BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= psy6258
         (136 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|347972274|ref|XP_001237637.3| AGAP004611-PA [Anopheles gambiae str. PEST]
 gi|333469330|gb|EAU76664.3| AGAP004611-PA [Anopheles gambiae str. PEST]
          Length = 514

 Score = 86.3 bits (212), Expect = 4e-15,   Method: Composition-based stats.
 Identities = 34/55 (61%), Positives = 43/55 (78%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L+DV +GGAT+FP L L VFP+KGSA+FWYN HAN  +D+R  H  CPV +G+KW
Sbjct: 437 LSDVAIGGATVFPQLGLGVFPQKGSAIFWYNLHANGTVDHRTLHGACPVFVGSKW 491



 Score = 81.6 bits (200), Expect = 9e-14,   Method: Composition-based stats.
 Identities = 34/64 (53%), Positives = 44/64 (68%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       L+DV +GGAT+FP L L VFP+KGSA+FW+N HAN  +D+   H  CPV +
Sbjct: 428 NRIATVMYYLSDVAIGGATVFPQLGLGVFPQKGSAIFWYNLHANGTVDHRTLHGACPVFV 487

Query: 62  GNKW 65
           G+KW
Sbjct: 488 GSKW 491


>gi|170029530|ref|XP_001842645.1| prolyl 4-hydroxylase subunit alpha-1 [Culex quinquefasciatus]
 gi|167863229|gb|EDS26612.1| prolyl 4-hydroxylase subunit alpha-1 [Culex quinquefasciatus]
          Length = 522

 Score = 85.9 bits (211), Expect = 4e-15,   Method: Composition-based stats.
 Identities = 36/67 (53%), Positives = 49/67 (73%), Gaps = 1/67 (1%)

Query: 61  LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
           +GN+   L+   L+DVE+GG T+FP L+L VFP KGSA+FW+N H N  +D RM H+GCP
Sbjct: 422 MGNRLATLMYY-LSDVEVGGGTVFPRLSLGVFPIKGSAIFWHNVHHNGNVDERMLHAGCP 480

Query: 121 VALGNKW 127
             +G+KW
Sbjct: 481 TLIGSKW 487



 Score = 83.2 bits (204), Expect = 3e-14,   Method: Composition-based stats.
 Identities = 35/64 (54%), Positives = 44/64 (68%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       L+DVE+GG T+FP L+L VFP KGSA+FWHN H N  +D  M H+GCP  +
Sbjct: 424 NRLATLMYYLSDVEVGGGTVFPRLSLGVFPIKGSAIFWHNVHHNGNVDERMLHAGCPTLI 483

Query: 62  GNKW 65
           G+KW
Sbjct: 484 GSKW 487


>gi|92096574|gb|AAI15350.1| LOC557059 protein [Danio rerio]
          Length = 508

 Score = 82.0 bits (201), Expect = 8e-14,   Method: Composition-based stats.
 Identities = 33/55 (60%), Positives = 43/55 (78%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           ++DVE+GGAT+F ++ + V PEKGSAVFWYN H N  LD +  H+GCPV +GNKW
Sbjct: 431 MSDVEVGGATVFTNVGVAVKPEKGSAVFWYNLHKNGELDLKTKHAGCPVLVGNKW 485



 Score = 77.8 bits (190), Expect = 1e-12,   Method: Composition-based stats.
 Identities = 32/57 (56%), Positives = 43/57 (75%)

Query: 9   VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           + ++DVE+GGAT+F ++ + V PEKGSAVFW+N H N  LD    H+GCPV +GNKW
Sbjct: 429 IYMSDVEVGGATVFTNVGVAVKPEKGSAVFWYNLHKNGELDLKTKHAGCPVLVGNKW 485


>gi|239915958|ref|NP_001070123.2| prolyl 4-hydroxylase alpha II-like precursor [Danio rerio]
          Length = 490

 Score = 82.0 bits (201), Expect = 8e-14,   Method: Composition-based stats.
 Identities = 33/55 (60%), Positives = 43/55 (78%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           ++DVE+GGAT+F ++ + V PEKGSAVFWYN H N  LD +  H+GCPV +GNKW
Sbjct: 413 MSDVEVGGATVFTNVGVAVKPEKGSAVFWYNLHKNGELDLKTKHAGCPVLVGNKW 467



 Score = 77.8 bits (190), Expect = 1e-12,   Method: Composition-based stats.
 Identities = 32/57 (56%), Positives = 43/57 (75%)

Query: 9   VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           + ++DVE+GGAT+F ++ + V PEKGSAVFW+N H N  LD    H+GCPV +GNKW
Sbjct: 411 IYMSDVEVGGATVFTNVGVAVKPEKGSAVFWYNLHKNGELDLKTKHAGCPVLVGNKW 467


>gi|170064953|ref|XP_001867740.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
 gi|167882143|gb|EDS45526.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
          Length = 509

 Score = 79.0 bits (193), Expect = 7e-13,   Method: Composition-based stats.
 Identities = 34/67 (50%), Positives = 45/67 (67%), Gaps = 1/67 (1%)

Query: 61  LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
           +GN+   ++   L+DVE+GGAT+FP L L VFP KGSA+ WYN + N   D R  H+ CP
Sbjct: 417 VGNRIATVMFY-LSDVEIGGATVFPQLGLAVFPRKGSAILWYNLYRNGKGDRRTLHAACP 475

Query: 121 VALGNKW 127
           V  G+KW
Sbjct: 476 VLSGSKW 482



 Score = 73.6 bits (179), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 32/64 (50%), Positives = 40/64 (62%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       L+DVE+GGAT+FP L L VFP KGSA+ W+N + N   D    H+ CPV  
Sbjct: 419 NRIATVMFYLSDVEIGGATVFPQLGLAVFPRKGSAILWYNLYRNGKGDRRTLHAACPVLS 478

Query: 62  GNKW 65
           G+KW
Sbjct: 479 GSKW 482


>gi|344252711|gb|EGW08815.1| Prolyl 4-hydroxylase subunit alpha-2 [Cricetulus griseus]
          Length = 584

 Score = 78.2 bits (191), Expect = 1e-12,   Method: Composition-based stats.
 Identities = 33/61 (54%), Positives = 43/61 (70%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 128
           L S ++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KWG
Sbjct: 466 LCSQMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWG 525

Query: 129 K 129
           K
Sbjct: 526 K 526



 Score = 71.6 bits (174), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 29/58 (50%), Positives = 41/58 (70%)

Query: 10  ELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWGK 67
           +++DVE GGAT+FP L   ++P+KG+AVFW+N   +   DY   H+ CPV +G KWGK
Sbjct: 469 QMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWGK 526


>gi|56118630|ref|NP_001007975.1| prolyl 4-hydroxylase, alpha polypeptide 2 precursor [Xenopus
           (Silurana) tropicalis]
 gi|51513259|gb|AAH80485.1| p4ha2 protein [Xenopus (Silurana) tropicalis]
          Length = 527

 Score = 78.2 bits (191), Expect = 1e-12,   Method: Composition-based stats.
 Identities = 33/68 (48%), Positives = 46/68 (67%), Gaps = 1/68 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L + ++DVE GGAT+FP     ++P+KG+AVFWYN   +   DYR  H+ CPV
Sbjct: 448 GNRLATFL-NYMSDVEAGGATVFPDFGAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPV 506

Query: 122 ALGNKWGK 129
            +G+KWGK
Sbjct: 507 LVGSKWGK 514



 Score = 71.6 bits (174), Expect = 9e-11,   Method: Composition-based stats.
 Identities = 28/57 (49%), Positives = 40/57 (70%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWGK 67
           ++DVE GGAT+FP     ++P+KG+AVFW+N   +   DY   H+ CPV +G+KWGK
Sbjct: 458 MSDVEAGGATVFPDFGAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGSKWGK 514


>gi|157114983|ref|XP_001658090.1| prolyl 4-hydroxylase alpha subunit 1, putative [Aedes aegypti]
 gi|108877085|gb|EAT41310.1| AAEL007032-PA, partial [Aedes aegypti]
          Length = 448

 Score = 77.8 bits (190), Expect = 1e-12,   Method: Composition-based stats.
 Identities = 32/64 (50%), Positives = 42/64 (65%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWGKLLL 132
           ++DVE GGAT+FP + + VFP+KGSA+FWYN   +   D R  H  CPV LG+KWG   +
Sbjct: 384 ISDVEQGGATVFPQIGVGVFPKKGSAIFWYNLLPDGTGDERTLHGACPVLLGSKWGMFCM 443

Query: 133 SGVF 136
              F
Sbjct: 444 QQYF 447



 Score = 71.2 bits (173), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 29/56 (51%), Positives = 39/56 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWG 66
           ++DVE GGAT+FP + + VFP+KGSA+FW+N   +   D    H  CPV LG+KWG
Sbjct: 384 ISDVEQGGATVFPQIGVGVFPKKGSAIFWYNLLPDGTGDERTLHGACPVLLGSKWG 439


>gi|195341558|ref|XP_002037373.1| GM12146 [Drosophila sechellia]
 gi|194131489|gb|EDW53532.1| GM12146 [Drosophila sechellia]
          Length = 485

 Score = 77.8 bits (190), Expect = 1e-12,   Method: Composition-based stats.
 Identities = 32/56 (57%), Positives = 39/56 (69%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 128
           L DV  GGAT FP LN+TVFP+ G+ + WYN H   LL  R  H+GCPV +G+KWG
Sbjct: 430 LNDVPQGGATHFPGLNITVFPKFGTVLMWYNLHTEGLLHVRTMHTGCPVIVGSKWG 485



 Score = 72.8 bits (177), Expect = 5e-11,   Method: Composition-based stats.
 Identities = 31/66 (46%), Positives = 42/66 (63%)

Query: 1   MNRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVA 60
           ++R   +   L DV  GGAT FP LN+TVFP+ G+ + W+N H   LL     H+GCPV 
Sbjct: 420 IDRLATTLFYLNDVPQGGATHFPGLNITVFPKFGTVLMWYNLHTEGLLHVRTMHTGCPVI 479

Query: 61  LGNKWG 66
           +G+KWG
Sbjct: 480 VGSKWG 485


>gi|281348666|gb|EFB24250.1| hypothetical protein PANDA_000722 [Ailuropoda melanoleuca]
          Length = 505

 Score = 77.8 bits (190), Expect = 1e-12,   Method: Composition-based stats.
 Identities = 33/63 (52%), Positives = 44/63 (69%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 128
            L+ ++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KWG
Sbjct: 443 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWG 502

Query: 129 KLL 131
           K L
Sbjct: 503 KWL 505



 Score = 71.6 bits (174), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 30/59 (50%), Positives = 41/59 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWGKLL 69
           ++DVE GGAT+FP L   ++P+KG+AVFW+N   +   DY   H+ CPV +G KWGK L
Sbjct: 447 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWGKWL 505


>gi|195391760|ref|XP_002054528.1| GJ22757 [Drosophila virilis]
 gi|194152614|gb|EDW68048.1| GJ22757 [Drosophila virilis]
          Length = 534

 Score = 77.4 bits (189), Expect = 2e-12,   Method: Composition-based stats.
 Identities = 39/103 (37%), Positives = 57/103 (55%), Gaps = 1/103 (0%)

Query: 25  LNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIF 84
           L++T   +   A +    H     D++      P A GN+    +   L+DVE GGAT F
Sbjct: 400 LDMTFCEQLQVANYGVGGHYEPHWDFFRDSRHYPAAEGNRIATAIFY-LSDVEQGGATAF 458

Query: 85  PSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           P LN  V P+ G+ +FWYN H ++ +D+R  H+GCPV  G+KW
Sbjct: 459 PFLNFAVRPQLGNILFWYNLHRSSDMDFRTKHAGCPVLKGSKW 501



 Score = 70.9 bits (172), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 42/64 (65%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR   +   L+DVE GGAT FP LN  V P+ G+ +FW+N H ++ +D+   H+GCPV  
Sbjct: 438 NRIATAIFYLSDVEQGGATAFPFLNFAVRPQLGNILFWYNLHRSSDMDFRTKHAGCPVLK 497

Query: 62  GNKW 65
           G+KW
Sbjct: 498 GSKW 501


>gi|347964867|ref|XP_309164.4| AGAP000971-PA [Anopheles gambiae str. PEST]
 gi|333466515|gb|EAA04901.5| AGAP000971-PA [Anopheles gambiae str. PEST]
          Length = 553

 Score = 77.0 bits (188), Expect = 2e-12,   Method: Composition-based stats.
 Identities = 33/66 (50%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+   +L   ++DV  GGAT+FPSLNL ++P KG+A FW+N HA+   DY   H+ CPV
Sbjct: 461 GNRIATVLFY-MSDVTQGGATVFPSLNLALWPRKGTAAFWFNLHASGRGDYATRHAACPV 519

Query: 122 ALGNKW 127
             G KW
Sbjct: 520 LTGTKW 525



 Score = 74.3 bits (181), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 32/64 (50%), Positives = 40/64 (62%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FPSLNL ++P KG+A FW N HA+   DY   H+ CPV  
Sbjct: 462 NRIATVLFYMSDVTQGGATVFPSLNLALWPRKGTAAFWFNLHASGRGDYATRHAACPVLT 521

Query: 62  GNKW 65
           G KW
Sbjct: 522 GTKW 525


>gi|195110925|ref|XP_002000030.1| GI22756 [Drosophila mojavensis]
 gi|193916624|gb|EDW15491.1| GI22756 [Drosophila mojavensis]
          Length = 533

 Score = 77.0 bits (188), Expect = 2e-12,   Method: Composition-based stats.
 Identities = 43/113 (38%), Positives = 61/113 (53%), Gaps = 4/113 (3%)

Query: 15  ELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLT 74
           ++G AT    L++T   +   A +    H     D++      P A GN+    +   L+
Sbjct: 392 DVGDAT---GLDMTYCEQLQVANYGVGGHYEPHWDFFRDSRHYPAAEGNRIATAIFY-LS 447

Query: 75  DVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           DVE GGAT FP LN  V P+ G+ +FWYN H ++  DYR  H+GCPV  G+KW
Sbjct: 448 DVEQGGATAFPFLNFAVRPQLGNILFWYNLHRSSDEDYRTKHAGCPVLKGSKW 500



 Score = 70.1 bits (170), Expect = 3e-10,   Method: Composition-based stats.
 Identities = 31/64 (48%), Positives = 41/64 (64%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR   +   L+DVE GGAT FP LN  V P+ G+ +FW+N H ++  DY   H+GCPV  
Sbjct: 437 NRIATAIFYLSDVEQGGATAFPFLNFAVRPQLGNILFWYNLHRSSDEDYRTKHAGCPVLK 496

Query: 62  GNKW 65
           G+KW
Sbjct: 497 GSKW 500


>gi|195055773|ref|XP_001994787.1| GH17427 [Drosophila grimshawi]
 gi|193892550|gb|EDV91416.1| GH17427 [Drosophila grimshawi]
          Length = 538

 Score = 77.0 bits (188), Expect = 2e-12,   Method: Composition-based stats.
 Identities = 39/103 (37%), Positives = 56/103 (54%), Gaps = 1/103 (0%)

Query: 25  LNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIF 84
           L++T   +   A +    H     D+++     P   GN+    +   L+DVE GGAT F
Sbjct: 404 LDMTYCEQLQVANYGVGGHYEPHWDFFVDSQHYPAEEGNRIATAIFY-LSDVEQGGATAF 462

Query: 85  PSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           P LN  V P+ G+ +FWYN H +  +DYR  H+GCPV  G+KW
Sbjct: 463 PFLNFAVRPQLGNILFWYNLHRSLDMDYRTKHAGCPVLKGSKW 505



 Score = 71.2 bits (173), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 31/64 (48%), Positives = 41/64 (64%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR   +   L+DVE GGAT FP LN  V P+ G+ +FW+N H +  +DY   H+GCPV  
Sbjct: 442 NRIATAIFYLSDVEQGGATAFPFLNFAVRPQLGNILFWYNLHRSLDMDYRTKHAGCPVLK 501

Query: 62  GNKW 65
           G+KW
Sbjct: 502 GSKW 505


>gi|170064956|ref|XP_001867741.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
 gi|167882144|gb|EDS45527.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
          Length = 520

 Score = 77.0 bits (188), Expect = 2e-12,   Method: Composition-based stats.
 Identities = 32/68 (47%), Positives = 46/68 (67%), Gaps = 1/68 (1%)

Query: 60  ALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGC 119
            +GN+   L+   L+DV+ GG+T+FP LNL V P KG+A+FWYN H N   + +  H+ C
Sbjct: 431 GMGNRIATLMFY-LSDVQEGGSTVFPRLNLAVRPRKGTAIFWYNLHRNGKGNKKTLHAAC 489

Query: 120 PVALGNKW 127
           PV +G+KW
Sbjct: 490 PVLIGSKW 497



 Score = 70.9 bits (172), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 41/64 (64%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       L+DV+ GG+T+FP LNL V P KG+A+FW+N H N   +    H+ CPV +
Sbjct: 434 NRIATLMFYLSDVQEGGSTVFPRLNLAVRPRKGTAIFWYNLHRNGKGNKKTLHAACPVLI 493

Query: 62  GNKW 65
           G+KW
Sbjct: 494 GSKW 497


>gi|194765172|ref|XP_001964701.1| GF23326 [Drosophila ananassae]
 gi|190614973|gb|EDV30497.1| GF23326 [Drosophila ananassae]
          Length = 885

 Score = 77.0 bits (188), Expect = 2e-12,   Method: Composition-based stats.
 Identities = 35/60 (58%), Positives = 43/60 (71%)

Query: 68  LLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           +L + L+DVE GGATIF  LNLTVFP+ GSA+FWYN       D R  H+GCPV +G+KW
Sbjct: 799 VLSNALSDVEQGGATIFTKLNLTVFPQSGSALFWYNLDNWGNEDKRTEHAGCPVIVGSKW 858



 Score = 71.2 bits (173), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 42/92 (45%), Positives = 53/92 (57%), Gaps = 16/92 (17%)

Query: 8   SVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYW------MYHSGCPVAL 61
           S  L+DVE GGATIF  LNLTVFP+ GSA+FW+N      LD W        H+GCPV +
Sbjct: 801 SNALSDVEQGGATIFTKLNLTVFPQSGSALFWYN------LDNWGNEDKRTEHAGCPVIV 854

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFP 93
           G+KW  ++   +TD  +G     P     VFP
Sbjct: 855 GSKW--IMTKWVTD--MGQEFRKPCYKSGVFP 882


>gi|189241578|ref|XP_969458.2| PREDICTED: similar to prolyl 4-hydroxylase alpha subunit 1,
           putative [Tribolium castaneum]
          Length = 515

 Score = 76.6 bits (187), Expect = 3e-12,   Method: Composition-based stats.
 Identities = 34/75 (45%), Positives = 51/75 (68%), Gaps = 1/75 (1%)

Query: 53  YHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDY 112
           + S    A+G++   +L   L+DVE GGAT+FP + ++V+P+KGSAV W+N H +   D 
Sbjct: 424 FQSTVDPAIGSRIETVLFY-LSDVEQGGATVFPEIQVSVWPQKGSAVVWFNLHPSGDGDQ 482

Query: 113 RMYHSGCPVALGNKW 127
           R  H+GCPV +G+KW
Sbjct: 483 RTKHAGCPVLIGSKW 497



 Score = 71.6 bits (174), Expect = 9e-11,   Method: Composition-based stats.
 Identities = 31/64 (48%), Positives = 43/64 (67%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           +R E     L+DVE GGAT+FP + ++V+P+KGSAV W N H +   D    H+GCPV +
Sbjct: 434 SRIETVLFYLSDVEQGGATVFPEIQVSVWPQKGSAVVWFNLHPSGDGDQRTKHAGCPVLI 493

Query: 62  GNKW 65
           G+KW
Sbjct: 494 GSKW 497


>gi|270001038|gb|EEZ97485.1| hypothetical protein TcasGA2_TC011322 [Tribolium castaneum]
          Length = 509

 Score = 76.6 bits (187), Expect = 3e-12,   Method: Composition-based stats.
 Identities = 34/75 (45%), Positives = 51/75 (68%), Gaps = 1/75 (1%)

Query: 53  YHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDY 112
           + S    A+G++   +L   L+DVE GGAT+FP + ++V+P+KGSAV W+N H +   D 
Sbjct: 418 FQSTVDPAIGSRIETVLFY-LSDVEQGGATVFPEIQVSVWPQKGSAVVWFNLHPSGDGDQ 476

Query: 113 RMYHSGCPVALGNKW 127
           R  H+GCPV +G+KW
Sbjct: 477 RTKHAGCPVLIGSKW 491



 Score = 71.6 bits (174), Expect = 9e-11,   Method: Composition-based stats.
 Identities = 31/64 (48%), Positives = 43/64 (67%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           +R E     L+DVE GGAT+FP + ++V+P+KGSAV W N H +   D    H+GCPV +
Sbjct: 428 SRIETVLFYLSDVEQGGATVFPEIQVSVWPQKGSAVVWFNLHPSGDGDQRTKHAGCPVLI 487

Query: 62  GNKW 65
           G+KW
Sbjct: 488 GSKW 491


>gi|195055775|ref|XP_001994788.1| GH17428 [Drosophila grimshawi]
 gi|193892551|gb|EDV91417.1| GH17428 [Drosophila grimshawi]
          Length = 540

 Score = 76.6 bits (187), Expect = 3e-12,   Method: Composition-based stats.
 Identities = 32/55 (58%), Positives = 40/55 (72%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L++VE GG T FP L L V PE+GS +FWYN H +  LDYR  H+GCPV +G+KW
Sbjct: 453 LSNVEAGGGTAFPFLPLLVEPERGSLLFWYNLHRSGDLDYRTKHAGCPVLMGSKW 507



 Score = 72.4 bits (176), Expect = 5e-11,   Method: Composition-based stats.
 Identities = 32/64 (50%), Positives = 41/64 (64%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       L++VE GG T FP L L V PE+GS +FW+N H +  LDY   H+GCPV +
Sbjct: 444 NRVATGIYYLSNVEAGGGTAFPFLPLLVEPERGSLLFWYNLHRSGDLDYRTKHAGCPVLM 503

Query: 62  GNKW 65
           G+KW
Sbjct: 504 GSKW 507


>gi|195159317|ref|XP_002020528.1| GL14042 [Drosophila persimilis]
 gi|194117297|gb|EDW39340.1| GL14042 [Drosophila persimilis]
          Length = 534

 Score = 76.3 bits (186), Expect = 4e-12,   Method: Composition-based stats.
 Identities = 34/70 (48%), Positives = 44/70 (62%), Gaps = 1/70 (1%)

Query: 58  PVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHS 117
           P   GN+    +   L+DVE GGAT FP LN  V P+ G+ +FWYN H +  +DYR  H+
Sbjct: 433 PAEEGNRMATAIFY-LSDVEQGGATAFPFLNFAVKPQLGNVLFWYNVHRSLDVDYRTKHA 491

Query: 118 GCPVALGNKW 127
           GCPV  G+KW
Sbjct: 492 GCPVLKGSKW 501



 Score = 71.6 bits (174), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 31/64 (48%), Positives = 41/64 (64%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR   +   L+DVE GGAT FP LN  V P+ G+ +FW+N H +  +DY   H+GCPV  
Sbjct: 438 NRMATAIFYLSDVEQGGATAFPFLNFAVKPQLGNVLFWYNVHRSLDVDYRTKHAGCPVLK 497

Query: 62  GNKW 65
           G+KW
Sbjct: 498 GSKW 501


>gi|125772813|ref|XP_001357665.1| GA21991 [Drosophila pseudoobscura pseudoobscura]
 gi|54637397|gb|EAL26799.1| GA21991 [Drosophila pseudoobscura pseudoobscura]
          Length = 534

 Score = 76.3 bits (186), Expect = 4e-12,   Method: Composition-based stats.
 Identities = 34/70 (48%), Positives = 44/70 (62%), Gaps = 1/70 (1%)

Query: 58  PVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHS 117
           P   GN+    +   L+DVE GGAT FP LN  V P+ G+ +FWYN H +  +DYR  H+
Sbjct: 433 PAEEGNRMATAIFY-LSDVEQGGATAFPFLNFAVKPQLGNVLFWYNVHRSLDVDYRTKHA 491

Query: 118 GCPVALGNKW 127
           GCPV  G+KW
Sbjct: 492 GCPVLKGSKW 501



 Score = 71.6 bits (174), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 31/64 (48%), Positives = 41/64 (64%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR   +   L+DVE GGAT FP LN  V P+ G+ +FW+N H +  +DY   H+GCPV  
Sbjct: 438 NRMATAIFYLSDVEQGGATAFPFLNFAVKPQLGNVLFWYNVHRSLDVDYRTKHAGCPVLK 497

Query: 62  GNKW 65
           G+KW
Sbjct: 498 GSKW 501


>gi|410927705|ref|XP_003977281.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Takifugu
           rubripes]
          Length = 531

 Score = 76.3 bits (186), Expect = 4e-12,   Method: Composition-based stats.
 Identities = 34/66 (51%), Positives = 45/66 (68%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    LL  +++V+ GGAT+F  +  +V P+KGSAVFWYN H +   DYR  H+ CPV
Sbjct: 444 GNRIATWLLY-MSEVQAGGATVFTDIGASVSPKKGSAVFWYNLHPSGDGDYRTRHAACPV 502

Query: 122 ALGNKW 127
            LGNKW
Sbjct: 503 LLGNKW 508



 Score = 69.7 bits (169), Expect = 3e-10,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 42/64 (65%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + +++V+ GGAT+F  +  +V P+KGSAVFW+N H +   DY   H+ CPV L
Sbjct: 445 NRIATWLLYMSEVQAGGATVFTDIGASVSPKKGSAVFWYNLHPSGDGDYRTRHAACPVLL 504

Query: 62  GNKW 65
           GNKW
Sbjct: 505 GNKW 508


>gi|170064951|ref|XP_001867739.1| prolyl 4-hydroxylase subunit alpha-2 [Culex quinquefasciatus]
 gi|167882142|gb|EDS45525.1| prolyl 4-hydroxylase subunit alpha-2 [Culex quinquefasciatus]
          Length = 516

 Score = 75.9 bits (185), Expect = 4e-12,   Method: Composition-based stats.
 Identities = 33/66 (50%), Positives = 45/66 (68%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+   L+   L+DVE GGAT+FP L + VFP+KG+A+FWYN  A+   D +  H  CPV
Sbjct: 429 GNRIATLMFY-LSDVEEGGATVFPHLGVGVFPKKGTAIFWYNLRASGKGDEKTLHGACPV 487

Query: 122 ALGNKW 127
            +G+KW
Sbjct: 488 LIGSKW 493



 Score = 70.9 bits (172), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 31/64 (48%), Positives = 41/64 (64%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       L+DVE GGAT+FP L + VFP+KG+A+FW+N  A+   D    H  CPV +
Sbjct: 430 NRIATLMFYLSDVEEGGATVFPHLGVGVFPKKGTAIFWYNLRASGKGDEKTLHGACPVLI 489

Query: 62  GNKW 65
           G+KW
Sbjct: 490 GSKW 493


>gi|157111033|ref|XP_001651361.1| prolyl 4-hydroxylase alpha subunit 1, putative [Aedes aegypti]
 gi|108878552|gb|EAT42777.1| AAEL005714-PA, partial [Aedes aegypti]
          Length = 522

 Score = 75.9 bits (185), Expect = 5e-12,   Method: Composition-based stats.
 Identities = 33/71 (46%), Positives = 44/71 (61%), Gaps = 8/71 (11%)

Query: 65  WGKL--------LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYH 116
           WGKL        +L  ++DV +GGAT+FP L +T+   KG+A FWYN H++  LDY   H
Sbjct: 429 WGKLKEVNRIATVLFYMSDVSIGGATVFPKLGVTLEARKGTAAFWYNLHSSGELDYSTLH 488

Query: 117 SGCPVALGNKW 127
             CPV +G KW
Sbjct: 489 GACPVLIGEKW 499



 Score = 71.2 bits (173), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 29/65 (44%), Positives = 41/65 (63%)

Query: 1   MNRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVA 60
           +NR       ++DV +GGAT+FP L +T+   KG+A FW+N H++  LDY   H  CPV 
Sbjct: 435 VNRIATVLFYMSDVSIGGATVFPKLGVTLEARKGTAAFWYNLHSSGELDYSTLHGACPVL 494

Query: 61  LGNKW 65
           +G KW
Sbjct: 495 IGEKW 499


>gi|195505216|ref|XP_002099408.1| GE23378 [Drosophila yakuba]
 gi|194185509|gb|EDW99120.1| GE23378 [Drosophila yakuba]
          Length = 546

 Score = 75.9 bits (185), Expect = 5e-12,   Method: Composition-based stats.
 Identities = 32/55 (58%), Positives = 39/55 (70%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L DV  GGAT FP LN+TVFP+ G+A+FWYN     LL  R  H+GCPV +G+KW
Sbjct: 457 LNDVPQGGATHFPGLNITVFPKFGTALFWYNLDTKGLLRLRTMHTGCPVIVGSKW 511



 Score = 72.8 bits (177), Expect = 5e-11,   Method: Composition-based stats.
 Identities = 32/65 (49%), Positives = 42/65 (64%)

Query: 1   MNRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVA 60
           M+R   +   L DV  GGAT FP LN+TVFP+ G+A+FW+N     LL     H+GCPV 
Sbjct: 447 MDRLATTLFYLNDVPQGGATHFPGLNITVFPKFGTALFWYNLDTKGLLRLRTMHTGCPVI 506

Query: 61  LGNKW 65
           +G+KW
Sbjct: 507 VGSKW 511


>gi|115313004|gb|AAI24075.1| Zgc:152670 [Danio rerio]
          Length = 235

 Score = 75.9 bits (185), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 32/55 (58%), Positives = 42/55 (76%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           ++DVE+GGAT+F ++ + V PEKGSAVFW N H N  LD +  H+GCPV +GNKW
Sbjct: 158 MSDVEVGGATVFTNVGVAVKPEKGSAVFWNNLHKNGELDLKTKHAGCPVLVGNKW 212



 Score = 75.1 bits (183), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 32/55 (58%), Positives = 42/55 (76%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE+GGAT+F ++ + V PEKGSAVFW+N H N  LD    H+GCPV +GNKW
Sbjct: 158 MSDVEVGGATVFTNVGVAVKPEKGSAVFWNNLHKNGELDLKTKHAGCPVLVGNKW 212


>gi|410914996|ref|XP_003970973.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Takifugu
           rubripes]
          Length = 538

 Score = 75.9 bits (185), Expect = 6e-12,   Method: Composition-based stats.
 Identities = 30/59 (50%), Positives = 40/59 (67%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
            L+ ++DVE GGAT+FP     ++P KG+AVFWYN   +   DYR  H+ CPV +GNKW
Sbjct: 457 FLNYMSDVEAGGATVFPDFGAAIWPRKGTAVFWYNLFKSGEGDYRTRHAACPVLVGNKW 515



 Score = 69.3 bits (168), Expect = 4e-10,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 37/55 (67%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP     ++P KG+AVFW+N   +   DY   H+ CPV +GNKW
Sbjct: 461 MSDVEAGGATVFPDFGAAIWPRKGTAVFWYNLFKSGEGDYRTRHAACPVLVGNKW 515


>gi|410900628|ref|XP_003963798.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Takifugu
           rubripes]
          Length = 548

 Score = 75.5 bits (184), Expect = 6e-12,   Method: Composition-based stats.
 Identities = 33/66 (50%), Positives = 43/66 (65%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+FP +   V+P+KG+AVFWYN  AN   DY   H+ CPV
Sbjct: 461 GNRIATWLFY-MSDVAAGGATVFPDVGAAVWPQKGTAVFWYNLFANGEGDYSTRHAACPV 519

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 520 LVGNKW 525



 Score = 72.8 bits (177), Expect = 5e-11,   Method: Composition-based stats.
 Identities = 31/64 (48%), Positives = 41/64 (64%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP +   V+P+KG+AVFW+N  AN   DY   H+ CPV +
Sbjct: 462 NRIATWLFYMSDVAAGGATVFPDVGAAVWPQKGTAVFWYNLFANGEGDYSTRHAACPVLV 521

Query: 62  GNKW 65
           GNKW
Sbjct: 522 GNKW 525


>gi|195390833|ref|XP_002054072.1| GJ22994 [Drosophila virilis]
 gi|194152158|gb|EDW67592.1| GJ22994 [Drosophila virilis]
          Length = 496

 Score = 75.5 bits (184), Expect = 6e-12,   Method: Composition-based stats.
 Identities = 32/55 (58%), Positives = 39/55 (70%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           LTDVE GGAT+F  LN  V P++G+A+FWYN H N   D R  H GCPV +G+KW
Sbjct: 420 LTDVEQGGATVFTILNQAVSPKRGTALFWYNLHRNGTGDTRTLHGGCPVLVGSKW 474



 Score = 70.1 bits (170), Expect = 3e-10,   Method: Composition-based stats.
 Identities = 30/55 (54%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           LTDVE GGAT+F  LN  V P++G+A+FW+N H N   D    H GCPV +G+KW
Sbjct: 420 LTDVEQGGATVFTILNQAVSPKRGTALFWYNLHRNGTGDTRTLHGGCPVLVGSKW 474


>gi|195575113|ref|XP_002105524.1| GD16980 [Drosophila simulans]
 gi|194201451|gb|EDX15027.1| GD16980 [Drosophila simulans]
          Length = 518

 Score = 75.5 bits (184), Expect = 7e-12,   Method: Composition-based stats.
 Identities = 31/55 (56%), Positives = 38/55 (69%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L DV  GGAT FP LN+TVFP+ G+ + WYN H   LL  R  H+GCPV +G+KW
Sbjct: 429 LNDVPQGGATHFPGLNITVFPKFGTVLMWYNLHTEGLLHVRTMHTGCPVIVGSKW 483



 Score = 70.1 bits (170), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 30/65 (46%), Positives = 41/65 (63%)

Query: 1   MNRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVA 60
           ++R   +   L DV  GGAT FP LN+TVFP+ G+ + W+N H   LL     H+GCPV 
Sbjct: 419 IDRLATTLFYLNDVPQGGATHFPGLNITVFPKFGTVLMWYNLHTEGLLHVRTMHTGCPVI 478

Query: 61  LGNKW 65
           +G+KW
Sbjct: 479 VGSKW 483


>gi|194905376|ref|XP_001981185.1| GG11927 [Drosophila erecta]
 gi|190655823|gb|EDV53055.1| GG11927 [Drosophila erecta]
          Length = 539

 Score = 75.5 bits (184), Expect = 7e-12,   Method: Composition-based stats.
 Identities = 31/55 (56%), Positives = 38/55 (69%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L DV  GGAT FP LN+TVFP  G+A+FWYN     +L  R  H+GCPV +G+KW
Sbjct: 450 LNDVPQGGATHFPGLNITVFPRLGAALFWYNLDTQGMLQVRTMHTGCPVIVGSKW 504



 Score = 70.5 bits (171), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 30/65 (46%), Positives = 41/65 (63%)

Query: 1   MNRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVA 60
           ++R   +   L DV  GGAT FP LN+TVFP  G+A+FW+N     +L     H+GCPV 
Sbjct: 440 IDRIATTLFYLNDVPQGGATHFPGLNITVFPRLGAALFWYNLDTQGMLQVRTMHTGCPVI 499

Query: 61  LGNKW 65
           +G+KW
Sbjct: 500 VGSKW 504


>gi|195061068|ref|XP_001995918.1| GH14106 [Drosophila grimshawi]
 gi|193891710|gb|EDV90576.1| GH14106 [Drosophila grimshawi]
          Length = 511

 Score = 75.1 bits (183), Expect = 8e-12,   Method: Composition-based stats.
 Identities = 42/125 (33%), Positives = 63/125 (50%), Gaps = 12/125 (9%)

Query: 3   RAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALG 62
           R E   V++T++ + G+  +  +N  +             H     D++          G
Sbjct: 376 RIERRVVDMTELSMQGSEAYNIMNYGL-----------GGHYAAHYDFFNTTKQQTSETG 424

Query: 63  NKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 122
           ++   +L   L+DVE GGAT+FP+L L V PE+G A+FWYN   N   D R  H GCPV 
Sbjct: 425 DRIATVLFY-LSDVEQGGATVFPNLKLAVSPERGMALFWYNLLDNGTGDTRTLHGGCPVL 483

Query: 123 LGNKW 127
           +G+KW
Sbjct: 484 VGSKW 488


>gi|348523976|ref|XP_003449499.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Oreochromis
           niloticus]
          Length = 594

 Score = 75.1 bits (183), Expect = 8e-12,   Method: Composition-based stats.
 Identities = 34/66 (51%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    LL  +TDV+ GGAT+F  +   V P+KG+AVFWYN + +   DYR  H+ CPV
Sbjct: 507 GNRIATWLLY-MTDVQAGGATVFTDIGAAVKPKKGTAVFWYNLYPSGEGDYRTRHAACPV 565

Query: 122 ALGNKW 127
            LGNKW
Sbjct: 566 LLGNKW 571



 Score = 68.9 bits (167), Expect = 6e-10,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 41/64 (64%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + +TDV+ GGAT+F  +   V P+KG+AVFW+N + +   DY   H+ CPV L
Sbjct: 508 NRIATWLLYMTDVQAGGATVFTDIGAAVKPKKGTAVFWYNLYPSGEGDYRTRHAACPVLL 567

Query: 62  GNKW 65
           GNKW
Sbjct: 568 GNKW 571


>gi|157114985|ref|XP_001658091.1| prolyl 4-hydroxylase alpha subunit 1, putative [Aedes aegypti]
 gi|108877086|gb|EAT41311.1| AAEL007038-PA [Aedes aegypti]
          Length = 545

 Score = 75.1 bits (183), Expect = 8e-12,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+   +L   ++DV  GGAT+FPSL L ++P+KG+A FW+N HA+   DY   H+ CPV
Sbjct: 454 GNRIATVLFY-MSDVTQGGATVFPSLRLALWPKKGAAAFWFNLHASGQGDYSTRHAACPV 512

Query: 122 ALGNKW 127
             G KW
Sbjct: 513 LTGTKW 518



 Score = 72.0 bits (175), Expect = 7e-11,   Method: Composition-based stats.
 Identities = 31/64 (48%), Positives = 40/64 (62%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FPSL L ++P+KG+A FW N HA+   DY   H+ CPV  
Sbjct: 455 NRIATVLFYMSDVTQGGATVFPSLRLALWPKKGAAAFWFNLHASGQGDYSTRHAACPVLT 514

Query: 62  GNKW 65
           G KW
Sbjct: 515 GTKW 518


>gi|195452734|ref|XP_002073476.1| GK13124 [Drosophila willistoni]
 gi|194169561|gb|EDW84462.1| GK13124 [Drosophila willistoni]
          Length = 536

 Score = 75.1 bits (183), Expect = 9e-12,   Method: Composition-based stats.
 Identities = 38/103 (36%), Positives = 56/103 (54%), Gaps = 1/103 (0%)

Query: 25  LNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIF 84
           L++T   +   A +    H     D++      P   GN+    +   L++VE GGAT F
Sbjct: 402 LDMTYCEQLQVANYGVGGHYEPHWDFFRNPDHYPAEEGNRIATAIYY-LSEVEQGGATAF 460

Query: 85  PSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           P LN  V P+ G+ +FWYN H ++ +DYR  H+GCPV  G+KW
Sbjct: 461 PFLNFAVRPQLGNVLFWYNLHRSSDMDYRTKHAGCPVLKGSKW 503



 Score = 70.9 bits (172), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 42/64 (65%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR   +   L++VE GGAT FP LN  V P+ G+ +FW+N H ++ +DY   H+GCPV  
Sbjct: 440 NRIATAIYYLSEVEQGGATAFPFLNFAVRPQLGNVLFWYNLHRSSDMDYRTKHAGCPVLK 499

Query: 62  GNKW 65
           G+KW
Sbjct: 500 GSKW 503


>gi|47213360|emb|CAF90979.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 511

 Score = 74.7 bits (182), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 30/55 (54%), Positives = 39/55 (70%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           ++DV+ GGAT+F  +  +V P+KGSAVFWYN   +   DYR  H+ CPV LGNKW
Sbjct: 436 MSDVQAGGATVFTDIGASVLPQKGSAVFWYNLRPSGDGDYRTRHAACPVLLGNKW 490



 Score = 69.7 bits (169), Expect = 4e-10,   Method: Composition-based stats.
 Identities = 28/57 (49%), Positives = 39/57 (68%)

Query: 9   VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           + ++DV+ GGAT+F  +  +V P+KGSAVFW+N   +   DY   H+ CPV LGNKW
Sbjct: 434 LRMSDVQAGGATVFTDIGASVLPQKGSAVFWYNLRPSGDGDYRTRHAACPVLLGNKW 490


>gi|261245137|gb|ACX54875.1| FI12021p [Drosophila melanogaster]
          Length = 538

 Score = 74.7 bits (182), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 30/55 (54%), Positives = 38/55 (69%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L DV  GGAT FP LN+TVFP+ G+ + WYN H   +L  R  H+GCPV +G+KW
Sbjct: 449 LNDVPQGGATHFPGLNITVFPKFGTVLMWYNLHTEGMLHVRTMHTGCPVIVGSKW 503



 Score = 69.3 bits (168), Expect = 4e-10,   Method: Composition-based stats.
 Identities = 29/65 (44%), Positives = 41/65 (63%)

Query: 1   MNRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVA 60
           ++R   +   L DV  GGAT FP LN+TVFP+ G+ + W+N H   +L     H+GCPV 
Sbjct: 439 IDRLATTLFYLNDVPQGGATHFPGLNITVFPKFGTVLMWYNLHTEGMLHVRTMHTGCPVI 498

Query: 61  LGNKW 65
           +G+KW
Sbjct: 499 VGSKW 503


>gi|116008537|ref|NP_733379.2| CG31524, isoform A [Drosophila melanogaster]
 gi|113194861|gb|AAN14239.2| CG31524, isoform A [Drosophila melanogaster]
          Length = 536

 Score = 74.7 bits (182), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 30/55 (54%), Positives = 38/55 (69%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L DV  GGAT FP LN+TVFP+ G+ + WYN H   +L  R  H+GCPV +G+KW
Sbjct: 447 LNDVPQGGATHFPGLNITVFPKFGTVLMWYNLHTEGMLHVRTMHTGCPVIVGSKW 501



 Score = 69.3 bits (168), Expect = 4e-10,   Method: Composition-based stats.
 Identities = 29/65 (44%), Positives = 41/65 (63%)

Query: 1   MNRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVA 60
           ++R   +   L DV  GGAT FP LN+TVFP+ G+ + W+N H   +L     H+GCPV 
Sbjct: 437 IDRLATTLFYLNDVPQGGATHFPGLNITVFPKFGTVLMWYNLHTEGMLHVRTMHTGCPVI 496

Query: 61  LGNKW 65
           +G+KW
Sbjct: 497 VGSKW 501


>gi|116008130|ref|NP_001036777.1| CG31524, isoform B [Drosophila melanogaster]
 gi|113194860|gb|ABI31221.1| CG31524, isoform B [Drosophila melanogaster]
          Length = 535

 Score = 74.7 bits (182), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 30/55 (54%), Positives = 38/55 (69%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L DV  GGAT FP LN+TVFP+ G+ + WYN H   +L  R  H+GCPV +G+KW
Sbjct: 446 LNDVPQGGATHFPGLNITVFPKFGTVLMWYNLHTEGMLHVRTMHTGCPVIVGSKW 500



 Score = 69.3 bits (168), Expect = 4e-10,   Method: Composition-based stats.
 Identities = 29/65 (44%), Positives = 41/65 (63%)

Query: 1   MNRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVA 60
           ++R   +   L DV  GGAT FP LN+TVFP+ G+ + W+N H   +L     H+GCPV 
Sbjct: 436 IDRLATTLFYLNDVPQGGATHFPGLNITVFPKFGTVLMWYNLHTEGMLHVRTMHTGCPVI 495

Query: 61  LGNKW 65
           +G+KW
Sbjct: 496 VGSKW 500


>gi|66771513|gb|AAY55068.1| IP12095p [Drosophila melanogaster]
          Length = 538

 Score = 74.7 bits (182), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 30/55 (54%), Positives = 38/55 (69%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L DV  GGAT FP LN+TVFP+ G+ + WYN H   +L  R  H+GCPV +G+KW
Sbjct: 449 LNDVPQGGATHFPGLNITVFPKFGTVLMWYNLHTEGMLHVRTMHTGCPVIVGSKW 503



 Score = 69.3 bits (168), Expect = 4e-10,   Method: Composition-based stats.
 Identities = 29/65 (44%), Positives = 41/65 (63%)

Query: 1   MNRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVA 60
           ++R   +   L DV  GGAT FP LN+TVFP+ G+ + W+N H   +L     H+GCPV 
Sbjct: 439 IDRLATTLFYLNDVPQGGATHFPGLNITVFPKFGTVLMWYNLHTEGMLHVRTMHTGCPVI 498

Query: 61  LGNKW 65
           +G+KW
Sbjct: 499 VGSKW 503


>gi|66770643|gb|AAY54633.1| IP12395p [Drosophila melanogaster]
          Length = 538

 Score = 74.7 bits (182), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 30/55 (54%), Positives = 38/55 (69%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L DV  GGAT FP LN+TVFP+ G+ + WYN H   +L  R  H+GCPV +G+KW
Sbjct: 449 LNDVPQGGATHFPGLNITVFPKFGTVLMWYNLHTEGMLHVRTMHTGCPVIVGSKW 503



 Score = 69.3 bits (168), Expect = 4e-10,   Method: Composition-based stats.
 Identities = 29/65 (44%), Positives = 41/65 (63%)

Query: 1   MNRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVA 60
           ++R   +   L DV  GGAT FP LN+TVFP+ G+ + W+N H   +L     H+GCPV 
Sbjct: 439 IDRLATTLFYLNDVPQGGATHFPGLNITVFPKFGTVLMWYNLHTEGMLHVRTMHTGCPVI 498

Query: 61  LGNKW 65
           +G+KW
Sbjct: 499 VGSKW 503


>gi|55925444|ref|NP_001007286.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Danio rerio]
 gi|49900294|gb|AAH76508.1| Procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide 2 [Danio rerio]
 gi|182891794|gb|AAI65288.1| P4ha2 protein [Danio rerio]
          Length = 514

 Score = 74.7 bits (182), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 30/58 (51%), Positives = 40/58 (68%)

Query: 70  LSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
            S ++DVE GGAT+FP    +V+P KG+AVFWYN   +   DYR  H+ CPV +G+KW
Sbjct: 434 FSRMSDVEAGGATVFPDFGASVWPRKGTAVFWYNLFRSGEGDYRTRHAACPVLVGSKW 491



 Score = 68.2 bits (165), Expect = 1e-09,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP    +V+P KG+AVFW+N   +   DY   H+ CPV +G+KW
Sbjct: 437 MSDVEAGGATVFPDFGASVWPRKGTAVFWYNLFRSGEGDYRTRHAACPVLVGSKW 491


>gi|301613006|ref|XP_002936013.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Xenopus
           (Silurana) tropicalis]
          Length = 504

 Score = 74.3 bits (181), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 29/55 (52%), Positives = 39/55 (70%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           ++DVE GGAT+FP +   V+P+KG+AVFWYN   +   DY   H+ CPV +GNKW
Sbjct: 427 MSDVEAGGATVFPEVGAAVYPKKGTAVFWYNLFESGEGDYSTRHAACPVLVGNKW 481



 Score = 72.0 bits (175), Expect = 8e-11,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 41/64 (64%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DVE GGAT+FP +   V+P+KG+AVFW+N   +   DY   H+ CPV +
Sbjct: 418 NRVATWLFYMSDVEAGGATVFPEVGAAVYPKKGTAVFWYNLFESGEGDYSTRHAACPVLV 477

Query: 62  GNKW 65
           GNKW
Sbjct: 478 GNKW 481


>gi|432926124|ref|XP_004080841.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Oryzias
           latipes]
          Length = 523

 Score = 74.3 bits (181), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    LL  ++DV+ GG T+F  +   V+P+KG+AVFWYN H +   DYR  H+ CPV
Sbjct: 436 GNRIATWLLY-MSDVQAGGNTVFTDIGAVVWPKKGTAVFWYNLHRSGEGDYRTRHAACPV 494

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 495 LVGNKW 500



 Score = 67.8 bits (164), Expect = 1e-09,   Method: Composition-based stats.
 Identities = 28/64 (43%), Positives = 41/64 (64%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + ++DV+ GG T+F  +   V+P+KG+AVFW+N H +   DY   H+ CPV +
Sbjct: 437 NRIATWLLYMSDVQAGGNTVFTDIGAVVWPKKGTAVFWYNLHRSGEGDYRTRHAACPVLV 496

Query: 62  GNKW 65
           GNKW
Sbjct: 497 GNKW 500


>gi|321474875|gb|EFX85839.1| hypothetical protein DAPPUDRAFT_309105 [Daphnia pulex]
          Length = 545

 Score = 73.9 bits (180), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 45/66 (68%), Gaps = 3/66 (4%)

Query: 65  WGKLLLSGL---TDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           WG  + + L   +DVE GGAT+FP LNL+++P+KGSA FW+N + N   +    H+GCPV
Sbjct: 455 WGGRIATWLFYMSDVEAGGATVFPKLNLSLWPQKGSAAFWFNLYPNGEGNEMTQHAGCPV 514

Query: 122 ALGNKW 127
             G+KW
Sbjct: 515 LTGSKW 520



 Score = 72.4 bits (176), Expect = 6e-11,   Method: Composition-based stats.
 Identities = 29/55 (52%), Positives = 40/55 (72%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP LNL+++P+KGSA FW N + N   +    H+GCPV  G+KW
Sbjct: 466 MSDVEAGGATVFPKLNLSLWPQKGSAAFWFNLYPNGEGNEMTQHAGCPVLTGSKW 520


>gi|432949777|ref|XP_004084253.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Oryzias
           latipes]
          Length = 532

 Score = 73.9 bits (180), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 31/66 (46%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L + ++DVE GGAT+FP    +++P KG+AVFWYN   +   DYR  H+ CPV
Sbjct: 445 GNRLATFL-NYMSDVEAGGATVFPDFGASIWPRKGTAVFWYNLFRSGEGDYRTRHAACPV 503

Query: 122 ALGNKW 127
            +G+KW
Sbjct: 504 LVGSKW 509



 Score = 67.8 bits (164), Expect = 1e-09,   Method: Composition-based stats.
 Identities = 26/55 (47%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP    +++P KG+AVFW+N   +   DY   H+ CPV +G+KW
Sbjct: 455 MSDVEAGGATVFPDFGASIWPRKGTAVFWYNLFRSGEGDYRTRHAACPVLVGSKW 509


>gi|395509387|ref|XP_003758979.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1
           [Sarcophilus harrisii]
          Length = 534

 Score = 73.9 bits (180), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 31/66 (46%), Positives = 45/66 (68%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L + ++DVE GGAT+FP    T++P+KG++VFWYN   +   DYR  H+ CPV
Sbjct: 447 GNRLATFL-NYMSDVEAGGATVFPDFGATIWPKKGTSVFWYNLFRSGEGDYRTRHAACPV 505

Query: 122 ALGNKW 127
            +G+KW
Sbjct: 506 LVGSKW 511



 Score = 67.8 bits (164), Expect = 1e-09,   Method: Composition-based stats.
 Identities = 26/55 (47%), Positives = 39/55 (70%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP    T++P+KG++VFW+N   +   DY   H+ CPV +G+KW
Sbjct: 457 MSDVEAGGATVFPDFGATIWPKKGTSVFWYNLFRSGEGDYRTRHAACPVLVGSKW 511


>gi|432106758|gb|ELK32410.1| Prolyl 4-hydroxylase subunit alpha-1 [Myotis davidii]
          Length = 534

 Score = 73.9 bits (180), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 506 LVGNKW 511



 Score = 71.2 bits (173), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 42/64 (65%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP +  +V+P+KG+AVFW+N  A+   DY   H+ CPV +
Sbjct: 448 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507

Query: 62  GNKW 65
           GNKW
Sbjct: 508 GNKW 511


>gi|190786|gb|AAA36534.1| prolyl 4-hydroxylase alpha subunit (EC 1.14.11.2) [Homo sapiens]
          Length = 534

 Score = 73.9 bits (180), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 506 LVGNKW 511



 Score = 71.2 bits (173), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 42/64 (65%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP +  +V+P+KG+AVFW+N  A+   DY   H+ CPV +
Sbjct: 448 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507

Query: 62  GNKW 65
           GNKW
Sbjct: 508 GNKW 511


>gi|190788|gb|AAA36535.1| prolyl 4-hydroxylase alpha subunit (EC 1.14.11.2) [Homo sapiens]
          Length = 534

 Score = 73.9 bits (180), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 506 LVGNKW 511



 Score = 71.2 bits (173), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 42/64 (65%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP +  +V+P+KG+AVFW+N  A+   DY   H+ CPV +
Sbjct: 448 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507

Query: 62  GNKW 65
           GNKW
Sbjct: 508 GNKW 511


>gi|73952886|ref|XP_850682.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 3 [Canis
           lupus familiaris]
          Length = 534

 Score = 73.9 bits (180), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 506 LVGNKW 511



 Score = 71.2 bits (173), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 42/64 (65%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP +  +V+P+KG+AVFW+N  A+   DY   H+ CPV +
Sbjct: 448 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507

Query: 62  GNKW 65
           GNKW
Sbjct: 508 GNKW 511


>gi|51036657|ref|NP_742059.2| prolyl 4-hydroxylase subunit alpha-1 precursor [Rattus norvegicus]
 gi|90111077|sp|P54001.2|P4HA1_RAT RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
           alpha-1; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-1; Flags: Precursor
 gi|50927553|gb|AAH78703.1| Procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I [Rattus norvegicus]
 gi|149038787|gb|EDL93076.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha 1 polypeptide, isoform CRA_a
           [Rattus norvegicus]
          Length = 534

 Score = 73.9 bits (180), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 506 LVGNKW 511



 Score = 71.2 bits (173), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 42/64 (65%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP +  +V+P+KG+AVFW+N  A+   DY   H+ CPV +
Sbjct: 448 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507

Query: 62  GNKW 65
           GNKW
Sbjct: 508 GNKW 511


>gi|74224984|dbj|BAE38205.1| unnamed protein product [Mus musculus]
          Length = 534

 Score = 73.9 bits (180), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 506 LVGNKW 511



 Score = 71.2 bits (173), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 42/64 (65%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP +  +V+P+KG+AVFW+N  A+   DY   H+ CPV +
Sbjct: 448 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507

Query: 62  GNKW 65
           GNKW
Sbjct: 508 GNKW 511


>gi|74148153|dbj|BAE36242.1| unnamed protein product [Mus musculus]
          Length = 454

 Score = 73.9 bits (180), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV
Sbjct: 367 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 425

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 426 LVGNKW 431



 Score = 71.2 bits (173), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 42/64 (65%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP +  +V+P+KG+AVFW+N  A+   DY   H+ CPV +
Sbjct: 368 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 427

Query: 62  GNKW 65
           GNKW
Sbjct: 428 GNKW 431


>gi|33859596|ref|NP_035160.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Mus musculus]
 gi|20455506|sp|Q60715.2|P4HA1_MOUSE RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
           alpha-1; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-1; Flags: Precursor
 gi|16307134|gb|AAH09654.1| Procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha 1 polypeptide [Mus musculus]
 gi|74144306|dbj|BAE36020.1| unnamed protein product [Mus musculus]
 gi|74146660|dbj|BAE41331.1| unnamed protein product [Mus musculus]
 gi|148700260|gb|EDL32207.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha 1 polypeptide, isoform CRA_a [Mus
           musculus]
          Length = 534

 Score = 73.9 bits (180), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 506 LVGNKW 511



 Score = 71.2 bits (173), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 42/64 (65%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP +  +V+P+KG+AVFW+N  A+   DY   H+ CPV +
Sbjct: 448 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507

Query: 62  GNKW 65
           GNKW
Sbjct: 508 GNKW 511


>gi|410975458|ref|XP_003994148.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Felis catus]
          Length = 567

 Score = 73.9 bits (180), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV
Sbjct: 480 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 538

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 539 LVGNKW 544



 Score = 71.2 bits (173), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 42/64 (65%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP +  +V+P+KG+AVFW+N  A+   DY   H+ CPV +
Sbjct: 481 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 540

Query: 62  GNKW 65
           GNKW
Sbjct: 541 GNKW 544


>gi|410295850|gb|JAA26525.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
 gi|410295854|gb|JAA26527.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
          Length = 534

 Score = 73.9 bits (180), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 506 LVGNKW 511



 Score = 71.2 bits (173), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 42/64 (65%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP +  +V+P+KG+AVFW+N  A+   DY   H+ CPV +
Sbjct: 448 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507

Query: 62  GNKW 65
           GNKW
Sbjct: 508 GNKW 511


>gi|410251926|gb|JAA13930.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
          Length = 566

 Score = 73.9 bits (180), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 506 LVGNKW 511



 Score = 71.2 bits (173), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 42/64 (65%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP +  +V+P+KG+AVFW+N  A+   DY   H+ CPV +
Sbjct: 448 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507

Query: 62  GNKW 65
           GNKW
Sbjct: 508 GNKW 511


>gi|410251924|gb|JAA13929.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
          Length = 566

 Score = 73.9 bits (180), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 506 LVGNKW 511



 Score = 71.2 bits (173), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 42/64 (65%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP +  +V+P+KG+AVFW+N  A+   DY   H+ CPV +
Sbjct: 448 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507

Query: 62  GNKW 65
           GNKW
Sbjct: 508 GNKW 511


>gi|397490069|ref|XP_003816032.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Pan paniscus]
          Length = 488

 Score = 73.9 bits (180), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV
Sbjct: 401 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 459

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 460 LVGNKW 465



 Score = 71.2 bits (173), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 42/64 (65%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP +  +V+P+KG+AVFW+N  A+   DY   H+ CPV +
Sbjct: 402 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 461

Query: 62  GNKW 65
           GNKW
Sbjct: 462 GNKW 465


>gi|395820528|ref|XP_003783616.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 3 [Otolemur
           garnettii]
          Length = 516

 Score = 73.9 bits (180), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV
Sbjct: 429 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 487

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 488 LVGNKW 493



 Score = 71.2 bits (173), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 42/64 (65%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP +  +V+P+KG+AVFW+N  A+   DY   H+ CPV +
Sbjct: 430 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 489

Query: 62  GNKW 65
           GNKW
Sbjct: 490 GNKW 493


>gi|395820526|ref|XP_003783615.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2 [Otolemur
           garnettii]
          Length = 534

 Score = 73.9 bits (180), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 506 LVGNKW 511



 Score = 71.2 bits (173), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 42/64 (65%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP +  +V+P+KG+AVFW+N  A+   DY   H+ CPV +
Sbjct: 448 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507

Query: 62  GNKW 65
           GNKW
Sbjct: 508 GNKW 511


>gi|395820524|ref|XP_003783614.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1 [Otolemur
           garnettii]
          Length = 534

 Score = 73.9 bits (180), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 506 LVGNKW 511



 Score = 71.2 bits (173), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 42/64 (65%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP +  +V+P+KG+AVFW+N  A+   DY   H+ CPV +
Sbjct: 448 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507

Query: 62  GNKW 65
           GNKW
Sbjct: 508 GNKW 511


>gi|395509389|ref|XP_003758980.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2
           [Sarcophilus harrisii]
          Length = 536

 Score = 73.9 bits (180), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 29/59 (49%), Positives = 42/59 (71%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
            L+ ++DVE GGAT+FP    T++P+KG++VFWYN   +   DYR  H+ CPV +G+KW
Sbjct: 455 FLNYMSDVEAGGATVFPDFGATIWPKKGTSVFWYNLFRSGEGDYRTRHAACPVLVGSKW 513



 Score = 67.8 bits (164), Expect = 1e-09,   Method: Composition-based stats.
 Identities = 26/55 (47%), Positives = 39/55 (70%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP    T++P+KG++VFW+N   +   DY   H+ CPV +G+KW
Sbjct: 459 MSDVEAGGATVFPDFGATIWPKKGTSVFWYNLFRSGEGDYRTRHAACPVLVGSKW 513


>gi|383418721|gb|AFH32574.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Macaca
           mulatta]
          Length = 534

 Score = 73.9 bits (180), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 506 LVGNKW 511



 Score = 71.2 bits (173), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 42/64 (65%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP +  +V+P+KG+AVFW+N  A+   DY   H+ CPV +
Sbjct: 448 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507

Query: 62  GNKW 65
           GNKW
Sbjct: 508 GNKW 511


>gi|380813208|gb|AFE78478.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Macaca
           mulatta]
 gi|384947330|gb|AFI37270.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Macaca
           mulatta]
          Length = 534

 Score = 73.9 bits (180), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 506 LVGNKW 511



 Score = 71.2 bits (173), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 42/64 (65%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP +  +V+P+KG+AVFW+N  A+   DY   H+ CPV +
Sbjct: 448 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507

Query: 62  GNKW 65
           GNKW
Sbjct: 508 GNKW 511


>gi|380813206|gb|AFE78477.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Macaca
           mulatta]
 gi|384947328|gb|AFI37269.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Macaca
           mulatta]
          Length = 534

 Score = 73.9 bits (180), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 506 LVGNKW 511



 Score = 71.2 bits (173), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 42/64 (65%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP +  +V+P+KG+AVFW+N  A+   DY   H+ CPV +
Sbjct: 448 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507

Query: 62  GNKW 65
           GNKW
Sbjct: 508 GNKW 511


>gi|355562502|gb|EHH19096.1| hypothetical protein EGK_19739 [Macaca mulatta]
 gi|355782842|gb|EHH64763.1| hypothetical protein EGM_18071 [Macaca fascicularis]
 gi|383418719|gb|AFH32573.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Macaca
           mulatta]
          Length = 534

 Score = 73.9 bits (180), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 506 LVGNKW 511



 Score = 71.2 bits (173), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 42/64 (65%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP +  +V+P+KG+AVFW+N  A+   DY   H+ CPV +
Sbjct: 448 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507

Query: 62  GNKW 65
           GNKW
Sbjct: 508 GNKW 511


>gi|354483225|ref|XP_003503795.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 2
           [Cricetulus griseus]
          Length = 534

 Score = 73.9 bits (180), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 506 LVGNKW 511



 Score = 71.2 bits (173), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 42/64 (65%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP +  +V+P+KG+AVFW+N  A+   DY   H+ CPV +
Sbjct: 448 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507

Query: 62  GNKW 65
           GNKW
Sbjct: 508 GNKW 511


>gi|354483223|ref|XP_003503794.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 1
           [Cricetulus griseus]
          Length = 534

 Score = 73.9 bits (180), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 506 LVGNKW 511



 Score = 71.2 bits (173), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 42/64 (65%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP +  +V+P+KG+AVFW+N  A+   DY   H+ CPV +
Sbjct: 448 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507

Query: 62  GNKW 65
           GNKW
Sbjct: 508 GNKW 511


>gi|348576112|ref|XP_003473831.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cavia
           porcellus]
          Length = 534

 Score = 73.9 bits (180), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 506 LVGNKW 511



 Score = 71.2 bits (173), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 42/64 (65%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP +  +V+P+KG+AVFW+N  A+   DY   H+ CPV +
Sbjct: 448 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507

Query: 62  GNKW 65
           GNKW
Sbjct: 508 GNKW 511


>gi|345305838|ref|XP_001508476.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Ornithorhynchus
           anatinus]
          Length = 493

 Score = 73.9 bits (180), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV
Sbjct: 406 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 464

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 465 LVGNKW 470



 Score = 71.2 bits (173), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 42/64 (65%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP +  +V+P+KG+AVFW+N  A+   DY   H+ CPV +
Sbjct: 407 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 466

Query: 62  GNKW 65
           GNKW
Sbjct: 467 GNKW 470


>gi|344254200|gb|EGW10304.1| Prolyl 4-hydroxylase subunit alpha-1 [Cricetulus griseus]
          Length = 507

 Score = 73.9 bits (180), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV
Sbjct: 420 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 478

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 479 LVGNKW 484



 Score = 71.2 bits (173), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 42/64 (65%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP +  +V+P+KG+AVFW+N  A+   DY   H+ CPV +
Sbjct: 421 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 480

Query: 62  GNKW 65
           GNKW
Sbjct: 481 GNKW 484


>gi|334314087|ref|XP_003339988.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2
           [Monodelphis domestica]
          Length = 537

 Score = 73.9 bits (180), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV
Sbjct: 450 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 508

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 509 LVGNKW 514



 Score = 71.2 bits (173), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 42/64 (65%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP +  +V+P+KG+AVFW+N  A+   DY   H+ CPV +
Sbjct: 451 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 510

Query: 62  GNKW 65
           GNKW
Sbjct: 511 GNKW 514


>gi|334314085|ref|XP_001363658.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1
           [Monodelphis domestica]
          Length = 537

 Score = 73.9 bits (180), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV
Sbjct: 450 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 508

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 509 LVGNKW 514



 Score = 71.2 bits (173), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 42/64 (65%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP +  +V+P+KG+AVFW+N  A+   DY   H+ CPV +
Sbjct: 451 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 510

Query: 62  GNKW 65
           GNKW
Sbjct: 511 GNKW 514


>gi|332244067|ref|XP_003271193.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
           alpha-1 [Nomascus leucogenys]
          Length = 502

 Score = 73.9 bits (180), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV
Sbjct: 415 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 473

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 474 LVGNKW 479



 Score = 71.2 bits (173), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 42/64 (65%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP +  +V+P+KG+AVFW+N  A+   DY   H+ CPV +
Sbjct: 416 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 475

Query: 62  GNKW 65
           GNKW
Sbjct: 476 GNKW 479


>gi|301770069|ref|XP_002920453.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Ailuropoda
           melanoleuca]
          Length = 534

 Score = 73.9 bits (180), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 506 LVGNKW 511



 Score = 71.2 bits (173), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 42/64 (65%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP +  +V+P+KG+AVFW+N  A+   DY   H+ CPV +
Sbjct: 448 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507

Query: 62  GNKW 65
           GNKW
Sbjct: 508 GNKW 511


>gi|296220402|ref|XP_002756291.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Callithrix
           jacchus]
          Length = 534

 Score = 73.9 bits (180), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 506 LVGNKW 511



 Score = 71.2 bits (173), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 42/64 (65%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP +  +V+P+KG+AVFW+N  A+   DY   H+ CPV +
Sbjct: 448 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507

Query: 62  GNKW 65
           GNKW
Sbjct: 508 GNKW 511


>gi|291404186|ref|XP_002718473.1| PREDICTED: prolyl 4-hydroxylase, alpha I subunit isoform 3
           [Oryctolagus cuniculus]
          Length = 516

 Score = 73.9 bits (180), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV
Sbjct: 429 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 487

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 488 LVGNKW 493



 Score = 71.2 bits (173), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 42/64 (65%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP +  +V+P+KG+AVFW+N  A+   DY   H+ CPV +
Sbjct: 430 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 489

Query: 62  GNKW 65
           GNKW
Sbjct: 490 GNKW 493


>gi|291404184|ref|XP_002718472.1| PREDICTED: prolyl 4-hydroxylase, alpha I subunit isoform 2
           [Oryctolagus cuniculus]
          Length = 534

 Score = 73.9 bits (180), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 506 LVGNKW 511



 Score = 71.2 bits (173), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 42/64 (65%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP +  +V+P+KG+AVFW+N  A+   DY   H+ CPV +
Sbjct: 448 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507

Query: 62  GNKW 65
           GNKW
Sbjct: 508 GNKW 511


>gi|291404182|ref|XP_002718471.1| PREDICTED: prolyl 4-hydroxylase, alpha I subunit isoform 1
           [Oryctolagus cuniculus]
          Length = 534

 Score = 73.9 bits (180), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 506 LVGNKW 511



 Score = 71.2 bits (173), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 42/64 (65%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP +  +V+P+KG+AVFW+N  A+   DY   H+ CPV +
Sbjct: 448 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507

Query: 62  GNKW 65
           GNKW
Sbjct: 508 GNKW 511


>gi|281350467|gb|EFB26051.1| hypothetical protein PANDA_009188 [Ailuropoda melanoleuca]
          Length = 511

 Score = 73.9 bits (180), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 506 LVGNKW 511



 Score = 71.2 bits (173), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 42/64 (65%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP +  +V+P+KG+AVFW+N  A+   DY   H+ CPV +
Sbjct: 448 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507

Query: 62  GNKW 65
           GNKW
Sbjct: 508 GNKW 511


>gi|148233143|ref|NP_001090904.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Sus scrofa]
 gi|83778522|gb|ABC47142.1| procollagen-proline 2-oxoglutarate-4-dioxygenase [Sus scrofa]
          Length = 534

 Score = 73.9 bits (180), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 506 LVGNKW 511



 Score = 71.2 bits (173), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 42/64 (65%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP +  +V+P+KG+AVFW+N  A+   DY   H+ CPV +
Sbjct: 448 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507

Query: 62  GNKW 65
           GNKW
Sbjct: 508 GNKW 511


>gi|217272851|ref|NP_001136068.1| prolyl 4-hydroxylase subunit alpha-1 isoform 3 precursor [Homo
           sapiens]
 gi|114631189|ref|XP_001140871.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 10 [Pan
           troglodytes]
          Length = 516

 Score = 73.9 bits (180), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV
Sbjct: 429 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 487

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 488 LVGNKW 493



 Score = 71.2 bits (173), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 42/64 (65%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP +  +V+P+KG+AVFW+N  A+   DY   H+ CPV +
Sbjct: 430 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 489

Query: 62  GNKW 65
           GNKW
Sbjct: 490 GNKW 493


>gi|26336999|dbj|BAC32183.1| unnamed protein product [Mus musculus]
 gi|148700261|gb|EDL32208.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha 1 polypeptide, isoform CRA_b [Mus
           musculus]
          Length = 534

 Score = 73.9 bits (180), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 506 LVGNKW 511



 Score = 71.2 bits (173), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 42/64 (65%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP +  +V+P+KG+AVFW+N  A+   DY   H+ CPV +
Sbjct: 448 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507

Query: 62  GNKW 65
           GNKW
Sbjct: 508 GNKW 511


>gi|836898|gb|AAC52197.1| prolyl 4-hydroxylase alpha(I)-subunit, partial [Mus musculus]
 gi|1096887|prf||2112362A Pro 4-hydroxylase:SUBUNIT=alpha:ISOTYPE=I
          Length = 526

 Score = 73.9 bits (180), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV
Sbjct: 439 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 497

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 498 LVGNKW 503



 Score = 71.2 bits (173), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 42/64 (65%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP +  +V+P+KG+AVFW+N  A+   DY   H+ CPV +
Sbjct: 440 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 499

Query: 62  GNKW 65
           GNKW
Sbjct: 500 GNKW 503


>gi|63252886|ref|NP_000908.2| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Homo
           sapiens]
 gi|114631173|ref|XP_508168.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 13 [Pan
           troglodytes]
 gi|602676|gb|AAA59069.1| alpha-subunit of prolyl 4-hydroxylase [Homo sapiens]
 gi|62897481|dbj|BAD96680.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I variant [Homo
           sapiens]
 gi|119574852|gb|EAW54467.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I, isoform CRA_a [Homo
           sapiens]
 gi|119574853|gb|EAW54468.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I, isoform CRA_b [Homo
           sapiens]
 gi|410349609|gb|JAA41408.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
 gi|410349613|gb|JAA41410.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
          Length = 534

 Score = 73.9 bits (180), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 506 LVGNKW 511



 Score = 71.2 bits (173), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 42/64 (65%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP +  +V+P+KG+AVFW+N  A+   DY   H+ CPV +
Sbjct: 448 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507

Query: 62  GNKW 65
           GNKW
Sbjct: 508 GNKW 511


>gi|474940|emb|CAA55546.1| gamma-butyrobetaine,2-oxoglutarate dioxygenase [Rattus norvegicus]
          Length = 534

 Score = 73.9 bits (180), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 506 LVGNKW 511



 Score = 71.2 bits (173), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 42/64 (65%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP +  +V+P+KG+AVFW+N  A+   DY   H+ CPV +
Sbjct: 448 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507

Query: 62  GNKW 65
           GNKW
Sbjct: 508 GNKW 511


>gi|63252888|ref|NP_001017962.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Homo
           sapiens]
 gi|197099666|ref|NP_001125733.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Pongo abelii]
 gi|217272849|ref|NP_001136067.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Homo
           sapiens]
 gi|114631177|ref|XP_001140234.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 3 [Pan
           troglodytes]
 gi|114631181|ref|XP_001140652.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 7 [Pan
           troglodytes]
 gi|2507090|sp|P13674.2|P4HA1_HUMAN RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
           alpha-1; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-1; Flags: Precursor
 gi|75061858|sp|Q5RAG8.1|P4HA1_PONAB RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
           alpha-1; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-1; Flags: Precursor
 gi|602675|gb|AAA59068.1| alpha-subunit of prolyl 4-hydroxylase [Homo sapiens]
 gi|23271226|gb|AAH34998.1| Prolyl 4-hydroxylase, alpha polypeptide I [Homo sapiens]
 gi|55729010|emb|CAH91242.1| hypothetical protein [Pongo abelii]
 gi|56403853|emb|CAI29712.1| hypothetical protein [Pongo abelii]
 gi|119574854|gb|EAW54469.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I, isoform CRA_c [Homo
           sapiens]
 gi|119574855|gb|EAW54470.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I, isoform CRA_d [Homo
           sapiens]
 gi|123981532|gb|ABM82595.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I [synthetic
           construct]
 gi|123996359|gb|ABM85781.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I [synthetic
           construct]
 gi|261861532|dbj|BAI47288.1| prolyl 4-hydroxylase, alpha polypeptide I [synthetic construct]
 gi|410295852|gb|JAA26526.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
 gi|410349611|gb|JAA41409.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
          Length = 534

 Score = 73.9 bits (180), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 506 LVGNKW 511



 Score = 71.2 bits (173), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 42/64 (65%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP +  +V+P+KG+AVFW+N  A+   DY   H+ CPV +
Sbjct: 448 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507

Query: 62  GNKW 65
           GNKW
Sbjct: 508 GNKW 511


>gi|149038788|gb|EDL93077.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha 1 polypeptide, isoform CRA_b
           [Rattus norvegicus]
          Length = 534

 Score = 73.9 bits (180), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 506 LVGNKW 511



 Score = 71.2 bits (173), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 42/64 (65%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP +  +V+P+KG+AVFW+N  A+   DY   H+ CPV +
Sbjct: 448 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507

Query: 62  GNKW 65
           GNKW
Sbjct: 508 GNKW 511


>gi|195341590|ref|XP_002037389.1| GM12139 [Drosophila sechellia]
 gi|194131505|gb|EDW53548.1| GM12139 [Drosophila sechellia]
          Length = 525

 Score = 73.9 bits (180), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 29/55 (52%), Positives = 39/55 (70%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           LTDVE GGAT+FP++   VFP++GS V WYN   N  +D +  H+ CPV +G+KW
Sbjct: 449 LTDVEQGGATVFPNIRKAVFPQRGSVVMWYNLRDNGQIDTQTLHAACPVIVGSKW 503



 Score = 70.5 bits (171), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 28/55 (50%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           LTDVE GGAT+FP++   VFP++GS V W+N   N  +D    H+ CPV +G+KW
Sbjct: 449 LTDVEQGGATVFPNIRKAVFPQRGSVVMWYNLRDNGQIDTQTLHAACPVIVGSKW 503


>gi|344274276|ref|XP_003408943.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 3
           [Loxodonta africana]
          Length = 516

 Score = 73.9 bits (180), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV
Sbjct: 429 GNRIATWLFY-MSDVSAGGATVFPDVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 487

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 488 LVGNKW 493



 Score = 71.2 bits (173), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 42/64 (65%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP +  +V+P+KG+AVFW+N  A+   DY   H+ CPV +
Sbjct: 430 NRIATWLFYMSDVSAGGATVFPDVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 489

Query: 62  GNKW 65
           GNKW
Sbjct: 490 GNKW 493


>gi|344274274|ref|XP_003408942.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2
           [Loxodonta africana]
          Length = 534

 Score = 73.9 bits (180), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVSAGGATVFPDVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 506 LVGNKW 511



 Score = 71.2 bits (173), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 42/64 (65%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP +  +V+P+KG+AVFW+N  A+   DY   H+ CPV +
Sbjct: 448 NRIATWLFYMSDVSAGGATVFPDVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507

Query: 62  GNKW 65
           GNKW
Sbjct: 508 GNKW 511


>gi|344274272|ref|XP_003408941.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1
           [Loxodonta africana]
          Length = 534

 Score = 73.9 bits (180), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVSAGGATVFPDVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 506 LVGNKW 511



 Score = 71.2 bits (173), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 42/64 (65%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP +  +V+P+KG+AVFW+N  A+   DY   H+ CPV +
Sbjct: 448 NRIATWLFYMSDVSAGGATVFPDVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507

Query: 62  GNKW 65
           GNKW
Sbjct: 508 GNKW 511


>gi|403183473|gb|EJY58123.1| AAEL017524-PA, partial [Aedes aegypti]
          Length = 212

 Score = 73.9 bits (180), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 35/69 (50%), Positives = 46/69 (66%), Gaps = 1/69 (1%)

Query: 59  VALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSG 118
           + LGN+   L+   L+DVE GGAT+FP + + VFP+KGSA+FWYN   +   D R  H  
Sbjct: 121 MGLGNRIATLMYY-LSDVEQGGATVFPQIGVGVFPKKGSAIFWYNLLPDGTGDERTLHGA 179

Query: 119 CPVALGNKW 127
           CPV LG+KW
Sbjct: 180 CPVLLGSKW 188



 Score = 66.2 bits (160), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 31/64 (48%), Positives = 40/64 (62%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       L+DVE GGAT+FP + + VFP+KGSA+FW+N   +   D    H  CPV L
Sbjct: 125 NRIATLMYYLSDVEQGGATVFPQIGVGVFPKKGSAIFWYNLLPDGTGDERTLHGACPVLL 184

Query: 62  GNKW 65
           G+KW
Sbjct: 185 GSKW 188


>gi|297301157|ref|XP_001103971.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2 [Macaca
           mulatta]
          Length = 512

 Score = 73.9 bits (180), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 29/55 (52%), Positives = 40/55 (72%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           ++DV  GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV +GNKW
Sbjct: 435 MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 489



 Score = 70.9 bits (172), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 28/55 (50%), Positives = 40/55 (72%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DV  GGAT+FP +  +V+P+KG+AVFW+N  A+   DY   H+ CPV +GNKW
Sbjct: 435 MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 489


>gi|348501574|ref|XP_003438344.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Oreochromis
           niloticus]
          Length = 615

 Score = 73.9 bits (180), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 43/66 (65%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+FP +   V+P+KG+AVFWYN  A+   DY   H+ CPV
Sbjct: 528 GNRIATWLFY-MSDVSAGGATVFPDVGAAVWPQKGTAVFWYNLFASGEGDYSTRHAACPV 586

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 587 LVGNKW 592



 Score = 70.9 bits (172), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 41/64 (64%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP +   V+P+KG+AVFW+N  A+   DY   H+ CPV +
Sbjct: 529 NRIATWLFYMSDVSAGGATVFPDVGAAVWPQKGTAVFWYNLFASGEGDYSTRHAACPVLV 588

Query: 62  GNKW 65
           GNKW
Sbjct: 589 GNKW 592


>gi|198466403|ref|XP_002135183.1| GA23911 [Drosophila pseudoobscura pseudoobscura]
 gi|198150584|gb|EDY73810.1| GA23911 [Drosophila pseudoobscura pseudoobscura]
          Length = 534

 Score = 73.9 bits (180), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 30/69 (43%), Positives = 47/69 (68%), Gaps = 1/69 (1%)

Query: 59  VALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSG 118
           + LG++ G ++    +DV  GGAT+FP   +++FP KGS+VFWYN + +  +D R  HS 
Sbjct: 441 LTLGDRLGSIIFYA-SDVPQGGATVFPRSRVSIFPRKGSSVFWYNLYDDGRIDTRSQHSV 499

Query: 119 CPVALGNKW 127
           CPV +G++W
Sbjct: 500 CPVIVGDRW 508



 Score = 64.7 bits (156), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 25/54 (46%), Positives = 38/54 (70%)

Query: 12  TDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           +DV  GGAT+FP   +++FP KGS+VFW+N + +  +D    HS CPV +G++W
Sbjct: 455 SDVPQGGATVFPRSRVSIFPRKGSSVFWYNLYDDGRIDTRSQHSVCPVIVGDRW 508


>gi|195166681|ref|XP_002024163.1| GL22882 [Drosophila persimilis]
 gi|194107518|gb|EDW29561.1| GL22882 [Drosophila persimilis]
          Length = 534

 Score = 73.9 bits (180), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 30/69 (43%), Positives = 47/69 (68%), Gaps = 1/69 (1%)

Query: 59  VALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSG 118
           + LG++ G ++    +DV  GGAT+FP   +++FP KGS+VFWYN + +  +D R  HS 
Sbjct: 441 LTLGDRLGSIIFYA-SDVPQGGATVFPRSRVSIFPRKGSSVFWYNLYDDGRIDTRSQHSV 499

Query: 119 CPVALGNKW 127
           CPV +G++W
Sbjct: 500 CPVIVGDRW 508



 Score = 64.7 bits (156), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 25/54 (46%), Positives = 38/54 (70%)

Query: 12  TDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           +DV  GGAT+FP   +++FP KGS+VFW+N + +  +D    HS CPV +G++W
Sbjct: 455 SDVPQGGATVFPRSRVSIFPRKGSSVFWYNLYDDGRIDTRSQHSVCPVIVGDRW 508


>gi|195575145|ref|XP_002105540.1| GD16902 [Drosophila simulans]
 gi|194201467|gb|EDX15043.1| GD16902 [Drosophila simulans]
          Length = 525

 Score = 73.9 bits (180), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 29/55 (52%), Positives = 39/55 (70%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           LTDVE GGAT+FP++   VFP++GS V WYN   N  +D +  H+ CPV +G+KW
Sbjct: 449 LTDVEQGGATVFPNIRKAVFPQRGSVVMWYNLRDNGQIDTQTLHAACPVIVGSKW 503



 Score = 70.5 bits (171), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 28/55 (50%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           LTDVE GGAT+FP++   VFP++GS V W+N   N  +D    H+ CPV +G+KW
Sbjct: 449 LTDVEQGGATVFPNIRKAVFPQRGSVVMWYNLRDNGQIDTQTLHAACPVIVGSKW 503


>gi|24651477|ref|NP_733395.1| prolyl-4-hydroxylase-alpha PV [Drosophila melanogaster]
 gi|20269812|gb|AAM18061.1|AF495539_1 prolyl 4-hydroxylase alpha-related protein PH4[alpha]PV [Drosophila
           melanogaster]
 gi|23172718|gb|AAN14252.1| prolyl-4-hydroxylase-alpha PV [Drosophila melanogaster]
          Length = 525

 Score = 73.9 bits (180), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 29/55 (52%), Positives = 39/55 (70%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           LTDVE GGAT+FP++   VFP++GS V WYN   N  +D +  H+ CPV +G+KW
Sbjct: 449 LTDVEQGGATVFPNIRKAVFPQRGSVVMWYNLKDNGQIDTQTLHAACPVIVGSKW 503



 Score = 70.5 bits (171), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 28/55 (50%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           LTDVE GGAT+FP++   VFP++GS V W+N   N  +D    H+ CPV +G+KW
Sbjct: 449 LTDVEQGGATVFPNIRKAVFPQRGSVVMWYNLKDNGQIDTQTLHAACPVIVGSKW 503


>gi|195110923|ref|XP_002000029.1| GI22757 [Drosophila mojavensis]
 gi|193916623|gb|EDW15490.1| GI22757 [Drosophila mojavensis]
          Length = 535

 Score = 73.6 bits (179), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 32/55 (58%), Positives = 39/55 (70%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L++VE GGAT FP L L V PE+GS +FWYN H +   DYR  H+GCPV  G+KW
Sbjct: 450 LSNVEAGGATAFPFLPLLVKPEQGSLLFWYNLHRSGDADYRTKHAGCPVLQGSKW 504



 Score = 69.3 bits (168), Expect = 5e-10,   Method: Composition-based stats.
 Identities = 32/64 (50%), Positives = 40/64 (62%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       L++VE GGAT FP L L V PE+GS +FW+N H +   DY   H+GCPV  
Sbjct: 441 NRIATGIFYLSNVEAGGATAFPFLPLLVKPEQGSLLFWYNLHRSGDADYRTKHAGCPVLQ 500

Query: 62  GNKW 65
           G+KW
Sbjct: 501 GSKW 504


>gi|301613004|ref|XP_002936004.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Xenopus
           (Silurana) tropicalis]
          Length = 526

 Score = 73.6 bits (179), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 29/55 (52%), Positives = 39/55 (70%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           ++DVE GGAT+FP +   V+P+KG+AVFWYN   +   DY   H+ CPV +GNKW
Sbjct: 449 MSDVEAGGATVFPEVGAAVYPKKGTAVFWYNLLESGEGDYSTRHAACPVLVGNKW 503



 Score = 71.2 bits (173), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 41/64 (64%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DVE GGAT+FP +   V+P+KG+AVFW+N   +   DY   H+ CPV +
Sbjct: 440 NRVATWLFYMSDVEAGGATVFPEVGAAVYPKKGTAVFWYNLLESGEGDYSTRHAACPVLV 499

Query: 62  GNKW 65
           GNKW
Sbjct: 500 GNKW 503


>gi|334311009|ref|XP_001371555.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Monodelphis
           domestica]
          Length = 534

 Score = 73.6 bits (179), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 30/66 (45%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L + ++DVE GGAT+FP     ++P+KG++VFWYN   +   DYR  H+ CPV
Sbjct: 447 GNRLATFL-NYMSDVEAGGATVFPDFGAAIWPKKGTSVFWYNLFRSGECDYRTRHAACPV 505

Query: 122 ALGNKW 127
            +G+KW
Sbjct: 506 LVGSKW 511



 Score = 67.4 bits (163), Expect = 2e-09,   Method: Composition-based stats.
 Identities = 25/55 (45%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP     ++P+KG++VFW+N   +   DY   H+ CPV +G+KW
Sbjct: 457 MSDVEAGGATVFPDFGAAIWPKKGTSVFWYNLFRSGECDYRTRHAACPVLVGSKW 511


>gi|194765180|ref|XP_001964705.1| GF23331 [Drosophila ananassae]
 gi|190614977|gb|EDV30501.1| GF23331 [Drosophila ananassae]
          Length = 535

 Score = 73.6 bits (179), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 35/69 (50%), Positives = 43/69 (62%), Gaps = 3/69 (4%)

Query: 62  GNKWGKLLLSG---LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSG 118
           G+  G  + +G   L+DVE GG T FP L L V PEKGS +FWYN H +   DYR  H+ 
Sbjct: 434 GDDRGNRIATGIYYLSDVEAGGGTAFPFLPLLVTPEKGSLLFWYNLHESGDQDYRTKHAA 493

Query: 119 CPVALGNKW 127
           CPV  G+KW
Sbjct: 494 CPVLQGSKW 502



 Score = 68.9 bits (167), Expect = 7e-10,   Method: Composition-based stats.
 Identities = 32/64 (50%), Positives = 38/64 (59%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       L+DVE GG T FP L L V PEKGS +FW+N H +   DY   H+ CPV  
Sbjct: 439 NRIATGIYYLSDVEAGGGTAFPFLPLLVTPEKGSLLFWYNLHESGDQDYRTKHAACPVLQ 498

Query: 62  GNKW 65
           G+KW
Sbjct: 499 GSKW 502


>gi|194905410|ref|XP_001981191.1| GG11931 [Drosophila erecta]
 gi|190655829|gb|EDV53061.1| GG11931 [Drosophila erecta]
          Length = 537

 Score = 73.6 bits (179), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 36/86 (41%), Positives = 49/86 (56%), Gaps = 1/86 (1%)

Query: 42  AHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFW 101
            H     D++   S  P A GN+    +   L++VE GGAT FP L+  V P+ G+ +FW
Sbjct: 420 GHYEPHWDFFRDPSHYPAAEGNRIATAIFY-LSEVEQGGATAFPFLDFAVKPQLGNVLFW 478

Query: 102 YNAHANTLLDYRMYHSGCPVALGNKW 127
           YN H +   DYR  H+GCPV  G+KW
Sbjct: 479 YNLHRSLDKDYRTKHAGCPVLKGSKW 504



 Score = 66.2 bits (160), Expect = 4e-09,   Method: Composition-based stats.
 Identities = 29/64 (45%), Positives = 40/64 (62%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR   +   L++VE GGAT FP L+  V P+ G+ +FW+N H +   DY   H+GCPV  
Sbjct: 441 NRIATAIFYLSEVEQGGATAFPFLDFAVKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLK 500

Query: 62  GNKW 65
           G+KW
Sbjct: 501 GSKW 504


>gi|390459659|ref|XP_002806656.2| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
           alpha-2 [Callithrix jacchus]
          Length = 579

 Score = 73.6 bits (179), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 30/59 (50%), Positives = 41/59 (69%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
            L+ ++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 498 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGXGDYRTRHAACPVLVGCKW 556



 Score = 67.4 bits (163), Expect = 2e-09,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP L   ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 502 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGXGDYRTRHAACPVLVGCKW 556


>gi|194765178|ref|XP_001964704.1| GF23330 [Drosophila ananassae]
 gi|190614976|gb|EDV30500.1| GF23330 [Drosophila ananassae]
          Length = 537

 Score = 73.6 bits (179), Expect = 3e-11,   Method: Composition-based stats.
 Identities = 33/70 (47%), Positives = 44/70 (62%), Gaps = 1/70 (1%)

Query: 58  PVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHS 117
           P   GN+    +   L+DVE GGAT FP L+  V P+ G+ +FWYN H +  +DYR  H+
Sbjct: 436 PAEEGNRIATAIFY-LSDVEQGGATAFPFLDFAVKPQLGNVLFWYNLHRSLDMDYRTKHA 494

Query: 118 GCPVALGNKW 127
           GCPV  G+KW
Sbjct: 495 GCPVLKGSKW 504



 Score = 69.3 bits (168), Expect = 5e-10,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 41/64 (64%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR   +   L+DVE GGAT FP L+  V P+ G+ +FW+N H +  +DY   H+GCPV  
Sbjct: 441 NRIATAIFYLSDVEQGGATAFPFLDFAVKPQLGNVLFWYNLHRSLDMDYRTKHAGCPVLK 500

Query: 62  GNKW 65
           G+KW
Sbjct: 501 GSKW 504


>gi|426365135|ref|XP_004049642.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Gorilla gorilla
           gorilla]
          Length = 500

 Score = 73.6 bits (179), Expect = 3e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV
Sbjct: 413 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 471

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 472 LVGNKW 477



 Score = 70.9 bits (172), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 42/64 (65%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP +  +V+P+KG+AVFW+N  A+   DY   H+ CPV +
Sbjct: 414 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 473

Query: 62  GNKW 65
           GNKW
Sbjct: 474 GNKW 477


>gi|443712762|gb|ELU05926.1| hypothetical protein CAPTEDRAFT_153364 [Capitella teleta]
          Length = 491

 Score = 73.6 bits (179), Expect = 3e-11,   Method: Composition-based stats.
 Identities = 32/55 (58%), Positives = 36/55 (65%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L+ V LGGAT+FP LN+ V P K  A FWYNA  N   D R  H+GCPV LG KW
Sbjct: 411 LSRVHLGGATVFPKLNVRVPPVKNGAAFWYNARPNGEHDKRTLHAGCPVVLGEKW 465



 Score = 68.2 bits (165), Expect = 1e-09,   Method: Composition-based stats.
 Identities = 30/55 (54%), Positives = 35/55 (63%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L+ V LGGAT+FP LN+ V P K  A FW+NA  N   D    H+GCPV LG KW
Sbjct: 411 LSRVHLGGATVFPKLNVRVPPVKNGAAFWYNARPNGEHDKRTLHAGCPVVLGEKW 465


>gi|321474898|gb|EFX85862.1| hypothetical protein DAPPUDRAFT_309117 [Daphnia pulex]
          Length = 541

 Score = 73.6 bits (179), Expect = 3e-11,   Method: Composition-based stats.
 Identities = 29/55 (52%), Positives = 39/55 (70%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           ++DVE GGAT+FP +NL ++P+KGSA FW+N H N   D    H+ CPV  G+KW
Sbjct: 462 MSDVEAGGATVFPQINLALWPQKGSAAFWFNLHPNGEGDDLTQHAACPVLTGSKW 516



 Score = 72.4 bits (176), Expect = 5e-11,   Method: Composition-based stats.
 Identities = 31/64 (48%), Positives = 40/64 (62%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DVE GGAT+FP +NL ++P+KGSA FW N H N   D    H+ CPV  
Sbjct: 453 NRIATWMFYMSDVEAGGATVFPQINLALWPQKGSAAFWFNLHPNGEGDDLTQHAACPVLT 512

Query: 62  GNKW 65
           G+KW
Sbjct: 513 GSKW 516


>gi|194751833|ref|XP_001958228.1| GF10815 [Drosophila ananassae]
 gi|190625510|gb|EDV41034.1| GF10815 [Drosophila ananassae]
          Length = 273

 Score = 73.6 bits (179), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 34/63 (53%), Positives = 42/63 (66%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 128
           LL  L+DVE GG T+FP LNL V  +KGS + WYN  +N   D R+ H+ CPV +GNKW 
Sbjct: 208 LLKNLSDVEQGGDTVFPHLNLKVPAQKGSLMVWYNLLSNGTTDSRVLHASCPVLMGNKWS 267

Query: 129 KLL 131
           K L
Sbjct: 268 KYL 270



 Score = 67.4 bits (163), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 33/70 (47%), Positives = 46/70 (65%), Gaps = 1/70 (1%)

Query: 1   MNRAEMSSVE-LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPV 59
           +N  E+S ++ L+DVE GG T+FP LNL V  +KGS + W+N  +N   D  + H+ CPV
Sbjct: 201 INEKEISLLKNLSDVEQGGDTVFPHLNLKVPAQKGSLMVWYNLLSNGTTDSRVLHASCPV 260

Query: 60  ALGNKWGKLL 69
            +GNKW K L
Sbjct: 261 LMGNKWSKYL 270


>gi|74225936|dbj|BAE28745.1| unnamed protein product [Mus musculus]
          Length = 561

 Score = 73.6 bits (179), Expect = 3e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 506 LVGNKW 511



 Score = 70.5 bits (171), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 42/64 (65%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP +  +V+P+KG+AVFW+N  A+   DY   H+ CPV +
Sbjct: 448 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507

Query: 62  GNKW 65
           GNKW
Sbjct: 508 GNKW 511


>gi|312383453|gb|EFR28539.1| hypothetical protein AND_03427 [Anopheles darlingi]
          Length = 341

 Score = 73.6 bits (179), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 32/60 (53%), Positives = 41/60 (68%)

Query: 6   MSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           +S  +++DV  GGAT+FPSLNL + P KG+A FWHN HA+   DY   H+ CPV  G KW
Sbjct: 254 LSVGQMSDVTQGGATVFPSLNLALRPRKGTAAFWHNLHASGNGDYATRHAACPVLTGTKW 313



 Score = 70.5 bits (171), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 31/60 (51%), Positives = 40/60 (66%)

Query: 68  LLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L +  ++DV  GGAT+FPSLNL + P KG+A FW+N HA+   DY   H+ CPV  G KW
Sbjct: 254 LSVGQMSDVTQGGATVFPSLNLALRPRKGTAAFWHNLHASGNGDYATRHAACPVLTGTKW 313


>gi|195110931|ref|XP_002000033.1| GI24862 [Drosophila mojavensis]
 gi|193916627|gb|EDW15494.1| GI24862 [Drosophila mojavensis]
          Length = 549

 Score = 73.2 bits (178), Expect = 3e-11,   Method: Composition-based stats.
 Identities = 32/67 (47%), Positives = 45/67 (67%), Gaps = 1/67 (1%)

Query: 61  LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
           +GN+ G +L   L+DVE GGAT FP+L   + P+K +A FWYN HA+ + D R  H  CP
Sbjct: 445 MGNRIGTVLFY-LSDVEQGGATAFPALKQLLRPKKHAAAFWYNLHASGVGDARTMHGACP 503

Query: 121 VALGNKW 127
           + +G+KW
Sbjct: 504 IIVGSKW 510



 Score = 63.5 bits (153), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 28/64 (43%), Positives = 39/64 (60%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       L+DVE GGAT FP+L   + P+K +A FW+N HA+ + D    H  CP+ +
Sbjct: 447 NRIGTVLFYLSDVEQGGATAFPALKQLLRPKKHAAAFWYNLHASGVGDARTMHGACPIIV 506

Query: 62  GNKW 65
           G+KW
Sbjct: 507 GSKW 510


>gi|195499025|ref|XP_002096772.1| GE25857 [Drosophila yakuba]
 gi|194182873|gb|EDW96484.1| GE25857 [Drosophila yakuba]
          Length = 490

 Score = 73.2 bits (178), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 37/79 (46%), Positives = 48/79 (60%), Gaps = 1/79 (1%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY  Y +      G++   LL   L +V+ GGATIFP +N+TV P+KGSAVFWYN H + 
Sbjct: 384 DYHKYTNTTRAKYGDRIATLLFY-LGEVDSGGATIFPRINITVTPKKGSAVFWYNLHNSG 442

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L     HS CPV  G+K+
Sbjct: 443 ALHLETLHSACPVISGSKY 461



 Score = 65.5 bits (158), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 30/55 (54%), Positives = 39/55 (70%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L +V+ GGATIFP +N+TV P+KGSAVFW+N H +  L     HS CPV  G+K+
Sbjct: 407 LGEVDSGGATIFPRINITVTPKKGSAVFWYNLHNSGALHLETLHSACPVISGSKY 461


>gi|395736139|ref|XP_003776705.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Pongo abelii]
          Length = 575

 Score = 73.2 bits (178), Expect = 3e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L + ++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV
Sbjct: 488 GNRLATFL-NYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPV 546

Query: 122 ALGNKW 127
            +G KW
Sbjct: 547 LVGCKW 552



 Score = 67.0 bits (162), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP L   ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 498 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 552


>gi|344264849|ref|XP_003404502.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2
           [Loxodonta africana]
          Length = 534

 Score = 73.2 bits (178), Expect = 3e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L + ++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV
Sbjct: 447 GNRLATFL-NYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPV 505

Query: 122 ALGNKW 127
            +G KW
Sbjct: 506 LVGCKW 511



 Score = 67.0 bits (162), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP L   ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 457 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 511


>gi|116283554|gb|AAH17062.1| P4HA2 protein [Homo sapiens]
          Length = 504

 Score = 73.2 bits (178), Expect = 3e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L + ++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV
Sbjct: 417 GNRLATFL-NYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPV 475

Query: 122 ALGNKW 127
            +G KW
Sbjct: 476 LVGCKW 481



 Score = 67.0 bits (162), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP L   ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 427 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 481


>gi|291190128|ref|NP_001167431.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Salmo salar]
 gi|223649060|gb|ACN11288.1| Prolyl 4-hydroxylase subunit alpha-2 precursor [Salmo salar]
          Length = 538

 Score = 73.2 bits (178), Expect = 3e-11,   Method: Composition-based stats.
 Identities = 29/59 (49%), Positives = 40/59 (67%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
            L+ ++DVE GGAT+FP     ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 457 FLNYMSDVEAGGATVFPDFGAAIYPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGCKW 515



 Score = 67.0 bits (162), Expect = 2e-09,   Method: Composition-based stats.
 Identities = 26/55 (47%), Positives = 37/55 (67%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP     ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 461 MSDVEAGGATVFPDFGAAIYPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGCKW 515


>gi|148226320|ref|NP_001087703.1| prolyl 4-hydroxylase, alpha polypeptide 2 precursor [Xenopus
           laevis]
 gi|51703693|gb|AAH81114.1| MGC83530 protein [Xenopus laevis]
          Length = 533

 Score = 73.2 bits (178), Expect = 3e-11,   Method: Composition-based stats.
 Identities = 29/58 (50%), Positives = 40/58 (68%)

Query: 70  LSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L+ ++DVE GGAT+FP     ++P KG+AVFWYN   +   DYR  H+ CPV +G+KW
Sbjct: 453 LNYMSDVEAGGATVFPDFGAAIWPRKGTAVFWYNLFRSGEGDYRTRHAACPVLVGSKW 510



 Score = 67.0 bits (162), Expect = 2e-09,   Method: Composition-based stats.
 Identities = 26/55 (47%), Positives = 37/55 (67%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP     ++P KG+AVFW+N   +   DY   H+ CPV +G+KW
Sbjct: 456 MSDVEAGGATVFPDFGAAIWPRKGTAVFWYNLFRSGEGDYRTRHAACPVLVGSKW 510


>gi|91091610|ref|XP_969386.1| PREDICTED: similar to prolyl 4-hydroxylase alpha subunit 1,
           putative [Tribolium castaneum]
 gi|270001037|gb|EEZ97484.1| hypothetical protein TcasGA2_TC011321 [Tribolium castaneum]
          Length = 536

 Score = 73.2 bits (178), Expect = 3e-11,   Method: Composition-based stats.
 Identities = 31/66 (46%), Positives = 45/66 (68%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+   +L   ++DVE GGAT+FPS+ ++++P+KGSA FWYN H +   D    H+ CPV
Sbjct: 444 GNRIATVLFY-MSDVEQGGATVFPSIQVSLWPQKGSAAFWYNLHPSGDGDKMTRHAACPV 502

Query: 122 ALGNKW 127
             G+KW
Sbjct: 503 LTGSKW 508



 Score = 70.5 bits (171), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 29/64 (45%), Positives = 42/64 (65%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DVE GGAT+FPS+ ++++P+KGSA FW+N H +   D    H+ CPV  
Sbjct: 445 NRIATVLFYMSDVEQGGATVFPSIQVSLWPQKGSAAFWYNLHPSGDGDKMTRHAACPVLT 504

Query: 62  GNKW 65
           G+KW
Sbjct: 505 GSKW 508


>gi|432109537|gb|ELK33711.1| Prolyl 4-hydroxylase subunit alpha-2 [Myotis davidii]
          Length = 555

 Score = 73.2 bits (178), Expect = 3e-11,   Method: Composition-based stats.
 Identities = 30/59 (50%), Positives = 41/59 (69%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
            L+ ++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 474 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 532



 Score = 67.0 bits (162), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP L   ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 478 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 532


>gi|410948134|ref|XP_003980796.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Felis
           catus]
          Length = 535

 Score = 73.2 bits (178), Expect = 3e-11,   Method: Composition-based stats.
 Identities = 30/59 (50%), Positives = 41/59 (69%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
            L+ ++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 454 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512



 Score = 67.0 bits (162), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP L   ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 458 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512


>gi|395736141|ref|XP_003776706.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Pongo abelii]
          Length = 577

 Score = 73.2 bits (178), Expect = 3e-11,   Method: Composition-based stats.
 Identities = 30/59 (50%), Positives = 41/59 (69%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
            L+ ++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 496 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 554



 Score = 67.0 bits (162), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP L   ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 500 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 554


>gi|344264847|ref|XP_003404501.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1
           [Loxodonta africana]
          Length = 536

 Score = 73.2 bits (178), Expect = 3e-11,   Method: Composition-based stats.
 Identities = 30/59 (50%), Positives = 41/59 (69%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
            L+ ++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 455 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 513



 Score = 67.0 bits (162), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP L   ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 459 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 513


>gi|149052606|gb|EDM04423.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha II polypeptide (predicted),
           isoform CRA_c [Rattus norvegicus]
          Length = 506

 Score = 73.2 bits (178), Expect = 3e-11,   Method: Composition-based stats.
 Identities = 30/59 (50%), Positives = 41/59 (69%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
            L+ ++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 425 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 483



 Score = 67.0 bits (162), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP L   ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 429 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 483


>gi|148701597|gb|EDL33544.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha II polypeptide, isoform CRA_b [Mus
           musculus]
          Length = 506

 Score = 73.2 bits (178), Expect = 3e-11,   Method: Composition-based stats.
 Identities = 30/59 (50%), Positives = 41/59 (69%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
            L+ ++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 425 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 483



 Score = 67.0 bits (162), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP L   ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 429 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 483


>gi|410948132|ref|XP_003980795.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Felis
           catus]
 gi|410948136|ref|XP_003980797.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Felis
           catus]
          Length = 533

 Score = 73.2 bits (178), Expect = 3e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L + ++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV
Sbjct: 446 GNRLATFL-NYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPV 504

Query: 122 ALGNKW 127
            +G KW
Sbjct: 505 LVGCKW 510



 Score = 66.6 bits (161), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP L   ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 456 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 510


>gi|403255937|ref|XP_003920661.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Saimiri
           boliviensis boliviensis]
 gi|403255939|ref|XP_003920662.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Saimiri
           boliviensis boliviensis]
 gi|403255943|ref|XP_003920664.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 4 [Saimiri
           boliviensis boliviensis]
          Length = 533

 Score = 73.2 bits (178), Expect = 3e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L + ++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV
Sbjct: 446 GNRLATFL-NYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPV 504

Query: 122 ALGNKW 127
            +G KW
Sbjct: 505 LVGCKW 510



 Score = 66.6 bits (161), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP L   ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 456 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 510


>gi|426349879|ref|XP_004042513.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Gorilla gorilla
           gorilla]
          Length = 565

 Score = 73.2 bits (178), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L + ++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV
Sbjct: 478 GNRLATFL-NYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPV 536

Query: 122 ALGNKW 127
            +G KW
Sbjct: 537 LVGCKW 542



 Score = 66.6 bits (161), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP L   ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 488 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 542


>gi|119582748|gb|EAW62344.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide II, isoform CRA_c
           [Homo sapiens]
          Length = 565

 Score = 73.2 bits (178), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L + ++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV
Sbjct: 478 GNRLATFL-NYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPV 536

Query: 122 ALGNKW 127
            +G KW
Sbjct: 537 LVGCKW 542



 Score = 66.6 bits (161), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP L   ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 488 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 542


>gi|386780652|ref|NP_001247763.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Macaca mulatta]
 gi|383422579|gb|AFH34503.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Macaca
           mulatta]
 gi|384939466|gb|AFI33338.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Macaca
           mulatta]
          Length = 533

 Score = 73.2 bits (178), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L + ++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV
Sbjct: 446 GNRLATFL-NYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPV 504

Query: 122 ALGNKW 127
            +G KW
Sbjct: 505 LVGCKW 510



 Score = 66.6 bits (161), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP L   ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 456 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 510


>gi|395817620|ref|XP_003782263.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Otolemur
           garnettii]
          Length = 540

 Score = 73.2 bits (178), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 30/59 (50%), Positives = 41/59 (69%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
            L+ ++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 459 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 517



 Score = 66.6 bits (161), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP L   ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 463 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 517


>gi|403255941|ref|XP_003920663.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Saimiri
           boliviensis boliviensis]
 gi|403255945|ref|XP_003920665.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 5 [Saimiri
           boliviensis boliviensis]
          Length = 535

 Score = 73.2 bits (178), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 30/59 (50%), Positives = 41/59 (69%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
            L+ ++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 454 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512



 Score = 66.6 bits (161), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP L   ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 458 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512


>gi|395817618|ref|XP_003782262.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Otolemur
           garnettii]
          Length = 538

 Score = 73.2 bits (178), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 30/59 (50%), Positives = 41/59 (69%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
            L+ ++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 457 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 515



 Score = 66.6 bits (161), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP L   ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 461 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 515


>gi|197215651|gb|ACH53042.1| prolyl 4-hydroxylase, alpha II subunit isoform 1 precursor
           (predicted) [Otolemur garnettii]
          Length = 555

 Score = 73.2 bits (178), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 30/59 (50%), Positives = 41/59 (69%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
            L+ ++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 474 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 532



 Score = 66.6 bits (161), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP L   ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 478 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 532


>gi|119582752|gb|EAW62348.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide II, isoform CRA_f
           [Homo sapiens]
          Length = 567

 Score = 73.2 bits (178), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 30/59 (50%), Positives = 41/59 (69%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
            L+ ++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 486 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 544



 Score = 66.6 bits (161), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP L   ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 490 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 544


>gi|167045848|gb|ABZ10515.1| prolyl 4-hydroxylase, alpha II subunit isoform 1 precursor
           (predicted) [Callithrix jacchus]
          Length = 555

 Score = 73.2 bits (178), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 30/59 (50%), Positives = 41/59 (69%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
            L+ ++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 474 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 532



 Score = 66.6 bits (161), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP L   ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 478 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 532


>gi|355691582|gb|EHH26767.1| hypothetical protein EGK_16829 [Macaca mulatta]
 gi|355750162|gb|EHH54500.1| hypothetical protein EGM_15360 [Macaca fascicularis]
 gi|384939464|gb|AFI33337.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Macaca
           mulatta]
          Length = 535

 Score = 72.8 bits (177), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 30/59 (50%), Positives = 41/59 (69%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
            L+ ++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 454 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512



 Score = 66.6 bits (161), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP L   ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 458 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512


>gi|114601566|ref|XP_001162222.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Pan
           troglodytes]
 gi|114601568|ref|XP_001162843.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 17 [Pan
           troglodytes]
 gi|397518358|ref|XP_003829358.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Pan
           paniscus]
 gi|397518362|ref|XP_003829360.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 5 [Pan
           paniscus]
 gi|410215944|gb|JAA05191.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
 gi|410255608|gb|JAA15771.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
 gi|410331279|gb|JAA34586.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
          Length = 535

 Score = 72.8 bits (177), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 30/59 (50%), Positives = 41/59 (69%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
            L+ ++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 454 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512



 Score = 66.6 bits (161), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP L   ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 458 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512


>gi|195575143|ref|XP_002105539.1| GD16913 [Drosophila simulans]
 gi|194201466|gb|EDX15042.1| GD16913 [Drosophila simulans]
          Length = 534

 Score = 72.8 bits (177), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 33/73 (45%), Positives = 47/73 (64%), Gaps = 1/73 (1%)

Query: 55  SGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRM 114
           SG  + LG++   +L   LTDVE GGAT+F  +  +V+P+ G+A+FWYN   N   D R 
Sbjct: 435 SGYSMDLGDRIATVLFY-LTDVEQGGATVFADVGYSVYPQAGTAIFWYNLDTNGKGDPRT 493

Query: 115 YHSGCPVALGNKW 127
            H+ CPV +G+KW
Sbjct: 494 RHAACPVIVGSKW 506



 Score = 64.3 bits (155), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 26/55 (47%), Positives = 37/55 (67%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           LTDVE GGAT+F  +  +V+P+ G+A+FW+N   N   D    H+ CPV +G+KW
Sbjct: 452 LTDVEQGGATVFADVGYSVYPQAGTAIFWYNLDTNGKGDPRTRHAACPVIVGSKW 506


>gi|4758868|ref|NP_004190.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Homo
           sapiens]
 gi|217272863|ref|NP_001136071.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Homo
           sapiens]
 gi|20455169|sp|O15460.1|P4HA2_HUMAN RecName: Full=Prolyl 4-hydroxylase subunit alpha-2; Short=4-PH
           alpha-2; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-2; Flags: Precursor
 gi|2439985|gb|AAB71339.1| prolyl 4-hydroxylase alpha (II) subunit [Homo sapiens]
 gi|18073926|emb|CAC85689.1| Prolyl 4-hydroxylase alpha IIb subunit [Homo sapiens]
 gi|119582746|gb|EAW62342.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide II, isoform CRA_b
           [Homo sapiens]
 gi|119582747|gb|EAW62343.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide II, isoform CRA_b
           [Homo sapiens]
          Length = 535

 Score = 72.8 bits (177), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 30/59 (50%), Positives = 41/59 (69%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
            L+ ++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 454 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512



 Score = 66.6 bits (161), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP L   ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 458 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512


>gi|335283456|ref|XP_003354320.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Sus scrofa]
          Length = 535

 Score = 72.8 bits (177), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 30/59 (50%), Positives = 41/59 (69%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
            L+ ++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 454 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512



 Score = 66.6 bits (161), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP L   ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 458 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512


>gi|332221660|ref|XP_003259981.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Nomascus
           leucogenys]
          Length = 537

 Score = 72.8 bits (177), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 30/59 (50%), Positives = 41/59 (69%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
            L+ ++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 456 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 514



 Score = 66.6 bits (161), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP L   ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 460 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 514


>gi|332221656|ref|XP_003259979.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Nomascus
           leucogenys]
 gi|332221658|ref|XP_003259980.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Nomascus
           leucogenys]
          Length = 535

 Score = 72.8 bits (177), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L + ++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV
Sbjct: 448 GNRLATFL-NYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPV 506

Query: 122 ALGNKW 127
            +G KW
Sbjct: 507 LVGCKW 512



 Score = 66.6 bits (161), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP L   ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 458 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512


>gi|297675927|ref|XP_002815905.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Pongo
           abelii]
 gi|395736137|ref|XP_003776704.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Pongo abelii]
          Length = 533

 Score = 72.8 bits (177), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L + ++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV
Sbjct: 446 GNRLATFL-NYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPV 504

Query: 122 ALGNKW 127
            +G KW
Sbjct: 505 LVGCKW 510



 Score = 66.6 bits (161), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP L   ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 456 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 510


>gi|297675929|ref|XP_002815906.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Pongo
           abelii]
          Length = 535

 Score = 72.8 bits (177), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 30/59 (50%), Positives = 41/59 (69%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
            L+ ++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 454 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512



 Score = 66.6 bits (161), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP L   ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 458 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512


>gi|170649696|gb|ACB21278.1| prolyl 4-hydroxylase, alpha II subunit isoform 1 precursor
           (predicted) [Callicebus moloch]
          Length = 555

 Score = 72.8 bits (177), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 30/59 (50%), Positives = 41/59 (69%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
            L+ ++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 474 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 532



 Score = 66.6 bits (161), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP L   ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 478 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 532


>gi|114601548|ref|XP_001162501.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 9 [Pan
           troglodytes]
 gi|114601562|ref|XP_001162805.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 16 [Pan
           troglodytes]
 gi|114601564|ref|XP_517917.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 19 [Pan
           troglodytes]
 gi|397518354|ref|XP_003829356.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Pan
           paniscus]
 gi|397518356|ref|XP_003829357.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Pan
           paniscus]
 gi|397518360|ref|XP_003829359.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 4 [Pan
           paniscus]
 gi|410215942|gb|JAA05190.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
 gi|410255606|gb|JAA15770.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
 gi|410331277|gb|JAA34585.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
 gi|410331281|gb|JAA34587.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
          Length = 533

 Score = 72.8 bits (177), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L + ++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV
Sbjct: 446 GNRLATFL-NYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPV 504

Query: 122 ALGNKW 127
            +G KW
Sbjct: 505 LVGCKW 510



 Score = 66.6 bits (161), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP L   ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 456 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 510


>gi|57997558|emb|CAI46066.1| hypothetical protein [Homo sapiens]
          Length = 533

 Score = 72.8 bits (177), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L + ++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV
Sbjct: 446 GNRLATFL-NYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPV 504

Query: 122 ALGNKW 127
            +G KW
Sbjct: 505 LVGCKW 510



 Score = 66.6 bits (161), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP L   ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 456 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 510


>gi|63252891|ref|NP_001017973.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Homo
           sapiens]
 gi|63252893|ref|NP_001017974.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Homo
           sapiens]
 gi|217272861|ref|NP_001136070.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Homo
           sapiens]
 gi|18073925|emb|CAC85688.1| Prolyl 4-hydroxylase alpha IIa subunit [Homo sapiens]
 gi|23274221|gb|AAH35813.1| Prolyl 4-hydroxylase, alpha polypeptide II [Homo sapiens]
 gi|37183058|gb|AAQ89329.1| P4HA2 [Homo sapiens]
 gi|119582745|gb|EAW62341.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide II, isoform CRA_a
           [Homo sapiens]
 gi|119582750|gb|EAW62346.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide II, isoform CRA_a
           [Homo sapiens]
 gi|123983232|gb|ABM83357.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide II [synthetic
           construct]
 gi|157928048|gb|ABW03320.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide II [synthetic
           construct]
          Length = 533

 Score = 72.8 bits (177), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L + ++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV
Sbjct: 446 GNRLATFL-NYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPV 504

Query: 122 ALGNKW 127
            +G KW
Sbjct: 505 LVGCKW 510



 Score = 66.6 bits (161), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP L   ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 456 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 510


>gi|47218149|emb|CAG10069.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 595

 Score = 72.8 bits (177), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 42/66 (63%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+FP +   V+P+KGSAVFWYN   +   DY   H+ CPV
Sbjct: 508 GNRIATWLFY-MSDVAAGGATVFPDVGAAVWPQKGSAVFWYNLFTSGEGDYSTRHAACPV 566

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 567 LVGNKW 572



 Score = 70.1 bits (170), Expect = 3e-10,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 40/64 (62%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP +   V+P+KGSAVFW+N   +   DY   H+ CPV +
Sbjct: 509 NRIATWLFYMSDVAAGGATVFPDVGAAVWPQKGSAVFWYNLFTSGEGDYSTRHAACPVLV 568

Query: 62  GNKW 65
           GNKW
Sbjct: 569 GNKW 572


>gi|426255748|ref|XP_004021510.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 3 [Ovis
           aries]
          Length = 516

 Score = 72.8 bits (177), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV
Sbjct: 429 GNRIATWLFY-MSDVLAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 487

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 488 LVGNKW 493



 Score = 70.1 bits (170), Expect = 3e-10,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 42/64 (65%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP +  +V+P+KG+AVFW+N  A+   DY   H+ CPV +
Sbjct: 430 NRIATWLFYMSDVLAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 489

Query: 62  GNKW 65
           GNKW
Sbjct: 490 GNKW 493


>gi|426255746|ref|XP_004021509.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2 [Ovis
           aries]
          Length = 534

 Score = 72.8 bits (177), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVLAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 506 LVGNKW 511



 Score = 70.1 bits (170), Expect = 3e-10,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 42/64 (65%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP +  +V+P+KG+AVFW+N  A+   DY   H+ CPV +
Sbjct: 448 NRIATWLFYMSDVLAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507

Query: 62  GNKW 65
           GNKW
Sbjct: 508 GNKW 511


>gi|426255744|ref|XP_004021508.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1 [Ovis
           aries]
          Length = 534

 Score = 72.8 bits (177), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVLAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 506 LVGNKW 511



 Score = 70.1 bits (170), Expect = 3e-10,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 42/64 (65%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP +  +V+P+KG+AVFW+N  A+   DY   H+ CPV +
Sbjct: 448 NRIATWLFYMSDVLAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507

Query: 62  GNKW 65
           GNKW
Sbjct: 508 GNKW 511


>gi|321474877|gb|EFX85841.1| hypothetical protein DAPPUDRAFT_208740 [Daphnia pulex]
          Length = 545

 Score = 72.8 bits (177), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 43/66 (65%), Gaps = 3/66 (4%)

Query: 65  WGKLLLSGL---TDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           WG  + + L   ++VE GGAT+FP LNL ++P+KGSA FWYN H N   +    H+ CPV
Sbjct: 455 WGNRVATWLFYMSEVEAGGATVFPKLNLALWPQKGSAAFWYNLHPNGEGNELTRHAACPV 514

Query: 122 ALGNKW 127
             G+KW
Sbjct: 515 LTGSKW 520



 Score = 71.2 bits (173), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 41/64 (64%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       +++VE GGAT+FP LNL ++P+KGSA FW+N H N   +    H+ CPV  
Sbjct: 457 NRVATWLFYMSEVEAGGATVFPKLNLALWPQKGSAAFWYNLHPNGEGNELTRHAACPVLT 516

Query: 62  GNKW 65
           G+KW
Sbjct: 517 GSKW 520


>gi|151556370|gb|AAI47868.1| P4HA1 protein [Bos taurus]
          Length = 534

 Score = 72.8 bits (177), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVLAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 506 LVGNKW 511



 Score = 70.1 bits (170), Expect = 3e-10,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 42/64 (65%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP +  +V+P+KG+AVFW+N  A+   DY   H+ CPV +
Sbjct: 448 NRIATWLFYMSDVLAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507

Query: 62  GNKW 65
           GNKW
Sbjct: 508 GNKW 511


>gi|115495019|ref|NP_001069238.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Bos taurus]
 gi|122144801|sp|Q1RMU3.1|P4HA1_BOVIN RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
           alpha-1; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-1; Flags: Precursor
 gi|92097479|gb|AAI14709.1| Prolyl 4-hydroxylase, alpha polypeptide I [Bos taurus]
 gi|296472132|tpg|DAA14247.1| TPA: prolyl 4-hydroxylase subunit alpha-1 precursor [Bos taurus]
 gi|440892721|gb|ELR45796.1| Prolyl 4-hydroxylase subunit alpha-1 [Bos grunniens mutus]
          Length = 534

 Score = 72.8 bits (177), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVLAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 506 LVGNKW 511



 Score = 70.1 bits (170), Expect = 3e-10,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 42/64 (65%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP +  +V+P+KG+AVFW+N  A+   DY   H+ CPV +
Sbjct: 448 NRIATWLFYMSDVLAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507

Query: 62  GNKW 65
           GNKW
Sbjct: 508 GNKW 511


>gi|157818741|ref|NP_001101745.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Rattus norvegicus]
 gi|149052604|gb|EDM04421.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha II polypeptide (predicted),
           isoform CRA_a [Rattus norvegicus]
          Length = 535

 Score = 72.8 bits (177), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 30/59 (50%), Positives = 41/59 (69%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
            L+ ++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 454 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512



 Score = 66.6 bits (161), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP L   ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 458 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512


>gi|226874876|ref|NP_035161.2| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Mus
           musculus]
 gi|148701601|gb|EDL33548.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha II polypeptide, isoform CRA_f [Mus
           musculus]
          Length = 537

 Score = 72.8 bits (177), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 30/59 (50%), Positives = 41/59 (69%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
            L+ ++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 456 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 514



 Score = 66.6 bits (161), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP L   ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 460 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 514


>gi|2498741|sp|Q60716.1|P4HA2_MOUSE RecName: Full=Prolyl 4-hydroxylase subunit alpha-2; Short=4-PH
           alpha-2; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-2; Flags: Precursor
 gi|836900|gb|AAC52198.1| prolyl 4-hydroxylase alpha(II)-subunit [Mus musculus]
 gi|18073923|emb|CAC85691.1| Prolyl 4-hydroxylase alpha IIb subunit [Mus musculus]
 gi|1096888|prf||2112362B Pro 4-hydroxylase:SUBUNIT=alpha:ISOTYPE=II
          Length = 537

 Score = 72.8 bits (177), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 30/59 (50%), Positives = 41/59 (69%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
            L+ ++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 456 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 514



 Score = 66.6 bits (161), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP L   ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 460 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 514


>gi|226874889|ref|NP_001152881.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Bos
           taurus]
 gi|296485624|tpg|DAA27739.1| TPA: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Bos taurus]
          Length = 535

 Score = 72.8 bits (177), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 30/59 (50%), Positives = 41/59 (69%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
            L+ ++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 454 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512



 Score = 66.6 bits (161), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP L   ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 458 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512


>gi|426229219|ref|XP_004008688.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 1
           [Ovis aries]
          Length = 535

 Score = 72.8 bits (177), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 30/59 (50%), Positives = 41/59 (69%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
            L+ ++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 454 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512



 Score = 66.6 bits (161), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP L   ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 458 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512


>gi|21711777|gb|AAM75079.1| RE70601p [Drosophila melanogaster]
          Length = 316

 Score = 72.8 bits (177), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 29/55 (52%), Positives = 39/55 (70%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           LTDVE GGAT+FP++   VFP++GS V WYN   N  +D +  H+ CPV +G+KW
Sbjct: 240 LTDVEQGGATVFPNIRKAVFPQRGSVVMWYNLKDNGQIDTQTLHAACPVIVGSKW 294



 Score = 70.1 bits (170), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 28/55 (50%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           LTDVE GGAT+FP++   VFP++GS V W+N   N  +D    H+ CPV +G+KW
Sbjct: 240 LTDVEQGGATVFPNIRKAVFPQRGSVVMWYNLKDNGQIDTQTLHAACPVIVGSKW 294


>gi|170064960|ref|XP_001867743.1| prolyl 4-hydroxylase subunit alpha-1 [Culex quinquefasciatus]
 gi|167882146|gb|EDS45529.1| prolyl 4-hydroxylase subunit alpha-1 [Culex quinquefasciatus]
          Length = 545

 Score = 72.8 bits (177), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 31/66 (46%), Positives = 43/66 (65%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+   +L   ++DV  GGAT+FPS+ + + P+KG+A FWYN HA+   DY   H+ CPV
Sbjct: 454 GNRIATVLFY-MSDVSQGGATVFPSIRVALRPKKGTAAFWYNLHASGHGDYATRHAACPV 512

Query: 122 ALGNKW 127
             G KW
Sbjct: 513 LTGTKW 518



 Score = 69.7 bits (169), Expect = 4e-10,   Method: Composition-based stats.
 Identities = 29/64 (45%), Positives = 40/64 (62%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FPS+ + + P+KG+A FW+N HA+   DY   H+ CPV  
Sbjct: 455 NRIATVLFYMSDVSQGGATVFPSIRVALRPKKGTAAFWYNLHASGHGDYATRHAACPVLT 514

Query: 62  GNKW 65
           G KW
Sbjct: 515 GTKW 518


>gi|66772331|gb|AAY55477.1| IP03959p [Drosophila melanogaster]
 gi|66772361|gb|AAY55492.1| IP03859p [Drosophila melanogaster]
          Length = 541

 Score = 72.8 bits (177), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 32/55 (58%), Positives = 38/55 (69%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L DV LGGAT FP L L V P KGS + WYN H++T  D+R  H+GCPV  G+KW
Sbjct: 455 LNDVALGGATAFPFLRLAVPPVKGSLLIWYNLHSSTHKDFRTKHAGCPVLQGSKW 509



 Score = 68.2 bits (165), Expect = 1e-09,   Method: Composition-based stats.
 Identities = 32/64 (50%), Positives = 40/64 (62%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR   +   L DV LGGAT FP L L V P KGS + W+N H++T  D+   H+GCPV  
Sbjct: 446 NRLLTALFYLNDVALGGATAFPFLRLAVPPVKGSLLIWYNLHSSTHKDFRTKHAGCPVLQ 505

Query: 62  GNKW 65
           G+KW
Sbjct: 506 GSKW 509


>gi|431892682|gb|ELK03115.1| Prolyl 4-hydroxylase subunit alpha-2 [Pteropus alecto]
          Length = 629

 Score = 72.8 bits (177), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 30/59 (50%), Positives = 41/59 (69%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
            L+ ++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 480 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 538



 Score = 66.6 bits (161), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP L   ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 484 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 538


>gi|195505207|ref|XP_002099404.1| GE23380 [Drosophila yakuba]
 gi|194185505|gb|EDW99116.1| GE23380 [Drosophila yakuba]
          Length = 540

 Score = 72.8 bits (177), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 35/66 (53%), Positives = 42/66 (63%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   L DV LGGAT FP L L V P KGS + WYN H++T  D+R  H+GCPV
Sbjct: 444 GNRLATALFY-LNDVALGGATAFPFLRLAVPPVKGSLLIWYNLHSSTHKDFRTKHAGCPV 502

Query: 122 ALGNKW 127
             G+KW
Sbjct: 503 LQGSKW 508



 Score = 68.6 bits (166), Expect = 8e-10,   Method: Composition-based stats.
 Identities = 32/64 (50%), Positives = 40/64 (62%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR   +   L DV LGGAT FP L L V P KGS + W+N H++T  D+   H+GCPV  
Sbjct: 445 NRLATALFYLNDVALGGATAFPFLRLAVPPVKGSLLIWYNLHSSTHKDFRTKHAGCPVLQ 504

Query: 62  GNKW 65
           G+KW
Sbjct: 505 GSKW 508


>gi|209862961|ref|NP_001129548.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Mus
           musculus]
 gi|17390970|gb|AAH18411.1| P4ha2 protein [Mus musculus]
 gi|18073922|emb|CAC85690.1| Prolyl 4-hydroxylase alpha IIa subunit [Mus musculus]
 gi|74211515|dbj|BAE26490.1| unnamed protein product [Mus musculus]
          Length = 535

 Score = 72.8 bits (177), Expect = 5e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L + ++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV
Sbjct: 448 GNRLATFL-NYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPV 506

Query: 122 ALGNKW 127
            +G KW
Sbjct: 507 LVGCKW 512



 Score = 66.6 bits (161), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP L   ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 458 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512


>gi|332221664|ref|XP_003259983.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 5 [Nomascus
           leucogenys]
          Length = 558

 Score = 72.8 bits (177), Expect = 5e-11,   Method: Composition-based stats.
 Identities = 30/59 (50%), Positives = 41/59 (69%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
            L+ ++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 477 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 535



 Score = 66.6 bits (161), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP L   ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 481 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 535


>gi|332221662|ref|XP_003259982.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 4 [Nomascus
           leucogenys]
          Length = 556

 Score = 72.8 bits (177), Expect = 5e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L + ++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV
Sbjct: 469 GNRLATFL-NYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPV 527

Query: 122 ALGNKW 127
            +G KW
Sbjct: 528 LVGCKW 533



 Score = 66.6 bits (161), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP L   ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 479 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 533


>gi|301754231|ref|XP_002912939.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Ailuropoda
           melanoleuca]
          Length = 535

 Score = 72.8 bits (177), Expect = 5e-11,   Method: Composition-based stats.
 Identities = 30/59 (50%), Positives = 41/59 (69%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
            L+ ++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 454 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512



 Score = 66.6 bits (161), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP L   ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 458 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512


>gi|291230950|ref|XP_002735430.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Saccoglossus
           kowalevskii]
          Length = 533

 Score = 72.8 bits (177), Expect = 5e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 41/66 (62%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DVE GGAT+FP +   + PEKGSA FWYN   N   DY   H+ CPV
Sbjct: 445 GNRIATFLFY-MSDVEAGGATVFPQVGARLIPEKGSAAFWYNLLKNGEGDYSTRHAACPV 503

Query: 122 ALGNKW 127
            +G+KW
Sbjct: 504 LVGSKW 509



 Score = 69.3 bits (168), Expect = 5e-10,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 39/64 (60%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DVE GGAT+FP +   + PEKGSA FW+N   N   DY   H+ CPV +
Sbjct: 446 NRIATFLFYMSDVEAGGATVFPQVGARLIPEKGSAAFWYNLLKNGEGDYSTRHAACPVLV 505

Query: 62  GNKW 65
           G+KW
Sbjct: 506 GSKW 509


>gi|148701600|gb|EDL33547.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha II polypeptide, isoform CRA_e [Mus
           musculus]
          Length = 593

 Score = 72.8 bits (177), Expect = 5e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L + ++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV
Sbjct: 506 GNRLATFL-NYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPV 564

Query: 122 ALGNKW 127
            +G KW
Sbjct: 565 LVGCKW 570



 Score = 66.6 bits (161), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP L   ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 516 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 570


>gi|24651424|ref|NP_733376.1| prolyl-4-hydroxylase-alpha SG1 [Drosophila melanogaster]
 gi|23172697|gb|AAF57059.2| prolyl-4-hydroxylase-alpha SG1 [Drosophila melanogaster]
 gi|66772443|gb|AAY55533.1| IP03659p [Drosophila melanogaster]
 gi|220951214|gb|ACL88150.1| PH4alphaSG1-PA [synthetic construct]
 gi|220959938|gb|ACL92512.1| PH4alphaSG1-PA [synthetic construct]
          Length = 540

 Score = 72.8 bits (177), Expect = 5e-11,   Method: Composition-based stats.
 Identities = 32/55 (58%), Positives = 38/55 (69%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L DV LGGAT FP L L V P KGS + WYN H++T  D+R  H+GCPV  G+KW
Sbjct: 454 LNDVALGGATAFPFLRLAVPPVKGSLLIWYNLHSSTHKDFRTKHAGCPVLQGSKW 508



 Score = 68.2 bits (165), Expect = 1e-09,   Method: Composition-based stats.
 Identities = 32/64 (50%), Positives = 40/64 (62%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR   +   L DV LGGAT FP L L V P KGS + W+N H++T  D+   H+GCPV  
Sbjct: 445 NRLLTALFYLNDVALGGATAFPFLRLAVPPVKGSLLIWYNLHSSTHKDFRTKHAGCPVLQ 504

Query: 62  GNKW 65
           G+KW
Sbjct: 505 GSKW 508


>gi|184185444|gb|ACC68850.1| prolyl 4-hydroxylase, alpha II subunit isoform 1 precursor
           (predicted) [Rhinolophus ferrumequinum]
          Length = 555

 Score = 72.8 bits (177), Expect = 5e-11,   Method: Composition-based stats.
 Identities = 30/59 (50%), Positives = 41/59 (69%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
            L+ ++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 474 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 532



 Score = 66.6 bits (161), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP L   ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 478 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 532


>gi|20269816|gb|AAM18063.1|AF495541_1 prolyl 4-hydroxylase alpha-related protein PH4[alpha]SG1
           [Drosophila melanogaster]
          Length = 540

 Score = 72.8 bits (177), Expect = 5e-11,   Method: Composition-based stats.
 Identities = 32/55 (58%), Positives = 38/55 (69%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L DV LGGAT FP L L V P KGS + WYN H++T  D+R  H+GCPV  G+KW
Sbjct: 454 LNDVALGGATAFPFLRLAVPPVKGSLLIWYNLHSSTHKDFRTKHAGCPVLQGSKW 508



 Score = 68.2 bits (165), Expect = 1e-09,   Method: Composition-based stats.
 Identities = 32/64 (50%), Positives = 40/64 (62%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR   +   L DV LGGAT FP L L V P KGS + W+N H++T  D+   H+GCPV  
Sbjct: 445 NRLLTALFYLNDVALGGATAFPFLRLAVPPVKGSLLIWYNLHSSTHKDFRTKHAGCPVLQ 504

Query: 62  GNKW 65
           G+KW
Sbjct: 505 GSKW 508


>gi|432904500|ref|XP_004077362.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Oryzias
           latipes]
          Length = 555

 Score = 72.8 bits (177), Expect = 5e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 43/66 (65%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+FP +  +V P+KG+AVFWYN  A+   DY   H+ CPV
Sbjct: 463 GNRIATWLFY-MSDVSAGGATVFPDVGASVGPQKGTAVFWYNLFASGEGDYSTRHAACPV 521

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 522 LVGNKW 527



 Score = 69.7 bits (169), Expect = 3e-10,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 41/64 (64%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP +  +V P+KG+AVFW+N  A+   DY   H+ CPV +
Sbjct: 464 NRIATWLFYMSDVSAGGATVFPDVGASVGPQKGTAVFWYNLFASGEGDYSTRHAACPVLV 523

Query: 62  GNKW 65
           GNKW
Sbjct: 524 GNKW 527


>gi|440912197|gb|ELR61789.1| Prolyl 4-hydroxylase subunit alpha-2, partial [Bos grunniens mutus]
          Length = 535

 Score = 72.8 bits (177), Expect = 5e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L + ++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV
Sbjct: 448 GNRLATFL-NYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPV 506

Query: 122 ALGNKW 127
            +G KW
Sbjct: 507 LVGCKW 512



 Score = 66.2 bits (160), Expect = 4e-09,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP L   ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 458 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512


>gi|354474413|ref|XP_003499425.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1
           [Cricetulus griseus]
          Length = 535

 Score = 72.8 bits (177), Expect = 5e-11,   Method: Composition-based stats.
 Identities = 30/59 (50%), Positives = 41/59 (69%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
            L+ ++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 454 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512



 Score = 66.6 bits (161), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP L   ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 458 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512


>gi|281183175|ref|NP_001162504.1| prolyl 4-hydroxylase subunit alpha-2 [Papio anubis]
 gi|159461520|gb|ABW96795.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase, alpha
           polypeptide II, isoform 1 (predicted) [Papio anubis]
          Length = 578

 Score = 72.8 bits (177), Expect = 5e-11,   Method: Composition-based stats.
 Identities = 30/59 (50%), Positives = 41/59 (69%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
            L+ ++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 497 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 555



 Score = 66.6 bits (161), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP L   ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 501 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 555


>gi|226874885|ref|NP_001029465.2| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Bos
           taurus]
 gi|296485623|tpg|DAA27738.1| TPA: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Bos taurus]
          Length = 533

 Score = 72.8 bits (177), Expect = 5e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L + ++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV
Sbjct: 446 GNRLATFL-NYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPV 504

Query: 122 ALGNKW 127
            +G KW
Sbjct: 505 LVGCKW 510



 Score = 66.2 bits (160), Expect = 4e-09,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP L   ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 456 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 510


>gi|229368743|gb|ACQ63024.1| prolyl 4-hydroxylase, alpha II subunit isoform 1 precursor
           (predicted) [Dasypus novemcinctus]
          Length = 556

 Score = 72.8 bits (177), Expect = 5e-11,   Method: Composition-based stats.
 Identities = 30/59 (50%), Positives = 41/59 (69%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
            L+ ++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 475 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 533



 Score = 66.2 bits (160), Expect = 4e-09,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP L   ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 479 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 533


>gi|355709025|gb|AES03456.1| prolyl 4-hydroxylase, alpha polypeptide II [Mustela putorius furo]
          Length = 532

 Score = 72.8 bits (177), Expect = 5e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L + ++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV
Sbjct: 446 GNRLATFL-NYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPV 504

Query: 122 ALGNKW 127
            +G KW
Sbjct: 505 LVGCKW 510



 Score = 66.2 bits (160), Expect = 4e-09,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP L   ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 456 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 510


>gi|354474415|ref|XP_003499426.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2
           [Cricetulus griseus]
          Length = 533

 Score = 72.4 bits (176), Expect = 5e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L + ++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV
Sbjct: 446 GNRLATFL-NYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPV 504

Query: 122 ALGNKW 127
            +G KW
Sbjct: 505 LVGCKW 510



 Score = 66.2 bits (160), Expect = 4e-09,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP L   ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 456 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 510


>gi|119582749|gb|EAW62345.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide II, isoform CRA_d
           [Homo sapiens]
          Length = 488

 Score = 72.4 bits (176), Expect = 5e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L + ++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV
Sbjct: 401 GNRLATFL-NYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPV 459

Query: 122 ALGNKW 127
            +G KW
Sbjct: 460 LVGCKW 465



 Score = 66.2 bits (160), Expect = 4e-09,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP L   ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 411 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 465


>gi|73970649|ref|XP_850109.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Canis
           lupus familiaris]
          Length = 533

 Score = 72.4 bits (176), Expect = 5e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L + ++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV
Sbjct: 446 GNRLATFL-NYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPV 504

Query: 122 ALGNKW 127
            +G KW
Sbjct: 505 LVGCKW 510



 Score = 66.2 bits (160), Expect = 4e-09,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP L   ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 456 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 510


>gi|194905290|ref|XP_001981166.1| GG11918 [Drosophila erecta]
 gi|190655804|gb|EDV53036.1| GG11918 [Drosophila erecta]
          Length = 525

 Score = 72.4 bits (176), Expect = 5e-11,   Method: Composition-based stats.
 Identities = 28/55 (50%), Positives = 38/55 (69%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           LTDVE GGAT+FP++   VFP++GS + WYN   N   D +  H+ CPV +G+KW
Sbjct: 449 LTDVEQGGATVFPNIRKAVFPQRGSVIMWYNLQDNGQTDNKTLHAACPVIVGSKW 503



 Score = 68.6 bits (166), Expect = 9e-10,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 37/55 (67%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           LTDVE GGAT+FP++   VFP++GS + W+N   N   D    H+ CPV +G+KW
Sbjct: 449 LTDVEQGGATVFPNIRKAVFPQRGSVIMWYNLQDNGQTDNKTLHAACPVIVGSKW 503


>gi|195341548|ref|XP_002037368.1| GM12149 [Drosophila sechellia]
 gi|194131484|gb|EDW53527.1| GM12149 [Drosophila sechellia]
          Length = 537

 Score = 72.4 bits (176), Expect = 5e-11,   Method: Composition-based stats.
 Identities = 32/55 (58%), Positives = 38/55 (69%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L DV LGGAT FP L L V P KGS + WYN H++T  D+R  H+GCPV  G+KW
Sbjct: 451 LNDVALGGATAFPFLRLAVPPVKGSLLIWYNLHSSTHKDFRTKHAGCPVLQGSKW 505



 Score = 67.8 bits (164), Expect = 1e-09,   Method: Composition-based stats.
 Identities = 32/64 (50%), Positives = 40/64 (62%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR   +   L DV LGGAT FP L L V P KGS + W+N H++T  D+   H+GCPV  
Sbjct: 442 NRLLTALFYLNDVALGGATAFPFLRLAVPPVKGSLLIWYNLHSSTHKDFRTKHAGCPVLQ 501

Query: 62  GNKW 65
           G+KW
Sbjct: 502 GSKW 505


>gi|391342914|ref|XP_003745760.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Metaseiulus
           occidentalis]
          Length = 525

 Score = 72.4 bits (176), Expect = 5e-11,   Method: Composition-based stats.
 Identities = 30/55 (54%), Positives = 39/55 (70%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           ++DV+ GGAT+FP L LTV+PEKGSA FW+N H +   D    H+ CPV  G+KW
Sbjct: 448 MSDVKAGGATVFPRLGLTVWPEKGSAAFWWNLHRSGEGDILTRHAACPVLAGSKW 502



 Score = 70.9 bits (172), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 30/55 (54%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DV+ GGAT+FP L LTV+PEKGSA FW N H +   D    H+ CPV  G+KW
Sbjct: 448 MSDVKAGGATVFPRLGLTVWPEKGSAAFWWNLHRSGEGDILTRHAACPVLAGSKW 502


>gi|387016442|gb|AFJ50340.1| Prolyl 4-hydroxylase subunit alpha-2-like [Crotalus adamanteus]
          Length = 533

 Score = 72.4 bits (176), Expect = 6e-11,   Method: Composition-based stats.
 Identities = 31/66 (46%), Positives = 43/66 (65%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L + ++DVE GGAT+FP     ++P+KG+AVFWYN   +   DYR  H+ CPV
Sbjct: 446 GNRLATFL-NYMSDVEAGGATVFPDFGAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPV 504

Query: 122 ALGNKW 127
            +G KW
Sbjct: 505 LVGCKW 510



 Score = 66.2 bits (160), Expect = 4e-09,   Method: Composition-based stats.
 Identities = 26/55 (47%), Positives = 37/55 (67%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP     ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 456 MSDVEAGGATVFPDFGAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGCKW 510


>gi|195069801|ref|XP_001997031.1| GH12975 [Drosophila grimshawi]
 gi|193891500|gb|EDV90366.1| GH12975 [Drosophila grimshawi]
          Length = 242

 Score = 72.4 bits (176), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 32/55 (58%), Positives = 40/55 (72%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L++VE GG T FP L L V PE+GS +FWYN H +  LDYR  H+GCPV +G+KW
Sbjct: 155 LSNVEAGGGTAFPFLPLLVEPERGSLLFWYNLHRSGDLDYRTKHAGCPVLMGSKW 209



 Score = 68.2 bits (165), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 32/64 (50%), Positives = 41/64 (64%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       L++VE GG T FP L L V PE+GS +FW+N H +  LDY   H+GCPV +
Sbjct: 146 NRVATGIYYLSNVEAGGGTAFPFLPLLVEPERGSLLFWYNLHRSGDLDYRTKHAGCPVLM 205

Query: 62  GNKW 65
           G+KW
Sbjct: 206 GSKW 209


>gi|291387304|ref|XP_002710243.1| PREDICTED: prolyl 4-hydroxylase, alpha II subunit isoform 1
           precursor (predicted)-like isoform 3 [Oryctolagus
           cuniculus]
          Length = 535

 Score = 72.4 bits (176), Expect = 6e-11,   Method: Composition-based stats.
 Identities = 30/59 (50%), Positives = 41/59 (69%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
            L+ ++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 454 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512



 Score = 66.2 bits (160), Expect = 4e-09,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP L   ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 458 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512


>gi|195341536|ref|XP_002037362.1| GM12882 [Drosophila sechellia]
 gi|194131478|gb|EDW53521.1| GM12882 [Drosophila sechellia]
          Length = 550

 Score = 72.4 bits (176), Expect = 6e-11,   Method: Composition-based stats.
 Identities = 32/69 (46%), Positives = 44/69 (63%), Gaps = 1/69 (1%)

Query: 59  VALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSG 118
           + LGN+   +L   ++DVE GGAT+F SL+  +FP+KG+A FW N H +   D R  H+ 
Sbjct: 455 INLGNRIATVLFY-MSDVEQGGATVFTSLHTALFPKKGTAAFWMNLHRDGQGDVRTRHAA 513

Query: 119 CPVALGNKW 127
           CPV  G KW
Sbjct: 514 CPVLTGTKW 522



 Score = 66.2 bits (160), Expect = 4e-09,   Method: Composition-based stats.
 Identities = 29/64 (45%), Positives = 38/64 (59%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DVE GGAT+F SL+  +FP+KG+A FW N H +   D    H+ CPV  
Sbjct: 459 NRIATVLFYMSDVEQGGATVFTSLHTALFPKKGTAAFWMNLHRDGQGDVRTRHAACPVLT 518

Query: 62  GNKW 65
           G KW
Sbjct: 519 GTKW 522


>gi|327265288|ref|XP_003217440.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Anolis
           carolinensis]
          Length = 554

 Score = 72.4 bits (176), Expect = 6e-11,   Method: Composition-based stats.
 Identities = 29/59 (49%), Positives = 40/59 (67%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
            L+ ++DVE GGAT+FP     ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 473 FLNYMSDVEAGGATVFPDFGAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGCKW 531



 Score = 66.2 bits (160), Expect = 4e-09,   Method: Composition-based stats.
 Identities = 26/55 (47%), Positives = 37/55 (67%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP     ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 477 MSDVEAGGATVFPDFGAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGCKW 531


>gi|291387300|ref|XP_002710241.1| PREDICTED: prolyl 4-hydroxylase, alpha II subunit isoform 1
           precursor (predicted)-like isoform 1 [Oryctolagus
           cuniculus]
          Length = 533

 Score = 72.4 bits (176), Expect = 6e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L + ++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV
Sbjct: 446 GNRLATFL-NYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPV 504

Query: 122 ALGNKW 127
            +G KW
Sbjct: 505 LVGCKW 510



 Score = 66.2 bits (160), Expect = 4e-09,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP L   ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 456 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 510


>gi|190402274|gb|ACE77683.1| prolyl 4-hydroxylase subunit alpha-2 precursor (predicted) [Sorex
           araneus]
          Length = 533

 Score = 72.4 bits (176), Expect = 6e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L + ++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV
Sbjct: 446 GNRLATFL-NYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPV 504

Query: 122 ALGNKW 127
            +G KW
Sbjct: 505 LVGCKW 510



 Score = 66.2 bits (160), Expect = 4e-09,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP L   ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 456 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 510


>gi|426229221|ref|XP_004008689.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 2
           [Ovis aries]
          Length = 487

 Score = 72.4 bits (176), Expect = 6e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L + ++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV
Sbjct: 400 GNRLATFL-NYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPV 458

Query: 122 ALGNKW 127
            +G KW
Sbjct: 459 LVGCKW 464



 Score = 66.2 bits (160), Expect = 4e-09,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP L   ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 410 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 464


>gi|74353841|gb|AAI03334.1| Prolyl 4-hydroxylase, alpha polypeptide II [Bos taurus]
          Length = 487

 Score = 72.4 bits (176), Expect = 6e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L + ++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV
Sbjct: 400 GNRLATFL-NYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPV 458

Query: 122 ALGNKW 127
            +G KW
Sbjct: 459 LVGCKW 464



 Score = 66.2 bits (160), Expect = 4e-09,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP L   ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 410 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 464


>gi|291387302|ref|XP_002710242.1| PREDICTED: prolyl 4-hydroxylase, alpha II subunit isoform 1
           precursor (predicted)-like isoform 2 [Oryctolagus
           cuniculus]
 gi|217273039|gb|ACK28132.1| prolyl 4-hydroxylase, alpha II subunit isoform 1 precursor
           (predicted) [Oryctolagus cuniculus]
          Length = 555

 Score = 72.4 bits (176), Expect = 6e-11,   Method: Composition-based stats.
 Identities = 30/59 (50%), Positives = 41/59 (69%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
            L+ ++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 474 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 532



 Score = 66.2 bits (160), Expect = 4e-09,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP L   ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 478 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 532


>gi|57525020|ref|NP_001006155.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Gallus gallus]
 gi|82082587|sp|Q5ZLK5.1|P4HA2_CHICK RecName: Full=Prolyl 4-hydroxylase subunit alpha-2; Short=4-PH
           alpha-2; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-2; Flags: Precursor
 gi|53129464|emb|CAG31388.1| hypothetical protein RCJMB04_5l17 [Gallus gallus]
          Length = 534

 Score = 72.4 bits (176), Expect = 6e-11,   Method: Composition-based stats.
 Identities = 31/66 (46%), Positives = 43/66 (65%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L + ++DVE GGAT+FP     ++P+KG+AVFWYN   +   DYR  H+ CPV
Sbjct: 447 GNRLATFL-NYMSDVEAGGATVFPDFGAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPV 505

Query: 122 ALGNKW 127
            +G KW
Sbjct: 506 LVGCKW 511



 Score = 65.9 bits (159), Expect = 5e-09,   Method: Composition-based stats.
 Identities = 26/55 (47%), Positives = 37/55 (67%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP     ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 457 MSDVEAGGATVFPDFGAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGCKW 511


>gi|351706369|gb|EHB09288.1| Prolyl 4-hydroxylase subunit alpha-2 [Heterocephalus glaber]
          Length = 535

 Score = 72.4 bits (176), Expect = 6e-11,   Method: Composition-based stats.
 Identities = 30/59 (50%), Positives = 41/59 (69%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
            L+ ++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 454 FLNYMSDVEAGGATVFPDLGAALWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512



 Score = 65.9 bits (159), Expect = 5e-09,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP L   ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 458 MSDVEAGGATVFPDLGAALWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512


>gi|326928728|ref|XP_003210527.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Meleagris
           gallopavo]
          Length = 535

 Score = 72.4 bits (176), Expect = 6e-11,   Method: Composition-based stats.
 Identities = 31/66 (46%), Positives = 43/66 (65%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L + ++DVE GGAT+FP     ++P+KG+AVFWYN   +   DYR  H+ CPV
Sbjct: 448 GNRLATFL-NYMSDVEAGGATVFPDFGAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPV 506

Query: 122 ALGNKW 127
            +G KW
Sbjct: 507 LVGCKW 512



 Score = 65.9 bits (159), Expect = 5e-09,   Method: Composition-based stats.
 Identities = 26/55 (47%), Positives = 37/55 (67%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP     ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 458 MSDVEAGGATVFPDFGAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGCKW 512


>gi|54792285|emb|CAG28668.1| prolyl 4-hydroxylase alpha-2 subunit [Gallus gallus]
          Length = 538

 Score = 72.4 bits (176), Expect = 6e-11,   Method: Composition-based stats.
 Identities = 29/59 (49%), Positives = 40/59 (67%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
            L+ ++DVE GGAT+FP     ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 458 FLNYMSDVEAGGATVFPDFGAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGCKW 516



 Score = 65.9 bits (159), Expect = 5e-09,   Method: Composition-based stats.
 Identities = 26/55 (47%), Positives = 37/55 (67%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP     ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 462 MSDVEAGGATVFPDFGAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGCKW 516


>gi|240974259|ref|XP_002401836.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
 gi|215491070|gb|EEC00711.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
          Length = 490

 Score = 72.4 bits (176), Expect = 6e-11,   Method: Composition-based stats.
 Identities = 30/58 (51%), Positives = 40/58 (68%)

Query: 70  LSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L+ ++DV  GGAT+FP L LT++PEKG+A FWYN H +   D    H+ CPV  G+KW
Sbjct: 410 LNYMSDVPAGGATVFPQLRLTLWPEKGAAAFWYNLHRSGEGDMLTRHAACPVLAGSKW 467



 Score = 69.7 bits (169), Expect = 3e-10,   Method: Composition-based stats.
 Identities = 28/55 (50%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DV  GGAT+FP L LT++PEKG+A FW+N H +   D    H+ CPV  G+KW
Sbjct: 413 MSDVPAGGATVFPQLRLTLWPEKGAAAFWYNLHRSGEGDMLTRHAACPVLAGSKW 467


>gi|348518914|ref|XP_003446976.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Oreochromis
           niloticus]
          Length = 536

 Score = 72.0 bits (175), Expect = 7e-11,   Method: Composition-based stats.
 Identities = 28/59 (47%), Positives = 40/59 (67%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
            L+ ++DVE GGAT+FP     ++P KG++VFWYN   +   DYR  H+ CPV +G+KW
Sbjct: 455 FLNYMSDVEAGGATVFPDFGAAIWPRKGTSVFWYNLFRSGEGDYRTRHAACPVLVGSKW 513



 Score = 65.9 bits (159), Expect = 5e-09,   Method: Composition-based stats.
 Identities = 25/55 (45%), Positives = 37/55 (67%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP     ++P KG++VFW+N   +   DY   H+ CPV +G+KW
Sbjct: 459 MSDVEAGGATVFPDFGAAIWPRKGTSVFWYNLFRSGEGDYRTRHAACPVLVGSKW 513


>gi|449267219|gb|EMC78185.1| Prolyl 4-hydroxylase subunit alpha-2 [Columba livia]
          Length = 538

 Score = 72.0 bits (175), Expect = 7e-11,   Method: Composition-based stats.
 Identities = 29/59 (49%), Positives = 40/59 (67%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
            L+ ++DVE GGAT+FP     ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 457 FLNYMSDVEAGGATVFPDFGAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGCKW 515



 Score = 65.9 bits (159), Expect = 5e-09,   Method: Composition-based stats.
 Identities = 26/55 (47%), Positives = 37/55 (67%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP     ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 461 MSDVEAGGATVFPDFGAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGCKW 515


>gi|348557542|ref|XP_003464578.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 1
           [Cavia porcellus]
          Length = 535

 Score = 72.0 bits (175), Expect = 7e-11,   Method: Composition-based stats.
 Identities = 30/59 (50%), Positives = 41/59 (69%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
            L+ ++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 454 FLNYMSDVEAGGATVFPDLGAALWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512



 Score = 65.9 bits (159), Expect = 5e-09,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP L   ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 458 MSDVEAGGATVFPDLGAALWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512


>gi|348557544|ref|XP_003464579.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 2
           [Cavia porcellus]
          Length = 533

 Score = 72.0 bits (175), Expect = 7e-11,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L + ++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV
Sbjct: 446 GNRLATFL-NYMSDVEAGGATVFPDLGAALWPKKGTAVFWYNLLRSGEGDYRTRHAACPV 504

Query: 122 ALGNKW 127
            +G KW
Sbjct: 505 LVGCKW 510



 Score = 65.9 bits (159), Expect = 5e-09,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP L   ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 456 MSDVEAGGATVFPDLGAALWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 510


>gi|224068121|ref|XP_002191580.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Taeniopygia
           guttata]
          Length = 539

 Score = 72.0 bits (175), Expect = 7e-11,   Method: Composition-based stats.
 Identities = 31/66 (46%), Positives = 43/66 (65%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L + ++DVE GGAT+FP     ++P+KG+AVFWYN   +   DYR  H+ CPV
Sbjct: 452 GNRLATFL-NYMSDVEAGGATVFPDFGAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPV 510

Query: 122 ALGNKW 127
            +G KW
Sbjct: 511 LVGCKW 516



 Score = 65.9 bits (159), Expect = 6e-09,   Method: Composition-based stats.
 Identities = 26/55 (47%), Positives = 37/55 (67%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP     ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 462 MSDVEAGGATVFPDFGAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGCKW 516


>gi|129365|sp|P16924.1|P4HA1_CHICK RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
           alpha-1; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-1
          Length = 516

 Score = 72.0 bits (175), Expect = 7e-11,   Method: Composition-based stats.
 Identities = 31/66 (46%), Positives = 43/66 (65%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+FP +  +V+P+KG+AVFWYN   +   DY   H+ CPV
Sbjct: 429 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPV 487

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 488 LVGNKW 493



 Score = 68.9 bits (167), Expect = 5e-10,   Method: Composition-based stats.
 Identities = 29/64 (45%), Positives = 41/64 (64%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP +  +V+P+KG+AVFW+N   +   DY   H+ CPV +
Sbjct: 430 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPVLV 489

Query: 62  GNKW 65
           GNKW
Sbjct: 490 GNKW 493


>gi|212530|gb|AAA49002.1| prolyl 4-hydroxylase, alpha subunit (EC 1.14.11.2), partial [Gallus
           gallus]
          Length = 489

 Score = 72.0 bits (175), Expect = 7e-11,   Method: Composition-based stats.
 Identities = 31/66 (46%), Positives = 43/66 (65%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+FP +  +V+P+KG+AVFWYN   +   DY   H+ CPV
Sbjct: 402 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPV 460

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 461 LVGNKW 466



 Score = 68.9 bits (167), Expect = 5e-10,   Method: Composition-based stats.
 Identities = 29/64 (45%), Positives = 41/64 (64%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP +  +V+P+KG+AVFW+N   +   DY   H+ CPV +
Sbjct: 403 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPVLV 462

Query: 62  GNKW 65
           GNKW
Sbjct: 463 GNKW 466


>gi|24651407|ref|NP_733371.1| prolyl-4-hydroxylase-alpha EFB [Drosophila melanogaster]
 gi|20269806|gb|AAM18058.1|AF495536_1 prolyl 4-hydroxylase alpha-related protein PH4[alpha]EFB
           [Drosophila melanogaster]
 gi|15292529|gb|AAK93533.1| SD05564p [Drosophila melanogaster]
 gi|23172692|gb|AAF57053.2| prolyl-4-hydroxylase-alpha EFB [Drosophila melanogaster]
 gi|220946562|gb|ACL85824.1| PH4alphaEFB-PA [synthetic construct]
          Length = 550

 Score = 72.0 bits (175), Expect = 8e-11,   Method: Composition-based stats.
 Identities = 32/67 (47%), Positives = 43/67 (64%), Gaps = 1/67 (1%)

Query: 61  LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
           LGN+   +L   ++DVE GGAT+F SL+  +FP+KG+A FW N H +   D R  H+ CP
Sbjct: 457 LGNRIATVLFY-MSDVEQGGATVFTSLHTALFPKKGTAAFWMNLHRDGQGDVRTRHAACP 515

Query: 121 VALGNKW 127
           V  G KW
Sbjct: 516 VLTGTKW 522



 Score = 66.2 bits (160), Expect = 4e-09,   Method: Composition-based stats.
 Identities = 29/64 (45%), Positives = 38/64 (59%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DVE GGAT+F SL+  +FP+KG+A FW N H +   D    H+ CPV  
Sbjct: 459 NRIATVLFYMSDVEQGGATVFTSLHTALFPKKGTAAFWMNLHRDGQGDVRTRHAACPVLT 518

Query: 62  GNKW 65
           G KW
Sbjct: 519 GTKW 522


>gi|195575089|ref|XP_002105512.1| GD21521 [Drosophila simulans]
 gi|194201439|gb|EDX15015.1| GD21521 [Drosophila simulans]
          Length = 550

 Score = 72.0 bits (175), Expect = 8e-11,   Method: Composition-based stats.
 Identities = 32/67 (47%), Positives = 43/67 (64%), Gaps = 1/67 (1%)

Query: 61  LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
           LGN+   +L   ++DVE GGAT+F SL+  +FP+KG+A FW N H +   D R  H+ CP
Sbjct: 457 LGNRIATVLFY-MSDVEQGGATVFTSLHTALFPKKGTAAFWMNLHRDGQGDVRTRHAACP 515

Query: 121 VALGNKW 127
           V  G KW
Sbjct: 516 VLTGTKW 522



 Score = 66.2 bits (160), Expect = 4e-09,   Method: Composition-based stats.
 Identities = 29/64 (45%), Positives = 38/64 (59%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DVE GGAT+F SL+  +FP+KG+A FW N H +   D    H+ CPV  
Sbjct: 459 NRIATVLFYMSDVEQGGATVFTSLHTALFPKKGTAAFWMNLHRDGQGDVRTRHAACPVLT 518

Query: 62  GNKW 65
           G KW
Sbjct: 519 GTKW 522


>gi|194905436|ref|XP_001981196.1| GG11753 [Drosophila erecta]
 gi|190655834|gb|EDV53066.1| GG11753 [Drosophila erecta]
          Length = 550

 Score = 72.0 bits (175), Expect = 8e-11,   Method: Composition-based stats.
 Identities = 32/67 (47%), Positives = 43/67 (64%), Gaps = 1/67 (1%)

Query: 61  LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
           LGN+   +L   ++DVE GGAT+F SL+  +FP+KG+A FW N H +   D R  H+ CP
Sbjct: 457 LGNRIATVLFY-MSDVEQGGATVFTSLHTALFPKKGTAAFWMNLHRDGQGDVRTRHAACP 515

Query: 121 VALGNKW 127
           V  G KW
Sbjct: 516 VLTGTKW 522



 Score = 66.2 bits (160), Expect = 4e-09,   Method: Composition-based stats.
 Identities = 29/64 (45%), Positives = 38/64 (59%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DVE GGAT+F SL+  +FP+KG+A FW N H +   D    H+ CPV  
Sbjct: 459 NRIATVLFYMSDVEQGGATVFTSLHTALFPKKGTAAFWMNLHRDGQGDVRTRHAACPVLT 518

Query: 62  GNKW 65
           G KW
Sbjct: 519 GTKW 522


>gi|345481336|ref|XP_001600680.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Nasonia
           vitripennis]
          Length = 556

 Score = 72.0 bits (175), Expect = 8e-11,   Method: Composition-based stats.
 Identities = 30/66 (45%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+   +L   ++DVE GG T+F  +N++++P+KGSA FWYN   N   DY+  H+ CPV
Sbjct: 461 GNRIATVLYY-MSDVEQGGGTVFTKINISLWPKKGSAAFWYNLKPNGEGDYKTRHAACPV 519

Query: 122 ALGNKW 127
             G+KW
Sbjct: 520 LTGSKW 525



 Score = 67.8 bits (164), Expect = 2e-09,   Method: Composition-based stats.
 Identities = 28/64 (43%), Positives = 40/64 (62%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DVE GG T+F  +N++++P+KGSA FW+N   N   DY   H+ CPV  
Sbjct: 462 NRIATVLYYMSDVEQGGGTVFTKINISLWPKKGSAAFWYNLKPNGEGDYKTRHAACPVLT 521

Query: 62  GNKW 65
           G+KW
Sbjct: 522 GSKW 525


>gi|326923465|ref|XP_003207956.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 3
           [Meleagris gallopavo]
          Length = 518

 Score = 72.0 bits (175), Expect = 8e-11,   Method: Composition-based stats.
 Identities = 31/66 (46%), Positives = 43/66 (65%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+FP +  +V+P+KG+AVFWYN   +   DY   H+ CPV
Sbjct: 431 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPV 489

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 490 LVGNKW 495



 Score = 68.9 bits (167), Expect = 6e-10,   Method: Composition-based stats.
 Identities = 29/64 (45%), Positives = 41/64 (64%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP +  +V+P+KG+AVFW+N   +   DY   H+ CPV +
Sbjct: 432 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPVLV 491

Query: 62  GNKW 65
           GNKW
Sbjct: 492 GNKW 495


>gi|312032360|ref|NP_001185667.1| prolyl 4-hydroxylase subunit alpha-1 isoform 4 precursor [Gallus
           gallus]
          Length = 536

 Score = 72.0 bits (175), Expect = 8e-11,   Method: Composition-based stats.
 Identities = 31/66 (46%), Positives = 43/66 (65%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+FP +  +V+P+KG+AVFWYN   +   DY   H+ CPV
Sbjct: 449 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPV 507

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 508 LVGNKW 513



 Score = 68.9 bits (167), Expect = 6e-10,   Method: Composition-based stats.
 Identities = 29/64 (45%), Positives = 41/64 (64%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP +  +V+P+KG+AVFW+N   +   DY   H+ CPV +
Sbjct: 450 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPVLV 509

Query: 62  GNKW 65
           GNKW
Sbjct: 510 GNKW 513


>gi|195452730|ref|XP_002073475.1| GK13125 [Drosophila willistoni]
 gi|194169560|gb|EDW84461.1| GK13125 [Drosophila willistoni]
          Length = 539

 Score = 71.6 bits (174), Expect = 8e-11,   Method: Composition-based stats.
 Identities = 31/55 (56%), Positives = 37/55 (67%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L++VE GG T FP L L V PE+GS VFWYN H +   DYR  H+ CPV  G+KW
Sbjct: 452 LSEVEAGGGTAFPFLPLLVTPERGSLVFWYNLHRSGDQDYRTKHAACPVLQGSKW 506



 Score = 67.4 bits (163), Expect = 2e-09,   Method: Composition-based stats.
 Identities = 31/64 (48%), Positives = 38/64 (59%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       L++VE GG T FP L L V PE+GS VFW+N H +   DY   H+ CPV  
Sbjct: 443 NRIATGIYYLSEVEAGGGTAFPFLPLLVTPERGSLVFWYNLHRSGDQDYRTKHAACPVLQ 502

Query: 62  GNKW 65
           G+KW
Sbjct: 503 GSKW 506


>gi|312032358|ref|NP_001185666.1| prolyl 4-hydroxylase subunit alpha-1 isoform 3 precursor [Gallus
           gallus]
          Length = 536

 Score = 71.6 bits (174), Expect = 8e-11,   Method: Composition-based stats.
 Identities = 31/66 (46%), Positives = 43/66 (65%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+FP +  +V+P+KG+AVFWYN   +   DY   H+ CPV
Sbjct: 449 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPV 507

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 508 LVGNKW 513



 Score = 68.9 bits (167), Expect = 6e-10,   Method: Composition-based stats.
 Identities = 29/64 (45%), Positives = 41/64 (64%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP +  +V+P+KG+AVFW+N   +   DY   H+ CPV +
Sbjct: 450 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPVLV 509

Query: 62  GNKW 65
           GNKW
Sbjct: 510 GNKW 513


>gi|312032356|ref|NP_001185665.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Gallus
           gallus]
          Length = 536

 Score = 71.6 bits (174), Expect = 8e-11,   Method: Composition-based stats.
 Identities = 31/66 (46%), Positives = 43/66 (65%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+FP +  +V+P+KG+AVFWYN   +   DY   H+ CPV
Sbjct: 449 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPV 507

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 508 LVGNKW 513



 Score = 68.9 bits (167), Expect = 6e-10,   Method: Composition-based stats.
 Identities = 29/64 (45%), Positives = 41/64 (64%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP +  +V+P+KG+AVFW+N   +   DY   H+ CPV +
Sbjct: 450 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPVLV 509

Query: 62  GNKW 65
           GNKW
Sbjct: 510 GNKW 513


>gi|326923461|ref|XP_003207954.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 1
           [Meleagris gallopavo]
          Length = 536

 Score = 71.6 bits (174), Expect = 9e-11,   Method: Composition-based stats.
 Identities = 31/66 (46%), Positives = 43/66 (65%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+FP +  +V+P+KG+AVFWYN   +   DY   H+ CPV
Sbjct: 449 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPV 507

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 508 LVGNKW 513



 Score = 68.9 bits (167), Expect = 6e-10,   Method: Composition-based stats.
 Identities = 29/64 (45%), Positives = 41/64 (64%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP +  +V+P+KG+AVFW+N   +   DY   H+ CPV +
Sbjct: 450 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPVLV 509

Query: 62  GNKW 65
           GNKW
Sbjct: 510 GNKW 513


>gi|196011908|ref|XP_002115817.1| hypothetical protein TRIADDRAFT_30052 [Trichoplax adhaerens]
 gi|190581593|gb|EDV21669.1| hypothetical protein TRIADDRAFT_30052, partial [Trichoplax
           adhaerens]
          Length = 495

 Score = 71.6 bits (174), Expect = 9e-11,   Method: Composition-based stats.
 Identities = 29/55 (52%), Positives = 40/55 (72%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L++VE+GGATIFP L + V P+KGSA+FW N   N L + +  H+ CPV +G+KW
Sbjct: 418 LSNVEIGGATIFPRLGVAVRPQKGSALFWINLKRNGLTNRQTLHAACPVVIGSKW 472



 Score = 69.7 bits (169), Expect = 4e-10,   Method: Composition-based stats.
 Identities = 29/55 (52%), Positives = 39/55 (70%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L++VE+GGATIFP L + V P+KGSA+FW N   N L +    H+ CPV +G+KW
Sbjct: 418 LSNVEIGGATIFPRLGVAVRPQKGSALFWINLKRNGLTNRQTLHAACPVVIGSKW 472


>gi|312032354|ref|NP_001185664.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Gallus
           gallus]
          Length = 536

 Score = 71.6 bits (174), Expect = 9e-11,   Method: Composition-based stats.
 Identities = 31/66 (46%), Positives = 43/66 (65%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+FP +  +V+P+KG+AVFWYN   +   DY   H+ CPV
Sbjct: 449 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPV 507

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 508 LVGNKW 513



 Score = 68.9 bits (167), Expect = 6e-10,   Method: Composition-based stats.
 Identities = 29/64 (45%), Positives = 41/64 (64%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP +  +V+P+KG+AVFW+N   +   DY   H+ CPV +
Sbjct: 450 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPVLV 509

Query: 62  GNKW 65
           GNKW
Sbjct: 510 GNKW 513


>gi|326923463|ref|XP_003207955.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 2
           [Meleagris gallopavo]
          Length = 536

 Score = 71.6 bits (174), Expect = 9e-11,   Method: Composition-based stats.
 Identities = 31/66 (46%), Positives = 43/66 (65%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+FP +  +V+P+KG+AVFWYN   +   DY   H+ CPV
Sbjct: 449 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPV 507

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 508 LVGNKW 513



 Score = 68.9 bits (167), Expect = 7e-10,   Method: Composition-based stats.
 Identities = 29/64 (45%), Positives = 41/64 (64%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP +  +V+P+KG+AVFW+N   +   DY   H+ CPV +
Sbjct: 450 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPVLV 509

Query: 62  GNKW 65
           GNKW
Sbjct: 510 GNKW 513


>gi|194765194|ref|XP_001964712.1| GF22904 [Drosophila ananassae]
 gi|190614984|gb|EDV30508.1| GF22904 [Drosophila ananassae]
          Length = 547

 Score = 71.6 bits (174), Expect = 9e-11,   Method: Composition-based stats.
 Identities = 32/67 (47%), Positives = 43/67 (64%), Gaps = 1/67 (1%)

Query: 61  LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
           LGN+   +L   ++DVE GGAT+F SL+  +FP+KG+A FW N H +   D R  H+ CP
Sbjct: 454 LGNRIATVLFY-MSDVEQGGATVFTSLHTALFPKKGTAAFWMNLHRDGEGDVRTRHAACP 512

Query: 121 VALGNKW 127
           V  G KW
Sbjct: 513 VLTGTKW 519



 Score = 65.9 bits (159), Expect = 5e-09,   Method: Composition-based stats.
 Identities = 29/64 (45%), Positives = 38/64 (59%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DVE GGAT+F SL+  +FP+KG+A FW N H +   D    H+ CPV  
Sbjct: 456 NRIATVLFYMSDVEQGGATVFTSLHTALFPKKGTAAFWMNLHRDGEGDVRTRHAACPVLT 515

Query: 62  GNKW 65
           G KW
Sbjct: 516 GTKW 519


>gi|195341588|ref|XP_002037388.1| GM12140 [Drosophila sechellia]
 gi|194131504|gb|EDW53547.1| GM12140 [Drosophila sechellia]
          Length = 534

 Score = 71.6 bits (174), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 36/90 (40%), Positives = 51/90 (56%), Gaps = 8/90 (8%)

Query: 38  FWHNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGS 97
           F  + H +T   Y M        LG++   +L   LTDVE GGAT+F  +  +V+P+ G+
Sbjct: 425 FASSNHTDTRSSYSM-------DLGDRIATVLFY-LTDVEQGGATVFADVGYSVYPQAGT 476

Query: 98  AVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           A+FWYN   N   D R  H+ CPV +G+KW
Sbjct: 477 AIFWYNLDTNGKGDPRTKHAACPVIVGSKW 506



 Score = 64.7 bits (156), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 26/55 (47%), Positives = 37/55 (67%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           LTDVE GGAT+F  +  +V+P+ G+A+FW+N   N   D    H+ CPV +G+KW
Sbjct: 452 LTDVEQGGATVFADVGYSVYPQAGTAIFWYNLDTNGKGDPRTKHAACPVIVGSKW 506


>gi|449280261|gb|EMC87600.1| Prolyl 4-hydroxylase subunit alpha-1 [Columba livia]
          Length = 536

 Score = 71.6 bits (174), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 31/66 (46%), Positives = 42/66 (63%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+FP +  +V+P KG+AVFWYN   +   DY   H+ CPV
Sbjct: 449 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPRKGTAVFWYNLFPSGEGDYSTRHAACPV 507

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 508 LVGNKW 513



 Score = 68.9 bits (167), Expect = 7e-10,   Method: Composition-based stats.
 Identities = 29/64 (45%), Positives = 40/64 (62%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP +  +V+P KG+AVFW+N   +   DY   H+ CPV +
Sbjct: 450 NRIATWLFYMSDVSAGGATVFPEVGASVWPRKGTAVFWYNLFPSGEGDYSTRHAACPVLV 509

Query: 62  GNKW 65
           GNKW
Sbjct: 510 GNKW 513


>gi|327267604|ref|XP_003218589.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Anolis
           carolinensis]
          Length = 542

 Score = 71.6 bits (174), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 31/66 (46%), Positives = 42/66 (63%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+FP +  +V+P KG+AVFWYN   +   DY   H+ CPV
Sbjct: 455 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPRKGTAVFWYNLFPSGEGDYSTRHAACPV 513

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 514 LVGNKW 519



 Score = 68.9 bits (167), Expect = 7e-10,   Method: Composition-based stats.
 Identities = 29/64 (45%), Positives = 40/64 (62%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP +  +V+P KG+AVFW+N   +   DY   H+ CPV +
Sbjct: 456 NRIATWLFYMSDVSAGGATVFPEVGASVWPRKGTAVFWYNLFPSGEGDYSTRHAACPVLV 515

Query: 62  GNKW 65
           GNKW
Sbjct: 516 GNKW 519


>gi|224052167|ref|XP_002191912.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Taeniopygia
           guttata]
          Length = 536

 Score = 71.6 bits (174), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 31/66 (46%), Positives = 42/66 (63%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+FP +  +V+P KG+AVFWYN   +   DY   H+ CPV
Sbjct: 449 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPRKGTAVFWYNLFPSGEGDYSTRHAACPV 507

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 508 LVGNKW 513



 Score = 68.9 bits (167), Expect = 7e-10,   Method: Composition-based stats.
 Identities = 29/64 (45%), Positives = 40/64 (62%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP +  +V+P KG+AVFW+N   +   DY   H+ CPV +
Sbjct: 450 NRIATWLFYMSDVSAGGATVFPEVGASVWPRKGTAVFWYNLFPSGEGDYSTRHAACPVLV 509

Query: 62  GNKW 65
           GNKW
Sbjct: 510 GNKW 513


>gi|195505199|ref|XP_002099401.1| GE23383 [Drosophila yakuba]
 gi|194185502|gb|EDW99113.1| GE23383 [Drosophila yakuba]
          Length = 535

 Score = 71.6 bits (174), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 31/55 (56%), Positives = 37/55 (67%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L+DVE GG T FP L L V PEKGS +FWYN H +   D+R  H+ CPV  G+KW
Sbjct: 448 LSDVEAGGGTAFPFLPLLVTPEKGSLLFWYNLHPSGDQDFRTKHAACPVLQGSKW 502



 Score = 67.0 bits (162), Expect = 2e-09,   Method: Composition-based stats.
 Identities = 31/64 (48%), Positives = 38/64 (59%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       L+DVE GG T FP L L V PEKGS +FW+N H +   D+   H+ CPV  
Sbjct: 439 NRIATGIYYLSDVEAGGGTAFPFLPLLVTPEKGSLLFWYNLHPSGDQDFRTKHAACPVLQ 498

Query: 62  GNKW 65
           G+KW
Sbjct: 499 GSKW 502


>gi|195505190|ref|XP_002099397.1| GE10881 [Drosophila yakuba]
 gi|194185498|gb|EDW99109.1| GE10881 [Drosophila yakuba]
          Length = 487

 Score = 71.6 bits (174), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 32/67 (47%), Positives = 42/67 (62%), Gaps = 1/67 (1%)

Query: 61  LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
           LGN+   +L   ++DVE GGAT+F SL+  +FP KG+A FW N H +   D R  H+ CP
Sbjct: 394 LGNRIATVLFY-MSDVEQGGATVFTSLHTALFPRKGTAAFWMNLHRDGQGDVRTRHAACP 452

Query: 121 VALGNKW 127
           V  G KW
Sbjct: 453 VLTGTKW 459



 Score = 65.9 bits (159), Expect = 6e-09,   Method: Composition-based stats.
 Identities = 29/64 (45%), Positives = 37/64 (57%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DVE GGAT+F SL+  +FP KG+A FW N H +   D    H+ CPV  
Sbjct: 396 NRIATVLFYMSDVEQGGATVFTSLHTALFPRKGTAAFWMNLHRDGQGDVRTRHAACPVLT 455

Query: 62  GNKW 65
           G KW
Sbjct: 456 GTKW 459


>gi|387016440|gb|AFJ50339.1| Prolyl 4-hydroxylase subunit alpha-1-like [Crotalus adamanteus]
          Length = 543

 Score = 71.6 bits (174), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 31/66 (46%), Positives = 43/66 (65%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+FP +  +V+P+KG+AVFWYN   +   DY   H+ CPV
Sbjct: 456 GNRIATWLFY-MSDVAAGGATVFPEVGASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPV 514

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 515 LVGNKW 520



 Score = 68.6 bits (166), Expect = 7e-10,   Method: Composition-based stats.
 Identities = 29/64 (45%), Positives = 41/64 (64%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP +  +V+P+KG+AVFW+N   +   DY   H+ CPV +
Sbjct: 457 NRIATWLFYMSDVAAGGATVFPEVGASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPVLV 516

Query: 62  GNKW 65
           GNKW
Sbjct: 517 GNKW 520


>gi|221512818|ref|NP_730346.2| CG32201 [Drosophila melanogaster]
 gi|220902638|gb|AAN11679.2| CG32201 [Drosophila melanogaster]
          Length = 520

 Score = 71.6 bits (174), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 30/67 (44%), Positives = 45/67 (67%), Gaps = 1/67 (1%)

Query: 61  LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
           LG++   +L    ++V  GGAT+FP +N+TVFP+KGS ++W+N H +   D R  HS CP
Sbjct: 436 LGDRLASILFYA-SEVPQGGATVFPEINVTVFPQKGSMLYWFNLHDDGKPDIRSLHSVCP 494

Query: 121 VALGNKW 127
           V  G++W
Sbjct: 495 VLNGDRW 501



 Score = 64.7 bits (156), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 26/54 (48%), Positives = 37/54 (68%)

Query: 12  TDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++V  GGAT+FP +N+TVFP+KGS ++W N H +   D    HS CPV  G++W
Sbjct: 448 SEVPQGGATVFPEINVTVFPQKGSMLYWFNLHDDGKPDIRSLHSVCPVLNGDRW 501


>gi|66770649|gb|AAY54636.1| IP12415p [Drosophila melanogaster]
 gi|66772017|gb|AAY55320.1| IP12615p [Drosophila melanogaster]
          Length = 512

 Score = 71.6 bits (174), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 30/67 (44%), Positives = 45/67 (67%), Gaps = 1/67 (1%)

Query: 61  LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
           LG++   +L    ++V  GGAT+FP +N+TVFP+KGS ++W+N H +   D R  HS CP
Sbjct: 428 LGDRLASILFYA-SEVPQGGATVFPEINVTVFPQKGSMLYWFNLHDDGKPDIRSLHSVCP 486

Query: 121 VALGNKW 127
           V  G++W
Sbjct: 487 VLNGDRW 493



 Score = 64.7 bits (156), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 26/54 (48%), Positives = 37/54 (68%)

Query: 12  TDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++V  GGAT+FP +N+TVFP+KGS ++W N H +   D    HS CPV  G++W
Sbjct: 440 SEVPQGGATVFPEINVTVFPQKGSMLYWFNLHDDGKPDIRSLHSVCPVLNGDRW 493


>gi|66771935|gb|AAY55279.1| IP12715p [Drosophila melanogaster]
          Length = 451

 Score = 71.6 bits (174), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 30/67 (44%), Positives = 45/67 (67%), Gaps = 1/67 (1%)

Query: 61  LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
           LG++   +L    ++V  GGAT+FP +N+TVFP+KGS ++W+N H +   D R  HS CP
Sbjct: 367 LGDRLASILFYA-SEVPQGGATVFPEINVTVFPQKGSMLYWFNLHDDGKPDIRSLHSVCP 425

Query: 121 VALGNKW 127
           V  G++W
Sbjct: 426 VLNGDRW 432



 Score = 64.7 bits (156), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 26/54 (48%), Positives = 37/54 (68%)

Query: 12  TDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++V  GGAT+FP +N+TVFP+KGS ++W N H +   D    HS CPV  G++W
Sbjct: 379 SEVPQGGATVFPEINVTVFPQKGSMLYWFNLHDDGKPDIRSLHSVCPVLNGDRW 432


>gi|195330780|ref|XP_002032081.1| GM23710 [Drosophila sechellia]
 gi|194121024|gb|EDW43067.1| GM23710 [Drosophila sechellia]
          Length = 490

 Score = 71.6 bits (174), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 35/79 (44%), Positives = 51/79 (64%), Gaps = 1/79 (1%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           D+  Y +   +  G++   +L   L +V+ GGATIFP LN+TV P+KGSAVFWYN H + 
Sbjct: 384 DFHEYTNTTRLKRGDRIATVLFY-LGEVDSGGATIFPMLNITVTPKKGSAVFWYNLHNSG 442

Query: 109 LLDYRMYHSGCPVALGNKW 127
            ++ +  HS CPV  G+K+
Sbjct: 443 AVNSKTLHSACPVISGSKY 461



 Score = 65.1 bits (157), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 30/55 (54%), Positives = 40/55 (72%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L +V+ GGATIFP LN+TV P+KGSAVFW+N H +  ++    HS CPV  G+K+
Sbjct: 407 LGEVDSGGATIFPMLNITVTPKKGSAVFWYNLHNSGAVNSKTLHSACPVISGSKY 461


>gi|195390835|ref|XP_002054073.1| GJ22993 [Drosophila virilis]
 gi|194152159|gb|EDW67593.1| GJ22993 [Drosophila virilis]
          Length = 525

 Score = 71.6 bits (174), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 29/55 (52%), Positives = 39/55 (70%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           LTDVE GGAT+FP++   +FP+KG+AV WYN   N   D +  H+ CPV +G+KW
Sbjct: 450 LTDVEQGGATVFPNIKQAIFPKKGTAVMWYNLRHNNDGDPQTLHAACPVIVGSKW 504



 Score = 68.2 bits (165), Expect = 1e-09,   Method: Composition-based stats.
 Identities = 28/55 (50%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           LTDVE GGAT+FP++   +FP+KG+AV W+N   N   D    H+ CPV +G+KW
Sbjct: 450 LTDVEQGGATVFPNIKQAIFPKKGTAVMWYNLRHNNDGDPQTLHAACPVIVGSKW 504


>gi|345326417|ref|XP_001510155.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like
           [Ornithorhynchus anatinus]
          Length = 888

 Score = 71.6 bits (174), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 29/59 (49%), Positives = 40/59 (67%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
            L+ ++DVE GGAT+FP     ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 807 FLNYMSDVEAGGATVFPDFGAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGCKW 865



 Score = 65.5 bits (158), Expect = 8e-09,   Method: Composition-based stats.
 Identities = 26/55 (47%), Positives = 37/55 (67%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP     ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 811 MSDVEAGGATVFPDFGAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGCKW 865


>gi|195341542|ref|XP_002037365.1| GM12152 [Drosophila sechellia]
 gi|194131481|gb|EDW53524.1| GM12152 [Drosophila sechellia]
          Length = 535

 Score = 71.6 bits (174), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 31/55 (56%), Positives = 37/55 (67%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L+DVE GG T FP L L V PEKGS +FWYN H +   D+R  H+ CPV  G+KW
Sbjct: 448 LSDVEAGGGTAFPFLPLLVTPEKGSLLFWYNLHPSGDQDFRTKHAACPVLQGSKW 502



 Score = 67.0 bits (162), Expect = 2e-09,   Method: Composition-based stats.
 Identities = 31/64 (48%), Positives = 38/64 (59%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       L+DVE GG T FP L L V PEKGS +FW+N H +   D+   H+ CPV  
Sbjct: 439 NRIATGIYYLSDVEAGGGTAFPFLPLLVTPEKGSLLFWYNLHPSGDQDFRTKHAACPVLQ 498

Query: 62  GNKW 65
           G+KW
Sbjct: 499 GSKW 502


>gi|195575097|ref|XP_002105516.1| GD17035 [Drosophila simulans]
 gi|194201443|gb|EDX15019.1| GD17035 [Drosophila simulans]
          Length = 535

 Score = 71.6 bits (174), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 31/55 (56%), Positives = 37/55 (67%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L+DVE GG T FP L L V PEKGS +FWYN H +   D+R  H+ CPV  G+KW
Sbjct: 448 LSDVEAGGGTAFPFLPLLVTPEKGSLLFWYNLHPSGDQDFRTKHAACPVLQGSKW 502



 Score = 67.0 bits (162), Expect = 2e-09,   Method: Composition-based stats.
 Identities = 31/64 (48%), Positives = 38/64 (59%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       L+DVE GG T FP L L V PEKGS +FW+N H +   D+   H+ CPV  
Sbjct: 439 NRIATGIYYLSDVEAGGGTAFPFLPLLVTPEKGSLLFWYNLHPSGDQDFRTKHAACPVLQ 498

Query: 62  GNKW 65
           G+KW
Sbjct: 499 GSKW 502


>gi|195572621|ref|XP_002104294.1| GD18523 [Drosophila simulans]
 gi|194200221|gb|EDX13797.1| GD18523 [Drosophila simulans]
          Length = 490

 Score = 71.2 bits (173), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 35/79 (44%), Positives = 51/79 (64%), Gaps = 1/79 (1%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           D+  Y +   +  G++   +L   L +V+ GGATIFP LN+TV P+KGSAVFWYN H + 
Sbjct: 384 DFHEYTNTTRLKQGDRIATVLFY-LGEVDSGGATIFPMLNITVTPKKGSAVFWYNLHNSG 442

Query: 109 LLDYRMYHSGCPVALGNKW 127
            ++ +  HS CPV  G+K+
Sbjct: 443 AVNSKTLHSACPVISGSKY 461



 Score = 65.1 bits (157), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 30/55 (54%), Positives = 40/55 (72%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L +V+ GGATIFP LN+TV P+KGSAVFW+N H +  ++    HS CPV  G+K+
Sbjct: 407 LGEVDSGGATIFPMLNITVTPKKGSAVFWYNLHNSGAVNSKTLHSACPVISGSKY 461


>gi|195591302|ref|XP_002085381.1| GD14757 [Drosophila simulans]
 gi|194197390|gb|EDX10966.1| GD14757 [Drosophila simulans]
          Length = 525

 Score = 71.2 bits (173), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 30/67 (44%), Positives = 45/67 (67%), Gaps = 1/67 (1%)

Query: 61  LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
           LG++   +L    ++V  GGAT+FP +N+TVFP+KGS ++W+N H +   D R  HS CP
Sbjct: 436 LGDRLASILFYA-SEVPQGGATVFPEINVTVFPQKGSMLYWFNLHDDGRPDIRSKHSVCP 494

Query: 121 VALGNKW 127
           V  G++W
Sbjct: 495 VINGDRW 501



 Score = 64.3 bits (155), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 26/54 (48%), Positives = 37/54 (68%)

Query: 12  TDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++V  GGAT+FP +N+TVFP+KGS ++W N H +   D    HS CPV  G++W
Sbjct: 448 SEVPQGGATVFPEINVTVFPQKGSMLYWFNLHDDGRPDIRSKHSVCPVINGDRW 501


>gi|195505255|ref|XP_002099425.1| GE23368 [Drosophila yakuba]
 gi|194185526|gb|EDW99137.1| GE23368 [Drosophila yakuba]
          Length = 528

 Score = 71.2 bits (173), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 39/55 (70%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           LTDVE GGAT+FP++   VFP++GS + WYN   +  +D +  H+ CPV +G+KW
Sbjct: 452 LTDVEQGGATVFPNIRKAVFPQRGSVIMWYNLKDDGQIDTQTLHAACPVIVGSKW 506



 Score = 68.2 bits (165), Expect = 1e-09,   Method: Composition-based stats.
 Identities = 26/55 (47%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           LTDVE GGAT+FP++   VFP++GS + W+N   +  +D    H+ CPV +G+KW
Sbjct: 452 LTDVEQGGATVFPNIRKAVFPQRGSVIMWYNLKDDGQIDTQTLHAACPVIVGSKW 506


>gi|195352182|ref|XP_002042593.1| GM14980 [Drosophila sechellia]
 gi|194124477|gb|EDW46520.1| GM14980 [Drosophila sechellia]
          Length = 520

 Score = 71.2 bits (173), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 30/67 (44%), Positives = 45/67 (67%), Gaps = 1/67 (1%)

Query: 61  LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
           LG++   +L    ++V  GGAT+FP +N+TVFP+KGS ++W+N H +   D R  HS CP
Sbjct: 436 LGDRLASILFYA-SEVPQGGATVFPEINVTVFPQKGSMLYWFNLHDDGRPDIRSKHSVCP 494

Query: 121 VALGNKW 127
           V  G++W
Sbjct: 495 VINGDRW 501



 Score = 64.3 bits (155), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 26/54 (48%), Positives = 37/54 (68%)

Query: 12  TDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++V  GGAT+FP +N+TVFP+KGS ++W N H +   D    HS CPV  G++W
Sbjct: 448 SEVPQGGATVFPEINVTVFPQKGSMLYWFNLHDDGRPDIRSKHSVCPVINGDRW 501


>gi|78706702|ref|NP_001027154.1| CG18749 [Drosophila melanogaster]
 gi|21429852|gb|AAM50604.1| GH05783p [Drosophila melanogaster]
 gi|23175900|gb|AAN14309.1| CG18749 [Drosophila melanogaster]
 gi|220956638|gb|ACL90862.1| CG18749-PB [synthetic construct]
          Length = 491

 Score = 71.2 bits (173), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 51/79 (64%), Gaps = 1/79 (1%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           D+  Y +   +  G++   +L   L +V+ GGAT+FP LN+TV P+KGSAVFWYN H + 
Sbjct: 385 DFHEYTNTTRLKQGDRIATVLFY-LREVDSGGATVFPMLNITVMPKKGSAVFWYNLHNSG 443

Query: 109 LLDYRMYHSGCPVALGNKW 127
            ++ +  H+ CPV  G+K+
Sbjct: 444 AVNSKTLHTACPVISGSKY 462



 Score = 65.5 bits (158), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 28/55 (50%), Positives = 40/55 (72%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L +V+ GGAT+FP LN+TV P+KGSAVFW+N H +  ++    H+ CPV  G+K+
Sbjct: 408 LREVDSGGATVFPMLNITVMPKKGSAVFWYNLHNSGAVNSKTLHTACPVISGSKY 462


>gi|47210159|emb|CAF93191.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 78

 Score = 71.2 bits (173), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 29/55 (52%), Positives = 38/55 (69%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           ++DVE GGAT+FP     ++P KG+AVFWYN   +   DYR  H+ CPV +GNKW
Sbjct: 1   MSDVEAGGATVFPDFGAAIWPRKGTAVFWYNLFRSGEGDYRTRHAACPVLVGNKW 55



 Score = 65.9 bits (159), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 27/55 (49%), Positives = 37/55 (67%)

Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
          ++DVE GGAT+FP     ++P KG+AVFW+N   +   DY   H+ CPV +GNKW
Sbjct: 1  MSDVEAGGATVFPDFGAAIWPRKGTAVFWYNLFRSGEGDYRTRHAACPVLVGNKW 55


>gi|442747045|gb|JAA65682.1| Putative prolyl 4-hydroxylase alpha subunit [Ixodes ricinus]
          Length = 538

 Score = 71.2 bits (173), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 34/83 (40%), Positives = 51/83 (61%), Gaps = 6/83 (7%)

Query: 45  NTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNA 104
           N+  DY++ +       G++   L++  +TDVE GGAT+FPSL + + P+KG A FW+N 
Sbjct: 433 NSSADYYLQY-----GTGDRLATLMIY-MTDVEEGGATVFPSLGIRLTPKKGDAAFWWNL 486

Query: 105 HANTLLDYRMYHSGCPVALGNKW 127
            A+   D    H+GCPV  G+KW
Sbjct: 487 KASGEGDRLTTHAGCPVLYGSKW 509



 Score = 67.4 bits (163), Expect = 2e-09,   Method: Composition-based stats.
 Identities = 29/57 (50%), Positives = 38/57 (66%)

Query: 9   VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           + +TDVE GGAT+FPSL + + P+KG A FW N  A+   D    H+GCPV  G+KW
Sbjct: 453 IYMTDVEEGGATVFPSLGIRLTPKKGDAAFWWNLKASGEGDRLTTHAGCPVLYGSKW 509


>gi|321474876|gb|EFX85840.1| hypothetical protein DAPPUDRAFT_309107 [Daphnia pulex]
          Length = 528

 Score = 70.9 bits (172), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 31/66 (46%), Positives = 42/66 (63%), Gaps = 3/66 (4%)

Query: 65  WGKLLLSGL---TDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           WG  + + L   +DVE GGAT+FP++   ++P+KGSA FWYN   N   D    H+GCPV
Sbjct: 440 WGNRIATWLFYMSDVEAGGATVFPAVGAALWPQKGSAAFWYNLRPNGNGDEDTLHAGCPV 499

Query: 122 ALGNKW 127
             G+KW
Sbjct: 500 LTGSKW 505



 Score = 67.0 bits (162), Expect = 2e-09,   Method: Composition-based stats.
 Identities = 29/64 (45%), Positives = 40/64 (62%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DVE GGAT+FP++   ++P+KGSA FW+N   N   D    H+GCPV  
Sbjct: 442 NRIATWLFYMSDVEAGGATVFPAVGAALWPQKGSAAFWYNLRPNGNGDEDTLHAGCPVLT 501

Query: 62  GNKW 65
           G+KW
Sbjct: 502 GSKW 505


>gi|198449502|ref|XP_001357605.2| GA15937 [Drosophila pseudoobscura pseudoobscura]
 gi|198130635|gb|EAL26739.2| GA15937 [Drosophila pseudoobscura pseudoobscura]
          Length = 510

 Score = 70.9 bits (172), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 4/79 (5%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           D++  H   P  +GN+   +L   LTDVE GGAT+F      V P +G+A+FWYN H + 
Sbjct: 413 DWFTQH---PEVMGNRLATVLFY-LTDVEQGGATMFNKAEHKVLPRRGTALFWYNLHTDG 468

Query: 109 LLDYRMYHSGCPVALGNKW 127
             D+   H+ CP+ +G+KW
Sbjct: 469 EGDWSTTHAACPIIVGSKW 487



 Score = 63.5 bits (153), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 27/64 (42%), Positives = 38/64 (59%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       LTDVE GGAT+F      V P +G+A+FW+N H +   D+   H+ CP+ +
Sbjct: 424 NRLATVLFYLTDVEQGGATMFNKAEHKVLPRRGTALFWYNLHTDGEGDWSTTHAACPIIV 483

Query: 62  GNKW 65
           G+KW
Sbjct: 484 GSKW 487


>gi|195159144|ref|XP_002020442.1| GL13995 [Drosophila persimilis]
 gi|194117211|gb|EDW39254.1| GL13995 [Drosophila persimilis]
          Length = 535

 Score = 70.9 bits (172), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 4/79 (5%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           D++  H   P  +GN+   +L   LTDVE GGAT+F      V P +G+A+FWYN H + 
Sbjct: 438 DWFTQH---PEVMGNRLATVLFY-LTDVEQGGATMFNKAEHKVLPRRGTALFWYNLHTDG 493

Query: 109 LLDYRMYHSGCPVALGNKW 127
             D+   H+ CP+ +G+KW
Sbjct: 494 EGDWSTTHAACPIIVGSKW 512



 Score = 63.5 bits (153), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 27/64 (42%), Positives = 38/64 (59%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       LTDVE GGAT+F      V P +G+A+FW+N H +   D+   H+ CP+ +
Sbjct: 449 NRLATVLFYLTDVEQGGATMFNKAEHKVLPRRGTALFWYNLHTDGEGDWSTTHAACPIIV 508

Query: 62  GNKW 65
           G+KW
Sbjct: 509 GSKW 512


>gi|194760358|ref|XP_001962408.1| GF14452 [Drosophila ananassae]
 gi|190616105|gb|EDV31629.1| GF14452 [Drosophila ananassae]
          Length = 498

 Score = 70.9 bits (172), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 28/54 (51%), Positives = 41/54 (75%)

Query: 74  TDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           +DVE+GGATIFP+ N+++ P+KGSA+FWYN H +   +    H+ CP+ LGN+W
Sbjct: 426 SDVEMGGATIFPAANISIKPKKGSALFWYNLHNDWEPNPLSRHAVCPMVLGNRW 479



 Score = 68.9 bits (167), Expect = 5e-10,   Method: Composition-based stats.
 Identities = 27/54 (50%), Positives = 41/54 (75%)

Query: 12  TDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           +DVE+GGATIFP+ N+++ P+KGSA+FW+N H +   +    H+ CP+ LGN+W
Sbjct: 426 SDVEMGGATIFPAANISIKPKKGSALFWYNLHNDWEPNPLSRHAVCPMVLGNRW 479


>gi|350416719|ref|XP_003491070.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Bombus
           impatiens]
          Length = 557

 Score = 70.9 bits (172), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 29/66 (43%), Positives = 45/66 (68%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+   +L   ++DVE GG T+F ++N++++P+KGSA FWYN   N   D++  H+ CPV
Sbjct: 462 GNRIATVLYY-MSDVEQGGGTVFTAINISLWPKKGSAAFWYNLKPNGEGDFKTRHAACPV 520

Query: 122 ALGNKW 127
             G+KW
Sbjct: 521 LTGSKW 526



 Score = 66.6 bits (161), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 27/64 (42%), Positives = 41/64 (64%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DVE GG T+F ++N++++P+KGSA FW+N   N   D+   H+ CPV  
Sbjct: 463 NRIATVLYYMSDVEQGGGTVFTAINISLWPKKGSAAFWYNLKPNGEGDFKTRHAACPVLT 522

Query: 62  GNKW 65
           G+KW
Sbjct: 523 GSKW 526


>gi|340722330|ref|XP_003399560.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Bombus
           terrestris]
          Length = 557

 Score = 70.9 bits (172), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 29/66 (43%), Positives = 45/66 (68%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+   +L   ++DVE GG T+F ++N++++P+KGSA FWYN   N   D++  H+ CPV
Sbjct: 462 GNRIATVLYY-MSDVEQGGGTVFTAINISLWPKKGSAAFWYNLKPNGEGDFKTRHAACPV 520

Query: 122 ALGNKW 127
             G+KW
Sbjct: 521 LTGSKW 526



 Score = 66.6 bits (161), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 27/64 (42%), Positives = 41/64 (64%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DVE GG T+F ++N++++P+KGSA FW+N   N   D+   H+ CPV  
Sbjct: 463 NRIATVLYYMSDVEQGGGTVFTAINISLWPKKGSAAFWYNLKPNGEGDFKTRHAACPVLT 522

Query: 62  GNKW 65
           G+KW
Sbjct: 523 GSKW 526


>gi|195061074|ref|XP_001995919.1| GH14105 [Drosophila grimshawi]
 gi|193891711|gb|EDV90577.1| GH14105 [Drosophila grimshawi]
          Length = 513

 Score = 70.9 bits (172), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 37/55 (67%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L+DVE GGAT+FP +   VFP++G+A+ WYN   N   D    H+ CPV +G+KW
Sbjct: 439 LSDVEQGGATVFPKIQKAVFPQRGTAIIWYNLKENGDFDTNTIHAACPVIVGSKW 493



 Score = 67.4 bits (163), Expect = 2e-09,   Method: Composition-based stats.
 Identities = 26/55 (47%), Positives = 37/55 (67%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L+DVE GGAT+FP +   VFP++G+A+ W+N   N   D    H+ CPV +G+KW
Sbjct: 439 LSDVEQGGATVFPKIQKAVFPQRGTAIIWYNLKENGDFDTNTIHAACPVIVGSKW 493


>gi|195494563|ref|XP_002094891.1| GE19960 [Drosophila yakuba]
 gi|194180992|gb|EDW94603.1| GE19960 [Drosophila yakuba]
          Length = 152

 Score = 70.9 bits (172), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 29/62 (46%), Positives = 42/62 (67%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 128
           L+S   +V  GGAT+FP +N+TVFP+KGS ++W+N H +   D R  H  CPV  G++W 
Sbjct: 54  LISKAGEVRQGGATVFPEINVTVFPQKGSMLYWFNLHDDGRPDIRSQHLVCPVVNGDRWS 113

Query: 129 KL 130
           +L
Sbjct: 114 RL 115



 Score = 64.7 bits (156), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 26/56 (46%), Positives = 37/56 (66%)

Query: 13  DVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWGKL 68
           +V  GGAT+FP +N+TVFP+KGS ++W N H +   D    H  CPV  G++W +L
Sbjct: 60  EVRQGGATVFPEINVTVFPQKGSMLYWFNLHDDGRPDIRSQHLVCPVVNGDRWSRL 115


>gi|195113239|ref|XP_002001175.1| GI10638 [Drosophila mojavensis]
 gi|193917769|gb|EDW16636.1| GI10638 [Drosophila mojavensis]
          Length = 511

 Score = 70.9 bits (172), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 41/125 (32%), Positives = 63/125 (50%), Gaps = 16/125 (12%)

Query: 3   RAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALG 62
           R E    ++ D+ L G+  F  +N  +  +  +   W    A+T+ D             
Sbjct: 378 RIERRIADMVDLNLEGSEPFYVINYGIGGQYKA--HWDFFGADTVED------------- 422

Query: 63  NKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 122
           N+   +L   + DVE GGAT+FP L  TV  ++G+A+FWYN   N  +D R  H GCP+ 
Sbjct: 423 NRLATVLFY-MNDVEQGGATVFPRLGQTVRAKRGNALFWYNMQHNGTVDDRTLHGGCPIL 481

Query: 123 LGNKW 127
           +G+KW
Sbjct: 482 VGSKW 486



 Score = 65.9 bits (159), Expect = 6e-09,   Method: Composition-based stats.
 Identities = 28/64 (43%), Positives = 39/64 (60%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       + DVE GGAT+FP L  TV  ++G+A+FW+N   N  +D    H GCP+ +
Sbjct: 423 NRLATVLFYMNDVEQGGATVFPRLGQTVRAKRGNALFWYNMQHNGTVDDRTLHGGCPILV 482

Query: 62  GNKW 65
           G+KW
Sbjct: 483 GSKW 486


>gi|195391758|ref|XP_002054527.1| GJ22759 [Drosophila virilis]
 gi|194152613|gb|EDW68047.1| GJ22759 [Drosophila virilis]
          Length = 539

 Score = 70.9 bits (172), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 30/55 (54%), Positives = 37/55 (67%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L++VE GG T FP L L V PE+GS +FWYN H +   DYR  H+ CPV  G+KW
Sbjct: 452 LSNVEAGGGTAFPFLPLLVTPERGSLLFWYNLHPSGDADYRTKHAACPVLQGSKW 506



 Score = 66.2 bits (160), Expect = 4e-09,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 38/64 (59%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       L++VE GG T FP L L V PE+GS +FW+N H +   DY   H+ CPV  
Sbjct: 443 NRLATGIYYLSNVEAGGGTAFPFLPLLVTPERGSLLFWYNLHPSGDADYRTKHAACPVLQ 502

Query: 62  GNKW 65
           G+KW
Sbjct: 503 GSKW 506


>gi|198466401|ref|XP_002135182.1| GA23910 [Drosophila pseudoobscura pseudoobscura]
 gi|198150583|gb|EDY73809.1| GA23910 [Drosophila pseudoobscura pseudoobscura]
          Length = 530

 Score = 70.5 bits (171), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 31/67 (46%), Positives = 45/67 (67%), Gaps = 1/67 (1%)

Query: 61  LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
           +G   G +L   ++D++ GGATIFPS+N+TV P KGSA+ W+N + +   D R  HS CP
Sbjct: 439 MGGPLGTILFY-VSDLQQGGATIFPSINITVSPRKGSALLWFNLYDDGEPDPRTLHSSCP 497

Query: 121 VALGNKW 127
           V  G++W
Sbjct: 498 VIEGDRW 504



 Score = 64.3 bits (155), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++D++ GGATIFPS+N+TV P KGSA+ W N + +   D    HS CPV  G++W
Sbjct: 450 VSDLQQGGATIFPSINITVSPRKGSALLWFNLYDDGEPDPRTLHSSCPVIEGDRW 504


>gi|116008128|ref|NP_001036776.1| CG15539, isoform B [Drosophila melanogaster]
 gi|113194857|gb|ABI31220.1| CG15539, isoform B [Drosophila melanogaster]
          Length = 509

 Score = 70.5 bits (171), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 30/55 (54%), Positives = 37/55 (67%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L+DV  GGATIFP L L+VFP+KGSA+ WYN       D R  HS CP  +G++W
Sbjct: 434 LSDVPQGGATIFPKLGLSVFPKKGSALLWYNLDHKGDGDNRTAHSACPTVVGSRW 488



 Score = 65.5 bits (158), Expect = 7e-09,   Method: Composition-based stats.
 Identities = 29/64 (45%), Positives = 39/64 (60%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           +R   +   L+DV  GGATIFP L L+VFP+KGSA+ W+N       D    HS CP  +
Sbjct: 425 DRIATAVFYLSDVPQGGATIFPKLGLSVFPKKGSALLWYNLDHKGDGDNRTAHSACPTVV 484

Query: 62  GNKW 65
           G++W
Sbjct: 485 GSRW 488


>gi|194905419|ref|XP_001981192.1| GG11932 [Drosophila erecta]
 gi|190655830|gb|EDV53062.1| GG11932 [Drosophila erecta]
          Length = 535

 Score = 70.5 bits (171), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 30/55 (54%), Positives = 37/55 (67%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L+DVE GG T FP L L V PE+GS +FWYN H +   D+R  H+ CPV  G+KW
Sbjct: 448 LSDVEAGGGTAFPFLPLLVTPERGSLLFWYNLHPSGDQDFRTKHAACPVLQGSKW 502



 Score = 66.6 bits (161), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 39/64 (60%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR   +   L+DVE GG T FP L L V PE+GS +FW+N H +   D+   H+ CPV  
Sbjct: 439 NRIATAIYYLSDVEAGGGTAFPFLPLLVTPERGSLLFWYNLHPSGDQDFRTKHAACPVLQ 498

Query: 62  GNKW 65
           G+KW
Sbjct: 499 GSKW 502


>gi|307211752|gb|EFN87747.1| Prolyl 4-hydroxylase subunit alpha-1 [Harpegnathos saltator]
          Length = 415

 Score = 70.5 bits (171), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 29/66 (43%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+   +L   ++DVE GG T+F ++N++++P KGSA FWYN   N   D++  H+ CPV
Sbjct: 320 GNRIATVLYY-MSDVEQGGGTVFTAINISLWPRKGSAAFWYNLKPNGEGDFKTRHAACPV 378

Query: 122 ALGNKW 127
             G+KW
Sbjct: 379 LTGSKW 384



 Score = 66.2 bits (160), Expect = 4e-09,   Method: Composition-based stats.
 Identities = 27/64 (42%), Positives = 40/64 (62%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DVE GG T+F ++N++++P KGSA FW+N   N   D+   H+ CPV  
Sbjct: 321 NRIATVLYYMSDVEQGGGTVFTAINISLWPRKGSAAFWYNLKPNGEGDFKTRHAACPVLT 380

Query: 62  GNKW 65
           G+KW
Sbjct: 381 GSKW 384


>gi|307190793|gb|EFN74662.1| Prolyl 4-hydroxylase subunit alpha-2 [Camponotus floridanus]
          Length = 476

 Score = 70.5 bits (171), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 29/66 (43%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+   +L   ++DVE GG T+F ++N++++P KGSA FWYN   N   D++  H+ CPV
Sbjct: 381 GNRIATVLYY-MSDVEQGGGTVFTAINISLWPRKGSAAFWYNLKPNGEGDFKTRHAACPV 439

Query: 122 ALGNKW 127
             G+KW
Sbjct: 440 LTGSKW 445



 Score = 66.2 bits (160), Expect = 4e-09,   Method: Composition-based stats.
 Identities = 27/64 (42%), Positives = 40/64 (62%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DVE GG T+F ++N++++P KGSA FW+N   N   D+   H+ CPV  
Sbjct: 382 NRIATVLYYMSDVEQGGGTVFTAINISLWPRKGSAAFWYNLKPNGEGDFKTRHAACPVLT 441

Query: 62  GNKW 65
           G+KW
Sbjct: 442 GSKW 445


>gi|24651420|ref|NP_733374.1| prolyl-4-hydroxylase-alpha NE1 [Drosophila melanogaster]
 gi|7301952|gb|AAF57058.1| prolyl-4-hydroxylase-alpha NE1 [Drosophila melanogaster]
 gi|363987308|gb|AEW43896.1| FI16820p1 [Drosophila melanogaster]
          Length = 537

 Score = 70.5 bits (171), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 32/70 (45%), Positives = 44/70 (62%), Gaps = 1/70 (1%)

Query: 58  PVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHS 117
           P   GN+    +   L++VE GGAT FP L++ V P+ G+ +FWYN H +   DYR  H+
Sbjct: 436 PAEEGNRIATAIFY-LSEVEQGGATAFPFLDIAVKPQLGNVLFWYNLHRSLDKDYRTKHA 494

Query: 118 GCPVALGNKW 127
           GCPV  G+KW
Sbjct: 495 GCPVLKGSKW 504



 Score = 66.2 bits (160), Expect = 4e-09,   Method: Composition-based stats.
 Identities = 29/64 (45%), Positives = 41/64 (64%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR   +   L++VE GGAT FP L++ V P+ G+ +FW+N H +   DY   H+GCPV  
Sbjct: 441 NRIATAIFYLSEVEQGGATAFPFLDIAVKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLK 500

Query: 62  GNKW 65
           G+KW
Sbjct: 501 GSKW 504


>gi|442757047|gb|JAA70682.1| Putative prolyl 4-hydroxylase alpha subunit [Ixodes ricinus]
          Length = 532

 Score = 70.5 bits (171), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 32/69 (46%), Positives = 47/69 (68%), Gaps = 3/69 (4%)

Query: 62  GNKWGK---LLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSG 118
           GN++G     L+  ++DVE GGAT+FPSL + V P+KG AVFW+N  ++   +   +H+G
Sbjct: 435 GNRFGNRVATLMIYMSDVEEGGATVFPSLGVRVSPKKGDAVFWWNIMSSWEGEMLTWHAG 494

Query: 119 CPVALGNKW 127
           CPV  G+KW
Sbjct: 495 CPVLYGSKW 503



 Score = 68.9 bits (167), Expect = 7e-10,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 42/64 (65%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + ++DVE GGAT+FPSL + V P+KG AVFW N  ++   +   +H+GCPV  
Sbjct: 440 NRVATLMIYMSDVEEGGATVFPSLGVRVSPKKGDAVFWWNIMSSWEGEMLTWHAGCPVLY 499

Query: 62  GNKW 65
           G+KW
Sbjct: 500 GSKW 503


>gi|194871348|ref|XP_001972831.1| GG13664 [Drosophila erecta]
 gi|190654614|gb|EDV51857.1| GG13664 [Drosophila erecta]
          Length = 520

 Score = 70.5 bits (171), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 29/67 (43%), Positives = 44/67 (65%), Gaps = 1/67 (1%)

Query: 61  LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
           LG++   ++     +V  GGAT+FP +N+TVFP+KGS ++W+N H +   D R  HS CP
Sbjct: 436 LGDRLASIIFYA-GEVLQGGATVFPEINVTVFPQKGSMLYWFNLHDDGRPDIRSQHSACP 494

Query: 121 VALGNKW 127
           V  G++W
Sbjct: 495 VVNGDRW 501



 Score = 65.1 bits (157), Expect = 9e-09,   Method: Composition-based stats.
 Identities = 26/53 (49%), Positives = 36/53 (67%)

Query: 13  DVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           +V  GGAT+FP +N+TVFP+KGS ++W N H +   D    HS CPV  G++W
Sbjct: 449 EVLQGGATVFPEINVTVFPQKGSMLYWFNLHDDGRPDIRSQHSACPVVNGDRW 501


>gi|242018356|ref|XP_002429643.1| Prolyl 4-hydroxylase alpha-1 subunit precursor, putative [Pediculus
           humanus corporis]
 gi|212514628|gb|EEB16905.1| Prolyl 4-hydroxylase alpha-1 subunit precursor, putative [Pediculus
           humanus corporis]
          Length = 534

 Score = 70.5 bits (171), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 29/66 (43%), Positives = 43/66 (65%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+   +L   ++DV  GGAT+FP + ++++P+KG+A FWYN   N   DY   H+ CPV
Sbjct: 446 GNRIATILFY-MSDVSQGGATVFPGIKVSLWPKKGTAAFWYNLRKNGEGDYLTRHAACPV 504

Query: 122 ALGNKW 127
             G+KW
Sbjct: 505 LTGSKW 510



 Score = 67.4 bits (163), Expect = 2e-09,   Method: Composition-based stats.
 Identities = 27/64 (42%), Positives = 40/64 (62%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP + ++++P+KG+A FW+N   N   DY   H+ CPV  
Sbjct: 447 NRIATILFYMSDVSQGGATVFPGIKVSLWPKKGTAAFWYNLRKNGEGDYLTRHAACPVLT 506

Query: 62  GNKW 65
           G+KW
Sbjct: 507 GSKW 510


>gi|195341544|ref|XP_002037366.1| GM12151 [Drosophila sechellia]
 gi|194131482|gb|EDW53525.1| GM12151 [Drosophila sechellia]
          Length = 537

 Score = 70.5 bits (171), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 32/70 (45%), Positives = 44/70 (62%), Gaps = 1/70 (1%)

Query: 58  PVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHS 117
           P   GN+    +   L++VE GGAT FP L++ V P+ G+ +FWYN H +   DYR  H+
Sbjct: 436 PAEEGNRIATAIFY-LSEVEQGGATAFPFLDIAVKPQLGNVLFWYNLHRSLDKDYRTKHA 494

Query: 118 GCPVALGNKW 127
           GCPV  G+KW
Sbjct: 495 GCPVLKGSKW 504



 Score = 66.2 bits (160), Expect = 4e-09,   Method: Composition-based stats.
 Identities = 29/64 (45%), Positives = 41/64 (64%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR   +   L++VE GGAT FP L++ V P+ G+ +FW+N H +   DY   H+GCPV  
Sbjct: 441 NRIATAIFYLSEVEQGGATAFPFLDIAVKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLK 500

Query: 62  GNKW 65
           G+KW
Sbjct: 501 GSKW 504


>gi|195575099|ref|XP_002105517.1| GD17024 [Drosophila simulans]
 gi|194201444|gb|EDX15020.1| GD17024 [Drosophila simulans]
          Length = 537

 Score = 70.5 bits (171), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 32/70 (45%), Positives = 44/70 (62%), Gaps = 1/70 (1%)

Query: 58  PVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHS 117
           P   GN+    +   L++VE GGAT FP L++ V P+ G+ +FWYN H +   DYR  H+
Sbjct: 436 PAEEGNRIATAIFY-LSEVEQGGATAFPFLDIAVKPQLGNVLFWYNLHRSLDKDYRTKHA 494

Query: 118 GCPVALGNKW 127
           GCPV  G+KW
Sbjct: 495 GCPVLKGSKW 504



 Score = 66.2 bits (160), Expect = 4e-09,   Method: Composition-based stats.
 Identities = 29/64 (45%), Positives = 41/64 (64%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR   +   L++VE GGAT FP L++ V P+ G+ +FW+N H +   DY   H+GCPV  
Sbjct: 441 NRIATAIFYLSEVEQGGATAFPFLDIAVKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLK 500

Query: 62  GNKW 65
           G+KW
Sbjct: 501 GSKW 504


>gi|380025232|ref|XP_003696381.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Apis florea]
          Length = 537

 Score = 70.5 bits (171), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 29/66 (43%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+   +L   ++DVE GG T+F ++N+ ++P+KGSA FWYN   N   D++  H+ CPV
Sbjct: 442 GNRIATVLYY-MSDVEQGGGTVFTAINIALWPKKGSAAFWYNLKPNGEGDFKTRHAACPV 500

Query: 122 ALGNKW 127
             G+KW
Sbjct: 501 LTGSKW 506



 Score = 66.2 bits (160), Expect = 5e-09,   Method: Composition-based stats.
 Identities = 27/64 (42%), Positives = 40/64 (62%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DVE GG T+F ++N+ ++P+KGSA FW+N   N   D+   H+ CPV  
Sbjct: 443 NRIATVLYYMSDVEQGGGTVFTAINIALWPKKGSAAFWYNLKPNGEGDFKTRHAACPVLT 502

Query: 62  GNKW 65
           G+KW
Sbjct: 503 GSKW 506


>gi|328790718|ref|XP_392392.4| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Apis mellifera]
          Length = 415

 Score = 70.5 bits (171), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 29/66 (43%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+   +L   ++DVE GG T+F ++N+ ++P+KGSA FWYN   N   D++  H+ CPV
Sbjct: 320 GNRIATVLYY-MSDVEQGGGTVFTAINIALWPKKGSAAFWYNLKPNGEGDFKTRHAACPV 378

Query: 122 ALGNKW 127
             G+KW
Sbjct: 379 LTGSKW 384



 Score = 66.2 bits (160), Expect = 5e-09,   Method: Composition-based stats.
 Identities = 27/64 (42%), Positives = 40/64 (62%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DVE GG T+F ++N+ ++P+KGSA FW+N   N   D+   H+ CPV  
Sbjct: 321 NRIATVLYYMSDVEQGGGTVFTAINIALWPKKGSAAFWYNLKPNGEGDFKTRHAACPVLT 380

Query: 62  GNKW 65
           G+KW
Sbjct: 381 GSKW 384


>gi|4336512|gb|AAD17844.1| prolyl 4-hydroxylase alpha subunit [Drosophila melanogaster]
          Length = 535

 Score = 70.5 bits (171), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 30/55 (54%), Positives = 37/55 (67%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L+DVE GG T FP L L V PE+GS +FWYN H +   D+R  H+ CPV  G+KW
Sbjct: 448 LSDVEAGGGTAFPFLPLLVTPERGSLLFWYNLHPSGDQDFRTKHAACPVLQGSKW 502



 Score = 65.9 bits (159), Expect = 5e-09,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 38/64 (59%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       L+DVE GG T FP L L V PE+GS +FW+N H +   D+   H+ CPV  
Sbjct: 439 NRMATGIYYLSDVEAGGGTAFPFLPLLVTPERGSLLFWYNLHPSGDQDFRTKHAACPVLQ 498

Query: 62  GNKW 65
           G+KW
Sbjct: 499 GSKW 502


>gi|227553849|gb|ACP40552.1| IP22178p [Drosophila melanogaster]
          Length = 467

 Score = 70.5 bits (171), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 32/70 (45%), Positives = 44/70 (62%), Gaps = 1/70 (1%)

Query: 58  PVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHS 117
           P   GN+    +   L++VE GGAT FP L++ V P+ G+ +FWYN H +   DYR  H+
Sbjct: 366 PAEEGNRIATAIFY-LSEVEQGGATAFPFLDIAVKPQLGNVLFWYNLHRSLDKDYRTKHA 424

Query: 118 GCPVALGNKW 127
           GCPV  G+KW
Sbjct: 425 GCPVLKGSKW 434



 Score = 66.2 bits (160), Expect = 5e-09,   Method: Composition-based stats.
 Identities = 29/64 (45%), Positives = 41/64 (64%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR   +   L++VE GGAT FP L++ V P+ G+ +FW+N H +   DY   H+GCPV  
Sbjct: 371 NRIATAIFYLSEVEQGGATAFPFLDIAVKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLK 430

Query: 62  GNKW 65
           G+KW
Sbjct: 431 GSKW 434


>gi|442747091|gb|JAA65705.1| Putative prolyl 4-hydroxylase alpha subunit [Ixodes ricinus]
          Length = 533

 Score = 70.1 bits (170), Expect = 3e-10,   Method: Composition-based stats.
 Identities = 31/67 (46%), Positives = 45/67 (67%), Gaps = 1/67 (1%)

Query: 61  LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
            G++   L++  ++DVE GGAT+FPSL + V P KG AVFW+N  ++   D   +H+GCP
Sbjct: 439 FGDRVATLMIY-MSDVEEGGATVFPSLGVRVSPRKGDAVFWWNIKSSWEGDVLTWHAGCP 497

Query: 121 VALGNKW 127
           V  G+KW
Sbjct: 498 VLYGSKW 504



 Score = 68.9 bits (167), Expect = 6e-10,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 41/64 (64%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           +R     + ++DVE GGAT+FPSL + V P KG AVFW N  ++   D   +H+GCPV  
Sbjct: 441 DRVATLMIYMSDVEEGGATVFPSLGVRVSPRKGDAVFWWNIKSSWEGDVLTWHAGCPVLY 500

Query: 62  GNKW 65
           G+KW
Sbjct: 501 GSKW 504


>gi|390176894|ref|XP_002136933.2| GA26862 [Drosophila pseudoobscura pseudoobscura]
 gi|388858830|gb|EDY67491.2| GA26862 [Drosophila pseudoobscura pseudoobscura]
          Length = 520

 Score = 70.1 bits (170), Expect = 3e-10,   Method: Composition-based stats.
 Identities = 29/55 (52%), Positives = 39/55 (70%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L+DV  GGAT+F  LN+TVFP+ G+ +FWYN   +   D R  H+GCPV +G+KW
Sbjct: 438 LSDVPQGGATLFTELNVTVFPQAGTVLFWYNLAHSGDHDMRTRHTGCPVIVGSKW 492



 Score = 64.7 bits (156), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L+DV  GGAT+F  LN+TVFP+ G+ +FW+N   +   D    H+GCPV +G+KW
Sbjct: 438 LSDVPQGGATLFTELNVTVFPQAGTVLFWYNLAHSGDHDMRTRHTGCPVIVGSKW 492


>gi|195505202|ref|XP_002099402.1| GE23382 [Drosophila yakuba]
 gi|194185503|gb|EDW99114.1| GE23382 [Drosophila yakuba]
          Length = 537

 Score = 70.1 bits (170), Expect = 3e-10,   Method: Composition-based stats.
 Identities = 29/55 (52%), Positives = 39/55 (70%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L++VE GGAT FP L++ V P+ G+ +FWYN H +   DYR  H+GCPV  G+KW
Sbjct: 450 LSEVEQGGATAFPFLDIAVKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLKGSKW 504



 Score = 66.2 bits (160), Expect = 4e-09,   Method: Composition-based stats.
 Identities = 29/64 (45%), Positives = 41/64 (64%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR   +   L++VE GGAT FP L++ V P+ G+ +FW+N H +   DY   H+GCPV  
Sbjct: 441 NRIATAIFYLSEVEQGGATAFPFLDIAVKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLK 500

Query: 62  GNKW 65
           G+KW
Sbjct: 501 GSKW 504


>gi|195166675|ref|XP_002024160.1| GL22879 [Drosophila persimilis]
 gi|194107515|gb|EDW29558.1| GL22879 [Drosophila persimilis]
          Length = 484

 Score = 70.1 bits (170), Expect = 3e-10,   Method: Composition-based stats.
 Identities = 30/59 (50%), Positives = 38/59 (64%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWGKLL 131
           L +V  GGA  FP L  TV+P KGSA+ W+N + N  LDYR+ H  CPV +G+KW K  
Sbjct: 415 LNEVPHGGAISFPQLEFTVWPRKGSALVWHNLNNNLELDYRVAHISCPVIVGSKWSKFF 473



 Score = 69.7 bits (169), Expect = 4e-10,   Method: Composition-based stats.
 Identities = 31/69 (44%), Positives = 40/69 (57%)

Query: 1   MNRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVA 60
           M+       +L +V  GGA  FP L  TV+P KGSA+ WHN + N  LDY + H  CPV 
Sbjct: 405 MDNLAKEEQQLNEVPHGGAISFPQLEFTVWPRKGSALVWHNLNNNLELDYRVAHISCPVI 464

Query: 61  LGNKWGKLL 69
           +G+KW K  
Sbjct: 465 VGSKWSKFF 473


>gi|148701598|gb|EDL33545.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha II polypeptide, isoform CRA_c [Mus
           musculus]
 gi|149052607|gb|EDM04424.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha II polypeptide (predicted),
           isoform CRA_d [Rattus norvegicus]
          Length = 189

 Score = 70.1 bits (170), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 30/59 (50%), Positives = 41/59 (69%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
            L+ ++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 108 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 166



 Score = 64.3 bits (155), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 27/55 (49%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP L   ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 112 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 166


>gi|74216495|dbj|BAE25162.1| unnamed protein product [Mus musculus]
          Length = 187

 Score = 70.1 bits (170), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 30/59 (50%), Positives = 41/59 (69%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
            L+ ++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 106 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 164



 Score = 64.3 bits (155), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 27/55 (49%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP L   ++P+KG+AVFW+N   +   DY   H+ CPV +G KW
Sbjct: 110 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 164


>gi|195391754|ref|XP_002054525.1| GJ24502 [Drosophila virilis]
 gi|194152611|gb|EDW68045.1| GJ24502 [Drosophila virilis]
          Length = 487

 Score = 70.1 bits (170), Expect = 3e-10,   Method: Composition-based stats.
 Identities = 31/67 (46%), Positives = 44/67 (65%), Gaps = 1/67 (1%)

Query: 61  LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
           LGN+   +L   ++DVE GGAT+F SL+  ++P+KG+A FW N H +   D R  H+ CP
Sbjct: 394 LGNRIATMLFY-MSDVEQGGATVFTSLHAALWPKKGTAAFWMNLHRSGEGDVRTRHAACP 452

Query: 121 VALGNKW 127
           V  G+KW
Sbjct: 453 VLTGSKW 459



 Score = 63.5 bits (153), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 28/64 (43%), Positives = 39/64 (60%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DVE GGAT+F SL+  ++P+KG+A FW N H +   D    H+ CPV  
Sbjct: 396 NRIATMLFYMSDVEQGGATVFTSLHAALWPKKGTAAFWMNLHRSGEGDVRTRHAACPVLT 455

Query: 62  GNKW 65
           G+KW
Sbjct: 456 GSKW 459


>gi|442751927|gb|JAA68123.1| Putative prolyl 4-hydroxylase alpha subunit [Ixodes ricinus]
          Length = 522

 Score = 69.7 bits (169), Expect = 3e-10,   Method: Composition-based stats.
 Identities = 35/87 (40%), Positives = 54/87 (62%), Gaps = 7/87 (8%)

Query: 41  NAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVF 100
           +AH N+  DY++ +       G++   L++  +TDVE+GGAT+FP L + + P+KG A F
Sbjct: 414 HAH-NSTADYYLQNKE-----GDRLATLMIY-MTDVEVGGATVFPRLGVRLVPKKGDAAF 466

Query: 101 WYNAHANTLLDYRMYHSGCPVALGNKW 127
           W+N  A+   D    H+GCPV  G+KW
Sbjct: 467 WWNLKASGEGDTLTMHAGCPVLYGSKW 493



 Score = 66.2 bits (160), Expect = 4e-09,   Method: Composition-based stats.
 Identities = 28/57 (49%), Positives = 38/57 (66%)

Query: 9   VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           + +TDVE+GGAT+FP L + + P+KG A FW N  A+   D    H+GCPV  G+KW
Sbjct: 437 IYMTDVEVGGATVFPRLGVRLVPKKGDAAFWWNLKASGEGDTLTMHAGCPVLYGSKW 493


>gi|395501518|ref|XP_003755140.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Sarcophilus
           harrisii]
          Length = 385

 Score = 69.7 bits (169), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 29/55 (52%), Positives = 40/55 (72%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           ++DV  GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV +GNKW
Sbjct: 308 MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 362



 Score = 67.4 bits (163), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 28/55 (50%), Positives = 40/55 (72%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DV  GGAT+FP +  +V+P+KG+AVFW+N  A+   DY   H+ CPV +GNKW
Sbjct: 308 MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 362


>gi|195159313|ref|XP_002020526.1| GL14040 [Drosophila persimilis]
 gi|194117295|gb|EDW39338.1| GL14040 [Drosophila persimilis]
          Length = 549

 Score = 69.7 bits (169), Expect = 3e-10,   Method: Composition-based stats.
 Identities = 36/69 (52%), Positives = 43/69 (62%), Gaps = 2/69 (2%)

Query: 61  LGNKWGKLL--LSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSG 118
            G K  +LL  L  L DV LGGAT FP L+L V P KGS + WYN H +   D+R  H+G
Sbjct: 449 FGWKGNRLLTALFYLNDVPLGGATAFPFLHLAVPPVKGSLLVWYNLHRSLHKDFRTKHAG 508

Query: 119 CPVALGNKW 127
           CPV  G+KW
Sbjct: 509 CPVLKGSKW 517



 Score = 65.1 bits (157), Expect = 9e-09,   Method: Composition-based stats.
 Identities = 31/64 (48%), Positives = 39/64 (60%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR   +   L DV LGGAT FP L+L V P KGS + W+N H +   D+   H+GCPV  
Sbjct: 454 NRLLTALFYLNDVPLGGATAFPFLHLAVPPVKGSLLVWYNLHRSLHKDFRTKHAGCPVLK 513

Query: 62  GNKW 65
           G+KW
Sbjct: 514 GSKW 517


>gi|194905392|ref|XP_001981188.1| GG11756 [Drosophila erecta]
 gi|190655826|gb|EDV53058.1| GG11756 [Drosophila erecta]
          Length = 509

 Score = 69.7 bits (169), Expect = 3e-10,   Method: Composition-based stats.
 Identities = 29/55 (52%), Positives = 37/55 (67%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L+DV  GGAT FP L L+VFPEKG+A+ WYN     + D R  HS CP  +G++W
Sbjct: 434 LSDVPQGGATTFPDLGLSVFPEKGAALLWYNLDHKGVGDNRTAHSACPTIVGSRW 488


>gi|198449643|ref|XP_001357664.2| GA15938 [Drosophila pseudoobscura pseudoobscura]
 gi|198130698|gb|EAL26798.2| GA15938 [Drosophila pseudoobscura pseudoobscura]
          Length = 549

 Score = 69.7 bits (169), Expect = 4e-10,   Method: Composition-based stats.
 Identities = 36/69 (52%), Positives = 43/69 (62%), Gaps = 2/69 (2%)

Query: 61  LGNKWGKLL--LSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSG 118
            G K  +LL  L  L DV LGGAT FP L+L V P KGS + WYN H +   D+R  H+G
Sbjct: 449 FGWKGNRLLTALFYLNDVPLGGATAFPFLHLAVPPVKGSLLVWYNLHRSLHKDFRTKHAG 508

Query: 119 CPVALGNKW 127
           CPV  G+KW
Sbjct: 509 CPVLKGSKW 517



 Score = 65.1 bits (157), Expect = 9e-09,   Method: Composition-based stats.
 Identities = 31/64 (48%), Positives = 39/64 (60%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR   +   L DV LGGAT FP L+L V P KGS + W+N H +   D+   H+GCPV  
Sbjct: 454 NRLLTALFYLNDVPLGGATAFPFLHLAVPPVKGSLLVWYNLHRSLHKDFRTKHAGCPVLK 513

Query: 62  GNKW 65
           G+KW
Sbjct: 514 GSKW 517


>gi|194905397|ref|XP_001981189.1| GG11929 [Drosophila erecta]
 gi|190655827|gb|EDV53059.1| GG11929 [Drosophila erecta]
          Length = 538

 Score = 69.7 bits (169), Expect = 4e-10,   Method: Composition-based stats.
 Identities = 30/55 (54%), Positives = 37/55 (67%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           + DV LGGAT FP L L V PEKGS + W N H++   DYR  H+GCP+  G+KW
Sbjct: 452 INDVALGGATAFPFLRLAVPPEKGSLLMWNNLHSSLHKDYRSKHAGCPILQGSKW 506



 Score = 67.4 bits (163), Expect = 2e-09,   Method: Composition-based stats.
 Identities = 29/55 (52%), Positives = 37/55 (67%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           + DV LGGAT FP L L V PEKGS + W+N H++   DY   H+GCP+  G+KW
Sbjct: 452 INDVALGGATAFPFLRLAVPPEKGSLLMWNNLHSSLHKDYRSKHAGCPILQGSKW 506


>gi|66772633|gb|AAY55628.1| IP02961p [Drosophila melanogaster]
          Length = 409

 Score = 69.7 bits (169), Expect = 4e-10,   Method: Composition-based stats.
 Identities = 30/55 (54%), Positives = 36/55 (65%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L DVE GG T FP L L V PE+GS +FWYN H +   D+R  H+ CPV  G+KW
Sbjct: 322 LADVEAGGGTAFPFLPLLVTPERGSLLFWYNLHPSGDQDFRTKHAACPVLQGSKW 376



 Score = 65.5 bits (158), Expect = 7e-09,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 37/64 (57%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       L DVE GG T FP L L V PE+GS +FW+N H +   D+   H+ CPV  
Sbjct: 313 NRMATGIYYLADVEAGGGTAFPFLPLLVTPERGSLLFWYNLHPSGDQDFRTKHAACPVLQ 372

Query: 62  GNKW 65
           G+KW
Sbjct: 373 GSKW 376


>gi|24651418|ref|NP_524594.2| prolyl-4-hydroxylase-alpha MP [Drosophila melanogaster]
 gi|7301951|gb|AAF57057.1| prolyl-4-hydroxylase-alpha MP [Drosophila melanogaster]
 gi|359807686|gb|AEV66559.1| FI17802p1 [Drosophila melanogaster]
          Length = 535

 Score = 69.7 bits (169), Expect = 4e-10,   Method: Composition-based stats.
 Identities = 30/55 (54%), Positives = 36/55 (65%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L DVE GG T FP L L V PE+GS +FWYN H +   D+R  H+ CPV  G+KW
Sbjct: 448 LADVEAGGGTAFPFLPLLVTPERGSLLFWYNLHPSGDQDFRTKHAACPVLQGSKW 502



 Score = 65.5 bits (158), Expect = 7e-09,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 37/64 (57%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       L DVE GG T FP L L V PE+GS +FW+N H +   D+   H+ CPV  
Sbjct: 439 NRMATGIYYLADVEAGGGTAFPFLPLLVTPERGSLLFWYNLHPSGDQDFRTKHAACPVLQ 498

Query: 62  GNKW 65
           G+KW
Sbjct: 499 GSKW 502


>gi|349604936|gb|AEQ00344.1| Prolyl 4-hydroxylase subunit alpha-1-like protein, partial [Equus
           caballus]
          Length = 302

 Score = 69.7 bits (169), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 29/55 (52%), Positives = 40/55 (72%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           ++DV  GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV +GNKW
Sbjct: 225 MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 279



 Score = 67.4 bits (163), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 28/55 (50%), Positives = 40/55 (72%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DV  GGAT+FP +  +V+P+KG+AVFW+N  A+   DY   H+ CPV +GNKW
Sbjct: 225 MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 279


>gi|402880501|ref|XP_003903839.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like, partial
           [Papio anubis]
          Length = 379

 Score = 69.7 bits (169), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 29/55 (52%), Positives = 40/55 (72%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           ++DV  GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV +GNKW
Sbjct: 302 MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 356



 Score = 67.4 bits (163), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 28/55 (50%), Positives = 40/55 (72%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DV  GGAT+FP +  +V+P+KG+AVFW+N  A+   DY   H+ CPV +GNKW
Sbjct: 302 MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 356


>gi|195505209|ref|XP_002099405.1| GE10885 [Drosophila yakuba]
 gi|194185506|gb|EDW99117.1| GE10885 [Drosophila yakuba]
          Length = 473

 Score = 69.7 bits (169), Expect = 4e-10,   Method: Composition-based stats.
 Identities = 30/55 (54%), Positives = 37/55 (67%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L+DV  GGATIFP L L+VFP+KGSA+ WYN       D R  HS CP  +G++W
Sbjct: 398 LSDVPQGGATIFPKLGLSVFPKKGSALLWYNLDHKGDGDNRTAHSACPTIVGSRW 452



 Score = 66.2 bits (160), Expect = 4e-09,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 39/64 (60%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR   +   L+DV  GGATIFP L L+VFP+KGSA+ W+N       D    HS CP  +
Sbjct: 389 NRIATAVFYLSDVPQGGATIFPKLGLSVFPKKGSALLWYNLDHKGDGDNRTAHSACPTIV 448

Query: 62  GNKW 65
           G++W
Sbjct: 449 GSRW 452


>gi|195156517|ref|XP_002019146.1| GL25581 [Drosophila persimilis]
 gi|194115299|gb|EDW37342.1| GL25581 [Drosophila persimilis]
          Length = 206

 Score = 69.7 bits (169), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 31/57 (54%), Positives = 39/57 (68%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWGK 129
           +TDV LGGATIFP +NLT+ P+KGSA+FWYN H +        H+ CP   GN+W K
Sbjct: 130 MTDVPLGGATIFPKINLTIQPKKGSALFWYNIHNDWEPHVLTRHAVCPTIEGNRWSK 186



 Score = 68.6 bits (166), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 30/64 (46%), Positives = 43/64 (67%)

Query: 4   AEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGN 63
           +++ +  +TDV LGGATIFP +NLT+ P+KGSA+FW+N H +        H+ CP   GN
Sbjct: 123 SDIKANPMTDVPLGGATIFPKINLTIQPKKGSALFWYNIHNDWEPHVLTRHAVCPTIEGN 182

Query: 64  KWGK 67
           +W K
Sbjct: 183 RWSK 186


>gi|90085216|dbj|BAE91349.1| unnamed protein product [Macaca fascicularis]
          Length = 244

 Score = 69.7 bits (169), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 29/55 (52%), Positives = 40/55 (72%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           ++DV  GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV +GNKW
Sbjct: 167 MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 221



 Score = 67.0 bits (162), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 28/55 (50%), Positives = 40/55 (72%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DV  GGAT+FP +  +V+P+KG+AVFW+N  A+   DY   H+ CPV +GNKW
Sbjct: 167 MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 221


>gi|403274090|ref|XP_003928822.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Saimiri
           boliviensis boliviensis]
          Length = 149

 Score = 69.7 bits (169), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 29/55 (52%), Positives = 40/55 (72%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           ++DV  GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV +GNKW
Sbjct: 72  MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 126



 Score = 66.6 bits (161), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 28/55 (50%), Positives = 40/55 (72%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DV  GGAT+FP +  +V+P+KG+AVFW+N  A+   DY   H+ CPV +GNKW
Sbjct: 72  MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 126


>gi|198417610|ref|XP_002125349.1| PREDICTED: similar to Prolyl 4-hydroxylase subunit alpha-1
           precursor (4-PH alpha-1)
           (Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-1) [Ciona intestinalis]
          Length = 527

 Score = 69.3 bits (168), Expect = 4e-10,   Method: Composition-based stats.
 Identities = 30/56 (53%), Positives = 36/56 (64%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 128
           L+DVE+GG T F +  ++  P KGSAVFWYN   +   D R YH  CPVA GNKW 
Sbjct: 441 LSDVEVGGRTAFVNAGVSAKPIKGSAVFWYNVFPSGEPDLRTYHGACPVAFGNKWA 496



 Score = 64.3 bits (155), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 28/58 (48%), Positives = 36/58 (62%)

Query: 9   VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWG 66
           + L+DVE+GG T F +  ++  P KGSAVFW+N   +   D   YH  CPVA GNKW 
Sbjct: 439 IYLSDVEVGGRTAFVNAGVSAKPIKGSAVFWYNVFPSGEPDLRTYHGACPVAFGNKWA 496


>gi|195159309|ref|XP_002020524.1| GL13466 [Drosophila persimilis]
 gi|194117293|gb|EDW39336.1| GL13466 [Drosophila persimilis]
          Length = 643

 Score = 69.3 bits (168), Expect = 4e-10,   Method: Composition-based stats.
 Identities = 29/55 (52%), Positives = 39/55 (70%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L+DV  GGAT+F  LN+TVFP+ G+ +FWYN   +   D R  H+GCPV +G+KW
Sbjct: 561 LSDVPHGGATLFTELNVTVFPQAGTVLFWYNLAHSGDHDMRTRHTGCPVIVGSKW 615



 Score = 63.9 bits (154), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L+DV  GGAT+F  LN+TVFP+ G+ +FW+N   +   D    H+GCPV +G+KW
Sbjct: 561 LSDVPHGGATLFTELNVTVFPQAGTVLFWYNLAHSGDHDMRTRHTGCPVIVGSKW 615


>gi|195452726|ref|XP_002073473.1| GK14136 [Drosophila willistoni]
 gi|194169558|gb|EDW84459.1| GK14136 [Drosophila willistoni]
          Length = 550

 Score = 69.3 bits (168), Expect = 5e-10,   Method: Composition-based stats.
 Identities = 31/67 (46%), Positives = 43/67 (64%), Gaps = 1/67 (1%)

Query: 61  LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
           LGN+   +L   ++DVE GGAT+F SL+  ++P+KG+A FW N H +   D R  H+ CP
Sbjct: 457 LGNRIATVLFY-MSDVEQGGATVFTSLHAALWPKKGTAAFWMNLHRDGEGDVRTRHAACP 515

Query: 121 VALGNKW 127
           V  G KW
Sbjct: 516 VLTGTKW 522



 Score = 63.5 bits (153), Expect = 3e-08,   Method: Composition-based stats.
 Identities = 28/64 (43%), Positives = 38/64 (59%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DVE GGAT+F SL+  ++P+KG+A FW N H +   D    H+ CPV  
Sbjct: 459 NRIATVLFYMSDVEQGGATVFTSLHAALWPKKGTAAFWMNLHRDGEGDVRTRHAACPVLT 518

Query: 62  GNKW 65
           G KW
Sbjct: 519 GTKW 522


>gi|281361323|ref|NP_652183.2| CG15864 [Drosophila melanogaster]
 gi|272476864|gb|AAF54202.3| CG15864 [Drosophila melanogaster]
          Length = 490

 Score = 69.3 bits (168), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 34/79 (43%), Positives = 50/79 (63%), Gaps = 1/79 (1%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           D+  Y +      G++   +L   L +V+ GGATIFP +N+TV P+KGSAVFWYN H + 
Sbjct: 384 DFHEYTNTTRPKQGDRIATVLFY-LGEVDSGGATIFPMINITVTPKKGSAVFWYNLHNSG 442

Query: 109 LLDYRMYHSGCPVALGNKW 127
            ++ +  HS CPV  G+K+
Sbjct: 443 AMNLKSLHSACPVISGSKY 461



 Score = 64.7 bits (156), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 29/55 (52%), Positives = 40/55 (72%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L +V+ GGATIFP +N+TV P+KGSAVFW+N H +  ++    HS CPV  G+K+
Sbjct: 407 LGEVDSGGATIFPMINITVTPKKGSAVFWYNLHNSGAMNLKSLHSACPVISGSKY 461


>gi|198466397|ref|XP_002135180.1| GA23908 [Drosophila pseudoobscura pseudoobscura]
 gi|198150581|gb|EDY73807.1| GA23908 [Drosophila pseudoobscura pseudoobscura]
          Length = 403

 Score = 69.3 bits (168), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 32/69 (46%), Positives = 41/69 (59%)

Query: 1   MNRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVA 60
           M+       +L +V  GGA  FP L  TV+P KGSA+ WHN + N  LDY + H  CPV 
Sbjct: 324 MDNLAKEEQQLNEVPHGGAISFPQLEFTVWPRKGSALVWHNLNNNLELDYRVAHISCPVI 383

Query: 61  LGNKWGKLL 69
           +G+KW K L
Sbjct: 384 VGSKWSKFL 392



 Score = 69.3 bits (168), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 31/61 (50%), Positives = 39/61 (63%)

Query: 71  SGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWGKL 130
             L +V  GGA  FP L  TV+P KGSA+ W+N + N  LDYR+ H  CPV +G+KW K 
Sbjct: 332 QQLNEVPHGGAISFPQLEFTVWPRKGSALVWHNLNNNLELDYRVAHISCPVIVGSKWSKF 391

Query: 131 L 131
           L
Sbjct: 392 L 392


>gi|20269818|gb|AAM18064.1| prolyl 4-hydroxylase alpha-related protein PH4[alpha]NE1
           [Drosophila melanogaster]
          Length = 286

 Score = 69.3 bits (168), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 48/79 (60%), Gaps = 1/79 (1%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           D++   +  P   GN+    +   L++VE GGAT FP L++ V P+ G+ +FWYN H + 
Sbjct: 176 DFFRDPNHYPAEEGNRIATAIFY-LSEVEQGGATAFPFLDIAVKPQLGNVLFWYNLHRSL 234

Query: 109 LLDYRMYHSGCPVALGNKW 127
             DYR  H+GCPV  G+KW
Sbjct: 235 DKDYRTKHAGCPVLKGSKW 253



 Score = 63.9 bits (154), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 29/64 (45%), Positives = 41/64 (64%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR   +   L++VE GGAT FP L++ V P+ G+ +FW+N H +   DY   H+GCPV  
Sbjct: 190 NRIATAIFYLSEVEQGGATAFPFLDIAVKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLK 249

Query: 62  GNKW 65
           G+KW
Sbjct: 250 GSKW 253


>gi|194751825|ref|XP_001958224.1| GF23629 [Drosophila ananassae]
 gi|190625506|gb|EDV41030.1| GF23629 [Drosophila ananassae]
          Length = 523

 Score = 69.3 bits (168), Expect = 5e-10,   Method: Composition-based stats.
 Identities = 30/68 (44%), Positives = 45/68 (66%), Gaps = 1/68 (1%)

Query: 60  ALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGC 119
           ALG++ G ++    +DV  GGAT+FP + +++ P KGS+VFWYN + +   D R  HS C
Sbjct: 436 ALGDRLGTIIFYA-SDVLQGGATVFPDIKVSITPRKGSSVFWYNLYDDGRPDIRSRHSVC 494

Query: 120 PVALGNKW 127
           PV  G++W
Sbjct: 495 PVINGDRW 502



 Score = 58.9 bits (141), Expect = 6e-07,   Method: Composition-based stats.
 Identities = 24/54 (44%), Positives = 36/54 (66%)

Query: 12  TDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           +DV  GGAT+FP + +++ P KGS+VFW+N + +   D    HS CPV  G++W
Sbjct: 449 SDVLQGGATVFPDIKVSITPRKGSSVFWYNLYDDGRPDIRSRHSVCPVINGDRW 502


>gi|195159299|ref|XP_002020519.1| GL13471 [Drosophila persimilis]
 gi|194117288|gb|EDW39331.1| GL13471 [Drosophila persimilis]
          Length = 238

 Score = 68.9 bits (167), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 30/60 (50%), Positives = 41/60 (68%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWGKLLL 132
           L+DV  GGAT+F  LN+TVFP+ G+ +FWYN   +   D R  H+GCPV +G+KW +  L
Sbjct: 176 LSDVPQGGATLFTELNVTVFPQAGTVLFWYNLAHSGDHDMRTRHTGCPVIVGSKWSRFSL 235



 Score = 63.9 bits (154), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 28/60 (46%), Positives = 40/60 (66%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWGKLLL 70
           L+DV  GGAT+F  LN+TVFP+ G+ +FW+N   +   D    H+GCPV +G+KW +  L
Sbjct: 176 LSDVPQGGATLFTELNVTVFPQAGTVLFWYNLAHSGDHDMRTRHTGCPVIVGSKWSRFSL 235


>gi|194904100|ref|XP_001981000.1| GG23922 [Drosophila erecta]
 gi|190652703|gb|EDV49958.1| GG23922 [Drosophila erecta]
          Length = 490

 Score = 68.9 bits (167), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 35/79 (44%), Positives = 47/79 (59%), Gaps = 1/79 (1%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY  Y +      G++   +L   L +V  GGATIFP +N+TV P+KGSAVFWYN H + 
Sbjct: 384 DYHEYSNMTREKYGDRIATVLFY-LGEVHSGGATIFPRINITVTPKKGSAVFWYNLHNSG 442

Query: 109 LLDYRMYHSGCPVALGNKW 127
            +     HS CPV  G+K+
Sbjct: 443 AMHSETLHSACPVISGSKY 461



 Score = 63.5 bits (153), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 29/55 (52%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L +V  GGATIFP +N+TV P+KGSAVFW+N H +  +     HS CPV  G+K+
Sbjct: 407 LGEVHSGGATIFPRINITVTPKKGSAVFWYNLHNSGAMHSETLHSACPVISGSKY 461


>gi|17861644|gb|AAL39299.1| GH17175p [Drosophila melanogaster]
          Length = 187

 Score = 68.9 bits (167), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 48/79 (60%), Gaps = 1/79 (1%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           D++   +  P   GN+    +   L++VE GGAT FP L++ V P+ G+ +FWYN H + 
Sbjct: 77  DFFRDPNHYPAEEGNRIATAIFY-LSEVEQGGATAFPFLDIAVKPQLGNVLFWYNLHRSL 135

Query: 109 LLDYRMYHSGCPVALGNKW 127
             DYR  H+GCPV  G+KW
Sbjct: 136 DKDYRTKHAGCPVLKGSKW 154



 Score = 63.9 bits (154), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 29/64 (45%), Positives = 41/64 (64%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR   +   L++VE GGAT FP L++ V P+ G+ +FW+N H +   DY   H+GCPV  
Sbjct: 91  NRIATAIFYLSEVEQGGATAFPFLDIAVKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLK 150

Query: 62  GNKW 65
           G+KW
Sbjct: 151 GSKW 154


>gi|194765174|ref|XP_001964702.1| GF23328 [Drosophila ananassae]
 gi|190614974|gb|EDV30498.1| GF23328 [Drosophila ananassae]
          Length = 542

 Score = 68.9 bits (167), Expect = 7e-10,   Method: Composition-based stats.
 Identities = 33/65 (50%), Positives = 40/65 (61%), Gaps = 3/65 (4%)

Query: 66  GKLLLSGL---TDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 122
           G  +L+ L    DV LGGAT FP L L V P KGS + WYN H +   D+R  H+GCPV 
Sbjct: 446 GNRMLTALFYINDVALGGATAFPFLQLAVPPVKGSLLVWYNLHRSLHKDFRTKHAGCPVI 505

Query: 123 LGNKW 127
            G+KW
Sbjct: 506 KGSKW 510



 Score = 64.3 bits (155), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 38/64 (59%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR   +   + DV LGGAT FP L L V P KGS + W+N H +   D+   H+GCPV  
Sbjct: 447 NRMLTALFYINDVALGGATAFPFLQLAVPPVKGSLLVWYNLHRSLHKDFRTKHAGCPVIK 506

Query: 62  GNKW 65
           G+KW
Sbjct: 507 GSKW 510


>gi|383864775|ref|XP_003707853.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Megachile
           rotundata]
          Length = 550

 Score = 68.9 bits (167), Expect = 7e-10,   Method: Composition-based stats.
 Identities = 29/66 (43%), Positives = 43/66 (65%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+   +L   ++DVE GG T+F ++N++++P KGSA FW+N   N   D R  H+ CPV
Sbjct: 455 GNRIATVLYY-MSDVEQGGGTVFTAINISLWPRKGSAAFWFNLKPNGEGDLRTRHAACPV 513

Query: 122 ALGNKW 127
             G+KW
Sbjct: 514 LTGSKW 519



 Score = 63.5 bits (153), Expect = 3e-08,   Method: Composition-based stats.
 Identities = 27/64 (42%), Positives = 38/64 (59%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DVE GG T+F ++N++++P KGSA FW N   N   D    H+ CPV  
Sbjct: 456 NRIATVLYYMSDVEQGGGTVFTAINISLWPRKGSAAFWFNLKPNGEGDLRTRHAACPVLT 515

Query: 62  GNKW 65
           G+KW
Sbjct: 516 GSKW 519


>gi|332026992|gb|EGI67088.1| Prolyl 4-hydroxylase subunit alpha-1 [Acromyrmex echinatior]
          Length = 415

 Score = 68.9 bits (167), Expect = 7e-10,   Method: Composition-based stats.
 Identities = 28/64 (43%), Positives = 40/64 (62%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DVE GG T+F ++N++++P KGSA FWHN   N   D+   H+ CPV  
Sbjct: 321 NRIATVLYYMSDVEQGGGTVFTAINISLWPRKGSAAFWHNLKPNGEGDFKTRHAACPVLT 380

Query: 62  GNKW 65
           G+KW
Sbjct: 381 GSKW 384



 Score = 68.6 bits (166), Expect = 8e-10,   Method: Composition-based stats.
 Identities = 28/66 (42%), Positives = 44/66 (66%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+   +L   ++DVE GG T+F ++N++++P KGSA FW+N   N   D++  H+ CPV
Sbjct: 320 GNRIATVLYY-MSDVEQGGGTVFTAINISLWPRKGSAAFWHNLKPNGEGDFKTRHAACPV 378

Query: 122 ALGNKW 127
             G+KW
Sbjct: 379 LTGSKW 384


>gi|195159305|ref|XP_002020522.1| GL13469 [Drosophila persimilis]
 gi|194117291|gb|EDW39334.1| GL13469 [Drosophila persimilis]
          Length = 253

 Score = 68.6 bits (166), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 30/61 (49%), Positives = 42/61 (68%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWGKLLL 132
           L+DV LGGAT+FP L+L+VFP+KG+ + WYN         +  HS CPV +G++WGK+ L
Sbjct: 159 LSDVPLGGATVFPLLDLSVFPKKGAVLMWYNLDHKGQGMEKTIHSACPVVVGSRWGKINL 218

Query: 133 S 133
            
Sbjct: 219 K 219



 Score = 67.4 bits (163), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 32/66 (48%), Positives = 47/66 (71%), Gaps = 2/66 (3%)

Query: 10  ELTDVELGGATIFPSLNLTVFPEKGSAVFWHN-AHANTLLDYWMYHSGCPVALGNKWGKL 68
           EL+DV LGGAT+FP L+L+VFP+KG+ + W+N  H    ++  + HS CPV +G++WGK+
Sbjct: 158 ELSDVPLGGATVFPLLDLSVFPKKGAVLMWYNLDHKGQGMEKTI-HSACPVVVGSRWGKI 216

Query: 69  LLSGLT 74
            L   T
Sbjct: 217 NLKRET 222


>gi|195452742|ref|XP_002073480.1| GK13123 [Drosophila willistoni]
 gi|194169565|gb|EDW84466.1| GK13123 [Drosophila willistoni]
          Length = 540

 Score = 68.6 bits (166), Expect = 8e-10,   Method: Composition-based stats.
 Identities = 33/66 (50%), Positives = 42/66 (63%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    LL  L +V +GGAT FP L L V P KGS + WYN H +   D+R  H+GCPV
Sbjct: 439 GNRLLTALLY-LNEVPMGGATAFPYLKLAVPPVKGSLLVWYNLHRSLDPDFRTKHAGCPV 497

Query: 122 ALGNKW 127
            +G+KW
Sbjct: 498 LMGSKW 503



 Score = 64.3 bits (155), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 29/64 (45%), Positives = 40/64 (62%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR   + + L +V +GGAT FP L L V P KGS + W+N H +   D+   H+GCPV +
Sbjct: 440 NRLLTALLYLNEVPMGGATAFPYLKLAVPPVKGSLLVWYNLHRSLDPDFRTKHAGCPVLM 499

Query: 62  GNKW 65
           G+KW
Sbjct: 500 GSKW 503


>gi|291190274|ref|NP_001167096.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha 1 polypeptide precursor [Salmo
           salar]
 gi|223648100|gb|ACN10808.1| Prolyl 4-hydroxylase subunit alpha-1 precursor [Salmo salar]
          Length = 545

 Score = 68.6 bits (166), Expect = 8e-10,   Method: Composition-based stats.
 Identities = 31/66 (46%), Positives = 42/66 (63%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L+  ++DV  GGAT+F  +   V+P+KGSAVFWYN   +   DY   H+ CPV
Sbjct: 458 GNRIATWLIY-MSDVPSGGATVFTDVGAAVWPKKGSAVFWYNLFPSGEGDYSTRHAACPV 516

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 517 LVGNKW 522



 Score = 66.6 bits (161), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 29/64 (45%), Positives = 40/64 (62%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + ++DV  GGAT+F  +   V+P+KGSAVFW+N   +   DY   H+ CPV +
Sbjct: 459 NRIATWLIYMSDVPSGGATVFTDVGAAVWPKKGSAVFWYNLFPSGEGDYSTRHAACPVLV 518

Query: 62  GNKW 65
           GNKW
Sbjct: 519 GNKW 522


>gi|195575111|ref|XP_002105523.1| GD16991 [Drosophila simulans]
 gi|194201450|gb|EDX15026.1| GD16991 [Drosophila simulans]
          Length = 542

 Score = 68.6 bits (166), Expect = 8e-10,   Method: Composition-based stats.
 Identities = 29/55 (52%), Positives = 36/55 (65%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L +V  GG T FP +NLTVFP+ GSA+FWYN   N        H+GCPV +G+KW
Sbjct: 453 LNEVRQGGGTYFPRINLTVFPQPGSALFWYNLDTNGNDHMGSLHTGCPVIVGSKW 507



 Score = 66.2 bits (160), Expect = 4e-09,   Method: Composition-based stats.
 Identities = 28/55 (50%), Positives = 36/55 (65%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L +V  GG T FP +NLTVFP+ GSA+FW+N   N        H+GCPV +G+KW
Sbjct: 453 LNEVRQGGGTYFPRINLTVFPQPGSALFWYNLDTNGNDHMGSLHTGCPVIVGSKW 507


>gi|195572619|ref|XP_002104293.1| GD18524 [Drosophila simulans]
 gi|194200220|gb|EDX13796.1| GD18524 [Drosophila simulans]
          Length = 472

 Score = 68.6 bits (166), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 49/79 (62%), Gaps = 1/79 (1%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           D+  Y +      G++   +L   L +V+ GGATIFP +N+ V P+KGSAVFWYN H + 
Sbjct: 366 DFHEYTNTTRPKQGDRIATVLFY-LGEVDSGGATIFPKINIAVTPKKGSAVFWYNLHNSG 424

Query: 109 LLDYRMYHSGCPVALGNKW 127
            ++ +  HS CPV  G+K+
Sbjct: 425 AMNLKSLHSACPVISGSKY 443



 Score = 63.5 bits (153), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 28/55 (50%), Positives = 39/55 (70%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L +V+ GGATIFP +N+ V P+KGSAVFW+N H +  ++    HS CPV  G+K+
Sbjct: 389 LGEVDSGGATIFPKINIAVTPKKGSAVFWYNLHNSGAMNLKSLHSACPVISGSKY 443


>gi|405967005|gb|EKC32220.1| Prolyl 4-hydroxylase subunit alpha-1 [Crassostrea gigas]
          Length = 303

 Score = 68.6 bits (166), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 39/100 (39%), Positives = 52/100 (52%), Gaps = 8/100 (8%)

Query: 29  VFPEKGSAVFWHNAHA-NTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSL 87
           + P+ G    W+   A       W  H+G  +A    W    +  L+DVE GGAT+FP L
Sbjct: 176 IDPDIGDEFLWNMTEAVQGPRALWRKHTGQRIAT---W----MFYLSDVEAGGATVFPKL 228

Query: 88  NLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
              V   KG+A FWYN   +  +D R  H+GCPV LG+KW
Sbjct: 229 EARVPVVKGAAAFWYNLTPSGKIDRRTQHAGCPVILGSKW 268



 Score = 62.0 bits (149), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 28/55 (50%), Positives = 36/55 (65%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L+DVE GGAT+FP L   V   KG+A FW+N   +  +D    H+GCPV LG+KW
Sbjct: 214 LSDVEAGGATVFPKLEARVPVVKGAAAFWYNLTPSGKIDRRTQHAGCPVILGSKW 268


>gi|194765138|ref|XP_001964684.1| GF23317 [Drosophila ananassae]
 gi|190614956|gb|EDV30480.1| GF23317 [Drosophila ananassae]
          Length = 520

 Score = 68.2 bits (165), Expect = 9e-10,   Method: Composition-based stats.
 Identities = 26/55 (47%), Positives = 40/55 (72%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           ++DVE GGAT+FP++  TV+P++G+AV WYN   +   D +  H+ CPV +G+KW
Sbjct: 446 MSDVEQGGATVFPNIYKTVYPQRGTAVMWYNLKDDGQPDEQTLHAACPVLVGSKW 500



 Score = 65.1 bits (157), Expect = 9e-09,   Method: Composition-based stats.
 Identities = 25/55 (45%), Positives = 39/55 (70%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP++  TV+P++G+AV W+N   +   D    H+ CPV +G+KW
Sbjct: 446 MSDVEQGGATVFPNIYKTVYPQRGTAVMWYNLKDDGQPDEQTLHAACPVLVGSKW 500


>gi|195330778|ref|XP_002032080.1| GM23711 [Drosophila sechellia]
 gi|194121023|gb|EDW43066.1| GM23711 [Drosophila sechellia]
          Length = 490

 Score = 68.2 bits (165), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 49/79 (62%), Gaps = 1/79 (1%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           D+  Y +      G++   +L   L +V+ GGATIFP +N+ V P+KGSAVFWYN H + 
Sbjct: 384 DFHEYTNTTRPKQGDRIATVLFY-LGEVDSGGATIFPKINIAVTPKKGSAVFWYNLHNSG 442

Query: 109 LLDYRMYHSGCPVALGNKW 127
            ++ +  HS CPV  G+K+
Sbjct: 443 AMNLKSLHSACPVISGSKY 461



 Score = 63.5 bits (153), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 28/55 (50%), Positives = 39/55 (70%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L +V+ GGATIFP +N+ V P+KGSAVFW+N H +  ++    HS CPV  G+K+
Sbjct: 407 LGEVDSGGATIFPKINIAVTPKKGSAVFWYNLHNSGAMNLKSLHSACPVISGSKY 461


>gi|198477148|ref|XP_002136736.1| GA29214 [Drosophila pseudoobscura pseudoobscura]
 gi|198145041|gb|EDY71753.1| GA29214 [Drosophila pseudoobscura pseudoobscura]
          Length = 520

 Score = 68.2 bits (165), Expect = 1e-09,   Method: Composition-based stats.
 Identities = 29/55 (52%), Positives = 38/55 (69%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L+DV  GGAT+F  LN+TVFP+ G+ +FWYN   +   D R  H+GCPV  G+KW
Sbjct: 438 LSDVPQGGATLFTELNVTVFPQAGTVLFWYNLAHSGDHDMRTRHTGCPVIGGSKW 492



 Score = 62.8 bits (151), Expect = 4e-08,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 37/55 (67%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L+DV  GGAT+F  LN+TVFP+ G+ +FW+N   +   D    H+GCPV  G+KW
Sbjct: 438 LSDVPQGGATLFTELNVTVFPQAGTVLFWYNLAHSGDHDMRTRHTGCPVIGGSKW 492


>gi|195159142|ref|XP_002020441.1| GL13994 [Drosophila persimilis]
 gi|194117210|gb|EDW39253.1| GL13994 [Drosophila persimilis]
          Length = 493

 Score = 68.2 bits (165), Expect = 1e-09,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 39/55 (70%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           LTDVE GGAT+FP++   VFP++GSA+ WYN   +   + +  H+ CPV +G+KW
Sbjct: 416 LTDVEQGGATVFPNIRKAVFPQRGSAIIWYNLKDDGDPNPQTLHAACPVLVGSKW 470



 Score = 65.1 bits (157), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 29/67 (43%), Positives = 44/67 (65%), Gaps = 2/67 (2%)

Query: 1   MNRAEMSSV--ELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCP 58
           MN   +++V   LTDVE GGAT+FP++   VFP++GSA+ W+N   +   +    H+ CP
Sbjct: 404 MNGDRIATVLFYLTDVEQGGATVFPNIRKAVFPQRGSAIIWYNLKDDGDPNPQTLHAACP 463

Query: 59  VALGNKW 65
           V +G+KW
Sbjct: 464 VLVGSKW 470


>gi|195055779|ref|XP_001994790.1| GH14110 [Drosophila grimshawi]
 gi|193892553|gb|EDV91419.1| GH14110 [Drosophila grimshawi]
          Length = 487

 Score = 68.2 bits (165), Expect = 1e-09,   Method: Composition-based stats.
 Identities = 30/67 (44%), Positives = 43/67 (64%), Gaps = 1/67 (1%)

Query: 61  LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
           LGN+   +L   ++DV+ GGAT+F SL   ++P+KG+A FW N H +   D R  H+ CP
Sbjct: 394 LGNRIATMLFY-MSDVQQGGATVFTSLRTALWPKKGTAAFWMNLHRSGEGDARTRHAACP 452

Query: 121 VALGNKW 127
           V  G+KW
Sbjct: 453 VLTGSKW 459



 Score = 62.0 bits (149), Expect = 9e-08,   Method: Composition-based stats.
 Identities = 27/64 (42%), Positives = 38/64 (59%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV+ GGAT+F SL   ++P+KG+A FW N H +   D    H+ CPV  
Sbjct: 396 NRIATMLFYMSDVQQGGATVFTSLRTALWPKKGTAAFWMNLHRSGEGDARTRHAACPVLT 455

Query: 62  GNKW 65
           G+KW
Sbjct: 456 GSKW 459


>gi|195159319|ref|XP_002020529.1| GL14044 [Drosophila persimilis]
 gi|194117298|gb|EDW39341.1| GL14044 [Drosophila persimilis]
          Length = 536

 Score = 67.8 bits (164), Expect = 1e-09,   Method: Composition-based stats.
 Identities = 29/55 (52%), Positives = 36/55 (65%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L+DV  GG T FP L L V PE+GS +FWYN H +   D+R  H+ CPV  G+KW
Sbjct: 449 LSDVVAGGGTAFPFLPLLVTPERGSLLFWYNLHPSGDQDFRTKHAACPVLQGSKW 503



 Score = 63.9 bits (154), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 29/64 (45%), Positives = 38/64 (59%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR   +   L+DV  GG T FP L L V PE+GS +FW+N H +   D+   H+ CPV  
Sbjct: 440 NRLATAIYYLSDVVAGGGTAFPFLPLLVTPERGSLLFWYNLHPSGDQDFRTKHAACPVLQ 499

Query: 62  GNKW 65
           G+KW
Sbjct: 500 GSKW 503


>gi|195391766|ref|XP_002054531.1| GJ24504 [Drosophila virilis]
 gi|194152617|gb|EDW68051.1| GJ24504 [Drosophila virilis]
          Length = 545

 Score = 67.8 bits (164), Expect = 1e-09,   Method: Composition-based stats.
 Identities = 31/67 (46%), Positives = 41/67 (61%), Gaps = 1/67 (1%)

Query: 61  LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
           +GN+   +L   LTDVE GG T FP L   + P+K +A FWYN HA+   D R  H  CP
Sbjct: 441 MGNRIATVLFY-LTDVEQGGGTAFPVLKQLLKPKKYAAAFWYNLHASGAGDVRTMHGACP 499

Query: 121 VALGNKW 127
           + +G+KW
Sbjct: 500 IIVGSKW 506



 Score = 61.2 bits (147), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 28/64 (43%), Positives = 36/64 (56%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       LTDVE GG T FP L   + P+K +A FW+N HA+   D    H  CP+ +
Sbjct: 443 NRIATVLFYLTDVEQGGGTAFPVLKQLLKPKKYAAAFWYNLHASGAGDVRTMHGACPIIV 502

Query: 62  GNKW 65
           G+KW
Sbjct: 503 GSKW 506


>gi|292619367|ref|XP_001922562.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Danio rerio]
          Length = 541

 Score = 67.8 bits (164), Expect = 1e-09,   Method: Composition-based stats.
 Identities = 31/66 (46%), Positives = 40/66 (60%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+FP +   V P KG+AVFWYN   +   DY   H+ CPV
Sbjct: 454 GNRIATWLFY-MSDVAAGGATVFPEVGAAVKPLKGTAVFWYNLFPSGEGDYSTRHAACPV 512

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 513 LVGNKW 518



 Score = 65.1 bits (157), Expect = 9e-09,   Method: Composition-based stats.
 Identities = 29/64 (45%), Positives = 38/64 (59%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP +   V P KG+AVFW+N   +   DY   H+ CPV +
Sbjct: 455 NRIATWLFYMSDVAAGGATVFPEVGAAVKPLKGTAVFWYNLFPSGEGDYSTRHAACPVLV 514

Query: 62  GNKW 65
           GNKW
Sbjct: 515 GNKW 518


>gi|198449648|ref|XP_001357666.2| GA21989 [Drosophila pseudoobscura pseudoobscura]
 gi|198130700|gb|EAL26801.2| GA21989 [Drosophila pseudoobscura pseudoobscura]
          Length = 536

 Score = 67.8 bits (164), Expect = 1e-09,   Method: Composition-based stats.
 Identities = 29/55 (52%), Positives = 36/55 (65%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L+DV  GG T FP L L V PE+GS +FWYN H +   D+R  H+ CPV  G+KW
Sbjct: 449 LSDVVAGGGTAFPFLPLLVTPERGSLLFWYNLHPSGDQDFRTKHAACPVLQGSKW 503



 Score = 63.9 bits (154), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 29/64 (45%), Positives = 38/64 (59%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR   +   L+DV  GG T FP L L V PE+GS +FW+N H +   D+   H+ CPV  
Sbjct: 440 NRLATAIYYLSDVVAGGGTAFPFLPLLVTPERGSLLFWYNLHPSGDQDFRTKHAACPVLQ 499

Query: 62  GNKW 65
           G+KW
Sbjct: 500 GSKW 503


>gi|113682363|ref|NP_001038463.1| prolyl 4-hydroxylase, alpha polypeptide I a precursor [Danio rerio]
          Length = 522

 Score = 67.8 bits (164), Expect = 1e-09,   Method: Composition-based stats.
 Identities = 31/66 (46%), Positives = 40/66 (60%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+FP +   V P KG+AVFWYN   +   DY   H+ CPV
Sbjct: 435 GNRIATWLFY-MSDVAAGGATVFPEVGAAVKPLKGTAVFWYNLFPSGEGDYSTRHAACPV 493

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 494 LVGNKW 499



 Score = 65.1 bits (157), Expect = 9e-09,   Method: Composition-based stats.
 Identities = 29/64 (45%), Positives = 38/64 (59%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP +   V P KG+AVFW+N   +   DY   H+ CPV +
Sbjct: 436 NRIATWLFYMSDVAAGGATVFPEVGAAVKPLKGTAVFWYNLFPSGEGDYSTRHAACPVLV 495

Query: 62  GNKW 65
           GNKW
Sbjct: 496 GNKW 499


>gi|47550697|ref|NP_999856.1| prolyl 4-hydroxylase, alpha polypeptide I b precursor [Danio rerio]
 gi|28277826|gb|AAH45890.1| Procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I [Danio rerio]
          Length = 536

 Score = 67.8 bits (164), Expect = 1e-09,   Method: Composition-based stats.
 Identities = 30/66 (45%), Positives = 41/66 (62%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    L   ++DV  GGAT+F  +   V+P+KG+AVFWYN   +   DY   H+ CPV
Sbjct: 449 GNRIATWLFY-MSDVSAGGATVFTDVGAAVWPKKGTAVFWYNLFPSGEGDYSTRHAACPV 507

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 508 LVGNKW 513



 Score = 65.1 bits (157), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 28/64 (43%), Positives = 39/64 (60%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+F  +   V+P+KG+AVFW+N   +   DY   H+ CPV +
Sbjct: 450 NRIATWLFYMSDVSAGGATVFTDVGAAVWPKKGTAVFWYNLFPSGEGDYSTRHAACPVLV 509

Query: 62  GNKW 65
           GNKW
Sbjct: 510 GNKW 513


>gi|198449500|ref|XP_001357604.2| GA15939 [Drosophila pseudoobscura pseudoobscura]
 gi|198130634|gb|EAL26738.2| GA15939 [Drosophila pseudoobscura pseudoobscura]
          Length = 528

 Score = 67.8 bits (164), Expect = 1e-09,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 39/55 (70%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           LTDVE GGAT+FP++   VFP++GSA+ WYN   +   + +  H+ CPV +G+KW
Sbjct: 451 LTDVEQGGATVFPNIRKAVFPQRGSAIIWYNLKDDGDPNPQTLHAACPVLVGSKW 505



 Score = 64.7 bits (156), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 29/67 (43%), Positives = 44/67 (65%), Gaps = 2/67 (2%)

Query: 1   MNRAEMSSV--ELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCP 58
           MN   +++V   LTDVE GGAT+FP++   VFP++GSA+ W+N   +   +    H+ CP
Sbjct: 439 MNGDRIATVLFYLTDVEQGGATVFPNIRKAVFPQRGSAIIWYNLKDDGDPNPQTLHAACP 498

Query: 59  VALGNKW 65
           V +G+KW
Sbjct: 499 VLVGSKW 505


>gi|195110919|ref|XP_002000027.1| GI24860 [Drosophila mojavensis]
 gi|193916621|gb|EDW15488.1| GI24860 [Drosophila mojavensis]
          Length = 487

 Score = 67.8 bits (164), Expect = 1e-09,   Method: Composition-based stats.
 Identities = 31/67 (46%), Positives = 43/67 (64%), Gaps = 1/67 (1%)

Query: 61  LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
           LGN+   +L   ++DVE GGAT+F SL+  + P+KG+A FW N H +   D R  H+ CP
Sbjct: 394 LGNRIATVLFY-MSDVEQGGATVFTSLHAVLKPKKGTAAFWMNLHRSGEGDVRTRHAACP 452

Query: 121 VALGNKW 127
           V  G+KW
Sbjct: 453 VLTGSKW 459



 Score = 62.0 bits (149), Expect = 8e-08,   Method: Composition-based stats.
 Identities = 28/64 (43%), Positives = 38/64 (59%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DVE GGAT+F SL+  + P+KG+A FW N H +   D    H+ CPV  
Sbjct: 396 NRIATVLFYMSDVEQGGATVFTSLHAVLKPKKGTAAFWMNLHRSGEGDVRTRHAACPVLT 455

Query: 62  GNKW 65
           G+KW
Sbjct: 456 GSKW 459


>gi|198471971|ref|XP_002133305.1| GA28042 [Drosophila pseudoobscura pseudoobscura]
 gi|198139547|gb|EDY70707.1| GA28042 [Drosophila pseudoobscura pseudoobscura]
          Length = 203

 Score = 67.8 bits (164), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 30/55 (54%), Positives = 38/55 (69%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           +TDV LGGATIFP +NLT+ P+KGSA+FWYN H +        H+ CP   GN+W
Sbjct: 130 MTDVPLGGATIFPKINLTIQPKKGSALFWYNIHNDWEPHVLTRHAVCPTIEGNRW 184



 Score = 66.6 bits (161), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 29/62 (46%), Positives = 42/62 (67%)

Query: 4   AEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGN 63
           +++ +  +TDV LGGATIFP +NLT+ P+KGSA+FW+N H +        H+ CP   GN
Sbjct: 123 SDIKANPMTDVPLGGATIFPKINLTIQPKKGSALFWYNIHNDWEPHVLTRHAVCPTIEGN 182

Query: 64  KW 65
           +W
Sbjct: 183 RW 184


>gi|195145314|ref|XP_002013641.1| GL24244 [Drosophila persimilis]
 gi|194102584|gb|EDW24627.1| GL24244 [Drosophila persimilis]
          Length = 496

 Score = 67.8 bits (164), Expect = 1e-09,   Method: Composition-based stats.
 Identities = 28/55 (50%), Positives = 37/55 (67%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L+DV LGG T FP+L L + P +GSA+ W+NA+     DYR  H+ CPV LG +W
Sbjct: 422 LSDVRLGGFTSFPALGLGIRPNRGSALVWHNANNAGNCDYRALHAACPVLLGTRW 476



 Score = 66.6 bits (161), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 28/55 (50%), Positives = 36/55 (65%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L+DV LGG T FP+L L + P +GSA+ WHNA+     DY   H+ CPV LG +W
Sbjct: 422 LSDVRLGGFTSFPALGLGIRPNRGSALVWHNANNAGNCDYRALHAACPVLLGTRW 476


>gi|156352054|ref|XP_001622587.1| predicted protein [Nematostella vectensis]
 gi|156209158|gb|EDO30487.1| predicted protein [Nematostella vectensis]
          Length = 531

 Score = 67.8 bits (164), Expect = 1e-09,   Method: Composition-based stats.
 Identities = 30/69 (43%), Positives = 41/69 (59%), Gaps = 1/69 (1%)

Query: 59  VALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSG 118
           + +GN+   +L   ++DVE GGAT+F      V P KG A FWYN H N   D R  H+ 
Sbjct: 441 LGIGNRIATVLYY-MSDVEAGGATVFVDAEAIVKPSKGDAAFWYNLHKNGKGDERTRHAA 499

Query: 119 CPVALGNKW 127
           CP+ +G+KW
Sbjct: 500 CPIIVGSKW 508



 Score = 61.2 bits (147), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 27/64 (42%), Positives = 35/64 (54%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DVE GGAT+F      V P KG A FW+N H N   D    H+ CP+ +
Sbjct: 445 NRIATVLYYMSDVEAGGATVFVDAEAIVKPSKGDAAFWYNLHKNGKGDERTRHAACPIIV 504

Query: 62  GNKW 65
           G+KW
Sbjct: 505 GSKW 508


>gi|195341556|ref|XP_002037372.1| GM12148 [Drosophila sechellia]
 gi|194131488|gb|EDW53531.1| GM12148 [Drosophila sechellia]
          Length = 542

 Score = 67.8 bits (164), Expect = 2e-09,   Method: Composition-based stats.
 Identities = 31/57 (54%), Positives = 38/57 (66%), Gaps = 4/57 (7%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYN--AHANTLLDYRMYHSGCPVALGNKW 127
           L +V  GG T FP LNLTVFP+ GSA+FWYN     N  +D    H+GCPV +G+KW
Sbjct: 453 LNEVRQGGGTYFPRLNLTVFPQPGSALFWYNLDTKGNDHMD--SLHTGCPVIVGSKW 507



 Score = 65.5 bits (158), Expect = 6e-09,   Method: Composition-based stats.
 Identities = 30/57 (52%), Positives = 38/57 (66%), Gaps = 4/57 (7%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHN--AHANTLLDYWMYHSGCPVALGNKW 65
           L +V  GG T FP LNLTVFP+ GSA+FW+N     N  +D    H+GCPV +G+KW
Sbjct: 453 LNEVRQGGGTYFPRLNLTVFPQPGSALFWYNLDTKGNDHMD--SLHTGCPVIVGSKW 507


>gi|116008432|ref|NP_651804.2| CG15539, isoform A [Drosophila melanogaster]
 gi|66772391|gb|AAY55507.1| IP10910p [Drosophila melanogaster]
 gi|66772535|gb|AAY55579.1| IP10810p [Drosophila melanogaster]
 gi|113194858|gb|AAF57060.2| CG15539, isoform A [Drosophila melanogaster]
          Length = 386

 Score = 67.8 bits (164), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 30/55 (54%), Positives = 37/55 (67%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L+DV  GGATIFP L L+VFP+KGSA+ WYN       D R  HS CP  +G++W
Sbjct: 311 LSDVPQGGATIFPKLGLSVFPKKGSALLWYNLDHKGDGDNRTAHSACPTVVGSRW 365



 Score = 62.8 bits (151), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 29/64 (45%), Positives = 39/64 (60%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           +R   +   L+DV  GGATIFP L L+VFP+KGSA+ W+N       D    HS CP  +
Sbjct: 302 DRIATAVFYLSDVPQGGATIFPKLGLSVFPKKGSALLWYNLDHKGDGDNRTAHSACPTVV 361

Query: 62  GNKW 65
           G++W
Sbjct: 362 GSRW 365


>gi|241598357|ref|XP_002404731.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
 gi|215500462|gb|EEC09956.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
          Length = 218

 Score = 67.8 bits (164), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 34/83 (40%), Positives = 51/83 (61%), Gaps = 6/83 (7%)

Query: 45  NTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNA 104
           N+  DY++ H       G++   L++  +TDVE GGAT+FPSL + + P+KG A FW+N 
Sbjct: 113 NSSADYYLQH-----GRGDRIATLMIY-MTDVEAGGATVFPSLGIRLTPKKGDAAFWWNL 166

Query: 105 HANTLLDYRMYHSGCPVALGNKW 127
            A+   +    H+GCPV  G+KW
Sbjct: 167 KASGEGERLTMHAGCPVLYGSKW 189



 Score = 62.4 bits (150), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 28/57 (49%), Positives = 38/57 (66%)

Query: 9   VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           + +TDVE GGAT+FPSL + + P+KG A FW N  A+   +    H+GCPV  G+KW
Sbjct: 133 IYMTDVEAGGATVFPSLGIRLTPKKGDAAFWWNLKASGEGERLTMHAGCPVLYGSKW 189


>gi|195113237|ref|XP_002001174.1| GI10637 [Drosophila mojavensis]
 gi|193917768|gb|EDW16635.1| GI10637 [Drosophila mojavensis]
          Length = 529

 Score = 67.4 bits (163), Expect = 2e-09,   Method: Composition-based stats.
 Identities = 28/55 (50%), Positives = 38/55 (69%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           LTDVE GGAT+FP++   VFP+ G+AV WYN   +   D +  H+ CPV +G+KW
Sbjct: 451 LTDVEQGGATVFPNIEKAVFPKSGTAVVWYNLRHDGNGDPQTLHAACPVIVGSKW 505



 Score = 64.3 bits (155), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 37/55 (67%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           LTDVE GGAT+FP++   VFP+ G+AV W+N   +   D    H+ CPV +G+KW
Sbjct: 451 LTDVEQGGATVFPNIEKAVFPKSGTAVVWYNLRHDGNGDPQTLHAACPVIVGSKW 505


>gi|386368303|gb|AFJ06910.1| procollagen-proline dioxygenase [Mytilus galloprovincialis]
          Length = 535

 Score = 67.4 bits (163), Expect = 2e-09,   Method: Composition-based stats.
 Identities = 30/66 (45%), Positives = 42/66 (63%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    LL  +TDV+ GGAT+FP +++ V P+KG+AVFWYN   +        H+ CPV
Sbjct: 448 GNRIATFLLY-MTDVDAGGATVFPIIDVRVLPKKGTAVFWYNLRRSGKGIMETRHAACPV 506

Query: 122 ALGNKW 127
            +G KW
Sbjct: 507 LVGTKW 512



 Score = 62.8 bits (151), Expect = 5e-08,   Method: Composition-based stats.
 Identities = 27/64 (42%), Positives = 40/64 (62%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + +TDV+ GGAT+FP +++ V P+KG+AVFW+N   +        H+ CPV +
Sbjct: 449 NRIATFLLYMTDVDAGGATVFPIIDVRVLPKKGTAVFWYNLRRSGKGIMETRHAACPVLV 508

Query: 62  GNKW 65
           G KW
Sbjct: 509 GTKW 512


>gi|195452776|ref|XP_002073495.1| GK13117 [Drosophila willistoni]
 gi|194169580|gb|EDW84481.1| GK13117 [Drosophila willistoni]
          Length = 487

 Score = 67.4 bits (163), Expect = 2e-09,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 37/55 (67%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L DVE GGAT+FP +   VFP++GSA+ WYN   +   +    H+GCPV +G+KW
Sbjct: 409 LNDVEQGGATVFPEIKKAVFPKRGSAIMWYNLKDDGEGNRDTLHAGCPVIVGSKW 463



 Score = 64.3 bits (155), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 26/55 (47%), Positives = 37/55 (67%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GGAT+FP +   VFP++GSA+ W+N   +   +    H+GCPV +G+KW
Sbjct: 409 LNDVEQGGATVFPEIKKAVFPKRGSAIMWYNLKDDGEGNRDTLHAGCPVIVGSKW 463


>gi|196011900|ref|XP_002115813.1| hypothetical protein TRIADDRAFT_59899 [Trichoplax adhaerens]
 gi|190581589|gb|EDV21665.1| hypothetical protein TRIADDRAFT_59899 [Trichoplax adhaerens]
          Length = 581

 Score = 67.4 bits (163), Expect = 2e-09,   Method: Composition-based stats.
 Identities = 30/68 (44%), Positives = 43/68 (63%), Gaps = 1/68 (1%)

Query: 61  LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
           +GN+   +L+  L++VE GGAT+FP   + V P +GSA FWYN H N   +    H+ CP
Sbjct: 466 IGNRIATVLIY-LSNVEAGGATVFPKAGVRVEPRQGSAAFWYNMHRNGEGNKLSVHAACP 524

Query: 121 VALGNKWG 128
           V +G+KW 
Sbjct: 525 VLIGSKWA 532



 Score = 66.6 bits (161), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 33/91 (36%), Positives = 49/91 (53%), Gaps = 9/91 (9%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L++VE GGAT+FP   + V P +GSA FW+N H N   +    H+ CPV +
Sbjct: 468 NRIATVLIYLSNVEAGGATVFPKAGVRVEPRQGSAAFWYNMHRNGEGNKLSVHAACPVLI 527

Query: 62  GNKW---------GKLLLSGLTDVELGGATI 83
           G+KW         G+ L   LT++ +  A +
Sbjct: 528 GSKWAANLWFREVGQELRHTLTEINIASAIL 558


>gi|198449641|ref|XP_002136935.1| GA26860 [Drosophila pseudoobscura pseudoobscura]
 gi|198130697|gb|EDY67493.1| GA26860 [Drosophila pseudoobscura pseudoobscura]
          Length = 508

 Score = 67.0 bits (162), Expect = 2e-09,   Method: Composition-based stats.
 Identities = 31/75 (41%), Positives = 46/75 (61%), Gaps = 1/75 (1%)

Query: 53  YHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDY 112
           Y+ G    L ++   LL   L+DV LGGAT+FP L+L+VFP++G+ + WYN         
Sbjct: 415 YYEGSNPELPDRVATLLFY-LSDVPLGGATVFPLLDLSVFPKRGAVLMWYNLDHKGQGIE 473

Query: 113 RMYHSGCPVALGNKW 127
           +  HS CPV +G++W
Sbjct: 474 KTVHSACPVVVGSRW 488



 Score = 60.8 bits (146), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 25/55 (45%), Positives = 37/55 (67%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L+DV LGGAT+FP L+L+VFP++G+ + W+N            HS CPV +G++W
Sbjct: 434 LSDVPLGGATVFPLLDLSVFPKRGAVLMWYNLDHKGQGIEKTVHSACPVVVGSRW 488


>gi|211938649|gb|ACJ13221.1| FI08532p [Drosophila melanogaster]
          Length = 543

 Score = 67.0 bits (162), Expect = 2e-09,   Method: Composition-based stats.
 Identities = 29/55 (52%), Positives = 35/55 (63%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L +V  GG T FP LNLTVFP+ GSA+FWYN            H+GCPV +G+KW
Sbjct: 454 LNEVRQGGGTYFPRLNLTVFPQPGSALFWYNLDTKGNDHMGSLHTGCPVIVGSKW 508



 Score = 65.1 bits (157), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 28/55 (50%), Positives = 35/55 (63%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L +V  GG T FP LNLTVFP+ GSA+FW+N            H+GCPV +G+KW
Sbjct: 454 LNEVRQGGGTYFPRLNLTVFPQPGSALFWYNLDTKGNDHMGSLHTGCPVIVGSKW 508


>gi|24651430|ref|NP_733378.1| prolyl-4-hydroxylase-alpha NE2 [Drosophila melanogaster]
 gi|23172699|gb|AAF57061.2| prolyl-4-hydroxylase-alpha NE2 [Drosophila melanogaster]
          Length = 542

 Score = 67.0 bits (162), Expect = 2e-09,   Method: Composition-based stats.
 Identities = 29/55 (52%), Positives = 35/55 (63%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L +V  GG T FP LNLTVFP+ GSA+FWYN            H+GCPV +G+KW
Sbjct: 453 LNEVRQGGGTYFPRLNLTVFPQPGSALFWYNLDTKGNDHMGSLHTGCPVIVGSKW 507



 Score = 65.1 bits (157), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 28/55 (50%), Positives = 35/55 (63%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L +V  GG T FP LNLTVFP+ GSA+FW+N            H+GCPV +G+KW
Sbjct: 453 LNEVRQGGGTYFPRLNLTVFPQPGSALFWYNLDTKGNDHMGSLHTGCPVIVGSKW 507


>gi|20269814|gb|AAM18062.1|AF495540_1 prolyl 4-hydroxylase alpha-related protein PH4[alpha]NE2
           [Drosophila melanogaster]
 gi|19528175|gb|AAL90202.1| AT27756p [Drosophila melanogaster]
          Length = 542

 Score = 67.0 bits (162), Expect = 2e-09,   Method: Composition-based stats.
 Identities = 29/55 (52%), Positives = 35/55 (63%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L +V  GG T FP LNLTVFP+ GSA+FWYN            H+GCPV +G+KW
Sbjct: 453 LNEVRQGGGTYFPRLNLTVFPQPGSALFWYNLDTKGNDHMGSLHTGCPVIVGSKW 507



 Score = 65.1 bits (157), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 28/55 (50%), Positives = 35/55 (63%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L +V  GG T FP LNLTVFP+ GSA+FW+N            H+GCPV +G+KW
Sbjct: 453 LNEVRQGGGTYFPRLNLTVFPQPGSALFWYNLDTKGNDHMGSLHTGCPVIVGSKW 507


>gi|195159323|ref|XP_002020531.1| GL13463 [Drosophila persimilis]
 gi|194117300|gb|EDW39343.1| GL13463 [Drosophila persimilis]
          Length = 487

 Score = 67.0 bits (162), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 29/67 (43%), Positives = 42/67 (62%), Gaps = 1/67 (1%)

Query: 61  LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
           LGN+   +L   ++DVE GGAT+F +L   ++P++G+A FW N H +   D R  H+ CP
Sbjct: 394 LGNRIATVLFY-MSDVEQGGATVFTTLRTALWPKRGTAAFWMNLHRDGEGDKRTQHAACP 452

Query: 121 VALGNKW 127
           V  G KW
Sbjct: 453 VLTGTKW 459



 Score = 61.2 bits (147), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 26/64 (40%), Positives = 37/64 (57%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DVE GGAT+F +L   ++P++G+A FW N H +   D    H+ CPV  
Sbjct: 396 NRIATVLFYMSDVEQGGATVFTTLRTALWPKRGTAAFWMNLHRDGEGDKRTQHAACPVLT 455

Query: 62  GNKW 65
           G KW
Sbjct: 456 GTKW 459


>gi|125772807|ref|XP_001357662.1| GA15946 [Drosophila pseudoobscura pseudoobscura]
 gi|54637394|gb|EAL26796.1| GA15946 [Drosophila pseudoobscura pseudoobscura]
          Length = 549

 Score = 67.0 bits (162), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 29/67 (43%), Positives = 42/67 (62%), Gaps = 1/67 (1%)

Query: 61  LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
           LGN+   +L   ++DVE GGAT+F +L   ++P++G+A FW N H +   D R  H+ CP
Sbjct: 456 LGNRIATVLFY-MSDVEQGGATVFTTLRTALWPKRGTAAFWMNLHRDGEGDKRTQHAACP 514

Query: 121 VALGNKW 127
           V  G KW
Sbjct: 515 VLTGTKW 521



 Score = 61.2 bits (147), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 26/64 (40%), Positives = 37/64 (57%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DVE GGAT+F +L   ++P++G+A FW N H +   D    H+ CPV  
Sbjct: 458 NRIATVLFYMSDVEQGGATVFTTLRTALWPKRGTAAFWMNLHRDGEGDKRTQHAACPVLT 517

Query: 62  GNKW 65
           G KW
Sbjct: 518 GTKW 521


>gi|195505214|ref|XP_002099407.1| GE23379 [Drosophila yakuba]
 gi|194185508|gb|EDW99119.1| GE23379 [Drosophila yakuba]
          Length = 547

 Score = 66.6 bits (161), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 32/61 (52%), Positives = 39/61 (63%), Gaps = 12/61 (19%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNA------HANTLLDYRMYHSGCPVALGNK 126
           L +V  GG T FP LNLTVFP+ GSA+FWYN       H +TL      H+GCPV +G+K
Sbjct: 458 LNEVRQGGGTHFPRLNLTVFPQPGSALFWYNLDTRGNDHTSTL------HTGCPVIVGSK 511

Query: 127 W 127
           W
Sbjct: 512 W 512



 Score = 64.7 bits (156), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 31/61 (50%), Positives = 39/61 (63%), Gaps = 12/61 (19%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNA------HANTLLDYWMYHSGCPVALGNK 64
           L +V  GG T FP LNLTVFP+ GSA+FW+N       H +TL      H+GCPV +G+K
Sbjct: 458 LNEVRQGGGTHFPRLNLTVFPQPGSALFWYNLDTRGNDHTSTL------HTGCPVIVGSK 511

Query: 65  W 65
           W
Sbjct: 512 W 512


>gi|198477150|ref|XP_002136737.1| GA29215 [Drosophila pseudoobscura pseudoobscura]
 gi|198145042|gb|EDY71754.1| GA29215 [Drosophila pseudoobscura pseudoobscura]
          Length = 508

 Score = 66.6 bits (161), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 29/59 (49%), Positives = 40/59 (67%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           LL  L+DV LGGAT+FP L+L+VFP+KG+ + WYN         +  HS CPV +G++W
Sbjct: 430 LLFYLSDVPLGGATVFPLLDLSVFPKKGAVLMWYNLDHKGQGMEKTIHSACPVVVGSRW 488



 Score = 62.0 bits (149), Expect = 8e-08,   Method: Composition-based stats.
 Identities = 27/56 (48%), Positives = 41/56 (73%), Gaps = 2/56 (3%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNA-HANTLLDYWMYHSGCPVALGNKW 65
           L+DV LGGAT+FP L+L+VFP+KG+ + W+N  H    ++  + HS CPV +G++W
Sbjct: 434 LSDVPLGGATVFPLLDLSVFPKKGAVLMWYNLDHKGQGMEKTI-HSACPVVVGSRW 488


>gi|195055767|ref|XP_001994784.1| GH14132 [Drosophila grimshawi]
 gi|193892547|gb|EDV91413.1| GH14132 [Drosophila grimshawi]
          Length = 537

 Score = 66.6 bits (161), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 30/67 (44%), Positives = 42/67 (62%), Gaps = 1/67 (1%)

Query: 61  LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
           +GN+   +L   L+DV  GG T FP L   + P+K +A FWYN HA+   D+R  H GCP
Sbjct: 444 MGNRIATVLFY-LSDVAQGGGTAFPILKQLLKPKKYAAAFWYNLHASGKGDWRNLHGGCP 502

Query: 121 VALGNKW 127
           + +G+KW
Sbjct: 503 IIVGSKW 509



 Score = 60.1 bits (144), Expect = 3e-07,   Method: Composition-based stats.
 Identities = 27/64 (42%), Positives = 37/64 (57%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       L+DV  GG T FP L   + P+K +A FW+N HA+   D+   H GCP+ +
Sbjct: 446 NRIATVLFYLSDVAQGGGTAFPILKQLLKPKKYAAAFWYNLHASGKGDWRNLHGGCPIIV 505

Query: 62  GNKW 65
           G+KW
Sbjct: 506 GSKW 509


>gi|194905381|ref|XP_001981186.1| GG11928 [Drosophila erecta]
 gi|190655824|gb|EDV53056.1| GG11928 [Drosophila erecta]
          Length = 543

 Score = 66.6 bits (161), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 30/55 (54%), Positives = 35/55 (63%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L  V  GGAT FP LNLTVFP+ GSA+FWYN            H+GCPV +G+KW
Sbjct: 454 LNGVRQGGATHFPLLNLTVFPQPGSALFWYNLDTKGNDQRSTMHTGCPVIVGSKW 508



 Score = 64.3 bits (155), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 30/64 (46%), Positives = 38/64 (59%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           +R   +   L  V  GGAT FP LNLTVFP+ GSA+FW+N            H+GCPV +
Sbjct: 445 DRIATTVFYLNGVRQGGATHFPLLNLTVFPQPGSALFWYNLDTKGNDQRSTMHTGCPVIV 504

Query: 62  GNKW 65
           G+KW
Sbjct: 505 GSKW 508


>gi|67084101|gb|AAY66985.1| truncated prolyl 4-hydroxylase alpha subunit [Ixodes scapularis]
          Length = 452

 Score = 66.6 bits (161), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 34/87 (39%), Positives = 54/87 (62%), Gaps = 7/87 (8%)

Query: 41  NAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVF 100
           +AH N+  DY++ ++      G++   L++  +TDV+ GGAT+FP L + + P+KG A F
Sbjct: 344 HAH-NSTADYYLQNNE-----GDRLATLMIY-MTDVKEGGATVFPRLGVRLVPKKGDAAF 396

Query: 101 WYNAHANTLLDYRMYHSGCPVALGNKW 127
           W+N  A+   D    H+GCPV  G+KW
Sbjct: 397 WWNLKASGEGDTLTMHAGCPVLYGSKW 423



 Score = 62.8 bits (151), Expect = 4e-08,   Method: Composition-based stats.
 Identities = 27/57 (47%), Positives = 37/57 (64%)

Query: 9   VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           + +TDV+ GGAT+FP L + + P+KG A FW N  A+   D    H+GCPV  G+KW
Sbjct: 367 IYMTDVKEGGATVFPRLGVRLVPKKGDAAFWWNLKASGEGDTLTMHAGCPVLYGSKW 423


>gi|195452744|ref|XP_002073481.1| GK14140 [Drosophila willistoni]
 gi|194169566|gb|EDW84467.1| GK14140 [Drosophila willistoni]
          Length = 454

 Score = 66.6 bits (161), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 29/55 (52%), Positives = 38/55 (69%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L DV  GGAT+F +LNLTVFP+ G+A+FW+N       D    H+GCPV +G+KW
Sbjct: 380 LQDVPQGGATLFNNLNLTVFPKAGAALFWFNLDNAGDTDLFTVHTGCPVIVGSKW 434



 Score = 66.2 bits (160), Expect = 4e-09,   Method: Composition-based stats.
 Identities = 29/55 (52%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DV  GGAT+F +LNLTVFP+ G+A+FW N       D +  H+GCPV +G+KW
Sbjct: 380 LQDVPQGGATLFNNLNLTVFPKAGAALFWFNLDNAGDTDLFTVHTGCPVIVGSKW 434


>gi|195109817|ref|XP_001999478.1| GI23043 [Drosophila mojavensis]
 gi|193916072|gb|EDW14939.1| GI23043 [Drosophila mojavensis]
          Length = 491

 Score = 66.6 bits (161), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 30/65 (46%), Positives = 41/65 (63%), Gaps = 1/65 (1%)

Query: 63  NKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 122
           N+   +LL  L +V++GG T FP+L L V P KGSA+ W+N +     DYR  H+ CP  
Sbjct: 405 NRIATILLY-LNEVQMGGITSFPALGLGVQPIKGSALIWHNMNHEIECDYRTLHAACPTL 463

Query: 123 LGNKW 127
           LGN+W
Sbjct: 464 LGNRW 468



 Score = 65.9 bits (159), Expect = 5e-09,   Method: Composition-based stats.
 Identities = 29/64 (45%), Positives = 38/64 (59%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L +V++GG T FP+L L V P KGSA+ WHN +     DY   H+ CP  L
Sbjct: 405 NRIATILLYLNEVQMGGITSFPALGLGVQPIKGSALIWHNMNHEIECDYRTLHAACPTLL 464

Query: 62  GNKW 65
           GN+W
Sbjct: 465 GNRW 468


>gi|260825357|ref|XP_002607633.1| hypothetical protein BRAFLDRAFT_59428 [Branchiostoma floridae]
 gi|229292981|gb|EEN63643.1| hypothetical protein BRAFLDRAFT_59428 [Branchiostoma floridae]
          Length = 520

 Score = 66.2 bits (160), Expect = 4e-09,   Method: Composition-based stats.
 Identities = 28/59 (47%), Positives = 38/59 (64%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           +L  ++DVE GGAT+FP +   V PEKG+  FWYN   +   D    H+GCPV +G+KW
Sbjct: 439 MLFYMSDVEAGGATVFPQVGARVVPEKGAGAFWYNLLKSGEGDMLTEHAGCPVLVGSKW 497



 Score = 64.3 bits (155), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 26/55 (47%), Positives = 36/55 (65%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP +   V PEKG+  FW+N   +   D    H+GCPV +G+KW
Sbjct: 443 MSDVEAGGATVFPQVGARVVPEKGAGAFWYNLLKSGEGDMLTEHAGCPVLVGSKW 497


>gi|195379218|ref|XP_002048377.1| GJ13934 [Drosophila virilis]
 gi|194155535|gb|EDW70719.1| GJ13934 [Drosophila virilis]
          Length = 469

 Score = 66.2 bits (160), Expect = 4e-09,   Method: Composition-based stats.
 Identities = 30/67 (44%), Positives = 43/67 (64%), Gaps = 1/67 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+   ++    +DV+LGGATIFP L L V P++GSA+ WYN +A    D    H+ CPV
Sbjct: 386 GNRIATVIFYA-SDVQLGGATIFPRLQLVVKPKRGSALLWYNLNAAGKPDPLTRHAVCPV 444

Query: 122 ALGNKWG 128
            +G++W 
Sbjct: 445 VVGSRWA 451



 Score = 64.3 bits (155), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 38/55 (69%)

Query: 12  TDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWG 66
           +DV+LGGATIFP L L V P++GSA+ W+N +A    D    H+ CPV +G++W 
Sbjct: 397 SDVQLGGATIFPRLQLVVKPKRGSALLWYNLNAAGKPDPLTRHAVCPVVVGSRWA 451


>gi|195591300|ref|XP_002085380.1| GD14756 [Drosophila simulans]
 gi|194197389|gb|EDX10965.1| GD14756 [Drosophila simulans]
          Length = 477

 Score = 66.2 bits (160), Expect = 4e-09,   Method: Composition-based stats.
 Identities = 31/61 (50%), Positives = 39/61 (63%), Gaps = 5/61 (8%)

Query: 68  LLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           + +S L +VELGGAT+FP L ++VFP+KGS + WYN       D R     CPV  GNKW
Sbjct: 421 IYISLLNNVELGGATVFPKLKISVFPQKGSCLIWYNTP-----DPRSEPLECPVLQGNKW 475

Query: 128 G 128
           G
Sbjct: 476 G 476



 Score = 59.7 bits (143), Expect = 3e-07,   Method: Composition-based stats.
 Identities = 28/56 (50%), Positives = 35/56 (62%), Gaps = 5/56 (8%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWG 66
           L +VELGGAT+FP L ++VFP+KGS + W+N       D       CPV  GNKWG
Sbjct: 426 LNNVELGGATVFPKLKISVFPQKGSCLIWYNTP-----DPRSEPLECPVLQGNKWG 476


>gi|449668268|ref|XP_002154169.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Hydra
           magnipapillata]
          Length = 531

 Score = 66.2 bits (160), Expect = 4e-09,   Method: Composition-based stats.
 Identities = 30/67 (44%), Positives = 40/67 (59%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L+DVE GG T+F  L L++ P+ G A FWHN + N    Y   H+ CPV  
Sbjct: 456 NRIATLLIYLSDVERGGDTVFTRLGLSLKPKLGDAAFWHNLYKNGSGIYATEHASCPVVS 515

Query: 62  GNKWGKL 68
           G+KWGK+
Sbjct: 516 GSKWGKI 522



 Score = 65.5 bits (158), Expect = 7e-09,   Method: Composition-based stats.
 Identities = 31/69 (44%), Positives = 43/69 (62%), Gaps = 1/69 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+   LL+  L+DVE GG T+F  L L++ P+ G A FW+N + N    Y   H+ CPV
Sbjct: 455 GNRIATLLIY-LSDVERGGDTVFTRLGLSLKPKLGDAAFWHNLYKNGSGIYATEHASCPV 513

Query: 122 ALGNKWGKL 130
             G+KWGK+
Sbjct: 514 VSGSKWGKI 522


>gi|260825355|ref|XP_002607632.1| hypothetical protein BRAFLDRAFT_84679 [Branchiostoma floridae]
 gi|229292980|gb|EEN63642.1| hypothetical protein BRAFLDRAFT_84679 [Branchiostoma floridae]
          Length = 519

 Score = 66.2 bits (160), Expect = 4e-09,   Method: Composition-based stats.
 Identities = 28/59 (47%), Positives = 38/59 (64%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           +L  ++DVE GGAT+FP +   V PEKG+  FWYN   +   D    H+GCPV +G+KW
Sbjct: 438 MLFYMSDVEAGGATVFPQVGARVVPEKGAGAFWYNLLKSGEGDMLTEHAGCPVLVGSKW 496



 Score = 64.3 bits (155), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 26/55 (47%), Positives = 36/55 (65%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP +   V PEKG+  FW+N   +   D    H+GCPV +G+KW
Sbjct: 442 MSDVEAGGATVFPQVGARVVPEKGAGAFWYNLLKSGEGDMLTEHAGCPVLVGSKW 496


>gi|194764881|ref|XP_001964556.1| GF23245 [Drosophila ananassae]
 gi|190614828|gb|EDV30352.1| GF23245 [Drosophila ananassae]
          Length = 460

 Score = 66.2 bits (160), Expect = 4e-09,   Method: Composition-based stats.
 Identities = 31/79 (39%), Positives = 48/79 (60%), Gaps = 1/79 (1%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           D+++Y        G +   +L   L DVEL G+T+FP LN+++ P+KGSAV WYN H + 
Sbjct: 358 DFYVYSEPLRFLRGERIVTVLFY-LGDVELSGSTVFPFLNISITPKKGSAVMWYNLHNSG 416

Query: 109 LLDYRMYHSGCPVALGNKW 127
            +  +  H  CPV +G+K+
Sbjct: 417 DVHQKTQHCACPVVVGSKY 435



 Score = 62.0 bits (149), Expect = 8e-08,   Method: Composition-based stats.
 Identities = 26/55 (47%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVEL G+T+FP LN+++ P+KGSAV W+N H +  +     H  CPV +G+K+
Sbjct: 381 LGDVELSGSTVFPFLNISITPKKGSAVMWYNLHNSGDVHQKTQHCACPVVVGSKY 435


>gi|443730626|gb|ELU16050.1| hypothetical protein CAPTEDRAFT_114796, partial [Capitella teleta]
          Length = 150

 Score = 66.2 bits (160), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 30/55 (54%), Positives = 34/55 (61%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L DVE GG T+FP LNL + P K SAVFW+N   +   D R  H  CPV LG KW
Sbjct: 76  LNDVEAGGRTVFPRLNLVIEPIKNSAVFWHNLLDDGQQDDRTIHGACPVVLGRKW 130



 Score = 65.5 bits (158), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 30/57 (52%), Positives = 34/57 (59%)

Query: 9   VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           + L DVE GG T+FP LNL + P K SAVFWHN   +   D    H  CPV LG KW
Sbjct: 74  IYLNDVEAGGRTVFPRLNLVIEPIKNSAVFWHNLLDDGQQDDRTIHGACPVVLGRKW 130


>gi|195166679|ref|XP_002024162.1| GL22881 [Drosophila persimilis]
 gi|194107517|gb|EDW29560.1| GL22881 [Drosophila persimilis]
          Length = 288

 Score = 66.2 bits (160), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 30/60 (50%), Positives = 40/60 (66%)

Query: 75  DVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWGKLLLSG 134
           D++ GGATIFP +N+TV P KGSA+ W+N + +   D R  HS CPV  G++W K   SG
Sbjct: 218 DLQQGGATIFPLINITVSPRKGSALLWFNLYDDGEPDPRTLHSSCPVIEGDRWMKQNGSG 277



 Score = 62.0 bits (149), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 30/63 (47%), Positives = 39/63 (61%)

Query: 13  DVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWGKLLLSG 72
           D++ GGATIFP +N+TV P KGSA+ W N + +   D    HS CPV  G++W K   SG
Sbjct: 218 DLQQGGATIFPLINITVSPRKGSALLWFNLYDDGEPDPRTLHSSCPVIEGDRWMKQNGSG 277

Query: 73  LTD 75
             D
Sbjct: 278 SLD 280


>gi|194905294|ref|XP_001981167.1| GG11919 [Drosophila erecta]
 gi|190655805|gb|EDV53037.1| GG11919 [Drosophila erecta]
          Length = 533

 Score = 66.2 bits (160), Expect = 5e-09,   Method: Composition-based stats.
 Identities = 31/75 (41%), Positives = 48/75 (64%), Gaps = 4/75 (5%)

Query: 53  YHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDY 112
           +HS   + LG++   +L   LTDVE GGAT+F ++  +V+P+ G+A+FWYN   +   D 
Sbjct: 435 HHS---IVLGDRIATVLFY-LTDVEQGGATVFGNVGYSVYPQAGTAIFWYNLDTDGNGDP 490

Query: 113 RMYHSGCPVALGNKW 127
              H+ CPV +G+KW
Sbjct: 491 LTRHASCPVVVGSKW 505



 Score = 62.4 bits (150), Expect = 6e-08,   Method: Composition-based stats.
 Identities = 25/55 (45%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           LTDVE GGAT+F ++  +V+P+ G+A+FW+N   +   D    H+ CPV +G+KW
Sbjct: 451 LTDVEQGGATVFGNVGYSVYPQAGTAIFWYNLDTDGNGDPLTRHASCPVVVGSKW 505


>gi|268562483|ref|XP_002638619.1| Hypothetical protein CBG05671 [Caenorhabditis briggsae]
          Length = 520

 Score = 65.9 bits (159), Expect = 5e-09,   Method: Composition-based stats.
 Identities = 40/111 (36%), Positives = 56/111 (50%), Gaps = 6/111 (5%)

Query: 21  IFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHS-----GCPVALGNKWGKLLLSGLTD 75
           + P+LN+    +  +  +    H     D+  Y S     G     GN++G L+++  T 
Sbjct: 377 LIPTLNIESSEDIVALSYIRGGHYAAHHDFLEYPSEKEWDGWMKDYGNRFGTLIMAFET- 435

Query: 76  VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 126
            ELGGATIFPSLN  + P  G A FW+NA  NT  +    H GCP+  G K
Sbjct: 436 AELGGATIFPSLNAAIRPNTGDAFFWFNAMGNTKQEDLSDHGGCPIYEGKK 486



 Score = 57.8 bits (138), Expect = 1e-06,   Method: Composition-based stats.
 Identities = 27/50 (54%), Positives = 30/50 (60%)

Query: 15  ELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNK 64
           ELGGATIFPSLN  + P  G A FW NA  NT  +    H GCP+  G K
Sbjct: 437 ELGGATIFPSLNAAIRPNTGDAFFWFNAMGNTKQEDLSDHGGCPIYEGKK 486



 Score = 45.8 bits (107), Expect = 0.005,   Method: Composition-based stats.
 Identities = 23/70 (32%), Positives = 36/70 (51%), Gaps = 1/70 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+   +L+  L   + GG T+FP +N+ + P+ G  V W N   +   D +  H+ CP+
Sbjct: 170 GNRIATVLVI-LQIAKKGGTTVFPKININIRPKIGDVVVWLNTVPDGESDSQTLHAACPI 228

Query: 122 ALGNKWGKLL 131
             G K G  L
Sbjct: 229 KEGTKIGATL 238



 Score = 45.1 bits (105), Expect = 0.009,   Method: Composition-based stats.
 Identities = 23/68 (33%), Positives = 32/68 (47%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     V L   + GG T+FP +N+ + P+ G  V W N   +   D    H+ CP+  
Sbjct: 171 NRIATVLVILQIAKKGGTTVFPKININIRPKIGDVVVWLNTVPDGESDSQTLHAACPIKE 230

Query: 62  GNKWGKLL 69
           G K G  L
Sbjct: 231 GTKIGATL 238


>gi|195440206|ref|XP_002067933.1| GK11220 [Drosophila willistoni]
 gi|194164018|gb|EDW78919.1| GK11220 [Drosophila willistoni]
          Length = 459

 Score = 65.9 bits (159), Expect = 5e-09,   Method: Composition-based stats.
 Identities = 25/71 (35%), Positives = 46/71 (64%), Gaps = 1/71 (1%)

Query: 59  VALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSG 118
           + LG++   ++    ++V LGGAT+FP L++ + P+KG+ + WYN + ++  D R  H+ 
Sbjct: 371 LTLGDRLASVIFYA-SEVHLGGATVFPRLDVAITPKKGAGLVWYNTYDDSTHDQRSQHAV 429

Query: 119 CPVALGNKWGK 129
           CP  +G++W K
Sbjct: 430 CPTLMGSRWSK 440



 Score = 59.7 bits (143), Expect = 4e-07,   Method: Composition-based stats.
 Identities = 21/56 (37%), Positives = 38/56 (67%)

Query: 12  TDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWGK 67
           ++V LGGAT+FP L++ + P+KG+ + W+N + ++  D    H+ CP  +G++W K
Sbjct: 385 SEVHLGGATVFPRLDVAITPKKGAGLVWYNTYDDSTHDQRSQHAVCPTLMGSRWSK 440


>gi|196011902|ref|XP_002115814.1| hypothetical protein TRIADDRAFT_30039 [Trichoplax adhaerens]
 gi|190581590|gb|EDV21666.1| hypothetical protein TRIADDRAFT_30039 [Trichoplax adhaerens]
          Length = 534

 Score = 65.9 bits (159), Expect = 5e-09,   Method: Composition-based stats.
 Identities = 30/66 (45%), Positives = 42/66 (63%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+   +L+  L+ VELGGAT+F S  + + P +GSA FWYN H N   +    H+ CPV
Sbjct: 447 GNRIATMLIY-LSSVELGGATVFSSAGVRIEPRQGSAAFWYNLHRNGNGNNLTRHAACPV 505

Query: 122 ALGNKW 127
            +G+KW
Sbjct: 506 LIGSKW 511



 Score = 63.9 bits (154), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 28/64 (43%), Positives = 39/64 (60%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L+ VELGGAT+F S  + + P +GSA FW+N H N   +    H+ CPV +
Sbjct: 448 NRIATMLIYLSSVELGGATVFSSAGVRIEPRQGSAAFWYNLHRNGNGNNLTRHAACPVLI 507

Query: 62  GNKW 65
           G+KW
Sbjct: 508 GSKW 511


>gi|195390831|ref|XP_002054071.1| GJ22995 [Drosophila virilis]
 gi|194152157|gb|EDW67591.1| GJ22995 [Drosophila virilis]
          Length = 485

 Score = 65.9 bits (159), Expect = 5e-09,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 34/55 (61%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L+DVE GG T+FP +     PE+G A+ WYN H N   D R  H  CPV +G+KW
Sbjct: 408 LSDVEQGGDTVFPRIEQAFKPERGKALLWYNLHRNGTGDKRTEHGACPVLVGSKW 462



 Score = 63.2 bits (152), Expect = 3e-08,   Method: Composition-based stats.
 Identities = 26/59 (44%), Positives = 36/59 (61%)

Query: 7   SSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++ EL+DVE GG T+FP +     PE+G A+ W+N H N   D    H  CPV +G+KW
Sbjct: 404 NTAELSDVEQGGDTVFPRIEQAFKPERGKALLWYNLHRNGTGDKRTEHGACPVLVGSKW 462


>gi|194751827|ref|XP_001958225.1| GF23630 [Drosophila ananassae]
 gi|190625507|gb|EDV41031.1| GF23630 [Drosophila ananassae]
          Length = 431

 Score = 65.9 bits (159), Expect = 5e-09,   Method: Composition-based stats.
 Identities = 33/69 (47%), Positives = 42/69 (60%), Gaps = 1/69 (1%)

Query: 61  LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
           +G   G +L     DV+ GGAT+FP L ++VFP+KGS + WYN   +  LD R  HS CP
Sbjct: 360 IGGPVGAILFFISDDVQ-GGATVFPKLKVSVFPKKGSCLVWYNIKDDGRLDPRTTHSICP 418

Query: 121 VALGNKWGK 129
           V  GN  GK
Sbjct: 419 VLEGNSLGK 427



 Score = 58.9 bits (141), Expect = 6e-07,   Method: Composition-based stats.
 Identities = 27/57 (47%), Positives = 36/57 (63%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWGK 67
           ++D   GGAT+FP L ++VFP+KGS + W+N   +  LD    HS CPV  GN  GK
Sbjct: 371 ISDDVQGGATVFPKLKVSVFPKKGSCLVWYNIKDDGRLDPRTTHSICPVLEGNSLGK 427


>gi|195352180|ref|XP_002042592.1| GM14979 [Drosophila sechellia]
 gi|194124476|gb|EDW46519.1| GM14979 [Drosophila sechellia]
          Length = 461

 Score = 65.9 bits (159), Expect = 5e-09,   Method: Composition-based stats.
 Identities = 32/60 (53%), Positives = 38/60 (63%), Gaps = 5/60 (8%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 128
           LL  L +VELGGAT+FP L ++VFP+KGS + WYN       D R     CPV  GNKWG
Sbjct: 406 LLFFLNNVELGGATVFPKLKISVFPQKGSCLIWYNTP-----DPRSDPLECPVLQGNKWG 460



 Score = 59.7 bits (143), Expect = 4e-07,   Method: Composition-based stats.
 Identities = 28/59 (47%), Positives = 37/59 (62%), Gaps = 11/59 (18%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNA---HANTLLDYWMYHSGCPVALGNKWG 66
           L +VELGGAT+FP L ++VFP+KGS + W+N     ++ L         CPV  GNKWG
Sbjct: 410 LNNVELGGATVFPKLKISVFPQKGSCLIWYNTPDPRSDPL--------ECPVLQGNKWG 460


>gi|442762205|gb|JAA73261.1| Putative prolyl 4-hydroxylase alpha subunit, partial [Ixodes
           ricinus]
          Length = 482

 Score = 65.9 bits (159), Expect = 6e-09,   Method: Composition-based stats.
 Identities = 29/67 (43%), Positives = 44/67 (65%), Gaps = 1/67 (1%)

Query: 61  LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
           +G++   L++  ++DVE GGAT+FP L + + P+KG A FW+N  AN   +    H+GCP
Sbjct: 388 VGDRVATLMIY-MSDVEEGGATVFPYLGVRLTPQKGDAAFWWNLKANGEGEVLTTHAGCP 446

Query: 121 VALGNKW 127
           V  G+KW
Sbjct: 447 VLYGSKW 453



 Score = 64.3 bits (155), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 28/64 (43%), Positives = 39/64 (60%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           +R     + ++DVE GGAT+FP L + + P+KG A FW N  AN   +    H+GCPV  
Sbjct: 390 DRVATLMIYMSDVEEGGATVFPYLGVRLTPQKGDAAFWWNLKANGEGEVLTTHAGCPVLY 449

Query: 62  GNKW 65
           G+KW
Sbjct: 450 GSKW 453


>gi|198418585|ref|XP_002122034.1| PREDICTED: similar to Prolyl 4-hydroxylase subunit alpha-1 (4-PH
           alpha-1)
           (Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-1) [Ciona intestinalis]
          Length = 525

 Score = 65.9 bits (159), Expect = 6e-09,   Method: Composition-based stats.
 Identities = 32/68 (47%), Positives = 41/68 (60%), Gaps = 1/68 (1%)

Query: 61  LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
           +GN+   +LL  L DV+ GG T F    +   P KGSAVFWYN + + L D R  H+ CP
Sbjct: 432 VGNRIATMLLY-LNDVKEGGRTAFIEPKIVAKPIKGSAVFWYNLYPSGLGDPRTRHASCP 490

Query: 121 VALGNKWG 128
           V +GNKW 
Sbjct: 491 VVIGNKWA 498



 Score = 58.2 bits (139), Expect = 1e-06,   Method: Composition-based stats.
 Identities = 28/65 (43%), Positives = 36/65 (55%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L DV+ GG T F    +   P KGSAVFW+N + + L D    H+ CPV +
Sbjct: 434 NRIATMLLYLNDVKEGGRTAFIEPKIVAKPIKGSAVFWYNLYPSGLGDPRTRHASCPVVI 493

Query: 62  GNKWG 66
           GNKW 
Sbjct: 494 GNKWA 498


>gi|195425415|ref|XP_002061004.1| GK10713 [Drosophila willistoni]
 gi|194157089|gb|EDW71990.1| GK10713 [Drosophila willistoni]
          Length = 502

 Score = 65.9 bits (159), Expect = 6e-09,   Method: Composition-based stats.
 Identities = 28/56 (50%), Positives = 38/56 (67%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 128
           +TDV+ GGAT+FP+L +T FP+KGSA+ + N   N   D    H+GCPV  G+KW 
Sbjct: 408 MTDVQQGGATVFPALRITNFPKKGSALIFRNLDNNISPDPSTLHAGCPVLFGSKWA 463



 Score = 65.5 bits (158), Expect = 6e-09,   Method: Composition-based stats.
 Identities = 28/58 (48%), Positives = 39/58 (67%)

Query: 9   VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWG 66
           + +TDV+ GGAT+FP+L +T FP+KGSA+ + N   N   D    H+GCPV  G+KW 
Sbjct: 406 IYMTDVQQGGATVFPALRITNFPKKGSALIFRNLDNNISPDPSTLHAGCPVLFGSKWA 463


>gi|195390805|ref|XP_002054058.1| GJ23004 [Drosophila virilis]
 gi|194152144|gb|EDW67578.1| GJ23004 [Drosophila virilis]
          Length = 446

 Score = 65.5 bits (158), Expect = 6e-09,   Method: Composition-based stats.
 Identities = 30/55 (54%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L+DVELGGAT FP LNLT+ PEKG+AV WHN   +        H+ CPV +G+K+
Sbjct: 368 LSDVELGGATTFPLLNLTISPEKGTAVLWHNLKDSGTPHPKTVHAACPVIVGSKY 422



 Score = 65.5 bits (158), Expect = 8e-09,   Method: Composition-based stats.
 Identities = 29/55 (52%), Positives = 39/55 (70%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L+DVELGGAT FP LNLT+ PEKG+AV W+N   +     +  H+ CPV +G+K+
Sbjct: 368 LSDVELGGATTFPLLNLTISPEKGTAVLWHNLKDSGTPHPKTVHAACPVIVGSKY 422


>gi|386712780|ref|YP_006179102.1| prolyl 4-hydroxylase alpha subunit [Halobacillus halophilus DSM
           2266]
 gi|384072335|emb|CCG43825.1| prolyl 4-hydroxylase alpha subunit [Halobacillus halophilus DSM
           2266]
          Length = 211

 Score = 65.5 bits (158), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 45/142 (31%), Positives = 68/142 (47%), Gaps = 14/142 (9%)

Query: 1   MNRAEM-SSVELTDVELGGATIFPSLNLTVFPEKGSA----VFWHNAHANTLLDY----- 50
           MNR+++ S  E++D+    +T  P  +LT   EK  A    V   +     +L+Y     
Sbjct: 59  MNRSKIGSQHEVSDIRTSSSTFLPEDDLTNRIEKRVAQIMNVPVEHGEGLHILNYKQGQE 118

Query: 51  ----WMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHA 106
               + Y      A  N     L+  L DVE GG T FP +NL++ P KG AV++   ++
Sbjct: 119 YKAHYDYFRSKAKAANNPRISTLVLYLNDVEEGGETYFPHMNLSISPHKGMAVYFEYFYS 178

Query: 107 NTLLDYRMYHSGCPVALGNKWG 128
           + L++ R  H G PV  G KW 
Sbjct: 179 DPLINERTLHGGSPVTSGEKWA 200


>gi|321474953|gb|EFX85917.1| hypothetical protein DAPPUDRAFT_309108 [Daphnia pulex]
          Length = 549

 Score = 65.5 bits (158), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 28/55 (50%), Positives = 39/55 (70%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           ++DVE GGAT+FP+LN+ ++P+KGSA FWYN   N   +    H+ CPV  G+KW
Sbjct: 470 MSDVEAGGATVFPALNVALWPQKGSAAFWYNLFPNGEGNELTRHAACPVLTGSKW 524



 Score = 63.9 bits (154), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 27/55 (49%), Positives = 39/55 (70%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DVE GGAT+FP+LN+ ++P+KGSA FW+N   N   +    H+ CPV  G+KW
Sbjct: 470 MSDVEAGGATVFPALNVALWPQKGSAAFWYNLFPNGEGNELTRHAACPVLTGSKW 524


>gi|443721482|gb|ELU10773.1| hypothetical protein CAPTEDRAFT_174752 [Capitella teleta]
          Length = 525

 Score = 65.5 bits (158), Expect = 8e-09,   Method: Composition-based stats.
 Identities = 34/77 (44%), Positives = 41/77 (53%), Gaps = 4/77 (5%)

Query: 54  HSGCPV---ALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLL 110
           HS  P    A GN+    +   L DV  GGAT+FP L + + P K  A FWYN   N  +
Sbjct: 428 HSSLPEHVRASGNRLATFMFY-LNDVHAGGATVFPKLKVGIPPTKNGAAFWYNIGLNGDV 486

Query: 111 DYRMYHSGCPVALGNKW 127
           D    H+GCPV LG KW
Sbjct: 487 DPLTEHAGCPVLLGQKW 503



 Score = 62.4 bits (150), Expect = 6e-08,   Method: Composition-based stats.
 Identities = 29/64 (45%), Positives = 35/64 (54%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       L DV  GGAT+FP L + + P K  A FW+N   N  +D    H+GCPV L
Sbjct: 440 NRLATFMFYLNDVHAGGATVFPKLKVGIPPTKNGAAFWYNIGLNGDVDPLTEHAGCPVLL 499

Query: 62  GNKW 65
           G KW
Sbjct: 500 GQKW 503


>gi|326435474|gb|EGD81044.1| hypothetical protein PTSG_10986 [Salpingoeca sp. ATCC 50818]
          Length = 264

 Score = 65.5 bits (158), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 33/57 (57%), Positives = 39/57 (68%), Gaps = 1/57 (1%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTL-LDYRMYHSGCPVALGNKWG 128
           L DVE GG T FP+LNLTV P KG+AV WY+A+ NT  +D R  H   PVA G K+G
Sbjct: 188 LNDVEEGGGTRFPNLNLTVQPAKGNAVLWYSAYPNTTRMDSRTDHEAMPVAKGMKYG 244



 Score = 60.1 bits (144), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 31/57 (54%), Positives = 38/57 (66%), Gaps = 1/57 (1%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTL-LDYWMYHSGCPVALGNKWG 66
           L DVE GG T FP+LNLTV P KG+AV W++A+ NT  +D    H   PVA G K+G
Sbjct: 188 LNDVEEGGGTRFPNLNLTVQPAKGNAVLWYSAYPNTTRMDSRTDHEAMPVAKGMKYG 244


>gi|313229039|emb|CBY18191.1| unnamed protein product [Oikopleura dioica]
          Length = 522

 Score = 65.5 bits (158), Expect = 8e-09,   Method: Composition-based stats.
 Identities = 32/58 (55%), Positives = 35/58 (60%)

Query: 70  LSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           LS LT VE GG T+F  L L +   KGSAVFWYN   N   D R  H+ CPV  GNKW
Sbjct: 437 LSYLTTVEQGGGTVFTELGLHIRSIKGSAVFWYNLLPNGSGDERTRHAACPVLRGNKW 494



 Score = 57.8 bits (138), Expect = 1e-06,   Method: Composition-based stats.
 Identities = 28/55 (50%), Positives = 32/55 (58%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           LT VE GG T+F  L L +   KGSAVFW+N   N   D    H+ CPV  GNKW
Sbjct: 440 LTTVEQGGGTVFTELGLHIRSIKGSAVFWYNLLPNGSGDERTRHAACPVLRGNKW 494


>gi|321461762|gb|EFX72791.1| hypothetical protein DAPPUDRAFT_308081 [Daphnia pulex]
          Length = 561

 Score = 65.5 bits (158), Expect = 8e-09,   Method: Composition-based stats.
 Identities = 28/59 (47%), Positives = 37/59 (62%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L+  L+DVE GGAT+FP   +T +P KGSA FW+N + +   D    H  CPV  G+KW
Sbjct: 473 LMVYLSDVEAGGATVFPRAGVTCWPRKGSAAFWWNLYKSGEPDLTTRHGACPVLHGSKW 531



 Score = 63.2 bits (152), Expect = 3e-08,   Method: Composition-based stats.
 Identities = 28/57 (49%), Positives = 35/57 (61%)

Query: 9   VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           V L+DVE GGAT+FP   +T +P KGSA FW N + +   D    H  CPV  G+KW
Sbjct: 475 VYLSDVEAGGATVFPRAGVTCWPRKGSAAFWWNLYKSGEPDLTTRHGACPVLHGSKW 531


>gi|85857698|gb|ABC86384.1| IP10964p [Drosophila melanogaster]
          Length = 534

 Score = 65.1 bits (157), Expect = 8e-09,   Method: Composition-based stats.
 Identities = 30/69 (43%), Positives = 43/69 (62%), Gaps = 1/69 (1%)

Query: 59  VALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSG 118
           + LG++   +L   LTDVE GGAT+F  +   V P+ G+A+FWYN   +   D R  H+ 
Sbjct: 439 IDLGDRIATVLFY-LTDVEQGGATVFGDVGYYVSPQAGTAIFWYNLDTDGNGDPRTRHAA 497

Query: 119 CPVALGNKW 127
           CPV +G+KW
Sbjct: 498 CPVIVGSKW 506



 Score = 58.5 bits (140), Expect = 8e-07,   Method: Composition-based stats.
 Identities = 25/55 (45%), Positives = 35/55 (63%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           LTDVE GGAT+F  +   V P+ G+A+FW+N   +   D    H+ CPV +G+KW
Sbjct: 452 LTDVEQGGATVFGDVGYYVSPQAGTAIFWYNLDTDGNGDPRTRHAACPVIVGSKW 506


>gi|198417608|ref|XP_002125299.1| PREDICTED: similar to prolyl-4-hydroxylase-alpha EFB CG31022-PA
           [Ciona intestinalis]
          Length = 471

 Score = 65.1 bits (157), Expect = 8e-09,   Method: Composition-based stats.
 Identities = 28/56 (50%), Positives = 37/56 (66%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 128
           L++V+ GG+T F   N+   P KGSAVFWYN + +  LD R  H+ CPV +GNKW 
Sbjct: 389 LSEVQKGGSTAFFYPNIVAEPIKGSAVFWYNLYPSGALDKRTLHAACPVLIGNKWA 444



 Score = 61.2 bits (147), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 28/64 (43%), Positives = 39/64 (60%)

Query: 3   RAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALG 62
           R   + V L++V+ GG+T F   N+   P KGSAVFW+N + +  LD    H+ CPV +G
Sbjct: 381 RIATALVYLSEVQKGGSTAFFYPNIVAEPIKGSAVFWYNLYPSGALDKRTLHAACPVLIG 440

Query: 63  NKWG 66
           NKW 
Sbjct: 441 NKWA 444


>gi|195577074|ref|XP_002078398.1| GD23422 [Drosophila simulans]
 gi|194190407|gb|EDX03983.1| GD23422 [Drosophila simulans]
          Length = 513

 Score = 65.1 bits (157), Expect = 8e-09,   Method: Composition-based stats.
 Identities = 26/55 (47%), Positives = 37/55 (67%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           +TD+ +GGATIFP   L + P+KGSA+FWYN H N   +    H+ CP  +G++W
Sbjct: 440 MTDIPVGGATIFPGAQLAIQPKKGSALFWYNLHNNGDPNPLTRHAVCPTIVGSRW 494



 Score = 63.5 bits (153), Expect = 3e-08,   Method: Composition-based stats.
 Identities = 25/55 (45%), Positives = 37/55 (67%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           +TD+ +GGATIFP   L + P+KGSA+FW+N H N   +    H+ CP  +G++W
Sbjct: 440 MTDIPVGGATIFPGAQLAIQPKKGSALFWYNLHNNGDPNPLTRHAVCPTIVGSRW 494


>gi|195505251|ref|XP_002099423.1| GE23370 [Drosophila yakuba]
 gi|194185524|gb|EDW99135.1| GE23370 [Drosophila yakuba]
          Length = 534

 Score = 65.1 bits (157), Expect = 8e-09,   Method: Composition-based stats.
 Identities = 40/126 (31%), Positives = 64/126 (50%), Gaps = 18/126 (14%)

Query: 9   VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWGKL 68
           V++T  +L  +  F  +N  +    G   F H       +DY+ Y S       ++  K+
Sbjct: 392 VDMTGFDLADSEDFQVINYGI----GGHYFLH-------MDYFDYASSNYTGPRSRQSKV 440

Query: 69  L-------LSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           L       L  L+DVE GGAT+F ++  +V+P+ G+A+FWYN   +   D    H+ CPV
Sbjct: 441 LGDRIATVLFYLSDVEQGGATVFGNVGYSVYPQAGTAIFWYNLDTDGNGDPLTRHASCPV 500

Query: 122 ALGNKW 127
            +G+KW
Sbjct: 501 IVGSKW 506


>gi|221460681|ref|NP_733394.3| CG31013 [Drosophila melanogaster]
 gi|220903261|gb|AAF57073.4| CG31013 [Drosophila melanogaster]
          Length = 534

 Score = 65.1 bits (157), Expect = 9e-09,   Method: Composition-based stats.
 Identities = 30/69 (43%), Positives = 43/69 (62%), Gaps = 1/69 (1%)

Query: 59  VALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSG 118
           + LG++   +L   LTDVE GGAT+F  +   V P+ G+A+FWYN   +   D R  H+ 
Sbjct: 439 IDLGDRIATVLFY-LTDVEQGGATVFGDVGYYVSPQAGTAIFWYNLDTDGNGDPRTRHAA 497

Query: 119 CPVALGNKW 127
           CPV +G+KW
Sbjct: 498 CPVIVGSKW 506



 Score = 58.5 bits (140), Expect = 9e-07,   Method: Composition-based stats.
 Identities = 25/55 (45%), Positives = 35/55 (63%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           LTDVE GGAT+F  +   V P+ G+A+FW+N   +   D    H+ CPV +G+KW
Sbjct: 452 LTDVEQGGATVFGDVGYYVSPQAGTAIFWYNLDTDGNGDPRTRHAACPVIVGSKW 506


>gi|321474952|gb|EFX85916.1| hypothetical protein DAPPUDRAFT_45616 [Daphnia pulex]
          Length = 537

 Score = 65.1 bits (157), Expect = 9e-09,   Method: Composition-based stats.
 Identities = 30/66 (45%), Positives = 39/66 (59%), Gaps = 3/66 (4%)

Query: 65  WGKLLLSGL---TDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           WG  + + L   +DVE GGAT+FP     V+P KGSA FWYN + N   +    H+ CPV
Sbjct: 447 WGNRIATWLFYMSDVEAGGATVFPPTGAAVWPRKGSAAFWYNLYPNGKGNELTRHAACPV 506

Query: 122 ALGNKW 127
             G+KW
Sbjct: 507 LSGSKW 512



 Score = 62.8 bits (151), Expect = 5e-08,   Method: Composition-based stats.
 Identities = 28/64 (43%), Positives = 37/64 (57%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DVE GGAT+FP     V+P KGSA FW+N + N   +    H+ CPV  
Sbjct: 449 NRIATWLFYMSDVEAGGATVFPPTGAAVWPRKGSAAFWYNLYPNGKGNELTRHAACPVLS 508

Query: 62  GNKW 65
           G+KW
Sbjct: 509 GSKW 512


>gi|195452778|ref|XP_002073496.1| GK13116 [Drosophila willistoni]
 gi|194169581|gb|EDW84482.1| GK13116 [Drosophila willistoni]
          Length = 521

 Score = 65.1 bits (157), Expect = 9e-09,   Method: Composition-based stats.
 Identities = 26/55 (47%), Positives = 36/55 (65%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L DVE GGAT+FP +   VFP++GSA+ WYN   +   +    H+ CPV +G+KW
Sbjct: 446 LNDVEQGGATVFPEIKKAVFPKRGSAIMWYNLKDDGEGNRDTLHAACPVIVGSKW 500



 Score = 62.0 bits (149), Expect = 8e-08,   Method: Composition-based stats.
 Identities = 25/55 (45%), Positives = 36/55 (65%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GGAT+FP +   VFP++GSA+ W+N   +   +    H+ CPV +G+KW
Sbjct: 446 LNDVEQGGATVFPEIKKAVFPKRGSAIMWYNLKDDGEGNRDTLHAACPVIVGSKW 500


>gi|156370129|ref|XP_001628324.1| predicted protein [Nematostella vectensis]
 gi|156215298|gb|EDO36261.1| predicted protein [Nematostella vectensis]
          Length = 541

 Score = 65.1 bits (157), Expect = 9e-09,   Method: Composition-based stats.
 Identities = 28/59 (47%), Positives = 37/59 (62%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
            LS L+DVE GG T+F  +  TV+P+KG A FWYN   +   D    H+ CPV +G+KW
Sbjct: 449 FLSYLSDVEAGGGTVFTRVGATVWPQKGDAAFWYNLKRSGDGDSSTRHAACPVLVGSKW 507



 Score = 60.1 bits (144), Expect = 3e-07,   Method: Composition-based stats.
 Identities = 25/55 (45%), Positives = 35/55 (63%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L+DVE GG T+F  +  TV+P+KG A FW+N   +   D    H+ CPV +G+KW
Sbjct: 453 LSDVEAGGGTVFTRVGATVWPQKGDAAFWYNLKRSGDGDSSTRHAACPVLVGSKW 507


>gi|195471732|ref|XP_002088156.1| GE14021 [Drosophila yakuba]
 gi|194174257|gb|EDW87868.1| GE14021 [Drosophila yakuba]
          Length = 265

 Score = 65.1 bits (157), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 31/78 (39%), Positives = 46/78 (58%)

Query: 50  YWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTL 109
           Y+  HS       +    LL + ++DV +GGATIFP+  L + P+KGSA+FWYN H N  
Sbjct: 169 YFQEHSDYVDIKLHPASSLLPTSISDVPVGGATIFPAAKLAIQPKKGSALFWYNLHNNGD 228

Query: 110 LDYRMYHSGCPVALGNKW 127
            +    H+ CP  +G++W
Sbjct: 229 PNPLTRHAVCPTIVGSRW 246



 Score = 60.8 bits (146), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 25/55 (45%), Positives = 38/55 (69%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DV +GGATIFP+  L + P+KGSA+FW+N H N   +    H+ CP  +G++W
Sbjct: 192 ISDVPVGGATIFPAAKLAIQPKKGSALFWYNLHNNGDPNPLTRHAVCPTIVGSRW 246


>gi|194765140|ref|XP_001964685.1| GF23318 [Drosophila ananassae]
 gi|190614957|gb|EDV30481.1| GF23318 [Drosophila ananassae]
          Length = 412

 Score = 65.1 bits (157), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 28/59 (47%), Positives = 37/59 (62%), Gaps = 4/59 (6%)

Query: 73  LTDVELGGATIFPSLN----LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L+DVE GGAT+FP ++     TV+P  G+A  WYN H + L D    H  CPV +G+KW
Sbjct: 327 LSDVEQGGATVFPGISADSAYTVYPRAGTAAMWYNLHTDGLGDPTTLHVACPVIVGSKW 385



 Score = 64.7 bits (156), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 28/66 (42%), Positives = 41/66 (62%), Gaps = 4/66 (6%)

Query: 4   AEMSSVELTDVELGGATIFPSLN----LTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPV 59
           A+  + +L+DVE GGAT+FP ++     TV+P  G+A  W+N H + L D    H  CPV
Sbjct: 320 ADAENFQLSDVEQGGATVFPGISADSAYTVYPRAGTAAMWYNLHTDGLGDPTTLHVACPV 379

Query: 60  ALGNKW 65
            +G+KW
Sbjct: 380 IVGSKW 385


>gi|386771382|ref|NP_649044.3| CG18233 [Drosophila melanogaster]
 gi|383291998|gb|AAF49254.3| CG18233 [Drosophila melanogaster]
          Length = 515

 Score = 65.1 bits (157), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 28/67 (41%), Positives = 44/67 (65%), Gaps = 1/67 (1%)

Query: 61  LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
           LG++ G ++     +V  GG T+FP L + V P+KG+A+FW+NA  ++  D R  HS CP
Sbjct: 426 LGDRIGSIIFYA-GEVSQGGQTVFPDLKVAVEPKKGNALFWFNAFDDSTPDPRSLHSVCP 484

Query: 121 VALGNKW 127
           V +G++W
Sbjct: 485 VLVGSRW 491



 Score = 57.8 bits (138), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 24/53 (45%), Positives = 35/53 (66%)

Query: 13  DVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           +V  GG T+FP L + V P+KG+A+FW NA  ++  D    HS CPV +G++W
Sbjct: 439 EVSQGGQTVFPDLKVAVEPKKGNALFWFNAFDDSTPDPRSLHSVCPVLVGSRW 491


>gi|198466399|ref|XP_002135181.1| GA23909 [Drosophila pseudoobscura pseudoobscura]
 gi|198150582|gb|EDY73808.1| GA23909 [Drosophila pseudoobscura pseudoobscura]
          Length = 530

 Score = 65.1 bits (157), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 27/67 (40%), Positives = 45/67 (67%), Gaps = 1/67 (1%)

Query: 61  LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
           LG++ G ++    ++V  GGATIFP + +TV P+KG+++FW+N   ++  D R  H+ CP
Sbjct: 439 LGDRTGSIIFYA-SEVPQGGATIFPDIQVTVTPQKGNSLFWFNTFDDSTPDPRSLHAICP 497

Query: 121 VALGNKW 127
           V  G++W
Sbjct: 498 VIAGSRW 504



 Score = 57.0 bits (136), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 23/54 (42%), Positives = 36/54 (66%)

Query: 12  TDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++V  GGATIFP + +TV P+KG+++FW N   ++  D    H+ CPV  G++W
Sbjct: 451 SEVPQGGATIFPDIQVTVTPQKGNSLFWFNTFDDSTPDPRSLHAICPVIAGSRW 504


>gi|195392288|ref|XP_002054791.1| GJ24631 [Drosophila virilis]
 gi|194152877|gb|EDW68311.1| GJ24631 [Drosophila virilis]
          Length = 499

 Score = 65.1 bits (157), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 30/65 (46%), Positives = 40/65 (61%), Gaps = 1/65 (1%)

Query: 63  NKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 122
           N+   +LL  LTDV+LGG T FP+L L V P  GSA+ W+N +     D R  H+ CP+ 
Sbjct: 413 NRIATILLY-LTDVQLGGLTSFPALGLAVQPSPGSALIWHNMNNAAECDRRTLHAACPLL 471

Query: 123 LGNKW 127
           LG +W
Sbjct: 472 LGTRW 476



 Score = 64.3 bits (155), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 29/64 (45%), Positives = 37/64 (57%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + LTDV+LGG T FP+L L V P  GSA+ WHN +     D    H+ CP+ L
Sbjct: 413 NRIATILLYLTDVQLGGLTSFPALGLAVQPSPGSALIWHNMNNAAECDRRTLHAACPLLL 472

Query: 62  GNKW 65
           G +W
Sbjct: 473 GTRW 476


>gi|195128347|ref|XP_002008625.1| GI13597 [Drosophila mojavensis]
 gi|193920234|gb|EDW19101.1| GI13597 [Drosophila mojavensis]
          Length = 457

 Score = 64.7 bits (156), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 27/54 (50%), Positives = 36/54 (66%)

Query: 75  DVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 128
           DV +GGATIFP L L V P +G+A+ WYN +A+   D    H+ CPV LG++W 
Sbjct: 385 DVAIGGATIFPKLRLLVQPRRGTALLWYNLNADGAADPLAKHAVCPVVLGSRWA 438



 Score = 63.2 bits (152), Expect = 4e-08,   Method: Composition-based stats.
 Identities = 26/54 (48%), Positives = 36/54 (66%)

Query: 13  DVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWG 66
           DV +GGATIFP L L V P +G+A+ W+N +A+   D    H+ CPV LG++W 
Sbjct: 385 DVAIGGATIFPKLRLLVQPRRGTALLWYNLNADGAADPLAKHAVCPVVLGSRWA 438


>gi|357605723|gb|EHJ64752.1| prolyl 4-hydroxylase alpha subunit [Danaus plexippus]
          Length = 235

 Score = 64.7 bits (156), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 31/66 (46%), Positives = 41/66 (62%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+   +L   ++DV  GGAT+F  L L+VFP +GSAVFW N H +   D    H+ CPV
Sbjct: 132 GNRIATVLFY-MSDVAQGGATVFTELGLSVFPRRGSAVFWLNLHPSGEGDLATRHAACPV 190

Query: 122 ALGNKW 127
             G+KW
Sbjct: 191 LRGSKW 196



 Score = 62.8 bits (151), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 28/55 (50%), Positives = 36/55 (65%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DV  GGAT+F  L L+VFP +GSAVFW N H +   D    H+ CPV  G+KW
Sbjct: 142 MSDVAQGGATVFTELGLSVFPRRGSAVFWLNLHPSGEGDLATRHAACPVLRGSKW 196


>gi|161076739|ref|NP_001097101.1| CG34345 [Drosophila melanogaster]
 gi|157400090|gb|ABV53635.1| CG34345 [Drosophila melanogaster]
          Length = 504

 Score = 64.7 bits (156), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 26/55 (47%), Positives = 36/55 (65%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           +TDV +GG TIFP   L + P+KGSA+FWYN H N   +    H+ CP  +G++W
Sbjct: 431 MTDVPVGGTTIFPGAQLAIQPKKGSALFWYNLHNNGDPNLLTRHAVCPTIVGSRW 485



 Score = 63.2 bits (152), Expect = 4e-08,   Method: Composition-based stats.
 Identities = 25/55 (45%), Positives = 36/55 (65%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           +TDV +GG TIFP   L + P+KGSA+FW+N H N   +    H+ CP  +G++W
Sbjct: 431 MTDVPVGGTTIFPGAQLAIQPKKGSALFWYNLHNNGDPNLLTRHAVCPTIVGSRW 485


>gi|92109908|gb|ABE73278.1| IP10618p [Drosophila melanogaster]
          Length = 501

 Score = 64.7 bits (156), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 26/55 (47%), Positives = 36/55 (65%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           +TDV +GG TIFP   L + P+KGSA+FWYN H N   +    H+ CP  +G++W
Sbjct: 428 MTDVPVGGTTIFPGAQLAIQPKKGSALFWYNLHNNGDPNLLTRHAVCPTIVGSRW 482



 Score = 63.2 bits (152), Expect = 4e-08,   Method: Composition-based stats.
 Identities = 25/55 (45%), Positives = 36/55 (65%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           +TDV +GG TIFP   L + P+KGSA+FW+N H N   +    H+ CP  +G++W
Sbjct: 428 MTDVPVGGTTIFPGAQLAIQPKKGSALFWYNLHNNGDPNLLTRHAVCPTIVGSRW 482


>gi|195591298|ref|XP_002085379.1| GD14755 [Drosophila simulans]
 gi|194197388|gb|EDX10964.1| GD14755 [Drosophila simulans]
          Length = 515

 Score = 64.7 bits (156), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 28/67 (41%), Positives = 44/67 (65%), Gaps = 1/67 (1%)

Query: 61  LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
           LG++ G ++     +V  GG T+FP L + V P+KG+A+FW+NA  ++  D R  HS CP
Sbjct: 426 LGDRIGSIIFYA-GEVSQGGQTVFPDLKVAVEPKKGNALFWFNAFDDSSPDPRTLHSVCP 484

Query: 121 VALGNKW 127
           V +G++W
Sbjct: 485 VIVGSRW 491



 Score = 57.8 bits (138), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 24/53 (45%), Positives = 35/53 (66%)

Query: 13  DVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           +V  GG T+FP L + V P+KG+A+FW NA  ++  D    HS CPV +G++W
Sbjct: 439 EVSQGGQTVFPDLKVAVEPKKGNALFWFNAFDDSSPDPRTLHSVCPVIVGSRW 491


>gi|194765168|ref|XP_001964699.1| GF22909 [Drosophila ananassae]
 gi|190614971|gb|EDV30495.1| GF22909 [Drosophila ananassae]
          Length = 525

 Score = 64.7 bits (156), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 29/67 (43%), Positives = 41/67 (61%), Gaps = 1/67 (1%)

Query: 61  LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
           +GN+   +L   L+DV  GG T FP L + + P+K +A FWYN HA+ + D R  H  CP
Sbjct: 413 MGNRIATVLFY-LSDVTQGGGTAFPHLRVLLKPKKYAAAFWYNLHASGVGDPRTQHGACP 471

Query: 121 VALGNKW 127
           +  G+KW
Sbjct: 472 IISGSKW 478



 Score = 57.8 bits (138), Expect = 1e-06,   Method: Composition-based stats.
 Identities = 26/64 (40%), Positives = 36/64 (56%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       L+DV  GG T FP L + + P+K +A FW+N HA+ + D    H  CP+  
Sbjct: 415 NRIATVLFYLSDVTQGGGTAFPHLRVLLKPKKYAAAFWYNLHASGVGDPRTQHGACPIIS 474

Query: 62  GNKW 65
           G+KW
Sbjct: 475 GSKW 478


>gi|195505253|ref|XP_002099424.1| GE23369 [Drosophila yakuba]
 gi|194185525|gb|EDW99136.1| GE23369 [Drosophila yakuba]
          Length = 164

 Score = 64.7 bits (156), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 36/55 (65%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L+DVE GGAT+FP    T++P  G+A+ WYN H +   D    H+ CPV +G+KW
Sbjct: 88  LSDVEQGGATVFPMSGYTIYPRAGTALLWYNLHTDGHCDPSTLHAACPVMVGSKW 142



 Score = 63.5 bits (153), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 26/61 (42%), Positives = 38/61 (62%)

Query: 5   EMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNK 64
           E    +L+DVE GGAT+FP    T++P  G+A+ W+N H +   D    H+ CPV +G+K
Sbjct: 82  EGEKFQLSDVEQGGATVFPMSGYTIYPRAGTALLWYNLHTDGHCDPSTLHAACPVMVGSK 141

Query: 65  W 65
           W
Sbjct: 142 W 142


>gi|194765182|ref|XP_001964706.1| GF22908 [Drosophila ananassae]
 gi|190614978|gb|EDV30502.1| GF22908 [Drosophila ananassae]
          Length = 509

 Score = 64.7 bits (156), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 28/63 (44%), Positives = 42/63 (66%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           +R   S   L++V+ GG T+FP+LN+ + P KGS V WHN H +  +D   +H+GCPV +
Sbjct: 415 DRISTSMFYLSNVQQGGYTVFPNLNVFLPPVKGSMVLWHNLHYSLDVDARTWHAGCPVIV 474

Query: 62  GNK 64
           G+K
Sbjct: 475 GSK 477



 Score = 63.9 bits (154), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 26/54 (48%), Positives = 40/54 (74%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 126
           L++V+ GG T+FP+LN+ + P KGS V W+N H +  +D R +H+GCPV +G+K
Sbjct: 424 LSNVQQGGYTVFPNLNVFLPPVKGSMVLWHNLHYSLDVDARTWHAGCPVIVGSK 477


>gi|195064500|ref|XP_001996577.1| GH12091 [Drosophila grimshawi]
 gi|193895397|gb|EDV94263.1| GH12091 [Drosophila grimshawi]
          Length = 521

 Score = 64.7 bits (156), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 27/67 (40%), Positives = 43/67 (64%), Gaps = 1/67 (1%)

Query: 61  LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
           LG++   ++    +DV LGGAT+FP + + V P+KG+++ WYN   +   D R  HS CP
Sbjct: 434 LGDRLASIIFYS-SDVPLGGATVFPDIQVAVQPQKGNSLLWYNLFDDGTPDPRSLHSVCP 492

Query: 121 VALGNKW 127
           V +G++W
Sbjct: 493 VVVGSRW 499



 Score = 58.5 bits (140), Expect = 9e-07,   Method: Composition-based stats.
 Identities = 23/54 (42%), Positives = 36/54 (66%)

Query: 12  TDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           +DV LGGAT+FP + + V P+KG+++ W+N   +   D    HS CPV +G++W
Sbjct: 446 SDVPLGGATVFPDIQVAVQPQKGNSLLWYNLFDDGTPDPRSLHSVCPVVVGSRW 499


>gi|443707037|gb|ELU02831.1| hypothetical protein CAPTEDRAFT_181697 [Capitella teleta]
          Length = 538

 Score = 64.7 bits (156), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 26/51 (50%), Positives = 33/51 (64%)

Query: 77  ELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           E GGAT+FP L + ++PEKGS   W+N   N   DYR  H+GCP   G+KW
Sbjct: 465 EAGGATVFPDLGVKLWPEKGSCAVWWNLMRNGEGDYRTKHAGCPTITGSKW 515



 Score = 59.3 bits (142), Expect = 5e-07,   Method: Composition-based stats.
 Identities = 25/51 (49%), Positives = 31/51 (60%)

Query: 15  ELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           E GGAT+FP L + ++PEKGS   W N   N   DY   H+GCP   G+KW
Sbjct: 465 EAGGATVFPDLGVKLWPEKGSCAVWWNLMRNGEGDYRTKHAGCPTITGSKW 515


>gi|389770666|ref|ZP_10192118.1| procollagen-proline dioxygenase [Rhodanobacter sp. 115]
 gi|388429637|gb|EIL86932.1| procollagen-proline dioxygenase [Rhodanobacter sp. 115]
          Length = 286

 Score = 64.3 bits (155), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 33/73 (45%), Positives = 43/73 (58%)

Query: 55  SGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRM 114
           S   +A G +    L+  L +VE GGATIFP L L+V P+KGSAV++   ++   LD R 
Sbjct: 200 SAVQMATGGQRVSTLVMYLNEVEDGGATIFPELGLSVLPKKGSAVYFEYTNSRGQLDPRT 259

Query: 115 YHSGCPVALGNKW 127
            H G PV  G KW
Sbjct: 260 LHGGAPVLRGEKW 272



 Score = 60.1 bits (144), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 28/55 (50%), Positives = 35/55 (63%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L +VE GGATIFP L L+V P+KGSAV++   ++   LD    H G PV  G KW
Sbjct: 218 LNEVEDGGATIFPELGLSVLPKKGSAVYFEYTNSRGQLDPRTLHGGAPVLRGEKW 272


>gi|156398644|ref|XP_001638298.1| predicted protein [Nematostella vectensis]
 gi|156225417|gb|EDO46235.1| predicted protein [Nematostella vectensis]
          Length = 495

 Score = 64.3 bits (155), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 29/64 (45%), Positives = 38/64 (59%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     V ++ VE GGAT+FP +   + P+KG AVFWHN   +   D+   H+GCPV  
Sbjct: 409 NRIATVLVWMSQVESGGATVFPYVGARILPQKGDAVFWHNLLRSGDGDFRTRHAGCPVLS 468

Query: 62  GNKW 65
           G KW
Sbjct: 469 GIKW 472



 Score = 63.9 bits (154), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 29/66 (43%), Positives = 42/66 (63%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+   +L+  ++ VE GGAT+FP +   + P+KG AVFW+N   +   D+R  H+GCPV
Sbjct: 408 GNRIATVLV-WMSQVESGGATVFPYVGARILPQKGDAVFWHNLLRSGDGDFRTRHAGCPV 466

Query: 122 ALGNKW 127
             G KW
Sbjct: 467 LSGIKW 472


>gi|390176836|ref|XP_003736216.1| GA26872, isoform B [Drosophila pseudoobscura pseudoobscura]
 gi|388858809|gb|EIM52289.1| GA26872, isoform B [Drosophila pseudoobscura pseudoobscura]
          Length = 567

 Score = 64.3 bits (155), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 32/79 (40%), Positives = 46/79 (58%), Gaps = 2/79 (2%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           D++  H+      G++   ++   L DVE GGAT FP+L+L V  E+G+ +FW+N    T
Sbjct: 459 DFFSVHTPTSRLHGDRIATVIFY-LNDVEHGGATAFPNLDLVVPTERGAVLFWHNMDGET 517

Query: 109 L-LDYRMYHSGCPVALGNK 126
             LDYR  H  CPV +G K
Sbjct: 518 YDLDYRTLHGACPVIVGTK 536



 Score = 62.8 bits (151), Expect = 4e-08,   Method: Composition-based stats.
 Identities = 29/55 (52%), Positives = 35/55 (63%), Gaps = 1/55 (1%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTL-LDYWMYHSGCPVALGNK 64
           L DVE GGAT FP+L+L V  E+G+ +FWHN    T  LDY   H  CPV +G K
Sbjct: 482 LNDVEHGGATAFPNLDLVVPTERGAVLFWHNMDGETYDLDYRTLHGACPVIVGTK 536


>gi|198449518|ref|XP_002136915.1| GA26872, isoform A [Drosophila pseudoobscura pseudoobscura]
 gi|198130643|gb|EDY67473.1| GA26872, isoform A [Drosophila pseudoobscura pseudoobscura]
          Length = 543

 Score = 64.3 bits (155), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 32/79 (40%), Positives = 46/79 (58%), Gaps = 2/79 (2%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           D++  H+      G++   ++   L DVE GGAT FP+L+L V  E+G+ +FW+N    T
Sbjct: 435 DFFSVHTPTSRLHGDRIATVIFY-LNDVEHGGATAFPNLDLVVPTERGAVLFWHNMDGET 493

Query: 109 L-LDYRMYHSGCPVALGNK 126
             LDYR  H  CPV +G K
Sbjct: 494 YDLDYRTLHGACPVIVGTK 512



 Score = 62.8 bits (151), Expect = 4e-08,   Method: Composition-based stats.
 Identities = 29/55 (52%), Positives = 35/55 (63%), Gaps = 1/55 (1%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTL-LDYWMYHSGCPVALGNK 64
           L DVE GGAT FP+L+L V  E+G+ +FWHN    T  LDY   H  CPV +G K
Sbjct: 458 LNDVEHGGATAFPNLDLVVPTERGAVLFWHNMDGETYDLDYRTLHGACPVIVGTK 512


>gi|198449506|ref|XP_002136910.1| GA26925 [Drosophila pseudoobscura pseudoobscura]
 gi|198130637|gb|EDY67468.1| GA26925 [Drosophila pseudoobscura pseudoobscura]
          Length = 543

 Score = 64.3 bits (155), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 32/79 (40%), Positives = 46/79 (58%), Gaps = 2/79 (2%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           D++  H+      G++   ++   L DVE GGAT FP+L+L V  E+G+ +FW+N    T
Sbjct: 435 DFFSVHTPTSRLHGDRIATVIFY-LNDVEHGGATAFPNLDLVVPTERGAVLFWHNMDGET 493

Query: 109 L-LDYRMYHSGCPVALGNK 126
             LDYR  H  CPV +G K
Sbjct: 494 YDLDYRTLHGACPVIVGTK 512



 Score = 62.8 bits (151), Expect = 4e-08,   Method: Composition-based stats.
 Identities = 29/55 (52%), Positives = 35/55 (63%), Gaps = 1/55 (1%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTL-LDYWMYHSGCPVALGNK 64
           L DVE GGAT FP+L+L V  E+G+ +FWHN    T  LDY   H  CPV +G K
Sbjct: 458 LNDVEHGGATAFPNLDLVVPTERGAVLFWHNMDGETYDLDYRTLHGACPVIVGTK 512


>gi|195159160|ref|XP_002020450.1| GL13507 [Drosophila persimilis]
 gi|194117219|gb|EDW39262.1| GL13507 [Drosophila persimilis]
          Length = 543

 Score = 64.3 bits (155), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 32/79 (40%), Positives = 46/79 (58%), Gaps = 2/79 (2%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           D++  H+      G++   ++   L DVE GGAT FP+L+L V  E+G+ +FW+N    T
Sbjct: 435 DFFSVHTPTSRLHGDRIATVIFY-LNDVEHGGATAFPNLDLVVPTERGAVLFWHNMDGET 493

Query: 109 L-LDYRMYHSGCPVALGNK 126
             LDYR  H  CPV +G K
Sbjct: 494 YDLDYRTLHGACPVIVGTK 512



 Score = 62.8 bits (151), Expect = 4e-08,   Method: Composition-based stats.
 Identities = 29/55 (52%), Positives = 35/55 (63%), Gaps = 1/55 (1%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTL-LDYWMYHSGCPVALGNK 64
           L DVE GGAT FP+L+L V  E+G+ +FWHN    T  LDY   H  CPV +G K
Sbjct: 458 LNDVEHGGATAFPNLDLVVPTERGAVLFWHNMDGETYDLDYRTLHGACPVIVGTK 512


>gi|195159148|ref|XP_002020444.1| GL13996 [Drosophila persimilis]
 gi|194117213|gb|EDW39256.1| GL13996 [Drosophila persimilis]
          Length = 559

 Score = 64.3 bits (155), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 32/79 (40%), Positives = 46/79 (58%), Gaps = 2/79 (2%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           D++  H+      G++   ++   L DVE GGAT FP+L+L V  E+G+ +FW+N    T
Sbjct: 451 DFFSVHTPTSRLHGDRIATVIFY-LNDVEHGGATAFPNLDLVVPTERGAVLFWHNMDGET 509

Query: 109 L-LDYRMYHSGCPVALGNK 126
             LDYR  H  CPV +G K
Sbjct: 510 YDLDYRTLHGACPVIVGTK 528



 Score = 62.8 bits (151), Expect = 4e-08,   Method: Composition-based stats.
 Identities = 29/55 (52%), Positives = 35/55 (63%), Gaps = 1/55 (1%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTL-LDYWMYHSGCPVALGNK 64
           L DVE GGAT FP+L+L V  E+G+ +FWHN    T  LDY   H  CPV +G K
Sbjct: 474 LNDVEHGGATAFPNLDLVVPTERGAVLFWHNMDGETYDLDYRTLHGACPVIVGTK 528


>gi|289526401|gb|ADD01323.1| FI13021p [Drosophila melanogaster]
 gi|373432715|gb|AEY70761.1| FI17809p1 [Drosophila melanogaster]
          Length = 193

 Score = 64.3 bits (155), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 31/74 (41%), Positives = 44/74 (59%), Gaps = 1/74 (1%)

Query: 54  HSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYR 113
            S   + LG++   +L   LTDVE GGAT+F  +   V P+ G+A+FWYN   +   D R
Sbjct: 93  RSRYSIDLGDRIATVLFY-LTDVEQGGATVFGDVGYYVSPQAGTAIFWYNLDTDGNGDPR 151

Query: 114 MYHSGCPVALGNKW 127
             H+ CPV +G+KW
Sbjct: 152 TRHAACPVIVGSKW 165



 Score = 57.4 bits (137), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 25/55 (45%), Positives = 35/55 (63%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           LTDVE GGAT+F  +   V P+ G+A+FW+N   +   D    H+ CPV +G+KW
Sbjct: 111 LTDVEQGGATVFGDVGYYVSPQAGTAIFWYNLDTDGNGDPRTRHAACPVIVGSKW 165


>gi|195505218|ref|XP_002099409.1| GE10887 [Drosophila yakuba]
 gi|194185510|gb|EDW99121.1| GE10887 [Drosophila yakuba]
          Length = 521

 Score = 63.9 bits (154), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 36/110 (32%), Positives = 53/110 (48%), Gaps = 8/110 (7%)

Query: 24  SLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL------GNKWGKLLLSGLTDVE 77
           +LN+    +   A +    H    +D W Y +     L      GN+   +L   L+DV 
Sbjct: 371 NLNMKYAEDHQFANYGIGGHYGQHMD-WFYQTTIDAGLISSPEMGNRIATVLFY-LSDVS 428

Query: 78  LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
            GG T FP L   + P+K +A FW+N HA+ + D R  H  CP+  G+KW
Sbjct: 429 QGGGTAFPQLRTLLKPKKYAAAFWHNLHASGVGDVRTQHGACPIIAGSKW 478



 Score = 61.2 bits (147), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 27/64 (42%), Positives = 35/64 (54%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       L+DV  GG T FP L   + P+K +A FWHN HA+ + D    H  CP+  
Sbjct: 415 NRIATVLFYLSDVSQGGGTAFPQLRTLLKPKKYAAAFWHNLHASGVGDVRTQHGACPIIA 474

Query: 62  GNKW 65
           G+KW
Sbjct: 475 GSKW 478


>gi|195438148|ref|XP_002066999.1| GK24258 [Drosophila willistoni]
 gi|194163084|gb|EDW77985.1| GK24258 [Drosophila willistoni]
          Length = 217

 Score = 63.9 bits (154), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 30/68 (44%), Positives = 43/68 (63%), Gaps = 1/68 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           G++   +L   L+DV  GG TIFP  N+TV P+KGSA+FW+N H +   + +  H  CP+
Sbjct: 150 GDRLATILFY-LSDVAQGGHTIFPLANVTVQPKKGSALFWFNLHNDGEPNIKSLHGVCPI 208

Query: 122 ALGNKWGK 129
             GN+W K
Sbjct: 209 IEGNRWSK 216



 Score = 60.5 bits (145), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 28/57 (49%), Positives = 36/57 (63%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWGK 67
           L+DV  GG TIFP  N+TV P+KGSA+FW N H +   +    H  CP+  GN+W K
Sbjct: 160 LSDVAQGGHTIFPLANVTVQPKKGSALFWFNLHNDGEPNIKSLHGVCPIIEGNRWSK 216


>gi|341884171|gb|EGT40106.1| CBN-PHY-2 protein [Caenorhabditis brenneri]
          Length = 607

 Score = 63.5 bits (153), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 30/66 (45%), Positives = 38/66 (57%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+   +L   ++  ELGGAT+F  L   VFP K  A+FWYN   +   D R  H+ CPV
Sbjct: 505 GNRIATVLFY-MSQPELGGATVFNHLGTAVFPSKNDALFWYNLRRDGEGDLRTRHAACPV 563

Query: 122 ALGNKW 127
            LG KW
Sbjct: 564 LLGVKW 569



 Score = 57.8 bits (138), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 27/64 (42%), Positives = 34/64 (53%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++  ELGGAT+F  L   VFP K  A+FW+N   +   D    H+ CPV L
Sbjct: 506 NRIATVLFYMSQPELGGATVFNHLGTAVFPSKNDALFWYNLRRDGEGDLRTRHAACPVLL 565

Query: 62  GNKW 65
           G KW
Sbjct: 566 GVKW 569


>gi|390178148|ref|XP_001358756.3| GA13990 [Drosophila pseudoobscura pseudoobscura]
 gi|388859341|gb|EAL27899.3| GA13990 [Drosophila pseudoobscura pseudoobscura]
          Length = 498

 Score = 63.5 bits (153), Expect = 3e-08,   Method: Composition-based stats.
 Identities = 31/86 (36%), Positives = 50/86 (58%), Gaps = 1/86 (1%)

Query: 42  AHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFW 101
            H +   D   Y     + LG++   +L   + +V+ GGAT FP +N++V P+KGSAV W
Sbjct: 386 GHYDVHYDSHNYSEANRLILGDRIATVLFY-VGEVDSGGATTFPYINVSVTPKKGSAVLW 444

Query: 102 YNAHANTLLDYRMYHSGCPVALGNKW 127
           YN      ++ +  H+GCPV +G+K+
Sbjct: 445 YNLDNAGQMNPKAIHAGCPVIVGSKY 470



 Score = 57.4 bits (137), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 24/53 (45%), Positives = 37/53 (69%)

Query: 13  DVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           +V+ GGAT FP +N++V P+KGSAV W+N      ++    H+GCPV +G+K+
Sbjct: 418 EVDSGGATTFPYINVSVTPKKGSAVLWYNLDNAGQMNPKAIHAGCPVIVGSKY 470


>gi|443697961|gb|ELT98195.1| hypothetical protein CAPTEDRAFT_181380 [Capitella teleta]
          Length = 530

 Score = 63.5 bits (153), Expect = 3e-08,   Method: Composition-based stats.
 Identities = 24/55 (43%), Positives = 33/55 (60%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L+++E GG T++P++   V P K S   WYN   N   DYR YH+ CP+  G KW
Sbjct: 453 LSELEAGGYTVYPTVGAAVVPRKNSCALWYNLMRNGTGDYRTYHAACPILYGYKW 507



 Score = 58.9 bits (141), Expect = 7e-07,   Method: Composition-based stats.
 Identities = 24/64 (37%), Positives = 34/64 (53%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       L+++E GG T++P++   V P K S   W+N   N   DY  YH+ CP+  
Sbjct: 444 NRVATFICYLSELEAGGYTVYPTVGAAVVPRKNSCALWYNLMRNGTGDYRTYHAACPILY 503

Query: 62  GNKW 65
           G KW
Sbjct: 504 GYKW 507


>gi|195166677|ref|XP_002024161.1| GL22880 [Drosophila persimilis]
 gi|194107516|gb|EDW29559.1| GL22880 [Drosophila persimilis]
          Length = 507

 Score = 63.5 bits (153), Expect = 3e-08,   Method: Composition-based stats.
 Identities = 26/67 (38%), Positives = 44/67 (65%), Gaps = 1/67 (1%)

Query: 61  LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
           LG++ G ++    ++V  GG TIFP + +TV P+KG+++FW+N   ++  D R  H+ CP
Sbjct: 382 LGDRTGSIIFYA-SEVPQGGTTIFPDIQVTVTPQKGNSLFWFNTFDDSTPDPRSLHAICP 440

Query: 121 VALGNKW 127
           V  G++W
Sbjct: 441 VIAGSRW 447



 Score = 55.5 bits (132), Expect = 7e-06,   Method: Composition-based stats.
 Identities = 22/54 (40%), Positives = 35/54 (64%)

Query: 12  TDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++V  GG TIFP + +TV P+KG+++FW N   ++  D    H+ CPV  G++W
Sbjct: 394 SEVPQGGTTIFPDIQVTVTPQKGNSLFWFNTFDDSTPDPRSLHAICPVIAGSRW 447


>gi|449673565|ref|XP_002167120.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Hydra
           magnipapillata]
          Length = 571

 Score = 63.5 bits (153), Expect = 3e-08,   Method: Composition-based stats.
 Identities = 30/69 (43%), Positives = 40/69 (57%), Gaps = 1/69 (1%)

Query: 59  VALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSG 118
           + +GN+   +L   L+DVE GGAT+F      VFP KG AVFW+N   N   +    H+ 
Sbjct: 481 LGMGNRIATVLFY-LSDVEAGGATVFTVGKTAVFPSKGDAVFWFNLKRNGKGNPNTRHAA 539

Query: 119 CPVALGNKW 127
           CPV +G KW
Sbjct: 540 CPVLVGQKW 548



 Score = 58.5 bits (140), Expect = 8e-07,   Method: Composition-based stats.
 Identities = 29/64 (45%), Positives = 34/64 (53%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       L+DVE GGAT+F      VFP KG AVFW N   N   +    H+ CPV +
Sbjct: 485 NRIATVLFYLSDVEAGGATVFTVGKTAVFPSKGDAVFWFNLKRNGKGNPNTRHAACPVLV 544

Query: 62  GNKW 65
           G KW
Sbjct: 545 GQKW 548


>gi|427783867|gb|JAA57385.1| Putative prolyl 4-hydroxylase subunit alpha-1 [Rhipicephalus
           pulchellus]
          Length = 548

 Score = 63.5 bits (153), Expect = 3e-08,   Method: Composition-based stats.
 Identities = 27/59 (45%), Positives = 40/59 (67%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L+  L+DVE GGAT+FP L + + P+KG+A FW+N +++   +    H GCPV  G+KW
Sbjct: 461 LMFYLSDVEEGGATVFPHLGVRLTPKKGNAAFWWNLNSDGEGEQLTKHGGCPVLYGSKW 519



 Score = 61.6 bits (148), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 26/55 (47%), Positives = 37/55 (67%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L+DVE GGAT+FP L + + P+KG+A FW N +++   +    H GCPV  G+KW
Sbjct: 465 LSDVEEGGATVFPHLGVRLTPKKGNAAFWWNLNSDGEGEQLTKHGGCPVLYGSKW 519


>gi|116008434|ref|NP_651806.2| CG9698 [Drosophila melanogaster]
 gi|113194862|gb|AAF57062.2| CG9698 [Drosophila melanogaster]
          Length = 547

 Score = 63.2 bits (152), Expect = 3e-08,   Method: Composition-based stats.
 Identities = 28/67 (41%), Positives = 40/67 (59%), Gaps = 1/67 (1%)

Query: 61  LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
           +GN+   +L   L+DV  GG T FP L   + P+K +A FW+N HA+ + D R  H  CP
Sbjct: 444 MGNRIATVLFY-LSDVAQGGGTAFPQLRTLLKPKKYAAAFWHNLHASGVGDVRTQHGACP 502

Query: 121 VALGNKW 127
           +  G+KW
Sbjct: 503 IIAGSKW 509



 Score = 60.8 bits (146), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 27/64 (42%), Positives = 35/64 (54%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       L+DV  GG T FP L   + P+K +A FWHN HA+ + D    H  CP+  
Sbjct: 446 NRIATVLFYLSDVAQGGGTAFPQLRTLLKPKKYAAAFWHNLHASGVGDVRTQHGACPIIA 505

Query: 62  GNKW 65
           G+KW
Sbjct: 506 GSKW 509


>gi|195452728|ref|XP_002073474.1| GK14137 [Drosophila willistoni]
 gi|194169559|gb|EDW84460.1| GK14137 [Drosophila willistoni]
          Length = 536

 Score = 63.2 bits (152), Expect = 3e-08,   Method: Composition-based stats.
 Identities = 29/63 (46%), Positives = 40/63 (63%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           +R   S   L+DVELGG T+F  LN+ + P KG+ V WHN H +  +D    H+GCPV +
Sbjct: 442 DRISTSMFYLSDVELGGNTVFIKLNVFLPPIKGAMVMWHNLHYSLDVDRRTIHAGCPVLI 501

Query: 62  GNK 64
           G+K
Sbjct: 502 GSK 504



 Score = 62.4 bits (150), Expect = 5e-08,   Method: Composition-based stats.
 Identities = 27/54 (50%), Positives = 38/54 (70%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 126
           L+DVELGG T+F  LN+ + P KG+ V W+N H +  +D R  H+GCPV +G+K
Sbjct: 451 LSDVELGGNTVFIKLNVFLPPIKGAMVMWHNLHYSLDVDRRTIHAGCPVLIGSK 504


>gi|195069793|ref|XP_001997027.1| GH12976 [Drosophila grimshawi]
 gi|193891496|gb|EDV90362.1| GH12976 [Drosophila grimshawi]
          Length = 83

 Score = 63.2 bits (152), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 37/55 (67%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           ++DV+ GGAT+F SL   ++P+KG+A FW N H +   D R  H+ CPV  G+KW
Sbjct: 1   MSDVQQGGATVFTSLRTALWPKKGTAAFWMNLHRSGEGDARTRHAACPVLTGSKW 55



 Score = 58.9 bits (141), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 25/55 (45%), Positives = 36/55 (65%)

Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
          ++DV+ GGAT+F SL   ++P+KG+A FW N H +   D    H+ CPV  G+KW
Sbjct: 1  MSDVQQGGATVFTSLRTALWPKKGTAAFWMNLHRSGEGDARTRHAACPVLTGSKW 55


>gi|350014318|dbj|GAA37183.1| prolyl 4-hydroxylase [Clonorchis sinensis]
          Length = 595

 Score = 63.2 bits (152), Expect = 3e-08,   Method: Composition-based stats.
 Identities = 29/66 (43%), Positives = 40/66 (60%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+   ++   LTDV+ GGAT+F     +V P +G+A FWYN H +   D R  H  CPV
Sbjct: 501 GNRIATIIFY-LTDVKAGGATVFNRFGASVKPVRGAAGFWYNLHPSGEGDLRTRHVACPV 559

Query: 122 ALGNKW 127
            +G+KW
Sbjct: 560 LVGSKW 565



 Score = 57.8 bits (138), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 27/64 (42%), Positives = 36/64 (56%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       LTDV+ GGAT+F     +V P +G+A FW+N H +   D    H  CPV +
Sbjct: 502 NRIATIIFYLTDVKAGGATVFNRFGASVKPVRGAAGFWYNLHPSGEGDLRTRHVACPVLV 561

Query: 62  GNKW 65
           G+KW
Sbjct: 562 GSKW 565


>gi|297515507|gb|ADI44133.1| RT08151p [Drosophila melanogaster]
          Length = 546

 Score = 63.2 bits (152), Expect = 3e-08,   Method: Composition-based stats.
 Identities = 28/67 (41%), Positives = 40/67 (59%), Gaps = 1/67 (1%)

Query: 61  LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
           +GN+   +L   L+DV  GG T FP L   + P+K +A FW+N HA+ + D R  H  CP
Sbjct: 444 MGNRIAAVLFY-LSDVAQGGGTAFPQLRTLLKPKKYAAAFWHNLHASGVGDVRTQHGACP 502

Query: 121 VALGNKW 127
           +  G+KW
Sbjct: 503 IIAGSKW 509



 Score = 60.8 bits (146), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 27/64 (42%), Positives = 35/64 (54%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       L+DV  GG T FP L   + P+K +A FWHN HA+ + D    H  CP+  
Sbjct: 446 NRIAAVLFYLSDVAQGGGTAFPQLRTLLKPKKYAAAFWHNLHASGVGDVRTQHGACPIIA 505

Query: 62  GNKW 65
           G+KW
Sbjct: 506 GSKW 509


>gi|194905372|ref|XP_001981184.1| GG11758 [Drosophila erecta]
 gi|190655822|gb|EDV53054.1| GG11758 [Drosophila erecta]
          Length = 550

 Score = 63.2 bits (152), Expect = 4e-08,   Method: Composition-based stats.
 Identities = 28/67 (41%), Positives = 40/67 (59%), Gaps = 1/67 (1%)

Query: 61  LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
           +GN+   +L   L+DV  GG T FP L   + P+K +A FW+N HA+ + D R  H  CP
Sbjct: 441 MGNRIATVLFY-LSDVSQGGGTAFPQLRTLLKPKKYAAAFWHNLHASGVGDVRTQHGACP 499

Query: 121 VALGNKW 127
           +  G+KW
Sbjct: 500 IIAGSKW 506



 Score = 60.8 bits (146), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 27/64 (42%), Positives = 35/64 (54%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       L+DV  GG T FP L   + P+K +A FWHN HA+ + D    H  CP+  
Sbjct: 443 NRIATVLFYLSDVSQGGGTAFPQLRTLLKPKKYAAAFWHNLHASGVGDVRTQHGACPIIA 502

Query: 62  GNKW 65
           G+KW
Sbjct: 503 GSKW 506


>gi|390352104|ref|XP_003727818.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like
           [Strongylocentrotus purpuratus]
          Length = 121

 Score = 63.2 bits (152), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 30/66 (45%), Positives = 39/66 (59%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+   +L   L+DV  GG T+F      + PEKGSA+FWYN   N  +D R  H+ CPV
Sbjct: 34  GNRIASMLFY-LSDVAKGGDTVFIDAGAKIKPEKGSAIFWYNLFKNGKVDERTKHASCPV 92

Query: 122 ALGNKW 127
             G+KW
Sbjct: 93  ISGSKW 98



 Score = 56.2 bits (134), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 25/55 (45%), Positives = 33/55 (60%)

Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
          L+DV  GG T+F      + PEKGSA+FW+N   N  +D    H+ CPV  G+KW
Sbjct: 44 LSDVAKGGDTVFIDAGAKIKPEKGSAIFWYNLFKNGKVDERTKHASCPVISGSKW 98


>gi|443709455|gb|ELU04127.1| hypothetical protein CAPTEDRAFT_149240 [Capitella teleta]
          Length = 532

 Score = 63.2 bits (152), Expect = 4e-08,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 35/55 (63%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           +TDVE GGAT+F    + V+PEKGSA  W+N   +   D R  H+ CPV  G+KW
Sbjct: 457 MTDVEAGGATVFLDAGVKVYPEKGSAAVWHNLLPSGEGDMRTRHAACPVLTGSKW 511



 Score = 62.0 bits (149), Expect = 7e-08,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 34/55 (61%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           +TDVE GGAT+F    + V+PEKGSA  WHN   +   D    H+ CPV  G+KW
Sbjct: 457 MTDVEAGGATVFLDAGVKVYPEKGSAAVWHNLLPSGEGDMRTRHAACPVLTGSKW 511


>gi|195444366|ref|XP_002069834.1| GK11733 [Drosophila willistoni]
 gi|194165919|gb|EDW80820.1| GK11733 [Drosophila willistoni]
          Length = 517

 Score = 63.2 bits (152), Expect = 4e-08,   Method: Composition-based stats.
 Identities = 29/55 (52%), Positives = 35/55 (63%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           LTDV+ GGAT+FP L L+ FP+KGSA+ + N       D    HS CPV  GNKW
Sbjct: 441 LTDVQQGGATVFPFLRLSYFPKKGSALIFRNLDNAMSGDKDSTHSACPVLFGNKW 495



 Score = 62.4 bits (150), Expect = 5e-08,   Method: Composition-based stats.
 Identities = 29/55 (52%), Positives = 35/55 (63%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           LTDV+ GGAT+FP L L+ FP+KGSA+ + N       D    HS CPV  GNKW
Sbjct: 441 LTDVQQGGATVFPFLRLSYFPKKGSALIFRNLDNAMSGDKDSTHSACPVLFGNKW 495


>gi|340367965|ref|XP_003382523.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Amphimedon
           queenslandica]
          Length = 525

 Score = 62.8 bits (151), Expect = 4e-08,   Method: Composition-based stats.
 Identities = 28/66 (42%), Positives = 42/66 (63%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+   LL+  ++DVE GGAT+FP +   + P K +A +W+N   +   DY   H+GCPV
Sbjct: 438 GNRISTLLIY-MSDVEKGGATVFPGVGARLVPIKRAAAYWWNLKRSGDGDYSTRHAGCPV 496

Query: 122 ALGNKW 127
            +G+KW
Sbjct: 497 LVGSKW 502



 Score = 59.7 bits (143), Expect = 4e-07,   Method: Composition-based stats.
 Identities = 26/64 (40%), Positives = 38/64 (59%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + ++DVE GGAT+FP +   + P K +A +W N   +   DY   H+GCPV +
Sbjct: 439 NRISTLLIYMSDVEKGGATVFPGVGARLVPIKRAAAYWWNLKRSGDGDYSTRHAGCPVLV 498

Query: 62  GNKW 65
           G+KW
Sbjct: 499 GSKW 502


>gi|195145084|ref|XP_002013526.1| GL24185 [Drosophila persimilis]
 gi|194102469|gb|EDW24512.1| GL24185 [Drosophila persimilis]
          Length = 229

 Score = 62.8 bits (151), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 31/85 (36%), Positives = 51/85 (60%), Gaps = 1/85 (1%)

Query: 43  HANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWY 102
           H +   D   Y     + LG++   +L   + +V+ GGAT FP +N++V P+KGSAV WY
Sbjct: 118 HYDVHYDSHNYSEANRLILGDRIATVLFY-VGEVDSGGATTFPYINVSVTPKKGSAVLWY 176

Query: 103 NAHANTLLDYRMYHSGCPVALGNKW 127
           N   +  ++ +  H+GCPV +G+K+
Sbjct: 177 NLDNSGQMNPKAIHAGCPVIVGSKY 201



 Score = 56.6 bits (135), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 24/53 (45%), Positives = 38/53 (71%)

Query: 13  DVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           +V+ GGAT FP +N++V P+KGSAV W+N   +  ++    H+GCPV +G+K+
Sbjct: 149 EVDSGGATTFPYINVSVTPKKGSAVLWYNLDNSGQMNPKAIHAGCPVIVGSKY 201


>gi|402813396|ref|ZP_10862991.1| hypothetical protein PAV_1c08470 [Paenibacillus alvei DSM 29]
 gi|402509339|gb|EJW19859.1| hypothetical protein PAV_1c08470 [Paenibacillus alvei DSM 29]
          Length = 215

 Score = 62.8 bits (151), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 32/78 (41%), Positives = 44/78 (56%)

Query: 50  YWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTL 109
           ++ + +    A  N     L+  L DVE GG T FP LNL+VFP KG AV++   ++N  
Sbjct: 124 HFDFFADTSRASANNRISTLVMYLNDVEEGGETTFPMLNLSVFPSKGMAVYFEYFYSNHE 183

Query: 110 LDYRMYHSGCPVALGNKW 127
           L+ R  H+G PV  G KW
Sbjct: 184 LNERTLHAGAPVRKGEKW 201



 Score = 60.5 bits (145), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 28/55 (50%), Positives = 35/55 (63%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T FP LNL+VFP KG AV++   ++N  L+    H+G PV  G KW
Sbjct: 147 LNDVEEGGETTFPMLNLSVFPSKGMAVYFEYFYSNHELNERTLHAGAPVRKGEKW 201


>gi|195452746|ref|XP_002073482.1| GK14141 [Drosophila willistoni]
 gi|194169567|gb|EDW84468.1| GK14141 [Drosophila willistoni]
          Length = 541

 Score = 62.8 bits (151), Expect = 5e-08,   Method: Composition-based stats.
 Identities = 28/67 (41%), Positives = 40/67 (59%), Gaps = 1/67 (1%)

Query: 61  LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
           +GN+   +L   L+DV  GG T FP L   + P+K +A FW+N HA+ + D R  H  CP
Sbjct: 441 MGNRIATVLFY-LSDVAQGGGTAFPHLKQLLQPKKYAAAFWHNLHASGVGDLRTLHGACP 499

Query: 121 VALGNKW 127
           +  G+KW
Sbjct: 500 IIAGSKW 506



 Score = 60.5 bits (145), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 27/64 (42%), Positives = 35/64 (54%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       L+DV  GG T FP L   + P+K +A FWHN HA+ + D    H  CP+  
Sbjct: 443 NRIATVLFYLSDVAQGGGTAFPHLKQLLQPKKYAAAFWHNLHASGVGDLRTLHGACPIIA 502

Query: 62  GNKW 65
           G+KW
Sbjct: 503 GSKW 506


>gi|405964867|gb|EKC30309.1| Prolyl 4-hydroxylase subunit alpha-1 [Crassostrea gigas]
          Length = 591

 Score = 62.8 bits (151), Expect = 5e-08,   Method: Composition-based stats.
 Identities = 26/55 (47%), Positives = 33/55 (60%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L DV+ GGAT+FP +   +   KG A FWYN   +   D R  H GCPV +G+KW
Sbjct: 507 LNDVKAGGATVFPEVKTRIPVAKGGAAFWYNVRPSGATDPRTLHGGCPVLVGSKW 561



 Score = 57.4 bits (137), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 24/55 (43%), Positives = 32/55 (58%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DV+ GGAT+FP +   +   KG A FW+N   +   D    H GCPV +G+KW
Sbjct: 507 LNDVKAGGATVFPEVKTRIPVAKGGAAFWYNVRPSGATDPRTLHGGCPVLVGSKW 561


>gi|156333122|ref|XP_001619372.1| hypothetical protein NEMVEDRAFT_v1g151555 [Nematostella vectensis]
 gi|156202442|gb|EDO27272.1| predicted protein [Nematostella vectensis]
          Length = 144

 Score = 62.8 bits (151), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 28/59 (47%), Positives = 37/59 (62%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
            LS L+DVE GG T+F  +  TV+P+KG A FWYN   +   D    H+ CPV +G+KW
Sbjct: 69  FLSYLSDVEAGGGTVFTRVGATVWPQKGDAAFWYNLKRSGDGDSSTRHAACPVLVGSKW 127



 Score = 58.2 bits (139), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 25/55 (45%), Positives = 35/55 (63%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L+DVE GG T+F  +  TV+P+KG A FW+N   +   D    H+ CPV +G+KW
Sbjct: 73  LSDVEAGGGTVFTRVGATVWPQKGDAAFWYNLKRSGDGDSSTRHAACPVLVGSKW 127


>gi|195591296|ref|XP_002085378.1| GD14754 [Drosophila simulans]
 gi|194197387|gb|EDX10963.1| GD14754 [Drosophila simulans]
          Length = 508

 Score = 62.8 bits (151), Expect = 5e-08,   Method: Composition-based stats.
 Identities = 33/73 (45%), Positives = 45/73 (61%), Gaps = 13/73 (17%)

Query: 61  LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRM------ 114
           LG++   +L   +TDV  GGA  FP+LNLT++P+KGSA+ W N      LD+RM      
Sbjct: 412 LGDRLTSILFF-MTDVVQGGAFAFPNLNLTIWPQKGSALVWRN------LDHRMQPNKDL 464

Query: 115 YHSGCPVALGNKW 127
            H  CPV +G+KW
Sbjct: 465 LHVSCPVVVGSKW 477



 Score = 60.8 bits (146), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 26/55 (47%), Positives = 36/55 (65%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           +TDV  GGA  FP+LNLT++P+KGSA+ W N       +  + H  CPV +G+KW
Sbjct: 423 MTDVVQGGAFAFPNLNLTIWPQKGSALVWRNLDHRMQPNKDLLHVSCPVVVGSKW 477


>gi|324507368|gb|ADY43128.1| Prolyl 4-hydroxylase subunit alpha-2 [Ascaris suum]
          Length = 534

 Score = 62.8 bits (151), Expect = 5e-08,   Method: Composition-based stats.
 Identities = 30/66 (45%), Positives = 39/66 (59%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+   +L+  ++DVE GGAT+F  L   VFP K  A+FWYN   +   D R  H+ CPV
Sbjct: 437 GNRIATILIY-MSDVESGGATVFNHLGNAVFPSKYDALFWYNLRRDGEGDLRTRHAACPV 495

Query: 122 ALGNKW 127
             G KW
Sbjct: 496 LTGIKW 501



 Score = 57.0 bits (136), Expect = 3e-06,   Method: Composition-based stats.
 Identities = 27/64 (42%), Positives = 35/64 (54%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + ++DVE GGAT+F  L   VFP K  A+FW+N   +   D    H+ CPV  
Sbjct: 438 NRIATILIYMSDVESGGATVFNHLGNAVFPSKYDALFWYNLRRDGEGDLRTRHAACPVLT 497

Query: 62  GNKW 65
           G KW
Sbjct: 498 GIKW 501


>gi|156370133|ref|XP_001628326.1| predicted protein [Nematostella vectensis]
 gi|156215300|gb|EDO36263.1| predicted protein [Nematostella vectensis]
          Length = 526

 Score = 62.4 bits (150), Expect = 5e-08,   Method: Composition-based stats.
 Identities = 32/89 (35%), Positives = 46/89 (51%), Gaps = 4/89 (4%)

Query: 42  AHANTLLDYWMYHSGCPVA---LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSA 98
            H     D+ +     P+A    GN+    +   L++VE GG+T+F    +   P KG A
Sbjct: 415 GHYEPHFDHSLDMENSPIASLGQGNRIATFMFY-LSEVEAGGSTVFIKTGVKTNPFKGGA 473

Query: 99  VFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           VFWYN   +   D+   H+GCPV +GNKW
Sbjct: 474 VFWYNLKKSGEGDWDSLHAGCPVLIGNKW 502



 Score = 59.3 bits (142), Expect = 5e-07,   Method: Composition-based stats.
 Identities = 27/64 (42%), Positives = 37/64 (57%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       L++VE GG+T+F    +   P KG AVFW+N   +   D+   H+GCPV +
Sbjct: 439 NRIATFMFYLSEVEAGGSTVFIKTGVKTNPFKGGAVFWYNLKKSGEGDWDSLHAGCPVLI 498

Query: 62  GNKW 65
           GNKW
Sbjct: 499 GNKW 502


>gi|195575105|ref|XP_002105520.1| GD21524 [Drosophila simulans]
 gi|194201447|gb|EDX15023.1| GD21524 [Drosophila simulans]
          Length = 448

 Score = 62.4 bits (150), Expect = 5e-08,   Method: Composition-based stats.
 Identities = 28/48 (58%), Positives = 32/48 (66%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
           L+DV  GGATIFP L L+VFP+KGSA+ WYN       D R  HS CP
Sbjct: 398 LSDVPQGGATIFPKLGLSVFPKKGSALLWYNLDHKGDGDNRTAHSACP 445



 Score = 57.4 bits (137), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 27/57 (47%), Positives = 34/57 (59%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCP 58
           +R   +   L+DV  GGATIFP L L+VFP+KGSA+ W+N       D    HS CP
Sbjct: 389 DRIATAVFYLSDVPQGGATIFPKLGLSVFPKKGSALLWYNLDHKGDGDNRTAHSACP 445


>gi|194905424|ref|XP_001981193.1| GG11755 [Drosophila erecta]
 gi|190655831|gb|EDV53063.1| GG11755 [Drosophila erecta]
          Length = 527

 Score = 62.4 bits (150), Expect = 6e-08,   Method: Composition-based stats.
 Identities = 28/63 (44%), Positives = 39/63 (61%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           +R   S   L+DVE GG T+FP LN+ + P  G+ V WHN H +  +D    H+GCPV +
Sbjct: 433 DRISTSMFYLSDVEQGGYTVFPKLNVFLPPVSGALVMWHNLHRSLDVDARTLHAGCPVIV 492

Query: 62  GNK 64
           G+K
Sbjct: 493 GSK 495



 Score = 61.6 bits (148), Expect = 9e-08,   Method: Composition-based stats.
 Identities = 26/54 (48%), Positives = 37/54 (68%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 126
           L+DVE GG T+FP LN+ + P  G+ V W+N H +  +D R  H+GCPV +G+K
Sbjct: 442 LSDVEQGGYTVFPKLNVFLPPVSGALVMWHNLHRSLDVDARTLHAGCPVIVGSK 495


>gi|443705944|gb|ELU02240.1| hypothetical protein CAPTEDRAFT_227850 [Capitella teleta]
          Length = 475

 Score = 62.4 bits (150), Expect = 6e-08,   Method: Composition-based stats.
 Identities = 25/55 (45%), Positives = 37/55 (67%)

Query: 79  GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWGKLLLS 133
           GGAT+FP L +++  +KG+A FW+N H +  LD R  H+ CPV  G KW  +++S
Sbjct: 317 GGATVFPLLPMSIPIQKGAAAFWFNLHPDGSLDRRTLHAACPVIRGTKWECVIVS 371



 Score = 57.0 bits (136), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 24/55 (43%), Positives = 35/55 (63%)

Query: 17  GGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWGKLLLS 71
           GGAT+FP L +++  +KG+A FW N H +  LD    H+ CPV  G KW  +++S
Sbjct: 317 GGATVFPLLPMSIPIQKGAAAFWFNLHPDGSLDRRTLHAACPVIRGTKWECVIVS 371


>gi|423518940|ref|ZP_17495421.1| hypothetical protein IG7_04010 [Bacillus cereus HuA2-4]
 gi|401159995|gb|EJQ67374.1| hypothetical protein IG7_04010 [Bacillus cereus HuA2-4]
          Length = 216

 Score = 62.4 bits (150), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 34/79 (43%), Positives = 44/79 (55%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPQLNLSVHPRKGMAVYFEYFYQDQ 185

Query: 109 LLDYRMYHSGCPVALGNKW 127
           LL+    H G PV  G KW
Sbjct: 186 LLNELTLHGGAPVTKGEKW 204



 Score = 59.3 bits (142), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 27/55 (49%), Positives = 33/55 (60%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T FP LNL+V P KG AV++   + + LL+    H G PV  G KW
Sbjct: 150 LNDVEEGGETFFPQLNLSVHPRKGMAVYFEYFYQDQLLNELTLHGGAPVTKGEKW 204


>gi|195159164|ref|XP_002020452.1| GL13506 [Drosophila persimilis]
 gi|194117221|gb|EDW39264.1| GL13506 [Drosophila persimilis]
          Length = 536

 Score = 62.4 bits (150), Expect = 6e-08,   Method: Composition-based stats.
 Identities = 36/92 (39%), Positives = 49/92 (53%), Gaps = 6/92 (6%)

Query: 38  FWHNAHANTLLDYWMYHSGCPV--ALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEK 95
           + +  H +T  DY   +   P+  ALG++   +L   L DV+ GG+T+FP L L V  E+
Sbjct: 420 YGYGGHYDTHFDY--LNDSLPITQALGDRMATVLFY-LNDVKHGGSTVFPVLQLKVPSER 476

Query: 96  GSAVFWYNAHANTL-LDYRMYHSGCPVALGNK 126
           G  + WYN H  T  LD R  H  CPV  G K
Sbjct: 477 GKVLVWYNMHGETHDLDSRTLHGSCPVIDGAK 508



 Score = 50.8 bits (120), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 25/55 (45%), Positives = 31/55 (56%), Gaps = 1/55 (1%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTL-LDYWMYHSGCPVALGNK 64
           L DV+ GG+T+FP L L V  E+G  + W+N H  T  LD    H  CPV  G K
Sbjct: 454 LNDVKHGGSTVFPVLQLKVPSERGKVLVWYNMHGETHDLDSRTLHGSCPVIDGAK 508


>gi|198449524|ref|XP_002136918.1| GA26871 [Drosophila pseudoobscura pseudoobscura]
 gi|198130646|gb|EDY67476.1| GA26871 [Drosophila pseudoobscura pseudoobscura]
          Length = 530

 Score = 62.4 bits (150), Expect = 7e-08,   Method: Composition-based stats.
 Identities = 36/92 (39%), Positives = 49/92 (53%), Gaps = 6/92 (6%)

Query: 38  FWHNAHANTLLDYWMYHSGCPV--ALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEK 95
           + +  H +T  DY   +   P+  ALG++   +L   L DV+ GG+T+FP L L V  E+
Sbjct: 414 YGYGGHYDTHFDY--LNDSLPITQALGDRMATVLFY-LNDVKHGGSTVFPVLKLKVPSER 470

Query: 96  GSAVFWYNAHANTL-LDYRMYHSGCPVALGNK 126
           G  + WYN H  T  LD R  H  CPV  G K
Sbjct: 471 GKVLVWYNMHGETHDLDSRTLHGSCPVIDGAK 502



 Score = 50.8 bits (120), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 25/55 (45%), Positives = 31/55 (56%), Gaps = 1/55 (1%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTL-LDYWMYHSGCPVALGNK 64
           L DV+ GG+T+FP L L V  E+G  + W+N H  T  LD    H  CPV  G K
Sbjct: 448 LNDVKHGGSTVFPVLKLKVPSERGKVLVWYNMHGETHDLDSRTLHGSCPVIDGAK 502


>gi|405970696|gb|EKC35577.1| Prolyl 4-hydroxylase subunit alpha-1 [Crassostrea gigas]
          Length = 171

 Score = 62.0 bits (149), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 28/55 (50%), Positives = 37/55 (67%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           LTDVE GGAT+FP   + V   KG+A+FWYN   N+  D R  ++ CPV LG+K+
Sbjct: 90  LTDVEKGGATVFPEAKVRVPVTKGAALFWYNIKRNSEKDQRSLNADCPVILGSKF 144



 Score = 57.4 bits (137), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 36/55 (65%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           LTDVE GGAT+FP   + V   KG+A+FW+N   N+  D    ++ CPV LG+K+
Sbjct: 90  LTDVEKGGATVFPEAKVRVPVTKGAALFWYNIKRNSEKDQRSLNADCPVILGSKF 144


>gi|112984520|ref|NP_001037195.1| prolyl 4-hydroxylase alpha subunit precursor [Bombyx mori]
 gi|37543673|gb|AAM21932.1| prolyl 4-hydroxylase alpha subunit [Bombyx mori]
          Length = 550

 Score = 62.0 bits (149), Expect = 7e-08,   Method: Composition-based stats.
 Identities = 29/66 (43%), Positives = 40/66 (60%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+   +L   ++DV  GGAT+F  L L++FP K +A FW N HA+   D    H+ CPV
Sbjct: 447 GNRIATVLFY-MSDVAQGGATVFTELGLSLFPIKRAAAFWLNLHASGEGDLATRHAACPV 505

Query: 122 ALGNKW 127
             G+KW
Sbjct: 506 LRGSKW 511



 Score = 60.5 bits (145), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 28/64 (43%), Positives = 37/64 (57%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+F  L L++FP K +A FW N HA+   D    H+ CPV  
Sbjct: 448 NRIATVLFYMSDVAQGGATVFTELGLSLFPIKRAAAFWLNLHASGEGDLATRHAACPVLR 507

Query: 62  GNKW 65
           G+KW
Sbjct: 508 GSKW 511


>gi|423512354|ref|ZP_17488885.1| hypothetical protein IG3_03851 [Bacillus cereus HuA2-1]
 gi|402449325|gb|EJV81162.1| hypothetical protein IG3_03851 [Bacillus cereus HuA2-1]
          Length = 216

 Score = 62.0 bits (149), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 34/79 (43%), Positives = 44/79 (55%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185

Query: 109 LLDYRMYHSGCPVALGNKW 127
           LL+    H G PV  G KW
Sbjct: 186 LLNELTLHGGAPVTKGEKW 204



 Score = 58.9 bits (141), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 27/55 (49%), Positives = 33/55 (60%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T FP LNL+V P KG AV++   + + LL+    H G PV  G KW
Sbjct: 150 LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQLLNELTLHGGAPVTKGEKW 204


>gi|423604110|ref|ZP_17580003.1| hypothetical protein IIK_00691 [Bacillus cereus VD102]
 gi|401245796|gb|EJR52149.1| hypothetical protein IIK_00691 [Bacillus cereus VD102]
          Length = 216

 Score = 62.0 bits (149), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 34/79 (43%), Positives = 43/79 (54%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   H + 
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFHQDQ 185

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L+    H G PV  G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204



 Score = 58.9 bits (141), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 27/55 (49%), Positives = 32/55 (58%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T FP LNL+V P KG AV++   H +  L+    H G PV  G KW
Sbjct: 150 LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFHQDQSLNELTLHGGAPVTKGEKW 204


>gi|423368291|ref|ZP_17345723.1| hypothetical protein IC3_03392 [Bacillus cereus VD142]
 gi|401081042|gb|EJP89322.1| hypothetical protein IC3_03392 [Bacillus cereus VD142]
          Length = 216

 Score = 62.0 bits (149), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 34/79 (43%), Positives = 44/79 (55%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185

Query: 109 LLDYRMYHSGCPVALGNKW 127
           LL+    H G PV  G KW
Sbjct: 186 LLNELTLHGGAPVTKGEKW 204



 Score = 58.9 bits (141), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 27/55 (49%), Positives = 33/55 (60%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T FP LNL+V P KG AV++   + + LL+    H G PV  G KW
Sbjct: 150 LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQLLNELTLHGGAPVTKGEKW 204


>gi|229135058|ref|ZP_04263863.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST196]
 gi|228648443|gb|EEL04473.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST196]
          Length = 216

 Score = 62.0 bits (149), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 34/79 (43%), Positives = 44/79 (55%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185

Query: 109 LLDYRMYHSGCPVALGNKW 127
           LL+    H G PV  G KW
Sbjct: 186 LLNELTLHGGAPVTKGEKW 204



 Score = 58.9 bits (141), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 27/55 (49%), Positives = 33/55 (60%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T FP LNL+V P KG AV++   + + LL+    H G PV  G KW
Sbjct: 150 LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQLLNELTLHGGAPVTKGEKW 204


>gi|241710335|ref|XP_002412046.1| prolyl 4-hydroxylase alpha subunit 1, putative [Ixodes scapularis]
 gi|215505101|gb|EEC14595.1| prolyl 4-hydroxylase alpha subunit 1, putative [Ixodes scapularis]
          Length = 65

 Score = 62.0 bits (149), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 27/55 (49%), Positives = 37/55 (67%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           ++DVE GGAT+FP L + + P+KG A FW+N  AN   +    H+GCPV  G+KW
Sbjct: 11  MSDVEEGGATVFPYLGVRLTPQKGDAAFWWNLKANGEGEVLTTHAGCPVLYGSKW 65



 Score = 61.6 bits (148), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 28/64 (43%), Positives = 39/64 (60%)

Query: 2  NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
          +R     + ++DVE GGAT+FP L + + P+KG A FW N  AN   +    H+GCPV  
Sbjct: 2  DRVATLMIYMSDVEEGGATVFPYLGVRLTPQKGDAAFWWNLKANGEGEVLTTHAGCPVLY 61

Query: 62 GNKW 65
          G+KW
Sbjct: 62 GSKW 65


>gi|423669823|ref|ZP_17644852.1| hypothetical protein IKO_03520 [Bacillus cereus VDM034]
 gi|423673973|ref|ZP_17648912.1| hypothetical protein IKS_01516 [Bacillus cereus VDM062]
 gi|401298950|gb|EJS04550.1| hypothetical protein IKO_03520 [Bacillus cereus VDM034]
 gi|401309524|gb|EJS14857.1| hypothetical protein IKS_01516 [Bacillus cereus VDM062]
          Length = 216

 Score = 62.0 bits (149), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 34/79 (43%), Positives = 44/79 (55%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185

Query: 109 LLDYRMYHSGCPVALGNKW 127
           LL+    H G PV  G KW
Sbjct: 186 LLNELTLHGGAPVTKGEKW 204



 Score = 58.9 bits (141), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 27/55 (49%), Positives = 33/55 (60%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T FP LNL+V P KG AV++   + + LL+    H G PV  G KW
Sbjct: 150 LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQLLNELTLHGGAPVTKGEKW 204


>gi|229061929|ref|ZP_04199257.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH603]
 gi|228717372|gb|EEL69042.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH603]
          Length = 216

 Score = 62.0 bits (149), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 34/79 (43%), Positives = 44/79 (55%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185

Query: 109 LLDYRMYHSGCPVALGNKW 127
           LL+    H G PV  G KW
Sbjct: 186 LLNELTLHGGAPVTKGEKW 204



 Score = 58.9 bits (141), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 27/55 (49%), Positives = 33/55 (60%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T FP LNL+V P KG AV++   + + LL+    H G PV  G KW
Sbjct: 150 LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQLLNELTLHGGAPVTKGEKW 204


>gi|443709454|gb|ELU04126.1| hypothetical protein CAPTEDRAFT_167710 [Capitella teleta]
          Length = 535

 Score = 62.0 bits (149), Expect = 7e-08,   Method: Composition-based stats.
 Identities = 26/55 (47%), Positives = 33/55 (60%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L+DVE GG T+F +    + PEKG+A  WYN H +   D    H+ CPV  GNKW
Sbjct: 458 LSDVEAGGGTVFMTAGTKLRPEKGAAAVWYNLHPDGTGDDETKHAACPVLTGNKW 512



 Score = 58.9 bits (141), Expect = 7e-07,   Method: Composition-based stats.
 Identities = 25/55 (45%), Positives = 33/55 (60%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L+DVE GG T+F +    + PEKG+A  W+N H +   D    H+ CPV  GNKW
Sbjct: 458 LSDVEAGGGTVFMTAGTKLRPEKGAAAVWYNLHPDGTGDDETKHAACPVLTGNKW 512


>gi|393909803|gb|EFO21561.2| prolyl 4-hydroxylase 2 [Loa loa]
          Length = 542

 Score = 62.0 bits (149), Expect = 7e-08,   Method: Composition-based stats.
 Identities = 29/66 (43%), Positives = 38/66 (57%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+   LL   +T  E GGAT+F  +  TV P K  A+FWYN   +   D R  H+ CPV
Sbjct: 445 GNRLATLLFY-MTQPESGGATVFTEVKTTVMPSKNDALFWYNLLRSGEGDLRTRHAACPV 503

Query: 122 ALGNKW 127
            +G+KW
Sbjct: 504 LIGSKW 509



 Score = 53.9 bits (128), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 25/64 (39%), Positives = 34/64 (53%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       +T  E GGAT+F  +  TV P K  A+FW+N   +   D    H+ CPV +
Sbjct: 446 NRLATLLFYMTQPESGGATVFTEVKTTVMPSKNDALFWYNLLRSGEGDLRTRHAACPVLI 505

Query: 62  GNKW 65
           G+KW
Sbjct: 506 GSKW 509


>gi|312080225|ref|XP_003142509.1| prolyl 4-hydroxylase 2 [Loa loa]
          Length = 541

 Score = 62.0 bits (149), Expect = 7e-08,   Method: Composition-based stats.
 Identities = 29/66 (43%), Positives = 38/66 (57%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+   LL   +T  E GGAT+F  +  TV P K  A+FWYN   +   D R  H+ CPV
Sbjct: 444 GNRLATLLFY-MTQPESGGATVFTEVKTTVMPSKNDALFWYNLLRSGEGDLRTRHAACPV 502

Query: 122 ALGNKW 127
            +G+KW
Sbjct: 503 LIGSKW 508



 Score = 53.9 bits (128), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 25/64 (39%), Positives = 34/64 (53%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       +T  E GGAT+F  +  TV P K  A+FW+N   +   D    H+ CPV +
Sbjct: 445 NRLATLLFYMTQPESGGATVFTEVKTTVMPSKNDALFWYNLLRSGEGDLRTRHAACPVLI 504

Query: 62  GNKW 65
           G+KW
Sbjct: 505 GSKW 508


>gi|194751829|ref|XP_001958226.1| GF23628 [Drosophila ananassae]
 gi|190625508|gb|EDV41032.1| GF23628 [Drosophila ananassae]
          Length = 484

 Score = 62.0 bits (149), Expect = 8e-08,   Method: Composition-based stats.
 Identities = 26/67 (38%), Positives = 42/67 (62%), Gaps = 1/67 (1%)

Query: 61  LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
           LG++   ++   + DV  GG T+FP + + V P+KGS++FW+N   +   D R  HS CP
Sbjct: 396 LGDRIASIIFY-VGDVVHGGQTVFPDIQIAVKPQKGSSLFWFNTFDDATPDPRSLHSVCP 454

Query: 121 VALGNKW 127
           V +G++W
Sbjct: 455 VLIGDRW 461



 Score = 56.6 bits (135), Expect = 3e-06,   Method: Composition-based stats.
 Identities = 23/53 (43%), Positives = 33/53 (62%)

Query: 13  DVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           DV  GG T+FP + + V P+KGS++FW N   +   D    HS CPV +G++W
Sbjct: 409 DVVHGGQTVFPDIQIAVKPQKGSSLFWFNTFDDATPDPRSLHSVCPVLIGDRW 461


>gi|195352176|ref|XP_002042590.1| GM14977 [Drosophila sechellia]
 gi|194124474|gb|EDW46517.1| GM14977 [Drosophila sechellia]
          Length = 485

 Score = 62.0 bits (149), Expect = 8e-08,   Method: Composition-based stats.
 Identities = 33/76 (43%), Positives = 45/76 (59%), Gaps = 13/76 (17%)

Query: 61  LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRM------ 114
           LG++   +L   + DV  GGA  FP+LNLT++P KGSA+ W N      LD+RM      
Sbjct: 413 LGDRLTSILFF-MNDVVQGGAFAFPNLNLTIWPHKGSALVWRN------LDHRMQPNKDL 465

Query: 115 YHSGCPVALGNKWGKL 130
            H  CPV +G+KW +L
Sbjct: 466 LHVSCPVVVGSKWSEL 481



 Score = 60.1 bits (144), Expect = 3e-07,   Method: Composition-based stats.
 Identities = 26/58 (44%), Positives = 36/58 (62%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWGKL 68
           + DV  GGA  FP+LNLT++P KGSA+ W N       +  + H  CPV +G+KW +L
Sbjct: 424 MNDVVQGGAFAFPNLNLTIWPHKGSALVWRNLDHRMQPNKDLLHVSCPVVVGSKWSEL 481


>gi|229168980|ref|ZP_04296697.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH621]
 gi|423591765|ref|ZP_17567796.1| hypothetical protein IIG_00633 [Bacillus cereus VD048]
 gi|228614572|gb|EEK71680.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH621]
 gi|401231898|gb|EJR38400.1| hypothetical protein IIG_00633 [Bacillus cereus VD048]
          Length = 216

 Score = 62.0 bits (149), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 34/79 (43%), Positives = 44/79 (55%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185

Query: 109 LLDYRMYHSGCPVALGNKW 127
           LL+    H G PV  G KW
Sbjct: 186 LLNELTLHGGAPVTKGEKW 204



 Score = 58.9 bits (141), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 29/64 (45%), Positives = 36/64 (56%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L DVE GG T FP LNL+V P KG AV++   + + LL+    H G PV  
Sbjct: 141 NRISTLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQLLNELTLHGGAPVTK 200

Query: 62  GNKW 65
           G KW
Sbjct: 201 GEKW 204


>gi|21358309|ref|NP_651801.1| prolyl-4-hydroxylase-alpha SG2 [Drosophila melanogaster]
 gi|20269808|gb|AAM18059.1|AF495537_1 prolyl 4-hydroxylase alpha-related protein PH4[alpha]SG2
           [Drosophila melanogaster]
 gi|10726875|gb|AAG22175.1| prolyl-4-hydroxylase-alpha SG2 [Drosophila melanogaster]
          Length = 527

 Score = 61.6 bits (148), Expect = 9e-08,   Method: Composition-based stats.
 Identities = 28/63 (44%), Positives = 39/63 (61%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           +R   S   L+DVE GG T+F  LN+ + P KG+ V WHN H +  +D    H+GCPV +
Sbjct: 433 DRISTSMFYLSDVEQGGYTVFTKLNVFLPPVKGALVMWHNLHRSLHVDARTLHAGCPVIV 492

Query: 62  GNK 64
           G+K
Sbjct: 493 GSK 495



 Score = 60.8 bits (146), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 26/54 (48%), Positives = 37/54 (68%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 126
           L+DVE GG T+F  LN+ + P KG+ V W+N H +  +D R  H+GCPV +G+K
Sbjct: 442 LSDVEQGGYTVFTKLNVFLPPVKGALVMWHNLHRSLHVDARTLHAGCPVIVGSK 495


>gi|198449650|ref|XP_001357661.2| GA13747 [Drosophila pseudoobscura pseudoobscura]
 gi|198130701|gb|EAL26795.2| GA13747 [Drosophila pseudoobscura pseudoobscura]
          Length = 533

 Score = 61.6 bits (148), Expect = 9e-08,   Method: Composition-based stats.
 Identities = 28/63 (44%), Positives = 40/63 (63%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           +R   S   L+DVE GG T+F  LN+ + P KG+ V WHN H +  +D   +H+GCPV +
Sbjct: 439 DRISTSMFYLSDVEQGGYTVFTKLNVFLPPIKGALVMWHNLHRSLDVDPRTHHAGCPVIV 498

Query: 62  GNK 64
           G+K
Sbjct: 499 GSK 501



 Score = 60.8 bits (146), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 26/54 (48%), Positives = 38/54 (70%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 126
           L+DVE GG T+F  LN+ + P KG+ V W+N H +  +D R +H+GCPV +G+K
Sbjct: 448 LSDVEQGGYTVFTKLNVFLPPIKGALVMWHNLHRSLDVDPRTHHAGCPVIVGSK 501


>gi|195159321|ref|XP_002020530.1| GL13464 [Drosophila persimilis]
 gi|194117299|gb|EDW39342.1| GL13464 [Drosophila persimilis]
          Length = 533

 Score = 61.6 bits (148), Expect = 9e-08,   Method: Composition-based stats.
 Identities = 28/63 (44%), Positives = 40/63 (63%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           +R   S   L+DVE GG T+F  LN+ + P KG+ V WHN H +  +D   +H+GCPV +
Sbjct: 439 DRISTSMFYLSDVEQGGYTVFTKLNVFLPPIKGALVMWHNLHRSLDVDPRTHHAGCPVIV 498

Query: 62  GNK 64
           G+K
Sbjct: 499 GSK 501



 Score = 60.8 bits (146), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 26/54 (48%), Positives = 38/54 (70%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 126
           L+DVE GG T+F  LN+ + P KG+ V W+N H +  +D R +H+GCPV +G+K
Sbjct: 448 LSDVEQGGYTVFTKLNVFLPPIKGALVMWHNLHRSLDVDPRTHHAGCPVIVGSK 501


>gi|195172672|ref|XP_002027120.1| GL20071 [Drosophila persimilis]
 gi|194112933|gb|EDW34976.1| GL20071 [Drosophila persimilis]
          Length = 455

 Score = 61.6 bits (148), Expect = 9e-08,   Method: Composition-based stats.
 Identities = 28/56 (50%), Positives = 35/56 (62%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 128
           L DVE GG T+FP L +   P KGSAV +YN +++   D R  H GCPV +G KW 
Sbjct: 380 LNDVEQGGKTVFPRLGIFRSPMKGSAVVFYNMNSSLQGDPRTEHGGCPVLVGTKWA 435



 Score = 56.2 bits (134), Expect = 4e-06,   Method: Composition-based stats.
 Identities = 26/56 (46%), Positives = 34/56 (60%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWG 66
           L DVE GG T+FP L +   P KGSAV ++N +++   D    H GCPV +G KW 
Sbjct: 380 LNDVEQGGKTVFPRLGIFRSPMKGSAVVFYNMNSSLQGDPRTEHGGCPVLVGTKWA 435


>gi|256083648|ref|XP_002578053.1| prolyl 4-hydroxylase alpha subunit 1 [Schistosoma mansoni]
 gi|360044447|emb|CCD81995.1| putative prolyl 4-hydroxylase alpha subunit 1 [Schistosoma mansoni]
          Length = 584

 Score = 61.6 bits (148), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 28/66 (42%), Positives = 40/66 (60%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+   ++   L+DV+ GGAT+F  +   V P+KG+A FW+N   N   D R  H+ CPV
Sbjct: 490 GNRIATIIFY-LSDVQAGGATVFNRIGTRVVPKKGAAGFWFNLLPNGEGDLRTRHAACPV 548

Query: 122 ALGNKW 127
             G+KW
Sbjct: 549 LAGSKW 554



 Score = 56.2 bits (134), Expect = 4e-06,   Method: Composition-based stats.
 Identities = 27/64 (42%), Positives = 35/64 (54%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       L+DV+ GGAT+F  +   V P+KG+A FW N   N   D    H+ CPV  
Sbjct: 491 NRIATIIFYLSDVQAGGATVFNRIGTRVVPKKGAAGFWFNLLPNGEGDLRTRHAACPVLA 550

Query: 62  GNKW 65
           G+KW
Sbjct: 551 GSKW 554


>gi|324511726|gb|ADY44875.1| Prolyl 4-hydroxylase subunit alpha-1 [Ascaris suum]
          Length = 550

 Score = 61.6 bits (148), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 28/66 (42%), Positives = 38/66 (57%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+   +L   LT+ E+GG T+F  L   V P K  A+FWYN + +   D R  H+ CPV
Sbjct: 441 GNRLATVLFY-LTEPEIGGGTVFTELRTAVMPSKNGALFWYNLYRSGEGDLRTRHAACPV 499

Query: 122 ALGNKW 127
            +G KW
Sbjct: 500 LVGIKW 505



 Score = 55.1 bits (131), Expect = 9e-06,   Method: Composition-based stats.
 Identities = 25/64 (39%), Positives = 34/64 (53%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       LT+ E+GG T+F  L   V P K  A+FW+N + +   D    H+ CPV +
Sbjct: 442 NRLATVLFYLTEPEIGGGTVFTELRTAVMPSKNGALFWYNLYRSGEGDLRTRHAACPVLV 501

Query: 62  GNKW 65
           G KW
Sbjct: 502 GIKW 505


>gi|195452738|ref|XP_002073478.1| GK14139 [Drosophila willistoni]
 gi|194169563|gb|EDW84464.1| GK14139 [Drosophila willistoni]
          Length = 215

 Score = 61.6 bits (148), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 41/55 (74%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L DV  GGAT+F +++L+VFP+ G+A+FWYN +     ++ + H+GCPV +G+KW
Sbjct: 141 LQDVPQGGATLFNNISLSVFPKAGAALFWYNLNNAGDTEWNVAHTGCPVIVGSKW 195



 Score = 60.5 bits (145), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 26/56 (46%), Positives = 42/56 (75%)

Query: 10  ELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           EL DV  GGAT+F +++L+VFP+ G+A+FW+N +     ++ + H+GCPV +G+KW
Sbjct: 140 ELQDVPQGGATLFNNISLSVFPKAGAALFWYNLNNAGDTEWNVAHTGCPVIVGSKW 195


>gi|195575115|ref|XP_002105525.1| GD21527 [Drosophila simulans]
 gi|194201452|gb|EDX15028.1| GD21527 [Drosophila simulans]
          Length = 495

 Score = 61.6 bits (148), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 30/87 (34%), Positives = 44/87 (50%), Gaps = 4/87 (4%)

Query: 45  NTLLDYWMYHSGCPVALGNKWGK----LLLSGLTDVELGGATIFPSLNLTVFPEKGSAVF 100
           N  + Y   H      +G  +G+       + L+DV  GG T FP L   + P+K +A F
Sbjct: 365 NLNMKYAEDHQFANYGIGGHYGQHMDWFYQTTLSDVAQGGGTAFPQLRTLLKPKKYAAAF 424

Query: 101 WYNAHANTLLDYRMYHSGCPVALGNKW 127
           W+N HA+ + D R  H  CP+  G+KW
Sbjct: 425 WHNLHASGVGDVRTQHGACPIIAGSKW 451



 Score = 60.5 bits (145), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 25/55 (45%), Positives = 33/55 (60%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L+DV  GG T FP L   + P+K +A FWHN HA+ + D    H  CP+  G+KW
Sbjct: 397 LSDVAQGGGTAFPQLRTLLKPKKYAAAFWHNLHASGVGDVRTQHGACPIIAGSKW 451


>gi|195341560|ref|XP_002037374.1| GM12888 [Drosophila sechellia]
 gi|194131490|gb|EDW53533.1| GM12888 [Drosophila sechellia]
          Length = 501

 Score = 61.6 bits (148), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 30/87 (34%), Positives = 44/87 (50%), Gaps = 4/87 (4%)

Query: 45  NTLLDYWMYHSGCPVALGNKWGK----LLLSGLTDVELGGATIFPSLNLTVFPEKGSAVF 100
           N  + Y   H      +G  +G+       + L+DV  GG T FP L   + P+K +A F
Sbjct: 371 NLNMKYAEDHQFANYGIGGHYGQHMDWFYQTTLSDVAQGGGTAFPQLRTLLKPKKYAAAF 430

Query: 101 WYNAHANTLLDYRMYHSGCPVALGNKW 127
           W+N HA+ + D R  H  CP+  G+KW
Sbjct: 431 WHNLHASGVGDVRTQHGACPIIAGSKW 457



 Score = 60.5 bits (145), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 25/55 (45%), Positives = 33/55 (60%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L+DV  GG T FP L   + P+K +A FWHN HA+ + D    H  CP+  G+KW
Sbjct: 403 LSDVAQGGGTAFPQLRTLLKPKKYAAAFWHNLHASGVGDVRTQHGACPIIAGSKW 457


>gi|195574593|ref|XP_002105269.1| GD21390 [Drosophila simulans]
 gi|194201196|gb|EDX14772.1| GD21390 [Drosophila simulans]
          Length = 478

 Score = 61.6 bits (148), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 27/60 (45%), Positives = 35/60 (58%)

Query: 10  ELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWGKLL 69
           EL+DV++GG   FP L     P +GSA+ WHN   +   D     + CPV LGN+WGK L
Sbjct: 382 ELSDVQMGGYASFPDLGFGFKPRRGSALVWHNTDNSGNCDTRSLQATCPVLLGNQWGKSL 441



 Score = 60.5 bits (145), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 26/59 (44%), Positives = 35/59 (59%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWGKLL 131
           L+DV++GG   FP L     P +GSA+ W+N   +   D R   + CPV LGN+WGK L
Sbjct: 383 LSDVQMGGYASFPDLGFGFKPRRGSALVWHNTDNSGNCDTRSLQATCPVLLGNQWGKSL 441


>gi|195575095|ref|XP_002105515.1| GD21523 [Drosophila simulans]
 gi|194201442|gb|EDX15018.1| GD21523 [Drosophila simulans]
          Length = 527

 Score = 61.2 bits (147), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 28/63 (44%), Positives = 39/63 (61%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           +R   S   L+DVE GG T+F  LN+ + P KG+ V WHN H +  +D    H+GCPV +
Sbjct: 433 DRISTSMFYLSDVEQGGYTVFTKLNVFLPPVKGALVMWHNLHRSLDVDARTLHAGCPVIV 492

Query: 62  GNK 64
           G+K
Sbjct: 493 GSK 495



 Score = 60.8 bits (146), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 26/54 (48%), Positives = 37/54 (68%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 126
           L+DVE GG T+F  LN+ + P KG+ V W+N H +  +D R  H+GCPV +G+K
Sbjct: 442 LSDVEQGGYTVFTKLNVFLPPVKGALVMWHNLHRSLDVDARTLHAGCPVIVGSK 495


>gi|195505197|ref|XP_002099400.1| GE10884 [Drosophila yakuba]
 gi|194185501|gb|EDW99112.1| GE10884 [Drosophila yakuba]
          Length = 527

 Score = 61.2 bits (147), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 28/63 (44%), Positives = 39/63 (61%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           +R   S   L+DVE GG T+F  LN+ + P KG+ V WHN H +  +D    H+GCPV +
Sbjct: 433 DRISTSMFYLSDVEQGGYTVFTKLNVFLPPVKGALVMWHNLHRSLDVDARTLHAGCPVIV 492

Query: 62  GNK 64
           G+K
Sbjct: 493 GSK 495



 Score = 60.8 bits (146), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 26/54 (48%), Positives = 37/54 (68%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 126
           L+DVE GG T+F  LN+ + P KG+ V W+N H +  +D R  H+GCPV +G+K
Sbjct: 442 LSDVEQGGYTVFTKLNVFLPPVKGALVMWHNLHRSLDVDARTLHAGCPVIVGSK 495


>gi|198459366|ref|XP_002138685.1| GA24919 [Drosophila pseudoobscura pseudoobscura]
 gi|198136669|gb|EDY69243.1| GA24919 [Drosophila pseudoobscura pseudoobscura]
          Length = 448

 Score = 61.2 bits (147), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 28/56 (50%), Positives = 35/56 (62%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 128
           L DVE GG T+FP L +   P KGSAV +YN +++   D R  H GCPV +G KW 
Sbjct: 373 LNDVEQGGKTVFPRLGIFRSPMKGSAVVFYNLNSSLQGDPRTEHGGCPVLVGTKWA 428



 Score = 55.8 bits (133), Expect = 5e-06,   Method: Composition-based stats.
 Identities = 26/56 (46%), Positives = 34/56 (60%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWG 66
           L DVE GG T+FP L +   P KGSAV ++N +++   D    H GCPV +G KW 
Sbjct: 373 LNDVEQGGKTVFPRLGIFRSPMKGSAVVFYNLNSSLQGDPRTEHGGCPVLVGTKWA 428


>gi|221512810|ref|NP_649043.3| CG18234 [Drosophila melanogaster]
 gi|66771545|gb|AAY55084.1| IP12246p [Drosophila melanogaster]
 gi|220902636|gb|AAF49255.4| CG18234 [Drosophila melanogaster]
          Length = 515

 Score = 61.2 bits (147), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 29/61 (47%), Positives = 38/61 (62%), Gaps = 12/61 (19%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRM------YHSGCPVALGNK 126
           + DV  GGA  FP+LNLT++P+KGSA+ W N      LD+RM       H  CPV +G+K
Sbjct: 424 MNDVAQGGALAFPNLNLTIWPQKGSALVWRN------LDHRMQPNQDLLHVSCPVVVGSK 477

Query: 127 W 127
           W
Sbjct: 478 W 478



 Score = 60.5 bits (145), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 25/55 (45%), Positives = 35/55 (63%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           + DV  GGA  FP+LNLT++P+KGSA+ W N       +  + H  CPV +G+KW
Sbjct: 424 MNDVAQGGALAFPNLNLTIWPQKGSALVWRNLDHRMQPNQDLLHVSCPVVVGSKW 478


>gi|195441323|ref|XP_002068462.1| GK20483 [Drosophila willistoni]
 gi|194164547|gb|EDW79448.1| GK20483 [Drosophila willistoni]
          Length = 550

 Score = 61.2 bits (147), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 28/53 (52%), Positives = 34/53 (64%)

Query: 75  DVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           DV  GGAT FP   L V P+KGSA+FWYN   +   D R  HS CPV +G++W
Sbjct: 473 DVTEGGATNFPRNQLVVQPKKGSALFWYNKFDDGSPDPRSLHSICPVVVGSRW 525



 Score = 55.8 bits (133), Expect = 6e-06,   Method: Composition-based stats.
 Identities = 26/53 (49%), Positives = 33/53 (62%)

Query: 13  DVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           DV  GGAT FP   L V P+KGSA+FW+N   +   D    HS CPV +G++W
Sbjct: 473 DVTEGGATNFPRNQLVVQPKKGSALFWYNKFDDGSPDPRSLHSICPVVVGSRW 525


>gi|195390825|ref|XP_002054068.1| GJ24233 [Drosophila virilis]
 gi|194152154|gb|EDW67588.1| GJ24233 [Drosophila virilis]
          Length = 533

 Score = 61.2 bits (147), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 28/55 (50%), Positives = 34/55 (61%), Gaps = 1/55 (1%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTL-LDYRMYHSGCPVALGNK 126
           L DV  GGAT+FP L+L V PE+G  + WYN   +T   D R  H GCPV +G K
Sbjct: 446 LNDVVRGGATVFPKLDLVVQPERGKVLHWYNMLPDTFDYDRRSLHGGCPVLIGEK 500



 Score = 57.0 bits (136), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 27/64 (42%), Positives = 37/64 (57%), Gaps = 1/64 (1%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTL-LDYWMYHSGCPVA 60
           +R   + + L DV  GGAT+FP L+L V PE+G  + W+N   +T   D    H GCPV 
Sbjct: 437 DRIATTLIYLNDVVRGGATVFPKLDLVVQPERGKVLHWYNMLPDTFDYDRRSLHGGCPVL 496

Query: 61  LGNK 64
           +G K
Sbjct: 497 IGEK 500


>gi|268536692|ref|XP_002633481.1| C. briggsae CBR-PHY-2 protein [Caenorhabditis briggsae]
 gi|94442973|emb|CAJ98659.1| prolyl 4-hydroxylase [Caenorhabditis briggsae]
          Length = 539

 Score = 61.2 bits (147), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 29/66 (43%), Positives = 37/66 (56%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+   +L   ++  E GGAT+F  L   VFP K  A+FWYN   +   D R  H+ CPV
Sbjct: 437 GNRIATVLFY-MSQPERGGATVFNHLGTAVFPSKNDALFWYNLRRDGEGDLRTRHAACPV 495

Query: 122 ALGNKW 127
            LG KW
Sbjct: 496 LLGVKW 501



 Score = 55.1 bits (131), Expect = 9e-06,   Method: Composition-based stats.
 Identities = 26/64 (40%), Positives = 33/64 (51%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++  E GGAT+F  L   VFP K  A+FW+N   +   D    H+ CPV L
Sbjct: 438 NRIATVLFYMSQPERGGATVFNHLGTAVFPSKNDALFWYNLRRDGEGDLRTRHAACPVLL 497

Query: 62  GNKW 65
           G KW
Sbjct: 498 GVKW 501


>gi|17541712|ref|NP_502317.1| Protein PHY-2 [Caenorhabditis elegans]
 gi|32171589|sp|Q20065.1|P4HA2_CAEEL RecName: Full=Prolyl 4-hydroxylase subunit alpha-2; Short=4-PH
           alpha-2; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-2; Flags: Precursor
 gi|3876769|emb|CAA93469.1| Protein PHY-2 [Caenorhabditis elegans]
          Length = 539

 Score = 61.2 bits (147), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 29/66 (43%), Positives = 37/66 (56%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+   +L   ++  E GGAT+F  L   VFP K  A+FWYN   +   D R  H+ CPV
Sbjct: 437 GNRIATVLFY-MSQPERGGATVFNHLGTAVFPSKNDALFWYNLRRDGEGDLRTRHAACPV 495

Query: 122 ALGNKW 127
            LG KW
Sbjct: 496 LLGVKW 501



 Score = 55.1 bits (131), Expect = 9e-06,   Method: Composition-based stats.
 Identities = 26/64 (40%), Positives = 33/64 (51%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++  E GGAT+F  L   VFP K  A+FW+N   +   D    H+ CPV L
Sbjct: 438 NRIATVLFYMSQPERGGATVFNHLGTAVFPSKNDALFWYNLRRDGEGDLRTRHAACPVLL 497

Query: 62  GNKW 65
           G KW
Sbjct: 498 GVKW 501


>gi|308476969|ref|XP_003100699.1| hypothetical protein CRE_15564 [Caenorhabditis remanei]
 gi|308264511|gb|EFP08464.1| hypothetical protein CRE_15564 [Caenorhabditis remanei]
          Length = 573

 Score = 61.2 bits (147), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 29/66 (43%), Positives = 37/66 (56%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+   +L   ++  E GGAT+F  L   VFP K  A+FWYN   +   D R  H+ CPV
Sbjct: 471 GNRIATVLFY-MSQPERGGATVFNHLGTAVFPSKNDALFWYNLRRDGEGDLRTRHAACPV 529

Query: 122 ALGNKW 127
            LG KW
Sbjct: 530 LLGVKW 535



 Score = 55.1 bits (131), Expect = 9e-06,   Method: Composition-based stats.
 Identities = 26/64 (40%), Positives = 33/64 (51%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++  E GGAT+F  L   VFP K  A+FW+N   +   D    H+ CPV L
Sbjct: 472 NRIATVLFYMSQPERGGATVFNHLGTAVFPSKNDALFWYNLRRDGEGDLRTRHAACPVLL 531

Query: 62  GNKW 65
           G KW
Sbjct: 532 GVKW 535


>gi|308451420|ref|XP_003088665.1| CRE-PHY-2 protein [Caenorhabditis remanei]
 gi|308246199|gb|EFO90151.1| CRE-PHY-2 protein [Caenorhabditis remanei]
          Length = 609

 Score = 60.8 bits (146), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 29/66 (43%), Positives = 37/66 (56%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+   +L   ++  E GGAT+F  L   VFP K  A+FWYN   +   D R  H+ CPV
Sbjct: 507 GNRIATVLFY-MSQPERGGATVFNHLGTAVFPSKNDALFWYNLRRDGEGDLRTRHAACPV 565

Query: 122 ALGNKW 127
            LG KW
Sbjct: 566 LLGVKW 571



 Score = 55.1 bits (131), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 26/64 (40%), Positives = 33/64 (51%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++  E GGAT+F  L   VFP K  A+FW+N   +   D    H+ CPV L
Sbjct: 508 NRIATVLFYMSQPERGGATVFNHLGTAVFPSKNDALFWYNLRRDGEGDLRTRHAACPVLL 567

Query: 62  GNKW 65
           G KW
Sbjct: 568 GVKW 571


>gi|51490656|emb|CAF31507.1| prolyl 4-hydroxylase 2 precursor [Brugia malayi]
          Length = 551

 Score = 60.8 bits (146), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 27/66 (40%), Positives = 40/66 (60%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+   +L+  +T+ E+GG T+F  L  +V   K +A+FWYN   +  +D R YH+ CPV
Sbjct: 438 GNRIATILIY-MTEPEIGGGTVFIDLKTSVSCTKNAALFWYNLMRSGAVDMRSYHAACPV 496

Query: 122 ALGNKW 127
             G KW
Sbjct: 497 LTGTKW 502



 Score = 55.1 bits (131), Expect = 8e-06,   Method: Composition-based stats.
 Identities = 24/64 (37%), Positives = 36/64 (56%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + +T+ E+GG T+F  L  +V   K +A+FW+N   +  +D   YH+ CPV  
Sbjct: 439 NRIATILIYMTEPEIGGGTVFIDLKTSVSCTKNAALFWYNLMRSGAVDMRSYHAACPVLT 498

Query: 62  GNKW 65
           G KW
Sbjct: 499 GTKW 502


>gi|241598362|ref|XP_002404733.1| prolyl 4-hydroxylase alpha subunit 1, putative [Ixodes scapularis]
 gi|215500464|gb|EEC09958.1| prolyl 4-hydroxylase alpha subunit 1, putative [Ixodes scapularis]
          Length = 340

 Score = 60.8 bits (146), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 28/66 (42%), Positives = 43/66 (65%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           G++   L++  +TDVE GG T+FP+L + + P+KG A FW+N  A+   +    H+GCPV
Sbjct: 143 GDRLATLMIY-MTDVEEGGTTVFPNLGIRLTPKKGDAAFWWNLKASGDGERLTTHAGCPV 201

Query: 122 ALGNKW 127
             G+KW
Sbjct: 202 LYGSKW 207



 Score = 58.9 bits (141), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 26/57 (45%), Positives = 37/57 (64%)

Query: 9   VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           + +TDVE GG T+FP+L + + P+KG A FW N  A+   +    H+GCPV  G+KW
Sbjct: 151 IYMTDVEEGGTTVFPNLGIRLTPKKGDAAFWWNLKASGDGERLTTHAGCPVLYGSKW 207


>gi|195452736|ref|XP_002073477.1| GK14138 [Drosophila willistoni]
 gi|194169562|gb|EDW84463.1| GK14138 [Drosophila willistoni]
          Length = 518

 Score = 60.8 bits (146), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 24/55 (43%), Positives = 39/55 (70%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L DV  GGAT+  +++L+VFP+ G+A+FWYN +     ++ + H+ CPV +G+KW
Sbjct: 444 LQDVPQGGATLLNNISLSVFPKAGAALFWYNLNNAGDTEWNVAHTACPVIVGSKW 498



 Score = 59.3 bits (142), Expect = 5e-07,   Method: Composition-based stats.
 Identities = 24/56 (42%), Positives = 40/56 (71%)

Query: 10  ELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           EL DV  GGAT+  +++L+VFP+ G+A+FW+N +     ++ + H+ CPV +G+KW
Sbjct: 443 ELQDVPQGGATLLNNISLSVFPKAGAALFWYNLNNAGDTEWNVAHTACPVIVGSKW 498


>gi|321458081|gb|EFX69155.1| hypothetical protein DAPPUDRAFT_228756 [Daphnia pulex]
          Length = 570

 Score = 60.8 bits (146), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 27/59 (45%), Positives = 37/59 (62%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
            ++ L DVE GGAT+F    + V P+KG AVFW+N  +++  D    H GCPV  G+KW
Sbjct: 474 FMTYLDDVEAGGATVFTHAGVVVRPKKGMAVFWWNLKSDSNGDTLTRHGGCPVLHGSKW 532



 Score = 58.9 bits (141), Expect = 7e-07,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 34/55 (61%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GGAT+F    + V P+KG AVFW N  +++  D    H GCPV  G+KW
Sbjct: 478 LDDVEAGGATVFTHAGVVVRPKKGMAVFWWNLKSDSNGDTLTRHGGCPVLHGSKW 532


>gi|402593814|gb|EJW87741.1| hypothetical protein WUBG_01349 [Wuchereria bancrofti]
          Length = 541

 Score = 60.5 bits (145), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 29/66 (43%), Positives = 36/66 (54%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+   LL   +T  E GGAT+F  +  TV P K  A+FWYN   +   D R  H+ CPV
Sbjct: 444 GNRLATLLFY-MTQPESGGATVFTEVKTTVMPSKNDALFWYNLLRSGEGDLRTRHAACPV 502

Query: 122 ALGNKW 127
             G KW
Sbjct: 503 LTGTKW 508



 Score = 52.4 bits (124), Expect = 5e-05,   Method: Composition-based stats.
 Identities = 25/64 (39%), Positives = 32/64 (50%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       +T  E GGAT+F  +  TV P K  A+FW+N   +   D    H+ CPV  
Sbjct: 445 NRLATLLFYMTQPESGGATVFTEVKTTVMPSKNDALFWYNLLRSGEGDLRTRHAACPVLT 504

Query: 62  GNKW 65
           G KW
Sbjct: 505 GTKW 508


>gi|196011912|ref|XP_002115819.1| hypothetical protein TRIADDRAFT_59908 [Trichoplax adhaerens]
 gi|190581595|gb|EDV21671.1| hypothetical protein TRIADDRAFT_59908 [Trichoplax adhaerens]
          Length = 300

 Score = 60.5 bits (145), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 37/55 (67%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L+DVE GGAT+F ++N+ V P KG+ + WYN   +  L     H+GCPV +G+KW
Sbjct: 226 LSDVERGGATVFTNINVRVLPRKGNVIIWYNYLPDGNLHPGTLHAGCPVLVGSKW 280



 Score = 59.3 bits (142), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 25/55 (45%), Positives = 37/55 (67%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L+DVE GGAT+F ++N+ V P KG+ + W+N   +  L     H+GCPV +G+KW
Sbjct: 226 LSDVERGGATVFTNINVRVLPRKGNVIIWYNYLPDGNLHPGTLHAGCPVLVGSKW 280


>gi|170591592|ref|XP_001900554.1| prolyl 4-hydroxylase [Brugia malayi]
 gi|16415740|emb|CAC82616.1| prolyl 4-hydroxylase [Brugia malayi]
 gi|21425621|emb|CAD19314.1| prolyl 4-hydroxylase [Brugia malayi]
 gi|158592166|gb|EDP30768.1| prolyl 4-hydroxylase, putative [Brugia malayi]
          Length = 541

 Score = 60.5 bits (145), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 29/66 (43%), Positives = 36/66 (54%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+   LL   +T  E GGAT+F  +  TV P K  A+FWYN   +   D R  H+ CPV
Sbjct: 444 GNRLATLLFY-MTQPESGGATVFTEVKTTVMPSKNDALFWYNLLRSGEGDLRTRHAACPV 502

Query: 122 ALGNKW 127
             G KW
Sbjct: 503 LTGTKW 508



 Score = 52.4 bits (124), Expect = 5e-05,   Method: Composition-based stats.
 Identities = 25/64 (39%), Positives = 32/64 (50%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       +T  E GGAT+F  +  TV P K  A+FW+N   +   D    H+ CPV  
Sbjct: 445 NRLATLLFYMTQPESGGATVFTEVKTTVMPSKNDALFWYNLLRSGEGDLRTRHAACPVLT 504

Query: 62  GNKW 65
           G KW
Sbjct: 505 GTKW 508


>gi|195505244|ref|XP_002099420.1| GE10895 [Drosophila yakuba]
 gi|194185521|gb|EDW99132.1| GE10895 [Drosophila yakuba]
          Length = 533

 Score = 60.5 bits (145), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 36/55 (65%), Gaps = 1/55 (1%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTL-LDYRMYHSGCPVALGNK 126
           L +VE GGAT+FPS+NL V  +KGSA+FW+N    +   D R +H  CP+  G K
Sbjct: 444 LNNVEHGGATVFPSINLAVPTQKGSALFWHNLDGQSYDYDTRTFHGACPLISGTK 498



 Score = 59.7 bits (143), Expect = 4e-07,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 35/55 (63%), Gaps = 1/55 (1%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTL-LDYWMYHSGCPVALGNK 64
           L +VE GGAT+FPS+NL V  +KGSA+FWHN    +   D   +H  CP+  G K
Sbjct: 444 LNNVEHGGATVFPSINLAVPTQKGSALFWHNLDGQSYDYDTRTFHGACPLISGTK 498


>gi|195113247|ref|XP_002001179.1| GI22114 [Drosophila mojavensis]
 gi|193917773|gb|EDW16640.1| GI22114 [Drosophila mojavensis]
          Length = 487

 Score = 60.5 bits (145), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 26/63 (41%), Positives = 38/63 (60%), Gaps = 3/63 (4%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRM--YHSGCPVALGNKWGKL 130
           L DV  GGAT+FP + + V PE+G  + WYN +  +  DY +  YH  CPV +G K G  
Sbjct: 426 LNDVARGGATVFPDVEIAVHPERGKVIHWYNMNPKS-FDYELHSYHGACPVLIGQKIGNG 484

Query: 131 LLS 133
           +++
Sbjct: 485 IIT 487



 Score = 57.4 bits (137), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 25/65 (38%), Positives = 39/65 (60%), Gaps = 3/65 (4%)

Query: 9   VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWM--YHSGCPVALGNKWG 66
           + L DV  GGAT+FP + + V PE+G  + W+N +  +  DY +  YH  CPV +G K G
Sbjct: 424 IYLNDVARGGATVFPDVEIAVHPERGKVIHWYNMNPKS-FDYELHSYHGACPVLIGQKIG 482

Query: 67  KLLLS 71
             +++
Sbjct: 483 NGIIT 487


>gi|433460968|ref|ZP_20418587.1| prolyl 4-hydroxylase alpha subunit [Halobacillus sp. BAB-2008]
 gi|432190746|gb|ELK47751.1| prolyl 4-hydroxylase alpha subunit [Halobacillus sp. BAB-2008]
          Length = 211

 Score = 60.5 bits (145), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 43/145 (29%), Positives = 63/145 (43%), Gaps = 14/145 (9%)

Query: 1   MNRAEM-SSVELTDVELGGATIFPSLNLTVFPEKGSAVFW----------HNAHANTLLD 49
           +NR+++ S  +++D+    +   P   LT   EK  A             H  H     +
Sbjct: 59  VNRSKIGSDHDVSDIRTSSSAFLPDDELTGRIEKRLAQIMNVPVEHGEGIHILHYKPGQE 118

Query: 50  YWMYHS---GCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHA 106
           Y  +H        A  N     L+  L DVE GG T FP +NLTV P KG AV++   + 
Sbjct: 119 YKAHHDYFRSTSRAAKNPRISTLVLYLNDVEEGGETYFPEMNLTVSPHKGMAVYFEYFYN 178

Query: 107 NTLLDYRMYHSGCPVALGNKWGKLL 131
           +  ++ R  H G PV  G KW   +
Sbjct: 179 DPAINERTLHGGSPVTAGEKWAATM 203


>gi|221512814|ref|NP_649045.2| CG18231 [Drosophila melanogaster]
 gi|220902637|gb|AAF49253.3| CG18231 [Drosophila melanogaster]
          Length = 470

 Score = 60.5 bits (145), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 30/58 (51%), Positives = 37/58 (63%), Gaps = 5/58 (8%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 126
           LL  L +VELGGA +FP L ++VFP+KGS +FW+N      LD R     CPV  GNK
Sbjct: 393 LLFFLNNVELGGAMVFPKLKISVFPQKGSCLFWHNT-----LDPRSEPLECPVLQGNK 445



 Score = 58.5 bits (140), Expect = 9e-07,   Method: Composition-based stats.
 Identities = 28/54 (51%), Positives = 34/54 (62%), Gaps = 5/54 (9%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNK 64
           L +VELGGA +FP L ++VFP+KGS +FWHN      LD       CPV  GNK
Sbjct: 397 LNNVELGGAMVFPKLKISVFPQKGSCLFWHNT-----LDPRSEPLECPVLQGNK 445


>gi|66771505|gb|AAY55064.1| IP12044p [Drosophila melanogaster]
          Length = 484

 Score = 60.5 bits (145), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 30/58 (51%), Positives = 37/58 (63%), Gaps = 5/58 (8%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 126
           LL  L +VELGGA +FP L ++VFP+KGS +FW+N      LD R     CPV  GNK
Sbjct: 407 LLFFLNNVELGGAMVFPKLKISVFPQKGSCLFWHNT-----LDPRSEPLECPVLQGNK 459



 Score = 58.5 bits (140), Expect = 9e-07,   Method: Composition-based stats.
 Identities = 28/54 (51%), Positives = 34/54 (62%), Gaps = 5/54 (9%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNK 64
           L +VELGGA +FP L ++VFP+KGS +FWHN      LD       CPV  GNK
Sbjct: 411 LNNVELGGAMVFPKLKISVFPQKGSCLFWHNT-----LDPRSEPLECPVLQGNK 459


>gi|229086310|ref|ZP_04218488.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-44]
 gi|228697005|gb|EEL49812.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-44]
          Length = 220

 Score = 60.5 bits (145), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 45/79 (56%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL++ P+KGSAV++   + + 
Sbjct: 133 DYFAEHSRA--AENNRISTLVMY-LNDVEEGGETFFPKLNLSIAPKKGSAVYFEYFYNDK 189

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L+    H G PV  G KW
Sbjct: 190 SLNELTLHGGAPVIKGEKW 208



 Score = 57.4 bits (137), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 34/55 (61%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T FP LNL++ P+KGSAV++   + +  L+    H G PV  G KW
Sbjct: 154 LNDVEEGGETFFPKLNLSIAPKKGSAVYFEYFYNDKSLNELTLHGGAPVIKGEKW 208


>gi|195503448|ref|XP_002098656.1| GE23815 [Drosophila yakuba]
 gi|194184757|gb|EDW98368.1| GE23815 [Drosophila yakuba]
          Length = 472

 Score = 60.5 bits (145), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 26/59 (44%), Positives = 35/59 (59%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWGKLL 131
           L+DV++GG   FP L     P +GSA+ W+N  +    D R   + CPV LGN+WGK L
Sbjct: 377 LSDVQMGGYASFPDLGFGFKPSRGSALVWHNTDSAGNCDTRSLQATCPVLLGNQWGKWL 435



 Score = 59.7 bits (143), Expect = 4e-07,   Method: Composition-based stats.
 Identities = 26/60 (43%), Positives = 35/60 (58%)

Query: 10  ELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWGKLL 69
           +L+DV++GG   FP L     P +GSA+ WHN  +    D     + CPV LGN+WGK L
Sbjct: 376 KLSDVQMGGYASFPDLGFGFKPSRGSALVWHNTDSAGNCDTRSLQATCPVLLGNQWGKWL 435


>gi|195338688|ref|XP_002035956.1| GM16188 [Drosophila sechellia]
 gi|194129836|gb|EDW51879.1| GM16188 [Drosophila sechellia]
          Length = 392

 Score = 60.1 bits (144), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 36/55 (65%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           +TDV +GG TIFP   L + P+KGSA+FWYN H N   +    H+ CP  +G++W
Sbjct: 319 MTDVLVGGGTIFPGAQLAIQPKKGSALFWYNLHNNGDPNPLTRHAVCPTIVGSRW 373



 Score = 58.9 bits (141), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 25/55 (45%), Positives = 36/55 (65%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           +TDV +GG TIFP   L + P+KGSA+FW+N H N   +    H+ CP  +G++W
Sbjct: 319 MTDVLVGGGTIFPGAQLAIQPKKGSALFWYNLHNNGDPNPLTRHAVCPTIVGSRW 373


>gi|405965633|gb|EKC30995.1| Prolyl 4-hydroxylase subunit alpha-1 [Crassostrea gigas]
          Length = 617

 Score = 60.1 bits (144), Expect = 3e-07,   Method: Composition-based stats.
 Identities = 26/60 (43%), Positives = 40/60 (66%), Gaps = 4/60 (6%)

Query: 70  LSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAH--ANTLLDYRMYHSGCPVALGNKW 127
           L+ +T+V+ GGAT+F  + + +FP KG+A FWYN +   + + D R  H+ CPV +G KW
Sbjct: 537 LTYMTNVDAGGATVFTHIGVKLFPIKGAAAFWYNLYRSGDGIFDTR--HAACPVLVGQKW 594



 Score = 55.8 bits (133), Expect = 5e-06,   Method: Composition-based stats.
 Identities = 24/64 (37%), Positives = 38/64 (59%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       +T+V+ GGAT+F  + + +FP KG+A FW+N + +    +   H+ CPV +
Sbjct: 531 NRIATWLTYMTNVDAGGATVFTHIGVKLFPIKGAAAFWYNLYRSGDGIFDTRHAACPVLV 590

Query: 62  GNKW 65
           G KW
Sbjct: 591 GQKW 594


>gi|194751823|ref|XP_001958223.1| GF23631 [Drosophila ananassae]
 gi|190625505|gb|EDV41029.1| GF23631 [Drosophila ananassae]
          Length = 502

 Score = 60.1 bits (144), Expect = 3e-07,   Method: Composition-based stats.
 Identities = 32/89 (35%), Positives = 46/89 (51%), Gaps = 3/89 (3%)

Query: 42  AHANTLLDYWMYHSGCPVALGNKWGKLLLSGL---TDVELGGATIFPSLNLTVFPEKGSA 98
            H    +DY       P   G+  G  + + L   +DV LGG T+F  L L++ P+KGSA
Sbjct: 395 GHFGKHVDYVELAKRPPNFFGDLGGDRIATALLYASDVPLGGTTVFTKLKLSIEPKKGSA 454

Query: 99  VFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           + W+N +     D    HS CPV LG++W
Sbjct: 455 LIWFNLNNAGDPDPMSEHSACPVVLGSRW 483



 Score = 57.8 bits (138), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 25/54 (46%), Positives = 34/54 (62%)

Query: 12  TDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           +DV LGG T+F  L L++ P+KGSA+ W N +     D    HS CPV LG++W
Sbjct: 430 SDVPLGGTTVFTKLKLSIEPKKGSALIWFNLNNAGDPDPMSEHSACPVVLGSRW 483


>gi|443697959|gb|ELT98193.1| hypothetical protein CAPTEDRAFT_162820 [Capitella teleta]
          Length = 347

 Score = 60.1 bits (144), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 24/55 (43%), Positives = 33/55 (60%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L+++E GG T++P++   V P K S   WYN   N   DYR YH+ CP+  G KW
Sbjct: 270 LSELEAGGYTVYPTVGAAVVPRKNSCALWYNLMRNGTGDYRTYHAACPILYGYKW 324



 Score = 55.5 bits (132), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 24/65 (36%), Positives = 34/65 (52%)

Query: 1   MNRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVA 60
            NR       L+++E GG T++P++   V P K S   W+N   N   DY  YH+ CP+ 
Sbjct: 260 RNRVATFICYLSELEAGGYTVYPTVGAAVVPRKNSCALWYNLMRNGTGDYRTYHAACPIL 319

Query: 61  LGNKW 65
            G KW
Sbjct: 320 YGYKW 324


>gi|291224083|ref|XP_002732036.1| PREDICTED: prolyl 4-hydroxylase, alpha I subunit-like [Saccoglossus
           kowalevskii]
          Length = 491

 Score = 60.1 bits (144), Expect = 3e-07,   Method: Composition-based stats.
 Identities = 25/55 (45%), Positives = 37/55 (67%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           +++V+ GG T+FP ++  V P K +AVFWYN  A+   D    H+GCPV +G+KW
Sbjct: 415 MSEVKAGGYTVFPEVDAFVPPVKNAAVFWYNLKASGESDDLTRHAGCPVLIGSKW 469



 Score = 58.5 bits (140), Expect = 8e-07,   Method: Composition-based stats.
 Identities = 26/64 (40%), Positives = 39/64 (60%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       +++V+ GG T+FP ++  V P K +AVFW+N  A+   D    H+GCPV +
Sbjct: 406 NRIATWMFYMSEVKAGGYTVFPEVDAFVPPVKNAAVFWYNLKASGESDDLTRHAGCPVLI 465

Query: 62  GNKW 65
           G+KW
Sbjct: 466 GSKW 469


>gi|218899396|ref|YP_002447807.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus cereus
           G9842]
 gi|218542449|gb|ACK94843.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           G9842]
          Length = 216

 Score = 60.1 bits (144), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 44/79 (55%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A+ N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 129 DYFAEHSRS--AVNNRISTLVMY-LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQ 185

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L+    H G PV  G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204



 Score = 56.2 bits (134), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 32/55 (58%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 150 LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|312092237|ref|XP_003147267.1| hypothetical protein LOAG_11701 [Loa loa]
          Length = 553

 Score = 60.1 bits (144), Expect = 3e-07,   Method: Composition-based stats.
 Identities = 27/66 (40%), Positives = 41/66 (62%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+   +L+  +T+ E+GG T+F +L  +V   K +A+FWYN   +  +D R YH+ CPV
Sbjct: 437 GNRIATILIY-MTEPEIGGRTVFINLKASVPCTKNAALFWYNLMRSGAVDMRSYHAACPV 495

Query: 122 ALGNKW 127
             G KW
Sbjct: 496 LTGTKW 501



 Score = 54.3 bits (129), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 24/64 (37%), Positives = 37/64 (57%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + +T+ E+GG T+F +L  +V   K +A+FW+N   +  +D   YH+ CPV  
Sbjct: 438 NRIATILIYMTEPEIGGRTVFINLKASVPCTKNAALFWYNLMRSGAVDMRSYHAACPVLT 497

Query: 62  GNKW 65
           G KW
Sbjct: 498 GTKW 501


>gi|228910069|ref|ZP_04073889.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL 200]
 gi|228849586|gb|EEM94420.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL 200]
          Length = 248

 Score = 60.1 bits (144), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 44/79 (55%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A+ N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 161 DYFAEHSRS--AVNNRISTLVMY-LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQ 217

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L+    H G PV  G KW
Sbjct: 218 SLNELTLHGGAPVTKGEKW 236



 Score = 56.2 bits (134), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 32/55 (58%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 182 LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 236


>gi|198449635|ref|XP_001357660.2| GA21971 [Drosophila pseudoobscura pseudoobscura]
 gi|198130694|gb|EAL26794.2| GA21971 [Drosophila pseudoobscura pseudoobscura]
          Length = 549

 Score = 60.1 bits (144), Expect = 3e-07,   Method: Composition-based stats.
 Identities = 28/67 (41%), Positives = 38/67 (56%), Gaps = 1/67 (1%)

Query: 61  LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
           +GN+   +L   L+DV  GG T FP L   + P+K +A FW+N HA    D R  H  CP
Sbjct: 439 MGNRIATVLFY-LSDVAQGGGTAFPYLKQHLRPKKYAAAFWHNLHAAGRGDARTQHGACP 497

Query: 121 VALGNKW 127
           +  G+KW
Sbjct: 498 IIAGSKW 504



 Score = 57.4 bits (137), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 27/64 (42%), Positives = 33/64 (51%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       L+DV  GG T FP L   + P+K +A FWHN HA    D    H  CP+  
Sbjct: 441 NRIATVLFYLSDVAQGGGTAFPYLKQHLRPKKYAAAFWHNLHAAGRGDARTQHGACPIIA 500

Query: 62  GNKW 65
           G+KW
Sbjct: 501 GSKW 504


>gi|423558182|ref|ZP_17534484.1| hypothetical protein II3_03386 [Bacillus cereus MC67]
 gi|401191450|gb|EJQ98472.1| hypothetical protein II3_03386 [Bacillus cereus MC67]
          Length = 216

 Score = 59.7 bits (143), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L+    H G PV  G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204



 Score = 56.6 bits (135), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 32/55 (58%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 150 LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|423489423|ref|ZP_17466105.1| hypothetical protein IEU_04046 [Bacillus cereus BtB2-4]
 gi|402431659|gb|EJV63723.1| hypothetical protein IEU_04046 [Bacillus cereus BtB2-4]
          Length = 216

 Score = 59.7 bits (143), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L+    H G PV  G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204



 Score = 56.6 bits (135), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 32/55 (58%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 150 LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|423452458|ref|ZP_17429311.1| hypothetical protein IEE_01202 [Bacillus cereus BAG5X1-1]
 gi|401140096|gb|EJQ47653.1| hypothetical protein IEE_01202 [Bacillus cereus BAG5X1-1]
          Length = 216

 Score = 59.7 bits (143), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L+    H G PV  G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204



 Score = 56.6 bits (135), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 32/55 (58%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 150 LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|229075940|ref|ZP_04208916.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock4-18]
 gi|229117732|ref|ZP_04247101.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-3]
 gi|407706764|ref|YP_006830349.1| alpha/beta fold family hydrolase [Bacillus thuringiensis MC28]
 gi|423377905|ref|ZP_17355189.1| hypothetical protein IC9_01258 [Bacillus cereus BAG1O-2]
 gi|423464099|ref|ZP_17440867.1| hypothetical protein IEK_01286 [Bacillus cereus BAG6O-1]
 gi|423547540|ref|ZP_17523898.1| hypothetical protein IGO_03975 [Bacillus cereus HuB5-5]
 gi|423622677|ref|ZP_17598455.1| hypothetical protein IK3_01275 [Bacillus cereus VD148]
 gi|228665709|gb|EEL21182.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-3]
 gi|228707255|gb|EEL59452.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock4-18]
 gi|401179261|gb|EJQ86434.1| hypothetical protein IGO_03975 [Bacillus cereus HuB5-5]
 gi|401260797|gb|EJR66965.1| hypothetical protein IK3_01275 [Bacillus cereus VD148]
 gi|401636171|gb|EJS53925.1| hypothetical protein IC9_01258 [Bacillus cereus BAG1O-2]
 gi|402420366|gb|EJV52637.1| hypothetical protein IEK_01286 [Bacillus cereus BAG6O-1]
 gi|407384449|gb|AFU14950.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis MC28]
          Length = 216

 Score = 59.7 bits (143), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L+    H G PV  G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204



 Score = 56.6 bits (135), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 32/55 (58%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 150 LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|423470454|ref|ZP_17447198.1| hypothetical protein IEM_01760 [Bacillus cereus BAG6O-2]
 gi|402436583|gb|EJV68613.1| hypothetical protein IEM_01760 [Bacillus cereus BAG6O-2]
          Length = 216

 Score = 59.7 bits (143), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L+    H G PV  G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204



 Score = 56.6 bits (135), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 32/55 (58%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 150 LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|163941996|ref|YP_001646880.1| 2OG-Fe(II) oxygenase [Bacillus weihenstephanensis KBAB4]
 gi|229013455|ref|ZP_04170592.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides DSM 2048]
 gi|423495146|ref|ZP_17471790.1| hypothetical protein IEW_04044 [Bacillus cereus CER057]
 gi|423498060|ref|ZP_17474677.1| hypothetical protein IEY_01287 [Bacillus cereus CER074]
 gi|163864193|gb|ABY45252.1| 2OG-Fe(II) oxygenase [Bacillus weihenstephanensis KBAB4]
 gi|228747867|gb|EEL97733.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides DSM 2048]
 gi|401151239|gb|EJQ58691.1| hypothetical protein IEW_04044 [Bacillus cereus CER057]
 gi|401161347|gb|EJQ68714.1| hypothetical protein IEY_01287 [Bacillus cereus CER074]
          Length = 216

 Score = 59.7 bits (143), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L+    H G PV  G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204



 Score = 56.6 bits (135), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 32/55 (58%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 150 LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|423389445|ref|ZP_17366671.1| hypothetical protein ICG_01293 [Bacillus cereus BAG1X1-3]
 gi|401641536|gb|EJS59253.1| hypothetical protein ICG_01293 [Bacillus cereus BAG1X1-3]
          Length = 216

 Score = 59.7 bits (143), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L+    H G PV  G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204



 Score = 56.6 bits (135), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 32/55 (58%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 150 LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|423615424|ref|ZP_17591258.1| hypothetical protein IIO_00750 [Bacillus cereus VD115]
 gi|401259961|gb|EJR66134.1| hypothetical protein IIO_00750 [Bacillus cereus VD115]
          Length = 216

 Score = 59.7 bits (143), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L+    H G PV  G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204



 Score = 56.6 bits (135), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 32/55 (58%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 150 LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|423541303|ref|ZP_17517694.1| hypothetical protein IGK_03395 [Bacillus cereus HuB4-10]
 gi|401172491|gb|EJQ79712.1| hypothetical protein IGK_03395 [Bacillus cereus HuB4-10]
          Length = 216

 Score = 59.7 bits (143), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L+    H G PV  G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204



 Score = 56.6 bits (135), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 32/55 (58%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 150 LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|423483822|ref|ZP_17460512.1| hypothetical protein IEQ_03600 [Bacillus cereus BAG6X1-2]
 gi|401141373|gb|EJQ48928.1| hypothetical protein IEQ_03600 [Bacillus cereus BAG6X1-2]
          Length = 216

 Score = 59.7 bits (143), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L+    H G PV  G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204



 Score = 56.6 bits (135), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 32/55 (58%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 150 LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|193688213|ref|XP_001943683.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 1
           [Acyrthosiphon pisum]
          Length = 552

 Score = 59.7 bits (143), Expect = 4e-07,   Method: Composition-based stats.
 Identities = 27/66 (40%), Positives = 40/66 (60%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+   +L   ++DV  GGAT+FP L + + P KG+A  W+N + +   D R  H+ CPV
Sbjct: 459 GNRIATVLFY-MSDVAQGGATVFPWLGVALQPVKGTAAVWFNLYPSGNGDLRTRHAACPV 517

Query: 122 ALGNKW 127
             G+KW
Sbjct: 518 LQGSKW 523



 Score = 53.9 bits (128), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 25/64 (39%), Positives = 35/64 (54%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP L + + P KG+A  W N + +   D    H+ CPV  
Sbjct: 460 NRIATVLFYMSDVAQGGATVFPWLGVALQPVKGTAAVWFNLYPSGNGDLRTRHAACPVLQ 519

Query: 62  GNKW 65
           G+KW
Sbjct: 520 GSKW 523


>gi|423598444|ref|ZP_17574444.1| hypothetical protein III_01246 [Bacillus cereus VD078]
 gi|423660914|ref|ZP_17636083.1| hypothetical protein IKM_01311 [Bacillus cereus VDM022]
 gi|401236714|gb|EJR43171.1| hypothetical protein III_01246 [Bacillus cereus VD078]
 gi|401300955|gb|EJS06544.1| hypothetical protein IKM_01311 [Bacillus cereus VDM022]
          Length = 216

 Score = 59.7 bits (143), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L+    H G PV  G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204



 Score = 56.6 bits (135), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 32/55 (58%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 150 LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|229104864|ref|ZP_04235524.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-28]
 gi|228678581|gb|EEL32798.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-28]
          Length = 216

 Score = 59.7 bits (143), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L+    H G PV  G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204



 Score = 56.6 bits (135), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 32/55 (58%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 150 LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|25012370|gb|AAN71294.1| RE09701p [Drosophila melanogaster]
          Length = 301

 Score = 59.7 bits (143), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 28/63 (44%), Positives = 39/63 (61%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           +R   S   L+DVE GG T+F  LN+ + P KG+ V WHN H +  +D    H+GCPV +
Sbjct: 207 DRISTSMFYLSDVEQGGYTVFTKLNVFLPPVKGALVMWHNLHRSLHVDARTLHAGCPVIV 266

Query: 62  GNK 64
           G+K
Sbjct: 267 GSK 269



 Score = 58.9 bits (141), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 26/54 (48%), Positives = 37/54 (68%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 126
           L+DVE GG T+F  LN+ + P KG+ V W+N H +  +D R  H+GCPV +G+K
Sbjct: 216 LSDVEQGGYTVFTKLNVFLPPVKGALVMWHNLHRSLHVDARTLHAGCPVIVGSK 269


>gi|229019457|ref|ZP_04176278.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1273]
 gi|229025700|ref|ZP_04182104.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1272]
 gi|423417837|ref|ZP_17394926.1| hypothetical protein IE3_01309 [Bacillus cereus BAG3X2-1]
 gi|228735575|gb|EEL86166.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1272]
 gi|228741812|gb|EEL91991.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1273]
 gi|401107008|gb|EJQ14965.1| hypothetical protein IE3_01309 [Bacillus cereus BAG3X2-1]
          Length = 216

 Score = 59.7 bits (143), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L+    H G PV  G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204



 Score = 56.6 bits (135), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 32/55 (58%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 150 LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|30264308|ref|NP_846685.1| prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. Ames]
 gi|47529753|ref|YP_021102.1| prolyl 4-hydroxylase subunit alpha [Bacillus anthracis str. 'Ames
           Ancestor']
 gi|65321616|ref|ZP_00394575.1| hypothetical protein Bant_01005109 [Bacillus anthracis str. A2012]
 gi|165873278|ref|ZP_02217887.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0488]
 gi|167634610|ref|ZP_02392930.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0442]
 gi|167638693|ref|ZP_02396969.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0193]
 gi|170687507|ref|ZP_02878724.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0465]
 gi|170709341|ref|ZP_02899757.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0389]
 gi|177655890|ref|ZP_02937082.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0174]
 gi|190566156|ref|ZP_03019075.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. Tsiankovskii-I]
 gi|196034803|ref|ZP_03102210.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           W]
 gi|227817011|ref|YP_002817020.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus
           anthracis str. CDC 684]
 gi|228929280|ref|ZP_04092307.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           pondicheriensis BGSC 4BA1]
 gi|228935557|ref|ZP_04098373.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           andalousiensis BGSC 4AW1]
 gi|229123754|ref|ZP_04252949.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus 95/8201]
 gi|229604260|ref|YP_002868528.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0248]
 gi|254683996|ref|ZP_05147856.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. CNEVA-9066]
 gi|254721830|ref|ZP_05183619.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A1055]
 gi|254736344|ref|ZP_05194050.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. Western North America USA6153]
 gi|254741382|ref|ZP_05199069.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. Kruger B]
 gi|254753983|ref|ZP_05206018.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. Vollum]
 gi|254757854|ref|ZP_05209881.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. Australia 94]
 gi|386738126|ref|YP_006211307.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. H9401]
 gi|421506493|ref|ZP_15953416.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. UR-1]
 gi|421638315|ref|ZP_16078911.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. BF1]
 gi|30258953|gb|AAP28171.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. Ames]
 gi|47504901|gb|AAT33577.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. 'Ames Ancestor']
 gi|164710995|gb|EDR16563.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0488]
 gi|167513541|gb|EDR88911.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0193]
 gi|167530062|gb|EDR92797.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0442]
 gi|170125767|gb|EDS94678.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0389]
 gi|170668702|gb|EDT19448.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0465]
 gi|172079923|gb|EDT65028.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0174]
 gi|190563075|gb|EDV17041.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. Tsiankovskii-I]
 gi|195992342|gb|EDX56303.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           W]
 gi|227005734|gb|ACP15477.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. CDC 684]
 gi|228659889|gb|EEL15534.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus 95/8201]
 gi|228824095|gb|EEM69911.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           andalousiensis BGSC 4AW1]
 gi|228830570|gb|EEM76180.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           pondicheriensis BGSC 4BA1]
 gi|229268668|gb|ACQ50305.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0248]
 gi|384387978|gb|AFH85639.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. H9401]
 gi|401823486|gb|EJT22633.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. UR-1]
 gi|403394741|gb|EJY91981.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. BF1]
          Length = 216

 Score = 59.7 bits (143), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L+    H G PV  G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204



 Score = 56.6 bits (135), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 32/55 (58%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 150 LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|72000637|ref|NP_507251.2| Protein PHY-3 [Caenorhabditis elegans]
 gi|30145726|emb|CAB04736.2| Protein PHY-3 [Caenorhabditis elegans]
          Length = 318

 Score = 59.3 bits (142), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 38/119 (31%), Positives = 52/119 (43%), Gaps = 6/119 (5%)

Query: 13  DVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPV-----ALGNKWGK 67
           ++++      P LNLTV     +  +    H     DY  Y S           GN+ G 
Sbjct: 152 EIKMQAQKRIPGLNLTVAEHFSALSYLPGGHYAVHYDYLDYRSKQDYDWWMNKTGNRIGT 211

Query: 68  LLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 126
           L+   L   E GG T+FPS+  TV    G A FW+NA A+   +    H GCP+  G K
Sbjct: 212 LIFV-LKPAEKGGGTVFPSIGSTVRANAGDAFFWFNAQADEEKEMLSNHGGCPIYEGRK 269



 Score = 48.5 bits (114), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 31/94 (32%), Positives = 42/94 (44%), Gaps = 10/94 (10%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNK-----W 65
           L   E GG T+FPS+  TV    G A FW NA A+   +    H GCP+  G K     W
Sbjct: 216 LKPAEKGGGTVFPSIGSTVRANAGDAFFWFNAQADEEKEMLSNHGGCPIYEGRKVIATIW 275

Query: 66  GKLLLSGLTDVELGGATI-----FPSLNLTVFPE 94
            +     +  +   G++I      PSL+    PE
Sbjct: 276 IRAYNQRILPMAPAGSSIHASWLIPSLSNRFRPE 309


>gi|402555628|ref|YP_006596899.1| prolyl 4-hydroxylase subunit alpha [Bacillus cereus FRI-35]
 gi|401796838|gb|AFQ10697.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus FRI-35]
          Length = 216

 Score = 59.3 bits (142), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L+    H G PV  G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204



 Score = 56.6 bits (135), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 32/55 (58%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 150 LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|423521903|ref|ZP_17498376.1| hypothetical protein IGC_01286 [Bacillus cereus HuA4-10]
 gi|401176565|gb|EJQ83760.1| hypothetical protein IGC_01286 [Bacillus cereus HuA4-10]
          Length = 216

 Score = 59.3 bits (142), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L+    H G PV  G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204



 Score = 56.2 bits (134), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 32/55 (58%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 150 LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|328696638|ref|XP_003240086.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 2
           [Acyrthosiphon pisum]
          Length = 534

 Score = 59.3 bits (142), Expect = 4e-07,   Method: Composition-based stats.
 Identities = 27/66 (40%), Positives = 40/66 (60%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+   +L   ++DV  GGAT+FP L + + P KG+A  W+N + +   D R  H+ CPV
Sbjct: 441 GNRIATVLFY-MSDVAQGGATVFPWLGVALQPVKGTAAVWFNLYPSGNGDLRTRHAACPV 499

Query: 122 ALGNKW 127
             G+KW
Sbjct: 500 LQGSKW 505



 Score = 53.9 bits (128), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 25/64 (39%), Positives = 35/64 (54%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DV  GGAT+FP L + + P KG+A  W N + +   D    H+ CPV  
Sbjct: 442 NRIATVLFYMSDVAQGGATVFPWLGVALQPVKGTAAVWFNLYPSGNGDLRTRHAACPVLQ 501

Query: 62  GNKW 65
           G+KW
Sbjct: 502 GSKW 505


>gi|42783360|ref|NP_980607.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 10987]
 gi|42739288|gb|AAS43215.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           ATCC 10987]
          Length = 216

 Score = 59.3 bits (142), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L+    H G PV  G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204



 Score = 56.6 bits (135), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 32/55 (58%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 150 LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|20177113|gb|AAM12259.1| RE23792p [Drosophila melanogaster]
 gi|220948174|gb|ACL86630.1| PH4alphaSG2-PB [synthetic construct]
 gi|220960438|gb|ACL92755.1| PH4alphaSG2-PB [synthetic construct]
          Length = 301

 Score = 59.3 bits (142), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 28/63 (44%), Positives = 39/63 (61%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           +R   S   L+DVE GG T+F  LN+ + P KG+ V WHN H +  +D    H+GCPV +
Sbjct: 207 DRISTSMFYLSDVEQGGYTVFTKLNVFLPPVKGALVMWHNLHRSLHVDARTLHAGCPVIV 266

Query: 62  GNK 64
           G+K
Sbjct: 267 GSK 269



 Score = 58.5 bits (140), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 26/54 (48%), Positives = 37/54 (68%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 126
           L+DVE GG T+F  LN+ + P KG+ V W+N H +  +D R  H+GCPV +G+K
Sbjct: 216 LSDVEQGGYTVFTKLNVFLPPVKGALVMWHNLHRSLHVDARTLHAGCPVIVGSK 269


>gi|423400914|ref|ZP_17378087.1| hypothetical protein ICW_01312 [Bacillus cereus BAG2X1-2]
 gi|401653904|gb|EJS71447.1| hypothetical protein ICW_01312 [Bacillus cereus BAG2X1-2]
          Length = 216

 Score = 59.3 bits (142), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L+    H G PV  G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204



 Score = 56.6 bits (135), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 32/55 (58%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 150 LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|47567794|ref|ZP_00238502.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus G9241]
 gi|47555471|gb|EAL13814.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus G9241]
          Length = 216

 Score = 59.3 bits (142), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L+    H G PV  G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204



 Score = 56.6 bits (135), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 32/55 (58%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 150 LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|390176896|ref|XP_002136934.2| GA26861 [Drosophila pseudoobscura pseudoobscura]
 gi|388858831|gb|EDY67492.2| GA26861 [Drosophila pseudoobscura pseudoobscura]
          Length = 513

 Score = 59.3 bits (142), Expect = 5e-07,   Method: Composition-based stats.
 Identities = 24/56 (42%), Positives = 35/56 (62%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 128
           L+DV  GGAT+FP LN++V P +G A+ WYN +     +    H+ CP+  G+KW 
Sbjct: 440 LSDVPQGGATLFPRLNISVQPRQGDALLWYNLNDRGQGEIGTVHTSCPIIQGSKWA 495



 Score = 57.4 bits (137), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 23/56 (41%), Positives = 35/56 (62%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWG 66
           L+DV  GGAT+FP LN++V P +G A+ W+N +     +    H+ CP+  G+KW 
Sbjct: 440 LSDVPQGGATLFPRLNISVQPRQGDALLWYNLNDRGQGEIGTVHTSCPIIQGSKWA 495


>gi|196041590|ref|ZP_03108882.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           NVH0597-99]
 gi|218905373|ref|YP_002453207.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus cereus
           AH820]
 gi|225866219|ref|YP_002751597.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           03BB102]
 gi|423550018|ref|ZP_17526345.1| hypothetical protein IGW_00649 [Bacillus cereus ISP3191]
 gi|196027578|gb|EDX66193.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           NVH0597-99]
 gi|218537435|gb|ACK89833.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           AH820]
 gi|225786013|gb|ACO26230.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           03BB102]
 gi|401189634|gb|EJQ96684.1| hypothetical protein IGW_00649 [Bacillus cereus ISP3191]
          Length = 216

 Score = 59.3 bits (142), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L+    H G PV  G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204



 Score = 56.6 bits (135), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 32/55 (58%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 150 LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|423457579|ref|ZP_17434376.1| hypothetical protein IEI_00719 [Bacillus cereus BAG5X2-1]
 gi|401147963|gb|EJQ55456.1| hypothetical protein IEI_00719 [Bacillus cereus BAG5X2-1]
          Length = 216

 Score = 59.3 bits (142), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L+    H G PV  G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204



 Score = 56.6 bits (135), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 32/55 (58%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 150 LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|198477152|ref|XP_002136738.1| GA29216 [Drosophila pseudoobscura pseudoobscura]
 gi|198145043|gb|EDY71755.1| GA29216 [Drosophila pseudoobscura pseudoobscura]
          Length = 517

 Score = 59.3 bits (142), Expect = 5e-07,   Method: Composition-based stats.
 Identities = 24/56 (42%), Positives = 35/56 (62%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 128
           L+DV  GGAT+FP LN++V P +G A+ WYN +     +    H+ CP+  G+KW 
Sbjct: 444 LSDVPQGGATLFPRLNISVQPRQGDALLWYNLNDRGQGEIGTVHTSCPIIQGSKWA 499



 Score = 57.4 bits (137), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 23/56 (41%), Positives = 35/56 (62%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWG 66
           L+DV  GGAT+FP LN++V P +G A+ W+N +     +    H+ CP+  G+KW 
Sbjct: 444 LSDVPQGGATLFPRLNISVQPRQGDALLWYNLNDRGQGEIGTVHTSCPIIQGSKWA 499


>gi|195379216|ref|XP_002048376.1| GJ13933 [Drosophila virilis]
 gi|194155534|gb|EDW70718.1| GJ13933 [Drosophila virilis]
          Length = 521

 Score = 59.3 bits (142), Expect = 5e-07,   Method: Composition-based stats.
 Identities = 31/74 (41%), Positives = 44/74 (59%), Gaps = 2/74 (2%)

Query: 55  SGCPVA-LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYR 113
           SG  VA LG +   L+    +DV  GGAT FP + + V P+KG+++FWYN   +   D R
Sbjct: 426 SGEAVAELGERIATLIFYA-SDVAQGGATNFPDIQVAVQPQKGNSLFWYNMFDDGTPDPR 484

Query: 114 MYHSGCPVALGNKW 127
             HS CP  +G++W
Sbjct: 485 SLHSVCPTIVGSRW 498



 Score = 53.1 bits (126), Expect = 4e-05,   Method: Composition-based stats.
 Identities = 22/54 (40%), Positives = 34/54 (62%)

Query: 12  TDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           +DV  GGAT FP + + V P+KG+++FW+N   +   D    HS CP  +G++W
Sbjct: 445 SDVAQGGATNFPDIQVAVQPQKGNSLFWYNMFDDGTPDPRSLHSVCPTIVGSRW 498


>gi|206978009|ref|ZP_03238895.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           H3081.97]
 gi|423373947|ref|ZP_17351286.1| hypothetical protein IC5_03002 [Bacillus cereus AND1407]
 gi|206743809|gb|EDZ55230.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           H3081.97]
 gi|401094762|gb|EJQ02832.1| hypothetical protein IC5_03002 [Bacillus cereus AND1407]
          Length = 216

 Score = 59.3 bits (142), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L+    H G PV  G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204



 Score = 56.6 bits (135), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 32/55 (58%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 150 LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|229186477|ref|ZP_04313640.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BGSC 6E1]
 gi|228596991|gb|EEK54648.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BGSC 6E1]
          Length = 216

 Score = 59.3 bits (142), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 129 DYFAEHSRS--AANNRISTLVIY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L+    H G PV  G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204



 Score = 57.4 bits (137), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 28/64 (43%), Positives = 35/64 (54%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  
Sbjct: 141 NRISTLVIYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTK 200

Query: 62  GNKW 65
           G KW
Sbjct: 201 GEKW 204


>gi|423478381|ref|ZP_17455096.1| hypothetical protein IEO_03839 [Bacillus cereus BAG6X1-1]
 gi|402428543|gb|EJV60640.1| hypothetical protein IEO_03839 [Bacillus cereus BAG6X1-1]
          Length = 216

 Score = 59.3 bits (142), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L+    H G PV  G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204



 Score = 56.6 bits (135), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 32/55 (58%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 150 LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|301055727|ref|YP_003793938.1| prolyl 4-hydroxylase subunit alpha [Bacillus cereus biovar
           anthracis str. CI]
 gi|300377896|gb|ADK06800.1| prolyl 4-hydroxylase, alpha subunit [Bacillus cereus biovar
           anthracis str. CI]
          Length = 216

 Score = 59.3 bits (142), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L+    H G PV  G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204



 Score = 56.6 bits (135), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 28/64 (43%), Positives = 35/64 (54%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  
Sbjct: 141 NRISTLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTK 200

Query: 62  GNKW 65
           G KW
Sbjct: 201 GEKW 204


>gi|194871359|ref|XP_001972833.1| GG13662 [Drosophila erecta]
 gi|190654616|gb|EDV51859.1| GG13662 [Drosophila erecta]
          Length = 515

 Score = 59.3 bits (142), Expect = 5e-07,   Method: Composition-based stats.
 Identities = 25/67 (37%), Positives = 42/67 (62%), Gaps = 1/67 (1%)

Query: 61  LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
           LG++   +++    +V  GG T+FP + + V P+KG A+FW+N   ++  D R  HS CP
Sbjct: 426 LGDRLASIIIYA-GEVSQGGQTVFPDIKVAVEPKKGKALFWFNDFDDSSPDPRSLHSVCP 484

Query: 121 VALGNKW 127
           V +G++W
Sbjct: 485 VIVGSRW 491



 Score = 53.5 bits (127), Expect = 3e-05,   Method: Composition-based stats.
 Identities = 22/53 (41%), Positives = 33/53 (62%)

Query: 13  DVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           +V  GG T+FP + + V P+KG A+FW N   ++  D    HS CPV +G++W
Sbjct: 439 EVSQGGQTVFPDIKVAVEPKKGKALFWFNDFDDSSPDPRSLHSVCPVIVGSRW 491


>gi|196046329|ref|ZP_03113555.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           03BB108]
 gi|376268135|ref|YP_005120847.1| Peptidyl prolyl 4- hydroxylase like protein [Bacillus cereus
           F837/76]
 gi|196022799|gb|EDX61480.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           03BB108]
 gi|364513935|gb|AEW57334.1| Peptidyl prolyl 4- hydroxylase like protein [Bacillus cereus
           F837/76]
          Length = 216

 Score = 59.3 bits (142), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L+    H G PV  G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204



 Score = 56.6 bits (135), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 32/55 (58%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 150 LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|229174912|ref|ZP_04302432.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus MM3]
 gi|228608580|gb|EEK65882.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus MM3]
          Length = 216

 Score = 59.3 bits (142), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L+    H G PV  G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204



 Score = 56.6 bits (135), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 28/64 (43%), Positives = 35/64 (54%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  
Sbjct: 141 NRISTLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTK 200

Query: 62  GNKW 65
           G KW
Sbjct: 201 GEKW 204


>gi|217961727|ref|YP_002340297.1| prolyl 4-hydroxylase subunit alpha domain-containing protein
           [Bacillus cereus AH187]
 gi|222097680|ref|YP_002531737.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus cereus
           Q1]
 gi|229198365|ref|ZP_04325071.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus m1293]
 gi|375286242|ref|YP_005106681.1| prolyl 4-hydroxylase subunit alpha domain-containing protein
           [Bacillus cereus NC7401]
 gi|423354732|ref|ZP_17332357.1| hypothetical protein IAU_02806 [Bacillus cereus IS075]
 gi|423566803|ref|ZP_17543050.1| hypothetical protein II7_00026 [Bacillus cereus MSX-A12]
 gi|423574080|ref|ZP_17550199.1| hypothetical protein II9_01301 [Bacillus cereus MSX-D12]
 gi|217067199|gb|ACJ81449.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           AH187]
 gi|221241738|gb|ACM14448.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           Q1]
 gi|228585065|gb|EEK43177.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus m1293]
 gi|358354769|dbj|BAL19941.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           NC7401]
 gi|401086280|gb|EJP94507.1| hypothetical protein IAU_02806 [Bacillus cereus IS075]
 gi|401212649|gb|EJR19392.1| hypothetical protein II9_01301 [Bacillus cereus MSX-D12]
 gi|401215318|gb|EJR22035.1| hypothetical protein II7_00026 [Bacillus cereus MSX-A12]
          Length = 216

 Score = 59.3 bits (142), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L+    H G PV  G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204



 Score = 56.6 bits (135), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 32/55 (58%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 150 LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|195159303|ref|XP_002020521.1| GL13468 [Drosophila persimilis]
 gi|194117290|gb|EDW39333.1| GL13468 [Drosophila persimilis]
          Length = 415

 Score = 59.3 bits (142), Expect = 5e-07,   Method: Composition-based stats.
 Identities = 24/56 (42%), Positives = 35/56 (62%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 128
           L+DV  GGAT+FP LN++V P +G A+ WYN +     +    H+ CP+  G+KW 
Sbjct: 342 LSDVPQGGATLFPRLNISVQPRQGDALLWYNLNDRGQGEIGTVHTSCPIIKGSKWA 397



 Score = 58.2 bits (139), Expect = 1e-06,   Method: Composition-based stats.
 Identities = 23/57 (40%), Positives = 36/57 (63%)

Query: 10  ELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWG 66
           +L+DV  GGAT+FP LN++V P +G A+ W+N +     +    H+ CP+  G+KW 
Sbjct: 341 QLSDVPQGGATLFPRLNISVQPRQGDALLWYNLNDRGQGEIGTVHTSCPIIKGSKWA 397


>gi|229093299|ref|ZP_04224414.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-42]
 gi|228690082|gb|EEL43879.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-42]
          Length = 232

 Score = 59.3 bits (142), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 145 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 201

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L+    H G PV  G KW
Sbjct: 202 SLNELTLHGGAPVTKGEKW 220



 Score = 56.6 bits (135), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 28/64 (43%), Positives = 35/64 (54%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  
Sbjct: 157 NRISTLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTK 216

Query: 62  GNKW 65
           G KW
Sbjct: 217 GEKW 220


>gi|49480949|ref|YP_038297.1| prolyl 4-hydroxylase subunit alpha [Bacillus thuringiensis serovar
           konkukian str. 97-27]
 gi|49332505|gb|AAT63151.1| prolyl 4-hydroxylase, alpha subunit [Bacillus thuringiensis serovar
           konkukian str. 97-27]
          Length = 232

 Score = 59.3 bits (142), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 145 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 201

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L+    H G PV  G KW
Sbjct: 202 SLNELTLHGGAPVTKGEKW 220



 Score = 56.6 bits (135), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 28/64 (43%), Positives = 35/64 (54%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  
Sbjct: 157 NRISTLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTK 216

Query: 62  GNKW 65
           G KW
Sbjct: 217 GEKW 220


>gi|229031885|ref|ZP_04187873.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1271]
 gi|228729503|gb|EEL80492.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1271]
          Length = 216

 Score = 59.3 bits (142), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L+    H G PV  G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204



 Score = 56.6 bits (135), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 28/64 (43%), Positives = 35/64 (54%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  
Sbjct: 141 NRISTLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTK 200

Query: 62  GNKW 65
           G KW
Sbjct: 201 GEKW 204


>gi|52141260|ref|YP_085568.1| prolyl 4-hydroxylase, alpha subunit [Bacillus cereus E33L]
 gi|51974729|gb|AAU16279.1| prolyl 4-hydroxylase, alpha subunit [Bacillus cereus E33L]
          Length = 232

 Score = 59.3 bits (142), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 145 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 201

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L+    H G PV  G KW
Sbjct: 202 SLNELTLHGGAPVTKGEKW 220



 Score = 56.6 bits (135), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 28/64 (43%), Positives = 35/64 (54%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  
Sbjct: 157 NRISTLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTK 216

Query: 62  GNKW 65
           G KW
Sbjct: 217 GEKW 220


>gi|49187135|ref|YP_030387.1| prolyl 4-hydroxylase subunit alpha [Bacillus anthracis str. Sterne]
 gi|228947951|ref|ZP_04110238.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           monterrey BGSC 4AJ1]
 gi|49181062|gb|AAT56438.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. Sterne]
 gi|228811938|gb|EEM58272.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           monterrey BGSC 4AJ1]
          Length = 232

 Score = 59.3 bits (142), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 145 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 201

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L+    H G PV  G KW
Sbjct: 202 SLNELTLHGGAPVTKGEKW 220



 Score = 56.6 bits (135), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 28/64 (43%), Positives = 35/64 (54%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  
Sbjct: 157 NRISTLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTK 216

Query: 62  GNKW 65
           G KW
Sbjct: 217 GEKW 220


>gi|384182063|ref|YP_005567825.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           finitimus YBT-020]
 gi|324328147|gb|ADY23407.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           finitimus YBT-020]
          Length = 216

 Score = 59.3 bits (142), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDR 185

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L+    H G PV  G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204



 Score = 56.6 bits (135), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 28/64 (43%), Positives = 35/64 (54%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  
Sbjct: 141 NRISTLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDRSLNELTLHGGAPVTK 200

Query: 62  GNKW 65
           G KW
Sbjct: 201 GEKW 204


>gi|228916870|ref|ZP_04080433.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           pulsiensis BGSC 4CC1]
 gi|228842793|gb|EEM87878.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           pulsiensis BGSC 4CC1]
          Length = 232

 Score = 59.3 bits (142), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 145 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 201

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L+    H G PV  G KW
Sbjct: 202 SLNELTLHGGAPVTKGEKW 220



 Score = 56.6 bits (135), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 28/64 (43%), Positives = 35/64 (54%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  
Sbjct: 157 NRISTLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTK 216

Query: 62  GNKW 65
           G KW
Sbjct: 217 GEKW 220


>gi|228987427|ref|ZP_04147547.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           tochigiensis BGSC 4Y1]
 gi|228772399|gb|EEM20845.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           tochigiensis BGSC 4Y1]
          Length = 232

 Score = 59.3 bits (142), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 145 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 201

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L+    H G PV  G KW
Sbjct: 202 SLNELTLHGGAPVTKGEKW 220



 Score = 56.6 bits (135), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 28/64 (43%), Positives = 35/64 (54%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  
Sbjct: 157 NRISTLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTK 216

Query: 62  GNKW 65
           G KW
Sbjct: 217 GEKW 220


>gi|229157835|ref|ZP_04285910.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 4342]
 gi|228625792|gb|EEK82544.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 4342]
          Length = 232

 Score = 59.3 bits (142), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 145 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 201

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L+    H G PV  G KW
Sbjct: 202 SLNELTLHGGAPVTKGEKW 220



 Score = 56.6 bits (135), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 28/64 (43%), Positives = 35/64 (54%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  
Sbjct: 157 NRISTLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTK 216

Query: 62  GNKW 65
           G KW
Sbjct: 217 GEKW 220


>gi|118479416|ref|YP_896567.1| prolyl 4-hydroxylase subunit alpha [Bacillus thuringiensis str. Al
           Hakam]
 gi|118418641|gb|ABK87060.1| prolyl 4-hydroxylase, alpha subunit [Bacillus thuringiensis str. Al
           Hakam]
          Length = 232

 Score = 59.3 bits (142), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 145 DYFAEHSRS--AANNRISTLVIY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 201

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L+    H G PV  G KW
Sbjct: 202 SLNELTLHGGAPVTKGEKW 220



 Score = 57.4 bits (137), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 28/64 (43%), Positives = 35/64 (54%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  
Sbjct: 157 NRISTLVIYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTK 216

Query: 62  GNKW 65
           G KW
Sbjct: 217 GEKW 220


>gi|229140971|ref|ZP_04269515.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST26]
 gi|228642547|gb|EEK98834.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST26]
          Length = 232

 Score = 59.3 bits (142), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 145 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 201

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L+    H G PV  G KW
Sbjct: 202 SLNELTLHGGAPVTKGEKW 220



 Score = 56.6 bits (135), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 28/64 (43%), Positives = 35/64 (54%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  
Sbjct: 157 NRISTLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTK 216

Query: 62  GNKW 65
           G KW
Sbjct: 217 GEKW 220


>gi|423612451|ref|ZP_17588312.1| hypothetical protein IIM_03166 [Bacillus cereus VD107]
 gi|401246040|gb|EJR52392.1| hypothetical protein IIM_03166 [Bacillus cereus VD107]
          Length = 254

 Score = 59.3 bits (142), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 167 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 223

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L+    H G PV  G KW
Sbjct: 224 SLNELTLHGGAPVTKGEKW 242



 Score = 56.6 bits (135), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 32/55 (58%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 188 LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 242


>gi|312385117|gb|EFR29691.1| hypothetical protein AND_01144 [Anopheles darlingi]
          Length = 295

 Score = 59.3 bits (142), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 26/60 (43%), Positives = 36/60 (60%)

Query: 68  LLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
             +S +   E GG   FP L L+V P+KG+A+FWYN H +   D RM +S CP+   N+W
Sbjct: 200 FFMSDVDGAEGGGRIAFPYLGLSVLPQKGAALFWYNLHDSGRPDERMTYSICPLLADNRW 259



 Score = 52.8 bits (125), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 26/60 (43%), Positives = 36/60 (60%), Gaps = 2/60 (3%)

Query: 6   MSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           MS V+    E GG   FP L L+V P+KG+A+FW+N H +   D  M +S CP+   N+W
Sbjct: 202 MSDVD--GAEGGGRIAFPYLGLSVLPQKGAALFWYNLHDSGRPDERMTYSICPLLADNRW 259


>gi|423582447|ref|ZP_17558558.1| hypothetical protein IIA_03962 [Bacillus cereus VD014]
 gi|401213326|gb|EJR20067.1| hypothetical protein IIA_03962 [Bacillus cereus VD014]
          Length = 248

 Score = 58.9 bits (141), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 161 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQ 217

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L+    H G PV  G KW
Sbjct: 218 SLNELTLHGGAPVTKGEKW 236



 Score = 56.2 bits (134), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 32/55 (58%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 182 LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 236


>gi|198466393|ref|XP_001353986.2| GA18007 [Drosophila pseudoobscura pseudoobscura]
 gi|198150579|gb|EAL29722.2| GA18007 [Drosophila pseudoobscura pseudoobscura]
          Length = 455

 Score = 58.9 bits (141), Expect = 6e-07,   Method: Composition-based stats.
 Identities = 24/59 (40%), Positives = 39/59 (66%), Gaps = 6/59 (10%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWGKLL 131
           +++V+ GG T+FP L+L V P+KGS + W N       D+ + H+ CP+ +GN WG+L+
Sbjct: 387 MSNVQQGGETVFPFLSLRVKPQKGSLLLWRNT------DWSVLHNSCPLIIGNMWGELI 439



 Score = 58.9 bits (141), Expect = 6e-07,   Method: Composition-based stats.
 Identities = 25/66 (37%), Positives = 42/66 (63%), Gaps = 6/66 (9%)

Query: 4   AEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGN 63
           A  + + +++V+ GG T+FP L+L V P+KGS + W N       D+ + H+ CP+ +GN
Sbjct: 380 AATALIFMSNVQQGGETVFPFLSLRVKPQKGSLLLWRNT------DWSVLHNSCPLIIGN 433

Query: 64  KWGKLL 69
            WG+L+
Sbjct: 434 MWGELI 439


>gi|281307110|pdb|3ITQ|A Chain A, Crystal Structure Of A Prolyl 4-Hydroxylase From Bacillus
           Anthracis
 gi|281307111|pdb|3ITQ|B Chain B, Crystal Structure Of A Prolyl 4-Hydroxylase From Bacillus
           Anthracis
          Length = 216

 Score = 58.9 bits (141), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 41/79 (51%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    +  N     L+  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 129 DYFAEHSR---SAANNRISTLVXYLNDVEEGGETFFPKLNLSVHPRKGXAVYFEYFYQDQ 185

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L+    H G PV  G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204



 Score = 57.0 bits (136), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 32/55 (58%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 150 LNDVEEGGETFFPKLNLSVHPRKGXAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|195341061|ref|XP_002037130.1| GM12749 [Drosophila sechellia]
 gi|194131246|gb|EDW53289.1| GM12749 [Drosophila sechellia]
          Length = 467

 Score = 58.9 bits (141), Expect = 6e-07,   Method: Composition-based stats.
 Identities = 29/79 (36%), Positives = 38/79 (48%), Gaps = 3/79 (3%)

Query: 10  ELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWGKL- 68
           +L+DV++GG   FP L     P +GSA+ WHN   +   D     + CPV LGN+WG   
Sbjct: 382 KLSDVQMGGYASFPDLGFGFKPRRGSALVWHNTDNSGNCDTRSLQATCPVLLGNQWGSRE 441

Query: 69  --LLSGLTDVELGGATIFP 85
                G      GG   FP
Sbjct: 442 WEFGKGFEGFRAGGNNAFP 460



 Score = 57.8 bits (138), Expect = 1e-06,   Method: Composition-based stats.
 Identities = 24/56 (42%), Positives = 33/56 (58%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 128
           L+DV++GG   FP L     P +GSA+ W+N   +   D R   + CPV LGN+WG
Sbjct: 383 LSDVQMGGYASFPDLGFGFKPRRGSALVWHNTDNSGNCDTRSLQATCPVLLGNQWG 438


>gi|206971296|ref|ZP_03232247.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           AH1134]
 gi|229081494|ref|ZP_04213993.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock4-2]
 gi|423411965|ref|ZP_17389085.1| hypothetical protein IE1_01269 [Bacillus cereus BAG3O-2]
 gi|423432249|ref|ZP_17409253.1| hypothetical protein IE7_04065 [Bacillus cereus BAG4O-1]
 gi|206734068|gb|EDZ51239.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           AH1134]
 gi|228701801|gb|EEL54288.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock4-2]
 gi|401104033|gb|EJQ12010.1| hypothetical protein IE1_01269 [Bacillus cereus BAG3O-2]
 gi|401117005|gb|EJQ24843.1| hypothetical protein IE7_04065 [Bacillus cereus BAG4O-1]
          Length = 216

 Score = 58.9 bits (141), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQ 185

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L+    H G PV  G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204



 Score = 55.8 bits (133), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 32/55 (58%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 150 LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|228902749|ref|ZP_04066896.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL
           4222]
 gi|228967277|ref|ZP_04128313.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           sotto str. T04001]
 gi|402564350|ref|YP_006607074.1| prolyl 4-hydroxylase subunit alpha domain-containing protein
           [Bacillus thuringiensis HD-771]
 gi|434377355|ref|YP_006611999.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus
           thuringiensis HD-789]
 gi|228792646|gb|EEM40212.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           sotto str. T04001]
 gi|228856936|gb|EEN01449.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL
           4222]
 gi|401793002|gb|AFQ19041.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus
           thuringiensis HD-771]
 gi|401875912|gb|AFQ28079.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus
           thuringiensis HD-789]
          Length = 216

 Score = 58.9 bits (141), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQ 185

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L+    H G PV  G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204



 Score = 56.2 bits (134), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 32/55 (58%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 150 LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|229192445|ref|ZP_04319408.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 10876]
 gi|228591022|gb|EEK48878.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 10876]
          Length = 216

 Score = 58.9 bits (141), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQ 185

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L+    H G PV  G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204



 Score = 56.2 bits (134), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 32/55 (58%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 150 LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|423634936|ref|ZP_17610589.1| hypothetical protein IK7_01345 [Bacillus cereus VD156]
 gi|401278922|gb|EJR84852.1| hypothetical protein IK7_01345 [Bacillus cereus VD156]
          Length = 248

 Score = 58.9 bits (141), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 161 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQ 217

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L+    H G PV  G KW
Sbjct: 218 SLNELTLHGGAPVTKGEKW 236



 Score = 56.2 bits (134), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 32/55 (58%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 182 LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 236


>gi|228954520|ref|ZP_04116545.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           kurstaki str. T03a001]
 gi|449091198|ref|YP_007423639.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           kurstaki str. HD73]
 gi|228805177|gb|EEM51771.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           kurstaki str. T03a001]
 gi|449024955|gb|AGE80118.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           kurstaki str. HD73]
          Length = 216

 Score = 58.9 bits (141), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQ 185

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L+    H G PV  G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204



 Score = 56.2 bits (134), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 32/55 (58%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 150 LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|423426372|ref|ZP_17403403.1| hypothetical protein IE5_04061 [Bacillus cereus BAG3X2-2]
 gi|401111119|gb|EJQ19018.1| hypothetical protein IE5_04061 [Bacillus cereus BAG3X2-2]
          Length = 248

 Score = 58.9 bits (141), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 161 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQ 217

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L+    H G PV  G KW
Sbjct: 218 SLNELTLHGGAPVTKGEKW 236



 Score = 56.2 bits (134), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 32/55 (58%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 182 LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 236


>gi|228922987|ref|ZP_04086280.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           huazhongensis BGSC 4BD1]
 gi|228836620|gb|EEM81968.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           huazhongensis BGSC 4BD1]
          Length = 216

 Score = 58.9 bits (141), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQ 185

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L+    H G PV  G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204



 Score = 56.2 bits (134), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 32/55 (58%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 150 LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|423657194|ref|ZP_17632493.1| hypothetical protein IKG_04182 [Bacillus cereus VD200]
 gi|401289937|gb|EJR95641.1| hypothetical protein IKG_04182 [Bacillus cereus VD200]
          Length = 248

 Score = 58.9 bits (141), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 161 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQ 217

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L+    H G PV  G KW
Sbjct: 218 SLNELTLHGGAPVTKGEKW 236



 Score = 56.2 bits (134), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 32/55 (58%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 182 LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 236


>gi|194871352|ref|XP_001972832.1| GG13663 [Drosophila erecta]
 gi|190654615|gb|EDV51858.1| GG13663 [Drosophila erecta]
          Length = 420

 Score = 58.9 bits (141), Expect = 6e-07,   Method: Composition-based stats.
 Identities = 28/59 (47%), Positives = 37/59 (62%), Gaps = 5/59 (8%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           LL  L +V+ GGAT+FP+L + VFP++GS +FW+       LD R     CPV  GNKW
Sbjct: 365 LLFFLDNVKQGGATVFPNLKIAVFPQRGSCLFWHKT-----LDTRNEPLECPVLQGNKW 418



 Score = 57.0 bits (136), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 26/55 (47%), Positives = 34/55 (61%), Gaps = 5/55 (9%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L +V+ GGAT+FP+L + VFP++GS +FWH       LD       CPV  GNKW
Sbjct: 369 LDNVKQGGATVFPNLKIAVFPQRGSCLFWHKT-----LDTRNEPLECPVLQGNKW 418


>gi|228941395|ref|ZP_04103947.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           berliner ATCC 10792]
 gi|228974327|ref|ZP_04134896.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           thuringiensis str. T01001]
 gi|228980919|ref|ZP_04141223.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis Bt407]
 gi|384188306|ref|YP_005574202.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           chinensis CT-43]
 gi|410676625|ref|YP_006928996.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis Bt407]
 gi|452200698|ref|YP_007480779.1| Peptidyl prolyl 4-hydroxylase-like protein, alpha subunit [Bacillus
           thuringiensis serovar thuringiensis str. IS5056]
 gi|228778855|gb|EEM27118.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis Bt407]
 gi|228785377|gb|EEM33387.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           thuringiensis str. T01001]
 gi|228818321|gb|EEM64394.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           berliner ATCC 10792]
 gi|326942015|gb|AEA17911.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           chinensis CT-43]
 gi|409175754|gb|AFV20059.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis Bt407]
 gi|452106091|gb|AGG03031.1| Peptidyl prolyl 4-hydroxylase-like protein, alpha subunit [Bacillus
           thuringiensis serovar thuringiensis str. IS5056]
          Length = 216

 Score = 58.9 bits (141), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQ 185

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L+    H G PV  G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204



 Score = 56.2 bits (134), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 32/55 (58%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 150 LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|365158975|ref|ZP_09355162.1| hypothetical protein HMPREF1014_00625 [Bacillus sp. 7_6_55CFAA_CT2]
 gi|363625964|gb|EHL76973.1| hypothetical protein HMPREF1014_00625 [Bacillus sp. 7_6_55CFAA_CT2]
          Length = 248

 Score = 58.9 bits (141), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 161 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQ 217

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L+    H G PV  G KW
Sbjct: 218 SLNELTLHGGAPVTKGEKW 236



 Score = 56.2 bits (134), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 32/55 (58%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 182 LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 236


>gi|228960501|ref|ZP_04122151.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           pakistani str. T13001]
 gi|229047930|ref|ZP_04193506.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH676]
 gi|423630961|ref|ZP_17606708.1| hypothetical protein IK5_03811 [Bacillus cereus VD154]
 gi|423650103|ref|ZP_17625673.1| hypothetical protein IKA_03890 [Bacillus cereus VD169]
 gi|228723387|gb|EEL74756.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH676]
 gi|228799198|gb|EEM46165.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           pakistani str. T13001]
 gi|401264328|gb|EJR70440.1| hypothetical protein IK5_03811 [Bacillus cereus VD154]
 gi|401282521|gb|EJR88420.1| hypothetical protein IKA_03890 [Bacillus cereus VD169]
          Length = 248

 Score = 58.9 bits (141), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 161 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQ 217

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L+    H G PV  G KW
Sbjct: 218 SLNELTLHGGAPVTKGEKW 236



 Score = 56.2 bits (134), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 32/55 (58%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 182 LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 236


>gi|75760922|ref|ZP_00740932.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           israelensis ATCC 35646]
 gi|423385740|ref|ZP_17362996.1| hypothetical protein ICE_03486 [Bacillus cereus BAG1X1-2]
 gi|423561293|ref|ZP_17537569.1| hypothetical protein II5_00697 [Bacillus cereus MSX-A1]
 gi|74491592|gb|EAO54798.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           israelensis ATCC 35646]
 gi|401201550|gb|EJR08415.1| hypothetical protein II5_00697 [Bacillus cereus MSX-A1]
 gi|401635796|gb|EJS53551.1| hypothetical protein ICE_03486 [Bacillus cereus BAG1X1-2]
          Length = 248

 Score = 58.9 bits (141), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 161 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQ 217

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L+    H G PV  G KW
Sbjct: 218 SLNELTLHGGAPVTKGEKW 236



 Score = 56.2 bits (134), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 32/55 (58%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 182 LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 236


>gi|423437685|ref|ZP_17414666.1| hypothetical protein IE9_03866 [Bacillus cereus BAG4X12-1]
 gi|423503075|ref|ZP_17479667.1| hypothetical protein IG1_00641 [Bacillus cereus HD73]
 gi|401120840|gb|EJQ28636.1| hypothetical protein IE9_03866 [Bacillus cereus BAG4X12-1]
 gi|402459296|gb|EJV91033.1| hypothetical protein IG1_00641 [Bacillus cereus HD73]
          Length = 248

 Score = 58.9 bits (141), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 161 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQ 217

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L+    H G PV  G KW
Sbjct: 218 SLNELTLHGGAPVTKGEKW 236



 Score = 56.2 bits (134), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 32/55 (58%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 182 LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 236


>gi|423358724|ref|ZP_17336227.1| hypothetical protein IC1_00704 [Bacillus cereus VD022]
 gi|401084596|gb|EJP92842.1| hypothetical protein IC1_00704 [Bacillus cereus VD022]
          Length = 248

 Score = 58.9 bits (141), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 161 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQ 217

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L+    H G PV  G KW
Sbjct: 218 SLNELTLHGGAPVTKGEKW 236



 Score = 56.2 bits (134), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 32/55 (58%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 182 LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 236


>gi|229111709|ref|ZP_04241257.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-15]
 gi|296504733|ref|YP_003666433.1| prolyl 4-hydroxylase subunit alpha [Bacillus thuringiensis BMB171]
 gi|423585282|ref|ZP_17561369.1| hypothetical protein IIE_00694 [Bacillus cereus VD045]
 gi|423640681|ref|ZP_17616299.1| hypothetical protein IK9_00626 [Bacillus cereus VD166]
 gi|228671703|gb|EEL26999.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-15]
 gi|296325785|gb|ADH08713.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis BMB171]
 gi|401233925|gb|EJR40411.1| hypothetical protein IIE_00694 [Bacillus cereus VD045]
 gi|401279742|gb|EJR85664.1| hypothetical protein IK9_00626 [Bacillus cereus VD166]
          Length = 248

 Score = 58.9 bits (141), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 161 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQ 217

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L+    H G PV  G KW
Sbjct: 218 SLNELTLHGGAPVTKGEKW 236



 Score = 56.2 bits (134), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 32/55 (58%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 182 LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 236


>gi|423527903|ref|ZP_17504348.1| hypothetical protein IGE_01455 [Bacillus cereus HuB1-1]
 gi|402451566|gb|EJV83385.1| hypothetical protein IGE_01455 [Bacillus cereus HuB1-1]
          Length = 248

 Score = 58.9 bits (141), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 161 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQ 217

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L+    H G PV  G KW
Sbjct: 218 SLNELTLHGGAPVTKGEKW 236



 Score = 56.2 bits (134), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 32/55 (58%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 182 LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 236


>gi|341896599|gb|EGT52534.1| hypothetical protein CAEBREN_17302 [Caenorhabditis brenneri]
          Length = 314

 Score = 58.9 bits (141), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 26/66 (39%), Positives = 38/66 (57%), Gaps = 1/66 (1%)

Query: 61  LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
           +GN+    ++S +   ++GGAT+FPS+  TV P  G A  W+N   N  +D   +H GCP
Sbjct: 218 MGNRMTTFIIS-VEKADVGGATVFPSIKATVRPNPGDAFMWFNLLENQEIDNSAFHGGCP 276

Query: 121 VALGNK 126
           V  G K
Sbjct: 277 VLAGRK 282



 Score = 55.1 bits (131), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 28/57 (49%), Positives = 32/57 (56%), Gaps = 2/57 (3%)

Query: 8   SVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNK 64
           SVE  DV  GGAT+FPS+  TV P  G A  W N   N  +D   +H GCPV  G K
Sbjct: 228 SVEKADV--GGATVFPSIKATVRPNPGDAFMWFNLLENQEIDNSAFHGGCPVLAGRK 282


>gi|229180513|ref|ZP_04307855.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus 172560W]
 gi|228602937|gb|EEK60416.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus 172560W]
          Length = 232

 Score = 58.9 bits (141), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 145 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQ 201

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L+    H G PV  G KW
Sbjct: 202 SLNELTLHGGAPVTKGEKW 220



 Score = 55.8 bits (133), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 32/55 (58%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 166 LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 220


>gi|195159311|ref|XP_002020525.1| GL13465 [Drosophila persimilis]
 gi|194117294|gb|EDW39337.1| GL13465 [Drosophila persimilis]
          Length = 578

 Score = 58.9 bits (141), Expect = 7e-07,   Method: Composition-based stats.
 Identities = 24/56 (42%), Positives = 35/56 (62%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 128
           L+DV  GGAT+FP LN++V P +G A+ WYN +     +    H+ CP+  G+KW 
Sbjct: 505 LSDVPQGGATLFPRLNISVQPRQGDALLWYNLNDRGQGEIGTVHTSCPIIKGSKWA 560



 Score = 56.6 bits (135), Expect = 3e-06,   Method: Composition-based stats.
 Identities = 23/56 (41%), Positives = 35/56 (62%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWG 66
           L+DV  GGAT+FP LN++V P +G A+ W+N +     +    H+ CP+  G+KW 
Sbjct: 505 LSDVPQGGATLFPRLNISVQPRQGDALLWYNLNDRGQGEIGTVHTSCPIIKGSKWA 560


>gi|30022316|ref|NP_833947.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 14579]
 gi|229129515|ref|ZP_04258486.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-Cer4]
 gi|29897873|gb|AAP11148.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 14579]
 gi|228654120|gb|EEL09987.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-Cer4]
          Length = 232

 Score = 58.9 bits (141), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 145 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQ 201

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L+    H G PV  G KW
Sbjct: 202 SLNELTLHGGAPVTKGEKW 220



 Score = 55.8 bits (133), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 32/55 (58%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 166 LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 220


>gi|229071739|ref|ZP_04204954.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus F65185]
 gi|228711334|gb|EEL63294.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus F65185]
          Length = 232

 Score = 58.9 bits (141), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 145 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQ 201

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L+    H G PV  G KW
Sbjct: 202 SLNELTLHGGAPVTKGEKW 220



 Score = 55.8 bits (133), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 32/55 (58%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 166 LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 220


>gi|198428011|ref|XP_002120302.1| PREDICTED: similar to prolyl 4-hydroxylase alpha-2 subunit, partial
           [Ciona intestinalis]
          Length = 233

 Score = 58.9 bits (141), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 27/60 (45%), Positives = 34/60 (56%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 128
            L+  +DVE GG T F    +   P KGSA+FWY+   +   D R  H+ CPV LGNKW 
Sbjct: 145 FLTYFSDVEYGGNTPFVYQEVVAEPIKGSAIFWYDVFNDGSADERTEHAACPVVLGNKWA 204



 Score = 52.8 bits (125), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 24/56 (42%), Positives = 31/56 (55%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWG 66
            +DVE GG T F    +   P KGSA+FW++   +   D    H+ CPV LGNKW 
Sbjct: 149 FSDVEYGGNTPFVYQEVVAEPIKGSAIFWYDVFNDGSADERTEHAACPVVLGNKWA 204


>gi|195452772|ref|XP_002073493.1| GK14149 [Drosophila willistoni]
 gi|194169578|gb|EDW84479.1| GK14149 [Drosophila willistoni]
          Length = 496

 Score = 58.5 bits (140), Expect = 7e-07,   Method: Composition-based stats.
 Identities = 29/57 (50%), Positives = 33/57 (57%), Gaps = 1/57 (1%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTL-LDYRMYHSGCPVALGNKWG 128
           L  V+ GGAT+FPSLNL V  EKG  +FWYN    +   D    H GCPV  G K G
Sbjct: 416 LNTVKHGGATVFPSLNLKVPAEKGKVLFWYNLDGESFDFDENTEHGGCPVVDGIKLG 472



 Score = 55.5 bits (132), Expect = 8e-06,   Method: Composition-based stats.
 Identities = 28/57 (49%), Positives = 33/57 (57%), Gaps = 1/57 (1%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTL-LDYWMYHSGCPVALGNKWG 66
           L  V+ GGAT+FPSLNL V  EKG  +FW+N    +   D    H GCPV  G K G
Sbjct: 416 LNTVKHGGATVFPSLNLKVPAEKGKVLFWYNLDGESFDFDENTEHGGCPVVDGIKLG 472


>gi|423406337|ref|ZP_17383486.1| hypothetical protein ICY_01022 [Bacillus cereus BAG2X1-3]
 gi|401660331|gb|EJS77813.1| hypothetical protein ICY_01022 [Bacillus cereus BAG2X1-3]
          Length = 216

 Score = 58.5 bits (140), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQ 185

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L+    H G PV  G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204



 Score = 55.8 bits (133), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 28/64 (43%), Positives = 35/64 (54%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  
Sbjct: 141 NRISTLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTK 200

Query: 62  GNKW 65
           G KW
Sbjct: 201 GEKW 204


>gi|423395462|ref|ZP_17372663.1| hypothetical protein ICU_01156 [Bacillus cereus BAG2X1-1]
 gi|401654873|gb|EJS72412.1| hypothetical protein ICU_01156 [Bacillus cereus BAG2X1-1]
          Length = 216

 Score = 58.5 bits (140), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQ 185

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L+    H G PV  G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204



 Score = 55.8 bits (133), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 28/64 (43%), Positives = 35/64 (54%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  
Sbjct: 141 NRISTLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTK 200

Query: 62  GNKW 65
           G KW
Sbjct: 201 GEKW 204


>gi|195494568|ref|XP_002094893.1| GE19959 [Drosophila yakuba]
 gi|194180994|gb|EDW94605.1| GE19959 [Drosophila yakuba]
          Length = 486

 Score = 58.5 bits (140), Expect = 8e-07,   Method: Composition-based stats.
 Identities = 23/53 (43%), Positives = 36/53 (67%)

Query: 75  DVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           +V  GG T+FP L + V P++G+A+FW+N   ++  D R  HS CPV +G++W
Sbjct: 391 EVSQGGQTVFPDLKVVVEPKRGNALFWFNKLDDSSPDPRSLHSVCPVIVGSRW 443



 Score = 53.5 bits (127), Expect = 3e-05,   Method: Composition-based stats.
 Identities = 22/53 (41%), Positives = 34/53 (64%)

Query: 13  DVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           +V  GG T+FP L + V P++G+A+FW N   ++  D    HS CPV +G++W
Sbjct: 391 EVSQGGQTVFPDLKVVVEPKRGNALFWFNKLDDSSPDPRSLHSVCPVIVGSRW 443


>gi|195166671|ref|XP_002024158.1| GL22696 [Drosophila persimilis]
 gi|194107513|gb|EDW29556.1| GL22696 [Drosophila persimilis]
          Length = 491

 Score = 58.5 bits (140), Expect = 8e-07,   Method: Composition-based stats.
 Identities = 24/59 (40%), Positives = 39/59 (66%), Gaps = 6/59 (10%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWGKLL 131
           +++V+ GG T+FP L+L V P+KGS + W N       D+ + H+ CP+ +GN WG+L+
Sbjct: 423 MSNVQQGGETVFPFLSLRVKPQKGSLLLWRNT------DWSVLHNSCPLIIGNMWGELI 475



 Score = 58.5 bits (140), Expect = 8e-07,   Method: Composition-based stats.
 Identities = 25/66 (37%), Positives = 42/66 (63%), Gaps = 6/66 (9%)

Query: 4   AEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGN 63
           A  + + +++V+ GG T+FP L+L V P+KGS + W N       D+ + H+ CP+ +GN
Sbjct: 416 AATALIFMSNVQQGGETVFPFLSLRVKPQKGSLLLWRNT------DWSVLHNSCPLIIGN 469

Query: 64  KWGKLL 69
            WG+L+
Sbjct: 470 MWGELI 475


>gi|229163182|ref|ZP_04291137.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus R309803]
 gi|228620245|gb|EEK77116.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus R309803]
          Length = 229

 Score = 58.5 bits (140), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 142 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVNPRKGMAVYFEYFYQDQ 198

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L+    H G PV  G KW
Sbjct: 199 SLNELTLHGGAPVTKGEKW 217



 Score = 55.8 bits (133), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 32/55 (58%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 163 LNDVEEGGETFFPKLNLSVNPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 217


>gi|194871344|ref|XP_001972830.1| GG13666 [Drosophila erecta]
 gi|190654613|gb|EDV51856.1| GG13666 [Drosophila erecta]
          Length = 539

 Score = 58.5 bits (140), Expect = 8e-07,   Method: Composition-based stats.
 Identities = 24/54 (44%), Positives = 35/54 (64%)

Query: 74  TDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           +DV LGG T+FP L + V P+KG+A+ W+N +     D    HS CP+ LG++W
Sbjct: 467 SDVPLGGTTVFPKLKIAVQPKKGNALVWFNLNHAGEPDPLTEHSVCPIVLGSRW 520



 Score = 57.4 bits (137), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 25/64 (39%), Positives = 38/64 (59%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           +R   + +  +DV LGG T+FP L + V P+KG+A+ W N +     D    HS CP+ L
Sbjct: 457 DRIATAVLYASDVPLGGTTVFPKLKIAVQPKKGNALVWFNLNHAGEPDPLTEHSVCPIVL 516

Query: 62  GNKW 65
           G++W
Sbjct: 517 GSRW 520


>gi|319943342|ref|ZP_08017624.1| 2OG-Fe(II) oxygenase [Lautropia mirabilis ATCC 51599]
 gi|319743157|gb|EFV95562.1| 2OG-Fe(II) oxygenase [Lautropia mirabilis ATCC 51599]
          Length = 311

 Score = 58.5 bits (140), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 28/66 (42%), Positives = 40/66 (60%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           G + G +++  L DV  GG T FP +  +  P KGSAV++   +A+  LDYR  H+G PV
Sbjct: 234 GQRIGTVVMY-LNDVPEGGETAFPDIGFSAKPIKGSAVYFEYQNADGQLDYRCLHAGMPV 292

Query: 122 ALGNKW 127
             G+KW
Sbjct: 293 IRGDKW 298



 Score = 55.8 bits (133), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 25/55 (45%), Positives = 33/55 (60%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DV  GG T FP +  +  P KGSAV++   +A+  LDY   H+G PV  G+KW
Sbjct: 244 LNDVPEGGETAFPDIGFSAKPIKGSAVYFEYQNADGQLDYRCLHAGMPVIRGDKW 298


>gi|405964866|gb|EKC30308.1| KRR1 small subunit processome component-like protein [Crassostrea
           gigas]
          Length = 885

 Score = 58.5 bits (140), Expect = 9e-07,   Method: Composition-based stats.
 Identities = 23/55 (41%), Positives = 32/55 (58%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           + D + GGAT+FP +   +   KG A FW+N   +   D R  H GCPV +G+KW
Sbjct: 801 MNDAKAGGATVFPEVRTRIPVAKGGAAFWFNLRPSGATDPRTLHGGCPVLVGSKW 855



 Score = 53.1 bits (126), Expect = 3e-05,   Method: Composition-based stats.
 Identities = 22/55 (40%), Positives = 30/55 (54%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           + D + GGAT+FP +   +   KG A FW N   +   D    H GCPV +G+KW
Sbjct: 801 MNDAKAGGATVFPEVRTRIPVAKGGAAFWFNLRPSGATDPRTLHGGCPVLVGSKW 855


>gi|296217074|ref|XP_002754870.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Callithrix
           jacchus]
          Length = 544

 Score = 58.5 bits (140), Expect = 9e-07,   Method: Composition-based stats.
 Identities = 29/66 (43%), Positives = 40/66 (60%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    ++  L+ VE GGAT F   NL+V   K +A+FW+N H +   D    H+GCPV
Sbjct: 457 GNRVATFMIY-LSSVEAGGATAFIYANLSVPVVKNAALFWWNLHRSGEGDSDTLHAGCPV 515

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 516 LVGNKW 521



 Score = 57.4 bits (137), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 29/64 (45%), Positives = 37/64 (57%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L+ VE GGAT F   NL+V   K +A+FW N H +   D    H+GCPV +
Sbjct: 458 NRVATFMIYLSSVEAGGATAFIYANLSVPVVKNAALFWWNLHRSGEGDSDTLHAGCPVLV 517

Query: 62  GNKW 65
           GNKW
Sbjct: 518 GNKW 521


>gi|195391756|ref|XP_002054526.1| GJ24503 [Drosophila virilis]
 gi|194152612|gb|EDW68046.1| GJ24503 [Drosophila virilis]
          Length = 519

 Score = 58.5 bits (140), Expect = 9e-07,   Method: Composition-based stats.
 Identities = 28/60 (46%), Positives = 36/60 (60%), Gaps = 5/60 (8%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA-----LGNKW 127
           L+DV  GG T+FP LN+ + P KG+ V WYN H +   D R  H+GCPV      +GN W
Sbjct: 442 LSDVLNGGYTVFPKLNVFLKPVKGAMVSWYNLHRSLNKDSRTLHAGCPVIEGVKRIGNIW 501



 Score = 54.7 bits (130), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 28/69 (40%), Positives = 38/69 (55%), Gaps = 5/69 (7%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVA- 60
           NR   +   L+DV  GG T+FP LN+ + P KG+ V W+N H +   D    H+GCPV  
Sbjct: 433 NRIATNMFYLSDVLNGGYTVFPKLNVFLKPVKGAMVSWYNLHRSLNKDSRTLHAGCPVIE 492

Query: 61  ----LGNKW 65
               +GN W
Sbjct: 493 GVKRIGNIW 501


>gi|218231188|ref|YP_002369041.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus cereus
           B4264]
 gi|218159145|gb|ACK59137.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           B4264]
          Length = 216

 Score = 58.2 bits (139), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 43/79 (54%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQ 185

Query: 109 LLDYRMYHSGCPVALGNKW 127
            ++    H G PV  G KW
Sbjct: 186 SINELTLHGGAPVTKGEKW 204



 Score = 55.5 bits (132), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 25/55 (45%), Positives = 32/55 (58%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T FP LNL+V P KG AV++   + +  ++    H G PV  G KW
Sbjct: 150 LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSINELTLHGGAPVTKGEKW 204


>gi|445499353|ref|ZP_21466208.1| prolyl 4-hydroxylase alpha subunit [Janthinobacterium sp. HH01]
 gi|444789348|gb|ELX10896.1| prolyl 4-hydroxylase alpha subunit [Janthinobacterium sp. HH01]
          Length = 272

 Score = 58.2 bits (139), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 27/54 (50%), Positives = 34/54 (62%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 126
           L+DVE GG T+FP L L V+P+KGSA+++ N   N   D R  H G PV  G K
Sbjct: 206 LSDVEQGGGTVFPGLGLEVYPKKGSALWFLNTDINHQPDKRTLHGGAPVVRGTK 259



 Score = 54.7 bits (130), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 26/54 (48%), Positives = 33/54 (61%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNK 64
           L+DVE GG T+FP L L V+P+KGSA+++ N   N   D    H G PV  G K
Sbjct: 206 LSDVEQGGGTVFPGLGLEVYPKKGSALWFLNTDINHQPDKRTLHGGAPVVRGTK 259


>gi|403263105|ref|XP_003923900.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
           alpha-3, partial [Saimiri boliviensis boliviensis]
          Length = 534

 Score = 58.2 bits (139), Expect = 1e-06,   Method: Composition-based stats.
 Identities = 29/66 (43%), Positives = 40/66 (60%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    ++  L+ VE GGAT F   NL+V   K +A+FW+N H +   D    H+GCPV
Sbjct: 447 GNRVATFMIY-LSSVEAGGATAFIYANLSVPVVKNAALFWWNLHRSGEGDSDTLHAGCPV 505

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 506 LVGNKW 511



 Score = 57.4 bits (137), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 29/64 (45%), Positives = 37/64 (57%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L+ VE GGAT F   NL+V   K +A+FW N H +   D    H+GCPV +
Sbjct: 448 NRVATFMIYLSSVEAGGATAFIYANLSVPVVKNAALFWWNLHRSGEGDSDTLHAGCPVLV 507

Query: 62  GNKW 65
           GNKW
Sbjct: 508 GNKW 511


>gi|372266874|ref|ZP_09502922.1| peptidyl prolyl 4-hydroxylase-like protein subunit alpha
           [Alteromonas sp. S89]
          Length = 294

 Score = 58.2 bits (139), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 29/66 (43%), Positives = 41/66 (62%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           G + G L++  L+DVE GGAT+FP + L V P+KG+A+F+     +  LD +  H G PV
Sbjct: 216 GQRVGTLIMY-LSDVESGGATVFPRVGLEVQPQKGAALFFSYVGEHGKLDLQSLHGGSPV 274

Query: 122 ALGNKW 127
             G KW
Sbjct: 275 LAGEKW 280



 Score = 55.5 bits (132), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 34/55 (61%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L+DVE GGAT+FP + L V P+KG+A+F+     +  LD    H G PV  G KW
Sbjct: 226 LSDVESGGATVFPRVGLEVQPQKGAALFFSYVGEHGKLDLQSLHGGSPVLAGEKW 280


>gi|195110921|ref|XP_002000028.1| GI24861 [Drosophila mojavensis]
 gi|193916622|gb|EDW15489.1| GI24861 [Drosophila mojavensis]
          Length = 508

 Score = 58.2 bits (139), Expect = 1e-06,   Method: Composition-based stats.
 Identities = 27/54 (50%), Positives = 35/54 (64%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 126
           L+DV  GG T+FP LN+ + P KGS V W N H +   D R+ H+GCPV  G+K
Sbjct: 431 LSDVLQGGYTVFPFLNVFLRPVKGSLVIWPNVHRSLAPDSRVLHAGCPVLEGSK 484



 Score = 57.0 bits (136), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 28/63 (44%), Positives = 37/63 (58%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR   +   L+DV  GG T+FP LN+ + P KGS V W N H +   D  + H+GCPV  
Sbjct: 422 NRITTNMFYLSDVLQGGYTVFPFLNVFLRPVKGSLVIWPNVHRSLAPDSRVLHAGCPVLE 481

Query: 62  GNK 64
           G+K
Sbjct: 482 GSK 484


>gi|229152436|ref|ZP_04280628.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus m1550]
 gi|228631044|gb|EEK87681.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus m1550]
          Length = 248

 Score = 58.2 bits (139), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 43/79 (54%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 161 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQ 217

Query: 109 LLDYRMYHSGCPVALGNKW 127
            ++    H G PV  G KW
Sbjct: 218 SINELTLHGGAPVTKGEKW 236



 Score = 55.5 bits (132), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 25/55 (45%), Positives = 32/55 (58%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T FP LNL+V P KG AV++   + +  ++    H G PV  G KW
Sbjct: 182 LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSINELTLHGGAPVTKGEKW 236


>gi|149180354|ref|ZP_01858859.1| prolyl 4-hydroxylase, alpha subunit [Bacillus sp. SG-1]
 gi|148852546|gb|EDL66691.1| prolyl 4-hydroxylase, alpha subunit [Bacillus sp. SG-1]
          Length = 212

 Score = 58.2 bits (139), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 26/56 (46%), Positives = 34/56 (60%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWG 66
           L DVE GG T FP LN +V P+KG AV++   + N  L+    H G PV +G+KW 
Sbjct: 146 LNDVEEGGETYFPKLNFSVSPQKGMAVYFEYFYDNQELNDLTLHGGAPVIIGDKWA 201



 Score = 56.2 bits (134), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 26/56 (46%), Positives = 34/56 (60%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 128
           L DVE GG T FP LN +V P+KG AV++   + N  L+    H G PV +G+KW 
Sbjct: 146 LNDVEEGGETYFPKLNFSVSPQKGMAVYFEYFYDNQELNDLTLHGGAPVIIGDKWA 201


>gi|195113245|ref|XP_002001178.1| GI22115 [Drosophila mojavensis]
 gi|193917772|gb|EDW16639.1| GI22115 [Drosophila mojavensis]
          Length = 498

 Score = 58.2 bits (139), Expect = 1e-06,   Method: Composition-based stats.
 Identities = 27/56 (48%), Positives = 33/56 (58%), Gaps = 4/56 (7%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRM--YHSGCPVALGNK 126
           L DV  GG TIFP +   V PEKG  + WYN ++N  LDY +   H  CPV +G K
Sbjct: 410 LNDVTRGGGTIFPQVKQAVHPEKGKLILWYNMNSN--LDYELASLHGACPVLIGRK 463



 Score = 55.1 bits (131), Expect = 9e-06,   Method: Composition-based stats.
 Identities = 26/58 (44%), Positives = 34/58 (58%), Gaps = 4/58 (6%)

Query: 9   VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWM--YHSGCPVALGNK 64
           + L DV  GG TIFP +   V PEKG  + W+N ++N  LDY +   H  CPV +G K
Sbjct: 408 IYLNDVTRGGGTIFPQVKQAVHPEKGKLILWYNMNSN--LDYELASLHGACPVLIGRK 463


>gi|321463241|gb|EFX74258.1| hypothetical protein DAPPUDRAFT_22132 [Daphnia pulex]
          Length = 523

 Score = 58.2 bits (139), Expect = 1e-06,   Method: Composition-based stats.
 Identities = 26/55 (47%), Positives = 31/55 (56%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L DVE GG+T FP   + V P KG A FW+N   +   D    H  CPV LG+KW
Sbjct: 449 LNDVERGGSTAFPRAGVAVKPVKGGAAFWFNLKRSGKPDPLTLHGACPVLLGHKW 503



 Score = 56.6 bits (135), Expect = 3e-06,   Method: Composition-based stats.
 Identities = 26/55 (47%), Positives = 30/55 (54%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG+T FP   + V P KG A FW N   +   D    H  CPV LG+KW
Sbjct: 449 LNDVERGGSTAFPRAGVAVKPVKGGAAFWFNLKRSGKPDPLTLHGACPVLLGHKW 503


>gi|195494570|ref|XP_002094894.1| GE19958 [Drosophila yakuba]
 gi|194180995|gb|EDW94606.1| GE19958 [Drosophila yakuba]
          Length = 498

 Score = 57.8 bits (138), Expect = 1e-06,   Method: Composition-based stats.
 Identities = 28/61 (45%), Positives = 38/61 (62%), Gaps = 12/61 (19%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRM------YHSGCPVALGNK 126
           + DV  GGA +F  LNLT++P+KGSA+ W N      LD+RM       H+ CPV +G+K
Sbjct: 425 MNDVAQGGAFVFLHLNLTIWPQKGSALVWRN------LDHRMQPNEDLLHASCPVIVGSK 478

Query: 127 W 127
           W
Sbjct: 479 W 479



 Score = 56.6 bits (135), Expect = 3e-06,   Method: Composition-based stats.
 Identities = 24/55 (43%), Positives = 35/55 (63%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           + DV  GGA +F  LNLT++P+KGSA+ W N       +  + H+ CPV +G+KW
Sbjct: 425 MNDVAQGGAFVFLHLNLTIWPQKGSALVWRNLDHRMQPNEDLLHASCPVIVGSKW 479


>gi|194905305|ref|XP_001981170.1| GG11767 [Drosophila erecta]
 gi|190655808|gb|EDV53040.1| GG11767 [Drosophila erecta]
          Length = 536

 Score = 57.8 bits (138), Expect = 1e-06,   Method: Composition-based stats.
 Identities = 30/79 (37%), Positives = 46/79 (58%), Gaps = 2/79 (2%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+   +    A+G++   +L   L +V+ GGAT FP++NL V  +KGSA+FW+N    +
Sbjct: 424 DYFDSKTSYLEAVGDRIATVLFY-LNNVDHGGATAFPNINLAVPTQKGSALFWHNLDGKS 482

Query: 109 L-LDYRMYHSGCPVALGNK 126
              D R +H  CP+  G K
Sbjct: 483 YDYDTRTFHGACPLISGTK 501



 Score = 55.1 bits (131), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 25/55 (45%), Positives = 34/55 (61%), Gaps = 1/55 (1%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTL-LDYWMYHSGCPVALGNK 64
           L +V+ GGAT FP++NL V  +KGSA+FWHN    +   D   +H  CP+  G K
Sbjct: 447 LNNVDHGGATAFPNINLAVPTQKGSALFWHNLDGKSYDYDTRTFHGACPLISGTK 501


>gi|194871364|ref|XP_001972834.1| GG13661 [Drosophila erecta]
 gi|190654617|gb|EDV51860.1| GG13661 [Drosophila erecta]
          Length = 506

 Score = 57.8 bits (138), Expect = 1e-06,   Method: Composition-based stats.
 Identities = 26/56 (46%), Positives = 37/56 (66%), Gaps = 1/56 (1%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNA-HANTLLDYWMYHSGCPVALGNKW 65
           + DV LGGA +F + NLT+FP+KGSA+ W N  H+    +  + H  CPV +G+KW
Sbjct: 425 MNDVALGGAFVFLNANLTIFPQKGSALVWRNLDHSLQPKEDLLQHLSCPVIVGSKW 480



 Score = 57.8 bits (138), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 29/68 (42%), Positives = 43/68 (63%), Gaps = 2/68 (2%)

Query: 61  LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNA-HANTLLDYRMYHSGC 119
           LG++   +L   + DV LGGA +F + NLT+FP+KGSA+ W N  H+    +  + H  C
Sbjct: 414 LGDRLTSILFF-MNDVALGGAFVFLNANLTIFPQKGSALVWRNLDHSLQPKEDLLQHLSC 472

Query: 120 PVALGNKW 127
           PV +G+KW
Sbjct: 473 PVIVGSKW 480


>gi|21358233|ref|NP_651814.1| prolyl-4-hydroxylase-alpha NE3 [Drosophila melanogaster]
 gi|20269810|gb|AAM18060.1|AF495538_1 prolyl 4-hydroxylase alpha-related protein PH4[alpha]NE3
           [Drosophila melanogaster]
 gi|15291443|gb|AAK92990.1| GH21465p [Drosophila melanogaster]
 gi|23172714|gb|AAN14251.1| prolyl-4-hydroxylase-alpha NE3 [Drosophila melanogaster]
 gi|220945610|gb|ACL85348.1| PH4alphaNE3-PA [synthetic construct]
 gi|220955396|gb|ACL90241.1| PH4alphaNE3-PA [synthetic construct]
          Length = 481

 Score = 57.8 bits (138), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 25/56 (44%), Positives = 35/56 (62%), Gaps = 1/56 (1%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTL-LDYRMYHSGCPVALGNKW 127
           L D   GGAT+FP LN+ V  E+G  +FWYN + +T  ++    H+ CPV  G+KW
Sbjct: 416 LNDAPYGGATVFPRLNVKVPAERGKVLFWYNLNGDTHDMEPTTMHAACPVFHGSKW 471



 Score = 55.1 bits (131), Expect = 9e-06,   Method: Composition-based stats.
 Identities = 24/56 (42%), Positives = 35/56 (62%), Gaps = 1/56 (1%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTL-LDYWMYHSGCPVALGNKW 65
           L D   GGAT+FP LN+ V  E+G  +FW+N + +T  ++    H+ CPV  G+KW
Sbjct: 416 LNDAPYGGATVFPRLNVKVPAERGKVLFWYNLNGDTHDMEPTTMHAACPVFHGSKW 471


>gi|195061021|ref|XP_001995909.1| GH14207 [Drosophila grimshawi]
 gi|193891701|gb|EDV90567.1| GH14207 [Drosophila grimshawi]
          Length = 477

 Score = 57.8 bits (138), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 28/57 (49%), Positives = 33/57 (57%), Gaps = 1/57 (1%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTL-LDYRMYHSGCPVALGNKWG 128
           L DV  GGATIFP L L V PE+G  + W+N    T  L+ R  H  CPV +G K G
Sbjct: 405 LNDVARGGATIFPDLELLVQPERGKVLHWHNMDLGTYDLEKRSLHGACPVIMGKKEG 461



 Score = 56.6 bits (135), Expect = 3e-06,   Method: Composition-based stats.
 Identities = 28/57 (49%), Positives = 32/57 (56%), Gaps = 1/57 (1%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTL-LDYWMYHSGCPVALGNKWG 66
           L DV  GGATIFP L L V PE+G  + WHN    T  L+    H  CPV +G K G
Sbjct: 405 LNDVARGGATIFPDLELLVQPERGKVLHWHNMDLGTYDLEKRSLHGACPVIMGKKEG 461


>gi|281362877|ref|NP_733393.3| CG31016, isoform B [Drosophila melanogaster]
 gi|442621939|ref|NP_001263119.1| CG31016, isoform C [Drosophila melanogaster]
 gi|272477249|gb|AAF57071.5| CG31016, isoform B [Drosophila melanogaster]
 gi|440218076|gb|AGB96498.1| CG31016, isoform C [Drosophila melanogaster]
          Length = 536

 Score = 57.8 bits (138), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 26/55 (47%), Positives = 35/55 (63%), Gaps = 1/55 (1%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTL-LDYRMYHSGCPVALGNK 126
           L +V+ GGAT+FP LNL V  +KGSA+FW+N    +   D R +H  CP+  G K
Sbjct: 447 LNNVDHGGATVFPKLNLAVPTQKGSALFWHNIDRKSYDYDTRTFHGACPLISGTK 501



 Score = 56.6 bits (135), Expect = 3e-06,   Method: Composition-based stats.
 Identities = 26/55 (47%), Positives = 34/55 (61%), Gaps = 1/55 (1%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTL-LDYWMYHSGCPVALGNK 64
           L +V+ GGAT+FP LNL V  +KGSA+FWHN    +   D   +H  CP+  G K
Sbjct: 447 LNNVDHGGATVFPKLNLAVPTQKGSALFWHNIDRKSYDYDTRTFHGACPLISGTK 501


>gi|159884097|gb|ABX00727.1| IP12176p [Drosophila melanogaster]
          Length = 538

 Score = 57.8 bits (138), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 26/55 (47%), Positives = 35/55 (63%), Gaps = 1/55 (1%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTL-LDYRMYHSGCPVALGNK 126
           L +V+ GGAT+FP LNL V  +KGSA+FW+N    +   D R +H  CP+  G K
Sbjct: 449 LNNVDHGGATVFPKLNLAVPTQKGSALFWHNIDRKSYDYDTRTFHGACPLISGTK 503



 Score = 56.6 bits (135), Expect = 3e-06,   Method: Composition-based stats.
 Identities = 26/55 (47%), Positives = 34/55 (61%), Gaps = 1/55 (1%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTL-LDYWMYHSGCPVALGNK 64
           L +V+ GGAT+FP LNL V  +KGSA+FWHN    +   D   +H  CP+  G K
Sbjct: 449 LNNVDHGGATVFPKLNLAVPTQKGSALFWHNIDRKSYDYDTRTFHGACPLISGTK 503


>gi|229146822|ref|ZP_04275187.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST24]
 gi|228636650|gb|EEK93115.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST24]
          Length = 216

 Score = 57.8 bits (138), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 42/79 (53%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   +   
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQGQ 185

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L+    H G PV  G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204



 Score = 54.7 bits (130), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 31/55 (56%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T FP LNL+V P KG AV++   +    L+    H G PV  G KW
Sbjct: 150 LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQGQSLNELTLHGGAPVTKGEKW 204


>gi|195494561|ref|XP_002094890.1| GE19962 [Drosophila yakuba]
 gi|194180991|gb|EDW94602.1| GE19962 [Drosophila yakuba]
          Length = 539

 Score = 57.8 bits (138), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 25/54 (46%), Positives = 35/54 (64%)

Query: 74  TDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           +DV LGG T+F  L L V P+KG+A+ W+N + +   D    HS CPV LG++W
Sbjct: 467 SDVPLGGTTVFSKLKLAVQPKKGNALVWFNLNHDGKPDPLTEHSVCPVVLGSRW 520



 Score = 56.2 bits (134), Expect = 5e-06,   Method: Composition-based stats.
 Identities = 25/54 (46%), Positives = 34/54 (62%)

Query: 12  TDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           +DV LGG T+F  L L V P+KG+A+ W N + +   D    HS CPV LG++W
Sbjct: 467 SDVPLGGTTVFSKLKLAVQPKKGNALVWFNLNHDGKPDPLTEHSVCPVVLGSRW 520


>gi|195113263|ref|XP_002001187.1| GI10646 [Drosophila mojavensis]
 gi|193917781|gb|EDW16648.1| GI10646 [Drosophila mojavensis]
          Length = 471

 Score = 57.4 bits (137), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 27/55 (49%), Positives = 35/55 (63%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVEL GATIFP LNLT+ PEKG+A+ WHN  +         ++ CPV   +K+
Sbjct: 393 LEDVELVGATIFPRLNLTIKPEKGTALLWHNLESCGSSHPKALYAACPVISSSKY 447



 Score = 57.4 bits (137), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 26/55 (47%), Positives = 36/55 (65%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L DVEL GATIFP LNLT+ PEKG+A+ W+N  +      +  ++ CPV   +K+
Sbjct: 393 LEDVELVGATIFPRLNLTIKPEKGTALLWHNLESCGSSHPKALYAACPVISSSKY 447


>gi|251794605|ref|YP_003009336.1| procollagen-proline dioxygenase [Paenibacillus sp. JDR-2]
 gi|247542231|gb|ACS99249.1| Procollagen-proline dioxygenase [Paenibacillus sp. JDR-2]
          Length = 209

 Score = 57.4 bits (137), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 29/64 (45%), Positives = 39/64 (60%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L DVE GG T FPSL+ +V P+KGSAV++   + +T L+    H+G PV  
Sbjct: 134 NRISTLVMYLNDVEEGGETYFPSLHFSVTPKKGSAVYFEYFYNDTRLNELTLHAGHPVEA 193

Query: 62  GNKW 65
           G KW
Sbjct: 194 GEKW 197



 Score = 56.6 bits (135), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 29/68 (42%), Positives = 43/68 (63%), Gaps = 1/68 (1%)

Query: 60  ALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGC 119
           ++ N+   L++  L DVE GG T FPSL+ +V P+KGSAV++   + +T L+    H+G 
Sbjct: 131 SMNNRISTLVMY-LNDVEEGGETYFPSLHFSVTPKKGSAVYFEYFYNDTRLNELTLHAGH 189

Query: 120 PVALGNKW 127
           PV  G KW
Sbjct: 190 PVEAGEKW 197


>gi|390178051|ref|XP_002137433.2| GA30144 [Drosophila pseudoobscura pseudoobscura]
 gi|388859305|gb|EDY67991.2| GA30144 [Drosophila pseudoobscura pseudoobscura]
          Length = 546

 Score = 57.4 bits (137), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 24/48 (50%), Positives = 32/48 (66%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
           L+DV LGG T FP+L L + P +GSA+ W+NA+     DYR  H+ CP
Sbjct: 423 LSDVRLGGFTSFPALGLGIRPNRGSALVWHNANNAGNCDYRALHAACP 470



 Score = 56.6 bits (135), Expect = 3e-06,   Method: Composition-based stats.
 Identities = 24/48 (50%), Positives = 31/48 (64%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCP 58
           L+DV LGG T FP+L L + P +GSA+ WHNA+     DY   H+ CP
Sbjct: 423 LSDVRLGGFTSFPALGLGIRPNRGSALVWHNANNAGNCDYRALHAACP 470


>gi|198449508|ref|XP_002136911.1| GA26875 [Drosophila pseudoobscura pseudoobscura]
 gi|198130638|gb|EDY67469.1| GA26875 [Drosophila pseudoobscura pseudoobscura]
          Length = 516

 Score = 57.4 bits (137), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 33/90 (36%), Positives = 44/90 (48%), Gaps = 2/90 (2%)

Query: 38  FWHNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGS 97
           + +  H +   DY+   S    A G++   +L   L DV+ GG+T F  L L V  E+G 
Sbjct: 402 YGYGGHYDIHFDYFNTSSPITKARGDRMATVLFY-LNDVKHGGSTAFTDLQLKVPSERGK 460

Query: 98  AVFWYNAHANTL-LDYRMYHSGCPVALGNK 126
            +FWYN    T  LD R  H  CPV  G K
Sbjct: 461 VLFWYNMRGETHDLDSRTLHGACPVIDGTK 490



 Score = 46.6 bits (109), Expect = 0.003,   Method: Composition-based stats.
 Identities = 24/55 (43%), Positives = 29/55 (52%), Gaps = 1/55 (1%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTL-LDYWMYHSGCPVALGNK 64
           L DV+ GG+T F  L L V  E+G  +FW+N    T  LD    H  CPV  G K
Sbjct: 436 LNDVKHGGSTAFTDLQLKVPSERGKVLFWYNMRGETHDLDSRTLHGACPVIDGTK 490


>gi|195159150|ref|XP_002020445.1| GL13509 [Drosophila persimilis]
 gi|194117214|gb|EDW39257.1| GL13509 [Drosophila persimilis]
          Length = 554

 Score = 57.4 bits (137), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 33/90 (36%), Positives = 44/90 (48%), Gaps = 2/90 (2%)

Query: 38  FWHNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGS 97
           + +  H +   DY+   S    A G++   +L   L DV+ GG+T F  L L V  E+G 
Sbjct: 440 YGYGGHYDIHFDYFNTSSPITKARGDRMATVLFY-LNDVKHGGSTAFTDLQLKVPSERGK 498

Query: 98  AVFWYNAHANTL-LDYRMYHSGCPVALGNK 126
            +FWYN    T  LD R  H  CPV  G K
Sbjct: 499 VLFWYNMRGETHDLDSRTLHGACPVIDGTK 528



 Score = 46.6 bits (109), Expect = 0.003,   Method: Composition-based stats.
 Identities = 24/55 (43%), Positives = 29/55 (52%), Gaps = 1/55 (1%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTL-LDYWMYHSGCPVALGNK 64
           L DV+ GG+T F  L L V  E+G  +FW+N    T  LD    H  CPV  G K
Sbjct: 474 LNDVKHGGSTAFTDLQLKVPSERGKVLFWYNMRGETHDLDSRTLHGACPVIDGTK 528


>gi|253575459|ref|ZP_04852796.1| prolyl 4-hydroxylase [Paenibacillus sp. oral taxon 786 str. D14]
 gi|251845106|gb|EES73117.1| prolyl 4-hydroxylase [Paenibacillus sp. oral taxon 786 str. D14]
          Length = 215

 Score = 57.4 bits (137), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 30/78 (38%), Positives = 42/78 (53%), Gaps = 1/78 (1%)

Query: 50  YWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTL 109
           Y  +    P A  N+   L++  L DVE GG T+FP L+L V PE+GSA+++   +    
Sbjct: 120 YDFFGPNSPSASNNRISTLIIY-LNDVEAGGETVFPLLDLEVKPERGSALYFEYFYRQQE 178

Query: 110 LDYRMYHSGCPVALGNKW 127
           L+    HS  PV  G KW
Sbjct: 179 LNNLTLHSSVPVVRGEKW 196



 Score = 56.2 bits (134), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 27/64 (42%), Positives = 36/64 (56%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L DVE GG T+FP L+L V PE+GSA+++   +    L+    HS  PV  
Sbjct: 133 NRISTLIIYLNDVEAGGETVFPLLDLEVKPERGSALYFEYFYRQQELNNLTLHSSVPVVR 192

Query: 62  GNKW 65
           G KW
Sbjct: 193 GEKW 196


>gi|89096248|ref|ZP_01169141.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus sp.
           NRRL B-14911]
 gi|89089102|gb|EAR68210.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus sp.
           NRRL B-14911]
          Length = 217

 Score = 57.4 bits (137), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 27/56 (48%), Positives = 34/56 (60%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWG 66
           L DVE GG T FP LN +V P+KGSAV++   + N  L+    H G PV  G+KW 
Sbjct: 152 LNDVEEGGETYFPKLNFSVNPQKGSAVYFEYFYDNQDLNDLTLHGGAPVIKGSKWA 207



 Score = 55.8 bits (133), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 27/56 (48%), Positives = 34/56 (60%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 128
           L DVE GG T FP LN +V P+KGSAV++   + N  L+    H G PV  G+KW 
Sbjct: 152 LNDVEEGGETYFPKLNFSVNPQKGSAVYFEYFYDNQDLNDLTLHGGAPVIKGSKWA 207


>gi|403238305|ref|ZP_10916891.1| procollagen-proline dioxygenase [Bacillus sp. 10403023]
          Length = 296

 Score = 57.4 bits (137), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 28/80 (35%), Positives = 42/80 (52%), Gaps = 1/80 (1%)

Query: 48  LDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHAN 107
            DY+      P   G + G  L+  L DV  GG T+FP   +++ P+KGSAV++   ++ 
Sbjct: 204 FDYFPQSKVVPEKGGQRVGTFLIY-LNDVPAGGETVFPKAGVSIVPKKGSAVYFQYGNSK 262

Query: 108 TLLDYRMYHSGCPVALGNKW 127
             +D    HS  PV+ G KW
Sbjct: 263 GEVDRMSLHSSIPVSEGEKW 282



 Score = 53.5 bits (127), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 22/55 (40%), Positives = 33/55 (60%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DV  GG T+FP   +++ P+KGSAV++   ++   +D    HS  PV+ G KW
Sbjct: 228 LNDVPAGGETVFPKAGVSIVPKKGSAVYFQYGNSKGEVDRMSLHSSIPVSEGEKW 282


>gi|317127314|ref|YP_004093596.1| Procollagen-proline dioxygenase [Bacillus cellulosilyticus DSM
           2522]
 gi|315472262|gb|ADU28865.1| Procollagen-proline dioxygenase [Bacillus cellulosilyticus DSM
           2522]
          Length = 229

 Score = 57.4 bits (137), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 30/79 (37%), Positives = 42/79 (53%)

Query: 50  YWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTL 109
           ++ Y S     + N     L+  L DVE GG T FP LN  V P+KG AV++   + +T 
Sbjct: 133 HYDYFSSGNSKVNNPRISTLVMYLNDVEAGGETYFPKLNFYVAPKKGMAVYFEYFYNDTT 192

Query: 110 LDYRMYHSGCPVALGNKWG 128
           L+    H G PV +G+KW 
Sbjct: 193 LNELTLHGGAPVVIGDKWA 211



 Score = 57.4 bits (137), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 26/56 (46%), Positives = 34/56 (60%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWG 66
           L DVE GG T FP LN  V P+KG AV++   + +T L+    H G PV +G+KW 
Sbjct: 156 LNDVEAGGETYFPKLNFYVAPKKGMAVYFEYFYNDTTLNELTLHGGAPVVIGDKWA 211


>gi|15808767|gb|AAL08490.1|AF369789_1 prolyl-4-hydroxylase alpha subunit-like protein [Onchocerca
           volvulus]
          Length = 571

 Score = 57.4 bits (137), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 30/67 (44%), Positives = 41/67 (61%), Gaps = 2/67 (2%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPS-LNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
           GN+    L+  LT+ E+GG T+F S L ++V   K +A+FWYN   N  +D R  H+ CP
Sbjct: 449 GNRIATFLIY-LTEPEIGGRTVFTSNLKISVPCVKNAALFWYNLMRNGEVDTRSLHAACP 507

Query: 121 VALGNKW 127
           VA G KW
Sbjct: 508 VATGIKW 514



 Score = 52.0 bits (123), Expect = 7e-05,   Method: Composition-based stats.
 Identities = 27/65 (41%), Positives = 38/65 (58%), Gaps = 1/65 (1%)

Query: 2   NRAEMSSVELTDVELGGATIFPS-LNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVA 60
           NR     + LT+ E+GG T+F S L ++V   K +A+FW+N   N  +D    H+ CPVA
Sbjct: 450 NRIATFLIYLTEPEIGGRTVFTSNLKISVPCVKNAALFWYNLMRNGEVDTRSLHAACPVA 509

Query: 61  LGNKW 65
            G KW
Sbjct: 510 TGIKW 514


>gi|15808763|gb|AAL08488.1| prolyl-4-hydroxylase alpha subunit-like protein [Onchocerca
           volvulus]
          Length = 571

 Score = 57.4 bits (137), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 30/67 (44%), Positives = 41/67 (61%), Gaps = 2/67 (2%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPS-LNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
           GN+    L+  LT+ E+GG T+F S L ++V   K +A+FWYN   N  +D R  H+ CP
Sbjct: 449 GNRIATFLIY-LTEPEIGGRTVFTSNLKISVPCVKNAALFWYNLMRNGEVDTRSLHAACP 507

Query: 121 VALGNKW 127
           VA G KW
Sbjct: 508 VATGIKW 514



 Score = 52.0 bits (123), Expect = 7e-05,   Method: Composition-based stats.
 Identities = 27/65 (41%), Positives = 38/65 (58%), Gaps = 1/65 (1%)

Query: 2   NRAEMSSVELTDVELGGATIFPS-LNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVA 60
           NR     + LT+ E+GG T+F S L ++V   K +A+FW+N   N  +D    H+ CPVA
Sbjct: 450 NRIATFLIYLTEPEIGGRTVFTSNLKISVPCVKNAALFWYNLMRNGEVDTRSLHAACPVA 509

Query: 61  LGNKW 65
            G KW
Sbjct: 510 TGIKW 514


>gi|328713119|ref|XP_003244997.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Acyrthosiphon
           pisum]
          Length = 487

 Score = 57.0 bits (136), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 33/74 (44%), Positives = 42/74 (56%), Gaps = 4/74 (5%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAH-ANTLLDYWMYHSGCPVALGNKWGK-- 67
           LTDV+  G T FP LN+    EKGSA+ W+N H ++  L Y   H  CP+  GNKW K  
Sbjct: 411 LTDVQNDGYTSFPMLNIIAPAEKGSALVWNNLHMSDGQLCYESLHGACPLLKGNKWSKYT 470

Query: 68  -LLLSGLTDVELGG 80
            LL + L +   GG
Sbjct: 471 ALLRAVLKNKSWGG 484



 Score = 56.6 bits (135), Expect = 3e-06,   Method: Composition-based stats.
 Identities = 31/70 (44%), Positives = 39/70 (55%), Gaps = 2/70 (2%)

Query: 61  LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAH-ANTLLDYRMYHSGC 119
            GN+   +L   LTDV+  G T FP LN+    EKGSA+ W N H ++  L Y   H  C
Sbjct: 400 FGNRLVTVLFY-LTDVQNDGYTSFPMLNIIAPAEKGSALVWNNLHMSDGQLCYESLHGAC 458

Query: 120 PVALGNKWGK 129
           P+  GNKW K
Sbjct: 459 PLLKGNKWSK 468


>gi|194906709|ref|XP_001981416.1| GG11627 [Drosophila erecta]
 gi|190656054|gb|EDV53286.1| GG11627 [Drosophila erecta]
          Length = 462

 Score = 57.0 bits (136), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 26/59 (44%), Positives = 33/59 (55%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWGKLL 131
           L DV++GG   FP L     P +GSA+ W+N       D R   + CPV LGN+WGK L
Sbjct: 367 LNDVQMGGYASFPDLGFGFKPVRGSALVWHNTDNCGNCDIRGLQATCPVLLGNQWGKWL 425



 Score = 56.6 bits (135), Expect = 4e-06,   Method: Composition-based stats.
 Identities = 26/60 (43%), Positives = 33/60 (55%)

Query: 10  ELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWGKLL 69
           +L DV++GG   FP L     P +GSA+ WHN       D     + CPV LGN+WGK L
Sbjct: 366 KLNDVQMGGYASFPDLGFGFKPVRGSALVWHNTDNCGNCDIRGLQATCPVLLGNQWGKWL 425


>gi|443719426|gb|ELU09607.1| hypothetical protein CAPTEDRAFT_229373 [Capitella teleta]
          Length = 576

 Score = 57.0 bits (136), Expect = 3e-06,   Method: Composition-based stats.
 Identities = 24/55 (43%), Positives = 35/55 (63%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L+ V+ GGAT+FP L++ +   KG+A FW+N   +   +    H+GCPV LG KW
Sbjct: 494 LSSVKAGGATVFPKLHVRIPAVKGAAAFWFNIRRSGDREPLTQHAGCPVLLGEKW 548



 Score = 55.5 bits (132), Expect = 7e-06,   Method: Composition-based stats.
 Identities = 24/55 (43%), Positives = 34/55 (61%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L+ V+ GGAT+FP L++ +   KG+A FW N   +   +    H+GCPV LG KW
Sbjct: 494 LSSVKAGGATVFPKLHVRIPAVKGAAAFWFNIRRSGDREPLTQHAGCPVLLGEKW 548


>gi|291235355|ref|XP_002737609.1| PREDICTED: GI24860-like [Saccoglossus kowalevskii]
          Length = 85

 Score = 57.0 bits (136), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 35/55 (63%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L+DV+ GG T F ++   V P K +AVFW+N   +   DYR  H+ CPV LG+KW
Sbjct: 5   LSDVDAGGYTAFTAIKTIVKPIKNAAVFWWNLLPSGERDYRTLHAACPVLLGSKW 59



 Score = 52.0 bits (123), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 26/58 (44%), Positives = 34/58 (58%)

Query: 8  SVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           V L+DV+ GG T F ++   V P K +AVFW N   +   DY   H+ CPV LG+KW
Sbjct: 2  DVLLSDVDAGGYTAFTAIKTIVKPIKNAAVFWWNLLPSGERDYRTLHAACPVLLGSKW 59


>gi|194765184|ref|XP_001964707.1| GF22906 [Drosophila ananassae]
 gi|190614979|gb|EDV30503.1| GF22906 [Drosophila ananassae]
          Length = 708

 Score = 56.6 bits (135), Expect = 3e-06,   Method: Composition-based stats.
 Identities = 26/63 (41%), Positives = 39/63 (61%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           +R   S   L++VE GG T+F  LN+ + P +G+ V WHN H +  +D    H+GCPV +
Sbjct: 614 DRISTSMFYLSNVEQGGYTVFTKLNVFLPPIQGAMVMWHNLHRSLDVDARTLHAGCPVLV 673

Query: 62  GNK 64
           G+K
Sbjct: 674 GSK 676



 Score = 56.2 bits (134), Expect = 5e-06,   Method: Composition-based stats.
 Identities = 24/54 (44%), Positives = 37/54 (68%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 126
           L++VE GG T+F  LN+ + P +G+ V W+N H +  +D R  H+GCPV +G+K
Sbjct: 623 LSNVEQGGYTVFTKLNVFLPPIQGAMVMWHNLHRSLDVDARTLHAGCPVLVGSK 676


>gi|395814850|ref|XP_003780953.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Otolemur
           garnettii]
          Length = 544

 Score = 56.6 bits (135), Expect = 3e-06,   Method: Composition-based stats.
 Identities = 28/66 (42%), Positives = 39/66 (59%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    ++  L+ VE GGAT F   N +V   K +A+FW+N H N   D    H+GCPV
Sbjct: 457 GNRVATFMIY-LSSVEAGGATAFIYANFSVPVVKNAALFWWNLHRNGEGDSDTLHAGCPV 515

Query: 122 ALGNKW 127
            +G+KW
Sbjct: 516 LVGDKW 521



 Score = 55.8 bits (133), Expect = 5e-06,   Method: Composition-based stats.
 Identities = 28/64 (43%), Positives = 36/64 (56%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L+ VE GGAT F   N +V   K +A+FW N H N   D    H+GCPV +
Sbjct: 458 NRVATFMIYLSSVEAGGATAFIYANFSVPVVKNAALFWWNLHRNGEGDSDTLHAGCPVLV 517

Query: 62  GNKW 65
           G+KW
Sbjct: 518 GDKW 521


>gi|24666354|ref|NP_730347.1| CG32199 [Drosophila melanogaster]
 gi|23093193|gb|AAF49251.3| CG32199 [Drosophila melanogaster]
          Length = 509

 Score = 56.6 bits (135), Expect = 3e-06,   Method: Composition-based stats.
 Identities = 24/54 (44%), Positives = 34/54 (62%)

Query: 74  TDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           +D+ LGG T+F  L + V P+KGSA+ W+N +     D    HS CPV LG++W
Sbjct: 437 SDIPLGGTTVFTKLKIAVQPKKGSALIWFNLNHAGEPDPLTEHSVCPVVLGSRW 490



 Score = 55.8 bits (133), Expect = 5e-06,   Method: Composition-based stats.
 Identities = 25/64 (39%), Positives = 37/64 (57%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           +R   + +  +D+ LGG T+F  L + V P+KGSA+ W N +     D    HS CPV L
Sbjct: 427 DRIATALIYASDIPLGGTTVFTKLKIAVQPKKGSALIWFNLNHAGEPDPLTEHSVCPVVL 486

Query: 62  GNKW 65
           G++W
Sbjct: 487 GSRW 490


>gi|194905313|ref|XP_001981171.1| GG11766 [Drosophila erecta]
 gi|190655809|gb|EDV53041.1| GG11766 [Drosophila erecta]
          Length = 496

 Score = 56.6 bits (135), Expect = 3e-06,   Method: Composition-based stats.
 Identities = 25/56 (44%), Positives = 34/56 (60%), Gaps = 1/56 (1%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTL-LDYRMYHSGCPVALGNKW 127
           L D   GGAT+FP +N+ V  E+G  +FWYN    T  +D +  H+ CPV  G+KW
Sbjct: 416 LNDAPNGGATVFPRINVKVPAERGKVLFWYNLDGETHDVDPKTVHAACPVFHGSKW 471



 Score = 52.4 bits (124), Expect = 5e-05,   Method: Composition-based stats.
 Identities = 24/56 (42%), Positives = 33/56 (58%), Gaps = 1/56 (1%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTL-LDYWMYHSGCPVALGNKW 65
           L D   GGAT+FP +N+ V  E+G  +FW+N    T  +D    H+ CPV  G+KW
Sbjct: 416 LNDAPNGGATVFPRINVKVPAERGKVLFWYNLDGETHDVDPKTVHAACPVFHGSKW 471


>gi|398818543|ref|ZP_10577128.1| 2OG-Fe(II) oxygenase superfamily enzyme [Brevibacillus sp. BC25]
 gi|398027481|gb|EJL21031.1| 2OG-Fe(II) oxygenase superfamily enzyme [Brevibacillus sp. BC25]
          Length = 220

 Score = 56.6 bits (135), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 28/64 (43%), Positives = 39/64 (60%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L DVE GG T+FP L+L+VFP KG AV++   ++N  L+ +  H+G  V  
Sbjct: 143 NRISTLVMYLNDVEQGGETVFPLLHLSVFPTKGMAVYFEYFYSNQELNDFTLHAGTQVIH 202

Query: 62  GNKW 65
           G KW
Sbjct: 203 GEKW 206



 Score = 53.9 bits (128), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 29/78 (37%), Positives = 43/78 (55%), Gaps = 1/78 (1%)

Query: 50  YWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTL 109
           Y  +      +  N+   L++  L DVE GG T+FP L+L+VFP KG AV++   ++N  
Sbjct: 130 YDFFAETSRASTNNRISTLVMY-LNDVEQGGETVFPLLHLSVFPTKGMAVYFEYFYSNQE 188

Query: 110 LDYRMYHSGCPVALGNKW 127
           L+    H+G  V  G KW
Sbjct: 189 LNDFTLHAGTQVIHGEKW 206


>gi|66820122|ref|XP_643703.1| hypothetical protein DDB_G0275385 [Dictyostelium discoideum AX4]
 gi|60471803|gb|EAL69758.1| hypothetical protein DDB_G0275385 [Dictyostelium discoideum AX4]
          Length = 221

 Score = 56.6 bits (135), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 30/72 (41%), Positives = 36/72 (50%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWGKLLL 70
           L DVE GG T FP + + V P KG AV ++N   N  +D    H G PV  GNKW  + L
Sbjct: 149 LCDVEEGGETTFPEVGIKVKPIKGDAVLFYNCKPNGDVDPLSLHQGDPVLKGNKWVAIKL 208

Query: 71  SGLTDVELGGAT 82
                 E G  T
Sbjct: 209 INQKSKETGTPT 220



 Score = 56.2 bits (134), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 28/59 (47%), Positives = 33/59 (55%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           +L  L DVE GG T FP + + V P KG AV +YN   N  +D    H G PV  GNKW
Sbjct: 145 ILMYLCDVEEGGETTFPEVGIKVKPIKGDAVLFYNCKPNGDVDPLSLHQGDPVLKGNKW 203


>gi|319652240|ref|ZP_08006358.1| prolyl 4-hydroxylase [Bacillus sp. 2_A_57_CT2]
 gi|317396063|gb|EFV76783.1| prolyl 4-hydroxylase [Bacillus sp. 2_A_57_CT2]
          Length = 216

 Score = 56.6 bits (135), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 25/56 (44%), Positives = 34/56 (60%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWG 66
           L DVE GG T FP LN +V P+KG AV++   + +  L+    H G PV +G+KW 
Sbjct: 151 LNDVEQGGETYFPKLNFSVSPQKGMAVYFEYFYNDQNLNDLTLHGGAPVVMGDKWA 206



 Score = 56.6 bits (135), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 29/79 (36%), Positives = 42/79 (53%)

Query: 50  YWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTL 109
           ++ + S    A  N     L+  L DVE GG T FP LN +V P+KG AV++   + +  
Sbjct: 128 HFDFFSSTSRAASNPRISTLVMYLNDVEQGGETYFPKLNFSVSPQKGMAVYFEYFYNDQN 187

Query: 110 LDYRMYHSGCPVALGNKWG 128
           L+    H G PV +G+KW 
Sbjct: 188 LNDLTLHGGAPVVMGDKWA 206


>gi|198429625|ref|XP_002128613.1| PREDICTED: similar to procollagen-proline, 2-oxoglutarate
           4-dioxygenase (proline 4-hydroxylase), alpha 1
           polypeptide [Ciona intestinalis]
          Length = 195

 Score = 56.6 bits (135), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 29/67 (43%), Positives = 40/67 (59%), Gaps = 1/67 (1%)

Query: 61  LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
           +GN+     L+ +++VE GG+T+F    + V P KGSAVFWYN   +   D R  H+ CP
Sbjct: 102 VGNRIA-TFLTYMSNVEQGGSTVFLHPGIAVRPIKGSAVFWYNLLPSGAGDERTRHAACP 160

Query: 121 VALGNKW 127
           V  G KW
Sbjct: 161 VLTGVKW 167



 Score = 50.1 bits (118), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 24/55 (43%), Positives = 33/55 (60%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           +++VE GG+T+F    + V P KGSAVFW+N   +   D    H+ CPV  G KW
Sbjct: 113 MSNVEQGGSTVFLHPGIAVRPIKGSAVFWYNLLPSGAGDERTRHAACPVLTGVKW 167


>gi|402584932|gb|EJW78873.1| hypothetical protein WUBG_10221 [Wuchereria bancrofti]
          Length = 187

 Score = 56.6 bits (135), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 26/66 (39%), Positives = 39/66 (59%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+   +L+  +T  E+GG T+F  L  ++   K +A+FWYN   +  +D R YH+ CPV
Sbjct: 69  GNRIATILIY-MTKPEIGGGTVFIDLKTSISCTKNAALFWYNLMRSGAVDIRSYHAACPV 127

Query: 122 ALGNKW 127
             G KW
Sbjct: 128 LTGTKW 133



 Score = 50.8 bits (120), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 23/64 (35%), Positives = 35/64 (54%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + +T  E+GG T+F  L  ++   K +A+FW+N   +  +D   YH+ CPV  
Sbjct: 70  NRIATILIYMTKPEIGGGTVFIDLKTSISCTKNAALFWYNLMRSGAVDIRSYHAACPVLT 129

Query: 62  GNKW 65
           G KW
Sbjct: 130 GTKW 133


>gi|198449528|ref|XP_002136919.1| GA26870 [Drosophila pseudoobscura pseudoobscura]
 gi|198130648|gb|EDY67477.1| GA26870 [Drosophila pseudoobscura pseudoobscura]
          Length = 491

 Score = 56.6 bits (135), Expect = 4e-06,   Method: Composition-based stats.
 Identities = 25/55 (45%), Positives = 36/55 (65%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L++V  GG T+FP+L + V P+ G+A+ +YN   + + D R  H GCPV  GNKW
Sbjct: 419 LSNVSQGGYTVFPNLEVAVKPQAGTALVFYNLLDSLVGDTRTRHFGCPVIDGNKW 473



 Score = 52.0 bits (123), Expect = 7e-05,   Method: Composition-based stats.
 Identities = 25/64 (39%), Positives = 38/64 (59%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L++V  GG T+FP+L + V P+ G+A+ ++N   + + D    H GCPV  
Sbjct: 410 NRLFTIQIFLSNVSQGGYTVFPNLEVAVKPQAGTALVFYNLLDSLVGDTRTRHFGCPVID 469

Query: 62  GNKW 65
           GNKW
Sbjct: 470 GNKW 473


>gi|229098707|ref|ZP_04229647.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-29]
 gi|423441025|ref|ZP_17417931.1| hypothetical protein IEA_01355 [Bacillus cereus BAG4X2-1]
 gi|423533441|ref|ZP_17509859.1| hypothetical protein IGI_01273 [Bacillus cereus HuB2-9]
 gi|228684786|gb|EEL38724.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-29]
 gi|402417686|gb|EJV49986.1| hypothetical protein IEA_01355 [Bacillus cereus BAG4X2-1]
 gi|402463660|gb|EJV95360.1| hypothetical protein IGI_01273 [Bacillus cereus HuB2-9]
          Length = 216

 Score = 56.6 bits (135), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 42/79 (53%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L+    H G  V  G KW
Sbjct: 186 SLNELTLHGGASVTKGEKW 204



 Score = 53.5 bits (127), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 25/55 (45%), Positives = 31/55 (56%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T FP LNL+V P KG AV++   + +  L+    H G  V  G KW
Sbjct: 150 LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGASVTKGEKW 204


>gi|423448819|ref|ZP_17425698.1| hypothetical protein IEC_03427 [Bacillus cereus BAG5O-1]
 gi|401129413|gb|EJQ37096.1| hypothetical protein IEC_03427 [Bacillus cereus BAG5O-1]
          Length = 216

 Score = 56.2 bits (134), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 42/79 (53%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+  HS    A  N+   L++  L DVE GG T FP LNL+V P KG AV++   + + 
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185

Query: 109 LLDYRMYHSGCPVALGNKW 127
            L+    H G  V  G KW
Sbjct: 186 SLNELTLHGGASVTKGEKW 204



 Score = 53.5 bits (127), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 25/55 (45%), Positives = 31/55 (56%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T FP LNL+V P KG AV++   + +  L+    H G  V  G KW
Sbjct: 150 LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGASVTKGEKW 204


>gi|402894624|ref|XP_003910453.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
           alpha-3 [Papio anubis]
          Length = 535

 Score = 56.2 bits (134), Expect = 4e-06,   Method: Composition-based stats.
 Identities = 28/66 (42%), Positives = 40/66 (60%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    ++  L+ VE GGAT F   NL+V   K +A+FW+N H +   D    H+GCPV
Sbjct: 448 GNRVATFMIY-LSSVEAGGATAFIYANLSVPVVKNAALFWWNLHRSGEGDSDTLHAGCPV 506

Query: 122 ALGNKW 127
            +G+KW
Sbjct: 507 LVGDKW 512



 Score = 55.5 bits (132), Expect = 7e-06,   Method: Composition-based stats.
 Identities = 28/64 (43%), Positives = 37/64 (57%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L+ VE GGAT F   NL+V   K +A+FW N H +   D    H+GCPV +
Sbjct: 449 NRVATFMIYLSSVEAGGATAFIYANLSVPVVKNAALFWWNLHRSGEGDSDTLHAGCPVLV 508

Query: 62  GNKW 65
           G+KW
Sbjct: 509 GDKW 512


>gi|340787855|ref|YP_004753320.1| peptidyl prolyl 4-hydroxylase-like protein subunit alpha
           [Collimonas fungivorans Ter331]
 gi|340553122|gb|AEK62497.1| Peptidyl prolyl 4-hydroxylase-like protein, alpha subunit
           [Collimonas fungivorans Ter331]
          Length = 289

 Score = 56.2 bits (134), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 28/69 (40%), Positives = 39/69 (56%)

Query: 59  VALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSG 118
           +A G +    L+  L DV+ GG TIFP   L++ P KGSA+++   +A   LD   +H G
Sbjct: 209 LARGGQRTATLIIYLNDVDGGGETIFPRNGLSIVPAKGSAIYFSYTNAENQLDSLSFHGG 268

Query: 119 CPVALGNKW 127
            PV  G KW
Sbjct: 269 SPVIEGEKW 277



 Score = 55.5 bits (132), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 25/57 (43%), Positives = 34/57 (59%)

Query: 9   VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           + L DV+ GG TIFP   L++ P KGSA+++   +A   LD   +H G PV  G KW
Sbjct: 221 IYLNDVDGGGETIFPRNGLSIVPAKGSAIYFSYTNAENQLDSLSFHGGSPVIEGEKW 277


>gi|384251901|gb|EIE25378.1| hypothetical protein COCSUDRAFT_35772 [Coccomyxa subellipsoidea
           C-169]
          Length = 222

 Score = 56.2 bits (134), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 38/144 (26%), Positives = 62/144 (43%), Gaps = 22/144 (15%)

Query: 1   MNRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVA 60
           +NR E   +E  +  +   T  P  N      +G  +  + A       +  +H      
Sbjct: 59  LNRGEDDVIERIEARIAKYTAIPKENG-----EGLQILHYQASEEYRPHFDYFHDNFNTQ 113

Query: 61  LGNKWGKLLLSGLTDVELGGATIFP------SLNLTVF-----------PEKGSAVFWYN 103
            G +    +L  L+DVE GG T+FP      ++  T F           P+KG A+F+Y+
Sbjct: 114 NGGQRIATMLMYLSDVEDGGETVFPESSDKPNVGNTKFSQCAQAGAAAKPKKGDALFFYS 173

Query: 104 AHANTLLDYRMYHSGCPVALGNKW 127
              +  +D +  H+GCPV  G+KW
Sbjct: 174 LTPDGRMDEKSLHAGCPVMKGDKW 197


>gi|195166673|ref|XP_002024159.1| GL22695 [Drosophila persimilis]
 gi|194107514|gb|EDW29557.1| GL22695 [Drosophila persimilis]
          Length = 360

 Score = 55.8 bits (133), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 28/67 (41%), Positives = 37/67 (55%), Gaps = 8/67 (11%)

Query: 5   EMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNK 64
           +   ++L++V  GG   FPSLNL V P+KGS + W             YH  CPV +GNK
Sbjct: 300 QFDDLQLSNVRQGGEIHFPSLNLRVTPQKGSLLVWPRTGD--------YHIECPVIVGNK 351

Query: 65  WGKLLLS 71
           WGK  +S
Sbjct: 352 WGKSEIS 358



 Score = 53.9 bits (128), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 28/61 (45%), Positives = 34/61 (55%), Gaps = 8/61 (13%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWGKLLL 132
           L++V  GG   FPSLNL V P+KGS + W             YH  CPV +GNKWGK  +
Sbjct: 306 LSNVRQGGEIHFPSLNLRVTPQKGSLLVWPRTGD--------YHIECPVIVGNKWGKSEI 357

Query: 133 S 133
           S
Sbjct: 358 S 358


>gi|281206564|gb|EFA80750.1| putative prolyl 4-hydroxylase alpha subunit [Polysphondylium
           pallidum PN500]
          Length = 251

 Score = 55.8 bits (133), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 28/59 (47%), Positives = 33/59 (55%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           +L  L DVE GG T FP +N+ V P KG AV +YN   N  +D    H G PV  G KW
Sbjct: 177 ILMYLADVEEGGETSFPEVNVKVKPIKGDAVLFYNCKPNGEVDPLSLHQGDPVIKGTKW 235



 Score = 54.3 bits (129), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 31/55 (56%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T FP +N+ V P KG AV ++N   N  +D    H G PV  G KW
Sbjct: 181 LADVEEGGETSFPEVNVKVKPIKGDAVLFYNCKPNGEVDPLSLHQGDPVIKGTKW 235


>gi|386766694|ref|NP_651648.5| CG11828 [Drosophila melanogaster]
 gi|383293009|gb|AAF56834.5| CG11828 [Drosophila melanogaster]
          Length = 458

 Score = 55.8 bits (133), Expect = 5e-06,   Method: Composition-based stats.
 Identities = 26/63 (41%), Positives = 36/63 (57%), Gaps = 1/63 (1%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW-GKLL 131
           L+DV++GG   FP L     P +GSA+ W+N   +   D R   + CPV LGN+W  K  
Sbjct: 385 LSDVQMGGYASFPDLGFGFKPRRGSALVWHNTDNSGNCDTRSLQATCPVLLGNQWVAKKW 444

Query: 132 LSG 134
           +SG
Sbjct: 445 ISG 447



 Score = 55.5 bits (132), Expect = 8e-06,   Method: Composition-based stats.
 Identities = 27/71 (38%), Positives = 38/71 (53%), Gaps = 1/71 (1%)

Query: 3   RAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALG 62
           R+  +   L+DV++GG   FP L     P +GSA+ WHN   +   D     + CPV LG
Sbjct: 377 RSASAMFYLSDVQMGGYASFPDLGFGFKPRRGSALVWHNTDNSGNCDTRSLQATCPVLLG 436

Query: 63  NKW-GKLLLSG 72
           N+W  K  +SG
Sbjct: 437 NQWVAKKWISG 447


>gi|205374182|ref|ZP_03226981.1| prolyl 4-hydroxylase alpha subunit [Bacillus coahuilensis m4-4]
          Length = 210

 Score = 55.8 bits (133), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 25/56 (44%), Positives = 35/56 (62%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWG 66
           L DVE GG T FP+L L+V P KG AV++   + + +L+    H G PV +G+KW 
Sbjct: 144 LNDVEEGGDTYFPNLKLSVSPHKGMAVYFEYFYDDPMLNELTLHGGAPVTIGDKWA 199



 Score = 54.3 bits (129), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 25/56 (44%), Positives = 35/56 (62%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 128
           L DVE GG T FP+L L+V P KG AV++   + + +L+    H G PV +G+KW 
Sbjct: 144 LNDVEEGGDTYFPNLKLSVSPHKGMAVYFEYFYDDPMLNELTLHGGAPVTIGDKWA 199


>gi|195352184|ref|XP_002042594.1| GM14981 [Drosophila sechellia]
 gi|194124478|gb|EDW46521.1| GM14981 [Drosophila sechellia]
          Length = 539

 Score = 55.8 bits (133), Expect = 5e-06,   Method: Composition-based stats.
 Identities = 24/54 (44%), Positives = 34/54 (62%)

Query: 74  TDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           +DV LGG T+F  L + V P+KG+A+ W+N +     D    HS CPV LG++W
Sbjct: 467 SDVPLGGTTVFTKLKIAVKPKKGNALIWFNLNHAGEPDPLTEHSVCPVVLGSRW 520



 Score = 54.3 bits (129), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 24/54 (44%), Positives = 33/54 (61%)

Query: 12  TDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           +DV LGG T+F  L + V P+KG+A+ W N +     D    HS CPV LG++W
Sbjct: 467 SDVPLGGTTVFTKLKIAVKPKKGNALIWFNLNHAGEPDPLTEHSVCPVVLGSRW 520


>gi|239792190|dbj|BAH72464.1| ACYPI007079 [Acyrthosiphon pisum]
          Length = 249

 Score = 55.8 bits (133), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 27/66 (40%), Positives = 40/66 (60%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+   +L   ++DV  GGAT+FP L + + P KG+A  W+N + +   D R  H+ CPV
Sbjct: 156 GNRIATVLFY-MSDVAQGGATVFPWLGVALQPVKGTAAVWFNLYPSGNGDLRTRHAACPV 214

Query: 122 ALGNKW 127
             G+KW
Sbjct: 215 LQGSKW 220



 Score = 50.1 bits (118), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 23/55 (41%), Positives = 33/55 (60%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           ++DV  GGAT+FP L + + P KG+A  W N + +   D    H+ CPV  G+KW
Sbjct: 166 MSDVAQGGATVFPWLGVALQPVKGTAAVWFNLYPSGNGDLRTRHAACPVLQGSKW 220


>gi|195055777|ref|XP_001994789.1| GH14121 [Drosophila grimshawi]
 gi|193892552|gb|EDV91418.1| GH14121 [Drosophila grimshawi]
          Length = 517

 Score = 55.8 bits (133), Expect = 5e-06,   Method: Composition-based stats.
 Identities = 26/60 (43%), Positives = 36/60 (60%), Gaps = 5/60 (8%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA-----LGNKW 127
           L+DV+ GG T+FP+ N+ V P KGS + WYN   +   D+R  H+GC V      +GN W
Sbjct: 440 LSDVQQGGYTVFPTQNVFVKPVKGSMILWYNQLRSLNPDHRTLHAGCAVIEGIKRIGNIW 499



 Score = 53.5 bits (127), Expect = 3e-05,   Method: Composition-based stats.
 Identities = 26/69 (37%), Positives = 39/69 (56%), Gaps = 5/69 (7%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVA- 60
           +R   S + L+DV+ GG T+FP+ N+ V P KGS + W+N   +   D+   H+GC V  
Sbjct: 431 DRISTSMIYLSDVQQGGYTVFPTQNVFVKPVKGSMILWYNQLRSLNPDHRTLHAGCAVIE 490

Query: 61  ----LGNKW 65
               +GN W
Sbjct: 491 GIKRIGNIW 499


>gi|195069799|ref|XP_001997030.1| GH12979 [Drosophila grimshawi]
 gi|193891499|gb|EDV90365.1| GH12979 [Drosophila grimshawi]
          Length = 517

 Score = 55.8 bits (133), Expect = 5e-06,   Method: Composition-based stats.
 Identities = 26/60 (43%), Positives = 36/60 (60%), Gaps = 5/60 (8%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA-----LGNKW 127
           L+DV+ GG T+FP+ N+ V P KGS + WYN   +   D+R  H+GC V      +GN W
Sbjct: 440 LSDVQQGGYTVFPTQNVFVKPVKGSMILWYNQLRSLNPDHRTLHAGCAVIEGIKRIGNIW 499



 Score = 53.5 bits (127), Expect = 3e-05,   Method: Composition-based stats.
 Identities = 26/69 (37%), Positives = 39/69 (56%), Gaps = 5/69 (7%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVA- 60
           +R   S + L+DV+ GG T+FP+ N+ V P KGS + W+N   +   D+   H+GC V  
Sbjct: 431 DRISTSMIYLSDVQQGGYTVFPTQNVFVKPVKGSMILWYNQLRSLNPDHRTLHAGCAVIE 490

Query: 61  ----LGNKW 65
               +GN W
Sbjct: 491 GIKRIGNIW 499


>gi|195591304|ref|XP_002085382.1| GD14758 [Drosophila simulans]
 gi|194197391|gb|EDX10967.1| GD14758 [Drosophila simulans]
          Length = 509

 Score = 55.8 bits (133), Expect = 5e-06,   Method: Composition-based stats.
 Identities = 24/54 (44%), Positives = 34/54 (62%)

Query: 74  TDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           +DV LGG T+F  L + V P+KG+A+ W+N +     D    HS CPV LG++W
Sbjct: 437 SDVPLGGTTVFTKLKIAVQPKKGNALIWFNLNHAGEPDPLTEHSVCPVVLGSRW 490



 Score = 54.3 bits (129), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 24/54 (44%), Positives = 33/54 (61%)

Query: 12  TDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           +DV LGG T+F  L + V P+KG+A+ W N +     D    HS CPV LG++W
Sbjct: 437 SDVPLGGTTVFTKLKIAVQPKKGNALIWFNLNHAGEPDPLTEHSVCPVVLGSRW 490


>gi|221126103|ref|XP_002165259.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Hydra
           magnipapillata]
          Length = 533

 Score = 55.8 bits (133), Expect = 6e-06,   Method: Composition-based stats.
 Identities = 29/68 (42%), Positives = 40/68 (58%), Gaps = 3/68 (4%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSL--NLTVFPEKGSAVFWYNAHANTLLDYRMYHSGC 119
           GN+   +LL  L+DV+ GG T+F +    +   P KG+ VFWYN   N   + +  H+GC
Sbjct: 444 GNRIATMLLY-LSDVDWGGRTVFTNTAPGVGTDPIKGAGVFWYNLLRNGKSNPKTQHAGC 502

Query: 120 PVALGNKW 127
           PV LG KW
Sbjct: 503 PVVLGQKW 510



 Score = 50.1 bits (118), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 26/66 (39%), Positives = 36/66 (54%), Gaps = 2/66 (3%)

Query: 2   NRAEMSSVELTDVELGGATIFPSL--NLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPV 59
           NR     + L+DV+ GG T+F +    +   P KG+ VFW+N   N   +    H+GCPV
Sbjct: 445 NRIATMLLYLSDVDWGGRTVFTNTAPGVGTDPIKGAGVFWYNLLRNGKSNPKTQHAGCPV 504

Query: 60  ALGNKW 65
            LG KW
Sbjct: 505 VLGQKW 510


>gi|198466405|ref|XP_001353987.2| GA16752 [Drosophila pseudoobscura pseudoobscura]
 gi|198150585|gb|EAL29723.2| GA16752 [Drosophila pseudoobscura pseudoobscura]
          Length = 510

 Score = 55.8 bits (133), Expect = 6e-06,   Method: Composition-based stats.
 Identities = 27/74 (36%), Positives = 42/74 (56%), Gaps = 4/74 (5%)

Query: 58  PVALGNKWGKLLLSGL----TDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYR 113
           P     +WG  +L+ +    +DV LGGA +FP L ++V P+KG+A+ W N +     D  
Sbjct: 418 PANYFTEWGGDVLATVLLYASDVRLGGAVVFPKLKISVEPKKGNALIWDNLNNAGNPDKL 477

Query: 114 MYHSGCPVALGNKW 127
             H+ CPV +G+ W
Sbjct: 478 SKHAVCPVVMGSHW 491



 Score = 55.1 bits (131), Expect = 9e-06,   Method: Composition-based stats.
 Identities = 23/54 (42%), Positives = 34/54 (62%)

Query: 12  TDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           +DV LGGA +FP L ++V P+KG+A+ W N +     D    H+ CPV +G+ W
Sbjct: 438 SDVRLGGAVVFPKLKISVEPKKGNALIWDNLNNAGNPDKLSKHAVCPVVMGSHW 491


>gi|195069738|ref|XP_001997014.1| GH23597 [Drosophila grimshawi]
 gi|193892024|gb|EDV90890.1| GH23597 [Drosophila grimshawi]
          Length = 239

 Score = 55.8 bits (133), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 27/55 (49%), Positives = 32/55 (58%), Gaps = 1/55 (1%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTL-LDYRMYHSGCPVALGNK 126
           L DV  GGATIFP L L V PE+G  + W+N    T  L+ R  H  CPV +G K
Sbjct: 165 LNDVARGGATIFPDLELLVQPERGKVLHWHNMDLGTYDLEKRSLHGACPVIMGKK 219



 Score = 55.1 bits (131), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 27/55 (49%), Positives = 31/55 (56%), Gaps = 1/55 (1%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTL-LDYWMYHSGCPVALGNK 64
           L DV  GGATIFP L L V PE+G  + WHN    T  L+    H  CPV +G K
Sbjct: 165 LNDVARGGATIFPDLELLVQPERGKVLHWHNMDLGTYDLEKRSLHGACPVIMGKK 219


>gi|297689698|ref|XP_002822285.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Pongo abelii]
          Length = 544

 Score = 55.5 bits (132), Expect = 6e-06,   Method: Composition-based stats.
 Identities = 27/66 (40%), Positives = 40/66 (60%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    ++  L+ VE GGAT F   NL+V   + +A+FW+N H +   D    H+GCPV
Sbjct: 457 GNRVATFMIY-LSSVEAGGATAFIYANLSVPVVRNAALFWWNLHRSGEGDSDTLHAGCPV 515

Query: 122 ALGNKW 127
            +G+KW
Sbjct: 516 LVGDKW 521



 Score = 54.7 bits (130), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 27/64 (42%), Positives = 37/64 (57%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L+ VE GGAT F   NL+V   + +A+FW N H +   D    H+GCPV +
Sbjct: 458 NRVATFMIYLSSVEAGGATAFIYANLSVPVVRNAALFWWNLHRSGEGDSDTLHAGCPVLV 517

Query: 62  GNKW 65
           G+KW
Sbjct: 518 GDKW 521


>gi|226314793|ref|YP_002774689.1| hypothetical protein BBR47_52080 [Brevibacillus brevis NBRC 100599]
 gi|226097743|dbj|BAH46185.1| conserved hypothetical protein [Brevibacillus brevis NBRC 100599]
          Length = 215

 Score = 55.5 bits (132), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 27/64 (42%), Positives = 38/64 (59%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L DVE GG T+FP L+L+VFP KG AV++   + N  ++ +  H+G  V  
Sbjct: 138 NRISTLVMYLNDVEQGGETVFPLLHLSVFPTKGMAVYFEYFYRNQEVNEFTLHAGAQVIH 197

Query: 62  GNKW 65
           G KW
Sbjct: 198 GEKW 201



 Score = 53.1 bits (126), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 28/78 (35%), Positives = 42/78 (53%), Gaps = 1/78 (1%)

Query: 50  YWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTL 109
           Y  +      +  N+   L++  L DVE GG T+FP L+L+VFP KG AV++   + N  
Sbjct: 125 YDFFAETSRASTNNRISTLVMY-LNDVEQGGETVFPLLHLSVFPTKGMAVYFEYFYRNQE 183

Query: 110 LDYRMYHSGCPVALGNKW 127
           ++    H+G  V  G KW
Sbjct: 184 VNEFTLHAGAQVIHGEKW 201


>gi|195494565|ref|XP_002094892.1| GE19961 [Drosophila yakuba]
 gi|194180993|gb|EDW94604.1| GE19961 [Drosophila yakuba]
          Length = 441

 Score = 55.5 bits (132), Expect = 6e-06,   Method: Composition-based stats.
 Identities = 28/60 (46%), Positives = 35/60 (58%), Gaps = 5/60 (8%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 128
           LL  L DV+ GGAT+FP L + VFP+KGS +FW      + LD       CPV   N+WG
Sbjct: 386 LLFFLEDVKQGGATVFPKLKIAVFPQKGSCLFW-----QSTLDPGSKPLECPVLQRNRWG 440



 Score = 55.1 bits (131), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 26/56 (46%), Positives = 33/56 (58%), Gaps = 5/56 (8%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWG 66
           L DV+ GGAT+FP L + VFP+KGS +FW +      LD       CPV   N+WG
Sbjct: 390 LEDVKQGGATVFPKLKIAVFPQKGSCLFWQST-----LDPGSKPLECPVLQRNRWG 440


>gi|59809017|gb|AAH89446.1| P4HA3 protein [Homo sapiens]
          Length = 528

 Score = 55.5 bits (132), Expect = 7e-06,   Method: Composition-based stats.
 Identities = 27/66 (40%), Positives = 40/66 (60%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    ++  L+ VE GGAT F   NL+V   + +A+FW+N H +   D    H+GCPV
Sbjct: 441 GNRVATFMIY-LSSVEAGGATAFIYANLSVPVVRNAALFWWNLHRSGEGDSDTLHAGCPV 499

Query: 122 ALGNKW 127
            +G+KW
Sbjct: 500 LVGDKW 505



 Score = 54.7 bits (130), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 27/64 (42%), Positives = 37/64 (57%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L+ VE GGAT F   NL+V   + +A+FW N H +   D    H+GCPV +
Sbjct: 442 NRVATFMIYLSSVEAGGATAFIYANLSVPVVRNAALFWWNLHRSGEGDSDTLHAGCPVLV 501

Query: 62  GNKW 65
           G+KW
Sbjct: 502 GDKW 505


>gi|308508235|ref|XP_003116301.1| CRE-PHY-3 protein [Caenorhabditis remanei]
 gi|308251245|gb|EFO95197.1| CRE-PHY-3 protein [Caenorhabditis remanei]
          Length = 253

 Score = 55.5 bits (132), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 34/110 (30%), Positives = 49/110 (44%), Gaps = 6/110 (5%)

Query: 22  FPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCP-----VALGNKWGKLLLSGLTDV 76
            P+LN T      +  +    H +   DY  Y S          + N+ G L+   L   
Sbjct: 97  IPALNFTSAELFSALSYLPGGHYSVHYDYLTYRSEKEHDWWMKNMRNRIGTLIFV-LKTA 155

Query: 77  ELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 126
           E GG T+FPS+  T+    G A FW+N  A+   + R+ H GCP+  G K
Sbjct: 156 EKGGGTVFPSIGTTIRINPGDAFFWFNTQADESQEMRVNHGGCPIYEGRK 205



 Score = 50.8 bits (120), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 30/94 (31%), Positives = 43/94 (45%), Gaps = 10/94 (10%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNK-----W 65
           L   E GG T+FPS+  T+    G A FW N  A+   +  + H GCP+  G K     W
Sbjct: 152 LKTAEKGGGTVFPSIGTTIRINPGDAFFWFNTQADESQEMRVNHGGCPIYEGRKVITTLW 211

Query: 66  GK-----LLLSGLTDVELGGATIFPSLNLTVFPE 94
            +     +L    TD  +  + + PS + T  PE
Sbjct: 212 VRAKNQPILPMASTDSSIHASWLIPSFSSTFRPE 245


>gi|294499597|ref|YP_003563297.1| 2OG-Fe(II) oxygenase family protein [Bacillus megaterium QM B1551]
 gi|294349534|gb|ADE69863.1| 2OG-Fe(II) oxygenase family protein [Bacillus megaterium QM B1551]
          Length = 219

 Score = 55.5 bits (132), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 30/75 (40%), Positives = 39/75 (52%)

Query: 53  YHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDY 112
           Y +    A  N     L+  L DVE GG T FP L L+V P KG AV++   +++  L+ 
Sbjct: 132 YFTSASKASKNNRISTLVMYLNDVEEGGETYFPKLGLSVSPTKGMAVYFEYFYSDAELND 191

Query: 113 RMYHSGCPVALGNKW 127
           R  H G PV  G KW
Sbjct: 192 RTLHGGAPVIKGEKW 206



 Score = 52.0 bits (123), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 25/55 (45%), Positives = 32/55 (58%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T FP L L+V P KG AV++   +++  L+    H G PV  G KW
Sbjct: 152 LNDVEEGGETYFPKLGLSVSPTKGMAVYFEYFYSDAELNDRTLHGGAPVIKGEKW 206


>gi|38454288|ref|NP_942070.1| prolyl 4-hydroxylase subunit alpha-3 precursor [Rattus norvegicus]
 gi|81870816|sp|Q6W3E9.1|P4HA3_RAT RecName: Full=Prolyl 4-hydroxylase subunit alpha-3; Short=4-PH
           alpha-3; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-3; Flags: Precursor
 gi|36962768|gb|AAQ87605.1| collagen prolyl 4-hydroxylase alpha III subunit [Rattus norvegicus]
          Length = 544

 Score = 55.5 bits (132), Expect = 7e-06,   Method: Composition-based stats.
 Identities = 28/66 (42%), Positives = 40/66 (60%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+   L++  L+ VE GGAT F   N +V   K +A+FW+N H +   D    H+GCPV
Sbjct: 457 GNRAATLMIY-LSSVEAGGATAFIYGNFSVPVVKNAALFWWNLHRSGEGDDDTLHAGCPV 515

Query: 122 ALGNKW 127
            +G+KW
Sbjct: 516 LVGDKW 521



 Score = 54.7 bits (130), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 28/64 (43%), Positives = 37/64 (57%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NRA    + L+ VE GGAT F   N +V   K +A+FW N H +   D    H+GCPV +
Sbjct: 458 NRAATLMIYLSSVEAGGATAFIYGNFSVPVVKNAALFWWNLHRSGEGDDDTLHAGCPVLV 517

Query: 62  GNKW 65
           G+KW
Sbjct: 518 GDKW 521


>gi|6437556|gb|AAF08583.1|AC011623_16 unknown protein [Arabidopsis thaliana]
          Length = 278

 Score = 55.5 bits (132), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 28/90 (31%), Positives = 45/90 (50%), Gaps = 2/90 (2%)

Query: 38  FWHNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGS 97
           + H    +   DY  +H    +A G      +L  L++V  GG T+FP   + + P+KG+
Sbjct: 130 YEHGQKYDAHFDY--FHDKVNIARGGHRIATVLLYLSNVTKGGETVFPDAQVCLKPKKGN 187

Query: 98  AVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           A+ ++N   + + D    H GCPV  G KW
Sbjct: 188 ALLFFNLQQDAIPDPFSLHGGCPVIEGEKW 217



 Score = 51.2 bits (121), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 21/55 (38%), Positives = 33/55 (60%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L++V  GG T+FP   + + P+KG+A+ + N   + + D +  H GCPV  G KW
Sbjct: 163 LSNVTKGGETVFPDAQVCLKPKKGNALLFFNLQQDAIPDPFSLHGGCPVIEGEKW 217


>gi|332211329|ref|XP_003254773.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Nomascus
           leucogenys]
          Length = 544

 Score = 55.5 bits (132), Expect = 7e-06,   Method: Composition-based stats.
 Identities = 27/66 (40%), Positives = 40/66 (60%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    ++  L+ VE GGAT F   NL+V   + +A+FW+N H +   D    H+GCPV
Sbjct: 457 GNRVATFMIY-LSSVEAGGATAFIYANLSVPVVRNAALFWWNLHRSGEGDSDTLHAGCPV 515

Query: 122 ALGNKW 127
            +G+KW
Sbjct: 516 LVGDKW 521



 Score = 54.7 bits (130), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 27/64 (42%), Positives = 37/64 (57%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L+ VE GGAT F   NL+V   + +A+FW N H +   D    H+GCPV +
Sbjct: 458 NRVATFMIYLSSVEAGGATAFIYANLSVPVVRNAALFWWNLHRSGEGDSDTLHAGCPVLV 517

Query: 62  GNKW 65
           G+KW
Sbjct: 518 GDKW 521


>gi|195128345|ref|XP_002008624.1| GI13596 [Drosophila mojavensis]
 gi|193920233|gb|EDW19100.1| GI13596 [Drosophila mojavensis]
          Length = 527

 Score = 55.5 bits (132), Expect = 7e-06,   Method: Composition-based stats.
 Identities = 23/49 (46%), Positives = 33/49 (67%)

Query: 79  GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           GGAT+FP++ +TV P+KGSA+ WYN   +   +    HS CPV  G++W
Sbjct: 453 GGATVFPNIQVTVQPQKGSALHWYNLFDDDSPNPLTLHSVCPVISGSRW 501



 Score = 53.5 bits (127), Expect = 3e-05,   Method: Composition-based stats.
 Identities = 22/49 (44%), Positives = 33/49 (67%)

Query: 17  GGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           GGAT+FP++ +TV P+KGSA+ W+N   +   +    HS CPV  G++W
Sbjct: 453 GGATVFPNIQVTVQPQKGSALHWYNLFDDDSPNPLTLHSVCPVISGSRW 501


>gi|195128343|ref|XP_002008623.1| GI13594 [Drosophila mojavensis]
 gi|193920232|gb|EDW19099.1| GI13594 [Drosophila mojavensis]
          Length = 511

 Score = 55.5 bits (132), Expect = 7e-06,   Method: Composition-based stats.
 Identities = 22/49 (44%), Positives = 33/49 (67%)

Query: 79  GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           GGAT+FP++ +TV P+KGSA+ WYN   +   +    H+ CPV  G++W
Sbjct: 441 GGATVFPNIQVTVQPQKGSALHWYNLFDDDSPNPLSLHTACPVISGSRW 489



 Score = 53.5 bits (127), Expect = 3e-05,   Method: Composition-based stats.
 Identities = 21/49 (42%), Positives = 33/49 (67%)

Query: 17  GGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           GGAT+FP++ +TV P+KGSA+ W+N   +   +    H+ CPV  G++W
Sbjct: 441 GGATVFPNIQVTVQPQKGSALHWYNLFDDDSPNPLSLHTACPVISGSRW 489


>gi|426369750|ref|XP_004051847.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3, partial [Gorilla
           gorilla gorilla]
          Length = 517

 Score = 55.5 bits (132), Expect = 7e-06,   Method: Composition-based stats.
 Identities = 27/66 (40%), Positives = 40/66 (60%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    ++  L+ VE GGAT F   NL+V   + +A+FW+N H +   D    H+GCPV
Sbjct: 430 GNRVATFMIY-LSSVEAGGATAFIYANLSVPVVRNAALFWWNLHRSGEGDSDTLHAGCPV 488

Query: 122 ALGNKW 127
            +G+KW
Sbjct: 489 LVGDKW 494



 Score = 54.7 bits (130), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 27/64 (42%), Positives = 37/64 (57%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L+ VE GGAT F   NL+V   + +A+FW N H +   D    H+GCPV +
Sbjct: 431 NRVATFMIYLSSVEAGGATAFIYANLSVPVVRNAALFWWNLHRSGEGDSDTLHAGCPVLV 490

Query: 62  GNKW 65
           G+KW
Sbjct: 491 GDKW 494


>gi|430751569|ref|YP_007214477.1| 2OG-Fe(II) oxygenase [Thermobacillus composti KWC4]
 gi|430735534|gb|AGA59479.1| 2OG-Fe(II) oxygenase superfamily enzyme [Thermobacillus composti
           KWC4]
          Length = 215

 Score = 55.5 bits (132), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 29/83 (34%), Positives = 45/83 (54%), Gaps = 5/83 (6%)

Query: 49  DYWMYHS----GCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNA 104
           +Y  +H     G P A  N+   L++  L DVE GG T+FP L + + P++G+A+++   
Sbjct: 115 EYKAHHDFFAPGSPAARNNRISTLIVY-LNDVEEGGETVFPLLGIAMKPKRGAALYFEYF 173

Query: 105 HANTLLDYRMYHSGCPVALGNKW 127
           + N  L+    HS  PV  G KW
Sbjct: 174 YRNQALNDLTLHSSVPVVRGEKW 196



 Score = 52.4 bits (124), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 25/64 (39%), Positives = 36/64 (56%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     V L DVE GG T+FP L + + P++G+A+++   + N  L+    HS  PV  
Sbjct: 133 NRISTLIVYLNDVEEGGETVFPLLGIAMKPKRGAALYFEYFYRNQALNDLTLHSSVPVVR 192

Query: 62  GNKW 65
           G KW
Sbjct: 193 GEKW 196


>gi|295704991|ref|YP_003598066.1| 2OG-Fe(II) oxygenase [Bacillus megaterium DSM 319]
 gi|294802650|gb|ADF39716.1| 2OG-Fe(II) oxygenase family protein [Bacillus megaterium DSM 319]
          Length = 219

 Score = 55.5 bits (132), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 30/75 (40%), Positives = 39/75 (52%)

Query: 53  YHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDY 112
           Y +    A  N     L+  L DVE GG T FP L L+V P KG AV++   +++  L+ 
Sbjct: 132 YFTSASKASKNNRISTLVMYLNDVEEGGETYFPKLGLSVSPTKGMAVYFEYFYSDAELND 191

Query: 113 RMYHSGCPVALGNKW 127
           R  H G PV  G KW
Sbjct: 192 RTLHGGAPVIKGEKW 206



 Score = 52.0 bits (123), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 25/55 (45%), Positives = 32/55 (58%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T FP L L+V P KG AV++   +++  L+    H G PV  G KW
Sbjct: 152 LNDVEEGGETYFPKLGLSVSPTKGMAVYFEYFYSDAELNDRTLHGGAPVIKGEKW 206


>gi|33589818|ref|NP_878907.1| prolyl 4-hydroxylase subunit alpha-3 precursor [Homo sapiens]
 gi|114639354|ref|XP_001174896.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Pan troglodytes]
 gi|397487266|ref|XP_003814725.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Pan paniscus]
 gi|74738714|sp|Q7Z4N8.1|P4HA3_HUMAN RecName: Full=Prolyl 4-hydroxylase subunit alpha-3; Short=4-PH
           alpha-3; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-3; Flags: Precursor
 gi|33188232|gb|AAP97874.1| prolyl 4-hydroxylase alpha III subunit [Homo sapiens]
 gi|36962719|gb|AAQ87603.1| collagen prolyl 4-hydroxylase alpha III subunit [Homo sapiens]
 gi|37182165|gb|AAQ88885.1| GPGA711 [Homo sapiens]
 gi|109658570|gb|AAI17334.1| Prolyl 4-hydroxylase, alpha polypeptide III [Homo sapiens]
 gi|119595341|gb|EAW74935.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide III, isoform CRA_b
           [Homo sapiens]
 gi|410219716|gb|JAA07077.1| prolyl 4-hydroxylase, alpha polypeptide III [Pan troglodytes]
 gi|410248278|gb|JAA12106.1| prolyl 4-hydroxylase, alpha polypeptide III [Pan troglodytes]
 gi|410336087|gb|JAA36990.1| prolyl 4-hydroxylase, alpha polypeptide III [Pan troglodytes]
          Length = 544

 Score = 55.5 bits (132), Expect = 7e-06,   Method: Composition-based stats.
 Identities = 27/66 (40%), Positives = 40/66 (60%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    ++  L+ VE GGAT F   NL+V   + +A+FW+N H +   D    H+GCPV
Sbjct: 457 GNRVATFMIY-LSSVEAGGATAFIYANLSVPVVRNAALFWWNLHRSGEGDSDTLHAGCPV 515

Query: 122 ALGNKW 127
            +G+KW
Sbjct: 516 LVGDKW 521



 Score = 54.7 bits (130), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 27/64 (42%), Positives = 37/64 (57%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L+ VE GGAT F   NL+V   + +A+FW N H +   D    H+GCPV +
Sbjct: 458 NRVATFMIYLSSVEAGGATAFIYANLSVPVVRNAALFWWNLHRSGEGDSDTLHAGCPVLV 517

Query: 62  GNKW 65
           G+KW
Sbjct: 518 GDKW 521


>gi|145354086|ref|XP_001421326.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144581563|gb|ABO99619.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 309

 Score = 55.5 bits (132), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 33/107 (30%), Positives = 44/107 (41%), Gaps = 30/107 (28%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN-------------------- 88
           DY  +H    V  G +    +L  L+DVE GG T+FP                       
Sbjct: 181 DY--FHDEVNVKNGGQRAATVLMYLSDVEEGGETVFPRGTPLGGAAPEKSGVTQGNACER 238

Query: 89  --------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
                   L V P +G A+ ++N H N  +D R  H+GCPV  G KW
Sbjct: 239 ALRGDPNVLAVKPRRGDALLFFNVHLNGEVDERARHAGCPVVRGTKW 285



 Score = 47.0 bits (110), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/83 (31%), Positives = 33/83 (39%), Gaps = 28/83 (33%)

Query: 11  LTDVELGGATIFPSLN----------------------------LTVFPEKGSAVFWHNA 42
           L+DVE GG T+FP                               L V P +G A+ + N 
Sbjct: 203 LSDVEEGGETVFPRGTPLGGAAPEKSGVTQGNACERALRGDPNVLAVKPRRGDALLFFNV 262

Query: 43  HANTLLDYWMYHSGCPVALGNKW 65
           H N  +D    H+GCPV  G KW
Sbjct: 263 HLNGEVDERARHAGCPVVRGTKW 285


>gi|403234403|ref|ZP_10912989.1| Procollagen-proline dioxygenase [Bacillus sp. 10403023]
          Length = 217

 Score = 55.5 bits (132), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 27/69 (39%), Positives = 40/69 (57%)

Query: 60  ALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGC 119
           A+ N     L+  L+DVE GG T FP L+ +V P+KG AV++   + +  L+    H G 
Sbjct: 138 AINNPRISTLVMYLSDVEQGGETYFPKLHFSVSPQKGMAVYFEYFYNDQTLNELTLHGGA 197

Query: 120 PVALGNKWG 128
           PV +G+KW 
Sbjct: 198 PVIVGDKWA 206



 Score = 55.1 bits (131), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 24/56 (42%), Positives = 35/56 (62%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWG 66
           L+DVE GG T FP L+ +V P+KG AV++   + +  L+    H G PV +G+KW 
Sbjct: 151 LSDVEQGGETYFPKLHFSVSPQKGMAVYFEYFYNDQTLNELTLHGGAPVIVGDKWA 206


>gi|116496629|gb|AAI26171.1| Prolyl 4-hydroxylase, alpha polypeptide III [Homo sapiens]
          Length = 544

 Score = 55.5 bits (132), Expect = 8e-06,   Method: Composition-based stats.
 Identities = 27/66 (40%), Positives = 40/66 (60%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    ++  L+ VE GGAT F   NL+V   + +A+FW+N H +   D    H+GCPV
Sbjct: 457 GNRVATFMIY-LSSVEAGGATAFIYANLSVPVVRNAALFWWNLHRSGEGDSDTLHAGCPV 515

Query: 122 ALGNKW 127
            +G+KW
Sbjct: 516 LVGDKW 521



 Score = 54.7 bits (130), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 27/64 (42%), Positives = 37/64 (57%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L+ VE GGAT F   NL+V   + +A+FW N H +   D    H+GCPV +
Sbjct: 458 NRVATFMIYLSSVEAGGATAFIYANLSVPVVRNAALFWWNLHRSGEGDSDTLHAGCPVLV 517

Query: 62  GNKW 65
           G+KW
Sbjct: 518 GDKW 521


>gi|20177086|gb|AAM12247.1| AT28279p [Drosophila melanogaster]
          Length = 509

 Score = 55.5 bits (132), Expect = 8e-06,   Method: Composition-based stats.
 Identities = 23/54 (42%), Positives = 34/54 (62%)

Query: 74  TDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           +D+ LGG T+F  L + V P+KG+A+ W+N +     D    HS CPV LG++W
Sbjct: 437 SDIPLGGTTVFTKLKIAVQPKKGNALIWFNLNHAGEPDPLTEHSVCPVVLGSRW 490



 Score = 54.3 bits (129), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 24/64 (37%), Positives = 37/64 (57%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           +R   + +  +D+ LGG T+F  L + V P+KG+A+ W N +     D    HS CPV L
Sbjct: 427 DRIATALIYASDIPLGGTTVFTKLKIAVQPKKGNALIWFNLNHAGEPDPLTEHSVCPVVL 486

Query: 62  GNKW 65
           G++W
Sbjct: 487 GSRW 490


>gi|195069797|ref|XP_001997029.1| GH12978 [Drosophila grimshawi]
 gi|193891498|gb|EDV90364.1| GH12978 [Drosophila grimshawi]
          Length = 518

 Score = 55.5 bits (132), Expect = 8e-06,   Method: Composition-based stats.
 Identities = 26/63 (41%), Positives = 37/63 (58%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           +R   + + L+DV+ GG T+FP+ N+ V P KG+ V WHN   +   D    H+GCPV  
Sbjct: 432 DRISTNMIYLSDVQQGGYTVFPTQNVFVKPIKGAMVMWHNLLRSLDGDRRTLHAGCPVIE 491

Query: 62  GNK 64
           G K
Sbjct: 492 GTK 494



 Score = 54.7 bits (130), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 25/54 (46%), Positives = 34/54 (62%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 126
           L+DV+ GG T+FP+ N+ V P KG+ V W+N   +   D R  H+GCPV  G K
Sbjct: 441 LSDVQQGGYTVFPTQNVFVKPIKGAMVMWHNLLRSLDGDRRTLHAGCPVIEGTK 494


>gi|195069795|ref|XP_001997028.1| GH12977 [Drosophila grimshawi]
 gi|193891497|gb|EDV90363.1| GH12977 [Drosophila grimshawi]
          Length = 517

 Score = 55.5 bits (132), Expect = 8e-06,   Method: Composition-based stats.
 Identities = 26/63 (41%), Positives = 37/63 (58%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           +R   + + L+DV+ GG T+FP+ N+ V P KG+ V WHN   +   D    H+GCPV  
Sbjct: 431 DRISTNMIYLSDVQQGGYTVFPTQNVFVKPIKGAMVMWHNLLRSLDGDRRTLHAGCPVIE 490

Query: 62  GNK 64
           G K
Sbjct: 491 GTK 493



 Score = 54.7 bits (130), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 25/54 (46%), Positives = 34/54 (62%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 126
           L+DV+ GG T+FP+ N+ V P KG+ V W+N   +   D R  H+GCPV  G K
Sbjct: 440 LSDVQQGGYTVFPTQNVFVKPIKGAMVMWHNLLRSLDGDRRTLHAGCPVIEGTK 493


>gi|195159146|ref|XP_002020443.1| GL13510 [Drosophila persimilis]
 gi|194117212|gb|EDW39255.1| GL13510 [Drosophila persimilis]
          Length = 527

 Score = 55.1 bits (131), Expect = 8e-06,   Method: Composition-based stats.
 Identities = 31/90 (34%), Positives = 44/90 (48%), Gaps = 2/90 (2%)

Query: 38  FWHNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGS 97
           + +  H +   DY+   +    A G++   +L   L D++ GG+T F  L L V  E+G 
Sbjct: 413 YGYGGHYDIHFDYFNTSTPITKARGDRMATVLFY-LNDMKHGGSTAFTDLQLKVPSERGK 471

Query: 98  AVFWYNAHANTL-LDYRMYHSGCPVALGNK 126
            +FWYN    T  LD R  H  CPV  G K
Sbjct: 472 VLFWYNMRGETHDLDSRTLHGACPVINGTK 501



 Score = 45.4 bits (106), Expect = 0.007,   Method: Composition-based stats.
 Identities = 23/55 (41%), Positives = 29/55 (52%), Gaps = 1/55 (1%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTL-LDYWMYHSGCPVALGNK 64
           L D++ GG+T F  L L V  E+G  +FW+N    T  LD    H  CPV  G K
Sbjct: 447 LNDMKHGGSTAFTDLQLKVPSERGKVLFWYNMRGETHDLDSRTLHGACPVINGTK 501


>gi|195341584|ref|XP_002037386.1| GM12898 [Drosophila sechellia]
 gi|194131502|gb|EDW53545.1| GM12898 [Drosophila sechellia]
          Length = 536

 Score = 55.1 bits (131), Expect = 9e-06,   Method: Composition-based stats.
 Identities = 26/55 (47%), Positives = 33/55 (60%), Gaps = 1/55 (1%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTL-LDYWMYHSGCPVALGNK 64
           L +V+ GGAT FP LNL V  +KGSA+FWHN    +   D   +H  CP+  G K
Sbjct: 447 LNNVDHGGATAFPKLNLVVPTQKGSALFWHNLDRKSYDYDTCTFHGACPLISGTK 501



 Score = 52.4 bits (124), Expect = 6e-05,   Method: Composition-based stats.
 Identities = 25/55 (45%), Positives = 33/55 (60%), Gaps = 1/55 (1%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTL-LDYRMYHSGCPVALGNK 126
           L +V+ GGAT FP LNL V  +KGSA+FW+N    +   D   +H  CP+  G K
Sbjct: 447 LNNVDHGGATAFPKLNLVVPTQKGSALFWHNLDRKSYDYDTCTFHGACPLISGTK 501


>gi|384046522|ref|YP_005494539.1| prolyl 4-hydroxylase alpha subunit [Bacillus megaterium WSH-002]
 gi|345444213|gb|AEN89230.1| Prolyl 4-hydroxylase alpha subunit [Bacillus megaterium WSH-002]
          Length = 219

 Score = 55.1 bits (131), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 29/75 (38%), Positives = 39/75 (52%)

Query: 53  YHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDY 112
           Y +    A  N     L+  L DVE GG T FP L L++ P KG AV++   +++  L+ 
Sbjct: 132 YFTSASKASKNNRISTLVMYLNDVEEGGETYFPKLGLSISPTKGMAVYFEYFYSDAELND 191

Query: 113 RMYHSGCPVALGNKW 127
           R  H G PV  G KW
Sbjct: 192 RTLHGGAPVIKGEKW 206



 Score = 51.6 bits (122), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 24/55 (43%), Positives = 32/55 (58%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T FP L L++ P KG AV++   +++  L+    H G PV  G KW
Sbjct: 152 LNDVEEGGETYFPKLGLSISPTKGMAVYFEYFYSDAELNDRTLHGGAPVIKGEKW 206


>gi|302841711|ref|XP_002952400.1| hypothetical protein VOLCADRAFT_81799 [Volvox carteri f.
           nagariensis]
 gi|300262336|gb|EFJ46543.1| hypothetical protein VOLCADRAFT_81799 [Volvox carteri f.
           nagariensis]
          Length = 269

 Score = 55.1 bits (131), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 32/104 (30%), Positives = 48/104 (46%), Gaps = 22/104 (21%)

Query: 40  HNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSL------------ 87
           +++H N     + +H       GN+W   +L+ LTD E GG T+FP +            
Sbjct: 134 YDSHVN-----YFFHKEGSANGGNRWA-TVLTYLTDTEEGGETVFPKIPAPGGVNVGFSE 187

Query: 88  ----NLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
               NL V P KG A+ +++   N  L+ R  H  CPV  G K+
Sbjct: 188 CAKYNLAVKPRKGDAILFHSMKTNGQLEERSLHGACPVIKGEKF 231



 Score = 47.8 bits (112), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 25/71 (35%), Positives = 32/71 (45%), Gaps = 16/71 (22%)

Query: 11  LTDVELGGATIFPSL----------------NLTVFPEKGSAVFWHNAHANTLLDYWMYH 54
           LTD E GG T+FP +                NL V P KG A+ +H+   N  L+    H
Sbjct: 161 LTDTEEGGETVFPKIPAPGGVNVGFSECAKYNLAVKPRKGDAILFHSMKTNGQLEERSLH 220

Query: 55  SGCPVALGNKW 65
             CPV  G K+
Sbjct: 221 GACPVIKGEKF 231


>gi|194751835|ref|XP_001958229.1| GF10816 [Drosophila ananassae]
 gi|190625511|gb|EDV41035.1| GF10816 [Drosophila ananassae]
          Length = 419

 Score = 55.1 bits (131), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 22/59 (37%), Positives = 36/59 (61%), Gaps = 6/59 (10%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWGKLL 131
           L +V+ GG T+FPSLN+ + P  G+ + W N+      + R++H  CP+ +GN W K +
Sbjct: 367 LNNVKQGGETVFPSLNIKLKPRMGTMLLWLNS------EKRVWHHQCPLIVGNMWSKFM 419



 Score = 51.6 bits (122), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 21/59 (35%), Positives = 34/59 (57%), Gaps = 6/59 (10%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWGKLL 69
           L +V+ GG T+FPSLN+ + P  G+ + W N+         ++H  CP+ +GN W K +
Sbjct: 367 LNNVKQGGETVFPSLNIKLKPRMGTMLLWLNSEKR------VWHHQCPLIVGNMWSKFM 419


>gi|389795384|ref|ZP_10198508.1| procollagen-proline dioxygenase [Rhodanobacter fulvus Jip2]
 gi|388430823|gb|EIL87950.1| procollagen-proline dioxygenase [Rhodanobacter fulvus Jip2]
          Length = 293

 Score = 55.1 bits (131), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 29/73 (39%), Positives = 40/73 (54%)

Query: 55  SGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRM 114
           S   + +G +    LL  L DV  GGAT+FP+L L V P KG AV++  ++ +  +D   
Sbjct: 202 SEAQMVVGGQRVSTLLIYLNDVAQGGATVFPTLGLRVLPRKGMAVYFEYSNRDGQVDPLT 261

Query: 115 YHSGCPVALGNKW 127
            H G PV  G KW
Sbjct: 262 LHGGEPVEKGEKW 274



 Score = 54.7 bits (130), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 25/55 (45%), Positives = 33/55 (60%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DV  GGAT+FP+L L V P KG AV++  ++ +  +D    H G PV  G KW
Sbjct: 220 LNDVAQGGATVFPTLGLRVLPRKGMAVYFEYSNRDGQVDPLTLHGGEPVEKGEKW 274


>gi|194213450|ref|XP_001495951.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Equus
           caballus]
          Length = 548

 Score = 55.1 bits (131), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 27/66 (40%), Positives = 39/66 (59%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    ++  L+ VE GGAT F   N +V   K +A+FW+N H +   D    H+GCPV
Sbjct: 461 GNRVATFMIY-LSSVEAGGATAFIYANFSVPVVKNAALFWWNLHRSGEGDSDTLHAGCPV 519

Query: 122 ALGNKW 127
            +G+KW
Sbjct: 520 LVGDKW 525



 Score = 53.9 bits (128), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 27/64 (42%), Positives = 36/64 (56%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L+ VE GGAT F   N +V   K +A+FW N H +   D    H+GCPV +
Sbjct: 462 NRVATFMIYLSSVEAGGATAFIYANFSVPVVKNAALFWWNLHRSGEGDSDTLHAGCPVLV 521

Query: 62  GNKW 65
           G+KW
Sbjct: 522 GDKW 525


>gi|167519971|ref|XP_001744325.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163777411|gb|EDQ91028.1| predicted protein [Monosiga brevicollis MX1]
          Length = 492

 Score = 55.1 bits (131), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 26/55 (47%), Positives = 28/55 (50%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L  VE GG T FP L   V P  G AVFWYN   +   D    H  CPV  G+KW
Sbjct: 420 LNQVEQGGYTAFPRLGAAVEPGHGDAVFWYNLLPDGESDNNTLHGACPVLQGSKW 474



 Score = 51.6 bits (122), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 26/64 (40%), Positives = 31/64 (48%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           +R     + L  VE GG T FP L   V P  G AVFW+N   +   D    H  CPV  
Sbjct: 411 DRIATFMIYLNQVEQGGYTAFPRLGAAVEPGHGDAVFWYNLLPDGESDNNTLHGACPVLQ 470

Query: 62  GNKW 65
           G+KW
Sbjct: 471 GSKW 474


>gi|260787668|ref|XP_002588874.1| hypothetical protein BRAFLDRAFT_235878 [Branchiostoma floridae]
 gi|229274045|gb|EEN44885.1| hypothetical protein BRAFLDRAFT_235878 [Branchiostoma floridae]
          Length = 151

 Score = 54.7 bits (130), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 27/55 (49%), Positives = 36/55 (65%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L+DVE GGAT+F  LNLTV   K SAV +H+   +   +    H+GCPV +G+KW
Sbjct: 74  LSDVEAGGATVFTRLNLTVPAVKNSAVLFHDLKRSLEFEKDSEHAGCPVLMGSKW 128



 Score = 53.9 bits (128), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 36/55 (65%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L+DVE GGAT+F  LNLTV   K SAV +++   +   +    H+GCPV +G+KW
Sbjct: 74  LSDVEAGGATVFTRLNLTVPAVKNSAVLFHDLKRSLEFEKDSEHAGCPVLMGSKW 128


>gi|351696981|gb|EHA99899.1| Prolyl 4-hydroxylase subunit alpha-3 [Heterocephalus glaber]
          Length = 572

 Score = 54.7 bits (130), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 27/66 (40%), Positives = 39/66 (59%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    ++  L+ VE GGAT F   N +V   K +A+FW+N H +   D    H+GCPV
Sbjct: 485 GNRVATFMIY-LSSVEAGGATAFIYANFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPV 543

Query: 122 ALGNKW 127
            +G+KW
Sbjct: 544 LVGDKW 549



 Score = 53.9 bits (128), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 27/64 (42%), Positives = 36/64 (56%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L+ VE GGAT F   N +V   K +A+FW N H +   D    H+GCPV +
Sbjct: 486 NRVATFMIYLSSVEAGGATAFIYANFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLV 545

Query: 62  GNKW 65
           G+KW
Sbjct: 546 GDKW 549


>gi|195341582|ref|XP_002037385.1| GM12897 [Drosophila sechellia]
 gi|194131501|gb|EDW53544.1| GM12897 [Drosophila sechellia]
          Length = 467

 Score = 54.7 bits (130), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 24/59 (40%), Positives = 37/59 (62%), Gaps = 1/59 (1%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWGKLL 131
           L D   GGAT+FP LN+ V  E+G  + WYN + ++  D    H+ CPV  G+K+G+++
Sbjct: 403 LNDAPYGGATVFPRLNVKVPAERGKVLLWYNLNGDS-QDVTTVHAVCPVFHGSKYGEIV 460



 Score = 52.0 bits (123), Expect = 7e-05,   Method: Composition-based stats.
 Identities = 23/59 (38%), Positives = 37/59 (62%), Gaps = 1/59 (1%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWGKLL 69
           L D   GGAT+FP LN+ V  E+G  + W+N + ++  D    H+ CPV  G+K+G+++
Sbjct: 403 LNDAPYGGATVFPRLNVKVPAERGKVLLWYNLNGDS-QDVTTVHAVCPVFHGSKYGEIV 460


>gi|195159168|ref|XP_002020454.1| GL13504 [Drosophila persimilis]
 gi|194117223|gb|EDW39266.1| GL13504 [Drosophila persimilis]
          Length = 491

 Score = 54.7 bits (130), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 24/55 (43%), Positives = 36/55 (65%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L++V  GG T+FP+L + V P+ G+A+ +YN   + + D R  H GCPV  G+KW
Sbjct: 419 LSNVSQGGYTVFPNLEVAVKPQAGTALVFYNLLDSLVGDTRTRHFGCPVIDGDKW 473



 Score = 50.4 bits (119), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 24/64 (37%), Positives = 38/64 (59%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L++V  GG T+FP+L + V P+ G+A+ ++N   + + D    H GCPV  
Sbjct: 410 NRLFTIQIFLSNVSQGGYTVFPNLEVAVKPQAGTALVFYNLLDSLVGDTRTRHFGCPVID 469

Query: 62  GNKW 65
           G+KW
Sbjct: 470 GDKW 473


>gi|301759032|ref|XP_002915381.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Ailuropoda
           melanoleuca]
          Length = 539

 Score = 54.7 bits (130), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 27/66 (40%), Positives = 39/66 (59%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    ++  L+ VE GGAT F   N +V   K +A+FW+N H +   D    H+GCPV
Sbjct: 452 GNRVATFMIY-LSSVEAGGATAFIYANFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPV 510

Query: 122 ALGNKW 127
            +G+KW
Sbjct: 511 LVGDKW 516



 Score = 53.5 bits (127), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 27/64 (42%), Positives = 36/64 (56%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L+ VE GGAT F   N +V   K +A+FW N H +   D    H+GCPV +
Sbjct: 453 NRVATFMIYLSSVEAGGATAFIYANFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLV 512

Query: 62  GNKW 65
           G+KW
Sbjct: 513 GDKW 516


>gi|47204411|emb|CAF95476.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 284

 Score = 54.7 bits (130), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 27/66 (40%), Positives = 40/66 (60%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+   +++  L+ VE GG+T F   N +V   K +A+FW+N H N   D    H+GCPV
Sbjct: 197 GNRVATVMIY-LSSVEAGGSTAFIYANFSVPVMKNAAIFWWNLHRNGRGDPDTLHAGCPV 255

Query: 122 ALGNKW 127
            +G+KW
Sbjct: 256 LIGDKW 261



 Score = 53.1 bits (126), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 27/64 (42%), Positives = 36/64 (56%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L+ VE GG+T F   N +V   K +A+FW N H N   D    H+GCPV +
Sbjct: 198 NRVATVMIYLSSVEAGGSTAFIYANFSVPVMKNAAIFWWNLHRNGRGDPDTLHAGCPVLI 257

Query: 62  GNKW 65
           G+KW
Sbjct: 258 GDKW 261


>gi|319652187|ref|ZP_08006306.1| hypothetical protein HMPREF1013_02919 [Bacillus sp. 2_A_57_CT2]
 gi|317396176|gb|EFV76895.1| hypothetical protein HMPREF1013_02919 [Bacillus sp. 2_A_57_CT2]
          Length = 283

 Score = 54.7 bits (130), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 24/55 (43%), Positives = 32/55 (58%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T+F    L+  P+KG+A+++H  +A   LD    HS  PV  G KW
Sbjct: 215 LNDVEDGGETVFSKAGLSFVPKKGAAIYFHYGNAQGQLDRLSVHSSVPVRKGEKW 269



 Score = 53.5 bits (127), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 28/80 (35%), Positives = 41/80 (51%), Gaps = 1/80 (1%)

Query: 48  LDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHAN 107
            D++  H       G + G  L+  L DVE GG T+F    L+  P+KG+A++++  +A 
Sbjct: 191 FDFFPPHMADASKGGQRVGTFLIY-LNDVEDGGETVFSKAGLSFVPKKGAAIYFHYGNAQ 249

Query: 108 TLLDYRMYHSGCPVALGNKW 127
             LD    HS  PV  G KW
Sbjct: 250 GQLDRLSVHSSVPVRKGEKW 269


>gi|194765144|ref|XP_001964687.1| GF22917 [Drosophila ananassae]
 gi|190614959|gb|EDV30483.1| GF22917 [Drosophila ananassae]
          Length = 529

 Score = 54.7 bits (130), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 32/86 (37%), Positives = 41/86 (47%), Gaps = 2/86 (2%)

Query: 42  AHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFW 101
           AH     DY+       V  G++   +L   L D   GGAT+FP LN+ V  EKG  +FW
Sbjct: 415 AHYAQHHDYFNMSLKSTVLQGDRIATVLFY-LNDAPHGGATVFPMLNVKVPAEKGKILFW 473

Query: 102 YNAHANTL-LDYRMYHSGCPVALGNK 126
           YN    T   D +  H  CP+  G K
Sbjct: 474 YNLKGETHDFDEKTLHGACPIFHGTK 499



 Score = 48.9 bits (115), Expect = 7e-04,   Method: Composition-based stats.
 Identities = 24/55 (43%), Positives = 29/55 (52%), Gaps = 1/55 (1%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTL-LDYWMYHSGCPVALGNK 64
           L D   GGAT+FP LN+ V  EKG  +FW+N    T   D    H  CP+  G K
Sbjct: 445 LNDAPHGGATVFPMLNVKVPAEKGKILFWYNLKGETHDFDEKTLHGACPIFHGTK 499


>gi|198452400|ref|XP_002137470.1| GA26529 [Drosophila pseudoobscura pseudoobscura]
 gi|198131917|gb|EDY68028.1| GA26529 [Drosophila pseudoobscura pseudoobscura]
          Length = 348

 Score = 54.7 bits (130), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 32/87 (36%), Positives = 43/87 (49%), Gaps = 4/87 (4%)

Query: 40  HNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAV 99
           H+ H +    YW Y        G++   ++   L DVE GG T+FP L L + P KGSA+
Sbjct: 241 HDPHVDYHDLYWEYEYH---PFGDRQASVVFY-LNDVEDGGETVFPKLQLVIPPTKGSAL 296

Query: 100 FWYNAHANTLLDYRMYHSGCPVALGNK 126
            W+N       D R  H+ CPV  G K
Sbjct: 297 MWHNLRPWGEGDPRTQHASCPVLSGYK 323



 Score = 52.8 bits (125), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 29/68 (42%), Positives = 36/68 (52%), Gaps = 2/68 (2%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWGKL-- 68
           L DVE GG T+FP L L + P KGSA+ WHN       D    H+ CPV  G K   +  
Sbjct: 270 LNDVEDGGETVFPKLQLVIPPTKGSALMWHNLRPWGEGDPRTQHASCPVLSGYKQVAIQW 329

Query: 69  LLSGLTDV 76
           +L G  D+
Sbjct: 330 ILQGTRDL 337


>gi|195575139|ref|XP_002105537.1| GD21537 [Drosophila simulans]
 gi|194201464|gb|EDX15040.1| GD21537 [Drosophila simulans]
          Length = 536

 Score = 54.3 bits (129), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 30/79 (37%), Positives = 43/79 (54%), Gaps = 2/79 (2%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY+   +     +G++   +L   L +V+ GGAT FP LNL V  +KGSA+FW+N    +
Sbjct: 424 DYFNSKTSYLERVGDRMATVLFY-LNNVDHGGATAFPKLNLVVPTQKGSALFWHNLDRKS 482

Query: 109 L-LDYRMYHSGCPVALGNK 126
              D R  H  CP+  G K
Sbjct: 483 YDYDTRTSHGACPLISGTK 501



 Score = 53.1 bits (126), Expect = 4e-05,   Method: Composition-based stats.
 Identities = 26/55 (47%), Positives = 32/55 (58%), Gaps = 1/55 (1%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTL-LDYWMYHSGCPVALGNK 64
           L +V+ GGAT FP LNL V  +KGSA+FWHN    +   D    H  CP+  G K
Sbjct: 447 LNNVDHGGATAFPKLNLVVPTQKGSALFWHNLDRKSYDYDTRTSHGACPLISGTK 501


>gi|347966278|ref|XP_003435891.1| AGAP013377-PA [Anopheles gambiae str. PEST]
 gi|333470133|gb|EGK97522.1| AGAP013377-PA [Anopheles gambiae str. PEST]
          Length = 290

 Score = 54.3 bits (129), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 24/55 (43%), Positives = 33/55 (60%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L+DV  GG TIFP L + + P+KG+A FWYN       + +M +S CP+    KW
Sbjct: 209 LSDVTNGGYTIFPKLRVAIRPQKGTAAFWYNLKDTGEGNVQMKYSICPLQDDQKW 263



 Score = 51.6 bits (122), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 23/55 (41%), Positives = 32/55 (58%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L+DV  GG TIFP L + + P+KG+A FW+N       +  M +S CP+    KW
Sbjct: 209 LSDVTNGGYTIFPKLRVAIRPQKGTAAFWYNLKDTGEGNVQMKYSICPLQDDQKW 263


>gi|410972729|ref|XP_003992809.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Felis catus]
          Length = 533

 Score = 54.3 bits (129), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 27/66 (40%), Positives = 39/66 (59%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    ++  L+ VE GGAT F   N +V   K +A+FW+N H +   D    H+GCPV
Sbjct: 446 GNRVATFMIY-LSSVEAGGATAFIYANFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPV 504

Query: 122 ALGNKW 127
            +G+KW
Sbjct: 505 LVGDKW 510



 Score = 53.5 bits (127), Expect = 3e-05,   Method: Composition-based stats.
 Identities = 27/64 (42%), Positives = 36/64 (56%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L+ VE GGAT F   N +V   K +A+FW N H +   D    H+GCPV +
Sbjct: 447 NRVATFMIYLSSVEAGGATAFIYANFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLV 506

Query: 62  GNKW 65
           G+KW
Sbjct: 507 GDKW 510


>gi|328718393|ref|XP_001945742.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 1
           [Acyrthosiphon pisum]
          Length = 511

 Score = 54.3 bits (129), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 30/70 (42%), Positives = 38/70 (54%), Gaps = 2/70 (2%)

Query: 59  VALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAH-ANTLLDYRMYHS 117
           V  GN+   +L   LTDV+  G T FP LN+    EKG+A+ W N H +N    Y   H 
Sbjct: 423 VKFGNRLVTVLFY-LTDVQNDGYTSFPMLNIIAPAEKGAALVWNNLHMSNGQKFYETLHG 481

Query: 118 GCPVALGNKW 127
            CP+  GNKW
Sbjct: 482 SCPLLKGNKW 491



 Score = 52.8 bits (125), Expect = 4e-05,   Method: Composition-based stats.
 Identities = 26/56 (46%), Positives = 33/56 (58%), Gaps = 1/56 (1%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAH-ANTLLDYWMYHSGCPVALGNKW 65
           LTDV+  G T FP LN+    EKG+A+ W+N H +N    Y   H  CP+  GNKW
Sbjct: 436 LTDVQNDGYTSFPMLNIIAPAEKGAALVWNNLHMSNGQKFYETLHGSCPLLKGNKW 491


>gi|432891690|ref|XP_004075614.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Oryzias
           latipes]
          Length = 517

 Score = 54.3 bits (129), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 27/66 (40%), Positives = 39/66 (59%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    ++  L+ VE GG+T F   N +V   K +A+FW+N H N   D    H+GCPV
Sbjct: 430 GNRVATFMIY-LSSVEAGGSTAFIYANFSVPVLKKAAIFWWNLHRNGRGDAETLHAGCPV 488

Query: 122 ALGNKW 127
            +G+KW
Sbjct: 489 LIGDKW 494



 Score = 52.8 bits (125), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 27/64 (42%), Positives = 36/64 (56%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L+ VE GG+T F   N +V   K +A+FW N H N   D    H+GCPV +
Sbjct: 431 NRVATFMIYLSSVEAGGSTAFIYANFSVPVLKKAAIFWWNLHRNGRGDAETLHAGCPVLI 490

Query: 62  GNKW 65
           G+KW
Sbjct: 491 GDKW 494


>gi|73988166|ref|XP_851718.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Canis lupus
           familiaris]
          Length = 544

 Score = 54.3 bits (129), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 27/66 (40%), Positives = 39/66 (59%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    ++  L+ VE GGAT F   N +V   K +A+FW+N H +   D    H+GCPV
Sbjct: 457 GNRVATFMIY-LSSVEAGGATAFIYANFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPV 515

Query: 122 ALGNKW 127
            +G+KW
Sbjct: 516 LVGDKW 521



 Score = 53.5 bits (127), Expect = 3e-05,   Method: Composition-based stats.
 Identities = 27/64 (42%), Positives = 36/64 (56%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L+ VE GGAT F   N +V   K +A+FW N H +   D    H+GCPV +
Sbjct: 458 NRVATFMIYLSSVEAGGATAFIYANFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLV 517

Query: 62  GNKW 65
           G+KW
Sbjct: 518 GDKW 521


>gi|355709028|gb|AES03457.1| prolyl 4-hydroxylase, alpha polypeptide III [Mustela putorius furo]
          Length = 477

 Score = 54.3 bits (129), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 27/66 (40%), Positives = 39/66 (59%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    ++  L+ VE GGAT F   N +V   K +A+FW+N H +   D    H+GCPV
Sbjct: 391 GNRVATFMIY-LSSVEAGGATAFIYANFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPV 449

Query: 122 ALGNKW 127
            +G+KW
Sbjct: 450 LVGDKW 455



 Score = 53.5 bits (127), Expect = 3e-05,   Method: Composition-based stats.
 Identities = 27/64 (42%), Positives = 36/64 (56%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L+ VE GGAT F   N +V   K +A+FW N H +   D    H+GCPV +
Sbjct: 392 NRVATFMIYLSSVEAGGATAFIYANFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLV 451

Query: 62  GNKW 65
           G+KW
Sbjct: 452 GDKW 455


>gi|324510827|gb|ADY44523.1| Prolyl 4-hydroxylase subunit alpha-1 [Ascaris suum]
          Length = 551

 Score = 54.3 bits (129), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 25/66 (37%), Positives = 39/66 (59%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+   +L+  +T+ E+GG T+F  +  +V   K +A+FWYN   +  +D R  H+ CPV
Sbjct: 441 GNRVATILVY-MTEPEIGGGTVFTEVKTSVACTKNAALFWYNLLRSGEVDMRSRHAACPV 499

Query: 122 ALGNKW 127
             G KW
Sbjct: 500 LTGVKW 505



 Score = 50.1 bits (118), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 23/64 (35%), Positives = 35/64 (54%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     V +T+ E+GG T+F  +  +V   K +A+FW+N   +  +D    H+ CPV  
Sbjct: 442 NRVATILVYMTEPEIGGGTVFTEVKTSVACTKNAALFWYNLLRSGEVDMRSRHAACPVLT 501

Query: 62  GNKW 65
           G KW
Sbjct: 502 GVKW 505


>gi|328718391|ref|XP_003246474.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 2
           [Acyrthosiphon pisum]
          Length = 514

 Score = 54.3 bits (129), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 30/70 (42%), Positives = 38/70 (54%), Gaps = 2/70 (2%)

Query: 59  VALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAH-ANTLLDYRMYHS 117
           V  GN+   +L   LTDV+  G T FP LN+    EKG+A+ W N H +N    Y   H 
Sbjct: 426 VKFGNRLVTVLFY-LTDVQNDGYTSFPMLNIIAPAEKGAALVWNNLHMSNGQKFYETLHG 484

Query: 118 GCPVALGNKW 127
            CP+  GNKW
Sbjct: 485 SCPLLKGNKW 494



 Score = 52.8 bits (125), Expect = 5e-05,   Method: Composition-based stats.
 Identities = 26/56 (46%), Positives = 33/56 (58%), Gaps = 1/56 (1%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAH-ANTLLDYWMYHSGCPVALGNKW 65
           LTDV+  G T FP LN+    EKG+A+ W+N H +N    Y   H  CP+  GNKW
Sbjct: 439 LTDVQNDGYTSFPMLNIIAPAEKGAALVWNNLHMSNGQKFYETLHGSCPLLKGNKW 494


>gi|417402564|gb|JAA48127.1| Putative prolyl 4-hydroxylase alpha subunit [Desmodus rotundus]
          Length = 544

 Score = 54.3 bits (129), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 27/66 (40%), Positives = 39/66 (59%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    ++  L+ VE GGAT F   N +V   K +A+FW+N H +   D    H+GCPV
Sbjct: 457 GNRVATFMIY-LSSVEAGGATAFIYANFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPV 515

Query: 122 ALGNKW 127
            +G+KW
Sbjct: 516 LVGDKW 521



 Score = 53.5 bits (127), Expect = 3e-05,   Method: Composition-based stats.
 Identities = 27/64 (42%), Positives = 36/64 (56%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L+ VE GGAT F   N +V   K +A+FW N H +   D    H+GCPV +
Sbjct: 458 NRVATFMIYLSSVEAGGATAFIYANFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLV 517

Query: 62  GNKW 65
           G+KW
Sbjct: 518 GDKW 521


>gi|292621357|ref|XP_691737.4| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Danio rerio]
          Length = 538

 Score = 54.3 bits (129), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 25/66 (37%), Positives = 40/66 (60%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+   +++  L+ V+ GG+T F   N +V   + +A+FW+N H N   +    H+GCPV
Sbjct: 451 GNRVATIMIY-LSPVQAGGSTAFIYANFSVPVVQNAALFWWNLHKNGQGNVDTLHAGCPV 509

Query: 122 ALGNKW 127
            +GNKW
Sbjct: 510 IVGNKW 515



 Score = 52.8 bits (125), Expect = 4e-05,   Method: Composition-based stats.
 Identities = 25/64 (39%), Positives = 36/64 (56%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L+ V+ GG+T F   N +V   + +A+FW N H N   +    H+GCPV +
Sbjct: 452 NRVATIMIYLSPVQAGGSTAFIYANFSVPVVQNAALFWWNLHKNGQGNVDTLHAGCPVIV 511

Query: 62  GNKW 65
           GNKW
Sbjct: 512 GNKW 515


>gi|340357957|ref|ZP_08680560.1| prolyl 4-hydroxylase [Sporosarcina newyorkensis 2681]
 gi|339616017|gb|EGQ20677.1| prolyl 4-hydroxylase [Sporosarcina newyorkensis 2681]
          Length = 211

 Score = 54.3 bits (129), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 26/64 (40%), Positives = 35/64 (54%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L DVE GG T FP L  +V P KG AV++   +++  L+ +  H G PV  
Sbjct: 136 NRISTLVMYLNDVEQGGETFFPHLKFSVSPRKGMAVYFEYFYSDQTLNDFTLHGGAPVVE 195

Query: 62  GNKW 65
           G KW
Sbjct: 196 GEKW 199



 Score = 50.8 bits (120), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 25/59 (42%), Positives = 33/59 (55%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L+  L DVE GG T FP L  +V P KG AV++   +++  L+    H G PV  G KW
Sbjct: 141 LVMYLNDVEQGGETFFPHLKFSVSPRKGMAVYFEYFYSDQTLNDFTLHGGAPVVEGEKW 199


>gi|195145080|ref|XP_002013524.1| GL24183 [Drosophila persimilis]
 gi|194102467|gb|EDW24510.1| GL24183 [Drosophila persimilis]
          Length = 296

 Score = 54.3 bits (129), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 31/87 (35%), Positives = 42/87 (48%), Gaps = 1/87 (1%)

Query: 40  HNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAV 99
           H+ H +    YW          G++   ++   L DVE GG T+FP L L + P KGSA+
Sbjct: 173 HDPHVDYHDLYWHPQEYEYHPFGDRQASVVFY-LNDVEDGGETVFPKLQLVIPPTKGSAL 231

Query: 100 FWYNAHANTLLDYRMYHSGCPVALGNK 126
            W+N       D R  H+ CPV  G K
Sbjct: 232 MWHNLRPWGEGDPRTQHASCPVLSGYK 258



 Score = 52.8 bits (125), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 29/68 (42%), Positives = 36/68 (52%), Gaps = 2/68 (2%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWGKL-- 68
           L DVE GG T+FP L L + P KGSA+ WHN       D    H+ CPV  G K   +  
Sbjct: 205 LNDVEDGGETVFPKLQLVIPPTKGSALMWHNLRPWGEGDPRTQHASCPVLSGYKQVAIQW 264

Query: 69  LLSGLTDV 76
           +L G  D+
Sbjct: 265 ILQGTRDL 272


>gi|47227817|emb|CAG08980.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 285

 Score = 53.9 bits (128), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 27/66 (40%), Positives = 40/66 (60%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+   +++  L+ VE GG+T F   N +V   K +A+FW+N H N   D    H+GCPV
Sbjct: 198 GNRVATVMIY-LSSVEAGGSTAFIYANFSVPVMKNAAIFWWNLHRNGRGDPDTLHAGCPV 256

Query: 122 ALGNKW 127
            +G+KW
Sbjct: 257 LIGDKW 262



 Score = 52.8 bits (125), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 27/64 (42%), Positives = 36/64 (56%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L+ VE GG+T F   N +V   K +A+FW N H N   D    H+GCPV +
Sbjct: 199 NRVATVMIYLSSVEAGGSTAFIYANFSVPVMKNAAIFWWNLHRNGRGDPDTLHAGCPVLI 258

Query: 62  GNKW 65
           G+KW
Sbjct: 259 GDKW 262


>gi|198449520|ref|XP_002136916.1| GA26928 [Drosophila pseudoobscura pseudoobscura]
 gi|198130644|gb|EDY67474.1| GA26928 [Drosophila pseudoobscura pseudoobscura]
          Length = 532

 Score = 53.9 bits (128), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 29/68 (42%), Positives = 37/68 (54%), Gaps = 2/68 (2%)

Query: 60  ALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTL-LDYRMYHSG 118
           A+G++   +L   L DV+ GG+T F  L L V  E+G  +FWYN    T  LD R  H  
Sbjct: 433 AMGDRMATVLFY-LNDVKHGGSTAFTDLQLKVPSERGKVLFWYNMRGETHNLDRRTVHGS 491

Query: 119 CPVALGNK 126
           CPV  G K
Sbjct: 492 CPVIDGTK 499



 Score = 46.2 bits (108), Expect = 0.004,   Method: Composition-based stats.
 Identities = 24/55 (43%), Positives = 29/55 (52%), Gaps = 1/55 (1%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTL-LDYWMYHSGCPVALGNK 64
           L DV+ GG+T F  L L V  E+G  +FW+N    T  LD    H  CPV  G K
Sbjct: 445 LNDVKHGGSTAFTDLQLKVPSERGKVLFWYNMRGETHNLDRRTVHGSCPVIDGTK 499


>gi|410910256|ref|XP_003968606.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Takifugu
           rubripes]
          Length = 540

 Score = 53.9 bits (128), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 27/66 (40%), Positives = 39/66 (59%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    ++  L+ VE GG+T F   N +V   K +A+FW+N H N   D    H+GCPV
Sbjct: 453 GNRVATFMIY-LSSVEAGGSTAFIYANFSVPVMKNAAIFWWNLHRNGEGDADTLHAGCPV 511

Query: 122 ALGNKW 127
            +G+KW
Sbjct: 512 LIGDKW 517



 Score = 52.8 bits (125), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 27/64 (42%), Positives = 36/64 (56%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L+ VE GG+T F   N +V   K +A+FW N H N   D    H+GCPV +
Sbjct: 454 NRVATFMIYLSSVEAGGSTAFIYANFSVPVMKNAAIFWWNLHRNGEGDADTLHAGCPVLI 513

Query: 62  GNKW 65
           G+KW
Sbjct: 514 GDKW 517


>gi|81870817|sp|Q6W3F0.1|P4HA3_MOUSE RecName: Full=Prolyl 4-hydroxylase subunit alpha-3; Short=4-PH
           alpha-3; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-3; Flags: Precursor
 gi|36962749|gb|AAQ87604.1| collagen prolyl 4-hydroxylase alpha III subunit [Mus musculus]
          Length = 542

 Score = 53.9 bits (128), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 27/66 (40%), Positives = 39/66 (59%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    ++  L+ VE GGAT F   N +V   K +A+FW+N H +   D    H+GCPV
Sbjct: 455 GNRVATFMIY-LSSVEAGGATAFIYGNFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPV 513

Query: 122 ALGNKW 127
            +G+KW
Sbjct: 514 LVGDKW 519



 Score = 53.1 bits (126), Expect = 4e-05,   Method: Composition-based stats.
 Identities = 27/64 (42%), Positives = 36/64 (56%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L+ VE GGAT F   N +V   K +A+FW N H +   D    H+GCPV +
Sbjct: 456 NRVATFMIYLSSVEAGGATAFIYGNFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLV 515

Query: 62  GNKW 65
           G+KW
Sbjct: 516 GDKW 519


>gi|354504916|ref|XP_003514519.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Cricetulus
           griseus]
          Length = 509

 Score = 53.9 bits (128), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 27/66 (40%), Positives = 39/66 (59%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    ++  L+ VE GGAT F   N +V   K +A+FW+N H +   D    H+GCPV
Sbjct: 422 GNRVATFMIY-LSAVEAGGATAFIYANFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPV 480

Query: 122 ALGNKW 127
            +G+KW
Sbjct: 481 LVGDKW 486



 Score = 53.1 bits (126), Expect = 4e-05,   Method: Composition-based stats.
 Identities = 27/64 (42%), Positives = 36/64 (56%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L+ VE GGAT F   N +V   K +A+FW N H +   D    H+GCPV +
Sbjct: 423 NRVATFMIYLSAVEAGGATAFIYANFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLV 482

Query: 62  GNKW 65
           G+KW
Sbjct: 483 GDKW 486


>gi|227908832|ref|NP_796135.3| prolyl 4-hydroxylase subunit alpha-3 precursor [Mus musculus]
          Length = 542

 Score = 53.9 bits (128), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 27/66 (40%), Positives = 39/66 (59%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    ++  L+ VE GGAT F   N +V   K +A+FW+N H +   D    H+GCPV
Sbjct: 455 GNRVATFMIY-LSSVEAGGATAFIYGNFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPV 513

Query: 122 ALGNKW 127
            +G+KW
Sbjct: 514 LVGDKW 519



 Score = 53.1 bits (126), Expect = 4e-05,   Method: Composition-based stats.
 Identities = 27/64 (42%), Positives = 36/64 (56%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L+ VE GGAT F   N +V   K +A+FW N H +   D    H+GCPV +
Sbjct: 456 NRVATFMIYLSSVEAGGATAFIYGNFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLV 515

Query: 62  GNKW 65
           G+KW
Sbjct: 516 GDKW 519


>gi|198449504|ref|XP_002136909.1| GA26876 [Drosophila pseudoobscura pseudoobscura]
 gi|198130636|gb|EDY67467.1| GA26876 [Drosophila pseudoobscura pseudoobscura]
          Length = 527

 Score = 53.9 bits (128), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 30/90 (33%), Positives = 44/90 (48%), Gaps = 2/90 (2%)

Query: 38  FWHNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGS 97
           + +  H +   DY+   +    A G++   +L   L D++ GG+T F  L L V  E+G 
Sbjct: 413 YGYGGHYDIHFDYFNTSTPITKARGDRMATVLFY-LNDMKHGGSTAFTDLQLKVPSERGK 471

Query: 98  AVFWYNAHANTL-LDYRMYHSGCPVALGNK 126
            +FWYN    T  +D R  H  CPV  G K
Sbjct: 472 VLFWYNMRGETHDVDSRTLHGACPVINGTK 501



 Score = 44.3 bits (103), Expect = 0.018,   Method: Composition-based stats.
 Identities = 22/55 (40%), Positives = 29/55 (52%), Gaps = 1/55 (1%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTL-LDYWMYHSGCPVALGNK 64
           L D++ GG+T F  L L V  E+G  +FW+N    T  +D    H  CPV  G K
Sbjct: 447 LNDMKHGGSTAFTDLQLKVPSERGKVLFWYNMRGETHDVDSRTLHGACPVINGTK 501


>gi|348555277|ref|XP_003463450.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Cavia porcellus]
          Length = 584

 Score = 53.9 bits (128), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 27/66 (40%), Positives = 39/66 (59%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    ++  L+ VE GGAT F   N +V   K +A+FW+N H +   D    H+GCPV
Sbjct: 497 GNRVATFMIY-LSSVEAGGATAFIYANFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPV 555

Query: 122 ALGNKW 127
            +G+KW
Sbjct: 556 LVGDKW 561



 Score = 53.1 bits (126), Expect = 4e-05,   Method: Composition-based stats.
 Identities = 27/64 (42%), Positives = 36/64 (56%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L+ VE GGAT F   N +V   K +A+FW N H +   D    H+GCPV +
Sbjct: 498 NRVATFMIYLSSVEAGGATAFIYANFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLV 557

Query: 62  GNKW 65
           G+KW
Sbjct: 558 GDKW 561


>gi|268569334|ref|XP_002648232.1| C. briggsae CBR-PHY-3 protein [Caenorhabditis briggsae]
          Length = 246

 Score = 53.9 bits (128), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 27/78 (34%), Positives = 41/78 (52%), Gaps = 7/78 (8%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           D+WM        + N+ G L+   +   E GG T+FPS+  T+  ++G A FW+NA A+ 
Sbjct: 127 DWWMQ------MMRNRIGTLIFV-IKTAEKGGGTVFPSIGTTIKIDEGDAFFWFNAQADE 179

Query: 109 LLDYRMYHSGCPVALGNK 126
             +    H GCP+  G K
Sbjct: 180 SQEMLSDHGGCPIYSGRK 197



 Score = 49.7 bits (117), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 21/54 (38%), Positives = 30/54 (55%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNK 64
           +   E GG T+FPS+  T+  ++G A FW NA A+   +    H GCP+  G K
Sbjct: 144 IKTAEKGGGTVFPSIGTTIKIDEGDAFFWFNAQADESQEMLSDHGGCPIYSGRK 197


>gi|348505573|ref|XP_003440335.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Oreochromis
           niloticus]
          Length = 517

 Score = 53.9 bits (128), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 26/66 (39%), Positives = 39/66 (59%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    ++  L+ VE GG+T F   N +V   + +A+FW+N H N   D    H+GCPV
Sbjct: 430 GNRVATFMIY-LSPVEAGGSTAFIYANFSVPVVEKAAIFWWNLHRNGEGDDDTLHAGCPV 488

Query: 122 ALGNKW 127
            +G+KW
Sbjct: 489 LIGDKW 494



 Score = 53.1 bits (126), Expect = 4e-05,   Method: Composition-based stats.
 Identities = 26/64 (40%), Positives = 36/64 (56%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L+ VE GG+T F   N +V   + +A+FW N H N   D    H+GCPV +
Sbjct: 431 NRVATFMIYLSPVEAGGSTAFIYANFSVPVVEKAAIFWWNLHRNGEGDDDTLHAGCPVLI 490

Query: 62  GNKW 65
           G+KW
Sbjct: 491 GDKW 494


>gi|421749438|ref|ZP_16186877.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator HPC(L)]
 gi|409771699|gb|EKN53918.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator HPC(L)]
          Length = 319

 Score = 53.9 bits (128), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 29/66 (43%), Positives = 37/66 (56%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           G +   L++  L DV  GGAT FP L L V P +G+AVF+     +  LD R  H+G PV
Sbjct: 240 GQRMATLVIY-LNDVPAGGATAFPKLGLRVNPVQGNAVFFAYLGEDGSLDERTLHAGLPV 298

Query: 122 ALGNKW 127
             G KW
Sbjct: 299 EQGEKW 304



 Score = 51.2 bits (121), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 31/55 (56%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DV  GGAT FP L L V P +G+AVF+     +  LD    H+G PV  G KW
Sbjct: 250 LNDVPAGGATAFPKLGLRVNPVQGNAVFFAYLGEDGSLDERTLHAGLPVEQGEKW 304


>gi|195452770|ref|XP_002073492.1| GK14148 [Drosophila willistoni]
 gi|194169577|gb|EDW84478.1| GK14148 [Drosophila willistoni]
          Length = 444

 Score = 53.9 bits (128), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 28/57 (49%), Positives = 32/57 (56%), Gaps = 1/57 (1%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTL-LDYRMYHSGCPVALGNKWG 128
           L  V+ GGAT+FP LNL V  EKG  +FWYN    +L  D    H  CPV  G K G
Sbjct: 387 LNTVKHGGATVFPLLNLKVPAEKGKVLFWYNLDGESLDFDENTEHGVCPVVDGIKLG 443



 Score = 50.4 bits (119), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 27/57 (47%), Positives = 32/57 (56%), Gaps = 1/57 (1%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTL-LDYWMYHSGCPVALGNKWG 66
           L  V+ GGAT+FP LNL V  EKG  +FW+N    +L  D    H  CPV  G K G
Sbjct: 387 LNTVKHGGATVFPLLNLKVPAEKGKVLFWYNLDGESLDFDENTEHGVCPVVDGIKLG 443


>gi|255071007|ref|XP_002507585.1| predicted protein [Micromonas sp. RCC299]
 gi|226522860|gb|ACO68843.1| predicted protein [Micromonas sp. RCC299]
          Length = 433

 Score = 53.5 bits (127), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 32/97 (32%), Positives = 45/97 (46%), Gaps = 21/97 (21%)

Query: 48  LDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN-----------------LT 90
            DY+ +  G    + N     +L  L+DV+ GG T+FPS                   +T
Sbjct: 265 FDYFFHEGG----MKNNRIATVLMYLSDVKDGGETVFPSAESLQVKPEPIHHACAKNGIT 320

Query: 91  VFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           V P+KG A+ ++N      LD    H+GCPV LG KW
Sbjct: 321 VIPKKGDAILFWNMKVGGDLDGGSTHAGCPVVLGEKW 357



 Score = 51.2 bits (121), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 29/81 (35%), Positives = 38/81 (46%), Gaps = 17/81 (20%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLN-----------------LTVFPEKGSAVFWHNAHA 44
           NR     + L+DV+ GG T+FPS                   +TV P+KG A+ + N   
Sbjct: 277 NRIATVLMYLSDVKDGGETVFPSAESLQVKPEPIHHACAKNGITVIPKKGDAILFWNMKV 336

Query: 45  NTLLDYWMYHSGCPVALGNKW 65
              LD    H+GCPV LG KW
Sbjct: 337 GGDLDGGSTHAGCPVVLGEKW 357


>gi|241044303|ref|XP_002407179.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
 gi|215492129|gb|EEC01770.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
          Length = 456

 Score = 53.5 bits (127), Expect = 3e-05,   Method: Composition-based stats.
 Identities = 24/54 (44%), Positives = 29/54 (53%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 126
           L+DV LGG+T+FP     V P+ GSA FWYN   +   D    H  C V  G K
Sbjct: 378 LSDVNLGGSTVFPYAKAGVMPKMGSAAFWYNMREDGSYDRATLHGACSVLHGTK 431



 Score = 51.2 bits (121), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 23/54 (42%), Positives = 29/54 (53%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNK 64
           L+DV LGG+T+FP     V P+ GSA FW+N   +   D    H  C V  G K
Sbjct: 378 LSDVNLGGSTVFPYAKAGVMPKMGSAAFWYNMREDGSYDRATLHGACSVLHGTK 431


>gi|444731524|gb|ELW71877.1| Prolyl 4-hydroxylase subunit alpha-3 [Tupaia chinensis]
          Length = 562

 Score = 53.5 bits (127), Expect = 3e-05,   Method: Composition-based stats.
 Identities = 26/66 (39%), Positives = 39/66 (59%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    ++  L+ VE GGAT F   N +V   K +A+FW+N H +   +    H+GCPV
Sbjct: 475 GNRVATFMIY-LSSVEAGGATAFIYANFSVPVVKNAALFWWNLHRSGEGNSDTLHAGCPV 533

Query: 122 ALGNKW 127
            +G+KW
Sbjct: 534 LVGDKW 539



 Score = 52.8 bits (125), Expect = 5e-05,   Method: Composition-based stats.
 Identities = 26/64 (40%), Positives = 36/64 (56%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L+ VE GGAT F   N +V   K +A+FW N H +   +    H+GCPV +
Sbjct: 476 NRVATFMIYLSSVEAGGATAFIYANFSVPVVKNAALFWWNLHRSGEGNSDTLHAGCPVLV 535

Query: 62  GNKW 65
           G+KW
Sbjct: 536 GDKW 539


>gi|323445926|gb|EGB02303.1| hypothetical protein AURANDRAFT_39521 [Aureococcus anophagefferens]
          Length = 239

 Score = 53.5 bits (127), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 25/61 (40%), Positives = 37/61 (60%), Gaps = 6/61 (9%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA------LGNK 126
           L+DV  GGAT+FP+L L+V P +G  + + N   +T +D R  H+G P+A      + NK
Sbjct: 155 LSDVGAGGATVFPALGLSVAPRRGRLLVFANVADDTTVDARTVHAGEPIAGDTEKWIANK 214

Query: 127 W 127
           W
Sbjct: 215 W 215



 Score = 50.8 bits (120), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 24/61 (39%), Positives = 36/61 (59%), Gaps = 6/61 (9%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVA------LGNK 64
           L+DV  GGAT+FP+L L+V P +G  + + N   +T +D    H+G P+A      + NK
Sbjct: 155 LSDVGAGGATVFPALGLSVAPRRGRLLVFANVADDTTVDARTVHAGEPIAGDTEKWIANK 214

Query: 65  W 65
           W
Sbjct: 215 W 215


>gi|421895470|ref|ZP_16325871.1| prolyl 4-hydroxylase alpha subunit homologue protein [Ralstonia
           solanacearum MolK2]
 gi|206586635|emb|CAQ17221.1| prolyl 4-hydroxylase alpha subunit homologue protein [Ralstonia
           solanacearum MolK2]
          Length = 283

 Score = 53.5 bits (127), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 28/67 (41%), Positives = 37/67 (55%)

Query: 61  LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
           +G +    L+  L  V+ GGAT FP L L V P KG+AVF+     + +LD    H+G P
Sbjct: 202 VGGQRVATLVIYLNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGMLDDNTLHAGLP 261

Query: 121 VALGNKW 127
           V  G KW
Sbjct: 262 VERGEKW 268



 Score = 51.6 bits (122), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 32/55 (58%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L  V+ GGAT FP L L V P KG+AVF+     + +LD    H+G PV  G KW
Sbjct: 214 LNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGMLDDNTLHAGLPVERGEKW 268


>gi|301626782|ref|XP_002942567.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Xenopus
           (Silurana) tropicalis]
          Length = 716

 Score = 53.5 bits (127), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 35/55 (63%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L+ V+LGG+T F   N +    K +AVFW+N H N L D    H+GCPV +G+KW
Sbjct: 639 LSHVDLGGSTAFVFANFSSPVVKNAAVFWWNLHRNGLGDEDTLHAGCPVIIGSKW 693



 Score = 50.8 bits (120), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 34/55 (61%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L+ V+LGG+T F   N +    K +AVFW N H N L D    H+GCPV +G+KW
Sbjct: 639 LSHVDLGGSTAFVFANFSSPVVKNAAVFWWNLHRNGLGDEDTLHAGCPVIIGSKW 693


>gi|335294484|ref|XP_003357239.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Sus scrofa]
          Length = 545

 Score = 53.5 bits (127), Expect = 3e-05,   Method: Composition-based stats.
 Identities = 27/66 (40%), Positives = 39/66 (59%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    ++  L+ VE GGAT F   N +V   K +A+FW+N H +   D    H+GCPV
Sbjct: 458 GNRVATFMIY-LSSVEAGGATAFIYGNFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPV 516

Query: 122 ALGNKW 127
            +G+KW
Sbjct: 517 LVGDKW 522



 Score = 52.8 bits (125), Expect = 5e-05,   Method: Composition-based stats.
 Identities = 27/64 (42%), Positives = 36/64 (56%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L+ VE GGAT F   N +V   K +A+FW N H +   D    H+GCPV +
Sbjct: 459 NRVATFMIYLSSVEAGGATAFIYGNFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLV 518

Query: 62  GNKW 65
           G+KW
Sbjct: 519 GDKW 522


>gi|221484253|gb|EEE22549.1| prolyl 4-hydroxylase alpha subunit, putative [Toxoplasma gondii
           GT1]
          Length = 967

 Score = 53.1 bits (126), Expect = 3e-05,   Method: Composition-based stats.
 Identities = 26/59 (44%), Positives = 33/59 (55%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           +L  L DV  GG T FP L L + P KGSAV W N   +  +D R+ H+G P   G K+
Sbjct: 593 ILVYLNDVTHGGETEFPHLGLKLSPTKGSAVMWRNVFESNQIDPRVLHAGLPTLAGQKY 651



 Score = 50.8 bits (120), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 25/57 (43%), Positives = 31/57 (54%)

Query: 9   VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           V L DV  GG T FP L L + P KGSAV W N   +  +D  + H+G P   G K+
Sbjct: 595 VYLNDVTHGGETEFPHLGLKLSPTKGSAVMWRNVFESNQIDPRVLHAGLPTLAGQKY 651


>gi|221505765|gb|EEE31410.1| prolyl 4-hydroxylase alpha subunit, putative [Toxoplasma gondii
           VEG]
          Length = 918

 Score = 53.1 bits (126), Expect = 3e-05,   Method: Composition-based stats.
 Identities = 26/59 (44%), Positives = 33/59 (55%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           +L  L DV  GG T FP L L + P KGSAV W N   +  +D R+ H+G P   G K+
Sbjct: 593 ILVYLNDVTHGGETEFPHLGLKLSPTKGSAVMWRNVFESNQIDPRVLHAGLPTLAGQKY 651



 Score = 50.8 bits (120), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 25/57 (43%), Positives = 31/57 (54%)

Query: 9   VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           V L DV  GG T FP L L + P KGSAV W N   +  +D  + H+G P   G K+
Sbjct: 595 VYLNDVTHGGETEFPHLGLKLSPTKGSAVMWRNVFESNQIDPRVLHAGLPTLAGQKY 651


>gi|237838357|ref|XP_002368476.1| 2OG-Fe(II) oxygenase family protein [Toxoplasma gondii ME49]
 gi|211966140|gb|EEB01336.1| 2OG-Fe(II) oxygenase family protein [Toxoplasma gondii ME49]
          Length = 967

 Score = 53.1 bits (126), Expect = 3e-05,   Method: Composition-based stats.
 Identities = 26/59 (44%), Positives = 33/59 (55%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           +L  L DV  GG T FP L L + P KGSAV W N   +  +D R+ H+G P   G K+
Sbjct: 593 ILVYLNDVAHGGETEFPHLGLKLSPTKGSAVMWRNVFESNQIDPRVLHAGLPTLAGQKY 651



 Score = 50.8 bits (120), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 25/57 (43%), Positives = 31/57 (54%)

Query: 9   VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           V L DV  GG T FP L L + P KGSAV W N   +  +D  + H+G P   G K+
Sbjct: 595 VYLNDVAHGGETEFPHLGLKLSPTKGSAVMWRNVFESNQIDPRVLHAGLPTLAGQKY 651


>gi|424863736|ref|ZP_18287648.1| prolyl 4-hydroxylase subunit alpha-2 [SAR86 cluster bacterium
           SAR86A]
 gi|400757057|gb|EJP71269.1| prolyl 4-hydroxylase subunit alpha-2 [SAR86 cluster bacterium
           SAR86A]
          Length = 205

 Score = 53.1 bits (126), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 26/57 (45%), Positives = 32/57 (56%), Gaps = 1/57 (1%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNA-HANTLLDYWMYHSGCPVALGNKWG 66
           L DVE GGAT FP +N++V P KG  V +HN     T ++    H G PV  G KW 
Sbjct: 138 LNDVEEGGATDFPKINVSVKPNKGDVVVFHNCIEGTTEINPQALHGGSPVVAGEKWA 194



 Score = 52.8 bits (125), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 26/60 (43%), Positives = 35/60 (58%), Gaps = 1/60 (1%)

Query: 70  LSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNA-HANTLLDYRMYHSGCPVALGNKWG 128
           L+ L DVE GGAT FP +N++V P KG  V ++N     T ++ +  H G PV  G KW 
Sbjct: 135 LAYLNDVEEGGATDFPKINVSVKPNKGDVVVFHNCIEGTTEINPQALHGGSPVVAGEKWA 194


>gi|395521232|ref|XP_003764722.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Sarcophilus
           harrisii]
          Length = 521

 Score = 53.1 bits (126), Expect = 4e-05,   Method: Composition-based stats.
 Identities = 26/66 (39%), Positives = 39/66 (59%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    ++  L+ VE GG+T F   N +V   K +A+FW+N H +   D    H+GCPV
Sbjct: 434 GNRVATFMIY-LSSVEAGGSTAFIYANFSVPVVKNAALFWWNLHRSGQGDGDTLHAGCPV 492

Query: 122 ALGNKW 127
            +G+KW
Sbjct: 493 LVGDKW 498



 Score = 52.4 bits (124), Expect = 6e-05,   Method: Composition-based stats.
 Identities = 26/64 (40%), Positives = 36/64 (56%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L+ VE GG+T F   N +V   K +A+FW N H +   D    H+GCPV +
Sbjct: 435 NRVATFMIYLSSVEAGGSTAFIYANFSVPVVKNAALFWWNLHRSGQGDGDTLHAGCPVLV 494

Query: 62  GNKW 65
           G+KW
Sbjct: 495 GDKW 498


>gi|329913962|ref|ZP_08276011.1| hypothetical protein IMCC9480_1311 [Oxalobacteraceae bacterium
           IMCC9480]
 gi|327545257|gb|EGF30515.1| hypothetical protein IMCC9480_1311 [Oxalobacteraceae bacterium
           IMCC9480]
          Length = 280

 Score = 53.1 bits (126), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 27/65 (41%), Positives = 37/65 (56%), Gaps = 1/65 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           G +   ++L  LTDVE GG T FP + L V P+KG A+F+ N     + D +  H+G PV
Sbjct: 205 GQRLATIILY-LTDVEEGGGTSFPGIGLDVHPQKGGALFFRNTTPYGVPDRKTQHAGLPV 263

Query: 122 ALGNK 126
             G K
Sbjct: 264 EKGTK 268



 Score = 51.6 bits (122), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 25/54 (46%), Positives = 31/54 (57%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNK 64
           LTDVE GG T FP + L V P+KG A+F+ N     + D    H+G PV  G K
Sbjct: 215 LTDVEEGGGTSFPGIGLDVHPQKGGALFFRNTTPYGVPDRKTQHAGLPVEKGTK 268


>gi|326436053|gb|EGD81623.1| p4ha2 protein [Salpingoeca sp. ATCC 50818]
          Length = 548

 Score = 53.1 bits (126), Expect = 4e-05,   Method: Composition-based stats.
 Identities = 25/55 (45%), Positives = 29/55 (52%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L  V+ GG T FP L   V P  G AVFWYN   + + D    H  CPV  G+KW
Sbjct: 452 LNPVKQGGFTAFPRLGAAVQPGYGDAVFWYNLQPSGVGDPLTLHGACPVLRGSKW 506



 Score = 51.6 bits (122), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 24/57 (42%), Positives = 30/57 (52%)

Query: 9   VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           + L  V+ GG T FP L   V P  G AVFW+N   + + D    H  CPV  G+KW
Sbjct: 450 IYLNPVKQGGFTAFPRLGAAVQPGYGDAVFWYNLQPSGVGDPLTLHGACPVLRGSKW 506


>gi|126327904|ref|XP_001367838.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Monodelphis
           domestica]
          Length = 559

 Score = 53.1 bits (126), Expect = 4e-05,   Method: Composition-based stats.
 Identities = 26/66 (39%), Positives = 39/66 (59%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    ++  L+ VE GG+T F   N +V   K +A+FW+N H +   D    H+GCPV
Sbjct: 472 GNRVATFMIY-LSSVEAGGSTAFIYANFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPV 530

Query: 122 ALGNKW 127
            +G+KW
Sbjct: 531 LVGDKW 536



 Score = 52.4 bits (124), Expect = 7e-05,   Method: Composition-based stats.
 Identities = 26/64 (40%), Positives = 36/64 (56%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L+ VE GG+T F   N +V   K +A+FW N H +   D    H+GCPV +
Sbjct: 473 NRVATFMIYLSSVEAGGSTAFIYANFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLV 532

Query: 62  GNKW 65
           G+KW
Sbjct: 533 GDKW 536


>gi|406665340|ref|ZP_11073114.1| hypothetical protein B857_00901 [Bacillus isronensis B3W22]
 gi|405387266|gb|EKB46691.1| hypothetical protein B857_00901 [Bacillus isronensis B3W22]
          Length = 211

 Score = 52.8 bits (125), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 27/64 (42%), Positives = 36/64 (56%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     V L DVE GG T FP+L +   P+KG+AV++   + +  L+    HSG PV  
Sbjct: 133 NRISTLVVYLNDVEEGGVTTFPNLGIVNVPKKGTAVYFEYFYNDQKLNELTLHSGEPVIQ 192

Query: 62  GNKW 65
           G KW
Sbjct: 193 GEKW 196



 Score = 50.8 bits (120), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 27/70 (38%), Positives = 40/70 (57%), Gaps = 1/70 (1%)

Query: 58  PVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHS 117
           P +  N+   L++  L DVE GG T FP+L +   P+KG+AV++   + +  L+    HS
Sbjct: 128 PSSSNNRISTLVVY-LNDVEEGGVTTFPNLGIVNVPKKGTAVYFEYFYNDQKLNELTLHS 186

Query: 118 GCPVALGNKW 127
           G PV  G KW
Sbjct: 187 GEPVIQGEKW 196


>gi|330799463|ref|XP_003287764.1| hypothetical protein DICPUDRAFT_151895 [Dictyostelium purpureum]
 gi|325082219|gb|EGC35708.1| hypothetical protein DICPUDRAFT_151895 [Dictyostelium purpureum]
          Length = 220

 Score = 52.8 bits (125), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 26/59 (44%), Positives = 32/59 (54%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           +L  L DVE GG T FP + + V P +G AV +YN   N  +D    H G PV  G KW
Sbjct: 145 ILMYLCDVEEGGETSFPEVGIKVKPIRGDAVLFYNCKPNGDVDPLSLHQGDPVTKGTKW 203



 Score = 51.2 bits (121), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 24/55 (43%), Positives = 30/55 (54%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T FP + + V P +G AV ++N   N  +D    H G PV  G KW
Sbjct: 149 LCDVEEGGETSFPEVGIKVKPIRGDAVLFYNCKPNGDVDPLSLHQGDPVTKGTKW 203


>gi|341893248|gb|EGT49183.1| hypothetical protein CAEBREN_01919 [Caenorhabditis brenneri]
          Length = 305

 Score = 52.8 bits (125), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 34/112 (30%), Positives = 53/112 (47%), Gaps = 6/112 (5%)

Query: 20  TIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHS-----GCPVALGNKWGKLLLSGLT 74
           ++ P+LN  +  +  +  +    H     DY +Y S     G     GN++  L++   T
Sbjct: 154 SLIPNLNFEIAEDIIALSYKSAGHYAPHHDYLVYPSENEWDGWMRNYGNRFATLIMVFET 213

Query: 75  DVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 126
            +  GGAT+FP L  +V P  G A FW+NA  +   +    H GCP+  G K
Sbjct: 214 AIS-GGATVFPQLGASVKPNPGDAFFWFNAMGDGEQEDLSEHGGCPIYAGTK 264



 Score = 45.8 bits (107), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 21/48 (43%), Positives = 26/48 (54%)

Query: 17  GGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNK 64
           GGAT+FP L  +V P  G A FW NA  +   +    H GCP+  G K
Sbjct: 217 GGATVFPQLGASVKPNPGDAFFWFNAMGDGEQEDLSEHGGCPIYAGTK 264


>gi|195505241|ref|XP_002099419.1| GE10893 [Drosophila yakuba]
 gi|194185520|gb|EDW99131.1| GE10893 [Drosophila yakuba]
          Length = 508

 Score = 52.8 bits (125), Expect = 5e-05,   Method: Composition-based stats.
 Identities = 23/56 (41%), Positives = 34/56 (60%), Gaps = 1/56 (1%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTL-LDYWMYHSGCPVALGNKW 65
           L D   GGAT+FP +++ V  E+G  +FWHN +  T  ++    H+ CPV  G+KW
Sbjct: 428 LNDAPNGGATVFPRIDVKVTAERGKVLFWHNLNGETHDVEPNTLHAACPVFQGSKW 483



 Score = 51.6 bits (122), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 22/56 (39%), Positives = 34/56 (60%), Gaps = 1/56 (1%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTL-LDYRMYHSGCPVALGNKW 127
           L D   GGAT+FP +++ V  E+G  +FW+N +  T  ++    H+ CPV  G+KW
Sbjct: 428 LNDAPNGGATVFPRIDVKVTAERGKVLFWHNLNGETHDVEPNTLHAACPVFQGSKW 483


>gi|328707957|ref|XP_001947811.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Acyrthosiphon
           pisum]
          Length = 507

 Score = 52.8 bits (125), Expect = 5e-05,   Method: Composition-based stats.
 Identities = 29/71 (40%), Positives = 37/71 (52%), Gaps = 2/71 (2%)

Query: 58  PVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT-LLDYRMYH 116
           P   GN+   +L   +TDV   G T+FP+LN      KGSA+ W N   N   +    +H
Sbjct: 412 PKLNGNRLVTILFY-MTDVPDDGYTVFPNLNYVAHCRKGSALVWLNLRLNNGSVHSGTFH 470

Query: 117 SGCPVALGNKW 127
            GCPV  GNKW
Sbjct: 471 GGCPVIKGNKW 481



 Score = 51.2 bits (121), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 25/56 (44%), Positives = 31/56 (55%), Gaps = 1/56 (1%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANT-LLDYWMYHSGCPVALGNKW 65
           +TDV   G T+FP+LN      KGSA+ W N   N   +    +H GCPV  GNKW
Sbjct: 426 MTDVPDDGYTVFPNLNYVAHCRKGSALVWLNLRLNNGSVHSGTFHGGCPVIKGNKW 481


>gi|431838427|gb|ELK00359.1| Prolyl 4-hydroxylase subunit alpha-3 [Pteropus alecto]
          Length = 483

 Score = 52.8 bits (125), Expect = 5e-05,   Method: Composition-based stats.
 Identities = 26/66 (39%), Positives = 38/66 (57%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    ++  L+ VE GGAT F   N +V   K +A+FW+N H +   D    H+ CPV
Sbjct: 396 GNRVATFMIY-LSSVEAGGATAFIYANFSVPVVKNAALFWWNLHRSGEGDSDTLHAACPV 454

Query: 122 ALGNKW 127
            +G+KW
Sbjct: 455 LVGDKW 460



 Score = 52.0 bits (123), Expect = 9e-05,   Method: Composition-based stats.
 Identities = 26/64 (40%), Positives = 35/64 (54%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L+ VE GGAT F   N +V   K +A+FW N H +   D    H+ CPV +
Sbjct: 397 NRVATFMIYLSSVEAGGATAFIYANFSVPVVKNAALFWWNLHRSGEGDSDTLHAACPVLV 456

Query: 62  GNKW 65
           G+KW
Sbjct: 457 GDKW 460


>gi|377811809|ref|YP_005044249.1| ProCollegen-proline dioxygenase [Burkholderia sp. YI23]
 gi|357941170|gb|AET94726.1| ProCollegen-proline dioxygenase [Burkholderia sp. YI23]
          Length = 283

 Score = 52.4 bits (124), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 26/66 (39%), Positives = 37/66 (56%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           G +   L+L  L  VE GG T FP + LT+ P +G+A+++   +A    D R  H+G PV
Sbjct: 204 GQRIATLILY-LNQVEEGGDTTFPDIGLTIHPRRGAALYFEYVNALGQTDPRTLHAGMPV 262

Query: 122 ALGNKW 127
             G KW
Sbjct: 263 ERGEKW 268



 Score = 49.3 bits (116), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 22/55 (40%), Positives = 31/55 (56%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L  VE GG T FP + LT+ P +G+A+++   +A    D    H+G PV  G KW
Sbjct: 214 LNQVEEGGDTTFPDIGLTIHPRRGAALYFEYVNALGQTDPRTLHAGMPVERGEKW 268


>gi|308475644|ref|XP_003100040.1| hypothetical protein CRE_20852 [Caenorhabditis remanei]
 gi|308266092|gb|EFP10045.1| hypothetical protein CRE_20852 [Caenorhabditis remanei]
          Length = 300

 Score = 52.4 bits (124), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 27/112 (24%), Positives = 49/112 (43%), Gaps = 3/112 (2%)

Query: 18  GATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGL---T 74
            + + P+++ ++  +  +  +    H     DY  Y  G      N++G  + + +    
Sbjct: 156 ASRLIPAIDFSISEDISALSYNPGGHYAVHYDYLEYEEGSDDEFMNEFGNRMATFIMVFK 215

Query: 75  DVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 126
               GG T+FPS    V  + G A  W+++  N  +D  + H GCPV  G K
Sbjct: 216 KATSGGGTLFPSFGTVVRADAGDAFLWFDSKENGEMDMNVEHGGCPVYDGQK 267



 Score = 42.7 bits (99), Expect = 0.051,   Method: Compositional matrix adjust.
 Identities = 19/48 (39%), Positives = 25/48 (52%)

Query: 17  GGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNK 64
           GG T+FPS    V  + G A  W ++  N  +D  + H GCPV  G K
Sbjct: 220 GGGTLFPSFGTVVRADAGDAFLWFDSKENGEMDMNVEHGGCPVYDGQK 267


>gi|328718395|ref|XP_003246475.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Acyrthosiphon
           pisum]
          Length = 518

 Score = 52.4 bits (124), Expect = 6e-05,   Method: Composition-based stats.
 Identities = 29/68 (42%), Positives = 37/68 (54%), Gaps = 2/68 (2%)

Query: 61  LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAH-ANTLLDYRMYHSGC 119
            GN+   +L   LTDV+  G T FP LN+    EKG+A+ W N H +N    Y   H  C
Sbjct: 428 FGNRLVTVLFY-LTDVQNDGYTSFPLLNIIAPAEKGAALVWNNLHMSNGQKFYESLHGSC 486

Query: 120 PVALGNKW 127
           P+  GNKW
Sbjct: 487 PLLKGNKW 494



 Score = 52.0 bits (123), Expect = 7e-05,   Method: Composition-based stats.
 Identities = 26/56 (46%), Positives = 33/56 (58%), Gaps = 1/56 (1%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAH-ANTLLDYWMYHSGCPVALGNKW 65
           LTDV+  G T FP LN+    EKG+A+ W+N H +N    Y   H  CP+  GNKW
Sbjct: 439 LTDVQNDGYTSFPLLNIIAPAEKGAALVWNNLHMSNGQKFYESLHGSCPLLKGNKW 494


>gi|113869198|ref|YP_727687.1| prolyl 4-hydroxylase alpha subunit [Ralstonia eutropha H16]
 gi|113527974|emb|CAJ94319.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia eutropha H16]
          Length = 297

 Score = 52.4 bits (124), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 28/69 (40%), Positives = 39/69 (56%)

Query: 59  VALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSG 118
           +++G +    L+  L   E GGAT FP + L V P KG+AV++     +  LD R  H+G
Sbjct: 214 LSVGGQRIATLVIYLNTPEAGGATAFPRVGLEVAPVKGNAVYFSYLLPDGALDERTLHAG 273

Query: 119 CPVALGNKW 127
            PVA G KW
Sbjct: 274 LPVAFGEKW 282



 Score = 49.7 bits (117), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 24/51 (47%), Positives = 30/51 (58%)

Query: 15  ELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           E GGAT FP + L V P KG+AV++     +  LD    H+G PVA G KW
Sbjct: 232 EAGGATAFPRVGLEVAPVKGNAVYFSYLLPDGALDERTLHAGLPVAFGEKW 282


>gi|229002593|ref|ZP_04160640.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock3-17]
 gi|229003816|ref|ZP_04161625.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock1-4]
 gi|228757417|gb|EEM06653.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock1-4]
 gi|228758520|gb|EEM07660.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock3-17]
          Length = 219

 Score = 52.4 bits (124), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 29/79 (36%), Positives = 44/79 (55%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           D+++ +S    A  N+   L++  L  VE GG T FP LNL+V P+KG AV++   + + 
Sbjct: 132 DFFVENSAA--ASNNRMSTLVMY-LNHVEEGGETFFPKLNLSVSPKKGMAVYFEYFYQDE 188

Query: 109 LLDYRMYHSGCPVALGNKW 127
            ++    H G PV  G KW
Sbjct: 189 SINKLTLHGGAPVIKGEKW 207



 Score = 52.0 bits (123), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 24/55 (43%), Positives = 32/55 (58%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L  VE GG T FP LNL+V P+KG AV++   + +  ++    H G PV  G KW
Sbjct: 153 LNHVEEGGETFFPKLNLSVSPKKGMAVYFEYFYQDESINKLTLHGGAPVIKGEKW 207


>gi|344296798|ref|XP_003420090.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Loxodonta
           africana]
          Length = 544

 Score = 52.4 bits (124), Expect = 6e-05,   Method: Composition-based stats.
 Identities = 26/66 (39%), Positives = 39/66 (59%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    ++  L+ VE GGAT F   N ++   K +A+FW+N H +   D    H+GCPV
Sbjct: 457 GNRVATFMIY-LSAVEAGGATAFIYANFSMPVVKNAALFWWNLHRSGEGDGDTLHAGCPV 515

Query: 122 ALGNKW 127
            +G+KW
Sbjct: 516 LVGDKW 521



 Score = 51.6 bits (122), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 26/64 (40%), Positives = 36/64 (56%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L+ VE GGAT F   N ++   K +A+FW N H +   D    H+GCPV +
Sbjct: 458 NRVATFMIYLSAVEAGGATAFIYANFSMPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLV 517

Query: 62  GNKW 65
           G+KW
Sbjct: 518 GDKW 521


>gi|228990015|ref|ZP_04149988.1| Prolyl 4-hydroxylase alpha subunit [Bacillus pseudomycoides DSM
           12442]
 gi|228769681|gb|EEM18271.1| Prolyl 4-hydroxylase alpha subunit [Bacillus pseudomycoides DSM
           12442]
          Length = 219

 Score = 52.4 bits (124), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 29/79 (36%), Positives = 44/79 (55%), Gaps = 3/79 (3%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           D+++ +S    A  N+   L++  L  VE GG T FP LNL+V P+KG AV++   + + 
Sbjct: 132 DFFVENSAA--ASNNRMSTLVMY-LNHVEEGGETFFPKLNLSVSPKKGMAVYFEYFYQDE 188

Query: 109 LLDYRMYHSGCPVALGNKW 127
            ++    H G PV  G KW
Sbjct: 189 SINKLTLHGGAPVIKGEKW 207



 Score = 52.0 bits (123), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 24/55 (43%), Positives = 32/55 (58%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L  VE GG T FP LNL+V P+KG AV++   + +  ++    H G PV  G KW
Sbjct: 153 LNHVEEGGETFFPKLNLSVSPKKGMAVYFEYFYQDESINKLTLHGGAPVIKGEKW 207


>gi|341902051|gb|EGT57986.1| hypothetical protein CAEBREN_04818 [Caenorhabditis brenneri]
          Length = 314

 Score = 52.4 bits (124), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 31/113 (27%), Positives = 48/113 (42%), Gaps = 5/113 (4%)

Query: 18  GATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCP----VALGNKWGKLLLSGL 73
            A   P++N     +     +    H +   D+  +    P    +  GN+    ++S +
Sbjct: 171 AARHIPAMNFKYSEQVLGLSYLPGGHYSAHHDFLEFQGEAPEHAHLEEGNRIATFIIS-V 229

Query: 74  TDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 126
              ++GG T+FP L  TV    G A  W+N   N  +D   YH GCPV  G K
Sbjct: 230 EKADVGGGTVFPYLKTTVRANPGDAYIWFNMLENQEIDDMSYHGGCPVISGKK 282



 Score = 50.1 bits (118), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 27/57 (47%), Positives = 29/57 (50%), Gaps = 2/57 (3%)

Query: 8   SVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNK 64
           SVE  DV  GG T+FP L  TV    G A  W N   N  +D   YH GCPV  G K
Sbjct: 228 SVEKADV--GGGTVFPYLKTTVRANPGDAYIWFNMLENQEIDDMSYHGGCPVISGKK 282


>gi|187930127|ref|YP_001900614.1| procollagen-proline dioxygenase [Ralstonia pickettii 12J]
 gi|187727017|gb|ACD28182.1| Procollagen-proline dioxygenase [Ralstonia pickettii 12J]
          Length = 288

 Score = 52.4 bits (124), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 31/55 (56%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L  V+ GGAT FP L L V P KG+AVF+     +  LD    H+G PV  G KW
Sbjct: 219 LNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDEDTLHAGLPVERGEKW 273



 Score = 51.2 bits (121), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 31/55 (56%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L  V+ GGAT FP L L V P KG+AVF+     +  LD    H+G PV  G KW
Sbjct: 219 LNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDEDTLHAGLPVERGEKW 273


>gi|374370415|ref|ZP_09628419.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus basilensis OR16]
 gi|373098067|gb|EHP39184.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus basilensis OR16]
          Length = 454

 Score = 52.4 bits (124), Expect = 6e-05,   Method: Composition-based stats.
 Identities = 28/66 (42%), Positives = 37/66 (56%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           G + G L++  L     GGAT FP L L V P KG+AV++    ++  LD R  H+G PV
Sbjct: 245 GQRVGTLVIY-LNSPLAGGATAFPKLGLEVAPVKGNAVYFSYRKSDGALDERTLHAGLPV 303

Query: 122 ALGNKW 127
             G KW
Sbjct: 304 EAGEKW 309



 Score = 49.3 bits (116), Expect = 6e-04,   Method: Composition-based stats.
 Identities = 23/49 (46%), Positives = 29/49 (59%)

Query: 17  GGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           GGAT FP L L V P KG+AV++    ++  LD    H+G PV  G KW
Sbjct: 261 GGATAFPKLGLEVAPVKGNAVYFSYRKSDGALDERTLHAGLPVEAGEKW 309


>gi|328876967|gb|EGG25330.1| putative prolyl 4-hydroxylase alpha subunit [Dictyostelium
           fasciculatum]
          Length = 244

 Score = 52.4 bits (124), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 26/59 (44%), Positives = 33/59 (55%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           +L  L++VE GG T FP + + V P KG AV +YN   N  +D    H G PV  G KW
Sbjct: 146 ILMYLSEVEEGGETSFPEVGVKVKPVKGDAVLFYNCKPNGEVDPLSLHQGDPVIKGTKW 204



 Score = 50.8 bits (120), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 24/55 (43%), Positives = 31/55 (56%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L++VE GG T FP + + V P KG AV ++N   N  +D    H G PV  G KW
Sbjct: 150 LSEVEEGGETSFPEVGVKVKPVKGDAVLFYNCKPNGEVDPLSLHQGDPVIKGTKW 204


>gi|241664232|ref|YP_002982592.1| procollagen-proline dioxygenase [Ralstonia pickettii 12D]
 gi|309783051|ref|ZP_07677770.1| procollagen-proline dioxygenase [Ralstonia sp. 5_7_47FAA]
 gi|404397139|ref|ZP_10988932.1| hypothetical protein HMPREF0989_00773 [Ralstonia sp. 5_2_56FAA]
 gi|240866259|gb|ACS63920.1| Procollagen-proline dioxygenase [Ralstonia pickettii 12D]
 gi|308918159|gb|EFP63837.1| procollagen-proline dioxygenase [Ralstonia sp. 5_7_47FAA]
 gi|348610674|gb|EGY60360.1| hypothetical protein HMPREF0989_00773 [Ralstonia sp. 5_2_56FAA]
          Length = 288

 Score = 52.4 bits (124), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 31/55 (56%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L  V+ GGAT FP L L V P KG+AVF+     +  LD    H+G PV  G KW
Sbjct: 219 LNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDEDTLHAGLPVERGEKW 273



 Score = 51.2 bits (121), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 31/55 (56%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L  V+ GGAT FP L L V P KG+AVF+     +  LD    H+G PV  G KW
Sbjct: 219 LNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDEDTLHAGLPVERGEKW 273


>gi|344172475|emb|CCA85118.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia syzygii R24]
          Length = 289

 Score = 52.4 bits (124), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 28/67 (41%), Positives = 36/67 (53%)

Query: 61  LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
           +G +    L+  L  V+ GGAT FP L L V P KG+AVF+     +  LD    H+G P
Sbjct: 208 VGGQRVATLVIYLNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLP 267

Query: 121 VALGNKW 127
           V  G KW
Sbjct: 268 VERGEKW 274



 Score = 50.4 bits (119), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 31/55 (56%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L  V+ GGAT FP L L V P KG+AVF+     +  LD    H+G PV  G KW
Sbjct: 220 LNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLPVERGEKW 274


>gi|344169181|emb|CCA81504.1| putative Prolyl 4-hydroxylase alpha subunit [blood disease
           bacterium R229]
          Length = 289

 Score = 52.0 bits (123), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 28/67 (41%), Positives = 36/67 (53%)

Query: 61  LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
           +G +    L+  L  V+ GGAT FP L L V P KG+AVF+     +  LD    H+G P
Sbjct: 208 VGGQRVATLVIYLNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLP 267

Query: 121 VALGNKW 127
           V  G KW
Sbjct: 268 VERGEKW 274



 Score = 50.4 bits (119), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 31/55 (56%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L  V+ GGAT FP L L V P KG+AVF+     +  LD    H+G PV  G KW
Sbjct: 220 LNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLPVERGEKW 274


>gi|300690371|ref|YP_003751366.1| prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum PSI07]
 gi|299077431|emb|CBJ50057.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
           PSI07]
          Length = 289

 Score = 52.0 bits (123), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 28/67 (41%), Positives = 36/67 (53%)

Query: 61  LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
           +G +    L+  L  V+ GGAT FP L L V P KG+AVF+     +  LD    H+G P
Sbjct: 208 VGGQRVATLVIYLNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLP 267

Query: 121 VALGNKW 127
           V  G KW
Sbjct: 268 VERGEKW 274



 Score = 50.4 bits (119), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 31/55 (56%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L  V+ GGAT FP L L V P KG+AVF+     +  LD    H+G PV  G KW
Sbjct: 220 LNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLPVERGEKW 274


>gi|195575137|ref|XP_002105536.1| GD21536 [Drosophila simulans]
 gi|194201463|gb|EDX15039.1| GD21536 [Drosophila simulans]
          Length = 465

 Score = 52.0 bits (123), Expect = 7e-05,   Method: Composition-based stats.
 Identities = 23/55 (41%), Positives = 33/55 (60%), Gaps = 1/55 (1%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L D   GGAT+FP LN+ V  E+G  + WYN + +T  D    H+ CPV  G+++
Sbjct: 403 LNDAPYGGATVFPHLNVKVPAERGKVLLWYNLNGDT-QDVTTVHAACPVFHGSEY 456



 Score = 49.7 bits (117), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 23/64 (35%), Positives = 36/64 (56%), Gaps = 1/64 (1%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           +R +     L D   GGAT+FP LN+ V  E+G  + W+N + +T  D    H+ CPV  
Sbjct: 394 DRMDTVMFYLNDAPYGGATVFPHLNVKVPAERGKVLLWYNLNGDT-QDVTTVHAACPVFH 452

Query: 62  GNKW 65
           G+++
Sbjct: 453 GSEY 456


>gi|300702992|ref|YP_003744594.1| prolyl 4-hydroxylase subunit alpha [Ralstonia solanacearum
           CFBP2957]
 gi|299070655|emb|CBJ41950.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
           CFBP2957]
          Length = 289

 Score = 52.0 bits (123), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 28/67 (41%), Positives = 36/67 (53%)

Query: 61  LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
           +G +    L+  L  V+ GGAT FP L L V P KG+AVF+     +  LD    H+G P
Sbjct: 208 VGGQRVATLVIYLNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLP 267

Query: 121 VALGNKW 127
           V  G KW
Sbjct: 268 VERGEKW 274



 Score = 50.1 bits (118), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 31/55 (56%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L  V+ GGAT FP L L V P KG+AVF+     +  LD    H+G PV  G KW
Sbjct: 220 LNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLPVERGEKW 274


>gi|207744371|ref|YP_002260763.1| prolyl 4-hydroxylase subunit alpha [Ralstonia solanacearum IPO1609]
 gi|206595776|emb|CAQ62703.1| prolyl 4-hydroxylase alpha subunit homologue protein [Ralstonia
           solanacearum IPO1609]
          Length = 280

 Score = 52.0 bits (123), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 28/67 (41%), Positives = 36/67 (53%)

Query: 61  LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
           +G +    L+  L  V+ GGAT FP L L V P KG+AVF+     +  LD    H+G P
Sbjct: 199 VGGQRVATLVIYLNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLP 258

Query: 121 VALGNKW 127
           V  G KW
Sbjct: 259 VERGEKW 265



 Score = 50.4 bits (119), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 31/55 (56%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L  V+ GGAT FP L L V P KG+AVF+     +  LD    H+G PV  G KW
Sbjct: 211 LNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLPVERGEKW 265


>gi|421890664|ref|ZP_16321519.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
           K60-1]
 gi|378964031|emb|CCF98267.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
           K60-1]
          Length = 288

 Score = 52.0 bits (123), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 28/67 (41%), Positives = 36/67 (53%)

Query: 61  LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
           +G +    L+  L  V+ GGAT FP L L V P KG+AVF+     +  LD    H+G P
Sbjct: 207 VGGQRVATLVIYLNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLP 266

Query: 121 VALGNKW 127
           V  G KW
Sbjct: 267 VERGEKW 273



 Score = 50.1 bits (118), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 31/55 (56%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L  V+ GGAT FP L L V P KG+AVF+     +  LD    H+G PV  G KW
Sbjct: 219 LNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLPVERGEKW 273


>gi|299065638|emb|CBJ36810.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
           CMR15]
          Length = 289

 Score = 52.0 bits (123), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 28/67 (41%), Positives = 36/67 (53%)

Query: 61  LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
           +G +    L+  L  V  GGAT FP L L V P KG+AVF+     +  LD +  H+G P
Sbjct: 208 VGGQRVATLVIYLNSVPAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDDKTLHAGLP 267

Query: 121 VALGNKW 127
           V  G KW
Sbjct: 268 VERGEKW 274



 Score = 49.7 bits (117), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 30/55 (54%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L  V  GGAT FP L L V P KG+AVF+     +  LD    H+G PV  G KW
Sbjct: 220 LNSVPAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDDKTLHAGLPVERGEKW 274


>gi|83746819|ref|ZP_00943867.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum UW551]
 gi|83726588|gb|EAP73718.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum UW551]
          Length = 289

 Score = 52.0 bits (123), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 28/67 (41%), Positives = 36/67 (53%)

Query: 61  LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
           +G +    L+  L  V+ GGAT FP L L V P KG+AVF+     +  LD    H+G P
Sbjct: 208 VGGQRVATLVIYLNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLP 267

Query: 121 VALGNKW 127
           V  G KW
Sbjct: 268 VERGEKW 274



 Score = 50.1 bits (118), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 31/55 (56%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L  V+ GGAT FP L L V P KG+AVF+     +  LD    H+G PV  G KW
Sbjct: 220 LNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLPVERGEKW 274


>gi|386332363|ref|YP_006028532.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum Po82]
 gi|334194811|gb|AEG67996.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum Po82]
          Length = 292

 Score = 52.0 bits (123), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 28/67 (41%), Positives = 36/67 (53%)

Query: 61  LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
           +G +    L+  L  V+ GGAT FP L L V P KG+AVF+     +  LD    H+G P
Sbjct: 211 VGGQRVATLVIYLNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLP 270

Query: 121 VALGNKW 127
           V  G KW
Sbjct: 271 VERGEKW 277



 Score = 50.1 bits (118), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 31/55 (56%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L  V+ GGAT FP L L V P KG+AVF+     +  LD    H+G PV  G KW
Sbjct: 223 LNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLPVERGEKW 277


>gi|344175386|emb|CCA88057.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia syzygii R24]
          Length = 331

 Score = 52.0 bits (123), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 26/66 (39%), Positives = 37/66 (56%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           G++   +LL  L +VE GGAT FP+    V P+KG A+++    A+  +D    H GC V
Sbjct: 247 GHRIATVLLY-LNEVEAGGATFFPNPGFEVRPQKGGALYFAYQQADGSMDPTSLHEGCAV 305

Query: 122 ALGNKW 127
             G KW
Sbjct: 306 TQGEKW 311



 Score = 51.2 bits (121), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 23/55 (41%), Positives = 31/55 (56%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L +VE GGAT FP+    V P+KG A+++    A+  +D    H GC V  G KW
Sbjct: 257 LNEVEAGGATFFPNPGFEVRPQKGGALYFAYQQADGSMDPTSLHEGCAVTQGEKW 311


>gi|260806889|ref|XP_002598316.1| hypothetical protein BRAFLDRAFT_261183 [Branchiostoma floridae]
 gi|229283588|gb|EEN54328.1| hypothetical protein BRAFLDRAFT_261183 [Branchiostoma floridae]
          Length = 531

 Score = 52.0 bits (123), Expect = 9e-05,   Method: Composition-based stats.
 Identities = 27/66 (40%), Positives = 42/66 (63%), Gaps = 3/66 (4%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    LL  L+DV+ GGAT+FP +++ V P   +AVFW    +  ++   + H+GCPV
Sbjct: 450 GNRVVTFLLY-LSDVDAGGATVFPIVDVAV-PINSAAVFWSMERSGAVVPNSL-HAGCPV 506

Query: 122 ALGNKW 127
            +G+KW
Sbjct: 507 LIGSKW 512



 Score = 51.2 bits (121), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 25/64 (39%), Positives = 40/64 (62%), Gaps = 2/64 (3%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L+DV+ GGAT+FP +++ V P   +AVFW    +  ++   + H+GCPV +
Sbjct: 451 NRVVTFLLYLSDVDAGGATVFPIVDVAV-PINSAAVFWSMERSGAVVPNSL-HAGCPVLI 508

Query: 62  GNKW 65
           G+KW
Sbjct: 509 GSKW 512


>gi|339327280|ref|YP_004686973.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator N-1]
 gi|338167437|gb|AEI78492.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator N-1]
          Length = 297

 Score = 51.6 bits (122), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 28/69 (40%), Positives = 39/69 (56%)

Query: 59  VALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSG 118
           +++G +    L+  L   E GGAT FP + L V P KG+AV++     +  LD R  H+G
Sbjct: 214 LSVGGQRIATLVIYLNTPEAGGATAFPRVGLEVAPVKGNAVYFSYLLPDGTLDERTLHAG 273

Query: 119 CPVALGNKW 127
            PVA G KW
Sbjct: 274 LPVASGEKW 282



 Score = 49.3 bits (116), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 25/55 (45%), Positives = 31/55 (56%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L   E GGAT FP + L V P KG+AV++     +  LD    H+G PVA G KW
Sbjct: 228 LNTPEAGGATAFPRVGLEVAPVKGNAVYFSYLLPDGTLDERTLHAGLPVASGEKW 282


>gi|160900716|ref|YP_001566298.1| procollagen-proline dioxygenase [Delftia acidovorans SPH-1]
 gi|160366300|gb|ABX37913.1| Procollagen-proline dioxygenase [Delftia acidovorans SPH-1]
          Length = 294

 Score = 51.6 bits (122), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 32/83 (38%), Positives = 42/83 (50%), Gaps = 7/83 (8%)

Query: 49  DYWMY-HSGCPVAL---GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNA 104
           DY+     G P  L   G + G L++  L + E GGATIFP + L V P +G+AVF+   
Sbjct: 203 DYFAPGEPGTPTILKRGGQRVGTLVIY-LNEPERGGATIFPEVPLQVVPRRGNAVFFSYE 261

Query: 105 HANTLLDYRMYHSGCPVALGNKW 127
             +     R  H G PV  G KW
Sbjct: 262 RPDP--STRTLHGGAPVLAGEKW 282



 Score = 46.6 bits (109), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 23/55 (41%), Positives = 29/55 (52%), Gaps = 2/55 (3%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L + E GGATIFP + L V P +G+AVF+     +        H G PV  G KW
Sbjct: 230 LNEPERGGATIFPEVPLQVVPRRGNAVFFSYERPDP--STRTLHGGAPVLAGEKW 282


>gi|284035817|ref|YP_003385747.1| 2OG-Fe(II) oxygenase [Spirosoma linguale DSM 74]
 gi|283815110|gb|ADB36948.1| 2OG-Fe(II) oxygenase [Spirosoma linguale DSM 74]
          Length = 328

 Score = 51.6 bits (122), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 25/60 (41%), Positives = 36/60 (60%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 128
           +L  L D  +GG T FP LN+ V P++GSA+++ N   N LL     H+G P+A G K+ 
Sbjct: 258 MLVYLNDDFVGGETYFPELNMNVHPKRGSALYFLNRDDNNLLLLNSVHAGLPIAQGMKYA 317



 Score = 49.7 bits (117), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 25/58 (43%), Positives = 35/58 (60%)

Query: 9   VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWG 66
           V L D  +GG T FP LN+ V P++GSA+++ N   N LL     H+G P+A G K+ 
Sbjct: 260 VYLNDDFVGGETYFPELNMNVHPKRGSALYFLNRDDNNLLLLNSVHAGLPIAQGMKYA 317


>gi|333912984|ref|YP_004486716.1| procollagen-proline dioxygenase [Delftia sp. Cs1-4]
 gi|333743184|gb|AEF88361.1| Procollagen-proline dioxygenase [Delftia sp. Cs1-4]
          Length = 294

 Score = 51.6 bits (122), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 32/83 (38%), Positives = 42/83 (50%), Gaps = 7/83 (8%)

Query: 49  DYWMY-HSGCPVAL---GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNA 104
           DY+     G P  L   G + G L++  L + E GGATIFP + L V P +G+AVF+   
Sbjct: 203 DYFAPGEPGTPTILKRGGQRVGTLVIY-LNEPERGGATIFPEVPLQVVPRRGNAVFFSYE 261

Query: 105 HANTLLDYRMYHSGCPVALGNKW 127
             +     R  H G PV  G KW
Sbjct: 262 RPDP--STRTLHGGAPVLAGEKW 282



 Score = 46.6 bits (109), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 23/55 (41%), Positives = 29/55 (52%), Gaps = 2/55 (3%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L + E GGATIFP + L V P +G+AVF+     +        H G PV  G KW
Sbjct: 230 LNEPERGGATIFPEVPLQVVPRRGNAVFFSYERPDP--STRTLHGGAPVLAGEKW 282


>gi|393200372|ref|YP_006462214.1| prolyl 4-hydroxylase [Solibacillus silvestris StLB046]
 gi|327439703|dbj|BAK16068.1| prolyl 4-hydroxylase [Solibacillus silvestris StLB046]
          Length = 211

 Score = 51.6 bits (122), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 27/64 (42%), Positives = 36/64 (56%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     V L DVE GG T FP+L +   P+KG+AV++   + +  L+    HSG PV  
Sbjct: 133 NRICTLVVYLNDVEEGGVTTFPNLGIVNVPKKGTAVYFEYFYNDQKLNELTLHSGEPVIQ 192

Query: 62  GNKW 65
           G KW
Sbjct: 193 GEKW 196



 Score = 50.1 bits (118), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 27/70 (38%), Positives = 40/70 (57%), Gaps = 1/70 (1%)

Query: 58  PVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHS 117
           P +  N+   L++  L DVE GG T FP+L +   P+KG+AV++   + +  L+    HS
Sbjct: 128 PSSSNNRICTLVVY-LNDVEEGGVTTFPNLGIVNVPKKGTAVYFEYFYNDQKLNELTLHS 186

Query: 118 GCPVALGNKW 127
           G PV  G KW
Sbjct: 187 GEPVIQGEKW 196


>gi|194290782|ref|YP_002006689.1| prolyl 4-hydroxylase subunit alpha [Cupriavidus taiwanensis LMG
           19424]
 gi|193224617|emb|CAQ70628.1| putative Prolyl 4-hydroxylase alpha subunit [Cupriavidus
           taiwanensis LMG 19424]
          Length = 296

 Score = 51.6 bits (122), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 28/69 (40%), Positives = 39/69 (56%)

Query: 59  VALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSG 118
           +++G +    L+  L   E GGAT FP + L V P KG+AV++     +  LD R  H+G
Sbjct: 213 LSVGGQRIATLVIYLNTPEAGGATAFPRVGLEVAPVKGNAVYFSYLLPDGTLDDRTLHAG 272

Query: 119 CPVALGNKW 127
            PVA G KW
Sbjct: 273 LPVAAGEKW 281



 Score = 48.9 bits (115), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 24/51 (47%), Positives = 30/51 (58%)

Query: 15  ELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           E GGAT FP + L V P KG+AV++     +  LD    H+G PVA G KW
Sbjct: 231 EAGGATAFPRVGLEVAPVKGNAVYFSYLLPDGTLDDRTLHAGLPVAAGEKW 281


>gi|341902054|gb|EGT57989.1| hypothetical protein CAEBREN_08644 [Caenorhabditis brenneri]
          Length = 314

 Score = 51.6 bits (122), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 31/113 (27%), Positives = 49/113 (43%), Gaps = 5/113 (4%)

Query: 18  GATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCP----VALGNKWGKLLLSGL 73
            A + PSLN  +        +    H +   DY  +         +A GN+    +++ +
Sbjct: 171 AARLIPSLNFKLSEPIVGLSYMPGGHYSAHHDYLDFDDEAAKQEHLANGNRMVTFIIA-V 229

Query: 74  TDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 126
              ++GG T+FP +  TV    G A  W+N   N  +D   +H GCPV  G K
Sbjct: 230 EKADVGGGTVFPEIQTTVRANPGDAYLWFNMLENQEMDDMSFHGGCPVLAGKK 282



 Score = 47.8 bits (112), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 22/63 (34%), Positives = 30/63 (47%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + +   ++GG T+FP +  TV    G A  W N   N  +D   +H GCPV  
Sbjct: 220 NRMVTFIIAVEKADVGGGTVFPEIQTTVRANPGDAYLWFNMLENQEMDDMSFHGGCPVLA 279

Query: 62  GNK 64
           G K
Sbjct: 280 GKK 282


>gi|341896531|gb|EGT52466.1| hypothetical protein CAEBREN_02034 [Caenorhabditis brenneri]
          Length = 317

 Score = 51.6 bits (122), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 24/65 (36%), Positives = 36/65 (55%), Gaps = 1/65 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+   +++      E+GG T+FP L  TV  E G A+ W+N   N  ++  + H+GCPV
Sbjct: 222 GNRMTTVIMV-FQKAEVGGGTMFPLLKTTVRAEPGDAIVWFNMRKNGEMEPLVEHAGCPV 280

Query: 122 ALGNK 126
             G K
Sbjct: 281 RSGRK 285



 Score = 49.7 bits (117), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 24/63 (38%), Positives = 32/63 (50%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     +     E+GG T+FP L  TV  E G A+ W N   N  ++  + H+GCPV  
Sbjct: 223 NRMTTVIMVFQKAEVGGGTMFPLLKTTVRAEPGDAIVWFNMRKNGEMEPLVEHAGCPVRS 282

Query: 62  GNK 64
           G K
Sbjct: 283 GRK 285


>gi|321466507|gb|EFX77502.1| hypothetical protein DAPPUDRAFT_25542 [Daphnia pulex]
          Length = 92

 Score = 51.6 bits (122), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 27/60 (45%), Positives = 37/60 (61%), Gaps = 1/60 (1%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHA-NTLLDYRMYHSGCPVALGNKW 127
           L+  L +VE GGAT+FP +   V P KGSA+FWYN +  N   +    H+ CP+  G+KW
Sbjct: 33  LIIYLNEVENGGATVFPIVKTRVKPVKGSALFWYNLNPDNGEGNPTTLHASCPILSGSKW 92



 Score = 49.3 bits (116), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 25/58 (43%), Positives = 36/58 (62%), Gaps = 1/58 (1%)

Query: 9  VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHA-NTLLDYWMYHSGCPVALGNKW 65
          + L +VE GGAT+FP +   V P KGSA+FW+N +  N   +    H+ CP+  G+KW
Sbjct: 35 IYLNEVENGGATVFPIVKTRVKPVKGSALFWYNLNPDNGEGNPTTLHASCPILSGSKW 92


>gi|428671901|gb|EKX72816.1| conserved hypothetical protein [Babesia equi]
          Length = 234

 Score = 51.6 bits (122), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 25/55 (45%), Positives = 33/55 (60%), Gaps = 1/55 (1%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L DVE GG T+FP+L L + P   SAVFW N +    +D RM H+G    +G K+
Sbjct: 166 LNDVE-GGETVFPNLGLAIKPVGNSAVFWRNLNGENEMDERMIHAGTTPKVGTKY 219



 Score = 49.3 bits (116), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 24/55 (43%), Positives = 32/55 (58%), Gaps = 1/55 (1%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T+FP+L L + P   SAVFW N +    +D  M H+G    +G K+
Sbjct: 166 LNDVE-GGETVFPNLGLAIKPVGNSAVFWRNLNGENEMDERMIHAGTTPKVGTKY 219


>gi|219121927|ref|XP_002181308.1| proly 4-hydroxylase [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217407294|gb|EEC47231.1| proly 4-hydroxylase [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 226

 Score = 51.2 bits (121), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 28/80 (35%), Positives = 39/80 (48%), Gaps = 15/80 (18%)

Query: 2   NRAEMSSV--ELTDVELGGATIFPSLN-------------LTVFPEKGSAVFWHNAHANT 46
           +R  M++V   L+DVE GG T+FP  N             L V PEKG  + +++   + 
Sbjct: 132 HRNRMATVFWYLSDVEKGGETVFPRFNGAQETSMKDCKTGLKVKPEKGKVIIFYSMTPDG 191

Query: 47  LLDYWMYHSGCPVALGNKWG 66
            LD +  H  CPV  G KW 
Sbjct: 192 ALDEYSLHGACPVQKGTKWA 211



 Score = 51.2 bits (121), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 26/69 (37%), Positives = 32/69 (46%), Gaps = 13/69 (18%)

Query: 73  LTDVELGGATIFPSLN-------------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGC 119
           L+DVE GG T+FP  N             L V PEKG  + +Y+   +  LD    H  C
Sbjct: 143 LSDVEKGGETVFPRFNGAQETSMKDCKTGLKVKPEKGKVIIFYSMTPDGALDEYSLHGAC 202

Query: 120 PVALGNKWG 128
           PV  G KW 
Sbjct: 203 PVQKGTKWA 211


>gi|319795182|ref|YP_004156822.1| procollagen-proline dioxygenase [Variovorax paradoxus EPS]
 gi|315597645|gb|ADU38711.1| Procollagen-proline dioxygenase [Variovorax paradoxus EPS]
          Length = 296

 Score = 51.2 bits (121), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 24/69 (34%), Positives = 36/69 (52%)

Query: 59  VALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSG 118
           +A   +    L++ L D   GG T+FP L L V P +G+A ++     N  +D R  H+ 
Sbjct: 209 IARSGQRVSTLVTYLNDAPEGGQTVFPQLGLAVSPIRGNACYFEYCDGNGRVDARSLHAS 268

Query: 119 CPVALGNKW 127
            PV  G+KW
Sbjct: 269 APVTRGDKW 277



 Score = 48.1 bits (113), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 21/55 (38%), Positives = 29/55 (52%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L D   GG T+FP L L V P +G+A ++     N  +D    H+  PV  G+KW
Sbjct: 223 LNDAPEGGQTVFPQLGLAVSPIRGNACYFEYCDGNGRVDARSLHASAPVTRGDKW 277


>gi|328718387|ref|XP_001952104.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Acyrthosiphon
           pisum]
          Length = 293

 Score = 51.2 bits (121), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 30/70 (42%), Positives = 39/70 (55%), Gaps = 2/70 (2%)

Query: 59  VALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAH-ANTLLDYRMYHS 117
           V  GN+   +L   LTDV+  G T FP LN+    +KG+A+ W N H +N  L Y   H 
Sbjct: 205 VEFGNRLVTVLFY-LTDVQNDGYTSFPLLNINAPVDKGAALVWNNLHMSNGQLFYESLHG 263

Query: 118 GCPVALGNKW 127
            CP+  GNKW
Sbjct: 264 SCPLLKGNKW 273



 Score = 49.7 bits (117), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 26/56 (46%), Positives = 34/56 (60%), Gaps = 1/56 (1%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAH-ANTLLDYWMYHSGCPVALGNKW 65
           LTDV+  G T FP LN+    +KG+A+ W+N H +N  L Y   H  CP+  GNKW
Sbjct: 218 LTDVQNDGYTSFPLLNINAPVDKGAALVWNNLHMSNGQLFYESLHGSCPLLKGNKW 273


>gi|241999340|ref|XP_002434313.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
 gi|215496072|gb|EEC05713.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
          Length = 267

 Score = 51.2 bits (121), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 30/79 (37%), Positives = 40/79 (50%), Gaps = 14/79 (17%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNA-------------HANT 108
           GN+   +L+  L DV  GGAT F ++ L V P  G+A+FWYN              +   
Sbjct: 170 GNRLATMLIY-LADVAGGGATAFINMRLAVKPTLGTALFWYNLKPYDGPIVNESFWNQRR 228

Query: 109 LLDYRMYHSGCPVALGNKW 127
             D R +H GCPV  G+KW
Sbjct: 229 FGDPRTFHMGCPVLTGSKW 247



 Score = 45.8 bits (107), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 27/77 (35%), Positives = 36/77 (46%), Gaps = 13/77 (16%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNA-------------HANTLL 48
           NR     + L DV  GGAT F ++ L V P  G+A+FW+N              +     
Sbjct: 171 NRLATMLIYLADVAGGGATAFINMRLAVKPTLGTALFWYNLKPYDGPIVNESFWNQRRFG 230

Query: 49  DYWMYHSGCPVALGNKW 65
           D   +H GCPV  G+KW
Sbjct: 231 DPRTFHMGCPVLTGSKW 247


>gi|363543301|ref|NP_001241866.1| prolyl 4-hydroxylase 6 precursor [Zea mays]
 gi|195624808|gb|ACG34234.1| oxidoreductase [Zea mays]
 gi|347978818|gb|AEP37751.1| prolyl 4-hydroxylase 6 [Zea mays]
          Length = 297

 Score = 51.2 bits (121), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 34/106 (32%), Positives = 47/106 (44%), Gaps = 24/106 (22%)

Query: 40  HNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN----------- 88
           ++AH     DY  +H    + LG +    +L  LTDV+ GG T+FP+             
Sbjct: 133 YDAH----FDY--FHDRNNLKLGGQRVATVLMYLTDVKKGGETVFPNAEGSHLQYKDETW 186

Query: 89  -------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
                  L V P+KG A+ ++N H N   D    H  CPV  G KW
Sbjct: 187 SECSRSGLAVKPKKGDALLFFNLHVNATADTGSLHGSCPVIEGEKW 232



 Score = 47.0 bits (110), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 26/73 (35%), Positives = 32/73 (43%), Gaps = 18/73 (24%)

Query: 11  LTDVELGGATIFPSLN------------------LTVFPEKGSAVFWHNAHANTLLDYWM 52
           LTDV+ GG T+FP+                    L V P+KG A+ + N H N   D   
Sbjct: 160 LTDVKKGGETVFPNAEGSHLQYKDETWSECSRSGLAVKPKKGDALLFFNLHVNATADTGS 219

Query: 53  YHSGCPVALGNKW 65
            H  CPV  G KW
Sbjct: 220 LHGSCPVIEGEKW 232


>gi|426245942|ref|XP_004016760.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3, partial [Ovis
           aries]
          Length = 514

 Score = 51.2 bits (121), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 26/66 (39%), Positives = 38/66 (57%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    ++  L+ VE GGAT F   N +V   K +A+FW+N H +   D    H+ CPV
Sbjct: 427 GNRVATFMIY-LSSVEAGGATAFIYGNFSVPVVKNAALFWWNLHRSGEGDGDTLHAACPV 485

Query: 122 ALGNKW 127
            +G+KW
Sbjct: 486 LVGDKW 491



 Score = 50.4 bits (119), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 26/64 (40%), Positives = 35/64 (54%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L+ VE GGAT F   N +V   K +A+FW N H +   D    H+ CPV +
Sbjct: 428 NRVATFMIYLSSVEAGGATAFIYGNFSVPVVKNAALFWWNLHRSGEGDGDTLHAACPVLV 487

Query: 62  GNKW 65
           G+KW
Sbjct: 488 GDKW 491


>gi|339236271|ref|XP_003379690.1| prolyl 4-hydroxylase subunit alpha-1 [Trichinella spiralis]
 gi|316977627|gb|EFV60702.1| prolyl 4-hydroxylase subunit alpha-1 [Trichinella spiralis]
          Length = 558

 Score = 51.2 bits (121), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 25/66 (37%), Positives = 33/66 (50%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+   +L   ++  E GG T+F S  +TV P K  A FW+N       D    H+ CPV
Sbjct: 469 GNRIATVLFY-ISQPEAGGGTVFTSHKITVEPSKYDAAFWFNVLQGGEPDMSTRHAACPV 527

Query: 122 ALGNKW 127
             G KW
Sbjct: 528 LAGTKW 533



 Score = 48.1 bits (113), Expect = 0.001,   Method: Composition-based stats.
 Identities = 24/64 (37%), Positives = 29/64 (45%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++  E GG T+F S  +TV P K  A FW N       D    H+ CPV  
Sbjct: 470 NRIATVLFYISQPEAGGGTVFTSHKITVEPSKYDAAFWFNVLQGGEPDMSTRHAACPVLA 529

Query: 62  GNKW 65
           G KW
Sbjct: 530 GTKW 533


>gi|48675383|ref|NP_001001598.1| prolyl 4-hydroxylase subunit alpha-3 precursor [Bos taurus]
 gi|75053350|sp|Q75UG4.1|P4HA3_BOVIN RecName: Full=Prolyl 4-hydroxylase subunit alpha-3; Short=4-PH
           alpha-3; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-3; Flags: Precursor
 gi|47115494|dbj|BAD18888.1| Collagen prolyl 4-hydroxylase alpha III subunit [Bos taurus]
 gi|296479828|tpg|DAA21943.1| TPA: prolyl 4-hydroxylase subunit alpha-3 precursor [Bos taurus]
          Length = 544

 Score = 51.2 bits (121), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 26/66 (39%), Positives = 38/66 (57%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    ++  L+ VE GGAT F   N +V   K +A+FW+N H +   D    H+ CPV
Sbjct: 457 GNRVATFMIY-LSSVEAGGATAFIYGNFSVPVVKNAALFWWNLHRSGEGDGDTLHAACPV 515

Query: 122 ALGNKW 127
            +G+KW
Sbjct: 516 LVGDKW 521



 Score = 50.4 bits (119), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 26/64 (40%), Positives = 35/64 (54%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L+ VE GGAT F   N +V   K +A+FW N H +   D    H+ CPV +
Sbjct: 458 NRVATFMIYLSSVEAGGATAFIYGNFSVPVVKNAALFWWNLHRSGEGDGDTLHAACPVLV 517

Query: 62  GNKW 65
           G+KW
Sbjct: 518 GDKW 521


>gi|407682954|ref|YP_006798128.1| prolyl 4-hydroxylase subunit alpha [Alteromonas macleodii str.
           'English Channel 673']
 gi|407244565|gb|AFT73751.1| prolyl 4-hydroxylase subunit alpha [Alteromonas macleodii str.
           'English Channel 673']
          Length = 376

 Score = 51.2 bits (121), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 22/48 (45%), Positives = 29/48 (60%)

Query: 80  GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           GAT FP L +T++PEKG  V + N   N  L+ + YH G PV+   KW
Sbjct: 309 GATHFPKLGITIYPEKGKLVSFLNVDKNLALEKQSYHCGQPVSTNEKW 356



 Score = 49.7 bits (117), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 22/48 (45%), Positives = 28/48 (58%)

Query: 18  GATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           GAT FP L +T++PEKG  V + N   N  L+   YH G PV+   KW
Sbjct: 309 GATHFPKLGITIYPEKGKLVSFLNVDKNLALEKQSYHCGQPVSTNEKW 356


>gi|406596009|ref|YP_006747139.1| prolyl 4-hydroxylase subunit alpha [Alteromonas macleodii ATCC
           27126]
 gi|406373330|gb|AFS36585.1| prolyl 4-hydroxylase subunit alpha [Alteromonas macleodii ATCC
           27126]
          Length = 376

 Score = 50.8 bits (120), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 22/48 (45%), Positives = 29/48 (60%)

Query: 80  GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           GAT FP L +T++PEKG  V + N   N  L+ + YH G PV+   KW
Sbjct: 309 GATHFPKLGITIYPEKGKLVSFLNVDKNLALEKQSYHCGQPVSTNEKW 356



 Score = 49.7 bits (117), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 22/48 (45%), Positives = 28/48 (58%)

Query: 18  GATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           GAT FP L +T++PEKG  V + N   N  L+   YH G PV+   KW
Sbjct: 309 GATHFPKLGITIYPEKGKLVSFLNVDKNLALEKQSYHCGQPVSTNEKW 356


>gi|440899661|gb|ELR50930.1| Prolyl 4-hydroxylase subunit alpha-3, partial [Bos grunniens mutus]
          Length = 478

 Score = 50.8 bits (120), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 26/66 (39%), Positives = 38/66 (57%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    ++  L+ VE GGAT F   N +V   K +A+FW+N H +   D    H+ CPV
Sbjct: 391 GNRVATFMIY-LSSVEAGGATAFIYGNFSVPVVKNAALFWWNLHRSGEGDGDTLHAACPV 449

Query: 122 ALGNKW 127
            +G+KW
Sbjct: 450 LVGDKW 455



 Score = 50.1 bits (118), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 26/64 (40%), Positives = 35/64 (54%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L+ VE GGAT F   N +V   K +A+FW N H +   D    H+ CPV +
Sbjct: 392 NRVATFMIYLSSVEAGGATAFIYGNFSVPVVKNAALFWWNLHRSGEGDGDTLHAACPVLV 451

Query: 62  GNKW 65
           G+KW
Sbjct: 452 GDKW 455


>gi|73542634|ref|YP_297154.1| procollagen-proline,2-oxoglutarate-4-dioxygenase [Ralstonia
           eutropha JMP134]
 gi|72120047|gb|AAZ62310.1| Procollagen-proline,2-oxoglutarate-4-dioxygenase [Ralstonia
           eutropha JMP134]
          Length = 282

 Score = 50.8 bits (120), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 26/67 (38%), Positives = 36/67 (53%)

Query: 61  LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
           +G +    L+  L   + GGAT FP + L V P KG+AV++     +  LD R  H+G P
Sbjct: 201 VGGQRTATLVIYLNTPQAGGATAFPRIGLEVAPVKGNAVYFSYLQPDGKLDERTLHAGLP 260

Query: 121 VALGNKW 127
           V  G KW
Sbjct: 261 VQSGEKW 267



 Score = 47.8 bits (112), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 23/55 (41%), Positives = 30/55 (54%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L   + GGAT FP + L V P KG+AV++     +  LD    H+G PV  G KW
Sbjct: 213 LNTPQAGGATAFPRIGLEVAPVKGNAVYFSYLQPDGKLDERTLHAGLPVQSGEKW 267


>gi|281353153|gb|EFB28737.1| hypothetical protein PANDA_003344 [Ailuropoda melanoleuca]
          Length = 456

 Score = 50.8 bits (120), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 27/66 (40%), Positives = 39/66 (59%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    ++  L+ VE GGAT F   N +V   K +A+FW+N H +   D    H+GCPV
Sbjct: 392 GNRVATFMIY-LSSVEAGGATAFIYANFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPV 450

Query: 122 ALGNKW 127
            +G+KW
Sbjct: 451 LVGDKW 456



 Score = 49.7 bits (117), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 27/64 (42%), Positives = 36/64 (56%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L+ VE GGAT F   N +V   K +A+FW N H +   D    H+GCPV +
Sbjct: 393 NRVATFMIYLSSVEAGGATAFIYANFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLV 452

Query: 62  GNKW 65
           G+KW
Sbjct: 453 GDKW 456


>gi|168006299|ref|XP_001755847.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162693166|gb|EDQ79520.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 299

 Score = 50.8 bits (120), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 38/138 (27%), Positives = 56/138 (40%), Gaps = 26/138 (18%)

Query: 8   SVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWGK 67
           S+E  + ++  AT+ P  N   F      V  +N        Y ++    P   G +  +
Sbjct: 158 SLEQVEEKMAKATMIPRENGEAF-----NVLKYNVGQKYDCHYDVFD---PAEYGPQPSQ 209

Query: 68  LLLS---GLTDVELGGATIFPSLN---------------LTVFPEKGSAVFWYNAHANTL 109
            + S    L+DVE GG T+FP  N               + V P +G A+ +Y+ H N  
Sbjct: 210 RMASFLLYLSDVEEGGETMFPFENFQNMNIGFDYKKCIGMKVKPRQGDALLFYSMHPNGT 269

Query: 110 LDYRMYHSGCPVALGNKW 127
            D    H  CPV  G KW
Sbjct: 270 FDKSALHGSCPVIKGEKW 287


>gi|242003035|ref|XP_002436120.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
 gi|215499456|gb|EEC08950.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
          Length = 173

 Score = 50.8 bits (120), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 28/67 (41%), Positives = 36/67 (53%), Gaps = 12/67 (17%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYN---AHANTLL---------DYRMYHSGCP 120
           L+ VE GGAT+F  L + V PE G A+FW+N      N+L          D R  H  CP
Sbjct: 77  LSSVERGGATLFKRLRVRVQPEAGMALFWHNLPPGSTNSLPSCCVHRSVGDERTEHGACP 136

Query: 121 VALGNKW 127
           V +G+KW
Sbjct: 137 VLVGSKW 143



 Score = 50.1 bits (118), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 29/69 (42%), Positives = 36/69 (52%), Gaps = 12/69 (17%)

Query: 9   VELTDVELGGATIFPSLNLTVFPEKGSAVFWHN---AHANTLL---------DYWMYHSG 56
           V L+ VE GGAT+F  L + V PE G A+FWHN      N+L          D    H  
Sbjct: 75  VYLSSVERGGATLFKRLRVRVQPEAGMALFWHNLPPGSTNSLPSCCVHRSVGDERTEHGA 134

Query: 57  CPVALGNKW 65
           CPV +G+KW
Sbjct: 135 CPVLVGSKW 143


>gi|341893180|gb|EGT49115.1| CBN-PHY-4 protein [Caenorhabditis brenneri]
          Length = 282

 Score = 50.8 bits (120), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 24/70 (34%), Positives = 38/70 (54%), Gaps = 1/70 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+   +L+  L   + GG T+FP +NL + P+ G  + W N  ++   D +  H+ CP+
Sbjct: 180 GNRIATVLVI-LQIAKKGGTTVFPKINLNIRPKAGDVIVWLNTLSSGESDPQTLHAACPI 238

Query: 122 ALGNKWGKLL 131
             GNK G  L
Sbjct: 239 KEGNKIGATL 248



 Score = 49.7 bits (117), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 24/68 (35%), Positives = 34/68 (50%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     V L   + GG T+FP +NL + P+ G  + W N  ++   D    H+ CP+  
Sbjct: 181 NRIATVLVILQIAKKGGTTVFPKINLNIRPKAGDVIVWLNTLSSGESDPQTLHAACPIKE 240

Query: 62  GNKWGKLL 69
           GNK G  L
Sbjct: 241 GNKIGATL 248


>gi|17547533|ref|NP_520935.1| hypothetical protein RSc2814 [Ralstonia solanacearum GMI1000]
 gi|17429837|emb|CAD16521.1| putative prolyl 4-hydroxylase alpha subunit homologue
           oxidoreductase protein [Ralstonia solanacearum GMI1000]
          Length = 289

 Score = 50.8 bits (120), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 28/67 (41%), Positives = 35/67 (52%)

Query: 61  LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
           +G +    L+  L  V  GGAT FP L L V P KG+AVF+     +  LD    H+G P
Sbjct: 208 VGGQRVATLVIYLNSVPAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLP 267

Query: 121 VALGNKW 127
           V  G KW
Sbjct: 268 VERGEKW 274



 Score = 48.9 bits (115), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 30/55 (54%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L  V  GGAT FP L L V P KG+AVF+     +  LD    H+G PV  G KW
Sbjct: 220 LNSVPAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLPVERGEKW 274


>gi|449432777|ref|XP_004134175.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 303

 Score = 50.8 bits (120), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 29/90 (32%), Positives = 39/90 (43%), Gaps = 21/90 (23%)

Query: 59  VALGNKWGKLLLSGLTDVELGGATIFPSLN---------------------LTVFPEKGS 97
           +A G      +L  L+DVE GG T+FPS                       + V P KG 
Sbjct: 152 IARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRKGD 211

Query: 98  AVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           A+ +++ H N + D    H GCPV  G KW
Sbjct: 212 ALLFFSLHPNAIPDTSSLHGGCPVIEGEKW 241



 Score = 47.8 bits (112), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/76 (34%), Positives = 33/76 (43%), Gaps = 21/76 (27%)

Query: 11  LTDVELGGATIFPSLN---------------------LTVFPEKGSAVFWHNAHANTLLD 49
           L+DVE GG T+FPS                       + V P KG A+ + + H N + D
Sbjct: 166 LSDVEKGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPD 225

Query: 50  YWMYHSGCPVALGNKW 65
               H GCPV  G KW
Sbjct: 226 TSSLHGGCPVIEGEKW 241


>gi|308812133|ref|XP_003083374.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein (ISS)
           [Ostreococcus tauri]
 gi|116055254|emb|CAL57650.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein (ISS)
           [Ostreococcus tauri]
          Length = 311

 Score = 50.4 bits (119), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 32/103 (31%), Positives = 45/103 (43%), Gaps = 28/103 (27%)

Query: 53  YHSGCPVALGNKWGKLLLSGLTDVELGGATIFP------------------------SLN 88
           +H    V  G +    +L  L+DVE GG T+FP                        S N
Sbjct: 173 FHDEHNVKNGGQRVATILMYLSDVEEGGETVFPLGTPLGGRDPEKSGVTGDNACELASQN 232

Query: 89  ----LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
               L V P +G A+ ++NAH +  +D +  H+GCPV  G KW
Sbjct: 233 DPRVLAVKPRRGDALLFFNAHLSGEMDEKANHAGCPVNRGTKW 275



 Score = 45.4 bits (106), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 28/83 (33%), Positives = 36/83 (43%), Gaps = 28/83 (33%)

Query: 11  LTDVELGGATIFP------------------------SLN----LTVFPEKGSAVFWHNA 42
           L+DVE GG T+FP                        S N    L V P +G A+ + NA
Sbjct: 193 LSDVEEGGETVFPLGTPLGGRDPEKSGVTGDNACELASQNDPRVLAVKPRRGDALLFFNA 252

Query: 43  HANTLLDYWMYHSGCPVALGNKW 65
           H +  +D    H+GCPV  G KW
Sbjct: 253 HLSGEMDEKANHAGCPVNRGTKW 275


>gi|159469311|ref|XP_001692811.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158278064|gb|EDP03830.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 273

 Score = 50.4 bits (119), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 31/95 (32%), Positives = 44/95 (46%), Gaps = 18/95 (18%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSL----------------NLTVF 92
           DY+ +  G     GN+W  +LL  LT+ E GG T+FP +                NL V 
Sbjct: 139 DYFFHKDGSSNG-GNRWATVLLY-LTETEEGGETVFPKIPAPNGINVGFSECAKYNLAVK 196

Query: 93  PEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           P KG A+ +++      L+ R  H  CPV  G K+
Sbjct: 197 PHKGDALLFHSMKPTGELEERSMHGACPVIRGEKF 231



 Score = 41.6 bits (96), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 23/71 (32%), Positives = 31/71 (43%), Gaps = 16/71 (22%)

Query: 11  LTDVELGGATIFPSL----------------NLTVFPEKGSAVFWHNAHANTLLDYWMYH 54
           LT+ E GG T+FP +                NL V P KG A+ +H+      L+    H
Sbjct: 161 LTETEEGGETVFPKIPAPNGINVGFSECAKYNLAVKPHKGDALLFHSMKPTGELEERSMH 220

Query: 55  SGCPVALGNKW 65
             CPV  G K+
Sbjct: 221 GACPVIRGEKF 231


>gi|301104296|ref|XP_002901233.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262101167|gb|EEY59219.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 535

 Score = 50.4 bits (119), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 39/118 (33%), Positives = 53/118 (44%), Gaps = 19/118 (16%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWGKLLL 70
           L+DV  GG T+FP   L   PE     + H    N+  DY         A+G    +L  
Sbjct: 321 LSDVPRGGQTVFP---LAEMPEGLPTEYQHPP--NSAQDYE--------AIG---AELFE 364

Query: 71  SGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 128
            G  ++++       S  L  +P KG AV +Y+   N  LD +  H GCPV  G KWG
Sbjct: 365 PGSWEMDMVRKC---STKLASYPSKGGAVLFYSQKPNGELDPKSLHGGCPVLEGTKWG 419


>gi|171059332|ref|YP_001791681.1| procollagen-proline dioxygenase [Leptothrix cholodnii SP-6]
 gi|170776777|gb|ACB34916.1| Procollagen-proline dioxygenase [Leptothrix cholodnii SP-6]
          Length = 287

 Score = 50.4 bits (119), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 40/79 (50%), Gaps = 10/79 (12%)

Query: 54  HSGCPVAL---GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVF--WYNAHANT 108
             G P  L   G + G L++  L   E GG T FP +NL V P KG+AVF  +  AH +T
Sbjct: 202 QPGTPTILKRGGQRVGTLVMY-LNTPERGGGTTFPDVNLEVAPIKGNAVFFSYERAHPST 260

Query: 109 LLDYRMYHSGCPVALGNKW 127
               R  H G PV  G KW
Sbjct: 261 ----RSLHGGAPVLAGEKW 275



 Score = 45.4 bits (106), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 26/57 (45%), Positives = 30/57 (52%), Gaps = 6/57 (10%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVF--WHNAHANTLLDYWMYHSGCPVALGNKW 65
           L   E GG T FP +NL V P KG+AVF  +  AH +T       H G PV  G KW
Sbjct: 223 LNTPERGGGTTFPDVNLEVAPIKGNAVFFSYERAHPSTR----SLHGGAPVLAGEKW 275


>gi|255085194|ref|XP_002505028.1| predicted protein [Micromonas sp. RCC299]
 gi|226520297|gb|ACO66286.1| predicted protein [Micromonas sp. RCC299]
          Length = 439

 Score = 50.4 bits (119), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 23/59 (38%), Positives = 31/59 (52%)

Query: 70  LSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 128
           L+ L + E GG T FP L     P+ G A+ W+N   N   D++  H+G PV  G KW 
Sbjct: 342 LAYLVEPERGGRTYFPKLRAGFEPKVGDALLWWNVDENGAEDFKTLHAGEPVEAGAKWA 400



 Score = 45.8 bits (107), Expect = 0.005,   Method: Composition-based stats.
 Identities = 24/64 (37%), Positives = 29/64 (45%)

Query: 3   RAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALG 62
           RA      L + E GG T FP L     P+ G A+ W N   N   D+   H+G PV  G
Sbjct: 337 RAVTCLAYLVEPERGGRTYFPKLRAGFEPKVGDALLWWNVDENGAEDFKTLHAGEPVEAG 396

Query: 63  NKWG 66
            KW 
Sbjct: 397 AKWA 400


>gi|363729586|ref|XP_417248.3| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Gallus gallus]
          Length = 542

 Score = 50.4 bits (119), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 26/66 (39%), Positives = 38/66 (57%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+   +++  L+ VE GG+T F   N +V   K +A+FW+N   N   D    H+GCPV
Sbjct: 455 GNRIATVMIY-LSAVEAGGSTAFIYANFSVPVVKNAALFWWNLRRNGDGDGDTLHAGCPV 513

Query: 122 ALGNKW 127
             G+KW
Sbjct: 514 LAGDKW 519



 Score = 49.3 bits (116), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 26/64 (40%), Positives = 34/64 (53%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L+ VE GG+T F   N +V   K +A+FW N   N   D    H+GCPV  
Sbjct: 456 NRIATVMIYLSAVEAGGSTAFIYANFSVPVVKNAALFWWNLRRNGDGDGDTLHAGCPVLA 515

Query: 62  GNKW 65
           G+KW
Sbjct: 516 GDKW 519


>gi|413932756|gb|AFW67307.1| oxidoreductase [Zea mays]
          Length = 297

 Score = 50.1 bits (118), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 32/98 (32%), Positives = 42/98 (42%), Gaps = 20/98 (20%)

Query: 48  LDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN------------------L 89
            DY  +H    + LG +    +L  LTDV  GG T+FP+                    L
Sbjct: 137 FDY--FHDRNNLKLGGQRVATVLMYLTDVNKGGETVFPNAEGSHLQYKDETWSECSRSGL 194

Query: 90  TVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
            V P+KG A+ ++N H N   D    H  CPV  G KW
Sbjct: 195 AVKPKKGDALLFFNLHVNATADTGSLHGSCPVIEGEKW 232



 Score = 46.2 bits (108), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 26/73 (35%), Positives = 31/73 (42%), Gaps = 18/73 (24%)

Query: 11  LTDVELGGATIFPSLN------------------LTVFPEKGSAVFWHNAHANTLLDYWM 52
           LTDV  GG T+FP+                    L V P+KG A+ + N H N   D   
Sbjct: 160 LTDVNKGGETVFPNAEGSHLQYKDETWSECSRSGLAVKPKKGDALLFFNLHVNATADTGS 219

Query: 53  YHSGCPVALGNKW 65
            H  CPV  G KW
Sbjct: 220 LHGSCPVIEGEKW 232


>gi|390570433|ref|ZP_10250698.1| procollagen-proline dioxygenase [Burkholderia terrae BS001]
 gi|389937613|gb|EIM99476.1| procollagen-proline dioxygenase [Burkholderia terrae BS001]
          Length = 285

 Score = 50.1 bits (118), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 25/69 (36%), Positives = 35/69 (50%)

Query: 59  VALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSG 118
            A G +    L+  L+DVE GG T+FP   L V   +G A+++   +    LD    H G
Sbjct: 205 TARGGQRVATLIVYLSDVEGGGETVFPDAGLAVMARQGGAIYFRYMNGRRQLDPLTLHGG 264

Query: 119 CPVALGNKW 127
            PV  G+KW
Sbjct: 265 APVTSGDKW 273



 Score = 50.1 bits (118), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 23/57 (40%), Positives = 31/57 (54%)

Query: 9   VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           V L+DVE GG T+FP   L V   +G A+++   +    LD    H G PV  G+KW
Sbjct: 217 VYLSDVEGGGETVFPDAGLAVMARQGGAIYFRYMNGRRQLDPLTLHGGAPVTSGDKW 273


>gi|168043388|ref|XP_001774167.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162674574|gb|EDQ61081.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 284

 Score = 50.1 bits (118), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 41/144 (28%), Positives = 58/144 (40%), Gaps = 28/144 (19%)

Query: 4   AEMSSVELTDVE--LGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           +E  +  L +VE  +  AT+ P  N   F      V  +N        Y ++    P   
Sbjct: 137 SEDKTQSLAEVEEKMARATMIPRQNGEAF-----NVLRYNPGQKYDCHYDVFD---PAEY 188

Query: 62  GNKWGKLLLS---GLTDVELGGATIFPSLN---------------LTVFPEKGSAVFWYN 103
           G +  + + S    L+DVE GG T+FP  N               L V P +G A+ +Y+
Sbjct: 189 GPQPSQRMASFLLYLSDVEEGGETMFPFENFQNMNTGYNYKDCIGLKVKPRQGDALLFYS 248

Query: 104 AHANTLLDYRMYHSGCPVALGNKW 127
            H N   D    H  CPV  G KW
Sbjct: 249 MHPNGTFDKTALHGSCPVIKGEKW 272


>gi|420246706|ref|ZP_14750139.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. BT03]
 gi|398073616|gb|EJL64785.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. BT03]
          Length = 282

 Score = 50.1 bits (118), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 25/69 (36%), Positives = 35/69 (50%)

Query: 59  VALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSG 118
            A G +    L+  L+DVE GG T+FP   L V   +G A+++   +    LD    H G
Sbjct: 202 TARGGQRVATLIVYLSDVEGGGETVFPDAGLAVMARQGGAIYFRYMNGRRQLDPLTLHGG 261

Query: 119 CPVALGNKW 127
            PV  G+KW
Sbjct: 262 APVTSGDKW 270



 Score = 50.1 bits (118), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 23/57 (40%), Positives = 31/57 (54%)

Query: 9   VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           V L+DVE GG T+FP   L V   +G A+++   +    LD    H G PV  G+KW
Sbjct: 214 VYLSDVEGGGETVFPDAGLAVMARQGGAIYFRYMNGRRQLDPLTLHGGAPVTSGDKW 270


>gi|326914688|ref|XP_003203656.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Meleagris
           gallopavo]
          Length = 539

 Score = 50.1 bits (118), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 26/66 (39%), Positives = 38/66 (57%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+   +++  L+ VE GG+T F   N +V   K +A+FW+N   N   D    H+GCPV
Sbjct: 452 GNRIATVMIY-LSAVEAGGSTAFIYANFSVPVVKNAALFWWNLRRNGDGDGDTLHAGCPV 510

Query: 122 ALGNKW 127
             G+KW
Sbjct: 511 LAGDKW 516



 Score = 48.9 bits (115), Expect = 7e-04,   Method: Composition-based stats.
 Identities = 26/64 (40%), Positives = 34/64 (53%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L+ VE GG+T F   N +V   K +A+FW N   N   D    H+GCPV  
Sbjct: 453 NRIATVMIYLSAVEAGGSTAFIYANFSVPVVKNAALFWWNLRRNGDGDGDTLHAGCPVLA 512

Query: 62  GNKW 65
           G+KW
Sbjct: 513 GDKW 516


>gi|308467521|ref|XP_003096008.1| CRE-PHY-4 protein [Caenorhabditis remanei]
 gi|308244157|gb|EFO88109.1| CRE-PHY-4 protein [Caenorhabditis remanei]
          Length = 198

 Score = 50.1 bits (118), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 25/72 (34%), Positives = 37/72 (51%), Gaps = 1/72 (1%)

Query: 60  ALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGC 119
           A GN+   +L+  L   + GG T+FP +NL + P+ G  V W N   +   D +  H+ C
Sbjct: 101 ARGNRIATVLVI-LQIAKKGGTTVFPKINLNIRPKAGDVVVWLNTLPSGESDSQTLHAAC 159

Query: 120 PVALGNKWGKLL 131
           P+  G K G  L
Sbjct: 160 PIKEGTKIGATL 171



 Score = 47.8 bits (112), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 24/68 (35%), Positives = 32/68 (47%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     V L   + GG T+FP +NL + P+ G  V W N   +   D    H+ CP+  
Sbjct: 104 NRIATVLVILQIAKKGGTTVFPKINLNIRPKAGDVVVWLNTLPSGESDSQTLHAACPIKE 163

Query: 62  GNKWGKLL 69
           G K G  L
Sbjct: 164 GTKIGATL 171


>gi|254254263|ref|ZP_04947580.1| hypothetical protein BDAG_03558 [Burkholderia dolosa AUO158]
 gi|124898908|gb|EAY70751.1| hypothetical protein BDAG_03558 [Burkholderia dolosa AUO158]
          Length = 285

 Score = 50.1 bits (118), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 25/69 (36%), Positives = 35/69 (50%)

Query: 59  VALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSG 118
           V  G +    L+  L DV  GG T+FP   ++V P +G AV++   +    LD    H+G
Sbjct: 205 VRTGGQRVATLIVYLNDVPSGGETVFPEAGISVVPRRGDAVYFRYMNRLRQLDPATLHAG 264

Query: 119 CPVALGNKW 127
            PV  G KW
Sbjct: 265 APVRDGEKW 273



 Score = 49.3 bits (116), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 23/57 (40%), Positives = 31/57 (54%)

Query: 9   VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           V L DV  GG T+FP   ++V P +G AV++   +    LD    H+G PV  G KW
Sbjct: 217 VYLNDVPSGGETVFPEAGISVVPRRGDAVYFRYMNRLRQLDPATLHAGAPVRDGEKW 273


>gi|325920649|ref|ZP_08182559.1| 2OG-Fe(II) oxygenase superfamily enzyme,Sel1 repeat protein
           [Xanthomonas gardneri ATCC 19865]
 gi|325548839|gb|EGD19783.1| 2OG-Fe(II) oxygenase superfamily enzyme,Sel1 repeat protein
           [Xanthomonas gardneri ATCC 19865]
          Length = 422

 Score = 50.1 bits (118), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 26/64 (40%), Positives = 33/64 (51%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR   + V L DV+ GG T FP   + V P  GS V + N  A+   D    H+G PV  
Sbjct: 345 NRLRTACVYLNDVDAGGETEFPVAGIRVQPRAGSVVCFDNLQADGCPDPDSLHAGLPVTT 404

Query: 62  GNKW 65
           G+KW
Sbjct: 405 GSKW 408



 Score = 45.8 bits (107), Expect = 0.006,   Method: Composition-based stats.
 Identities = 23/55 (41%), Positives = 29/55 (52%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L DV+ GG T FP   + V P  GS V + N  A+   D    H+G PV  G+KW
Sbjct: 354 LNDVDAGGETEFPVAGIRVQPRAGSVVCFDNLQADGCPDPDSLHAGLPVTTGSKW 408


>gi|156352046|ref|XP_001622583.1| predicted protein [Nematostella vectensis]
 gi|156209154|gb|EDO30483.1| predicted protein [Nematostella vectensis]
          Length = 497

 Score = 50.1 bits (118), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 23/57 (40%), Positives = 33/57 (57%), Gaps = 3/57 (5%)

Query: 71  SGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           + L  V+ GG+ +F    L+V P+KGSAVFW N   +        H+ CPV +G+KW
Sbjct: 423 NALNFVDAGGSNVFLDSELSVSPQKGSAVFWLNMRRS---GKETLHAACPVIVGHKW 476



 Score = 48.1 bits (113), Expect = 0.001,   Method: Composition-based stats.
 Identities = 25/58 (43%), Positives = 34/58 (58%), Gaps = 9/58 (15%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHN---AHANTLLDYWMYHSGCPVALGNKW 65
           L  V+ GG+ +F    L+V P+KGSAVFW N   +   TL      H+ CPV +G+KW
Sbjct: 425 LNFVDAGGSNVFLDSELSVSPQKGSAVFWLNMRRSGKETL------HAACPVIVGHKW 476


>gi|52139015|gb|AAH82538.1| P4ha3 protein [Mus musculus]
          Length = 404

 Score = 50.1 bits (118), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 27/66 (40%), Positives = 39/66 (59%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    ++  L+ VE GGAT F   N +V   K +A+FW+N H +   D    H+GCPV
Sbjct: 317 GNRVATFMIY-LSSVEAGGATAFIYGNFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPV 375

Query: 122 ALGNKW 127
            +G+KW
Sbjct: 376 LVGDKW 381



 Score = 48.5 bits (114), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 27/64 (42%), Positives = 36/64 (56%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L+ VE GGAT F   N +V   K +A+FW N H +   D    H+GCPV +
Sbjct: 318 NRVATFMIYLSSVEAGGATAFIYGNFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLV 377

Query: 62  GNKW 65
           G+KW
Sbjct: 378 GDKW 381


>gi|224013908|ref|XP_002296618.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220968970|gb|EED87314.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 601

 Score = 50.1 bits (118), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 26/58 (44%), Positives = 34/58 (58%), Gaps = 2/58 (3%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHAN--TLLDYRMYHSGCPVALGNKWG 128
           L DVE GG T FP L+LTV P++G+A+ W +   +     D R  H   PVA G K+G
Sbjct: 525 LNDVEEGGGTRFPLLDLTVQPKRGNAILWASVRDDDPEEKDIRTDHEALPVAKGMKYG 582



 Score = 46.6 bits (109), Expect = 0.004,   Method: Composition-based stats.
 Identities = 25/58 (43%), Positives = 33/58 (56%), Gaps = 2/58 (3%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHAN--TLLDYWMYHSGCPVALGNKWG 66
           L DVE GG T FP L+LTV P++G+A+ W +   +     D    H   PVA G K+G
Sbjct: 525 LNDVEEGGGTRFPLLDLTVQPKRGNAILWASVRDDDPEEKDIRTDHEALPVAKGMKYG 582


>gi|302791635|ref|XP_002977584.1| hypothetical protein SELMODRAFT_106693 [Selaginella moellendorffii]
 gi|300154954|gb|EFJ21588.1| hypothetical protein SELMODRAFT_106693 [Selaginella moellendorffii]
          Length = 296

 Score = 49.7 bits (117), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 32/94 (34%), Positives = 40/94 (42%), Gaps = 16/94 (17%)

Query: 50  YWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLT----------------VFP 93
           Y  +H     ALG      +L  L+DV  GG T+FPS   T                V P
Sbjct: 144 YDYFHDKYNQALGGHRIATVLMYLSDVVKGGETVFPSSEDTTVKDDSWSDCAKKGIAVKP 203

Query: 94  EKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
            KG A+ +Y+ H +   D    H GCPV  G KW
Sbjct: 204 RKGDALLFYSLHPDATPDESSLHGGCPVIEGEKW 237



 Score = 43.9 bits (102), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 25/71 (35%), Positives = 32/71 (45%), Gaps = 16/71 (22%)

Query: 11  LTDVELGGATIFPSLNLT----------------VFPEKGSAVFWHNAHANTLLDYWMYH 54
           L+DV  GG T+FPS   T                V P KG A+ +++ H +   D    H
Sbjct: 167 LSDVVKGGETVFPSSEDTTVKDDSWSDCAKKGIAVKPRKGDALLFYSLHPDATPDESSLH 226

Query: 55  SGCPVALGNKW 65
            GCPV  G KW
Sbjct: 227 GGCPVIEGEKW 237


>gi|341893923|gb|EGT49858.1| CBN-PHY-3 protein [Caenorhabditis brenneri]
          Length = 275

 Score = 49.7 bits (117), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 27/78 (34%), Positives = 39/78 (50%), Gaps = 7/78 (8%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           D+WM +      + N+ G L+   L   E GG T+FPS+  T+      A FW+NA A+ 
Sbjct: 157 DWWMKN------MRNRIGTLIFV-LKTAEKGGGTVFPSIGTTIRINPRDAFFWFNAQADE 209

Query: 109 LLDYRMYHSGCPVALGNK 126
             +    H GCP+  G K
Sbjct: 210 EKEMLADHGGCPIYEGRK 227



 Score = 44.7 bits (104), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 21/54 (38%), Positives = 27/54 (50%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNK 64
           L   E GG T+FPS+  T+      A FW NA A+   +    H GCP+  G K
Sbjct: 174 LKTAEKGGGTVFPSIGTTIRINPRDAFFWFNAQADEEKEMLADHGGCPIYEGRK 227


>gi|260812289|ref|XP_002600853.1| hypothetical protein BRAFLDRAFT_214927 [Branchiostoma floridae]
 gi|229286143|gb|EEN56865.1| hypothetical protein BRAFLDRAFT_214927 [Branchiostoma floridae]
          Length = 281

 Score = 49.7 bits (117), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 26/67 (38%), Positives = 39/67 (58%), Gaps = 1/67 (1%)

Query: 61  LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
           LGN+    L+  L+DV  GGAT+FP  N+TV   K + + + +   +   D    H+GCP
Sbjct: 193 LGNRMATFLMY-LSDVTAGGATVFPVANVTVPVVKNAGLLFMDLLRSGRGDVNSLHAGCP 251

Query: 121 VALGNKW 127
           V +G+KW
Sbjct: 252 VVIGSKW 258



 Score = 45.4 bits (106), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 22/55 (40%), Positives = 33/55 (60%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L+DV  GGAT+FP  N+TV   K + + + +   +   D    H+GCPV +G+KW
Sbjct: 204 LSDVTAGGATVFPVANVTVPVVKNAGLLFMDLLRSGRGDVNSLHAGCPVVIGSKW 258


>gi|260806885|ref|XP_002598314.1| hypothetical protein BRAFLDRAFT_204780 [Branchiostoma floridae]
 gi|229283586|gb|EEN54326.1| hypothetical protein BRAFLDRAFT_204780 [Branchiostoma floridae]
          Length = 282

 Score = 49.7 bits (117), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 23/55 (41%), Positives = 33/55 (60%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L++VE GGAT+F   N+ V   K SAV + N +   +      H+GCPV +G+KW
Sbjct: 205 LSEVEAGGATVFTEANIAVPVVKNSAVLFENTNKALVRSRASVHAGCPVLIGSKW 259



 Score = 48.9 bits (115), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 23/55 (41%), Positives = 33/55 (60%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L++VE GGAT+F   N+ V   K SAV + N +   +      H+GCPV +G+KW
Sbjct: 205 LSEVEAGGATVFTEANIAVPVVKNSAVLFENTNKALVRSRASVHAGCPVLIGSKW 259


>gi|334185677|ref|NP_001189994.1| prolyl 4-hydroxylase [Arabidopsis thaliana]
 gi|332643930|gb|AEE77451.1| prolyl 4-hydroxylase [Arabidopsis thaliana]
          Length = 324

 Score = 49.7 bits (117), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 41/149 (27%), Positives = 56/149 (37%), Gaps = 31/149 (20%)

Query: 4   AEMSSVELTDV------ELGGATIFPSLNLTVFPEKGSAVFWHNAHA-NTLLDYWMYHSG 56
           A M S+E+ D+      +L   T  P  N     E    + + N        DY  +H  
Sbjct: 119 ANMDSLEIDDIVSNVEAKLAAWTFLPEEN----GESMQILHYENGQKYEPHFDY--FHDQ 172

Query: 57  CPVALGNKWGKLLLSGLTDVELGGATIFP------------------SLNLTVFPEKGSA 98
             + LG      +L  L++VE GG T+FP                       V P KG A
Sbjct: 173 ANLELGGHRIATVLMYLSNVEKGGETVFPMWKGKATQLKDDSWTECAKQGYAVKPRKGDA 232

Query: 99  VFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           + ++N H N   D    H  CPV  G KW
Sbjct: 233 LLFFNLHPNATTDSNSLHGSCPVVEGEKW 261


>gi|407708877|ref|YP_006792741.1| prolyl 4-hydroxylase [Burkholderia phenoliruptrix BR3459a]
 gi|407237560|gb|AFT87758.1| prolyl 4-hydroxylase [Burkholderia phenoliruptrix BR3459a]
          Length = 300

 Score = 49.7 bits (117), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 24/57 (42%), Positives = 32/57 (56%)

Query: 9   VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           V L DV  GG TIFP+  L+V  ++G AV++   +    LD    H G PV  G+KW
Sbjct: 232 VYLNDVADGGETIFPAAGLSVAAKQGGAVYFRYMNGQRQLDPLTLHGGAPVRAGDKW 288



 Score = 48.9 bits (115), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 27/73 (36%), Positives = 37/73 (50%)

Query: 55  SGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRM 114
           S    A G +    L+  L DV  GG TIFP+  L+V  ++G AV++   +    LD   
Sbjct: 216 SAVHTARGGQRVATLVVYLNDVADGGETIFPAAGLSVAAKQGGAVYFRYMNGQRQLDPLT 275

Query: 115 YHSGCPVALGNKW 127
            H G PV  G+KW
Sbjct: 276 LHGGAPVRAGDKW 288


>gi|226529219|ref|NP_001151238.1| LOC100284871 [Zea mays]
 gi|195645242|gb|ACG42089.1| prolyl 4-hydroxylase alpha-2 subunit precursor [Zea mays]
 gi|347978812|gb|AEP37748.1| prolyl 4-hydroxylase 5 [Zea mays]
 gi|413923983|gb|AFW63915.1| prolyl 4-hydroxylase alpha-2 subunit [Zea mays]
          Length = 308

 Score = 49.3 bits (116), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 34/107 (31%), Positives = 48/107 (44%), Gaps = 25/107 (23%)

Query: 40  HNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN----------- 88
           +  H +  LD +   +G     G +   LL+  L+DVE GG TIFP  N           
Sbjct: 195 YEPHFDYFLDEFNTKNG-----GQRMATLLMY-LSDVEEGGETIFPDANVNVSSLPWYNE 248

Query: 89  --------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
                   L+V P+ G A+ +++   +  LD    H GCPV  GNKW
Sbjct: 249 LSECAKRGLSVKPKMGDALLFWSMKPDATLDPLSLHGGCPVIRGNKW 295



 Score = 46.2 bits (108), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 27/74 (36%), Positives = 34/74 (45%), Gaps = 19/74 (25%)

Query: 11  LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
           L+DVE GG TIFP  N                   L+V P+ G A+ + +   +  LD  
Sbjct: 222 LSDVEEGGETIFPDANVNVSSLPWYNELSECAKRGLSVKPKMGDALLFWSMKPDATLDPL 281

Query: 52  MYHSGCPVALGNKW 65
             H GCPV  GNKW
Sbjct: 282 SLHGGCPVIRGNKW 295


>gi|225459748|ref|XP_002285898.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Vitis vinifera]
 gi|302141716|emb|CBI18919.3| unnamed protein product [Vitis vinifera]
          Length = 288

 Score = 49.3 bits (116), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 33/107 (30%), Positives = 50/107 (46%), Gaps = 25/107 (23%)

Query: 40  HNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN----------- 88
           ++AH +  LD +   +G     G +   LL+  L+DVE GG T+FP+             
Sbjct: 175 YDAHYDYFLDEFNTKNG-----GQRIATLLMY-LSDVEEGGETVFPATKANFSSVPWWNE 228

Query: 89  --------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
                   L+V P+ G A+ +++   +  LD    H GCPV  GNKW
Sbjct: 229 LSECGKKGLSVKPKMGDALLFWSMRPDATLDPSSLHGGCPVIKGNKW 275



 Score = 43.5 bits (101), Expect = 0.028,   Method: Compositional matrix adjust.
 Identities = 25/74 (33%), Positives = 34/74 (45%), Gaps = 19/74 (25%)

Query: 11  LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
           L+DVE GG T+FP+                     L+V P+ G A+ + +   +  LD  
Sbjct: 202 LSDVEEGGETVFPATKANFSSVPWWNELSECGKKGLSVKPKMGDALLFWSMRPDATLDPS 261

Query: 52  MYHSGCPVALGNKW 65
             H GCPV  GNKW
Sbjct: 262 SLHGGCPVIKGNKW 275


>gi|398810140|ref|ZP_10568970.1| 2OG-Fe(II) oxygenase superfamily enzyme [Variovorax sp. CF313]
 gi|398083831|gb|EJL74535.1| 2OG-Fe(II) oxygenase superfamily enzyme [Variovorax sp. CF313]
          Length = 296

 Score = 49.3 bits (116), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 35/108 (32%), Positives = 56/108 (51%), Gaps = 5/108 (4%)

Query: 25  LNLTVFPEKGSAVFWHNAHANTL--LDYWMYHSGCPVALGNKWGK---LLLSGLTDVELG 79
           +NL V   +G  V  + A A ++   D+ +  +    A   + G+    L+S L +VE G
Sbjct: 169 MNLPVENGEGLQVLCYPAGAQSMPHFDFLVPSNAANKASLARSGQRVSTLVSYLNEVEEG 228

Query: 80  GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           G TIFP    +V P +GSAV++   ++   +D+   H+G PV  G KW
Sbjct: 229 GETIFPECGWSVPPRRGSAVYFEYCNSLGQVDHASLHAGGPVLHGEKW 276



 Score = 47.8 bits (112), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 23/55 (41%), Positives = 33/55 (60%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L +VE GG TIFP    +V P +GSAV++   ++   +D+   H+G PV  G KW
Sbjct: 222 LNEVEEGGETIFPECGWSVPPRRGSAVYFEYCNSLGQVDHASLHAGGPVLHGEKW 276


>gi|363543299|ref|NP_001241865.1| prolyl 4-hydroxylase 5-1 [Zea mays]
 gi|347978814|gb|AEP37749.1| prolyl 4-hydroxylase 5-1 [Zea mays]
          Length = 180

 Score = 49.3 bits (116), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 34/107 (31%), Positives = 48/107 (44%), Gaps = 25/107 (23%)

Query: 40  HNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN----------- 88
           +  H +  LD +   +G     G +   LL+  L+DVE GG TIFP  N           
Sbjct: 67  YEPHFDYFLDEFNTKNG-----GQRMATLLMH-LSDVEEGGETIFPDANVNDSSLPWYNE 120

Query: 89  --------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
                   L+V P+ G A+ +++   +  LD    H GCPV  GNKW
Sbjct: 121 LSECAKRGLSVKPKMGDALLFWSMKPDATLDPLSLHGGCPVIRGNKW 167



 Score = 46.2 bits (108), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 27/76 (35%), Positives = 35/76 (46%), Gaps = 19/76 (25%)

Query: 9   VELTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLD 49
           + L+DVE GG TIFP  N                   L+V P+ G A+ + +   +  LD
Sbjct: 92  MHLSDVEEGGETIFPDANVNDSSLPWYNELSECAKRGLSVKPKMGDALLFWSMKPDATLD 151

Query: 50  YWMYHSGCPVALGNKW 65
               H GCPV  GNKW
Sbjct: 152 PLSLHGGCPVIRGNKW 167


>gi|242063586|ref|XP_002453082.1| hypothetical protein SORBIDRAFT_04g038020 [Sorghum bicolor]
 gi|241932913|gb|EES06058.1| hypothetical protein SORBIDRAFT_04g038020 [Sorghum bicolor]
          Length = 307

 Score = 49.3 bits (116), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 34/107 (31%), Positives = 48/107 (44%), Gaps = 25/107 (23%)

Query: 40  HNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN----------- 88
           +  H +  LD +   +G     G +   LL+  L+DVE GG TIFP  N           
Sbjct: 194 YEPHFDYFLDEFNTKNG-----GQRMATLLMY-LSDVEEGGETIFPDANVNASSLPWYNE 247

Query: 89  --------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
                   L+V P+ G A+ +++   +  LD    H GCPV  GNKW
Sbjct: 248 LSECAKRGLSVKPKMGDALLFWSMKPDATLDPLSLHGGCPVIRGNKW 294



 Score = 46.2 bits (108), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 27/74 (36%), Positives = 34/74 (45%), Gaps = 19/74 (25%)

Query: 11  LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
           L+DVE GG TIFP  N                   L+V P+ G A+ + +   +  LD  
Sbjct: 221 LSDVEEGGETIFPDANVNASSLPWYNELSECAKRGLSVKPKMGDALLFWSMKPDATLDPL 280

Query: 52  MYHSGCPVALGNKW 65
             H GCPV  GNKW
Sbjct: 281 SLHGGCPVIRGNKW 294


>gi|48716447|dbj|BAD23054.1| putative prolyl 4-hydroxylase [Oryza sativa Japonica Group]
          Length = 310

 Score = 49.3 bits (116), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 34/107 (31%), Positives = 47/107 (43%), Gaps = 25/107 (23%)

Query: 40  HNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN----------- 88
           +  H +  LD +   +G     G +   LL+  L+DVE GG TIFP  N           
Sbjct: 197 YEPHFDYFLDEYNTKNG-----GQRMATLLMY-LSDVEEGGETIFPDANVNSSSLPWYNE 250

Query: 89  --------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
                   L V P+ G A+ +++   +  LD    H GCPV  GNKW
Sbjct: 251 LSECARKGLAVKPKMGDALLFWSMKPDATLDPLSLHGGCPVIKGNKW 297



 Score = 45.8 bits (107), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 27/74 (36%), Positives = 33/74 (44%), Gaps = 19/74 (25%)

Query: 11  LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
           L+DVE GG TIFP  N                   L V P+ G A+ + +   +  LD  
Sbjct: 224 LSDVEEGGETIFPDANVNSSSLPWYNELSECARKGLAVKPKMGDALLFWSMKPDATLDPL 283

Query: 52  MYHSGCPVALGNKW 65
             H GCPV  GNKW
Sbjct: 284 SLHGGCPVIKGNKW 297


>gi|226494249|ref|NP_001141909.1| uncharacterized protein LOC100274058 [Zea mays]
 gi|194706408|gb|ACF87288.1| unknown [Zea mays]
 gi|413932757|gb|AFW67308.1| hypothetical protein ZEAMMB73_919439 [Zea mays]
 gi|413932758|gb|AFW67309.1| hypothetical protein ZEAMMB73_919439 [Zea mays]
          Length = 217

 Score = 49.3 bits (116), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 32/98 (32%), Positives = 42/98 (42%), Gaps = 20/98 (20%)

Query: 48  LDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN------------------L 89
            DY  +H    + LG +    +L  LTDV  GG T+FP+                    L
Sbjct: 57  FDY--FHDRNNLKLGGQRVATVLMYLTDVNKGGETVFPNAEGSHLQYKDETWSECSRSGL 114

Query: 90  TVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
            V P+KG A+ ++N H N   D    H  CPV  G KW
Sbjct: 115 AVKPKKGDALLFFNLHVNATADTGSLHGSCPVIEGEKW 152



 Score = 44.7 bits (104), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 26/73 (35%), Positives = 31/73 (42%), Gaps = 18/73 (24%)

Query: 11  LTDVELGGATIFPSLN------------------LTVFPEKGSAVFWHNAHANTLLDYWM 52
           LTDV  GG T+FP+                    L V P+KG A+ + N H N   D   
Sbjct: 80  LTDVNKGGETVFPNAEGSHLQYKDETWSECSRSGLAVKPKKGDALLFFNLHVNATADTGS 139

Query: 53  YHSGCPVALGNKW 65
            H  CPV  G KW
Sbjct: 140 LHGSCPVIEGEKW 152


>gi|120609859|ref|YP_969537.1| 2OG-Fe(II) oxygenase [Acidovorax citrulli AAC00-1]
 gi|120588323|gb|ABM31763.1| 2OG-Fe(II) oxygenase [Acidovorax citrulli AAC00-1]
          Length = 309

 Score = 49.3 bits (116), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 29/76 (38%), Positives = 39/76 (51%), Gaps = 8/76 (10%)

Query: 56  GCPVAL--GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVF--WYNAHANTLLD 111
           G P  L  G +    ++  L D E GG T FP ++L V P +G+AVF  +   H +T   
Sbjct: 226 GTPTILRRGGQRVATIVIYLNDPEKGGGTTFPDVHLEVAPRRGNAVFFSYERPHPST--- 282

Query: 112 YRMYHSGCPVALGNKW 127
            R  H G PV  G+KW
Sbjct: 283 -RTLHGGAPVVAGDKW 297



 Score = 47.0 bits (110), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 24/57 (42%), Positives = 31/57 (54%), Gaps = 6/57 (10%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVF--WHNAHANTLLDYWMYHSGCPVALGNKW 65
           L D E GG T FP ++L V P +G+AVF  +   H +T       H G PV  G+KW
Sbjct: 245 LNDPEKGGGTTFPDVHLEVAPRRGNAVFFSYERPHPSTR----TLHGGAPVVAGDKW 297


>gi|388520325|gb|AFK48224.1| unknown [Lotus japonicus]
          Length = 188

 Score = 49.3 bits (116), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 32/107 (29%), Positives = 49/107 (45%), Gaps = 25/107 (23%)

Query: 40  HNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN----------- 88
           +  H +  LD +   +G     G +   +L+  L+DVE GG TIFP+             
Sbjct: 75  YEPHYDYFLDEFNTKNG-----GQRIATVLMY-LSDVEEGGETIFPAAKANFSSVPWYND 128

Query: 89  --------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
                   L+V P++G A+ +++   +  LD    H GCPV  GNKW
Sbjct: 129 LSVCAKKGLSVKPKRGDALLFWSIRPDATLDPSSLHGGCPVIRGNKW 175



 Score = 46.6 bits (109), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 26/74 (35%), Positives = 35/74 (47%), Gaps = 19/74 (25%)

Query: 11  LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
           L+DVE GG TIFP+                     L+V P++G A+ + +   +  LD  
Sbjct: 102 LSDVEEGGETIFPAAKANFSSVPWYNDLSVCAKKGLSVKPKRGDALLFWSIRPDATLDPS 161

Query: 52  MYHSGCPVALGNKW 65
             H GCPV  GNKW
Sbjct: 162 SLHGGCPVIRGNKW 175


>gi|209522122|ref|ZP_03270769.1| Procollagen-proline dioxygenase [Burkholderia sp. H160]
 gi|209497434|gb|EDZ97642.1| Procollagen-proline dioxygenase [Burkholderia sp. H160]
          Length = 296

 Score = 49.3 bits (116), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 24/66 (36%), Positives = 39/66 (59%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           G + G LL+  L DVE GG T+FP +  +V P++G A+++   +   L D    H+  P+
Sbjct: 215 GQRMGTLLMY-LKDVEGGGETVFPQIGWSVAPQRGHALYFEYGNRFGLCDPSSLHASTPL 273

Query: 122 ALGNKW 127
            +G+KW
Sbjct: 274 RVGDKW 279



 Score = 46.2 bits (108), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 20/55 (36%), Positives = 33/55 (60%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T+FP +  +V P++G A+++   +   L D    H+  P+ +G+KW
Sbjct: 225 LKDVEGGGETVFPQIGWSVAPQRGHALYFEYGNRFGLCDPSSLHASTPLRVGDKW 279


>gi|26352077|dbj|BAC39675.1| unnamed protein product [Mus musculus]
          Length = 383

 Score = 49.3 bits (116), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 25/55 (45%), Positives = 34/55 (61%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L+ VE GGAT F   N +V   K +A+FW+N H +   D    H+GCPV +G+KW
Sbjct: 306 LSSVEAGGATAFIYGNFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLVGDKW 360



 Score = 46.6 bits (109), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 25/55 (45%), Positives = 33/55 (60%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L+ VE GGAT F   N +V   K +A+FW N H +   D    H+GCPV +G+KW
Sbjct: 306 LSSVEAGGATAFIYGNFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLVGDKW 360


>gi|357483925|ref|XP_003612249.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355513584|gb|AES95207.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
          Length = 289

 Score = 49.3 bits (116), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 31/107 (28%), Positives = 49/107 (45%), Gaps = 25/107 (23%)

Query: 40  HNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN----------- 88
           +  H +  LD +   +G     G +   +L+  LTDVE GG T+FP+             
Sbjct: 176 YEPHYDYFLDEFNTKNG-----GQRIATVLMY-LTDVEEGGETVFPAAKGNFSNVPWYNE 229

Query: 89  --------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
                   L++ P++G A+ +++   +  LD    H GCPV  GNKW
Sbjct: 230 LSDCGKKGLSIKPKRGDALLFWSMKPDATLDASSLHGGCPVIKGNKW 276



 Score = 46.2 bits (108), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 25/74 (33%), Positives = 35/74 (47%), Gaps = 19/74 (25%)

Query: 11  LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
           LTDVE GG T+FP+                     L++ P++G A+ + +   +  LD  
Sbjct: 203 LTDVEEGGETVFPAAKGNFSNVPWYNELSDCGKKGLSIKPKRGDALLFWSMKPDATLDAS 262

Query: 52  MYHSGCPVALGNKW 65
             H GCPV  GNKW
Sbjct: 263 SLHGGCPVIKGNKW 276


>gi|239816557|ref|YP_002945467.1| Procollagen-proline dioxygenase [Variovorax paradoxus S110]
 gi|239803134|gb|ACS20201.1| Procollagen-proline dioxygenase [Variovorax paradoxus S110]
          Length = 296

 Score = 49.3 bits (116), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 33/108 (30%), Positives = 56/108 (51%), Gaps = 5/108 (4%)

Query: 25  LNLTVFPEKGSAVFWHNAHANTL--LDYWMYHSGCPVALGNKWGK---LLLSGLTDVELG 79
           +NL V   +G  V  + A A +L   D+ +  +    A   + G+    L++ L +VE G
Sbjct: 169 MNLPVENGEGLQVLHYPAGAQSLPHFDFLVPSNAANQASLQRSGQRVSTLVAYLNEVEEG 228

Query: 80  GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           G T+FP    +V P++G AV++   ++   +D+   H+G PV  G KW
Sbjct: 229 GETVFPETGWSVSPQRGGAVYFEYCNSLGQVDHASLHAGAPVLSGEKW 276



 Score = 47.4 bits (111), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 21/55 (38%), Positives = 33/55 (60%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L +VE GG T+FP    +V P++G AV++   ++   +D+   H+G PV  G KW
Sbjct: 222 LNEVEEGGETVFPETGWSVSPQRGGAVYFEYCNSLGQVDHASLHAGAPVLSGEKW 276


>gi|91789558|ref|YP_550510.1| procollagen-proline,2-oxoglutarate-4-dioxygenase [Polaromonas sp.
           JS666]
 gi|91698783|gb|ABE45612.1| Procollagen-proline,2-oxoglutarate-4-dioxygenase [Polaromonas sp.
           JS666]
          Length = 277

 Score = 49.3 bits (116), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 30/76 (39%), Positives = 38/76 (50%), Gaps = 4/76 (5%)

Query: 54  HSGCPVAL--GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLD 111
             G P  L  G +    L+  L + E GGAT+FP + L V P +G+AVF+    AN    
Sbjct: 192 EPGTPAILQRGGQRVATLIMYLNEPEQGGATVFPDIGLQVTPRRGTAVFFSYPAANPASL 251

Query: 112 YRMYHSGCPVALGNKW 127
            R  H G PV  G KW
Sbjct: 252 TR--HGGEPVKAGEKW 265



 Score = 47.8 bits (112), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 24/55 (43%), Positives = 30/55 (54%), Gaps = 2/55 (3%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L + E GGAT+FP + L V P +G+AVF+    AN        H G PV  G KW
Sbjct: 213 LNEPEQGGATVFPDIGLQVTPRRGTAVFFSYPAANPA--SLTRHGGEPVKAGEKW 265


>gi|326316001|ref|YP_004233673.1| procollagen-proline dioxygenase [Acidovorax avenae subsp. avenae
           ATCC 19860]
 gi|323372837|gb|ADX45106.1| Procollagen-proline dioxygenase [Acidovorax avenae subsp. avenae
           ATCC 19860]
          Length = 298

 Score = 49.3 bits (116), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 29/76 (38%), Positives = 39/76 (51%), Gaps = 8/76 (10%)

Query: 56  GCPVAL--GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVF--WYNAHANTLLD 111
           G P  L  G +    ++  L D E GG T FP ++L V P +G+AVF  +   H +T   
Sbjct: 215 GTPTILRRGGQRVATIVIYLNDPEKGGGTTFPDVHLEVAPRRGNAVFFSYERPHPST--- 271

Query: 112 YRMYHSGCPVALGNKW 127
            R  H G PV  G+KW
Sbjct: 272 -RTLHGGAPVVAGDKW 286



 Score = 47.0 bits (110), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 24/57 (42%), Positives = 31/57 (54%), Gaps = 6/57 (10%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVF--WHNAHANTLLDYWMYHSGCPVALGNKW 65
           L D E GG T FP ++L V P +G+AVF  +   H +T       H G PV  G+KW
Sbjct: 234 LNDPEKGGGTTFPDVHLEVAPRRGNAVFFSYERPHPSTR----TLHGGAPVVAGDKW 286


>gi|238007346|gb|ACR34708.1| unknown [Zea mays]
          Length = 180

 Score = 48.9 bits (115), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 34/107 (31%), Positives = 48/107 (44%), Gaps = 25/107 (23%)

Query: 40  HNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN----------- 88
           +  H +  LD +   +G     G +   LL+  L+DVE GG TIFP  N           
Sbjct: 67  YEPHFDYFLDEFNTKNG-----GQRMATLLMY-LSDVEEGGETIFPDANVNVSSLPWYNE 120

Query: 89  --------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
                   L+V P+ G A+ +++   +  LD    H GCPV  GNKW
Sbjct: 121 LSECAKRGLSVKPKMGDALLFWSMKPDATLDPLSLHGGCPVIRGNKW 167



 Score = 46.2 bits (108), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 27/74 (36%), Positives = 34/74 (45%), Gaps = 19/74 (25%)

Query: 11  LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
           L+DVE GG TIFP  N                   L+V P+ G A+ + +   +  LD  
Sbjct: 94  LSDVEEGGETIFPDANVNVSSLPWYNELSECAKRGLSVKPKMGDALLFWSMKPDATLDPL 153

Query: 52  MYHSGCPVALGNKW 65
             H GCPV  GNKW
Sbjct: 154 SLHGGCPVIRGNKW 167


>gi|259490206|ref|NP_001159002.1| prolyl 4-hydroxylase alpha-2 subunit [Zea mays]
 gi|195626402|gb|ACG35031.1| prolyl 4-hydroxylase alpha-2 subunit precursor [Zea mays]
 gi|347978830|gb|AEP37757.1| prolyl 4-hydroxylase 8 [Zea mays]
 gi|347978832|gb|AEP37758.1| prolyl 4-hydroxylase 8-1 [Zea mays]
 gi|413939569|gb|AFW74120.1| prolyl 4-hydroxylase alpha-2 subunit isoform 1 [Zea mays]
 gi|413939570|gb|AFW74121.1| prolyl 4-hydroxylase alpha-2 subunit isoform 2 [Zea mays]
 gi|413939571|gb|AFW74122.1| prolyl 4-hydroxylase alpha-2 subunit isoform 3 [Zea mays]
 gi|413939572|gb|AFW74123.1| prolyl 4-hydroxylase alpha-2 subunit isoform 4 [Zea mays]
          Length = 307

 Score = 48.9 bits (115), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 34/107 (31%), Positives = 48/107 (44%), Gaps = 25/107 (23%)

Query: 40  HNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN----------- 88
           +  H +  LD +   +G     G +   LL+  L+DVE GG TIFP  N           
Sbjct: 194 YEPHFDYFLDEFNTKNG-----GQRIATLLMY-LSDVEEGGETIFPDANVNASSLPWYNE 247

Query: 89  --------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
                   L+V P+ G A+ +++   +  LD    H GCPV  GNKW
Sbjct: 248 LSDCAKRGLSVKPKMGDALLFWSMKPDATLDPLSLHGGCPVIKGNKW 294



 Score = 46.2 bits (108), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 27/74 (36%), Positives = 34/74 (45%), Gaps = 19/74 (25%)

Query: 11  LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
           L+DVE GG TIFP  N                   L+V P+ G A+ + +   +  LD  
Sbjct: 221 LSDVEEGGETIFPDANVNASSLPWYNELSDCAKRGLSVKPKMGDALLFWSMKPDATLDPL 280

Query: 52  MYHSGCPVALGNKW 65
             H GCPV  GNKW
Sbjct: 281 SLHGGCPVIKGNKW 294


>gi|21107513|gb|AAM36222.1| conserved hypothetical protein [Xanthomonas axonopodis pv. citri
           str. 306]
          Length = 273

 Score = 48.9 bits (115), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 26/64 (40%), Positives = 32/64 (50%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     V L DV  GG T FP   + V P  G+ V + N HA+   D    H+G PV  
Sbjct: 196 NRQRTVCVYLNDVGAGGDTEFPIAGVRVRPRPGTLVCFDNLHADGRPDADSLHAGLPVTA 255

Query: 62  GNKW 65
           G+KW
Sbjct: 256 GSKW 259



 Score = 44.7 bits (104), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 23/55 (41%), Positives = 29/55 (52%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L DV  GG T FP   + V P  G+ V + N HA+   D    H+G PV  G+KW
Sbjct: 205 LNDVGAGGDTEFPIAGVRVRPRPGTLVCFDNLHADGRPDADSLHAGLPVTAGSKW 259


>gi|363543369|ref|NP_001241694.1| prolyl 4-hydroxylase 8-4 [Zea mays]
 gi|347978838|gb|AEP37761.1| prolyl 4-hydroxylase 8-4 [Zea mays]
          Length = 307

 Score = 48.9 bits (115), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 34/107 (31%), Positives = 48/107 (44%), Gaps = 25/107 (23%)

Query: 40  HNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN----------- 88
           +  H +  LD +   +G     G +   LL+  L+DVE GG TIFP  N           
Sbjct: 194 YEPHFDYFLDEFNTKNG-----GQRIATLLMY-LSDVEEGGETIFPDANVNASSLPWYNE 247

Query: 89  --------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
                   L+V P+ G A+ +++   +  LD    H GCPV  GNKW
Sbjct: 248 LSDCAKRGLSVKPKMGDALLFWSMKPDATLDPLSLHGGCPVIKGNKW 294



 Score = 46.2 bits (108), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 27/74 (36%), Positives = 34/74 (45%), Gaps = 19/74 (25%)

Query: 11  LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
           L+DVE GG TIFP  N                   L+V P+ G A+ + +   +  LD  
Sbjct: 221 LSDVEEGGETIFPDANVNASSLPWYNELSDCAKRGLSVKPKMGDALLFWSMKPDATLDPL 280

Query: 52  MYHSGCPVALGNKW 65
             H GCPV  GNKW
Sbjct: 281 SLHGGCPVIKGNKW 294


>gi|323528042|ref|YP_004230194.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1001]
 gi|323385044|gb|ADX57134.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1001]
          Length = 300

 Score = 48.9 bits (115), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 24/57 (42%), Positives = 32/57 (56%)

Query: 9   VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           V L DV  GG TIFP+  L+V  ++G AV++   +    LD    H G PV  G+KW
Sbjct: 232 VYLNDVADGGETIFPAAGLSVAAKQGGAVYFRYMNGQRQLDPLTLHGGAPVHAGDKW 288



 Score = 48.5 bits (114), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 27/73 (36%), Positives = 37/73 (50%)

Query: 55  SGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRM 114
           S    A G +    L+  L DV  GG TIFP+  L+V  ++G AV++   +    LD   
Sbjct: 216 SAVHTARGGQRVATLVVYLNDVADGGETIFPAAGLSVAAKQGGAVYFRYMNGQRQLDPLT 275

Query: 115 YHSGCPVALGNKW 127
            H G PV  G+KW
Sbjct: 276 LHGGAPVHAGDKW 288


>gi|307725787|ref|YP_003909000.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1003]
 gi|307586312|gb|ADN59709.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1003]
          Length = 313

 Score = 48.9 bits (115), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 24/57 (42%), Positives = 31/57 (54%)

Query: 9   VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           V L DV  GG TIFP   L+V  ++G AV++   +    LD    H G PV  G+KW
Sbjct: 245 VYLNDVPDGGETIFPEAGLSVAAQQGGAVYFRYMNGRRQLDPLTLHGGAPVLSGDKW 301



 Score = 47.8 bits (112), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 26/69 (37%), Positives = 35/69 (50%)

Query: 59  VALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSG 118
            A G +    L+  L DV  GG TIFP   L+V  ++G AV++   +    LD    H G
Sbjct: 233 TARGGQRVATLVVYLNDVPDGGETIFPEAGLSVAAQQGGAVYFRYMNGRRQLDPLTLHGG 292

Query: 119 CPVALGNKW 127
            PV  G+KW
Sbjct: 293 APVLSGDKW 301


>gi|357137804|ref|XP_003570489.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Brachypodium
           distachyon]
          Length = 318

 Score = 48.9 bits (115), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 33/107 (30%), Positives = 48/107 (44%), Gaps = 25/107 (23%)

Query: 40  HNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN----------- 88
           +  H +  LD +   +G     G +   +L+  L+DVE GG TIFP  N           
Sbjct: 205 YEPHFDYFLDEFNTKNG-----GQRMATILMY-LSDVEEGGETIFPDANVNSSSLPWHNE 258

Query: 89  --------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
                   L V P+ G A+ +++ + +  LD    H GCPV  GNKW
Sbjct: 259 LSECARKGLAVKPKMGDALLFWSMNPDATLDPLSLHGGCPVIRGNKW 305



 Score = 46.6 bits (109), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 27/74 (36%), Positives = 34/74 (45%), Gaps = 19/74 (25%)

Query: 11  LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
           L+DVE GG TIFP  N                   L V P+ G A+ + + + +  LD  
Sbjct: 232 LSDVEEGGETIFPDANVNSSSLPWHNELSECARKGLAVKPKMGDALLFWSMNPDATLDPL 291

Query: 52  MYHSGCPVALGNKW 65
             H GCPV  GNKW
Sbjct: 292 SLHGGCPVIRGNKW 305


>gi|330821584|ref|YP_004350446.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
           gladioli BSR3]
 gi|327373579|gb|AEA64934.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
           gladioli BSR3]
          Length = 302

 Score = 48.5 bits (114), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 23/66 (34%), Positives = 36/66 (54%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           G + G LL+  L DVE GG T+FP +  +V P +G A ++   + +   D    H+  P+
Sbjct: 215 GQRVGTLLMY-LNDVESGGETLFPQVGCSVVPRRGQAFYFEYGNGSGRSDPASLHASSPI 273

Query: 122 ALGNKW 127
             G+KW
Sbjct: 274 GSGDKW 279



 Score = 46.6 bits (109), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 19/55 (34%), Positives = 30/55 (54%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T+FP +  +V P +G A ++   + +   D    H+  P+  G+KW
Sbjct: 225 LNDVESGGETLFPQVGCSVVPRRGQAFYFEYGNGSGRSDPASLHASSPIGSGDKW 279


>gi|78046960|ref|YP_363135.1| hypothetical protein XCV1404 [Xanthomonas campestris pv.
           vesicatoria str. 85-10]
 gi|78035390|emb|CAJ23035.1| conserved hypothetical protein [Xanthomonas campestris pv.
           vesicatoria str. 85-10]
          Length = 418

 Score = 48.5 bits (114), Expect = 8e-04,   Method: Composition-based stats.
 Identities = 26/64 (40%), Positives = 32/64 (50%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     V L DV  GG T FP   + V P  G+ V + N HA+   D    H+G PV  
Sbjct: 341 NRQRTVCVYLNDVGAGGETEFPVAGVRVRPRPGTLVCFDNLHADGRPDADSLHAGLPVTA 400

Query: 62  GNKW 65
           G+KW
Sbjct: 401 GSKW 404



 Score = 45.4 bits (106), Expect = 0.008,   Method: Composition-based stats.
 Identities = 23/55 (41%), Positives = 29/55 (52%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L DV  GG T FP   + V P  G+ V + N HA+   D    H+G PV  G+KW
Sbjct: 350 LNDVGAGGETEFPVAGVRVRPRPGTLVCFDNLHADGRPDADSLHAGLPVTAGSKW 404


>gi|346724248|ref|YP_004850917.1| hypothetical protein XACM_1335 [Xanthomonas axonopodis pv.
           citrumelo F1]
 gi|346648995|gb|AEO41619.1| hypothetical protein XACM_1335 [Xanthomonas axonopodis pv.
           citrumelo F1]
          Length = 418

 Score = 48.5 bits (114), Expect = 8e-04,   Method: Composition-based stats.
 Identities = 26/64 (40%), Positives = 32/64 (50%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     V L DV  GG T FP   + V P  G+ V + N HA+   D    H+G PV  
Sbjct: 341 NRQRTVCVYLNDVGAGGETEFPVAGVRVRPRPGTLVCFDNLHADGRPDADSLHAGLPVTA 400

Query: 62  GNKW 65
           G+KW
Sbjct: 401 GSKW 404



 Score = 45.4 bits (106), Expect = 0.008,   Method: Composition-based stats.
 Identities = 23/55 (41%), Positives = 29/55 (52%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L DV  GG T FP   + V P  G+ V + N HA+   D    H+G PV  G+KW
Sbjct: 350 LNDVGAGGETEFPVAGVRVRPRPGTLVCFDNLHADGRPDADSLHAGLPVTAGSKW 404


>gi|145345764|ref|XP_001417370.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144577597|gb|ABO95663.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 328

 Score = 48.5 bits (114), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 28/73 (38%), Positives = 33/73 (45%), Gaps = 18/73 (24%)

Query: 11  LTDVELGGATIFPSL------------------NLTVFPEKGSAVFWHNAHANTLLDYWM 52
           LTDVE GG TIFP                     L V P KG A+F+ + H N   D   
Sbjct: 183 LTDVEEGGETIFPDAEAGANPGGGDDASSCAAGKLAVKPRKGDALFFRSLHHNGTSDAMS 242

Query: 53  YHSGCPVALGNKW 65
            H+GCPV  G K+
Sbjct: 243 SHAGCPVVKGVKF 255



 Score = 48.1 bits (113), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 32/98 (32%), Positives = 40/98 (40%), Gaps = 18/98 (18%)

Query: 48  LDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSL------------------NL 89
            DY+          G +    +L  LTDVE GG TIFP                     L
Sbjct: 158 FDYFQDEFNQKREKGGQRVATVLMYLTDVEEGGETIFPDAEAGANPGGGDDASSCAAGKL 217

Query: 90  TVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
            V P KG A+F+ + H N   D    H+GCPV  G K+
Sbjct: 218 AVKPRKGDALFFRSLHHNGTSDAMSSHAGCPVVKGVKF 255


>gi|308799217|ref|XP_003074389.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
 gi|116000560|emb|CAL50240.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
          Length = 294

 Score = 48.5 bits (114), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 39/146 (26%), Positives = 58/146 (39%), Gaps = 25/146 (17%)

Query: 1   MNRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVA 60
           + RAE   +E  +  +   T  P  +   F      V  +  H      Y  +H    V 
Sbjct: 84  LGRAEDDVIEAIEARIAAWTHVPESHGEGF-----QVLRYEKHQEYRAHYDYFHDKFNVK 138

Query: 61  L---GNKWGKLLLSGLTDVELGGATIFPSLN----------------LTVFPEKGSAVFW 101
               G + G +L+  L+DVE GG T+FP                   L V P KG A+F+
Sbjct: 139 REKGGQRMGTVLMY-LSDVEEGGETVFPKFEDGTPAGSEASECARNKLAVRPRKGDALFF 197

Query: 102 YNAHANTLLDYRMYHSGCPVALGNKW 127
            +   + + D    H+GCPV  G K+
Sbjct: 198 RSLRHDGVPDTFSEHAGCPVIRGVKF 223


>gi|125542543|gb|EAY88682.1| hypothetical protein OsI_10157 [Oryza sativa Indica Group]
          Length = 321

 Score = 48.5 bits (114), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 28/84 (33%), Positives = 39/84 (46%), Gaps = 19/84 (22%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLN------------------LTVFPEKGSAVFWYN 103
           G++   +L+  L++V++GG TIFP                       V P KGSAV +++
Sbjct: 175 GDRVATVLMY-LSNVKMGGETIFPDCEARLSQPKDETWSDCAEQGFAVKPAKGSAVLFFS 233

Query: 104 AHANTLLDYRMYHSGCPVALGNKW 127
            H N  LD    H  CPV  G KW
Sbjct: 234 LHPNATLDTDSLHGSCPVIEGEKW 257



 Score = 46.2 bits (108), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 26/73 (35%), Positives = 32/73 (43%), Gaps = 18/73 (24%)

Query: 11  LTDVELGGATIFPSLN------------------LTVFPEKGSAVFWHNAHANTLLDYWM 52
           L++V++GG TIFP                       V P KGSAV + + H N  LD   
Sbjct: 185 LSNVKMGGETIFPDCEARLSQPKDETWSDCAEQGFAVKPAKGSAVLFFSLHPNATLDTDS 244

Query: 53  YHSGCPVALGNKW 65
            H  CPV  G KW
Sbjct: 245 LHGSCPVIEGEKW 257


>gi|170690448|ref|ZP_02881615.1| Procollagen-proline dioxygenase [Burkholderia graminis C4D1M]
 gi|170144883|gb|EDT13044.1| Procollagen-proline dioxygenase [Burkholderia graminis C4D1M]
          Length = 307

 Score = 48.5 bits (114), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 23/55 (41%), Positives = 30/55 (54%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DV  GG TIFP   L+V  ++G AV++   +    LD    H G PV  G+KW
Sbjct: 241 LNDVPDGGETIFPEAGLSVAAKQGGAVYFRYMNGQRQLDPLTLHGGAPVRAGDKW 295



 Score = 48.1 bits (113), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 26/68 (38%), Positives = 35/68 (51%)

Query: 60  ALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGC 119
           A G +    L+  L DV  GG TIFP   L+V  ++G AV++   +    LD    H G 
Sbjct: 228 ARGGQRVATLVIYLNDVPDGGETIFPEAGLSVAAKQGGAVYFRYMNGQRQLDPLTLHGGA 287

Query: 120 PVALGNKW 127
           PV  G+KW
Sbjct: 288 PVRAGDKW 295


>gi|390989336|ref|ZP_10259634.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas axonopodis
           pv. punicae str. LMG 859]
 gi|372555840|emb|CCF66609.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas axonopodis
           pv. punicae str. LMG 859]
          Length = 228

 Score = 48.5 bits (114), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 26/64 (40%), Positives = 32/64 (50%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     V L DV  GG T FP   + V P  G+ V + N HA+   D    H+G PV  
Sbjct: 151 NRQRTVCVYLNDVGAGGDTEFPIAGVRVRPRPGTLVCFDNLHADGRPDADSLHAGLPVTA 210

Query: 62  GNKW 65
           G+KW
Sbjct: 211 GSKW 214



 Score = 44.3 bits (103), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 25/66 (37%), Positives = 34/66 (51%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+  + +   L DV  GG T FP   + V P  G+ V + N HA+   D    H+G PV
Sbjct: 150 GNRQ-RTVCVYLNDVGAGGDTEFPIAGVRVRPRPGTLVCFDNLHADGRPDADSLHAGLPV 208

Query: 122 ALGNKW 127
             G+KW
Sbjct: 209 TAGSKW 214


>gi|325267002|ref|ZP_08133672.1| 2OG-Fe(II) oxygenase [Kingella denitrificans ATCC 33394]
 gi|324981502|gb|EGC17144.1| 2OG-Fe(II) oxygenase [Kingella denitrificans ATCC 33394]
          Length = 279

 Score = 48.5 bits (114), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 27/66 (40%), Positives = 38/66 (57%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           G + G  L+  L DV+ GGAT FP+LN  + P+KG A+F+ N   N   +    H+G PV
Sbjct: 204 GQRVGTFLVY-LCDVDAGGATRFPALNFEIRPKKGMALFFANTLPNGEGNPLTLHAGVPV 262

Query: 122 ALGNKW 127
             G K+
Sbjct: 263 VSGVKY 268



 Score = 48.5 bits (114), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 25/57 (43%), Positives = 34/57 (59%)

Query: 9   VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           V L DV+ GGAT FP+LN  + P+KG A+F+ N   N   +    H+G PV  G K+
Sbjct: 212 VYLCDVDAGGATRFPALNFEIRPKKGMALFFANTLPNGEGNPLTLHAGVPVVSGVKY 268


>gi|268572523|ref|XP_002641343.1| C. briggsae CBR-DPY-18 protein [Caenorhabditis briggsae]
 gi|94442971|emb|CAJ98658.1| prolyl 4-hydroxylase [Caenorhabditis briggsae]
          Length = 559

 Score = 48.5 bits (114), Expect = 8e-04,   Method: Composition-based stats.
 Identities = 23/66 (34%), Positives = 34/66 (51%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+   +L   ++    GG T+F  +  TV P K  A+FWYN +     +    H+ CPV
Sbjct: 440 GNRIATVLFY-MSQPSHGGGTVFTEVKSTVLPTKNDALFWYNLYKQGDGNPDTRHAACPV 498

Query: 122 ALGNKW 127
            +G KW
Sbjct: 499 LVGIKW 504



 Score = 45.4 bits (106), Expect = 0.008,   Method: Composition-based stats.
 Identities = 19/49 (38%), Positives = 27/49 (55%)

Query: 17  GGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           GG T+F  +  TV P K  A+FW+N +     +    H+ CPV +G KW
Sbjct: 456 GGGTVFTEVKSTVLPTKNDALFWYNLYKQGDGNPDTRHAACPVLVGIKW 504


>gi|302762452|ref|XP_002964648.1| hypothetical protein SELMODRAFT_82355 [Selaginella moellendorffii]
 gi|300168377|gb|EFJ34981.1| hypothetical protein SELMODRAFT_82355 [Selaginella moellendorffii]
          Length = 225

 Score = 48.5 bits (114), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 26/78 (33%), Positives = 38/78 (48%), Gaps = 19/78 (24%)

Query: 69  LLSGLTDVELGGATIFPS-------------------LNLTVFPEKGSAVFWYNAHANTL 109
           LL  LTDVE GG T+FP                      ++V P++G A+ +++   +  
Sbjct: 135 LLMYLTDVEEGGETVFPKSAKNSSSLPWHNQLSECGRRGVSVRPKRGDALLFWSMSPDAQ 194

Query: 110 LDYRMYHSGCPVALGNKW 127
           LD+   H GCPV  G+KW
Sbjct: 195 LDHSSLHGGCPVIKGDKW 212



 Score = 45.1 bits (105), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 24/74 (32%), Positives = 35/74 (47%), Gaps = 19/74 (25%)

Query: 11  LTDVELGGATIFPS-------------------LNLTVFPEKGSAVFWHNAHANTLLDYW 51
           LTDVE GG T+FP                      ++V P++G A+ + +   +  LD+ 
Sbjct: 139 LTDVEEGGETVFPKSAKNSSSLPWHNQLSECGRRGVSVRPKRGDALLFWSMSPDAQLDHS 198

Query: 52  MYHSGCPVALGNKW 65
             H GCPV  G+KW
Sbjct: 199 SLHGGCPVIKGDKW 212


>gi|302815629|ref|XP_002989495.1| hypothetical protein SELMODRAFT_129912 [Selaginella moellendorffii]
 gi|300142673|gb|EFJ09371.1| hypothetical protein SELMODRAFT_129912 [Selaginella moellendorffii]
          Length = 213

 Score = 48.5 bits (114), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 26/78 (33%), Positives = 38/78 (48%), Gaps = 19/78 (24%)

Query: 69  LLSGLTDVELGGATIFPS-------------------LNLTVFPEKGSAVFWYNAHANTL 109
           LL  LTDVE GG T+FP                      ++V P++G A+ +++   +  
Sbjct: 123 LLMYLTDVEEGGETVFPKSAKNSSSLPWHNQLSECGRRGVSVRPKRGDALLFWSMSPDAQ 182

Query: 110 LDYRMYHSGCPVALGNKW 127
           LD+   H GCPV  G+KW
Sbjct: 183 LDHSSLHGGCPVIKGDKW 200



 Score = 44.7 bits (104), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 24/74 (32%), Positives = 35/74 (47%), Gaps = 19/74 (25%)

Query: 11  LTDVELGGATIFPS-------------------LNLTVFPEKGSAVFWHNAHANTLLDYW 51
           LTDVE GG T+FP                      ++V P++G A+ + +   +  LD+ 
Sbjct: 127 LTDVEEGGETVFPKSAKNSSSLPWHNQLSECGRRGVSVRPKRGDALLFWSMSPDAQLDHS 186

Query: 52  MYHSGCPVALGNKW 65
             H GCPV  G+KW
Sbjct: 187 SLHGGCPVIKGDKW 200


>gi|356540840|ref|XP_003538892.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Glycine max]
          Length = 290

 Score = 48.5 bits (114), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 31/107 (28%), Positives = 49/107 (45%), Gaps = 25/107 (23%)

Query: 40  HNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN----------- 88
           +  H +  LD +   +G     G +   +L+  LTDVE GG T+FP+             
Sbjct: 177 YEPHYDYFLDDFNTKNG-----GQRIATVLMY-LTDVEEGGETVFPAAKGNFSSVPWWNE 230

Query: 89  --------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
                   L++ P++G A+ +++   +  LD    H GCPV  GNKW
Sbjct: 231 LSECGKKGLSIKPKRGDALLFWSMKPDATLDPSSLHGGCPVIKGNKW 277



 Score = 45.8 bits (107), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 25/74 (33%), Positives = 35/74 (47%), Gaps = 19/74 (25%)

Query: 11  LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
           LTDVE GG T+FP+                     L++ P++G A+ + +   +  LD  
Sbjct: 204 LTDVEEGGETVFPAAKGNFSSVPWWNELSECGKKGLSIKPKRGDALLFWSMKPDATLDPS 263

Query: 52  MYHSGCPVALGNKW 65
             H GCPV  GNKW
Sbjct: 264 SLHGGCPVIKGNKW 277


>gi|224033439|gb|ACN35795.1| unknown [Zea mays]
          Length = 180

 Score = 48.1 bits (113), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 29/78 (37%), Positives = 37/78 (47%), Gaps = 19/78 (24%)

Query: 69  LLSGLTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWYNAHANTL 109
           LL  L+DVE GG TIFP  N                   L+V P+ G A+ +++   +  
Sbjct: 90  LLMYLSDVEEGGETIFPDANVNASSLPWYNELSDCAKRGLSVKPKMGDALLFWSMKPDAT 149

Query: 110 LDYRMYHSGCPVALGNKW 127
           LD    H GCPV  GNKW
Sbjct: 150 LDPLSLHGGCPVIKGNKW 167



 Score = 45.8 bits (107), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 27/74 (36%), Positives = 34/74 (45%), Gaps = 19/74 (25%)

Query: 11  LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
           L+DVE GG TIFP  N                   L+V P+ G A+ + +   +  LD  
Sbjct: 94  LSDVEEGGETIFPDANVNASSLPWYNELSDCAKRGLSVKPKMGDALLFWSMKPDATLDPL 153

Query: 52  MYHSGCPVALGNKW 65
             H GCPV  GNKW
Sbjct: 154 SLHGGCPVIKGNKW 167


>gi|224006556|ref|XP_002292238.1| hypothetical protein THAPSDRAFT_269421 [Thalassiosira pseudonana
           CCMP1335]
 gi|220971880|gb|EED90213.1| hypothetical protein THAPSDRAFT_269421 [Thalassiosira pseudonana
           CCMP1335]
          Length = 348

 Score = 48.1 bits (113), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 33/99 (33%), Positives = 46/99 (46%), Gaps = 17/99 (17%)

Query: 45  NTLLDYWMYHSGCPVALGNKWGKLL---LSGLTDVELGGATIF---PS----------LN 88
             L  ++  +    V   N+ G+ L   L  L DVE GG T F   P+          LN
Sbjct: 225 QALAPHYDANRSADVEDKNRGGQTLATLLVYLNDVENGGTTRFGKLPAADDKRQVKGELN 284

Query: 89  LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L + P+KG A+ ++ A  N + D R  H GCP A+  KW
Sbjct: 285 LNIIPKKGDALLFFPADKNGMFDERTEHEGCP-AVDEKW 322



 Score = 43.1 bits (100), Expect = 0.039,   Method: Compositional matrix adjust.
 Identities = 27/70 (38%), Positives = 34/70 (48%), Gaps = 14/70 (20%)

Query: 9   VELTDVELGGATIF---PS----------LNLTVFPEKGSAVFWHNAHANTLLDYWMYHS 55
           V L DVE GG T F   P+          LNL + P+KG A+ +  A  N + D    H 
Sbjct: 254 VYLNDVENGGTTRFGKLPAADDKRQVKGELNLNIIPKKGDALLFFPADKNGMFDERTEHE 313

Query: 56  GCPVALGNKW 65
           GCP A+  KW
Sbjct: 314 GCP-AVDEKW 322


>gi|295699617|ref|YP_003607510.1| procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
 gi|295438830|gb|ADG17999.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
          Length = 286

 Score = 48.1 bits (113), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 23/55 (41%), Positives = 31/55 (56%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DV  GG T+FP+  L+V  + G AV++   +A   LD    H G PV  G+KW
Sbjct: 220 LNDVADGGETVFPTAGLSVAAQAGGAVYFRYMNAERQLDPSTLHGGAPVLAGDKW 274



 Score = 48.1 bits (113), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 25/66 (37%), Positives = 36/66 (54%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           G +   L++  L DV  GG T+FP+  L+V  + G AV++   +A   LD    H G PV
Sbjct: 210 GQRVATLIIY-LNDVADGGETVFPTAGLSVAAQAGGAVYFRYMNAERQLDPSTLHGGAPV 268

Query: 122 ALGNKW 127
             G+KW
Sbjct: 269 LAGDKW 274


>gi|384250599|gb|EIE24078.1| hypothetical protein COCSUDRAFT_47131 [Coccomyxa subellipsoidea
           C-169]
          Length = 327

 Score = 48.1 bits (113), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 29/75 (38%), Positives = 33/75 (44%), Gaps = 20/75 (26%)

Query: 73  LTDVELGGATIFPS--------LNLTVF------------PEKGSAVFWYNAHANTLLDY 112
           LTDVE GG TIFP         L +T F            P  G A+ +Y+ H N   D 
Sbjct: 237 LTDVEEGGETIFPLEGRYGPDLLKMTGFNYKSCTTGFKYKPRMGDALMFYSMHPNGTFDK 296

Query: 113 RMYHSGCPVALGNKW 127
              H GCPV  G KW
Sbjct: 297 HALHGGCPVMAGEKW 311



 Score = 45.8 bits (107), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 28/75 (37%), Positives = 33/75 (44%), Gaps = 20/75 (26%)

Query: 11  LTDVELGGATIFPS--------LNLTVF------------PEKGSAVFWHNAHANTLLDY 50
           LTDVE GG TIFP         L +T F            P  G A+ +++ H N   D 
Sbjct: 237 LTDVEEGGETIFPLEGRYGPDLLKMTGFNYKSCTTGFKYKPRMGDALMFYSMHPNGTFDK 296

Query: 51  WMYHSGCPVALGNKW 65
              H GCPV  G KW
Sbjct: 297 HALHGGCPVMAGEKW 311


>gi|363806698|ref|NP_001242522.1| uncharacterized protein LOC100806046 [Glycine max]
 gi|255647110|gb|ACU24023.1| unknown [Glycine max]
          Length = 289

 Score = 48.1 bits (113), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 31/107 (28%), Positives = 49/107 (45%), Gaps = 25/107 (23%)

Query: 40  HNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN----------- 88
           +  H +  LD +   +G     G +   +L+  LTDVE GG T+FP+             
Sbjct: 176 YEPHYDYFLDDFNTKNG-----GQRIATVLMY-LTDVEEGGETVFPAAKGNFSFVPWWNE 229

Query: 89  --------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
                   L++ P++G A+ +++   +  LD    H GCPV  GNKW
Sbjct: 230 LFECGKKGLSIKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKW 276



 Score = 45.4 bits (106), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 25/74 (33%), Positives = 35/74 (47%), Gaps = 19/74 (25%)

Query: 11  LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
           LTDVE GG T+FP+                     L++ P++G A+ + +   +  LD  
Sbjct: 203 LTDVEEGGETVFPAAKGNFSFVPWWNELFECGKKGLSIKPKRGDALLFWSMKPDASLDPS 262

Query: 52  MYHSGCPVALGNKW 65
             H GCPV  GNKW
Sbjct: 263 SLHGGCPVIKGNKW 276


>gi|18394842|ref|NP_564109.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
           [Arabidopsis thaliana]
 gi|9558598|gb|AAF88161.1|AC026234_12 Contains similarity to a prolyl 4-hydroxylase alpha subunit protein
           from Gallus gallus gi|212530 [Arabidopsis thaliana]
 gi|90962978|gb|ABE02413.1| At1g20270 [Arabidopsis thaliana]
 gi|332191835|gb|AEE29956.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
           [Arabidopsis thaliana]
          Length = 287

 Score = 48.1 bits (113), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 27/78 (34%), Positives = 37/78 (47%), Gaps = 19/78 (24%)

Query: 69  LLSGLTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWYNAHANTL 109
           +L  L+DVE GG T+FP+ N                   L+V P  G A+ +++   +  
Sbjct: 197 MLMYLSDVEEGGETVFPAANMNFSSVPWYNELSECGKKGLSVKPRMGDALLFWSMRPDAT 256

Query: 110 LDYRMYHSGCPVALGNKW 127
           LD    H GCPV  GNKW
Sbjct: 257 LDPTSLHGGCPVIRGNKW 274



 Score = 45.8 bits (107), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 26/74 (35%), Positives = 34/74 (45%), Gaps = 19/74 (25%)

Query: 11  LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
           L+DVE GG T+FP+ N                   L+V P  G A+ + +   +  LD  
Sbjct: 201 LSDVEEGGETVFPAANMNFSSVPWYNELSECGKKGLSVKPRMGDALLFWSMRPDATLDPT 260

Query: 52  MYHSGCPVALGNKW 65
             H GCPV  GNKW
Sbjct: 261 SLHGGCPVIRGNKW 274


>gi|297850430|ref|XP_002893096.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
 gi|297338938|gb|EFH69355.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
          Length = 287

 Score = 48.1 bits (113), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 27/78 (34%), Positives = 37/78 (47%), Gaps = 19/78 (24%)

Query: 69  LLSGLTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWYNAHANTL 109
           +L  L+DVE GG T+FP+ N                   L+V P  G A+ +++   +  
Sbjct: 197 MLMYLSDVEEGGETVFPAANMNFSSVPWYNELSECGKKGLSVKPRMGDALLFWSMRPDAT 256

Query: 110 LDYRMYHSGCPVALGNKW 127
           LD    H GCPV  GNKW
Sbjct: 257 LDPTSLHGGCPVIRGNKW 274



 Score = 45.8 bits (107), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 26/74 (35%), Positives = 34/74 (45%), Gaps = 19/74 (25%)

Query: 11  LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
           L+DVE GG T+FP+ N                   L+V P  G A+ + +   +  LD  
Sbjct: 201 LSDVEEGGETVFPAANMNFSSVPWYNELSECGKKGLSVKPRMGDALLFWSMRPDATLDPT 260

Query: 52  MYHSGCPVALGNKW 65
             H GCPV  GNKW
Sbjct: 261 SLHGGCPVIRGNKW 274


>gi|148653656|ref|YP_001280749.1| procollagen-proline dioxygenase [Psychrobacter sp. PRwf-1]
 gi|148572740|gb|ABQ94799.1| Procollagen-proline dioxygenase [Psychrobacter sp. PRwf-1]
          Length = 268

 Score = 48.1 bits (113), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 24/66 (36%), Positives = 40/66 (60%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           G + G  L+  L++V+ GG+T FP+LN  + P KGSA+++ N +    ++    H+G PV
Sbjct: 192 GQRVGTFLMY-LSEVDSGGSTRFPNLNFEIRPNKGSALYFANTNLKAEIEPLTLHAGMPV 250

Query: 122 ALGNKW 127
             G K+
Sbjct: 251 TEGVKY 256



 Score = 47.8 bits (112), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 21/55 (38%), Positives = 35/55 (63%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L++V+ GG+T FP+LN  + P KGSA+++ N +    ++    H+G PV  G K+
Sbjct: 202 LSEVDSGGSTRFPNLNFEIRPNKGSALYFANTNLKAEIEPLTLHAGMPVTEGVKY 256


>gi|21537370|gb|AAM61711.1| putative prolyl 4-hydroxylase, alpha subunit [Arabidopsis thaliana]
          Length = 287

 Score = 48.1 bits (113), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 27/78 (34%), Positives = 37/78 (47%), Gaps = 19/78 (24%)

Query: 69  LLSGLTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWYNAHANTL 109
           +L  L+DVE GG T+FP+ N                   L+V P  G A+ +++   +  
Sbjct: 197 MLMYLSDVEEGGETVFPAANMNFSSVPWYNELSECGKKGLSVKPRMGDALLFWSMRPDAT 256

Query: 110 LDYRMYHSGCPVALGNKW 127
           LD    H GCPV  GNKW
Sbjct: 257 LDPTSLHGGCPVIRGNKW 274



 Score = 45.8 bits (107), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 26/74 (35%), Positives = 34/74 (45%), Gaps = 19/74 (25%)

Query: 11  LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
           L+DVE GG T+FP+ N                   L+V P  G A+ + +   +  LD  
Sbjct: 201 LSDVEEGGETVFPAANMNFSSVPWYNELSECGKKGLSVKPRMGDALLFWSMRPDATLDPT 260

Query: 52  MYHSGCPVALGNKW 65
             H GCPV  GNKW
Sbjct: 261 SLHGGCPVIRGNKW 274


>gi|397620797|gb|EJK65916.1| hypothetical protein THAOC_13183 [Thalassiosira oceanica]
          Length = 520

 Score = 48.1 bits (113), Expect = 0.001,   Method: Composition-based stats.
 Identities = 24/55 (43%), Positives = 31/55 (56%), Gaps = 1/55 (1%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L DVE GG T FP ++L + P++G AV  + A      D R  H G P AL +KW
Sbjct: 438 LNDVEEGGTTYFPEVDLDIRPKRGMAVIHFPADVEMREDKRTLHQGSP-ALDDKW 491



 Score = 43.1 bits (100), Expect = 0.041,   Method: Composition-based stats.
 Identities = 23/55 (41%), Positives = 29/55 (52%), Gaps = 1/55 (1%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T FP ++L + P++G AV    A      D    H G P AL +KW
Sbjct: 438 LNDVEEGGTTYFPEVDLDIRPKRGMAVIHFPADVEMREDKRTLHQGSP-ALDDKW 491


>gi|395003644|ref|ZP_10387769.1| 2OG-Fe(II) oxygenase superfamily enzyme [Acidovorax sp. CF316]
 gi|394318439|gb|EJE54870.1| 2OG-Fe(II) oxygenase superfamily enzyme [Acidovorax sp. CF316]
          Length = 299

 Score = 48.1 bits (113), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 30/79 (37%), Positives = 40/79 (50%), Gaps = 10/79 (12%)

Query: 54  HSGCPVAL---GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVF--WYNAHANT 108
             G P  L   G + G L++  L   E GG T FP +++ V P++G+AVF  +  AH  T
Sbjct: 214 EPGTPTILKRGGQRVGTLVMY-LNTPEKGGGTTFPDVHVEVAPQRGNAVFFSYERAHPAT 272

Query: 109 LLDYRMYHSGCPVALGNKW 127
               R  H G PV  G KW
Sbjct: 273 ----RTLHGGAPVIAGEKW 287



 Score = 43.5 bits (101), Expect = 0.028,   Method: Compositional matrix adjust.
 Identities = 23/57 (40%), Positives = 30/57 (52%), Gaps = 6/57 (10%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVF--WHNAHANTLLDYWMYHSGCPVALGNKW 65
           L   E GG T FP +++ V P++G+AVF  +  AH  T       H G PV  G KW
Sbjct: 235 LNTPEKGGGTTFPDVHVEVAPQRGNAVFFSYERAHPATR----TLHGGAPVIAGEKW 287


>gi|313242424|emb|CBY34571.1| unnamed protein product [Oikopleura dioica]
          Length = 503

 Score = 48.1 bits (113), Expect = 0.001,   Method: Composition-based stats.
 Identities = 23/57 (40%), Positives = 31/57 (54%)

Query: 70  LSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 126
           L+ L + + GG T+F    +   P   SAVFWYN   +   DYR  H+ CPV +G K
Sbjct: 414 LTYLNEPKHGGDTVFLGPGIKAEPIHKSAVFWYNLLRDGSCDYRTRHAACPVLIGQK 470



 Score = 42.0 bits (97), Expect = 0.075,   Method: Composition-based stats.
 Identities = 22/63 (34%), Positives = 30/63 (47%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       L + + GG T+F    +   P   SAVFW+N   +   DY   H+ CPV +
Sbjct: 408 NRIATWLTYLNEPKHGGDTVFLGPGIKAEPIHKSAVFWYNLLRDGSCDYRTRHAACPVLI 467

Query: 62  GNK 64
           G K
Sbjct: 468 GQK 470


>gi|313241587|emb|CBY33829.1| unnamed protein product [Oikopleura dioica]
          Length = 541

 Score = 48.1 bits (113), Expect = 0.001,   Method: Composition-based stats.
 Identities = 23/57 (40%), Positives = 31/57 (54%)

Query: 70  LSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 126
           L+ L + + GG T+F    +   P   SAVFWYN   +   DYR  H+ CPV +G K
Sbjct: 452 LTYLNEPKHGGDTVFLGPGIKAEPIHKSAVFWYNLLRDGSCDYRTRHAACPVLIGQK 508



 Score = 42.0 bits (97), Expect = 0.075,   Method: Composition-based stats.
 Identities = 22/63 (34%), Positives = 30/63 (47%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       L + + GG T+F    +   P   SAVFW+N   +   DY   H+ CPV +
Sbjct: 446 NRIATWLTYLNEPKHGGDTVFLGPGIKAEPIHKSAVFWYNLLRDGSCDYRTRHAACPVLI 505

Query: 62  GNK 64
           G K
Sbjct: 506 GQK 508


>gi|313229343|emb|CBY23930.1| unnamed protein product [Oikopleura dioica]
          Length = 542

 Score = 48.1 bits (113), Expect = 0.001,   Method: Composition-based stats.
 Identities = 23/57 (40%), Positives = 31/57 (54%)

Query: 70  LSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 126
           L+ L + + GG T+F    +   P   SAVFWYN   +   DYR  H+ CPV +G K
Sbjct: 453 LTYLNEPKHGGDTVFLGPGIKAEPIHKSAVFWYNLLRDGSCDYRTRHAACPVLIGQK 509



 Score = 42.0 bits (97), Expect = 0.075,   Method: Composition-based stats.
 Identities = 22/63 (34%), Positives = 30/63 (47%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       L + + GG T+F    +   P   SAVFW+N   +   DY   H+ CPV +
Sbjct: 447 NRIATWLTYLNEPKHGGDTVFLGPGIKAEPIHKSAVFWYNLLRDGSCDYRTRHAACPVLI 506

Query: 62  GNK 64
           G K
Sbjct: 507 GQK 509


>gi|313213106|emb|CBY36968.1| unnamed protein product [Oikopleura dioica]
          Length = 541

 Score = 48.1 bits (113), Expect = 0.001,   Method: Composition-based stats.
 Identities = 23/57 (40%), Positives = 31/57 (54%)

Query: 70  LSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 126
           L+ L + + GG T+F    +   P   SAVFWYN   +   DYR  H+ CPV +G K
Sbjct: 452 LTYLNEPKHGGDTVFLGPGIKAEPIHKSAVFWYNLLRDGSCDYRTRHAACPVLIGQK 508



 Score = 42.0 bits (97), Expect = 0.075,   Method: Composition-based stats.
 Identities = 22/63 (34%), Positives = 30/63 (47%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       L + + GG T+F    +   P   SAVFW+N   +   DY   H+ CPV +
Sbjct: 446 NRIATWLTYLNEPKHGGDTVFLGPGIKAEPIHKSAVFWYNLLRDGSCDYRTRHAACPVLI 505

Query: 62  GNK 64
           G K
Sbjct: 506 GQK 508


>gi|449491267|ref|XP_004158845.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 287

 Score = 48.1 bits (113), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 31/107 (28%), Positives = 49/107 (45%), Gaps = 25/107 (23%)

Query: 40  HNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN----------- 88
           +  H +  LD +   +G     G +   +L+  L+DVE GG T+FP+             
Sbjct: 174 YEPHFDYFLDEYNTKNG-----GQRIATVLMY-LSDVEEGGETVFPAAKGNFSSVPWWNE 227

Query: 89  --------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
                   L+V P++G A+ +++   +  LD    H GCPV  GNKW
Sbjct: 228 LSDCGKKGLSVKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKW 274



 Score = 44.7 bits (104), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 25/74 (33%), Positives = 35/74 (47%), Gaps = 19/74 (25%)

Query: 11  LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
           L+DVE GG T+FP+                     L+V P++G A+ + +   +  LD  
Sbjct: 201 LSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKPKRGDALLFWSMKPDASLDPS 260

Query: 52  MYHSGCPVALGNKW 65
             H GCPV  GNKW
Sbjct: 261 SLHGGCPVIKGNKW 274


>gi|397617249|gb|EJK64350.1| hypothetical protein THAOC_14927, partial [Thalassiosira oceanica]
          Length = 620

 Score = 48.1 bits (113), Expect = 0.001,   Method: Composition-based stats.
 Identities = 29/65 (44%), Positives = 37/65 (56%), Gaps = 4/65 (6%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNA--HANTLLDYRMYHSGCPV--ALGNKWG 128
           L DVE GG T FP L++TV P+KGSA+ W +    A    DYR  H   PV   L N   
Sbjct: 517 LNDVEEGGGTSFPLLDVTVQPKKGSALLWPSVLDAAPESKDYRTDHEALPVIKGLNNASQ 576

Query: 129 KLLLS 133
           +L+L+
Sbjct: 577 QLILN 581



 Score = 45.1 bits (105), Expect = 0.010,   Method: Composition-based stats.
 Identities = 28/65 (43%), Positives = 36/65 (55%), Gaps = 4/65 (6%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNA--HANTLLDYWMYHSGCPV--ALGNKWG 66
           L DVE GG T FP L++TV P+KGSA+ W +    A    DY   H   PV   L N   
Sbjct: 517 LNDVEEGGGTSFPLLDVTVQPKKGSALLWPSVLDAAPESKDYRTDHEALPVIKGLNNASQ 576

Query: 67  KLLLS 71
           +L+L+
Sbjct: 577 QLILN 581


>gi|413963357|ref|ZP_11402584.1| ProCollegen-proline dioxygenase [Burkholderia sp. SJ98]
 gi|413929189|gb|EKS68477.1| ProCollegen-proline dioxygenase [Burkholderia sp. SJ98]
          Length = 286

 Score = 48.1 bits (113), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 23/66 (34%), Positives = 36/66 (54%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           G +   ++L  L +VE GG T FP + L + P +GSA+++   +     D +  H+G PV
Sbjct: 207 GQRIATVILY-LNEVEQGGDTTFPDIGLAIHPRRGSALYFEYVNELGQSDPKTLHAGTPV 265

Query: 122 ALGNKW 127
             G KW
Sbjct: 266 EKGEKW 271



 Score = 47.4 bits (111), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 21/55 (38%), Positives = 30/55 (54%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L +VE GG T FP + L + P +GSA+++   +     D    H+G PV  G KW
Sbjct: 217 LNEVEQGGDTTFPDIGLAIHPRRGSALYFEYVNELGQSDPKTLHAGTPVEKGEKW 271


>gi|449434114|ref|XP_004134841.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 287

 Score = 48.1 bits (113), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 31/107 (28%), Positives = 49/107 (45%), Gaps = 25/107 (23%)

Query: 40  HNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN----------- 88
           +  H +  LD +   +G     G +   +L+  L+DVE GG T+FP+             
Sbjct: 174 YEPHFDYFLDEYNTKNG-----GQRIATVLMY-LSDVEEGGETVFPAAKGNFSSVPWWNE 227

Query: 89  --------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
                   L+V P++G A+ +++   +  LD    H GCPV  GNKW
Sbjct: 228 LSDCGKKGLSVKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKW 274



 Score = 44.7 bits (104), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 25/74 (33%), Positives = 35/74 (47%), Gaps = 19/74 (25%)

Query: 11  LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
           L+DVE GG T+FP+                     L+V P++G A+ + +   +  LD  
Sbjct: 201 LSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKPKRGDALLFWSMKPDASLDPS 260

Query: 52  MYHSGCPVALGNKW 65
             H GCPV  GNKW
Sbjct: 261 SLHGGCPVIKGNKW 274


>gi|430808003|ref|ZP_19435118.1| prolyl 4-hydroxylase [Cupriavidus sp. HMR-1]
 gi|429499635|gb|EKZ98045.1| prolyl 4-hydroxylase [Cupriavidus sp. HMR-1]
          Length = 293

 Score = 48.1 bits (113), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 23/49 (46%), Positives = 28/49 (57%)

Query: 79  GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           GGAT FP + L V P KG+AV +     +  LD R  H+G PV  G KW
Sbjct: 230 GGATAFPRIGLEVAPVKGNAVLFSYGLPDGALDERTLHAGLPVEAGEKW 278



 Score = 45.4 bits (106), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 22/49 (44%), Positives = 27/49 (55%)

Query: 17  GGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           GGAT FP + L V P KG+AV +     +  LD    H+G PV  G KW
Sbjct: 230 GGATAFPRIGLEVAPVKGNAVLFSYGLPDGALDERTLHAGLPVEAGEKW 278


>gi|356517655|ref|XP_003527502.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Glycine max]
          Length = 290

 Score = 48.1 bits (113), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 32/108 (29%), Positives = 49/108 (45%), Gaps = 25/108 (23%)

Query: 40  HNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN----------- 88
           +  H +  LD +   +G     G +   +L+  L+DVE GG T+FP+ N           
Sbjct: 177 YEPHYDYFLDEFNTKNG-----GQRIATVLMY-LSDVEEGGETVFPAANANFSSVPWWND 230

Query: 89  --------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 128
                   L+V P+ G A+ +++   +  LD    H GCPV  GNKW 
Sbjct: 231 LSQCARKGLSVKPKMGDALLFWSMRPDATLDPSSLHGGCPVIKGNKWS 278



 Score = 45.1 bits (105), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 26/75 (34%), Positives = 35/75 (46%), Gaps = 19/75 (25%)

Query: 11  LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
           L+DVE GG T+FP+ N                   L+V P+ G A+ + +   +  LD  
Sbjct: 204 LSDVEEGGETVFPAANANFSSVPWWNDLSQCARKGLSVKPKMGDALLFWSMRPDATLDPS 263

Query: 52  MYHSGCPVALGNKWG 66
             H GCPV  GNKW 
Sbjct: 264 SLHGGCPVIKGNKWS 278


>gi|77748579|ref|NP_641686.2| hypothetical protein XAC1351 [Xanthomonas axonopodis pv. citri str.
           306]
          Length = 418

 Score = 48.1 bits (113), Expect = 0.001,   Method: Composition-based stats.
 Identities = 26/64 (40%), Positives = 32/64 (50%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     V L DV  GG T FP   + V P  G+ V + N HA+   D    H+G PV  
Sbjct: 341 NRQRTVCVYLNDVGAGGDTEFPIAGVRVRPRPGTLVCFDNLHADGRPDADSLHAGLPVTA 400

Query: 62  GNKW 65
           G+KW
Sbjct: 401 GSKW 404



 Score = 44.7 bits (104), Expect = 0.013,   Method: Composition-based stats.
 Identities = 23/55 (41%), Positives = 29/55 (52%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L DV  GG T FP   + V P  G+ V + N HA+   D    H+G PV  G+KW
Sbjct: 350 LNDVGAGGDTEFPIAGVRVRPRPGTLVCFDNLHADGRPDADSLHAGLPVTAGSKW 404


>gi|418521653|ref|ZP_13087695.1| hypothetical protein WS7_11622 [Xanthomonas axonopodis pv.
           malvacearum str. GSPB2388]
 gi|410702188|gb|EKQ60697.1| hypothetical protein WS7_11622 [Xanthomonas axonopodis pv.
           malvacearum str. GSPB2388]
          Length = 418

 Score = 48.1 bits (113), Expect = 0.001,   Method: Composition-based stats.
 Identities = 26/64 (40%), Positives = 32/64 (50%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     V L DV  GG T FP   + V P  G+ V + N HA+   D    H+G PV  
Sbjct: 341 NRQRTVCVYLNDVGAGGDTEFPIAGVRVRPRPGTLVCFDNLHADGRPDADSLHAGLPVTA 400

Query: 62  GNKW 65
           G+KW
Sbjct: 401 GSKW 404



 Score = 44.7 bits (104), Expect = 0.013,   Method: Composition-based stats.
 Identities = 23/55 (41%), Positives = 29/55 (52%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L DV  GG T FP   + V P  G+ V + N HA+   D    H+G PV  G+KW
Sbjct: 350 LNDVGAGGDTEFPIAGVRVRPRPGTLVCFDNLHADGRPDADSLHAGLPVTAGSKW 404


>gi|381173085|ref|ZP_09882194.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas citri pv.
           mangiferaeindicae LMG 941]
 gi|380686458|emb|CCG38681.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas citri pv.
           mangiferaeindicae LMG 941]
          Length = 418

 Score = 48.1 bits (113), Expect = 0.001,   Method: Composition-based stats.
 Identities = 26/64 (40%), Positives = 32/64 (50%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     V L DV  GG T FP   + V P  G+ V + N HA+   D    H+G PV  
Sbjct: 341 NRQRTVCVYLNDVGAGGDTEFPIAGVRVRPRPGTLVCFDNLHADGRPDADSLHAGLPVTA 400

Query: 62  GNKW 65
           G+KW
Sbjct: 401 GSKW 404



 Score = 44.7 bits (104), Expect = 0.013,   Method: Composition-based stats.
 Identities = 23/55 (41%), Positives = 29/55 (52%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L DV  GG T FP   + V P  G+ V + N HA+   D    H+G PV  G+KW
Sbjct: 350 LNDVGAGGDTEFPIAGVRVRPRPGTLVCFDNLHADGRPDADSLHAGLPVTAGSKW 404


>gi|323453438|gb|EGB09310.1| hypothetical protein AURANDRAFT_63401 [Aureococcus anophagefferens]
          Length = 572

 Score = 48.1 bits (113), Expect = 0.001,   Method: Composition-based stats.
 Identities = 23/52 (44%), Positives = 31/52 (59%), Gaps = 3/52 (5%)

Query: 17  GGATIFPSLNLTVFPEKGSAVFWH--NAHANTLLDYW-MYHSGCPVALGNKW 65
           GG T FP+ N+ V P KG AV++H      + +++ W   HSGCPV  G KW
Sbjct: 494 GGGTAFPNANVHVVPAKGQAVYFHFRGPEPDGVVEDWHTEHSGCPVRSGEKW 545



 Score = 46.2 bits (108), Expect = 0.004,   Method: Composition-based stats.
 Identities = 26/75 (34%), Positives = 36/75 (48%), Gaps = 5/75 (6%)

Query: 56  GCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWY---NAHANTLLDY 112
           G P   G +   +L+        GG T FP+ N+ V P KG AV+++         + D+
Sbjct: 473 GSPFVAGGRLATVLM--YCAAADGGGTAFPNANVHVVPAKGQAVYFHFRGPEPDGVVEDW 530

Query: 113 RMYHSGCPVALGNKW 127
              HSGCPV  G KW
Sbjct: 531 HTEHSGCPVRSGEKW 545


>gi|410632646|ref|ZP_11343301.1| prolyl 4-hydroxylase [Glaciecola arctica BSs20135]
 gi|410147883|dbj|GAC20168.1| prolyl 4-hydroxylase [Glaciecola arctica BSs20135]
          Length = 480

 Score = 47.8 bits (112), Expect = 0.001,   Method: Composition-based stats.
 Identities = 22/56 (39%), Positives = 32/56 (57%)

Query: 9   VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNK 64
           + L +VE GG T+FP L +    +KG AV W+N + +  ++Y   H G PV  G K
Sbjct: 211 IYLNEVEQGGDTVFPELAIGFKAKKGMAVIWNNINPDGSVNYQTLHQGMPVQKGEK 266



 Score = 47.8 bits (112), Expect = 0.001,   Method: Composition-based stats.
 Identities = 22/54 (40%), Positives = 31/54 (57%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 126
           L +VE GG T+FP L +    +KG AV W N + +  ++Y+  H G PV  G K
Sbjct: 213 LNEVEQGGDTVFPELAIGFKAKKGMAVIWNNINPDGSVNYQTLHQGMPVQKGEK 266


>gi|224085946|ref|XP_002307750.1| predicted protein [Populus trichocarpa]
 gi|222857199|gb|EEE94746.1| predicted protein [Populus trichocarpa]
          Length = 288

 Score = 47.8 bits (112), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 32/107 (29%), Positives = 50/107 (46%), Gaps = 25/107 (23%)

Query: 40  HNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN----------- 88
           + AH +  LD +   +G     G +   LL+  L+DVE GG T+FP+ N           
Sbjct: 175 YEAHFDYFLDEFNTKNG-----GQRTATLLMY-LSDVEEGGETVFPAANMNISAVPWWNE 228

Query: 89  --------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
                   L++ P+ G+A+ +++   +  LD    H  CPV  GNKW
Sbjct: 229 LSECAKQGLSLKPKMGNALLFWSTRPDATLDPSSLHGSCPVIRGNKW 275



 Score = 42.7 bits (99), Expect = 0.053,   Method: Compositional matrix adjust.
 Identities = 24/74 (32%), Positives = 35/74 (47%), Gaps = 19/74 (25%)

Query: 11  LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
           L+DVE GG T+FP+ N                   L++ P+ G+A+ + +   +  LD  
Sbjct: 202 LSDVEEGGETVFPAANMNISAVPWWNELSECAKQGLSLKPKMGNALLFWSTRPDATLDPS 261

Query: 52  MYHSGCPVALGNKW 65
             H  CPV  GNKW
Sbjct: 262 SLHGSCPVIRGNKW 275


>gi|297825201|ref|XP_002880483.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
 gi|297326322|gb|EFH56742.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
          Length = 272

 Score = 47.8 bits (112), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 26/70 (37%), Positives = 32/70 (45%), Gaps = 15/70 (21%)

Query: 73  LTDVELGGATIFPSLN---------------LTVFPEKGSAVFWYNAHANTLLDYRMYHS 117
           L+ VE GG T+FP  N               L V P +G A+F+YN   N  +D    H 
Sbjct: 191 LSSVEEGGETMFPFENGRNMNGRYDYETCIGLRVKPRQGDAIFFYNLLPNRTIDQTSLHG 250

Query: 118 GCPVALGNKW 127
            CPV  G KW
Sbjct: 251 SCPVIKGEKW 260



 Score = 46.2 bits (108), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 25/70 (35%), Positives = 32/70 (45%), Gaps = 15/70 (21%)

Query: 11  LTDVELGGATIFPSLN---------------LTVFPEKGSAVFWHNAHANTLLDYWMYHS 55
           L+ VE GG T+FP  N               L V P +G A+F++N   N  +D    H 
Sbjct: 191 LSSVEEGGETMFPFENGRNMNGRYDYETCIGLRVKPRQGDAIFFYNLLPNRTIDQTSLHG 250

Query: 56  GCPVALGNKW 65
            CPV  G KW
Sbjct: 251 SCPVIKGEKW 260


>gi|363543371|ref|NP_001241695.1| prolyl 4-hydroxylase 8-5 [Zea mays]
 gi|347978840|gb|AEP37762.1| prolyl 4-hydroxylase 8-5 [Zea mays]
          Length = 307

 Score = 47.8 bits (112), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 34/107 (31%), Positives = 47/107 (43%), Gaps = 25/107 (23%)

Query: 40  HNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN----------- 88
           +  H +  LD +   +G     G +   LL+  L+DVE GG TIFP  N           
Sbjct: 194 YEPHFDYFLDEFNTKNG-----GQRIATLLMY-LSDVEEGGETIFPDANVNASSLPWYNE 247

Query: 89  --------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
                   L+V P+ G A+ +++      LD    H GCPV  GNKW
Sbjct: 248 LSDCAKRGLSVKPKMGDALLFWSMKPGATLDPLSLHGGCPVIKGNKW 294



 Score = 45.1 bits (105), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 27/74 (36%), Positives = 33/74 (44%), Gaps = 19/74 (25%)

Query: 11  LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
           L+DVE GG TIFP  N                   L+V P+ G A+ + +      LD  
Sbjct: 221 LSDVEEGGETIFPDANVNASSLPWYNELSDCAKRGLSVKPKMGDALLFWSMKPGATLDPL 280

Query: 52  MYHSGCPVALGNKW 65
             H GCPV  GNKW
Sbjct: 281 SLHGGCPVIKGNKW 294


>gi|313217217|emb|CBY38368.1| unnamed protein product [Oikopleura dioica]
 gi|313239835|emb|CBY17758.1| unnamed protein product [Oikopleura dioica]
          Length = 521

 Score = 47.8 bits (112), Expect = 0.001,   Method: Composition-based stats.
 Identities = 21/58 (36%), Positives = 31/58 (53%)

Query: 70  LSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L+ +++  +GG T+F    +   P + SAVFWYN   +   D    H+ CPV  G KW
Sbjct: 436 LAYMSEPNMGGGTVFIQAGIQARPIRNSAVFWYNLLPSGESDDNTQHAACPVLSGTKW 493



 Score = 43.5 bits (101), Expect = 0.025,   Method: Composition-based stats.
 Identities = 19/55 (34%), Positives = 29/55 (52%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           +++  +GG T+F    +   P + SAVFW+N   +   D    H+ CPV  G KW
Sbjct: 439 MSEPNMGGGTVFIQAGIQARPIRNSAVFWYNLLPSGESDDNTQHAACPVLSGTKW 493


>gi|295700439|ref|YP_003608332.1| procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
 gi|295439652|gb|ADG18821.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
          Length = 296

 Score = 47.8 bits (112), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 22/66 (33%), Positives = 38/66 (57%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           G + G LL+  L DVE GG T+FP +  ++ P++G A+++   +   + D    H+  P+
Sbjct: 215 GQRMGTLLMY-LKDVEGGGETVFPQVGWSIVPQRGHALYFEYGNRYGMCDPSSLHASTPL 273

Query: 122 ALGNKW 127
             G+KW
Sbjct: 274 RTGDKW 279



 Score = 44.7 bits (104), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 18/55 (32%), Positives = 32/55 (58%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T+FP +  ++ P++G A+++   +   + D    H+  P+  G+KW
Sbjct: 225 LKDVEGGGETVFPQVGWSIVPQRGHALYFEYGNRYGMCDPSSLHASTPLRTGDKW 279


>gi|326489721|dbj|BAK01841.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 315

 Score = 47.8 bits (112), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 32/107 (29%), Positives = 47/107 (43%), Gaps = 25/107 (23%)

Query: 40  HNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN----------- 88
           +  H +  LD +   +G     G +   +L+  L+D+E GG TIFP  N           
Sbjct: 202 YEPHFDYFLDEFNTKNG-----GQRMATILMY-LSDIEEGGETIFPDANVNSSSLPWYNE 255

Query: 89  --------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
                   L V P+ G A+ +++   +  LD    H GCPV  GNKW
Sbjct: 256 LSECARKGLAVKPKMGDALLFWSMKPDATLDPLSLHGGCPVIKGNKW 302



 Score = 45.4 bits (106), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 26/74 (35%), Positives = 33/74 (44%), Gaps = 19/74 (25%)

Query: 11  LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
           L+D+E GG TIFP  N                   L V P+ G A+ + +   +  LD  
Sbjct: 229 LSDIEEGGETIFPDANVNSSSLPWYNELSECARKGLAVKPKMGDALLFWSMKPDATLDPL 288

Query: 52  MYHSGCPVALGNKW 65
             H GCPV  GNKW
Sbjct: 289 SLHGGCPVIKGNKW 302


>gi|94312029|ref|YP_585239.1| prolyl 4-hydroxylase [Cupriavidus metallidurans CH34]
 gi|93355881|gb|ABF09970.1| prolyl 4-hydroxylase [Cupriavidus metallidurans CH34]
          Length = 293

 Score = 47.8 bits (112), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 23/49 (46%), Positives = 28/49 (57%)

Query: 79  GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           GGAT FP + L V P KG+AV +     +  LD R  H+G PV  G KW
Sbjct: 230 GGATAFPRIGLEVAPVKGNAVLFSYGLPDGALDERTLHAGLPVEAGEKW 278



 Score = 45.4 bits (106), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 22/49 (44%), Positives = 27/49 (55%)

Query: 17  GGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           GGAT FP + L V P KG+AV +     +  LD    H+G PV  G KW
Sbjct: 230 GGATAFPRIGLEVAPVKGNAVLFSYGLPDGALDERTLHAGLPVEAGEKW 278


>gi|377810637|ref|YP_005043077.1| proCollegen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia sp.
           YI23]
 gi|357939998|gb|AET93554.1| proCollegen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia sp.
           YI23]
          Length = 297

 Score = 47.8 bits (112), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 23/66 (34%), Positives = 36/66 (54%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           G + G LL+  L DVE GG T+FP +  +V P +G A+++   +   + D    H+  P+
Sbjct: 215 GQRVGTLLMY-LNDVEGGGETVFPQVGCSVVPRRGQALYFEYCNRAGVCDPASLHASTPL 273

Query: 122 ALGNKW 127
             G KW
Sbjct: 274 RSGEKW 279



 Score = 45.8 bits (107), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 19/55 (34%), Positives = 30/55 (54%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T+FP +  +V P +G A+++   +   + D    H+  P+  G KW
Sbjct: 225 LNDVEGGGETVFPQVGCSVVPRRGQALYFEYCNRAGVCDPASLHASTPLRSGEKW 279


>gi|302773668|ref|XP_002970251.1| hypothetical protein SELMODRAFT_411114 [Selaginella moellendorffii]
 gi|300161767|gb|EFJ28381.1| hypothetical protein SELMODRAFT_411114 [Selaginella moellendorffii]
          Length = 256

 Score = 47.8 bits (112), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 31/98 (31%), Positives = 43/98 (43%), Gaps = 21/98 (21%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNL------------------- 89
           DY  +H       G +    +L  L+DVE GG T+FPS  +                   
Sbjct: 149 DY--FHDKVNTKNGGQRVATVLMYLSDVEEGGETVFPSAKVNSSSVPWWDELSECAKKGV 206

Query: 90  TVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           +V P KG A+ +++   +  LD    H GCPV  GNKW
Sbjct: 207 SVKPRKGDALLFWSMSPDAELDPFSLHGGCPVIKGNKW 244



 Score = 46.2 bits (108), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 26/74 (35%), Positives = 35/74 (47%), Gaps = 19/74 (25%)

Query: 11  LTDVELGGATIFPSLNL-------------------TVFPEKGSAVFWHNAHANTLLDYW 51
           L+DVE GG T+FPS  +                   +V P KG A+ + +   +  LD +
Sbjct: 171 LSDVEEGGETVFPSAKVNSSSVPWWDELSECAKKGVSVKPRKGDALLFWSMSPDAELDPF 230

Query: 52  MYHSGCPVALGNKW 65
             H GCPV  GNKW
Sbjct: 231 SLHGGCPVIKGNKW 244


>gi|297818456|ref|XP_002877111.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
 gi|297322949|gb|EFH53370.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
          Length = 316

 Score = 47.8 bits (112), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 30/98 (30%), Positives = 39/98 (39%), Gaps = 20/98 (20%)

Query: 48  LDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFP------------------SLNL 89
            DY  +H    + LG      +L  L++VE GG T+FP                      
Sbjct: 158 FDY--FHDQANLELGGHRIATVLMYLSNVEKGGETVFPMWKGKTTQLKDDSWTECAKQGY 215

Query: 90  TVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
            V P KG A+ ++N H N   D    H  CPV  G KW
Sbjct: 216 AVKPRKGDALLFFNLHPNATTDSNSLHGSCPVVEGEKW 253



 Score = 42.0 bits (97), Expect = 0.075,   Method: Compositional matrix adjust.
 Identities = 24/73 (32%), Positives = 29/73 (39%), Gaps = 18/73 (24%)

Query: 11  LTDVELGGATIFP------------------SLNLTVFPEKGSAVFWHNAHANTLLDYWM 52
           L++VE GG T+FP                       V P KG A+ + N H N   D   
Sbjct: 181 LSNVEKGGETVFPMWKGKTTQLKDDSWTECAKQGYAVKPRKGDALLFFNLHPNATTDSNS 240

Query: 53  YHSGCPVALGNKW 65
            H  CPV  G KW
Sbjct: 241 LHGSCPVVEGEKW 253


>gi|397567288|gb|EJK45500.1| hypothetical protein THAOC_35883, partial [Thalassiosira oceanica]
          Length = 664

 Score = 47.8 bits (112), Expect = 0.001,   Method: Composition-based stats.
 Identities = 27/58 (46%), Positives = 33/58 (56%), Gaps = 2/58 (3%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNA--HANTLLDYRMYHSGCPVALGNKWG 128
           L DVE GG T FP L++TV P+KGSA+ W +    A    D R  H   PV  G K+G
Sbjct: 413 LNDVEEGGGTHFPLLDITVQPKKGSALLWPSVLDEAPESKDGRTDHEALPVIKGLKYG 470



 Score = 45.1 bits (105), Expect = 0.011,   Method: Composition-based stats.
 Identities = 26/60 (43%), Positives = 33/60 (55%), Gaps = 2/60 (3%)

Query: 9   VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNA--HANTLLDYWMYHSGCPVALGNKWG 66
           + L DVE GG T FP L++TV P+KGSA+ W +    A    D    H   PV  G K+G
Sbjct: 411 IYLNDVEEGGGTHFPLLDITVQPKKGSALLWPSVLDEAPESKDGRTDHEALPVIKGLKYG 470


>gi|18086437|gb|AAL57673.1| AT3g28480/MFJ20_16 [Arabidopsis thaliana]
 gi|24796986|gb|AAN64505.1| At3g28480/MFJ20_16 [Arabidopsis thaliana]
          Length = 316

 Score = 47.8 bits (112), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 30/98 (30%), Positives = 39/98 (39%), Gaps = 20/98 (20%)

Query: 48  LDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFP------------------SLNL 89
            DY  +H    + LG      +L  L++VE GG T+FP                      
Sbjct: 158 FDY--FHDQANLELGGHRIATVLMYLSNVEKGGETVFPMWKGKATQLKDDSWTECAKQGY 215

Query: 90  TVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
            V P KG A+ ++N H N   D    H  CPV  G KW
Sbjct: 216 AVKPRKGDALLFFNLHPNATTDSNSLHGSCPVVEGEKW 253



 Score = 42.0 bits (97), Expect = 0.080,   Method: Compositional matrix adjust.
 Identities = 24/73 (32%), Positives = 29/73 (39%), Gaps = 18/73 (24%)

Query: 11  LTDVELGGATIFP------------------SLNLTVFPEKGSAVFWHNAHANTLLDYWM 52
           L++VE GG T+FP                       V P KG A+ + N H N   D   
Sbjct: 181 LSNVEKGGETVFPMWKGKATQLKDDSWTECAKQGYAVKPRKGDALLFFNLHPNATTDSNS 240

Query: 53  YHSGCPVALGNKW 65
            H  CPV  G KW
Sbjct: 241 LHGSCPVVEGEKW 253


>gi|416009427|ref|ZP_11561250.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein
           [Acidithiobacillus sp. GGI-221]
 gi|339836568|gb|EGQ64151.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein
           [Acidithiobacillus sp. GGI-221]
          Length = 196

 Score = 47.8 bits (112), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 28/66 (42%), Positives = 33/66 (50%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+ G L+L  L  VE GG T FP L L V P  G  VF+ N +          H+G PV
Sbjct: 120 GNRQGTLILY-LNAVEEGGETAFPELGLQVSPIPGGGVFFRNLNEEGQRHPLSLHAGLPV 178

Query: 122 ALGNKW 127
             G KW
Sbjct: 179 RKGEKW 184



 Score = 43.5 bits (101), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 25/64 (39%), Positives = 29/64 (45%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L  VE GG T FP L L V P  G  VF+ N +          H+G PV  
Sbjct: 121 NRQGTLILYLNAVEEGGETAFPELGLQVSPIPGGGVFFRNLNEEGQRHPLSLHAGLPVRK 180

Query: 62  GNKW 65
           G KW
Sbjct: 181 GEKW 184


>gi|449284064|gb|EMC90646.1| Prolyl 4-hydroxylase subunit alpha-3, partial [Columba livia]
          Length = 174

 Score = 47.8 bits (112), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 27/66 (40%), Positives = 38/66 (57%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+   L++  L+ V  GG+T F   NL+V   K +A+FW+N   N   D    H+GCPV
Sbjct: 87  GNRIATLMIY-LSAVGAGGSTAFVHANLSVPVVKNAALFWWNLRRNGDGDGDTLHAGCPV 145

Query: 122 ALGNKW 127
             G+KW
Sbjct: 146 LAGDKW 151



 Score = 44.7 bits (104), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 26/64 (40%), Positives = 34/64 (53%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L+ V  GG+T F   NL+V   K +A+FW N   N   D    H+GCPV  
Sbjct: 88  NRIATLMIYLSAVGAGGSTAFVHANLSVPVVKNAALFWWNLRRNGDGDGDTLHAGCPVLA 147

Query: 62  GNKW 65
           G+KW
Sbjct: 148 GDKW 151


>gi|302793288|ref|XP_002978409.1| hypothetical protein SELMODRAFT_418273 [Selaginella moellendorffii]
 gi|300153758|gb|EFJ20395.1| hypothetical protein SELMODRAFT_418273 [Selaginella moellendorffii]
          Length = 256

 Score = 47.8 bits (112), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 31/98 (31%), Positives = 43/98 (43%), Gaps = 21/98 (21%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNL------------------- 89
           DY  +H       G +    +L  L+DVE GG T+FPS  +                   
Sbjct: 149 DY--FHDKVNTKNGGQRVATVLMYLSDVEEGGETVFPSAKVNSSSVPWWDELSECGKKGV 206

Query: 90  TVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           +V P KG A+ +++   +  LD    H GCPV  GNKW
Sbjct: 207 SVKPRKGDALLFWSMSPDAELDPFSLHGGCPVIKGNKW 244



 Score = 46.2 bits (108), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 26/74 (35%), Positives = 35/74 (47%), Gaps = 19/74 (25%)

Query: 11  LTDVELGGATIFPSLNL-------------------TVFPEKGSAVFWHNAHANTLLDYW 51
           L+DVE GG T+FPS  +                   +V P KG A+ + +   +  LD +
Sbjct: 171 LSDVEEGGETVFPSAKVNSSSVPWWDELSECGKKGVSVKPRKGDALLFWSMSPDAELDPF 230

Query: 52  MYHSGCPVALGNKW 65
             H GCPV  GNKW
Sbjct: 231 SLHGGCPVIKGNKW 244


>gi|298712929|emb|CBJ26831.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 294

 Score = 47.8 bits (112), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 24/55 (43%), Positives = 32/55 (58%), Gaps = 1/55 (1%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L DV  GG T FP L + V P KG AV ++ A  + +LD R+ H+  P A+  KW
Sbjct: 219 LNDVPSGGCTAFPQLGMKVQPRKGMAVVFFPATLDGVLDSRLLHAAEP-AIDTKW 272



 Score = 43.5 bits (101), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 24/57 (42%), Positives = 31/57 (54%), Gaps = 1/57 (1%)

Query: 9   VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           V L DV  GG T FP L + V P KG AV +  A  + +LD  + H+  P A+  KW
Sbjct: 217 VYLNDVPSGGCTAFPQLGMKVQPRKGMAVVFFPATLDGVLDSRLLHAAEP-AIDTKW 272


>gi|37912909|gb|AAR05245.1| conserved hypothetical protein [uncultured marine proteobacterium
           ANT32C12]
          Length = 186

 Score = 47.8 bits (112), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 25/57 (43%), Positives = 32/57 (56%), Gaps = 1/57 (1%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTL-LDYWMYHSGCPVALGNKWG 66
           L DVE GGAT FP ++++V P KG  V +HN    T  ++    H G PV  G KW 
Sbjct: 119 LNDVEDGGATDFPDIHVSVKPNKGDVVVFHNCKDGTSDINPNSLHGGSPVISGEKWA 175



 Score = 47.4 bits (111), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 25/60 (41%), Positives = 34/60 (56%), Gaps = 1/60 (1%)

Query: 70  LSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTL-LDYRMYHSGCPVALGNKWG 128
           L+ L DVE GGAT FP ++++V P KG  V ++N    T  ++    H G PV  G KW 
Sbjct: 116 LAYLNDVEDGGATDFPDIHVSVKPNKGDVVVFHNCKDGTSDINPNSLHGGSPVISGEKWA 175


>gi|18405808|ref|NP_566838.1| prolyl 4-hydroxylase [Arabidopsis thaliana]
 gi|21617881|gb|AAM66931.1| prolyl 4-hydroxylase, putative [Arabidopsis thaliana]
 gi|332643929|gb|AEE77450.1| prolyl 4-hydroxylase [Arabidopsis thaliana]
          Length = 316

 Score = 47.8 bits (112), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 30/98 (30%), Positives = 39/98 (39%), Gaps = 20/98 (20%)

Query: 48  LDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFP------------------SLNL 89
            DY  +H    + LG      +L  L++VE GG T+FP                      
Sbjct: 158 FDY--FHDQANLELGGHRIATVLMYLSNVEKGGETVFPMWKGKATQLKDDSWTECAKQGY 215

Query: 90  TVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
            V P KG A+ ++N H N   D    H  CPV  G KW
Sbjct: 216 AVKPRKGDALLFFNLHPNATTDSNSLHGSCPVVEGEKW 253



 Score = 42.0 bits (97), Expect = 0.080,   Method: Compositional matrix adjust.
 Identities = 24/73 (32%), Positives = 29/73 (39%), Gaps = 18/73 (24%)

Query: 11  LTDVELGGATIFP------------------SLNLTVFPEKGSAVFWHNAHANTLLDYWM 52
           L++VE GG T+FP                       V P KG A+ + N H N   D   
Sbjct: 181 LSNVEKGGETVFPMWKGKATQLKDDSWTECAKQGYAVKPRKGDALLFFNLHPNATTDSNS 240

Query: 53  YHSGCPVALGNKW 65
            H  CPV  G KW
Sbjct: 241 LHGSCPVVEGEKW 253


>gi|30681957|ref|NP_850038.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
           [Arabidopsis thaliana]
 gi|330252315|gb|AEC07409.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
           [Arabidopsis thaliana]
          Length = 274

 Score = 47.8 bits (112), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/70 (37%), Positives = 32/70 (45%), Gaps = 15/70 (21%)

Query: 73  LTDVELGGATIFPSLN---------------LTVFPEKGSAVFWYNAHANTLLDYRMYHS 117
           L+ VE GG T+FP  N               L V P +G A+F+YN   N  +D    H 
Sbjct: 193 LSSVEEGGETMFPFENGRNMNGRYDYEKCVGLKVKPRQGDAIFFYNLFPNGTIDQTSLHG 252

Query: 118 GCPVALGNKW 127
            CPV  G KW
Sbjct: 253 SCPVIKGEKW 262



 Score = 45.8 bits (107), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 25/70 (35%), Positives = 32/70 (45%), Gaps = 15/70 (21%)

Query: 11  LTDVELGGATIFPSLN---------------LTVFPEKGSAVFWHNAHANTLLDYWMYHS 55
           L+ VE GG T+FP  N               L V P +G A+F++N   N  +D    H 
Sbjct: 193 LSSVEEGGETMFPFENGRNMNGRYDYEKCVGLKVKPRQGDAIFFYNLFPNGTIDQTSLHG 252

Query: 56  GCPVALGNKW 65
            CPV  G KW
Sbjct: 253 SCPVIKGEKW 262


>gi|9294583|dbj|BAB02864.1| prolyl 4-hydroxylase alpha subunit-like protein [Arabidopsis
           thaliana]
          Length = 332

 Score = 47.8 bits (112), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 30/98 (30%), Positives = 39/98 (39%), Gaps = 20/98 (20%)

Query: 48  LDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFP------------------SLNL 89
            DY  +H    + LG      +L  L++VE GG T+FP                      
Sbjct: 174 FDY--FHDQANLELGGHRIATVLMYLSNVEKGGETVFPMWKGKATQLKDDSWTECAKQGY 231

Query: 90  TVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
            V P KG A+ ++N H N   D    H  CPV  G KW
Sbjct: 232 AVKPRKGDALLFFNLHPNATTDSNSLHGSCPVVEGEKW 269



 Score = 42.4 bits (98), Expect = 0.058,   Method: Compositional matrix adjust.
 Identities = 24/73 (32%), Positives = 29/73 (39%), Gaps = 18/73 (24%)

Query: 11  LTDVELGGATIFP------------------SLNLTVFPEKGSAVFWHNAHANTLLDYWM 52
           L++VE GG T+FP                       V P KG A+ + N H N   D   
Sbjct: 197 LSNVEKGGETVFPMWKGKATQLKDDSWTECAKQGYAVKPRKGDALLFFNLHPNATTDSNS 256

Query: 53  YHSGCPVALGNKW 65
            H  CPV  G KW
Sbjct: 257 LHGSCPVVEGEKW 269


>gi|357467075|ref|XP_003603822.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355492870|gb|AES74073.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
          Length = 683

 Score = 47.8 bits (112), Expect = 0.002,   Method: Composition-based stats.
 Identities = 32/96 (33%), Positives = 44/96 (45%), Gaps = 23/96 (23%)

Query: 11  LTDVELGGATIFPSLN-------------LTVFPEKGSAVFWHNAHANTLLDYWMYHSGC 57
           L+DVE GG T+FP+               L+V P+ G A+ + +   +  LD    H G 
Sbjct: 467 LSDVEEGGETVFPNAKPNINSVSKYPEKGLSVKPKMGDALLFRSMKPDGTLDTSSLHGGS 526

Query: 58  PVALGNKWG----------KLLLSGLTDVELGGATI 83
           PV  GNKW           K+L + L  V+ GG TI
Sbjct: 527 PVIRGNKWASTKWLHLTEFKVLGTALPVVDDGGTTI 562



 Score = 46.2 bits (108), Expect = 0.005,   Method: Composition-based stats.
 Identities = 28/85 (32%), Positives = 38/85 (44%), Gaps = 13/85 (15%)

Query: 11  LTDVELGGATIFPSLN-------------LTVFPEKGSAVFWHNAHANTLLDYWMYHSGC 57
           L+DVE GG T+FP                L+V P+ G A+ + +   +  LD    H G 
Sbjct: 237 LSDVEEGGETVFPDAQPNFTSVSKCSGDGLSVKPKMGDALLFWSMKPDGTLDTSSLHGGS 296

Query: 58  PVALGNKWGKLLLSGLTDVELGGAT 82
           PV  GNKW       L + +L G T
Sbjct: 297 PVIRGNKWASTKWLHLRECKLSGTT 321



 Score = 45.4 bits (106), Expect = 0.007,   Method: Composition-based stats.
 Identities = 25/73 (34%), Positives = 35/73 (47%), Gaps = 13/73 (17%)

Query: 69  LLSGLTDVELGGATIFPSLN-------------LTVFPEKGSAVFWYNAHANTLLDYRMY 115
           +L  L+DVE GG T+FP                L+V P+ G A+ +++   +  LD    
Sbjct: 233 VLMYLSDVEEGGETVFPDAQPNFTSVSKCSGDGLSVKPKMGDALLFWSMKPDGTLDTSSL 292

Query: 116 HSGCPVALGNKWG 128
           H G PV  GNKW 
Sbjct: 293 HGGSPVIRGNKWA 305



 Score = 44.3 bits (103), Expect = 0.017,   Method: Composition-based stats.
 Identities = 25/73 (34%), Positives = 35/73 (47%), Gaps = 13/73 (17%)

Query: 69  LLSGLTDVELGGATIFPSLN-------------LTVFPEKGSAVFWYNAHANTLLDYRMY 115
           +L  L+DVE GG T+FP+               L+V P+ G A+ + +   +  LD    
Sbjct: 463 VLMYLSDVEEGGETVFPNAKPNINSVSKYPEKGLSVKPKMGDALLFRSMKPDGTLDTSSL 522

Query: 116 HSGCPVALGNKWG 128
           H G PV  GNKW 
Sbjct: 523 HGGSPVIRGNKWA 535


>gi|302786814|ref|XP_002975178.1| hypothetical protein SELMODRAFT_174666 [Selaginella moellendorffii]
 gi|300157337|gb|EFJ23963.1| hypothetical protein SELMODRAFT_174666 [Selaginella moellendorffii]
          Length = 283

 Score = 47.8 bits (112), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 30/95 (31%), Positives = 39/95 (41%), Gaps = 17/95 (17%)

Query: 50  YWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN-----------------LTVF 92
           Y  +H     ALG      +L  L+D   GG T+FPS                   + V 
Sbjct: 130 YDYFHDKYNQALGGHRIATVLMYLSDAVKGGETVFPSSEEDTTVKDDSWSDCAKKGIAVK 189

Query: 93  PEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           P KG A+ +Y+ H +   D    H GCPV  G KW
Sbjct: 190 PRKGDALLFYSLHPDATPDESSLHGGCPVIEGEKW 224



 Score = 42.0 bits (97), Expect = 0.075,   Method: Compositional matrix adjust.
 Identities = 23/72 (31%), Positives = 31/72 (43%), Gaps = 17/72 (23%)

Query: 11  LTDVELGGATIFPSLN-----------------LTVFPEKGSAVFWHNAHANTLLDYWMY 53
           L+D   GG T+FPS                   + V P KG A+ +++ H +   D    
Sbjct: 153 LSDAVKGGETVFPSSEEDTTVKDDSWSDCAKKGIAVKPRKGDALLFYSLHPDATPDESSL 212

Query: 54  HSGCPVALGNKW 65
           H GCPV  G KW
Sbjct: 213 HGGCPVIEGEKW 224


>gi|321466285|gb|EFX77281.1| hypothetical protein DAPPUDRAFT_106233 [Daphnia pulex]
          Length = 128

 Score = 47.8 bits (112), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/66 (39%), Positives = 35/66 (53%), Gaps = 3/66 (4%)

Query: 65  WGKLLLSGL---TDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           WG  + + L   +DVE G AT+FP     V+P KGSA F YN + N   +    H+  PV
Sbjct: 38  WGNRIATCLFYMSDVEAGSATVFPPTGAAVWPRKGSAAFCYNLYPNDKGNEFTRHATFPV 97

Query: 122 ALGNKW 127
              +KW
Sbjct: 98  IFLSKW 103



 Score = 47.0 bits (110), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 24/64 (37%), Positives = 34/64 (53%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR       ++DVE G AT+FP     V+P KGSA F +N + N   + +  H+  PV  
Sbjct: 40  NRIATCLFYMSDVEAGSATVFPPTGAAVWPRKGSAAFCYNLYPNDKGNEFTRHATFPVIF 99

Query: 62  GNKW 65
            +KW
Sbjct: 100 LSKW 103


>gi|363807286|ref|NP_001242363.1| uncharacterized protein LOC100796794 precursor [Glycine max]
 gi|255641119|gb|ACU20838.1| unknown [Glycine max]
          Length = 297

 Score = 47.8 bits (112), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 28/90 (31%), Positives = 38/90 (42%), Gaps = 21/90 (23%)

Query: 59  VALGNKWGKLLLSGLTDVELGGATIFPSLN---------------------LTVFPEKGS 97
           +A G      +L  LTDV  GG T+FPS                       + V P +G 
Sbjct: 147 IARGGHRIATVLMYLTDVAKGGETVFPSAEEPPRRRGAETSSDLSECAKKGIAVKPRRGD 206

Query: 98  AVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           A+ +++ H N   D    H+GCPV  G KW
Sbjct: 207 ALLFFSLHTNATPDTSSLHAGCPVIEGEKW 236



 Score = 44.7 bits (104), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 25/76 (32%), Positives = 32/76 (42%), Gaps = 21/76 (27%)

Query: 11  LTDVELGGATIFPSLN---------------------LTVFPEKGSAVFWHNAHANTLLD 49
           LTDV  GG T+FPS                       + V P +G A+ + + H N   D
Sbjct: 161 LTDVAKGGETVFPSAEEPPRRRGAETSSDLSECAKKGIAVKPRRGDALLFFSLHTNATPD 220

Query: 50  YWMYHSGCPVALGNKW 65
               H+GCPV  G KW
Sbjct: 221 TSSLHAGCPVIEGEKW 236


>gi|15227885|ref|NP_179363.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
           [Arabidopsis thaliana]
 gi|25411813|pir||F84555 similar to prolyl 4-hydroxylase alpha subunit [imported] -
           Arabidopsis thaliana
 gi|89274129|gb|ABD65585.1| At2g17720 [Arabidopsis thaliana]
 gi|110738861|dbj|BAF01353.1| similar to prolyl 4-hydroxylase alpha subunit [Arabidopsis
           thaliana]
 gi|330251579|gb|AEC06673.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
           [Arabidopsis thaliana]
          Length = 291

 Score = 47.8 bits (112), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 25/74 (33%), Positives = 35/74 (47%), Gaps = 19/74 (25%)

Query: 73  LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWYNAHANTLLDYR 113
           L+DV+ GG T+FP+                     L+V P+K  A+ ++N   +  LD  
Sbjct: 205 LSDVDDGGETVFPAARGNISAVPWWNELSKCGKEGLSVLPKKRDALLFWNMRPDASLDPS 264

Query: 114 MYHSGCPVALGNKW 127
             H GCPV  GNKW
Sbjct: 265 SLHGGCPVVKGNKW 278



 Score = 45.1 bits (105), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 25/74 (33%), Positives = 34/74 (45%), Gaps = 19/74 (25%)

Query: 11  LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
           L+DV+ GG T+FP+                     L+V P+K  A+ + N   +  LD  
Sbjct: 205 LSDVDDGGETVFPAARGNISAVPWWNELSKCGKEGLSVLPKKRDALLFWNMRPDASLDPS 264

Query: 52  MYHSGCPVALGNKW 65
             H GCPV  GNKW
Sbjct: 265 SLHGGCPVVKGNKW 278


>gi|168046048|ref|XP_001775487.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162673157|gb|EDQ59684.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 263

 Score = 47.4 bits (111), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 31/97 (31%), Positives = 43/97 (44%), Gaps = 19/97 (19%)

Query: 48  LDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN-----------------LT 90
            DY  +H     ALG      +L  L+DV  GG T+FPS                   + 
Sbjct: 108 FDY--FHDKNNQALGGHRIATVLMYLSDVVKGGETVFPSSEDRGGPKDDSWSACGKTGVA 165

Query: 91  VFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           V P KG A+ +++ H + + D    H+GCPV  G KW
Sbjct: 166 VKPRKGDALLFFSLHPSAVPDESSLHTGCPVIEGEKW 202



 Score = 42.7 bits (99), Expect = 0.045,   Method: Compositional matrix adjust.
 Identities = 24/72 (33%), Positives = 33/72 (45%), Gaps = 17/72 (23%)

Query: 11  LTDVELGGATIFPSLN-----------------LTVFPEKGSAVFWHNAHANTLLDYWMY 53
           L+DV  GG T+FPS                   + V P KG A+ + + H + + D    
Sbjct: 131 LSDVVKGGETVFPSSEDRGGPKDDSWSACGKTGVAVKPRKGDALLFFSLHPSAVPDESSL 190

Query: 54  HSGCPVALGNKW 65
           H+GCPV  G KW
Sbjct: 191 HTGCPVIEGEKW 202


>gi|326501992|dbj|BAK06488.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 306

 Score = 47.4 bits (111), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 25/73 (34%), Positives = 34/73 (46%), Gaps = 18/73 (24%)

Query: 73  LTDVELGGATIFPSLNL------------------TVFPEKGSAVFWYNAHANTLLDYRM 114
           L+DV++GG T+FP+ +                    V P KG AV +++ H N   D   
Sbjct: 166 LSDVKMGGETVFPNSDAKTLQPKDDTQSECARRGYAVKPVKGDAVLFFSLHPNGTTDRDS 225

Query: 115 YHSGCPVALGNKW 127
            H GCPV  G KW
Sbjct: 226 LHGGCPVIEGEKW 238



 Score = 45.1 bits (105), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 25/73 (34%), Positives = 33/73 (45%), Gaps = 18/73 (24%)

Query: 11  LTDVELGGATIFPSLNL------------------TVFPEKGSAVFWHNAHANTLLDYWM 52
           L+DV++GG T+FP+ +                    V P KG AV + + H N   D   
Sbjct: 166 LSDVKMGGETVFPNSDAKTLQPKDDTQSECARRGYAVKPVKGDAVLFFSLHPNGTTDRDS 225

Query: 53  YHSGCPVALGNKW 65
            H GCPV  G KW
Sbjct: 226 LHGGCPVIEGEKW 238


>gi|255539064|ref|XP_002510597.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
 gi|223551298|gb|EEF52784.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
          Length = 289

 Score = 47.4 bits (111), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 32/107 (29%), Positives = 49/107 (45%), Gaps = 25/107 (23%)

Query: 40  HNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN----------- 88
           + AH +  LD +   +G     G +   LL+  L+DVE GG T+FP+             
Sbjct: 176 YEAHYDYFLDEFNTKNG-----GQRTATLLMY-LSDVEEGGETVFPAAKANISNVPSWNE 229

Query: 89  --------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
                   L+V P+ G+A+ +++   +  LD    H  CPV  GNKW
Sbjct: 230 LSECARQGLSVKPKMGNALLFWSTRPDATLDPASLHGSCPVIRGNKW 276



 Score = 42.4 bits (98), Expect = 0.061,   Method: Compositional matrix adjust.
 Identities = 24/74 (32%), Positives = 34/74 (45%), Gaps = 19/74 (25%)

Query: 11  LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
           L+DVE GG T+FP+                     L+V P+ G+A+ + +   +  LD  
Sbjct: 203 LSDVEEGGETVFPAAKANISNVPSWNELSECARQGLSVKPKMGNALLFWSTRPDATLDPA 262

Query: 52  MYHSGCPVALGNKW 65
             H  CPV  GNKW
Sbjct: 263 SLHGSCPVIRGNKW 276


>gi|365090417|ref|ZP_09328465.1| 2OG-Fe(II) oxygenase [Acidovorax sp. NO-1]
 gi|363416516|gb|EHL23626.1| 2OG-Fe(II) oxygenase [Acidovorax sp. NO-1]
          Length = 302

 Score = 47.4 bits (111), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 27/68 (39%), Positives = 37/68 (54%), Gaps = 7/68 (10%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVF--WYNAHANTLLDYRMYHSGC 119
           G + G L++  L   E GG T FP ++L V P++G+AVF  +   H +T    R  H G 
Sbjct: 228 GQRVGTLVMY-LNTPEKGGGTTFPDVHLEVAPQRGNAVFFSYERPHPST----RTLHGGA 282

Query: 120 PVALGNKW 127
           PV  G KW
Sbjct: 283 PVIAGEKW 290



 Score = 43.5 bits (101), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 23/57 (40%), Positives = 30/57 (52%), Gaps = 6/57 (10%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVF--WHNAHANTLLDYWMYHSGCPVALGNKW 65
           L   E GG T FP ++L V P++G+AVF  +   H +T       H G PV  G KW
Sbjct: 238 LNTPEKGGGTTFPDVHLEVAPQRGNAVFFSYERPHPSTR----TLHGGAPVIAGEKW 290


>gi|332140647|ref|YP_004426385.1| Prolyl 4-hydroxylase alpha subunit [Alteromonas macleodii str.
           'Deep ecotype']
 gi|327550669|gb|AEA97387.1| Prolyl 4-hydroxylase alpha subunit [Alteromonas macleodii str.
           'Deep ecotype']
          Length = 376

 Score = 47.4 bits (111), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 23/66 (34%), Positives = 34/66 (51%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           G +  K  L  L  +  GG T+FP L++ + P+ G+ V + N+  N  L    YH+G P 
Sbjct: 287 GKQRIKTALVYLNTISEGGETLFPKLDIRIAPKSGTMVVFSNSDENGKLLLNSYHAGAPT 346

Query: 122 ALGNKW 127
              NKW
Sbjct: 347 VSENKW 352



 Score = 47.4 bits (111), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 22/64 (34%), Positives = 34/64 (53%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
            R + + V L  +  GG T+FP L++ + P+ G+ V + N+  N  L    YH+G P   
Sbjct: 289 QRIKTALVYLNTISEGGETLFPKLDIRIAPKSGTMVVFSNSDENGKLLLNSYHAGAPTVS 348

Query: 62  GNKW 65
            NKW
Sbjct: 349 ENKW 352


>gi|219126281|ref|XP_002183389.1| hypothetical protein PHATRDRAFT_48891 [Phaeodactylum tricornutum
           CCAP 1055/1]
 gi|217405145|gb|EEC45089.1| hypothetical protein PHATRDRAFT_48891 [Phaeodactylum tricornutum
           CCAP 1055/1]
          Length = 427

 Score = 47.4 bits (111), Expect = 0.002,   Method: Composition-based stats.
 Identities = 27/58 (46%), Positives = 32/58 (55%), Gaps = 2/58 (3%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHAN--TLLDYRMYHSGCPVALGNKWG 128
           L+DVE GG T FP L LTV P+KG AV W +   +     D R  H   PV  G K+G
Sbjct: 352 LSDVEEGGGTNFPKLGLTVTPKKGRAVLWPSVLDDEPNQKDARSDHQALPVIKGVKYG 409



 Score = 44.3 bits (103), Expect = 0.015,   Method: Composition-based stats.
 Identities = 26/58 (44%), Positives = 31/58 (53%), Gaps = 2/58 (3%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHAN--TLLDYWMYHSGCPVALGNKWG 66
           L+DVE GG T FP L LTV P+KG AV W +   +     D    H   PV  G K+G
Sbjct: 352 LSDVEEGGGTNFPKLGLTVTPKKGRAVLWPSVLDDEPNQKDARSDHQALPVIKGVKYG 409


>gi|347527209|ref|YP_004833956.1| putative oxygenase [Sphingobium sp. SYK-6]
 gi|345135890|dbj|BAK65499.1| putative oxygenase [Sphingobium sp. SYK-6]
          Length = 229

 Score = 47.4 bits (111), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 31/88 (35%), Positives = 42/88 (47%), Gaps = 12/88 (13%)

Query: 52  MYHSGCPVALGN--KWGKLLLSG----------LTDVELGGATIFPSLNLTVFPEKGSAV 99
            YH  C    GN   W  + +SG          L DVE GG T FP L LTV P  G+ +
Sbjct: 125 QYHLHCDYFPGNVHYWPAMRVSGGQRCWTAMAYLCDVEEGGETHFPRLGLTVPPRAGTLL 184

Query: 100 FWYNAHANTLLDYRMYHSGCPVALGNKW 127
            W N +A+   ++   H+  PV+ G K+
Sbjct: 185 MWNNMNADGSPNHDTLHAALPVSRGVKY 212



 Score = 46.2 bits (108), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 23/55 (41%), Positives = 33/55 (60%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T FP L LTV P  G+ + W+N +A+   ++   H+  PV+ G K+
Sbjct: 158 LCDVEEGGETHFPRLGLTVPPRAGTLLMWNNMNADGSPNHDTLHAALPVSRGVKY 212


>gi|410860761|ref|YP_006975995.1| prolyl 4-hydroxylase subunit alpha [Alteromonas macleodii AltDE1]
 gi|410818023|gb|AFV84640.1| Prolyl 4-hydroxylase alpha subunit [Alteromonas macleodii AltDE1]
          Length = 376

 Score = 47.4 bits (111), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 23/66 (34%), Positives = 34/66 (51%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           G +  K  L  L  +  GG T+FP L++ + P+ G+ V + N+  N  L    YH+G P 
Sbjct: 287 GKQRIKTALVYLNTINEGGETLFPKLDIRIAPKSGTMVVFSNSDENGKLLLNSYHAGAPT 346

Query: 122 ALGNKW 127
              NKW
Sbjct: 347 VSENKW 352



 Score = 47.4 bits (111), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 22/64 (34%), Positives = 34/64 (53%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
            R + + V L  +  GG T+FP L++ + P+ G+ V + N+  N  L    YH+G P   
Sbjct: 289 QRIKTALVYLNTINEGGETLFPKLDIRIAPKSGTMVVFSNSDENGKLLLNSYHAGAPTVS 348

Query: 62  GNKW 65
            NKW
Sbjct: 349 ENKW 352


>gi|308497208|ref|XP_003110791.1| CRE-DPY-18 protein [Caenorhabditis remanei]
 gi|308242671|gb|EFO86623.1| CRE-DPY-18 protein [Caenorhabditis remanei]
          Length = 559

 Score = 47.4 bits (111), Expect = 0.002,   Method: Composition-based stats.
 Identities = 23/66 (34%), Positives = 33/66 (50%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+   +L   ++    GG T+F  +  TV P K  A+FWYN       +    H+ CPV
Sbjct: 440 GNRIATVLFY-MSQPSHGGGTVFTEVKSTVLPTKNDALFWYNLFKQGDGNPDTRHAACPV 498

Query: 122 ALGNKW 127
            +G KW
Sbjct: 499 LVGIKW 504



 Score = 43.9 bits (102), Expect = 0.019,   Method: Composition-based stats.
 Identities = 19/49 (38%), Positives = 26/49 (53%)

Query: 17  GGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           GG T+F  +  TV P K  A+FW+N       +    H+ CPV +G KW
Sbjct: 456 GGGTVFTEVKSTVLPTKNDALFWYNLFKQGDGNPDTRHAACPVLVGIKW 504


>gi|297832394|ref|XP_002884079.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
 gi|297329919|gb|EFH60338.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
          Length = 291

 Score = 47.4 bits (111), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 25/74 (33%), Positives = 35/74 (47%), Gaps = 19/74 (25%)

Query: 73  LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWYNAHANTLLDYR 113
           L+DV+ GG T+FP+                     L+V P+K  A+ ++N   +  LD  
Sbjct: 205 LSDVDDGGETVFPAARGNISAVPWWNELSKCGKEGLSVLPKKRDALLFWNMRPDASLDPS 264

Query: 114 MYHSGCPVALGNKW 127
             H GCPV  GNKW
Sbjct: 265 SLHGGCPVVKGNKW 278



 Score = 44.7 bits (104), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 25/74 (33%), Positives = 34/74 (45%), Gaps = 19/74 (25%)

Query: 11  LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
           L+DV+ GG T+FP+                     L+V P+K  A+ + N   +  LD  
Sbjct: 205 LSDVDDGGETVFPAARGNISAVPWWNELSKCGKEGLSVLPKKRDALLFWNMRPDASLDPS 264

Query: 52  MYHSGCPVALGNKW 65
             H GCPV  GNKW
Sbjct: 265 SLHGGCPVVKGNKW 278


>gi|307102963|gb|EFN51228.1| hypothetical protein CHLNCDRAFT_141231 [Chlorella variabilis]
          Length = 313

 Score = 47.4 bits (111), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 27/83 (32%), Positives = 41/83 (49%), Gaps = 18/83 (21%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSL-----------------NLTVFPEKGSAVFWYNA 104
           GN++  +L+  L  VE GG T+FP++                 +L   P KGSAV +++ 
Sbjct: 145 GNRYATVLMY-LNTVEEGGETVFPNIPAPGGDNGPTFTECARRHLAAKPTKGSAVLFHSI 203

Query: 105 HANTLLDYRMYHSGCPVALGNKW 127
             +  L+ R  H+ CPV  G KW
Sbjct: 204 KPSGDLERRSLHTACPVVKGEKW 226



 Score = 43.1 bits (100), Expect = 0.035,   Method: Compositional matrix adjust.
 Identities = 24/72 (33%), Positives = 33/72 (45%), Gaps = 17/72 (23%)

Query: 11  LTDVELGGATIFPSL-----------------NLTVFPEKGSAVFWHNAHANTLLDYWMY 53
           L  VE GG T+FP++                 +L   P KGSAV +H+   +  L+    
Sbjct: 155 LNTVEEGGETVFPNIPAPGGDNGPTFTECARRHLAAKPTKGSAVLFHSIKPSGDLERRSL 214

Query: 54  HSGCPVALGNKW 65
           H+ CPV  G KW
Sbjct: 215 HTACPVVKGEKW 226


>gi|114799222|ref|YP_760562.1| 2OG-Fe(II) oxygenase [Hyphomonas neptunium ATCC 15444]
 gi|114739396|gb|ABI77521.1| oxidoreductase, 2OG-Fe(II) oxygenase family [Hyphomonas neptunium
           ATCC 15444]
          Length = 298

 Score = 47.4 bits (111), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 23/57 (40%), Positives = 32/57 (56%)

Query: 9   VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           + L DVE GG T F  L  T+ PEKG AV W+N + +  ++ +  H G  V  G K+
Sbjct: 219 IYLNDVEEGGGTRFRRLEKTIMPEKGKAVIWNNLNPDGSVNPYTIHHGMKVRSGAKY 275



 Score = 44.7 bits (104), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 26/75 (34%), Positives = 34/75 (45%), Gaps = 2/75 (2%)

Query: 53  YHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDY 112
           Y   C       W  ++   L DVE GG T F  L  T+ PEKG AV W N + +  ++ 
Sbjct: 203 YQVHCQFTGQRTWTFMIY--LNDVEEGGGTRFRRLEKTIMPEKGKAVIWNNLNPDGSVNP 260

Query: 113 RMYHSGCPVALGNKW 127
              H G  V  G K+
Sbjct: 261 YTIHHGMKVRSGAKY 275


>gi|37496185|emb|CAE47803.1| Prolyl 4-hydroxylase alpha subunit [Sus scrofa]
          Length = 263

 Score = 47.4 bits (111), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 18/31 (58%), Positives = 26/31 (83%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYN 103
           ++DV  GGAT+FP +  +V+P+KG+AVFWYN
Sbjct: 226 MSDVSAGGATVFPEVGASVWPKKGTAVFWYN 256



 Score = 45.4 bits (106), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 17/31 (54%), Positives = 26/31 (83%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHN 41
           ++DV  GGAT+FP +  +V+P+KG+AVFW+N
Sbjct: 226 MSDVSAGGATVFPEVGASVWPKKGTAVFWYN 256


>gi|313243209|emb|CBY39868.1| unnamed protein product [Oikopleura dioica]
          Length = 430

 Score = 47.4 bits (111), Expect = 0.002,   Method: Composition-based stats.
 Identities = 21/58 (36%), Positives = 31/58 (53%)

Query: 70  LSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L+ +++  +GG T+F    +   P + SAVFWYN   +   D    H+ CPV  G KW
Sbjct: 345 LAYMSEPNMGGGTVFIQAGIQARPIRNSAVFWYNLLPSGESDDNTQHAACPVLSGTKW 402



 Score = 43.1 bits (100), Expect = 0.035,   Method: Composition-based stats.
 Identities = 19/55 (34%), Positives = 29/55 (52%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           +++  +GG T+F    +   P + SAVFW+N   +   D    H+ CPV  G KW
Sbjct: 348 MSEPNMGGGTVFIQAGIQARPIRNSAVFWYNLLPSGESDDNTQHAACPVLSGTKW 402


>gi|156083831|ref|XP_001609399.1| prolyl 4-hydroxylase alpha-related protein PH4 [Babesia bovis T2Bo]
 gi|154796650|gb|EDO05831.1| prolyl 4-hydroxylase alpha-related protein PH4, putative [Babesia
           bovis]
          Length = 241

 Score = 47.0 bits (110), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 24/57 (42%), Positives = 35/57 (61%), Gaps = 3/57 (5%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSG-CPVALGNKWG 128
           L DV+ GG T+FP L + V P   SA++W N   + + D+RM H+G CP + G K+ 
Sbjct: 174 LNDVD-GGETVFPELKIAVRPVGNSALYWKNTTEDGVADFRMVHAGVCPKS-GTKYA 228



 Score = 44.7 bits (104), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 23/57 (40%), Positives = 34/57 (59%), Gaps = 3/57 (5%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSG-CPVALGNKWG 66
           L DV+ GG T+FP L + V P   SA++W N   + + D+ M H+G CP + G K+ 
Sbjct: 174 LNDVD-GGETVFPELKIAVRPVGNSALYWKNTTEDGVADFRMVHAGVCPKS-GTKYA 228


>gi|607947|gb|AAA62207.1| prolyl 4-hydroxylase alpha subunit [Caenorhabditis elegans]
          Length = 558

 Score = 47.0 bits (110), Expect = 0.002,   Method: Composition-based stats.
 Identities = 22/66 (33%), Positives = 33/66 (50%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+   +L   ++    GG T+F     T+ P K  A+FWYN +     +    H+ CPV
Sbjct: 439 GNRIATVLFY-MSQPSHGGGTVFTEAKSTILPTKNDALFWYNLYKQGDGNPDTRHAACPV 497

Query: 122 ALGNKW 127
            +G KW
Sbjct: 498 LVGIKW 503



 Score = 43.9 bits (102), Expect = 0.023,   Method: Composition-based stats.
 Identities = 18/49 (36%), Positives = 26/49 (53%)

Query: 17  GGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           GG T+F     T+ P K  A+FW+N +     +    H+ CPV +G KW
Sbjct: 455 GGGTVFTEAKSTILPTKNDALFWYNLYKQGDGNPDTRHAACPVLVGIKW 503


>gi|242032633|ref|XP_002463711.1| hypothetical protein SORBIDRAFT_01g004670 [Sorghum bicolor]
 gi|241917565|gb|EER90709.1| hypothetical protein SORBIDRAFT_01g004670 [Sorghum bicolor]
          Length = 297

 Score = 47.0 bits (110), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 31/98 (31%), Positives = 41/98 (41%), Gaps = 20/98 (20%)

Query: 48  LDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN------------------L 89
            DY  +H    V  G +    +L  LTDV+ GG T+FP+                    L
Sbjct: 137 FDY--FHDKNNVKHGGQRFATVLMYLTDVKKGGETVFPNAEGSHLQYKDETWSECSRSGL 194

Query: 90  TVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
            V P+KG A+ ++  H N   D    H  CPV  G KW
Sbjct: 195 AVKPKKGDALLFFGLHLNATTDTSSLHGSCPVIEGEKW 232



 Score = 42.7 bits (99), Expect = 0.042,   Method: Compositional matrix adjust.
 Identities = 25/73 (34%), Positives = 31/73 (42%), Gaps = 18/73 (24%)

Query: 11  LTDVELGGATIFPSLN------------------LTVFPEKGSAVFWHNAHANTLLDYWM 52
           LTDV+ GG T+FP+                    L V P+KG A+ +   H N   D   
Sbjct: 160 LTDVKKGGETVFPNAEGSHLQYKDETWSECSRSGLAVKPKKGDALLFFGLHLNATTDTSS 219

Query: 53  YHSGCPVALGNKW 65
            H  CPV  G KW
Sbjct: 220 LHGSCPVIEGEKW 232


>gi|17552840|ref|NP_499464.1| Protein DPY-18 [Caenorhabditis elegans]
 gi|20455505|sp|Q10576.2|P4HA1_CAEEL RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
           alpha-1; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-1; AltName: Full=Protein dumpy-18; Flags:
           Precursor
 gi|3881011|emb|CAA21045.1| Protein DPY-18 [Caenorhabditis elegans]
 gi|6900013|emb|CAB71298.1| prolyl 4-hydroxylase alpha subunit 1 [Caenorhabditis elegans]
          Length = 559

 Score = 47.0 bits (110), Expect = 0.002,   Method: Composition-based stats.
 Identities = 22/66 (33%), Positives = 33/66 (50%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+   +L   ++    GG T+F     T+ P K  A+FWYN +     +    H+ CPV
Sbjct: 440 GNRIATVLFY-MSQPSHGGGTVFTEAKSTILPTKNDALFWYNLYKQGDGNPDTRHAACPV 498

Query: 122 ALGNKW 127
            +G KW
Sbjct: 499 LVGIKW 504



 Score = 43.9 bits (102), Expect = 0.023,   Method: Composition-based stats.
 Identities = 18/49 (36%), Positives = 26/49 (53%)

Query: 17  GGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           GG T+F     T+ P K  A+FW+N +     +    H+ CPV +G KW
Sbjct: 456 GGGTVFTEAKSTILPTKNDALFWYNLYKQGDGNPDTRHAACPVLVGIKW 504


>gi|20260280|gb|AAM13038.1| unknown protein [Arabidopsis thaliana]
 gi|22136524|gb|AAM91340.1| unknown protein [Arabidopsis thaliana]
          Length = 298

 Score = 47.0 bits (110), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 29/111 (26%), Positives = 43/111 (38%), Gaps = 23/111 (20%)

Query: 38  FWHNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNL-------- 89
           + H    +   DY  +H    +  G      +L  L++V  GG T+FP   +        
Sbjct: 129 YEHGQKYDAHFDY--FHDKVNIVRGGHRMATILMYLSNVTKGGETVFPDAEIPSRRVLSE 186

Query: 90  -------------TVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
                         V P KG A+ ++N H + + D    H GCPV  G KW
Sbjct: 187 NEEDLSDCAKRGIAVKPRKGDALLFFNLHPDAIPDPLSLHGGCPVIEGEKW 237



 Score = 41.6 bits (96), Expect = 0.097,   Method: Compositional matrix adjust.
 Identities = 23/76 (30%), Positives = 31/76 (40%), Gaps = 21/76 (27%)

Query: 11  LTDVELGGATIFPSLNL---------------------TVFPEKGSAVFWHNAHANTLLD 49
           L++V  GG T+FP   +                      V P KG A+ + N H + + D
Sbjct: 162 LSNVTKGGETVFPDAEIPSRRVLSENEEDLSDCAKRGIAVKPRKGDALLFFNLHPDAIPD 221

Query: 50  YWMYHSGCPVALGNKW 65
               H GCPV  G KW
Sbjct: 222 PLSLHGGCPVIEGEKW 237


>gi|218193936|gb|EEC76363.1| hypothetical protein OsI_13952 [Oryza sativa Indica Group]
          Length = 1062

 Score = 47.0 bits (110), Expect = 0.002,   Method: Composition-based stats.
 Identities = 26/77 (33%), Positives = 35/77 (45%), Gaps = 18/77 (23%)

Query: 69  LLSGLTDVELGGATIFPSL------------------NLTVFPEKGSAVFWYNAHANTLL 110
           +L  LTDV+ GG T+FP+                    L V P+KG A+ +++ H N   
Sbjct: 158 VLMYLTDVKKGGETVFPNAAGRHLQLKDETWSDCARSGLAVKPKKGDALLFFSLHVNATT 217

Query: 111 DYRMYHSGCPVALGNKW 127
           D    H  CPV  G KW
Sbjct: 218 DPASLHGSCPVIEGEKW 234



 Score = 44.7 bits (104), Expect = 0.011,   Method: Composition-based stats.
 Identities = 25/73 (34%), Positives = 32/73 (43%), Gaps = 18/73 (24%)

Query: 11  LTDVELGGATIFPSL------------------NLTVFPEKGSAVFWHNAHANTLLDYWM 52
           LTDV+ GG T+FP+                    L V P+KG A+ + + H N   D   
Sbjct: 162 LTDVKKGGETVFPNAAGRHLQLKDETWSDCARSGLAVKPKKGDALLFFSLHVNATTDPAS 221

Query: 53  YHSGCPVALGNKW 65
            H  CPV  G KW
Sbjct: 222 LHGSCPVIEGEKW 234


>gi|407938132|ref|YP_006853773.1| 2OG-Fe(II) oxygenase [Acidovorax sp. KKS102]
 gi|407895926|gb|AFU45135.1| 2OG-Fe(II) oxygenase [Acidovorax sp. KKS102]
          Length = 303

 Score = 47.0 bits (110), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/68 (38%), Positives = 38/68 (55%), Gaps = 7/68 (10%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVF--WYNAHANTLLDYRMYHSGC 119
           G + G L++  L   + GG T FP ++L V P++G+AVF  +   H +T    R  H G 
Sbjct: 229 GQRVGTLVMY-LNTPDKGGGTTFPDVHLEVAPQRGNAVFFSYERPHPST----RTLHGGA 283

Query: 120 PVALGNKW 127
           PV  G+KW
Sbjct: 284 PVIAGDKW 291



 Score = 43.1 bits (100), Expect = 0.038,   Method: Compositional matrix adjust.
 Identities = 21/51 (41%), Positives = 29/51 (56%), Gaps = 6/51 (11%)

Query: 17  GGATIFPSLNLTVFPEKGSAVF--WHNAHANTLLDYWMYHSGCPVALGNKW 65
           GG T FP ++L V P++G+AVF  +   H +T       H G PV  G+KW
Sbjct: 245 GGGTTFPDVHLEVAPQRGNAVFFSYERPHPSTR----TLHGGAPVIAGDKW 291


>gi|15239594|ref|NP_197391.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
           thaliana]
 gi|21593296|gb|AAM65245.1| prolyl 4-hydroxylase alpha subunit-like protein [Arabidopsis
           thaliana]
 gi|332005243|gb|AED92626.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
           thaliana]
          Length = 298

 Score = 47.0 bits (110), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 29/111 (26%), Positives = 43/111 (38%), Gaps = 23/111 (20%)

Query: 38  FWHNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNL-------- 89
           + H    +   DY  +H    +  G      +L  L++V  GG T+FP   +        
Sbjct: 129 YEHGQKYDAHFDY--FHDKVNIVRGGHRMATILMYLSNVTKGGETVFPDAEIPSRRVLSE 186

Query: 90  -------------TVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
                         V P KG A+ ++N H + + D    H GCPV  G KW
Sbjct: 187 NKEDLSDCAKRGIAVKPRKGDALLFFNLHPDAIPDPLSLHGGCPVIEGEKW 237



 Score = 41.6 bits (96), Expect = 0.097,   Method: Compositional matrix adjust.
 Identities = 23/76 (30%), Positives = 31/76 (40%), Gaps = 21/76 (27%)

Query: 11  LTDVELGGATIFPSLNL---------------------TVFPEKGSAVFWHNAHANTLLD 49
           L++V  GG T+FP   +                      V P KG A+ + N H + + D
Sbjct: 162 LSNVTKGGETVFPDAEIPSRRVLSENKEDLSDCAKRGIAVKPRKGDALLFFNLHPDAIPD 221

Query: 50  YWMYHSGCPVALGNKW 65
               H GCPV  G KW
Sbjct: 222 PLSLHGGCPVIEGEKW 237


>gi|215490181|dbj|BAG86624.1| type 2 proly 4-hydroxylase [Nicotiana tabacum]
          Length = 294

 Score = 47.0 bits (110), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 29/90 (32%), Positives = 37/90 (41%), Gaps = 21/90 (23%)

Query: 59  VALGNKWGKLLLSGLTDVELGGATIFPSLN---------------------LTVFPEKGS 97
           +A G      +L  LT+VE GG T+FP                        + V P KG 
Sbjct: 144 IARGGHRLATVLMYLTNVEKGGETVFPKAEESPRRRSMIADDSLSECAKKGIPVKPRKGD 203

Query: 98  AVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           A+ +Y+ H N   D    H GCPV  G KW
Sbjct: 204 ALLFYSLHPNATPDPLSLHGGCPVIQGEKW 233



 Score = 45.1 bits (105), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 25/76 (32%), Positives = 32/76 (42%), Gaps = 21/76 (27%)

Query: 11  LTDVELGGATIFPSLN---------------------LTVFPEKGSAVFWHNAHANTLLD 49
           LT+VE GG T+FP                        + V P KG A+ +++ H N   D
Sbjct: 158 LTNVEKGGETVFPKAEESPRRRSMIADDSLSECAKKGIPVKPRKGDALLFYSLHPNATPD 217

Query: 50  YWMYHSGCPVALGNKW 65
               H GCPV  G KW
Sbjct: 218 PLSLHGGCPVIQGEKW 233


>gi|90704797|dbj|BAE92293.1| putative prolyl 4-hydroxylase, alpha subunit [Cryptomeria japonica]
          Length = 302

 Score = 47.0 bits (110), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 28/89 (31%), Positives = 39/89 (43%), Gaps = 19/89 (21%)

Query: 59  VALGNKWGKLLLSGLTDVELGGATIFPSLN-------------------LTVFPEKGSAV 99
           V  G + G  +L  L+DVE GG T+FP                      L+V P+ G A+
Sbjct: 202 VKNGGQRGATMLMYLSDVEKGGETVFPQSKVNSSSVPWWDELSECGRSGLSVRPKMGDAL 261

Query: 100 FWYNAHANTLLDYRMYHSGCPVALGNKWG 128
            +++   +  LD    H  CPV  GNKW 
Sbjct: 262 LFWSVKPDASLDPSSLHGSCPVIQGNKWS 290



 Score = 40.8 bits (94), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 24/75 (32%), Positives = 32/75 (42%), Gaps = 19/75 (25%)

Query: 11  LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
           L+DVE GG T+FP                      L+V P+ G A+ + +   +  LD  
Sbjct: 216 LSDVEKGGETVFPQSKVNSSSVPWWDELSECGRSGLSVRPKMGDALLFWSVKPDASLDPS 275

Query: 52  MYHSGCPVALGNKWG 66
             H  CPV  GNKW 
Sbjct: 276 SLHGSCPVIQGNKWS 290


>gi|449529555|ref|XP_004171765.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 284

 Score = 47.0 bits (110), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 31/107 (28%), Positives = 48/107 (44%), Gaps = 25/107 (23%)

Query: 40  HNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN----------- 88
           ++AH +  +D +    G     G +   LL+  L+DVE GG T+FP+             
Sbjct: 172 YDAHYDYFVDEYNIKKG-----GQRMATLLMY-LSDVEEGGETVFPAAKGNFSSVPWWNE 225

Query: 89  --------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
                   L+V P+ G A+ +++   +  LD    H  CPV  GNKW
Sbjct: 226 LSECGKGGLSVKPKMGDALLFWSMKPDATLDPTSLHGACPVIRGNKW 272



 Score = 40.8 bits (94), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 24/74 (32%), Positives = 33/74 (44%), Gaps = 19/74 (25%)

Query: 11  LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
           L+DVE GG T+FP+                     L+V P+ G A+ + +   +  LD  
Sbjct: 199 LSDVEEGGETVFPAAKGNFSSVPWWNELSECGKGGLSVKPKMGDALLFWSMKPDATLDPT 258

Query: 52  MYHSGCPVALGNKW 65
             H  CPV  GNKW
Sbjct: 259 SLHGACPVIRGNKW 272


>gi|449443243|ref|XP_004139389.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 284

 Score = 47.0 bits (110), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 31/107 (28%), Positives = 48/107 (44%), Gaps = 25/107 (23%)

Query: 40  HNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN----------- 88
           ++AH +  +D +    G     G +   LL+  L+DVE GG T+FP+             
Sbjct: 172 YDAHYDYFVDEYNIKKG-----GQRMATLLMY-LSDVEEGGETVFPAAKGNFSSVPWWNE 225

Query: 89  --------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
                   L+V P+ G A+ +++   +  LD    H  CPV  GNKW
Sbjct: 226 LSKCGKGGLSVKPKMGDALLFWSMKPDATLDPTSLHGACPVIRGNKW 272



 Score = 40.8 bits (94), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 24/74 (32%), Positives = 33/74 (44%), Gaps = 19/74 (25%)

Query: 11  LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
           L+DVE GG T+FP+                     L+V P+ G A+ + +   +  LD  
Sbjct: 199 LSDVEEGGETVFPAAKGNFSSVPWWNELSKCGKGGLSVKPKMGDALLFWSMKPDATLDPT 258

Query: 52  MYHSGCPVALGNKW 65
             H  CPV  GNKW
Sbjct: 259 SLHGACPVIRGNKW 272


>gi|193209070|ref|NP_001123049.1| Protein PHY-4, isoform b [Caenorhabditis elegans]
 gi|172051527|emb|CAQ35068.1| Protein PHY-4, isoform b [Caenorhabditis elegans]
          Length = 282

 Score = 47.0 bits (110), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 23/70 (32%), Positives = 36/70 (51%), Gaps = 1/70 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+   +L+  L   + GG T+FP LNL + P+ G  + W N  +    + +  H+ CP+
Sbjct: 183 GNRIATVLVI-LQIAKKGGTTVFPRLNLNIRPKAGDVIVWLNTLSTGESNSQTLHAACPI 241

Query: 122 ALGNKWGKLL 131
             G K G  L
Sbjct: 242 HEGTKIGATL 251



 Score = 46.2 bits (108), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 23/68 (33%), Positives = 32/68 (47%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     V L   + GG T+FP LNL + P+ G  + W N  +    +    H+ CP+  
Sbjct: 184 NRIATVLVILQIAKKGGTTVFPRLNLNIRPKAGDVIVWLNTLSTGESNSQTLHAACPIHE 243

Query: 62  GNKWGKLL 69
           G K G  L
Sbjct: 244 GTKIGATL 251


>gi|357125236|ref|XP_003564301.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Brachypodium
           distachyon]
          Length = 293

 Score = 47.0 bits (110), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 31/98 (31%), Positives = 40/98 (40%), Gaps = 20/98 (20%)

Query: 48  LDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN------------------L 89
            DY  +H      LG      +L  LTDV+ GG T+FP+                    L
Sbjct: 133 FDY--FHDKNNQKLGGHRVATVLMYLTDVKKGGETVFPNAEGRHLQHKDETWSECARSGL 190

Query: 90  TVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
            V P KG A+ +++ H N   D    H  CPV  G KW
Sbjct: 191 AVKPRKGDALLFFSLHINATTDPSSLHGSCPVIEGEKW 228



 Score = 43.1 bits (100), Expect = 0.034,   Method: Compositional matrix adjust.
 Identities = 25/73 (34%), Positives = 31/73 (42%), Gaps = 18/73 (24%)

Query: 11  LTDVELGGATIFPSLN------------------LTVFPEKGSAVFWHNAHANTLLDYWM 52
           LTDV+ GG T+FP+                    L V P KG A+ + + H N   D   
Sbjct: 156 LTDVKKGGETVFPNAEGRHLQHKDETWSECARSGLAVKPRKGDALLFFSLHINATTDPSS 215

Query: 53  YHSGCPVALGNKW 65
            H  CPV  G KW
Sbjct: 216 LHGSCPVIEGEKW 228


>gi|356572148|ref|XP_003554232.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Glycine max]
          Length = 319

 Score = 47.0 bits (110), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 29/98 (29%), Positives = 41/98 (41%), Gaps = 20/98 (20%)

Query: 48  LDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN------------------L 89
            DY  +H      +G      +L  L+DVE GG TIFP+                     
Sbjct: 159 FDY--FHDKANQVMGGHRIATVLMYLSDVEKGGETIFPNAEAKLLQPKDESWSECAHKGY 216

Query: 90  TVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
            V P+KG A+ +++ H +   D +  H  CPV  G KW
Sbjct: 217 AVKPQKGDALLFFSLHLDASTDTKSLHGSCPVIEGEKW 254



 Score = 42.4 bits (98), Expect = 0.067,   Method: Compositional matrix adjust.
 Identities = 26/85 (30%), Positives = 35/85 (41%), Gaps = 18/85 (21%)

Query: 11  LTDVELGGATIFPSLN------------------LTVFPEKGSAVFWHNAHANTLLDYWM 52
           L+DVE GG TIFP+                      V P+KG A+ + + H +   D   
Sbjct: 182 LSDVEKGGETIFPNAEAKLLQPKDESWSECAHKGYAVKPQKGDALLFFSLHLDASTDTKS 241

Query: 53  YHSGCPVALGNKWGKLLLSGLTDVE 77
            H  CPV  G KW       ++D E
Sbjct: 242 LHGSCPVIEGEKWSATKWIHVSDFE 266


>gi|255552788|ref|XP_002517437.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
 gi|223543448|gb|EEF44979.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
          Length = 311

 Score = 47.0 bits (110), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 34/121 (28%), Positives = 47/121 (38%), Gaps = 21/121 (17%)

Query: 28  TVFPEKG--SAVFWHNAHANTLLDYWMY-HSGCPVALGNKWGKLLLSGLTDVELGGATIF 84
           T  PE+   S    H  H      ++ Y H      LG      +L  L++VE GG T+F
Sbjct: 126 TFLPEENGESMQILHYEHGQKYEPHFDYFHDKANQELGGHRVATVLMYLSNVEKGGETVF 185

Query: 85  PSLN------------------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 126
           P+                      V PEKG A+ +++ H +   D    H  CPV  G K
Sbjct: 186 PNAEGKLSQPKEDSWSDCAKGGYAVKPEKGDALLFFSLHPDATTDSDSLHGSCPVIEGEK 245

Query: 127 W 127
           W
Sbjct: 246 W 246



 Score = 40.8 bits (94), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 23/73 (31%), Positives = 31/73 (42%), Gaps = 18/73 (24%)

Query: 11  LTDVELGGATIFPSLN------------------LTVFPEKGSAVFWHNAHANTLLDYWM 52
           L++VE GG T+FP+                      V PEKG A+ + + H +   D   
Sbjct: 174 LSNVEKGGETVFPNAEGKLSQPKEDSWSDCAKGGYAVKPEKGDALLFFSLHPDATTDSDS 233

Query: 53  YHSGCPVALGNKW 65
            H  CPV  G KW
Sbjct: 234 LHGSCPVIEGEKW 246


>gi|449485593|ref|XP_004175686.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
           alpha-3 [Taeniopygia guttata]
          Length = 567

 Score = 47.0 bits (110), Expect = 0.003,   Method: Composition-based stats.
 Identities = 25/66 (37%), Positives = 37/66 (56%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+   +++  L+ VE GG+T     N +V   K +A+FW+N   N   D    H+GCPV
Sbjct: 480 GNRNATVMIY-LSAVEAGGSTALIYTNFSVPVVKNAALFWWNLRRNGNGDGDTLHAGCPV 538

Query: 122 ALGNKW 127
             G+KW
Sbjct: 539 LAGDKW 544



 Score = 46.2 bits (108), Expect = 0.004,   Method: Composition-based stats.
 Identities = 25/64 (39%), Positives = 33/64 (51%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L+ VE GG+T     N +V   K +A+FW N   N   D    H+GCPV  
Sbjct: 481 NRNATVMIYLSAVEAGGSTALIYTNFSVPVVKNAALFWWNLRRNGNGDGDTLHAGCPVLA 540

Query: 62  GNKW 65
           G+KW
Sbjct: 541 GDKW 544


>gi|341890029|gb|EGT45964.1| hypothetical protein CAEBREN_03684 [Caenorhabditis brenneri]
          Length = 293

 Score = 47.0 bits (110), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 22/63 (34%), Positives = 32/63 (50%), Gaps = 1/63 (1%)

Query: 64  KWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVAL 123
           +W + L+S +    +GG T+FP L      + G A+ W+N   N  L+    H GCPV  
Sbjct: 201 EWQRFLIS-IKAANVGGGTVFPYLGTVARTKPGDAILWFNMKQNRELEPLSLHGGCPVRS 259

Query: 124 GNK 126
           G K
Sbjct: 260 GEK 262



 Score = 44.3 bits (103), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 23/73 (31%), Positives = 32/73 (43%), Gaps = 5/73 (6%)

Query: 16  LGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTD 75
           +GG T+FP L      + G A+ W N   N  L+    H GCPV  G K     +     
Sbjct: 214 VGGGTVFPYLGTVARTKPGDAILWFNMKQNRELEPLSLHGGCPVRSGEK-----IVAAIS 268

Query: 76  VELGGATIFPSLN 88
           VE  G  +F + +
Sbjct: 269 VERKGQELFKTFD 281


>gi|224102545|ref|XP_002312720.1| predicted protein [Populus trichocarpa]
 gi|222852540|gb|EEE90087.1| predicted protein [Populus trichocarpa]
          Length = 300

 Score = 47.0 bits (110), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 27/90 (30%), Positives = 39/90 (43%), Gaps = 21/90 (23%)

Query: 59  VALGNKWGKLLLSGLTDVELGGATIFPSLN---------------------LTVFPEKGS 97
           +A G      +L  LTDVE GG T+FPS                       + V P +G 
Sbjct: 150 IARGGHRVATVLMYLTDVEKGGETVFPSAEELPRRKASVSHEDLSECARKGIAVKPRRGD 209

Query: 98  AVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           A+ +++ +   + D    H+GCPV  G KW
Sbjct: 210 ALLFFSLYPTAVPDTSSIHAGCPVIEGEKW 239



 Score = 43.9 bits (102), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 24/76 (31%), Positives = 33/76 (43%), Gaps = 21/76 (27%)

Query: 11  LTDVELGGATIFPSLN---------------------LTVFPEKGSAVFWHNAHANTLLD 49
           LTDVE GG T+FPS                       + V P +G A+ + + +   + D
Sbjct: 164 LTDVEKGGETVFPSAEELPRRKASVSHEDLSECARKGIAVKPRRGDALLFFSLYPTAVPD 223

Query: 50  YWMYHSGCPVALGNKW 65
               H+GCPV  G KW
Sbjct: 224 TSSIHAGCPVIEGEKW 239


>gi|326492085|dbj|BAJ98267.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 347

 Score = 47.0 bits (110), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 26/70 (37%), Positives = 31/70 (44%), Gaps = 15/70 (21%)

Query: 73  LTDVELGGATIFPSLN---------------LTVFPEKGSAVFWYNAHANTLLDYRMYHS 117
           LTDVE GG T+FP  N               L V P KG  + +Y+   N  +D    H 
Sbjct: 267 LTDVEEGGETMFPYENGDNMNIGYDYEQCIGLKVKPRKGDGLLFYSLMVNGTIDPTSLHG 326

Query: 118 GCPVALGNKW 127
            CPV  G KW
Sbjct: 327 SCPVVRGEKW 336



 Score = 45.1 bits (105), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 25/70 (35%), Positives = 31/70 (44%), Gaps = 15/70 (21%)

Query: 11  LTDVELGGATIFPSLN---------------LTVFPEKGSAVFWHNAHANTLLDYWMYHS 55
           LTDVE GG T+FP  N               L V P KG  + +++   N  +D    H 
Sbjct: 267 LTDVEEGGETMFPYENGDNMNIGYDYEQCIGLKVKPRKGDGLLFYSLMVNGTIDPTSLHG 326

Query: 56  GCPVALGNKW 65
            CPV  G KW
Sbjct: 327 SCPVVRGEKW 336


>gi|159464219|ref|XP_001690339.1| hypothetical protein CHLREDRAFT_114525 [Chlamydomonas reinhardtii]
 gi|158279839|gb|EDP05598.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 244

 Score = 46.6 bits (109), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 27/95 (28%), Positives = 46/95 (48%), Gaps = 18/95 (18%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSL----------------NLTVF 92
           DY+ +  G     GN++  +L+  L D E GG T+FP++                +L   
Sbjct: 95  DYFFHKDGIANG-GNRYATVLMY-LVDTEEGGETVFPNIAAPGGENVGFSECARYHLAAK 152

Query: 93  PEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           P+KG+A+ +++      L+ +  H+ CPV  G KW
Sbjct: 153 PKKGTAILFHSIKPTGELERKSLHTACPVIKGIKW 187



 Score = 42.7 bits (99), Expect = 0.047,   Method: Compositional matrix adjust.
 Identities = 22/71 (30%), Positives = 33/71 (46%), Gaps = 16/71 (22%)

Query: 11  LTDVELGGATIFPSL----------------NLTVFPEKGSAVFWHNAHANTLLDYWMYH 54
           L D E GG T+FP++                +L   P+KG+A+ +H+      L+    H
Sbjct: 117 LVDTEEGGETVFPNIAAPGGENVGFSECARYHLAAKPKKGTAILFHSIKPTGELERKSLH 176

Query: 55  SGCPVALGNKW 65
           + CPV  G KW
Sbjct: 177 TACPVIKGIKW 187


>gi|449520146|ref|XP_004167095.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 249

 Score = 46.6 bits (109), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 28/86 (32%), Positives = 41/86 (47%), Gaps = 20/86 (23%)

Query: 61  LGNKWGKLLLSGLTDVELGGATIFPSLN-------------------LTVFPEKGSAVFW 101
           +G +   LL+  L+DVE GG T+FP+                     L+V P+ G A+ +
Sbjct: 153 IGQRMATLLMY-LSDVEEGGETVFPAAKGNFSSVPWWNELSKCGKGGLSVKPKMGDALLF 211

Query: 102 YNAHANTLLDYRMYHSGCPVALGNKW 127
           ++   +T LD    H  CPV  GNKW
Sbjct: 212 WSMKPDTTLDPTSLHGACPVIRGNKW 237



 Score = 42.7 bits (99), Expect = 0.054,   Method: Compositional matrix adjust.
 Identities = 25/74 (33%), Positives = 34/74 (45%), Gaps = 19/74 (25%)

Query: 11  LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
           L+DVE GG T+FP+                     L+V P+ G A+ + +   +T LD  
Sbjct: 164 LSDVEEGGETVFPAAKGNFSSVPWWNELSKCGKGGLSVKPKMGDALLFWSMKPDTTLDPT 223

Query: 52  MYHSGCPVALGNKW 65
             H  CPV  GNKW
Sbjct: 224 SLHGACPVIRGNKW 237


>gi|385206010|ref|ZP_10032880.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
 gi|385185901|gb|EIF35175.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
          Length = 296

 Score = 46.6 bits (109), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 24/66 (36%), Positives = 36/66 (54%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           G + G LL+  L DVE GG T+FP    +V P +G A+++   +   L D    H+  P+
Sbjct: 215 GQRVGTLLMY-LNDVEGGGETMFPQTGWSVVPRRGQALYFEYGNRFGLADPSSLHTSTPL 273

Query: 122 ALGNKW 127
            +G KW
Sbjct: 274 RVGEKW 279



 Score = 44.7 bits (104), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 20/55 (36%), Positives = 30/55 (54%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T+FP    +V P +G A+++   +   L D    H+  P+ +G KW
Sbjct: 225 LNDVEGGGETMFPQTGWSVVPRRGQALYFEYGNRFGLADPSSLHTSTPLRVGEKW 279


>gi|297798522|ref|XP_002867145.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
 gi|297312981|gb|EFH43404.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
          Length = 288

 Score = 46.6 bits (109), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 25/70 (35%), Positives = 31/70 (44%), Gaps = 15/70 (21%)

Query: 73  LTDVELGGATIFPSLN---------------LTVFPEKGSAVFWYNAHANTLLDYRMYHS 117
           L+DVE GG T+FP  N               L V P KG  + +Y+   N  +D    H 
Sbjct: 206 LSDVEEGGETMFPFENGSNMGTGYDYKQCIGLKVKPRKGDGLLFYSVFPNGTIDQTSLHG 265

Query: 118 GCPVALGNKW 127
            CPV  G KW
Sbjct: 266 SCPVTKGEKW 275



 Score = 45.1 bits (105), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 24/70 (34%), Positives = 31/70 (44%), Gaps = 15/70 (21%)

Query: 11  LTDVELGGATIFPSLN---------------LTVFPEKGSAVFWHNAHANTLLDYWMYHS 55
           L+DVE GG T+FP  N               L V P KG  + +++   N  +D    H 
Sbjct: 206 LSDVEEGGETMFPFENGSNMGTGYDYKQCIGLKVKPRKGDGLLFYSVFPNGTIDQTSLHG 265

Query: 56  GCPVALGNKW 65
            CPV  G KW
Sbjct: 266 SCPVTKGEKW 275


>gi|356555587|ref|XP_003546112.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 2
           [Glycine max]
          Length = 297

 Score = 46.6 bits (109), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 27/86 (31%), Positives = 39/86 (45%), Gaps = 17/86 (19%)

Query: 59  VALGNKWGKLLLSGLTDVELGGATIFPSLNL-----------------TVFPEKGSAVFW 101
           +A G      +L  LTDV  GG T+FP+  L                  V P +G A+ +
Sbjct: 151 IARGGHRVATVLMYLTDVTKGGETVFPNAELKSSETKEDLSECAQKGIAVKPRRGDALLF 210

Query: 102 YNAHANTLLDYRMYHSGCPVALGNKW 127
           ++ + N + D    H+GCPV  G KW
Sbjct: 211 FSLYPNAIPDTMSLHAGCPVIEGEKW 236



 Score = 44.3 bits (103), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 24/72 (33%), Positives = 33/72 (45%), Gaps = 17/72 (23%)

Query: 11  LTDVELGGATIFPSLNL-----------------TVFPEKGSAVFWHNAHANTLLDYWMY 53
           LTDV  GG T+FP+  L                  V P +G A+ + + + N + D    
Sbjct: 165 LTDVTKGGETVFPNAELKSSETKEDLSECAQKGIAVKPRRGDALLFFSLYPNAIPDTMSL 224

Query: 54  HSGCPVALGNKW 65
           H+GCPV  G KW
Sbjct: 225 HAGCPVIEGEKW 236


>gi|385137888|gb|AFI41205.1| oxygenase protein, partial [Arabidopsis thaliana]
          Length = 288

 Score = 46.6 bits (109), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 25/70 (35%), Positives = 31/70 (44%), Gaps = 15/70 (21%)

Query: 73  LTDVELGGATIFPSLN---------------LTVFPEKGSAVFWYNAHANTLLDYRMYHS 117
           L+DVE GG T+FP  N               L V P KG  + +Y+   N  +D    H 
Sbjct: 206 LSDVEEGGETMFPFENGSNMGIGYDYKQCIGLKVKPRKGDGLLFYSVFPNGTIDQTSLHG 265

Query: 118 GCPVALGNKW 127
            CPV  G KW
Sbjct: 266 SCPVTKGEKW 275



 Score = 45.1 bits (105), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 24/70 (34%), Positives = 31/70 (44%), Gaps = 15/70 (21%)

Query: 11  LTDVELGGATIFPSLN---------------LTVFPEKGSAVFWHNAHANTLLDYWMYHS 55
           L+DVE GG T+FP  N               L V P KG  + +++   N  +D    H 
Sbjct: 206 LSDVEEGGETMFPFENGSNMGIGYDYKQCIGLKVKPRKGDGLLFYSVFPNGTIDQTSLHG 265

Query: 56  GCPVALGNKW 65
            CPV  G KW
Sbjct: 266 SCPVTKGEKW 275


>gi|18418321|ref|NP_567941.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
           thaliana]
 gi|17381226|gb|AAL36425.1| unknown protein [Arabidopsis thaliana]
 gi|20465827|gb|AAM20018.1| unknown protein [Arabidopsis thaliana]
 gi|21592377|gb|AAM64328.1| putative dioxygenase [Arabidopsis thaliana]
 gi|332660892|gb|AEE86292.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
           thaliana]
          Length = 288

 Score = 46.6 bits (109), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 25/70 (35%), Positives = 31/70 (44%), Gaps = 15/70 (21%)

Query: 73  LTDVELGGATIFPSLN---------------LTVFPEKGSAVFWYNAHANTLLDYRMYHS 117
           L+DVE GG T+FP  N               L V P KG  + +Y+   N  +D    H 
Sbjct: 206 LSDVEEGGETMFPFENGSNMGIGYDYKQCIGLKVKPRKGDGLLFYSVFPNGTIDQTSLHG 265

Query: 118 GCPVALGNKW 127
            CPV  G KW
Sbjct: 266 SCPVTKGEKW 275



 Score = 45.1 bits (105), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 24/70 (34%), Positives = 31/70 (44%), Gaps = 15/70 (21%)

Query: 11  LTDVELGGATIFPSLN---------------LTVFPEKGSAVFWHNAHANTLLDYWMYHS 55
           L+DVE GG T+FP  N               L V P KG  + +++   N  +D    H 
Sbjct: 206 LSDVEEGGETMFPFENGSNMGIGYDYKQCIGLKVKPRKGDGLLFYSVFPNGTIDQTSLHG 265

Query: 56  GCPVALGNKW 65
            CPV  G KW
Sbjct: 266 SCPVTKGEKW 275


>gi|303276456|ref|XP_003057522.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226461874|gb|EEH59167.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 231

 Score = 46.6 bits (109), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 24/60 (40%), Positives = 31/60 (51%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 128
           LL+ L +   GG T FP L+L   P  G AV W+N   +   D R  H+G PV  G K+ 
Sbjct: 125 LLAYLREPTRGGRTTFPKLHLGFDPRVGDAVLWWNVDDDGKEDPRTTHAGEPVEAGEKFA 184



 Score = 39.3 bits (90), Expect = 0.48,   Method: Compositional matrix adjust.
 Identities = 21/56 (37%), Positives = 26/56 (46%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWG 66
           L +   GG T FP L+L   P  G AV W N   +   D    H+G PV  G K+ 
Sbjct: 129 LREPTRGGRTTFPKLHLGFDPRVGDAVLWWNVDDDGKEDPRTTHAGEPVEAGEKFA 184


>gi|357114580|ref|XP_003559078.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Brachypodium
           distachyon]
          Length = 295

 Score = 46.6 bits (109), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 26/70 (37%), Positives = 31/70 (44%), Gaps = 15/70 (21%)

Query: 73  LTDVELGGATIFPSLN---------------LTVFPEKGSAVFWYNAHANTLLDYRMYHS 117
           LTDVE GG T+FP  N               L V P KG  + +Y+   N  +D    H 
Sbjct: 215 LTDVEEGGETMFPYENGENMDIGYDYEQCIGLKVKPRKGDGLLFYSLMVNGTIDLTSLHG 274

Query: 118 GCPVALGNKW 127
            CPV  G KW
Sbjct: 275 SCPVIKGEKW 284



 Score = 44.7 bits (104), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 25/70 (35%), Positives = 31/70 (44%), Gaps = 15/70 (21%)

Query: 11  LTDVELGGATIFPSLN---------------LTVFPEKGSAVFWHNAHANTLLDYWMYHS 55
           LTDVE GG T+FP  N               L V P KG  + +++   N  +D    H 
Sbjct: 215 LTDVEEGGETMFPYENGENMDIGYDYEQCIGLKVKPRKGDGLLFYSLMVNGTIDLTSLHG 274

Query: 56  GCPVALGNKW 65
            CPV  G KW
Sbjct: 275 SCPVIKGEKW 284


>gi|224117220|ref|XP_002331751.1| predicted protein [Populus trichocarpa]
 gi|222874448|gb|EEF11579.1| predicted protein [Populus trichocarpa]
          Length = 266

 Score = 46.6 bits (109), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 26/74 (35%), Positives = 35/74 (47%), Gaps = 19/74 (25%)

Query: 73  LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWYNAHANTLLDYR 113
           L+DVE GG T+FPS                     L+V P+ G A+ +++   +  LD  
Sbjct: 180 LSDVEEGGETVFPSAKGNISSVPWWNELSECGKGGLSVKPKMGDALLFWSMKPDASLDPS 239

Query: 114 MYHSGCPVALGNKW 127
             H GCPV  GNKW
Sbjct: 240 SLHGGCPVIRGNKW 253



 Score = 43.9 bits (102), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 26/74 (35%), Positives = 34/74 (45%), Gaps = 19/74 (25%)

Query: 11  LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
           L+DVE GG T+FPS                     L+V P+ G A+ + +   +  LD  
Sbjct: 180 LSDVEEGGETVFPSAKGNISSVPWWNELSECGKGGLSVKPKMGDALLFWSMKPDASLDPS 239

Query: 52  MYHSGCPVALGNKW 65
             H GCPV  GNKW
Sbjct: 240 SLHGGCPVIRGNKW 253


>gi|302823087|ref|XP_002993198.1| hypothetical protein SELMODRAFT_431327 [Selaginella moellendorffii]
 gi|300138968|gb|EFJ05718.1| hypothetical protein SELMODRAFT_431327 [Selaginella moellendorffii]
          Length = 269

 Score = 46.6 bits (109), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 26/68 (38%), Positives = 34/68 (50%), Gaps = 13/68 (19%)

Query: 73  LTDVELGGATIFPSLN-------------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGC 119
           L+DVE GG TIFPS+              L V P KG A+ +++A  +  +D    H GC
Sbjct: 190 LSDVEEGGETIFPSVGDGECECGGELRKGLCVKPRKGDAILFWSAALDGNVDSNSLHGGC 249

Query: 120 PVALGNKW 127
            V  G KW
Sbjct: 250 SVLRGEKW 257



 Score = 43.9 bits (102), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 26/68 (38%), Positives = 33/68 (48%), Gaps = 13/68 (19%)

Query: 11  LTDVELGGATIFPSLN-------------LTVFPEKGSAVFWHNAHANTLLDYWMYHSGC 57
           L+DVE GG TIFPS+              L V P KG A+ + +A  +  +D    H GC
Sbjct: 190 LSDVEEGGETIFPSVGDGECECGGELRKGLCVKPRKGDAILFWSAALDGNVDSNSLHGGC 249

Query: 58  PVALGNKW 65
            V  G KW
Sbjct: 250 SVLRGEKW 257


>gi|226499492|ref|NP_001150030.1| LOC100283657 [Zea mays]
 gi|195636206|gb|ACG37571.1| prolyl 4-hydroxylase [Zea mays]
 gi|347978804|gb|AEP37744.1| prolyl 4-hydroxylase 3 [Zea mays]
          Length = 294

 Score = 46.6 bits (109), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 26/70 (37%), Positives = 31/70 (44%), Gaps = 15/70 (21%)

Query: 73  LTDVELGGATIFPSLN---------------LTVFPEKGSAVFWYNAHANTLLDYRMYHS 117
           LTDVE GG T+FP  N               L V P KG  + +Y+   N  +D    H 
Sbjct: 214 LTDVEEGGETMFPYENSENMDIGYDYEKCIGLKVKPRKGDGLLFYSLMVNGTIDRTSLHG 273

Query: 118 GCPVALGNKW 127
            CPV  G KW
Sbjct: 274 SCPVIKGEKW 283



 Score = 44.7 bits (104), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 25/70 (35%), Positives = 31/70 (44%), Gaps = 15/70 (21%)

Query: 11  LTDVELGGATIFPSLN---------------LTVFPEKGSAVFWHNAHANTLLDYWMYHS 55
           LTDVE GG T+FP  N               L V P KG  + +++   N  +D    H 
Sbjct: 214 LTDVEEGGETMFPYENSENMDIGYDYEKCIGLKVKPRKGDGLLFYSLMVNGTIDRTSLHG 273

Query: 56  GCPVALGNKW 65
            CPV  G KW
Sbjct: 274 SCPVIKGEKW 283


>gi|119595340|gb|EAW74934.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide III, isoform CRA_a
           [Homo sapiens]
          Length = 657

 Score = 46.6 bits (109), Expect = 0.003,   Method: Composition-based stats.
 Identities = 24/60 (40%), Positives = 35/60 (58%), Gaps = 1/60 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+    ++  L+ VE GGAT F   NL+V   + +A+FW+N H +   D    H+GCPV
Sbjct: 489 GNRVATFMIY-LSSVEAGGATAFIYANLSVPVVRNAALFWWNLHRSGEGDSDTLHAGCPV 547



 Score = 45.8 bits (107), Expect = 0.006,   Method: Composition-based stats.
 Identities = 24/58 (41%), Positives = 32/58 (55%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPV 59
           NR     + L+ VE GGAT F   NL+V   + +A+FW N H +   D    H+GCPV
Sbjct: 490 NRVATFMIYLSSVEAGGATAFIYANLSVPVVRNAALFWWNLHRSGEGDSDTLHAGCPV 547


>gi|307111754|gb|EFN59988.1| hypothetical protein CHLNCDRAFT_49444 [Chlorella variabilis]
          Length = 344

 Score = 46.6 bits (109), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 39/147 (26%), Positives = 62/147 (42%), Gaps = 28/147 (19%)

Query: 1   MNRAEMSSVELTDVELGGATIFPSLN---LTVFPEKGSAVFWHNAHANTLLDYWMYHSGC 57
           +NR   S +   +  +   T+ P+ N   L V        + H        DY+ + +G 
Sbjct: 121 LNRGHDSVIADIEARIAKWTLMPAGNGEGLQVLK------YEHGQEYEGHYDYFFHKAGT 174

Query: 58  PVALGNKWGKLLLSGLTDVELGGATIFPSLN-----------------LTVFPEKGSAVF 100
               GN++  +L+  L DVE GG T FP++                  L   P+KG+AV 
Sbjct: 175 ANG-GNRYLTVLMY-LNDVEEGGETCFPNIPSPNGDNGPEFSECARKVLAAKPKKGNAVL 232

Query: 101 WYNAHANTLLDYRMYHSGCPVALGNKW 127
           +++      L+ R  H+ CPV  G KW
Sbjct: 233 FHSIKPTGELERRSLHTACPVIKGVKW 259


>gi|223945827|gb|ACN26997.1| unknown [Zea mays]
 gi|414872966|tpg|DAA51523.1| TPA: prolyl 4-hydroxylase [Zea mays]
          Length = 294

 Score = 46.6 bits (109), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 26/70 (37%), Positives = 31/70 (44%), Gaps = 15/70 (21%)

Query: 73  LTDVELGGATIFPSLN---------------LTVFPEKGSAVFWYNAHANTLLDYRMYHS 117
           LTDVE GG T+FP  N               L V P KG  + +Y+   N  +D    H 
Sbjct: 214 LTDVEEGGETMFPYENSENMDIGYDYEKCIGLKVKPRKGDGLLFYSLMVNGTIDRTSLHG 273

Query: 118 GCPVALGNKW 127
            CPV  G KW
Sbjct: 274 SCPVIKGEKW 283



 Score = 44.7 bits (104), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 25/70 (35%), Positives = 31/70 (44%), Gaps = 15/70 (21%)

Query: 11  LTDVELGGATIFPSLN---------------LTVFPEKGSAVFWHNAHANTLLDYWMYHS 55
           LTDVE GG T+FP  N               L V P KG  + +++   N  +D    H 
Sbjct: 214 LTDVEEGGETMFPYENSENMDIGYDYEKCIGLKVKPRKGDGLLFYSLMVNGTIDRTSLHG 273

Query: 56  GCPVALGNKW 65
            CPV  G KW
Sbjct: 274 SCPVIKGEKW 283


>gi|378706224|gb|AFC35025.1| hypothetical protein OtV6_117c [Ostreococcus tauri virus RT-2011]
          Length = 194

 Score = 46.6 bits (109), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 36/127 (28%), Positives = 51/127 (40%), Gaps = 17/127 (13%)

Query: 1   MNRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVA 60
           +NR     V LTD  L        L +  + E G    ++  H +T  D           
Sbjct: 74  VNRVTQRCVSLTDRPLINCE---KLQVLRYKEGG----FYKPHQDTFSD----------V 116

Query: 61  LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
            GNK    ++  L D   GG T+FP+L       KG A+F++      L+  +  H G P
Sbjct: 117 KGNKRMYTIILALNDDYEGGETVFPNLRRKYKLNKGDALFFHTLDNYELMTSKALHGGAP 176

Query: 121 VALGNKW 127
           V  G KW
Sbjct: 177 VKSGEKW 183


>gi|221482398|gb|EEE20746.1| prolyl 4-hydroxylase alpha subunit, putative [Toxoplasma gondii
           GT1]
 gi|221504447|gb|EEE30120.1| prolyl 4-hydroxylase alpha subunit, putative [Toxoplasma gondii
           VEG]
          Length = 401

 Score = 46.6 bits (109), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 30/80 (37%), Positives = 40/80 (50%), Gaps = 6/80 (7%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
            Y+  HS      G    K +L  L DVE GG T F +L   V P KG+ V W N++  T
Sbjct: 307 QYFKLHSD-----GGFRPKTILLYLNDVEAGGETSFENLGFRVAPMKGAGVLWNNSYPGT 361

Query: 109 -LLDYRMYHSGCPVALGNKW 127
             +D R+ H+G P   G K+
Sbjct: 362 NEIDPRLIHAGLPPEKGVKF 381



 Score = 43.1 bits (100), Expect = 0.035,   Method: Compositional matrix adjust.
 Identities = 23/56 (41%), Positives = 32/56 (57%), Gaps = 1/56 (1%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANT-LLDYWMYHSGCPVALGNKW 65
           L DVE GG T F +L   V P KG+ V W+N++  T  +D  + H+G P   G K+
Sbjct: 326 LNDVEAGGETSFENLGFRVAPMKGAGVLWNNSYPGTNEIDPRLIHAGLPPEKGVKF 381


>gi|302842389|ref|XP_002952738.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
 gi|300262082|gb|EFJ46291.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
          Length = 281

 Score = 46.6 bits (109), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 37/146 (25%), Positives = 65/146 (44%), Gaps = 27/146 (18%)

Query: 1   MNRAEMSSVELTDVELGGATIFPSLN---LTVFPEKGSAVFWHNAHANTLLDYWMYHSGC 57
           ++R E   V+  +  +   T+ P  N   L V   +    +  +AH     DY+ +  G 
Sbjct: 46  LDRGEDEIVKRVEERIAAWTLMPVGNGEGLQVLRYQKEQKY--DAH----WDYFFHKDGI 99

Query: 58  PVALGNKWGKLLLSGLTDVELGGATIFPSL----------------NLTVFPEKGSAVFW 101
               GN++  +L+  L D E GG T+FP++                +L   P+KG+A+ +
Sbjct: 100 TNG-GNRYATVLMY-LVDTEEGGETVFPNVAAPGGENVGFSECARYHLAAKPKKGTAILF 157

Query: 102 YNAHANTLLDYRMYHSGCPVALGNKW 127
           ++      L+ +  H+ CPV  G KW
Sbjct: 158 HSIKPTGELERKSLHTACPVIRGIKW 183


>gi|383757171|ref|YP_005436156.1| putative prolyl 4-hydroxylase alpha subunit [Rubrivivax gelatinosus
           IL144]
 gi|381377840|dbj|BAL94657.1| putative prolyl 4-hydroxylase alpha subunit homologue
           oxidoreductase protein [Rubrivivax gelatinosus IL144]
          Length = 279

 Score = 46.6 bits (109), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 27/60 (45%), Positives = 33/60 (55%), Gaps = 4/60 (6%)

Query: 79  GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW--GKLLLSGVF 136
           GGAT FP + L V P KG+AVF+  ++   L   R  H G PV  G KW   K +  GVF
Sbjct: 221 GGATTFPEVGLEVQPVKGNAVFF--SYERPLASTRTLHGGAPVLDGEKWVATKWMREGVF 278



 Score = 42.4 bits (98), Expect = 0.066,   Method: Compositional matrix adjust.
 Identities = 22/49 (44%), Positives = 27/49 (55%), Gaps = 2/49 (4%)

Query: 17  GGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           GGAT FP + L V P KG+AVF+  ++   L      H G PV  G KW
Sbjct: 221 GGATTFPEVGLEVQPVKGNAVFF--SYERPLASTRTLHGGAPVLDGEKW 267


>gi|325929527|ref|ZP_08190641.1| 2OG-Fe(II) oxygenase superfamily enzyme,Sel1 repeat protein
           [Xanthomonas perforans 91-118]
 gi|325540037|gb|EGD11665.1| 2OG-Fe(II) oxygenase superfamily enzyme,Sel1 repeat protein
           [Xanthomonas perforans 91-118]
          Length = 418

 Score = 46.6 bits (109), Expect = 0.004,   Method: Composition-based stats.
 Identities = 25/64 (39%), Positives = 31/64 (48%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     V L DV   G T FP   + V P  G+ V + N HA+   D    H+G PV  
Sbjct: 341 NRQRTVCVYLNDVGAAGETEFPVAGVRVRPRPGTLVCFDNLHADGRPDADSLHAGLPVTA 400

Query: 62  GNKW 65
           G+KW
Sbjct: 401 GSKW 404



 Score = 43.1 bits (100), Expect = 0.040,   Method: Composition-based stats.
 Identities = 22/55 (40%), Positives = 28/55 (50%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           L DV   G T FP   + V P  G+ V + N HA+   D    H+G PV  G+KW
Sbjct: 350 LNDVGAAGETEFPVAGVRVRPRPGTLVCFDNLHADGRPDADSLHAGLPVTAGSKW 404


>gi|237841319|ref|XP_002369957.1| 2OG-Fe(II) oxygenase family protein, putative [Toxoplasma gondii
           ME49]
 gi|211967621|gb|EEB02817.1| 2OG-Fe(II) oxygenase family protein, putative [Toxoplasma gondii
           ME49]
          Length = 401

 Score = 46.6 bits (109), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 30/80 (37%), Positives = 40/80 (50%), Gaps = 6/80 (7%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
            Y+  HS      G    K +L  L DVE GG T F +L   V P KG+ V W N++  T
Sbjct: 307 QYFKLHSD-----GGFRPKTILLYLNDVEAGGETSFENLGFRVAPMKGAGVLWNNSYPGT 361

Query: 109 -LLDYRMYHSGCPVALGNKW 127
             +D R+ H+G P   G K+
Sbjct: 362 NEIDPRLIHAGLPPEKGVKF 381



 Score = 43.1 bits (100), Expect = 0.036,   Method: Compositional matrix adjust.
 Identities = 23/56 (41%), Positives = 32/56 (57%), Gaps = 1/56 (1%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANT-LLDYWMYHSGCPVALGNKW 65
           L DVE GG T F +L   V P KG+ V W+N++  T  +D  + H+G P   G K+
Sbjct: 326 LNDVEAGGETSFENLGFRVAPMKGAGVLWNNSYPGTNEIDPRLIHAGLPPEKGVKF 381


>gi|193209068|ref|NP_001123048.1| Protein PHY-4, isoform a [Caenorhabditis elegans]
 gi|172051526|emb|CAQ35067.1| Protein PHY-4, isoform a [Caenorhabditis elegans]
          Length = 278

 Score = 46.6 bits (109), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 23/70 (32%), Positives = 36/70 (51%), Gaps = 1/70 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+   +L+  L   + GG T+FP LNL + P+ G  + W N  +    + +  H+ CP+
Sbjct: 183 GNRIATVLVI-LQIAKKGGTTVFPRLNLNIRPKAGDVIVWLNTLSTGESNSQTLHAACPI 241

Query: 122 ALGNKWGKLL 131
             G K G  L
Sbjct: 242 HEGTKIGATL 251



 Score = 45.4 bits (106), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 23/68 (33%), Positives = 32/68 (47%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     V L   + GG T+FP LNL + P+ G  + W N  +    +    H+ CP+  
Sbjct: 184 NRIATVLVILQIAKKGGTTVFPRLNLNIRPKAGDVIVWLNTLSTGESNSQTLHAACPIHE 243

Query: 62  GNKWGKLL 69
           G K G  L
Sbjct: 244 GTKIGATL 251


>gi|125588006|gb|EAZ28670.1| hypothetical protein OsJ_12681 [Oryza sativa Japonica Group]
          Length = 280

 Score = 46.6 bits (109), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 34/120 (28%), Positives = 48/120 (40%), Gaps = 25/120 (20%)

Query: 33  KGSAVFWHNAHANTLLDYWM-------YHSGCPVALGNKWGKLLLS---GLTDVELGGAT 82
           K + +  H+     +L Y +       Y +  P   G +  + + S    LTDVE GG T
Sbjct: 148 KATMIPRHHGEPFNILRYEIGQRYASHYDAFDPAQYGPQKSQRVASFLLYLTDVEEGGET 207

Query: 83  IFPSLN---------------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           +FP  N               L V P KG  + +Y+   N  +D    H  CPV  G KW
Sbjct: 208 MFPYENGENMDIGYDYEKCIGLKVKPRKGDGLLFYSLMVNGTIDPTSLHGSCPVIKGEKW 267



 Score = 44.3 bits (103), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 25/70 (35%), Positives = 31/70 (44%), Gaps = 15/70 (21%)

Query: 11  LTDVELGGATIFPSLN---------------LTVFPEKGSAVFWHNAHANTLLDYWMYHS 55
           LTDVE GG T+FP  N               L V P KG  + +++   N  +D    H 
Sbjct: 198 LTDVEEGGETMFPYENGENMDIGYDYEKCIGLKVKPRKGDGLLFYSLMVNGTIDPTSLHG 257

Query: 56  GCPVALGNKW 65
            CPV  G KW
Sbjct: 258 SCPVIKGEKW 267


>gi|302764100|ref|XP_002965471.1| hypothetical protein SELMODRAFT_67344 [Selaginella moellendorffii]
 gi|300166285|gb|EFJ32891.1| hypothetical protein SELMODRAFT_67344 [Selaginella moellendorffii]
          Length = 264

 Score = 46.2 bits (108), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 26/68 (38%), Positives = 34/68 (50%), Gaps = 13/68 (19%)

Query: 73  LTDVELGGATIFPSLN-------------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGC 119
           L+DVE GG TIFPS+              L V P KG A+ +++A  +  +D    H GC
Sbjct: 189 LSDVEEGGETIFPSVGDGECECGGELRKGLCVKPRKGDAILFWSAALDGNVDSNSLHGGC 248

Query: 120 PVALGNKW 127
            V  G KW
Sbjct: 249 SVLRGEKW 256



 Score = 43.9 bits (102), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 26/68 (38%), Positives = 33/68 (48%), Gaps = 13/68 (19%)

Query: 11  LTDVELGGATIFPSLN-------------LTVFPEKGSAVFWHNAHANTLLDYWMYHSGC 57
           L+DVE GG TIFPS+              L V P KG A+ + +A  +  +D    H GC
Sbjct: 189 LSDVEEGGETIFPSVGDGECECGGELRKGLCVKPRKGDAILFWSAALDGNVDSNSLHGGC 248

Query: 58  PVALGNKW 65
            V  G KW
Sbjct: 249 SVLRGEKW 256


>gi|148537204|dbj|BAF63493.1| prolyl 4-hydroxylase [Potamogeton distinctus]
          Length = 246

 Score = 46.2 bits (108), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 36/149 (24%), Positives = 56/149 (37%), Gaps = 31/149 (20%)

Query: 1   MNRAEMSSVELTDVELGGATIFPSLN----LTVFPEKGSAVFWHNAHANTLLDYWMYHSG 56
           + R +  ++   +  +   T  P  N      +  EKG     ++AH      Y  +H  
Sbjct: 46  LERRQDETITRIEKRIAAWTFLPEENGEPIQILHYEKGQK---YDAH------YDYFHDK 96

Query: 57  CPVALGNKWGKLLLSGLTDVELGGATIFPSLN------------------LTVFPEKGSA 98
               +G      +L  L+DV+ GG T+FP                       V P KG A
Sbjct: 97  NNQRVGGHRMATVLMYLSDVKKGGETVFPDAEGKLLQVKDDTWSDCARSGYAVKPRKGDA 156

Query: 99  VFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           + +++ H N   D    H+ CPV  G KW
Sbjct: 157 LLFFSCHPNATTDPNSLHASCPVIEGEKW 185


>gi|115455509|ref|NP_001051355.1| Os03g0761900 [Oryza sativa Japonica Group]
 gi|14488368|gb|AAK63935.1|AC084282_16 putative dioxygenase [Oryza sativa Japonica Group]
 gi|17027263|gb|AAL34117.1|AC090713_4 putative hydroxylase subunit [Oryza sativa Japonica Group]
 gi|108711218|gb|ABF99013.1| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
           Group]
 gi|113549826|dbj|BAF13269.1| Os03g0761900 [Oryza sativa Japonica Group]
 gi|125545807|gb|EAY91946.1| hypothetical protein OsI_13633 [Oryza sativa Indica Group]
          Length = 310

 Score = 46.2 bits (108), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 34/120 (28%), Positives = 48/120 (40%), Gaps = 25/120 (20%)

Query: 33  KGSAVFWHNAHANTLLDYWM-------YHSGCPVALGNKWGKLLLS---GLTDVELGGAT 82
           K + +  H+     +L Y +       Y +  P   G +  + + S    LTDVE GG T
Sbjct: 178 KATMIPRHHGEPFNILRYEIGQRYASHYDAFDPAQYGPQKSQRVASFLLYLTDVEEGGET 237

Query: 83  IFPSLN---------------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           +FP  N               L V P KG  + +Y+   N  +D    H  CPV  G KW
Sbjct: 238 MFPYENGENMDIGYDYEKCIGLKVKPRKGDGLLFYSLMVNGTIDPTSLHGSCPVIKGEKW 297



 Score = 44.3 bits (103), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 25/70 (35%), Positives = 31/70 (44%), Gaps = 15/70 (21%)

Query: 11  LTDVELGGATIFPSLN---------------LTVFPEKGSAVFWHNAHANTLLDYWMYHS 55
           LTDVE GG T+FP  N               L V P KG  + +++   N  +D    H 
Sbjct: 228 LTDVEEGGETMFPYENGENMDIGYDYEKCIGLKVKPRKGDGLLFYSLMVNGTIDPTSLHG 287

Query: 56  GCPVALGNKW 65
            CPV  G KW
Sbjct: 288 SCPVIKGEKW 297


>gi|332526359|ref|ZP_08402485.1| procollagen-proline dioxygenase [Rubrivivax benzoatilyticus JA2]
 gi|332110495|gb|EGJ10818.1| procollagen-proline dioxygenase [Rubrivivax benzoatilyticus JA2]
          Length = 224

 Score = 46.2 bits (108), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 28/60 (46%), Positives = 33/60 (55%), Gaps = 4/60 (6%)

Query: 79  GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW--GKLLLSGVF 136
           GGAT FP + L V P KG+AVF+  ++   L   R  H G PV  G KW   K L  GVF
Sbjct: 166 GGATTFPEVGLEVQPIKGNAVFF--SYERPLASTRTLHGGAPVLDGEKWVATKWLREGVF 223



 Score = 42.0 bits (97), Expect = 0.092,   Method: Compositional matrix adjust.
 Identities = 22/49 (44%), Positives = 27/49 (55%), Gaps = 2/49 (4%)

Query: 17  GGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           GGAT FP + L V P KG+AVF+  ++   L      H G PV  G KW
Sbjct: 166 GGATTFPEVGLEVQPIKGNAVFF--SYERPLASTRTLHGGAPVLDGEKW 212


>gi|404253277|ref|ZP_10957245.1| putative oxygenase [Sphingomonas sp. PAMC 26621]
          Length = 226

 Score = 46.2 bits (108), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 30/90 (33%), Positives = 42/90 (46%), Gaps = 8/90 (8%)

Query: 40  HNAHANTLLDYW--MYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGS 97
           HN + N    YW  M  SG        W  ++   L DVE GGAT FP + + + P++G 
Sbjct: 129 HNDYFNEAQPYWPKMIESGGQ----RTWTAMIY--LNDVEEGGATWFPLVGVRIAPKRGL 182

Query: 98  AVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
            + W N  A+   +    H G PV  G K+
Sbjct: 183 LIAWNNMRADGSPNPDTLHEGMPVTAGTKY 212



 Score = 43.1 bits (100), Expect = 0.033,   Method: Compositional matrix adjust.
 Identities = 21/57 (36%), Positives = 32/57 (56%)

Query: 9   VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           + L DVE GGAT FP + + + P++G  + W+N  A+   +    H G PV  G K+
Sbjct: 156 IYLNDVEEGGATWFPLVGVRIAPKRGLLIAWNNMRADGSPNPDTLHEGMPVTAGTKY 212


>gi|395492951|ref|ZP_10424530.1| putative oxygenase [Sphingomonas sp. PAMC 26617]
          Length = 226

 Score = 46.2 bits (108), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 30/90 (33%), Positives = 42/90 (46%), Gaps = 8/90 (8%)

Query: 40  HNAHANTLLDYW--MYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGS 97
           HN + N    YW  M  SG        W  ++   L DVE GGAT FP + + + P++G 
Sbjct: 129 HNDYFNEAQPYWPKMIESGGQ----RTWTAMIY--LNDVEEGGATWFPLVGVRIAPKRGL 182

Query: 98  AVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
            + W N  A+   +    H G PV  G K+
Sbjct: 183 LIAWNNMRADGSPNPDTLHEGMPVTAGTKY 212



 Score = 43.1 bits (100), Expect = 0.033,   Method: Compositional matrix adjust.
 Identities = 21/57 (36%), Positives = 32/57 (56%)

Query: 9   VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           + L DVE GGAT FP + + + P++G  + W+N  A+   +    H G PV  G K+
Sbjct: 156 IYLNDVEEGGATWFPLVGVRIAPKRGLLIAWNNMRADGSPNPDTLHEGMPVTAGTKY 212


>gi|401397395|ref|XP_003880042.1| hypothetical protein NCLIV_004830 [Neospora caninum Liverpool]
 gi|325114451|emb|CBZ50007.1| hypothetical protein NCLIV_004830 [Neospora caninum Liverpool]
          Length = 308

 Score = 46.2 bits (108), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 26/62 (41%), Positives = 35/62 (56%), Gaps = 1/62 (1%)

Query: 67  KLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT-LLDYRMYHSGCPVALGN 125
           K +L  L DVE GG T F +L   V P KG+ V W N++  T  +D R+ H+G P   G 
Sbjct: 227 KTVLLYLNDVEAGGETSFETLGFRVAPMKGAGVVWSNSYPGTHEIDPRLLHAGLPPERGV 286

Query: 126 KW 127
           K+
Sbjct: 287 KF 288



 Score = 42.7 bits (99), Expect = 0.053,   Method: Compositional matrix adjust.
 Identities = 23/56 (41%), Positives = 31/56 (55%), Gaps = 1/56 (1%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANT-LLDYWMYHSGCPVALGNKW 65
           L DVE GG T F +L   V P KG+ V W N++  T  +D  + H+G P   G K+
Sbjct: 233 LNDVEAGGETSFETLGFRVAPMKGAGVVWSNSYPGTHEIDPRLLHAGLPPERGVKF 288


>gi|351731158|ref|ZP_08948849.1| 2OG-Fe(II) oxygenase [Acidovorax radicis N35]
          Length = 303

 Score = 46.2 bits (108), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 26/68 (38%), Positives = 37/68 (54%), Gaps = 7/68 (10%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVF--WYNAHANTLLDYRMYHSGC 119
           G + G L++  L   E GG T FP +++ V P++G+AVF  +   H +T    R  H G 
Sbjct: 229 GQRVGTLVMY-LNTPEKGGGTTFPDVHVEVAPQRGNAVFFSYERPHPST----RTLHGGA 283

Query: 120 PVALGNKW 127
           PV  G KW
Sbjct: 284 PVLAGEKW 291



 Score = 42.4 bits (98), Expect = 0.065,   Method: Compositional matrix adjust.
 Identities = 22/57 (38%), Positives = 30/57 (52%), Gaps = 6/57 (10%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVF--WHNAHANTLLDYWMYHSGCPVALGNKW 65
           L   E GG T FP +++ V P++G+AVF  +   H +T       H G PV  G KW
Sbjct: 239 LNTPEKGGGTTFPDVHVEVAPQRGNAVFFSYERPHPSTR----TLHGGAPVLAGEKW 291


>gi|255085784|ref|XP_002505323.1| predicted protein [Micromonas sp. RCC299]
 gi|226520592|gb|ACO66581.1| predicted protein [Micromonas sp. RCC299]
          Length = 215

 Score = 46.2 bits (108), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 27/81 (33%), Positives = 33/81 (40%), Gaps = 26/81 (32%)

Query: 11  LTDVELGGATIFPSLN--------------------------LTVFPEKGSAVFWHNAHA 44
           L+DVE GG T+FP                             L V P +G A+ +HN H 
Sbjct: 126 LSDVESGGETVFPKHGAWIEPDEMDVRGRSSSKDSSKCAKGALHVKPRRGDALLFHNCHL 185

Query: 45  NTLLDYWMYHSGCPVALGNKW 65
           N   D    H+GCPV  G KW
Sbjct: 186 NGREDPTSLHAGCPVLRGEKW 206



 Score = 44.3 bits (103), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 26/81 (32%), Positives = 33/81 (40%), Gaps = 26/81 (32%)

Query: 73  LTDVELGGATIFPSLN--------------------------LTVFPEKGSAVFWYNAHA 106
           L+DVE GG T+FP                             L V P +G A+ ++N H 
Sbjct: 126 LSDVESGGETVFPKHGAWIEPDEMDVRGRSSSKDSSKCAKGALHVKPRRGDALLFHNCHL 185

Query: 107 NTLLDYRMYHSGCPVALGNKW 127
           N   D    H+GCPV  G KW
Sbjct: 186 NGREDPTSLHAGCPVLRGEKW 206


>gi|225428943|ref|XP_002263094.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
 gi|296083076|emb|CBI22480.3| unnamed protein product [Vitis vinifera]
          Length = 282

 Score = 46.2 bits (108), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 24/69 (34%), Positives = 33/69 (47%), Gaps = 14/69 (20%)

Query: 73  LTDVELGGATIFP---SLNLTVF-----------PEKGSAVFWYNAHANTLLDYRMYHSG 118
           L+DVE GG T+FP    LN+  F           P +G  + +Y+   N  +D+   H  
Sbjct: 201 LSDVEEGGETMFPFEHDLNINTFNSRKCIGLKVKPRRGDGLLFYSVFPNGTIDWTSMHGS 260

Query: 119 CPVALGNKW 127
           CPV  G KW
Sbjct: 261 CPVIEGEKW 269



 Score = 43.9 bits (102), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 23/69 (33%), Positives = 33/69 (47%), Gaps = 14/69 (20%)

Query: 11  LTDVELGGATIFP---SLNLTVF-----------PEKGSAVFWHNAHANTLLDYWMYHSG 56
           L+DVE GG T+FP    LN+  F           P +G  + +++   N  +D+   H  
Sbjct: 201 LSDVEEGGETMFPFEHDLNINTFNSRKCIGLKVKPRRGDGLLFYSVFPNGTIDWTSMHGS 260

Query: 57  CPVALGNKW 65
           CPV  G KW
Sbjct: 261 CPVIEGEKW 269


>gi|186474111|ref|YP_001861453.1| procollagen-proline dioxygenase [Burkholderia phymatum STM815]
 gi|184196443|gb|ACC74407.1| Procollagen-proline dioxygenase [Burkholderia phymatum STM815]
          Length = 305

 Score = 46.2 bits (108), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 22/57 (38%), Positives = 31/57 (54%)

Query: 9   VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           V L+DV  GG T+FP+  L V   +G A+++   + +  LD    H G PV  G KW
Sbjct: 237 VYLSDVAGGGETVFPNAGLAVMARQGGAIYFRYLNGHRQLDPLTLHGGAPVTNGEKW 293



 Score = 45.1 bits (105), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 23/69 (33%), Positives = 35/69 (50%)

Query: 59  VALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSG 118
            + G +    L+  L+DV  GG T+FP+  L V   +G A+++   + +  LD    H G
Sbjct: 225 TSRGGQRVATLIVYLSDVAGGGETVFPNAGLAVMARQGGAIYFRYLNGHRQLDPLTLHGG 284

Query: 119 CPVALGNKW 127
            PV  G KW
Sbjct: 285 APVTNGEKW 293


>gi|145345880|ref|XP_001417427.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144577654|gb|ABO95720.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 378

 Score = 46.2 bits (108), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 28/85 (32%), Positives = 42/85 (49%), Gaps = 6/85 (7%)

Query: 49  DYWMYHS-GCPVALGNKWG----KLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYN 103
           +Y+  H  G P+A+    G      +L  L DV+ GG T F  L + V P+KG A+ ++ 
Sbjct: 232 EYFKAHEDGFPIAVAADKGYQRRATILVYLNDVDEGGETRFEHLGIEVAPKKGKALVFFP 291

Query: 104 AHANTLLDYRMYHSGCPVALG-NKW 127
           + A  + D R  H+  P   G  KW
Sbjct: 292 SSAACMPDARTLHTATPAKEGHEKW 316



 Score = 38.1 bits (87), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 21/58 (36%), Positives = 29/58 (50%), Gaps = 1/58 (1%)

Query: 9   VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALG-NKW 65
           V L DV+ GG T F  L + V P+KG A+ +  + A  + D    H+  P   G  KW
Sbjct: 259 VYLNDVDEGGETRFEHLGIEVAPKKGKALVFFPSSAACMPDARTLHTATPAKEGHEKW 316


>gi|357467085|ref|XP_003603827.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355492875|gb|AES74078.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
          Length = 280

 Score = 46.2 bits (108), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 30/108 (27%), Positives = 49/108 (45%), Gaps = 25/108 (23%)

Query: 40  HNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN----------- 88
           +  H +  LD +   +G     G +   +L+  L+DVE GG T+FP+             
Sbjct: 167 YEPHYDYFLDEFNTKNG-----GQRVATVLMY-LSDVEEGGETVFPAAKANFSSVPWWND 220

Query: 89  --------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 128
                   L++ P+ G A+ +++   +  LD    H GCPV +GNKW 
Sbjct: 221 LSECARKGLSLKPKMGDALLFWSMRPDATLDASSLHGGCPVIVGNKWS 268



 Score = 43.5 bits (101), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 24/75 (32%), Positives = 35/75 (46%), Gaps = 19/75 (25%)

Query: 11  LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
           L+DVE GG T+FP+                     L++ P+ G A+ + +   +  LD  
Sbjct: 194 LSDVEEGGETVFPAAKANFSSVPWWNDLSECARKGLSLKPKMGDALLFWSMRPDATLDAS 253

Query: 52  MYHSGCPVALGNKWG 66
             H GCPV +GNKW 
Sbjct: 254 SLHGGCPVIVGNKWS 268


>gi|167524906|ref|XP_001746788.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163774568|gb|EDQ88195.1| predicted protein [Monosiga brevicollis MX1]
          Length = 321

 Score = 46.2 bits (108), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 23/56 (41%), Positives = 28/56 (50%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 128
           L DV  GG T F  L L V P+KG AV WY+   +   + R  H   PV  G K+ 
Sbjct: 246 LNDVPAGGETSFTRLGLAVKPKKGKAVLWYSVTNDLEPEERTDHEARPVKQGTKYA 301



 Score = 41.2 bits (95), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 21/56 (37%), Positives = 27/56 (48%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWG 66
           L DV  GG T F  L L V P+KG AV W++   +   +    H   PV  G K+ 
Sbjct: 246 LNDVPAGGETSFTRLGLAVKPKKGKAVLWYSVTNDLEPEERTDHEARPVKQGTKYA 301


>gi|341878860|gb|EGT34795.1| hypothetical protein CAEBREN_10065 [Caenorhabditis brenneri]
          Length = 163

 Score = 46.2 bits (108), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 23/69 (33%), Positives = 35/69 (50%), Gaps = 1/69 (1%)

Query: 59  VALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSG 118
           +  GN+   +L   ++    GG T+F  +  TV P K  A+FWYN +     +    H+ 
Sbjct: 41  LGTGNRIATVLFY-MSQPSHGGGTVFTEVKSTVLPTKNDALFWYNLYKQGDGNPDTRHAA 99

Query: 119 CPVALGNKW 127
           CPV +G KW
Sbjct: 100 CPVLVGIKW 108



 Score = 42.7 bits (99), Expect = 0.049,   Method: Compositional matrix adjust.
 Identities = 19/49 (38%), Positives = 27/49 (55%)

Query: 17  GGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           GG T+F  +  TV P K  A+FW+N +     +    H+ CPV +G KW
Sbjct: 60  GGGTVFTEVKSTVLPTKNDALFWYNLYKQGDGNPDTRHAACPVLVGIKW 108


>gi|297812067|ref|XP_002873917.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
 gi|297319754|gb|EFH50176.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
          Length = 298

 Score = 46.2 bits (108), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 29/111 (26%), Positives = 43/111 (38%), Gaps = 23/111 (20%)

Query: 38  FWHNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN--------- 88
           + H    +   DY  +H    +  G      +L  L++V  GG T+FP            
Sbjct: 129 YEHGQKYDAHFDY--FHDKVNIVRGGHRIATVLMYLSNVTKGGETVFPDAEVPSCRVLSE 186

Query: 89  ------------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
                       + V P KG A+ ++N H + + D    H GCPV  G KW
Sbjct: 187 NKEDLSDCAKRGIAVKPRKGDALLFFNLHPDAIPDPLSLHGGCPVIEGEKW 237



 Score = 41.2 bits (95), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 23/76 (30%), Positives = 31/76 (40%), Gaps = 21/76 (27%)

Query: 11  LTDVELGGATIFPSLN---------------------LTVFPEKGSAVFWHNAHANTLLD 49
           L++V  GG T+FP                        + V P KG A+ + N H + + D
Sbjct: 162 LSNVTKGGETVFPDAEVPSCRVLSENKEDLSDCAKRGIAVKPRKGDALLFFNLHPDAIPD 221

Query: 50  YWMYHSGCPVALGNKW 65
               H GCPV  G KW
Sbjct: 222 PLSLHGGCPVIEGEKW 237


>gi|448930555|gb|AGE54119.1| prolyl 4-hydroxylase [Paramecium bursaria Chlorella virus IL-5-2s1]
 gi|448934686|gb|AGE58238.1| prolyl 4-hydroxylase [Paramecium bursaria Chlorella virus NY-2B]
          Length = 230

 Score = 46.2 bits (108), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 34/128 (26%), Positives = 59/128 (46%), Gaps = 10/128 (7%)

Query: 1   MNRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVA 60
           +++ +  + EL D +      +   ++ V   K    ++H+   +   D       CP  
Sbjct: 103 IDKIQNKTRELLDSKRHCIDKYKFEDVQVARYKPGEYYYHHYDGDDCDD------ACP-- 154

Query: 61  LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYR-MYHSGC 119
             ++    L+  L +   GG TIFP L   V P+KG AVF++ A+ +T   Y+   H+G 
Sbjct: 155 -KDQRLATLMVYLKEPNEGGKTIFPILKTQVVPKKGKAVFFWVANPSTRKLYKETLHAGL 213

Query: 120 PVALGNKW 127
           PV  G K+
Sbjct: 214 PVKSGEKF 221


>gi|308801080|ref|XP_003075321.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
 gi|116061875|emb|CAL52593.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
          Length = 541

 Score = 46.2 bits (108), Expect = 0.005,   Method: Composition-based stats.
 Identities = 32/97 (32%), Positives = 46/97 (47%), Gaps = 21/97 (21%)

Query: 48  LDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFP------SLNLTVFPE------- 94
            DY+ + SG      N+   +LL  L+DVE GG T+FP      S N +++ E       
Sbjct: 360 FDYFFHKSG---MRNNRIATVLLY-LSDVEEGGETVFPNTDVPTSRNRSMYSECGNGGKA 415

Query: 95  ----KGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
               KG A+ +++      LD    H+GCPV  G KW
Sbjct: 416 LKARKGDALLFWSMKPGGELDAGSSHAGCPVIKGEKW 452



 Score = 42.0 bits (97), Expect = 0.088,   Method: Composition-based stats.
 Identities = 27/81 (33%), Positives = 37/81 (45%), Gaps = 17/81 (20%)

Query: 2   NRAEMSSVELTDVELGGATIFP------SLNLTVFPE-----------KGSAVFWHNAHA 44
           NR     + L+DVE GG T+FP      S N +++ E           KG A+ + +   
Sbjct: 372 NRIATVLLYLSDVEEGGETVFPNTDVPTSRNRSMYSECGNGGKALKARKGDALLFWSMKP 431

Query: 45  NTLLDYWMYHSGCPVALGNKW 65
              LD    H+GCPV  G KW
Sbjct: 432 GGELDAGSSHAGCPVIKGEKW 452


>gi|385205097|ref|ZP_10031967.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
 gi|385184988|gb|EIF34262.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
          Length = 292

 Score = 46.2 bits (108), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 22/55 (40%), Positives = 29/55 (52%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DV  GG TIFP   ++V   +G AV++   +    LD    H G PV  G+KW
Sbjct: 226 LNDVPDGGETIFPEAGMSVAASQGGAVYFRYMNGRRQLDPLTLHGGAPVLSGDKW 280



 Score = 45.4 bits (106), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 25/69 (36%), Positives = 34/69 (49%)

Query: 59  VALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSG 118
            A G +    L+  L DV  GG TIFP   ++V   +G AV++   +    LD    H G
Sbjct: 212 TAQGGQRVATLVIYLNDVPDGGETIFPEAGMSVAASQGGAVYFRYMNGRRQLDPLTLHGG 271

Query: 119 CPVALGNKW 127
            PV  G+KW
Sbjct: 272 APVLSGDKW 280


>gi|356541677|ref|XP_003539300.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
          Length = 297

 Score = 45.8 bits (107), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 26/68 (38%), Positives = 31/68 (45%), Gaps = 13/68 (19%)

Query: 73  LTDVELGGATIFPSLN-------------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGC 119
           LTDV  GG T+FP  N             L V P KG A+ +Y+   N  +D    H  C
Sbjct: 217 LTDVPEGGETMFPYENGFNRDGNVEDCIGLRVRPRKGDALLFYSLLPNGTIDQTSAHGSC 276

Query: 120 PVALGNKW 127
           PV  G KW
Sbjct: 277 PVIKGEKW 284



 Score = 43.9 bits (102), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 25/68 (36%), Positives = 31/68 (45%), Gaps = 13/68 (19%)

Query: 11  LTDVELGGATIFPSLN-------------LTVFPEKGSAVFWHNAHANTLLDYWMYHSGC 57
           LTDV  GG T+FP  N             L V P KG A+ +++   N  +D    H  C
Sbjct: 217 LTDVPEGGETMFPYENGFNRDGNVEDCIGLRVRPRKGDALLFYSLLPNGTIDQTSAHGSC 276

Query: 58  PVALGNKW 65
           PV  G KW
Sbjct: 277 PVIKGEKW 284


>gi|147800995|emb|CAN64470.1| hypothetical protein VITISV_014644 [Vitis vinifera]
          Length = 288

 Score = 45.8 bits (107), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 31/107 (28%), Positives = 48/107 (44%), Gaps = 25/107 (23%)

Query: 40  HNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN----------- 88
           +  H +  LD +   +G     G +   +L+  L+DVE GG T+FP+             
Sbjct: 175 YEPHYDYFLDDYNTKNG-----GQRMATVLMY-LSDVEEGGETVFPAAKGNFSSVPWWNE 228

Query: 89  --------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
                   L+V P+ G A+ +++   +  LD    H GCPV  GNKW
Sbjct: 229 LSXCGKEGLSVKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKW 275



 Score = 42.7 bits (99), Expect = 0.047,   Method: Compositional matrix adjust.
 Identities = 25/74 (33%), Positives = 34/74 (45%), Gaps = 19/74 (25%)

Query: 11  LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
           L+DVE GG T+FP+                     L+V P+ G A+ + +   +  LD  
Sbjct: 202 LSDVEEGGETVFPAAKGNFSSVPWWNELSXCGKEGLSVKPKMGDALLFWSMKPDASLDPS 261

Query: 52  MYHSGCPVALGNKW 65
             H GCPV  GNKW
Sbjct: 262 SLHGGCPVIKGNKW 275


>gi|429859077|gb|ELA33870.1| 2og-fe oxygenase family protein [Colletotrichum gloeosporioides
           Nara gc5]
          Length = 294

 Score = 45.8 bits (107), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 21/42 (50%), Positives = 27/42 (64%)

Query: 87  LNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 128
           + L V P+KG+A+FW N H N   DYR+ H+G PV  G K G
Sbjct: 235 MGLLVKPKKGNAIFWNNMHMNGSGDYRVVHAGLPVHSGQKIG 276



 Score = 44.3 bits (103), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 20/42 (47%), Positives = 27/42 (64%)

Query: 25  LNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWG 66
           + L V P+KG+A+FW+N H N   DY + H+G PV  G K G
Sbjct: 235 MGLLVKPKKGNAIFWNNMHMNGSGDYRVVHAGLPVHSGQKIG 276


>gi|225468574|ref|XP_002263060.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
 gi|296084059|emb|CBI24447.3| unnamed protein product [Vitis vinifera]
          Length = 288

 Score = 45.8 bits (107), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 31/107 (28%), Positives = 48/107 (44%), Gaps = 25/107 (23%)

Query: 40  HNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN----------- 88
           +  H +  LD +   +G     G +   +L+  L+DVE GG T+FP+             
Sbjct: 175 YEPHYDYFLDDYNTKNG-----GQRMATVLMY-LSDVEEGGETVFPAAKGNFSSVPWWNE 228

Query: 89  --------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
                   L+V P+ G A+ +++   +  LD    H GCPV  GNKW
Sbjct: 229 LSDCGKEGLSVKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKW 275



 Score = 42.7 bits (99), Expect = 0.048,   Method: Compositional matrix adjust.
 Identities = 25/74 (33%), Positives = 34/74 (45%), Gaps = 19/74 (25%)

Query: 11  LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
           L+DVE GG T+FP+                     L+V P+ G A+ + +   +  LD  
Sbjct: 202 LSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKEGLSVKPKMGDALLFWSMKPDASLDPS 261

Query: 52  MYHSGCPVALGNKW 65
             H GCPV  GNKW
Sbjct: 262 SLHGGCPVIKGNKW 275


>gi|91778899|ref|YP_554107.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
           xenovorans LB400]
 gi|91691559|gb|ABE34757.1| Procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
           xenovorans LB400]
          Length = 292

 Score = 45.8 bits (107), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 22/55 (40%), Positives = 29/55 (52%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DV  GG TIFP   ++V   +G AV++   +    LD    H G PV  G+KW
Sbjct: 226 LNDVPDGGETIFPEAGMSVAASQGGAVYFRYMNDRRQLDPLTLHGGAPVLAGDKW 280



 Score = 45.8 bits (107), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 26/73 (35%), Positives = 35/73 (47%)

Query: 55  SGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRM 114
           S    A G +    L+  L DV  GG TIFP   ++V   +G AV++   +    LD   
Sbjct: 208 SAVHTAQGGQRVATLVIYLNDVPDGGETIFPEAGMSVAASQGGAVYFRYMNDRRQLDPLT 267

Query: 115 YHSGCPVALGNKW 127
            H G PV  G+KW
Sbjct: 268 LHGGAPVLAGDKW 280


>gi|356550516|ref|XP_003543632.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
          Length = 318

 Score = 45.8 bits (107), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 29/98 (29%), Positives = 40/98 (40%), Gaps = 20/98 (20%)

Query: 48  LDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLT----------------- 90
            DY  +H      +G      +L  L+DVE GG TIFP+                     
Sbjct: 158 FDY--FHDKANQVMGGHRIATVLMYLSDVEKGGETIFPNAKAKLLQPKDESWSECAHKGY 215

Query: 91  -VFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
            V P KG A+ +++ H +   D +  H  CPV  G KW
Sbjct: 216 AVKPRKGDALLFFSLHLDASTDNKSLHGSCPVIEGEKW 253



 Score = 41.2 bits (95), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 24/73 (32%), Positives = 30/73 (41%), Gaps = 18/73 (24%)

Query: 11  LTDVELGGATIFPSLNLT------------------VFPEKGSAVFWHNAHANTLLDYWM 52
           L+DVE GG TIFP+                      V P KG A+ + + H +   D   
Sbjct: 181 LSDVEKGGETIFPNAKAKLLQPKDESWSECAHKGYAVKPRKGDALLFFSLHLDASTDNKS 240

Query: 53  YHSGCPVALGNKW 65
            H  CPV  G KW
Sbjct: 241 LHGSCPVIEGEKW 253


>gi|187920106|ref|YP_001889137.1| procollagen-proline dioxygenase [Burkholderia phytofirmans PsJN]
 gi|187718544|gb|ACD19767.1| Procollagen-proline dioxygenase [Burkholderia phytofirmans PsJN]
          Length = 295

 Score = 45.8 bits (107), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 22/55 (40%), Positives = 29/55 (52%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DV  GG TIFP   ++V   +G AV++   +    LD    H G PV  G+KW
Sbjct: 229 LNDVPDGGETIFPEAGISVAARQGGAVYFRYMNGQRQLDPLTLHGGAPVLGGDKW 283



 Score = 45.4 bits (106), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 26/73 (35%), Positives = 35/73 (47%)

Query: 55  SGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRM 114
           S    A G +    L+  L DV  GG TIFP   ++V   +G AV++   +    LD   
Sbjct: 211 SAVHTAQGGQRVATLVIYLNDVPDGGETIFPEAGISVAARQGGAVYFRYMNGQRQLDPLT 270

Query: 115 YHSGCPVALGNKW 127
            H G PV  G+KW
Sbjct: 271 LHGGAPVLGGDKW 283


>gi|194871369|ref|XP_001972835.1| GG15736 [Drosophila erecta]
 gi|190654618|gb|EDV51861.1| GG15736 [Drosophila erecta]
          Length = 476

 Score = 45.8 bits (107), Expect = 0.006,   Method: Composition-based stats.
 Identities = 33/108 (30%), Positives = 50/108 (46%), Gaps = 20/108 (18%)

Query: 25  LNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALG-----NKWGKLLLSGLTDVELG 79
           L++T FP KG+ +         +++Y +  +  P         NK   L+   L + E G
Sbjct: 377 LDITGFPSKGNQML-------EVINYGIAGNYNPDDTARPRKQNKANALIF--LDNAERG 427

Query: 80  GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           G  +FPS +L V P KGS + W N   + +     YH  CP+  GN W
Sbjct: 428 GEIVFPSRHLKVRPRKGSMLVWMNLERSVI-----YHQ-CPILKGNMW 469



 Score = 44.7 bits (104), Expect = 0.011,   Method: Composition-based stats.
 Identities = 22/55 (40%), Positives = 29/55 (52%), Gaps = 6/55 (10%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L + E GG  +FPS +L V P KGS + W N   + +     YH  CP+  GN W
Sbjct: 421 LDNAERGGEIVFPSRHLKVRPRKGSMLVWMNLERSVI-----YHQ-CPILKGNMW 469


>gi|337280547|ref|YP_004620019.1| hypothetical protein Rta_28970 [Ramlibacter tataouinensis TTB310]
 gi|334731624|gb|AEG94000.1| conserved hypothetical protein [Ramlibacter tataouinensis TTB310]
          Length = 286

 Score = 45.8 bits (107), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 29/77 (37%), Positives = 39/77 (50%), Gaps = 10/77 (12%)

Query: 56  GCPVAL---GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVF--WYNAHANTLL 110
           G P  L   G + G L++  L + E GG T FP ++L V P++G  VF  +   H +T  
Sbjct: 203 GTPTILKRGGQRVGTLVMY-LGEPEKGGGTTFPDVHLEVAPKRGHGVFFSYERPHPST-- 259

Query: 111 DYRMYHSGCPVALGNKW 127
             R  H G PV  G KW
Sbjct: 260 --RTLHGGAPVLAGEKW 274



 Score = 41.2 bits (95), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 6/57 (10%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVF--WHNAHANTLLDYWMYHSGCPVALGNKW 65
           L + E GG T FP ++L V P++G  VF  +   H +T       H G PV  G KW
Sbjct: 222 LGEPEKGGGTTFPDVHLEVAPKRGHGVFFSYERPHPSTR----TLHGGAPVLAGEKW 274


>gi|225428938|ref|XP_002262952.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
 gi|296083079|emb|CBI22483.3| unnamed protein product [Vitis vinifera]
          Length = 284

 Score = 45.8 bits (107), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 23/70 (32%), Positives = 32/70 (45%), Gaps = 15/70 (21%)

Query: 73  LTDVELGGATIFP---------------SLNLTVFPEKGSAVFWYNAHANTLLDYRMYHS 117
           L+DVE GG T+FP                + L V P++G  + +Y+   N  +D    H 
Sbjct: 202 LSDVEEGGETMFPFEHDLNIGTGYDYKKCIGLKVKPQRGDGLLFYSVFPNGTIDRTSLHG 261

Query: 118 GCPVALGNKW 127
            CPV  G KW
Sbjct: 262 SCPVIAGEKW 271



 Score = 43.9 bits (102), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 22/70 (31%), Positives = 32/70 (45%), Gaps = 15/70 (21%)

Query: 11  LTDVELGGATIFP---------------SLNLTVFPEKGSAVFWHNAHANTLLDYWMYHS 55
           L+DVE GG T+FP                + L V P++G  + +++   N  +D    H 
Sbjct: 202 LSDVEEGGETMFPFEHDLNIGTGYDYKKCIGLKVKPQRGDGLLFYSVFPNGTIDRTSLHG 261

Query: 56  GCPVALGNKW 65
            CPV  G KW
Sbjct: 262 SCPVIAGEKW 271


>gi|156403977|ref|XP_001640184.1| predicted protein [Nematostella vectensis]
 gi|156227317|gb|EDO48121.1| predicted protein [Nematostella vectensis]
          Length = 414

 Score = 45.8 bits (107), Expect = 0.006,   Method: Composition-based stats.
 Identities = 29/88 (32%), Positives = 34/88 (38%), Gaps = 33/88 (37%)

Query: 73  LTDVELGGATIFP---------------------------SLNLTVFPEKGSAVFWYNAH 105
           L DVE GG T FP                             NL + P+K SA+ WYN +
Sbjct: 299 LNDVEEGGETAFPVADNETFSIEAWAGITKYRCDLSRHCHKANLYIKPKKRSAIMWYNHY 358

Query: 106 ANTL------LDYRMYHSGCPVALGNKW 127
            + L      LD   YH GC V  G KW
Sbjct: 359 TDELTGWFGGLDPMTYHGGCDVVRGRKW 386



 Score = 43.5 bits (101), Expect = 0.025,   Method: Composition-based stats.
 Identities = 28/88 (31%), Positives = 34/88 (38%), Gaps = 33/88 (37%)

Query: 11  LTDVELGGATIFP---------------------------SLNLTVFPEKGSAVFWHNAH 43
           L DVE GG T FP                             NL + P+K SA+ W+N +
Sbjct: 299 LNDVEEGGETAFPVADNETFSIEAWAGITKYRCDLSRHCHKANLYIKPKKRSAIMWYNHY 358

Query: 44  ANTL------LDYWMYHSGCPVALGNKW 65
            + L      LD   YH GC V  G KW
Sbjct: 359 TDELTGWFGGLDPMTYHGGCDVVRGRKW 386


>gi|91779740|ref|YP_554948.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
           xenovorans LB400]
 gi|91692400|gb|ABE35598.1| Procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
           xenovorans LB400]
          Length = 296

 Score = 45.8 bits (107), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 24/66 (36%), Positives = 35/66 (53%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           G + G LL+  L DVE GG T+FP    +V P +G A+++   +   L D    H+  P+
Sbjct: 215 GQRVGTLLMY-LNDVEGGGETMFPQTGWSVVPRRGQALYFEYGNRFGLADPSSLHTSTPL 273

Query: 122 ALGNKW 127
             G KW
Sbjct: 274 RAGEKW 279



 Score = 43.5 bits (101), Expect = 0.027,   Method: Compositional matrix adjust.
 Identities = 20/55 (36%), Positives = 29/55 (52%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
           L DVE GG T+FP    +V P +G A+++   +   L D    H+  P+  G KW
Sbjct: 225 LNDVEGGGETMFPQTGWSVVPRRGQALYFEYGNRFGLADPSSLHTSTPLRAGEKW 279


>gi|147823227|emb|CAN70872.1| hypothetical protein VITISV_009065 [Vitis vinifera]
          Length = 276

 Score = 45.8 bits (107), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 23/70 (32%), Positives = 32/70 (45%), Gaps = 15/70 (21%)

Query: 73  LTDVELGGATIFP---------------SLNLTVFPEKGSAVFWYNAHANTLLDYRMYHS 117
           L+DVE GG T+FP                + L V P++G  + +Y+   N  +D    H 
Sbjct: 194 LSDVEEGGETMFPFEHDLNIGTGYDYKKCIGLKVKPQRGDGLLFYSVFPNGTIDRTSLHG 253

Query: 118 GCPVALGNKW 127
            CPV  G KW
Sbjct: 254 SCPVIAGEKW 263



 Score = 43.5 bits (101), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 22/70 (31%), Positives = 32/70 (45%), Gaps = 15/70 (21%)

Query: 11  LTDVELGGATIFP---------------SLNLTVFPEKGSAVFWHNAHANTLLDYWMYHS 55
           L+DVE GG T+FP                + L V P++G  + +++   N  +D    H 
Sbjct: 194 LSDVEEGGETMFPFEHDLNIGTGYDYKKCIGLKVKPQRGDGLLFYSVFPNGTIDRTSLHG 253

Query: 56  GCPVALGNKW 65
            CPV  G KW
Sbjct: 254 SCPVIAGEKW 263


>gi|397613716|gb|EJK62387.1| hypothetical protein THAOC_17003 [Thalassiosira oceanica]
          Length = 578

 Score = 45.8 bits (107), Expect = 0.006,   Method: Composition-based stats.
 Identities = 22/52 (42%), Positives = 30/52 (57%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
           +L+ L DV  GGAT FP LN+ V P KG A+ ++ A  +  +D    H  CP
Sbjct: 447 VLTYLNDVSKGGATAFPRLNVEVRPRKGMALVFFPATVDGHVDGNALHEACP 498



 Score = 41.6 bits (96), Expect = 0.12,   Method: Composition-based stats.
 Identities = 21/48 (43%), Positives = 26/48 (54%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCP 58
           L DV  GGAT FP LN+ V P KG A+ +  A  +  +D    H  CP
Sbjct: 451 LNDVSKGGATAFPRLNVEVRPRKGMALVFFPATVDGHVDGNALHEACP 498


>gi|224006045|ref|XP_002291983.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220972502|gb|EED90834.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 168

 Score = 45.8 bits (107), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 24/61 (39%), Positives = 35/61 (57%)

Query: 60  ALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGC 119
           A G +    +L+ L DV  GGAT FP+LN+ V P +G AV ++ +  + LLD    H+  
Sbjct: 106 ANGGQRTITVLTYLNDVPRGGATSFPNLNIQVQPRRGMAVVFFPSTLDGLLDKMALHAAM 165

Query: 120 P 120
           P
Sbjct: 166 P 166



 Score = 43.1 bits (100), Expect = 0.032,   Method: Compositional matrix adjust.
 Identities = 21/48 (43%), Positives = 28/48 (58%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCP 58
           L DV  GGAT FP+LN+ V P +G AV +  +  + LLD    H+  P
Sbjct: 119 LNDVPRGGATSFPNLNIQVQPRRGMAVVFFPSTLDGLLDKMALHAAMP 166


>gi|115456019|ref|NP_001051610.1| Os03g0803500 [Oryza sativa Japonica Group]
 gi|29150365|gb|AAO72374.1| putative oxidoreductase [Oryza sativa Japonica Group]
 gi|108711618|gb|ABF99413.1| oxidoreductase, 2OG-Fe oxygenase family protein, putative,
           expressed [Oryza sativa Japonica Group]
 gi|113550081|dbj|BAF13524.1| Os03g0803500 [Oryza sativa Japonica Group]
 gi|215765410|dbj|BAG87107.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222625993|gb|EEE60125.1| hypothetical protein OsJ_13003 [Oryza sativa Japonica Group]
          Length = 299

 Score = 45.8 bits (107), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 25/73 (34%), Positives = 33/73 (45%), Gaps = 18/73 (24%)

Query: 73  LTDVELGGATIFPSL------------------NLTVFPEKGSAVFWYNAHANTLLDYRM 114
           LTDV+ GG T+FP+                    L V P+KG A+ +++ H N   D   
Sbjct: 162 LTDVKKGGETVFPNAAGRHLQLKDETWSDCARSGLAVKPKKGDALLFFSLHVNATTDPAS 221

Query: 115 YHSGCPVALGNKW 127
            H  CPV  G KW
Sbjct: 222 LHGSCPVIEGEKW 234



 Score = 43.5 bits (101), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 25/73 (34%), Positives = 32/73 (43%), Gaps = 18/73 (24%)

Query: 11  LTDVELGGATIFPSL------------------NLTVFPEKGSAVFWHNAHANTLLDYWM 52
           LTDV+ GG T+FP+                    L V P+KG A+ + + H N   D   
Sbjct: 162 LTDVKKGGETVFPNAAGRHLQLKDETWSDCARSGLAVKPKKGDALLFFSLHVNATTDPAS 221

Query: 53  YHSGCPVALGNKW 65
            H  CPV  G KW
Sbjct: 222 LHGSCPVIEGEKW 234


>gi|218199253|gb|EEC81680.1| hypothetical protein OsI_25242 [Oryza sativa Indica Group]
          Length = 487

 Score = 45.8 bits (107), Expect = 0.006,   Method: Composition-based stats.
 Identities = 31/98 (31%), Positives = 41/98 (41%), Gaps = 20/98 (20%)

Query: 48  LDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN------------------L 89
            DY  +H     ALG      +L  L+ VE GG T+FP+                    L
Sbjct: 159 FDY--FHDKVNQALGGHRYATVLMYLSTVEKGGETVFPNAEGWENQPKDDTFSECAQKGL 216

Query: 90  TVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
            V P KG AV +++ H + + D    H  CPV  G KW
Sbjct: 217 AVKPVKGDAVLFFSLHIDGVPDPLSLHGSCPVIEGEKW 254



 Score = 40.4 bits (93), Expect = 0.26,   Method: Composition-based stats.
 Identities = 24/73 (32%), Positives = 31/73 (42%), Gaps = 18/73 (24%)

Query: 11  LTDVELGGATIFPSLN------------------LTVFPEKGSAVFWHNAHANTLLDYWM 52
           L+ VE GG T+FP+                    L V P KG AV + + H + + D   
Sbjct: 182 LSTVEKGGETVFPNAEGWENQPKDDTFSECAQKGLAVKPVKGDAVLFFSLHIDGVPDPLS 241

Query: 53  YHSGCPVALGNKW 65
            H  CPV  G KW
Sbjct: 242 LHGSCPVIEGEKW 254


>gi|357146834|ref|XP_003574128.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Brachypodium
           distachyon]
          Length = 306

 Score = 45.8 bits (107), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 30/99 (30%), Positives = 43/99 (43%), Gaps = 21/99 (21%)

Query: 48  LDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN------------------- 88
            DY  +H       G +    LL  L+DVE GG T+FPS                     
Sbjct: 197 FDY--FHDDFNTKNGGQRIATLLMYLSDVEEGGETVFPSAKVNSSSIPFYNELSECAKRG 254

Query: 89  LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           ++V P+ G A+ +++   +  LD    H GCPV  G+KW
Sbjct: 255 ISVKPKMGDALLFWSMRPDGTLDPTSLHGGCPVIKGDKW 293



 Score = 42.4 bits (98), Expect = 0.056,   Method: Compositional matrix adjust.
 Identities = 24/74 (32%), Positives = 34/74 (45%), Gaps = 19/74 (25%)

Query: 11  LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
           L+DVE GG T+FPS                     ++V P+ G A+ + +   +  LD  
Sbjct: 220 LSDVEEGGETVFPSAKVNSSSIPFYNELSECAKRGISVKPKMGDALLFWSMRPDGTLDPT 279

Query: 52  MYHSGCPVALGNKW 65
             H GCPV  G+KW
Sbjct: 280 SLHGGCPVIKGDKW 293


>gi|357467087|ref|XP_003603828.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355492876|gb|AES74079.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
          Length = 156

 Score = 45.8 bits (107), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 24/74 (32%), Positives = 36/74 (48%), Gaps = 19/74 (25%)

Query: 73  LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWYNAHANTLLDYR 113
           L+DVE GG T+FP+                     L++ P+ G A+ +++   +  LD  
Sbjct: 70  LSDVEEGGETVFPAAKANFSSVPWWNDLSECARKGLSLKPKMGDALLFWSMRPDATLDAS 129

Query: 114 MYHSGCPVALGNKW 127
             H GCPV +GNKW
Sbjct: 130 SLHGGCPVIVGNKW 143



 Score = 43.1 bits (100), Expect = 0.041,   Method: Compositional matrix adjust.
 Identities = 24/74 (32%), Positives = 35/74 (47%), Gaps = 19/74 (25%)

Query: 11  LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
           L+DVE GG T+FP+                     L++ P+ G A+ + +   +  LD  
Sbjct: 70  LSDVEEGGETVFPAAKANFSSVPWWNDLSECARKGLSLKPKMGDALLFWSMRPDATLDAS 129

Query: 52  MYHSGCPVALGNKW 65
             H GCPV +GNKW
Sbjct: 130 SLHGGCPVIVGNKW 143


>gi|333981907|ref|YP_004511117.1| procollagen-proline dioxygenase [Methylomonas methanica MC09]
 gi|333805948|gb|AEF98617.1| Procollagen-proline dioxygenase [Methylomonas methanica MC09]
          Length = 286

 Score = 45.8 bits (107), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 28/78 (35%), Positives = 42/78 (53%), Gaps = 11/78 (14%)

Query: 52  MYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTL-- 109
           M  SG  VA        L+  L +V  GG T+FP + +++ P++GSA ++  A+ N+L  
Sbjct: 205 MTQSGQRVAT-------LVMYLNEVTEGGETVFPDVGISITPKRGSAAYF--AYCNSLGQ 255

Query: 110 LDYRMYHSGCPVALGNKW 127
           +D    H G PV  G KW
Sbjct: 256 VDPATLHGGAPVLTGEKW 273



 Score = 45.4 bits (106), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 22/57 (38%), Positives = 35/57 (61%), Gaps = 4/57 (7%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTL--LDYWMYHSGCPVALGNKW 65
           L +V  GG T+FP + +++ P++GSA ++  A+ N+L  +D    H G PV  G KW
Sbjct: 219 LNEVTEGGETVFPDVGISITPKRGSAAYF--AYCNSLGQVDPATLHGGAPVLTGEKW 273


>gi|218665910|ref|YP_002425647.1| 2OG-Fe(II) oxygenase [Acidithiobacillus ferrooxidans ATCC 23270]
 gi|218518123|gb|ACK78709.1| oxidoreductase, 2OG-Fe(II) oxygenase family [Acidithiobacillus
           ferrooxidans ATCC 23270]
          Length = 248

 Score = 45.4 bits (106), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 27/66 (40%), Positives = 32/66 (48%), Gaps = 1/66 (1%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
           GN+   L+L  L  VE GG T FP L L V P  G  VF+ N +          H+G PV
Sbjct: 172 GNRQATLILY-LNAVEEGGETAFPELGLQVSPIPGGGVFFRNLNEEGQRHPLSLHAGLPV 230

Query: 122 ALGNKW 127
             G KW
Sbjct: 231 RKGEKW 236



 Score = 44.3 bits (103), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 25/64 (39%), Positives = 29/64 (45%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
           NR     + L  VE GG T FP L L V P  G  VF+ N +          H+G PV  
Sbjct: 173 NRQATLILYLNAVEEGGETAFPELGLQVSPIPGGGVFFRNLNEEGQRHPLSLHAGLPVRK 232

Query: 62  GNKW 65
           G KW
Sbjct: 233 GEKW 236


>gi|405955421|gb|EKC22545.1| Prolyl 4-hydroxylase subunit alpha-2 [Crassostrea gigas]
          Length = 173

 Score = 45.4 bits (106), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 38/127 (29%), Positives = 55/127 (43%), Gaps = 20/127 (15%)

Query: 3   RAEMSS--VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVA 60
           R  M++  V L DVE GG T FP L + V P KG A+ W+N + +   +    H+   V 
Sbjct: 44  RDRMATILVYLQDVEDGGETKFPELGIWVKPRKGRALVWNNMNEDGKCEALSVHNAAKVN 103

Query: 61  LGNKW--------------GKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHA 106
            G+K+              GK        V   G    P +N   + EKGS  ++   + 
Sbjct: 104 KGHKYILQRWYYYKNFYSLGKRPPEPELPVRASGT---PRVNCDEY-EKGSCRWYDEWNY 159

Query: 107 NTLLDYR 113
             L+DYR
Sbjct: 160 EHLVDYR 166



 Score = 41.6 bits (96), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 22/59 (37%), Positives = 31/59 (52%)

Query: 69  LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           +L  L DVE GG T FP L + V P KG A+ W N + +   +    H+   V  G+K+
Sbjct: 50  ILVYLQDVEDGGETKFPELGIWVKPRKGRALVWNNMNEDGKCEALSVHNAAKVNKGHKY 108


>gi|195627276|gb|ACG35468.1| prolyl 4-hydroxylase [Zea mays]
          Length = 298

 Score = 45.4 bits (106), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 34/117 (29%), Positives = 48/117 (41%), Gaps = 20/117 (17%)

Query: 31  PEKGSAV-FWHNAHANTLLDYWMY-HSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN 88
           PE G A+   H  +      ++ Y H     ALG      +L  L++VE GG TIFP+  
Sbjct: 116 PENGEAIQILHYQNGEKYEPHYDYFHDKNNQALGGHRIATVLMYLSNVEKGGETIFPNAE 175

Query: 89  ------------------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
                               V P KG A+ +++ H ++  D    H  CPV  G KW
Sbjct: 176 GKLLQPKDDTWSDCARNGYAVKPVKGDALLFFSLHPDSTTDSDSLHGSCPVIEGQKW 232



 Score = 38.9 bits (89), Expect = 0.72,   Method: Compositional matrix adjust.
 Identities = 23/73 (31%), Positives = 31/73 (42%), Gaps = 18/73 (24%)

Query: 11  LTDVELGGATIFPSLN------------------LTVFPEKGSAVFWHNAHANTLLDYWM 52
           L++VE GG TIFP+                      V P KG A+ + + H ++  D   
Sbjct: 160 LSNVEKGGETIFPNAEGKLLQPKDDTWSDCARNGYAVKPVKGDALLFFSLHPDSTTDSDS 219

Query: 53  YHSGCPVALGNKW 65
            H  CPV  G KW
Sbjct: 220 LHGSCPVIEGQKW 232


>gi|224013910|ref|XP_002296619.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220968971|gb|EED87315.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 541

 Score = 45.4 bits (106), Expect = 0.007,   Method: Composition-based stats.
 Identities = 32/82 (39%), Positives = 41/82 (50%), Gaps = 5/82 (6%)

Query: 49  DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
           DY  Y    P   G +   L L  L DVE GG T FP L++T+ P+KG+AV W +   + 
Sbjct: 444 DYIEYQEDMPC--GVRMLTLFLY-LNDVEEGGGTHFPLLDITIQPKKGNAVIWPSVLDDK 500

Query: 109 --LLDYRMYHSGCPVALGNKWG 128
               D R  H   PV  G K+G
Sbjct: 501 PETKDPRTDHEALPVINGIKYG 522



 Score = 42.0 bits (97), Expect = 0.082,   Method: Composition-based stats.
 Identities = 24/58 (41%), Positives = 32/58 (55%), Gaps = 2/58 (3%)

Query: 11  LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANT--LLDYWMYHSGCPVALGNKWG 66
           L DVE GG T FP L++T+ P+KG+AV W +   +     D    H   PV  G K+G
Sbjct: 465 LNDVEEGGGTHFPLLDITIQPKKGNAVIWPSVLDDKPETKDPRTDHEALPVINGIKYG 522


>gi|226495689|ref|NP_001149322.1| LOC100282945 precursor [Zea mays]
 gi|194697650|gb|ACF82909.1| unknown [Zea mays]
 gi|194708468|gb|ACF88318.1| unknown [Zea mays]
 gi|195626376|gb|ACG35018.1| oxidoreductase [Zea mays]
 gi|347978842|gb|AEP37763.1| prolyl 4-hydroxylase 9 [Zea mays]
 gi|413945802|gb|AFW78451.1| oxidoreductase [Zea mays]
          Length = 308

 Score = 45.4 bits (106), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 27/83 (32%), Positives = 38/83 (45%), Gaps = 18/83 (21%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFP-----------------SLNLTVFPEKGSAVFWYNA 104
           G+++  +LL  LTDV  GG T+FP                    + V P KG A+ ++N 
Sbjct: 166 GHRYATVLLY-LTDVPEGGETVFPLAEEPDDAKDATLSECAQKGIAVRPRKGDALLFFNL 224

Query: 105 HANTLLDYRMYHSGCPVALGNKW 127
           + +   D    H GCPV  G KW
Sbjct: 225 NPDGTTDSVSLHGGCPVIKGEKW 247



 Score = 41.6 bits (96), Expect = 0.099,   Method: Compositional matrix adjust.
 Identities = 24/72 (33%), Positives = 30/72 (41%), Gaps = 17/72 (23%)

Query: 11  LTDVELGGATIFP-----------------SLNLTVFPEKGSAVFWHNAHANTLLDYWMY 53
           LTDV  GG T+FP                    + V P KG A+ + N + +   D    
Sbjct: 176 LTDVPEGGETVFPLAEEPDDAKDATLSECAQKGIAVRPRKGDALLFFNLNPDGTTDSVSL 235

Query: 54  HSGCPVALGNKW 65
           H GCPV  G KW
Sbjct: 236 HGGCPVIKGEKW 247


>gi|449448264|ref|XP_004141886.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 294

 Score = 45.4 bits (106), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 25/70 (35%), Positives = 31/70 (44%), Gaps = 15/70 (21%)

Query: 73  LTDVELGGATIFPSLN---------------LTVFPEKGSAVFWYNAHANTLLDYRMYHS 117
           LTDVE GG T+FP  N               L V P +G  + +Y+   N  +D    H 
Sbjct: 209 LTDVEEGGETMFPFENGLNMDGTYNFQTCIGLKVKPRQGDGLLFYSVFPNGTIDPTSLHG 268

Query: 118 GCPVALGNKW 127
            CPV  G KW
Sbjct: 269 SCPVIKGQKW 278



 Score = 43.5 bits (101), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 24/70 (34%), Positives = 31/70 (44%), Gaps = 15/70 (21%)

Query: 11  LTDVELGGATIFPSLN---------------LTVFPEKGSAVFWHNAHANTLLDYWMYHS 55
           LTDVE GG T+FP  N               L V P +G  + +++   N  +D    H 
Sbjct: 209 LTDVEEGGETMFPFENGLNMDGTYNFQTCIGLKVKPRQGDGLLFYSVFPNGTIDPTSLHG 268

Query: 56  GCPVALGNKW 65
            CPV  G KW
Sbjct: 269 SCPVIKGQKW 278


>gi|449511009|ref|XP_004163837.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
           alpha-1-like [Cucumis sativus]
          Length = 294

 Score = 45.4 bits (106), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 25/70 (35%), Positives = 31/70 (44%), Gaps = 15/70 (21%)

Query: 73  LTDVELGGATIFPSLN---------------LTVFPEKGSAVFWYNAHANTLLDYRMYHS 117
           LTDVE GG T+FP  N               L V P +G  + +Y+   N  +D    H 
Sbjct: 209 LTDVEEGGETMFPFENGLNMDGTYNFQTCIGLKVKPRQGDGLLFYSVFPNGTIDPTSLHG 268

Query: 118 GCPVALGNKW 127
            CPV  G KW
Sbjct: 269 SCPVIKGQKW 278



 Score = 43.5 bits (101), Expect = 0.027,   Method: Compositional matrix adjust.
 Identities = 24/70 (34%), Positives = 31/70 (44%), Gaps = 15/70 (21%)

Query: 11  LTDVELGGATIFPSLN---------------LTVFPEKGSAVFWHNAHANTLLDYWMYHS 55
           LTDVE GG T+FP  N               L V P +G  + +++   N  +D    H 
Sbjct: 209 LTDVEEGGETMFPFENGLNMDGTYNFQTCIGLKVKPRQGDGLLFYSVFPNGTIDPTSLHG 268

Query: 56  GCPVALGNKW 65
            CPV  G KW
Sbjct: 269 SCPVIKGQKW 278


>gi|242038031|ref|XP_002466410.1| hypothetical protein SORBIDRAFT_01g007280 [Sorghum bicolor]
 gi|241920264|gb|EER93408.1| hypothetical protein SORBIDRAFT_01g007280 [Sorghum bicolor]
          Length = 294

 Score = 45.4 bits (106), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 30/93 (32%), Positives = 40/93 (43%), Gaps = 18/93 (19%)

Query: 53  YHSGCPVALGNKWGKLLLS---GLTDVELGGATIFPSLN---------------LTVFPE 94
           Y +  PV  G +  + + S    LT+VE GG T+FP  N               L V P 
Sbjct: 191 YDAFDPVQYGPQKSQRVASFLLYLTNVEEGGETMFPYENGENMDIGYDYEKCIGLKVKPR 250

Query: 95  KGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
           KG  + +Y+   N  +D    H  CPV  G KW
Sbjct: 251 KGDGLLFYSLMVNGTIDRTSLHGSCPVIKGEKW 283



 Score = 42.7 bits (99), Expect = 0.050,   Method: Compositional matrix adjust.
 Identities = 24/70 (34%), Positives = 31/70 (44%), Gaps = 15/70 (21%)

Query: 11  LTDVELGGATIFPSLN---------------LTVFPEKGSAVFWHNAHANTLLDYWMYHS 55
           LT+VE GG T+FP  N               L V P KG  + +++   N  +D    H 
Sbjct: 214 LTNVEEGGETMFPYENGENMDIGYDYEKCIGLKVKPRKGDGLLFYSLMVNGTIDRTSLHG 273

Query: 56  GCPVALGNKW 65
            CPV  G KW
Sbjct: 274 SCPVIKGEKW 283


>gi|255579590|ref|XP_002530636.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
 gi|223529809|gb|EEF31744.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
          Length = 287

 Score = 45.4 bits (106), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 27/78 (34%), Positives = 36/78 (46%), Gaps = 19/78 (24%)

Query: 69  LLSGLTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWYNAHANTL 109
           LL  L+DVE GG T+FP+                     L+V P  G A+ +++   +  
Sbjct: 197 LLMYLSDVEEGGETVFPAAKGNFSAVPWWNELSECGKKGLSVKPNMGDALLFWSMKPDAT 256

Query: 110 LDYRMYHSGCPVALGNKW 127
           LD    H GCPV  GNKW
Sbjct: 257 LDPSSLHGGCPVINGNKW 274



 Score = 42.0 bits (97), Expect = 0.089,   Method: Compositional matrix adjust.
 Identities = 25/74 (33%), Positives = 33/74 (44%), Gaps = 19/74 (25%)

Query: 11  LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
           L+DVE GG T+FP+                     L+V P  G A+ + +   +  LD  
Sbjct: 201 LSDVEEGGETVFPAAKGNFSAVPWWNELSECGKKGLSVKPNMGDALLFWSMKPDATLDPS 260

Query: 52  MYHSGCPVALGNKW 65
             H GCPV  GNKW
Sbjct: 261 SLHGGCPVINGNKW 274


>gi|224034451|gb|ACN36301.1| unknown [Zea mays]
 gi|413945801|gb|AFW78450.1| hypothetical protein ZEAMMB73_588774 [Zea mays]
          Length = 295

 Score = 45.4 bits (106), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 27/83 (32%), Positives = 38/83 (45%), Gaps = 18/83 (21%)

Query: 62  GNKWGKLLLSGLTDVELGGATIFP-----------------SLNLTVFPEKGSAVFWYNA 104
           G+++  +LL  LTDV  GG T+FP                    + V P KG A+ ++N 
Sbjct: 153 GHRYATVLLY-LTDVPEGGETVFPLAEEPDDAKDATLSECAQKGIAVRPRKGDALLFFNL 211

Query: 105 HANTLLDYRMYHSGCPVALGNKW 127
           + +   D    H GCPV  G KW
Sbjct: 212 NPDGTTDSVSLHGGCPVIKGEKW 234



 Score = 41.6 bits (96), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 24/72 (33%), Positives = 30/72 (41%), Gaps = 17/72 (23%)

Query: 11  LTDVELGGATIFP-----------------SLNLTVFPEKGSAVFWHNAHANTLLDYWMY 53
           LTDV  GG T+FP                    + V P KG A+ + N + +   D    
Sbjct: 163 LTDVPEGGETVFPLAEEPDDAKDATLSECAQKGIAVRPRKGDALLFFNLNPDGTTDSVSL 222

Query: 54  HSGCPVALGNKW 65
           H GCPV  G KW
Sbjct: 223 HGGCPVIKGEKW 234


>gi|66359518|ref|XP_626937.1| prolyl 4-hydroxylase alpha subunit [Cryptosporidium parvum Iowa II]
 gi|46228084|gb|EAK88983.1| prolyl 4-hydroxylase alpha subunit [Cryptosporidium parvum Iowa II]
          Length = 717

 Score = 45.1 bits (105), Expect = 0.008,   Method: Composition-based stats.
 Identities = 23/54 (42%), Positives = 29/54 (53%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHS 55
           NR    SV L+DVE GG   FP   + + P+KG AV W N  +   LDY   H+
Sbjct: 578 NRTHTISVFLSDVENGGELDFPYAGIKIKPKKGFAVVWPNIDSQGKLDYTTVHA 631



 Score = 40.4 bits (93), Expect = 0.27,   Method: Composition-based stats.
 Identities = 19/45 (42%), Positives = 25/45 (55%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHS 117
           L+DVE GG   FP   + + P+KG AV W N  +   LDY   H+
Sbjct: 587 LSDVENGGELDFPYAGIKIKPKKGFAVVWPNIDSQGKLDYTTVHA 631


>gi|302802700|ref|XP_002983104.1| hypothetical protein SELMODRAFT_234144 [Selaginella moellendorffii]
 gi|300149257|gb|EFJ15913.1| hypothetical protein SELMODRAFT_234144 [Selaginella moellendorffii]
          Length = 292

 Score = 45.1 bits (105), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 25/70 (35%), Positives = 32/70 (45%), Gaps = 15/70 (21%)

Query: 73  LTDVELGGATIFPSLN---------------LTVFPEKGSAVFWYNAHANTLLDYRMYHS 117
           L+DVE GG T+FP  N               L V P++G A+ +Y+   N   D    H 
Sbjct: 210 LSDVEEGGETMFPFENYEHMNENYDYKECIGLKVKPKQGDALLFYSMFPNGTFDKTALHG 269

Query: 118 GCPVALGNKW 127
            CPV  G KW
Sbjct: 270 SCPVIKGEKW 279



 Score = 43.1 bits (100), Expect = 0.042,   Method: Compositional matrix adjust.
 Identities = 24/70 (34%), Positives = 32/70 (45%), Gaps = 15/70 (21%)

Query: 11  LTDVELGGATIFPSLN---------------LTVFPEKGSAVFWHNAHANTLLDYWMYHS 55
           L+DVE GG T+FP  N               L V P++G A+ +++   N   D    H 
Sbjct: 210 LSDVEEGGETMFPFENYEHMNENYDYKECIGLKVKPKQGDALLFYSMFPNGTFDKTALHG 269

Query: 56  GCPVALGNKW 65
            CPV  G KW
Sbjct: 270 SCPVIKGEKW 279


>gi|195452740|ref|XP_002073479.1| GK13122 [Drosophila willistoni]
 gi|194169564|gb|EDW84465.1| GK13122 [Drosophila willistoni]
          Length = 494

 Score = 45.1 bits (105), Expect = 0.009,   Method: Composition-based stats.
 Identities = 21/59 (35%), Positives = 30/59 (50%), Gaps = 5/59 (8%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTL-----LDYRMYHSGCPVALGNK 126
           L   ++GGA +FP L L +    GS ++W   + N        DYR  H+ CPV LG +
Sbjct: 433 LNPAQMGGAMVFPQLQLGINVPHGSLLYWRTCNGNEFSMDSESDYRSRHAVCPVVLGEQ 491



 Score = 43.1 bits (100), Expect = 0.033,   Method: Composition-based stats.
 Identities = 21/63 (33%), Positives = 31/63 (49%), Gaps = 5/63 (7%)

Query: 7   SSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTL-----LDYWMYHSGCPVAL 61
           + V L   ++GGA +FP L L +    GS ++W   + N        DY   H+ CPV L
Sbjct: 429 ARVMLNPAQMGGAMVFPQLQLGINVPHGSLLYWRTCNGNEFSMDSESDYRSRHAVCPVVL 488

Query: 62  GNK 64
           G +
Sbjct: 489 GEQ 491


>gi|67598816|ref|XP_666240.1| hypothetical protein [Cryptosporidium hominis TU502]
 gi|54657195|gb|EAL36009.1| hypothetical protein Chro.30441 [Cryptosporidium hominis]
          Length = 718

 Score = 45.1 bits (105), Expect = 0.009,   Method: Composition-based stats.
 Identities = 23/54 (42%), Positives = 29/54 (53%)

Query: 2   NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHS 55
           NR    SV L+DVE GG   FP   + + P+KG AV W N  +   LDY   H+
Sbjct: 579 NRTHTISVFLSDVENGGELDFPYAGIKIKPKKGLAVVWPNIDSQGKLDYTTVHA 632



 Score = 40.0 bits (92), Expect = 0.27,   Method: Composition-based stats.
 Identities = 19/45 (42%), Positives = 25/45 (55%)

Query: 73  LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHS 117
           L+DVE GG   FP   + + P+KG AV W N  +   LDY   H+
Sbjct: 588 LSDVENGGELDFPYAGIKIKPKKGLAVVWPNIDSQGKLDYTTVHA 632


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.320    0.137    0.445 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,188,443,347
Number of Sequences: 23463169
Number of extensions: 86208845
Number of successful extensions: 166759
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1222
Number of HSP's successfully gapped in prelim test: 324
Number of HSP's that attempted gapping in prelim test: 163012
Number of HSP's gapped (non-prelim): 3144
length of query: 136
length of database: 8,064,228,071
effective HSP length: 101
effective length of query: 35
effective length of database: 9,989,415,298
effective search space: 349629535430
effective search space used: 349629535430
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 71 (32.0 bits)