BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy6258
(136 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|347972274|ref|XP_001237637.3| AGAP004611-PA [Anopheles gambiae str. PEST]
gi|333469330|gb|EAU76664.3| AGAP004611-PA [Anopheles gambiae str. PEST]
Length = 514
Score = 86.3 bits (212), Expect = 4e-15, Method: Composition-based stats.
Identities = 34/55 (61%), Positives = 43/55 (78%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+DV +GGAT+FP L L VFP+KGSA+FWYN HAN +D+R H CPV +G+KW
Sbjct: 437 LSDVAIGGATVFPQLGLGVFPQKGSAIFWYNLHANGTVDHRTLHGACPVFVGSKW 491
Score = 81.6 bits (200), Expect = 9e-14, Method: Composition-based stats.
Identities = 34/64 (53%), Positives = 44/64 (68%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR L+DV +GGAT+FP L L VFP+KGSA+FW+N HAN +D+ H CPV +
Sbjct: 428 NRIATVMYYLSDVAIGGATVFPQLGLGVFPQKGSAIFWYNLHANGTVDHRTLHGACPVFV 487
Query: 62 GNKW 65
G+KW
Sbjct: 488 GSKW 491
>gi|170029530|ref|XP_001842645.1| prolyl 4-hydroxylase subunit alpha-1 [Culex quinquefasciatus]
gi|167863229|gb|EDS26612.1| prolyl 4-hydroxylase subunit alpha-1 [Culex quinquefasciatus]
Length = 522
Score = 85.9 bits (211), Expect = 4e-15, Method: Composition-based stats.
Identities = 36/67 (53%), Positives = 49/67 (73%), Gaps = 1/67 (1%)
Query: 61 LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
+GN+ L+ L+DVE+GG T+FP L+L VFP KGSA+FW+N H N +D RM H+GCP
Sbjct: 422 MGNRLATLMYY-LSDVEVGGGTVFPRLSLGVFPIKGSAIFWHNVHHNGNVDERMLHAGCP 480
Query: 121 VALGNKW 127
+G+KW
Sbjct: 481 TLIGSKW 487
Score = 83.2 bits (204), Expect = 3e-14, Method: Composition-based stats.
Identities = 35/64 (54%), Positives = 44/64 (68%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR L+DVE+GG T+FP L+L VFP KGSA+FWHN H N +D M H+GCP +
Sbjct: 424 NRLATLMYYLSDVEVGGGTVFPRLSLGVFPIKGSAIFWHNVHHNGNVDERMLHAGCPTLI 483
Query: 62 GNKW 65
G+KW
Sbjct: 484 GSKW 487
>gi|92096574|gb|AAI15350.1| LOC557059 protein [Danio rerio]
Length = 508
Score = 82.0 bits (201), Expect = 8e-14, Method: Composition-based stats.
Identities = 33/55 (60%), Positives = 43/55 (78%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
++DVE+GGAT+F ++ + V PEKGSAVFWYN H N LD + H+GCPV +GNKW
Sbjct: 431 MSDVEVGGATVFTNVGVAVKPEKGSAVFWYNLHKNGELDLKTKHAGCPVLVGNKW 485
Score = 77.8 bits (190), Expect = 1e-12, Method: Composition-based stats.
Identities = 32/57 (56%), Positives = 43/57 (75%)
Query: 9 VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
+ ++DVE+GGAT+F ++ + V PEKGSAVFW+N H N LD H+GCPV +GNKW
Sbjct: 429 IYMSDVEVGGATVFTNVGVAVKPEKGSAVFWYNLHKNGELDLKTKHAGCPVLVGNKW 485
>gi|239915958|ref|NP_001070123.2| prolyl 4-hydroxylase alpha II-like precursor [Danio rerio]
Length = 490
Score = 82.0 bits (201), Expect = 8e-14, Method: Composition-based stats.
Identities = 33/55 (60%), Positives = 43/55 (78%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
++DVE+GGAT+F ++ + V PEKGSAVFWYN H N LD + H+GCPV +GNKW
Sbjct: 413 MSDVEVGGATVFTNVGVAVKPEKGSAVFWYNLHKNGELDLKTKHAGCPVLVGNKW 467
Score = 77.8 bits (190), Expect = 1e-12, Method: Composition-based stats.
Identities = 32/57 (56%), Positives = 43/57 (75%)
Query: 9 VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
+ ++DVE+GGAT+F ++ + V PEKGSAVFW+N H N LD H+GCPV +GNKW
Sbjct: 411 IYMSDVEVGGATVFTNVGVAVKPEKGSAVFWYNLHKNGELDLKTKHAGCPVLVGNKW 467
>gi|170064953|ref|XP_001867740.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
gi|167882143|gb|EDS45526.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
Length = 509
Score = 79.0 bits (193), Expect = 7e-13, Method: Composition-based stats.
Identities = 34/67 (50%), Positives = 45/67 (67%), Gaps = 1/67 (1%)
Query: 61 LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
+GN+ ++ L+DVE+GGAT+FP L L VFP KGSA+ WYN + N D R H+ CP
Sbjct: 417 VGNRIATVMFY-LSDVEIGGATVFPQLGLAVFPRKGSAILWYNLYRNGKGDRRTLHAACP 475
Query: 121 VALGNKW 127
V G+KW
Sbjct: 476 VLSGSKW 482
Score = 73.6 bits (179), Expect = 2e-11, Method: Composition-based stats.
Identities = 32/64 (50%), Positives = 40/64 (62%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR L+DVE+GGAT+FP L L VFP KGSA+ W+N + N D H+ CPV
Sbjct: 419 NRIATVMFYLSDVEIGGATVFPQLGLAVFPRKGSAILWYNLYRNGKGDRRTLHAACPVLS 478
Query: 62 GNKW 65
G+KW
Sbjct: 479 GSKW 482
>gi|344252711|gb|EGW08815.1| Prolyl 4-hydroxylase subunit alpha-2 [Cricetulus griseus]
Length = 584
Score = 78.2 bits (191), Expect = 1e-12, Method: Composition-based stats.
Identities = 33/61 (54%), Positives = 43/61 (70%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 128
L S ++DVE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KWG
Sbjct: 466 LCSQMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWG 525
Query: 129 K 129
K
Sbjct: 526 K 526
Score = 71.6 bits (174), Expect = 1e-10, Method: Composition-based stats.
Identities = 29/58 (50%), Positives = 41/58 (70%)
Query: 10 ELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWGK 67
+++DVE GGAT+FP L ++P+KG+AVFW+N + DY H+ CPV +G KWGK
Sbjct: 469 QMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWGK 526
>gi|56118630|ref|NP_001007975.1| prolyl 4-hydroxylase, alpha polypeptide 2 precursor [Xenopus
(Silurana) tropicalis]
gi|51513259|gb|AAH80485.1| p4ha2 protein [Xenopus (Silurana) tropicalis]
Length = 527
Score = 78.2 bits (191), Expect = 1e-12, Method: Composition-based stats.
Identities = 33/68 (48%), Positives = 46/68 (67%), Gaps = 1/68 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L + ++DVE GGAT+FP ++P+KG+AVFWYN + DYR H+ CPV
Sbjct: 448 GNRLATFL-NYMSDVEAGGATVFPDFGAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPV 506
Query: 122 ALGNKWGK 129
+G+KWGK
Sbjct: 507 LVGSKWGK 514
Score = 71.6 bits (174), Expect = 9e-11, Method: Composition-based stats.
Identities = 28/57 (49%), Positives = 40/57 (70%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWGK 67
++DVE GGAT+FP ++P+KG+AVFW+N + DY H+ CPV +G+KWGK
Sbjct: 458 MSDVEAGGATVFPDFGAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGSKWGK 514
>gi|157114983|ref|XP_001658090.1| prolyl 4-hydroxylase alpha subunit 1, putative [Aedes aegypti]
gi|108877085|gb|EAT41310.1| AAEL007032-PA, partial [Aedes aegypti]
Length = 448
Score = 77.8 bits (190), Expect = 1e-12, Method: Composition-based stats.
Identities = 32/64 (50%), Positives = 42/64 (65%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWGKLLL 132
++DVE GGAT+FP + + VFP+KGSA+FWYN + D R H CPV LG+KWG +
Sbjct: 384 ISDVEQGGATVFPQIGVGVFPKKGSAIFWYNLLPDGTGDERTLHGACPVLLGSKWGMFCM 443
Query: 133 SGVF 136
F
Sbjct: 444 QQYF 447
Score = 71.2 bits (173), Expect = 1e-10, Method: Composition-based stats.
Identities = 29/56 (51%), Positives = 39/56 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWG 66
++DVE GGAT+FP + + VFP+KGSA+FW+N + D H CPV LG+KWG
Sbjct: 384 ISDVEQGGATVFPQIGVGVFPKKGSAIFWYNLLPDGTGDERTLHGACPVLLGSKWG 439
>gi|195341558|ref|XP_002037373.1| GM12146 [Drosophila sechellia]
gi|194131489|gb|EDW53532.1| GM12146 [Drosophila sechellia]
Length = 485
Score = 77.8 bits (190), Expect = 1e-12, Method: Composition-based stats.
Identities = 32/56 (57%), Positives = 39/56 (69%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 128
L DV GGAT FP LN+TVFP+ G+ + WYN H LL R H+GCPV +G+KWG
Sbjct: 430 LNDVPQGGATHFPGLNITVFPKFGTVLMWYNLHTEGLLHVRTMHTGCPVIVGSKWG 485
Score = 72.8 bits (177), Expect = 5e-11, Method: Composition-based stats.
Identities = 31/66 (46%), Positives = 42/66 (63%)
Query: 1 MNRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVA 60
++R + L DV GGAT FP LN+TVFP+ G+ + W+N H LL H+GCPV
Sbjct: 420 IDRLATTLFYLNDVPQGGATHFPGLNITVFPKFGTVLMWYNLHTEGLLHVRTMHTGCPVI 479
Query: 61 LGNKWG 66
+G+KWG
Sbjct: 480 VGSKWG 485
>gi|281348666|gb|EFB24250.1| hypothetical protein PANDA_000722 [Ailuropoda melanoleuca]
Length = 505
Score = 77.8 bits (190), Expect = 1e-12, Method: Composition-based stats.
Identities = 33/63 (52%), Positives = 44/63 (69%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 128
L+ ++DVE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KWG
Sbjct: 443 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWG 502
Query: 129 KLL 131
K L
Sbjct: 503 KWL 505
Score = 71.6 bits (174), Expect = 1e-10, Method: Composition-based stats.
Identities = 30/59 (50%), Positives = 41/59 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWGKLL 69
++DVE GGAT+FP L ++P+KG+AVFW+N + DY H+ CPV +G KWGK L
Sbjct: 447 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWGKWL 505
>gi|195391760|ref|XP_002054528.1| GJ22757 [Drosophila virilis]
gi|194152614|gb|EDW68048.1| GJ22757 [Drosophila virilis]
Length = 534
Score = 77.4 bits (189), Expect = 2e-12, Method: Composition-based stats.
Identities = 39/103 (37%), Positives = 57/103 (55%), Gaps = 1/103 (0%)
Query: 25 LNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIF 84
L++T + A + H D++ P A GN+ + L+DVE GGAT F
Sbjct: 400 LDMTFCEQLQVANYGVGGHYEPHWDFFRDSRHYPAAEGNRIATAIFY-LSDVEQGGATAF 458
Query: 85 PSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
P LN V P+ G+ +FWYN H ++ +D+R H+GCPV G+KW
Sbjct: 459 PFLNFAVRPQLGNILFWYNLHRSSDMDFRTKHAGCPVLKGSKW 501
Score = 70.9 bits (172), Expect = 2e-10, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 42/64 (65%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L+DVE GGAT FP LN V P+ G+ +FW+N H ++ +D+ H+GCPV
Sbjct: 438 NRIATAIFYLSDVEQGGATAFPFLNFAVRPQLGNILFWYNLHRSSDMDFRTKHAGCPVLK 497
Query: 62 GNKW 65
G+KW
Sbjct: 498 GSKW 501
>gi|347964867|ref|XP_309164.4| AGAP000971-PA [Anopheles gambiae str. PEST]
gi|333466515|gb|EAA04901.5| AGAP000971-PA [Anopheles gambiae str. PEST]
Length = 553
Score = 77.0 bits (188), Expect = 2e-12, Method: Composition-based stats.
Identities = 33/66 (50%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ +L ++DV GGAT+FPSLNL ++P KG+A FW+N HA+ DY H+ CPV
Sbjct: 461 GNRIATVLFY-MSDVTQGGATVFPSLNLALWPRKGTAAFWFNLHASGRGDYATRHAACPV 519
Query: 122 ALGNKW 127
G KW
Sbjct: 520 LTGTKW 525
Score = 74.3 bits (181), Expect = 1e-11, Method: Composition-based stats.
Identities = 32/64 (50%), Positives = 40/64 (62%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FPSLNL ++P KG+A FW N HA+ DY H+ CPV
Sbjct: 462 NRIATVLFYMSDVTQGGATVFPSLNLALWPRKGTAAFWFNLHASGRGDYATRHAACPVLT 521
Query: 62 GNKW 65
G KW
Sbjct: 522 GTKW 525
>gi|195110925|ref|XP_002000030.1| GI22756 [Drosophila mojavensis]
gi|193916624|gb|EDW15491.1| GI22756 [Drosophila mojavensis]
Length = 533
Score = 77.0 bits (188), Expect = 2e-12, Method: Composition-based stats.
Identities = 43/113 (38%), Positives = 61/113 (53%), Gaps = 4/113 (3%)
Query: 15 ELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLT 74
++G AT L++T + A + H D++ P A GN+ + L+
Sbjct: 392 DVGDAT---GLDMTYCEQLQVANYGVGGHYEPHWDFFRDSRHYPAAEGNRIATAIFY-LS 447
Query: 75 DVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
DVE GGAT FP LN V P+ G+ +FWYN H ++ DYR H+GCPV G+KW
Sbjct: 448 DVEQGGATAFPFLNFAVRPQLGNILFWYNLHRSSDEDYRTKHAGCPVLKGSKW 500
Score = 70.1 bits (170), Expect = 3e-10, Method: Composition-based stats.
Identities = 31/64 (48%), Positives = 41/64 (64%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L+DVE GGAT FP LN V P+ G+ +FW+N H ++ DY H+GCPV
Sbjct: 437 NRIATAIFYLSDVEQGGATAFPFLNFAVRPQLGNILFWYNLHRSSDEDYRTKHAGCPVLK 496
Query: 62 GNKW 65
G+KW
Sbjct: 497 GSKW 500
>gi|195055773|ref|XP_001994787.1| GH17427 [Drosophila grimshawi]
gi|193892550|gb|EDV91416.1| GH17427 [Drosophila grimshawi]
Length = 538
Score = 77.0 bits (188), Expect = 2e-12, Method: Composition-based stats.
Identities = 39/103 (37%), Positives = 56/103 (54%), Gaps = 1/103 (0%)
Query: 25 LNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIF 84
L++T + A + H D+++ P GN+ + L+DVE GGAT F
Sbjct: 404 LDMTYCEQLQVANYGVGGHYEPHWDFFVDSQHYPAEEGNRIATAIFY-LSDVEQGGATAF 462
Query: 85 PSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
P LN V P+ G+ +FWYN H + +DYR H+GCPV G+KW
Sbjct: 463 PFLNFAVRPQLGNILFWYNLHRSLDMDYRTKHAGCPVLKGSKW 505
Score = 71.2 bits (173), Expect = 1e-10, Method: Composition-based stats.
Identities = 31/64 (48%), Positives = 41/64 (64%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L+DVE GGAT FP LN V P+ G+ +FW+N H + +DY H+GCPV
Sbjct: 442 NRIATAIFYLSDVEQGGATAFPFLNFAVRPQLGNILFWYNLHRSLDMDYRTKHAGCPVLK 501
Query: 62 GNKW 65
G+KW
Sbjct: 502 GSKW 505
>gi|170064956|ref|XP_001867741.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
gi|167882144|gb|EDS45527.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
Length = 520
Score = 77.0 bits (188), Expect = 2e-12, Method: Composition-based stats.
Identities = 32/68 (47%), Positives = 46/68 (67%), Gaps = 1/68 (1%)
Query: 60 ALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGC 119
+GN+ L+ L+DV+ GG+T+FP LNL V P KG+A+FWYN H N + + H+ C
Sbjct: 431 GMGNRIATLMFY-LSDVQEGGSTVFPRLNLAVRPRKGTAIFWYNLHRNGKGNKKTLHAAC 489
Query: 120 PVALGNKW 127
PV +G+KW
Sbjct: 490 PVLIGSKW 497
Score = 70.9 bits (172), Expect = 2e-10, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 41/64 (64%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR L+DV+ GG+T+FP LNL V P KG+A+FW+N H N + H+ CPV +
Sbjct: 434 NRIATLMFYLSDVQEGGSTVFPRLNLAVRPRKGTAIFWYNLHRNGKGNKKTLHAACPVLI 493
Query: 62 GNKW 65
G+KW
Sbjct: 494 GSKW 497
>gi|194765172|ref|XP_001964701.1| GF23326 [Drosophila ananassae]
gi|190614973|gb|EDV30497.1| GF23326 [Drosophila ananassae]
Length = 885
Score = 77.0 bits (188), Expect = 2e-12, Method: Composition-based stats.
Identities = 35/60 (58%), Positives = 43/60 (71%)
Query: 68 LLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
+L + L+DVE GGATIF LNLTVFP+ GSA+FWYN D R H+GCPV +G+KW
Sbjct: 799 VLSNALSDVEQGGATIFTKLNLTVFPQSGSALFWYNLDNWGNEDKRTEHAGCPVIVGSKW 858
Score = 71.2 bits (173), Expect = 1e-10, Method: Composition-based stats.
Identities = 42/92 (45%), Positives = 53/92 (57%), Gaps = 16/92 (17%)
Query: 8 SVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYW------MYHSGCPVAL 61
S L+DVE GGATIF LNLTVFP+ GSA+FW+N LD W H+GCPV +
Sbjct: 801 SNALSDVEQGGATIFTKLNLTVFPQSGSALFWYN------LDNWGNEDKRTEHAGCPVIV 854
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFP 93
G+KW ++ +TD +G P VFP
Sbjct: 855 GSKW--IMTKWVTD--MGQEFRKPCYKSGVFP 882
>gi|189241578|ref|XP_969458.2| PREDICTED: similar to prolyl 4-hydroxylase alpha subunit 1,
putative [Tribolium castaneum]
Length = 515
Score = 76.6 bits (187), Expect = 3e-12, Method: Composition-based stats.
Identities = 34/75 (45%), Positives = 51/75 (68%), Gaps = 1/75 (1%)
Query: 53 YHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDY 112
+ S A+G++ +L L+DVE GGAT+FP + ++V+P+KGSAV W+N H + D
Sbjct: 424 FQSTVDPAIGSRIETVLFY-LSDVEQGGATVFPEIQVSVWPQKGSAVVWFNLHPSGDGDQ 482
Query: 113 RMYHSGCPVALGNKW 127
R H+GCPV +G+KW
Sbjct: 483 RTKHAGCPVLIGSKW 497
Score = 71.6 bits (174), Expect = 9e-11, Method: Composition-based stats.
Identities = 31/64 (48%), Positives = 43/64 (67%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
+R E L+DVE GGAT+FP + ++V+P+KGSAV W N H + D H+GCPV +
Sbjct: 434 SRIETVLFYLSDVEQGGATVFPEIQVSVWPQKGSAVVWFNLHPSGDGDQRTKHAGCPVLI 493
Query: 62 GNKW 65
G+KW
Sbjct: 494 GSKW 497
>gi|270001038|gb|EEZ97485.1| hypothetical protein TcasGA2_TC011322 [Tribolium castaneum]
Length = 509
Score = 76.6 bits (187), Expect = 3e-12, Method: Composition-based stats.
Identities = 34/75 (45%), Positives = 51/75 (68%), Gaps = 1/75 (1%)
Query: 53 YHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDY 112
+ S A+G++ +L L+DVE GGAT+FP + ++V+P+KGSAV W+N H + D
Sbjct: 418 FQSTVDPAIGSRIETVLFY-LSDVEQGGATVFPEIQVSVWPQKGSAVVWFNLHPSGDGDQ 476
Query: 113 RMYHSGCPVALGNKW 127
R H+GCPV +G+KW
Sbjct: 477 RTKHAGCPVLIGSKW 491
Score = 71.6 bits (174), Expect = 9e-11, Method: Composition-based stats.
Identities = 31/64 (48%), Positives = 43/64 (67%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
+R E L+DVE GGAT+FP + ++V+P+KGSAV W N H + D H+GCPV +
Sbjct: 428 SRIETVLFYLSDVEQGGATVFPEIQVSVWPQKGSAVVWFNLHPSGDGDQRTKHAGCPVLI 487
Query: 62 GNKW 65
G+KW
Sbjct: 488 GSKW 491
>gi|195055775|ref|XP_001994788.1| GH17428 [Drosophila grimshawi]
gi|193892551|gb|EDV91417.1| GH17428 [Drosophila grimshawi]
Length = 540
Score = 76.6 bits (187), Expect = 3e-12, Method: Composition-based stats.
Identities = 32/55 (58%), Positives = 40/55 (72%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L++VE GG T FP L L V PE+GS +FWYN H + LDYR H+GCPV +G+KW
Sbjct: 453 LSNVEAGGGTAFPFLPLLVEPERGSLLFWYNLHRSGDLDYRTKHAGCPVLMGSKW 507
Score = 72.4 bits (176), Expect = 5e-11, Method: Composition-based stats.
Identities = 32/64 (50%), Positives = 41/64 (64%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR L++VE GG T FP L L V PE+GS +FW+N H + LDY H+GCPV +
Sbjct: 444 NRVATGIYYLSNVEAGGGTAFPFLPLLVEPERGSLLFWYNLHRSGDLDYRTKHAGCPVLM 503
Query: 62 GNKW 65
G+KW
Sbjct: 504 GSKW 507
>gi|195159317|ref|XP_002020528.1| GL14042 [Drosophila persimilis]
gi|194117297|gb|EDW39340.1| GL14042 [Drosophila persimilis]
Length = 534
Score = 76.3 bits (186), Expect = 4e-12, Method: Composition-based stats.
Identities = 34/70 (48%), Positives = 44/70 (62%), Gaps = 1/70 (1%)
Query: 58 PVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHS 117
P GN+ + L+DVE GGAT FP LN V P+ G+ +FWYN H + +DYR H+
Sbjct: 433 PAEEGNRMATAIFY-LSDVEQGGATAFPFLNFAVKPQLGNVLFWYNVHRSLDVDYRTKHA 491
Query: 118 GCPVALGNKW 127
GCPV G+KW
Sbjct: 492 GCPVLKGSKW 501
Score = 71.6 bits (174), Expect = 1e-10, Method: Composition-based stats.
Identities = 31/64 (48%), Positives = 41/64 (64%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L+DVE GGAT FP LN V P+ G+ +FW+N H + +DY H+GCPV
Sbjct: 438 NRMATAIFYLSDVEQGGATAFPFLNFAVKPQLGNVLFWYNVHRSLDVDYRTKHAGCPVLK 497
Query: 62 GNKW 65
G+KW
Sbjct: 498 GSKW 501
>gi|125772813|ref|XP_001357665.1| GA21991 [Drosophila pseudoobscura pseudoobscura]
gi|54637397|gb|EAL26799.1| GA21991 [Drosophila pseudoobscura pseudoobscura]
Length = 534
Score = 76.3 bits (186), Expect = 4e-12, Method: Composition-based stats.
Identities = 34/70 (48%), Positives = 44/70 (62%), Gaps = 1/70 (1%)
Query: 58 PVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHS 117
P GN+ + L+DVE GGAT FP LN V P+ G+ +FWYN H + +DYR H+
Sbjct: 433 PAEEGNRMATAIFY-LSDVEQGGATAFPFLNFAVKPQLGNVLFWYNVHRSLDVDYRTKHA 491
Query: 118 GCPVALGNKW 127
GCPV G+KW
Sbjct: 492 GCPVLKGSKW 501
Score = 71.6 bits (174), Expect = 1e-10, Method: Composition-based stats.
Identities = 31/64 (48%), Positives = 41/64 (64%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L+DVE GGAT FP LN V P+ G+ +FW+N H + +DY H+GCPV
Sbjct: 438 NRMATAIFYLSDVEQGGATAFPFLNFAVKPQLGNVLFWYNVHRSLDVDYRTKHAGCPVLK 497
Query: 62 GNKW 65
G+KW
Sbjct: 498 GSKW 501
>gi|410927705|ref|XP_003977281.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Takifugu
rubripes]
Length = 531
Score = 76.3 bits (186), Expect = 4e-12, Method: Composition-based stats.
Identities = 34/66 (51%), Positives = 45/66 (68%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ LL +++V+ GGAT+F + +V P+KGSAVFWYN H + DYR H+ CPV
Sbjct: 444 GNRIATWLLY-MSEVQAGGATVFTDIGASVSPKKGSAVFWYNLHPSGDGDYRTRHAACPV 502
Query: 122 ALGNKW 127
LGNKW
Sbjct: 503 LLGNKW 508
Score = 69.7 bits (169), Expect = 3e-10, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 42/64 (65%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + +++V+ GGAT+F + +V P+KGSAVFW+N H + DY H+ CPV L
Sbjct: 445 NRIATWLLYMSEVQAGGATVFTDIGASVSPKKGSAVFWYNLHPSGDGDYRTRHAACPVLL 504
Query: 62 GNKW 65
GNKW
Sbjct: 505 GNKW 508
>gi|170064951|ref|XP_001867739.1| prolyl 4-hydroxylase subunit alpha-2 [Culex quinquefasciatus]
gi|167882142|gb|EDS45525.1| prolyl 4-hydroxylase subunit alpha-2 [Culex quinquefasciatus]
Length = 516
Score = 75.9 bits (185), Expect = 4e-12, Method: Composition-based stats.
Identities = 33/66 (50%), Positives = 45/66 (68%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L+ L+DVE GGAT+FP L + VFP+KG+A+FWYN A+ D + H CPV
Sbjct: 429 GNRIATLMFY-LSDVEEGGATVFPHLGVGVFPKKGTAIFWYNLRASGKGDEKTLHGACPV 487
Query: 122 ALGNKW 127
+G+KW
Sbjct: 488 LIGSKW 493
Score = 70.9 bits (172), Expect = 2e-10, Method: Composition-based stats.
Identities = 31/64 (48%), Positives = 41/64 (64%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR L+DVE GGAT+FP L + VFP+KG+A+FW+N A+ D H CPV +
Sbjct: 430 NRIATLMFYLSDVEEGGATVFPHLGVGVFPKKGTAIFWYNLRASGKGDEKTLHGACPVLI 489
Query: 62 GNKW 65
G+KW
Sbjct: 490 GSKW 493
>gi|157111033|ref|XP_001651361.1| prolyl 4-hydroxylase alpha subunit 1, putative [Aedes aegypti]
gi|108878552|gb|EAT42777.1| AAEL005714-PA, partial [Aedes aegypti]
Length = 522
Score = 75.9 bits (185), Expect = 5e-12, Method: Composition-based stats.
Identities = 33/71 (46%), Positives = 44/71 (61%), Gaps = 8/71 (11%)
Query: 65 WGKL--------LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYH 116
WGKL +L ++DV +GGAT+FP L +T+ KG+A FWYN H++ LDY H
Sbjct: 429 WGKLKEVNRIATVLFYMSDVSIGGATVFPKLGVTLEARKGTAAFWYNLHSSGELDYSTLH 488
Query: 117 SGCPVALGNKW 127
CPV +G KW
Sbjct: 489 GACPVLIGEKW 499
Score = 71.2 bits (173), Expect = 1e-10, Method: Composition-based stats.
Identities = 29/65 (44%), Positives = 41/65 (63%)
Query: 1 MNRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVA 60
+NR ++DV +GGAT+FP L +T+ KG+A FW+N H++ LDY H CPV
Sbjct: 435 VNRIATVLFYMSDVSIGGATVFPKLGVTLEARKGTAAFWYNLHSSGELDYSTLHGACPVL 494
Query: 61 LGNKW 65
+G KW
Sbjct: 495 IGEKW 499
>gi|195505216|ref|XP_002099408.1| GE23378 [Drosophila yakuba]
gi|194185509|gb|EDW99120.1| GE23378 [Drosophila yakuba]
Length = 546
Score = 75.9 bits (185), Expect = 5e-12, Method: Composition-based stats.
Identities = 32/55 (58%), Positives = 39/55 (70%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L DV GGAT FP LN+TVFP+ G+A+FWYN LL R H+GCPV +G+KW
Sbjct: 457 LNDVPQGGATHFPGLNITVFPKFGTALFWYNLDTKGLLRLRTMHTGCPVIVGSKW 511
Score = 72.8 bits (177), Expect = 5e-11, Method: Composition-based stats.
Identities = 32/65 (49%), Positives = 42/65 (64%)
Query: 1 MNRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVA 60
M+R + L DV GGAT FP LN+TVFP+ G+A+FW+N LL H+GCPV
Sbjct: 447 MDRLATTLFYLNDVPQGGATHFPGLNITVFPKFGTALFWYNLDTKGLLRLRTMHTGCPVI 506
Query: 61 LGNKW 65
+G+KW
Sbjct: 507 VGSKW 511
>gi|115313004|gb|AAI24075.1| Zgc:152670 [Danio rerio]
Length = 235
Score = 75.9 bits (185), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 32/55 (58%), Positives = 42/55 (76%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
++DVE+GGAT+F ++ + V PEKGSAVFW N H N LD + H+GCPV +GNKW
Sbjct: 158 MSDVEVGGATVFTNVGVAVKPEKGSAVFWNNLHKNGELDLKTKHAGCPVLVGNKW 212
Score = 75.1 bits (183), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 32/55 (58%), Positives = 42/55 (76%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE+GGAT+F ++ + V PEKGSAVFW+N H N LD H+GCPV +GNKW
Sbjct: 158 MSDVEVGGATVFTNVGVAVKPEKGSAVFWNNLHKNGELDLKTKHAGCPVLVGNKW 212
>gi|410914996|ref|XP_003970973.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Takifugu
rubripes]
Length = 538
Score = 75.9 bits (185), Expect = 6e-12, Method: Composition-based stats.
Identities = 30/59 (50%), Positives = 40/59 (67%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+ ++DVE GGAT+FP ++P KG+AVFWYN + DYR H+ CPV +GNKW
Sbjct: 457 FLNYMSDVEAGGATVFPDFGAAIWPRKGTAVFWYNLFKSGEGDYRTRHAACPVLVGNKW 515
Score = 69.3 bits (168), Expect = 4e-10, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 37/55 (67%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP ++P KG+AVFW+N + DY H+ CPV +GNKW
Sbjct: 461 MSDVEAGGATVFPDFGAAIWPRKGTAVFWYNLFKSGEGDYRTRHAACPVLVGNKW 515
>gi|410900628|ref|XP_003963798.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Takifugu
rubripes]
Length = 548
Score = 75.5 bits (184), Expect = 6e-12, Method: Composition-based stats.
Identities = 33/66 (50%), Positives = 43/66 (65%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+FP + V+P+KG+AVFWYN AN DY H+ CPV
Sbjct: 461 GNRIATWLFY-MSDVAAGGATVFPDVGAAVWPQKGTAVFWYNLFANGEGDYSTRHAACPV 519
Query: 122 ALGNKW 127
+GNKW
Sbjct: 520 LVGNKW 525
Score = 72.8 bits (177), Expect = 5e-11, Method: Composition-based stats.
Identities = 31/64 (48%), Positives = 41/64 (64%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + V+P+KG+AVFW+N AN DY H+ CPV +
Sbjct: 462 NRIATWLFYMSDVAAGGATVFPDVGAAVWPQKGTAVFWYNLFANGEGDYSTRHAACPVLV 521
Query: 62 GNKW 65
GNKW
Sbjct: 522 GNKW 525
>gi|195390833|ref|XP_002054072.1| GJ22994 [Drosophila virilis]
gi|194152158|gb|EDW67592.1| GJ22994 [Drosophila virilis]
Length = 496
Score = 75.5 bits (184), Expect = 6e-12, Method: Composition-based stats.
Identities = 32/55 (58%), Positives = 39/55 (70%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
LTDVE GGAT+F LN V P++G+A+FWYN H N D R H GCPV +G+KW
Sbjct: 420 LTDVEQGGATVFTILNQAVSPKRGTALFWYNLHRNGTGDTRTLHGGCPVLVGSKW 474
Score = 70.1 bits (170), Expect = 3e-10, Method: Composition-based stats.
Identities = 30/55 (54%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
LTDVE GGAT+F LN V P++G+A+FW+N H N D H GCPV +G+KW
Sbjct: 420 LTDVEQGGATVFTILNQAVSPKRGTALFWYNLHRNGTGDTRTLHGGCPVLVGSKW 474
>gi|195575113|ref|XP_002105524.1| GD16980 [Drosophila simulans]
gi|194201451|gb|EDX15027.1| GD16980 [Drosophila simulans]
Length = 518
Score = 75.5 bits (184), Expect = 7e-12, Method: Composition-based stats.
Identities = 31/55 (56%), Positives = 38/55 (69%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L DV GGAT FP LN+TVFP+ G+ + WYN H LL R H+GCPV +G+KW
Sbjct: 429 LNDVPQGGATHFPGLNITVFPKFGTVLMWYNLHTEGLLHVRTMHTGCPVIVGSKW 483
Score = 70.1 bits (170), Expect = 2e-10, Method: Composition-based stats.
Identities = 30/65 (46%), Positives = 41/65 (63%)
Query: 1 MNRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVA 60
++R + L DV GGAT FP LN+TVFP+ G+ + W+N H LL H+GCPV
Sbjct: 419 IDRLATTLFYLNDVPQGGATHFPGLNITVFPKFGTVLMWYNLHTEGLLHVRTMHTGCPVI 478
Query: 61 LGNKW 65
+G+KW
Sbjct: 479 VGSKW 483
>gi|194905376|ref|XP_001981185.1| GG11927 [Drosophila erecta]
gi|190655823|gb|EDV53055.1| GG11927 [Drosophila erecta]
Length = 539
Score = 75.5 bits (184), Expect = 7e-12, Method: Composition-based stats.
Identities = 31/55 (56%), Positives = 38/55 (69%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L DV GGAT FP LN+TVFP G+A+FWYN +L R H+GCPV +G+KW
Sbjct: 450 LNDVPQGGATHFPGLNITVFPRLGAALFWYNLDTQGMLQVRTMHTGCPVIVGSKW 504
Score = 70.5 bits (171), Expect = 2e-10, Method: Composition-based stats.
Identities = 30/65 (46%), Positives = 41/65 (63%)
Query: 1 MNRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVA 60
++R + L DV GGAT FP LN+TVFP G+A+FW+N +L H+GCPV
Sbjct: 440 IDRIATTLFYLNDVPQGGATHFPGLNITVFPRLGAALFWYNLDTQGMLQVRTMHTGCPVI 499
Query: 61 LGNKW 65
+G+KW
Sbjct: 500 VGSKW 504
>gi|195061068|ref|XP_001995918.1| GH14106 [Drosophila grimshawi]
gi|193891710|gb|EDV90576.1| GH14106 [Drosophila grimshawi]
Length = 511
Score = 75.1 bits (183), Expect = 8e-12, Method: Composition-based stats.
Identities = 42/125 (33%), Positives = 63/125 (50%), Gaps = 12/125 (9%)
Query: 3 RAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALG 62
R E V++T++ + G+ + +N + H D++ G
Sbjct: 376 RIERRVVDMTELSMQGSEAYNIMNYGL-----------GGHYAAHYDFFNTTKQQTSETG 424
Query: 63 NKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 122
++ +L L+DVE GGAT+FP+L L V PE+G A+FWYN N D R H GCPV
Sbjct: 425 DRIATVLFY-LSDVEQGGATVFPNLKLAVSPERGMALFWYNLLDNGTGDTRTLHGGCPVL 483
Query: 123 LGNKW 127
+G+KW
Sbjct: 484 VGSKW 488
>gi|348523976|ref|XP_003449499.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Oreochromis
niloticus]
Length = 594
Score = 75.1 bits (183), Expect = 8e-12, Method: Composition-based stats.
Identities = 34/66 (51%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ LL +TDV+ GGAT+F + V P+KG+AVFWYN + + DYR H+ CPV
Sbjct: 507 GNRIATWLLY-MTDVQAGGATVFTDIGAAVKPKKGTAVFWYNLYPSGEGDYRTRHAACPV 565
Query: 122 ALGNKW 127
LGNKW
Sbjct: 566 LLGNKW 571
Score = 68.9 bits (167), Expect = 6e-10, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 41/64 (64%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + +TDV+ GGAT+F + V P+KG+AVFW+N + + DY H+ CPV L
Sbjct: 508 NRIATWLLYMTDVQAGGATVFTDIGAAVKPKKGTAVFWYNLYPSGEGDYRTRHAACPVLL 567
Query: 62 GNKW 65
GNKW
Sbjct: 568 GNKW 571
>gi|157114985|ref|XP_001658091.1| prolyl 4-hydroxylase alpha subunit 1, putative [Aedes aegypti]
gi|108877086|gb|EAT41311.1| AAEL007038-PA [Aedes aegypti]
Length = 545
Score = 75.1 bits (183), Expect = 8e-12, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ +L ++DV GGAT+FPSL L ++P+KG+A FW+N HA+ DY H+ CPV
Sbjct: 454 GNRIATVLFY-MSDVTQGGATVFPSLRLALWPKKGAAAFWFNLHASGQGDYSTRHAACPV 512
Query: 122 ALGNKW 127
G KW
Sbjct: 513 LTGTKW 518
Score = 72.0 bits (175), Expect = 7e-11, Method: Composition-based stats.
Identities = 31/64 (48%), Positives = 40/64 (62%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FPSL L ++P+KG+A FW N HA+ DY H+ CPV
Sbjct: 455 NRIATVLFYMSDVTQGGATVFPSLRLALWPKKGAAAFWFNLHASGQGDYSTRHAACPVLT 514
Query: 62 GNKW 65
G KW
Sbjct: 515 GTKW 518
>gi|195452734|ref|XP_002073476.1| GK13124 [Drosophila willistoni]
gi|194169561|gb|EDW84462.1| GK13124 [Drosophila willistoni]
Length = 536
Score = 75.1 bits (183), Expect = 9e-12, Method: Composition-based stats.
Identities = 38/103 (36%), Positives = 56/103 (54%), Gaps = 1/103 (0%)
Query: 25 LNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIF 84
L++T + A + H D++ P GN+ + L++VE GGAT F
Sbjct: 402 LDMTYCEQLQVANYGVGGHYEPHWDFFRNPDHYPAEEGNRIATAIYY-LSEVEQGGATAF 460
Query: 85 PSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
P LN V P+ G+ +FWYN H ++ +DYR H+GCPV G+KW
Sbjct: 461 PFLNFAVRPQLGNVLFWYNLHRSSDMDYRTKHAGCPVLKGSKW 503
Score = 70.9 bits (172), Expect = 1e-10, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 42/64 (65%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L++VE GGAT FP LN V P+ G+ +FW+N H ++ +DY H+GCPV
Sbjct: 440 NRIATAIYYLSEVEQGGATAFPFLNFAVRPQLGNVLFWYNLHRSSDMDYRTKHAGCPVLK 499
Query: 62 GNKW 65
G+KW
Sbjct: 500 GSKW 503
>gi|47213360|emb|CAF90979.1| unnamed protein product [Tetraodon nigroviridis]
Length = 511
Score = 74.7 bits (182), Expect = 1e-11, Method: Composition-based stats.
Identities = 30/55 (54%), Positives = 39/55 (70%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
++DV+ GGAT+F + +V P+KGSAVFWYN + DYR H+ CPV LGNKW
Sbjct: 436 MSDVQAGGATVFTDIGASVLPQKGSAVFWYNLRPSGDGDYRTRHAACPVLLGNKW 490
Score = 69.7 bits (169), Expect = 4e-10, Method: Composition-based stats.
Identities = 28/57 (49%), Positives = 39/57 (68%)
Query: 9 VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
+ ++DV+ GGAT+F + +V P+KGSAVFW+N + DY H+ CPV LGNKW
Sbjct: 434 LRMSDVQAGGATVFTDIGASVLPQKGSAVFWYNLRPSGDGDYRTRHAACPVLLGNKW 490
>gi|261245137|gb|ACX54875.1| FI12021p [Drosophila melanogaster]
Length = 538
Score = 74.7 bits (182), Expect = 1e-11, Method: Composition-based stats.
Identities = 30/55 (54%), Positives = 38/55 (69%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L DV GGAT FP LN+TVFP+ G+ + WYN H +L R H+GCPV +G+KW
Sbjct: 449 LNDVPQGGATHFPGLNITVFPKFGTVLMWYNLHTEGMLHVRTMHTGCPVIVGSKW 503
Score = 69.3 bits (168), Expect = 4e-10, Method: Composition-based stats.
Identities = 29/65 (44%), Positives = 41/65 (63%)
Query: 1 MNRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVA 60
++R + L DV GGAT FP LN+TVFP+ G+ + W+N H +L H+GCPV
Sbjct: 439 IDRLATTLFYLNDVPQGGATHFPGLNITVFPKFGTVLMWYNLHTEGMLHVRTMHTGCPVI 498
Query: 61 LGNKW 65
+G+KW
Sbjct: 499 VGSKW 503
>gi|116008537|ref|NP_733379.2| CG31524, isoform A [Drosophila melanogaster]
gi|113194861|gb|AAN14239.2| CG31524, isoform A [Drosophila melanogaster]
Length = 536
Score = 74.7 bits (182), Expect = 1e-11, Method: Composition-based stats.
Identities = 30/55 (54%), Positives = 38/55 (69%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L DV GGAT FP LN+TVFP+ G+ + WYN H +L R H+GCPV +G+KW
Sbjct: 447 LNDVPQGGATHFPGLNITVFPKFGTVLMWYNLHTEGMLHVRTMHTGCPVIVGSKW 501
Score = 69.3 bits (168), Expect = 4e-10, Method: Composition-based stats.
Identities = 29/65 (44%), Positives = 41/65 (63%)
Query: 1 MNRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVA 60
++R + L DV GGAT FP LN+TVFP+ G+ + W+N H +L H+GCPV
Sbjct: 437 IDRLATTLFYLNDVPQGGATHFPGLNITVFPKFGTVLMWYNLHTEGMLHVRTMHTGCPVI 496
Query: 61 LGNKW 65
+G+KW
Sbjct: 497 VGSKW 501
>gi|116008130|ref|NP_001036777.1| CG31524, isoform B [Drosophila melanogaster]
gi|113194860|gb|ABI31221.1| CG31524, isoform B [Drosophila melanogaster]
Length = 535
Score = 74.7 bits (182), Expect = 1e-11, Method: Composition-based stats.
Identities = 30/55 (54%), Positives = 38/55 (69%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L DV GGAT FP LN+TVFP+ G+ + WYN H +L R H+GCPV +G+KW
Sbjct: 446 LNDVPQGGATHFPGLNITVFPKFGTVLMWYNLHTEGMLHVRTMHTGCPVIVGSKW 500
Score = 69.3 bits (168), Expect = 4e-10, Method: Composition-based stats.
Identities = 29/65 (44%), Positives = 41/65 (63%)
Query: 1 MNRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVA 60
++R + L DV GGAT FP LN+TVFP+ G+ + W+N H +L H+GCPV
Sbjct: 436 IDRLATTLFYLNDVPQGGATHFPGLNITVFPKFGTVLMWYNLHTEGMLHVRTMHTGCPVI 495
Query: 61 LGNKW 65
+G+KW
Sbjct: 496 VGSKW 500
>gi|66771513|gb|AAY55068.1| IP12095p [Drosophila melanogaster]
Length = 538
Score = 74.7 bits (182), Expect = 1e-11, Method: Composition-based stats.
Identities = 30/55 (54%), Positives = 38/55 (69%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L DV GGAT FP LN+TVFP+ G+ + WYN H +L R H+GCPV +G+KW
Sbjct: 449 LNDVPQGGATHFPGLNITVFPKFGTVLMWYNLHTEGMLHVRTMHTGCPVIVGSKW 503
Score = 69.3 bits (168), Expect = 4e-10, Method: Composition-based stats.
Identities = 29/65 (44%), Positives = 41/65 (63%)
Query: 1 MNRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVA 60
++R + L DV GGAT FP LN+TVFP+ G+ + W+N H +L H+GCPV
Sbjct: 439 IDRLATTLFYLNDVPQGGATHFPGLNITVFPKFGTVLMWYNLHTEGMLHVRTMHTGCPVI 498
Query: 61 LGNKW 65
+G+KW
Sbjct: 499 VGSKW 503
>gi|66770643|gb|AAY54633.1| IP12395p [Drosophila melanogaster]
Length = 538
Score = 74.7 bits (182), Expect = 1e-11, Method: Composition-based stats.
Identities = 30/55 (54%), Positives = 38/55 (69%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L DV GGAT FP LN+TVFP+ G+ + WYN H +L R H+GCPV +G+KW
Sbjct: 449 LNDVPQGGATHFPGLNITVFPKFGTVLMWYNLHTEGMLHVRTMHTGCPVIVGSKW 503
Score = 69.3 bits (168), Expect = 4e-10, Method: Composition-based stats.
Identities = 29/65 (44%), Positives = 41/65 (63%)
Query: 1 MNRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVA 60
++R + L DV GGAT FP LN+TVFP+ G+ + W+N H +L H+GCPV
Sbjct: 439 IDRLATTLFYLNDVPQGGATHFPGLNITVFPKFGTVLMWYNLHTEGMLHVRTMHTGCPVI 498
Query: 61 LGNKW 65
+G+KW
Sbjct: 499 VGSKW 503
>gi|55925444|ref|NP_001007286.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Danio rerio]
gi|49900294|gb|AAH76508.1| Procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide 2 [Danio rerio]
gi|182891794|gb|AAI65288.1| P4ha2 protein [Danio rerio]
Length = 514
Score = 74.7 bits (182), Expect = 1e-11, Method: Composition-based stats.
Identities = 30/58 (51%), Positives = 40/58 (68%)
Query: 70 LSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
S ++DVE GGAT+FP +V+P KG+AVFWYN + DYR H+ CPV +G+KW
Sbjct: 434 FSRMSDVEAGGATVFPDFGASVWPRKGTAVFWYNLFRSGEGDYRTRHAACPVLVGSKW 491
Score = 68.2 bits (165), Expect = 1e-09, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP +V+P KG+AVFW+N + DY H+ CPV +G+KW
Sbjct: 437 MSDVEAGGATVFPDFGASVWPRKGTAVFWYNLFRSGEGDYRTRHAACPVLVGSKW 491
>gi|301613006|ref|XP_002936013.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Xenopus
(Silurana) tropicalis]
Length = 504
Score = 74.3 bits (181), Expect = 2e-11, Method: Composition-based stats.
Identities = 29/55 (52%), Positives = 39/55 (70%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
++DVE GGAT+FP + V+P+KG+AVFWYN + DY H+ CPV +GNKW
Sbjct: 427 MSDVEAGGATVFPEVGAAVYPKKGTAVFWYNLFESGEGDYSTRHAACPVLVGNKW 481
Score = 72.0 bits (175), Expect = 8e-11, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 41/64 (64%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DVE GGAT+FP + V+P+KG+AVFW+N + DY H+ CPV +
Sbjct: 418 NRVATWLFYMSDVEAGGATVFPEVGAAVYPKKGTAVFWYNLFESGEGDYSTRHAACPVLV 477
Query: 62 GNKW 65
GNKW
Sbjct: 478 GNKW 481
>gi|432926124|ref|XP_004080841.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Oryzias
latipes]
Length = 523
Score = 74.3 bits (181), Expect = 2e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ LL ++DV+ GG T+F + V+P+KG+AVFWYN H + DYR H+ CPV
Sbjct: 436 GNRIATWLLY-MSDVQAGGNTVFTDIGAVVWPKKGTAVFWYNLHRSGEGDYRTRHAACPV 494
Query: 122 ALGNKW 127
+GNKW
Sbjct: 495 LVGNKW 500
Score = 67.8 bits (164), Expect = 1e-09, Method: Composition-based stats.
Identities = 28/64 (43%), Positives = 41/64 (64%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + ++DV+ GG T+F + V+P+KG+AVFW+N H + DY H+ CPV +
Sbjct: 437 NRIATWLLYMSDVQAGGNTVFTDIGAVVWPKKGTAVFWYNLHRSGEGDYRTRHAACPVLV 496
Query: 62 GNKW 65
GNKW
Sbjct: 497 GNKW 500
>gi|321474875|gb|EFX85839.1| hypothetical protein DAPPUDRAFT_309105 [Daphnia pulex]
Length = 545
Score = 73.9 bits (180), Expect = 2e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 45/66 (68%), Gaps = 3/66 (4%)
Query: 65 WGKLLLSGL---TDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
WG + + L +DVE GGAT+FP LNL+++P+KGSA FW+N + N + H+GCPV
Sbjct: 455 WGGRIATWLFYMSDVEAGGATVFPKLNLSLWPQKGSAAFWFNLYPNGEGNEMTQHAGCPV 514
Query: 122 ALGNKW 127
G+KW
Sbjct: 515 LTGSKW 520
Score = 72.4 bits (176), Expect = 6e-11, Method: Composition-based stats.
Identities = 29/55 (52%), Positives = 40/55 (72%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP LNL+++P+KGSA FW N + N + H+GCPV G+KW
Sbjct: 466 MSDVEAGGATVFPKLNLSLWPQKGSAAFWFNLYPNGEGNEMTQHAGCPVLTGSKW 520
>gi|432949777|ref|XP_004084253.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Oryzias
latipes]
Length = 532
Score = 73.9 bits (180), Expect = 2e-11, Method: Composition-based stats.
Identities = 31/66 (46%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L + ++DVE GGAT+FP +++P KG+AVFWYN + DYR H+ CPV
Sbjct: 445 GNRLATFL-NYMSDVEAGGATVFPDFGASIWPRKGTAVFWYNLFRSGEGDYRTRHAACPV 503
Query: 122 ALGNKW 127
+G+KW
Sbjct: 504 LVGSKW 509
Score = 67.8 bits (164), Expect = 1e-09, Method: Composition-based stats.
Identities = 26/55 (47%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP +++P KG+AVFW+N + DY H+ CPV +G+KW
Sbjct: 455 MSDVEAGGATVFPDFGASIWPRKGTAVFWYNLFRSGEGDYRTRHAACPVLVGSKW 509
>gi|395509387|ref|XP_003758979.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1
[Sarcophilus harrisii]
Length = 534
Score = 73.9 bits (180), Expect = 2e-11, Method: Composition-based stats.
Identities = 31/66 (46%), Positives = 45/66 (68%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L + ++DVE GGAT+FP T++P+KG++VFWYN + DYR H+ CPV
Sbjct: 447 GNRLATFL-NYMSDVEAGGATVFPDFGATIWPKKGTSVFWYNLFRSGEGDYRTRHAACPV 505
Query: 122 ALGNKW 127
+G+KW
Sbjct: 506 LVGSKW 511
Score = 67.8 bits (164), Expect = 1e-09, Method: Composition-based stats.
Identities = 26/55 (47%), Positives = 39/55 (70%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP T++P+KG++VFW+N + DY H+ CPV +G+KW
Sbjct: 457 MSDVEAGGATVFPDFGATIWPKKGTSVFWYNLFRSGEGDYRTRHAACPVLVGSKW 511
>gi|432106758|gb|ELK32410.1| Prolyl 4-hydroxylase subunit alpha-1 [Myotis davidii]
Length = 534
Score = 73.9 bits (180), Expect = 2e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505
Query: 122 ALGNKW 127
+GNKW
Sbjct: 506 LVGNKW 511
Score = 71.2 bits (173), Expect = 1e-10, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 42/64 (65%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + +V+P+KG+AVFW+N A+ DY H+ CPV +
Sbjct: 448 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507
Query: 62 GNKW 65
GNKW
Sbjct: 508 GNKW 511
>gi|190786|gb|AAA36534.1| prolyl 4-hydroxylase alpha subunit (EC 1.14.11.2) [Homo sapiens]
Length = 534
Score = 73.9 bits (180), Expect = 2e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505
Query: 122 ALGNKW 127
+GNKW
Sbjct: 506 LVGNKW 511
Score = 71.2 bits (173), Expect = 1e-10, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 42/64 (65%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + +V+P+KG+AVFW+N A+ DY H+ CPV +
Sbjct: 448 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507
Query: 62 GNKW 65
GNKW
Sbjct: 508 GNKW 511
>gi|190788|gb|AAA36535.1| prolyl 4-hydroxylase alpha subunit (EC 1.14.11.2) [Homo sapiens]
Length = 534
Score = 73.9 bits (180), Expect = 2e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505
Query: 122 ALGNKW 127
+GNKW
Sbjct: 506 LVGNKW 511
Score = 71.2 bits (173), Expect = 1e-10, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 42/64 (65%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + +V+P+KG+AVFW+N A+ DY H+ CPV +
Sbjct: 448 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507
Query: 62 GNKW 65
GNKW
Sbjct: 508 GNKW 511
>gi|73952886|ref|XP_850682.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 3 [Canis
lupus familiaris]
Length = 534
Score = 73.9 bits (180), Expect = 2e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505
Query: 122 ALGNKW 127
+GNKW
Sbjct: 506 LVGNKW 511
Score = 71.2 bits (173), Expect = 1e-10, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 42/64 (65%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + +V+P+KG+AVFW+N A+ DY H+ CPV +
Sbjct: 448 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507
Query: 62 GNKW 65
GNKW
Sbjct: 508 GNKW 511
>gi|51036657|ref|NP_742059.2| prolyl 4-hydroxylase subunit alpha-1 precursor [Rattus norvegicus]
gi|90111077|sp|P54001.2|P4HA1_RAT RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
alpha-1; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1; Flags: Precursor
gi|50927553|gb|AAH78703.1| Procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I [Rattus norvegicus]
gi|149038787|gb|EDL93076.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha 1 polypeptide, isoform CRA_a
[Rattus norvegicus]
Length = 534
Score = 73.9 bits (180), Expect = 2e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505
Query: 122 ALGNKW 127
+GNKW
Sbjct: 506 LVGNKW 511
Score = 71.2 bits (173), Expect = 1e-10, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 42/64 (65%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + +V+P+KG+AVFW+N A+ DY H+ CPV +
Sbjct: 448 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507
Query: 62 GNKW 65
GNKW
Sbjct: 508 GNKW 511
>gi|74224984|dbj|BAE38205.1| unnamed protein product [Mus musculus]
Length = 534
Score = 73.9 bits (180), Expect = 2e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505
Query: 122 ALGNKW 127
+GNKW
Sbjct: 506 LVGNKW 511
Score = 71.2 bits (173), Expect = 1e-10, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 42/64 (65%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + +V+P+KG+AVFW+N A+ DY H+ CPV +
Sbjct: 448 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507
Query: 62 GNKW 65
GNKW
Sbjct: 508 GNKW 511
>gi|74148153|dbj|BAE36242.1| unnamed protein product [Mus musculus]
Length = 454
Score = 73.9 bits (180), Expect = 2e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV
Sbjct: 367 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 425
Query: 122 ALGNKW 127
+GNKW
Sbjct: 426 LVGNKW 431
Score = 71.2 bits (173), Expect = 1e-10, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 42/64 (65%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + +V+P+KG+AVFW+N A+ DY H+ CPV +
Sbjct: 368 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 427
Query: 62 GNKW 65
GNKW
Sbjct: 428 GNKW 431
>gi|33859596|ref|NP_035160.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Mus musculus]
gi|20455506|sp|Q60715.2|P4HA1_MOUSE RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
alpha-1; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1; Flags: Precursor
gi|16307134|gb|AAH09654.1| Procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha 1 polypeptide [Mus musculus]
gi|74144306|dbj|BAE36020.1| unnamed protein product [Mus musculus]
gi|74146660|dbj|BAE41331.1| unnamed protein product [Mus musculus]
gi|148700260|gb|EDL32207.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha 1 polypeptide, isoform CRA_a [Mus
musculus]
Length = 534
Score = 73.9 bits (180), Expect = 2e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505
Query: 122 ALGNKW 127
+GNKW
Sbjct: 506 LVGNKW 511
Score = 71.2 bits (173), Expect = 1e-10, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 42/64 (65%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + +V+P+KG+AVFW+N A+ DY H+ CPV +
Sbjct: 448 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507
Query: 62 GNKW 65
GNKW
Sbjct: 508 GNKW 511
>gi|410975458|ref|XP_003994148.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Felis catus]
Length = 567
Score = 73.9 bits (180), Expect = 2e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV
Sbjct: 480 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 538
Query: 122 ALGNKW 127
+GNKW
Sbjct: 539 LVGNKW 544
Score = 71.2 bits (173), Expect = 1e-10, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 42/64 (65%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + +V+P+KG+AVFW+N A+ DY H+ CPV +
Sbjct: 481 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 540
Query: 62 GNKW 65
GNKW
Sbjct: 541 GNKW 544
>gi|410295850|gb|JAA26525.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
gi|410295854|gb|JAA26527.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
Length = 534
Score = 73.9 bits (180), Expect = 2e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505
Query: 122 ALGNKW 127
+GNKW
Sbjct: 506 LVGNKW 511
Score = 71.2 bits (173), Expect = 1e-10, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 42/64 (65%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + +V+P+KG+AVFW+N A+ DY H+ CPV +
Sbjct: 448 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507
Query: 62 GNKW 65
GNKW
Sbjct: 508 GNKW 511
>gi|410251926|gb|JAA13930.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
Length = 566
Score = 73.9 bits (180), Expect = 2e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505
Query: 122 ALGNKW 127
+GNKW
Sbjct: 506 LVGNKW 511
Score = 71.2 bits (173), Expect = 1e-10, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 42/64 (65%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + +V+P+KG+AVFW+N A+ DY H+ CPV +
Sbjct: 448 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507
Query: 62 GNKW 65
GNKW
Sbjct: 508 GNKW 511
>gi|410251924|gb|JAA13929.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
Length = 566
Score = 73.9 bits (180), Expect = 2e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505
Query: 122 ALGNKW 127
+GNKW
Sbjct: 506 LVGNKW 511
Score = 71.2 bits (173), Expect = 1e-10, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 42/64 (65%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + +V+P+KG+AVFW+N A+ DY H+ CPV +
Sbjct: 448 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507
Query: 62 GNKW 65
GNKW
Sbjct: 508 GNKW 511
>gi|397490069|ref|XP_003816032.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Pan paniscus]
Length = 488
Score = 73.9 bits (180), Expect = 2e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV
Sbjct: 401 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 459
Query: 122 ALGNKW 127
+GNKW
Sbjct: 460 LVGNKW 465
Score = 71.2 bits (173), Expect = 1e-10, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 42/64 (65%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + +V+P+KG+AVFW+N A+ DY H+ CPV +
Sbjct: 402 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 461
Query: 62 GNKW 65
GNKW
Sbjct: 462 GNKW 465
>gi|395820528|ref|XP_003783616.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 3 [Otolemur
garnettii]
Length = 516
Score = 73.9 bits (180), Expect = 2e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV
Sbjct: 429 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 487
Query: 122 ALGNKW 127
+GNKW
Sbjct: 488 LVGNKW 493
Score = 71.2 bits (173), Expect = 1e-10, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 42/64 (65%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + +V+P+KG+AVFW+N A+ DY H+ CPV +
Sbjct: 430 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 489
Query: 62 GNKW 65
GNKW
Sbjct: 490 GNKW 493
>gi|395820526|ref|XP_003783615.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2 [Otolemur
garnettii]
Length = 534
Score = 73.9 bits (180), Expect = 2e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505
Query: 122 ALGNKW 127
+GNKW
Sbjct: 506 LVGNKW 511
Score = 71.2 bits (173), Expect = 1e-10, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 42/64 (65%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + +V+P+KG+AVFW+N A+ DY H+ CPV +
Sbjct: 448 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507
Query: 62 GNKW 65
GNKW
Sbjct: 508 GNKW 511
>gi|395820524|ref|XP_003783614.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1 [Otolemur
garnettii]
Length = 534
Score = 73.9 bits (180), Expect = 2e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505
Query: 122 ALGNKW 127
+GNKW
Sbjct: 506 LVGNKW 511
Score = 71.2 bits (173), Expect = 1e-10, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 42/64 (65%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + +V+P+KG+AVFW+N A+ DY H+ CPV +
Sbjct: 448 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507
Query: 62 GNKW 65
GNKW
Sbjct: 508 GNKW 511
>gi|395509389|ref|XP_003758980.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2
[Sarcophilus harrisii]
Length = 536
Score = 73.9 bits (180), Expect = 2e-11, Method: Composition-based stats.
Identities = 29/59 (49%), Positives = 42/59 (71%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+ ++DVE GGAT+FP T++P+KG++VFWYN + DYR H+ CPV +G+KW
Sbjct: 455 FLNYMSDVEAGGATVFPDFGATIWPKKGTSVFWYNLFRSGEGDYRTRHAACPVLVGSKW 513
Score = 67.8 bits (164), Expect = 1e-09, Method: Composition-based stats.
Identities = 26/55 (47%), Positives = 39/55 (70%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP T++P+KG++VFW+N + DY H+ CPV +G+KW
Sbjct: 459 MSDVEAGGATVFPDFGATIWPKKGTSVFWYNLFRSGEGDYRTRHAACPVLVGSKW 513
>gi|383418721|gb|AFH32574.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Macaca
mulatta]
Length = 534
Score = 73.9 bits (180), Expect = 2e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505
Query: 122 ALGNKW 127
+GNKW
Sbjct: 506 LVGNKW 511
Score = 71.2 bits (173), Expect = 1e-10, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 42/64 (65%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + +V+P+KG+AVFW+N A+ DY H+ CPV +
Sbjct: 448 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507
Query: 62 GNKW 65
GNKW
Sbjct: 508 GNKW 511
>gi|380813208|gb|AFE78478.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Macaca
mulatta]
gi|384947330|gb|AFI37270.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Macaca
mulatta]
Length = 534
Score = 73.9 bits (180), Expect = 2e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505
Query: 122 ALGNKW 127
+GNKW
Sbjct: 506 LVGNKW 511
Score = 71.2 bits (173), Expect = 1e-10, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 42/64 (65%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + +V+P+KG+AVFW+N A+ DY H+ CPV +
Sbjct: 448 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507
Query: 62 GNKW 65
GNKW
Sbjct: 508 GNKW 511
>gi|380813206|gb|AFE78477.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Macaca
mulatta]
gi|384947328|gb|AFI37269.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Macaca
mulatta]
Length = 534
Score = 73.9 bits (180), Expect = 2e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505
Query: 122 ALGNKW 127
+GNKW
Sbjct: 506 LVGNKW 511
Score = 71.2 bits (173), Expect = 1e-10, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 42/64 (65%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + +V+P+KG+AVFW+N A+ DY H+ CPV +
Sbjct: 448 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507
Query: 62 GNKW 65
GNKW
Sbjct: 508 GNKW 511
>gi|355562502|gb|EHH19096.1| hypothetical protein EGK_19739 [Macaca mulatta]
gi|355782842|gb|EHH64763.1| hypothetical protein EGM_18071 [Macaca fascicularis]
gi|383418719|gb|AFH32573.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Macaca
mulatta]
Length = 534
Score = 73.9 bits (180), Expect = 2e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505
Query: 122 ALGNKW 127
+GNKW
Sbjct: 506 LVGNKW 511
Score = 71.2 bits (173), Expect = 1e-10, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 42/64 (65%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + +V+P+KG+AVFW+N A+ DY H+ CPV +
Sbjct: 448 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507
Query: 62 GNKW 65
GNKW
Sbjct: 508 GNKW 511
>gi|354483225|ref|XP_003503795.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 2
[Cricetulus griseus]
Length = 534
Score = 73.9 bits (180), Expect = 2e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505
Query: 122 ALGNKW 127
+GNKW
Sbjct: 506 LVGNKW 511
Score = 71.2 bits (173), Expect = 1e-10, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 42/64 (65%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + +V+P+KG+AVFW+N A+ DY H+ CPV +
Sbjct: 448 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507
Query: 62 GNKW 65
GNKW
Sbjct: 508 GNKW 511
>gi|354483223|ref|XP_003503794.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 1
[Cricetulus griseus]
Length = 534
Score = 73.9 bits (180), Expect = 2e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505
Query: 122 ALGNKW 127
+GNKW
Sbjct: 506 LVGNKW 511
Score = 71.2 bits (173), Expect = 1e-10, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 42/64 (65%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + +V+P+KG+AVFW+N A+ DY H+ CPV +
Sbjct: 448 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507
Query: 62 GNKW 65
GNKW
Sbjct: 508 GNKW 511
>gi|348576112|ref|XP_003473831.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cavia
porcellus]
Length = 534
Score = 73.9 bits (180), Expect = 2e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505
Query: 122 ALGNKW 127
+GNKW
Sbjct: 506 LVGNKW 511
Score = 71.2 bits (173), Expect = 1e-10, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 42/64 (65%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + +V+P+KG+AVFW+N A+ DY H+ CPV +
Sbjct: 448 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507
Query: 62 GNKW 65
GNKW
Sbjct: 508 GNKW 511
>gi|345305838|ref|XP_001508476.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Ornithorhynchus
anatinus]
Length = 493
Score = 73.9 bits (180), Expect = 2e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV
Sbjct: 406 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 464
Query: 122 ALGNKW 127
+GNKW
Sbjct: 465 LVGNKW 470
Score = 71.2 bits (173), Expect = 1e-10, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 42/64 (65%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + +V+P+KG+AVFW+N A+ DY H+ CPV +
Sbjct: 407 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 466
Query: 62 GNKW 65
GNKW
Sbjct: 467 GNKW 470
>gi|344254200|gb|EGW10304.1| Prolyl 4-hydroxylase subunit alpha-1 [Cricetulus griseus]
Length = 507
Score = 73.9 bits (180), Expect = 2e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV
Sbjct: 420 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 478
Query: 122 ALGNKW 127
+GNKW
Sbjct: 479 LVGNKW 484
Score = 71.2 bits (173), Expect = 1e-10, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 42/64 (65%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + +V+P+KG+AVFW+N A+ DY H+ CPV +
Sbjct: 421 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 480
Query: 62 GNKW 65
GNKW
Sbjct: 481 GNKW 484
>gi|334314087|ref|XP_003339988.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2
[Monodelphis domestica]
Length = 537
Score = 73.9 bits (180), Expect = 2e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV
Sbjct: 450 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 508
Query: 122 ALGNKW 127
+GNKW
Sbjct: 509 LVGNKW 514
Score = 71.2 bits (173), Expect = 1e-10, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 42/64 (65%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + +V+P+KG+AVFW+N A+ DY H+ CPV +
Sbjct: 451 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 510
Query: 62 GNKW 65
GNKW
Sbjct: 511 GNKW 514
>gi|334314085|ref|XP_001363658.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1
[Monodelphis domestica]
Length = 537
Score = 73.9 bits (180), Expect = 2e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV
Sbjct: 450 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 508
Query: 122 ALGNKW 127
+GNKW
Sbjct: 509 LVGNKW 514
Score = 71.2 bits (173), Expect = 1e-10, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 42/64 (65%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + +V+P+KG+AVFW+N A+ DY H+ CPV +
Sbjct: 451 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 510
Query: 62 GNKW 65
GNKW
Sbjct: 511 GNKW 514
>gi|332244067|ref|XP_003271193.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
alpha-1 [Nomascus leucogenys]
Length = 502
Score = 73.9 bits (180), Expect = 2e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV
Sbjct: 415 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 473
Query: 122 ALGNKW 127
+GNKW
Sbjct: 474 LVGNKW 479
Score = 71.2 bits (173), Expect = 1e-10, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 42/64 (65%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + +V+P+KG+AVFW+N A+ DY H+ CPV +
Sbjct: 416 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 475
Query: 62 GNKW 65
GNKW
Sbjct: 476 GNKW 479
>gi|301770069|ref|XP_002920453.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Ailuropoda
melanoleuca]
Length = 534
Score = 73.9 bits (180), Expect = 2e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505
Query: 122 ALGNKW 127
+GNKW
Sbjct: 506 LVGNKW 511
Score = 71.2 bits (173), Expect = 1e-10, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 42/64 (65%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + +V+P+KG+AVFW+N A+ DY H+ CPV +
Sbjct: 448 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507
Query: 62 GNKW 65
GNKW
Sbjct: 508 GNKW 511
>gi|296220402|ref|XP_002756291.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Callithrix
jacchus]
Length = 534
Score = 73.9 bits (180), Expect = 2e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505
Query: 122 ALGNKW 127
+GNKW
Sbjct: 506 LVGNKW 511
Score = 71.2 bits (173), Expect = 1e-10, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 42/64 (65%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + +V+P+KG+AVFW+N A+ DY H+ CPV +
Sbjct: 448 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507
Query: 62 GNKW 65
GNKW
Sbjct: 508 GNKW 511
>gi|291404186|ref|XP_002718473.1| PREDICTED: prolyl 4-hydroxylase, alpha I subunit isoform 3
[Oryctolagus cuniculus]
Length = 516
Score = 73.9 bits (180), Expect = 2e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV
Sbjct: 429 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 487
Query: 122 ALGNKW 127
+GNKW
Sbjct: 488 LVGNKW 493
Score = 71.2 bits (173), Expect = 1e-10, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 42/64 (65%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + +V+P+KG+AVFW+N A+ DY H+ CPV +
Sbjct: 430 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 489
Query: 62 GNKW 65
GNKW
Sbjct: 490 GNKW 493
>gi|291404184|ref|XP_002718472.1| PREDICTED: prolyl 4-hydroxylase, alpha I subunit isoform 2
[Oryctolagus cuniculus]
Length = 534
Score = 73.9 bits (180), Expect = 2e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505
Query: 122 ALGNKW 127
+GNKW
Sbjct: 506 LVGNKW 511
Score = 71.2 bits (173), Expect = 1e-10, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 42/64 (65%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + +V+P+KG+AVFW+N A+ DY H+ CPV +
Sbjct: 448 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507
Query: 62 GNKW 65
GNKW
Sbjct: 508 GNKW 511
>gi|291404182|ref|XP_002718471.1| PREDICTED: prolyl 4-hydroxylase, alpha I subunit isoform 1
[Oryctolagus cuniculus]
Length = 534
Score = 73.9 bits (180), Expect = 2e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505
Query: 122 ALGNKW 127
+GNKW
Sbjct: 506 LVGNKW 511
Score = 71.2 bits (173), Expect = 1e-10, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 42/64 (65%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + +V+P+KG+AVFW+N A+ DY H+ CPV +
Sbjct: 448 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507
Query: 62 GNKW 65
GNKW
Sbjct: 508 GNKW 511
>gi|281350467|gb|EFB26051.1| hypothetical protein PANDA_009188 [Ailuropoda melanoleuca]
Length = 511
Score = 73.9 bits (180), Expect = 2e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505
Query: 122 ALGNKW 127
+GNKW
Sbjct: 506 LVGNKW 511
Score = 71.2 bits (173), Expect = 1e-10, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 42/64 (65%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + +V+P+KG+AVFW+N A+ DY H+ CPV +
Sbjct: 448 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507
Query: 62 GNKW 65
GNKW
Sbjct: 508 GNKW 511
>gi|148233143|ref|NP_001090904.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Sus scrofa]
gi|83778522|gb|ABC47142.1| procollagen-proline 2-oxoglutarate-4-dioxygenase [Sus scrofa]
Length = 534
Score = 73.9 bits (180), Expect = 2e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505
Query: 122 ALGNKW 127
+GNKW
Sbjct: 506 LVGNKW 511
Score = 71.2 bits (173), Expect = 1e-10, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 42/64 (65%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + +V+P+KG+AVFW+N A+ DY H+ CPV +
Sbjct: 448 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507
Query: 62 GNKW 65
GNKW
Sbjct: 508 GNKW 511
>gi|217272851|ref|NP_001136068.1| prolyl 4-hydroxylase subunit alpha-1 isoform 3 precursor [Homo
sapiens]
gi|114631189|ref|XP_001140871.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 10 [Pan
troglodytes]
Length = 516
Score = 73.9 bits (180), Expect = 2e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV
Sbjct: 429 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 487
Query: 122 ALGNKW 127
+GNKW
Sbjct: 488 LVGNKW 493
Score = 71.2 bits (173), Expect = 1e-10, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 42/64 (65%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + +V+P+KG+AVFW+N A+ DY H+ CPV +
Sbjct: 430 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 489
Query: 62 GNKW 65
GNKW
Sbjct: 490 GNKW 493
>gi|26336999|dbj|BAC32183.1| unnamed protein product [Mus musculus]
gi|148700261|gb|EDL32208.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha 1 polypeptide, isoform CRA_b [Mus
musculus]
Length = 534
Score = 73.9 bits (180), Expect = 2e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505
Query: 122 ALGNKW 127
+GNKW
Sbjct: 506 LVGNKW 511
Score = 71.2 bits (173), Expect = 1e-10, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 42/64 (65%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + +V+P+KG+AVFW+N A+ DY H+ CPV +
Sbjct: 448 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507
Query: 62 GNKW 65
GNKW
Sbjct: 508 GNKW 511
>gi|836898|gb|AAC52197.1| prolyl 4-hydroxylase alpha(I)-subunit, partial [Mus musculus]
gi|1096887|prf||2112362A Pro 4-hydroxylase:SUBUNIT=alpha:ISOTYPE=I
Length = 526
Score = 73.9 bits (180), Expect = 2e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV
Sbjct: 439 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 497
Query: 122 ALGNKW 127
+GNKW
Sbjct: 498 LVGNKW 503
Score = 71.2 bits (173), Expect = 1e-10, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 42/64 (65%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + +V+P+KG+AVFW+N A+ DY H+ CPV +
Sbjct: 440 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 499
Query: 62 GNKW 65
GNKW
Sbjct: 500 GNKW 503
>gi|63252886|ref|NP_000908.2| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Homo
sapiens]
gi|114631173|ref|XP_508168.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 13 [Pan
troglodytes]
gi|602676|gb|AAA59069.1| alpha-subunit of prolyl 4-hydroxylase [Homo sapiens]
gi|62897481|dbj|BAD96680.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I variant [Homo
sapiens]
gi|119574852|gb|EAW54467.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I, isoform CRA_a [Homo
sapiens]
gi|119574853|gb|EAW54468.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I, isoform CRA_b [Homo
sapiens]
gi|410349609|gb|JAA41408.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
gi|410349613|gb|JAA41410.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
Length = 534
Score = 73.9 bits (180), Expect = 2e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505
Query: 122 ALGNKW 127
+GNKW
Sbjct: 506 LVGNKW 511
Score = 71.2 bits (173), Expect = 1e-10, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 42/64 (65%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + +V+P+KG+AVFW+N A+ DY H+ CPV +
Sbjct: 448 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507
Query: 62 GNKW 65
GNKW
Sbjct: 508 GNKW 511
>gi|474940|emb|CAA55546.1| gamma-butyrobetaine,2-oxoglutarate dioxygenase [Rattus norvegicus]
Length = 534
Score = 73.9 bits (180), Expect = 2e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505
Query: 122 ALGNKW 127
+GNKW
Sbjct: 506 LVGNKW 511
Score = 71.2 bits (173), Expect = 1e-10, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 42/64 (65%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + +V+P+KG+AVFW+N A+ DY H+ CPV +
Sbjct: 448 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507
Query: 62 GNKW 65
GNKW
Sbjct: 508 GNKW 511
>gi|63252888|ref|NP_001017962.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Homo
sapiens]
gi|197099666|ref|NP_001125733.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Pongo abelii]
gi|217272849|ref|NP_001136067.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Homo
sapiens]
gi|114631177|ref|XP_001140234.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 3 [Pan
troglodytes]
gi|114631181|ref|XP_001140652.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 7 [Pan
troglodytes]
gi|2507090|sp|P13674.2|P4HA1_HUMAN RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
alpha-1; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1; Flags: Precursor
gi|75061858|sp|Q5RAG8.1|P4HA1_PONAB RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
alpha-1; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1; Flags: Precursor
gi|602675|gb|AAA59068.1| alpha-subunit of prolyl 4-hydroxylase [Homo sapiens]
gi|23271226|gb|AAH34998.1| Prolyl 4-hydroxylase, alpha polypeptide I [Homo sapiens]
gi|55729010|emb|CAH91242.1| hypothetical protein [Pongo abelii]
gi|56403853|emb|CAI29712.1| hypothetical protein [Pongo abelii]
gi|119574854|gb|EAW54469.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I, isoform CRA_c [Homo
sapiens]
gi|119574855|gb|EAW54470.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I, isoform CRA_d [Homo
sapiens]
gi|123981532|gb|ABM82595.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I [synthetic
construct]
gi|123996359|gb|ABM85781.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I [synthetic
construct]
gi|261861532|dbj|BAI47288.1| prolyl 4-hydroxylase, alpha polypeptide I [synthetic construct]
gi|410295852|gb|JAA26526.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
gi|410349611|gb|JAA41409.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
Length = 534
Score = 73.9 bits (180), Expect = 2e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505
Query: 122 ALGNKW 127
+GNKW
Sbjct: 506 LVGNKW 511
Score = 71.2 bits (173), Expect = 1e-10, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 42/64 (65%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + +V+P+KG+AVFW+N A+ DY H+ CPV +
Sbjct: 448 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507
Query: 62 GNKW 65
GNKW
Sbjct: 508 GNKW 511
>gi|149038788|gb|EDL93077.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha 1 polypeptide, isoform CRA_b
[Rattus norvegicus]
Length = 534
Score = 73.9 bits (180), Expect = 2e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505
Query: 122 ALGNKW 127
+GNKW
Sbjct: 506 LVGNKW 511
Score = 71.2 bits (173), Expect = 1e-10, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 42/64 (65%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + +V+P+KG+AVFW+N A+ DY H+ CPV +
Sbjct: 448 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507
Query: 62 GNKW 65
GNKW
Sbjct: 508 GNKW 511
>gi|195341590|ref|XP_002037389.1| GM12139 [Drosophila sechellia]
gi|194131505|gb|EDW53548.1| GM12139 [Drosophila sechellia]
Length = 525
Score = 73.9 bits (180), Expect = 2e-11, Method: Composition-based stats.
Identities = 29/55 (52%), Positives = 39/55 (70%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
LTDVE GGAT+FP++ VFP++GS V WYN N +D + H+ CPV +G+KW
Sbjct: 449 LTDVEQGGATVFPNIRKAVFPQRGSVVMWYNLRDNGQIDTQTLHAACPVIVGSKW 503
Score = 70.5 bits (171), Expect = 2e-10, Method: Composition-based stats.
Identities = 28/55 (50%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
LTDVE GGAT+FP++ VFP++GS V W+N N +D H+ CPV +G+KW
Sbjct: 449 LTDVEQGGATVFPNIRKAVFPQRGSVVMWYNLRDNGQIDTQTLHAACPVIVGSKW 503
>gi|344274276|ref|XP_003408943.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 3
[Loxodonta africana]
Length = 516
Score = 73.9 bits (180), Expect = 2e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV
Sbjct: 429 GNRIATWLFY-MSDVSAGGATVFPDVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 487
Query: 122 ALGNKW 127
+GNKW
Sbjct: 488 LVGNKW 493
Score = 71.2 bits (173), Expect = 1e-10, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 42/64 (65%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + +V+P+KG+AVFW+N A+ DY H+ CPV +
Sbjct: 430 NRIATWLFYMSDVSAGGATVFPDVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 489
Query: 62 GNKW 65
GNKW
Sbjct: 490 GNKW 493
>gi|344274274|ref|XP_003408942.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2
[Loxodonta africana]
Length = 534
Score = 73.9 bits (180), Expect = 2e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVSAGGATVFPDVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505
Query: 122 ALGNKW 127
+GNKW
Sbjct: 506 LVGNKW 511
Score = 71.2 bits (173), Expect = 1e-10, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 42/64 (65%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + +V+P+KG+AVFW+N A+ DY H+ CPV +
Sbjct: 448 NRIATWLFYMSDVSAGGATVFPDVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507
Query: 62 GNKW 65
GNKW
Sbjct: 508 GNKW 511
>gi|344274272|ref|XP_003408941.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1
[Loxodonta africana]
Length = 534
Score = 73.9 bits (180), Expect = 2e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVSAGGATVFPDVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505
Query: 122 ALGNKW 127
+GNKW
Sbjct: 506 LVGNKW 511
Score = 71.2 bits (173), Expect = 1e-10, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 42/64 (65%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + +V+P+KG+AVFW+N A+ DY H+ CPV +
Sbjct: 448 NRIATWLFYMSDVSAGGATVFPDVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507
Query: 62 GNKW 65
GNKW
Sbjct: 508 GNKW 511
>gi|403183473|gb|EJY58123.1| AAEL017524-PA, partial [Aedes aegypti]
Length = 212
Score = 73.9 bits (180), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 35/69 (50%), Positives = 46/69 (66%), Gaps = 1/69 (1%)
Query: 59 VALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSG 118
+ LGN+ L+ L+DVE GGAT+FP + + VFP+KGSA+FWYN + D R H
Sbjct: 121 MGLGNRIATLMYY-LSDVEQGGATVFPQIGVGVFPKKGSAIFWYNLLPDGTGDERTLHGA 179
Query: 119 CPVALGNKW 127
CPV LG+KW
Sbjct: 180 CPVLLGSKW 188
Score = 66.2 bits (160), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 31/64 (48%), Positives = 40/64 (62%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR L+DVE GGAT+FP + + VFP+KGSA+FW+N + D H CPV L
Sbjct: 125 NRIATLMYYLSDVEQGGATVFPQIGVGVFPKKGSAIFWYNLLPDGTGDERTLHGACPVLL 184
Query: 62 GNKW 65
G+KW
Sbjct: 185 GSKW 188
>gi|297301157|ref|XP_001103971.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2 [Macaca
mulatta]
Length = 512
Score = 73.9 bits (180), Expect = 2e-11, Method: Composition-based stats.
Identities = 29/55 (52%), Positives = 40/55 (72%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
++DV GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV +GNKW
Sbjct: 435 MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 489
Score = 70.9 bits (172), Expect = 1e-10, Method: Composition-based stats.
Identities = 28/55 (50%), Positives = 40/55 (72%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DV GGAT+FP + +V+P+KG+AVFW+N A+ DY H+ CPV +GNKW
Sbjct: 435 MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 489
>gi|348501574|ref|XP_003438344.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Oreochromis
niloticus]
Length = 615
Score = 73.9 bits (180), Expect = 2e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 43/66 (65%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+FP + V+P+KG+AVFWYN A+ DY H+ CPV
Sbjct: 528 GNRIATWLFY-MSDVSAGGATVFPDVGAAVWPQKGTAVFWYNLFASGEGDYSTRHAACPV 586
Query: 122 ALGNKW 127
+GNKW
Sbjct: 587 LVGNKW 592
Score = 70.9 bits (172), Expect = 2e-10, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 41/64 (64%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + V+P+KG+AVFW+N A+ DY H+ CPV +
Sbjct: 529 NRIATWLFYMSDVSAGGATVFPDVGAAVWPQKGTAVFWYNLFASGEGDYSTRHAACPVLV 588
Query: 62 GNKW 65
GNKW
Sbjct: 589 GNKW 592
>gi|198466403|ref|XP_002135183.1| GA23911 [Drosophila pseudoobscura pseudoobscura]
gi|198150584|gb|EDY73810.1| GA23911 [Drosophila pseudoobscura pseudoobscura]
Length = 534
Score = 73.9 bits (180), Expect = 2e-11, Method: Composition-based stats.
Identities = 30/69 (43%), Positives = 47/69 (68%), Gaps = 1/69 (1%)
Query: 59 VALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSG 118
+ LG++ G ++ +DV GGAT+FP +++FP KGS+VFWYN + + +D R HS
Sbjct: 441 LTLGDRLGSIIFYA-SDVPQGGATVFPRSRVSIFPRKGSSVFWYNLYDDGRIDTRSQHSV 499
Query: 119 CPVALGNKW 127
CPV +G++W
Sbjct: 500 CPVIVGDRW 508
Score = 64.7 bits (156), Expect = 1e-08, Method: Composition-based stats.
Identities = 25/54 (46%), Positives = 38/54 (70%)
Query: 12 TDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
+DV GGAT+FP +++FP KGS+VFW+N + + +D HS CPV +G++W
Sbjct: 455 SDVPQGGATVFPRSRVSIFPRKGSSVFWYNLYDDGRIDTRSQHSVCPVIVGDRW 508
>gi|195166681|ref|XP_002024163.1| GL22882 [Drosophila persimilis]
gi|194107518|gb|EDW29561.1| GL22882 [Drosophila persimilis]
Length = 534
Score = 73.9 bits (180), Expect = 2e-11, Method: Composition-based stats.
Identities = 30/69 (43%), Positives = 47/69 (68%), Gaps = 1/69 (1%)
Query: 59 VALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSG 118
+ LG++ G ++ +DV GGAT+FP +++FP KGS+VFWYN + + +D R HS
Sbjct: 441 LTLGDRLGSIIFYA-SDVPQGGATVFPRSRVSIFPRKGSSVFWYNLYDDGRIDTRSQHSV 499
Query: 119 CPVALGNKW 127
CPV +G++W
Sbjct: 500 CPVIVGDRW 508
Score = 64.7 bits (156), Expect = 1e-08, Method: Composition-based stats.
Identities = 25/54 (46%), Positives = 38/54 (70%)
Query: 12 TDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
+DV GGAT+FP +++FP KGS+VFW+N + + +D HS CPV +G++W
Sbjct: 455 SDVPQGGATVFPRSRVSIFPRKGSSVFWYNLYDDGRIDTRSQHSVCPVIVGDRW 508
>gi|195575145|ref|XP_002105540.1| GD16902 [Drosophila simulans]
gi|194201467|gb|EDX15043.1| GD16902 [Drosophila simulans]
Length = 525
Score = 73.9 bits (180), Expect = 2e-11, Method: Composition-based stats.
Identities = 29/55 (52%), Positives = 39/55 (70%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
LTDVE GGAT+FP++ VFP++GS V WYN N +D + H+ CPV +G+KW
Sbjct: 449 LTDVEQGGATVFPNIRKAVFPQRGSVVMWYNLRDNGQIDTQTLHAACPVIVGSKW 503
Score = 70.5 bits (171), Expect = 2e-10, Method: Composition-based stats.
Identities = 28/55 (50%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
LTDVE GGAT+FP++ VFP++GS V W+N N +D H+ CPV +G+KW
Sbjct: 449 LTDVEQGGATVFPNIRKAVFPQRGSVVMWYNLRDNGQIDTQTLHAACPVIVGSKW 503
>gi|24651477|ref|NP_733395.1| prolyl-4-hydroxylase-alpha PV [Drosophila melanogaster]
gi|20269812|gb|AAM18061.1|AF495539_1 prolyl 4-hydroxylase alpha-related protein PH4[alpha]PV [Drosophila
melanogaster]
gi|23172718|gb|AAN14252.1| prolyl-4-hydroxylase-alpha PV [Drosophila melanogaster]
Length = 525
Score = 73.9 bits (180), Expect = 2e-11, Method: Composition-based stats.
Identities = 29/55 (52%), Positives = 39/55 (70%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
LTDVE GGAT+FP++ VFP++GS V WYN N +D + H+ CPV +G+KW
Sbjct: 449 LTDVEQGGATVFPNIRKAVFPQRGSVVMWYNLKDNGQIDTQTLHAACPVIVGSKW 503
Score = 70.5 bits (171), Expect = 2e-10, Method: Composition-based stats.
Identities = 28/55 (50%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
LTDVE GGAT+FP++ VFP++GS V W+N N +D H+ CPV +G+KW
Sbjct: 449 LTDVEQGGATVFPNIRKAVFPQRGSVVMWYNLKDNGQIDTQTLHAACPVIVGSKW 503
>gi|195110923|ref|XP_002000029.1| GI22757 [Drosophila mojavensis]
gi|193916623|gb|EDW15490.1| GI22757 [Drosophila mojavensis]
Length = 535
Score = 73.6 bits (179), Expect = 2e-11, Method: Composition-based stats.
Identities = 32/55 (58%), Positives = 39/55 (70%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L++VE GGAT FP L L V PE+GS +FWYN H + DYR H+GCPV G+KW
Sbjct: 450 LSNVEAGGATAFPFLPLLVKPEQGSLLFWYNLHRSGDADYRTKHAGCPVLQGSKW 504
Score = 69.3 bits (168), Expect = 5e-10, Method: Composition-based stats.
Identities = 32/64 (50%), Positives = 40/64 (62%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR L++VE GGAT FP L L V PE+GS +FW+N H + DY H+GCPV
Sbjct: 441 NRIATGIFYLSNVEAGGATAFPFLPLLVKPEQGSLLFWYNLHRSGDADYRTKHAGCPVLQ 500
Query: 62 GNKW 65
G+KW
Sbjct: 501 GSKW 504
>gi|301613004|ref|XP_002936004.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Xenopus
(Silurana) tropicalis]
Length = 526
Score = 73.6 bits (179), Expect = 2e-11, Method: Composition-based stats.
Identities = 29/55 (52%), Positives = 39/55 (70%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
++DVE GGAT+FP + V+P+KG+AVFWYN + DY H+ CPV +GNKW
Sbjct: 449 MSDVEAGGATVFPEVGAAVYPKKGTAVFWYNLLESGEGDYSTRHAACPVLVGNKW 503
Score = 71.2 bits (173), Expect = 1e-10, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 41/64 (64%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DVE GGAT+FP + V+P+KG+AVFW+N + DY H+ CPV +
Sbjct: 440 NRVATWLFYMSDVEAGGATVFPEVGAAVYPKKGTAVFWYNLLESGEGDYSTRHAACPVLV 499
Query: 62 GNKW 65
GNKW
Sbjct: 500 GNKW 503
>gi|334311009|ref|XP_001371555.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Monodelphis
domestica]
Length = 534
Score = 73.6 bits (179), Expect = 2e-11, Method: Composition-based stats.
Identities = 30/66 (45%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L + ++DVE GGAT+FP ++P+KG++VFWYN + DYR H+ CPV
Sbjct: 447 GNRLATFL-NYMSDVEAGGATVFPDFGAAIWPKKGTSVFWYNLFRSGECDYRTRHAACPV 505
Query: 122 ALGNKW 127
+G+KW
Sbjct: 506 LVGSKW 511
Score = 67.4 bits (163), Expect = 2e-09, Method: Composition-based stats.
Identities = 25/55 (45%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP ++P+KG++VFW+N + DY H+ CPV +G+KW
Sbjct: 457 MSDVEAGGATVFPDFGAAIWPKKGTSVFWYNLFRSGECDYRTRHAACPVLVGSKW 511
>gi|194765180|ref|XP_001964705.1| GF23331 [Drosophila ananassae]
gi|190614977|gb|EDV30501.1| GF23331 [Drosophila ananassae]
Length = 535
Score = 73.6 bits (179), Expect = 2e-11, Method: Composition-based stats.
Identities = 35/69 (50%), Positives = 43/69 (62%), Gaps = 3/69 (4%)
Query: 62 GNKWGKLLLSG---LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSG 118
G+ G + +G L+DVE GG T FP L L V PEKGS +FWYN H + DYR H+
Sbjct: 434 GDDRGNRIATGIYYLSDVEAGGGTAFPFLPLLVTPEKGSLLFWYNLHESGDQDYRTKHAA 493
Query: 119 CPVALGNKW 127
CPV G+KW
Sbjct: 494 CPVLQGSKW 502
Score = 68.9 bits (167), Expect = 7e-10, Method: Composition-based stats.
Identities = 32/64 (50%), Positives = 38/64 (59%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR L+DVE GG T FP L L V PEKGS +FW+N H + DY H+ CPV
Sbjct: 439 NRIATGIYYLSDVEAGGGTAFPFLPLLVTPEKGSLLFWYNLHESGDQDYRTKHAACPVLQ 498
Query: 62 GNKW 65
G+KW
Sbjct: 499 GSKW 502
>gi|194905410|ref|XP_001981191.1| GG11931 [Drosophila erecta]
gi|190655829|gb|EDV53061.1| GG11931 [Drosophila erecta]
Length = 537
Score = 73.6 bits (179), Expect = 2e-11, Method: Composition-based stats.
Identities = 36/86 (41%), Positives = 49/86 (56%), Gaps = 1/86 (1%)
Query: 42 AHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFW 101
H D++ S P A GN+ + L++VE GGAT FP L+ V P+ G+ +FW
Sbjct: 420 GHYEPHWDFFRDPSHYPAAEGNRIATAIFY-LSEVEQGGATAFPFLDFAVKPQLGNVLFW 478
Query: 102 YNAHANTLLDYRMYHSGCPVALGNKW 127
YN H + DYR H+GCPV G+KW
Sbjct: 479 YNLHRSLDKDYRTKHAGCPVLKGSKW 504
Score = 66.2 bits (160), Expect = 4e-09, Method: Composition-based stats.
Identities = 29/64 (45%), Positives = 40/64 (62%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L++VE GGAT FP L+ V P+ G+ +FW+N H + DY H+GCPV
Sbjct: 441 NRIATAIFYLSEVEQGGATAFPFLDFAVKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLK 500
Query: 62 GNKW 65
G+KW
Sbjct: 501 GSKW 504
>gi|390459659|ref|XP_002806656.2| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
alpha-2 [Callithrix jacchus]
Length = 579
Score = 73.6 bits (179), Expect = 2e-11, Method: Composition-based stats.
Identities = 30/59 (50%), Positives = 41/59 (69%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+ ++DVE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 498 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGXGDYRTRHAACPVLVGCKW 556
Score = 67.4 bits (163), Expect = 2e-09, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP L ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 502 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGXGDYRTRHAACPVLVGCKW 556
>gi|194765178|ref|XP_001964704.1| GF23330 [Drosophila ananassae]
gi|190614976|gb|EDV30500.1| GF23330 [Drosophila ananassae]
Length = 537
Score = 73.6 bits (179), Expect = 3e-11, Method: Composition-based stats.
Identities = 33/70 (47%), Positives = 44/70 (62%), Gaps = 1/70 (1%)
Query: 58 PVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHS 117
P GN+ + L+DVE GGAT FP L+ V P+ G+ +FWYN H + +DYR H+
Sbjct: 436 PAEEGNRIATAIFY-LSDVEQGGATAFPFLDFAVKPQLGNVLFWYNLHRSLDMDYRTKHA 494
Query: 118 GCPVALGNKW 127
GCPV G+KW
Sbjct: 495 GCPVLKGSKW 504
Score = 69.3 bits (168), Expect = 5e-10, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 41/64 (64%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L+DVE GGAT FP L+ V P+ G+ +FW+N H + +DY H+GCPV
Sbjct: 441 NRIATAIFYLSDVEQGGATAFPFLDFAVKPQLGNVLFWYNLHRSLDMDYRTKHAGCPVLK 500
Query: 62 GNKW 65
G+KW
Sbjct: 501 GSKW 504
>gi|426365135|ref|XP_004049642.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Gorilla gorilla
gorilla]
Length = 500
Score = 73.6 bits (179), Expect = 3e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV
Sbjct: 413 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 471
Query: 122 ALGNKW 127
+GNKW
Sbjct: 472 LVGNKW 477
Score = 70.9 bits (172), Expect = 2e-10, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 42/64 (65%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + +V+P+KG+AVFW+N A+ DY H+ CPV +
Sbjct: 414 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 473
Query: 62 GNKW 65
GNKW
Sbjct: 474 GNKW 477
>gi|443712762|gb|ELU05926.1| hypothetical protein CAPTEDRAFT_153364 [Capitella teleta]
Length = 491
Score = 73.6 bits (179), Expect = 3e-11, Method: Composition-based stats.
Identities = 32/55 (58%), Positives = 36/55 (65%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+ V LGGAT+FP LN+ V P K A FWYNA N D R H+GCPV LG KW
Sbjct: 411 LSRVHLGGATVFPKLNVRVPPVKNGAAFWYNARPNGEHDKRTLHAGCPVVLGEKW 465
Score = 68.2 bits (165), Expect = 1e-09, Method: Composition-based stats.
Identities = 30/55 (54%), Positives = 35/55 (63%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L+ V LGGAT+FP LN+ V P K A FW+NA N D H+GCPV LG KW
Sbjct: 411 LSRVHLGGATVFPKLNVRVPPVKNGAAFWYNARPNGEHDKRTLHAGCPVVLGEKW 465
>gi|321474898|gb|EFX85862.1| hypothetical protein DAPPUDRAFT_309117 [Daphnia pulex]
Length = 541
Score = 73.6 bits (179), Expect = 3e-11, Method: Composition-based stats.
Identities = 29/55 (52%), Positives = 39/55 (70%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
++DVE GGAT+FP +NL ++P+KGSA FW+N H N D H+ CPV G+KW
Sbjct: 462 MSDVEAGGATVFPQINLALWPQKGSAAFWFNLHPNGEGDDLTQHAACPVLTGSKW 516
Score = 72.4 bits (176), Expect = 5e-11, Method: Composition-based stats.
Identities = 31/64 (48%), Positives = 40/64 (62%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DVE GGAT+FP +NL ++P+KGSA FW N H N D H+ CPV
Sbjct: 453 NRIATWMFYMSDVEAGGATVFPQINLALWPQKGSAAFWFNLHPNGEGDDLTQHAACPVLT 512
Query: 62 GNKW 65
G+KW
Sbjct: 513 GSKW 516
>gi|194751833|ref|XP_001958228.1| GF10815 [Drosophila ananassae]
gi|190625510|gb|EDV41034.1| GF10815 [Drosophila ananassae]
Length = 273
Score = 73.6 bits (179), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 34/63 (53%), Positives = 42/63 (66%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 128
LL L+DVE GG T+FP LNL V +KGS + WYN +N D R+ H+ CPV +GNKW
Sbjct: 208 LLKNLSDVEQGGDTVFPHLNLKVPAQKGSLMVWYNLLSNGTTDSRVLHASCPVLMGNKWS 267
Query: 129 KLL 131
K L
Sbjct: 268 KYL 270
Score = 67.4 bits (163), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 33/70 (47%), Positives = 46/70 (65%), Gaps = 1/70 (1%)
Query: 1 MNRAEMSSVE-LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPV 59
+N E+S ++ L+DVE GG T+FP LNL V +KGS + W+N +N D + H+ CPV
Sbjct: 201 INEKEISLLKNLSDVEQGGDTVFPHLNLKVPAQKGSLMVWYNLLSNGTTDSRVLHASCPV 260
Query: 60 ALGNKWGKLL 69
+GNKW K L
Sbjct: 261 LMGNKWSKYL 270
>gi|74225936|dbj|BAE28745.1| unnamed protein product [Mus musculus]
Length = 561
Score = 73.6 bits (179), Expect = 3e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505
Query: 122 ALGNKW 127
+GNKW
Sbjct: 506 LVGNKW 511
Score = 70.5 bits (171), Expect = 2e-10, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 42/64 (65%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + +V+P+KG+AVFW+N A+ DY H+ CPV +
Sbjct: 448 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507
Query: 62 GNKW 65
GNKW
Sbjct: 508 GNKW 511
>gi|312383453|gb|EFR28539.1| hypothetical protein AND_03427 [Anopheles darlingi]
Length = 341
Score = 73.6 bits (179), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 32/60 (53%), Positives = 41/60 (68%)
Query: 6 MSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
+S +++DV GGAT+FPSLNL + P KG+A FWHN HA+ DY H+ CPV G KW
Sbjct: 254 LSVGQMSDVTQGGATVFPSLNLALRPRKGTAAFWHNLHASGNGDYATRHAACPVLTGTKW 313
Score = 70.5 bits (171), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 31/60 (51%), Positives = 40/60 (66%)
Query: 68 LLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L + ++DV GGAT+FPSLNL + P KG+A FW+N HA+ DY H+ CPV G KW
Sbjct: 254 LSVGQMSDVTQGGATVFPSLNLALRPRKGTAAFWHNLHASGNGDYATRHAACPVLTGTKW 313
>gi|195110931|ref|XP_002000033.1| GI24862 [Drosophila mojavensis]
gi|193916627|gb|EDW15494.1| GI24862 [Drosophila mojavensis]
Length = 549
Score = 73.2 bits (178), Expect = 3e-11, Method: Composition-based stats.
Identities = 32/67 (47%), Positives = 45/67 (67%), Gaps = 1/67 (1%)
Query: 61 LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
+GN+ G +L L+DVE GGAT FP+L + P+K +A FWYN HA+ + D R H CP
Sbjct: 445 MGNRIGTVLFY-LSDVEQGGATAFPALKQLLRPKKHAAAFWYNLHASGVGDARTMHGACP 503
Query: 121 VALGNKW 127
+ +G+KW
Sbjct: 504 IIVGSKW 510
Score = 63.5 bits (153), Expect = 2e-08, Method: Composition-based stats.
Identities = 28/64 (43%), Positives = 39/64 (60%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR L+DVE GGAT FP+L + P+K +A FW+N HA+ + D H CP+ +
Sbjct: 447 NRIGTVLFYLSDVEQGGATAFPALKQLLRPKKHAAAFWYNLHASGVGDARTMHGACPIIV 506
Query: 62 GNKW 65
G+KW
Sbjct: 507 GSKW 510
>gi|195499025|ref|XP_002096772.1| GE25857 [Drosophila yakuba]
gi|194182873|gb|EDW96484.1| GE25857 [Drosophila yakuba]
Length = 490
Score = 73.2 bits (178), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 37/79 (46%), Positives = 48/79 (60%), Gaps = 1/79 (1%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY Y + G++ LL L +V+ GGATIFP +N+TV P+KGSAVFWYN H +
Sbjct: 384 DYHKYTNTTRAKYGDRIATLLFY-LGEVDSGGATIFPRINITVTPKKGSAVFWYNLHNSG 442
Query: 109 LLDYRMYHSGCPVALGNKW 127
L HS CPV G+K+
Sbjct: 443 ALHLETLHSACPVISGSKY 461
Score = 65.5 bits (158), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 30/55 (54%), Positives = 39/55 (70%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L +V+ GGATIFP +N+TV P+KGSAVFW+N H + L HS CPV G+K+
Sbjct: 407 LGEVDSGGATIFPRINITVTPKKGSAVFWYNLHNSGALHLETLHSACPVISGSKY 461
>gi|395736139|ref|XP_003776705.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Pongo abelii]
Length = 575
Score = 73.2 bits (178), Expect = 3e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L + ++DVE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV
Sbjct: 488 GNRLATFL-NYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPV 546
Query: 122 ALGNKW 127
+G KW
Sbjct: 547 LVGCKW 552
Score = 67.0 bits (162), Expect = 3e-09, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP L ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 498 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 552
>gi|344264849|ref|XP_003404502.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2
[Loxodonta africana]
Length = 534
Score = 73.2 bits (178), Expect = 3e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L + ++DVE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV
Sbjct: 447 GNRLATFL-NYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPV 505
Query: 122 ALGNKW 127
+G KW
Sbjct: 506 LVGCKW 511
Score = 67.0 bits (162), Expect = 3e-09, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP L ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 457 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 511
>gi|116283554|gb|AAH17062.1| P4HA2 protein [Homo sapiens]
Length = 504
Score = 73.2 bits (178), Expect = 3e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L + ++DVE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV
Sbjct: 417 GNRLATFL-NYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPV 475
Query: 122 ALGNKW 127
+G KW
Sbjct: 476 LVGCKW 481
Score = 67.0 bits (162), Expect = 3e-09, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP L ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 427 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 481
>gi|291190128|ref|NP_001167431.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Salmo salar]
gi|223649060|gb|ACN11288.1| Prolyl 4-hydroxylase subunit alpha-2 precursor [Salmo salar]
Length = 538
Score = 73.2 bits (178), Expect = 3e-11, Method: Composition-based stats.
Identities = 29/59 (49%), Positives = 40/59 (67%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+ ++DVE GGAT+FP ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 457 FLNYMSDVEAGGATVFPDFGAAIYPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGCKW 515
Score = 67.0 bits (162), Expect = 2e-09, Method: Composition-based stats.
Identities = 26/55 (47%), Positives = 37/55 (67%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 461 MSDVEAGGATVFPDFGAAIYPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGCKW 515
>gi|148226320|ref|NP_001087703.1| prolyl 4-hydroxylase, alpha polypeptide 2 precursor [Xenopus
laevis]
gi|51703693|gb|AAH81114.1| MGC83530 protein [Xenopus laevis]
Length = 533
Score = 73.2 bits (178), Expect = 3e-11, Method: Composition-based stats.
Identities = 29/58 (50%), Positives = 40/58 (68%)
Query: 70 LSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+ ++DVE GGAT+FP ++P KG+AVFWYN + DYR H+ CPV +G+KW
Sbjct: 453 LNYMSDVEAGGATVFPDFGAAIWPRKGTAVFWYNLFRSGEGDYRTRHAACPVLVGSKW 510
Score = 67.0 bits (162), Expect = 2e-09, Method: Composition-based stats.
Identities = 26/55 (47%), Positives = 37/55 (67%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP ++P KG+AVFW+N + DY H+ CPV +G+KW
Sbjct: 456 MSDVEAGGATVFPDFGAAIWPRKGTAVFWYNLFRSGEGDYRTRHAACPVLVGSKW 510
>gi|91091610|ref|XP_969386.1| PREDICTED: similar to prolyl 4-hydroxylase alpha subunit 1,
putative [Tribolium castaneum]
gi|270001037|gb|EEZ97484.1| hypothetical protein TcasGA2_TC011321 [Tribolium castaneum]
Length = 536
Score = 73.2 bits (178), Expect = 3e-11, Method: Composition-based stats.
Identities = 31/66 (46%), Positives = 45/66 (68%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ +L ++DVE GGAT+FPS+ ++++P+KGSA FWYN H + D H+ CPV
Sbjct: 444 GNRIATVLFY-MSDVEQGGATVFPSIQVSLWPQKGSAAFWYNLHPSGDGDKMTRHAACPV 502
Query: 122 ALGNKW 127
G+KW
Sbjct: 503 LTGSKW 508
Score = 70.5 bits (171), Expect = 2e-10, Method: Composition-based stats.
Identities = 29/64 (45%), Positives = 42/64 (65%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DVE GGAT+FPS+ ++++P+KGSA FW+N H + D H+ CPV
Sbjct: 445 NRIATVLFYMSDVEQGGATVFPSIQVSLWPQKGSAAFWYNLHPSGDGDKMTRHAACPVLT 504
Query: 62 GNKW 65
G+KW
Sbjct: 505 GSKW 508
>gi|432109537|gb|ELK33711.1| Prolyl 4-hydroxylase subunit alpha-2 [Myotis davidii]
Length = 555
Score = 73.2 bits (178), Expect = 3e-11, Method: Composition-based stats.
Identities = 30/59 (50%), Positives = 41/59 (69%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+ ++DVE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 474 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 532
Score = 67.0 bits (162), Expect = 3e-09, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP L ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 478 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 532
>gi|410948134|ref|XP_003980796.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Felis
catus]
Length = 535
Score = 73.2 bits (178), Expect = 3e-11, Method: Composition-based stats.
Identities = 30/59 (50%), Positives = 41/59 (69%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+ ++DVE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 454 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512
Score = 67.0 bits (162), Expect = 3e-09, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP L ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 458 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512
>gi|395736141|ref|XP_003776706.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Pongo abelii]
Length = 577
Score = 73.2 bits (178), Expect = 3e-11, Method: Composition-based stats.
Identities = 30/59 (50%), Positives = 41/59 (69%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+ ++DVE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 496 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 554
Score = 67.0 bits (162), Expect = 3e-09, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP L ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 500 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 554
>gi|344264847|ref|XP_003404501.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1
[Loxodonta africana]
Length = 536
Score = 73.2 bits (178), Expect = 3e-11, Method: Composition-based stats.
Identities = 30/59 (50%), Positives = 41/59 (69%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+ ++DVE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 455 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 513
Score = 67.0 bits (162), Expect = 3e-09, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP L ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 459 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 513
>gi|149052606|gb|EDM04423.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide (predicted),
isoform CRA_c [Rattus norvegicus]
Length = 506
Score = 73.2 bits (178), Expect = 3e-11, Method: Composition-based stats.
Identities = 30/59 (50%), Positives = 41/59 (69%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+ ++DVE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 425 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 483
Score = 67.0 bits (162), Expect = 3e-09, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP L ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 429 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 483
>gi|148701597|gb|EDL33544.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide, isoform CRA_b [Mus
musculus]
Length = 506
Score = 73.2 bits (178), Expect = 3e-11, Method: Composition-based stats.
Identities = 30/59 (50%), Positives = 41/59 (69%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+ ++DVE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 425 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 483
Score = 67.0 bits (162), Expect = 3e-09, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP L ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 429 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 483
>gi|410948132|ref|XP_003980795.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Felis
catus]
gi|410948136|ref|XP_003980797.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Felis
catus]
Length = 533
Score = 73.2 bits (178), Expect = 3e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L + ++DVE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV
Sbjct: 446 GNRLATFL-NYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPV 504
Query: 122 ALGNKW 127
+G KW
Sbjct: 505 LVGCKW 510
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP L ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 456 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 510
>gi|403255937|ref|XP_003920661.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Saimiri
boliviensis boliviensis]
gi|403255939|ref|XP_003920662.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Saimiri
boliviensis boliviensis]
gi|403255943|ref|XP_003920664.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 4 [Saimiri
boliviensis boliviensis]
Length = 533
Score = 73.2 bits (178), Expect = 3e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L + ++DVE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV
Sbjct: 446 GNRLATFL-NYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPV 504
Query: 122 ALGNKW 127
+G KW
Sbjct: 505 LVGCKW 510
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP L ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 456 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 510
>gi|426349879|ref|XP_004042513.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Gorilla gorilla
gorilla]
Length = 565
Score = 73.2 bits (178), Expect = 4e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L + ++DVE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV
Sbjct: 478 GNRLATFL-NYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPV 536
Query: 122 ALGNKW 127
+G KW
Sbjct: 537 LVGCKW 542
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP L ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 488 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 542
>gi|119582748|gb|EAW62344.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_c
[Homo sapiens]
Length = 565
Score = 73.2 bits (178), Expect = 4e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L + ++DVE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV
Sbjct: 478 GNRLATFL-NYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPV 536
Query: 122 ALGNKW 127
+G KW
Sbjct: 537 LVGCKW 542
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP L ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 488 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 542
>gi|386780652|ref|NP_001247763.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Macaca mulatta]
gi|383422579|gb|AFH34503.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Macaca
mulatta]
gi|384939466|gb|AFI33338.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Macaca
mulatta]
Length = 533
Score = 73.2 bits (178), Expect = 4e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L + ++DVE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV
Sbjct: 446 GNRLATFL-NYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPV 504
Query: 122 ALGNKW 127
+G KW
Sbjct: 505 LVGCKW 510
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP L ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 456 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 510
>gi|395817620|ref|XP_003782263.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Otolemur
garnettii]
Length = 540
Score = 73.2 bits (178), Expect = 4e-11, Method: Composition-based stats.
Identities = 30/59 (50%), Positives = 41/59 (69%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+ ++DVE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 459 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 517
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP L ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 463 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 517
>gi|403255941|ref|XP_003920663.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Saimiri
boliviensis boliviensis]
gi|403255945|ref|XP_003920665.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 5 [Saimiri
boliviensis boliviensis]
Length = 535
Score = 73.2 bits (178), Expect = 4e-11, Method: Composition-based stats.
Identities = 30/59 (50%), Positives = 41/59 (69%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+ ++DVE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 454 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP L ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 458 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512
>gi|395817618|ref|XP_003782262.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Otolemur
garnettii]
Length = 538
Score = 73.2 bits (178), Expect = 4e-11, Method: Composition-based stats.
Identities = 30/59 (50%), Positives = 41/59 (69%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+ ++DVE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 457 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 515
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP L ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 461 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 515
>gi|197215651|gb|ACH53042.1| prolyl 4-hydroxylase, alpha II subunit isoform 1 precursor
(predicted) [Otolemur garnettii]
Length = 555
Score = 73.2 bits (178), Expect = 4e-11, Method: Composition-based stats.
Identities = 30/59 (50%), Positives = 41/59 (69%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+ ++DVE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 474 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 532
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP L ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 478 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 532
>gi|119582752|gb|EAW62348.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_f
[Homo sapiens]
Length = 567
Score = 73.2 bits (178), Expect = 4e-11, Method: Composition-based stats.
Identities = 30/59 (50%), Positives = 41/59 (69%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+ ++DVE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 486 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 544
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP L ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 490 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 544
>gi|167045848|gb|ABZ10515.1| prolyl 4-hydroxylase, alpha II subunit isoform 1 precursor
(predicted) [Callithrix jacchus]
Length = 555
Score = 73.2 bits (178), Expect = 4e-11, Method: Composition-based stats.
Identities = 30/59 (50%), Positives = 41/59 (69%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+ ++DVE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 474 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 532
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP L ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 478 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 532
>gi|355691582|gb|EHH26767.1| hypothetical protein EGK_16829 [Macaca mulatta]
gi|355750162|gb|EHH54500.1| hypothetical protein EGM_15360 [Macaca fascicularis]
gi|384939464|gb|AFI33337.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Macaca
mulatta]
Length = 535
Score = 72.8 bits (177), Expect = 4e-11, Method: Composition-based stats.
Identities = 30/59 (50%), Positives = 41/59 (69%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+ ++DVE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 454 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP L ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 458 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512
>gi|114601566|ref|XP_001162222.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Pan
troglodytes]
gi|114601568|ref|XP_001162843.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 17 [Pan
troglodytes]
gi|397518358|ref|XP_003829358.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Pan
paniscus]
gi|397518362|ref|XP_003829360.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 5 [Pan
paniscus]
gi|410215944|gb|JAA05191.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
gi|410255608|gb|JAA15771.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
gi|410331279|gb|JAA34586.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
Length = 535
Score = 72.8 bits (177), Expect = 4e-11, Method: Composition-based stats.
Identities = 30/59 (50%), Positives = 41/59 (69%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+ ++DVE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 454 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP L ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 458 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512
>gi|195575143|ref|XP_002105539.1| GD16913 [Drosophila simulans]
gi|194201466|gb|EDX15042.1| GD16913 [Drosophila simulans]
Length = 534
Score = 72.8 bits (177), Expect = 4e-11, Method: Composition-based stats.
Identities = 33/73 (45%), Positives = 47/73 (64%), Gaps = 1/73 (1%)
Query: 55 SGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRM 114
SG + LG++ +L LTDVE GGAT+F + +V+P+ G+A+FWYN N D R
Sbjct: 435 SGYSMDLGDRIATVLFY-LTDVEQGGATVFADVGYSVYPQAGTAIFWYNLDTNGKGDPRT 493
Query: 115 YHSGCPVALGNKW 127
H+ CPV +G+KW
Sbjct: 494 RHAACPVIVGSKW 506
Score = 64.3 bits (155), Expect = 1e-08, Method: Composition-based stats.
Identities = 26/55 (47%), Positives = 37/55 (67%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
LTDVE GGAT+F + +V+P+ G+A+FW+N N D H+ CPV +G+KW
Sbjct: 452 LTDVEQGGATVFADVGYSVYPQAGTAIFWYNLDTNGKGDPRTRHAACPVIVGSKW 506
>gi|4758868|ref|NP_004190.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Homo
sapiens]
gi|217272863|ref|NP_001136071.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Homo
sapiens]
gi|20455169|sp|O15460.1|P4HA2_HUMAN RecName: Full=Prolyl 4-hydroxylase subunit alpha-2; Short=4-PH
alpha-2; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-2; Flags: Precursor
gi|2439985|gb|AAB71339.1| prolyl 4-hydroxylase alpha (II) subunit [Homo sapiens]
gi|18073926|emb|CAC85689.1| Prolyl 4-hydroxylase alpha IIb subunit [Homo sapiens]
gi|119582746|gb|EAW62342.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_b
[Homo sapiens]
gi|119582747|gb|EAW62343.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_b
[Homo sapiens]
Length = 535
Score = 72.8 bits (177), Expect = 4e-11, Method: Composition-based stats.
Identities = 30/59 (50%), Positives = 41/59 (69%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+ ++DVE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 454 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP L ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 458 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512
>gi|335283456|ref|XP_003354320.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Sus scrofa]
Length = 535
Score = 72.8 bits (177), Expect = 4e-11, Method: Composition-based stats.
Identities = 30/59 (50%), Positives = 41/59 (69%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+ ++DVE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 454 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP L ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 458 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512
>gi|332221660|ref|XP_003259981.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Nomascus
leucogenys]
Length = 537
Score = 72.8 bits (177), Expect = 4e-11, Method: Composition-based stats.
Identities = 30/59 (50%), Positives = 41/59 (69%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+ ++DVE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 456 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 514
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP L ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 460 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 514
>gi|332221656|ref|XP_003259979.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Nomascus
leucogenys]
gi|332221658|ref|XP_003259980.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Nomascus
leucogenys]
Length = 535
Score = 72.8 bits (177), Expect = 4e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L + ++DVE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV
Sbjct: 448 GNRLATFL-NYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPV 506
Query: 122 ALGNKW 127
+G KW
Sbjct: 507 LVGCKW 512
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP L ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 458 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512
>gi|297675927|ref|XP_002815905.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Pongo
abelii]
gi|395736137|ref|XP_003776704.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Pongo abelii]
Length = 533
Score = 72.8 bits (177), Expect = 4e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L + ++DVE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV
Sbjct: 446 GNRLATFL-NYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPV 504
Query: 122 ALGNKW 127
+G KW
Sbjct: 505 LVGCKW 510
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP L ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 456 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 510
>gi|297675929|ref|XP_002815906.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Pongo
abelii]
Length = 535
Score = 72.8 bits (177), Expect = 4e-11, Method: Composition-based stats.
Identities = 30/59 (50%), Positives = 41/59 (69%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+ ++DVE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 454 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP L ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 458 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512
>gi|170649696|gb|ACB21278.1| prolyl 4-hydroxylase, alpha II subunit isoform 1 precursor
(predicted) [Callicebus moloch]
Length = 555
Score = 72.8 bits (177), Expect = 4e-11, Method: Composition-based stats.
Identities = 30/59 (50%), Positives = 41/59 (69%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+ ++DVE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 474 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 532
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP L ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 478 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 532
>gi|114601548|ref|XP_001162501.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 9 [Pan
troglodytes]
gi|114601562|ref|XP_001162805.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 16 [Pan
troglodytes]
gi|114601564|ref|XP_517917.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 19 [Pan
troglodytes]
gi|397518354|ref|XP_003829356.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Pan
paniscus]
gi|397518356|ref|XP_003829357.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Pan
paniscus]
gi|397518360|ref|XP_003829359.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 4 [Pan
paniscus]
gi|410215942|gb|JAA05190.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
gi|410255606|gb|JAA15770.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
gi|410331277|gb|JAA34585.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
gi|410331281|gb|JAA34587.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
Length = 533
Score = 72.8 bits (177), Expect = 4e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L + ++DVE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV
Sbjct: 446 GNRLATFL-NYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPV 504
Query: 122 ALGNKW 127
+G KW
Sbjct: 505 LVGCKW 510
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP L ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 456 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 510
>gi|57997558|emb|CAI46066.1| hypothetical protein [Homo sapiens]
Length = 533
Score = 72.8 bits (177), Expect = 4e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L + ++DVE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV
Sbjct: 446 GNRLATFL-NYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPV 504
Query: 122 ALGNKW 127
+G KW
Sbjct: 505 LVGCKW 510
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP L ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 456 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 510
>gi|63252891|ref|NP_001017973.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Homo
sapiens]
gi|63252893|ref|NP_001017974.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Homo
sapiens]
gi|217272861|ref|NP_001136070.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Homo
sapiens]
gi|18073925|emb|CAC85688.1| Prolyl 4-hydroxylase alpha IIa subunit [Homo sapiens]
gi|23274221|gb|AAH35813.1| Prolyl 4-hydroxylase, alpha polypeptide II [Homo sapiens]
gi|37183058|gb|AAQ89329.1| P4HA2 [Homo sapiens]
gi|119582745|gb|EAW62341.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_a
[Homo sapiens]
gi|119582750|gb|EAW62346.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_a
[Homo sapiens]
gi|123983232|gb|ABM83357.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II [synthetic
construct]
gi|157928048|gb|ABW03320.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II [synthetic
construct]
Length = 533
Score = 72.8 bits (177), Expect = 4e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L + ++DVE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV
Sbjct: 446 GNRLATFL-NYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPV 504
Query: 122 ALGNKW 127
+G KW
Sbjct: 505 LVGCKW 510
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP L ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 456 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 510
>gi|47218149|emb|CAG10069.1| unnamed protein product [Tetraodon nigroviridis]
Length = 595
Score = 72.8 bits (177), Expect = 4e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 42/66 (63%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+FP + V+P+KGSAVFWYN + DY H+ CPV
Sbjct: 508 GNRIATWLFY-MSDVAAGGATVFPDVGAAVWPQKGSAVFWYNLFTSGEGDYSTRHAACPV 566
Query: 122 ALGNKW 127
+GNKW
Sbjct: 567 LVGNKW 572
Score = 70.1 bits (170), Expect = 3e-10, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 40/64 (62%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + V+P+KGSAVFW+N + DY H+ CPV +
Sbjct: 509 NRIATWLFYMSDVAAGGATVFPDVGAAVWPQKGSAVFWYNLFTSGEGDYSTRHAACPVLV 568
Query: 62 GNKW 65
GNKW
Sbjct: 569 GNKW 572
>gi|426255748|ref|XP_004021510.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 3 [Ovis
aries]
Length = 516
Score = 72.8 bits (177), Expect = 4e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV
Sbjct: 429 GNRIATWLFY-MSDVLAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 487
Query: 122 ALGNKW 127
+GNKW
Sbjct: 488 LVGNKW 493
Score = 70.1 bits (170), Expect = 3e-10, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 42/64 (65%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + +V+P+KG+AVFW+N A+ DY H+ CPV +
Sbjct: 430 NRIATWLFYMSDVLAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 489
Query: 62 GNKW 65
GNKW
Sbjct: 490 GNKW 493
>gi|426255746|ref|XP_004021509.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2 [Ovis
aries]
Length = 534
Score = 72.8 bits (177), Expect = 4e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVLAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505
Query: 122 ALGNKW 127
+GNKW
Sbjct: 506 LVGNKW 511
Score = 70.1 bits (170), Expect = 3e-10, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 42/64 (65%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + +V+P+KG+AVFW+N A+ DY H+ CPV +
Sbjct: 448 NRIATWLFYMSDVLAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507
Query: 62 GNKW 65
GNKW
Sbjct: 508 GNKW 511
>gi|426255744|ref|XP_004021508.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1 [Ovis
aries]
Length = 534
Score = 72.8 bits (177), Expect = 4e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVLAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505
Query: 122 ALGNKW 127
+GNKW
Sbjct: 506 LVGNKW 511
Score = 70.1 bits (170), Expect = 3e-10, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 42/64 (65%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + +V+P+KG+AVFW+N A+ DY H+ CPV +
Sbjct: 448 NRIATWLFYMSDVLAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507
Query: 62 GNKW 65
GNKW
Sbjct: 508 GNKW 511
>gi|321474877|gb|EFX85841.1| hypothetical protein DAPPUDRAFT_208740 [Daphnia pulex]
Length = 545
Score = 72.8 bits (177), Expect = 4e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 43/66 (65%), Gaps = 3/66 (4%)
Query: 65 WGKLLLSGL---TDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
WG + + L ++VE GGAT+FP LNL ++P+KGSA FWYN H N + H+ CPV
Sbjct: 455 WGNRVATWLFYMSEVEAGGATVFPKLNLALWPQKGSAAFWYNLHPNGEGNELTRHAACPV 514
Query: 122 ALGNKW 127
G+KW
Sbjct: 515 LTGSKW 520
Score = 71.2 bits (173), Expect = 1e-10, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 41/64 (64%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR +++VE GGAT+FP LNL ++P+KGSA FW+N H N + H+ CPV
Sbjct: 457 NRVATWLFYMSEVEAGGATVFPKLNLALWPQKGSAAFWYNLHPNGEGNELTRHAACPVLT 516
Query: 62 GNKW 65
G+KW
Sbjct: 517 GSKW 520
>gi|151556370|gb|AAI47868.1| P4HA1 protein [Bos taurus]
Length = 534
Score = 72.8 bits (177), Expect = 4e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVLAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505
Query: 122 ALGNKW 127
+GNKW
Sbjct: 506 LVGNKW 511
Score = 70.1 bits (170), Expect = 3e-10, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 42/64 (65%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + +V+P+KG+AVFW+N A+ DY H+ CPV +
Sbjct: 448 NRIATWLFYMSDVLAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507
Query: 62 GNKW 65
GNKW
Sbjct: 508 GNKW 511
>gi|115495019|ref|NP_001069238.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Bos taurus]
gi|122144801|sp|Q1RMU3.1|P4HA1_BOVIN RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
alpha-1; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1; Flags: Precursor
gi|92097479|gb|AAI14709.1| Prolyl 4-hydroxylase, alpha polypeptide I [Bos taurus]
gi|296472132|tpg|DAA14247.1| TPA: prolyl 4-hydroxylase subunit alpha-1 precursor [Bos taurus]
gi|440892721|gb|ELR45796.1| Prolyl 4-hydroxylase subunit alpha-1 [Bos grunniens mutus]
Length = 534
Score = 72.8 bits (177), Expect = 4e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV
Sbjct: 447 GNRIATWLFY-MSDVLAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 505
Query: 122 ALGNKW 127
+GNKW
Sbjct: 506 LVGNKW 511
Score = 70.1 bits (170), Expect = 3e-10, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 42/64 (65%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + +V+P+KG+AVFW+N A+ DY H+ CPV +
Sbjct: 448 NRIATWLFYMSDVLAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLV 507
Query: 62 GNKW 65
GNKW
Sbjct: 508 GNKW 511
>gi|157818741|ref|NP_001101745.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Rattus norvegicus]
gi|149052604|gb|EDM04421.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide (predicted),
isoform CRA_a [Rattus norvegicus]
Length = 535
Score = 72.8 bits (177), Expect = 4e-11, Method: Composition-based stats.
Identities = 30/59 (50%), Positives = 41/59 (69%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+ ++DVE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 454 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP L ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 458 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512
>gi|226874876|ref|NP_035161.2| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Mus
musculus]
gi|148701601|gb|EDL33548.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide, isoform CRA_f [Mus
musculus]
Length = 537
Score = 72.8 bits (177), Expect = 4e-11, Method: Composition-based stats.
Identities = 30/59 (50%), Positives = 41/59 (69%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+ ++DVE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 456 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 514
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP L ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 460 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 514
>gi|2498741|sp|Q60716.1|P4HA2_MOUSE RecName: Full=Prolyl 4-hydroxylase subunit alpha-2; Short=4-PH
alpha-2; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-2; Flags: Precursor
gi|836900|gb|AAC52198.1| prolyl 4-hydroxylase alpha(II)-subunit [Mus musculus]
gi|18073923|emb|CAC85691.1| Prolyl 4-hydroxylase alpha IIb subunit [Mus musculus]
gi|1096888|prf||2112362B Pro 4-hydroxylase:SUBUNIT=alpha:ISOTYPE=II
Length = 537
Score = 72.8 bits (177), Expect = 4e-11, Method: Composition-based stats.
Identities = 30/59 (50%), Positives = 41/59 (69%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+ ++DVE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 456 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 514
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP L ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 460 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 514
>gi|226874889|ref|NP_001152881.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Bos
taurus]
gi|296485624|tpg|DAA27739.1| TPA: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Bos taurus]
Length = 535
Score = 72.8 bits (177), Expect = 4e-11, Method: Composition-based stats.
Identities = 30/59 (50%), Positives = 41/59 (69%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+ ++DVE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 454 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP L ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 458 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512
>gi|426229219|ref|XP_004008688.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 1
[Ovis aries]
Length = 535
Score = 72.8 bits (177), Expect = 4e-11, Method: Composition-based stats.
Identities = 30/59 (50%), Positives = 41/59 (69%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+ ++DVE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 454 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP L ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 458 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512
>gi|21711777|gb|AAM75079.1| RE70601p [Drosophila melanogaster]
Length = 316
Score = 72.8 bits (177), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 29/55 (52%), Positives = 39/55 (70%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
LTDVE GGAT+FP++ VFP++GS V WYN N +D + H+ CPV +G+KW
Sbjct: 240 LTDVEQGGATVFPNIRKAVFPQRGSVVMWYNLKDNGQIDTQTLHAACPVIVGSKW 294
Score = 70.1 bits (170), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 28/55 (50%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
LTDVE GGAT+FP++ VFP++GS V W+N N +D H+ CPV +G+KW
Sbjct: 240 LTDVEQGGATVFPNIRKAVFPQRGSVVMWYNLKDNGQIDTQTLHAACPVIVGSKW 294
>gi|170064960|ref|XP_001867743.1| prolyl 4-hydroxylase subunit alpha-1 [Culex quinquefasciatus]
gi|167882146|gb|EDS45529.1| prolyl 4-hydroxylase subunit alpha-1 [Culex quinquefasciatus]
Length = 545
Score = 72.8 bits (177), Expect = 4e-11, Method: Composition-based stats.
Identities = 31/66 (46%), Positives = 43/66 (65%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ +L ++DV GGAT+FPS+ + + P+KG+A FWYN HA+ DY H+ CPV
Sbjct: 454 GNRIATVLFY-MSDVSQGGATVFPSIRVALRPKKGTAAFWYNLHASGHGDYATRHAACPV 512
Query: 122 ALGNKW 127
G KW
Sbjct: 513 LTGTKW 518
Score = 69.7 bits (169), Expect = 4e-10, Method: Composition-based stats.
Identities = 29/64 (45%), Positives = 40/64 (62%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FPS+ + + P+KG+A FW+N HA+ DY H+ CPV
Sbjct: 455 NRIATVLFYMSDVSQGGATVFPSIRVALRPKKGTAAFWYNLHASGHGDYATRHAACPVLT 514
Query: 62 GNKW 65
G KW
Sbjct: 515 GTKW 518
>gi|66772331|gb|AAY55477.1| IP03959p [Drosophila melanogaster]
gi|66772361|gb|AAY55492.1| IP03859p [Drosophila melanogaster]
Length = 541
Score = 72.8 bits (177), Expect = 4e-11, Method: Composition-based stats.
Identities = 32/55 (58%), Positives = 38/55 (69%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L DV LGGAT FP L L V P KGS + WYN H++T D+R H+GCPV G+KW
Sbjct: 455 LNDVALGGATAFPFLRLAVPPVKGSLLIWYNLHSSTHKDFRTKHAGCPVLQGSKW 509
Score = 68.2 bits (165), Expect = 1e-09, Method: Composition-based stats.
Identities = 32/64 (50%), Positives = 40/64 (62%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L DV LGGAT FP L L V P KGS + W+N H++T D+ H+GCPV
Sbjct: 446 NRLLTALFYLNDVALGGATAFPFLRLAVPPVKGSLLIWYNLHSSTHKDFRTKHAGCPVLQ 505
Query: 62 GNKW 65
G+KW
Sbjct: 506 GSKW 509
>gi|431892682|gb|ELK03115.1| Prolyl 4-hydroxylase subunit alpha-2 [Pteropus alecto]
Length = 629
Score = 72.8 bits (177), Expect = 4e-11, Method: Composition-based stats.
Identities = 30/59 (50%), Positives = 41/59 (69%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+ ++DVE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 480 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 538
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP L ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 484 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 538
>gi|195505207|ref|XP_002099404.1| GE23380 [Drosophila yakuba]
gi|194185505|gb|EDW99116.1| GE23380 [Drosophila yakuba]
Length = 540
Score = 72.8 bits (177), Expect = 4e-11, Method: Composition-based stats.
Identities = 35/66 (53%), Positives = 42/66 (63%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L L DV LGGAT FP L L V P KGS + WYN H++T D+R H+GCPV
Sbjct: 444 GNRLATALFY-LNDVALGGATAFPFLRLAVPPVKGSLLIWYNLHSSTHKDFRTKHAGCPV 502
Query: 122 ALGNKW 127
G+KW
Sbjct: 503 LQGSKW 508
Score = 68.6 bits (166), Expect = 8e-10, Method: Composition-based stats.
Identities = 32/64 (50%), Positives = 40/64 (62%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L DV LGGAT FP L L V P KGS + W+N H++T D+ H+GCPV
Sbjct: 445 NRLATALFYLNDVALGGATAFPFLRLAVPPVKGSLLIWYNLHSSTHKDFRTKHAGCPVLQ 504
Query: 62 GNKW 65
G+KW
Sbjct: 505 GSKW 508
>gi|209862961|ref|NP_001129548.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Mus
musculus]
gi|17390970|gb|AAH18411.1| P4ha2 protein [Mus musculus]
gi|18073922|emb|CAC85690.1| Prolyl 4-hydroxylase alpha IIa subunit [Mus musculus]
gi|74211515|dbj|BAE26490.1| unnamed protein product [Mus musculus]
Length = 535
Score = 72.8 bits (177), Expect = 5e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L + ++DVE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV
Sbjct: 448 GNRLATFL-NYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPV 506
Query: 122 ALGNKW 127
+G KW
Sbjct: 507 LVGCKW 512
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP L ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 458 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512
>gi|332221664|ref|XP_003259983.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 5 [Nomascus
leucogenys]
Length = 558
Score = 72.8 bits (177), Expect = 5e-11, Method: Composition-based stats.
Identities = 30/59 (50%), Positives = 41/59 (69%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+ ++DVE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 477 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 535
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP L ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 481 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 535
>gi|332221662|ref|XP_003259982.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 4 [Nomascus
leucogenys]
Length = 556
Score = 72.8 bits (177), Expect = 5e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L + ++DVE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV
Sbjct: 469 GNRLATFL-NYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPV 527
Query: 122 ALGNKW 127
+G KW
Sbjct: 528 LVGCKW 533
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP L ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 479 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 533
>gi|301754231|ref|XP_002912939.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Ailuropoda
melanoleuca]
Length = 535
Score = 72.8 bits (177), Expect = 5e-11, Method: Composition-based stats.
Identities = 30/59 (50%), Positives = 41/59 (69%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+ ++DVE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 454 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP L ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 458 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512
>gi|291230950|ref|XP_002735430.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Saccoglossus
kowalevskii]
Length = 533
Score = 72.8 bits (177), Expect = 5e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 41/66 (62%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DVE GGAT+FP + + PEKGSA FWYN N DY H+ CPV
Sbjct: 445 GNRIATFLFY-MSDVEAGGATVFPQVGARLIPEKGSAAFWYNLLKNGEGDYSTRHAACPV 503
Query: 122 ALGNKW 127
+G+KW
Sbjct: 504 LVGSKW 509
Score = 69.3 bits (168), Expect = 5e-10, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 39/64 (60%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DVE GGAT+FP + + PEKGSA FW+N N DY H+ CPV +
Sbjct: 446 NRIATFLFYMSDVEAGGATVFPQVGARLIPEKGSAAFWYNLLKNGEGDYSTRHAACPVLV 505
Query: 62 GNKW 65
G+KW
Sbjct: 506 GSKW 509
>gi|148701600|gb|EDL33547.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide, isoform CRA_e [Mus
musculus]
Length = 593
Score = 72.8 bits (177), Expect = 5e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L + ++DVE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV
Sbjct: 506 GNRLATFL-NYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPV 564
Query: 122 ALGNKW 127
+G KW
Sbjct: 565 LVGCKW 570
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP L ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 516 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 570
>gi|24651424|ref|NP_733376.1| prolyl-4-hydroxylase-alpha SG1 [Drosophila melanogaster]
gi|23172697|gb|AAF57059.2| prolyl-4-hydroxylase-alpha SG1 [Drosophila melanogaster]
gi|66772443|gb|AAY55533.1| IP03659p [Drosophila melanogaster]
gi|220951214|gb|ACL88150.1| PH4alphaSG1-PA [synthetic construct]
gi|220959938|gb|ACL92512.1| PH4alphaSG1-PA [synthetic construct]
Length = 540
Score = 72.8 bits (177), Expect = 5e-11, Method: Composition-based stats.
Identities = 32/55 (58%), Positives = 38/55 (69%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L DV LGGAT FP L L V P KGS + WYN H++T D+R H+GCPV G+KW
Sbjct: 454 LNDVALGGATAFPFLRLAVPPVKGSLLIWYNLHSSTHKDFRTKHAGCPVLQGSKW 508
Score = 68.2 bits (165), Expect = 1e-09, Method: Composition-based stats.
Identities = 32/64 (50%), Positives = 40/64 (62%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L DV LGGAT FP L L V P KGS + W+N H++T D+ H+GCPV
Sbjct: 445 NRLLTALFYLNDVALGGATAFPFLRLAVPPVKGSLLIWYNLHSSTHKDFRTKHAGCPVLQ 504
Query: 62 GNKW 65
G+KW
Sbjct: 505 GSKW 508
>gi|184185444|gb|ACC68850.1| prolyl 4-hydroxylase, alpha II subunit isoform 1 precursor
(predicted) [Rhinolophus ferrumequinum]
Length = 555
Score = 72.8 bits (177), Expect = 5e-11, Method: Composition-based stats.
Identities = 30/59 (50%), Positives = 41/59 (69%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+ ++DVE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 474 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 532
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP L ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 478 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 532
>gi|20269816|gb|AAM18063.1|AF495541_1 prolyl 4-hydroxylase alpha-related protein PH4[alpha]SG1
[Drosophila melanogaster]
Length = 540
Score = 72.8 bits (177), Expect = 5e-11, Method: Composition-based stats.
Identities = 32/55 (58%), Positives = 38/55 (69%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L DV LGGAT FP L L V P KGS + WYN H++T D+R H+GCPV G+KW
Sbjct: 454 LNDVALGGATAFPFLRLAVPPVKGSLLIWYNLHSSTHKDFRTKHAGCPVLQGSKW 508
Score = 68.2 bits (165), Expect = 1e-09, Method: Composition-based stats.
Identities = 32/64 (50%), Positives = 40/64 (62%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L DV LGGAT FP L L V P KGS + W+N H++T D+ H+GCPV
Sbjct: 445 NRLLTALFYLNDVALGGATAFPFLRLAVPPVKGSLLIWYNLHSSTHKDFRTKHAGCPVLQ 504
Query: 62 GNKW 65
G+KW
Sbjct: 505 GSKW 508
>gi|432904500|ref|XP_004077362.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Oryzias
latipes]
Length = 555
Score = 72.8 bits (177), Expect = 5e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 43/66 (65%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+FP + +V P+KG+AVFWYN A+ DY H+ CPV
Sbjct: 463 GNRIATWLFY-MSDVSAGGATVFPDVGASVGPQKGTAVFWYNLFASGEGDYSTRHAACPV 521
Query: 122 ALGNKW 127
+GNKW
Sbjct: 522 LVGNKW 527
Score = 69.7 bits (169), Expect = 3e-10, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 41/64 (64%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + +V P+KG+AVFW+N A+ DY H+ CPV +
Sbjct: 464 NRIATWLFYMSDVSAGGATVFPDVGASVGPQKGTAVFWYNLFASGEGDYSTRHAACPVLV 523
Query: 62 GNKW 65
GNKW
Sbjct: 524 GNKW 527
>gi|440912197|gb|ELR61789.1| Prolyl 4-hydroxylase subunit alpha-2, partial [Bos grunniens mutus]
Length = 535
Score = 72.8 bits (177), Expect = 5e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L + ++DVE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV
Sbjct: 448 GNRLATFL-NYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPV 506
Query: 122 ALGNKW 127
+G KW
Sbjct: 507 LVGCKW 512
Score = 66.2 bits (160), Expect = 4e-09, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP L ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 458 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512
>gi|354474413|ref|XP_003499425.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1
[Cricetulus griseus]
Length = 535
Score = 72.8 bits (177), Expect = 5e-11, Method: Composition-based stats.
Identities = 30/59 (50%), Positives = 41/59 (69%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+ ++DVE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 454 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP L ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 458 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512
>gi|281183175|ref|NP_001162504.1| prolyl 4-hydroxylase subunit alpha-2 [Papio anubis]
gi|159461520|gb|ABW96795.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase, alpha
polypeptide II, isoform 1 (predicted) [Papio anubis]
Length = 578
Score = 72.8 bits (177), Expect = 5e-11, Method: Composition-based stats.
Identities = 30/59 (50%), Positives = 41/59 (69%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+ ++DVE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 497 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 555
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP L ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 501 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 555
>gi|226874885|ref|NP_001029465.2| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Bos
taurus]
gi|296485623|tpg|DAA27738.1| TPA: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Bos taurus]
Length = 533
Score = 72.8 bits (177), Expect = 5e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L + ++DVE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV
Sbjct: 446 GNRLATFL-NYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPV 504
Query: 122 ALGNKW 127
+G KW
Sbjct: 505 LVGCKW 510
Score = 66.2 bits (160), Expect = 4e-09, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP L ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 456 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 510
>gi|229368743|gb|ACQ63024.1| prolyl 4-hydroxylase, alpha II subunit isoform 1 precursor
(predicted) [Dasypus novemcinctus]
Length = 556
Score = 72.8 bits (177), Expect = 5e-11, Method: Composition-based stats.
Identities = 30/59 (50%), Positives = 41/59 (69%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+ ++DVE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 475 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 533
Score = 66.2 bits (160), Expect = 4e-09, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP L ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 479 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 533
>gi|355709025|gb|AES03456.1| prolyl 4-hydroxylase, alpha polypeptide II [Mustela putorius furo]
Length = 532
Score = 72.8 bits (177), Expect = 5e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L + ++DVE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV
Sbjct: 446 GNRLATFL-NYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPV 504
Query: 122 ALGNKW 127
+G KW
Sbjct: 505 LVGCKW 510
Score = 66.2 bits (160), Expect = 4e-09, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP L ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 456 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 510
>gi|354474415|ref|XP_003499426.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2
[Cricetulus griseus]
Length = 533
Score = 72.4 bits (176), Expect = 5e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L + ++DVE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV
Sbjct: 446 GNRLATFL-NYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPV 504
Query: 122 ALGNKW 127
+G KW
Sbjct: 505 LVGCKW 510
Score = 66.2 bits (160), Expect = 4e-09, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP L ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 456 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 510
>gi|119582749|gb|EAW62345.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_d
[Homo sapiens]
Length = 488
Score = 72.4 bits (176), Expect = 5e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L + ++DVE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV
Sbjct: 401 GNRLATFL-NYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPV 459
Query: 122 ALGNKW 127
+G KW
Sbjct: 460 LVGCKW 465
Score = 66.2 bits (160), Expect = 4e-09, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP L ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 411 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 465
>gi|73970649|ref|XP_850109.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Canis
lupus familiaris]
Length = 533
Score = 72.4 bits (176), Expect = 5e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L + ++DVE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV
Sbjct: 446 GNRLATFL-NYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPV 504
Query: 122 ALGNKW 127
+G KW
Sbjct: 505 LVGCKW 510
Score = 66.2 bits (160), Expect = 4e-09, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP L ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 456 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 510
>gi|194905290|ref|XP_001981166.1| GG11918 [Drosophila erecta]
gi|190655804|gb|EDV53036.1| GG11918 [Drosophila erecta]
Length = 525
Score = 72.4 bits (176), Expect = 5e-11, Method: Composition-based stats.
Identities = 28/55 (50%), Positives = 38/55 (69%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
LTDVE GGAT+FP++ VFP++GS + WYN N D + H+ CPV +G+KW
Sbjct: 449 LTDVEQGGATVFPNIRKAVFPQRGSVIMWYNLQDNGQTDNKTLHAACPVIVGSKW 503
Score = 68.6 bits (166), Expect = 9e-10, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 37/55 (67%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
LTDVE GGAT+FP++ VFP++GS + W+N N D H+ CPV +G+KW
Sbjct: 449 LTDVEQGGATVFPNIRKAVFPQRGSVIMWYNLQDNGQTDNKTLHAACPVIVGSKW 503
>gi|195341548|ref|XP_002037368.1| GM12149 [Drosophila sechellia]
gi|194131484|gb|EDW53527.1| GM12149 [Drosophila sechellia]
Length = 537
Score = 72.4 bits (176), Expect = 5e-11, Method: Composition-based stats.
Identities = 32/55 (58%), Positives = 38/55 (69%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L DV LGGAT FP L L V P KGS + WYN H++T D+R H+GCPV G+KW
Sbjct: 451 LNDVALGGATAFPFLRLAVPPVKGSLLIWYNLHSSTHKDFRTKHAGCPVLQGSKW 505
Score = 67.8 bits (164), Expect = 1e-09, Method: Composition-based stats.
Identities = 32/64 (50%), Positives = 40/64 (62%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L DV LGGAT FP L L V P KGS + W+N H++T D+ H+GCPV
Sbjct: 442 NRLLTALFYLNDVALGGATAFPFLRLAVPPVKGSLLIWYNLHSSTHKDFRTKHAGCPVLQ 501
Query: 62 GNKW 65
G+KW
Sbjct: 502 GSKW 505
>gi|391342914|ref|XP_003745760.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Metaseiulus
occidentalis]
Length = 525
Score = 72.4 bits (176), Expect = 5e-11, Method: Composition-based stats.
Identities = 30/55 (54%), Positives = 39/55 (70%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
++DV+ GGAT+FP L LTV+PEKGSA FW+N H + D H+ CPV G+KW
Sbjct: 448 MSDVKAGGATVFPRLGLTVWPEKGSAAFWWNLHRSGEGDILTRHAACPVLAGSKW 502
Score = 70.9 bits (172), Expect = 2e-10, Method: Composition-based stats.
Identities = 30/55 (54%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DV+ GGAT+FP L LTV+PEKGSA FW N H + D H+ CPV G+KW
Sbjct: 448 MSDVKAGGATVFPRLGLTVWPEKGSAAFWWNLHRSGEGDILTRHAACPVLAGSKW 502
>gi|387016442|gb|AFJ50340.1| Prolyl 4-hydroxylase subunit alpha-2-like [Crotalus adamanteus]
Length = 533
Score = 72.4 bits (176), Expect = 6e-11, Method: Composition-based stats.
Identities = 31/66 (46%), Positives = 43/66 (65%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L + ++DVE GGAT+FP ++P+KG+AVFWYN + DYR H+ CPV
Sbjct: 446 GNRLATFL-NYMSDVEAGGATVFPDFGAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPV 504
Query: 122 ALGNKW 127
+G KW
Sbjct: 505 LVGCKW 510
Score = 66.2 bits (160), Expect = 4e-09, Method: Composition-based stats.
Identities = 26/55 (47%), Positives = 37/55 (67%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 456 MSDVEAGGATVFPDFGAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGCKW 510
>gi|195069801|ref|XP_001997031.1| GH12975 [Drosophila grimshawi]
gi|193891500|gb|EDV90366.1| GH12975 [Drosophila grimshawi]
Length = 242
Score = 72.4 bits (176), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 32/55 (58%), Positives = 40/55 (72%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L++VE GG T FP L L V PE+GS +FWYN H + LDYR H+GCPV +G+KW
Sbjct: 155 LSNVEAGGGTAFPFLPLLVEPERGSLLFWYNLHRSGDLDYRTKHAGCPVLMGSKW 209
Score = 68.2 bits (165), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 32/64 (50%), Positives = 41/64 (64%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR L++VE GG T FP L L V PE+GS +FW+N H + LDY H+GCPV +
Sbjct: 146 NRVATGIYYLSNVEAGGGTAFPFLPLLVEPERGSLLFWYNLHRSGDLDYRTKHAGCPVLM 205
Query: 62 GNKW 65
G+KW
Sbjct: 206 GSKW 209
>gi|291387304|ref|XP_002710243.1| PREDICTED: prolyl 4-hydroxylase, alpha II subunit isoform 1
precursor (predicted)-like isoform 3 [Oryctolagus
cuniculus]
Length = 535
Score = 72.4 bits (176), Expect = 6e-11, Method: Composition-based stats.
Identities = 30/59 (50%), Positives = 41/59 (69%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+ ++DVE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 454 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512
Score = 66.2 bits (160), Expect = 4e-09, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP L ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 458 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512
>gi|195341536|ref|XP_002037362.1| GM12882 [Drosophila sechellia]
gi|194131478|gb|EDW53521.1| GM12882 [Drosophila sechellia]
Length = 550
Score = 72.4 bits (176), Expect = 6e-11, Method: Composition-based stats.
Identities = 32/69 (46%), Positives = 44/69 (63%), Gaps = 1/69 (1%)
Query: 59 VALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSG 118
+ LGN+ +L ++DVE GGAT+F SL+ +FP+KG+A FW N H + D R H+
Sbjct: 455 INLGNRIATVLFY-MSDVEQGGATVFTSLHTALFPKKGTAAFWMNLHRDGQGDVRTRHAA 513
Query: 119 CPVALGNKW 127
CPV G KW
Sbjct: 514 CPVLTGTKW 522
Score = 66.2 bits (160), Expect = 4e-09, Method: Composition-based stats.
Identities = 29/64 (45%), Positives = 38/64 (59%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DVE GGAT+F SL+ +FP+KG+A FW N H + D H+ CPV
Sbjct: 459 NRIATVLFYMSDVEQGGATVFTSLHTALFPKKGTAAFWMNLHRDGQGDVRTRHAACPVLT 518
Query: 62 GNKW 65
G KW
Sbjct: 519 GTKW 522
>gi|327265288|ref|XP_003217440.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Anolis
carolinensis]
Length = 554
Score = 72.4 bits (176), Expect = 6e-11, Method: Composition-based stats.
Identities = 29/59 (49%), Positives = 40/59 (67%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+ ++DVE GGAT+FP ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 473 FLNYMSDVEAGGATVFPDFGAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGCKW 531
Score = 66.2 bits (160), Expect = 4e-09, Method: Composition-based stats.
Identities = 26/55 (47%), Positives = 37/55 (67%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 477 MSDVEAGGATVFPDFGAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGCKW 531
>gi|291387300|ref|XP_002710241.1| PREDICTED: prolyl 4-hydroxylase, alpha II subunit isoform 1
precursor (predicted)-like isoform 1 [Oryctolagus
cuniculus]
Length = 533
Score = 72.4 bits (176), Expect = 6e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L + ++DVE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV
Sbjct: 446 GNRLATFL-NYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPV 504
Query: 122 ALGNKW 127
+G KW
Sbjct: 505 LVGCKW 510
Score = 66.2 bits (160), Expect = 4e-09, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP L ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 456 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 510
>gi|190402274|gb|ACE77683.1| prolyl 4-hydroxylase subunit alpha-2 precursor (predicted) [Sorex
araneus]
Length = 533
Score = 72.4 bits (176), Expect = 6e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L + ++DVE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV
Sbjct: 446 GNRLATFL-NYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPV 504
Query: 122 ALGNKW 127
+G KW
Sbjct: 505 LVGCKW 510
Score = 66.2 bits (160), Expect = 4e-09, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP L ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 456 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 510
>gi|426229221|ref|XP_004008689.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 2
[Ovis aries]
Length = 487
Score = 72.4 bits (176), Expect = 6e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L + ++DVE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV
Sbjct: 400 GNRLATFL-NYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPV 458
Query: 122 ALGNKW 127
+G KW
Sbjct: 459 LVGCKW 464
Score = 66.2 bits (160), Expect = 4e-09, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP L ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 410 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 464
>gi|74353841|gb|AAI03334.1| Prolyl 4-hydroxylase, alpha polypeptide II [Bos taurus]
Length = 487
Score = 72.4 bits (176), Expect = 6e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L + ++DVE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV
Sbjct: 400 GNRLATFL-NYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPV 458
Query: 122 ALGNKW 127
+G KW
Sbjct: 459 LVGCKW 464
Score = 66.2 bits (160), Expect = 4e-09, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP L ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 410 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 464
>gi|291387302|ref|XP_002710242.1| PREDICTED: prolyl 4-hydroxylase, alpha II subunit isoform 1
precursor (predicted)-like isoform 2 [Oryctolagus
cuniculus]
gi|217273039|gb|ACK28132.1| prolyl 4-hydroxylase, alpha II subunit isoform 1 precursor
(predicted) [Oryctolagus cuniculus]
Length = 555
Score = 72.4 bits (176), Expect = 6e-11, Method: Composition-based stats.
Identities = 30/59 (50%), Positives = 41/59 (69%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+ ++DVE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 474 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 532
Score = 66.2 bits (160), Expect = 4e-09, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP L ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 478 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 532
>gi|57525020|ref|NP_001006155.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Gallus gallus]
gi|82082587|sp|Q5ZLK5.1|P4HA2_CHICK RecName: Full=Prolyl 4-hydroxylase subunit alpha-2; Short=4-PH
alpha-2; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-2; Flags: Precursor
gi|53129464|emb|CAG31388.1| hypothetical protein RCJMB04_5l17 [Gallus gallus]
Length = 534
Score = 72.4 bits (176), Expect = 6e-11, Method: Composition-based stats.
Identities = 31/66 (46%), Positives = 43/66 (65%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L + ++DVE GGAT+FP ++P+KG+AVFWYN + DYR H+ CPV
Sbjct: 447 GNRLATFL-NYMSDVEAGGATVFPDFGAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPV 505
Query: 122 ALGNKW 127
+G KW
Sbjct: 506 LVGCKW 511
Score = 65.9 bits (159), Expect = 5e-09, Method: Composition-based stats.
Identities = 26/55 (47%), Positives = 37/55 (67%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 457 MSDVEAGGATVFPDFGAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGCKW 511
>gi|351706369|gb|EHB09288.1| Prolyl 4-hydroxylase subunit alpha-2 [Heterocephalus glaber]
Length = 535
Score = 72.4 bits (176), Expect = 6e-11, Method: Composition-based stats.
Identities = 30/59 (50%), Positives = 41/59 (69%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+ ++DVE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 454 FLNYMSDVEAGGATVFPDLGAALWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512
Score = 65.9 bits (159), Expect = 5e-09, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP L ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 458 MSDVEAGGATVFPDLGAALWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512
>gi|326928728|ref|XP_003210527.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Meleagris
gallopavo]
Length = 535
Score = 72.4 bits (176), Expect = 6e-11, Method: Composition-based stats.
Identities = 31/66 (46%), Positives = 43/66 (65%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L + ++DVE GGAT+FP ++P+KG+AVFWYN + DYR H+ CPV
Sbjct: 448 GNRLATFL-NYMSDVEAGGATVFPDFGAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPV 506
Query: 122 ALGNKW 127
+G KW
Sbjct: 507 LVGCKW 512
Score = 65.9 bits (159), Expect = 5e-09, Method: Composition-based stats.
Identities = 26/55 (47%), Positives = 37/55 (67%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 458 MSDVEAGGATVFPDFGAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGCKW 512
>gi|54792285|emb|CAG28668.1| prolyl 4-hydroxylase alpha-2 subunit [Gallus gallus]
Length = 538
Score = 72.4 bits (176), Expect = 6e-11, Method: Composition-based stats.
Identities = 29/59 (49%), Positives = 40/59 (67%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+ ++DVE GGAT+FP ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 458 FLNYMSDVEAGGATVFPDFGAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGCKW 516
Score = 65.9 bits (159), Expect = 5e-09, Method: Composition-based stats.
Identities = 26/55 (47%), Positives = 37/55 (67%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 462 MSDVEAGGATVFPDFGAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGCKW 516
>gi|240974259|ref|XP_002401836.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
gi|215491070|gb|EEC00711.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
Length = 490
Score = 72.4 bits (176), Expect = 6e-11, Method: Composition-based stats.
Identities = 30/58 (51%), Positives = 40/58 (68%)
Query: 70 LSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+ ++DV GGAT+FP L LT++PEKG+A FWYN H + D H+ CPV G+KW
Sbjct: 410 LNYMSDVPAGGATVFPQLRLTLWPEKGAAAFWYNLHRSGEGDMLTRHAACPVLAGSKW 467
Score = 69.7 bits (169), Expect = 3e-10, Method: Composition-based stats.
Identities = 28/55 (50%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DV GGAT+FP L LT++PEKG+A FW+N H + D H+ CPV G+KW
Sbjct: 413 MSDVPAGGATVFPQLRLTLWPEKGAAAFWYNLHRSGEGDMLTRHAACPVLAGSKW 467
>gi|348518914|ref|XP_003446976.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Oreochromis
niloticus]
Length = 536
Score = 72.0 bits (175), Expect = 7e-11, Method: Composition-based stats.
Identities = 28/59 (47%), Positives = 40/59 (67%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+ ++DVE GGAT+FP ++P KG++VFWYN + DYR H+ CPV +G+KW
Sbjct: 455 FLNYMSDVEAGGATVFPDFGAAIWPRKGTSVFWYNLFRSGEGDYRTRHAACPVLVGSKW 513
Score = 65.9 bits (159), Expect = 5e-09, Method: Composition-based stats.
Identities = 25/55 (45%), Positives = 37/55 (67%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP ++P KG++VFW+N + DY H+ CPV +G+KW
Sbjct: 459 MSDVEAGGATVFPDFGAAIWPRKGTSVFWYNLFRSGEGDYRTRHAACPVLVGSKW 513
>gi|449267219|gb|EMC78185.1| Prolyl 4-hydroxylase subunit alpha-2 [Columba livia]
Length = 538
Score = 72.0 bits (175), Expect = 7e-11, Method: Composition-based stats.
Identities = 29/59 (49%), Positives = 40/59 (67%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+ ++DVE GGAT+FP ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 457 FLNYMSDVEAGGATVFPDFGAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGCKW 515
Score = 65.9 bits (159), Expect = 5e-09, Method: Composition-based stats.
Identities = 26/55 (47%), Positives = 37/55 (67%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 461 MSDVEAGGATVFPDFGAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGCKW 515
>gi|348557542|ref|XP_003464578.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 1
[Cavia porcellus]
Length = 535
Score = 72.0 bits (175), Expect = 7e-11, Method: Composition-based stats.
Identities = 30/59 (50%), Positives = 41/59 (69%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+ ++DVE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 454 FLNYMSDVEAGGATVFPDLGAALWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512
Score = 65.9 bits (159), Expect = 5e-09, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP L ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 458 MSDVEAGGATVFPDLGAALWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512
>gi|348557544|ref|XP_003464579.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 2
[Cavia porcellus]
Length = 533
Score = 72.0 bits (175), Expect = 7e-11, Method: Composition-based stats.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L + ++DVE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV
Sbjct: 446 GNRLATFL-NYMSDVEAGGATVFPDLGAALWPKKGTAVFWYNLLRSGEGDYRTRHAACPV 504
Query: 122 ALGNKW 127
+G KW
Sbjct: 505 LVGCKW 510
Score = 65.9 bits (159), Expect = 5e-09, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP L ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 456 MSDVEAGGATVFPDLGAALWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 510
>gi|224068121|ref|XP_002191580.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Taeniopygia
guttata]
Length = 539
Score = 72.0 bits (175), Expect = 7e-11, Method: Composition-based stats.
Identities = 31/66 (46%), Positives = 43/66 (65%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L + ++DVE GGAT+FP ++P+KG+AVFWYN + DYR H+ CPV
Sbjct: 452 GNRLATFL-NYMSDVEAGGATVFPDFGAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPV 510
Query: 122 ALGNKW 127
+G KW
Sbjct: 511 LVGCKW 516
Score = 65.9 bits (159), Expect = 6e-09, Method: Composition-based stats.
Identities = 26/55 (47%), Positives = 37/55 (67%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 462 MSDVEAGGATVFPDFGAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGCKW 516
>gi|129365|sp|P16924.1|P4HA1_CHICK RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
alpha-1; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1
Length = 516
Score = 72.0 bits (175), Expect = 7e-11, Method: Composition-based stats.
Identities = 31/66 (46%), Positives = 43/66 (65%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+FP + +V+P+KG+AVFWYN + DY H+ CPV
Sbjct: 429 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPV 487
Query: 122 ALGNKW 127
+GNKW
Sbjct: 488 LVGNKW 493
Score = 68.9 bits (167), Expect = 5e-10, Method: Composition-based stats.
Identities = 29/64 (45%), Positives = 41/64 (64%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + +V+P+KG+AVFW+N + DY H+ CPV +
Sbjct: 430 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPVLV 489
Query: 62 GNKW 65
GNKW
Sbjct: 490 GNKW 493
>gi|212530|gb|AAA49002.1| prolyl 4-hydroxylase, alpha subunit (EC 1.14.11.2), partial [Gallus
gallus]
Length = 489
Score = 72.0 bits (175), Expect = 7e-11, Method: Composition-based stats.
Identities = 31/66 (46%), Positives = 43/66 (65%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+FP + +V+P+KG+AVFWYN + DY H+ CPV
Sbjct: 402 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPV 460
Query: 122 ALGNKW 127
+GNKW
Sbjct: 461 LVGNKW 466
Score = 68.9 bits (167), Expect = 5e-10, Method: Composition-based stats.
Identities = 29/64 (45%), Positives = 41/64 (64%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + +V+P+KG+AVFW+N + DY H+ CPV +
Sbjct: 403 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPVLV 462
Query: 62 GNKW 65
GNKW
Sbjct: 463 GNKW 466
>gi|24651407|ref|NP_733371.1| prolyl-4-hydroxylase-alpha EFB [Drosophila melanogaster]
gi|20269806|gb|AAM18058.1|AF495536_1 prolyl 4-hydroxylase alpha-related protein PH4[alpha]EFB
[Drosophila melanogaster]
gi|15292529|gb|AAK93533.1| SD05564p [Drosophila melanogaster]
gi|23172692|gb|AAF57053.2| prolyl-4-hydroxylase-alpha EFB [Drosophila melanogaster]
gi|220946562|gb|ACL85824.1| PH4alphaEFB-PA [synthetic construct]
Length = 550
Score = 72.0 bits (175), Expect = 8e-11, Method: Composition-based stats.
Identities = 32/67 (47%), Positives = 43/67 (64%), Gaps = 1/67 (1%)
Query: 61 LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
LGN+ +L ++DVE GGAT+F SL+ +FP+KG+A FW N H + D R H+ CP
Sbjct: 457 LGNRIATVLFY-MSDVEQGGATVFTSLHTALFPKKGTAAFWMNLHRDGQGDVRTRHAACP 515
Query: 121 VALGNKW 127
V G KW
Sbjct: 516 VLTGTKW 522
Score = 66.2 bits (160), Expect = 4e-09, Method: Composition-based stats.
Identities = 29/64 (45%), Positives = 38/64 (59%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DVE GGAT+F SL+ +FP+KG+A FW N H + D H+ CPV
Sbjct: 459 NRIATVLFYMSDVEQGGATVFTSLHTALFPKKGTAAFWMNLHRDGQGDVRTRHAACPVLT 518
Query: 62 GNKW 65
G KW
Sbjct: 519 GTKW 522
>gi|195575089|ref|XP_002105512.1| GD21521 [Drosophila simulans]
gi|194201439|gb|EDX15015.1| GD21521 [Drosophila simulans]
Length = 550
Score = 72.0 bits (175), Expect = 8e-11, Method: Composition-based stats.
Identities = 32/67 (47%), Positives = 43/67 (64%), Gaps = 1/67 (1%)
Query: 61 LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
LGN+ +L ++DVE GGAT+F SL+ +FP+KG+A FW N H + D R H+ CP
Sbjct: 457 LGNRIATVLFY-MSDVEQGGATVFTSLHTALFPKKGTAAFWMNLHRDGQGDVRTRHAACP 515
Query: 121 VALGNKW 127
V G KW
Sbjct: 516 VLTGTKW 522
Score = 66.2 bits (160), Expect = 4e-09, Method: Composition-based stats.
Identities = 29/64 (45%), Positives = 38/64 (59%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DVE GGAT+F SL+ +FP+KG+A FW N H + D H+ CPV
Sbjct: 459 NRIATVLFYMSDVEQGGATVFTSLHTALFPKKGTAAFWMNLHRDGQGDVRTRHAACPVLT 518
Query: 62 GNKW 65
G KW
Sbjct: 519 GTKW 522
>gi|194905436|ref|XP_001981196.1| GG11753 [Drosophila erecta]
gi|190655834|gb|EDV53066.1| GG11753 [Drosophila erecta]
Length = 550
Score = 72.0 bits (175), Expect = 8e-11, Method: Composition-based stats.
Identities = 32/67 (47%), Positives = 43/67 (64%), Gaps = 1/67 (1%)
Query: 61 LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
LGN+ +L ++DVE GGAT+F SL+ +FP+KG+A FW N H + D R H+ CP
Sbjct: 457 LGNRIATVLFY-MSDVEQGGATVFTSLHTALFPKKGTAAFWMNLHRDGQGDVRTRHAACP 515
Query: 121 VALGNKW 127
V G KW
Sbjct: 516 VLTGTKW 522
Score = 66.2 bits (160), Expect = 4e-09, Method: Composition-based stats.
Identities = 29/64 (45%), Positives = 38/64 (59%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DVE GGAT+F SL+ +FP+KG+A FW N H + D H+ CPV
Sbjct: 459 NRIATVLFYMSDVEQGGATVFTSLHTALFPKKGTAAFWMNLHRDGQGDVRTRHAACPVLT 518
Query: 62 GNKW 65
G KW
Sbjct: 519 GTKW 522
>gi|345481336|ref|XP_001600680.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Nasonia
vitripennis]
Length = 556
Score = 72.0 bits (175), Expect = 8e-11, Method: Composition-based stats.
Identities = 30/66 (45%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ +L ++DVE GG T+F +N++++P+KGSA FWYN N DY+ H+ CPV
Sbjct: 461 GNRIATVLYY-MSDVEQGGGTVFTKINISLWPKKGSAAFWYNLKPNGEGDYKTRHAACPV 519
Query: 122 ALGNKW 127
G+KW
Sbjct: 520 LTGSKW 525
Score = 67.8 bits (164), Expect = 2e-09, Method: Composition-based stats.
Identities = 28/64 (43%), Positives = 40/64 (62%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DVE GG T+F +N++++P+KGSA FW+N N DY H+ CPV
Sbjct: 462 NRIATVLYYMSDVEQGGGTVFTKINISLWPKKGSAAFWYNLKPNGEGDYKTRHAACPVLT 521
Query: 62 GNKW 65
G+KW
Sbjct: 522 GSKW 525
>gi|326923465|ref|XP_003207956.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 3
[Meleagris gallopavo]
Length = 518
Score = 72.0 bits (175), Expect = 8e-11, Method: Composition-based stats.
Identities = 31/66 (46%), Positives = 43/66 (65%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+FP + +V+P+KG+AVFWYN + DY H+ CPV
Sbjct: 431 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPV 489
Query: 122 ALGNKW 127
+GNKW
Sbjct: 490 LVGNKW 495
Score = 68.9 bits (167), Expect = 6e-10, Method: Composition-based stats.
Identities = 29/64 (45%), Positives = 41/64 (64%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + +V+P+KG+AVFW+N + DY H+ CPV +
Sbjct: 432 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPVLV 491
Query: 62 GNKW 65
GNKW
Sbjct: 492 GNKW 495
>gi|312032360|ref|NP_001185667.1| prolyl 4-hydroxylase subunit alpha-1 isoform 4 precursor [Gallus
gallus]
Length = 536
Score = 72.0 bits (175), Expect = 8e-11, Method: Composition-based stats.
Identities = 31/66 (46%), Positives = 43/66 (65%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+FP + +V+P+KG+AVFWYN + DY H+ CPV
Sbjct: 449 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPV 507
Query: 122 ALGNKW 127
+GNKW
Sbjct: 508 LVGNKW 513
Score = 68.9 bits (167), Expect = 6e-10, Method: Composition-based stats.
Identities = 29/64 (45%), Positives = 41/64 (64%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + +V+P+KG+AVFW+N + DY H+ CPV +
Sbjct: 450 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPVLV 509
Query: 62 GNKW 65
GNKW
Sbjct: 510 GNKW 513
>gi|195452730|ref|XP_002073475.1| GK13125 [Drosophila willistoni]
gi|194169560|gb|EDW84461.1| GK13125 [Drosophila willistoni]
Length = 539
Score = 71.6 bits (174), Expect = 8e-11, Method: Composition-based stats.
Identities = 31/55 (56%), Positives = 37/55 (67%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L++VE GG T FP L L V PE+GS VFWYN H + DYR H+ CPV G+KW
Sbjct: 452 LSEVEAGGGTAFPFLPLLVTPERGSLVFWYNLHRSGDQDYRTKHAACPVLQGSKW 506
Score = 67.4 bits (163), Expect = 2e-09, Method: Composition-based stats.
Identities = 31/64 (48%), Positives = 38/64 (59%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR L++VE GG T FP L L V PE+GS VFW+N H + DY H+ CPV
Sbjct: 443 NRIATGIYYLSEVEAGGGTAFPFLPLLVTPERGSLVFWYNLHRSGDQDYRTKHAACPVLQ 502
Query: 62 GNKW 65
G+KW
Sbjct: 503 GSKW 506
>gi|312032358|ref|NP_001185666.1| prolyl 4-hydroxylase subunit alpha-1 isoform 3 precursor [Gallus
gallus]
Length = 536
Score = 71.6 bits (174), Expect = 8e-11, Method: Composition-based stats.
Identities = 31/66 (46%), Positives = 43/66 (65%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+FP + +V+P+KG+AVFWYN + DY H+ CPV
Sbjct: 449 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPV 507
Query: 122 ALGNKW 127
+GNKW
Sbjct: 508 LVGNKW 513
Score = 68.9 bits (167), Expect = 6e-10, Method: Composition-based stats.
Identities = 29/64 (45%), Positives = 41/64 (64%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + +V+P+KG+AVFW+N + DY H+ CPV +
Sbjct: 450 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPVLV 509
Query: 62 GNKW 65
GNKW
Sbjct: 510 GNKW 513
>gi|312032356|ref|NP_001185665.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Gallus
gallus]
Length = 536
Score = 71.6 bits (174), Expect = 8e-11, Method: Composition-based stats.
Identities = 31/66 (46%), Positives = 43/66 (65%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+FP + +V+P+KG+AVFWYN + DY H+ CPV
Sbjct: 449 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPV 507
Query: 122 ALGNKW 127
+GNKW
Sbjct: 508 LVGNKW 513
Score = 68.9 bits (167), Expect = 6e-10, Method: Composition-based stats.
Identities = 29/64 (45%), Positives = 41/64 (64%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + +V+P+KG+AVFW+N + DY H+ CPV +
Sbjct: 450 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPVLV 509
Query: 62 GNKW 65
GNKW
Sbjct: 510 GNKW 513
>gi|326923461|ref|XP_003207954.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 1
[Meleagris gallopavo]
Length = 536
Score = 71.6 bits (174), Expect = 9e-11, Method: Composition-based stats.
Identities = 31/66 (46%), Positives = 43/66 (65%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+FP + +V+P+KG+AVFWYN + DY H+ CPV
Sbjct: 449 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPV 507
Query: 122 ALGNKW 127
+GNKW
Sbjct: 508 LVGNKW 513
Score = 68.9 bits (167), Expect = 6e-10, Method: Composition-based stats.
Identities = 29/64 (45%), Positives = 41/64 (64%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + +V+P+KG+AVFW+N + DY H+ CPV +
Sbjct: 450 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPVLV 509
Query: 62 GNKW 65
GNKW
Sbjct: 510 GNKW 513
>gi|196011908|ref|XP_002115817.1| hypothetical protein TRIADDRAFT_30052 [Trichoplax adhaerens]
gi|190581593|gb|EDV21669.1| hypothetical protein TRIADDRAFT_30052, partial [Trichoplax
adhaerens]
Length = 495
Score = 71.6 bits (174), Expect = 9e-11, Method: Composition-based stats.
Identities = 29/55 (52%), Positives = 40/55 (72%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L++VE+GGATIFP L + V P+KGSA+FW N N L + + H+ CPV +G+KW
Sbjct: 418 LSNVEIGGATIFPRLGVAVRPQKGSALFWINLKRNGLTNRQTLHAACPVVIGSKW 472
Score = 69.7 bits (169), Expect = 4e-10, Method: Composition-based stats.
Identities = 29/55 (52%), Positives = 39/55 (70%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L++VE+GGATIFP L + V P+KGSA+FW N N L + H+ CPV +G+KW
Sbjct: 418 LSNVEIGGATIFPRLGVAVRPQKGSALFWINLKRNGLTNRQTLHAACPVVIGSKW 472
>gi|312032354|ref|NP_001185664.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Gallus
gallus]
Length = 536
Score = 71.6 bits (174), Expect = 9e-11, Method: Composition-based stats.
Identities = 31/66 (46%), Positives = 43/66 (65%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+FP + +V+P+KG+AVFWYN + DY H+ CPV
Sbjct: 449 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPV 507
Query: 122 ALGNKW 127
+GNKW
Sbjct: 508 LVGNKW 513
Score = 68.9 bits (167), Expect = 6e-10, Method: Composition-based stats.
Identities = 29/64 (45%), Positives = 41/64 (64%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + +V+P+KG+AVFW+N + DY H+ CPV +
Sbjct: 450 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPVLV 509
Query: 62 GNKW 65
GNKW
Sbjct: 510 GNKW 513
>gi|326923463|ref|XP_003207955.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 2
[Meleagris gallopavo]
Length = 536
Score = 71.6 bits (174), Expect = 9e-11, Method: Composition-based stats.
Identities = 31/66 (46%), Positives = 43/66 (65%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+FP + +V+P+KG+AVFWYN + DY H+ CPV
Sbjct: 449 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPV 507
Query: 122 ALGNKW 127
+GNKW
Sbjct: 508 LVGNKW 513
Score = 68.9 bits (167), Expect = 7e-10, Method: Composition-based stats.
Identities = 29/64 (45%), Positives = 41/64 (64%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + +V+P+KG+AVFW+N + DY H+ CPV +
Sbjct: 450 NRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPVLV 509
Query: 62 GNKW 65
GNKW
Sbjct: 510 GNKW 513
>gi|194765194|ref|XP_001964712.1| GF22904 [Drosophila ananassae]
gi|190614984|gb|EDV30508.1| GF22904 [Drosophila ananassae]
Length = 547
Score = 71.6 bits (174), Expect = 9e-11, Method: Composition-based stats.
Identities = 32/67 (47%), Positives = 43/67 (64%), Gaps = 1/67 (1%)
Query: 61 LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
LGN+ +L ++DVE GGAT+F SL+ +FP+KG+A FW N H + D R H+ CP
Sbjct: 454 LGNRIATVLFY-MSDVEQGGATVFTSLHTALFPKKGTAAFWMNLHRDGEGDVRTRHAACP 512
Query: 121 VALGNKW 127
V G KW
Sbjct: 513 VLTGTKW 519
Score = 65.9 bits (159), Expect = 5e-09, Method: Composition-based stats.
Identities = 29/64 (45%), Positives = 38/64 (59%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DVE GGAT+F SL+ +FP+KG+A FW N H + D H+ CPV
Sbjct: 456 NRIATVLFYMSDVEQGGATVFTSLHTALFPKKGTAAFWMNLHRDGEGDVRTRHAACPVLT 515
Query: 62 GNKW 65
G KW
Sbjct: 516 GTKW 519
>gi|195341588|ref|XP_002037388.1| GM12140 [Drosophila sechellia]
gi|194131504|gb|EDW53547.1| GM12140 [Drosophila sechellia]
Length = 534
Score = 71.6 bits (174), Expect = 1e-10, Method: Composition-based stats.
Identities = 36/90 (40%), Positives = 51/90 (56%), Gaps = 8/90 (8%)
Query: 38 FWHNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGS 97
F + H +T Y M LG++ +L LTDVE GGAT+F + +V+P+ G+
Sbjct: 425 FASSNHTDTRSSYSM-------DLGDRIATVLFY-LTDVEQGGATVFADVGYSVYPQAGT 476
Query: 98 AVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
A+FWYN N D R H+ CPV +G+KW
Sbjct: 477 AIFWYNLDTNGKGDPRTKHAACPVIVGSKW 506
Score = 64.7 bits (156), Expect = 1e-08, Method: Composition-based stats.
Identities = 26/55 (47%), Positives = 37/55 (67%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
LTDVE GGAT+F + +V+P+ G+A+FW+N N D H+ CPV +G+KW
Sbjct: 452 LTDVEQGGATVFADVGYSVYPQAGTAIFWYNLDTNGKGDPRTKHAACPVIVGSKW 506
>gi|449280261|gb|EMC87600.1| Prolyl 4-hydroxylase subunit alpha-1 [Columba livia]
Length = 536
Score = 71.6 bits (174), Expect = 1e-10, Method: Composition-based stats.
Identities = 31/66 (46%), Positives = 42/66 (63%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+FP + +V+P KG+AVFWYN + DY H+ CPV
Sbjct: 449 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPRKGTAVFWYNLFPSGEGDYSTRHAACPV 507
Query: 122 ALGNKW 127
+GNKW
Sbjct: 508 LVGNKW 513
Score = 68.9 bits (167), Expect = 7e-10, Method: Composition-based stats.
Identities = 29/64 (45%), Positives = 40/64 (62%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + +V+P KG+AVFW+N + DY H+ CPV +
Sbjct: 450 NRIATWLFYMSDVSAGGATVFPEVGASVWPRKGTAVFWYNLFPSGEGDYSTRHAACPVLV 509
Query: 62 GNKW 65
GNKW
Sbjct: 510 GNKW 513
>gi|327267604|ref|XP_003218589.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Anolis
carolinensis]
Length = 542
Score = 71.6 bits (174), Expect = 1e-10, Method: Composition-based stats.
Identities = 31/66 (46%), Positives = 42/66 (63%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+FP + +V+P KG+AVFWYN + DY H+ CPV
Sbjct: 455 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPRKGTAVFWYNLFPSGEGDYSTRHAACPV 513
Query: 122 ALGNKW 127
+GNKW
Sbjct: 514 LVGNKW 519
Score = 68.9 bits (167), Expect = 7e-10, Method: Composition-based stats.
Identities = 29/64 (45%), Positives = 40/64 (62%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + +V+P KG+AVFW+N + DY H+ CPV +
Sbjct: 456 NRIATWLFYMSDVSAGGATVFPEVGASVWPRKGTAVFWYNLFPSGEGDYSTRHAACPVLV 515
Query: 62 GNKW 65
GNKW
Sbjct: 516 GNKW 519
>gi|224052167|ref|XP_002191912.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Taeniopygia
guttata]
Length = 536
Score = 71.6 bits (174), Expect = 1e-10, Method: Composition-based stats.
Identities = 31/66 (46%), Positives = 42/66 (63%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+FP + +V+P KG+AVFWYN + DY H+ CPV
Sbjct: 449 GNRIATWLFY-MSDVSAGGATVFPEVGASVWPRKGTAVFWYNLFPSGEGDYSTRHAACPV 507
Query: 122 ALGNKW 127
+GNKW
Sbjct: 508 LVGNKW 513
Score = 68.9 bits (167), Expect = 7e-10, Method: Composition-based stats.
Identities = 29/64 (45%), Positives = 40/64 (62%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + +V+P KG+AVFW+N + DY H+ CPV +
Sbjct: 450 NRIATWLFYMSDVSAGGATVFPEVGASVWPRKGTAVFWYNLFPSGEGDYSTRHAACPVLV 509
Query: 62 GNKW 65
GNKW
Sbjct: 510 GNKW 513
>gi|195505199|ref|XP_002099401.1| GE23383 [Drosophila yakuba]
gi|194185502|gb|EDW99113.1| GE23383 [Drosophila yakuba]
Length = 535
Score = 71.6 bits (174), Expect = 1e-10, Method: Composition-based stats.
Identities = 31/55 (56%), Positives = 37/55 (67%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+DVE GG T FP L L V PEKGS +FWYN H + D+R H+ CPV G+KW
Sbjct: 448 LSDVEAGGGTAFPFLPLLVTPEKGSLLFWYNLHPSGDQDFRTKHAACPVLQGSKW 502
Score = 67.0 bits (162), Expect = 2e-09, Method: Composition-based stats.
Identities = 31/64 (48%), Positives = 38/64 (59%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR L+DVE GG T FP L L V PEKGS +FW+N H + D+ H+ CPV
Sbjct: 439 NRIATGIYYLSDVEAGGGTAFPFLPLLVTPEKGSLLFWYNLHPSGDQDFRTKHAACPVLQ 498
Query: 62 GNKW 65
G+KW
Sbjct: 499 GSKW 502
>gi|195505190|ref|XP_002099397.1| GE10881 [Drosophila yakuba]
gi|194185498|gb|EDW99109.1| GE10881 [Drosophila yakuba]
Length = 487
Score = 71.6 bits (174), Expect = 1e-10, Method: Composition-based stats.
Identities = 32/67 (47%), Positives = 42/67 (62%), Gaps = 1/67 (1%)
Query: 61 LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
LGN+ +L ++DVE GGAT+F SL+ +FP KG+A FW N H + D R H+ CP
Sbjct: 394 LGNRIATVLFY-MSDVEQGGATVFTSLHTALFPRKGTAAFWMNLHRDGQGDVRTRHAACP 452
Query: 121 VALGNKW 127
V G KW
Sbjct: 453 VLTGTKW 459
Score = 65.9 bits (159), Expect = 6e-09, Method: Composition-based stats.
Identities = 29/64 (45%), Positives = 37/64 (57%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DVE GGAT+F SL+ +FP KG+A FW N H + D H+ CPV
Sbjct: 396 NRIATVLFYMSDVEQGGATVFTSLHTALFPRKGTAAFWMNLHRDGQGDVRTRHAACPVLT 455
Query: 62 GNKW 65
G KW
Sbjct: 456 GTKW 459
>gi|387016440|gb|AFJ50339.1| Prolyl 4-hydroxylase subunit alpha-1-like [Crotalus adamanteus]
Length = 543
Score = 71.6 bits (174), Expect = 1e-10, Method: Composition-based stats.
Identities = 31/66 (46%), Positives = 43/66 (65%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+FP + +V+P+KG+AVFWYN + DY H+ CPV
Sbjct: 456 GNRIATWLFY-MSDVAAGGATVFPEVGASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPV 514
Query: 122 ALGNKW 127
+GNKW
Sbjct: 515 LVGNKW 520
Score = 68.6 bits (166), Expect = 7e-10, Method: Composition-based stats.
Identities = 29/64 (45%), Positives = 41/64 (64%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + +V+P+KG+AVFW+N + DY H+ CPV +
Sbjct: 457 NRIATWLFYMSDVAAGGATVFPEVGASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPVLV 516
Query: 62 GNKW 65
GNKW
Sbjct: 517 GNKW 520
>gi|221512818|ref|NP_730346.2| CG32201 [Drosophila melanogaster]
gi|220902638|gb|AAN11679.2| CG32201 [Drosophila melanogaster]
Length = 520
Score = 71.6 bits (174), Expect = 1e-10, Method: Composition-based stats.
Identities = 30/67 (44%), Positives = 45/67 (67%), Gaps = 1/67 (1%)
Query: 61 LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
LG++ +L ++V GGAT+FP +N+TVFP+KGS ++W+N H + D R HS CP
Sbjct: 436 LGDRLASILFYA-SEVPQGGATVFPEINVTVFPQKGSMLYWFNLHDDGKPDIRSLHSVCP 494
Query: 121 VALGNKW 127
V G++W
Sbjct: 495 VLNGDRW 501
Score = 64.7 bits (156), Expect = 1e-08, Method: Composition-based stats.
Identities = 26/54 (48%), Positives = 37/54 (68%)
Query: 12 TDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++V GGAT+FP +N+TVFP+KGS ++W N H + D HS CPV G++W
Sbjct: 448 SEVPQGGATVFPEINVTVFPQKGSMLYWFNLHDDGKPDIRSLHSVCPVLNGDRW 501
>gi|66770649|gb|AAY54636.1| IP12415p [Drosophila melanogaster]
gi|66772017|gb|AAY55320.1| IP12615p [Drosophila melanogaster]
Length = 512
Score = 71.6 bits (174), Expect = 1e-10, Method: Composition-based stats.
Identities = 30/67 (44%), Positives = 45/67 (67%), Gaps = 1/67 (1%)
Query: 61 LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
LG++ +L ++V GGAT+FP +N+TVFP+KGS ++W+N H + D R HS CP
Sbjct: 428 LGDRLASILFYA-SEVPQGGATVFPEINVTVFPQKGSMLYWFNLHDDGKPDIRSLHSVCP 486
Query: 121 VALGNKW 127
V G++W
Sbjct: 487 VLNGDRW 493
Score = 64.7 bits (156), Expect = 1e-08, Method: Composition-based stats.
Identities = 26/54 (48%), Positives = 37/54 (68%)
Query: 12 TDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++V GGAT+FP +N+TVFP+KGS ++W N H + D HS CPV G++W
Sbjct: 440 SEVPQGGATVFPEINVTVFPQKGSMLYWFNLHDDGKPDIRSLHSVCPVLNGDRW 493
>gi|66771935|gb|AAY55279.1| IP12715p [Drosophila melanogaster]
Length = 451
Score = 71.6 bits (174), Expect = 1e-10, Method: Composition-based stats.
Identities = 30/67 (44%), Positives = 45/67 (67%), Gaps = 1/67 (1%)
Query: 61 LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
LG++ +L ++V GGAT+FP +N+TVFP+KGS ++W+N H + D R HS CP
Sbjct: 367 LGDRLASILFYA-SEVPQGGATVFPEINVTVFPQKGSMLYWFNLHDDGKPDIRSLHSVCP 425
Query: 121 VALGNKW 127
V G++W
Sbjct: 426 VLNGDRW 432
Score = 64.7 bits (156), Expect = 1e-08, Method: Composition-based stats.
Identities = 26/54 (48%), Positives = 37/54 (68%)
Query: 12 TDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++V GGAT+FP +N+TVFP+KGS ++W N H + D HS CPV G++W
Sbjct: 379 SEVPQGGATVFPEINVTVFPQKGSMLYWFNLHDDGKPDIRSLHSVCPVLNGDRW 432
>gi|195330780|ref|XP_002032081.1| GM23710 [Drosophila sechellia]
gi|194121024|gb|EDW43067.1| GM23710 [Drosophila sechellia]
Length = 490
Score = 71.6 bits (174), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 35/79 (44%), Positives = 51/79 (64%), Gaps = 1/79 (1%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
D+ Y + + G++ +L L +V+ GGATIFP LN+TV P+KGSAVFWYN H +
Sbjct: 384 DFHEYTNTTRLKRGDRIATVLFY-LGEVDSGGATIFPMLNITVTPKKGSAVFWYNLHNSG 442
Query: 109 LLDYRMYHSGCPVALGNKW 127
++ + HS CPV G+K+
Sbjct: 443 AVNSKTLHSACPVISGSKY 461
Score = 65.1 bits (157), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 30/55 (54%), Positives = 40/55 (72%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L +V+ GGATIFP LN+TV P+KGSAVFW+N H + ++ HS CPV G+K+
Sbjct: 407 LGEVDSGGATIFPMLNITVTPKKGSAVFWYNLHNSGAVNSKTLHSACPVISGSKY 461
>gi|195390835|ref|XP_002054073.1| GJ22993 [Drosophila virilis]
gi|194152159|gb|EDW67593.1| GJ22993 [Drosophila virilis]
Length = 525
Score = 71.6 bits (174), Expect = 1e-10, Method: Composition-based stats.
Identities = 29/55 (52%), Positives = 39/55 (70%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
LTDVE GGAT+FP++ +FP+KG+AV WYN N D + H+ CPV +G+KW
Sbjct: 450 LTDVEQGGATVFPNIKQAIFPKKGTAVMWYNLRHNNDGDPQTLHAACPVIVGSKW 504
Score = 68.2 bits (165), Expect = 1e-09, Method: Composition-based stats.
Identities = 28/55 (50%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
LTDVE GGAT+FP++ +FP+KG+AV W+N N D H+ CPV +G+KW
Sbjct: 450 LTDVEQGGATVFPNIKQAIFPKKGTAVMWYNLRHNNDGDPQTLHAACPVIVGSKW 504
>gi|345326417|ref|XP_001510155.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like
[Ornithorhynchus anatinus]
Length = 888
Score = 71.6 bits (174), Expect = 1e-10, Method: Composition-based stats.
Identities = 29/59 (49%), Positives = 40/59 (67%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+ ++DVE GGAT+FP ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 807 FLNYMSDVEAGGATVFPDFGAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGCKW 865
Score = 65.5 bits (158), Expect = 8e-09, Method: Composition-based stats.
Identities = 26/55 (47%), Positives = 37/55 (67%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 811 MSDVEAGGATVFPDFGAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGCKW 865
>gi|195341542|ref|XP_002037365.1| GM12152 [Drosophila sechellia]
gi|194131481|gb|EDW53524.1| GM12152 [Drosophila sechellia]
Length = 535
Score = 71.6 bits (174), Expect = 1e-10, Method: Composition-based stats.
Identities = 31/55 (56%), Positives = 37/55 (67%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+DVE GG T FP L L V PEKGS +FWYN H + D+R H+ CPV G+KW
Sbjct: 448 LSDVEAGGGTAFPFLPLLVTPEKGSLLFWYNLHPSGDQDFRTKHAACPVLQGSKW 502
Score = 67.0 bits (162), Expect = 2e-09, Method: Composition-based stats.
Identities = 31/64 (48%), Positives = 38/64 (59%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR L+DVE GG T FP L L V PEKGS +FW+N H + D+ H+ CPV
Sbjct: 439 NRIATGIYYLSDVEAGGGTAFPFLPLLVTPEKGSLLFWYNLHPSGDQDFRTKHAACPVLQ 498
Query: 62 GNKW 65
G+KW
Sbjct: 499 GSKW 502
>gi|195575097|ref|XP_002105516.1| GD17035 [Drosophila simulans]
gi|194201443|gb|EDX15019.1| GD17035 [Drosophila simulans]
Length = 535
Score = 71.6 bits (174), Expect = 1e-10, Method: Composition-based stats.
Identities = 31/55 (56%), Positives = 37/55 (67%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+DVE GG T FP L L V PEKGS +FWYN H + D+R H+ CPV G+KW
Sbjct: 448 LSDVEAGGGTAFPFLPLLVTPEKGSLLFWYNLHPSGDQDFRTKHAACPVLQGSKW 502
Score = 67.0 bits (162), Expect = 2e-09, Method: Composition-based stats.
Identities = 31/64 (48%), Positives = 38/64 (59%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR L+DVE GG T FP L L V PEKGS +FW+N H + D+ H+ CPV
Sbjct: 439 NRIATGIYYLSDVEAGGGTAFPFLPLLVTPEKGSLLFWYNLHPSGDQDFRTKHAACPVLQ 498
Query: 62 GNKW 65
G+KW
Sbjct: 499 GSKW 502
>gi|195572621|ref|XP_002104294.1| GD18523 [Drosophila simulans]
gi|194200221|gb|EDX13797.1| GD18523 [Drosophila simulans]
Length = 490
Score = 71.2 bits (173), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 35/79 (44%), Positives = 51/79 (64%), Gaps = 1/79 (1%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
D+ Y + + G++ +L L +V+ GGATIFP LN+TV P+KGSAVFWYN H +
Sbjct: 384 DFHEYTNTTRLKQGDRIATVLFY-LGEVDSGGATIFPMLNITVTPKKGSAVFWYNLHNSG 442
Query: 109 LLDYRMYHSGCPVALGNKW 127
++ + HS CPV G+K+
Sbjct: 443 AVNSKTLHSACPVISGSKY 461
Score = 65.1 bits (157), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 30/55 (54%), Positives = 40/55 (72%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L +V+ GGATIFP LN+TV P+KGSAVFW+N H + ++ HS CPV G+K+
Sbjct: 407 LGEVDSGGATIFPMLNITVTPKKGSAVFWYNLHNSGAVNSKTLHSACPVISGSKY 461
>gi|195591302|ref|XP_002085381.1| GD14757 [Drosophila simulans]
gi|194197390|gb|EDX10966.1| GD14757 [Drosophila simulans]
Length = 525
Score = 71.2 bits (173), Expect = 1e-10, Method: Composition-based stats.
Identities = 30/67 (44%), Positives = 45/67 (67%), Gaps = 1/67 (1%)
Query: 61 LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
LG++ +L ++V GGAT+FP +N+TVFP+KGS ++W+N H + D R HS CP
Sbjct: 436 LGDRLASILFYA-SEVPQGGATVFPEINVTVFPQKGSMLYWFNLHDDGRPDIRSKHSVCP 494
Query: 121 VALGNKW 127
V G++W
Sbjct: 495 VINGDRW 501
Score = 64.3 bits (155), Expect = 1e-08, Method: Composition-based stats.
Identities = 26/54 (48%), Positives = 37/54 (68%)
Query: 12 TDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++V GGAT+FP +N+TVFP+KGS ++W N H + D HS CPV G++W
Sbjct: 448 SEVPQGGATVFPEINVTVFPQKGSMLYWFNLHDDGRPDIRSKHSVCPVINGDRW 501
>gi|195505255|ref|XP_002099425.1| GE23368 [Drosophila yakuba]
gi|194185526|gb|EDW99137.1| GE23368 [Drosophila yakuba]
Length = 528
Score = 71.2 bits (173), Expect = 1e-10, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 39/55 (70%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
LTDVE GGAT+FP++ VFP++GS + WYN + +D + H+ CPV +G+KW
Sbjct: 452 LTDVEQGGATVFPNIRKAVFPQRGSVIMWYNLKDDGQIDTQTLHAACPVIVGSKW 506
Score = 68.2 bits (165), Expect = 1e-09, Method: Composition-based stats.
Identities = 26/55 (47%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
LTDVE GGAT+FP++ VFP++GS + W+N + +D H+ CPV +G+KW
Sbjct: 452 LTDVEQGGATVFPNIRKAVFPQRGSVIMWYNLKDDGQIDTQTLHAACPVIVGSKW 506
>gi|195352182|ref|XP_002042593.1| GM14980 [Drosophila sechellia]
gi|194124477|gb|EDW46520.1| GM14980 [Drosophila sechellia]
Length = 520
Score = 71.2 bits (173), Expect = 1e-10, Method: Composition-based stats.
Identities = 30/67 (44%), Positives = 45/67 (67%), Gaps = 1/67 (1%)
Query: 61 LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
LG++ +L ++V GGAT+FP +N+TVFP+KGS ++W+N H + D R HS CP
Sbjct: 436 LGDRLASILFYA-SEVPQGGATVFPEINVTVFPQKGSMLYWFNLHDDGRPDIRSKHSVCP 494
Query: 121 VALGNKW 127
V G++W
Sbjct: 495 VINGDRW 501
Score = 64.3 bits (155), Expect = 1e-08, Method: Composition-based stats.
Identities = 26/54 (48%), Positives = 37/54 (68%)
Query: 12 TDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++V GGAT+FP +N+TVFP+KGS ++W N H + D HS CPV G++W
Sbjct: 448 SEVPQGGATVFPEINVTVFPQKGSMLYWFNLHDDGRPDIRSKHSVCPVINGDRW 501
>gi|78706702|ref|NP_001027154.1| CG18749 [Drosophila melanogaster]
gi|21429852|gb|AAM50604.1| GH05783p [Drosophila melanogaster]
gi|23175900|gb|AAN14309.1| CG18749 [Drosophila melanogaster]
gi|220956638|gb|ACL90862.1| CG18749-PB [synthetic construct]
Length = 491
Score = 71.2 bits (173), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 51/79 (64%), Gaps = 1/79 (1%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
D+ Y + + G++ +L L +V+ GGAT+FP LN+TV P+KGSAVFWYN H +
Sbjct: 385 DFHEYTNTTRLKQGDRIATVLFY-LREVDSGGATVFPMLNITVMPKKGSAVFWYNLHNSG 443
Query: 109 LLDYRMYHSGCPVALGNKW 127
++ + H+ CPV G+K+
Sbjct: 444 AVNSKTLHTACPVISGSKY 462
Score = 65.5 bits (158), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 28/55 (50%), Positives = 40/55 (72%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L +V+ GGAT+FP LN+TV P+KGSAVFW+N H + ++ H+ CPV G+K+
Sbjct: 408 LREVDSGGATVFPMLNITVMPKKGSAVFWYNLHNSGAVNSKTLHTACPVISGSKY 462
>gi|47210159|emb|CAF93191.1| unnamed protein product [Tetraodon nigroviridis]
Length = 78
Score = 71.2 bits (173), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 29/55 (52%), Positives = 38/55 (69%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
++DVE GGAT+FP ++P KG+AVFWYN + DYR H+ CPV +GNKW
Sbjct: 1 MSDVEAGGATVFPDFGAAIWPRKGTAVFWYNLFRSGEGDYRTRHAACPVLVGNKW 55
Score = 65.9 bits (159), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 27/55 (49%), Positives = 37/55 (67%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP ++P KG+AVFW+N + DY H+ CPV +GNKW
Sbjct: 1 MSDVEAGGATVFPDFGAAIWPRKGTAVFWYNLFRSGEGDYRTRHAACPVLVGNKW 55
>gi|442747045|gb|JAA65682.1| Putative prolyl 4-hydroxylase alpha subunit [Ixodes ricinus]
Length = 538
Score = 71.2 bits (173), Expect = 1e-10, Method: Composition-based stats.
Identities = 34/83 (40%), Positives = 51/83 (61%), Gaps = 6/83 (7%)
Query: 45 NTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNA 104
N+ DY++ + G++ L++ +TDVE GGAT+FPSL + + P+KG A FW+N
Sbjct: 433 NSSADYYLQY-----GTGDRLATLMIY-MTDVEEGGATVFPSLGIRLTPKKGDAAFWWNL 486
Query: 105 HANTLLDYRMYHSGCPVALGNKW 127
A+ D H+GCPV G+KW
Sbjct: 487 KASGEGDRLTTHAGCPVLYGSKW 509
Score = 67.4 bits (163), Expect = 2e-09, Method: Composition-based stats.
Identities = 29/57 (50%), Positives = 38/57 (66%)
Query: 9 VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
+ +TDVE GGAT+FPSL + + P+KG A FW N A+ D H+GCPV G+KW
Sbjct: 453 IYMTDVEEGGATVFPSLGIRLTPKKGDAAFWWNLKASGEGDRLTTHAGCPVLYGSKW 509
>gi|321474876|gb|EFX85840.1| hypothetical protein DAPPUDRAFT_309107 [Daphnia pulex]
Length = 528
Score = 70.9 bits (172), Expect = 2e-10, Method: Composition-based stats.
Identities = 31/66 (46%), Positives = 42/66 (63%), Gaps = 3/66 (4%)
Query: 65 WGKLLLSGL---TDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
WG + + L +DVE GGAT+FP++ ++P+KGSA FWYN N D H+GCPV
Sbjct: 440 WGNRIATWLFYMSDVEAGGATVFPAVGAALWPQKGSAAFWYNLRPNGNGDEDTLHAGCPV 499
Query: 122 ALGNKW 127
G+KW
Sbjct: 500 LTGSKW 505
Score = 67.0 bits (162), Expect = 2e-09, Method: Composition-based stats.
Identities = 29/64 (45%), Positives = 40/64 (62%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DVE GGAT+FP++ ++P+KGSA FW+N N D H+GCPV
Sbjct: 442 NRIATWLFYMSDVEAGGATVFPAVGAALWPQKGSAAFWYNLRPNGNGDEDTLHAGCPVLT 501
Query: 62 GNKW 65
G+KW
Sbjct: 502 GSKW 505
>gi|198449502|ref|XP_001357605.2| GA15937 [Drosophila pseudoobscura pseudoobscura]
gi|198130635|gb|EAL26739.2| GA15937 [Drosophila pseudoobscura pseudoobscura]
Length = 510
Score = 70.9 bits (172), Expect = 2e-10, Method: Composition-based stats.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 4/79 (5%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
D++ H P +GN+ +L LTDVE GGAT+F V P +G+A+FWYN H +
Sbjct: 413 DWFTQH---PEVMGNRLATVLFY-LTDVEQGGATMFNKAEHKVLPRRGTALFWYNLHTDG 468
Query: 109 LLDYRMYHSGCPVALGNKW 127
D+ H+ CP+ +G+KW
Sbjct: 469 EGDWSTTHAACPIIVGSKW 487
Score = 63.5 bits (153), Expect = 2e-08, Method: Composition-based stats.
Identities = 27/64 (42%), Positives = 38/64 (59%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR LTDVE GGAT+F V P +G+A+FW+N H + D+ H+ CP+ +
Sbjct: 424 NRLATVLFYLTDVEQGGATMFNKAEHKVLPRRGTALFWYNLHTDGEGDWSTTHAACPIIV 483
Query: 62 GNKW 65
G+KW
Sbjct: 484 GSKW 487
>gi|195159144|ref|XP_002020442.1| GL13995 [Drosophila persimilis]
gi|194117211|gb|EDW39254.1| GL13995 [Drosophila persimilis]
Length = 535
Score = 70.9 bits (172), Expect = 2e-10, Method: Composition-based stats.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 4/79 (5%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
D++ H P +GN+ +L LTDVE GGAT+F V P +G+A+FWYN H +
Sbjct: 438 DWFTQH---PEVMGNRLATVLFY-LTDVEQGGATMFNKAEHKVLPRRGTALFWYNLHTDG 493
Query: 109 LLDYRMYHSGCPVALGNKW 127
D+ H+ CP+ +G+KW
Sbjct: 494 EGDWSTTHAACPIIVGSKW 512
Score = 63.5 bits (153), Expect = 2e-08, Method: Composition-based stats.
Identities = 27/64 (42%), Positives = 38/64 (59%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR LTDVE GGAT+F V P +G+A+FW+N H + D+ H+ CP+ +
Sbjct: 449 NRLATVLFYLTDVEQGGATMFNKAEHKVLPRRGTALFWYNLHTDGEGDWSTTHAACPIIV 508
Query: 62 GNKW 65
G+KW
Sbjct: 509 GSKW 512
>gi|194760358|ref|XP_001962408.1| GF14452 [Drosophila ananassae]
gi|190616105|gb|EDV31629.1| GF14452 [Drosophila ananassae]
Length = 498
Score = 70.9 bits (172), Expect = 2e-10, Method: Composition-based stats.
Identities = 28/54 (51%), Positives = 41/54 (75%)
Query: 74 TDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
+DVE+GGATIFP+ N+++ P+KGSA+FWYN H + + H+ CP+ LGN+W
Sbjct: 426 SDVEMGGATIFPAANISIKPKKGSALFWYNLHNDWEPNPLSRHAVCPMVLGNRW 479
Score = 68.9 bits (167), Expect = 5e-10, Method: Composition-based stats.
Identities = 27/54 (50%), Positives = 41/54 (75%)
Query: 12 TDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
+DVE+GGATIFP+ N+++ P+KGSA+FW+N H + + H+ CP+ LGN+W
Sbjct: 426 SDVEMGGATIFPAANISIKPKKGSALFWYNLHNDWEPNPLSRHAVCPMVLGNRW 479
>gi|350416719|ref|XP_003491070.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Bombus
impatiens]
Length = 557
Score = 70.9 bits (172), Expect = 2e-10, Method: Composition-based stats.
Identities = 29/66 (43%), Positives = 45/66 (68%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ +L ++DVE GG T+F ++N++++P+KGSA FWYN N D++ H+ CPV
Sbjct: 462 GNRIATVLYY-MSDVEQGGGTVFTAINISLWPKKGSAAFWYNLKPNGEGDFKTRHAACPV 520
Query: 122 ALGNKW 127
G+KW
Sbjct: 521 LTGSKW 526
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 27/64 (42%), Positives = 41/64 (64%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DVE GG T+F ++N++++P+KGSA FW+N N D+ H+ CPV
Sbjct: 463 NRIATVLYYMSDVEQGGGTVFTAINISLWPKKGSAAFWYNLKPNGEGDFKTRHAACPVLT 522
Query: 62 GNKW 65
G+KW
Sbjct: 523 GSKW 526
>gi|340722330|ref|XP_003399560.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Bombus
terrestris]
Length = 557
Score = 70.9 bits (172), Expect = 2e-10, Method: Composition-based stats.
Identities = 29/66 (43%), Positives = 45/66 (68%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ +L ++DVE GG T+F ++N++++P+KGSA FWYN N D++ H+ CPV
Sbjct: 462 GNRIATVLYY-MSDVEQGGGTVFTAINISLWPKKGSAAFWYNLKPNGEGDFKTRHAACPV 520
Query: 122 ALGNKW 127
G+KW
Sbjct: 521 LTGSKW 526
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 27/64 (42%), Positives = 41/64 (64%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DVE GG T+F ++N++++P+KGSA FW+N N D+ H+ CPV
Sbjct: 463 NRIATVLYYMSDVEQGGGTVFTAINISLWPKKGSAAFWYNLKPNGEGDFKTRHAACPVLT 522
Query: 62 GNKW 65
G+KW
Sbjct: 523 GSKW 526
>gi|195061074|ref|XP_001995919.1| GH14105 [Drosophila grimshawi]
gi|193891711|gb|EDV90577.1| GH14105 [Drosophila grimshawi]
Length = 513
Score = 70.9 bits (172), Expect = 2e-10, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 37/55 (67%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+DVE GGAT+FP + VFP++G+A+ WYN N D H+ CPV +G+KW
Sbjct: 439 LSDVEQGGATVFPKIQKAVFPQRGTAIIWYNLKENGDFDTNTIHAACPVIVGSKW 493
Score = 67.4 bits (163), Expect = 2e-09, Method: Composition-based stats.
Identities = 26/55 (47%), Positives = 37/55 (67%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L+DVE GGAT+FP + VFP++G+A+ W+N N D H+ CPV +G+KW
Sbjct: 439 LSDVEQGGATVFPKIQKAVFPQRGTAIIWYNLKENGDFDTNTIHAACPVIVGSKW 493
>gi|195494563|ref|XP_002094891.1| GE19960 [Drosophila yakuba]
gi|194180992|gb|EDW94603.1| GE19960 [Drosophila yakuba]
Length = 152
Score = 70.9 bits (172), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 29/62 (46%), Positives = 42/62 (67%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 128
L+S +V GGAT+FP +N+TVFP+KGS ++W+N H + D R H CPV G++W
Sbjct: 54 LISKAGEVRQGGATVFPEINVTVFPQKGSMLYWFNLHDDGRPDIRSQHLVCPVVNGDRWS 113
Query: 129 KL 130
+L
Sbjct: 114 RL 115
Score = 64.7 bits (156), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 26/56 (46%), Positives = 37/56 (66%)
Query: 13 DVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWGKL 68
+V GGAT+FP +N+TVFP+KGS ++W N H + D H CPV G++W +L
Sbjct: 60 EVRQGGATVFPEINVTVFPQKGSMLYWFNLHDDGRPDIRSQHLVCPVVNGDRWSRL 115
>gi|195113239|ref|XP_002001175.1| GI10638 [Drosophila mojavensis]
gi|193917769|gb|EDW16636.1| GI10638 [Drosophila mojavensis]
Length = 511
Score = 70.9 bits (172), Expect = 2e-10, Method: Composition-based stats.
Identities = 41/125 (32%), Positives = 63/125 (50%), Gaps = 16/125 (12%)
Query: 3 RAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALG 62
R E ++ D+ L G+ F +N + + + W A+T+ D
Sbjct: 378 RIERRIADMVDLNLEGSEPFYVINYGIGGQYKA--HWDFFGADTVED------------- 422
Query: 63 NKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 122
N+ +L + DVE GGAT+FP L TV ++G+A+FWYN N +D R H GCP+
Sbjct: 423 NRLATVLFY-MNDVEQGGATVFPRLGQTVRAKRGNALFWYNMQHNGTVDDRTLHGGCPIL 481
Query: 123 LGNKW 127
+G+KW
Sbjct: 482 VGSKW 486
Score = 65.9 bits (159), Expect = 6e-09, Method: Composition-based stats.
Identities = 28/64 (43%), Positives = 39/64 (60%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + DVE GGAT+FP L TV ++G+A+FW+N N +D H GCP+ +
Sbjct: 423 NRLATVLFYMNDVEQGGATVFPRLGQTVRAKRGNALFWYNMQHNGTVDDRTLHGGCPILV 482
Query: 62 GNKW 65
G+KW
Sbjct: 483 GSKW 486
>gi|195391758|ref|XP_002054527.1| GJ22759 [Drosophila virilis]
gi|194152613|gb|EDW68047.1| GJ22759 [Drosophila virilis]
Length = 539
Score = 70.9 bits (172), Expect = 2e-10, Method: Composition-based stats.
Identities = 30/55 (54%), Positives = 37/55 (67%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L++VE GG T FP L L V PE+GS +FWYN H + DYR H+ CPV G+KW
Sbjct: 452 LSNVEAGGGTAFPFLPLLVTPERGSLLFWYNLHPSGDADYRTKHAACPVLQGSKW 506
Score = 66.2 bits (160), Expect = 4e-09, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 38/64 (59%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR L++VE GG T FP L L V PE+GS +FW+N H + DY H+ CPV
Sbjct: 443 NRLATGIYYLSNVEAGGGTAFPFLPLLVTPERGSLLFWYNLHPSGDADYRTKHAACPVLQ 502
Query: 62 GNKW 65
G+KW
Sbjct: 503 GSKW 506
>gi|198466401|ref|XP_002135182.1| GA23910 [Drosophila pseudoobscura pseudoobscura]
gi|198150583|gb|EDY73809.1| GA23910 [Drosophila pseudoobscura pseudoobscura]
Length = 530
Score = 70.5 bits (171), Expect = 2e-10, Method: Composition-based stats.
Identities = 31/67 (46%), Positives = 45/67 (67%), Gaps = 1/67 (1%)
Query: 61 LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
+G G +L ++D++ GGATIFPS+N+TV P KGSA+ W+N + + D R HS CP
Sbjct: 439 MGGPLGTILFY-VSDLQQGGATIFPSINITVSPRKGSALLWFNLYDDGEPDPRTLHSSCP 497
Query: 121 VALGNKW 127
V G++W
Sbjct: 498 VIEGDRW 504
Score = 64.3 bits (155), Expect = 1e-08, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++D++ GGATIFPS+N+TV P KGSA+ W N + + D HS CPV G++W
Sbjct: 450 VSDLQQGGATIFPSINITVSPRKGSALLWFNLYDDGEPDPRTLHSSCPVIEGDRW 504
>gi|116008128|ref|NP_001036776.1| CG15539, isoform B [Drosophila melanogaster]
gi|113194857|gb|ABI31220.1| CG15539, isoform B [Drosophila melanogaster]
Length = 509
Score = 70.5 bits (171), Expect = 2e-10, Method: Composition-based stats.
Identities = 30/55 (54%), Positives = 37/55 (67%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+DV GGATIFP L L+VFP+KGSA+ WYN D R HS CP +G++W
Sbjct: 434 LSDVPQGGATIFPKLGLSVFPKKGSALLWYNLDHKGDGDNRTAHSACPTVVGSRW 488
Score = 65.5 bits (158), Expect = 7e-09, Method: Composition-based stats.
Identities = 29/64 (45%), Positives = 39/64 (60%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
+R + L+DV GGATIFP L L+VFP+KGSA+ W+N D HS CP +
Sbjct: 425 DRIATAVFYLSDVPQGGATIFPKLGLSVFPKKGSALLWYNLDHKGDGDNRTAHSACPTVV 484
Query: 62 GNKW 65
G++W
Sbjct: 485 GSRW 488
>gi|194905419|ref|XP_001981192.1| GG11932 [Drosophila erecta]
gi|190655830|gb|EDV53062.1| GG11932 [Drosophila erecta]
Length = 535
Score = 70.5 bits (171), Expect = 2e-10, Method: Composition-based stats.
Identities = 30/55 (54%), Positives = 37/55 (67%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+DVE GG T FP L L V PE+GS +FWYN H + D+R H+ CPV G+KW
Sbjct: 448 LSDVEAGGGTAFPFLPLLVTPERGSLLFWYNLHPSGDQDFRTKHAACPVLQGSKW 502
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 39/64 (60%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L+DVE GG T FP L L V PE+GS +FW+N H + D+ H+ CPV
Sbjct: 439 NRIATAIYYLSDVEAGGGTAFPFLPLLVTPERGSLLFWYNLHPSGDQDFRTKHAACPVLQ 498
Query: 62 GNKW 65
G+KW
Sbjct: 499 GSKW 502
>gi|307211752|gb|EFN87747.1| Prolyl 4-hydroxylase subunit alpha-1 [Harpegnathos saltator]
Length = 415
Score = 70.5 bits (171), Expect = 2e-10, Method: Composition-based stats.
Identities = 29/66 (43%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ +L ++DVE GG T+F ++N++++P KGSA FWYN N D++ H+ CPV
Sbjct: 320 GNRIATVLYY-MSDVEQGGGTVFTAINISLWPRKGSAAFWYNLKPNGEGDFKTRHAACPV 378
Query: 122 ALGNKW 127
G+KW
Sbjct: 379 LTGSKW 384
Score = 66.2 bits (160), Expect = 4e-09, Method: Composition-based stats.
Identities = 27/64 (42%), Positives = 40/64 (62%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DVE GG T+F ++N++++P KGSA FW+N N D+ H+ CPV
Sbjct: 321 NRIATVLYYMSDVEQGGGTVFTAINISLWPRKGSAAFWYNLKPNGEGDFKTRHAACPVLT 380
Query: 62 GNKW 65
G+KW
Sbjct: 381 GSKW 384
>gi|307190793|gb|EFN74662.1| Prolyl 4-hydroxylase subunit alpha-2 [Camponotus floridanus]
Length = 476
Score = 70.5 bits (171), Expect = 2e-10, Method: Composition-based stats.
Identities = 29/66 (43%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ +L ++DVE GG T+F ++N++++P KGSA FWYN N D++ H+ CPV
Sbjct: 381 GNRIATVLYY-MSDVEQGGGTVFTAINISLWPRKGSAAFWYNLKPNGEGDFKTRHAACPV 439
Query: 122 ALGNKW 127
G+KW
Sbjct: 440 LTGSKW 445
Score = 66.2 bits (160), Expect = 4e-09, Method: Composition-based stats.
Identities = 27/64 (42%), Positives = 40/64 (62%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DVE GG T+F ++N++++P KGSA FW+N N D+ H+ CPV
Sbjct: 382 NRIATVLYYMSDVEQGGGTVFTAINISLWPRKGSAAFWYNLKPNGEGDFKTRHAACPVLT 441
Query: 62 GNKW 65
G+KW
Sbjct: 442 GSKW 445
>gi|24651420|ref|NP_733374.1| prolyl-4-hydroxylase-alpha NE1 [Drosophila melanogaster]
gi|7301952|gb|AAF57058.1| prolyl-4-hydroxylase-alpha NE1 [Drosophila melanogaster]
gi|363987308|gb|AEW43896.1| FI16820p1 [Drosophila melanogaster]
Length = 537
Score = 70.5 bits (171), Expect = 2e-10, Method: Composition-based stats.
Identities = 32/70 (45%), Positives = 44/70 (62%), Gaps = 1/70 (1%)
Query: 58 PVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHS 117
P GN+ + L++VE GGAT FP L++ V P+ G+ +FWYN H + DYR H+
Sbjct: 436 PAEEGNRIATAIFY-LSEVEQGGATAFPFLDIAVKPQLGNVLFWYNLHRSLDKDYRTKHA 494
Query: 118 GCPVALGNKW 127
GCPV G+KW
Sbjct: 495 GCPVLKGSKW 504
Score = 66.2 bits (160), Expect = 4e-09, Method: Composition-based stats.
Identities = 29/64 (45%), Positives = 41/64 (64%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L++VE GGAT FP L++ V P+ G+ +FW+N H + DY H+GCPV
Sbjct: 441 NRIATAIFYLSEVEQGGATAFPFLDIAVKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLK 500
Query: 62 GNKW 65
G+KW
Sbjct: 501 GSKW 504
>gi|442757047|gb|JAA70682.1| Putative prolyl 4-hydroxylase alpha subunit [Ixodes ricinus]
Length = 532
Score = 70.5 bits (171), Expect = 2e-10, Method: Composition-based stats.
Identities = 32/69 (46%), Positives = 47/69 (68%), Gaps = 3/69 (4%)
Query: 62 GNKWGK---LLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSG 118
GN++G L+ ++DVE GGAT+FPSL + V P+KG AVFW+N ++ + +H+G
Sbjct: 435 GNRFGNRVATLMIYMSDVEEGGATVFPSLGVRVSPKKGDAVFWWNIMSSWEGEMLTWHAG 494
Query: 119 CPVALGNKW 127
CPV G+KW
Sbjct: 495 CPVLYGSKW 503
Score = 68.9 bits (167), Expect = 7e-10, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 42/64 (65%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + ++DVE GGAT+FPSL + V P+KG AVFW N ++ + +H+GCPV
Sbjct: 440 NRVATLMIYMSDVEEGGATVFPSLGVRVSPKKGDAVFWWNIMSSWEGEMLTWHAGCPVLY 499
Query: 62 GNKW 65
G+KW
Sbjct: 500 GSKW 503
>gi|194871348|ref|XP_001972831.1| GG13664 [Drosophila erecta]
gi|190654614|gb|EDV51857.1| GG13664 [Drosophila erecta]
Length = 520
Score = 70.5 bits (171), Expect = 2e-10, Method: Composition-based stats.
Identities = 29/67 (43%), Positives = 44/67 (65%), Gaps = 1/67 (1%)
Query: 61 LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
LG++ ++ +V GGAT+FP +N+TVFP+KGS ++W+N H + D R HS CP
Sbjct: 436 LGDRLASIIFYA-GEVLQGGATVFPEINVTVFPQKGSMLYWFNLHDDGRPDIRSQHSACP 494
Query: 121 VALGNKW 127
V G++W
Sbjct: 495 VVNGDRW 501
Score = 65.1 bits (157), Expect = 9e-09, Method: Composition-based stats.
Identities = 26/53 (49%), Positives = 36/53 (67%)
Query: 13 DVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
+V GGAT+FP +N+TVFP+KGS ++W N H + D HS CPV G++W
Sbjct: 449 EVLQGGATVFPEINVTVFPQKGSMLYWFNLHDDGRPDIRSQHSACPVVNGDRW 501
>gi|242018356|ref|XP_002429643.1| Prolyl 4-hydroxylase alpha-1 subunit precursor, putative [Pediculus
humanus corporis]
gi|212514628|gb|EEB16905.1| Prolyl 4-hydroxylase alpha-1 subunit precursor, putative [Pediculus
humanus corporis]
Length = 534
Score = 70.5 bits (171), Expect = 2e-10, Method: Composition-based stats.
Identities = 29/66 (43%), Positives = 43/66 (65%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ +L ++DV GGAT+FP + ++++P+KG+A FWYN N DY H+ CPV
Sbjct: 446 GNRIATILFY-MSDVSQGGATVFPGIKVSLWPKKGTAAFWYNLRKNGEGDYLTRHAACPV 504
Query: 122 ALGNKW 127
G+KW
Sbjct: 505 LTGSKW 510
Score = 67.4 bits (163), Expect = 2e-09, Method: Composition-based stats.
Identities = 27/64 (42%), Positives = 40/64 (62%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + ++++P+KG+A FW+N N DY H+ CPV
Sbjct: 447 NRIATILFYMSDVSQGGATVFPGIKVSLWPKKGTAAFWYNLRKNGEGDYLTRHAACPVLT 506
Query: 62 GNKW 65
G+KW
Sbjct: 507 GSKW 510
>gi|195341544|ref|XP_002037366.1| GM12151 [Drosophila sechellia]
gi|194131482|gb|EDW53525.1| GM12151 [Drosophila sechellia]
Length = 537
Score = 70.5 bits (171), Expect = 2e-10, Method: Composition-based stats.
Identities = 32/70 (45%), Positives = 44/70 (62%), Gaps = 1/70 (1%)
Query: 58 PVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHS 117
P GN+ + L++VE GGAT FP L++ V P+ G+ +FWYN H + DYR H+
Sbjct: 436 PAEEGNRIATAIFY-LSEVEQGGATAFPFLDIAVKPQLGNVLFWYNLHRSLDKDYRTKHA 494
Query: 118 GCPVALGNKW 127
GCPV G+KW
Sbjct: 495 GCPVLKGSKW 504
Score = 66.2 bits (160), Expect = 4e-09, Method: Composition-based stats.
Identities = 29/64 (45%), Positives = 41/64 (64%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L++VE GGAT FP L++ V P+ G+ +FW+N H + DY H+GCPV
Sbjct: 441 NRIATAIFYLSEVEQGGATAFPFLDIAVKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLK 500
Query: 62 GNKW 65
G+KW
Sbjct: 501 GSKW 504
>gi|195575099|ref|XP_002105517.1| GD17024 [Drosophila simulans]
gi|194201444|gb|EDX15020.1| GD17024 [Drosophila simulans]
Length = 537
Score = 70.5 bits (171), Expect = 2e-10, Method: Composition-based stats.
Identities = 32/70 (45%), Positives = 44/70 (62%), Gaps = 1/70 (1%)
Query: 58 PVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHS 117
P GN+ + L++VE GGAT FP L++ V P+ G+ +FWYN H + DYR H+
Sbjct: 436 PAEEGNRIATAIFY-LSEVEQGGATAFPFLDIAVKPQLGNVLFWYNLHRSLDKDYRTKHA 494
Query: 118 GCPVALGNKW 127
GCPV G+KW
Sbjct: 495 GCPVLKGSKW 504
Score = 66.2 bits (160), Expect = 4e-09, Method: Composition-based stats.
Identities = 29/64 (45%), Positives = 41/64 (64%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L++VE GGAT FP L++ V P+ G+ +FW+N H + DY H+GCPV
Sbjct: 441 NRIATAIFYLSEVEQGGATAFPFLDIAVKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLK 500
Query: 62 GNKW 65
G+KW
Sbjct: 501 GSKW 504
>gi|380025232|ref|XP_003696381.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Apis florea]
Length = 537
Score = 70.5 bits (171), Expect = 2e-10, Method: Composition-based stats.
Identities = 29/66 (43%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ +L ++DVE GG T+F ++N+ ++P+KGSA FWYN N D++ H+ CPV
Sbjct: 442 GNRIATVLYY-MSDVEQGGGTVFTAINIALWPKKGSAAFWYNLKPNGEGDFKTRHAACPV 500
Query: 122 ALGNKW 127
G+KW
Sbjct: 501 LTGSKW 506
Score = 66.2 bits (160), Expect = 5e-09, Method: Composition-based stats.
Identities = 27/64 (42%), Positives = 40/64 (62%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DVE GG T+F ++N+ ++P+KGSA FW+N N D+ H+ CPV
Sbjct: 443 NRIATVLYYMSDVEQGGGTVFTAINIALWPKKGSAAFWYNLKPNGEGDFKTRHAACPVLT 502
Query: 62 GNKW 65
G+KW
Sbjct: 503 GSKW 506
>gi|328790718|ref|XP_392392.4| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Apis mellifera]
Length = 415
Score = 70.5 bits (171), Expect = 2e-10, Method: Composition-based stats.
Identities = 29/66 (43%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ +L ++DVE GG T+F ++N+ ++P+KGSA FWYN N D++ H+ CPV
Sbjct: 320 GNRIATVLYY-MSDVEQGGGTVFTAINIALWPKKGSAAFWYNLKPNGEGDFKTRHAACPV 378
Query: 122 ALGNKW 127
G+KW
Sbjct: 379 LTGSKW 384
Score = 66.2 bits (160), Expect = 5e-09, Method: Composition-based stats.
Identities = 27/64 (42%), Positives = 40/64 (62%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DVE GG T+F ++N+ ++P+KGSA FW+N N D+ H+ CPV
Sbjct: 321 NRIATVLYYMSDVEQGGGTVFTAINIALWPKKGSAAFWYNLKPNGEGDFKTRHAACPVLT 380
Query: 62 GNKW 65
G+KW
Sbjct: 381 GSKW 384
>gi|4336512|gb|AAD17844.1| prolyl 4-hydroxylase alpha subunit [Drosophila melanogaster]
Length = 535
Score = 70.5 bits (171), Expect = 2e-10, Method: Composition-based stats.
Identities = 30/55 (54%), Positives = 37/55 (67%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+DVE GG T FP L L V PE+GS +FWYN H + D+R H+ CPV G+KW
Sbjct: 448 LSDVEAGGGTAFPFLPLLVTPERGSLLFWYNLHPSGDQDFRTKHAACPVLQGSKW 502
Score = 65.9 bits (159), Expect = 5e-09, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 38/64 (59%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR L+DVE GG T FP L L V PE+GS +FW+N H + D+ H+ CPV
Sbjct: 439 NRMATGIYYLSDVEAGGGTAFPFLPLLVTPERGSLLFWYNLHPSGDQDFRTKHAACPVLQ 498
Query: 62 GNKW 65
G+KW
Sbjct: 499 GSKW 502
>gi|227553849|gb|ACP40552.1| IP22178p [Drosophila melanogaster]
Length = 467
Score = 70.5 bits (171), Expect = 2e-10, Method: Composition-based stats.
Identities = 32/70 (45%), Positives = 44/70 (62%), Gaps = 1/70 (1%)
Query: 58 PVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHS 117
P GN+ + L++VE GGAT FP L++ V P+ G+ +FWYN H + DYR H+
Sbjct: 366 PAEEGNRIATAIFY-LSEVEQGGATAFPFLDIAVKPQLGNVLFWYNLHRSLDKDYRTKHA 424
Query: 118 GCPVALGNKW 127
GCPV G+KW
Sbjct: 425 GCPVLKGSKW 434
Score = 66.2 bits (160), Expect = 5e-09, Method: Composition-based stats.
Identities = 29/64 (45%), Positives = 41/64 (64%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L++VE GGAT FP L++ V P+ G+ +FW+N H + DY H+GCPV
Sbjct: 371 NRIATAIFYLSEVEQGGATAFPFLDIAVKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLK 430
Query: 62 GNKW 65
G+KW
Sbjct: 431 GSKW 434
>gi|442747091|gb|JAA65705.1| Putative prolyl 4-hydroxylase alpha subunit [Ixodes ricinus]
Length = 533
Score = 70.1 bits (170), Expect = 3e-10, Method: Composition-based stats.
Identities = 31/67 (46%), Positives = 45/67 (67%), Gaps = 1/67 (1%)
Query: 61 LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
G++ L++ ++DVE GGAT+FPSL + V P KG AVFW+N ++ D +H+GCP
Sbjct: 439 FGDRVATLMIY-MSDVEEGGATVFPSLGVRVSPRKGDAVFWWNIKSSWEGDVLTWHAGCP 497
Query: 121 VALGNKW 127
V G+KW
Sbjct: 498 VLYGSKW 504
Score = 68.9 bits (167), Expect = 6e-10, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 41/64 (64%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
+R + ++DVE GGAT+FPSL + V P KG AVFW N ++ D +H+GCPV
Sbjct: 441 DRVATLMIYMSDVEEGGATVFPSLGVRVSPRKGDAVFWWNIKSSWEGDVLTWHAGCPVLY 500
Query: 62 GNKW 65
G+KW
Sbjct: 501 GSKW 504
>gi|390176894|ref|XP_002136933.2| GA26862 [Drosophila pseudoobscura pseudoobscura]
gi|388858830|gb|EDY67491.2| GA26862 [Drosophila pseudoobscura pseudoobscura]
Length = 520
Score = 70.1 bits (170), Expect = 3e-10, Method: Composition-based stats.
Identities = 29/55 (52%), Positives = 39/55 (70%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+DV GGAT+F LN+TVFP+ G+ +FWYN + D R H+GCPV +G+KW
Sbjct: 438 LSDVPQGGATLFTELNVTVFPQAGTVLFWYNLAHSGDHDMRTRHTGCPVIVGSKW 492
Score = 64.7 bits (156), Expect = 1e-08, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L+DV GGAT+F LN+TVFP+ G+ +FW+N + D H+GCPV +G+KW
Sbjct: 438 LSDVPQGGATLFTELNVTVFPQAGTVLFWYNLAHSGDHDMRTRHTGCPVIVGSKW 492
>gi|195505202|ref|XP_002099402.1| GE23382 [Drosophila yakuba]
gi|194185503|gb|EDW99114.1| GE23382 [Drosophila yakuba]
Length = 537
Score = 70.1 bits (170), Expect = 3e-10, Method: Composition-based stats.
Identities = 29/55 (52%), Positives = 39/55 (70%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L++VE GGAT FP L++ V P+ G+ +FWYN H + DYR H+GCPV G+KW
Sbjct: 450 LSEVEQGGATAFPFLDIAVKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLKGSKW 504
Score = 66.2 bits (160), Expect = 4e-09, Method: Composition-based stats.
Identities = 29/64 (45%), Positives = 41/64 (64%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L++VE GGAT FP L++ V P+ G+ +FW+N H + DY H+GCPV
Sbjct: 441 NRIATAIFYLSEVEQGGATAFPFLDIAVKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLK 500
Query: 62 GNKW 65
G+KW
Sbjct: 501 GSKW 504
>gi|195166675|ref|XP_002024160.1| GL22879 [Drosophila persimilis]
gi|194107515|gb|EDW29558.1| GL22879 [Drosophila persimilis]
Length = 484
Score = 70.1 bits (170), Expect = 3e-10, Method: Composition-based stats.
Identities = 30/59 (50%), Positives = 38/59 (64%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWGKLL 131
L +V GGA FP L TV+P KGSA+ W+N + N LDYR+ H CPV +G+KW K
Sbjct: 415 LNEVPHGGAISFPQLEFTVWPRKGSALVWHNLNNNLELDYRVAHISCPVIVGSKWSKFF 473
Score = 69.7 bits (169), Expect = 4e-10, Method: Composition-based stats.
Identities = 31/69 (44%), Positives = 40/69 (57%)
Query: 1 MNRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVA 60
M+ +L +V GGA FP L TV+P KGSA+ WHN + N LDY + H CPV
Sbjct: 405 MDNLAKEEQQLNEVPHGGAISFPQLEFTVWPRKGSALVWHNLNNNLELDYRVAHISCPVI 464
Query: 61 LGNKWGKLL 69
+G+KW K
Sbjct: 465 VGSKWSKFF 473
>gi|148701598|gb|EDL33545.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide, isoform CRA_c [Mus
musculus]
gi|149052607|gb|EDM04424.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide (predicted),
isoform CRA_d [Rattus norvegicus]
Length = 189
Score = 70.1 bits (170), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 30/59 (50%), Positives = 41/59 (69%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+ ++DVE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 108 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 166
Score = 64.3 bits (155), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 27/55 (49%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP L ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 112 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 166
>gi|74216495|dbj|BAE25162.1| unnamed protein product [Mus musculus]
Length = 187
Score = 70.1 bits (170), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 30/59 (50%), Positives = 41/59 (69%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+ ++DVE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 106 FLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 164
Score = 64.3 bits (155), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 27/55 (49%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP L ++P+KG+AVFW+N + DY H+ CPV +G KW
Sbjct: 110 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 164
>gi|195391754|ref|XP_002054525.1| GJ24502 [Drosophila virilis]
gi|194152611|gb|EDW68045.1| GJ24502 [Drosophila virilis]
Length = 487
Score = 70.1 bits (170), Expect = 3e-10, Method: Composition-based stats.
Identities = 31/67 (46%), Positives = 44/67 (65%), Gaps = 1/67 (1%)
Query: 61 LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
LGN+ +L ++DVE GGAT+F SL+ ++P+KG+A FW N H + D R H+ CP
Sbjct: 394 LGNRIATMLFY-MSDVEQGGATVFTSLHAALWPKKGTAAFWMNLHRSGEGDVRTRHAACP 452
Query: 121 VALGNKW 127
V G+KW
Sbjct: 453 VLTGSKW 459
Score = 63.5 bits (153), Expect = 2e-08, Method: Composition-based stats.
Identities = 28/64 (43%), Positives = 39/64 (60%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DVE GGAT+F SL+ ++P+KG+A FW N H + D H+ CPV
Sbjct: 396 NRIATMLFYMSDVEQGGATVFTSLHAALWPKKGTAAFWMNLHRSGEGDVRTRHAACPVLT 455
Query: 62 GNKW 65
G+KW
Sbjct: 456 GSKW 459
>gi|442751927|gb|JAA68123.1| Putative prolyl 4-hydroxylase alpha subunit [Ixodes ricinus]
Length = 522
Score = 69.7 bits (169), Expect = 3e-10, Method: Composition-based stats.
Identities = 35/87 (40%), Positives = 54/87 (62%), Gaps = 7/87 (8%)
Query: 41 NAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVF 100
+AH N+ DY++ + G++ L++ +TDVE+GGAT+FP L + + P+KG A F
Sbjct: 414 HAH-NSTADYYLQNKE-----GDRLATLMIY-MTDVEVGGATVFPRLGVRLVPKKGDAAF 466
Query: 101 WYNAHANTLLDYRMYHSGCPVALGNKW 127
W+N A+ D H+GCPV G+KW
Sbjct: 467 WWNLKASGEGDTLTMHAGCPVLYGSKW 493
Score = 66.2 bits (160), Expect = 4e-09, Method: Composition-based stats.
Identities = 28/57 (49%), Positives = 38/57 (66%)
Query: 9 VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
+ +TDVE+GGAT+FP L + + P+KG A FW N A+ D H+GCPV G+KW
Sbjct: 437 IYMTDVEVGGATVFPRLGVRLVPKKGDAAFWWNLKASGEGDTLTMHAGCPVLYGSKW 493
>gi|395501518|ref|XP_003755140.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Sarcophilus
harrisii]
Length = 385
Score = 69.7 bits (169), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 29/55 (52%), Positives = 40/55 (72%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
++DV GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV +GNKW
Sbjct: 308 MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 362
Score = 67.4 bits (163), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 28/55 (50%), Positives = 40/55 (72%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DV GGAT+FP + +V+P+KG+AVFW+N A+ DY H+ CPV +GNKW
Sbjct: 308 MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 362
>gi|195159313|ref|XP_002020526.1| GL14040 [Drosophila persimilis]
gi|194117295|gb|EDW39338.1| GL14040 [Drosophila persimilis]
Length = 549
Score = 69.7 bits (169), Expect = 3e-10, Method: Composition-based stats.
Identities = 36/69 (52%), Positives = 43/69 (62%), Gaps = 2/69 (2%)
Query: 61 LGNKWGKLL--LSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSG 118
G K +LL L L DV LGGAT FP L+L V P KGS + WYN H + D+R H+G
Sbjct: 449 FGWKGNRLLTALFYLNDVPLGGATAFPFLHLAVPPVKGSLLVWYNLHRSLHKDFRTKHAG 508
Query: 119 CPVALGNKW 127
CPV G+KW
Sbjct: 509 CPVLKGSKW 517
Score = 65.1 bits (157), Expect = 9e-09, Method: Composition-based stats.
Identities = 31/64 (48%), Positives = 39/64 (60%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L DV LGGAT FP L+L V P KGS + W+N H + D+ H+GCPV
Sbjct: 454 NRLLTALFYLNDVPLGGATAFPFLHLAVPPVKGSLLVWYNLHRSLHKDFRTKHAGCPVLK 513
Query: 62 GNKW 65
G+KW
Sbjct: 514 GSKW 517
>gi|194905392|ref|XP_001981188.1| GG11756 [Drosophila erecta]
gi|190655826|gb|EDV53058.1| GG11756 [Drosophila erecta]
Length = 509
Score = 69.7 bits (169), Expect = 3e-10, Method: Composition-based stats.
Identities = 29/55 (52%), Positives = 37/55 (67%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+DV GGAT FP L L+VFPEKG+A+ WYN + D R HS CP +G++W
Sbjct: 434 LSDVPQGGATTFPDLGLSVFPEKGAALLWYNLDHKGVGDNRTAHSACPTIVGSRW 488
>gi|198449643|ref|XP_001357664.2| GA15938 [Drosophila pseudoobscura pseudoobscura]
gi|198130698|gb|EAL26798.2| GA15938 [Drosophila pseudoobscura pseudoobscura]
Length = 549
Score = 69.7 bits (169), Expect = 4e-10, Method: Composition-based stats.
Identities = 36/69 (52%), Positives = 43/69 (62%), Gaps = 2/69 (2%)
Query: 61 LGNKWGKLL--LSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSG 118
G K +LL L L DV LGGAT FP L+L V P KGS + WYN H + D+R H+G
Sbjct: 449 FGWKGNRLLTALFYLNDVPLGGATAFPFLHLAVPPVKGSLLVWYNLHRSLHKDFRTKHAG 508
Query: 119 CPVALGNKW 127
CPV G+KW
Sbjct: 509 CPVLKGSKW 517
Score = 65.1 bits (157), Expect = 9e-09, Method: Composition-based stats.
Identities = 31/64 (48%), Positives = 39/64 (60%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L DV LGGAT FP L+L V P KGS + W+N H + D+ H+GCPV
Sbjct: 454 NRLLTALFYLNDVPLGGATAFPFLHLAVPPVKGSLLVWYNLHRSLHKDFRTKHAGCPVLK 513
Query: 62 GNKW 65
G+KW
Sbjct: 514 GSKW 517
>gi|194905397|ref|XP_001981189.1| GG11929 [Drosophila erecta]
gi|190655827|gb|EDV53059.1| GG11929 [Drosophila erecta]
Length = 538
Score = 69.7 bits (169), Expect = 4e-10, Method: Composition-based stats.
Identities = 30/55 (54%), Positives = 37/55 (67%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
+ DV LGGAT FP L L V PEKGS + W N H++ DYR H+GCP+ G+KW
Sbjct: 452 INDVALGGATAFPFLRLAVPPEKGSLLMWNNLHSSLHKDYRSKHAGCPILQGSKW 506
Score = 67.4 bits (163), Expect = 2e-09, Method: Composition-based stats.
Identities = 29/55 (52%), Positives = 37/55 (67%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
+ DV LGGAT FP L L V PEKGS + W+N H++ DY H+GCP+ G+KW
Sbjct: 452 INDVALGGATAFPFLRLAVPPEKGSLLMWNNLHSSLHKDYRSKHAGCPILQGSKW 506
>gi|66772633|gb|AAY55628.1| IP02961p [Drosophila melanogaster]
Length = 409
Score = 69.7 bits (169), Expect = 4e-10, Method: Composition-based stats.
Identities = 30/55 (54%), Positives = 36/55 (65%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L DVE GG T FP L L V PE+GS +FWYN H + D+R H+ CPV G+KW
Sbjct: 322 LADVEAGGGTAFPFLPLLVTPERGSLLFWYNLHPSGDQDFRTKHAACPVLQGSKW 376
Score = 65.5 bits (158), Expect = 7e-09, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 37/64 (57%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR L DVE GG T FP L L V PE+GS +FW+N H + D+ H+ CPV
Sbjct: 313 NRMATGIYYLADVEAGGGTAFPFLPLLVTPERGSLLFWYNLHPSGDQDFRTKHAACPVLQ 372
Query: 62 GNKW 65
G+KW
Sbjct: 373 GSKW 376
>gi|24651418|ref|NP_524594.2| prolyl-4-hydroxylase-alpha MP [Drosophila melanogaster]
gi|7301951|gb|AAF57057.1| prolyl-4-hydroxylase-alpha MP [Drosophila melanogaster]
gi|359807686|gb|AEV66559.1| FI17802p1 [Drosophila melanogaster]
Length = 535
Score = 69.7 bits (169), Expect = 4e-10, Method: Composition-based stats.
Identities = 30/55 (54%), Positives = 36/55 (65%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L DVE GG T FP L L V PE+GS +FWYN H + D+R H+ CPV G+KW
Sbjct: 448 LADVEAGGGTAFPFLPLLVTPERGSLLFWYNLHPSGDQDFRTKHAACPVLQGSKW 502
Score = 65.5 bits (158), Expect = 7e-09, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 37/64 (57%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR L DVE GG T FP L L V PE+GS +FW+N H + D+ H+ CPV
Sbjct: 439 NRMATGIYYLADVEAGGGTAFPFLPLLVTPERGSLLFWYNLHPSGDQDFRTKHAACPVLQ 498
Query: 62 GNKW 65
G+KW
Sbjct: 499 GSKW 502
>gi|349604936|gb|AEQ00344.1| Prolyl 4-hydroxylase subunit alpha-1-like protein, partial [Equus
caballus]
Length = 302
Score = 69.7 bits (169), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 29/55 (52%), Positives = 40/55 (72%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
++DV GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV +GNKW
Sbjct: 225 MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 279
Score = 67.4 bits (163), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 28/55 (50%), Positives = 40/55 (72%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DV GGAT+FP + +V+P+KG+AVFW+N A+ DY H+ CPV +GNKW
Sbjct: 225 MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 279
>gi|402880501|ref|XP_003903839.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like, partial
[Papio anubis]
Length = 379
Score = 69.7 bits (169), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 29/55 (52%), Positives = 40/55 (72%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
++DV GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV +GNKW
Sbjct: 302 MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 356
Score = 67.4 bits (163), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 28/55 (50%), Positives = 40/55 (72%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DV GGAT+FP + +V+P+KG+AVFW+N A+ DY H+ CPV +GNKW
Sbjct: 302 MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 356
>gi|195505209|ref|XP_002099405.1| GE10885 [Drosophila yakuba]
gi|194185506|gb|EDW99117.1| GE10885 [Drosophila yakuba]
Length = 473
Score = 69.7 bits (169), Expect = 4e-10, Method: Composition-based stats.
Identities = 30/55 (54%), Positives = 37/55 (67%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+DV GGATIFP L L+VFP+KGSA+ WYN D R HS CP +G++W
Sbjct: 398 LSDVPQGGATIFPKLGLSVFPKKGSALLWYNLDHKGDGDNRTAHSACPTIVGSRW 452
Score = 66.2 bits (160), Expect = 4e-09, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 39/64 (60%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L+DV GGATIFP L L+VFP+KGSA+ W+N D HS CP +
Sbjct: 389 NRIATAVFYLSDVPQGGATIFPKLGLSVFPKKGSALLWYNLDHKGDGDNRTAHSACPTIV 448
Query: 62 GNKW 65
G++W
Sbjct: 449 GSRW 452
>gi|195156517|ref|XP_002019146.1| GL25581 [Drosophila persimilis]
gi|194115299|gb|EDW37342.1| GL25581 [Drosophila persimilis]
Length = 206
Score = 69.7 bits (169), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 31/57 (54%), Positives = 39/57 (68%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWGK 129
+TDV LGGATIFP +NLT+ P+KGSA+FWYN H + H+ CP GN+W K
Sbjct: 130 MTDVPLGGATIFPKINLTIQPKKGSALFWYNIHNDWEPHVLTRHAVCPTIEGNRWSK 186
Score = 68.6 bits (166), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 30/64 (46%), Positives = 43/64 (67%)
Query: 4 AEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGN 63
+++ + +TDV LGGATIFP +NLT+ P+KGSA+FW+N H + H+ CP GN
Sbjct: 123 SDIKANPMTDVPLGGATIFPKINLTIQPKKGSALFWYNIHNDWEPHVLTRHAVCPTIEGN 182
Query: 64 KWGK 67
+W K
Sbjct: 183 RWSK 186
>gi|90085216|dbj|BAE91349.1| unnamed protein product [Macaca fascicularis]
Length = 244
Score = 69.7 bits (169), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 29/55 (52%), Positives = 40/55 (72%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
++DV GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV +GNKW
Sbjct: 167 MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 221
Score = 67.0 bits (162), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 28/55 (50%), Positives = 40/55 (72%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DV GGAT+FP + +V+P+KG+AVFW+N A+ DY H+ CPV +GNKW
Sbjct: 167 MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 221
>gi|403274090|ref|XP_003928822.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Saimiri
boliviensis boliviensis]
Length = 149
Score = 69.7 bits (169), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 29/55 (52%), Positives = 40/55 (72%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
++DV GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV +GNKW
Sbjct: 72 MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 126
Score = 66.6 bits (161), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 28/55 (50%), Positives = 40/55 (72%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DV GGAT+FP + +V+P+KG+AVFW+N A+ DY H+ CPV +GNKW
Sbjct: 72 MSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 126
>gi|198417610|ref|XP_002125349.1| PREDICTED: similar to Prolyl 4-hydroxylase subunit alpha-1
precursor (4-PH alpha-1)
(Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1) [Ciona intestinalis]
Length = 527
Score = 69.3 bits (168), Expect = 4e-10, Method: Composition-based stats.
Identities = 30/56 (53%), Positives = 36/56 (64%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 128
L+DVE+GG T F + ++ P KGSAVFWYN + D R YH CPVA GNKW
Sbjct: 441 LSDVEVGGRTAFVNAGVSAKPIKGSAVFWYNVFPSGEPDLRTYHGACPVAFGNKWA 496
Score = 64.3 bits (155), Expect = 2e-08, Method: Composition-based stats.
Identities = 28/58 (48%), Positives = 36/58 (62%)
Query: 9 VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWG 66
+ L+DVE+GG T F + ++ P KGSAVFW+N + D YH CPVA GNKW
Sbjct: 439 IYLSDVEVGGRTAFVNAGVSAKPIKGSAVFWYNVFPSGEPDLRTYHGACPVAFGNKWA 496
>gi|195159309|ref|XP_002020524.1| GL13466 [Drosophila persimilis]
gi|194117293|gb|EDW39336.1| GL13466 [Drosophila persimilis]
Length = 643
Score = 69.3 bits (168), Expect = 4e-10, Method: Composition-based stats.
Identities = 29/55 (52%), Positives = 39/55 (70%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+DV GGAT+F LN+TVFP+ G+ +FWYN + D R H+GCPV +G+KW
Sbjct: 561 LSDVPHGGATLFTELNVTVFPQAGTVLFWYNLAHSGDHDMRTRHTGCPVIVGSKW 615
Score = 63.9 bits (154), Expect = 2e-08, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L+DV GGAT+F LN+TVFP+ G+ +FW+N + D H+GCPV +G+KW
Sbjct: 561 LSDVPHGGATLFTELNVTVFPQAGTVLFWYNLAHSGDHDMRTRHTGCPVIVGSKW 615
>gi|195452726|ref|XP_002073473.1| GK14136 [Drosophila willistoni]
gi|194169558|gb|EDW84459.1| GK14136 [Drosophila willistoni]
Length = 550
Score = 69.3 bits (168), Expect = 5e-10, Method: Composition-based stats.
Identities = 31/67 (46%), Positives = 43/67 (64%), Gaps = 1/67 (1%)
Query: 61 LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
LGN+ +L ++DVE GGAT+F SL+ ++P+KG+A FW N H + D R H+ CP
Sbjct: 457 LGNRIATVLFY-MSDVEQGGATVFTSLHAALWPKKGTAAFWMNLHRDGEGDVRTRHAACP 515
Query: 121 VALGNKW 127
V G KW
Sbjct: 516 VLTGTKW 522
Score = 63.5 bits (153), Expect = 3e-08, Method: Composition-based stats.
Identities = 28/64 (43%), Positives = 38/64 (59%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DVE GGAT+F SL+ ++P+KG+A FW N H + D H+ CPV
Sbjct: 459 NRIATVLFYMSDVEQGGATVFTSLHAALWPKKGTAAFWMNLHRDGEGDVRTRHAACPVLT 518
Query: 62 GNKW 65
G KW
Sbjct: 519 GTKW 522
>gi|281361323|ref|NP_652183.2| CG15864 [Drosophila melanogaster]
gi|272476864|gb|AAF54202.3| CG15864 [Drosophila melanogaster]
Length = 490
Score = 69.3 bits (168), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 50/79 (63%), Gaps = 1/79 (1%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
D+ Y + G++ +L L +V+ GGATIFP +N+TV P+KGSAVFWYN H +
Sbjct: 384 DFHEYTNTTRPKQGDRIATVLFY-LGEVDSGGATIFPMINITVTPKKGSAVFWYNLHNSG 442
Query: 109 LLDYRMYHSGCPVALGNKW 127
++ + HS CPV G+K+
Sbjct: 443 AMNLKSLHSACPVISGSKY 461
Score = 64.7 bits (156), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 29/55 (52%), Positives = 40/55 (72%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L +V+ GGATIFP +N+TV P+KGSAVFW+N H + ++ HS CPV G+K+
Sbjct: 407 LGEVDSGGATIFPMINITVTPKKGSAVFWYNLHNSGAMNLKSLHSACPVISGSKY 461
>gi|198466397|ref|XP_002135180.1| GA23908 [Drosophila pseudoobscura pseudoobscura]
gi|198150581|gb|EDY73807.1| GA23908 [Drosophila pseudoobscura pseudoobscura]
Length = 403
Score = 69.3 bits (168), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 32/69 (46%), Positives = 41/69 (59%)
Query: 1 MNRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVA 60
M+ +L +V GGA FP L TV+P KGSA+ WHN + N LDY + H CPV
Sbjct: 324 MDNLAKEEQQLNEVPHGGAISFPQLEFTVWPRKGSALVWHNLNNNLELDYRVAHISCPVI 383
Query: 61 LGNKWGKLL 69
+G+KW K L
Sbjct: 384 VGSKWSKFL 392
Score = 69.3 bits (168), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 31/61 (50%), Positives = 39/61 (63%)
Query: 71 SGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWGKL 130
L +V GGA FP L TV+P KGSA+ W+N + N LDYR+ H CPV +G+KW K
Sbjct: 332 QQLNEVPHGGAISFPQLEFTVWPRKGSALVWHNLNNNLELDYRVAHISCPVIVGSKWSKF 391
Query: 131 L 131
L
Sbjct: 392 L 392
>gi|20269818|gb|AAM18064.1| prolyl 4-hydroxylase alpha-related protein PH4[alpha]NE1
[Drosophila melanogaster]
Length = 286
Score = 69.3 bits (168), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 48/79 (60%), Gaps = 1/79 (1%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
D++ + P GN+ + L++VE GGAT FP L++ V P+ G+ +FWYN H +
Sbjct: 176 DFFRDPNHYPAEEGNRIATAIFY-LSEVEQGGATAFPFLDIAVKPQLGNVLFWYNLHRSL 234
Query: 109 LLDYRMYHSGCPVALGNKW 127
DYR H+GCPV G+KW
Sbjct: 235 DKDYRTKHAGCPVLKGSKW 253
Score = 63.9 bits (154), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 29/64 (45%), Positives = 41/64 (64%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L++VE GGAT FP L++ V P+ G+ +FW+N H + DY H+GCPV
Sbjct: 190 NRIATAIFYLSEVEQGGATAFPFLDIAVKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLK 249
Query: 62 GNKW 65
G+KW
Sbjct: 250 GSKW 253
>gi|194751825|ref|XP_001958224.1| GF23629 [Drosophila ananassae]
gi|190625506|gb|EDV41030.1| GF23629 [Drosophila ananassae]
Length = 523
Score = 69.3 bits (168), Expect = 5e-10, Method: Composition-based stats.
Identities = 30/68 (44%), Positives = 45/68 (66%), Gaps = 1/68 (1%)
Query: 60 ALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGC 119
ALG++ G ++ +DV GGAT+FP + +++ P KGS+VFWYN + + D R HS C
Sbjct: 436 ALGDRLGTIIFYA-SDVLQGGATVFPDIKVSITPRKGSSVFWYNLYDDGRPDIRSRHSVC 494
Query: 120 PVALGNKW 127
PV G++W
Sbjct: 495 PVINGDRW 502
Score = 58.9 bits (141), Expect = 6e-07, Method: Composition-based stats.
Identities = 24/54 (44%), Positives = 36/54 (66%)
Query: 12 TDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
+DV GGAT+FP + +++ P KGS+VFW+N + + D HS CPV G++W
Sbjct: 449 SDVLQGGATVFPDIKVSITPRKGSSVFWYNLYDDGRPDIRSRHSVCPVINGDRW 502
>gi|195159299|ref|XP_002020519.1| GL13471 [Drosophila persimilis]
gi|194117288|gb|EDW39331.1| GL13471 [Drosophila persimilis]
Length = 238
Score = 68.9 bits (167), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 30/60 (50%), Positives = 41/60 (68%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWGKLLL 132
L+DV GGAT+F LN+TVFP+ G+ +FWYN + D R H+GCPV +G+KW + L
Sbjct: 176 LSDVPQGGATLFTELNVTVFPQAGTVLFWYNLAHSGDHDMRTRHTGCPVIVGSKWSRFSL 235
Score = 63.9 bits (154), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 28/60 (46%), Positives = 40/60 (66%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWGKLLL 70
L+DV GGAT+F LN+TVFP+ G+ +FW+N + D H+GCPV +G+KW + L
Sbjct: 176 LSDVPQGGATLFTELNVTVFPQAGTVLFWYNLAHSGDHDMRTRHTGCPVIVGSKWSRFSL 235
>gi|194904100|ref|XP_001981000.1| GG23922 [Drosophila erecta]
gi|190652703|gb|EDV49958.1| GG23922 [Drosophila erecta]
Length = 490
Score = 68.9 bits (167), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 35/79 (44%), Positives = 47/79 (59%), Gaps = 1/79 (1%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY Y + G++ +L L +V GGATIFP +N+TV P+KGSAVFWYN H +
Sbjct: 384 DYHEYSNMTREKYGDRIATVLFY-LGEVHSGGATIFPRINITVTPKKGSAVFWYNLHNSG 442
Query: 109 LLDYRMYHSGCPVALGNKW 127
+ HS CPV G+K+
Sbjct: 443 AMHSETLHSACPVISGSKY 461
Score = 63.5 bits (153), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 29/55 (52%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L +V GGATIFP +N+TV P+KGSAVFW+N H + + HS CPV G+K+
Sbjct: 407 LGEVHSGGATIFPRINITVTPKKGSAVFWYNLHNSGAMHSETLHSACPVISGSKY 461
>gi|17861644|gb|AAL39299.1| GH17175p [Drosophila melanogaster]
Length = 187
Score = 68.9 bits (167), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 48/79 (60%), Gaps = 1/79 (1%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
D++ + P GN+ + L++VE GGAT FP L++ V P+ G+ +FWYN H +
Sbjct: 77 DFFRDPNHYPAEEGNRIATAIFY-LSEVEQGGATAFPFLDIAVKPQLGNVLFWYNLHRSL 135
Query: 109 LLDYRMYHSGCPVALGNKW 127
DYR H+GCPV G+KW
Sbjct: 136 DKDYRTKHAGCPVLKGSKW 154
Score = 63.9 bits (154), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 29/64 (45%), Positives = 41/64 (64%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L++VE GGAT FP L++ V P+ G+ +FW+N H + DY H+GCPV
Sbjct: 91 NRIATAIFYLSEVEQGGATAFPFLDIAVKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLK 150
Query: 62 GNKW 65
G+KW
Sbjct: 151 GSKW 154
>gi|194765174|ref|XP_001964702.1| GF23328 [Drosophila ananassae]
gi|190614974|gb|EDV30498.1| GF23328 [Drosophila ananassae]
Length = 542
Score = 68.9 bits (167), Expect = 7e-10, Method: Composition-based stats.
Identities = 33/65 (50%), Positives = 40/65 (61%), Gaps = 3/65 (4%)
Query: 66 GKLLLSGL---TDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 122
G +L+ L DV LGGAT FP L L V P KGS + WYN H + D+R H+GCPV
Sbjct: 446 GNRMLTALFYINDVALGGATAFPFLQLAVPPVKGSLLVWYNLHRSLHKDFRTKHAGCPVI 505
Query: 123 LGNKW 127
G+KW
Sbjct: 506 KGSKW 510
Score = 64.3 bits (155), Expect = 1e-08, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 38/64 (59%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + + DV LGGAT FP L L V P KGS + W+N H + D+ H+GCPV
Sbjct: 447 NRMLTALFYINDVALGGATAFPFLQLAVPPVKGSLLVWYNLHRSLHKDFRTKHAGCPVIK 506
Query: 62 GNKW 65
G+KW
Sbjct: 507 GSKW 510
>gi|383864775|ref|XP_003707853.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Megachile
rotundata]
Length = 550
Score = 68.9 bits (167), Expect = 7e-10, Method: Composition-based stats.
Identities = 29/66 (43%), Positives = 43/66 (65%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ +L ++DVE GG T+F ++N++++P KGSA FW+N N D R H+ CPV
Sbjct: 455 GNRIATVLYY-MSDVEQGGGTVFTAINISLWPRKGSAAFWFNLKPNGEGDLRTRHAACPV 513
Query: 122 ALGNKW 127
G+KW
Sbjct: 514 LTGSKW 519
Score = 63.5 bits (153), Expect = 3e-08, Method: Composition-based stats.
Identities = 27/64 (42%), Positives = 38/64 (59%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DVE GG T+F ++N++++P KGSA FW N N D H+ CPV
Sbjct: 456 NRIATVLYYMSDVEQGGGTVFTAINISLWPRKGSAAFWFNLKPNGEGDLRTRHAACPVLT 515
Query: 62 GNKW 65
G+KW
Sbjct: 516 GSKW 519
>gi|332026992|gb|EGI67088.1| Prolyl 4-hydroxylase subunit alpha-1 [Acromyrmex echinatior]
Length = 415
Score = 68.9 bits (167), Expect = 7e-10, Method: Composition-based stats.
Identities = 28/64 (43%), Positives = 40/64 (62%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DVE GG T+F ++N++++P KGSA FWHN N D+ H+ CPV
Sbjct: 321 NRIATVLYYMSDVEQGGGTVFTAINISLWPRKGSAAFWHNLKPNGEGDFKTRHAACPVLT 380
Query: 62 GNKW 65
G+KW
Sbjct: 381 GSKW 384
Score = 68.6 bits (166), Expect = 8e-10, Method: Composition-based stats.
Identities = 28/66 (42%), Positives = 44/66 (66%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ +L ++DVE GG T+F ++N++++P KGSA FW+N N D++ H+ CPV
Sbjct: 320 GNRIATVLYY-MSDVEQGGGTVFTAINISLWPRKGSAAFWHNLKPNGEGDFKTRHAACPV 378
Query: 122 ALGNKW 127
G+KW
Sbjct: 379 LTGSKW 384
>gi|195159305|ref|XP_002020522.1| GL13469 [Drosophila persimilis]
gi|194117291|gb|EDW39334.1| GL13469 [Drosophila persimilis]
Length = 253
Score = 68.6 bits (166), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 30/61 (49%), Positives = 42/61 (68%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWGKLLL 132
L+DV LGGAT+FP L+L+VFP+KG+ + WYN + HS CPV +G++WGK+ L
Sbjct: 159 LSDVPLGGATVFPLLDLSVFPKKGAVLMWYNLDHKGQGMEKTIHSACPVVVGSRWGKINL 218
Query: 133 S 133
Sbjct: 219 K 219
Score = 67.4 bits (163), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 32/66 (48%), Positives = 47/66 (71%), Gaps = 2/66 (3%)
Query: 10 ELTDVELGGATIFPSLNLTVFPEKGSAVFWHN-AHANTLLDYWMYHSGCPVALGNKWGKL 68
EL+DV LGGAT+FP L+L+VFP+KG+ + W+N H ++ + HS CPV +G++WGK+
Sbjct: 158 ELSDVPLGGATVFPLLDLSVFPKKGAVLMWYNLDHKGQGMEKTI-HSACPVVVGSRWGKI 216
Query: 69 LLSGLT 74
L T
Sbjct: 217 NLKRET 222
>gi|195452742|ref|XP_002073480.1| GK13123 [Drosophila willistoni]
gi|194169565|gb|EDW84466.1| GK13123 [Drosophila willistoni]
Length = 540
Score = 68.6 bits (166), Expect = 8e-10, Method: Composition-based stats.
Identities = 33/66 (50%), Positives = 42/66 (63%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ LL L +V +GGAT FP L L V P KGS + WYN H + D+R H+GCPV
Sbjct: 439 GNRLLTALLY-LNEVPMGGATAFPYLKLAVPPVKGSLLVWYNLHRSLDPDFRTKHAGCPV 497
Query: 122 ALGNKW 127
+G+KW
Sbjct: 498 LMGSKW 503
Score = 64.3 bits (155), Expect = 2e-08, Method: Composition-based stats.
Identities = 29/64 (45%), Positives = 40/64 (62%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + + L +V +GGAT FP L L V P KGS + W+N H + D+ H+GCPV +
Sbjct: 440 NRLLTALLYLNEVPMGGATAFPYLKLAVPPVKGSLLVWYNLHRSLDPDFRTKHAGCPVLM 499
Query: 62 GNKW 65
G+KW
Sbjct: 500 GSKW 503
>gi|291190274|ref|NP_001167096.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha 1 polypeptide precursor [Salmo
salar]
gi|223648100|gb|ACN10808.1| Prolyl 4-hydroxylase subunit alpha-1 precursor [Salmo salar]
Length = 545
Score = 68.6 bits (166), Expect = 8e-10, Method: Composition-based stats.
Identities = 31/66 (46%), Positives = 42/66 (63%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L+ ++DV GGAT+F + V+P+KGSAVFWYN + DY H+ CPV
Sbjct: 458 GNRIATWLIY-MSDVPSGGATVFTDVGAAVWPKKGSAVFWYNLFPSGEGDYSTRHAACPV 516
Query: 122 ALGNKW 127
+GNKW
Sbjct: 517 LVGNKW 522
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 29/64 (45%), Positives = 40/64 (62%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + ++DV GGAT+F + V+P+KGSAVFW+N + DY H+ CPV +
Sbjct: 459 NRIATWLIYMSDVPSGGATVFTDVGAAVWPKKGSAVFWYNLFPSGEGDYSTRHAACPVLV 518
Query: 62 GNKW 65
GNKW
Sbjct: 519 GNKW 522
>gi|195575111|ref|XP_002105523.1| GD16991 [Drosophila simulans]
gi|194201450|gb|EDX15026.1| GD16991 [Drosophila simulans]
Length = 542
Score = 68.6 bits (166), Expect = 8e-10, Method: Composition-based stats.
Identities = 29/55 (52%), Positives = 36/55 (65%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L +V GG T FP +NLTVFP+ GSA+FWYN N H+GCPV +G+KW
Sbjct: 453 LNEVRQGGGTYFPRINLTVFPQPGSALFWYNLDTNGNDHMGSLHTGCPVIVGSKW 507
Score = 66.2 bits (160), Expect = 4e-09, Method: Composition-based stats.
Identities = 28/55 (50%), Positives = 36/55 (65%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L +V GG T FP +NLTVFP+ GSA+FW+N N H+GCPV +G+KW
Sbjct: 453 LNEVRQGGGTYFPRINLTVFPQPGSALFWYNLDTNGNDHMGSLHTGCPVIVGSKW 507
>gi|195572619|ref|XP_002104293.1| GD18524 [Drosophila simulans]
gi|194200220|gb|EDX13796.1| GD18524 [Drosophila simulans]
Length = 472
Score = 68.6 bits (166), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 49/79 (62%), Gaps = 1/79 (1%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
D+ Y + G++ +L L +V+ GGATIFP +N+ V P+KGSAVFWYN H +
Sbjct: 366 DFHEYTNTTRPKQGDRIATVLFY-LGEVDSGGATIFPKINIAVTPKKGSAVFWYNLHNSG 424
Query: 109 LLDYRMYHSGCPVALGNKW 127
++ + HS CPV G+K+
Sbjct: 425 AMNLKSLHSACPVISGSKY 443
Score = 63.5 bits (153), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 28/55 (50%), Positives = 39/55 (70%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L +V+ GGATIFP +N+ V P+KGSAVFW+N H + ++ HS CPV G+K+
Sbjct: 389 LGEVDSGGATIFPKINIAVTPKKGSAVFWYNLHNSGAMNLKSLHSACPVISGSKY 443
>gi|405967005|gb|EKC32220.1| Prolyl 4-hydroxylase subunit alpha-1 [Crassostrea gigas]
Length = 303
Score = 68.6 bits (166), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 39/100 (39%), Positives = 52/100 (52%), Gaps = 8/100 (8%)
Query: 29 VFPEKGSAVFWHNAHA-NTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSL 87
+ P+ G W+ A W H+G +A W + L+DVE GGAT+FP L
Sbjct: 176 IDPDIGDEFLWNMTEAVQGPRALWRKHTGQRIAT---W----MFYLSDVEAGGATVFPKL 228
Query: 88 NLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
V KG+A FWYN + +D R H+GCPV LG+KW
Sbjct: 229 EARVPVVKGAAAFWYNLTPSGKIDRRTQHAGCPVILGSKW 268
Score = 62.0 bits (149), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 28/55 (50%), Positives = 36/55 (65%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L+DVE GGAT+FP L V KG+A FW+N + +D H+GCPV LG+KW
Sbjct: 214 LSDVEAGGATVFPKLEARVPVVKGAAAFWYNLTPSGKIDRRTQHAGCPVILGSKW 268
>gi|194765138|ref|XP_001964684.1| GF23317 [Drosophila ananassae]
gi|190614956|gb|EDV30480.1| GF23317 [Drosophila ananassae]
Length = 520
Score = 68.2 bits (165), Expect = 9e-10, Method: Composition-based stats.
Identities = 26/55 (47%), Positives = 40/55 (72%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
++DVE GGAT+FP++ TV+P++G+AV WYN + D + H+ CPV +G+KW
Sbjct: 446 MSDVEQGGATVFPNIYKTVYPQRGTAVMWYNLKDDGQPDEQTLHAACPVLVGSKW 500
Score = 65.1 bits (157), Expect = 9e-09, Method: Composition-based stats.
Identities = 25/55 (45%), Positives = 39/55 (70%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP++ TV+P++G+AV W+N + D H+ CPV +G+KW
Sbjct: 446 MSDVEQGGATVFPNIYKTVYPQRGTAVMWYNLKDDGQPDEQTLHAACPVLVGSKW 500
>gi|195330778|ref|XP_002032080.1| GM23711 [Drosophila sechellia]
gi|194121023|gb|EDW43066.1| GM23711 [Drosophila sechellia]
Length = 490
Score = 68.2 bits (165), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 49/79 (62%), Gaps = 1/79 (1%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
D+ Y + G++ +L L +V+ GGATIFP +N+ V P+KGSAVFWYN H +
Sbjct: 384 DFHEYTNTTRPKQGDRIATVLFY-LGEVDSGGATIFPKINIAVTPKKGSAVFWYNLHNSG 442
Query: 109 LLDYRMYHSGCPVALGNKW 127
++ + HS CPV G+K+
Sbjct: 443 AMNLKSLHSACPVISGSKY 461
Score = 63.5 bits (153), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 28/55 (50%), Positives = 39/55 (70%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L +V+ GGATIFP +N+ V P+KGSAVFW+N H + ++ HS CPV G+K+
Sbjct: 407 LGEVDSGGATIFPKINIAVTPKKGSAVFWYNLHNSGAMNLKSLHSACPVISGSKY 461
>gi|198477148|ref|XP_002136736.1| GA29214 [Drosophila pseudoobscura pseudoobscura]
gi|198145041|gb|EDY71753.1| GA29214 [Drosophila pseudoobscura pseudoobscura]
Length = 520
Score = 68.2 bits (165), Expect = 1e-09, Method: Composition-based stats.
Identities = 29/55 (52%), Positives = 38/55 (69%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+DV GGAT+F LN+TVFP+ G+ +FWYN + D R H+GCPV G+KW
Sbjct: 438 LSDVPQGGATLFTELNVTVFPQAGTVLFWYNLAHSGDHDMRTRHTGCPVIGGSKW 492
Score = 62.8 bits (151), Expect = 4e-08, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 37/55 (67%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L+DV GGAT+F LN+TVFP+ G+ +FW+N + D H+GCPV G+KW
Sbjct: 438 LSDVPQGGATLFTELNVTVFPQAGTVLFWYNLAHSGDHDMRTRHTGCPVIGGSKW 492
>gi|195159142|ref|XP_002020441.1| GL13994 [Drosophila persimilis]
gi|194117210|gb|EDW39253.1| GL13994 [Drosophila persimilis]
Length = 493
Score = 68.2 bits (165), Expect = 1e-09, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 39/55 (70%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
LTDVE GGAT+FP++ VFP++GSA+ WYN + + + H+ CPV +G+KW
Sbjct: 416 LTDVEQGGATVFPNIRKAVFPQRGSAIIWYNLKDDGDPNPQTLHAACPVLVGSKW 470
Score = 65.1 bits (157), Expect = 1e-08, Method: Composition-based stats.
Identities = 29/67 (43%), Positives = 44/67 (65%), Gaps = 2/67 (2%)
Query: 1 MNRAEMSSV--ELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCP 58
MN +++V LTDVE GGAT+FP++ VFP++GSA+ W+N + + H+ CP
Sbjct: 404 MNGDRIATVLFYLTDVEQGGATVFPNIRKAVFPQRGSAIIWYNLKDDGDPNPQTLHAACP 463
Query: 59 VALGNKW 65
V +G+KW
Sbjct: 464 VLVGSKW 470
>gi|195055779|ref|XP_001994790.1| GH14110 [Drosophila grimshawi]
gi|193892553|gb|EDV91419.1| GH14110 [Drosophila grimshawi]
Length = 487
Score = 68.2 bits (165), Expect = 1e-09, Method: Composition-based stats.
Identities = 30/67 (44%), Positives = 43/67 (64%), Gaps = 1/67 (1%)
Query: 61 LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
LGN+ +L ++DV+ GGAT+F SL ++P+KG+A FW N H + D R H+ CP
Sbjct: 394 LGNRIATMLFY-MSDVQQGGATVFTSLRTALWPKKGTAAFWMNLHRSGEGDARTRHAACP 452
Query: 121 VALGNKW 127
V G+KW
Sbjct: 453 VLTGSKW 459
Score = 62.0 bits (149), Expect = 9e-08, Method: Composition-based stats.
Identities = 27/64 (42%), Positives = 38/64 (59%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV+ GGAT+F SL ++P+KG+A FW N H + D H+ CPV
Sbjct: 396 NRIATMLFYMSDVQQGGATVFTSLRTALWPKKGTAAFWMNLHRSGEGDARTRHAACPVLT 455
Query: 62 GNKW 65
G+KW
Sbjct: 456 GSKW 459
>gi|195159319|ref|XP_002020529.1| GL14044 [Drosophila persimilis]
gi|194117298|gb|EDW39341.1| GL14044 [Drosophila persimilis]
Length = 536
Score = 67.8 bits (164), Expect = 1e-09, Method: Composition-based stats.
Identities = 29/55 (52%), Positives = 36/55 (65%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+DV GG T FP L L V PE+GS +FWYN H + D+R H+ CPV G+KW
Sbjct: 449 LSDVVAGGGTAFPFLPLLVTPERGSLLFWYNLHPSGDQDFRTKHAACPVLQGSKW 503
Score = 63.9 bits (154), Expect = 2e-08, Method: Composition-based stats.
Identities = 29/64 (45%), Positives = 38/64 (59%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L+DV GG T FP L L V PE+GS +FW+N H + D+ H+ CPV
Sbjct: 440 NRLATAIYYLSDVVAGGGTAFPFLPLLVTPERGSLLFWYNLHPSGDQDFRTKHAACPVLQ 499
Query: 62 GNKW 65
G+KW
Sbjct: 500 GSKW 503
>gi|195391766|ref|XP_002054531.1| GJ24504 [Drosophila virilis]
gi|194152617|gb|EDW68051.1| GJ24504 [Drosophila virilis]
Length = 545
Score = 67.8 bits (164), Expect = 1e-09, Method: Composition-based stats.
Identities = 31/67 (46%), Positives = 41/67 (61%), Gaps = 1/67 (1%)
Query: 61 LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
+GN+ +L LTDVE GG T FP L + P+K +A FWYN HA+ D R H CP
Sbjct: 441 MGNRIATVLFY-LTDVEQGGGTAFPVLKQLLKPKKYAAAFWYNLHASGAGDVRTMHGACP 499
Query: 121 VALGNKW 127
+ +G+KW
Sbjct: 500 IIVGSKW 506
Score = 61.2 bits (147), Expect = 1e-07, Method: Composition-based stats.
Identities = 28/64 (43%), Positives = 36/64 (56%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR LTDVE GG T FP L + P+K +A FW+N HA+ D H CP+ +
Sbjct: 443 NRIATVLFYLTDVEQGGGTAFPVLKQLLKPKKYAAAFWYNLHASGAGDVRTMHGACPIIV 502
Query: 62 GNKW 65
G+KW
Sbjct: 503 GSKW 506
>gi|292619367|ref|XP_001922562.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Danio rerio]
Length = 541
Score = 67.8 bits (164), Expect = 1e-09, Method: Composition-based stats.
Identities = 31/66 (46%), Positives = 40/66 (60%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+FP + V P KG+AVFWYN + DY H+ CPV
Sbjct: 454 GNRIATWLFY-MSDVAAGGATVFPEVGAAVKPLKGTAVFWYNLFPSGEGDYSTRHAACPV 512
Query: 122 ALGNKW 127
+GNKW
Sbjct: 513 LVGNKW 518
Score = 65.1 bits (157), Expect = 9e-09, Method: Composition-based stats.
Identities = 29/64 (45%), Positives = 38/64 (59%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + V P KG+AVFW+N + DY H+ CPV +
Sbjct: 455 NRIATWLFYMSDVAAGGATVFPEVGAAVKPLKGTAVFWYNLFPSGEGDYSTRHAACPVLV 514
Query: 62 GNKW 65
GNKW
Sbjct: 515 GNKW 518
>gi|198449648|ref|XP_001357666.2| GA21989 [Drosophila pseudoobscura pseudoobscura]
gi|198130700|gb|EAL26801.2| GA21989 [Drosophila pseudoobscura pseudoobscura]
Length = 536
Score = 67.8 bits (164), Expect = 1e-09, Method: Composition-based stats.
Identities = 29/55 (52%), Positives = 36/55 (65%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+DV GG T FP L L V PE+GS +FWYN H + D+R H+ CPV G+KW
Sbjct: 449 LSDVVAGGGTAFPFLPLLVTPERGSLLFWYNLHPSGDQDFRTKHAACPVLQGSKW 503
Score = 63.9 bits (154), Expect = 2e-08, Method: Composition-based stats.
Identities = 29/64 (45%), Positives = 38/64 (59%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L+DV GG T FP L L V PE+GS +FW+N H + D+ H+ CPV
Sbjct: 440 NRLATAIYYLSDVVAGGGTAFPFLPLLVTPERGSLLFWYNLHPSGDQDFRTKHAACPVLQ 499
Query: 62 GNKW 65
G+KW
Sbjct: 500 GSKW 503
>gi|113682363|ref|NP_001038463.1| prolyl 4-hydroxylase, alpha polypeptide I a precursor [Danio rerio]
Length = 522
Score = 67.8 bits (164), Expect = 1e-09, Method: Composition-based stats.
Identities = 31/66 (46%), Positives = 40/66 (60%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+FP + V P KG+AVFWYN + DY H+ CPV
Sbjct: 435 GNRIATWLFY-MSDVAAGGATVFPEVGAAVKPLKGTAVFWYNLFPSGEGDYSTRHAACPV 493
Query: 122 ALGNKW 127
+GNKW
Sbjct: 494 LVGNKW 499
Score = 65.1 bits (157), Expect = 9e-09, Method: Composition-based stats.
Identities = 29/64 (45%), Positives = 38/64 (59%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP + V P KG+AVFW+N + DY H+ CPV +
Sbjct: 436 NRIATWLFYMSDVAAGGATVFPEVGAAVKPLKGTAVFWYNLFPSGEGDYSTRHAACPVLV 495
Query: 62 GNKW 65
GNKW
Sbjct: 496 GNKW 499
>gi|47550697|ref|NP_999856.1| prolyl 4-hydroxylase, alpha polypeptide I b precursor [Danio rerio]
gi|28277826|gb|AAH45890.1| Procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I [Danio rerio]
Length = 536
Score = 67.8 bits (164), Expect = 1e-09, Method: Composition-based stats.
Identities = 30/66 (45%), Positives = 41/66 (62%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L ++DV GGAT+F + V+P+KG+AVFWYN + DY H+ CPV
Sbjct: 449 GNRIATWLFY-MSDVSAGGATVFTDVGAAVWPKKGTAVFWYNLFPSGEGDYSTRHAACPV 507
Query: 122 ALGNKW 127
+GNKW
Sbjct: 508 LVGNKW 513
Score = 65.1 bits (157), Expect = 1e-08, Method: Composition-based stats.
Identities = 28/64 (43%), Positives = 39/64 (60%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+F + V+P+KG+AVFW+N + DY H+ CPV +
Sbjct: 450 NRIATWLFYMSDVSAGGATVFTDVGAAVWPKKGTAVFWYNLFPSGEGDYSTRHAACPVLV 509
Query: 62 GNKW 65
GNKW
Sbjct: 510 GNKW 513
>gi|198449500|ref|XP_001357604.2| GA15939 [Drosophila pseudoobscura pseudoobscura]
gi|198130634|gb|EAL26738.2| GA15939 [Drosophila pseudoobscura pseudoobscura]
Length = 528
Score = 67.8 bits (164), Expect = 1e-09, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 39/55 (70%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
LTDVE GGAT+FP++ VFP++GSA+ WYN + + + H+ CPV +G+KW
Sbjct: 451 LTDVEQGGATVFPNIRKAVFPQRGSAIIWYNLKDDGDPNPQTLHAACPVLVGSKW 505
Score = 64.7 bits (156), Expect = 1e-08, Method: Composition-based stats.
Identities = 29/67 (43%), Positives = 44/67 (65%), Gaps = 2/67 (2%)
Query: 1 MNRAEMSSV--ELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCP 58
MN +++V LTDVE GGAT+FP++ VFP++GSA+ W+N + + H+ CP
Sbjct: 439 MNGDRIATVLFYLTDVEQGGATVFPNIRKAVFPQRGSAIIWYNLKDDGDPNPQTLHAACP 498
Query: 59 VALGNKW 65
V +G+KW
Sbjct: 499 VLVGSKW 505
>gi|195110919|ref|XP_002000027.1| GI24860 [Drosophila mojavensis]
gi|193916621|gb|EDW15488.1| GI24860 [Drosophila mojavensis]
Length = 487
Score = 67.8 bits (164), Expect = 1e-09, Method: Composition-based stats.
Identities = 31/67 (46%), Positives = 43/67 (64%), Gaps = 1/67 (1%)
Query: 61 LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
LGN+ +L ++DVE GGAT+F SL+ + P+KG+A FW N H + D R H+ CP
Sbjct: 394 LGNRIATVLFY-MSDVEQGGATVFTSLHAVLKPKKGTAAFWMNLHRSGEGDVRTRHAACP 452
Query: 121 VALGNKW 127
V G+KW
Sbjct: 453 VLTGSKW 459
Score = 62.0 bits (149), Expect = 8e-08, Method: Composition-based stats.
Identities = 28/64 (43%), Positives = 38/64 (59%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DVE GGAT+F SL+ + P+KG+A FW N H + D H+ CPV
Sbjct: 396 NRIATVLFYMSDVEQGGATVFTSLHAVLKPKKGTAAFWMNLHRSGEGDVRTRHAACPVLT 455
Query: 62 GNKW 65
G+KW
Sbjct: 456 GSKW 459
>gi|198471971|ref|XP_002133305.1| GA28042 [Drosophila pseudoobscura pseudoobscura]
gi|198139547|gb|EDY70707.1| GA28042 [Drosophila pseudoobscura pseudoobscura]
Length = 203
Score = 67.8 bits (164), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 30/55 (54%), Positives = 38/55 (69%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
+TDV LGGATIFP +NLT+ P+KGSA+FWYN H + H+ CP GN+W
Sbjct: 130 MTDVPLGGATIFPKINLTIQPKKGSALFWYNIHNDWEPHVLTRHAVCPTIEGNRW 184
Score = 66.6 bits (161), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 29/62 (46%), Positives = 42/62 (67%)
Query: 4 AEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGN 63
+++ + +TDV LGGATIFP +NLT+ P+KGSA+FW+N H + H+ CP GN
Sbjct: 123 SDIKANPMTDVPLGGATIFPKINLTIQPKKGSALFWYNIHNDWEPHVLTRHAVCPTIEGN 182
Query: 64 KW 65
+W
Sbjct: 183 RW 184
>gi|195145314|ref|XP_002013641.1| GL24244 [Drosophila persimilis]
gi|194102584|gb|EDW24627.1| GL24244 [Drosophila persimilis]
Length = 496
Score = 67.8 bits (164), Expect = 1e-09, Method: Composition-based stats.
Identities = 28/55 (50%), Positives = 37/55 (67%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+DV LGG T FP+L L + P +GSA+ W+NA+ DYR H+ CPV LG +W
Sbjct: 422 LSDVRLGGFTSFPALGLGIRPNRGSALVWHNANNAGNCDYRALHAACPVLLGTRW 476
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 28/55 (50%), Positives = 36/55 (65%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L+DV LGG T FP+L L + P +GSA+ WHNA+ DY H+ CPV LG +W
Sbjct: 422 LSDVRLGGFTSFPALGLGIRPNRGSALVWHNANNAGNCDYRALHAACPVLLGTRW 476
>gi|156352054|ref|XP_001622587.1| predicted protein [Nematostella vectensis]
gi|156209158|gb|EDO30487.1| predicted protein [Nematostella vectensis]
Length = 531
Score = 67.8 bits (164), Expect = 1e-09, Method: Composition-based stats.
Identities = 30/69 (43%), Positives = 41/69 (59%), Gaps = 1/69 (1%)
Query: 59 VALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSG 118
+ +GN+ +L ++DVE GGAT+F V P KG A FWYN H N D R H+
Sbjct: 441 LGIGNRIATVLYY-MSDVEAGGATVFVDAEAIVKPSKGDAAFWYNLHKNGKGDERTRHAA 499
Query: 119 CPVALGNKW 127
CP+ +G+KW
Sbjct: 500 CPIIVGSKW 508
Score = 61.2 bits (147), Expect = 1e-07, Method: Composition-based stats.
Identities = 27/64 (42%), Positives = 35/64 (54%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DVE GGAT+F V P KG A FW+N H N D H+ CP+ +
Sbjct: 445 NRIATVLYYMSDVEAGGATVFVDAEAIVKPSKGDAAFWYNLHKNGKGDERTRHAACPIIV 504
Query: 62 GNKW 65
G+KW
Sbjct: 505 GSKW 508
>gi|195341556|ref|XP_002037372.1| GM12148 [Drosophila sechellia]
gi|194131488|gb|EDW53531.1| GM12148 [Drosophila sechellia]
Length = 542
Score = 67.8 bits (164), Expect = 2e-09, Method: Composition-based stats.
Identities = 31/57 (54%), Positives = 38/57 (66%), Gaps = 4/57 (7%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYN--AHANTLLDYRMYHSGCPVALGNKW 127
L +V GG T FP LNLTVFP+ GSA+FWYN N +D H+GCPV +G+KW
Sbjct: 453 LNEVRQGGGTYFPRLNLTVFPQPGSALFWYNLDTKGNDHMD--SLHTGCPVIVGSKW 507
Score = 65.5 bits (158), Expect = 6e-09, Method: Composition-based stats.
Identities = 30/57 (52%), Positives = 38/57 (66%), Gaps = 4/57 (7%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHN--AHANTLLDYWMYHSGCPVALGNKW 65
L +V GG T FP LNLTVFP+ GSA+FW+N N +D H+GCPV +G+KW
Sbjct: 453 LNEVRQGGGTYFPRLNLTVFPQPGSALFWYNLDTKGNDHMD--SLHTGCPVIVGSKW 507
>gi|116008432|ref|NP_651804.2| CG15539, isoform A [Drosophila melanogaster]
gi|66772391|gb|AAY55507.1| IP10910p [Drosophila melanogaster]
gi|66772535|gb|AAY55579.1| IP10810p [Drosophila melanogaster]
gi|113194858|gb|AAF57060.2| CG15539, isoform A [Drosophila melanogaster]
Length = 386
Score = 67.8 bits (164), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 30/55 (54%), Positives = 37/55 (67%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+DV GGATIFP L L+VFP+KGSA+ WYN D R HS CP +G++W
Sbjct: 311 LSDVPQGGATIFPKLGLSVFPKKGSALLWYNLDHKGDGDNRTAHSACPTVVGSRW 365
Score = 62.8 bits (151), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 29/64 (45%), Positives = 39/64 (60%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
+R + L+DV GGATIFP L L+VFP+KGSA+ W+N D HS CP +
Sbjct: 302 DRIATAVFYLSDVPQGGATIFPKLGLSVFPKKGSALLWYNLDHKGDGDNRTAHSACPTVV 361
Query: 62 GNKW 65
G++W
Sbjct: 362 GSRW 365
>gi|241598357|ref|XP_002404731.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
gi|215500462|gb|EEC09956.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
Length = 218
Score = 67.8 bits (164), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 34/83 (40%), Positives = 51/83 (61%), Gaps = 6/83 (7%)
Query: 45 NTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNA 104
N+ DY++ H G++ L++ +TDVE GGAT+FPSL + + P+KG A FW+N
Sbjct: 113 NSSADYYLQH-----GRGDRIATLMIY-MTDVEAGGATVFPSLGIRLTPKKGDAAFWWNL 166
Query: 105 HANTLLDYRMYHSGCPVALGNKW 127
A+ + H+GCPV G+KW
Sbjct: 167 KASGEGERLTMHAGCPVLYGSKW 189
Score = 62.4 bits (150), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 28/57 (49%), Positives = 38/57 (66%)
Query: 9 VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
+ +TDVE GGAT+FPSL + + P+KG A FW N A+ + H+GCPV G+KW
Sbjct: 133 IYMTDVEAGGATVFPSLGIRLTPKKGDAAFWWNLKASGEGERLTMHAGCPVLYGSKW 189
>gi|195113237|ref|XP_002001174.1| GI10637 [Drosophila mojavensis]
gi|193917768|gb|EDW16635.1| GI10637 [Drosophila mojavensis]
Length = 529
Score = 67.4 bits (163), Expect = 2e-09, Method: Composition-based stats.
Identities = 28/55 (50%), Positives = 38/55 (69%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
LTDVE GGAT+FP++ VFP+ G+AV WYN + D + H+ CPV +G+KW
Sbjct: 451 LTDVEQGGATVFPNIEKAVFPKSGTAVVWYNLRHDGNGDPQTLHAACPVIVGSKW 505
Score = 64.3 bits (155), Expect = 2e-08, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 37/55 (67%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
LTDVE GGAT+FP++ VFP+ G+AV W+N + D H+ CPV +G+KW
Sbjct: 451 LTDVEQGGATVFPNIEKAVFPKSGTAVVWYNLRHDGNGDPQTLHAACPVIVGSKW 505
>gi|386368303|gb|AFJ06910.1| procollagen-proline dioxygenase [Mytilus galloprovincialis]
Length = 535
Score = 67.4 bits (163), Expect = 2e-09, Method: Composition-based stats.
Identities = 30/66 (45%), Positives = 42/66 (63%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ LL +TDV+ GGAT+FP +++ V P+KG+AVFWYN + H+ CPV
Sbjct: 448 GNRIATFLLY-MTDVDAGGATVFPIIDVRVLPKKGTAVFWYNLRRSGKGIMETRHAACPV 506
Query: 122 ALGNKW 127
+G KW
Sbjct: 507 LVGTKW 512
Score = 62.8 bits (151), Expect = 5e-08, Method: Composition-based stats.
Identities = 27/64 (42%), Positives = 40/64 (62%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + +TDV+ GGAT+FP +++ V P+KG+AVFW+N + H+ CPV +
Sbjct: 449 NRIATFLLYMTDVDAGGATVFPIIDVRVLPKKGTAVFWYNLRRSGKGIMETRHAACPVLV 508
Query: 62 GNKW 65
G KW
Sbjct: 509 GTKW 512
>gi|195452776|ref|XP_002073495.1| GK13117 [Drosophila willistoni]
gi|194169580|gb|EDW84481.1| GK13117 [Drosophila willistoni]
Length = 487
Score = 67.4 bits (163), Expect = 2e-09, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 37/55 (67%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L DVE GGAT+FP + VFP++GSA+ WYN + + H+GCPV +G+KW
Sbjct: 409 LNDVEQGGATVFPEIKKAVFPKRGSAIMWYNLKDDGEGNRDTLHAGCPVIVGSKW 463
Score = 64.3 bits (155), Expect = 2e-08, Method: Composition-based stats.
Identities = 26/55 (47%), Positives = 37/55 (67%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GGAT+FP + VFP++GSA+ W+N + + H+GCPV +G+KW
Sbjct: 409 LNDVEQGGATVFPEIKKAVFPKRGSAIMWYNLKDDGEGNRDTLHAGCPVIVGSKW 463
>gi|196011900|ref|XP_002115813.1| hypothetical protein TRIADDRAFT_59899 [Trichoplax adhaerens]
gi|190581589|gb|EDV21665.1| hypothetical protein TRIADDRAFT_59899 [Trichoplax adhaerens]
Length = 581
Score = 67.4 bits (163), Expect = 2e-09, Method: Composition-based stats.
Identities = 30/68 (44%), Positives = 43/68 (63%), Gaps = 1/68 (1%)
Query: 61 LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
+GN+ +L+ L++VE GGAT+FP + V P +GSA FWYN H N + H+ CP
Sbjct: 466 IGNRIATVLIY-LSNVEAGGATVFPKAGVRVEPRQGSAAFWYNMHRNGEGNKLSVHAACP 524
Query: 121 VALGNKWG 128
V +G+KW
Sbjct: 525 VLIGSKWA 532
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 33/91 (36%), Positives = 49/91 (53%), Gaps = 9/91 (9%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L++VE GGAT+FP + V P +GSA FW+N H N + H+ CPV +
Sbjct: 468 NRIATVLIYLSNVEAGGATVFPKAGVRVEPRQGSAAFWYNMHRNGEGNKLSVHAACPVLI 527
Query: 62 GNKW---------GKLLLSGLTDVELGGATI 83
G+KW G+ L LT++ + A +
Sbjct: 528 GSKWAANLWFREVGQELRHTLTEINIASAIL 558
>gi|198449641|ref|XP_002136935.1| GA26860 [Drosophila pseudoobscura pseudoobscura]
gi|198130697|gb|EDY67493.1| GA26860 [Drosophila pseudoobscura pseudoobscura]
Length = 508
Score = 67.0 bits (162), Expect = 2e-09, Method: Composition-based stats.
Identities = 31/75 (41%), Positives = 46/75 (61%), Gaps = 1/75 (1%)
Query: 53 YHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDY 112
Y+ G L ++ LL L+DV LGGAT+FP L+L+VFP++G+ + WYN
Sbjct: 415 YYEGSNPELPDRVATLLFY-LSDVPLGGATVFPLLDLSVFPKRGAVLMWYNLDHKGQGIE 473
Query: 113 RMYHSGCPVALGNKW 127
+ HS CPV +G++W
Sbjct: 474 KTVHSACPVVVGSRW 488
Score = 60.8 bits (146), Expect = 2e-07, Method: Composition-based stats.
Identities = 25/55 (45%), Positives = 37/55 (67%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L+DV LGGAT+FP L+L+VFP++G+ + W+N HS CPV +G++W
Sbjct: 434 LSDVPLGGATVFPLLDLSVFPKRGAVLMWYNLDHKGQGIEKTVHSACPVVVGSRW 488
>gi|211938649|gb|ACJ13221.1| FI08532p [Drosophila melanogaster]
Length = 543
Score = 67.0 bits (162), Expect = 2e-09, Method: Composition-based stats.
Identities = 29/55 (52%), Positives = 35/55 (63%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L +V GG T FP LNLTVFP+ GSA+FWYN H+GCPV +G+KW
Sbjct: 454 LNEVRQGGGTYFPRLNLTVFPQPGSALFWYNLDTKGNDHMGSLHTGCPVIVGSKW 508
Score = 65.1 bits (157), Expect = 1e-08, Method: Composition-based stats.
Identities = 28/55 (50%), Positives = 35/55 (63%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L +V GG T FP LNLTVFP+ GSA+FW+N H+GCPV +G+KW
Sbjct: 454 LNEVRQGGGTYFPRLNLTVFPQPGSALFWYNLDTKGNDHMGSLHTGCPVIVGSKW 508
>gi|24651430|ref|NP_733378.1| prolyl-4-hydroxylase-alpha NE2 [Drosophila melanogaster]
gi|23172699|gb|AAF57061.2| prolyl-4-hydroxylase-alpha NE2 [Drosophila melanogaster]
Length = 542
Score = 67.0 bits (162), Expect = 2e-09, Method: Composition-based stats.
Identities = 29/55 (52%), Positives = 35/55 (63%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L +V GG T FP LNLTVFP+ GSA+FWYN H+GCPV +G+KW
Sbjct: 453 LNEVRQGGGTYFPRLNLTVFPQPGSALFWYNLDTKGNDHMGSLHTGCPVIVGSKW 507
Score = 65.1 bits (157), Expect = 1e-08, Method: Composition-based stats.
Identities = 28/55 (50%), Positives = 35/55 (63%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L +V GG T FP LNLTVFP+ GSA+FW+N H+GCPV +G+KW
Sbjct: 453 LNEVRQGGGTYFPRLNLTVFPQPGSALFWYNLDTKGNDHMGSLHTGCPVIVGSKW 507
>gi|20269814|gb|AAM18062.1|AF495540_1 prolyl 4-hydroxylase alpha-related protein PH4[alpha]NE2
[Drosophila melanogaster]
gi|19528175|gb|AAL90202.1| AT27756p [Drosophila melanogaster]
Length = 542
Score = 67.0 bits (162), Expect = 2e-09, Method: Composition-based stats.
Identities = 29/55 (52%), Positives = 35/55 (63%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L +V GG T FP LNLTVFP+ GSA+FWYN H+GCPV +G+KW
Sbjct: 453 LNEVRQGGGTYFPRLNLTVFPQPGSALFWYNLDTKGNDHMGSLHTGCPVIVGSKW 507
Score = 65.1 bits (157), Expect = 1e-08, Method: Composition-based stats.
Identities = 28/55 (50%), Positives = 35/55 (63%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L +V GG T FP LNLTVFP+ GSA+FW+N H+GCPV +G+KW
Sbjct: 453 LNEVRQGGGTYFPRLNLTVFPQPGSALFWYNLDTKGNDHMGSLHTGCPVIVGSKW 507
>gi|195159323|ref|XP_002020531.1| GL13463 [Drosophila persimilis]
gi|194117300|gb|EDW39343.1| GL13463 [Drosophila persimilis]
Length = 487
Score = 67.0 bits (162), Expect = 3e-09, Method: Composition-based stats.
Identities = 29/67 (43%), Positives = 42/67 (62%), Gaps = 1/67 (1%)
Query: 61 LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
LGN+ +L ++DVE GGAT+F +L ++P++G+A FW N H + D R H+ CP
Sbjct: 394 LGNRIATVLFY-MSDVEQGGATVFTTLRTALWPKRGTAAFWMNLHRDGEGDKRTQHAACP 452
Query: 121 VALGNKW 127
V G KW
Sbjct: 453 VLTGTKW 459
Score = 61.2 bits (147), Expect = 1e-07, Method: Composition-based stats.
Identities = 26/64 (40%), Positives = 37/64 (57%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DVE GGAT+F +L ++P++G+A FW N H + D H+ CPV
Sbjct: 396 NRIATVLFYMSDVEQGGATVFTTLRTALWPKRGTAAFWMNLHRDGEGDKRTQHAACPVLT 455
Query: 62 GNKW 65
G KW
Sbjct: 456 GTKW 459
>gi|125772807|ref|XP_001357662.1| GA15946 [Drosophila pseudoobscura pseudoobscura]
gi|54637394|gb|EAL26796.1| GA15946 [Drosophila pseudoobscura pseudoobscura]
Length = 549
Score = 67.0 bits (162), Expect = 3e-09, Method: Composition-based stats.
Identities = 29/67 (43%), Positives = 42/67 (62%), Gaps = 1/67 (1%)
Query: 61 LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
LGN+ +L ++DVE GGAT+F +L ++P++G+A FW N H + D R H+ CP
Sbjct: 456 LGNRIATVLFY-MSDVEQGGATVFTTLRTALWPKRGTAAFWMNLHRDGEGDKRTQHAACP 514
Query: 121 VALGNKW 127
V G KW
Sbjct: 515 VLTGTKW 521
Score = 61.2 bits (147), Expect = 1e-07, Method: Composition-based stats.
Identities = 26/64 (40%), Positives = 37/64 (57%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DVE GGAT+F +L ++P++G+A FW N H + D H+ CPV
Sbjct: 458 NRIATVLFYMSDVEQGGATVFTTLRTALWPKRGTAAFWMNLHRDGEGDKRTQHAACPVLT 517
Query: 62 GNKW 65
G KW
Sbjct: 518 GTKW 521
>gi|195505214|ref|XP_002099407.1| GE23379 [Drosophila yakuba]
gi|194185508|gb|EDW99119.1| GE23379 [Drosophila yakuba]
Length = 547
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 32/61 (52%), Positives = 39/61 (63%), Gaps = 12/61 (19%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNA------HANTLLDYRMYHSGCPVALGNK 126
L +V GG T FP LNLTVFP+ GSA+FWYN H +TL H+GCPV +G+K
Sbjct: 458 LNEVRQGGGTHFPRLNLTVFPQPGSALFWYNLDTRGNDHTSTL------HTGCPVIVGSK 511
Query: 127 W 127
W
Sbjct: 512 W 512
Score = 64.7 bits (156), Expect = 1e-08, Method: Composition-based stats.
Identities = 31/61 (50%), Positives = 39/61 (63%), Gaps = 12/61 (19%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNA------HANTLLDYWMYHSGCPVALGNK 64
L +V GG T FP LNLTVFP+ GSA+FW+N H +TL H+GCPV +G+K
Sbjct: 458 LNEVRQGGGTHFPRLNLTVFPQPGSALFWYNLDTRGNDHTSTL------HTGCPVIVGSK 511
Query: 65 W 65
W
Sbjct: 512 W 512
>gi|198477150|ref|XP_002136737.1| GA29215 [Drosophila pseudoobscura pseudoobscura]
gi|198145042|gb|EDY71754.1| GA29215 [Drosophila pseudoobscura pseudoobscura]
Length = 508
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 29/59 (49%), Positives = 40/59 (67%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
LL L+DV LGGAT+FP L+L+VFP+KG+ + WYN + HS CPV +G++W
Sbjct: 430 LLFYLSDVPLGGATVFPLLDLSVFPKKGAVLMWYNLDHKGQGMEKTIHSACPVVVGSRW 488
Score = 62.0 bits (149), Expect = 8e-08, Method: Composition-based stats.
Identities = 27/56 (48%), Positives = 41/56 (73%), Gaps = 2/56 (3%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNA-HANTLLDYWMYHSGCPVALGNKW 65
L+DV LGGAT+FP L+L+VFP+KG+ + W+N H ++ + HS CPV +G++W
Sbjct: 434 LSDVPLGGATVFPLLDLSVFPKKGAVLMWYNLDHKGQGMEKTI-HSACPVVVGSRW 488
>gi|195055767|ref|XP_001994784.1| GH14132 [Drosophila grimshawi]
gi|193892547|gb|EDV91413.1| GH14132 [Drosophila grimshawi]
Length = 537
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 30/67 (44%), Positives = 42/67 (62%), Gaps = 1/67 (1%)
Query: 61 LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
+GN+ +L L+DV GG T FP L + P+K +A FWYN HA+ D+R H GCP
Sbjct: 444 MGNRIATVLFY-LSDVAQGGGTAFPILKQLLKPKKYAAAFWYNLHASGKGDWRNLHGGCP 502
Query: 121 VALGNKW 127
+ +G+KW
Sbjct: 503 IIVGSKW 509
Score = 60.1 bits (144), Expect = 3e-07, Method: Composition-based stats.
Identities = 27/64 (42%), Positives = 37/64 (57%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR L+DV GG T FP L + P+K +A FW+N HA+ D+ H GCP+ +
Sbjct: 446 NRIATVLFYLSDVAQGGGTAFPILKQLLKPKKYAAAFWYNLHASGKGDWRNLHGGCPIIV 505
Query: 62 GNKW 65
G+KW
Sbjct: 506 GSKW 509
>gi|194905381|ref|XP_001981186.1| GG11928 [Drosophila erecta]
gi|190655824|gb|EDV53056.1| GG11928 [Drosophila erecta]
Length = 543
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 30/55 (54%), Positives = 35/55 (63%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L V GGAT FP LNLTVFP+ GSA+FWYN H+GCPV +G+KW
Sbjct: 454 LNGVRQGGATHFPLLNLTVFPQPGSALFWYNLDTKGNDQRSTMHTGCPVIVGSKW 508
Score = 64.3 bits (155), Expect = 2e-08, Method: Composition-based stats.
Identities = 30/64 (46%), Positives = 38/64 (59%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
+R + L V GGAT FP LNLTVFP+ GSA+FW+N H+GCPV +
Sbjct: 445 DRIATTVFYLNGVRQGGATHFPLLNLTVFPQPGSALFWYNLDTKGNDQRSTMHTGCPVIV 504
Query: 62 GNKW 65
G+KW
Sbjct: 505 GSKW 508
>gi|67084101|gb|AAY66985.1| truncated prolyl 4-hydroxylase alpha subunit [Ixodes scapularis]
Length = 452
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 34/87 (39%), Positives = 54/87 (62%), Gaps = 7/87 (8%)
Query: 41 NAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVF 100
+AH N+ DY++ ++ G++ L++ +TDV+ GGAT+FP L + + P+KG A F
Sbjct: 344 HAH-NSTADYYLQNNE-----GDRLATLMIY-MTDVKEGGATVFPRLGVRLVPKKGDAAF 396
Query: 101 WYNAHANTLLDYRMYHSGCPVALGNKW 127
W+N A+ D H+GCPV G+KW
Sbjct: 397 WWNLKASGEGDTLTMHAGCPVLYGSKW 423
Score = 62.8 bits (151), Expect = 4e-08, Method: Composition-based stats.
Identities = 27/57 (47%), Positives = 37/57 (64%)
Query: 9 VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
+ +TDV+ GGAT+FP L + + P+KG A FW N A+ D H+GCPV G+KW
Sbjct: 367 IYMTDVKEGGATVFPRLGVRLVPKKGDAAFWWNLKASGEGDTLTMHAGCPVLYGSKW 423
>gi|195452744|ref|XP_002073481.1| GK14140 [Drosophila willistoni]
gi|194169566|gb|EDW84467.1| GK14140 [Drosophila willistoni]
Length = 454
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 29/55 (52%), Positives = 38/55 (69%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L DV GGAT+F +LNLTVFP+ G+A+FW+N D H+GCPV +G+KW
Sbjct: 380 LQDVPQGGATLFNNLNLTVFPKAGAALFWFNLDNAGDTDLFTVHTGCPVIVGSKW 434
Score = 66.2 bits (160), Expect = 4e-09, Method: Composition-based stats.
Identities = 29/55 (52%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DV GGAT+F +LNLTVFP+ G+A+FW N D + H+GCPV +G+KW
Sbjct: 380 LQDVPQGGATLFNNLNLTVFPKAGAALFWFNLDNAGDTDLFTVHTGCPVIVGSKW 434
>gi|195109817|ref|XP_001999478.1| GI23043 [Drosophila mojavensis]
gi|193916072|gb|EDW14939.1| GI23043 [Drosophila mojavensis]
Length = 491
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 30/65 (46%), Positives = 41/65 (63%), Gaps = 1/65 (1%)
Query: 63 NKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 122
N+ +LL L +V++GG T FP+L L V P KGSA+ W+N + DYR H+ CP
Sbjct: 405 NRIATILLY-LNEVQMGGITSFPALGLGVQPIKGSALIWHNMNHEIECDYRTLHAACPTL 463
Query: 123 LGNKW 127
LGN+W
Sbjct: 464 LGNRW 468
Score = 65.9 bits (159), Expect = 5e-09, Method: Composition-based stats.
Identities = 29/64 (45%), Positives = 38/64 (59%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L +V++GG T FP+L L V P KGSA+ WHN + DY H+ CP L
Sbjct: 405 NRIATILLYLNEVQMGGITSFPALGLGVQPIKGSALIWHNMNHEIECDYRTLHAACPTLL 464
Query: 62 GNKW 65
GN+W
Sbjct: 465 GNRW 468
>gi|260825357|ref|XP_002607633.1| hypothetical protein BRAFLDRAFT_59428 [Branchiostoma floridae]
gi|229292981|gb|EEN63643.1| hypothetical protein BRAFLDRAFT_59428 [Branchiostoma floridae]
Length = 520
Score = 66.2 bits (160), Expect = 4e-09, Method: Composition-based stats.
Identities = 28/59 (47%), Positives = 38/59 (64%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
+L ++DVE GGAT+FP + V PEKG+ FWYN + D H+GCPV +G+KW
Sbjct: 439 MLFYMSDVEAGGATVFPQVGARVVPEKGAGAFWYNLLKSGEGDMLTEHAGCPVLVGSKW 497
Score = 64.3 bits (155), Expect = 1e-08, Method: Composition-based stats.
Identities = 26/55 (47%), Positives = 36/55 (65%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP + V PEKG+ FW+N + D H+GCPV +G+KW
Sbjct: 443 MSDVEAGGATVFPQVGARVVPEKGAGAFWYNLLKSGEGDMLTEHAGCPVLVGSKW 497
>gi|195379218|ref|XP_002048377.1| GJ13934 [Drosophila virilis]
gi|194155535|gb|EDW70719.1| GJ13934 [Drosophila virilis]
Length = 469
Score = 66.2 bits (160), Expect = 4e-09, Method: Composition-based stats.
Identities = 30/67 (44%), Positives = 43/67 (64%), Gaps = 1/67 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ ++ +DV+LGGATIFP L L V P++GSA+ WYN +A D H+ CPV
Sbjct: 386 GNRIATVIFYA-SDVQLGGATIFPRLQLVVKPKRGSALLWYNLNAAGKPDPLTRHAVCPV 444
Query: 122 ALGNKWG 128
+G++W
Sbjct: 445 VVGSRWA 451
Score = 64.3 bits (155), Expect = 1e-08, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 38/55 (69%)
Query: 12 TDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWG 66
+DV+LGGATIFP L L V P++GSA+ W+N +A D H+ CPV +G++W
Sbjct: 397 SDVQLGGATIFPRLQLVVKPKRGSALLWYNLNAAGKPDPLTRHAVCPVVVGSRWA 451
>gi|195591300|ref|XP_002085380.1| GD14756 [Drosophila simulans]
gi|194197389|gb|EDX10965.1| GD14756 [Drosophila simulans]
Length = 477
Score = 66.2 bits (160), Expect = 4e-09, Method: Composition-based stats.
Identities = 31/61 (50%), Positives = 39/61 (63%), Gaps = 5/61 (8%)
Query: 68 LLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
+ +S L +VELGGAT+FP L ++VFP+KGS + WYN D R CPV GNKW
Sbjct: 421 IYISLLNNVELGGATVFPKLKISVFPQKGSCLIWYNTP-----DPRSEPLECPVLQGNKW 475
Query: 128 G 128
G
Sbjct: 476 G 476
Score = 59.7 bits (143), Expect = 3e-07, Method: Composition-based stats.
Identities = 28/56 (50%), Positives = 35/56 (62%), Gaps = 5/56 (8%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWG 66
L +VELGGAT+FP L ++VFP+KGS + W+N D CPV GNKWG
Sbjct: 426 LNNVELGGATVFPKLKISVFPQKGSCLIWYNTP-----DPRSEPLECPVLQGNKWG 476
>gi|449668268|ref|XP_002154169.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Hydra
magnipapillata]
Length = 531
Score = 66.2 bits (160), Expect = 4e-09, Method: Composition-based stats.
Identities = 30/67 (44%), Positives = 40/67 (59%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L+DVE GG T+F L L++ P+ G A FWHN + N Y H+ CPV
Sbjct: 456 NRIATLLIYLSDVERGGDTVFTRLGLSLKPKLGDAAFWHNLYKNGSGIYATEHASCPVVS 515
Query: 62 GNKWGKL 68
G+KWGK+
Sbjct: 516 GSKWGKI 522
Score = 65.5 bits (158), Expect = 7e-09, Method: Composition-based stats.
Identities = 31/69 (44%), Positives = 43/69 (62%), Gaps = 1/69 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ LL+ L+DVE GG T+F L L++ P+ G A FW+N + N Y H+ CPV
Sbjct: 455 GNRIATLLIY-LSDVERGGDTVFTRLGLSLKPKLGDAAFWHNLYKNGSGIYATEHASCPV 513
Query: 122 ALGNKWGKL 130
G+KWGK+
Sbjct: 514 VSGSKWGKI 522
>gi|260825355|ref|XP_002607632.1| hypothetical protein BRAFLDRAFT_84679 [Branchiostoma floridae]
gi|229292980|gb|EEN63642.1| hypothetical protein BRAFLDRAFT_84679 [Branchiostoma floridae]
Length = 519
Score = 66.2 bits (160), Expect = 4e-09, Method: Composition-based stats.
Identities = 28/59 (47%), Positives = 38/59 (64%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
+L ++DVE GGAT+FP + V PEKG+ FWYN + D H+GCPV +G+KW
Sbjct: 438 MLFYMSDVEAGGATVFPQVGARVVPEKGAGAFWYNLLKSGEGDMLTEHAGCPVLVGSKW 496
Score = 64.3 bits (155), Expect = 2e-08, Method: Composition-based stats.
Identities = 26/55 (47%), Positives = 36/55 (65%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP + V PEKG+ FW+N + D H+GCPV +G+KW
Sbjct: 442 MSDVEAGGATVFPQVGARVVPEKGAGAFWYNLLKSGEGDMLTEHAGCPVLVGSKW 496
>gi|194764881|ref|XP_001964556.1| GF23245 [Drosophila ananassae]
gi|190614828|gb|EDV30352.1| GF23245 [Drosophila ananassae]
Length = 460
Score = 66.2 bits (160), Expect = 4e-09, Method: Composition-based stats.
Identities = 31/79 (39%), Positives = 48/79 (60%), Gaps = 1/79 (1%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
D+++Y G + +L L DVEL G+T+FP LN+++ P+KGSAV WYN H +
Sbjct: 358 DFYVYSEPLRFLRGERIVTVLFY-LGDVELSGSTVFPFLNISITPKKGSAVMWYNLHNSG 416
Query: 109 LLDYRMYHSGCPVALGNKW 127
+ + H CPV +G+K+
Sbjct: 417 DVHQKTQHCACPVVVGSKY 435
Score = 62.0 bits (149), Expect = 8e-08, Method: Composition-based stats.
Identities = 26/55 (47%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVEL G+T+FP LN+++ P+KGSAV W+N H + + H CPV +G+K+
Sbjct: 381 LGDVELSGSTVFPFLNISITPKKGSAVMWYNLHNSGDVHQKTQHCACPVVVGSKY 435
>gi|443730626|gb|ELU16050.1| hypothetical protein CAPTEDRAFT_114796, partial [Capitella teleta]
Length = 150
Score = 66.2 bits (160), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 30/55 (54%), Positives = 34/55 (61%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L DVE GG T+FP LNL + P K SAVFW+N + D R H CPV LG KW
Sbjct: 76 LNDVEAGGRTVFPRLNLVIEPIKNSAVFWHNLLDDGQQDDRTIHGACPVVLGRKW 130
Score = 65.5 bits (158), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 30/57 (52%), Positives = 34/57 (59%)
Query: 9 VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
+ L DVE GG T+FP LNL + P K SAVFWHN + D H CPV LG KW
Sbjct: 74 IYLNDVEAGGRTVFPRLNLVIEPIKNSAVFWHNLLDDGQQDDRTIHGACPVVLGRKW 130
>gi|195166679|ref|XP_002024162.1| GL22881 [Drosophila persimilis]
gi|194107517|gb|EDW29560.1| GL22881 [Drosophila persimilis]
Length = 288
Score = 66.2 bits (160), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 30/60 (50%), Positives = 40/60 (66%)
Query: 75 DVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWGKLLLSG 134
D++ GGATIFP +N+TV P KGSA+ W+N + + D R HS CPV G++W K SG
Sbjct: 218 DLQQGGATIFPLINITVSPRKGSALLWFNLYDDGEPDPRTLHSSCPVIEGDRWMKQNGSG 277
Score = 62.0 bits (149), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 30/63 (47%), Positives = 39/63 (61%)
Query: 13 DVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWGKLLLSG 72
D++ GGATIFP +N+TV P KGSA+ W N + + D HS CPV G++W K SG
Sbjct: 218 DLQQGGATIFPLINITVSPRKGSALLWFNLYDDGEPDPRTLHSSCPVIEGDRWMKQNGSG 277
Query: 73 LTD 75
D
Sbjct: 278 SLD 280
>gi|194905294|ref|XP_001981167.1| GG11919 [Drosophila erecta]
gi|190655805|gb|EDV53037.1| GG11919 [Drosophila erecta]
Length = 533
Score = 66.2 bits (160), Expect = 5e-09, Method: Composition-based stats.
Identities = 31/75 (41%), Positives = 48/75 (64%), Gaps = 4/75 (5%)
Query: 53 YHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDY 112
+HS + LG++ +L LTDVE GGAT+F ++ +V+P+ G+A+FWYN + D
Sbjct: 435 HHS---IVLGDRIATVLFY-LTDVEQGGATVFGNVGYSVYPQAGTAIFWYNLDTDGNGDP 490
Query: 113 RMYHSGCPVALGNKW 127
H+ CPV +G+KW
Sbjct: 491 LTRHASCPVVVGSKW 505
Score = 62.4 bits (150), Expect = 6e-08, Method: Composition-based stats.
Identities = 25/55 (45%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
LTDVE GGAT+F ++ +V+P+ G+A+FW+N + D H+ CPV +G+KW
Sbjct: 451 LTDVEQGGATVFGNVGYSVYPQAGTAIFWYNLDTDGNGDPLTRHASCPVVVGSKW 505
>gi|268562483|ref|XP_002638619.1| Hypothetical protein CBG05671 [Caenorhabditis briggsae]
Length = 520
Score = 65.9 bits (159), Expect = 5e-09, Method: Composition-based stats.
Identities = 40/111 (36%), Positives = 56/111 (50%), Gaps = 6/111 (5%)
Query: 21 IFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHS-----GCPVALGNKWGKLLLSGLTD 75
+ P+LN+ + + + H D+ Y S G GN++G L+++ T
Sbjct: 377 LIPTLNIESSEDIVALSYIRGGHYAAHHDFLEYPSEKEWDGWMKDYGNRFGTLIMAFET- 435
Query: 76 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 126
ELGGATIFPSLN + P G A FW+NA NT + H GCP+ G K
Sbjct: 436 AELGGATIFPSLNAAIRPNTGDAFFWFNAMGNTKQEDLSDHGGCPIYEGKK 486
Score = 57.8 bits (138), Expect = 1e-06, Method: Composition-based stats.
Identities = 27/50 (54%), Positives = 30/50 (60%)
Query: 15 ELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNK 64
ELGGATIFPSLN + P G A FW NA NT + H GCP+ G K
Sbjct: 437 ELGGATIFPSLNAAIRPNTGDAFFWFNAMGNTKQEDLSDHGGCPIYEGKK 486
Score = 45.8 bits (107), Expect = 0.005, Method: Composition-based stats.
Identities = 23/70 (32%), Positives = 36/70 (51%), Gaps = 1/70 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ +L+ L + GG T+FP +N+ + P+ G V W N + D + H+ CP+
Sbjct: 170 GNRIATVLVI-LQIAKKGGTTVFPKININIRPKIGDVVVWLNTVPDGESDSQTLHAACPI 228
Query: 122 ALGNKWGKLL 131
G K G L
Sbjct: 229 KEGTKIGATL 238
Score = 45.1 bits (105), Expect = 0.009, Method: Composition-based stats.
Identities = 23/68 (33%), Positives = 32/68 (47%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR V L + GG T+FP +N+ + P+ G V W N + D H+ CP+
Sbjct: 171 NRIATVLVILQIAKKGGTTVFPKININIRPKIGDVVVWLNTVPDGESDSQTLHAACPIKE 230
Query: 62 GNKWGKLL 69
G K G L
Sbjct: 231 GTKIGATL 238
>gi|195440206|ref|XP_002067933.1| GK11220 [Drosophila willistoni]
gi|194164018|gb|EDW78919.1| GK11220 [Drosophila willistoni]
Length = 459
Score = 65.9 bits (159), Expect = 5e-09, Method: Composition-based stats.
Identities = 25/71 (35%), Positives = 46/71 (64%), Gaps = 1/71 (1%)
Query: 59 VALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSG 118
+ LG++ ++ ++V LGGAT+FP L++ + P+KG+ + WYN + ++ D R H+
Sbjct: 371 LTLGDRLASVIFYA-SEVHLGGATVFPRLDVAITPKKGAGLVWYNTYDDSTHDQRSQHAV 429
Query: 119 CPVALGNKWGK 129
CP +G++W K
Sbjct: 430 CPTLMGSRWSK 440
Score = 59.7 bits (143), Expect = 4e-07, Method: Composition-based stats.
Identities = 21/56 (37%), Positives = 38/56 (67%)
Query: 12 TDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWGK 67
++V LGGAT+FP L++ + P+KG+ + W+N + ++ D H+ CP +G++W K
Sbjct: 385 SEVHLGGATVFPRLDVAITPKKGAGLVWYNTYDDSTHDQRSQHAVCPTLMGSRWSK 440
>gi|196011902|ref|XP_002115814.1| hypothetical protein TRIADDRAFT_30039 [Trichoplax adhaerens]
gi|190581590|gb|EDV21666.1| hypothetical protein TRIADDRAFT_30039 [Trichoplax adhaerens]
Length = 534
Score = 65.9 bits (159), Expect = 5e-09, Method: Composition-based stats.
Identities = 30/66 (45%), Positives = 42/66 (63%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ +L+ L+ VELGGAT+F S + + P +GSA FWYN H N + H+ CPV
Sbjct: 447 GNRIATMLIY-LSSVELGGATVFSSAGVRIEPRQGSAAFWYNLHRNGNGNNLTRHAACPV 505
Query: 122 ALGNKW 127
+G+KW
Sbjct: 506 LIGSKW 511
Score = 63.9 bits (154), Expect = 2e-08, Method: Composition-based stats.
Identities = 28/64 (43%), Positives = 39/64 (60%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L+ VELGGAT+F S + + P +GSA FW+N H N + H+ CPV +
Sbjct: 448 NRIATMLIYLSSVELGGATVFSSAGVRIEPRQGSAAFWYNLHRNGNGNNLTRHAACPVLI 507
Query: 62 GNKW 65
G+KW
Sbjct: 508 GSKW 511
>gi|195390831|ref|XP_002054071.1| GJ22995 [Drosophila virilis]
gi|194152157|gb|EDW67591.1| GJ22995 [Drosophila virilis]
Length = 485
Score = 65.9 bits (159), Expect = 5e-09, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 34/55 (61%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+DVE GG T+FP + PE+G A+ WYN H N D R H CPV +G+KW
Sbjct: 408 LSDVEQGGDTVFPRIEQAFKPERGKALLWYNLHRNGTGDKRTEHGACPVLVGSKW 462
Score = 63.2 bits (152), Expect = 3e-08, Method: Composition-based stats.
Identities = 26/59 (44%), Positives = 36/59 (61%)
Query: 7 SSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++ EL+DVE GG T+FP + PE+G A+ W+N H N D H CPV +G+KW
Sbjct: 404 NTAELSDVEQGGDTVFPRIEQAFKPERGKALLWYNLHRNGTGDKRTEHGACPVLVGSKW 462
>gi|194751827|ref|XP_001958225.1| GF23630 [Drosophila ananassae]
gi|190625507|gb|EDV41031.1| GF23630 [Drosophila ananassae]
Length = 431
Score = 65.9 bits (159), Expect = 5e-09, Method: Composition-based stats.
Identities = 33/69 (47%), Positives = 42/69 (60%), Gaps = 1/69 (1%)
Query: 61 LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
+G G +L DV+ GGAT+FP L ++VFP+KGS + WYN + LD R HS CP
Sbjct: 360 IGGPVGAILFFISDDVQ-GGATVFPKLKVSVFPKKGSCLVWYNIKDDGRLDPRTTHSICP 418
Query: 121 VALGNKWGK 129
V GN GK
Sbjct: 419 VLEGNSLGK 427
Score = 58.9 bits (141), Expect = 6e-07, Method: Composition-based stats.
Identities = 27/57 (47%), Positives = 36/57 (63%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWGK 67
++D GGAT+FP L ++VFP+KGS + W+N + LD HS CPV GN GK
Sbjct: 371 ISDDVQGGATVFPKLKVSVFPKKGSCLVWYNIKDDGRLDPRTTHSICPVLEGNSLGK 427
>gi|195352180|ref|XP_002042592.1| GM14979 [Drosophila sechellia]
gi|194124476|gb|EDW46519.1| GM14979 [Drosophila sechellia]
Length = 461
Score = 65.9 bits (159), Expect = 5e-09, Method: Composition-based stats.
Identities = 32/60 (53%), Positives = 38/60 (63%), Gaps = 5/60 (8%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 128
LL L +VELGGAT+FP L ++VFP+KGS + WYN D R CPV GNKWG
Sbjct: 406 LLFFLNNVELGGATVFPKLKISVFPQKGSCLIWYNTP-----DPRSDPLECPVLQGNKWG 460
Score = 59.7 bits (143), Expect = 4e-07, Method: Composition-based stats.
Identities = 28/59 (47%), Positives = 37/59 (62%), Gaps = 11/59 (18%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNA---HANTLLDYWMYHSGCPVALGNKWG 66
L +VELGGAT+FP L ++VFP+KGS + W+N ++ L CPV GNKWG
Sbjct: 410 LNNVELGGATVFPKLKISVFPQKGSCLIWYNTPDPRSDPL--------ECPVLQGNKWG 460
>gi|442762205|gb|JAA73261.1| Putative prolyl 4-hydroxylase alpha subunit, partial [Ixodes
ricinus]
Length = 482
Score = 65.9 bits (159), Expect = 6e-09, Method: Composition-based stats.
Identities = 29/67 (43%), Positives = 44/67 (65%), Gaps = 1/67 (1%)
Query: 61 LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
+G++ L++ ++DVE GGAT+FP L + + P+KG A FW+N AN + H+GCP
Sbjct: 388 VGDRVATLMIY-MSDVEEGGATVFPYLGVRLTPQKGDAAFWWNLKANGEGEVLTTHAGCP 446
Query: 121 VALGNKW 127
V G+KW
Sbjct: 447 VLYGSKW 453
Score = 64.3 bits (155), Expect = 1e-08, Method: Composition-based stats.
Identities = 28/64 (43%), Positives = 39/64 (60%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
+R + ++DVE GGAT+FP L + + P+KG A FW N AN + H+GCPV
Sbjct: 390 DRVATLMIYMSDVEEGGATVFPYLGVRLTPQKGDAAFWWNLKANGEGEVLTTHAGCPVLY 449
Query: 62 GNKW 65
G+KW
Sbjct: 450 GSKW 453
>gi|198418585|ref|XP_002122034.1| PREDICTED: similar to Prolyl 4-hydroxylase subunit alpha-1 (4-PH
alpha-1)
(Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1) [Ciona intestinalis]
Length = 525
Score = 65.9 bits (159), Expect = 6e-09, Method: Composition-based stats.
Identities = 32/68 (47%), Positives = 41/68 (60%), Gaps = 1/68 (1%)
Query: 61 LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
+GN+ +LL L DV+ GG T F + P KGSAVFWYN + + L D R H+ CP
Sbjct: 432 VGNRIATMLLY-LNDVKEGGRTAFIEPKIVAKPIKGSAVFWYNLYPSGLGDPRTRHASCP 490
Query: 121 VALGNKWG 128
V +GNKW
Sbjct: 491 VVIGNKWA 498
Score = 58.2 bits (139), Expect = 1e-06, Method: Composition-based stats.
Identities = 28/65 (43%), Positives = 36/65 (55%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L DV+ GG T F + P KGSAVFW+N + + L D H+ CPV +
Sbjct: 434 NRIATMLLYLNDVKEGGRTAFIEPKIVAKPIKGSAVFWYNLYPSGLGDPRTRHASCPVVI 493
Query: 62 GNKWG 66
GNKW
Sbjct: 494 GNKWA 498
>gi|195425415|ref|XP_002061004.1| GK10713 [Drosophila willistoni]
gi|194157089|gb|EDW71990.1| GK10713 [Drosophila willistoni]
Length = 502
Score = 65.9 bits (159), Expect = 6e-09, Method: Composition-based stats.
Identities = 28/56 (50%), Positives = 38/56 (67%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 128
+TDV+ GGAT+FP+L +T FP+KGSA+ + N N D H+GCPV G+KW
Sbjct: 408 MTDVQQGGATVFPALRITNFPKKGSALIFRNLDNNISPDPSTLHAGCPVLFGSKWA 463
Score = 65.5 bits (158), Expect = 6e-09, Method: Composition-based stats.
Identities = 28/58 (48%), Positives = 39/58 (67%)
Query: 9 VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWG 66
+ +TDV+ GGAT+FP+L +T FP+KGSA+ + N N D H+GCPV G+KW
Sbjct: 406 IYMTDVQQGGATVFPALRITNFPKKGSALIFRNLDNNISPDPSTLHAGCPVLFGSKWA 463
>gi|195390805|ref|XP_002054058.1| GJ23004 [Drosophila virilis]
gi|194152144|gb|EDW67578.1| GJ23004 [Drosophila virilis]
Length = 446
Score = 65.5 bits (158), Expect = 6e-09, Method: Composition-based stats.
Identities = 30/55 (54%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L+DVELGGAT FP LNLT+ PEKG+AV WHN + H+ CPV +G+K+
Sbjct: 368 LSDVELGGATTFPLLNLTISPEKGTAVLWHNLKDSGTPHPKTVHAACPVIVGSKY 422
Score = 65.5 bits (158), Expect = 8e-09, Method: Composition-based stats.
Identities = 29/55 (52%), Positives = 39/55 (70%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+DVELGGAT FP LNLT+ PEKG+AV W+N + + H+ CPV +G+K+
Sbjct: 368 LSDVELGGATTFPLLNLTISPEKGTAVLWHNLKDSGTPHPKTVHAACPVIVGSKY 422
>gi|386712780|ref|YP_006179102.1| prolyl 4-hydroxylase alpha subunit [Halobacillus halophilus DSM
2266]
gi|384072335|emb|CCG43825.1| prolyl 4-hydroxylase alpha subunit [Halobacillus halophilus DSM
2266]
Length = 211
Score = 65.5 bits (158), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 45/142 (31%), Positives = 68/142 (47%), Gaps = 14/142 (9%)
Query: 1 MNRAEM-SSVELTDVELGGATIFPSLNLTVFPEKGSA----VFWHNAHANTLLDY----- 50
MNR+++ S E++D+ +T P +LT EK A V + +L+Y
Sbjct: 59 MNRSKIGSQHEVSDIRTSSSTFLPEDDLTNRIEKRVAQIMNVPVEHGEGLHILNYKQGQE 118
Query: 51 ----WMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHA 106
+ Y A N L+ L DVE GG T FP +NL++ P KG AV++ ++
Sbjct: 119 YKAHYDYFRSKAKAANNPRISTLVLYLNDVEEGGETYFPHMNLSISPHKGMAVYFEYFYS 178
Query: 107 NTLLDYRMYHSGCPVALGNKWG 128
+ L++ R H G PV G KW
Sbjct: 179 DPLINERTLHGGSPVTSGEKWA 200
>gi|321474953|gb|EFX85917.1| hypothetical protein DAPPUDRAFT_309108 [Daphnia pulex]
Length = 549
Score = 65.5 bits (158), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 28/55 (50%), Positives = 39/55 (70%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
++DVE GGAT+FP+LN+ ++P+KGSA FWYN N + H+ CPV G+KW
Sbjct: 470 MSDVEAGGATVFPALNVALWPQKGSAAFWYNLFPNGEGNELTRHAACPVLTGSKW 524
Score = 63.9 bits (154), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 27/55 (49%), Positives = 39/55 (70%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DVE GGAT+FP+LN+ ++P+KGSA FW+N N + H+ CPV G+KW
Sbjct: 470 MSDVEAGGATVFPALNVALWPQKGSAAFWYNLFPNGEGNELTRHAACPVLTGSKW 524
>gi|443721482|gb|ELU10773.1| hypothetical protein CAPTEDRAFT_174752 [Capitella teleta]
Length = 525
Score = 65.5 bits (158), Expect = 8e-09, Method: Composition-based stats.
Identities = 34/77 (44%), Positives = 41/77 (53%), Gaps = 4/77 (5%)
Query: 54 HSGCPV---ALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLL 110
HS P A GN+ + L DV GGAT+FP L + + P K A FWYN N +
Sbjct: 428 HSSLPEHVRASGNRLATFMFY-LNDVHAGGATVFPKLKVGIPPTKNGAAFWYNIGLNGDV 486
Query: 111 DYRMYHSGCPVALGNKW 127
D H+GCPV LG KW
Sbjct: 487 DPLTEHAGCPVLLGQKW 503
Score = 62.4 bits (150), Expect = 6e-08, Method: Composition-based stats.
Identities = 29/64 (45%), Positives = 35/64 (54%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR L DV GGAT+FP L + + P K A FW+N N +D H+GCPV L
Sbjct: 440 NRLATFMFYLNDVHAGGATVFPKLKVGIPPTKNGAAFWYNIGLNGDVDPLTEHAGCPVLL 499
Query: 62 GNKW 65
G KW
Sbjct: 500 GQKW 503
>gi|326435474|gb|EGD81044.1| hypothetical protein PTSG_10986 [Salpingoeca sp. ATCC 50818]
Length = 264
Score = 65.5 bits (158), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 33/57 (57%), Positives = 39/57 (68%), Gaps = 1/57 (1%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTL-LDYRMYHSGCPVALGNKWG 128
L DVE GG T FP+LNLTV P KG+AV WY+A+ NT +D R H PVA G K+G
Sbjct: 188 LNDVEEGGGTRFPNLNLTVQPAKGNAVLWYSAYPNTTRMDSRTDHEAMPVAKGMKYG 244
Score = 60.1 bits (144), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 31/57 (54%), Positives = 38/57 (66%), Gaps = 1/57 (1%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTL-LDYWMYHSGCPVALGNKWG 66
L DVE GG T FP+LNLTV P KG+AV W++A+ NT +D H PVA G K+G
Sbjct: 188 LNDVEEGGGTRFPNLNLTVQPAKGNAVLWYSAYPNTTRMDSRTDHEAMPVAKGMKYG 244
>gi|313229039|emb|CBY18191.1| unnamed protein product [Oikopleura dioica]
Length = 522
Score = 65.5 bits (158), Expect = 8e-09, Method: Composition-based stats.
Identities = 32/58 (55%), Positives = 35/58 (60%)
Query: 70 LSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
LS LT VE GG T+F L L + KGSAVFWYN N D R H+ CPV GNKW
Sbjct: 437 LSYLTTVEQGGGTVFTELGLHIRSIKGSAVFWYNLLPNGSGDERTRHAACPVLRGNKW 494
Score = 57.8 bits (138), Expect = 1e-06, Method: Composition-based stats.
Identities = 28/55 (50%), Positives = 32/55 (58%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
LT VE GG T+F L L + KGSAVFW+N N D H+ CPV GNKW
Sbjct: 440 LTTVEQGGGTVFTELGLHIRSIKGSAVFWYNLLPNGSGDERTRHAACPVLRGNKW 494
>gi|321461762|gb|EFX72791.1| hypothetical protein DAPPUDRAFT_308081 [Daphnia pulex]
Length = 561
Score = 65.5 bits (158), Expect = 8e-09, Method: Composition-based stats.
Identities = 28/59 (47%), Positives = 37/59 (62%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+ L+DVE GGAT+FP +T +P KGSA FW+N + + D H CPV G+KW
Sbjct: 473 LMVYLSDVEAGGATVFPRAGVTCWPRKGSAAFWWNLYKSGEPDLTTRHGACPVLHGSKW 531
Score = 63.2 bits (152), Expect = 3e-08, Method: Composition-based stats.
Identities = 28/57 (49%), Positives = 35/57 (61%)
Query: 9 VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
V L+DVE GGAT+FP +T +P KGSA FW N + + D H CPV G+KW
Sbjct: 475 VYLSDVEAGGATVFPRAGVTCWPRKGSAAFWWNLYKSGEPDLTTRHGACPVLHGSKW 531
>gi|85857698|gb|ABC86384.1| IP10964p [Drosophila melanogaster]
Length = 534
Score = 65.1 bits (157), Expect = 8e-09, Method: Composition-based stats.
Identities = 30/69 (43%), Positives = 43/69 (62%), Gaps = 1/69 (1%)
Query: 59 VALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSG 118
+ LG++ +L LTDVE GGAT+F + V P+ G+A+FWYN + D R H+
Sbjct: 439 IDLGDRIATVLFY-LTDVEQGGATVFGDVGYYVSPQAGTAIFWYNLDTDGNGDPRTRHAA 497
Query: 119 CPVALGNKW 127
CPV +G+KW
Sbjct: 498 CPVIVGSKW 506
Score = 58.5 bits (140), Expect = 8e-07, Method: Composition-based stats.
Identities = 25/55 (45%), Positives = 35/55 (63%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
LTDVE GGAT+F + V P+ G+A+FW+N + D H+ CPV +G+KW
Sbjct: 452 LTDVEQGGATVFGDVGYYVSPQAGTAIFWYNLDTDGNGDPRTRHAACPVIVGSKW 506
>gi|198417608|ref|XP_002125299.1| PREDICTED: similar to prolyl-4-hydroxylase-alpha EFB CG31022-PA
[Ciona intestinalis]
Length = 471
Score = 65.1 bits (157), Expect = 8e-09, Method: Composition-based stats.
Identities = 28/56 (50%), Positives = 37/56 (66%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 128
L++V+ GG+T F N+ P KGSAVFWYN + + LD R H+ CPV +GNKW
Sbjct: 389 LSEVQKGGSTAFFYPNIVAEPIKGSAVFWYNLYPSGALDKRTLHAACPVLIGNKWA 444
Score = 61.2 bits (147), Expect = 1e-07, Method: Composition-based stats.
Identities = 28/64 (43%), Positives = 39/64 (60%)
Query: 3 RAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALG 62
R + V L++V+ GG+T F N+ P KGSAVFW+N + + LD H+ CPV +G
Sbjct: 381 RIATALVYLSEVQKGGSTAFFYPNIVAEPIKGSAVFWYNLYPSGALDKRTLHAACPVLIG 440
Query: 63 NKWG 66
NKW
Sbjct: 441 NKWA 444
>gi|195577074|ref|XP_002078398.1| GD23422 [Drosophila simulans]
gi|194190407|gb|EDX03983.1| GD23422 [Drosophila simulans]
Length = 513
Score = 65.1 bits (157), Expect = 8e-09, Method: Composition-based stats.
Identities = 26/55 (47%), Positives = 37/55 (67%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
+TD+ +GGATIFP L + P+KGSA+FWYN H N + H+ CP +G++W
Sbjct: 440 MTDIPVGGATIFPGAQLAIQPKKGSALFWYNLHNNGDPNPLTRHAVCPTIVGSRW 494
Score = 63.5 bits (153), Expect = 3e-08, Method: Composition-based stats.
Identities = 25/55 (45%), Positives = 37/55 (67%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
+TD+ +GGATIFP L + P+KGSA+FW+N H N + H+ CP +G++W
Sbjct: 440 MTDIPVGGATIFPGAQLAIQPKKGSALFWYNLHNNGDPNPLTRHAVCPTIVGSRW 494
>gi|195505251|ref|XP_002099423.1| GE23370 [Drosophila yakuba]
gi|194185524|gb|EDW99135.1| GE23370 [Drosophila yakuba]
Length = 534
Score = 65.1 bits (157), Expect = 8e-09, Method: Composition-based stats.
Identities = 40/126 (31%), Positives = 64/126 (50%), Gaps = 18/126 (14%)
Query: 9 VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWGKL 68
V++T +L + F +N + G F H +DY+ Y S ++ K+
Sbjct: 392 VDMTGFDLADSEDFQVINYGI----GGHYFLH-------MDYFDYASSNYTGPRSRQSKV 440
Query: 69 L-------LSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
L L L+DVE GGAT+F ++ +V+P+ G+A+FWYN + D H+ CPV
Sbjct: 441 LGDRIATVLFYLSDVEQGGATVFGNVGYSVYPQAGTAIFWYNLDTDGNGDPLTRHASCPV 500
Query: 122 ALGNKW 127
+G+KW
Sbjct: 501 IVGSKW 506
>gi|221460681|ref|NP_733394.3| CG31013 [Drosophila melanogaster]
gi|220903261|gb|AAF57073.4| CG31013 [Drosophila melanogaster]
Length = 534
Score = 65.1 bits (157), Expect = 9e-09, Method: Composition-based stats.
Identities = 30/69 (43%), Positives = 43/69 (62%), Gaps = 1/69 (1%)
Query: 59 VALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSG 118
+ LG++ +L LTDVE GGAT+F + V P+ G+A+FWYN + D R H+
Sbjct: 439 IDLGDRIATVLFY-LTDVEQGGATVFGDVGYYVSPQAGTAIFWYNLDTDGNGDPRTRHAA 497
Query: 119 CPVALGNKW 127
CPV +G+KW
Sbjct: 498 CPVIVGSKW 506
Score = 58.5 bits (140), Expect = 9e-07, Method: Composition-based stats.
Identities = 25/55 (45%), Positives = 35/55 (63%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
LTDVE GGAT+F + V P+ G+A+FW+N + D H+ CPV +G+KW
Sbjct: 452 LTDVEQGGATVFGDVGYYVSPQAGTAIFWYNLDTDGNGDPRTRHAACPVIVGSKW 506
>gi|321474952|gb|EFX85916.1| hypothetical protein DAPPUDRAFT_45616 [Daphnia pulex]
Length = 537
Score = 65.1 bits (157), Expect = 9e-09, Method: Composition-based stats.
Identities = 30/66 (45%), Positives = 39/66 (59%), Gaps = 3/66 (4%)
Query: 65 WGKLLLSGL---TDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
WG + + L +DVE GGAT+FP V+P KGSA FWYN + N + H+ CPV
Sbjct: 447 WGNRIATWLFYMSDVEAGGATVFPPTGAAVWPRKGSAAFWYNLYPNGKGNELTRHAACPV 506
Query: 122 ALGNKW 127
G+KW
Sbjct: 507 LSGSKW 512
Score = 62.8 bits (151), Expect = 5e-08, Method: Composition-based stats.
Identities = 28/64 (43%), Positives = 37/64 (57%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DVE GGAT+FP V+P KGSA FW+N + N + H+ CPV
Sbjct: 449 NRIATWLFYMSDVEAGGATVFPPTGAAVWPRKGSAAFWYNLYPNGKGNELTRHAACPVLS 508
Query: 62 GNKW 65
G+KW
Sbjct: 509 GSKW 512
>gi|195452778|ref|XP_002073496.1| GK13116 [Drosophila willistoni]
gi|194169581|gb|EDW84482.1| GK13116 [Drosophila willistoni]
Length = 521
Score = 65.1 bits (157), Expect = 9e-09, Method: Composition-based stats.
Identities = 26/55 (47%), Positives = 36/55 (65%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L DVE GGAT+FP + VFP++GSA+ WYN + + H+ CPV +G+KW
Sbjct: 446 LNDVEQGGATVFPEIKKAVFPKRGSAIMWYNLKDDGEGNRDTLHAACPVIVGSKW 500
Score = 62.0 bits (149), Expect = 8e-08, Method: Composition-based stats.
Identities = 25/55 (45%), Positives = 36/55 (65%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GGAT+FP + VFP++GSA+ W+N + + H+ CPV +G+KW
Sbjct: 446 LNDVEQGGATVFPEIKKAVFPKRGSAIMWYNLKDDGEGNRDTLHAACPVIVGSKW 500
>gi|156370129|ref|XP_001628324.1| predicted protein [Nematostella vectensis]
gi|156215298|gb|EDO36261.1| predicted protein [Nematostella vectensis]
Length = 541
Score = 65.1 bits (157), Expect = 9e-09, Method: Composition-based stats.
Identities = 28/59 (47%), Positives = 37/59 (62%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
LS L+DVE GG T+F + TV+P+KG A FWYN + D H+ CPV +G+KW
Sbjct: 449 FLSYLSDVEAGGGTVFTRVGATVWPQKGDAAFWYNLKRSGDGDSSTRHAACPVLVGSKW 507
Score = 60.1 bits (144), Expect = 3e-07, Method: Composition-based stats.
Identities = 25/55 (45%), Positives = 35/55 (63%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L+DVE GG T+F + TV+P+KG A FW+N + D H+ CPV +G+KW
Sbjct: 453 LSDVEAGGGTVFTRVGATVWPQKGDAAFWYNLKRSGDGDSSTRHAACPVLVGSKW 507
>gi|195471732|ref|XP_002088156.1| GE14021 [Drosophila yakuba]
gi|194174257|gb|EDW87868.1| GE14021 [Drosophila yakuba]
Length = 265
Score = 65.1 bits (157), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 31/78 (39%), Positives = 46/78 (58%)
Query: 50 YWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTL 109
Y+ HS + LL + ++DV +GGATIFP+ L + P+KGSA+FWYN H N
Sbjct: 169 YFQEHSDYVDIKLHPASSLLPTSISDVPVGGATIFPAAKLAIQPKKGSALFWYNLHNNGD 228
Query: 110 LDYRMYHSGCPVALGNKW 127
+ H+ CP +G++W
Sbjct: 229 PNPLTRHAVCPTIVGSRW 246
Score = 60.8 bits (146), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 25/55 (45%), Positives = 38/55 (69%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DV +GGATIFP+ L + P+KGSA+FW+N H N + H+ CP +G++W
Sbjct: 192 ISDVPVGGATIFPAAKLAIQPKKGSALFWYNLHNNGDPNPLTRHAVCPTIVGSRW 246
>gi|194765140|ref|XP_001964685.1| GF23318 [Drosophila ananassae]
gi|190614957|gb|EDV30481.1| GF23318 [Drosophila ananassae]
Length = 412
Score = 65.1 bits (157), Expect = 1e-08, Method: Composition-based stats.
Identities = 28/59 (47%), Positives = 37/59 (62%), Gaps = 4/59 (6%)
Query: 73 LTDVELGGATIFPSLN----LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+DVE GGAT+FP ++ TV+P G+A WYN H + L D H CPV +G+KW
Sbjct: 327 LSDVEQGGATVFPGISADSAYTVYPRAGTAAMWYNLHTDGLGDPTTLHVACPVIVGSKW 385
Score = 64.7 bits (156), Expect = 1e-08, Method: Composition-based stats.
Identities = 28/66 (42%), Positives = 41/66 (62%), Gaps = 4/66 (6%)
Query: 4 AEMSSVELTDVELGGATIFPSLN----LTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPV 59
A+ + +L+DVE GGAT+FP ++ TV+P G+A W+N H + L D H CPV
Sbjct: 320 ADAENFQLSDVEQGGATVFPGISADSAYTVYPRAGTAAMWYNLHTDGLGDPTTLHVACPV 379
Query: 60 ALGNKW 65
+G+KW
Sbjct: 380 IVGSKW 385
>gi|386771382|ref|NP_649044.3| CG18233 [Drosophila melanogaster]
gi|383291998|gb|AAF49254.3| CG18233 [Drosophila melanogaster]
Length = 515
Score = 65.1 bits (157), Expect = 1e-08, Method: Composition-based stats.
Identities = 28/67 (41%), Positives = 44/67 (65%), Gaps = 1/67 (1%)
Query: 61 LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
LG++ G ++ +V GG T+FP L + V P+KG+A+FW+NA ++ D R HS CP
Sbjct: 426 LGDRIGSIIFYA-GEVSQGGQTVFPDLKVAVEPKKGNALFWFNAFDDSTPDPRSLHSVCP 484
Query: 121 VALGNKW 127
V +G++W
Sbjct: 485 VLVGSRW 491
Score = 57.8 bits (138), Expect = 2e-06, Method: Composition-based stats.
Identities = 24/53 (45%), Positives = 35/53 (66%)
Query: 13 DVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
+V GG T+FP L + V P+KG+A+FW NA ++ D HS CPV +G++W
Sbjct: 439 EVSQGGQTVFPDLKVAVEPKKGNALFWFNAFDDSTPDPRSLHSVCPVLVGSRW 491
>gi|198466399|ref|XP_002135181.1| GA23909 [Drosophila pseudoobscura pseudoobscura]
gi|198150582|gb|EDY73808.1| GA23909 [Drosophila pseudoobscura pseudoobscura]
Length = 530
Score = 65.1 bits (157), Expect = 1e-08, Method: Composition-based stats.
Identities = 27/67 (40%), Positives = 45/67 (67%), Gaps = 1/67 (1%)
Query: 61 LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
LG++ G ++ ++V GGATIFP + +TV P+KG+++FW+N ++ D R H+ CP
Sbjct: 439 LGDRTGSIIFYA-SEVPQGGATIFPDIQVTVTPQKGNSLFWFNTFDDSTPDPRSLHAICP 497
Query: 121 VALGNKW 127
V G++W
Sbjct: 498 VIAGSRW 504
Score = 57.0 bits (136), Expect = 2e-06, Method: Composition-based stats.
Identities = 23/54 (42%), Positives = 36/54 (66%)
Query: 12 TDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++V GGATIFP + +TV P+KG+++FW N ++ D H+ CPV G++W
Sbjct: 451 SEVPQGGATIFPDIQVTVTPQKGNSLFWFNTFDDSTPDPRSLHAICPVIAGSRW 504
>gi|195392288|ref|XP_002054791.1| GJ24631 [Drosophila virilis]
gi|194152877|gb|EDW68311.1| GJ24631 [Drosophila virilis]
Length = 499
Score = 65.1 bits (157), Expect = 1e-08, Method: Composition-based stats.
Identities = 30/65 (46%), Positives = 40/65 (61%), Gaps = 1/65 (1%)
Query: 63 NKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 122
N+ +LL LTDV+LGG T FP+L L V P GSA+ W+N + D R H+ CP+
Sbjct: 413 NRIATILLY-LTDVQLGGLTSFPALGLAVQPSPGSALIWHNMNNAAECDRRTLHAACPLL 471
Query: 123 LGNKW 127
LG +W
Sbjct: 472 LGTRW 476
Score = 64.3 bits (155), Expect = 1e-08, Method: Composition-based stats.
Identities = 29/64 (45%), Positives = 37/64 (57%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + LTDV+LGG T FP+L L V P GSA+ WHN + D H+ CP+ L
Sbjct: 413 NRIATILLYLTDVQLGGLTSFPALGLAVQPSPGSALIWHNMNNAAECDRRTLHAACPLLL 472
Query: 62 GNKW 65
G +W
Sbjct: 473 GTRW 476
>gi|195128347|ref|XP_002008625.1| GI13597 [Drosophila mojavensis]
gi|193920234|gb|EDW19101.1| GI13597 [Drosophila mojavensis]
Length = 457
Score = 64.7 bits (156), Expect = 1e-08, Method: Composition-based stats.
Identities = 27/54 (50%), Positives = 36/54 (66%)
Query: 75 DVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 128
DV +GGATIFP L L V P +G+A+ WYN +A+ D H+ CPV LG++W
Sbjct: 385 DVAIGGATIFPKLRLLVQPRRGTALLWYNLNADGAADPLAKHAVCPVVLGSRWA 438
Score = 63.2 bits (152), Expect = 4e-08, Method: Composition-based stats.
Identities = 26/54 (48%), Positives = 36/54 (66%)
Query: 13 DVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWG 66
DV +GGATIFP L L V P +G+A+ W+N +A+ D H+ CPV LG++W
Sbjct: 385 DVAIGGATIFPKLRLLVQPRRGTALLWYNLNADGAADPLAKHAVCPVVLGSRWA 438
>gi|357605723|gb|EHJ64752.1| prolyl 4-hydroxylase alpha subunit [Danaus plexippus]
Length = 235
Score = 64.7 bits (156), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 31/66 (46%), Positives = 41/66 (62%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ +L ++DV GGAT+F L L+VFP +GSAVFW N H + D H+ CPV
Sbjct: 132 GNRIATVLFY-MSDVAQGGATVFTELGLSVFPRRGSAVFWLNLHPSGEGDLATRHAACPV 190
Query: 122 ALGNKW 127
G+KW
Sbjct: 191 LRGSKW 196
Score = 62.8 bits (151), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 28/55 (50%), Positives = 36/55 (65%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DV GGAT+F L L+VFP +GSAVFW N H + D H+ CPV G+KW
Sbjct: 142 MSDVAQGGATVFTELGLSVFPRRGSAVFWLNLHPSGEGDLATRHAACPVLRGSKW 196
>gi|161076739|ref|NP_001097101.1| CG34345 [Drosophila melanogaster]
gi|157400090|gb|ABV53635.1| CG34345 [Drosophila melanogaster]
Length = 504
Score = 64.7 bits (156), Expect = 1e-08, Method: Composition-based stats.
Identities = 26/55 (47%), Positives = 36/55 (65%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
+TDV +GG TIFP L + P+KGSA+FWYN H N + H+ CP +G++W
Sbjct: 431 MTDVPVGGTTIFPGAQLAIQPKKGSALFWYNLHNNGDPNLLTRHAVCPTIVGSRW 485
Score = 63.2 bits (152), Expect = 4e-08, Method: Composition-based stats.
Identities = 25/55 (45%), Positives = 36/55 (65%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
+TDV +GG TIFP L + P+KGSA+FW+N H N + H+ CP +G++W
Sbjct: 431 MTDVPVGGTTIFPGAQLAIQPKKGSALFWYNLHNNGDPNLLTRHAVCPTIVGSRW 485
>gi|92109908|gb|ABE73278.1| IP10618p [Drosophila melanogaster]
Length = 501
Score = 64.7 bits (156), Expect = 1e-08, Method: Composition-based stats.
Identities = 26/55 (47%), Positives = 36/55 (65%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
+TDV +GG TIFP L + P+KGSA+FWYN H N + H+ CP +G++W
Sbjct: 428 MTDVPVGGTTIFPGAQLAIQPKKGSALFWYNLHNNGDPNLLTRHAVCPTIVGSRW 482
Score = 63.2 bits (152), Expect = 4e-08, Method: Composition-based stats.
Identities = 25/55 (45%), Positives = 36/55 (65%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
+TDV +GG TIFP L + P+KGSA+FW+N H N + H+ CP +G++W
Sbjct: 428 MTDVPVGGTTIFPGAQLAIQPKKGSALFWYNLHNNGDPNLLTRHAVCPTIVGSRW 482
>gi|195591298|ref|XP_002085379.1| GD14755 [Drosophila simulans]
gi|194197388|gb|EDX10964.1| GD14755 [Drosophila simulans]
Length = 515
Score = 64.7 bits (156), Expect = 1e-08, Method: Composition-based stats.
Identities = 28/67 (41%), Positives = 44/67 (65%), Gaps = 1/67 (1%)
Query: 61 LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
LG++ G ++ +V GG T+FP L + V P+KG+A+FW+NA ++ D R HS CP
Sbjct: 426 LGDRIGSIIFYA-GEVSQGGQTVFPDLKVAVEPKKGNALFWFNAFDDSSPDPRTLHSVCP 484
Query: 121 VALGNKW 127
V +G++W
Sbjct: 485 VIVGSRW 491
Score = 57.8 bits (138), Expect = 2e-06, Method: Composition-based stats.
Identities = 24/53 (45%), Positives = 35/53 (66%)
Query: 13 DVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
+V GG T+FP L + V P+KG+A+FW NA ++ D HS CPV +G++W
Sbjct: 439 EVSQGGQTVFPDLKVAVEPKKGNALFWFNAFDDSSPDPRTLHSVCPVIVGSRW 491
>gi|194765168|ref|XP_001964699.1| GF22909 [Drosophila ananassae]
gi|190614971|gb|EDV30495.1| GF22909 [Drosophila ananassae]
Length = 525
Score = 64.7 bits (156), Expect = 1e-08, Method: Composition-based stats.
Identities = 29/67 (43%), Positives = 41/67 (61%), Gaps = 1/67 (1%)
Query: 61 LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
+GN+ +L L+DV GG T FP L + + P+K +A FWYN HA+ + D R H CP
Sbjct: 413 MGNRIATVLFY-LSDVTQGGGTAFPHLRVLLKPKKYAAAFWYNLHASGVGDPRTQHGACP 471
Query: 121 VALGNKW 127
+ G+KW
Sbjct: 472 IISGSKW 478
Score = 57.8 bits (138), Expect = 1e-06, Method: Composition-based stats.
Identities = 26/64 (40%), Positives = 36/64 (56%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR L+DV GG T FP L + + P+K +A FW+N HA+ + D H CP+
Sbjct: 415 NRIATVLFYLSDVTQGGGTAFPHLRVLLKPKKYAAAFWYNLHASGVGDPRTQHGACPIIS 474
Query: 62 GNKW 65
G+KW
Sbjct: 475 GSKW 478
>gi|195505253|ref|XP_002099424.1| GE23369 [Drosophila yakuba]
gi|194185525|gb|EDW99136.1| GE23369 [Drosophila yakuba]
Length = 164
Score = 64.7 bits (156), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 36/55 (65%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+DVE GGAT+FP T++P G+A+ WYN H + D H+ CPV +G+KW
Sbjct: 88 LSDVEQGGATVFPMSGYTIYPRAGTALLWYNLHTDGHCDPSTLHAACPVMVGSKW 142
Score = 63.5 bits (153), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 26/61 (42%), Positives = 38/61 (62%)
Query: 5 EMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNK 64
E +L+DVE GGAT+FP T++P G+A+ W+N H + D H+ CPV +G+K
Sbjct: 82 EGEKFQLSDVEQGGATVFPMSGYTIYPRAGTALLWYNLHTDGHCDPSTLHAACPVMVGSK 141
Query: 65 W 65
W
Sbjct: 142 W 142
>gi|194765182|ref|XP_001964706.1| GF22908 [Drosophila ananassae]
gi|190614978|gb|EDV30502.1| GF22908 [Drosophila ananassae]
Length = 509
Score = 64.7 bits (156), Expect = 1e-08, Method: Composition-based stats.
Identities = 28/63 (44%), Positives = 42/63 (66%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
+R S L++V+ GG T+FP+LN+ + P KGS V WHN H + +D +H+GCPV +
Sbjct: 415 DRISTSMFYLSNVQQGGYTVFPNLNVFLPPVKGSMVLWHNLHYSLDVDARTWHAGCPVIV 474
Query: 62 GNK 64
G+K
Sbjct: 475 GSK 477
Score = 63.9 bits (154), Expect = 2e-08, Method: Composition-based stats.
Identities = 26/54 (48%), Positives = 40/54 (74%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 126
L++V+ GG T+FP+LN+ + P KGS V W+N H + +D R +H+GCPV +G+K
Sbjct: 424 LSNVQQGGYTVFPNLNVFLPPVKGSMVLWHNLHYSLDVDARTWHAGCPVIVGSK 477
>gi|195064500|ref|XP_001996577.1| GH12091 [Drosophila grimshawi]
gi|193895397|gb|EDV94263.1| GH12091 [Drosophila grimshawi]
Length = 521
Score = 64.7 bits (156), Expect = 1e-08, Method: Composition-based stats.
Identities = 27/67 (40%), Positives = 43/67 (64%), Gaps = 1/67 (1%)
Query: 61 LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
LG++ ++ +DV LGGAT+FP + + V P+KG+++ WYN + D R HS CP
Sbjct: 434 LGDRLASIIFYS-SDVPLGGATVFPDIQVAVQPQKGNSLLWYNLFDDGTPDPRSLHSVCP 492
Query: 121 VALGNKW 127
V +G++W
Sbjct: 493 VVVGSRW 499
Score = 58.5 bits (140), Expect = 9e-07, Method: Composition-based stats.
Identities = 23/54 (42%), Positives = 36/54 (66%)
Query: 12 TDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
+DV LGGAT+FP + + V P+KG+++ W+N + D HS CPV +G++W
Sbjct: 446 SDVPLGGATVFPDIQVAVQPQKGNSLLWYNLFDDGTPDPRSLHSVCPVVVGSRW 499
>gi|443707037|gb|ELU02831.1| hypothetical protein CAPTEDRAFT_181697 [Capitella teleta]
Length = 538
Score = 64.7 bits (156), Expect = 1e-08, Method: Composition-based stats.
Identities = 26/51 (50%), Positives = 33/51 (64%)
Query: 77 ELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
E GGAT+FP L + ++PEKGS W+N N DYR H+GCP G+KW
Sbjct: 465 EAGGATVFPDLGVKLWPEKGSCAVWWNLMRNGEGDYRTKHAGCPTITGSKW 515
Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats.
Identities = 25/51 (49%), Positives = 31/51 (60%)
Query: 15 ELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
E GGAT+FP L + ++PEKGS W N N DY H+GCP G+KW
Sbjct: 465 EAGGATVFPDLGVKLWPEKGSCAVWWNLMRNGEGDYRTKHAGCPTITGSKW 515
>gi|389770666|ref|ZP_10192118.1| procollagen-proline dioxygenase [Rhodanobacter sp. 115]
gi|388429637|gb|EIL86932.1| procollagen-proline dioxygenase [Rhodanobacter sp. 115]
Length = 286
Score = 64.3 bits (155), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 33/73 (45%), Positives = 43/73 (58%)
Query: 55 SGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRM 114
S +A G + L+ L +VE GGATIFP L L+V P+KGSAV++ ++ LD R
Sbjct: 200 SAVQMATGGQRVSTLVMYLNEVEDGGATIFPELGLSVLPKKGSAVYFEYTNSRGQLDPRT 259
Query: 115 YHSGCPVALGNKW 127
H G PV G KW
Sbjct: 260 LHGGAPVLRGEKW 272
Score = 60.1 bits (144), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 28/55 (50%), Positives = 35/55 (63%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L +VE GGATIFP L L+V P+KGSAV++ ++ LD H G PV G KW
Sbjct: 218 LNEVEDGGATIFPELGLSVLPKKGSAVYFEYTNSRGQLDPRTLHGGAPVLRGEKW 272
>gi|156398644|ref|XP_001638298.1| predicted protein [Nematostella vectensis]
gi|156225417|gb|EDO46235.1| predicted protein [Nematostella vectensis]
Length = 495
Score = 64.3 bits (155), Expect = 1e-08, Method: Composition-based stats.
Identities = 29/64 (45%), Positives = 38/64 (59%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR V ++ VE GGAT+FP + + P+KG AVFWHN + D+ H+GCPV
Sbjct: 409 NRIATVLVWMSQVESGGATVFPYVGARILPQKGDAVFWHNLLRSGDGDFRTRHAGCPVLS 468
Query: 62 GNKW 65
G KW
Sbjct: 469 GIKW 472
Score = 63.9 bits (154), Expect = 2e-08, Method: Composition-based stats.
Identities = 29/66 (43%), Positives = 42/66 (63%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ +L+ ++ VE GGAT+FP + + P+KG AVFW+N + D+R H+GCPV
Sbjct: 408 GNRIATVLV-WMSQVESGGATVFPYVGARILPQKGDAVFWHNLLRSGDGDFRTRHAGCPV 466
Query: 122 ALGNKW 127
G KW
Sbjct: 467 LSGIKW 472
>gi|390176836|ref|XP_003736216.1| GA26872, isoform B [Drosophila pseudoobscura pseudoobscura]
gi|388858809|gb|EIM52289.1| GA26872, isoform B [Drosophila pseudoobscura pseudoobscura]
Length = 567
Score = 64.3 bits (155), Expect = 2e-08, Method: Composition-based stats.
Identities = 32/79 (40%), Positives = 46/79 (58%), Gaps = 2/79 (2%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
D++ H+ G++ ++ L DVE GGAT FP+L+L V E+G+ +FW+N T
Sbjct: 459 DFFSVHTPTSRLHGDRIATVIFY-LNDVEHGGATAFPNLDLVVPTERGAVLFWHNMDGET 517
Query: 109 L-LDYRMYHSGCPVALGNK 126
LDYR H CPV +G K
Sbjct: 518 YDLDYRTLHGACPVIVGTK 536
Score = 62.8 bits (151), Expect = 4e-08, Method: Composition-based stats.
Identities = 29/55 (52%), Positives = 35/55 (63%), Gaps = 1/55 (1%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTL-LDYWMYHSGCPVALGNK 64
L DVE GGAT FP+L+L V E+G+ +FWHN T LDY H CPV +G K
Sbjct: 482 LNDVEHGGATAFPNLDLVVPTERGAVLFWHNMDGETYDLDYRTLHGACPVIVGTK 536
>gi|198449518|ref|XP_002136915.1| GA26872, isoform A [Drosophila pseudoobscura pseudoobscura]
gi|198130643|gb|EDY67473.1| GA26872, isoform A [Drosophila pseudoobscura pseudoobscura]
Length = 543
Score = 64.3 bits (155), Expect = 2e-08, Method: Composition-based stats.
Identities = 32/79 (40%), Positives = 46/79 (58%), Gaps = 2/79 (2%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
D++ H+ G++ ++ L DVE GGAT FP+L+L V E+G+ +FW+N T
Sbjct: 435 DFFSVHTPTSRLHGDRIATVIFY-LNDVEHGGATAFPNLDLVVPTERGAVLFWHNMDGET 493
Query: 109 L-LDYRMYHSGCPVALGNK 126
LDYR H CPV +G K
Sbjct: 494 YDLDYRTLHGACPVIVGTK 512
Score = 62.8 bits (151), Expect = 4e-08, Method: Composition-based stats.
Identities = 29/55 (52%), Positives = 35/55 (63%), Gaps = 1/55 (1%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTL-LDYWMYHSGCPVALGNK 64
L DVE GGAT FP+L+L V E+G+ +FWHN T LDY H CPV +G K
Sbjct: 458 LNDVEHGGATAFPNLDLVVPTERGAVLFWHNMDGETYDLDYRTLHGACPVIVGTK 512
>gi|198449506|ref|XP_002136910.1| GA26925 [Drosophila pseudoobscura pseudoobscura]
gi|198130637|gb|EDY67468.1| GA26925 [Drosophila pseudoobscura pseudoobscura]
Length = 543
Score = 64.3 bits (155), Expect = 2e-08, Method: Composition-based stats.
Identities = 32/79 (40%), Positives = 46/79 (58%), Gaps = 2/79 (2%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
D++ H+ G++ ++ L DVE GGAT FP+L+L V E+G+ +FW+N T
Sbjct: 435 DFFSVHTPTSRLHGDRIATVIFY-LNDVEHGGATAFPNLDLVVPTERGAVLFWHNMDGET 493
Query: 109 L-LDYRMYHSGCPVALGNK 126
LDYR H CPV +G K
Sbjct: 494 YDLDYRTLHGACPVIVGTK 512
Score = 62.8 bits (151), Expect = 4e-08, Method: Composition-based stats.
Identities = 29/55 (52%), Positives = 35/55 (63%), Gaps = 1/55 (1%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTL-LDYWMYHSGCPVALGNK 64
L DVE GGAT FP+L+L V E+G+ +FWHN T LDY H CPV +G K
Sbjct: 458 LNDVEHGGATAFPNLDLVVPTERGAVLFWHNMDGETYDLDYRTLHGACPVIVGTK 512
>gi|195159160|ref|XP_002020450.1| GL13507 [Drosophila persimilis]
gi|194117219|gb|EDW39262.1| GL13507 [Drosophila persimilis]
Length = 543
Score = 64.3 bits (155), Expect = 2e-08, Method: Composition-based stats.
Identities = 32/79 (40%), Positives = 46/79 (58%), Gaps = 2/79 (2%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
D++ H+ G++ ++ L DVE GGAT FP+L+L V E+G+ +FW+N T
Sbjct: 435 DFFSVHTPTSRLHGDRIATVIFY-LNDVEHGGATAFPNLDLVVPTERGAVLFWHNMDGET 493
Query: 109 L-LDYRMYHSGCPVALGNK 126
LDYR H CPV +G K
Sbjct: 494 YDLDYRTLHGACPVIVGTK 512
Score = 62.8 bits (151), Expect = 4e-08, Method: Composition-based stats.
Identities = 29/55 (52%), Positives = 35/55 (63%), Gaps = 1/55 (1%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTL-LDYWMYHSGCPVALGNK 64
L DVE GGAT FP+L+L V E+G+ +FWHN T LDY H CPV +G K
Sbjct: 458 LNDVEHGGATAFPNLDLVVPTERGAVLFWHNMDGETYDLDYRTLHGACPVIVGTK 512
>gi|195159148|ref|XP_002020444.1| GL13996 [Drosophila persimilis]
gi|194117213|gb|EDW39256.1| GL13996 [Drosophila persimilis]
Length = 559
Score = 64.3 bits (155), Expect = 2e-08, Method: Composition-based stats.
Identities = 32/79 (40%), Positives = 46/79 (58%), Gaps = 2/79 (2%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
D++ H+ G++ ++ L DVE GGAT FP+L+L V E+G+ +FW+N T
Sbjct: 451 DFFSVHTPTSRLHGDRIATVIFY-LNDVEHGGATAFPNLDLVVPTERGAVLFWHNMDGET 509
Query: 109 L-LDYRMYHSGCPVALGNK 126
LDYR H CPV +G K
Sbjct: 510 YDLDYRTLHGACPVIVGTK 528
Score = 62.8 bits (151), Expect = 4e-08, Method: Composition-based stats.
Identities = 29/55 (52%), Positives = 35/55 (63%), Gaps = 1/55 (1%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTL-LDYWMYHSGCPVALGNK 64
L DVE GGAT FP+L+L V E+G+ +FWHN T LDY H CPV +G K
Sbjct: 474 LNDVEHGGATAFPNLDLVVPTERGAVLFWHNMDGETYDLDYRTLHGACPVIVGTK 528
>gi|289526401|gb|ADD01323.1| FI13021p [Drosophila melanogaster]
gi|373432715|gb|AEY70761.1| FI17809p1 [Drosophila melanogaster]
Length = 193
Score = 64.3 bits (155), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 31/74 (41%), Positives = 44/74 (59%), Gaps = 1/74 (1%)
Query: 54 HSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYR 113
S + LG++ +L LTDVE GGAT+F + V P+ G+A+FWYN + D R
Sbjct: 93 RSRYSIDLGDRIATVLFY-LTDVEQGGATVFGDVGYYVSPQAGTAIFWYNLDTDGNGDPR 151
Query: 114 MYHSGCPVALGNKW 127
H+ CPV +G+KW
Sbjct: 152 TRHAACPVIVGSKW 165
Score = 57.4 bits (137), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 25/55 (45%), Positives = 35/55 (63%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
LTDVE GGAT+F + V P+ G+A+FW+N + D H+ CPV +G+KW
Sbjct: 111 LTDVEQGGATVFGDVGYYVSPQAGTAIFWYNLDTDGNGDPRTRHAACPVIVGSKW 165
>gi|195505218|ref|XP_002099409.1| GE10887 [Drosophila yakuba]
gi|194185510|gb|EDW99121.1| GE10887 [Drosophila yakuba]
Length = 521
Score = 63.9 bits (154), Expect = 2e-08, Method: Composition-based stats.
Identities = 36/110 (32%), Positives = 53/110 (48%), Gaps = 8/110 (7%)
Query: 24 SLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL------GNKWGKLLLSGLTDVE 77
+LN+ + A + H +D W Y + L GN+ +L L+DV
Sbjct: 371 NLNMKYAEDHQFANYGIGGHYGQHMD-WFYQTTIDAGLISSPEMGNRIATVLFY-LSDVS 428
Query: 78 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
GG T FP L + P+K +A FW+N HA+ + D R H CP+ G+KW
Sbjct: 429 QGGGTAFPQLRTLLKPKKYAAAFWHNLHASGVGDVRTQHGACPIIAGSKW 478
Score = 61.2 bits (147), Expect = 1e-07, Method: Composition-based stats.
Identities = 27/64 (42%), Positives = 35/64 (54%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR L+DV GG T FP L + P+K +A FWHN HA+ + D H CP+
Sbjct: 415 NRIATVLFYLSDVSQGGGTAFPQLRTLLKPKKYAAAFWHNLHASGVGDVRTQHGACPIIA 474
Query: 62 GNKW 65
G+KW
Sbjct: 475 GSKW 478
>gi|195438148|ref|XP_002066999.1| GK24258 [Drosophila willistoni]
gi|194163084|gb|EDW77985.1| GK24258 [Drosophila willistoni]
Length = 217
Score = 63.9 bits (154), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 30/68 (44%), Positives = 43/68 (63%), Gaps = 1/68 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
G++ +L L+DV GG TIFP N+TV P+KGSA+FW+N H + + + H CP+
Sbjct: 150 GDRLATILFY-LSDVAQGGHTIFPLANVTVQPKKGSALFWFNLHNDGEPNIKSLHGVCPI 208
Query: 122 ALGNKWGK 129
GN+W K
Sbjct: 209 IEGNRWSK 216
Score = 60.5 bits (145), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 28/57 (49%), Positives = 36/57 (63%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWGK 67
L+DV GG TIFP N+TV P+KGSA+FW N H + + H CP+ GN+W K
Sbjct: 160 LSDVAQGGHTIFPLANVTVQPKKGSALFWFNLHNDGEPNIKSLHGVCPIIEGNRWSK 216
>gi|341884171|gb|EGT40106.1| CBN-PHY-2 protein [Caenorhabditis brenneri]
Length = 607
Score = 63.5 bits (153), Expect = 2e-08, Method: Composition-based stats.
Identities = 30/66 (45%), Positives = 38/66 (57%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ +L ++ ELGGAT+F L VFP K A+FWYN + D R H+ CPV
Sbjct: 505 GNRIATVLFY-MSQPELGGATVFNHLGTAVFPSKNDALFWYNLRRDGEGDLRTRHAACPV 563
Query: 122 ALGNKW 127
LG KW
Sbjct: 564 LLGVKW 569
Score = 57.8 bits (138), Expect = 2e-06, Method: Composition-based stats.
Identities = 27/64 (42%), Positives = 34/64 (53%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++ ELGGAT+F L VFP K A+FW+N + D H+ CPV L
Sbjct: 506 NRIATVLFYMSQPELGGATVFNHLGTAVFPSKNDALFWYNLRRDGEGDLRTRHAACPVLL 565
Query: 62 GNKW 65
G KW
Sbjct: 566 GVKW 569
>gi|390178148|ref|XP_001358756.3| GA13990 [Drosophila pseudoobscura pseudoobscura]
gi|388859341|gb|EAL27899.3| GA13990 [Drosophila pseudoobscura pseudoobscura]
Length = 498
Score = 63.5 bits (153), Expect = 3e-08, Method: Composition-based stats.
Identities = 31/86 (36%), Positives = 50/86 (58%), Gaps = 1/86 (1%)
Query: 42 AHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFW 101
H + D Y + LG++ +L + +V+ GGAT FP +N++V P+KGSAV W
Sbjct: 386 GHYDVHYDSHNYSEANRLILGDRIATVLFY-VGEVDSGGATTFPYINVSVTPKKGSAVLW 444
Query: 102 YNAHANTLLDYRMYHSGCPVALGNKW 127
YN ++ + H+GCPV +G+K+
Sbjct: 445 YNLDNAGQMNPKAIHAGCPVIVGSKY 470
Score = 57.4 bits (137), Expect = 2e-06, Method: Composition-based stats.
Identities = 24/53 (45%), Positives = 37/53 (69%)
Query: 13 DVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
+V+ GGAT FP +N++V P+KGSAV W+N ++ H+GCPV +G+K+
Sbjct: 418 EVDSGGATTFPYINVSVTPKKGSAVLWYNLDNAGQMNPKAIHAGCPVIVGSKY 470
>gi|443697961|gb|ELT98195.1| hypothetical protein CAPTEDRAFT_181380 [Capitella teleta]
Length = 530
Score = 63.5 bits (153), Expect = 3e-08, Method: Composition-based stats.
Identities = 24/55 (43%), Positives = 33/55 (60%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+++E GG T++P++ V P K S WYN N DYR YH+ CP+ G KW
Sbjct: 453 LSELEAGGYTVYPTVGAAVVPRKNSCALWYNLMRNGTGDYRTYHAACPILYGYKW 507
Score = 58.9 bits (141), Expect = 7e-07, Method: Composition-based stats.
Identities = 24/64 (37%), Positives = 34/64 (53%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR L+++E GG T++P++ V P K S W+N N DY YH+ CP+
Sbjct: 444 NRVATFICYLSELEAGGYTVYPTVGAAVVPRKNSCALWYNLMRNGTGDYRTYHAACPILY 503
Query: 62 GNKW 65
G KW
Sbjct: 504 GYKW 507
>gi|195166677|ref|XP_002024161.1| GL22880 [Drosophila persimilis]
gi|194107516|gb|EDW29559.1| GL22880 [Drosophila persimilis]
Length = 507
Score = 63.5 bits (153), Expect = 3e-08, Method: Composition-based stats.
Identities = 26/67 (38%), Positives = 44/67 (65%), Gaps = 1/67 (1%)
Query: 61 LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
LG++ G ++ ++V GG TIFP + +TV P+KG+++FW+N ++ D R H+ CP
Sbjct: 382 LGDRTGSIIFYA-SEVPQGGTTIFPDIQVTVTPQKGNSLFWFNTFDDSTPDPRSLHAICP 440
Query: 121 VALGNKW 127
V G++W
Sbjct: 441 VIAGSRW 447
Score = 55.5 bits (132), Expect = 7e-06, Method: Composition-based stats.
Identities = 22/54 (40%), Positives = 35/54 (64%)
Query: 12 TDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++V GG TIFP + +TV P+KG+++FW N ++ D H+ CPV G++W
Sbjct: 394 SEVPQGGTTIFPDIQVTVTPQKGNSLFWFNTFDDSTPDPRSLHAICPVIAGSRW 447
>gi|449673565|ref|XP_002167120.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Hydra
magnipapillata]
Length = 571
Score = 63.5 bits (153), Expect = 3e-08, Method: Composition-based stats.
Identities = 30/69 (43%), Positives = 40/69 (57%), Gaps = 1/69 (1%)
Query: 59 VALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSG 118
+ +GN+ +L L+DVE GGAT+F VFP KG AVFW+N N + H+
Sbjct: 481 LGMGNRIATVLFY-LSDVEAGGATVFTVGKTAVFPSKGDAVFWFNLKRNGKGNPNTRHAA 539
Query: 119 CPVALGNKW 127
CPV +G KW
Sbjct: 540 CPVLVGQKW 548
Score = 58.5 bits (140), Expect = 8e-07, Method: Composition-based stats.
Identities = 29/64 (45%), Positives = 34/64 (53%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR L+DVE GGAT+F VFP KG AVFW N N + H+ CPV +
Sbjct: 485 NRIATVLFYLSDVEAGGATVFTVGKTAVFPSKGDAVFWFNLKRNGKGNPNTRHAACPVLV 544
Query: 62 GNKW 65
G KW
Sbjct: 545 GQKW 548
>gi|427783867|gb|JAA57385.1| Putative prolyl 4-hydroxylase subunit alpha-1 [Rhipicephalus
pulchellus]
Length = 548
Score = 63.5 bits (153), Expect = 3e-08, Method: Composition-based stats.
Identities = 27/59 (45%), Positives = 40/59 (67%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+ L+DVE GGAT+FP L + + P+KG+A FW+N +++ + H GCPV G+KW
Sbjct: 461 LMFYLSDVEEGGATVFPHLGVRLTPKKGNAAFWWNLNSDGEGEQLTKHGGCPVLYGSKW 519
Score = 61.6 bits (148), Expect = 1e-07, Method: Composition-based stats.
Identities = 26/55 (47%), Positives = 37/55 (67%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L+DVE GGAT+FP L + + P+KG+A FW N +++ + H GCPV G+KW
Sbjct: 465 LSDVEEGGATVFPHLGVRLTPKKGNAAFWWNLNSDGEGEQLTKHGGCPVLYGSKW 519
>gi|116008434|ref|NP_651806.2| CG9698 [Drosophila melanogaster]
gi|113194862|gb|AAF57062.2| CG9698 [Drosophila melanogaster]
Length = 547
Score = 63.2 bits (152), Expect = 3e-08, Method: Composition-based stats.
Identities = 28/67 (41%), Positives = 40/67 (59%), Gaps = 1/67 (1%)
Query: 61 LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
+GN+ +L L+DV GG T FP L + P+K +A FW+N HA+ + D R H CP
Sbjct: 444 MGNRIATVLFY-LSDVAQGGGTAFPQLRTLLKPKKYAAAFWHNLHASGVGDVRTQHGACP 502
Query: 121 VALGNKW 127
+ G+KW
Sbjct: 503 IIAGSKW 509
Score = 60.8 bits (146), Expect = 2e-07, Method: Composition-based stats.
Identities = 27/64 (42%), Positives = 35/64 (54%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR L+DV GG T FP L + P+K +A FWHN HA+ + D H CP+
Sbjct: 446 NRIATVLFYLSDVAQGGGTAFPQLRTLLKPKKYAAAFWHNLHASGVGDVRTQHGACPIIA 505
Query: 62 GNKW 65
G+KW
Sbjct: 506 GSKW 509
>gi|195452728|ref|XP_002073474.1| GK14137 [Drosophila willistoni]
gi|194169559|gb|EDW84460.1| GK14137 [Drosophila willistoni]
Length = 536
Score = 63.2 bits (152), Expect = 3e-08, Method: Composition-based stats.
Identities = 29/63 (46%), Positives = 40/63 (63%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
+R S L+DVELGG T+F LN+ + P KG+ V WHN H + +D H+GCPV +
Sbjct: 442 DRISTSMFYLSDVELGGNTVFIKLNVFLPPIKGAMVMWHNLHYSLDVDRRTIHAGCPVLI 501
Query: 62 GNK 64
G+K
Sbjct: 502 GSK 504
Score = 62.4 bits (150), Expect = 5e-08, Method: Composition-based stats.
Identities = 27/54 (50%), Positives = 38/54 (70%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 126
L+DVELGG T+F LN+ + P KG+ V W+N H + +D R H+GCPV +G+K
Sbjct: 451 LSDVELGGNTVFIKLNVFLPPIKGAMVMWHNLHYSLDVDRRTIHAGCPVLIGSK 504
>gi|195069793|ref|XP_001997027.1| GH12976 [Drosophila grimshawi]
gi|193891496|gb|EDV90362.1| GH12976 [Drosophila grimshawi]
Length = 83
Score = 63.2 bits (152), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 37/55 (67%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
++DV+ GGAT+F SL ++P+KG+A FW N H + D R H+ CPV G+KW
Sbjct: 1 MSDVQQGGATVFTSLRTALWPKKGTAAFWMNLHRSGEGDARTRHAACPVLTGSKW 55
Score = 58.9 bits (141), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 25/55 (45%), Positives = 36/55 (65%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DV+ GGAT+F SL ++P+KG+A FW N H + D H+ CPV G+KW
Sbjct: 1 MSDVQQGGATVFTSLRTALWPKKGTAAFWMNLHRSGEGDARTRHAACPVLTGSKW 55
>gi|350014318|dbj|GAA37183.1| prolyl 4-hydroxylase [Clonorchis sinensis]
Length = 595
Score = 63.2 bits (152), Expect = 3e-08, Method: Composition-based stats.
Identities = 29/66 (43%), Positives = 40/66 (60%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ ++ LTDV+ GGAT+F +V P +G+A FWYN H + D R H CPV
Sbjct: 501 GNRIATIIFY-LTDVKAGGATVFNRFGASVKPVRGAAGFWYNLHPSGEGDLRTRHVACPV 559
Query: 122 ALGNKW 127
+G+KW
Sbjct: 560 LVGSKW 565
Score = 57.8 bits (138), Expect = 2e-06, Method: Composition-based stats.
Identities = 27/64 (42%), Positives = 36/64 (56%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR LTDV+ GGAT+F +V P +G+A FW+N H + D H CPV +
Sbjct: 502 NRIATIIFYLTDVKAGGATVFNRFGASVKPVRGAAGFWYNLHPSGEGDLRTRHVACPVLV 561
Query: 62 GNKW 65
G+KW
Sbjct: 562 GSKW 565
>gi|297515507|gb|ADI44133.1| RT08151p [Drosophila melanogaster]
Length = 546
Score = 63.2 bits (152), Expect = 3e-08, Method: Composition-based stats.
Identities = 28/67 (41%), Positives = 40/67 (59%), Gaps = 1/67 (1%)
Query: 61 LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
+GN+ +L L+DV GG T FP L + P+K +A FW+N HA+ + D R H CP
Sbjct: 444 MGNRIAAVLFY-LSDVAQGGGTAFPQLRTLLKPKKYAAAFWHNLHASGVGDVRTQHGACP 502
Query: 121 VALGNKW 127
+ G+KW
Sbjct: 503 IIAGSKW 509
Score = 60.8 bits (146), Expect = 2e-07, Method: Composition-based stats.
Identities = 27/64 (42%), Positives = 35/64 (54%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR L+DV GG T FP L + P+K +A FWHN HA+ + D H CP+
Sbjct: 446 NRIAAVLFYLSDVAQGGGTAFPQLRTLLKPKKYAAAFWHNLHASGVGDVRTQHGACPIIA 505
Query: 62 GNKW 65
G+KW
Sbjct: 506 GSKW 509
>gi|194905372|ref|XP_001981184.1| GG11758 [Drosophila erecta]
gi|190655822|gb|EDV53054.1| GG11758 [Drosophila erecta]
Length = 550
Score = 63.2 bits (152), Expect = 4e-08, Method: Composition-based stats.
Identities = 28/67 (41%), Positives = 40/67 (59%), Gaps = 1/67 (1%)
Query: 61 LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
+GN+ +L L+DV GG T FP L + P+K +A FW+N HA+ + D R H CP
Sbjct: 441 MGNRIATVLFY-LSDVSQGGGTAFPQLRTLLKPKKYAAAFWHNLHASGVGDVRTQHGACP 499
Query: 121 VALGNKW 127
+ G+KW
Sbjct: 500 IIAGSKW 506
Score = 60.8 bits (146), Expect = 2e-07, Method: Composition-based stats.
Identities = 27/64 (42%), Positives = 35/64 (54%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR L+DV GG T FP L + P+K +A FWHN HA+ + D H CP+
Sbjct: 443 NRIATVLFYLSDVSQGGGTAFPQLRTLLKPKKYAAAFWHNLHASGVGDVRTQHGACPIIA 502
Query: 62 GNKW 65
G+KW
Sbjct: 503 GSKW 506
>gi|390352104|ref|XP_003727818.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like
[Strongylocentrotus purpuratus]
Length = 121
Score = 63.2 bits (152), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 30/66 (45%), Positives = 39/66 (59%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ +L L+DV GG T+F + PEKGSA+FWYN N +D R H+ CPV
Sbjct: 34 GNRIASMLFY-LSDVAKGGDTVFIDAGAKIKPEKGSAIFWYNLFKNGKVDERTKHASCPV 92
Query: 122 ALGNKW 127
G+KW
Sbjct: 93 ISGSKW 98
Score = 56.2 bits (134), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 25/55 (45%), Positives = 33/55 (60%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L+DV GG T+F + PEKGSA+FW+N N +D H+ CPV G+KW
Sbjct: 44 LSDVAKGGDTVFIDAGAKIKPEKGSAIFWYNLFKNGKVDERTKHASCPVISGSKW 98
>gi|443709455|gb|ELU04127.1| hypothetical protein CAPTEDRAFT_149240 [Capitella teleta]
Length = 532
Score = 63.2 bits (152), Expect = 4e-08, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 35/55 (63%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
+TDVE GGAT+F + V+PEKGSA W+N + D R H+ CPV G+KW
Sbjct: 457 MTDVEAGGATVFLDAGVKVYPEKGSAAVWHNLLPSGEGDMRTRHAACPVLTGSKW 511
Score = 62.0 bits (149), Expect = 7e-08, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 34/55 (61%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
+TDVE GGAT+F + V+PEKGSA WHN + D H+ CPV G+KW
Sbjct: 457 MTDVEAGGATVFLDAGVKVYPEKGSAAVWHNLLPSGEGDMRTRHAACPVLTGSKW 511
>gi|195444366|ref|XP_002069834.1| GK11733 [Drosophila willistoni]
gi|194165919|gb|EDW80820.1| GK11733 [Drosophila willistoni]
Length = 517
Score = 63.2 bits (152), Expect = 4e-08, Method: Composition-based stats.
Identities = 29/55 (52%), Positives = 35/55 (63%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
LTDV+ GGAT+FP L L+ FP+KGSA+ + N D HS CPV GNKW
Sbjct: 441 LTDVQQGGATVFPFLRLSYFPKKGSALIFRNLDNAMSGDKDSTHSACPVLFGNKW 495
Score = 62.4 bits (150), Expect = 5e-08, Method: Composition-based stats.
Identities = 29/55 (52%), Positives = 35/55 (63%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
LTDV+ GGAT+FP L L+ FP+KGSA+ + N D HS CPV GNKW
Sbjct: 441 LTDVQQGGATVFPFLRLSYFPKKGSALIFRNLDNAMSGDKDSTHSACPVLFGNKW 495
>gi|340367965|ref|XP_003382523.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Amphimedon
queenslandica]
Length = 525
Score = 62.8 bits (151), Expect = 4e-08, Method: Composition-based stats.
Identities = 28/66 (42%), Positives = 42/66 (63%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ LL+ ++DVE GGAT+FP + + P K +A +W+N + DY H+GCPV
Sbjct: 438 GNRISTLLIY-MSDVEKGGATVFPGVGARLVPIKRAAAYWWNLKRSGDGDYSTRHAGCPV 496
Query: 122 ALGNKW 127
+G+KW
Sbjct: 497 LVGSKW 502
Score = 59.7 bits (143), Expect = 4e-07, Method: Composition-based stats.
Identities = 26/64 (40%), Positives = 38/64 (59%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + ++DVE GGAT+FP + + P K +A +W N + DY H+GCPV +
Sbjct: 439 NRISTLLIYMSDVEKGGATVFPGVGARLVPIKRAAAYWWNLKRSGDGDYSTRHAGCPVLV 498
Query: 62 GNKW 65
G+KW
Sbjct: 499 GSKW 502
>gi|195145084|ref|XP_002013526.1| GL24185 [Drosophila persimilis]
gi|194102469|gb|EDW24512.1| GL24185 [Drosophila persimilis]
Length = 229
Score = 62.8 bits (151), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 31/85 (36%), Positives = 51/85 (60%), Gaps = 1/85 (1%)
Query: 43 HANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWY 102
H + D Y + LG++ +L + +V+ GGAT FP +N++V P+KGSAV WY
Sbjct: 118 HYDVHYDSHNYSEANRLILGDRIATVLFY-VGEVDSGGATTFPYINVSVTPKKGSAVLWY 176
Query: 103 NAHANTLLDYRMYHSGCPVALGNKW 127
N + ++ + H+GCPV +G+K+
Sbjct: 177 NLDNSGQMNPKAIHAGCPVIVGSKY 201
Score = 56.6 bits (135), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 24/53 (45%), Positives = 38/53 (71%)
Query: 13 DVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
+V+ GGAT FP +N++V P+KGSAV W+N + ++ H+GCPV +G+K+
Sbjct: 149 EVDSGGATTFPYINVSVTPKKGSAVLWYNLDNSGQMNPKAIHAGCPVIVGSKY 201
>gi|402813396|ref|ZP_10862991.1| hypothetical protein PAV_1c08470 [Paenibacillus alvei DSM 29]
gi|402509339|gb|EJW19859.1| hypothetical protein PAV_1c08470 [Paenibacillus alvei DSM 29]
Length = 215
Score = 62.8 bits (151), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 32/78 (41%), Positives = 44/78 (56%)
Query: 50 YWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTL 109
++ + + A N L+ L DVE GG T FP LNL+VFP KG AV++ ++N
Sbjct: 124 HFDFFADTSRASANNRISTLVMYLNDVEEGGETTFPMLNLSVFPSKGMAVYFEYFYSNHE 183
Query: 110 LDYRMYHSGCPVALGNKW 127
L+ R H+G PV G KW
Sbjct: 184 LNERTLHAGAPVRKGEKW 201
Score = 60.5 bits (145), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 28/55 (50%), Positives = 35/55 (63%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T FP LNL+VFP KG AV++ ++N L+ H+G PV G KW
Sbjct: 147 LNDVEEGGETTFPMLNLSVFPSKGMAVYFEYFYSNHELNERTLHAGAPVRKGEKW 201
>gi|195452746|ref|XP_002073482.1| GK14141 [Drosophila willistoni]
gi|194169567|gb|EDW84468.1| GK14141 [Drosophila willistoni]
Length = 541
Score = 62.8 bits (151), Expect = 5e-08, Method: Composition-based stats.
Identities = 28/67 (41%), Positives = 40/67 (59%), Gaps = 1/67 (1%)
Query: 61 LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
+GN+ +L L+DV GG T FP L + P+K +A FW+N HA+ + D R H CP
Sbjct: 441 MGNRIATVLFY-LSDVAQGGGTAFPHLKQLLQPKKYAAAFWHNLHASGVGDLRTLHGACP 499
Query: 121 VALGNKW 127
+ G+KW
Sbjct: 500 IIAGSKW 506
Score = 60.5 bits (145), Expect = 2e-07, Method: Composition-based stats.
Identities = 27/64 (42%), Positives = 35/64 (54%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR L+DV GG T FP L + P+K +A FWHN HA+ + D H CP+
Sbjct: 443 NRIATVLFYLSDVAQGGGTAFPHLKQLLQPKKYAAAFWHNLHASGVGDLRTLHGACPIIA 502
Query: 62 GNKW 65
G+KW
Sbjct: 503 GSKW 506
>gi|405964867|gb|EKC30309.1| Prolyl 4-hydroxylase subunit alpha-1 [Crassostrea gigas]
Length = 591
Score = 62.8 bits (151), Expect = 5e-08, Method: Composition-based stats.
Identities = 26/55 (47%), Positives = 33/55 (60%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L DV+ GGAT+FP + + KG A FWYN + D R H GCPV +G+KW
Sbjct: 507 LNDVKAGGATVFPEVKTRIPVAKGGAAFWYNVRPSGATDPRTLHGGCPVLVGSKW 561
Score = 57.4 bits (137), Expect = 2e-06, Method: Composition-based stats.
Identities = 24/55 (43%), Positives = 32/55 (58%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DV+ GGAT+FP + + KG A FW+N + D H GCPV +G+KW
Sbjct: 507 LNDVKAGGATVFPEVKTRIPVAKGGAAFWYNVRPSGATDPRTLHGGCPVLVGSKW 561
>gi|156333122|ref|XP_001619372.1| hypothetical protein NEMVEDRAFT_v1g151555 [Nematostella vectensis]
gi|156202442|gb|EDO27272.1| predicted protein [Nematostella vectensis]
Length = 144
Score = 62.8 bits (151), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 28/59 (47%), Positives = 37/59 (62%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
LS L+DVE GG T+F + TV+P+KG A FWYN + D H+ CPV +G+KW
Sbjct: 69 FLSYLSDVEAGGGTVFTRVGATVWPQKGDAAFWYNLKRSGDGDSSTRHAACPVLVGSKW 127
Score = 58.2 bits (139), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 25/55 (45%), Positives = 35/55 (63%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L+DVE GG T+F + TV+P+KG A FW+N + D H+ CPV +G+KW
Sbjct: 73 LSDVEAGGGTVFTRVGATVWPQKGDAAFWYNLKRSGDGDSSTRHAACPVLVGSKW 127
>gi|195591296|ref|XP_002085378.1| GD14754 [Drosophila simulans]
gi|194197387|gb|EDX10963.1| GD14754 [Drosophila simulans]
Length = 508
Score = 62.8 bits (151), Expect = 5e-08, Method: Composition-based stats.
Identities = 33/73 (45%), Positives = 45/73 (61%), Gaps = 13/73 (17%)
Query: 61 LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRM------ 114
LG++ +L +TDV GGA FP+LNLT++P+KGSA+ W N LD+RM
Sbjct: 412 LGDRLTSILFF-MTDVVQGGAFAFPNLNLTIWPQKGSALVWRN------LDHRMQPNKDL 464
Query: 115 YHSGCPVALGNKW 127
H CPV +G+KW
Sbjct: 465 LHVSCPVVVGSKW 477
Score = 60.8 bits (146), Expect = 2e-07, Method: Composition-based stats.
Identities = 26/55 (47%), Positives = 36/55 (65%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
+TDV GGA FP+LNLT++P+KGSA+ W N + + H CPV +G+KW
Sbjct: 423 MTDVVQGGAFAFPNLNLTIWPQKGSALVWRNLDHRMQPNKDLLHVSCPVVVGSKW 477
>gi|324507368|gb|ADY43128.1| Prolyl 4-hydroxylase subunit alpha-2 [Ascaris suum]
Length = 534
Score = 62.8 bits (151), Expect = 5e-08, Method: Composition-based stats.
Identities = 30/66 (45%), Positives = 39/66 (59%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ +L+ ++DVE GGAT+F L VFP K A+FWYN + D R H+ CPV
Sbjct: 437 GNRIATILIY-MSDVESGGATVFNHLGNAVFPSKYDALFWYNLRRDGEGDLRTRHAACPV 495
Query: 122 ALGNKW 127
G KW
Sbjct: 496 LTGIKW 501
Score = 57.0 bits (136), Expect = 3e-06, Method: Composition-based stats.
Identities = 27/64 (42%), Positives = 35/64 (54%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + ++DVE GGAT+F L VFP K A+FW+N + D H+ CPV
Sbjct: 438 NRIATILIYMSDVESGGATVFNHLGNAVFPSKYDALFWYNLRRDGEGDLRTRHAACPVLT 497
Query: 62 GNKW 65
G KW
Sbjct: 498 GIKW 501
>gi|156370133|ref|XP_001628326.1| predicted protein [Nematostella vectensis]
gi|156215300|gb|EDO36263.1| predicted protein [Nematostella vectensis]
Length = 526
Score = 62.4 bits (150), Expect = 5e-08, Method: Composition-based stats.
Identities = 32/89 (35%), Positives = 46/89 (51%), Gaps = 4/89 (4%)
Query: 42 AHANTLLDYWMYHSGCPVA---LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSA 98
H D+ + P+A GN+ + L++VE GG+T+F + P KG A
Sbjct: 415 GHYEPHFDHSLDMENSPIASLGQGNRIATFMFY-LSEVEAGGSTVFIKTGVKTNPFKGGA 473
Query: 99 VFWYNAHANTLLDYRMYHSGCPVALGNKW 127
VFWYN + D+ H+GCPV +GNKW
Sbjct: 474 VFWYNLKKSGEGDWDSLHAGCPVLIGNKW 502
Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats.
Identities = 27/64 (42%), Positives = 37/64 (57%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR L++VE GG+T+F + P KG AVFW+N + D+ H+GCPV +
Sbjct: 439 NRIATFMFYLSEVEAGGSTVFIKTGVKTNPFKGGAVFWYNLKKSGEGDWDSLHAGCPVLI 498
Query: 62 GNKW 65
GNKW
Sbjct: 499 GNKW 502
>gi|195575105|ref|XP_002105520.1| GD21524 [Drosophila simulans]
gi|194201447|gb|EDX15023.1| GD21524 [Drosophila simulans]
Length = 448
Score = 62.4 bits (150), Expect = 5e-08, Method: Composition-based stats.
Identities = 28/48 (58%), Positives = 32/48 (66%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
L+DV GGATIFP L L+VFP+KGSA+ WYN D R HS CP
Sbjct: 398 LSDVPQGGATIFPKLGLSVFPKKGSALLWYNLDHKGDGDNRTAHSACP 445
Score = 57.4 bits (137), Expect = 2e-06, Method: Composition-based stats.
Identities = 27/57 (47%), Positives = 34/57 (59%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCP 58
+R + L+DV GGATIFP L L+VFP+KGSA+ W+N D HS CP
Sbjct: 389 DRIATAVFYLSDVPQGGATIFPKLGLSVFPKKGSALLWYNLDHKGDGDNRTAHSACP 445
>gi|194905424|ref|XP_001981193.1| GG11755 [Drosophila erecta]
gi|190655831|gb|EDV53063.1| GG11755 [Drosophila erecta]
Length = 527
Score = 62.4 bits (150), Expect = 6e-08, Method: Composition-based stats.
Identities = 28/63 (44%), Positives = 39/63 (61%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
+R S L+DVE GG T+FP LN+ + P G+ V WHN H + +D H+GCPV +
Sbjct: 433 DRISTSMFYLSDVEQGGYTVFPKLNVFLPPVSGALVMWHNLHRSLDVDARTLHAGCPVIV 492
Query: 62 GNK 64
G+K
Sbjct: 493 GSK 495
Score = 61.6 bits (148), Expect = 9e-08, Method: Composition-based stats.
Identities = 26/54 (48%), Positives = 37/54 (68%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 126
L+DVE GG T+FP LN+ + P G+ V W+N H + +D R H+GCPV +G+K
Sbjct: 442 LSDVEQGGYTVFPKLNVFLPPVSGALVMWHNLHRSLDVDARTLHAGCPVIVGSK 495
>gi|443705944|gb|ELU02240.1| hypothetical protein CAPTEDRAFT_227850 [Capitella teleta]
Length = 475
Score = 62.4 bits (150), Expect = 6e-08, Method: Composition-based stats.
Identities = 25/55 (45%), Positives = 37/55 (67%)
Query: 79 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWGKLLLS 133
GGAT+FP L +++ +KG+A FW+N H + LD R H+ CPV G KW +++S
Sbjct: 317 GGATVFPLLPMSIPIQKGAAAFWFNLHPDGSLDRRTLHAACPVIRGTKWECVIVS 371
Score = 57.0 bits (136), Expect = 2e-06, Method: Composition-based stats.
Identities = 24/55 (43%), Positives = 35/55 (63%)
Query: 17 GGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWGKLLLS 71
GGAT+FP L +++ +KG+A FW N H + LD H+ CPV G KW +++S
Sbjct: 317 GGATVFPLLPMSIPIQKGAAAFWFNLHPDGSLDRRTLHAACPVIRGTKWECVIVS 371
>gi|423518940|ref|ZP_17495421.1| hypothetical protein IG7_04010 [Bacillus cereus HuA2-4]
gi|401159995|gb|EJQ67374.1| hypothetical protein IG7_04010 [Bacillus cereus HuA2-4]
Length = 216
Score = 62.4 bits (150), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 44/79 (55%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPQLNLSVHPRKGMAVYFEYFYQDQ 185
Query: 109 LLDYRMYHSGCPVALGNKW 127
LL+ H G PV G KW
Sbjct: 186 LLNELTLHGGAPVTKGEKW 204
Score = 59.3 bits (142), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 27/55 (49%), Positives = 33/55 (60%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T FP LNL+V P KG AV++ + + LL+ H G PV G KW
Sbjct: 150 LNDVEEGGETFFPQLNLSVHPRKGMAVYFEYFYQDQLLNELTLHGGAPVTKGEKW 204
>gi|195159164|ref|XP_002020452.1| GL13506 [Drosophila persimilis]
gi|194117221|gb|EDW39264.1| GL13506 [Drosophila persimilis]
Length = 536
Score = 62.4 bits (150), Expect = 6e-08, Method: Composition-based stats.
Identities = 36/92 (39%), Positives = 49/92 (53%), Gaps = 6/92 (6%)
Query: 38 FWHNAHANTLLDYWMYHSGCPV--ALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEK 95
+ + H +T DY + P+ ALG++ +L L DV+ GG+T+FP L L V E+
Sbjct: 420 YGYGGHYDTHFDY--LNDSLPITQALGDRMATVLFY-LNDVKHGGSTVFPVLQLKVPSER 476
Query: 96 GSAVFWYNAHANTL-LDYRMYHSGCPVALGNK 126
G + WYN H T LD R H CPV G K
Sbjct: 477 GKVLVWYNMHGETHDLDSRTLHGSCPVIDGAK 508
Score = 50.8 bits (120), Expect = 2e-04, Method: Composition-based stats.
Identities = 25/55 (45%), Positives = 31/55 (56%), Gaps = 1/55 (1%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTL-LDYWMYHSGCPVALGNK 64
L DV+ GG+T+FP L L V E+G + W+N H T LD H CPV G K
Sbjct: 454 LNDVKHGGSTVFPVLQLKVPSERGKVLVWYNMHGETHDLDSRTLHGSCPVIDGAK 508
>gi|198449524|ref|XP_002136918.1| GA26871 [Drosophila pseudoobscura pseudoobscura]
gi|198130646|gb|EDY67476.1| GA26871 [Drosophila pseudoobscura pseudoobscura]
Length = 530
Score = 62.4 bits (150), Expect = 7e-08, Method: Composition-based stats.
Identities = 36/92 (39%), Positives = 49/92 (53%), Gaps = 6/92 (6%)
Query: 38 FWHNAHANTLLDYWMYHSGCPV--ALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEK 95
+ + H +T DY + P+ ALG++ +L L DV+ GG+T+FP L L V E+
Sbjct: 414 YGYGGHYDTHFDY--LNDSLPITQALGDRMATVLFY-LNDVKHGGSTVFPVLKLKVPSER 470
Query: 96 GSAVFWYNAHANTL-LDYRMYHSGCPVALGNK 126
G + WYN H T LD R H CPV G K
Sbjct: 471 GKVLVWYNMHGETHDLDSRTLHGSCPVIDGAK 502
Score = 50.8 bits (120), Expect = 2e-04, Method: Composition-based stats.
Identities = 25/55 (45%), Positives = 31/55 (56%), Gaps = 1/55 (1%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTL-LDYWMYHSGCPVALGNK 64
L DV+ GG+T+FP L L V E+G + W+N H T LD H CPV G K
Sbjct: 448 LNDVKHGGSTVFPVLKLKVPSERGKVLVWYNMHGETHDLDSRTLHGSCPVIDGAK 502
>gi|405970696|gb|EKC35577.1| Prolyl 4-hydroxylase subunit alpha-1 [Crassostrea gigas]
Length = 171
Score = 62.0 bits (149), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 28/55 (50%), Positives = 37/55 (67%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
LTDVE GGAT+FP + V KG+A+FWYN N+ D R ++ CPV LG+K+
Sbjct: 90 LTDVEKGGATVFPEAKVRVPVTKGAALFWYNIKRNSEKDQRSLNADCPVILGSKF 144
Score = 57.4 bits (137), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 36/55 (65%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
LTDVE GGAT+FP + V KG+A+FW+N N+ D ++ CPV LG+K+
Sbjct: 90 LTDVEKGGATVFPEAKVRVPVTKGAALFWYNIKRNSEKDQRSLNADCPVILGSKF 144
>gi|112984520|ref|NP_001037195.1| prolyl 4-hydroxylase alpha subunit precursor [Bombyx mori]
gi|37543673|gb|AAM21932.1| prolyl 4-hydroxylase alpha subunit [Bombyx mori]
Length = 550
Score = 62.0 bits (149), Expect = 7e-08, Method: Composition-based stats.
Identities = 29/66 (43%), Positives = 40/66 (60%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ +L ++DV GGAT+F L L++FP K +A FW N HA+ D H+ CPV
Sbjct: 447 GNRIATVLFY-MSDVAQGGATVFTELGLSLFPIKRAAAFWLNLHASGEGDLATRHAACPV 505
Query: 122 ALGNKW 127
G+KW
Sbjct: 506 LRGSKW 511
Score = 60.5 bits (145), Expect = 2e-07, Method: Composition-based stats.
Identities = 28/64 (43%), Positives = 37/64 (57%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+F L L++FP K +A FW N HA+ D H+ CPV
Sbjct: 448 NRIATVLFYMSDVAQGGATVFTELGLSLFPIKRAAAFWLNLHASGEGDLATRHAACPVLR 507
Query: 62 GNKW 65
G+KW
Sbjct: 508 GSKW 511
>gi|423512354|ref|ZP_17488885.1| hypothetical protein IG3_03851 [Bacillus cereus HuA2-1]
gi|402449325|gb|EJV81162.1| hypothetical protein IG3_03851 [Bacillus cereus HuA2-1]
Length = 216
Score = 62.0 bits (149), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 44/79 (55%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185
Query: 109 LLDYRMYHSGCPVALGNKW 127
LL+ H G PV G KW
Sbjct: 186 LLNELTLHGGAPVTKGEKW 204
Score = 58.9 bits (141), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 27/55 (49%), Positives = 33/55 (60%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T FP LNL+V P KG AV++ + + LL+ H G PV G KW
Sbjct: 150 LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQLLNELTLHGGAPVTKGEKW 204
>gi|423604110|ref|ZP_17580003.1| hypothetical protein IIK_00691 [Bacillus cereus VD102]
gi|401245796|gb|EJR52149.1| hypothetical protein IIK_00691 [Bacillus cereus VD102]
Length = 216
Score = 62.0 bits (149), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 43/79 (54%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ H +
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFHQDQ 185
Query: 109 LLDYRMYHSGCPVALGNKW 127
L+ H G PV G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204
Score = 58.9 bits (141), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 27/55 (49%), Positives = 32/55 (58%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T FP LNL+V P KG AV++ H + L+ H G PV G KW
Sbjct: 150 LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFHQDQSLNELTLHGGAPVTKGEKW 204
>gi|423368291|ref|ZP_17345723.1| hypothetical protein IC3_03392 [Bacillus cereus VD142]
gi|401081042|gb|EJP89322.1| hypothetical protein IC3_03392 [Bacillus cereus VD142]
Length = 216
Score = 62.0 bits (149), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 44/79 (55%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185
Query: 109 LLDYRMYHSGCPVALGNKW 127
LL+ H G PV G KW
Sbjct: 186 LLNELTLHGGAPVTKGEKW 204
Score = 58.9 bits (141), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 27/55 (49%), Positives = 33/55 (60%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T FP LNL+V P KG AV++ + + LL+ H G PV G KW
Sbjct: 150 LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQLLNELTLHGGAPVTKGEKW 204
>gi|229135058|ref|ZP_04263863.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST196]
gi|228648443|gb|EEL04473.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST196]
Length = 216
Score = 62.0 bits (149), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 44/79 (55%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185
Query: 109 LLDYRMYHSGCPVALGNKW 127
LL+ H G PV G KW
Sbjct: 186 LLNELTLHGGAPVTKGEKW 204
Score = 58.9 bits (141), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 27/55 (49%), Positives = 33/55 (60%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T FP LNL+V P KG AV++ + + LL+ H G PV G KW
Sbjct: 150 LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQLLNELTLHGGAPVTKGEKW 204
>gi|241710335|ref|XP_002412046.1| prolyl 4-hydroxylase alpha subunit 1, putative [Ixodes scapularis]
gi|215505101|gb|EEC14595.1| prolyl 4-hydroxylase alpha subunit 1, putative [Ixodes scapularis]
Length = 65
Score = 62.0 bits (149), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 27/55 (49%), Positives = 37/55 (67%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
++DVE GGAT+FP L + + P+KG A FW+N AN + H+GCPV G+KW
Sbjct: 11 MSDVEEGGATVFPYLGVRLTPQKGDAAFWWNLKANGEGEVLTTHAGCPVLYGSKW 65
Score = 61.6 bits (148), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 28/64 (43%), Positives = 39/64 (60%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
+R + ++DVE GGAT+FP L + + P+KG A FW N AN + H+GCPV
Sbjct: 2 DRVATLMIYMSDVEEGGATVFPYLGVRLTPQKGDAAFWWNLKANGEGEVLTTHAGCPVLY 61
Query: 62 GNKW 65
G+KW
Sbjct: 62 GSKW 65
>gi|423669823|ref|ZP_17644852.1| hypothetical protein IKO_03520 [Bacillus cereus VDM034]
gi|423673973|ref|ZP_17648912.1| hypothetical protein IKS_01516 [Bacillus cereus VDM062]
gi|401298950|gb|EJS04550.1| hypothetical protein IKO_03520 [Bacillus cereus VDM034]
gi|401309524|gb|EJS14857.1| hypothetical protein IKS_01516 [Bacillus cereus VDM062]
Length = 216
Score = 62.0 bits (149), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 44/79 (55%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185
Query: 109 LLDYRMYHSGCPVALGNKW 127
LL+ H G PV G KW
Sbjct: 186 LLNELTLHGGAPVTKGEKW 204
Score = 58.9 bits (141), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 27/55 (49%), Positives = 33/55 (60%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T FP LNL+V P KG AV++ + + LL+ H G PV G KW
Sbjct: 150 LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQLLNELTLHGGAPVTKGEKW 204
>gi|229061929|ref|ZP_04199257.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH603]
gi|228717372|gb|EEL69042.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH603]
Length = 216
Score = 62.0 bits (149), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 44/79 (55%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185
Query: 109 LLDYRMYHSGCPVALGNKW 127
LL+ H G PV G KW
Sbjct: 186 LLNELTLHGGAPVTKGEKW 204
Score = 58.9 bits (141), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 27/55 (49%), Positives = 33/55 (60%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T FP LNL+V P KG AV++ + + LL+ H G PV G KW
Sbjct: 150 LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQLLNELTLHGGAPVTKGEKW 204
>gi|443709454|gb|ELU04126.1| hypothetical protein CAPTEDRAFT_167710 [Capitella teleta]
Length = 535
Score = 62.0 bits (149), Expect = 7e-08, Method: Composition-based stats.
Identities = 26/55 (47%), Positives = 33/55 (60%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+DVE GG T+F + + PEKG+A WYN H + D H+ CPV GNKW
Sbjct: 458 LSDVEAGGGTVFMTAGTKLRPEKGAAAVWYNLHPDGTGDDETKHAACPVLTGNKW 512
Score = 58.9 bits (141), Expect = 7e-07, Method: Composition-based stats.
Identities = 25/55 (45%), Positives = 33/55 (60%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L+DVE GG T+F + + PEKG+A W+N H + D H+ CPV GNKW
Sbjct: 458 LSDVEAGGGTVFMTAGTKLRPEKGAAAVWYNLHPDGTGDDETKHAACPVLTGNKW 512
>gi|393909803|gb|EFO21561.2| prolyl 4-hydroxylase 2 [Loa loa]
Length = 542
Score = 62.0 bits (149), Expect = 7e-08, Method: Composition-based stats.
Identities = 29/66 (43%), Positives = 38/66 (57%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ LL +T E GGAT+F + TV P K A+FWYN + D R H+ CPV
Sbjct: 445 GNRLATLLFY-MTQPESGGATVFTEVKTTVMPSKNDALFWYNLLRSGEGDLRTRHAACPV 503
Query: 122 ALGNKW 127
+G+KW
Sbjct: 504 LIGSKW 509
Score = 53.9 bits (128), Expect = 2e-05, Method: Composition-based stats.
Identities = 25/64 (39%), Positives = 34/64 (53%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR +T E GGAT+F + TV P K A+FW+N + D H+ CPV +
Sbjct: 446 NRLATLLFYMTQPESGGATVFTEVKTTVMPSKNDALFWYNLLRSGEGDLRTRHAACPVLI 505
Query: 62 GNKW 65
G+KW
Sbjct: 506 GSKW 509
>gi|312080225|ref|XP_003142509.1| prolyl 4-hydroxylase 2 [Loa loa]
Length = 541
Score = 62.0 bits (149), Expect = 7e-08, Method: Composition-based stats.
Identities = 29/66 (43%), Positives = 38/66 (57%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ LL +T E GGAT+F + TV P K A+FWYN + D R H+ CPV
Sbjct: 444 GNRLATLLFY-MTQPESGGATVFTEVKTTVMPSKNDALFWYNLLRSGEGDLRTRHAACPV 502
Query: 122 ALGNKW 127
+G+KW
Sbjct: 503 LIGSKW 508
Score = 53.9 bits (128), Expect = 2e-05, Method: Composition-based stats.
Identities = 25/64 (39%), Positives = 34/64 (53%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR +T E GGAT+F + TV P K A+FW+N + D H+ CPV +
Sbjct: 445 NRLATLLFYMTQPESGGATVFTEVKTTVMPSKNDALFWYNLLRSGEGDLRTRHAACPVLI 504
Query: 62 GNKW 65
G+KW
Sbjct: 505 GSKW 508
>gi|194751829|ref|XP_001958226.1| GF23628 [Drosophila ananassae]
gi|190625508|gb|EDV41032.1| GF23628 [Drosophila ananassae]
Length = 484
Score = 62.0 bits (149), Expect = 8e-08, Method: Composition-based stats.
Identities = 26/67 (38%), Positives = 42/67 (62%), Gaps = 1/67 (1%)
Query: 61 LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
LG++ ++ + DV GG T+FP + + V P+KGS++FW+N + D R HS CP
Sbjct: 396 LGDRIASIIFY-VGDVVHGGQTVFPDIQIAVKPQKGSSLFWFNTFDDATPDPRSLHSVCP 454
Query: 121 VALGNKW 127
V +G++W
Sbjct: 455 VLIGDRW 461
Score = 56.6 bits (135), Expect = 3e-06, Method: Composition-based stats.
Identities = 23/53 (43%), Positives = 33/53 (62%)
Query: 13 DVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
DV GG T+FP + + V P+KGS++FW N + D HS CPV +G++W
Sbjct: 409 DVVHGGQTVFPDIQIAVKPQKGSSLFWFNTFDDATPDPRSLHSVCPVLIGDRW 461
>gi|195352176|ref|XP_002042590.1| GM14977 [Drosophila sechellia]
gi|194124474|gb|EDW46517.1| GM14977 [Drosophila sechellia]
Length = 485
Score = 62.0 bits (149), Expect = 8e-08, Method: Composition-based stats.
Identities = 33/76 (43%), Positives = 45/76 (59%), Gaps = 13/76 (17%)
Query: 61 LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRM------ 114
LG++ +L + DV GGA FP+LNLT++P KGSA+ W N LD+RM
Sbjct: 413 LGDRLTSILFF-MNDVVQGGAFAFPNLNLTIWPHKGSALVWRN------LDHRMQPNKDL 465
Query: 115 YHSGCPVALGNKWGKL 130
H CPV +G+KW +L
Sbjct: 466 LHVSCPVVVGSKWSEL 481
Score = 60.1 bits (144), Expect = 3e-07, Method: Composition-based stats.
Identities = 26/58 (44%), Positives = 36/58 (62%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWGKL 68
+ DV GGA FP+LNLT++P KGSA+ W N + + H CPV +G+KW +L
Sbjct: 424 MNDVVQGGAFAFPNLNLTIWPHKGSALVWRNLDHRMQPNKDLLHVSCPVVVGSKWSEL 481
>gi|229168980|ref|ZP_04296697.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH621]
gi|423591765|ref|ZP_17567796.1| hypothetical protein IIG_00633 [Bacillus cereus VD048]
gi|228614572|gb|EEK71680.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH621]
gi|401231898|gb|EJR38400.1| hypothetical protein IIG_00633 [Bacillus cereus VD048]
Length = 216
Score = 62.0 bits (149), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 44/79 (55%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185
Query: 109 LLDYRMYHSGCPVALGNKW 127
LL+ H G PV G KW
Sbjct: 186 LLNELTLHGGAPVTKGEKW 204
Score = 58.9 bits (141), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 29/64 (45%), Positives = 36/64 (56%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L DVE GG T FP LNL+V P KG AV++ + + LL+ H G PV
Sbjct: 141 NRISTLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQLLNELTLHGGAPVTK 200
Query: 62 GNKW 65
G KW
Sbjct: 201 GEKW 204
>gi|21358309|ref|NP_651801.1| prolyl-4-hydroxylase-alpha SG2 [Drosophila melanogaster]
gi|20269808|gb|AAM18059.1|AF495537_1 prolyl 4-hydroxylase alpha-related protein PH4[alpha]SG2
[Drosophila melanogaster]
gi|10726875|gb|AAG22175.1| prolyl-4-hydroxylase-alpha SG2 [Drosophila melanogaster]
Length = 527
Score = 61.6 bits (148), Expect = 9e-08, Method: Composition-based stats.
Identities = 28/63 (44%), Positives = 39/63 (61%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
+R S L+DVE GG T+F LN+ + P KG+ V WHN H + +D H+GCPV +
Sbjct: 433 DRISTSMFYLSDVEQGGYTVFTKLNVFLPPVKGALVMWHNLHRSLHVDARTLHAGCPVIV 492
Query: 62 GNK 64
G+K
Sbjct: 493 GSK 495
Score = 60.8 bits (146), Expect = 1e-07, Method: Composition-based stats.
Identities = 26/54 (48%), Positives = 37/54 (68%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 126
L+DVE GG T+F LN+ + P KG+ V W+N H + +D R H+GCPV +G+K
Sbjct: 442 LSDVEQGGYTVFTKLNVFLPPVKGALVMWHNLHRSLHVDARTLHAGCPVIVGSK 495
>gi|198449650|ref|XP_001357661.2| GA13747 [Drosophila pseudoobscura pseudoobscura]
gi|198130701|gb|EAL26795.2| GA13747 [Drosophila pseudoobscura pseudoobscura]
Length = 533
Score = 61.6 bits (148), Expect = 9e-08, Method: Composition-based stats.
Identities = 28/63 (44%), Positives = 40/63 (63%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
+R S L+DVE GG T+F LN+ + P KG+ V WHN H + +D +H+GCPV +
Sbjct: 439 DRISTSMFYLSDVEQGGYTVFTKLNVFLPPIKGALVMWHNLHRSLDVDPRTHHAGCPVIV 498
Query: 62 GNK 64
G+K
Sbjct: 499 GSK 501
Score = 60.8 bits (146), Expect = 1e-07, Method: Composition-based stats.
Identities = 26/54 (48%), Positives = 38/54 (70%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 126
L+DVE GG T+F LN+ + P KG+ V W+N H + +D R +H+GCPV +G+K
Sbjct: 448 LSDVEQGGYTVFTKLNVFLPPIKGALVMWHNLHRSLDVDPRTHHAGCPVIVGSK 501
>gi|195159321|ref|XP_002020530.1| GL13464 [Drosophila persimilis]
gi|194117299|gb|EDW39342.1| GL13464 [Drosophila persimilis]
Length = 533
Score = 61.6 bits (148), Expect = 9e-08, Method: Composition-based stats.
Identities = 28/63 (44%), Positives = 40/63 (63%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
+R S L+DVE GG T+F LN+ + P KG+ V WHN H + +D +H+GCPV +
Sbjct: 439 DRISTSMFYLSDVEQGGYTVFTKLNVFLPPIKGALVMWHNLHRSLDVDPRTHHAGCPVIV 498
Query: 62 GNK 64
G+K
Sbjct: 499 GSK 501
Score = 60.8 bits (146), Expect = 1e-07, Method: Composition-based stats.
Identities = 26/54 (48%), Positives = 38/54 (70%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 126
L+DVE GG T+F LN+ + P KG+ V W+N H + +D R +H+GCPV +G+K
Sbjct: 448 LSDVEQGGYTVFTKLNVFLPPIKGALVMWHNLHRSLDVDPRTHHAGCPVIVGSK 501
>gi|195172672|ref|XP_002027120.1| GL20071 [Drosophila persimilis]
gi|194112933|gb|EDW34976.1| GL20071 [Drosophila persimilis]
Length = 455
Score = 61.6 bits (148), Expect = 9e-08, Method: Composition-based stats.
Identities = 28/56 (50%), Positives = 35/56 (62%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 128
L DVE GG T+FP L + P KGSAV +YN +++ D R H GCPV +G KW
Sbjct: 380 LNDVEQGGKTVFPRLGIFRSPMKGSAVVFYNMNSSLQGDPRTEHGGCPVLVGTKWA 435
Score = 56.2 bits (134), Expect = 4e-06, Method: Composition-based stats.
Identities = 26/56 (46%), Positives = 34/56 (60%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWG 66
L DVE GG T+FP L + P KGSAV ++N +++ D H GCPV +G KW
Sbjct: 380 LNDVEQGGKTVFPRLGIFRSPMKGSAVVFYNMNSSLQGDPRTEHGGCPVLVGTKWA 435
>gi|256083648|ref|XP_002578053.1| prolyl 4-hydroxylase alpha subunit 1 [Schistosoma mansoni]
gi|360044447|emb|CCD81995.1| putative prolyl 4-hydroxylase alpha subunit 1 [Schistosoma mansoni]
Length = 584
Score = 61.6 bits (148), Expect = 1e-07, Method: Composition-based stats.
Identities = 28/66 (42%), Positives = 40/66 (60%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ ++ L+DV+ GGAT+F + V P+KG+A FW+N N D R H+ CPV
Sbjct: 490 GNRIATIIFY-LSDVQAGGATVFNRIGTRVVPKKGAAGFWFNLLPNGEGDLRTRHAACPV 548
Query: 122 ALGNKW 127
G+KW
Sbjct: 549 LAGSKW 554
Score = 56.2 bits (134), Expect = 4e-06, Method: Composition-based stats.
Identities = 27/64 (42%), Positives = 35/64 (54%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR L+DV+ GGAT+F + V P+KG+A FW N N D H+ CPV
Sbjct: 491 NRIATIIFYLSDVQAGGATVFNRIGTRVVPKKGAAGFWFNLLPNGEGDLRTRHAACPVLA 550
Query: 62 GNKW 65
G+KW
Sbjct: 551 GSKW 554
>gi|324511726|gb|ADY44875.1| Prolyl 4-hydroxylase subunit alpha-1 [Ascaris suum]
Length = 550
Score = 61.6 bits (148), Expect = 1e-07, Method: Composition-based stats.
Identities = 28/66 (42%), Positives = 38/66 (57%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ +L LT+ E+GG T+F L V P K A+FWYN + + D R H+ CPV
Sbjct: 441 GNRLATVLFY-LTEPEIGGGTVFTELRTAVMPSKNGALFWYNLYRSGEGDLRTRHAACPV 499
Query: 122 ALGNKW 127
+G KW
Sbjct: 500 LVGIKW 505
Score = 55.1 bits (131), Expect = 9e-06, Method: Composition-based stats.
Identities = 25/64 (39%), Positives = 34/64 (53%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR LT+ E+GG T+F L V P K A+FW+N + + D H+ CPV +
Sbjct: 442 NRLATVLFYLTEPEIGGGTVFTELRTAVMPSKNGALFWYNLYRSGEGDLRTRHAACPVLV 501
Query: 62 GNKW 65
G KW
Sbjct: 502 GIKW 505
>gi|195452738|ref|XP_002073478.1| GK14139 [Drosophila willistoni]
gi|194169563|gb|EDW84464.1| GK14139 [Drosophila willistoni]
Length = 215
Score = 61.6 bits (148), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 41/55 (74%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L DV GGAT+F +++L+VFP+ G+A+FWYN + ++ + H+GCPV +G+KW
Sbjct: 141 LQDVPQGGATLFNNISLSVFPKAGAALFWYNLNNAGDTEWNVAHTGCPVIVGSKW 195
Score = 60.5 bits (145), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 26/56 (46%), Positives = 42/56 (75%)
Query: 10 ELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
EL DV GGAT+F +++L+VFP+ G+A+FW+N + ++ + H+GCPV +G+KW
Sbjct: 140 ELQDVPQGGATLFNNISLSVFPKAGAALFWYNLNNAGDTEWNVAHTGCPVIVGSKW 195
>gi|195575115|ref|XP_002105525.1| GD21527 [Drosophila simulans]
gi|194201452|gb|EDX15028.1| GD21527 [Drosophila simulans]
Length = 495
Score = 61.6 bits (148), Expect = 1e-07, Method: Composition-based stats.
Identities = 30/87 (34%), Positives = 44/87 (50%), Gaps = 4/87 (4%)
Query: 45 NTLLDYWMYHSGCPVALGNKWGK----LLLSGLTDVELGGATIFPSLNLTVFPEKGSAVF 100
N + Y H +G +G+ + L+DV GG T FP L + P+K +A F
Sbjct: 365 NLNMKYAEDHQFANYGIGGHYGQHMDWFYQTTLSDVAQGGGTAFPQLRTLLKPKKYAAAF 424
Query: 101 WYNAHANTLLDYRMYHSGCPVALGNKW 127
W+N HA+ + D R H CP+ G+KW
Sbjct: 425 WHNLHASGVGDVRTQHGACPIIAGSKW 451
Score = 60.5 bits (145), Expect = 2e-07, Method: Composition-based stats.
Identities = 25/55 (45%), Positives = 33/55 (60%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L+DV GG T FP L + P+K +A FWHN HA+ + D H CP+ G+KW
Sbjct: 397 LSDVAQGGGTAFPQLRTLLKPKKYAAAFWHNLHASGVGDVRTQHGACPIIAGSKW 451
>gi|195341560|ref|XP_002037374.1| GM12888 [Drosophila sechellia]
gi|194131490|gb|EDW53533.1| GM12888 [Drosophila sechellia]
Length = 501
Score = 61.6 bits (148), Expect = 1e-07, Method: Composition-based stats.
Identities = 30/87 (34%), Positives = 44/87 (50%), Gaps = 4/87 (4%)
Query: 45 NTLLDYWMYHSGCPVALGNKWGK----LLLSGLTDVELGGATIFPSLNLTVFPEKGSAVF 100
N + Y H +G +G+ + L+DV GG T FP L + P+K +A F
Sbjct: 371 NLNMKYAEDHQFANYGIGGHYGQHMDWFYQTTLSDVAQGGGTAFPQLRTLLKPKKYAAAF 430
Query: 101 WYNAHANTLLDYRMYHSGCPVALGNKW 127
W+N HA+ + D R H CP+ G+KW
Sbjct: 431 WHNLHASGVGDVRTQHGACPIIAGSKW 457
Score = 60.5 bits (145), Expect = 2e-07, Method: Composition-based stats.
Identities = 25/55 (45%), Positives = 33/55 (60%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L+DV GG T FP L + P+K +A FWHN HA+ + D H CP+ G+KW
Sbjct: 403 LSDVAQGGGTAFPQLRTLLKPKKYAAAFWHNLHASGVGDVRTQHGACPIIAGSKW 457
>gi|195574593|ref|XP_002105269.1| GD21390 [Drosophila simulans]
gi|194201196|gb|EDX14772.1| GD21390 [Drosophila simulans]
Length = 478
Score = 61.6 bits (148), Expect = 1e-07, Method: Composition-based stats.
Identities = 27/60 (45%), Positives = 35/60 (58%)
Query: 10 ELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWGKLL 69
EL+DV++GG FP L P +GSA+ WHN + D + CPV LGN+WGK L
Sbjct: 382 ELSDVQMGGYASFPDLGFGFKPRRGSALVWHNTDNSGNCDTRSLQATCPVLLGNQWGKSL 441
Score = 60.5 bits (145), Expect = 2e-07, Method: Composition-based stats.
Identities = 26/59 (44%), Positives = 35/59 (59%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWGKLL 131
L+DV++GG FP L P +GSA+ W+N + D R + CPV LGN+WGK L
Sbjct: 383 LSDVQMGGYASFPDLGFGFKPRRGSALVWHNTDNSGNCDTRSLQATCPVLLGNQWGKSL 441
>gi|195575095|ref|XP_002105515.1| GD21523 [Drosophila simulans]
gi|194201442|gb|EDX15018.1| GD21523 [Drosophila simulans]
Length = 527
Score = 61.2 bits (147), Expect = 1e-07, Method: Composition-based stats.
Identities = 28/63 (44%), Positives = 39/63 (61%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
+R S L+DVE GG T+F LN+ + P KG+ V WHN H + +D H+GCPV +
Sbjct: 433 DRISTSMFYLSDVEQGGYTVFTKLNVFLPPVKGALVMWHNLHRSLDVDARTLHAGCPVIV 492
Query: 62 GNK 64
G+K
Sbjct: 493 GSK 495
Score = 60.8 bits (146), Expect = 2e-07, Method: Composition-based stats.
Identities = 26/54 (48%), Positives = 37/54 (68%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 126
L+DVE GG T+F LN+ + P KG+ V W+N H + +D R H+GCPV +G+K
Sbjct: 442 LSDVEQGGYTVFTKLNVFLPPVKGALVMWHNLHRSLDVDARTLHAGCPVIVGSK 495
>gi|195505197|ref|XP_002099400.1| GE10884 [Drosophila yakuba]
gi|194185501|gb|EDW99112.1| GE10884 [Drosophila yakuba]
Length = 527
Score = 61.2 bits (147), Expect = 1e-07, Method: Composition-based stats.
Identities = 28/63 (44%), Positives = 39/63 (61%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
+R S L+DVE GG T+F LN+ + P KG+ V WHN H + +D H+GCPV +
Sbjct: 433 DRISTSMFYLSDVEQGGYTVFTKLNVFLPPVKGALVMWHNLHRSLDVDARTLHAGCPVIV 492
Query: 62 GNK 64
G+K
Sbjct: 493 GSK 495
Score = 60.8 bits (146), Expect = 2e-07, Method: Composition-based stats.
Identities = 26/54 (48%), Positives = 37/54 (68%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 126
L+DVE GG T+F LN+ + P KG+ V W+N H + +D R H+GCPV +G+K
Sbjct: 442 LSDVEQGGYTVFTKLNVFLPPVKGALVMWHNLHRSLDVDARTLHAGCPVIVGSK 495
>gi|198459366|ref|XP_002138685.1| GA24919 [Drosophila pseudoobscura pseudoobscura]
gi|198136669|gb|EDY69243.1| GA24919 [Drosophila pseudoobscura pseudoobscura]
Length = 448
Score = 61.2 bits (147), Expect = 1e-07, Method: Composition-based stats.
Identities = 28/56 (50%), Positives = 35/56 (62%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 128
L DVE GG T+FP L + P KGSAV +YN +++ D R H GCPV +G KW
Sbjct: 373 LNDVEQGGKTVFPRLGIFRSPMKGSAVVFYNLNSSLQGDPRTEHGGCPVLVGTKWA 428
Score = 55.8 bits (133), Expect = 5e-06, Method: Composition-based stats.
Identities = 26/56 (46%), Positives = 34/56 (60%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWG 66
L DVE GG T+FP L + P KGSAV ++N +++ D H GCPV +G KW
Sbjct: 373 LNDVEQGGKTVFPRLGIFRSPMKGSAVVFYNLNSSLQGDPRTEHGGCPVLVGTKWA 428
>gi|221512810|ref|NP_649043.3| CG18234 [Drosophila melanogaster]
gi|66771545|gb|AAY55084.1| IP12246p [Drosophila melanogaster]
gi|220902636|gb|AAF49255.4| CG18234 [Drosophila melanogaster]
Length = 515
Score = 61.2 bits (147), Expect = 1e-07, Method: Composition-based stats.
Identities = 29/61 (47%), Positives = 38/61 (62%), Gaps = 12/61 (19%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRM------YHSGCPVALGNK 126
+ DV GGA FP+LNLT++P+KGSA+ W N LD+RM H CPV +G+K
Sbjct: 424 MNDVAQGGALAFPNLNLTIWPQKGSALVWRN------LDHRMQPNQDLLHVSCPVVVGSK 477
Query: 127 W 127
W
Sbjct: 478 W 478
Score = 60.5 bits (145), Expect = 2e-07, Method: Composition-based stats.
Identities = 25/55 (45%), Positives = 35/55 (63%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
+ DV GGA FP+LNLT++P+KGSA+ W N + + H CPV +G+KW
Sbjct: 424 MNDVAQGGALAFPNLNLTIWPQKGSALVWRNLDHRMQPNQDLLHVSCPVVVGSKW 478
>gi|195441323|ref|XP_002068462.1| GK20483 [Drosophila willistoni]
gi|194164547|gb|EDW79448.1| GK20483 [Drosophila willistoni]
Length = 550
Score = 61.2 bits (147), Expect = 1e-07, Method: Composition-based stats.
Identities = 28/53 (52%), Positives = 34/53 (64%)
Query: 75 DVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
DV GGAT FP L V P+KGSA+FWYN + D R HS CPV +G++W
Sbjct: 473 DVTEGGATNFPRNQLVVQPKKGSALFWYNKFDDGSPDPRSLHSICPVVVGSRW 525
Score = 55.8 bits (133), Expect = 6e-06, Method: Composition-based stats.
Identities = 26/53 (49%), Positives = 33/53 (62%)
Query: 13 DVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
DV GGAT FP L V P+KGSA+FW+N + D HS CPV +G++W
Sbjct: 473 DVTEGGATNFPRNQLVVQPKKGSALFWYNKFDDGSPDPRSLHSICPVVVGSRW 525
>gi|195390825|ref|XP_002054068.1| GJ24233 [Drosophila virilis]
gi|194152154|gb|EDW67588.1| GJ24233 [Drosophila virilis]
Length = 533
Score = 61.2 bits (147), Expect = 1e-07, Method: Composition-based stats.
Identities = 28/55 (50%), Positives = 34/55 (61%), Gaps = 1/55 (1%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTL-LDYRMYHSGCPVALGNK 126
L DV GGAT+FP L+L V PE+G + WYN +T D R H GCPV +G K
Sbjct: 446 LNDVVRGGATVFPKLDLVVQPERGKVLHWYNMLPDTFDYDRRSLHGGCPVLIGEK 500
Score = 57.0 bits (136), Expect = 2e-06, Method: Composition-based stats.
Identities = 27/64 (42%), Positives = 37/64 (57%), Gaps = 1/64 (1%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTL-LDYWMYHSGCPVA 60
+R + + L DV GGAT+FP L+L V PE+G + W+N +T D H GCPV
Sbjct: 437 DRIATTLIYLNDVVRGGATVFPKLDLVVQPERGKVLHWYNMLPDTFDYDRRSLHGGCPVL 496
Query: 61 LGNK 64
+G K
Sbjct: 497 IGEK 500
>gi|268536692|ref|XP_002633481.1| C. briggsae CBR-PHY-2 protein [Caenorhabditis briggsae]
gi|94442973|emb|CAJ98659.1| prolyl 4-hydroxylase [Caenorhabditis briggsae]
Length = 539
Score = 61.2 bits (147), Expect = 1e-07, Method: Composition-based stats.
Identities = 29/66 (43%), Positives = 37/66 (56%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ +L ++ E GGAT+F L VFP K A+FWYN + D R H+ CPV
Sbjct: 437 GNRIATVLFY-MSQPERGGATVFNHLGTAVFPSKNDALFWYNLRRDGEGDLRTRHAACPV 495
Query: 122 ALGNKW 127
LG KW
Sbjct: 496 LLGVKW 501
Score = 55.1 bits (131), Expect = 9e-06, Method: Composition-based stats.
Identities = 26/64 (40%), Positives = 33/64 (51%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++ E GGAT+F L VFP K A+FW+N + D H+ CPV L
Sbjct: 438 NRIATVLFYMSQPERGGATVFNHLGTAVFPSKNDALFWYNLRRDGEGDLRTRHAACPVLL 497
Query: 62 GNKW 65
G KW
Sbjct: 498 GVKW 501
>gi|17541712|ref|NP_502317.1| Protein PHY-2 [Caenorhabditis elegans]
gi|32171589|sp|Q20065.1|P4HA2_CAEEL RecName: Full=Prolyl 4-hydroxylase subunit alpha-2; Short=4-PH
alpha-2; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-2; Flags: Precursor
gi|3876769|emb|CAA93469.1| Protein PHY-2 [Caenorhabditis elegans]
Length = 539
Score = 61.2 bits (147), Expect = 1e-07, Method: Composition-based stats.
Identities = 29/66 (43%), Positives = 37/66 (56%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ +L ++ E GGAT+F L VFP K A+FWYN + D R H+ CPV
Sbjct: 437 GNRIATVLFY-MSQPERGGATVFNHLGTAVFPSKNDALFWYNLRRDGEGDLRTRHAACPV 495
Query: 122 ALGNKW 127
LG KW
Sbjct: 496 LLGVKW 501
Score = 55.1 bits (131), Expect = 9e-06, Method: Composition-based stats.
Identities = 26/64 (40%), Positives = 33/64 (51%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++ E GGAT+F L VFP K A+FW+N + D H+ CPV L
Sbjct: 438 NRIATVLFYMSQPERGGATVFNHLGTAVFPSKNDALFWYNLRRDGEGDLRTRHAACPVLL 497
Query: 62 GNKW 65
G KW
Sbjct: 498 GVKW 501
>gi|308476969|ref|XP_003100699.1| hypothetical protein CRE_15564 [Caenorhabditis remanei]
gi|308264511|gb|EFP08464.1| hypothetical protein CRE_15564 [Caenorhabditis remanei]
Length = 573
Score = 61.2 bits (147), Expect = 1e-07, Method: Composition-based stats.
Identities = 29/66 (43%), Positives = 37/66 (56%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ +L ++ E GGAT+F L VFP K A+FWYN + D R H+ CPV
Sbjct: 471 GNRIATVLFY-MSQPERGGATVFNHLGTAVFPSKNDALFWYNLRRDGEGDLRTRHAACPV 529
Query: 122 ALGNKW 127
LG KW
Sbjct: 530 LLGVKW 535
Score = 55.1 bits (131), Expect = 9e-06, Method: Composition-based stats.
Identities = 26/64 (40%), Positives = 33/64 (51%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++ E GGAT+F L VFP K A+FW+N + D H+ CPV L
Sbjct: 472 NRIATVLFYMSQPERGGATVFNHLGTAVFPSKNDALFWYNLRRDGEGDLRTRHAACPVLL 531
Query: 62 GNKW 65
G KW
Sbjct: 532 GVKW 535
>gi|308451420|ref|XP_003088665.1| CRE-PHY-2 protein [Caenorhabditis remanei]
gi|308246199|gb|EFO90151.1| CRE-PHY-2 protein [Caenorhabditis remanei]
Length = 609
Score = 60.8 bits (146), Expect = 2e-07, Method: Composition-based stats.
Identities = 29/66 (43%), Positives = 37/66 (56%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ +L ++ E GGAT+F L VFP K A+FWYN + D R H+ CPV
Sbjct: 507 GNRIATVLFY-MSQPERGGATVFNHLGTAVFPSKNDALFWYNLRRDGEGDLRTRHAACPV 565
Query: 122 ALGNKW 127
LG KW
Sbjct: 566 LLGVKW 571
Score = 55.1 bits (131), Expect = 1e-05, Method: Composition-based stats.
Identities = 26/64 (40%), Positives = 33/64 (51%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++ E GGAT+F L VFP K A+FW+N + D H+ CPV L
Sbjct: 508 NRIATVLFYMSQPERGGATVFNHLGTAVFPSKNDALFWYNLRRDGEGDLRTRHAACPVLL 567
Query: 62 GNKW 65
G KW
Sbjct: 568 GVKW 571
>gi|51490656|emb|CAF31507.1| prolyl 4-hydroxylase 2 precursor [Brugia malayi]
Length = 551
Score = 60.8 bits (146), Expect = 2e-07, Method: Composition-based stats.
Identities = 27/66 (40%), Positives = 40/66 (60%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ +L+ +T+ E+GG T+F L +V K +A+FWYN + +D R YH+ CPV
Sbjct: 438 GNRIATILIY-MTEPEIGGGTVFIDLKTSVSCTKNAALFWYNLMRSGAVDMRSYHAACPV 496
Query: 122 ALGNKW 127
G KW
Sbjct: 497 LTGTKW 502
Score = 55.1 bits (131), Expect = 8e-06, Method: Composition-based stats.
Identities = 24/64 (37%), Positives = 36/64 (56%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + +T+ E+GG T+F L +V K +A+FW+N + +D YH+ CPV
Sbjct: 439 NRIATILIYMTEPEIGGGTVFIDLKTSVSCTKNAALFWYNLMRSGAVDMRSYHAACPVLT 498
Query: 62 GNKW 65
G KW
Sbjct: 499 GTKW 502
>gi|241598362|ref|XP_002404733.1| prolyl 4-hydroxylase alpha subunit 1, putative [Ixodes scapularis]
gi|215500464|gb|EEC09958.1| prolyl 4-hydroxylase alpha subunit 1, putative [Ixodes scapularis]
Length = 340
Score = 60.8 bits (146), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 43/66 (65%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
G++ L++ +TDVE GG T+FP+L + + P+KG A FW+N A+ + H+GCPV
Sbjct: 143 GDRLATLMIY-MTDVEEGGTTVFPNLGIRLTPKKGDAAFWWNLKASGDGERLTTHAGCPV 201
Query: 122 ALGNKW 127
G+KW
Sbjct: 202 LYGSKW 207
Score = 58.9 bits (141), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 26/57 (45%), Positives = 37/57 (64%)
Query: 9 VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
+ +TDVE GG T+FP+L + + P+KG A FW N A+ + H+GCPV G+KW
Sbjct: 151 IYMTDVEEGGTTVFPNLGIRLTPKKGDAAFWWNLKASGDGERLTTHAGCPVLYGSKW 207
>gi|195452736|ref|XP_002073477.1| GK14138 [Drosophila willistoni]
gi|194169562|gb|EDW84463.1| GK14138 [Drosophila willistoni]
Length = 518
Score = 60.8 bits (146), Expect = 2e-07, Method: Composition-based stats.
Identities = 24/55 (43%), Positives = 39/55 (70%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L DV GGAT+ +++L+VFP+ G+A+FWYN + ++ + H+ CPV +G+KW
Sbjct: 444 LQDVPQGGATLLNNISLSVFPKAGAALFWYNLNNAGDTEWNVAHTACPVIVGSKW 498
Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats.
Identities = 24/56 (42%), Positives = 40/56 (71%)
Query: 10 ELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
EL DV GGAT+ +++L+VFP+ G+A+FW+N + ++ + H+ CPV +G+KW
Sbjct: 443 ELQDVPQGGATLLNNISLSVFPKAGAALFWYNLNNAGDTEWNVAHTACPVIVGSKW 498
>gi|321458081|gb|EFX69155.1| hypothetical protein DAPPUDRAFT_228756 [Daphnia pulex]
Length = 570
Score = 60.8 bits (146), Expect = 2e-07, Method: Composition-based stats.
Identities = 27/59 (45%), Positives = 37/59 (62%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
++ L DVE GGAT+F + V P+KG AVFW+N +++ D H GCPV G+KW
Sbjct: 474 FMTYLDDVEAGGATVFTHAGVVVRPKKGMAVFWWNLKSDSNGDTLTRHGGCPVLHGSKW 532
Score = 58.9 bits (141), Expect = 7e-07, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 34/55 (61%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GGAT+F + V P+KG AVFW N +++ D H GCPV G+KW
Sbjct: 478 LDDVEAGGATVFTHAGVVVRPKKGMAVFWWNLKSDSNGDTLTRHGGCPVLHGSKW 532
>gi|402593814|gb|EJW87741.1| hypothetical protein WUBG_01349 [Wuchereria bancrofti]
Length = 541
Score = 60.5 bits (145), Expect = 2e-07, Method: Composition-based stats.
Identities = 29/66 (43%), Positives = 36/66 (54%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ LL +T E GGAT+F + TV P K A+FWYN + D R H+ CPV
Sbjct: 444 GNRLATLLFY-MTQPESGGATVFTEVKTTVMPSKNDALFWYNLLRSGEGDLRTRHAACPV 502
Query: 122 ALGNKW 127
G KW
Sbjct: 503 LTGTKW 508
Score = 52.4 bits (124), Expect = 5e-05, Method: Composition-based stats.
Identities = 25/64 (39%), Positives = 32/64 (50%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR +T E GGAT+F + TV P K A+FW+N + D H+ CPV
Sbjct: 445 NRLATLLFYMTQPESGGATVFTEVKTTVMPSKNDALFWYNLLRSGEGDLRTRHAACPVLT 504
Query: 62 GNKW 65
G KW
Sbjct: 505 GTKW 508
>gi|196011912|ref|XP_002115819.1| hypothetical protein TRIADDRAFT_59908 [Trichoplax adhaerens]
gi|190581595|gb|EDV21671.1| hypothetical protein TRIADDRAFT_59908 [Trichoplax adhaerens]
Length = 300
Score = 60.5 bits (145), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 37/55 (67%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+DVE GGAT+F ++N+ V P KG+ + WYN + L H+GCPV +G+KW
Sbjct: 226 LSDVERGGATVFTNINVRVLPRKGNVIIWYNYLPDGNLHPGTLHAGCPVLVGSKW 280
Score = 59.3 bits (142), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 25/55 (45%), Positives = 37/55 (67%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L+DVE GGAT+F ++N+ V P KG+ + W+N + L H+GCPV +G+KW
Sbjct: 226 LSDVERGGATVFTNINVRVLPRKGNVIIWYNYLPDGNLHPGTLHAGCPVLVGSKW 280
>gi|170591592|ref|XP_001900554.1| prolyl 4-hydroxylase [Brugia malayi]
gi|16415740|emb|CAC82616.1| prolyl 4-hydroxylase [Brugia malayi]
gi|21425621|emb|CAD19314.1| prolyl 4-hydroxylase [Brugia malayi]
gi|158592166|gb|EDP30768.1| prolyl 4-hydroxylase, putative [Brugia malayi]
Length = 541
Score = 60.5 bits (145), Expect = 2e-07, Method: Composition-based stats.
Identities = 29/66 (43%), Positives = 36/66 (54%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ LL +T E GGAT+F + TV P K A+FWYN + D R H+ CPV
Sbjct: 444 GNRLATLLFY-MTQPESGGATVFTEVKTTVMPSKNDALFWYNLLRSGEGDLRTRHAACPV 502
Query: 122 ALGNKW 127
G KW
Sbjct: 503 LTGTKW 508
Score = 52.4 bits (124), Expect = 5e-05, Method: Composition-based stats.
Identities = 25/64 (39%), Positives = 32/64 (50%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR +T E GGAT+F + TV P K A+FW+N + D H+ CPV
Sbjct: 445 NRLATLLFYMTQPESGGATVFTEVKTTVMPSKNDALFWYNLLRSGEGDLRTRHAACPVLT 504
Query: 62 GNKW 65
G KW
Sbjct: 505 GTKW 508
>gi|195505244|ref|XP_002099420.1| GE10895 [Drosophila yakuba]
gi|194185521|gb|EDW99132.1| GE10895 [Drosophila yakuba]
Length = 533
Score = 60.5 bits (145), Expect = 2e-07, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 36/55 (65%), Gaps = 1/55 (1%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTL-LDYRMYHSGCPVALGNK 126
L +VE GGAT+FPS+NL V +KGSA+FW+N + D R +H CP+ G K
Sbjct: 444 LNNVEHGGATVFPSINLAVPTQKGSALFWHNLDGQSYDYDTRTFHGACPLISGTK 498
Score = 59.7 bits (143), Expect = 4e-07, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 35/55 (63%), Gaps = 1/55 (1%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTL-LDYWMYHSGCPVALGNK 64
L +VE GGAT+FPS+NL V +KGSA+FWHN + D +H CP+ G K
Sbjct: 444 LNNVEHGGATVFPSINLAVPTQKGSALFWHNLDGQSYDYDTRTFHGACPLISGTK 498
>gi|195113247|ref|XP_002001179.1| GI22114 [Drosophila mojavensis]
gi|193917773|gb|EDW16640.1| GI22114 [Drosophila mojavensis]
Length = 487
Score = 60.5 bits (145), Expect = 2e-07, Method: Composition-based stats.
Identities = 26/63 (41%), Positives = 38/63 (60%), Gaps = 3/63 (4%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRM--YHSGCPVALGNKWGKL 130
L DV GGAT+FP + + V PE+G + WYN + + DY + YH CPV +G K G
Sbjct: 426 LNDVARGGATVFPDVEIAVHPERGKVIHWYNMNPKS-FDYELHSYHGACPVLIGQKIGNG 484
Query: 131 LLS 133
+++
Sbjct: 485 IIT 487
Score = 57.4 bits (137), Expect = 2e-06, Method: Composition-based stats.
Identities = 25/65 (38%), Positives = 39/65 (60%), Gaps = 3/65 (4%)
Query: 9 VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWM--YHSGCPVALGNKWG 66
+ L DV GGAT+FP + + V PE+G + W+N + + DY + YH CPV +G K G
Sbjct: 424 IYLNDVARGGATVFPDVEIAVHPERGKVIHWYNMNPKS-FDYELHSYHGACPVLIGQKIG 482
Query: 67 KLLLS 71
+++
Sbjct: 483 NGIIT 487
>gi|433460968|ref|ZP_20418587.1| prolyl 4-hydroxylase alpha subunit [Halobacillus sp. BAB-2008]
gi|432190746|gb|ELK47751.1| prolyl 4-hydroxylase alpha subunit [Halobacillus sp. BAB-2008]
Length = 211
Score = 60.5 bits (145), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 43/145 (29%), Positives = 63/145 (43%), Gaps = 14/145 (9%)
Query: 1 MNRAEM-SSVELTDVELGGATIFPSLNLTVFPEKGSAVFW----------HNAHANTLLD 49
+NR+++ S +++D+ + P LT EK A H H +
Sbjct: 59 VNRSKIGSDHDVSDIRTSSSAFLPDDELTGRIEKRLAQIMNVPVEHGEGIHILHYKPGQE 118
Query: 50 YWMYHS---GCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHA 106
Y +H A N L+ L DVE GG T FP +NLTV P KG AV++ +
Sbjct: 119 YKAHHDYFRSTSRAAKNPRISTLVLYLNDVEEGGETYFPEMNLTVSPHKGMAVYFEYFYN 178
Query: 107 NTLLDYRMYHSGCPVALGNKWGKLL 131
+ ++ R H G PV G KW +
Sbjct: 179 DPAINERTLHGGSPVTAGEKWAATM 203
>gi|221512814|ref|NP_649045.2| CG18231 [Drosophila melanogaster]
gi|220902637|gb|AAF49253.3| CG18231 [Drosophila melanogaster]
Length = 470
Score = 60.5 bits (145), Expect = 2e-07, Method: Composition-based stats.
Identities = 30/58 (51%), Positives = 37/58 (63%), Gaps = 5/58 (8%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 126
LL L +VELGGA +FP L ++VFP+KGS +FW+N LD R CPV GNK
Sbjct: 393 LLFFLNNVELGGAMVFPKLKISVFPQKGSCLFWHNT-----LDPRSEPLECPVLQGNK 445
Score = 58.5 bits (140), Expect = 9e-07, Method: Composition-based stats.
Identities = 28/54 (51%), Positives = 34/54 (62%), Gaps = 5/54 (9%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNK 64
L +VELGGA +FP L ++VFP+KGS +FWHN LD CPV GNK
Sbjct: 397 LNNVELGGAMVFPKLKISVFPQKGSCLFWHNT-----LDPRSEPLECPVLQGNK 445
>gi|66771505|gb|AAY55064.1| IP12044p [Drosophila melanogaster]
Length = 484
Score = 60.5 bits (145), Expect = 2e-07, Method: Composition-based stats.
Identities = 30/58 (51%), Positives = 37/58 (63%), Gaps = 5/58 (8%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 126
LL L +VELGGA +FP L ++VFP+KGS +FW+N LD R CPV GNK
Sbjct: 407 LLFFLNNVELGGAMVFPKLKISVFPQKGSCLFWHNT-----LDPRSEPLECPVLQGNK 459
Score = 58.5 bits (140), Expect = 9e-07, Method: Composition-based stats.
Identities = 28/54 (51%), Positives = 34/54 (62%), Gaps = 5/54 (9%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNK 64
L +VELGGA +FP L ++VFP+KGS +FWHN LD CPV GNK
Sbjct: 411 LNNVELGGAMVFPKLKISVFPQKGSCLFWHNT-----LDPRSEPLECPVLQGNK 459
>gi|229086310|ref|ZP_04218488.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-44]
gi|228697005|gb|EEL49812.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-44]
Length = 220
Score = 60.5 bits (145), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 45/79 (56%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL++ P+KGSAV++ + +
Sbjct: 133 DYFAEHSRA--AENNRISTLVMY-LNDVEEGGETFFPKLNLSIAPKKGSAVYFEYFYNDK 189
Query: 109 LLDYRMYHSGCPVALGNKW 127
L+ H G PV G KW
Sbjct: 190 SLNELTLHGGAPVIKGEKW 208
Score = 57.4 bits (137), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 34/55 (61%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T FP LNL++ P+KGSAV++ + + L+ H G PV G KW
Sbjct: 154 LNDVEEGGETFFPKLNLSIAPKKGSAVYFEYFYNDKSLNELTLHGGAPVIKGEKW 208
>gi|195503448|ref|XP_002098656.1| GE23815 [Drosophila yakuba]
gi|194184757|gb|EDW98368.1| GE23815 [Drosophila yakuba]
Length = 472
Score = 60.5 bits (145), Expect = 2e-07, Method: Composition-based stats.
Identities = 26/59 (44%), Positives = 35/59 (59%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWGKLL 131
L+DV++GG FP L P +GSA+ W+N + D R + CPV LGN+WGK L
Sbjct: 377 LSDVQMGGYASFPDLGFGFKPSRGSALVWHNTDSAGNCDTRSLQATCPVLLGNQWGKWL 435
Score = 59.7 bits (143), Expect = 4e-07, Method: Composition-based stats.
Identities = 26/60 (43%), Positives = 35/60 (58%)
Query: 10 ELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWGKLL 69
+L+DV++GG FP L P +GSA+ WHN + D + CPV LGN+WGK L
Sbjct: 376 KLSDVQMGGYASFPDLGFGFKPSRGSALVWHNTDSAGNCDTRSLQATCPVLLGNQWGKWL 435
>gi|195338688|ref|XP_002035956.1| GM16188 [Drosophila sechellia]
gi|194129836|gb|EDW51879.1| GM16188 [Drosophila sechellia]
Length = 392
Score = 60.1 bits (144), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 36/55 (65%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
+TDV +GG TIFP L + P+KGSA+FWYN H N + H+ CP +G++W
Sbjct: 319 MTDVLVGGGTIFPGAQLAIQPKKGSALFWYNLHNNGDPNPLTRHAVCPTIVGSRW 373
Score = 58.9 bits (141), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 25/55 (45%), Positives = 36/55 (65%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
+TDV +GG TIFP L + P+KGSA+FW+N H N + H+ CP +G++W
Sbjct: 319 MTDVLVGGGTIFPGAQLAIQPKKGSALFWYNLHNNGDPNPLTRHAVCPTIVGSRW 373
>gi|405965633|gb|EKC30995.1| Prolyl 4-hydroxylase subunit alpha-1 [Crassostrea gigas]
Length = 617
Score = 60.1 bits (144), Expect = 3e-07, Method: Composition-based stats.
Identities = 26/60 (43%), Positives = 40/60 (66%), Gaps = 4/60 (6%)
Query: 70 LSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAH--ANTLLDYRMYHSGCPVALGNKW 127
L+ +T+V+ GGAT+F + + +FP KG+A FWYN + + + D R H+ CPV +G KW
Sbjct: 537 LTYMTNVDAGGATVFTHIGVKLFPIKGAAAFWYNLYRSGDGIFDTR--HAACPVLVGQKW 594
Score = 55.8 bits (133), Expect = 5e-06, Method: Composition-based stats.
Identities = 24/64 (37%), Positives = 38/64 (59%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR +T+V+ GGAT+F + + +FP KG+A FW+N + + + H+ CPV +
Sbjct: 531 NRIATWLTYMTNVDAGGATVFTHIGVKLFPIKGAAAFWYNLYRSGDGIFDTRHAACPVLV 590
Query: 62 GNKW 65
G KW
Sbjct: 591 GQKW 594
>gi|194751823|ref|XP_001958223.1| GF23631 [Drosophila ananassae]
gi|190625505|gb|EDV41029.1| GF23631 [Drosophila ananassae]
Length = 502
Score = 60.1 bits (144), Expect = 3e-07, Method: Composition-based stats.
Identities = 32/89 (35%), Positives = 46/89 (51%), Gaps = 3/89 (3%)
Query: 42 AHANTLLDYWMYHSGCPVALGNKWGKLLLSGL---TDVELGGATIFPSLNLTVFPEKGSA 98
H +DY P G+ G + + L +DV LGG T+F L L++ P+KGSA
Sbjct: 395 GHFGKHVDYVELAKRPPNFFGDLGGDRIATALLYASDVPLGGTTVFTKLKLSIEPKKGSA 454
Query: 99 VFWYNAHANTLLDYRMYHSGCPVALGNKW 127
+ W+N + D HS CPV LG++W
Sbjct: 455 LIWFNLNNAGDPDPMSEHSACPVVLGSRW 483
Score = 57.8 bits (138), Expect = 2e-06, Method: Composition-based stats.
Identities = 25/54 (46%), Positives = 34/54 (62%)
Query: 12 TDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
+DV LGG T+F L L++ P+KGSA+ W N + D HS CPV LG++W
Sbjct: 430 SDVPLGGTTVFTKLKLSIEPKKGSALIWFNLNNAGDPDPMSEHSACPVVLGSRW 483
>gi|443697959|gb|ELT98193.1| hypothetical protein CAPTEDRAFT_162820 [Capitella teleta]
Length = 347
Score = 60.1 bits (144), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 24/55 (43%), Positives = 33/55 (60%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+++E GG T++P++ V P K S WYN N DYR YH+ CP+ G KW
Sbjct: 270 LSELEAGGYTVYPTVGAAVVPRKNSCALWYNLMRNGTGDYRTYHAACPILYGYKW 324
Score = 55.5 bits (132), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 24/65 (36%), Positives = 34/65 (52%)
Query: 1 MNRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVA 60
NR L+++E GG T++P++ V P K S W+N N DY YH+ CP+
Sbjct: 260 RNRVATFICYLSELEAGGYTVYPTVGAAVVPRKNSCALWYNLMRNGTGDYRTYHAACPIL 319
Query: 61 LGNKW 65
G KW
Sbjct: 320 YGYKW 324
>gi|291224083|ref|XP_002732036.1| PREDICTED: prolyl 4-hydroxylase, alpha I subunit-like [Saccoglossus
kowalevskii]
Length = 491
Score = 60.1 bits (144), Expect = 3e-07, Method: Composition-based stats.
Identities = 25/55 (45%), Positives = 37/55 (67%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
+++V+ GG T+FP ++ V P K +AVFWYN A+ D H+GCPV +G+KW
Sbjct: 415 MSEVKAGGYTVFPEVDAFVPPVKNAAVFWYNLKASGESDDLTRHAGCPVLIGSKW 469
Score = 58.5 bits (140), Expect = 8e-07, Method: Composition-based stats.
Identities = 26/64 (40%), Positives = 39/64 (60%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR +++V+ GG T+FP ++ V P K +AVFW+N A+ D H+GCPV +
Sbjct: 406 NRIATWMFYMSEVKAGGYTVFPEVDAFVPPVKNAAVFWYNLKASGESDDLTRHAGCPVLI 465
Query: 62 GNKW 65
G+KW
Sbjct: 466 GSKW 469
>gi|218899396|ref|YP_002447807.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus cereus
G9842]
gi|218542449|gb|ACK94843.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
G9842]
Length = 216
Score = 60.1 bits (144), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 44/79 (55%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A+ N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 129 DYFAEHSRS--AVNNRISTLVMY-LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQ 185
Query: 109 LLDYRMYHSGCPVALGNKW 127
L+ H G PV G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204
Score = 56.2 bits (134), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 32/55 (58%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 150 LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|312092237|ref|XP_003147267.1| hypothetical protein LOAG_11701 [Loa loa]
Length = 553
Score = 60.1 bits (144), Expect = 3e-07, Method: Composition-based stats.
Identities = 27/66 (40%), Positives = 41/66 (62%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ +L+ +T+ E+GG T+F +L +V K +A+FWYN + +D R YH+ CPV
Sbjct: 437 GNRIATILIY-MTEPEIGGRTVFINLKASVPCTKNAALFWYNLMRSGAVDMRSYHAACPV 495
Query: 122 ALGNKW 127
G KW
Sbjct: 496 LTGTKW 501
Score = 54.3 bits (129), Expect = 2e-05, Method: Composition-based stats.
Identities = 24/64 (37%), Positives = 37/64 (57%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + +T+ E+GG T+F +L +V K +A+FW+N + +D YH+ CPV
Sbjct: 438 NRIATILIYMTEPEIGGRTVFINLKASVPCTKNAALFWYNLMRSGAVDMRSYHAACPVLT 497
Query: 62 GNKW 65
G KW
Sbjct: 498 GTKW 501
>gi|228910069|ref|ZP_04073889.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL 200]
gi|228849586|gb|EEM94420.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL 200]
Length = 248
Score = 60.1 bits (144), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 44/79 (55%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A+ N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 161 DYFAEHSRS--AVNNRISTLVMY-LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQ 217
Query: 109 LLDYRMYHSGCPVALGNKW 127
L+ H G PV G KW
Sbjct: 218 SLNELTLHGGAPVTKGEKW 236
Score = 56.2 bits (134), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 32/55 (58%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 182 LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 236
>gi|198449635|ref|XP_001357660.2| GA21971 [Drosophila pseudoobscura pseudoobscura]
gi|198130694|gb|EAL26794.2| GA21971 [Drosophila pseudoobscura pseudoobscura]
Length = 549
Score = 60.1 bits (144), Expect = 3e-07, Method: Composition-based stats.
Identities = 28/67 (41%), Positives = 38/67 (56%), Gaps = 1/67 (1%)
Query: 61 LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
+GN+ +L L+DV GG T FP L + P+K +A FW+N HA D R H CP
Sbjct: 439 MGNRIATVLFY-LSDVAQGGGTAFPYLKQHLRPKKYAAAFWHNLHAAGRGDARTQHGACP 497
Query: 121 VALGNKW 127
+ G+KW
Sbjct: 498 IIAGSKW 504
Score = 57.4 bits (137), Expect = 2e-06, Method: Composition-based stats.
Identities = 27/64 (42%), Positives = 33/64 (51%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR L+DV GG T FP L + P+K +A FWHN HA D H CP+
Sbjct: 441 NRIATVLFYLSDVAQGGGTAFPYLKQHLRPKKYAAAFWHNLHAAGRGDARTQHGACPIIA 500
Query: 62 GNKW 65
G+KW
Sbjct: 501 GSKW 504
>gi|423558182|ref|ZP_17534484.1| hypothetical protein II3_03386 [Bacillus cereus MC67]
gi|401191450|gb|EJQ98472.1| hypothetical protein II3_03386 [Bacillus cereus MC67]
Length = 216
Score = 59.7 bits (143), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185
Query: 109 LLDYRMYHSGCPVALGNKW 127
L+ H G PV G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204
Score = 56.6 bits (135), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 32/55 (58%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 150 LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|423489423|ref|ZP_17466105.1| hypothetical protein IEU_04046 [Bacillus cereus BtB2-4]
gi|402431659|gb|EJV63723.1| hypothetical protein IEU_04046 [Bacillus cereus BtB2-4]
Length = 216
Score = 59.7 bits (143), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185
Query: 109 LLDYRMYHSGCPVALGNKW 127
L+ H G PV G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204
Score = 56.6 bits (135), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 32/55 (58%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 150 LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|423452458|ref|ZP_17429311.1| hypothetical protein IEE_01202 [Bacillus cereus BAG5X1-1]
gi|401140096|gb|EJQ47653.1| hypothetical protein IEE_01202 [Bacillus cereus BAG5X1-1]
Length = 216
Score = 59.7 bits (143), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185
Query: 109 LLDYRMYHSGCPVALGNKW 127
L+ H G PV G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204
Score = 56.6 bits (135), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 32/55 (58%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 150 LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|229075940|ref|ZP_04208916.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock4-18]
gi|229117732|ref|ZP_04247101.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-3]
gi|407706764|ref|YP_006830349.1| alpha/beta fold family hydrolase [Bacillus thuringiensis MC28]
gi|423377905|ref|ZP_17355189.1| hypothetical protein IC9_01258 [Bacillus cereus BAG1O-2]
gi|423464099|ref|ZP_17440867.1| hypothetical protein IEK_01286 [Bacillus cereus BAG6O-1]
gi|423547540|ref|ZP_17523898.1| hypothetical protein IGO_03975 [Bacillus cereus HuB5-5]
gi|423622677|ref|ZP_17598455.1| hypothetical protein IK3_01275 [Bacillus cereus VD148]
gi|228665709|gb|EEL21182.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-3]
gi|228707255|gb|EEL59452.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock4-18]
gi|401179261|gb|EJQ86434.1| hypothetical protein IGO_03975 [Bacillus cereus HuB5-5]
gi|401260797|gb|EJR66965.1| hypothetical protein IK3_01275 [Bacillus cereus VD148]
gi|401636171|gb|EJS53925.1| hypothetical protein IC9_01258 [Bacillus cereus BAG1O-2]
gi|402420366|gb|EJV52637.1| hypothetical protein IEK_01286 [Bacillus cereus BAG6O-1]
gi|407384449|gb|AFU14950.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis MC28]
Length = 216
Score = 59.7 bits (143), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185
Query: 109 LLDYRMYHSGCPVALGNKW 127
L+ H G PV G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204
Score = 56.6 bits (135), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 32/55 (58%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 150 LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|423470454|ref|ZP_17447198.1| hypothetical protein IEM_01760 [Bacillus cereus BAG6O-2]
gi|402436583|gb|EJV68613.1| hypothetical protein IEM_01760 [Bacillus cereus BAG6O-2]
Length = 216
Score = 59.7 bits (143), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185
Query: 109 LLDYRMYHSGCPVALGNKW 127
L+ H G PV G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204
Score = 56.6 bits (135), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 32/55 (58%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 150 LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|163941996|ref|YP_001646880.1| 2OG-Fe(II) oxygenase [Bacillus weihenstephanensis KBAB4]
gi|229013455|ref|ZP_04170592.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides DSM 2048]
gi|423495146|ref|ZP_17471790.1| hypothetical protein IEW_04044 [Bacillus cereus CER057]
gi|423498060|ref|ZP_17474677.1| hypothetical protein IEY_01287 [Bacillus cereus CER074]
gi|163864193|gb|ABY45252.1| 2OG-Fe(II) oxygenase [Bacillus weihenstephanensis KBAB4]
gi|228747867|gb|EEL97733.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides DSM 2048]
gi|401151239|gb|EJQ58691.1| hypothetical protein IEW_04044 [Bacillus cereus CER057]
gi|401161347|gb|EJQ68714.1| hypothetical protein IEY_01287 [Bacillus cereus CER074]
Length = 216
Score = 59.7 bits (143), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185
Query: 109 LLDYRMYHSGCPVALGNKW 127
L+ H G PV G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204
Score = 56.6 bits (135), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 32/55 (58%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 150 LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|423389445|ref|ZP_17366671.1| hypothetical protein ICG_01293 [Bacillus cereus BAG1X1-3]
gi|401641536|gb|EJS59253.1| hypothetical protein ICG_01293 [Bacillus cereus BAG1X1-3]
Length = 216
Score = 59.7 bits (143), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185
Query: 109 LLDYRMYHSGCPVALGNKW 127
L+ H G PV G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204
Score = 56.6 bits (135), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 32/55 (58%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 150 LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|423615424|ref|ZP_17591258.1| hypothetical protein IIO_00750 [Bacillus cereus VD115]
gi|401259961|gb|EJR66134.1| hypothetical protein IIO_00750 [Bacillus cereus VD115]
Length = 216
Score = 59.7 bits (143), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185
Query: 109 LLDYRMYHSGCPVALGNKW 127
L+ H G PV G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204
Score = 56.6 bits (135), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 32/55 (58%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 150 LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|423541303|ref|ZP_17517694.1| hypothetical protein IGK_03395 [Bacillus cereus HuB4-10]
gi|401172491|gb|EJQ79712.1| hypothetical protein IGK_03395 [Bacillus cereus HuB4-10]
Length = 216
Score = 59.7 bits (143), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185
Query: 109 LLDYRMYHSGCPVALGNKW 127
L+ H G PV G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204
Score = 56.6 bits (135), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 32/55 (58%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 150 LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|423483822|ref|ZP_17460512.1| hypothetical protein IEQ_03600 [Bacillus cereus BAG6X1-2]
gi|401141373|gb|EJQ48928.1| hypothetical protein IEQ_03600 [Bacillus cereus BAG6X1-2]
Length = 216
Score = 59.7 bits (143), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185
Query: 109 LLDYRMYHSGCPVALGNKW 127
L+ H G PV G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204
Score = 56.6 bits (135), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 32/55 (58%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 150 LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|193688213|ref|XP_001943683.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 1
[Acyrthosiphon pisum]
Length = 552
Score = 59.7 bits (143), Expect = 4e-07, Method: Composition-based stats.
Identities = 27/66 (40%), Positives = 40/66 (60%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ +L ++DV GGAT+FP L + + P KG+A W+N + + D R H+ CPV
Sbjct: 459 GNRIATVLFY-MSDVAQGGATVFPWLGVALQPVKGTAAVWFNLYPSGNGDLRTRHAACPV 517
Query: 122 ALGNKW 127
G+KW
Sbjct: 518 LQGSKW 523
Score = 53.9 bits (128), Expect = 2e-05, Method: Composition-based stats.
Identities = 25/64 (39%), Positives = 35/64 (54%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP L + + P KG+A W N + + D H+ CPV
Sbjct: 460 NRIATVLFYMSDVAQGGATVFPWLGVALQPVKGTAAVWFNLYPSGNGDLRTRHAACPVLQ 519
Query: 62 GNKW 65
G+KW
Sbjct: 520 GSKW 523
>gi|423598444|ref|ZP_17574444.1| hypothetical protein III_01246 [Bacillus cereus VD078]
gi|423660914|ref|ZP_17636083.1| hypothetical protein IKM_01311 [Bacillus cereus VDM022]
gi|401236714|gb|EJR43171.1| hypothetical protein III_01246 [Bacillus cereus VD078]
gi|401300955|gb|EJS06544.1| hypothetical protein IKM_01311 [Bacillus cereus VDM022]
Length = 216
Score = 59.7 bits (143), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185
Query: 109 LLDYRMYHSGCPVALGNKW 127
L+ H G PV G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204
Score = 56.6 bits (135), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 32/55 (58%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 150 LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|229104864|ref|ZP_04235524.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-28]
gi|228678581|gb|EEL32798.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-28]
Length = 216
Score = 59.7 bits (143), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185
Query: 109 LLDYRMYHSGCPVALGNKW 127
L+ H G PV G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204
Score = 56.6 bits (135), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 32/55 (58%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 150 LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|25012370|gb|AAN71294.1| RE09701p [Drosophila melanogaster]
Length = 301
Score = 59.7 bits (143), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 28/63 (44%), Positives = 39/63 (61%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
+R S L+DVE GG T+F LN+ + P KG+ V WHN H + +D H+GCPV +
Sbjct: 207 DRISTSMFYLSDVEQGGYTVFTKLNVFLPPVKGALVMWHNLHRSLHVDARTLHAGCPVIV 266
Query: 62 GNK 64
G+K
Sbjct: 267 GSK 269
Score = 58.9 bits (141), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 26/54 (48%), Positives = 37/54 (68%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 126
L+DVE GG T+F LN+ + P KG+ V W+N H + +D R H+GCPV +G+K
Sbjct: 216 LSDVEQGGYTVFTKLNVFLPPVKGALVMWHNLHRSLHVDARTLHAGCPVIVGSK 269
>gi|229019457|ref|ZP_04176278.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1273]
gi|229025700|ref|ZP_04182104.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1272]
gi|423417837|ref|ZP_17394926.1| hypothetical protein IE3_01309 [Bacillus cereus BAG3X2-1]
gi|228735575|gb|EEL86166.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1272]
gi|228741812|gb|EEL91991.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1273]
gi|401107008|gb|EJQ14965.1| hypothetical protein IE3_01309 [Bacillus cereus BAG3X2-1]
Length = 216
Score = 59.7 bits (143), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185
Query: 109 LLDYRMYHSGCPVALGNKW 127
L+ H G PV G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204
Score = 56.6 bits (135), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 32/55 (58%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 150 LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|30264308|ref|NP_846685.1| prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. Ames]
gi|47529753|ref|YP_021102.1| prolyl 4-hydroxylase subunit alpha [Bacillus anthracis str. 'Ames
Ancestor']
gi|65321616|ref|ZP_00394575.1| hypothetical protein Bant_01005109 [Bacillus anthracis str. A2012]
gi|165873278|ref|ZP_02217887.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0488]
gi|167634610|ref|ZP_02392930.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0442]
gi|167638693|ref|ZP_02396969.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0193]
gi|170687507|ref|ZP_02878724.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0465]
gi|170709341|ref|ZP_02899757.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0389]
gi|177655890|ref|ZP_02937082.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0174]
gi|190566156|ref|ZP_03019075.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Tsiankovskii-I]
gi|196034803|ref|ZP_03102210.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
W]
gi|227817011|ref|YP_002817020.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus
anthracis str. CDC 684]
gi|228929280|ref|ZP_04092307.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
pondicheriensis BGSC 4BA1]
gi|228935557|ref|ZP_04098373.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
andalousiensis BGSC 4AW1]
gi|229123754|ref|ZP_04252949.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus 95/8201]
gi|229604260|ref|YP_002868528.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0248]
gi|254683996|ref|ZP_05147856.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. CNEVA-9066]
gi|254721830|ref|ZP_05183619.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A1055]
gi|254736344|ref|ZP_05194050.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Western North America USA6153]
gi|254741382|ref|ZP_05199069.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Kruger B]
gi|254753983|ref|ZP_05206018.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Vollum]
gi|254757854|ref|ZP_05209881.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Australia 94]
gi|386738126|ref|YP_006211307.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. H9401]
gi|421506493|ref|ZP_15953416.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. UR-1]
gi|421638315|ref|ZP_16078911.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. BF1]
gi|30258953|gb|AAP28171.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Ames]
gi|47504901|gb|AAT33577.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. 'Ames Ancestor']
gi|164710995|gb|EDR16563.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0488]
gi|167513541|gb|EDR88911.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0193]
gi|167530062|gb|EDR92797.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0442]
gi|170125767|gb|EDS94678.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0389]
gi|170668702|gb|EDT19448.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0465]
gi|172079923|gb|EDT65028.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0174]
gi|190563075|gb|EDV17041.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Tsiankovskii-I]
gi|195992342|gb|EDX56303.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
W]
gi|227005734|gb|ACP15477.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. CDC 684]
gi|228659889|gb|EEL15534.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus 95/8201]
gi|228824095|gb|EEM69911.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
andalousiensis BGSC 4AW1]
gi|228830570|gb|EEM76180.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
pondicheriensis BGSC 4BA1]
gi|229268668|gb|ACQ50305.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0248]
gi|384387978|gb|AFH85639.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. H9401]
gi|401823486|gb|EJT22633.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. UR-1]
gi|403394741|gb|EJY91981.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. BF1]
Length = 216
Score = 59.7 bits (143), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185
Query: 109 LLDYRMYHSGCPVALGNKW 127
L+ H G PV G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204
Score = 56.6 bits (135), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 32/55 (58%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 150 LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|72000637|ref|NP_507251.2| Protein PHY-3 [Caenorhabditis elegans]
gi|30145726|emb|CAB04736.2| Protein PHY-3 [Caenorhabditis elegans]
Length = 318
Score = 59.3 bits (142), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 38/119 (31%), Positives = 52/119 (43%), Gaps = 6/119 (5%)
Query: 13 DVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPV-----ALGNKWGK 67
++++ P LNLTV + + H DY Y S GN+ G
Sbjct: 152 EIKMQAQKRIPGLNLTVAEHFSALSYLPGGHYAVHYDYLDYRSKQDYDWWMNKTGNRIGT 211
Query: 68 LLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 126
L+ L E GG T+FPS+ TV G A FW+NA A+ + H GCP+ G K
Sbjct: 212 LIFV-LKPAEKGGGTVFPSIGSTVRANAGDAFFWFNAQADEEKEMLSNHGGCPIYEGRK 269
Score = 48.5 bits (114), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 31/94 (32%), Positives = 42/94 (44%), Gaps = 10/94 (10%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNK-----W 65
L E GG T+FPS+ TV G A FW NA A+ + H GCP+ G K W
Sbjct: 216 LKPAEKGGGTVFPSIGSTVRANAGDAFFWFNAQADEEKEMLSNHGGCPIYEGRKVIATIW 275
Query: 66 GKLLLSGLTDVELGGATI-----FPSLNLTVFPE 94
+ + + G++I PSL+ PE
Sbjct: 276 IRAYNQRILPMAPAGSSIHASWLIPSLSNRFRPE 309
>gi|402555628|ref|YP_006596899.1| prolyl 4-hydroxylase subunit alpha [Bacillus cereus FRI-35]
gi|401796838|gb|AFQ10697.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus FRI-35]
Length = 216
Score = 59.3 bits (142), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185
Query: 109 LLDYRMYHSGCPVALGNKW 127
L+ H G PV G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204
Score = 56.6 bits (135), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 32/55 (58%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 150 LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|423521903|ref|ZP_17498376.1| hypothetical protein IGC_01286 [Bacillus cereus HuA4-10]
gi|401176565|gb|EJQ83760.1| hypothetical protein IGC_01286 [Bacillus cereus HuA4-10]
Length = 216
Score = 59.3 bits (142), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185
Query: 109 LLDYRMYHSGCPVALGNKW 127
L+ H G PV G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204
Score = 56.2 bits (134), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 32/55 (58%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 150 LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|328696638|ref|XP_003240086.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 2
[Acyrthosiphon pisum]
Length = 534
Score = 59.3 bits (142), Expect = 4e-07, Method: Composition-based stats.
Identities = 27/66 (40%), Positives = 40/66 (60%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ +L ++DV GGAT+FP L + + P KG+A W+N + + D R H+ CPV
Sbjct: 441 GNRIATVLFY-MSDVAQGGATVFPWLGVALQPVKGTAAVWFNLYPSGNGDLRTRHAACPV 499
Query: 122 ALGNKW 127
G+KW
Sbjct: 500 LQGSKW 505
Score = 53.9 bits (128), Expect = 2e-05, Method: Composition-based stats.
Identities = 25/64 (39%), Positives = 35/64 (54%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DV GGAT+FP L + + P KG+A W N + + D H+ CPV
Sbjct: 442 NRIATVLFYMSDVAQGGATVFPWLGVALQPVKGTAAVWFNLYPSGNGDLRTRHAACPVLQ 501
Query: 62 GNKW 65
G+KW
Sbjct: 502 GSKW 505
>gi|42783360|ref|NP_980607.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 10987]
gi|42739288|gb|AAS43215.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
ATCC 10987]
Length = 216
Score = 59.3 bits (142), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185
Query: 109 LLDYRMYHSGCPVALGNKW 127
L+ H G PV G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204
Score = 56.6 bits (135), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 32/55 (58%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 150 LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|20177113|gb|AAM12259.1| RE23792p [Drosophila melanogaster]
gi|220948174|gb|ACL86630.1| PH4alphaSG2-PB [synthetic construct]
gi|220960438|gb|ACL92755.1| PH4alphaSG2-PB [synthetic construct]
Length = 301
Score = 59.3 bits (142), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 28/63 (44%), Positives = 39/63 (61%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
+R S L+DVE GG T+F LN+ + P KG+ V WHN H + +D H+GCPV +
Sbjct: 207 DRISTSMFYLSDVEQGGYTVFTKLNVFLPPVKGALVMWHNLHRSLHVDARTLHAGCPVIV 266
Query: 62 GNK 64
G+K
Sbjct: 267 GSK 269
Score = 58.5 bits (140), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 26/54 (48%), Positives = 37/54 (68%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 126
L+DVE GG T+F LN+ + P KG+ V W+N H + +D R H+GCPV +G+K
Sbjct: 216 LSDVEQGGYTVFTKLNVFLPPVKGALVMWHNLHRSLHVDARTLHAGCPVIVGSK 269
>gi|423400914|ref|ZP_17378087.1| hypothetical protein ICW_01312 [Bacillus cereus BAG2X1-2]
gi|401653904|gb|EJS71447.1| hypothetical protein ICW_01312 [Bacillus cereus BAG2X1-2]
Length = 216
Score = 59.3 bits (142), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185
Query: 109 LLDYRMYHSGCPVALGNKW 127
L+ H G PV G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204
Score = 56.6 bits (135), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 32/55 (58%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 150 LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|47567794|ref|ZP_00238502.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus G9241]
gi|47555471|gb|EAL13814.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus G9241]
Length = 216
Score = 59.3 bits (142), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185
Query: 109 LLDYRMYHSGCPVALGNKW 127
L+ H G PV G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204
Score = 56.6 bits (135), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 32/55 (58%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 150 LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|390176896|ref|XP_002136934.2| GA26861 [Drosophila pseudoobscura pseudoobscura]
gi|388858831|gb|EDY67492.2| GA26861 [Drosophila pseudoobscura pseudoobscura]
Length = 513
Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats.
Identities = 24/56 (42%), Positives = 35/56 (62%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 128
L+DV GGAT+FP LN++V P +G A+ WYN + + H+ CP+ G+KW
Sbjct: 440 LSDVPQGGATLFPRLNISVQPRQGDALLWYNLNDRGQGEIGTVHTSCPIIQGSKWA 495
Score = 57.4 bits (137), Expect = 2e-06, Method: Composition-based stats.
Identities = 23/56 (41%), Positives = 35/56 (62%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWG 66
L+DV GGAT+FP LN++V P +G A+ W+N + + H+ CP+ G+KW
Sbjct: 440 LSDVPQGGATLFPRLNISVQPRQGDALLWYNLNDRGQGEIGTVHTSCPIIQGSKWA 495
>gi|196041590|ref|ZP_03108882.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
NVH0597-99]
gi|218905373|ref|YP_002453207.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus cereus
AH820]
gi|225866219|ref|YP_002751597.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
03BB102]
gi|423550018|ref|ZP_17526345.1| hypothetical protein IGW_00649 [Bacillus cereus ISP3191]
gi|196027578|gb|EDX66193.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
NVH0597-99]
gi|218537435|gb|ACK89833.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
AH820]
gi|225786013|gb|ACO26230.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
03BB102]
gi|401189634|gb|EJQ96684.1| hypothetical protein IGW_00649 [Bacillus cereus ISP3191]
Length = 216
Score = 59.3 bits (142), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185
Query: 109 LLDYRMYHSGCPVALGNKW 127
L+ H G PV G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204
Score = 56.6 bits (135), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 32/55 (58%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 150 LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|423457579|ref|ZP_17434376.1| hypothetical protein IEI_00719 [Bacillus cereus BAG5X2-1]
gi|401147963|gb|EJQ55456.1| hypothetical protein IEI_00719 [Bacillus cereus BAG5X2-1]
Length = 216
Score = 59.3 bits (142), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185
Query: 109 LLDYRMYHSGCPVALGNKW 127
L+ H G PV G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204
Score = 56.6 bits (135), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 32/55 (58%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 150 LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|198477152|ref|XP_002136738.1| GA29216 [Drosophila pseudoobscura pseudoobscura]
gi|198145043|gb|EDY71755.1| GA29216 [Drosophila pseudoobscura pseudoobscura]
Length = 517
Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats.
Identities = 24/56 (42%), Positives = 35/56 (62%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 128
L+DV GGAT+FP LN++V P +G A+ WYN + + H+ CP+ G+KW
Sbjct: 444 LSDVPQGGATLFPRLNISVQPRQGDALLWYNLNDRGQGEIGTVHTSCPIIQGSKWA 499
Score = 57.4 bits (137), Expect = 2e-06, Method: Composition-based stats.
Identities = 23/56 (41%), Positives = 35/56 (62%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWG 66
L+DV GGAT+FP LN++V P +G A+ W+N + + H+ CP+ G+KW
Sbjct: 444 LSDVPQGGATLFPRLNISVQPRQGDALLWYNLNDRGQGEIGTVHTSCPIIQGSKWA 499
>gi|195379216|ref|XP_002048376.1| GJ13933 [Drosophila virilis]
gi|194155534|gb|EDW70718.1| GJ13933 [Drosophila virilis]
Length = 521
Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats.
Identities = 31/74 (41%), Positives = 44/74 (59%), Gaps = 2/74 (2%)
Query: 55 SGCPVA-LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYR 113
SG VA LG + L+ +DV GGAT FP + + V P+KG+++FWYN + D R
Sbjct: 426 SGEAVAELGERIATLIFYA-SDVAQGGATNFPDIQVAVQPQKGNSLFWYNMFDDGTPDPR 484
Query: 114 MYHSGCPVALGNKW 127
HS CP +G++W
Sbjct: 485 SLHSVCPTIVGSRW 498
Score = 53.1 bits (126), Expect = 4e-05, Method: Composition-based stats.
Identities = 22/54 (40%), Positives = 34/54 (62%)
Query: 12 TDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
+DV GGAT FP + + V P+KG+++FW+N + D HS CP +G++W
Sbjct: 445 SDVAQGGATNFPDIQVAVQPQKGNSLFWYNMFDDGTPDPRSLHSVCPTIVGSRW 498
>gi|206978009|ref|ZP_03238895.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
H3081.97]
gi|423373947|ref|ZP_17351286.1| hypothetical protein IC5_03002 [Bacillus cereus AND1407]
gi|206743809|gb|EDZ55230.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
H3081.97]
gi|401094762|gb|EJQ02832.1| hypothetical protein IC5_03002 [Bacillus cereus AND1407]
Length = 216
Score = 59.3 bits (142), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185
Query: 109 LLDYRMYHSGCPVALGNKW 127
L+ H G PV G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204
Score = 56.6 bits (135), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 32/55 (58%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 150 LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|229186477|ref|ZP_04313640.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BGSC 6E1]
gi|228596991|gb|EEK54648.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BGSC 6E1]
Length = 216
Score = 59.3 bits (142), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 129 DYFAEHSRS--AANNRISTLVIY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185
Query: 109 LLDYRMYHSGCPVALGNKW 127
L+ H G PV G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204
Score = 57.4 bits (137), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 28/64 (43%), Positives = 35/64 (54%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L DVE GG T FP LNL+V P KG AV++ + + L+ H G PV
Sbjct: 141 NRISTLVIYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTK 200
Query: 62 GNKW 65
G KW
Sbjct: 201 GEKW 204
>gi|423478381|ref|ZP_17455096.1| hypothetical protein IEO_03839 [Bacillus cereus BAG6X1-1]
gi|402428543|gb|EJV60640.1| hypothetical protein IEO_03839 [Bacillus cereus BAG6X1-1]
Length = 216
Score = 59.3 bits (142), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185
Query: 109 LLDYRMYHSGCPVALGNKW 127
L+ H G PV G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204
Score = 56.6 bits (135), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 32/55 (58%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 150 LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|301055727|ref|YP_003793938.1| prolyl 4-hydroxylase subunit alpha [Bacillus cereus biovar
anthracis str. CI]
gi|300377896|gb|ADK06800.1| prolyl 4-hydroxylase, alpha subunit [Bacillus cereus biovar
anthracis str. CI]
Length = 216
Score = 59.3 bits (142), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185
Query: 109 LLDYRMYHSGCPVALGNKW 127
L+ H G PV G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204
Score = 56.6 bits (135), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 28/64 (43%), Positives = 35/64 (54%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L DVE GG T FP LNL+V P KG AV++ + + L+ H G PV
Sbjct: 141 NRISTLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTK 200
Query: 62 GNKW 65
G KW
Sbjct: 201 GEKW 204
>gi|194871359|ref|XP_001972833.1| GG13662 [Drosophila erecta]
gi|190654616|gb|EDV51859.1| GG13662 [Drosophila erecta]
Length = 515
Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats.
Identities = 25/67 (37%), Positives = 42/67 (62%), Gaps = 1/67 (1%)
Query: 61 LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
LG++ +++ +V GG T+FP + + V P+KG A+FW+N ++ D R HS CP
Sbjct: 426 LGDRLASIIIYA-GEVSQGGQTVFPDIKVAVEPKKGKALFWFNDFDDSSPDPRSLHSVCP 484
Query: 121 VALGNKW 127
V +G++W
Sbjct: 485 VIVGSRW 491
Score = 53.5 bits (127), Expect = 3e-05, Method: Composition-based stats.
Identities = 22/53 (41%), Positives = 33/53 (62%)
Query: 13 DVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
+V GG T+FP + + V P+KG A+FW N ++ D HS CPV +G++W
Sbjct: 439 EVSQGGQTVFPDIKVAVEPKKGKALFWFNDFDDSSPDPRSLHSVCPVIVGSRW 491
>gi|196046329|ref|ZP_03113555.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
03BB108]
gi|376268135|ref|YP_005120847.1| Peptidyl prolyl 4- hydroxylase like protein [Bacillus cereus
F837/76]
gi|196022799|gb|EDX61480.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
03BB108]
gi|364513935|gb|AEW57334.1| Peptidyl prolyl 4- hydroxylase like protein [Bacillus cereus
F837/76]
Length = 216
Score = 59.3 bits (142), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185
Query: 109 LLDYRMYHSGCPVALGNKW 127
L+ H G PV G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204
Score = 56.6 bits (135), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 32/55 (58%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 150 LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|229174912|ref|ZP_04302432.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus MM3]
gi|228608580|gb|EEK65882.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus MM3]
Length = 216
Score = 59.3 bits (142), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185
Query: 109 LLDYRMYHSGCPVALGNKW 127
L+ H G PV G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204
Score = 56.6 bits (135), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 28/64 (43%), Positives = 35/64 (54%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L DVE GG T FP LNL+V P KG AV++ + + L+ H G PV
Sbjct: 141 NRISTLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTK 200
Query: 62 GNKW 65
G KW
Sbjct: 201 GEKW 204
>gi|217961727|ref|YP_002340297.1| prolyl 4-hydroxylase subunit alpha domain-containing protein
[Bacillus cereus AH187]
gi|222097680|ref|YP_002531737.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus cereus
Q1]
gi|229198365|ref|ZP_04325071.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus m1293]
gi|375286242|ref|YP_005106681.1| prolyl 4-hydroxylase subunit alpha domain-containing protein
[Bacillus cereus NC7401]
gi|423354732|ref|ZP_17332357.1| hypothetical protein IAU_02806 [Bacillus cereus IS075]
gi|423566803|ref|ZP_17543050.1| hypothetical protein II7_00026 [Bacillus cereus MSX-A12]
gi|423574080|ref|ZP_17550199.1| hypothetical protein II9_01301 [Bacillus cereus MSX-D12]
gi|217067199|gb|ACJ81449.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
AH187]
gi|221241738|gb|ACM14448.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
Q1]
gi|228585065|gb|EEK43177.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus m1293]
gi|358354769|dbj|BAL19941.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
NC7401]
gi|401086280|gb|EJP94507.1| hypothetical protein IAU_02806 [Bacillus cereus IS075]
gi|401212649|gb|EJR19392.1| hypothetical protein II9_01301 [Bacillus cereus MSX-D12]
gi|401215318|gb|EJR22035.1| hypothetical protein II7_00026 [Bacillus cereus MSX-A12]
Length = 216
Score = 59.3 bits (142), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185
Query: 109 LLDYRMYHSGCPVALGNKW 127
L+ H G PV G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204
Score = 56.6 bits (135), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 32/55 (58%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 150 LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|195159303|ref|XP_002020521.1| GL13468 [Drosophila persimilis]
gi|194117290|gb|EDW39333.1| GL13468 [Drosophila persimilis]
Length = 415
Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats.
Identities = 24/56 (42%), Positives = 35/56 (62%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 128
L+DV GGAT+FP LN++V P +G A+ WYN + + H+ CP+ G+KW
Sbjct: 342 LSDVPQGGATLFPRLNISVQPRQGDALLWYNLNDRGQGEIGTVHTSCPIIKGSKWA 397
Score = 58.2 bits (139), Expect = 1e-06, Method: Composition-based stats.
Identities = 23/57 (40%), Positives = 36/57 (63%)
Query: 10 ELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWG 66
+L+DV GGAT+FP LN++V P +G A+ W+N + + H+ CP+ G+KW
Sbjct: 341 QLSDVPQGGATLFPRLNISVQPRQGDALLWYNLNDRGQGEIGTVHTSCPIIKGSKWA 397
>gi|229093299|ref|ZP_04224414.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-42]
gi|228690082|gb|EEL43879.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-42]
Length = 232
Score = 59.3 bits (142), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 145 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 201
Query: 109 LLDYRMYHSGCPVALGNKW 127
L+ H G PV G KW
Sbjct: 202 SLNELTLHGGAPVTKGEKW 220
Score = 56.6 bits (135), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 28/64 (43%), Positives = 35/64 (54%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L DVE GG T FP LNL+V P KG AV++ + + L+ H G PV
Sbjct: 157 NRISTLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTK 216
Query: 62 GNKW 65
G KW
Sbjct: 217 GEKW 220
>gi|49480949|ref|YP_038297.1| prolyl 4-hydroxylase subunit alpha [Bacillus thuringiensis serovar
konkukian str. 97-27]
gi|49332505|gb|AAT63151.1| prolyl 4-hydroxylase, alpha subunit [Bacillus thuringiensis serovar
konkukian str. 97-27]
Length = 232
Score = 59.3 bits (142), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 145 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 201
Query: 109 LLDYRMYHSGCPVALGNKW 127
L+ H G PV G KW
Sbjct: 202 SLNELTLHGGAPVTKGEKW 220
Score = 56.6 bits (135), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 28/64 (43%), Positives = 35/64 (54%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L DVE GG T FP LNL+V P KG AV++ + + L+ H G PV
Sbjct: 157 NRISTLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTK 216
Query: 62 GNKW 65
G KW
Sbjct: 217 GEKW 220
>gi|229031885|ref|ZP_04187873.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1271]
gi|228729503|gb|EEL80492.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1271]
Length = 216
Score = 59.3 bits (142), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185
Query: 109 LLDYRMYHSGCPVALGNKW 127
L+ H G PV G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204
Score = 56.6 bits (135), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 28/64 (43%), Positives = 35/64 (54%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L DVE GG T FP LNL+V P KG AV++ + + L+ H G PV
Sbjct: 141 NRISTLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTK 200
Query: 62 GNKW 65
G KW
Sbjct: 201 GEKW 204
>gi|52141260|ref|YP_085568.1| prolyl 4-hydroxylase, alpha subunit [Bacillus cereus E33L]
gi|51974729|gb|AAU16279.1| prolyl 4-hydroxylase, alpha subunit [Bacillus cereus E33L]
Length = 232
Score = 59.3 bits (142), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 145 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 201
Query: 109 LLDYRMYHSGCPVALGNKW 127
L+ H G PV G KW
Sbjct: 202 SLNELTLHGGAPVTKGEKW 220
Score = 56.6 bits (135), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 28/64 (43%), Positives = 35/64 (54%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L DVE GG T FP LNL+V P KG AV++ + + L+ H G PV
Sbjct: 157 NRISTLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTK 216
Query: 62 GNKW 65
G KW
Sbjct: 217 GEKW 220
>gi|49187135|ref|YP_030387.1| prolyl 4-hydroxylase subunit alpha [Bacillus anthracis str. Sterne]
gi|228947951|ref|ZP_04110238.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
monterrey BGSC 4AJ1]
gi|49181062|gb|AAT56438.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Sterne]
gi|228811938|gb|EEM58272.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
monterrey BGSC 4AJ1]
Length = 232
Score = 59.3 bits (142), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 145 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 201
Query: 109 LLDYRMYHSGCPVALGNKW 127
L+ H G PV G KW
Sbjct: 202 SLNELTLHGGAPVTKGEKW 220
Score = 56.6 bits (135), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 28/64 (43%), Positives = 35/64 (54%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L DVE GG T FP LNL+V P KG AV++ + + L+ H G PV
Sbjct: 157 NRISTLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTK 216
Query: 62 GNKW 65
G KW
Sbjct: 217 GEKW 220
>gi|384182063|ref|YP_005567825.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
finitimus YBT-020]
gi|324328147|gb|ADY23407.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
finitimus YBT-020]
Length = 216
Score = 59.3 bits (142), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDR 185
Query: 109 LLDYRMYHSGCPVALGNKW 127
L+ H G PV G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204
Score = 56.6 bits (135), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 28/64 (43%), Positives = 35/64 (54%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L DVE GG T FP LNL+V P KG AV++ + + L+ H G PV
Sbjct: 141 NRISTLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDRSLNELTLHGGAPVTK 200
Query: 62 GNKW 65
G KW
Sbjct: 201 GEKW 204
>gi|228916870|ref|ZP_04080433.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
pulsiensis BGSC 4CC1]
gi|228842793|gb|EEM87878.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
pulsiensis BGSC 4CC1]
Length = 232
Score = 59.3 bits (142), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 145 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 201
Query: 109 LLDYRMYHSGCPVALGNKW 127
L+ H G PV G KW
Sbjct: 202 SLNELTLHGGAPVTKGEKW 220
Score = 56.6 bits (135), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 28/64 (43%), Positives = 35/64 (54%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L DVE GG T FP LNL+V P KG AV++ + + L+ H G PV
Sbjct: 157 NRISTLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTK 216
Query: 62 GNKW 65
G KW
Sbjct: 217 GEKW 220
>gi|228987427|ref|ZP_04147547.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
tochigiensis BGSC 4Y1]
gi|228772399|gb|EEM20845.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
tochigiensis BGSC 4Y1]
Length = 232
Score = 59.3 bits (142), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 145 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 201
Query: 109 LLDYRMYHSGCPVALGNKW 127
L+ H G PV G KW
Sbjct: 202 SLNELTLHGGAPVTKGEKW 220
Score = 56.6 bits (135), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 28/64 (43%), Positives = 35/64 (54%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L DVE GG T FP LNL+V P KG AV++ + + L+ H G PV
Sbjct: 157 NRISTLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTK 216
Query: 62 GNKW 65
G KW
Sbjct: 217 GEKW 220
>gi|229157835|ref|ZP_04285910.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 4342]
gi|228625792|gb|EEK82544.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 4342]
Length = 232
Score = 59.3 bits (142), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 145 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 201
Query: 109 LLDYRMYHSGCPVALGNKW 127
L+ H G PV G KW
Sbjct: 202 SLNELTLHGGAPVTKGEKW 220
Score = 56.6 bits (135), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 28/64 (43%), Positives = 35/64 (54%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L DVE GG T FP LNL+V P KG AV++ + + L+ H G PV
Sbjct: 157 NRISTLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTK 216
Query: 62 GNKW 65
G KW
Sbjct: 217 GEKW 220
>gi|118479416|ref|YP_896567.1| prolyl 4-hydroxylase subunit alpha [Bacillus thuringiensis str. Al
Hakam]
gi|118418641|gb|ABK87060.1| prolyl 4-hydroxylase, alpha subunit [Bacillus thuringiensis str. Al
Hakam]
Length = 232
Score = 59.3 bits (142), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 145 DYFAEHSRS--AANNRISTLVIY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 201
Query: 109 LLDYRMYHSGCPVALGNKW 127
L+ H G PV G KW
Sbjct: 202 SLNELTLHGGAPVTKGEKW 220
Score = 57.4 bits (137), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 28/64 (43%), Positives = 35/64 (54%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L DVE GG T FP LNL+V P KG AV++ + + L+ H G PV
Sbjct: 157 NRISTLVIYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTK 216
Query: 62 GNKW 65
G KW
Sbjct: 217 GEKW 220
>gi|229140971|ref|ZP_04269515.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST26]
gi|228642547|gb|EEK98834.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST26]
Length = 232
Score = 59.3 bits (142), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 145 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 201
Query: 109 LLDYRMYHSGCPVALGNKW 127
L+ H G PV G KW
Sbjct: 202 SLNELTLHGGAPVTKGEKW 220
Score = 56.6 bits (135), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 28/64 (43%), Positives = 35/64 (54%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L DVE GG T FP LNL+V P KG AV++ + + L+ H G PV
Sbjct: 157 NRISTLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTK 216
Query: 62 GNKW 65
G KW
Sbjct: 217 GEKW 220
>gi|423612451|ref|ZP_17588312.1| hypothetical protein IIM_03166 [Bacillus cereus VD107]
gi|401246040|gb|EJR52392.1| hypothetical protein IIM_03166 [Bacillus cereus VD107]
Length = 254
Score = 59.3 bits (142), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 167 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 223
Query: 109 LLDYRMYHSGCPVALGNKW 127
L+ H G PV G KW
Sbjct: 224 SLNELTLHGGAPVTKGEKW 242
Score = 56.6 bits (135), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 32/55 (58%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 188 LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 242
>gi|312385117|gb|EFR29691.1| hypothetical protein AND_01144 [Anopheles darlingi]
Length = 295
Score = 59.3 bits (142), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 26/60 (43%), Positives = 36/60 (60%)
Query: 68 LLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
+S + E GG FP L L+V P+KG+A+FWYN H + D RM +S CP+ N+W
Sbjct: 200 FFMSDVDGAEGGGRIAFPYLGLSVLPQKGAALFWYNLHDSGRPDERMTYSICPLLADNRW 259
Score = 52.8 bits (125), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 26/60 (43%), Positives = 36/60 (60%), Gaps = 2/60 (3%)
Query: 6 MSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
MS V+ E GG FP L L+V P+KG+A+FW+N H + D M +S CP+ N+W
Sbjct: 202 MSDVD--GAEGGGRIAFPYLGLSVLPQKGAALFWYNLHDSGRPDERMTYSICPLLADNRW 259
>gi|423582447|ref|ZP_17558558.1| hypothetical protein IIA_03962 [Bacillus cereus VD014]
gi|401213326|gb|EJR20067.1| hypothetical protein IIA_03962 [Bacillus cereus VD014]
Length = 248
Score = 58.9 bits (141), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 161 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQ 217
Query: 109 LLDYRMYHSGCPVALGNKW 127
L+ H G PV G KW
Sbjct: 218 SLNELTLHGGAPVTKGEKW 236
Score = 56.2 bits (134), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 32/55 (58%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 182 LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 236
>gi|198466393|ref|XP_001353986.2| GA18007 [Drosophila pseudoobscura pseudoobscura]
gi|198150579|gb|EAL29722.2| GA18007 [Drosophila pseudoobscura pseudoobscura]
Length = 455
Score = 58.9 bits (141), Expect = 6e-07, Method: Composition-based stats.
Identities = 24/59 (40%), Positives = 39/59 (66%), Gaps = 6/59 (10%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWGKLL 131
+++V+ GG T+FP L+L V P+KGS + W N D+ + H+ CP+ +GN WG+L+
Sbjct: 387 MSNVQQGGETVFPFLSLRVKPQKGSLLLWRNT------DWSVLHNSCPLIIGNMWGELI 439
Score = 58.9 bits (141), Expect = 6e-07, Method: Composition-based stats.
Identities = 25/66 (37%), Positives = 42/66 (63%), Gaps = 6/66 (9%)
Query: 4 AEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGN 63
A + + +++V+ GG T+FP L+L V P+KGS + W N D+ + H+ CP+ +GN
Sbjct: 380 AATALIFMSNVQQGGETVFPFLSLRVKPQKGSLLLWRNT------DWSVLHNSCPLIIGN 433
Query: 64 KWGKLL 69
WG+L+
Sbjct: 434 MWGELI 439
>gi|281307110|pdb|3ITQ|A Chain A, Crystal Structure Of A Prolyl 4-Hydroxylase From Bacillus
Anthracis
gi|281307111|pdb|3ITQ|B Chain B, Crystal Structure Of A Prolyl 4-Hydroxylase From Bacillus
Anthracis
Length = 216
Score = 58.9 bits (141), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 41/79 (51%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS + N L+ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 129 DYFAEHSR---SAANNRISTLVXYLNDVEEGGETFFPKLNLSVHPRKGXAVYFEYFYQDQ 185
Query: 109 LLDYRMYHSGCPVALGNKW 127
L+ H G PV G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204
Score = 57.0 bits (136), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 32/55 (58%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 150 LNDVEEGGETFFPKLNLSVHPRKGXAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|195341061|ref|XP_002037130.1| GM12749 [Drosophila sechellia]
gi|194131246|gb|EDW53289.1| GM12749 [Drosophila sechellia]
Length = 467
Score = 58.9 bits (141), Expect = 6e-07, Method: Composition-based stats.
Identities = 29/79 (36%), Positives = 38/79 (48%), Gaps = 3/79 (3%)
Query: 10 ELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWGKL- 68
+L+DV++GG FP L P +GSA+ WHN + D + CPV LGN+WG
Sbjct: 382 KLSDVQMGGYASFPDLGFGFKPRRGSALVWHNTDNSGNCDTRSLQATCPVLLGNQWGSRE 441
Query: 69 --LLSGLTDVELGGATIFP 85
G GG FP
Sbjct: 442 WEFGKGFEGFRAGGNNAFP 460
Score = 57.8 bits (138), Expect = 1e-06, Method: Composition-based stats.
Identities = 24/56 (42%), Positives = 33/56 (58%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 128
L+DV++GG FP L P +GSA+ W+N + D R + CPV LGN+WG
Sbjct: 383 LSDVQMGGYASFPDLGFGFKPRRGSALVWHNTDNSGNCDTRSLQATCPVLLGNQWG 438
>gi|206971296|ref|ZP_03232247.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
AH1134]
gi|229081494|ref|ZP_04213993.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock4-2]
gi|423411965|ref|ZP_17389085.1| hypothetical protein IE1_01269 [Bacillus cereus BAG3O-2]
gi|423432249|ref|ZP_17409253.1| hypothetical protein IE7_04065 [Bacillus cereus BAG4O-1]
gi|206734068|gb|EDZ51239.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
AH1134]
gi|228701801|gb|EEL54288.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock4-2]
gi|401104033|gb|EJQ12010.1| hypothetical protein IE1_01269 [Bacillus cereus BAG3O-2]
gi|401117005|gb|EJQ24843.1| hypothetical protein IE7_04065 [Bacillus cereus BAG4O-1]
Length = 216
Score = 58.9 bits (141), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQ 185
Query: 109 LLDYRMYHSGCPVALGNKW 127
L+ H G PV G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204
Score = 55.8 bits (133), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 32/55 (58%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 150 LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|228902749|ref|ZP_04066896.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL
4222]
gi|228967277|ref|ZP_04128313.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
sotto str. T04001]
gi|402564350|ref|YP_006607074.1| prolyl 4-hydroxylase subunit alpha domain-containing protein
[Bacillus thuringiensis HD-771]
gi|434377355|ref|YP_006611999.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus
thuringiensis HD-789]
gi|228792646|gb|EEM40212.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
sotto str. T04001]
gi|228856936|gb|EEN01449.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL
4222]
gi|401793002|gb|AFQ19041.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus
thuringiensis HD-771]
gi|401875912|gb|AFQ28079.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus
thuringiensis HD-789]
Length = 216
Score = 58.9 bits (141), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQ 185
Query: 109 LLDYRMYHSGCPVALGNKW 127
L+ H G PV G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204
Score = 56.2 bits (134), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 32/55 (58%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 150 LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|229192445|ref|ZP_04319408.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 10876]
gi|228591022|gb|EEK48878.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 10876]
Length = 216
Score = 58.9 bits (141), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQ 185
Query: 109 LLDYRMYHSGCPVALGNKW 127
L+ H G PV G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204
Score = 56.2 bits (134), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 32/55 (58%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 150 LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|423634936|ref|ZP_17610589.1| hypothetical protein IK7_01345 [Bacillus cereus VD156]
gi|401278922|gb|EJR84852.1| hypothetical protein IK7_01345 [Bacillus cereus VD156]
Length = 248
Score = 58.9 bits (141), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 161 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQ 217
Query: 109 LLDYRMYHSGCPVALGNKW 127
L+ H G PV G KW
Sbjct: 218 SLNELTLHGGAPVTKGEKW 236
Score = 56.2 bits (134), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 32/55 (58%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 182 LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 236
>gi|228954520|ref|ZP_04116545.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
kurstaki str. T03a001]
gi|449091198|ref|YP_007423639.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
kurstaki str. HD73]
gi|228805177|gb|EEM51771.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
kurstaki str. T03a001]
gi|449024955|gb|AGE80118.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
kurstaki str. HD73]
Length = 216
Score = 58.9 bits (141), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQ 185
Query: 109 LLDYRMYHSGCPVALGNKW 127
L+ H G PV G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204
Score = 56.2 bits (134), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 32/55 (58%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 150 LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|423426372|ref|ZP_17403403.1| hypothetical protein IE5_04061 [Bacillus cereus BAG3X2-2]
gi|401111119|gb|EJQ19018.1| hypothetical protein IE5_04061 [Bacillus cereus BAG3X2-2]
Length = 248
Score = 58.9 bits (141), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 161 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQ 217
Query: 109 LLDYRMYHSGCPVALGNKW 127
L+ H G PV G KW
Sbjct: 218 SLNELTLHGGAPVTKGEKW 236
Score = 56.2 bits (134), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 32/55 (58%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 182 LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 236
>gi|228922987|ref|ZP_04086280.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
huazhongensis BGSC 4BD1]
gi|228836620|gb|EEM81968.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
huazhongensis BGSC 4BD1]
Length = 216
Score = 58.9 bits (141), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQ 185
Query: 109 LLDYRMYHSGCPVALGNKW 127
L+ H G PV G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204
Score = 56.2 bits (134), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 32/55 (58%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 150 LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|423657194|ref|ZP_17632493.1| hypothetical protein IKG_04182 [Bacillus cereus VD200]
gi|401289937|gb|EJR95641.1| hypothetical protein IKG_04182 [Bacillus cereus VD200]
Length = 248
Score = 58.9 bits (141), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 161 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQ 217
Query: 109 LLDYRMYHSGCPVALGNKW 127
L+ H G PV G KW
Sbjct: 218 SLNELTLHGGAPVTKGEKW 236
Score = 56.2 bits (134), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 32/55 (58%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 182 LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 236
>gi|194871352|ref|XP_001972832.1| GG13663 [Drosophila erecta]
gi|190654615|gb|EDV51858.1| GG13663 [Drosophila erecta]
Length = 420
Score = 58.9 bits (141), Expect = 6e-07, Method: Composition-based stats.
Identities = 28/59 (47%), Positives = 37/59 (62%), Gaps = 5/59 (8%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
LL L +V+ GGAT+FP+L + VFP++GS +FW+ LD R CPV GNKW
Sbjct: 365 LLFFLDNVKQGGATVFPNLKIAVFPQRGSCLFWHKT-----LDTRNEPLECPVLQGNKW 418
Score = 57.0 bits (136), Expect = 2e-06, Method: Composition-based stats.
Identities = 26/55 (47%), Positives = 34/55 (61%), Gaps = 5/55 (9%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L +V+ GGAT+FP+L + VFP++GS +FWH LD CPV GNKW
Sbjct: 369 LDNVKQGGATVFPNLKIAVFPQRGSCLFWHKT-----LDTRNEPLECPVLQGNKW 418
>gi|228941395|ref|ZP_04103947.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
berliner ATCC 10792]
gi|228974327|ref|ZP_04134896.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
thuringiensis str. T01001]
gi|228980919|ref|ZP_04141223.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis Bt407]
gi|384188306|ref|YP_005574202.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
chinensis CT-43]
gi|410676625|ref|YP_006928996.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis Bt407]
gi|452200698|ref|YP_007480779.1| Peptidyl prolyl 4-hydroxylase-like protein, alpha subunit [Bacillus
thuringiensis serovar thuringiensis str. IS5056]
gi|228778855|gb|EEM27118.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis Bt407]
gi|228785377|gb|EEM33387.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
thuringiensis str. T01001]
gi|228818321|gb|EEM64394.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
berliner ATCC 10792]
gi|326942015|gb|AEA17911.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
chinensis CT-43]
gi|409175754|gb|AFV20059.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis Bt407]
gi|452106091|gb|AGG03031.1| Peptidyl prolyl 4-hydroxylase-like protein, alpha subunit [Bacillus
thuringiensis serovar thuringiensis str. IS5056]
Length = 216
Score = 58.9 bits (141), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQ 185
Query: 109 LLDYRMYHSGCPVALGNKW 127
L+ H G PV G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204
Score = 56.2 bits (134), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 32/55 (58%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 150 LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|365158975|ref|ZP_09355162.1| hypothetical protein HMPREF1014_00625 [Bacillus sp. 7_6_55CFAA_CT2]
gi|363625964|gb|EHL76973.1| hypothetical protein HMPREF1014_00625 [Bacillus sp. 7_6_55CFAA_CT2]
Length = 248
Score = 58.9 bits (141), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 161 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQ 217
Query: 109 LLDYRMYHSGCPVALGNKW 127
L+ H G PV G KW
Sbjct: 218 SLNELTLHGGAPVTKGEKW 236
Score = 56.2 bits (134), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 32/55 (58%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 182 LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 236
>gi|228960501|ref|ZP_04122151.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
pakistani str. T13001]
gi|229047930|ref|ZP_04193506.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH676]
gi|423630961|ref|ZP_17606708.1| hypothetical protein IK5_03811 [Bacillus cereus VD154]
gi|423650103|ref|ZP_17625673.1| hypothetical protein IKA_03890 [Bacillus cereus VD169]
gi|228723387|gb|EEL74756.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH676]
gi|228799198|gb|EEM46165.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
pakistani str. T13001]
gi|401264328|gb|EJR70440.1| hypothetical protein IK5_03811 [Bacillus cereus VD154]
gi|401282521|gb|EJR88420.1| hypothetical protein IKA_03890 [Bacillus cereus VD169]
Length = 248
Score = 58.9 bits (141), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 161 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQ 217
Query: 109 LLDYRMYHSGCPVALGNKW 127
L+ H G PV G KW
Sbjct: 218 SLNELTLHGGAPVTKGEKW 236
Score = 56.2 bits (134), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 32/55 (58%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 182 LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 236
>gi|75760922|ref|ZP_00740932.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
israelensis ATCC 35646]
gi|423385740|ref|ZP_17362996.1| hypothetical protein ICE_03486 [Bacillus cereus BAG1X1-2]
gi|423561293|ref|ZP_17537569.1| hypothetical protein II5_00697 [Bacillus cereus MSX-A1]
gi|74491592|gb|EAO54798.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
israelensis ATCC 35646]
gi|401201550|gb|EJR08415.1| hypothetical protein II5_00697 [Bacillus cereus MSX-A1]
gi|401635796|gb|EJS53551.1| hypothetical protein ICE_03486 [Bacillus cereus BAG1X1-2]
Length = 248
Score = 58.9 bits (141), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 161 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQ 217
Query: 109 LLDYRMYHSGCPVALGNKW 127
L+ H G PV G KW
Sbjct: 218 SLNELTLHGGAPVTKGEKW 236
Score = 56.2 bits (134), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 32/55 (58%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 182 LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 236
>gi|423437685|ref|ZP_17414666.1| hypothetical protein IE9_03866 [Bacillus cereus BAG4X12-1]
gi|423503075|ref|ZP_17479667.1| hypothetical protein IG1_00641 [Bacillus cereus HD73]
gi|401120840|gb|EJQ28636.1| hypothetical protein IE9_03866 [Bacillus cereus BAG4X12-1]
gi|402459296|gb|EJV91033.1| hypothetical protein IG1_00641 [Bacillus cereus HD73]
Length = 248
Score = 58.9 bits (141), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 161 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQ 217
Query: 109 LLDYRMYHSGCPVALGNKW 127
L+ H G PV G KW
Sbjct: 218 SLNELTLHGGAPVTKGEKW 236
Score = 56.2 bits (134), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 32/55 (58%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 182 LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 236
>gi|423358724|ref|ZP_17336227.1| hypothetical protein IC1_00704 [Bacillus cereus VD022]
gi|401084596|gb|EJP92842.1| hypothetical protein IC1_00704 [Bacillus cereus VD022]
Length = 248
Score = 58.9 bits (141), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 161 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQ 217
Query: 109 LLDYRMYHSGCPVALGNKW 127
L+ H G PV G KW
Sbjct: 218 SLNELTLHGGAPVTKGEKW 236
Score = 56.2 bits (134), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 32/55 (58%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 182 LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 236
>gi|229111709|ref|ZP_04241257.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-15]
gi|296504733|ref|YP_003666433.1| prolyl 4-hydroxylase subunit alpha [Bacillus thuringiensis BMB171]
gi|423585282|ref|ZP_17561369.1| hypothetical protein IIE_00694 [Bacillus cereus VD045]
gi|423640681|ref|ZP_17616299.1| hypothetical protein IK9_00626 [Bacillus cereus VD166]
gi|228671703|gb|EEL26999.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-15]
gi|296325785|gb|ADH08713.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis BMB171]
gi|401233925|gb|EJR40411.1| hypothetical protein IIE_00694 [Bacillus cereus VD045]
gi|401279742|gb|EJR85664.1| hypothetical protein IK9_00626 [Bacillus cereus VD166]
Length = 248
Score = 58.9 bits (141), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 161 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQ 217
Query: 109 LLDYRMYHSGCPVALGNKW 127
L+ H G PV G KW
Sbjct: 218 SLNELTLHGGAPVTKGEKW 236
Score = 56.2 bits (134), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 32/55 (58%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 182 LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 236
>gi|423527903|ref|ZP_17504348.1| hypothetical protein IGE_01455 [Bacillus cereus HuB1-1]
gi|402451566|gb|EJV83385.1| hypothetical protein IGE_01455 [Bacillus cereus HuB1-1]
Length = 248
Score = 58.9 bits (141), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 161 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQ 217
Query: 109 LLDYRMYHSGCPVALGNKW 127
L+ H G PV G KW
Sbjct: 218 SLNELTLHGGAPVTKGEKW 236
Score = 56.2 bits (134), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 32/55 (58%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 182 LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 236
>gi|341896599|gb|EGT52534.1| hypothetical protein CAEBREN_17302 [Caenorhabditis brenneri]
Length = 314
Score = 58.9 bits (141), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 26/66 (39%), Positives = 38/66 (57%), Gaps = 1/66 (1%)
Query: 61 LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
+GN+ ++S + ++GGAT+FPS+ TV P G A W+N N +D +H GCP
Sbjct: 218 MGNRMTTFIIS-VEKADVGGATVFPSIKATVRPNPGDAFMWFNLLENQEIDNSAFHGGCP 276
Query: 121 VALGNK 126
V G K
Sbjct: 277 VLAGRK 282
Score = 55.1 bits (131), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/57 (49%), Positives = 32/57 (56%), Gaps = 2/57 (3%)
Query: 8 SVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNK 64
SVE DV GGAT+FPS+ TV P G A W N N +D +H GCPV G K
Sbjct: 228 SVEKADV--GGATVFPSIKATVRPNPGDAFMWFNLLENQEIDNSAFHGGCPVLAGRK 282
>gi|229180513|ref|ZP_04307855.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus 172560W]
gi|228602937|gb|EEK60416.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus 172560W]
Length = 232
Score = 58.9 bits (141), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 145 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQ 201
Query: 109 LLDYRMYHSGCPVALGNKW 127
L+ H G PV G KW
Sbjct: 202 SLNELTLHGGAPVTKGEKW 220
Score = 55.8 bits (133), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 32/55 (58%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 166 LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 220
>gi|195159311|ref|XP_002020525.1| GL13465 [Drosophila persimilis]
gi|194117294|gb|EDW39337.1| GL13465 [Drosophila persimilis]
Length = 578
Score = 58.9 bits (141), Expect = 7e-07, Method: Composition-based stats.
Identities = 24/56 (42%), Positives = 35/56 (62%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 128
L+DV GGAT+FP LN++V P +G A+ WYN + + H+ CP+ G+KW
Sbjct: 505 LSDVPQGGATLFPRLNISVQPRQGDALLWYNLNDRGQGEIGTVHTSCPIIKGSKWA 560
Score = 56.6 bits (135), Expect = 3e-06, Method: Composition-based stats.
Identities = 23/56 (41%), Positives = 35/56 (62%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWG 66
L+DV GGAT+FP LN++V P +G A+ W+N + + H+ CP+ G+KW
Sbjct: 505 LSDVPQGGATLFPRLNISVQPRQGDALLWYNLNDRGQGEIGTVHTSCPIIKGSKWA 560
>gi|30022316|ref|NP_833947.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 14579]
gi|229129515|ref|ZP_04258486.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-Cer4]
gi|29897873|gb|AAP11148.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 14579]
gi|228654120|gb|EEL09987.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-Cer4]
Length = 232
Score = 58.9 bits (141), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 145 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQ 201
Query: 109 LLDYRMYHSGCPVALGNKW 127
L+ H G PV G KW
Sbjct: 202 SLNELTLHGGAPVTKGEKW 220
Score = 55.8 bits (133), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 32/55 (58%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 166 LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 220
>gi|229071739|ref|ZP_04204954.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus F65185]
gi|228711334|gb|EEL63294.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus F65185]
Length = 232
Score = 58.9 bits (141), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 145 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQ 201
Query: 109 LLDYRMYHSGCPVALGNKW 127
L+ H G PV G KW
Sbjct: 202 SLNELTLHGGAPVTKGEKW 220
Score = 55.8 bits (133), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 32/55 (58%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 166 LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 220
>gi|198428011|ref|XP_002120302.1| PREDICTED: similar to prolyl 4-hydroxylase alpha-2 subunit, partial
[Ciona intestinalis]
Length = 233
Score = 58.9 bits (141), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 27/60 (45%), Positives = 34/60 (56%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 128
L+ +DVE GG T F + P KGSA+FWY+ + D R H+ CPV LGNKW
Sbjct: 145 FLTYFSDVEYGGNTPFVYQEVVAEPIKGSAIFWYDVFNDGSADERTEHAACPVVLGNKWA 204
Score = 52.8 bits (125), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 24/56 (42%), Positives = 31/56 (55%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWG 66
+DVE GG T F + P KGSA+FW++ + D H+ CPV LGNKW
Sbjct: 149 FSDVEYGGNTPFVYQEVVAEPIKGSAIFWYDVFNDGSADERTEHAACPVVLGNKWA 204
>gi|195452772|ref|XP_002073493.1| GK14149 [Drosophila willistoni]
gi|194169578|gb|EDW84479.1| GK14149 [Drosophila willistoni]
Length = 496
Score = 58.5 bits (140), Expect = 7e-07, Method: Composition-based stats.
Identities = 29/57 (50%), Positives = 33/57 (57%), Gaps = 1/57 (1%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTL-LDYRMYHSGCPVALGNKWG 128
L V+ GGAT+FPSLNL V EKG +FWYN + D H GCPV G K G
Sbjct: 416 LNTVKHGGATVFPSLNLKVPAEKGKVLFWYNLDGESFDFDENTEHGGCPVVDGIKLG 472
Score = 55.5 bits (132), Expect = 8e-06, Method: Composition-based stats.
Identities = 28/57 (49%), Positives = 33/57 (57%), Gaps = 1/57 (1%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTL-LDYWMYHSGCPVALGNKWG 66
L V+ GGAT+FPSLNL V EKG +FW+N + D H GCPV G K G
Sbjct: 416 LNTVKHGGATVFPSLNLKVPAEKGKVLFWYNLDGESFDFDENTEHGGCPVVDGIKLG 472
>gi|423406337|ref|ZP_17383486.1| hypothetical protein ICY_01022 [Bacillus cereus BAG2X1-3]
gi|401660331|gb|EJS77813.1| hypothetical protein ICY_01022 [Bacillus cereus BAG2X1-3]
Length = 216
Score = 58.5 bits (140), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQ 185
Query: 109 LLDYRMYHSGCPVALGNKW 127
L+ H G PV G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204
Score = 55.8 bits (133), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 28/64 (43%), Positives = 35/64 (54%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L DVE GG T FP LNL+V P KG AV++ + + L+ H G PV
Sbjct: 141 NRISTLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTK 200
Query: 62 GNKW 65
G KW
Sbjct: 201 GEKW 204
>gi|423395462|ref|ZP_17372663.1| hypothetical protein ICU_01156 [Bacillus cereus BAG2X1-1]
gi|401654873|gb|EJS72412.1| hypothetical protein ICU_01156 [Bacillus cereus BAG2X1-1]
Length = 216
Score = 58.5 bits (140), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQ 185
Query: 109 LLDYRMYHSGCPVALGNKW 127
L+ H G PV G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204
Score = 55.8 bits (133), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 28/64 (43%), Positives = 35/64 (54%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L DVE GG T FP LNL+V P KG AV++ + + L+ H G PV
Sbjct: 141 NRISTLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTK 200
Query: 62 GNKW 65
G KW
Sbjct: 201 GEKW 204
>gi|195494568|ref|XP_002094893.1| GE19959 [Drosophila yakuba]
gi|194180994|gb|EDW94605.1| GE19959 [Drosophila yakuba]
Length = 486
Score = 58.5 bits (140), Expect = 8e-07, Method: Composition-based stats.
Identities = 23/53 (43%), Positives = 36/53 (67%)
Query: 75 DVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
+V GG T+FP L + V P++G+A+FW+N ++ D R HS CPV +G++W
Sbjct: 391 EVSQGGQTVFPDLKVVVEPKRGNALFWFNKLDDSSPDPRSLHSVCPVIVGSRW 443
Score = 53.5 bits (127), Expect = 3e-05, Method: Composition-based stats.
Identities = 22/53 (41%), Positives = 34/53 (64%)
Query: 13 DVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
+V GG T+FP L + V P++G+A+FW N ++ D HS CPV +G++W
Sbjct: 391 EVSQGGQTVFPDLKVVVEPKRGNALFWFNKLDDSSPDPRSLHSVCPVIVGSRW 443
>gi|195166671|ref|XP_002024158.1| GL22696 [Drosophila persimilis]
gi|194107513|gb|EDW29556.1| GL22696 [Drosophila persimilis]
Length = 491
Score = 58.5 bits (140), Expect = 8e-07, Method: Composition-based stats.
Identities = 24/59 (40%), Positives = 39/59 (66%), Gaps = 6/59 (10%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWGKLL 131
+++V+ GG T+FP L+L V P+KGS + W N D+ + H+ CP+ +GN WG+L+
Sbjct: 423 MSNVQQGGETVFPFLSLRVKPQKGSLLLWRNT------DWSVLHNSCPLIIGNMWGELI 475
Score = 58.5 bits (140), Expect = 8e-07, Method: Composition-based stats.
Identities = 25/66 (37%), Positives = 42/66 (63%), Gaps = 6/66 (9%)
Query: 4 AEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGN 63
A + + +++V+ GG T+FP L+L V P+KGS + W N D+ + H+ CP+ +GN
Sbjct: 416 AATALIFMSNVQQGGETVFPFLSLRVKPQKGSLLLWRNT------DWSVLHNSCPLIIGN 469
Query: 64 KWGKLL 69
WG+L+
Sbjct: 470 MWGELI 475
>gi|229163182|ref|ZP_04291137.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus R309803]
gi|228620245|gb|EEK77116.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus R309803]
Length = 229
Score = 58.5 bits (140), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 142 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVNPRKGMAVYFEYFYQDQ 198
Query: 109 LLDYRMYHSGCPVALGNKW 127
L+ H G PV G KW
Sbjct: 199 SLNELTLHGGAPVTKGEKW 217
Score = 55.8 bits (133), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 32/55 (58%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 163 LNDVEEGGETFFPKLNLSVNPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 217
>gi|194871344|ref|XP_001972830.1| GG13666 [Drosophila erecta]
gi|190654613|gb|EDV51856.1| GG13666 [Drosophila erecta]
Length = 539
Score = 58.5 bits (140), Expect = 8e-07, Method: Composition-based stats.
Identities = 24/54 (44%), Positives = 35/54 (64%)
Query: 74 TDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
+DV LGG T+FP L + V P+KG+A+ W+N + D HS CP+ LG++W
Sbjct: 467 SDVPLGGTTVFPKLKIAVQPKKGNALVWFNLNHAGEPDPLTEHSVCPIVLGSRW 520
Score = 57.4 bits (137), Expect = 2e-06, Method: Composition-based stats.
Identities = 25/64 (39%), Positives = 38/64 (59%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
+R + + +DV LGG T+FP L + V P+KG+A+ W N + D HS CP+ L
Sbjct: 457 DRIATAVLYASDVPLGGTTVFPKLKIAVQPKKGNALVWFNLNHAGEPDPLTEHSVCPIVL 516
Query: 62 GNKW 65
G++W
Sbjct: 517 GSRW 520
>gi|319943342|ref|ZP_08017624.1| 2OG-Fe(II) oxygenase [Lautropia mirabilis ATCC 51599]
gi|319743157|gb|EFV95562.1| 2OG-Fe(II) oxygenase [Lautropia mirabilis ATCC 51599]
Length = 311
Score = 58.5 bits (140), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 40/66 (60%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
G + G +++ L DV GG T FP + + P KGSAV++ +A+ LDYR H+G PV
Sbjct: 234 GQRIGTVVMY-LNDVPEGGETAFPDIGFSAKPIKGSAVYFEYQNADGQLDYRCLHAGMPV 292
Query: 122 ALGNKW 127
G+KW
Sbjct: 293 IRGDKW 298
Score = 55.8 bits (133), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 25/55 (45%), Positives = 33/55 (60%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DV GG T FP + + P KGSAV++ +A+ LDY H+G PV G+KW
Sbjct: 244 LNDVPEGGETAFPDIGFSAKPIKGSAVYFEYQNADGQLDYRCLHAGMPVIRGDKW 298
>gi|405964866|gb|EKC30308.1| KRR1 small subunit processome component-like protein [Crassostrea
gigas]
Length = 885
Score = 58.5 bits (140), Expect = 9e-07, Method: Composition-based stats.
Identities = 23/55 (41%), Positives = 32/55 (58%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
+ D + GGAT+FP + + KG A FW+N + D R H GCPV +G+KW
Sbjct: 801 MNDAKAGGATVFPEVRTRIPVAKGGAAFWFNLRPSGATDPRTLHGGCPVLVGSKW 855
Score = 53.1 bits (126), Expect = 3e-05, Method: Composition-based stats.
Identities = 22/55 (40%), Positives = 30/55 (54%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
+ D + GGAT+FP + + KG A FW N + D H GCPV +G+KW
Sbjct: 801 MNDAKAGGATVFPEVRTRIPVAKGGAAFWFNLRPSGATDPRTLHGGCPVLVGSKW 855
>gi|296217074|ref|XP_002754870.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Callithrix
jacchus]
Length = 544
Score = 58.5 bits (140), Expect = 9e-07, Method: Composition-based stats.
Identities = 29/66 (43%), Positives = 40/66 (60%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ ++ L+ VE GGAT F NL+V K +A+FW+N H + D H+GCPV
Sbjct: 457 GNRVATFMIY-LSSVEAGGATAFIYANLSVPVVKNAALFWWNLHRSGEGDSDTLHAGCPV 515
Query: 122 ALGNKW 127
+GNKW
Sbjct: 516 LVGNKW 521
Score = 57.4 bits (137), Expect = 2e-06, Method: Composition-based stats.
Identities = 29/64 (45%), Positives = 37/64 (57%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L+ VE GGAT F NL+V K +A+FW N H + D H+GCPV +
Sbjct: 458 NRVATFMIYLSSVEAGGATAFIYANLSVPVVKNAALFWWNLHRSGEGDSDTLHAGCPVLV 517
Query: 62 GNKW 65
GNKW
Sbjct: 518 GNKW 521
>gi|195391756|ref|XP_002054526.1| GJ24503 [Drosophila virilis]
gi|194152612|gb|EDW68046.1| GJ24503 [Drosophila virilis]
Length = 519
Score = 58.5 bits (140), Expect = 9e-07, Method: Composition-based stats.
Identities = 28/60 (46%), Positives = 36/60 (60%), Gaps = 5/60 (8%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA-----LGNKW 127
L+DV GG T+FP LN+ + P KG+ V WYN H + D R H+GCPV +GN W
Sbjct: 442 LSDVLNGGYTVFPKLNVFLKPVKGAMVSWYNLHRSLNKDSRTLHAGCPVIEGVKRIGNIW 501
Score = 54.7 bits (130), Expect = 1e-05, Method: Composition-based stats.
Identities = 28/69 (40%), Positives = 38/69 (55%), Gaps = 5/69 (7%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVA- 60
NR + L+DV GG T+FP LN+ + P KG+ V W+N H + D H+GCPV
Sbjct: 433 NRIATNMFYLSDVLNGGYTVFPKLNVFLKPVKGAMVSWYNLHRSLNKDSRTLHAGCPVIE 492
Query: 61 ----LGNKW 65
+GN W
Sbjct: 493 GVKRIGNIW 501
>gi|218231188|ref|YP_002369041.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus cereus
B4264]
gi|218159145|gb|ACK59137.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
B4264]
Length = 216
Score = 58.2 bits (139), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 43/79 (54%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQ 185
Query: 109 LLDYRMYHSGCPVALGNKW 127
++ H G PV G KW
Sbjct: 186 SINELTLHGGAPVTKGEKW 204
Score = 55.5 bits (132), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 25/55 (45%), Positives = 32/55 (58%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T FP LNL+V P KG AV++ + + ++ H G PV G KW
Sbjct: 150 LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSINELTLHGGAPVTKGEKW 204
>gi|445499353|ref|ZP_21466208.1| prolyl 4-hydroxylase alpha subunit [Janthinobacterium sp. HH01]
gi|444789348|gb|ELX10896.1| prolyl 4-hydroxylase alpha subunit [Janthinobacterium sp. HH01]
Length = 272
Score = 58.2 bits (139), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 27/54 (50%), Positives = 34/54 (62%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 126
L+DVE GG T+FP L L V+P+KGSA+++ N N D R H G PV G K
Sbjct: 206 LSDVEQGGGTVFPGLGLEVYPKKGSALWFLNTDINHQPDKRTLHGGAPVVRGTK 259
Score = 54.7 bits (130), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 26/54 (48%), Positives = 33/54 (61%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNK 64
L+DVE GG T+FP L L V+P+KGSA+++ N N D H G PV G K
Sbjct: 206 LSDVEQGGGTVFPGLGLEVYPKKGSALWFLNTDINHQPDKRTLHGGAPVVRGTK 259
>gi|403263105|ref|XP_003923900.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
alpha-3, partial [Saimiri boliviensis boliviensis]
Length = 534
Score = 58.2 bits (139), Expect = 1e-06, Method: Composition-based stats.
Identities = 29/66 (43%), Positives = 40/66 (60%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ ++ L+ VE GGAT F NL+V K +A+FW+N H + D H+GCPV
Sbjct: 447 GNRVATFMIY-LSSVEAGGATAFIYANLSVPVVKNAALFWWNLHRSGEGDSDTLHAGCPV 505
Query: 122 ALGNKW 127
+GNKW
Sbjct: 506 LVGNKW 511
Score = 57.4 bits (137), Expect = 2e-06, Method: Composition-based stats.
Identities = 29/64 (45%), Positives = 37/64 (57%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L+ VE GGAT F NL+V K +A+FW N H + D H+GCPV +
Sbjct: 448 NRVATFMIYLSSVEAGGATAFIYANLSVPVVKNAALFWWNLHRSGEGDSDTLHAGCPVLV 507
Query: 62 GNKW 65
GNKW
Sbjct: 508 GNKW 511
>gi|372266874|ref|ZP_09502922.1| peptidyl prolyl 4-hydroxylase-like protein subunit alpha
[Alteromonas sp. S89]
Length = 294
Score = 58.2 bits (139), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 29/66 (43%), Positives = 41/66 (62%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
G + G L++ L+DVE GGAT+FP + L V P+KG+A+F+ + LD + H G PV
Sbjct: 216 GQRVGTLIMY-LSDVESGGATVFPRVGLEVQPQKGAALFFSYVGEHGKLDLQSLHGGSPV 274
Query: 122 ALGNKW 127
G KW
Sbjct: 275 LAGEKW 280
Score = 55.5 bits (132), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 34/55 (61%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L+DVE GGAT+FP + L V P+KG+A+F+ + LD H G PV G KW
Sbjct: 226 LSDVESGGATVFPRVGLEVQPQKGAALFFSYVGEHGKLDLQSLHGGSPVLAGEKW 280
>gi|195110921|ref|XP_002000028.1| GI24861 [Drosophila mojavensis]
gi|193916622|gb|EDW15489.1| GI24861 [Drosophila mojavensis]
Length = 508
Score = 58.2 bits (139), Expect = 1e-06, Method: Composition-based stats.
Identities = 27/54 (50%), Positives = 35/54 (64%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 126
L+DV GG T+FP LN+ + P KGS V W N H + D R+ H+GCPV G+K
Sbjct: 431 LSDVLQGGYTVFPFLNVFLRPVKGSLVIWPNVHRSLAPDSRVLHAGCPVLEGSK 484
Score = 57.0 bits (136), Expect = 2e-06, Method: Composition-based stats.
Identities = 28/63 (44%), Positives = 37/63 (58%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L+DV GG T+FP LN+ + P KGS V W N H + D + H+GCPV
Sbjct: 422 NRITTNMFYLSDVLQGGYTVFPFLNVFLRPVKGSLVIWPNVHRSLAPDSRVLHAGCPVLE 481
Query: 62 GNK 64
G+K
Sbjct: 482 GSK 484
>gi|229152436|ref|ZP_04280628.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus m1550]
gi|228631044|gb|EEK87681.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus m1550]
Length = 248
Score = 58.2 bits (139), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 43/79 (54%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 161 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQ 217
Query: 109 LLDYRMYHSGCPVALGNKW 127
++ H G PV G KW
Sbjct: 218 SINELTLHGGAPVTKGEKW 236
Score = 55.5 bits (132), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 25/55 (45%), Positives = 32/55 (58%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T FP LNL+V P KG AV++ + + ++ H G PV G KW
Sbjct: 182 LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSINELTLHGGAPVTKGEKW 236
>gi|149180354|ref|ZP_01858859.1| prolyl 4-hydroxylase, alpha subunit [Bacillus sp. SG-1]
gi|148852546|gb|EDL66691.1| prolyl 4-hydroxylase, alpha subunit [Bacillus sp. SG-1]
Length = 212
Score = 58.2 bits (139), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 26/56 (46%), Positives = 34/56 (60%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWG 66
L DVE GG T FP LN +V P+KG AV++ + N L+ H G PV +G+KW
Sbjct: 146 LNDVEEGGETYFPKLNFSVSPQKGMAVYFEYFYDNQELNDLTLHGGAPVIIGDKWA 201
Score = 56.2 bits (134), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 26/56 (46%), Positives = 34/56 (60%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 128
L DVE GG T FP LN +V P+KG AV++ + N L+ H G PV +G+KW
Sbjct: 146 LNDVEEGGETYFPKLNFSVSPQKGMAVYFEYFYDNQELNDLTLHGGAPVIIGDKWA 201
>gi|195113245|ref|XP_002001178.1| GI22115 [Drosophila mojavensis]
gi|193917772|gb|EDW16639.1| GI22115 [Drosophila mojavensis]
Length = 498
Score = 58.2 bits (139), Expect = 1e-06, Method: Composition-based stats.
Identities = 27/56 (48%), Positives = 33/56 (58%), Gaps = 4/56 (7%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRM--YHSGCPVALGNK 126
L DV GG TIFP + V PEKG + WYN ++N LDY + H CPV +G K
Sbjct: 410 LNDVTRGGGTIFPQVKQAVHPEKGKLILWYNMNSN--LDYELASLHGACPVLIGRK 463
Score = 55.1 bits (131), Expect = 9e-06, Method: Composition-based stats.
Identities = 26/58 (44%), Positives = 34/58 (58%), Gaps = 4/58 (6%)
Query: 9 VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWM--YHSGCPVALGNK 64
+ L DV GG TIFP + V PEKG + W+N ++N LDY + H CPV +G K
Sbjct: 408 IYLNDVTRGGGTIFPQVKQAVHPEKGKLILWYNMNSN--LDYELASLHGACPVLIGRK 463
>gi|321463241|gb|EFX74258.1| hypothetical protein DAPPUDRAFT_22132 [Daphnia pulex]
Length = 523
Score = 58.2 bits (139), Expect = 1e-06, Method: Composition-based stats.
Identities = 26/55 (47%), Positives = 31/55 (56%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L DVE GG+T FP + V P KG A FW+N + D H CPV LG+KW
Sbjct: 449 LNDVERGGSTAFPRAGVAVKPVKGGAAFWFNLKRSGKPDPLTLHGACPVLLGHKW 503
Score = 56.6 bits (135), Expect = 3e-06, Method: Composition-based stats.
Identities = 26/55 (47%), Positives = 30/55 (54%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG+T FP + V P KG A FW N + D H CPV LG+KW
Sbjct: 449 LNDVERGGSTAFPRAGVAVKPVKGGAAFWFNLKRSGKPDPLTLHGACPVLLGHKW 503
>gi|195494570|ref|XP_002094894.1| GE19958 [Drosophila yakuba]
gi|194180995|gb|EDW94606.1| GE19958 [Drosophila yakuba]
Length = 498
Score = 57.8 bits (138), Expect = 1e-06, Method: Composition-based stats.
Identities = 28/61 (45%), Positives = 38/61 (62%), Gaps = 12/61 (19%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRM------YHSGCPVALGNK 126
+ DV GGA +F LNLT++P+KGSA+ W N LD+RM H+ CPV +G+K
Sbjct: 425 MNDVAQGGAFVFLHLNLTIWPQKGSALVWRN------LDHRMQPNEDLLHASCPVIVGSK 478
Query: 127 W 127
W
Sbjct: 479 W 479
Score = 56.6 bits (135), Expect = 3e-06, Method: Composition-based stats.
Identities = 24/55 (43%), Positives = 35/55 (63%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
+ DV GGA +F LNLT++P+KGSA+ W N + + H+ CPV +G+KW
Sbjct: 425 MNDVAQGGAFVFLHLNLTIWPQKGSALVWRNLDHRMQPNEDLLHASCPVIVGSKW 479
>gi|194905305|ref|XP_001981170.1| GG11767 [Drosophila erecta]
gi|190655808|gb|EDV53040.1| GG11767 [Drosophila erecta]
Length = 536
Score = 57.8 bits (138), Expect = 1e-06, Method: Composition-based stats.
Identities = 30/79 (37%), Positives = 46/79 (58%), Gaps = 2/79 (2%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ + A+G++ +L L +V+ GGAT FP++NL V +KGSA+FW+N +
Sbjct: 424 DYFDSKTSYLEAVGDRIATVLFY-LNNVDHGGATAFPNINLAVPTQKGSALFWHNLDGKS 482
Query: 109 L-LDYRMYHSGCPVALGNK 126
D R +H CP+ G K
Sbjct: 483 YDYDTRTFHGACPLISGTK 501
Score = 55.1 bits (131), Expect = 1e-05, Method: Composition-based stats.
Identities = 25/55 (45%), Positives = 34/55 (61%), Gaps = 1/55 (1%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTL-LDYWMYHSGCPVALGNK 64
L +V+ GGAT FP++NL V +KGSA+FWHN + D +H CP+ G K
Sbjct: 447 LNNVDHGGATAFPNINLAVPTQKGSALFWHNLDGKSYDYDTRTFHGACPLISGTK 501
>gi|194871364|ref|XP_001972834.1| GG13661 [Drosophila erecta]
gi|190654617|gb|EDV51860.1| GG13661 [Drosophila erecta]
Length = 506
Score = 57.8 bits (138), Expect = 1e-06, Method: Composition-based stats.
Identities = 26/56 (46%), Positives = 37/56 (66%), Gaps = 1/56 (1%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNA-HANTLLDYWMYHSGCPVALGNKW 65
+ DV LGGA +F + NLT+FP+KGSA+ W N H+ + + H CPV +G+KW
Sbjct: 425 MNDVALGGAFVFLNANLTIFPQKGSALVWRNLDHSLQPKEDLLQHLSCPVIVGSKW 480
Score = 57.8 bits (138), Expect = 2e-06, Method: Composition-based stats.
Identities = 29/68 (42%), Positives = 43/68 (63%), Gaps = 2/68 (2%)
Query: 61 LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNA-HANTLLDYRMYHSGC 119
LG++ +L + DV LGGA +F + NLT+FP+KGSA+ W N H+ + + H C
Sbjct: 414 LGDRLTSILFF-MNDVALGGAFVFLNANLTIFPQKGSALVWRNLDHSLQPKEDLLQHLSC 472
Query: 120 PVALGNKW 127
PV +G+KW
Sbjct: 473 PVIVGSKW 480
>gi|21358233|ref|NP_651814.1| prolyl-4-hydroxylase-alpha NE3 [Drosophila melanogaster]
gi|20269810|gb|AAM18060.1|AF495538_1 prolyl 4-hydroxylase alpha-related protein PH4[alpha]NE3
[Drosophila melanogaster]
gi|15291443|gb|AAK92990.1| GH21465p [Drosophila melanogaster]
gi|23172714|gb|AAN14251.1| prolyl-4-hydroxylase-alpha NE3 [Drosophila melanogaster]
gi|220945610|gb|ACL85348.1| PH4alphaNE3-PA [synthetic construct]
gi|220955396|gb|ACL90241.1| PH4alphaNE3-PA [synthetic construct]
Length = 481
Score = 57.8 bits (138), Expect = 2e-06, Method: Composition-based stats.
Identities = 25/56 (44%), Positives = 35/56 (62%), Gaps = 1/56 (1%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTL-LDYRMYHSGCPVALGNKW 127
L D GGAT+FP LN+ V E+G +FWYN + +T ++ H+ CPV G+KW
Sbjct: 416 LNDAPYGGATVFPRLNVKVPAERGKVLFWYNLNGDTHDMEPTTMHAACPVFHGSKW 471
Score = 55.1 bits (131), Expect = 9e-06, Method: Composition-based stats.
Identities = 24/56 (42%), Positives = 35/56 (62%), Gaps = 1/56 (1%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTL-LDYWMYHSGCPVALGNKW 65
L D GGAT+FP LN+ V E+G +FW+N + +T ++ H+ CPV G+KW
Sbjct: 416 LNDAPYGGATVFPRLNVKVPAERGKVLFWYNLNGDTHDMEPTTMHAACPVFHGSKW 471
>gi|195061021|ref|XP_001995909.1| GH14207 [Drosophila grimshawi]
gi|193891701|gb|EDV90567.1| GH14207 [Drosophila grimshawi]
Length = 477
Score = 57.8 bits (138), Expect = 2e-06, Method: Composition-based stats.
Identities = 28/57 (49%), Positives = 33/57 (57%), Gaps = 1/57 (1%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTL-LDYRMYHSGCPVALGNKWG 128
L DV GGATIFP L L V PE+G + W+N T L+ R H CPV +G K G
Sbjct: 405 LNDVARGGATIFPDLELLVQPERGKVLHWHNMDLGTYDLEKRSLHGACPVIMGKKEG 461
Score = 56.6 bits (135), Expect = 3e-06, Method: Composition-based stats.
Identities = 28/57 (49%), Positives = 32/57 (56%), Gaps = 1/57 (1%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTL-LDYWMYHSGCPVALGNKWG 66
L DV GGATIFP L L V PE+G + WHN T L+ H CPV +G K G
Sbjct: 405 LNDVARGGATIFPDLELLVQPERGKVLHWHNMDLGTYDLEKRSLHGACPVIMGKKEG 461
>gi|281362877|ref|NP_733393.3| CG31016, isoform B [Drosophila melanogaster]
gi|442621939|ref|NP_001263119.1| CG31016, isoform C [Drosophila melanogaster]
gi|272477249|gb|AAF57071.5| CG31016, isoform B [Drosophila melanogaster]
gi|440218076|gb|AGB96498.1| CG31016, isoform C [Drosophila melanogaster]
Length = 536
Score = 57.8 bits (138), Expect = 2e-06, Method: Composition-based stats.
Identities = 26/55 (47%), Positives = 35/55 (63%), Gaps = 1/55 (1%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTL-LDYRMYHSGCPVALGNK 126
L +V+ GGAT+FP LNL V +KGSA+FW+N + D R +H CP+ G K
Sbjct: 447 LNNVDHGGATVFPKLNLAVPTQKGSALFWHNIDRKSYDYDTRTFHGACPLISGTK 501
Score = 56.6 bits (135), Expect = 3e-06, Method: Composition-based stats.
Identities = 26/55 (47%), Positives = 34/55 (61%), Gaps = 1/55 (1%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTL-LDYWMYHSGCPVALGNK 64
L +V+ GGAT+FP LNL V +KGSA+FWHN + D +H CP+ G K
Sbjct: 447 LNNVDHGGATVFPKLNLAVPTQKGSALFWHNIDRKSYDYDTRTFHGACPLISGTK 501
>gi|159884097|gb|ABX00727.1| IP12176p [Drosophila melanogaster]
Length = 538
Score = 57.8 bits (138), Expect = 2e-06, Method: Composition-based stats.
Identities = 26/55 (47%), Positives = 35/55 (63%), Gaps = 1/55 (1%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTL-LDYRMYHSGCPVALGNK 126
L +V+ GGAT+FP LNL V +KGSA+FW+N + D R +H CP+ G K
Sbjct: 449 LNNVDHGGATVFPKLNLAVPTQKGSALFWHNIDRKSYDYDTRTFHGACPLISGTK 503
Score = 56.6 bits (135), Expect = 3e-06, Method: Composition-based stats.
Identities = 26/55 (47%), Positives = 34/55 (61%), Gaps = 1/55 (1%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTL-LDYWMYHSGCPVALGNK 64
L +V+ GGAT+FP LNL V +KGSA+FWHN + D +H CP+ G K
Sbjct: 449 LNNVDHGGATVFPKLNLAVPTQKGSALFWHNIDRKSYDYDTRTFHGACPLISGTK 503
>gi|229146822|ref|ZP_04275187.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST24]
gi|228636650|gb|EEK93115.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST24]
Length = 216
Score = 57.8 bits (138), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 42/79 (53%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ +
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQGQ 185
Query: 109 LLDYRMYHSGCPVALGNKW 127
L+ H G PV G KW
Sbjct: 186 SLNELTLHGGAPVTKGEKW 204
Score = 54.7 bits (130), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 31/55 (56%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T FP LNL+V P KG AV++ + L+ H G PV G KW
Sbjct: 150 LNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQGQSLNELTLHGGAPVTKGEKW 204
>gi|195494561|ref|XP_002094890.1| GE19962 [Drosophila yakuba]
gi|194180991|gb|EDW94602.1| GE19962 [Drosophila yakuba]
Length = 539
Score = 57.8 bits (138), Expect = 2e-06, Method: Composition-based stats.
Identities = 25/54 (46%), Positives = 35/54 (64%)
Query: 74 TDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
+DV LGG T+F L L V P+KG+A+ W+N + + D HS CPV LG++W
Sbjct: 467 SDVPLGGTTVFSKLKLAVQPKKGNALVWFNLNHDGKPDPLTEHSVCPVVLGSRW 520
Score = 56.2 bits (134), Expect = 5e-06, Method: Composition-based stats.
Identities = 25/54 (46%), Positives = 34/54 (62%)
Query: 12 TDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
+DV LGG T+F L L V P+KG+A+ W N + + D HS CPV LG++W
Sbjct: 467 SDVPLGGTTVFSKLKLAVQPKKGNALVWFNLNHDGKPDPLTEHSVCPVVLGSRW 520
>gi|195113263|ref|XP_002001187.1| GI10646 [Drosophila mojavensis]
gi|193917781|gb|EDW16648.1| GI10646 [Drosophila mojavensis]
Length = 471
Score = 57.4 bits (137), Expect = 2e-06, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 35/55 (63%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVEL GATIFP LNLT+ PEKG+A+ WHN + ++ CPV +K+
Sbjct: 393 LEDVELVGATIFPRLNLTIKPEKGTALLWHNLESCGSSHPKALYAACPVISSSKY 447
Score = 57.4 bits (137), Expect = 2e-06, Method: Composition-based stats.
Identities = 26/55 (47%), Positives = 36/55 (65%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L DVEL GATIFP LNLT+ PEKG+A+ W+N + + ++ CPV +K+
Sbjct: 393 LEDVELVGATIFPRLNLTIKPEKGTALLWHNLESCGSSHPKALYAACPVISSSKY 447
>gi|251794605|ref|YP_003009336.1| procollagen-proline dioxygenase [Paenibacillus sp. JDR-2]
gi|247542231|gb|ACS99249.1| Procollagen-proline dioxygenase [Paenibacillus sp. JDR-2]
Length = 209
Score = 57.4 bits (137), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 29/64 (45%), Positives = 39/64 (60%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L DVE GG T FPSL+ +V P+KGSAV++ + +T L+ H+G PV
Sbjct: 134 NRISTLVMYLNDVEEGGETYFPSLHFSVTPKKGSAVYFEYFYNDTRLNELTLHAGHPVEA 193
Query: 62 GNKW 65
G KW
Sbjct: 194 GEKW 197
Score = 56.6 bits (135), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 29/68 (42%), Positives = 43/68 (63%), Gaps = 1/68 (1%)
Query: 60 ALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGC 119
++ N+ L++ L DVE GG T FPSL+ +V P+KGSAV++ + +T L+ H+G
Sbjct: 131 SMNNRISTLVMY-LNDVEEGGETYFPSLHFSVTPKKGSAVYFEYFYNDTRLNELTLHAGH 189
Query: 120 PVALGNKW 127
PV G KW
Sbjct: 190 PVEAGEKW 197
>gi|390178051|ref|XP_002137433.2| GA30144 [Drosophila pseudoobscura pseudoobscura]
gi|388859305|gb|EDY67991.2| GA30144 [Drosophila pseudoobscura pseudoobscura]
Length = 546
Score = 57.4 bits (137), Expect = 2e-06, Method: Composition-based stats.
Identities = 24/48 (50%), Positives = 32/48 (66%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
L+DV LGG T FP+L L + P +GSA+ W+NA+ DYR H+ CP
Sbjct: 423 LSDVRLGGFTSFPALGLGIRPNRGSALVWHNANNAGNCDYRALHAACP 470
Score = 56.6 bits (135), Expect = 3e-06, Method: Composition-based stats.
Identities = 24/48 (50%), Positives = 31/48 (64%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCP 58
L+DV LGG T FP+L L + P +GSA+ WHNA+ DY H+ CP
Sbjct: 423 LSDVRLGGFTSFPALGLGIRPNRGSALVWHNANNAGNCDYRALHAACP 470
>gi|198449508|ref|XP_002136911.1| GA26875 [Drosophila pseudoobscura pseudoobscura]
gi|198130638|gb|EDY67469.1| GA26875 [Drosophila pseudoobscura pseudoobscura]
Length = 516
Score = 57.4 bits (137), Expect = 2e-06, Method: Composition-based stats.
Identities = 33/90 (36%), Positives = 44/90 (48%), Gaps = 2/90 (2%)
Query: 38 FWHNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGS 97
+ + H + DY+ S A G++ +L L DV+ GG+T F L L V E+G
Sbjct: 402 YGYGGHYDIHFDYFNTSSPITKARGDRMATVLFY-LNDVKHGGSTAFTDLQLKVPSERGK 460
Query: 98 AVFWYNAHANTL-LDYRMYHSGCPVALGNK 126
+FWYN T LD R H CPV G K
Sbjct: 461 VLFWYNMRGETHDLDSRTLHGACPVIDGTK 490
Score = 46.6 bits (109), Expect = 0.003, Method: Composition-based stats.
Identities = 24/55 (43%), Positives = 29/55 (52%), Gaps = 1/55 (1%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTL-LDYWMYHSGCPVALGNK 64
L DV+ GG+T F L L V E+G +FW+N T LD H CPV G K
Sbjct: 436 LNDVKHGGSTAFTDLQLKVPSERGKVLFWYNMRGETHDLDSRTLHGACPVIDGTK 490
>gi|195159150|ref|XP_002020445.1| GL13509 [Drosophila persimilis]
gi|194117214|gb|EDW39257.1| GL13509 [Drosophila persimilis]
Length = 554
Score = 57.4 bits (137), Expect = 2e-06, Method: Composition-based stats.
Identities = 33/90 (36%), Positives = 44/90 (48%), Gaps = 2/90 (2%)
Query: 38 FWHNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGS 97
+ + H + DY+ S A G++ +L L DV+ GG+T F L L V E+G
Sbjct: 440 YGYGGHYDIHFDYFNTSSPITKARGDRMATVLFY-LNDVKHGGSTAFTDLQLKVPSERGK 498
Query: 98 AVFWYNAHANTL-LDYRMYHSGCPVALGNK 126
+FWYN T LD R H CPV G K
Sbjct: 499 VLFWYNMRGETHDLDSRTLHGACPVIDGTK 528
Score = 46.6 bits (109), Expect = 0.003, Method: Composition-based stats.
Identities = 24/55 (43%), Positives = 29/55 (52%), Gaps = 1/55 (1%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTL-LDYWMYHSGCPVALGNK 64
L DV+ GG+T F L L V E+G +FW+N T LD H CPV G K
Sbjct: 474 LNDVKHGGSTAFTDLQLKVPSERGKVLFWYNMRGETHDLDSRTLHGACPVIDGTK 528
>gi|253575459|ref|ZP_04852796.1| prolyl 4-hydroxylase [Paenibacillus sp. oral taxon 786 str. D14]
gi|251845106|gb|EES73117.1| prolyl 4-hydroxylase [Paenibacillus sp. oral taxon 786 str. D14]
Length = 215
Score = 57.4 bits (137), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 30/78 (38%), Positives = 42/78 (53%), Gaps = 1/78 (1%)
Query: 50 YWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTL 109
Y + P A N+ L++ L DVE GG T+FP L+L V PE+GSA+++ +
Sbjct: 120 YDFFGPNSPSASNNRISTLIIY-LNDVEAGGETVFPLLDLEVKPERGSALYFEYFYRQQE 178
Query: 110 LDYRMYHSGCPVALGNKW 127
L+ HS PV G KW
Sbjct: 179 LNNLTLHSSVPVVRGEKW 196
Score = 56.2 bits (134), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 27/64 (42%), Positives = 36/64 (56%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L DVE GG T+FP L+L V PE+GSA+++ + L+ HS PV
Sbjct: 133 NRISTLIIYLNDVEAGGETVFPLLDLEVKPERGSALYFEYFYRQQELNNLTLHSSVPVVR 192
Query: 62 GNKW 65
G KW
Sbjct: 193 GEKW 196
>gi|89096248|ref|ZP_01169141.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus sp.
NRRL B-14911]
gi|89089102|gb|EAR68210.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus sp.
NRRL B-14911]
Length = 217
Score = 57.4 bits (137), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 27/56 (48%), Positives = 34/56 (60%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWG 66
L DVE GG T FP LN +V P+KGSAV++ + N L+ H G PV G+KW
Sbjct: 152 LNDVEEGGETYFPKLNFSVNPQKGSAVYFEYFYDNQDLNDLTLHGGAPVIKGSKWA 207
Score = 55.8 bits (133), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 27/56 (48%), Positives = 34/56 (60%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 128
L DVE GG T FP LN +V P+KGSAV++ + N L+ H G PV G+KW
Sbjct: 152 LNDVEEGGETYFPKLNFSVNPQKGSAVYFEYFYDNQDLNDLTLHGGAPVIKGSKWA 207
>gi|403238305|ref|ZP_10916891.1| procollagen-proline dioxygenase [Bacillus sp. 10403023]
Length = 296
Score = 57.4 bits (137), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 28/80 (35%), Positives = 42/80 (52%), Gaps = 1/80 (1%)
Query: 48 LDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHAN 107
DY+ P G + G L+ L DV GG T+FP +++ P+KGSAV++ ++
Sbjct: 204 FDYFPQSKVVPEKGGQRVGTFLIY-LNDVPAGGETVFPKAGVSIVPKKGSAVYFQYGNSK 262
Query: 108 TLLDYRMYHSGCPVALGNKW 127
+D HS PV+ G KW
Sbjct: 263 GEVDRMSLHSSIPVSEGEKW 282
Score = 53.5 bits (127), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 22/55 (40%), Positives = 33/55 (60%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DV GG T+FP +++ P+KGSAV++ ++ +D HS PV+ G KW
Sbjct: 228 LNDVPAGGETVFPKAGVSIVPKKGSAVYFQYGNSKGEVDRMSLHSSIPVSEGEKW 282
>gi|317127314|ref|YP_004093596.1| Procollagen-proline dioxygenase [Bacillus cellulosilyticus DSM
2522]
gi|315472262|gb|ADU28865.1| Procollagen-proline dioxygenase [Bacillus cellulosilyticus DSM
2522]
Length = 229
Score = 57.4 bits (137), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 30/79 (37%), Positives = 42/79 (53%)
Query: 50 YWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTL 109
++ Y S + N L+ L DVE GG T FP LN V P+KG AV++ + +T
Sbjct: 133 HYDYFSSGNSKVNNPRISTLVMYLNDVEAGGETYFPKLNFYVAPKKGMAVYFEYFYNDTT 192
Query: 110 LDYRMYHSGCPVALGNKWG 128
L+ H G PV +G+KW
Sbjct: 193 LNELTLHGGAPVVIGDKWA 211
Score = 57.4 bits (137), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 26/56 (46%), Positives = 34/56 (60%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWG 66
L DVE GG T FP LN V P+KG AV++ + +T L+ H G PV +G+KW
Sbjct: 156 LNDVEAGGETYFPKLNFYVAPKKGMAVYFEYFYNDTTLNELTLHGGAPVVIGDKWA 211
>gi|15808767|gb|AAL08490.1|AF369789_1 prolyl-4-hydroxylase alpha subunit-like protein [Onchocerca
volvulus]
Length = 571
Score = 57.4 bits (137), Expect = 2e-06, Method: Composition-based stats.
Identities = 30/67 (44%), Positives = 41/67 (61%), Gaps = 2/67 (2%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPS-LNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
GN+ L+ LT+ E+GG T+F S L ++V K +A+FWYN N +D R H+ CP
Sbjct: 449 GNRIATFLIY-LTEPEIGGRTVFTSNLKISVPCVKNAALFWYNLMRNGEVDTRSLHAACP 507
Query: 121 VALGNKW 127
VA G KW
Sbjct: 508 VATGIKW 514
Score = 52.0 bits (123), Expect = 7e-05, Method: Composition-based stats.
Identities = 27/65 (41%), Positives = 38/65 (58%), Gaps = 1/65 (1%)
Query: 2 NRAEMSSVELTDVELGGATIFPS-LNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVA 60
NR + LT+ E+GG T+F S L ++V K +A+FW+N N +D H+ CPVA
Sbjct: 450 NRIATFLIYLTEPEIGGRTVFTSNLKISVPCVKNAALFWYNLMRNGEVDTRSLHAACPVA 509
Query: 61 LGNKW 65
G KW
Sbjct: 510 TGIKW 514
>gi|15808763|gb|AAL08488.1| prolyl-4-hydroxylase alpha subunit-like protein [Onchocerca
volvulus]
Length = 571
Score = 57.4 bits (137), Expect = 2e-06, Method: Composition-based stats.
Identities = 30/67 (44%), Positives = 41/67 (61%), Gaps = 2/67 (2%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPS-LNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
GN+ L+ LT+ E+GG T+F S L ++V K +A+FWYN N +D R H+ CP
Sbjct: 449 GNRIATFLIY-LTEPEIGGRTVFTSNLKISVPCVKNAALFWYNLMRNGEVDTRSLHAACP 507
Query: 121 VALGNKW 127
VA G KW
Sbjct: 508 VATGIKW 514
Score = 52.0 bits (123), Expect = 7e-05, Method: Composition-based stats.
Identities = 27/65 (41%), Positives = 38/65 (58%), Gaps = 1/65 (1%)
Query: 2 NRAEMSSVELTDVELGGATIFPS-LNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVA 60
NR + LT+ E+GG T+F S L ++V K +A+FW+N N +D H+ CPVA
Sbjct: 450 NRIATFLIYLTEPEIGGRTVFTSNLKISVPCVKNAALFWYNLMRNGEVDTRSLHAACPVA 509
Query: 61 LGNKW 65
G KW
Sbjct: 510 TGIKW 514
>gi|328713119|ref|XP_003244997.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Acyrthosiphon
pisum]
Length = 487
Score = 57.0 bits (136), Expect = 2e-06, Method: Composition-based stats.
Identities = 33/74 (44%), Positives = 42/74 (56%), Gaps = 4/74 (5%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAH-ANTLLDYWMYHSGCPVALGNKWGK-- 67
LTDV+ G T FP LN+ EKGSA+ W+N H ++ L Y H CP+ GNKW K
Sbjct: 411 LTDVQNDGYTSFPMLNIIAPAEKGSALVWNNLHMSDGQLCYESLHGACPLLKGNKWSKYT 470
Query: 68 -LLLSGLTDVELGG 80
LL + L + GG
Sbjct: 471 ALLRAVLKNKSWGG 484
Score = 56.6 bits (135), Expect = 3e-06, Method: Composition-based stats.
Identities = 31/70 (44%), Positives = 39/70 (55%), Gaps = 2/70 (2%)
Query: 61 LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAH-ANTLLDYRMYHSGC 119
GN+ +L LTDV+ G T FP LN+ EKGSA+ W N H ++ L Y H C
Sbjct: 400 FGNRLVTVLFY-LTDVQNDGYTSFPMLNIIAPAEKGSALVWNNLHMSDGQLCYESLHGAC 458
Query: 120 PVALGNKWGK 129
P+ GNKW K
Sbjct: 459 PLLKGNKWSK 468
>gi|194906709|ref|XP_001981416.1| GG11627 [Drosophila erecta]
gi|190656054|gb|EDV53286.1| GG11627 [Drosophila erecta]
Length = 462
Score = 57.0 bits (136), Expect = 2e-06, Method: Composition-based stats.
Identities = 26/59 (44%), Positives = 33/59 (55%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWGKLL 131
L DV++GG FP L P +GSA+ W+N D R + CPV LGN+WGK L
Sbjct: 367 LNDVQMGGYASFPDLGFGFKPVRGSALVWHNTDNCGNCDIRGLQATCPVLLGNQWGKWL 425
Score = 56.6 bits (135), Expect = 4e-06, Method: Composition-based stats.
Identities = 26/60 (43%), Positives = 33/60 (55%)
Query: 10 ELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWGKLL 69
+L DV++GG FP L P +GSA+ WHN D + CPV LGN+WGK L
Sbjct: 366 KLNDVQMGGYASFPDLGFGFKPVRGSALVWHNTDNCGNCDIRGLQATCPVLLGNQWGKWL 425
>gi|443719426|gb|ELU09607.1| hypothetical protein CAPTEDRAFT_229373 [Capitella teleta]
Length = 576
Score = 57.0 bits (136), Expect = 3e-06, Method: Composition-based stats.
Identities = 24/55 (43%), Positives = 35/55 (63%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+ V+ GGAT+FP L++ + KG+A FW+N + + H+GCPV LG KW
Sbjct: 494 LSSVKAGGATVFPKLHVRIPAVKGAAAFWFNIRRSGDREPLTQHAGCPVLLGEKW 548
Score = 55.5 bits (132), Expect = 7e-06, Method: Composition-based stats.
Identities = 24/55 (43%), Positives = 34/55 (61%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L+ V+ GGAT+FP L++ + KG+A FW N + + H+GCPV LG KW
Sbjct: 494 LSSVKAGGATVFPKLHVRIPAVKGAAAFWFNIRRSGDREPLTQHAGCPVLLGEKW 548
>gi|291235355|ref|XP_002737609.1| PREDICTED: GI24860-like [Saccoglossus kowalevskii]
Length = 85
Score = 57.0 bits (136), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 35/55 (63%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+DV+ GG T F ++ V P K +AVFW+N + DYR H+ CPV LG+KW
Sbjct: 5 LSDVDAGGYTAFTAIKTIVKPIKNAAVFWWNLLPSGERDYRTLHAACPVLLGSKW 59
Score = 52.0 bits (123), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 26/58 (44%), Positives = 34/58 (58%)
Query: 8 SVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
V L+DV+ GG T F ++ V P K +AVFW N + DY H+ CPV LG+KW
Sbjct: 2 DVLLSDVDAGGYTAFTAIKTIVKPIKNAAVFWWNLLPSGERDYRTLHAACPVLLGSKW 59
>gi|194765184|ref|XP_001964707.1| GF22906 [Drosophila ananassae]
gi|190614979|gb|EDV30503.1| GF22906 [Drosophila ananassae]
Length = 708
Score = 56.6 bits (135), Expect = 3e-06, Method: Composition-based stats.
Identities = 26/63 (41%), Positives = 39/63 (61%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
+R S L++VE GG T+F LN+ + P +G+ V WHN H + +D H+GCPV +
Sbjct: 614 DRISTSMFYLSNVEQGGYTVFTKLNVFLPPIQGAMVMWHNLHRSLDVDARTLHAGCPVLV 673
Query: 62 GNK 64
G+K
Sbjct: 674 GSK 676
Score = 56.2 bits (134), Expect = 5e-06, Method: Composition-based stats.
Identities = 24/54 (44%), Positives = 37/54 (68%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 126
L++VE GG T+F LN+ + P +G+ V W+N H + +D R H+GCPV +G+K
Sbjct: 623 LSNVEQGGYTVFTKLNVFLPPIQGAMVMWHNLHRSLDVDARTLHAGCPVLVGSK 676
>gi|395814850|ref|XP_003780953.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Otolemur
garnettii]
Length = 544
Score = 56.6 bits (135), Expect = 3e-06, Method: Composition-based stats.
Identities = 28/66 (42%), Positives = 39/66 (59%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ ++ L+ VE GGAT F N +V K +A+FW+N H N D H+GCPV
Sbjct: 457 GNRVATFMIY-LSSVEAGGATAFIYANFSVPVVKNAALFWWNLHRNGEGDSDTLHAGCPV 515
Query: 122 ALGNKW 127
+G+KW
Sbjct: 516 LVGDKW 521
Score = 55.8 bits (133), Expect = 5e-06, Method: Composition-based stats.
Identities = 28/64 (43%), Positives = 36/64 (56%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L+ VE GGAT F N +V K +A+FW N H N D H+GCPV +
Sbjct: 458 NRVATFMIYLSSVEAGGATAFIYANFSVPVVKNAALFWWNLHRNGEGDSDTLHAGCPVLV 517
Query: 62 GNKW 65
G+KW
Sbjct: 518 GDKW 521
>gi|24666354|ref|NP_730347.1| CG32199 [Drosophila melanogaster]
gi|23093193|gb|AAF49251.3| CG32199 [Drosophila melanogaster]
Length = 509
Score = 56.6 bits (135), Expect = 3e-06, Method: Composition-based stats.
Identities = 24/54 (44%), Positives = 34/54 (62%)
Query: 74 TDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
+D+ LGG T+F L + V P+KGSA+ W+N + D HS CPV LG++W
Sbjct: 437 SDIPLGGTTVFTKLKIAVQPKKGSALIWFNLNHAGEPDPLTEHSVCPVVLGSRW 490
Score = 55.8 bits (133), Expect = 5e-06, Method: Composition-based stats.
Identities = 25/64 (39%), Positives = 37/64 (57%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
+R + + +D+ LGG T+F L + V P+KGSA+ W N + D HS CPV L
Sbjct: 427 DRIATALIYASDIPLGGTTVFTKLKIAVQPKKGSALIWFNLNHAGEPDPLTEHSVCPVVL 486
Query: 62 GNKW 65
G++W
Sbjct: 487 GSRW 490
>gi|194905313|ref|XP_001981171.1| GG11766 [Drosophila erecta]
gi|190655809|gb|EDV53041.1| GG11766 [Drosophila erecta]
Length = 496
Score = 56.6 bits (135), Expect = 3e-06, Method: Composition-based stats.
Identities = 25/56 (44%), Positives = 34/56 (60%), Gaps = 1/56 (1%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTL-LDYRMYHSGCPVALGNKW 127
L D GGAT+FP +N+ V E+G +FWYN T +D + H+ CPV G+KW
Sbjct: 416 LNDAPNGGATVFPRINVKVPAERGKVLFWYNLDGETHDVDPKTVHAACPVFHGSKW 471
Score = 52.4 bits (124), Expect = 5e-05, Method: Composition-based stats.
Identities = 24/56 (42%), Positives = 33/56 (58%), Gaps = 1/56 (1%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTL-LDYWMYHSGCPVALGNKW 65
L D GGAT+FP +N+ V E+G +FW+N T +D H+ CPV G+KW
Sbjct: 416 LNDAPNGGATVFPRINVKVPAERGKVLFWYNLDGETHDVDPKTVHAACPVFHGSKW 471
>gi|398818543|ref|ZP_10577128.1| 2OG-Fe(II) oxygenase superfamily enzyme [Brevibacillus sp. BC25]
gi|398027481|gb|EJL21031.1| 2OG-Fe(II) oxygenase superfamily enzyme [Brevibacillus sp. BC25]
Length = 220
Score = 56.6 bits (135), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 28/64 (43%), Positives = 39/64 (60%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L DVE GG T+FP L+L+VFP KG AV++ ++N L+ + H+G V
Sbjct: 143 NRISTLVMYLNDVEQGGETVFPLLHLSVFPTKGMAVYFEYFYSNQELNDFTLHAGTQVIH 202
Query: 62 GNKW 65
G KW
Sbjct: 203 GEKW 206
Score = 53.9 bits (128), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 29/78 (37%), Positives = 43/78 (55%), Gaps = 1/78 (1%)
Query: 50 YWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTL 109
Y + + N+ L++ L DVE GG T+FP L+L+VFP KG AV++ ++N
Sbjct: 130 YDFFAETSRASTNNRISTLVMY-LNDVEQGGETVFPLLHLSVFPTKGMAVYFEYFYSNQE 188
Query: 110 LDYRMYHSGCPVALGNKW 127
L+ H+G V G KW
Sbjct: 189 LNDFTLHAGTQVIHGEKW 206
>gi|66820122|ref|XP_643703.1| hypothetical protein DDB_G0275385 [Dictyostelium discoideum AX4]
gi|60471803|gb|EAL69758.1| hypothetical protein DDB_G0275385 [Dictyostelium discoideum AX4]
Length = 221
Score = 56.6 bits (135), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 30/72 (41%), Positives = 36/72 (50%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWGKLLL 70
L DVE GG T FP + + V P KG AV ++N N +D H G PV GNKW + L
Sbjct: 149 LCDVEEGGETTFPEVGIKVKPIKGDAVLFYNCKPNGDVDPLSLHQGDPVLKGNKWVAIKL 208
Query: 71 SGLTDVELGGAT 82
E G T
Sbjct: 209 INQKSKETGTPT 220
Score = 56.2 bits (134), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 28/59 (47%), Positives = 33/59 (55%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
+L L DVE GG T FP + + V P KG AV +YN N +D H G PV GNKW
Sbjct: 145 ILMYLCDVEEGGETTFPEVGIKVKPIKGDAVLFYNCKPNGDVDPLSLHQGDPVLKGNKW 203
>gi|319652240|ref|ZP_08006358.1| prolyl 4-hydroxylase [Bacillus sp. 2_A_57_CT2]
gi|317396063|gb|EFV76783.1| prolyl 4-hydroxylase [Bacillus sp. 2_A_57_CT2]
Length = 216
Score = 56.6 bits (135), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 25/56 (44%), Positives = 34/56 (60%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWG 66
L DVE GG T FP LN +V P+KG AV++ + + L+ H G PV +G+KW
Sbjct: 151 LNDVEQGGETYFPKLNFSVSPQKGMAVYFEYFYNDQNLNDLTLHGGAPVVMGDKWA 206
Score = 56.6 bits (135), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 29/79 (36%), Positives = 42/79 (53%)
Query: 50 YWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTL 109
++ + S A N L+ L DVE GG T FP LN +V P+KG AV++ + +
Sbjct: 128 HFDFFSSTSRAASNPRISTLVMYLNDVEQGGETYFPKLNFSVSPQKGMAVYFEYFYNDQN 187
Query: 110 LDYRMYHSGCPVALGNKWG 128
L+ H G PV +G+KW
Sbjct: 188 LNDLTLHGGAPVVMGDKWA 206
>gi|198429625|ref|XP_002128613.1| PREDICTED: similar to procollagen-proline, 2-oxoglutarate
4-dioxygenase (proline 4-hydroxylase), alpha 1
polypeptide [Ciona intestinalis]
Length = 195
Score = 56.6 bits (135), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 29/67 (43%), Positives = 40/67 (59%), Gaps = 1/67 (1%)
Query: 61 LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
+GN+ L+ +++VE GG+T+F + V P KGSAVFWYN + D R H+ CP
Sbjct: 102 VGNRIA-TFLTYMSNVEQGGSTVFLHPGIAVRPIKGSAVFWYNLLPSGAGDERTRHAACP 160
Query: 121 VALGNKW 127
V G KW
Sbjct: 161 VLTGVKW 167
Score = 50.1 bits (118), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 24/55 (43%), Positives = 33/55 (60%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
+++VE GG+T+F + V P KGSAVFW+N + D H+ CPV G KW
Sbjct: 113 MSNVEQGGSTVFLHPGIAVRPIKGSAVFWYNLLPSGAGDERTRHAACPVLTGVKW 167
>gi|402584932|gb|EJW78873.1| hypothetical protein WUBG_10221 [Wuchereria bancrofti]
Length = 187
Score = 56.6 bits (135), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 26/66 (39%), Positives = 39/66 (59%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ +L+ +T E+GG T+F L ++ K +A+FWYN + +D R YH+ CPV
Sbjct: 69 GNRIATILIY-MTKPEIGGGTVFIDLKTSISCTKNAALFWYNLMRSGAVDIRSYHAACPV 127
Query: 122 ALGNKW 127
G KW
Sbjct: 128 LTGTKW 133
Score = 50.8 bits (120), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 23/64 (35%), Positives = 35/64 (54%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + +T E+GG T+F L ++ K +A+FW+N + +D YH+ CPV
Sbjct: 70 NRIATILIYMTKPEIGGGTVFIDLKTSISCTKNAALFWYNLMRSGAVDIRSYHAACPVLT 129
Query: 62 GNKW 65
G KW
Sbjct: 130 GTKW 133
>gi|198449528|ref|XP_002136919.1| GA26870 [Drosophila pseudoobscura pseudoobscura]
gi|198130648|gb|EDY67477.1| GA26870 [Drosophila pseudoobscura pseudoobscura]
Length = 491
Score = 56.6 bits (135), Expect = 4e-06, Method: Composition-based stats.
Identities = 25/55 (45%), Positives = 36/55 (65%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L++V GG T+FP+L + V P+ G+A+ +YN + + D R H GCPV GNKW
Sbjct: 419 LSNVSQGGYTVFPNLEVAVKPQAGTALVFYNLLDSLVGDTRTRHFGCPVIDGNKW 473
Score = 52.0 bits (123), Expect = 7e-05, Method: Composition-based stats.
Identities = 25/64 (39%), Positives = 38/64 (59%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L++V GG T+FP+L + V P+ G+A+ ++N + + D H GCPV
Sbjct: 410 NRLFTIQIFLSNVSQGGYTVFPNLEVAVKPQAGTALVFYNLLDSLVGDTRTRHFGCPVID 469
Query: 62 GNKW 65
GNKW
Sbjct: 470 GNKW 473
>gi|229098707|ref|ZP_04229647.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-29]
gi|423441025|ref|ZP_17417931.1| hypothetical protein IEA_01355 [Bacillus cereus BAG4X2-1]
gi|423533441|ref|ZP_17509859.1| hypothetical protein IGI_01273 [Bacillus cereus HuB2-9]
gi|228684786|gb|EEL38724.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-29]
gi|402417686|gb|EJV49986.1| hypothetical protein IEA_01355 [Bacillus cereus BAG4X2-1]
gi|402463660|gb|EJV95360.1| hypothetical protein IGI_01273 [Bacillus cereus HuB2-9]
Length = 216
Score = 56.6 bits (135), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 42/79 (53%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185
Query: 109 LLDYRMYHSGCPVALGNKW 127
L+ H G V G KW
Sbjct: 186 SLNELTLHGGASVTKGEKW 204
Score = 53.5 bits (127), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 25/55 (45%), Positives = 31/55 (56%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T FP LNL+V P KG AV++ + + L+ H G V G KW
Sbjct: 150 LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGASVTKGEKW 204
>gi|423448819|ref|ZP_17425698.1| hypothetical protein IEC_03427 [Bacillus cereus BAG5O-1]
gi|401129413|gb|EJQ37096.1| hypothetical protein IEC_03427 [Bacillus cereus BAG5O-1]
Length = 216
Score = 56.2 bits (134), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 42/79 (53%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ HS A N+ L++ L DVE GG T FP LNL+V P KG AV++ + +
Sbjct: 129 DYFAEHSRS--AANNRISTLVMY-LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQ 185
Query: 109 LLDYRMYHSGCPVALGNKW 127
L+ H G V G KW
Sbjct: 186 SLNELTLHGGASVTKGEKW 204
Score = 53.5 bits (127), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 25/55 (45%), Positives = 31/55 (56%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T FP LNL+V P KG AV++ + + L+ H G V G KW
Sbjct: 150 LNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGASVTKGEKW 204
>gi|402894624|ref|XP_003910453.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
alpha-3 [Papio anubis]
Length = 535
Score = 56.2 bits (134), Expect = 4e-06, Method: Composition-based stats.
Identities = 28/66 (42%), Positives = 40/66 (60%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ ++ L+ VE GGAT F NL+V K +A+FW+N H + D H+GCPV
Sbjct: 448 GNRVATFMIY-LSSVEAGGATAFIYANLSVPVVKNAALFWWNLHRSGEGDSDTLHAGCPV 506
Query: 122 ALGNKW 127
+G+KW
Sbjct: 507 LVGDKW 512
Score = 55.5 bits (132), Expect = 7e-06, Method: Composition-based stats.
Identities = 28/64 (43%), Positives = 37/64 (57%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L+ VE GGAT F NL+V K +A+FW N H + D H+GCPV +
Sbjct: 449 NRVATFMIYLSSVEAGGATAFIYANLSVPVVKNAALFWWNLHRSGEGDSDTLHAGCPVLV 508
Query: 62 GNKW 65
G+KW
Sbjct: 509 GDKW 512
>gi|340787855|ref|YP_004753320.1| peptidyl prolyl 4-hydroxylase-like protein subunit alpha
[Collimonas fungivorans Ter331]
gi|340553122|gb|AEK62497.1| Peptidyl prolyl 4-hydroxylase-like protein, alpha subunit
[Collimonas fungivorans Ter331]
Length = 289
Score = 56.2 bits (134), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 28/69 (40%), Positives = 39/69 (56%)
Query: 59 VALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSG 118
+A G + L+ L DV+ GG TIFP L++ P KGSA+++ +A LD +H G
Sbjct: 209 LARGGQRTATLIIYLNDVDGGGETIFPRNGLSIVPAKGSAIYFSYTNAENQLDSLSFHGG 268
Query: 119 CPVALGNKW 127
PV G KW
Sbjct: 269 SPVIEGEKW 277
Score = 55.5 bits (132), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 25/57 (43%), Positives = 34/57 (59%)
Query: 9 VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
+ L DV+ GG TIFP L++ P KGSA+++ +A LD +H G PV G KW
Sbjct: 221 IYLNDVDGGGETIFPRNGLSIVPAKGSAIYFSYTNAENQLDSLSFHGGSPVIEGEKW 277
>gi|384251901|gb|EIE25378.1| hypothetical protein COCSUDRAFT_35772 [Coccomyxa subellipsoidea
C-169]
Length = 222
Score = 56.2 bits (134), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 38/144 (26%), Positives = 62/144 (43%), Gaps = 22/144 (15%)
Query: 1 MNRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVA 60
+NR E +E + + T P N +G + + A + +H
Sbjct: 59 LNRGEDDVIERIEARIAKYTAIPKENG-----EGLQILHYQASEEYRPHFDYFHDNFNTQ 113
Query: 61 LGNKWGKLLLSGLTDVELGGATIFP------SLNLTVF-----------PEKGSAVFWYN 103
G + +L L+DVE GG T+FP ++ T F P+KG A+F+Y+
Sbjct: 114 NGGQRIATMLMYLSDVEDGGETVFPESSDKPNVGNTKFSQCAQAGAAAKPKKGDALFFYS 173
Query: 104 AHANTLLDYRMYHSGCPVALGNKW 127
+ +D + H+GCPV G+KW
Sbjct: 174 LTPDGRMDEKSLHAGCPVMKGDKW 197
>gi|195166673|ref|XP_002024159.1| GL22695 [Drosophila persimilis]
gi|194107514|gb|EDW29557.1| GL22695 [Drosophila persimilis]
Length = 360
Score = 55.8 bits (133), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 28/67 (41%), Positives = 37/67 (55%), Gaps = 8/67 (11%)
Query: 5 EMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNK 64
+ ++L++V GG FPSLNL V P+KGS + W YH CPV +GNK
Sbjct: 300 QFDDLQLSNVRQGGEIHFPSLNLRVTPQKGSLLVWPRTGD--------YHIECPVIVGNK 351
Query: 65 WGKLLLS 71
WGK +S
Sbjct: 352 WGKSEIS 358
Score = 53.9 bits (128), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/61 (45%), Positives = 34/61 (55%), Gaps = 8/61 (13%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWGKLLL 132
L++V GG FPSLNL V P+KGS + W YH CPV +GNKWGK +
Sbjct: 306 LSNVRQGGEIHFPSLNLRVTPQKGSLLVWPRTGD--------YHIECPVIVGNKWGKSEI 357
Query: 133 S 133
S
Sbjct: 358 S 358
>gi|281206564|gb|EFA80750.1| putative prolyl 4-hydroxylase alpha subunit [Polysphondylium
pallidum PN500]
Length = 251
Score = 55.8 bits (133), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 28/59 (47%), Positives = 33/59 (55%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
+L L DVE GG T FP +N+ V P KG AV +YN N +D H G PV G KW
Sbjct: 177 ILMYLADVEEGGETSFPEVNVKVKPIKGDAVLFYNCKPNGEVDPLSLHQGDPVIKGTKW 235
Score = 54.3 bits (129), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 31/55 (56%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T FP +N+ V P KG AV ++N N +D H G PV G KW
Sbjct: 181 LADVEEGGETSFPEVNVKVKPIKGDAVLFYNCKPNGEVDPLSLHQGDPVIKGTKW 235
>gi|386766694|ref|NP_651648.5| CG11828 [Drosophila melanogaster]
gi|383293009|gb|AAF56834.5| CG11828 [Drosophila melanogaster]
Length = 458
Score = 55.8 bits (133), Expect = 5e-06, Method: Composition-based stats.
Identities = 26/63 (41%), Positives = 36/63 (57%), Gaps = 1/63 (1%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW-GKLL 131
L+DV++GG FP L P +GSA+ W+N + D R + CPV LGN+W K
Sbjct: 385 LSDVQMGGYASFPDLGFGFKPRRGSALVWHNTDNSGNCDTRSLQATCPVLLGNQWVAKKW 444
Query: 132 LSG 134
+SG
Sbjct: 445 ISG 447
Score = 55.5 bits (132), Expect = 8e-06, Method: Composition-based stats.
Identities = 27/71 (38%), Positives = 38/71 (53%), Gaps = 1/71 (1%)
Query: 3 RAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALG 62
R+ + L+DV++GG FP L P +GSA+ WHN + D + CPV LG
Sbjct: 377 RSASAMFYLSDVQMGGYASFPDLGFGFKPRRGSALVWHNTDNSGNCDTRSLQATCPVLLG 436
Query: 63 NKW-GKLLLSG 72
N+W K +SG
Sbjct: 437 NQWVAKKWISG 447
>gi|205374182|ref|ZP_03226981.1| prolyl 4-hydroxylase alpha subunit [Bacillus coahuilensis m4-4]
Length = 210
Score = 55.8 bits (133), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 25/56 (44%), Positives = 35/56 (62%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWG 66
L DVE GG T FP+L L+V P KG AV++ + + +L+ H G PV +G+KW
Sbjct: 144 LNDVEEGGDTYFPNLKLSVSPHKGMAVYFEYFYDDPMLNELTLHGGAPVTIGDKWA 199
Score = 54.3 bits (129), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 25/56 (44%), Positives = 35/56 (62%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 128
L DVE GG T FP+L L+V P KG AV++ + + +L+ H G PV +G+KW
Sbjct: 144 LNDVEEGGDTYFPNLKLSVSPHKGMAVYFEYFYDDPMLNELTLHGGAPVTIGDKWA 199
>gi|195352184|ref|XP_002042594.1| GM14981 [Drosophila sechellia]
gi|194124478|gb|EDW46521.1| GM14981 [Drosophila sechellia]
Length = 539
Score = 55.8 bits (133), Expect = 5e-06, Method: Composition-based stats.
Identities = 24/54 (44%), Positives = 34/54 (62%)
Query: 74 TDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
+DV LGG T+F L + V P+KG+A+ W+N + D HS CPV LG++W
Sbjct: 467 SDVPLGGTTVFTKLKIAVKPKKGNALIWFNLNHAGEPDPLTEHSVCPVVLGSRW 520
Score = 54.3 bits (129), Expect = 2e-05, Method: Composition-based stats.
Identities = 24/54 (44%), Positives = 33/54 (61%)
Query: 12 TDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
+DV LGG T+F L + V P+KG+A+ W N + D HS CPV LG++W
Sbjct: 467 SDVPLGGTTVFTKLKIAVKPKKGNALIWFNLNHAGEPDPLTEHSVCPVVLGSRW 520
>gi|239792190|dbj|BAH72464.1| ACYPI007079 [Acyrthosiphon pisum]
Length = 249
Score = 55.8 bits (133), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 40/66 (60%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ +L ++DV GGAT+FP L + + P KG+A W+N + + D R H+ CPV
Sbjct: 156 GNRIATVLFY-MSDVAQGGATVFPWLGVALQPVKGTAAVWFNLYPSGNGDLRTRHAACPV 214
Query: 122 ALGNKW 127
G+KW
Sbjct: 215 LQGSKW 220
Score = 50.1 bits (118), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 23/55 (41%), Positives = 33/55 (60%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
++DV GGAT+FP L + + P KG+A W N + + D H+ CPV G+KW
Sbjct: 166 MSDVAQGGATVFPWLGVALQPVKGTAAVWFNLYPSGNGDLRTRHAACPVLQGSKW 220
>gi|195055777|ref|XP_001994789.1| GH14121 [Drosophila grimshawi]
gi|193892552|gb|EDV91418.1| GH14121 [Drosophila grimshawi]
Length = 517
Score = 55.8 bits (133), Expect = 5e-06, Method: Composition-based stats.
Identities = 26/60 (43%), Positives = 36/60 (60%), Gaps = 5/60 (8%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA-----LGNKW 127
L+DV+ GG T+FP+ N+ V P KGS + WYN + D+R H+GC V +GN W
Sbjct: 440 LSDVQQGGYTVFPTQNVFVKPVKGSMILWYNQLRSLNPDHRTLHAGCAVIEGIKRIGNIW 499
Score = 53.5 bits (127), Expect = 3e-05, Method: Composition-based stats.
Identities = 26/69 (37%), Positives = 39/69 (56%), Gaps = 5/69 (7%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVA- 60
+R S + L+DV+ GG T+FP+ N+ V P KGS + W+N + D+ H+GC V
Sbjct: 431 DRISTSMIYLSDVQQGGYTVFPTQNVFVKPVKGSMILWYNQLRSLNPDHRTLHAGCAVIE 490
Query: 61 ----LGNKW 65
+GN W
Sbjct: 491 GIKRIGNIW 499
>gi|195069799|ref|XP_001997030.1| GH12979 [Drosophila grimshawi]
gi|193891499|gb|EDV90365.1| GH12979 [Drosophila grimshawi]
Length = 517
Score = 55.8 bits (133), Expect = 5e-06, Method: Composition-based stats.
Identities = 26/60 (43%), Positives = 36/60 (60%), Gaps = 5/60 (8%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA-----LGNKW 127
L+DV+ GG T+FP+ N+ V P KGS + WYN + D+R H+GC V +GN W
Sbjct: 440 LSDVQQGGYTVFPTQNVFVKPVKGSMILWYNQLRSLNPDHRTLHAGCAVIEGIKRIGNIW 499
Score = 53.5 bits (127), Expect = 3e-05, Method: Composition-based stats.
Identities = 26/69 (37%), Positives = 39/69 (56%), Gaps = 5/69 (7%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVA- 60
+R S + L+DV+ GG T+FP+ N+ V P KGS + W+N + D+ H+GC V
Sbjct: 431 DRISTSMIYLSDVQQGGYTVFPTQNVFVKPVKGSMILWYNQLRSLNPDHRTLHAGCAVIE 490
Query: 61 ----LGNKW 65
+GN W
Sbjct: 491 GIKRIGNIW 499
>gi|195591304|ref|XP_002085382.1| GD14758 [Drosophila simulans]
gi|194197391|gb|EDX10967.1| GD14758 [Drosophila simulans]
Length = 509
Score = 55.8 bits (133), Expect = 5e-06, Method: Composition-based stats.
Identities = 24/54 (44%), Positives = 34/54 (62%)
Query: 74 TDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
+DV LGG T+F L + V P+KG+A+ W+N + D HS CPV LG++W
Sbjct: 437 SDVPLGGTTVFTKLKIAVQPKKGNALIWFNLNHAGEPDPLTEHSVCPVVLGSRW 490
Score = 54.3 bits (129), Expect = 2e-05, Method: Composition-based stats.
Identities = 24/54 (44%), Positives = 33/54 (61%)
Query: 12 TDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
+DV LGG T+F L + V P+KG+A+ W N + D HS CPV LG++W
Sbjct: 437 SDVPLGGTTVFTKLKIAVQPKKGNALIWFNLNHAGEPDPLTEHSVCPVVLGSRW 490
>gi|221126103|ref|XP_002165259.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Hydra
magnipapillata]
Length = 533
Score = 55.8 bits (133), Expect = 6e-06, Method: Composition-based stats.
Identities = 29/68 (42%), Positives = 40/68 (58%), Gaps = 3/68 (4%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSL--NLTVFPEKGSAVFWYNAHANTLLDYRMYHSGC 119
GN+ +LL L+DV+ GG T+F + + P KG+ VFWYN N + + H+GC
Sbjct: 444 GNRIATMLLY-LSDVDWGGRTVFTNTAPGVGTDPIKGAGVFWYNLLRNGKSNPKTQHAGC 502
Query: 120 PVALGNKW 127
PV LG KW
Sbjct: 503 PVVLGQKW 510
Score = 50.1 bits (118), Expect = 3e-04, Method: Composition-based stats.
Identities = 26/66 (39%), Positives = 36/66 (54%), Gaps = 2/66 (3%)
Query: 2 NRAEMSSVELTDVELGGATIFPSL--NLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPV 59
NR + L+DV+ GG T+F + + P KG+ VFW+N N + H+GCPV
Sbjct: 445 NRIATMLLYLSDVDWGGRTVFTNTAPGVGTDPIKGAGVFWYNLLRNGKSNPKTQHAGCPV 504
Query: 60 ALGNKW 65
LG KW
Sbjct: 505 VLGQKW 510
>gi|198466405|ref|XP_001353987.2| GA16752 [Drosophila pseudoobscura pseudoobscura]
gi|198150585|gb|EAL29723.2| GA16752 [Drosophila pseudoobscura pseudoobscura]
Length = 510
Score = 55.8 bits (133), Expect = 6e-06, Method: Composition-based stats.
Identities = 27/74 (36%), Positives = 42/74 (56%), Gaps = 4/74 (5%)
Query: 58 PVALGNKWGKLLLSGL----TDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYR 113
P +WG +L+ + +DV LGGA +FP L ++V P+KG+A+ W N + D
Sbjct: 418 PANYFTEWGGDVLATVLLYASDVRLGGAVVFPKLKISVEPKKGNALIWDNLNNAGNPDKL 477
Query: 114 MYHSGCPVALGNKW 127
H+ CPV +G+ W
Sbjct: 478 SKHAVCPVVMGSHW 491
Score = 55.1 bits (131), Expect = 9e-06, Method: Composition-based stats.
Identities = 23/54 (42%), Positives = 34/54 (62%)
Query: 12 TDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
+DV LGGA +FP L ++V P+KG+A+ W N + D H+ CPV +G+ W
Sbjct: 438 SDVRLGGAVVFPKLKISVEPKKGNALIWDNLNNAGNPDKLSKHAVCPVVMGSHW 491
>gi|195069738|ref|XP_001997014.1| GH23597 [Drosophila grimshawi]
gi|193892024|gb|EDV90890.1| GH23597 [Drosophila grimshawi]
Length = 239
Score = 55.8 bits (133), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 27/55 (49%), Positives = 32/55 (58%), Gaps = 1/55 (1%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTL-LDYRMYHSGCPVALGNK 126
L DV GGATIFP L L V PE+G + W+N T L+ R H CPV +G K
Sbjct: 165 LNDVARGGATIFPDLELLVQPERGKVLHWHNMDLGTYDLEKRSLHGACPVIMGKK 219
Score = 55.1 bits (131), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 27/55 (49%), Positives = 31/55 (56%), Gaps = 1/55 (1%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTL-LDYWMYHSGCPVALGNK 64
L DV GGATIFP L L V PE+G + WHN T L+ H CPV +G K
Sbjct: 165 LNDVARGGATIFPDLELLVQPERGKVLHWHNMDLGTYDLEKRSLHGACPVIMGKK 219
>gi|297689698|ref|XP_002822285.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Pongo abelii]
Length = 544
Score = 55.5 bits (132), Expect = 6e-06, Method: Composition-based stats.
Identities = 27/66 (40%), Positives = 40/66 (60%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ ++ L+ VE GGAT F NL+V + +A+FW+N H + D H+GCPV
Sbjct: 457 GNRVATFMIY-LSSVEAGGATAFIYANLSVPVVRNAALFWWNLHRSGEGDSDTLHAGCPV 515
Query: 122 ALGNKW 127
+G+KW
Sbjct: 516 LVGDKW 521
Score = 54.7 bits (130), Expect = 1e-05, Method: Composition-based stats.
Identities = 27/64 (42%), Positives = 37/64 (57%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L+ VE GGAT F NL+V + +A+FW N H + D H+GCPV +
Sbjct: 458 NRVATFMIYLSSVEAGGATAFIYANLSVPVVRNAALFWWNLHRSGEGDSDTLHAGCPVLV 517
Query: 62 GNKW 65
G+KW
Sbjct: 518 GDKW 521
>gi|226314793|ref|YP_002774689.1| hypothetical protein BBR47_52080 [Brevibacillus brevis NBRC 100599]
gi|226097743|dbj|BAH46185.1| conserved hypothetical protein [Brevibacillus brevis NBRC 100599]
Length = 215
Score = 55.5 bits (132), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 27/64 (42%), Positives = 38/64 (59%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L DVE GG T+FP L+L+VFP KG AV++ + N ++ + H+G V
Sbjct: 138 NRISTLVMYLNDVEQGGETVFPLLHLSVFPTKGMAVYFEYFYRNQEVNEFTLHAGAQVIH 197
Query: 62 GNKW 65
G KW
Sbjct: 198 GEKW 201
Score = 53.1 bits (126), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 42/78 (53%), Gaps = 1/78 (1%)
Query: 50 YWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTL 109
Y + + N+ L++ L DVE GG T+FP L+L+VFP KG AV++ + N
Sbjct: 125 YDFFAETSRASTNNRISTLVMY-LNDVEQGGETVFPLLHLSVFPTKGMAVYFEYFYRNQE 183
Query: 110 LDYRMYHSGCPVALGNKW 127
++ H+G V G KW
Sbjct: 184 VNEFTLHAGAQVIHGEKW 201
>gi|195494565|ref|XP_002094892.1| GE19961 [Drosophila yakuba]
gi|194180993|gb|EDW94604.1| GE19961 [Drosophila yakuba]
Length = 441
Score = 55.5 bits (132), Expect = 6e-06, Method: Composition-based stats.
Identities = 28/60 (46%), Positives = 35/60 (58%), Gaps = 5/60 (8%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 128
LL L DV+ GGAT+FP L + VFP+KGS +FW + LD CPV N+WG
Sbjct: 386 LLFFLEDVKQGGATVFPKLKIAVFPQKGSCLFW-----QSTLDPGSKPLECPVLQRNRWG 440
Score = 55.1 bits (131), Expect = 1e-05, Method: Composition-based stats.
Identities = 26/56 (46%), Positives = 33/56 (58%), Gaps = 5/56 (8%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWG 66
L DV+ GGAT+FP L + VFP+KGS +FW + LD CPV N+WG
Sbjct: 390 LEDVKQGGATVFPKLKIAVFPQKGSCLFWQST-----LDPGSKPLECPVLQRNRWG 440
>gi|59809017|gb|AAH89446.1| P4HA3 protein [Homo sapiens]
Length = 528
Score = 55.5 bits (132), Expect = 7e-06, Method: Composition-based stats.
Identities = 27/66 (40%), Positives = 40/66 (60%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ ++ L+ VE GGAT F NL+V + +A+FW+N H + D H+GCPV
Sbjct: 441 GNRVATFMIY-LSSVEAGGATAFIYANLSVPVVRNAALFWWNLHRSGEGDSDTLHAGCPV 499
Query: 122 ALGNKW 127
+G+KW
Sbjct: 500 LVGDKW 505
Score = 54.7 bits (130), Expect = 1e-05, Method: Composition-based stats.
Identities = 27/64 (42%), Positives = 37/64 (57%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L+ VE GGAT F NL+V + +A+FW N H + D H+GCPV +
Sbjct: 442 NRVATFMIYLSSVEAGGATAFIYANLSVPVVRNAALFWWNLHRSGEGDSDTLHAGCPVLV 501
Query: 62 GNKW 65
G+KW
Sbjct: 502 GDKW 505
>gi|308508235|ref|XP_003116301.1| CRE-PHY-3 protein [Caenorhabditis remanei]
gi|308251245|gb|EFO95197.1| CRE-PHY-3 protein [Caenorhabditis remanei]
Length = 253
Score = 55.5 bits (132), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 34/110 (30%), Positives = 49/110 (44%), Gaps = 6/110 (5%)
Query: 22 FPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCP-----VALGNKWGKLLLSGLTDV 76
P+LN T + + H + DY Y S + N+ G L+ L
Sbjct: 97 IPALNFTSAELFSALSYLPGGHYSVHYDYLTYRSEKEHDWWMKNMRNRIGTLIFV-LKTA 155
Query: 77 ELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 126
E GG T+FPS+ T+ G A FW+N A+ + R+ H GCP+ G K
Sbjct: 156 EKGGGTVFPSIGTTIRINPGDAFFWFNTQADESQEMRVNHGGCPIYEGRK 205
Score = 50.8 bits (120), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 30/94 (31%), Positives = 43/94 (45%), Gaps = 10/94 (10%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNK-----W 65
L E GG T+FPS+ T+ G A FW N A+ + + H GCP+ G K W
Sbjct: 152 LKTAEKGGGTVFPSIGTTIRINPGDAFFWFNTQADESQEMRVNHGGCPIYEGRKVITTLW 211
Query: 66 GK-----LLLSGLTDVELGGATIFPSLNLTVFPE 94
+ +L TD + + + PS + T PE
Sbjct: 212 VRAKNQPILPMASTDSSIHASWLIPSFSSTFRPE 245
>gi|294499597|ref|YP_003563297.1| 2OG-Fe(II) oxygenase family protein [Bacillus megaterium QM B1551]
gi|294349534|gb|ADE69863.1| 2OG-Fe(II) oxygenase family protein [Bacillus megaterium QM B1551]
Length = 219
Score = 55.5 bits (132), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 30/75 (40%), Positives = 39/75 (52%)
Query: 53 YHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDY 112
Y + A N L+ L DVE GG T FP L L+V P KG AV++ +++ L+
Sbjct: 132 YFTSASKASKNNRISTLVMYLNDVEEGGETYFPKLGLSVSPTKGMAVYFEYFYSDAELND 191
Query: 113 RMYHSGCPVALGNKW 127
R H G PV G KW
Sbjct: 192 RTLHGGAPVIKGEKW 206
Score = 52.0 bits (123), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 25/55 (45%), Positives = 32/55 (58%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T FP L L+V P KG AV++ +++ L+ H G PV G KW
Sbjct: 152 LNDVEEGGETYFPKLGLSVSPTKGMAVYFEYFYSDAELNDRTLHGGAPVIKGEKW 206
>gi|38454288|ref|NP_942070.1| prolyl 4-hydroxylase subunit alpha-3 precursor [Rattus norvegicus]
gi|81870816|sp|Q6W3E9.1|P4HA3_RAT RecName: Full=Prolyl 4-hydroxylase subunit alpha-3; Short=4-PH
alpha-3; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-3; Flags: Precursor
gi|36962768|gb|AAQ87605.1| collagen prolyl 4-hydroxylase alpha III subunit [Rattus norvegicus]
Length = 544
Score = 55.5 bits (132), Expect = 7e-06, Method: Composition-based stats.
Identities = 28/66 (42%), Positives = 40/66 (60%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L++ L+ VE GGAT F N +V K +A+FW+N H + D H+GCPV
Sbjct: 457 GNRAATLMIY-LSSVEAGGATAFIYGNFSVPVVKNAALFWWNLHRSGEGDDDTLHAGCPV 515
Query: 122 ALGNKW 127
+G+KW
Sbjct: 516 LVGDKW 521
Score = 54.7 bits (130), Expect = 1e-05, Method: Composition-based stats.
Identities = 28/64 (43%), Positives = 37/64 (57%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NRA + L+ VE GGAT F N +V K +A+FW N H + D H+GCPV +
Sbjct: 458 NRAATLMIYLSSVEAGGATAFIYGNFSVPVVKNAALFWWNLHRSGEGDDDTLHAGCPVLV 517
Query: 62 GNKW 65
G+KW
Sbjct: 518 GDKW 521
>gi|6437556|gb|AAF08583.1|AC011623_16 unknown protein [Arabidopsis thaliana]
Length = 278
Score = 55.5 bits (132), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 28/90 (31%), Positives = 45/90 (50%), Gaps = 2/90 (2%)
Query: 38 FWHNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGS 97
+ H + DY +H +A G +L L++V GG T+FP + + P+KG+
Sbjct: 130 YEHGQKYDAHFDY--FHDKVNIARGGHRIATVLLYLSNVTKGGETVFPDAQVCLKPKKGN 187
Query: 98 AVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
A+ ++N + + D H GCPV G KW
Sbjct: 188 ALLFFNLQQDAIPDPFSLHGGCPVIEGEKW 217
Score = 51.2 bits (121), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 21/55 (38%), Positives = 33/55 (60%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L++V GG T+FP + + P+KG+A+ + N + + D + H GCPV G KW
Sbjct: 163 LSNVTKGGETVFPDAQVCLKPKKGNALLFFNLQQDAIPDPFSLHGGCPVIEGEKW 217
>gi|332211329|ref|XP_003254773.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Nomascus
leucogenys]
Length = 544
Score = 55.5 bits (132), Expect = 7e-06, Method: Composition-based stats.
Identities = 27/66 (40%), Positives = 40/66 (60%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ ++ L+ VE GGAT F NL+V + +A+FW+N H + D H+GCPV
Sbjct: 457 GNRVATFMIY-LSSVEAGGATAFIYANLSVPVVRNAALFWWNLHRSGEGDSDTLHAGCPV 515
Query: 122 ALGNKW 127
+G+KW
Sbjct: 516 LVGDKW 521
Score = 54.7 bits (130), Expect = 1e-05, Method: Composition-based stats.
Identities = 27/64 (42%), Positives = 37/64 (57%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L+ VE GGAT F NL+V + +A+FW N H + D H+GCPV +
Sbjct: 458 NRVATFMIYLSSVEAGGATAFIYANLSVPVVRNAALFWWNLHRSGEGDSDTLHAGCPVLV 517
Query: 62 GNKW 65
G+KW
Sbjct: 518 GDKW 521
>gi|195128345|ref|XP_002008624.1| GI13596 [Drosophila mojavensis]
gi|193920233|gb|EDW19100.1| GI13596 [Drosophila mojavensis]
Length = 527
Score = 55.5 bits (132), Expect = 7e-06, Method: Composition-based stats.
Identities = 23/49 (46%), Positives = 33/49 (67%)
Query: 79 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
GGAT+FP++ +TV P+KGSA+ WYN + + HS CPV G++W
Sbjct: 453 GGATVFPNIQVTVQPQKGSALHWYNLFDDDSPNPLTLHSVCPVISGSRW 501
Score = 53.5 bits (127), Expect = 3e-05, Method: Composition-based stats.
Identities = 22/49 (44%), Positives = 33/49 (67%)
Query: 17 GGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
GGAT+FP++ +TV P+KGSA+ W+N + + HS CPV G++W
Sbjct: 453 GGATVFPNIQVTVQPQKGSALHWYNLFDDDSPNPLTLHSVCPVISGSRW 501
>gi|195128343|ref|XP_002008623.1| GI13594 [Drosophila mojavensis]
gi|193920232|gb|EDW19099.1| GI13594 [Drosophila mojavensis]
Length = 511
Score = 55.5 bits (132), Expect = 7e-06, Method: Composition-based stats.
Identities = 22/49 (44%), Positives = 33/49 (67%)
Query: 79 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
GGAT+FP++ +TV P+KGSA+ WYN + + H+ CPV G++W
Sbjct: 441 GGATVFPNIQVTVQPQKGSALHWYNLFDDDSPNPLSLHTACPVISGSRW 489
Score = 53.5 bits (127), Expect = 3e-05, Method: Composition-based stats.
Identities = 21/49 (42%), Positives = 33/49 (67%)
Query: 17 GGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
GGAT+FP++ +TV P+KGSA+ W+N + + H+ CPV G++W
Sbjct: 441 GGATVFPNIQVTVQPQKGSALHWYNLFDDDSPNPLSLHTACPVISGSRW 489
>gi|426369750|ref|XP_004051847.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3, partial [Gorilla
gorilla gorilla]
Length = 517
Score = 55.5 bits (132), Expect = 7e-06, Method: Composition-based stats.
Identities = 27/66 (40%), Positives = 40/66 (60%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ ++ L+ VE GGAT F NL+V + +A+FW+N H + D H+GCPV
Sbjct: 430 GNRVATFMIY-LSSVEAGGATAFIYANLSVPVVRNAALFWWNLHRSGEGDSDTLHAGCPV 488
Query: 122 ALGNKW 127
+G+KW
Sbjct: 489 LVGDKW 494
Score = 54.7 bits (130), Expect = 1e-05, Method: Composition-based stats.
Identities = 27/64 (42%), Positives = 37/64 (57%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L+ VE GGAT F NL+V + +A+FW N H + D H+GCPV +
Sbjct: 431 NRVATFMIYLSSVEAGGATAFIYANLSVPVVRNAALFWWNLHRSGEGDSDTLHAGCPVLV 490
Query: 62 GNKW 65
G+KW
Sbjct: 491 GDKW 494
>gi|430751569|ref|YP_007214477.1| 2OG-Fe(II) oxygenase [Thermobacillus composti KWC4]
gi|430735534|gb|AGA59479.1| 2OG-Fe(II) oxygenase superfamily enzyme [Thermobacillus composti
KWC4]
Length = 215
Score = 55.5 bits (132), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 45/83 (54%), Gaps = 5/83 (6%)
Query: 49 DYWMYHS----GCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNA 104
+Y +H G P A N+ L++ L DVE GG T+FP L + + P++G+A+++
Sbjct: 115 EYKAHHDFFAPGSPAARNNRISTLIVY-LNDVEEGGETVFPLLGIAMKPKRGAALYFEYF 173
Query: 105 HANTLLDYRMYHSGCPVALGNKW 127
+ N L+ HS PV G KW
Sbjct: 174 YRNQALNDLTLHSSVPVVRGEKW 196
Score = 52.4 bits (124), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 25/64 (39%), Positives = 36/64 (56%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR V L DVE GG T+FP L + + P++G+A+++ + N L+ HS PV
Sbjct: 133 NRISTLIVYLNDVEEGGETVFPLLGIAMKPKRGAALYFEYFYRNQALNDLTLHSSVPVVR 192
Query: 62 GNKW 65
G KW
Sbjct: 193 GEKW 196
>gi|295704991|ref|YP_003598066.1| 2OG-Fe(II) oxygenase [Bacillus megaterium DSM 319]
gi|294802650|gb|ADF39716.1| 2OG-Fe(II) oxygenase family protein [Bacillus megaterium DSM 319]
Length = 219
Score = 55.5 bits (132), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 30/75 (40%), Positives = 39/75 (52%)
Query: 53 YHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDY 112
Y + A N L+ L DVE GG T FP L L+V P KG AV++ +++ L+
Sbjct: 132 YFTSASKASKNNRISTLVMYLNDVEEGGETYFPKLGLSVSPTKGMAVYFEYFYSDAELND 191
Query: 113 RMYHSGCPVALGNKW 127
R H G PV G KW
Sbjct: 192 RTLHGGAPVIKGEKW 206
Score = 52.0 bits (123), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 25/55 (45%), Positives = 32/55 (58%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T FP L L+V P KG AV++ +++ L+ H G PV G KW
Sbjct: 152 LNDVEEGGETYFPKLGLSVSPTKGMAVYFEYFYSDAELNDRTLHGGAPVIKGEKW 206
>gi|33589818|ref|NP_878907.1| prolyl 4-hydroxylase subunit alpha-3 precursor [Homo sapiens]
gi|114639354|ref|XP_001174896.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Pan troglodytes]
gi|397487266|ref|XP_003814725.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Pan paniscus]
gi|74738714|sp|Q7Z4N8.1|P4HA3_HUMAN RecName: Full=Prolyl 4-hydroxylase subunit alpha-3; Short=4-PH
alpha-3; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-3; Flags: Precursor
gi|33188232|gb|AAP97874.1| prolyl 4-hydroxylase alpha III subunit [Homo sapiens]
gi|36962719|gb|AAQ87603.1| collagen prolyl 4-hydroxylase alpha III subunit [Homo sapiens]
gi|37182165|gb|AAQ88885.1| GPGA711 [Homo sapiens]
gi|109658570|gb|AAI17334.1| Prolyl 4-hydroxylase, alpha polypeptide III [Homo sapiens]
gi|119595341|gb|EAW74935.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide III, isoform CRA_b
[Homo sapiens]
gi|410219716|gb|JAA07077.1| prolyl 4-hydroxylase, alpha polypeptide III [Pan troglodytes]
gi|410248278|gb|JAA12106.1| prolyl 4-hydroxylase, alpha polypeptide III [Pan troglodytes]
gi|410336087|gb|JAA36990.1| prolyl 4-hydroxylase, alpha polypeptide III [Pan troglodytes]
Length = 544
Score = 55.5 bits (132), Expect = 7e-06, Method: Composition-based stats.
Identities = 27/66 (40%), Positives = 40/66 (60%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ ++ L+ VE GGAT F NL+V + +A+FW+N H + D H+GCPV
Sbjct: 457 GNRVATFMIY-LSSVEAGGATAFIYANLSVPVVRNAALFWWNLHRSGEGDSDTLHAGCPV 515
Query: 122 ALGNKW 127
+G+KW
Sbjct: 516 LVGDKW 521
Score = 54.7 bits (130), Expect = 1e-05, Method: Composition-based stats.
Identities = 27/64 (42%), Positives = 37/64 (57%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L+ VE GGAT F NL+V + +A+FW N H + D H+GCPV +
Sbjct: 458 NRVATFMIYLSSVEAGGATAFIYANLSVPVVRNAALFWWNLHRSGEGDSDTLHAGCPVLV 517
Query: 62 GNKW 65
G+KW
Sbjct: 518 GDKW 521
>gi|145354086|ref|XP_001421326.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144581563|gb|ABO99619.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 309
Score = 55.5 bits (132), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 33/107 (30%), Positives = 44/107 (41%), Gaps = 30/107 (28%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN-------------------- 88
DY +H V G + +L L+DVE GG T+FP
Sbjct: 181 DY--FHDEVNVKNGGQRAATVLMYLSDVEEGGETVFPRGTPLGGAAPEKSGVTQGNACER 238
Query: 89 --------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L V P +G A+ ++N H N +D R H+GCPV G KW
Sbjct: 239 ALRGDPNVLAVKPRRGDALLFFNVHLNGEVDERARHAGCPVVRGTKW 285
Score = 47.0 bits (110), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/83 (31%), Positives = 33/83 (39%), Gaps = 28/83 (33%)
Query: 11 LTDVELGGATIFPSLN----------------------------LTVFPEKGSAVFWHNA 42
L+DVE GG T+FP L V P +G A+ + N
Sbjct: 203 LSDVEEGGETVFPRGTPLGGAAPEKSGVTQGNACERALRGDPNVLAVKPRRGDALLFFNV 262
Query: 43 HANTLLDYWMYHSGCPVALGNKW 65
H N +D H+GCPV G KW
Sbjct: 263 HLNGEVDERARHAGCPVVRGTKW 285
>gi|403234403|ref|ZP_10912989.1| Procollagen-proline dioxygenase [Bacillus sp. 10403023]
Length = 217
Score = 55.5 bits (132), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 27/69 (39%), Positives = 40/69 (57%)
Query: 60 ALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGC 119
A+ N L+ L+DVE GG T FP L+ +V P+KG AV++ + + L+ H G
Sbjct: 138 AINNPRISTLVMYLSDVEQGGETYFPKLHFSVSPQKGMAVYFEYFYNDQTLNELTLHGGA 197
Query: 120 PVALGNKWG 128
PV +G+KW
Sbjct: 198 PVIVGDKWA 206
Score = 55.1 bits (131), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 24/56 (42%), Positives = 35/56 (62%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWG 66
L+DVE GG T FP L+ +V P+KG AV++ + + L+ H G PV +G+KW
Sbjct: 151 LSDVEQGGETYFPKLHFSVSPQKGMAVYFEYFYNDQTLNELTLHGGAPVIVGDKWA 206
>gi|116496629|gb|AAI26171.1| Prolyl 4-hydroxylase, alpha polypeptide III [Homo sapiens]
Length = 544
Score = 55.5 bits (132), Expect = 8e-06, Method: Composition-based stats.
Identities = 27/66 (40%), Positives = 40/66 (60%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ ++ L+ VE GGAT F NL+V + +A+FW+N H + D H+GCPV
Sbjct: 457 GNRVATFMIY-LSSVEAGGATAFIYANLSVPVVRNAALFWWNLHRSGEGDSDTLHAGCPV 515
Query: 122 ALGNKW 127
+G+KW
Sbjct: 516 LVGDKW 521
Score = 54.7 bits (130), Expect = 1e-05, Method: Composition-based stats.
Identities = 27/64 (42%), Positives = 37/64 (57%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L+ VE GGAT F NL+V + +A+FW N H + D H+GCPV +
Sbjct: 458 NRVATFMIYLSSVEAGGATAFIYANLSVPVVRNAALFWWNLHRSGEGDSDTLHAGCPVLV 517
Query: 62 GNKW 65
G+KW
Sbjct: 518 GDKW 521
>gi|20177086|gb|AAM12247.1| AT28279p [Drosophila melanogaster]
Length = 509
Score = 55.5 bits (132), Expect = 8e-06, Method: Composition-based stats.
Identities = 23/54 (42%), Positives = 34/54 (62%)
Query: 74 TDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
+D+ LGG T+F L + V P+KG+A+ W+N + D HS CPV LG++W
Sbjct: 437 SDIPLGGTTVFTKLKIAVQPKKGNALIWFNLNHAGEPDPLTEHSVCPVVLGSRW 490
Score = 54.3 bits (129), Expect = 1e-05, Method: Composition-based stats.
Identities = 24/64 (37%), Positives = 37/64 (57%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
+R + + +D+ LGG T+F L + V P+KG+A+ W N + D HS CPV L
Sbjct: 427 DRIATALIYASDIPLGGTTVFTKLKIAVQPKKGNALIWFNLNHAGEPDPLTEHSVCPVVL 486
Query: 62 GNKW 65
G++W
Sbjct: 487 GSRW 490
>gi|195069797|ref|XP_001997029.1| GH12978 [Drosophila grimshawi]
gi|193891498|gb|EDV90364.1| GH12978 [Drosophila grimshawi]
Length = 518
Score = 55.5 bits (132), Expect = 8e-06, Method: Composition-based stats.
Identities = 26/63 (41%), Positives = 37/63 (58%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
+R + + L+DV+ GG T+FP+ N+ V P KG+ V WHN + D H+GCPV
Sbjct: 432 DRISTNMIYLSDVQQGGYTVFPTQNVFVKPIKGAMVMWHNLLRSLDGDRRTLHAGCPVIE 491
Query: 62 GNK 64
G K
Sbjct: 492 GTK 494
Score = 54.7 bits (130), Expect = 1e-05, Method: Composition-based stats.
Identities = 25/54 (46%), Positives = 34/54 (62%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 126
L+DV+ GG T+FP+ N+ V P KG+ V W+N + D R H+GCPV G K
Sbjct: 441 LSDVQQGGYTVFPTQNVFVKPIKGAMVMWHNLLRSLDGDRRTLHAGCPVIEGTK 494
>gi|195069795|ref|XP_001997028.1| GH12977 [Drosophila grimshawi]
gi|193891497|gb|EDV90363.1| GH12977 [Drosophila grimshawi]
Length = 517
Score = 55.5 bits (132), Expect = 8e-06, Method: Composition-based stats.
Identities = 26/63 (41%), Positives = 37/63 (58%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
+R + + L+DV+ GG T+FP+ N+ V P KG+ V WHN + D H+GCPV
Sbjct: 431 DRISTNMIYLSDVQQGGYTVFPTQNVFVKPIKGAMVMWHNLLRSLDGDRRTLHAGCPVIE 490
Query: 62 GNK 64
G K
Sbjct: 491 GTK 493
Score = 54.7 bits (130), Expect = 1e-05, Method: Composition-based stats.
Identities = 25/54 (46%), Positives = 34/54 (62%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 126
L+DV+ GG T+FP+ N+ V P KG+ V W+N + D R H+GCPV G K
Sbjct: 440 LSDVQQGGYTVFPTQNVFVKPIKGAMVMWHNLLRSLDGDRRTLHAGCPVIEGTK 493
>gi|195159146|ref|XP_002020443.1| GL13510 [Drosophila persimilis]
gi|194117212|gb|EDW39255.1| GL13510 [Drosophila persimilis]
Length = 527
Score = 55.1 bits (131), Expect = 8e-06, Method: Composition-based stats.
Identities = 31/90 (34%), Positives = 44/90 (48%), Gaps = 2/90 (2%)
Query: 38 FWHNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGS 97
+ + H + DY+ + A G++ +L L D++ GG+T F L L V E+G
Sbjct: 413 YGYGGHYDIHFDYFNTSTPITKARGDRMATVLFY-LNDMKHGGSTAFTDLQLKVPSERGK 471
Query: 98 AVFWYNAHANTL-LDYRMYHSGCPVALGNK 126
+FWYN T LD R H CPV G K
Sbjct: 472 VLFWYNMRGETHDLDSRTLHGACPVINGTK 501
Score = 45.4 bits (106), Expect = 0.007, Method: Composition-based stats.
Identities = 23/55 (41%), Positives = 29/55 (52%), Gaps = 1/55 (1%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTL-LDYWMYHSGCPVALGNK 64
L D++ GG+T F L L V E+G +FW+N T LD H CPV G K
Sbjct: 447 LNDMKHGGSTAFTDLQLKVPSERGKVLFWYNMRGETHDLDSRTLHGACPVINGTK 501
>gi|195341584|ref|XP_002037386.1| GM12898 [Drosophila sechellia]
gi|194131502|gb|EDW53545.1| GM12898 [Drosophila sechellia]
Length = 536
Score = 55.1 bits (131), Expect = 9e-06, Method: Composition-based stats.
Identities = 26/55 (47%), Positives = 33/55 (60%), Gaps = 1/55 (1%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTL-LDYWMYHSGCPVALGNK 64
L +V+ GGAT FP LNL V +KGSA+FWHN + D +H CP+ G K
Sbjct: 447 LNNVDHGGATAFPKLNLVVPTQKGSALFWHNLDRKSYDYDTCTFHGACPLISGTK 501
Score = 52.4 bits (124), Expect = 6e-05, Method: Composition-based stats.
Identities = 25/55 (45%), Positives = 33/55 (60%), Gaps = 1/55 (1%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTL-LDYRMYHSGCPVALGNK 126
L +V+ GGAT FP LNL V +KGSA+FW+N + D +H CP+ G K
Sbjct: 447 LNNVDHGGATAFPKLNLVVPTQKGSALFWHNLDRKSYDYDTCTFHGACPLISGTK 501
>gi|384046522|ref|YP_005494539.1| prolyl 4-hydroxylase alpha subunit [Bacillus megaterium WSH-002]
gi|345444213|gb|AEN89230.1| Prolyl 4-hydroxylase alpha subunit [Bacillus megaterium WSH-002]
Length = 219
Score = 55.1 bits (131), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 29/75 (38%), Positives = 39/75 (52%)
Query: 53 YHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDY 112
Y + A N L+ L DVE GG T FP L L++ P KG AV++ +++ L+
Sbjct: 132 YFTSASKASKNNRISTLVMYLNDVEEGGETYFPKLGLSISPTKGMAVYFEYFYSDAELND 191
Query: 113 RMYHSGCPVALGNKW 127
R H G PV G KW
Sbjct: 192 RTLHGGAPVIKGEKW 206
Score = 51.6 bits (122), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 24/55 (43%), Positives = 32/55 (58%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T FP L L++ P KG AV++ +++ L+ H G PV G KW
Sbjct: 152 LNDVEEGGETYFPKLGLSISPTKGMAVYFEYFYSDAELNDRTLHGGAPVIKGEKW 206
>gi|302841711|ref|XP_002952400.1| hypothetical protein VOLCADRAFT_81799 [Volvox carteri f.
nagariensis]
gi|300262336|gb|EFJ46543.1| hypothetical protein VOLCADRAFT_81799 [Volvox carteri f.
nagariensis]
Length = 269
Score = 55.1 bits (131), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 32/104 (30%), Positives = 48/104 (46%), Gaps = 22/104 (21%)
Query: 40 HNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSL------------ 87
+++H N + +H GN+W +L+ LTD E GG T+FP +
Sbjct: 134 YDSHVN-----YFFHKEGSANGGNRWA-TVLTYLTDTEEGGETVFPKIPAPGGVNVGFSE 187
Query: 88 ----NLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
NL V P KG A+ +++ N L+ R H CPV G K+
Sbjct: 188 CAKYNLAVKPRKGDAILFHSMKTNGQLEERSLHGACPVIKGEKF 231
Score = 47.8 bits (112), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 25/71 (35%), Positives = 32/71 (45%), Gaps = 16/71 (22%)
Query: 11 LTDVELGGATIFPSL----------------NLTVFPEKGSAVFWHNAHANTLLDYWMYH 54
LTD E GG T+FP + NL V P KG A+ +H+ N L+ H
Sbjct: 161 LTDTEEGGETVFPKIPAPGGVNVGFSECAKYNLAVKPRKGDAILFHSMKTNGQLEERSLH 220
Query: 55 SGCPVALGNKW 65
CPV G K+
Sbjct: 221 GACPVIKGEKF 231
>gi|194751835|ref|XP_001958229.1| GF10816 [Drosophila ananassae]
gi|190625511|gb|EDV41035.1| GF10816 [Drosophila ananassae]
Length = 419
Score = 55.1 bits (131), Expect = 1e-05, Method: Composition-based stats.
Identities = 22/59 (37%), Positives = 36/59 (61%), Gaps = 6/59 (10%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWGKLL 131
L +V+ GG T+FPSLN+ + P G+ + W N+ + R++H CP+ +GN W K +
Sbjct: 367 LNNVKQGGETVFPSLNIKLKPRMGTMLLWLNS------EKRVWHHQCPLIVGNMWSKFM 419
Score = 51.6 bits (122), Expect = 1e-04, Method: Composition-based stats.
Identities = 21/59 (35%), Positives = 34/59 (57%), Gaps = 6/59 (10%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWGKLL 69
L +V+ GG T+FPSLN+ + P G+ + W N+ ++H CP+ +GN W K +
Sbjct: 367 LNNVKQGGETVFPSLNIKLKPRMGTMLLWLNSEKR------VWHHQCPLIVGNMWSKFM 419
>gi|389795384|ref|ZP_10198508.1| procollagen-proline dioxygenase [Rhodanobacter fulvus Jip2]
gi|388430823|gb|EIL87950.1| procollagen-proline dioxygenase [Rhodanobacter fulvus Jip2]
Length = 293
Score = 55.1 bits (131), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 29/73 (39%), Positives = 40/73 (54%)
Query: 55 SGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRM 114
S + +G + LL L DV GGAT+FP+L L V P KG AV++ ++ + +D
Sbjct: 202 SEAQMVVGGQRVSTLLIYLNDVAQGGATVFPTLGLRVLPRKGMAVYFEYSNRDGQVDPLT 261
Query: 115 YHSGCPVALGNKW 127
H G PV G KW
Sbjct: 262 LHGGEPVEKGEKW 274
Score = 54.7 bits (130), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 25/55 (45%), Positives = 33/55 (60%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DV GGAT+FP+L L V P KG AV++ ++ + +D H G PV G KW
Sbjct: 220 LNDVAQGGATVFPTLGLRVLPRKGMAVYFEYSNRDGQVDPLTLHGGEPVEKGEKW 274
>gi|194213450|ref|XP_001495951.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Equus
caballus]
Length = 548
Score = 55.1 bits (131), Expect = 1e-05, Method: Composition-based stats.
Identities = 27/66 (40%), Positives = 39/66 (59%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ ++ L+ VE GGAT F N +V K +A+FW+N H + D H+GCPV
Sbjct: 461 GNRVATFMIY-LSSVEAGGATAFIYANFSVPVVKNAALFWWNLHRSGEGDSDTLHAGCPV 519
Query: 122 ALGNKW 127
+G+KW
Sbjct: 520 LVGDKW 525
Score = 53.9 bits (128), Expect = 2e-05, Method: Composition-based stats.
Identities = 27/64 (42%), Positives = 36/64 (56%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L+ VE GGAT F N +V K +A+FW N H + D H+GCPV +
Sbjct: 462 NRVATFMIYLSSVEAGGATAFIYANFSVPVVKNAALFWWNLHRSGEGDSDTLHAGCPVLV 521
Query: 62 GNKW 65
G+KW
Sbjct: 522 GDKW 525
>gi|167519971|ref|XP_001744325.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163777411|gb|EDQ91028.1| predicted protein [Monosiga brevicollis MX1]
Length = 492
Score = 55.1 bits (131), Expect = 1e-05, Method: Composition-based stats.
Identities = 26/55 (47%), Positives = 28/55 (50%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L VE GG T FP L V P G AVFWYN + D H CPV G+KW
Sbjct: 420 LNQVEQGGYTAFPRLGAAVEPGHGDAVFWYNLLPDGESDNNTLHGACPVLQGSKW 474
Score = 51.6 bits (122), Expect = 1e-04, Method: Composition-based stats.
Identities = 26/64 (40%), Positives = 31/64 (48%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
+R + L VE GG T FP L V P G AVFW+N + D H CPV
Sbjct: 411 DRIATFMIYLNQVEQGGYTAFPRLGAAVEPGHGDAVFWYNLLPDGESDNNTLHGACPVLQ 470
Query: 62 GNKW 65
G+KW
Sbjct: 471 GSKW 474
>gi|260787668|ref|XP_002588874.1| hypothetical protein BRAFLDRAFT_235878 [Branchiostoma floridae]
gi|229274045|gb|EEN44885.1| hypothetical protein BRAFLDRAFT_235878 [Branchiostoma floridae]
Length = 151
Score = 54.7 bits (130), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 27/55 (49%), Positives = 36/55 (65%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L+DVE GGAT+F LNLTV K SAV +H+ + + H+GCPV +G+KW
Sbjct: 74 LSDVEAGGATVFTRLNLTVPAVKNSAVLFHDLKRSLEFEKDSEHAGCPVLMGSKW 128
Score = 53.9 bits (128), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 36/55 (65%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+DVE GGAT+F LNLTV K SAV +++ + + H+GCPV +G+KW
Sbjct: 74 LSDVEAGGATVFTRLNLTVPAVKNSAVLFHDLKRSLEFEKDSEHAGCPVLMGSKW 128
>gi|351696981|gb|EHA99899.1| Prolyl 4-hydroxylase subunit alpha-3 [Heterocephalus glaber]
Length = 572
Score = 54.7 bits (130), Expect = 1e-05, Method: Composition-based stats.
Identities = 27/66 (40%), Positives = 39/66 (59%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ ++ L+ VE GGAT F N +V K +A+FW+N H + D H+GCPV
Sbjct: 485 GNRVATFMIY-LSSVEAGGATAFIYANFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPV 543
Query: 122 ALGNKW 127
+G+KW
Sbjct: 544 LVGDKW 549
Score = 53.9 bits (128), Expect = 2e-05, Method: Composition-based stats.
Identities = 27/64 (42%), Positives = 36/64 (56%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L+ VE GGAT F N +V K +A+FW N H + D H+GCPV +
Sbjct: 486 NRVATFMIYLSSVEAGGATAFIYANFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLV 545
Query: 62 GNKW 65
G+KW
Sbjct: 546 GDKW 549
>gi|195341582|ref|XP_002037385.1| GM12897 [Drosophila sechellia]
gi|194131501|gb|EDW53544.1| GM12897 [Drosophila sechellia]
Length = 467
Score = 54.7 bits (130), Expect = 1e-05, Method: Composition-based stats.
Identities = 24/59 (40%), Positives = 37/59 (62%), Gaps = 1/59 (1%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWGKLL 131
L D GGAT+FP LN+ V E+G + WYN + ++ D H+ CPV G+K+G+++
Sbjct: 403 LNDAPYGGATVFPRLNVKVPAERGKVLLWYNLNGDS-QDVTTVHAVCPVFHGSKYGEIV 460
Score = 52.0 bits (123), Expect = 7e-05, Method: Composition-based stats.
Identities = 23/59 (38%), Positives = 37/59 (62%), Gaps = 1/59 (1%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWGKLL 69
L D GGAT+FP LN+ V E+G + W+N + ++ D H+ CPV G+K+G+++
Sbjct: 403 LNDAPYGGATVFPRLNVKVPAERGKVLLWYNLNGDS-QDVTTVHAVCPVFHGSKYGEIV 460
>gi|195159168|ref|XP_002020454.1| GL13504 [Drosophila persimilis]
gi|194117223|gb|EDW39266.1| GL13504 [Drosophila persimilis]
Length = 491
Score = 54.7 bits (130), Expect = 1e-05, Method: Composition-based stats.
Identities = 24/55 (43%), Positives = 36/55 (65%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L++V GG T+FP+L + V P+ G+A+ +YN + + D R H GCPV G+KW
Sbjct: 419 LSNVSQGGYTVFPNLEVAVKPQAGTALVFYNLLDSLVGDTRTRHFGCPVIDGDKW 473
Score = 50.4 bits (119), Expect = 2e-04, Method: Composition-based stats.
Identities = 24/64 (37%), Positives = 38/64 (59%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L++V GG T+FP+L + V P+ G+A+ ++N + + D H GCPV
Sbjct: 410 NRLFTIQIFLSNVSQGGYTVFPNLEVAVKPQAGTALVFYNLLDSLVGDTRTRHFGCPVID 469
Query: 62 GNKW 65
G+KW
Sbjct: 470 GDKW 473
>gi|301759032|ref|XP_002915381.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Ailuropoda
melanoleuca]
Length = 539
Score = 54.7 bits (130), Expect = 1e-05, Method: Composition-based stats.
Identities = 27/66 (40%), Positives = 39/66 (59%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ ++ L+ VE GGAT F N +V K +A+FW+N H + D H+GCPV
Sbjct: 452 GNRVATFMIY-LSSVEAGGATAFIYANFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPV 510
Query: 122 ALGNKW 127
+G+KW
Sbjct: 511 LVGDKW 516
Score = 53.5 bits (127), Expect = 2e-05, Method: Composition-based stats.
Identities = 27/64 (42%), Positives = 36/64 (56%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L+ VE GGAT F N +V K +A+FW N H + D H+GCPV +
Sbjct: 453 NRVATFMIYLSSVEAGGATAFIYANFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLV 512
Query: 62 GNKW 65
G+KW
Sbjct: 513 GDKW 516
>gi|47204411|emb|CAF95476.1| unnamed protein product [Tetraodon nigroviridis]
Length = 284
Score = 54.7 bits (130), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 40/66 (60%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ +++ L+ VE GG+T F N +V K +A+FW+N H N D H+GCPV
Sbjct: 197 GNRVATVMIY-LSSVEAGGSTAFIYANFSVPVMKNAAIFWWNLHRNGRGDPDTLHAGCPV 255
Query: 122 ALGNKW 127
+G+KW
Sbjct: 256 LIGDKW 261
Score = 53.1 bits (126), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 27/64 (42%), Positives = 36/64 (56%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L+ VE GG+T F N +V K +A+FW N H N D H+GCPV +
Sbjct: 198 NRVATVMIYLSSVEAGGSTAFIYANFSVPVMKNAAIFWWNLHRNGRGDPDTLHAGCPVLI 257
Query: 62 GNKW 65
G+KW
Sbjct: 258 GDKW 261
>gi|319652187|ref|ZP_08006306.1| hypothetical protein HMPREF1013_02919 [Bacillus sp. 2_A_57_CT2]
gi|317396176|gb|EFV76895.1| hypothetical protein HMPREF1013_02919 [Bacillus sp. 2_A_57_CT2]
Length = 283
Score = 54.7 bits (130), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 24/55 (43%), Positives = 32/55 (58%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T+F L+ P+KG+A+++H +A LD HS PV G KW
Sbjct: 215 LNDVEDGGETVFSKAGLSFVPKKGAAIYFHYGNAQGQLDRLSVHSSVPVRKGEKW 269
Score = 53.5 bits (127), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 28/80 (35%), Positives = 41/80 (51%), Gaps = 1/80 (1%)
Query: 48 LDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHAN 107
D++ H G + G L+ L DVE GG T+F L+ P+KG+A++++ +A
Sbjct: 191 FDFFPPHMADASKGGQRVGTFLIY-LNDVEDGGETVFSKAGLSFVPKKGAAIYFHYGNAQ 249
Query: 108 TLLDYRMYHSGCPVALGNKW 127
LD HS PV G KW
Sbjct: 250 GQLDRLSVHSSVPVRKGEKW 269
>gi|194765144|ref|XP_001964687.1| GF22917 [Drosophila ananassae]
gi|190614959|gb|EDV30483.1| GF22917 [Drosophila ananassae]
Length = 529
Score = 54.7 bits (130), Expect = 1e-05, Method: Composition-based stats.
Identities = 32/86 (37%), Positives = 41/86 (47%), Gaps = 2/86 (2%)
Query: 42 AHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFW 101
AH DY+ V G++ +L L D GGAT+FP LN+ V EKG +FW
Sbjct: 415 AHYAQHHDYFNMSLKSTVLQGDRIATVLFY-LNDAPHGGATVFPMLNVKVPAEKGKILFW 473
Query: 102 YNAHANTL-LDYRMYHSGCPVALGNK 126
YN T D + H CP+ G K
Sbjct: 474 YNLKGETHDFDEKTLHGACPIFHGTK 499
Score = 48.9 bits (115), Expect = 7e-04, Method: Composition-based stats.
Identities = 24/55 (43%), Positives = 29/55 (52%), Gaps = 1/55 (1%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTL-LDYWMYHSGCPVALGNK 64
L D GGAT+FP LN+ V EKG +FW+N T D H CP+ G K
Sbjct: 445 LNDAPHGGATVFPMLNVKVPAEKGKILFWYNLKGETHDFDEKTLHGACPIFHGTK 499
>gi|198452400|ref|XP_002137470.1| GA26529 [Drosophila pseudoobscura pseudoobscura]
gi|198131917|gb|EDY68028.1| GA26529 [Drosophila pseudoobscura pseudoobscura]
Length = 348
Score = 54.7 bits (130), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 32/87 (36%), Positives = 43/87 (49%), Gaps = 4/87 (4%)
Query: 40 HNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAV 99
H+ H + YW Y G++ ++ L DVE GG T+FP L L + P KGSA+
Sbjct: 241 HDPHVDYHDLYWEYEYH---PFGDRQASVVFY-LNDVEDGGETVFPKLQLVIPPTKGSAL 296
Query: 100 FWYNAHANTLLDYRMYHSGCPVALGNK 126
W+N D R H+ CPV G K
Sbjct: 297 MWHNLRPWGEGDPRTQHASCPVLSGYK 323
Score = 52.8 bits (125), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 29/68 (42%), Positives = 36/68 (52%), Gaps = 2/68 (2%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWGKL-- 68
L DVE GG T+FP L L + P KGSA+ WHN D H+ CPV G K +
Sbjct: 270 LNDVEDGGETVFPKLQLVIPPTKGSALMWHNLRPWGEGDPRTQHASCPVLSGYKQVAIQW 329
Query: 69 LLSGLTDV 76
+L G D+
Sbjct: 330 ILQGTRDL 337
>gi|195575139|ref|XP_002105537.1| GD21537 [Drosophila simulans]
gi|194201464|gb|EDX15040.1| GD21537 [Drosophila simulans]
Length = 536
Score = 54.3 bits (129), Expect = 1e-05, Method: Composition-based stats.
Identities = 30/79 (37%), Positives = 43/79 (54%), Gaps = 2/79 (2%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY+ + +G++ +L L +V+ GGAT FP LNL V +KGSA+FW+N +
Sbjct: 424 DYFNSKTSYLERVGDRMATVLFY-LNNVDHGGATAFPKLNLVVPTQKGSALFWHNLDRKS 482
Query: 109 L-LDYRMYHSGCPVALGNK 126
D R H CP+ G K
Sbjct: 483 YDYDTRTSHGACPLISGTK 501
Score = 53.1 bits (126), Expect = 4e-05, Method: Composition-based stats.
Identities = 26/55 (47%), Positives = 32/55 (58%), Gaps = 1/55 (1%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTL-LDYWMYHSGCPVALGNK 64
L +V+ GGAT FP LNL V +KGSA+FWHN + D H CP+ G K
Sbjct: 447 LNNVDHGGATAFPKLNLVVPTQKGSALFWHNLDRKSYDYDTRTSHGACPLISGTK 501
>gi|347966278|ref|XP_003435891.1| AGAP013377-PA [Anopheles gambiae str. PEST]
gi|333470133|gb|EGK97522.1| AGAP013377-PA [Anopheles gambiae str. PEST]
Length = 290
Score = 54.3 bits (129), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 24/55 (43%), Positives = 33/55 (60%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+DV GG TIFP L + + P+KG+A FWYN + +M +S CP+ KW
Sbjct: 209 LSDVTNGGYTIFPKLRVAIRPQKGTAAFWYNLKDTGEGNVQMKYSICPLQDDQKW 263
Score = 51.6 bits (122), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 23/55 (41%), Positives = 32/55 (58%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L+DV GG TIFP L + + P+KG+A FW+N + M +S CP+ KW
Sbjct: 209 LSDVTNGGYTIFPKLRVAIRPQKGTAAFWYNLKDTGEGNVQMKYSICPLQDDQKW 263
>gi|410972729|ref|XP_003992809.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Felis catus]
Length = 533
Score = 54.3 bits (129), Expect = 2e-05, Method: Composition-based stats.
Identities = 27/66 (40%), Positives = 39/66 (59%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ ++ L+ VE GGAT F N +V K +A+FW+N H + D H+GCPV
Sbjct: 446 GNRVATFMIY-LSSVEAGGATAFIYANFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPV 504
Query: 122 ALGNKW 127
+G+KW
Sbjct: 505 LVGDKW 510
Score = 53.5 bits (127), Expect = 3e-05, Method: Composition-based stats.
Identities = 27/64 (42%), Positives = 36/64 (56%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L+ VE GGAT F N +V K +A+FW N H + D H+GCPV +
Sbjct: 447 NRVATFMIYLSSVEAGGATAFIYANFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLV 506
Query: 62 GNKW 65
G+KW
Sbjct: 507 GDKW 510
>gi|328718393|ref|XP_001945742.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 1
[Acyrthosiphon pisum]
Length = 511
Score = 54.3 bits (129), Expect = 2e-05, Method: Composition-based stats.
Identities = 30/70 (42%), Positives = 38/70 (54%), Gaps = 2/70 (2%)
Query: 59 VALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAH-ANTLLDYRMYHS 117
V GN+ +L LTDV+ G T FP LN+ EKG+A+ W N H +N Y H
Sbjct: 423 VKFGNRLVTVLFY-LTDVQNDGYTSFPMLNIIAPAEKGAALVWNNLHMSNGQKFYETLHG 481
Query: 118 GCPVALGNKW 127
CP+ GNKW
Sbjct: 482 SCPLLKGNKW 491
Score = 52.8 bits (125), Expect = 4e-05, Method: Composition-based stats.
Identities = 26/56 (46%), Positives = 33/56 (58%), Gaps = 1/56 (1%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAH-ANTLLDYWMYHSGCPVALGNKW 65
LTDV+ G T FP LN+ EKG+A+ W+N H +N Y H CP+ GNKW
Sbjct: 436 LTDVQNDGYTSFPMLNIIAPAEKGAALVWNNLHMSNGQKFYETLHGSCPLLKGNKW 491
>gi|432891690|ref|XP_004075614.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Oryzias
latipes]
Length = 517
Score = 54.3 bits (129), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 39/66 (59%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ ++ L+ VE GG+T F N +V K +A+FW+N H N D H+GCPV
Sbjct: 430 GNRVATFMIY-LSSVEAGGSTAFIYANFSVPVLKKAAIFWWNLHRNGRGDAETLHAGCPV 488
Query: 122 ALGNKW 127
+G+KW
Sbjct: 489 LIGDKW 494
Score = 52.8 bits (125), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 27/64 (42%), Positives = 36/64 (56%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L+ VE GG+T F N +V K +A+FW N H N D H+GCPV +
Sbjct: 431 NRVATFMIYLSSVEAGGSTAFIYANFSVPVLKKAAIFWWNLHRNGRGDAETLHAGCPVLI 490
Query: 62 GNKW 65
G+KW
Sbjct: 491 GDKW 494
>gi|73988166|ref|XP_851718.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Canis lupus
familiaris]
Length = 544
Score = 54.3 bits (129), Expect = 2e-05, Method: Composition-based stats.
Identities = 27/66 (40%), Positives = 39/66 (59%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ ++ L+ VE GGAT F N +V K +A+FW+N H + D H+GCPV
Sbjct: 457 GNRVATFMIY-LSSVEAGGATAFIYANFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPV 515
Query: 122 ALGNKW 127
+G+KW
Sbjct: 516 LVGDKW 521
Score = 53.5 bits (127), Expect = 3e-05, Method: Composition-based stats.
Identities = 27/64 (42%), Positives = 36/64 (56%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L+ VE GGAT F N +V K +A+FW N H + D H+GCPV +
Sbjct: 458 NRVATFMIYLSSVEAGGATAFIYANFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLV 517
Query: 62 GNKW 65
G+KW
Sbjct: 518 GDKW 521
>gi|355709028|gb|AES03457.1| prolyl 4-hydroxylase, alpha polypeptide III [Mustela putorius furo]
Length = 477
Score = 54.3 bits (129), Expect = 2e-05, Method: Composition-based stats.
Identities = 27/66 (40%), Positives = 39/66 (59%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ ++ L+ VE GGAT F N +V K +A+FW+N H + D H+GCPV
Sbjct: 391 GNRVATFMIY-LSSVEAGGATAFIYANFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPV 449
Query: 122 ALGNKW 127
+G+KW
Sbjct: 450 LVGDKW 455
Score = 53.5 bits (127), Expect = 3e-05, Method: Composition-based stats.
Identities = 27/64 (42%), Positives = 36/64 (56%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L+ VE GGAT F N +V K +A+FW N H + D H+GCPV +
Sbjct: 392 NRVATFMIYLSSVEAGGATAFIYANFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLV 451
Query: 62 GNKW 65
G+KW
Sbjct: 452 GDKW 455
>gi|324510827|gb|ADY44523.1| Prolyl 4-hydroxylase subunit alpha-1 [Ascaris suum]
Length = 551
Score = 54.3 bits (129), Expect = 2e-05, Method: Composition-based stats.
Identities = 25/66 (37%), Positives = 39/66 (59%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ +L+ +T+ E+GG T+F + +V K +A+FWYN + +D R H+ CPV
Sbjct: 441 GNRVATILVY-MTEPEIGGGTVFTEVKTSVACTKNAALFWYNLLRSGEVDMRSRHAACPV 499
Query: 122 ALGNKW 127
G KW
Sbjct: 500 LTGVKW 505
Score = 50.1 bits (118), Expect = 3e-04, Method: Composition-based stats.
Identities = 23/64 (35%), Positives = 35/64 (54%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR V +T+ E+GG T+F + +V K +A+FW+N + +D H+ CPV
Sbjct: 442 NRVATILVYMTEPEIGGGTVFTEVKTSVACTKNAALFWYNLLRSGEVDMRSRHAACPVLT 501
Query: 62 GNKW 65
G KW
Sbjct: 502 GVKW 505
>gi|328718391|ref|XP_003246474.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 2
[Acyrthosiphon pisum]
Length = 514
Score = 54.3 bits (129), Expect = 2e-05, Method: Composition-based stats.
Identities = 30/70 (42%), Positives = 38/70 (54%), Gaps = 2/70 (2%)
Query: 59 VALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAH-ANTLLDYRMYHS 117
V GN+ +L LTDV+ G T FP LN+ EKG+A+ W N H +N Y H
Sbjct: 426 VKFGNRLVTVLFY-LTDVQNDGYTSFPMLNIIAPAEKGAALVWNNLHMSNGQKFYETLHG 484
Query: 118 GCPVALGNKW 127
CP+ GNKW
Sbjct: 485 SCPLLKGNKW 494
Score = 52.8 bits (125), Expect = 5e-05, Method: Composition-based stats.
Identities = 26/56 (46%), Positives = 33/56 (58%), Gaps = 1/56 (1%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAH-ANTLLDYWMYHSGCPVALGNKW 65
LTDV+ G T FP LN+ EKG+A+ W+N H +N Y H CP+ GNKW
Sbjct: 439 LTDVQNDGYTSFPMLNIIAPAEKGAALVWNNLHMSNGQKFYETLHGSCPLLKGNKW 494
>gi|417402564|gb|JAA48127.1| Putative prolyl 4-hydroxylase alpha subunit [Desmodus rotundus]
Length = 544
Score = 54.3 bits (129), Expect = 2e-05, Method: Composition-based stats.
Identities = 27/66 (40%), Positives = 39/66 (59%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ ++ L+ VE GGAT F N +V K +A+FW+N H + D H+GCPV
Sbjct: 457 GNRVATFMIY-LSSVEAGGATAFIYANFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPV 515
Query: 122 ALGNKW 127
+G+KW
Sbjct: 516 LVGDKW 521
Score = 53.5 bits (127), Expect = 3e-05, Method: Composition-based stats.
Identities = 27/64 (42%), Positives = 36/64 (56%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L+ VE GGAT F N +V K +A+FW N H + D H+GCPV +
Sbjct: 458 NRVATFMIYLSSVEAGGATAFIYANFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLV 517
Query: 62 GNKW 65
G+KW
Sbjct: 518 GDKW 521
>gi|292621357|ref|XP_691737.4| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Danio rerio]
Length = 538
Score = 54.3 bits (129), Expect = 2e-05, Method: Composition-based stats.
Identities = 25/66 (37%), Positives = 40/66 (60%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ +++ L+ V+ GG+T F N +V + +A+FW+N H N + H+GCPV
Sbjct: 451 GNRVATIMIY-LSPVQAGGSTAFIYANFSVPVVQNAALFWWNLHKNGQGNVDTLHAGCPV 509
Query: 122 ALGNKW 127
+GNKW
Sbjct: 510 IVGNKW 515
Score = 52.8 bits (125), Expect = 4e-05, Method: Composition-based stats.
Identities = 25/64 (39%), Positives = 36/64 (56%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L+ V+ GG+T F N +V + +A+FW N H N + H+GCPV +
Sbjct: 452 NRVATIMIYLSPVQAGGSTAFIYANFSVPVVQNAALFWWNLHKNGQGNVDTLHAGCPVIV 511
Query: 62 GNKW 65
GNKW
Sbjct: 512 GNKW 515
>gi|340357957|ref|ZP_08680560.1| prolyl 4-hydroxylase [Sporosarcina newyorkensis 2681]
gi|339616017|gb|EGQ20677.1| prolyl 4-hydroxylase [Sporosarcina newyorkensis 2681]
Length = 211
Score = 54.3 bits (129), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 26/64 (40%), Positives = 35/64 (54%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L DVE GG T FP L +V P KG AV++ +++ L+ + H G PV
Sbjct: 136 NRISTLVMYLNDVEQGGETFFPHLKFSVSPRKGMAVYFEYFYSDQTLNDFTLHGGAPVVE 195
Query: 62 GNKW 65
G KW
Sbjct: 196 GEKW 199
Score = 50.8 bits (120), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 25/59 (42%), Positives = 33/59 (55%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+ L DVE GG T FP L +V P KG AV++ +++ L+ H G PV G KW
Sbjct: 141 LVMYLNDVEQGGETFFPHLKFSVSPRKGMAVYFEYFYSDQTLNDFTLHGGAPVVEGEKW 199
>gi|195145080|ref|XP_002013524.1| GL24183 [Drosophila persimilis]
gi|194102467|gb|EDW24510.1| GL24183 [Drosophila persimilis]
Length = 296
Score = 54.3 bits (129), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 31/87 (35%), Positives = 42/87 (48%), Gaps = 1/87 (1%)
Query: 40 HNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAV 99
H+ H + YW G++ ++ L DVE GG T+FP L L + P KGSA+
Sbjct: 173 HDPHVDYHDLYWHPQEYEYHPFGDRQASVVFY-LNDVEDGGETVFPKLQLVIPPTKGSAL 231
Query: 100 FWYNAHANTLLDYRMYHSGCPVALGNK 126
W+N D R H+ CPV G K
Sbjct: 232 MWHNLRPWGEGDPRTQHASCPVLSGYK 258
Score = 52.8 bits (125), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 29/68 (42%), Positives = 36/68 (52%), Gaps = 2/68 (2%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWGKL-- 68
L DVE GG T+FP L L + P KGSA+ WHN D H+ CPV G K +
Sbjct: 205 LNDVEDGGETVFPKLQLVIPPTKGSALMWHNLRPWGEGDPRTQHASCPVLSGYKQVAIQW 264
Query: 69 LLSGLTDV 76
+L G D+
Sbjct: 265 ILQGTRDL 272
>gi|47227817|emb|CAG08980.1| unnamed protein product [Tetraodon nigroviridis]
Length = 285
Score = 53.9 bits (128), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 40/66 (60%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ +++ L+ VE GG+T F N +V K +A+FW+N H N D H+GCPV
Sbjct: 198 GNRVATVMIY-LSSVEAGGSTAFIYANFSVPVMKNAAIFWWNLHRNGRGDPDTLHAGCPV 256
Query: 122 ALGNKW 127
+G+KW
Sbjct: 257 LIGDKW 262
Score = 52.8 bits (125), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 27/64 (42%), Positives = 36/64 (56%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L+ VE GG+T F N +V K +A+FW N H N D H+GCPV +
Sbjct: 199 NRVATVMIYLSSVEAGGSTAFIYANFSVPVMKNAAIFWWNLHRNGRGDPDTLHAGCPVLI 258
Query: 62 GNKW 65
G+KW
Sbjct: 259 GDKW 262
>gi|198449520|ref|XP_002136916.1| GA26928 [Drosophila pseudoobscura pseudoobscura]
gi|198130644|gb|EDY67474.1| GA26928 [Drosophila pseudoobscura pseudoobscura]
Length = 532
Score = 53.9 bits (128), Expect = 2e-05, Method: Composition-based stats.
Identities = 29/68 (42%), Positives = 37/68 (54%), Gaps = 2/68 (2%)
Query: 60 ALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTL-LDYRMYHSG 118
A+G++ +L L DV+ GG+T F L L V E+G +FWYN T LD R H
Sbjct: 433 AMGDRMATVLFY-LNDVKHGGSTAFTDLQLKVPSERGKVLFWYNMRGETHNLDRRTVHGS 491
Query: 119 CPVALGNK 126
CPV G K
Sbjct: 492 CPVIDGTK 499
Score = 46.2 bits (108), Expect = 0.004, Method: Composition-based stats.
Identities = 24/55 (43%), Positives = 29/55 (52%), Gaps = 1/55 (1%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTL-LDYWMYHSGCPVALGNK 64
L DV+ GG+T F L L V E+G +FW+N T LD H CPV G K
Sbjct: 445 LNDVKHGGSTAFTDLQLKVPSERGKVLFWYNMRGETHNLDRRTVHGSCPVIDGTK 499
>gi|410910256|ref|XP_003968606.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Takifugu
rubripes]
Length = 540
Score = 53.9 bits (128), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 39/66 (59%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ ++ L+ VE GG+T F N +V K +A+FW+N H N D H+GCPV
Sbjct: 453 GNRVATFMIY-LSSVEAGGSTAFIYANFSVPVMKNAAIFWWNLHRNGEGDADTLHAGCPV 511
Query: 122 ALGNKW 127
+G+KW
Sbjct: 512 LIGDKW 517
Score = 52.8 bits (125), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 27/64 (42%), Positives = 36/64 (56%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L+ VE GG+T F N +V K +A+FW N H N D H+GCPV +
Sbjct: 454 NRVATFMIYLSSVEAGGSTAFIYANFSVPVMKNAAIFWWNLHRNGEGDADTLHAGCPVLI 513
Query: 62 GNKW 65
G+KW
Sbjct: 514 GDKW 517
>gi|81870817|sp|Q6W3F0.1|P4HA3_MOUSE RecName: Full=Prolyl 4-hydroxylase subunit alpha-3; Short=4-PH
alpha-3; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-3; Flags: Precursor
gi|36962749|gb|AAQ87604.1| collagen prolyl 4-hydroxylase alpha III subunit [Mus musculus]
Length = 542
Score = 53.9 bits (128), Expect = 2e-05, Method: Composition-based stats.
Identities = 27/66 (40%), Positives = 39/66 (59%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ ++ L+ VE GGAT F N +V K +A+FW+N H + D H+GCPV
Sbjct: 455 GNRVATFMIY-LSSVEAGGATAFIYGNFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPV 513
Query: 122 ALGNKW 127
+G+KW
Sbjct: 514 LVGDKW 519
Score = 53.1 bits (126), Expect = 4e-05, Method: Composition-based stats.
Identities = 27/64 (42%), Positives = 36/64 (56%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L+ VE GGAT F N +V K +A+FW N H + D H+GCPV +
Sbjct: 456 NRVATFMIYLSSVEAGGATAFIYGNFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLV 515
Query: 62 GNKW 65
G+KW
Sbjct: 516 GDKW 519
>gi|354504916|ref|XP_003514519.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Cricetulus
griseus]
Length = 509
Score = 53.9 bits (128), Expect = 2e-05, Method: Composition-based stats.
Identities = 27/66 (40%), Positives = 39/66 (59%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ ++ L+ VE GGAT F N +V K +A+FW+N H + D H+GCPV
Sbjct: 422 GNRVATFMIY-LSAVEAGGATAFIYANFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPV 480
Query: 122 ALGNKW 127
+G+KW
Sbjct: 481 LVGDKW 486
Score = 53.1 bits (126), Expect = 4e-05, Method: Composition-based stats.
Identities = 27/64 (42%), Positives = 36/64 (56%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L+ VE GGAT F N +V K +A+FW N H + D H+GCPV +
Sbjct: 423 NRVATFMIYLSAVEAGGATAFIYANFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLV 482
Query: 62 GNKW 65
G+KW
Sbjct: 483 GDKW 486
>gi|227908832|ref|NP_796135.3| prolyl 4-hydroxylase subunit alpha-3 precursor [Mus musculus]
Length = 542
Score = 53.9 bits (128), Expect = 2e-05, Method: Composition-based stats.
Identities = 27/66 (40%), Positives = 39/66 (59%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ ++ L+ VE GGAT F N +V K +A+FW+N H + D H+GCPV
Sbjct: 455 GNRVATFMIY-LSSVEAGGATAFIYGNFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPV 513
Query: 122 ALGNKW 127
+G+KW
Sbjct: 514 LVGDKW 519
Score = 53.1 bits (126), Expect = 4e-05, Method: Composition-based stats.
Identities = 27/64 (42%), Positives = 36/64 (56%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L+ VE GGAT F N +V K +A+FW N H + D H+GCPV +
Sbjct: 456 NRVATFMIYLSSVEAGGATAFIYGNFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLV 515
Query: 62 GNKW 65
G+KW
Sbjct: 516 GDKW 519
>gi|198449504|ref|XP_002136909.1| GA26876 [Drosophila pseudoobscura pseudoobscura]
gi|198130636|gb|EDY67467.1| GA26876 [Drosophila pseudoobscura pseudoobscura]
Length = 527
Score = 53.9 bits (128), Expect = 2e-05, Method: Composition-based stats.
Identities = 30/90 (33%), Positives = 44/90 (48%), Gaps = 2/90 (2%)
Query: 38 FWHNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGS 97
+ + H + DY+ + A G++ +L L D++ GG+T F L L V E+G
Sbjct: 413 YGYGGHYDIHFDYFNTSTPITKARGDRMATVLFY-LNDMKHGGSTAFTDLQLKVPSERGK 471
Query: 98 AVFWYNAHANTL-LDYRMYHSGCPVALGNK 126
+FWYN T +D R H CPV G K
Sbjct: 472 VLFWYNMRGETHDVDSRTLHGACPVINGTK 501
Score = 44.3 bits (103), Expect = 0.018, Method: Composition-based stats.
Identities = 22/55 (40%), Positives = 29/55 (52%), Gaps = 1/55 (1%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTL-LDYWMYHSGCPVALGNK 64
L D++ GG+T F L L V E+G +FW+N T +D H CPV G K
Sbjct: 447 LNDMKHGGSTAFTDLQLKVPSERGKVLFWYNMRGETHDVDSRTLHGACPVINGTK 501
>gi|348555277|ref|XP_003463450.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Cavia porcellus]
Length = 584
Score = 53.9 bits (128), Expect = 2e-05, Method: Composition-based stats.
Identities = 27/66 (40%), Positives = 39/66 (59%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ ++ L+ VE GGAT F N +V K +A+FW+N H + D H+GCPV
Sbjct: 497 GNRVATFMIY-LSSVEAGGATAFIYANFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPV 555
Query: 122 ALGNKW 127
+G+KW
Sbjct: 556 LVGDKW 561
Score = 53.1 bits (126), Expect = 4e-05, Method: Composition-based stats.
Identities = 27/64 (42%), Positives = 36/64 (56%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L+ VE GGAT F N +V K +A+FW N H + D H+GCPV +
Sbjct: 498 NRVATFMIYLSSVEAGGATAFIYANFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLV 557
Query: 62 GNKW 65
G+KW
Sbjct: 558 GDKW 561
>gi|268569334|ref|XP_002648232.1| C. briggsae CBR-PHY-3 protein [Caenorhabditis briggsae]
Length = 246
Score = 53.9 bits (128), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 27/78 (34%), Positives = 41/78 (52%), Gaps = 7/78 (8%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
D+WM + N+ G L+ + E GG T+FPS+ T+ ++G A FW+NA A+
Sbjct: 127 DWWMQ------MMRNRIGTLIFV-IKTAEKGGGTVFPSIGTTIKIDEGDAFFWFNAQADE 179
Query: 109 LLDYRMYHSGCPVALGNK 126
+ H GCP+ G K
Sbjct: 180 SQEMLSDHGGCPIYSGRK 197
Score = 49.7 bits (117), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 21/54 (38%), Positives = 30/54 (55%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNK 64
+ E GG T+FPS+ T+ ++G A FW NA A+ + H GCP+ G K
Sbjct: 144 IKTAEKGGGTVFPSIGTTIKIDEGDAFFWFNAQADESQEMLSDHGGCPIYSGRK 197
>gi|348505573|ref|XP_003440335.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Oreochromis
niloticus]
Length = 517
Score = 53.9 bits (128), Expect = 2e-05, Method: Composition-based stats.
Identities = 26/66 (39%), Positives = 39/66 (59%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ ++ L+ VE GG+T F N +V + +A+FW+N H N D H+GCPV
Sbjct: 430 GNRVATFMIY-LSPVEAGGSTAFIYANFSVPVVEKAAIFWWNLHRNGEGDDDTLHAGCPV 488
Query: 122 ALGNKW 127
+G+KW
Sbjct: 489 LIGDKW 494
Score = 53.1 bits (126), Expect = 4e-05, Method: Composition-based stats.
Identities = 26/64 (40%), Positives = 36/64 (56%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L+ VE GG+T F N +V + +A+FW N H N D H+GCPV +
Sbjct: 431 NRVATFMIYLSPVEAGGSTAFIYANFSVPVVEKAAIFWWNLHRNGEGDDDTLHAGCPVLI 490
Query: 62 GNKW 65
G+KW
Sbjct: 491 GDKW 494
>gi|421749438|ref|ZP_16186877.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator HPC(L)]
gi|409771699|gb|EKN53918.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator HPC(L)]
Length = 319
Score = 53.9 bits (128), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 29/66 (43%), Positives = 37/66 (56%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
G + L++ L DV GGAT FP L L V P +G+AVF+ + LD R H+G PV
Sbjct: 240 GQRMATLVIY-LNDVPAGGATAFPKLGLRVNPVQGNAVFFAYLGEDGSLDERTLHAGLPV 298
Query: 122 ALGNKW 127
G KW
Sbjct: 299 EQGEKW 304
Score = 51.2 bits (121), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 31/55 (56%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DV GGAT FP L L V P +G+AVF+ + LD H+G PV G KW
Sbjct: 250 LNDVPAGGATAFPKLGLRVNPVQGNAVFFAYLGEDGSLDERTLHAGLPVEQGEKW 304
>gi|195452770|ref|XP_002073492.1| GK14148 [Drosophila willistoni]
gi|194169577|gb|EDW84478.1| GK14148 [Drosophila willistoni]
Length = 444
Score = 53.9 bits (128), Expect = 2e-05, Method: Composition-based stats.
Identities = 28/57 (49%), Positives = 32/57 (56%), Gaps = 1/57 (1%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTL-LDYRMYHSGCPVALGNKWG 128
L V+ GGAT+FP LNL V EKG +FWYN +L D H CPV G K G
Sbjct: 387 LNTVKHGGATVFPLLNLKVPAEKGKVLFWYNLDGESLDFDENTEHGVCPVVDGIKLG 443
Score = 50.4 bits (119), Expect = 2e-04, Method: Composition-based stats.
Identities = 27/57 (47%), Positives = 32/57 (56%), Gaps = 1/57 (1%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTL-LDYWMYHSGCPVALGNKWG 66
L V+ GGAT+FP LNL V EKG +FW+N +L D H CPV G K G
Sbjct: 387 LNTVKHGGATVFPLLNLKVPAEKGKVLFWYNLDGESLDFDENTEHGVCPVVDGIKLG 443
>gi|255071007|ref|XP_002507585.1| predicted protein [Micromonas sp. RCC299]
gi|226522860|gb|ACO68843.1| predicted protein [Micromonas sp. RCC299]
Length = 433
Score = 53.5 bits (127), Expect = 2e-05, Method: Composition-based stats.
Identities = 32/97 (32%), Positives = 45/97 (46%), Gaps = 21/97 (21%)
Query: 48 LDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN-----------------LT 90
DY+ + G + N +L L+DV+ GG T+FPS +T
Sbjct: 265 FDYFFHEGG----MKNNRIATVLMYLSDVKDGGETVFPSAESLQVKPEPIHHACAKNGIT 320
Query: 91 VFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
V P+KG A+ ++N LD H+GCPV LG KW
Sbjct: 321 VIPKKGDAILFWNMKVGGDLDGGSTHAGCPVVLGEKW 357
Score = 51.2 bits (121), Expect = 1e-04, Method: Composition-based stats.
Identities = 29/81 (35%), Positives = 38/81 (46%), Gaps = 17/81 (20%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLN-----------------LTVFPEKGSAVFWHNAHA 44
NR + L+DV+ GG T+FPS +TV P+KG A+ + N
Sbjct: 277 NRIATVLMYLSDVKDGGETVFPSAESLQVKPEPIHHACAKNGITVIPKKGDAILFWNMKV 336
Query: 45 NTLLDYWMYHSGCPVALGNKW 65
LD H+GCPV LG KW
Sbjct: 337 GGDLDGGSTHAGCPVVLGEKW 357
>gi|241044303|ref|XP_002407179.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
gi|215492129|gb|EEC01770.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
Length = 456
Score = 53.5 bits (127), Expect = 3e-05, Method: Composition-based stats.
Identities = 24/54 (44%), Positives = 29/54 (53%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 126
L+DV LGG+T+FP V P+ GSA FWYN + D H C V G K
Sbjct: 378 LSDVNLGGSTVFPYAKAGVMPKMGSAAFWYNMREDGSYDRATLHGACSVLHGTK 431
Score = 51.2 bits (121), Expect = 2e-04, Method: Composition-based stats.
Identities = 23/54 (42%), Positives = 29/54 (53%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNK 64
L+DV LGG+T+FP V P+ GSA FW+N + D H C V G K
Sbjct: 378 LSDVNLGGSTVFPYAKAGVMPKMGSAAFWYNMREDGSYDRATLHGACSVLHGTK 431
>gi|444731524|gb|ELW71877.1| Prolyl 4-hydroxylase subunit alpha-3 [Tupaia chinensis]
Length = 562
Score = 53.5 bits (127), Expect = 3e-05, Method: Composition-based stats.
Identities = 26/66 (39%), Positives = 39/66 (59%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ ++ L+ VE GGAT F N +V K +A+FW+N H + + H+GCPV
Sbjct: 475 GNRVATFMIY-LSSVEAGGATAFIYANFSVPVVKNAALFWWNLHRSGEGNSDTLHAGCPV 533
Query: 122 ALGNKW 127
+G+KW
Sbjct: 534 LVGDKW 539
Score = 52.8 bits (125), Expect = 5e-05, Method: Composition-based stats.
Identities = 26/64 (40%), Positives = 36/64 (56%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L+ VE GGAT F N +V K +A+FW N H + + H+GCPV +
Sbjct: 476 NRVATFMIYLSSVEAGGATAFIYANFSVPVVKNAALFWWNLHRSGEGNSDTLHAGCPVLV 535
Query: 62 GNKW 65
G+KW
Sbjct: 536 GDKW 539
>gi|323445926|gb|EGB02303.1| hypothetical protein AURANDRAFT_39521 [Aureococcus anophagefferens]
Length = 239
Score = 53.5 bits (127), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 25/61 (40%), Positives = 37/61 (60%), Gaps = 6/61 (9%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA------LGNK 126
L+DV GGAT+FP+L L+V P +G + + N +T +D R H+G P+A + NK
Sbjct: 155 LSDVGAGGATVFPALGLSVAPRRGRLLVFANVADDTTVDARTVHAGEPIAGDTEKWIANK 214
Query: 127 W 127
W
Sbjct: 215 W 215
Score = 50.8 bits (120), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 24/61 (39%), Positives = 36/61 (59%), Gaps = 6/61 (9%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVA------LGNK 64
L+DV GGAT+FP+L L+V P +G + + N +T +D H+G P+A + NK
Sbjct: 155 LSDVGAGGATVFPALGLSVAPRRGRLLVFANVADDTTVDARTVHAGEPIAGDTEKWIANK 214
Query: 65 W 65
W
Sbjct: 215 W 215
>gi|421895470|ref|ZP_16325871.1| prolyl 4-hydroxylase alpha subunit homologue protein [Ralstonia
solanacearum MolK2]
gi|206586635|emb|CAQ17221.1| prolyl 4-hydroxylase alpha subunit homologue protein [Ralstonia
solanacearum MolK2]
Length = 283
Score = 53.5 bits (127), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 28/67 (41%), Positives = 37/67 (55%)
Query: 61 LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
+G + L+ L V+ GGAT FP L L V P KG+AVF+ + +LD H+G P
Sbjct: 202 VGGQRVATLVIYLNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGMLDDNTLHAGLP 261
Query: 121 VALGNKW 127
V G KW
Sbjct: 262 VERGEKW 268
Score = 51.6 bits (122), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 32/55 (58%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L V+ GGAT FP L L V P KG+AVF+ + +LD H+G PV G KW
Sbjct: 214 LNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGMLDDNTLHAGLPVERGEKW 268
>gi|301626782|ref|XP_002942567.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Xenopus
(Silurana) tropicalis]
Length = 716
Score = 53.5 bits (127), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 35/55 (63%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+ V+LGG+T F N + K +AVFW+N H N L D H+GCPV +G+KW
Sbjct: 639 LSHVDLGGSTAFVFANFSSPVVKNAAVFWWNLHRNGLGDEDTLHAGCPVIIGSKW 693
Score = 50.8 bits (120), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 34/55 (61%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L+ V+LGG+T F N + K +AVFW N H N L D H+GCPV +G+KW
Sbjct: 639 LSHVDLGGSTAFVFANFSSPVVKNAAVFWWNLHRNGLGDEDTLHAGCPVIIGSKW 693
>gi|335294484|ref|XP_003357239.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Sus scrofa]
Length = 545
Score = 53.5 bits (127), Expect = 3e-05, Method: Composition-based stats.
Identities = 27/66 (40%), Positives = 39/66 (59%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ ++ L+ VE GGAT F N +V K +A+FW+N H + D H+GCPV
Sbjct: 458 GNRVATFMIY-LSSVEAGGATAFIYGNFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPV 516
Query: 122 ALGNKW 127
+G+KW
Sbjct: 517 LVGDKW 522
Score = 52.8 bits (125), Expect = 5e-05, Method: Composition-based stats.
Identities = 27/64 (42%), Positives = 36/64 (56%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L+ VE GGAT F N +V K +A+FW N H + D H+GCPV +
Sbjct: 459 NRVATFMIYLSSVEAGGATAFIYGNFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLV 518
Query: 62 GNKW 65
G+KW
Sbjct: 519 GDKW 522
>gi|221484253|gb|EEE22549.1| prolyl 4-hydroxylase alpha subunit, putative [Toxoplasma gondii
GT1]
Length = 967
Score = 53.1 bits (126), Expect = 3e-05, Method: Composition-based stats.
Identities = 26/59 (44%), Positives = 33/59 (55%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
+L L DV GG T FP L L + P KGSAV W N + +D R+ H+G P G K+
Sbjct: 593 ILVYLNDVTHGGETEFPHLGLKLSPTKGSAVMWRNVFESNQIDPRVLHAGLPTLAGQKY 651
Score = 50.8 bits (120), Expect = 2e-04, Method: Composition-based stats.
Identities = 25/57 (43%), Positives = 31/57 (54%)
Query: 9 VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
V L DV GG T FP L L + P KGSAV W N + +D + H+G P G K+
Sbjct: 595 VYLNDVTHGGETEFPHLGLKLSPTKGSAVMWRNVFESNQIDPRVLHAGLPTLAGQKY 651
>gi|221505765|gb|EEE31410.1| prolyl 4-hydroxylase alpha subunit, putative [Toxoplasma gondii
VEG]
Length = 918
Score = 53.1 bits (126), Expect = 3e-05, Method: Composition-based stats.
Identities = 26/59 (44%), Positives = 33/59 (55%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
+L L DV GG T FP L L + P KGSAV W N + +D R+ H+G P G K+
Sbjct: 593 ILVYLNDVTHGGETEFPHLGLKLSPTKGSAVMWRNVFESNQIDPRVLHAGLPTLAGQKY 651
Score = 50.8 bits (120), Expect = 2e-04, Method: Composition-based stats.
Identities = 25/57 (43%), Positives = 31/57 (54%)
Query: 9 VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
V L DV GG T FP L L + P KGSAV W N + +D + H+G P G K+
Sbjct: 595 VYLNDVTHGGETEFPHLGLKLSPTKGSAVMWRNVFESNQIDPRVLHAGLPTLAGQKY 651
>gi|237838357|ref|XP_002368476.1| 2OG-Fe(II) oxygenase family protein [Toxoplasma gondii ME49]
gi|211966140|gb|EEB01336.1| 2OG-Fe(II) oxygenase family protein [Toxoplasma gondii ME49]
Length = 967
Score = 53.1 bits (126), Expect = 3e-05, Method: Composition-based stats.
Identities = 26/59 (44%), Positives = 33/59 (55%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
+L L DV GG T FP L L + P KGSAV W N + +D R+ H+G P G K+
Sbjct: 593 ILVYLNDVAHGGETEFPHLGLKLSPTKGSAVMWRNVFESNQIDPRVLHAGLPTLAGQKY 651
Score = 50.8 bits (120), Expect = 2e-04, Method: Composition-based stats.
Identities = 25/57 (43%), Positives = 31/57 (54%)
Query: 9 VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
V L DV GG T FP L L + P KGSAV W N + +D + H+G P G K+
Sbjct: 595 VYLNDVAHGGETEFPHLGLKLSPTKGSAVMWRNVFESNQIDPRVLHAGLPTLAGQKY 651
>gi|424863736|ref|ZP_18287648.1| prolyl 4-hydroxylase subunit alpha-2 [SAR86 cluster bacterium
SAR86A]
gi|400757057|gb|EJP71269.1| prolyl 4-hydroxylase subunit alpha-2 [SAR86 cluster bacterium
SAR86A]
Length = 205
Score = 53.1 bits (126), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 26/57 (45%), Positives = 32/57 (56%), Gaps = 1/57 (1%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNA-HANTLLDYWMYHSGCPVALGNKWG 66
L DVE GGAT FP +N++V P KG V +HN T ++ H G PV G KW
Sbjct: 138 LNDVEEGGATDFPKINVSVKPNKGDVVVFHNCIEGTTEINPQALHGGSPVVAGEKWA 194
Score = 52.8 bits (125), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 26/60 (43%), Positives = 35/60 (58%), Gaps = 1/60 (1%)
Query: 70 LSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNA-HANTLLDYRMYHSGCPVALGNKWG 128
L+ L DVE GGAT FP +N++V P KG V ++N T ++ + H G PV G KW
Sbjct: 135 LAYLNDVEEGGATDFPKINVSVKPNKGDVVVFHNCIEGTTEINPQALHGGSPVVAGEKWA 194
>gi|395521232|ref|XP_003764722.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Sarcophilus
harrisii]
Length = 521
Score = 53.1 bits (126), Expect = 4e-05, Method: Composition-based stats.
Identities = 26/66 (39%), Positives = 39/66 (59%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ ++ L+ VE GG+T F N +V K +A+FW+N H + D H+GCPV
Sbjct: 434 GNRVATFMIY-LSSVEAGGSTAFIYANFSVPVVKNAALFWWNLHRSGQGDGDTLHAGCPV 492
Query: 122 ALGNKW 127
+G+KW
Sbjct: 493 LVGDKW 498
Score = 52.4 bits (124), Expect = 6e-05, Method: Composition-based stats.
Identities = 26/64 (40%), Positives = 36/64 (56%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L+ VE GG+T F N +V K +A+FW N H + D H+GCPV +
Sbjct: 435 NRVATFMIYLSSVEAGGSTAFIYANFSVPVVKNAALFWWNLHRSGQGDGDTLHAGCPVLV 494
Query: 62 GNKW 65
G+KW
Sbjct: 495 GDKW 498
>gi|329913962|ref|ZP_08276011.1| hypothetical protein IMCC9480_1311 [Oxalobacteraceae bacterium
IMCC9480]
gi|327545257|gb|EGF30515.1| hypothetical protein IMCC9480_1311 [Oxalobacteraceae bacterium
IMCC9480]
Length = 280
Score = 53.1 bits (126), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 27/65 (41%), Positives = 37/65 (56%), Gaps = 1/65 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
G + ++L LTDVE GG T FP + L V P+KG A+F+ N + D + H+G PV
Sbjct: 205 GQRLATIILY-LTDVEEGGGTSFPGIGLDVHPQKGGALFFRNTTPYGVPDRKTQHAGLPV 263
Query: 122 ALGNK 126
G K
Sbjct: 264 EKGTK 268
Score = 51.6 bits (122), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 25/54 (46%), Positives = 31/54 (57%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNK 64
LTDVE GG T FP + L V P+KG A+F+ N + D H+G PV G K
Sbjct: 215 LTDVEEGGGTSFPGIGLDVHPQKGGALFFRNTTPYGVPDRKTQHAGLPVEKGTK 268
>gi|326436053|gb|EGD81623.1| p4ha2 protein [Salpingoeca sp. ATCC 50818]
Length = 548
Score = 53.1 bits (126), Expect = 4e-05, Method: Composition-based stats.
Identities = 25/55 (45%), Positives = 29/55 (52%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L V+ GG T FP L V P G AVFWYN + + D H CPV G+KW
Sbjct: 452 LNPVKQGGFTAFPRLGAAVQPGYGDAVFWYNLQPSGVGDPLTLHGACPVLRGSKW 506
Score = 51.6 bits (122), Expect = 1e-04, Method: Composition-based stats.
Identities = 24/57 (42%), Positives = 30/57 (52%)
Query: 9 VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
+ L V+ GG T FP L V P G AVFW+N + + D H CPV G+KW
Sbjct: 450 IYLNPVKQGGFTAFPRLGAAVQPGYGDAVFWYNLQPSGVGDPLTLHGACPVLRGSKW 506
>gi|126327904|ref|XP_001367838.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Monodelphis
domestica]
Length = 559
Score = 53.1 bits (126), Expect = 4e-05, Method: Composition-based stats.
Identities = 26/66 (39%), Positives = 39/66 (59%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ ++ L+ VE GG+T F N +V K +A+FW+N H + D H+GCPV
Sbjct: 472 GNRVATFMIY-LSSVEAGGSTAFIYANFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPV 530
Query: 122 ALGNKW 127
+G+KW
Sbjct: 531 LVGDKW 536
Score = 52.4 bits (124), Expect = 7e-05, Method: Composition-based stats.
Identities = 26/64 (40%), Positives = 36/64 (56%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L+ VE GG+T F N +V K +A+FW N H + D H+GCPV +
Sbjct: 473 NRVATFMIYLSSVEAGGSTAFIYANFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLV 532
Query: 62 GNKW 65
G+KW
Sbjct: 533 GDKW 536
>gi|406665340|ref|ZP_11073114.1| hypothetical protein B857_00901 [Bacillus isronensis B3W22]
gi|405387266|gb|EKB46691.1| hypothetical protein B857_00901 [Bacillus isronensis B3W22]
Length = 211
Score = 52.8 bits (125), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 27/64 (42%), Positives = 36/64 (56%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR V L DVE GG T FP+L + P+KG+AV++ + + L+ HSG PV
Sbjct: 133 NRISTLVVYLNDVEEGGVTTFPNLGIVNVPKKGTAVYFEYFYNDQKLNELTLHSGEPVIQ 192
Query: 62 GNKW 65
G KW
Sbjct: 193 GEKW 196
Score = 50.8 bits (120), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 27/70 (38%), Positives = 40/70 (57%), Gaps = 1/70 (1%)
Query: 58 PVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHS 117
P + N+ L++ L DVE GG T FP+L + P+KG+AV++ + + L+ HS
Sbjct: 128 PSSSNNRISTLVVY-LNDVEEGGVTTFPNLGIVNVPKKGTAVYFEYFYNDQKLNELTLHS 186
Query: 118 GCPVALGNKW 127
G PV G KW
Sbjct: 187 GEPVIQGEKW 196
>gi|330799463|ref|XP_003287764.1| hypothetical protein DICPUDRAFT_151895 [Dictyostelium purpureum]
gi|325082219|gb|EGC35708.1| hypothetical protein DICPUDRAFT_151895 [Dictyostelium purpureum]
Length = 220
Score = 52.8 bits (125), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 26/59 (44%), Positives = 32/59 (54%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
+L L DVE GG T FP + + V P +G AV +YN N +D H G PV G KW
Sbjct: 145 ILMYLCDVEEGGETSFPEVGIKVKPIRGDAVLFYNCKPNGDVDPLSLHQGDPVTKGTKW 203
Score = 51.2 bits (121), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 24/55 (43%), Positives = 30/55 (54%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T FP + + V P +G AV ++N N +D H G PV G KW
Sbjct: 149 LCDVEEGGETSFPEVGIKVKPIRGDAVLFYNCKPNGDVDPLSLHQGDPVTKGTKW 203
>gi|341893248|gb|EGT49183.1| hypothetical protein CAEBREN_01919 [Caenorhabditis brenneri]
Length = 305
Score = 52.8 bits (125), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 34/112 (30%), Positives = 53/112 (47%), Gaps = 6/112 (5%)
Query: 20 TIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHS-----GCPVALGNKWGKLLLSGLT 74
++ P+LN + + + + H DY +Y S G GN++ L++ T
Sbjct: 154 SLIPNLNFEIAEDIIALSYKSAGHYAPHHDYLVYPSENEWDGWMRNYGNRFATLIMVFET 213
Query: 75 DVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 126
+ GGAT+FP L +V P G A FW+NA + + H GCP+ G K
Sbjct: 214 AIS-GGATVFPQLGASVKPNPGDAFFWFNAMGDGEQEDLSEHGGCPIYAGTK 264
Score = 45.8 bits (107), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 21/48 (43%), Positives = 26/48 (54%)
Query: 17 GGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNK 64
GGAT+FP L +V P G A FW NA + + H GCP+ G K
Sbjct: 217 GGATVFPQLGASVKPNPGDAFFWFNAMGDGEQEDLSEHGGCPIYAGTK 264
>gi|195505241|ref|XP_002099419.1| GE10893 [Drosophila yakuba]
gi|194185520|gb|EDW99131.1| GE10893 [Drosophila yakuba]
Length = 508
Score = 52.8 bits (125), Expect = 5e-05, Method: Composition-based stats.
Identities = 23/56 (41%), Positives = 34/56 (60%), Gaps = 1/56 (1%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTL-LDYWMYHSGCPVALGNKW 65
L D GGAT+FP +++ V E+G +FWHN + T ++ H+ CPV G+KW
Sbjct: 428 LNDAPNGGATVFPRIDVKVTAERGKVLFWHNLNGETHDVEPNTLHAACPVFQGSKW 483
Score = 51.6 bits (122), Expect = 1e-04, Method: Composition-based stats.
Identities = 22/56 (39%), Positives = 34/56 (60%), Gaps = 1/56 (1%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTL-LDYRMYHSGCPVALGNKW 127
L D GGAT+FP +++ V E+G +FW+N + T ++ H+ CPV G+KW
Sbjct: 428 LNDAPNGGATVFPRIDVKVTAERGKVLFWHNLNGETHDVEPNTLHAACPVFQGSKW 483
>gi|328707957|ref|XP_001947811.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Acyrthosiphon
pisum]
Length = 507
Score = 52.8 bits (125), Expect = 5e-05, Method: Composition-based stats.
Identities = 29/71 (40%), Positives = 37/71 (52%), Gaps = 2/71 (2%)
Query: 58 PVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT-LLDYRMYH 116
P GN+ +L +TDV G T+FP+LN KGSA+ W N N + +H
Sbjct: 412 PKLNGNRLVTILFY-MTDVPDDGYTVFPNLNYVAHCRKGSALVWLNLRLNNGSVHSGTFH 470
Query: 117 SGCPVALGNKW 127
GCPV GNKW
Sbjct: 471 GGCPVIKGNKW 481
Score = 51.2 bits (121), Expect = 1e-04, Method: Composition-based stats.
Identities = 25/56 (44%), Positives = 31/56 (55%), Gaps = 1/56 (1%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANT-LLDYWMYHSGCPVALGNKW 65
+TDV G T+FP+LN KGSA+ W N N + +H GCPV GNKW
Sbjct: 426 MTDVPDDGYTVFPNLNYVAHCRKGSALVWLNLRLNNGSVHSGTFHGGCPVIKGNKW 481
>gi|431838427|gb|ELK00359.1| Prolyl 4-hydroxylase subunit alpha-3 [Pteropus alecto]
Length = 483
Score = 52.8 bits (125), Expect = 5e-05, Method: Composition-based stats.
Identities = 26/66 (39%), Positives = 38/66 (57%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ ++ L+ VE GGAT F N +V K +A+FW+N H + D H+ CPV
Sbjct: 396 GNRVATFMIY-LSSVEAGGATAFIYANFSVPVVKNAALFWWNLHRSGEGDSDTLHAACPV 454
Query: 122 ALGNKW 127
+G+KW
Sbjct: 455 LVGDKW 460
Score = 52.0 bits (123), Expect = 9e-05, Method: Composition-based stats.
Identities = 26/64 (40%), Positives = 35/64 (54%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L+ VE GGAT F N +V K +A+FW N H + D H+ CPV +
Sbjct: 397 NRVATFMIYLSSVEAGGATAFIYANFSVPVVKNAALFWWNLHRSGEGDSDTLHAACPVLV 456
Query: 62 GNKW 65
G+KW
Sbjct: 457 GDKW 460
>gi|377811809|ref|YP_005044249.1| ProCollegen-proline dioxygenase [Burkholderia sp. YI23]
gi|357941170|gb|AET94726.1| ProCollegen-proline dioxygenase [Burkholderia sp. YI23]
Length = 283
Score = 52.4 bits (124), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 26/66 (39%), Positives = 37/66 (56%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
G + L+L L VE GG T FP + LT+ P +G+A+++ +A D R H+G PV
Sbjct: 204 GQRIATLILY-LNQVEEGGDTTFPDIGLTIHPRRGAALYFEYVNALGQTDPRTLHAGMPV 262
Query: 122 ALGNKW 127
G KW
Sbjct: 263 ERGEKW 268
Score = 49.3 bits (116), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 22/55 (40%), Positives = 31/55 (56%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L VE GG T FP + LT+ P +G+A+++ +A D H+G PV G KW
Sbjct: 214 LNQVEEGGDTTFPDIGLTIHPRRGAALYFEYVNALGQTDPRTLHAGMPVERGEKW 268
>gi|308475644|ref|XP_003100040.1| hypothetical protein CRE_20852 [Caenorhabditis remanei]
gi|308266092|gb|EFP10045.1| hypothetical protein CRE_20852 [Caenorhabditis remanei]
Length = 300
Score = 52.4 bits (124), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 27/112 (24%), Positives = 49/112 (43%), Gaps = 3/112 (2%)
Query: 18 GATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGL---T 74
+ + P+++ ++ + + + H DY Y G N++G + + +
Sbjct: 156 ASRLIPAIDFSISEDISALSYNPGGHYAVHYDYLEYEEGSDDEFMNEFGNRMATFIMVFK 215
Query: 75 DVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 126
GG T+FPS V + G A W+++ N +D + H GCPV G K
Sbjct: 216 KATSGGGTLFPSFGTVVRADAGDAFLWFDSKENGEMDMNVEHGGCPVYDGQK 267
Score = 42.7 bits (99), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 25/48 (52%)
Query: 17 GGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNK 64
GG T+FPS V + G A W ++ N +D + H GCPV G K
Sbjct: 220 GGGTLFPSFGTVVRADAGDAFLWFDSKENGEMDMNVEHGGCPVYDGQK 267
>gi|328718395|ref|XP_003246475.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Acyrthosiphon
pisum]
Length = 518
Score = 52.4 bits (124), Expect = 6e-05, Method: Composition-based stats.
Identities = 29/68 (42%), Positives = 37/68 (54%), Gaps = 2/68 (2%)
Query: 61 LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAH-ANTLLDYRMYHSGC 119
GN+ +L LTDV+ G T FP LN+ EKG+A+ W N H +N Y H C
Sbjct: 428 FGNRLVTVLFY-LTDVQNDGYTSFPLLNIIAPAEKGAALVWNNLHMSNGQKFYESLHGSC 486
Query: 120 PVALGNKW 127
P+ GNKW
Sbjct: 487 PLLKGNKW 494
Score = 52.0 bits (123), Expect = 7e-05, Method: Composition-based stats.
Identities = 26/56 (46%), Positives = 33/56 (58%), Gaps = 1/56 (1%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAH-ANTLLDYWMYHSGCPVALGNKW 65
LTDV+ G T FP LN+ EKG+A+ W+N H +N Y H CP+ GNKW
Sbjct: 439 LTDVQNDGYTSFPLLNIIAPAEKGAALVWNNLHMSNGQKFYESLHGSCPLLKGNKW 494
>gi|113869198|ref|YP_727687.1| prolyl 4-hydroxylase alpha subunit [Ralstonia eutropha H16]
gi|113527974|emb|CAJ94319.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia eutropha H16]
Length = 297
Score = 52.4 bits (124), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 28/69 (40%), Positives = 39/69 (56%)
Query: 59 VALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSG 118
+++G + L+ L E GGAT FP + L V P KG+AV++ + LD R H+G
Sbjct: 214 LSVGGQRIATLVIYLNTPEAGGATAFPRVGLEVAPVKGNAVYFSYLLPDGALDERTLHAG 273
Query: 119 CPVALGNKW 127
PVA G KW
Sbjct: 274 LPVAFGEKW 282
Score = 49.7 bits (117), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 24/51 (47%), Positives = 30/51 (58%)
Query: 15 ELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
E GGAT FP + L V P KG+AV++ + LD H+G PVA G KW
Sbjct: 232 EAGGATAFPRVGLEVAPVKGNAVYFSYLLPDGALDERTLHAGLPVAFGEKW 282
>gi|229002593|ref|ZP_04160640.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock3-17]
gi|229003816|ref|ZP_04161625.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock1-4]
gi|228757417|gb|EEM06653.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock1-4]
gi|228758520|gb|EEM07660.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock3-17]
Length = 219
Score = 52.4 bits (124), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 29/79 (36%), Positives = 44/79 (55%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
D+++ +S A N+ L++ L VE GG T FP LNL+V P+KG AV++ + +
Sbjct: 132 DFFVENSAA--ASNNRMSTLVMY-LNHVEEGGETFFPKLNLSVSPKKGMAVYFEYFYQDE 188
Query: 109 LLDYRMYHSGCPVALGNKW 127
++ H G PV G KW
Sbjct: 189 SINKLTLHGGAPVIKGEKW 207
Score = 52.0 bits (123), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 24/55 (43%), Positives = 32/55 (58%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L VE GG T FP LNL+V P+KG AV++ + + ++ H G PV G KW
Sbjct: 153 LNHVEEGGETFFPKLNLSVSPKKGMAVYFEYFYQDESINKLTLHGGAPVIKGEKW 207
>gi|344296798|ref|XP_003420090.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Loxodonta
africana]
Length = 544
Score = 52.4 bits (124), Expect = 6e-05, Method: Composition-based stats.
Identities = 26/66 (39%), Positives = 39/66 (59%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ ++ L+ VE GGAT F N ++ K +A+FW+N H + D H+GCPV
Sbjct: 457 GNRVATFMIY-LSAVEAGGATAFIYANFSMPVVKNAALFWWNLHRSGEGDGDTLHAGCPV 515
Query: 122 ALGNKW 127
+G+KW
Sbjct: 516 LVGDKW 521
Score = 51.6 bits (122), Expect = 1e-04, Method: Composition-based stats.
Identities = 26/64 (40%), Positives = 36/64 (56%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L+ VE GGAT F N ++ K +A+FW N H + D H+GCPV +
Sbjct: 458 NRVATFMIYLSAVEAGGATAFIYANFSMPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLV 517
Query: 62 GNKW 65
G+KW
Sbjct: 518 GDKW 521
>gi|228990015|ref|ZP_04149988.1| Prolyl 4-hydroxylase alpha subunit [Bacillus pseudomycoides DSM
12442]
gi|228769681|gb|EEM18271.1| Prolyl 4-hydroxylase alpha subunit [Bacillus pseudomycoides DSM
12442]
Length = 219
Score = 52.4 bits (124), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 29/79 (36%), Positives = 44/79 (55%), Gaps = 3/79 (3%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
D+++ +S A N+ L++ L VE GG T FP LNL+V P+KG AV++ + +
Sbjct: 132 DFFVENSAA--ASNNRMSTLVMY-LNHVEEGGETFFPKLNLSVSPKKGMAVYFEYFYQDE 188
Query: 109 LLDYRMYHSGCPVALGNKW 127
++ H G PV G KW
Sbjct: 189 SINKLTLHGGAPVIKGEKW 207
Score = 52.0 bits (123), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 24/55 (43%), Positives = 32/55 (58%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L VE GG T FP LNL+V P+KG AV++ + + ++ H G PV G KW
Sbjct: 153 LNHVEEGGETFFPKLNLSVSPKKGMAVYFEYFYQDESINKLTLHGGAPVIKGEKW 207
>gi|341902051|gb|EGT57986.1| hypothetical protein CAEBREN_04818 [Caenorhabditis brenneri]
Length = 314
Score = 52.4 bits (124), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 31/113 (27%), Positives = 48/113 (42%), Gaps = 5/113 (4%)
Query: 18 GATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCP----VALGNKWGKLLLSGL 73
A P++N + + H + D+ + P + GN+ ++S +
Sbjct: 171 AARHIPAMNFKYSEQVLGLSYLPGGHYSAHHDFLEFQGEAPEHAHLEEGNRIATFIIS-V 229
Query: 74 TDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 126
++GG T+FP L TV G A W+N N +D YH GCPV G K
Sbjct: 230 EKADVGGGTVFPYLKTTVRANPGDAYIWFNMLENQEIDDMSYHGGCPVISGKK 282
Score = 50.1 bits (118), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 27/57 (47%), Positives = 29/57 (50%), Gaps = 2/57 (3%)
Query: 8 SVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNK 64
SVE DV GG T+FP L TV G A W N N +D YH GCPV G K
Sbjct: 228 SVEKADV--GGGTVFPYLKTTVRANPGDAYIWFNMLENQEIDDMSYHGGCPVISGKK 282
>gi|187930127|ref|YP_001900614.1| procollagen-proline dioxygenase [Ralstonia pickettii 12J]
gi|187727017|gb|ACD28182.1| Procollagen-proline dioxygenase [Ralstonia pickettii 12J]
Length = 288
Score = 52.4 bits (124), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 31/55 (56%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L V+ GGAT FP L L V P KG+AVF+ + LD H+G PV G KW
Sbjct: 219 LNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDEDTLHAGLPVERGEKW 273
Score = 51.2 bits (121), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 31/55 (56%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L V+ GGAT FP L L V P KG+AVF+ + LD H+G PV G KW
Sbjct: 219 LNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDEDTLHAGLPVERGEKW 273
>gi|374370415|ref|ZP_09628419.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus basilensis OR16]
gi|373098067|gb|EHP39184.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus basilensis OR16]
Length = 454
Score = 52.4 bits (124), Expect = 6e-05, Method: Composition-based stats.
Identities = 28/66 (42%), Positives = 37/66 (56%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
G + G L++ L GGAT FP L L V P KG+AV++ ++ LD R H+G PV
Sbjct: 245 GQRVGTLVIY-LNSPLAGGATAFPKLGLEVAPVKGNAVYFSYRKSDGALDERTLHAGLPV 303
Query: 122 ALGNKW 127
G KW
Sbjct: 304 EAGEKW 309
Score = 49.3 bits (116), Expect = 6e-04, Method: Composition-based stats.
Identities = 23/49 (46%), Positives = 29/49 (59%)
Query: 17 GGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
GGAT FP L L V P KG+AV++ ++ LD H+G PV G KW
Sbjct: 261 GGATAFPKLGLEVAPVKGNAVYFSYRKSDGALDERTLHAGLPVEAGEKW 309
>gi|328876967|gb|EGG25330.1| putative prolyl 4-hydroxylase alpha subunit [Dictyostelium
fasciculatum]
Length = 244
Score = 52.4 bits (124), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 26/59 (44%), Positives = 33/59 (55%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
+L L++VE GG T FP + + V P KG AV +YN N +D H G PV G KW
Sbjct: 146 ILMYLSEVEEGGETSFPEVGVKVKPVKGDAVLFYNCKPNGEVDPLSLHQGDPVIKGTKW 204
Score = 50.8 bits (120), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 24/55 (43%), Positives = 31/55 (56%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L++VE GG T FP + + V P KG AV ++N N +D H G PV G KW
Sbjct: 150 LSEVEEGGETSFPEVGVKVKPVKGDAVLFYNCKPNGEVDPLSLHQGDPVIKGTKW 204
>gi|241664232|ref|YP_002982592.1| procollagen-proline dioxygenase [Ralstonia pickettii 12D]
gi|309783051|ref|ZP_07677770.1| procollagen-proline dioxygenase [Ralstonia sp. 5_7_47FAA]
gi|404397139|ref|ZP_10988932.1| hypothetical protein HMPREF0989_00773 [Ralstonia sp. 5_2_56FAA]
gi|240866259|gb|ACS63920.1| Procollagen-proline dioxygenase [Ralstonia pickettii 12D]
gi|308918159|gb|EFP63837.1| procollagen-proline dioxygenase [Ralstonia sp. 5_7_47FAA]
gi|348610674|gb|EGY60360.1| hypothetical protein HMPREF0989_00773 [Ralstonia sp. 5_2_56FAA]
Length = 288
Score = 52.4 bits (124), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 31/55 (56%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L V+ GGAT FP L L V P KG+AVF+ + LD H+G PV G KW
Sbjct: 219 LNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDEDTLHAGLPVERGEKW 273
Score = 51.2 bits (121), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 31/55 (56%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L V+ GGAT FP L L V P KG+AVF+ + LD H+G PV G KW
Sbjct: 219 LNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDEDTLHAGLPVERGEKW 273
>gi|344172475|emb|CCA85118.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia syzygii R24]
Length = 289
Score = 52.4 bits (124), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 28/67 (41%), Positives = 36/67 (53%)
Query: 61 LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
+G + L+ L V+ GGAT FP L L V P KG+AVF+ + LD H+G P
Sbjct: 208 VGGQRVATLVIYLNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLP 267
Query: 121 VALGNKW 127
V G KW
Sbjct: 268 VERGEKW 274
Score = 50.4 bits (119), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 31/55 (56%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L V+ GGAT FP L L V P KG+AVF+ + LD H+G PV G KW
Sbjct: 220 LNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLPVERGEKW 274
>gi|344169181|emb|CCA81504.1| putative Prolyl 4-hydroxylase alpha subunit [blood disease
bacterium R229]
Length = 289
Score = 52.0 bits (123), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 28/67 (41%), Positives = 36/67 (53%)
Query: 61 LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
+G + L+ L V+ GGAT FP L L V P KG+AVF+ + LD H+G P
Sbjct: 208 VGGQRVATLVIYLNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLP 267
Query: 121 VALGNKW 127
V G KW
Sbjct: 268 VERGEKW 274
Score = 50.4 bits (119), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 31/55 (56%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L V+ GGAT FP L L V P KG+AVF+ + LD H+G PV G KW
Sbjct: 220 LNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLPVERGEKW 274
>gi|300690371|ref|YP_003751366.1| prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum PSI07]
gi|299077431|emb|CBJ50057.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
PSI07]
Length = 289
Score = 52.0 bits (123), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 28/67 (41%), Positives = 36/67 (53%)
Query: 61 LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
+G + L+ L V+ GGAT FP L L V P KG+AVF+ + LD H+G P
Sbjct: 208 VGGQRVATLVIYLNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLP 267
Query: 121 VALGNKW 127
V G KW
Sbjct: 268 VERGEKW 274
Score = 50.4 bits (119), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 31/55 (56%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L V+ GGAT FP L L V P KG+AVF+ + LD H+G PV G KW
Sbjct: 220 LNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLPVERGEKW 274
>gi|195575137|ref|XP_002105536.1| GD21536 [Drosophila simulans]
gi|194201463|gb|EDX15039.1| GD21536 [Drosophila simulans]
Length = 465
Score = 52.0 bits (123), Expect = 7e-05, Method: Composition-based stats.
Identities = 23/55 (41%), Positives = 33/55 (60%), Gaps = 1/55 (1%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L D GGAT+FP LN+ V E+G + WYN + +T D H+ CPV G+++
Sbjct: 403 LNDAPYGGATVFPHLNVKVPAERGKVLLWYNLNGDT-QDVTTVHAACPVFHGSEY 456
Score = 49.7 bits (117), Expect = 4e-04, Method: Composition-based stats.
Identities = 23/64 (35%), Positives = 36/64 (56%), Gaps = 1/64 (1%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
+R + L D GGAT+FP LN+ V E+G + W+N + +T D H+ CPV
Sbjct: 394 DRMDTVMFYLNDAPYGGATVFPHLNVKVPAERGKVLLWYNLNGDT-QDVTTVHAACPVFH 452
Query: 62 GNKW 65
G+++
Sbjct: 453 GSEY 456
>gi|300702992|ref|YP_003744594.1| prolyl 4-hydroxylase subunit alpha [Ralstonia solanacearum
CFBP2957]
gi|299070655|emb|CBJ41950.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
CFBP2957]
Length = 289
Score = 52.0 bits (123), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 28/67 (41%), Positives = 36/67 (53%)
Query: 61 LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
+G + L+ L V+ GGAT FP L L V P KG+AVF+ + LD H+G P
Sbjct: 208 VGGQRVATLVIYLNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLP 267
Query: 121 VALGNKW 127
V G KW
Sbjct: 268 VERGEKW 274
Score = 50.1 bits (118), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 31/55 (56%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L V+ GGAT FP L L V P KG+AVF+ + LD H+G PV G KW
Sbjct: 220 LNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLPVERGEKW 274
>gi|207744371|ref|YP_002260763.1| prolyl 4-hydroxylase subunit alpha [Ralstonia solanacearum IPO1609]
gi|206595776|emb|CAQ62703.1| prolyl 4-hydroxylase alpha subunit homologue protein [Ralstonia
solanacearum IPO1609]
Length = 280
Score = 52.0 bits (123), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 28/67 (41%), Positives = 36/67 (53%)
Query: 61 LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
+G + L+ L V+ GGAT FP L L V P KG+AVF+ + LD H+G P
Sbjct: 199 VGGQRVATLVIYLNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLP 258
Query: 121 VALGNKW 127
V G KW
Sbjct: 259 VERGEKW 265
Score = 50.4 bits (119), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 31/55 (56%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L V+ GGAT FP L L V P KG+AVF+ + LD H+G PV G KW
Sbjct: 211 LNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLPVERGEKW 265
>gi|421890664|ref|ZP_16321519.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
K60-1]
gi|378964031|emb|CCF98267.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
K60-1]
Length = 288
Score = 52.0 bits (123), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 28/67 (41%), Positives = 36/67 (53%)
Query: 61 LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
+G + L+ L V+ GGAT FP L L V P KG+AVF+ + LD H+G P
Sbjct: 207 VGGQRVATLVIYLNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLP 266
Query: 121 VALGNKW 127
V G KW
Sbjct: 267 VERGEKW 273
Score = 50.1 bits (118), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 31/55 (56%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L V+ GGAT FP L L V P KG+AVF+ + LD H+G PV G KW
Sbjct: 219 LNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLPVERGEKW 273
>gi|299065638|emb|CBJ36810.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
CMR15]
Length = 289
Score = 52.0 bits (123), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 28/67 (41%), Positives = 36/67 (53%)
Query: 61 LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
+G + L+ L V GGAT FP L L V P KG+AVF+ + LD + H+G P
Sbjct: 208 VGGQRVATLVIYLNSVPAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDDKTLHAGLP 267
Query: 121 VALGNKW 127
V G KW
Sbjct: 268 VERGEKW 274
Score = 49.7 bits (117), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 30/55 (54%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L V GGAT FP L L V P KG+AVF+ + LD H+G PV G KW
Sbjct: 220 LNSVPAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDDKTLHAGLPVERGEKW 274
>gi|83746819|ref|ZP_00943867.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum UW551]
gi|83726588|gb|EAP73718.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum UW551]
Length = 289
Score = 52.0 bits (123), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 28/67 (41%), Positives = 36/67 (53%)
Query: 61 LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
+G + L+ L V+ GGAT FP L L V P KG+AVF+ + LD H+G P
Sbjct: 208 VGGQRVATLVIYLNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLP 267
Query: 121 VALGNKW 127
V G KW
Sbjct: 268 VERGEKW 274
Score = 50.1 bits (118), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 31/55 (56%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L V+ GGAT FP L L V P KG+AVF+ + LD H+G PV G KW
Sbjct: 220 LNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLPVERGEKW 274
>gi|386332363|ref|YP_006028532.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum Po82]
gi|334194811|gb|AEG67996.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum Po82]
Length = 292
Score = 52.0 bits (123), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 28/67 (41%), Positives = 36/67 (53%)
Query: 61 LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
+G + L+ L V+ GGAT FP L L V P KG+AVF+ + LD H+G P
Sbjct: 211 VGGQRVATLVIYLNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLP 270
Query: 121 VALGNKW 127
V G KW
Sbjct: 271 VERGEKW 277
Score = 50.1 bits (118), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 31/55 (56%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L V+ GGAT FP L L V P KG+AVF+ + LD H+G PV G KW
Sbjct: 223 LNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLPVERGEKW 277
>gi|344175386|emb|CCA88057.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia syzygii R24]
Length = 331
Score = 52.0 bits (123), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 26/66 (39%), Positives = 37/66 (56%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
G++ +LL L +VE GGAT FP+ V P+KG A+++ A+ +D H GC V
Sbjct: 247 GHRIATVLLY-LNEVEAGGATFFPNPGFEVRPQKGGALYFAYQQADGSMDPTSLHEGCAV 305
Query: 122 ALGNKW 127
G KW
Sbjct: 306 TQGEKW 311
Score = 51.2 bits (121), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 23/55 (41%), Positives = 31/55 (56%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L +VE GGAT FP+ V P+KG A+++ A+ +D H GC V G KW
Sbjct: 257 LNEVEAGGATFFPNPGFEVRPQKGGALYFAYQQADGSMDPTSLHEGCAVTQGEKW 311
>gi|260806889|ref|XP_002598316.1| hypothetical protein BRAFLDRAFT_261183 [Branchiostoma floridae]
gi|229283588|gb|EEN54328.1| hypothetical protein BRAFLDRAFT_261183 [Branchiostoma floridae]
Length = 531
Score = 52.0 bits (123), Expect = 9e-05, Method: Composition-based stats.
Identities = 27/66 (40%), Positives = 42/66 (63%), Gaps = 3/66 (4%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ LL L+DV+ GGAT+FP +++ V P +AVFW + ++ + H+GCPV
Sbjct: 450 GNRVVTFLLY-LSDVDAGGATVFPIVDVAV-PINSAAVFWSMERSGAVVPNSL-HAGCPV 506
Query: 122 ALGNKW 127
+G+KW
Sbjct: 507 LIGSKW 512
Score = 51.2 bits (121), Expect = 1e-04, Method: Composition-based stats.
Identities = 25/64 (39%), Positives = 40/64 (62%), Gaps = 2/64 (3%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L+DV+ GGAT+FP +++ V P +AVFW + ++ + H+GCPV +
Sbjct: 451 NRVVTFLLYLSDVDAGGATVFPIVDVAV-PINSAAVFWSMERSGAVVPNSL-HAGCPVLI 508
Query: 62 GNKW 65
G+KW
Sbjct: 509 GSKW 512
>gi|339327280|ref|YP_004686973.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator N-1]
gi|338167437|gb|AEI78492.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator N-1]
Length = 297
Score = 51.6 bits (122), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 28/69 (40%), Positives = 39/69 (56%)
Query: 59 VALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSG 118
+++G + L+ L E GGAT FP + L V P KG+AV++ + LD R H+G
Sbjct: 214 LSVGGQRIATLVIYLNTPEAGGATAFPRVGLEVAPVKGNAVYFSYLLPDGTLDERTLHAG 273
Query: 119 CPVALGNKW 127
PVA G KW
Sbjct: 274 LPVASGEKW 282
Score = 49.3 bits (116), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 25/55 (45%), Positives = 31/55 (56%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L E GGAT FP + L V P KG+AV++ + LD H+G PVA G KW
Sbjct: 228 LNTPEAGGATAFPRVGLEVAPVKGNAVYFSYLLPDGTLDERTLHAGLPVASGEKW 282
>gi|160900716|ref|YP_001566298.1| procollagen-proline dioxygenase [Delftia acidovorans SPH-1]
gi|160366300|gb|ABX37913.1| Procollagen-proline dioxygenase [Delftia acidovorans SPH-1]
Length = 294
Score = 51.6 bits (122), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 32/83 (38%), Positives = 42/83 (50%), Gaps = 7/83 (8%)
Query: 49 DYWMY-HSGCPVAL---GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNA 104
DY+ G P L G + G L++ L + E GGATIFP + L V P +G+AVF+
Sbjct: 203 DYFAPGEPGTPTILKRGGQRVGTLVIY-LNEPERGGATIFPEVPLQVVPRRGNAVFFSYE 261
Query: 105 HANTLLDYRMYHSGCPVALGNKW 127
+ R H G PV G KW
Sbjct: 262 RPDP--STRTLHGGAPVLAGEKW 282
Score = 46.6 bits (109), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 23/55 (41%), Positives = 29/55 (52%), Gaps = 2/55 (3%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L + E GGATIFP + L V P +G+AVF+ + H G PV G KW
Sbjct: 230 LNEPERGGATIFPEVPLQVVPRRGNAVFFSYERPDP--STRTLHGGAPVLAGEKW 282
>gi|284035817|ref|YP_003385747.1| 2OG-Fe(II) oxygenase [Spirosoma linguale DSM 74]
gi|283815110|gb|ADB36948.1| 2OG-Fe(II) oxygenase [Spirosoma linguale DSM 74]
Length = 328
Score = 51.6 bits (122), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 25/60 (41%), Positives = 36/60 (60%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 128
+L L D +GG T FP LN+ V P++GSA+++ N N LL H+G P+A G K+
Sbjct: 258 MLVYLNDDFVGGETYFPELNMNVHPKRGSALYFLNRDDNNLLLLNSVHAGLPIAQGMKYA 317
Score = 49.7 bits (117), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 25/58 (43%), Positives = 35/58 (60%)
Query: 9 VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWG 66
V L D +GG T FP LN+ V P++GSA+++ N N LL H+G P+A G K+
Sbjct: 260 VYLNDDFVGGETYFPELNMNVHPKRGSALYFLNRDDNNLLLLNSVHAGLPIAQGMKYA 317
>gi|333912984|ref|YP_004486716.1| procollagen-proline dioxygenase [Delftia sp. Cs1-4]
gi|333743184|gb|AEF88361.1| Procollagen-proline dioxygenase [Delftia sp. Cs1-4]
Length = 294
Score = 51.6 bits (122), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 32/83 (38%), Positives = 42/83 (50%), Gaps = 7/83 (8%)
Query: 49 DYWMY-HSGCPVAL---GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNA 104
DY+ G P L G + G L++ L + E GGATIFP + L V P +G+AVF+
Sbjct: 203 DYFAPGEPGTPTILKRGGQRVGTLVIY-LNEPERGGATIFPEVPLQVVPRRGNAVFFSYE 261
Query: 105 HANTLLDYRMYHSGCPVALGNKW 127
+ R H G PV G KW
Sbjct: 262 RPDP--STRTLHGGAPVLAGEKW 282
Score = 46.6 bits (109), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 23/55 (41%), Positives = 29/55 (52%), Gaps = 2/55 (3%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L + E GGATIFP + L V P +G+AVF+ + H G PV G KW
Sbjct: 230 LNEPERGGATIFPEVPLQVVPRRGNAVFFSYERPDP--STRTLHGGAPVLAGEKW 282
>gi|393200372|ref|YP_006462214.1| prolyl 4-hydroxylase [Solibacillus silvestris StLB046]
gi|327439703|dbj|BAK16068.1| prolyl 4-hydroxylase [Solibacillus silvestris StLB046]
Length = 211
Score = 51.6 bits (122), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 27/64 (42%), Positives = 36/64 (56%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR V L DVE GG T FP+L + P+KG+AV++ + + L+ HSG PV
Sbjct: 133 NRICTLVVYLNDVEEGGVTTFPNLGIVNVPKKGTAVYFEYFYNDQKLNELTLHSGEPVIQ 192
Query: 62 GNKW 65
G KW
Sbjct: 193 GEKW 196
Score = 50.1 bits (118), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 27/70 (38%), Positives = 40/70 (57%), Gaps = 1/70 (1%)
Query: 58 PVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHS 117
P + N+ L++ L DVE GG T FP+L + P+KG+AV++ + + L+ HS
Sbjct: 128 PSSSNNRICTLVVY-LNDVEEGGVTTFPNLGIVNVPKKGTAVYFEYFYNDQKLNELTLHS 186
Query: 118 GCPVALGNKW 127
G PV G KW
Sbjct: 187 GEPVIQGEKW 196
>gi|194290782|ref|YP_002006689.1| prolyl 4-hydroxylase subunit alpha [Cupriavidus taiwanensis LMG
19424]
gi|193224617|emb|CAQ70628.1| putative Prolyl 4-hydroxylase alpha subunit [Cupriavidus
taiwanensis LMG 19424]
Length = 296
Score = 51.6 bits (122), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 28/69 (40%), Positives = 39/69 (56%)
Query: 59 VALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSG 118
+++G + L+ L E GGAT FP + L V P KG+AV++ + LD R H+G
Sbjct: 213 LSVGGQRIATLVIYLNTPEAGGATAFPRVGLEVAPVKGNAVYFSYLLPDGTLDDRTLHAG 272
Query: 119 CPVALGNKW 127
PVA G KW
Sbjct: 273 LPVAAGEKW 281
Score = 48.9 bits (115), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 24/51 (47%), Positives = 30/51 (58%)
Query: 15 ELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
E GGAT FP + L V P KG+AV++ + LD H+G PVA G KW
Sbjct: 231 EAGGATAFPRVGLEVAPVKGNAVYFSYLLPDGTLDDRTLHAGLPVAAGEKW 281
>gi|341902054|gb|EGT57989.1| hypothetical protein CAEBREN_08644 [Caenorhabditis brenneri]
Length = 314
Score = 51.6 bits (122), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 31/113 (27%), Positives = 49/113 (43%), Gaps = 5/113 (4%)
Query: 18 GATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCP----VALGNKWGKLLLSGL 73
A + PSLN + + H + DY + +A GN+ +++ +
Sbjct: 171 AARLIPSLNFKLSEPIVGLSYMPGGHYSAHHDYLDFDDEAAKQEHLANGNRMVTFIIA-V 229
Query: 74 TDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 126
++GG T+FP + TV G A W+N N +D +H GCPV G K
Sbjct: 230 EKADVGGGTVFPEIQTTVRANPGDAYLWFNMLENQEMDDMSFHGGCPVLAGKK 282
Score = 47.8 bits (112), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 22/63 (34%), Positives = 30/63 (47%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + + ++GG T+FP + TV G A W N N +D +H GCPV
Sbjct: 220 NRMVTFIIAVEKADVGGGTVFPEIQTTVRANPGDAYLWFNMLENQEMDDMSFHGGCPVLA 279
Query: 62 GNK 64
G K
Sbjct: 280 GKK 282
>gi|341896531|gb|EGT52466.1| hypothetical protein CAEBREN_02034 [Caenorhabditis brenneri]
Length = 317
Score = 51.6 bits (122), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 24/65 (36%), Positives = 36/65 (55%), Gaps = 1/65 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ +++ E+GG T+FP L TV E G A+ W+N N ++ + H+GCPV
Sbjct: 222 GNRMTTVIMV-FQKAEVGGGTMFPLLKTTVRAEPGDAIVWFNMRKNGEMEPLVEHAGCPV 280
Query: 122 ALGNK 126
G K
Sbjct: 281 RSGRK 285
Score = 49.7 bits (117), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 24/63 (38%), Positives = 32/63 (50%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + E+GG T+FP L TV E G A+ W N N ++ + H+GCPV
Sbjct: 223 NRMTTVIMVFQKAEVGGGTMFPLLKTTVRAEPGDAIVWFNMRKNGEMEPLVEHAGCPVRS 282
Query: 62 GNK 64
G K
Sbjct: 283 GRK 285
>gi|321466507|gb|EFX77502.1| hypothetical protein DAPPUDRAFT_25542 [Daphnia pulex]
Length = 92
Score = 51.6 bits (122), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 27/60 (45%), Positives = 37/60 (61%), Gaps = 1/60 (1%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHA-NTLLDYRMYHSGCPVALGNKW 127
L+ L +VE GGAT+FP + V P KGSA+FWYN + N + H+ CP+ G+KW
Sbjct: 33 LIIYLNEVENGGATVFPIVKTRVKPVKGSALFWYNLNPDNGEGNPTTLHASCPILSGSKW 92
Score = 49.3 bits (116), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 25/58 (43%), Positives = 36/58 (62%), Gaps = 1/58 (1%)
Query: 9 VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHA-NTLLDYWMYHSGCPVALGNKW 65
+ L +VE GGAT+FP + V P KGSA+FW+N + N + H+ CP+ G+KW
Sbjct: 35 IYLNEVENGGATVFPIVKTRVKPVKGSALFWYNLNPDNGEGNPTTLHASCPILSGSKW 92
>gi|428671901|gb|EKX72816.1| conserved hypothetical protein [Babesia equi]
Length = 234
Score = 51.6 bits (122), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 25/55 (45%), Positives = 33/55 (60%), Gaps = 1/55 (1%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L DVE GG T+FP+L L + P SAVFW N + +D RM H+G +G K+
Sbjct: 166 LNDVE-GGETVFPNLGLAIKPVGNSAVFWRNLNGENEMDERMIHAGTTPKVGTKY 219
Score = 49.3 bits (116), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 24/55 (43%), Positives = 32/55 (58%), Gaps = 1/55 (1%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T+FP+L L + P SAVFW N + +D M H+G +G K+
Sbjct: 166 LNDVE-GGETVFPNLGLAIKPVGNSAVFWRNLNGENEMDERMIHAGTTPKVGTKY 219
>gi|219121927|ref|XP_002181308.1| proly 4-hydroxylase [Phaeodactylum tricornutum CCAP 1055/1]
gi|217407294|gb|EEC47231.1| proly 4-hydroxylase [Phaeodactylum tricornutum CCAP 1055/1]
Length = 226
Score = 51.2 bits (121), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 28/80 (35%), Positives = 39/80 (48%), Gaps = 15/80 (18%)
Query: 2 NRAEMSSV--ELTDVELGGATIFPSLN-------------LTVFPEKGSAVFWHNAHANT 46
+R M++V L+DVE GG T+FP N L V PEKG + +++ +
Sbjct: 132 HRNRMATVFWYLSDVEKGGETVFPRFNGAQETSMKDCKTGLKVKPEKGKVIIFYSMTPDG 191
Query: 47 LLDYWMYHSGCPVALGNKWG 66
LD + H CPV G KW
Sbjct: 192 ALDEYSLHGACPVQKGTKWA 211
Score = 51.2 bits (121), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 26/69 (37%), Positives = 32/69 (46%), Gaps = 13/69 (18%)
Query: 73 LTDVELGGATIFPSLN-------------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGC 119
L+DVE GG T+FP N L V PEKG + +Y+ + LD H C
Sbjct: 143 LSDVEKGGETVFPRFNGAQETSMKDCKTGLKVKPEKGKVIIFYSMTPDGALDEYSLHGAC 202
Query: 120 PVALGNKWG 128
PV G KW
Sbjct: 203 PVQKGTKWA 211
>gi|319795182|ref|YP_004156822.1| procollagen-proline dioxygenase [Variovorax paradoxus EPS]
gi|315597645|gb|ADU38711.1| Procollagen-proline dioxygenase [Variovorax paradoxus EPS]
Length = 296
Score = 51.2 bits (121), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 24/69 (34%), Positives = 36/69 (52%)
Query: 59 VALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSG 118
+A + L++ L D GG T+FP L L V P +G+A ++ N +D R H+
Sbjct: 209 IARSGQRVSTLVTYLNDAPEGGQTVFPQLGLAVSPIRGNACYFEYCDGNGRVDARSLHAS 268
Query: 119 CPVALGNKW 127
PV G+KW
Sbjct: 269 APVTRGDKW 277
Score = 48.1 bits (113), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/55 (38%), Positives = 29/55 (52%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L D GG T+FP L L V P +G+A ++ N +D H+ PV G+KW
Sbjct: 223 LNDAPEGGQTVFPQLGLAVSPIRGNACYFEYCDGNGRVDARSLHASAPVTRGDKW 277
>gi|328718387|ref|XP_001952104.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Acyrthosiphon
pisum]
Length = 293
Score = 51.2 bits (121), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 30/70 (42%), Positives = 39/70 (55%), Gaps = 2/70 (2%)
Query: 59 VALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAH-ANTLLDYRMYHS 117
V GN+ +L LTDV+ G T FP LN+ +KG+A+ W N H +N L Y H
Sbjct: 205 VEFGNRLVTVLFY-LTDVQNDGYTSFPLLNINAPVDKGAALVWNNLHMSNGQLFYESLHG 263
Query: 118 GCPVALGNKW 127
CP+ GNKW
Sbjct: 264 SCPLLKGNKW 273
Score = 49.7 bits (117), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 26/56 (46%), Positives = 34/56 (60%), Gaps = 1/56 (1%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAH-ANTLLDYWMYHSGCPVALGNKW 65
LTDV+ G T FP LN+ +KG+A+ W+N H +N L Y H CP+ GNKW
Sbjct: 218 LTDVQNDGYTSFPLLNINAPVDKGAALVWNNLHMSNGQLFYESLHGSCPLLKGNKW 273
>gi|241999340|ref|XP_002434313.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
gi|215496072|gb|EEC05713.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
Length = 267
Score = 51.2 bits (121), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 30/79 (37%), Positives = 40/79 (50%), Gaps = 14/79 (17%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNA-------------HANT 108
GN+ +L+ L DV GGAT F ++ L V P G+A+FWYN +
Sbjct: 170 GNRLATMLIY-LADVAGGGATAFINMRLAVKPTLGTALFWYNLKPYDGPIVNESFWNQRR 228
Query: 109 LLDYRMYHSGCPVALGNKW 127
D R +H GCPV G+KW
Sbjct: 229 FGDPRTFHMGCPVLTGSKW 247
Score = 45.8 bits (107), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 27/77 (35%), Positives = 36/77 (46%), Gaps = 13/77 (16%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNA-------------HANTLL 48
NR + L DV GGAT F ++ L V P G+A+FW+N +
Sbjct: 171 NRLATMLIYLADVAGGGATAFINMRLAVKPTLGTALFWYNLKPYDGPIVNESFWNQRRFG 230
Query: 49 DYWMYHSGCPVALGNKW 65
D +H GCPV G+KW
Sbjct: 231 DPRTFHMGCPVLTGSKW 247
>gi|363543301|ref|NP_001241866.1| prolyl 4-hydroxylase 6 precursor [Zea mays]
gi|195624808|gb|ACG34234.1| oxidoreductase [Zea mays]
gi|347978818|gb|AEP37751.1| prolyl 4-hydroxylase 6 [Zea mays]
Length = 297
Score = 51.2 bits (121), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 34/106 (32%), Positives = 47/106 (44%), Gaps = 24/106 (22%)
Query: 40 HNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN----------- 88
++AH DY +H + LG + +L LTDV+ GG T+FP+
Sbjct: 133 YDAH----FDY--FHDRNNLKLGGQRVATVLMYLTDVKKGGETVFPNAEGSHLQYKDETW 186
Query: 89 -------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L V P+KG A+ ++N H N D H CPV G KW
Sbjct: 187 SECSRSGLAVKPKKGDALLFFNLHVNATADTGSLHGSCPVIEGEKW 232
Score = 47.0 bits (110), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/73 (35%), Positives = 32/73 (43%), Gaps = 18/73 (24%)
Query: 11 LTDVELGGATIFPSLN------------------LTVFPEKGSAVFWHNAHANTLLDYWM 52
LTDV+ GG T+FP+ L V P+KG A+ + N H N D
Sbjct: 160 LTDVKKGGETVFPNAEGSHLQYKDETWSECSRSGLAVKPKKGDALLFFNLHVNATADTGS 219
Query: 53 YHSGCPVALGNKW 65
H CPV G KW
Sbjct: 220 LHGSCPVIEGEKW 232
>gi|426245942|ref|XP_004016760.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3, partial [Ovis
aries]
Length = 514
Score = 51.2 bits (121), Expect = 1e-04, Method: Composition-based stats.
Identities = 26/66 (39%), Positives = 38/66 (57%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ ++ L+ VE GGAT F N +V K +A+FW+N H + D H+ CPV
Sbjct: 427 GNRVATFMIY-LSSVEAGGATAFIYGNFSVPVVKNAALFWWNLHRSGEGDGDTLHAACPV 485
Query: 122 ALGNKW 127
+G+KW
Sbjct: 486 LVGDKW 491
Score = 50.4 bits (119), Expect = 2e-04, Method: Composition-based stats.
Identities = 26/64 (40%), Positives = 35/64 (54%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L+ VE GGAT F N +V K +A+FW N H + D H+ CPV +
Sbjct: 428 NRVATFMIYLSSVEAGGATAFIYGNFSVPVVKNAALFWWNLHRSGEGDGDTLHAACPVLV 487
Query: 62 GNKW 65
G+KW
Sbjct: 488 GDKW 491
>gi|339236271|ref|XP_003379690.1| prolyl 4-hydroxylase subunit alpha-1 [Trichinella spiralis]
gi|316977627|gb|EFV60702.1| prolyl 4-hydroxylase subunit alpha-1 [Trichinella spiralis]
Length = 558
Score = 51.2 bits (121), Expect = 1e-04, Method: Composition-based stats.
Identities = 25/66 (37%), Positives = 33/66 (50%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ +L ++ E GG T+F S +TV P K A FW+N D H+ CPV
Sbjct: 469 GNRIATVLFY-ISQPEAGGGTVFTSHKITVEPSKYDAAFWFNVLQGGEPDMSTRHAACPV 527
Query: 122 ALGNKW 127
G KW
Sbjct: 528 LAGTKW 533
Score = 48.1 bits (113), Expect = 0.001, Method: Composition-based stats.
Identities = 24/64 (37%), Positives = 29/64 (45%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++ E GG T+F S +TV P K A FW N D H+ CPV
Sbjct: 470 NRIATVLFYISQPEAGGGTVFTSHKITVEPSKYDAAFWFNVLQGGEPDMSTRHAACPVLA 529
Query: 62 GNKW 65
G KW
Sbjct: 530 GTKW 533
>gi|48675383|ref|NP_001001598.1| prolyl 4-hydroxylase subunit alpha-3 precursor [Bos taurus]
gi|75053350|sp|Q75UG4.1|P4HA3_BOVIN RecName: Full=Prolyl 4-hydroxylase subunit alpha-3; Short=4-PH
alpha-3; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-3; Flags: Precursor
gi|47115494|dbj|BAD18888.1| Collagen prolyl 4-hydroxylase alpha III subunit [Bos taurus]
gi|296479828|tpg|DAA21943.1| TPA: prolyl 4-hydroxylase subunit alpha-3 precursor [Bos taurus]
Length = 544
Score = 51.2 bits (121), Expect = 1e-04, Method: Composition-based stats.
Identities = 26/66 (39%), Positives = 38/66 (57%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ ++ L+ VE GGAT F N +V K +A+FW+N H + D H+ CPV
Sbjct: 457 GNRVATFMIY-LSSVEAGGATAFIYGNFSVPVVKNAALFWWNLHRSGEGDGDTLHAACPV 515
Query: 122 ALGNKW 127
+G+KW
Sbjct: 516 LVGDKW 521
Score = 50.4 bits (119), Expect = 2e-04, Method: Composition-based stats.
Identities = 26/64 (40%), Positives = 35/64 (54%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L+ VE GGAT F N +V K +A+FW N H + D H+ CPV +
Sbjct: 458 NRVATFMIYLSSVEAGGATAFIYGNFSVPVVKNAALFWWNLHRSGEGDGDTLHAACPVLV 517
Query: 62 GNKW 65
G+KW
Sbjct: 518 GDKW 521
>gi|407682954|ref|YP_006798128.1| prolyl 4-hydroxylase subunit alpha [Alteromonas macleodii str.
'English Channel 673']
gi|407244565|gb|AFT73751.1| prolyl 4-hydroxylase subunit alpha [Alteromonas macleodii str.
'English Channel 673']
Length = 376
Score = 51.2 bits (121), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 22/48 (45%), Positives = 29/48 (60%)
Query: 80 GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
GAT FP L +T++PEKG V + N N L+ + YH G PV+ KW
Sbjct: 309 GATHFPKLGITIYPEKGKLVSFLNVDKNLALEKQSYHCGQPVSTNEKW 356
Score = 49.7 bits (117), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 22/48 (45%), Positives = 28/48 (58%)
Query: 18 GATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
GAT FP L +T++PEKG V + N N L+ YH G PV+ KW
Sbjct: 309 GATHFPKLGITIYPEKGKLVSFLNVDKNLALEKQSYHCGQPVSTNEKW 356
>gi|406596009|ref|YP_006747139.1| prolyl 4-hydroxylase subunit alpha [Alteromonas macleodii ATCC
27126]
gi|406373330|gb|AFS36585.1| prolyl 4-hydroxylase subunit alpha [Alteromonas macleodii ATCC
27126]
Length = 376
Score = 50.8 bits (120), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 22/48 (45%), Positives = 29/48 (60%)
Query: 80 GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
GAT FP L +T++PEKG V + N N L+ + YH G PV+ KW
Sbjct: 309 GATHFPKLGITIYPEKGKLVSFLNVDKNLALEKQSYHCGQPVSTNEKW 356
Score = 49.7 bits (117), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 22/48 (45%), Positives = 28/48 (58%)
Query: 18 GATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
GAT FP L +T++PEKG V + N N L+ YH G PV+ KW
Sbjct: 309 GATHFPKLGITIYPEKGKLVSFLNVDKNLALEKQSYHCGQPVSTNEKW 356
>gi|440899661|gb|ELR50930.1| Prolyl 4-hydroxylase subunit alpha-3, partial [Bos grunniens mutus]
Length = 478
Score = 50.8 bits (120), Expect = 2e-04, Method: Composition-based stats.
Identities = 26/66 (39%), Positives = 38/66 (57%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ ++ L+ VE GGAT F N +V K +A+FW+N H + D H+ CPV
Sbjct: 391 GNRVATFMIY-LSSVEAGGATAFIYGNFSVPVVKNAALFWWNLHRSGEGDGDTLHAACPV 449
Query: 122 ALGNKW 127
+G+KW
Sbjct: 450 LVGDKW 455
Score = 50.1 bits (118), Expect = 3e-04, Method: Composition-based stats.
Identities = 26/64 (40%), Positives = 35/64 (54%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L+ VE GGAT F N +V K +A+FW N H + D H+ CPV +
Sbjct: 392 NRVATFMIYLSSVEAGGATAFIYGNFSVPVVKNAALFWWNLHRSGEGDGDTLHAACPVLV 451
Query: 62 GNKW 65
G+KW
Sbjct: 452 GDKW 455
>gi|73542634|ref|YP_297154.1| procollagen-proline,2-oxoglutarate-4-dioxygenase [Ralstonia
eutropha JMP134]
gi|72120047|gb|AAZ62310.1| Procollagen-proline,2-oxoglutarate-4-dioxygenase [Ralstonia
eutropha JMP134]
Length = 282
Score = 50.8 bits (120), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 26/67 (38%), Positives = 36/67 (53%)
Query: 61 LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
+G + L+ L + GGAT FP + L V P KG+AV++ + LD R H+G P
Sbjct: 201 VGGQRTATLVIYLNTPQAGGATAFPRIGLEVAPVKGNAVYFSYLQPDGKLDERTLHAGLP 260
Query: 121 VALGNKW 127
V G KW
Sbjct: 261 VQSGEKW 267
Score = 47.8 bits (112), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 23/55 (41%), Positives = 30/55 (54%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L + GGAT FP + L V P KG+AV++ + LD H+G PV G KW
Sbjct: 213 LNTPQAGGATAFPRIGLEVAPVKGNAVYFSYLQPDGKLDERTLHAGLPVQSGEKW 267
>gi|281353153|gb|EFB28737.1| hypothetical protein PANDA_003344 [Ailuropoda melanoleuca]
Length = 456
Score = 50.8 bits (120), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 39/66 (59%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ ++ L+ VE GGAT F N +V K +A+FW+N H + D H+GCPV
Sbjct: 392 GNRVATFMIY-LSSVEAGGATAFIYANFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPV 450
Query: 122 ALGNKW 127
+G+KW
Sbjct: 451 LVGDKW 456
Score = 49.7 bits (117), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 27/64 (42%), Positives = 36/64 (56%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L+ VE GGAT F N +V K +A+FW N H + D H+GCPV +
Sbjct: 393 NRVATFMIYLSSVEAGGATAFIYANFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLV 452
Query: 62 GNKW 65
G+KW
Sbjct: 453 GDKW 456
>gi|168006299|ref|XP_001755847.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693166|gb|EDQ79520.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 299
Score = 50.8 bits (120), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 38/138 (27%), Positives = 56/138 (40%), Gaps = 26/138 (18%)
Query: 8 SVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWGK 67
S+E + ++ AT+ P N F V +N Y ++ P G + +
Sbjct: 158 SLEQVEEKMAKATMIPRENGEAF-----NVLKYNVGQKYDCHYDVFD---PAEYGPQPSQ 209
Query: 68 LLLS---GLTDVELGGATIFPSLN---------------LTVFPEKGSAVFWYNAHANTL 109
+ S L+DVE GG T+FP N + V P +G A+ +Y+ H N
Sbjct: 210 RMASFLLYLSDVEEGGETMFPFENFQNMNIGFDYKKCIGMKVKPRQGDALLFYSMHPNGT 269
Query: 110 LDYRMYHSGCPVALGNKW 127
D H CPV G KW
Sbjct: 270 FDKSALHGSCPVIKGEKW 287
>gi|242003035|ref|XP_002436120.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
gi|215499456|gb|EEC08950.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
Length = 173
Score = 50.8 bits (120), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 28/67 (41%), Positives = 36/67 (53%), Gaps = 12/67 (17%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYN---AHANTLL---------DYRMYHSGCP 120
L+ VE GGAT+F L + V PE G A+FW+N N+L D R H CP
Sbjct: 77 LSSVERGGATLFKRLRVRVQPEAGMALFWHNLPPGSTNSLPSCCVHRSVGDERTEHGACP 136
Query: 121 VALGNKW 127
V +G+KW
Sbjct: 137 VLVGSKW 143
Score = 50.1 bits (118), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 29/69 (42%), Positives = 36/69 (52%), Gaps = 12/69 (17%)
Query: 9 VELTDVELGGATIFPSLNLTVFPEKGSAVFWHN---AHANTLL---------DYWMYHSG 56
V L+ VE GGAT+F L + V PE G A+FWHN N+L D H
Sbjct: 75 VYLSSVERGGATLFKRLRVRVQPEAGMALFWHNLPPGSTNSLPSCCVHRSVGDERTEHGA 134
Query: 57 CPVALGNKW 65
CPV +G+KW
Sbjct: 135 CPVLVGSKW 143
>gi|341893180|gb|EGT49115.1| CBN-PHY-4 protein [Caenorhabditis brenneri]
Length = 282
Score = 50.8 bits (120), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 24/70 (34%), Positives = 38/70 (54%), Gaps = 1/70 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ +L+ L + GG T+FP +NL + P+ G + W N ++ D + H+ CP+
Sbjct: 180 GNRIATVLVI-LQIAKKGGTTVFPKINLNIRPKAGDVIVWLNTLSSGESDPQTLHAACPI 238
Query: 122 ALGNKWGKLL 131
GNK G L
Sbjct: 239 KEGNKIGATL 248
Score = 49.7 bits (117), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 34/68 (50%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR V L + GG T+FP +NL + P+ G + W N ++ D H+ CP+
Sbjct: 181 NRIATVLVILQIAKKGGTTVFPKINLNIRPKAGDVIVWLNTLSSGESDPQTLHAACPIKE 240
Query: 62 GNKWGKLL 69
GNK G L
Sbjct: 241 GNKIGATL 248
>gi|17547533|ref|NP_520935.1| hypothetical protein RSc2814 [Ralstonia solanacearum GMI1000]
gi|17429837|emb|CAD16521.1| putative prolyl 4-hydroxylase alpha subunit homologue
oxidoreductase protein [Ralstonia solanacearum GMI1000]
Length = 289
Score = 50.8 bits (120), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 28/67 (41%), Positives = 35/67 (52%)
Query: 61 LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
+G + L+ L V GGAT FP L L V P KG+AVF+ + LD H+G P
Sbjct: 208 VGGQRVATLVIYLNSVPAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLP 267
Query: 121 VALGNKW 127
V G KW
Sbjct: 268 VERGEKW 274
Score = 48.9 bits (115), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 30/55 (54%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L V GGAT FP L L V P KG+AVF+ + LD H+G PV G KW
Sbjct: 220 LNSVPAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLPVERGEKW 274
>gi|449432777|ref|XP_004134175.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 303
Score = 50.8 bits (120), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 29/90 (32%), Positives = 39/90 (43%), Gaps = 21/90 (23%)
Query: 59 VALGNKWGKLLLSGLTDVELGGATIFPSLN---------------------LTVFPEKGS 97
+A G +L L+DVE GG T+FPS + V P KG
Sbjct: 152 IARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRKGD 211
Query: 98 AVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
A+ +++ H N + D H GCPV G KW
Sbjct: 212 ALLFFSLHPNAIPDTSSLHGGCPVIEGEKW 241
Score = 47.8 bits (112), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/76 (34%), Positives = 33/76 (43%), Gaps = 21/76 (27%)
Query: 11 LTDVELGGATIFPSLN---------------------LTVFPEKGSAVFWHNAHANTLLD 49
L+DVE GG T+FPS + V P KG A+ + + H N + D
Sbjct: 166 LSDVEKGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRKGDALLFFSLHPNAIPD 225
Query: 50 YWMYHSGCPVALGNKW 65
H GCPV G KW
Sbjct: 226 TSSLHGGCPVIEGEKW 241
>gi|308812133|ref|XP_003083374.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein (ISS)
[Ostreococcus tauri]
gi|116055254|emb|CAL57650.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein (ISS)
[Ostreococcus tauri]
Length = 311
Score = 50.4 bits (119), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 32/103 (31%), Positives = 45/103 (43%), Gaps = 28/103 (27%)
Query: 53 YHSGCPVALGNKWGKLLLSGLTDVELGGATIFP------------------------SLN 88
+H V G + +L L+DVE GG T+FP S N
Sbjct: 173 FHDEHNVKNGGQRVATILMYLSDVEEGGETVFPLGTPLGGRDPEKSGVTGDNACELASQN 232
Query: 89 ----LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L V P +G A+ ++NAH + +D + H+GCPV G KW
Sbjct: 233 DPRVLAVKPRRGDALLFFNAHLSGEMDEKANHAGCPVNRGTKW 275
Score = 45.4 bits (106), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 28/83 (33%), Positives = 36/83 (43%), Gaps = 28/83 (33%)
Query: 11 LTDVELGGATIFP------------------------SLN----LTVFPEKGSAVFWHNA 42
L+DVE GG T+FP S N L V P +G A+ + NA
Sbjct: 193 LSDVEEGGETVFPLGTPLGGRDPEKSGVTGDNACELASQNDPRVLAVKPRRGDALLFFNA 252
Query: 43 HANTLLDYWMYHSGCPVALGNKW 65
H + +D H+GCPV G KW
Sbjct: 253 HLSGEMDEKANHAGCPVNRGTKW 275
>gi|159469311|ref|XP_001692811.1| predicted protein [Chlamydomonas reinhardtii]
gi|158278064|gb|EDP03830.1| predicted protein [Chlamydomonas reinhardtii]
Length = 273
Score = 50.4 bits (119), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 31/95 (32%), Positives = 44/95 (46%), Gaps = 18/95 (18%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSL----------------NLTVF 92
DY+ + G GN+W +LL LT+ E GG T+FP + NL V
Sbjct: 139 DYFFHKDGSSNG-GNRWATVLLY-LTETEEGGETVFPKIPAPNGINVGFSECAKYNLAVK 196
Query: 93 PEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
P KG A+ +++ L+ R H CPV G K+
Sbjct: 197 PHKGDALLFHSMKPTGELEERSMHGACPVIRGEKF 231
Score = 41.6 bits (96), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 23/71 (32%), Positives = 31/71 (43%), Gaps = 16/71 (22%)
Query: 11 LTDVELGGATIFPSL----------------NLTVFPEKGSAVFWHNAHANTLLDYWMYH 54
LT+ E GG T+FP + NL V P KG A+ +H+ L+ H
Sbjct: 161 LTETEEGGETVFPKIPAPNGINVGFSECAKYNLAVKPHKGDALLFHSMKPTGELEERSMH 220
Query: 55 SGCPVALGNKW 65
CPV G K+
Sbjct: 221 GACPVIRGEKF 231
>gi|301104296|ref|XP_002901233.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262101167|gb|EEY59219.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 535
Score = 50.4 bits (119), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 39/118 (33%), Positives = 53/118 (44%), Gaps = 19/118 (16%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWGKLLL 70
L+DV GG T+FP L PE + H N+ DY A+G +L
Sbjct: 321 LSDVPRGGQTVFP---LAEMPEGLPTEYQHPP--NSAQDYE--------AIG---AELFE 364
Query: 71 SGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 128
G ++++ S L +P KG AV +Y+ N LD + H GCPV G KWG
Sbjct: 365 PGSWEMDMVRKC---STKLASYPSKGGAVLFYSQKPNGELDPKSLHGGCPVLEGTKWG 419
>gi|171059332|ref|YP_001791681.1| procollagen-proline dioxygenase [Leptothrix cholodnii SP-6]
gi|170776777|gb|ACB34916.1| Procollagen-proline dioxygenase [Leptothrix cholodnii SP-6]
Length = 287
Score = 50.4 bits (119), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 40/79 (50%), Gaps = 10/79 (12%)
Query: 54 HSGCPVAL---GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVF--WYNAHANT 108
G P L G + G L++ L E GG T FP +NL V P KG+AVF + AH +T
Sbjct: 202 QPGTPTILKRGGQRVGTLVMY-LNTPERGGGTTFPDVNLEVAPIKGNAVFFSYERAHPST 260
Query: 109 LLDYRMYHSGCPVALGNKW 127
R H G PV G KW
Sbjct: 261 ----RSLHGGAPVLAGEKW 275
Score = 45.4 bits (106), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 26/57 (45%), Positives = 30/57 (52%), Gaps = 6/57 (10%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVF--WHNAHANTLLDYWMYHSGCPVALGNKW 65
L E GG T FP +NL V P KG+AVF + AH +T H G PV G KW
Sbjct: 223 LNTPERGGGTTFPDVNLEVAPIKGNAVFFSYERAHPSTR----SLHGGAPVLAGEKW 275
>gi|255085194|ref|XP_002505028.1| predicted protein [Micromonas sp. RCC299]
gi|226520297|gb|ACO66286.1| predicted protein [Micromonas sp. RCC299]
Length = 439
Score = 50.4 bits (119), Expect = 2e-04, Method: Composition-based stats.
Identities = 23/59 (38%), Positives = 31/59 (52%)
Query: 70 LSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 128
L+ L + E GG T FP L P+ G A+ W+N N D++ H+G PV G KW
Sbjct: 342 LAYLVEPERGGRTYFPKLRAGFEPKVGDALLWWNVDENGAEDFKTLHAGEPVEAGAKWA 400
Score = 45.8 bits (107), Expect = 0.005, Method: Composition-based stats.
Identities = 24/64 (37%), Positives = 29/64 (45%)
Query: 3 RAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALG 62
RA L + E GG T FP L P+ G A+ W N N D+ H+G PV G
Sbjct: 337 RAVTCLAYLVEPERGGRTYFPKLRAGFEPKVGDALLWWNVDENGAEDFKTLHAGEPVEAG 396
Query: 63 NKWG 66
KW
Sbjct: 397 AKWA 400
>gi|363729586|ref|XP_417248.3| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Gallus gallus]
Length = 542
Score = 50.4 bits (119), Expect = 2e-04, Method: Composition-based stats.
Identities = 26/66 (39%), Positives = 38/66 (57%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ +++ L+ VE GG+T F N +V K +A+FW+N N D H+GCPV
Sbjct: 455 GNRIATVMIY-LSAVEAGGSTAFIYANFSVPVVKNAALFWWNLRRNGDGDGDTLHAGCPV 513
Query: 122 ALGNKW 127
G+KW
Sbjct: 514 LAGDKW 519
Score = 49.3 bits (116), Expect = 5e-04, Method: Composition-based stats.
Identities = 26/64 (40%), Positives = 34/64 (53%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L+ VE GG+T F N +V K +A+FW N N D H+GCPV
Sbjct: 456 NRIATVMIYLSAVEAGGSTAFIYANFSVPVVKNAALFWWNLRRNGDGDGDTLHAGCPVLA 515
Query: 62 GNKW 65
G+KW
Sbjct: 516 GDKW 519
>gi|413932756|gb|AFW67307.1| oxidoreductase [Zea mays]
Length = 297
Score = 50.1 bits (118), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 32/98 (32%), Positives = 42/98 (42%), Gaps = 20/98 (20%)
Query: 48 LDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN------------------L 89
DY +H + LG + +L LTDV GG T+FP+ L
Sbjct: 137 FDY--FHDRNNLKLGGQRVATVLMYLTDVNKGGETVFPNAEGSHLQYKDETWSECSRSGL 194
Query: 90 TVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
V P+KG A+ ++N H N D H CPV G KW
Sbjct: 195 AVKPKKGDALLFFNLHVNATADTGSLHGSCPVIEGEKW 232
Score = 46.2 bits (108), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 26/73 (35%), Positives = 31/73 (42%), Gaps = 18/73 (24%)
Query: 11 LTDVELGGATIFPSLN------------------LTVFPEKGSAVFWHNAHANTLLDYWM 52
LTDV GG T+FP+ L V P+KG A+ + N H N D
Sbjct: 160 LTDVNKGGETVFPNAEGSHLQYKDETWSECSRSGLAVKPKKGDALLFFNLHVNATADTGS 219
Query: 53 YHSGCPVALGNKW 65
H CPV G KW
Sbjct: 220 LHGSCPVIEGEKW 232
>gi|390570433|ref|ZP_10250698.1| procollagen-proline dioxygenase [Burkholderia terrae BS001]
gi|389937613|gb|EIM99476.1| procollagen-proline dioxygenase [Burkholderia terrae BS001]
Length = 285
Score = 50.1 bits (118), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 25/69 (36%), Positives = 35/69 (50%)
Query: 59 VALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSG 118
A G + L+ L+DVE GG T+FP L V +G A+++ + LD H G
Sbjct: 205 TARGGQRVATLIVYLSDVEGGGETVFPDAGLAVMARQGGAIYFRYMNGRRQLDPLTLHGG 264
Query: 119 CPVALGNKW 127
PV G+KW
Sbjct: 265 APVTSGDKW 273
Score = 50.1 bits (118), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 23/57 (40%), Positives = 31/57 (54%)
Query: 9 VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
V L+DVE GG T+FP L V +G A+++ + LD H G PV G+KW
Sbjct: 217 VYLSDVEGGGETVFPDAGLAVMARQGGAIYFRYMNGRRQLDPLTLHGGAPVTSGDKW 273
>gi|168043388|ref|XP_001774167.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674574|gb|EDQ61081.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 284
Score = 50.1 bits (118), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 41/144 (28%), Positives = 58/144 (40%), Gaps = 28/144 (19%)
Query: 4 AEMSSVELTDVE--LGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
+E + L +VE + AT+ P N F V +N Y ++ P
Sbjct: 137 SEDKTQSLAEVEEKMARATMIPRQNGEAF-----NVLRYNPGQKYDCHYDVFD---PAEY 188
Query: 62 GNKWGKLLLS---GLTDVELGGATIFPSLN---------------LTVFPEKGSAVFWYN 103
G + + + S L+DVE GG T+FP N L V P +G A+ +Y+
Sbjct: 189 GPQPSQRMASFLLYLSDVEEGGETMFPFENFQNMNTGYNYKDCIGLKVKPRQGDALLFYS 248
Query: 104 AHANTLLDYRMYHSGCPVALGNKW 127
H N D H CPV G KW
Sbjct: 249 MHPNGTFDKTALHGSCPVIKGEKW 272
>gi|420246706|ref|ZP_14750139.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. BT03]
gi|398073616|gb|EJL64785.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. BT03]
Length = 282
Score = 50.1 bits (118), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 25/69 (36%), Positives = 35/69 (50%)
Query: 59 VALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSG 118
A G + L+ L+DVE GG T+FP L V +G A+++ + LD H G
Sbjct: 202 TARGGQRVATLIVYLSDVEGGGETVFPDAGLAVMARQGGAIYFRYMNGRRQLDPLTLHGG 261
Query: 119 CPVALGNKW 127
PV G+KW
Sbjct: 262 APVTSGDKW 270
Score = 50.1 bits (118), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 23/57 (40%), Positives = 31/57 (54%)
Query: 9 VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
V L+DVE GG T+FP L V +G A+++ + LD H G PV G+KW
Sbjct: 214 VYLSDVEGGGETVFPDAGLAVMARQGGAIYFRYMNGRRQLDPLTLHGGAPVTSGDKW 270
>gi|326914688|ref|XP_003203656.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Meleagris
gallopavo]
Length = 539
Score = 50.1 bits (118), Expect = 3e-04, Method: Composition-based stats.
Identities = 26/66 (39%), Positives = 38/66 (57%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ +++ L+ VE GG+T F N +V K +A+FW+N N D H+GCPV
Sbjct: 452 GNRIATVMIY-LSAVEAGGSTAFIYANFSVPVVKNAALFWWNLRRNGDGDGDTLHAGCPV 510
Query: 122 ALGNKW 127
G+KW
Sbjct: 511 LAGDKW 516
Score = 48.9 bits (115), Expect = 7e-04, Method: Composition-based stats.
Identities = 26/64 (40%), Positives = 34/64 (53%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L+ VE GG+T F N +V K +A+FW N N D H+GCPV
Sbjct: 453 NRIATVMIYLSAVEAGGSTAFIYANFSVPVVKNAALFWWNLRRNGDGDGDTLHAGCPVLA 512
Query: 62 GNKW 65
G+KW
Sbjct: 513 GDKW 516
>gi|308467521|ref|XP_003096008.1| CRE-PHY-4 protein [Caenorhabditis remanei]
gi|308244157|gb|EFO88109.1| CRE-PHY-4 protein [Caenorhabditis remanei]
Length = 198
Score = 50.1 bits (118), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 25/72 (34%), Positives = 37/72 (51%), Gaps = 1/72 (1%)
Query: 60 ALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGC 119
A GN+ +L+ L + GG T+FP +NL + P+ G V W N + D + H+ C
Sbjct: 101 ARGNRIATVLVI-LQIAKKGGTTVFPKINLNIRPKAGDVVVWLNTLPSGESDSQTLHAAC 159
Query: 120 PVALGNKWGKLL 131
P+ G K G L
Sbjct: 160 PIKEGTKIGATL 171
Score = 47.8 bits (112), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 32/68 (47%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR V L + GG T+FP +NL + P+ G V W N + D H+ CP+
Sbjct: 104 NRIATVLVILQIAKKGGTTVFPKINLNIRPKAGDVVVWLNTLPSGESDSQTLHAACPIKE 163
Query: 62 GNKWGKLL 69
G K G L
Sbjct: 164 GTKIGATL 171
>gi|254254263|ref|ZP_04947580.1| hypothetical protein BDAG_03558 [Burkholderia dolosa AUO158]
gi|124898908|gb|EAY70751.1| hypothetical protein BDAG_03558 [Burkholderia dolosa AUO158]
Length = 285
Score = 50.1 bits (118), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 25/69 (36%), Positives = 35/69 (50%)
Query: 59 VALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSG 118
V G + L+ L DV GG T+FP ++V P +G AV++ + LD H+G
Sbjct: 205 VRTGGQRVATLIVYLNDVPSGGETVFPEAGISVVPRRGDAVYFRYMNRLRQLDPATLHAG 264
Query: 119 CPVALGNKW 127
PV G KW
Sbjct: 265 APVRDGEKW 273
Score = 49.3 bits (116), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 23/57 (40%), Positives = 31/57 (54%)
Query: 9 VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
V L DV GG T+FP ++V P +G AV++ + LD H+G PV G KW
Sbjct: 217 VYLNDVPSGGETVFPEAGISVVPRRGDAVYFRYMNRLRQLDPATLHAGAPVRDGEKW 273
>gi|325920649|ref|ZP_08182559.1| 2OG-Fe(II) oxygenase superfamily enzyme,Sel1 repeat protein
[Xanthomonas gardneri ATCC 19865]
gi|325548839|gb|EGD19783.1| 2OG-Fe(II) oxygenase superfamily enzyme,Sel1 repeat protein
[Xanthomonas gardneri ATCC 19865]
Length = 422
Score = 50.1 bits (118), Expect = 3e-04, Method: Composition-based stats.
Identities = 26/64 (40%), Positives = 33/64 (51%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + V L DV+ GG T FP + V P GS V + N A+ D H+G PV
Sbjct: 345 NRLRTACVYLNDVDAGGETEFPVAGIRVQPRAGSVVCFDNLQADGCPDPDSLHAGLPVTT 404
Query: 62 GNKW 65
G+KW
Sbjct: 405 GSKW 408
Score = 45.8 bits (107), Expect = 0.006, Method: Composition-based stats.
Identities = 23/55 (41%), Positives = 29/55 (52%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L DV+ GG T FP + V P GS V + N A+ D H+G PV G+KW
Sbjct: 354 LNDVDAGGETEFPVAGIRVQPRAGSVVCFDNLQADGCPDPDSLHAGLPVTTGSKW 408
>gi|156352046|ref|XP_001622583.1| predicted protein [Nematostella vectensis]
gi|156209154|gb|EDO30483.1| predicted protein [Nematostella vectensis]
Length = 497
Score = 50.1 bits (118), Expect = 3e-04, Method: Composition-based stats.
Identities = 23/57 (40%), Positives = 33/57 (57%), Gaps = 3/57 (5%)
Query: 71 SGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
+ L V+ GG+ +F L+V P+KGSAVFW N + H+ CPV +G+KW
Sbjct: 423 NALNFVDAGGSNVFLDSELSVSPQKGSAVFWLNMRRS---GKETLHAACPVIVGHKW 476
Score = 48.1 bits (113), Expect = 0.001, Method: Composition-based stats.
Identities = 25/58 (43%), Positives = 34/58 (58%), Gaps = 9/58 (15%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHN---AHANTLLDYWMYHSGCPVALGNKW 65
L V+ GG+ +F L+V P+KGSAVFW N + TL H+ CPV +G+KW
Sbjct: 425 LNFVDAGGSNVFLDSELSVSPQKGSAVFWLNMRRSGKETL------HAACPVIVGHKW 476
>gi|52139015|gb|AAH82538.1| P4ha3 protein [Mus musculus]
Length = 404
Score = 50.1 bits (118), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 39/66 (59%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ ++ L+ VE GGAT F N +V K +A+FW+N H + D H+GCPV
Sbjct: 317 GNRVATFMIY-LSSVEAGGATAFIYGNFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPV 375
Query: 122 ALGNKW 127
+G+KW
Sbjct: 376 LVGDKW 381
Score = 48.5 bits (114), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 27/64 (42%), Positives = 36/64 (56%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L+ VE GGAT F N +V K +A+FW N H + D H+GCPV +
Sbjct: 318 NRVATFMIYLSSVEAGGATAFIYGNFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLV 377
Query: 62 GNKW 65
G+KW
Sbjct: 378 GDKW 381
>gi|224013908|ref|XP_002296618.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220968970|gb|EED87314.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 601
Score = 50.1 bits (118), Expect = 3e-04, Method: Composition-based stats.
Identities = 26/58 (44%), Positives = 34/58 (58%), Gaps = 2/58 (3%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHAN--TLLDYRMYHSGCPVALGNKWG 128
L DVE GG T FP L+LTV P++G+A+ W + + D R H PVA G K+G
Sbjct: 525 LNDVEEGGGTRFPLLDLTVQPKRGNAILWASVRDDDPEEKDIRTDHEALPVAKGMKYG 582
Score = 46.6 bits (109), Expect = 0.004, Method: Composition-based stats.
Identities = 25/58 (43%), Positives = 33/58 (56%), Gaps = 2/58 (3%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHAN--TLLDYWMYHSGCPVALGNKWG 66
L DVE GG T FP L+LTV P++G+A+ W + + D H PVA G K+G
Sbjct: 525 LNDVEEGGGTRFPLLDLTVQPKRGNAILWASVRDDDPEEKDIRTDHEALPVAKGMKYG 582
>gi|302791635|ref|XP_002977584.1| hypothetical protein SELMODRAFT_106693 [Selaginella moellendorffii]
gi|300154954|gb|EFJ21588.1| hypothetical protein SELMODRAFT_106693 [Selaginella moellendorffii]
Length = 296
Score = 49.7 bits (117), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 32/94 (34%), Positives = 40/94 (42%), Gaps = 16/94 (17%)
Query: 50 YWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLT----------------VFP 93
Y +H ALG +L L+DV GG T+FPS T V P
Sbjct: 144 YDYFHDKYNQALGGHRIATVLMYLSDVVKGGETVFPSSEDTTVKDDSWSDCAKKGIAVKP 203
Query: 94 EKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
KG A+ +Y+ H + D H GCPV G KW
Sbjct: 204 RKGDALLFYSLHPDATPDESSLHGGCPVIEGEKW 237
Score = 43.9 bits (102), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 25/71 (35%), Positives = 32/71 (45%), Gaps = 16/71 (22%)
Query: 11 LTDVELGGATIFPSLNLT----------------VFPEKGSAVFWHNAHANTLLDYWMYH 54
L+DV GG T+FPS T V P KG A+ +++ H + D H
Sbjct: 167 LSDVVKGGETVFPSSEDTTVKDDSWSDCAKKGIAVKPRKGDALLFYSLHPDATPDESSLH 226
Query: 55 SGCPVALGNKW 65
GCPV G KW
Sbjct: 227 GGCPVIEGEKW 237
>gi|341893923|gb|EGT49858.1| CBN-PHY-3 protein [Caenorhabditis brenneri]
Length = 275
Score = 49.7 bits (117), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 27/78 (34%), Positives = 39/78 (50%), Gaps = 7/78 (8%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
D+WM + + N+ G L+ L E GG T+FPS+ T+ A FW+NA A+
Sbjct: 157 DWWMKN------MRNRIGTLIFV-LKTAEKGGGTVFPSIGTTIRINPRDAFFWFNAQADE 209
Query: 109 LLDYRMYHSGCPVALGNK 126
+ H GCP+ G K
Sbjct: 210 EKEMLADHGGCPIYEGRK 227
Score = 44.7 bits (104), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 21/54 (38%), Positives = 27/54 (50%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNK 64
L E GG T+FPS+ T+ A FW NA A+ + H GCP+ G K
Sbjct: 174 LKTAEKGGGTVFPSIGTTIRINPRDAFFWFNAQADEEKEMLADHGGCPIYEGRK 227
>gi|260812289|ref|XP_002600853.1| hypothetical protein BRAFLDRAFT_214927 [Branchiostoma floridae]
gi|229286143|gb|EEN56865.1| hypothetical protein BRAFLDRAFT_214927 [Branchiostoma floridae]
Length = 281
Score = 49.7 bits (117), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 26/67 (38%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 61 LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
LGN+ L+ L+DV GGAT+FP N+TV K + + + + + D H+GCP
Sbjct: 193 LGNRMATFLMY-LSDVTAGGATVFPVANVTVPVVKNAGLLFMDLLRSGRGDVNSLHAGCP 251
Query: 121 VALGNKW 127
V +G+KW
Sbjct: 252 VVIGSKW 258
Score = 45.4 bits (106), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 22/55 (40%), Positives = 33/55 (60%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L+DV GGAT+FP N+TV K + + + + + D H+GCPV +G+KW
Sbjct: 204 LSDVTAGGATVFPVANVTVPVVKNAGLLFMDLLRSGRGDVNSLHAGCPVVIGSKW 258
>gi|260806885|ref|XP_002598314.1| hypothetical protein BRAFLDRAFT_204780 [Branchiostoma floridae]
gi|229283586|gb|EEN54326.1| hypothetical protein BRAFLDRAFT_204780 [Branchiostoma floridae]
Length = 282
Score = 49.7 bits (117), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 23/55 (41%), Positives = 33/55 (60%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L++VE GGAT+F N+ V K SAV + N + + H+GCPV +G+KW
Sbjct: 205 LSEVEAGGATVFTEANIAVPVVKNSAVLFENTNKALVRSRASVHAGCPVLIGSKW 259
Score = 48.9 bits (115), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 23/55 (41%), Positives = 33/55 (60%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L++VE GGAT+F N+ V K SAV + N + + H+GCPV +G+KW
Sbjct: 205 LSEVEAGGATVFTEANIAVPVVKNSAVLFENTNKALVRSRASVHAGCPVLIGSKW 259
>gi|334185677|ref|NP_001189994.1| prolyl 4-hydroxylase [Arabidopsis thaliana]
gi|332643930|gb|AEE77451.1| prolyl 4-hydroxylase [Arabidopsis thaliana]
Length = 324
Score = 49.7 bits (117), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 41/149 (27%), Positives = 56/149 (37%), Gaps = 31/149 (20%)
Query: 4 AEMSSVELTDV------ELGGATIFPSLNLTVFPEKGSAVFWHNAHA-NTLLDYWMYHSG 56
A M S+E+ D+ +L T P N E + + N DY +H
Sbjct: 119 ANMDSLEIDDIVSNVEAKLAAWTFLPEEN----GESMQILHYENGQKYEPHFDY--FHDQ 172
Query: 57 CPVALGNKWGKLLLSGLTDVELGGATIFP------------------SLNLTVFPEKGSA 98
+ LG +L L++VE GG T+FP V P KG A
Sbjct: 173 ANLELGGHRIATVLMYLSNVEKGGETVFPMWKGKATQLKDDSWTECAKQGYAVKPRKGDA 232
Query: 99 VFWYNAHANTLLDYRMYHSGCPVALGNKW 127
+ ++N H N D H CPV G KW
Sbjct: 233 LLFFNLHPNATTDSNSLHGSCPVVEGEKW 261
>gi|407708877|ref|YP_006792741.1| prolyl 4-hydroxylase [Burkholderia phenoliruptrix BR3459a]
gi|407237560|gb|AFT87758.1| prolyl 4-hydroxylase [Burkholderia phenoliruptrix BR3459a]
Length = 300
Score = 49.7 bits (117), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 32/57 (56%)
Query: 9 VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
V L DV GG TIFP+ L+V ++G AV++ + LD H G PV G+KW
Sbjct: 232 VYLNDVADGGETIFPAAGLSVAAKQGGAVYFRYMNGQRQLDPLTLHGGAPVRAGDKW 288
Score = 48.9 bits (115), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 27/73 (36%), Positives = 37/73 (50%)
Query: 55 SGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRM 114
S A G + L+ L DV GG TIFP+ L+V ++G AV++ + LD
Sbjct: 216 SAVHTARGGQRVATLVVYLNDVADGGETIFPAAGLSVAAKQGGAVYFRYMNGQRQLDPLT 275
Query: 115 YHSGCPVALGNKW 127
H G PV G+KW
Sbjct: 276 LHGGAPVRAGDKW 288
>gi|226529219|ref|NP_001151238.1| LOC100284871 [Zea mays]
gi|195645242|gb|ACG42089.1| prolyl 4-hydroxylase alpha-2 subunit precursor [Zea mays]
gi|347978812|gb|AEP37748.1| prolyl 4-hydroxylase 5 [Zea mays]
gi|413923983|gb|AFW63915.1| prolyl 4-hydroxylase alpha-2 subunit [Zea mays]
Length = 308
Score = 49.3 bits (116), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 34/107 (31%), Positives = 48/107 (44%), Gaps = 25/107 (23%)
Query: 40 HNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN----------- 88
+ H + LD + +G G + LL+ L+DVE GG TIFP N
Sbjct: 195 YEPHFDYFLDEFNTKNG-----GQRMATLLMY-LSDVEEGGETIFPDANVNVSSLPWYNE 248
Query: 89 --------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+V P+ G A+ +++ + LD H GCPV GNKW
Sbjct: 249 LSECAKRGLSVKPKMGDALLFWSMKPDATLDPLSLHGGCPVIRGNKW 295
Score = 46.2 bits (108), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 27/74 (36%), Positives = 34/74 (45%), Gaps = 19/74 (25%)
Query: 11 LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
L+DVE GG TIFP N L+V P+ G A+ + + + LD
Sbjct: 222 LSDVEEGGETIFPDANVNVSSLPWYNELSECAKRGLSVKPKMGDALLFWSMKPDATLDPL 281
Query: 52 MYHSGCPVALGNKW 65
H GCPV GNKW
Sbjct: 282 SLHGGCPVIRGNKW 295
>gi|225459748|ref|XP_002285898.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Vitis vinifera]
gi|302141716|emb|CBI18919.3| unnamed protein product [Vitis vinifera]
Length = 288
Score = 49.3 bits (116), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 33/107 (30%), Positives = 50/107 (46%), Gaps = 25/107 (23%)
Query: 40 HNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN----------- 88
++AH + LD + +G G + LL+ L+DVE GG T+FP+
Sbjct: 175 YDAHYDYFLDEFNTKNG-----GQRIATLLMY-LSDVEEGGETVFPATKANFSSVPWWNE 228
Query: 89 --------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+V P+ G A+ +++ + LD H GCPV GNKW
Sbjct: 229 LSECGKKGLSVKPKMGDALLFWSMRPDATLDPSSLHGGCPVIKGNKW 275
Score = 43.5 bits (101), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 25/74 (33%), Positives = 34/74 (45%), Gaps = 19/74 (25%)
Query: 11 LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
L+DVE GG T+FP+ L+V P+ G A+ + + + LD
Sbjct: 202 LSDVEEGGETVFPATKANFSSVPWWNELSECGKKGLSVKPKMGDALLFWSMRPDATLDPS 261
Query: 52 MYHSGCPVALGNKW 65
H GCPV GNKW
Sbjct: 262 SLHGGCPVIKGNKW 275
>gi|398810140|ref|ZP_10568970.1| 2OG-Fe(II) oxygenase superfamily enzyme [Variovorax sp. CF313]
gi|398083831|gb|EJL74535.1| 2OG-Fe(II) oxygenase superfamily enzyme [Variovorax sp. CF313]
Length = 296
Score = 49.3 bits (116), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 35/108 (32%), Positives = 56/108 (51%), Gaps = 5/108 (4%)
Query: 25 LNLTVFPEKGSAVFWHNAHANTL--LDYWMYHSGCPVALGNKWGK---LLLSGLTDVELG 79
+NL V +G V + A A ++ D+ + + A + G+ L+S L +VE G
Sbjct: 169 MNLPVENGEGLQVLCYPAGAQSMPHFDFLVPSNAANKASLARSGQRVSTLVSYLNEVEEG 228
Query: 80 GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
G TIFP +V P +GSAV++ ++ +D+ H+G PV G KW
Sbjct: 229 GETIFPECGWSVPPRRGSAVYFEYCNSLGQVDHASLHAGGPVLHGEKW 276
Score = 47.8 bits (112), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 23/55 (41%), Positives = 33/55 (60%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L +VE GG TIFP +V P +GSAV++ ++ +D+ H+G PV G KW
Sbjct: 222 LNEVEEGGETIFPECGWSVPPRRGSAVYFEYCNSLGQVDHASLHAGGPVLHGEKW 276
>gi|363543299|ref|NP_001241865.1| prolyl 4-hydroxylase 5-1 [Zea mays]
gi|347978814|gb|AEP37749.1| prolyl 4-hydroxylase 5-1 [Zea mays]
Length = 180
Score = 49.3 bits (116), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 34/107 (31%), Positives = 48/107 (44%), Gaps = 25/107 (23%)
Query: 40 HNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN----------- 88
+ H + LD + +G G + LL+ L+DVE GG TIFP N
Sbjct: 67 YEPHFDYFLDEFNTKNG-----GQRMATLLMH-LSDVEEGGETIFPDANVNDSSLPWYNE 120
Query: 89 --------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+V P+ G A+ +++ + LD H GCPV GNKW
Sbjct: 121 LSECAKRGLSVKPKMGDALLFWSMKPDATLDPLSLHGGCPVIRGNKW 167
Score = 46.2 bits (108), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 27/76 (35%), Positives = 35/76 (46%), Gaps = 19/76 (25%)
Query: 9 VELTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLD 49
+ L+DVE GG TIFP N L+V P+ G A+ + + + LD
Sbjct: 92 MHLSDVEEGGETIFPDANVNDSSLPWYNELSECAKRGLSVKPKMGDALLFWSMKPDATLD 151
Query: 50 YWMYHSGCPVALGNKW 65
H GCPV GNKW
Sbjct: 152 PLSLHGGCPVIRGNKW 167
>gi|242063586|ref|XP_002453082.1| hypothetical protein SORBIDRAFT_04g038020 [Sorghum bicolor]
gi|241932913|gb|EES06058.1| hypothetical protein SORBIDRAFT_04g038020 [Sorghum bicolor]
Length = 307
Score = 49.3 bits (116), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 34/107 (31%), Positives = 48/107 (44%), Gaps = 25/107 (23%)
Query: 40 HNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN----------- 88
+ H + LD + +G G + LL+ L+DVE GG TIFP N
Sbjct: 194 YEPHFDYFLDEFNTKNG-----GQRMATLLMY-LSDVEEGGETIFPDANVNASSLPWYNE 247
Query: 89 --------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+V P+ G A+ +++ + LD H GCPV GNKW
Sbjct: 248 LSECAKRGLSVKPKMGDALLFWSMKPDATLDPLSLHGGCPVIRGNKW 294
Score = 46.2 bits (108), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 27/74 (36%), Positives = 34/74 (45%), Gaps = 19/74 (25%)
Query: 11 LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
L+DVE GG TIFP N L+V P+ G A+ + + + LD
Sbjct: 221 LSDVEEGGETIFPDANVNASSLPWYNELSECAKRGLSVKPKMGDALLFWSMKPDATLDPL 280
Query: 52 MYHSGCPVALGNKW 65
H GCPV GNKW
Sbjct: 281 SLHGGCPVIRGNKW 294
>gi|48716447|dbj|BAD23054.1| putative prolyl 4-hydroxylase [Oryza sativa Japonica Group]
Length = 310
Score = 49.3 bits (116), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 34/107 (31%), Positives = 47/107 (43%), Gaps = 25/107 (23%)
Query: 40 HNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN----------- 88
+ H + LD + +G G + LL+ L+DVE GG TIFP N
Sbjct: 197 YEPHFDYFLDEYNTKNG-----GQRMATLLMY-LSDVEEGGETIFPDANVNSSSLPWYNE 250
Query: 89 --------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L V P+ G A+ +++ + LD H GCPV GNKW
Sbjct: 251 LSECARKGLAVKPKMGDALLFWSMKPDATLDPLSLHGGCPVIKGNKW 297
Score = 45.8 bits (107), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 27/74 (36%), Positives = 33/74 (44%), Gaps = 19/74 (25%)
Query: 11 LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
L+DVE GG TIFP N L V P+ G A+ + + + LD
Sbjct: 224 LSDVEEGGETIFPDANVNSSSLPWYNELSECARKGLAVKPKMGDALLFWSMKPDATLDPL 283
Query: 52 MYHSGCPVALGNKW 65
H GCPV GNKW
Sbjct: 284 SLHGGCPVIKGNKW 297
>gi|226494249|ref|NP_001141909.1| uncharacterized protein LOC100274058 [Zea mays]
gi|194706408|gb|ACF87288.1| unknown [Zea mays]
gi|413932757|gb|AFW67308.1| hypothetical protein ZEAMMB73_919439 [Zea mays]
gi|413932758|gb|AFW67309.1| hypothetical protein ZEAMMB73_919439 [Zea mays]
Length = 217
Score = 49.3 bits (116), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 32/98 (32%), Positives = 42/98 (42%), Gaps = 20/98 (20%)
Query: 48 LDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN------------------L 89
DY +H + LG + +L LTDV GG T+FP+ L
Sbjct: 57 FDY--FHDRNNLKLGGQRVATVLMYLTDVNKGGETVFPNAEGSHLQYKDETWSECSRSGL 114
Query: 90 TVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
V P+KG A+ ++N H N D H CPV G KW
Sbjct: 115 AVKPKKGDALLFFNLHVNATADTGSLHGSCPVIEGEKW 152
Score = 44.7 bits (104), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 26/73 (35%), Positives = 31/73 (42%), Gaps = 18/73 (24%)
Query: 11 LTDVELGGATIFPSLN------------------LTVFPEKGSAVFWHNAHANTLLDYWM 52
LTDV GG T+FP+ L V P+KG A+ + N H N D
Sbjct: 80 LTDVNKGGETVFPNAEGSHLQYKDETWSECSRSGLAVKPKKGDALLFFNLHVNATADTGS 139
Query: 53 YHSGCPVALGNKW 65
H CPV G KW
Sbjct: 140 LHGSCPVIEGEKW 152
>gi|120609859|ref|YP_969537.1| 2OG-Fe(II) oxygenase [Acidovorax citrulli AAC00-1]
gi|120588323|gb|ABM31763.1| 2OG-Fe(II) oxygenase [Acidovorax citrulli AAC00-1]
Length = 309
Score = 49.3 bits (116), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 29/76 (38%), Positives = 39/76 (51%), Gaps = 8/76 (10%)
Query: 56 GCPVAL--GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVF--WYNAHANTLLD 111
G P L G + ++ L D E GG T FP ++L V P +G+AVF + H +T
Sbjct: 226 GTPTILRRGGQRVATIVIYLNDPEKGGGTTFPDVHLEVAPRRGNAVFFSYERPHPST--- 282
Query: 112 YRMYHSGCPVALGNKW 127
R H G PV G+KW
Sbjct: 283 -RTLHGGAPVVAGDKW 297
Score = 47.0 bits (110), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 31/57 (54%), Gaps = 6/57 (10%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVF--WHNAHANTLLDYWMYHSGCPVALGNKW 65
L D E GG T FP ++L V P +G+AVF + H +T H G PV G+KW
Sbjct: 245 LNDPEKGGGTTFPDVHLEVAPRRGNAVFFSYERPHPSTR----TLHGGAPVVAGDKW 297
>gi|388520325|gb|AFK48224.1| unknown [Lotus japonicus]
Length = 188
Score = 49.3 bits (116), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 32/107 (29%), Positives = 49/107 (45%), Gaps = 25/107 (23%)
Query: 40 HNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN----------- 88
+ H + LD + +G G + +L+ L+DVE GG TIFP+
Sbjct: 75 YEPHYDYFLDEFNTKNG-----GQRIATVLMY-LSDVEEGGETIFPAAKANFSSVPWYND 128
Query: 89 --------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+V P++G A+ +++ + LD H GCPV GNKW
Sbjct: 129 LSVCAKKGLSVKPKRGDALLFWSIRPDATLDPSSLHGGCPVIRGNKW 175
Score = 46.6 bits (109), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 35/74 (47%), Gaps = 19/74 (25%)
Query: 11 LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
L+DVE GG TIFP+ L+V P++G A+ + + + LD
Sbjct: 102 LSDVEEGGETIFPAAKANFSSVPWYNDLSVCAKKGLSVKPKRGDALLFWSIRPDATLDPS 161
Query: 52 MYHSGCPVALGNKW 65
H GCPV GNKW
Sbjct: 162 SLHGGCPVIRGNKW 175
>gi|209522122|ref|ZP_03270769.1| Procollagen-proline dioxygenase [Burkholderia sp. H160]
gi|209497434|gb|EDZ97642.1| Procollagen-proline dioxygenase [Burkholderia sp. H160]
Length = 296
Score = 49.3 bits (116), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 24/66 (36%), Positives = 39/66 (59%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
G + G LL+ L DVE GG T+FP + +V P++G A+++ + L D H+ P+
Sbjct: 215 GQRMGTLLMY-LKDVEGGGETVFPQIGWSVAPQRGHALYFEYGNRFGLCDPSSLHASTPL 273
Query: 122 ALGNKW 127
+G+KW
Sbjct: 274 RVGDKW 279
Score = 46.2 bits (108), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 20/55 (36%), Positives = 33/55 (60%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T+FP + +V P++G A+++ + L D H+ P+ +G+KW
Sbjct: 225 LKDVEGGGETVFPQIGWSVAPQRGHALYFEYGNRFGLCDPSSLHASTPLRVGDKW 279
>gi|26352077|dbj|BAC39675.1| unnamed protein product [Mus musculus]
Length = 383
Score = 49.3 bits (116), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 25/55 (45%), Positives = 34/55 (61%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+ VE GGAT F N +V K +A+FW+N H + D H+GCPV +G+KW
Sbjct: 306 LSSVEAGGATAFIYGNFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLVGDKW 360
Score = 46.6 bits (109), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 25/55 (45%), Positives = 33/55 (60%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L+ VE GGAT F N +V K +A+FW N H + D H+GCPV +G+KW
Sbjct: 306 LSSVEAGGATAFIYGNFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLVGDKW 360
>gi|357483925|ref|XP_003612249.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355513584|gb|AES95207.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 289
Score = 49.3 bits (116), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 31/107 (28%), Positives = 49/107 (45%), Gaps = 25/107 (23%)
Query: 40 HNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN----------- 88
+ H + LD + +G G + +L+ LTDVE GG T+FP+
Sbjct: 176 YEPHYDYFLDEFNTKNG-----GQRIATVLMY-LTDVEEGGETVFPAAKGNFSNVPWYNE 229
Query: 89 --------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L++ P++G A+ +++ + LD H GCPV GNKW
Sbjct: 230 LSDCGKKGLSIKPKRGDALLFWSMKPDATLDASSLHGGCPVIKGNKW 276
Score = 46.2 bits (108), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 25/74 (33%), Positives = 35/74 (47%), Gaps = 19/74 (25%)
Query: 11 LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
LTDVE GG T+FP+ L++ P++G A+ + + + LD
Sbjct: 203 LTDVEEGGETVFPAAKGNFSNVPWYNELSDCGKKGLSIKPKRGDALLFWSMKPDATLDAS 262
Query: 52 MYHSGCPVALGNKW 65
H GCPV GNKW
Sbjct: 263 SLHGGCPVIKGNKW 276
>gi|239816557|ref|YP_002945467.1| Procollagen-proline dioxygenase [Variovorax paradoxus S110]
gi|239803134|gb|ACS20201.1| Procollagen-proline dioxygenase [Variovorax paradoxus S110]
Length = 296
Score = 49.3 bits (116), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 33/108 (30%), Positives = 56/108 (51%), Gaps = 5/108 (4%)
Query: 25 LNLTVFPEKGSAVFWHNAHANTL--LDYWMYHSGCPVALGNKWGK---LLLSGLTDVELG 79
+NL V +G V + A A +L D+ + + A + G+ L++ L +VE G
Sbjct: 169 MNLPVENGEGLQVLHYPAGAQSLPHFDFLVPSNAANQASLQRSGQRVSTLVAYLNEVEEG 228
Query: 80 GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
G T+FP +V P++G AV++ ++ +D+ H+G PV G KW
Sbjct: 229 GETVFPETGWSVSPQRGGAVYFEYCNSLGQVDHASLHAGAPVLSGEKW 276
Score = 47.4 bits (111), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/55 (38%), Positives = 33/55 (60%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L +VE GG T+FP +V P++G AV++ ++ +D+ H+G PV G KW
Sbjct: 222 LNEVEEGGETVFPETGWSVSPQRGGAVYFEYCNSLGQVDHASLHAGAPVLSGEKW 276
>gi|91789558|ref|YP_550510.1| procollagen-proline,2-oxoglutarate-4-dioxygenase [Polaromonas sp.
JS666]
gi|91698783|gb|ABE45612.1| Procollagen-proline,2-oxoglutarate-4-dioxygenase [Polaromonas sp.
JS666]
Length = 277
Score = 49.3 bits (116), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 38/76 (50%), Gaps = 4/76 (5%)
Query: 54 HSGCPVAL--GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLD 111
G P L G + L+ L + E GGAT+FP + L V P +G+AVF+ AN
Sbjct: 192 EPGTPAILQRGGQRVATLIMYLNEPEQGGATVFPDIGLQVTPRRGTAVFFSYPAANPASL 251
Query: 112 YRMYHSGCPVALGNKW 127
R H G PV G KW
Sbjct: 252 TR--HGGEPVKAGEKW 265
Score = 47.8 bits (112), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 24/55 (43%), Positives = 30/55 (54%), Gaps = 2/55 (3%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L + E GGAT+FP + L V P +G+AVF+ AN H G PV G KW
Sbjct: 213 LNEPEQGGATVFPDIGLQVTPRRGTAVFFSYPAANPA--SLTRHGGEPVKAGEKW 265
>gi|326316001|ref|YP_004233673.1| procollagen-proline dioxygenase [Acidovorax avenae subsp. avenae
ATCC 19860]
gi|323372837|gb|ADX45106.1| Procollagen-proline dioxygenase [Acidovorax avenae subsp. avenae
ATCC 19860]
Length = 298
Score = 49.3 bits (116), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 29/76 (38%), Positives = 39/76 (51%), Gaps = 8/76 (10%)
Query: 56 GCPVAL--GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVF--WYNAHANTLLD 111
G P L G + ++ L D E GG T FP ++L V P +G+AVF + H +T
Sbjct: 215 GTPTILRRGGQRVATIVIYLNDPEKGGGTTFPDVHLEVAPRRGNAVFFSYERPHPST--- 271
Query: 112 YRMYHSGCPVALGNKW 127
R H G PV G+KW
Sbjct: 272 -RTLHGGAPVVAGDKW 286
Score = 47.0 bits (110), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 31/57 (54%), Gaps = 6/57 (10%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVF--WHNAHANTLLDYWMYHSGCPVALGNKW 65
L D E GG T FP ++L V P +G+AVF + H +T H G PV G+KW
Sbjct: 234 LNDPEKGGGTTFPDVHLEVAPRRGNAVFFSYERPHPSTR----TLHGGAPVVAGDKW 286
>gi|238007346|gb|ACR34708.1| unknown [Zea mays]
Length = 180
Score = 48.9 bits (115), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 34/107 (31%), Positives = 48/107 (44%), Gaps = 25/107 (23%)
Query: 40 HNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN----------- 88
+ H + LD + +G G + LL+ L+DVE GG TIFP N
Sbjct: 67 YEPHFDYFLDEFNTKNG-----GQRMATLLMY-LSDVEEGGETIFPDANVNVSSLPWYNE 120
Query: 89 --------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+V P+ G A+ +++ + LD H GCPV GNKW
Sbjct: 121 LSECAKRGLSVKPKMGDALLFWSMKPDATLDPLSLHGGCPVIRGNKW 167
Score = 46.2 bits (108), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 27/74 (36%), Positives = 34/74 (45%), Gaps = 19/74 (25%)
Query: 11 LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
L+DVE GG TIFP N L+V P+ G A+ + + + LD
Sbjct: 94 LSDVEEGGETIFPDANVNVSSLPWYNELSECAKRGLSVKPKMGDALLFWSMKPDATLDPL 153
Query: 52 MYHSGCPVALGNKW 65
H GCPV GNKW
Sbjct: 154 SLHGGCPVIRGNKW 167
>gi|259490206|ref|NP_001159002.1| prolyl 4-hydroxylase alpha-2 subunit [Zea mays]
gi|195626402|gb|ACG35031.1| prolyl 4-hydroxylase alpha-2 subunit precursor [Zea mays]
gi|347978830|gb|AEP37757.1| prolyl 4-hydroxylase 8 [Zea mays]
gi|347978832|gb|AEP37758.1| prolyl 4-hydroxylase 8-1 [Zea mays]
gi|413939569|gb|AFW74120.1| prolyl 4-hydroxylase alpha-2 subunit isoform 1 [Zea mays]
gi|413939570|gb|AFW74121.1| prolyl 4-hydroxylase alpha-2 subunit isoform 2 [Zea mays]
gi|413939571|gb|AFW74122.1| prolyl 4-hydroxylase alpha-2 subunit isoform 3 [Zea mays]
gi|413939572|gb|AFW74123.1| prolyl 4-hydroxylase alpha-2 subunit isoform 4 [Zea mays]
Length = 307
Score = 48.9 bits (115), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 34/107 (31%), Positives = 48/107 (44%), Gaps = 25/107 (23%)
Query: 40 HNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN----------- 88
+ H + LD + +G G + LL+ L+DVE GG TIFP N
Sbjct: 194 YEPHFDYFLDEFNTKNG-----GQRIATLLMY-LSDVEEGGETIFPDANVNASSLPWYNE 247
Query: 89 --------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+V P+ G A+ +++ + LD H GCPV GNKW
Sbjct: 248 LSDCAKRGLSVKPKMGDALLFWSMKPDATLDPLSLHGGCPVIKGNKW 294
Score = 46.2 bits (108), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 27/74 (36%), Positives = 34/74 (45%), Gaps = 19/74 (25%)
Query: 11 LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
L+DVE GG TIFP N L+V P+ G A+ + + + LD
Sbjct: 221 LSDVEEGGETIFPDANVNASSLPWYNELSDCAKRGLSVKPKMGDALLFWSMKPDATLDPL 280
Query: 52 MYHSGCPVALGNKW 65
H GCPV GNKW
Sbjct: 281 SLHGGCPVIKGNKW 294
>gi|21107513|gb|AAM36222.1| conserved hypothetical protein [Xanthomonas axonopodis pv. citri
str. 306]
Length = 273
Score = 48.9 bits (115), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 26/64 (40%), Positives = 32/64 (50%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR V L DV GG T FP + V P G+ V + N HA+ D H+G PV
Sbjct: 196 NRQRTVCVYLNDVGAGGDTEFPIAGVRVRPRPGTLVCFDNLHADGRPDADSLHAGLPVTA 255
Query: 62 GNKW 65
G+KW
Sbjct: 256 GSKW 259
Score = 44.7 bits (104), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 23/55 (41%), Positives = 29/55 (52%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L DV GG T FP + V P G+ V + N HA+ D H+G PV G+KW
Sbjct: 205 LNDVGAGGDTEFPIAGVRVRPRPGTLVCFDNLHADGRPDADSLHAGLPVTAGSKW 259
>gi|363543369|ref|NP_001241694.1| prolyl 4-hydroxylase 8-4 [Zea mays]
gi|347978838|gb|AEP37761.1| prolyl 4-hydroxylase 8-4 [Zea mays]
Length = 307
Score = 48.9 bits (115), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 34/107 (31%), Positives = 48/107 (44%), Gaps = 25/107 (23%)
Query: 40 HNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN----------- 88
+ H + LD + +G G + LL+ L+DVE GG TIFP N
Sbjct: 194 YEPHFDYFLDEFNTKNG-----GQRIATLLMY-LSDVEEGGETIFPDANVNASSLPWYNE 247
Query: 89 --------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+V P+ G A+ +++ + LD H GCPV GNKW
Sbjct: 248 LSDCAKRGLSVKPKMGDALLFWSMKPDATLDPLSLHGGCPVIKGNKW 294
Score = 46.2 bits (108), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 27/74 (36%), Positives = 34/74 (45%), Gaps = 19/74 (25%)
Query: 11 LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
L+DVE GG TIFP N L+V P+ G A+ + + + LD
Sbjct: 221 LSDVEEGGETIFPDANVNASSLPWYNELSDCAKRGLSVKPKMGDALLFWSMKPDATLDPL 280
Query: 52 MYHSGCPVALGNKW 65
H GCPV GNKW
Sbjct: 281 SLHGGCPVIKGNKW 294
>gi|323528042|ref|YP_004230194.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1001]
gi|323385044|gb|ADX57134.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1001]
Length = 300
Score = 48.9 bits (115), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 32/57 (56%)
Query: 9 VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
V L DV GG TIFP+ L+V ++G AV++ + LD H G PV G+KW
Sbjct: 232 VYLNDVADGGETIFPAAGLSVAAKQGGAVYFRYMNGQRQLDPLTLHGGAPVHAGDKW 288
Score = 48.5 bits (114), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 27/73 (36%), Positives = 37/73 (50%)
Query: 55 SGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRM 114
S A G + L+ L DV GG TIFP+ L+V ++G AV++ + LD
Sbjct: 216 SAVHTARGGQRVATLVVYLNDVADGGETIFPAAGLSVAAKQGGAVYFRYMNGQRQLDPLT 275
Query: 115 YHSGCPVALGNKW 127
H G PV G+KW
Sbjct: 276 LHGGAPVHAGDKW 288
>gi|307725787|ref|YP_003909000.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1003]
gi|307586312|gb|ADN59709.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1003]
Length = 313
Score = 48.9 bits (115), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 31/57 (54%)
Query: 9 VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
V L DV GG TIFP L+V ++G AV++ + LD H G PV G+KW
Sbjct: 245 VYLNDVPDGGETIFPEAGLSVAAQQGGAVYFRYMNGRRQLDPLTLHGGAPVLSGDKW 301
Score = 47.8 bits (112), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/69 (37%), Positives = 35/69 (50%)
Query: 59 VALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSG 118
A G + L+ L DV GG TIFP L+V ++G AV++ + LD H G
Sbjct: 233 TARGGQRVATLVVYLNDVPDGGETIFPEAGLSVAAQQGGAVYFRYMNGRRQLDPLTLHGG 292
Query: 119 CPVALGNKW 127
PV G+KW
Sbjct: 293 APVLSGDKW 301
>gi|357137804|ref|XP_003570489.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Brachypodium
distachyon]
Length = 318
Score = 48.9 bits (115), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 33/107 (30%), Positives = 48/107 (44%), Gaps = 25/107 (23%)
Query: 40 HNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN----------- 88
+ H + LD + +G G + +L+ L+DVE GG TIFP N
Sbjct: 205 YEPHFDYFLDEFNTKNG-----GQRMATILMY-LSDVEEGGETIFPDANVNSSSLPWHNE 258
Query: 89 --------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L V P+ G A+ +++ + + LD H GCPV GNKW
Sbjct: 259 LSECARKGLAVKPKMGDALLFWSMNPDATLDPLSLHGGCPVIRGNKW 305
Score = 46.6 bits (109), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 27/74 (36%), Positives = 34/74 (45%), Gaps = 19/74 (25%)
Query: 11 LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
L+DVE GG TIFP N L V P+ G A+ + + + + LD
Sbjct: 232 LSDVEEGGETIFPDANVNSSSLPWHNELSECARKGLAVKPKMGDALLFWSMNPDATLDPL 291
Query: 52 MYHSGCPVALGNKW 65
H GCPV GNKW
Sbjct: 292 SLHGGCPVIRGNKW 305
>gi|330821584|ref|YP_004350446.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
gladioli BSR3]
gi|327373579|gb|AEA64934.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
gladioli BSR3]
Length = 302
Score = 48.5 bits (114), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 23/66 (34%), Positives = 36/66 (54%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
G + G LL+ L DVE GG T+FP + +V P +G A ++ + + D H+ P+
Sbjct: 215 GQRVGTLLMY-LNDVESGGETLFPQVGCSVVPRRGQAFYFEYGNGSGRSDPASLHASSPI 273
Query: 122 ALGNKW 127
G+KW
Sbjct: 274 GSGDKW 279
Score = 46.6 bits (109), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 19/55 (34%), Positives = 30/55 (54%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T+FP + +V P +G A ++ + + D H+ P+ G+KW
Sbjct: 225 LNDVESGGETLFPQVGCSVVPRRGQAFYFEYGNGSGRSDPASLHASSPIGSGDKW 279
>gi|78046960|ref|YP_363135.1| hypothetical protein XCV1404 [Xanthomonas campestris pv.
vesicatoria str. 85-10]
gi|78035390|emb|CAJ23035.1| conserved hypothetical protein [Xanthomonas campestris pv.
vesicatoria str. 85-10]
Length = 418
Score = 48.5 bits (114), Expect = 8e-04, Method: Composition-based stats.
Identities = 26/64 (40%), Positives = 32/64 (50%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR V L DV GG T FP + V P G+ V + N HA+ D H+G PV
Sbjct: 341 NRQRTVCVYLNDVGAGGETEFPVAGVRVRPRPGTLVCFDNLHADGRPDADSLHAGLPVTA 400
Query: 62 GNKW 65
G+KW
Sbjct: 401 GSKW 404
Score = 45.4 bits (106), Expect = 0.008, Method: Composition-based stats.
Identities = 23/55 (41%), Positives = 29/55 (52%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L DV GG T FP + V P G+ V + N HA+ D H+G PV G+KW
Sbjct: 350 LNDVGAGGETEFPVAGVRVRPRPGTLVCFDNLHADGRPDADSLHAGLPVTAGSKW 404
>gi|346724248|ref|YP_004850917.1| hypothetical protein XACM_1335 [Xanthomonas axonopodis pv.
citrumelo F1]
gi|346648995|gb|AEO41619.1| hypothetical protein XACM_1335 [Xanthomonas axonopodis pv.
citrumelo F1]
Length = 418
Score = 48.5 bits (114), Expect = 8e-04, Method: Composition-based stats.
Identities = 26/64 (40%), Positives = 32/64 (50%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR V L DV GG T FP + V P G+ V + N HA+ D H+G PV
Sbjct: 341 NRQRTVCVYLNDVGAGGETEFPVAGVRVRPRPGTLVCFDNLHADGRPDADSLHAGLPVTA 400
Query: 62 GNKW 65
G+KW
Sbjct: 401 GSKW 404
Score = 45.4 bits (106), Expect = 0.008, Method: Composition-based stats.
Identities = 23/55 (41%), Positives = 29/55 (52%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L DV GG T FP + V P G+ V + N HA+ D H+G PV G+KW
Sbjct: 350 LNDVGAGGETEFPVAGVRVRPRPGTLVCFDNLHADGRPDADSLHAGLPVTAGSKW 404
>gi|145345764|ref|XP_001417370.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144577597|gb|ABO95663.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 328
Score = 48.5 bits (114), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 28/73 (38%), Positives = 33/73 (45%), Gaps = 18/73 (24%)
Query: 11 LTDVELGGATIFPSL------------------NLTVFPEKGSAVFWHNAHANTLLDYWM 52
LTDVE GG TIFP L V P KG A+F+ + H N D
Sbjct: 183 LTDVEEGGETIFPDAEAGANPGGGDDASSCAAGKLAVKPRKGDALFFRSLHHNGTSDAMS 242
Query: 53 YHSGCPVALGNKW 65
H+GCPV G K+
Sbjct: 243 SHAGCPVVKGVKF 255
Score = 48.1 bits (113), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 32/98 (32%), Positives = 40/98 (40%), Gaps = 18/98 (18%)
Query: 48 LDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSL------------------NL 89
DY+ G + +L LTDVE GG TIFP L
Sbjct: 158 FDYFQDEFNQKREKGGQRVATVLMYLTDVEEGGETIFPDAEAGANPGGGDDASSCAAGKL 217
Query: 90 TVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
V P KG A+F+ + H N D H+GCPV G K+
Sbjct: 218 AVKPRKGDALFFRSLHHNGTSDAMSSHAGCPVVKGVKF 255
>gi|308799217|ref|XP_003074389.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
gi|116000560|emb|CAL50240.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
Length = 294
Score = 48.5 bits (114), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 39/146 (26%), Positives = 58/146 (39%), Gaps = 25/146 (17%)
Query: 1 MNRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVA 60
+ RAE +E + + T P + F V + H Y +H V
Sbjct: 84 LGRAEDDVIEAIEARIAAWTHVPESHGEGF-----QVLRYEKHQEYRAHYDYFHDKFNVK 138
Query: 61 L---GNKWGKLLLSGLTDVELGGATIFPSLN----------------LTVFPEKGSAVFW 101
G + G +L+ L+DVE GG T+FP L V P KG A+F+
Sbjct: 139 REKGGQRMGTVLMY-LSDVEEGGETVFPKFEDGTPAGSEASECARNKLAVRPRKGDALFF 197
Query: 102 YNAHANTLLDYRMYHSGCPVALGNKW 127
+ + + D H+GCPV G K+
Sbjct: 198 RSLRHDGVPDTFSEHAGCPVIRGVKF 223
>gi|125542543|gb|EAY88682.1| hypothetical protein OsI_10157 [Oryza sativa Indica Group]
Length = 321
Score = 48.5 bits (114), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 28/84 (33%), Positives = 39/84 (46%), Gaps = 19/84 (22%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLN------------------LTVFPEKGSAVFWYN 103
G++ +L+ L++V++GG TIFP V P KGSAV +++
Sbjct: 175 GDRVATVLMY-LSNVKMGGETIFPDCEARLSQPKDETWSDCAEQGFAVKPAKGSAVLFFS 233
Query: 104 AHANTLLDYRMYHSGCPVALGNKW 127
H N LD H CPV G KW
Sbjct: 234 LHPNATLDTDSLHGSCPVIEGEKW 257
Score = 46.2 bits (108), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 26/73 (35%), Positives = 32/73 (43%), Gaps = 18/73 (24%)
Query: 11 LTDVELGGATIFPSLN------------------LTVFPEKGSAVFWHNAHANTLLDYWM 52
L++V++GG TIFP V P KGSAV + + H N LD
Sbjct: 185 LSNVKMGGETIFPDCEARLSQPKDETWSDCAEQGFAVKPAKGSAVLFFSLHPNATLDTDS 244
Query: 53 YHSGCPVALGNKW 65
H CPV G KW
Sbjct: 245 LHGSCPVIEGEKW 257
>gi|170690448|ref|ZP_02881615.1| Procollagen-proline dioxygenase [Burkholderia graminis C4D1M]
gi|170144883|gb|EDT13044.1| Procollagen-proline dioxygenase [Burkholderia graminis C4D1M]
Length = 307
Score = 48.5 bits (114), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 23/55 (41%), Positives = 30/55 (54%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DV GG TIFP L+V ++G AV++ + LD H G PV G+KW
Sbjct: 241 LNDVPDGGETIFPEAGLSVAAKQGGAVYFRYMNGQRQLDPLTLHGGAPVRAGDKW 295
Score = 48.1 bits (113), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/68 (38%), Positives = 35/68 (51%)
Query: 60 ALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGC 119
A G + L+ L DV GG TIFP L+V ++G AV++ + LD H G
Sbjct: 228 ARGGQRVATLVIYLNDVPDGGETIFPEAGLSVAAKQGGAVYFRYMNGQRQLDPLTLHGGA 287
Query: 120 PVALGNKW 127
PV G+KW
Sbjct: 288 PVRAGDKW 295
>gi|390989336|ref|ZP_10259634.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas axonopodis
pv. punicae str. LMG 859]
gi|372555840|emb|CCF66609.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas axonopodis
pv. punicae str. LMG 859]
Length = 228
Score = 48.5 bits (114), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 26/64 (40%), Positives = 32/64 (50%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR V L DV GG T FP + V P G+ V + N HA+ D H+G PV
Sbjct: 151 NRQRTVCVYLNDVGAGGDTEFPIAGVRVRPRPGTLVCFDNLHADGRPDADSLHAGLPVTA 210
Query: 62 GNKW 65
G+KW
Sbjct: 211 GSKW 214
Score = 44.3 bits (103), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 25/66 (37%), Positives = 34/66 (51%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ + + L DV GG T FP + V P G+ V + N HA+ D H+G PV
Sbjct: 150 GNRQ-RTVCVYLNDVGAGGDTEFPIAGVRVRPRPGTLVCFDNLHADGRPDADSLHAGLPV 208
Query: 122 ALGNKW 127
G+KW
Sbjct: 209 TAGSKW 214
>gi|325267002|ref|ZP_08133672.1| 2OG-Fe(II) oxygenase [Kingella denitrificans ATCC 33394]
gi|324981502|gb|EGC17144.1| 2OG-Fe(II) oxygenase [Kingella denitrificans ATCC 33394]
Length = 279
Score = 48.5 bits (114), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 38/66 (57%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
G + G L+ L DV+ GGAT FP+LN + P+KG A+F+ N N + H+G PV
Sbjct: 204 GQRVGTFLVY-LCDVDAGGATRFPALNFEIRPKKGMALFFANTLPNGEGNPLTLHAGVPV 262
Query: 122 ALGNKW 127
G K+
Sbjct: 263 VSGVKY 268
Score = 48.5 bits (114), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 25/57 (43%), Positives = 34/57 (59%)
Query: 9 VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
V L DV+ GGAT FP+LN + P+KG A+F+ N N + H+G PV G K+
Sbjct: 212 VYLCDVDAGGATRFPALNFEIRPKKGMALFFANTLPNGEGNPLTLHAGVPVVSGVKY 268
>gi|268572523|ref|XP_002641343.1| C. briggsae CBR-DPY-18 protein [Caenorhabditis briggsae]
gi|94442971|emb|CAJ98658.1| prolyl 4-hydroxylase [Caenorhabditis briggsae]
Length = 559
Score = 48.5 bits (114), Expect = 8e-04, Method: Composition-based stats.
Identities = 23/66 (34%), Positives = 34/66 (51%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ +L ++ GG T+F + TV P K A+FWYN + + H+ CPV
Sbjct: 440 GNRIATVLFY-MSQPSHGGGTVFTEVKSTVLPTKNDALFWYNLYKQGDGNPDTRHAACPV 498
Query: 122 ALGNKW 127
+G KW
Sbjct: 499 LVGIKW 504
Score = 45.4 bits (106), Expect = 0.008, Method: Composition-based stats.
Identities = 19/49 (38%), Positives = 27/49 (55%)
Query: 17 GGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
GG T+F + TV P K A+FW+N + + H+ CPV +G KW
Sbjct: 456 GGGTVFTEVKSTVLPTKNDALFWYNLYKQGDGNPDTRHAACPVLVGIKW 504
>gi|302762452|ref|XP_002964648.1| hypothetical protein SELMODRAFT_82355 [Selaginella moellendorffii]
gi|300168377|gb|EFJ34981.1| hypothetical protein SELMODRAFT_82355 [Selaginella moellendorffii]
Length = 225
Score = 48.5 bits (114), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 26/78 (33%), Positives = 38/78 (48%), Gaps = 19/78 (24%)
Query: 69 LLSGLTDVELGGATIFPS-------------------LNLTVFPEKGSAVFWYNAHANTL 109
LL LTDVE GG T+FP ++V P++G A+ +++ +
Sbjct: 135 LLMYLTDVEEGGETVFPKSAKNSSSLPWHNQLSECGRRGVSVRPKRGDALLFWSMSPDAQ 194
Query: 110 LDYRMYHSGCPVALGNKW 127
LD+ H GCPV G+KW
Sbjct: 195 LDHSSLHGGCPVIKGDKW 212
Score = 45.1 bits (105), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 24/74 (32%), Positives = 35/74 (47%), Gaps = 19/74 (25%)
Query: 11 LTDVELGGATIFPS-------------------LNLTVFPEKGSAVFWHNAHANTLLDYW 51
LTDVE GG T+FP ++V P++G A+ + + + LD+
Sbjct: 139 LTDVEEGGETVFPKSAKNSSSLPWHNQLSECGRRGVSVRPKRGDALLFWSMSPDAQLDHS 198
Query: 52 MYHSGCPVALGNKW 65
H GCPV G+KW
Sbjct: 199 SLHGGCPVIKGDKW 212
>gi|302815629|ref|XP_002989495.1| hypothetical protein SELMODRAFT_129912 [Selaginella moellendorffii]
gi|300142673|gb|EFJ09371.1| hypothetical protein SELMODRAFT_129912 [Selaginella moellendorffii]
Length = 213
Score = 48.5 bits (114), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/78 (33%), Positives = 38/78 (48%), Gaps = 19/78 (24%)
Query: 69 LLSGLTDVELGGATIFPS-------------------LNLTVFPEKGSAVFWYNAHANTL 109
LL LTDVE GG T+FP ++V P++G A+ +++ +
Sbjct: 123 LLMYLTDVEEGGETVFPKSAKNSSSLPWHNQLSECGRRGVSVRPKRGDALLFWSMSPDAQ 182
Query: 110 LDYRMYHSGCPVALGNKW 127
LD+ H GCPV G+KW
Sbjct: 183 LDHSSLHGGCPVIKGDKW 200
Score = 44.7 bits (104), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 24/74 (32%), Positives = 35/74 (47%), Gaps = 19/74 (25%)
Query: 11 LTDVELGGATIFPS-------------------LNLTVFPEKGSAVFWHNAHANTLLDYW 51
LTDVE GG T+FP ++V P++G A+ + + + LD+
Sbjct: 127 LTDVEEGGETVFPKSAKNSSSLPWHNQLSECGRRGVSVRPKRGDALLFWSMSPDAQLDHS 186
Query: 52 MYHSGCPVALGNKW 65
H GCPV G+KW
Sbjct: 187 SLHGGCPVIKGDKW 200
>gi|356540840|ref|XP_003538892.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Glycine max]
Length = 290
Score = 48.5 bits (114), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 31/107 (28%), Positives = 49/107 (45%), Gaps = 25/107 (23%)
Query: 40 HNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN----------- 88
+ H + LD + +G G + +L+ LTDVE GG T+FP+
Sbjct: 177 YEPHYDYFLDDFNTKNG-----GQRIATVLMY-LTDVEEGGETVFPAAKGNFSSVPWWNE 230
Query: 89 --------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L++ P++G A+ +++ + LD H GCPV GNKW
Sbjct: 231 LSECGKKGLSIKPKRGDALLFWSMKPDATLDPSSLHGGCPVIKGNKW 277
Score = 45.8 bits (107), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 25/74 (33%), Positives = 35/74 (47%), Gaps = 19/74 (25%)
Query: 11 LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
LTDVE GG T+FP+ L++ P++G A+ + + + LD
Sbjct: 204 LTDVEEGGETVFPAAKGNFSSVPWWNELSECGKKGLSIKPKRGDALLFWSMKPDATLDPS 263
Query: 52 MYHSGCPVALGNKW 65
H GCPV GNKW
Sbjct: 264 SLHGGCPVIKGNKW 277
>gi|224033439|gb|ACN35795.1| unknown [Zea mays]
Length = 180
Score = 48.1 bits (113), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 29/78 (37%), Positives = 37/78 (47%), Gaps = 19/78 (24%)
Query: 69 LLSGLTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWYNAHANTL 109
LL L+DVE GG TIFP N L+V P+ G A+ +++ +
Sbjct: 90 LLMYLSDVEEGGETIFPDANVNASSLPWYNELSDCAKRGLSVKPKMGDALLFWSMKPDAT 149
Query: 110 LDYRMYHSGCPVALGNKW 127
LD H GCPV GNKW
Sbjct: 150 LDPLSLHGGCPVIKGNKW 167
Score = 45.8 bits (107), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 27/74 (36%), Positives = 34/74 (45%), Gaps = 19/74 (25%)
Query: 11 LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
L+DVE GG TIFP N L+V P+ G A+ + + + LD
Sbjct: 94 LSDVEEGGETIFPDANVNASSLPWYNELSDCAKRGLSVKPKMGDALLFWSMKPDATLDPL 153
Query: 52 MYHSGCPVALGNKW 65
H GCPV GNKW
Sbjct: 154 SLHGGCPVIKGNKW 167
>gi|224006556|ref|XP_002292238.1| hypothetical protein THAPSDRAFT_269421 [Thalassiosira pseudonana
CCMP1335]
gi|220971880|gb|EED90213.1| hypothetical protein THAPSDRAFT_269421 [Thalassiosira pseudonana
CCMP1335]
Length = 348
Score = 48.1 bits (113), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 33/99 (33%), Positives = 46/99 (46%), Gaps = 17/99 (17%)
Query: 45 NTLLDYWMYHSGCPVALGNKWGKLL---LSGLTDVELGGATIF---PS----------LN 88
L ++ + V N+ G+ L L L DVE GG T F P+ LN
Sbjct: 225 QALAPHYDANRSADVEDKNRGGQTLATLLVYLNDVENGGTTRFGKLPAADDKRQVKGELN 284
Query: 89 LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L + P+KG A+ ++ A N + D R H GCP A+ KW
Sbjct: 285 LNIIPKKGDALLFFPADKNGMFDERTEHEGCP-AVDEKW 322
Score = 43.1 bits (100), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 27/70 (38%), Positives = 34/70 (48%), Gaps = 14/70 (20%)
Query: 9 VELTDVELGGATIF---PS----------LNLTVFPEKGSAVFWHNAHANTLLDYWMYHS 55
V L DVE GG T F P+ LNL + P+KG A+ + A N + D H
Sbjct: 254 VYLNDVENGGTTRFGKLPAADDKRQVKGELNLNIIPKKGDALLFFPADKNGMFDERTEHE 313
Query: 56 GCPVALGNKW 65
GCP A+ KW
Sbjct: 314 GCP-AVDEKW 322
>gi|295699617|ref|YP_003607510.1| procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
gi|295438830|gb|ADG17999.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
Length = 286
Score = 48.1 bits (113), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 23/55 (41%), Positives = 31/55 (56%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DV GG T+FP+ L+V + G AV++ +A LD H G PV G+KW
Sbjct: 220 LNDVADGGETVFPTAGLSVAAQAGGAVYFRYMNAERQLDPSTLHGGAPVLAGDKW 274
Score = 48.1 bits (113), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 25/66 (37%), Positives = 36/66 (54%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
G + L++ L DV GG T+FP+ L+V + G AV++ +A LD H G PV
Sbjct: 210 GQRVATLIIY-LNDVADGGETVFPTAGLSVAAQAGGAVYFRYMNAERQLDPSTLHGGAPV 268
Query: 122 ALGNKW 127
G+KW
Sbjct: 269 LAGDKW 274
>gi|384250599|gb|EIE24078.1| hypothetical protein COCSUDRAFT_47131 [Coccomyxa subellipsoidea
C-169]
Length = 327
Score = 48.1 bits (113), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 29/75 (38%), Positives = 33/75 (44%), Gaps = 20/75 (26%)
Query: 73 LTDVELGGATIFPS--------LNLTVF------------PEKGSAVFWYNAHANTLLDY 112
LTDVE GG TIFP L +T F P G A+ +Y+ H N D
Sbjct: 237 LTDVEEGGETIFPLEGRYGPDLLKMTGFNYKSCTTGFKYKPRMGDALMFYSMHPNGTFDK 296
Query: 113 RMYHSGCPVALGNKW 127
H GCPV G KW
Sbjct: 297 HALHGGCPVMAGEKW 311
Score = 45.8 bits (107), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 28/75 (37%), Positives = 33/75 (44%), Gaps = 20/75 (26%)
Query: 11 LTDVELGGATIFPS--------LNLTVF------------PEKGSAVFWHNAHANTLLDY 50
LTDVE GG TIFP L +T F P G A+ +++ H N D
Sbjct: 237 LTDVEEGGETIFPLEGRYGPDLLKMTGFNYKSCTTGFKYKPRMGDALMFYSMHPNGTFDK 296
Query: 51 WMYHSGCPVALGNKW 65
H GCPV G KW
Sbjct: 297 HALHGGCPVMAGEKW 311
>gi|363806698|ref|NP_001242522.1| uncharacterized protein LOC100806046 [Glycine max]
gi|255647110|gb|ACU24023.1| unknown [Glycine max]
Length = 289
Score = 48.1 bits (113), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 31/107 (28%), Positives = 49/107 (45%), Gaps = 25/107 (23%)
Query: 40 HNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN----------- 88
+ H + LD + +G G + +L+ LTDVE GG T+FP+
Sbjct: 176 YEPHYDYFLDDFNTKNG-----GQRIATVLMY-LTDVEEGGETVFPAAKGNFSFVPWWNE 229
Query: 89 --------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L++ P++G A+ +++ + LD H GCPV GNKW
Sbjct: 230 LFECGKKGLSIKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKW 276
Score = 45.4 bits (106), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 25/74 (33%), Positives = 35/74 (47%), Gaps = 19/74 (25%)
Query: 11 LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
LTDVE GG T+FP+ L++ P++G A+ + + + LD
Sbjct: 203 LTDVEEGGETVFPAAKGNFSFVPWWNELFECGKKGLSIKPKRGDALLFWSMKPDASLDPS 262
Query: 52 MYHSGCPVALGNKW 65
H GCPV GNKW
Sbjct: 263 SLHGGCPVIKGNKW 276
>gi|18394842|ref|NP_564109.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
gi|9558598|gb|AAF88161.1|AC026234_12 Contains similarity to a prolyl 4-hydroxylase alpha subunit protein
from Gallus gallus gi|212530 [Arabidopsis thaliana]
gi|90962978|gb|ABE02413.1| At1g20270 [Arabidopsis thaliana]
gi|332191835|gb|AEE29956.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
Length = 287
Score = 48.1 bits (113), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 27/78 (34%), Positives = 37/78 (47%), Gaps = 19/78 (24%)
Query: 69 LLSGLTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWYNAHANTL 109
+L L+DVE GG T+FP+ N L+V P G A+ +++ +
Sbjct: 197 MLMYLSDVEEGGETVFPAANMNFSSVPWYNELSECGKKGLSVKPRMGDALLFWSMRPDAT 256
Query: 110 LDYRMYHSGCPVALGNKW 127
LD H GCPV GNKW
Sbjct: 257 LDPTSLHGGCPVIRGNKW 274
Score = 45.8 bits (107), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 34/74 (45%), Gaps = 19/74 (25%)
Query: 11 LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
L+DVE GG T+FP+ N L+V P G A+ + + + LD
Sbjct: 201 LSDVEEGGETVFPAANMNFSSVPWYNELSECGKKGLSVKPRMGDALLFWSMRPDATLDPT 260
Query: 52 MYHSGCPVALGNKW 65
H GCPV GNKW
Sbjct: 261 SLHGGCPVIRGNKW 274
>gi|297850430|ref|XP_002893096.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297338938|gb|EFH69355.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 287
Score = 48.1 bits (113), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 27/78 (34%), Positives = 37/78 (47%), Gaps = 19/78 (24%)
Query: 69 LLSGLTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWYNAHANTL 109
+L L+DVE GG T+FP+ N L+V P G A+ +++ +
Sbjct: 197 MLMYLSDVEEGGETVFPAANMNFSSVPWYNELSECGKKGLSVKPRMGDALLFWSMRPDAT 256
Query: 110 LDYRMYHSGCPVALGNKW 127
LD H GCPV GNKW
Sbjct: 257 LDPTSLHGGCPVIRGNKW 274
Score = 45.8 bits (107), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 34/74 (45%), Gaps = 19/74 (25%)
Query: 11 LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
L+DVE GG T+FP+ N L+V P G A+ + + + LD
Sbjct: 201 LSDVEEGGETVFPAANMNFSSVPWYNELSECGKKGLSVKPRMGDALLFWSMRPDATLDPT 260
Query: 52 MYHSGCPVALGNKW 65
H GCPV GNKW
Sbjct: 261 SLHGGCPVIRGNKW 274
>gi|148653656|ref|YP_001280749.1| procollagen-proline dioxygenase [Psychrobacter sp. PRwf-1]
gi|148572740|gb|ABQ94799.1| Procollagen-proline dioxygenase [Psychrobacter sp. PRwf-1]
Length = 268
Score = 48.1 bits (113), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 24/66 (36%), Positives = 40/66 (60%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
G + G L+ L++V+ GG+T FP+LN + P KGSA+++ N + ++ H+G PV
Sbjct: 192 GQRVGTFLMY-LSEVDSGGSTRFPNLNFEIRPNKGSALYFANTNLKAEIEPLTLHAGMPV 250
Query: 122 ALGNKW 127
G K+
Sbjct: 251 TEGVKY 256
Score = 47.8 bits (112), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/55 (38%), Positives = 35/55 (63%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L++V+ GG+T FP+LN + P KGSA+++ N + ++ H+G PV G K+
Sbjct: 202 LSEVDSGGSTRFPNLNFEIRPNKGSALYFANTNLKAEIEPLTLHAGMPVTEGVKY 256
>gi|21537370|gb|AAM61711.1| putative prolyl 4-hydroxylase, alpha subunit [Arabidopsis thaliana]
Length = 287
Score = 48.1 bits (113), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 27/78 (34%), Positives = 37/78 (47%), Gaps = 19/78 (24%)
Query: 69 LLSGLTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWYNAHANTL 109
+L L+DVE GG T+FP+ N L+V P G A+ +++ +
Sbjct: 197 MLMYLSDVEEGGETVFPAANMNFSSVPWYNELSECGKKGLSVKPRMGDALLFWSMRPDAT 256
Query: 110 LDYRMYHSGCPVALGNKW 127
LD H GCPV GNKW
Sbjct: 257 LDPTSLHGGCPVIRGNKW 274
Score = 45.8 bits (107), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 34/74 (45%), Gaps = 19/74 (25%)
Query: 11 LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
L+DVE GG T+FP+ N L+V P G A+ + + + LD
Sbjct: 201 LSDVEEGGETVFPAANMNFSSVPWYNELSECGKKGLSVKPRMGDALLFWSMRPDATLDPT 260
Query: 52 MYHSGCPVALGNKW 65
H GCPV GNKW
Sbjct: 261 SLHGGCPVIRGNKW 274
>gi|397620797|gb|EJK65916.1| hypothetical protein THAOC_13183 [Thalassiosira oceanica]
Length = 520
Score = 48.1 bits (113), Expect = 0.001, Method: Composition-based stats.
Identities = 24/55 (43%), Positives = 31/55 (56%), Gaps = 1/55 (1%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L DVE GG T FP ++L + P++G AV + A D R H G P AL +KW
Sbjct: 438 LNDVEEGGTTYFPEVDLDIRPKRGMAVIHFPADVEMREDKRTLHQGSP-ALDDKW 491
Score = 43.1 bits (100), Expect = 0.041, Method: Composition-based stats.
Identities = 23/55 (41%), Positives = 29/55 (52%), Gaps = 1/55 (1%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T FP ++L + P++G AV A D H G P AL +KW
Sbjct: 438 LNDVEEGGTTYFPEVDLDIRPKRGMAVIHFPADVEMREDKRTLHQGSP-ALDDKW 491
>gi|395003644|ref|ZP_10387769.1| 2OG-Fe(II) oxygenase superfamily enzyme [Acidovorax sp. CF316]
gi|394318439|gb|EJE54870.1| 2OG-Fe(II) oxygenase superfamily enzyme [Acidovorax sp. CF316]
Length = 299
Score = 48.1 bits (113), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 30/79 (37%), Positives = 40/79 (50%), Gaps = 10/79 (12%)
Query: 54 HSGCPVAL---GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVF--WYNAHANT 108
G P L G + G L++ L E GG T FP +++ V P++G+AVF + AH T
Sbjct: 214 EPGTPTILKRGGQRVGTLVMY-LNTPEKGGGTTFPDVHVEVAPQRGNAVFFSYERAHPAT 272
Query: 109 LLDYRMYHSGCPVALGNKW 127
R H G PV G KW
Sbjct: 273 ----RTLHGGAPVIAGEKW 287
Score = 43.5 bits (101), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 23/57 (40%), Positives = 30/57 (52%), Gaps = 6/57 (10%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVF--WHNAHANTLLDYWMYHSGCPVALGNKW 65
L E GG T FP +++ V P++G+AVF + AH T H G PV G KW
Sbjct: 235 LNTPEKGGGTTFPDVHVEVAPQRGNAVFFSYERAHPATR----TLHGGAPVIAGEKW 287
>gi|313242424|emb|CBY34571.1| unnamed protein product [Oikopleura dioica]
Length = 503
Score = 48.1 bits (113), Expect = 0.001, Method: Composition-based stats.
Identities = 23/57 (40%), Positives = 31/57 (54%)
Query: 70 LSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 126
L+ L + + GG T+F + P SAVFWYN + DYR H+ CPV +G K
Sbjct: 414 LTYLNEPKHGGDTVFLGPGIKAEPIHKSAVFWYNLLRDGSCDYRTRHAACPVLIGQK 470
Score = 42.0 bits (97), Expect = 0.075, Method: Composition-based stats.
Identities = 22/63 (34%), Positives = 30/63 (47%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR L + + GG T+F + P SAVFW+N + DY H+ CPV +
Sbjct: 408 NRIATWLTYLNEPKHGGDTVFLGPGIKAEPIHKSAVFWYNLLRDGSCDYRTRHAACPVLI 467
Query: 62 GNK 64
G K
Sbjct: 468 GQK 470
>gi|313241587|emb|CBY33829.1| unnamed protein product [Oikopleura dioica]
Length = 541
Score = 48.1 bits (113), Expect = 0.001, Method: Composition-based stats.
Identities = 23/57 (40%), Positives = 31/57 (54%)
Query: 70 LSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 126
L+ L + + GG T+F + P SAVFWYN + DYR H+ CPV +G K
Sbjct: 452 LTYLNEPKHGGDTVFLGPGIKAEPIHKSAVFWYNLLRDGSCDYRTRHAACPVLIGQK 508
Score = 42.0 bits (97), Expect = 0.075, Method: Composition-based stats.
Identities = 22/63 (34%), Positives = 30/63 (47%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR L + + GG T+F + P SAVFW+N + DY H+ CPV +
Sbjct: 446 NRIATWLTYLNEPKHGGDTVFLGPGIKAEPIHKSAVFWYNLLRDGSCDYRTRHAACPVLI 505
Query: 62 GNK 64
G K
Sbjct: 506 GQK 508
>gi|313229343|emb|CBY23930.1| unnamed protein product [Oikopleura dioica]
Length = 542
Score = 48.1 bits (113), Expect = 0.001, Method: Composition-based stats.
Identities = 23/57 (40%), Positives = 31/57 (54%)
Query: 70 LSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 126
L+ L + + GG T+F + P SAVFWYN + DYR H+ CPV +G K
Sbjct: 453 LTYLNEPKHGGDTVFLGPGIKAEPIHKSAVFWYNLLRDGSCDYRTRHAACPVLIGQK 509
Score = 42.0 bits (97), Expect = 0.075, Method: Composition-based stats.
Identities = 22/63 (34%), Positives = 30/63 (47%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR L + + GG T+F + P SAVFW+N + DY H+ CPV +
Sbjct: 447 NRIATWLTYLNEPKHGGDTVFLGPGIKAEPIHKSAVFWYNLLRDGSCDYRTRHAACPVLI 506
Query: 62 GNK 64
G K
Sbjct: 507 GQK 509
>gi|313213106|emb|CBY36968.1| unnamed protein product [Oikopleura dioica]
Length = 541
Score = 48.1 bits (113), Expect = 0.001, Method: Composition-based stats.
Identities = 23/57 (40%), Positives = 31/57 (54%)
Query: 70 LSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 126
L+ L + + GG T+F + P SAVFWYN + DYR H+ CPV +G K
Sbjct: 452 LTYLNEPKHGGDTVFLGPGIKAEPIHKSAVFWYNLLRDGSCDYRTRHAACPVLIGQK 508
Score = 42.0 bits (97), Expect = 0.075, Method: Composition-based stats.
Identities = 22/63 (34%), Positives = 30/63 (47%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR L + + GG T+F + P SAVFW+N + DY H+ CPV +
Sbjct: 446 NRIATWLTYLNEPKHGGDTVFLGPGIKAEPIHKSAVFWYNLLRDGSCDYRTRHAACPVLI 505
Query: 62 GNK 64
G K
Sbjct: 506 GQK 508
>gi|449491267|ref|XP_004158845.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 287
Score = 48.1 bits (113), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 31/107 (28%), Positives = 49/107 (45%), Gaps = 25/107 (23%)
Query: 40 HNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN----------- 88
+ H + LD + +G G + +L+ L+DVE GG T+FP+
Sbjct: 174 YEPHFDYFLDEYNTKNG-----GQRIATVLMY-LSDVEEGGETVFPAAKGNFSSVPWWNE 227
Query: 89 --------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+V P++G A+ +++ + LD H GCPV GNKW
Sbjct: 228 LSDCGKKGLSVKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKW 274
Score = 44.7 bits (104), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 25/74 (33%), Positives = 35/74 (47%), Gaps = 19/74 (25%)
Query: 11 LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
L+DVE GG T+FP+ L+V P++G A+ + + + LD
Sbjct: 201 LSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKPKRGDALLFWSMKPDASLDPS 260
Query: 52 MYHSGCPVALGNKW 65
H GCPV GNKW
Sbjct: 261 SLHGGCPVIKGNKW 274
>gi|397617249|gb|EJK64350.1| hypothetical protein THAOC_14927, partial [Thalassiosira oceanica]
Length = 620
Score = 48.1 bits (113), Expect = 0.001, Method: Composition-based stats.
Identities = 29/65 (44%), Positives = 37/65 (56%), Gaps = 4/65 (6%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNA--HANTLLDYRMYHSGCPV--ALGNKWG 128
L DVE GG T FP L++TV P+KGSA+ W + A DYR H PV L N
Sbjct: 517 LNDVEEGGGTSFPLLDVTVQPKKGSALLWPSVLDAAPESKDYRTDHEALPVIKGLNNASQ 576
Query: 129 KLLLS 133
+L+L+
Sbjct: 577 QLILN 581
Score = 45.1 bits (105), Expect = 0.010, Method: Composition-based stats.
Identities = 28/65 (43%), Positives = 36/65 (55%), Gaps = 4/65 (6%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNA--HANTLLDYWMYHSGCPV--ALGNKWG 66
L DVE GG T FP L++TV P+KGSA+ W + A DY H PV L N
Sbjct: 517 LNDVEEGGGTSFPLLDVTVQPKKGSALLWPSVLDAAPESKDYRTDHEALPVIKGLNNASQ 576
Query: 67 KLLLS 71
+L+L+
Sbjct: 577 QLILN 581
>gi|413963357|ref|ZP_11402584.1| ProCollegen-proline dioxygenase [Burkholderia sp. SJ98]
gi|413929189|gb|EKS68477.1| ProCollegen-proline dioxygenase [Burkholderia sp. SJ98]
Length = 286
Score = 48.1 bits (113), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 23/66 (34%), Positives = 36/66 (54%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
G + ++L L +VE GG T FP + L + P +GSA+++ + D + H+G PV
Sbjct: 207 GQRIATVILY-LNEVEQGGDTTFPDIGLAIHPRRGSALYFEYVNELGQSDPKTLHAGTPV 265
Query: 122 ALGNKW 127
G KW
Sbjct: 266 EKGEKW 271
Score = 47.4 bits (111), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/55 (38%), Positives = 30/55 (54%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L +VE GG T FP + L + P +GSA+++ + D H+G PV G KW
Sbjct: 217 LNEVEQGGDTTFPDIGLAIHPRRGSALYFEYVNELGQSDPKTLHAGTPVEKGEKW 271
>gi|449434114|ref|XP_004134841.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 287
Score = 48.1 bits (113), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 31/107 (28%), Positives = 49/107 (45%), Gaps = 25/107 (23%)
Query: 40 HNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN----------- 88
+ H + LD + +G G + +L+ L+DVE GG T+FP+
Sbjct: 174 YEPHFDYFLDEYNTKNG-----GQRIATVLMY-LSDVEEGGETVFPAAKGNFSSVPWWNE 227
Query: 89 --------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+V P++G A+ +++ + LD H GCPV GNKW
Sbjct: 228 LSDCGKKGLSVKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKW 274
Score = 44.7 bits (104), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 25/74 (33%), Positives = 35/74 (47%), Gaps = 19/74 (25%)
Query: 11 LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
L+DVE GG T+FP+ L+V P++G A+ + + + LD
Sbjct: 201 LSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKPKRGDALLFWSMKPDASLDPS 260
Query: 52 MYHSGCPVALGNKW 65
H GCPV GNKW
Sbjct: 261 SLHGGCPVIKGNKW 274
>gi|430808003|ref|ZP_19435118.1| prolyl 4-hydroxylase [Cupriavidus sp. HMR-1]
gi|429499635|gb|EKZ98045.1| prolyl 4-hydroxylase [Cupriavidus sp. HMR-1]
Length = 293
Score = 48.1 bits (113), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 28/49 (57%)
Query: 79 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
GGAT FP + L V P KG+AV + + LD R H+G PV G KW
Sbjct: 230 GGATAFPRIGLEVAPVKGNAVLFSYGLPDGALDERTLHAGLPVEAGEKW 278
Score = 45.4 bits (106), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 27/49 (55%)
Query: 17 GGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
GGAT FP + L V P KG+AV + + LD H+G PV G KW
Sbjct: 230 GGATAFPRIGLEVAPVKGNAVLFSYGLPDGALDERTLHAGLPVEAGEKW 278
>gi|356517655|ref|XP_003527502.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Glycine max]
Length = 290
Score = 48.1 bits (113), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 32/108 (29%), Positives = 49/108 (45%), Gaps = 25/108 (23%)
Query: 40 HNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN----------- 88
+ H + LD + +G G + +L+ L+DVE GG T+FP+ N
Sbjct: 177 YEPHYDYFLDEFNTKNG-----GQRIATVLMY-LSDVEEGGETVFPAANANFSSVPWWND 230
Query: 89 --------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 128
L+V P+ G A+ +++ + LD H GCPV GNKW
Sbjct: 231 LSQCARKGLSVKPKMGDALLFWSMRPDATLDPSSLHGGCPVIKGNKWS 278
Score = 45.1 bits (105), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 35/75 (46%), Gaps = 19/75 (25%)
Query: 11 LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
L+DVE GG T+FP+ N L+V P+ G A+ + + + LD
Sbjct: 204 LSDVEEGGETVFPAANANFSSVPWWNDLSQCARKGLSVKPKMGDALLFWSMRPDATLDPS 263
Query: 52 MYHSGCPVALGNKWG 66
H GCPV GNKW
Sbjct: 264 SLHGGCPVIKGNKWS 278
>gi|77748579|ref|NP_641686.2| hypothetical protein XAC1351 [Xanthomonas axonopodis pv. citri str.
306]
Length = 418
Score = 48.1 bits (113), Expect = 0.001, Method: Composition-based stats.
Identities = 26/64 (40%), Positives = 32/64 (50%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR V L DV GG T FP + V P G+ V + N HA+ D H+G PV
Sbjct: 341 NRQRTVCVYLNDVGAGGDTEFPIAGVRVRPRPGTLVCFDNLHADGRPDADSLHAGLPVTA 400
Query: 62 GNKW 65
G+KW
Sbjct: 401 GSKW 404
Score = 44.7 bits (104), Expect = 0.013, Method: Composition-based stats.
Identities = 23/55 (41%), Positives = 29/55 (52%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L DV GG T FP + V P G+ V + N HA+ D H+G PV G+KW
Sbjct: 350 LNDVGAGGDTEFPIAGVRVRPRPGTLVCFDNLHADGRPDADSLHAGLPVTAGSKW 404
>gi|418521653|ref|ZP_13087695.1| hypothetical protein WS7_11622 [Xanthomonas axonopodis pv.
malvacearum str. GSPB2388]
gi|410702188|gb|EKQ60697.1| hypothetical protein WS7_11622 [Xanthomonas axonopodis pv.
malvacearum str. GSPB2388]
Length = 418
Score = 48.1 bits (113), Expect = 0.001, Method: Composition-based stats.
Identities = 26/64 (40%), Positives = 32/64 (50%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR V L DV GG T FP + V P G+ V + N HA+ D H+G PV
Sbjct: 341 NRQRTVCVYLNDVGAGGDTEFPIAGVRVRPRPGTLVCFDNLHADGRPDADSLHAGLPVTA 400
Query: 62 GNKW 65
G+KW
Sbjct: 401 GSKW 404
Score = 44.7 bits (104), Expect = 0.013, Method: Composition-based stats.
Identities = 23/55 (41%), Positives = 29/55 (52%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L DV GG T FP + V P G+ V + N HA+ D H+G PV G+KW
Sbjct: 350 LNDVGAGGDTEFPIAGVRVRPRPGTLVCFDNLHADGRPDADSLHAGLPVTAGSKW 404
>gi|381173085|ref|ZP_09882194.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas citri pv.
mangiferaeindicae LMG 941]
gi|380686458|emb|CCG38681.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas citri pv.
mangiferaeindicae LMG 941]
Length = 418
Score = 48.1 bits (113), Expect = 0.001, Method: Composition-based stats.
Identities = 26/64 (40%), Positives = 32/64 (50%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR V L DV GG T FP + V P G+ V + N HA+ D H+G PV
Sbjct: 341 NRQRTVCVYLNDVGAGGDTEFPIAGVRVRPRPGTLVCFDNLHADGRPDADSLHAGLPVTA 400
Query: 62 GNKW 65
G+KW
Sbjct: 401 GSKW 404
Score = 44.7 bits (104), Expect = 0.013, Method: Composition-based stats.
Identities = 23/55 (41%), Positives = 29/55 (52%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L DV GG T FP + V P G+ V + N HA+ D H+G PV G+KW
Sbjct: 350 LNDVGAGGDTEFPIAGVRVRPRPGTLVCFDNLHADGRPDADSLHAGLPVTAGSKW 404
>gi|323453438|gb|EGB09310.1| hypothetical protein AURANDRAFT_63401 [Aureococcus anophagefferens]
Length = 572
Score = 48.1 bits (113), Expect = 0.001, Method: Composition-based stats.
Identities = 23/52 (44%), Positives = 31/52 (59%), Gaps = 3/52 (5%)
Query: 17 GGATIFPSLNLTVFPEKGSAVFWH--NAHANTLLDYW-MYHSGCPVALGNKW 65
GG T FP+ N+ V P KG AV++H + +++ W HSGCPV G KW
Sbjct: 494 GGGTAFPNANVHVVPAKGQAVYFHFRGPEPDGVVEDWHTEHSGCPVRSGEKW 545
Score = 46.2 bits (108), Expect = 0.004, Method: Composition-based stats.
Identities = 26/75 (34%), Positives = 36/75 (48%), Gaps = 5/75 (6%)
Query: 56 GCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWY---NAHANTLLDY 112
G P G + +L+ GG T FP+ N+ V P KG AV+++ + D+
Sbjct: 473 GSPFVAGGRLATVLM--YCAAADGGGTAFPNANVHVVPAKGQAVYFHFRGPEPDGVVEDW 530
Query: 113 RMYHSGCPVALGNKW 127
HSGCPV G KW
Sbjct: 531 HTEHSGCPVRSGEKW 545
>gi|410632646|ref|ZP_11343301.1| prolyl 4-hydroxylase [Glaciecola arctica BSs20135]
gi|410147883|dbj|GAC20168.1| prolyl 4-hydroxylase [Glaciecola arctica BSs20135]
Length = 480
Score = 47.8 bits (112), Expect = 0.001, Method: Composition-based stats.
Identities = 22/56 (39%), Positives = 32/56 (57%)
Query: 9 VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNK 64
+ L +VE GG T+FP L + +KG AV W+N + + ++Y H G PV G K
Sbjct: 211 IYLNEVEQGGDTVFPELAIGFKAKKGMAVIWNNINPDGSVNYQTLHQGMPVQKGEK 266
Score = 47.8 bits (112), Expect = 0.001, Method: Composition-based stats.
Identities = 22/54 (40%), Positives = 31/54 (57%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 126
L +VE GG T+FP L + +KG AV W N + + ++Y+ H G PV G K
Sbjct: 213 LNEVEQGGDTVFPELAIGFKAKKGMAVIWNNINPDGSVNYQTLHQGMPVQKGEK 266
>gi|224085946|ref|XP_002307750.1| predicted protein [Populus trichocarpa]
gi|222857199|gb|EEE94746.1| predicted protein [Populus trichocarpa]
Length = 288
Score = 47.8 bits (112), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 32/107 (29%), Positives = 50/107 (46%), Gaps = 25/107 (23%)
Query: 40 HNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN----------- 88
+ AH + LD + +G G + LL+ L+DVE GG T+FP+ N
Sbjct: 175 YEAHFDYFLDEFNTKNG-----GQRTATLLMY-LSDVEEGGETVFPAANMNISAVPWWNE 228
Query: 89 --------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L++ P+ G+A+ +++ + LD H CPV GNKW
Sbjct: 229 LSECAKQGLSLKPKMGNALLFWSTRPDATLDPSSLHGSCPVIRGNKW 275
Score = 42.7 bits (99), Expect = 0.053, Method: Compositional matrix adjust.
Identities = 24/74 (32%), Positives = 35/74 (47%), Gaps = 19/74 (25%)
Query: 11 LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
L+DVE GG T+FP+ N L++ P+ G+A+ + + + LD
Sbjct: 202 LSDVEEGGETVFPAANMNISAVPWWNELSECAKQGLSLKPKMGNALLFWSTRPDATLDPS 261
Query: 52 MYHSGCPVALGNKW 65
H CPV GNKW
Sbjct: 262 SLHGSCPVIRGNKW 275
>gi|297825201|ref|XP_002880483.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297326322|gb|EFH56742.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 272
Score = 47.8 bits (112), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/70 (37%), Positives = 32/70 (45%), Gaps = 15/70 (21%)
Query: 73 LTDVELGGATIFPSLN---------------LTVFPEKGSAVFWYNAHANTLLDYRMYHS 117
L+ VE GG T+FP N L V P +G A+F+YN N +D H
Sbjct: 191 LSSVEEGGETMFPFENGRNMNGRYDYETCIGLRVKPRQGDAIFFYNLLPNRTIDQTSLHG 250
Query: 118 GCPVALGNKW 127
CPV G KW
Sbjct: 251 SCPVIKGEKW 260
Score = 46.2 bits (108), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 25/70 (35%), Positives = 32/70 (45%), Gaps = 15/70 (21%)
Query: 11 LTDVELGGATIFPSLN---------------LTVFPEKGSAVFWHNAHANTLLDYWMYHS 55
L+ VE GG T+FP N L V P +G A+F++N N +D H
Sbjct: 191 LSSVEEGGETMFPFENGRNMNGRYDYETCIGLRVKPRQGDAIFFYNLLPNRTIDQTSLHG 250
Query: 56 GCPVALGNKW 65
CPV G KW
Sbjct: 251 SCPVIKGEKW 260
>gi|363543371|ref|NP_001241695.1| prolyl 4-hydroxylase 8-5 [Zea mays]
gi|347978840|gb|AEP37762.1| prolyl 4-hydroxylase 8-5 [Zea mays]
Length = 307
Score = 47.8 bits (112), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 34/107 (31%), Positives = 47/107 (43%), Gaps = 25/107 (23%)
Query: 40 HNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN----------- 88
+ H + LD + +G G + LL+ L+DVE GG TIFP N
Sbjct: 194 YEPHFDYFLDEFNTKNG-----GQRIATLLMY-LSDVEEGGETIFPDANVNASSLPWYNE 247
Query: 89 --------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+V P+ G A+ +++ LD H GCPV GNKW
Sbjct: 248 LSDCAKRGLSVKPKMGDALLFWSMKPGATLDPLSLHGGCPVIKGNKW 294
Score = 45.1 bits (105), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 27/74 (36%), Positives = 33/74 (44%), Gaps = 19/74 (25%)
Query: 11 LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
L+DVE GG TIFP N L+V P+ G A+ + + LD
Sbjct: 221 LSDVEEGGETIFPDANVNASSLPWYNELSDCAKRGLSVKPKMGDALLFWSMKPGATLDPL 280
Query: 52 MYHSGCPVALGNKW 65
H GCPV GNKW
Sbjct: 281 SLHGGCPVIKGNKW 294
>gi|313217217|emb|CBY38368.1| unnamed protein product [Oikopleura dioica]
gi|313239835|emb|CBY17758.1| unnamed protein product [Oikopleura dioica]
Length = 521
Score = 47.8 bits (112), Expect = 0.001, Method: Composition-based stats.
Identities = 21/58 (36%), Positives = 31/58 (53%)
Query: 70 LSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+ +++ +GG T+F + P + SAVFWYN + D H+ CPV G KW
Sbjct: 436 LAYMSEPNMGGGTVFIQAGIQARPIRNSAVFWYNLLPSGESDDNTQHAACPVLSGTKW 493
Score = 43.5 bits (101), Expect = 0.025, Method: Composition-based stats.
Identities = 19/55 (34%), Positives = 29/55 (52%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
+++ +GG T+F + P + SAVFW+N + D H+ CPV G KW
Sbjct: 439 MSEPNMGGGTVFIQAGIQARPIRNSAVFWYNLLPSGESDDNTQHAACPVLSGTKW 493
>gi|295700439|ref|YP_003608332.1| procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
gi|295439652|gb|ADG18821.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
Length = 296
Score = 47.8 bits (112), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 22/66 (33%), Positives = 38/66 (57%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
G + G LL+ L DVE GG T+FP + ++ P++G A+++ + + D H+ P+
Sbjct: 215 GQRMGTLLMY-LKDVEGGGETVFPQVGWSIVPQRGHALYFEYGNRYGMCDPSSLHASTPL 273
Query: 122 ALGNKW 127
G+KW
Sbjct: 274 RTGDKW 279
Score = 44.7 bits (104), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 18/55 (32%), Positives = 32/55 (58%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T+FP + ++ P++G A+++ + + D H+ P+ G+KW
Sbjct: 225 LKDVEGGGETVFPQVGWSIVPQRGHALYFEYGNRYGMCDPSSLHASTPLRTGDKW 279
>gi|326489721|dbj|BAK01841.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 315
Score = 47.8 bits (112), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 32/107 (29%), Positives = 47/107 (43%), Gaps = 25/107 (23%)
Query: 40 HNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN----------- 88
+ H + LD + +G G + +L+ L+D+E GG TIFP N
Sbjct: 202 YEPHFDYFLDEFNTKNG-----GQRMATILMY-LSDIEEGGETIFPDANVNSSSLPWYNE 255
Query: 89 --------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L V P+ G A+ +++ + LD H GCPV GNKW
Sbjct: 256 LSECARKGLAVKPKMGDALLFWSMKPDATLDPLSLHGGCPVIKGNKW 302
Score = 45.4 bits (106), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 33/74 (44%), Gaps = 19/74 (25%)
Query: 11 LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
L+D+E GG TIFP N L V P+ G A+ + + + LD
Sbjct: 229 LSDIEEGGETIFPDANVNSSSLPWYNELSECARKGLAVKPKMGDALLFWSMKPDATLDPL 288
Query: 52 MYHSGCPVALGNKW 65
H GCPV GNKW
Sbjct: 289 SLHGGCPVIKGNKW 302
>gi|94312029|ref|YP_585239.1| prolyl 4-hydroxylase [Cupriavidus metallidurans CH34]
gi|93355881|gb|ABF09970.1| prolyl 4-hydroxylase [Cupriavidus metallidurans CH34]
Length = 293
Score = 47.8 bits (112), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 28/49 (57%)
Query: 79 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
GGAT FP + L V P KG+AV + + LD R H+G PV G KW
Sbjct: 230 GGATAFPRIGLEVAPVKGNAVLFSYGLPDGALDERTLHAGLPVEAGEKW 278
Score = 45.4 bits (106), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 27/49 (55%)
Query: 17 GGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
GGAT FP + L V P KG+AV + + LD H+G PV G KW
Sbjct: 230 GGATAFPRIGLEVAPVKGNAVLFSYGLPDGALDERTLHAGLPVEAGEKW 278
>gi|377810637|ref|YP_005043077.1| proCollegen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia sp.
YI23]
gi|357939998|gb|AET93554.1| proCollegen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia sp.
YI23]
Length = 297
Score = 47.8 bits (112), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 23/66 (34%), Positives = 36/66 (54%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
G + G LL+ L DVE GG T+FP + +V P +G A+++ + + D H+ P+
Sbjct: 215 GQRVGTLLMY-LNDVEGGGETVFPQVGCSVVPRRGQALYFEYCNRAGVCDPASLHASTPL 273
Query: 122 ALGNKW 127
G KW
Sbjct: 274 RSGEKW 279
Score = 45.8 bits (107), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 19/55 (34%), Positives = 30/55 (54%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T+FP + +V P +G A+++ + + D H+ P+ G KW
Sbjct: 225 LNDVEGGGETVFPQVGCSVVPRRGQALYFEYCNRAGVCDPASLHASTPLRSGEKW 279
>gi|302773668|ref|XP_002970251.1| hypothetical protein SELMODRAFT_411114 [Selaginella moellendorffii]
gi|300161767|gb|EFJ28381.1| hypothetical protein SELMODRAFT_411114 [Selaginella moellendorffii]
Length = 256
Score = 47.8 bits (112), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 31/98 (31%), Positives = 43/98 (43%), Gaps = 21/98 (21%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNL------------------- 89
DY +H G + +L L+DVE GG T+FPS +
Sbjct: 149 DY--FHDKVNTKNGGQRVATVLMYLSDVEEGGETVFPSAKVNSSSVPWWDELSECAKKGV 206
Query: 90 TVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
+V P KG A+ +++ + LD H GCPV GNKW
Sbjct: 207 SVKPRKGDALLFWSMSPDAELDPFSLHGGCPVIKGNKW 244
Score = 46.2 bits (108), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 35/74 (47%), Gaps = 19/74 (25%)
Query: 11 LTDVELGGATIFPSLNL-------------------TVFPEKGSAVFWHNAHANTLLDYW 51
L+DVE GG T+FPS + +V P KG A+ + + + LD +
Sbjct: 171 LSDVEEGGETVFPSAKVNSSSVPWWDELSECAKKGVSVKPRKGDALLFWSMSPDAELDPF 230
Query: 52 MYHSGCPVALGNKW 65
H GCPV GNKW
Sbjct: 231 SLHGGCPVIKGNKW 244
>gi|297818456|ref|XP_002877111.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297322949|gb|EFH53370.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 316
Score = 47.8 bits (112), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 30/98 (30%), Positives = 39/98 (39%), Gaps = 20/98 (20%)
Query: 48 LDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFP------------------SLNL 89
DY +H + LG +L L++VE GG T+FP
Sbjct: 158 FDY--FHDQANLELGGHRIATVLMYLSNVEKGGETVFPMWKGKTTQLKDDSWTECAKQGY 215
Query: 90 TVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
V P KG A+ ++N H N D H CPV G KW
Sbjct: 216 AVKPRKGDALLFFNLHPNATTDSNSLHGSCPVVEGEKW 253
Score = 42.0 bits (97), Expect = 0.075, Method: Compositional matrix adjust.
Identities = 24/73 (32%), Positives = 29/73 (39%), Gaps = 18/73 (24%)
Query: 11 LTDVELGGATIFP------------------SLNLTVFPEKGSAVFWHNAHANTLLDYWM 52
L++VE GG T+FP V P KG A+ + N H N D
Sbjct: 181 LSNVEKGGETVFPMWKGKTTQLKDDSWTECAKQGYAVKPRKGDALLFFNLHPNATTDSNS 240
Query: 53 YHSGCPVALGNKW 65
H CPV G KW
Sbjct: 241 LHGSCPVVEGEKW 253
>gi|397567288|gb|EJK45500.1| hypothetical protein THAOC_35883, partial [Thalassiosira oceanica]
Length = 664
Score = 47.8 bits (112), Expect = 0.001, Method: Composition-based stats.
Identities = 27/58 (46%), Positives = 33/58 (56%), Gaps = 2/58 (3%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNA--HANTLLDYRMYHSGCPVALGNKWG 128
L DVE GG T FP L++TV P+KGSA+ W + A D R H PV G K+G
Sbjct: 413 LNDVEEGGGTHFPLLDITVQPKKGSALLWPSVLDEAPESKDGRTDHEALPVIKGLKYG 470
Score = 45.1 bits (105), Expect = 0.011, Method: Composition-based stats.
Identities = 26/60 (43%), Positives = 33/60 (55%), Gaps = 2/60 (3%)
Query: 9 VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNA--HANTLLDYWMYHSGCPVALGNKWG 66
+ L DVE GG T FP L++TV P+KGSA+ W + A D H PV G K+G
Sbjct: 411 IYLNDVEEGGGTHFPLLDITVQPKKGSALLWPSVLDEAPESKDGRTDHEALPVIKGLKYG 470
>gi|18086437|gb|AAL57673.1| AT3g28480/MFJ20_16 [Arabidopsis thaliana]
gi|24796986|gb|AAN64505.1| At3g28480/MFJ20_16 [Arabidopsis thaliana]
Length = 316
Score = 47.8 bits (112), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 30/98 (30%), Positives = 39/98 (39%), Gaps = 20/98 (20%)
Query: 48 LDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFP------------------SLNL 89
DY +H + LG +L L++VE GG T+FP
Sbjct: 158 FDY--FHDQANLELGGHRIATVLMYLSNVEKGGETVFPMWKGKATQLKDDSWTECAKQGY 215
Query: 90 TVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
V P KG A+ ++N H N D H CPV G KW
Sbjct: 216 AVKPRKGDALLFFNLHPNATTDSNSLHGSCPVVEGEKW 253
Score = 42.0 bits (97), Expect = 0.080, Method: Compositional matrix adjust.
Identities = 24/73 (32%), Positives = 29/73 (39%), Gaps = 18/73 (24%)
Query: 11 LTDVELGGATIFP------------------SLNLTVFPEKGSAVFWHNAHANTLLDYWM 52
L++VE GG T+FP V P KG A+ + N H N D
Sbjct: 181 LSNVEKGGETVFPMWKGKATQLKDDSWTECAKQGYAVKPRKGDALLFFNLHPNATTDSNS 240
Query: 53 YHSGCPVALGNKW 65
H CPV G KW
Sbjct: 241 LHGSCPVVEGEKW 253
>gi|416009427|ref|ZP_11561250.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein
[Acidithiobacillus sp. GGI-221]
gi|339836568|gb|EGQ64151.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein
[Acidithiobacillus sp. GGI-221]
Length = 196
Score = 47.8 bits (112), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 33/66 (50%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ G L+L L VE GG T FP L L V P G VF+ N + H+G PV
Sbjct: 120 GNRQGTLILY-LNAVEEGGETAFPELGLQVSPIPGGGVFFRNLNEEGQRHPLSLHAGLPV 178
Query: 122 ALGNKW 127
G KW
Sbjct: 179 RKGEKW 184
Score = 43.5 bits (101), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 25/64 (39%), Positives = 29/64 (45%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L VE GG T FP L L V P G VF+ N + H+G PV
Sbjct: 121 NRQGTLILYLNAVEEGGETAFPELGLQVSPIPGGGVFFRNLNEEGQRHPLSLHAGLPVRK 180
Query: 62 GNKW 65
G KW
Sbjct: 181 GEKW 184
>gi|449284064|gb|EMC90646.1| Prolyl 4-hydroxylase subunit alpha-3, partial [Columba livia]
Length = 174
Score = 47.8 bits (112), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 38/66 (57%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L++ L+ V GG+T F NL+V K +A+FW+N N D H+GCPV
Sbjct: 87 GNRIATLMIY-LSAVGAGGSTAFVHANLSVPVVKNAALFWWNLRRNGDGDGDTLHAGCPV 145
Query: 122 ALGNKW 127
G+KW
Sbjct: 146 LAGDKW 151
Score = 44.7 bits (104), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 26/64 (40%), Positives = 34/64 (53%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L+ V GG+T F NL+V K +A+FW N N D H+GCPV
Sbjct: 88 NRIATLMIYLSAVGAGGSTAFVHANLSVPVVKNAALFWWNLRRNGDGDGDTLHAGCPVLA 147
Query: 62 GNKW 65
G+KW
Sbjct: 148 GDKW 151
>gi|302793288|ref|XP_002978409.1| hypothetical protein SELMODRAFT_418273 [Selaginella moellendorffii]
gi|300153758|gb|EFJ20395.1| hypothetical protein SELMODRAFT_418273 [Selaginella moellendorffii]
Length = 256
Score = 47.8 bits (112), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 31/98 (31%), Positives = 43/98 (43%), Gaps = 21/98 (21%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNL------------------- 89
DY +H G + +L L+DVE GG T+FPS +
Sbjct: 149 DY--FHDKVNTKNGGQRVATVLMYLSDVEEGGETVFPSAKVNSSSVPWWDELSECGKKGV 206
Query: 90 TVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
+V P KG A+ +++ + LD H GCPV GNKW
Sbjct: 207 SVKPRKGDALLFWSMSPDAELDPFSLHGGCPVIKGNKW 244
Score = 46.2 bits (108), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 35/74 (47%), Gaps = 19/74 (25%)
Query: 11 LTDVELGGATIFPSLNL-------------------TVFPEKGSAVFWHNAHANTLLDYW 51
L+DVE GG T+FPS + +V P KG A+ + + + LD +
Sbjct: 171 LSDVEEGGETVFPSAKVNSSSVPWWDELSECGKKGVSVKPRKGDALLFWSMSPDAELDPF 230
Query: 52 MYHSGCPVALGNKW 65
H GCPV GNKW
Sbjct: 231 SLHGGCPVIKGNKW 244
>gi|298712929|emb|CBJ26831.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 294
Score = 47.8 bits (112), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/55 (43%), Positives = 32/55 (58%), Gaps = 1/55 (1%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L DV GG T FP L + V P KG AV ++ A + +LD R+ H+ P A+ KW
Sbjct: 219 LNDVPSGGCTAFPQLGMKVQPRKGMAVVFFPATLDGVLDSRLLHAAEP-AIDTKW 272
Score = 43.5 bits (101), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 31/57 (54%), Gaps = 1/57 (1%)
Query: 9 VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
V L DV GG T FP L + V P KG AV + A + +LD + H+ P A+ KW
Sbjct: 217 VYLNDVPSGGCTAFPQLGMKVQPRKGMAVVFFPATLDGVLDSRLLHAAEP-AIDTKW 272
>gi|37912909|gb|AAR05245.1| conserved hypothetical protein [uncultured marine proteobacterium
ANT32C12]
Length = 186
Score = 47.8 bits (112), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 25/57 (43%), Positives = 32/57 (56%), Gaps = 1/57 (1%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTL-LDYWMYHSGCPVALGNKWG 66
L DVE GGAT FP ++++V P KG V +HN T ++ H G PV G KW
Sbjct: 119 LNDVEDGGATDFPDIHVSVKPNKGDVVVFHNCKDGTSDINPNSLHGGSPVISGEKWA 175
Score = 47.4 bits (111), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 25/60 (41%), Positives = 34/60 (56%), Gaps = 1/60 (1%)
Query: 70 LSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTL-LDYRMYHSGCPVALGNKWG 128
L+ L DVE GGAT FP ++++V P KG V ++N T ++ H G PV G KW
Sbjct: 116 LAYLNDVEDGGATDFPDIHVSVKPNKGDVVVFHNCKDGTSDINPNSLHGGSPVISGEKWA 175
>gi|18405808|ref|NP_566838.1| prolyl 4-hydroxylase [Arabidopsis thaliana]
gi|21617881|gb|AAM66931.1| prolyl 4-hydroxylase, putative [Arabidopsis thaliana]
gi|332643929|gb|AEE77450.1| prolyl 4-hydroxylase [Arabidopsis thaliana]
Length = 316
Score = 47.8 bits (112), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 30/98 (30%), Positives = 39/98 (39%), Gaps = 20/98 (20%)
Query: 48 LDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFP------------------SLNL 89
DY +H + LG +L L++VE GG T+FP
Sbjct: 158 FDY--FHDQANLELGGHRIATVLMYLSNVEKGGETVFPMWKGKATQLKDDSWTECAKQGY 215
Query: 90 TVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
V P KG A+ ++N H N D H CPV G KW
Sbjct: 216 AVKPRKGDALLFFNLHPNATTDSNSLHGSCPVVEGEKW 253
Score = 42.0 bits (97), Expect = 0.080, Method: Compositional matrix adjust.
Identities = 24/73 (32%), Positives = 29/73 (39%), Gaps = 18/73 (24%)
Query: 11 LTDVELGGATIFP------------------SLNLTVFPEKGSAVFWHNAHANTLLDYWM 52
L++VE GG T+FP V P KG A+ + N H N D
Sbjct: 181 LSNVEKGGETVFPMWKGKATQLKDDSWTECAKQGYAVKPRKGDALLFFNLHPNATTDSNS 240
Query: 53 YHSGCPVALGNKW 65
H CPV G KW
Sbjct: 241 LHGSCPVVEGEKW 253
>gi|30681957|ref|NP_850038.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
gi|330252315|gb|AEC07409.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
Length = 274
Score = 47.8 bits (112), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/70 (37%), Positives = 32/70 (45%), Gaps = 15/70 (21%)
Query: 73 LTDVELGGATIFPSLN---------------LTVFPEKGSAVFWYNAHANTLLDYRMYHS 117
L+ VE GG T+FP N L V P +G A+F+YN N +D H
Sbjct: 193 LSSVEEGGETMFPFENGRNMNGRYDYEKCVGLKVKPRQGDAIFFYNLFPNGTIDQTSLHG 252
Query: 118 GCPVALGNKW 127
CPV G KW
Sbjct: 253 SCPVIKGEKW 262
Score = 45.8 bits (107), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 25/70 (35%), Positives = 32/70 (45%), Gaps = 15/70 (21%)
Query: 11 LTDVELGGATIFPSLN---------------LTVFPEKGSAVFWHNAHANTLLDYWMYHS 55
L+ VE GG T+FP N L V P +G A+F++N N +D H
Sbjct: 193 LSSVEEGGETMFPFENGRNMNGRYDYEKCVGLKVKPRQGDAIFFYNLFPNGTIDQTSLHG 252
Query: 56 GCPVALGNKW 65
CPV G KW
Sbjct: 253 SCPVIKGEKW 262
>gi|9294583|dbj|BAB02864.1| prolyl 4-hydroxylase alpha subunit-like protein [Arabidopsis
thaliana]
Length = 332
Score = 47.8 bits (112), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 30/98 (30%), Positives = 39/98 (39%), Gaps = 20/98 (20%)
Query: 48 LDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFP------------------SLNL 89
DY +H + LG +L L++VE GG T+FP
Sbjct: 174 FDY--FHDQANLELGGHRIATVLMYLSNVEKGGETVFPMWKGKATQLKDDSWTECAKQGY 231
Query: 90 TVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
V P KG A+ ++N H N D H CPV G KW
Sbjct: 232 AVKPRKGDALLFFNLHPNATTDSNSLHGSCPVVEGEKW 269
Score = 42.4 bits (98), Expect = 0.058, Method: Compositional matrix adjust.
Identities = 24/73 (32%), Positives = 29/73 (39%), Gaps = 18/73 (24%)
Query: 11 LTDVELGGATIFP------------------SLNLTVFPEKGSAVFWHNAHANTLLDYWM 52
L++VE GG T+FP V P KG A+ + N H N D
Sbjct: 197 LSNVEKGGETVFPMWKGKATQLKDDSWTECAKQGYAVKPRKGDALLFFNLHPNATTDSNS 256
Query: 53 YHSGCPVALGNKW 65
H CPV G KW
Sbjct: 257 LHGSCPVVEGEKW 269
>gi|357467075|ref|XP_003603822.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355492870|gb|AES74073.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 683
Score = 47.8 bits (112), Expect = 0.002, Method: Composition-based stats.
Identities = 32/96 (33%), Positives = 44/96 (45%), Gaps = 23/96 (23%)
Query: 11 LTDVELGGATIFPSLN-------------LTVFPEKGSAVFWHNAHANTLLDYWMYHSGC 57
L+DVE GG T+FP+ L+V P+ G A+ + + + LD H G
Sbjct: 467 LSDVEEGGETVFPNAKPNINSVSKYPEKGLSVKPKMGDALLFRSMKPDGTLDTSSLHGGS 526
Query: 58 PVALGNKWG----------KLLLSGLTDVELGGATI 83
PV GNKW K+L + L V+ GG TI
Sbjct: 527 PVIRGNKWASTKWLHLTEFKVLGTALPVVDDGGTTI 562
Score = 46.2 bits (108), Expect = 0.005, Method: Composition-based stats.
Identities = 28/85 (32%), Positives = 38/85 (44%), Gaps = 13/85 (15%)
Query: 11 LTDVELGGATIFPSLN-------------LTVFPEKGSAVFWHNAHANTLLDYWMYHSGC 57
L+DVE GG T+FP L+V P+ G A+ + + + LD H G
Sbjct: 237 LSDVEEGGETVFPDAQPNFTSVSKCSGDGLSVKPKMGDALLFWSMKPDGTLDTSSLHGGS 296
Query: 58 PVALGNKWGKLLLSGLTDVELGGAT 82
PV GNKW L + +L G T
Sbjct: 297 PVIRGNKWASTKWLHLRECKLSGTT 321
Score = 45.4 bits (106), Expect = 0.007, Method: Composition-based stats.
Identities = 25/73 (34%), Positives = 35/73 (47%), Gaps = 13/73 (17%)
Query: 69 LLSGLTDVELGGATIFPSLN-------------LTVFPEKGSAVFWYNAHANTLLDYRMY 115
+L L+DVE GG T+FP L+V P+ G A+ +++ + LD
Sbjct: 233 VLMYLSDVEEGGETVFPDAQPNFTSVSKCSGDGLSVKPKMGDALLFWSMKPDGTLDTSSL 292
Query: 116 HSGCPVALGNKWG 128
H G PV GNKW
Sbjct: 293 HGGSPVIRGNKWA 305
Score = 44.3 bits (103), Expect = 0.017, Method: Composition-based stats.
Identities = 25/73 (34%), Positives = 35/73 (47%), Gaps = 13/73 (17%)
Query: 69 LLSGLTDVELGGATIFPSLN-------------LTVFPEKGSAVFWYNAHANTLLDYRMY 115
+L L+DVE GG T+FP+ L+V P+ G A+ + + + LD
Sbjct: 463 VLMYLSDVEEGGETVFPNAKPNINSVSKYPEKGLSVKPKMGDALLFRSMKPDGTLDTSSL 522
Query: 116 HSGCPVALGNKWG 128
H G PV GNKW
Sbjct: 523 HGGSPVIRGNKWA 535
>gi|302786814|ref|XP_002975178.1| hypothetical protein SELMODRAFT_174666 [Selaginella moellendorffii]
gi|300157337|gb|EFJ23963.1| hypothetical protein SELMODRAFT_174666 [Selaginella moellendorffii]
Length = 283
Score = 47.8 bits (112), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 30/95 (31%), Positives = 39/95 (41%), Gaps = 17/95 (17%)
Query: 50 YWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN-----------------LTVF 92
Y +H ALG +L L+D GG T+FPS + V
Sbjct: 130 YDYFHDKYNQALGGHRIATVLMYLSDAVKGGETVFPSSEEDTTVKDDSWSDCAKKGIAVK 189
Query: 93 PEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
P KG A+ +Y+ H + D H GCPV G KW
Sbjct: 190 PRKGDALLFYSLHPDATPDESSLHGGCPVIEGEKW 224
Score = 42.0 bits (97), Expect = 0.075, Method: Compositional matrix adjust.
Identities = 23/72 (31%), Positives = 31/72 (43%), Gaps = 17/72 (23%)
Query: 11 LTDVELGGATIFPSLN-----------------LTVFPEKGSAVFWHNAHANTLLDYWMY 53
L+D GG T+FPS + V P KG A+ +++ H + D
Sbjct: 153 LSDAVKGGETVFPSSEEDTTVKDDSWSDCAKKGIAVKPRKGDALLFYSLHPDATPDESSL 212
Query: 54 HSGCPVALGNKW 65
H GCPV G KW
Sbjct: 213 HGGCPVIEGEKW 224
>gi|321466285|gb|EFX77281.1| hypothetical protein DAPPUDRAFT_106233 [Daphnia pulex]
Length = 128
Score = 47.8 bits (112), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/66 (39%), Positives = 35/66 (53%), Gaps = 3/66 (4%)
Query: 65 WGKLLLSGL---TDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
WG + + L +DVE G AT+FP V+P KGSA F YN + N + H+ PV
Sbjct: 38 WGNRIATCLFYMSDVEAGSATVFPPTGAAVWPRKGSAAFCYNLYPNDKGNEFTRHATFPV 97
Query: 122 ALGNKW 127
+KW
Sbjct: 98 IFLSKW 103
Score = 47.0 bits (110), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 34/64 (53%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR ++DVE G AT+FP V+P KGSA F +N + N + + H+ PV
Sbjct: 40 NRIATCLFYMSDVEAGSATVFPPTGAAVWPRKGSAAFCYNLYPNDKGNEFTRHATFPVIF 99
Query: 62 GNKW 65
+KW
Sbjct: 100 LSKW 103
>gi|363807286|ref|NP_001242363.1| uncharacterized protein LOC100796794 precursor [Glycine max]
gi|255641119|gb|ACU20838.1| unknown [Glycine max]
Length = 297
Score = 47.8 bits (112), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 28/90 (31%), Positives = 38/90 (42%), Gaps = 21/90 (23%)
Query: 59 VALGNKWGKLLLSGLTDVELGGATIFPSLN---------------------LTVFPEKGS 97
+A G +L LTDV GG T+FPS + V P +G
Sbjct: 147 IARGGHRIATVLMYLTDVAKGGETVFPSAEEPPRRRGAETSSDLSECAKKGIAVKPRRGD 206
Query: 98 AVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
A+ +++ H N D H+GCPV G KW
Sbjct: 207 ALLFFSLHTNATPDTSSLHAGCPVIEGEKW 236
Score = 44.7 bits (104), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 25/76 (32%), Positives = 32/76 (42%), Gaps = 21/76 (27%)
Query: 11 LTDVELGGATIFPSLN---------------------LTVFPEKGSAVFWHNAHANTLLD 49
LTDV GG T+FPS + V P +G A+ + + H N D
Sbjct: 161 LTDVAKGGETVFPSAEEPPRRRGAETSSDLSECAKKGIAVKPRRGDALLFFSLHTNATPD 220
Query: 50 YWMYHSGCPVALGNKW 65
H+GCPV G KW
Sbjct: 221 TSSLHAGCPVIEGEKW 236
>gi|15227885|ref|NP_179363.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
gi|25411813|pir||F84555 similar to prolyl 4-hydroxylase alpha subunit [imported] -
Arabidopsis thaliana
gi|89274129|gb|ABD65585.1| At2g17720 [Arabidopsis thaliana]
gi|110738861|dbj|BAF01353.1| similar to prolyl 4-hydroxylase alpha subunit [Arabidopsis
thaliana]
gi|330251579|gb|AEC06673.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
Length = 291
Score = 47.8 bits (112), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 25/74 (33%), Positives = 35/74 (47%), Gaps = 19/74 (25%)
Query: 73 LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWYNAHANTLLDYR 113
L+DV+ GG T+FP+ L+V P+K A+ ++N + LD
Sbjct: 205 LSDVDDGGETVFPAARGNISAVPWWNELSKCGKEGLSVLPKKRDALLFWNMRPDASLDPS 264
Query: 114 MYHSGCPVALGNKW 127
H GCPV GNKW
Sbjct: 265 SLHGGCPVVKGNKW 278
Score = 45.1 bits (105), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 25/74 (33%), Positives = 34/74 (45%), Gaps = 19/74 (25%)
Query: 11 LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
L+DV+ GG T+FP+ L+V P+K A+ + N + LD
Sbjct: 205 LSDVDDGGETVFPAARGNISAVPWWNELSKCGKEGLSVLPKKRDALLFWNMRPDASLDPS 264
Query: 52 MYHSGCPVALGNKW 65
H GCPV GNKW
Sbjct: 265 SLHGGCPVVKGNKW 278
>gi|168046048|ref|XP_001775487.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162673157|gb|EDQ59684.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 263
Score = 47.4 bits (111), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 43/97 (44%), Gaps = 19/97 (19%)
Query: 48 LDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN-----------------LT 90
DY +H ALG +L L+DV GG T+FPS +
Sbjct: 108 FDY--FHDKNNQALGGHRIATVLMYLSDVVKGGETVFPSSEDRGGPKDDSWSACGKTGVA 165
Query: 91 VFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
V P KG A+ +++ H + + D H+GCPV G KW
Sbjct: 166 VKPRKGDALLFFSLHPSAVPDESSLHTGCPVIEGEKW 202
Score = 42.7 bits (99), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 24/72 (33%), Positives = 33/72 (45%), Gaps = 17/72 (23%)
Query: 11 LTDVELGGATIFPSLN-----------------LTVFPEKGSAVFWHNAHANTLLDYWMY 53
L+DV GG T+FPS + V P KG A+ + + H + + D
Sbjct: 131 LSDVVKGGETVFPSSEDRGGPKDDSWSACGKTGVAVKPRKGDALLFFSLHPSAVPDESSL 190
Query: 54 HSGCPVALGNKW 65
H+GCPV G KW
Sbjct: 191 HTGCPVIEGEKW 202
>gi|326501992|dbj|BAK06488.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 306
Score = 47.4 bits (111), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 25/73 (34%), Positives = 34/73 (46%), Gaps = 18/73 (24%)
Query: 73 LTDVELGGATIFPSLNL------------------TVFPEKGSAVFWYNAHANTLLDYRM 114
L+DV++GG T+FP+ + V P KG AV +++ H N D
Sbjct: 166 LSDVKMGGETVFPNSDAKTLQPKDDTQSECARRGYAVKPVKGDAVLFFSLHPNGTTDRDS 225
Query: 115 YHSGCPVALGNKW 127
H GCPV G KW
Sbjct: 226 LHGGCPVIEGEKW 238
Score = 45.1 bits (105), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 25/73 (34%), Positives = 33/73 (45%), Gaps = 18/73 (24%)
Query: 11 LTDVELGGATIFPSLNL------------------TVFPEKGSAVFWHNAHANTLLDYWM 52
L+DV++GG T+FP+ + V P KG AV + + H N D
Sbjct: 166 LSDVKMGGETVFPNSDAKTLQPKDDTQSECARRGYAVKPVKGDAVLFFSLHPNGTTDRDS 225
Query: 53 YHSGCPVALGNKW 65
H GCPV G KW
Sbjct: 226 LHGGCPVIEGEKW 238
>gi|255539064|ref|XP_002510597.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223551298|gb|EEF52784.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 289
Score = 47.4 bits (111), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 32/107 (29%), Positives = 49/107 (45%), Gaps = 25/107 (23%)
Query: 40 HNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN----------- 88
+ AH + LD + +G G + LL+ L+DVE GG T+FP+
Sbjct: 176 YEAHYDYFLDEFNTKNG-----GQRTATLLMY-LSDVEEGGETVFPAAKANISNVPSWNE 229
Query: 89 --------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+V P+ G+A+ +++ + LD H CPV GNKW
Sbjct: 230 LSECARQGLSVKPKMGNALLFWSTRPDATLDPASLHGSCPVIRGNKW 276
Score = 42.4 bits (98), Expect = 0.061, Method: Compositional matrix adjust.
Identities = 24/74 (32%), Positives = 34/74 (45%), Gaps = 19/74 (25%)
Query: 11 LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
L+DVE GG T+FP+ L+V P+ G+A+ + + + LD
Sbjct: 203 LSDVEEGGETVFPAAKANISNVPSWNELSECARQGLSVKPKMGNALLFWSTRPDATLDPA 262
Query: 52 MYHSGCPVALGNKW 65
H CPV GNKW
Sbjct: 263 SLHGSCPVIRGNKW 276
>gi|365090417|ref|ZP_09328465.1| 2OG-Fe(II) oxygenase [Acidovorax sp. NO-1]
gi|363416516|gb|EHL23626.1| 2OG-Fe(II) oxygenase [Acidovorax sp. NO-1]
Length = 302
Score = 47.4 bits (111), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/68 (39%), Positives = 37/68 (54%), Gaps = 7/68 (10%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVF--WYNAHANTLLDYRMYHSGC 119
G + G L++ L E GG T FP ++L V P++G+AVF + H +T R H G
Sbjct: 228 GQRVGTLVMY-LNTPEKGGGTTFPDVHLEVAPQRGNAVFFSYERPHPST----RTLHGGA 282
Query: 120 PVALGNKW 127
PV G KW
Sbjct: 283 PVIAGEKW 290
Score = 43.5 bits (101), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 23/57 (40%), Positives = 30/57 (52%), Gaps = 6/57 (10%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVF--WHNAHANTLLDYWMYHSGCPVALGNKW 65
L E GG T FP ++L V P++G+AVF + H +T H G PV G KW
Sbjct: 238 LNTPEKGGGTTFPDVHLEVAPQRGNAVFFSYERPHPSTR----TLHGGAPVIAGEKW 290
>gi|332140647|ref|YP_004426385.1| Prolyl 4-hydroxylase alpha subunit [Alteromonas macleodii str.
'Deep ecotype']
gi|327550669|gb|AEA97387.1| Prolyl 4-hydroxylase alpha subunit [Alteromonas macleodii str.
'Deep ecotype']
Length = 376
Score = 47.4 bits (111), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 23/66 (34%), Positives = 34/66 (51%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
G + K L L + GG T+FP L++ + P+ G+ V + N+ N L YH+G P
Sbjct: 287 GKQRIKTALVYLNTISEGGETLFPKLDIRIAPKSGTMVVFSNSDENGKLLLNSYHAGAPT 346
Query: 122 ALGNKW 127
NKW
Sbjct: 347 VSENKW 352
Score = 47.4 bits (111), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 22/64 (34%), Positives = 34/64 (53%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
R + + V L + GG T+FP L++ + P+ G+ V + N+ N L YH+G P
Sbjct: 289 QRIKTALVYLNTISEGGETLFPKLDIRIAPKSGTMVVFSNSDENGKLLLNSYHAGAPTVS 348
Query: 62 GNKW 65
NKW
Sbjct: 349 ENKW 352
>gi|219126281|ref|XP_002183389.1| hypothetical protein PHATRDRAFT_48891 [Phaeodactylum tricornutum
CCAP 1055/1]
gi|217405145|gb|EEC45089.1| hypothetical protein PHATRDRAFT_48891 [Phaeodactylum tricornutum
CCAP 1055/1]
Length = 427
Score = 47.4 bits (111), Expect = 0.002, Method: Composition-based stats.
Identities = 27/58 (46%), Positives = 32/58 (55%), Gaps = 2/58 (3%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHAN--TLLDYRMYHSGCPVALGNKWG 128
L+DVE GG T FP L LTV P+KG AV W + + D R H PV G K+G
Sbjct: 352 LSDVEEGGGTNFPKLGLTVTPKKGRAVLWPSVLDDEPNQKDARSDHQALPVIKGVKYG 409
Score = 44.3 bits (103), Expect = 0.015, Method: Composition-based stats.
Identities = 26/58 (44%), Positives = 31/58 (53%), Gaps = 2/58 (3%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHAN--TLLDYWMYHSGCPVALGNKWG 66
L+DVE GG T FP L LTV P+KG AV W + + D H PV G K+G
Sbjct: 352 LSDVEEGGGTNFPKLGLTVTPKKGRAVLWPSVLDDEPNQKDARSDHQALPVIKGVKYG 409
>gi|347527209|ref|YP_004833956.1| putative oxygenase [Sphingobium sp. SYK-6]
gi|345135890|dbj|BAK65499.1| putative oxygenase [Sphingobium sp. SYK-6]
Length = 229
Score = 47.4 bits (111), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 31/88 (35%), Positives = 42/88 (47%), Gaps = 12/88 (13%)
Query: 52 MYHSGCPVALGN--KWGKLLLSG----------LTDVELGGATIFPSLNLTVFPEKGSAV 99
YH C GN W + +SG L DVE GG T FP L LTV P G+ +
Sbjct: 125 QYHLHCDYFPGNVHYWPAMRVSGGQRCWTAMAYLCDVEEGGETHFPRLGLTVPPRAGTLL 184
Query: 100 FWYNAHANTLLDYRMYHSGCPVALGNKW 127
W N +A+ ++ H+ PV+ G K+
Sbjct: 185 MWNNMNADGSPNHDTLHAALPVSRGVKY 212
Score = 46.2 bits (108), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 23/55 (41%), Positives = 33/55 (60%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T FP L LTV P G+ + W+N +A+ ++ H+ PV+ G K+
Sbjct: 158 LCDVEEGGETHFPRLGLTVPPRAGTLLMWNNMNADGSPNHDTLHAALPVSRGVKY 212
>gi|410860761|ref|YP_006975995.1| prolyl 4-hydroxylase subunit alpha [Alteromonas macleodii AltDE1]
gi|410818023|gb|AFV84640.1| Prolyl 4-hydroxylase alpha subunit [Alteromonas macleodii AltDE1]
Length = 376
Score = 47.4 bits (111), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 23/66 (34%), Positives = 34/66 (51%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
G + K L L + GG T+FP L++ + P+ G+ V + N+ N L YH+G P
Sbjct: 287 GKQRIKTALVYLNTINEGGETLFPKLDIRIAPKSGTMVVFSNSDENGKLLLNSYHAGAPT 346
Query: 122 ALGNKW 127
NKW
Sbjct: 347 VSENKW 352
Score = 47.4 bits (111), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 22/64 (34%), Positives = 34/64 (53%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
R + + V L + GG T+FP L++ + P+ G+ V + N+ N L YH+G P
Sbjct: 289 QRIKTALVYLNTINEGGETLFPKLDIRIAPKSGTMVVFSNSDENGKLLLNSYHAGAPTVS 348
Query: 62 GNKW 65
NKW
Sbjct: 349 ENKW 352
>gi|308497208|ref|XP_003110791.1| CRE-DPY-18 protein [Caenorhabditis remanei]
gi|308242671|gb|EFO86623.1| CRE-DPY-18 protein [Caenorhabditis remanei]
Length = 559
Score = 47.4 bits (111), Expect = 0.002, Method: Composition-based stats.
Identities = 23/66 (34%), Positives = 33/66 (50%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ +L ++ GG T+F + TV P K A+FWYN + H+ CPV
Sbjct: 440 GNRIATVLFY-MSQPSHGGGTVFTEVKSTVLPTKNDALFWYNLFKQGDGNPDTRHAACPV 498
Query: 122 ALGNKW 127
+G KW
Sbjct: 499 LVGIKW 504
Score = 43.9 bits (102), Expect = 0.019, Method: Composition-based stats.
Identities = 19/49 (38%), Positives = 26/49 (53%)
Query: 17 GGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
GG T+F + TV P K A+FW+N + H+ CPV +G KW
Sbjct: 456 GGGTVFTEVKSTVLPTKNDALFWYNLFKQGDGNPDTRHAACPVLVGIKW 504
>gi|297832394|ref|XP_002884079.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297329919|gb|EFH60338.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 291
Score = 47.4 bits (111), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 25/74 (33%), Positives = 35/74 (47%), Gaps = 19/74 (25%)
Query: 73 LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWYNAHANTLLDYR 113
L+DV+ GG T+FP+ L+V P+K A+ ++N + LD
Sbjct: 205 LSDVDDGGETVFPAARGNISAVPWWNELSKCGKEGLSVLPKKRDALLFWNMRPDASLDPS 264
Query: 114 MYHSGCPVALGNKW 127
H GCPV GNKW
Sbjct: 265 SLHGGCPVVKGNKW 278
Score = 44.7 bits (104), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 25/74 (33%), Positives = 34/74 (45%), Gaps = 19/74 (25%)
Query: 11 LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
L+DV+ GG T+FP+ L+V P+K A+ + N + LD
Sbjct: 205 LSDVDDGGETVFPAARGNISAVPWWNELSKCGKEGLSVLPKKRDALLFWNMRPDASLDPS 264
Query: 52 MYHSGCPVALGNKW 65
H GCPV GNKW
Sbjct: 265 SLHGGCPVVKGNKW 278
>gi|307102963|gb|EFN51228.1| hypothetical protein CHLNCDRAFT_141231 [Chlorella variabilis]
Length = 313
Score = 47.4 bits (111), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/83 (32%), Positives = 41/83 (49%), Gaps = 18/83 (21%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSL-----------------NLTVFPEKGSAVFWYNA 104
GN++ +L+ L VE GG T+FP++ +L P KGSAV +++
Sbjct: 145 GNRYATVLMY-LNTVEEGGETVFPNIPAPGGDNGPTFTECARRHLAAKPTKGSAVLFHSI 203
Query: 105 HANTLLDYRMYHSGCPVALGNKW 127
+ L+ R H+ CPV G KW
Sbjct: 204 KPSGDLERRSLHTACPVVKGEKW 226
Score = 43.1 bits (100), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 24/72 (33%), Positives = 33/72 (45%), Gaps = 17/72 (23%)
Query: 11 LTDVELGGATIFPSL-----------------NLTVFPEKGSAVFWHNAHANTLLDYWMY 53
L VE GG T+FP++ +L P KGSAV +H+ + L+
Sbjct: 155 LNTVEEGGETVFPNIPAPGGDNGPTFTECARRHLAAKPTKGSAVLFHSIKPSGDLERRSL 214
Query: 54 HSGCPVALGNKW 65
H+ CPV G KW
Sbjct: 215 HTACPVVKGEKW 226
>gi|114799222|ref|YP_760562.1| 2OG-Fe(II) oxygenase [Hyphomonas neptunium ATCC 15444]
gi|114739396|gb|ABI77521.1| oxidoreductase, 2OG-Fe(II) oxygenase family [Hyphomonas neptunium
ATCC 15444]
Length = 298
Score = 47.4 bits (111), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 23/57 (40%), Positives = 32/57 (56%)
Query: 9 VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
+ L DVE GG T F L T+ PEKG AV W+N + + ++ + H G V G K+
Sbjct: 219 IYLNDVEEGGGTRFRRLEKTIMPEKGKAVIWNNLNPDGSVNPYTIHHGMKVRSGAKY 275
Score = 44.7 bits (104), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 34/75 (45%), Gaps = 2/75 (2%)
Query: 53 YHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDY 112
Y C W ++ L DVE GG T F L T+ PEKG AV W N + + ++
Sbjct: 203 YQVHCQFTGQRTWTFMIY--LNDVEEGGGTRFRRLEKTIMPEKGKAVIWNNLNPDGSVNP 260
Query: 113 RMYHSGCPVALGNKW 127
H G V G K+
Sbjct: 261 YTIHHGMKVRSGAKY 275
>gi|37496185|emb|CAE47803.1| Prolyl 4-hydroxylase alpha subunit [Sus scrofa]
Length = 263
Score = 47.4 bits (111), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 26/31 (83%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYN 103
++DV GGAT+FP + +V+P+KG+AVFWYN
Sbjct: 226 MSDVSAGGATVFPEVGASVWPKKGTAVFWYN 256
Score = 45.4 bits (106), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 17/31 (54%), Positives = 26/31 (83%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHN 41
++DV GGAT+FP + +V+P+KG+AVFW+N
Sbjct: 226 MSDVSAGGATVFPEVGASVWPKKGTAVFWYN 256
>gi|313243209|emb|CBY39868.1| unnamed protein product [Oikopleura dioica]
Length = 430
Score = 47.4 bits (111), Expect = 0.002, Method: Composition-based stats.
Identities = 21/58 (36%), Positives = 31/58 (53%)
Query: 70 LSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+ +++ +GG T+F + P + SAVFWYN + D H+ CPV G KW
Sbjct: 345 LAYMSEPNMGGGTVFIQAGIQARPIRNSAVFWYNLLPSGESDDNTQHAACPVLSGTKW 402
Score = 43.1 bits (100), Expect = 0.035, Method: Composition-based stats.
Identities = 19/55 (34%), Positives = 29/55 (52%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
+++ +GG T+F + P + SAVFW+N + D H+ CPV G KW
Sbjct: 348 MSEPNMGGGTVFIQAGIQARPIRNSAVFWYNLLPSGESDDNTQHAACPVLSGTKW 402
>gi|156083831|ref|XP_001609399.1| prolyl 4-hydroxylase alpha-related protein PH4 [Babesia bovis T2Bo]
gi|154796650|gb|EDO05831.1| prolyl 4-hydroxylase alpha-related protein PH4, putative [Babesia
bovis]
Length = 241
Score = 47.0 bits (110), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 35/57 (61%), Gaps = 3/57 (5%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSG-CPVALGNKWG 128
L DV+ GG T+FP L + V P SA++W N + + D+RM H+G CP + G K+
Sbjct: 174 LNDVD-GGETVFPELKIAVRPVGNSALYWKNTTEDGVADFRMVHAGVCPKS-GTKYA 228
Score = 44.7 bits (104), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 23/57 (40%), Positives = 34/57 (59%), Gaps = 3/57 (5%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSG-CPVALGNKWG 66
L DV+ GG T+FP L + V P SA++W N + + D+ M H+G CP + G K+
Sbjct: 174 LNDVD-GGETVFPELKIAVRPVGNSALYWKNTTEDGVADFRMVHAGVCPKS-GTKYA 228
>gi|607947|gb|AAA62207.1| prolyl 4-hydroxylase alpha subunit [Caenorhabditis elegans]
Length = 558
Score = 47.0 bits (110), Expect = 0.002, Method: Composition-based stats.
Identities = 22/66 (33%), Positives = 33/66 (50%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ +L ++ GG T+F T+ P K A+FWYN + + H+ CPV
Sbjct: 439 GNRIATVLFY-MSQPSHGGGTVFTEAKSTILPTKNDALFWYNLYKQGDGNPDTRHAACPV 497
Query: 122 ALGNKW 127
+G KW
Sbjct: 498 LVGIKW 503
Score = 43.9 bits (102), Expect = 0.023, Method: Composition-based stats.
Identities = 18/49 (36%), Positives = 26/49 (53%)
Query: 17 GGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
GG T+F T+ P K A+FW+N + + H+ CPV +G KW
Sbjct: 455 GGGTVFTEAKSTILPTKNDALFWYNLYKQGDGNPDTRHAACPVLVGIKW 503
>gi|242032633|ref|XP_002463711.1| hypothetical protein SORBIDRAFT_01g004670 [Sorghum bicolor]
gi|241917565|gb|EER90709.1| hypothetical protein SORBIDRAFT_01g004670 [Sorghum bicolor]
Length = 297
Score = 47.0 bits (110), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 31/98 (31%), Positives = 41/98 (41%), Gaps = 20/98 (20%)
Query: 48 LDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN------------------L 89
DY +H V G + +L LTDV+ GG T+FP+ L
Sbjct: 137 FDY--FHDKNNVKHGGQRFATVLMYLTDVKKGGETVFPNAEGSHLQYKDETWSECSRSGL 194
Query: 90 TVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
V P+KG A+ ++ H N D H CPV G KW
Sbjct: 195 AVKPKKGDALLFFGLHLNATTDTSSLHGSCPVIEGEKW 232
Score = 42.7 bits (99), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 25/73 (34%), Positives = 31/73 (42%), Gaps = 18/73 (24%)
Query: 11 LTDVELGGATIFPSLN------------------LTVFPEKGSAVFWHNAHANTLLDYWM 52
LTDV+ GG T+FP+ L V P+KG A+ + H N D
Sbjct: 160 LTDVKKGGETVFPNAEGSHLQYKDETWSECSRSGLAVKPKKGDALLFFGLHLNATTDTSS 219
Query: 53 YHSGCPVALGNKW 65
H CPV G KW
Sbjct: 220 LHGSCPVIEGEKW 232
>gi|17552840|ref|NP_499464.1| Protein DPY-18 [Caenorhabditis elegans]
gi|20455505|sp|Q10576.2|P4HA1_CAEEL RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
alpha-1; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1; AltName: Full=Protein dumpy-18; Flags:
Precursor
gi|3881011|emb|CAA21045.1| Protein DPY-18 [Caenorhabditis elegans]
gi|6900013|emb|CAB71298.1| prolyl 4-hydroxylase alpha subunit 1 [Caenorhabditis elegans]
Length = 559
Score = 47.0 bits (110), Expect = 0.002, Method: Composition-based stats.
Identities = 22/66 (33%), Positives = 33/66 (50%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ +L ++ GG T+F T+ P K A+FWYN + + H+ CPV
Sbjct: 440 GNRIATVLFY-MSQPSHGGGTVFTEAKSTILPTKNDALFWYNLYKQGDGNPDTRHAACPV 498
Query: 122 ALGNKW 127
+G KW
Sbjct: 499 LVGIKW 504
Score = 43.9 bits (102), Expect = 0.023, Method: Composition-based stats.
Identities = 18/49 (36%), Positives = 26/49 (53%)
Query: 17 GGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
GG T+F T+ P K A+FW+N + + H+ CPV +G KW
Sbjct: 456 GGGTVFTEAKSTILPTKNDALFWYNLYKQGDGNPDTRHAACPVLVGIKW 504
>gi|20260280|gb|AAM13038.1| unknown protein [Arabidopsis thaliana]
gi|22136524|gb|AAM91340.1| unknown protein [Arabidopsis thaliana]
Length = 298
Score = 47.0 bits (110), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 29/111 (26%), Positives = 43/111 (38%), Gaps = 23/111 (20%)
Query: 38 FWHNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNL-------- 89
+ H + DY +H + G +L L++V GG T+FP +
Sbjct: 129 YEHGQKYDAHFDY--FHDKVNIVRGGHRMATILMYLSNVTKGGETVFPDAEIPSRRVLSE 186
Query: 90 -------------TVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
V P KG A+ ++N H + + D H GCPV G KW
Sbjct: 187 NEEDLSDCAKRGIAVKPRKGDALLFFNLHPDAIPDPLSLHGGCPVIEGEKW 237
Score = 41.6 bits (96), Expect = 0.097, Method: Compositional matrix adjust.
Identities = 23/76 (30%), Positives = 31/76 (40%), Gaps = 21/76 (27%)
Query: 11 LTDVELGGATIFPSLNL---------------------TVFPEKGSAVFWHNAHANTLLD 49
L++V GG T+FP + V P KG A+ + N H + + D
Sbjct: 162 LSNVTKGGETVFPDAEIPSRRVLSENEEDLSDCAKRGIAVKPRKGDALLFFNLHPDAIPD 221
Query: 50 YWMYHSGCPVALGNKW 65
H GCPV G KW
Sbjct: 222 PLSLHGGCPVIEGEKW 237
>gi|218193936|gb|EEC76363.1| hypothetical protein OsI_13952 [Oryza sativa Indica Group]
Length = 1062
Score = 47.0 bits (110), Expect = 0.002, Method: Composition-based stats.
Identities = 26/77 (33%), Positives = 35/77 (45%), Gaps = 18/77 (23%)
Query: 69 LLSGLTDVELGGATIFPSL------------------NLTVFPEKGSAVFWYNAHANTLL 110
+L LTDV+ GG T+FP+ L V P+KG A+ +++ H N
Sbjct: 158 VLMYLTDVKKGGETVFPNAAGRHLQLKDETWSDCARSGLAVKPKKGDALLFFSLHVNATT 217
Query: 111 DYRMYHSGCPVALGNKW 127
D H CPV G KW
Sbjct: 218 DPASLHGSCPVIEGEKW 234
Score = 44.7 bits (104), Expect = 0.011, Method: Composition-based stats.
Identities = 25/73 (34%), Positives = 32/73 (43%), Gaps = 18/73 (24%)
Query: 11 LTDVELGGATIFPSL------------------NLTVFPEKGSAVFWHNAHANTLLDYWM 52
LTDV+ GG T+FP+ L V P+KG A+ + + H N D
Sbjct: 162 LTDVKKGGETVFPNAAGRHLQLKDETWSDCARSGLAVKPKKGDALLFFSLHVNATTDPAS 221
Query: 53 YHSGCPVALGNKW 65
H CPV G KW
Sbjct: 222 LHGSCPVIEGEKW 234
>gi|407938132|ref|YP_006853773.1| 2OG-Fe(II) oxygenase [Acidovorax sp. KKS102]
gi|407895926|gb|AFU45135.1| 2OG-Fe(II) oxygenase [Acidovorax sp. KKS102]
Length = 303
Score = 47.0 bits (110), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/68 (38%), Positives = 38/68 (55%), Gaps = 7/68 (10%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVF--WYNAHANTLLDYRMYHSGC 119
G + G L++ L + GG T FP ++L V P++G+AVF + H +T R H G
Sbjct: 229 GQRVGTLVMY-LNTPDKGGGTTFPDVHLEVAPQRGNAVFFSYERPHPST----RTLHGGA 283
Query: 120 PVALGNKW 127
PV G+KW
Sbjct: 284 PVIAGDKW 291
Score = 43.1 bits (100), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 21/51 (41%), Positives = 29/51 (56%), Gaps = 6/51 (11%)
Query: 17 GGATIFPSLNLTVFPEKGSAVF--WHNAHANTLLDYWMYHSGCPVALGNKW 65
GG T FP ++L V P++G+AVF + H +T H G PV G+KW
Sbjct: 245 GGGTTFPDVHLEVAPQRGNAVFFSYERPHPSTR----TLHGGAPVIAGDKW 291
>gi|15239594|ref|NP_197391.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
gi|21593296|gb|AAM65245.1| prolyl 4-hydroxylase alpha subunit-like protein [Arabidopsis
thaliana]
gi|332005243|gb|AED92626.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
Length = 298
Score = 47.0 bits (110), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 29/111 (26%), Positives = 43/111 (38%), Gaps = 23/111 (20%)
Query: 38 FWHNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNL-------- 89
+ H + DY +H + G +L L++V GG T+FP +
Sbjct: 129 YEHGQKYDAHFDY--FHDKVNIVRGGHRMATILMYLSNVTKGGETVFPDAEIPSRRVLSE 186
Query: 90 -------------TVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
V P KG A+ ++N H + + D H GCPV G KW
Sbjct: 187 NKEDLSDCAKRGIAVKPRKGDALLFFNLHPDAIPDPLSLHGGCPVIEGEKW 237
Score = 41.6 bits (96), Expect = 0.097, Method: Compositional matrix adjust.
Identities = 23/76 (30%), Positives = 31/76 (40%), Gaps = 21/76 (27%)
Query: 11 LTDVELGGATIFPSLNL---------------------TVFPEKGSAVFWHNAHANTLLD 49
L++V GG T+FP + V P KG A+ + N H + + D
Sbjct: 162 LSNVTKGGETVFPDAEIPSRRVLSENKEDLSDCAKRGIAVKPRKGDALLFFNLHPDAIPD 221
Query: 50 YWMYHSGCPVALGNKW 65
H GCPV G KW
Sbjct: 222 PLSLHGGCPVIEGEKW 237
>gi|215490181|dbj|BAG86624.1| type 2 proly 4-hydroxylase [Nicotiana tabacum]
Length = 294
Score = 47.0 bits (110), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 29/90 (32%), Positives = 37/90 (41%), Gaps = 21/90 (23%)
Query: 59 VALGNKWGKLLLSGLTDVELGGATIFPSLN---------------------LTVFPEKGS 97
+A G +L LT+VE GG T+FP + V P KG
Sbjct: 144 IARGGHRLATVLMYLTNVEKGGETVFPKAEESPRRRSMIADDSLSECAKKGIPVKPRKGD 203
Query: 98 AVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
A+ +Y+ H N D H GCPV G KW
Sbjct: 204 ALLFYSLHPNATPDPLSLHGGCPVIQGEKW 233
Score = 45.1 bits (105), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 25/76 (32%), Positives = 32/76 (42%), Gaps = 21/76 (27%)
Query: 11 LTDVELGGATIFPSLN---------------------LTVFPEKGSAVFWHNAHANTLLD 49
LT+VE GG T+FP + V P KG A+ +++ H N D
Sbjct: 158 LTNVEKGGETVFPKAEESPRRRSMIADDSLSECAKKGIPVKPRKGDALLFYSLHPNATPD 217
Query: 50 YWMYHSGCPVALGNKW 65
H GCPV G KW
Sbjct: 218 PLSLHGGCPVIQGEKW 233
>gi|90704797|dbj|BAE92293.1| putative prolyl 4-hydroxylase, alpha subunit [Cryptomeria japonica]
Length = 302
Score = 47.0 bits (110), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 28/89 (31%), Positives = 39/89 (43%), Gaps = 19/89 (21%)
Query: 59 VALGNKWGKLLLSGLTDVELGGATIFPSLN-------------------LTVFPEKGSAV 99
V G + G +L L+DVE GG T+FP L+V P+ G A+
Sbjct: 202 VKNGGQRGATMLMYLSDVEKGGETVFPQSKVNSSSVPWWDELSECGRSGLSVRPKMGDAL 261
Query: 100 FWYNAHANTLLDYRMYHSGCPVALGNKWG 128
+++ + LD H CPV GNKW
Sbjct: 262 LFWSVKPDASLDPSSLHGSCPVIQGNKWS 290
Score = 40.8 bits (94), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 24/75 (32%), Positives = 32/75 (42%), Gaps = 19/75 (25%)
Query: 11 LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
L+DVE GG T+FP L+V P+ G A+ + + + LD
Sbjct: 216 LSDVEKGGETVFPQSKVNSSSVPWWDELSECGRSGLSVRPKMGDALLFWSVKPDASLDPS 275
Query: 52 MYHSGCPVALGNKWG 66
H CPV GNKW
Sbjct: 276 SLHGSCPVIQGNKWS 290
>gi|449529555|ref|XP_004171765.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 284
Score = 47.0 bits (110), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 31/107 (28%), Positives = 48/107 (44%), Gaps = 25/107 (23%)
Query: 40 HNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN----------- 88
++AH + +D + G G + LL+ L+DVE GG T+FP+
Sbjct: 172 YDAHYDYFVDEYNIKKG-----GQRMATLLMY-LSDVEEGGETVFPAAKGNFSSVPWWNE 225
Query: 89 --------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+V P+ G A+ +++ + LD H CPV GNKW
Sbjct: 226 LSECGKGGLSVKPKMGDALLFWSMKPDATLDPTSLHGACPVIRGNKW 272
Score = 40.8 bits (94), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 24/74 (32%), Positives = 33/74 (44%), Gaps = 19/74 (25%)
Query: 11 LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
L+DVE GG T+FP+ L+V P+ G A+ + + + LD
Sbjct: 199 LSDVEEGGETVFPAAKGNFSSVPWWNELSECGKGGLSVKPKMGDALLFWSMKPDATLDPT 258
Query: 52 MYHSGCPVALGNKW 65
H CPV GNKW
Sbjct: 259 SLHGACPVIRGNKW 272
>gi|449443243|ref|XP_004139389.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 284
Score = 47.0 bits (110), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 31/107 (28%), Positives = 48/107 (44%), Gaps = 25/107 (23%)
Query: 40 HNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN----------- 88
++AH + +D + G G + LL+ L+DVE GG T+FP+
Sbjct: 172 YDAHYDYFVDEYNIKKG-----GQRMATLLMY-LSDVEEGGETVFPAAKGNFSSVPWWNE 225
Query: 89 --------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+V P+ G A+ +++ + LD H CPV GNKW
Sbjct: 226 LSKCGKGGLSVKPKMGDALLFWSMKPDATLDPTSLHGACPVIRGNKW 272
Score = 40.8 bits (94), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 24/74 (32%), Positives = 33/74 (44%), Gaps = 19/74 (25%)
Query: 11 LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
L+DVE GG T+FP+ L+V P+ G A+ + + + LD
Sbjct: 199 LSDVEEGGETVFPAAKGNFSSVPWWNELSKCGKGGLSVKPKMGDALLFWSMKPDATLDPT 258
Query: 52 MYHSGCPVALGNKW 65
H CPV GNKW
Sbjct: 259 SLHGACPVIRGNKW 272
>gi|193209070|ref|NP_001123049.1| Protein PHY-4, isoform b [Caenorhabditis elegans]
gi|172051527|emb|CAQ35068.1| Protein PHY-4, isoform b [Caenorhabditis elegans]
Length = 282
Score = 47.0 bits (110), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 23/70 (32%), Positives = 36/70 (51%), Gaps = 1/70 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ +L+ L + GG T+FP LNL + P+ G + W N + + + H+ CP+
Sbjct: 183 GNRIATVLVI-LQIAKKGGTTVFPRLNLNIRPKAGDVIVWLNTLSTGESNSQTLHAACPI 241
Query: 122 ALGNKWGKLL 131
G K G L
Sbjct: 242 HEGTKIGATL 251
Score = 46.2 bits (108), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 23/68 (33%), Positives = 32/68 (47%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR V L + GG T+FP LNL + P+ G + W N + + H+ CP+
Sbjct: 184 NRIATVLVILQIAKKGGTTVFPRLNLNIRPKAGDVIVWLNTLSTGESNSQTLHAACPIHE 243
Query: 62 GNKWGKLL 69
G K G L
Sbjct: 244 GTKIGATL 251
>gi|357125236|ref|XP_003564301.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Brachypodium
distachyon]
Length = 293
Score = 47.0 bits (110), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 31/98 (31%), Positives = 40/98 (40%), Gaps = 20/98 (20%)
Query: 48 LDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN------------------L 89
DY +H LG +L LTDV+ GG T+FP+ L
Sbjct: 133 FDY--FHDKNNQKLGGHRVATVLMYLTDVKKGGETVFPNAEGRHLQHKDETWSECARSGL 190
Query: 90 TVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
V P KG A+ +++ H N D H CPV G KW
Sbjct: 191 AVKPRKGDALLFFSLHINATTDPSSLHGSCPVIEGEKW 228
Score = 43.1 bits (100), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 25/73 (34%), Positives = 31/73 (42%), Gaps = 18/73 (24%)
Query: 11 LTDVELGGATIFPSLN------------------LTVFPEKGSAVFWHNAHANTLLDYWM 52
LTDV+ GG T+FP+ L V P KG A+ + + H N D
Sbjct: 156 LTDVKKGGETVFPNAEGRHLQHKDETWSECARSGLAVKPRKGDALLFFSLHINATTDPSS 215
Query: 53 YHSGCPVALGNKW 65
H CPV G KW
Sbjct: 216 LHGSCPVIEGEKW 228
>gi|356572148|ref|XP_003554232.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Glycine max]
Length = 319
Score = 47.0 bits (110), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 29/98 (29%), Positives = 41/98 (41%), Gaps = 20/98 (20%)
Query: 48 LDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN------------------L 89
DY +H +G +L L+DVE GG TIFP+
Sbjct: 159 FDY--FHDKANQVMGGHRIATVLMYLSDVEKGGETIFPNAEAKLLQPKDESWSECAHKGY 216
Query: 90 TVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
V P+KG A+ +++ H + D + H CPV G KW
Sbjct: 217 AVKPQKGDALLFFSLHLDASTDTKSLHGSCPVIEGEKW 254
Score = 42.4 bits (98), Expect = 0.067, Method: Compositional matrix adjust.
Identities = 26/85 (30%), Positives = 35/85 (41%), Gaps = 18/85 (21%)
Query: 11 LTDVELGGATIFPSLN------------------LTVFPEKGSAVFWHNAHANTLLDYWM 52
L+DVE GG TIFP+ V P+KG A+ + + H + D
Sbjct: 182 LSDVEKGGETIFPNAEAKLLQPKDESWSECAHKGYAVKPQKGDALLFFSLHLDASTDTKS 241
Query: 53 YHSGCPVALGNKWGKLLLSGLTDVE 77
H CPV G KW ++D E
Sbjct: 242 LHGSCPVIEGEKWSATKWIHVSDFE 266
>gi|255552788|ref|XP_002517437.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223543448|gb|EEF44979.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 311
Score = 47.0 bits (110), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 34/121 (28%), Positives = 47/121 (38%), Gaps = 21/121 (17%)
Query: 28 TVFPEKG--SAVFWHNAHANTLLDYWMY-HSGCPVALGNKWGKLLLSGLTDVELGGATIF 84
T PE+ S H H ++ Y H LG +L L++VE GG T+F
Sbjct: 126 TFLPEENGESMQILHYEHGQKYEPHFDYFHDKANQELGGHRVATVLMYLSNVEKGGETVF 185
Query: 85 PSLN------------------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 126
P+ V PEKG A+ +++ H + D H CPV G K
Sbjct: 186 PNAEGKLSQPKEDSWSDCAKGGYAVKPEKGDALLFFSLHPDATTDSDSLHGSCPVIEGEK 245
Query: 127 W 127
W
Sbjct: 246 W 246
Score = 40.8 bits (94), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 23/73 (31%), Positives = 31/73 (42%), Gaps = 18/73 (24%)
Query: 11 LTDVELGGATIFPSLN------------------LTVFPEKGSAVFWHNAHANTLLDYWM 52
L++VE GG T+FP+ V PEKG A+ + + H + D
Sbjct: 174 LSNVEKGGETVFPNAEGKLSQPKEDSWSDCAKGGYAVKPEKGDALLFFSLHPDATTDSDS 233
Query: 53 YHSGCPVALGNKW 65
H CPV G KW
Sbjct: 234 LHGSCPVIEGEKW 246
>gi|449485593|ref|XP_004175686.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
alpha-3 [Taeniopygia guttata]
Length = 567
Score = 47.0 bits (110), Expect = 0.003, Method: Composition-based stats.
Identities = 25/66 (37%), Positives = 37/66 (56%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ +++ L+ VE GG+T N +V K +A+FW+N N D H+GCPV
Sbjct: 480 GNRNATVMIY-LSAVEAGGSTALIYTNFSVPVVKNAALFWWNLRRNGNGDGDTLHAGCPV 538
Query: 122 ALGNKW 127
G+KW
Sbjct: 539 LAGDKW 544
Score = 46.2 bits (108), Expect = 0.004, Method: Composition-based stats.
Identities = 25/64 (39%), Positives = 33/64 (51%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L+ VE GG+T N +V K +A+FW N N D H+GCPV
Sbjct: 481 NRNATVMIYLSAVEAGGSTALIYTNFSVPVVKNAALFWWNLRRNGNGDGDTLHAGCPVLA 540
Query: 62 GNKW 65
G+KW
Sbjct: 541 GDKW 544
>gi|341890029|gb|EGT45964.1| hypothetical protein CAEBREN_03684 [Caenorhabditis brenneri]
Length = 293
Score = 47.0 bits (110), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 22/63 (34%), Positives = 32/63 (50%), Gaps = 1/63 (1%)
Query: 64 KWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVAL 123
+W + L+S + +GG T+FP L + G A+ W+N N L+ H GCPV
Sbjct: 201 EWQRFLIS-IKAANVGGGTVFPYLGTVARTKPGDAILWFNMKQNRELEPLSLHGGCPVRS 259
Query: 124 GNK 126
G K
Sbjct: 260 GEK 262
Score = 44.3 bits (103), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 23/73 (31%), Positives = 32/73 (43%), Gaps = 5/73 (6%)
Query: 16 LGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTD 75
+GG T+FP L + G A+ W N N L+ H GCPV G K +
Sbjct: 214 VGGGTVFPYLGTVARTKPGDAILWFNMKQNRELEPLSLHGGCPVRSGEK-----IVAAIS 268
Query: 76 VELGGATIFPSLN 88
VE G +F + +
Sbjct: 269 VERKGQELFKTFD 281
>gi|224102545|ref|XP_002312720.1| predicted protein [Populus trichocarpa]
gi|222852540|gb|EEE90087.1| predicted protein [Populus trichocarpa]
Length = 300
Score = 47.0 bits (110), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 27/90 (30%), Positives = 39/90 (43%), Gaps = 21/90 (23%)
Query: 59 VALGNKWGKLLLSGLTDVELGGATIFPSLN---------------------LTVFPEKGS 97
+A G +L LTDVE GG T+FPS + V P +G
Sbjct: 150 IARGGHRVATVLMYLTDVEKGGETVFPSAEELPRRKASVSHEDLSECARKGIAVKPRRGD 209
Query: 98 AVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
A+ +++ + + D H+GCPV G KW
Sbjct: 210 ALLFFSLYPTAVPDTSSIHAGCPVIEGEKW 239
Score = 43.9 bits (102), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 24/76 (31%), Positives = 33/76 (43%), Gaps = 21/76 (27%)
Query: 11 LTDVELGGATIFPSLN---------------------LTVFPEKGSAVFWHNAHANTLLD 49
LTDVE GG T+FPS + V P +G A+ + + + + D
Sbjct: 164 LTDVEKGGETVFPSAEELPRRKASVSHEDLSECARKGIAVKPRRGDALLFFSLYPTAVPD 223
Query: 50 YWMYHSGCPVALGNKW 65
H+GCPV G KW
Sbjct: 224 TSSIHAGCPVIEGEKW 239
>gi|326492085|dbj|BAJ98267.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 347
Score = 47.0 bits (110), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/70 (37%), Positives = 31/70 (44%), Gaps = 15/70 (21%)
Query: 73 LTDVELGGATIFPSLN---------------LTVFPEKGSAVFWYNAHANTLLDYRMYHS 117
LTDVE GG T+FP N L V P KG + +Y+ N +D H
Sbjct: 267 LTDVEEGGETMFPYENGDNMNIGYDYEQCIGLKVKPRKGDGLLFYSLMVNGTIDPTSLHG 326
Query: 118 GCPVALGNKW 127
CPV G KW
Sbjct: 327 SCPVVRGEKW 336
Score = 45.1 bits (105), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 25/70 (35%), Positives = 31/70 (44%), Gaps = 15/70 (21%)
Query: 11 LTDVELGGATIFPSLN---------------LTVFPEKGSAVFWHNAHANTLLDYWMYHS 55
LTDVE GG T+FP N L V P KG + +++ N +D H
Sbjct: 267 LTDVEEGGETMFPYENGDNMNIGYDYEQCIGLKVKPRKGDGLLFYSLMVNGTIDPTSLHG 326
Query: 56 GCPVALGNKW 65
CPV G KW
Sbjct: 327 SCPVVRGEKW 336
>gi|159464219|ref|XP_001690339.1| hypothetical protein CHLREDRAFT_114525 [Chlamydomonas reinhardtii]
gi|158279839|gb|EDP05598.1| predicted protein [Chlamydomonas reinhardtii]
Length = 244
Score = 46.6 bits (109), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 27/95 (28%), Positives = 46/95 (48%), Gaps = 18/95 (18%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSL----------------NLTVF 92
DY+ + G GN++ +L+ L D E GG T+FP++ +L
Sbjct: 95 DYFFHKDGIANG-GNRYATVLMY-LVDTEEGGETVFPNIAAPGGENVGFSECARYHLAAK 152
Query: 93 PEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
P+KG+A+ +++ L+ + H+ CPV G KW
Sbjct: 153 PKKGTAILFHSIKPTGELERKSLHTACPVIKGIKW 187
Score = 42.7 bits (99), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 22/71 (30%), Positives = 33/71 (46%), Gaps = 16/71 (22%)
Query: 11 LTDVELGGATIFPSL----------------NLTVFPEKGSAVFWHNAHANTLLDYWMYH 54
L D E GG T+FP++ +L P+KG+A+ +H+ L+ H
Sbjct: 117 LVDTEEGGETVFPNIAAPGGENVGFSECARYHLAAKPKKGTAILFHSIKPTGELERKSLH 176
Query: 55 SGCPVALGNKW 65
+ CPV G KW
Sbjct: 177 TACPVIKGIKW 187
>gi|449520146|ref|XP_004167095.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 249
Score = 46.6 bits (109), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 28/86 (32%), Positives = 41/86 (47%), Gaps = 20/86 (23%)
Query: 61 LGNKWGKLLLSGLTDVELGGATIFPSLN-------------------LTVFPEKGSAVFW 101
+G + LL+ L+DVE GG T+FP+ L+V P+ G A+ +
Sbjct: 153 IGQRMATLLMY-LSDVEEGGETVFPAAKGNFSSVPWWNELSKCGKGGLSVKPKMGDALLF 211
Query: 102 YNAHANTLLDYRMYHSGCPVALGNKW 127
++ +T LD H CPV GNKW
Sbjct: 212 WSMKPDTTLDPTSLHGACPVIRGNKW 237
Score = 42.7 bits (99), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 25/74 (33%), Positives = 34/74 (45%), Gaps = 19/74 (25%)
Query: 11 LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
L+DVE GG T+FP+ L+V P+ G A+ + + +T LD
Sbjct: 164 LSDVEEGGETVFPAAKGNFSSVPWWNELSKCGKGGLSVKPKMGDALLFWSMKPDTTLDPT 223
Query: 52 MYHSGCPVALGNKW 65
H CPV GNKW
Sbjct: 224 SLHGACPVIRGNKW 237
>gi|385206010|ref|ZP_10032880.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
gi|385185901|gb|EIF35175.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
Length = 296
Score = 46.6 bits (109), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 24/66 (36%), Positives = 36/66 (54%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
G + G LL+ L DVE GG T+FP +V P +G A+++ + L D H+ P+
Sbjct: 215 GQRVGTLLMY-LNDVEGGGETMFPQTGWSVVPRRGQALYFEYGNRFGLADPSSLHTSTPL 273
Query: 122 ALGNKW 127
+G KW
Sbjct: 274 RVGEKW 279
Score = 44.7 bits (104), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 20/55 (36%), Positives = 30/55 (54%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T+FP +V P +G A+++ + L D H+ P+ +G KW
Sbjct: 225 LNDVEGGGETMFPQTGWSVVPRRGQALYFEYGNRFGLADPSSLHTSTPLRVGEKW 279
>gi|297798522|ref|XP_002867145.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297312981|gb|EFH43404.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 288
Score = 46.6 bits (109), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 25/70 (35%), Positives = 31/70 (44%), Gaps = 15/70 (21%)
Query: 73 LTDVELGGATIFPSLN---------------LTVFPEKGSAVFWYNAHANTLLDYRMYHS 117
L+DVE GG T+FP N L V P KG + +Y+ N +D H
Sbjct: 206 LSDVEEGGETMFPFENGSNMGTGYDYKQCIGLKVKPRKGDGLLFYSVFPNGTIDQTSLHG 265
Query: 118 GCPVALGNKW 127
CPV G KW
Sbjct: 266 SCPVTKGEKW 275
Score = 45.1 bits (105), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 24/70 (34%), Positives = 31/70 (44%), Gaps = 15/70 (21%)
Query: 11 LTDVELGGATIFPSLN---------------LTVFPEKGSAVFWHNAHANTLLDYWMYHS 55
L+DVE GG T+FP N L V P KG + +++ N +D H
Sbjct: 206 LSDVEEGGETMFPFENGSNMGTGYDYKQCIGLKVKPRKGDGLLFYSVFPNGTIDQTSLHG 265
Query: 56 GCPVALGNKW 65
CPV G KW
Sbjct: 266 SCPVTKGEKW 275
>gi|356555587|ref|XP_003546112.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 2
[Glycine max]
Length = 297
Score = 46.6 bits (109), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 27/86 (31%), Positives = 39/86 (45%), Gaps = 17/86 (19%)
Query: 59 VALGNKWGKLLLSGLTDVELGGATIFPSLNL-----------------TVFPEKGSAVFW 101
+A G +L LTDV GG T+FP+ L V P +G A+ +
Sbjct: 151 IARGGHRVATVLMYLTDVTKGGETVFPNAELKSSETKEDLSECAQKGIAVKPRRGDALLF 210
Query: 102 YNAHANTLLDYRMYHSGCPVALGNKW 127
++ + N + D H+GCPV G KW
Sbjct: 211 FSLYPNAIPDTMSLHAGCPVIEGEKW 236
Score = 44.3 bits (103), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 24/72 (33%), Positives = 33/72 (45%), Gaps = 17/72 (23%)
Query: 11 LTDVELGGATIFPSLNL-----------------TVFPEKGSAVFWHNAHANTLLDYWMY 53
LTDV GG T+FP+ L V P +G A+ + + + N + D
Sbjct: 165 LTDVTKGGETVFPNAELKSSETKEDLSECAQKGIAVKPRRGDALLFFSLYPNAIPDTMSL 224
Query: 54 HSGCPVALGNKW 65
H+GCPV G KW
Sbjct: 225 HAGCPVIEGEKW 236
>gi|385137888|gb|AFI41205.1| oxygenase protein, partial [Arabidopsis thaliana]
Length = 288
Score = 46.6 bits (109), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 25/70 (35%), Positives = 31/70 (44%), Gaps = 15/70 (21%)
Query: 73 LTDVELGGATIFPSLN---------------LTVFPEKGSAVFWYNAHANTLLDYRMYHS 117
L+DVE GG T+FP N L V P KG + +Y+ N +D H
Sbjct: 206 LSDVEEGGETMFPFENGSNMGIGYDYKQCIGLKVKPRKGDGLLFYSVFPNGTIDQTSLHG 265
Query: 118 GCPVALGNKW 127
CPV G KW
Sbjct: 266 SCPVTKGEKW 275
Score = 45.1 bits (105), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 24/70 (34%), Positives = 31/70 (44%), Gaps = 15/70 (21%)
Query: 11 LTDVELGGATIFPSLN---------------LTVFPEKGSAVFWHNAHANTLLDYWMYHS 55
L+DVE GG T+FP N L V P KG + +++ N +D H
Sbjct: 206 LSDVEEGGETMFPFENGSNMGIGYDYKQCIGLKVKPRKGDGLLFYSVFPNGTIDQTSLHG 265
Query: 56 GCPVALGNKW 65
CPV G KW
Sbjct: 266 SCPVTKGEKW 275
>gi|18418321|ref|NP_567941.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
gi|17381226|gb|AAL36425.1| unknown protein [Arabidopsis thaliana]
gi|20465827|gb|AAM20018.1| unknown protein [Arabidopsis thaliana]
gi|21592377|gb|AAM64328.1| putative dioxygenase [Arabidopsis thaliana]
gi|332660892|gb|AEE86292.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
Length = 288
Score = 46.6 bits (109), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 25/70 (35%), Positives = 31/70 (44%), Gaps = 15/70 (21%)
Query: 73 LTDVELGGATIFPSLN---------------LTVFPEKGSAVFWYNAHANTLLDYRMYHS 117
L+DVE GG T+FP N L V P KG + +Y+ N +D H
Sbjct: 206 LSDVEEGGETMFPFENGSNMGIGYDYKQCIGLKVKPRKGDGLLFYSVFPNGTIDQTSLHG 265
Query: 118 GCPVALGNKW 127
CPV G KW
Sbjct: 266 SCPVTKGEKW 275
Score = 45.1 bits (105), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 24/70 (34%), Positives = 31/70 (44%), Gaps = 15/70 (21%)
Query: 11 LTDVELGGATIFPSLN---------------LTVFPEKGSAVFWHNAHANTLLDYWMYHS 55
L+DVE GG T+FP N L V P KG + +++ N +D H
Sbjct: 206 LSDVEEGGETMFPFENGSNMGIGYDYKQCIGLKVKPRKGDGLLFYSVFPNGTIDQTSLHG 265
Query: 56 GCPVALGNKW 65
CPV G KW
Sbjct: 266 SCPVTKGEKW 275
>gi|303276456|ref|XP_003057522.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226461874|gb|EEH59167.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 231
Score = 46.6 bits (109), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 24/60 (40%), Positives = 31/60 (51%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 128
LL+ L + GG T FP L+L P G AV W+N + D R H+G PV G K+
Sbjct: 125 LLAYLREPTRGGRTTFPKLHLGFDPRVGDAVLWWNVDDDGKEDPRTTHAGEPVEAGEKFA 184
Score = 39.3 bits (90), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 21/56 (37%), Positives = 26/56 (46%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWG 66
L + GG T FP L+L P G AV W N + D H+G PV G K+
Sbjct: 129 LREPTRGGRTTFPKLHLGFDPRVGDAVLWWNVDDDGKEDPRTTHAGEPVEAGEKFA 184
>gi|357114580|ref|XP_003559078.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Brachypodium
distachyon]
Length = 295
Score = 46.6 bits (109), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/70 (37%), Positives = 31/70 (44%), Gaps = 15/70 (21%)
Query: 73 LTDVELGGATIFPSLN---------------LTVFPEKGSAVFWYNAHANTLLDYRMYHS 117
LTDVE GG T+FP N L V P KG + +Y+ N +D H
Sbjct: 215 LTDVEEGGETMFPYENGENMDIGYDYEQCIGLKVKPRKGDGLLFYSLMVNGTIDLTSLHG 274
Query: 118 GCPVALGNKW 127
CPV G KW
Sbjct: 275 SCPVIKGEKW 284
Score = 44.7 bits (104), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 25/70 (35%), Positives = 31/70 (44%), Gaps = 15/70 (21%)
Query: 11 LTDVELGGATIFPSLN---------------LTVFPEKGSAVFWHNAHANTLLDYWMYHS 55
LTDVE GG T+FP N L V P KG + +++ N +D H
Sbjct: 215 LTDVEEGGETMFPYENGENMDIGYDYEQCIGLKVKPRKGDGLLFYSLMVNGTIDLTSLHG 274
Query: 56 GCPVALGNKW 65
CPV G KW
Sbjct: 275 SCPVIKGEKW 284
>gi|224117220|ref|XP_002331751.1| predicted protein [Populus trichocarpa]
gi|222874448|gb|EEF11579.1| predicted protein [Populus trichocarpa]
Length = 266
Score = 46.6 bits (109), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 35/74 (47%), Gaps = 19/74 (25%)
Query: 73 LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWYNAHANTLLDYR 113
L+DVE GG T+FPS L+V P+ G A+ +++ + LD
Sbjct: 180 LSDVEEGGETVFPSAKGNISSVPWWNELSECGKGGLSVKPKMGDALLFWSMKPDASLDPS 239
Query: 114 MYHSGCPVALGNKW 127
H GCPV GNKW
Sbjct: 240 SLHGGCPVIRGNKW 253
Score = 43.9 bits (102), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 34/74 (45%), Gaps = 19/74 (25%)
Query: 11 LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
L+DVE GG T+FPS L+V P+ G A+ + + + LD
Sbjct: 180 LSDVEEGGETVFPSAKGNISSVPWWNELSECGKGGLSVKPKMGDALLFWSMKPDASLDPS 239
Query: 52 MYHSGCPVALGNKW 65
H GCPV GNKW
Sbjct: 240 SLHGGCPVIRGNKW 253
>gi|302823087|ref|XP_002993198.1| hypothetical protein SELMODRAFT_431327 [Selaginella moellendorffii]
gi|300138968|gb|EFJ05718.1| hypothetical protein SELMODRAFT_431327 [Selaginella moellendorffii]
Length = 269
Score = 46.6 bits (109), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/68 (38%), Positives = 34/68 (50%), Gaps = 13/68 (19%)
Query: 73 LTDVELGGATIFPSLN-------------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGC 119
L+DVE GG TIFPS+ L V P KG A+ +++A + +D H GC
Sbjct: 190 LSDVEEGGETIFPSVGDGECECGGELRKGLCVKPRKGDAILFWSAALDGNVDSNSLHGGC 249
Query: 120 PVALGNKW 127
V G KW
Sbjct: 250 SVLRGEKW 257
Score = 43.9 bits (102), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 26/68 (38%), Positives = 33/68 (48%), Gaps = 13/68 (19%)
Query: 11 LTDVELGGATIFPSLN-------------LTVFPEKGSAVFWHNAHANTLLDYWMYHSGC 57
L+DVE GG TIFPS+ L V P KG A+ + +A + +D H GC
Sbjct: 190 LSDVEEGGETIFPSVGDGECECGGELRKGLCVKPRKGDAILFWSAALDGNVDSNSLHGGC 249
Query: 58 PVALGNKW 65
V G KW
Sbjct: 250 SVLRGEKW 257
>gi|226499492|ref|NP_001150030.1| LOC100283657 [Zea mays]
gi|195636206|gb|ACG37571.1| prolyl 4-hydroxylase [Zea mays]
gi|347978804|gb|AEP37744.1| prolyl 4-hydroxylase 3 [Zea mays]
Length = 294
Score = 46.6 bits (109), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/70 (37%), Positives = 31/70 (44%), Gaps = 15/70 (21%)
Query: 73 LTDVELGGATIFPSLN---------------LTVFPEKGSAVFWYNAHANTLLDYRMYHS 117
LTDVE GG T+FP N L V P KG + +Y+ N +D H
Sbjct: 214 LTDVEEGGETMFPYENSENMDIGYDYEKCIGLKVKPRKGDGLLFYSLMVNGTIDRTSLHG 273
Query: 118 GCPVALGNKW 127
CPV G KW
Sbjct: 274 SCPVIKGEKW 283
Score = 44.7 bits (104), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 25/70 (35%), Positives = 31/70 (44%), Gaps = 15/70 (21%)
Query: 11 LTDVELGGATIFPSLN---------------LTVFPEKGSAVFWHNAHANTLLDYWMYHS 55
LTDVE GG T+FP N L V P KG + +++ N +D H
Sbjct: 214 LTDVEEGGETMFPYENSENMDIGYDYEKCIGLKVKPRKGDGLLFYSLMVNGTIDRTSLHG 273
Query: 56 GCPVALGNKW 65
CPV G KW
Sbjct: 274 SCPVIKGEKW 283
>gi|119595340|gb|EAW74934.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide III, isoform CRA_a
[Homo sapiens]
Length = 657
Score = 46.6 bits (109), Expect = 0.003, Method: Composition-based stats.
Identities = 24/60 (40%), Positives = 35/60 (58%), Gaps = 1/60 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ ++ L+ VE GGAT F NL+V + +A+FW+N H + D H+GCPV
Sbjct: 489 GNRVATFMIY-LSSVEAGGATAFIYANLSVPVVRNAALFWWNLHRSGEGDSDTLHAGCPV 547
Score = 45.8 bits (107), Expect = 0.006, Method: Composition-based stats.
Identities = 24/58 (41%), Positives = 32/58 (55%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPV 59
NR + L+ VE GGAT F NL+V + +A+FW N H + D H+GCPV
Sbjct: 490 NRVATFMIYLSSVEAGGATAFIYANLSVPVVRNAALFWWNLHRSGEGDSDTLHAGCPV 547
>gi|307111754|gb|EFN59988.1| hypothetical protein CHLNCDRAFT_49444 [Chlorella variabilis]
Length = 344
Score = 46.6 bits (109), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 39/147 (26%), Positives = 62/147 (42%), Gaps = 28/147 (19%)
Query: 1 MNRAEMSSVELTDVELGGATIFPSLN---LTVFPEKGSAVFWHNAHANTLLDYWMYHSGC 57
+NR S + + + T+ P+ N L V + H DY+ + +G
Sbjct: 121 LNRGHDSVIADIEARIAKWTLMPAGNGEGLQVLK------YEHGQEYEGHYDYFFHKAGT 174
Query: 58 PVALGNKWGKLLLSGLTDVELGGATIFPSLN-----------------LTVFPEKGSAVF 100
GN++ +L+ L DVE GG T FP++ L P+KG+AV
Sbjct: 175 ANG-GNRYLTVLMY-LNDVEEGGETCFPNIPSPNGDNGPEFSECARKVLAAKPKKGNAVL 232
Query: 101 WYNAHANTLLDYRMYHSGCPVALGNKW 127
+++ L+ R H+ CPV G KW
Sbjct: 233 FHSIKPTGELERRSLHTACPVIKGVKW 259
>gi|223945827|gb|ACN26997.1| unknown [Zea mays]
gi|414872966|tpg|DAA51523.1| TPA: prolyl 4-hydroxylase [Zea mays]
Length = 294
Score = 46.6 bits (109), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/70 (37%), Positives = 31/70 (44%), Gaps = 15/70 (21%)
Query: 73 LTDVELGGATIFPSLN---------------LTVFPEKGSAVFWYNAHANTLLDYRMYHS 117
LTDVE GG T+FP N L V P KG + +Y+ N +D H
Sbjct: 214 LTDVEEGGETMFPYENSENMDIGYDYEKCIGLKVKPRKGDGLLFYSLMVNGTIDRTSLHG 273
Query: 118 GCPVALGNKW 127
CPV G KW
Sbjct: 274 SCPVIKGEKW 283
Score = 44.7 bits (104), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 25/70 (35%), Positives = 31/70 (44%), Gaps = 15/70 (21%)
Query: 11 LTDVELGGATIFPSLN---------------LTVFPEKGSAVFWHNAHANTLLDYWMYHS 55
LTDVE GG T+FP N L V P KG + +++ N +D H
Sbjct: 214 LTDVEEGGETMFPYENSENMDIGYDYEKCIGLKVKPRKGDGLLFYSLMVNGTIDRTSLHG 273
Query: 56 GCPVALGNKW 65
CPV G KW
Sbjct: 274 SCPVIKGEKW 283
>gi|378706224|gb|AFC35025.1| hypothetical protein OtV6_117c [Ostreococcus tauri virus RT-2011]
Length = 194
Score = 46.6 bits (109), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 36/127 (28%), Positives = 51/127 (40%), Gaps = 17/127 (13%)
Query: 1 MNRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVA 60
+NR V LTD L L + + E G ++ H +T D
Sbjct: 74 VNRVTQRCVSLTDRPLINCE---KLQVLRYKEGG----FYKPHQDTFSD----------V 116
Query: 61 LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
GNK ++ L D GG T+FP+L KG A+F++ L+ + H G P
Sbjct: 117 KGNKRMYTIILALNDDYEGGETVFPNLRRKYKLNKGDALFFHTLDNYELMTSKALHGGAP 176
Query: 121 VALGNKW 127
V G KW
Sbjct: 177 VKSGEKW 183
>gi|221482398|gb|EEE20746.1| prolyl 4-hydroxylase alpha subunit, putative [Toxoplasma gondii
GT1]
gi|221504447|gb|EEE30120.1| prolyl 4-hydroxylase alpha subunit, putative [Toxoplasma gondii
VEG]
Length = 401
Score = 46.6 bits (109), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 30/80 (37%), Positives = 40/80 (50%), Gaps = 6/80 (7%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
Y+ HS G K +L L DVE GG T F +L V P KG+ V W N++ T
Sbjct: 307 QYFKLHSD-----GGFRPKTILLYLNDVEAGGETSFENLGFRVAPMKGAGVLWNNSYPGT 361
Query: 109 -LLDYRMYHSGCPVALGNKW 127
+D R+ H+G P G K+
Sbjct: 362 NEIDPRLIHAGLPPEKGVKF 381
Score = 43.1 bits (100), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 23/56 (41%), Positives = 32/56 (57%), Gaps = 1/56 (1%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANT-LLDYWMYHSGCPVALGNKW 65
L DVE GG T F +L V P KG+ V W+N++ T +D + H+G P G K+
Sbjct: 326 LNDVEAGGETSFENLGFRVAPMKGAGVLWNNSYPGTNEIDPRLIHAGLPPEKGVKF 381
>gi|302842389|ref|XP_002952738.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
gi|300262082|gb|EFJ46291.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
Length = 281
Score = 46.6 bits (109), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 37/146 (25%), Positives = 65/146 (44%), Gaps = 27/146 (18%)
Query: 1 MNRAEMSSVELTDVELGGATIFPSLN---LTVFPEKGSAVFWHNAHANTLLDYWMYHSGC 57
++R E V+ + + T+ P N L V + + +AH DY+ + G
Sbjct: 46 LDRGEDEIVKRVEERIAAWTLMPVGNGEGLQVLRYQKEQKY--DAH----WDYFFHKDGI 99
Query: 58 PVALGNKWGKLLLSGLTDVELGGATIFPSL----------------NLTVFPEKGSAVFW 101
GN++ +L+ L D E GG T+FP++ +L P+KG+A+ +
Sbjct: 100 TNG-GNRYATVLMY-LVDTEEGGETVFPNVAAPGGENVGFSECARYHLAAKPKKGTAILF 157
Query: 102 YNAHANTLLDYRMYHSGCPVALGNKW 127
++ L+ + H+ CPV G KW
Sbjct: 158 HSIKPTGELERKSLHTACPVIRGIKW 183
>gi|383757171|ref|YP_005436156.1| putative prolyl 4-hydroxylase alpha subunit [Rubrivivax gelatinosus
IL144]
gi|381377840|dbj|BAL94657.1| putative prolyl 4-hydroxylase alpha subunit homologue
oxidoreductase protein [Rubrivivax gelatinosus IL144]
Length = 279
Score = 46.6 bits (109), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 27/60 (45%), Positives = 33/60 (55%), Gaps = 4/60 (6%)
Query: 79 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW--GKLLLSGVF 136
GGAT FP + L V P KG+AVF+ ++ L R H G PV G KW K + GVF
Sbjct: 221 GGATTFPEVGLEVQPVKGNAVFF--SYERPLASTRTLHGGAPVLDGEKWVATKWMREGVF 278
Score = 42.4 bits (98), Expect = 0.066, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 27/49 (55%), Gaps = 2/49 (4%)
Query: 17 GGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
GGAT FP + L V P KG+AVF+ ++ L H G PV G KW
Sbjct: 221 GGATTFPEVGLEVQPVKGNAVFF--SYERPLASTRTLHGGAPVLDGEKW 267
>gi|325929527|ref|ZP_08190641.1| 2OG-Fe(II) oxygenase superfamily enzyme,Sel1 repeat protein
[Xanthomonas perforans 91-118]
gi|325540037|gb|EGD11665.1| 2OG-Fe(II) oxygenase superfamily enzyme,Sel1 repeat protein
[Xanthomonas perforans 91-118]
Length = 418
Score = 46.6 bits (109), Expect = 0.004, Method: Composition-based stats.
Identities = 25/64 (39%), Positives = 31/64 (48%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR V L DV G T FP + V P G+ V + N HA+ D H+G PV
Sbjct: 341 NRQRTVCVYLNDVGAAGETEFPVAGVRVRPRPGTLVCFDNLHADGRPDADSLHAGLPVTA 400
Query: 62 GNKW 65
G+KW
Sbjct: 401 GSKW 404
Score = 43.1 bits (100), Expect = 0.040, Method: Composition-based stats.
Identities = 22/55 (40%), Positives = 28/55 (50%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L DV G T FP + V P G+ V + N HA+ D H+G PV G+KW
Sbjct: 350 LNDVGAAGETEFPVAGVRVRPRPGTLVCFDNLHADGRPDADSLHAGLPVTAGSKW 404
>gi|237841319|ref|XP_002369957.1| 2OG-Fe(II) oxygenase family protein, putative [Toxoplasma gondii
ME49]
gi|211967621|gb|EEB02817.1| 2OG-Fe(II) oxygenase family protein, putative [Toxoplasma gondii
ME49]
Length = 401
Score = 46.6 bits (109), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 30/80 (37%), Positives = 40/80 (50%), Gaps = 6/80 (7%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
Y+ HS G K +L L DVE GG T F +L V P KG+ V W N++ T
Sbjct: 307 QYFKLHSD-----GGFRPKTILLYLNDVEAGGETSFENLGFRVAPMKGAGVLWNNSYPGT 361
Query: 109 -LLDYRMYHSGCPVALGNKW 127
+D R+ H+G P G K+
Sbjct: 362 NEIDPRLIHAGLPPEKGVKF 381
Score = 43.1 bits (100), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 23/56 (41%), Positives = 32/56 (57%), Gaps = 1/56 (1%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANT-LLDYWMYHSGCPVALGNKW 65
L DVE GG T F +L V P KG+ V W+N++ T +D + H+G P G K+
Sbjct: 326 LNDVEAGGETSFENLGFRVAPMKGAGVLWNNSYPGTNEIDPRLIHAGLPPEKGVKF 381
>gi|193209068|ref|NP_001123048.1| Protein PHY-4, isoform a [Caenorhabditis elegans]
gi|172051526|emb|CAQ35067.1| Protein PHY-4, isoform a [Caenorhabditis elegans]
Length = 278
Score = 46.6 bits (109), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 23/70 (32%), Positives = 36/70 (51%), Gaps = 1/70 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ +L+ L + GG T+FP LNL + P+ G + W N + + + H+ CP+
Sbjct: 183 GNRIATVLVI-LQIAKKGGTTVFPRLNLNIRPKAGDVIVWLNTLSTGESNSQTLHAACPI 241
Query: 122 ALGNKWGKLL 131
G K G L
Sbjct: 242 HEGTKIGATL 251
Score = 45.4 bits (106), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 23/68 (33%), Positives = 32/68 (47%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR V L + GG T+FP LNL + P+ G + W N + + H+ CP+
Sbjct: 184 NRIATVLVILQIAKKGGTTVFPRLNLNIRPKAGDVIVWLNTLSTGESNSQTLHAACPIHE 243
Query: 62 GNKWGKLL 69
G K G L
Sbjct: 244 GTKIGATL 251
>gi|125588006|gb|EAZ28670.1| hypothetical protein OsJ_12681 [Oryza sativa Japonica Group]
Length = 280
Score = 46.6 bits (109), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 34/120 (28%), Positives = 48/120 (40%), Gaps = 25/120 (20%)
Query: 33 KGSAVFWHNAHANTLLDYWM-------YHSGCPVALGNKWGKLLLS---GLTDVELGGAT 82
K + + H+ +L Y + Y + P G + + + S LTDVE GG T
Sbjct: 148 KATMIPRHHGEPFNILRYEIGQRYASHYDAFDPAQYGPQKSQRVASFLLYLTDVEEGGET 207
Query: 83 IFPSLN---------------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
+FP N L V P KG + +Y+ N +D H CPV G KW
Sbjct: 208 MFPYENGENMDIGYDYEKCIGLKVKPRKGDGLLFYSLMVNGTIDPTSLHGSCPVIKGEKW 267
Score = 44.3 bits (103), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 25/70 (35%), Positives = 31/70 (44%), Gaps = 15/70 (21%)
Query: 11 LTDVELGGATIFPSLN---------------LTVFPEKGSAVFWHNAHANTLLDYWMYHS 55
LTDVE GG T+FP N L V P KG + +++ N +D H
Sbjct: 198 LTDVEEGGETMFPYENGENMDIGYDYEKCIGLKVKPRKGDGLLFYSLMVNGTIDPTSLHG 257
Query: 56 GCPVALGNKW 65
CPV G KW
Sbjct: 258 SCPVIKGEKW 267
>gi|302764100|ref|XP_002965471.1| hypothetical protein SELMODRAFT_67344 [Selaginella moellendorffii]
gi|300166285|gb|EFJ32891.1| hypothetical protein SELMODRAFT_67344 [Selaginella moellendorffii]
Length = 264
Score = 46.2 bits (108), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 26/68 (38%), Positives = 34/68 (50%), Gaps = 13/68 (19%)
Query: 73 LTDVELGGATIFPSLN-------------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGC 119
L+DVE GG TIFPS+ L V P KG A+ +++A + +D H GC
Sbjct: 189 LSDVEEGGETIFPSVGDGECECGGELRKGLCVKPRKGDAILFWSAALDGNVDSNSLHGGC 248
Query: 120 PVALGNKW 127
V G KW
Sbjct: 249 SVLRGEKW 256
Score = 43.9 bits (102), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 26/68 (38%), Positives = 33/68 (48%), Gaps = 13/68 (19%)
Query: 11 LTDVELGGATIFPSLN-------------LTVFPEKGSAVFWHNAHANTLLDYWMYHSGC 57
L+DVE GG TIFPS+ L V P KG A+ + +A + +D H GC
Sbjct: 189 LSDVEEGGETIFPSVGDGECECGGELRKGLCVKPRKGDAILFWSAALDGNVDSNSLHGGC 248
Query: 58 PVALGNKW 65
V G KW
Sbjct: 249 SVLRGEKW 256
>gi|148537204|dbj|BAF63493.1| prolyl 4-hydroxylase [Potamogeton distinctus]
Length = 246
Score = 46.2 bits (108), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 36/149 (24%), Positives = 56/149 (37%), Gaps = 31/149 (20%)
Query: 1 MNRAEMSSVELTDVELGGATIFPSLN----LTVFPEKGSAVFWHNAHANTLLDYWMYHSG 56
+ R + ++ + + T P N + EKG ++AH Y +H
Sbjct: 46 LERRQDETITRIEKRIAAWTFLPEENGEPIQILHYEKGQK---YDAH------YDYFHDK 96
Query: 57 CPVALGNKWGKLLLSGLTDVELGGATIFPSLN------------------LTVFPEKGSA 98
+G +L L+DV+ GG T+FP V P KG A
Sbjct: 97 NNQRVGGHRMATVLMYLSDVKKGGETVFPDAEGKLLQVKDDTWSDCARSGYAVKPRKGDA 156
Query: 99 VFWYNAHANTLLDYRMYHSGCPVALGNKW 127
+ +++ H N D H+ CPV G KW
Sbjct: 157 LLFFSCHPNATTDPNSLHASCPVIEGEKW 185
>gi|115455509|ref|NP_001051355.1| Os03g0761900 [Oryza sativa Japonica Group]
gi|14488368|gb|AAK63935.1|AC084282_16 putative dioxygenase [Oryza sativa Japonica Group]
gi|17027263|gb|AAL34117.1|AC090713_4 putative hydroxylase subunit [Oryza sativa Japonica Group]
gi|108711218|gb|ABF99013.1| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
Group]
gi|113549826|dbj|BAF13269.1| Os03g0761900 [Oryza sativa Japonica Group]
gi|125545807|gb|EAY91946.1| hypothetical protein OsI_13633 [Oryza sativa Indica Group]
Length = 310
Score = 46.2 bits (108), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 34/120 (28%), Positives = 48/120 (40%), Gaps = 25/120 (20%)
Query: 33 KGSAVFWHNAHANTLLDYWM-------YHSGCPVALGNKWGKLLLS---GLTDVELGGAT 82
K + + H+ +L Y + Y + P G + + + S LTDVE GG T
Sbjct: 178 KATMIPRHHGEPFNILRYEIGQRYASHYDAFDPAQYGPQKSQRVASFLLYLTDVEEGGET 237
Query: 83 IFPSLN---------------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
+FP N L V P KG + +Y+ N +D H CPV G KW
Sbjct: 238 MFPYENGENMDIGYDYEKCIGLKVKPRKGDGLLFYSLMVNGTIDPTSLHGSCPVIKGEKW 297
Score = 44.3 bits (103), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 25/70 (35%), Positives = 31/70 (44%), Gaps = 15/70 (21%)
Query: 11 LTDVELGGATIFPSLN---------------LTVFPEKGSAVFWHNAHANTLLDYWMYHS 55
LTDVE GG T+FP N L V P KG + +++ N +D H
Sbjct: 228 LTDVEEGGETMFPYENGENMDIGYDYEKCIGLKVKPRKGDGLLFYSLMVNGTIDPTSLHG 287
Query: 56 GCPVALGNKW 65
CPV G KW
Sbjct: 288 SCPVIKGEKW 297
>gi|332526359|ref|ZP_08402485.1| procollagen-proline dioxygenase [Rubrivivax benzoatilyticus JA2]
gi|332110495|gb|EGJ10818.1| procollagen-proline dioxygenase [Rubrivivax benzoatilyticus JA2]
Length = 224
Score = 46.2 bits (108), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 28/60 (46%), Positives = 33/60 (55%), Gaps = 4/60 (6%)
Query: 79 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW--GKLLLSGVF 136
GGAT FP + L V P KG+AVF+ ++ L R H G PV G KW K L GVF
Sbjct: 166 GGATTFPEVGLEVQPIKGNAVFF--SYERPLASTRTLHGGAPVLDGEKWVATKWLREGVF 223
Score = 42.0 bits (97), Expect = 0.092, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 27/49 (55%), Gaps = 2/49 (4%)
Query: 17 GGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
GGAT FP + L V P KG+AVF+ ++ L H G PV G KW
Sbjct: 166 GGATTFPEVGLEVQPIKGNAVFF--SYERPLASTRTLHGGAPVLDGEKW 212
>gi|404253277|ref|ZP_10957245.1| putative oxygenase [Sphingomonas sp. PAMC 26621]
Length = 226
Score = 46.2 bits (108), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 30/90 (33%), Positives = 42/90 (46%), Gaps = 8/90 (8%)
Query: 40 HNAHANTLLDYW--MYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGS 97
HN + N YW M SG W ++ L DVE GGAT FP + + + P++G
Sbjct: 129 HNDYFNEAQPYWPKMIESGGQ----RTWTAMIY--LNDVEEGGATWFPLVGVRIAPKRGL 182
Query: 98 AVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
+ W N A+ + H G PV G K+
Sbjct: 183 LIAWNNMRADGSPNPDTLHEGMPVTAGTKY 212
Score = 43.1 bits (100), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 21/57 (36%), Positives = 32/57 (56%)
Query: 9 VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
+ L DVE GGAT FP + + + P++G + W+N A+ + H G PV G K+
Sbjct: 156 IYLNDVEEGGATWFPLVGVRIAPKRGLLIAWNNMRADGSPNPDTLHEGMPVTAGTKY 212
>gi|395492951|ref|ZP_10424530.1| putative oxygenase [Sphingomonas sp. PAMC 26617]
Length = 226
Score = 46.2 bits (108), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 30/90 (33%), Positives = 42/90 (46%), Gaps = 8/90 (8%)
Query: 40 HNAHANTLLDYW--MYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGS 97
HN + N YW M SG W ++ L DVE GGAT FP + + + P++G
Sbjct: 129 HNDYFNEAQPYWPKMIESGGQ----RTWTAMIY--LNDVEEGGATWFPLVGVRIAPKRGL 182
Query: 98 AVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
+ W N A+ + H G PV G K+
Sbjct: 183 LIAWNNMRADGSPNPDTLHEGMPVTAGTKY 212
Score = 43.1 bits (100), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 21/57 (36%), Positives = 32/57 (56%)
Query: 9 VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
+ L DVE GGAT FP + + + P++G + W+N A+ + H G PV G K+
Sbjct: 156 IYLNDVEEGGATWFPLVGVRIAPKRGLLIAWNNMRADGSPNPDTLHEGMPVTAGTKY 212
>gi|401397395|ref|XP_003880042.1| hypothetical protein NCLIV_004830 [Neospora caninum Liverpool]
gi|325114451|emb|CBZ50007.1| hypothetical protein NCLIV_004830 [Neospora caninum Liverpool]
Length = 308
Score = 46.2 bits (108), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 26/62 (41%), Positives = 35/62 (56%), Gaps = 1/62 (1%)
Query: 67 KLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT-LLDYRMYHSGCPVALGN 125
K +L L DVE GG T F +L V P KG+ V W N++ T +D R+ H+G P G
Sbjct: 227 KTVLLYLNDVEAGGETSFETLGFRVAPMKGAGVVWSNSYPGTHEIDPRLLHAGLPPERGV 286
Query: 126 KW 127
K+
Sbjct: 287 KF 288
Score = 42.7 bits (99), Expect = 0.053, Method: Compositional matrix adjust.
Identities = 23/56 (41%), Positives = 31/56 (55%), Gaps = 1/56 (1%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANT-LLDYWMYHSGCPVALGNKW 65
L DVE GG T F +L V P KG+ V W N++ T +D + H+G P G K+
Sbjct: 233 LNDVEAGGETSFETLGFRVAPMKGAGVVWSNSYPGTHEIDPRLLHAGLPPERGVKF 288
>gi|351731158|ref|ZP_08948849.1| 2OG-Fe(II) oxygenase [Acidovorax radicis N35]
Length = 303
Score = 46.2 bits (108), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 26/68 (38%), Positives = 37/68 (54%), Gaps = 7/68 (10%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVF--WYNAHANTLLDYRMYHSGC 119
G + G L++ L E GG T FP +++ V P++G+AVF + H +T R H G
Sbjct: 229 GQRVGTLVMY-LNTPEKGGGTTFPDVHVEVAPQRGNAVFFSYERPHPST----RTLHGGA 283
Query: 120 PVALGNKW 127
PV G KW
Sbjct: 284 PVLAGEKW 291
Score = 42.4 bits (98), Expect = 0.065, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 30/57 (52%), Gaps = 6/57 (10%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVF--WHNAHANTLLDYWMYHSGCPVALGNKW 65
L E GG T FP +++ V P++G+AVF + H +T H G PV G KW
Sbjct: 239 LNTPEKGGGTTFPDVHVEVAPQRGNAVFFSYERPHPSTR----TLHGGAPVLAGEKW 291
>gi|255085784|ref|XP_002505323.1| predicted protein [Micromonas sp. RCC299]
gi|226520592|gb|ACO66581.1| predicted protein [Micromonas sp. RCC299]
Length = 215
Score = 46.2 bits (108), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 33/81 (40%), Gaps = 26/81 (32%)
Query: 11 LTDVELGGATIFPSLN--------------------------LTVFPEKGSAVFWHNAHA 44
L+DVE GG T+FP L V P +G A+ +HN H
Sbjct: 126 LSDVESGGETVFPKHGAWIEPDEMDVRGRSSSKDSSKCAKGALHVKPRRGDALLFHNCHL 185
Query: 45 NTLLDYWMYHSGCPVALGNKW 65
N D H+GCPV G KW
Sbjct: 186 NGREDPTSLHAGCPVLRGEKW 206
Score = 44.3 bits (103), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 33/81 (40%), Gaps = 26/81 (32%)
Query: 73 LTDVELGGATIFPSLN--------------------------LTVFPEKGSAVFWYNAHA 106
L+DVE GG T+FP L V P +G A+ ++N H
Sbjct: 126 LSDVESGGETVFPKHGAWIEPDEMDVRGRSSSKDSSKCAKGALHVKPRRGDALLFHNCHL 185
Query: 107 NTLLDYRMYHSGCPVALGNKW 127
N D H+GCPV G KW
Sbjct: 186 NGREDPTSLHAGCPVLRGEKW 206
>gi|225428943|ref|XP_002263094.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
gi|296083076|emb|CBI22480.3| unnamed protein product [Vitis vinifera]
Length = 282
Score = 46.2 bits (108), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 24/69 (34%), Positives = 33/69 (47%), Gaps = 14/69 (20%)
Query: 73 LTDVELGGATIFP---SLNLTVF-----------PEKGSAVFWYNAHANTLLDYRMYHSG 118
L+DVE GG T+FP LN+ F P +G + +Y+ N +D+ H
Sbjct: 201 LSDVEEGGETMFPFEHDLNINTFNSRKCIGLKVKPRRGDGLLFYSVFPNGTIDWTSMHGS 260
Query: 119 CPVALGNKW 127
CPV G KW
Sbjct: 261 CPVIEGEKW 269
Score = 43.9 bits (102), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 23/69 (33%), Positives = 33/69 (47%), Gaps = 14/69 (20%)
Query: 11 LTDVELGGATIFP---SLNLTVF-----------PEKGSAVFWHNAHANTLLDYWMYHSG 56
L+DVE GG T+FP LN+ F P +G + +++ N +D+ H
Sbjct: 201 LSDVEEGGETMFPFEHDLNINTFNSRKCIGLKVKPRRGDGLLFYSVFPNGTIDWTSMHGS 260
Query: 57 CPVALGNKW 65
CPV G KW
Sbjct: 261 CPVIEGEKW 269
>gi|186474111|ref|YP_001861453.1| procollagen-proline dioxygenase [Burkholderia phymatum STM815]
gi|184196443|gb|ACC74407.1| Procollagen-proline dioxygenase [Burkholderia phymatum STM815]
Length = 305
Score = 46.2 bits (108), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 31/57 (54%)
Query: 9 VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
V L+DV GG T+FP+ L V +G A+++ + + LD H G PV G KW
Sbjct: 237 VYLSDVAGGGETVFPNAGLAVMARQGGAIYFRYLNGHRQLDPLTLHGGAPVTNGEKW 293
Score = 45.1 bits (105), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 23/69 (33%), Positives = 35/69 (50%)
Query: 59 VALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSG 118
+ G + L+ L+DV GG T+FP+ L V +G A+++ + + LD H G
Sbjct: 225 TSRGGQRVATLIVYLSDVAGGGETVFPNAGLAVMARQGGAIYFRYLNGHRQLDPLTLHGG 284
Query: 119 CPVALGNKW 127
PV G KW
Sbjct: 285 APVTNGEKW 293
>gi|145345880|ref|XP_001417427.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144577654|gb|ABO95720.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 378
Score = 46.2 bits (108), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 42/85 (49%), Gaps = 6/85 (7%)
Query: 49 DYWMYHS-GCPVALGNKWG----KLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYN 103
+Y+ H G P+A+ G +L L DV+ GG T F L + V P+KG A+ ++
Sbjct: 232 EYFKAHEDGFPIAVAADKGYQRRATILVYLNDVDEGGETRFEHLGIEVAPKKGKALVFFP 291
Query: 104 AHANTLLDYRMYHSGCPVALG-NKW 127
+ A + D R H+ P G KW
Sbjct: 292 SSAACMPDARTLHTATPAKEGHEKW 316
Score = 38.1 bits (87), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 21/58 (36%), Positives = 29/58 (50%), Gaps = 1/58 (1%)
Query: 9 VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALG-NKW 65
V L DV+ GG T F L + V P+KG A+ + + A + D H+ P G KW
Sbjct: 259 VYLNDVDEGGETRFEHLGIEVAPKKGKALVFFPSSAACMPDARTLHTATPAKEGHEKW 316
>gi|357467085|ref|XP_003603827.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355492875|gb|AES74078.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 280
Score = 46.2 bits (108), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 30/108 (27%), Positives = 49/108 (45%), Gaps = 25/108 (23%)
Query: 40 HNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN----------- 88
+ H + LD + +G G + +L+ L+DVE GG T+FP+
Sbjct: 167 YEPHYDYFLDEFNTKNG-----GQRVATVLMY-LSDVEEGGETVFPAAKANFSSVPWWND 220
Query: 89 --------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 128
L++ P+ G A+ +++ + LD H GCPV +GNKW
Sbjct: 221 LSECARKGLSLKPKMGDALLFWSMRPDATLDASSLHGGCPVIVGNKWS 268
Score = 43.5 bits (101), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 24/75 (32%), Positives = 35/75 (46%), Gaps = 19/75 (25%)
Query: 11 LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
L+DVE GG T+FP+ L++ P+ G A+ + + + LD
Sbjct: 194 LSDVEEGGETVFPAAKANFSSVPWWNDLSECARKGLSLKPKMGDALLFWSMRPDATLDAS 253
Query: 52 MYHSGCPVALGNKWG 66
H GCPV +GNKW
Sbjct: 254 SLHGGCPVIVGNKWS 268
>gi|167524906|ref|XP_001746788.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163774568|gb|EDQ88195.1| predicted protein [Monosiga brevicollis MX1]
Length = 321
Score = 46.2 bits (108), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 23/56 (41%), Positives = 28/56 (50%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 128
L DV GG T F L L V P+KG AV WY+ + + R H PV G K+
Sbjct: 246 LNDVPAGGETSFTRLGLAVKPKKGKAVLWYSVTNDLEPEERTDHEARPVKQGTKYA 301
Score = 41.2 bits (95), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 21/56 (37%), Positives = 27/56 (48%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWG 66
L DV GG T F L L V P+KG AV W++ + + H PV G K+
Sbjct: 246 LNDVPAGGETSFTRLGLAVKPKKGKAVLWYSVTNDLEPEERTDHEARPVKQGTKYA 301
>gi|341878860|gb|EGT34795.1| hypothetical protein CAEBREN_10065 [Caenorhabditis brenneri]
Length = 163
Score = 46.2 bits (108), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 23/69 (33%), Positives = 35/69 (50%), Gaps = 1/69 (1%)
Query: 59 VALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSG 118
+ GN+ +L ++ GG T+F + TV P K A+FWYN + + H+
Sbjct: 41 LGTGNRIATVLFY-MSQPSHGGGTVFTEVKSTVLPTKNDALFWYNLYKQGDGNPDTRHAA 99
Query: 119 CPVALGNKW 127
CPV +G KW
Sbjct: 100 CPVLVGIKW 108
Score = 42.7 bits (99), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 27/49 (55%)
Query: 17 GGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
GG T+F + TV P K A+FW+N + + H+ CPV +G KW
Sbjct: 60 GGGTVFTEVKSTVLPTKNDALFWYNLYKQGDGNPDTRHAACPVLVGIKW 108
>gi|297812067|ref|XP_002873917.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297319754|gb|EFH50176.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 298
Score = 46.2 bits (108), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 29/111 (26%), Positives = 43/111 (38%), Gaps = 23/111 (20%)
Query: 38 FWHNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN--------- 88
+ H + DY +H + G +L L++V GG T+FP
Sbjct: 129 YEHGQKYDAHFDY--FHDKVNIVRGGHRIATVLMYLSNVTKGGETVFPDAEVPSCRVLSE 186
Query: 89 ------------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
+ V P KG A+ ++N H + + D H GCPV G KW
Sbjct: 187 NKEDLSDCAKRGIAVKPRKGDALLFFNLHPDAIPDPLSLHGGCPVIEGEKW 237
Score = 41.2 bits (95), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 23/76 (30%), Positives = 31/76 (40%), Gaps = 21/76 (27%)
Query: 11 LTDVELGGATIFPSLN---------------------LTVFPEKGSAVFWHNAHANTLLD 49
L++V GG T+FP + V P KG A+ + N H + + D
Sbjct: 162 LSNVTKGGETVFPDAEVPSCRVLSENKEDLSDCAKRGIAVKPRKGDALLFFNLHPDAIPD 221
Query: 50 YWMYHSGCPVALGNKW 65
H GCPV G KW
Sbjct: 222 PLSLHGGCPVIEGEKW 237
>gi|448930555|gb|AGE54119.1| prolyl 4-hydroxylase [Paramecium bursaria Chlorella virus IL-5-2s1]
gi|448934686|gb|AGE58238.1| prolyl 4-hydroxylase [Paramecium bursaria Chlorella virus NY-2B]
Length = 230
Score = 46.2 bits (108), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 34/128 (26%), Positives = 59/128 (46%), Gaps = 10/128 (7%)
Query: 1 MNRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVA 60
+++ + + EL D + + ++ V K ++H+ + D CP
Sbjct: 103 IDKIQNKTRELLDSKRHCIDKYKFEDVQVARYKPGEYYYHHYDGDDCDD------ACP-- 154
Query: 61 LGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYR-MYHSGC 119
++ L+ L + GG TIFP L V P+KG AVF++ A+ +T Y+ H+G
Sbjct: 155 -KDQRLATLMVYLKEPNEGGKTIFPILKTQVVPKKGKAVFFWVANPSTRKLYKETLHAGL 213
Query: 120 PVALGNKW 127
PV G K+
Sbjct: 214 PVKSGEKF 221
>gi|308801080|ref|XP_003075321.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
gi|116061875|emb|CAL52593.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
Length = 541
Score = 46.2 bits (108), Expect = 0.005, Method: Composition-based stats.
Identities = 32/97 (32%), Positives = 46/97 (47%), Gaps = 21/97 (21%)
Query: 48 LDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFP------SLNLTVFPE------- 94
DY+ + SG N+ +LL L+DVE GG T+FP S N +++ E
Sbjct: 360 FDYFFHKSG---MRNNRIATVLLY-LSDVEEGGETVFPNTDVPTSRNRSMYSECGNGGKA 415
Query: 95 ----KGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
KG A+ +++ LD H+GCPV G KW
Sbjct: 416 LKARKGDALLFWSMKPGGELDAGSSHAGCPVIKGEKW 452
Score = 42.0 bits (97), Expect = 0.088, Method: Composition-based stats.
Identities = 27/81 (33%), Positives = 37/81 (45%), Gaps = 17/81 (20%)
Query: 2 NRAEMSSVELTDVELGGATIFP------SLNLTVFPE-----------KGSAVFWHNAHA 44
NR + L+DVE GG T+FP S N +++ E KG A+ + +
Sbjct: 372 NRIATVLLYLSDVEEGGETVFPNTDVPTSRNRSMYSECGNGGKALKARKGDALLFWSMKP 431
Query: 45 NTLLDYWMYHSGCPVALGNKW 65
LD H+GCPV G KW
Sbjct: 432 GGELDAGSSHAGCPVIKGEKW 452
>gi|385205097|ref|ZP_10031967.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
gi|385184988|gb|EIF34262.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
Length = 292
Score = 46.2 bits (108), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 22/55 (40%), Positives = 29/55 (52%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DV GG TIFP ++V +G AV++ + LD H G PV G+KW
Sbjct: 226 LNDVPDGGETIFPEAGMSVAASQGGAVYFRYMNGRRQLDPLTLHGGAPVLSGDKW 280
Score = 45.4 bits (106), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 25/69 (36%), Positives = 34/69 (49%)
Query: 59 VALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSG 118
A G + L+ L DV GG TIFP ++V +G AV++ + LD H G
Sbjct: 212 TAQGGQRVATLVIYLNDVPDGGETIFPEAGMSVAASQGGAVYFRYMNGRRQLDPLTLHGG 271
Query: 119 CPVALGNKW 127
PV G+KW
Sbjct: 272 APVLSGDKW 280
>gi|356541677|ref|XP_003539300.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 297
Score = 45.8 bits (107), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 26/68 (38%), Positives = 31/68 (45%), Gaps = 13/68 (19%)
Query: 73 LTDVELGGATIFPSLN-------------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGC 119
LTDV GG T+FP N L V P KG A+ +Y+ N +D H C
Sbjct: 217 LTDVPEGGETMFPYENGFNRDGNVEDCIGLRVRPRKGDALLFYSLLPNGTIDQTSAHGSC 276
Query: 120 PVALGNKW 127
PV G KW
Sbjct: 277 PVIKGEKW 284
Score = 43.9 bits (102), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 25/68 (36%), Positives = 31/68 (45%), Gaps = 13/68 (19%)
Query: 11 LTDVELGGATIFPSLN-------------LTVFPEKGSAVFWHNAHANTLLDYWMYHSGC 57
LTDV GG T+FP N L V P KG A+ +++ N +D H C
Sbjct: 217 LTDVPEGGETMFPYENGFNRDGNVEDCIGLRVRPRKGDALLFYSLLPNGTIDQTSAHGSC 276
Query: 58 PVALGNKW 65
PV G KW
Sbjct: 277 PVIKGEKW 284
>gi|147800995|emb|CAN64470.1| hypothetical protein VITISV_014644 [Vitis vinifera]
Length = 288
Score = 45.8 bits (107), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 31/107 (28%), Positives = 48/107 (44%), Gaps = 25/107 (23%)
Query: 40 HNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN----------- 88
+ H + LD + +G G + +L+ L+DVE GG T+FP+
Sbjct: 175 YEPHYDYFLDDYNTKNG-----GQRMATVLMY-LSDVEEGGETVFPAAKGNFSSVPWWNE 228
Query: 89 --------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+V P+ G A+ +++ + LD H GCPV GNKW
Sbjct: 229 LSXCGKEGLSVKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKW 275
Score = 42.7 bits (99), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 25/74 (33%), Positives = 34/74 (45%), Gaps = 19/74 (25%)
Query: 11 LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
L+DVE GG T+FP+ L+V P+ G A+ + + + LD
Sbjct: 202 LSDVEEGGETVFPAAKGNFSSVPWWNELSXCGKEGLSVKPKMGDALLFWSMKPDASLDPS 261
Query: 52 MYHSGCPVALGNKW 65
H GCPV GNKW
Sbjct: 262 SLHGGCPVIKGNKW 275
>gi|429859077|gb|ELA33870.1| 2og-fe oxygenase family protein [Colletotrichum gloeosporioides
Nara gc5]
Length = 294
Score = 45.8 bits (107), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 21/42 (50%), Positives = 27/42 (64%)
Query: 87 LNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 128
+ L V P+KG+A+FW N H N DYR+ H+G PV G K G
Sbjct: 235 MGLLVKPKKGNAIFWNNMHMNGSGDYRVVHAGLPVHSGQKIG 276
Score = 44.3 bits (103), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 20/42 (47%), Positives = 27/42 (64%)
Query: 25 LNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKWG 66
+ L V P+KG+A+FW+N H N DY + H+G PV G K G
Sbjct: 235 MGLLVKPKKGNAIFWNNMHMNGSGDYRVVHAGLPVHSGQKIG 276
>gi|225468574|ref|XP_002263060.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
gi|296084059|emb|CBI24447.3| unnamed protein product [Vitis vinifera]
Length = 288
Score = 45.8 bits (107), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 31/107 (28%), Positives = 48/107 (44%), Gaps = 25/107 (23%)
Query: 40 HNAHANTLLDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN----------- 88
+ H + LD + +G G + +L+ L+DVE GG T+FP+
Sbjct: 175 YEPHYDYFLDDYNTKNG-----GQRMATVLMY-LSDVEEGGETVFPAAKGNFSSVPWWNE 228
Query: 89 --------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
L+V P+ G A+ +++ + LD H GCPV GNKW
Sbjct: 229 LSDCGKEGLSVKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKW 275
Score = 42.7 bits (99), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 25/74 (33%), Positives = 34/74 (45%), Gaps = 19/74 (25%)
Query: 11 LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
L+DVE GG T+FP+ L+V P+ G A+ + + + LD
Sbjct: 202 LSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKEGLSVKPKMGDALLFWSMKPDASLDPS 261
Query: 52 MYHSGCPVALGNKW 65
H GCPV GNKW
Sbjct: 262 SLHGGCPVIKGNKW 275
>gi|91778899|ref|YP_554107.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
xenovorans LB400]
gi|91691559|gb|ABE34757.1| Procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
xenovorans LB400]
Length = 292
Score = 45.8 bits (107), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 22/55 (40%), Positives = 29/55 (52%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DV GG TIFP ++V +G AV++ + LD H G PV G+KW
Sbjct: 226 LNDVPDGGETIFPEAGMSVAASQGGAVYFRYMNDRRQLDPLTLHGGAPVLAGDKW 280
Score = 45.8 bits (107), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 26/73 (35%), Positives = 35/73 (47%)
Query: 55 SGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRM 114
S A G + L+ L DV GG TIFP ++V +G AV++ + LD
Sbjct: 208 SAVHTAQGGQRVATLVIYLNDVPDGGETIFPEAGMSVAASQGGAVYFRYMNDRRQLDPLT 267
Query: 115 YHSGCPVALGNKW 127
H G PV G+KW
Sbjct: 268 LHGGAPVLAGDKW 280
>gi|356550516|ref|XP_003543632.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 318
Score = 45.8 bits (107), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 29/98 (29%), Positives = 40/98 (40%), Gaps = 20/98 (20%)
Query: 48 LDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLT----------------- 90
DY +H +G +L L+DVE GG TIFP+
Sbjct: 158 FDY--FHDKANQVMGGHRIATVLMYLSDVEKGGETIFPNAKAKLLQPKDESWSECAHKGY 215
Query: 91 -VFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
V P KG A+ +++ H + D + H CPV G KW
Sbjct: 216 AVKPRKGDALLFFSLHLDASTDNKSLHGSCPVIEGEKW 253
Score = 41.2 bits (95), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 24/73 (32%), Positives = 30/73 (41%), Gaps = 18/73 (24%)
Query: 11 LTDVELGGATIFPSLNLT------------------VFPEKGSAVFWHNAHANTLLDYWM 52
L+DVE GG TIFP+ V P KG A+ + + H + D
Sbjct: 181 LSDVEKGGETIFPNAKAKLLQPKDESWSECAHKGYAVKPRKGDALLFFSLHLDASTDNKS 240
Query: 53 YHSGCPVALGNKW 65
H CPV G KW
Sbjct: 241 LHGSCPVIEGEKW 253
>gi|187920106|ref|YP_001889137.1| procollagen-proline dioxygenase [Burkholderia phytofirmans PsJN]
gi|187718544|gb|ACD19767.1| Procollagen-proline dioxygenase [Burkholderia phytofirmans PsJN]
Length = 295
Score = 45.8 bits (107), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 22/55 (40%), Positives = 29/55 (52%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DV GG TIFP ++V +G AV++ + LD H G PV G+KW
Sbjct: 229 LNDVPDGGETIFPEAGISVAARQGGAVYFRYMNGQRQLDPLTLHGGAPVLGGDKW 283
Score = 45.4 bits (106), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 26/73 (35%), Positives = 35/73 (47%)
Query: 55 SGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRM 114
S A G + L+ L DV GG TIFP ++V +G AV++ + LD
Sbjct: 211 SAVHTAQGGQRVATLVIYLNDVPDGGETIFPEAGISVAARQGGAVYFRYMNGQRQLDPLT 270
Query: 115 YHSGCPVALGNKW 127
H G PV G+KW
Sbjct: 271 LHGGAPVLGGDKW 283
>gi|194871369|ref|XP_001972835.1| GG15736 [Drosophila erecta]
gi|190654618|gb|EDV51861.1| GG15736 [Drosophila erecta]
Length = 476
Score = 45.8 bits (107), Expect = 0.006, Method: Composition-based stats.
Identities = 33/108 (30%), Positives = 50/108 (46%), Gaps = 20/108 (18%)
Query: 25 LNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALG-----NKWGKLLLSGLTDVELG 79
L++T FP KG+ + +++Y + + P NK L+ L + E G
Sbjct: 377 LDITGFPSKGNQML-------EVINYGIAGNYNPDDTARPRKQNKANALIF--LDNAERG 427
Query: 80 GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
G +FPS +L V P KGS + W N + + YH CP+ GN W
Sbjct: 428 GEIVFPSRHLKVRPRKGSMLVWMNLERSVI-----YHQ-CPILKGNMW 469
Score = 44.7 bits (104), Expect = 0.011, Method: Composition-based stats.
Identities = 22/55 (40%), Positives = 29/55 (52%), Gaps = 6/55 (10%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L + E GG +FPS +L V P KGS + W N + + YH CP+ GN W
Sbjct: 421 LDNAERGGEIVFPSRHLKVRPRKGSMLVWMNLERSVI-----YHQ-CPILKGNMW 469
>gi|337280547|ref|YP_004620019.1| hypothetical protein Rta_28970 [Ramlibacter tataouinensis TTB310]
gi|334731624|gb|AEG94000.1| conserved hypothetical protein [Ramlibacter tataouinensis TTB310]
Length = 286
Score = 45.8 bits (107), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 29/77 (37%), Positives = 39/77 (50%), Gaps = 10/77 (12%)
Query: 56 GCPVAL---GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVF--WYNAHANTLL 110
G P L G + G L++ L + E GG T FP ++L V P++G VF + H +T
Sbjct: 203 GTPTILKRGGQRVGTLVMY-LGEPEKGGGTTFPDVHLEVAPKRGHGVFFSYERPHPST-- 259
Query: 111 DYRMYHSGCPVALGNKW 127
R H G PV G KW
Sbjct: 260 --RTLHGGAPVLAGEKW 274
Score = 41.2 bits (95), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 6/57 (10%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVF--WHNAHANTLLDYWMYHSGCPVALGNKW 65
L + E GG T FP ++L V P++G VF + H +T H G PV G KW
Sbjct: 222 LGEPEKGGGTTFPDVHLEVAPKRGHGVFFSYERPHPSTR----TLHGGAPVLAGEKW 274
>gi|225428938|ref|XP_002262952.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
gi|296083079|emb|CBI22483.3| unnamed protein product [Vitis vinifera]
Length = 284
Score = 45.8 bits (107), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 23/70 (32%), Positives = 32/70 (45%), Gaps = 15/70 (21%)
Query: 73 LTDVELGGATIFP---------------SLNLTVFPEKGSAVFWYNAHANTLLDYRMYHS 117
L+DVE GG T+FP + L V P++G + +Y+ N +D H
Sbjct: 202 LSDVEEGGETMFPFEHDLNIGTGYDYKKCIGLKVKPQRGDGLLFYSVFPNGTIDRTSLHG 261
Query: 118 GCPVALGNKW 127
CPV G KW
Sbjct: 262 SCPVIAGEKW 271
Score = 43.9 bits (102), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 22/70 (31%), Positives = 32/70 (45%), Gaps = 15/70 (21%)
Query: 11 LTDVELGGATIFP---------------SLNLTVFPEKGSAVFWHNAHANTLLDYWMYHS 55
L+DVE GG T+FP + L V P++G + +++ N +D H
Sbjct: 202 LSDVEEGGETMFPFEHDLNIGTGYDYKKCIGLKVKPQRGDGLLFYSVFPNGTIDRTSLHG 261
Query: 56 GCPVALGNKW 65
CPV G KW
Sbjct: 262 SCPVIAGEKW 271
>gi|156403977|ref|XP_001640184.1| predicted protein [Nematostella vectensis]
gi|156227317|gb|EDO48121.1| predicted protein [Nematostella vectensis]
Length = 414
Score = 45.8 bits (107), Expect = 0.006, Method: Composition-based stats.
Identities = 29/88 (32%), Positives = 34/88 (38%), Gaps = 33/88 (37%)
Query: 73 LTDVELGGATIFP---------------------------SLNLTVFPEKGSAVFWYNAH 105
L DVE GG T FP NL + P+K SA+ WYN +
Sbjct: 299 LNDVEEGGETAFPVADNETFSIEAWAGITKYRCDLSRHCHKANLYIKPKKRSAIMWYNHY 358
Query: 106 ANTL------LDYRMYHSGCPVALGNKW 127
+ L LD YH GC V G KW
Sbjct: 359 TDELTGWFGGLDPMTYHGGCDVVRGRKW 386
Score = 43.5 bits (101), Expect = 0.025, Method: Composition-based stats.
Identities = 28/88 (31%), Positives = 34/88 (38%), Gaps = 33/88 (37%)
Query: 11 LTDVELGGATIFP---------------------------SLNLTVFPEKGSAVFWHNAH 43
L DVE GG T FP NL + P+K SA+ W+N +
Sbjct: 299 LNDVEEGGETAFPVADNETFSIEAWAGITKYRCDLSRHCHKANLYIKPKKRSAIMWYNHY 358
Query: 44 ANTL------LDYWMYHSGCPVALGNKW 65
+ L LD YH GC V G KW
Sbjct: 359 TDELTGWFGGLDPMTYHGGCDVVRGRKW 386
>gi|91779740|ref|YP_554948.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
xenovorans LB400]
gi|91692400|gb|ABE35598.1| Procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
xenovorans LB400]
Length = 296
Score = 45.8 bits (107), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 24/66 (36%), Positives = 35/66 (53%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
G + G LL+ L DVE GG T+FP +V P +G A+++ + L D H+ P+
Sbjct: 215 GQRVGTLLMY-LNDVEGGGETMFPQTGWSVVPRRGQALYFEYGNRFGLADPSSLHTSTPL 273
Query: 122 ALGNKW 127
G KW
Sbjct: 274 RAGEKW 279
Score = 43.5 bits (101), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 20/55 (36%), Positives = 29/55 (52%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVALGNKW 65
L DVE GG T+FP +V P +G A+++ + L D H+ P+ G KW
Sbjct: 225 LNDVEGGGETMFPQTGWSVVPRRGQALYFEYGNRFGLADPSSLHTSTPLRAGEKW 279
>gi|147823227|emb|CAN70872.1| hypothetical protein VITISV_009065 [Vitis vinifera]
Length = 276
Score = 45.8 bits (107), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 23/70 (32%), Positives = 32/70 (45%), Gaps = 15/70 (21%)
Query: 73 LTDVELGGATIFP---------------SLNLTVFPEKGSAVFWYNAHANTLLDYRMYHS 117
L+DVE GG T+FP + L V P++G + +Y+ N +D H
Sbjct: 194 LSDVEEGGETMFPFEHDLNIGTGYDYKKCIGLKVKPQRGDGLLFYSVFPNGTIDRTSLHG 253
Query: 118 GCPVALGNKW 127
CPV G KW
Sbjct: 254 SCPVIAGEKW 263
Score = 43.5 bits (101), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 22/70 (31%), Positives = 32/70 (45%), Gaps = 15/70 (21%)
Query: 11 LTDVELGGATIFP---------------SLNLTVFPEKGSAVFWHNAHANTLLDYWMYHS 55
L+DVE GG T+FP + L V P++G + +++ N +D H
Sbjct: 194 LSDVEEGGETMFPFEHDLNIGTGYDYKKCIGLKVKPQRGDGLLFYSVFPNGTIDRTSLHG 253
Query: 56 GCPVALGNKW 65
CPV G KW
Sbjct: 254 SCPVIAGEKW 263
>gi|397613716|gb|EJK62387.1| hypothetical protein THAOC_17003 [Thalassiosira oceanica]
Length = 578
Score = 45.8 bits (107), Expect = 0.006, Method: Composition-based stats.
Identities = 22/52 (42%), Positives = 30/52 (57%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 120
+L+ L DV GGAT FP LN+ V P KG A+ ++ A + +D H CP
Sbjct: 447 VLTYLNDVSKGGATAFPRLNVEVRPRKGMALVFFPATVDGHVDGNALHEACP 498
Score = 41.6 bits (96), Expect = 0.12, Method: Composition-based stats.
Identities = 21/48 (43%), Positives = 26/48 (54%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCP 58
L DV GGAT FP LN+ V P KG A+ + A + +D H CP
Sbjct: 451 LNDVSKGGATAFPRLNVEVRPRKGMALVFFPATVDGHVDGNALHEACP 498
>gi|224006045|ref|XP_002291983.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220972502|gb|EED90834.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 168
Score = 45.8 bits (107), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 24/61 (39%), Positives = 35/61 (57%)
Query: 60 ALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGC 119
A G + +L+ L DV GGAT FP+LN+ V P +G AV ++ + + LLD H+
Sbjct: 106 ANGGQRTITVLTYLNDVPRGGATSFPNLNIQVQPRRGMAVVFFPSTLDGLLDKMALHAAM 165
Query: 120 P 120
P
Sbjct: 166 P 166
Score = 43.1 bits (100), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 21/48 (43%), Positives = 28/48 (58%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCP 58
L DV GGAT FP+LN+ V P +G AV + + + LLD H+ P
Sbjct: 119 LNDVPRGGATSFPNLNIQVQPRRGMAVVFFPSTLDGLLDKMALHAAMP 166
>gi|115456019|ref|NP_001051610.1| Os03g0803500 [Oryza sativa Japonica Group]
gi|29150365|gb|AAO72374.1| putative oxidoreductase [Oryza sativa Japonica Group]
gi|108711618|gb|ABF99413.1| oxidoreductase, 2OG-Fe oxygenase family protein, putative,
expressed [Oryza sativa Japonica Group]
gi|113550081|dbj|BAF13524.1| Os03g0803500 [Oryza sativa Japonica Group]
gi|215765410|dbj|BAG87107.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222625993|gb|EEE60125.1| hypothetical protein OsJ_13003 [Oryza sativa Japonica Group]
Length = 299
Score = 45.8 bits (107), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 25/73 (34%), Positives = 33/73 (45%), Gaps = 18/73 (24%)
Query: 73 LTDVELGGATIFPSL------------------NLTVFPEKGSAVFWYNAHANTLLDYRM 114
LTDV+ GG T+FP+ L V P+KG A+ +++ H N D
Sbjct: 162 LTDVKKGGETVFPNAAGRHLQLKDETWSDCARSGLAVKPKKGDALLFFSLHVNATTDPAS 221
Query: 115 YHSGCPVALGNKW 127
H CPV G KW
Sbjct: 222 LHGSCPVIEGEKW 234
Score = 43.5 bits (101), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 25/73 (34%), Positives = 32/73 (43%), Gaps = 18/73 (24%)
Query: 11 LTDVELGGATIFPSL------------------NLTVFPEKGSAVFWHNAHANTLLDYWM 52
LTDV+ GG T+FP+ L V P+KG A+ + + H N D
Sbjct: 162 LTDVKKGGETVFPNAAGRHLQLKDETWSDCARSGLAVKPKKGDALLFFSLHVNATTDPAS 221
Query: 53 YHSGCPVALGNKW 65
H CPV G KW
Sbjct: 222 LHGSCPVIEGEKW 234
>gi|218199253|gb|EEC81680.1| hypothetical protein OsI_25242 [Oryza sativa Indica Group]
Length = 487
Score = 45.8 bits (107), Expect = 0.006, Method: Composition-based stats.
Identities = 31/98 (31%), Positives = 41/98 (41%), Gaps = 20/98 (20%)
Query: 48 LDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN------------------L 89
DY +H ALG +L L+ VE GG T+FP+ L
Sbjct: 159 FDY--FHDKVNQALGGHRYATVLMYLSTVEKGGETVFPNAEGWENQPKDDTFSECAQKGL 216
Query: 90 TVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
V P KG AV +++ H + + D H CPV G KW
Sbjct: 217 AVKPVKGDAVLFFSLHIDGVPDPLSLHGSCPVIEGEKW 254
Score = 40.4 bits (93), Expect = 0.26, Method: Composition-based stats.
Identities = 24/73 (32%), Positives = 31/73 (42%), Gaps = 18/73 (24%)
Query: 11 LTDVELGGATIFPSLN------------------LTVFPEKGSAVFWHNAHANTLLDYWM 52
L+ VE GG T+FP+ L V P KG AV + + H + + D
Sbjct: 182 LSTVEKGGETVFPNAEGWENQPKDDTFSECAQKGLAVKPVKGDAVLFFSLHIDGVPDPLS 241
Query: 53 YHSGCPVALGNKW 65
H CPV G KW
Sbjct: 242 LHGSCPVIEGEKW 254
>gi|357146834|ref|XP_003574128.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Brachypodium
distachyon]
Length = 306
Score = 45.8 bits (107), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 43/99 (43%), Gaps = 21/99 (21%)
Query: 48 LDYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN------------------- 88
DY +H G + LL L+DVE GG T+FPS
Sbjct: 197 FDY--FHDDFNTKNGGQRIATLLMYLSDVEEGGETVFPSAKVNSSSIPFYNELSECAKRG 254
Query: 89 LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
++V P+ G A+ +++ + LD H GCPV G+KW
Sbjct: 255 ISVKPKMGDALLFWSMRPDGTLDPTSLHGGCPVIKGDKW 293
Score = 42.4 bits (98), Expect = 0.056, Method: Compositional matrix adjust.
Identities = 24/74 (32%), Positives = 34/74 (45%), Gaps = 19/74 (25%)
Query: 11 LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
L+DVE GG T+FPS ++V P+ G A+ + + + LD
Sbjct: 220 LSDVEEGGETVFPSAKVNSSSIPFYNELSECAKRGISVKPKMGDALLFWSMRPDGTLDPT 279
Query: 52 MYHSGCPVALGNKW 65
H GCPV G+KW
Sbjct: 280 SLHGGCPVIKGDKW 293
>gi|357467087|ref|XP_003603828.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355492876|gb|AES74079.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 156
Score = 45.8 bits (107), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 24/74 (32%), Positives = 36/74 (48%), Gaps = 19/74 (25%)
Query: 73 LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWYNAHANTLLDYR 113
L+DVE GG T+FP+ L++ P+ G A+ +++ + LD
Sbjct: 70 LSDVEEGGETVFPAAKANFSSVPWWNDLSECARKGLSLKPKMGDALLFWSMRPDATLDAS 129
Query: 114 MYHSGCPVALGNKW 127
H GCPV +GNKW
Sbjct: 130 SLHGGCPVIVGNKW 143
Score = 43.1 bits (100), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 24/74 (32%), Positives = 35/74 (47%), Gaps = 19/74 (25%)
Query: 11 LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
L+DVE GG T+FP+ L++ P+ G A+ + + + LD
Sbjct: 70 LSDVEEGGETVFPAAKANFSSVPWWNDLSECARKGLSLKPKMGDALLFWSMRPDATLDAS 129
Query: 52 MYHSGCPVALGNKW 65
H GCPV +GNKW
Sbjct: 130 SLHGGCPVIVGNKW 143
>gi|333981907|ref|YP_004511117.1| procollagen-proline dioxygenase [Methylomonas methanica MC09]
gi|333805948|gb|AEF98617.1| Procollagen-proline dioxygenase [Methylomonas methanica MC09]
Length = 286
Score = 45.8 bits (107), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 42/78 (53%), Gaps = 11/78 (14%)
Query: 52 MYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTL-- 109
M SG VA L+ L +V GG T+FP + +++ P++GSA ++ A+ N+L
Sbjct: 205 MTQSGQRVAT-------LVMYLNEVTEGGETVFPDVGISITPKRGSAAYF--AYCNSLGQ 255
Query: 110 LDYRMYHSGCPVALGNKW 127
+D H G PV G KW
Sbjct: 256 VDPATLHGGAPVLTGEKW 273
Score = 45.4 bits (106), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 35/57 (61%), Gaps = 4/57 (7%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTL--LDYWMYHSGCPVALGNKW 65
L +V GG T+FP + +++ P++GSA ++ A+ N+L +D H G PV G KW
Sbjct: 219 LNEVTEGGETVFPDVGISITPKRGSAAYF--AYCNSLGQVDPATLHGGAPVLTGEKW 273
>gi|218665910|ref|YP_002425647.1| 2OG-Fe(II) oxygenase [Acidithiobacillus ferrooxidans ATCC 23270]
gi|218518123|gb|ACK78709.1| oxidoreductase, 2OG-Fe(II) oxygenase family [Acidithiobacillus
ferrooxidans ATCC 23270]
Length = 248
Score = 45.4 bits (106), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 32/66 (48%), Gaps = 1/66 (1%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 121
GN+ L+L L VE GG T FP L L V P G VF+ N + H+G PV
Sbjct: 172 GNRQATLILY-LNAVEEGGETAFPELGLQVSPIPGGGVFFRNLNEEGQRHPLSLHAGLPV 230
Query: 122 ALGNKW 127
G KW
Sbjct: 231 RKGEKW 236
Score = 44.3 bits (103), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 25/64 (39%), Positives = 29/64 (45%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVAL 61
NR + L VE GG T FP L L V P G VF+ N + H+G PV
Sbjct: 173 NRQATLILYLNAVEEGGETAFPELGLQVSPIPGGGVFFRNLNEEGQRHPLSLHAGLPVRK 232
Query: 62 GNKW 65
G KW
Sbjct: 233 GEKW 236
>gi|405955421|gb|EKC22545.1| Prolyl 4-hydroxylase subunit alpha-2 [Crassostrea gigas]
Length = 173
Score = 45.4 bits (106), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 38/127 (29%), Positives = 55/127 (43%), Gaps = 20/127 (15%)
Query: 3 RAEMSS--VELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHSGCPVA 60
R M++ V L DVE GG T FP L + V P KG A+ W+N + + + H+ V
Sbjct: 44 RDRMATILVYLQDVEDGGETKFPELGIWVKPRKGRALVWNNMNEDGKCEALSVHNAAKVN 103
Query: 61 LGNKW--------------GKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHA 106
G+K+ GK V G P +N + EKGS ++ +
Sbjct: 104 KGHKYILQRWYYYKNFYSLGKRPPEPELPVRASGT---PRVNCDEY-EKGSCRWYDEWNY 159
Query: 107 NTLLDYR 113
L+DYR
Sbjct: 160 EHLVDYR 166
Score = 41.6 bits (96), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 31/59 (52%)
Query: 69 LLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
+L L DVE GG T FP L + V P KG A+ W N + + + H+ V G+K+
Sbjct: 50 ILVYLQDVEDGGETKFPELGIWVKPRKGRALVWNNMNEDGKCEALSVHNAAKVNKGHKY 108
>gi|195627276|gb|ACG35468.1| prolyl 4-hydroxylase [Zea mays]
Length = 298
Score = 45.4 bits (106), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 34/117 (29%), Positives = 48/117 (41%), Gaps = 20/117 (17%)
Query: 31 PEKGSAV-FWHNAHANTLLDYWMY-HSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLN 88
PE G A+ H + ++ Y H ALG +L L++VE GG TIFP+
Sbjct: 116 PENGEAIQILHYQNGEKYEPHYDYFHDKNNQALGGHRIATVLMYLSNVEKGGETIFPNAE 175
Query: 89 ------------------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
V P KG A+ +++ H ++ D H CPV G KW
Sbjct: 176 GKLLQPKDDTWSDCARNGYAVKPVKGDALLFFSLHPDSTTDSDSLHGSCPVIEGQKW 232
Score = 38.9 bits (89), Expect = 0.72, Method: Compositional matrix adjust.
Identities = 23/73 (31%), Positives = 31/73 (42%), Gaps = 18/73 (24%)
Query: 11 LTDVELGGATIFPSLN------------------LTVFPEKGSAVFWHNAHANTLLDYWM 52
L++VE GG TIFP+ V P KG A+ + + H ++ D
Sbjct: 160 LSNVEKGGETIFPNAEGKLLQPKDDTWSDCARNGYAVKPVKGDALLFFSLHPDSTTDSDS 219
Query: 53 YHSGCPVALGNKW 65
H CPV G KW
Sbjct: 220 LHGSCPVIEGQKW 232
>gi|224013910|ref|XP_002296619.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220968971|gb|EED87315.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 541
Score = 45.4 bits (106), Expect = 0.007, Method: Composition-based stats.
Identities = 32/82 (39%), Positives = 41/82 (50%), Gaps = 5/82 (6%)
Query: 49 DYWMYHSGCPVALGNKWGKLLLSGLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT 108
DY Y P G + L L L DVE GG T FP L++T+ P+KG+AV W + +
Sbjct: 444 DYIEYQEDMPC--GVRMLTLFLY-LNDVEEGGGTHFPLLDITIQPKKGNAVIWPSVLDDK 500
Query: 109 --LLDYRMYHSGCPVALGNKWG 128
D R H PV G K+G
Sbjct: 501 PETKDPRTDHEALPVINGIKYG 522
Score = 42.0 bits (97), Expect = 0.082, Method: Composition-based stats.
Identities = 24/58 (41%), Positives = 32/58 (55%), Gaps = 2/58 (3%)
Query: 11 LTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANT--LLDYWMYHSGCPVALGNKWG 66
L DVE GG T FP L++T+ P+KG+AV W + + D H PV G K+G
Sbjct: 465 LNDVEEGGGTHFPLLDITIQPKKGNAVIWPSVLDDKPETKDPRTDHEALPVINGIKYG 522
>gi|226495689|ref|NP_001149322.1| LOC100282945 precursor [Zea mays]
gi|194697650|gb|ACF82909.1| unknown [Zea mays]
gi|194708468|gb|ACF88318.1| unknown [Zea mays]
gi|195626376|gb|ACG35018.1| oxidoreductase [Zea mays]
gi|347978842|gb|AEP37763.1| prolyl 4-hydroxylase 9 [Zea mays]
gi|413945802|gb|AFW78451.1| oxidoreductase [Zea mays]
Length = 308
Score = 45.4 bits (106), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 27/83 (32%), Positives = 38/83 (45%), Gaps = 18/83 (21%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFP-----------------SLNLTVFPEKGSAVFWYNA 104
G+++ +LL LTDV GG T+FP + V P KG A+ ++N
Sbjct: 166 GHRYATVLLY-LTDVPEGGETVFPLAEEPDDAKDATLSECAQKGIAVRPRKGDALLFFNL 224
Query: 105 HANTLLDYRMYHSGCPVALGNKW 127
+ + D H GCPV G KW
Sbjct: 225 NPDGTTDSVSLHGGCPVIKGEKW 247
Score = 41.6 bits (96), Expect = 0.099, Method: Compositional matrix adjust.
Identities = 24/72 (33%), Positives = 30/72 (41%), Gaps = 17/72 (23%)
Query: 11 LTDVELGGATIFP-----------------SLNLTVFPEKGSAVFWHNAHANTLLDYWMY 53
LTDV GG T+FP + V P KG A+ + N + + D
Sbjct: 176 LTDVPEGGETVFPLAEEPDDAKDATLSECAQKGIAVRPRKGDALLFFNLNPDGTTDSVSL 235
Query: 54 HSGCPVALGNKW 65
H GCPV G KW
Sbjct: 236 HGGCPVIKGEKW 247
>gi|449448264|ref|XP_004141886.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 294
Score = 45.4 bits (106), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 25/70 (35%), Positives = 31/70 (44%), Gaps = 15/70 (21%)
Query: 73 LTDVELGGATIFPSLN---------------LTVFPEKGSAVFWYNAHANTLLDYRMYHS 117
LTDVE GG T+FP N L V P +G + +Y+ N +D H
Sbjct: 209 LTDVEEGGETMFPFENGLNMDGTYNFQTCIGLKVKPRQGDGLLFYSVFPNGTIDPTSLHG 268
Query: 118 GCPVALGNKW 127
CPV G KW
Sbjct: 269 SCPVIKGQKW 278
Score = 43.5 bits (101), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 24/70 (34%), Positives = 31/70 (44%), Gaps = 15/70 (21%)
Query: 11 LTDVELGGATIFPSLN---------------LTVFPEKGSAVFWHNAHANTLLDYWMYHS 55
LTDVE GG T+FP N L V P +G + +++ N +D H
Sbjct: 209 LTDVEEGGETMFPFENGLNMDGTYNFQTCIGLKVKPRQGDGLLFYSVFPNGTIDPTSLHG 268
Query: 56 GCPVALGNKW 65
CPV G KW
Sbjct: 269 SCPVIKGQKW 278
>gi|449511009|ref|XP_004163837.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
alpha-1-like [Cucumis sativus]
Length = 294
Score = 45.4 bits (106), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 25/70 (35%), Positives = 31/70 (44%), Gaps = 15/70 (21%)
Query: 73 LTDVELGGATIFPSLN---------------LTVFPEKGSAVFWYNAHANTLLDYRMYHS 117
LTDVE GG T+FP N L V P +G + +Y+ N +D H
Sbjct: 209 LTDVEEGGETMFPFENGLNMDGTYNFQTCIGLKVKPRQGDGLLFYSVFPNGTIDPTSLHG 268
Query: 118 GCPVALGNKW 127
CPV G KW
Sbjct: 269 SCPVIKGQKW 278
Score = 43.5 bits (101), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 24/70 (34%), Positives = 31/70 (44%), Gaps = 15/70 (21%)
Query: 11 LTDVELGGATIFPSLN---------------LTVFPEKGSAVFWHNAHANTLLDYWMYHS 55
LTDVE GG T+FP N L V P +G + +++ N +D H
Sbjct: 209 LTDVEEGGETMFPFENGLNMDGTYNFQTCIGLKVKPRQGDGLLFYSVFPNGTIDPTSLHG 268
Query: 56 GCPVALGNKW 65
CPV G KW
Sbjct: 269 SCPVIKGQKW 278
>gi|242038031|ref|XP_002466410.1| hypothetical protein SORBIDRAFT_01g007280 [Sorghum bicolor]
gi|241920264|gb|EER93408.1| hypothetical protein SORBIDRAFT_01g007280 [Sorghum bicolor]
Length = 294
Score = 45.4 bits (106), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 30/93 (32%), Positives = 40/93 (43%), Gaps = 18/93 (19%)
Query: 53 YHSGCPVALGNKWGKLLLS---GLTDVELGGATIFPSLN---------------LTVFPE 94
Y + PV G + + + S LT+VE GG T+FP N L V P
Sbjct: 191 YDAFDPVQYGPQKSQRVASFLLYLTNVEEGGETMFPYENGENMDIGYDYEKCIGLKVKPR 250
Query: 95 KGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 127
KG + +Y+ N +D H CPV G KW
Sbjct: 251 KGDGLLFYSLMVNGTIDRTSLHGSCPVIKGEKW 283
Score = 42.7 bits (99), Expect = 0.050, Method: Compositional matrix adjust.
Identities = 24/70 (34%), Positives = 31/70 (44%), Gaps = 15/70 (21%)
Query: 11 LTDVELGGATIFPSLN---------------LTVFPEKGSAVFWHNAHANTLLDYWMYHS 55
LT+VE GG T+FP N L V P KG + +++ N +D H
Sbjct: 214 LTNVEEGGETMFPYENGENMDIGYDYEKCIGLKVKPRKGDGLLFYSLMVNGTIDRTSLHG 273
Query: 56 GCPVALGNKW 65
CPV G KW
Sbjct: 274 SCPVIKGEKW 283
>gi|255579590|ref|XP_002530636.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223529809|gb|EEF31744.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 287
Score = 45.4 bits (106), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 27/78 (34%), Positives = 36/78 (46%), Gaps = 19/78 (24%)
Query: 69 LLSGLTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWYNAHANTL 109
LL L+DVE GG T+FP+ L+V P G A+ +++ +
Sbjct: 197 LLMYLSDVEEGGETVFPAAKGNFSAVPWWNELSECGKKGLSVKPNMGDALLFWSMKPDAT 256
Query: 110 LDYRMYHSGCPVALGNKW 127
LD H GCPV GNKW
Sbjct: 257 LDPSSLHGGCPVINGNKW 274
Score = 42.0 bits (97), Expect = 0.089, Method: Compositional matrix adjust.
Identities = 25/74 (33%), Positives = 33/74 (44%), Gaps = 19/74 (25%)
Query: 11 LTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWHNAHANTLLDYW 51
L+DVE GG T+FP+ L+V P G A+ + + + LD
Sbjct: 201 LSDVEEGGETVFPAAKGNFSAVPWWNELSECGKKGLSVKPNMGDALLFWSMKPDATLDPS 260
Query: 52 MYHSGCPVALGNKW 65
H GCPV GNKW
Sbjct: 261 SLHGGCPVINGNKW 274
>gi|224034451|gb|ACN36301.1| unknown [Zea mays]
gi|413945801|gb|AFW78450.1| hypothetical protein ZEAMMB73_588774 [Zea mays]
Length = 295
Score = 45.4 bits (106), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 27/83 (32%), Positives = 38/83 (45%), Gaps = 18/83 (21%)
Query: 62 GNKWGKLLLSGLTDVELGGATIFP-----------------SLNLTVFPEKGSAVFWYNA 104
G+++ +LL LTDV GG T+FP + V P KG A+ ++N
Sbjct: 153 GHRYATVLLY-LTDVPEGGETVFPLAEEPDDAKDATLSECAQKGIAVRPRKGDALLFFNL 211
Query: 105 HANTLLDYRMYHSGCPVALGNKW 127
+ + D H GCPV G KW
Sbjct: 212 NPDGTTDSVSLHGGCPVIKGEKW 234
Score = 41.6 bits (96), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 24/72 (33%), Positives = 30/72 (41%), Gaps = 17/72 (23%)
Query: 11 LTDVELGGATIFP-----------------SLNLTVFPEKGSAVFWHNAHANTLLDYWMY 53
LTDV GG T+FP + V P KG A+ + N + + D
Sbjct: 163 LTDVPEGGETVFPLAEEPDDAKDATLSECAQKGIAVRPRKGDALLFFNLNPDGTTDSVSL 222
Query: 54 HSGCPVALGNKW 65
H GCPV G KW
Sbjct: 223 HGGCPVIKGEKW 234
>gi|66359518|ref|XP_626937.1| prolyl 4-hydroxylase alpha subunit [Cryptosporidium parvum Iowa II]
gi|46228084|gb|EAK88983.1| prolyl 4-hydroxylase alpha subunit [Cryptosporidium parvum Iowa II]
Length = 717
Score = 45.1 bits (105), Expect = 0.008, Method: Composition-based stats.
Identities = 23/54 (42%), Positives = 29/54 (53%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHS 55
NR SV L+DVE GG FP + + P+KG AV W N + LDY H+
Sbjct: 578 NRTHTISVFLSDVENGGELDFPYAGIKIKPKKGFAVVWPNIDSQGKLDYTTVHA 631
Score = 40.4 bits (93), Expect = 0.27, Method: Composition-based stats.
Identities = 19/45 (42%), Positives = 25/45 (55%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHS 117
L+DVE GG FP + + P+KG AV W N + LDY H+
Sbjct: 587 LSDVENGGELDFPYAGIKIKPKKGFAVVWPNIDSQGKLDYTTVHA 631
>gi|302802700|ref|XP_002983104.1| hypothetical protein SELMODRAFT_234144 [Selaginella moellendorffii]
gi|300149257|gb|EFJ15913.1| hypothetical protein SELMODRAFT_234144 [Selaginella moellendorffii]
Length = 292
Score = 45.1 bits (105), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 25/70 (35%), Positives = 32/70 (45%), Gaps = 15/70 (21%)
Query: 73 LTDVELGGATIFPSLN---------------LTVFPEKGSAVFWYNAHANTLLDYRMYHS 117
L+DVE GG T+FP N L V P++G A+ +Y+ N D H
Sbjct: 210 LSDVEEGGETMFPFENYEHMNENYDYKECIGLKVKPKQGDALLFYSMFPNGTFDKTALHG 269
Query: 118 GCPVALGNKW 127
CPV G KW
Sbjct: 270 SCPVIKGEKW 279
Score = 43.1 bits (100), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 24/70 (34%), Positives = 32/70 (45%), Gaps = 15/70 (21%)
Query: 11 LTDVELGGATIFPSLN---------------LTVFPEKGSAVFWHNAHANTLLDYWMYHS 55
L+DVE GG T+FP N L V P++G A+ +++ N D H
Sbjct: 210 LSDVEEGGETMFPFENYEHMNENYDYKECIGLKVKPKQGDALLFYSMFPNGTFDKTALHG 269
Query: 56 GCPVALGNKW 65
CPV G KW
Sbjct: 270 SCPVIKGEKW 279
>gi|195452740|ref|XP_002073479.1| GK13122 [Drosophila willistoni]
gi|194169564|gb|EDW84465.1| GK13122 [Drosophila willistoni]
Length = 494
Score = 45.1 bits (105), Expect = 0.009, Method: Composition-based stats.
Identities = 21/59 (35%), Positives = 30/59 (50%), Gaps = 5/59 (8%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTL-----LDYRMYHSGCPVALGNK 126
L ++GGA +FP L L + GS ++W + N DYR H+ CPV LG +
Sbjct: 433 LNPAQMGGAMVFPQLQLGINVPHGSLLYWRTCNGNEFSMDSESDYRSRHAVCPVVLGEQ 491
Score = 43.1 bits (100), Expect = 0.033, Method: Composition-based stats.
Identities = 21/63 (33%), Positives = 31/63 (49%), Gaps = 5/63 (7%)
Query: 7 SSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTL-----LDYWMYHSGCPVAL 61
+ V L ++GGA +FP L L + GS ++W + N DY H+ CPV L
Sbjct: 429 ARVMLNPAQMGGAMVFPQLQLGINVPHGSLLYWRTCNGNEFSMDSESDYRSRHAVCPVVL 488
Query: 62 GNK 64
G +
Sbjct: 489 GEQ 491
>gi|67598816|ref|XP_666240.1| hypothetical protein [Cryptosporidium hominis TU502]
gi|54657195|gb|EAL36009.1| hypothetical protein Chro.30441 [Cryptosporidium hominis]
Length = 718
Score = 45.1 bits (105), Expect = 0.009, Method: Composition-based stats.
Identities = 23/54 (42%), Positives = 29/54 (53%)
Query: 2 NRAEMSSVELTDVELGGATIFPSLNLTVFPEKGSAVFWHNAHANTLLDYWMYHS 55
NR SV L+DVE GG FP + + P+KG AV W N + LDY H+
Sbjct: 579 NRTHTISVFLSDVENGGELDFPYAGIKIKPKKGLAVVWPNIDSQGKLDYTTVHA 632
Score = 40.0 bits (92), Expect = 0.27, Method: Composition-based stats.
Identities = 19/45 (42%), Positives = 25/45 (55%)
Query: 73 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHS 117
L+DVE GG FP + + P+KG AV W N + LDY H+
Sbjct: 588 LSDVENGGELDFPYAGIKIKPKKGLAVVWPNIDSQGKLDYTTVHA 632
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.320 0.137 0.445
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,188,443,347
Number of Sequences: 23463169
Number of extensions: 86208845
Number of successful extensions: 166759
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1222
Number of HSP's successfully gapped in prelim test: 324
Number of HSP's that attempted gapping in prelim test: 163012
Number of HSP's gapped (non-prelim): 3144
length of query: 136
length of database: 8,064,228,071
effective HSP length: 101
effective length of query: 35
effective length of database: 9,989,415,298
effective search space: 349629535430
effective search space used: 349629535430
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 71 (32.0 bits)