BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy6259
(227 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|347972274|ref|XP_001237637.3| AGAP004611-PA [Anopheles gambiae str. PEST]
gi|333469330|gb|EAU76664.3| AGAP004611-PA [Anopheles gambiae str. PEST]
Length = 514
Score = 179 bits (453), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 95/227 (41%), Positives = 131/227 (57%), Gaps = 16/227 (7%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
YP C+G+ P + L+C YE FL+I PLK++E+ DP +V HD I + EI+
Sbjct: 272 YPSLCRGDDQRPAKELAKLRCRYEHNRTPFLRISPLKLQEVNHDPMIVMYHDVISNKEID 331
Query: 63 RIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVI 122
II +SK + R V + + TR S +L + HP + + R +DMTNL +
Sbjct: 332 AIISISKPLMHRSMVGDDHEKAVSKTRTSSNAWLDDVM---HPVVRTLSQRTEDMTNLAM 388
Query: 123 GREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW---------RLASFMFYLTDVELGG 173
ER LQ+ NYG+GGHY H D +EG R+A+ M+YL+DV +GG
Sbjct: 389 TAAER----LQVGNYGIGGHYLPHYDYAVAEEGKEVYPSIGKGNRIATVMYYLSDVAIGG 444
Query: 174 ATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
AT+FP L L VFP+KGSA+FWYN HAN +D+R H CPV +G+KW
Sbjct: 445 ATVFPQLGLGVFPQKGSAIFWYNLHANGTVDHRTLHGACPVFVGSKW 491
>gi|307190793|gb|EFN74662.1| Prolyl 4-hydroxylase subunit alpha-2 [Camponotus floridanus]
Length = 476
Score = 176 bits (445), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 89/230 (38%), Positives = 135/230 (58%), Gaps = 17/230 (7%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
E Y + C+G +S+P +++ NLKC Y FLKI PLK EE YLDPR+V H+ IYD E
Sbjct: 223 ERYEMLCRGEVSIPREVEKNLKCRYVDRGIPFLKIAPLKEEEAYLDPRIVVYHNVIYDEE 282
Query: 61 INRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I I +++ + +R V NY G + R+SK +L +H + + R++ MT
Sbjct: 283 IETIKRMAQPRFKRATVQNYKTGALEIANYRISKSAWLQEH---EHKHVAAVSKRVEHMT 339
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
++ I E LQ+ NYG+GGHY+ H D ++E R+A+ ++Y++DVE
Sbjct: 340 SMSIETAEE----LQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIATVLYYMSDVE 395
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GG T+F ++N++++P KGSA FWYN N D++ H+ CPV G+KW
Sbjct: 396 QGGGTVFTAINISLWPRKGSAAFWYNLKPNGEGDFKTRHAACPVLTGSKW 445
>gi|347964867|ref|XP_309164.4| AGAP000971-PA [Anopheles gambiae str. PEST]
gi|333466515|gb|EAA04901.5| AGAP000971-PA [Anopheles gambiae str. PEST]
Length = 553
Score = 174 bits (442), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 91/230 (39%), Positives = 133/230 (57%), Gaps = 17/230 (7%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
++Y C+G P +++S L C Y + ++ FL+IGPLK+EE YL P +V HD + D E
Sbjct: 303 KLYEQLCRGEQQPPIELRSQLVCRYTTNSSPFLRIGPLKLEEAYLRPYIVIYHDVMSDRE 362
Query: 61 INRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I RI ++ + R V NY G+ + + R+SK +L + + I R++DMT
Sbjct: 363 IERIKHYARPRFRRATVQNYKTGELEFANYRISKSAWLKD---AEDEMIRTISQRVEDMT 419
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GGHY+ H D R+E R+A+ +FY++DV
Sbjct: 420 GLTMETAEE----LQVVNYGIGGHYEPHFDFARREERNAFKSLGTGNRIATVLFYMSDVT 475
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FPSLNL ++P KG+A FW+N HA+ DY H+ CPV G KW
Sbjct: 476 QGGATVFPSLNLALWPRKGTAAFWFNLHASGRGDYATRHAACPVLTGTKW 525
>gi|345481336|ref|XP_001600680.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Nasonia
vitripennis]
Length = 556
Score = 173 bits (439), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 89/230 (38%), Positives = 132/230 (57%), Gaps = 17/230 (7%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
E Y + C+G + +P I+ L+C Y FLKI P K EE YLDPR+V HD IYD E
Sbjct: 303 ERYEMLCRGEIKMPLSIQKELRCRYVDRGIPFLKIAPFKEEEAYLDPRIVIYHDVIYDDE 362
Query: 61 INRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I I +++ + +R V NY G+ + R+SK +L +H + + R++ MT
Sbjct: 363 IETIKRMAQPRFKRATVQNYKTGELEIANYRISKSAWLQEH---EHKHVRAVSQRVEHMT 419
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
++ I E LQ+ NYG+GGHY+ H D R+E R+A+ ++Y++DVE
Sbjct: 420 SMSIETAE----ELQVVNYGIGGHYEPHFDFARREEKNAFKSLGTGNRIATVLYYMSDVE 475
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GG T+F +N++++P+KGSA FWYN N DY+ H+ CPV G+KW
Sbjct: 476 QGGGTVFTKINISLWPKKGSAAFWYNLKPNGEGDYKTRHAACPVLTGSKW 525
>gi|91091610|ref|XP_969386.1| PREDICTED: similar to prolyl 4-hydroxylase alpha subunit 1,
putative [Tribolium castaneum]
gi|270001037|gb|EEZ97484.1| hypothetical protein TcasGA2_TC011321 [Tribolium castaneum]
Length = 536
Score = 173 bits (438), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 92/230 (40%), Positives = 130/230 (56%), Gaps = 17/230 (7%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
E Y C+G +S+P + S LKCFY S N FLKI P KVEE + P + D + DSE
Sbjct: 286 EFYEQLCRGEISLPVEKASKLKCFYLSRNQPFLKIAPFKVEEAHHRPDIFIFRDVLADSE 345
Query: 61 INRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I I +++ + +R V N G+ R+SK +L E +H + + R+ DMT
Sbjct: 346 IATIKRMAQPRFKRATVQNTDTGELEIAQYRISKSAWLKEE---EHKHIADVSQRVSDMT 402
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GGHY+ H D RDE R+A+ +FY++DVE
Sbjct: 403 GLTMSTAEE----LQVVNYGIGGHYEPHFDFARRDERNAFKSLGTGNRIATVLFYMSDVE 458
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FPS+ ++++P+KGSA FWYN H + D H+ CPV G+KW
Sbjct: 459 QGGATVFPSIQVSLWPQKGSAAFWYNLHPSGDGDKMTRHAACPVLTGSKW 508
>gi|328790718|ref|XP_392392.4| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Apis mellifera]
Length = 415
Score = 172 bits (437), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 86/230 (37%), Positives = 134/230 (58%), Gaps = 17/230 (7%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
E Y + C+G +++P +++ NLKC Y FLKI P K EE YLDPR+V H+ IYD E
Sbjct: 162 ERYEMLCRGEVTIPPEVQKNLKCRYVDRGIPFLKIAPFKEEEAYLDPRIVVYHNVIYDDE 221
Query: 61 INRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I I +++ + +R V NY G + R+SK +L +H + + R++ MT
Sbjct: 222 IETIKRMAQPRFKRATVQNYKTGALEIANYRISKSAWLQEH---EHKHVAAVSRRVEHMT 278
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
++ + E LQ+ NYG+GGHY+ H D ++E R+A+ ++Y++DVE
Sbjct: 279 SMTVDTAEE----LQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIATVLYYMSDVE 334
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GG T+F ++N+ ++P+KGSA FWYN N D++ H+ CPV G+KW
Sbjct: 335 QGGGTVFTAINIALWPKKGSAAFWYNLKPNGEGDFKTRHAACPVLTGSKW 384
>gi|380025232|ref|XP_003696381.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Apis florea]
Length = 537
Score = 172 bits (437), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 86/230 (37%), Positives = 134/230 (58%), Gaps = 17/230 (7%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
E Y + C+G +++P +++ NLKC Y FLKI P K EE YLDPR+V H+ IYD E
Sbjct: 284 ERYEMLCRGEVTIPPEVQKNLKCRYVDRGIPFLKIAPFKEEEAYLDPRIVVYHNVIYDDE 343
Query: 61 INRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I I +++ + +R V NY G + R+SK +L +H + + R++ MT
Sbjct: 344 IETIKRMAQPRFKRATVQNYKTGALEIANYRISKSAWLQEH---EHKHVAAVSRRVEHMT 400
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
++ + E LQ+ NYG+GGHY+ H D ++E R+A+ ++Y++DVE
Sbjct: 401 SMTVDTAEE----LQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIATVLYYMSDVE 456
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GG T+F ++N+ ++P+KGSA FWYN N D++ H+ CPV G+KW
Sbjct: 457 QGGGTVFTAINIALWPKKGSAAFWYNLKPNGEGDFKTRHAACPVLTGSKW 506
>gi|383864775|ref|XP_003707853.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Megachile
rotundata]
Length = 550
Score = 172 bits (437), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 89/230 (38%), Positives = 133/230 (57%), Gaps = 17/230 (7%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
E Y + C+G +S+P +I+ NLKC Y FLKI P K EE YLDPR+V H+ IYD E
Sbjct: 297 ERYEMLCRGEVSIPPEIQKNLKCRYVDRGIPFLKIAPFKEEEAYLDPRIVIYHNVIYDEE 356
Query: 61 INRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I I +++ + +R V NY G + R+SK +L +H + + R++ MT
Sbjct: 357 IETIKRMAQPRFKRATVQNYKTGALEIANYRISKSAWLQEH---EHKHVAAVSKRVEHMT 413
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
+L + E LQ+ NYG+GGHY+ H D ++E R+A+ ++Y++DVE
Sbjct: 414 SLNVETAEE----LQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIATVLYYMSDVE 469
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GG T+F ++N++++P KGSA FW+N N D R H+ CPV G+KW
Sbjct: 470 QGGGTVFTAINISLWPRKGSAAFWFNLKPNGEGDLRTRHAACPVLTGSKW 519
>gi|350416719|ref|XP_003491070.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Bombus
impatiens]
Length = 557
Score = 172 bits (436), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 87/230 (37%), Positives = 134/230 (58%), Gaps = 17/230 (7%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
E Y + C+G +S+P +I+ NL C Y FLKI P K EE YLDPR+V H+ IYD E
Sbjct: 304 ERYEMLCRGEVSIPPEIQKNLVCRYVDRGIPFLKIAPFKEEEAYLDPRIVVYHNVIYDEE 363
Query: 61 INRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I I +++ + +R V NY G + R+SK +L +H + + R++ MT
Sbjct: 364 IETIKRMAQPRFKRATVQNYKTGALEIANYRISKSAWLQEH---EHEHVAAVSRRVEHMT 420
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
++ + E LQ+ NYG+GGHY+ H D ++E R+A+ ++Y++DVE
Sbjct: 421 SMTVDTAEE----LQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIATVLYYMSDVE 476
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GG T+F ++N++++P+KGSA FWYN N D++ H+ CPV G+KW
Sbjct: 477 QGGGTVFTAINISLWPKKGSAAFWYNLKPNGEGDFKTRHAACPVLTGSKW 526
>gi|340722330|ref|XP_003399560.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Bombus
terrestris]
Length = 557
Score = 172 bits (435), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 87/230 (37%), Positives = 134/230 (58%), Gaps = 17/230 (7%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
E Y + C+G +S+P +I+ NL C Y FLKI P K EE YLDPR+V H+ IYD E
Sbjct: 304 ERYEMLCRGEVSIPPEIQKNLVCRYVDRGIPFLKIAPFKEEEAYLDPRIVVYHNVIYDEE 363
Query: 61 INRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I I +++ + +R V NY G + R+SK +L +H + + R++ MT
Sbjct: 364 IETIKRMAQPRFKRATVQNYKTGALEIANYRISKSAWLQEH---EHEHVAAVSRRVEHMT 420
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
++ + E LQ+ NYG+GGHY+ H D ++E R+A+ ++Y++DVE
Sbjct: 421 SMTVDTAEE----LQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIATVLYYMSDVE 476
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GG T+F ++N++++P+KGSA FWYN N D++ H+ CPV G+KW
Sbjct: 477 QGGGTVFTAINISLWPKKGSAAFWYNLKPNGEGDFKTRHAACPVLTGSKW 526
>gi|307211752|gb|EFN87747.1| Prolyl 4-hydroxylase subunit alpha-1 [Harpegnathos saltator]
Length = 415
Score = 172 bits (435), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 87/230 (37%), Positives = 134/230 (58%), Gaps = 17/230 (7%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
E Y + C+G +S+P +++ NLKC Y FLKI P K EE YLDPR+V H+ IYD E
Sbjct: 162 ERYEMLCRGEVSIPLEVEKNLKCRYVDRGIPFLKIAPFKEEEAYLDPRIVFYHNVIYDEE 221
Query: 61 INRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I I +++ + +R V NY G + R+SK +L +H + + R++ MT
Sbjct: 222 IETIKRMAQPRFKRATVQNYKTGALEIANYRISKSAWLQEH---EHKHVAAVSKRVEHMT 278
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
++ + E LQ+ NYG+GGHY+ H D ++E R+A+ ++Y++DVE
Sbjct: 279 SMSVETAEE----LQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIATVLYYMSDVE 334
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GG T+F ++N++++P KGSA FWYN N D++ H+ CPV G+KW
Sbjct: 335 QGGGTVFTAINISLWPRKGSAAFWYNLKPNGEGDFKTRHAACPVLTGSKW 384
>gi|332026992|gb|EGI67088.1| Prolyl 4-hydroxylase subunit alpha-1 [Acromyrmex echinatior]
Length = 415
Score = 171 bits (434), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 86/230 (37%), Positives = 134/230 (58%), Gaps = 17/230 (7%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
E Y + C+G +S+P +++ NLKC Y FLKI P K EE YLDPR+V H+ IYD E
Sbjct: 162 ERYEMLCRGEVSIPPEVEKNLKCRYVDRGIPFLKIAPFKEEEAYLDPRIVVYHNVIYDEE 221
Query: 61 INRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I I +++ + +R V NY G + R+SK +L +H + + R++ MT
Sbjct: 222 IETIKRMAQPRFKRATVQNYKTGALEIANYRISKSAWLQEH---EHKHVAAVSKRVEHMT 278
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
++ + E LQ+ NYG+GGHY+ H D ++E R+A+ ++Y++DVE
Sbjct: 279 SMSVETAEE----LQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIATVLYYMSDVE 334
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GG T+F ++N++++P KGSA FW+N N D++ H+ CPV G+KW
Sbjct: 335 QGGGTVFTAINISLWPRKGSAAFWHNLKPNGEGDFKTRHAACPVLTGSKW 384
>gi|157114985|ref|XP_001658091.1| prolyl 4-hydroxylase alpha subunit 1, putative [Aedes aegypti]
gi|108877086|gb|EAT41311.1| AAEL007038-PA [Aedes aegypti]
Length = 545
Score = 167 bits (424), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 90/230 (39%), Positives = 130/230 (56%), Gaps = 17/230 (7%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
++Y C+G S LKC Y + + FLKI PLK+EE L P +V HD I ++E
Sbjct: 296 KLYEQLCRGEAERSVAETSKLKCRYVTNKSPFLKIAPLKLEEANLKPYIVIYHDVISEAE 355
Query: 61 INRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
+ + L+K + R V NY G+ + R+SK +L +HP++ I R++DMT
Sbjct: 356 MELVKRLAKPRFRRATVQNYKTGELEVANYRISKSAWLKDH---EHPYIKAIGERVEDMT 412
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GGHY+ H D R+E R+A+ +FY++DV
Sbjct: 413 GLTMSTAEE----LQVVNYGIGGHYEPHFDFARREETNAFKSLGTGNRIATVLFYMSDVT 468
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FPSL L ++P+KG+A FW+N HA+ DY H+ CPV G KW
Sbjct: 469 QGGATVFPSLRLALWPKKGAAAFWFNLHASGQGDYSTRHAACPVLTGTKW 518
>gi|170064953|ref|XP_001867740.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
gi|167882143|gb|EDS45526.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
Length = 509
Score = 167 bits (422), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 93/225 (41%), Positives = 133/225 (59%), Gaps = 14/225 (6%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
+ L C+G+ P S+L C Y + ++FL++ PLK E L LDP + HD D EI+
Sbjct: 265 HELLCRGDYQRPASETSHLYCRYHTGTSSFLRLAPLKEEVLNLDPFITVYHDVASDREIS 324
Query: 63 RIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVI 122
++IEL+K ++ R + + G+ + R S+ +L GD + + R+ DMT +
Sbjct: 325 KLIELAKSRISRATIRDDGEPQVSNARTSQNAWLDA---GDDRVVTTLDRRVGDMTGGL- 380
Query: 123 GREERYKGPLQINNYGLGGHYDLHCD----ATPRDEGLW---RLASFMFYLTDVELGGAT 175
R++ Y+ LQ+NNYG+GGHY H D A P GL R+A+ MFYL+DVE+GGAT
Sbjct: 381 -RQQSYE-MLQVNNYGVGGHYVAHHDWAMEAVPY-AGLRVGNRIATVMFYLSDVEIGGAT 437
Query: 176 IFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+FP L L VFP KGSA+ WYN + N D R H+ CPV G+KW
Sbjct: 438 VFPQLGLAVFPRKGSAILWYNLYRNGKGDRRTLHAACPVLSGSKW 482
>gi|170064951|ref|XP_001867739.1| prolyl 4-hydroxylase subunit alpha-2 [Culex quinquefasciatus]
gi|167882142|gb|EDS45525.1| prolyl 4-hydroxylase subunit alpha-2 [Culex quinquefasciatus]
Length = 516
Score = 166 bits (421), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 91/226 (40%), Positives = 130/226 (57%), Gaps = 14/226 (6%)
Query: 2 IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
+Y C+G+ P SNL C Y + FL++ PLK E + LDP V HDA D+EI
Sbjct: 275 LYEPLCRGDHQRPPSETSNLYCRYHMSTSPFLRLAPLKQEVVNLDPFVAVYHDAASDAEI 334
Query: 62 NRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLV 121
N++IEL + ++ R V + +R S+ +L DHP + + R +DM
Sbjct: 335 NKVIELGRPQINRSMVGDAAKKEVSKSRTSQNSWL---TDYDHPVVAALSRRTKDM---A 388
Query: 122 IGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-------RLASFMFYLTDVELGGA 174
+G +E LQ+NNYG+GGHY H D + R+E + R+A+ MFYL+DVE GGA
Sbjct: 389 LGLDETAYESLQVNNYGIGGHYLPHYDWS-REENPYPELNTGNRIATLMFYLSDVEEGGA 447
Query: 175 TIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
T+FP L + VFP+KG+A+FWYN A+ D + H CPV +G+KW
Sbjct: 448 TVFPHLGVGVFPKKGTAIFWYNLRASGKGDEKTLHGACPVLIGSKW 493
>gi|195505207|ref|XP_002099404.1| GE23380 [Drosophila yakuba]
gi|194185505|gb|EDW99116.1| GE23380 [Drosophila yakuba]
Length = 540
Score = 166 bits (420), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 90/227 (39%), Positives = 134/227 (59%), Gaps = 15/227 (6%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
++Y C+G L + NL+C+ + ++ P K+E+L LDP V +H+ ++DSE
Sbjct: 290 KLYTQVCRGELHQTPREQRNLRCWLTHQGVPYYRLAPFKIEQLNLDPYVAYVHEVLWDSE 349
Query: 61 INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
I+ I+E KG ++R V G++ + R S+ +L+ + +P+L KI+ R++D+T L
Sbjct: 350 IDMIMEHGKGNMKRSMVGQSGNSTTTEIRTSQNTWLW---YDANPWLAKIKQRLEDVTGL 406
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGL----W---RLASFMFYLTDVELGG 173
E PLQ+ NYG+GG Y+ H D D+G W RLA+ +FYL DV LGG
Sbjct: 407 STESAE----PLQLVNYGIGGQYEPHFDFM-EDDGQKVFGWKGNRLATALFYLNDVALGG 461
Query: 174 ATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
AT FP L L V P KGS + WYN H++T D+R H+GCPV G+KW
Sbjct: 462 ATAFPFLRLAVPPVKGSLLIWYNLHSSTHKDFRTKHAGCPVLQGSKW 508
>gi|195341548|ref|XP_002037368.1| GM12149 [Drosophila sechellia]
gi|194131484|gb|EDW53527.1| GM12149 [Drosophila sechellia]
Length = 537
Score = 166 bits (419), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 89/226 (39%), Positives = 131/226 (57%), Gaps = 13/226 (5%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
++Y C+G L + NL+C+ + + P K+E+L +DP V +H+ ++DSE
Sbjct: 287 KLYTQVCRGELHQSPREQRNLRCWLSHQGVLYYHLSPFKIEQLNIDPYVAYVHEVLWDSE 346
Query: 61 INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
I+ IIE KG +ER KV ++ + R+S+ +L+ + +P+L KI+ R++D+T L
Sbjct: 347 IDTIIEHGKGNMERSKVGQIENSTTTEVRISRNTWLW---YDANPWLSKIKQRLEDVTGL 403
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGL---W---RLASFMFYLTDVELGGA 174
E PLQ+ NYG+GG Y+ H D D W RL + +FYL DV LGGA
Sbjct: 404 STESAE----PLQLVNYGIGGQYEPHFDFVEDDGKTVFSWKGNRLLTALFYLNDVALGGA 459
Query: 175 TIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
T FP L L V P KGS + WYN H++T D+R H+GCPV G+KW
Sbjct: 460 TAFPFLRLAVPPVKGSLLIWYNLHSSTHKDFRTKHAGCPVLQGSKW 505
>gi|170064960|ref|XP_001867743.1| prolyl 4-hydroxylase subunit alpha-1 [Culex quinquefasciatus]
gi|167882146|gb|EDS45529.1| prolyl 4-hydroxylase subunit alpha-1 [Culex quinquefasciatus]
Length = 545
Score = 165 bits (417), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 88/229 (38%), Positives = 131/229 (57%), Gaps = 17/229 (7%)
Query: 2 IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
+Y C+G E + L+C Y + ++ FLKI PLK+EE +L+P +V H+ + D+EI
Sbjct: 297 LYEQLCRGEAHRAEADLAKLRCRYVTNSSPFLKIAPLKLEEAHLEPYIVIYHEVMSDAEI 356
Query: 62 NRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
I L+K + R V NY G+ + R+SK +L E +H + + R++DMT
Sbjct: 357 EVIKRLAKPRFRRATVQNYKTGELEVANYRISKSAWLKDE---EHSVVRTVGQRVEDMTG 413
Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVEL 171
L + E LQ+ NYG+GGHY+ H D R+E R+A+ +FY++DV
Sbjct: 414 LTMTTAEE----LQVVNYGIGGHYEPHFDFARREEKNAFKSLGTGNRIATVLFYMSDVSQ 469
Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FPS+ + + P+KG+A FWYN HA+ DY H+ CPV G KW
Sbjct: 470 GGATVFPSIRVALRPKKGTAAFWYNLHASGHGDYATRHAACPVLTGTKW 518
>gi|270001038|gb|EEZ97485.1| hypothetical protein TcasGA2_TC011322 [Tribolium castaneum]
Length = 509
Score = 164 bits (416), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 85/224 (37%), Positives = 136/224 (60%), Gaps = 11/224 (4%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
E+Y C+ +S+PE S LKCFY++ N+ FL+I P KVE+ +LDP ++ H+ + D E
Sbjct: 275 EVYKKLCRAEISLPEAKSSKLKCFYQNSNHPFLRIAPFKVEQAHLDPDILIFHNVLSDCE 334
Query: 61 INRIIELSKGKVERGKVVN-YGDTIYV-DTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + +L++ ++ N + + + R+SKV +L + +H L + R+ MT
Sbjct: 335 IETMKQLAQSRLVTAVFENPHSKQLELFPFRISKVAWLEDQ---EHQHLAVVAQRVAHMT 391
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCD-ATPRDEGL-WRLASFMFYLTDVELGGATI 176
L + E + Q+ NYG+GGHY+ H D + D + R+ + +FYL+DVE GGAT+
Sbjct: 392 GLTLSTAEEF----QVVNYGIGGHYEPHFDFQSTVDPAIGSRIETVLFYLSDVEQGGATV 447
Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
FP + ++V+P+KGSAV W+N H + D R H+GCPV +G+KW
Sbjct: 448 FPEIQVSVWPQKGSAVVWFNLHPSGDGDQRTKHAGCPVLIGSKW 491
>gi|189241578|ref|XP_969458.2| PREDICTED: similar to prolyl 4-hydroxylase alpha subunit 1,
putative [Tribolium castaneum]
Length = 515
Score = 164 bits (416), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 85/224 (37%), Positives = 133/224 (59%), Gaps = 11/224 (4%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
E+Y C+ +S+PE S LKCFY++ N+ FL+I P KVE+ +LDP ++ H+ + D E
Sbjct: 281 EVYKKLCRAEISLPEAKSSKLKCFYQNSNHPFLRIAPFKVEQAHLDPDILIFHNVLSDCE 340
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + +L++ ++ N R+SKV +L + +H L + R+ MT
Sbjct: 341 IETMKQLAQSRLVTAVFENPHSKQLELFPFRISKVAWLEDQ---EHQHLAVVAQRVAHMT 397
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCD-ATPRDEGL-WRLASFMFYLTDVELGGATI 176
L + E + Q+ NYG+GGHY+ H D + D + R+ + +FYL+DVE GGAT+
Sbjct: 398 GLTLSTAEEF----QVVNYGIGGHYEPHFDFQSTVDPAIGSRIETVLFYLSDVEQGGATV 453
Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
FP + ++V+P+KGSAV W+N H + D R H+GCPV +G+KW
Sbjct: 454 FPEIQVSVWPQKGSAVVWFNLHPSGDGDQRTKHAGCPVLIGSKW 497
>gi|24651424|ref|NP_733376.1| prolyl-4-hydroxylase-alpha SG1 [Drosophila melanogaster]
gi|23172697|gb|AAF57059.2| prolyl-4-hydroxylase-alpha SG1 [Drosophila melanogaster]
gi|66772443|gb|AAY55533.1| IP03659p [Drosophila melanogaster]
gi|220951214|gb|ACL88150.1| PH4alphaSG1-PA [synthetic construct]
gi|220959938|gb|ACL92512.1| PH4alphaSG1-PA [synthetic construct]
Length = 540
Score = 164 bits (415), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 89/227 (39%), Positives = 134/227 (59%), Gaps = 15/227 (6%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
++Y C+G L + NL+C+ + ++ P K+E+L +DP V +H+ ++DSE
Sbjct: 290 KLYTQVCRGELHQSPREQRNLRCWLYHQGVPYYRLSPFKIEQLNVDPYVAYVHEVLWDSE 349
Query: 61 INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
I+ I+E KG +ER KV ++ + R+S+ +L+ + +P+L KI+ R++D+T L
Sbjct: 350 IDTIMEHGKGNMERSKVGQSENSTTSEVRISRNTWLW---YDANPWLSKIKQRLEDVTGL 406
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGL----W---RLASFMFYLTDVELGG 173
E PLQ+ NYG+GG Y+ H D D+G W RL + +FYL DV LGG
Sbjct: 407 STESAE----PLQLVNYGIGGQYEPHFDFV-EDDGQSVFSWKGNRLLTALFYLNDVALGG 461
Query: 174 ATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
AT FP L L V P KGS + WYN H++T D+R H+GCPV G+KW
Sbjct: 462 ATAFPFLRLAVPPVKGSLLIWYNLHSSTHKDFRTKHAGCPVLQGSKW 508
>gi|66772331|gb|AAY55477.1| IP03959p [Drosophila melanogaster]
gi|66772361|gb|AAY55492.1| IP03859p [Drosophila melanogaster]
Length = 541
Score = 164 bits (415), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 89/227 (39%), Positives = 134/227 (59%), Gaps = 15/227 (6%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
++Y C+G L + NL+C+ + ++ P K+E+L +DP V +H+ ++DSE
Sbjct: 291 KLYTQVCRGELHQSPREQRNLRCWLYHQGVPYYRLSPFKIEQLNVDPYVAYVHEVLWDSE 350
Query: 61 INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
I+ I+E KG +ER KV ++ + R+S+ +L+ + +P+L KI+ R++D+T L
Sbjct: 351 IDTIMEHGKGNMERSKVGQSENSTTSEVRISRNTWLW---YDANPWLSKIKQRLEDVTGL 407
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGL----W---RLASFMFYLTDVELGG 173
E PLQ+ NYG+GG Y+ H D D+G W RL + +FYL DV LGG
Sbjct: 408 STESAE----PLQLVNYGIGGQYEPHFDFV-EDDGQSVFSWKGNRLLTALFYLNDVALGG 462
Query: 174 ATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
AT FP L L V P KGS + WYN H++T D+R H+GCPV G+KW
Sbjct: 463 ATAFPFLRLAVPPVKGSLLIWYNLHSSTHKDFRTKHAGCPVLQGSKW 509
>gi|20269816|gb|AAM18063.1|AF495541_1 prolyl 4-hydroxylase alpha-related protein PH4[alpha]SG1
[Drosophila melanogaster]
Length = 540
Score = 164 bits (414), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 89/227 (39%), Positives = 134/227 (59%), Gaps = 15/227 (6%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
++Y C+G L + NL+C+ + ++ P K+E+L +DP V +H+ ++DSE
Sbjct: 290 KLYTEVCRGELHQSPREQRNLRCWLSHQGVPYYRLFPFKIEQLNIDPYVAYVHEVLWDSE 349
Query: 61 INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
I+ I+E KG +ER KV ++ + R+S+ +L+ + +P+L KI+ R++D+T L
Sbjct: 350 IDTIMEHGKGNMERSKVGQSENSTTSEVRISRNTWLW---YDANPWLSKIKQRLEDVTGL 406
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGL----W---RLASFMFYLTDVELGG 173
E PLQ+ NYG+GG Y+ H D D+G W RL + +FYL DV LGG
Sbjct: 407 STESAE----PLQLVNYGIGGQYEPHFDFV-EDDGQSVFSWKGNRLLTALFYLNDVALGG 461
Query: 174 ATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
AT FP L L V P KGS + WYN H++T D+R H+GCPV G+KW
Sbjct: 462 ATAFPFLRLAVPPVKGSLLIWYNLHSSTHKDFRTKHAGCPVLQGSKW 508
>gi|242018356|ref|XP_002429643.1| Prolyl 4-hydroxylase alpha-1 subunit precursor, putative [Pediculus
humanus corporis]
gi|212514628|gb|EEB16905.1| Prolyl 4-hydroxylase alpha-1 subunit precursor, putative [Pediculus
humanus corporis]
Length = 534
Score = 161 bits (408), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 84/229 (36%), Positives = 129/229 (56%), Gaps = 17/229 (7%)
Query: 2 IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
+Y C+ + + E +K+ LKC Y + FL + +K EE +LDPR+V HD + D EI
Sbjct: 289 MYEKLCRNEVGLSEKMKAKLKCRYVDFGRPFLMLAKVKEEEAFLDPRIVLYHDVLSDREI 348
Query: 62 NRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
I +L+ + +R V N G R+SK +L DHP++ K+ R++D+T
Sbjct: 349 KTIQQLAVPRFKRATVQNSETGKLEVAHYRISKSAWLED---VDHPYVAKVSQRVEDITG 405
Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVEL 171
L + E LQ+ NYG+GGHY+ H D ++E R+A+ +FY++DV
Sbjct: 406 LNMATAE----SLQVVNYGIGGHYEPHFDFARKEEKNAFQSLGTGNRIATILFYMSDVSQ 461
Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP + ++++P+KG+A FWYN N DY H+ CPV G+KW
Sbjct: 462 GGATVFPGIKVSLWPKKGTAAFWYNLRKNGEGDYLTRHAACPVLTGSKW 510
>gi|194765174|ref|XP_001964702.1| GF23328 [Drosophila ananassae]
gi|190614974|gb|EDV30498.1| GF23328 [Drosophila ananassae]
Length = 542
Score = 161 bits (407), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 85/225 (37%), Positives = 126/225 (56%), Gaps = 12/225 (5%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
++Y C+G L + L+C + N F ++ P KVE+L LDP V H+AI SE
Sbjct: 293 QLYKRVCRGELRQSPRQQRKLRCLFSHQNVAFYRLAPFKVEQLNLDPYVAYFHEAINSSE 352
Query: 61 INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
+ +IIE G +ER +V + + R S +L+ + ++P+L KI+ R++D+T L
Sbjct: 353 MEQIIEKGLGSMERSRVGQSQNATTSEIRTSANTWLW---YNENPWLSKIKQRLEDITGL 409
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-----RLASFMFYLTDVELGGAT 175
E PLQ+ NYG+GG Y+ H D + ++ R+ + +FY+ DV LGGAT
Sbjct: 410 STESAE----PLQLVNYGIGGQYEPHFDFVEEPQKVFGWKGNRMLTALFYINDVALGGAT 465
Query: 176 IFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
FP L L V P KGS + WYN H + D+R H+GCPV G+KW
Sbjct: 466 AFPFLQLAVPPVKGSLLVWYNLHRSLHKDFRTKHAGCPVIKGSKW 510
>gi|195110931|ref|XP_002000033.1| GI24862 [Drosophila mojavensis]
gi|193916627|gb|EDW15494.1| GI24862 [Drosophila mojavensis]
Length = 549
Score = 160 bits (405), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 88/231 (38%), Positives = 128/231 (55%), Gaps = 19/231 (8%)
Query: 1 EIYPLACQGNLS-VPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDS 59
E+Y C G++ P +++ L+C Y + + FL + PLKVEEL DP +V HD IY S
Sbjct: 288 ELYRHTCNGHIRPTPSELR-QLRCGYMTETHPFLLLAPLKVEELSHDPLLVLFHDVIYQS 346
Query: 60 EINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
EI+ ++ L+K K+ R V + ++ + R S+ FL P+ H L I R+ DMT+
Sbjct: 347 EIDTLMRLAKNKIHRATVTGHNSSVVSNARTSQFTFL-PKT--RHKVLRTIDQRVADMTD 403
Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCD----------ATPRDEGLWRLASFMFYLTDV 169
L + Y Q+ NYG+GGHY H D E R+ + +FYL+DV
Sbjct: 404 LHL----EYAEDHQLANYGIGGHYAQHMDWFYPITFETKQVSNPEMGNRIGTVLFYLSDV 459
Query: 170 ELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
E GGAT FP+L + P+K +A FWYN HA+ + D R H CP+ +G+KW
Sbjct: 460 EQGGATAFPALKQLLRPKKHAAAFWYNLHASGVGDARTMHGACPIIVGSKW 510
>gi|194905372|ref|XP_001981184.1| GG11758 [Drosophila erecta]
gi|190655822|gb|EDV53054.1| GG11758 [Drosophila erecta]
Length = 550
Score = 159 bits (402), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 87/230 (37%), Positives = 126/230 (54%), Gaps = 17/230 (7%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
+ Y L C G+ + +S+L+C Y + + FL I PLK EEL+ DP +V HD IY SE
Sbjct: 284 QAYSLTCSGHWRLTPKEQSHLRCGYVTETHPFLWIAPLKAEELFQDPLLVLYHDVIYQSE 343
Query: 61 INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
I+ I +L++ ++ R V + +++ + R S+ F+ H L I R+ DMTNL
Sbjct: 344 IDVIRKLTENRLMRATVTGHNESLVSNVRTSQFTFIPASA---HKVLSTIDQRVADMTNL 400
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLW-------RLASFMFYLTDVE 170
+ +Y Q NYG+GGHY H D T D GL R+A+ +FYL+DV
Sbjct: 401 NM----KYAEDHQFANYGIGGHYGQHMDWFYQTTFDAGLVSSPEMGNRIATVLFYLSDVS 456
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GG T FP L + P+K +A FW+N HA+ + D R H CP+ G+KW
Sbjct: 457 QGGGTAFPQLRTLLKPKKYAAAFWHNLHASGVGDVRTQHGACPIIAGSKW 506
>gi|195390835|ref|XP_002054073.1| GJ22993 [Drosophila virilis]
gi|194152159|gb|EDW67593.1| GJ22993 [Drosophila virilis]
Length = 525
Score = 159 bits (401), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 94/227 (41%), Positives = 124/227 (54%), Gaps = 20/227 (8%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
Y C+G KS L C Y S N+ FL++ PLK E L LDP +V HD I SEI
Sbjct: 289 YERGCRGQFPT----KSKLHCVYNSTNSPFLRLAPLKTELLALDPYMVLYHDVITPSEIR 344
Query: 63 RIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
+ L+ ++R V N G V TR SKV +L + +P ++ RI DMT
Sbjct: 345 ELQYLAVPTLKRATVFNQKMGRNTVVKTRTSKVTWLTDSL---NPLTVRLNRRISDMTGF 401
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD----ATPRDEGLW---RLASFMFYLTDVELGG 173
+ E LQ+ NYGLGGHYDLH D +D R+A+ +FYLTDVE GG
Sbjct: 402 DLYGSEM----LQVMNYGLGGHYDLHFDYFNATIAKDLTKLNGDRIATVLFYLTDVEQGG 457
Query: 174 ATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
AT+FP++ +FP+KG+AV WYN N D + H+ CPV +G+KW
Sbjct: 458 ATVFPNIKQAIFPKKGTAVMWYNLRHNNDGDPQTLHAACPVIVGSKW 504
>gi|170064956|ref|XP_001867741.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
gi|167882144|gb|EDS45527.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
Length = 520
Score = 159 bits (401), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 89/228 (39%), Positives = 134/228 (58%), Gaps = 16/228 (7%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
++Y C+G+ P ++ S L C YE+ FL++ PLK+E + L+P +V H+A+ D E
Sbjct: 278 KLYEKLCRGDYERPGEVTSQLFCRYETSATPFLRLAPLKLEVVNLEPLIVVYHEAVSDRE 337
Query: 61 INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDH--PFLYKIQTRIQDMT 118
I ++IEL++ ++R V GDT ++SK+ F + P + + R +DM
Sbjct: 338 IAKLIELARPLIKRSAV---GDT--RSEQISKIRISQNAWFENEHDPIVETLNQRARDMA 392
Query: 119 NLVIGREERYKGPLQINNYGLGG----HYDLHCDATP-RDEGLW-RLASFMFYLTDVELG 172
G E LQ+NNYGLGG HYD A P ++G+ R+A+ MFYL+DV+ G
Sbjct: 393 G---GLNEPSYELLQVNNYGLGGFYSIHYDWSTSANPFPNKGMGNRIATLMFYLSDVQEG 449
Query: 173 GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
G+T+FP LNL V P KG+A+FWYN H N + + H+ CPV +G+KW
Sbjct: 450 GSTVFPRLNLAVRPRKGTAIFWYNLHRNGKGNKKTLHAACPVLIGSKW 497
>gi|170029530|ref|XP_001842645.1| prolyl 4-hydroxylase subunit alpha-1 [Culex quinquefasciatus]
gi|167863229|gb|EDS26612.1| prolyl 4-hydroxylase subunit alpha-1 [Culex quinquefasciatus]
Length = 522
Score = 159 bits (401), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 89/225 (39%), Positives = 130/225 (57%), Gaps = 16/225 (7%)
Query: 2 IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
+Y C+G + D S L+C ++ FL++ PLKVEE+ L+P + H I D EI
Sbjct: 273 LYEPLCRGEVHRFADELSKLRCRLDTKTTPFLRLAPLKVEEVSLEPPIYLYHKVISDEEI 332
Query: 62 NRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLV 121
+++IEL K ++ R V + R+S+ +L E+ P L +Q R DM+
Sbjct: 333 DKLIELGKARLNRATV----GQMVSQVRISQNVWLSEEV---DPLLGVLQRRTYDMSR-- 383
Query: 122 IGREERYKGPLQINNYGLGGHYDLH--CDAT----PRDEGLWRLASFMFYLTDVELGGAT 175
G + +Q+NNYG+GGH H CD+ P+ RLA+ M+YL+DVE+GG T
Sbjct: 384 -GLSMQGFDMVQVNNYGIGGHNIPHYDCDSEYPPFPQFNMGNRLATLMYYLSDVEVGGGT 442
Query: 176 IFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+FP L+L VFP KGSA+FW+N H N +D RM H+GCP +G+KW
Sbjct: 443 VFPRLSLGVFPIKGSAIFWHNVHHNGNVDERMLHAGCPTLIGSKW 487
>gi|195505218|ref|XP_002099409.1| GE10887 [Drosophila yakuba]
gi|194185510|gb|EDW99121.1| GE10887 [Drosophila yakuba]
Length = 521
Score = 159 bits (401), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 86/230 (37%), Positives = 126/230 (54%), Gaps = 17/230 (7%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
+ Y L C G+ + + +L+C Y + + FL I PLK EEL+ DP +V HD IY SE
Sbjct: 256 QAYSLTCSGHWRLTPKEQRHLRCGYVTETHPFLWIAPLKAEELFQDPLLVLYHDVIYQSE 315
Query: 61 INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
I+ I +L++ +++R V + +++ + R S+ F+ H L I R+ DMTNL
Sbjct: 316 IDVIRKLTENRLKRATVTGHNESVVSNVRTSQFTFI---PVSAHKVLSTIDQRVADMTNL 372
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLW-------RLASFMFYLTDVE 170
+ +Y Q NYG+GGHY H D T D GL R+A+ +FYL+DV
Sbjct: 373 NM----KYAEDHQFANYGIGGHYGQHMDWFYQTTIDAGLISSPEMGNRIATVLFYLSDVS 428
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GG T FP L + P+K +A FW+N HA+ + D R H CP+ G+KW
Sbjct: 429 QGGGTAFPQLRTLLKPKKYAAAFWHNLHASGVGDVRTQHGACPIIAGSKW 478
>gi|194905436|ref|XP_001981196.1| GG11753 [Drosophila erecta]
gi|190655834|gb|EDV53066.1| GG11753 [Drosophila erecta]
Length = 550
Score = 158 bits (400), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 92/230 (40%), Positives = 132/230 (57%), Gaps = 21/230 (9%)
Query: 3 YPLACQGNLS-VPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
Y + C+G L P D++S L+C Y + FL++GPLK+EE + DP +V HDA+YD EI
Sbjct: 302 YEMLCRGELKPSPSDLRS-LRCRYVTNGVPFLRLGPLKLEEAHADPYIVIFHDAMYDGEI 360
Query: 62 NRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFL-YPEIFGDHPFLYKIQTRIQDMT 118
+ I +++ + R V N G + R+SK +L PE H + + R DMT
Sbjct: 361 DLIKRMARPRFRRATVQNSVTGALETANYRISKSAWLKTPE----HRVIETVVQRTADMT 416
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDE-----GLW---RLASFMFYLTDVE 170
L + E LQ+ NYG+GGHY+ H D ++E GL R+A+ +FY++DVE
Sbjct: 417 GLDMDSAEE----LQVVNYGIGGHYEPHFDFARKEEQRAFEGLNLGNRIATVLFYMSDVE 472
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+F SL+ +FP+KG+A FW N H + D R H+ CPV G KW
Sbjct: 473 QGGATVFTSLHTALFPKKGTAAFWMNLHRDGQGDVRTRHAACPVLTGTKW 522
>gi|194905397|ref|XP_001981189.1| GG11929 [Drosophila erecta]
gi|190655827|gb|EDV53059.1| GG11929 [Drosophila erecta]
Length = 538
Score = 158 bits (400), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 83/225 (36%), Positives = 129/225 (57%), Gaps = 12/225 (5%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
++Y C+G L + NL+C+ + ++ P K E+L LDP V +H ++DSE
Sbjct: 289 QLYTQLCRGELHQSPREQRNLRCWLSHQGVPYYRLSPFKFEQLNLDPYVALVHHVLWDSE 348
Query: 61 INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
+ I++ +G +ER KV ++ D R S+ +L+ ++ +P+L +I+ R++D+T L
Sbjct: 349 MEMIMQHGRGSMERSKVGQSENSKIADRRTSQNTWLWYDV---NPWLSRIKQRLEDVTGL 405
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-----RLASFMFYLTDVELGGAT 175
E PLQ+ NYG+GG Y+ H D E ++ RL + +FY+ DV LGGAT
Sbjct: 406 STESAE----PLQLLNYGIGGQYEPHFDFVEDAEKIFGWQDDRLMTAIFYINDVALGGAT 461
Query: 176 IFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
FP L L V PEKGS + W N H++ DYR H+GCP+ G+KW
Sbjct: 462 AFPFLRLAVPPEKGSLLMWNNLHSSLHKDYRSKHAGCPILQGSKW 506
>gi|195575089|ref|XP_002105512.1| GD21521 [Drosophila simulans]
gi|194201439|gb|EDX15015.1| GD21521 [Drosophila simulans]
Length = 550
Score = 158 bits (399), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 90/229 (39%), Positives = 133/229 (58%), Gaps = 19/229 (8%)
Query: 3 YPLACQGNLS-VPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
Y + C+G L P D++S L+C Y + FL++GPLK+EE++ DP +V HDA+YDSEI
Sbjct: 302 YEMLCRGELKPSPSDLRS-LRCRYVTNRVPFLRLGPLKLEEVHADPYIVIYHDAMYDSEI 360
Query: 62 NRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
+ I +++ + R V N G + R+SK +L + + + + R DMT
Sbjct: 361 DLIKRMARPRFRRATVQNSVTGALETANYRISKSAWLKTQ---EDRVIETVVQRTADMTG 417
Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCDATPRDE-----GLW---RLASFMFYLTDVEL 171
L + E LQ+ NYG+GGHY+ H D ++E GL R+A+ +FY++DVE
Sbjct: 418 LDMDSAEE----LQVVNYGIGGHYEPHFDFARKEEERAFEGLNLGNRIATVLFYMSDVEQ 473
Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+F SL+ +FP+KG+A FW N H + D R H+ CPV G KW
Sbjct: 474 GGATVFTSLHTALFPKKGTAAFWMNLHRDGQGDVRTRHAACPVLTGTKW 522
>gi|195452742|ref|XP_002073480.1| GK13123 [Drosophila willistoni]
gi|194169565|gb|EDW84466.1| GK13123 [Drosophila willistoni]
Length = 540
Score = 158 bits (399), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 84/225 (37%), Positives = 126/225 (56%), Gaps = 12/225 (5%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
++Y C+G L + L+CFY F ++GP KVE+L LDP V H+ I D E
Sbjct: 286 DLYQRVCRGELRQSPRQQRKLRCFYSDRGVAFYRLGPFKVEQLNLDPYVAYFHNVISDDE 345
Query: 61 INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
+ +IE G+V+R +V G++ + R S+ +L+ + P+L ++ R++D+T L
Sbjct: 346 TDDLIEHGMGQVKRSRVGTVGNSTVSEVRTSQNTWLW---YEQQPWLKNLKLRLEDITGL 402
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-----RLASFMFYLTDVELGGAT 175
+ E PLQ+ NYG+GGHY+ H D + RL + + YL +V +GGAT
Sbjct: 403 GMESAE----PLQLVNYGIGGHYEPHYDFVEDKVTTFGWKGNRLLTALLYLNEVPMGGAT 458
Query: 176 IFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
FP L L V P KGS + WYN H + D+R H+GCPV +G+KW
Sbjct: 459 AFPYLKLAVPPVKGSLLVWYNLHRSLDPDFRTKHAGCPVLMGSKW 503
>gi|24651407|ref|NP_733371.1| prolyl-4-hydroxylase-alpha EFB [Drosophila melanogaster]
gi|20269806|gb|AAM18058.1|AF495536_1 prolyl 4-hydroxylase alpha-related protein PH4[alpha]EFB
[Drosophila melanogaster]
gi|15292529|gb|AAK93533.1| SD05564p [Drosophila melanogaster]
gi|23172692|gb|AAF57053.2| prolyl-4-hydroxylase-alpha EFB [Drosophila melanogaster]
gi|220946562|gb|ACL85824.1| PH4alphaEFB-PA [synthetic construct]
Length = 550
Score = 158 bits (399), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 90/229 (39%), Positives = 133/229 (58%), Gaps = 19/229 (8%)
Query: 3 YPLACQGNLS-VPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
Y + C+G L P D++S L+C Y + FL++GPLK+EE++ DP +V HDA+YDSEI
Sbjct: 302 YEMLCRGELKPSPSDLRS-LRCRYVTNRVPFLRLGPLKLEEVHADPYIVIYHDAMYDSEI 360
Query: 62 NRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
+ I +++ + R V N G + R+SK +L + + + + R DMT
Sbjct: 361 DLIKRMARPRFRRATVQNSVTGALETANYRISKSAWLKTQ---EDRVIETVVQRTADMTG 417
Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCDATPRDE-----GLW---RLASFMFYLTDVEL 171
L + E LQ+ NYG+GGHY+ H D ++E GL R+A+ +FY++DVE
Sbjct: 418 LDMDSAEE----LQVVNYGIGGHYEPHFDFARKEEQRAFEGLNLGNRIATVLFYMSDVEQ 473
Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+F SL+ +FP+KG+A FW N H + D R H+ CPV G KW
Sbjct: 474 GGATVFTSLHTALFPKKGTAAFWMNLHRDGQGDVRTRHAACPVLTGTKW 522
>gi|195341536|ref|XP_002037362.1| GM12882 [Drosophila sechellia]
gi|194131478|gb|EDW53521.1| GM12882 [Drosophila sechellia]
Length = 550
Score = 158 bits (399), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 88/229 (38%), Positives = 131/229 (57%), Gaps = 19/229 (8%)
Query: 3 YPLACQGNLS-VPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
Y + C+G L P D++S L+C Y + FL++GPLK+EE++ DP +V HDA+YDSEI
Sbjct: 302 YEMLCRGELKPSPSDLRS-LRCRYVTNRVPFLRLGPLKLEEVHADPYIVIYHDAMYDSEI 360
Query: 62 NRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
+ I +++ + R V N G + R+SK +L + + + + R DMT
Sbjct: 361 DLIKRMARPRFRRATVQNSVTGALETANYRISKSAWLKTQ---EDRVIETVVQRTADMTG 417
Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVEL 171
L + E LQ+ NYG+GGHY+ H D ++E R+A+ +FY++DVE
Sbjct: 418 LDMDSAEE----LQVVNYGIGGHYEPHFDFARKEEERAFEGINLGNRIATVLFYMSDVEQ 473
Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+F SL+ +FP+KG+A FW N H + D R H+ CPV G KW
Sbjct: 474 GGATVFTSLHTALFPKKGTAAFWMNLHRDGQGDVRTRHAACPVLTGTKW 522
>gi|195391754|ref|XP_002054525.1| GJ24502 [Drosophila virilis]
gi|194152611|gb|EDW68045.1| GJ24502 [Drosophila virilis]
Length = 487
Score = 158 bits (399), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 90/231 (38%), Positives = 133/231 (57%), Gaps = 19/231 (8%)
Query: 1 EIYPLACQGNLS-VPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDS 59
+ Y + C+G L P +++ L+C Y + N FL++ PLK+EE Y+DP +V HDA+YDS
Sbjct: 237 KAYEMLCRGELKPSPSELRP-LRCRYVNNNVAFLRLAPLKLEEAYMDPYIVIYHDAMYDS 295
Query: 60 EINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDM 117
EI I +++ + R V N G + R+SK +L +H + + R DM
Sbjct: 296 EIEIIKRMARPRFRRATVQNSVTGALETANYRISKSAWLKT---AEHRVIGTVVQRTADM 352
Query: 118 TNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDE-----GLW---RLASFMFYLTDV 169
T L + E LQ+ NYG+GGHY+ H D R+E GL R+A+ +FY++DV
Sbjct: 353 TGLDMDSAEE----LQVVNYGIGGHYEPHFDFARREEKRAFEGLNLGNRIATMLFYMSDV 408
Query: 170 ELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
E GGAT+F SL+ ++P+KG+A FW N H + D R H+ CPV G+KW
Sbjct: 409 EQGGATVFTSLHAALWPKKGTAAFWMNLHRSGEGDVRTRHAACPVLTGSKW 459
>gi|195110919|ref|XP_002000027.1| GI24860 [Drosophila mojavensis]
gi|193916621|gb|EDW15488.1| GI24860 [Drosophila mojavensis]
Length = 487
Score = 157 bits (398), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 88/230 (38%), Positives = 130/230 (56%), Gaps = 17/230 (7%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
+ Y + C+G L + + L+C Y S N FL++ PLK+EE +LDP +V HDA++DSE
Sbjct: 237 KAYEMLCRGELKLSPSVLRPLRCRYVSNNVPFLRLAPLKLEEAFLDPYIVIYHDAMFDSE 296
Query: 61 INRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + +++ + R V N G + R+SK +L +H + + R DMT
Sbjct: 297 IEVLKRMARPRFRRATVQNAVTGALETANYRISKSAWLKT---AEHRVIGTVVQRTADMT 353
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDE-----GLW---RLASFMFYLTDVE 170
L + E LQ+ NYG+GGHY+ H D R+E GL R+A+ +FY++DVE
Sbjct: 354 GLDMDSAEE----LQVVNYGIGGHYEPHFDFARREEIRAFEGLNLGNRIATVLFYMSDVE 409
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+F SL+ + P+KG+A FW N H + D R H+ CPV G+KW
Sbjct: 410 QGGATVFTSLHAVLKPKKGTAAFWMNLHRSGEGDVRTRHAACPVLTGSKW 459
>gi|157111033|ref|XP_001651361.1| prolyl 4-hydroxylase alpha subunit 1, putative [Aedes aegypti]
gi|108878552|gb|EAT42777.1| AAEL005714-PA, partial [Aedes aegypti]
Length = 522
Score = 157 bits (397), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 86/218 (39%), Positives = 126/218 (57%), Gaps = 11/218 (5%)
Query: 7 CQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIE 66
C+G + S+LKC Y S + F KIGP K+EE++L P++V HD + D+EI +
Sbjct: 289 CRGEIQRNVSETSHLKCRYVSNLSAFSKIGPFKLEEMHLKPKIVIFHDVLSDTEIELLKR 348
Query: 67 LSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGR 124
L+K +ER + N G R+SK + +P+ + H + I R+ DMT L +
Sbjct: 349 LAKPILERATIANQQTGKAERSKDRVSKSSW-FPDEY--HSTIRTITKRVADMTGLSMDT 405
Query: 125 EERYKGPLQINNYGLGGHYDLHCD--ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNL 182
E LQ+ NYGLGG YD H D + + + R+A+ +FY++DV +GGAT+FP L +
Sbjct: 406 AEE----LQVVNYGLGGQYDPHFDFFHWGKLKEVNRIATVLFYMSDVSIGGATVFPKLGV 461
Query: 183 TVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
T+ KG+A FWYN H++ LDY H CPV +G KW
Sbjct: 462 TLEARKGTAAFWYNLHSSGELDYSTLHGACPVLIGEKW 499
>gi|195452726|ref|XP_002073473.1| GK14136 [Drosophila willistoni]
gi|194169558|gb|EDW84459.1| GK14136 [Drosophila willistoni]
Length = 550
Score = 157 bits (397), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 90/229 (39%), Positives = 133/229 (58%), Gaps = 19/229 (8%)
Query: 3 YPLACQGNLS-VPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
Y + C+G L P D++ L+C Y + N FL++GPLK+EE ++DP +V HDA+YDSE+
Sbjct: 302 YEMLCRGELKPSPADLRP-LRCRYVTNNVPFLRLGPLKLEEAHMDPYIVIYHDAMYDSEM 360
Query: 62 NRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
+ I +++ + R V N G + R+SK +L E + + + R DMT
Sbjct: 361 DLIKRMARPRFRRATVQNSVTGALETANYRISKSAWLKTE---EDQVIGTVVQRTADMTG 417
Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCDATPRDE-----GLW---RLASFMFYLTDVEL 171
L + E LQ+ NYG+GGHY+ H D R+E GL R+A+ +FY++DVE
Sbjct: 418 LDMDSAEE----LQVVNYGIGGHYEPHFDFARREEKRAFEGLNLGNRIATVLFYMSDVEQ 473
Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+F SL+ ++P+KG+A FW N H + D R H+ CPV G KW
Sbjct: 474 GGATVFTSLHAALWPKKGTAAFWMNLHRDGEGDVRTRHAACPVLTGTKW 522
>gi|195055779|ref|XP_001994790.1| GH14110 [Drosophila grimshawi]
gi|193892553|gb|EDV91419.1| GH14110 [Drosophila grimshawi]
Length = 487
Score = 157 bits (396), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 89/230 (38%), Positives = 131/230 (56%), Gaps = 21/230 (9%)
Query: 3 YPLACQGNLS-VPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
Y + C+G L P +I+ L+C Y + N FL++ PLK+EE ++DP +V HDA+YDSEI
Sbjct: 239 YEMLCRGELKPSPAEIRP-LRCRYVNNNVDFLRLAPLKLEEAFMDPYIVIYHDAMYDSEI 297
Query: 62 NRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFL-YPEIFGDHPFLYKIQTRIQDMT 118
+ +++ + R V N G + R+SK +L PE H + + R DMT
Sbjct: 298 EVLKRMARPRFRRATVQNSVTGALETANYRISKSAWLKTPE----HEIIGTVVQRTADMT 353
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GGHY+ H D R+E L R+A+ +FY++DV+
Sbjct: 354 GLDMDSAEE----LQVVNYGIGGHYEPHFDFARREEKLAFEGLNLGNRIATMLFYMSDVQ 409
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+F SL ++P+KG+A FW N H + D R H+ CPV G+KW
Sbjct: 410 QGGATVFTSLRTALWPKKGTAAFWMNLHRSGEGDARTRHAACPVLTGSKW 459
>gi|297515507|gb|ADI44133.1| RT08151p [Drosophila melanogaster]
Length = 546
Score = 157 bits (396), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 85/230 (36%), Positives = 126/230 (54%), Gaps = 17/230 (7%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
+ Y L C G+ + + +L+C Y + + FL I PLK EEL+ DP +V HD IY SE
Sbjct: 287 QAYSLTCSGHWRLTPKEQRHLRCGYVTETHPFLWIAPLKAEELFQDPLLVLYHDVIYQSE 346
Query: 61 INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
I+ I +L++ ++ R + ++ +++ + R S+ F+ H L I R+ DMTNL
Sbjct: 347 IDVIRKLTENRLMRATITSHNESVVSNVRTSQFTFI---PVTAHKVLSTIDQRVADMTNL 403
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLW-------RLASFMFYLTDVE 170
+ +Y Q NYG+GGHY H D T D GL R+A+ +FYL+DV
Sbjct: 404 NM----KYAEDHQFANYGIGGHYGQHMDWFYQTTFDAGLVSSPEMGNRIAAVLFYLSDVA 459
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GG T FP L + P+K +A FW+N HA+ + D R H CP+ G+KW
Sbjct: 460 QGGGTAFPQLRTLLKPKKYAAAFWHNLHASGVGDVRTQHGACPIIAGSKW 509
>gi|194765194|ref|XP_001964712.1| GF22904 [Drosophila ananassae]
gi|190614984|gb|EDV30508.1| GF22904 [Drosophila ananassae]
Length = 547
Score = 156 bits (395), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 93/229 (40%), Positives = 132/229 (57%), Gaps = 19/229 (8%)
Query: 3 YPLACQGNLS-VPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
Y + C+G L P D++ L+C Y + N FL++GPLK+EE + +P +V HDA+YDSEI
Sbjct: 299 YEMLCRGELKPSPADLRP-LRCRYVTNNVPFLRLGPLKLEEAHQEPYIVIYHDAMYDSEI 357
Query: 62 NRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
I +++ + R V N G + R+SK +L E DH +Q R DMT
Sbjct: 358 ELIKRMARPRFRRATVQNSVTGALETANYRISKSAWLKTE--EDHVIGTVVQ-RTADMTG 414
Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCDATPRDE-----GLW---RLASFMFYLTDVEL 171
L + E LQ+ NYG+GGHY+ H D ++E GL R+A+ +FY++DVE
Sbjct: 415 LDMDSAEE----LQVVNYGIGGHYEPHFDFARKEEKRAFEGLNLGNRIATVLFYMSDVEQ 470
Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+F SL+ +FP+KG+A FW N H + D R H+ CPV G KW
Sbjct: 471 GGATVFTSLHTALFPKKGTAAFWMNLHRDGEGDVRTRHAACPVLTGTKW 519
>gi|116008434|ref|NP_651806.2| CG9698 [Drosophila melanogaster]
gi|113194862|gb|AAF57062.2| CG9698 [Drosophila melanogaster]
Length = 547
Score = 156 bits (395), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 85/230 (36%), Positives = 126/230 (54%), Gaps = 17/230 (7%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
+ Y L C G+ + + +L+C Y + + FL I PLK EEL+ DP +V HD IY SE
Sbjct: 287 QAYSLTCSGHWRLTPKEQRHLRCGYVTETHPFLWIAPLKAEELFQDPLLVLYHDVIYQSE 346
Query: 61 INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
I+ I +L++ ++ R + ++ +++ + R S+ F+ H L I R+ DMTNL
Sbjct: 347 IDVIRKLTENRLMRATITSHNESVVSNVRTSQFTFI---PVTAHKVLSTIDQRVADMTNL 403
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLW-------RLASFMFYLTDVE 170
+ +Y Q NYG+GGHY H D T D GL R+A+ +FYL+DV
Sbjct: 404 NM----KYAEDHQFANYGIGGHYGQHMDWFYQTTFDAGLVSSPEMGNRIATVLFYLSDVA 459
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GG T FP L + P+K +A FW+N HA+ + D R H CP+ G+KW
Sbjct: 460 QGGGTAFPQLRTLLKPKKYAAAFWHNLHASGVGDVRTQHGACPIIAGSKW 509
>gi|198449643|ref|XP_001357664.2| GA15938 [Drosophila pseudoobscura pseudoobscura]
gi|198130698|gb|EAL26798.2| GA15938 [Drosophila pseudoobscura pseudoobscura]
Length = 549
Score = 156 bits (395), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 86/225 (38%), Positives = 126/225 (56%), Gaps = 12/225 (5%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
E+Y C+G L + L+C+ + + ++ P KVE+L DP V HD + D E
Sbjct: 300 ELYQRVCRGELRQSPKEQRYLRCWLSHQDVPYQRLSPFKVEQLSGDPYVAYFHDVLSDKE 359
Query: 61 INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
+IIE KG+V R ++ G++ D R S+ +L+ + ++P+L I+ R++D+T L
Sbjct: 360 SEQIIEHGKGQVTRSEIGQTGNSTVSDIRTSQNTWLW---YENNPWLADIKQRLEDITGL 416
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-----RLASFMFYLTDVELGGAT 175
E PLQ+ NYG+GG Y+ H D E + RL + +FYL DV LGGAT
Sbjct: 417 STDTAE----PLQLVNYGIGGQYEPHFDFMDDAEKNFGWKGNRLLTALFYLNDVPLGGAT 472
Query: 176 IFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
FP L+L V P KGS + WYN H + D+R H+GCPV G+KW
Sbjct: 473 AFPFLHLAVPPVKGSLLVWYNLHRSLHKDFRTKHAGCPVLKGSKW 517
>gi|125772807|ref|XP_001357662.1| GA15946 [Drosophila pseudoobscura pseudoobscura]
gi|54637394|gb|EAL26796.1| GA15946 [Drosophila pseudoobscura pseudoobscura]
Length = 549
Score = 155 bits (392), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 87/228 (38%), Positives = 128/228 (56%), Gaps = 17/228 (7%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
Y + C+G L +L+C Y + N FL++GPLK+EE + DP +V HDA+YDSE++
Sbjct: 301 YEMLCRGELKPSPTYMRSLRCRYVTNNVPFLRLGPLKLEEAHKDPYIVIYHDAMYDSEMD 360
Query: 63 RIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
I +++ + R V N G + R+SK +L E + + K+ R DMT L
Sbjct: 361 LIKRMARPRFRRATVQNSVTGALETANYRISKSAWLKTE---EDSVIAKVVQRTADMTGL 417
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRDE-----GLW---RLASFMFYLTDVELG 172
+ E LQ+ NYG+GGHY H D R+E GL R+A+ +FY++DVE G
Sbjct: 418 DMESAEE----LQVVNYGIGGHYAPHFDFARREEKRAFEGLNLGNRIATVLFYMSDVEQG 473
Query: 173 GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GAT+F +L ++P++G+A FW N H + D R H+ CPV G KW
Sbjct: 474 GATVFTTLRTALWPKRGTAAFWMNLHRDGEGDKRTQHAACPVLTGTKW 521
>gi|195159323|ref|XP_002020531.1| GL13463 [Drosophila persimilis]
gi|194117300|gb|EDW39343.1| GL13463 [Drosophila persimilis]
Length = 487
Score = 155 bits (391), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 87/228 (38%), Positives = 128/228 (56%), Gaps = 17/228 (7%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
Y + C+G L +L+C Y + N FL++GPLK+EE + DP +V HDA+YDSE++
Sbjct: 239 YEMLCRGELKPSPTYMRSLRCRYVTNNVPFLRLGPLKLEEAHKDPYIVIYHDAMYDSEMD 298
Query: 63 RIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
I +++ + R V N G + R+SK +L E + + K+ R DMT L
Sbjct: 299 LIKRMARPRFRRATVQNSVTGALETANYRISKSAWLKTE---EDSVIAKVVQRTADMTGL 355
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRDE-----GLW---RLASFMFYLTDVELG 172
+ E LQ+ NYG+GGHY H D R+E GL R+A+ +FY++DVE G
Sbjct: 356 DMESAEE----LQVVNYGIGGHYAPHFDFARREEKRAFEGLNLGNRIATVLFYMSDVEQG 411
Query: 173 GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GAT+F +L ++P++G+A FW N H + D R H+ CPV G KW
Sbjct: 412 GATVFTTLRTALWPKRGTAAFWMNLHRDGEGDKRTQHAACPVLTGTKW 459
>gi|112984520|ref|NP_001037195.1| prolyl 4-hydroxylase alpha subunit precursor [Bombyx mori]
gi|37543673|gb|AAM21932.1| prolyl 4-hydroxylase alpha subunit [Bombyx mori]
Length = 550
Score = 155 bits (391), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 83/228 (36%), Positives = 129/228 (56%), Gaps = 15/228 (6%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
++Y C+G + +P +I LKC+Y + + FLK+ P+KVE++Y+ P + H+ + D E
Sbjct: 291 KVYESLCRGEMEIPHEITKRLKCWYVTDTHPFLKLAPIKVEQMYVKPDIFMFHEVMTDDE 350
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I I + +K + +R V + G+ R+SK +L E + P + +I R+ DMT
Sbjct: 351 IEFIKKRAKPRFKRAVVHDPKTGELTPAHYRISKSSWLRDE---ESPVIARITQRVTDMT 407
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDE------GLWRLASFMFYLTDVELG 172
L + E LQ+ NYG+GGHY+ H D + E G R+A+ +FY++DV G
Sbjct: 408 GLSMLHAEE----LQVVNYGIGGHYEPHFDFARKRENPFTKFGGNRIATVLFYMSDVAQG 463
Query: 173 GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GAT+F L L++FP K +A FW N HA+ D H+ CPV G+KW
Sbjct: 464 GATVFTELGLSLFPIKRAAAFWLNLHASGEGDLATRHAACPVLRGSKW 511
>gi|195159313|ref|XP_002020526.1| GL14040 [Drosophila persimilis]
gi|194117295|gb|EDW39338.1| GL14040 [Drosophila persimilis]
Length = 549
Score = 154 bits (390), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 85/225 (37%), Positives = 126/225 (56%), Gaps = 12/225 (5%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
E+Y C+G L + L+C+ + + ++ P KVE+L DP V HD + D E
Sbjct: 300 ELYQRVCRGELRQSPKEQRYLRCWLSHQDVPYQRLSPFKVEQLSGDPYVAYFHDVLSDKE 359
Query: 61 INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
+IIE KG+V R ++ G++ + R S+ +L+ + ++P+L I+ R++D+T L
Sbjct: 360 SEQIIEHGKGQVTRSEIGQTGNSTVSEIRTSQNTWLW---YENNPWLADIKQRLEDITGL 416
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-----RLASFMFYLTDVELGGAT 175
E PLQ+ NYG+GG Y+ H D E + RL + +FYL DV LGGAT
Sbjct: 417 STDTAE----PLQLVNYGIGGQYEPHFDFMDDAEKNFGWKGNRLLTALFYLNDVPLGGAT 472
Query: 176 IFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
FP L+L V P KGS + WYN H + D+R H+GCPV G+KW
Sbjct: 473 AFPFLHLAVPPVKGSLLVWYNLHRSLHKDFRTKHAGCPVLKGSKW 517
>gi|321474875|gb|EFX85839.1| hypothetical protein DAPPUDRAFT_309105 [Daphnia pulex]
Length = 545
Score = 154 bits (389), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 85/228 (37%), Positives = 131/228 (57%), Gaps = 15/228 (6%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
++Y C+G + ++ L+C Y + N + I P+K+EE L PR+V HD I D E
Sbjct: 300 DVYEQLCRGEKLMDPKLEGRLRCRYVTNNVPYFYIQPIKMEEALLKPRIVVYHDIISDEE 359
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I I L++ + ER V G+ + R++K +L E +H ++ I R+ D+T
Sbjct: 360 IETIKRLAQPRFERATVQKKESGEREFSRYRIAKSAWLKHE---EHDYVSDINFRVGDIT 416
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGL----W--RLASFMFYLTDVELG 172
L + E LQ+ NYG+GGHY+ H D + E W R+A+++FY++DVE G
Sbjct: 417 GLDMATSE----DLQVCNYGIGGHYEPHYDYARKGEVQQDFGWGGRIATWLFYMSDVEAG 472
Query: 173 GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GAT+FP LNL+++P+KGSA FW+N + N + H+GCPV G+KW
Sbjct: 473 GATVFPKLNLSLWPQKGSAAFWFNLYPNGEGNEMTQHAGCPVLTGSKW 520
>gi|312383453|gb|EFR28539.1| hypothetical protein AND_03427 [Anopheles darlingi]
Length = 341
Score = 153 bits (387), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 87/253 (34%), Positives = 129/253 (50%), Gaps = 40/253 (15%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
++Y C+G P +++S L C Y + + FL++ PLK+EE Y P +V HD + D E
Sbjct: 68 KLYEQLCRGEQEPPIELRSQLVCRYATNRSPFLRLAPLKLEEAYRQPDIVIYHDVMSDRE 127
Query: 61 INRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I I ++ + R V NY G+ + + R+SK +L +H + + R++DMT
Sbjct: 128 IELIKHYARPRFRRATVQNYKTGELEFANYRISKSAWLKD---TEHEVIRTVNQRVEDMT 184
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFY----- 165
L + E LQ+ NYG+GGHY+ H D R+E R+A+ +FY
Sbjct: 185 GLTMATAEE----LQVVNYGIGGHYEPHFDFARREERNAFKSLGTGNRIATVLFYVSDLC 240
Query: 166 ------------------LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRM 207
++DV GGAT+FPSLNL + P KG+A FW+N HA+ DY
Sbjct: 241 LCHTSHTNADFRFLSVGQMSDVTQGGATVFPSLNLALRPRKGTAAFWHNLHASGNGDYAT 300
Query: 208 YHSGCPVALGNKW 220
H+ CPV G KW
Sbjct: 301 RHAACPVLTGTKW 313
>gi|195061074|ref|XP_001995919.1| GH14105 [Drosophila grimshawi]
gi|193891711|gb|EDV90577.1| GH14105 [Drosophila grimshawi]
Length = 513
Score = 153 bits (387), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 91/227 (40%), Positives = 126/227 (55%), Gaps = 21/227 (9%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
Y C+G + S L C Y S N+ FL++ PLK+E L LDP +V HDAI EI
Sbjct: 279 YEKGCRGQYAPA--TSSRLHCVYNSTNSAFLRLAPLKMELLQLDPYMVLYHDAISPREIE 336
Query: 63 RIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGD--HPFLYKIQTRIQDMT 118
+ L+ +++R KVV+ + V R SKV +L GD + F ++ RI+DM+
Sbjct: 337 DLQFLAMPRLKRAKVVDQVTHRNMMVKERTSKVTWL-----GDATNAFTMRLNKRIEDMS 391
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCD-----ATPRDEGLWRLASFMFYLTDVELGG 173
+ E LQ+ NYGLGGHY H D + R G R+A+ MFYL+DVE GG
Sbjct: 392 GFTMYGSEM----LQVMNYGLGGHYASHYDFLNATSKTRLNGD-RIATVMFYLSDVEQGG 446
Query: 174 ATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
AT+FP + VFP++G+A+ WYN N D H+ CPV +G+KW
Sbjct: 447 ATVFPKIQKAVFPQRGTAIIWYNLKENGDFDTNTIHAACPVIVGSKW 493
>gi|195391766|ref|XP_002054531.1| GJ24504 [Drosophila virilis]
gi|194152617|gb|EDW68051.1| GJ24504 [Drosophila virilis]
Length = 545
Score = 152 bits (385), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 87/231 (37%), Positives = 126/231 (54%), Gaps = 19/231 (8%)
Query: 1 EIYPLACQGNLS-VPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDS 59
++Y C G++ P +++ L+C Y + + FL + PLKVEEL DP +V HD IY S
Sbjct: 284 DLYRYTCNGHIKPTPAELR-QLRCGYMTETHPFLLLAPLKVEELSHDPLLVLYHDVIYQS 342
Query: 60 EINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
EI+ + +L+K K+ R V ++ + R S+ F+ P+ H L I R+ DMT+
Sbjct: 343 EIDTLAKLTKNKIHRATVTGNNASVVSNARTSQFTFI-PKT--RHKVLRTIDQRVADMTD 399
Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCD----------ATPRDEGLWRLASFMFYLTDV 169
L + E + Q+ NYG+GGHY H D E R+A+ +FYLTDV
Sbjct: 400 LNMVFAEDH----QLANYGIGGHYAQHMDWFSPNAFETKQVANSEMGNRIATVLFYLTDV 455
Query: 170 ELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
E GG T FP L + P+K +A FWYN HA+ D R H CP+ +G+KW
Sbjct: 456 EQGGGTAFPVLKQLLKPKKYAAAFWYNLHASGAGDVRTMHGACPIIVGSKW 506
>gi|195505190|ref|XP_002099397.1| GE10881 [Drosophila yakuba]
gi|194185498|gb|EDW99109.1| GE10881 [Drosophila yakuba]
Length = 487
Score = 151 bits (381), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 88/229 (38%), Positives = 129/229 (56%), Gaps = 19/229 (8%)
Query: 3 YPLACQGNLS-VPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
Y + C+G L P +++ L+C Y + FL++GPLK+EE + DP +V HDA+YDSEI
Sbjct: 239 YEMLCRGELKPSPSELRP-LRCRYVTNGVPFLRLGPLKLEEAHADPYIVIYHDAMYDSEI 297
Query: 62 NRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
+ I +++ + R V N G + R+SK +L + + + R DMT
Sbjct: 298 DVIKRMARPRFRRATVQNSVTGALETANYRISKSAWLKTH---EDRVIGTVVQRTADMTG 354
Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCDATPRDE-----GLW---RLASFMFYLTDVEL 171
L + E LQ+ NYG+GGHY+ H D ++E GL R+A+ +FY++DVE
Sbjct: 355 LDMESAEE----LQVVNYGIGGHYEPHFDFARKEEERAFEGLNLGNRIATVLFYMSDVEQ 410
Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+F SL+ +FP KG+A FW N H + D R H+ CPV G KW
Sbjct: 411 GGATVFTSLHTALFPRKGTAAFWMNLHRDGQGDVRTRHAACPVLTGTKW 459
>gi|194765138|ref|XP_001964684.1| GF23317 [Drosophila ananassae]
gi|190614956|gb|EDV30480.1| GF23317 [Drosophila ananassae]
Length = 520
Score = 150 bits (380), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 85/226 (37%), Positives = 129/226 (57%), Gaps = 16/226 (7%)
Query: 2 IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
+Y + C+G P S L C Y S FL + PLK+E + L+P +V HD + +EI
Sbjct: 284 LYEMGCRG--MYPASTDSKLVCRYNSTTTPFLTLAPLKMEIVGLNPYMVIYHDVLSSAEI 341
Query: 62 NRIIELSKGKVERGKV--VNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
+ + E++ ++R V + G V TR SKV + +P+ + + ++ RI DMT
Sbjct: 342 DEMKEMATPSLKRATVYKASLGKNEVVKTRTSKVAW-FPDSY--NSLTLRLNARIHDMTG 398
Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLW--RLASFMFYLTDVELGGA 174
+ E LQ+ NYGLGGHYD H D AT + L R+A+ +FY++DVE GGA
Sbjct: 399 FDLSGSEM----LQLMNYGLGGHYDKHYDFFNATEKSSSLTGDRIATVLFYMSDVEQGGA 454
Query: 175 TIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
T+FP++ TV+P++G+AV WYN + D + H+ CPV +G+KW
Sbjct: 455 TVFPNIYKTVYPQRGTAVMWYNLKDDGQPDEQTLHAACPVLVGSKW 500
>gi|321474898|gb|EFX85862.1| hypothetical protein DAPPUDRAFT_309117 [Daphnia pulex]
Length = 541
Score = 150 bits (380), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 84/226 (37%), Positives = 126/226 (55%), Gaps = 15/226 (6%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
Y C+G + I++ L+C Y + N + I P+K+E L PR+V H+ + D EI
Sbjct: 298 YERLCRGEKLMDPKIEARLRCRYVTNNVPYFFIQPIKMELASLKPRLVIYHNVVTDEEIE 357
Query: 63 RIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
+L++ ++ R V N G + R++K FL +H + K+ RI D+T L
Sbjct: 358 TAKKLAQSRLRRSTVQNSLTGASEPTKYRIAKAAFLQN---SEHDHIVKMTRRIGDVTGL 414
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGL----W--RLASFMFYLTDVELGGA 174
+ E LQ+ NYG+GGHY+ H D + E W R+A++MFY++DVE GGA
Sbjct: 415 DMTTAEE----LQVCNYGIGGHYEPHYDHARKGEVQKDFGWGNRIATWMFYMSDVEAGGA 470
Query: 175 TIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
T+FP +NL ++P+KGSA FW+N H N D H+ CPV G+KW
Sbjct: 471 TVFPQINLALWPQKGSAAFWFNLHPNGEGDDLTQHAACPVLTGSKW 516
>gi|321474953|gb|EFX85917.1| hypothetical protein DAPPUDRAFT_309108 [Daphnia pulex]
Length = 549
Score = 150 bits (379), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 84/229 (36%), Positives = 130/229 (56%), Gaps = 18/229 (7%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
Y C+G + I+ +L+C Y + N + I PLK+EE +L P +V HD I+D EI
Sbjct: 303 YEKLCRGEKLMDPKIEGHLRCRYVTNNEPYFFIQPLKMEEAFLKPLLVIYHDVIFDEEIE 362
Query: 63 RIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
+ +L+ + +R V+N G R+SK FL + +H + K+ R+ +T L
Sbjct: 363 TVKKLAHPRFKRTTVMNSATGKLETAKYRISKAAFLKNK---EHHHVLKMSRRVGAITGL 419
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGL-------WR--LASFMFYLTDVEL 171
+ E LQ+ NYG+GGHY+ H D ++E + WR +A+++FY++DVE
Sbjct: 420 DMSTAE----DLQVCNYGIGGHYEPHFDYARKNETIGFNKDSGWRNRIATWLFYMSDVEA 475
Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP+LN+ ++P+KGSA FWYN N + H+ CPV G+KW
Sbjct: 476 GGATVFPALNVALWPQKGSAAFWYNLFPNGEGNELTRHAACPVLTGSKW 524
>gi|195055767|ref|XP_001994784.1| GH14132 [Drosophila grimshawi]
gi|193892547|gb|EDV91413.1| GH14132 [Drosophila grimshawi]
Length = 537
Score = 150 bits (379), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 88/228 (38%), Positives = 123/228 (53%), Gaps = 15/228 (6%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
EIY C G + + NL+C Y S + FL + PLKVEEL +P +V HD IY SE
Sbjct: 289 EIYRYTCNGYIKKTPPEERNLRCGYMSETHPFLLLAPLKVEELNRNPLLVLYHDVIYQSE 348
Query: 61 INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
I+ + +L++ + ER VV + R S+ F+ H L I R+ DMTNL
Sbjct: 349 IDVLNKLNRKRYERAGVVINSTSTVSKKRTSQHIFIAA---TRHKVLRTIDQRVADMTNL 405
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD--------ATPRDEGLWRLASFMFYLTDVELG 172
+ +Y Q+ +YG+GGHY H D + DE R+A+ +FYL+DV G
Sbjct: 406 NM----QYAEDHQLADYGIGGHYSQHFDWFGNSDLANSKCDEMGNRIATVLFYLSDVAQG 461
Query: 173 GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
G T FP L + P+K +A FWYN HA+ D+R H GCP+ +G+KW
Sbjct: 462 GGTAFPILKQLLKPKKYAAAFWYNLHASGKGDWRNLHGGCPIIVGSKW 509
>gi|334311009|ref|XP_001371555.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Monodelphis
domestica]
Length = 534
Score = 150 bits (378), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 87/230 (37%), Positives = 130/230 (56%), Gaps = 17/230 (7%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
E+Y C+G + + + L C Y N T L I P K E+ + P +V+ +D + D
Sbjct: 289 EVYEALCRGEGIKLTPQRRKRLFCRYHDSNKTPQLLIAPFKEEDEWDSPHIVRYYDVLSD 348
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI +I E+SK K+ R V + G I V R+SK +L + D P + ++ R+Q
Sbjct: 349 EEIEKIKEISKPKLSRATVRDPKTGHLIVVSYRISKSSWLKED---DDPIIAQVNRRMQY 405
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----RLASFMFYLTDVE 170
+T L + E LQ++NYG+GG Y+ H D + P D GL RLA+F+ Y++DVE
Sbjct: 406 ITGLSVKTAEL----LQVSNYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVE 461
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP ++P+KG++VFWYN + DYR H+ CPV +G+KW
Sbjct: 462 AGGATVFPDFGAAIWPKKGTSVFWYNLFRSGECDYRTRHAACPVLVGSKW 511
>gi|194765168|ref|XP_001964699.1| GF22909 [Drosophila ananassae]
gi|190614971|gb|EDV30495.1| GF22909 [Drosophila ananassae]
Length = 525
Score = 150 bits (378), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 85/228 (37%), Positives = 120/228 (52%), Gaps = 17/228 (7%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
Y L C G+ + +L+C Y + FL I PLK EEL DP ++ HD IY SEI+
Sbjct: 258 YMLTCSGHFRPTPREQRDLRCGYMDETHPFLWIAPLKAEELSRDPLLILYHDVIYQSEID 317
Query: 63 RIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVI 122
I +L+ K++R + + +++ + R S+ FL + L I R+ DMTN +
Sbjct: 318 TIRKLTTNKLKRATITSTNESVVSNVRTSQFTFL---PVTEDKVLATIDRRVADMTNFNM 374
Query: 123 GREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLW-------RLASFMFYLTDVELG 172
RY Q NYG+GGHY H D D GL R+A+ +FYL+DV G
Sbjct: 375 ----RYAEDHQFANYGIGGHYGQHMDWFYQPSFDAGLVSSPEMGNRIATVLFYLSDVTQG 430
Query: 173 GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
G T FP L + + P+K +A FWYN HA+ + D R H CP+ G+KW
Sbjct: 431 GGTAFPHLRVLLKPKKYAAAFWYNLHASGVGDPRTQHGACPIISGSKW 478
>gi|321474877|gb|EFX85841.1| hypothetical protein DAPPUDRAFT_208740 [Daphnia pulex]
Length = 545
Score = 150 bits (378), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 84/230 (36%), Positives = 132/230 (57%), Gaps = 17/230 (7%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
E Y C+G + I+ L+C Y + N +L I P+K+EE + P +V H+ I D E
Sbjct: 298 ENYEKLCRGEKLMDPKIEGRLRCRYVTNNVPYLYIQPVKMEEAFHKPLIVIYHNVINDDE 357
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + ++++ + +R V N G+ + R+SK +L E +H ++K+ R+ D+T
Sbjct: 358 IETVKKMAQPRFKRATVQNSVTGNLEPANYRISKSAWLKSE---EHDHVFKVTRRVGDVT 414
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGL------W--RLASFMFYLTDVE 170
L + E LQ+ NYG+GGHY+ H D ++E W R+A+++FY+++VE
Sbjct: 415 GLDMATAE----DLQVVNYGIGGHYEPHFDYARKEEVNAFKDLGWGNRVATWLFYMSEVE 470
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP LNL ++P+KGSA FWYN H N + H+ CPV G+KW
Sbjct: 471 AGGATVFPKLNLALWPQKGSAAFWYNLHPNGEGNELTRHAACPVLTGSKW 520
>gi|198449635|ref|XP_001357660.2| GA21971 [Drosophila pseudoobscura pseudoobscura]
gi|198130694|gb|EAL26794.2| GA21971 [Drosophila pseudoobscura pseudoobscura]
Length = 549
Score = 149 bits (377), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 86/230 (37%), Positives = 120/230 (52%), Gaps = 17/230 (7%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
E Y L C G+ + + +L+C Y + + FL + PLK EEL DP +V HD IY SE
Sbjct: 282 EAYRLTCSGHSRLTAREERHLRCGYMTETHPFLLLAPLKAEELSHDPLLVLYHDVIYQSE 341
Query: 61 INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
I+ I +L+ ++ R V + + R S++ F+ +H L I R+ DMTNL
Sbjct: 342 IDVIRQLTTNRMARAMVTLTNQSTVSNVRTSQITFIAK---TEHEVLQTIDRRVADMTNL 398
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLW-------RLASFMFYLTDVE 170
+ E + Q NYG+GGHY H D T D GL R+A+ +FYL+DV
Sbjct: 399 NMDYAEDH----QFANYGIGGHYGQHMDWFTETTFDNGLVSSTEMGNRIATVLFYLSDVA 454
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GG T FP L + P+K +A FW+N HA D R H CP+ G+KW
Sbjct: 455 QGGGTAFPYLKQHLRPKKYAAAFWHNLHAAGRGDARTQHGACPIIAGSKW 504
>gi|21711777|gb|AAM75079.1| RE70601p [Drosophila melanogaster]
Length = 316
Score = 149 bits (376), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 86/226 (38%), Positives = 122/226 (53%), Gaps = 17/226 (7%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
Y + C+G P S L C Y + FL + PLK+E + LDP +V HD + EI
Sbjct: 78 YQIGCRGQF--PPSADSKLYCLYNRTTSPFLILAPLKMELVGLDPYMVLYHDVLSPKEIK 135
Query: 63 RIIELSKGKVERGKV--VNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
+ ++ ++R V + G V TR SKV + +P+ G +P ++ RI DMT
Sbjct: 136 ELQGMATPSLKRATVYQASSGRNEVVKTRTSKVAW-FPD--GYNPLTVRLNARISDMTGF 192
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW------RLASFMFYLTDVELGGA 174
+ E LQ+ NYGLGGHYD H D + R+A+ +FYLTDVE GGA
Sbjct: 193 NLYGSEM----LQLMNYGLGGHYDQHYDFFNKTNSNMTAMSGDRIATVLFYLTDVEQGGA 248
Query: 175 TIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
T+FP++ VFP++GS V WYN N +D + H+ CPV +G+KW
Sbjct: 249 TVFPNIRKAVFPQRGSVVMWYNLKDNGQIDTQTLHAACPVIVGSKW 294
>gi|195061068|ref|XP_001995918.1| GH14106 [Drosophila grimshawi]
gi|193891710|gb|EDV90576.1| GH14106 [Drosophila grimshawi]
Length = 511
Score = 149 bits (375), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 89/223 (39%), Positives = 125/223 (56%), Gaps = 16/223 (7%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
Y L C+G VP+ SNL C Y+ + FL++ PLK+E + L+P +V HDA+ EI+
Sbjct: 277 YSLGCRGQF-VPQ---SNLHCEYKMKTSPFLRLAPLKMEIVLLNPFIVVFHDALSPQEID 332
Query: 63 RIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVI 122
+ L++ ++R V G + R SK +L ++ + +I+ R+ DMT L +
Sbjct: 333 YLQNLARPLLKRTTVHVNGKYVSRRVRTSKGAWLERDL---NNLTRRIERRVVDMTELSM 389
Query: 123 GREERYKGPLQINNYGLGGHYDLHCD-----ATPRDEGLWRLASFMFYLTDVELGGATIF 177
E Y I NYGLGGHY H D E R+A+ +FYL+DVE GGAT+F
Sbjct: 390 QGSEAY----NIMNYGLGGHYAAHYDFFNTTKQQTSETGDRIATVLFYLSDVEQGGATVF 445
Query: 178 PSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
P+L L V PE+G A+FWYN N D R H GCPV +G+KW
Sbjct: 446 PNLKLAVSPERGMALFWYNLLDNGTGDTRTLHGGCPVLVGSKW 488
>gi|195575145|ref|XP_002105540.1| GD16902 [Drosophila simulans]
gi|194201467|gb|EDX15043.1| GD16902 [Drosophila simulans]
Length = 525
Score = 149 bits (375), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 86/226 (38%), Positives = 122/226 (53%), Gaps = 17/226 (7%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
Y + C+G P S L C Y + FL + PLK+E + LDP +V HD + EI
Sbjct: 287 YQMGCRGQF--PPSADSKLYCLYNRTTSPFLILAPLKMELVGLDPYMVLYHDVLSPKEIT 344
Query: 63 RIIELSKGKVERGKV--VNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
+ ++ ++R V + G V TR SKV + +P+ G +P ++ RI DMT
Sbjct: 345 ELQGMATPGLKRATVYQASSGRNEVVKTRTSKVAW-FPD--GYNPLTVRLNARISDMTGF 401
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW------RLASFMFYLTDVELGGA 174
+ E LQ+ NYGLGGHYD H D + R+A+ +FYLTDVE GGA
Sbjct: 402 NLYGSEM----LQLMNYGLGGHYDQHYDFFNKTNSNMTAMSGDRIATVLFYLTDVEQGGA 457
Query: 175 TIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
T+FP++ VFP++GS V WYN N +D + H+ CPV +G+KW
Sbjct: 458 TVFPNIRKAVFPQRGSVVMWYNLRDNGQIDTQTLHAACPVIVGSKW 503
>gi|291230950|ref|XP_002735430.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Saccoglossus
kowalevskii]
Length = 533
Score = 148 bits (374), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 82/231 (35%), Positives = 126/231 (54%), Gaps = 18/231 (7%)
Query: 1 EIYPLACQG-NLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDS 59
E Y C+G + + + LKC YN FL + P K E ++ P+++ HDAI +
Sbjct: 286 EAYEALCRGEQVKMSPQRQKKLKCRLRDYNRPFLILQPAKEEVVFDKPKLIIFHDAILTN 345
Query: 60 EINRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDM 117
EI ++ L+ ++ R + N G+ + + R+SK +L + D ++++ RI+
Sbjct: 346 EIRKVKALASPRLRRATIQNSVTGNLEFAEYRISKSAWLSED---DGDVVHRLNHRIEQY 402
Query: 118 TNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDE--------GLWRLASFMFYLTDV 169
T L + E LQ+ NYGLGGHY+ H D ++E R+A+F+FY++DV
Sbjct: 403 TGLTMDTAEE----LQVANYGLGGHYEPHFDFARKEEINAFKSLNTGNRIATFLFYMSDV 458
Query: 170 ELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
E GGAT+FP + + PEKGSA FWYN N DY H+ CPV +G+KW
Sbjct: 459 EAGGATVFPQVGARLIPEKGSAAFWYNLLKNGEGDYSTRHAACPVLVGSKW 509
>gi|344274274|ref|XP_003408942.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2
[Loxodonta africana]
Length = 534
Score = 148 bits (373), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 85/230 (36%), Positives = 129/230 (56%), Gaps = 19/230 (8%)
Query: 3 YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G + + + L C Y N N + P K E+ + PR+V+ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIVRFHDIISDAE 348
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + +L+K ++ R + N GD V R+SK +L ++P + +I RIQD+T
Sbjct: 349 IEVVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSGY---ENPVVSRINMRIQDLT 405
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GG Y+ H D +DE R+A+++FY++DV
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVS 461
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV +GNKW
Sbjct: 462 AGGATVFPDVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511
>gi|24651477|ref|NP_733395.1| prolyl-4-hydroxylase-alpha PV [Drosophila melanogaster]
gi|20269812|gb|AAM18061.1|AF495539_1 prolyl 4-hydroxylase alpha-related protein PH4[alpha]PV [Drosophila
melanogaster]
gi|23172718|gb|AAN14252.1| prolyl-4-hydroxylase-alpha PV [Drosophila melanogaster]
Length = 525
Score = 148 bits (373), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 86/226 (38%), Positives = 122/226 (53%), Gaps = 17/226 (7%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
Y + C+G P S L C Y + FL + PLK+E + LDP +V HD + EI
Sbjct: 287 YQIGCRGQF--PPSADSKLYCLYNRTTSPFLILAPLKMELVGLDPYMVLYHDVLSPKEIK 344
Query: 63 RIIELSKGKVERGKV--VNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
+ ++ ++R V + G V TR SKV + +P+ G +P ++ RI DMT
Sbjct: 345 ELQGMATPGLKRATVYQASSGRNEVVKTRTSKVAW-FPD--GYNPLTVRLNARISDMTGF 401
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW------RLASFMFYLTDVELGGA 174
+ E LQ+ NYGLGGHYD H D + R+A+ +FYLTDVE GGA
Sbjct: 402 NLYGSEM----LQLMNYGLGGHYDQHYDFFNKTNSNMTAMSGDRIATVLFYLTDVEQGGA 457
Query: 175 TIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
T+FP++ VFP++GS V WYN N +D + H+ CPV +G+KW
Sbjct: 458 TVFPNIRKAVFPQRGSVVMWYNLKDNGQIDTQTLHAACPVIVGSKW 503
>gi|195341560|ref|XP_002037374.1| GM12888 [Drosophila sechellia]
gi|194131490|gb|EDW53533.1| GM12888 [Drosophila sechellia]
Length = 501
Score = 147 bits (372), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 79/220 (35%), Positives = 118/220 (53%), Gaps = 18/220 (8%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
+ Y L C G+ + + +L+C Y + + FL I PLK EEL+ DP +V HD IY SE
Sbjct: 256 QAYSLTCSGHWRLTPKEQRHLRCGYVTETHPFLWIAPLKAEELFQDPLLVLYHDVIYQSE 315
Query: 61 INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
I+ I +L+K ++ R + ++ +++ + R S++ F+ H L I R+ DMTNL
Sbjct: 316 IDVIRKLTKNRLMRATITSHNESVVSNVRTSQITFI---PVTAHKVLSTIDQRVADMTNL 372
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPSL 180
+ +Y Q NYG+GGHY H D W + L+DV GG T FP L
Sbjct: 373 NM----KYAEDHQFANYGIGGHYGQHMD--------W---FYQTTLSDVAQGGGTAFPQL 417
Query: 181 NLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ P+K +A FW+N HA+ + D R H CP+ G+KW
Sbjct: 418 RTLLKPKKYAAAFWHNLHASGVGDVRTQHGACPIIAGSKW 457
>gi|380813206|gb|AFE78477.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Macaca
mulatta]
gi|384947328|gb|AFI37269.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Macaca
mulatta]
Length = 534
Score = 147 bits (372), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 84/230 (36%), Positives = 129/230 (56%), Gaps = 19/230 (8%)
Query: 3 YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G + + + L C Y N N + P K E+ + PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + +L+K ++ R + N GD V R+SK +L ++P + +I RIQD+T
Sbjct: 349 IEIVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSGY---ENPVVSRINMRIQDLT 405
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GG Y+ H D +DE R+A+++FY++DV
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVS 461
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV +GNKW
Sbjct: 462 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511
>gi|190788|gb|AAA36535.1| prolyl 4-hydroxylase alpha subunit (EC 1.14.11.2) [Homo sapiens]
Length = 534
Score = 147 bits (372), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 84/230 (36%), Positives = 129/230 (56%), Gaps = 19/230 (8%)
Query: 3 YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G + + + L C Y N N + P K E+ + PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + +L+K ++ R + N GD V R+SK +L ++P + +I RIQD+T
Sbjct: 349 IEIVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSGY---ENPVVSRINMRIQDLT 405
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GG Y+ H D +DE R+A+++FY++DV
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVS 461
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV +GNKW
Sbjct: 462 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511
>gi|281350467|gb|EFB26051.1| hypothetical protein PANDA_009188 [Ailuropoda melanoleuca]
Length = 511
Score = 147 bits (372), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 84/230 (36%), Positives = 129/230 (56%), Gaps = 19/230 (8%)
Query: 3 YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G + + + L C Y N N + P K E+ + PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + +L+K ++ R + N GD V R+SK +L ++P + +I RIQD+T
Sbjct: 349 IEIVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSGY---ENPVVSRINMRIQDLT 405
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GG Y+ H D +DE R+A+++FY++DV
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVS 461
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV +GNKW
Sbjct: 462 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511
>gi|355562502|gb|EHH19096.1| hypothetical protein EGK_19739 [Macaca mulatta]
gi|355782842|gb|EHH64763.1| hypothetical protein EGM_18071 [Macaca fascicularis]
gi|383418719|gb|AFH32573.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Macaca
mulatta]
Length = 534
Score = 147 bits (371), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 84/230 (36%), Positives = 129/230 (56%), Gaps = 19/230 (8%)
Query: 3 YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G + + + L C Y N N + P K E+ + PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + +L+K ++ R + N GD V R+SK +L ++P + +I RIQD+T
Sbjct: 349 IEIVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSGY---ENPVVSRINMRIQDLT 405
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GG Y+ H D +DE R+A+++FY++DV
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVS 461
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV +GNKW
Sbjct: 462 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511
>gi|291404184|ref|XP_002718472.1| PREDICTED: prolyl 4-hydroxylase, alpha I subunit isoform 2
[Oryctolagus cuniculus]
Length = 534
Score = 147 bits (371), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 84/230 (36%), Positives = 129/230 (56%), Gaps = 19/230 (8%)
Query: 3 YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G + + + L C Y N N + P K E+ + PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + +L+K ++ R + N GD V R+SK +L ++P + +I RIQD+T
Sbjct: 349 IEIVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSGY---ENPVVSRINMRIQDLT 405
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GG Y+ H D +DE R+A+++FY++DV
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVS 461
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV +GNKW
Sbjct: 462 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511
>gi|63252888|ref|NP_001017962.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Homo
sapiens]
gi|197099666|ref|NP_001125733.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Pongo abelii]
gi|217272849|ref|NP_001136067.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Homo
sapiens]
gi|114631177|ref|XP_001140234.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 3 [Pan
troglodytes]
gi|114631181|ref|XP_001140652.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 7 [Pan
troglodytes]
gi|2507090|sp|P13674.2|P4HA1_HUMAN RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
alpha-1; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1; Flags: Precursor
gi|75061858|sp|Q5RAG8.1|P4HA1_PONAB RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
alpha-1; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1; Flags: Precursor
gi|602675|gb|AAA59068.1| alpha-subunit of prolyl 4-hydroxylase [Homo sapiens]
gi|23271226|gb|AAH34998.1| Prolyl 4-hydroxylase, alpha polypeptide I [Homo sapiens]
gi|55729010|emb|CAH91242.1| hypothetical protein [Pongo abelii]
gi|56403853|emb|CAI29712.1| hypothetical protein [Pongo abelii]
gi|119574854|gb|EAW54469.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I, isoform CRA_c [Homo
sapiens]
gi|119574855|gb|EAW54470.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I, isoform CRA_d [Homo
sapiens]
gi|123981532|gb|ABM82595.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I [synthetic
construct]
gi|123996359|gb|ABM85781.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I [synthetic
construct]
gi|261861532|dbj|BAI47288.1| prolyl 4-hydroxylase, alpha polypeptide I [synthetic construct]
gi|410295852|gb|JAA26526.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
gi|410349611|gb|JAA41409.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
Length = 534
Score = 147 bits (371), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 84/230 (36%), Positives = 129/230 (56%), Gaps = 19/230 (8%)
Query: 3 YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G + + + L C Y N N + P K E+ + PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + +L+K ++ R + N GD V R+SK +L ++P + +I RIQD+T
Sbjct: 349 IEIVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSGY---ENPVVSRINMRIQDLT 405
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GG Y+ H D +DE R+A+++FY++DV
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVS 461
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV +GNKW
Sbjct: 462 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511
>gi|395820526|ref|XP_003783615.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2 [Otolemur
garnettii]
Length = 534
Score = 147 bits (371), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 84/230 (36%), Positives = 129/230 (56%), Gaps = 19/230 (8%)
Query: 3 YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G + + + L C Y N N + P K E+ + PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + +L+K ++ R + N GD V R+SK +L ++P + +I RIQD+T
Sbjct: 349 IEIVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSGY---ENPVVSRINMRIQDLT 405
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GG Y+ H D +DE R+A+++FY++DV
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVS 461
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV +GNKW
Sbjct: 462 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511
>gi|148233143|ref|NP_001090904.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Sus scrofa]
gi|83778522|gb|ABC47142.1| procollagen-proline 2-oxoglutarate-4-dioxygenase [Sus scrofa]
Length = 534
Score = 147 bits (371), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 83/230 (36%), Positives = 130/230 (56%), Gaps = 19/230 (8%)
Query: 3 YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G + + + L C Y N N + P K E+ + PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I+ + +L+K ++ R + N GD V R+SK +L ++P + ++ RIQD+T
Sbjct: 349 IDIVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSGY---ENPVVSRLNMRIQDLT 405
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GG Y+ H D +DE R+A+++FY++DV
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVS 461
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV +GNKW
Sbjct: 462 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511
>gi|410251926|gb|JAA13930.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
Length = 566
Score = 147 bits (370), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 84/230 (36%), Positives = 129/230 (56%), Gaps = 19/230 (8%)
Query: 3 YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G + + + L C Y N N + P K E+ + PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + +L+K ++ R + N GD V R+SK +L ++P + +I RIQD+T
Sbjct: 349 IEIVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSGY---ENPVVSRINMRIQDLT 405
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GG Y+ H D +DE R+A+++FY++DV
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVS 461
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV +GNKW
Sbjct: 462 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511
>gi|157114983|ref|XP_001658090.1| prolyl 4-hydroxylase alpha subunit 1, putative [Aedes aegypti]
gi|108877085|gb|EAT41310.1| AAEL007032-PA, partial [Aedes aegypti]
Length = 448
Score = 147 bits (370), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 84/220 (38%), Positives = 121/220 (55%), Gaps = 24/220 (10%)
Query: 2 IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
+Y C+G +NL+C YES N++FLKI P K+EE LDP +V H+AI D EI
Sbjct: 244 LYEPLCRGEYQRTPAQVANLRCRYESKNSSFLKIAPFKLEEASLDPLIVIYHNAISDKEI 303
Query: 62 NRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLV 121
++II++SK ++R V G++ + + + D + + R +DMT
Sbjct: 304 DQIIQVSKPMLKRSMV---GESFSKEVSNERTNY-------DFELVKVLSLRTEDMT--- 350
Query: 122 IGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPSLN 181
G + + LQ+NNYG+GG Y H D +E + +DVE GGAT+FP +
Sbjct: 351 -GLDRKSYESLQVNNYGIGGFYLPHFDWVRTNEPI----------SDVEQGGATVFPQIG 399
Query: 182 LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 221
+ VFP+KGSA+FWYN + D R H CPV LG+KWG
Sbjct: 400 VGVFPKKGSAIFWYNLLPDGTGDERTLHGACPVLLGSKWG 439
>gi|321474876|gb|EFX85840.1| hypothetical protein DAPPUDRAFT_309107 [Daphnia pulex]
Length = 528
Score = 147 bits (370), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 84/229 (36%), Positives = 129/229 (56%), Gaps = 19/229 (8%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
Y C+G + ++ L+C Y + N F I P+K+EE L P +V H I+D+EI+
Sbjct: 285 YEKLCRGEKLLDPKVEGRLRCRYVTNNVPFFFIQPVKMEEALLKPLLVIYHGVIFDAEID 344
Query: 63 RIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
+ +L++ + +R V + G ++ V R++K FL +H + K+ R+ D+T L
Sbjct: 345 VVKKLAQPRFKRTGVTDRDTGRSMPVQYRIAKAAFLKD---SEHNLIVKMSRRVGDITGL 401
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDAT-------PRDEGLW--RLASFMFYLTDVEL 171
+ E LQ+ NYG+GGHY H D PRD W R+A+++FY++DVE
Sbjct: 402 DMAASE----DLQVCNYGIGGHYVPHFDYARQGEIHGPRDLD-WGNRIATWLFYMSDVEA 456
Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP++ ++P+KGSA FWYN N D H+GCPV G+KW
Sbjct: 457 GGATVFPAVGAALWPQKGSAAFWYNLRPNGNGDEDTLHAGCPVLTGSKW 505
>gi|115495019|ref|NP_001069238.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Bos taurus]
gi|122144801|sp|Q1RMU3.1|P4HA1_BOVIN RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
alpha-1; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1; Flags: Precursor
gi|92097479|gb|AAI14709.1| Prolyl 4-hydroxylase, alpha polypeptide I [Bos taurus]
gi|296472132|tpg|DAA14247.1| TPA: prolyl 4-hydroxylase subunit alpha-1 precursor [Bos taurus]
gi|440892721|gb|ELR45796.1| Prolyl 4-hydroxylase subunit alpha-1 [Bos grunniens mutus]
Length = 534
Score = 146 bits (369), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 84/230 (36%), Positives = 129/230 (56%), Gaps = 19/230 (8%)
Query: 3 YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G + + + L C Y N N + P K E+ + PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + +L+K ++ R + N GD V R+SK +L ++P + +I RIQD+T
Sbjct: 349 IEVVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSGY---ENPVVSRINMRIQDLT 405
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GG Y+ H D +DE R+A+++FY++DV
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVL 461
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV +GNKW
Sbjct: 462 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511
>gi|195341590|ref|XP_002037389.1| GM12139 [Drosophila sechellia]
gi|194131505|gb|EDW53548.1| GM12139 [Drosophila sechellia]
Length = 525
Score = 146 bits (369), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 87/230 (37%), Positives = 123/230 (53%), Gaps = 25/230 (10%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
Y + C+G P S L C Y + FL + PLK+E + L+P +V HD + EI
Sbjct: 287 YQVGCRGQF--PPSADSKLYCLYNRTTSPFLILAPLKMELVGLEPYMVLYHDVLSPKEIT 344
Query: 63 RIIELSKGKVERGKV--VNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
+ ++ ++R V + G V TR SKV + +P+ G +P ++ RI DMT
Sbjct: 345 ELQGMATPGLKRATVYQASSGRNEVVKTRTSKVAW-FPD--GYNPLTVRLNARISDMTGF 401
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD----------ATPRDEGLWRLASFMFYLTDVE 170
+ E LQ+ NYGLGGHYD H D A D R+A+ +FYLTDVE
Sbjct: 402 NLYGSEM----LQLMNYGLGGHYDQHYDFFNNTNSNMTAMSGD----RIATVLFYLTDVE 453
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP++ VFP++GS V WYN N +D + H+ CPV +G+KW
Sbjct: 454 QGGATVFPNIRKAVFPQRGSVVMWYNLRDNGQIDTQTLHAACPVIVGSKW 503
>gi|195390833|ref|XP_002054072.1| GJ22994 [Drosophila virilis]
gi|194152158|gb|EDW67592.1| GJ22994 [Drosophila virilis]
Length = 496
Score = 146 bits (368), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 87/209 (41%), Positives = 119/209 (56%), Gaps = 16/209 (7%)
Query: 19 SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV 78
SNL C Y+ + FL + P+K+E L+P ++ HD + EI+ + +L++ +ER VV
Sbjct: 275 SNLYCVYKFGTSPFLLLAPIKMEIRLLNPFIIVFHDVLSPREIDELQKLARPLLERTTVV 334
Query: 79 NYGDTIYVDTRLSKVYFLYPEIFGDHPFLYK-IQTRIQDMTNLVIGREERYKGPLQINNY 137
+ R SK + I DH L K I+ RI DM L + RY P Q+ NY
Sbjct: 335 KFKKYEKDSRRTSKGTW----IERDHNNLTKRIERRITDMVELDL----RYSEPFQVMNY 386
Query: 138 GLGGHYDLHCD------ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSA 191
GLGGHY H D A ++E R+A+ +FYLTDVE GGAT+F LN V P++G+A
Sbjct: 387 GLGGHYAAHEDFLGDTWADKKEEDD-RIATVLFYLTDVEQGGATVFTILNQAVSPKRGTA 445
Query: 192 VFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+FWYN H N D R H GCPV +G+KW
Sbjct: 446 LFWYNLHRNGTGDTRTLHGGCPVLVGSKW 474
>gi|195505255|ref|XP_002099425.1| GE23368 [Drosophila yakuba]
gi|194185526|gb|EDW99137.1| GE23368 [Drosophila yakuba]
Length = 528
Score = 146 bits (368), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 86/230 (37%), Positives = 122/230 (53%), Gaps = 25/230 (10%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
Y + C+G + D S L C Y + FL + PLK+E + LDP +V HD + EI
Sbjct: 290 YQMGCRGQFAPSAD--SKLHCLYNRTTSPFLMLAPLKMELVGLDPYMVLYHDVLSAKEIK 347
Query: 63 RIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
+ ++ ++R V G V TR SKV + +P+ G P ++ RI DMT
Sbjct: 348 ELQGMATPGLKRATVFQAASGRNEVVRTRTSKVAW-FPD--GYSPLTVRLNARITDMTGF 404
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD----------ATPRDEGLWRLASFMFYLTDVE 170
+ E LQ+ NYGLGGHYD H D A D R+A+ +FYLTDVE
Sbjct: 405 NLHGSEM----LQLMNYGLGGHYDQHYDYFNTINSNLTAMSGD----RIATVLFYLTDVE 456
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP++ VFP++GS + WYN + +D + H+ CPV +G+KW
Sbjct: 457 QGGATVFPNIRKAVFPQRGSVIMWYNLKDDGQIDTQTLHAACPVIVGSKW 506
>gi|426255744|ref|XP_004021508.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1 [Ovis
aries]
Length = 534
Score = 146 bits (368), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 84/230 (36%), Positives = 129/230 (56%), Gaps = 19/230 (8%)
Query: 3 YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G + + + L C Y N N + P K E+ + PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + +L+K ++ R + N GD V R+SK +L ++P + +I RIQD+T
Sbjct: 349 IEIVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSGY---ENPVVSRINMRIQDLT 405
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GG Y+ H D +DE R+A+++FY++DV
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVL 461
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV +GNKW
Sbjct: 462 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511
>gi|56118630|ref|NP_001007975.1| prolyl 4-hydroxylase, alpha polypeptide 2 precursor [Xenopus
(Silurana) tropicalis]
gi|51513259|gb|AAH80485.1| p4ha2 protein [Xenopus (Silurana) tropicalis]
Length = 527
Score = 146 bits (368), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 86/232 (37%), Positives = 133/232 (57%), Gaps = 17/232 (7%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
++Y C+G + + + L C Y + N + +L + P+KVE+ + PR+V+ +A+ D
Sbjct: 290 DVYEALCRGEGVKMNPRRQRRLFCRYHNGNRSPYLILSPVKVEDEWDSPRIVRYLNALSD 349
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI +I EL+K K+ R V + G + R+SK +L D P + ++ R+Q
Sbjct: 350 EEIAKIKELAKPKLARATVRDPKTGVLSVANYRVSKSAWLEE---NDDPVIARVNLRMQA 406
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----RLASFMFYLTDVE 170
+T L + E LQ+ NYG+GG Y+ H D + P D L RLA+F+ Y++DVE
Sbjct: 407 ITGLTVDTAEL----LQVANYGMGGQYEPHFDFSRRPFDSNLKTDGNRLATFLNYMSDVE 462
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWGK 222
GGAT+FP ++P+KG+AVFWYN + DYR H+ CPV +G+KWGK
Sbjct: 463 AGGATVFPDFGAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGSKWGK 514
>gi|195575115|ref|XP_002105525.1| GD21527 [Drosophila simulans]
gi|194201452|gb|EDX15028.1| GD21527 [Drosophila simulans]
Length = 495
Score = 146 bits (368), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 79/220 (35%), Positives = 117/220 (53%), Gaps = 18/220 (8%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
+ Y L C G+ + + +L+C Y + + FL I PLK EEL+ DP +V HD IY SE
Sbjct: 250 QAYSLTCSGHWQLTPKEQRHLRCGYVTETHPFLWIAPLKAEELFQDPLLVLYHDVIYQSE 309
Query: 61 INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
I+ I +L+K ++ R + ++ +++ + R S+ F+ H L I R+ DMTNL
Sbjct: 310 IDVIRKLTKNRLMRATITSHNESVVSNVRTSQFTFI---PVTAHKVLSTIDQRVADMTNL 366
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPSL 180
+ +Y Q NYG+GGHY H D W + L+DV GG T FP L
Sbjct: 367 NM----KYAEDHQFANYGIGGHYGQHMD--------W---FYQTTLSDVAQGGGTAFPQL 411
Query: 181 NLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ P+K +A FW+N HA+ + D R H CP+ G+KW
Sbjct: 412 RTLLKPKKYAAAFWHNLHASGVGDVRTQHGACPIIAGSKW 451
>gi|194905290|ref|XP_001981166.1| GG11918 [Drosophila erecta]
gi|190655804|gb|EDV53036.1| GG11918 [Drosophila erecta]
Length = 525
Score = 145 bits (366), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 85/230 (36%), Positives = 120/230 (52%), Gaps = 25/230 (10%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
Y + C+G P L C Y + FL + PLK+E + LDP +V HD + EI
Sbjct: 287 YQMGCRGQF--PPSADGKLYCLYNRTTSAFLMLAPLKMELVGLDPYMVLYHDVLSAKEIK 344
Query: 63 RIIELSKGKVERGKV--VNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
+ ++ + R V + G V TR SKV + +P+ + +P ++ RI DMT
Sbjct: 345 ELQGMATPGLTRATVFQASSGRNEVVKTRTSKVAW-FPDSY--NPLTVRLNARIADMTGF 401
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD----------ATPRDEGLWRLASFMFYLTDVE 170
+ E LQ+ NYGLGGHYD H D A D R+A+ +FYLTDVE
Sbjct: 402 NLYGSEM----LQLMNYGLGGHYDQHYDFFNTINSNLTAMSGD----RIATVLFYLTDVE 453
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP++ VFP++GS + WYN N D + H+ CPV +G+KW
Sbjct: 454 QGGATVFPNIRKAVFPQRGSVIMWYNLQDNGQTDNKTLHAACPVIVGSKW 503
>gi|239915958|ref|NP_001070123.2| prolyl 4-hydroxylase alpha II-like precursor [Danio rerio]
Length = 490
Score = 145 bits (366), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 84/220 (38%), Positives = 125/220 (56%), Gaps = 16/220 (7%)
Query: 3 YPLACQGNLSVPEDIKSN-LKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y C+G + + L C Y + N L P+K EEL+ +P++++ HD I D+E
Sbjct: 262 YEALCRGEVDERTSKRQRALSCRYSTGGGNPRLMYAPVKEEELWDEPKIIRYHDVISDTE 321
Query: 61 INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
I + ++++ ++ R + G + D R S+ FL E G + +I RI D+T L
Sbjct: 322 IETLKDIARPELTRSQT---GWGVISDIRTSQSVFL--EEVGT---VARISQRIADITGL 373
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPSL 180
+ E+ L + NYG+GG Y H D DE R A+F+ Y++DVE+GGAT+F ++
Sbjct: 374 SVESAEK----LHVQNYGIGGRYTPHFDTG--DEVNERTATFLIYMSDVEVGGATVFTNV 427
Query: 181 NLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ V PEKGSAVFWYN H N LD + H+GCPV +GNKW
Sbjct: 428 GVAVKPEKGSAVFWYNLHKNGELDLKTKHAGCPVLVGNKW 467
>gi|410914996|ref|XP_003970973.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Takifugu
rubripes]
Length = 538
Score = 145 bits (365), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 86/232 (37%), Positives = 128/232 (55%), Gaps = 19/232 (8%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYD 58
E Y C+G L + E +S L C Y+ N N L + P+K E+ + P +V+ D + +
Sbjct: 291 EAYEALCRGEGLQMNEARRSRLFCRYQDGNRNPHLLLKPIKEEDEWDSPNIVRYLDFLSN 350
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI +I EL+K K+ R V + G R+SK +L E + P + ++ RI+D
Sbjct: 351 EEIEKIKELAKPKLARATVRDPKSGVLTTASYRVSKSAWLEGE---EDPIIARVNQRIED 407
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTD 168
+T L + E LQ+ NYG+GG Y+ H D + +DE R+A+F+ Y++D
Sbjct: 408 LTGLTVKTAEL----LQVANYGVGGQYEPHFDFSRKDEPDAFKRLGTGNRVATFLNYMSD 463
Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
VE GGAT+FP ++P KG+AVFWYN + DYR H+ CPV +GNKW
Sbjct: 464 VEAGGATVFPDFGAAIWPRKGTAVFWYNLFKSGEGDYRTRHAACPVLVGNKW 515
>gi|354483225|ref|XP_003503795.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 2
[Cricetulus griseus]
Length = 534
Score = 145 bits (365), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 83/230 (36%), Positives = 128/230 (55%), Gaps = 19/230 (8%)
Query: 3 YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G + + + L C Y N N + P K E+ + PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + +L+K ++ R + N G+ V R+SK +L + P + +I RIQD+T
Sbjct: 349 IEIVKDLAKPRLRRATISNPITGNLETVHYRISKSAWLSGY---EDPVVSRINMRIQDLT 405
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GG Y+ H D +DE R+A+++FY++DV
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFQELGTGNRIATWLFYMSDVS 461
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV +GNKW
Sbjct: 462 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511
>gi|260825355|ref|XP_002607632.1| hypothetical protein BRAFLDRAFT_84679 [Branchiostoma floridae]
gi|229292980|gb|EEN63642.1| hypothetical protein BRAFLDRAFT_84679 [Branchiostoma floridae]
Length = 519
Score = 145 bits (365), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 81/234 (34%), Positives = 131/234 (55%), Gaps = 23/234 (9%)
Query: 2 IYPLACQGNLS-----VPEDIKSNLKCFYESYNN-TFLKIGPLKVEELYLDPRVVKIHDA 55
+Y L CQGN P +K +LKC Y + NN L + P+++E+++ P++ +H+
Sbjct: 271 VYELLCQGNQPEIFNITPSRVK-HLKCRYFTNNNHPRLLLAPIRLEQVFDKPKLWVLHNI 329
Query: 56 IYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTR 113
+ D E+ I +L++ ++ N G + R+SK +LY + +H + +++ R
Sbjct: 330 LSDPEMEVIKKLAQPRLRPAATQNPTTGGAVLSSYRISKNAWLY---YWEHRLINRVKQR 386
Query: 114 IQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-------RLASFMFYL 166
++D T L + E PLQ+ NYG+GGHY+ H D +DE R+A+ +FY+
Sbjct: 387 VEDATGLTMETAE----PLQVINYGIGGHYEPHFDCATKDEEFALDPNEGDRIATMLFYM 442
Query: 167 TDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+DVE GGAT+FP + V PEKG+ FWYN + D H+GCPV +G+KW
Sbjct: 443 SDVEAGGATVFPQVGARVVPEKGAGAFWYNLLKSGEGDMLTEHAGCPVLVGSKW 496
>gi|268536692|ref|XP_002633481.1| C. briggsae CBR-PHY-2 protein [Caenorhabditis briggsae]
gi|94442973|emb|CAJ98659.1| prolyl 4-hydroxylase [Caenorhabditis briggsae]
Length = 539
Score = 145 bits (365), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 89/231 (38%), Positives = 123/231 (53%), Gaps = 19/231 (8%)
Query: 1 EIYPLACQGNLS-VPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDS 59
+ Y C+G + V E KS L+C+ + + FLKI P+KVE L DP V + I DS
Sbjct: 279 DAYEALCRGEIPPVEEKWKSKLRCYLKR-DKPFLKIAPIKVEILRFDPLAVLFKNVISDS 337
Query: 60 EINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDM 117
EI I EL+ K++R V N G+ + R+SK +L ++ P + ++ RI+D
Sbjct: 338 EIEVIKELASPKLKRATVQNSKTGELEHATYRISKSAWLKGDL---DPVIDRVNRRIEDF 394
Query: 118 TNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDV 169
T L E LQ+ NYGLGGHYD H D ++E R+A+ +FY++
Sbjct: 395 TGLNQATSEE----LQVANYGLGGHYDPHFDFARKEEKNAFKTLNTGNRIATVLFYMSQP 450
Query: 170 ELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
E GGAT+F L VFP K A+FWYN + D R H+ CPV LG KW
Sbjct: 451 ERGGATVFNHLGTAVFPSKNDALFWYNLRRDGEGDLRTRHAACPVLLGVKW 501
>gi|92096574|gb|AAI15350.1| LOC557059 protein [Danio rerio]
Length = 508
Score = 145 bits (365), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 84/220 (38%), Positives = 125/220 (56%), Gaps = 16/220 (7%)
Query: 3 YPLACQGNLSVPEDIKSN-LKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y C+G + + L C Y + N L P+K EEL+ +P++++ HD I D+E
Sbjct: 280 YEALCRGEVDERTSKRQRALSCRYSTGGGNPRLMYAPVKEEELWDEPKIIRYHDVISDTE 339
Query: 61 INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
I + ++++ ++ R + G + D R S+ FL E G + +I RI D+T L
Sbjct: 340 IETLKDIARPELTRSQT---GWGVISDIRTSQSVFL--EEVGT---VARISQRIADITGL 391
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPSL 180
+ E+ L + NYG+GG Y H D DE R A+F+ Y++DVE+GGAT+F ++
Sbjct: 392 SVESAEK----LHVQNYGIGGRYTPHFDTG--DEVNERTATFLIYMSDVEVGGATVFTNV 445
Query: 181 NLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ V PEKGSAVFWYN H N LD + H+GCPV +GNKW
Sbjct: 446 GVAVKPEKGSAVFWYNLHKNGELDLKTKHAGCPVLVGNKW 485
>gi|195159142|ref|XP_002020441.1| GL13994 [Drosophila persimilis]
gi|194117210|gb|EDW39253.1| GL13994 [Drosophila persimilis]
Length = 493
Score = 145 bits (365), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 87/228 (38%), Positives = 126/228 (55%), Gaps = 19/228 (8%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
Y C+G P L C Y S N+ FL++ PLK+E + LDP +V HD I EI+
Sbjct: 252 YERGCRGLFPSPSK-DGRLHCVYNSTNSAFLRLAPLKMELVGLDPYMVLYHDVISALEIS 310
Query: 63 RIIELSKGKVERGKVVNYG--DTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
++ +++ ++R V + V TR SKV + +P+ F + ++ RI DMTN
Sbjct: 311 QLQDMATPGLKRATVYKASGRRSEVVKTRTSKVAW-FPDTFNE--LTERLNRRIADMTNF 367
Query: 121 -VIGREERYKGPLQINNYGLGGHYDLHCD-------ATPRDEGLWRLASFMFYLTDVELG 172
++G E LQ NYGLGGHYD H D A R+A+ +FYLTDVE G
Sbjct: 368 DLLGSEM-----LQAMNYGLGGHYDKHYDFFNASTAANLTQMNGDRIATVLFYLTDVEQG 422
Query: 173 GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GAT+FP++ VFP++GSA+ WYN + + + H+ CPV +G+KW
Sbjct: 423 GATVFPNIRKAVFPQRGSAIIWYNLKDDGDPNPQTLHAACPVLVGSKW 470
>gi|344254200|gb|EGW10304.1| Prolyl 4-hydroxylase subunit alpha-1 [Cricetulus griseus]
Length = 507
Score = 144 bits (364), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 83/230 (36%), Positives = 128/230 (55%), Gaps = 19/230 (8%)
Query: 3 YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G + + + L C Y N N + P K E+ + PR+++ HD I D+E
Sbjct: 262 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 321
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + +L+K ++ R + N G+ V R+SK +L + P + +I RIQD+T
Sbjct: 322 IEIVKDLAKPRLRRATISNPITGNLETVHYRISKSAWLSGY---EDPVVSRINMRIQDLT 378
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GG Y+ H D +DE R+A+++FY++DV
Sbjct: 379 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFQELGTGNRIATWLFYMSDVS 434
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV +GNKW
Sbjct: 435 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 484
>gi|198449500|ref|XP_001357604.2| GA15939 [Drosophila pseudoobscura pseudoobscura]
gi|198130634|gb|EAL26738.2| GA15939 [Drosophila pseudoobscura pseudoobscura]
Length = 528
Score = 144 bits (364), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 87/228 (38%), Positives = 128/228 (56%), Gaps = 19/228 (8%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
Y C+G P L C Y S N+ FL++ PLK+E + LDP +V HD I EI+
Sbjct: 287 YERGCRGLFPSPSK-DGRLHCVYNSTNSAFLRLAPLKMELVGLDPYMVLYHDVISAPEIS 345
Query: 63 RIIELSKGKVERGKVVNYG--DTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
++ +++ ++R V + V TR SKV + +P+ F + ++ RI DMTN
Sbjct: 346 QLQDMATPGLKRATVYKASGRRSEVVKTRTSKVAW-FPDTFNE--LTERLNRRIADMTNF 402
Query: 121 -VIGREERYKGPLQINNYGLGGHYDLHCD----ATPRDEGLW---RLASFMFYLTDVELG 172
++G E LQ NYGLGGHYD H D +T + R+A+ +FYLTDVE G
Sbjct: 403 DLLGSEM-----LQAMNYGLGGHYDKHYDFFNASTATNLTQMNGDRIATVLFYLTDVEQG 457
Query: 173 GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GAT+FP++ VFP++GSA+ WYN + + + H+ CPV +G+KW
Sbjct: 458 GATVFPNIRKAVFPQRGSAIIWYNLKDDGDPNPQTLHAACPVLVGSKW 505
>gi|348557544|ref|XP_003464579.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 2
[Cavia porcellus]
Length = 533
Score = 144 bits (364), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 87/230 (37%), Positives = 126/230 (54%), Gaps = 17/230 (7%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
E+Y C+G + + + L C Y N L I P K E+ + P +V+ +D + D
Sbjct: 288 EVYESLCRGEGIKLTPQRRKRLFCRYHHGNRAPELLIAPFKEEDEWDSPHIVRYYDVMSD 347
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI RI E++K K+ R V + G R+SK +L E D P + ++ R+Q
Sbjct: 348 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEE---DDPVVARVNRRMQQ 404
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----RLASFMFYLTDVE 170
+T L + E LQ+ NYG+GG Y+ H D + P D GL RLA+F+ Y++DVE
Sbjct: 405 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVE 460
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 461 AGGATVFPDLGAALWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 510
>gi|195113239|ref|XP_002001175.1| GI10638 [Drosophila mojavensis]
gi|193917769|gb|EDW16636.1| GI10638 [Drosophila mojavensis]
Length = 511
Score = 144 bits (364), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 81/222 (36%), Positives = 123/222 (55%), Gaps = 18/222 (8%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
+ + C+G +S L C Y+S + FL++ P+K+E L LDP VV HD + EI+
Sbjct: 279 FEIGCRGQYVQ----QSGLMCTYKSKSPAFLRLAPIKMEVLVLDPLVVIFHDVLSSREID 334
Query: 63 RIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVI 122
+ E+++ +ER VV Y + R+S ++ + + ++I+ RI DM +L +
Sbjct: 335 GLQEIARPHLERSMVVKYRANVQGKHRISAGTWVERKY---NNLTWRIERRIADMVDLNL 391
Query: 123 GREERYKGPLQINNYGLGGHYDLHCD----ATPRDEGLWRLASFMFYLTDVELGGATIFP 178
E P + NYG+GG Y H D T D RLA+ +FY+ DVE GGAT+FP
Sbjct: 392 EGSE----PFYVINYGIGGQYKAHWDFFGADTVEDN---RLATVLFYMNDVEQGGATVFP 444
Query: 179 SLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
L TV ++G+A+FWYN N +D R H GCP+ +G+KW
Sbjct: 445 RLGQTVRAKRGNALFWYNMQHNGTVDDRTLHGGCPILVGSKW 486
>gi|312032360|ref|NP_001185667.1| prolyl 4-hydroxylase subunit alpha-1 isoform 4 precursor [Gallus
gallus]
Length = 536
Score = 144 bits (363), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 84/230 (36%), Positives = 127/230 (55%), Gaps = 19/230 (8%)
Query: 3 YPLACQGN-LSVPEDIKSNLKC-FYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G L + + L C +Y+ N +GP+K E+ + PR+V+ D I D E
Sbjct: 291 YEMLCRGEGLKMTPRRQKRLFCRYYDGNRNPRYILGPVKQEDEWDKPRIVRFLDIISDEE 350
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + EL+K ++ R + N G R+SK +L + P + +I TRIQD+T
Sbjct: 351 IETVKELAKPRLRRATISNPITGALETAHYRISKSAWLSGY---ESPVVSRINTRIQDLT 407
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GG Y+ H D +DE R+A+++FY++DV
Sbjct: 408 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVS 463
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP + +V+P+KG+AVFWYN + DY H+ CPV +GNKW
Sbjct: 464 AGGATVFPEVGASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPVLVGNKW 513
>gi|149038788|gb|EDL93077.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha 1 polypeptide, isoform CRA_b
[Rattus norvegicus]
Length = 534
Score = 144 bits (363), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 83/230 (36%), Positives = 127/230 (55%), Gaps = 19/230 (8%)
Query: 3 YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G + + + L C Y N N + P K E+ + PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKRLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + +L+K ++ R + N G V R+SK +L + P + +I RIQD+T
Sbjct: 349 IEIVKDLAKPRLRRATISNPVTGALETVHYRISKSAWLSGY---EDPVVSRINMRIQDLT 405
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GG Y+ H D +DE R+A+++FY++DV
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIATWLFYMSDVS 461
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV +GNKW
Sbjct: 462 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511
>gi|74224984|dbj|BAE38205.1| unnamed protein product [Mus musculus]
Length = 534
Score = 144 bits (363), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 83/230 (36%), Positives = 127/230 (55%), Gaps = 19/230 (8%)
Query: 3 YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G + + + L C Y N N + P K E+ + PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKRLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + +L+K ++ R + N G V R+SK +L + P + +I RIQD+T
Sbjct: 349 IEIVKDLAKPRLRRATISNPVTGALETVHYRISKSAWLSGY---EDPVVSRINMRIQDLT 405
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GG Y+ H D +DE R+A+++FY++DV
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIATWLFYMSDVS 461
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV +GNKW
Sbjct: 462 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511
>gi|74225936|dbj|BAE28745.1| unnamed protein product [Mus musculus]
Length = 561
Score = 144 bits (363), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 83/230 (36%), Positives = 127/230 (55%), Gaps = 19/230 (8%)
Query: 3 YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G + + + L C Y N N + P K E+ + PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKRLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + +L+K ++ R + N G V R+SK +L + P + +I RIQD+T
Sbjct: 349 IEIVKDLAKPRLRRATISNPVTGALETVHYRISKSAWLSGY---EDPVVSRINMRIQDLT 405
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GG Y+ H D +DE R+A+++FY++DV
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIATWLFYMSDVS 461
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV +GNKW
Sbjct: 462 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511
>gi|33859596|ref|NP_035160.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Mus musculus]
gi|20455506|sp|Q60715.2|P4HA1_MOUSE RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
alpha-1; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1; Flags: Precursor
gi|16307134|gb|AAH09654.1| Procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha 1 polypeptide [Mus musculus]
gi|74144306|dbj|BAE36020.1| unnamed protein product [Mus musculus]
gi|74146660|dbj|BAE41331.1| unnamed protein product [Mus musculus]
gi|148700260|gb|EDL32207.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha 1 polypeptide, isoform CRA_a [Mus
musculus]
Length = 534
Score = 144 bits (362), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 83/230 (36%), Positives = 127/230 (55%), Gaps = 19/230 (8%)
Query: 3 YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G + + + L C Y N N + P K E+ + PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKRLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + +L+K ++ R + N G V R+SK +L + P + +I RIQD+T
Sbjct: 349 IEIVKDLAKPRLRRATISNPVTGALETVHYRISKSAWLSGY---EDPVVSRINMRIQDLT 405
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GG Y+ H D +DE R+A+++FY++DV
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIATWLFYMSDVS 461
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV +GNKW
Sbjct: 462 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511
>gi|344274272|ref|XP_003408941.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1
[Loxodonta africana]
Length = 534
Score = 144 bits (362), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 83/230 (36%), Positives = 127/230 (55%), Gaps = 19/230 (8%)
Query: 3 YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G + + + L C Y N N + P K E+ + PR+V+ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIVRFHDIISDAE 348
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + +L+K ++ R V + G R+SK +L ++P + +I RIQD+T
Sbjct: 349 IEVVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---ENPVVSRINMRIQDLT 405
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GG Y+ H D +DE R+A+++FY++DV
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVS 461
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV +GNKW
Sbjct: 462 AGGATVFPDVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511
>gi|312032356|ref|NP_001185665.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Gallus
gallus]
Length = 536
Score = 143 bits (361), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 84/230 (36%), Positives = 127/230 (55%), Gaps = 19/230 (8%)
Query: 3 YPLACQGN-LSVPEDIKSNLKC-FYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G L + + L C +Y+ N +GP+K E+ + PR+V+ D I D E
Sbjct: 291 YEMLCRGEGLKMTPRRQKRLFCRYYDGNRNPRYILGPVKQEDEWDKPRIVRFLDIISDEE 350
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + EL+K ++ R V + G R+SK +L + P + +I TRIQD+T
Sbjct: 351 IETVKELAKPRLSRATVHDPETGKLTTAHYRVSKSAWLSGY---ESPVVSRINTRIQDLT 407
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GG Y+ H D +DE R+A+++FY++DV
Sbjct: 408 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVS 463
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP + +V+P+KG+AVFWYN + DY H+ CPV +GNKW
Sbjct: 464 AGGATVFPEVGASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPVLVGNKW 513
>gi|326923461|ref|XP_003207954.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 1
[Meleagris gallopavo]
Length = 536
Score = 143 bits (361), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 84/230 (36%), Positives = 127/230 (55%), Gaps = 19/230 (8%)
Query: 3 YPLACQGN-LSVPEDIKSNLKC-FYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G L + + L C +Y+ N +GP+K E+ + PR+V+ D I D E
Sbjct: 291 YEMLCRGEGLKMTPRRQKRLFCRYYDGNRNPRYILGPVKQEDEWDKPRIVRFLDIISDEE 350
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + EL+K ++ R + N G R+SK +L + P + +I TRIQD+T
Sbjct: 351 IETVKELAKPRLRRATISNPITGALETAHYRISKSAWLSGY---ESPVVSRINTRIQDLT 407
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GG Y+ H D +DE R+A+++FY++DV
Sbjct: 408 GLDVSTAEE----LQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVS 463
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP + +V+P+KG+AVFWYN + DY H+ CPV +GNKW
Sbjct: 464 AGGATVFPEVGASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPVLVGNKW 513
>gi|195452746|ref|XP_002073482.1| GK14141 [Drosophila willistoni]
gi|194169567|gb|EDW84468.1| GK14141 [Drosophila willistoni]
Length = 541
Score = 143 bits (361), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 83/230 (36%), Positives = 122/230 (53%), Gaps = 17/230 (7%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
+IY C G++ + +L+C Y + + FL + PLKVEEL +P +V HD IY SE
Sbjct: 284 DIYRFTCSGHIKKTAREERHLRCGYLTETHPFLNLAPLKVEELNHNPLLVLYHDVIYQSE 343
Query: 61 INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
I+ I L++ ++ R V+ + R S+ F+ P+ H L I R+ DM+NL
Sbjct: 344 IDVIRNLTENEISRATVIGAKGSEVSKVRTSQFTFI-PKT--RHKVLQTIDQRVADMSNL 400
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRD----------EGLWRLASFMFYLTDVE 170
+ E + Q NYG+GGHY H D +D E R+A+ +FYL+DV
Sbjct: 401 NMDYAELH----QFANYGIGGHYAQHNDWFGQDAFDNELVSSPEMGNRIATVLFYLSDVA 456
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GG T FP L + P+K +A FW+N HA+ + D R H CP+ G+KW
Sbjct: 457 QGGGTAFPHLKQLLQPKKYAAAFWHNLHASGVGDLRTLHGACPIIAGSKW 506
>gi|281348666|gb|EFB24250.1| hypothetical protein PANDA_000722 [Ailuropoda melanoleuca]
Length = 505
Score = 143 bits (361), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 86/236 (36%), Positives = 129/236 (54%), Gaps = 19/236 (8%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
+IY C+G + + + L C Y N T L I P K E+ + P +V+ +D + D
Sbjct: 277 DIYESLCRGEGVKLTPRRQKRLFCRYHHGNRTPQLLIAPFKEEDEWDSPHIVRYYDVMSD 336
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI RI E++K K+ R V + G R+SK +L + D P + ++ R+Q
Sbjct: 337 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNLRMQH 393
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTD 168
+T L + E LQ+ NYG+GG Y+ H D + ++E R+A+F+ Y++D
Sbjct: 394 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRKNEQDAFKRLGTGNRVATFLNYMSD 449
Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWGKLL 224
VE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KWGK L
Sbjct: 450 VEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWGKWL 505
>gi|291190128|ref|NP_001167431.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Salmo salar]
gi|223649060|gb|ACN11288.1| Prolyl 4-hydroxylase subunit alpha-2 precursor [Salmo salar]
Length = 538
Score = 143 bits (361), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 86/232 (37%), Positives = 130/232 (56%), Gaps = 19/232 (8%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYD 58
EIY C+G + + + +S L C Y N N L + P+K E+ + P +V+ +A+ D
Sbjct: 291 EIYEGLCRGEGVKMTSERRSRLYCRYHDGNRNPRLLLQPMKEEDEWDSPHIVRYLNALSD 350
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
SEI +I EL+K ++ R V + G + R+SK +L E + P + ++ RI+D
Sbjct: 351 SEIEKIKELAKPRLARATVRDPKTGVLTTANYRVSKSAWLEGE---EDPVIERVNQRIED 407
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTD 168
+T L E LQI NYG+GG Y+ H D + +DE R+A+F+ Y++D
Sbjct: 408 ITGLTTQTAEL----LQIANYGVGGQYEPHFDFSRKDEPDAFKTLGTGNRVATFLNYMSD 463
Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
VE GGAT+FP ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 464 VEAGGATVFPDFGAAIYPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGCKW 515
>gi|312032358|ref|NP_001185666.1| prolyl 4-hydroxylase subunit alpha-1 isoform 3 precursor [Gallus
gallus]
Length = 536
Score = 143 bits (361), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 84/230 (36%), Positives = 127/230 (55%), Gaps = 19/230 (8%)
Query: 3 YPLACQGN-LSVPEDIKSNLKC-FYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G L + + L C +Y+ N +GP+K E+ + PR+V+ D I D E
Sbjct: 291 YEMLCRGEGLKMTPRRQKRLFCRYYDGNRNPRYILGPVKQEDEWDKPRIVRFLDIISDEE 350
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + EL+K ++ R + N G R+SK +L + P + +I TRIQD+T
Sbjct: 351 IETVKELAKPRLRRATISNPITGALETAHYRISKSAWLSGY---ESPVVSRINTRIQDLT 407
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GG Y+ H D +DE R+A+++FY++DV
Sbjct: 408 GLDVSTAEE----LQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVS 463
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP + +V+P+KG+AVFWYN + DY H+ CPV +GNKW
Sbjct: 464 AGGATVFPEVGASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPVLVGNKW 513
>gi|395509387|ref|XP_003758979.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1
[Sarcophilus harrisii]
Length = 534
Score = 143 bits (361), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 86/230 (37%), Positives = 128/230 (55%), Gaps = 17/230 (7%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
++Y C+G + + + L C Y N T L I P K E+ + P +V+ +D + D
Sbjct: 289 DVYEALCRGEGIKLTPRRQKRLFCRYHDGNRTPQLLIAPFKEEDEWDSPHIVRYYDVLSD 348
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI RI EL+K K+ R V + G + R+SK +L GD P + ++ R+
Sbjct: 349 EEIERIKELAKPKLARATVRDPKTGVLTVANYRVSKSSWLEE---GDDPVIAQLNRRMHY 405
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----RLASFMFYLTDVE 170
+T L + E LQ+ NYG+GG Y+ H D + P D GL RLA+F+ Y++DVE
Sbjct: 406 ITGLSVKTAEL----LQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVE 461
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP T++P+KG++VFWYN + DYR H+ CPV +G+KW
Sbjct: 462 AGGATVFPDFGATIWPKKGTSVFWYNLFRSGEGDYRTRHAACPVLVGSKW 511
>gi|17541712|ref|NP_502317.1| Protein PHY-2 [Caenorhabditis elegans]
gi|32171589|sp|Q20065.1|P4HA2_CAEEL RecName: Full=Prolyl 4-hydroxylase subunit alpha-2; Short=4-PH
alpha-2; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-2; Flags: Precursor
gi|3876769|emb|CAA93469.1| Protein PHY-2 [Caenorhabditis elegans]
Length = 539
Score = 143 bits (360), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 87/231 (37%), Positives = 124/231 (53%), Gaps = 19/231 (8%)
Query: 1 EIYPLACQGNLS-VPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDS 59
+ Y C+G + V K+ L+C+ + + FLK+ P+KVE L DP V + I+DS
Sbjct: 279 DAYEALCRGEIPPVEPKWKNKLRCYLKR-DKPFLKLAPIKVEILRFDPLAVLFKNVIHDS 337
Query: 60 EINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDM 117
EI I EL+ K++R V N G+ + R+SK +L ++ P + ++ RI+D
Sbjct: 338 EIEVIKELASPKLKRATVQNSKTGELEHATYRISKSAWLKGDL---DPVIDRVNRRIEDF 394
Query: 118 TNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDV 169
TNL E LQ+ NYGLGGHYD H D ++E R+A+ +FY++
Sbjct: 395 TNLNQATSEE----LQVANYGLGGHYDPHFDFARKEEKNAFKTLNTGNRIATVLFYMSQP 450
Query: 170 ELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
E GGAT+F L VFP K A+FWYN + D R H+ CPV LG KW
Sbjct: 451 ERGGATVFNHLGTAVFPSKNDALFWYNLRRDGEGDLRTRHAACPVLLGVKW 501
>gi|410948132|ref|XP_003980795.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Felis
catus]
gi|410948136|ref|XP_003980797.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Felis
catus]
Length = 533
Score = 143 bits (360), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 87/230 (37%), Positives = 127/230 (55%), Gaps = 17/230 (7%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
+IY C+G + + + L C Y N T L I P K E+ + P +V+ +D + D
Sbjct: 288 DIYESLCRGEGVKLTPRRQKRLFCRYHHGNRTPQLLIAPFKEEDEWDSPHIVRYYDVMSD 347
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI RI E++K K+ R V + G R+SK +L + D P + ++ R+Q
Sbjct: 348 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 404
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----RLASFMFYLTDVE 170
+T L + E LQ+ NYG+GG Y+ H D + P D GL RLA+F+ Y++DVE
Sbjct: 405 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVE 460
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 461 AGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 510
>gi|198418585|ref|XP_002122034.1| PREDICTED: similar to Prolyl 4-hydroxylase subunit alpha-1 (4-PH
alpha-1)
(Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1) [Ciona intestinalis]
Length = 525
Score = 143 bits (360), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 89/230 (38%), Positives = 124/230 (53%), Gaps = 18/230 (7%)
Query: 3 YPLACQGNLSVPEDIKSNLKCF-YESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
Y CQG +P + NL+C+ Y + N+ L+I P+KVEEL P +V+ +D I + +I
Sbjct: 276 YNQICQGKFKLPHKVSKNLRCYLYTNKNDPRLRIKPVKVEELCNSPHIVQFYDVINNDDI 335
Query: 62 NRIIELSKGKVERGKVVNYGDT-IYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
I ++SK + R V +T I D R SKV + D + K+ TRI +MT L
Sbjct: 336 ETIKKMSKKHLSRALVTGPNNTGIVEDIRTSKVAWFKK---NDFTAVKKLYTRISEMTGL 392
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDAT------PRDEGLW---RLASFMFYLTDVEL 171
EE ++ LQ+ NYGL G Y H D T R++G R+A+ + YL DV+
Sbjct: 393 ---SEETFED-LQVANYGLAGEYQPHFDYTEDPSIYKREDGAEVGNRIATMLLYLNDVKE 448
Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 221
GG T F + P KGSAVFWYN + + L D R H+ CPV +GNKW
Sbjct: 449 GGRTAFIEPKIVAKPIKGSAVFWYNLYPSGLGDPRTRHASCPVVIGNKWA 498
>gi|321474952|gb|EFX85916.1| hypothetical protein DAPPUDRAFT_45616 [Daphnia pulex]
Length = 537
Score = 143 bits (360), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 83/228 (36%), Positives = 125/228 (54%), Gaps = 17/228 (7%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
Y C+G + I+ +L+C Y + N F I P+K+EE L P +V HD + D EI
Sbjct: 292 YEKLCRGEKLMDPKIEGHLRCRYITNNVPFFFIQPIKMEEALLKPMIVVYHDVMSDDEIE 351
Query: 63 RIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
+ +++K + +R + N G+ + R+SK +L E +H + K+ R+ D+T L
Sbjct: 352 TVKKMAKPRFKRATIRNSKTGELEPANYRISKSAWLKSE---EHDHILKVTRRVGDITGL 408
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD--ATPRDEGL----W--RLASFMFYLTDVELG 172
+ E LQ+ NYG+GGHY+ H D T E W R+A+++FY++DVE G
Sbjct: 409 DMSTAE----DLQVVNYGIGGHYEPHFDYARTETTEAFKELGWGNRIATWLFYMSDVEAG 464
Query: 173 GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GAT+FP V+P KGSA FWYN + N + H+ CPV G+KW
Sbjct: 465 GATVFPPTGAAVWPRKGSAAFWYNLYPNGKGNELTRHAACPVLSGSKW 512
>gi|395820524|ref|XP_003783614.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1 [Otolemur
garnettii]
Length = 534
Score = 143 bits (360), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 82/230 (35%), Positives = 127/230 (55%), Gaps = 19/230 (8%)
Query: 3 YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G + + + L C Y N N + P K E+ + PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + +L+K ++ R V + G R+SK +L ++P + +I RIQD+T
Sbjct: 349 IEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---ENPVVSRINMRIQDLT 405
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GG Y+ H D +DE R+A+++FY++DV
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVS 461
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV +GNKW
Sbjct: 462 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511
>gi|402593814|gb|EJW87741.1| hypothetical protein WUBG_01349 [Wuchereria bancrofti]
Length = 541
Score = 143 bits (360), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 86/230 (37%), Positives = 117/230 (50%), Gaps = 18/230 (7%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
+IY C+ + V + S L C+Y+ + FL++ P KVE L +P V D I D E
Sbjct: 287 DIYEALCRNEIPVSIKVTSKLYCYYK-MDRPFLRLAPFKVEILRFNPLAVLFRDVITDEE 345
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I I L+ ++ R V N G+ R SK +L E +H +++I RI MT
Sbjct: 346 ITMIQMLATPRLRRATVQNSITGELETASYRTSKSAWLKDE---EHEVVHRINKRIDLMT 402
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
NL E+ LQ+ NYG+GGHYD H D R+E RLA+ +FY+T E
Sbjct: 403 NL----EQETSEELQVGNYGIGGHYDPHFDFARREEVNAFQSLNTGNRLATLLFYMTQPE 458
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+F + TV P K A+FWYN + D R H+ CPV G KW
Sbjct: 459 SGGATVFTEVKTTVMPSKNDALFWYNLLRSGEGDLRTRHAACPVLTGTKW 508
>gi|397490069|ref|XP_003816032.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Pan paniscus]
Length = 488
Score = 143 bits (360), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 82/230 (35%), Positives = 127/230 (55%), Gaps = 19/230 (8%)
Query: 3 YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G + + + L C Y N N + P K E+ + PR+++ HD I D+E
Sbjct: 243 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 302
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + +L+K ++ R V + G R+SK +L ++P + +I RIQD+T
Sbjct: 303 IEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---ENPVVSRINMRIQDLT 359
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GG Y+ H D +DE R+A+++FY++DV
Sbjct: 360 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVS 415
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV +GNKW
Sbjct: 416 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 465
>gi|432106758|gb|ELK32410.1| Prolyl 4-hydroxylase subunit alpha-1 [Myotis davidii]
Length = 534
Score = 143 bits (360), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 82/230 (35%), Positives = 127/230 (55%), Gaps = 19/230 (8%)
Query: 3 YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G + + + L C Y N N + P K E+ + PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + +L+K ++ R V + G R+SK +L ++P + +I RIQD+T
Sbjct: 349 IEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---ENPVVSRINMRIQDLT 405
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GG Y+ H D +DE R+A+++FY++DV
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVS 461
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV +GNKW
Sbjct: 462 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511
>gi|349604936|gb|AEQ00344.1| Prolyl 4-hydroxylase subunit alpha-1-like protein, partial [Equus
caballus]
Length = 302
Score = 143 bits (360), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 82/230 (35%), Positives = 127/230 (55%), Gaps = 19/230 (8%)
Query: 3 YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G + + + L C Y N N + P K E+ + PR+++ HD I D+E
Sbjct: 57 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 116
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + +L+K ++ R V + G R+SK +L ++P + +I RIQD+T
Sbjct: 117 IEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---ENPVVSRINMRIQDLT 173
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GG Y+ H D +DE R+A+++FY++DV
Sbjct: 174 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVS 229
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV +GNKW
Sbjct: 230 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 279
>gi|301770069|ref|XP_002920453.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Ailuropoda
melanoleuca]
Length = 534
Score = 143 bits (360), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 82/230 (35%), Positives = 127/230 (55%), Gaps = 19/230 (8%)
Query: 3 YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G + + + L C Y N N + P K E+ + PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + +L+K ++ R V + G R+SK +L ++P + +I RIQD+T
Sbjct: 349 IEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---ENPVVSRINMRIQDLT 405
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GG Y+ H D +DE R+A+++FY++DV
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVS 461
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV +GNKW
Sbjct: 462 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511
>gi|296220402|ref|XP_002756291.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Callithrix
jacchus]
Length = 534
Score = 143 bits (360), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 82/230 (35%), Positives = 127/230 (55%), Gaps = 19/230 (8%)
Query: 3 YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G + + + L C Y N N + P K E+ + PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + +L+K ++ R V + G R+SK +L ++P + +I RIQD+T
Sbjct: 349 IEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---ENPVVSRINMRIQDLT 405
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GG Y+ H D +DE R+A+++FY++DV
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVS 461
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV +GNKW
Sbjct: 462 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511
>gi|190786|gb|AAA36534.1| prolyl 4-hydroxylase alpha subunit (EC 1.14.11.2) [Homo sapiens]
Length = 534
Score = 143 bits (360), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 82/230 (35%), Positives = 127/230 (55%), Gaps = 19/230 (8%)
Query: 3 YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G + + + L C Y N N + P K E+ + PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + +L+K ++ R V + G R+SK +L ++P + +I RIQD+T
Sbjct: 349 IEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---ENPVVSRINMRIQDLT 405
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GG Y+ H D +DE R+A+++FY++DV
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVS 461
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV +GNKW
Sbjct: 462 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511
>gi|380813208|gb|AFE78478.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Macaca
mulatta]
gi|384947330|gb|AFI37270.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Macaca
mulatta]
Length = 534
Score = 143 bits (360), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 82/230 (35%), Positives = 127/230 (55%), Gaps = 19/230 (8%)
Query: 3 YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G + + + L C Y N N + P K E+ + PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + +L+K ++ R V + G R+SK +L ++P + +I RIQD+T
Sbjct: 349 IEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---ENPVVSRINMRIQDLT 405
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GG Y+ H D +DE R+A+++FY++DV
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVS 461
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV +GNKW
Sbjct: 462 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511
>gi|332244067|ref|XP_003271193.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
alpha-1 [Nomascus leucogenys]
Length = 502
Score = 143 bits (360), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 82/230 (35%), Positives = 127/230 (55%), Gaps = 19/230 (8%)
Query: 3 YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G + + + L C Y N N + P K E+ + PR+++ HD I D+E
Sbjct: 257 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 316
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + +L+K ++ R V + G R+SK +L ++P + +I RIQD+T
Sbjct: 317 IEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---ENPVVSRINMRIQDLT 373
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GG Y+ H D +DE R+A+++FY++DV
Sbjct: 374 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVS 429
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV +GNKW
Sbjct: 430 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 479
>gi|348576112|ref|XP_003473831.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cavia
porcellus]
Length = 534
Score = 142 bits (359), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 82/230 (35%), Positives = 127/230 (55%), Gaps = 19/230 (8%)
Query: 3 YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G + + + L C Y N N + P K E+ + PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + +L+K ++ R V + G R+SK +L ++P + +I RIQD+T
Sbjct: 349 IEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---ENPVVSRINMRIQDLT 405
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GG Y+ H D +DE R+A+++FY++DV
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVS 461
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV +GNKW
Sbjct: 462 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511
>gi|291404182|ref|XP_002718471.1| PREDICTED: prolyl 4-hydroxylase, alpha I subunit isoform 1
[Oryctolagus cuniculus]
Length = 534
Score = 142 bits (359), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 82/230 (35%), Positives = 127/230 (55%), Gaps = 19/230 (8%)
Query: 3 YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G + + + L C Y N N + P K E+ + PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + +L+K ++ R V + G R+SK +L ++P + +I RIQD+T
Sbjct: 349 IEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---ENPVVSRINMRIQDLT 405
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GG Y+ H D +DE R+A+++FY++DV
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVS 461
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV +GNKW
Sbjct: 462 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511
>gi|63252886|ref|NP_000908.2| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Homo
sapiens]
gi|114631173|ref|XP_508168.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 13 [Pan
troglodytes]
gi|602676|gb|AAA59069.1| alpha-subunit of prolyl 4-hydroxylase [Homo sapiens]
gi|62897481|dbj|BAD96680.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I variant [Homo
sapiens]
gi|119574852|gb|EAW54467.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I, isoform CRA_a [Homo
sapiens]
gi|119574853|gb|EAW54468.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I, isoform CRA_b [Homo
sapiens]
gi|410349609|gb|JAA41408.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
gi|410349613|gb|JAA41410.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
Length = 534
Score = 142 bits (359), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 82/230 (35%), Positives = 127/230 (55%), Gaps = 19/230 (8%)
Query: 3 YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G + + + L C Y N N + P K E+ + PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + +L+K ++ R V + G R+SK +L ++P + +I RIQD+T
Sbjct: 349 IEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---ENPVVSRINMRIQDLT 405
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GG Y+ H D +DE R+A+++FY++DV
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVS 461
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV +GNKW
Sbjct: 462 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511
>gi|410295850|gb|JAA26525.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
gi|410295854|gb|JAA26527.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
Length = 534
Score = 142 bits (359), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 82/230 (35%), Positives = 127/230 (55%), Gaps = 19/230 (8%)
Query: 3 YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G + + + L C Y N N + P K E+ + PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + +L+K ++ R V + G R+SK +L ++P + +I RIQD+T
Sbjct: 349 IEIVKDLAKPRLRRATVHDPETGKLTTAQYRVSKSAWLSGY---ENPVVSRINMRIQDLT 405
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GG Y+ H D +DE R+A+++FY++DV
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVS 461
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV +GNKW
Sbjct: 462 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511
>gi|73952886|ref|XP_850682.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 3 [Canis
lupus familiaris]
Length = 534
Score = 142 bits (359), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 82/230 (35%), Positives = 127/230 (55%), Gaps = 19/230 (8%)
Query: 3 YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G + + + L C Y N N + P K E+ + PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + +L+K ++ R V + G R+SK +L ++P + +I RIQD+T
Sbjct: 349 IEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---ENPVVSRINMRIQDLT 405
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GG Y+ H D +DE R+A+++FY++DV
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVS 461
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV +GNKW
Sbjct: 462 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511
>gi|383418721|gb|AFH32574.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Macaca
mulatta]
Length = 534
Score = 142 bits (359), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 82/230 (35%), Positives = 127/230 (55%), Gaps = 19/230 (8%)
Query: 3 YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G + + + L C Y N N + P K E+ + PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + +L+K ++ R V + G R+SK +L ++P + +I RIQD+T
Sbjct: 349 IEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---ENPVVSRINMRIQDLT 405
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GG Y+ H D +DE R+A+++FY++DV
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVS 461
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV +GNKW
Sbjct: 462 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511
>gi|312032354|ref|NP_001185664.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Gallus
gallus]
Length = 536
Score = 142 bits (359), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 84/230 (36%), Positives = 127/230 (55%), Gaps = 19/230 (8%)
Query: 3 YPLACQGN-LSVPEDIKSNLKC-FYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G L + + L C +Y+ N +GP+K E+ + PR+V+ D I D E
Sbjct: 291 YEMLCRGEGLKMTPRRQKRLFCRYYDGNRNPRYILGPVKQEDEWDKPRIVRFLDIISDEE 350
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + EL+K ++ R V + G R+SK +L + P + +I TRIQD+T
Sbjct: 351 IETVKELAKPRLSRATVHDPETGKLTTAHYRVSKSAWLSGY---ESPVVSRINTRIQDLT 407
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GG Y+ H D +DE R+A+++FY++DV
Sbjct: 408 GLDVSTAEE----LQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVS 463
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP + +V+P+KG+AVFWYN + DY H+ CPV +GNKW
Sbjct: 464 AGGATVFPEVGASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPVLVGNKW 513
>gi|402880501|ref|XP_003903839.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like, partial
[Papio anubis]
Length = 379
Score = 142 bits (359), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 82/230 (35%), Positives = 127/230 (55%), Gaps = 19/230 (8%)
Query: 3 YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G + + + L C Y N N + P K E+ + PR+++ HD I D+E
Sbjct: 134 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 193
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + +L+K ++ R V + G R+SK +L ++P + +I RIQD+T
Sbjct: 194 IEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---ENPVVSRINMRIQDLT 250
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GG Y+ H D +DE R+A+++FY++DV
Sbjct: 251 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVS 306
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV +GNKW
Sbjct: 307 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 356
>gi|410251924|gb|JAA13929.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
Length = 566
Score = 142 bits (359), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 82/230 (35%), Positives = 127/230 (55%), Gaps = 19/230 (8%)
Query: 3 YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G + + + L C Y N N + P K E+ + PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + +L+K ++ R V + G R+SK +L ++P + +I RIQD+T
Sbjct: 349 IEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---ENPVVSRINMRIQDLT 405
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GG Y+ H D +DE R+A+++FY++DV
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVS 461
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV +GNKW
Sbjct: 462 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511
>gi|326923463|ref|XP_003207955.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 2
[Meleagris gallopavo]
Length = 536
Score = 142 bits (359), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 84/230 (36%), Positives = 127/230 (55%), Gaps = 19/230 (8%)
Query: 3 YPLACQGN-LSVPEDIKSNLKC-FYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G L + + L C +Y+ N +GP+K E+ + PR+V+ D I D E
Sbjct: 291 YEMLCRGEGLKMTPRRQKRLFCRYYDGNRNPRYILGPVKQEDEWDKPRIVRFLDIISDEE 350
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + EL+K ++ R V + G R+SK +L + P + +I TRIQD+T
Sbjct: 351 IETVKELAKPRLSRATVHDPETGKLTTAHYRVSKSAWLSGY---ESPVVSRINTRIQDLT 407
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GG Y+ H D +DE R+A+++FY++DV
Sbjct: 408 GLDVSTAEE----LQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVS 463
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP + +V+P+KG+AVFWYN + DY H+ CPV +GNKW
Sbjct: 464 AGGATVFPEVGASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPVLVGNKW 513
>gi|129365|sp|P16924.1|P4HA1_CHICK RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
alpha-1; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1
Length = 516
Score = 142 bits (359), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 84/230 (36%), Positives = 127/230 (55%), Gaps = 19/230 (8%)
Query: 3 YPLACQGN-LSVPEDIKSNLKC-FYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G L + + L C +Y+ N +GP+K E+ + PR+V+ D I D E
Sbjct: 271 YEMLCRGEGLKMTPRRQKRLFCRYYDGNRNPRYILGPVKQEDEWDKPRIVRFLDIISDEE 330
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + EL+K ++ R V + G R+SK +L + P + +I TRIQD+T
Sbjct: 331 IETVKELAKPRLSRATVHDPETGKLTTAHYRVSKSAWLSGY---ESPVVSRINTRIQDLT 387
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GG Y+ H D +DE R+A+++FY++DV
Sbjct: 388 GLDVSTAEE----LQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVS 443
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP + +V+P+KG+AVFWYN + DY H+ CPV +GNKW
Sbjct: 444 AGGATVFPEVGASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPVLVGNKW 493
>gi|474940|emb|CAA55546.1| gamma-butyrobetaine,2-oxoglutarate dioxygenase [Rattus norvegicus]
Length = 534
Score = 142 bits (359), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 82/230 (35%), Positives = 126/230 (54%), Gaps = 19/230 (8%)
Query: 3 YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G + + + L C Y N N + P K E+ + PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKRLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + +L+K ++ R V + G R+SK +L + P + +I RIQD+T
Sbjct: 349 IEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---EDPVVSRINMRIQDLT 405
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GG Y+ H D +DE R+A+++FY++DV
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIATWLFYMSDVS 461
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV +GNKW
Sbjct: 462 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511
>gi|170591592|ref|XP_001900554.1| prolyl 4-hydroxylase [Brugia malayi]
gi|16415740|emb|CAC82616.1| prolyl 4-hydroxylase [Brugia malayi]
gi|21425621|emb|CAD19314.1| prolyl 4-hydroxylase [Brugia malayi]
gi|158592166|gb|EDP30768.1| prolyl 4-hydroxylase, putative [Brugia malayi]
Length = 541
Score = 142 bits (359), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 85/230 (36%), Positives = 117/230 (50%), Gaps = 18/230 (7%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
+IY C+ + V + S L C+Y+ + FL++ P KVE L +P V D I D E
Sbjct: 287 DIYEALCRNEIPVSIKVTSKLYCYYK-MDRPFLRLAPFKVEILRFNPLAVLFRDVITDEE 345
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
+ I L+ ++ R V N G+ R SK +L E +H +++I RI MT
Sbjct: 346 VTMIQMLATPRLRRATVQNSITGELETASYRTSKSAWLKDE---EHEVVHRINKRIDLMT 402
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
NL E+ LQ+ NYG+GGHYD H D R+E RLA+ +FY+T E
Sbjct: 403 NL----EQETSEELQVGNYGIGGHYDPHFDFARREEVNAFQSLNTGNRLATLLFYMTQPE 458
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+F + TV P K A+FWYN + D R H+ CPV G KW
Sbjct: 459 SGGATVFTEVKTTVMPSKNDALFWYNLLRSGEGDLRTRHAACPVLTGTKW 508
>gi|212530|gb|AAA49002.1| prolyl 4-hydroxylase, alpha subunit (EC 1.14.11.2), partial [Gallus
gallus]
Length = 489
Score = 142 bits (359), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 84/230 (36%), Positives = 127/230 (55%), Gaps = 19/230 (8%)
Query: 3 YPLACQGN-LSVPEDIKSNLKC-FYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G L + + L C +Y+ N +GP+K E+ + PR+V+ D I D E
Sbjct: 244 YEMLCRGEGLKMTPRRQKRLFCRYYDGNRNPRYILGPVKQEDEWDKPRIVRFLDIISDEE 303
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + EL+K ++ R V + G R+SK +L + P + +I TRIQD+T
Sbjct: 304 IETVKELAKPRLSRATVHDPETGKLTTAHYRVSKSAWLSGY---ESPVVSRINTRIQDLT 360
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GG Y+ H D +DE R+A+++FY++DV
Sbjct: 361 GLDVSTAEE----LQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVS 416
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP + +V+P+KG+AVFWYN + DY H+ CPV +GNKW
Sbjct: 417 AGGATVFPEVGASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPVLVGNKW 466
>gi|26336999|dbj|BAC32183.1| unnamed protein product [Mus musculus]
gi|148700261|gb|EDL32208.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha 1 polypeptide, isoform CRA_b [Mus
musculus]
Length = 534
Score = 142 bits (359), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 82/230 (35%), Positives = 126/230 (54%), Gaps = 19/230 (8%)
Query: 3 YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G + + + L C Y N N + P K E+ + PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKRLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + +L+K ++ R V + G R+SK +L + P + +I RIQD+T
Sbjct: 349 IEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---EDPVVSRINMRIQDLT 405
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GG Y+ H D +DE R+A+++FY++DV
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIATWLFYMSDVS 461
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV +GNKW
Sbjct: 462 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511
>gi|51036657|ref|NP_742059.2| prolyl 4-hydroxylase subunit alpha-1 precursor [Rattus norvegicus]
gi|90111077|sp|P54001.2|P4HA1_RAT RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
alpha-1; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1; Flags: Precursor
gi|50927553|gb|AAH78703.1| Procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I [Rattus norvegicus]
gi|149038787|gb|EDL93076.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha 1 polypeptide, isoform CRA_a
[Rattus norvegicus]
Length = 534
Score = 142 bits (358), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 82/230 (35%), Positives = 126/230 (54%), Gaps = 19/230 (8%)
Query: 3 YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G + + + L C Y N N + P K E+ + PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKRLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + +L+K ++ R V + G R+SK +L + P + +I RIQD+T
Sbjct: 349 IEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---EDPVVSRINMRIQDLT 405
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GG Y+ H D +DE R+A+++FY++DV
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIATWLFYMSDVS 461
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV +GNKW
Sbjct: 462 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511
>gi|354483223|ref|XP_003503794.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 1
[Cricetulus griseus]
Length = 534
Score = 142 bits (358), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 82/230 (35%), Positives = 126/230 (54%), Gaps = 19/230 (8%)
Query: 3 YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G + + + L C Y N N + P K E+ + PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + +L+K ++ R V + G R+SK +L + P + +I RIQD+T
Sbjct: 349 IEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---EDPVVSRINMRIQDLT 405
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GG Y+ H D +DE R+A+++FY++DV
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFQELGTGNRIATWLFYMSDVS 461
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV +GNKW
Sbjct: 462 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511
>gi|73970649|ref|XP_850109.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Canis
lupus familiaris]
Length = 533
Score = 142 bits (358), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 86/230 (37%), Positives = 127/230 (55%), Gaps = 17/230 (7%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
++Y C+G + + + L C Y N T L I P K E+ + P +V+ +D + D
Sbjct: 288 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRTPQLLIAPFKEEDEWDSPHIVRYYDVMSD 347
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI RI E++K K+ R V + G R+SK +L + D P + ++ R+Q
Sbjct: 348 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNLRMQH 404
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----RLASFMFYLTDVE 170
+T L + E LQ+ NYG+GG Y+ H D + P D GL RLA+F+ Y++DVE
Sbjct: 405 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVE 460
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 461 AGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 510
>gi|355709025|gb|AES03456.1| prolyl 4-hydroxylase, alpha polypeptide II [Mustela putorius furo]
Length = 532
Score = 142 bits (358), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 86/230 (37%), Positives = 127/230 (55%), Gaps = 17/230 (7%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
++Y C+G + + + L C Y N T L I P K E+ + P +V+ +D + D
Sbjct: 288 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRTPQLLIAPFKEEDEWDSPHIVRYYDVMSD 347
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI RI E++K K+ R V + G R+SK +L + D P + ++ R+Q
Sbjct: 348 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNLRMQH 404
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----RLASFMFYLTDVE 170
+T L + E LQ+ NYG+GG Y+ H D + P D GL RLA+F+ Y++DVE
Sbjct: 405 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVE 460
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 461 AGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 510
>gi|395501518|ref|XP_003755140.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Sarcophilus
harrisii]
Length = 385
Score = 142 bits (358), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 83/230 (36%), Positives = 126/230 (54%), Gaps = 19/230 (8%)
Query: 3 YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G L + + L C Y N N + P K E+ + PR+V+ H+ I D+E
Sbjct: 140 YEMLCRGEGLKMTPQRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIVRFHEIISDAE 199
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + +L+K ++ R V + G R+SK +L + P + +I RIQD+T
Sbjct: 200 IEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---EDPVVSRINMRIQDLT 256
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GG Y+ H D +DE R+A+++FY++DV
Sbjct: 257 GLDVSTAEE----LQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVS 312
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV +GNKW
Sbjct: 313 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 362
>gi|449280261|gb|EMC87600.1| Prolyl 4-hydroxylase subunit alpha-1 [Columba livia]
Length = 536
Score = 142 bits (358), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 84/230 (36%), Positives = 126/230 (54%), Gaps = 19/230 (8%)
Query: 3 YPLACQGN-LSVPEDIKSNLKC-FYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G L + + L C +Y+ N +GP+K E+ + PR+V+ D I D E
Sbjct: 291 YEMLCRGEGLKMTPRRQKRLFCRYYDGNRNPRYILGPVKQEDEWDKPRIVRFLDIISDEE 350
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + EL+K ++ R V + G R+SK +L + P + +I TRIQD+T
Sbjct: 351 IETVKELAKPRLSRATVHDPETGKLTTAHYRVSKSAWLSGY---ESPVVSRINTRIQDLT 407
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GG Y+ H D +DE R+A+++FY++DV
Sbjct: 408 GLDVSTAEE----LQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVS 463
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP + +V+P KG+AVFWYN + DY H+ CPV +GNKW
Sbjct: 464 AGGATVFPEVGASVWPRKGTAVFWYNLFPSGEGDYSTRHAACPVLVGNKW 513
>gi|224052167|ref|XP_002191912.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Taeniopygia
guttata]
Length = 536
Score = 142 bits (358), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 84/230 (36%), Positives = 126/230 (54%), Gaps = 19/230 (8%)
Query: 3 YPLACQGN-LSVPEDIKSNLKC-FYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G L + + L C +Y+ N +GP+K E+ + PR+V+ D I D E
Sbjct: 291 YEMLCRGEGLKMTPRRQKRLFCRYYDGNRNPRYILGPVKQEDEWDKPRIVRFLDIISDEE 350
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + EL+K ++ R V + G R+SK +L + P + +I TRIQD+T
Sbjct: 351 IETVKELAKPRLSRATVHDPETGKLTTAHYRVSKSAWLSGY---ESPVVSRINTRIQDLT 407
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GG Y+ H D +DE R+A+++FY++DV
Sbjct: 408 GLDVSTAEE----LQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVS 463
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP + +V+P KG+AVFWYN + DY H+ CPV +GNKW
Sbjct: 464 AGGATVFPEVGASVWPRKGTAVFWYNLFPSGEGDYSTRHAACPVLVGNKW 513
>gi|151556370|gb|AAI47868.1| P4HA1 protein [Bos taurus]
Length = 534
Score = 142 bits (357), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 82/230 (35%), Positives = 127/230 (55%), Gaps = 19/230 (8%)
Query: 3 YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G + + + L C Y N N + P K E+ + PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + +L+K ++ R V + G R+SK +L ++P + +I RIQD+T
Sbjct: 349 IEVVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---ENPVVSRINMRIQDLT 405
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GG Y+ H D +DE R+A+++FY++DV
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVL 461
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV +GNKW
Sbjct: 462 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511
>gi|326928728|ref|XP_003210527.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Meleagris
gallopavo]
Length = 535
Score = 142 bits (357), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 85/231 (36%), Positives = 127/231 (54%), Gaps = 19/231 (8%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYD 58
+IY C+G + + + L C Y + N N L I P K E+ + P +V+ +D + D
Sbjct: 290 DIYEALCRGEGVKMTPRRQKRLFCRYHNGNRNPHLVIAPFKEEDEWDSPHIVRYYDVMSD 349
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI +I +L+K K+ R V + G R+SK +L + D P + K+ R+Q
Sbjct: 350 EEIEKIKQLAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVAKVNQRMQQ 406
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCD-------ATPRDEGLWRLASFMFYLTDV 169
+T L + E LQ+ NYG+GG Y+ H D +T + EG RLA+F+ Y++DV
Sbjct: 407 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRRPFDSTLKSEGN-RLATFLNYMSDV 461
Query: 170 ELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
E GGAT+FP ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 462 EAGGATVFPDFGAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGCKW 512
>gi|432949777|ref|XP_004084253.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Oryzias
latipes]
Length = 532
Score = 141 bits (356), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 84/230 (36%), Positives = 130/230 (56%), Gaps = 17/230 (7%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKC-FYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYD 58
E Y C+G L + E +S L C +++ + L + P+K E+ + +P +V+ + + D
Sbjct: 287 ETYEALCRGEGLQLTEARRSRLFCRYHDGKRSPRLLLKPIKEEDEWDNPHIVRYLNILSD 346
Query: 59 SEINRIIELSKGKVERGKVVNYGDTIYVDT--RLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI +I EL+K ++ R V + + R+SK +L E D P + ++ RIQD
Sbjct: 347 QEIEKIKELAKPRLARATVRDPKTGVLTTAPYRVSKSAWLEGE---DDPVIDRVNQRIQD 403
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----RLASFMFYLTDVE 170
+T L + E LQ+ NYG+GG Y+ H D + P D L RLA+F+ Y++DVE
Sbjct: 404 ITGLTVETAEL----LQVANYGVGGQYEPHFDFSRRPFDSNLKVDGNRLATFLNYMSDVE 459
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP +++P KG+AVFWYN + DYR H+ CPV +G+KW
Sbjct: 460 AGGATVFPDFGASIWPRKGTAVFWYNLFRSGEGDYRTRHAACPVLVGSKW 509
>gi|291387300|ref|XP_002710241.1| PREDICTED: prolyl 4-hydroxylase, alpha II subunit isoform 1
precursor (predicted)-like isoform 1 [Oryctolagus
cuniculus]
Length = 533
Score = 141 bits (356), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 86/230 (37%), Positives = 126/230 (54%), Gaps = 17/230 (7%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
++Y C+G + + + L C Y N L I P K E+ + P +V+ +D + D
Sbjct: 288 DVYESLCRGEGVKLTPRRQKRLFCRYHDGNGAPQLLIAPFKEEDEWDSPHIVRYYDVMSD 347
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI RI E++K K+ R V + G R+SK +L + D P + +I R+Q
Sbjct: 348 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARINRRMQH 404
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----RLASFMFYLTDVE 170
+T L + E LQ+ NYG+GG Y+ H D + P D GL RLA+F+ Y++DVE
Sbjct: 405 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVE 460
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 461 AGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 510
>gi|426255746|ref|XP_004021509.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2 [Ovis
aries]
Length = 534
Score = 141 bits (356), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 82/230 (35%), Positives = 127/230 (55%), Gaps = 19/230 (8%)
Query: 3 YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G + + + L C Y N N + P K E+ + PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + +L+K ++ R V + G R+SK +L ++P + +I RIQD+T
Sbjct: 349 IEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---ENPVVSRINMRIQDLT 405
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GG Y+ H D +DE R+A+++FY++DV
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVL 461
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV +GNKW
Sbjct: 462 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511
>gi|195113237|ref|XP_002001174.1| GI10637 [Drosophila mojavensis]
gi|193917768|gb|EDW16635.1| GI10637 [Drosophila mojavensis]
Length = 529
Score = 141 bits (356), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 89/230 (38%), Positives = 125/230 (54%), Gaps = 22/230 (9%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
+ Y C+G P+++K L C Y S + FL++ PLK+E + LDP +V HD I SE
Sbjct: 288 QAYERGCRGQY--PQNLK--LYCVYNSTTSAFLRLAPLKMELISLDPYMVIYHDVISPSE 343
Query: 61 INRIIELSKGKVERGKVVNYGDTI--YVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I+ + L+ ++R V N V TR SKV +L + + ++ RI DMT
Sbjct: 344 ISELQSLAVPGLKRATVFNQQSMRNHVVKTRTSKVTWLLDTL---NQLTIRLNRRITDMT 400
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCD--------ATPRDEGLWRLASFMFYLTDVE 170
+ E LQ+ NYGLGGHYD H D R G R+A+ +FYLTDVE
Sbjct: 401 GFDMYGSEM----LQVMNYGLGGHYDKHYDYFNSSVAADLTRLNGD-RIATVLFYLTDVE 455
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP++ VFP+ G+AV WYN + D + H+ CPV +G+KW
Sbjct: 456 QGGATVFPNIEKAVFPKSGTAVVWYNLRHDGNGDPQTLHAACPVIVGSKW 505
>gi|195055775|ref|XP_001994788.1| GH17428 [Drosophila grimshawi]
gi|193892551|gb|EDV91417.1| GH17428 [Drosophila grimshawi]
Length = 540
Score = 141 bits (356), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 77/230 (33%), Positives = 129/230 (56%), Gaps = 19/230 (8%)
Query: 2 IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
+Y C+ L + L+C N P ++EEL+LDP V+++HD I E
Sbjct: 286 MYQQVCREELKPEPATQRKLRCRLHRGNGLRSSYQPYRLEELHLDPYVIQVHDIISAEET 345
Query: 62 NRIIELSKGKVERGKVVNYGDTIYVDT--RLSK-VYFLYPEIFGDHPFLYKIQTRIQDMT 118
+ +L++ +++R V + ++ ++ T R+S+ +F Y E HP + ++ +++++
Sbjct: 346 IVLQQLARPELQRSMVYSLSNSEHISTNFRISQGTFFEYHE----HPIMQRMSQHLENIS 401
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDE--------GLWRLASFMFYLTDVE 170
L + E+ LQ+ NYG+GGHY+ H D+ + R+A+ ++YL++VE
Sbjct: 402 GLDMRSAEQ----LQVANYGIGGHYEPHMDSFSENHNYGINTYMSTNRVATGIYYLSNVE 457
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GG T FP L L V PE+GS +FWYN H + LDYR H+GCPV +G+KW
Sbjct: 458 AGGGTAFPFLPLLVEPERGSLLFWYNLHRSGDLDYRTKHAGCPVLMGSKW 507
>gi|334314085|ref|XP_001363658.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1
[Monodelphis domestica]
Length = 537
Score = 141 bits (355), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 83/230 (36%), Positives = 126/230 (54%), Gaps = 19/230 (8%)
Query: 3 YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G L + + L C Y N N + P K E+ + PR+V+ H+ I D+E
Sbjct: 292 YEMLCRGEGLKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIVRFHEIISDAE 351
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + +L+K ++ R V + G R+SK +L + P + +I RIQD+T
Sbjct: 352 IEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---EDPVVSRINMRIQDLT 408
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GG Y+ H D +DE R+A+++FY++DV
Sbjct: 409 GLDVSTAEE----LQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVS 464
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV +GNKW
Sbjct: 465 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 514
>gi|57525020|ref|NP_001006155.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Gallus gallus]
gi|82082587|sp|Q5ZLK5.1|P4HA2_CHICK RecName: Full=Prolyl 4-hydroxylase subunit alpha-2; Short=4-PH
alpha-2; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-2; Flags: Precursor
gi|53129464|emb|CAG31388.1| hypothetical protein RCJMB04_5l17 [Gallus gallus]
Length = 534
Score = 141 bits (355), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 85/231 (36%), Positives = 126/231 (54%), Gaps = 19/231 (8%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYD 58
+IY C+G + + + L C Y N N L I P K E+ + P +V+ +D + D
Sbjct: 289 DIYEALCRGEGVKMTPRRQKRLFCRYHDGNRNPHLLIAPFKEEDEWDSPHIVRYYDVMSD 348
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI +I +L+K K+ R V + G R+SK +L + D P + K+ R+Q
Sbjct: 349 EEIEKIKQLAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVAKVNQRMQQ 405
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCD-------ATPRDEGLWRLASFMFYLTDV 169
+T L + E LQ+ NYG+GG Y+ H D +T + EG RLA+F+ Y++DV
Sbjct: 406 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRRPFDSTLKSEGN-RLATFLNYMSDV 460
Query: 170 ELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
E GGAT+FP ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 461 EAGGATVFPDFGAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGCKW 511
>gi|90085216|dbj|BAE91349.1| unnamed protein product [Macaca fascicularis]
Length = 244
Score = 141 bits (355), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 81/228 (35%), Positives = 126/228 (55%), Gaps = 19/228 (8%)
Query: 5 LACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
+ C+G + + + L C Y N N + P K E+ + PR+++ HD I D+EI
Sbjct: 1 MLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIE 60
Query: 63 RIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
+ +L+K ++ R V + G R+SK +L ++P + +I RIQD+T L
Sbjct: 61 IVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---ENPVVSRINMRIQDLTGL 117
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVELG 172
+ E LQ+ NYG+GG Y+ H D +DE R+A+++FY++DV G
Sbjct: 118 DVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAG 173
Query: 173 GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV +GNKW
Sbjct: 174 GATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 221
>gi|54792285|emb|CAG28668.1| prolyl 4-hydroxylase alpha-2 subunit [Gallus gallus]
Length = 538
Score = 141 bits (355), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 83/232 (35%), Positives = 125/232 (53%), Gaps = 19/232 (8%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYD 58
+IY C+G + + + L C Y N N L I P K E+ + P +V+ +D + D
Sbjct: 292 DIYEALCRGEGVKMTPQRQKRLFCRYHDGNRNPHLLIAPFKEEDEWDSPHIVRYYDVMSD 351
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI +I +L+K K+ R V + G R+SK +L + D P + K+ R+Q
Sbjct: 352 EEIEKIKQLAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVAKVNQRMQQ 408
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTD 168
+T L + E LQ+ NYG+GG Y+ H D + +DE R+A+F+ Y++D
Sbjct: 409 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRKDEPDAFKRLGTGNRVATFLNYMSD 464
Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
VE GGAT+FP ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 465 VEAGGATVFPDFGAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGCKW 516
>gi|148701600|gb|EDL33547.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide, isoform CRA_e [Mus
musculus]
Length = 593
Score = 141 bits (355), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 85/230 (36%), Positives = 126/230 (54%), Gaps = 17/230 (7%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
++Y C+G + + + L C Y N L I P K E+ + P +V+ +D + D
Sbjct: 348 DVYESLCRGEGVKLTPRRQKKLFCRYHHGNRVPQLLIAPFKEEDEWDSPHIVRYYDVMSD 407
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI RI E++K K+ R V + G R+SK +L + D P + ++ R+Q
Sbjct: 408 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 464
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----RLASFMFYLTDVE 170
+T L + E LQ+ NYG+GG Y+ H D + P D GL RLA+F+ Y++DVE
Sbjct: 465 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVE 520
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 521 AGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 570
>gi|391342914|ref|XP_003745760.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Metaseiulus
occidentalis]
Length = 525
Score = 140 bits (354), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 83/230 (36%), Positives = 126/230 (54%), Gaps = 19/230 (8%)
Query: 2 IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
+Y C+G ++ NL C Y N+ ++ + P K+E ++ P + HD + D EI
Sbjct: 281 MYERLCRGEPVEKPFLRKNLHCTYFHNNHPYMILQPSKLEVIHERPYLALFHDIMSDDEI 340
Query: 62 NRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
+IELS +++R V N G+ + R+SK +L DH + ++ R + +T
Sbjct: 341 QTVIELSAPRLKRATVQNAKSGELEVANYRISKSAWLKNH---DHEVVERLSFRFEYLTG 397
Query: 120 LV-IGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + EE LQ+ NYG+GGHY+ H D RDE R+A+++ Y++DV+
Sbjct: 398 LTHLTAEE-----LQVVNYGIGGHYEAHFDFARRDEKDAFKQLGTGNRIATWINYMSDVK 452
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP L LTV+PEKGSA FW+N H + D H+ CPV G+KW
Sbjct: 453 AGGATVFPRLGLTVWPEKGSAAFWWNLHRSGEGDILTRHAACPVLAGSKW 502
>gi|57997558|emb|CAI46066.1| hypothetical protein [Homo sapiens]
Length = 533
Score = 140 bits (354), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 85/230 (36%), Positives = 126/230 (54%), Gaps = 17/230 (7%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
++Y C+G + + + L C Y N L I P K E+ + P +V+ +D + D
Sbjct: 288 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRAPQLPIAPFKEEDEWDSPHIVRYYDVMSD 347
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI RI E++K K+ R V + G R+SK +L + D P + ++ R+Q
Sbjct: 348 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 404
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----RLASFMFYLTDVE 170
+T L + E LQ+ NYG+GG Y+ H D + P D GL RLA+F+ Y++DVE
Sbjct: 405 ITGLTVKTAEL----LQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVE 460
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 461 AGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 510
>gi|209862961|ref|NP_001129548.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Mus
musculus]
gi|17390970|gb|AAH18411.1| P4ha2 protein [Mus musculus]
gi|18073922|emb|CAC85690.1| Prolyl 4-hydroxylase alpha IIa subunit [Mus musculus]
gi|74211515|dbj|BAE26490.1| unnamed protein product [Mus musculus]
Length = 535
Score = 140 bits (354), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 85/230 (36%), Positives = 126/230 (54%), Gaps = 17/230 (7%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
++Y C+G + + + L C Y N L I P K E+ + P +V+ +D + D
Sbjct: 290 DVYESLCRGEGVKLTPRRQKKLFCRYHHGNRVPQLLIAPFKEEDEWDSPHIVRYYDVMSD 349
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI RI E++K K+ R V + G R+SK +L + D P + ++ R+Q
Sbjct: 350 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 406
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----RLASFMFYLTDVE 170
+T L + E LQ+ NYG+GG Y+ H D + P D GL RLA+F+ Y++DVE
Sbjct: 407 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVE 462
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 463 AGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512
>gi|114601548|ref|XP_001162501.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 9 [Pan
troglodytes]
gi|114601562|ref|XP_001162805.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 16 [Pan
troglodytes]
gi|114601564|ref|XP_517917.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 19 [Pan
troglodytes]
gi|397518354|ref|XP_003829356.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Pan
paniscus]
gi|397518356|ref|XP_003829357.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Pan
paniscus]
gi|397518360|ref|XP_003829359.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 4 [Pan
paniscus]
gi|410215942|gb|JAA05190.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
gi|410255606|gb|JAA15770.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
gi|410331277|gb|JAA34585.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
gi|410331281|gb|JAA34587.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
Length = 533
Score = 140 bits (354), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 86/230 (37%), Positives = 126/230 (54%), Gaps = 17/230 (7%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
+IY C+G + + + L C Y N L I P K E+ + P +V+ +D + D
Sbjct: 288 DIYESLCRGEGVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSD 347
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI RI E++K K+ R V + G R+SK +L + D P + ++ R+Q
Sbjct: 348 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 404
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----RLASFMFYLTDVE 170
+T L + E LQ+ NYG+GG Y+ H D + P D GL RLA+F+ Y++DVE
Sbjct: 405 ITGLTVKTAEL----LQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVE 460
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 461 AGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 510
>gi|334314087|ref|XP_003339988.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2
[Monodelphis domestica]
Length = 537
Score = 140 bits (354), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 83/230 (36%), Positives = 126/230 (54%), Gaps = 19/230 (8%)
Query: 3 YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G L + + L C Y N N + P K E+ + PR+V+ H+ I D+E
Sbjct: 292 YEMLCRGEGLKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIVRFHEIISDAE 351
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + +L+K ++ R + N G R+SK +L + P + +I RIQD+T
Sbjct: 352 IEIVKDLAKPRLRRATISNPITGVLETAHYRISKSAWLSGY---EDPVVSRINMRIQDLT 408
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GG Y+ H D +DE R+A+++FY++DV
Sbjct: 409 GLDVSTAEE----LQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVS 464
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV +GNKW
Sbjct: 465 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 514
>gi|344264849|ref|XP_003404502.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2
[Loxodonta africana]
Length = 534
Score = 140 bits (354), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 85/230 (36%), Positives = 127/230 (55%), Gaps = 17/230 (7%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
++Y C+G + + + L C Y N T L I P K E+ + P +V+ +D + D
Sbjct: 289 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRTPQLLIAPFKEEDEWDSPHIVRYYDVMSD 348
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI RI +++K K+ R V + G R+SK +L + D P + ++ R+Q
Sbjct: 349 EEIERIKQIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVAQVNRRMQH 405
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----RLASFMFYLTDVE 170
+T L + E LQ+ NYG+GG Y+ H D + P D GL RLA+F+ Y++DVE
Sbjct: 406 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVE 461
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 462 AGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 511
>gi|345305838|ref|XP_001508476.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Ornithorhynchus
anatinus]
Length = 493
Score = 140 bits (353), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 82/230 (35%), Positives = 126/230 (54%), Gaps = 19/230 (8%)
Query: 3 YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G + + + L C Y N N + P K E+ + PR+V+ H+ I D+E
Sbjct: 248 YEMLCRGEGIKMTPRRQKRLFCRYHDGNRNPKFILAPAKQEDEWDKPRIVRYHEIISDAE 307
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + +L+K ++ R V + G R+SK +L + P + +I RIQD+T
Sbjct: 308 IETVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---EDPVVSRINMRIQDLT 364
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GG Y+ H D +DE R+A+++FY++DV
Sbjct: 365 GLDVSTAEE----LQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVS 420
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV +GNKW
Sbjct: 421 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 470
>gi|348501574|ref|XP_003438344.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Oreochromis
niloticus]
Length = 615
Score = 140 bits (353), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 83/232 (35%), Positives = 129/232 (55%), Gaps = 19/232 (8%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKC-FYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYD 58
E Y + C+G + + +S L C +Y++ N L + P+K ++ + P +V+ D I D
Sbjct: 368 EKYEMLCRGEGIKMTPRRQSRLFCRYYDNNRNPSLLLAPVKQQDEWDRPYIVRYLDIISD 427
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
+EI R+ +L+K ++ R + N G R+SK +L D P + KI RI+
Sbjct: 428 AEIERVKQLAKPRLRRATISNPITGVLETASYRISKSAWLTEY---DDPMIEKINDRIEG 484
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTD 168
+T L + E LQ+ NYG+GG Y+ H D +DE R+A+++FY++D
Sbjct: 485 VTGLEMDTAEE----LQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSD 540
Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
V GGAT+FP + V+P+KG+AVFWYN A+ DY H+ CPV +GNKW
Sbjct: 541 VSAGGATVFPDVGAAVWPQKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 592
>gi|836898|gb|AAC52197.1| prolyl 4-hydroxylase alpha(I)-subunit, partial [Mus musculus]
gi|1096887|prf||2112362A Pro 4-hydroxylase:SUBUNIT=alpha:ISOTYPE=I
Length = 526
Score = 140 bits (353), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 82/230 (35%), Positives = 125/230 (54%), Gaps = 19/230 (8%)
Query: 3 YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G + + + L C Y N N + P K E+ + PR+++ HD I D+E
Sbjct: 281 YEMLCRGEGIKMTPRRQKRLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 340
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + L+K ++ R V + G R+SK +L + P + +I RIQD+T
Sbjct: 341 IEIVKYLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---EDPVVSRINMRIQDLT 397
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GG Y+ H D +DE R+A+++FY++DV
Sbjct: 398 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIATWLFYMSDVS 453
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV +GNKW
Sbjct: 454 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 503
>gi|386368303|gb|AFJ06910.1| procollagen-proline dioxygenase [Mytilus galloprovincialis]
Length = 535
Score = 140 bits (353), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 82/229 (35%), Positives = 122/229 (53%), Gaps = 18/229 (7%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
Y C+G P + S + C Y NN L + P+K EE+Y D +V HD D E+
Sbjct: 291 YKRLCKGLDVKPREKMSQVVCRYRHNNNPRLLLSPIKEEEVYRDANMVLFHDIASDKEMK 350
Query: 63 RIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
I L+ K+ R V + G I+ R++K +L DH + ++Q RI+ +T L
Sbjct: 351 IIKSLAIPKLFRATVHDPTTGKLIHAKYRITKTAWLDDR---DHLVVDRVQNRIKAVTGL 407
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW---------RLASFMFYLTDVEL 171
+ + LQ+ NYG+GGHYD H D + RD+ R+A+F+ Y+TDV+
Sbjct: 408 DLDSAD----ALQVANYGIGGHYDPHYDFSTRDDDDTSETEKRDGNRIATFLLYMTDVDA 463
Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP +++ V P+KG+AVFWYN + H+ CPV +G KW
Sbjct: 464 GGATVFPIIDVRVLPKKGTAVFWYNLRRSGKGIMETRHAACPVLVGTKW 512
>gi|63252891|ref|NP_001017973.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Homo
sapiens]
gi|63252893|ref|NP_001017974.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Homo
sapiens]
gi|217272861|ref|NP_001136070.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Homo
sapiens]
gi|18073925|emb|CAC85688.1| Prolyl 4-hydroxylase alpha IIa subunit [Homo sapiens]
gi|23274221|gb|AAH35813.1| Prolyl 4-hydroxylase, alpha polypeptide II [Homo sapiens]
gi|37183058|gb|AAQ89329.1| P4HA2 [Homo sapiens]
gi|119582745|gb|EAW62341.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_a
[Homo sapiens]
gi|119582750|gb|EAW62346.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_a
[Homo sapiens]
gi|123983232|gb|ABM83357.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II [synthetic
construct]
gi|157928048|gb|ABW03320.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II [synthetic
construct]
Length = 533
Score = 140 bits (353), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 85/230 (36%), Positives = 126/230 (54%), Gaps = 17/230 (7%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
++Y C+G + + + L C Y N L I P K E+ + P +V+ +D + D
Sbjct: 288 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSD 347
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI RI E++K K+ R V + G R+SK +L + D P + ++ R+Q
Sbjct: 348 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 404
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----RLASFMFYLTDVE 170
+T L + E LQ+ NYG+GG Y+ H D + P D GL RLA+F+ Y++DVE
Sbjct: 405 ITGLTVKTAEL----LQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVE 460
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 461 AGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 510
>gi|292619367|ref|XP_001922562.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Danio rerio]
Length = 541
Score = 140 bits (353), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 85/230 (36%), Positives = 127/230 (55%), Gaps = 19/230 (8%)
Query: 3 YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y C+G L + + +L C Y + N + F IGP+K E+ + PR+++ H+ I + E
Sbjct: 296 YEKLCRGEGLRMTPRRQKHLFCRYFNGNRHPFYTIGPVKQEDEWDRPRIIRYHEIITEQE 355
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I +I ELSK ++ R + N G R+SK +L +HP + +I RI+D+T
Sbjct: 356 IEKIKELSKPRLRRATISNPITGVLETAHYRISKSAWLAAY---EHPVVDRINQRIEDIT 412
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GG Y+ H D +DE R+A+++FY++DV
Sbjct: 413 GLNVKTAEE----LQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVA 468
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP + V P KG+AVFWYN + DY H+ CPV +GNKW
Sbjct: 469 AGGATVFPEVGAAVKPLKGTAVFWYNLFPSGEGDYSTRHAACPVLVGNKW 518
>gi|332221662|ref|XP_003259982.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 4 [Nomascus
leucogenys]
Length = 556
Score = 140 bits (353), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 85/230 (36%), Positives = 126/230 (54%), Gaps = 17/230 (7%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
++Y C+G + + + L C Y N L I P K E+ + P +V+ +D + D
Sbjct: 311 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSD 370
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI RI E++K K+ R V + G R+SK +L + D P + ++ R+Q
Sbjct: 371 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 427
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----RLASFMFYLTDVE 170
+T L + E LQ+ NYG+GG Y+ H D + P D GL RLA+F+ Y++DVE
Sbjct: 428 ITGLTVKTAEL----LQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVE 483
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 484 AGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 533
>gi|297675927|ref|XP_002815905.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Pongo
abelii]
gi|395736137|ref|XP_003776704.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Pongo abelii]
Length = 533
Score = 140 bits (353), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 85/230 (36%), Positives = 126/230 (54%), Gaps = 17/230 (7%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
++Y C+G + + + L C Y N L I P K E+ + P +V+ +D + D
Sbjct: 288 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSD 347
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI RI E++K K+ R V + G R+SK +L + D P + ++ R+Q
Sbjct: 348 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 404
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----RLASFMFYLTDVE 170
+T L + E LQ+ NYG+GG Y+ H D + P D GL RLA+F+ Y++DVE
Sbjct: 405 ITGLTVKTAEL----LQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVE 460
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 461 AGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 510
>gi|332221656|ref|XP_003259979.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Nomascus
leucogenys]
gi|332221658|ref|XP_003259980.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Nomascus
leucogenys]
Length = 535
Score = 140 bits (353), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 85/230 (36%), Positives = 126/230 (54%), Gaps = 17/230 (7%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
++Y C+G + + + L C Y N L I P K E+ + P +V+ +D + D
Sbjct: 290 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSD 349
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI RI E++K K+ R V + G R+SK +L + D P + ++ R+Q
Sbjct: 350 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 406
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----RLASFMFYLTDVE 170
+T L + E LQ+ NYG+GG Y+ H D + P D GL RLA+F+ Y++DVE
Sbjct: 407 ITGLTVKTAEL----LQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVE 462
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 463 AGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512
>gi|403255937|ref|XP_003920661.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Saimiri
boliviensis boliviensis]
gi|403255939|ref|XP_003920662.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Saimiri
boliviensis boliviensis]
gi|403255943|ref|XP_003920664.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 4 [Saimiri
boliviensis boliviensis]
Length = 533
Score = 140 bits (353), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 85/230 (36%), Positives = 126/230 (54%), Gaps = 17/230 (7%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
++Y C+G + + + L C Y N L I P K E+ + P +V+ +D + D
Sbjct: 288 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSD 347
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI RI E++K K+ R V + G R+SK +L + D P + ++ R+Q
Sbjct: 348 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 404
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----RLASFMFYLTDVE 170
+T L + E LQ+ NYG+GG Y+ H D + P D GL RLA+F+ Y++DVE
Sbjct: 405 ITGLTVKTAEL----LQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVE 460
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 461 AGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 510
>gi|449267219|gb|EMC78185.1| Prolyl 4-hydroxylase subunit alpha-2 [Columba livia]
Length = 538
Score = 140 bits (352), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 83/232 (35%), Positives = 125/232 (53%), Gaps = 19/232 (8%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYD 58
+IY C+G + + + L C Y N N L I P K E+ + P +V+ +D + D
Sbjct: 291 DIYEALCRGEGVKMTPRRQKRLFCRYHDGNRNPHLLIAPFKEEDEWDSPHIVRYYDVMSD 350
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI +I +L+K K+ R V + G R+SK +L + D P + K+ R+Q
Sbjct: 351 EEIEKIKQLAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVAKVNQRMQQ 407
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTD 168
+T L + E LQ+ NYG+GG Y+ H D + +DE R+A+F+ Y++D
Sbjct: 408 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRKDEPDAFKRLGTGNRVATFLNYMSD 463
Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
VE GGAT+FP ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 464 VEAGGATVFPDFGAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGCKW 515
>gi|194905376|ref|XP_001981185.1| GG11927 [Drosophila erecta]
gi|190655823|gb|EDV53055.1| GG11927 [Drosophila erecta]
Length = 539
Score = 140 bits (352), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 82/224 (36%), Positives = 114/224 (50%), Gaps = 15/224 (6%)
Query: 4 PLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINR 63
P C G P +K L C Y FL++ P+K E L +DP VV +HD + E
Sbjct: 289 PPCCSGRCEGPRKLK-RLYCVYNCATAAFLRLAPIKTEILSIDPFVVLLHDMVSPKEAAL 347
Query: 64 IIELSKGKVERGKVVNYGDTIYVDT-RLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVI 122
I SK + + VN + V R SK +L + + K+ R+ D T L +
Sbjct: 348 IRSSSKSTIFPSETVNAANDFVVSKFRTSKSVWLDRDA---NEATVKLTQRLADATGLDV 404
Query: 123 GREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW------RLASFMFYLTDVELGGATI 176
E + Q+ NYG+GG ++ H D T D + R+A+ +FYL DV GGAT
Sbjct: 405 KHSEHF----QVINYGIGGVFESHFDTTLEDTNRFVGGFIDRIATTLFYLNDVPQGGATH 460
Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
FP LN+TVFP G+A+FWYN +L R H+GCPV +G+KW
Sbjct: 461 FPGLNITVFPRLGAALFWYNLDTQGMLQVRTMHTGCPVIVGSKW 504
>gi|426349879|ref|XP_004042513.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Gorilla gorilla
gorilla]
Length = 565
Score = 140 bits (352), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 85/230 (36%), Positives = 126/230 (54%), Gaps = 17/230 (7%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
++Y C+G + + + L C Y N L I P K E+ + P +V+ +D + D
Sbjct: 320 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSD 379
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI RI E++K K+ R V + G R+SK +L + D P + ++ R+Q
Sbjct: 380 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 436
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----RLASFMFYLTDVE 170
+T L + E LQ+ NYG+GG Y+ H D + P D GL RLA+F+ Y++DVE
Sbjct: 437 ITGLTVKTAEL----LQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVE 492
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 493 AGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 542
>gi|386780652|ref|NP_001247763.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Macaca mulatta]
gi|383422579|gb|AFH34503.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Macaca
mulatta]
gi|384939466|gb|AFI33338.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Macaca
mulatta]
Length = 533
Score = 140 bits (352), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 85/230 (36%), Positives = 126/230 (54%), Gaps = 17/230 (7%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
++Y C+G + + + L C Y N L I P K E+ + P +V+ +D + D
Sbjct: 288 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSD 347
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI RI E++K K+ R V + G R+SK +L + D P + ++ R+Q
Sbjct: 348 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 404
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----RLASFMFYLTDVE 170
+T L + E LQ+ NYG+GG Y+ H D + P D GL RLA+F+ Y++DVE
Sbjct: 405 ITGLTVKTAEL----LQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVE 460
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 461 AGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 510
>gi|195452734|ref|XP_002073476.1| GK13124 [Drosophila willistoni]
gi|194169561|gb|EDW84462.1| GK13124 [Drosophila willistoni]
Length = 536
Score = 140 bits (352), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 82/226 (36%), Positives = 125/226 (55%), Gaps = 16/226 (7%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
Y C+G + + L+C Y N+ + ++ PLK+EE LDP VV HD + ++I
Sbjct: 286 YEKVCRGEVEPSPAQQRPLRCRYSQGNHPYRQLAPLKMEEHSLDPFVVTYHDMLSPNKIA 345
Query: 63 RIIELSKGKVERGKV--VNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
++ E++ + R V + G R+SK +L + HP + K+ + D T L
Sbjct: 346 QLREMAVPHMRRSTVNPLPGGQNKKSSFRVSKNAWL---AYETHPTMGKMLRDLSDTTGL 402
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD------ATPRDEGLWRLASFMFYLTDVELGGA 174
+ Y LQ+ NYG+GGHY+ H D P +EG R+A+ ++YL++VE GGA
Sbjct: 403 DMT----YCEQLQVANYGVGGHYEPHWDFFRNPDHYPAEEGN-RIATAIYYLSEVEQGGA 457
Query: 175 TIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
T FP LN V P+ G+ +FWYN H ++ +DYR H+GCPV G+KW
Sbjct: 458 TAFPFLNFAVRPQLGNVLFWYNLHRSSDMDYRTKHAGCPVLKGSKW 503
>gi|119582748|gb|EAW62344.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_c
[Homo sapiens]
Length = 565
Score = 140 bits (352), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 85/230 (36%), Positives = 126/230 (54%), Gaps = 17/230 (7%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
++Y C+G + + + L C Y N L I P K E+ + P +V+ +D + D
Sbjct: 320 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSD 379
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI RI E++K K+ R V + G R+SK +L + D P + ++ R+Q
Sbjct: 380 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 436
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----RLASFMFYLTDVE 170
+T L + E LQ+ NYG+GG Y+ H D + P D GL RLA+F+ Y++DVE
Sbjct: 437 ITGLTVKTAEL----LQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVE 492
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 493 AGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 542
>gi|74353841|gb|AAI03334.1| Prolyl 4-hydroxylase, alpha polypeptide II [Bos taurus]
Length = 487
Score = 140 bits (352), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 85/230 (36%), Positives = 126/230 (54%), Gaps = 17/230 (7%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
++Y C+G + + + L C Y N L I P K E+ + P +V+ +D + D
Sbjct: 242 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRVPQLLIAPFKEEDEWDSPHIVRYYDVMSD 301
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI RI E++K K+ R V + G R+SK +L + D P + ++ R+Q
Sbjct: 302 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNLRMQH 358
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----RLASFMFYLTDVE 170
+T L + E LQ+ NYG+GG Y+ H D + P D GL RLA+F+ Y++DVE
Sbjct: 359 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVE 414
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 415 AGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 464
>gi|440912197|gb|ELR61789.1| Prolyl 4-hydroxylase subunit alpha-2, partial [Bos grunniens mutus]
Length = 535
Score = 140 bits (352), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 85/230 (36%), Positives = 126/230 (54%), Gaps = 17/230 (7%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
++Y C+G + + + L C Y N L I P K E+ + P +V+ +D + D
Sbjct: 290 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRVPQLLIAPFKEEDEWDSPHIVRYYDVMSD 349
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI RI E++K K+ R V + G R+SK +L + D P + ++ R+Q
Sbjct: 350 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNLRMQH 406
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----RLASFMFYLTDVE 170
+T L + E LQ+ NYG+GG Y+ H D + P D GL RLA+F+ Y++DVE
Sbjct: 407 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVE 462
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 463 AGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512
>gi|395736139|ref|XP_003776705.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Pongo abelii]
Length = 575
Score = 140 bits (352), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 85/230 (36%), Positives = 126/230 (54%), Gaps = 17/230 (7%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
++Y C+G + + + L C Y N L I P K E+ + P +V+ +D + D
Sbjct: 330 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSD 389
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI RI E++K K+ R V + G R+SK +L + D P + ++ R+Q
Sbjct: 390 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 446
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----RLASFMFYLTDVE 170
+T L + E LQ+ NYG+GG Y+ H D + P D GL RLA+F+ Y++DVE
Sbjct: 447 ITGLTVKTAEL----LQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVE 502
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 503 AGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 552
>gi|226874885|ref|NP_001029465.2| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Bos
taurus]
gi|296485623|tpg|DAA27738.1| TPA: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Bos taurus]
Length = 533
Score = 140 bits (352), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 85/230 (36%), Positives = 126/230 (54%), Gaps = 17/230 (7%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
++Y C+G + + + L C Y N L I P K E+ + P +V+ +D + D
Sbjct: 288 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRVPQLLIAPFKEEDEWDSPHIVRYYDVMSD 347
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI RI E++K K+ R V + G R+SK +L + D P + ++ R+Q
Sbjct: 348 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNLRMQH 404
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----RLASFMFYLTDVE 170
+T L + E LQ+ NYG+GG Y+ H D + P D GL RLA+F+ Y++DVE
Sbjct: 405 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVE 460
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 461 AGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 510
>gi|116283554|gb|AAH17062.1| P4HA2 protein [Homo sapiens]
Length = 504
Score = 140 bits (352), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 85/230 (36%), Positives = 126/230 (54%), Gaps = 17/230 (7%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
++Y C+G + + + L C Y N L I P K E+ + P +V+ +D + D
Sbjct: 259 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSD 318
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI RI E++K K+ R V + G R+SK +L + D P + ++ R+Q
Sbjct: 319 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 375
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----RLASFMFYLTDVE 170
+T L + E LQ+ NYG+GG Y+ H D + P D GL RLA+F+ Y++DVE
Sbjct: 376 ITGLTVKTAEL----LQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVE 431
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 432 AGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 481
>gi|426229221|ref|XP_004008689.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 2
[Ovis aries]
Length = 487
Score = 140 bits (352), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 85/230 (36%), Positives = 126/230 (54%), Gaps = 17/230 (7%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
++Y C+G + + + L C Y N L I P K E+ + P +V+ +D + D
Sbjct: 242 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRVPQLLIAPFKEEDEWDSPHIVRYYDVMSD 301
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI RI E++K K+ R V + G R+SK +L + D P + ++ R+Q
Sbjct: 302 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNLRMQH 358
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----RLASFMFYLTDVE 170
+T L + E LQ+ NYG+GG Y+ H D + P D GL RLA+F+ Y++DVE
Sbjct: 359 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVE 414
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 415 AGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 464
>gi|119582749|gb|EAW62345.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_d
[Homo sapiens]
Length = 488
Score = 140 bits (352), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 85/230 (36%), Positives = 126/230 (54%), Gaps = 17/230 (7%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
++Y C+G + + + L C Y N L I P K E+ + P +V+ +D + D
Sbjct: 243 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSD 302
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI RI E++K K+ R V + G R+SK +L + D P + ++ R+Q
Sbjct: 303 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 359
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----RLASFMFYLTDVE 170
+T L + E LQ+ NYG+GG Y+ H D + P D GL RLA+F+ Y++DVE
Sbjct: 360 ITGLTVKTAEL----LQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVE 415
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 416 AGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 465
>gi|393909803|gb|EFO21561.2| prolyl 4-hydroxylase 2 [Loa loa]
Length = 542
Score = 139 bits (351), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 84/230 (36%), Positives = 117/230 (50%), Gaps = 18/230 (7%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
++Y C+ + V S L C+Y+ + FL++ P KVE L P V D I D E
Sbjct: 288 DMYEALCRNEVPVSVKATSKLYCYYK-MDRPFLRLAPFKVEILRFSPLAVFFRDVITDEE 346
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
+ I L+ ++ R V N G+ R SK +L E +H +++I RI MT
Sbjct: 347 VTIIQMLATPRLRRATVQNSITGELETASYRTSKSAWLKDE---EHEIVHRINRRIDLMT 403
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
NL E+ LQ+ NYG+GGHYD H D R+E RLA+ +FY+T E
Sbjct: 404 NL----EQETSEELQVGNYGIGGHYDPHFDFARREEVNAFQSLNTGNRLATLLFYMTQPE 459
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+F + TV P K A+FWYN + D R H+ CPV +G+KW
Sbjct: 460 SGGATVFTEVKTTVMPSKNDALFWYNLLRSGEGDLRTRHAACPVLIGSKW 509
>gi|335283456|ref|XP_003354320.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Sus scrofa]
Length = 535
Score = 139 bits (351), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 83/232 (35%), Positives = 126/232 (54%), Gaps = 19/232 (8%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
++Y C+G + + + L C Y N T L I P K E+ + P +V+ +D + D
Sbjct: 288 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRTPQLLIAPFKEEDEWDSPHIVRYYDVMSD 347
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI RI E++K K+ R V + G R+SK +L + D P + ++ R+Q
Sbjct: 348 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 404
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTD 168
+T L + E LQ+ NYG+GG Y+ H D + +DE R+A+F+ Y++D
Sbjct: 405 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRKDEQDAFKRLGTGNRVATFLNYMSD 460
Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
VE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 461 VEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512
>gi|281361323|ref|NP_652183.2| CG15864 [Drosophila melanogaster]
gi|272476864|gb|AAF54202.3| CG15864 [Drosophila melanogaster]
Length = 490
Score = 139 bits (351), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 86/221 (38%), Positives = 119/221 (53%), Gaps = 22/221 (9%)
Query: 5 LACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRI 64
L+ + N +V S L C Y + F +I PLK+EEL LDP +V HD IYD+EI+ +
Sbjct: 258 LSTKQNCAVVVQKPSRLHCRYNTTTTPFTRIAPLKMEELGLDPYMVVFHDVIYDTEIDGM 317
Query: 65 IELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGR 124
+ S N+G ++ + S+V D L R+ DMT G
Sbjct: 318 LNSS----------NFGLSLTDSGQKSEVRTSKDSYIVDAKTL---NERVTDMT----GF 360
Query: 125 EERYKGPLQINNYGLGGHYDLHCD-----ATPRDEGLWRLASFMFYLTDVELGGATIFPS 179
P + NYGLGGHY LH D T R + R+A+ +FYL +V+ GGATIFP
Sbjct: 361 SMEMSDPFSLINYGLGGHYMLHYDFHEYTNTTRPKQGDRIATVLFYLGEVDSGGATIFPM 420
Query: 180 LNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+N+TV P+KGSAVFWYN H + ++ + HS CPV G+K+
Sbjct: 421 INITVTPKKGSAVFWYNLHNSGAMNLKSLHSACPVISGSKY 461
>gi|224068121|ref|XP_002191580.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Taeniopygia
guttata]
Length = 539
Score = 139 bits (351), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 84/231 (36%), Positives = 126/231 (54%), Gaps = 19/231 (8%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYD 58
+IY C+G + + + L C Y N N L I P K E+ + P +V+ +D + D
Sbjct: 294 DIYEALCRGEGVKMTPRRQKRLFCRYHDGNRNPHLLIAPFKEEDEWDSPHIVRYYDVMSD 353
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI +I +L+K ++ R V + G R+SK +L + D P + K+ R+Q
Sbjct: 354 EEIEKIKQLAKPRLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVAKVNQRMQH 410
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCD-------ATPRDEGLWRLASFMFYLTDV 169
+T L + E LQ+ NYG+GG Y+ H D +T + EG RLA+F+ Y++DV
Sbjct: 411 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRRPFDSTLKSEGN-RLATFLNYMSDV 465
Query: 170 ELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
E GGAT+FP ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 466 EAGGATVFPDFGAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGCKW 516
>gi|312080225|ref|XP_003142509.1| prolyl 4-hydroxylase 2 [Loa loa]
Length = 541
Score = 139 bits (351), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 84/230 (36%), Positives = 117/230 (50%), Gaps = 18/230 (7%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
++Y C+ + V S L C+Y+ + FL++ P KVE L P V D I D E
Sbjct: 287 DMYEALCRNEVPVSVKATSKLYCYYK-MDRPFLRLAPFKVEILRFSPLAVFFRDVITDEE 345
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
+ I L+ ++ R V N G+ R SK +L E +H +++I RI MT
Sbjct: 346 VTIIQMLATPRLRRATVQNSITGELETASYRTSKSAWLKDE---EHEIVHRINRRIDLMT 402
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
NL E+ LQ+ NYG+GGHYD H D R+E RLA+ +FY+T E
Sbjct: 403 NL----EQETSEELQVGNYGIGGHYDPHFDFARREEVNAFQSLNTGNRLATLLFYMTQPE 458
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+F + TV P K A+FWYN + D R H+ CPV +G+KW
Sbjct: 459 SGGATVFTEVKTTVMPSKNDALFWYNLLRSGEGDLRTRHAACPVLIGSKW 508
>gi|240974259|ref|XP_002401836.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
gi|215491070|gb|EEC00711.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
Length = 490
Score = 139 bits (351), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 83/231 (35%), Positives = 127/231 (54%), Gaps = 19/231 (8%)
Query: 2 IYPLACQG-NLSVPEDIK-SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDS 59
IY C+G VP K +L C Y + + FL + P K E ++ PR+V HD +
Sbjct: 244 IYERLCRGEKFPVPPLYKDKDLTCQYRTNGSPFLLLQPAKEEVMFPKPRIVIYHDVMSKH 303
Query: 60 EINRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDM 117
E++ + L++ +++R V NY G+ + R+SK +L E +H + ++ RI+ +
Sbjct: 304 EMDVVKLLAQPRLKRATVQNYKSGELEVANYRISKSAWLRNE---EHGVIARVTRRIEHI 360
Query: 118 TNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDV 169
T L E LQ+ NYG+GGHY+ H D R+E R+A+++ Y++DV
Sbjct: 361 TGLSADTAEE----LQVVNYGIGGHYEPHFDFARREEKNAFQSLGTGNRIATWLNYMSDV 416
Query: 170 ELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP L LT++PEKG+A FWYN H + D H+ CPV G+KW
Sbjct: 417 PAGGATVFPQLRLTLWPEKGAAAFWYNLHRSGEGDMLTRHAACPVLAGSKW 467
>gi|156352054|ref|XP_001622587.1| predicted protein [Nematostella vectensis]
gi|156209158|gb|EDO30487.1| predicted protein [Nematostella vectensis]
Length = 531
Score = 139 bits (351), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 83/230 (36%), Positives = 118/230 (51%), Gaps = 12/230 (5%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
E Y C+G + + L+C+Y+ + +I PLKVEEL+ DP + + D +YDSE
Sbjct: 281 EAYERLCRGISYRSNEEAAKLRCYYDFTRHPMFRIRPLKVEELHSDPPIWMLRDVMYDSE 340
Query: 61 INRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLY-PEIFGDHPFLYKIQTRIQDM 117
I I + K+ R V N G+ + D R+SK +L P + L ++ R +
Sbjct: 341 IEYIKRTATPKLRRATVTNLKTGELEFADYRISKSGWLEDPRDDNEEKILNRVNRRTSII 400
Query: 118 TNLVIGREERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLW------RLASFMFYLTDVE 170
T L R LQI NYG GHY+ H D AT + R+A+ ++Y++DVE
Sbjct: 401 TGL--DTTPRSAEALQIVNYGAAGHYEPHFDHATEAVSSILKLGIGNRIATVLYYMSDVE 458
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+F V P KG A FWYN H N D R H+ CP+ +G+KW
Sbjct: 459 AGGATVFVDAEAIVKPSKGDAAFWYNLHKNGKGDERTRHAACPIIVGSKW 508
>gi|74148153|dbj|BAE36242.1| unnamed protein product [Mus musculus]
Length = 454
Score = 139 bits (351), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 81/230 (35%), Positives = 125/230 (54%), Gaps = 19/230 (8%)
Query: 3 YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G + + + L C Y N N + P K E+ + PR+++ HD I D+E
Sbjct: 209 YEMLCRGEGIKMTPRRQKRLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 268
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
+ +L+K ++ R V + G R+SK +L + P + +I RIQD+T
Sbjct: 269 NEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---EDPVVSRINMRIQDLT 325
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GG Y+ H D +DE R+A+++FY++DV
Sbjct: 326 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIATWLFYMSDVS 381
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV +GNKW
Sbjct: 382 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 431
>gi|195452776|ref|XP_002073495.1| GK13117 [Drosophila willistoni]
gi|194169580|gb|EDW84481.1| GK13117 [Drosophila willistoni]
Length = 487
Score = 139 bits (350), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 85/223 (38%), Positives = 123/223 (55%), Gaps = 18/223 (8%)
Query: 7 CQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIE 66
C+G D K L C Y + ++ FL++ PLK+E + LDP +V HD I +EI + E
Sbjct: 250 CRGEFPALTDAK--LYCIYNTTSSPFLRLAPLKMELIGLDPYMVLYHDVISPNEIAELQE 307
Query: 67 LSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD--HPFLYKIQTRIQDMTNLVIGR 124
++K +++R +V Y T D +LSK F D + ++ RI DMTN V+
Sbjct: 308 MAKPQLKRARV--YNSTKNTD-QLSKTRTAKLAWFLDTFNQLTERLNQRIMDMTNFVLNG 364
Query: 125 EERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-------RLASFMFYLTDVELGGATIF 177
E LQ+ NYGLGG+Y H D +G R+A+ +FYL DVE GGAT+F
Sbjct: 365 SEM----LQVMNYGLGGYYVKHFDYFNTTKGPHITQINGDRIATVLFYLNDVEQGGATVF 420
Query: 178 PSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
P + VFP++GSA+ WYN + + H+GCPV +G+KW
Sbjct: 421 PEIKKAVFPKRGSAIMWYNLKDDGEGNRDTLHAGCPVIVGSKW 463
>gi|395817618|ref|XP_003782262.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Otolemur
garnettii]
Length = 538
Score = 139 bits (350), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 85/230 (36%), Positives = 126/230 (54%), Gaps = 17/230 (7%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
E+Y C+G + + + L C Y N L I P K E+ + P +V+ +D + D
Sbjct: 293 EVYESLCRGEGVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSD 352
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI RI E++K K+ R V + G R+SK +L + D P + ++ R+Q
Sbjct: 353 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNHRMQH 409
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----RLASFMFYLTDVE 170
+T L + E LQ+ NYG+GG Y+ H D + P D GL R+A+F+ Y++DVE
Sbjct: 410 ITGLSVKTAEL----LQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRVATFLNYMSDVE 465
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 466 AGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 515
>gi|195572621|ref|XP_002104294.1| GD18523 [Drosophila simulans]
gi|194200221|gb|EDX13797.1| GD18523 [Drosophila simulans]
Length = 490
Score = 139 bits (350), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 83/227 (36%), Positives = 120/227 (52%), Gaps = 34/227 (14%)
Query: 5 LACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRI 64
LA + N + S L C Y + F +I PLK+EEL LDP +V HD +YD+EI+ +
Sbjct: 258 LATKQNCTAVVQKPSRLHCRYNTSTTPFTRIAPLKMEELSLDPYMVVFHDVVYDTEIDGM 317
Query: 65 IELSKGKVE------RGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
+ S + + +V D+ VD++ + R+ DMT
Sbjct: 318 LNSSNFGISESVSGLKSEVRTSKDSHIVDSK-------------------TLNERVTDMT 358
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCD-----ATPRDEGLWRLASFMFYLTDVELGG 173
L + + P + NYGLGGH+ LH D T R + R+A+ +FYL +V+ GG
Sbjct: 359 GLSMEMSD----PFSLINYGLGGHFILHHDFHEYTNTTRLKQGDRIATVLFYLGEVDSGG 414
Query: 174 ATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
ATIFP LN+TV P+KGSAVFWYN H + ++ + HS CPV G+K+
Sbjct: 415 ATIFPMLNITVTPKKGSAVFWYNLHNSGAVNSKTLHSACPVISGSKY 461
>gi|308476969|ref|XP_003100699.1| hypothetical protein CRE_15564 [Caenorhabditis remanei]
gi|308264511|gb|EFP08464.1| hypothetical protein CRE_15564 [Caenorhabditis remanei]
Length = 573
Score = 139 bits (350), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 91/249 (36%), Positives = 126/249 (50%), Gaps = 37/249 (14%)
Query: 1 EIYPLACQGNLS-VPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDS 59
+ Y C+G + V + K+ L+C+ + + FLKI P+KVE L DP V + I DS
Sbjct: 295 DAYEALCRGEIPPVEKKWKNKLRCYLKR-DKPFLKIAPIKVEILRFDPLAVLFKNVISDS 353
Query: 60 EINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDM 117
EI I EL+ K++R V N G+ + R+SK +L ++ HP + ++ RI+D
Sbjct: 354 EIKVIKELASPKLKRATVQNSKTGELEHATYRISKSAWLKGDL---HPVIERVNRRIEDF 410
Query: 118 TNLVIGREERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLW------------------- 157
T L G E LQ+ NYGLGGHYD H D A + GL
Sbjct: 411 TGLYQGTSEE----LQVANYGLGGHYDPHFDFARIANYGLGGHYEPHYDMSLKEEKNAFK 466
Query: 158 ------RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSG 211
R+A+ +FY++ E GGAT+F L VFP K A+FWYN + D R H+
Sbjct: 467 TLNTGNRIATVLFYMSQPERGGATVFNHLGTAVFPSKNDALFWYNLRRDGEGDLRTRHAA 526
Query: 212 CPVALGNKW 220
CPV LG KW
Sbjct: 527 CPVLLGVKW 535
>gi|405965633|gb|EKC30995.1| Prolyl 4-hydroxylase subunit alpha-1 [Crassostrea gigas]
Length = 617
Score = 139 bits (350), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 84/232 (36%), Positives = 124/232 (53%), Gaps = 21/232 (9%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
+ Y C+G + +K LKC Y NN L + P K EE+YL+P +V HD + D E
Sbjct: 372 QTYESLCRGEDTHDYKLKHKLKCRYVHKNNPRLLLKPAKEEEVYLNPWIVIYHDVVSDKE 431
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I+ I ++ + R V N G + R+SK +L GD P ++ + RI D+T
Sbjct: 432 IDTIKRIATPLLSRATVHNPRTGKLETAEYRVSKSAWLKD---GDDPVIHNVNNRISDIT 488
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQI NYGLGG Y+ H D R+E R+A+++ Y+T+V+
Sbjct: 489 GLSMATAEE----LQIANYGLGGQYEPHFDFARREETEAFRDLGSGNRIATWLTYMTNVD 544
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAH--ANTLLDYRMYHSGCPVALGNKW 220
GGAT+F + + +FP KG+A FWYN + + + D R H+ CPV +G KW
Sbjct: 545 AGGATVFTHIGVKLFPIKGAAAFWYNLYRSGDGIFDTR--HAACPVLVGQKW 594
>gi|354474415|ref|XP_003499426.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2
[Cricetulus griseus]
Length = 533
Score = 139 bits (350), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 84/224 (37%), Positives = 123/224 (54%), Gaps = 17/224 (7%)
Query: 7 CQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYDSEINRI 64
C+G + + + L C Y N L I P K E+ + P +V+ +D + D EI RI
Sbjct: 294 CRGEGVKLTPQRQKKLFCRYHHGNRVPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERI 353
Query: 65 IELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVI 122
E++K K+ R V + G R+SK +L + D P + ++ R+Q +T L +
Sbjct: 354 KEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQHITGLTV 410
Query: 123 GREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----RLASFMFYLTDVELGGATI 176
E LQ+ NYG+GG Y+ H D + P D GL RLA+F+ Y++DVE GGAT+
Sbjct: 411 KTAEL----LQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGGATV 466
Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 467 FPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 510
>gi|195452778|ref|XP_002073496.1| GK13116 [Drosophila willistoni]
gi|194169581|gb|EDW84482.1| GK13116 [Drosophila willistoni]
Length = 521
Score = 139 bits (350), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 82/223 (36%), Positives = 121/223 (54%), Gaps = 18/223 (8%)
Query: 7 CQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIE 66
C+G D K L C Y + ++ FL++ PLK+E + LDP +V HD I +EI + E
Sbjct: 287 CRGEFPALTDAK--LYCIYNTTSSPFLRLAPLKMELIGLDPYMVLYHDVISPNEIAELQE 344
Query: 67 LSKGKVERGKVVNYGDTI--YVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGR 124
++K +++R V N +V TR +KV + + ++ RI DMTN V+
Sbjct: 345 MAKPELKRATVYNSTKNTNQFVKTRTAKVAWFLDTF---NQLTERLNQRIMDMTNFVLNG 401
Query: 125 EERYKGPLQINNYGLGGHYDLHCD-----ATPRDEGLW--RLASFMFYLTDVELGGATIF 177
E LQ+ NYGLGG+Y H D P + R+A+ +FYL DVE GGAT+F
Sbjct: 402 SEM----LQVMNYGLGGYYVKHFDYFNTTTNPHISQINGDRIATVLFYLNDVEQGGATVF 457
Query: 178 PSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
P + VFP++GSA+ WYN + + H+ CPV +G+KW
Sbjct: 458 PEIKKAVFPKRGSAIMWYNLKDDGEGNRDTLHAACPVIVGSKW 500
>gi|195330780|ref|XP_002032081.1| GM23710 [Drosophila sechellia]
gi|194121024|gb|EDW43067.1| GM23710 [Drosophila sechellia]
Length = 490
Score = 139 bits (349), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 83/227 (36%), Positives = 120/227 (52%), Gaps = 34/227 (14%)
Query: 5 LACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRI 64
LA + N + S L C Y + F +I PLK+EEL LDP +V HD +YD+EI+ +
Sbjct: 258 LATKQNCTAVIQKPSRLHCRYNTSTTPFTRIAPLKMEELSLDPYMVVFHDVVYDTEIDGM 317
Query: 65 IELSKGKVE------RGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
+ S + + +V D+ VD++ + R+ DMT
Sbjct: 318 LNSSNFGISESVSGLKSEVRTSKDSHIVDSK-------------------TLNERVTDMT 358
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCD-----ATPRDEGLWRLASFMFYLTDVELGG 173
L + + P + NYGLGGH+ LH D T R + R+A+ +FYL +V+ GG
Sbjct: 359 GLSMEMSD----PFSLINYGLGGHFILHHDFHEYTNTTRLKRGDRIATVLFYLGEVDSGG 414
Query: 174 ATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
ATIFP LN+TV P+KGSAVFWYN H + ++ + HS CPV G+K+
Sbjct: 415 ATIFPMLNITVTPKKGSAVFWYNLHNSGAVNSKTLHSACPVISGSKY 461
>gi|345326417|ref|XP_001510155.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like
[Ornithorhynchus anatinus]
Length = 888
Score = 139 bits (349), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 83/232 (35%), Positives = 126/232 (54%), Gaps = 19/232 (8%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
++Y C+G + + + L C Y N T L I P K E+ + P +V+ +D + D
Sbjct: 641 DVYEGLCRGEGVKLTPRRQKRLFCRYHDGNRTPQLLIAPFKEEDEWDSPHIVRYYDVLSD 700
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI +I EL+K K+ R V + G + R+SK +L E D P + ++ R+Q
Sbjct: 701 EEIEKIKELAKPKLARATVRDPKTGVLTVANYRVSKSSWLEEE---DDPVVAQVNRRMQY 757
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTD 168
+T L + E LQ+ NYG+GG Y+ H D + +DE R+A+F+ Y++D
Sbjct: 758 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRKDEPDAFKRLGTGNRVATFLNYMSD 813
Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
VE GGAT+FP ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 814 VEAGGATVFPDFGAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGCKW 865
>gi|260825357|ref|XP_002607633.1| hypothetical protein BRAFLDRAFT_59428 [Branchiostoma floridae]
gi|229292981|gb|EEN63643.1| hypothetical protein BRAFLDRAFT_59428 [Branchiostoma floridae]
Length = 520
Score = 139 bits (349), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 79/233 (33%), Positives = 131/233 (56%), Gaps = 21/233 (9%)
Query: 2 IYPLACQGNLSVPEDIKSN----LKCFYESYNN-TFLKIGPLKVEELYLDPRVVKIHDAI 56
+Y L CQ + +I S+ LKC Y + NN L + P+++E+++ P++ +H+ +
Sbjct: 272 VYELLCQADQPEIFNITSSRVKHLKCRYFTNNNHPRLLLAPIRLEQVFDKPKLWVLHNIL 331
Query: 57 YDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRI 114
D E+ I +L++ ++ R +V + G+ R+SK +LY +H + ++ R+
Sbjct: 332 TDPEMEVIKKLAQPRLRRARVESPTTGEGELASYRISKSAWLYD---WEHRVIRRVNQRV 388
Query: 115 QDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-------RLASFMFYLT 167
+D+T L + E LQ+ NYG+GGHY+ H D +DE R+A+ +FY++
Sbjct: 389 EDVTGLTMETAEL----LQVVNYGIGGHYEPHFDCATKDEEFALDPNEGDRIATMLFYMS 444
Query: 168 DVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
DVE GGAT+FP + V PEKG+ FWYN + D H+GCPV +G+KW
Sbjct: 445 DVEAGGATVFPQVGARVVPEKGAGAFWYNLLKSGEGDMLTEHAGCPVLVGSKW 497
>gi|410948134|ref|XP_003980796.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Felis
catus]
Length = 535
Score = 139 bits (349), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 83/232 (35%), Positives = 126/232 (54%), Gaps = 19/232 (8%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
+IY C+G + + + L C Y N T L I P K E+ + P +V+ +D + D
Sbjct: 288 DIYESLCRGEGVKLTPRRQKRLFCRYHHGNRTPQLLIAPFKEEDEWDSPHIVRYYDVMSD 347
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI RI E++K K+ R V + G R+SK +L + D P + ++ R+Q
Sbjct: 348 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 404
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTD 168
+T L + E LQ+ NYG+GG Y+ H D + ++E R+A+F+ Y++D
Sbjct: 405 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRKNEQDAFKRLGTGNRVATFLNYMSD 460
Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
VE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 461 VEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512
>gi|324511726|gb|ADY44875.1| Prolyl 4-hydroxylase subunit alpha-1 [Ascaris suum]
Length = 550
Score = 139 bits (349), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 85/229 (37%), Positives = 119/229 (51%), Gaps = 18/229 (7%)
Query: 2 IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
IY C+ + V S L C+Y+ + FL++ P KVE L +P V D I D E
Sbjct: 285 IYEALCRNEVPVSIKAISQLYCYYK-MDRPFLRLAPFKVEILRFNPLAVLFVDIISDEEA 343
Query: 62 NRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
I +++ +++R V N G+ R+SK +L GDH + +I RI+ MTN
Sbjct: 344 KMIQQIATPRLKRATVQNSKTGELETAAYRISKSAWLKG---GDHELIDRINRRIELMTN 400
Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVEL 171
L+ E LQI NYG+GGHYD H D ++E RLA+ +FYLT+ E+
Sbjct: 401 LIQETSEE----LQIANYGVGGHYDPHFDFARKEEPKAFESLGTGNRLATVLFYLTEPEI 456
Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GG T+F L V P K A+FWYN + + D R H+ CPV +G KW
Sbjct: 457 GGGTVFTELRTAVMPSKNGALFWYNLYRSGEGDLRTRHAACPVLVGIKW 505
>gi|432926124|ref|XP_004080841.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Oryzias
latipes]
Length = 523
Score = 138 bits (348), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 79/224 (35%), Positives = 124/224 (55%), Gaps = 19/224 (8%)
Query: 8 QGNLSVPEDIKSNLKC-FYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIE 66
QG L P + S L C ++ ++ + IGP+K E+ + P +V+ HD + E+ + E
Sbjct: 285 QGALMTPRRL-SRLFCRYFNNHGHPNYLIGPVKQEDEWDSPYIVRYHDVASEKEMETVKE 343
Query: 67 LSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGR 124
L+K ++ R V + G R+SK +L +HP + +I RI+D+T L +
Sbjct: 344 LAKPRLRRATVHDPQTGKLTTAQYRVSKSAWLGSH---EHPIVDRINQRIEDITGLDVST 400
Query: 125 EERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVELGGATI 176
E LQ+ NYG+GG Y+ H D +DE R+A+++ Y++DV+ GG T+
Sbjct: 401 AE----DLQVANYGVGGQYEPHFDFGRKDEADAFEELGTGNRIATWLLYMSDVQAGGNTV 456
Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
F + V+P+KG+AVFWYN H + DYR H+ CPV +GNKW
Sbjct: 457 FTDIGAVVWPKKGTAVFWYNLHRSGEGDYRTRHAACPVLVGNKW 500
>gi|190402274|gb|ACE77683.1| prolyl 4-hydroxylase subunit alpha-2 precursor (predicted) [Sorex
araneus]
Length = 533
Score = 138 bits (348), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 83/230 (36%), Positives = 126/230 (54%), Gaps = 17/230 (7%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKC-FYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYD 58
++Y C+G + + + L C ++ + L I P K E+ + P +V+ +D + D
Sbjct: 288 DVYESLCRGEGVKLTPRRQKRLFCRYHHGHGAPQLLIAPFKEEDEWDSPHIVRYYDVMSD 347
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI RI E++K K+ R V + G R+SK +L D P + ++ R+Q
Sbjct: 348 EEIERIKEIAKPKLARATVRDPKTGVLTTASYRVSKSSWLEET---DDPVVARVNLRMQH 404
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----RLASFMFYLTDVE 170
+T L + E LQ+ NYG+GG Y+ H D + P D GL RLA+F+ Y++DVE
Sbjct: 405 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVE 460
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 461 AGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 510
>gi|195330778|ref|XP_002032080.1| GM23711 [Drosophila sechellia]
gi|194121023|gb|EDW43066.1| GM23711 [Drosophila sechellia]
Length = 490
Score = 138 bits (348), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 82/227 (36%), Positives = 117/227 (51%), Gaps = 34/227 (14%)
Query: 5 LACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRI 64
LA + N + S L C Y + F +I PLK+EEL LDP +V HD +YD+EI+ +
Sbjct: 258 LATKQNCTAVVQKPSRLHCRYNTSTTPFTRIAPLKMEELSLDPYMVVFHDVVYDTEIDGM 317
Query: 65 IELSKGKV------ERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
+ S + ++ +V D+ VD + + R+ DMT
Sbjct: 318 LNSSNFVLSLTDSGQKSEVRTSKDSYIVDAK-------------------SLNERVTDMT 358
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCD-----ATPRDEGLWRLASFMFYLTDVELGG 173
G P + NYGLGGHY LH D T R + R+A+ +FYL +V+ GG
Sbjct: 359 ----GFSMEMSDPFSLINYGLGGHYMLHYDFHEYTNTTRPKQGDRIATVLFYLGEVDSGG 414
Query: 174 ATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
ATIFP +N+ V P+KGSAVFWYN H + ++ + HS CPV G+K+
Sbjct: 415 ATIFPKINIAVTPKKGSAVFWYNLHNSGAMNLKSLHSACPVISGSKY 461
>gi|195572619|ref|XP_002104293.1| GD18524 [Drosophila simulans]
gi|194200220|gb|EDX13796.1| GD18524 [Drosophila simulans]
Length = 472
Score = 138 bits (348), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 84/221 (38%), Positives = 117/221 (52%), Gaps = 22/221 (9%)
Query: 5 LACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRI 64
LA + N + S L C Y + F +I PLK+EEL LDP +V HD +YD+EI+ +
Sbjct: 240 LATKQNCTAVVQKPSRLHCRYNTSTTPFTRIAPLKMEELSLDPYMVVFHDVVYDTEIDGM 299
Query: 65 IELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGR 124
+ S N+G ++ + S+V D L R+ DMT G
Sbjct: 300 LNSS----------NFGLSLTDSGQKSEVRTSKDSYIVDSESL---NERVTDMT----GF 342
Query: 125 EERYKGPLQINNYGLGGHYDLHCD-----ATPRDEGLWRLASFMFYLTDVELGGATIFPS 179
P + NYGLGGHY LH D T R + R+A+ +FYL +V+ GGATIFP
Sbjct: 343 SMEMSDPFSLINYGLGGHYMLHYDFHEYTNTTRPKQGDRIATVLFYLGEVDSGGATIFPK 402
Query: 180 LNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+N+ V P+KGSAVFWYN H + ++ + HS CPV G+K+
Sbjct: 403 INIAVTPKKGSAVFWYNLHNSGAMNLKSLHSACPVISGSKY 443
>gi|148226320|ref|NP_001087703.1| prolyl 4-hydroxylase, alpha polypeptide 2 precursor [Xenopus
laevis]
gi|51703693|gb|AAH81114.1| MGC83530 protein [Xenopus laevis]
Length = 533
Score = 138 bits (348), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 84/230 (36%), Positives = 130/230 (56%), Gaps = 17/230 (7%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYD 58
++Y C+G + + + L C Y N N L +GP+K+E+ + PR+V+ D + D
Sbjct: 288 DVYEALCRGEGVKMNPRRQKRLFCRYHDGNRNPRLILGPIKMEDEWDSPRIVRYLDVLSD 347
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI +I EL+K ++ R V + G + R+SK +L E + D P + ++ +R+Q
Sbjct: 348 EEIEKIKELAKPRLARATVRDPKTGVLTVANYRVSKSAWL--EEY-DDPVIGRVNSRMQA 404
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPR--DEGLW----RLASFMFYLTDVE 170
+T L E LQ+ NYG+GG Y+ H D + R D L RLA+++ Y++DVE
Sbjct: 405 ITGLTKDTAEL----LQVANYGMGGQYEPHFDFSRRPFDSNLKTEGNRLATYLNYMSDVE 460
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP ++P KG+AVFWYN + DYR H+ CPV +G+KW
Sbjct: 461 AGGATVFPDFGAAIWPRKGTAVFWYNLFRSGEGDYRTRHAACPVLVGSKW 510
>gi|301754231|ref|XP_002912939.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Ailuropoda
melanoleuca]
Length = 535
Score = 138 bits (348), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 83/232 (35%), Positives = 126/232 (54%), Gaps = 19/232 (8%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
+IY C+G + + + L C Y N T L I P K E+ + P +V+ +D + D
Sbjct: 288 DIYESLCRGEGVKLTPRRQKRLFCRYHHGNRTPQLLIAPFKEEDEWDSPHIVRYYDVMSD 347
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI RI E++K K+ R V + G R+SK +L + D P + ++ R+Q
Sbjct: 348 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNLRMQH 404
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTD 168
+T L + E LQ+ NYG+GG Y+ H D + ++E R+A+F+ Y++D
Sbjct: 405 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRKNEQDAFKRLGTGNRVATFLNYMSD 460
Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
VE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 461 VEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512
>gi|195341558|ref|XP_002037373.1| GM12146 [Drosophila sechellia]
gi|194131489|gb|EDW53532.1| GM12146 [Drosophila sechellia]
Length = 485
Score = 138 bits (348), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 82/225 (36%), Positives = 116/225 (51%), Gaps = 15/225 (6%)
Query: 4 PLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINR 63
P C G P +K L C Y FL++ P+K E L +DP V+ HD + +E
Sbjct: 269 PPCCSGRCEGPRKLK-RLYCVYNCVTAPFLRLAPIKTEILSIDPFVILFHDMVSPTEGAL 327
Query: 64 IIELSKGKVERGKVVNYGDTIYVDT-RLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVI 122
I SK ++ + VN + V R SK + + + K+ R+ + T L +
Sbjct: 328 IRSSSKNQILPSETVNAANEFEVAKFRTSKSVWFDSDA---NEATLKLTQRLGEATGLDM 384
Query: 123 GREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW------RLASFMFYLTDVELGGATI 176
E P Q+ NYG+GG ++ H D + DE + RLA+ +FYL DV GGAT
Sbjct: 385 KHSE----PFQVINYGIGGVFESHFDTSLADEDRFVNGYIDRLATTLFYLNDVPQGGATH 440
Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 221
FP LN+TVFP+ G+ + WYN H LL R H+GCPV +G+KWG
Sbjct: 441 FPGLNITVFPKFGTVLMWYNLHTEGLLHVRTMHTGCPVIVGSKWG 485
>gi|410927705|ref|XP_003977281.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Takifugu
rubripes]
Length = 531
Score = 138 bits (347), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 82/230 (35%), Positives = 128/230 (55%), Gaps = 19/230 (8%)
Query: 3 YPLACQGN-LSVPEDIKSNLKC-FYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y C+G L + +S L C +Y++ + IGP+K E+ + P +V+ HD + + E
Sbjct: 286 YEQLCRGEGLKMTARRQSQLFCRYYDNGRHPKYVIGPVKQEDEWDRPHIVRYHDILSNRE 345
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
+ + EL+K ++ R V + G R+SK +L +HP + +I RI+D+T
Sbjct: 346 METVKELAKPRLRRATVHDPQTGQLTTAPYRVSKSAWLGA---FEHPVVDRINQRIEDIT 402
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GG Y+ H D +DE R+A+++ Y+++V+
Sbjct: 403 GLDVSTAE----DLQVANYGVGGQYEPHYDFGRKDEPDAFKELGTGNRIATWLLYMSEVQ 458
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+F + +V P+KGSAVFWYN H + DYR H+ CPV LGNKW
Sbjct: 459 AGGATVFTDIGASVSPKKGSAVFWYNLHPSGDGDYRTRHAACPVLLGNKW 508
>gi|195591302|ref|XP_002085381.1| GD14757 [Drosophila simulans]
gi|194197390|gb|EDX10966.1| GD14757 [Drosophila simulans]
Length = 525
Score = 138 bits (347), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 85/225 (37%), Positives = 122/225 (54%), Gaps = 21/225 (9%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
+ L C+G K+NL C Y+S NTFL++ PLK+EE+ LDP + H+ +YDSEI+
Sbjct: 291 FELGCRGLYRQ----KTNLVCRYKSTANTFLRLAPLKLEEISLDPFIAMYHEVLYDSEIH 346
Query: 63 RIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVI 122
+ S V G T DT ++ + + +I RI DMT
Sbjct: 347 ELKGQSMNMV-NGYASERNGTEIRDTVARYDWWSNTSLVRE-----RINQRIIDMTEFNF 400
Query: 123 GREERYKGPLQINNYGLGGHYDLHCD------ATPRDEGLW-RLASFMFYLTDVELGGAT 175
++E+ LQI NYG+G ++ H D TP L RLAS +FY ++V GGAT
Sbjct: 401 SKDEK----LQITNYGVGTYFQPHFDYSSDGFETPNITTLGDRLASILFYASEVPQGGAT 456
Query: 176 IFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+FP +N+TVFP+KGS ++W+N H + D R HS CPV G++W
Sbjct: 457 VFPEINVTVFPQKGSMLYWFNLHDDGRPDIRSKHSVCPVINGDRW 501
>gi|348557542|ref|XP_003464578.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 1
[Cavia porcellus]
Length = 535
Score = 138 bits (347), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 83/232 (35%), Positives = 123/232 (53%), Gaps = 19/232 (8%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
E+Y C+G + + + L C Y N L I P K E+ + P +V+ +D + D
Sbjct: 288 EVYESLCRGEGIKLTPQRRKRLFCRYHHGNRAPELLIAPFKEEDEWDSPHIVRYYDVMSD 347
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI RI E++K K+ R V + G R+SK +L E D P + ++ R+Q
Sbjct: 348 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEE---DDPVVARVNRRMQQ 404
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTD 168
+T L + E LQ+ NYG+GG Y+ H D + E R+A+F+ Y++D
Sbjct: 405 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRSHERDAFKRLGTGNRVATFLNYMSD 460
Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
VE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 461 VEAGGATVFPDLGAALWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512
>gi|327267604|ref|XP_003218589.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Anolis
carolinensis]
Length = 542
Score = 138 bits (347), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 82/230 (35%), Positives = 126/230 (54%), Gaps = 19/230 (8%)
Query: 3 YPLACQGN-LSVPEDIKSNLKC-FYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G L + + L C +Y+ N + P+K E+ + PR+V+ + I D E
Sbjct: 297 YEMLCRGEGLKMTPRRQKKLFCRYYDGNRNPKYILRPVKQEDEWDRPRIVRFVEIISDEE 356
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + EL+K ++ R V + G R+SK +L ++P + +I TRIQD+T
Sbjct: 357 IETVKELAKPRLSRATVHDPQTGKLTTAHYRVSKSAWLSGY---ENPIVARINTRIQDLT 413
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GG Y+ H D +DE R+A+++FY++DV
Sbjct: 414 GLDVSTAEE----LQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVS 469
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP + +V+P KG+AVFWYN + DY H+ CPV +GNKW
Sbjct: 470 AGGATVFPEVGASVWPRKGTAVFWYNLFPSGEGDYSTRHAACPVLVGNKW 519
>gi|226874889|ref|NP_001152881.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Bos
taurus]
gi|296485624|tpg|DAA27739.1| TPA: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Bos taurus]
Length = 535
Score = 138 bits (347), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 82/232 (35%), Positives = 125/232 (53%), Gaps = 19/232 (8%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
++Y C+G + + + L C Y N L I P K E+ + P +V+ +D + D
Sbjct: 288 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRVPQLLIAPFKEEDEWDSPHIVRYYDVMSD 347
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI RI E++K K+ R V + G R+SK +L + D P + ++ R+Q
Sbjct: 348 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNLRMQH 404
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTD 168
+T L + E LQ+ NYG+GG Y+ H D + +DE R+A+F+ Y++D
Sbjct: 405 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRKDEQDAFKRLGTGNRVATFLNYMSD 460
Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
VE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 461 VEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512
>gi|395509389|ref|XP_003758980.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2
[Sarcophilus harrisii]
Length = 536
Score = 137 bits (346), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 82/232 (35%), Positives = 126/232 (54%), Gaps = 19/232 (8%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
++Y C+G + + + L C Y N T L I P K E+ + P +V+ +D + D
Sbjct: 289 DVYEALCRGEGIKLTPRRQKRLFCRYHDGNRTPQLLIAPFKEEDEWDSPHIVRYYDVLSD 348
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI RI EL+K K+ R V + G + R+SK +L GD P + ++ R+
Sbjct: 349 EEIERIKELAKPKLARATVRDPKTGVLTVANYRVSKSSWLEE---GDDPVIAQLNRRMHY 405
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTD 168
+T L + E LQ+ NYG+GG Y+ H D + + E R+A+F+ Y++D
Sbjct: 406 ITGLSVKTAEL----LQVANYGMGGQYEPHFDFSRKGEQDAFKHLGTGNRVATFLNYMSD 461
Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
VE GGAT+FP T++P+KG++VFWYN + DYR H+ CPV +G+KW
Sbjct: 462 VEAGGATVFPDFGATIWPKKGTSVFWYNLFRSGEGDYRTRHAACPVLVGSKW 513
>gi|193688213|ref|XP_001943683.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 1
[Acyrthosiphon pisum]
Length = 552
Score = 137 bits (346), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 82/231 (35%), Positives = 124/231 (53%), Gaps = 18/231 (7%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDS 59
E Y + C+ + I S L+C Y + N N L I PLK EE + PR++ D +YD+
Sbjct: 300 ERYHMLCRNENLMSIQISSQLRCRYTNNNRNPLLLIAPLKEEEAFFSPRIILYRDVLYDN 359
Query: 60 EINRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDM 117
EI I +++ +++R V NY G+ + D R+SK +L + + + R++ M
Sbjct: 360 EIEVIKRMAQPRLKRATVQNYKTGELEFADYRISKSAWLKEH---EDVVVANVAKRVEVM 416
Query: 118 TNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDE-------GLW-RLASFMFYLTDV 169
T L E LQ+ NYG+GGHYD H D +E G R+A+ +FY++DV
Sbjct: 417 TGLTTETAEE----LQVVNYGVGGHYDPHYDFARTEEINAFKSLGTGNRIATVLFYMSDV 472
Query: 170 ELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP L + + P KG+A W+N + + D R H+ CPV G+KW
Sbjct: 473 AQGGATVFPWLGVALQPVKGTAAVWFNLYPSGNGDLRTRHAACPVLQGSKW 523
>gi|426229219|ref|XP_004008688.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 1
[Ovis aries]
Length = 535
Score = 137 bits (346), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 82/232 (35%), Positives = 125/232 (53%), Gaps = 19/232 (8%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
++Y C+G + + + L C Y N L I P K E+ + P +V+ +D + D
Sbjct: 288 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRVPQLLIAPFKEEDEWDSPHIVRYYDVMSD 347
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI RI E++K K+ R V + G R+SK +L + D P + ++ R+Q
Sbjct: 348 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNLRMQH 404
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTD 168
+T L + E LQ+ NYG+GG Y+ H D + +DE R+A+F+ Y++D
Sbjct: 405 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRKDEQDAFKRLGTGNRVATFLNYMSD 460
Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
VE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 461 VEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512
>gi|328696638|ref|XP_003240086.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 2
[Acyrthosiphon pisum]
Length = 534
Score = 137 bits (346), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 82/231 (35%), Positives = 124/231 (53%), Gaps = 18/231 (7%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDS 59
E Y + C+ + I S L+C Y + N N L I PLK EE + PR++ D +YD+
Sbjct: 282 ERYHMLCRNENLMSIQISSQLRCRYTNNNRNPLLLIAPLKEEEAFFSPRIILYRDVLYDN 341
Query: 60 EINRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDM 117
EI I +++ +++R V NY G+ + D R+SK +L + + + R++ M
Sbjct: 342 EIEVIKRMAQPRLKRATVQNYKTGELEFADYRISKSAWLKEH---EDVVVANVAKRVEVM 398
Query: 118 TNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDE-------GLW-RLASFMFYLTDV 169
T L E LQ+ NYG+GGHYD H D +E G R+A+ +FY++DV
Sbjct: 399 TGLTTETAEE----LQVVNYGVGGHYDPHYDFARTEEINAFKSLGTGNRIATVLFYMSDV 454
Query: 170 ELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP L + + P KG+A W+N + + D R H+ CPV G+KW
Sbjct: 455 AQGGATVFPWLGVALQPVKGTAAVWFNLYPSGNGDLRTRHAACPVLQGSKW 505
>gi|194765178|ref|XP_001964704.1| GF23330 [Drosophila ananassae]
gi|190614976|gb|EDV30500.1| GF23330 [Drosophila ananassae]
Length = 537
Score = 137 bits (346), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 82/226 (36%), Positives = 123/226 (54%), Gaps = 16/226 (7%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
Y C+G ++ + NL+C N+ F + PLK+EE LDP VV HD + +I
Sbjct: 287 YEKVCRGEVNPTPRQERNLRCRLSQGNHPFRLLAPLKLEEHNLDPYVVTYHDMLSAQKIR 346
Query: 63 RIIELSKGKVERGKV--VNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
+ +++ ++ R V + G R+SK +L + HP + + ++D T L
Sbjct: 347 DLRQMAVPRMRRSTVNPLPGGQNKKSAFRVSKNAWL---AYESHPTMEGMLRDLKDATGL 403
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD------ATPRDEGLWRLASFMFYLTDVELGGA 174
+ Y LQ+ NYG+GGHY+ H D P +EG R+A+ +FYL+DVE GGA
Sbjct: 404 ----DTTYCEQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGN-RIATAIFYLSDVEQGGA 458
Query: 175 TIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
T FP L+ V P+ G+ +FWYN H + +DYR H+GCPV G+KW
Sbjct: 459 TAFPFLDFAVKPQLGNVLFWYNLHRSLDMDYRTKHAGCPVLKGSKW 504
>gi|195575113|ref|XP_002105524.1| GD16980 [Drosophila simulans]
gi|194201451|gb|EDX15027.1| GD16980 [Drosophila simulans]
Length = 518
Score = 137 bits (346), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 81/224 (36%), Positives = 117/224 (52%), Gaps = 15/224 (6%)
Query: 4 PLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINR 63
P C G P+ +K L C Y FL++ P+K E L +DP V+ +HD + +E
Sbjct: 268 PHCCSGRCERPQKLK-RLYCVYNCITAPFLRLAPIKTEILSVDPFVILLHDMVSPTEGAL 326
Query: 64 IIELSKGKVERGKVVNYGDTIYVDT-RLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVI 122
I SK ++ + VN + V R SK + + + K+ R+ + T L +
Sbjct: 327 IRSSSKNQILPSETVNAANEFEVAKFRTSKSVWFDSDA---NEATLKLTQRLGEATGLDM 383
Query: 123 GREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW------RLASFMFYLTDVELGGATI 176
E P Q+ NYG+GG ++ H D + DE + RLA+ +FYL DV GGAT
Sbjct: 384 KHSE----PFQVINYGIGGVFESHFDTSLADEDRFVNGYIDRLATTLFYLNDVPQGGATH 439
Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
FP LN+TVFP+ G+ + WYN H LL R H+GCPV +G+KW
Sbjct: 440 FPGLNITVFPKFGTVLMWYNLHTEGLLHVRTMHTGCPVIVGSKW 483
>gi|195110925|ref|XP_002000030.1| GI22756 [Drosophila mojavensis]
gi|193916624|gb|EDW15491.1| GI22756 [Drosophila mojavensis]
Length = 533
Score = 137 bits (346), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 81/226 (35%), Positives = 121/226 (53%), Gaps = 16/226 (7%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
Y C+G + + L+C Y + + + PLK+EE LDP VV HD + +I
Sbjct: 283 YEKVCRGEVGPSAAQQRRLRCRYARGRHAYRLLAPLKLEEHSLDPLVVSYHDMLSPQQIG 342
Query: 63 RIIELSKGKVERGKV--VNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
+ ++ ++R V ++ G + R+SK +L + HP + ++ + D T L
Sbjct: 343 ELRAMAVPHMQRSTVNPLSGGQRMKSAFRVSKNAWL---PYSTHPMMGRMLRDVGDATGL 399
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD------ATPRDEGLWRLASFMFYLTDVELGGA 174
+ Y LQ+ NYG+GGHY+ H D P EG R+A+ +FYL+DVE GGA
Sbjct: 400 ----DMTYCEQLQVANYGVGGHYEPHWDFFRDSRHYPAAEGN-RIATAIFYLSDVEQGGA 454
Query: 175 TIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
T FP LN V P+ G+ +FWYN H ++ DYR H+GCPV G+KW
Sbjct: 455 TAFPFLNFAVRPQLGNILFWYNLHRSSDEDYRTKHAGCPVLKGSKW 500
>gi|116008432|ref|NP_651804.2| CG15539, isoform A [Drosophila melanogaster]
gi|66772391|gb|AAY55507.1| IP10910p [Drosophila melanogaster]
gi|66772535|gb|AAY55579.1| IP10810p [Drosophila melanogaster]
gi|113194858|gb|AAF57060.2| CG15539, isoform A [Drosophila melanogaster]
Length = 386
Score = 137 bits (346), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 80/206 (38%), Positives = 115/206 (55%), Gaps = 12/206 (5%)
Query: 19 SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV 78
+ L C Y++ ++ FL++ PLK+E L LDP +V HD + D +I I L+KGK+ R V
Sbjct: 168 AKLYCLYKTTSSYFLRLAPLKMELLSLDPYMVLFHDVVSDKDIVSIRNLTKGKLARTVTV 227
Query: 79 NYGDTIYVD-TRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNY 137
+ D R +K +L ++ + ++ QDMTN I + P Q+ NY
Sbjct: 228 SKDGNYTEDPDRTTKGTWLVE----NNALIQRLSQLTQDMTNFDIHDAD----PFQVLNY 279
Query: 138 GLGGHYDLHCD---ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFW 194
G+GG Y +H D D R+A+ +FYL+DV GGATIFP L L+VFP+KGSA+ W
Sbjct: 280 GIGGFYGIHFDFLEDAELDNFSDRIATAVFYLSDVPQGGATIFPKLGLSVFPKKGSALLW 339
Query: 195 YNAHANTLLDYRMYHSGCPVALGNKW 220
YN D R HS CP +G++W
Sbjct: 340 YNLDHKGDGDNRTAHSACPTVVGSRW 365
>gi|195505209|ref|XP_002099405.1| GE10885 [Drosophila yakuba]
gi|194185506|gb|EDW99117.1| GE10885 [Drosophila yakuba]
Length = 473
Score = 137 bits (345), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 82/206 (39%), Positives = 114/206 (55%), Gaps = 12/206 (5%)
Query: 19 SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV 78
+ L C Y + + FL++ PLK+E L LDP +V HD + D +I I L+KG + R V
Sbjct: 255 AKLHCLYNTTASYFLRLAPLKMELLSLDPYMVLFHDVVSDKDITSIRNLAKGGLVRAVTV 314
Query: 79 NYGDTIYVD-TRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNY 137
+ D R +K +L + + ++ QDMTNL I R P Q+ NY
Sbjct: 315 TKDGSYEEDPARTTKGTWL----VENSKLIQRLSQLAQDMTNLDI----RDADPFQVLNY 366
Query: 138 GLGGHYDLHCDATPRDE-GLW--RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFW 194
G+GG+Y H D E G + R+A+ +FYL+DV GGATIFP L L+VFP+KGSA+ W
Sbjct: 367 GIGGYYGTHFDFLADTEMGNFSNRIATAVFYLSDVPQGGATIFPKLGLSVFPKKGSALLW 426
Query: 195 YNAHANTLLDYRMYHSGCPVALGNKW 220
YN D R HS CP +G++W
Sbjct: 427 YNLDHKGDGDNRTAHSACPTIVGSRW 452
>gi|66770649|gb|AAY54636.1| IP12415p [Drosophila melanogaster]
gi|66772017|gb|AAY55320.1| IP12615p [Drosophila melanogaster]
Length = 512
Score = 137 bits (345), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 83/210 (39%), Positives = 117/210 (55%), Gaps = 17/210 (8%)
Query: 18 KSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKV 77
K+NL C Y+S NTFL++ PLK+EE+ LDP + H+ +YDSEI + S V G
Sbjct: 294 KTNLVCRYKSTANTFLRLAPLKLEEISLDPFMAMYHEVLYDSEIRELKGQSMNMV-NGYA 352
Query: 78 VNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNY 137
T DT + ++ + + +I RI DMT ++E+ LQI NY
Sbjct: 353 SQRNGTEIRDTVVRYDWWSNTSLVRE-----RINQRIIDMTGFNFLKDEK----LQIANY 403
Query: 138 GLGGHYDLHCD------ATPRDEGLW-RLASFMFYLTDVELGGATIFPSLNLTVFPEKGS 190
GLG ++ H D TP L RLAS +FY ++V GGAT+FP +N+TVFP+KGS
Sbjct: 404 GLGTYFQPHFDYSSDGFETPNITTLGDRLASILFYASEVPQGGATVFPEINVTVFPQKGS 463
Query: 191 AVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
++W+N H + D R HS CPV G++W
Sbjct: 464 MLYWFNLHDDGKPDIRSLHSVCPVLNGDRW 493
>gi|195499025|ref|XP_002096772.1| GE25857 [Drosophila yakuba]
gi|194182873|gb|EDW96484.1| GE25857 [Drosophila yakuba]
Length = 490
Score = 137 bits (345), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 88/228 (38%), Positives = 116/228 (50%), Gaps = 36/228 (15%)
Query: 5 LACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI--- 61
LA N +V S L C Y S F +I PLK+EEL LDP +V HD IYD EI
Sbjct: 258 LATVQNCTVVVQKPSRLHCRYNSTTTPFTRIAPLKMEELSLDPYMVVFHDVIYDREIELM 317
Query: 62 ----NRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDM 117
N I+ L+ E +V D+ V+++ + R+ DM
Sbjct: 318 LNSSNFILSLTDSGQE-SEVRASKDSYIVESK-------------------TLNDRVTDM 357
Query: 118 TNLVIGREERYKGPLQINNYGLGGHYDLHCD-----ATPRDEGLWRLASFMFYLTDVELG 172
T L + P + NYG+GGHY LH D T R + R+A+ +FYL +V+ G
Sbjct: 358 TGLSM----ELSDPFSLINYGIGGHYMLHYDYHKYTNTTRAKYGDRIATLLFYLGEVDSG 413
Query: 173 GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GATIFP +N+TV P+KGSAVFWYN H + L HS CPV G+K+
Sbjct: 414 GATIFPRINITVTPKKGSAVFWYNLHNSGALHLETLHSACPVISGSKY 461
>gi|194871348|ref|XP_001972831.1| GG13664 [Drosophila erecta]
gi|190654614|gb|EDV51857.1| GG13664 [Drosophila erecta]
Length = 520
Score = 137 bits (345), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 83/210 (39%), Positives = 116/210 (55%), Gaps = 17/210 (8%)
Query: 18 KSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKV 77
++NL C Y+S NTFL++ PLK EE+ LDP + H+ +YDSEI+ + KGK G +
Sbjct: 302 RTNLVCRYKSTANTFLRLAPLKFEEISLDPFIAVYHEVLYDSEIHAL----KGK--SGNM 355
Query: 78 VNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNY 137
VN T + Y +I RI DMT ++E+ LQI NY
Sbjct: 356 VNGYARQRNGTEIRDTVARYDWWSDTSLTRERINQRIIDMTGFNFTKDEK----LQIANY 411
Query: 138 GLGGHYDLHCD------ATPRDEGLW-RLASFMFYLTDVELGGATIFPSLNLTVFPEKGS 190
G+G +++ H D TP L RLAS +FY +V GGAT+FP +N+TVFP+KGS
Sbjct: 412 GVGTYFEPHFDYSSDGFETPEVTTLGDRLASIIFYAGEVLQGGATVFPEINVTVFPQKGS 471
Query: 191 AVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
++W+N H + D R HS CPV G++W
Sbjct: 472 MLYWFNLHDDGRPDIRSQHSACPVVNGDRW 501
>gi|66771935|gb|AAY55279.1| IP12715p [Drosophila melanogaster]
Length = 451
Score = 137 bits (345), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 83/210 (39%), Positives = 117/210 (55%), Gaps = 17/210 (8%)
Query: 18 KSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKV 77
K+NL C Y+S NTFL++ PLK+EE+ LDP + H+ +YDSEI + S V G
Sbjct: 233 KTNLVCRYKSTANTFLRLAPLKLEEISLDPFMAMYHEVLYDSEIRELKGQSMNMV-NGYA 291
Query: 78 VNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNY 137
T DT + ++ + + +I RI DMT ++E+ LQI NY
Sbjct: 292 SQRNGTEIRDTVVRYDWWSNTSLVRE-----RINQRIIDMTGFNFLKDEK----LQIANY 342
Query: 138 GLGGHYDLHCD------ATPRDEGLW-RLASFMFYLTDVELGGATIFPSLNLTVFPEKGS 190
GLG ++ H D TP L RLAS +FY ++V GGAT+FP +N+TVFP+KGS
Sbjct: 343 GLGTYFQPHFDYSSDGFETPNITTLGDRLASILFYASEVPQGGATVFPEINVTVFPQKGS 402
Query: 191 AVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
++W+N H + D R HS CPV G++W
Sbjct: 403 MLYWFNLHDDGKPDIRSLHSVCPVLNGDRW 432
>gi|221512818|ref|NP_730346.2| CG32201 [Drosophila melanogaster]
gi|220902638|gb|AAN11679.2| CG32201 [Drosophila melanogaster]
Length = 520
Score = 137 bits (345), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 83/210 (39%), Positives = 117/210 (55%), Gaps = 17/210 (8%)
Query: 18 KSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKV 77
K+NL C Y+S NTFL++ PLK+EE+ LDP + H+ +YDSEI + S V G
Sbjct: 302 KTNLVCRYKSTANTFLRLAPLKLEEISLDPFMAMYHEVLYDSEIRELKGQSMNMV-NGYA 360
Query: 78 VNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNY 137
T DT + ++ + + +I RI DMT ++E+ LQI NY
Sbjct: 361 SQRNGTEIRDTVVRYDWWSNTSLVRE-----RINQRIIDMTGFNFLKDEK----LQIANY 411
Query: 138 GLGGHYDLHCD------ATPRDEGLW-RLASFMFYLTDVELGGATIFPSLNLTVFPEKGS 190
GLG ++ H D TP L RLAS +FY ++V GGAT+FP +N+TVFP+KGS
Sbjct: 412 GLGTYFQPHFDYSSDGFETPNITTLGDRLASILFYASEVPQGGATVFPEINVTVFPQKGS 471
Query: 191 AVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
++W+N H + D R HS CPV G++W
Sbjct: 472 MLYWFNLHDDGKPDIRSLHSVCPVLNGDRW 501
>gi|115313004|gb|AAI24075.1| Zgc:152670 [Danio rerio]
Length = 235
Score = 137 bits (345), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 76/201 (37%), Positives = 116/201 (57%), Gaps = 15/201 (7%)
Query: 21 LKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN 79
L C Y + N L P+K EEL+ +P++++ HD I D+EI + ++++ ++ R +
Sbjct: 26 LSCRYSTGGGNPRLMYAPVKEEELWDEPKIIRYHDVISDTEIETLKDIARPELTRSQT-- 83
Query: 80 YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGL 139
G + + R S+ FL + + +I RI D+T L + E+ L + NYG+
Sbjct: 84 -GWGVISEIRTSQSVFL-----DEVGTVARISQRIADITGLSVESAEK----LHVQNYGI 133
Query: 140 GGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHA 199
GG Y H DA R A+F+ Y++DVE+GGAT+F ++ + V PEKGSAVFW N H
Sbjct: 134 GGRYTPHFDAGGDVNE--RTATFLIYMSDVEVGGATVFTNVGVAVKPEKGSAVFWNNLHK 191
Query: 200 NTLLDYRMYHSGCPVALGNKW 220
N LD + H+GCPV +GNKW
Sbjct: 192 NGELDLKTKHAGCPVLVGNKW 212
>gi|157818741|ref|NP_001101745.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Rattus norvegicus]
gi|149052604|gb|EDM04421.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide (predicted),
isoform CRA_a [Rattus norvegicus]
Length = 535
Score = 137 bits (345), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 82/232 (35%), Positives = 124/232 (53%), Gaps = 19/232 (8%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
++Y C+G + + + L C Y N L I P K E+ + P +V+ +D + D
Sbjct: 288 DVYESLCRGEGIKMTPRRQKRLFCRYHHGNRVPQLLIAPFKEEDEWDSPHIVRYYDVMSD 347
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI RI E++K K+ R V + G R+SK +L + D P + ++ R+Q
Sbjct: 348 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 404
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTD 168
+T L + E LQ+ NYG+GG Y+ H D + DE R+A+F+ Y++D
Sbjct: 405 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRSDERDAFKRLGTGNRVATFLNYMSD 460
Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
VE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 461 VEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512
>gi|324507368|gb|ADY43128.1| Prolyl 4-hydroxylase subunit alpha-2 [Ascaris suum]
Length = 534
Score = 137 bits (344), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 89/230 (38%), Positives = 118/230 (51%), Gaps = 18/230 (7%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
++Y C+G V +S + C Y + FLK+ P+KVE L P VV I D E
Sbjct: 280 DVYEALCRGEQKVNVTAQSEVYC-YLKMDRPFLKLAPIKVEILRFSPLVVLFKQVISDYE 338
Query: 61 INRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I I +L+ K++R V N GD Y + R+SK +L DHP + +I RI MT
Sbjct: 339 IEVIEKLAIPKLKRATVQNARTGDLEYANYRISKSAWLKG---TDHPAIDRINKRIDLMT 395
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLW-------RLASFMFYLTDVE 170
NL + LQ NYG+GGHYD H D A D + R+A+ + Y++DVE
Sbjct: 396 NL----NQETAEELQAQNYGIGGHYDPHFDFARKEDINAFKTLNTGNRIATILIYMSDVE 451
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+F L VFP K A+FWYN + D R H+ CPV G KW
Sbjct: 452 SGGATVFNHLGNAVFPSKYDALFWYNLRRDGEGDLRTRHAACPVLTGIKW 501
>gi|116008128|ref|NP_001036776.1| CG15539, isoform B [Drosophila melanogaster]
gi|113194857|gb|ABI31220.1| CG15539, isoform B [Drosophila melanogaster]
Length = 509
Score = 137 bits (344), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 80/206 (38%), Positives = 115/206 (55%), Gaps = 12/206 (5%)
Query: 19 SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV 78
+ L C Y++ ++ FL++ PLK+E L LDP +V HD + D +I I L+KGK+ R V
Sbjct: 291 AKLYCLYKTTSSYFLRLAPLKMELLSLDPYMVLFHDVVSDKDIVSIRNLTKGKLARTVTV 350
Query: 79 NYGDTIYVD-TRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNY 137
+ D R +K +L ++ + ++ QDMTN I + P Q+ NY
Sbjct: 351 SKDGNYTEDPDRTTKGTWL----VENNALIQRLSQLTQDMTNFDIHDAD----PFQVLNY 402
Query: 138 GLGGHYDLHCD---ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFW 194
G+GG Y +H D D R+A+ +FYL+DV GGATIFP L L+VFP+KGSA+ W
Sbjct: 403 GIGGFYGIHFDFLEDAELDNFSDRIATAVFYLSDVPQGGATIFPKLGLSVFPKKGSALLW 462
Query: 195 YNAHANTLLDYRMYHSGCPVALGNKW 220
YN D R HS CP +G++W
Sbjct: 463 YNLDHKGDGDNRTAHSACPTVVGSRW 488
>gi|226874876|ref|NP_035161.2| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Mus
musculus]
gi|148701601|gb|EDL33548.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide, isoform CRA_f [Mus
musculus]
Length = 537
Score = 137 bits (344), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 82/232 (35%), Positives = 124/232 (53%), Gaps = 19/232 (8%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
++Y C+G + + + L C Y N L I P K E+ + P +V+ +D + D
Sbjct: 290 DVYESLCRGEGVKLTPRRQKKLFCRYHHGNRVPQLLIAPFKEEDEWDSPHIVRYYDVMSD 349
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI RI E++K K+ R V + G R+SK +L + D P + ++ R+Q
Sbjct: 350 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 406
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTD 168
+T L + E LQ+ NYG+GG Y+ H D + DE R+A+F+ Y++D
Sbjct: 407 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRSDEQDAFKRLGTGNRVATFLNYMSD 462
Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
VE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 463 VEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 514
>gi|149052606|gb|EDM04423.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide (predicted),
isoform CRA_c [Rattus norvegicus]
Length = 506
Score = 137 bits (344), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 82/232 (35%), Positives = 124/232 (53%), Gaps = 19/232 (8%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
++Y C+G + + + L C Y N L I P K E+ + P +V+ +D + D
Sbjct: 259 DVYESLCRGEGIKMTPRRQKRLFCRYHHGNRVPQLLIAPFKEEDEWDSPHIVRYYDVMSD 318
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI RI E++K K+ R V + G R+SK +L + D P + ++ R+Q
Sbjct: 319 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 375
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTD 168
+T L + E LQ+ NYG+GG Y+ H D + DE R+A+F+ Y++D
Sbjct: 376 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRSDERDAFKRLGTGNRVATFLNYMSD 431
Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
VE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 432 VEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 483
>gi|387016440|gb|AFJ50339.1| Prolyl 4-hydroxylase subunit alpha-1-like [Crotalus adamanteus]
Length = 543
Score = 137 bits (344), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 82/230 (35%), Positives = 126/230 (54%), Gaps = 19/230 (8%)
Query: 3 YPLACQGN-LSVPEDIKSNLKC-FYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y C+G L + + L C +Y N +GP++ E+ + PR+V+ D I + E
Sbjct: 298 YEKLCRGEGLKMTPRREKKLFCRYYNGNGNPNYILGPVRQEDEWDRPRIVRFLDIISNEE 357
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I ++ ELSK ++ R + N G R+SK +L ++P + +I RIQD+T
Sbjct: 358 IEKVKELSKPRLRRATISNPITGVLETAHYRISKSAWLSGY---ENPVVARINQRIQDLT 414
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GG Y+ H D +DE R+A+++FY++DV
Sbjct: 415 GLDVSTAEE----LQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVA 470
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP + +V+P+KG+AVFWYN + DY H+ CPV +GNKW
Sbjct: 471 AGGATVFPEVGASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPVLVGNKW 520
>gi|195145314|ref|XP_002013641.1| GL24244 [Drosophila persimilis]
gi|194102584|gb|EDW24627.1| GL24244 [Drosophila persimilis]
Length = 496
Score = 137 bits (344), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 76/229 (33%), Positives = 125/229 (54%), Gaps = 28/229 (12%)
Query: 2 IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
++ CQG +P ++S+L+C Y + + FL++ PL++E L DP V H+ + +E
Sbjct: 266 VHQRNCQGRSRLP--VQSSLRCHYSAEGSAFLRLAPLRMELLSRDPLVALYHEVVSAAEQ 323
Query: 62 NRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDH-PFLYKIQTRIQDMTNL 120
++ LS+ +++R + Y D I F + + P + ++ R++D+T L
Sbjct: 324 RHLMLLSESQLQRQRGHQY-DKIRT--------FASASVAANATPTVEQLHRRLEDITGL 374
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDAT---------PRDEGLWRLASFMFYLTDVEL 171
+ E PL+I NYG+GG Y +H D P++ +RLA+ + YL+DV L
Sbjct: 375 DLAESE----PLRILNYGIGGQYYIHVDCEQPQTHVEPYPKE---YRLATVLLYLSDVRL 427
Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GG T FP+L L + P +GSA+ W+NA+ DYR H+ CPV LG +W
Sbjct: 428 GGFTSFPALGLGIRPNRGSALVWHNANNAGNCDYRALHAACPVLLGTRW 476
>gi|148701597|gb|EDL33544.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide, isoform CRA_b [Mus
musculus]
Length = 506
Score = 137 bits (344), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 82/232 (35%), Positives = 124/232 (53%), Gaps = 19/232 (8%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
++Y C+G + + + L C Y N L I P K E+ + P +V+ +D + D
Sbjct: 259 DVYESLCRGEGVKLTPRRQKKLFCRYHHGNRVPQLLIAPFKEEDEWDSPHIVRYYDVMSD 318
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI RI E++K K+ R V + G R+SK +L + D P + ++ R+Q
Sbjct: 319 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 375
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTD 168
+T L + E LQ+ NYG+GG Y+ H D + DE R+A+F+ Y++D
Sbjct: 376 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRSDEQDAFKRLGTGNRVATFLNYMSD 431
Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
VE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 432 VEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 483
>gi|114601566|ref|XP_001162222.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Pan
troglodytes]
gi|114601568|ref|XP_001162843.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 17 [Pan
troglodytes]
gi|397518358|ref|XP_003829358.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Pan
paniscus]
gi|397518362|ref|XP_003829360.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 5 [Pan
paniscus]
gi|410215944|gb|JAA05191.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
gi|410255608|gb|JAA15771.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
gi|410331279|gb|JAA34586.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
Length = 535
Score = 137 bits (344), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 83/232 (35%), Positives = 124/232 (53%), Gaps = 19/232 (8%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
+IY C+G + + + L C Y N L I P K E+ + P +V+ +D + D
Sbjct: 288 DIYESLCRGEGVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSD 347
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI RI E++K K+ R V + G R+SK +L + D P + ++ R+Q
Sbjct: 348 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 404
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTD 168
+T L + E LQ+ NYG+GG Y+ H D + DE R+A+F+ Y++D
Sbjct: 405 ITGLTVKTAEL----LQVANYGVGGQYEPHFDFSRNDERDTFKHLGTGNRVATFLNYMSD 460
Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
VE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 461 VEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512
>gi|195055773|ref|XP_001994787.1| GH17427 [Drosophila grimshawi]
gi|193892550|gb|EDV91416.1| GH17427 [Drosophila grimshawi]
Length = 538
Score = 137 bits (344), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 81/226 (35%), Positives = 122/226 (53%), Gaps = 16/226 (7%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
Y C+G +S + L+C Y + + + PLK+EE LDP VV HD + +I
Sbjct: 288 YEKVCRGEVSASAAQQRPLRCRYARGQHAYRVLAPLKLEEHSLDPLVVSYHDMLSPQQII 347
Query: 63 RIIELSKGKVERGKV--VNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
+ +++ ++R V + + R+SK +L + HP + ++ + D T L
Sbjct: 348 ELRQMAVPHMKRSTVNPLPGRQSKKSAFRVSKNAWLE---YDTHPMMGRMLRDLSDATGL 404
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD------ATPRDEGLWRLASFMFYLTDVELGGA 174
+ Y LQ+ NYG+GGHY+ H D P +EG R+A+ +FYL+DVE GGA
Sbjct: 405 DMT----YCEQLQVANYGVGGHYEPHWDFFVDSQHYPAEEGN-RIATAIFYLSDVEQGGA 459
Query: 175 TIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
T FP LN V P+ G+ +FWYN H + +DYR H+GCPV G+KW
Sbjct: 460 TAFPFLNFAVRPQLGNILFWYNLHRSLDMDYRTKHAGCPVLKGSKW 505
>gi|195391758|ref|XP_002054527.1| GJ22759 [Drosophila virilis]
gi|194152613|gb|EDW68047.1| GJ22759 [Drosophila virilis]
Length = 539
Score = 137 bits (344), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 81/229 (35%), Positives = 123/229 (53%), Gaps = 18/229 (7%)
Query: 2 IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
+Y C+ L L+C + N P K+EEL+LDP ++++HD I +
Sbjct: 286 LYQQVCREELRPAPAALRELRCRLFAGNGRKSTYAPYKLEELHLDPYIIQVHDVISARDT 345
Query: 62 NRIIELSKGKVERGKVVNYG--DTIYVDTRLSK-VYFLYPEIFGDHPFLYKIQTRIQDMT 118
+ L++ +++R +V + + I + R S+ F Y DHP + K+ + +++
Sbjct: 346 AELQHLARPELQRSQVYSRTGHEHISANFRTSQGTTFEYT----DHPIMQKMSHHVAEIS 401
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPR--DEGLW-----RLASFMFYLTDVEL 171
G + R PLQI NYG+GGHY+ H D+ P D L RLA+ ++YL++VE
Sbjct: 402 ----GLDMRSAEPLQIANYGIGGHYEPHMDSFPDSYDYSLNMYKTNRLATGIYYLSNVEA 457
Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GG T FP L L V PE+GS +FWYN H + DYR H+ CPV G+KW
Sbjct: 458 GGGTAFPFLPLLVTPERGSLLFWYNLHPSGDADYRTKHAACPVLQGSKW 506
>gi|195391760|ref|XP_002054528.1| GJ22757 [Drosophila virilis]
gi|194152614|gb|EDW68048.1| GJ22757 [Drosophila virilis]
Length = 534
Score = 136 bits (343), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 81/226 (35%), Positives = 121/226 (53%), Gaps = 16/226 (7%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
Y C+G + + L+C Y + + + PLK+EE LDP VV HD + I
Sbjct: 284 YEKVCRGEVGASAAQQRPLRCRYTRGEHAYRLLAPLKLEEHSLDPLVVTFHDMLSQHRIA 343
Query: 63 RIIELSKGKVERGKV--VNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
+ E++ ++R V + G R+SK +L + HP + ++ + D T L
Sbjct: 344 ELREMAVPHMQRSTVNPLPGGQRRKSAFRVSKNAWL---PYSTHPTMGRMLRDVSDATGL 400
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD------ATPRDEGLWRLASFMFYLTDVELGGA 174
+ E+ LQ+ NYG+GGHY+ H D P EG R+A+ +FYL+DVE GGA
Sbjct: 401 DMTFCEQ----LQVANYGVGGHYEPHWDFFRDSRHYPAAEGN-RIATAIFYLSDVEQGGA 455
Query: 175 TIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
T FP LN V P+ G+ +FWYN H ++ +D+R H+GCPV G+KW
Sbjct: 456 TAFPFLNFAVRPQLGNILFWYNLHRSSDMDFRTKHAGCPVLKGSKW 501
>gi|195109817|ref|XP_001999478.1| GI23043 [Drosophila mojavensis]
gi|193916072|gb|EDW14939.1| GI23043 [Drosophila mojavensis]
Length = 491
Score = 136 bits (343), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 75/222 (33%), Positives = 120/222 (54%), Gaps = 20/222 (9%)
Query: 7 CQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIE 66
C+G +P + +L+C Y + + FL++ PLK+E+L +DP V H+AI+D+E+ IIE
Sbjct: 259 CRGQRQLP--VSDSLRCRYSAEGSPFLRLAPLKLEQLSIDPYVALCHNAIHDNELEYIIE 316
Query: 67 LSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREE 126
S+ ++R +V+ G + + L G ++ R++DM+ +
Sbjct: 317 QSRPYLKRA-LVDQGVVHEKRVTMDAAFDLNASTHG-----RTLRQRLEDMSGFDLSN-- 368
Query: 127 RYKGPLQINNYGLGGHYDLHCDA-TPRDEGLW-------RLASFMFYLTDVELGGATIFP 178
G L + NYG+GGHY +H D D + R+A+ + YL +V++GG T FP
Sbjct: 369 --SGQLAVLNYGIGGHYSMHFDCWFSSDSAAYEAYIRSNRIATILLYLNEVQMGGITSFP 426
Query: 179 SLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+L L V P KGSA+ W+N + DYR H+ CP LGN+W
Sbjct: 427 ALGLGVQPIKGSALIWHNMNHEIECDYRTLHAACPTLLGNRW 468
>gi|195069801|ref|XP_001997031.1| GH12975 [Drosophila grimshawi]
gi|193891500|gb|EDV90366.1| GH12975 [Drosophila grimshawi]
Length = 242
Score = 136 bits (343), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 72/213 (33%), Positives = 123/213 (57%), Gaps = 17/213 (7%)
Query: 18 KSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKV 77
+ L+C N P ++EEL+LDP V+++HD I E + +L++ +++R V
Sbjct: 4 QRKLRCRLHRGNGLRSSYQPYRLEELHLDPYVIQVHDIISAEETIVLQQLARPELQRSMV 63
Query: 78 VNYGDTIYVDT--RLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQIN 135
+ ++ ++ T R+S+ F + +HP + ++ +++++ L + E+ LQ+
Sbjct: 64 YSLSNSEHISTNFRISQGTFFE---YHEHPIMQRMSQHLENISGLDMRSAEQ----LQVA 116
Query: 136 NYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVELGGATIFPSLNLTVFPE 187
NYG+GGHY+ H D+ + R+A+ ++YL++VE GG T FP L L V PE
Sbjct: 117 NYGIGGHYEPHMDSFSENHNYGINTYMSTNRVATGIYYLSNVEAGGGTAFPFLPLLVEPE 176
Query: 188 KGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+GS +FWYN H + LDYR H+GCPV +G+KW
Sbjct: 177 RGSLLFWYNLHRSGDLDYRTKHAGCPVLMGSKW 209
>gi|195110923|ref|XP_002000029.1| GI22757 [Drosophila mojavensis]
gi|193916623|gb|EDW15490.1| GI22757 [Drosophila mojavensis]
Length = 535
Score = 136 bits (343), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 86/228 (37%), Positives = 123/228 (53%), Gaps = 17/228 (7%)
Query: 2 IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
IY C+ L + L+C Y S + L K+EEL+ DP ++++H+ I E
Sbjct: 285 IYQQVCREELMPTAAAQRELRCRYFSGHGRSLNYLAYKLEELHRDPYIIQLHEVIGAHES 344
Query: 62 NRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
++ L++ ++R +V + G T F Y E HP + K+ Q MT
Sbjct: 345 VQLQHLARPVLQRSEVYSPTNGSTAATFRTSQGTVFEYDE----HPIIEKLS---QHMT- 396
Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCDATPR--DEGLWR-----LASFMFYLTDVELG 172
L+ G + + PLQI NYG+GGHY+ H D+ P D L R +A+ +FYL++VE G
Sbjct: 397 LISGLDMGFAEPLQIANYGIGGHYEPHMDSFPESFDYSLQRFKTNRIATGIFYLSNVEAG 456
Query: 173 GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GAT FP L L V PE+GS +FWYN H + DYR H+GCPV G+KW
Sbjct: 457 GATAFPFLPLLVKPEQGSLLFWYNLHRSGDADYRTKHAGCPVLQGSKW 504
>gi|195452744|ref|XP_002073481.1| GK14140 [Drosophila willistoni]
gi|194169566|gb|EDW84467.1| GK14140 [Drosophila willistoni]
Length = 454
Score = 136 bits (343), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 80/227 (35%), Positives = 123/227 (54%), Gaps = 16/227 (7%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
++ P C GN V + + L C Y + + FL+I P+K+E L L+P +V HD I SE
Sbjct: 217 DVLPYCCSGNCEVDREFQ--LFCLYNTKDAYFLRIAPVKMEILSLNPYIVLCHDVILPSE 274
Query: 61 INRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
+ S ++E + ++ + ++ R SK +L ++ I+D++
Sbjct: 275 QEFLKTQSSKRLEGARALDQVKNEVVFNFIRTSKATWLKK---NSDNVTRRLSHWIEDVS 331
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-----RLASFMFYLTDVELGG 173
NL + Y QI NYG+GG ++ H D +DE W R+A+F+FYL DV GG
Sbjct: 332 NLDSNIGDLY----QIINYGVGGLFEAHSDTMRKDEDRWKVLYDRIATFIFYLQDVPQGG 387
Query: 174 ATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
AT+F +LNLTVFP+ G+A+FW+N D H+GCPV +G+KW
Sbjct: 388 ATLFNNLNLTVFPKAGAALFWFNLDNAGDTDLFTVHTGCPVIVGSKW 434
>gi|355691582|gb|EHH26767.1| hypothetical protein EGK_16829 [Macaca mulatta]
gi|355750162|gb|EHH54500.1| hypothetical protein EGM_15360 [Macaca fascicularis]
gi|384939464|gb|AFI33337.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Macaca
mulatta]
Length = 535
Score = 136 bits (343), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 82/232 (35%), Positives = 124/232 (53%), Gaps = 19/232 (8%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
++Y C+G + + + L C Y N L I P K E+ + P +V+ +D + D
Sbjct: 288 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSD 347
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI RI E++K K+ R V + G R+SK +L + D P + ++ R+Q
Sbjct: 348 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 404
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTD 168
+T L + E LQ+ NYG+GG Y+ H D + DE R+A+F+ Y++D
Sbjct: 405 ITGLTVKTAEL----LQVANYGVGGQYEPHFDFSRNDERHTFKHLGTGNRVATFLNYMSD 460
Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
VE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 461 VEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512
>gi|195352182|ref|XP_002042593.1| GM14980 [Drosophila sechellia]
gi|194124477|gb|EDW46520.1| GM14980 [Drosophila sechellia]
Length = 520
Score = 136 bits (343), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 84/225 (37%), Positives = 123/225 (54%), Gaps = 21/225 (9%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
+ L C+G K+NL C ++S NTFL++ PLK+EE+ LDP + H+ +YDSEI+
Sbjct: 291 FELGCRGLYRQ----KTNLVCRFKSTANTFLRLAPLKLEEISLDPFIAMYHEVLYDSEIH 346
Query: 63 RIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVI 122
+ S V G T DT + ++ + + +I RI DMT
Sbjct: 347 ELKGQSMNMV-NGYASERNGTEIRDTVVRYDWWSNISLVRE-----RINQRIIDMTEFNF 400
Query: 123 GREERYKGPLQINNYGLGGHYDLHCD------ATPRDEGLW-RLASFMFYLTDVELGGAT 175
++E+ LQI NYG+G ++ H D TP L RLAS +FY ++V GGAT
Sbjct: 401 SKDEK----LQIANYGVGTYFQPHFDYSSDGFETPNITTLGDRLASILFYASEVPQGGAT 456
Query: 176 IFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+FP +N+TVFP+KGS ++W+N H + D R HS CPV G++W
Sbjct: 457 VFPEINVTVFPQKGSMLYWFNLHDDGRPDIRSKHSVCPVINGDRW 501
>gi|348518914|ref|XP_003446976.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Oreochromis
niloticus]
Length = 536
Score = 136 bits (343), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 80/232 (34%), Positives = 128/232 (55%), Gaps = 19/232 (8%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKC-FYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYD 58
E Y C+G + + E +S L C +++ N L + P+K E+ + P +V+ D + D
Sbjct: 289 ESYEALCRGEGIQMTEARRSRLFCRYHDGKRNPHLLLKPVKEEDEWDSPHIVRYLDLLSD 348
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI +I EL+K ++ R V + G + R+SK +L E + P + ++ RI+
Sbjct: 349 EEIEKIKELAKPRLARATVRDPKTGVLTTANYRVSKSAWLEGE---EDPVIDRVNQRIEA 405
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTD 168
+T L + E LQ+ NYG+GG Y+ H D + +DE R+A+F+ Y++D
Sbjct: 406 ITGLTVETAEL----LQVANYGVGGQYEPHFDFSRKDEPDAFKRLGTGNRVATFLNYMSD 461
Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
VE GGAT+FP ++P KG++VFWYN + DYR H+ CPV +G+KW
Sbjct: 462 VEAGGATVFPDFGAAIWPRKGTSVFWYNLFRSGEGDYRTRHAACPVLVGSKW 513
>gi|198417610|ref|XP_002125349.1| PREDICTED: similar to Prolyl 4-hydroxylase subunit alpha-1
precursor (4-PH alpha-1)
(Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1) [Ciona intestinalis]
Length = 527
Score = 136 bits (343), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 85/233 (36%), Positives = 123/233 (52%), Gaps = 19/233 (8%)
Query: 1 EIYPLACQGNLSVPEDIK-SNLKCFYES-YNNTFLKIGPLKVEELYLDPRVVKIHDAIYD 58
E + C+G ++ + + L+C+ + N L I P+KVEEL P +V+ HD + D
Sbjct: 271 ETFFKLCRGEQTLTKKKQHKKLRCYLSTNMGNPKLLIRPVKVEELSKSPDIVQFHDVLSD 330
Query: 59 SEINRIIELSKGKVERGKVVNYGDTIYVDT--RLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
+ IN I +L+K ++ R DT R++K+ +L + D P + KI RI D
Sbjct: 331 TVINEIKKLAKPQLFRAIHAGSDDTDLQKAPYRITKLAWLLDD---DGPEVAKITERISD 387
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGL--------WRLASFMFYLTD 168
+T L + E +Q+ NYG+GG Y H D DE R+A+F+ YL+D
Sbjct: 388 ITGLTLNTSEE----IQVANYGVGGEYPPHFDIPTTDEERDDLKSQDGERIATFLIYLSD 443
Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 221
VE+GG T F + ++ P KGSAVFWYN + D R YH CPVA GNKW
Sbjct: 444 VEVGGRTAFVNAGVSAKPIKGSAVFWYNVFPSGEPDLRTYHGACPVAFGNKWA 496
>gi|403255941|ref|XP_003920663.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Saimiri
boliviensis boliviensis]
gi|403255945|ref|XP_003920665.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 5 [Saimiri
boliviensis boliviensis]
Length = 535
Score = 136 bits (342), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 82/232 (35%), Positives = 124/232 (53%), Gaps = 19/232 (8%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
++Y C+G + + + L C Y N L I P K E+ + P +V+ +D + D
Sbjct: 288 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSD 347
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI RI E++K K+ R V + G R+SK +L + D P + ++ R+Q
Sbjct: 348 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 404
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTD 168
+T L + E LQ+ NYG+GG Y+ H D + DE R+A+F+ Y++D
Sbjct: 405 ITGLTVKTAEL----LQVANYGVGGQYEPHFDFSRNDERDAFKHLGTGNRVATFLNYMSD 460
Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
VE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 461 VEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512
>gi|119582752|gb|EAW62348.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_f
[Homo sapiens]
Length = 567
Score = 136 bits (342), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 82/232 (35%), Positives = 124/232 (53%), Gaps = 19/232 (8%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
++Y C+G + + + L C Y N L I P K E+ + P +V+ +D + D
Sbjct: 320 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSD 379
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI RI E++K K+ R V + G R+SK +L + D P + ++ R+Q
Sbjct: 380 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 436
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTD 168
+T L + E LQ+ NYG+GG Y+ H D + DE R+A+F+ Y++D
Sbjct: 437 ITGLTVKTAEL----LQVANYGVGGQYEPHFDFSRNDERDTFKHLGTGNRVATFLNYMSD 492
Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
VE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 493 VEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 544
>gi|332221660|ref|XP_003259981.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Nomascus
leucogenys]
Length = 537
Score = 136 bits (342), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 82/232 (35%), Positives = 124/232 (53%), Gaps = 19/232 (8%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
++Y C+G + + + L C Y N L I P K E+ + P +V+ +D + D
Sbjct: 290 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSD 349
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI RI E++K K+ R V + G R+SK +L + D P + ++ R+Q
Sbjct: 350 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 406
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTD 168
+T L + E LQ+ NYG+GG Y+ H D + DE R+A+F+ Y++D
Sbjct: 407 ITGLTVKTAEL----LQVANYGVGGQYEPHFDFSRNDERDTFKHLGTGNRVATFLNYMSD 462
Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
VE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 463 VEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 514
>gi|4758868|ref|NP_004190.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Homo
sapiens]
gi|217272863|ref|NP_001136071.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Homo
sapiens]
gi|20455169|sp|O15460.1|P4HA2_HUMAN RecName: Full=Prolyl 4-hydroxylase subunit alpha-2; Short=4-PH
alpha-2; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-2; Flags: Precursor
gi|2439985|gb|AAB71339.1| prolyl 4-hydroxylase alpha (II) subunit [Homo sapiens]
gi|18073926|emb|CAC85689.1| Prolyl 4-hydroxylase alpha IIb subunit [Homo sapiens]
gi|119582746|gb|EAW62342.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_b
[Homo sapiens]
gi|119582747|gb|EAW62343.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_b
[Homo sapiens]
Length = 535
Score = 136 bits (342), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 82/232 (35%), Positives = 124/232 (53%), Gaps = 19/232 (8%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
++Y C+G + + + L C Y N L I P K E+ + P +V+ +D + D
Sbjct: 288 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSD 347
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI RI E++K K+ R V + G R+SK +L + D P + ++ R+Q
Sbjct: 348 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 404
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTD 168
+T L + E LQ+ NYG+GG Y+ H D + DE R+A+F+ Y++D
Sbjct: 405 ITGLTVKTAEL----LQVANYGVGGQYEPHFDFSRNDERDTFKHLGTGNRVATFLNYMSD 460
Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
VE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 461 VEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512
>gi|297675929|ref|XP_002815906.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Pongo
abelii]
Length = 535
Score = 136 bits (342), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 82/232 (35%), Positives = 124/232 (53%), Gaps = 19/232 (8%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
++Y C+G + + + L C Y N L I P K E+ + P +V+ +D + D
Sbjct: 288 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSD 347
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI RI E++K K+ R V + G R+SK +L + D P + ++ R+Q
Sbjct: 348 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 404
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTD 168
+T L + E LQ+ NYG+GG Y+ H D + DE R+A+F+ Y++D
Sbjct: 405 ITGLTVKTAEL----LQVANYGVGGQYEPHFDFSRNDERDTFKHLGTGNRVATFLNYMSD 460
Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
VE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 461 VEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512
>gi|332221664|ref|XP_003259983.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 5 [Nomascus
leucogenys]
Length = 558
Score = 136 bits (342), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 82/232 (35%), Positives = 124/232 (53%), Gaps = 19/232 (8%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
++Y C+G + + + L C Y N L I P K E+ + P +V+ +D + D
Sbjct: 311 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSD 370
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI RI E++K K+ R V + G R+SK +L + D P + ++ R+Q
Sbjct: 371 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 427
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTD 168
+T L + E LQ+ NYG+GG Y+ H D + DE R+A+F+ Y++D
Sbjct: 428 ITGLTVKTAEL----LQVANYGVGGQYEPHFDFSRNDERDTFKHLGTGNRVATFLNYMSD 483
Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
VE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 484 VEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 535
>gi|196011900|ref|XP_002115813.1| hypothetical protein TRIADDRAFT_59899 [Trichoplax adhaerens]
gi|190581589|gb|EDV21665.1| hypothetical protein TRIADDRAFT_59899 [Trichoplax adhaerens]
Length = 581
Score = 136 bits (342), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 89/236 (37%), Positives = 125/236 (52%), Gaps = 25/236 (10%)
Query: 3 YPLACQGNLS--VPEDIKSN--LKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYD 58
Y C+GN++ +D+K N L C Y+ Y N L PL VE L L P +V H+ + +
Sbjct: 305 YKELCRGNVNQKTGDDVKLNNQLNC-YQDYRNPRLLFSPLNVEVLSLQPYIVIYHNLLTN 363
Query: 59 SEINRIIELSKGKVERGKVVNYGDTIYVDT---RLSKVYFLYPEIFGDHPFLYKIQTRIQ 115
SE+ + L+ ++R VV D Y + R+SK +L E DHP + +I T I
Sbjct: 364 SEVVLLKTLASPLLKRAVVVGKPDKEYGEETTYRISKTAWLDKE---DHPAVKRITTLIG 420
Query: 116 DMTNLVIGREERYKGPLQINNYGLGGHYDLHCD--ATPRDEGLW--------RLASFMFY 165
D +IG PLQI NYG+GGHY+ H D + E L R+A+ + Y
Sbjct: 421 D----IIGLTSETAEPLQIANYGIGGHYEPHLDFIESEDKEALSEYTSRIGNRIATVLIY 476
Query: 166 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 221
L++VE GGAT+FP + V P +GSA FWYN H N + H+ CPV +G+KW
Sbjct: 477 LSNVEAGGATVFPKAGVRVEPRQGSAAFWYNMHRNGEGNKLSVHAACPVLIGSKWA 532
>gi|47550697|ref|NP_999856.1| prolyl 4-hydroxylase, alpha polypeptide I b precursor [Danio rerio]
gi|28277826|gb|AAH45890.1| Procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I [Danio rerio]
Length = 536
Score = 136 bits (342), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 82/230 (35%), Positives = 126/230 (54%), Gaps = 19/230 (8%)
Query: 3 YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y C+G + + +S L C Y + N N L + P+K E+ + PR+V+ H+ I DSE
Sbjct: 291 YERLCRGEGIKLTPRRQSRLFCRYSNNNRNPRLLLAPVKQEDEWDRPRIVRYHEIISDSE 350
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + E++K ++ R + N G R+SK +L +H + +I RI+D+T
Sbjct: 351 IETVKEMAKPRLRRATISNPITGVLETAPYRISKSAWLSG---YEHSTIERINQRIEDVT 407
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GG Y+ H D +DE R+A+++FY++DV
Sbjct: 408 GLEMDTAEE----LQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVS 463
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+F + V+P+KG+AVFWYN + DY H+ CPV +GNKW
Sbjct: 464 AGGATVFTDVGAAVWPKKGTAVFWYNLFPSGEGDYSTRHAACPVLVGNKW 513
>gi|395736141|ref|XP_003776706.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Pongo abelii]
Length = 577
Score = 136 bits (342), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 82/232 (35%), Positives = 124/232 (53%), Gaps = 19/232 (8%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
++Y C+G + + + L C Y N L I P K E+ + P +V+ +D + D
Sbjct: 330 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSD 389
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI RI E++K K+ R V + G R+SK +L + D P + ++ R+Q
Sbjct: 390 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 446
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTD 168
+T L + E LQ+ NYG+GG Y+ H D + DE R+A+F+ Y++D
Sbjct: 447 ITGLTVKTAEL----LQVANYGVGGQYEPHFDFSRNDERDTFKHLGTGNRVATFLNYMSD 502
Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
VE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 503 VEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 554
>gi|442757047|gb|JAA70682.1| Putative prolyl 4-hydroxylase alpha subunit [Ixodes ricinus]
Length = 532
Score = 135 bits (341), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 81/229 (35%), Positives = 123/229 (53%), Gaps = 14/229 (6%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
E Y C+G + S L+C Y + F K+ P+K+EE L P VV + D + D +
Sbjct: 280 ENYKRLCRGEQLRTPKMDSQLRCRYYTGETGFFKLQPIKLEEFNLKPYVVVLRDLLQDRD 339
Query: 61 INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
+N +I +K ++E+ K + D +R S +L E D P ++ +Q + L
Sbjct: 340 LNDMIAFAKPRLEQSKTLCAADKDGPPSRTSSNTWLNDE---DAPVAARVNQYLQSLLGL 396
Query: 121 --VIGREERYKGPLQINNYGLGGHYDLHCD-----ATPRDEGLW--RLASFMFYLTDVEL 171
+ R+E K Q+ NYG+GGHY H D TP + R+A+ M Y++DVE
Sbjct: 397 GTLFSRDEAEK--YQLANYGIGGHYVPHHDYFEEFQTPSKGNRFGNRVATLMIYMSDVEE 454
Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FPSL + V P+KG AVFW+N ++ + +H+GCPV G+KW
Sbjct: 455 GGATVFPSLGVRVSPKKGDAVFWWNIMSSWEGEMLTWHAGCPVLYGSKW 503
>gi|410900628|ref|XP_003963798.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Takifugu
rubripes]
Length = 548
Score = 135 bits (341), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 81/230 (35%), Positives = 127/230 (55%), Gaps = 19/230 (8%)
Query: 3 YPLACQGN-LSVPEDIKSNLKC-FYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G + + +S L C +Y++ +N + P+K ++ + P +V+ D I D E
Sbjct: 303 YEMLCRGEGIKMTPRRQSRLFCRYYDNNHNPKYVLSPVKQQDEWDRPYIVRYIDIISDKE 362
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + +L+K ++ R + N G R+SK +L +HP + I RI+D+T
Sbjct: 363 IETVKKLAKPRLRRATISNPITGVLETASYRISKSAWL---TGYEHPVIEIINQRIEDLT 419
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GG Y+ H D +DE R+A+++FY++DV
Sbjct: 420 GLEMDTAEE----LQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVA 475
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP + V+P+KG+AVFWYN AN DY H+ CPV +GNKW
Sbjct: 476 AGGATVFPDVGAAVWPQKGTAVFWYNLFANGEGDYSTRHAACPVLVGNKW 525
>gi|2498741|sp|Q60716.1|P4HA2_MOUSE RecName: Full=Prolyl 4-hydroxylase subunit alpha-2; Short=4-PH
alpha-2; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-2; Flags: Precursor
gi|836900|gb|AAC52198.1| prolyl 4-hydroxylase alpha(II)-subunit [Mus musculus]
gi|18073923|emb|CAC85691.1| Prolyl 4-hydroxylase alpha IIb subunit [Mus musculus]
gi|1096888|prf||2112362B Pro 4-hydroxylase:SUBUNIT=alpha:ISOTYPE=II
Length = 537
Score = 135 bits (341), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 82/232 (35%), Positives = 125/232 (53%), Gaps = 19/232 (8%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
++Y C+G + + + L C Y N L I P K E+ + P +V+ +D + D
Sbjct: 290 DVYESLCRGEGVKLTPRRQKKLFCRYHHGNRVPQLLIAPFKEEDEWDSPHIVRYYDVMSD 349
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI RI E++K K+ R V + G R+SK +L + D P + ++ R+Q
Sbjct: 350 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 406
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLW-------RLASFMFYLTD 168
+T L + E LQ+ NYG+GG Y+ H D + DE + R+A+F+ Y++D
Sbjct: 407 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRSDDEDAFKRLGTGNRVATFLNYMSD 462
Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
VE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 463 VEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 514
>gi|291387304|ref|XP_002710243.1| PREDICTED: prolyl 4-hydroxylase, alpha II subunit isoform 1
precursor (predicted)-like isoform 3 [Oryctolagus
cuniculus]
Length = 535
Score = 135 bits (341), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 82/232 (35%), Positives = 124/232 (53%), Gaps = 19/232 (8%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
++Y C+G + + + L C Y N L I P K E+ + P +V+ +D + D
Sbjct: 288 DVYESLCRGEGVKLTPRRQKRLFCRYHDGNGAPQLLIAPFKEEDEWDSPHIVRYYDVMSD 347
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI RI E++K K+ R V + G R+SK +L + D P + +I R+Q
Sbjct: 348 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARINRRMQH 404
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTD 168
+T L + E LQ+ NYG+GG Y+ H D + +E R+A+F+ Y++D
Sbjct: 405 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRNNERDAFKRLGTGNRVATFLNYMSD 460
Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
VE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 461 VEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512
>gi|239792190|dbj|BAH72464.1| ACYPI007079 [Acyrthosiphon pisum]
Length = 249
Score = 135 bits (341), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 80/227 (35%), Positives = 122/227 (53%), Gaps = 18/227 (7%)
Query: 5 LACQGNLSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINR 63
+ C+ + I S L+C Y + N N L I PLK EE + PR++ D +YD+EI
Sbjct: 1 MLCRNENLMSIQISSQLRCRYTNNNRNPLLLIAPLKEEEAFFSPRIILYRDVLYDNEIEV 60
Query: 64 IIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLV 121
I +++ +++R V NY G+ + D R+SK +L + + + R++ MT L
Sbjct: 61 IKRMAQPRLKRATVQNYKTGELEFADYRISKSAWLKEH---EDVVVANVAKRVEVMTGLT 117
Query: 122 IGREERYKGPLQINNYGLGGHYDLHCDATPRDE-------GLW-RLASFMFYLTDVELGG 173
E LQ+ NYG+GGHYD H D +E G R+A+ +FY++DV GG
Sbjct: 118 TETAEE----LQVVNYGVGGHYDPHYDFARTEEINAFKSLGTGNRIATVLFYMSDVAQGG 173
Query: 174 ATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
AT+FP L + + P KG+A W+N + + D R H+ CPV G+KW
Sbjct: 174 ATVFPWLGVALQPVKGTAAVWFNLYPSGNGDLRTRHAACPVLQGSKW 220
>gi|198449641|ref|XP_002136935.1| GA26860 [Drosophila pseudoobscura pseudoobscura]
gi|198130697|gb|EDY67493.1| GA26860 [Drosophila pseudoobscura pseudoobscura]
Length = 508
Score = 135 bits (340), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 76/208 (36%), Positives = 118/208 (56%), Gaps = 16/208 (7%)
Query: 19 SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV 78
S L C Y + FL++ PLK+E L LDP VV HD + D E++ + +++ + R
Sbjct: 291 SRLYCLYNTTATAFLRLAPLKMELLSLDPYVVLYHDVLADREMSLLKSMAQKDLVRASTY 350
Query: 79 NYGDTIYVD--TRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINN 136
+ D + + R +K +L P H + ++ +DMTNL + R E + Q+ N
Sbjct: 351 DVMDKKHSEDPNRTTKARWLDPS----HSLIRRMGILTEDMTNLDLERLEDF----QVLN 402
Query: 137 YGLGGHYDLHCD----ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAV 192
YG+GGH D+H D + P E R+A+ +FYL+DV LGGAT+FP L+L+VFP++G+ +
Sbjct: 403 YGIGGHDDIHPDYYEGSNP--ELPDRVATLLFYLSDVPLGGATVFPLLDLSVFPKRGAVL 460
Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
WYN + HS CPV +G++W
Sbjct: 461 MWYNLDHKGQGIEKTVHSACPVVVGSRW 488
>gi|354474413|ref|XP_003499425.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1
[Cricetulus griseus]
Length = 535
Score = 135 bits (340), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 81/226 (35%), Positives = 121/226 (53%), Gaps = 19/226 (8%)
Query: 7 CQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYDSEINRI 64
C+G + + + L C Y N L I P K E+ + P +V+ +D + D EI RI
Sbjct: 294 CRGEGVKLTPQRQKKLFCRYHHGNRVPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERI 353
Query: 65 IELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVI 122
E++K K+ R V + G R+SK +L + D P + ++ R+Q +T L +
Sbjct: 354 KEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQHITGLTV 410
Query: 123 GREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVELGGA 174
E LQ+ NYG+GG Y+ H D + DE R+A+F+ Y++DVE GGA
Sbjct: 411 KTAEL----LQVANYGMGGQYEPHFDFSRSDEQDAFKRLGTGNRVATFLNYMSDVEAGGA 466
Query: 175 TIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
T+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 467 TVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512
>gi|47213360|emb|CAF90979.1| unnamed protein product [Tetraodon nigroviridis]
Length = 511
Score = 135 bits (340), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 82/237 (34%), Positives = 129/237 (54%), Gaps = 26/237 (10%)
Query: 3 YPLACQGN-LSVPEDIKSNLKC-FYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y C+G L + +S L C +Y++ + IGP+K E+ + PR+V+ HD + + E
Sbjct: 261 YEQLCRGEGLRMTPQRQSGLFCRYYDNGRHPKYVIGPVKQEDEWDHPRIVRYHDVLSNRE 320
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
+ ++ EL++ ++ R V + G R+SK +L +HP + +I RI+D+T
Sbjct: 321 MEKVKELARPRLRRATVHDPRTGQLTTAPYRVSKSAWLGA---FEHPIVDQINQRIEDIT 377
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFY----- 165
L + E LQ+ NYG+GG Y+ H D +DE R+A+++ Y
Sbjct: 378 GLDVSTAE----DLQVANYGVGGQYEPHFDFGQKDEPDAFEELGTGNRIATWLLYVSAAV 433
Query: 166 --LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
++DV+ GGAT+F + +V P+KGSAVFWYN + DYR H+ CPV LGNKW
Sbjct: 434 LRMSDVQAGGATVFTDIGASVLPQKGSAVFWYNLRPSGDGDYRTRHAACPVLLGNKW 490
>gi|194904100|ref|XP_001981000.1| GG23922 [Drosophila erecta]
gi|190652703|gb|EDV49958.1| GG23922 [Drosophila erecta]
Length = 490
Score = 135 bits (340), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 83/227 (36%), Positives = 116/227 (51%), Gaps = 34/227 (14%)
Query: 5 LACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRI 64
LA N + S L C Y S F +I PLK+EEL DP +V HD IYDSEI+ +
Sbjct: 258 LATVQNCTAVVQKPSRLHCRYNSSTTPFTRIAPLKMEELSSDPYMVVYHDVIYDSEIDLM 317
Query: 65 IELSKGKV------ERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
+ S + ++ +V D+ VD++ + R+ DMT
Sbjct: 318 LNASNFSLSLTNSGQKSEVRASKDSYIVDSK-------------------TLNDRVTDMT 358
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCD-----ATPRDEGLWRLASFMFYLTDVELGG 173
L + + P + NYG+GGHY LH D R++ R+A+ +FYL +V GG
Sbjct: 359 GLSMEMSD----PFSMINYGIGGHYMLHYDYHEYSNMTREKYGDRIATVLFYLGEVHSGG 414
Query: 174 ATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
ATIFP +N+TV P+KGSAVFWYN H + + HS CPV G+K+
Sbjct: 415 ATIFPRINITVTPKKGSAVFWYNLHNSGAMHSETLHSACPVISGSKY 461
>gi|194905392|ref|XP_001981188.1| GG11756 [Drosophila erecta]
gi|190655826|gb|EDV53058.1| GG11756 [Drosophila erecta]
Length = 509
Score = 135 bits (340), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 80/206 (38%), Positives = 112/206 (54%), Gaps = 12/206 (5%)
Query: 19 SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV 78
+ L C Y + + FL++ PLK+E L LDP VV HD + D +I I L+KG + R V
Sbjct: 291 AKLHCLYNTTASHFLRLAPLKMELLSLDPYVVLFHDVVSDQDILSIRNLAKGGLARAVTV 350
Query: 79 NY-GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNY 137
G+ R +K +L + + ++ QDMTN + R P Q+ NY
Sbjct: 351 TQDGNDKEDPARTTKGTWL----VENSKLIQRLSQLSQDMTNFDV----RDADPFQVLNY 402
Query: 138 GLGGHYDLHCDATPRDE-GLW--RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFW 194
G+GG Y H D E G + R+A+ +FYL+DV GGAT FP L L+VFPEKG+A+ W
Sbjct: 403 GIGGFYGTHFDFLEDTEMGHFSDRIATAVFYLSDVPQGGATTFPDLGLSVFPEKGAALLW 462
Query: 195 YNAHANTLLDYRMYHSGCPVALGNKW 220
YN + D R HS CP +G++W
Sbjct: 463 YNLDHKGVGDNRTAHSACPTIVGSRW 488
>gi|327265288|ref|XP_003217440.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Anolis
carolinensis]
Length = 554
Score = 135 bits (339), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 82/232 (35%), Positives = 126/232 (54%), Gaps = 19/232 (8%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYD 58
EIY C+G + + + L C Y + N N L I P K E+ + P +V+ ++ + D
Sbjct: 307 EIYEALCRGEGVKMTPRRQKRLFCRYHNGNQNPHLLIAPFKEEDEWDSPHIVRYYNVLSD 366
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI +I EL+K K+ R V + G + R+SK +L E D + K+ R++
Sbjct: 367 EEIEKIKELAKPKLARATVRDPKTGVLTVANYRVSKSSWLEEE---DDLVVAKVNQRMEH 423
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTD 168
+T L + E LQ+ NYG+GG Y+ H D + ++E R+A+F+ Y++D
Sbjct: 424 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRKEEPDAFKRLGTGNRVATFLNYMSD 479
Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
VE GGAT+FP ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 480 VEAGGATVFPDFGAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGCKW 531
>gi|431892682|gb|ELK03115.1| Prolyl 4-hydroxylase subunit alpha-2 [Pteropus alecto]
Length = 629
Score = 135 bits (339), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 86/252 (34%), Positives = 128/252 (50%), Gaps = 39/252 (15%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
++Y C+G + + + L C Y N T L I P K E+ + P +V+ +D + D
Sbjct: 294 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRTPQLLIAPFKEEDEWDSPHIVRYYDVMSD 353
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EINRI E++K K+ R V + G R+SK +L + D P + ++ R+Q
Sbjct: 354 EEINRIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 410
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----------------- 157
+T L + E LQ+ NYG+GG Y+ H D + P D GL
Sbjct: 411 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYNDEQD 466
Query: 158 ---------RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMY 208
R+A+F+ Y++DVE GGAT+FP L ++P+KG+AVFWYN + DYR
Sbjct: 467 VFKHLGTGNRVATFLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTR 526
Query: 209 HSGCPVALGNKW 220
H+ CPV +G KW
Sbjct: 527 HAACPVLVGCKW 538
>gi|432904500|ref|XP_004077362.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Oryzias
latipes]
Length = 555
Score = 135 bits (339), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 79/230 (34%), Positives = 130/230 (56%), Gaps = 19/230 (8%)
Query: 3 YPLACQGN-LSVPEDIKSNLKC-FYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G + + +S L C +Y++ +N + P+K ++ + P +V+ D I ++E
Sbjct: 305 YEMLCRGEGVRMTSRRQSRLFCRYYDNKHNPRFVLAPVKQQDEWDRPYIVRYIDIISEAE 364
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
+++I +L+K ++ R + N G R+SK +L + P + KI RI+D+T
Sbjct: 365 MDKIKQLAKPRLRRATISNPVTGVLETAPYRISKSAWL---TAYEDPVVEKINQRIEDLT 421
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GG Y+ H D +DE R+A+++FY++DV
Sbjct: 422 GLEMDTAEE----LQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVS 477
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP + +V P+KG+AVFWYN A+ DY H+ CPV +GNKW
Sbjct: 478 AGGATVFPDVGASVGPQKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 527
>gi|291190274|ref|NP_001167096.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha 1 polypeptide precursor [Salmo
salar]
gi|223648100|gb|ACN10808.1| Prolyl 4-hydroxylase subunit alpha-1 precursor [Salmo salar]
Length = 545
Score = 135 bits (339), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 81/230 (35%), Positives = 125/230 (54%), Gaps = 19/230 (8%)
Query: 3 YPLACQGN-LSVPEDIKSNLKCFYESYNNTFLKI-GPLKVEELYLDPRVVKIHDAIYDSE 60
Y C+G + + +S + C Y N L + GP+K E+ + PR+++ HD + +SE
Sbjct: 300 YEQLCRGEGIKMTPRRQSRMFCRYSDNNRHPLYVLGPVKQEDEWDRPRIIRYHDVLSNSE 359
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I ++ EL+K ++ R + N G R+SK +L + P + KI RI+D+T
Sbjct: 360 IEKVKELAKPRLRRATISNPITGVLETAHYRISKSAWL---TAYEDPVVDKINQRIEDIT 416
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GG Y+ H D +DE R+A+++ Y++DV
Sbjct: 417 GLNVKTAEE----LQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLIYMSDVP 472
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+F + V+P+KGSAVFWYN + DY H+ CPV +GNKW
Sbjct: 473 SGGATVFTDVGAAVWPKKGSAVFWYNLFPSGEGDYSTRHAACPVLVGNKW 522
>gi|20269814|gb|AAM18062.1|AF495540_1 prolyl 4-hydroxylase alpha-related protein PH4[alpha]NE2
[Drosophila melanogaster]
gi|19528175|gb|AAL90202.1| AT27756p [Drosophila melanogaster]
Length = 542
Score = 134 bits (338), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 84/227 (37%), Positives = 113/227 (49%), Gaps = 15/227 (6%)
Query: 2 IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
+ P C G VP ++ SNL C Y + FL++ P+K E L +DP VV +HD I E
Sbjct: 288 VLPPCCSGRCQVPRNL-SNLYCVYNHVTSPFLQLAPIKTEILSIDPFVVLLHDMISQKES 346
Query: 62 NRIIELSKGKVERGKVVN---YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I SK + + D VDT + Y F D KI R+ D T
Sbjct: 347 TLIRTSSKEHMLPSATTDPDASDDETQVDTYRTSKSVWYSSDFNDTT--KKITERLGDAT 404
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-----RLASFMFYLTDVELGG 173
L + E Y Q+ NYGLGG ++ H D ++ + R+A+ +FYL +V GG
Sbjct: 405 GLDMNSTEFY----QVINYGLGGFFETHLDMLLSEKNRFNGTSDRIATTLFYLNEVRQGG 460
Query: 174 ATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
T FP LNLTVFP+ GSA+FWYN H+GCPV +G+KW
Sbjct: 461 GTYFPRLNLTVFPQPGSALFWYNLDTKGNDHMGSLHTGCPVIVGSKW 507
>gi|78706702|ref|NP_001027154.1| CG18749 [Drosophila melanogaster]
gi|21429852|gb|AAM50604.1| GH05783p [Drosophila melanogaster]
gi|23175900|gb|AAN14309.1| CG18749 [Drosophila melanogaster]
gi|220956638|gb|ACL90862.1| CG18749-PB [synthetic construct]
Length = 491
Score = 134 bits (338), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 82/224 (36%), Positives = 118/224 (52%), Gaps = 34/224 (15%)
Query: 8 QGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIEL 67
Q +V + L C Y + F +I PLK+EEL LDP +V HD IYD+EI+ ++
Sbjct: 262 QNCTAVVQKPSKKLHCRYNTSTTPFTRIAPLKMEELGLDPYMVVFHDVIYDTEIDGMLNS 321
Query: 68 SK-GKVE-----RGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLV 121
S G E + +V D+ VD + + R+ DMT L
Sbjct: 322 SDFGLSESVSGLKSEVRTSKDSHIVDAK-------------------TLNERVTDMTGLS 362
Query: 122 IGREERYKGPLQINNYGLGGHYDLHCD-----ATPRDEGLWRLASFMFYLTDVELGGATI 176
+ + P + NYGLGGH+ LH D T R + R+A+ +FYL +V+ GGAT+
Sbjct: 363 MEMSD----PFSLINYGLGGHFILHHDFHEYTNTTRLKQGDRIATVLFYLREVDSGGATV 418
Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
FP LN+TV P+KGSAVFWYN H + ++ + H+ CPV G+K+
Sbjct: 419 FPMLNITVMPKKGSAVFWYNLHNSGAVNSKTLHTACPVISGSKY 462
>gi|211938649|gb|ACJ13221.1| FI08532p [Drosophila melanogaster]
Length = 543
Score = 134 bits (338), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 84/227 (37%), Positives = 113/227 (49%), Gaps = 15/227 (6%)
Query: 2 IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
+ P C G VP ++ SNL C Y + FL++ P+K E L +DP VV +HD I E
Sbjct: 289 VLPPCCSGRCQVPRNL-SNLYCVYNHVTSPFLQLAPIKTEILSIDPFVVLLHDMISQKES 347
Query: 62 NRIIELSKGKVERGKVVN---YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I SK + + D VDT + Y F D KI R+ D T
Sbjct: 348 TLIRTSSKEHMLPSATTDPDASDDETQVDTYRTSKSVWYSSDFNDTT--KKITERLGDAT 405
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-----RLASFMFYLTDVELGG 173
L + E Y Q+ NYGLGG ++ H D ++ + R+A+ +FYL +V GG
Sbjct: 406 GLDMNSTEFY----QVINYGLGGFFETHLDMLLSEKNRFNGTSDRIATTLFYLNEVRQGG 461
Query: 174 ATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
T FP LNLTVFP+ GSA+FWYN H+GCPV +G+KW
Sbjct: 462 GTYFPRLNLTVFPQPGSALFWYNLDTKGNDHMGSLHTGCPVIVGSKW 508
>gi|24651430|ref|NP_733378.1| prolyl-4-hydroxylase-alpha NE2 [Drosophila melanogaster]
gi|23172699|gb|AAF57061.2| prolyl-4-hydroxylase-alpha NE2 [Drosophila melanogaster]
Length = 542
Score = 134 bits (338), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 84/227 (37%), Positives = 113/227 (49%), Gaps = 15/227 (6%)
Query: 2 IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
+ P C G VP ++ SNL C Y + FL++ P+K E L +DP VV +HD I E
Sbjct: 288 VLPPCCSGRCQVPRNL-SNLYCVYNHVTSPFLQLAPIKTEILSIDPFVVLLHDMISQKES 346
Query: 62 NRIIELSKGKVERGKVVN---YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I SK + + D VDT + Y F D KI R+ D T
Sbjct: 347 TLIRTSSKEHMLPSATTDPDASDDETQVDTYRTSKSVWYSSDFNDTT--KKITERLGDAT 404
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-----RLASFMFYLTDVELGG 173
L + E Y Q+ NYGLGG ++ H D ++ + R+A+ +FYL +V GG
Sbjct: 405 GLDMNSTEFY----QVINYGLGGFFETHLDMLLSEKNRFNGTSDRIATTLFYLNEVRQGG 460
Query: 174 ATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
T FP LNLTVFP+ GSA+FWYN H+GCPV +G+KW
Sbjct: 461 GTYFPRLNLTVFPQPGSALFWYNLDTKGNDHMGSLHTGCPVIVGSKW 507
>gi|344264847|ref|XP_003404501.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1
[Loxodonta africana]
Length = 536
Score = 134 bits (338), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 81/232 (34%), Positives = 124/232 (53%), Gaps = 19/232 (8%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
++Y C+G + + + L C Y N T L I P K E+ + P +V+ +D + D
Sbjct: 289 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRTPQLLIAPFKEEDEWDSPHIVRYYDVMSD 348
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI RI +++K K+ R V + G R+SK +L + D P + ++ R+Q
Sbjct: 349 EEIERIKQIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVAQVNRRMQH 405
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTD 168
+T L + E LQ+ NYG+GG Y+ H D + E R+A+F+ Y++D
Sbjct: 406 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRSHEQDAFKRLGTGNRVATFLNYMSD 461
Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
VE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 462 VEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 513
>gi|113682363|ref|NP_001038463.1| prolyl 4-hydroxylase, alpha polypeptide I a precursor [Danio rerio]
Length = 522
Score = 134 bits (338), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 85/252 (33%), Positives = 127/252 (50%), Gaps = 41/252 (16%)
Query: 3 YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y C+G L + + +L C Y + N + F IGP+K E+ + PR+++ H+ I + E
Sbjct: 255 YEKLCRGEGLKMTPRRQKHLFCRYFNGNRHPFYTIGPVKQEDEWDRPRIIRYHEIITEQE 314
Query: 61 INRIIELSKGKVERGKVVN------------------------YGDTIYVDTRLSKVYFL 96
I +I ELSK ++ R + N G R+SK +L
Sbjct: 315 IEKIKELSKPRLRRATISNPITGVLETAHYRISKRRATVHDPQTGKLTTAQYRVSKSAWL 374
Query: 97 YPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGL 156
+HP + +I RI+D+T L + E LQ+ NYG+GG Y+ H D +DE
Sbjct: 375 AAY---EHPVVDRINQRIEDITGLNVKTAEE----LQVANYGVGGQYEPHFDFGRKDEPD 427
Query: 157 W--------RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMY 208
R+A+++FY++DV GGAT+FP + V P KG+AVFWYN + DY
Sbjct: 428 AFKELGTGNRIATWLFYMSDVAAGGATVFPEVGAAVKPLKGTAVFWYNLFPSGEGDYSTR 487
Query: 209 HSGCPVALGNKW 220
H+ CPV +GNKW
Sbjct: 488 HAACPVLVGNKW 499
>gi|198449502|ref|XP_001357605.2| GA15937 [Drosophila pseudoobscura pseudoobscura]
gi|198130635|gb|EAL26739.2| GA15937 [Drosophila pseudoobscura pseudoobscura]
Length = 510
Score = 134 bits (337), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 75/208 (36%), Positives = 111/208 (53%), Gaps = 9/208 (4%)
Query: 15 EDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVER 74
E L C Y FL++ PLK+E L P VV HD + DSEI I+E+++ ++ R
Sbjct: 287 EQSPKALHCCYNFTTTPFLRLAPLKMELLGEHPYVVVYHDVLSDSEIAEILEMAERRMAR 346
Query: 75 GKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQI 134
V + TR + +L + +I R++DM+ L + ER +Q+
Sbjct: 347 TSTVAQPNRTSSPTRTAMGAWLK---RSSNALTRRIARRVRDMSGLQLEGSER----MQV 399
Query: 135 NNYGLGGHYDLHCDATPRDEGLW--RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAV 192
NYG+GGHY H D + + RLA+ +FYLTDVE GGAT+F V P +G+A+
Sbjct: 400 INYGIGGHYVPHKDWFTQHPEVMGNRLATVLFYLTDVEQGGATMFNKAEHKVLPRRGTAL 459
Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
FWYN H + D+ H+ CP+ +G+KW
Sbjct: 460 FWYNLHTDGEGDWSTTHAACPIIVGSKW 487
>gi|195159144|ref|XP_002020442.1| GL13995 [Drosophila persimilis]
gi|194117211|gb|EDW39254.1| GL13995 [Drosophila persimilis]
Length = 535
Score = 134 bits (336), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 75/208 (36%), Positives = 111/208 (53%), Gaps = 9/208 (4%)
Query: 15 EDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVER 74
E L C Y FL++ PLK+E L P VV HD + DSEI I+E+++ ++ R
Sbjct: 312 EQSPKALHCCYNFTTTPFLRLAPLKMELLGEHPYVVVYHDVLSDSEIAEILEMAERRMAR 371
Query: 75 GKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQI 134
V + TR + +L + +I R++DM+ L + ER +Q+
Sbjct: 372 TSTVAQPNRTSSPTRTALGAWLK---RSSNALTRRIARRVRDMSGLQLEGSER----MQV 424
Query: 135 NNYGLGGHYDLHCDATPRDEGLW--RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAV 192
NYG+GGHY H D + + RLA+ +FYLTDVE GGAT+F V P +G+A+
Sbjct: 425 INYGIGGHYVPHKDWFTQHPEVMGNRLATVLFYLTDVEQGGATMFNKAEHKVLPRRGTAL 484
Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
FWYN H + D+ H+ CP+ +G+KW
Sbjct: 485 FWYNLHTDGEGDWSTTHAACPIIVGSKW 512
>gi|195438148|ref|XP_002066999.1| GK24258 [Drosophila willistoni]
gi|194163084|gb|EDW77985.1| GK24258 [Drosophila willistoni]
Length = 217
Score = 134 bits (336), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 82/224 (36%), Positives = 120/224 (53%), Gaps = 16/224 (7%)
Query: 5 LACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRI 64
L C+G+L P + NL C Y S FL++ P K EE+ LDP ++ H+AIYD+EI+
Sbjct: 3 LGCRGHLKAPSN--RNLFCSYNSTTTPFLRLAPFKTEEISLDPFILLFHNAIYDNEISYF 60
Query: 65 IELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGR 124
++ + + NY T R+ +V E GD + R++D++ L G
Sbjct: 61 TKVKRKDMREAHTDNY-TTPNEQYRIMQVKVY--EGIGDK-MDKTLLERVKDISGLSAGN 116
Query: 125 EERYKGPLQINNYGLGGHYDLHCD-----ATPR-DEGLWRLASFMFYLTDVELGGATIFP 178
K L NYGLG ++ H D +P +E RLA+ +FYL+DV GG TIFP
Sbjct: 117 ----KSELAAGNYGLGSYFPEHSDYRDIKVSPELNETGDRLATILFYLSDVAQGGHTIFP 172
Query: 179 SLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWGK 222
N+TV P+KGSA+FW+N H + + + H CP+ GN+W K
Sbjct: 173 LANVTVQPKKGSALFWFNLHNDGEPNIKSLHGVCPIIEGNRWSK 216
>gi|195505216|ref|XP_002099408.1| GE23378 [Drosophila yakuba]
gi|194185509|gb|EDW99120.1| GE23378 [Drosophila yakuba]
Length = 546
Score = 134 bits (336), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 82/231 (35%), Positives = 115/231 (49%), Gaps = 22/231 (9%)
Query: 4 PLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINR 63
P C G P +K L C Y FL++ P+K E L +DP +V +HD + E
Sbjct: 289 PPCCSGCCEGPRKLK-RLYCVYNGVTAPFLRLAPIKTEILSIDPFIVLLHDMVSVEEGAL 347
Query: 64 IIELSKGKVERGKVVNYGDTIYVDT--------RLSKVYFLYPEIFGDHPFLYKIQTRIQ 115
+ SK + + D+ R SK +L + + K+ R+
Sbjct: 348 LRTFSKNMISPSETAELSDSEEKSIFEFEVGSFRTSKSVWLDNDA---NEATLKLTQRLG 404
Query: 116 DMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW------RLASFMFYLTDV 169
D T L I E P Q+ NYG+GG ++ H D + +DE + RLA+ +FYL DV
Sbjct: 405 DATGLDISHSE----PFQVINYGIGGIFESHFDTSLQDENRFLDGYMDRLATTLFYLNDV 460
Query: 170 ELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT FP LN+TVFP+ G+A+FWYN LL R H+GCPV +G+KW
Sbjct: 461 PQGGATHFPGLNITVFPKFGTALFWYNLDTKGLLRLRTMHTGCPVIVGSKW 511
>gi|432109537|gb|ELK33711.1| Prolyl 4-hydroxylase subunit alpha-2 [Myotis davidii]
Length = 555
Score = 134 bits (336), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 85/252 (33%), Positives = 127/252 (50%), Gaps = 39/252 (15%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
++Y C+G + + + L C Y N T L I P K E+ + P +V+ +D + D
Sbjct: 288 DVYESLCRGEGVKLTPKRQKRLFCRYHDGNRTPQLLIAPFKEEDEWDSPHIVRYYDVMSD 347
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI RI E++K K+ R V + G R+SK +L + D P + ++ R+Q
Sbjct: 348 EEIQRIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 404
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----------------- 157
+T L + E LQ+ NYG+GG Y+ H D + P D GL
Sbjct: 405 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYNDEQD 460
Query: 158 ---------RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMY 208
R+A+F+ Y++DVE GGAT+FP L ++P+KG+AVFWYN + DYR
Sbjct: 461 VFKHLGTGNRVATFLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTR 520
Query: 209 HSGCPVALGNKW 220
H+ CPV +G KW
Sbjct: 521 HAACPVLVGCKW 532
>gi|395817620|ref|XP_003782263.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Otolemur
garnettii]
Length = 540
Score = 134 bits (336), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 82/232 (35%), Positives = 123/232 (53%), Gaps = 19/232 (8%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
E+Y C+G + + + L C Y N L I P K E+ + P +V+ +D + D
Sbjct: 293 EVYESLCRGEGVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSD 352
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI RI E++K K+ R V + G R+SK +L + D P + ++ R+Q
Sbjct: 353 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNHRMQH 409
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTD 168
+T L + E LQ+ NYG+GG Y+ H D + E R+A+F+ Y++D
Sbjct: 410 ITGLSVKTAEL----LQVANYGVGGQYEPHFDFSRNHERDAFKRLGTGNRVATFLNYMSD 465
Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
VE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 466 VEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 517
>gi|351706369|gb|EHB09288.1| Prolyl 4-hydroxylase subunit alpha-2 [Heterocephalus glaber]
Length = 535
Score = 134 bits (336), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 82/232 (35%), Positives = 124/232 (53%), Gaps = 19/232 (8%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
E+Y C+G + + + L C Y N L I P K E+ + P +V+ ++ + D
Sbjct: 288 EVYESLCRGEGVKLTPQRQKRLFCRYHHGNRAPELLIAPFKEEDEWDSPHIVRYYNVMSD 347
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI+RI EL+K K+ R V + G R+SK +L + D P + ++ R+Q
Sbjct: 348 EEIDRIKELAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQY 404
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTD 168
+T L + E LQ+ NYG+GG Y+ H D + E R+A+F+ Y++D
Sbjct: 405 ITGLTVQTAEL----LQVANYGMGGQYEPHFDFSRNHERDAFKRLGTGNRVATFLNYMSD 460
Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
VE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 461 VEAGGATVFPDLGAALWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512
>gi|348523976|ref|XP_003449499.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Oreochromis
niloticus]
Length = 594
Score = 134 bits (336), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 80/230 (34%), Positives = 127/230 (55%), Gaps = 19/230 (8%)
Query: 3 YPLACQGN-LSVPEDIKSNLKC-FYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y C+G + + +S L C +Y++ + IGP+K E+ + P +V+ H+ + + +
Sbjct: 349 YEQLCRGQGIKLTPRRQSRLFCRYYDNNRHPRYVIGPVKQEDEWDSPHIVRYHNIVSEKD 408
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
+ ++ EL+K ++ R + N G R+SK +L +HP + KI I+D+T
Sbjct: 409 MEKVKELAKPRLRRATISNPVTGVLETAHYRISKSAWLGAY---EHPVVDKINQLIEDVT 465
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYGLGG Y+ H D +DE R+A+++ Y+TDV+
Sbjct: 466 GLNVKTAE----DLQVANYGLGGQYEPHFDFGRKDEPDAFEELGTGNRIATWLLYMTDVQ 521
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+F + V P+KG+AVFWYN + + DYR H+ CPV LGNKW
Sbjct: 522 AGGATVFTDIGAAVKPKKGTAVFWYNLYPSGEGDYRTRHAACPVLLGNKW 571
>gi|357605723|gb|EHJ64752.1| prolyl 4-hydroxylase alpha subunit [Danaus plexippus]
Length = 235
Score = 133 bits (335), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 71/200 (35%), Positives = 115/200 (57%), Gaps = 15/200 (7%)
Query: 29 NNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYV 86
N+ FL++ P+++E LY +P ++ +D + D EI+ I +++ + R V + G+ +
Sbjct: 4 NHPFLRLAPVRMEYLYRNPDIIVFNDVLSDYEIDYIKRIAQPRFRRATVHDPATGELVPA 63
Query: 87 DTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLH 146
R+SK +L E + + ++ R+ D+T L + E LQ+ NYG+GGHYD H
Sbjct: 64 HYRISKSAWLKDE---ESAVVARVSRRVADITGLSMTTAEE----LQVVNYGIGGHYDPH 116
Query: 147 CDATPRDEGLW------RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHAN 200
D ++E + R+A+ +FY++DV GGAT+F L L+VFP +GSAVFW N H +
Sbjct: 117 FDFARKEENAFEKFNGNRIATVLFYMSDVAQGGATVFTELGLSVFPRRGSAVFWLNLHPS 176
Query: 201 TLLDYRMYHSGCPVALGNKW 220
D H+ CPV G+KW
Sbjct: 177 GEGDLATRHAACPVLRGSKW 196
>gi|198477150|ref|XP_002136737.1| GA29215 [Drosophila pseudoobscura pseudoobscura]
gi|198145042|gb|EDY71754.1| GA29215 [Drosophila pseudoobscura pseudoobscura]
Length = 508
Score = 133 bits (334), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 75/206 (36%), Positives = 116/206 (56%), Gaps = 12/206 (5%)
Query: 19 SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV 78
S L C Y + FL++ PLK+E L LDP VV HD + D E++ + +++ + R
Sbjct: 291 SRLYCLYNTTATAFLRLAPLKMELLSLDPYVVLYHDVLADREMSLLKLMAQRDLVRAVTY 350
Query: 79 NYGDTIYVD--TRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINN 136
N + + + R +K +L P H + ++ +DM+NL + R E + Q+ N
Sbjct: 351 NATEKKHSEDPNRTTKAGWLDPS----HNLIRRMGILTEDMSNLDLERSEDF----QVLN 402
Query: 137 YGLGGHYDLHCD--ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFW 194
YG+GGHY +H D E R+A+ +FYL+DV LGGAT+FP L+L+VFP+KG+ + W
Sbjct: 403 YGIGGHYAVHPDFFEGSNPELPDRVATLLFYLSDVPLGGATVFPLLDLSVFPKKGAVLMW 462
Query: 195 YNAHANTLLDYRMYHSGCPVALGNKW 220
YN + HS CPV +G++W
Sbjct: 463 YNLDHKGQGMEKTIHSACPVVVGSRW 488
>gi|66770643|gb|AAY54633.1| IP12395p [Drosophila melanogaster]
Length = 538
Score = 133 bits (334), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 78/224 (34%), Positives = 115/224 (51%), Gaps = 15/224 (6%)
Query: 4 PLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINR 63
P C G P + + L C Y FL++ P+K E L +DP V+ +HD + E
Sbjct: 288 PPCCSGRCEGPRKL-NRLYCVYNCVTAPFLRLAPIKTEILSVDPFVILLHDMVSHKEGAL 346
Query: 64 IIELSKGKVERGKVVNYGDTIYVDT-RLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVI 122
I SK ++ + VN + + R SK + + + K+ R+ + T L +
Sbjct: 347 IRSSSKNQILPSETVNAANEFEIAKFRTSKSVWFDSDA---NEATLKLTQRLGEATGLDM 403
Query: 123 GREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW------RLASFMFYLTDVELGGATI 176
E P Q+ NYG+GG ++ H D + DE + RLA+ +FYL DV GGAT
Sbjct: 404 KHSE----PFQVINYGIGGVFESHFDTSLADEDRFVNGYIDRLATTLFYLNDVPQGGATH 459
Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
FP LN+TVFP+ G+ + WYN H +L R H+GCPV +G+KW
Sbjct: 460 FPGLNITVFPKFGTVLMWYNLHTEGMLHVRTMHTGCPVIVGSKW 503
>gi|449673565|ref|XP_002167120.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Hydra
magnipapillata]
Length = 571
Score = 133 bits (334), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 78/228 (34%), Positives = 121/228 (53%), Gaps = 18/228 (7%)
Query: 3 YPLACQGNLS-VPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
Y C+G + + + ++ +KC+Y S + LK+ P KVE +++DP + + + I + +I
Sbjct: 329 YEQLCRGEVRPLTKKEQAKMKCWY-SAKDPVLKLKPQKVERVWVDPEIFILRNIISEKQI 387
Query: 62 NRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
N I E + + R + + G + D R+SK +L + FL ++ R Q T
Sbjct: 388 NLIKEAASPMLRRATIQDPITGKLRHADYRISKSAWLSTNKYN---FLQALEARTQATTG 444
Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-------RLASFMFYLTDVELG 172
L + Y LQ+ NYGLGGHY+ H D + +E + R+A+ +FYL+DVE G
Sbjct: 445 LDLS----YAEQLQVANYGLGGHYEPHFDHSRENEDRFTDLGMGNRIATVLFYLSDVEAG 500
Query: 173 GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GAT+F VFP KG AVFW+N N + H+ CPV +G KW
Sbjct: 501 GATVFTVGKTAVFPSKGDAVFWFNLKRNGKGNPNTRHAACPVLVGQKW 548
>gi|116008130|ref|NP_001036777.1| CG31524, isoform B [Drosophila melanogaster]
gi|113194860|gb|ABI31221.1| CG31524, isoform B [Drosophila melanogaster]
Length = 535
Score = 133 bits (334), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 78/224 (34%), Positives = 115/224 (51%), Gaps = 15/224 (6%)
Query: 4 PLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINR 63
P C G P + + L C Y FL++ P+K E L +DP V+ +HD + E
Sbjct: 285 PPCCSGRCEGPRKL-NRLYCVYNCVTAPFLRLAPIKTEILSVDPFVILLHDMVSHKEGAL 343
Query: 64 IIELSKGKVERGKVVNYGDTIYVDT-RLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVI 122
I SK ++ + VN + + R SK + + + K+ R+ + T L +
Sbjct: 344 IRSSSKNQILPSETVNAANEFEIAKFRTSKSVWFDSDA---NEATLKLTQRLGEATGLDM 400
Query: 123 GREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW------RLASFMFYLTDVELGGATI 176
E P Q+ NYG+GG ++ H D + DE + RLA+ +FYL DV GGAT
Sbjct: 401 KHSE----PFQVINYGIGGVFESHFDTSLADEDRFVNGYIDRLATTLFYLNDVPQGGATH 456
Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
FP LN+TVFP+ G+ + WYN H +L R H+GCPV +G+KW
Sbjct: 457 FPGLNITVFPKFGTVLMWYNLHTEGMLHVRTMHTGCPVIVGSKW 500
>gi|116008537|ref|NP_733379.2| CG31524, isoform A [Drosophila melanogaster]
gi|113194861|gb|AAN14239.2| CG31524, isoform A [Drosophila melanogaster]
Length = 536
Score = 133 bits (334), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 78/224 (34%), Positives = 115/224 (51%), Gaps = 15/224 (6%)
Query: 4 PLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINR 63
P C G P + + L C Y FL++ P+K E L +DP V+ +HD + E
Sbjct: 286 PPCCSGRCEGPRKL-NRLYCVYNCVTAPFLRLAPIKTEILSVDPFVILLHDMVSHKEGAL 344
Query: 64 IIELSKGKVERGKVVNYGDTIYVDT-RLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVI 122
I SK ++ + VN + + R SK + + + K+ R+ + T L +
Sbjct: 345 IRSSSKNQILPSETVNAANEFEIAKFRTSKSVWFDSDA---NEATLKLTQRLGEATGLDM 401
Query: 123 GREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW------RLASFMFYLTDVELGGATI 176
E P Q+ NYG+GG ++ H D + DE + RLA+ +FYL DV GGAT
Sbjct: 402 KHSE----PFQVINYGIGGVFESHFDTSLADEDRFVNGYIDRLATTLFYLNDVPQGGATH 457
Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
FP LN+TVFP+ G+ + WYN H +L R H+GCPV +G+KW
Sbjct: 458 FPGLNITVFPKFGTVLMWYNLHTEGMLHVRTMHTGCPVIVGSKW 501
>gi|196011908|ref|XP_002115817.1| hypothetical protein TRIADDRAFT_30052 [Trichoplax adhaerens]
gi|190581593|gb|EDV21669.1| hypothetical protein TRIADDRAFT_30052, partial [Trichoplax
adhaerens]
Length = 495
Score = 133 bits (334), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 81/207 (39%), Positives = 118/207 (57%), Gaps = 15/207 (7%)
Query: 21 LKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNY 80
LKC+Y S + L + P+ VEE+ LDP +V +D I D +I I ++S K K N+
Sbjct: 274 LKCYY-SNQSPLLYLAPIPVEEISLDPFIVIYYDIINDHQIETIKKISPSK--SNKSPNH 330
Query: 81 GDTIY-VDTRLSKVYFLYPEIFGDH---PFLYKIQTRIQDMTNLVIGREERYKGPLQINN 136
+ + ++V + + P + KI Q++T+L + Y LQ+ N
Sbjct: 331 AMLCSGIKSEATQVSIFCCSTWLEDAYDPVVEKISRLTQELTHLDVN----YAEDLQVAN 386
Query: 137 YGLGGHYDLHCDAT---PRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVF 193
YG+GGHY H D+T P D L RLA+ MFYL++VE+GGATIFP L + V P+KGSA+F
Sbjct: 387 YGIGGHYVPHYDSTIIAPEDP-LQRLATMMFYLSNVEIGGATIFPRLGVAVRPQKGSALF 445
Query: 194 WYNAHANTLLDYRMYHSGCPVALGNKW 220
W N N L + + H+ CPV +G+KW
Sbjct: 446 WINLKRNGLTNRQTLHAACPVVIGSKW 472
>gi|261245137|gb|ACX54875.1| FI12021p [Drosophila melanogaster]
Length = 538
Score = 133 bits (334), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 78/224 (34%), Positives = 115/224 (51%), Gaps = 15/224 (6%)
Query: 4 PLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINR 63
P C G P + + L C Y FL++ P+K E L +DP V+ +HD + E
Sbjct: 288 PPCCSGRCEGPRKL-NRLYCVYNCVTAPFLRLAPIKTEILSVDPFVILLHDMVSHKEGAL 346
Query: 64 IIELSKGKVERGKVVNYGDTIYVDT-RLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVI 122
I SK ++ + VN + + R SK + + + K+ R+ + T L +
Sbjct: 347 IRSSSKNQILPSETVNAANEFEIAKFRTSKSVWFDSDA---NEATLKLTQRLGEATGLDM 403
Query: 123 GREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW------RLASFMFYLTDVELGGATI 176
E P Q+ NYG+GG ++ H D + DE + RLA+ +FYL DV GGAT
Sbjct: 404 KHSE----PFQVINYGIGGVFESHFDTSLADEDRFVNGYIDRLATTLFYLNDVPQGGATH 459
Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
FP LN+TVFP+ G+ + WYN H +L R H+GCPV +G+KW
Sbjct: 460 FPGLNITVFPKFGTVLMWYNLHTEGMLHVRTMHTGCPVIVGSKW 503
>gi|66771513|gb|AAY55068.1| IP12095p [Drosophila melanogaster]
Length = 538
Score = 133 bits (334), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 78/224 (34%), Positives = 115/224 (51%), Gaps = 15/224 (6%)
Query: 4 PLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINR 63
P C G P + + L C Y FL++ P+K E L +DP V+ +HD + E
Sbjct: 288 PPCCSGRCEGPRKL-NRLYCVYNCVTAPFLRLAPIKTEILSVDPFVILLHDMVSHKEGAL 346
Query: 64 IIELSKGKVERGKVVNYGDTIYVDT-RLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVI 122
I SK ++ + VN + + R SK + + + K+ R+ + T L +
Sbjct: 347 IRSSSKNQILPSETVNAANEFEIAKFRTSKSVWFDSDA---NEATLKLTQRLGEATGLDM 403
Query: 123 GREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW------RLASFMFYLTDVELGGATI 176
E P Q+ NYG+GG ++ H D + DE + RLA+ +FYL DV GGAT
Sbjct: 404 KHSE----PFQVINYGIGGVFESHFDTSLADEDRFVNGYIDRLATTLFYLNDVPQGGATH 459
Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
FP LN+TVFP+ G+ + WYN H +L R H+GCPV +G+KW
Sbjct: 460 FPGLNITVFPKFGTVLMWYNLHTEGMLHVRTMHTGCPVIVGSKW 503
>gi|195390805|ref|XP_002054058.1| GJ23004 [Drosophila virilis]
gi|194152144|gb|EDW67578.1| GJ23004 [Drosophila virilis]
Length = 446
Score = 133 bits (334), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 83/216 (38%), Positives = 121/216 (56%), Gaps = 17/216 (7%)
Query: 7 CQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIE 66
C +L P S+L C Y ++ FL+I PLK+EEL +DP VV H+ IYDSEI
Sbjct: 222 CAASLQRP----SHLHCRYNNWTTPFLRIAPLKMEELSIDPFVVLYHNVIYDSEIEWF-- 275
Query: 67 LSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREE 126
L++ +++YG + R K F+ E + I+ R+ DM+ L + +
Sbjct: 276 LTQSFDYTPALLDYGG--FSAHRSGKNVFIELE---KGELVKTIEMRVTDMSGLSMEGSD 330
Query: 127 RYKGPLQINNYGLGGHYDLHCDATPRDEGLW--RLASFMFYLTDVELGGATIFPSLNLTV 184
L + NYG+GGHY H D+ +E R+A+ +FYL+DVELGGAT FP LNLT+
Sbjct: 331 ----DLSLINYGIGGHYIPHHDSFSEEENKTEDRIATALFYLSDVELGGATTFPLLNLTI 386
Query: 185 FPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
PEKG+AV W+N + + H+ CPV +G+K+
Sbjct: 387 SPEKGTAVLWHNLKDSGTPHPKTVHAACPVIVGSKY 422
>gi|443709454|gb|ELU04126.1| hypothetical protein CAPTEDRAFT_167710 [Capitella teleta]
Length = 535
Score = 133 bits (334), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 81/229 (35%), Positives = 116/229 (50%), Gaps = 17/229 (7%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
+ Y C+G +P L C Y ++ F I PL+ E + DP + H + D +
Sbjct: 292 QTYEALCRGEDVIPIKDAHKLTCQYRVWHPMF-TINPLREETMNFDPWIAVYHQLMSDKD 350
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I+ I L+ ++ R VVN G+ + R+SK +L E +HP + KI R +T
Sbjct: 351 IDDIKALATPRLARATVVNSVTGELEFAKYRISKSGWLKDE---EHPTVAKISNRCSALT 407
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDE----GLWR---LASFMFYLTDVEL 171
NL + E LQI NYG+GGHY+ H D + E WR + + +FYL+DVE
Sbjct: 408 NLSLSTVEE----LQIANYGIGGHYEPHFDYSRLAEVTSFDHWRGNRILTVIFYLSDVEA 463
Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GG T+F + + PEKG+A WYN H + D H+ CPV GNKW
Sbjct: 464 GGGTVFMTAGTKLRPEKGAAAVWYNLHPDGTGDDETKHAACPVLTGNKW 512
>gi|442747091|gb|JAA65705.1| Putative prolyl 4-hydroxylase alpha subunit [Ixodes ricinus]
Length = 533
Score = 133 bits (334), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 80/230 (34%), Positives = 123/230 (53%), Gaps = 15/230 (6%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
E Y C+G + S L+C Y + F K+ P+K+EE L P VV + D + D +
Sbjct: 280 ENYKRLCRGEQLRTPKMDSQLRCRYYTGETGFFKLQPIKLEEYNLKPYVVVLRDLLQDRD 339
Query: 61 INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
+N +I +K ++E+ K + D R S +L + D P ++ +Q + L
Sbjct: 340 LNDMIAFAKPRLEQSKTLCAADKDGPPPRTSSNTWLDDD---DAPVAARVNQYLQSLLGL 396
Query: 121 --VIGREERYKGPLQINNYGLGGHYDLHCD------ATPRDEGLW--RLASFMFYLTDVE 170
+ G++E K Q+ NYG+GGHY H D + + L+ R+A+ M Y++DVE
Sbjct: 397 GTLYGKDEAEK--YQLANYGIGGHYVPHHDYLEESLTSSKKHRLFGDRVATLMIYMSDVE 454
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FPSL + V P KG AVFW+N ++ D +H+GCPV G+KW
Sbjct: 455 EGGATVFPSLGVRVSPRKGDAVFWWNIKSSWEGDVLTWHAGCPVLYGSKW 504
>gi|387016442|gb|AFJ50340.1| Prolyl 4-hydroxylase subunit alpha-2-like [Crotalus adamanteus]
Length = 533
Score = 132 bits (333), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 83/231 (35%), Positives = 124/231 (53%), Gaps = 19/231 (8%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYD 58
E Y C+G + + + L C Y + N N L I P K E+ + P +V+ ++ + D
Sbjct: 288 ETYEALCRGEGVKLTPRRQKGLFCRYHNGNRNPHLIIAPFKEEDEWDSPHIVRYYEVLSD 347
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI +I EL+K K+ R V + G + R+SK +L E D + ++ R++
Sbjct: 348 EEIEKIKELAKPKLARATVRDPKTGVLTVANYRVSKSSWLEEE---DDLVVARVNHRMEQ 404
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPR-------DEGLWRLASFMFYLTDV 169
+T L E LQ+ NYG+GG Y+ H D + R EG RLA+F+ Y++DV
Sbjct: 405 ITGLTTKTAEL----LQVANYGMGGQYEPHFDFSRRPFDITLKTEGN-RLATFLNYMSDV 459
Query: 170 ELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
E GGAT+FP ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 460 EAGGATVFPDFGAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGCKW 510
>gi|125772813|ref|XP_001357665.1| GA21991 [Drosophila pseudoobscura pseudoobscura]
gi|54637397|gb|EAL26799.1| GA21991 [Drosophila pseudoobscura pseudoobscura]
Length = 534
Score = 132 bits (333), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 81/226 (35%), Positives = 122/226 (53%), Gaps = 16/226 (7%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
Y C+G + + L+C Y ++ + + PLK+EE LDP VV HD + +I
Sbjct: 284 YEKVCRGEVGPSPRQERPLRCRYSLGSHPYRHLAPLKLEEHSLDPFVVTYHDMLSPRKIA 343
Query: 63 RIIELSKGKVERGKV--VNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
+ ++ ++ R V + G R+SK +L + HP + + + + D T L
Sbjct: 344 DLRLMAVPRMHRSTVNPLPGGQNKKSSFRVSKNAWL---AYDSHPTMGGMLSDLSDATGL 400
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD------ATPRDEGLWRLASFMFYLTDVELGGA 174
+ E+ LQ+ NYG+GGHY+ H D P +EG R+A+ +FYL+DVE GGA
Sbjct: 401 DMTFCEQ----LQVANYGVGGHYEPHWDFFRDPDHYPAEEGN-RMATAIFYLSDVEQGGA 455
Query: 175 TIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
T FP LN V P+ G+ +FWYN H + +DYR H+GCPV G+KW
Sbjct: 456 TAFPFLNFAVKPQLGNVLFWYNVHRSLDVDYRTKHAGCPVLKGSKW 501
>gi|301613004|ref|XP_002936004.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Xenopus
(Silurana) tropicalis]
Length = 526
Score = 132 bits (332), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 78/232 (33%), Positives = 126/232 (54%), Gaps = 19/232 (8%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKC-FYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYD 58
E Y C+G + + + L C +++ + L + P K E+ + PR+V+ HD I D
Sbjct: 279 EKYEKLCRGEGVKMTSRRQKRLFCRYFDGKKDPLLILSPTKQEDEWDKPRIVRYHDIISD 338
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI+++ EL+K ++ R + N G R++K +L + P + ++ RI+
Sbjct: 339 EEISKVKELAKPRLRRATISNPITGVLETAQYRITKSAWLSGY---EDPVVARLNRRIEG 395
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTD 168
+T L + E LQ+ NYG+GG Y+ H D + E R+A+++FY++D
Sbjct: 396 VTGLDMSTAEE----LQVANYGIGGQYEPHFDFLRKYEPDAFKKLGTGNRVATWLFYMSD 451
Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
VE GGAT+FP + V+P+KG+AVFWYN + DY H+ CPV +GNKW
Sbjct: 452 VEAGGATVFPEVGAAVYPKKGTAVFWYNLLESGEGDYSTRHAACPVLVGNKW 503
>gi|395814850|ref|XP_003780953.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Otolemur
garnettii]
Length = 544
Score = 132 bits (332), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 82/222 (36%), Positives = 123/222 (55%), Gaps = 13/222 (5%)
Query: 7 CQGNLSVPEDIK-SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRII 65
CQ S P + +L C YE+ ++ +L + P++ E ++L+P V HD + DSE +I
Sbjct: 305 CQTLGSQPTHYQIPSLYCSYETNSSPYLLLQPIRKEVIHLEPFVALYHDFVSDSEAQKIR 364
Query: 66 ELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGRE 125
EL++ ++R V + + VD R+SK +L + P L + RI +T L + +
Sbjct: 365 ELAEPWLQRSVVASGEKQLQVDYRISKSAWLKDTV---DPMLVTLDHRIAALTGLDV--Q 419
Query: 126 ERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELGGATIFP 178
Y LQ+ NYG+GGHY+ H D AT L+R+ A+FM YL+ VE GGAT F
Sbjct: 420 PPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFI 479
Query: 179 SLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
N +V K +A+FW+N H N D H+GCPV +G+KW
Sbjct: 480 YANFSVPVVKNAALFWWNLHRNGEGDSDTLHAGCPVLVGDKW 521
>gi|195159317|ref|XP_002020528.1| GL14042 [Drosophila persimilis]
gi|194117297|gb|EDW39340.1| GL14042 [Drosophila persimilis]
Length = 534
Score = 132 bits (332), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 81/226 (35%), Positives = 122/226 (53%), Gaps = 16/226 (7%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
Y C+G + + L+C Y ++ + + PLK+EE LDP VV HD + +I
Sbjct: 284 YEKVCRGEVGPSPRQERPLRCRYSLGSHPYRHLAPLKLEEHSLDPFVVTYHDMLSPRKIA 343
Query: 63 RIIELSKGKVERGKV--VNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
+ ++ ++ R V + G R+SK +L + HP + + + + D T L
Sbjct: 344 DLRLMAVPRMHRSTVNPLPGGQNKKSSFRVSKNAWL---AYDSHPTMGGMLSDLSDATGL 400
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD------ATPRDEGLWRLASFMFYLTDVELGGA 174
+ E+ LQ+ NYG+GGHY+ H D P +EG R+A+ +FYL+DVE GGA
Sbjct: 401 DMTFCEQ----LQVANYGVGGHYEPHWDFFRDPDHYPAEEGN-RMATAIFYLSDVEQGGA 455
Query: 175 TIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
T FP LN V P+ G+ +FWYN H + +DYR H+GCPV G+KW
Sbjct: 456 TAFPFLNFAVKPQLGNVLFWYNVHRSLDVDYRTKHAGCPVLKGSKW 501
>gi|321463241|gb|EFX74258.1| hypothetical protein DAPPUDRAFT_22132 [Daphnia pulex]
Length = 523
Score = 132 bits (332), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 80/229 (34%), Positives = 118/229 (51%), Gaps = 19/229 (8%)
Query: 7 CQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIE 66
C+G + + + + LKC+Y++ + + + P+K+E+ +P + HD + D EI I E
Sbjct: 279 CRGERLLNDKLLAELKCWYDTRHQFYFLLMPIKIEQHSFEPAIYTFHDVLSDEEIETIKE 338
Query: 67 LSKGKVERGKV---VNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIG 123
L+K + R V + G + + R SK +L PE G HP L ++ RI +T L
Sbjct: 339 LAKPLLARSMVQGKLGVGHEVS-NVRTSKTAWL-PE--GLHPLLNRLSRRIGLITGLKTD 394
Query: 124 REERYKGPLQINNYGLGGHYDLHCDATPRDEGLW------------RLASFMFYLTDVEL 171
LQ+ NYG+GGHY H D +D+ + R+A+FMFYL DVE
Sbjct: 395 PIRDEAELLQVANYGIGGHYSPHHDYLMKDKADFEYMHHRELQAGDRIATFMFYLNDVER 454
Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GG+T FP + V P KG A FW+N + D H CPV LG+KW
Sbjct: 455 GGSTAFPRAGVAVKPVKGGAAFWFNLKRSGKPDPLTLHGACPVLLGHKW 503
>gi|312092237|ref|XP_003147267.1| hypothetical protein LOAG_11701 [Loa loa]
Length = 553
Score = 132 bits (332), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 82/230 (35%), Positives = 126/230 (54%), Gaps = 20/230 (8%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
+ Y C+ + V +S L C+Y+ + +L++ P+KVE +Y +P V HD + D E
Sbjct: 282 DTYQALCRQEMPVNIKAQSRLYCYYK-MDRPYLRLAPIKVEIVYQNPLAVLFHDIMSDEE 340
Query: 61 INRIIE-LSKGKVERGKV--VNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDM 117
+RIIE L+ K++R V V G+ R+SK +L +H + +I R+
Sbjct: 341 -SRIIEMLAVPKLDRATVHNVETGNLETASYRISKSAWLRS---TEHEVVNRINRRLDLA 396
Query: 118 TNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-------RLASFMFYLTDVE 170
TNL I E LQ+ NYG+GGHY+ H D + RDE + R+A+ + Y+T+ E
Sbjct: 397 TNLEIATAEE----LQVQNYGIGGHYEPHLDCS-RDEDAFERTGTGNRIATILIYMTEPE 451
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+GG T+F +L +V K +A+FWYN + +D R YH+ CPV G KW
Sbjct: 452 IGGRTVFINLKASVPCTKNAALFWYNLMRSGAVDMRSYHAACPVLTGTKW 501
>gi|403183473|gb|EJY58123.1| AAEL017524-PA, partial [Aedes aegypti]
Length = 212
Score = 132 bits (332), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 82/197 (41%), Positives = 113/197 (57%), Gaps = 20/197 (10%)
Query: 35 IGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVY 94
I P K+EE LDP +V H+AI D EI +II++SK ++R V + R S+
Sbjct: 1 IAPFKLEEASLDPLIVIYHNAISDKEIEQIIQVSKPMLKRSMVGESFSKEVSNERTSQNA 60
Query: 95 FLYPEIFGDHPF-LYKIQT-RIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATP- 151
+L D+ F L K+ + R +DMT L + + LQ+NNYG+GG Y H D
Sbjct: 61 WL-----ADYDFELVKVLSLRTEDMTGL----DRKSYESLQVNNYGIGGFYLPHFDWVRT 111
Query: 152 -------RDEGLW-RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLL 203
+D GL R+A+ M+YL+DVE GGAT+FP + + VFP+KGSA+FWYN +
Sbjct: 112 NGTEEPYKDMGLGNRIATLMYYLSDVEQGGATVFPQIGVGVFPKKGSAIFWYNLLPDGTG 171
Query: 204 DYRMYHSGCPVALGNKW 220
D R H CPV LG+KW
Sbjct: 172 DERTLHGACPVLLGSKW 188
>gi|24651420|ref|NP_733374.1| prolyl-4-hydroxylase-alpha NE1 [Drosophila melanogaster]
gi|7301952|gb|AAF57058.1| prolyl-4-hydroxylase-alpha NE1 [Drosophila melanogaster]
gi|363987308|gb|AEW43896.1| FI16820p1 [Drosophila melanogaster]
Length = 537
Score = 132 bits (331), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 80/226 (35%), Positives = 124/226 (54%), Gaps = 18/226 (7%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
Y C+G V ++ L+C Y N+ + + PLK+EE LDP V HD + +I+
Sbjct: 289 YEKVCRGE--VHPIVRQELRCRYSRGNHPYRFLAPLKLEEHSLDPYVATFHDILSPGKIS 346
Query: 63 RIIELSKGKVERGKV--VNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
++ E++ ++ R V + G R+SK +L + HP + + ++D T L
Sbjct: 347 QLREMAVPRMHRSTVNPLPGGQLKKSAFRVSKNAWL---AYESHPTMVGMLRDLKDATGL 403
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD------ATPRDEGLWRLASFMFYLTDVELGGA 174
+ + LQ+ NYG+GGHY+ H D P +EG R+A+ +FYL++VE GGA
Sbjct: 404 ----DTTFCEQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGN-RIATAIFYLSEVEQGGA 458
Query: 175 TIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
T FP L++ V P+ G+ +FWYN H + DYR H+GCPV G+KW
Sbjct: 459 TAFPFLDIAVKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLKGSKW 504
>gi|227553849|gb|ACP40552.1| IP22178p [Drosophila melanogaster]
Length = 467
Score = 132 bits (331), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 80/226 (35%), Positives = 124/226 (54%), Gaps = 18/226 (7%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
Y C+G V ++ L+C Y N+ + + PLK+EE LDP V HD + +I+
Sbjct: 219 YEKVCRGE--VHPIVRQELRCRYSRGNHPYRFLAPLKLEEHSLDPYVATFHDILSPGKIS 276
Query: 63 RIIELSKGKVERGKV--VNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
++ E++ ++ R V + G R+SK +L + HP + + ++D T L
Sbjct: 277 QLREMAVPRMHRSTVNPLPGGQLKKSAFRVSKNAWL---AYESHPTMVGMLRDLKDATGL 333
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD------ATPRDEGLWRLASFMFYLTDVELGGA 174
+ + LQ+ NYG+GGHY+ H D P +EG R+A+ +FYL++VE GGA
Sbjct: 334 ----DTTFCEQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGN-RIATAIFYLSEVEQGGA 388
Query: 175 TIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
T FP L++ V P+ G+ +FWYN H + DYR H+GCPV G+KW
Sbjct: 389 TAFPFLDIAVKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLKGSKW 434
>gi|291387302|ref|XP_002710242.1| PREDICTED: prolyl 4-hydroxylase, alpha II subunit isoform 1
precursor (predicted)-like isoform 2 [Oryctolagus
cuniculus]
gi|217273039|gb|ACK28132.1| prolyl 4-hydroxylase, alpha II subunit isoform 1 precursor
(predicted) [Oryctolagus cuniculus]
Length = 555
Score = 132 bits (331), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 85/252 (33%), Positives = 126/252 (50%), Gaps = 39/252 (15%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
++Y C+G + + + L C Y N L I P K E+ + P +V+ +D + D
Sbjct: 288 DVYESLCRGEGVKLTPRRQKRLFCRYHDGNGAPQLLIAPFKEEDEWDSPHIVRYYDVMSD 347
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI RI E++K K+ R V + G R+SK +L + D P + +I R+Q
Sbjct: 348 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARINRRMQH 404
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----------------- 157
+T L + E LQ+ NYG+GG Y+ H D + P D GL
Sbjct: 405 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYNNERD 460
Query: 158 ---------RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMY 208
R+A+F+ Y++DVE GGAT+FP L ++P+KG+AVFWYN + DYR
Sbjct: 461 AFKRLGTGNRVATFLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTR 520
Query: 209 HSGCPVALGNKW 220
H+ CPV +G KW
Sbjct: 521 HAACPVLVGCKW 532
>gi|390459659|ref|XP_002806656.2| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
alpha-2 [Callithrix jacchus]
Length = 579
Score = 132 bits (331), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 84/252 (33%), Positives = 127/252 (50%), Gaps = 39/252 (15%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYNN-TFLKIGPLKVEELYLDPRVVKIHDAIYD 58
++Y C+G + + + L C Y N + L I P K E+ + P +V+ +D + D
Sbjct: 312 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRASQLLIAPFKEEDEWDSPHIVRYYDVMSD 371
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI RI E++K K+ R V + G R+SK +L + D P + ++ R+Q
Sbjct: 372 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 428
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----------------- 157
+T L + E LQ+ NYG+GG Y+ H D + P D GL
Sbjct: 429 ITGLTVKTAEL----LQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYNDERD 484
Query: 158 ---------RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMY 208
R+A+F+ Y++DVE GGAT+FP L ++P+KG+AVFWYN + DYR
Sbjct: 485 AFKHLGTGNRVATFLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGXGDYRTR 544
Query: 209 HSGCPVALGNKW 220
H+ CPV +G KW
Sbjct: 545 HAACPVLVGCKW 556
>gi|390178148|ref|XP_001358756.3| GA13990 [Drosophila pseudoobscura pseudoobscura]
gi|388859341|gb|EAL27899.3| GA13990 [Drosophila pseudoobscura pseudoobscura]
Length = 498
Score = 131 bits (330), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 79/223 (35%), Positives = 116/223 (52%), Gaps = 20/223 (8%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
Y C G P S L C Y S F +I PLK+EEL DP +V HD +Y+SEI+
Sbjct: 263 YKRGCNGVFRAP----SYLHCRYNSTTTAFARIAPLKMEELSHDPYMVLFHDVVYESEID 318
Query: 63 RIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVI 122
++ ++ K G Y R SK + D + + R+ DMT L +
Sbjct: 319 FLLNATQLKASL-----VGQYQYSPVRTSKEQHFVE--YNDTAVVKTLHRRLNDMTGLDM 371
Query: 123 GREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-----RLASFMFYLTDVELGGATIF 177
+ L + NYG+GGHYD+H D+ E R+A+ +FY+ +V+ GGAT F
Sbjct: 372 IESD----ALTLINYGMGGHYDVHYDSHNYSEANRLILGDRIATVLFYVGEVDSGGATTF 427
Query: 178 PSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
P +N++V P+KGSAV WYN ++ + H+GCPV +G+K+
Sbjct: 428 PYINVSVTPKKGSAVLWYNLDNAGQMNPKAIHAGCPVIVGSKY 470
>gi|332211329|ref|XP_003254773.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Nomascus
leucogenys]
Length = 544
Score = 131 bits (330), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 81/224 (36%), Positives = 124/224 (55%), Gaps = 18/224 (8%)
Query: 5 LACQGNL-SVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINR 63
L CQ L +P +L C YE+ +N +L + P++ E ++L+P + HD + DSE +
Sbjct: 308 LGCQPTLYQIP-----SLYCSYETNSNAYLLLQPIRKEVIHLEPYIALYHDFVSDSEAQK 362
Query: 64 IIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIG 123
I EL++ ++R V + + V+ R+SK +L + P L + RI +T L +
Sbjct: 363 IRELAEPWLQRSVVASGEKQLQVEYRISKSAWLKDTV---DPMLVTLNHRIAALTGLDV- 418
Query: 124 REERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELGGATI 176
Y LQ+ NYG+GGHY+ H D AT L+R+ A+FM YL+ VE GGAT
Sbjct: 419 -RPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATA 477
Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
F NL+V + +A+FW+N H + D H+GCPV +G+KW
Sbjct: 478 FIYANLSVPVVRNAALFWWNLHRSGEGDSDTLHAGCPVLVGDKW 521
>gi|20269818|gb|AAM18064.1| prolyl 4-hydroxylase alpha-related protein PH4[alpha]NE1
[Drosophila melanogaster]
Length = 286
Score = 131 bits (330), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 80/226 (35%), Positives = 124/226 (54%), Gaps = 18/226 (7%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
Y C+G V ++ L+C Y N+ + + PLK+EE LDP V HD + +I+
Sbjct: 38 YEKVCRGE--VHPIVRQELRCRYSRGNHPYRFLAPLKLEEHSLDPYVATFHDILSPGKIS 95
Query: 63 RIIELSKGKVERGKV--VNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
++ E++ ++ R V + G R+SK +L + HP + + ++D T L
Sbjct: 96 QLREMAVPRMHRSTVNPLPGGQLKKSAFRVSKNAWL---AYESHPTMVGMLRDLKDATGL 152
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD------ATPRDEGLWRLASFMFYLTDVELGGA 174
+ + LQ+ NYG+GGHY+ H D P +EG R+A+ +FYL++VE GGA
Sbjct: 153 ----DTTFCEQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGN-RIATAIFYLSEVEQGGA 207
Query: 175 TIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
T FP L++ V P+ G+ +FWYN H + DYR H+GCPV G+KW
Sbjct: 208 TAFPFLDIAVKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLKGSKW 253
>gi|296217074|ref|XP_002754870.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Callithrix
jacchus]
Length = 544
Score = 131 bits (329), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 78/208 (37%), Positives = 117/208 (56%), Gaps = 12/208 (5%)
Query: 20 NLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN 79
+L C YE+ +N +L + P++ E L+L+P + HD + DSE +I E ++ ++R V +
Sbjct: 319 SLYCSYETNSNPYLVLQPIQKEILHLEPYIALYHDFVSDSEAQKIREFAEPWLQRSVVAS 378
Query: 80 YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGL 139
+ V+ R+SK +L + P L + RI +T L + Y LQ+ NYG+
Sbjct: 379 GEKQLQVEYRISKSAWLKDTV---DPMLVTLNHRIAALTGLDV--RPPYAEYLQVVNYGI 433
Query: 140 GGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELGGATIFPSLNLTVFPEKGSAV 192
GGHY+ H D AT L+R+ A+FM YL+ VE GGAT F NL+V K +A+
Sbjct: 434 GGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFIYANLSVPVVKNAAL 493
Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
FW+N H + D H+GCPV +GNKW
Sbjct: 494 FWWNLHRSGEGDSDTLHAGCPVLVGNKW 521
>gi|403263105|ref|XP_003923900.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
alpha-3, partial [Saimiri boliviensis boliviensis]
Length = 534
Score = 131 bits (329), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 79/208 (37%), Positives = 117/208 (56%), Gaps = 12/208 (5%)
Query: 20 NLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN 79
+L C YE +N +L + P++ E L+L+P + HD + DSE +I EL++ ++R V +
Sbjct: 309 SLYCSYEINSNPYLLLQPIQKEVLHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVAS 368
Query: 80 YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGL 139
+ V+ R+SK +L + P L + RI +T L + Y LQ+ NYG+
Sbjct: 369 GEKQLQVEYRISKSAWLKDTV---DPMLVTLNHRIAALTGLDV--RPPYAEYLQVVNYGI 423
Query: 140 GGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELGGATIFPSLNLTVFPEKGSAV 192
GGHY+ H D AT L+R+ A+FM YL+ VE GGAT F NL+V K +A+
Sbjct: 424 GGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFIYANLSVPVVKNAAL 483
Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
FW+N H + D H+GCPV +GNKW
Sbjct: 484 FWWNLHRSGEGDSDTLHAGCPVLVGNKW 511
>gi|195575099|ref|XP_002105517.1| GD17024 [Drosophila simulans]
gi|194201444|gb|EDX15020.1| GD17024 [Drosophila simulans]
Length = 537
Score = 131 bits (329), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 80/226 (35%), Positives = 123/226 (54%), Gaps = 18/226 (7%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
Y C+G V + L+C Y N+ + + PLK+EE LDP V HD + +I+
Sbjct: 289 YEKVCRGE--VHPIARQELRCRYSRGNHPYRFLAPLKLEEHSLDPYVATFHDMLSPRKIS 346
Query: 63 RIIELSKGKVERGKV--VNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
++ E++ ++ R V + G R+SK +L + HP + + ++D T L
Sbjct: 347 QLREMAVPRMHRSTVNPLPGGQLKKSAFRVSKNAWL---AYESHPTMVGMLRDLKDATGL 403
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD------ATPRDEGLWRLASFMFYLTDVELGGA 174
+ + LQ+ NYG+GGHY+ H D P +EG R+A+ +FYL++VE GGA
Sbjct: 404 ----DTTFCEQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGN-RIATAIFYLSEVEQGGA 458
Query: 175 TIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
T FP L++ V P+ G+ +FWYN H + DYR H+GCPV G+KW
Sbjct: 459 TAFPFLDIAVKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLKGSKW 504
>gi|167045848|gb|ABZ10515.1| prolyl 4-hydroxylase, alpha II subunit isoform 1 precursor
(predicted) [Callithrix jacchus]
Length = 555
Score = 131 bits (329), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 84/252 (33%), Positives = 127/252 (50%), Gaps = 39/252 (15%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYNN-TFLKIGPLKVEELYLDPRVVKIHDAIYD 58
++Y C+G + + + L C Y N + L I P K E+ + P +V+ +D + D
Sbjct: 288 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRASQLLIAPFKEEDEWDSPHIVRYYDVMSD 347
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI RI E++K K+ R V + G R+SK +L + D P + ++ R+Q
Sbjct: 348 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 404
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----------------- 157
+T L + E LQ+ NYG+GG Y+ H D + P D GL
Sbjct: 405 ITGLTVKTAEL----LQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYNDERD 460
Query: 158 ---------RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMY 208
R+A+F+ Y++DVE GGAT+FP L ++P+KG+AVFWYN + DYR
Sbjct: 461 AFKHLGTGNRVATFLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTR 520
Query: 209 HSGCPVALGNKW 220
H+ CPV +G KW
Sbjct: 521 HAACPVLVGCKW 532
>gi|197215651|gb|ACH53042.1| prolyl 4-hydroxylase, alpha II subunit isoform 1 precursor
(predicted) [Otolemur garnettii]
Length = 555
Score = 130 bits (328), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 85/252 (33%), Positives = 126/252 (50%), Gaps = 39/252 (15%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
E+Y C+G + + + L C Y N L I P K E+ + P +V+ +D + D
Sbjct: 288 EVYESLCRGEGVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSD 347
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI RI E++K K+ R V + G R+SK +L + D P + ++ R+Q
Sbjct: 348 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNHRMQH 404
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----------------- 157
+T L + E LQ+ NYG+GG Y+ H D + P D GL
Sbjct: 405 ITGLSVKTAEL----LQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRVATFLNYNHERD 460
Query: 158 ---------RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMY 208
R+A+F+ Y++DVE GGAT+FP L ++P+KG+AVFWYN + DYR
Sbjct: 461 AFKRLGTGNRVATFLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTR 520
Query: 209 HSGCPVALGNKW 220
H+ CPV +G KW
Sbjct: 521 HAACPVLVGCKW 532
>gi|195341544|ref|XP_002037366.1| GM12151 [Drosophila sechellia]
gi|194131482|gb|EDW53525.1| GM12151 [Drosophila sechellia]
Length = 537
Score = 130 bits (328), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 80/226 (35%), Positives = 123/226 (54%), Gaps = 18/226 (7%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
Y C+G V + L+C Y N+ + + PLK+EE LDP V HD + +I+
Sbjct: 289 YEKVCRGE--VHPIARQELRCRYSRGNHPYRFLAPLKLEEHSLDPYVATFHDMLNPRKIS 346
Query: 63 RIIELSKGKVERGKV--VNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
++ E++ ++ R V + G R+SK +L + HP + + ++D T L
Sbjct: 347 QLREMAVPRMHRSTVNPLPGGQLKKSAFRVSKNAWL---AYESHPTMVGMLRDLKDATGL 403
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD------ATPRDEGLWRLASFMFYLTDVELGGA 174
+ + LQ+ NYG+GGHY+ H D P +EG R+A+ +FYL++VE GGA
Sbjct: 404 ----DTTFCEQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGN-RIATAIFYLSEVEQGGA 458
Query: 175 TIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
T FP L++ V P+ G+ +FWYN H + DYR H+GCPV G+KW
Sbjct: 459 TAFPFLDIAVKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLKGSKW 504
>gi|170649696|gb|ACB21278.1| prolyl 4-hydroxylase, alpha II subunit isoform 1 precursor
(predicted) [Callicebus moloch]
Length = 555
Score = 130 bits (328), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 84/252 (33%), Positives = 126/252 (50%), Gaps = 39/252 (15%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
++Y C+G + + + L C Y N L I P K E+ + P +V+ +D + D
Sbjct: 288 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSD 347
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI RI E++K K+ R V + G R+SK +L + D P + ++ R+Q
Sbjct: 348 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 404
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----------------- 157
+T L + E LQ+ NYG+GG Y+ H D + P D GL
Sbjct: 405 ITGLTVKTAEL----LQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYNDERD 460
Query: 158 ---------RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMY 208
R+A+F+ Y++DVE GGAT+FP L ++P+KG+AVFWYN + DYR
Sbjct: 461 AFKHLGTGNRVATFLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTR 520
Query: 209 HSGCPVALGNKW 220
H+ CPV +G KW
Sbjct: 521 HAACPVLVGCKW 532
>gi|281183175|ref|NP_001162504.1| prolyl 4-hydroxylase subunit alpha-2 [Papio anubis]
gi|159461520|gb|ABW96795.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase, alpha
polypeptide II, isoform 1 (predicted) [Papio anubis]
Length = 578
Score = 130 bits (327), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 84/252 (33%), Positives = 126/252 (50%), Gaps = 39/252 (15%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
++Y C+G + + + L C Y N L I P K E+ + P +V+ +D + D
Sbjct: 311 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSD 370
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI RI E++K K+ R V + G R+SK +L + D P + ++ R+Q
Sbjct: 371 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 427
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----------------- 157
+T L + E LQ+ NYG+GG Y+ H D + P D GL
Sbjct: 428 ITGLTVKTAEL----LQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYNDERH 483
Query: 158 ---------RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMY 208
R+A+F+ Y++DVE GGAT+FP L ++P+KG+AVFWYN + DYR
Sbjct: 484 TFKHLGTGNRVATFLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTR 543
Query: 209 HSGCPVALGNKW 220
H+ CPV +G KW
Sbjct: 544 HAACPVLVGCKW 555
>gi|195390825|ref|XP_002054068.1| GJ24233 [Drosophila virilis]
gi|194152154|gb|EDW67588.1| GJ24233 [Drosophila virilis]
Length = 533
Score = 130 bits (327), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 83/215 (38%), Positives = 118/215 (54%), Gaps = 23/215 (10%)
Query: 19 SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV 78
+ L C+Y++ + FL++ P K+E L DP + HD IY SEI +I + + ++R V
Sbjct: 295 TRLTCYYKTNPSEFLRLAPFKLELLSKDPYIAVFHDVIYASEIAELIRIGEPMLKRTAVQ 354
Query: 79 NYGDTIYVDTRLSK------VYFLYPEIFG-DHPFLYKIQTRIQDMTNLVI-GREERYKG 130
N T VDT +SK + L + + +++IQ RI+DMT L+I G E+
Sbjct: 355 NI--TQNVDTYISKDRTATGSWILNGNLTKLERNMIWRIQRRIEDMTGLLITGFSEQ--- 409
Query: 131 PLQINNYGLGGHYDLH-----CDATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVF 185
LQ+ NY GGHY H C + P D R+A+ + YL DV GGAT+FP L+L V
Sbjct: 410 DLQLLNYVFGGHYQSHYDFFNCPSFPHD----RIATTLIYLNDVVRGGATVFPKLDLVVQ 465
Query: 186 PEKGSAVFWYNAHANTL-LDYRMYHSGCPVALGNK 219
PE+G + WYN +T D R H GCPV +G K
Sbjct: 466 PERGKVLHWYNMLPDTFDYDRRSLHGGCPVLIGEK 500
>gi|195341556|ref|XP_002037372.1| GM12148 [Drosophila sechellia]
gi|194131488|gb|EDW53531.1| GM12148 [Drosophila sechellia]
Length = 542
Score = 130 bits (326), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 84/227 (37%), Positives = 114/227 (50%), Gaps = 15/227 (6%)
Query: 2 IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
+ P C G +VP ++ S L C Y + FL++ P+K E L +DP VV +HD I E
Sbjct: 288 VLPPCCSGRCAVPRNLNS-LYCVYNHVTSPFLQLAPIKTEILSVDPFVVLLHDMISQKES 346
Query: 62 NRIIELSKGKVERGKVVN--YGDT-IYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I SK + + DT VDT + Y F D KI R+ D T
Sbjct: 347 TLIRNSSKEHMLPSATTDPDASDTETQVDTYRTSKSVWYSSDFNDTT--KKITERLGDAT 404
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-----RLASFMFYLTDVELGG 173
L + E Y Q+ NYGLGG ++ H D ++ + R+A+ +FYL +V GG
Sbjct: 405 GLDMNFTEFY----QVINYGLGGFFETHLDMLLSEKNRFNGTRDRIATTLFYLNEVRQGG 460
Query: 174 ATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
T FP LNLTVFP+ GSA+FWYN H+GCPV +G+KW
Sbjct: 461 GTYFPRLNLTVFPQPGSALFWYNLDTKGNDHMDSLHTGCPVIVGSKW 507
>gi|443712762|gb|ELU05926.1| hypothetical protein CAPTEDRAFT_153364 [Capitella teleta]
Length = 491
Score = 130 bits (326), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 87/232 (37%), Positives = 119/232 (51%), Gaps = 20/232 (8%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELY-LDPRVVKIHDAIYDSEI 61
Y C+G++ V E KS L C Y + L P+ EE++ +DP V +D I D+E
Sbjct: 240 YQELCRGDMIVEESKKSLLYCRYAKGRDIPL---PIYKEEVHNVDPHVAIFYDVISDAEA 296
Query: 62 NRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT--N 119
+ II + + RG V N D R+SKV +L+ + + K+ RI D+T N
Sbjct: 297 DHIIRHAFPGMFRGLVGNSTLRQSSDQRISKVGWLFDNV---DTLIKKLSARIGDVTGLN 353
Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-----------RLASFMFYLTD 168
V +Q+ NYG+GG Y+ H D E L R+++F+FYL+
Sbjct: 354 TVYTPVRSPVEAMQVVNYGIGGQYEPHLDFYEDPEMLKNVNPSLQDTGDRISTFLFYLSR 413
Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
V LGGAT+FP LN+ V P K A FWYNA N D R H+GCPV LG KW
Sbjct: 414 VHLGGATVFPKLNVRVPPVKNGAAFWYNARPNGEHDKRTLHAGCPVVLGEKW 465
>gi|194764881|ref|XP_001964556.1| GF23245 [Drosophila ananassae]
gi|190614828|gb|EDV30352.1| GF23245 [Drosophila ananassae]
Length = 460
Score = 130 bits (326), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 77/223 (34%), Positives = 123/223 (55%), Gaps = 21/223 (9%)
Query: 5 LACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRI 64
L + N S + ++L C Y S + FL I PLK+EE+ DP +V HD IY++EIN +
Sbjct: 227 LGAKRNCSAKFRLPNHLHCRYNSSTSPFLHIAPLKMEEISTDPYMVVYHDVIYENEINWL 286
Query: 65 IELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHP--FLYKIQTRIQDMTNLVI 122
++ S ++ ++ ++++S + FG + + I+ RI+DMT L +
Sbjct: 287 LDNS----------DFRTSLVGESQISTLRTSQDMPFGANSGEVMRNIEKRIKDMTGLSM 336
Query: 123 GREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-----RLASFMFYLTDVELGGATIF 177
E + + NYG+GG Y +H D E L R+ + +FYL DVEL G+T+F
Sbjct: 337 DLSEDF----MLINYGIGGTYKMHYDFYVYSEPLRFLRGERIVTVLFYLGDVELSGSTVF 392
Query: 178 PSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
P LN+++ P+KGSAV WYN H + + + H CPV +G+K+
Sbjct: 393 PFLNISITPKKGSAVMWYNLHNSGDVHQKTQHCACPVVVGSKY 435
>gi|195575111|ref|XP_002105523.1| GD16991 [Drosophila simulans]
gi|194201450|gb|EDX15026.1| GD16991 [Drosophila simulans]
Length = 542
Score = 129 bits (325), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 83/227 (36%), Positives = 115/227 (50%), Gaps = 15/227 (6%)
Query: 2 IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
+ P C G +VP ++ S+L C Y + FL++ P+K E L +DP V+ +HD I E
Sbjct: 288 VLPPCCSGRCAVPRNL-SSLYCVYNHVTSPFLQLAPIKTEILSVDPFVLLLHDMISQKES 346
Query: 62 NRIIELSKGKVERGKVVN--YGDT-IYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I SK + + DT VDT + Y F D KI R+ D T
Sbjct: 347 TLIRNSSKEHMLPSATTDPDSSDTETQVDTYRTSKSVWYSSDFNDTT--KKITERLGDAT 404
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-----RLASFMFYLTDVELGG 173
L E Y Q+ NYGLGG ++ H D ++ + R+A+ +FYL +V GG
Sbjct: 405 GLDTNFTEFY----QVINYGLGGFFETHLDMLLSEKNRFNGTRDRIATTLFYLNEVRQGG 460
Query: 174 ATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
T FP +NLTVFP+ GSA+FWYN N H+GCPV +G+KW
Sbjct: 461 GTYFPRINLTVFPQPGSALFWYNLDTNGNDHMGSLHTGCPVIVGSKW 507
>gi|297689698|ref|XP_002822285.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Pongo abelii]
Length = 544
Score = 129 bits (325), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 76/208 (36%), Positives = 118/208 (56%), Gaps = 12/208 (5%)
Query: 20 NLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN 79
+L C YE+ +N +L + P++ E ++L+P + HD + DSE +I EL++ ++R V +
Sbjct: 319 SLYCSYETNSNAYLLLQPIRKEVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVAS 378
Query: 80 YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGL 139
+ V+ R+SK +L + P L + RI +T L + Y LQ+ NYG+
Sbjct: 379 GEKQLQVEYRISKSAWLKDTV---DPMLVTLNHRIAALTGLDV--RPPYAEYLQVVNYGI 433
Query: 140 GGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELGGATIFPSLNLTVFPEKGSAV 192
GGHY+ H D AT L+R+ A+FM YL+ VE GGAT F NL+V + +A+
Sbjct: 434 GGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFIYANLSVPVVRNAAL 493
Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
FW+N H + D H+GCPV +G+KW
Sbjct: 494 FWWNLHRSGEGDSDTLHAGCPVLVGDKW 521
>gi|195159305|ref|XP_002020522.1| GL13469 [Drosophila persimilis]
gi|194117291|gb|EDW39334.1| GL13469 [Drosophila persimilis]
Length = 253
Score = 129 bits (325), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 74/209 (35%), Positives = 113/209 (54%), Gaps = 24/209 (11%)
Query: 19 SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV 78
S L C Y + FL++ PLK+E L LDP VV HD + D E++ + +++ + R
Sbjct: 32 SRLYCLYNTTATAFLRLAPLKMELLSLDPYVVLYHDVLADREMSLLKLMAQRDLVRAVTY 91
Query: 79 NYGDTIYVD--TRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINN 136
N + + + R +K +L P H + ++ +DM+NL + R E + Q+ N
Sbjct: 92 NATEKKHSEDPNRTTKAGWLDPS----HNLIRRMGILTEDMSNLDLERSEDF----QVLN 143
Query: 137 YGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYN 196
YG+GGHY +H D F L+DV LGGAT+FP L+L+VFP+KG+ + WYN
Sbjct: 144 YGIGGHYAVHPD--------------FFELSDVPLGGATVFPLLDLSVFPKKGAVLMWYN 189
Query: 197 AHANTLLDYRMYHSGCPVALGNKWGKLLL 225
+ HS CPV +G++WGK+ L
Sbjct: 190 LDHKGQGMEKTIHSACPVVVGSRWGKINL 218
>gi|350014318|dbj|GAA37183.1| prolyl 4-hydroxylase [Clonorchis sinensis]
Length = 595
Score = 129 bits (325), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 84/229 (36%), Positives = 119/229 (51%), Gaps = 17/229 (7%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
EIY C+G P + C Y + + KIGP+K E LY DPR+V +D I+ SE
Sbjct: 345 EIYQALCRGEQLFPPPPDDQVYCRY-YIPHPYYKIGPVKEEVLYPDPRIVMWYDVIHPSE 403
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
+ RI EL+ ++ R V N G R SK +L G +++ RI +T
Sbjct: 404 VGRIQELALPRLRRATVKNPVTGKLENAYYRTSKSAWLQD---GLDEVTHRLNQRIHALT 460
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLW------RLASFMFYLTDVEL 171
L + E LQ+ NYG+GG+Y H D R++ + R+A+ +FYLTDV+
Sbjct: 461 GLAMETAE----DLQVGNYGIGGYYAPHFDFGRKREKDAFEVENGNRIATIIFYLTDVKA 516
Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+F +V P +G+A FWYN H + D R H CPV +G+KW
Sbjct: 517 GGATVFNRFGASVKPVRGAAGFWYNLHPSGEGDLRTRHVACPVLVGSKW 565
>gi|229368743|gb|ACQ63024.1| prolyl 4-hydroxylase, alpha II subunit isoform 1 precursor
(predicted) [Dasypus novemcinctus]
Length = 556
Score = 129 bits (325), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 84/252 (33%), Positives = 126/252 (50%), Gaps = 39/252 (15%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
++Y C+G + + + L C Y N T L I P K E+ + P +V+ +D + D
Sbjct: 289 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRTPQLLIAPFKEEDEWDSPHIVRYYDIMSD 348
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI RI E++K K+ R V + G R+SK +L D P + ++ R++
Sbjct: 349 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEE---NDDPVVAQVNRRMEH 405
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----------------- 157
+T L + E LQ+ NYG+GG Y+ H D + P D GL
Sbjct: 406 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYNHEQD 461
Query: 158 ---------RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMY 208
R+A+F+ Y++DVE GGAT+FP L ++P+KG+AVFWYN + DYR
Sbjct: 462 VFKHLGTGNRVATFLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTR 521
Query: 209 HSGCPVALGNKW 220
H+ CPV +G KW
Sbjct: 522 HAACPVLVGCKW 533
>gi|195064500|ref|XP_001996577.1| GH12091 [Drosophila grimshawi]
gi|193895397|gb|EDV94263.1| GH12091 [Drosophila grimshawi]
Length = 521
Score = 129 bits (325), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 78/224 (34%), Positives = 124/224 (55%), Gaps = 20/224 (8%)
Query: 5 LACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRI 64
+ C+G+ P+ + NL C Y FL++ PLK+EE+ DP VV H+ IYDSEI +
Sbjct: 288 VGCRGHF--PK--RHNLSCRYNFTTTPFLRLAPLKLEEINHDPYVVMYHNVIYDSEIEEM 343
Query: 65 IELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGR 124
LS +++ G + Y T+++ + + + PFL ++ RI DMT G
Sbjct: 344 KRLSP-QMQNGYIHGYKAN---QTKVTDIAARVNWLVENTPFLERMNQRITDMT----GF 395
Query: 125 EERYKGPLQINNYGLGGHYDLHCD----ATPRDEGLW----RLASFMFYLTDVELGGATI 176
+ + +Q+ N+G+G +++ H D R E + RLAS +FY +DV LGGAT+
Sbjct: 396 DLKEFPSVQVANFGIGNNFEAHYDYIFGKRVRKEDVGDLGDRLASIIFYSSDVPLGGATV 455
Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
FP + + V P+KG+++ WYN + D R HS CPV +G++W
Sbjct: 456 FPDIQVAVQPQKGNSLLWYNLFDDGTPDPRSLHSVCPVVVGSRW 499
>gi|55925444|ref|NP_001007286.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Danio rerio]
gi|49900294|gb|AAH76508.1| Procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide 2 [Danio rerio]
gi|182891794|gb|AAI65288.1| P4ha2 protein [Danio rerio]
Length = 514
Score = 129 bits (324), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 78/224 (34%), Positives = 117/224 (52%), Gaps = 25/224 (11%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYD 58
E Y C+G + + +S L C Y N N L + P+K E+ + P +V+ +A+ D
Sbjct: 289 EAYEALCRGEGVKMTTKRQSRLFCRYRDGNRNPRLLLKPMKEEDEWDSPHIVRFLEALSD 348
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI +I E++ K+ R V + G R+SK +L E D P + ++ RI+D
Sbjct: 349 EEIQKIKEIATPKLARATVRDPKTGVLTVAHYRVSKSAWLEGE---DDPVIARVNQRIED 405
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATI 176
+T L + E LQ+ NYG+GG Y+ H D + ++DVE GGAT+
Sbjct: 406 ITGLTVDTAEL----LQVANYGVGGQYEPHFDFS--------------RMSDVEAGGATV 447
Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
FP +V+P KG+AVFWYN + DYR H+ CPV +G+KW
Sbjct: 448 FPDFGASVWPRKGTAVFWYNLFRSGEGDYRTRHAACPVLVGSKW 491
>gi|402894624|ref|XP_003910453.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
alpha-3 [Papio anubis]
Length = 535
Score = 129 bits (324), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 76/208 (36%), Positives = 117/208 (56%), Gaps = 12/208 (5%)
Query: 20 NLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN 79
+L C YE+ +N +L + P++ E ++L+P + HD + DSE +I E ++ ++R V +
Sbjct: 310 SLYCSYETNSNAYLLLQPIRKEVIHLEPYIALYHDFVSDSEAQKIREFAEPWLQRSVVAS 369
Query: 80 YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGL 139
+ V+ R+SK +L + P L + RI +T L + Y LQ+ NYG+
Sbjct: 370 GEKQLQVEYRISKSAWLKDTV---DPMLVTLNHRIAALTGLDV--RPPYAEYLQVVNYGI 424
Query: 140 GGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELGGATIFPSLNLTVFPEKGSAV 192
GGHY+ H D AT L+R+ A+FM YL+ VE GGAT F NL+V K +A+
Sbjct: 425 GGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFIYANLSVPVVKNAAL 484
Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
FW+N H + D H+GCPV +G+KW
Sbjct: 485 FWWNLHRSGEGDSDTLHAGCPVLVGDKW 512
>gi|195444366|ref|XP_002069834.1| GK11733 [Drosophila willistoni]
gi|194165919|gb|EDW80820.1| GK11733 [Drosophila willistoni]
Length = 517
Score = 129 bits (324), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 86/222 (38%), Positives = 113/222 (50%), Gaps = 20/222 (9%)
Query: 6 ACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRII 65
C+G P+ L C+YE + FL+I P KVE L P V +D + DSEI +
Sbjct: 287 CCRGEYKPPK----GLSCYYEYGADPFLRIAPFKVELLNRSPYVAAYYDVLNDSEIEELK 342
Query: 66 ELSKGKVERGKVVNYG---DTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL-- 120
+S ++ R + N+ D VD + V+ I L I R DMT+L
Sbjct: 343 LMSSPQIRRSLLYNHTLDIDQADVDRTSNSVFMEETGI----TLLETISQRAADMTDLYV 398
Query: 121 -VIGREERYKGPLQINNYGLGGHYDLHCDATPRD-EGLWRLASFMFYLTDVELGGATIFP 178
I E+ LQ+ NYGLGG Y HCD + E RLA+ +FYLTDV+ GGAT+FP
Sbjct: 399 TAISSED-----LQVINYGLGGQYTPHCDYFDENAENGDRLATVLFYLTDVQQGGATVFP 453
Query: 179 SLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
L L+ FP+KGSA+ + N D HS CPV GNKW
Sbjct: 454 FLRLSYFPKKGSALIFRNLDNAMSGDKDSTHSACPVLFGNKW 495
>gi|116496629|gb|AAI26171.1| Prolyl 4-hydroxylase, alpha polypeptide III [Homo sapiens]
Length = 544
Score = 129 bits (324), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 76/208 (36%), Positives = 119/208 (57%), Gaps = 12/208 (5%)
Query: 20 NLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN 79
+L C YE+ +N +L + P++ E ++L+P + HD + DSE +I EL++ ++R V +
Sbjct: 319 SLYCSYETNSNAYLLLQPIRKEVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVAS 378
Query: 80 YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGL 139
+ V+ R+SK +L + +P L + RI +T L + Y LQ+ NYG+
Sbjct: 379 GEKQLQVEYRISKSAWLKDTV---NPKLVTLNHRIAALTGLDV--RPPYAEYLQVVNYGI 433
Query: 140 GGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELGGATIFPSLNLTVFPEKGSAV 192
GGHY+ H D AT L+R+ A+FM YL+ VE GGAT F NL+V + +A+
Sbjct: 434 GGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFIYANLSVPVVRNAAL 493
Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
FW+N H + D H+GCPV +G+KW
Sbjct: 494 FWWNLHRSGEGDSDTLHAGCPVLVGDKW 521
>gi|195145084|ref|XP_002013526.1| GL24185 [Drosophila persimilis]
gi|194102469|gb|EDW24512.1| GL24185 [Drosophila persimilis]
Length = 229
Score = 129 bits (323), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 75/207 (36%), Positives = 113/207 (54%), Gaps = 16/207 (7%)
Query: 19 SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV 78
S L C Y S F +I PLK+EEL DP +V HD +Y+SEI+ ++ ++ K
Sbjct: 6 SYLHCRYNSTTTAFARIAPLKMEELSHDPYMVLFHDVVYESEIDFLLNATQLKASL---- 61
Query: 79 NYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYG 138
G Y R SK + D + + R+ DMT L + + L + NYG
Sbjct: 62 -VGQYQYSPVRTSKEQHFVE--YNDTAVVKTLHRRLNDMTGLDMIESDT----LTLINYG 114
Query: 139 LGGHYDLHCDATPRDEGLW-----RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVF 193
+GGHYD+H D+ E R+A+ +FY+ +V+ GGAT FP +N++V P+KGSAV
Sbjct: 115 MGGHYDVHYDSHNYSEANRLILGDRIATVLFYVGEVDSGGATTFPYINVSVTPKKGSAVL 174
Query: 194 WYNAHANTLLDYRMYHSGCPVALGNKW 220
WYN + ++ + H+GCPV +G+K+
Sbjct: 175 WYNLDNSGQMNPKAIHAGCPVIVGSKY 201
>gi|195505251|ref|XP_002099423.1| GE23370 [Drosophila yakuba]
gi|194185524|gb|EDW99135.1| GE23370 [Drosophila yakuba]
Length = 534
Score = 129 bits (323), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 77/231 (33%), Positives = 118/231 (51%), Gaps = 24/231 (10%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
+ L+C G P + + L CFY FL++ PLK E++ LDP VV H+ + EI+
Sbjct: 287 FKLSCNG----PHESSTRLHCFYNFTTTPFLRLAPLKTEQIGLDPYVVLYHEVLSAREIS 342
Query: 63 RIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVI 122
+I + ++ +V R +K ++L E + +I RI DMT +
Sbjct: 343 MLISKAAQNMKNTRVHRETKPKTNRGRTAKGHWLKKE---SNELTRRITRRIVDMTGFDL 399
Query: 123 GREERYKGPLQINNYGLGGHYDLHCD---------ATPRDEGLW----RLASFMFYLTDV 169
E + Q+ NYG+GGHY LH D PR R+A+ +FYL+DV
Sbjct: 400 ADSEDF----QVINYGIGGHYFLHMDYFDYASSNYTGPRSRQSKVLGDRIATVLFYLSDV 455
Query: 170 ELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
E GGAT+F ++ +V+P+ G+A+FWYN + D H+ CPV +G+KW
Sbjct: 456 EQGGATVFGNVGYSVYPQAGTAIFWYNLDTDGNGDPLTRHASCPVIVGSKW 506
>gi|426369750|ref|XP_004051847.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3, partial [Gorilla
gorilla gorilla]
Length = 517
Score = 129 bits (323), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 76/208 (36%), Positives = 118/208 (56%), Gaps = 12/208 (5%)
Query: 20 NLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN 79
+L C YE+ +N +L + P++ E ++L+P + HD + DSE +I EL++ ++R V +
Sbjct: 292 SLYCSYETNSNAYLLLQPIRKEVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVAS 351
Query: 80 YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGL 139
+ V+ R+SK +L + P L + RI +T L + Y LQ+ NYG+
Sbjct: 352 GEKQLQVEYRISKSAWLKDTV---DPKLVALNHRIAALTGLDV--RPPYAEYLQVVNYGI 406
Query: 140 GGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELGGATIFPSLNLTVFPEKGSAV 192
GGHY+ H D AT L+R+ A+FM YL+ VE GGAT F NL+V + +A+
Sbjct: 407 GGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFIYANLSVPVVRNAAL 466
Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
FW+N H + D H+GCPV +G+KW
Sbjct: 467 FWWNLHRSGEGDSDTLHAGCPVLVGDKW 494
>gi|195505202|ref|XP_002099402.1| GE23382 [Drosophila yakuba]
gi|194185503|gb|EDW99114.1| GE23382 [Drosophila yakuba]
Length = 537
Score = 128 bits (322), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 79/226 (34%), Positives = 123/226 (54%), Gaps = 18/226 (7%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
Y C+G V + L+C Y ++ + + PLK+EE LDP V HD + +I+
Sbjct: 289 YEKVCRGE--VHPIARQELRCRYSRGSHPYRYLAPLKLEEHSLDPYVATYHDMLSPRKIS 346
Query: 63 RIIELSKGKVERGKV--VNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
++ E++ ++ R V + G R+SK +L + HP + + +++ T L
Sbjct: 347 QLREMAVPRMRRSTVNPLPGGQHKKSAFRVSKNAWL---AYESHPTMVGMLRDLKEATGL 403
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD------ATPRDEGLWRLASFMFYLTDVELGGA 174
+ Y LQ+ NYG+GGHY+ H D P +EG R+A+ +FYL++VE GGA
Sbjct: 404 ----DTTYCEQLQVANYGVGGHYEPHWDFFRDPNHYPEEEGN-RIATAIFYLSEVEQGGA 458
Query: 175 TIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
T FP L++ V P+ G+ +FWYN H + DYR H+GCPV G+KW
Sbjct: 459 TAFPFLDIAVKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLKGSKW 504
>gi|33589818|ref|NP_878907.1| prolyl 4-hydroxylase subunit alpha-3 precursor [Homo sapiens]
gi|114639354|ref|XP_001174896.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Pan troglodytes]
gi|397487266|ref|XP_003814725.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Pan paniscus]
gi|74738714|sp|Q7Z4N8.1|P4HA3_HUMAN RecName: Full=Prolyl 4-hydroxylase subunit alpha-3; Short=4-PH
alpha-3; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-3; Flags: Precursor
gi|33188232|gb|AAP97874.1| prolyl 4-hydroxylase alpha III subunit [Homo sapiens]
gi|36962719|gb|AAQ87603.1| collagen prolyl 4-hydroxylase alpha III subunit [Homo sapiens]
gi|37182165|gb|AAQ88885.1| GPGA711 [Homo sapiens]
gi|109658570|gb|AAI17334.1| Prolyl 4-hydroxylase, alpha polypeptide III [Homo sapiens]
gi|119595341|gb|EAW74935.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide III, isoform CRA_b
[Homo sapiens]
gi|410219716|gb|JAA07077.1| prolyl 4-hydroxylase, alpha polypeptide III [Pan troglodytes]
gi|410248278|gb|JAA12106.1| prolyl 4-hydroxylase, alpha polypeptide III [Pan troglodytes]
gi|410336087|gb|JAA36990.1| prolyl 4-hydroxylase, alpha polypeptide III [Pan troglodytes]
Length = 544
Score = 128 bits (322), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 76/208 (36%), Positives = 118/208 (56%), Gaps = 12/208 (5%)
Query: 20 NLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN 79
+L C YE+ +N +L + P++ E ++L+P + HD + DSE +I EL++ ++R V +
Sbjct: 319 SLYCSYETNSNAYLLLQPIRKEVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVAS 378
Query: 80 YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGL 139
+ V+ R+SK +L + P L + RI +T L + Y LQ+ NYG+
Sbjct: 379 GEKQLQVEYRISKSAWLKDTV---DPKLVTLNHRIAALTGLDV--RPPYAEYLQVVNYGI 433
Query: 140 GGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELGGATIFPSLNLTVFPEKGSAV 192
GGHY+ H D AT L+R+ A+FM YL+ VE GGAT F NL+V + +A+
Sbjct: 434 GGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFIYANLSVPVVRNAAL 493
Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
FW+N H + D H+GCPV +G+KW
Sbjct: 494 FWWNLHRSGEGDSDTLHAGCPVLVGDKW 521
>gi|59809017|gb|AAH89446.1| P4HA3 protein [Homo sapiens]
Length = 528
Score = 128 bits (322), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 76/208 (36%), Positives = 118/208 (56%), Gaps = 12/208 (5%)
Query: 20 NLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN 79
+L C YE+ +N +L + P++ E ++L+P + HD + DSE +I EL++ ++R V +
Sbjct: 303 SLYCSYETNSNAYLLLQPIRKEVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVAS 362
Query: 80 YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGL 139
+ V+ R+SK +L + P L + RI +T L + Y LQ+ NYG+
Sbjct: 363 GEKQLQVEYRISKSAWLKDTV---DPKLVTLNHRIAALTGLDV--RPPYAEYLQVVNYGI 417
Query: 140 GGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELGGATIFPSLNLTVFPEKGSAV 192
GGHY+ H D AT L+R+ A+FM YL+ VE GGAT F NL+V + +A+
Sbjct: 418 GGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFIYANLSVPVVRNAAL 477
Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
FW+N H + D H+GCPV +G+KW
Sbjct: 478 FWWNLHRSGEGDSDTLHAGCPVLVGDKW 505
>gi|195425415|ref|XP_002061004.1| GK10713 [Drosophila willistoni]
gi|194157089|gb|EDW71990.1| GK10713 [Drosophila willistoni]
Length = 502
Score = 128 bits (322), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 80/224 (35%), Positives = 117/224 (52%), Gaps = 18/224 (8%)
Query: 6 ACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRII 65
C+G P+ L C+Y+S + FL + P KVE L P V HD +YD EI +
Sbjct: 250 CCRGEYEHPK----GLSCYYDSKDEPFLFLAPFKVEILNNLPFVAIYHDVLYDREIEELK 305
Query: 66 ELSKGKVERGKVVNYGD--TIYVDTRLSKVYFLYPEIFGDHPFLYKI-QTRIQDMTNLVI 122
L+ + R + +Y + V+ R S FL + +L I + R+ DMT+L +
Sbjct: 306 RLAVPTITRSTIYDYDKEGNVPVNFRTSNSVFL----LNNASYLVDILRQRVADMTHLNV 361
Query: 123 GREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-----RLASFMFYLTDVELGGATIF 177
+ LQ+ NYGLGG+Y H D +DE R+ + + Y+TDV+ GGAT+F
Sbjct: 362 FKNS--SDDLQVMNYGLGGYYRYHFDFFGKDESPNKLLGDRIITVLIYMTDVQQGGATVF 419
Query: 178 PSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 221
P+L +T FP+KGSA+ + N N D H+GCPV G+KW
Sbjct: 420 PALRITNFPKKGSALIFRNLDNNISPDPSTLHAGCPVLFGSKWA 463
>gi|443709455|gb|ELU04127.1| hypothetical protein CAPTEDRAFT_149240 [Capitella teleta]
Length = 532
Score = 128 bits (321), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 80/229 (34%), Positives = 116/229 (50%), Gaps = 17/229 (7%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
+ Y C+G VP L C Y ++ F I PL+ E L+P + H + D E
Sbjct: 291 QTYEALCRGEDVVPVKDPHKLTCQYRFWHPMFY-INPLREETASLEPWIAVYHQLMNDHE 349
Query: 61 INRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I RI E++ ++ R V N G + R+SK +L E + P + +I R +T
Sbjct: 350 IERIKEMATPRLARATVHNSATGQLEHAKYRISKSGWLRDE---EDPLIARISERCSALT 406
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGL----WR---LASFMFYLTDVEL 171
NL + E LQ+ NYG+GG Y+ H D + R E WR + + ++Y+TDVE
Sbjct: 407 NLSLTTVEE----LQVVNYGIGGQYEPHFDFSRRSEPTAFEKWRGNRILTVIYYMTDVEA 462
Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+F + V+PEKGSA W+N + D R H+ CPV G+KW
Sbjct: 463 GGATVFLDAGVKVYPEKGSAAVWHNLLPSGEGDMRTRHAACPVLTGSKW 511
>gi|47218149|emb|CAG10069.1| unnamed protein product [Tetraodon nigroviridis]
Length = 595
Score = 128 bits (321), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 80/252 (31%), Positives = 125/252 (49%), Gaps = 41/252 (16%)
Query: 3 YPLACQGN-LSVPEDIKSNLKC-FYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G + + +S L C +Y+S + + P+K ++ + P +V+ D I D E
Sbjct: 328 YEMLCRGEGIRLTPRRQSRLFCRYYDSKRHPRYILSPVKQQDEWDRPYIVRYLDIISDKE 387
Query: 61 INRIIELSKGKVERGKVVN------------------------YGDTIYVDTRLSKVYFL 96
I + +L+K ++ R + N G R+SK +L
Sbjct: 388 IELVKQLAKPRLRRATISNPITGVLETASYRISKRRATVHDPQTGKLTTAQYRVSKSAWL 447
Query: 97 YPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGL 156
+HP + I RI+D+T L + E LQ+ NYG+GG Y+ H D +DE
Sbjct: 448 ---TGYEHPVIETINQRIEDLTGLEVDTAEE----LQVANYGVGGQYEPHFDFGRKDEPD 500
Query: 157 W--------RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMY 208
R+A+++FY++DV GGAT+FP + V+P+KGSAVFWYN + DY
Sbjct: 501 AFKELGTGNRIATWLFYMSDVAAGGATVFPDVGAAVWPQKGSAVFWYNLFTSGEGDYSTR 560
Query: 209 HSGCPVALGNKW 220
H+ CPV +GNKW
Sbjct: 561 HAACPVLVGNKW 572
>gi|195452730|ref|XP_002073475.1| GK13125 [Drosophila willistoni]
gi|194169560|gb|EDW84461.1| GK13125 [Drosophila willistoni]
Length = 539
Score = 128 bits (321), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 78/231 (33%), Positives = 120/231 (51%), Gaps = 23/231 (9%)
Query: 2 IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
+Y C+G + L+C + L L++EEL+ DP VV++H+ + ++
Sbjct: 287 LYEQVCRGETRPSAKSQRELRC---RLQRSRLSYEVLELEELHQDPFVVQVHNIVSQKDM 343
Query: 62 NRIIELSKGKVERGKVV----NYGDTIYVDTRLSK-VYFLYPEIFGDHPFLYKIQTRIQD 116
N + ++++ ++R +V N +T+ R SK F Y E H + + + D
Sbjct: 344 NLLQKIARPNIQRSQVYAQDHNANETVAAAYRTSKGATFEYFE----HRSMELLSRHVAD 399
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDA-------TPRDEGLWRLASFMFYLTDV 169
++ L + E LQI NYG+GGHY+ H D P D R+A+ ++YL++V
Sbjct: 400 LSGLDMNSAEL----LQIANYGIGGHYEPHWDCFPDHHVYLPDDRDGNRIATGIYYLSEV 455
Query: 170 ELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
E GG T FP L L V PE+GS VFWYN H + DYR H+ CPV G+KW
Sbjct: 456 EAGGGTAFPFLPLLVTPERGSLVFWYNLHRSGDQDYRTKHAACPVLQGSKW 506
>gi|194905410|ref|XP_001981191.1| GG11931 [Drosophila erecta]
gi|190655829|gb|EDV53061.1| GG11931 [Drosophila erecta]
Length = 537
Score = 128 bits (321), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 80/226 (35%), Positives = 121/226 (53%), Gaps = 18/226 (7%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
Y C+G V + L+C Y ++ + + PLK+EE LDP V HD + +I+
Sbjct: 289 YEEVCRGE--VQPIARQELRCRYSRGSHPYRILAPLKLEEHSLDPYVASFHDMLSPRKIS 346
Query: 63 RIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
++ E++ +++R V G R+SK +L E HP + + ++D T L
Sbjct: 347 QLREMAVPRMQRSTVNPRPGGQHKKSAFRVSKNAWLAYEA---HPTMAGMLRDLKDATGL 403
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD------ATPRDEGLWRLASFMFYLTDVELGGA 174
+ + LQ+ NYG+GGHY+ H D P EG R+A+ +FYL++VE GGA
Sbjct: 404 ----DTTFCEQLQVANYGVGGHYEPHWDFFRDPSHYPAAEGN-RIATAIFYLSEVEQGGA 458
Query: 175 TIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
T FP L+ V P+ G+ +FWYN H + DYR H+GCPV G+KW
Sbjct: 459 TAFPFLDFAVKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLKGSKW 504
>gi|195575105|ref|XP_002105520.1| GD21524 [Drosophila simulans]
gi|194201447|gb|EDX15023.1| GD21524 [Drosophila simulans]
Length = 448
Score = 128 bits (321), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 76/201 (37%), Positives = 107/201 (53%), Gaps = 12/201 (5%)
Query: 19 SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV 78
+ L C Y + + FL++ PLK+E L LDP +V HD + D +I I ++KG++ R V
Sbjct: 255 TKLYCLYNTTASYFLRLAPLKMELLSLDPYMVLFHDVVSDKDIVSIRNMAKGRLARAVTV 314
Query: 79 NYGDTIYVD-TRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNY 137
+ D R +K +L + + ++ QDMTN I + P Q+ NY
Sbjct: 315 SKDGNYTEDPDRTTKGTWL----VENSKLIQRLSQLTQDMTNFEIHDAD----PFQVLNY 366
Query: 138 GLGGHYDLHCD---ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFW 194
G+GG Y +H D D R+A+ +FYL+DV GGATIFP L L+VFP+KGSA+ W
Sbjct: 367 GIGGFYGIHLDFLGEAELDNFSDRIATAVFYLSDVPQGGATIFPKLGLSVFPKKGSALLW 426
Query: 195 YNAHANTLLDYRMYHSGCPVA 215
YN D R HS CP
Sbjct: 427 YNLDHKGDGDNRTAHSACPTV 447
>gi|256083648|ref|XP_002578053.1| prolyl 4-hydroxylase alpha subunit 1 [Schistosoma mansoni]
gi|360044447|emb|CCD81995.1| putative prolyl 4-hydroxylase alpha subunit 1 [Schistosoma mansoni]
Length = 584
Score = 128 bits (321), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 85/229 (37%), Positives = 120/229 (52%), Gaps = 17/229 (7%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
E+Y C+ P +L C Y + + F KIGP+K E L DPR+V +D I+ SE
Sbjct: 334 ELYESLCRNENPFPTVPSHHLTCRYYT-PHAFFKIGPVKEETLNPDPRIVMWYDLIFPSE 392
Query: 61 INRIIELSKGKVERGKVVNYGDTIYVDT--RLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I +I EL+ ++ R V N I R SK +L P + +I RI+ +T
Sbjct: 393 IEKIKELATPRLRRATVKNPVTGILEIAFYRTSKSAWL-PHSMSE--ITDQISQRIRAVT 449
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLW------RLASFMFYLTDVEL 171
L + E LQ+ NYGLGGHY H D R++ + R+A+ +FYL+DV+
Sbjct: 450 GLSLETAE----DLQVGNYGLGGHYAPHFDFGRKREKDAFEVKNGNRIATIIFYLSDVQA 505
Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+F + V P+KG+A FW+N N D R H+ CPV G+KW
Sbjct: 506 GGATVFNRIGTRVVPKKGAAGFWFNLLPNGEGDLRTRHAACPVLAGSKW 554
>gi|297301157|ref|XP_001103971.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2 [Macaca
mulatta]
Length = 512
Score = 128 bits (321), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 76/222 (34%), Positives = 116/222 (52%), Gaps = 25/222 (11%)
Query: 3 YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G + + + L C Y N N + P K E+ + PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + +L+K ++ R V + G R+SK +L ++P + +I RIQD+T
Sbjct: 349 IEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---ENPVVSRINMRIQDLT 405
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFP 178
L + E LQ+ NYG+GG Y+ H D ++DV GGAT+FP
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFD--------------FARMSDVSAGGATVFP 447
Query: 179 SLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ +V+P+KG+AVFWYN A+ DY H+ CPV +GNKW
Sbjct: 448 EVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 489
>gi|313229039|emb|CBY18191.1| unnamed protein product [Oikopleura dioica]
Length = 522
Score = 127 bits (320), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 81/227 (35%), Positives = 123/227 (54%), Gaps = 14/227 (6%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCF-YESYNNTFL-KIGPLKVEELYLDPRVVKIHDAIYD 58
E+ L ++ ++ +L+CF ++ + + F ++GP KVEE+ P VV+ D + D
Sbjct: 275 ELCQLGYNNEHTIRDNNDDSLRCFLFKGHEDDFFSQLGPWKVEEIAKQPYVVRFFDILND 334
Query: 59 SEINRIIELSKGKVERGKVVNYG--DTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
+EIN + L + K+ R V + + D R+SK +L E D + K RI
Sbjct: 335 NEINSLERLGEEKLARATVFDPATHKLVNADYRVSKSAWLKDE---DSDTVEKYNRRISR 391
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW---RLASFMFYLTDVELGG 173
+T L + Y LQ++NYG+GG Y+ H D + R+ ++ R+A+++ YLT VE GG
Sbjct: 392 LTGLDL----EYAEQLQMSNYGIGGQYEPHYDYSRREWDIYNNRRIATWLSYLTTVEQGG 447
Query: 174 ATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
T+F L L + KGSAVFWYN N D R H+ CPV GNKW
Sbjct: 448 GTVFTELGLHIRSIKGSAVFWYNLLPNGSGDERTRHAACPVLRGNKW 494
>gi|195591296|ref|XP_002085378.1| GD14754 [Drosophila simulans]
gi|194197387|gb|EDX10963.1| GD14754 [Drosophila simulans]
Length = 508
Score = 127 bits (320), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 80/220 (36%), Positives = 115/220 (52%), Gaps = 31/220 (14%)
Query: 11 LSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKG 70
LS +L C YE + FL+I PLKVE L L P +V HD IYDSEI+++ +S
Sbjct: 279 LSSVSQTSQHLSCHYEQNTSEFLRIAPLKVETLSLKPHIVLYHDVIYDSEISKVKNISLP 338
Query: 71 KVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKG 130
++ + D + + +L+ +I DH + RI+DMT G + +
Sbjct: 339 SLKSP--LRIIDAVDYNLKLA-------QIRDDHQ--SPLSLRIKDMT----GEDVQEDS 383
Query: 131 PLQINNYGLGGHYDLHCDATPRDEGLW----RLASFMFYLTDVELGGATIFPSLNLTVFP 186
QI+NYG+ G + H D + RL S +F++TDV GGA FP+LNLT++P
Sbjct: 384 DFQIDNYGICGFRNFHTDNIEMQDQTAELGDRLTSILFFMTDVVQGGAFAFPNLNLTIWP 443
Query: 187 EKGSAVFWYNAHANTLLDYRM------YHSGCPVALGNKW 220
+KGSA+ W N LD+RM H CPV +G+KW
Sbjct: 444 QKGSALVWRN------LDHRMQPNKDLLHVSCPVVVGSKW 477
>gi|221512810|ref|NP_649043.3| CG18234 [Drosophila melanogaster]
gi|66771545|gb|AAY55084.1| IP12246p [Drosophila melanogaster]
gi|220902636|gb|AAF49255.4| CG18234 [Drosophila melanogaster]
Length = 515
Score = 127 bits (319), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 75/206 (36%), Positives = 108/206 (52%), Gaps = 19/206 (9%)
Query: 19 SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV 78
+L C YE + FL+I PLKVE L L P +V HD IYDSEI+++ +S ++ +
Sbjct: 288 QHLSCHYEKNTSEFLRIAPLKVETLSLKPHIVLYHDVIYDSEISKVKNISLPSLKSPLRI 347
Query: 79 NYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYG 138
Y +D L + +I DH + RI+DMT G + + QI+NYG
Sbjct: 348 LYA----IDYNLK-----FAKIREDHQ--SPLSLRIKDMT----GEDVQEDTDFQIDNYG 392
Query: 139 LGGHYDLHCDATPRDEGLW----RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFW 194
+ G + H D + RL S MF++ DV GGA FP+LNLT++P+KGSA+ W
Sbjct: 393 ICGFRNFHTDNIELQDQTAELGDRLTSIMFFMNDVAQGGALAFPNLNLTIWPQKGSALVW 452
Query: 195 YNAHANTLLDYRMYHSGCPVALGNKW 220
N + + H CPV +G+KW
Sbjct: 453 RNLDHRMQPNQDLLHVSCPVVVGSKW 478
>gi|194213450|ref|XP_001495951.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Equus
caballus]
Length = 548
Score = 127 bits (319), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 82/228 (35%), Positives = 125/228 (54%), Gaps = 13/228 (5%)
Query: 1 EIYPLACQGNLSVPEDIK-SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDS 59
+ Y CQ S P + +L C YE+ ++ FL + P++ E ++L+P VV HD + DS
Sbjct: 303 DTYEGLCQTLGSQPTHYQIPSLYCSYETNSSPFLLLQPVRKEVIHLEPYVVLYHDFVSDS 362
Query: 60 EINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
E +I L++ ++R V + + V+ R+SK +L + P L + RI +T
Sbjct: 363 EAQKIRGLAEPWLQRSVVASGEKQLPVEYRISKSAWLKDTV---DPMLVTLDHRIAALTG 419
Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELG 172
L + + Y LQ+ NYG+GGHY+ H D AT L+R+ A+FM YL+ VE G
Sbjct: 420 LDV--QPPYAEYLQVVNYGIGGHYEPHFDHATSPTSPLYRMKSGNRVATFMIYLSSVEAG 477
Query: 173 GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GAT F N +V K +A+FW+N H + D H+GCPV +G+KW
Sbjct: 478 GATAFIYANFSVPVVKNAALFWWNLHRSGEGDSDTLHAGCPVLVGDKW 525
>gi|348555277|ref|XP_003463450.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Cavia porcellus]
Length = 584
Score = 127 bits (319), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 76/208 (36%), Positives = 117/208 (56%), Gaps = 12/208 (5%)
Query: 20 NLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN 79
+L C YE+ ++ +L + P++ E ++L+P V HD + D E +I EL++ ++R V +
Sbjct: 359 SLYCSYETNSSPYLLLQPVRKEVIHLEPYVALYHDFVSDPEAQKIRELAEPWLQRSVVAS 418
Query: 80 YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGL 139
G + V+ R+SK +L + P L + RI +T L + Y LQ+ NYG+
Sbjct: 419 GGKQLQVEYRISKSAWLKDTV---DPMLVTLNHRIAALTGLDV--RPPYAEYLQVVNYGI 473
Query: 140 GGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELGGATIFPSLNLTVFPEKGSAV 192
GGHY+ H D AT L+R+ A+FM YL+ VE GGAT F N +V K +A+
Sbjct: 474 GGHYEPHFDHATSPSSPLFRMKSGNRVATFMIYLSSVEAGGATAFIYANFSVPVVKNAAL 533
Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
FW+N H + D H+GCPV +G+KW
Sbjct: 534 FWWNLHRSGEGDGDTLHAGCPVLVGDKW 561
>gi|184185444|gb|ACC68850.1| prolyl 4-hydroxylase, alpha II subunit isoform 1 precursor
(predicted) [Rhinolophus ferrumequinum]
Length = 555
Score = 127 bits (319), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 83/252 (32%), Positives = 126/252 (50%), Gaps = 39/252 (15%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
++Y C+G + + + L C Y N T L I P K E+ + P +V+ +D + D
Sbjct: 288 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRTPQLLIAPFKEEDEWDSPHIVRYYDVMSD 347
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI +I E++K K+ R V + G R+SK +L + P + ++ R+Q
Sbjct: 348 EEIEKIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEET---EDPVVARLNLRMQH 404
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----------------- 157
+T L + E LQ+ NYG+GG Y+ H D + P D GL
Sbjct: 405 ITGLSVKTAEL----LQVANYGMGGQYEPHFDFSRRPFDNGLKTEGNRLATFLNYNDEHD 460
Query: 158 ---------RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMY 208
R+A+F+ Y++DVE GGAT+FP L ++P+KG+AVFWYN + DYR
Sbjct: 461 VFKHLGTGNRVATFLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTR 520
Query: 209 HSGCPVALGNKW 220
H+ CPV +G KW
Sbjct: 521 HAACPVLVGCKW 532
>gi|196011902|ref|XP_002115814.1| hypothetical protein TRIADDRAFT_30039 [Trichoplax adhaerens]
gi|190581590|gb|EDV21666.1| hypothetical protein TRIADDRAFT_30039 [Trichoplax adhaerens]
Length = 534
Score = 126 bits (317), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 80/235 (34%), Positives = 121/235 (51%), Gaps = 23/235 (9%)
Query: 1 EIYPLACQGNLSVP----EDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAI 56
+ Y C+G V + + ++L C Y+ + L P+ VE + L P ++ H+ +
Sbjct: 285 DFYKKLCRGGPKVKAGDNKMVSNHLTC-YQLRQHARLLFSPINVEVISLQPYILIYHNLL 343
Query: 57 YDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRI 114
D E+ + L+ ++R V N G Y R+SK +L + DHP + +I T I
Sbjct: 344 NDLEVEALKTLAAPMLQRATVHNKDTGKLEYATYRISKSAWLNDD---DHPLVRRISTLI 400
Query: 115 QDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGL-----W----RLASFMFY 165
+D+T L + E LQI NYG+GGHY+ H D G W R+A+ + Y
Sbjct: 401 EDVTGLTMESAE----ALQIANYGIGGHYEPHFDHADVRSGTDVFKTWKGGNRIATMLIY 456
Query: 166 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
L+ VELGGAT+F S + + P +GSA FWYN H N + H+ CPV +G+KW
Sbjct: 457 LSSVELGGATVFSSAGVRIEPRQGSAAFWYNLHRNGNGNNLTRHAACPVLIGSKW 511
>gi|340367965|ref|XP_003382523.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Amphimedon
queenslandica]
Length = 525
Score = 126 bits (316), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 74/230 (32%), Positives = 121/230 (52%), Gaps = 18/230 (7%)
Query: 2 IYPLACQGNLSVPEDIKSNLKCFY-ESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
+Y C+ +P + L C+Y + N L + P+K E ++ P++ +D + D E
Sbjct: 280 VYEKLCREPAPIPSHLHKKLICYYFNNKRNPRLILSPIKTEVAFVKPKIYIFYDIVTDRE 339
Query: 61 INRIIELSKGKVERGKVV-NYGDTIYVDTRLSKVYFLYPEIFGDHPFLY--KIQTRIQDM 117
I R+ EL+ K+ R V G+ ++ R+SK +L D P Y +I RI+D+
Sbjct: 340 IERLKELANPKLNRATVHGENGELLHATYRISKSGWLSG---SDDPLGYVDRIDQRIEDV 396
Query: 118 TNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-------RLASFMFYLTDVE 170
T L + E+ LQ+ NYG+GG Y+ H D E + R+++ + Y++DVE
Sbjct: 397 TGLTMSTAEQ----LQVVNYGIGGQYEPHYDFARTGEDTFTSLGSGNRISTLLIYMSDVE 452
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT+FP + + P K +A +W+N + DY H+GCPV +G+KW
Sbjct: 453 KGGATVFPGVGARLVPIKRAAAYWWNLKRSGDGDYSTRHAGCPVLVGSKW 502
>gi|390178051|ref|XP_002137433.2| GA30144 [Drosophila pseudoobscura pseudoobscura]
gi|388859305|gb|EDY67991.2| GA30144 [Drosophila pseudoobscura pseudoobscura]
Length = 546
Score = 126 bits (316), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 71/218 (32%), Positives = 116/218 (53%), Gaps = 20/218 (9%)
Query: 2 IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
++ CQG +P ++S+L+C Y + + FL++ PL++E L DP V H+ + +E
Sbjct: 267 VHQRNCQGRSRLP--VQSSLRCHYSAEGSAFLRLAPLRMELLSRDPLVAVYHEVVSAAEQ 324
Query: 62 NRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLV 121
++ LS+ +++R + Y D I S P + ++ R++D+T L
Sbjct: 325 RHLMLLSESQLQRQRGHQY-DKIRTFASASVAANATPTV-------EQLHRRLEDITGLD 376
Query: 122 IGREERYKGPLQINNYGLGGHYDLHCDATPRDEGL------WRLASFMFYLTDVELGGAT 175
+ E PL+I NYG+GG Y +H D + +RLA+ + YL+DV LGG T
Sbjct: 377 LAESE----PLRILNYGIGGQYYIHVDCEQPQTHVEPYPKEYRLATVLLYLSDVRLGGFT 432
Query: 176 IFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 213
FP+L L + P +GSA+ W+NA+ DYR H+ CP
Sbjct: 433 SFPALGLGIRPNRGSALVWHNANNAGNCDYRALHAACP 470
>gi|194765172|ref|XP_001964701.1| GF23326 [Drosophila ananassae]
gi|190614973|gb|EDV30497.1| GF23326 [Drosophila ananassae]
Length = 885
Score = 125 bits (315), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 77/219 (35%), Positives = 109/219 (49%), Gaps = 22/219 (10%)
Query: 4 PLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINR 63
P C G + K +L C Y + + FL++ P+K E L DP + HD +Y E+ R
Sbjct: 660 PRCCNGRCEIAR--KFSLYCLYNTKTSPFLRLAPIKTELLSKDPYIAIFHDVVYPKELTR 717
Query: 64 IIELSKGKVERGKVVNYGDTIY-VDT-RLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLV 121
I K + +NY Y VD+ R SK ++ + + +I + D T L
Sbjct: 718 IRTACKSHLIASTTINYTSNAYSVDSYRTSKSVWIPTD---SNNLTQRITNLVGDATGLE 774
Query: 122 IGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPSLN 181
+ E + Q+ NYG+GG ++ H D + L+DVE GGATIF LN
Sbjct: 775 MTTSEMF----QVINYGIGGLFEAHMDPVLSNA-----------LSDVEQGGATIFTKLN 819
Query: 182 LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
LTVFP+ GSA+FWYN D R H+GCPV +G+KW
Sbjct: 820 LTVFPQSGSALFWYNLDNWGNEDKRTEHAGCPVIVGSKW 858
Score = 65.1 bits (157), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 46/170 (27%), Positives = 77/170 (45%), Gaps = 13/170 (7%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
E++ L C G + I+ NL CFY++ + L I P+K E L +DP + HD I E
Sbjct: 288 EVFSLCCNGKCQKDKKIQ-NLYCFYDTKTSNALIIAPVKKEILSVDPYIALFHDVISQKE 346
Query: 61 INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
+ +SK + ++ + + R+SK + Y + D + R+
Sbjct: 347 QKILQSVSKIHLMASTTIHNNNKAVKNYRISKSVW-YASDYND------VTKRLTTFMEQ 399
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-----RLASFMFY 165
G + + Q+ NYGLGG +D H D D+ + R+A+ +FY
Sbjct: 400 ATGYDMKSSELFQVINYGLGGRFDGHEDYLLTDKTRFNGTSDRIATTLFY 449
>gi|344296798|ref|XP_003420090.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Loxodonta
africana]
Length = 544
Score = 125 bits (315), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 80/222 (36%), Positives = 121/222 (54%), Gaps = 13/222 (5%)
Query: 7 CQGNLSVPEDIK-SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRII 65
CQ S P + +L C YE+ +N +L + P + E ++L+P VV HD + D E +I
Sbjct: 305 CQTLGSQPTHYQIPSLYCSYETNSNPYLLLQPFRKEVIHLEPYVVLYHDFVNDMEAQKIK 364
Query: 66 ELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGRE 125
L++ ++R V + + VD R+SK +L + P L + RI +T L + +
Sbjct: 365 GLAEPWLQRSVVASGEKQLQVDYRISKSAWLKDSV---DPMLVTLDHRIAALTGLDV--Q 419
Query: 126 ERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELGGATIFP 178
Y LQ+ NYG+GGHY+ H D AT L+R+ A+FM YL+ VE GGAT F
Sbjct: 420 PPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSAVEAGGATAFI 479
Query: 179 SLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
N ++ K +A+FW+N H + D H+GCPV +G+KW
Sbjct: 480 YANFSMPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLVGDKW 521
>gi|194905294|ref|XP_001981167.1| GG11919 [Drosophila erecta]
gi|190655805|gb|EDV53037.1| GG11919 [Drosophila erecta]
Length = 533
Score = 125 bits (315), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 77/231 (33%), Positives = 118/231 (51%), Gaps = 24/231 (10%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
+ +C G L P + L CFY FL++ PLK E++ L P VV H+ + EI+
Sbjct: 286 FKTSCNGLLEKP----TRLHCFYNFTTTPFLRLAPLKTEQIGLKPYVVLYHEVLSAREIS 341
Query: 63 RIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVI 122
++ + ++ +V + R +K Y+L E + +I RI DMT +
Sbjct: 342 MLMGKAAQNMKNTRVQSEKAVNTNRERTAKGYWLKKE---SNEMTRRITRRIVDMTGFDL 398
Query: 123 GREERYKGPLQINNYGLGGHYDLHCD----ATPRDEGLW---------RLASFMFYLTDV 169
E + Q+ NYG+GGHY LH D A+ G R+A+ +FYLTDV
Sbjct: 399 ADSEDF----QVINYGIGGHYSLHFDYFGFASSNYTGERSHHSIVLGDRIATVLFYLTDV 454
Query: 170 ELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
E GGAT+F ++ +V+P+ G+A+FWYN + D H+ CPV +G+KW
Sbjct: 455 EQGGATVFGNVGYSVYPQAGTAIFWYNLDTDGNGDPLTRHASCPVVVGSKW 505
>gi|195575143|ref|XP_002105539.1| GD16913 [Drosophila simulans]
gi|194201466|gb|EDX15042.1| GD16913 [Drosophila simulans]
Length = 534
Score = 125 bits (315), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 77/221 (34%), Positives = 113/221 (51%), Gaps = 21/221 (9%)
Query: 14 PEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVE 73
P + + L CFY FL++ PLK+E++ LDP VV H+ + EI+ +I + ++
Sbjct: 293 PLESSTRLHCFYNFTTTPFLRLAPLKIEQIGLDPYVVLYHEVLSAREISMLIGKAAQNMK 352
Query: 74 RGKV-VNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPL 132
+V G R +K ++ E + I RI DMT + E +
Sbjct: 353 NTRVHKEQGVPKKNRGRTAKGFWFKKE---SNELTKGITRRIMDMTGFDLADSEGF---- 405
Query: 133 QINNYGLGGHYDLHCD--------ATPRDEGLW-----RLASFMFYLTDVELGGATIFPS 179
Q+ NYG+GGHY LH D T G R+A+ +FYLTDVE GGAT+F
Sbjct: 406 QVINYGIGGHYLLHMDYFDFASSNHTDTRSGYSMDLGDRIATVLFYLTDVEQGGATVFAD 465
Query: 180 LNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ +V+P+ G+A+FWYN N D R H+ CPV +G+KW
Sbjct: 466 VGYSVYPQAGTAIFWYNLDTNGKGDPRTRHAACPVIVGSKW 506
>gi|195159297|ref|XP_002020518.1| GL13472 [Drosophila persimilis]
gi|194117287|gb|EDW39330.1| GL13472 [Drosophila persimilis]
Length = 526
Score = 125 bits (315), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 79/236 (33%), Positives = 113/236 (47%), Gaps = 23/236 (9%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
E Y L C G+ + + +L+C Y + + FL + PLK EEL DP +V HD IY SE
Sbjct: 253 EAYRLTCSGHSRLTAREQRHLRCGYMTETHPFLLLAPLKAEELSHDPLLVLYHDVIYQSE 312
Query: 61 INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
I+ I +L+ ++ R V + + R S++ F+ +H L I R+ DMTNL
Sbjct: 313 IDVIRQLTTNRMARAMVTLTNQSTVSNVRTSQITFIAK---TEHEVLQTIDRRVADMTNL 369
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLW-------RLASFMFYLTDVE 170
+ Y Q NYG+GGHY H D T D GL R+A+ +FY +
Sbjct: 370 NMD----YAEDHQFANYGIGGHYGQHMDWFTETTFDNGLVSSTEMGNRIATVLFYNISLN 425
Query: 171 ------LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ A P L + +K +A FW+N HA D R H CP+ G+KW
Sbjct: 426 SSRMWLMSAALTCPYLKQHLRLKKYAAAFWHNLHAAGRGDARTQHGACPIIAGSKW 481
>gi|390176836|ref|XP_003736216.1| GA26872, isoform B [Drosophila pseudoobscura pseudoobscura]
gi|388858809|gb|EIM52289.1| GA26872, isoform B [Drosophila pseudoobscura pseudoobscura]
Length = 567
Score = 125 bits (314), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 78/212 (36%), Positives = 112/212 (52%), Gaps = 16/212 (7%)
Query: 19 SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV 78
+ L C Y + FL++ PL++EEL LDP +V H+ + D E+ R+ +S + R +V
Sbjct: 330 ARLHCRYNATTTAFLRLAPLRMEELSLDPYIVLYHNVLSDEEMARLENMSTPLLHRARVF 389
Query: 79 NYG---DTIYVDTRLSKVYFLYPEIFG-DHPFLYKIQTRIQDMTNLVIGREERYKGPLQI 134
+ G I +V P++ D + +IQ R+ D+T LV+ R +Q
Sbjct: 390 DSGIRKPKISPARTADEVQIPNPKLVAEDIQLVERIQKRMTDLTGLVLTSMRR----IQF 445
Query: 135 NNYGLGGHYDLHCD------ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEK 188
YG GG Y H D T R G R+A+ +FYL DVE GGAT FP+L+L V E+
Sbjct: 446 LKYGFGGIYVPHHDFFSVHTPTSRLHGD-RIATVIFYLNDVEHGGATAFPNLDLVVPTER 504
Query: 189 GSAVFWYNAHANTL-LDYRMYHSGCPVALGNK 219
G+ +FW+N T LDYR H CPV +G K
Sbjct: 505 GAVLFWHNMDGETYDLDYRTLHGACPVIVGTK 536
>gi|195341588|ref|XP_002037388.1| GM12140 [Drosophila sechellia]
gi|194131504|gb|EDW53547.1| GM12140 [Drosophila sechellia]
Length = 534
Score = 125 bits (314), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 77/222 (34%), Positives = 114/222 (51%), Gaps = 23/222 (10%)
Query: 14 PEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVE 73
P + + L CFY FL++ PLK+E++ LDP VV H+ + EI+ +I + ++
Sbjct: 293 PLESSTRLHCFYNFTTTPFLRLAPLKIEQIGLDPYVVLYHEVLSAREISMLIGKATQNMK 352
Query: 74 RGKV-VNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPL 132
+V G R +K ++ E + I RI DMT + E +
Sbjct: 353 NTRVHKEQGVPKKNRGRTAKGFWFKKE---SNELTKGITRRIMDMTGFDLADSEGF---- 405
Query: 133 QINNYGLGGHYDLHCD--------------ATPRDEGLWRLASFMFYLTDVELGGATIFP 178
Q+ NYG+GGHY LH D + D G R+A+ +FYLTDVE GGAT+F
Sbjct: 406 QVINYGIGGHYLLHMDYFDFASSNHTDTRSSYSMDLGD-RIATVLFYLTDVEQGGATVFA 464
Query: 179 SLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ +V+P+ G+A+FWYN N D R H+ CPV +G+KW
Sbjct: 465 DVGYSVYPQAGTAIFWYNLDTNGKGDPRTKHAACPVIVGSKW 506
>gi|198449518|ref|XP_002136915.1| GA26872, isoform A [Drosophila pseudoobscura pseudoobscura]
gi|198130643|gb|EDY67473.1| GA26872, isoform A [Drosophila pseudoobscura pseudoobscura]
Length = 543
Score = 125 bits (314), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 78/212 (36%), Positives = 112/212 (52%), Gaps = 16/212 (7%)
Query: 19 SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV 78
+ L C Y + FL++ PL++EEL LDP +V H+ + D E+ R+ +S + R +V
Sbjct: 306 ARLHCRYNATTTAFLRLAPLRMEELSLDPYIVLYHNVLSDEEMARLENMSTPLLHRARVF 365
Query: 79 NYG---DTIYVDTRLSKVYFLYPEIFG-DHPFLYKIQTRIQDMTNLVIGREERYKGPLQI 134
+ G I +V P++ D + +IQ R+ D+T LV+ R +Q
Sbjct: 366 DSGIRKPKISPARTADEVQIPNPKLVAEDIQLVERIQKRMTDLTGLVLTSMRR----IQF 421
Query: 135 NNYGLGGHYDLHCD------ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEK 188
YG GG Y H D T R G R+A+ +FYL DVE GGAT FP+L+L V E+
Sbjct: 422 LKYGFGGIYVPHHDFFSVHTPTSRLHGD-RIATVIFYLNDVEHGGATAFPNLDLVVPTER 480
Query: 189 GSAVFWYNAHANTL-LDYRMYHSGCPVALGNK 219
G+ +FW+N T LDYR H CPV +G K
Sbjct: 481 GAVLFWHNMDGETYDLDYRTLHGACPVIVGTK 512
>gi|195591298|ref|XP_002085379.1| GD14755 [Drosophila simulans]
gi|194197388|gb|EDX10964.1| GD14755 [Drosophila simulans]
Length = 515
Score = 125 bits (314), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 84/226 (37%), Positives = 124/226 (54%), Gaps = 29/226 (12%)
Query: 5 LACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRI 64
L CQG P+ KSNL C Y S N FL++ PLK+EE+ DP +V H+ I D EI +
Sbjct: 285 LGCQG--LFPK--KSNLVCRYNSSTNAFLQLAPLKMEEVSRDPYIVLFHEMISDKEIEEM 340
Query: 65 IELSKGKVERGKVVNYGDTIYVDTR--LSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVI 122
KG++ + G T D++ +S+VY++ E F +I RI DMT +
Sbjct: 341 ----KGEITE---MENGWTSLGDSKEIVSRVYWIRKE----SSFSKRINQRISDMTGFKL 389
Query: 123 GREERYKGPLQINNYGLGG----HYDLHCDATPR---DEGLW-RLASFMFYLTDVELGGA 174
E + +Q+ N+G+GG HYD + D + L R+ S +FY +V GG
Sbjct: 390 ---EEFPA-IQLANFGVGGYFKPHYDYYTDRLKEVDVNNTLGDRIGSIIFYAGEVSQGGQ 445
Query: 175 TIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
T+FP L + V P+KG+A+FW+NA ++ D R HS CPV +G++W
Sbjct: 446 TVFPDLKVAVEPKKGNALFWFNAFDDSSPDPRTLHSVCPVIVGSRW 491
>gi|195166681|ref|XP_002024163.1| GL22882 [Drosophila persimilis]
gi|194107518|gb|EDW29561.1| GL22882 [Drosophila persimilis]
Length = 534
Score = 125 bits (314), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 79/227 (34%), Positives = 118/227 (51%), Gaps = 23/227 (10%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
Y + C+G ++NL C Y FL++ PLK+EE+ DP +V H+ + D EI
Sbjct: 296 YEIGCRGLFPK----RTNLVCRYNFTTTPFLRLAPLKMEEVNHDPYIVMYHEVLSDREIE 351
Query: 63 RIIELSKGKVERGKVVN-YGDTIYVD-TRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
+ KG+ G++ N + D + T++ + + ++ RI DMTN
Sbjct: 352 EM----KGR--SGQMSNGWADQKEANSTKIRDIVCRHTWWREQSAIKERVNRRISDMTNF 405
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD------ATPRDEGLW-RLASFMFYLTDVELGG 173
+E LQ+ NYGLG H+ H D TP L RL S +FY +DV GG
Sbjct: 406 DFPPQE----DLQVANYGLGTHFKPHYDYTSDGYETPDVLTLGDRLGSIIFYASDVPQGG 461
Query: 174 ATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
AT+FP +++FP KGS+VFWYN + + +D R HS CPV +G++W
Sbjct: 462 ATVFPRSRVSIFPRKGSSVFWYNLYDDGRIDTRSQHSVCPVIVGDRW 508
>gi|195392288|ref|XP_002054791.1| GJ24631 [Drosophila virilis]
gi|194152877|gb|EDW68311.1| GJ24631 [Drosophila virilis]
Length = 499
Score = 125 bits (313), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 75/228 (32%), Positives = 121/228 (53%), Gaps = 32/228 (14%)
Query: 7 CQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIE 66
C+G+ S+P + S+L+C Y + + FL++ PLK+E+L LDP +V HD + +E I++
Sbjct: 267 CRGH-SLPL-VSSSLRCRYNTASAPFLRLAPLKLEQLSLDPYMVLYHDVVQANEREHIMQ 324
Query: 67 LSKGKVER---GKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIG 123
L+K + R G + ++ S + D +++ R++DM+ G
Sbjct: 325 LAKPHLRRALVGAARAHSQRFAMNAGFS---------YNDSRQGQRLRQRLEDMS----G 371
Query: 124 REERYKGPLQINNYGLGGHYDLHCD-----------ATPRDEGLWRLASFMFYLTDVELG 172
+ G L + NYG+GG Y +H D A+ +D R+A+ + YLTDV+LG
Sbjct: 372 FDLTNSGQLAVLNYGIGGQYYMHYDCWFSQDDAAQVASIKDN---RIATILLYLTDVQLG 428
Query: 173 GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
G T FP+L L V P GSA+ W+N + D R H+ CP+ LG +W
Sbjct: 429 GLTSFPALGLAVQPSPGSALIWHNMNNAAECDRRTLHAACPLLLGTRW 476
>gi|194905381|ref|XP_001981186.1| GG11928 [Drosophila erecta]
gi|190655824|gb|EDV53056.1| GG11928 [Drosophila erecta]
Length = 543
Score = 125 bits (313), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 81/228 (35%), Positives = 114/228 (50%), Gaps = 17/228 (7%)
Query: 2 IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
++P C V ++ + L C Y + FL++ P+K E L LDP V+ +HD + E
Sbjct: 289 LHPPCCSARCEVVRNL-TRLYCVYNRVTSPFLQLAPIKTEILSLDPFVLLLHDMVRQKES 347
Query: 62 NRIIELSKGKVERGKVVNYGDTIYVDT----RLSKVYFLYPEIFGDHPFLYKIQTRIQDM 117
I SK + + ++ N + D R SK + Y F D KI R+ D
Sbjct: 348 TLIRASSKEHLLQSEITNTDASSSEDNVAIFRTSKSVW-YSSDFND--TTKKITERLADA 404
Query: 118 TNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-----RLASFMFYLTDVELG 172
T L + E + Q+ NYGLGG + H D D+ + R+A+ +FYL V G
Sbjct: 405 TGLDMHFTEYF----QVINYGLGGFFATHLDMLLSDKTRFNGTSDRIATTVFYLNGVRQG 460
Query: 173 GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GAT FP LNLTVFP+ GSA+FWYN H+GCPV +G+KW
Sbjct: 461 GATHFPLLNLTVFPQPGSALFWYNLDTKGNDQRSTMHTGCPVIVGSKW 508
>gi|66772633|gb|AAY55628.1| IP02961p [Drosophila melanogaster]
Length = 409
Score = 124 bits (312), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 79/231 (34%), Positives = 118/231 (51%), Gaps = 24/231 (10%)
Query: 2 IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
+Y C+G L+ + NL+C + L P K+EEL+LDP VV++H I +
Sbjct: 158 MYEQVCRGELAPLPSKQRNLRC---RLRKSRLGYAPFKLEELHLDPLVVQLHQVIGSKDS 214
Query: 62 NRIIELSKGKVERGKVV----NYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTR-IQD 116
+ + + ++ +++R V N G T F Y K+ +R + D
Sbjct: 215 DSLQKTARPRIKRSTVYSLGGNGGSTAAAFRTSQGASFNYSRNAAT-----KLLSRHVGD 269
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRD----EGLW---RLASFMFYLTDV 169
+ L + Y LQ+ NYG+GGHY+ H D+ P + EG R+A+ ++YL DV
Sbjct: 270 FSGLNMD----YAEDLQVANYGIGGHYEPHWDSFPENHIYQEGDLHGNRMATGIYYLADV 325
Query: 170 ELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
E GG T FP L L V PE+GS +FWYN H + D+R H+ CPV G+KW
Sbjct: 326 EAGGGTAFPFLPLLVTPERGSLLFWYNLHPSGDQDFRTKHAACPVLQGSKW 376
>gi|195061021|ref|XP_001995909.1| GH14207 [Drosophila grimshawi]
gi|193891701|gb|EDV90567.1| GH14207 [Drosophila grimshawi]
Length = 477
Score = 124 bits (312), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 77/207 (37%), Positives = 112/207 (54%), Gaps = 10/207 (4%)
Query: 19 SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV 78
S L C Y+ ++ FL++ PLK+E L +DP VV H+AIYDSEI+ + L + ++ R ++
Sbjct: 261 SRLICNYKMDSSPFLRLAPLKMEMLSMDPYVVVFHEAIYDSEIDELRRLCESRLSRTEIA 320
Query: 79 NYGDTIYVDTRLSKVYFLYPEIFGDH-PFLYKIQTRIQDMTNLVIGREERYKGPLQINNY 137
G + + S V+ ++ L +I+ R+ DM+ L+I + +Q Y
Sbjct: 321 KQGKNKSIRSS-SGVWIFELDLNRQQLELLERIRRRVADMSGLLIDFNSQ---EVQYMEY 376
Query: 138 GLGGHYDLHCD--ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWY 195
GGHY H D P E R+A+ +FYL DV GGATIFP L L V PE+G + W+
Sbjct: 377 VFGGHYYPHWDFKGIPHLED--RIATVLFYLNDVARGGATIFPDLELLVQPERGKVLHWH 434
Query: 196 NAHANTL-LDYRMYHSGCPVALGNKWG 221
N T L+ R H CPV +G K G
Sbjct: 435 NMDLGTYDLEKRSLHGACPVIMGKKEG 461
>gi|4336512|gb|AAD17844.1| prolyl 4-hydroxylase alpha subunit [Drosophila melanogaster]
Length = 535
Score = 124 bits (312), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 79/231 (34%), Positives = 119/231 (51%), Gaps = 24/231 (10%)
Query: 2 IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
+Y C+G L+ + NL+C + L P K+EEL+LDP VV++H I +
Sbjct: 284 MYEQVCRGELAPLPSKQRNLRC---RLRKSRLGYAPFKLEELHLDPLVVQLHQVIGSKDS 340
Query: 62 NRIIELSKGKVERGKVV----NYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTR-IQD 116
+ + + ++ +++R V N G T F Y K+ +R + D
Sbjct: 341 DSLQKTARPRIKRSTVYSLGGNGGSTAAAFRTSQGASFNYSRNAAT-----KLLSRHVGD 395
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRD----EGLW---RLASFMFYLTDV 169
+ L + Y LQ+ NYG+GGHY+ H D+ P + EG R+A+ ++YL+DV
Sbjct: 396 FSGLNMD----YAEDLQVANYGIGGHYEPHWDSFPENHIYQEGDLHGNRMATGIYYLSDV 451
Query: 170 ELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
E GG T FP L L V PE+GS +FWYN H + D+R H+ CPV G+KW
Sbjct: 452 EAGGGTAFPFLPLLVTPERGSLLFWYNLHPSGDQDFRTKHAACPVLQGSKW 502
>gi|386771382|ref|NP_649044.3| CG18233 [Drosophila melanogaster]
gi|383291998|gb|AAF49254.3| CG18233 [Drosophila melanogaster]
Length = 515
Score = 124 bits (312), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 79/213 (37%), Positives = 117/213 (54%), Gaps = 25/213 (11%)
Query: 18 KSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKV 77
KSNL C Y S N FLK+ PLK+EE+ DP +V H+ I D +I + KG++
Sbjct: 294 KSNLVCRYNSSTNAFLKLAPLKMEEISRDPYIVMFHEVISDKDIEEM----KGEITE--- 346
Query: 78 VNYGDTIYVDTR--LSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQIN 135
+ G T D + +S+VY++ E F +I RI DMT + E + +Q+
Sbjct: 347 MENGWTSLGDPKEIVSRVYWIRKE----SSFSKRINQRISDMTGFKL---EEFPA-IQLA 398
Query: 136 NYGLGG----HYDLHCDATPR---DEGLW-RLASFMFYLTDVELGGATIFPSLNLTVFPE 187
N+G+GG HYD + D + L R+ S +FY +V GG T+FP L + V P+
Sbjct: 399 NFGVGGYFKPHYDFYTDRLKEVDVNNTLGDRIGSIIFYAGEVSQGGQTVFPDLKVAVEPK 458
Query: 188 KGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
KG+A+FW+NA ++ D R HS CPV +G++W
Sbjct: 459 KGNALFWFNAFDDSTPDPRSLHSVCPVLVGSRW 491
>gi|24651418|ref|NP_524594.2| prolyl-4-hydroxylase-alpha MP [Drosophila melanogaster]
gi|7301951|gb|AAF57057.1| prolyl-4-hydroxylase-alpha MP [Drosophila melanogaster]
gi|359807686|gb|AEV66559.1| FI17802p1 [Drosophila melanogaster]
Length = 535
Score = 124 bits (311), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 79/231 (34%), Positives = 118/231 (51%), Gaps = 24/231 (10%)
Query: 2 IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
+Y C+G L+ + NL+C + L P K+EEL+LDP VV++H I +
Sbjct: 284 MYEQVCRGELAPLPSKQRNLRC---RLRKSRLGYAPFKLEELHLDPLVVQLHQVIGSKDS 340
Query: 62 NRIIELSKGKVERGKVV----NYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTR-IQD 116
+ + + ++ +++R V N G T F Y K+ +R + D
Sbjct: 341 DSLQKTARPRIKRSTVYSLGGNGGSTAAAFRTSQGASFNYSRNAAT-----KLLSRHVGD 395
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRD----EGLW---RLASFMFYLTDV 169
+ L + Y LQ+ NYG+GGHY+ H D+ P + EG R+A+ ++YL DV
Sbjct: 396 FSGLNMD----YAEDLQVANYGIGGHYEPHWDSFPENHIYQEGDLHGNRMATGIYYLADV 451
Query: 170 ELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
E GG T FP L L V PE+GS +FWYN H + D+R H+ CPV G+KW
Sbjct: 452 EAGGGTAFPFLPLLVTPERGSLLFWYNLHPSGDQDFRTKHAACPVLQGSKW 502
>gi|198449648|ref|XP_001357666.2| GA21989 [Drosophila pseudoobscura pseudoobscura]
gi|198130700|gb|EAL26801.2| GA21989 [Drosophila pseudoobscura pseudoobscura]
Length = 536
Score = 124 bits (311), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 75/229 (32%), Positives = 115/229 (50%), Gaps = 21/229 (9%)
Query: 2 IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
+Y C+G L+ + +L+C + P K+EEL+ DP +V++HD + E
Sbjct: 286 MYEQVCRGELTPSPTAQRHLRCRLQRRR---FDYAPFKLEELHADPPIVQVHDMVSQRES 342
Query: 62 NRIIELSKGKVERGKVVNYGDTIYVDT--RLSK-VYFLYPEIFGDHPFLYKIQTRIQDMT 118
+ ++ +++R V N R S+ F Y + Y R+
Sbjct: 343 LFLQNAARPRIQRSTVYNQAGAGTTAAAFRTSQGASFNYSQ--------YATTQRLSQHV 394
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPR-----DEGLW--RLASFMFYLTDVEL 171
+ G + Y LQI NYG+GGHY+ H D+ P ++ L+ RLA+ ++YL+DV
Sbjct: 395 ADLSGLDMDYAENLQIANYGIGGHYEPHWDSFPEHHEYPEDDLYGNRLATAIYYLSDVVA 454
Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GG T FP L L V PE+GS +FWYN H + D+R H+ CPV G+KW
Sbjct: 455 GGGTAFPFLPLLVTPERGSLLFWYNLHPSGDQDFRTKHAACPVLQGSKW 503
>gi|195113247|ref|XP_002001179.1| GI22114 [Drosophila mojavensis]
gi|193917773|gb|EDW16640.1| GI22114 [Drosophila mojavensis]
Length = 487
Score = 124 bits (311), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 76/213 (35%), Positives = 114/213 (53%), Gaps = 17/213 (7%)
Query: 19 SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV 78
+ L C Y++ + FL + P K+E L DP +V HD IY+SEI + +SK ++R VV
Sbjct: 277 TRLVCSYKTKPSKFLYLAPFKMELLSEDPYMVVFHDVIYESEIEHLNRISKPFLQRATVV 336
Query: 79 ---NYGDTIYVDTRLSKVYFLYPEIFG--DHPFLYKIQTRIQDMTNLVIGREERYKGPLQ 133
N DT+ + R + FLY + D + +I R++DM++L I + +
Sbjct: 337 VEDNSEDTL-IKFRTANGAFLYRDKISPKDVQLVERIFQRMRDMSDLQINDD-----AFE 390
Query: 134 INNYGLGGHYDLHCDATPRDEGLW---RLASFMFYLTDVELGGATIFPSLNLTVFPEKGS 190
Y GGHYD+H D + + R A+F+ YL DV GGAT+FP + + V PE+G
Sbjct: 391 YLKYDFGGHYDIHADYFNYTDDQFTDDRFATFVIYLNDVARGGATVFPDVEIAVHPERGK 450
Query: 191 AVFWYNAHANTLLDYRM--YHSGCPVALGNKWG 221
+ WYN + + DY + YH CPV +G K G
Sbjct: 451 VIHWYNMNPKS-FDYELHSYHGACPVLIGQKIG 482
>gi|195159319|ref|XP_002020529.1| GL14044 [Drosophila persimilis]
gi|194117298|gb|EDW39341.1| GL14044 [Drosophila persimilis]
Length = 536
Score = 124 bits (311), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 75/229 (32%), Positives = 115/229 (50%), Gaps = 21/229 (9%)
Query: 2 IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
+Y C+G L+ + +L+C + P K+EEL+ DP +V++HD + E
Sbjct: 286 MYEQVCRGELTPSPTAQRHLRCRLQRRR---FDYAPFKLEELHADPPIVQVHDMVSQRES 342
Query: 62 NRIIELSKGKVERGKVVNYGDTIYVDT--RLSK-VYFLYPEIFGDHPFLYKIQTRIQDMT 118
+ ++ +++R V N R S+ F Y + Y R+
Sbjct: 343 LFLQNAARPRIQRSTVYNQAGAGTTAAAFRTSQGASFNYSQ--------YATTQRLSQHV 394
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPR-----DEGLW--RLASFMFYLTDVEL 171
+ G + Y LQI NYG+GGHY+ H D+ P ++ L+ RLA+ ++YL+DV
Sbjct: 395 ADLSGLDMDYAENLQIANYGIGGHYEPHWDSFPEHHEYPEDDLYGNRLATAIYYLSDVVA 454
Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GG T FP L L V PE+GS +FWYN H + D+R H+ CPV G+KW
Sbjct: 455 GGGTAFPFLPLLVTPERGSLLFWYNLHPSGDQDFRTKHAACPVLQGSKW 503
>gi|195159164|ref|XP_002020452.1| GL13506 [Drosophila persimilis]
gi|194117221|gb|EDW39264.1| GL13506 [Drosophila persimilis]
Length = 536
Score = 124 bits (310), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 78/214 (36%), Positives = 114/214 (53%), Gaps = 22/214 (10%)
Query: 19 SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV 78
S L C Y + FL++ PL++EEL LDP +V H+ + D+EI ++ +++ + K +
Sbjct: 304 SRLHCRYNATTTPFLRLAPLRMEELSLDPYIVVYHNVLSDAEIAKVERVAEPLL---KSI 360
Query: 79 NYGDTIYVDTRLSKVYFLYPEIFGDH-------PFLYKIQTRIQDMTNLVIGREERYKGP 131
G+ +++ SKV D P + +I RI DMT L+I R +
Sbjct: 361 GVGEMD--NSKKSKVRTALGAWIPDENMHISGWPVIQRIVRRIHDMTGLIIKRGQ----V 414
Query: 132 LQINNYGLGGHYDLHCD----ATPRDEGLW-RLASFMFYLTDVELGGATIFPSLNLTVFP 186
+Q+ YG GGHYD H D + P + L R+A+ +FYL DV+ GG+T+FP L L V
Sbjct: 415 VQLIKYGYGGHYDTHFDYLNDSLPITQALGDRMATVLFYLNDVKHGGSTVFPVLQLKVPS 474
Query: 187 EKGSAVFWYNAHANTL-LDYRMYHSGCPVALGNK 219
E+G + WYN H T LD R H CPV G K
Sbjct: 475 ERGKVLVWYNMHGETHDLDSRTLHGSCPVIDGAK 508
>gi|195575097|ref|XP_002105516.1| GD17035 [Drosophila simulans]
gi|194201443|gb|EDX15019.1| GD17035 [Drosophila simulans]
Length = 535
Score = 123 bits (309), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 77/230 (33%), Positives = 117/230 (50%), Gaps = 22/230 (9%)
Query: 2 IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
+Y C+G L+ + +L+C + L P K+EEL+LDP VV++H I ++
Sbjct: 284 MYEQVCRGELAPLSSKQRSLRC---RLRKSRLGYAPFKLEELHLDPLVVQLHQVIGSNDS 340
Query: 62 NRIIELSKGKVERGKVV----NYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDM 117
+ + ++ +++R V N G T F Y + + + D
Sbjct: 341 ESLQKTARPRIKRSTVYSLGGNGGSTAAAFRTSQGASFNYSR----NAATKLLSHHVGDF 396
Query: 118 TNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRD----EGLW---RLASFMFYLTDVE 170
+ L + Y LQ+ NYG+GGHY+ H D+ P + EG R+A+ ++YL+DVE
Sbjct: 397 SGLNMD----YAEDLQVANYGIGGHYEPHWDSFPENHIYQEGDLHGNRIATGIYYLSDVE 452
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GG T FP L L V PEKGS +FWYN H + D+R H+ CPV G+KW
Sbjct: 453 AGGGTAFPFLPLLVTPEKGSLLFWYNLHPSGDQDFRTKHAACPVLQGSKW 502
>gi|198428011|ref|XP_002120302.1| PREDICTED: similar to prolyl 4-hydroxylase alpha-2 subunit, partial
[Ciona intestinalis]
Length = 233
Score = 123 bits (309), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 83/219 (37%), Positives = 115/219 (52%), Gaps = 21/219 (9%)
Query: 18 KSNLKC---FYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVER 74
KSNLK F+ + N L I P+K EEL P VV+ +D + D + II L+ + R
Sbjct: 3 KSNLKLKCYFHNGWKNPRLLIQPIKSEELCDSPHVVRFYDVLSDRDSEEIIRLAAPLMFR 62
Query: 75 GKVVNYGDTIYVD----TRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKG 130
V GD ++ R+ K +L + P + TR+ D+T L +G E
Sbjct: 63 SGVT--GDDGAINDNPMERVGKNAWL-----DNSPVVNNFMTRVADITGLNVGAEIY--- 112
Query: 131 PLQINNYGLGGHYDLHCDATPRDEGLW--RLASFMFYLTDVELGGATIFPSLNLTVFPEK 188
LQ+ NYG+GGH+D H D T E + R+A+F+ Y +DVE GG T F + P K
Sbjct: 113 -LQVANYGIGGHFDPHIDETGGYENIMERRIATFLTYFSDVEYGGNTPFVYQEVVAEPIK 171
Query: 189 GSAVFWYNAHANTLLDYRMYHSGCPVALGNKW-GKLLLS 226
GSA+FWY+ + D R H+ CPV LGNKW G L L+
Sbjct: 172 GSAIFWYDVFNDGSADERTEHAACPVVLGNKWAGNLWLT 210
>gi|85857698|gb|ABC86384.1| IP10964p [Drosophila melanogaster]
Length = 534
Score = 123 bits (309), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 81/239 (33%), Positives = 119/239 (49%), Gaps = 39/239 (16%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
+ L+C G P + + L CFY FL++ PLK E++ LDP VV H+ + EI+
Sbjct: 286 FKLSCNG----PLESSTRLHCFYNFTTTPFLRLAPLKTEQIGLDPYVVLYHEVLSAREIS 341
Query: 63 RII-----ELSKGKVERGKVV---NYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRI 114
+I + K+ + + V N G R +K ++L E + +I RI
Sbjct: 342 MLIGKAAQNMKNTKIHKERAVPKKNRG-------RTAKGFWLKKE---SNELTKRITRRI 391
Query: 115 QDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD----ATPRDEGLW---------RLAS 161
DMT + E + Q+ NYG+GGHY LH D A+ R+A+
Sbjct: 392 MDMTGFDLADSEGF----QVINYGIGGHYFLHMDYFDFASSNHTDTRSRYSIDLGDRIAT 447
Query: 162 FMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+FYLTDVE GGAT+F + V P+ G+A+FWYN + D R H+ CPV +G+KW
Sbjct: 448 VLFYLTDVEQGGATVFGDVGYYVSPQAGTAIFWYNLDTDGNGDPRTRHAACPVIVGSKW 506
>gi|221460681|ref|NP_733394.3| CG31013 [Drosophila melanogaster]
gi|220903261|gb|AAF57073.4| CG31013 [Drosophila melanogaster]
Length = 534
Score = 123 bits (309), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 79/236 (33%), Positives = 118/236 (50%), Gaps = 33/236 (13%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
+ L+C G P + + L CFY FL++ PLK E++ LDP VV H+ + EI+
Sbjct: 286 FKLSCNG----PLESSTRLHCFYNFTTTPFLRLAPLKTEQIGLDPYVVLYHEVLSAREIS 341
Query: 63 RII-----ELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDM 117
+I + K+ + + V + R +K ++L E + +I RI DM
Sbjct: 342 MLIGKAAQNMKNTKIHKERAVPKKNR----GRTAKGFWLKKE---SNELTKRITRRIMDM 394
Query: 118 TNLVIGREERYKGPLQINNYGLGGHYDLHCD----ATPRDEGLW---------RLASFMF 164
T + E + Q+ NYG+GGHY LH D A+ R+A+ +F
Sbjct: 395 TGFDLADSEGF----QVINYGIGGHYFLHMDYFDFASSNHTDTRSRYSIDLGDRIATVLF 450
Query: 165 YLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
YLTDVE GGAT+F + V P+ G+A+FWYN + D R H+ CPV +G+KW
Sbjct: 451 YLTDVEQGGATVFGDVGYYVSPQAGTAIFWYNLDTDGNGDPRTRHAACPVIVGSKW 506
>gi|195069738|ref|XP_001997014.1| GH23597 [Drosophila grimshawi]
gi|193892024|gb|EDV90890.1| GH23597 [Drosophila grimshawi]
Length = 239
Score = 123 bits (309), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 76/205 (37%), Positives = 111/205 (54%), Gaps = 10/205 (4%)
Query: 19 SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV 78
S L C Y+ ++ FL++ PLK+E L +DP VV H+AIYDSEI+ + L + ++ R ++
Sbjct: 21 SRLICNYKMDSSPFLRLAPLKMEMLSMDPYVVVFHEAIYDSEIDELRRLCESRLSRTEIA 80
Query: 79 NYGDTIYVDTRLSKVYFLYPEIFGDH-PFLYKIQTRIQDMTNLVIGREERYKGPLQINNY 137
G + + S V+ ++ L +I+ R+ DM+ L+I + +Q Y
Sbjct: 81 KQGKNKSIRSS-SGVWIFELDLNRQQLELLERIRRRVADMSGLLIDFNSQ---EVQYMEY 136
Query: 138 GLGGHYDLHCD--ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWY 195
GGHY H D P E R+A+ +FYL DV GGATIFP L L V PE+G + W+
Sbjct: 137 VFGGHYYPHWDFKGIPHLED--RIATVLFYLNDVARGGATIFPDLELLVQPERGKVLHWH 194
Query: 196 NAHANTL-LDYRMYHSGCPVALGNK 219
N T L+ R H CPV +G K
Sbjct: 195 NMDLGTYDLEKRSLHGACPVIMGKK 219
>gi|395521232|ref|XP_003764722.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Sarcophilus
harrisii]
Length = 521
Score = 123 bits (308), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 79/228 (34%), Positives = 121/228 (53%), Gaps = 13/228 (5%)
Query: 1 EIYPLACQGNLSVPEDIK-SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDS 59
+ Y CQ S P + +L C YE+ + +L + P++ E L+L+P +V HD + DS
Sbjct: 276 DTYEGLCQTLGSQPTHYQIPSLYCAYETNGSPYLLLQPVRKEVLHLEPYIVLYHDFVSDS 335
Query: 60 EINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
E +I + ++R V + V+ R+SK +L + P L + RI +T
Sbjct: 336 EAQKIRGFAAPWLQRSVVASGEKQQQVEYRISKSAWLKDTV---DPILVSLDRRIAALTG 392
Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELG 172
L + + Y LQ+ NYG+GGHY+ H D AT L+R+ A+FM YL+ VE G
Sbjct: 393 LNV--QPPYAEHLQVVNYGIGGHYEPHFDHATSPSSPLYRMNSGNRVATFMIYLSSVEAG 450
Query: 173 GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
G+T F N +V K +A+FW+N H + D H+GCPV +G+KW
Sbjct: 451 GSTAFIYANFSVPVVKNAALFWWNLHRSGQGDGDTLHAGCPVLVGDKW 498
>gi|344274276|ref|XP_003408943.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 3
[Loxodonta africana]
Length = 516
Score = 123 bits (308), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 74/224 (33%), Positives = 115/224 (51%), Gaps = 25/224 (11%)
Query: 3 YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G + + + L C Y N N + P K E+ + PR+V+ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIVRFHDIISDAE 348
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + +L+K ++ R V + G R+SK +L ++P + +I RIQD+T
Sbjct: 349 IEVVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---ENPVVSRINMRIQDLT 405
Query: 119 NLVIGREERYKG--PLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATI 176
L + E + P G G R+A+++FY++DV GGAT+
Sbjct: 406 GLDVSTAEELQKDEPDAFKELGTGN----------------RIATWLFYMSDVSAGGATV 449
Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
FP + +V+P+KG+AVFWYN A+ DY H+ CPV +GNKW
Sbjct: 450 FPDVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 493
>gi|195159160|ref|XP_002020450.1| GL13507 [Drosophila persimilis]
gi|194117219|gb|EDW39262.1| GL13507 [Drosophila persimilis]
Length = 543
Score = 123 bits (308), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 78/212 (36%), Positives = 111/212 (52%), Gaps = 16/212 (7%)
Query: 19 SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV 78
+ L C Y + FL++ PL++EEL LDP +V H + D E+ R+ +S + R +V
Sbjct: 306 ARLHCRYNATTTAFLRLAPLRMEELSLDPYIVLYHSVLSDEEMARLENMSTPLLHRARVF 365
Query: 79 NYG---DTIYVDTRLSKVYFLYPEIFGDHPFLYK-IQTRIQDMTNLVIGREERYKGPLQI 134
+ G I +V P++ + L + IQ RI D+T L++ R +Q
Sbjct: 366 DSGIRKPKISPARTADEVQIPNPKLVAEDIQLVECIQKRITDLTGLMLTSMRR----IQF 421
Query: 135 NNYGLGGHYDLHCD------ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEK 188
YG GG Y H D T R G R+A+ +FYL DVE GGAT FP+L+L V E+
Sbjct: 422 LKYGFGGIYVPHHDFFSVHTPTSRLHGD-RIATVIFYLNDVEHGGATAFPNLDLVVPTER 480
Query: 189 GSAVFWYNAHANTL-LDYRMYHSGCPVALGNK 219
G+ +FW+N T LDYR H CPV +G K
Sbjct: 481 GAVLFWHNMDGETYDLDYRTLHGACPVIVGTK 512
>gi|194905419|ref|XP_001981192.1| GG11932 [Drosophila erecta]
gi|190655830|gb|EDV53062.1| GG11932 [Drosophila erecta]
Length = 535
Score = 123 bits (308), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 77/230 (33%), Positives = 118/230 (51%), Gaps = 22/230 (9%)
Query: 2 IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
+Y C+G L+ + +L+C + L P K+EEL+LDP VV++H I +
Sbjct: 284 MYEQVCRGELAPLPSKQRDLRC---RLWRSRLGYAPFKLEELHLDPPVVQLHQVIGSKDA 340
Query: 62 NRIIELSKGKVERGKV---VNYGDTIYVDTRLSK-VYFLYPEIFGDHPFLYKIQTRIQDM 117
+ ++ +++R V GD+ R S+ F Y + + + D
Sbjct: 341 ESLQRTARPRIKRSTVYSLAGNGDSTAAAFRTSQGASFNYSR----NAATKLLSHHVGDF 396
Query: 118 TNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRD----EGLW---RLASFMFYLTDVE 170
+ L + Y LQ+ NYG+GGHY+ H D+ P + EG R+A+ ++YL+DVE
Sbjct: 397 SGLNM----EYAEDLQVANYGIGGHYEPHWDSFPDNHVYQEGDLHGNRIATAIYYLSDVE 452
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GG T FP L L V PE+GS +FWYN H + D+R H+ CPV G+KW
Sbjct: 453 AGGGTAFPFLPLLVTPERGSLLFWYNLHPSGDQDFRTKHAACPVLQGSKW 502
>gi|198466403|ref|XP_002135183.1| GA23911 [Drosophila pseudoobscura pseudoobscura]
gi|198150584|gb|EDY73810.1| GA23911 [Drosophila pseudoobscura pseudoobscura]
Length = 534
Score = 122 bits (307), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 78/227 (34%), Positives = 117/227 (51%), Gaps = 23/227 (10%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
Y + C+G ++ L C Y FL++ PLK+EE+ DP +V H+ + D EI
Sbjct: 296 YEIGCRGLFPK----RTKLVCRYNFTTTPFLRLAPLKMEEVNHDPYIVMYHEVLSDREIE 351
Query: 63 RIIELSKGKVERGKVVN-YGDTIYVD-TRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
+ KG+ G++ N + D + T++ + + ++ RI DMTN
Sbjct: 352 EM----KGR--SGQMSNGWADQKEANSTKIRDIVCRHTWWREQSAIKERVNRRISDMTNF 405
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD------ATPRDEGLW-RLASFMFYLTDVELGG 173
+E LQ+ NYGLG H+ H D TP L RL S +FY +DV GG
Sbjct: 406 DFPPQE----DLQVANYGLGTHFKPHYDYTSDGYETPDVLTLGDRLGSIIFYASDVPQGG 461
Query: 174 ATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
AT+FP +++FP KGS+VFWYN + + +D R HS CPV +G++W
Sbjct: 462 ATVFPRSRVSIFPRKGSSVFWYNLYDDGRIDTRSQHSVCPVIVGDRW 508
>gi|301759032|ref|XP_002915381.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Ailuropoda
melanoleuca]
Length = 539
Score = 122 bits (307), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 79/222 (35%), Positives = 122/222 (54%), Gaps = 13/222 (5%)
Query: 7 CQGNLSVPEDIK-SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRII 65
CQ S P + +L C YE+ ++ +L + P++ E ++L+P VV HD + D E +I
Sbjct: 300 CQTLGSQPTHYQIPSLYCSYETNSSPYLLLQPVRKEVIHLEPYVVLYHDFVSDGEAQKIR 359
Query: 66 ELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGRE 125
L++ ++R V + + V+ R+SK +L + P L + RI +T L + +
Sbjct: 360 GLAEPWLQRSVVASGEKQLPVEYRISKSAWLKDTV---DPLLVTLDHRIGALTGLDV--Q 414
Query: 126 ERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELGGATIFP 178
Y LQ+ NYG+GGHY+ H D AT L+R+ A+FM YL+ VE GGAT F
Sbjct: 415 PPYAEYLQVVNYGIGGHYEPHFDHATSPTSPLYRMKSGNRVATFMIYLSSVEAGGATAFI 474
Query: 179 SLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
N +V K +A+FW+N H + D H+GCPV +G+KW
Sbjct: 475 YANFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLVGDKW 516
>gi|195494568|ref|XP_002094893.1| GE19959 [Drosophila yakuba]
gi|194180994|gb|EDW94605.1| GE19959 [Drosophila yakuba]
Length = 486
Score = 122 bits (307), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 78/214 (36%), Positives = 117/214 (54%), Gaps = 25/214 (11%)
Query: 18 KSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKV 77
K+NL C Y S N FLK+ PLK+EE+ DP +V H+ I D EI + KG + +
Sbjct: 246 KTNLVCRYNSSTNAFLKLAPLKMEEISRDPYIVMFHEVISDKEIEEM----KGDI---RE 298
Query: 78 VNYGDTIYVDTR--LSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQIN 135
+ G T D + +S VY++ E F +I RI DMT + E + +Q+
Sbjct: 299 MENGWTGLEDPKEIVSSVYWIREET----SFSKRINQRISDMTGFKL---EEFVA-IQLA 350
Query: 136 NYGLGGHYDLHCDA-TPRDEGLW-------RLASFMFYLTDVELGGATIFPSLNLTVFPE 187
N+G+GG++ H D T R G+ R+AS +FY +V GG T+FP L + V P+
Sbjct: 351 NFGVGGYFKPHFDYYTERLRGVDANNTLGDRIASIIFYAGEVSQGGQTVFPDLKVVVEPK 410
Query: 188 KGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 221
+G+A+FW+N ++ D R HS CPV +G++W
Sbjct: 411 RGNALFWFNKLDDSSPDPRSLHSVCPVIVGSRWS 444
>gi|81870817|sp|Q6W3F0.1|P4HA3_MOUSE RecName: Full=Prolyl 4-hydroxylase subunit alpha-3; Short=4-PH
alpha-3; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-3; Flags: Precursor
gi|36962749|gb|AAQ87604.1| collagen prolyl 4-hydroxylase alpha III subunit [Mus musculus]
Length = 542
Score = 122 bits (307), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 79/222 (35%), Positives = 120/222 (54%), Gaps = 13/222 (5%)
Query: 7 CQGNLSVPEDIK-SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRII 65
CQ S P + +L C YE+ ++ +L + P + E ++L P + HD + D E +I
Sbjct: 303 CQTLGSQPTHYQIPSLYCSYETNSSPYLLLQPARKEVVHLRPLIALYHDFVSDEEAQKIR 362
Query: 66 ELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGRE 125
EL++ ++R V + + V+ R+SK +L + P L + RI +T L I +
Sbjct: 363 ELAEPWLQRSVVASGEKQLQVEYRISKSAWLKDTV---DPMLVTLDHRIAALTGLDI--Q 417
Query: 126 ERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELGGATIFP 178
Y LQ+ NYG+GGHY+ H D AT L+R+ A+FM YL+ VE GGAT F
Sbjct: 418 PPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFI 477
Query: 179 SLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
N +V K +A+FW+N H + D H+GCPV +G+KW
Sbjct: 478 YGNFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLVGDKW 519
>gi|195505214|ref|XP_002099407.1| GE23379 [Drosophila yakuba]
gi|194185508|gb|EDW99119.1| GE23379 [Drosophila yakuba]
Length = 547
Score = 122 bits (307), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 78/227 (34%), Positives = 109/227 (48%), Gaps = 15/227 (6%)
Query: 2 IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
++P C G V ++ + L C Y + FL++ P+K E L +DP V+ HD I E
Sbjct: 293 LFPPCCSGRCEVSRNL-TGLYCVYNHVTSPFLQLAPIKTEILSIDPFVLLFHDMISQKES 351
Query: 62 NRIIELSKGKVERGKVVNY---GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I SK + + G +V T + Y D +I R+ D T
Sbjct: 352 TLIRSSSKEHMLPSATTDVDASGSEDHVATFRTSKSVWYSSTSNDTT--KRITERLGDAT 409
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-----RLASFMFYLTDVELGG 173
L + E + Q+ NYGLGG ++ H D D + RLA+ +FYL +V GG
Sbjct: 410 GLDMNFTEYF----QVINYGLGGFFETHLDMLLSDRSRFNGTRDRLATTLFYLNEVRQGG 465
Query: 174 ATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
T FP LNLTVFP+ GSA+FWYN H+GCPV +G+KW
Sbjct: 466 GTHFPRLNLTVFPQPGSALFWYNLDTRGNDHTSTLHTGCPVIVGSKW 512
>gi|227908832|ref|NP_796135.3| prolyl 4-hydroxylase subunit alpha-3 precursor [Mus musculus]
Length = 542
Score = 122 bits (307), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 79/222 (35%), Positives = 120/222 (54%), Gaps = 13/222 (5%)
Query: 7 CQGNLSVPEDIK-SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRII 65
CQ S P + +L C YE+ ++ +L + P + E ++L P + HD + D E +I
Sbjct: 303 CQTLGSQPTHYQIPSLYCSYETNSSPYLLLQPARKEVVHLRPLIALYHDFVSDEEAQKIR 362
Query: 66 ELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGRE 125
EL++ ++R V + + V+ R+SK +L + P L + RI +T L I +
Sbjct: 363 ELAEPWLQRSVVASGEKQLQVEYRISKSAWLKDTV---DPMLVTLDHRIAALTGLDI--Q 417
Query: 126 ERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELGGATIFP 178
Y LQ+ NYG+GGHY+ H D AT L+R+ A+FM YL+ VE GGAT F
Sbjct: 418 PPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFI 477
Query: 179 SLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
N +V K +A+FW+N H + D H+GCPV +G+KW
Sbjct: 478 YGNFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLVGDKW 519
>gi|194751825|ref|XP_001958224.1| GF23629 [Drosophila ananassae]
gi|190625506|gb|EDV41030.1| GF23629 [Drosophila ananassae]
Length = 523
Score = 122 bits (307), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 76/210 (36%), Positives = 111/210 (52%), Gaps = 17/210 (8%)
Query: 18 KSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKV 77
K NL C Y FL++ PLK+EE+ LDP VV H+ +Y++EI + + S G ++ G
Sbjct: 303 KMNLFCRYNFTTTPFLRLAPLKLEEINLDPYVVMYHEVLYETEIEELKKQS-GHMKNGYA 361
Query: 78 VNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNY 137
T+Y V + + P +I RI+DMT L + LQ+ NY
Sbjct: 362 DQKNGTMY-----RAVVARHSWWSDESPTRERINRRIRDMTGLDFPITD----TLQVANY 412
Query: 138 GLGGHYDLHCD------ATPRDEGLW-RLASFMFYLTDVELGGATIFPSLNLTVFPEKGS 190
G G ++ H D TP + L RL + +FY +DV GGAT+FP + +++ P KGS
Sbjct: 413 GCGTYFKPHFDYTSDGYETPNADALGDRLGTIIFYASDVLQGGATVFPDIKVSITPRKGS 472
Query: 191 AVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+VFWYN + + D R HS CPV G++W
Sbjct: 473 SVFWYNLYDDGRPDIRSRHSVCPVINGDRW 502
>gi|326923465|ref|XP_003207956.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 3
[Meleagris gallopavo]
Length = 518
Score = 122 bits (307), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 75/224 (33%), Positives = 115/224 (51%), Gaps = 25/224 (11%)
Query: 3 YPLACQGN-LSVPEDIKSNLKC-FYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G L + + L C +Y+ N +GP+K E+ + PR+V+ D I D E
Sbjct: 291 YEMLCRGEGLKMTPRRQKRLFCRYYDGNRNPRYILGPVKQEDEWDKPRIVRFLDIISDEE 350
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + EL+K ++ R V + G R+SK +L + P + +I TRIQD+T
Sbjct: 351 IETVKELAKPRLSRATVHDPETGKLTTAHYRVSKSAWLSGY---ESPVVSRINTRIQDLT 407
Query: 119 NLVIGREERYKG--PLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATI 176
L + E + P G G R+A+++FY++DV GGAT+
Sbjct: 408 GLDVSTAEELQKDEPDAFKELGTGN----------------RIATWLFYMSDVSAGGATV 451
Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
FP + +V+P+KG+AVFWYN + DY H+ CPV +GNKW
Sbjct: 452 FPEVGASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPVLVGNKW 495
>gi|52139015|gb|AAH82538.1| P4ha3 protein [Mus musculus]
Length = 404
Score = 122 bits (306), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 80/228 (35%), Positives = 122/228 (53%), Gaps = 13/228 (5%)
Query: 1 EIYPLACQGNLSVPEDIK-SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDS 59
+ Y CQ S P + +L C YE+ ++ +L + P + E ++L P + HD + D
Sbjct: 159 DTYEGLCQTLGSQPTHYQIPSLYCSYETNSSPYLLLQPARKEVVHLRPLIALYHDFVSDE 218
Query: 60 EINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
E +I EL++ ++R V + + V+ R+SK +L + P L + RI +T
Sbjct: 219 EAQKIRELAEPWLQRSVVASGEKQLQVEYRISKSAWLKDTV---DPMLVTLDHRIAALTG 275
Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELG 172
L I + Y LQ+ NYG+GGHY+ H D AT L+R+ A+FM YL+ VE G
Sbjct: 276 LDI--QPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAG 333
Query: 173 GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GAT F N +V K +A+FW+N H + D H+GCPV +G+KW
Sbjct: 334 GATAFIYGNFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLVGDKW 381
>gi|395820528|ref|XP_003783616.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 3 [Otolemur
garnettii]
Length = 516
Score = 122 bits (306), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 73/224 (32%), Positives = 115/224 (51%), Gaps = 25/224 (11%)
Query: 3 YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G + + + L C Y N N + P K E+ + PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + +L+K ++ R V + G R+SK +L ++P + +I RIQD+T
Sbjct: 349 IEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---ENPVVSRINMRIQDLT 405
Query: 119 NLVIGREERYKG--PLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATI 176
L + E + P G G R+A+++FY++DV GGAT+
Sbjct: 406 GLDVSTAEELQKDEPDAFKELGTGN----------------RIATWLFYMSDVSAGGATV 449
Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
FP + +V+P+KG+AVFWYN A+ DY H+ CPV +GNKW
Sbjct: 450 FPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 493
>gi|354504916|ref|XP_003514519.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Cricetulus
griseus]
Length = 509
Score = 122 bits (306), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 81/228 (35%), Positives = 123/228 (53%), Gaps = 13/228 (5%)
Query: 1 EIYPLACQGNLSVPEDIKS-NLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDS 59
+ Y CQ S P ++ L C YE+ ++ +L + P + E ++L P V HD + D+
Sbjct: 264 DTYEGLCQTLGSQPTHYQNPRLYCSYETNSSPYLLLQPARKEVIHLRPFVALYHDFVSDA 323
Query: 60 EINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
E +I EL++ ++R V + + V+ R+SK +L + P L + RI +T
Sbjct: 324 EAQKIRELAEPWLQRSVVASGEKQLPVEYRISKSAWLKDTV---DPMLGTLDHRIAALTG 380
Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELG 172
L I + Y LQ+ NYG+GGHY+ H D AT L+R+ A+FM YL+ VE G
Sbjct: 381 LDI--QPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSAVEAG 438
Query: 173 GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GAT F N +V K +A+FW+N H + D H+GCPV +G+KW
Sbjct: 439 GATAFIYANFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLVGDKW 486
>gi|291404186|ref|XP_002718473.1| PREDICTED: prolyl 4-hydroxylase, alpha I subunit isoform 3
[Oryctolagus cuniculus]
Length = 516
Score = 122 bits (306), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 73/224 (32%), Positives = 115/224 (51%), Gaps = 25/224 (11%)
Query: 3 YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G + + + L C Y N N + P K E+ + PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + +L+K ++ R V + G R+SK +L ++P + +I RIQD+T
Sbjct: 349 IEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---ENPVVSRINMRIQDLT 405
Query: 119 NLVIGREERYKG--PLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATI 176
L + E + P G G R+A+++FY++DV GGAT+
Sbjct: 406 GLDVSTAEELQKDEPDAFKELGTGN----------------RIATWLFYMSDVSAGGATV 449
Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
FP + +V+P+KG+AVFWYN A+ DY H+ CPV +GNKW
Sbjct: 450 FPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 493
>gi|148701598|gb|EDL33545.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide, isoform CRA_c [Mus
musculus]
gi|149052607|gb|EDM04424.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide (predicted),
isoform CRA_d [Rattus norvegicus]
Length = 189
Score = 122 bits (306), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 70/173 (40%), Positives = 100/173 (57%), Gaps = 15/173 (8%)
Query: 56 IYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTR 113
+ D EI RI E++K K+ R V + G R+SK +L + D P + ++ R
Sbjct: 1 MSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRR 57
Query: 114 IQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----RLASFMFYLT 167
+Q +T L + E LQ+ NYG+GG Y+ H D + P D GL RLA+F+ Y++
Sbjct: 58 MQHITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMS 113
Query: 168 DVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
DVE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 114 DVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 166
>gi|217272851|ref|NP_001136068.1| prolyl 4-hydroxylase subunit alpha-1 isoform 3 precursor [Homo
sapiens]
gi|114631189|ref|XP_001140871.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 10 [Pan
troglodytes]
Length = 516
Score = 122 bits (306), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 73/224 (32%), Positives = 115/224 (51%), Gaps = 25/224 (11%)
Query: 3 YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G + + + L C Y N N + P K E+ + PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + +L+K ++ R V + G R+SK +L ++P + +I RIQD+T
Sbjct: 349 IEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---ENPVVSRINMRIQDLT 405
Query: 119 NLVIGREERYKG--PLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATI 176
L + E + P G G R+A+++FY++DV GGAT+
Sbjct: 406 GLDVSTAEELQKDEPDAFKELGTGN----------------RIATWLFYMSDVSAGGATV 449
Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
FP + +V+P+KG+AVFWYN A+ DY H+ CPV +GNKW
Sbjct: 450 FPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 493
>gi|126327904|ref|XP_001367838.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Monodelphis
domestica]
Length = 559
Score = 122 bits (306), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 79/228 (34%), Positives = 121/228 (53%), Gaps = 13/228 (5%)
Query: 1 EIYPLACQGNLSVPEDIK-SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDS 59
+ Y CQ S P + +L C YE+ + +L + P++ E L+L+P +V HD + DS
Sbjct: 314 DTYEGLCQTLGSQPTHYQIPSLYCAYETNASPYLLLQPVRKEVLHLEPYIVLYHDFVSDS 373
Query: 60 EINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
E +I + ++R V + V+ R+SK +L + P L + RI +T
Sbjct: 374 EAQKIRGFAAPWLQRSVVASGEKQQQVEYRISKSAWLKDTV---DPMLVSLDHRIAALTG 430
Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELG 172
L + + Y LQ+ NYG+GGHY+ H D AT L+R+ A+FM YL+ VE G
Sbjct: 431 LNV--QPPYAEHLQVVNYGIGGHYEPHFDHATSPSSPLYRMNSGNRVATFMIYLSSVEAG 488
Query: 173 GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
G+T F N +V K +A+FW+N H + D H+GCPV +G+KW
Sbjct: 489 GSTAFIYANFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLVGDKW 536
>gi|442747045|gb|JAA65682.1| Putative prolyl 4-hydroxylase alpha subunit [Ixodes ricinus]
Length = 538
Score = 122 bits (306), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 71/234 (30%), Positives = 116/234 (49%), Gaps = 17/234 (7%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
+ Y C+G L + S L+C Y + F + P+K+EE+ L P ++ +HD + D +
Sbjct: 279 QSYKRLCRGELLRSPKMDSQLRCRYYKGQDGFFSLQPIKLEEINLKPYIIVMHDVVQDKD 338
Query: 61 INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
I ++ ++ ++ER + + R S +L + + P ++ + ++ + +
Sbjct: 339 IKDLMAYAEPRLERSTTYTGSEMVPSPVRTSSTAWLNED---EAPIAVRMNSYLRALLGM 395
Query: 121 VIGREERYKGPLQINNYGLGG----HYD-----LHCDATPRDEGLW-----RLASFMFYL 166
Q+ NYG GG H+D LH + D L RLA+ M Y+
Sbjct: 396 GTSDTNEEAEAYQLANYGTGGQFLPHHDFLQDSLHSYNSSADYYLQYGTGDRLATLMIYM 455
Query: 167 TDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
TDVE GGAT+FPSL + + P+KG A FW+N A+ D H+GCPV G+KW
Sbjct: 456 TDVEEGGATVFPSLGIRLTPKKGDAAFWWNLKASGEGDRLTTHAGCPVLYGSKW 509
>gi|198449524|ref|XP_002136918.1| GA26871 [Drosophila pseudoobscura pseudoobscura]
gi|198130646|gb|EDY67476.1| GA26871 [Drosophila pseudoobscura pseudoobscura]
Length = 530
Score = 122 bits (306), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 77/214 (35%), Positives = 114/214 (53%), Gaps = 22/214 (10%)
Query: 19 SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV 78
S L C Y + FL++ PL++EEL LDP +V H+ + D+EI ++ +++ + K +
Sbjct: 298 SRLHCRYNATTTPFLRLAPLRMEELSLDPYIVVYHNVLSDAEIAKVERVAEPLL---KSI 354
Query: 79 NYGDTIYVDTRLSKVYFLYPEIFGDH-------PFLYKIQTRIQDMTNLVIGREERYKGP 131
G+ +++ SKV D P + +I RI DMT L+I ++
Sbjct: 355 GVGEMD--NSKKSKVRTALGAWIPDKNMHISGWPVIQRIVRRIHDMTGLII----KHGQV 408
Query: 132 LQINNYGLGGHYDLHCD----ATPRDEGLW-RLASFMFYLTDVELGGATIFPSLNLTVFP 186
+Q+ YG GGHYD H D + P + L R+A+ +FYL DV+ GG+T+FP L L V
Sbjct: 409 VQLIKYGYGGHYDTHFDYLNDSLPITQALGDRMATVLFYLNDVKHGGSTVFPVLKLKVPS 468
Query: 187 EKGSAVFWYNAHANTL-LDYRMYHSGCPVALGNK 219
E+G + WYN H T LD R H CPV G K
Sbjct: 469 ERGKVLVWYNMHGETHDLDSRTLHGSCPVIDGAK 502
>gi|355709028|gb|AES03457.1| prolyl 4-hydroxylase, alpha polypeptide III [Mustela putorius furo]
Length = 477
Score = 122 bits (306), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 75/208 (36%), Positives = 117/208 (56%), Gaps = 12/208 (5%)
Query: 20 NLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN 79
+L C YE+ ++ +L + P++ E ++L+P VV HD + D E +I L++ ++R V +
Sbjct: 253 SLYCSYETNSSPYLLLQPIRKEVIHLEPYVVLYHDFVSDMEAQKIRGLAEPWLQRSVVAS 312
Query: 80 YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGL 139
+ V+ R+SK +L + P L + RI +T L + + Y LQ+ NYG+
Sbjct: 313 GEKQLPVEYRISKSAWLKDTV---DPLLVNLDHRIGALTGLDV--QPPYAEYLQVVNYGI 367
Query: 140 GGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELGGATIFPSLNLTVFPEKGSAV 192
GGHY+ H D AT L+R+ A+FM YL+ VE GGAT F N +V K +A+
Sbjct: 368 GGHYEPHFDHATSPTSPLYRMKSGNRVATFMIYLSSVEAGGATAFIYANFSVPVVKNAAL 427
Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
FW+N H + D H+GCPV +G+KW
Sbjct: 428 FWWNLHRSGEGDGDTLHAGCPVLVGDKW 455
>gi|321461762|gb|EFX72791.1| hypothetical protein DAPPUDRAFT_308081 [Daphnia pulex]
Length = 561
Score = 122 bits (306), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 80/236 (33%), Positives = 119/236 (50%), Gaps = 23/236 (9%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
E Y C+G +I++ L+C + + L + P+KVEE LDP +V +HD I + +
Sbjct: 303 EHYERLCRGEKLRSANIEAGLRCRLVTRGHPALLLQPIKVEEQSLDPMIVVLHDLITERQ 362
Query: 61 INRIIELSKGKV----ERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
+ +L + K+ RG G + R SK +L ++ L I+ R++
Sbjct: 363 TEILRQLGEPKLATSLHRG---GEGKFVRSMIRTSKNAWLQEH---ENASLPAIRHRMEL 416
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYD------LHCDATPRDEGLW------RLASFMF 164
T L+ G E + QI NYG+GG Y +H D P D+ W R+A+ M
Sbjct: 417 ATGLIYGPETASEY-FQIANYGIGGLYKTHTDNVIHPDVRPEDQDPWNLYVGDRIATLMV 475
Query: 165 YLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
YL+DVE GGAT+FP +T +P KGSA FW+N + + D H CPV G+KW
Sbjct: 476 YLSDVEAGGATVFPRAGVTCWPRKGSAAFWWNLYKSGEPDLTTRHGACPVLHGSKW 531
>gi|195505199|ref|XP_002099401.1| GE23383 [Drosophila yakuba]
gi|194185502|gb|EDW99113.1| GE23383 [Drosophila yakuba]
Length = 535
Score = 122 bits (306), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 77/230 (33%), Positives = 114/230 (49%), Gaps = 22/230 (9%)
Query: 2 IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
+Y C+G L+ + NL+C + L P K+EEL+LDP +V++H I +
Sbjct: 284 MYEQVCRGELAPLPAKQRNLRC---RLRKSRLGYAPFKLEELHLDPLLVQLHQVIGAKDS 340
Query: 62 NRIIELSKGKVERGKVV----NYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDM 117
+ ++ +++R V N G T F Y + + D
Sbjct: 341 ESLQRTARPRIKRSTVYSLAGNGGSTAAAFRTSQGASFNYSRSAATKLLSH----HVGDF 396
Query: 118 TNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRD----EGLW---RLASFMFYLTDVE 170
+ L + Y LQ+ NYG+GGHY+ H D+ P + EG R+A+ ++YL+DVE
Sbjct: 397 SGLNM----EYAEDLQVANYGIGGHYEPHWDSFPENHVYQEGDLHGNRIATGIYYLSDVE 452
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GG T FP L L V PEKGS +FWYN H + D+R H+ CPV G+KW
Sbjct: 453 AGGGTAFPFLPLLVTPEKGSLLFWYNLHPSGDQDFRTKHAACPVLQGSKW 502
>gi|386766694|ref|NP_651648.5| CG11828 [Drosophila melanogaster]
gi|383293009|gb|AAF56834.5| CG11828 [Drosophila melanogaster]
Length = 458
Score = 122 bits (306), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 78/228 (34%), Positives = 117/228 (51%), Gaps = 26/228 (11%)
Query: 7 CQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIE 66
C+G +P KS L+C Y + FL++ P+K+E+L ++P V HDAI +E ++
Sbjct: 239 CRGKNLLPS--KSYLRCRYLRDGSPFLRMAPVKLEQLNIEPFVGLFHDAISPAEQKDLLH 296
Query: 67 LSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREE 126
L+ ++E K + VDT S DH + +I RI+D+T + E
Sbjct: 297 LTDSRLEHRKKDSSSVEAKVDTNAS-----------DH--VRRIHQRIEDITGFDLEESE 343
Query: 127 RYKGPLQINNYGLGGHYDLHCDATPRDEGL------WRLASFMFYLTDVELGGATIFPSL 180
PL ++NYG+GG +H D E + +R AS MFYL+DV++GG FP L
Sbjct: 344 ----PLTVSNYGIGGQDFIHLDCEQPKEFIGYYPKEYRSASAMFYLSDVQMGGYASFPDL 399
Query: 181 NLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW-GKLLLSG 227
P +GSA+ W+N + D R + CPV LGN+W K +SG
Sbjct: 400 GFGFKPRRGSALVWHNTDNSGNCDTRSLQATCPVLLGNQWVAKKWISG 447
>gi|73988166|ref|XP_851718.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Canis lupus
familiaris]
Length = 544
Score = 122 bits (305), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 79/222 (35%), Positives = 122/222 (54%), Gaps = 13/222 (5%)
Query: 7 CQGNLSVPEDIK-SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRII 65
CQ S P + +L C YE+ ++ +L + P++ E ++L+P VV HD + D E +I
Sbjct: 305 CQTLGSQPTHYQIPSLYCSYETNSSPYLLLQPVRKEVIHLEPYVVLYHDFVNDVEAQKIR 364
Query: 66 ELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGRE 125
L++ ++R V + + V+ R+SK +L + P L + RI +T L + +
Sbjct: 365 GLAEPWLQRSVVASGEKQLPVEYRISKSAWLKDTV---DPLLVTLDHRIGALTGLDV--Q 419
Query: 126 ERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELGGATIFP 178
Y LQ+ NYG+GGHY+ H D AT L+R+ A+FM YL+ VE GGAT F
Sbjct: 420 PPYAEYLQVVNYGIGGHYEPHFDHATSPTSPLYRMKSGNRVATFMIYLSSVEAGGATAFI 479
Query: 179 SLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
N +V K +A+FW+N H + D H+GCPV +G+KW
Sbjct: 480 YANFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLVGDKW 521
>gi|443697961|gb|ELT98195.1| hypothetical protein CAPTEDRAFT_181380 [Capitella teleta]
Length = 530
Score = 122 bits (305), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 77/227 (33%), Positives = 111/227 (48%), Gaps = 17/227 (7%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
Y C+G + K L C Y+ Y+ F I PLK E L DP + HD + DS+
Sbjct: 289 YEKLCRGEETHKRPFKHRLVCRYQRYHPIFY-ISPLKEEMLNFDPAIYVYHDVLTDSQNA 347
Query: 63 RIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD--HPFLYKIQTRIQDMTNL 120
I E+S+ K+ R V + D DT LS D HP + ++ + ++NL
Sbjct: 348 IIKEVSRPKLHRSGVFSKTD---ADTGLSNFRTSQTAWHDDSTHPLIARLSQKASAISNL 404
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRDE-------GLWRLASFMFYLTDVELGG 173
+ E LQ+ NYG+GG Y+ H D +E R+A+F+ YL+++E GG
Sbjct: 405 TLETVEH----LQVLNYGIGGLYEPHWDFVQGEERNEFSESDRNRVATFICYLSELEAGG 460
Query: 174 ATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
T++P++ V P K S WYN N DYR YH+ CP+ G KW
Sbjct: 461 YTVYPTVGAAVVPRKNSCALWYNLMRNGTGDYRTYHAACPILYGYKW 507
>gi|443697959|gb|ELT98193.1| hypothetical protein CAPTEDRAFT_162820 [Capitella teleta]
Length = 347
Score = 122 bits (305), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 77/227 (33%), Positives = 111/227 (48%), Gaps = 17/227 (7%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
Y C+G + K L C Y+ Y+ F I PLK E L DP + HD + DS+
Sbjct: 106 YEKLCRGEETHKRPFKHRLVCRYQRYHPIFY-ISPLKEEMLNFDPAIYVYHDVLTDSQNA 164
Query: 63 RIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD--HPFLYKIQTRIQDMTNL 120
I E+S+ K+ R V + D DT LS D HP + ++ + ++NL
Sbjct: 165 IIKEVSRPKLHRSGVFSKTD---ADTGLSNFRTSQTAWHDDSTHPLIARLSQKASAISNL 221
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRDE-------GLWRLASFMFYLTDVELGG 173
+ E LQ+ NYG+GG Y+ H D +E R+A+F+ YL+++E GG
Sbjct: 222 TLETVEH----LQVLNYGIGGLYEPHWDFVQGEERNEFSESDRNRVATFICYLSELEAGG 277
Query: 174 ATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
T++P++ V P K S WYN N DYR YH+ CP+ G KW
Sbjct: 278 YTVYPTVGAAVVPRKNSCALWYNLMRNGTGDYRTYHAACPILYGYKW 324
>gi|351696981|gb|EHA99899.1| Prolyl 4-hydroxylase subunit alpha-3 [Heterocephalus glaber]
Length = 572
Score = 122 bits (305), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 74/207 (35%), Positives = 115/207 (55%), Gaps = 12/207 (5%)
Query: 21 LKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNY 80
L C YE+ ++ +L + P++ E ++L+P V HD + D E +I +L++ ++R V +
Sbjct: 348 LYCSYETNSSPYLLLQPVRKEVIHLEPYVALYHDFVSDPEAQKIRKLAEPWLQRSVVASG 407
Query: 81 GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLG 140
+ V+ R+SK +L P L + RI +T L + + Y LQ+ NYG+G
Sbjct: 408 EKQLQVEYRISKSAWLKDTA---DPVLVTLDHRIAALTGLDV--QHPYAEYLQVVNYGIG 462
Query: 141 GHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVF 193
GHY+ H D AT L+R+ A+FM YL+ VE GGAT F N +V K +A+F
Sbjct: 463 GHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFIYANFSVPVVKNAALF 522
Query: 194 WYNAHANTLLDYRMYHSGCPVALGNKW 220
W+N H + D H+GCPV +G+KW
Sbjct: 523 WWNLHRSGEGDGDTLHAGCPVLVGDKW 549
>gi|74216495|dbj|BAE25162.1| unnamed protein product [Mus musculus]
Length = 187
Score = 122 bits (305), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 70/171 (40%), Positives = 99/171 (57%), Gaps = 15/171 (8%)
Query: 58 DSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQ 115
D EI RI E++K K+ R V + G R+SK +L + D P + ++ R+Q
Sbjct: 1 DEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQ 57
Query: 116 DMTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----RLASFMFYLTDV 169
+T L + E LQ+ NYG+GG Y+ H D + P D GL RLA+F+ Y++DV
Sbjct: 58 HITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDV 113
Query: 170 ELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
E GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KW
Sbjct: 114 EAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 164
>gi|198466401|ref|XP_002135182.1| GA23910 [Drosophila pseudoobscura pseudoobscura]
gi|198150583|gb|EDY73809.1| GA23910 [Drosophila pseudoobscura pseudoobscura]
Length = 530
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 73/223 (32%), Positives = 117/223 (52%), Gaps = 19/223 (8%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
Y + C+G ++NL C Y FL++ PLK+EE+ DP +V H+ +YD EI
Sbjct: 296 YEIGCRGLFPK----RTNLVCRYNFTTTPFLRLAPLKMEEVNHDPYIVLYHEVLYDREIE 351
Query: 63 RIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVI 122
+ + SK ++N + ++ ++ + + +I RI D+T +
Sbjct: 352 ELKKQSKN------MINGFSEPQQENKIREIIARHAWWWEQTTTRARIYQRITDITGFQL 405
Query: 123 GREERYKGPLQINNYGLGGHYDLHCDATPRDEGL-W----RLASFMFYLTDVELGGATIF 177
+E L + NYGLG + H D TP + + W L + +FY++D++ GGATIF
Sbjct: 406 FVQEE----LNVANYGLGTIFGPHYDYTPENYDIGWFMGGPLGTILFYVSDLQQGGATIF 461
Query: 178 PSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
PS+N+TV P KGSA+ W+N + + D R HS CPV G++W
Sbjct: 462 PSINITVSPRKGSALLWFNLYDDGEPDPRTLHSSCPVIEGDRW 504
>gi|268572523|ref|XP_002641343.1| C. briggsae CBR-DPY-18 protein [Caenorhabditis briggsae]
gi|94442971|emb|CAJ98658.1| prolyl 4-hydroxylase [Caenorhabditis briggsae]
Length = 559
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 77/229 (33%), Positives = 113/229 (49%), Gaps = 18/229 (7%)
Query: 2 IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
+Y C+ + V + S L C+Y+ + FL P+KVE +P V D I D E+
Sbjct: 284 MYEALCRNEVPVSQKDISKLYCYYKR-DRPFLIYAPIKVEIKRFNPLAVLFKDVISDEEV 342
Query: 62 NRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
I EL+K K+ R V + G + R+SK +L +H + ++ RI MTN
Sbjct: 343 ATIQELAKPKLARATVHDSVTGKLVTATYRISKSAWLKA---WEHEVVERVNKRIDLMTN 399
Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVEL 171
L + E LQI NYG+GGHYD H D ++E R+A+ +FY++
Sbjct: 400 LEMETAEE----LQIANYGIGGHYDPHFDHAKKEESKSFESLGTGNRIATVLFYMSQPSH 455
Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GG T+F + TV P K A+FWYN + + H+ CPV +G KW
Sbjct: 456 GGGTVFTEVKSTVLPTKNDALFWYNLYKQGDGNPDTRHAACPVLVGIKW 504
>gi|410972729|ref|XP_003992809.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Felis catus]
Length = 533
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 79/222 (35%), Positives = 122/222 (54%), Gaps = 13/222 (5%)
Query: 7 CQGNLSVPEDIK-SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRII 65
CQ S P + +L C YE+ ++ +L + P++ E ++L+P VV HD + D E +I
Sbjct: 294 CQTLGSQPTHYQIPSLYCSYETNSSPYLLLQPIRKEVIHLEPYVVLYHDFVNDLEAQKIR 353
Query: 66 ELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGRE 125
L++ ++R V + + V+ R+SK +L + P L + RI +T L + +
Sbjct: 354 GLAEPWLQRSVVASGEKQLPVEYRISKSAWLKDTV---DPLLVTLDHRIGALTGLDV--Q 408
Query: 126 ERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELGGATIFP 178
Y LQ+ NYG+GGHY+ H D AT L+R+ A+FM YL+ VE GGAT F
Sbjct: 409 PPYAEYLQVVNYGIGGHYEPHFDHATSPTSPLYRMKSGNRVATFMIYLSSVEAGGATAFI 468
Query: 179 SLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
N +V K +A+FW+N H + D H+GCPV +G+KW
Sbjct: 469 YANFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLVGDKW 510
>gi|308497208|ref|XP_003110791.1| CRE-DPY-18 protein [Caenorhabditis remanei]
gi|308242671|gb|EFO86623.1| CRE-DPY-18 protein [Caenorhabditis remanei]
Length = 559
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 77/229 (33%), Positives = 113/229 (49%), Gaps = 18/229 (7%)
Query: 2 IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
+Y C+ + V + S L C+Y+ + FL P+KVE +P V D I D E+
Sbjct: 284 MYEALCRNEVPVSQKDISRLYCYYKR-DRPFLVYAPIKVEIKRFNPLAVLFKDVISDDEV 342
Query: 62 NRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
I EL+K K+ R V + G + R+SK +L +H + ++ RI+ MTN
Sbjct: 343 ATIQELAKPKLARATVHDSATGKLVTATYRISKSAWLKE---WEHEVVERVNKRIELMTN 399
Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVEL 171
L + E LQI NYG+GGHYD H D ++E R+A+ +FY++
Sbjct: 400 LEMETAEE----LQIANYGIGGHYDPHFDHAKKEESKSFESLGTGNRIATVLFYMSQPSH 455
Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GG T+F + TV P K A+FWYN + H+ CPV +G KW
Sbjct: 456 GGGTVFTEVKSTVLPTKNDALFWYNLFKQGDGNPDTRHAACPVLVGIKW 504
>gi|195341542|ref|XP_002037365.1| GM12152 [Drosophila sechellia]
gi|194131481|gb|EDW53524.1| GM12152 [Drosophila sechellia]
Length = 535
Score = 121 bits (304), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 77/230 (33%), Positives = 118/230 (51%), Gaps = 22/230 (9%)
Query: 2 IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
+Y C+G L+ + +L+C + L P K+EEL+LDP VV++H I ++
Sbjct: 284 MYEQVCRGELAPLPSKQRSLRC---RLRKSRLGYAPFKLEELHLDPLVVQLHQVIGSNDS 340
Query: 62 NRIIELSKGKVERGKVV----NYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDM 117
+ + ++ ++R V N G T F Y + + + + D
Sbjct: 341 ESLQKSARPMIKRSTVYSLGGNGGSTAAAFRTSQGASFNYSK----NAATKLLSHHVGDF 396
Query: 118 TNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRD----EGLW---RLASFMFYLTDVE 170
++L + Y LQ+ NYG+GGHY+ H D+ P + EG R+A+ ++YL+DVE
Sbjct: 397 SDLNMD----YAEDLQVANYGIGGHYEPHWDSFPENHIYQEGDLHGNRIATGIYYLSDVE 452
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GG T FP L L V PEKGS +FWYN H + D+R H+ CPV G+KW
Sbjct: 453 AGGGTAFPFLPLLVTPEKGSLLFWYNLHPSGDQDFRTKHAACPVLQGSKW 502
>gi|195352176|ref|XP_002042590.1| GM14977 [Drosophila sechellia]
gi|194124474|gb|EDW46517.1| GM14977 [Drosophila sechellia]
Length = 485
Score = 121 bits (304), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 80/225 (35%), Positives = 118/225 (52%), Gaps = 32/225 (14%)
Query: 9 GNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELS 68
G LSV + +L C YE + FL+I PLKVE L L P +V HD IY+SEI++I +S
Sbjct: 279 GCLSVWQ-TSQHLSCHYEQNTSEFLRIAPLKVETLSLKPHIVLYHDVIYESEISKIKNIS 337
Query: 69 KGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERY 128
++ + D + + +L+++ + P + RI+DMT G + +
Sbjct: 338 LPSLKSP--LRIIDAVDYNLKLAQIR--------EDP-QSPLSLRIKDMT----GEDVKE 382
Query: 129 KGPLQINNYGLGGHYDLHCDATPRDEGLW----RLASFMFYLTDVELGGATIFPSLNLTV 184
QI+NYG+ G + H D + RL S +F++ DV GGA FP+LNLT+
Sbjct: 383 DTDFQIDNYGICGFRNFHTDNIEIQDQTAELGDRLTSILFFMNDVVQGGAFAFPNLNLTI 442
Query: 185 FPEKGSAVFWYNAHANTLLDYRM------YHSGCPVALGNKWGKL 223
+P KGSA+ W N LD+RM H CPV +G+KW +L
Sbjct: 443 WPHKGSALVWRN------LDHRMQPNKDLLHVSCPVVVGSKWSEL 481
>gi|417402564|gb|JAA48127.1| Putative prolyl 4-hydroxylase alpha subunit [Desmodus rotundus]
Length = 544
Score = 121 bits (304), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 78/222 (35%), Positives = 120/222 (54%), Gaps = 13/222 (5%)
Query: 7 CQGNLSVPEDIKS-NLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRII 65
CQ S P ++ +L C YE+ + +L + P++ E ++L+P VV HD + D E +I
Sbjct: 305 CQTLGSQPTHYQNPSLHCSYETGASPYLLLQPIRKEVVHLEPYVVLYHDFVNDLEAQKIR 364
Query: 66 ELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGRE 125
++ ++R V + + V+ R+SK +L + P L + RI +T L +
Sbjct: 365 GFAEPWLQRSVVASGEKQLPVEYRISKSAWLKDTV---DPMLVTLDRRIAALTGL--DTQ 419
Query: 126 ERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELGGATIFP 178
Y LQ+ NYG+GGHY+ H D AT L+R+ A+FM YL+ VE GGAT F
Sbjct: 420 PPYAEHLQVVNYGIGGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFI 479
Query: 179 SLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
N +V K +A+FW+N H + D H+GCPV +G+KW
Sbjct: 480 YANFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLVGDKW 521
>gi|308451420|ref|XP_003088665.1| CRE-PHY-2 protein [Caenorhabditis remanei]
gi|308246199|gb|EFO90151.1| CRE-PHY-2 protein [Caenorhabditis remanei]
Length = 609
Score = 121 bits (303), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 89/285 (31%), Positives = 124/285 (43%), Gaps = 73/285 (25%)
Query: 1 EIYPLACQGNLS-VPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDS 59
+ Y C+G + V + K+ L+C+ + + FLKI P+KVE L DP V + I DS
Sbjct: 295 DAYEALCRGEIPPVEKKWKNKLRCYLKR-DKPFLKIAPIKVEILRFDPLAVLFKNVISDS 353
Query: 60 EINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDM 117
EI I EL+ K++R V N G+ + R+SK +L ++ HP + ++ RI+D
Sbjct: 354 EIKVIKELASPKLKRATVQNSKTGELEHATYRISKSAWLKGDL---HPVIERVNRRIEDF 410
Query: 118 TNLVIGREERYKGPLQINNYGLGGHYDLHCD----------------------------- 148
T L G E LQ+ NYGLGGHYD H D
Sbjct: 411 TGLYQGTSEE----LQVANYGLGGHYDPHFDFARIANYGLGGHYEPHYDMSLVGYHPIQL 466
Query: 149 -----------ATPRDEGLWRLASFMFY----------------------LTDVELGGAT 175
P + R+A+ +FY ++ E GGAT
Sbjct: 467 TVSLEYFQRGVPEPYGKNGNRIATVLFYKEEKNAFKTLNTGNRIATVLFYMSQPERGGAT 526
Query: 176 IFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+F L VFP K A+FWYN + D R H+ CPV LG KW
Sbjct: 527 VFNHLGTAVFPSKNDALFWYNLRRDGEGDLRTRHAACPVLLGVKW 571
>gi|426245942|ref|XP_004016760.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3, partial [Ovis
aries]
Length = 514
Score = 121 bits (303), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 78/222 (35%), Positives = 122/222 (54%), Gaps = 13/222 (5%)
Query: 7 CQGNLSVPEDIK-SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRII 65
CQ S P + +L C YE+ ++ +L + P++ E ++L+P VV HD + D+E +I
Sbjct: 275 CQTLGSQPTHYQIPSLYCSYETSSSPYLLLQPVRKEVIHLEPYVVLYHDFVSDAEAQKIR 334
Query: 66 ELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGRE 125
L++ ++R V + + V+ R+SK +L + P L + RI +T L + +
Sbjct: 335 GLAEPWLQRSVVASGEKQLPVEYRISKSAWLKDTV---DPVLVTLDHRIAALTGLDV--Q 389
Query: 126 ERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELGGATIFP 178
Y LQ+ NYG+GGHY+ H D AT L+R+ A+FM YL+ VE GGAT F
Sbjct: 390 PPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRMNSGNRVATFMIYLSSVEAGGATAFI 449
Query: 179 SLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
N +V K +A+FW+N H + D H+ CPV +G+KW
Sbjct: 450 YGNFSVPVVKNAALFWWNLHRSGEGDGDTLHAACPVLVGDKW 491
>gi|281353153|gb|EFB28737.1| hypothetical protein PANDA_003344 [Ailuropoda melanoleuca]
Length = 456
Score = 121 bits (303), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 79/222 (35%), Positives = 122/222 (54%), Gaps = 13/222 (5%)
Query: 7 CQGNLSVPEDIK-SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRII 65
CQ S P + +L C YE+ ++ +L + P++ E ++L+P VV HD + D E +I
Sbjct: 240 CQTLGSQPTHYQIPSLYCSYETNSSPYLLLQPVRKEVIHLEPYVVLYHDFVSDGEAQKIR 299
Query: 66 ELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGRE 125
L++ ++R V + + V+ R+SK +L + P L + RI +T L + +
Sbjct: 300 GLAEPWLQRSVVASGEKQLPVEYRISKSAWLKDTV---DPLLVTLDHRIGALTGLDV--Q 354
Query: 126 ERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELGGATIFP 178
Y LQ+ NYG+GGHY+ H D AT L+R+ A+FM YL+ VE GGAT F
Sbjct: 355 PPYAEYLQVVNYGIGGHYEPHFDHATVTMGPLYRMKSGNRVATFMIYLSSVEAGGATAFI 414
Query: 179 SLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
N +V K +A+FW+N H + D H+GCPV +G+KW
Sbjct: 415 YANFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLVGDKW 456
>gi|426255748|ref|XP_004021510.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 3 [Ovis
aries]
Length = 516
Score = 120 bits (302), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 73/224 (32%), Positives = 115/224 (51%), Gaps = 25/224 (11%)
Query: 3 YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G + + + L C Y N N + P K E+ + PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + +L+K ++ R V + G R+SK +L ++P + +I RIQD+T
Sbjct: 349 IEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---ENPVVSRINMRIQDLT 405
Query: 119 NLVIGREERYKG--PLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATI 176
L + E + P G G R+A+++FY++DV GGAT+
Sbjct: 406 GLDVSTAEELQKDEPDAFKELGTGN----------------RIATWLFYMSDVLAGGATV 449
Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
FP + +V+P+KG+AVFWYN A+ DY H+ CPV +GNKW
Sbjct: 450 FPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 493
>gi|431838427|gb|ELK00359.1| Prolyl 4-hydroxylase subunit alpha-3 [Pteropus alecto]
Length = 483
Score = 120 bits (302), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 78/222 (35%), Positives = 120/222 (54%), Gaps = 13/222 (5%)
Query: 7 CQGNLSVPEDIK-SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRII 65
CQ S P + +L C YE+ ++ +L + P++ E ++L+P VV HD + D E +I
Sbjct: 244 CQTLGSQPTHYQIPSLHCSYETNSSPYLLLQPVRKEVIHLEPYVVLYHDFVSDLEAQKIR 303
Query: 66 ELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGRE 125
L++ ++R V + + V+ R+SK +L P L + RI +T L + +
Sbjct: 304 GLAEPWLQRSVVASGEKQLPVEYRISKSAWLKDTA---DPMLVTLDHRIAALTGLDV--Q 358
Query: 126 ERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELGGATIFP 178
Y LQ+ NYG+GGHY+ H D AT L+R+ A+FM YL+ VE GGAT F
Sbjct: 359 PPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFI 418
Query: 179 SLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
N +V K +A+FW+N H + D H+ CPV +G+KW
Sbjct: 419 YANFSVPVVKNAALFWWNLHRSGEGDSDTLHAACPVLVGDKW 460
>gi|195338688|ref|XP_002035956.1| GM16188 [Drosophila sechellia]
gi|194129836|gb|EDW51879.1| GM16188 [Drosophila sechellia]
Length = 392
Score = 120 bits (302), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 75/222 (33%), Positives = 111/222 (50%), Gaps = 20/222 (9%)
Query: 6 ACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRII 65
CQG L C Y S F++I PLK EE+ DP + HD IYDSEI ++
Sbjct: 165 GCQGKFPP----GPQLVCRYNSTTTPFMRIAPLKEEEISRDPLIWLYHDVIYDSEITQLT 220
Query: 66 ELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGRE 125
L++ ++ G NY R+++++ + + R+ D++ L +G
Sbjct: 221 NLTREEMILGTTTNYT----TPDRVNRLFHIKVTNDDGGKLDKTLVNRMADISGLDMGNT 276
Query: 126 ERYKGPLQINNYGLGGHYDLHCD-------ATPRDEGLWRLASFMFYLTDVELGGATIFP 178
L NYGLGG++ H D +EG RL +F+FY+TDV +GG TIFP
Sbjct: 277 TT----LARINYGLGGYFQEHSDYMDIKLHPELTEEGD-RLMTFLFYMTDVLVGGGTIFP 331
Query: 179 SLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
L + P+KGSA+FWYN H N + H+ CP +G++W
Sbjct: 332 GAQLAIQPKKGSALFWYNLHNNGDPNPLTRHAVCPTIVGSRW 373
>gi|51490656|emb|CAF31507.1| prolyl 4-hydroxylase 2 precursor [Brugia malayi]
Length = 551
Score = 120 bits (302), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 76/229 (33%), Positives = 119/229 (51%), Gaps = 18/229 (7%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
+ Y C+ + + +S L C+Y+ + +L++ P KVE ++ +P VV D + D E
Sbjct: 283 DTYEALCRQEVPINTKAQSRLYCYYK-MDRPYLRLAPFKVEIVHQNPLVVLFRDIVSDEE 341
Query: 61 INRIIE-LSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDM 117
+ RIIE L+ K+ R V N G+ R S+ +L +H + +I R+
Sbjct: 342 M-RIIEMLAVPKLARATVHNVVTGNIETAFYRTSQSSWLGS---TEHEVVKRINKRLDLA 397
Query: 118 TNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW------RLASFMFYLTDVEL 171
TNL E LQ+ NYG+GGHY+ H D + R+ R+A+ + Y+T+ E+
Sbjct: 398 TNL----ETETAEELQVQNYGIGGHYEPHYDCSRRENVFEKTKNGNRIATILIYMTEPEI 453
Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GG T+F L +V K +A+FWYN + +D R YH+ CPV G KW
Sbjct: 454 GGGTVFIDLKTSVSCTKNAALFWYNLMRSGAVDMRSYHAACPVLTGTKW 502
>gi|48675383|ref|NP_001001598.1| prolyl 4-hydroxylase subunit alpha-3 precursor [Bos taurus]
gi|75053350|sp|Q75UG4.1|P4HA3_BOVIN RecName: Full=Prolyl 4-hydroxylase subunit alpha-3; Short=4-PH
alpha-3; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-3; Flags: Precursor
gi|47115494|dbj|BAD18888.1| Collagen prolyl 4-hydroxylase alpha III subunit [Bos taurus]
gi|296479828|tpg|DAA21943.1| TPA: prolyl 4-hydroxylase subunit alpha-3 precursor [Bos taurus]
Length = 544
Score = 120 bits (302), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 78/222 (35%), Positives = 121/222 (54%), Gaps = 13/222 (5%)
Query: 7 CQGNLSVPEDIK-SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRII 65
CQ S P + +L C YE+ ++ +L + P++ E ++L+P VV HD + D+E I
Sbjct: 305 CQTLGSQPTHYRIPSLYCSYETSSSPYLLLQPVRKEVIHLEPYVVLYHDFVSDAEAQTIR 364
Query: 66 ELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGRE 125
L++ ++R V + + V+ R+SK +L + P L + RI +T L + +
Sbjct: 365 GLAEPWLQRSVVASGEKQLPVEYRISKSAWLKDTV---DPVLVTLDHRIAALTGLDV--Q 419
Query: 126 ERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELGGATIFP 178
Y LQ+ NYG+GGHY+ H D AT L+R+ A+FM YL+ VE GGAT F
Sbjct: 420 PPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRMNSGNRVATFMIYLSSVEAGGATAFI 479
Query: 179 SLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
N +V K +A+FW+N H + D H+ CPV +G+KW
Sbjct: 480 YGNFSVPVVKNAALFWWNLHRSGEGDGDTLHAACPVLVGDKW 521
>gi|301613006|ref|XP_002936013.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Xenopus
(Silurana) tropicalis]
Length = 504
Score = 120 bits (301), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 73/228 (32%), Positives = 114/228 (50%), Gaps = 41/228 (17%)
Query: 3 YPLACQGN-LSVPEDIKSNLKC-FYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y C+G + + + L C +++ + L + P K E+ + PR+V+ HD I D E
Sbjct: 285 YEKLCRGEGVKMTSRRQKRLFCRYFDGNKDPLLILSPTKQEDEWDKPRIVRYHDIISDEE 344
Query: 61 INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
I+++ EL+K ++ R + N I G L Q RI +
Sbjct: 345 ISKVKELAKPRLRRATISN-------------------PITG---VLETAQYRISKRWAI 382
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVELG 172
+ L++ NYG+GG Y+ H D +DE R+A+++FY++DVE G
Sbjct: 383 M---------ELEVANYGMGGQYEPHFDFARKDEPDAFKELGTGNRVATWLFYMSDVEAG 433
Query: 173 GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GAT+FP + V+P+KG+AVFWYN + DY H+ CPV +GNKW
Sbjct: 434 GATVFPEVGAAVYPKKGTAVFWYNLFESGEGDYSTRHAACPVLVGNKW 481
>gi|440899661|gb|ELR50930.1| Prolyl 4-hydroxylase subunit alpha-3, partial [Bos grunniens mutus]
Length = 478
Score = 120 bits (301), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 78/222 (35%), Positives = 121/222 (54%), Gaps = 13/222 (5%)
Query: 7 CQGNLSVPEDIK-SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRII 65
CQ S P + +L C YE+ ++ +L + P++ E ++L+P VV HD + D+E I
Sbjct: 239 CQTLGSQPTHYRIPSLYCSYETSSSPYLLLQPVRKEVIHLEPYVVLYHDFVSDAEAQTIR 298
Query: 66 ELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGRE 125
L++ ++R V + + V+ R+SK +L + P L + RI +T L + +
Sbjct: 299 GLAEPWLQRSVVASGEKQLPVEYRISKSAWLKDTV---DPVLVTLDHRIAALTGLDV--Q 353
Query: 126 ERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELGGATIFP 178
Y LQ+ NYG+GGHY+ H D AT L+R+ A+FM YL+ VE GGAT F
Sbjct: 354 PPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRMNSGNRVATFMIYLSSVEAGGATAFI 413
Query: 179 SLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
N +V K +A+FW+N H + D H+ CPV +G+KW
Sbjct: 414 YGNFSVPVVKNAALFWWNLHRSGEGDGDTLHAACPVLVGDKW 455
>gi|92109908|gb|ABE73278.1| IP10618p [Drosophila melanogaster]
Length = 501
Score = 120 bits (301), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 74/221 (33%), Positives = 111/221 (50%), Gaps = 20/221 (9%)
Query: 7 CQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIE 66
CQG L C Y S F++I PLK EE+ DP + HD IYDSEI ++
Sbjct: 275 CQGKFPP----GPQLVCRYNSTTTPFMRIAPLKEEEISRDPLIWLYHDVIYDSEIAQLTN 330
Query: 67 LSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREE 126
+++ ++ G NY R+++++ + + R+ D++ L +G
Sbjct: 331 VTREEMILGTTTNYT----TPDRVNRLFHIKVTDDDGGKLDKTLVNRMADISGLDVGN-- 384
Query: 127 RYKGPLQINNYGLGGHYDLHCDATP-------RDEGLWRLASFMFYLTDVELGGATIFPS 179
L NYGLGG++ H D +EG RL +F+FY+TDV +GG TIFP
Sbjct: 385 --TTTLARINYGLGGYFQEHSDYMDIKLYPELTEEGD-RLMTFLFYMTDVPVGGTTIFPG 441
Query: 180 LNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
L + P+KGSA+FWYN H N + H+ CP +G++W
Sbjct: 442 AQLAIQPKKGSALFWYNLHNNGDPNLLTRHAVCPTIVGSRW 482
>gi|161076739|ref|NP_001097101.1| CG34345 [Drosophila melanogaster]
gi|157400090|gb|ABV53635.1| CG34345 [Drosophila melanogaster]
Length = 504
Score = 120 bits (301), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 74/221 (33%), Positives = 111/221 (50%), Gaps = 20/221 (9%)
Query: 7 CQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIE 66
CQG L C Y S F++I PLK EE+ DP + HD IYDSEI ++
Sbjct: 278 CQGKFPP----GPQLVCRYNSTTTPFMRIAPLKEEEISRDPLIWLYHDVIYDSEIAQLTN 333
Query: 67 LSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREE 126
+++ ++ G NY R+++++ + + R+ D++ L +G
Sbjct: 334 VTREEMILGTTTNYT----TPDRVNRLFHIKVTDDDGGKLDKTLVNRMADISGLDVGN-- 387
Query: 127 RYKGPLQINNYGLGGHYDLHCDATP-------RDEGLWRLASFMFYLTDVELGGATIFPS 179
L NYGLGG++ H D +EG RL +F+FY+TDV +GG TIFP
Sbjct: 388 --TTTLARINYGLGGYFQEHSDYMDIKLYPELTEEGD-RLMTFLFYMTDVPVGGTTIFPG 444
Query: 180 LNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
L + P+KGSA+FWYN H N + H+ CP +G++W
Sbjct: 445 AQLAIQPKKGSALFWYNLHNNGDPNLLTRHAVCPTIVGSRW 485
>gi|195159148|ref|XP_002020444.1| GL13996 [Drosophila persimilis]
gi|194117213|gb|EDW39256.1| GL13996 [Drosophila persimilis]
Length = 559
Score = 120 bits (301), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 77/212 (36%), Positives = 111/212 (52%), Gaps = 16/212 (7%)
Query: 19 SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV 78
+ L C Y + FL++ PL++EEL LDP +V H+ + D E+ R+ +S + R ++
Sbjct: 322 ARLHCRYNATTTAFLRLAPLRMEELSLDPYIVLYHNVLSDEEMARLENMSTPLLHRARIF 381
Query: 79 NYGDT---IYVDTRLSKVYFLYPE-IFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQI 134
+ I +V P+ + GD + IQ RI D+T L++ R +Q
Sbjct: 382 DKETKKPKISPVRSADEVGIPNPKLVTGDIQLVECIQKRITDLTGLMLTSMRR----IQF 437
Query: 135 NNYGLGGHYDLHCD------ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEK 188
YG GG Y H D T R G R+A+ +FYL DVE GGAT FP+L+L V E+
Sbjct: 438 LKYGFGGIYVPHHDFFSVHTPTSRLHGD-RIATVIFYLNDVEHGGATAFPNLDLVVPTER 496
Query: 189 GSAVFWYNAHANTL-LDYRMYHSGCPVALGNK 219
G+ +FW+N T LDYR H CPV +G K
Sbjct: 497 GAVLFWHNMDGETYDLDYRTLHGACPVIVGTK 528
>gi|194765182|ref|XP_001964706.1| GF22908 [Drosophila ananassae]
gi|190614978|gb|EDV30502.1| GF22908 [Drosophila ananassae]
Length = 509
Score = 120 bits (301), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 73/224 (32%), Positives = 116/224 (51%), Gaps = 16/224 (7%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
Y CQG +PE + LKC+ + + F + PLKVE+++LDP + H + +I+
Sbjct: 263 YSRLCQGK-RLPEKQDNILKCYLDGKRHAFFTLAPLKVEQVHLDPDITVYHGVLSSKQIS 321
Query: 63 RIIELS--KGKVERGKVVNYG-DTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
I S K ++ G G D D R+S+ +L + + +
Sbjct: 322 SIFTESNKKERIRSGVAGENGEDRTVKDIRVSQQTWL--------NYSTPTMQYVNRINE 373
Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCD----ATPRDEGLWRLASFMFYLTDVELGGAT 175
+ G R +Q+ NYG+GG Y+ H D P D R+++ MFYL++V+ GG T
Sbjct: 374 YICGLTMRGAEEMQVANYGVGGQYEPHPDYFEFDLPPDFDGDRISTSMFYLSNVQQGGYT 433
Query: 176 IFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 219
+FP+LN+ + P KGS V W+N H + +D R +H+GCPV +G+K
Sbjct: 434 VFPNLNVFLPPVKGSMVLWHNLHYSLDVDARTWHAGCPVIVGSK 477
>gi|119595340|gb|EAW74934.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide III, isoform CRA_a
[Homo sapiens]
Length = 657
Score = 120 bits (301), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 73/202 (36%), Positives = 113/202 (55%), Gaps = 12/202 (5%)
Query: 20 NLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN 79
+L C YE+ +N +L + P++ E ++L+P + HD + DSE +I EL++ ++R V +
Sbjct: 351 SLYCSYETNSNAYLLLQPIRKEVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVAS 410
Query: 80 YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGL 139
+ V+ R+SK +L + P L + RI +T L + Y LQ+ NYG+
Sbjct: 411 GEKQLQVEYRISKSAWLKDTV---DPKLVTLNHRIAALTGLDV--RPPYAEYLQVVNYGI 465
Query: 140 GGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELGGATIFPSLNLTVFPEKGSAV 192
GGHY+ H D AT L+R+ A+FM YL+ VE GGAT F NL+V + +A+
Sbjct: 466 GGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFIYANLSVPVVRNAAL 525
Query: 193 FWYNAHANTLLDYRMYHSGCPV 214
FW+N H + D H+GCPV
Sbjct: 526 FWWNLHRSGEGDSDTLHAGCPV 547
>gi|195494570|ref|XP_002094894.1| GE19958 [Drosophila yakuba]
gi|194180995|gb|EDW94606.1| GE19958 [Drosophila yakuba]
Length = 498
Score = 120 bits (301), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 75/214 (35%), Positives = 113/214 (52%), Gaps = 33/214 (15%)
Query: 18 KSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERG-K 76
+ NL C YE + + L+I PLKVE L L P +V HD IYDSEI+++ +S ++ +
Sbjct: 288 RKNLSCHYEKHTSDLLRIAPLKVETLSLKPHIVLYHDVIYDSEISKVKNISLPSLKSPLR 347
Query: 77 VVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINN 136
+++ D + +L+K+ + + RI+DMT G + + QI+N
Sbjct: 348 ILHAEDH---NLKLAKI---------SEDYHSPLNLRIKDMT----GEDVKEDTDFQIDN 391
Query: 137 YGLGGHYDLHCDATPRDEGLW----RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAV 192
YG+ G H D + RL S MF++ DV GGA +F LNLT++P+KGSA+
Sbjct: 392 YGICGFRYYHTDNLESQDQTAELGDRLTSIMFFMNDVAQGGAFVFLHLNLTIWPQKGSAL 451
Query: 193 FWYNAHANTLLDYRM------YHSGCPVALGNKW 220
W N LD+RM H+ CPV +G+KW
Sbjct: 452 VWRN------LDHRMQPNEDLLHASCPVIVGSKW 479
>gi|324510827|gb|ADY44523.1| Prolyl 4-hydroxylase subunit alpha-1 [Ascaris suum]
Length = 551
Score = 120 bits (300), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 75/230 (32%), Positives = 116/230 (50%), Gaps = 18/230 (7%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
+I+ C+ + V S L C+Y+ + +L++ P+KVE + L+P V H + D E
Sbjct: 284 DIFEALCRHEVPVSTKALSRLYCYYK-MDRPYLRLAPIKVEIMRLNPLAVLFHQIMSDEE 342
Query: 61 INRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
+ I L+ K+ R V N G R+SK +L P +H + + R+ T
Sbjct: 343 AHIIEMLAIPKLNRATVQNAMTGGLETASYRISKSAWLKPH---EHEVVDRFNKRLDMAT 399
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
NL + E LQI NYG+GGHYD H D ++E R+A+ + Y+T+ E
Sbjct: 400 NLEMETAEE----LQIQNYGVGGHYDPHFDCARKEEKNAFKELGTGNRVATILVYMTEPE 455
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+GG T+F + +V K +A+FWYN + +D R H+ CPV G KW
Sbjct: 456 IGGGTVFTEVKTSVACTKNAALFWYNLLRSGEVDMRSRHAACPVLTGVKW 505
>gi|335294484|ref|XP_003357239.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Sus scrofa]
Length = 545
Score = 120 bits (300), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 80/223 (35%), Positives = 123/223 (55%), Gaps = 14/223 (6%)
Query: 7 CQGNLSVPEDIK-SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRII 65
CQ S P + +L C YE+ ++ +L + P++ E ++L+P VV HD + D+E +I
Sbjct: 305 CQTLGSQPTHYQIPSLYCSYETSSSPYLLLQPIRKEVIHLEPYVVLYHDFVTDAEAQKIR 364
Query: 66 ELSKGKVERGKVVNYGD-TIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGR 124
L++ V +V G+ + V+ R+SK +L + P L + RI +T L +
Sbjct: 365 GLAEPWVTAEILVASGEKQLPVEYRISKSAWLKDTV---DPMLVTLDHRIAALTGLDV-- 419
Query: 125 EERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELGGATIF 177
+ Y LQ+ NYG+GGHY+ H D AT L+R+ A+FM YL+ VE GGAT F
Sbjct: 420 QPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAF 479
Query: 178 PSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
N +V K +A+FW+N H + D H+GCPV +G+KW
Sbjct: 480 IYGNFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLVGDKW 522
>gi|195577074|ref|XP_002078398.1| GD23422 [Drosophila simulans]
gi|194190407|gb|EDX03983.1| GD23422 [Drosophila simulans]
Length = 513
Score = 120 bits (300), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 74/222 (33%), Positives = 111/222 (50%), Gaps = 20/222 (9%)
Query: 6 ACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRII 65
CQG L C Y S F++I PLK EE+ DP + H+ IYDSEI ++
Sbjct: 286 GCQGKFPP----GPQLVCRYNSTTTPFMRIAPLKEEEISRDPLIWLYHNVIYDSEIAQLT 341
Query: 66 ELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGRE 125
L++ ++ G NY R+ +++ + + R+ D++ L +G
Sbjct: 342 NLTREEMILGTTTNYT----TPDRVDRLFHIKVTDDDGGKLDKTLVNRMADISGLDVGN- 396
Query: 126 ERYKGPLQINNYGLGGHYDLHCDATP-------RDEGLWRLASFMFYLTDVELGGATIFP 178
L NYGLGG++ H D +EG RL +F+FY+TD+ +GGATIFP
Sbjct: 397 ---TTTLARINYGLGGYFQEHSDYMDIKLHPELTEEGD-RLMTFLFYMTDIPVGGATIFP 452
Query: 179 SLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
L + P+KGSA+FWYN H N + H+ CP +G++W
Sbjct: 453 GAQLAIQPKKGSALFWYNLHNNGDPNPLTRHAVCPTIVGSRW 494
>gi|38454288|ref|NP_942070.1| prolyl 4-hydroxylase subunit alpha-3 precursor [Rattus norvegicus]
gi|81870816|sp|Q6W3E9.1|P4HA3_RAT RecName: Full=Prolyl 4-hydroxylase subunit alpha-3; Short=4-PH
alpha-3; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-3; Flags: Precursor
gi|36962768|gb|AAQ87605.1| collagen prolyl 4-hydroxylase alpha III subunit [Rattus norvegicus]
Length = 544
Score = 120 bits (300), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 79/222 (35%), Positives = 118/222 (53%), Gaps = 13/222 (5%)
Query: 7 CQGNLSVPEDIK-SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRII 65
CQ S P + +L C YE+ ++ +L + P + E ++L P V HD + D E +I
Sbjct: 305 CQTLGSQPTHYQIPSLYCSYETNSSPYLLLQPARKEVIHLRPLVALYHDFVSDEEAQKIR 364
Query: 66 ELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGRE 125
EL++ ++R V + + V+ R+SK +L + P L + RI +T L I +
Sbjct: 365 ELAEPWLQRSVVASGEKQLQVEYRISKSAWLKDTV---DPVLVTLDRRIAALTGLDI--Q 419
Query: 126 ERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLW------RLASFMFYLTDVELGGATIFP 178
Y LQ+ NYG+GGHY+ H D AT L+ R A+ M YL+ VE GGAT F
Sbjct: 420 PPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYKMKSGNRAATLMIYLSSVEAGGATAFI 479
Query: 179 SLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
N +V K +A+FW+N H + D H+GCPV +G+KW
Sbjct: 480 YGNFSVPVVKNAALFWWNLHRSGEGDDDTLHAGCPVLVGDKW 521
>gi|195505244|ref|XP_002099420.1| GE10895 [Drosophila yakuba]
gi|194185521|gb|EDW99132.1| GE10895 [Drosophila yakuba]
Length = 533
Score = 119 bits (299), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 79/215 (36%), Positives = 111/215 (51%), Gaps = 14/215 (6%)
Query: 15 EDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVER 74
E S L C Y + FL++ PL++EEL LDP VV H+ + D EI ++ +S+ +ER
Sbjct: 288 ESKPSRLHCRYNATTTPFLRLAPLRMEELSLDPYVVLYHNVLSDPEIEKLQLMSEPFLER 347
Query: 75 GKVVNY---GDTIYVDTRLSKVYFLYPEIF-GDHPFLYKIQTRIQDMTNLVIGREERYKG 130
KV D I + + E D L +I RI D+T G R
Sbjct: 348 AKVFRVEKGSDEIGASRAADGAWLPHQETEPEDLEVLNRIGRRIGDIT----GLSTRSGR 403
Query: 131 PLQINNYGLGGHYDLHCD----ATPRDEGLW-RLASFMFYLTDVELGGATIFPSLNLTVF 185
+Q+ YG GGH+ H D T E + R+A+ +FYL +VE GGAT+FPS+NL V
Sbjct: 404 QMQLLKYGFGGHFTPHFDYFDSKTLYLEKVGDRIATVLFYLNNVEHGGATVFPSINLAVP 463
Query: 186 PEKGSAVFWYNAHANTL-LDYRMYHSGCPVALGNK 219
+KGSA+FW+N + D R +H CP+ G K
Sbjct: 464 TQKGSALFWHNLDGQSYDYDTRTFHGACPLISGTK 498
>gi|339236271|ref|XP_003379690.1| prolyl 4-hydroxylase subunit alpha-1 [Trichinella spiralis]
gi|316977627|gb|EFV60702.1| prolyl 4-hydroxylase subunit alpha-1 [Trichinella spiralis]
Length = 558
Score = 119 bits (299), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 80/252 (31%), Positives = 121/252 (48%), Gaps = 41/252 (16%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
++Y C+ + + ++ L C+Y+ N +LK+ P+KVE ++ P++V I D E
Sbjct: 291 DVYEGLCRSEYPISDKDRAKLYCYYKR-NRPYLKLAPIKVEVMHWKPKIVYFRGVISDEE 349
Query: 61 INRIIELSKGKVERGKVVNYGDTIYVDT---RLSKVYFLYPEIFGDHPFLYKIQTRIQDM 117
I I +L+ ++R V N DT ++T R+SK +L +H + +I RI M
Sbjct: 350 IAVIKQLASPLLKRATVHN-ADTGQLETASYRISKSAWLKD---TEHEVVKRISDRIDMM 405
Query: 118 TNLVIGREERYKGPLQINNYGLGGHYDLHCDATPR------DEGLW-------------- 157
T+L + E LQI NYG+GGHYD H D + R +EG
Sbjct: 406 TDLTMETAEL----LQIANYGIGGHYDPHFDMSTRGESDPYEEGTGNRIATVLFYTNDPY 461
Query: 158 ---------RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMY 208
R+A+ +FY++ E GG T+F S +TV P K A FW+N D
Sbjct: 462 SFESLNAGNRIATVLFYISQPEAGGGTVFTSHKITVEPSKYDAAFWFNVLQGGEPDMSTR 521
Query: 209 HSGCPVALGNKW 220
H+ CPV G KW
Sbjct: 522 HAACPVLAGTKW 533
>gi|241598362|ref|XP_002404733.1| prolyl 4-hydroxylase alpha subunit 1, putative [Ixodes scapularis]
gi|215500464|gb|EEC09958.1| prolyl 4-hydroxylase alpha subunit 1, putative [Ixodes scapularis]
Length = 340
Score = 119 bits (299), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 63/210 (30%), Positives = 109/210 (51%), Gaps = 9/210 (4%)
Query: 17 IKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGK 76
+ S L+C Y + F + P+K+EE+ L P V+ +HD + D +I ++ ++ ++ER
Sbjct: 1 MDSQLRCRYYKGQDGFFSLQPIKLEEINLKPYVIVMHDVVQDKDIEDLMAFAEPRLERST 60
Query: 77 VVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINN 136
+ + R S +L + + P ++ + ++ + + + Q+ N
Sbjct: 61 TYTGNEMMPSPERTSSTAWLNED---EAPIAVRMNSYLRALLGMGTSDTDEEAEAYQLAN 117
Query: 137 YGLGGHY----DLHCDATPRDEGLW--RLASFMFYLTDVELGGATIFPSLNLTVFPEKGS 190
YG GGH+ D D+ D + RLA+ M Y+TDVE GG T+FP+L + + P+KG
Sbjct: 118 YGTGGHFLPHHDFLQDSLQADNSVTGDRLATLMIYMTDVEEGGTTVFPNLGIRLTPKKGD 177
Query: 191 AVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
A FW+N A+ + H+GCPV G+KW
Sbjct: 178 AAFWWNLKASGDGERLTTHAGCPVLYGSKW 207
>gi|194765180|ref|XP_001964705.1| GF23331 [Drosophila ananassae]
gi|190614977|gb|EDV30501.1| GF23331 [Drosophila ananassae]
Length = 535
Score = 119 bits (299), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 72/227 (31%), Positives = 115/227 (50%), Gaps = 14/227 (6%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
++Y C+G+L+ L+C + + L P K+EEL +P V ++H +
Sbjct: 283 KMYEQVCRGDLNPSPAKLRELRC---RFRRSRLGYAPFKLEELSHEPLVFQVHQVVSSKS 339
Query: 61 INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
I ++++ K++R V + G + + + + + D+++L
Sbjct: 340 AEFIKKMARPKIKRSTVYSIGGGGGSQAAAFRTSQGASFNYSRNAATKILSRHVGDLSSL 399
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPR----DEGL---WRLASFMFYLTDVELGG 173
+ + LQ+ NYG+GGHY+ H D+ P DEG R+A+ ++YL+DVE GG
Sbjct: 400 DMN----FAEELQVANYGIGGHYEPHWDSFPENHIYDEGDDRGNRIATGIYYLSDVEAGG 455
Query: 174 ATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
T FP L L V PEKGS +FWYN H + DYR H+ CPV G+KW
Sbjct: 456 GTAFPFLPLLVTPEKGSLLFWYNLHESGDQDYRTKHAACPVLQGSKW 502
>gi|37496185|emb|CAE47803.1| Prolyl 4-hydroxylase alpha subunit [Sus scrofa]
Length = 263
Score = 119 bits (298), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 71/209 (33%), Positives = 115/209 (55%), Gaps = 19/209 (9%)
Query: 3 YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G + + + L C Y N N + P K E+ + PR+++ HD I D+E
Sbjct: 58 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 117
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I+ + +L+K ++ R V + G R+SK +L ++P + ++ RIQD+T
Sbjct: 118 IDIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---ENPVVSRLNMRIQDLT 174
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GG Y+ H D +DE R+A+++FY++DV
Sbjct: 175 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVS 230
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHA 199
GGAT+FP + +V+P+KG+AVFWYN A
Sbjct: 231 AGGATVFPEVGASVWPKKGTAVFWYNLFA 259
>gi|195159321|ref|XP_002020530.1| GL13464 [Drosophila persimilis]
gi|194117299|gb|EDW39342.1| GL13464 [Drosophila persimilis]
Length = 533
Score = 119 bits (297), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 74/223 (33%), Positives = 119/223 (53%), Gaps = 14/223 (6%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
Y CQG + ++L+CF + + + + PL+VE+++LDP + H + +I+
Sbjct: 287 YSRLCQGRRLPEKGSGTSLRCFLDGKRHAYFTLAPLQVEQVHLDPDIDVYHGILTLDQID 346
Query: 63 RIIELS-KGKVERGKVVNYGDT-IYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
I E + K ++ R V G T VD R+S+ +L E P + I + ++
Sbjct: 347 SIFEAADKQEMTRSGVAGDGGTRTVVDLRVSQQTWLDYE----SPIMKSIARLVVFISGF 402
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD----ATPRDEGLWRLASFMFYLTDVELGGATI 176
I E +Q+ NYG+GG Y+ H D P D R+++ MFYL+DVE GG T+
Sbjct: 403 DIAGAE----AMQVANYGVGGQYEPHPDYFEVNLPSDFKGDRISTSMFYLSDVEQGGYTV 458
Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 219
F LN+ + P KG+ V W+N H + +D R +H+GCPV +G+K
Sbjct: 459 FTKLNVFLPPIKGALVMWHNLHRSLDVDPRTHHAGCPVIVGSK 501
>gi|281362877|ref|NP_733393.3| CG31016, isoform B [Drosophila melanogaster]
gi|442621939|ref|NP_001263119.1| CG31016, isoform C [Drosophila melanogaster]
gi|272477249|gb|AAF57071.5| CG31016, isoform B [Drosophila melanogaster]
gi|440218076|gb|AGB96498.1| CG31016, isoform C [Drosophila melanogaster]
Length = 536
Score = 119 bits (297), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 78/215 (36%), Positives = 109/215 (50%), Gaps = 14/215 (6%)
Query: 15 EDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVER 74
E S L C Y + FLK+ P ++EEL LDP V+ H+ + D+EI ++ + K +ER
Sbjct: 291 ESKPSRLHCRYNTITTPFLKLAPFRMEELSLDPYVIFYHNVLSDAEIEKLKPMGKPFLER 350
Query: 75 GKVVNY---GDTIYVDTRLSKVYFLYPEIFGDH-PFLYKIQTRIQDMTNLVIGREERYKG 130
KV D I + + I D L +I RI+DMT G R
Sbjct: 351 AKVFRVEKGSDEIDPSRSADGAWLPHQNIDPDDLEVLNRIGRRIEDMT----GLNTRSGS 406
Query: 131 PLQINNYGLGGHYDLHCD----ATPRDEGLW-RLASFMFYLTDVELGGATIFPSLNLTVF 185
+Q YG GGH+ H D T E + R+A+ +FYL +V+ GGAT+FP LNL V
Sbjct: 407 KMQFLKYGFGGHFVPHYDYFNSKTFSLETVGDRIATVLFYLNNVDHGGATVFPKLNLAVP 466
Query: 186 PEKGSAVFWYNAHANTL-LDYRMYHSGCPVALGNK 219
+KGSA+FW+N + D R +H CP+ G K
Sbjct: 467 TQKGSALFWHNIDRKSYDYDTRTFHGACPLISGTK 501
>gi|159884097|gb|ABX00727.1| IP12176p [Drosophila melanogaster]
Length = 538
Score = 118 bits (296), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 78/215 (36%), Positives = 109/215 (50%), Gaps = 14/215 (6%)
Query: 15 EDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVER 74
E S L C Y + FLK+ P ++EEL LDP V+ H+ + D+EI ++ + K +ER
Sbjct: 293 ESKPSRLHCRYNTITTPFLKLAPFRMEELSLDPYVIFYHNVLSDAEIEKLKPMGKPFLER 352
Query: 75 GKVVNY---GDTIYVDTRLSKVYFLYPEIFGDH-PFLYKIQTRIQDMTNLVIGREERYKG 130
KV D I + + I D L +I RI+DMT G R
Sbjct: 353 AKVFRVEKGSDEIDPSRSADGAWLPHQNIDPDDLEVLNRIGRRIEDMT----GLNTRSGS 408
Query: 131 PLQINNYGLGGHYDLHCD----ATPRDEGLW-RLASFMFYLTDVELGGATIFPSLNLTVF 185
+Q YG GGH+ H D T E + R+A+ +FYL +V+ GGAT+FP LNL V
Sbjct: 409 KMQFLKYGFGGHFVPHYDYFNSKTFSLETVGDRIATVLFYLNNVDHGGATVFPKLNLAVP 468
Query: 186 PEKGSAVFWYNAHANTL-LDYRMYHSGCPVALGNK 219
+KGSA+FW+N + D R +H CP+ G K
Sbjct: 469 TQKGSALFWHNIDRKSYDYDTRTFHGACPLISGTK 503
>gi|443707037|gb|ELU02831.1| hypothetical protein CAPTEDRAFT_181697 [Capitella teleta]
Length = 538
Score = 118 bits (296), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 73/208 (35%), Positives = 111/208 (53%), Gaps = 18/208 (8%)
Query: 23 CFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNY-- 80
C Y + F+ + P K E ++LDP + H+ + D E + I +SK K+ R V Y
Sbjct: 316 CNYVRPHPMFILV-PAKEEVMFLDPFIAIYHNLMTDKEADMIKRISKPKLHRSGVFTYSG 374
Query: 81 GDTIYV-DTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGL 139
G+ V D R SK ++ E +HP + ++ R +T+L + E + Q+ NYG+
Sbjct: 375 GNQKPVQDYRTSKSAWIEDE---EHPMIRRVSERTSALTDLSLDTVELF----QVVNYGI 427
Query: 140 GGHYDLHCD-ATPRDEGLW------RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAV 192
GGHY+ H D A P + + R+ + +FY+ E GGAT+FP L + ++PEKGS
Sbjct: 428 GGHYEPHFDFARPNEIATFDPEVGNRIITVIFYVAAPEAGGATVFPDLGVKLWPEKGSCA 487
Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
W+N N DYR H+GCP G+KW
Sbjct: 488 VWWNLMRNGEGDYRTKHAGCPTITGSKW 515
>gi|198449650|ref|XP_001357661.2| GA13747 [Drosophila pseudoobscura pseudoobscura]
gi|198130701|gb|EAL26795.2| GA13747 [Drosophila pseudoobscura pseudoobscura]
Length = 533
Score = 118 bits (296), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 74/223 (33%), Positives = 119/223 (53%), Gaps = 14/223 (6%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
Y CQG + ++L+CF + + + + PL+VE+++LDP + H + +I+
Sbjct: 287 YSRLCQGRRLPEKGSGTSLRCFLDGKRHAYFTLAPLQVEQVHLDPDIDVYHGILTLDQID 346
Query: 63 RIIELS-KGKVERGKVVNYGDT-IYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
I E + K ++ R V G T VD R+S+ +L E P + I + ++
Sbjct: 347 SIFEAADKQEMTRSGVAGDGGTRTVVDLRVSQQTWLDYE----SPIMKSIARLVVFISGF 402
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD----ATPRDEGLWRLASFMFYLTDVELGGATI 176
I E +Q+ NYG+GG Y+ H D P D R+++ MFYL+DVE GG T+
Sbjct: 403 DIAGAE----AMQVANYGVGGQYEPHPDYFEVNLPSDFKGDRISTSMFYLSDVEQGGYTV 458
Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 219
F LN+ + P KG+ V W+N H + +D R +H+GCPV +G+K
Sbjct: 459 FTKLNVFLPPIKGALVMWHNLHRSLDVDPRTHHAGCPVIVGSK 501
>gi|195128345|ref|XP_002008624.1| GI13596 [Drosophila mojavensis]
gi|193920233|gb|EDW19100.1| GI13596 [Drosophila mojavensis]
Length = 527
Score = 118 bits (296), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 79/231 (34%), Positives = 121/231 (52%), Gaps = 27/231 (11%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
E Y L C+G P+ ++NL C Y + FL++ P K+EE+ LDP +V H+ I DSE
Sbjct: 287 EPYYLGCRG--GYPK--RTNLHCRYNTTTTPFLRLAPFKMEEVSLDPYIVLYHNVISDSE 342
Query: 61 INRIIELSKG---KVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDM 117
I I + + + R ++N D + R+ V + P F +I RI D+
Sbjct: 343 IEDIKQHATNFTNGLSRNPLLNVTDKPQIVARMQWVEKMTP-------FTDRINLRITDI 395
Query: 118 TNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDE-------GLW-RLASFMFYLTDV 169
T + + +QI NYG+GGH+ H D T G+ R A+ +FY +D+
Sbjct: 396 TGFGVDECK----TVQIANYGIGGHFIPHFDYTTEGRVSINDTFGIGDRTATIVFYASDM 451
Query: 170 ELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ GGAT+FP++ +TV P+KGSA+ WYN + + HS CPV G++W
Sbjct: 452 Q-GGATVFPNIQVTVQPQKGSALHWYNLFDDDSPNPLTLHSVCPVISGSRW 501
>gi|195452728|ref|XP_002073474.1| GK14137 [Drosophila willistoni]
gi|194169559|gb|EDW84460.1| GK14137 [Drosophila willistoni]
Length = 536
Score = 118 bits (295), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 72/224 (32%), Positives = 124/224 (55%), Gaps = 16/224 (7%)
Query: 3 YPLACQGNLSVPEDIK-SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
Y CQG+ +PE +L C+ ++ + + PLKVE++++DP + H + D++I
Sbjct: 290 YTRLCQGH-RLPEPFTGKSLHCYLDAKRHVSFILAPLKVEQVHVDPDINVYHGVLNDAQI 348
Query: 62 NRIIELS-KGKVERGKVV-NYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
+I++ S + ++ R V + G D R+S+ +L P + + I D++
Sbjct: 349 EKILQESDQNEMMRSAVSGDKGSATIADLRVSQQTWLNY----SSPIMRSLSNLISDISG 404
Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCD----ATPRDEGLWRLASFMFYLTDVELGGAT 175
+ E+ +Q+ NYG+GG Y+ H D P++ R+++ MFYL+DVELGG T
Sbjct: 405 FDMAGAEQ----MQVANYGVGGQYEPHPDYFEVNLPQEFKGDRISTSMFYLSDVELGGNT 460
Query: 176 IFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 219
+F LN+ + P KG+ V W+N H + +D R H+GCPV +G+K
Sbjct: 461 VFIKLNVFLPPIKGAMVMWHNLHYSLDVDRRTIHAGCPVLIGSK 504
>gi|194871359|ref|XP_001972833.1| GG13662 [Drosophila erecta]
gi|190654616|gb|EDV51859.1| GG13662 [Drosophila erecta]
Length = 515
Score = 118 bits (295), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 80/226 (35%), Positives = 121/226 (53%), Gaps = 29/226 (12%)
Query: 5 LACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRI 64
+ CQG P+ K+NL C Y N FLK+ PLK+EE+ DP +V H+ I D EI +
Sbjct: 285 IGCQGLF--PK--KTNLVCRYNFSTNAFLKLAPLKMEEISRDPYIVMFHEVISDKEIEEM 340
Query: 65 IELSKGKVERGKVVNYGDTIYVDTR--LSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVI 122
KG++ K + G T + + +S +Y++ E F +I RI DMT +
Sbjct: 341 ----KGEI---KQMENGWTSLEEPKEIVSHIYWITKE----SSFSKRINDRISDMTGFKV 389
Query: 123 GREERYKGPLQINNYGLGGHYDLHCDA-TPRDEGLW-------RLASFMFYLTDVELGGA 174
E + +Q+ N+G+GG++ H D T R + L RLAS + Y +V GG
Sbjct: 390 ---EEFPA-IQLANFGVGGYFKPHYDYYTERLKELDANNTLGDRLASIIIYAGEVSQGGQ 445
Query: 175 TIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
T+FP + + V P+KG A+FW+N ++ D R HS CPV +G++W
Sbjct: 446 TVFPDIKVAVEPKKGKALFWFNDFDDSSPDPRSLHSVCPVIVGSRW 491
>gi|198449506|ref|XP_002136910.1| GA26925 [Drosophila pseudoobscura pseudoobscura]
gi|198130637|gb|EDY67468.1| GA26925 [Drosophila pseudoobscura pseudoobscura]
Length = 543
Score = 117 bits (294), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 76/212 (35%), Positives = 111/212 (52%), Gaps = 16/212 (7%)
Query: 19 SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV 78
+ L C Y + FL++ PL++EEL LDP +V H+ + D E+ R+ +S + R ++
Sbjct: 306 ARLHCRYNATTTAFLRLAPLRMEELSLDPYIVLYHNVLSDEEMARLENMSTPLLHRARIF 365
Query: 79 NYGDT---IYVDTRLSKVYFLYPEIFGDHPFLYK-IQTRIQDMTNLVIGREERYKGPLQI 134
+ I +V P++ + L + IQ RI D+T L++ R +Q
Sbjct: 366 DKETKKPKISPVRSADEVGIPNPKLVTEDIQLVECIQKRITDLTGLMLTSMRR----IQF 421
Query: 135 NNYGLGGHYDLHCD------ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEK 188
YG GG Y H D T R G R+A+ +FYL DVE GGAT FP+L+L V E+
Sbjct: 422 LKYGFGGIYVPHHDFFSVHTPTSRLHGD-RIATVIFYLNDVEHGGATAFPNLDLVVPTER 480
Query: 189 GSAVFWYNAHANTL-LDYRMYHSGCPVALGNK 219
G+ +FW+N T LDYR H CPV +G K
Sbjct: 481 GAVLFWHNMDGETYDLDYRTLHGACPVIVGTK 512
>gi|198466397|ref|XP_002135180.1| GA23908 [Drosophila pseudoobscura pseudoobscura]
gi|198150581|gb|EDY73807.1| GA23908 [Drosophila pseudoobscura pseudoobscura]
Length = 403
Score = 117 bits (293), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 74/207 (35%), Positives = 102/207 (49%), Gaps = 28/207 (13%)
Query: 20 NLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN 79
N C YES FL++ PLKVE L LDP + HD IY+ EI R++ L+ ++
Sbjct: 212 NRSCHYESTRTAFLRLAPLKVEMLSLDPYIAIYHDVIYEREIARVMTLALSSLK------ 265
Query: 80 YGDTIYVDTRLS--KVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNY 137
G Y R K +Y E ++ R +DMT G + + +I N
Sbjct: 266 -GPGRYSKRREHNIKSVTVYEEENS------QLNQRTRDMT----GEQVKEDKDFRIYNS 314
Query: 138 GLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNA 197
G+GG+ H D ++E L +V GGA FP L TV+P KGSA+ W+N
Sbjct: 315 GIGGYIRYHMDNLAKEEQ---------QLNEVPHGGAISFPQLEFTVWPRKGSALVWHNL 365
Query: 198 HANTLLDYRMYHSGCPVALGNKWGKLL 224
+ N LDYR+ H CPV +G+KW K L
Sbjct: 366 NNNLELDYRVAHISCPVIVGSKWSKFL 392
>gi|291224083|ref|XP_002732036.1| PREDICTED: prolyl 4-hydroxylase, alpha I subunit-like [Saccoglossus
kowalevskii]
Length = 491
Score = 117 bits (293), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 78/225 (34%), Positives = 116/225 (51%), Gaps = 17/225 (7%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
+ Y C+G P D S +KC Y + N L + P K E ++ +PRVV HD I D E
Sbjct: 257 DAYEALCRGERRKPLD-SSKVKCQYVTNGNYRLLLQPAKQEIMHHNPRVVLYHDVISDEE 315
Query: 61 INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIF---GDHPFLYKIQTRIQDM 117
IN +I+L+K K+ R VV G + Y + + D + K+ RI D+
Sbjct: 316 INEVIKLAKPKLRRSLVVTKGSSPSGTGSSDAEYRVSSGGWLEDWDGTVIAKLTRRISDI 375
Query: 118 TNL--VIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGAT 175
+ L + E R+ LQI N D+H + R+A++MFY+++V+ GG T
Sbjct: 376 SGLSTLTAPEYRHAEALQIENS------DVHLPGSRN-----RIATWMFYMSEVKAGGYT 424
Query: 176 IFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+FP ++ V P K +AVFWYN A+ D H+GCPV +G+KW
Sbjct: 425 VFPEVDAFVPPVKNAAVFWYNLKASGESDDLTRHAGCPVLIGSKW 469
>gi|449668268|ref|XP_002154169.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Hydra
magnipapillata]
Length = 531
Score = 117 bits (293), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 74/232 (31%), Positives = 121/232 (52%), Gaps = 16/232 (6%)
Query: 3 YPLAC---QGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDS 59
Y AC Q ++ +NL CFY++ N L + PLKV ++ +P V+ H+ I +
Sbjct: 296 YARACRRDQRTKTIAVKDVNNLVCFYKN-NKPRLILKPLKVTRMHDNPDVLVFHEMITEE 354
Query: 60 EINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDM 117
+I +++ ++ +V++ + R+SK F + K++ ++D
Sbjct: 355 VAEKIRDVANPRLRPSEVIDPIIQKHVTASYRVSKNVFFDDAFEEELEISRKLRPLVEDA 414
Query: 118 TNLVIGREERYKGPLQINNYGLGGHYDLHCD----ATPRD--EGLWRLASFMFYLTDVEL 171
T+L + + LQ+NNYGLGG Y+ H D +P D E R+A+ + YL+DVE
Sbjct: 415 TDL----NDDFSEQLQVNNYGLGGQYEFHVDFGDPGSPLDKHEHGNRIATLLIYLSDVER 470
Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWGKL 223
GG T+F L L++ P+ G A FW+N + N Y H+ CPV G+KWGK+
Sbjct: 471 GGDTVFTRLGLSLKPKLGDAAFWHNLYKNGSGIYATEHASCPVVSGSKWGKI 522
>gi|198449504|ref|XP_002136909.1| GA26876 [Drosophila pseudoobscura pseudoobscura]
gi|198130636|gb|EDY67467.1| GA26876 [Drosophila pseudoobscura pseudoobscura]
Length = 527
Score = 117 bits (292), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 71/211 (33%), Positives = 112/211 (53%), Gaps = 14/211 (6%)
Query: 19 SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV 78
S L C Y + FL++ PL++EEL LDP +V H+ + D+EI + +++ ++R V
Sbjct: 295 SRLHCRYNTTTTPFLRLAPLRMEELSLDPYIVVYHNVLSDAEIAEVERVTEPLLKRSVVF 354
Query: 79 N-YGDTIYVDTRLSKVYFLYPEIFGD---HPFLYKIQTRIQDMTNLVIGREERYKGPLQI 134
+ G+ + R + + P+ D + +I RI ++T L+I + +Q+
Sbjct: 355 DGKGNKMSTSKRRTALGAWLPDDNMDVSGRAVIQRIFRRIHELTGLIINDRQ----DMQL 410
Query: 135 NNYGLGGHYDLHCD----ATPRDEGLW-RLASFMFYLTDVELGGATIFPSLNLTVFPEKG 189
YG GGHYD+H D +TP + R+A+ +FYL D++ GG+T F L L V E+G
Sbjct: 411 IKYGYGGHYDIHFDYFNTSTPITKARGDRMATVLFYLNDMKHGGSTAFTDLQLKVPSERG 470
Query: 190 SAVFWYNAHANTL-LDYRMYHSGCPVALGNK 219
+FWYN T +D R H CPV G K
Sbjct: 471 KVLFWYNMRGETHDVDSRTLHGACPVINGTK 501
>gi|17552840|ref|NP_499464.1| Protein DPY-18 [Caenorhabditis elegans]
gi|20455505|sp|Q10576.2|P4HA1_CAEEL RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
alpha-1; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1; AltName: Full=Protein dumpy-18; Flags:
Precursor
gi|3881011|emb|CAA21045.1| Protein DPY-18 [Caenorhabditis elegans]
gi|6900013|emb|CAB71298.1| prolyl 4-hydroxylase alpha subunit 1 [Caenorhabditis elegans]
Length = 559
Score = 117 bits (292), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 78/229 (34%), Positives = 112/229 (48%), Gaps = 18/229 (7%)
Query: 2 IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
+Y C+ + V + S L C+Y+ + FL P+KVE +P V D I D E+
Sbjct: 284 MYEALCRNEVPVSQKDISRLYCYYKR-DRPFLVYAPIKVEIKRFNPLAVLFKDVISDDEV 342
Query: 62 NRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
I EL+K K+ R V + G + R+SK +L E GD + + RI MTN
Sbjct: 343 AAIQELAKPKLARATVHDSVTGKLVTATYRISKSAWL-KEWEGD--VVETVNKRIGYMTN 399
Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVEL 171
L + E LQI NYG+GGHYD H D ++E R+A+ +FY++
Sbjct: 400 LEMETAEE----LQIANYGIGGHYDPHFDHAKKEESKSFESLGTGNRIATVLFYMSQPSH 455
Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GG T+F T+ P K A+FWYN + + H+ CPV +G KW
Sbjct: 456 GGGTVFTEAKSTILPTKNDALFWYNLYKQGDGNPDTRHAACPVLVGIKW 504
>gi|196011912|ref|XP_002115819.1| hypothetical protein TRIADDRAFT_59908 [Trichoplax adhaerens]
gi|190581595|gb|EDV21671.1| hypothetical protein TRIADDRAFT_59908 [Trichoplax adhaerens]
Length = 300
Score = 117 bits (292), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 73/225 (32%), Positives = 118/225 (52%), Gaps = 19/225 (8%)
Query: 7 CQGNLSVPEDIKSN-LKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRII 65
C GN ++P + LKC+Y Y ++ + P +EE+ DP ++ H+ ++E+ +
Sbjct: 64 CIGNENLPAKSSGHHLKCYY-FYPSSKTRFMPYAIEEMSRDPLIILYHNLTSNAEMESLK 122
Query: 66 ELSKGKVERGKV--VNYGDTIYVD--TRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLV 121
L+ +++ V D ++ TR++K+ F+ E + I R+QD+T L
Sbjct: 123 ALAAKQLQPAGVYHTTSADNRNLEGYTRIAKMAFILDE---ESAVASAITQRLQDVTGLN 179
Query: 122 IGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW------RLASFMFYLTDVELGGAT 175
+ E PLQ+ NYG+ G Y H D P G RLA+ + YL+DVE GGAT
Sbjct: 180 MNFSE----PLQVINYGIAGQYTPHYDTFPAKSGDRSHPSHDRLATAILYLSDVERGGAT 235
Query: 176 IFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+F ++N+ V P KG+ + WYN + L H+GCPV +G+KW
Sbjct: 236 VFTNINVRVLPRKGNVIIWYNYLPDGNLHPGTLHAGCPVLVGSKW 280
>gi|341884171|gb|EGT40106.1| CBN-PHY-2 protein [Caenorhabditis brenneri]
Length = 607
Score = 117 bits (292), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 90/292 (30%), Positives = 130/292 (44%), Gaps = 73/292 (25%)
Query: 1 EIYPLACQGNLS-VPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDS 59
+ Y C+G + V E ++ L+C+ + + FLKI P+KVE L DP V + I DS
Sbjct: 279 DAYEALCRGEIPPVEEKWRNKLRCYLKR-DKPFLKIAPIKVEILRFDPLAVLFKNVISDS 337
Query: 60 EINRIIELSKGKVERGKVVNY-GDTIYVDTRLSK-------VYFLYPE------------ 99
EI I EL+ K+ER V G I VD R++K ++ + P+
Sbjct: 338 EIEVIKELASPKLERATVKGPDGTLITVDYRIAKRLVNWNTLHIVSPKGGFPKSKKMKNK 397
Query: 100 --------IFGD-HPFLYKIQTRIQDMTNLVIGREERYKGP--------------LQINN 136
+ GD P + ++ RI+D T L E + +I N
Sbjct: 398 CLVGFSAWLKGDLDPVIDRVNRRIEDFTGLNQATSEELQVANYGLGGHYDPHFDFARIAN 457
Query: 137 YGLGGHYDLHCDATPR---------------------DEGLW-------RLASFMFYLTD 168
YGLGGHY+ H D + R ++ + R+A+ +FY++
Sbjct: 458 YGLGGHYEPHYDMSLRGVPEPYGKNGNRIATVLFYKEEKNAFKTLNTGNRIATVLFYMSQ 517
Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
ELGGAT+F L VFP K A+FWYN + D R H+ CPV LG KW
Sbjct: 518 PELGGATVFNHLGTAVFPSKNDALFWYNLRRDGEGDLRTRHAACPVLLGVKW 569
>gi|427783867|gb|JAA57385.1| Putative prolyl 4-hydroxylase subunit alpha-1 [Rhipicephalus
pulchellus]
Length = 548
Score = 117 bits (292), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 79/236 (33%), Positives = 118/236 (50%), Gaps = 26/236 (11%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
Y C+G + S L+C Y N FL++ P+K+EE L P ++ HD I D +IN
Sbjct: 292 YKRLCRGEQLRTPKMDSQLRCRYYYGRNGFLRLQPVKIEEANLKPYIITFHDIIGDRDIN 351
Query: 63 RIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVI 122
++ + ++ R +YG+ T S + GD + TR+ ++
Sbjct: 352 DLLAYATPRLFRS--THYGEH---GTETSLIRTSSTAWLGDQD--APVATRLNRFVESLL 404
Query: 123 GREERY-KGPL---QINNYGLGGHYDLHCD------ATP--------RDEGLWRLASFMF 164
G +Y KG Q+ NYG+GG Y H D A P R G R+A+ MF
Sbjct: 405 GLGSQYLKGEAEYYQLANYGVGGQYIAHHDFLADIYADPNRKLDDFERSAGD-RIATLMF 463
Query: 165 YLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
YL+DVE GGAT+FP L + + P+KG+A FW+N +++ + H GCPV G+KW
Sbjct: 464 YLSDVEEGGATVFPHLGVRLTPKKGNAAFWWNLNSDGEGEQLTKHGGCPVLYGSKW 519
>gi|326914688|ref|XP_003203656.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Meleagris
gallopavo]
Length = 539
Score = 116 bits (291), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 79/228 (34%), Positives = 115/228 (50%), Gaps = 13/228 (5%)
Query: 1 EIYPLACQG-NLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDS 59
+ Y CQG + + S L C YE+ + +L + P K E L L P +V HD + D+
Sbjct: 294 DAYEELCQGLGAQMAPEQPSQLGCSYETNGSPYLLLQPAKKETLRLQPYIVLYHDFVSDA 353
Query: 60 EINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
E I L+ ++R V + V+ R+SK +L P + ++ R+ +T
Sbjct: 354 EAETIKGLAGPWLQRSVVASGEKQQKVEYRISKSAWLKDTA---DPVVRALELRMAAITG 410
Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELG 172
L + Y LQ+ NYGLGGHY+ H D AT R L+R+ A+ M YL+ VE G
Sbjct: 411 LDL--RPPYAEYLQVVNYGLGGHYEPHFDHATSRKSPLYRMKSGNRIATVMIYLSAVEAG 468
Query: 173 GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
G+T F N +V K +A+FW+N N D H+GCPV G+KW
Sbjct: 469 GSTAFIYANFSVPVVKNAALFWWNLRRNGDGDGDTLHAGCPVLAGDKW 516
>gi|301626782|ref|XP_002942567.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Xenopus
(Silurana) tropicalis]
Length = 716
Score = 116 bits (291), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 75/221 (33%), Positives = 112/221 (50%), Gaps = 20/221 (9%)
Query: 1 EIYPLACQGNLSVPEDIKS-NLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDS 59
++Y CQ S P + ++ C Y++ ++ +L + P+K E + L P+VV HD + D
Sbjct: 492 DLYEGLCQTLGSQPTSYEDPHMSCMYDTNSHPYLLLQPMKKEIVSLRPQVVLYHDFVSDL 551
Query: 60 EINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
E +I EL+ + R V + + R+SK +L I HPF+ + TRI +T
Sbjct: 552 EAEKIKELASPWLHRSVVASGEKQAEAEYRISKSAWLKDTI---HPFVQNLDTRISGVTG 608
Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPS 179
L Y LQ+ NYG+GGHY+ H D L+ V+LGG+T F
Sbjct: 609 L--NAHPPYAEYLQVVNYGIGGHYEPHFDHAT--------------LSHVDLGGSTAFVF 652
Query: 180 LNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
N + K +AVFW+N H N L D H+GCPV +G+KW
Sbjct: 653 ANFSSPVVKNAAVFWWNLHRNGLGDEDTLHAGCPVIIGSKW 693
>gi|363729586|ref|XP_417248.3| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Gallus gallus]
Length = 542
Score = 116 bits (291), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 79/228 (34%), Positives = 116/228 (50%), Gaps = 13/228 (5%)
Query: 1 EIYPLACQG-NLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDS 59
+ Y CQG + + S+L C YE+ + +L + P K E L L P +V HD + D+
Sbjct: 297 DAYEELCQGLGAQMAPERPSHLGCSYETNGSPYLLLQPAKKETLRLQPYIVLYHDFVSDA 356
Query: 60 EINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
E I L+ ++R V + V+ R+SK +L P + ++ R+ +T
Sbjct: 357 EAETIKGLAGPWLQRSVVASGEKQQKVEYRISKSAWLKDTA---DPVVQALELRMAAITG 413
Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELG 172
L + Y LQ+ NYGLGGHY+ H D AT R L+R+ A+ M YL+ VE G
Sbjct: 414 LDL--RPPYAEYLQVVNYGLGGHYEPHFDHATSRKSPLYRMKSGNRIATVMIYLSAVEAG 471
Query: 173 GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
G+T F N +V K +A+FW+N N D H+GCPV G+KW
Sbjct: 472 GSTAFIYANFSVPVVKNAALFWWNLRRNGDGDGDTLHAGCPVLAGDKW 519
>gi|194751829|ref|XP_001958226.1| GF23628 [Drosophila ananassae]
gi|190625508|gb|EDV41032.1| GF23628 [Drosophila ananassae]
Length = 484
Score = 116 bits (291), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 76/212 (35%), Positives = 118/212 (55%), Gaps = 24/212 (11%)
Query: 18 KSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKV 77
++NL C Y + FLK+ PLK+EE+ LDP +V H+ I D EI E KG ++
Sbjct: 265 QNNLVCRYNATTTPFLKLAPLKLEEVSLDPYIVLYHNVISDREI----EEMKGLIDE--- 317
Query: 78 VNYGDTIYVDTR--LSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQIN 135
++ G T ++R +S++ +L E F ++ RI+D+T + + +G LQI
Sbjct: 318 MDNGWTDLNESREIVSRLVWLTKE----SRFRKRLNLRIRDITGFNV---DEIRG-LQIA 369
Query: 136 NYGLGG----HYDLHCDATPRDEGLW---RLASFMFYLTDVELGGATIFPSLNLTVFPEK 188
N+G+GG HYD + R R+AS +FY+ DV GG T+FP + + V P+K
Sbjct: 370 NFGVGGQFKPHYDYFTERILRLNNTILGDRIASIIFYVGDVVHGGQTVFPDIQIAVKPQK 429
Query: 189 GSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GS++FW+N + D R HS CPV +G++W
Sbjct: 430 GSSLFWFNTFDDATPDPRSLHSVCPVLIGDRW 461
>gi|195440206|ref|XP_002067933.1| GK11220 [Drosophila willistoni]
gi|194164018|gb|EDW78919.1| GK11220 [Drosophila willistoni]
Length = 459
Score = 116 bits (291), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 75/235 (31%), Positives = 115/235 (48%), Gaps = 33/235 (14%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
Y L C+G P L C Y + FL++ PLK EE+ LDP +V HD ++D EI
Sbjct: 224 YHLGCRGLFLPP----GKLVCRYNFTTSPFLRLAPLKQEEINLDPYIVVYHDVLHDREIA 279
Query: 63 RIIE------LSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
++ E +S +E K +++ +V + F+ + RI D
Sbjct: 280 QMKEEMANAHISNAWIEERKANQ--------SQMRQVIGRVSWLTDSSNFMDSVNQRIMD 331
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW---------RLASFMFYLT 167
MT + E LQ+ NYG G ++ H D EG RLAS +FY +
Sbjct: 332 MTGFSMKGIE----SLQVCNYGPGCNFKPHYDYMA--EGYEPPNILTLGDRLASVIFYAS 385
Query: 168 DVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWGK 222
+V LGGAT+FP L++ + P+KG+ + WYN + ++ D R H+ CP +G++W K
Sbjct: 386 EVHLGGATVFPRLDVAITPKKGAGLVWYNTYDDSTHDQRSQHAVCPTLMGSRWSK 440
>gi|198459366|ref|XP_002138685.1| GA24919 [Drosophila pseudoobscura pseudoobscura]
gi|198136669|gb|EDY69243.1| GA24919 [Drosophila pseudoobscura pseudoobscura]
Length = 448
Score = 116 bits (291), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 80/229 (34%), Positives = 110/229 (48%), Gaps = 18/229 (7%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
E Y C+G P+ L C Y+ + + L++ P KVE L DP + HD IYDSE
Sbjct: 210 EHYVRGCRGLFDPPK----GLSCHYDFHTHPVLRLAPFKVEPLSQDPYIAMYHDVIYDSE 265
Query: 61 INRIIELSKGKVERGKVVNYGDTIYVDT-RLSKVYFLYPEIFGDHPF--LYKIQTRIQDM 117
I + + + +ER KV Y D DT R S F DH + + K+ R+ M
Sbjct: 266 IEELKDNAFPDMERSKVYTYSDKDGKDTGRTSMSAFQ-----TDHQYTAVTKVNRRVMHM 320
Query: 118 TNLVIGREERYKGPLQINNYGLGGHYDLHCD--ATPRDEGLWR---LASFMFYLTDVELG 172
T + + L + NY Y H D E + R +A+ +FYL DVE G
Sbjct: 321 TGFEV-LADGSSDELLVLNYATAAQYLTHSDYFGPAYSEYIQRGDRIATVLFYLNDVEQG 379
Query: 173 GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 221
G T+FP L + P KGSAV +YN +++ D R H GCPV +G KW
Sbjct: 380 GKTVFPRLGIFRSPMKGSAVVFYNLNSSLQGDPRTEHGGCPVLVGTKWA 428
>gi|195128343|ref|XP_002008623.1| GI13594 [Drosophila mojavensis]
gi|193920232|gb|EDW19099.1| GI13594 [Drosophila mojavensis]
Length = 511
Score = 116 bits (290), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 76/231 (32%), Positives = 120/231 (51%), Gaps = 27/231 (11%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
E Y L C+G ++NL C Y + FL++ P K+EE+ LDP +V H+ I D E
Sbjct: 275 EPYYLGCRGGYPK----RTNLHCRYNTTTTPFLRLAPFKMEEVSLDPYIVLYHNVISDRE 330
Query: 61 INRIIELSKGKVERGKV---VNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDM 117
I + + + + +N D + R+ V + P F +I RI D+
Sbjct: 331 IEDMKQHATNFANGLSISPDLNVTDKPQIVARMQWVRKMTP-------FTDRINLRITDI 383
Query: 118 TNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRD----EGLW----RLASFMFYLTDV 169
T + + +K +QI NYG+GGH+ H D T D E ++ R A+ +FY ++V
Sbjct: 384 TGFEV---DEFKA-VQIGNYGIGGHFMPHFDYTTPDRLRIEDIYGLGDRTATIVFYASEV 439
Query: 170 ELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ GGAT+FP++ +TV P+KGSA+ WYN + + H+ CPV G++W
Sbjct: 440 Q-GGATVFPNIQVTVQPQKGSALHWYNLFDDDSPNPLSLHTACPVISGSRW 489
>gi|195452736|ref|XP_002073477.1| GK14138 [Drosophila willistoni]
gi|194169562|gb|EDW84463.1| GK14138 [Drosophila willistoni]
Length = 518
Score = 116 bits (290), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 74/227 (32%), Positives = 117/227 (51%), Gaps = 34/227 (14%)
Query: 2 IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
+ P C G V ++++ L C Y + ++ FL+I P+K+E L L+P +V HD I E
Sbjct: 298 VLPFCCNGKCQVSKELQ--LYCLYNTKDSYFLRIAPVKMEVLSLNPYIVLYHDFILPREQ 355
Query: 62 NRIIELSKGKVERGKVVNYGDTIYVDT--------RLSKVYFLYPEIFGDHPFLYKIQTR 113
+ K + K ++ +TIY DT R +K + + +I R
Sbjct: 356 GSL------KAQSIKYLSVAETIYPDTGEWQADSSRTAKAMWFED---SSAEVISRISQR 406
Query: 114 IQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGG 173
I+D+TNL + E Y QI NYG+GG Y+ H D +E L DV GG
Sbjct: 407 IEDITNLNPEKGELY----QIINYGIGGLYETHYDYLYENE-----------LQDVPQGG 451
Query: 174 ATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
AT+ +++L+VFP+ G+A+FWYN + ++ + H+ CPV +G+KW
Sbjct: 452 ATLLNNISLSVFPKAGAALFWYNLNNAGDTEWNVAHTACPVIVGSKW 498
>gi|444731524|gb|ELW71877.1| Prolyl 4-hydroxylase subunit alpha-3 [Tupaia chinensis]
Length = 562
Score = 116 bits (290), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 79/239 (33%), Positives = 122/239 (51%), Gaps = 27/239 (11%)
Query: 7 CQGNLSVPEDIK-SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRII 65
CQ S P + +L C YE+ ++ +L + P++ E ++L+P + HD + DSE +I
Sbjct: 303 CQTLGSQPTHYQIPSLYCSYETNSSPYLLLQPVRKELIHLEPYIALYHDFVSDSEAQKIR 362
Query: 66 ELSKGKVERGKVVNYGDTIYVDTRLSK-----------------VYFLYPEIFGDHPFLY 108
L++ ++R V + + V+ R+SK VYF P L
Sbjct: 363 ALAEPWLQRSVVASGEKQLQVEYRISKRRRLVVSGIASLMPQSVVYFSAWLKDTVDPMLV 422
Query: 109 KIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLWRL------AS 161
+ RI +T L + + Y LQ+ NYG+GGHY+ H D AT L+R+ A+
Sbjct: 423 TLDHRIAALTGLDV--QPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRMKSGNRVAT 480
Query: 162 FMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
FM YL+ VE GGAT F N +V K +A+FW+N H + + H+GCPV +G+KW
Sbjct: 481 FMIYLSSVEAGGATAFIYANFSVPVVKNAALFWWNLHRSGEGNSDTLHAGCPVLVGDKW 539
>gi|443721482|gb|ELU10773.1| hypothetical protein CAPTEDRAFT_174752 [Capitella teleta]
Length = 525
Score = 115 bits (289), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 83/241 (34%), Positives = 116/241 (48%), Gaps = 28/241 (11%)
Query: 1 EIYPLACQG-NLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDS 59
+ Y C+G L +P D+ S + Y L K E L P +V HD + D+
Sbjct: 270 KAYEALCRGEQLKLP-DVDSEQQALKCRYKPGILPFVRYKEEMLNRKPHIVLFHDVMSDA 328
Query: 60 EINRIIELSKGKVERGKVVN----YGDTIYVDTRLSKVYFLYPEIFGDHP--FLYKIQTR 113
E + + K+ER V + +G + R+S+V +L+ DH ++++ R
Sbjct: 329 EAKTMKMEAMHKLERAHVADNENKHGHSASA-KRISQVSWLW----DDHANKTIHQLSRR 383
Query: 114 IQDMTNLVIGREE--RYKGPLQINNYGLGGHYDLHCD---------ATP---RDEGLWRL 159
+ D+T L G P QI NYG+GG Y+ H D + P R G RL
Sbjct: 384 VADITGLQTGVVSGLHSAEPFQILNYGIGGQYEPHVDYFAGNHSHSSLPEHVRASGN-RL 442
Query: 160 ASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 219
A+FMFYL DV GGAT+FP L + + P K A FWYN N +D H+GCPV LG K
Sbjct: 443 ATFMFYLNDVHAGGATVFPKLKVGIPPTKNGAAFWYNIGLNGDVDPLTEHAGCPVLLGQK 502
Query: 220 W 220
W
Sbjct: 503 W 503
>gi|198466399|ref|XP_002135181.1| GA23909 [Drosophila pseudoobscura pseudoobscura]
gi|198150582|gb|EDY73808.1| GA23909 [Drosophila pseudoobscura pseudoobscura]
Length = 530
Score = 115 bits (288), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 67/213 (31%), Positives = 113/213 (53%), Gaps = 21/213 (9%)
Query: 18 KSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSK---GKVER 74
++NL C Y S FL++ PLK+EE+ DP +V H + D E+ + +L++ +
Sbjct: 303 RTNLVCRYNSTTTPFLRLAPLKMEEVNHDPYIVMYHQVLSDREMEEMKQLARPMTNGMSG 362
Query: 75 GKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQI 134
++ N + + + R++ + PF ++ RI DMT + +K LQ+
Sbjct: 363 SEMANLTEPLEIVARVAW-------LIEASPFRERLNLRIGDMTGFDVSD---FKA-LQL 411
Query: 135 NNYGLGGHYDLHCD-ATPRDEGLW------RLASFMFYLTDVELGGATIFPSLNLTVFPE 187
N+G+G ++ H D T R L R S +FY ++V GGATIFP + +TV P+
Sbjct: 412 ANFGVGSYFKAHYDYRTERVNDLGVTELGDRTGSIIFYASEVPQGGATIFPDIQVTVTPQ 471
Query: 188 KGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
KG+++FW+N ++ D R H+ CPV G++W
Sbjct: 472 KGNSLFWFNTFDDSTPDPRSLHAICPVIAGSRW 504
>gi|195172672|ref|XP_002027120.1| GL20071 [Drosophila persimilis]
gi|194112933|gb|EDW34976.1| GL20071 [Drosophila persimilis]
Length = 455
Score = 115 bits (288), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 79/229 (34%), Positives = 110/229 (48%), Gaps = 18/229 (7%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
E Y C+G P+ L C Y+ + + L++ P KVE L DP + HD IYDSE
Sbjct: 217 EHYVRGCRGLFDPPK----GLSCHYDFHTHPVLRLAPFKVEPLSQDPYIAMYHDVIYDSE 272
Query: 61 INRIIELSKGKVERGKVVNYGDTIYVDT-RLSKVYFLYPEIFGDHPF--LYKIQTRIQDM 117
I + + + +ER KV Y D +T R S F DH + + K+ R+ M
Sbjct: 273 IEELKDNAFPDMERSKVYTYSDEDSKNTGRTSMSAFQ-----TDHQYKAVTKVNRRVMHM 327
Query: 118 TNLVIGREERYKGPLQINNYGLGGHYDLHCD--ATPRDEGLWR---LASFMFYLTDVELG 172
T + + L + NY Y H D E + R +A+ +FYL DVE G
Sbjct: 328 TGFEV-LADGSSDELLVLNYATAAQYLTHSDYFGPAYSEYIQRGDRIATVLFYLNDVEQG 386
Query: 173 GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 221
G T+FP L + P KGSAV +YN +++ D R H GCPV +G KW
Sbjct: 387 GKTVFPRLGIFRSPMKGSAVVFYNMNSSLQGDPRTEHGGCPVLVGTKWA 435
>gi|426365135|ref|XP_004049642.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Gorilla gorilla
gorilla]
Length = 500
Score = 115 bits (288), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 77/228 (33%), Positives = 115/228 (50%), Gaps = 27/228 (11%)
Query: 8 QGNLSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIE 66
Q S + L C Y N N + P K E+ + PR+++ HD I D+EI + +
Sbjct: 262 QSTASFTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 321
Query: 67 LSKGKVERGKVVN--YGDTIYVDTRLSK--VYFLYPEIFGDHPFLYKIQTRIQDMTNLVI 122
L+K ++ R V + G R+SK + LY L + TR+ + L
Sbjct: 322 LAKPRLSRATVHDPETGKLTTAQYRVSKRTICLLYIN-------LKRYYTRLGFLFLLY- 373
Query: 123 GREERYKGPL--QINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVELG 172
P Q+ NYG+GG Y+ H D +DE R+A+++FY++DV G
Sbjct: 374 ----NTTCPFVPQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAG 429
Query: 173 GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV +GNKW
Sbjct: 430 GATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 477
>gi|198477152|ref|XP_002136738.1| GA29216 [Drosophila pseudoobscura pseudoobscura]
gi|198145043|gb|EDY71755.1| GA29216 [Drosophila pseudoobscura pseudoobscura]
Length = 517
Score = 115 bits (287), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 77/227 (33%), Positives = 115/227 (50%), Gaps = 19/227 (8%)
Query: 5 LACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRI 64
L C+G P L C Y + FL++ P K E L L P +V HD I E +
Sbjct: 282 LCCRG--GCPYRDMHRLTCSYNTTAAPFLRLAPFKTEILSLSPYMVLYHDVITPLESLTL 339
Query: 65 IELSKGKVERGKVV---NYGDTIYVDT-RLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
LSK ++R +V N ++D+ R S +L ++ + +++ R+ MTN
Sbjct: 340 KNLSKPLMKRRAMVMVNNLKVRPFIDSGRTSNSVWLASH---ENAVMERLERRVGVMTNF 396
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD--ATPRDE----GLWRLASFMFYLTDVELGGA 174
+ E Y Q+ NYG+GGHY H D TP+ G R+A+ +FYL+DV GGA
Sbjct: 397 EMENSEVY----QLINYGIGGHYKPHTDHFETPQAPEHRGGGDRIATVLFYLSDVPQGGA 452
Query: 175 TIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 221
T+FP LN++V P +G A+ WYN + + H+ CP+ G+KW
Sbjct: 453 TLFPRLNISVQPRQGDALLWYNLNDRGQGEIGTVHTSCPIIQGSKWA 499
>gi|195166675|ref|XP_002024160.1| GL22879 [Drosophila persimilis]
gi|194107515|gb|EDW29558.1| GL22879 [Drosophila persimilis]
Length = 484
Score = 115 bits (287), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 72/207 (34%), Positives = 101/207 (48%), Gaps = 28/207 (13%)
Query: 20 NLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN 79
N C YES F+++ PLKVE L LDP + HD IY+ EI R++ L+ ++
Sbjct: 293 NRSCHYESTRTAFVRLAPLKVEMLSLDPYIAIYHDVIYEREIARVMTLALSSLK------ 346
Query: 80 YGDTIYVDTRLS--KVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNY 137
G Y R K +Y E ++ R +DMT G + + +I N
Sbjct: 347 -GPGRYSKRREHNIKSVTVYEEENS------QLNQRTRDMT----GEQVKEDKDFRIYNS 395
Query: 138 GLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNA 197
G+GG+ H D ++E L +V GGA FP L TV+P KGSA+ W+N
Sbjct: 396 GIGGYIRYHMDNLAKEEQ---------QLNEVPHGGAISFPQLEFTVWPRKGSALVWHNL 446
Query: 198 HANTLLDYRMYHSGCPVALGNKWGKLL 224
+ N LDYR+ H CPV +G+KW K
Sbjct: 447 NNNLELDYRVAHISCPVIVGSKWSKFF 473
>gi|194905424|ref|XP_001981193.1| GG11755 [Drosophila erecta]
gi|190655831|gb|EDV53063.1| GG11755 [Drosophila erecta]
Length = 527
Score = 115 bits (287), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 72/224 (32%), Positives = 121/224 (54%), Gaps = 16/224 (7%)
Query: 3 YPLACQGNLSVPEDIKSN-LKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
Y + CQG +PE+ ++ LKC+ + + + + PL+VE ++LDP + H + ++I
Sbjct: 281 YTMLCQGR-RLPEERSADPLKCYLDGKRHAYFTLAPLQVEPVHLDPDINVYHGMLSANQI 339
Query: 62 NRII-ELSKGKVERGKVV-NYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
I+ E K ++ R V N G++ D R+S+ +L + + + +
Sbjct: 340 LSILDEAEKMQMFRSAVSGNGGNSTVKDLRVSQQTWL--------DYKSAVMKSVGRINE 391
Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCD----ATPRDEGLWRLASFMFYLTDVELGGAT 175
LV G + +Q+ NYG+GG Y+ H D P + R+++ MFYL+DVE GG T
Sbjct: 392 LVSGFDMAGAEYMQVANYGVGGQYEPHPDYFGVNLPVEFKGDRISTSMFYLSDVEQGGYT 451
Query: 176 IFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 219
+FP LN+ + P G+ V W+N H + +D R H+GCPV +G+K
Sbjct: 452 VFPKLNVFLPPVSGALVMWHNLHRSLDVDARTLHAGCPVIVGSK 495
>gi|313242424|emb|CBY34571.1| unnamed protein product [Oikopleura dioica]
Length = 503
Score = 114 bits (286), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 75/230 (32%), Positives = 119/230 (51%), Gaps = 15/230 (6%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNN-TFLKIGPLKVEELYLDPRVVKIHDAIYDS 59
E Y C+ +P + LKCFY + N+ FL +GP+K EEL+ +P +++ ++ I D
Sbjct: 245 EYYEKLCRIPNELPREKADTLKCFYWTNNDHPFLVLGPVKAEELWDEPEIIRFYEIITDE 304
Query: 60 EINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEI-FGDHPFLYKIQTRIQD 116
E++ I E ++ K V + G + D R+S+ +L L + + RI
Sbjct: 305 ELDIINEQARPKSNLATVQDPITGKLVNADYRISESAWLPANTDSAQDEKLRQFRKRISI 364
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLW------RLASFMFYLTDV 169
+T L + R E +Q +NYG+GG Y+ H D +T D G + R+A+++ YL +
Sbjct: 365 ITGLTMERAE----DIQYSNYGIGGQYEPHYDMSTENDAGKFDEEDGNRIATWLTYLNEP 420
Query: 170 ELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 219
+ GG T+F + P SAVFWYN + DYR H+ CPV +G K
Sbjct: 421 KHGGDTVFLGPGIKAEPIHKSAVFWYNLLRDGSCDYRTRHAACPVLIGQK 470
>gi|390176896|ref|XP_002136934.2| GA26861 [Drosophila pseudoobscura pseudoobscura]
gi|388858831|gb|EDY67492.2| GA26861 [Drosophila pseudoobscura pseudoobscura]
Length = 513
Score = 114 bits (286), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 75/223 (33%), Positives = 113/223 (50%), Gaps = 15/223 (6%)
Query: 5 LACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRI 64
L C+G P L C Y + FL++ P K E L L P +V HD I E +
Sbjct: 282 LCCRG--GCPYRDMHRLTCSYNTTAAPFLRLAPFKTEILSLSPYMVLYHDVITPLESLTL 339
Query: 65 IELSKGKVERGKVVNYGDTI--YVDT-RLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLV 121
LSK ++R + + +D+ R S +L ++ + +++ R+ MTN
Sbjct: 340 KNLSKPHMKRRAMTFNKQKLRPLIDSGRTSNSVWLTSH---ENAVMERLERRVGVMTNFE 396
Query: 122 IGREERYKGPLQINNYGLGGHYDLHCD--ATPRDEGLW-RLASFMFYLTDVELGGATIFP 178
+ E Y Q+ NYG+GGHY H D TP+ G R+A+ +FYL+DV GGAT+FP
Sbjct: 397 MENSEVY----QLINYGIGGHYKPHTDHFETPQHRGGGDRIATVLFYLSDVPQGGATLFP 452
Query: 179 SLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 221
LN++V P +G A+ WYN + + H+ CP+ G+KW
Sbjct: 453 RLNISVQPRQGDALLWYNLNDRGQGEIGTVHTSCPIIQGSKWA 495
>gi|21358233|ref|NP_651814.1| prolyl-4-hydroxylase-alpha NE3 [Drosophila melanogaster]
gi|20269810|gb|AAM18060.1|AF495538_1 prolyl 4-hydroxylase alpha-related protein PH4[alpha]NE3
[Drosophila melanogaster]
gi|15291443|gb|AAK92990.1| GH21465p [Drosophila melanogaster]
gi|23172714|gb|AAN14251.1| prolyl-4-hydroxylase-alpha NE3 [Drosophila melanogaster]
gi|220945610|gb|ACL85348.1| PH4alphaNE3-PA [synthetic construct]
gi|220955396|gb|ACL90241.1| PH4alphaNE3-PA [synthetic construct]
Length = 481
Score = 114 bits (286), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 72/208 (34%), Positives = 107/208 (51%), Gaps = 19/208 (9%)
Query: 19 SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV 78
S L C Y S + FL + PLK+EE+ L+P +V HD + D +I ++I L++ ++
Sbjct: 277 SKLHCRYNSTTSAFLILAPLKMEEISLEPHIVVYHDILPDKDIQQLITLAEPLLK----- 331
Query: 79 NYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYG 138
T D ++ Y G P L + R++D+T L I R P+ I YG
Sbjct: 332 ---PTEMFDDNKNEARSSYRTPLGG-PLLDSLTQRMRDITGLQI----RQGNPINIIKYG 383
Query: 139 LGG----HYDLHCDATPRDEGLW-RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVF 193
G +YD +G R+A+FMFYL D GGAT+FP LN+ V E+G +F
Sbjct: 384 FGAPYTNYYDFFKKRNSESKGFGDRMATFMFYLNDAPYGGATVFPRLNVKVPAERGKVLF 443
Query: 194 WYNAHANTL-LDYRMYHSGCPVALGNKW 220
WYN + +T ++ H+ CPV G+KW
Sbjct: 444 WYNLNGDTHDMEPTTMHAACPVFHGSKW 471
>gi|195113245|ref|XP_002001178.1| GI22115 [Drosophila mojavensis]
gi|193917772|gb|EDW16639.1| GI22115 [Drosophila mojavensis]
Length = 498
Score = 114 bits (286), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 72/209 (34%), Positives = 110/209 (52%), Gaps = 16/209 (7%)
Query: 19 SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV 78
+ L C Y++ + FL + P K+E L DP +V HD IYDSEI + ++ + R V
Sbjct: 263 TRLVCSYKTKPSKFLYLAPFKMELLSEDPYIVVFHDVIYDSEIKHLRNTAEPLLHRSYVK 322
Query: 79 -NYGDTIYVDTRLSKVYFLYPEIFGDHP--FLYKIQTRIQDMTNLVIGREERYKGPLQIN 135
+ +++ R +K F++ + + +++ R+ D+++L I RE +Q
Sbjct: 323 KSNNESVVSKVRTAKGAFMHADRLSPESAQVVQRLKQRMGDLSDLNIKREGY--NEMQYL 380
Query: 136 NYGLGGHYDLHCD---ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAV 192
NY G HY LH D + D R+A+F+ YL DV GG TIFP + V PEKG +
Sbjct: 381 NYDFGDHYLLHMDYFNISMND----RIATFLIYLNDVTRGGGTIFPQVKQAVHPEKGKLI 436
Query: 193 FWYNAHANTLLDYRM--YHSGCPVALGNK 219
WYN ++N LDY + H CPV +G K
Sbjct: 437 LWYNMNSN--LDYELASLHGACPVLIGRK 463
>gi|195341584|ref|XP_002037386.1| GM12898 [Drosophila sechellia]
gi|194131502|gb|EDW53545.1| GM12898 [Drosophila sechellia]
Length = 536
Score = 114 bits (285), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 79/215 (36%), Positives = 110/215 (51%), Gaps = 14/215 (6%)
Query: 15 EDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVER 74
E S L C Y + FLK+ P ++EEL LDP VV H+ + D EI ++ +SK +ER
Sbjct: 291 ESKPSRLHCRYNTTTTPFLKLAPFRMEELSLDPYVVLYHNVLSDPEIEKLKPMSKPFLER 350
Query: 75 GKV--VNYGDTIYVDTRLSKVYFLYPEIF--GDHPFLYKIQTRIQDMTNLVIGREERYKG 130
KV V G +R + +L + D L +I RI+D+T G R
Sbjct: 351 AKVFRVEKGSDEIAPSRSADGAWLPHQDTDPDDLEVLRRIGRRIKDLT----GLNTRSGS 406
Query: 131 PLQINNYGLGGHYDLHCD----ATPRDEGLW-RLASFMFYLTDVELGGATIFPSLNLTVF 185
+Q YG GGH+ H D T E + R+A+ +FYL +V+ GGAT FP LNL V
Sbjct: 407 QMQFLKYGFGGHFVPHYDYFNSKTSYLERVGDRIATVLFYLNNVDHGGATAFPKLNLVVP 466
Query: 186 PEKGSAVFWYNAHANTL-LDYRMYHSGCPVALGNK 219
+KGSA+FW+N + D +H CP+ G K
Sbjct: 467 TQKGSALFWHNLDRKSYDYDTCTFHGACPLISGTK 501
>gi|195390831|ref|XP_002054071.1| GJ22995 [Drosophila virilis]
gi|194152157|gb|EDW67591.1| GJ22995 [Drosophila virilis]
Length = 485
Score = 114 bits (284), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 75/221 (33%), Positives = 105/221 (47%), Gaps = 25/221 (11%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
Y C+G ++NL C Y+ + FL++ PLK+E L + P +V HD + EI
Sbjct: 264 YARGCRGQFVQ----QTNLICKYKFRPSPFLRLAPLKMEVLVVKPFIVAFHDVLSPHEIG 319
Query: 63 RIIELSKGKVERGKVVNYGDTIYVD---TRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
+ +L+ ++R V + ++ TR SK +L + +I RI DMT
Sbjct: 320 ELQQLAMPLLKRTTVYDSNAGLHGSVKGTRTSKGIWLSR---SHNNLTKRIGRRISDMT- 375
Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPS 179
G LQ+ NYGL GHY LH D E L+DVE GG T+FP
Sbjct: 376 ---GFHLEGSTSLQVMNYGLSGHYALHTDYFNTAE-----------LSDVEQGGDTVFPR 421
Query: 180 LNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ PE+G A+ WYN H N D R H CPV +G+KW
Sbjct: 422 IEQAFKPERGKALLWYNLHRNGTGDKRTEHGACPVLVGSKW 462
>gi|195159146|ref|XP_002020443.1| GL13510 [Drosophila persimilis]
gi|194117212|gb|EDW39255.1| GL13510 [Drosophila persimilis]
Length = 527
Score = 114 bits (284), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 70/211 (33%), Positives = 111/211 (52%), Gaps = 14/211 (6%)
Query: 19 SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV 78
S L C Y + FL++ PL++EEL LDP +V H+ + D+EI + +++ ++R V
Sbjct: 295 SRLHCRYNTTTTPFLRLAPLRMEELSLDPYIVVYHNVLSDAEIAEVERVTEPLLKRSVVF 354
Query: 79 N-YGDTIYVDTRLSKVYFLYPEIFGD---HPFLYKIQTRIQDMTNLVIGREERYKGPLQI 134
+ + + + + + P+ D + +I RI ++T L+I + +Q+
Sbjct: 355 DGKENKMSTSKKRTALGAWLPDDNMDVSGRAVIQRIFRRIHELTGLIINDRQ----DMQL 410
Query: 135 NNYGLGGHYDLHCD----ATPRDEGLW-RLASFMFYLTDVELGGATIFPSLNLTVFPEKG 189
YG GGHYD+H D +TP + R+A+ +FYL D++ GG+T F L L V E+G
Sbjct: 411 IKYGYGGHYDIHFDYFNTSTPITKARGDRMATVLFYLNDMKHGGSTAFTDLQLKVPSERG 470
Query: 190 SAVFWYNAHANTL-LDYRMYHSGCPVALGNK 219
+FWYN T LD R H CPV G K
Sbjct: 471 KVLFWYNMRGETHDLDSRTLHGACPVINGTK 501
>gi|195166677|ref|XP_002024161.1| GL22880 [Drosophila persimilis]
gi|194107516|gb|EDW29559.1| GL22880 [Drosophila persimilis]
Length = 507
Score = 114 bits (284), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 66/213 (30%), Positives = 112/213 (52%), Gaps = 21/213 (9%)
Query: 18 KSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSK---GKVER 74
++NL C Y S FL++ PLK+EE+ DP +V H + D E+ + +L++ +
Sbjct: 246 RTNLVCRYNSTTTPFLRLAPLKMEEVNHDPYIVMYHQVLSDREMEEMKQLARPMTNGMSG 305
Query: 75 GKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQI 134
++ N + + + R++ + PF ++ RI DMT + +K LQ+
Sbjct: 306 SEMANLTEPLEIVARVAW-------LIEASPFRERLNLRIGDMTGFDVSD---FKA-LQL 354
Query: 135 NNYGLGGHYDLHCD-ATPRDEGLW------RLASFMFYLTDVELGGATIFPSLNLTVFPE 187
N+G+G ++ H D T R L R S +FY ++V GG TIFP + +TV P+
Sbjct: 355 ANFGVGSYFKAHYDYRTERVNDLGVTELGDRTGSIIFYASEVPQGGTTIFPDIQVTVTPQ 414
Query: 188 KGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
KG+++FW+N ++ D R H+ CPV G++W
Sbjct: 415 KGNSLFWFNTFDDSTPDPRSLHAICPVIAGSRW 447
>gi|194765144|ref|XP_001964687.1| GF22917 [Drosophila ananassae]
gi|190614959|gb|EDV30483.1| GF22917 [Drosophila ananassae]
Length = 529
Score = 114 bits (284), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 77/213 (36%), Positives = 111/213 (52%), Gaps = 16/213 (7%)
Query: 19 SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV 78
+ L C Y S FLKI PLK+EE+ LDP +V HD + D +I+ ++ LS+ K+E +VV
Sbjct: 291 TRLHCRYNSTTTPFLKIAPLKMEEISLDPYIVVYHDVLPDGDISEVLRLSETKLEPAQVV 350
Query: 79 NYGDT---IYVDTRLSKVYFLYPEIFGDHPF--LY-KIQTRIQDMTNLVIGREERYKGPL 132
+ T + T L Y E+ P LY +++ ++D+T LVI + +
Sbjct: 351 STPRTSNNVKFRTALGSWLPDYEEVVKGPPKGPLYGRLRNILRDVTGLVIWDYQFF---- 406
Query: 133 QINNYGLGGHYDLHCD---ATPRDEGLW--RLASFMFYLTDVELGGATIFPSLNLTVFPE 187
Q+ Y G HY H D + + L R+A+ +FYL D GGAT+FP LN+ V E
Sbjct: 407 QVLKYQFGAHYAQHHDYFNMSLKSTVLQGDRIATVLFYLNDAPHGGATVFPMLNVKVPAE 466
Query: 188 KGSAVFWYNAHANTL-LDYRMYHSGCPVALGNK 219
KG +FWYN T D + H CP+ G K
Sbjct: 467 KGKILFWYNLKGETHDFDEKTLHGACPIFHGTK 499
>gi|195379218|ref|XP_002048377.1| GJ13934 [Drosophila virilis]
gi|194155535|gb|EDW70719.1| GJ13934 [Drosophila virilis]
Length = 469
Score = 114 bits (284), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 66/201 (32%), Positives = 100/201 (49%), Gaps = 24/201 (11%)
Query: 21 LKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNY 80
L C Y ++ +L++ PLK+E L L P + HD ++DSEI + ++ R N
Sbjct: 275 LTCRYVQQHSAYLRLAPLKMEILSLQPLIQLYHDVLHDSEIEAVKNVTN---HRAMAENL 331
Query: 81 GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLG 140
T+ + T D P + RI DM+ L + + L + N+GLG
Sbjct: 332 ASTVKLIT------------LRDAPHTQNMHRRITDMSGLDMAQN----NTLHLLNFGLG 375
Query: 141 GHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHAN 200
G+ R+A+ +FY +DV+LGGATIFP L L V P++GSA+ WYN +A
Sbjct: 376 GYLGKQLKLQGN-----RIATVIFYASDVQLGGATIFPRLQLVVKPKRGSALLWYNLNAA 430
Query: 201 TLLDYRMYHSGCPVALGNKWG 221
D H+ CPV +G++W
Sbjct: 431 GKPDPLTRHAVCPVVVGSRWA 451
>gi|198449508|ref|XP_002136911.1| GA26875 [Drosophila pseudoobscura pseudoobscura]
gi|198130638|gb|EDY67469.1| GA26875 [Drosophila pseudoobscura pseudoobscura]
Length = 516
Score = 114 bits (284), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 70/211 (33%), Positives = 111/211 (52%), Gaps = 14/211 (6%)
Query: 19 SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV 78
S L C Y + FL++ PL++EEL LDP +V H+ + D+EI + +++ ++R V
Sbjct: 284 SRLHCRYNTTTTPFLRLAPLRMEELSLDPYIVVYHNVLCDAEIAEVERVTEPLLKRSVVF 343
Query: 79 N-YGDTIYVDTRLSKVYFLYPEIFGD---HPFLYKIQTRIQDMTNLVIGREERYKGPLQI 134
+ + + + + + P+ D + +I RI ++T L+I + +Q+
Sbjct: 344 DGKENKMSTSKKRTALGAWLPDDNMDVSGRAVIQRIFRRIHELTGLIIND----RQDMQL 399
Query: 135 NNYGLGGHYDLHCD----ATPRDEGLW-RLASFMFYLTDVELGGATIFPSLNLTVFPEKG 189
YG GGHYD+H D ++P + R+A+ +FYL DV+ GG+T F L L V E+G
Sbjct: 400 IKYGYGGHYDIHFDYFNTSSPITKARGDRMATVLFYLNDVKHGGSTAFTDLQLKVPSERG 459
Query: 190 SAVFWYNAHANTL-LDYRMYHSGCPVALGNK 219
+FWYN T LD R H CPV G K
Sbjct: 460 KVLFWYNMRGETHDLDSRTLHGACPVIDGTK 490
>gi|194760358|ref|XP_001962408.1| GF14452 [Drosophila ananassae]
gi|190616105|gb|EDV31629.1| GF14452 [Drosophila ananassae]
Length = 498
Score = 114 bits (284), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 77/221 (34%), Positives = 121/221 (54%), Gaps = 23/221 (10%)
Query: 18 KSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIE-LSKGKVERGK 76
+S L C Y + F++I PLK EE+ DP + HD ++DSE+ + + L++ ++ +G
Sbjct: 278 QSRLVCRYNTTTTPFMRIAPLKEEEISKDPLIWLYHDVLFDSEMALLTKNLTREEMIQGY 337
Query: 77 VVNYGDTIYVDTRLSKVYFLYP-EIF-GDHPFLYK-IQTRIQDMTNLVIGREERYKGPLQ 133
N T K Y ++ +++ GD L + + R+ D++ L +G L
Sbjct: 338 TNN-------QTTPDKGYRIFQVKVYEGDGGKLDRTLVNRMTDISGLDVGNHTY----LA 386
Query: 134 INNYGLGGHYDLHCDATPRDEGLW------RLASFMFYLTDVELGGATIFPSLNLTVFPE 187
NYGLG H+ H D E RL +F+FY +DVE+GGATIFP+ N+++ P+
Sbjct: 387 RANYGLGTHFQEHSDYVDLRENPDLGSEGDRLFTFLFYASDVEMGGATIFPAANISIKPK 446
Query: 188 KGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW--GKLLLS 226
KGSA+FWYN H + + H+ CP+ LGN+W K +LS
Sbjct: 447 KGSALFWYNLHNDWEPNPLSRHAVCPMVLGNRWILNKSMLS 487
>gi|328713119|ref|XP_003244997.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Acyrthosiphon
pisum]
Length = 487
Score = 114 bits (284), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 79/207 (38%), Positives = 104/207 (50%), Gaps = 12/207 (5%)
Query: 22 KCFYESYNNTFLKI-GPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNY 80
KC Y++ NN F +I P K E++ +P + HD +YD EI +I +S + KV
Sbjct: 268 KCRYQT-NNLFYRILMPFKEEDINSEPFIKIYHDVLYDDEILKIKTMSLANMSDAKVKTS 326
Query: 81 GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLG 140
D+I + S + E+ F + TRI+ T ERY QI NYGLG
Sbjct: 327 NDSILRERSRSGQVYRMNEVDAIEYFD-ALNTRIESFTGFSTKTAERY----QIVNYGLG 381
Query: 141 GHYDLHCDA----TPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYN 196
GHY H D T E RL + +FYLTDV+ G T FP LN+ EKGSA+ W N
Sbjct: 382 GHYFPHFDTFKKGTENMEFGNRLVTVLFYLTDVQNDGYTSFPMLNIIAPAEKGSALVWNN 441
Query: 197 AH-ANTLLDYRMYHSGCPVALGNKWGK 222
H ++ L Y H CP+ GNKW K
Sbjct: 442 LHMSDGQLCYESLHGACPLLKGNKWSK 468
>gi|156370129|ref|XP_001628324.1| predicted protein [Nematostella vectensis]
gi|156215298|gb|EDO36261.1| predicted protein [Nematostella vectensis]
Length = 541
Score = 113 bits (283), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 75/248 (30%), Positives = 125/248 (50%), Gaps = 36/248 (14%)
Query: 3 YPLACQGNL-SVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
Y C+G + S+ + + + C ++ ++ + P ++E +++ P V+ + I DSEI
Sbjct: 266 YEKLCRGEVRSLTKWEQGQMSC-WQIRDDPLTVLKPGRIERVFVKPEVLIFRNFITDSEI 324
Query: 62 NRIIELSKGKVERGKVVN--YGDTIYVDTRLSK--VYFLYPEIFGDHPF----------- 106
RI EL+ +++R V + G+ I+ + R+SK +P + G F
Sbjct: 325 KRIKELATPRLKRATVKDPVTGELIFANYRISKRRATIQHP-VTGKLEFANYRISKSGWL 383
Query: 107 -------LYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-- 157
+ +I R+Q + L + E LQ+ NYG+GGHY+ H D E +
Sbjct: 384 RDEEDELVKRISYRVQAYSGLNMTTSE----DLQVVNYGIGGHYEPHYDFARDGEDKFTS 439
Query: 158 -----RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGC 212
R+A+F+ YL+DVE GG T+F + TV+P+KG A FWYN + D H+ C
Sbjct: 440 LGTGNRIATFLSYLSDVEAGGGTVFTRVGATVWPQKGDAAFWYNLKRSGDGDSSTRHAAC 499
Query: 213 PVALGNKW 220
PV +G+KW
Sbjct: 500 PVLVGSKW 507
>gi|432891690|ref|XP_004075614.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Oryzias
latipes]
Length = 517
Score = 113 bits (283), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 79/228 (34%), Positives = 116/228 (50%), Gaps = 13/228 (5%)
Query: 1 EIYPLACQGNLSVPEDIKS-NLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDS 59
+ Y C+ S P ++ L C Y + NN L + P+K E L L P VV H+ I D
Sbjct: 272 DTYERLCRTQGSQPIHFENPRLYCDYFTNNNPALLLLPVKREVLSLQPYVVIYHNFITDR 331
Query: 60 EINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
E I ++ + R V + + V+ R+SK +L + + K+ RI +T
Sbjct: 332 EAEEIKGFAQPALRRSVVASGENQATVEYRISKSAWLKG---SESCIVGKLDQRISMLTG 388
Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLW------RLASFMFYLTDVELG 172
L + Y LQ+ NYG+GGHY+ H D AT ++ R+A+FM YL+ VE G
Sbjct: 389 LNV--RPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPVFKLKTGNRVATFMIYLSSVEAG 446
Query: 173 GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
G+T F N +V K +A+FW+N H N D H+GCPV +G+KW
Sbjct: 447 GSTAFIYANFSVPVLKKAAIFWWNLHRNGRGDAETLHAGCPVLIGDKW 494
>gi|195159150|ref|XP_002020445.1| GL13509 [Drosophila persimilis]
gi|194117214|gb|EDW39257.1| GL13509 [Drosophila persimilis]
Length = 554
Score = 113 bits (283), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 69/211 (32%), Positives = 111/211 (52%), Gaps = 14/211 (6%)
Query: 19 SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV 78
S L C Y + FL++ PL++EEL LDP +V H+ + D+EI + +++ ++R V
Sbjct: 322 SRLHCRYNTTTTPFLRLAPLRMEELSLDPYIVVYHNVLSDAEIAEVERVTEPLLKRSVVF 381
Query: 79 N-YGDTIYVDTRLSKVYFLYPEIFGD---HPFLYKIQTRIQDMTNLVIGREERYKGPLQI 134
+ + + + + + P+ D + +I RI ++T L++ + +Q+
Sbjct: 382 DGKENKMSTSKKRTALGAWLPDDNMDVSGRAVIQRILRRIHELTGLIMND----RQDMQL 437
Query: 135 NNYGLGGHYDLHCD----ATPRDEGLW-RLASFMFYLTDVELGGATIFPSLNLTVFPEKG 189
YG GGHYD+H D ++P + R+A+ +FYL DV+ GG+T F L L V E+G
Sbjct: 438 IKYGYGGHYDIHFDYFNTSSPITKARGDRMATVLFYLNDVKHGGSTAFTDLQLKVPSERG 497
Query: 190 SAVFWYNAHANTL-LDYRMYHSGCPVALGNK 219
+FWYN T LD R H CPV G K
Sbjct: 498 KVLFWYNMRGETHDLDSRTLHGACPVIDGTK 528
>gi|194765184|ref|XP_001964707.1| GF22906 [Drosophila ananassae]
gi|190614979|gb|EDV30503.1| GF22906 [Drosophila ananassae]
Length = 708
Score = 113 bits (283), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 73/223 (32%), Positives = 113/223 (50%), Gaps = 14/223 (6%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
Y CQG E L C+ + N + + PL+VE ++LDP + H + +IN
Sbjct: 462 YTRLCQGKKLPEESTGRPLSCYLDGRTNPYFVLAPLQVEPVHLDPDINVYHRMLSQQQIN 521
Query: 63 RIIELS-KGKVERGKVV-NYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
I E + K + R V N G + D R+S+ +L P + I IQ ++
Sbjct: 522 SIFEEADKLTMYRSAVAGNAGKSTVADLRVSQQTWLN----YTSPIMKSISRIIQFVSGF 577
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD----ATPRDEGLWRLASFMFYLTDVELGGATI 176
I E +Q+ NYG+GG Y+ H D P+ R+++ MFYL++VE GG T+
Sbjct: 578 DIAGAEF----MQVANYGVGGQYEPHPDYFEFNLPQQFQGDRISTSMFYLSNVEQGGYTV 633
Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 219
F LN+ + P +G+ V W+N H + +D R H+GCPV +G+K
Sbjct: 634 FTKLNVFLPPIQGAMVMWHNLHRSLDVDARTLHAGCPVLVGSK 676
>gi|241598357|ref|XP_002404731.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
gi|215500462|gb|EEC09956.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
Length = 218
Score = 113 bits (283), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 63/218 (28%), Positives = 103/218 (47%), Gaps = 43/218 (19%)
Query: 17 IKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGK 76
+ S L+C Y + FL + +K+EE+ L P ++ +HD + D +I +++E ++ ++ER
Sbjct: 1 MDSQLRCRYYKGQDGFLALQQIKLEEMNLKPYIIVMHDVVQDKDIEKLMEFAEPRLERST 60
Query: 77 VVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINN 136
N + + R S +L + + P+ + N
Sbjct: 61 TYNGSEVMPTPQRTSSTAWLNED-----------------------------EAPIALAN 91
Query: 137 YGLGGHYDLHCDATPRDEGLW--------------RLASFMFYLTDVELGGATIFPSLNL 182
YG GGH+ H D + R+A+ M Y+TDVE GGAT+FPSL +
Sbjct: 92 YGTGGHFLPHHDFFQDSLNAYNSSADYYLQHGRGDRIATLMIYMTDVEAGGATVFPSLGI 151
Query: 183 TVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ P+KG A FW+N A+ + H+GCPV G+KW
Sbjct: 152 RLTPKKGDAAFWWNLKASGEGERLTMHAGCPVLYGSKW 189
>gi|195159311|ref|XP_002020525.1| GL13465 [Drosophila persimilis]
gi|194117294|gb|EDW39337.1| GL13465 [Drosophila persimilis]
Length = 578
Score = 113 bits (282), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 76/225 (33%), Positives = 114/225 (50%), Gaps = 17/225 (7%)
Query: 5 LACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRI 64
L C+G P L C Y + FL++ P K E L L P +V HD I E +
Sbjct: 345 LCCRG--GCPYRDMHRLTCSYNTTAAPFLRLAPFKTELLSLAPYMVLYHDVITPLESLTL 402
Query: 65 IELSKGKVERGKVVNYGDTI--YVDT-RLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLV 121
LSK ++R + + +D+ R S +L ++ + +++ R+ MTN
Sbjct: 403 KNLSKPHMKRRAMTFNKQKLRPLIDSGRTSNSVWLTSH---ENAVMERLERRVGVMTNFE 459
Query: 122 IGREERYKGPLQINNYGLGGHYDLHCD--ATPRDE---GLWRLASFMFYLTDVELGGATI 176
+ E Y Q+ NYG+GGHY H D TP+ E G R+A+ +FYL+DV GGAT+
Sbjct: 460 MENSEVY----QLINYGIGGHYKPHTDHFETPQLEHRGGGDRIATVLFYLSDVPQGGATL 515
Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 221
FP LN++V P +G A+ WYN + + H+ CP+ G+KW
Sbjct: 516 FPRLNISVQPRQGDALLWYNLNDRGQGEIGTVHTSCPIIKGSKWA 560
>gi|313229343|emb|CBY23930.1| unnamed protein product [Oikopleura dioica]
Length = 542
Score = 113 bits (282), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 74/230 (32%), Positives = 119/230 (51%), Gaps = 15/230 (6%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNN-TFLKIGPLKVEELYLDPRVVKIHDAIYDS 59
E Y C+ +P + LKCFY + N+ FL +GP+K EEL+ +P +++ ++ I D
Sbjct: 284 EYYEKLCRIPNELPREKADTLKCFYWTNNDHPFLVLGPVKAEELWDEPEIIRFYEIITDE 343
Query: 60 EINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEI-FGDHPFLYKIQTRIQD 116
E++ I + ++ K V + G + D R+S+ +L L + + RI
Sbjct: 344 ELDIINKQARPKSNLATVQDPITGKLVNADYRISESAWLPANTDSAQDEKLRQFRKRISI 403
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLW------RLASFMFYLTDV 169
+T L + R E +Q +NYG+GG Y+ H D +T D G + R+A+++ YL +
Sbjct: 404 ITGLTMERAE----DIQYSNYGIGGQYEPHYDMSTENDAGKFDEEDGNRIATWLTYLNEP 459
Query: 170 ELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 219
+ GG T+F + P SAVFWYN + DYR H+ CPV +G K
Sbjct: 460 KHGGDTVFLGPGIKAEPIHKSAVFWYNLLRDGSCDYRTRHAACPVLIGQK 509
>gi|241044303|ref|XP_002407179.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
gi|215492129|gb|EEC01770.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
Length = 456
Score = 113 bits (282), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 72/218 (33%), Positives = 110/218 (50%), Gaps = 16/218 (7%)
Query: 7 CQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIE 66
C+G + +L C Y+ + + KIGP+KVE++ +P V++ +D ++ EI
Sbjct: 225 CRGEKIRNASEEKDLFCLYD-VPHPYFKIGPVKVEQMNKNPYVLQFYDVLWPQEIKAFRR 283
Query: 67 LSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREE 126
+ ++ER V + R+S+V ++ P+ L ++ R+ +T L R
Sbjct: 284 MGDPQLERATVRDTARNTVSHARVSQVAWISPD---SDVLLDRVNARVAMLTGLS-HRLR 339
Query: 127 RYKGPLQINNYGLGGHYDLHCDATPR-DE----GLWRLASFMFYLTDVELGGATIFPSLN 181
+Y N+YG GGHY+ H D DE G R+A+FMFYL+DV LGG+T+FP
Sbjct: 340 KY------NSYGPGGHYEPHHDYLEELDEVDKLGGDRIATFMFYLSDVNLGGSTVFPYAK 393
Query: 182 LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 219
V P+ GSA FWYN + D H C V G K
Sbjct: 394 AGVMPKMGSAAFWYNMREDGSYDRATLHGACSVLHGTK 431
>gi|313241587|emb|CBY33829.1| unnamed protein product [Oikopleura dioica]
Length = 541
Score = 113 bits (282), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 74/230 (32%), Positives = 119/230 (51%), Gaps = 15/230 (6%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNN-TFLKIGPLKVEELYLDPRVVKIHDAIYDS 59
E Y C+ +P + LKCFY + N+ FL +GP+K EEL+ +P +++ ++ I D
Sbjct: 283 EYYEKLCRIPNELPREKADTLKCFYWTNNDHPFLVLGPVKAEELWDEPEIIRFYEIITDE 342
Query: 60 EINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEI-FGDHPFLYKIQTRIQD 116
E++ I + ++ K V + G + D R+S+ +L L + + RI
Sbjct: 343 ELDIINKQARPKSNLATVQDPITGKLVNADYRISESAWLPANTDSAQDEKLRQFRKRISI 402
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLW------RLASFMFYLTDV 169
+T L + R E +Q +NYG+GG Y+ H D +T D G + R+A+++ YL +
Sbjct: 403 ITGLTMERAE----DIQYSNYGIGGQYEPHYDMSTENDAGKFDEEDGNRIATWLTYLNEP 458
Query: 170 ELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 219
+ GG T+F + P SAVFWYN + DYR H+ CPV +G K
Sbjct: 459 KHGGDTVFLGPGIKAEPIHKSAVFWYNLLRDGSCDYRTRHAACPVLIGQK 508
>gi|313213106|emb|CBY36968.1| unnamed protein product [Oikopleura dioica]
Length = 541
Score = 113 bits (282), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 74/230 (32%), Positives = 119/230 (51%), Gaps = 15/230 (6%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNN-TFLKIGPLKVEELYLDPRVVKIHDAIYDS 59
E Y C+ +P + LKCFY + N+ FL +GP+K EEL+ +P +++ ++ I D
Sbjct: 283 EYYEKLCRIPNELPREKADTLKCFYWTNNDHPFLVLGPVKAEELWDEPEIIRFYEIITDE 342
Query: 60 EINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEI-FGDHPFLYKIQTRIQD 116
E++ I + ++ K V + G + D R+S+ +L L + + RI
Sbjct: 343 ELDIINKQARPKSNLATVQDPITGKLVNADYRISESAWLPANTDSAQDEKLRQFRKRISI 402
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLW------RLASFMFYLTDV 169
+T L + R E +Q +NYG+GG Y+ H D +T D G + R+A+++ YL +
Sbjct: 403 ITGLTMERAE----DIQYSNYGIGGQYEPHYDMSTENDAGKFDEEDGNRIATWLTYLNEP 458
Query: 170 ELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 219
+ GG T+F + P SAVFWYN + DYR H+ CPV +G K
Sbjct: 459 KHGGDTVFLGPGIKAEPIHKSAVFWYNLLRDGSCDYRTRHAACPVLIGQK 508
>gi|156370133|ref|XP_001628326.1| predicted protein [Nematostella vectensis]
gi|156215300|gb|EDO36263.1| predicted protein [Nematostella vectensis]
Length = 526
Score = 112 bits (281), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 70/198 (35%), Positives = 113/198 (57%), Gaps = 17/198 (8%)
Query: 33 LKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNY--GDTIYVDTRL 90
LK+ P+ +E + ++P++ H+ + + EI +++EL++ ++ R +V N G+ VD R+
Sbjct: 312 LKLKPVAMEIVSVNPQITLFHNVLSEMEIEQMLELARPRLRRARVNNLETGEIEDVDYRI 371
Query: 91 SKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT 150
S++ +L D + +I R+ +T L E LQ+NNYG+GGHY+ H D +
Sbjct: 372 SQIAWLSD---SDGDIVRRINRRVGFITGLNTNTGE----CLQVNNYGVGGHYEPHFDHS 424
Query: 151 PRDEGL--------WRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTL 202
E R+A+FMFYL++VE GG+T+F + P KG AVFWYN +
Sbjct: 425 LDMENSPIASLGQGNRIATFMFYLSEVEAGGSTVFIKTGVKTNPFKGGAVFWYNLKKSGE 484
Query: 203 LDYRMYHSGCPVALGNKW 220
D+ H+GCPV +GNKW
Sbjct: 485 GDWDSLHAGCPVLIGNKW 502
>gi|221126103|ref|XP_002165259.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Hydra
magnipapillata]
Length = 533
Score = 112 bits (280), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 76/225 (33%), Positives = 117/225 (52%), Gaps = 19/225 (8%)
Query: 7 CQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIE 66
CQG + + + L C Y + ++ + PLK+E L+ DP + ++ I D E II+
Sbjct: 294 CQGREKMAQKDINRLFCKYVAPKAHYI-LKPLKMEVLHHDPYIELYYELITDDEAKHIIK 352
Query: 67 LSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGR 124
+K + R V + GD IY D R+SK ++ ++ KI R+ D+T L +
Sbjct: 353 FAKPLLRRAFVHDMVTGDLIYADYRVSKNTWIAEDM---DVIAAKIIRRVGDVTGLNM-- 407
Query: 125 EERYKGPLQINNYGLGGHYDLHCDAT----PRDEGLW---RLASFMFYLTDVELGGATIF 177
RY LQ+ NYG+ G Y+ H D + P+ W R+A+ + YL+DV+ GG T+F
Sbjct: 408 --RYAEHLQVANYGIAGQYEPHFDHSTGTRPKHFDRWGGNRIATMLLYLSDVDWGGRTVF 465
Query: 178 PSL--NLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + P KG+ VFWYN N + + H+GCPV LG KW
Sbjct: 466 TNTAPGVGTDPIKGAGVFWYNLLRNGKSNPKTQHAGCPVVLGQKW 510
>gi|194905305|ref|XP_001981170.1| GG11767 [Drosophila erecta]
gi|190655808|gb|EDV53040.1| GG11767 [Drosophila erecta]
Length = 536
Score = 112 bits (280), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 78/211 (36%), Positives = 113/211 (53%), Gaps = 14/211 (6%)
Query: 19 SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKV- 77
S L C Y + FL++ PL++EEL LDP VV H+ + D EI ++ +S+ +ER KV
Sbjct: 295 SRLHCRYNTTTRPFLRLVPLRMEELSLDPYVVLYHNVLSDPEIEKLKLMSEPFLERAKVY 354
Query: 78 -VNYGDTIYVDTRLSKVYFLY-PEIF-GDHPFLYKIQTRIQDMTNLVIGREERYKGPLQI 134
V G +R + +L PE D L +I RI D+T L + +Q+
Sbjct: 355 RVEKGSDEVAPSRSADGAWLPDPETEPEDLETLNRIGRRIGDITGLSTCSGSQ----MQL 410
Query: 135 NNYGLGGHYDLHCD----ATPRDEGLW-RLASFMFYLTDVELGGATIFPSLNLTVFPEKG 189
YG GGH+ H D T E + R+A+ +FYL +V+ GGAT FP++NL V +KG
Sbjct: 411 LKYGFGGHFVPHYDYFDSKTSYLEAVGDRIATVLFYLNNVDHGGATAFPNINLAVPTQKG 470
Query: 190 SAVFWYNAHANTL-LDYRMYHSGCPVALGNK 219
SA+FW+N + D R +H CP+ G K
Sbjct: 471 SALFWHNLDGKSYDYDTRTFHGACPLISGTK 501
>gi|442751927|gb|JAA68123.1| Putative prolyl 4-hydroxylase alpha subunit [Ixodes ricinus]
Length = 522
Score = 112 bits (279), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 67/232 (28%), Positives = 113/232 (48%), Gaps = 17/232 (7%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
Y C+G + + S L+C Y + F + P+K+EE+ L P ++ + D + + +I
Sbjct: 265 YKRLCRGEVLRTPKMDSKLRCRYYKGQDGFFTLRPIKLEEINLKPYIIVMRDVVQERDIE 324
Query: 63 RIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVI 122
++ ++ +++R R S +L+ + + P ++ ++ + L
Sbjct: 325 DLMAFAEPRLQRSTTYTGDGNAPSTRRTSSNAWLWDD---EAPIANRMNWYLRALVGLGT 381
Query: 123 GREERYKGPLQINNYGLGG----HYD-----LHCDATPRDEGLW-----RLASFMFYLTD 168
+ Q+ NYG GG H+D LH + D L RLA+ M Y+TD
Sbjct: 382 SGSDYEAEAYQLANYGSGGYFLPHHDYLQDTLHAHNSTADYYLQNKEGDRLATLMIYMTD 441
Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
VE+GGAT+FP L + + P+KG A FW+N A+ D H+GCPV G+KW
Sbjct: 442 VEVGGATVFPRLGVRLVPKKGDAAFWWNLKASGEGDTLTMHAGCPVLYGSKW 493
>gi|194751823|ref|XP_001958223.1| GF23631 [Drosophila ananassae]
gi|190625505|gb|EDV41029.1| GF23631 [Drosophila ananassae]
Length = 502
Score = 112 bits (279), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 75/227 (33%), Positives = 106/227 (46%), Gaps = 17/227 (7%)
Query: 5 LACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRI 64
L C+G P+ L C Y + FLK+ PLK+E L + P +V HD +Y+ E +
Sbjct: 263 LGCRGKW--PKKPSPTLTCRYVRETHDFLKLAPLKMEFLNMQPLIVLYHDVLYEGEFKSM 320
Query: 65 IELSKGKVERGKVVNYGD--TIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVI 122
+++ G Y D R +V + F +I RI DMT
Sbjct: 321 RDIAIFNATMGDGWTYVDFDKKGKPKRQDRVVKMITFQGTTAEFTLRINRRIADMT---- 376
Query: 123 GREERYKGPLQINNYGLGGHYDLHCDATPR---------DEGLWRLASFMFYLTDVELGG 173
G E L + NYGLGGH+ H D D G R+A+ + Y +DV LGG
Sbjct: 377 GLEMNENMALHLTNYGLGGHFGKHVDYVELAKRPPNFFGDLGGDRIATALLYASDVPLGG 436
Query: 174 ATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
T+F L L++ P+KGSA+ W+N + D HS CPV LG++W
Sbjct: 437 TTVFTKLKLSIEPKKGSALIWFNLNNAGDPDPMSEHSACPVVLGSRW 483
>gi|195156517|ref|XP_002019146.1| GL25581 [Drosophila persimilis]
gi|194115299|gb|EDW37342.1| GL25581 [Drosophila persimilis]
Length = 206
Score = 112 bits (279), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 73/210 (34%), Positives = 101/210 (48%), Gaps = 33/210 (15%)
Query: 21 LKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRI-IELSKGKVERGKVVN 79
L C Y FL++ PLK EE+ DP + HD +YDSE ++ + L++ ++ +G N
Sbjct: 2 LVCRYNHTTTPFLRLAPLKEEEVSRDPLIWLYHDVLYDSEFEQLTVNLTRAEMVQGYTDN 61
Query: 80 YGDTIYVDTRLSKVYFLYPEIF--GDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNY 137
Y T K Y IF + R+ D++ L+ G + L NY
Sbjct: 62 Y-------TTTEKERIFYVNIFEGSGEKLDRDLVNRMADISGLLTGEHTQ----LGTVNY 110
Query: 138 GLGGHYDLHCD-----ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAV 192
GLG H+ H D A P +TDV LGGATIFP +NLT+ P+KGSA+
Sbjct: 111 GLGSHFPEHGDYSDIKANP--------------MTDVPLGGATIFPKINLTIQPKKGSAL 156
Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKWGK 222
FWYN H + H+ CP GN+W K
Sbjct: 157 FWYNIHNDWEPHVLTRHAVCPTIEGNRWSK 186
>gi|194871364|ref|XP_001972834.1| GG13661 [Drosophila erecta]
gi|190654617|gb|EDV51860.1| GG13661 [Drosophila erecta]
Length = 506
Score = 111 bits (278), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 72/209 (34%), Positives = 109/209 (52%), Gaps = 22/209 (10%)
Query: 18 KSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVER-GK 76
+ + C YE + FL+I PLKVE L + P +V HD IYDSEI+++ +S + +
Sbjct: 288 RQHQSCHYEKNTSDFLRIAPLKVETLSVKPHIVLYHDVIYDSEISKVKNISLPSLRSPSR 347
Query: 77 VVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINN 136
++ D + +L+K+ + P + RI+DMT G + LQI N
Sbjct: 348 ILRAEDH---NLKLAKIR--------EDP-RSPLSLRIKDMT----GEDVEEDTDLQIEN 391
Query: 137 YGLGGHYDLHCDATPRDEGLW----RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAV 192
YG+ G H D + RL S +F++ DV LGGA +F + NLT+FP+KGSA+
Sbjct: 392 YGICGFRFYHNDNLESQDQTAKLGDRLTSILFFMNDVALGGAFVFLNANLTIFPQKGSAL 451
Query: 193 FWYNA-HANTLLDYRMYHSGCPVALGNKW 220
W N H+ + + H CPV +G+KW
Sbjct: 452 VWRNLDHSLQPKEDLLQHLSCPVIVGSKW 480
>gi|195379216|ref|XP_002048376.1| GJ13933 [Drosophila virilis]
gi|194155534|gb|EDW70718.1| GJ13933 [Drosophila virilis]
Length = 521
Score = 111 bits (278), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 72/224 (32%), Positives = 110/224 (49%), Gaps = 20/224 (8%)
Query: 5 LACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRI 64
L C+G P+ +L C Y S FL++ PLK+EE+ DP +V H+ + DSEI +
Sbjct: 287 LGCRGLFPKPK----SLSCRYNSTTTPFLRLAPLKLEEISHDPYIVMYHNVLSDSEIEEM 342
Query: 65 IELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGR 124
+LS N + +++ +L PFL +I RI DMT G
Sbjct: 343 KQLSVLMENGLSATNKPNNTEPLDIVARAGWLVEAT----PFLERINRRITDMT----GF 394
Query: 125 EERYKGPLQINNYGLGGHYDLHCD--------ATPRDEGLWRLASFMFYLTDVELGGATI 176
+ + + NYG+G ++ H D E R+A+ +FY +DV GGAT
Sbjct: 395 DVLDMWAVLLANYGIGNYFKPHYDYMYGGRVSGEAVAELGERIATLIFYASDVAQGGATN 454
Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
FP + + V P+KG+++FWYN + D R HS CP +G++W
Sbjct: 455 FPDIQVAVQPQKGNSLFWYNMFDDGTPDPRSLHSVCPTIVGSRW 498
>gi|195113263|ref|XP_002001187.1| GI10646 [Drosophila mojavensis]
gi|193917781|gb|EDW16648.1| GI10646 [Drosophila mojavensis]
Length = 471
Score = 111 bits (278), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 78/226 (34%), Positives = 113/226 (50%), Gaps = 41/226 (18%)
Query: 1 EIYPLAC-QGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDS 59
E+ LA +G+ S ++L C Y + FL+I PLK+EEL LDP +V H AIY+S
Sbjct: 257 EMISLAIIKGHCSASFQRPTHLHCRYNYWMTPFLRIAPLKLEELSLDPLIVLYHKAIYNS 316
Query: 60 EINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
EI +++ + + GK N TI+ R+ DM+
Sbjct: 317 EIETLLKRQEFNLISGKD-NMDRTIH--------------------------ERVADMSG 349
Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCDA-----TPRDEGLWRLASFMFYLTDVELGGA 174
L + R E L + N GH+ L DA P+D R+A+ +FYL DVEL GA
Sbjct: 350 LNLDRSE----VLSVINNDNNGHFQLQEDAPETTERPQD----RIATVLFYLEDVELVGA 401
Query: 175 TIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
TIFP LNLT+ PEKG+A+ W+N + + ++ CPV +K+
Sbjct: 402 TIFPRLNLTIKPEKGTALLWHNLESCGSSHPKALYAACPVISSSKY 447
>gi|410975458|ref|XP_003994148.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Felis catus]
Length = 567
Score = 111 bits (278), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 78/258 (30%), Positives = 123/258 (47%), Gaps = 42/258 (16%)
Query: 3 YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G + + + L C Y N N + P K E+ + PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLY-------------------PE 99
I + +L+K ++ R V + G R+SK + PE
Sbjct: 349 IEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSLVSWGKVQRALLIRSMQVCCERGPE 408
Query: 100 IFGDHPFLYKIQTRIQDMTNLV---------IGREERYKGPLQINNYGLGGHYDLHCDAT 150
D + + + +++ L IG E G + NYG+GG Y+ H D
Sbjct: 409 AAWDGGSM-SAEECLAELSLLAGECSAALVPIGVCESRLGK-GVANYGVGGQYEPHFDFA 466
Query: 151 PRDEGLW--------RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTL 202
+DE R+A+++FY++DV GGAT+FP + +V+P+KG+AVFWYN A+
Sbjct: 467 RKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGE 526
Query: 203 LDYRMYHSGCPVALGNKW 220
DY H+ CPV +GNKW
Sbjct: 527 GDYSTRHAACPVLVGNKW 544
>gi|195503448|ref|XP_002098656.1| GE23815 [Drosophila yakuba]
gi|194184757|gb|EDW98368.1| GE23815 [Drosophila yakuba]
Length = 472
Score = 111 bits (277), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 73/219 (33%), Positives = 108/219 (49%), Gaps = 33/219 (15%)
Query: 7 CQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIE 66
C+G P KS L+C Y + FL+ LK+E+L ++P V HDAI +E ++
Sbjct: 249 CRGKNLPPS--KSFLRCRYFREGSPFLRWAALKLEQLNIEPFVGLFHDAISPAEQEDLLR 306
Query: 67 LSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREE 126
L++ ++E K +Y VDT S DH + +I RI+D+T + E
Sbjct: 307 LTETRLEHRKKDSYSVEANVDTNGS-----------DH--VRRIHQRIEDITGFDLEDSE 353
Query: 127 RYKGPLQINNYGLGGHYDLHCDA-TPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVF 185
PL ++NYG+GG +H D P+ L+DV++GG FP L
Sbjct: 354 ----PLTVSNYGIGGQESIHLDCEQPK-------------LSDVQMGGYASFPDLGFGFK 396
Query: 186 PEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWGKLL 224
P +GSA+ W+N + D R + CPV LGN+WGK L
Sbjct: 397 PSRGSALVWHNTDSAGNCDTRSLQATCPVLLGNQWGKWL 435
>gi|195341061|ref|XP_002037130.1| GM12749 [Drosophila sechellia]
gi|194131246|gb|EDW53289.1| GM12749 [Drosophila sechellia]
Length = 467
Score = 111 bits (277), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 69/216 (31%), Positives = 108/216 (50%), Gaps = 33/216 (15%)
Query: 7 CQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIE 66
C+G +P KS+L+C Y + FL++ P+K+E+L +P V +HDAI +E ++
Sbjct: 255 CRGKNLLPN--KSSLRCRYFRGGSPFLRLAPVKLEQLNFEPFVGLVHDAISQAEQEDLLH 312
Query: 67 LSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREE 126
L+ ++E + + VDT S DH + +I RI+D+T + E
Sbjct: 313 LTDSRLEHTRKESSSVEAKVDTNAS-----------DH--VRRIHQRIEDITGFDMEESE 359
Query: 127 RYKGPLQINNYGLGGHYDLHCDA-TPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVF 185
PL ++NYG+GG +H D P+ L+DV++GG FP L
Sbjct: 360 ----PLIVSNYGIGGQELIHLDCEQPK-------------LSDVQMGGYASFPDLGFGFK 402
Query: 186 PEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 221
P +GSA+ W+N + D R + CPV LGN+WG
Sbjct: 403 PRRGSALVWHNTDNSGNCDTRSLQATCPVLLGNQWG 438
>gi|195128347|ref|XP_002008625.1| GI13597 [Drosophila mojavensis]
gi|193920234|gb|EDW19101.1| GI13597 [Drosophila mojavensis]
Length = 457
Score = 111 bits (277), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 73/216 (33%), Positives = 102/216 (47%), Gaps = 39/216 (18%)
Query: 6 ACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRII 65
AC+G PE +L C Y N+ +LK+ P+K+E+L L+P V HD +YDSEI I
Sbjct: 262 ACRGLW--PERKTDHLSCRYVYENSAYLKLAPMKLEQLSLEPVVQLYHDVLYDSEIKAIK 319
Query: 66 ELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGRE 125
+S + + +V I R+ DMT G
Sbjct: 320 NMSVPEAKAKRVE-----------------------------LNINQRVADMTG--YGMM 348
Query: 126 ERYKGPLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVF 185
E K L + N+ LG D D R+A+ +FY DV +GGATIFP L L V
Sbjct: 349 EHNK--LHVLNFALGQGADTKSCKARAD----RIATIVFYANDVAIGGATIFPKLRLLVQ 402
Query: 186 PEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 221
P +G+A+ WYN +A+ D H+ CPV LG++W
Sbjct: 403 PRRGTALLWYNLNADGAADPLAKHAVCPVVLGSRWA 438
>gi|194905313|ref|XP_001981171.1| GG11766 [Drosophila erecta]
gi|190655809|gb|EDV53041.1| GG11766 [Drosophila erecta]
Length = 496
Score = 110 bits (276), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 69/213 (32%), Positives = 110/213 (51%), Gaps = 23/213 (10%)
Query: 19 SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV 78
S L C Y S + FL + PLK+E++ L+P +V HD + + +I+++I L++ ++
Sbjct: 271 SKLHCRYNSTTSPFLILAPLKMEQISLEPYIVVYHDILPEGDIHQLIALAEPRLR----- 325
Query: 79 NYGDTIYVDTRLSKVYFLYPEIFGD-----HPFLYKIQTRIQDMTNLVIGREERYKGPLQ 133
+ + + V+ + F D P L ++ R++D+T L I + R +
Sbjct: 326 --ATLAFTEDKSDSVFGAFLP-FKDMNSSGEPVLDRLTQRMRDITGLQIHQRNR----IN 378
Query: 134 INNYGLGGHY----DLHCDATPRDEGLW-RLASFMFYLTDVELGGATIFPSLNLTVFPEK 188
I YG G HY D + EG R+A+ MFYL D GGAT+FP +N+ V E+
Sbjct: 379 IIKYGFGAHYAARHDFFNETNSETEGYGDRMATVMFYLNDAPNGGATVFPRINVKVPAER 438
Query: 189 GSAVFWYNAHANTL-LDYRMYHSGCPVALGNKW 220
G +FWYN T +D + H+ CPV G+KW
Sbjct: 439 GKVLFWYNLDGETHDVDPKTVHAACPVFHGSKW 471
>gi|198449520|ref|XP_002136916.1| GA26928 [Drosophila pseudoobscura pseudoobscura]
gi|198130644|gb|EDY67474.1| GA26928 [Drosophila pseudoobscura pseudoobscura]
Length = 532
Score = 110 bits (276), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 72/213 (33%), Positives = 106/213 (49%), Gaps = 20/213 (9%)
Query: 19 SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVER-GKV 77
S L C Y + FL++ PL++EEL LDP +V H+ + D+EI + + + ++R G+
Sbjct: 295 SRLHCRYNATTTPFLRLAPLRMEELSLDPYIVVYHNVLSDAEIAEVERVIEPLLQRIGRY 354
Query: 78 VNYGDTIYVDTRLSKVYFLYPEI-----FGDHPFLYKIQTRIQDMTNLVIGREERYKGPL 132
+++ R + F P I P + ++ I+DMT L + L
Sbjct: 355 DETPNSMSPSKR--RTGFTGPHIDDYMHVSGAPVIERVHRHIRDMTGLFMNEH------L 406
Query: 133 QINNYGLGGHYDLHCD----ATPRDEGLW-RLASFMFYLTDVELGGATIFPSLNLTVFPE 187
+ YGLGGH D H D + P + R+A+ +FYL DV+ GG+T F L L V E
Sbjct: 407 MMVKYGLGGHCDQHYDFLNASYPSTHAMGDRMATVLFYLNDVKHGGSTAFTDLQLKVPSE 466
Query: 188 KGSAVFWYNAHANTL-LDYRMYHSGCPVALGNK 219
+G +FWYN T LD R H CPV G K
Sbjct: 467 RGKVLFWYNMRGETHNLDRRTVHGSCPVIDGTK 499
>gi|195471732|ref|XP_002088156.1| GE14021 [Drosophila yakuba]
gi|194174257|gb|EDW87868.1| GE14021 [Drosophila yakuba]
Length = 265
Score = 110 bits (275), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 70/216 (32%), Positives = 109/216 (50%), Gaps = 15/216 (6%)
Query: 6 ACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRII 65
C+GN L C Y S F++I PLK EE+ +P + HD IYDSEI ++
Sbjct: 45 GCRGNFPP----HPQLVCRYNSTTTPFMRIAPLKEEEISKEPLIWLYHDVIYDSEIAQLT 100
Query: 66 ELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGRE 125
L++ ++ G NY R+++++ + + R+ D++ L +G
Sbjct: 101 NLTREEMILGTTNNY----TTPDRVNRLFHVKVTNDDGGQLDRTLVNRMADISGLDMGNT 156
Query: 126 ERYKGPLQINNYGLGGHYDLHCDATPRDEGLWRLASFM-FYLTDVELGGATIFPSLNLTV 184
L NYGLGG++ H D D L +S + ++DV +GGATIFP+ L +
Sbjct: 157 TS----LARINYGLGGYFQEHSDYV--DIKLHPASSLLPTSISDVPVGGATIFPAAKLAI 210
Query: 185 FPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
P+KGSA+FWYN H N + H+ CP +G++W
Sbjct: 211 QPKKGSALFWYNLHNNGDPNPLTRHAVCPTIVGSRW 246
>gi|20177113|gb|AAM12259.1| RE23792p [Drosophila melanogaster]
gi|220948174|gb|ACL86630.1| PH4alphaSG2-PB [synthetic construct]
gi|220960438|gb|ACL92755.1| PH4alphaSG2-PB [synthetic construct]
Length = 301
Score = 110 bits (274), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 71/223 (31%), Positives = 115/223 (51%), Gaps = 14/223 (6%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
Y CQG E L+C+ + + + + PL+VE ++LDP + H + +I
Sbjct: 55 YTRLCQGRRLPEERSGDPLRCYLDGKRHAYFTLAPLQVEPVHLDPDINVYHGMLSSKQIL 114
Query: 63 RIIELS-KGKVERGKVVNYGDTIYV-DTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
I E + K ++ R V G V D R+S+ +L + P + + IQ ++
Sbjct: 115 SIFEEADKEEMVRSAVAGSGGEGTVRDLRVSQQTWLDYK----SPVMNSVGRIIQFVSGF 170
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD----ATPRDEGLWRLASFMFYLTDVELGGATI 176
+ E +Q+ NYG+GG Y+ H D P++ R+++ MFYL+DVE GG T+
Sbjct: 171 DMAGAEH----MQVANYGVGGQYEPHPDYFEVNLPKNFEGDRISTSMFYLSDVEQGGYTV 226
Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 219
F LN+ + P KG+ V W+N H + +D R H+GCPV +G+K
Sbjct: 227 FTKLNVFLPPVKGALVMWHNLHRSLHVDARTLHAGCPVIVGSK 269
>gi|198471971|ref|XP_002133305.1| GA28042 [Drosophila pseudoobscura pseudoobscura]
gi|198139547|gb|EDY70707.1| GA28042 [Drosophila pseudoobscura pseudoobscura]
Length = 203
Score = 110 bits (274), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 72/208 (34%), Positives = 100/208 (48%), Gaps = 33/208 (15%)
Query: 21 LKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRI-IELSKGKVERGKVVN 79
L C Y FL++ PLK EE+ DP + HD +YDSE ++ + L++ ++ +G N
Sbjct: 2 LVCRYNHTTTPFLRLAPLKEEEVSRDPLIWLYHDVLYDSEFEQLTVNLTRAEMVQGYTDN 61
Query: 80 YGDTIYVDTRLSKVYFLYPEIF--GDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNY 137
Y T K Y IF + R+ D++ L+ G + L NY
Sbjct: 62 Y-------TTTEKERIFYVNIFEGSGEKLDRDLVNRMADISGLLTGEHTQ----LGTVNY 110
Query: 138 GLGGHYDLHCD-----ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAV 192
GLG H+ H D A P +TDV LGGATIFP +NLT+ P+KGSA+
Sbjct: 111 GLGSHFPEHGDYSDIKANP--------------MTDVPLGGATIFPKINLTIQPKKGSAL 156
Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
FWYN H + H+ CP GN+W
Sbjct: 157 FWYNIHNDWEPHVLTRHAVCPTIEGNRW 184
>gi|326436053|gb|EGD81623.1| p4ha2 protein [Salpingoeca sp. ATCC 50818]
Length = 548
Score = 110 bits (274), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 74/204 (36%), Positives = 103/204 (50%), Gaps = 12/204 (5%)
Query: 21 LKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELS-KGKVERGKVVN 79
L C + YN L + P+KVE L+ + +++ E R ++ + K ++ER
Sbjct: 311 LTCELKHYNQPHLFLKPIKVEHLHEGRQRLQVFRQFASPEECRHLQHAGKRRLERAVAWT 370
Query: 80 YGDTIYVDTRLSKVYFLYPEIFGDHPFLYK-IQTRIQDMTNLVIGREERYKGPLQINNYG 138
G V+ R+S +L P DH + K I RI+D T + I Y LQI+NYG
Sbjct: 371 DGRFQPVEFRISTAAWLQP----DHDAIVKRIHGRIEDATQVDI----EYAEALQISNYG 422
Query: 139 LGGHYDLHCDATPR--DEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYN 196
+GG Y+ H D + R + RLA+FM YL V+ GG T FP L V P G AVFWYN
Sbjct: 423 MGGFYEPHFDHSSRGTNPDGERLATFMIYLNPVKQGGFTAFPRLGAAVQPGYGDAVFWYN 482
Query: 197 AHANTLLDYRMYHSGCPVALGNKW 220
+ + D H CPV G+KW
Sbjct: 483 LQPSGVGDPLTLHGACPVLRGSKW 506
>gi|328707957|ref|XP_001947811.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Acyrthosiphon
pisum]
Length = 507
Score = 109 bits (273), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 74/205 (36%), Positives = 105/205 (51%), Gaps = 14/205 (6%)
Query: 22 KCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIEL-SKGKVERGKVVNY 80
KC Y++ N+ + I P K E++ +P + HD IYD EI I ++ SK + N
Sbjct: 285 KCRYQTKNSPYRMIMPFKEEDISSNPNIKLYHDIIYDEEIKTITDMASKDLSDAAYYFNG 344
Query: 81 GDTIYVDTRLSKVYFLYPEIFGDHPFLY-KIQTRIQDMTNLVIGREERYKGPLQINNYGL 139
T+ D RL ++ + +P L+ K+ RI+ +T E Y Q NYGL
Sbjct: 345 KITLLDDQRLGQLKWFSENA---NPILFGKLNDRIECITEYTTKTAEGY----QTINYGL 397
Query: 140 GGHYDLHCDA---TPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYN 196
GGH+ +H DA P+ G RL + +FY+TDV G T+FP+LN KGSA+ W N
Sbjct: 398 GGHFSVHMDAFTDGPKLNGN-RLVTILFYMTDVPDDGYTVFPNLNYVAHCRKGSALVWLN 456
Query: 197 AHANT-LLDYRMYHSGCPVALGNKW 220
N + +H GCPV GNKW
Sbjct: 457 LRLNNGSVHSGTFHGGCPVIKGNKW 481
>gi|195452738|ref|XP_002073478.1| GK14139 [Drosophila willistoni]
gi|194169563|gb|EDW84464.1| GK14139 [Drosophila willistoni]
Length = 215
Score = 109 bits (273), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 76/218 (34%), Positives = 115/218 (52%), Gaps = 30/218 (13%)
Query: 9 GNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELS 68
G V +++K L C Y + ++ FL+I P+K+E L LDP +V HD I SE L
Sbjct: 2 GKCQVSKELK--LYCLYNTKDSYFLRIAPVKMEVLSLDPYIVLYHDFILSSEQEF---LK 56
Query: 69 KGKVERGKVVNYGD----TIYVD-TRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIG 123
+ER V D Y D +R +K + Y + +I RI+++TNL
Sbjct: 57 AESIERLSVAETVDPDTGKWYADASRTAKAMWFYDT---SSVVIRRINQRIEEITNL--- 110
Query: 124 REERYKGPL-QINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPSLNL 182
+ KG L QI +YG+GG + H D +E L DV GGAT+F +++L
Sbjct: 111 --DPEKGDLYQIISYGIGGLFQTHYDYLHENE-----------LQDVPQGGATLFNNISL 157
Query: 183 TVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+VFP+ G+A+FWYN + ++ + H+GCPV +G+KW
Sbjct: 158 SVFPKAGAALFWYNLNNAGDTEWNVAHTGCPVIVGSKW 195
>gi|344252711|gb|EGW08815.1| Prolyl 4-hydroxylase subunit alpha-2 [Cricetulus griseus]
Length = 584
Score = 109 bits (273), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 78/239 (32%), Positives = 113/239 (47%), Gaps = 29/239 (12%)
Query: 7 CQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYDSEINRI 64
C+G + + + L C Y N L I P K E+ + P +V+ +D + D EI RI
Sbjct: 294 CRGEGVKLTPQRQKKLFCRYHHGNRVPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERI 353
Query: 65 IELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVI 122
E++K K+ R V + G R+SK +L + D P + ++ R+Q +T L +
Sbjct: 354 KEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQHITGLTV 410
Query: 123 GREERYKGPLQ--INNYGLGGHY-------DLHCDATP----------RDEGLWRLASFM 163
E + Q G G DL + P R L+ L S M
Sbjct: 411 KTAELLQSDEQDAFKRLGTGNRVATFLNYGDLRTLSCPQGFVALLSLGRGAKLFALCSQM 470
Query: 164 FYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWGK 222
+DVE GGAT+FP L ++P+KG+AVFWYN + DYR H+ CPV +G KWGK
Sbjct: 471 ---SDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWGK 526
>gi|195145080|ref|XP_002013524.1| GL24183 [Drosophila persimilis]
gi|194102467|gb|EDW24510.1| GL24183 [Drosophila persimilis]
Length = 296
Score = 109 bits (273), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 73/212 (34%), Positives = 102/212 (48%), Gaps = 20/212 (9%)
Query: 20 NLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN 79
NL C Y + F ++ PLK+E DP VV HD +YD+E+ +I+ ++ ++ R V
Sbjct: 55 NLHCRYHKKGSAFSRLAPLKLEIFSHDPYVVIYHDVLYDAEMQGLIDSTRRRMSRSMVQY 114
Query: 80 YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGL 139
I + + + + E D L +I R++DMT + R E L I Y
Sbjct: 115 EIRQIEISEQRTSKEAPFTEK-NDPQLLKRIYDRLKDMTGCDMLRSEH----LSILLYDQ 169
Query: 140 GGHYDLHCDATPRDEGLW------------RLASFMFYLTDVELGGATIFPSLNLTVFPE 187
GGH+D H D + W R AS +FYL DVE GG T+FP L L + P
Sbjct: 170 GGHHDPHVDY---HDLYWHPQEYEYHPFGDRQASVVFYLNDVEDGGETVFPKLQLVIPPT 226
Query: 188 KGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 219
KGSA+ W+N D R H+ CPV G K
Sbjct: 227 KGSALMWHNLRPWGEGDPRTQHASCPVLSGYK 258
>gi|195574593|ref|XP_002105269.1| GD21390 [Drosophila simulans]
gi|194201196|gb|EDX14772.1| GD21390 [Drosophila simulans]
Length = 478
Score = 109 bits (273), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 69/218 (31%), Positives = 106/218 (48%), Gaps = 31/218 (14%)
Query: 7 CQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIE 66
C+G +P KS L+C Y + FL++ P+K+E+L +P V HDAI +E ++
Sbjct: 255 CRGKNLLPS--KSYLRCRYLRDGSPFLRLAPVKLEQLNFEPFVGLFHDAISPAEQEDLLH 312
Query: 67 LSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREE 126
L+ ++E + + VDT S DH + ++ RI+D+T + E
Sbjct: 313 LTDSRLEHTRKESSSVEAKVDTNAS-----------DH--VRRMHQRIEDITGFEMEESE 359
Query: 127 RYKGPLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFP 186
PL + NYG+GG +H D + L+DV++GG FP L P
Sbjct: 360 ----PLTVFNYGIGGQELIHLDCEQPE------------LSDVQMGGYASFPDLGFGFKP 403
Query: 187 EKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWGKLL 224
+GSA+ W+N + D R + CPV LGN+WGK L
Sbjct: 404 RRGSALVWHNTDNSGNCDTRSLQATCPVLLGNQWGKSL 441
>gi|194906709|ref|XP_001981416.1| GG11627 [Drosophila erecta]
gi|190656054|gb|EDV53286.1| GG11627 [Drosophila erecta]
Length = 462
Score = 109 bits (272), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 71/219 (32%), Positives = 108/219 (49%), Gaps = 33/219 (15%)
Query: 7 CQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIE 66
C+G P KS+L+C Y + FL++ LK+E+L ++P V HDAI +E ++
Sbjct: 239 CRGKNLPPS--KSSLRCRYFREGSPFLRLAALKLEQLNIEPFVGLFHDAILQAEQEDLLR 296
Query: 67 LSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREE 126
L++ ++E K+ + VDT S DH + +I RI+D+T + E
Sbjct: 297 LTESRLEHKKIESSRVEAKVDTNAS-----------DH--VRRIHQRIEDITGFDLEGSE 343
Query: 127 RYKGPLQINNYGLGGHYDLHCD-ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVF 185
PL ++N+G+GG +H D P+ L DV++GG FP L
Sbjct: 344 ----PLTVSNHGIGGQEAIHLDCGQPK-------------LNDVQMGGYASFPDLGFGFK 386
Query: 186 PEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWGKLL 224
P +GSA+ W+N D R + CPV LGN+WGK L
Sbjct: 387 PVRGSALVWHNTDNCGNCDIRGLQATCPVLLGNQWGKWL 425
>gi|607947|gb|AAA62207.1| prolyl 4-hydroxylase alpha subunit [Caenorhabditis elegans]
Length = 558
Score = 109 bits (272), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 74/210 (35%), Positives = 104/210 (49%), Gaps = 18/210 (8%)
Query: 21 LKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN- 79
L C+Y + +FL P+KVE +P V D I D E+ I EL+K K+ R V +
Sbjct: 302 LYCYYLA-GPSFLVYAPIKVEIKRFNPLAVLFKDVISDDEVAAIQELAKPKLARATVHDS 360
Query: 80 -YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYG 138
G + R+SK +L E GD + + RI MTNL + E LQI NYG
Sbjct: 361 VTGKLVTATYRISKSAWL-KEWEGD--VVETVNKRIGYMTNLEMETAEE----LQIANYG 413
Query: 139 LGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVELGGATIFPSLNLTVFPEKGS 190
+GGHYD H D ++E R+A+ +FY++ GG T+F T+ P K
Sbjct: 414 IGGHYDPHFDHAKKEESKSFESLGTGNRIATVLFYMSQPSHGGGTVFTEAKSTILPTKND 473
Query: 191 AVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
A+FWYN + + H+ CPV +G KW
Sbjct: 474 ALFWYNLYKQGDGNPDTRHAACPVLVGIKW 503
>gi|328718391|ref|XP_003246474.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 2
[Acyrthosiphon pisum]
Length = 514
Score = 109 bits (272), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 77/208 (37%), Positives = 109/208 (52%), Gaps = 16/208 (7%)
Query: 22 KCFYESYNNTFLKI-GPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNY 80
KC Y++ NN F +I P K E++ +P + HD +YD EI +I L+ K++ KV +
Sbjct: 294 KCRYQT-NNLFYRILMPFKEEDINSEPLIKIYHDVLYDDEILKIKTLALEKMKDAKVKSV 352
Query: 81 GDTIYV---DTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNY 137
Y+ TR +VY+++ + + TRI+ T ERY QI NY
Sbjct: 353 DGKNYLLEEKTRSGQVYWIFE--VDAVEYFDALNTRIESFTGFSTKTAERY----QIVNY 406
Query: 138 GLGGHYDLHCDATPR-DEGLW---RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVF 193
GLGGHY H D+ + E + RL + +FYLTDV+ G T FP LN+ EKG+A+
Sbjct: 407 GLGGHYIPHHDSFAKGAENVKFGNRLVTVLFYLTDVQNDGYTSFPMLNIIAPAEKGAALV 466
Query: 194 WYNAH-ANTLLDYRMYHSGCPVALGNKW 220
W N H +N Y H CP+ GNKW
Sbjct: 467 WNNLHMSNGQKFYETLHGSCPLLKGNKW 494
>gi|21358309|ref|NP_651801.1| prolyl-4-hydroxylase-alpha SG2 [Drosophila melanogaster]
gi|20269808|gb|AAM18059.1|AF495537_1 prolyl 4-hydroxylase alpha-related protein PH4[alpha]SG2
[Drosophila melanogaster]
gi|10726875|gb|AAG22175.1| prolyl-4-hydroxylase-alpha SG2 [Drosophila melanogaster]
Length = 527
Score = 109 bits (272), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 71/223 (31%), Positives = 115/223 (51%), Gaps = 14/223 (6%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
Y CQG E L+C+ + + + + PL+VE ++LDP + H + +I
Sbjct: 281 YTRLCQGRRLPEERSGDPLRCYLDGKRHAYFTLAPLQVEPVHLDPDINVYHGMLSSKQIL 340
Query: 63 RIIELS-KGKVERGKVVNYGDTIYV-DTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
I E + K ++ R V G V D R+S+ +L + P + + IQ ++
Sbjct: 341 SIFEEADKEEMVRSAVAGSGGEGTVRDLRVSQQTWLDYK----SPVMNSVGRIIQFVSGF 396
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD----ATPRDEGLWRLASFMFYLTDVELGGATI 176
+ E +Q+ NYG+GG Y+ H D P++ R+++ MFYL+DVE GG T+
Sbjct: 397 DMAGAEH----MQVANYGVGGQYEPHPDYFEVNLPKNFEGDRISTSMFYLSDVEQGGYTV 452
Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 219
F LN+ + P KG+ V W+N H + +D R H+GCPV +G+K
Sbjct: 453 FTKLNVFLPPVKGALVMWHNLHRSLHVDARTLHAGCPVIVGSK 495
>gi|321458081|gb|EFX69155.1| hypothetical protein DAPPUDRAFT_228756 [Daphnia pulex]
Length = 570
Score = 108 bits (271), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 76/226 (33%), Positives = 106/226 (46%), Gaps = 22/226 (9%)
Query: 12 SVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGK 71
S P +K LKC S+ + + + PLK+EE L P + HD + D+E I S
Sbjct: 312 SRPTGLKGRLKCRQISHTHPYFILRPLKLEEHSLVPYIAVFHDFMSDAETE--IFKSLAM 369
Query: 72 VERGKVVNYGDT------IYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGRE 125
ER + +G + D R SK ++ G H + +I RI D L
Sbjct: 370 AERLERSAHGSKRPGQGGVTSDKRTSKQSWVED---GSHHVVDQISKRISDSVGLNSQPS 426
Query: 126 ERYKGPLQINNYGLGGHYDLHCD--------ATPRDEGLWR---LASFMFYLTDVELGGA 174
Q+ NYG+GG Y H D P + L+R + +FM YL DVE GGA
Sbjct: 427 NVGSEHYQVANYGIGGRYTPHTDHGVLSKSMGGPSEFDLFRGDRILTFMTYLDDVEAGGA 486
Query: 175 TIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
T+F + V P+KG AVFW+N +++ D H GCPV G+KW
Sbjct: 487 TVFTHAGVVVRPKKGMAVFWWNLKSDSNGDTLTRHGGCPVLHGSKW 532
>gi|410910256|ref|XP_003968606.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Takifugu
rubripes]
Length = 540
Score = 108 bits (271), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 76/229 (33%), Positives = 116/229 (50%), Gaps = 15/229 (6%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTF--LKIGPLKVEELYLDPRVVKIHDAIYD 58
+ Y CQ S P + N + F +++ N L + P++ E L L P VV HD I D
Sbjct: 295 DTYERLCQTRGSQPVHFE-NPQLFCDNFANGHPGLLLRPVRREVLSLRPYVVLYHDFISD 353
Query: 59 SEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
SE I + ++ + R V + R+SK +L H + ++ +I +T
Sbjct: 354 SESEEIKQHAQLGLRRSVVATGDKQATAEYRISKSAWLKGSA---HSTVSRLDQKISMLT 410
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLW------RLASFMFYLTDVEL 171
L + + + LQ+ NYG+GGHY+ H D AT ++ R+A+FM YL+ VE
Sbjct: 411 GLNV--QHPHGEYLQVVNYGIGGHYEPHFDHATSPSSPVFKLKTGNRVATFMIYLSSVEA 468
Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GG+T F N +V K +A+FW+N H N D H+GCPV +G+KW
Sbjct: 469 GGSTAFIYANFSVPVMKNAAIFWWNLHRNGEGDADTLHAGCPVLIGDKW 517
>gi|25012370|gb|AAN71294.1| RE09701p [Drosophila melanogaster]
Length = 301
Score = 108 bits (271), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 71/223 (31%), Positives = 115/223 (51%), Gaps = 14/223 (6%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
Y CQG E L+C+ + + + + PL+VE ++LDP + H + +I
Sbjct: 55 YTRLCQGRRLPEERSGDPLRCYLDGKRHAYFTLAPLQVELVHLDPDINVYHGMLSSKQIL 114
Query: 63 RIIELS-KGKVERGKVVNYGDTIYV-DTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
I E + K ++ R V G V D R+S+ +L + P + + IQ ++
Sbjct: 115 SIFEEADKEEMVRSAVAGSGGEGTVRDLRVSQQTWLDYK----SPVMNSVGRIIQFVSGF 170
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD----ATPRDEGLWRLASFMFYLTDVELGGATI 176
+ E +Q+ NYG+GG Y+ H D P++ R+++ MFYL+DVE GG T+
Sbjct: 171 DMAGAEH----MQVANYGVGGQYEPHPDYFEVNLPKNFEGDRISTSMFYLSDVEQGGYTV 226
Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 219
F LN+ + P KG+ V W+N H + +D R H+GCPV +G+K
Sbjct: 227 FTKLNVFLPPVKGALVMWHNLHRSLHVDARTLHAGCPVIVGSK 269
>gi|67084101|gb|AAY66985.1| truncated prolyl 4-hydroxylase alpha subunit [Ixodes scapularis]
Length = 452
Score = 108 bits (271), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 71/234 (30%), Positives = 115/234 (49%), Gaps = 21/234 (8%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
Y C+G + S L+C Y + F + P+K+E++ L P ++ + D + + +I
Sbjct: 195 YRRLCRGEALRTPQMDSKLRCRYYKGQDGFFTLHPIKLEKINLKPYIIVMRDVVQERDIE 254
Query: 63 RIIELSKGKVERGKVVNYGDTIYVDTR-LSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLV 121
++ ++ +++R GD TR S +L+ + + P ++ ++ + L
Sbjct: 255 NLMAFAEPRLQRSTTYT-GDGNAPSTRQTSSNAWLWDD---EAPIANRMNWYLRALVGLG 310
Query: 122 IGREERYKGPLQINNYGLGG----HYD-----LHCDATPRD------EGLWRLASFMFYL 166
E Q+ NYG GG HYD LH + D EG RLA+ M Y+
Sbjct: 311 TSGSEYEAEAYQLANYGSGGYFLPHYDYLQDTLHAHNSTADYYLQNNEGD-RLATLMIYM 369
Query: 167 TDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
TDV+ GGAT+FP L + + P+KG A FW+N A+ D H+GCPV G+KW
Sbjct: 370 TDVKEGGATVFPRLGVRLVPKKGDAAFWWNLKASGEGDTLTMHAGCPVLYGSKW 423
>gi|328718393|ref|XP_001945742.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 1
[Acyrthosiphon pisum]
Length = 511
Score = 108 bits (271), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 77/208 (37%), Positives = 109/208 (52%), Gaps = 16/208 (7%)
Query: 22 KCFYESYNNTFLKI-GPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNY 80
KC Y++ NN F +I P K E++ +P + HD +YD EI +I L+ K++ KV +
Sbjct: 291 KCRYQT-NNLFYRILMPFKEEDINSEPLIKIYHDVLYDDEILKIKTLALEKMKDAKVKSV 349
Query: 81 GDTIYV---DTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNY 137
Y+ TR +VY+++ + + TRI+ T ERY QI NY
Sbjct: 350 DGKNYLLEEKTRSGQVYWIFE--VDAVEYFDALNTRIESFTGFSTKTAERY----QIVNY 403
Query: 138 GLGGHYDLHCDATPR-DEGLW---RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVF 193
GLGGHY H D+ + E + RL + +FYLTDV+ G T FP LN+ EKG+A+
Sbjct: 404 GLGGHYIPHHDSFAKGAENVKFGNRLVTVLFYLTDVQNDGYTSFPMLNIIAPAEKGAALV 463
Query: 194 WYNAH-ANTLLDYRMYHSGCPVALGNKW 220
W N H +N Y H CP+ GNKW
Sbjct: 464 WNNLHMSNGQKFYETLHGSCPLLKGNKW 491
>gi|198452400|ref|XP_002137470.1| GA26529 [Drosophila pseudoobscura pseudoobscura]
gi|198131917|gb|EDY68028.1| GA26529 [Drosophila pseudoobscura pseudoobscura]
Length = 348
Score = 108 bits (271), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 75/222 (33%), Positives = 106/222 (47%), Gaps = 18/222 (8%)
Query: 7 CQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIE 66
C+G S L C Y + + F ++ PLK+E DP VV HD +YD+E+ +I+
Sbjct: 111 CRGAFPTKSHHHS-LHCRYHNKGSAFSRLAPLKLEIFSHDPYVVIYHDVLYDAEMQGLID 169
Query: 67 LSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREE 126
++ ++ R V I + + + + E D L +I R++DMT + R E
Sbjct: 170 STRRRMSRSMVQYEIRQIEISEQRTSKEAPFTEK-NDPQLLKRIYDRLKDMTGCDMLRSE 228
Query: 127 RYKGPLQINNYGLGGHYDLHCDATPRDEGLW---------RLASFMFYLTDVELGGATIF 177
L I Y GGH+D H D + W R AS +FYL DVE GG T+F
Sbjct: 229 H----LSILLYDQGGHHDPHVDY---HDLYWEYEYHPFGDRQASVVFYLNDVEDGGETVF 281
Query: 178 PSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 219
P L L + P KGSA+ W+N D R H+ CPV G K
Sbjct: 282 PKLQLVIPPTKGSALMWHNLRPWGEGDPRTQHASCPVLSGYK 323
>gi|348505573|ref|XP_003440335.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Oreochromis
niloticus]
Length = 517
Score = 108 bits (269), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 73/208 (35%), Positives = 105/208 (50%), Gaps = 12/208 (5%)
Query: 20 NLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN 79
L C Y + NN L + P + E + L P VV HD + D+E I L+ + R V
Sbjct: 292 QLFCDYFTNNNPALMLMPARRELVSLQPYVVLYHDFVTDTEAEDIKSLAHPGLRRSVVAA 351
Query: 80 YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGL 139
D R+SK +L + K+ RI +T L + + Y LQ+ NYG+
Sbjct: 352 GEKQATADYRISKSAWLKGSA---QSIVGKLDQRISLLTGLNV--KHPYGEYLQVVNYGI 406
Query: 140 GGHYDLHCD-ATPRDEGLW------RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAV 192
GGHY+ H D AT ++ R+A+FM YL+ VE GG+T F N +V + +A+
Sbjct: 407 GGHYEPHFDHATSPSSPVFKLKTGNRVATFMIYLSPVEAGGSTAFIYANFSVPVVEKAAI 466
Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
FW+N H N D H+GCPV +G+KW
Sbjct: 467 FWWNLHRNGEGDDDTLHAGCPVLIGDKW 494
>gi|449485593|ref|XP_004175686.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
alpha-3 [Taeniopygia guttata]
Length = 567
Score = 107 bits (268), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 72/208 (34%), Positives = 106/208 (50%), Gaps = 12/208 (5%)
Query: 20 NLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN 79
+L C YE+ N+ FL + P K E +++ P V HD I D+E I L+ ++R V +
Sbjct: 342 HLSCSYETNNSPFLLLQPAKKEMVWIQPHVALYHDFITDAEAETIKGLAGPWLQRSVVAS 401
Query: 80 YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGL 139
+ +SK +L + P ++ + RI +T L + Y LQ+ NYGL
Sbjct: 402 GEKQQKAEYWISKSTWLKDTV---DPVVHALDQRIIAVTGLDLWPP--YAEYLQVVNYGL 456
Query: 140 GGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELGGATIFPSLNLTVFPEKGSAV 192
GGHY+ H D AT L+R+ A+ M YL+ VE GG+T N +V K +A+
Sbjct: 457 GGHYEPHFDHATSTKSPLYRMKSGNRNATVMIYLSAVEAGGSTALIYTNFSVPVVKNAAL 516
Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
FW+N N D H+GCPV G+KW
Sbjct: 517 FWWNLRRNGNGDGDTLHAGCPVLAGDKW 544
>gi|195575095|ref|XP_002105515.1| GD21523 [Drosophila simulans]
gi|194201442|gb|EDX15018.1| GD21523 [Drosophila simulans]
Length = 527
Score = 107 bits (268), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 71/223 (31%), Positives = 114/223 (51%), Gaps = 14/223 (6%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
Y CQG E L C+ + + + + PL+VE ++LDP + H + +I
Sbjct: 281 YTRLCQGRRLPEERSGDPLSCYLDGKRHAYFTLAPLQVEPVHLDPDINVYHGMLSSKQIL 340
Query: 63 RIIELS-KGKVERGKVVNYGDTIYV-DTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
I E + K ++ R V G V D R+S+ +L + P + + IQ ++
Sbjct: 341 SIFEEADKEEMVRSAVAGDGGKRTVRDLRVSQQTWLDYK----SPVMNSVSRIIQFVSGF 396
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD----ATPRDEGLWRLASFMFYLTDVELGGATI 176
+ E +Q+ NYG+GG Y+ H D P++ R+++ MFYL+DVE GG T+
Sbjct: 397 DMAGAEY----MQVANYGVGGQYEPHPDYFEVNLPKNFEGDRISTSMFYLSDVEQGGYTV 452
Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 219
F LN+ + P KG+ V W+N H + +D R H+GCPV +G+K
Sbjct: 453 FTKLNVFLPPVKGALVMWHNLHRSLDVDARTLHAGCPVIVGSK 495
>gi|195441323|ref|XP_002068462.1| GK20483 [Drosophila willistoni]
gi|194164547|gb|EDW79448.1| GK20483 [Drosophila willistoni]
Length = 550
Score = 107 bits (267), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 73/218 (33%), Positives = 112/218 (51%), Gaps = 28/218 (12%)
Query: 19 SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV 78
+NL C Y + FL++ P+K+EE+ LDP +V+ HD + D+EI + K + +G ++
Sbjct: 320 TNLVCRYNFTTSPFLQLAPMKLEEISLDPYIVQYHDVLSDNEIEDL----KREGIKGTMI 375
Query: 79 NYGDTIYVD--------TRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKG 130
N ++ T +++V + P + + +I RI DMT I +
Sbjct: 376 NGWTSLKSSNATENESRTIVARVAIMSPSL----EIVQRINRRIIDMTGFNIEESK---- 427
Query: 131 PLQINNYGLGG----HYDLHCDATPRDEGLW----RLASFMFYLTDVELGGATIFPSLNL 182
+Q+ + +GG HYD D + L R+AS +FY DV GGAT FP L
Sbjct: 428 TIQLAAFSVGGFFMPHYDYLYDRLLDTDVLKKLGDRVASVIFYAGDVTEGGATNFPRNQL 487
Query: 183 TVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
V P+KGSA+FWYN + D R HS CPV +G++W
Sbjct: 488 VVQPKKGSALFWYNKFDDGSPDPRSLHSICPVVVGSRW 525
>gi|26352077|dbj|BAC39675.1| unnamed protein product [Mus musculus]
Length = 383
Score = 107 bits (267), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 72/221 (32%), Positives = 111/221 (50%), Gaps = 20/221 (9%)
Query: 1 EIYPLACQGNLSVPEDIK-SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDS 59
+ Y CQ S P + +L C YE+ ++ +L + P + E ++L P + HD + D
Sbjct: 159 DTYEGLCQTLGSQPTHYQIPSLYCSYETNSSPYLLLQPARKEVVHLRPLIALYHDFVSDE 218
Query: 60 EINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
E +I EL++ ++R V + + V+ R+SK +L + P L + RI +T
Sbjct: 219 EAQKIRELAEPWLQRSVVASGEKQLQVEYRISKSAWLKDTV---DPMLVTLDHRIAALTG 275
Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPS 179
L I + Y LQ+ NYG+GGHY+ H D L+ VE GGAT F
Sbjct: 276 LDI--QPPYAEYLQVVNYGIGGHYEPHFDHAT--------------LSSVEAGGATAFIY 319
Query: 180 LNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
N +V K +A+FW+N H + D H+GCPV +G+KW
Sbjct: 320 GNFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLVGDKW 360
>gi|195505197|ref|XP_002099400.1| GE10884 [Drosophila yakuba]
gi|194185501|gb|EDW99112.1| GE10884 [Drosophila yakuba]
Length = 527
Score = 107 bits (267), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 70/223 (31%), Positives = 110/223 (49%), Gaps = 14/223 (6%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
Y CQG E LKC+ + + + + PL+VE ++LDP + H + I
Sbjct: 281 YTRLCQGRRLPEERSGDPLKCYLDGKRHAYFILAPLQVEPVHLDPDINVYHGMLSSKHIQ 340
Query: 63 RIIELS-KGKVERGKVVNYGDTIYV-DTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
I E + K ++ R V G V D R+S+ +L + + + +
Sbjct: 341 SIFEEADKKEMVRSAVAGDGGARTVKDLRVSQQTWL--------DYKSPVMKSVGRIIEF 392
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD----ATPRDEGLWRLASFMFYLTDVELGGATI 176
V G + +Q+ NYG+GG Y+ H D P + R+++ MFYL+DVE GG T+
Sbjct: 393 VSGFDMAGAEFMQVANYGVGGQYEPHPDYFEVNLPEEFIGDRISTSMFYLSDVEQGGYTV 452
Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 219
F LN+ + P KG+ V W+N H + +D R H+GCPV +G+K
Sbjct: 453 FTKLNVFLPPVKGALVMWHNLHRSLDVDARTLHAGCPVIVGSK 495
>gi|24666354|ref|NP_730347.1| CG32199 [Drosophila melanogaster]
gi|23093193|gb|AAF49251.3| CG32199 [Drosophila melanogaster]
Length = 509
Score = 107 bits (266), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 73/225 (32%), Positives = 106/225 (47%), Gaps = 17/225 (7%)
Query: 7 CQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN--RI 64
C+G P+ L C Y + FLK+ PLK+E L + P ++ HD +Y++E R
Sbjct: 272 CRGEW--PKKSSPELICRYNRDTSAFLKLAPLKLEFLSVQPMILLYHDVLYENEFKSMRD 329
Query: 65 IELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGR 124
I + G + G D + +V + PF I R+ DM+ G
Sbjct: 330 IAMYNGSMIDGWTYVDFDKKGNPKQQDRVVKMIAFQGTTAPFTLSINRRMADMS----GL 385
Query: 125 EERYKGPLQINNYGLGGHYDLHCDATP---------RDEGLWRLASFMFYLTDVELGGAT 175
E R L + NYGLGGH+ H D D G R+A+ + Y +D+ LGG T
Sbjct: 386 EMRDNMVLYLTNYGLGGHFGKHVDYVELAKRPPDFFADFGGDRIATALIYASDIPLGGTT 445
Query: 176 IFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+F L + V P+KGSA+ W+N + D HS CPV LG++W
Sbjct: 446 VFTKLKIAVQPKKGSALIWFNLNHAGEPDPLTEHSVCPVVLGSRW 490
>gi|195352184|ref|XP_002042594.1| GM14981 [Drosophila sechellia]
gi|194124478|gb|EDW46521.1| GM14981 [Drosophila sechellia]
Length = 539
Score = 107 bits (266), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 75/235 (31%), Positives = 110/235 (46%), Gaps = 37/235 (15%)
Query: 7 CQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIE 66
C+G S L C Y + FLK+ PLK+E L + P ++ HD +Y++E + +
Sbjct: 302 CRGEWSRKSS--PELICRYNRDTSAFLKLAPLKLEFLSVQPMILLYHDVLYENEFKSMRD 359
Query: 67 LSKGKVERGKVVNYGDTI-----YVD-------TRLSKVYFLYPEIFGDHPFLYKIQTRI 114
L+ Y D++ YVD + +V + PF I R+
Sbjct: 360 LAM----------YNDSMIDGWTYVDFDKKGNPKQQDRVVKIISFQGTTAPFTLSINRRL 409
Query: 115 QDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATP---------RDEGLWRLASFMFY 165
DM+ G E R L + NYGLGGH+ H D D G R+A+ +FY
Sbjct: 410 ADMS----GLEMRENMVLYLTNYGLGGHFGKHVDYVELAKRPPDFFADFGGDRIATALFY 465
Query: 166 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+DV LGG T+F L + V P+KG+A+ W+N + D HS CPV LG++W
Sbjct: 466 ASDVPLGGTTVFTKLKIAVKPKKGNALIWFNLNHAGEPDPLTEHSVCPVVLGSRW 520
>gi|195575139|ref|XP_002105537.1| GD21537 [Drosophila simulans]
gi|194201464|gb|EDX15040.1| GD21537 [Drosophila simulans]
Length = 536
Score = 107 bits (266), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 77/213 (36%), Positives = 107/213 (50%), Gaps = 10/213 (4%)
Query: 15 EDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVER 74
E S L C Y + FL++ P ++EEL LDP VV H+ + D EI ++ +S+ +ER
Sbjct: 291 ESKPSRLHCRYNTTTTPFLRLAPFRMEELSLDPYVVFYHNVLSDPEIEKLKPMSEPFLER 350
Query: 75 GKV--VNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPL 132
KV V G TR + +L P D P ++ RI + G R +
Sbjct: 351 AKVFRVEKGSDEIAPTRSADGAWL-PHQDTD-PDDLEVLRRIGRRIRDITGLNTRSGSQM 408
Query: 133 QINNYGLGGHYDLHCD----ATPRDEGLW-RLASFMFYLTDVELGGATIFPSLNLTVFPE 187
Q YG GGH+ H D T E + R+A+ +FYL +V+ GGAT FP LNL V +
Sbjct: 409 QFLKYGFGGHFVPHYDYFNSKTSYLERVGDRMATVLFYLNNVDHGGATAFPKLNLVVPTQ 468
Query: 188 KGSAVFWYNAHANTL-LDYRMYHSGCPVALGNK 219
KGSA+FW+N + D R H CP+ G K
Sbjct: 469 KGSALFWHNLDRKSYDYDTRTSHGACPLISGTK 501
>gi|328718395|ref|XP_003246475.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Acyrthosiphon
pisum]
Length = 518
Score = 106 bits (265), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 75/208 (36%), Positives = 107/208 (51%), Gaps = 16/208 (7%)
Query: 22 KCFYESYNNTFLKI-GPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNY 80
KC Y++ NN F +I P K E++ +P + HD +YD EI +I L+ ++ V +
Sbjct: 294 KCRYQT-NNLFYRILMPFKEEDINSEPLIKIYHDVLYDDEILKIKTLALENMKDATVKSV 352
Query: 81 ---GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNY 137
GD++ TR +VY++ +L + TRI+ T E+Y QI NY
Sbjct: 353 DGKGDSLIEKTRSGQVYWISK--VDAVEYLDALDTRIESFTGFSTKTAEQY----QIVNY 406
Query: 138 GLGGHYDLHCDATPRDEGLW----RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVF 193
GLGGHY H D+ + RL + +FYLTDV+ G T FP LN+ EKG+A+
Sbjct: 407 GLGGHYLPHHDSFAKAINCLQFGNRLVTVLFYLTDVQNDGYTSFPLLNIIAPAEKGAALV 466
Query: 194 WYNAH-ANTLLDYRMYHSGCPVALGNKW 220
W N H +N Y H CP+ GNKW
Sbjct: 467 WNNLHMSNGQKFYESLHGSCPLLKGNKW 494
>gi|195591304|ref|XP_002085382.1| GD14758 [Drosophila simulans]
gi|194197391|gb|EDX10967.1| GD14758 [Drosophila simulans]
Length = 509
Score = 106 bits (265), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 74/235 (31%), Positives = 109/235 (46%), Gaps = 37/235 (15%)
Query: 7 CQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIE 66
C+G P L C Y + FLK+ PLK+E L + P ++ HD +Y++E + +
Sbjct: 272 CRGEW--PRKSSPELICRYNRDTSAFLKLAPLKLEFLSVQPMILLYHDVLYENEFKSMRD 329
Query: 67 LSKGKVERGKVVNYGDTI-----YVD-------TRLSKVYFLYPEIFGDHPFLYKIQTRI 114
+ Y D++ YVD + +V + PF I R+
Sbjct: 330 ----------IAMYNDSMIDGWTYVDFDKKGNPKQQDRVVKIISFQGTTAPFTLSINRRL 379
Query: 115 QDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATP---------RDEGLWRLASFMFY 165
DM+ G E R L + NYGLGGH+ H D D G R+A+ +FY
Sbjct: 380 ADMS----GLEMRENMVLYLTNYGLGGHFGKHVDYVELAKRPPDFFADFGGDRIATAVFY 435
Query: 166 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+DV LGG T+F L + V P+KG+A+ W+N + D HS CPV LG++W
Sbjct: 436 ASDVPLGGTTVFTKLKIAVQPKKGNALIWFNLNHAGEPDPLTEHSVCPVVLGSRW 490
>gi|198429625|ref|XP_002128613.1| PREDICTED: similar to procollagen-proline, 2-oxoglutarate
4-dioxygenase (proline 4-hydroxylase), alpha 1
polypeptide [Ciona intestinalis]
Length = 195
Score = 106 bits (264), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 66/174 (37%), Positives = 94/174 (54%), Gaps = 16/174 (9%)
Query: 56 IYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTR 113
+ D E+ I L+K ++ R V N G + R+SK +L E DHP + ++ R
Sbjct: 1 MSDKEMAMIKSLAKPRLRRATVQNPVTGVLEFAHYRVSKSAWLKDE---DHPVIKRVCQR 57
Query: 114 IQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPR-------DEGLWRLASFMFYL 166
I D+T L + E LQI NYG+GG Y+ H D + + DE R+A+F+ Y+
Sbjct: 58 ISDVTGLSMETAEE----LQIANYGVGGQYEPHFDYSRKSDFGKFDDEVGNRIATFLTYM 113
Query: 167 TDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
++VE GG+T+F + V P KGSAVFWYN + D R H+ CPV G KW
Sbjct: 114 SNVEQGGSTVFLHPGIAVRPIKGSAVFWYNLLPSGAGDERTRHAACPVLTGVKW 167
>gi|195159299|ref|XP_002020519.1| GL13471 [Drosophila persimilis]
gi|194117288|gb|EDW39331.1| GL13471 [Drosophila persimilis]
Length = 238
Score = 106 bits (264), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 68/228 (29%), Positives = 113/228 (49%), Gaps = 19/228 (8%)
Query: 6 ACQGNLSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYDSEINRI 64
C G P + +L CFY + + FL + ++ E L DP + +D + S++ +
Sbjct: 19 CCNGLCKGPRN--RHLHCFYLTKRGSPFLLLARVRTEILSDDPFIALYYDVLTHSDMVSL 76
Query: 65 IELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTR----IQDMTNL 120
S+ + + Y + + +F++ E P + + R + D+T L
Sbjct: 77 RNTSEPLLHPATTIQYFNAPQELSNSRTAHFVWLE-----PTITEATRRADRVLWDVTGL 131
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW---RLASFMFYLTDVELGGATIF 177
+ E + Q+NNYG+GG + H D + R+A+ +FYL+DV GGAT+F
Sbjct: 132 NLSNSEMF----QVNNYGIGGSFMRHSDLLHSERNYLVRERIATAIFYLSDVPQGGATLF 187
Query: 178 PSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWGKLLL 225
LN+TVFP+ G+ +FWYN + D R H+GCPV +G+KW + L
Sbjct: 188 TELNVTVFPQAGTVLFWYNLAHSGDHDMRTRHTGCPVIVGSKWSRFSL 235
>gi|241999340|ref|XP_002434313.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
gi|215496072|gb|EEC05713.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
Length = 267
Score = 105 bits (263), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 77/227 (33%), Positives = 106/227 (46%), Gaps = 31/227 (13%)
Query: 18 KSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGK 76
+S L C + + FL + P K+E L DPR+V D + E +S+ K+ R K
Sbjct: 28 QSKLLCKISTIGGHPFLVLQPFKIEVLSEDPRIVVFPDFLNPRECEIFRSISQEKLSRAK 87
Query: 77 VVNYG--DTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQI 134
V G + + R +KV ++ ++ HP L K+ RI T L + E Y Q+
Sbjct: 88 VYLGGPPEGGFSLRRTNKVAWMSDDL---HPLLGKVSRRIALATGLTLTSAEMY----QV 140
Query: 135 NNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVELGGATIFPSLNLTVFP 186
NYGLGGHY H D E RLA+ + YL DV GGAT F ++ L V P
Sbjct: 141 ANYGLGGHYIPHPDYAGFGEAQGDIYKSSGNRLATMLIYLADVAGGGATAFINMRLAVKP 200
Query: 187 EKGSAVFWYNA-------------HANTLLDYRMYHSGCPVALGNKW 220
G+A+FWYN + D R +H GCPV G+KW
Sbjct: 201 TLGTALFWYNLKPYDGPIVNESFWNQRRFGDPRTFHMGCPVLTGSKW 247
>gi|20177086|gb|AAM12247.1| AT28279p [Drosophila melanogaster]
Length = 509
Score = 105 bits (263), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 72/225 (32%), Positives = 106/225 (47%), Gaps = 17/225 (7%)
Query: 7 CQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN--RI 64
C+G P+ L C Y + FLK+ PLK+E L + P ++ HD +Y++E R
Sbjct: 272 CRGEW--PKKSSPELICRYNRDTSAFLKLAPLKLEFLSVQPMILLYHDVLYENEFKSMRD 329
Query: 65 IELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGR 124
I + G + G D + +V + PF I R+ DM+ G
Sbjct: 330 IAMYNGSMIDGWTYVDFDKKGNPKQQDRVVKMIAFQGTTAPFTLSINRRMADMS----GL 385
Query: 125 EERYKGPLQINNYGLGGHYDLHCDATP---------RDEGLWRLASFMFYLTDVELGGAT 175
E R L + NYGLGGH+ H D D G R+A+ + Y +D+ LGG T
Sbjct: 386 EMRDNMVLYLTNYGLGGHFGKHVDYVELAKRPPDFFADFGGDRIATALIYASDIPLGGTT 445
Query: 176 IFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+F L + V P+KG+A+ W+N + D HS CPV LG++W
Sbjct: 446 VFTKLKIAVQPKKGNALIWFNLNHAGEPDPLTEHSVCPVVLGSRW 490
>gi|390176894|ref|XP_002136933.2| GA26862 [Drosophila pseudoobscura pseudoobscura]
gi|388858830|gb|EDY67491.2| GA26862 [Drosophila pseudoobscura pseudoobscura]
Length = 520
Score = 105 bits (262), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 67/222 (30%), Positives = 112/222 (50%), Gaps = 19/222 (8%)
Query: 7 CQGNLSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYDSEINRII 65
C G P + +L CFY + + FL + ++ E L DP + +D + S++ +
Sbjct: 282 CNGLCKGPRN--RHLHCFYLTKRGSPFLLLARVRTEILSDDPFIALYYDVLTHSDMVSLR 339
Query: 66 ELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTR----IQDMTNLV 121
S+ + + Y + + +F++ E P + + R + D+T L
Sbjct: 340 NTSEPLLHPATTIQYLNAPQELSNSRTAHFVWLE-----PTITEATRRADRVLWDVTGLN 394
Query: 122 IGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW---RLASFMFYLTDVELGGATIFP 178
+ E++ Q+NNYG+GG + H D + R+A+ +FYL+DV GGAT+F
Sbjct: 395 LSNSEKF----QVNNYGIGGSFMRHSDPLHSERNYLVRERIATAIFYLSDVPQGGATLFT 450
Query: 179 SLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
LN+TVFP+ G+ +FWYN + D R H+GCPV +G+KW
Sbjct: 451 ELNVTVFPQAGTVLFWYNLAHSGDHDMRTRHTGCPVIVGSKW 492
>gi|405967005|gb|EKC32220.1| Prolyl 4-hydroxylase subunit alpha-1 [Crassostrea gigas]
Length = 303
Score = 105 bits (262), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 69/225 (30%), Positives = 106/225 (47%), Gaps = 29/225 (12%)
Query: 17 IKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGK 76
++S L+C+ T + I K E + PR+ HD I + +I ++ + K+ +
Sbjct: 52 VESKLRCYLR---KTAIPIYMAKEEVVNYTPRISLFHDVISNDDIRQLKKAGTKKLTHSR 108
Query: 77 VVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKG--PLQI 134
G R+S+ ++Y + ++ RI ++ NL + P Q+
Sbjct: 109 T---GGGYVTRLRVSQTGWVYDQAIPQ--VSRRLARRIANIVNLDTTFRSKASPVEPWQV 163
Query: 135 NNYGLGGHYDLHCDATPRDEGLW-------------------RLASFMFYLTDVELGGAT 175
+Y GG+Y H D DE LW R+A++MFYL+DVE GGAT
Sbjct: 164 LSYTTGGYYGEHIDPDIGDEFLWNMTEAVQGPRALWRKHTGQRIATWMFYLSDVEAGGAT 223
Query: 176 IFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+FP L V KG+A FWYN + +D R H+GCPV LG+KW
Sbjct: 224 VFPKLEARVPVVKGAAAFWYNLTPSGKIDRRTQHAGCPVILGSKW 268
>gi|442762205|gb|JAA73261.1| Putative prolyl 4-hydroxylase alpha subunit, partial [Ixodes
ricinus]
Length = 482
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 68/236 (28%), Positives = 112/236 (47%), Gaps = 25/236 (10%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
Y C+G L + S L+C Y + + P+K+EE+ L P +V +HD + D +I
Sbjct: 225 YKRLCRGELLRTPKMDSKLRCRYYKGHGGSFTLHPIKLEEVNLKPYIVVMHDVVQDRDIE 284
Query: 63 RIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDM----T 118
+ ++ +++ + R S ++ + + P K+ ++ + T
Sbjct: 285 DLRAFAEPRLQTSLTYDVPGVESPAVRTSSNAWMDEK---NAPVATKLNKFLRSLLGMGT 341
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLH---------CDATPRDEGLW-----RLASFMF 164
+ G E+Y Q+ NYG GGH+ H D P + R+A+ M
Sbjct: 342 SYSDGEAEKY----QLANYGTGGHFLTHPDYLGDLFENDTDPSEFEFHKKVGDRVATLMI 397
Query: 165 YLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
Y++DVE GGAT+FP L + + P+KG A FW+N AN + H+GCPV G+KW
Sbjct: 398 YMSDVEEGGATVFPYLGVRLTPQKGDAAFWWNLKANGEGEVLTTHAGCPVLYGSKW 453
>gi|195452772|ref|XP_002073493.1| GK14149 [Drosophila willistoni]
gi|194169578|gb|EDW84479.1| GK14149 [Drosophila willistoni]
Length = 496
Score = 105 bits (261), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 68/213 (31%), Positives = 103/213 (48%), Gaps = 14/213 (6%)
Query: 19 SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV 78
+ L C Y + FL++ P ++EEL LDP +V ++ + D EI ++ L+ +++ +
Sbjct: 264 TKLHCRYNTTTTPFLRLAPFRMEELSLDPYIVAYYNVLSDQEITQLDRLTATLLKKTFAI 323
Query: 79 NYGDTIYVDTRLSKVYFL----YPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQI 134
D + R + + P + + +I + D+T L + + + Q
Sbjct: 324 GPDDDYDDNARTADGAWFPNNETPRTEENIQLIERIINLVSDLTGLQGDKADSF----QA 379
Query: 135 NNYGLGGHYDLHCD--ATPRDEGLW---RLASFMFYLTDVELGGATIFPSLNLTVFPEKG 189
YG GGHY H D D+ + RLA+ FYL V+ GGAT+FPSLNL V EKG
Sbjct: 380 VRYGFGGHYTPHFDYLNMSIDQTAFYGDRLATVFFYLNTVKHGGATVFPSLNLKVPAEKG 439
Query: 190 SAVFWYNAHANTL-LDYRMYHSGCPVALGNKWG 221
+FWYN + D H GCPV G K G
Sbjct: 440 KVLFWYNLDGESFDFDENTEHGGCPVVDGIKLG 472
>gi|292621357|ref|XP_691737.4| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Danio rerio]
Length = 538
Score = 104 bits (260), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 73/226 (32%), Positives = 114/226 (50%), Gaps = 13/226 (5%)
Query: 3 YPLACQGNLSVPEDIKS-NLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
Y CQ S P+ ++ +L C Y + + L + P++ E + L P VV H + +E
Sbjct: 295 YEQLCQTKGSQPKHFENPSLFCDYFTNGSPALFLQPIRREIISLQPYVVLFHGFVTQAEA 354
Query: 62 NRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLV 121
I + + + R V + + + R+SK +L H + K+ RI +T L
Sbjct: 355 KNIRKYAMPGLRRSVVASGMNQATAEYRISKSAWLKE---SAHEVVGKLDQRITLVTGLN 411
Query: 122 IGREERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELGGA 174
+ + Y LQ+ NYG+GGHY+ H D AT L+RL A+ M YL+ V+ GG+
Sbjct: 412 V--QPPYAEYLQVVNYGIGGHYEPHFDHATSDSSPLYRLKTGNRVATIMIYLSPVQAGGS 469
Query: 175 TIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
T F N +V + +A+FW+N H N + H+GCPV +GNKW
Sbjct: 470 TAFIYANFSVPVVQNAALFWWNLHKNGQGNVDTLHAGCPVIVGNKW 515
>gi|47227817|emb|CAG08980.1| unnamed protein product [Tetraodon nigroviridis]
Length = 285
Score = 104 bits (260), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 73/229 (31%), Positives = 113/229 (49%), Gaps = 15/229 (6%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTF--LKIGPLKVEELYLDPRVVKIHDAIYD 58
+ Y C+ S P + N + F +++ N L + P + E L L P VV HD I D
Sbjct: 40 DTYERLCRTRGSQPTHFE-NPQLFCDNFANGHPGLLLRPARRETLSLQPYVVLYHDFISD 98
Query: 59 SEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
+E I ++ + R V + + R+SK +L + ++ RI +T
Sbjct: 99 TEAEEIKHHAQLGLRRSVVATRDKQVTAEYRISKSAWLKGSA---QSAVSRLDQRISMLT 155
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLW------RLASFMFYLTDVEL 171
L + + + LQ+ NYG+GGHY+ H D AT ++ R+A+ M YL+ VE
Sbjct: 156 GLNV--QHPHGEYLQVVNYGIGGHYEPHFDHATSPSSPVFKLKTGNRVATVMIYLSSVEA 213
Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GG+T F N +V K +A+FW+N H N D H+GCPV +G+KW
Sbjct: 214 GGSTAFIYANFSVPVMKNAAIFWWNLHRNGRGDPDTLHAGCPVLIGDKW 262
>gi|195391756|ref|XP_002054526.1| GJ24503 [Drosophila virilis]
gi|194152612|gb|EDW68046.1| GJ24503 [Drosophila virilis]
Length = 519
Score = 104 bits (259), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 72/229 (31%), Positives = 112/229 (48%), Gaps = 21/229 (9%)
Query: 3 YPLACQGN-LSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
Y CQG LS P+ S L C+ + + ++ PLKVE++ L+P + +D I D +I
Sbjct: 283 YTQLCQGKRLSEPKPNGSALNCYLDFTRHARFRLAPLKVEQVRLNPDIHIYYDLINDDQI 342
Query: 62 NRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLV 121
+ I E+ + +I D R+S+ +L + I + ++ +
Sbjct: 343 DDIYEVVDQF--DSFRSSVSSSIVTDWRVSQQVWL--------NYSSPILRSVSNLVGAI 392
Query: 122 IGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-----RLASFMFYLTDVELGGATI 176
G + +Q+ NYG+GG Y H D + + R+A+ MFYL+DV GG T+
Sbjct: 393 SGFDMENAEQMQVANYGIGGQYAPHTDYLSKIPDSYIPRGNRIATNMFYLSDVLNGGYTV 452
Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV-----ALGNKW 220
FP LN+ + P KG+ V WYN H + D R H+GCPV +GN W
Sbjct: 453 FPKLNVFLKPVKGAMVSWYNLHRSLNKDSRTLHAGCPVIEGVKRIGNIW 501
>gi|195110921|ref|XP_002000028.1| GI24861 [Drosophila mojavensis]
gi|193916622|gb|EDW15489.1| GI24861 [Drosophila mojavensis]
Length = 508
Score = 104 bits (259), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 71/219 (32%), Positives = 119/219 (54%), Gaps = 20/219 (9%)
Query: 7 CQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIE 66
CQG +PE +L C+ + + ++ PLKVE+ +L+P + +D + D +I +++
Sbjct: 280 CQGK-RLPE--PGSLSCYLDFERHPRFRLSPLKVEQAHLNPDIHIYYDVLTDPQIESVLD 336
Query: 67 L-SKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGRE 125
L S+ + R KV+ GD + +TR+S+ +L + I + ++ + G +
Sbjct: 337 LASQLESFRSKVL--GDVV-TETRVSQQVWLN--------YTSPIMRTVGNLLGAISGLD 385
Query: 126 ERYKGPLQINNYGLGGHYDLHCD--ATPRDEGLWR---LASFMFYLTDVELGGATIFPSL 180
+Q+ NYG+GG Y H D + R++ + R + + MFYL+DV GG T+FP L
Sbjct: 386 MTNVEEMQVANYGIGGQYFPHFDYISELREDYIERGNRITTNMFYLSDVLQGGYTVFPFL 445
Query: 181 NLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 219
N+ + P KGS V W N H + D R+ H+GCPV G+K
Sbjct: 446 NVFLRPVKGSLVIWPNVHRSLAPDSRVLHAGCPVLEGSK 484
>gi|443705944|gb|ELU02240.1| hypothetical protein CAPTEDRAFT_227850 [Capitella teleta]
Length = 475
Score = 103 bits (258), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 65/219 (29%), Positives = 110/219 (50%), Gaps = 19/219 (8%)
Query: 20 NLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN 79
+L C + N++ G K E L+ +P + HD I DSEI R+ ++++ + + V++
Sbjct: 160 DLFCLNKQMRNSY---GLWKTELLHANPEIYLFHDFISDSEIQRLKDMAEPQFQSSAVLD 216
Query: 80 Y--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGP--LQIN 135
G++ + +RLS F + + + + R+ +T L + + LQ+
Sbjct: 217 DTGGESFFDVSRLSSTAF----VNDSNDLVASLNRRVSKLTGLQTEVLDSFSESESLQVL 272
Query: 136 NYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVELGGATIFPSLNLTVFPE 187
YG GG Y H D + L R+A+F+ YL GGAT+FP L +++ +
Sbjct: 273 RYGPGGLYTPHYDTLGSEADLPPYIQHTGDRIATFILYLDIATAGGATVFPLLPMSIPIQ 332
Query: 188 KGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWGKLLLS 226
KG+A FW+N H + LD R H+ CPV G KW +++S
Sbjct: 333 KGAAAFWFNLHPDGSLDRRTLHAACPVIRGTKWECVIVS 371
>gi|195159309|ref|XP_002020524.1| GL13466 [Drosophila persimilis]
gi|194117293|gb|EDW39336.1| GL13466 [Drosophila persimilis]
Length = 643
Score = 103 bits (258), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 67/222 (30%), Positives = 111/222 (50%), Gaps = 19/222 (8%)
Query: 7 CQGNLSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYDSEINRII 65
C G P + +L CFY + + FL + ++ E L DP + +D + S++ +
Sbjct: 405 CNGLCKGPRN--RHLHCFYLTKRGSPFLLLARVRTEILSDDPFIALYYDVLTHSDMVSLR 462
Query: 66 ELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTR----IQDMTNLV 121
S+ + + Y + + +F++ E P + + R + D+T L
Sbjct: 463 NTSEPLLHPATTIQYFNAPQELSNSRTAHFVWLE-----PTITEATRRADRVLWDVTGLN 517
Query: 122 IGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW---RLASFMFYLTDVELGGATIFP 178
+ E + Q+NNYG+GG + H D + R+A+ +FYL+DV GGAT+F
Sbjct: 518 LSNSEMF----QVNNYGIGGSFMRHSDLLHSERNYLVRERIATAIFYLSDVPHGGATLFT 573
Query: 179 SLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
LN+TVFP+ G+ +FWYN + D R H+GCPV +G+KW
Sbjct: 574 ELNVTVFPQAGTVLFWYNLAHSGDHDMRTRHTGCPVIVGSKW 615
>gi|312385117|gb|EFR29691.1| hypothetical protein AND_01144 [Anopheles darlingi]
Length = 295
Score = 103 bits (257), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 71/225 (31%), Positives = 112/225 (49%), Gaps = 21/225 (9%)
Query: 7 CQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIE 66
C+G P + S L+C+Y++ N+ + IGP KVE L +P V +D I+DSEI R+ E
Sbjct: 45 CKGTYQRPVGLTSWLRCWYDARNDHSV-IGPRKVEMLNYEPFVALFYDVIHDSEITRLQE 103
Query: 67 LSKGKVE-RGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGRE 125
L G ++ G + +Y + + Y L D P + ++ R + M+ L
Sbjct: 104 LGDGVIKVSGATTDGWLPVYYENH--QTYTLQNR---DDPVVKRLSQRTERMSGLSCDTA 158
Query: 126 ERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-------RLASFMFYLTDV---ELGGAT 175
E L++ +G + D + RLA+ +F+++DV E GG
Sbjct: 159 ED----LKVIYNEVGAYKSFIVDGKKKSSVAQQFAFAGKRLATVLFFMSDVDGAEGGGRI 214
Query: 176 IFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
FP L L+V P+KG+A+FWYN H + D RM +S CP+ N+W
Sbjct: 215 AFPYLGLSVLPQKGAALFWYNLHDSGRPDERMTYSICPLLADNRW 259
>gi|313243209|emb|CBY39868.1| unnamed protein product [Oikopleura dioica]
Length = 430
Score = 103 bits (257), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 68/216 (31%), Positives = 108/216 (50%), Gaps = 17/216 (7%)
Query: 18 KSNLKCFYESYNN--TFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERG 75
KSNLKCFY + + + L+ P+K EEL+ DP VV+ ++ I D E I L+ + R
Sbjct: 191 KSNLKCFYWTGPSPVSPLQWAPVKTEELHDDPLVVQFYEVISDEEERAIQFLAGEHLNRA 250
Query: 76 KVVN--YGDTIYVDTRLSKVYFLYP-EIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPL 132
+ + G + D R+ K +L + F + + K ++ +T L E +
Sbjct: 251 TIQDPATGKLVNADYRIQKTAWLTEFDKFDVNGTIAKYNAKLTKITGLDADHAEL----V 306
Query: 133 QINNYGLGGHYDLHCD--ATPRDEGLW------RLASFMFYLTDVELGGATIFPSLNLTV 184
Q+ NYG+ G Y+ H D + P E W R+A+++ Y+++ +GG T+F +
Sbjct: 307 QVGNYGVAGQYEPHWDHQSYPGAENRWDPIEGSRIATWLAYMSEPNMGGGTVFIQAGIQA 366
Query: 185 FPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
P + SAVFWYN + D H+ CPV G KW
Sbjct: 367 RPIRNSAVFWYNLLPSGESDDNTQHAACPVLSGTKW 402
>gi|195505241|ref|XP_002099419.1| GE10893 [Drosophila yakuba]
gi|194185520|gb|EDW99131.1| GE10893 [Drosophila yakuba]
Length = 508
Score = 103 bits (257), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 70/231 (30%), Positives = 115/231 (49%), Gaps = 20/231 (8%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
E Y C+ + S P S L C Y S + FL + P K+EE+ L+P +V HD + D +
Sbjct: 262 EDYKRLCRSSFS-PR--PSKLLCRYNSDTSPFLILAPFKMEEISLEPYIVVYHDILPDKD 318
Query: 61 INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDH-----PFLYKIQTRIQ 115
+ ++I L++ ++ +V + + S + P F D P L ++ R++
Sbjct: 319 MQQLIALAEPRLRPTEVFEEDKSEARTSDRSALGTFLP--FKDMNPSGGPLLDRLTQRMR 376
Query: 116 DMTNLVIGREERYKGPLQINNYGLGGHYDLHCD----ATPRDEGLW-RLASFMFYLTDVE 170
D+T + I R++ I YG G Y + D EG R+A+ +FYL D
Sbjct: 377 DITGIQI----RHENTFNIIKYGFGSQYATNFDFFNGTNSEMEGYGDRMATVLFYLNDAP 432
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTL-LDYRMYHSGCPVALGNKW 220
GGAT+FP +++ V E+G +FW+N + T ++ H+ CPV G+KW
Sbjct: 433 NGGATVFPRIDVKVTAERGKVLFWHNLNGETHDVEPNTLHAACPVFQGSKW 483
>gi|198477148|ref|XP_002136736.1| GA29214 [Drosophila pseudoobscura pseudoobscura]
gi|198145041|gb|EDY71753.1| GA29214 [Drosophila pseudoobscura pseudoobscura]
Length = 520
Score = 103 bits (257), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 68/222 (30%), Positives = 111/222 (50%), Gaps = 19/222 (8%)
Query: 7 CQGNLSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYDSEINRII 65
C G P + +L CFY + + FL + ++ E L DP +V +D + S++ +
Sbjct: 282 CNGLCKGPRN--RHLHCFYLTKRGSPFLLLARVRTEILSDDPFIVLYYDVLTHSDMVSLR 339
Query: 66 ELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTR----IQDMTNLV 121
S+ + + Y + + +F++ E P + + R + D+T L
Sbjct: 340 NTSEPLLHPATTIQYLNAPQELSNSRTAHFVWLE-----PTITEATRRADRVLWDVTGLN 394
Query: 122 IGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW---RLASFMFYLTDVELGGATIFP 178
+ E + Q+NNYG+GG + H D + R+A+ +FYL+DV GGAT+F
Sbjct: 395 LSNSEMF----QVNNYGIGGSFMRHSDLLHSERNYLVRERIATAIFYLSDVPQGGATLFT 450
Query: 179 SLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
LN+TVFP+ G+ +FWYN + D R H+GCPV G+KW
Sbjct: 451 ELNVTVFPQAGTVLFWYNLAHSGDHDMRTRHTGCPVIGGSKW 492
>gi|195341582|ref|XP_002037385.1| GM12897 [Drosophila sechellia]
gi|194131501|gb|EDW53544.1| GM12897 [Drosophila sechellia]
Length = 467
Score = 103 bits (257), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 69/225 (30%), Positives = 113/225 (50%), Gaps = 26/225 (11%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
E Y C+ + S S L C Y S + FL + LK+EE+ L+P +V HD + D +
Sbjct: 261 EDYKRLCRSSFS---PTPSKLHCRYNSTTSRFLILASLKMEEISLEPYIVAYHDILPDKD 317
Query: 61 INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
I ++I L++ ++ +V + + + + P L ++ R++D+T L
Sbjct: 318 IQQLITLAEPLLKPIEVFDENKNEAKSSDRTSL---------GGPLLDRLTERMRDITGL 368
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-RLASFMFYLTDVELGGATIFPS 179
I + P+ I YG G H + EG R+A+ MFYL D GGAT+FP
Sbjct: 369 QIPQ----GNPINIIKYGFGAHSET--------EGYGDRMATVMFYLNDAPYGGATVFPR 416
Query: 180 LNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWGKLL 224
LN+ V E+G + WYN + ++ D H+ CPV G+K+G+++
Sbjct: 417 LNVKVPAERGKVLLWYNLNGDS-QDVTTVHAVCPVFHGSKYGEIV 460
>gi|195159303|ref|XP_002020521.1| GL13468 [Drosophila persimilis]
gi|194117290|gb|EDW39333.1| GL13468 [Drosophila persimilis]
Length = 415
Score = 103 bits (257), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 72/223 (32%), Positives = 107/223 (47%), Gaps = 28/223 (12%)
Query: 5 LACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRI 64
L C+G P L C Y + FL++ P K E L L P +V HD I E +
Sbjct: 197 LCCRGG--CPYRDMHRLTCSYNTTAAPFLRLAPFKTELLSLSPYMVLYHDVITPLESLTL 254
Query: 65 IELSKGKVERGKVV---NYGDTIYVDT-RLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
LSK ++R +V N ++D+ R S +L ++ + +++ R+ MTN
Sbjct: 255 KNLSKPLMKRRAMVMVNNLKVRPFIDSGRTSNSVWLTSH---ENAVMERLERRVGVMTNF 311
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD--ATPRDEGLWRLASFMFYLTDVELGGATIFP 178
+ E Y Q+ NYG+GGHY H D TP+ L+DV GGAT+FP
Sbjct: 312 EMENSEVY----QLINYGIGGHYKPHTDHFETPQ-------------LSDVPQGGATLFP 354
Query: 179 SLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 221
LN++V P +G A+ WYN + + H+ CP+ G+KW
Sbjct: 355 RLNISVQPRQGDALLWYNLNDRGQGEIGTVHTSCPIIKGSKWA 397
>gi|328718387|ref|XP_001952104.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Acyrthosiphon
pisum]
Length = 293
Score = 103 bits (257), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 77/208 (37%), Positives = 107/208 (51%), Gaps = 16/208 (7%)
Query: 22 KCFYESYNNTFLKI-GPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNY 80
KC Y++ NN F +I P K E++ +P + HD +YD EI +I L+ + V +
Sbjct: 73 KCRYQT-NNLFYRILMPFKEEDINSEPLIKIYHDVLYDDEILKIKTLALENMNDAHVKSV 131
Query: 81 G---DTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNY 137
D + TR +VY++ E+ F + TRI+ T E+Y QI NY
Sbjct: 132 DGKDDVLEEKTRSGQVYWI-SEVDAVEYFD-ALNTRIESFTGFSTKTAEQY----QIVNY 185
Query: 138 GLGGHYDLHCDA----TPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVF 193
GLGGHY H D+ T E RL + +FYLTDV+ G T FP LN+ +KG+A+
Sbjct: 186 GLGGHYLPHHDSFAKGTENVEFGNRLVTVLFYLTDVQNDGYTSFPLLNINAPVDKGAALV 245
Query: 194 WYNAH-ANTLLDYRMYHSGCPVALGNKW 220
W N H +N L Y H CP+ GNKW
Sbjct: 246 WNNLHMSNGQLFYESLHGSCPLLKGNKW 273
>gi|313217217|emb|CBY38368.1| unnamed protein product [Oikopleura dioica]
gi|313239835|emb|CBY17758.1| unnamed protein product [Oikopleura dioica]
Length = 521
Score = 103 bits (256), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 68/216 (31%), Positives = 107/216 (49%), Gaps = 17/216 (7%)
Query: 18 KSNLKCFYESYNNTF--LKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERG 75
KSNLKCFY + + L+ P+K EEL+ DP VV+ ++ I D E I L+ + R
Sbjct: 282 KSNLKCFYWTGPSPLSPLQWAPVKTEELHGDPLVVQFYEVISDEEERAIQFLAGEHLNRA 341
Query: 76 KVVN--YGDTIYVDTRLSKVYFLYP-EIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPL 132
+ + G + D R+ K +L E + + K ++ +T G + Y +
Sbjct: 342 TIQDPATGKLVNADYRIQKTAWLTEFEKLDVNGTIAKYNEKLTKIT----GLDADYAELV 397
Query: 133 QINNYGLGGHYDLHCD--ATPRDEGLW------RLASFMFYLTDVELGGATIFPSLNLTV 184
Q+ NYG+ G Y+ H D + P E W R+A+++ Y+++ +GG T+F +
Sbjct: 398 QVGNYGVAGQYEPHWDHQSYPGAENRWDPIEGSRIATWLAYMSEPNMGGGTVFIQAGIQA 457
Query: 185 FPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
P + SAVFWYN + D H+ CPV G KW
Sbjct: 458 RPIRNSAVFWYNLLPSGESDDNTQHAACPVLSGTKW 493
>gi|393903732|gb|EFO16802.2| hypothetical protein LOAG_11701 [Loa loa]
Length = 531
Score = 103 bits (256), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 73/230 (31%), Positives = 111/230 (48%), Gaps = 43/230 (18%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
+ Y C+ + V +S L C+Y+ + +L++ P+KVE +Y +P V HD + D E
Sbjct: 283 DTYQALCRQEMPVNIKAQSRLYCYYK-MDRPYLRLAPIKVEIVYQNPLAVLFHDIMSDEE 341
Query: 61 INRIIE-LSKGKVERGKV--VNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDM 117
+RIIE L+ K++R V V G+ R+SK +L +H + +I R+
Sbjct: 342 -SRIIEMLAVPKLDRATVHNVETGNLETASYRISKSAWLRS---TEHEVVNRINRRLDLA 397
Query: 118 TNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-------RLASFMFYLTDVE 170
TNL I E LQ+ NYG+GGHY+ H D + RDE + R+A+ + Y
Sbjct: 398 TNLEIATAEE----LQVQNYGIGGHYEPHLDCS-RDEDAFERTGTGNRIATILIY----- 447
Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+A+FWYN + +D R YH+ CPV G KW
Sbjct: 448 ------------------NAALFWYNLMRSGAVDMRSYHAACPVLTGTKW 479
>gi|195069797|ref|XP_001997029.1| GH12978 [Drosophila grimshawi]
gi|193891498|gb|EDV90364.1| GH12978 [Drosophila grimshawi]
Length = 518
Score = 102 bits (255), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 73/227 (32%), Positives = 116/227 (51%), Gaps = 23/227 (10%)
Query: 3 YPLACQGNLSVPEDIKSNL---KCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDS 59
Y CQG +PE IK+N +C+ +S + + K+ PLKVE++ L P + +D + D+
Sbjct: 281 YVRLCQGK-RLPE-IKTNQSSPRCYLDSNQHAYFKLSPLKVEQVNLAPDINIYYDVLNDN 338
Query: 60 EINRIIELS-KGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
+I I+ELS + + R V Y T D R+S+ +L + I + +
Sbjct: 339 QIKSILELSTEFESFRSSVNKYNVT---DKRVSQQVWL--------NYSSPIMRTYRQLV 387
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW------RLASFMFYLTDVELG 172
+ G +Q+ NYG+GG Y+ H D + + R+++ M YL+DV+ G
Sbjct: 388 GAISGFNMTNAEIMQVANYGIGGQYEPHHDFSGANLAARYANFGDRISTNMIYLSDVQQG 447
Query: 173 GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 219
G T+FP+ N+ V P KG+ V W+N + D R H+GCPV G K
Sbjct: 448 GYTVFPTQNVFVKPIKGAMVMWHNLLRSLDGDRRTLHAGCPVIEGTK 494
>gi|194751827|ref|XP_001958225.1| GF23630 [Drosophila ananassae]
gi|190625507|gb|EDV41031.1| GF23630 [Drosophila ananassae]
Length = 431
Score = 102 bits (254), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 72/220 (32%), Positives = 109/220 (49%), Gaps = 36/220 (16%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
Y L C+G +K+ L C Y + FLKI PLK E L LDP + H+ +Y+ E++
Sbjct: 244 YELGCRGLFP----LKNKLFCQYNFHTTPFLKIAPLKQEILSLDPFISMFHEVLYEYELH 299
Query: 63 RIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVI 122
+ E K ++ K Y I R S+ R+ D+T L
Sbjct: 300 GLKEDLKNPIKSKK---YKKNI--TNRFSQ--------------------RLTDITGLHF 334
Query: 123 GREERYKGPLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPSLNL 182
+ ++ + I+NYGL ++H + +D G + + +F+++D GGAT+FP L +
Sbjct: 335 SKRDQ----INIDNYGLENQAEVHYNY--KDIG-GPVGAILFFISDDVQGGATVFPKLKV 387
Query: 183 TVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWGK 222
+VFP+KGS + WYN + LD R HS CPV GN GK
Sbjct: 388 SVFPKKGSCLVWYNIKDDGRLDPRTTHSICPVLEGNSLGK 427
>gi|195069795|ref|XP_001997028.1| GH12977 [Drosophila grimshawi]
gi|193891497|gb|EDV90363.1| GH12977 [Drosophila grimshawi]
Length = 517
Score = 102 bits (254), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 75/228 (32%), Positives = 114/228 (50%), Gaps = 25/228 (10%)
Query: 3 YPLACQGNLSVPEDIKSNL---KCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDS 59
Y CQG +PE IK+N +C+ +S + + K+ PLKVE++ L P + +D + D+
Sbjct: 280 YVRLCQGK-RLPE-IKTNQSSPRCYLDSNQHAYFKLSPLKVEQVNLAPDINIYYDVLNDN 337
Query: 60 EINRIIELSKG-KVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
+I I+ELS R V Y T D R+S+ +L + I + +
Sbjct: 338 QIKSILELSTEFDSFRSSVNKYNVT---DKRVSQQVWL--------NYSSPIMRTYRQLV 386
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCD-------ATPRDEGLWRLASFMFYLTDVEL 171
+ G +Q+ NYG+GG Y+ H D A G R+++ M YL+DV+
Sbjct: 387 GAISGFNMTNAETMQVANYGIGGQYEPHHDFFGINLPANSVKRGD-RISTNMIYLSDVQQ 445
Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 219
GG T+FP+ N+ V P KG+ V W+N + D R H+GCPV G K
Sbjct: 446 GGYTVFPTQNVFVKPIKGAMVMWHNLLRSLDGDRRTLHAGCPVIEGTK 493
>gi|195452770|ref|XP_002073492.1| GK14148 [Drosophila willistoni]
gi|194169577|gb|EDW84478.1| GK14148 [Drosophila willistoni]
Length = 444
Score = 101 bits (252), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 73/226 (32%), Positives = 109/226 (48%), Gaps = 19/226 (8%)
Query: 7 CQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIE 66
CQ + P+ K +L C Y + FL++ P ++EEL L+P +V H+ + D EI ++
Sbjct: 226 CQSS-HKPKPTK-HLYCRYNTTTTPFLRLAPFRMEELSLNPYMVAYHNVLSDEEIRQLNR 283
Query: 67 LSKGKVERGKVVNYGDTIYVDTRLSKVYF---LYPEIFGDHPFLYKIQTRIQDMTNLVIG 123
+S +++ V+ D Y + +F P + + +I + D+T L
Sbjct: 284 MSAPLLKKAFPVSAVDIDYDVRTVDTAWFPNSETPHTKENDRLIKRIVNIVSDLTGLNAD 343
Query: 124 REERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-------RLASFMFYLTDVELGGATI 176
+ + Q YG GGHY H D +E + RLA+ +FYL V+ GGAT+
Sbjct: 344 VADSF----QAVRYGFGGHYSPHHDYF--NESIHQTAVNGDRLATVLFYLNTVKHGGATV 397
Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTL-LDYRMYHSGCPVALGNKWG 221
FP LNL V EKG +FWYN +L D H CPV G K G
Sbjct: 398 FPLLNLKVPAEKGKVLFWYNLDGESLDFDENTEHGVCPVVDGIKLG 443
>gi|15808763|gb|AAL08488.1| prolyl-4-hydroxylase alpha subunit-like protein [Onchocerca
volvulus]
Length = 571
Score = 101 bits (252), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 71/229 (31%), Positives = 116/229 (50%), Gaps = 18/229 (7%)
Query: 2 IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
IY C+ + V ++S L C+Y++ + +L++ P KVE + +P V + I D +
Sbjct: 294 IYEALCRREVPVNTKVQSQLYCYYKT-DRPYLRLAPFKVEIVRQNPLNVLFYGIISDEQA 352
Query: 62 NRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
I L+ K+ ++ N G R+ K L ++ + +I R++ TN
Sbjct: 353 RIIQMLAVPKLNGSRIYNDLTGSFELPSFRILKSARLRS---TEYETVKRIDKRLELATN 409
Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-------RLASFMFYLTDVELG 172
L I E L + NYG+GG ++ H D + + + R+A+F+ YLT+ E+G
Sbjct: 410 LEIETAE----DLAVLNYGIGGQFEPHFDCALKGDQCFEKLGTGNRIATFLIYLTEPEIG 465
Query: 173 GATIFPS-LNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
G T+F S L ++V K +A+FWYN N +D R H+ CPVA G KW
Sbjct: 466 GRTVFTSNLKISVPCVKNAALFWYNLMRNGEVDTRSLHAACPVATGIKW 514
>gi|15808767|gb|AAL08490.1|AF369789_1 prolyl-4-hydroxylase alpha subunit-like protein [Onchocerca
volvulus]
Length = 571
Score = 101 bits (252), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 74/230 (32%), Positives = 119/230 (51%), Gaps = 20/230 (8%)
Query: 2 IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
IY C+ + V ++S L C+Y++ + +L++ P KVE + +P V + I D +
Sbjct: 294 IYEALCRREVPVNTKVQSQLYCYYKT-DRPYLRLAPFKVEIVRQNPLNVLFYGIISDEQA 352
Query: 62 NRIIE-LSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
RIIE L+ K+ ++ N G R+ K L ++ + +I R++ T
Sbjct: 353 -RIIEMLAVPKLNGSRIYNDLTGSFELPSFRILKSARLRS---TEYETVKRIDKRLELAT 408
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-------RLASFMFYLTDVEL 171
NL I E L + NYG+GG ++ H D + + + R+A+F+ YLT+ E+
Sbjct: 409 NLEIETAE----DLAVLNYGIGGQFEPHFDCALKGDQCFEKLGTGNRIATFLIYLTEPEI 464
Query: 172 GGATIFPS-LNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GG T+F S L ++V K +A+FWYN N +D R H+ CPVA G KW
Sbjct: 465 GGRTVFTSNLKISVPCVKNAALFWYNLMRNGEVDTRSLHAACPVATGIKW 514
>gi|194871344|ref|XP_001972830.1| GG13666 [Drosophila erecta]
gi|190654613|gb|EDV51856.1| GG13666 [Drosophila erecta]
Length = 539
Score = 101 bits (251), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 72/228 (31%), Positives = 109/228 (47%), Gaps = 23/228 (10%)
Query: 7 CQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN--RI 64
C+G P+ L C Y + FLK+ PLK+E L + P + HD +Y+ E R
Sbjct: 302 CRGEW--PKKSSPELICRYSRDTSAFLKLAPLKLEFLSVQPMIHLYHDVLYEKEFKSMRD 359
Query: 65 IELSKGKVERGKV-VNYGDTIYVDTRLSKVYFLYPEIFGD--HPFLYKIQTRIQDMTNLV 121
+ + + G+ ++ I T+ V + F D P+ I RI DM+
Sbjct: 360 VAVFNATMIDGRTYFDFHKKIKPKTQDRVVKMI---DFKDTTAPYTLSINRRIADMS--- 413
Query: 122 IGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLW------RLASFMFYLTDVELG 172
G E R L ++NYGLGG + H D R + R+A+ + Y +DV LG
Sbjct: 414 -GLEMRENMVLYLSNYGLGGDFGKHVDYVELAKRPSDFFADFKGDRIATAVLYASDVPLG 472
Query: 173 GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
G T+FP L + V P+KG+A+ W+N + D HS CP+ LG++W
Sbjct: 473 GTTVFPKLKIAVQPKKGNALVWFNLNHAGEPDPLTEHSVCPIVLGSRW 520
>gi|194765140|ref|XP_001964685.1| GF23318 [Drosophila ananassae]
gi|190614957|gb|EDV30481.1| GF23318 [Drosophila ananassae]
Length = 412
Score = 100 bits (250), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 66/209 (31%), Positives = 94/209 (44%), Gaps = 46/209 (22%)
Query: 18 KSN--LKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERG 75
KSN L C+Y S FL+I P K E++ LDP VV HD + EI+++I L+ K+ +
Sbjct: 217 KSNNRLMCYYNSSTTPFLRIAPFKTEQIGLDPYVVVFHDVLSPREISKLISLTDRKLVQA 276
Query: 76 KVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQIN 135
VN + R +K +++Y G +I RI DM+ + E
Sbjct: 277 VTVN-KKSFKEMVRTAKAHWVY---RGYQELTKRIYRRIHDMSGFELADAEN-------- 324
Query: 136 NYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPSLNL----TVFPEKGSA 191
F L+DVE GGAT+FP ++ TV+P G+A
Sbjct: 325 ----------------------------FQLSDVEQGGATVFPGISADSAYTVYPRAGTA 356
Query: 192 VFWYNAHANTLLDYRMYHSGCPVALGNKW 220
WYN H + L D H CPV +G+KW
Sbjct: 357 AMWYNLHTDGLGDPTTLHVACPVIVGSKW 385
>gi|431904119|gb|ELK09541.1| Prolyl 4-hydroxylase subunit alpha-1 [Pteropus alecto]
Length = 507
Score = 100 bits (250), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 64/198 (32%), Positives = 105/198 (53%), Gaps = 19/198 (9%)
Query: 3 YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G + + + L C Y N N + P K E+ + PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + +L+K ++ R V + G R+SK +L ++P + +I RIQD+T
Sbjct: 349 IEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---ENPVVSRINMRIQDLT 405
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
L + E LQ+ NYG+GG Y+ H D +DE R+A+++FY++DV
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVS 461
Query: 171 LGGATIFPSLNLTVFPEK 188
GGAT+FP + +V+P+K
Sbjct: 462 AGGATVFPEVGASVWPKK 479
>gi|443719426|gb|ELU09607.1| hypothetical protein CAPTEDRAFT_229373 [Capitella teleta]
Length = 576
Score = 100 bits (250), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 60/194 (30%), Positives = 103/194 (53%), Gaps = 18/194 (9%)
Query: 41 EELY-LDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPE 99
EE++ +PR+ I++ I + +IN + + + + +V + + + R+SK +L+
Sbjct: 359 EEIFNFNPRIALIYNVIKNRDINMLKDKATAGLSSSRVGDPAKSKLSNERISKTSWLWD- 417
Query: 100 IFGDHPFLYKIQTRIQDMTNLVIGRE--ERYKGPLQINNYGLGGHYDLHCD--------- 148
+ ++K+ ++ D+T L + P Q+ NYG+GG Y H D
Sbjct: 418 --TEDERIFKLSKQVADITGLSTQYSTLHSHAEPFQLVNYGIGGQYQPHFDYYENDMLRN 475
Query: 149 --ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYR 206
A +D G R+A+FMFYL+ V+ GGAT+FP L++ + KG+A FW+N + +
Sbjct: 476 VPAFIQDTGD-RVATFMFYLSSVKAGGATVFPKLHVRIPAVKGAAAFWFNIRRSGDREPL 534
Query: 207 MYHSGCPVALGNKW 220
H+GCPV LG KW
Sbjct: 535 TQHAGCPVLLGEKW 548
>gi|319943342|ref|ZP_08017624.1| 2OG-Fe(II) oxygenase [Lautropia mirabilis ATCC 51599]
gi|319743157|gb|EFV95562.1| 2OG-Fe(II) oxygenase [Lautropia mirabilis ATCC 51599]
Length = 311
Score = 100 bits (248), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 61/184 (33%), Positives = 97/184 (52%), Gaps = 14/184 (7%)
Query: 46 DPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHP 105
+P + I + D E + +I LS+GK++ +VV+ ++ + K + E G++
Sbjct: 120 NPNIAVIRGLLSDEECDEVIRLSRGKMKTSQVVDRESGGSYESSVRKSEGSHFE-RGENE 178
Query: 106 FLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDA-TPRDEGL-------- 156
+ +I+ R+ + +L + R E PLQI +YG GG Y H D P+D G
Sbjct: 179 LVRRIEARLSALVDLPVNRGE----PLQILHYGPGGEYKAHQDFFEPKDPGSAVLTRVGG 234
Query: 157 WRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVAL 216
R+ + + YL DV GG T FP + + P KGSAV++ +A+ LDYR H+G PV
Sbjct: 235 QRIGTVVMYLNDVPEGGETAFPDIGFSAKPIKGSAVYFEYQNADGQLDYRCLHAGMPVIR 294
Query: 217 GNKW 220
G+KW
Sbjct: 295 GDKW 298
>gi|339236275|ref|XP_003379692.1| prolyl 4-hydroxylase subunit alpha-2 [Trichinella spiralis]
gi|316977629|gb|EFV60704.1| prolyl 4-hydroxylase subunit alpha-2 [Trichinella spiralis]
Length = 441
Score = 99.8 bits (247), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 78/254 (30%), Positives = 109/254 (42%), Gaps = 53/254 (20%)
Query: 7 CQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIE 66
C+G + E +S L C+Y+ + FL + P+KVE ++ P++V I +EI +
Sbjct: 176 CRGEYLLTEKQRSRLYCYYKR-DTPFLSLAPIKVEVMHWKPKIVIFRQVISANEIAVLKT 234
Query: 67 LSKGKVERGKVVNY-----------------------------GDTIYVDTRLSKVYFLY 97
L+ ++ R V N G + R+SK +L
Sbjct: 235 LAYPRLSRATVQNSETGELETAKYRISKRCRTLRRATVHNKETGQLEHASYRISKSAWLK 294
Query: 98 PEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDE--- 154
+HP + +I RI DMTNL + E LQI NYGLGGHYD H D RDE
Sbjct: 295 EH---EHPVVDRIVKRIHDMTNLNMETAE----DLQIANYGLGGHYDPHFDHARRDEVDP 347
Query: 155 ----GLWRLASFMFYLTDVELGGATIFPSLN----LTVFPEKGSAVFWYNAHANTLLDYR 206
R+A+ +FY +V F SLN + G A FW+N N D
Sbjct: 348 YEHGHGNRIATTLFYKEEV-----NAFKSLNTGNRIATVLFYGDAAFWFNLKPNGEGDMS 402
Query: 207 MYHSGCPVALGNKW 220
H+ CPV G KW
Sbjct: 403 TRHAACPVLAGVKW 416
>gi|167519971|ref|XP_001744325.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163777411|gb|EDQ91028.1| predicted protein [Monosiga brevicollis MX1]
Length = 492
Score = 99.8 bits (247), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 69/206 (33%), Positives = 98/206 (47%), Gaps = 14/206 (6%)
Query: 21 LKCFYESYNNTFLKIGPLKVEELYL-DPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN 79
L C + +N L + P++VE ++ + R+ + E + E + K+ R
Sbjct: 277 LSCRLQHFNKPHLFLKPIRVEYVHEGNNRLQIFRNFASAQECAHLREEGRKKLSRAVAWT 336
Query: 80 YGDTIYVDTRLSKVYFLYPEIFGDHP-FLYKIQTRIQDMTNLVIGREERYKGPLQINNYG 138
G V+ R+S +L P DH + + TRI D T L + + LQ++NYG
Sbjct: 337 DGAFRPVEFRISTAAWLQP----DHDDVVTNLHTRIADATQLDL----EFAEALQVSNYG 388
Query: 139 LGGHYDLHCDA-TPRDEGLW---RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFW 194
+GG Y+ H D R+ L R+A+FM YL VE GG T FP L V P G AVFW
Sbjct: 389 IGGFYETHYDHHASRERELPEGDRIATFMIYLNQVEQGGYTAFPRLGAAVEPGHGDAVFW 448
Query: 195 YNAHANTLLDYRMYHSGCPVALGNKW 220
YN + D H CPV G+KW
Sbjct: 449 YNLLPDGESDNNTLHGACPVLQGSKW 474
>gi|17861644|gb|AAL39299.1| GH17175p [Drosophila melanogaster]
Length = 187
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 56/138 (40%), Positives = 82/138 (59%), Gaps = 14/138 (10%)
Query: 89 RLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD 148
R+SK +L + HP + + ++D T L + + LQ+ NYG+GGHY+ H D
Sbjct: 25 RVSKNAWL---AYESHPTMVGMLRDLKDATGL----DTTFCEQLQVANYGVGGHYEPHWD 77
Query: 149 ------ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTL 202
P +EG R+A+ +FYL++VE GGAT FP L++ V P+ G+ +FWYN H +
Sbjct: 78 FFRDPNHYPAEEGN-RIATAIFYLSEVEQGGATAFPFLDIAVKPQLGNVLFWYNLHRSLD 136
Query: 203 LDYRMYHSGCPVALGNKW 220
DYR H+GCPV G+KW
Sbjct: 137 KDYRTKHAGCPVLKGSKW 154
>gi|195069799|ref|XP_001997030.1| GH12979 [Drosophila grimshawi]
gi|193891499|gb|EDV90365.1| GH12979 [Drosophila grimshawi]
Length = 517
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 78/246 (31%), Positives = 123/246 (50%), Gaps = 54/246 (21%)
Query: 3 YPLACQGNLSVPEDIKSNL---KCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDS 59
Y CQG +PE IK+N +C+ +S + + K+ PLKVE++ LDP + + + D+
Sbjct: 280 YVRLCQGK-RLPE-IKTNQSSPRCYLDSNRHAYFKLSPLKVEQVNLDPDINIYYGVLNDN 337
Query: 60 EINRIIELSKG-----KVERGKVVNYGDTIYVDTRLSK-VYFLYPEIFGDHPFLYKIQTR 113
+I I+ LS R V++ D R+S+ V+ Y P + +
Sbjct: 338 QIKSILRLSDELDSFRSTHRKYVIS-------DMRISQQVWLNYSS-----PIMRTYRQL 385
Query: 114 IQ-----DMTNLVIGREERYKGPLQINNYGLGGHYDLHCD--ATP-------RDEGLWRL 159
+ +MTN+ I +Q+ NYG+GGHY+ H D +P R + R+
Sbjct: 386 VGAISGFNMTNVEI---------MQLANYGIGGHYEPHIDYMGSPLPPYYAKRGD---RI 433
Query: 160 ASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV----- 214
++ M YL+DV+ GG T+FP+ N+ V P KGS + WYN + D+R H+GC V
Sbjct: 434 STSMIYLSDVQQGGYTVFPTQNVFVKPVKGSMILWYNQLRSLNPDHRTLHAGCAVIEGIK 493
Query: 215 ALGNKW 220
+GN W
Sbjct: 494 RIGNIW 499
>gi|195575103|ref|XP_002105519.1| GD17002 [Drosophila simulans]
gi|194201446|gb|EDX15022.1| GD17002 [Drosophila simulans]
Length = 793
Score = 99.0 bits (245), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 58/173 (33%), Positives = 96/173 (55%), Gaps = 15/173 (8%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
++Y C+G L + NL+C+ + + P K+E+L +DP V +H+ ++DSE
Sbjct: 255 KLYTQVCRGELHQSPRDQRNLRCWLSHQGVPYYHLSPFKIEQLNIDPYVAYVHEVLWDSE 314
Query: 61 INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
I+ I+E KG +ER KV ++ + R+S+ +L+ + +P+L KI+ R++D+T L
Sbjct: 315 IDTIMEHGKGNMERSKVGQIENSTTTEVRISRNTWLW---YDANPWLSKIKQRLEDVTGL 371
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGL----W---RLASFMFYL 166
E PLQ+ NYG+GG Y+ H D D+G W RL + +FYL
Sbjct: 372 STESAE----PLQLVNYGIGGQYEPHFDFV-EDDGQNVFSWKGNRLLTALFYL 419
>gi|195055777|ref|XP_001994789.1| GH14121 [Drosophila grimshawi]
gi|193892552|gb|EDV91418.1| GH14121 [Drosophila grimshawi]
Length = 517
Score = 99.0 bits (245), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 78/247 (31%), Positives = 121/247 (48%), Gaps = 56/247 (22%)
Query: 3 YPLACQGNLSVPEDIKSNL---KCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDS 59
Y CQG +PE IK+N +C+ +S + + K+ PLKVE++ LDP + + + D+
Sbjct: 280 YVRLCQGK-RLPE-IKTNQSSPRCYLDSNRHAYFKLSPLKVEQVNLDPDINIYYGVLNDN 337
Query: 60 EINRIIELSKG-----KVERGKVVNYGDTIYVDTRLSK-VYFLYPEIFGDHPFLYKIQTR 113
+I I+ LS R V++ D R+S+ V+ Y P + +
Sbjct: 338 QIKSILRLSDELDSFRSTHRKYVIS-------DMRISQQVWLNYSS-----PIMRTYRQL 385
Query: 114 IQ-----DMTNLVIGREERYKGPLQINNYGLGGHYDLHCD----------ATPRDEGLWR 158
+ +MTN+ I +Q+ NYG+GGHY+ H D A D R
Sbjct: 386 VGAISGFNMTNVEI---------MQLANYGIGGHYEPHIDYMGSPLPPYYAKRGD----R 432
Query: 159 LASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV---- 214
+++ M YL+DV+ GG T+FP+ N+ V P KGS + WYN + D+R H+GC V
Sbjct: 433 ISTSMIYLSDVQQGGYTVFPTQNVFVKPVKGSMILWYNQLRSLNPDHRTLHAGCAVIEGI 492
Query: 215 -ALGNKW 220
+GN W
Sbjct: 493 KRIGNIW 499
>gi|319652240|ref|ZP_08006358.1| prolyl 4-hydroxylase [Bacillus sp. 2_A_57_CT2]
gi|317396063|gb|EFV76783.1| prolyl 4-hydroxylase [Bacillus sp. 2_A_57_CT2]
Length = 216
Score = 99.0 bits (245), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 60/180 (33%), Positives = 98/180 (54%), Gaps = 14/180 (7%)
Query: 46 DPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDT-RLSKVYFLYPEIFGDH 104
+P +V + + + D E +++I+ SK +++R KV N ++ VD R S F + G++
Sbjct: 37 EPLIVILGNVLSDEECDQLIQQSKDRMQRSKVAN---SLEVDELRTSSSTFFHE---GEN 90
Query: 105 PFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLAS 161
+ +I+ RI + N+ + E LQI NY +G Y H D +T R R+++
Sbjct: 91 EIVARIEKRISQIMNIPVEHGE----GLQILNYKIGQEYKAHFDFFSSTSRAASNPRIST 146
Query: 162 FMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 221
+ YL DVE GG T FP LN +V P+KG AV++ + + L+ H G PV +G+KW
Sbjct: 147 LVMYLNDVEQGGETYFPKLNFSVSPQKGMAVYFEYFYNDQNLNDLTLHGGAPVVMGDKWA 206
>gi|423669823|ref|ZP_17644852.1| hypothetical protein IKO_03520 [Bacillus cereus VDM034]
gi|423673973|ref|ZP_17648912.1| hypothetical protein IKS_01516 [Bacillus cereus VDM062]
gi|401298950|gb|EJS04550.1| hypothetical protein IKO_03520 [Bacillus cereus VDM034]
gi|401309524|gb|EJS14857.1| hypothetical protein IKS_01516 [Bacillus cereus VDM062]
Length = 216
Score = 98.6 bits (244), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 64/180 (35%), Positives = 91/180 (50%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + + D E + +IELSK K+ER KV + D D R S FL +
Sbjct: 36 FEEPLIVVLANVLSDEECDELIELSKSKMERSKVGSSRDV--NDIRTSSGAFLE-----E 88
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI +TN+ + E L I NY + Y H D R R++
Sbjct: 89 NELTSKIEKRISSITNVPVAHGE----GLHILNYEVDQEYKAHYDYFAEHSRSAANNRIS 144
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + LL+ H G PV G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQLLNELTLHGGAPVTKGEKW 204
>gi|403274090|ref|XP_003928822.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Saimiri
boliviensis boliviensis]
Length = 149
Score = 98.6 bits (244), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 52/126 (41%), Positives = 75/126 (59%), Gaps = 15/126 (11%)
Query: 110 IQTRIQDMTNLVIG-REERYKGP------LQINNYGLGGHYDLHCDATPRDE-------G 155
+ +R T +G R + GP LQ+ NYG+GG Y+ H D +DE G
Sbjct: 1 MHSRNNGGTPRAVGLRRAQGSGPECAVLGLQVANYGVGGQYEPHFDFARKDEPDAFKELG 60
Query: 156 LW-RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 214
R+A+++FY++DV GGAT+FP + +V+P+KG+AVFWYN A+ DY H+ CPV
Sbjct: 61 TGNRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 120
Query: 215 ALGNKW 220
+GNKW
Sbjct: 121 LVGNKW 126
>gi|421749438|ref|ZP_16186877.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator HPC(L)]
gi|409771699|gb|EKN53918.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator HPC(L)]
Length = 319
Score = 98.2 bits (243), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 65/186 (34%), Positives = 89/186 (47%), Gaps = 18/186 (9%)
Query: 46 DPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGD 103
PR+ + E +I LS+G++ R VVN GD +D R S G+
Sbjct: 126 SPRIALFQRLLMPDECEALIALSRGRLARSPVVNPDTGDENLIDARTSMGAMFQ---VGE 182
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---------ATPRDE 154
HP + +++ RI +T + + E +G LQI NY G Y H D A
Sbjct: 183 HPLIERLEARIAAVTGVPV---EHGEG-LQILNYKPGAEYQPHYDFFNPQRPGEARQLRV 238
Query: 155 GLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 214
G R+A+ + YL DV GGAT FP L L V P +G+AVF+ + LD R H+G PV
Sbjct: 239 GGQRMATLVIYLNDVPAGGATAFPKLGLRVNPVQGNAVFFAYLGEDGSLDERTLHAGLPV 298
Query: 215 ALGNKW 220
G KW
Sbjct: 299 EQGEKW 304
>gi|405964867|gb|EKC30309.1| Prolyl 4-hydroxylase subunit alpha-1 [Crassostrea gigas]
Length = 591
Score = 97.8 bits (242), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 73/248 (29%), Positives = 113/248 (45%), Gaps = 41/248 (16%)
Query: 2 IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
+Y C+ +++ + L+CF T + K E + +PR+ HD I + I
Sbjct: 326 MYEALCREEQKSLQEL-AKLRCFLR---ETVIPYYKAKEEVVNYEPRIAIFHDVISPTSI 381
Query: 62 NRIIELSKGKVERGKVV--NYGDTIYV------DTRLSKVYFLYPEIFGDHPFLYKIQTR 113
+ ++ R V N G +V + R+S+ +L + ++P L +++ R
Sbjct: 382 EHLKSVASKGFTRSTVFLENTGPDGHVTYGKLDNVRVSQTSWLGTD---EYPELSRLENR 438
Query: 114 IQDMTNLVIGREERYKG------PLQINNYGLGGHYDLHCDATP---------------R 152
I+ L G YK Q+ NYG+GG Y +H D T R
Sbjct: 439 IK----LTTGLSAEYKSVRSHSEKFQVLNYGVGGMYTVHYDYTGYMLGIPSNPLDSDDIR 494
Query: 153 DEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGC 212
G R+A++MFYL DV+ GGAT+FP + + KG A FWYN + D R H GC
Sbjct: 495 TSGE-RMATWMFYLNDVKAGGATVFPEVKTRIPVAKGGAAFWYNVRPSGATDPRTLHGGC 553
Query: 213 PVALGNKW 220
PV +G+KW
Sbjct: 554 PVLVGSKW 561
>gi|390363005|ref|XP_797519.3| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like
[Strongylocentrotus purpuratus]
Length = 579
Score = 97.8 bits (242), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 68/235 (28%), Positives = 116/235 (49%), Gaps = 24/235 (10%)
Query: 3 YPLACQGNLSVPEDIKSNL-KCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
Y C+G+ + + L KC Y+ YN+ FL + P K E ++ DPR+V + + D EI
Sbjct: 329 YEALCRGDPGALKVVDHRLLKCQYQHYNHPFLYLQPAKEEVIFDDPRLVFYRNILNDKEI 388
Query: 62 NRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
+ L+ +++R + N G+ + D R+SK ++ E + + I+ R+Q T
Sbjct: 389 AFVKRLASPRLQRATIQNAITGNLEFADYRISKSAWVKQE---EDQLIRSIRFRVQAYTG 445
Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVEL 171
L + E LQ+ NYG+GGHY+ H D +E R+A+ +FY++
Sbjct: 446 LELDTAE----DLQVVNYGIGGHYEPHFDFARAEETNAFQSLGTGNRIATALFYVSITCP 501
Query: 172 GGATIFPSLN------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
++ + + L++ G+AVFWYN + +Y H+ CPV G+KW
Sbjct: 502 DMSSTYEPRDEIRNGFLSLVYPSGTAVFWYNLRKSGQGNYDTRHAACPVLSGSKW 556
>gi|198466405|ref|XP_001353987.2| GA16752 [Drosophila pseudoobscura pseudoobscura]
gi|198150585|gb|EAL29723.2| GA16752 [Drosophila pseudoobscura pseudoobscura]
Length = 510
Score = 97.8 bits (242), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 67/211 (31%), Positives = 100/211 (47%), Gaps = 15/211 (7%)
Query: 21 LKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN--RIIELSKGKVERGKVV 78
L C Y + FL + PLK+E L P +V H+ +Y+ E+ R I ++ G
Sbjct: 285 LACRYNREYSAFLLLAPLKMEVLNQQPLIVLYHEVLYEKELRAMRDIANKNATMQDGWTR 344
Query: 79 NYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYG 138
+ D +V L+ F I RI DMT G E + L ++NYG
Sbjct: 345 MHSDQRVKPEPEDRVLKLHIFQGNSESFSPSINRRIADMT----GLEVQGNNALHLSNYG 400
Query: 139 LGGHYDLHCD---ATPRDEGLWR------LASFMFYLTDVELGGATIFPSLNLTVFPEKG 189
LGG+++ H D T R + LA+ + Y +DV LGGA +FP L ++V P+KG
Sbjct: 401 LGGYFNAHYDYVELTKRPANYFTEWGGDVLATVLLYASDVRLGGAVVFPKLKISVEPKKG 460
Query: 190 SAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+A+ W N + D H+ CPV +G+ W
Sbjct: 461 NALIWDNLNNAGNPDKLSKHAVCPVVMGSHW 491
>gi|113869198|ref|YP_727687.1| prolyl 4-hydroxylase alpha subunit [Ralstonia eutropha H16]
gi|113527974|emb|CAJ94319.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia eutropha H16]
Length = 297
Score = 97.8 bits (242), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 65/185 (35%), Positives = 90/185 (48%), Gaps = 18/185 (9%)
Query: 47 PRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDH 104
P+V + D E + ++ LS+G++ R VVN GD +D R S +H
Sbjct: 105 PQVQLFQQLLTDDECDALVALSRGRLARSPVVNPDTGDENLIDARTSMGAMFQ---VAEH 161
Query: 105 PFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---------ATPRDEG 155
P + +I+ RI +T + E +G LQI NY GG Y H D A G
Sbjct: 162 PLITRIEARIAAVTGVPA---EHGEG-LQILNYKPGGEYQPHFDYFNPQRPGEARQLSVG 217
Query: 156 LWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 215
R+A+ + YL E GGAT FP + L V P KG+AV++ + LD R H+G PVA
Sbjct: 218 GQRIATLVIYLNTPEAGGATAFPRVGLEVAPVKGNAVYFSYLLPDGALDERTLHAGLPVA 277
Query: 216 LGNKW 220
G KW
Sbjct: 278 FGEKW 282
>gi|194373965|dbj|BAG62295.1| unnamed protein product [Homo sapiens]
Length = 604
Score = 97.4 bits (241), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 62/172 (36%), Positives = 96/172 (55%), Gaps = 12/172 (6%)
Query: 20 NLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN 79
+L C YE+ +N +L + P++ E ++L+P + HD + DSE +I EL++ ++R V +
Sbjct: 319 SLYCSYETNSNAYLLLQPIRKEVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVAS 378
Query: 80 YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGL 139
+ V+ R+SK +L + P L + RI +T L + Y LQ+ NYG+
Sbjct: 379 GEKQLQVEYRISKSAWLKDTV---DPKLVTLNHRIAALTGLDV--RPPYAEYLQVVNYGI 433
Query: 140 GGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELGGATIFPSLNLTV 184
GGHY+ H D AT L+R+ A+FM YL+ VE GGAT F NL+V
Sbjct: 434 GGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFIYANLSV 485
>gi|355752458|gb|EHH56578.1| hypothetical protein EGM_06023, partial [Macaca fascicularis]
Length = 586
Score = 97.4 bits (241), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 61/172 (35%), Positives = 95/172 (55%), Gaps = 12/172 (6%)
Query: 20 NLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN 79
+L C YE+ +N +L + P++ E ++L+P + HD + DSE +I E ++ ++R V +
Sbjct: 306 SLYCSYETNSNAYLLLQPIRKEVIHLEPYIALYHDFVSDSEAQKIREFAEPWLQRSVVAS 365
Query: 80 YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGL 139
+ V+ R+SK +L + P L + RI +T L + Y LQ+ NYG+
Sbjct: 366 GEKQLQVEYRISKSAWLKDTV---DPMLVTLNHRIAALTGLDV--RPPYAEYLQVVNYGI 420
Query: 140 GGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELGGATIFPSLNLTV 184
GGHY+ H D AT L+R+ A+FM YL+ VE GGAT F NL+V
Sbjct: 421 GGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFIYANLSV 472
>gi|355566863|gb|EHH23242.1| hypothetical protein EGK_06672, partial [Macaca mulatta]
Length = 583
Score = 97.4 bits (241), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 61/172 (35%), Positives = 95/172 (55%), Gaps = 12/172 (6%)
Query: 20 NLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN 79
+L C YE+ +N +L + P++ E ++L+P + HD + DSE +I E ++ ++R V +
Sbjct: 303 SLYCSYETNSNAYLLLQPIRKEVIHLEPYIALYHDFVSDSEAQKIREFAEPWLQRSVVAS 362
Query: 80 YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGL 139
+ V+ R+SK +L + P L + RI +T L + Y LQ+ NYG+
Sbjct: 363 GEKQLQVEYRISKSAWLKDTV---DPMLVTLNHRIAALTGLDV--RPPYAEYLQVVNYGI 417
Query: 140 GGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELGGATIFPSLNLTV 184
GGHY+ H D AT L+R+ A+FM YL+ VE GGAT F NL+V
Sbjct: 418 GGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFIYANLSV 469
>gi|423612451|ref|ZP_17588312.1| hypothetical protein IIM_03166 [Bacillus cereus VD107]
gi|401246040|gb|EJR52392.1| hypothetical protein IIM_03166 [Bacillus cereus VD107]
Length = 254
Score = 97.1 bits (240), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 64/181 (35%), Positives = 92/181 (50%), Gaps = 16/181 (8%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYV-DTRLSKVYFLYPEIFG 102
+ +P +V + + + D E + +IELSK K+ER K+ G + V D R S FL
Sbjct: 74 FEEPLIVVLANVLSDEECDELIELSKNKMERSKI---GSSRNVNDIRTSSGAFL-----E 125
Query: 103 DHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRL 159
++ F KI+ RI +TN+ + E L I NY + Y H D R R+
Sbjct: 126 ENEFTSKIEKRISSITNVPVAHGE----GLHILNYAVDQEYKAHYDYFAEHSRSAANNRI 181
Query: 160 ASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 219
++ + YL DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G K
Sbjct: 182 STLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEK 241
Query: 220 W 220
W
Sbjct: 242 W 242
>gi|405964866|gb|EKC30308.1| KRR1 small subunit processome component-like protein [Crassostrea
gigas]
Length = 885
Score = 96.7 bits (239), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 66/234 (28%), Positives = 106/234 (45%), Gaps = 44/234 (18%)
Query: 19 SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKV- 77
+ L+CF +T + K E + +PR+ HD I + I + ++ + R V
Sbjct: 634 AKLRCFL---RDTVIPYYKAKEEVVNYEPRIAIFHDVISSTSIEHLKSIASKGLTRSTVF 690
Query: 78 -----------VNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREE 126
+ YG + R+S+ ++ + ++P L +++ RIQ L+ G
Sbjct: 691 LENTGPNGQVTITYGKQDNI--RVSQTCWIRTD---EYPELLRLENRIQ----LITGLSA 741
Query: 127 RYK------GPLQINNYGLGGHYDLHCDATPRDEGLW--------------RLASFMFYL 166
YK Q+ NYG+GG Y H D T G+ R+A++MFY+
Sbjct: 742 EYKPVRSHSEKFQVVNYGVGGMYTAHHDYTGYKLGIISNPMDSEDISTSGDRMATWMFYM 801
Query: 167 TDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
D + GGAT+FP + + KG A FW+N + D R H GCPV +G+KW
Sbjct: 802 NDAKAGGATVFPEVRTRIPVAKGGAAFWFNLRPSGATDPRTLHGGCPVLVGSKW 855
>gi|255607134|ref|XP_002538686.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223510975|gb|EEF23697.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 318
Score = 96.3 bits (238), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 64/169 (37%), Positives = 90/169 (53%), Gaps = 20/169 (11%)
Query: 38 LKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV-NYGDTIYVD-TRLS-KVY 94
+KV + PR+ D + D+E + +I S+ +++R KVV N G +VD TR S Y
Sbjct: 117 IKVVMVCTAPRIALFDDVLSDAECDALIAASRSRLQRSKVVANRGSGEFVDDTRTSYGAY 176
Query: 95 FLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD------ 148
F G++ + IQ RI ++T + E PLQI NYGLGG Y H D
Sbjct: 177 FNK----GENSLVATIQRRIAELTRWPLTHAE----PLQILNYGLGGEYLPHFDYFEPQQ 228
Query: 149 ---ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFW 194
+P + G R+A+ + YL DVE GG TIFP LNL P KG A+++
Sbjct: 229 PGLPSPLESGGQRIATVVMYLNDVEAGGGTIFPHLNLETRPRKGGAIYF 277
>gi|260806889|ref|XP_002598316.1| hypothetical protein BRAFLDRAFT_261183 [Branchiostoma floridae]
gi|229283588|gb|EEN54328.1| hypothetical protein BRAFLDRAFT_261183 [Branchiostoma floridae]
Length = 531
Score = 96.3 bits (238), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 70/214 (32%), Positives = 113/214 (52%), Gaps = 22/214 (10%)
Query: 18 KSNLKCFYESYNNTFLKIGPLKVEELY-LDPRVVKIHDAIYDSEINRIIELSKGKVERGK 76
+S+ C Y + + +GP+K+E L+ +P + HD + +SE R+ E++ K R
Sbjct: 310 RSSASCRY-FRPSPYFYLGPIKMEVLHETNPVIHLFHDIVSESEAARMREMAIPKFHRSV 368
Query: 77 VV--NYGDTIYVDTRLSKV--YFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPL 132
VV + GD I ++ R+S+ +F Y D P + K+ R+ T L E
Sbjct: 369 VVGDDGGDAIILN-RVSETAWHFDY-----DDPVVAKLSRRVDYATGLSTA--EGTAEAF 420
Query: 133 QINNYGLGGHYDLHCD------ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFP 186
Q+ NYGLGG Y H D T + R+ +F+ YL+DV+ GGAT+FP +++ V P
Sbjct: 421 QVVNYGLGGQYIPHTDYFEGDHVTRHIQNGNRVVTFLLYLSDVDAGGATVFPIVDVAV-P 479
Query: 187 EKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+AVFW + ++ + H+GCPV +G+KW
Sbjct: 480 INSAAVFWSMERSGAVVPNSL-HAGCPVLIGSKW 512
>gi|163941996|ref|YP_001646880.1| 2OG-Fe(II) oxygenase [Bacillus weihenstephanensis KBAB4]
gi|229013455|ref|ZP_04170592.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides DSM 2048]
gi|423495146|ref|ZP_17471790.1| hypothetical protein IEW_04044 [Bacillus cereus CER057]
gi|423498060|ref|ZP_17474677.1| hypothetical protein IEY_01287 [Bacillus cereus CER074]
gi|163864193|gb|ABY45252.1| 2OG-Fe(II) oxygenase [Bacillus weihenstephanensis KBAB4]
gi|228747867|gb|EEL97733.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides DSM 2048]
gi|401151239|gb|EJQ58691.1| hypothetical protein IEW_04044 [Bacillus cereus CER057]
gi|401161347|gb|EJQ68714.1| hypothetical protein IEY_01287 [Bacillus cereus CER074]
Length = 216
Score = 96.3 bits (238), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 63/180 (35%), Positives = 90/180 (50%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + + D E + +IELSK K+ER KV + D D R S FL +
Sbjct: 36 FEEPLIVVLANVLSDEECDELIELSKSKMERSKVGSSRDV--NDIRTSSGAFLE-----E 88
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI +TN+ + E L I NY + Y H D R R++
Sbjct: 89 NELTSKIEKRISSITNVPVAHGE----GLHILNYEVDQEYKAHYDYFAEHSRSAANNRIS 144
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|326435474|gb|EGD81044.1| hypothetical protein PTSG_10986 [Salpingoeca sp. ATCC 50818]
Length = 264
Score = 96.3 bits (238), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 65/193 (33%), Positives = 96/193 (49%), Gaps = 27/193 (13%)
Query: 43 LYLDPRVVKIHDAIYDSEINRIIELSKGKVERG------KVVNYGDTIYVDTRLSKVYFL 96
L DP V++ ++ I I+ I+ +K K R +V NY R S ++
Sbjct: 65 LSEDPPVIQFNNFISQERIDAILHFAKPKFARSTSGIEREVSNY--------RTSSTAWM 116
Query: 97 YPEIFGDHPF---LYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRD 153
P++ G+ P L ++ I + L + +E + Q+ Y +Y +H D
Sbjct: 117 LPDVLGNDPMQAHLKDMEEEIARIVRLPVENQEHF----QVLQYQKNQYYKVHSDYIEEQ 172
Query: 154 E----GLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTL-LDYRMY 208
G+ R+A+F YL DVE GG T FP+LNLTV P KG+AV WY+A+ NT +D R
Sbjct: 173 RQQPCGI-RVATFFLYLNDVEEGGGTRFPNLNLTVQPAKGNAVLWYSAYPNTTRMDSRTD 231
Query: 209 HSGCPVALGNKWG 221
H PVA G K+G
Sbjct: 232 HEAMPVAKGMKYG 244
>gi|423489423|ref|ZP_17466105.1| hypothetical protein IEU_04046 [Bacillus cereus BtB2-4]
gi|402431659|gb|EJV63723.1| hypothetical protein IEU_04046 [Bacillus cereus BtB2-4]
Length = 216
Score = 95.9 bits (237), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 63/180 (35%), Positives = 90/180 (50%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + + D E + +IELSK K+ER KV + D D R S FL +
Sbjct: 36 FEEPLIVVLANVLSDEECDELIELSKSKMERSKVGSSRDV--NDIRTSSGAFLE-----E 88
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI +TN+ + E L I NY + Y H D R R++
Sbjct: 89 NELTSKIEKRISSITNVPVSHGE----GLHILNYEVDQEYKAHYDYFAEHSRSAANNRIS 144
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|423604110|ref|ZP_17580003.1| hypothetical protein IIK_00691 [Bacillus cereus VD102]
gi|401245796|gb|EJR52149.1| hypothetical protein IIK_00691 [Bacillus cereus VD102]
Length = 216
Score = 95.5 bits (236), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 63/180 (35%), Positives = 88/180 (48%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + + D E + +IELSK K+ R KV + D D R S FL D
Sbjct: 36 FEEPLIVVLGNVLSDEECDELIELSKNKLARSKVGSSRDV--NDIRTSSGAFL-----DD 88
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI + N+ + E L I NY + Y H D R R++
Sbjct: 89 NELTAKIEKRISSIMNVPVSHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 144
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ H + L+ H G PV G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFHQDQSLNELTLHGGAPVTKGEKW 204
>gi|229168980|ref|ZP_04296697.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH621]
gi|423591765|ref|ZP_17567796.1| hypothetical protein IIG_00633 [Bacillus cereus VD048]
gi|228614572|gb|EEK71680.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH621]
gi|401231898|gb|EJR38400.1| hypothetical protein IIG_00633 [Bacillus cereus VD048]
Length = 216
Score = 95.1 bits (235), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 62/180 (34%), Positives = 90/180 (50%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + + D E +IELSK ++R KV + D D R S FL +
Sbjct: 36 FEEPLIVVLANVLSDEECAELIELSKSNMKRSKVGSSRDV--NDIRTSSGAFL-----EE 88
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ +KI+ RI +TN+ + E L I NY + Y H D R R++
Sbjct: 89 NELTWKIEKRISSITNVPVAHGE----GLHILNYEVDQEYKAHYDYFAEHSRSAANNRIS 144
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + LL+ H G PV G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQLLNELTLHGGAPVTKGEKW 204
>gi|195575137|ref|XP_002105536.1| GD21536 [Drosophila simulans]
gi|194201463|gb|EDX15039.1| GD21536 [Drosophila simulans]
Length = 465
Score = 95.1 bits (235), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 69/221 (31%), Positives = 111/221 (50%), Gaps = 26/221 (11%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
E Y C+ + S P +K L C Y S + FL + PLK+EE+ L+P +V HD + D +
Sbjct: 261 EDYKRLCRSSFS-PTPLK--LHCRYNSTTSPFLILAPLKMEEISLEPYIVMYHDILPDKD 317
Query: 61 INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
I ++I L++ ++ ++ + + + S P + G L ++ R+ D+T L
Sbjct: 318 IQQLITLAEPLLKPTEMFDENKN---EAKSSD----RPALGG--LLLDRLNERMGDITGL 368
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-RLASFMFYLTDVELGGATIFPS 179
I + P+ I Y G H + EG R+ + MFYL D GGAT+FP
Sbjct: 369 QIPQ----GNPINIIKYAFGAHSET--------EGYGDRMDTVMFYLNDAPYGGATVFPH 416
Query: 180 LNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
LN+ V E+G + WYN + +T D H+ CPV G+++
Sbjct: 417 LNVKVPAERGKVLLWYNLNGDT-QDVTTVHAACPVFHGSEY 456
>gi|229192445|ref|ZP_04319408.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 10876]
gi|228591022|gb|EEK48878.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 10876]
Length = 216
Score = 94.7 bits (234), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 62/180 (34%), Positives = 89/180 (49%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + I D E + +IE+SK K+ER K+ + D D R S FL D
Sbjct: 36 FEEPLIVVLANVISDEECDELIEMSKNKMERSKIGSSRDV--NDIRTSSGAFL-----ED 88
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI + N+ + E L I NY + Y H D R R++
Sbjct: 89 NELTSKIEKRISSIMNVPVAHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 144
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 145 TLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|229111709|ref|ZP_04241257.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-15]
gi|296504733|ref|YP_003666433.1| prolyl 4-hydroxylase subunit alpha [Bacillus thuringiensis BMB171]
gi|423585282|ref|ZP_17561369.1| hypothetical protein IIE_00694 [Bacillus cereus VD045]
gi|423640681|ref|ZP_17616299.1| hypothetical protein IK9_00626 [Bacillus cereus VD166]
gi|228671703|gb|EEL26999.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-15]
gi|296325785|gb|ADH08713.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis BMB171]
gi|401233925|gb|EJR40411.1| hypothetical protein IIE_00694 [Bacillus cereus VD045]
gi|401279742|gb|EJR85664.1| hypothetical protein IK9_00626 [Bacillus cereus VD166]
Length = 248
Score = 94.7 bits (234), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 62/180 (34%), Positives = 89/180 (49%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + + D E + +IE+SK K+ER K+ + D D R S FL D
Sbjct: 68 FEEPLIVVLANVLSDEECDELIEMSKNKMERSKIGSSRDV--NDIRTSSGAFL-----ED 120
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ F KI+ RI + N+ E L I NY + Y H D R R++
Sbjct: 121 NEFTSKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 176
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 177 TLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 236
>gi|423598444|ref|ZP_17574444.1| hypothetical protein III_01246 [Bacillus cereus VD078]
gi|423660914|ref|ZP_17636083.1| hypothetical protein IKM_01311 [Bacillus cereus VDM022]
gi|401236714|gb|EJR43171.1| hypothetical protein III_01246 [Bacillus cereus VD078]
gi|401300955|gb|EJS06544.1| hypothetical protein IKM_01311 [Bacillus cereus VDM022]
Length = 216
Score = 94.7 bits (234), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 62/180 (34%), Positives = 90/180 (50%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + + D E + +IELSK K++R KV + D D R S FL +
Sbjct: 36 FEEPLIVVLANVLSDEECDELIELSKSKMKRSKVGSSRDV--NDIRTSSGAFLE-----E 88
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI +TN+ + E L I NY + Y H D R R++
Sbjct: 89 NELTSKIEKRISSITNVPVAHGE----GLHILNYEVDQEYKAHYDYFAEHSRSAANNRIS 144
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|187930127|ref|YP_001900614.1| procollagen-proline dioxygenase [Ralstonia pickettii 12J]
gi|187727017|gb|ACD28182.1| Procollagen-proline dioxygenase [Ralstonia pickettii 12J]
Length = 288
Score = 94.4 bits (233), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 60/185 (32%), Positives = 89/185 (48%), Gaps = 18/185 (9%)
Query: 47 PRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDH 104
PR+V + D+E + +I + + +++R VVN G+ + R S+ G+H
Sbjct: 96 PRIVLFQHFLSDAECDELIAIGRNRLKRSPVVNPDTGEENLISARTSQGGMFQ---VGEH 152
Query: 105 PFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---------ATPRDEG 155
P + KI+ RI + + E + Q+ NY GG Y H D A + G
Sbjct: 153 PLIAKIEVRIAQAVGVPVEHGEGF----QVLNYQPGGEYQPHFDFFNPGRSGEARQLEVG 208
Query: 156 LWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 215
R+A+ + YL V+ GGAT FP L L V P KG+AVF+ + LD H+G PV
Sbjct: 209 GQRVATMVIYLNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDEDTLHAGLPVE 268
Query: 216 LGNKW 220
G KW
Sbjct: 269 RGEKW 273
>gi|229019457|ref|ZP_04176278.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1273]
gi|229025700|ref|ZP_04182104.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1272]
gi|423417837|ref|ZP_17394926.1| hypothetical protein IE3_01309 [Bacillus cereus BAG3X2-1]
gi|228735575|gb|EEL86166.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1272]
gi|228741812|gb|EEL91991.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1273]
gi|401107008|gb|EJQ14965.1| hypothetical protein IE3_01309 [Bacillus cereus BAG3X2-1]
Length = 216
Score = 94.4 bits (233), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 62/180 (34%), Positives = 90/180 (50%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + + D E + +IELSK K++R KV + D D R S FL +
Sbjct: 36 FEEPLIVVLANVLSDEECDELIELSKNKMKRSKVGSSRDV--NDIRTSSGAFLE-----E 88
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI +TN+ + E L I NY + Y H D R R++
Sbjct: 89 NELTSKIEKRISSITNVPVAHGE----GLHILNYEVDQEYKAHYDYFAEHSRSAANNRIS 144
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|299065638|emb|CBJ36810.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
CMR15]
Length = 289
Score = 94.4 bits (233), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 60/185 (32%), Positives = 90/185 (48%), Gaps = 18/185 (9%)
Query: 47 PRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDH 104
PR+V + D E +++I L + +++R VVN G+ + R S+ G+H
Sbjct: 97 PRIVLFQHFLSDEECDQLITLGRHRLKRSPVVNPETGEENLISARTSQGAMFQ---VGEH 153
Query: 105 PFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---------ATPRDEG 155
P + +I+ RI T + + E + Q+ +Y GG Y H D A + G
Sbjct: 154 PLIARIEARIAQATGVPVEHGEGF----QVLHYQPGGEYQPHFDYFNPGRSGEARQLEVG 209
Query: 156 LWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 215
R+A+ + YL V GGAT FP L L V P KG+AVF+ + LD + H+G PV
Sbjct: 210 GQRVATLVIYLNSVPAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDDKTLHAGLPVE 269
Query: 216 LGNKW 220
G KW
Sbjct: 270 RGEKW 274
>gi|241664232|ref|YP_002982592.1| procollagen-proline dioxygenase [Ralstonia pickettii 12D]
gi|309783051|ref|ZP_07677770.1| procollagen-proline dioxygenase [Ralstonia sp. 5_7_47FAA]
gi|404397139|ref|ZP_10988932.1| hypothetical protein HMPREF0989_00773 [Ralstonia sp. 5_2_56FAA]
gi|240866259|gb|ACS63920.1| Procollagen-proline dioxygenase [Ralstonia pickettii 12D]
gi|308918159|gb|EFP63837.1| procollagen-proline dioxygenase [Ralstonia sp. 5_7_47FAA]
gi|348610674|gb|EGY60360.1| hypothetical protein HMPREF0989_00773 [Ralstonia sp. 5_2_56FAA]
Length = 288
Score = 94.4 bits (233), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 60/185 (32%), Positives = 88/185 (47%), Gaps = 18/185 (9%)
Query: 47 PRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDH 104
PR+V + D E + +I + + +++R VVN G+ + R S+ G+H
Sbjct: 96 PRIVLFQHFLSDQECDELIAIGRNRLKRSPVVNPDTGEENLISARTSQGGMFQ---VGEH 152
Query: 105 PFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---------ATPRDEG 155
P + KI+ RI + + E + Q+ NY GG Y H D A + G
Sbjct: 153 PLIAKIEARIAQAVGVPVEHGEGF----QVLNYQPGGEYQPHFDFFNPGRSGEARQLEVG 208
Query: 156 LWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 215
R+A+ + YL V+ GGAT FP L L V P KG+AVF+ + LD H+G PV
Sbjct: 209 GQRVATMVIYLNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDEDTLHAGLPVE 268
Query: 216 LGNKW 220
G KW
Sbjct: 269 RGEKW 273
>gi|443730626|gb|ELU16050.1| hypothetical protein CAPTEDRAFT_114796, partial [Capitella teleta]
Length = 150
Score = 94.4 bits (233), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 50/121 (41%), Positives = 66/121 (54%), Gaps = 12/121 (9%)
Query: 109 KIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATP---------RDEGLWRL 159
K+ R+ T L E+Y Q++ YG+GGHY+ H D + ++ R+
Sbjct: 13 KLSRRVSSATKL---DAEKYAELFQVSTYGIGGHYEPHFDFSKVKYFTNPVLNEQMGDRI 69
Query: 160 ASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 219
A+FM YL DVE GG T+FP LNL + P K SAVFW+N + D R H CPV LG K
Sbjct: 70 ATFMIYLNDVEAGGRTVFPRLNLVIEPIKNSAVFWHNLLDDGQQDDRTIHGACPVVLGRK 129
Query: 220 W 220
W
Sbjct: 130 W 130
>gi|365158975|ref|ZP_09355162.1| hypothetical protein HMPREF1014_00625 [Bacillus sp. 7_6_55CFAA_CT2]
gi|363625964|gb|EHL76973.1| hypothetical protein HMPREF1014_00625 [Bacillus sp. 7_6_55CFAA_CT2]
Length = 248
Score = 94.0 bits (232), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 61/180 (33%), Positives = 89/180 (49%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + + D E + +IE+SK K+ER K+ + D D R S FL D
Sbjct: 68 FEEPLIVVLANVLSDEECDELIEMSKNKMERSKIGSSRDV--NDIRTSSGAFL-----ED 120
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI + N+ + E L I NY + Y H D R R++
Sbjct: 121 NELTSKIEKRISSIMNVPVAHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 176
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 177 TLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 236
>gi|195494561|ref|XP_002094890.1| GE19962 [Drosophila yakuba]
gi|194180991|gb|EDW94602.1| GE19962 [Drosophila yakuba]
Length = 539
Score = 94.0 bits (232), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 69/229 (30%), Positives = 104/229 (45%), Gaps = 25/229 (10%)
Query: 7 CQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIE 66
C+G P L C Y + FLK+ PLK+E L + P ++ HD +Y++E + +
Sbjct: 302 CRGEW--PPKSSPELICRYNRDTSAFLKLAPLKLEILSVQPVILLYHDVLYENEFKSMRD 359
Query: 67 LSKGKVERGKVVNY------GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
+ Y G+ + D + + F PF I R+ M+
Sbjct: 360 AAIFNASMIDGWTYYDFDQKGNPKWQDRVVKTIGFQ----GTTAPFTLSINRRLGYMS-- 413
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATP---------RDEGLWRLASFMFYLTDVEL 171
G E R L + NYGLGG++ H D D G +A+ + Y +DV L
Sbjct: 414 --GLEMRENMMLYLTNYGLGGNFRKHFDYVELAKRPPNFFADSGGDHIATAVLYASDVPL 471
Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GG T+F L L V P+KG+A+ W+N + + D HS CPV LG++W
Sbjct: 472 GGTTVFSKLKLAVQPKKGNALVWFNLNHDGKPDPLTEHSVCPVVLGSRW 520
>gi|207744371|ref|YP_002260763.1| prolyl 4-hydroxylase subunit alpha [Ralstonia solanacearum IPO1609]
gi|206595776|emb|CAQ62703.1| prolyl 4-hydroxylase alpha subunit homologue protein [Ralstonia
solanacearum IPO1609]
Length = 280
Score = 94.0 bits (232), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 64/217 (29%), Positives = 102/217 (47%), Gaps = 20/217 (9%)
Query: 17 IKSNLKCFYESYNNTFLKIGPLKVEELYL--DPRVVKIHDAIYDSEINRIIELSKGKVER 74
+ S + E+ N+ ++ ++ L+ PR+V + D E + +I L + +++R
Sbjct: 56 VPSPAQAEPEAENSNAVRTSDREIPILFAIETPRIVLFQHFLSDEECDELIALGRYRLKR 115
Query: 75 GKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPL 132
VVN G+ + R S+ G+HP + +I+ RI T + + E +
Sbjct: 116 SPVVNPETGEENLISARTSEGAMFQ---VGEHPLVARIEARIAQATGVPVEHGEGF---- 168
Query: 133 QINNYGLGGHYDLHCD---------ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLT 183
Q+ +Y GG Y H D A + G R+A+ + YL V+ GGAT FP L L
Sbjct: 169 QVLHYHPGGEYQPHFDYFNPGRSGEARQLEVGGQRVATLVIYLNSVQAGGATGFPKLGLE 228
Query: 184 VFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
V P KG+AVF+ + LD H+G PV G KW
Sbjct: 229 VAPVKGNAVFFVYKRPDGTLDDNTLHAGLPVERGEKW 265
>gi|344169181|emb|CCA81504.1| putative Prolyl 4-hydroxylase alpha subunit [blood disease
bacterium R229]
Length = 289
Score = 94.0 bits (232), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 63/208 (30%), Positives = 99/208 (47%), Gaps = 20/208 (9%)
Query: 26 ESYNNTFLKIGPLKVEELYL--DPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YG 81
E+ N+ ++ ++ L+ PR+V + D E + +I L + +++R VVN G
Sbjct: 74 EAENSNAVRTSDREIPILFAIETPRIVLFQHFLSDEECDELIALGRHRLKRSPVVNPETG 133
Query: 82 DTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG 141
+ + R S+ G+HP + +I+ RI T + + E + Q+ +Y GG
Sbjct: 134 EENLISARTSQGAMFQ---VGEHPLIARIEARIAQATGVPVEHGEGF----QVLHYQPGG 186
Query: 142 HYDLHCD---------ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAV 192
Y H D A + G R+A+ + YL V+ GGAT FP L L V P KG+AV
Sbjct: 187 EYQPHFDYFNPGRSGEARQLEVGGQRVATLVIYLNSVQAGGATGFPKLGLEVAPVKGNAV 246
Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
F+ + LD H+G PV G KW
Sbjct: 247 FFVYKRPDGTLDDNTLHAGLPVERGEKW 274
>gi|300690371|ref|YP_003751366.1| prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum PSI07]
gi|299077431|emb|CBJ50057.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
PSI07]
Length = 289
Score = 94.0 bits (232), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 63/208 (30%), Positives = 99/208 (47%), Gaps = 20/208 (9%)
Query: 26 ESYNNTFLKIGPLKVEELYL--DPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YG 81
E+ N+ ++ ++ L+ PR+V + D E + +I L + +++R VVN G
Sbjct: 74 EAENSNAVRTSDREIPILFAIETPRIVLFQHFLSDEECDELIALGRHRLKRSPVVNPETG 133
Query: 82 DTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG 141
+ + R S+ G+HP + +I+ RI T + + E + Q+ +Y GG
Sbjct: 134 EENLISARTSQGAMFQ---VGEHPLIARIEARIAQATGVPVEHGEGF----QVLHYQPGG 186
Query: 142 HYDLHCD---------ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAV 192
Y H D A + G R+A+ + YL V+ GGAT FP L L V P KG+AV
Sbjct: 187 EYQPHFDYFNPGRSGEARQLEVGGQRVATLVIYLNSVQAGGATGFPKLGLEVAPVKGNAV 246
Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
F+ + LD H+G PV G KW
Sbjct: 247 FFVYKRPDGTLDDNTLHAGLPVERGEKW 274
>gi|194290782|ref|YP_002006689.1| prolyl 4-hydroxylase subunit alpha [Cupriavidus taiwanensis LMG
19424]
gi|193224617|emb|CAQ70628.1| putative Prolyl 4-hydroxylase alpha subunit [Cupriavidus
taiwanensis LMG 19424]
Length = 296
Score = 94.0 bits (232), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 63/185 (34%), Positives = 87/185 (47%), Gaps = 18/185 (9%)
Query: 47 PRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDH 104
P+V + D E + ++ LS+G++ R VVN GD +D R S +H
Sbjct: 104 PQVQLFQQLLSDDECDALVALSRGRLARSPVVNPDTGDENLIDARTSMGAMFQ---VAEH 160
Query: 105 PFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---------ATPRDEG 155
+ +I+ RI +T + E LQI NY GG Y H D A G
Sbjct: 161 ALIARIEARIAAVTGVPADHGEG----LQILNYKPGGEYQPHFDYFNPQRPGEARQLSVG 216
Query: 156 LWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 215
R+A+ + YL E GGAT FP + L V P KG+AV++ + LD R H+G PVA
Sbjct: 217 GQRIATLVIYLNTPEAGGATAFPRVGLEVAPVKGNAVYFSYLLPDGTLDDRTLHAGLPVA 276
Query: 216 LGNKW 220
G KW
Sbjct: 277 AGEKW 281
>gi|339327280|ref|YP_004686973.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator N-1]
gi|338167437|gb|AEI78492.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator N-1]
Length = 297
Score = 94.0 bits (232), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 64/185 (34%), Positives = 89/185 (48%), Gaps = 18/185 (9%)
Query: 47 PRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDH 104
P+V + D E + ++ LS+G++ R VVN GD +D R S +H
Sbjct: 105 PQVQLFQQLLTDDECDALVALSRGRLARSPVVNPDTGDENLIDARTSMGAMFQ---VAEH 161
Query: 105 PFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---------ATPRDEG 155
+ +I+ RI +T + E +G LQI NY GG Y H D A G
Sbjct: 162 ALIARIEARIAAVTGVPA---EHGEG-LQILNYKPGGEYQPHFDYFNPQRPGEARQLSVG 217
Query: 156 LWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 215
R+A+ + YL E GGAT FP + L V P KG+AV++ + LD R H+G PVA
Sbjct: 218 GQRIATLVIYLNTPEAGGATAFPRVGLEVAPVKGNAVYFSYLLPDGTLDERTLHAGLPVA 277
Query: 216 LGNKW 220
G KW
Sbjct: 278 SGEKW 282
>gi|344172475|emb|CCA85118.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia syzygii R24]
Length = 289
Score = 94.0 bits (232), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 63/208 (30%), Positives = 99/208 (47%), Gaps = 20/208 (9%)
Query: 26 ESYNNTFLKIGPLKVEELYL--DPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YG 81
E+ N+ ++ ++ L+ PR+V + D E + +I L + +++R VVN G
Sbjct: 74 EAENSNAVRTSDREIPILFAIETPRIVLFQHFLSDEECDELIALGRHRLKRSPVVNPETG 133
Query: 82 DTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG 141
+ + R S+ G+HP + +I+ RI T + + E + Q+ +Y GG
Sbjct: 134 EENLISARTSQGAMFQ---VGEHPLIARIEARIAQATGVPVEHGEGF----QVLHYQPGG 186
Query: 142 HYDLHCD---------ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAV 192
Y H D A + G R+A+ + YL V+ GGAT FP L L V P KG+AV
Sbjct: 187 EYQPHFDYFNPGRSGEARQLEVGGQRVATLVIYLNSVQAGGATGFPKLGLEVAPVKGNAV 246
Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
F+ + LD H+G PV G KW
Sbjct: 247 FFVYKRPDGTLDDNTLHAGLPVERGEKW 274
>gi|83746819|ref|ZP_00943867.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum UW551]
gi|83726588|gb|EAP73718.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum UW551]
Length = 289
Score = 94.0 bits (232), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 64/217 (29%), Positives = 102/217 (47%), Gaps = 20/217 (9%)
Query: 17 IKSNLKCFYESYNNTFLKIGPLKVEELYL--DPRVVKIHDAIYDSEINRIIELSKGKVER 74
+ S + E+ N+ ++ ++ L+ PR+V + D E + +I L + +++R
Sbjct: 65 VPSPAQAEPEAENSNAVRTSDREIPILFAIETPRIVLFQHFLSDEECDELIALGRYRLKR 124
Query: 75 GKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPL 132
VVN G+ + R S+ G+HP + +I+ RI T + + E +
Sbjct: 125 SPVVNPETGEENLISARTSEGAMFQ---VGEHPLVARIEARIAQATGVPVEHGEGF---- 177
Query: 133 QINNYGLGGHYDLHCD---------ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLT 183
Q+ +Y GG Y H D A + G R+A+ + YL V+ GGAT FP L L
Sbjct: 178 QVLHYHPGGEYQPHFDYFNPGRSGEARQLEVGGQRVATLVIYLNSVQAGGATGFPKLGLE 237
Query: 184 VFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
V P KG+AVF+ + LD H+G PV G KW
Sbjct: 238 VAPVKGNAVFFVYKRPDGTLDDNTLHAGLPVERGEKW 274
>gi|421895470|ref|ZP_16325871.1| prolyl 4-hydroxylase alpha subunit homologue protein [Ralstonia
solanacearum MolK2]
gi|206586635|emb|CAQ17221.1| prolyl 4-hydroxylase alpha subunit homologue protein [Ralstonia
solanacearum MolK2]
Length = 283
Score = 94.0 bits (232), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 60/185 (32%), Positives = 90/185 (48%), Gaps = 18/185 (9%)
Query: 47 PRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDH 104
PR+V + D E + +I L + +++R VVN G+ + R S+ G+H
Sbjct: 91 PRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPETGEENLISARTSEGAMFQ---VGEH 147
Query: 105 PFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---------ATPRDEG 155
P + +I+ RI T + + E + Q+ +Y GG Y H D A + G
Sbjct: 148 PLVARIEARIAQATGVPVEHGEGF----QVLHYHPGGEYQPHFDYFNPGRGGEARQLEVG 203
Query: 156 LWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 215
R+A+ + YL V+ GGAT FP L L V P KG+AVF+ + +LD H+G PV
Sbjct: 204 GQRVATLVIYLNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGMLDDNTLHAGLPVE 263
Query: 216 LGNKW 220
G KW
Sbjct: 264 RGEKW 268
>gi|423389445|ref|ZP_17366671.1| hypothetical protein ICG_01293 [Bacillus cereus BAG1X1-3]
gi|401641536|gb|EJS59253.1| hypothetical protein ICG_01293 [Bacillus cereus BAG1X1-3]
Length = 216
Score = 94.0 bits (232), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 62/180 (34%), Positives = 89/180 (49%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + + D E +IELSK K++R KV + D D R S FL +
Sbjct: 36 FEEPLIVVLANVLSDEECEELIELSKNKMKRSKVGSSRDV--NDIRTSSGAFLE-----E 88
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI +TN+ + E L I NY + Y H D R R++
Sbjct: 89 NELTSKIEKRISSITNVPVAHGE----GLHILNYEVDQEYKAHYDYFAEHSRSAANNRIS 144
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|229180513|ref|ZP_04307855.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus 172560W]
gi|228602937|gb|EEK60416.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus 172560W]
Length = 232
Score = 94.0 bits (232), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 61/180 (33%), Positives = 89/180 (49%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + + D E + +IE+SK K+ER K+ + D D R S FL D
Sbjct: 52 FEEPLIVVLANVLSDEECDELIEMSKNKMERSKIGSSRDV--NDIRTSSGAFL-----ED 104
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI + N+ + E L I NY + Y H D R R++
Sbjct: 105 NELTSKIEKRISSIMNVPVAHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 160
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 161 TLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 220
>gi|198466393|ref|XP_001353986.2| GA18007 [Drosophila pseudoobscura pseudoobscura]
gi|198150579|gb|EAL29722.2| GA18007 [Drosophila pseudoobscura pseudoobscura]
Length = 455
Score = 94.0 bits (232), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 62/215 (28%), Positives = 109/215 (50%), Gaps = 32/215 (14%)
Query: 11 LSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKG 70
+S P + +++ C Y + FL++ P++ E L + V HD EI + L++
Sbjct: 256 MSYPRKV-NDVHCRYLR-STPFLQLAPIRQENLDNEAHVYLYHDLFNHEEIEALKSLARP 313
Query: 71 KVERGKVV-NYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYK 129
K++R K+ N+ I +LS + + RIQD++ + + +E
Sbjct: 314 KLKRQKISSNFTCKI---AQLSN---------SAQDIIRTVNRRIQDVSGMDMNEKEM-- 359
Query: 130 GPLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKG 189
LQ+ NYG+ G YDL D+ A+ + ++++V+ GG T+FP L+L V P+KG
Sbjct: 360 --LQVVNYGIAGRYDL-------DDSAGSAATALIFMSNVQQGGETVFPFLSLRVKPQKG 410
Query: 190 SAVFWYNAHANTLLDYRMYHSGCPVALGNKWGKLL 224
S + W N D+ + H+ CP+ +GN WG+L+
Sbjct: 411 SLLLWRNT------DWSVLHNSCPLIIGNMWGELI 439
>gi|423518940|ref|ZP_17495421.1| hypothetical protein IG7_04010 [Bacillus cereus HuA2-4]
gi|401159995|gb|EJQ67374.1| hypothetical protein IG7_04010 [Bacillus cereus HuA2-4]
Length = 216
Score = 94.0 bits (232), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 62/180 (34%), Positives = 89/180 (49%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + + D E +IELSK ++R KV + D D R S FL +
Sbjct: 36 FEEPLIVVLANVLSDEECAELIELSKNNMKRSKVGSSRDV--NDIRTSSGAFLE-----E 88
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI +TN+ + E L I NY + Y H D R R++
Sbjct: 89 NELTSKIEKRISSITNVPVAHGE----GLHILNYEVDQEYKAHYDYFAEHSRSAANNRIS 144
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + LL+ H G PV G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPQLNLSVHPRKGMAVYFEYFYQDQLLNELTLHGGAPVTKGEKW 204
>gi|423426372|ref|ZP_17403403.1| hypothetical protein IE5_04061 [Bacillus cereus BAG3X2-2]
gi|401111119|gb|EJQ19018.1| hypothetical protein IE5_04061 [Bacillus cereus BAG3X2-2]
Length = 248
Score = 94.0 bits (232), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 61/180 (33%), Positives = 89/180 (49%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + + D E + +IE+SK K+ER K+ + D D R S FL D
Sbjct: 68 FEEPLIVVLANVLSDEECDELIEISKNKMERSKIGSSRDV--NDIRTSSGAFL-----ED 120
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI + N+ + E L I NY + Y H D R R++
Sbjct: 121 NELTSKIEKRISSIMNVPVAHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 176
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 177 TLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 236
>gi|421890664|ref|ZP_16321519.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
K60-1]
gi|378964031|emb|CCF98267.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
K60-1]
Length = 288
Score = 94.0 bits (232), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 64/208 (30%), Positives = 99/208 (47%), Gaps = 20/208 (9%)
Query: 26 ESYNNTFLKIGPLKVEELYL--DPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YG 81
E+ N+ ++ ++ L+ PR+V + D E + +I L + +++R VVN G
Sbjct: 73 EAENSNAVRTSDREIPILFAIETPRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPETG 132
Query: 82 DTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG 141
+ + R S+ G+HP + +I+ RI T + + E + Q+ +Y GG
Sbjct: 133 EENLISARTSEGAMFQ---VGEHPLVARIEARIAQATGVPVEHGEGF----QVLHYHPGG 185
Query: 142 HYDLHCD---------ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAV 192
Y H D A D G R+A+ + YL V+ GGAT FP L L V P KG+AV
Sbjct: 186 EYQPHFDYFNPGRSGEARQLDVGGQRVATLVIYLNSVQAGGATGFPKLGLEVAPVKGNAV 245
Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
F+ + LD H+G PV G KW
Sbjct: 246 FFVYKRPDGTLDDNTLHAGLPVERGEKW 273
>gi|423512354|ref|ZP_17488885.1| hypothetical protein IG3_03851 [Bacillus cereus HuA2-1]
gi|402449325|gb|EJV81162.1| hypothetical protein IG3_03851 [Bacillus cereus HuA2-1]
Length = 216
Score = 94.0 bits (232), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 62/180 (34%), Positives = 89/180 (49%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + + D E +IELSK ++R KV + D D R S FL +
Sbjct: 36 FEEPLIVVLANVLSDEECAELIELSKSNMKRSKVGSSRDV--NDIRTSSGAFLE-----E 88
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI +TN+ + E L I NY + Y H D R R++
Sbjct: 89 NELTSKIEKRISSITNVPVAHGE----GLHILNYEVDQEYKAHYDYFAEHSRSAANNRIS 144
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + LL+ H G PV G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQLLNELTLHGGAPVTKGEKW 204
>gi|423400914|ref|ZP_17378087.1| hypothetical protein ICW_01312 [Bacillus cereus BAG2X1-2]
gi|401653904|gb|EJS71447.1| hypothetical protein ICW_01312 [Bacillus cereus BAG2X1-2]
Length = 216
Score = 94.0 bits (232), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 62/180 (34%), Positives = 89/180 (49%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + + D E + +IELSK K++R KV + D D R S FL D
Sbjct: 36 FEEPLIVVLGNVLSDEECDELIELSKSKMKRSKVGSSRDV--NDIRTSSGAFL-----DD 88
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI + N+ + E L I NY + Y H D R R++
Sbjct: 89 NELTAKIEKRISSIMNVPVSHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 144
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|206971296|ref|ZP_03232247.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
AH1134]
gi|229081494|ref|ZP_04213993.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock4-2]
gi|423411965|ref|ZP_17389085.1| hypothetical protein IE1_01269 [Bacillus cereus BAG3O-2]
gi|423432249|ref|ZP_17409253.1| hypothetical protein IE7_04065 [Bacillus cereus BAG4O-1]
gi|206734068|gb|EDZ51239.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
AH1134]
gi|228701801|gb|EEL54288.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock4-2]
gi|401104033|gb|EJQ12010.1| hypothetical protein IE1_01269 [Bacillus cereus BAG3O-2]
gi|401117005|gb|EJQ24843.1| hypothetical protein IE7_04065 [Bacillus cereus BAG4O-1]
Length = 216
Score = 93.6 bits (231), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 61/180 (33%), Positives = 89/180 (49%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + + D E + +IE+SK K+ER K+ + D D R S FL D
Sbjct: 36 FEEPLIVVLANVLSDEECDELIEMSKNKMERSKIGSSRDV--NDIRTSSGAFL-----ED 88
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI + N+ + E L I NY + Y H D R R++
Sbjct: 89 NELTSKIEKRISSIMNVPVAHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 144
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 145 TLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|229135058|ref|ZP_04263863.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST196]
gi|228648443|gb|EEL04473.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST196]
Length = 216
Score = 93.6 bits (231), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 62/180 (34%), Positives = 89/180 (49%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + + D E +IELSK ++R KV + D D R S FL +
Sbjct: 36 FEEPLIVVLANVLSDEECAELIELSKSNMKRSKVGSSRDV--NDIRTSSGAFLE-----E 88
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI +TN+ + E L I NY + Y H D R R++
Sbjct: 89 NELTSKIEKRISSITNVPVAHGE----GLHILNYEVDQEYKAHYDYFAEHSRSAANNRIS 144
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + LL+ H G PV G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQLLNELTLHGGAPVTKGEKW 204
>gi|229071739|ref|ZP_04204954.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus F65185]
gi|228711334|gb|EEL63294.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus F65185]
Length = 232
Score = 93.6 bits (231), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 61/180 (33%), Positives = 89/180 (49%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + + D E + +IE+SK K+ER K+ + D D R S FL D
Sbjct: 52 FEEPLIVVLANVLSDEECDELIEMSKNKMERSKIGSSRDV--NDIRTSSGAFL-----ED 104
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI + N+ + E L I NY + Y H D R R++
Sbjct: 105 NELTSKIEKRISSIMNVPVAHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 160
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 161 TLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 220
>gi|423478381|ref|ZP_17455096.1| hypothetical protein IEO_03839 [Bacillus cereus BAG6X1-1]
gi|402428543|gb|EJV60640.1| hypothetical protein IEO_03839 [Bacillus cereus BAG6X1-1]
Length = 216
Score = 93.6 bits (231), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 62/180 (34%), Positives = 89/180 (49%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + + D E + +IELSK K++R KV + D D R S FL D
Sbjct: 36 FEEPLIVVLGNVLSDEECDELIELSKSKMKRSKVGSSRDV--NDIRTSSGAFL-----DD 88
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI + N+ + E L I NY + Y H D R R++
Sbjct: 89 NELTAKIEKRISSIMNVPVSHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 144
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|217961727|ref|YP_002340297.1| prolyl 4-hydroxylase subunit alpha domain-containing protein
[Bacillus cereus AH187]
gi|222097680|ref|YP_002531737.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus cereus
Q1]
gi|229198365|ref|ZP_04325071.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus m1293]
gi|375286242|ref|YP_005106681.1| prolyl 4-hydroxylase subunit alpha domain-containing protein
[Bacillus cereus NC7401]
gi|423354732|ref|ZP_17332357.1| hypothetical protein IAU_02806 [Bacillus cereus IS075]
gi|423566803|ref|ZP_17543050.1| hypothetical protein II7_00026 [Bacillus cereus MSX-A12]
gi|423574080|ref|ZP_17550199.1| hypothetical protein II9_01301 [Bacillus cereus MSX-D12]
gi|217067199|gb|ACJ81449.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
AH187]
gi|221241738|gb|ACM14448.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
Q1]
gi|228585065|gb|EEK43177.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus m1293]
gi|358354769|dbj|BAL19941.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
NC7401]
gi|401086280|gb|EJP94507.1| hypothetical protein IAU_02806 [Bacillus cereus IS075]
gi|401212649|gb|EJR19392.1| hypothetical protein II9_01301 [Bacillus cereus MSX-D12]
gi|401215318|gb|EJR22035.1| hypothetical protein II7_00026 [Bacillus cereus MSX-A12]
Length = 216
Score = 93.6 bits (231), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 62/180 (34%), Positives = 89/180 (49%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + + D E +++IELSK K+ R KV + D D R S FL D
Sbjct: 36 FEEPLIVVLGNVLSDEECDKLIELSKNKLARSKVGSSRDV--NDIRTSSGAFL-----DD 88
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI + N+ + E L I NY + Y H D R R++
Sbjct: 89 NELTAKIEKRISSIMNVPVSHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 144
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|374370415|ref|ZP_09628419.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus basilensis OR16]
gi|373098067|gb|EHP39184.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus basilensis OR16]
Length = 454
Score = 93.6 bits (231), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 62/185 (33%), Positives = 90/185 (48%), Gaps = 18/185 (9%)
Query: 47 PRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDH 104
PRV + D+E + ++ L++G++ R V+N GD ++ R S G+H
Sbjct: 132 PRVTLFQQLLTDAECDALVALARGRLARSPVINPDTGDENLIEARTSLGAMFQ---VGEH 188
Query: 105 PFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---------ATPRDEG 155
P + +I+ I +T + ER +G LQI NY GG Y H D A G
Sbjct: 189 PLIERIEDCIAAVTGIAA---ERGEG-LQILNYKPGGEYQPHYDFFNPQRPGEARQLKVG 244
Query: 156 LWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 215
R+ + + YL GGAT FP L L V P KG+AV++ ++ LD R H+G PV
Sbjct: 245 GQRVGTLVIYLNSPLAGGATAFPKLGLEVAPVKGNAVYFSYRKSDGALDERTLHAGLPVE 304
Query: 216 LGNKW 220
G KW
Sbjct: 305 AGEKW 309
>gi|229140971|ref|ZP_04269515.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST26]
gi|228642547|gb|EEK98834.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST26]
Length = 232
Score = 93.6 bits (231), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 62/180 (34%), Positives = 89/180 (49%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + + D E +++IELSK K+ R KV + D D R S FL D
Sbjct: 52 FEEPLIVVLGNVLSDEECDKLIELSKNKLARSKVGSSRDV--NDIRTSSGAFL-----DD 104
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI + N+ + E L I NY + Y H D R R++
Sbjct: 105 NELTAKIEKRISSIMNVPVSHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 160
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 161 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 220
>gi|423368291|ref|ZP_17345723.1| hypothetical protein IC3_03392 [Bacillus cereus VD142]
gi|401081042|gb|EJP89322.1| hypothetical protein IC3_03392 [Bacillus cereus VD142]
Length = 216
Score = 93.6 bits (231), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 62/180 (34%), Positives = 89/180 (49%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + + D E +IELSK ++R KV + D D R S FL +
Sbjct: 36 FEEPLIVVLANVLSDEECAELIELSKNNMKRSKVGSSRDV--NDIRTSSGAFLE-----E 88
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI +TN+ + E L I NY + Y H D R R++
Sbjct: 89 NELTSKIEKRISSITNVPVAHGE----GLHILNYEVDQEYKAHYDYFAEHSRSAANNRIS 144
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + LL+ H G PV G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQLLNELTLHGGAPVTKGEKW 204
>gi|17547533|ref|NP_520935.1| hypothetical protein RSc2814 [Ralstonia solanacearum GMI1000]
gi|17429837|emb|CAD16521.1| putative prolyl 4-hydroxylase alpha subunit homologue
oxidoreductase protein [Ralstonia solanacearum GMI1000]
Length = 289
Score = 93.6 bits (231), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 60/185 (32%), Positives = 89/185 (48%), Gaps = 18/185 (9%)
Query: 47 PRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDH 104
PR+V + D E +++I L + +++R VVN G+ + R S+ G+H
Sbjct: 97 PRIVLFQHFLSDEECDQLIALGRHRLKRSPVVNPETGEENLISARTSQGAMFQ---VGEH 153
Query: 105 PFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---------ATPRDEG 155
P + +I+ RI T + + E + Q+ +Y GG Y H D A + G
Sbjct: 154 PLVARIEARIAQATGVPVEHGEGF----QVLHYQPGGEYQPHFDYFNPGRSGEARQLEVG 209
Query: 156 LWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 215
R+A+ + YL V GGAT FP L L V P KG+AVF+ + LD H+G PV
Sbjct: 210 GQRVATLVIYLNSVPAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLPVE 269
Query: 216 LGNKW 220
G KW
Sbjct: 270 RGEKW 274
>gi|423395462|ref|ZP_17372663.1| hypothetical protein ICU_01156 [Bacillus cereus BAG2X1-1]
gi|401654873|gb|EJS72412.1| hypothetical protein ICU_01156 [Bacillus cereus BAG2X1-1]
Length = 216
Score = 93.6 bits (231), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 63/180 (35%), Positives = 89/180 (49%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + + D E +++IELSK K+ R KV + D D R SK FL D
Sbjct: 36 FEEPLIVVLGNVLSDEECDKLIELSKNKLARSKVGSSRDV--NDIRTSKGAFL-----DD 88
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI + N+ E L I NY + Y H D R R++
Sbjct: 89 NELTAKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 144
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 145 TLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|229174912|ref|ZP_04302432.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus MM3]
gi|228608580|gb|EEK65882.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus MM3]
Length = 216
Score = 93.6 bits (231), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 63/180 (35%), Positives = 88/180 (48%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + + D E + +IELSK K+ R KV + D D R SK FL D
Sbjct: 36 FEEPLIVVLGNVLSDEECDELIELSKSKLARSKVGSSRDV--NDIRTSKGAFL-----DD 88
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI + N+ E L I NY + Y H D R R++
Sbjct: 89 NELTVKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 144
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|228916870|ref|ZP_04080433.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
pulsiensis BGSC 4CC1]
gi|228842793|gb|EEM87878.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
pulsiensis BGSC 4CC1]
Length = 232
Score = 93.6 bits (231), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 63/180 (35%), Positives = 88/180 (48%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + + D E + +IELSK K+ R KV + D D R SK FL D
Sbjct: 52 FEEPLIVVLGNVLSDEECDELIELSKNKLARSKVGSSRDV--NDIRTSKGAFL-----DD 104
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI + N+ E L I NY + Y H D R R++
Sbjct: 105 NELTAKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 160
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 161 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 220
>gi|229157835|ref|ZP_04285910.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 4342]
gi|228625792|gb|EEK82544.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 4342]
Length = 232
Score = 93.2 bits (230), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 63/180 (35%), Positives = 88/180 (48%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + + D E + +IELSK K+ R KV + D D R SK FL D
Sbjct: 52 FEEPLIVVLGNVLSDEECDELIELSKNKLARSKVGSSRDV--NDIRTSKGAFL-----DD 104
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI + N+ E L I NY + Y H D R R++
Sbjct: 105 NELTEKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 160
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 161 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 220
>gi|228987427|ref|ZP_04147547.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
tochigiensis BGSC 4Y1]
gi|228772399|gb|EEM20845.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
tochigiensis BGSC 4Y1]
Length = 232
Score = 93.2 bits (230), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 63/180 (35%), Positives = 88/180 (48%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + + D E + +IELSK K+ R KV + D D R SK FL D
Sbjct: 52 FEEPLIVVLGNVLSDEECDELIELSKNKLARSKVGSSRDV--NDIRTSKGAFL-----DD 104
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI + N+ E L I NY + Y H D R R++
Sbjct: 105 NELTEKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 160
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 161 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 220
>gi|347966278|ref|XP_003435891.1| AGAP013377-PA [Anopheles gambiae str. PEST]
gi|333470133|gb|EGK97522.1| AGAP013377-PA [Anopheles gambiae str. PEST]
Length = 290
Score = 93.2 bits (230), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 67/221 (30%), Positives = 103/221 (46%), Gaps = 16/221 (7%)
Query: 7 CQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIE 66
C+G P + S+L C+Y+ N I P KVE L DP V H+ ++D EI ++
Sbjct: 52 CRGVYVPPPSLTSSLYCWYD-VRNAHSVISPSKVEALSNDPFVALFHEFVHDGEIAQLQA 110
Query: 67 LSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREE 126
L +++ N + V + Y L+ DHP + ++ RI+ T L E
Sbjct: 111 LGSMHIKQSGPSN-DSWLPVFYENHQTYTLHDR---DHPVVERLTKRIERRTGLSCDTAE 166
Query: 127 RYKGPLQINNYGLGGHYDLHCDATPRDEGLWR-------LASFMFYLTDVELGGATIFPS 179
L++ +G DA + E R LA+ +F+L+DV GG TIFP
Sbjct: 167 ----DLKVIYNEVGAFKTAALDAIHKKEDAQRFAYAGDRLATMLFFLSDVTNGGYTIFPK 222
Query: 180 LNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
L + + P+KG+A FWYN + +M +S CP+ KW
Sbjct: 223 LRVAIRPQKGTAAFWYNLKDTGEGNVQMKYSICPLQDDQKW 263
>gi|47567794|ref|ZP_00238502.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus G9241]
gi|47555471|gb|EAL13814.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus G9241]
Length = 216
Score = 93.2 bits (230), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 63/180 (35%), Positives = 88/180 (48%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + + D E + +IELSK K+ R KV + D D R SK FL D
Sbjct: 36 FEEPLIVVLGNVLSDEECDELIELSKNKLARSKVGSSRDV--NDIRTSKGAFL-----DD 88
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI + N+ E L I NY + Y H D R R++
Sbjct: 89 NELTEKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 144
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|423406337|ref|ZP_17383486.1| hypothetical protein ICY_01022 [Bacillus cereus BAG2X1-3]
gi|401660331|gb|EJS77813.1| hypothetical protein ICY_01022 [Bacillus cereus BAG2X1-3]
Length = 216
Score = 93.2 bits (230), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 63/180 (35%), Positives = 89/180 (49%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + + D E +++IELSK K+ R KV + D D R SK FL D
Sbjct: 36 FEEPLIVVLGNVLSDEECDKLIELSKNKLARSKVGSSRDV--NDIRTSKGAFL-----DD 88
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI + N+ E L I NY + Y H D R R++
Sbjct: 89 NELTAKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 144
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 145 TLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|206978009|ref|ZP_03238895.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
H3081.97]
gi|423373947|ref|ZP_17351286.1| hypothetical protein IC5_03002 [Bacillus cereus AND1407]
gi|206743809|gb|EDZ55230.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
H3081.97]
gi|401094762|gb|EJQ02832.1| hypothetical protein IC5_03002 [Bacillus cereus AND1407]
Length = 216
Score = 93.2 bits (230), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 62/180 (34%), Positives = 88/180 (48%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + + D E +++IELSK K+ R KV + D D R S FL D
Sbjct: 36 FEEPLIVVLGNVLSDEECDKLIELSKNKLARSKVGSSRDV--NDIRTSSGAFL-----DD 88
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
KI+ RI + N+ + E L I NY + Y H D R R++
Sbjct: 89 DELTAKIEKRISSIMNVPVSHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 144
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|423470454|ref|ZP_17447198.1| hypothetical protein IEM_01760 [Bacillus cereus BAG6O-2]
gi|402436583|gb|EJV68613.1| hypothetical protein IEM_01760 [Bacillus cereus BAG6O-2]
Length = 216
Score = 92.8 bits (229), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 61/180 (33%), Positives = 89/180 (49%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + + D E + +IELSK K+ER K+ + D D R S FL +
Sbjct: 36 FEEPLIVVLANVLSDEECDGLIELSKNKIERSKIGSSRDV--NDIRTSSGAFLE-----E 88
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI + N+ + E L I NY + Y H D R R++
Sbjct: 89 NELTSKIEKRISSIMNVPVAHGE----GLHILNYEVDQEYKAHYDYFAEHSRSAANNRIS 144
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|423521903|ref|ZP_17498376.1| hypothetical protein IGC_01286 [Bacillus cereus HuA4-10]
gi|401176565|gb|EJQ83760.1| hypothetical protein IGC_01286 [Bacillus cereus HuA4-10]
Length = 216
Score = 92.8 bits (229), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 61/180 (33%), Positives = 90/180 (50%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + + D E +++IELSK ++R KV + D D R S FL +
Sbjct: 36 FEEPLIVVLANVLSDEECDKLIELSKNNMKRSKVGSSRDV--NDIRTSSGAFLE-----E 88
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI +TN+ + E L I NY + Y H D R R++
Sbjct: 89 NELTSKIEKRISSITNVPVAHGE----GLHILNYEVDQEYKAHYDYFAEHSRSAANNRIS 144
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|423452458|ref|ZP_17429311.1| hypothetical protein IEE_01202 [Bacillus cereus BAG5X1-1]
gi|401140096|gb|EJQ47653.1| hypothetical protein IEE_01202 [Bacillus cereus BAG5X1-1]
Length = 216
Score = 92.8 bits (229), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 61/180 (33%), Positives = 89/180 (49%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + + D E + +IELSK K+ER K+ + D D R S FL +
Sbjct: 36 FEEPLIVVLANVLSDEECDGLIELSKNKIERSKIGSSRDV--NDIRTSSGAFLE-----E 88
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI + N+ + E L I NY + Y H D R R++
Sbjct: 89 NELTSKIEKRISSIMNVPVAHGE----GLHILNYEVDQEYKAHYDYFAEHSRSAANNRIS 144
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|42783360|ref|NP_980607.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 10987]
gi|42739288|gb|AAS43215.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
ATCC 10987]
Length = 216
Score = 92.8 bits (229), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 62/180 (34%), Positives = 88/180 (48%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + + D E + +IELSK K+ R KV + D D R S FL D
Sbjct: 36 FEEPLIVVLGNVLSDEECDELIELSKNKLARSKVGSSRDV--NDIRTSSGAFL-----DD 88
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI + N+ + E L I NY + Y H D R R++
Sbjct: 89 NELTAKIEKRISSIMNVPVSHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 144
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|229163182|ref|ZP_04291137.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus R309803]
gi|228620245|gb|EEK77116.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus R309803]
Length = 229
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 63/180 (35%), Positives = 88/180 (48%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + + D E + +IELSK K+ R KV + D D R SK FL D
Sbjct: 49 FEEPLIVVLGNVLSDEECDELIELSKSKLARSKVGSSRDV--NDIRTSKGAFL-----DD 101
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI + N+ E L I NY + Y H D R R++
Sbjct: 102 NELTAKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 157
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 158 TLVMYLNDVEEGGETFFPKLNLSVNPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 217
>gi|386332363|ref|YP_006028532.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum Po82]
gi|334194811|gb|AEG67996.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum Po82]
Length = 292
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 63/208 (30%), Positives = 99/208 (47%), Gaps = 20/208 (9%)
Query: 26 ESYNNTFLKIGPLKVEELYL--DPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YG 81
E+ N+ ++ ++ L+ PR+V + D E + +I L + +++R VVN G
Sbjct: 77 EAENSNAVRTSDREIPILFAIETPRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPETG 136
Query: 82 DTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG 141
+ + R S+ G+HP + +I+ RI T + + E + Q+ +Y GG
Sbjct: 137 EENLISARTSEGAMFQ---VGEHPLVARIEARIAQATGVPVEHGEGF----QVLHYHPGG 189
Query: 142 HYDLHCD---------ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAV 192
Y H D A + G R+A+ + YL V+ GGAT FP L L V P KG+AV
Sbjct: 190 EYQPHFDYFNPGRSGEARQLEVGGQRVATLVIYLNSVQAGGATGFPKLGLEVAPVKGNAV 249
Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
F+ + LD H+G PV G KW
Sbjct: 250 FFVYKRPDGTLDDNTLHAGLPVERGEKW 277
>gi|229031885|ref|ZP_04187873.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1271]
gi|228729503|gb|EEL80492.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1271]
Length = 216
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 63/180 (35%), Positives = 87/180 (48%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + + D E +IELSK K+ R KV + D D R SK FL D
Sbjct: 36 FEEPLIVVLGNVLSDEECGELIELSKSKLARSKVGSSRDV--NDIRTSKGAFL-----DD 88
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI + N+ E L I NY + Y H D R R++
Sbjct: 89 NELTTKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 144
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|195352178|ref|XP_002042591.1| GM14978 [Drosophila sechellia]
gi|194124475|gb|EDW46518.1| GM14978 [Drosophila sechellia]
Length = 467
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 71/221 (32%), Positives = 102/221 (46%), Gaps = 54/221 (24%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
+ L CQG P+ KSNL C Y S N FL++ PLK+EE+ DP +V H+ + D EI
Sbjct: 284 HNLGCQG--LFPK--KSNLVCRYNSSTNAFLQLAPLKMEEVSRDPYIVMFHEVVSDKEIE 339
Query: 63 RIIELSKGKV---ERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
+ KG++ E GK + F +I RI DMT
Sbjct: 340 EM----KGEITEMENGK--------------------------ESSFSKRINQRISDMTG 369
Query: 120 LVIGREERYKGPLQINNYGLGG----HYDLHCDATPR---DEGLW-RLASFMFYLTDVEL 171
+ E + +Q N+G+GG HYD + D + L R+ S +FY +V
Sbjct: 370 FKL---EEFPA-IQSANFGVGGYFKPHYDYYTDRLKEVDVNNTLGDRIGSIIFYAGEVSQ 425
Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGC 212
GG T+FP + V P+KG+A+ W+NA +R H C
Sbjct: 426 GGQTVFPDSKVMVEPKKGNALLWFNAFI-----HRQIHEPC 461
>gi|195166671|ref|XP_002024158.1| GL22696 [Drosophila persimilis]
gi|194107513|gb|EDW29556.1| GL22696 [Drosophila persimilis]
Length = 491
Score = 92.4 bits (228), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 61/215 (28%), Positives = 109/215 (50%), Gaps = 32/215 (14%)
Query: 11 LSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKG 70
+S P + +++ C Y + FL++ P++ E L + V HD EI + L++
Sbjct: 292 MSYPRKV-NDVHCRYLR-STPFLQLAPIRQENLDNEAHVYLYHDLFNHEEIEALKSLARP 349
Query: 71 KVERGKVV-NYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYK 129
+++R K+ N+ I +LS + + RIQD++ + + +E
Sbjct: 350 RLKRQKISSNFTCKI---AQLSN---------SAQDIIRTVNRRIQDVSGMDMNEKE--- 394
Query: 130 GPLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKG 189
LQ+ NYG+ G YDL D+ A+ + ++++V+ GG T+FP L+L V P+KG
Sbjct: 395 -VLQVVNYGIAGRYDL-------DDSAGSAATALIFMSNVQQGGETVFPFLSLRVKPQKG 446
Query: 190 SAVFWYNAHANTLLDYRMYHSGCPVALGNKWGKLL 224
S + W N D+ + H+ CP+ +GN WG+L+
Sbjct: 447 SLLLWRNT------DWSVLHNSCPLIIGNMWGELI 475
>gi|198449528|ref|XP_002136919.1| GA26870 [Drosophila pseudoobscura pseudoobscura]
gi|198130648|gb|EDY67477.1| GA26870 [Drosophila pseudoobscura pseudoobscura]
Length = 491
Score = 92.4 bits (228), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 73/211 (34%), Positives = 103/211 (48%), Gaps = 31/211 (14%)
Query: 18 KSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKV 77
KS L C + S+ +F LKVEE+ LDP +V HD + E+ EL K
Sbjct: 286 KSTLHCRF-SWRPSF--YARLKVEEVLLDPYIVLYHDVVSGKEM----ELLK-------- 330
Query: 78 VNYGDTIY----VDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQ 133
+YG T + + LS + PE P + + R+ DMT L + E +
Sbjct: 331 -DYGRTNLTHDPLRSGLSAKHCALPESL---PLVQSLHQRLWDMTGLSLNGSESWL---- 382
Query: 134 INNYGLGGHYDLHCDATPRDE----GLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKG 189
I NYG+GG LH D E G RL + +L++V GG T+FP+L + V P+ G
Sbjct: 383 ITNYGIGGFLGLHKDYFDEIEEELQGDNRLFTIQIFLSNVSQGGYTVFPNLEVAVKPQAG 442
Query: 190 SAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+A+ +YN + + D R H GCPV GNKW
Sbjct: 443 TALVFYNLLDSLVGDTRTRHFGCPVIDGNKW 473
>gi|423657194|ref|ZP_17632493.1| hypothetical protein IKG_04182 [Bacillus cereus VD200]
gi|401289937|gb|EJR95641.1| hypothetical protein IKG_04182 [Bacillus cereus VD200]
Length = 248
Score = 92.4 bits (228), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 61/180 (33%), Positives = 88/180 (48%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + + D E + +IE+SK K+ER K+ + D D R S FL D
Sbjct: 68 FEEPLIVVLANVLSDEECDELIEMSKNKMERSKIGSSRDV--NDIRTSSGAFL-----ED 120
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI + N+ E L I NY + Y H D R R++
Sbjct: 121 NELTSKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 176
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 177 TLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 236
>gi|229061929|ref|ZP_04199257.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH603]
gi|228717372|gb|EEL69042.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH603]
Length = 216
Score = 92.4 bits (228), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 62/180 (34%), Positives = 89/180 (49%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + + D E +IELSK ++R KV + D D R S FL +
Sbjct: 36 FEEPLIVVLANVLSDEECAELIELSKSNMKRSKVGSSRDV--NDIRTSSGAFLE-----E 88
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI +TN+ + E L I NY + Y H D R R++
Sbjct: 89 NELTSKIEKRISSITNVPVVHGE----GLHILNYEVDQEYKAHYDYFAEHSRSAANNRIS 144
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + LL+ H G PV G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQLLNELTLHGGAPVTKGEKW 204
>gi|253575459|ref|ZP_04852796.1| prolyl 4-hydroxylase [Paenibacillus sp. oral taxon 786 str. D14]
gi|251845106|gb|EES73117.1| prolyl 4-hydroxylase [Paenibacillus sp. oral taxon 786 str. D14]
Length = 215
Score = 92.4 bits (228), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 58/182 (31%), Positives = 97/182 (53%), Gaps = 15/182 (8%)
Query: 43 LYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFG 102
L+ +P +++ + D E ++IE + ++ K+VN + + R S+ F E
Sbjct: 26 LHKEPLIMRFERLLTDDECRQLIEAAAPRLRESKLVN---KVVSEIRTSRGMFFEEE--- 79
Query: 103 DHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLG----GHYDLHCDATPRDEGLWR 158
++PF+++I+ RI + N+ I E +G LQ+ +YG G HYD +P R
Sbjct: 80 ENPFIHRIEKRISALMNVPI---EHAEG-LQVLHYGPGQEYQAHYDFFGPNSPSASN-NR 134
Query: 159 LASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGN 218
+++ + YL DVE GG T+FP L+L V PE+GSA+++ + L+ HS PV G
Sbjct: 135 ISTLIIYLNDVEAGGETVFPLLDLEVKPERGSALYFEYFYRQQELNNLTLHSSVPVVRGE 194
Query: 219 KW 220
KW
Sbjct: 195 KW 196
>gi|423527903|ref|ZP_17504348.1| hypothetical protein IGE_01455 [Bacillus cereus HuB1-1]
gi|402451566|gb|EJV83385.1| hypothetical protein IGE_01455 [Bacillus cereus HuB1-1]
Length = 248
Score = 92.4 bits (228), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 61/180 (33%), Positives = 89/180 (49%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + + D E +++IE+SK K++R KV + D D R S FL D
Sbjct: 68 FEEPLIVVLANVLSDEECDKLIEMSKNKMKRSKVGSSRDV--NDIRTSSGAFL-----ED 120
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI + N+ E L I NY + Y H D R R++
Sbjct: 121 NELTSKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 176
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 177 TLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 236
>gi|300702992|ref|YP_003744594.1| prolyl 4-hydroxylase subunit alpha [Ralstonia solanacearum
CFBP2957]
gi|299070655|emb|CBJ41950.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
CFBP2957]
Length = 289
Score = 92.4 bits (228), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 63/208 (30%), Positives = 99/208 (47%), Gaps = 20/208 (9%)
Query: 26 ESYNNTFLKIGPLKVEELYL--DPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YG 81
E+ N+ ++ ++ L+ PR+V + D E + +I L + +++R VVN G
Sbjct: 74 EAENSNAVRTSDREIPILFAIETPRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPETG 133
Query: 82 DTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG 141
+ + R S+ G+HP + +I+ RI T + + E + Q+ +Y GG
Sbjct: 134 EENLISARTSEGAMFQ---VGEHPLVARIEARIAQATGVPVEHGEGF----QVLHYHPGG 186
Query: 142 HYDLHCD---------ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAV 192
Y H D A + G R+A+ + YL V+ GGAT FP L L V P KG+AV
Sbjct: 187 EYQPHFDYFNPGRSGEARQLEVGGQRVATLVIYLNSVQAGGATGFPKLGLEVAPVKGNAV 246
Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
F+ + LD H+G PV G KW
Sbjct: 247 FFVYKRPDGTLDDNTLHAGLPVERGEKW 274
>gi|384182063|ref|YP_005567825.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
finitimus YBT-020]
gi|324328147|gb|ADY23407.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
finitimus YBT-020]
Length = 216
Score = 92.4 bits (228), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 62/180 (34%), Positives = 88/180 (48%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + + D E + +IELSK K+ R KV + D D R S FL D
Sbjct: 36 FEEPLIVVLGNVLSDEECDELIELSKNKLARSKVGSSRDV--NDIRTSSGAFL-----DD 88
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI + N+ + E L I NY + Y H D R R++
Sbjct: 89 NELTAKIEKRISSIMNVPVSHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 144
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDRSLNELTLHGGAPVTKGEKW 204
>gi|75760922|ref|ZP_00740932.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
israelensis ATCC 35646]
gi|423385740|ref|ZP_17362996.1| hypothetical protein ICE_03486 [Bacillus cereus BAG1X1-2]
gi|423561293|ref|ZP_17537569.1| hypothetical protein II5_00697 [Bacillus cereus MSX-A1]
gi|74491592|gb|EAO54798.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
israelensis ATCC 35646]
gi|401201550|gb|EJR08415.1| hypothetical protein II5_00697 [Bacillus cereus MSX-A1]
gi|401635796|gb|EJS53551.1| hypothetical protein ICE_03486 [Bacillus cereus BAG1X1-2]
Length = 248
Score = 92.4 bits (228), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 61/180 (33%), Positives = 89/180 (49%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + + D E +++IE+SK K++R KV + D D R S FL D
Sbjct: 68 FEEPLIVVLANVLSDEECDKLIEMSKNKMKRSKVGSSRDV--NDIRTSSGAFL-----ED 120
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI + N+ E L I NY + Y H D R R++
Sbjct: 121 NELTSKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 176
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 177 TLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 236
>gi|423358724|ref|ZP_17336227.1| hypothetical protein IC1_00704 [Bacillus cereus VD022]
gi|401084596|gb|EJP92842.1| hypothetical protein IC1_00704 [Bacillus cereus VD022]
Length = 248
Score = 92.4 bits (228), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 61/180 (33%), Positives = 89/180 (49%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + + D E +++IE+SK K++R KV + D D R S FL D
Sbjct: 68 FEEPLIVVLANVLSDEECDKLIEMSKNKMKRSKVGSSRDV--NDIRTSSGAFL-----ED 120
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI + N+ E L I NY + Y H D R R++
Sbjct: 121 NELTSKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 176
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 177 TLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 236
>gi|417402369|gb|JAA48034.1| Putative prolyl 4-hydroxylase alpha subunit [Desmodus rotundus]
Length = 529
Score = 92.0 bits (227), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 66/196 (33%), Positives = 102/196 (52%), Gaps = 13/196 (6%)
Query: 7 CQGNLSVPEDIKS-NLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRII 65
CQ S P ++ +L C YE+ + +L + P++ E ++L+P VV HD + D E +I
Sbjct: 305 CQTLGSQPTHYQNPSLHCSYETGASPYLLLQPIRKEVVHLEPYVVLYHDFVNDLEAQKIR 364
Query: 66 ELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGRE 125
++ ++R V + + V+ R+SK +L + P L + RI +T L +
Sbjct: 365 GFAEPWLQRSVVASGEKQLPVEYRISKSAWLKDTV---DPMLVTLDRRIAALTGL--DTQ 419
Query: 126 ERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELGGATIFP 178
Y LQ+ NYG+GGHY+ H D AT L+R+ A+FM YL+ VE GGAT F
Sbjct: 420 PPYAEHLQVVNYGIGGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFI 479
Query: 179 SLNLTVFPEKGSAVFW 194
N +V K S+ W
Sbjct: 480 YANFSVPVVKCSSPRW 495
>gi|30022316|ref|NP_833947.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 14579]
gi|229129515|ref|ZP_04258486.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-Cer4]
gi|29897873|gb|AAP11148.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 14579]
gi|228654120|gb|EEL09987.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-Cer4]
Length = 232
Score = 92.0 bits (227), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 61/180 (33%), Positives = 88/180 (48%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + + D E + +IE+SK K+ER K+ + D D R S FL D
Sbjct: 52 FEEPLIVVLANVLSDEECDELIEMSKNKMERSKIGSSRDV--NDIRTSSGAFL-----ED 104
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI + N+ E L I NY + Y H D R R++
Sbjct: 105 NKLTSKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 160
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 161 TLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 220
>gi|402555628|ref|YP_006596899.1| prolyl 4-hydroxylase subunit alpha [Bacillus cereus FRI-35]
gi|401796838|gb|AFQ10697.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus FRI-35]
Length = 216
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 62/180 (34%), Positives = 87/180 (48%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + + D E +IELSK K+ R KV + D D R S FL D
Sbjct: 36 FEEPLIVVLGNVLSDEECGELIELSKNKLARSKVGSSRDV--NDIRTSSGAFL-----DD 88
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI + N+ + E L I NY + Y H D R R++
Sbjct: 89 NELTAKIEKRISSIMNVPVSHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 144
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|430751569|ref|YP_007214477.1| 2OG-Fe(II) oxygenase [Thermobacillus composti KWC4]
gi|430735534|gb|AGA59479.1| 2OG-Fe(II) oxygenase superfamily enzyme [Thermobacillus composti
KWC4]
Length = 215
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 58/189 (30%), Positives = 100/189 (52%), Gaps = 15/189 (7%)
Query: 36 GPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYF 95
G ++ L+ +P +V+ + D E ++IE + +++ K+VN + D R S+ F
Sbjct: 19 GVVEATVLHQEPLIVRFERLLSDDECRQLIETAAPRLKESKLVN---KVVSDIRTSRGMF 75
Query: 96 LYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD----ATP 151
E + PF+++I+ RI + N+ I E +G LQ+ +YG G Y H D +P
Sbjct: 76 FEEE---ESPFIHRIERRIAQLMNVPI---EHAEG-LQVLHYGPGQEYKAHHDFFAPGSP 128
Query: 152 RDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSG 211
R+++ + YL DVE GG T+FP L + + P++G+A+++ + N L+ HS
Sbjct: 129 AARN-NRISTLIVYLNDVEEGGETVFPLLGIAMKPKRGAALYFEYFYRNQALNDLTLHSS 187
Query: 212 CPVALGNKW 220
PV G KW
Sbjct: 188 VPVVRGEKW 196
>gi|94312029|ref|YP_585239.1| prolyl 4-hydroxylase [Cupriavidus metallidurans CH34]
gi|93355881|gb|ABF09970.1| prolyl 4-hydroxylase [Cupriavidus metallidurans CH34]
Length = 293
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 59/185 (31%), Positives = 89/185 (48%), Gaps = 18/185 (9%)
Query: 47 PRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDH 104
PR++ + + + D+E + ++ L++ +++R VVN GD +D R S G+H
Sbjct: 101 PRILLLQNLLDDAECDAVVALARDRLQRSPVVNPDTGDENLIDARTSMGAMFQ---VGEH 157
Query: 105 PFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---------ATPRDEG 155
L +I+ RI +T + E + Q+ NY GG Y H D A G
Sbjct: 158 ALLQRIEARIAAVTGWPVEHGEGF----QVLNYKPGGEYQPHFDFFNPKRPGEARQLRVG 213
Query: 156 LWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 215
R+A+ + YL GGAT FP + L V P KG+AV + + LD R H+G PV
Sbjct: 214 GQRVATMVIYLNSPASGGATAFPRIGLEVAPVKGNAVLFSYGLPDGALDERTLHAGLPVE 273
Query: 216 LGNKW 220
G KW
Sbjct: 274 AGEKW 278
>gi|430808003|ref|ZP_19435118.1| prolyl 4-hydroxylase [Cupriavidus sp. HMR-1]
gi|429499635|gb|EKZ98045.1| prolyl 4-hydroxylase [Cupriavidus sp. HMR-1]
Length = 293
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 59/185 (31%), Positives = 89/185 (48%), Gaps = 18/185 (9%)
Query: 47 PRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDH 104
PR++ + + + D+E + ++ L++ +++R VVN GD +D R S G+H
Sbjct: 101 PRILLLQNLLDDAECDAVVALARDRLQRSPVVNPDTGDENLIDARTSMGAMFQ---VGEH 157
Query: 105 PFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---------ATPRDEG 155
L +I+ RI +T + E + Q+ NY GG Y H D A G
Sbjct: 158 ALLQRIEARIAAVTGWPVEHGEGF----QVLNYKPGGEYQPHFDFFNPKRPGEARQLRVG 213
Query: 156 LWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 215
R+A+ + YL GGAT FP + L V P KG+AV + + LD R H+G PV
Sbjct: 214 GQRVATMVIYLNSPASGGATAFPRIGLEVAPVKGNAVLFSYGLPDGALDERTLHAGLPVE 273
Query: 216 LGNKW 220
G KW
Sbjct: 274 AGEKW 278
>gi|228902749|ref|ZP_04066896.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL
4222]
gi|228967277|ref|ZP_04128313.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
sotto str. T04001]
gi|402564350|ref|YP_006607074.1| prolyl 4-hydroxylase subunit alpha domain-containing protein
[Bacillus thuringiensis HD-771]
gi|434377355|ref|YP_006611999.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus
thuringiensis HD-789]
gi|228792646|gb|EEM40212.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
sotto str. T04001]
gi|228856936|gb|EEN01449.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL
4222]
gi|401793002|gb|AFQ19041.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus
thuringiensis HD-771]
gi|401875912|gb|AFQ28079.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus
thuringiensis HD-789]
Length = 216
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 61/180 (33%), Positives = 89/180 (49%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + + D E +++IE+SK K++R KV + D D R S FL D
Sbjct: 36 FEEPLIVVLANVLSDEECDKLIEMSKNKMKRSKVGSSRDV--NDIRTSSGAFL-----ED 88
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI + N+ E L I NY + Y H D R R++
Sbjct: 89 NELTSKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 144
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 145 TLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|289526401|gb|ADD01323.1| FI13021p [Drosophila melanogaster]
gi|373432715|gb|AEY70761.1| FI17809p1 [Drosophila melanogaster]
Length = 193
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 54/145 (37%), Positives = 76/145 (52%), Gaps = 20/145 (13%)
Query: 89 RLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD 148
R +K ++L E + +I RI DMT + E + Q+ NYG+GGHY LH D
Sbjct: 28 RTAKGFWLKKE---SNELTKRITRRIMDMTGFDLADSEGF----QVINYGIGGHYFLHMD 80
Query: 149 ----ATPRDEGLW---------RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWY 195
A+ R+A+ +FYLTDVE GGAT+F + V P+ G+A+FWY
Sbjct: 81 YFDFASSNHTDTRSRYSIDLGDRIATVLFYLTDVEQGGATVFGDVGYYVSPQAGTAIFWY 140
Query: 196 NAHANTLLDYRMYHSGCPVALGNKW 220
N + D R H+ CPV +G+KW
Sbjct: 141 NLDTDGNGDPRTRHAACPVIVGSKW 165
>gi|312385412|gb|EFR29925.1| hypothetical protein AND_00803 [Anopheles darlingi]
Length = 468
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 58/169 (34%), Positives = 84/169 (49%), Gaps = 16/169 (9%)
Query: 7 CQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIE 66
C+G + L+C Y S FLKI PLK+EE+ LDP +V H I D+EI IIE
Sbjct: 284 CRGESPRTASEMAKLRCRYVSNRVPFLKIAPLKLEEVSLDPFIVVYHQVISDNEIKTIIE 343
Query: 67 LSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREE 126
+S+ + R V + R S +L + HP + + R +DMT L + E
Sbjct: 344 ISRDSLRRAMVGDVAKQEVSKARTSSNAWLDDPM---HPHVRSLSRRTEDMTGLTMWAAE 400
Query: 127 RYKGPLQINNYGLGGHYDLHCDATPRDEGLW---------RLASFMFYL 166
+ LQ+ NYG+GGHY H D +EG+ R+A+ M+Y+
Sbjct: 401 Q----LQVGNYGIGGHYLPHFDYGTPEEGVELYPNIEKGNRIATVMYYV 445
>gi|49187135|ref|YP_030387.1| prolyl 4-hydroxylase subunit alpha [Bacillus anthracis str. Sterne]
gi|228947951|ref|ZP_04110238.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
monterrey BGSC 4AJ1]
gi|49181062|gb|AAT56438.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Sterne]
gi|228811938|gb|EEM58272.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
monterrey BGSC 4AJ1]
Length = 232
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 62/180 (34%), Positives = 87/180 (48%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + + D E + +IELSK K+ R KV + D D R S FL D
Sbjct: 52 FEEPLIVVLGNVLSDEECDELIELSKSKLARSKVGSSRDV--NDIRTSSGAFL-----DD 104
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI + N+ E L I NY + Y H D R R++
Sbjct: 105 NELTAKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 160
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 161 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 220
>gi|340787855|ref|YP_004753320.1| peptidyl prolyl 4-hydroxylase-like protein subunit alpha
[Collimonas fungivorans Ter331]
gi|340553122|gb|AEK62497.1| Peptidyl prolyl 4-hydroxylase-like protein, alpha subunit
[Collimonas fungivorans Ter331]
Length = 289
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 62/185 (33%), Positives = 90/185 (48%), Gaps = 18/185 (9%)
Query: 47 PRVVKIHDAIYDSEINRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDH 104
PR + + + E +++I LSK K+ R VV++ G+T + R S F + G
Sbjct: 100 PRAILFGNVLSHDECDQLIALSKTKLLRSGVVDHQTGNTKLHEHRTSSGTFFH---RGTT 156
Query: 105 PFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---------ATPRDEG 155
PF+ I R+ + + E + LQI NY +GG Y H D A G
Sbjct: 157 PFIAMIDKRLAALMQV----PESHGEGLQILNYQMGGEYRPHYDYFRPDAPGSAKHLARG 212
Query: 156 LWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 215
R A+ + YL DV+ GG TIFP L++ P KGSA+++ +A LD +H G PV
Sbjct: 213 GQRTATLIIYLNDVDGGGETIFPRNGLSIVPAKGSAIYFSYTNAENQLDSLSFHGGSPVI 272
Query: 216 LGNKW 220
G KW
Sbjct: 273 EGEKW 277
>gi|229075940|ref|ZP_04208916.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock4-18]
gi|229117732|ref|ZP_04247101.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-3]
gi|407706764|ref|YP_006830349.1| alpha/beta fold family hydrolase [Bacillus thuringiensis MC28]
gi|423377905|ref|ZP_17355189.1| hypothetical protein IC9_01258 [Bacillus cereus BAG1O-2]
gi|423464099|ref|ZP_17440867.1| hypothetical protein IEK_01286 [Bacillus cereus BAG6O-1]
gi|423547540|ref|ZP_17523898.1| hypothetical protein IGO_03975 [Bacillus cereus HuB5-5]
gi|423622677|ref|ZP_17598455.1| hypothetical protein IK3_01275 [Bacillus cereus VD148]
gi|228665709|gb|EEL21182.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-3]
gi|228707255|gb|EEL59452.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock4-18]
gi|401179261|gb|EJQ86434.1| hypothetical protein IGO_03975 [Bacillus cereus HuB5-5]
gi|401260797|gb|EJR66965.1| hypothetical protein IK3_01275 [Bacillus cereus VD148]
gi|401636171|gb|EJS53925.1| hypothetical protein IC9_01258 [Bacillus cereus BAG1O-2]
gi|402420366|gb|EJV52637.1| hypothetical protein IEK_01286 [Bacillus cereus BAG6O-1]
gi|407384449|gb|AFU14950.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis MC28]
Length = 216
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 60/180 (33%), Positives = 88/180 (48%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + I D E N +IE+SK K++R + + D D R S FL +
Sbjct: 36 FEEPLIVVLGNVISDEECNELIEMSKNKIKRSTIGSARDV--NDIRTSSGAFLE-----E 88
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI + N+ + E L I NY + Y H D R R++
Sbjct: 89 NELTSKIEKRISSIMNVPVTHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 144
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|30264308|ref|NP_846685.1| prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. Ames]
gi|47529753|ref|YP_021102.1| prolyl 4-hydroxylase subunit alpha [Bacillus anthracis str. 'Ames
Ancestor']
gi|65321616|ref|ZP_00394575.1| hypothetical protein Bant_01005109 [Bacillus anthracis str. A2012]
gi|165873278|ref|ZP_02217887.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0488]
gi|167634610|ref|ZP_02392930.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0442]
gi|167638693|ref|ZP_02396969.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0193]
gi|170687507|ref|ZP_02878724.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0465]
gi|170709341|ref|ZP_02899757.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0389]
gi|177655890|ref|ZP_02937082.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0174]
gi|190566156|ref|ZP_03019075.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Tsiankovskii-I]
gi|196034803|ref|ZP_03102210.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
W]
gi|227817011|ref|YP_002817020.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus
anthracis str. CDC 684]
gi|228929280|ref|ZP_04092307.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
pondicheriensis BGSC 4BA1]
gi|228935557|ref|ZP_04098373.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
andalousiensis BGSC 4AW1]
gi|229123754|ref|ZP_04252949.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus 95/8201]
gi|229604260|ref|YP_002868528.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0248]
gi|254683996|ref|ZP_05147856.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. CNEVA-9066]
gi|254721830|ref|ZP_05183619.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A1055]
gi|254736344|ref|ZP_05194050.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Western North America USA6153]
gi|254741382|ref|ZP_05199069.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Kruger B]
gi|254753983|ref|ZP_05206018.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Vollum]
gi|254757854|ref|ZP_05209881.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Australia 94]
gi|386738126|ref|YP_006211307.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. H9401]
gi|421506493|ref|ZP_15953416.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. UR-1]
gi|421638315|ref|ZP_16078911.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. BF1]
gi|30258953|gb|AAP28171.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Ames]
gi|47504901|gb|AAT33577.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. 'Ames Ancestor']
gi|164710995|gb|EDR16563.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0488]
gi|167513541|gb|EDR88911.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0193]
gi|167530062|gb|EDR92797.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0442]
gi|170125767|gb|EDS94678.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0389]
gi|170668702|gb|EDT19448.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0465]
gi|172079923|gb|EDT65028.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0174]
gi|190563075|gb|EDV17041.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Tsiankovskii-I]
gi|195992342|gb|EDX56303.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
W]
gi|227005734|gb|ACP15477.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. CDC 684]
gi|228659889|gb|EEL15534.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus 95/8201]
gi|228824095|gb|EEM69911.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
andalousiensis BGSC 4AW1]
gi|228830570|gb|EEM76180.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
pondicheriensis BGSC 4BA1]
gi|229268668|gb|ACQ50305.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0248]
gi|384387978|gb|AFH85639.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. H9401]
gi|401823486|gb|EJT22633.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. UR-1]
gi|403394741|gb|EJY91981.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. BF1]
Length = 216
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 62/180 (34%), Positives = 87/180 (48%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + + D E + +IELSK K+ R KV + D D R S FL D
Sbjct: 36 FEEPLIVVLGNVLSDEECDELIELSKSKLARSKVGSSRDV--NDIRTSSGAFL-----DD 88
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI + N+ E L I NY + Y H D R R++
Sbjct: 89 NELTAKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 144
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|196046329|ref|ZP_03113555.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
03BB108]
gi|376268135|ref|YP_005120847.1| Peptidyl prolyl 4- hydroxylase like protein [Bacillus cereus
F837/76]
gi|196022799|gb|EDX61480.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
03BB108]
gi|364513935|gb|AEW57334.1| Peptidyl prolyl 4- hydroxylase like protein [Bacillus cereus
F837/76]
Length = 216
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 62/180 (34%), Positives = 87/180 (48%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + + D E + +IELSK K+ R KV + D D R S FL D
Sbjct: 36 FEEPLIVVLGNVLSDEECDELIELSKNKLARSKVGSSRDV--NDIRTSSGAFL-----DD 88
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI + N+ E L I NY + Y H D R R++
Sbjct: 89 NELTAKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 144
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|196041590|ref|ZP_03108882.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
NVH0597-99]
gi|218905373|ref|YP_002453207.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus cereus
AH820]
gi|225866219|ref|YP_002751597.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
03BB102]
gi|423550018|ref|ZP_17526345.1| hypothetical protein IGW_00649 [Bacillus cereus ISP3191]
gi|196027578|gb|EDX66193.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
NVH0597-99]
gi|218537435|gb|ACK89833.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
AH820]
gi|225786013|gb|ACO26230.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
03BB102]
gi|401189634|gb|EJQ96684.1| hypothetical protein IGW_00649 [Bacillus cereus ISP3191]
Length = 216
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 62/180 (34%), Positives = 87/180 (48%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + + D E + +IELSK K+ R KV + D D R S FL D
Sbjct: 36 FEEPLIVVLGNVLSDEECDELIELSKNKLARSKVGSSRDV--NDIRTSSGAFL-----DD 88
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI + N+ E L I NY + Y H D R R++
Sbjct: 89 NELTAKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 144
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|301055727|ref|YP_003793938.1| prolyl 4-hydroxylase subunit alpha [Bacillus cereus biovar
anthracis str. CI]
gi|300377896|gb|ADK06800.1| prolyl 4-hydroxylase, alpha subunit [Bacillus cereus biovar
anthracis str. CI]
Length = 216
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 62/180 (34%), Positives = 87/180 (48%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + + D E + +IELSK K+ R KV + D D R S FL D
Sbjct: 36 FEEPLIVVLGNVLSDEECDELIELSKNKLARSKVGSSRDV--NDIRTSSGAFL-----DD 88
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI + N+ E L I NY + Y H D R R++
Sbjct: 89 NELTAKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 144
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|229186477|ref|ZP_04313640.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BGSC 6E1]
gi|228596991|gb|EEK54648.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BGSC 6E1]
Length = 216
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 62/180 (34%), Positives = 87/180 (48%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + + D E + +IELSK K+ R KV + D D R S FL D
Sbjct: 36 FEEPLIVVLGNVLSDEECDELIELSKNKLARSKVGSSRDV--NDIRTSSGAFL-----DD 88
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI + N+ E L I NY + Y H D R R++
Sbjct: 89 NELTAKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 144
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 145 TLVIYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|423457579|ref|ZP_17434376.1| hypothetical protein IEI_00719 [Bacillus cereus BAG5X2-1]
gi|401147963|gb|EJQ55456.1| hypothetical protein IEI_00719 [Bacillus cereus BAG5X2-1]
Length = 216
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 62/180 (34%), Positives = 87/180 (48%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + + D E + +IELSK K+ R KV + D D R S FL D
Sbjct: 36 FEEPLIVVLGNVLSDEECDELIELSKSKLARSKVGSSRDV--NDIRTSSGAFLE-----D 88
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI + N+ E L I NY + Y H D R R++
Sbjct: 89 NELTVKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 144
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|52141260|ref|YP_085568.1| prolyl 4-hydroxylase, alpha subunit [Bacillus cereus E33L]
gi|51974729|gb|AAU16279.1| prolyl 4-hydroxylase, alpha subunit [Bacillus cereus E33L]
Length = 232
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 62/180 (34%), Positives = 87/180 (48%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + + D E + +IELSK K+ R KV + D D R S FL D
Sbjct: 52 FEEPLIVVLGNVLSDEECDELIELSKNKLARSKVGSSRDV--NDIRTSSGAFL-----DD 104
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI + N+ E L I NY + Y H D R R++
Sbjct: 105 NELTAKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 160
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 161 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 220
>gi|281307110|pdb|3ITQ|A Chain A, Crystal Structure Of A Prolyl 4-Hydroxylase From Bacillus
Anthracis
gi|281307111|pdb|3ITQ|B Chain B, Crystal Structure Of A Prolyl 4-Hydroxylase From Bacillus
Anthracis
Length = 216
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 62/180 (34%), Positives = 87/180 (48%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + + D E + +IELSK K+ R KV + D D R S FL D
Sbjct: 36 FEEPLIVVLGNVLSDEECDELIELSKSKLARSKVGSSRDV--NDIRTSSGAFL-----DD 88
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI + N+ E L I NY + Y H D R R++
Sbjct: 89 NELTAKIEKRISSIXNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 144
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 145 TLVXYLNDVEEGGETFFPKLNLSVHPRKGXAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|118479416|ref|YP_896567.1| prolyl 4-hydroxylase subunit alpha [Bacillus thuringiensis str. Al
Hakam]
gi|118418641|gb|ABK87060.1| prolyl 4-hydroxylase, alpha subunit [Bacillus thuringiensis str. Al
Hakam]
Length = 232
Score = 91.3 bits (225), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 62/180 (34%), Positives = 87/180 (48%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + + D E + +IELSK K+ R KV + D D R S FL D
Sbjct: 52 FEEPLIVVLGNVLSDEECDELIELSKNKLARSKVGSSRDV--NDIRTSSGAFL-----DD 104
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI + N+ E L I NY + Y H D R R++
Sbjct: 105 NELTAKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 160
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 161 TLVIYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 220
>gi|49480949|ref|YP_038297.1| prolyl 4-hydroxylase subunit alpha [Bacillus thuringiensis serovar
konkukian str. 97-27]
gi|49332505|gb|AAT63151.1| prolyl 4-hydroxylase, alpha subunit [Bacillus thuringiensis serovar
konkukian str. 97-27]
Length = 232
Score = 91.3 bits (225), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 62/180 (34%), Positives = 87/180 (48%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + + D E + +IELSK K+ R KV + D D R S FL D
Sbjct: 52 FEEPLIVVLGNVLSDEECDELIELSKNKLARSKVGSSRDV--NDIRTSSGAFL-----DD 104
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI + N+ E L I NY + Y H D R R++
Sbjct: 105 NELTEKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 160
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 161 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 220
>gi|229093299|ref|ZP_04224414.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-42]
gi|228690082|gb|EEL43879.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-42]
Length = 232
Score = 91.3 bits (225), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 62/180 (34%), Positives = 87/180 (48%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + + D E + +IELSK K+ R KV + D D R S FL D
Sbjct: 52 FEEPLIVVLGNVLSDEECDELIELSKNKLARSKVGSSRDV--NDIRTSSGAFL-----DD 104
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI + N+ E L I NY + Y H D R R++
Sbjct: 105 NELTEKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 160
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 161 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 220
>gi|229146822|ref|ZP_04275187.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST24]
gi|228636650|gb|EEK93115.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST24]
Length = 216
Score = 91.3 bits (225), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 61/180 (33%), Positives = 87/180 (48%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + + D E + +IE+SK K+ER K+ + D D R S FL D
Sbjct: 36 FEEPLIVVLANVLSDEECDELIEMSKNKMERSKIGSSRDV--NDIRTSSGAFLE-----D 88
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI + N+ E L I NY + Y H D R R++
Sbjct: 89 NELTSKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 144
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + L+ H G PV G KW
Sbjct: 145 TLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQGQSLNELTLHGGAPVTKGEKW 204
>gi|228960501|ref|ZP_04122151.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
pakistani str. T13001]
gi|229047930|ref|ZP_04193506.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH676]
gi|423630961|ref|ZP_17606708.1| hypothetical protein IK5_03811 [Bacillus cereus VD154]
gi|423650103|ref|ZP_17625673.1| hypothetical protein IKA_03890 [Bacillus cereus VD169]
gi|228723387|gb|EEL74756.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH676]
gi|228799198|gb|EEM46165.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
pakistani str. T13001]
gi|401264328|gb|EJR70440.1| hypothetical protein IK5_03811 [Bacillus cereus VD154]
gi|401282521|gb|EJR88420.1| hypothetical protein IKA_03890 [Bacillus cereus VD169]
Length = 248
Score = 91.3 bits (225), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 61/180 (33%), Positives = 88/180 (48%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + + D E + +IE+SK K++R KV + D D R S FL D
Sbjct: 68 FEEPLIVVLANVLSDEECDELIEMSKNKMKRSKVGSSRDV--NDIRTSSGAFL-----ED 120
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI + N+ E L I NY + Y H D R R++
Sbjct: 121 NELTSKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 176
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 177 TLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 236
>gi|423558182|ref|ZP_17534484.1| hypothetical protein II3_03386 [Bacillus cereus MC67]
gi|401191450|gb|EJQ98472.1| hypothetical protein II3_03386 [Bacillus cereus MC67]
Length = 216
Score = 91.3 bits (225), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 60/180 (33%), Positives = 89/180 (49%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + + D E + +IELSK K++R K+ + D D R S FL +
Sbjct: 36 FEEPLIVVLANVLSDEECDGLIELSKNKIKRSKIGSSRDV--NDIRTSSGAFL-----EE 88
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI + N+ + E L I NY + Y H D R R++
Sbjct: 89 NELTSKIEKRISSIMNVPVAHGE----GLHILNYEVDQEYKAHYDYFAEHSRSAANNRIS 144
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|389770666|ref|ZP_10192118.1| procollagen-proline dioxygenase [Rhodanobacter sp. 115]
gi|388429637|gb|EIL86932.1| procollagen-proline dioxygenase [Rhodanobacter sp. 115]
Length = 286
Score = 91.3 bits (225), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 65/194 (33%), Positives = 97/194 (50%), Gaps = 22/194 (11%)
Query: 38 LKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYF 95
L+VE+ P + + + E + +I + K++R +V+ G + R S+ F
Sbjct: 90 LRVEQ----PVLAVLDGVLSHEECDELIRRAAAKLQRSTIVDPTTGKHETIADRSSEGTF 145
Query: 96 LYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDE- 154
EI D F+ ++ RI + NL + E LQI +YG GG Y H D P +
Sbjct: 146 F--EINADD-FIARLDRRISALMNLPVDHGEG----LQILHYGPGGEYKPHFDFFPPGDP 198
Query: 155 --------GLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYR 206
G R+++ + YL +VE GGATIFP L L+V P+KGSAV++ ++ LD R
Sbjct: 199 GSAVQMATGGQRVSTLVMYLNEVEDGGATIFPELGLSVLPKKGSAVYFEYTNSRGQLDPR 258
Query: 207 MYHSGCPVALGNKW 220
H G PV G KW
Sbjct: 259 TLHGGAPVLRGEKW 272
>gi|423437685|ref|ZP_17414666.1| hypothetical protein IE9_03866 [Bacillus cereus BAG4X12-1]
gi|423503075|ref|ZP_17479667.1| hypothetical protein IG1_00641 [Bacillus cereus HD73]
gi|401120840|gb|EJQ28636.1| hypothetical protein IE9_03866 [Bacillus cereus BAG4X12-1]
gi|402459296|gb|EJV91033.1| hypothetical protein IG1_00641 [Bacillus cereus HD73]
Length = 248
Score = 90.9 bits (224), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 61/180 (33%), Positives = 88/180 (48%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + + D E + +IE+SK K++R KV + D D R S FL D
Sbjct: 68 FEEPLIVVLANVLSDEECDELIEMSKNKMKRSKVGSARDV--NDIRTSSGAFL-----ED 120
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI + N+ E L I NY + Y H D R R++
Sbjct: 121 NELTSKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 176
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 177 TLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 236
>gi|228954520|ref|ZP_04116545.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
kurstaki str. T03a001]
gi|449091198|ref|YP_007423639.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
kurstaki str. HD73]
gi|228805177|gb|EEM51771.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
kurstaki str. T03a001]
gi|449024955|gb|AGE80118.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
kurstaki str. HD73]
Length = 216
Score = 90.9 bits (224), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 61/180 (33%), Positives = 88/180 (48%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + + D E + +IE+SK K++R KV + D D R S FL D
Sbjct: 36 FEEPLIVVLANVLSDEECDELIEMSKNKMKRSKVGSARDV--NDIRTSSGAFL-----ED 88
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI + N+ E L I NY + Y H D R R++
Sbjct: 89 NELTSKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 144
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 145 TLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|195159168|ref|XP_002020454.1| GL13504 [Drosophila persimilis]
gi|194117223|gb|EDW39266.1| GL13504 [Drosophila persimilis]
Length = 491
Score = 90.9 bits (224), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 72/211 (34%), Positives = 103/211 (48%), Gaps = 31/211 (14%)
Query: 18 KSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKV 77
KS L C + S+ +F LKVEE+ LDP +V HD + E+ EL K
Sbjct: 286 KSTLHCRF-SWRPSF--YARLKVEEVLLDPYIVLYHDVVSGKEM----ELLK-------- 330
Query: 78 VNYGDTIY----VDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQ 133
+YG T + + LS + PE P + + R+ DMT L + E +
Sbjct: 331 -DYGRTNLTHDPLRSGLSAKHCALPESL---PLVQSLHQRLWDMTGLSLNGSESWL---- 382
Query: 134 INNYGLGGHYDLHCDATPRDE----GLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKG 189
I NYG+GG LH D E G RL + +L++V GG T+FP+L + V P+ G
Sbjct: 383 ITNYGIGGFLGLHKDYFDEIEEELQGDNRLFTIQIFLSNVSQGGYTVFPNLEVAVKPQAG 442
Query: 190 SAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+A+ +YN + + D R H GCPV G+KW
Sbjct: 443 TALVFYNLLDSLVGDTRTRHFGCPVIDGDKW 473
>gi|149180354|ref|ZP_01858859.1| prolyl 4-hydroxylase, alpha subunit [Bacillus sp. SG-1]
gi|148852546|gb|EDL66691.1| prolyl 4-hydroxylase, alpha subunit [Bacillus sp. SG-1]
Length = 212
Score = 90.9 bits (224), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 58/176 (32%), Positives = 92/176 (52%), Gaps = 10/176 (5%)
Query: 46 DPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHP 105
+P +V + + + D E + +I LSK K++R K+ N + D R S F+ G+
Sbjct: 36 EPLIVVLGNVLSDEECDALIGLSKDKLKRSKIGNTRNE--NDMRTSSSTFMEE---GESE 90
Query: 106 FLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFY 165
+ +++ RI + N+ E LQI NY +G Y H D ++ R+++ + Y
Sbjct: 91 VVTRVEKRISQIMNIPYENGE----GLQILNYKIGQEYKAHFDFF-KNASNPRISTLVMY 145
Query: 166 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 221
L DVE GG T FP LN +V P+KG AV++ + N L+ H G PV +G+KW
Sbjct: 146 LNDVEEGGETYFPKLNFSVSPQKGMAVYFEYFYDNQELNDLTLHGGAPVIIGDKWA 201
>gi|423582447|ref|ZP_17558558.1| hypothetical protein IIA_03962 [Bacillus cereus VD014]
gi|401213326|gb|EJR20067.1| hypothetical protein IIA_03962 [Bacillus cereus VD014]
Length = 248
Score = 90.9 bits (224), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 61/180 (33%), Positives = 87/180 (48%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + + D E + +IE+SK K++R KV + D D R S FL D
Sbjct: 68 FEEPLIVVLANVLSDEECDELIEMSKNKMKRSKVGSSRDV--NDIRTSSGAFL-----ED 120
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
KI+ RI + N+ E L I NY + Y H D R R++
Sbjct: 121 SELTLKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 176
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 177 TLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 236
>gi|423634936|ref|ZP_17610589.1| hypothetical protein IK7_01345 [Bacillus cereus VD156]
gi|401278922|gb|EJR84852.1| hypothetical protein IK7_01345 [Bacillus cereus VD156]
Length = 248
Score = 90.9 bits (224), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 61/180 (33%), Positives = 87/180 (48%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + + D E + +IE+SK K++R KV + D D R S FL D
Sbjct: 68 FEEPLIVVLANVLSDEECDELIEMSKNKMKRSKVGSSRDV--NDIRTSSGAFL-----ED 120
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
KI+ RI + N+ E L I NY + Y H D R R++
Sbjct: 121 SELTLKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 176
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 177 TLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 236
>gi|229152436|ref|ZP_04280628.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus m1550]
gi|228631044|gb|EEK87681.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus m1550]
Length = 248
Score = 90.9 bits (224), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 60/180 (33%), Positives = 87/180 (48%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + + D E +IE+SK K+ER K+ + D D R S FL D
Sbjct: 68 FEEPLIVVLANVLSDEECGELIEMSKNKMERSKIGSSRDV--NDIRTSSGAFL-----ED 120
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI + N+ E L I NY + Y H D R R++
Sbjct: 121 NELTSKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 176
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + ++ H G PV G KW
Sbjct: 177 TLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSINELTLHGGAPVTKGEKW 236
>gi|377811809|ref|YP_005044249.1| ProCollegen-proline dioxygenase [Burkholderia sp. YI23]
gi|357941170|gb|AET94726.1| ProCollegen-proline dioxygenase [Burkholderia sp. YI23]
Length = 283
Score = 90.9 bits (224), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 58/186 (31%), Positives = 89/186 (47%), Gaps = 18/186 (9%)
Query: 46 DPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGD 103
+P V + D + E +R+IE+ + +V R VV+ G + +D R S+ F+
Sbjct: 90 EPVVALLADVLSPRECDRLIEIGRERVRRSSVVDPDSGGEVLIDARKSEGAFVNGST--- 146
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDE--------- 154
P + I RI ++ + E L I YG GG Y H D P ++
Sbjct: 147 DPLVATIDRRIAELVQQPVENGE----DLHILRYGAGGEYRPHFDYFPEEQAGSKHHMQR 202
Query: 155 GLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 214
G R+A+ + YL VE GG T FP + LT+ P +G+A+++ +A D R H+G PV
Sbjct: 203 GGQRIATLILYLNQVEEGGDTTFPDIGLTIHPRRGAALYFEYVNALGQTDPRTLHAGMPV 262
Query: 215 ALGNKW 220
G KW
Sbjct: 263 ERGEKW 268
>gi|402813396|ref|ZP_10862991.1| hypothetical protein PAV_1c08470 [Paenibacillus alvei DSM 29]
gi|402509339|gb|EJW19859.1| hypothetical protein PAV_1c08470 [Paenibacillus alvei DSM 29]
Length = 215
Score = 90.5 bits (223), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 61/181 (33%), Positives = 94/181 (51%), Gaps = 16/181 (8%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDT-RLSKVYFLYPEIFG 102
Y +P +V + + + + E + +IE SK +++R K+ G+ V+ R S F
Sbjct: 33 YEEPLIVILGNVLSNEECDELIEHSKERLQRSKI---GEERSVNQIRTSSGVFCE----- 84
Query: 103 DHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRL 159
++ + KI+ RI + N+ I + LQ+ Y G Y H D T R R+
Sbjct: 85 ENETVAKIEKRISQIMNIPI----EHGDGLQVLLYAPGQEYKPHFDFFADTSRASANNRI 140
Query: 160 ASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 219
++ + YL DVE GG T FP LNL+VFP KG AV++ ++N L+ R H+G PV G K
Sbjct: 141 STLVMYLNDVEEGGETTFPMLNLSVFPSKGMAVYFEYFYSNHELNERTLHAGAPVRKGEK 200
Query: 220 W 220
W
Sbjct: 201 W 201
>gi|423483822|ref|ZP_17460512.1| hypothetical protein IEQ_03600 [Bacillus cereus BAG6X1-2]
gi|401141373|gb|EJQ48928.1| hypothetical protein IEQ_03600 [Bacillus cereus BAG6X1-2]
Length = 216
Score = 90.5 bits (223), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 59/180 (32%), Positives = 88/180 (48%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + I D E + +IE+SK K++R + + D D R S FL +
Sbjct: 36 FEEPLIVVLGNVISDEECDELIEMSKNKIKRSTIGSSRDV--NDIRTSSGAFLE-----E 88
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI + N+ + E L I NY + Y H D R R++
Sbjct: 89 NELTSKIEKRISSIMNVPVAHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 144
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|228922987|ref|ZP_04086280.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
huazhongensis BGSC 4BD1]
gi|228836620|gb|EEM81968.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
huazhongensis BGSC 4BD1]
Length = 216
Score = 90.5 bits (223), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 61/180 (33%), Positives = 87/180 (48%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + + D E + +IE+SK K++R KV + D D R S FL D
Sbjct: 36 FEEPLIVVLANVLSDEECDELIEMSKNKMKRSKVGSSRDV--NDIRTSSGAFL-----ED 88
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
KI+ RI + N+ E L I NY + Y H D R R++
Sbjct: 89 SELTLKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 144
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 145 TLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|423615424|ref|ZP_17591258.1| hypothetical protein IIO_00750 [Bacillus cereus VD115]
gi|401259961|gb|EJR66134.1| hypothetical protein IIO_00750 [Bacillus cereus VD115]
Length = 216
Score = 90.5 bits (223), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 59/180 (32%), Positives = 88/180 (48%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + I D E + +IE+SK K++R + + D D R S FL +
Sbjct: 36 FEEPLIVVLGNVISDEECDELIEMSKNKIKRSTIGSSRDV--NDIRTSSGAFLE-----E 88
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI + N+ + E L I NY + Y H D R R++
Sbjct: 89 NELTSKIEKRISSIMNVPVAHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 144
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|218231188|ref|YP_002369041.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus cereus
B4264]
gi|218159145|gb|ACK59137.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
B4264]
Length = 216
Score = 90.5 bits (223), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 60/180 (33%), Positives = 87/180 (48%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + + D E +IE+SK K+ER K+ + D D R S FL D
Sbjct: 36 FEEPLIVVLANVLSDEECGELIEMSKNKMERSKIGSSRDV--NDIRTSSGAFLE-----D 88
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI + N+ E L I NY + Y H D R R++
Sbjct: 89 NELTSKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 144
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + ++ H G PV G KW
Sbjct: 145 TLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSINELTLHGGAPVTKGEKW 204
>gi|390352104|ref|XP_003727818.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like
[Strongylocentrotus purpuratus]
Length = 121
Score = 90.1 bits (222), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 48/93 (51%), Positives = 57/93 (61%), Gaps = 5/93 (5%)
Query: 132 LQINNYGLGGHYDLHCDATPRDEGLW----RLASFMFYLTDVELGGATIFPSLNLTVFPE 187
LQI NYGLGGHY H D T RD R+AS +FYL+DV GG T+F + PE
Sbjct: 7 LQIANYGLGGHYLPHFDFT-RDVATHKNGNRIASMLFYLSDVAKGGDTVFIDAGAKIKPE 65
Query: 188 KGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
KGSA+FWYN N +D R H+ CPV G+KW
Sbjct: 66 KGSAIFWYNLFKNGKVDERTKHASCPVISGSKW 98
>gi|228941395|ref|ZP_04103947.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
berliner ATCC 10792]
gi|228974327|ref|ZP_04134896.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
thuringiensis str. T01001]
gi|228980919|ref|ZP_04141223.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis Bt407]
gi|384188306|ref|YP_005574202.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
chinensis CT-43]
gi|410676625|ref|YP_006928996.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis Bt407]
gi|452200698|ref|YP_007480779.1| Peptidyl prolyl 4-hydroxylase-like protein, alpha subunit [Bacillus
thuringiensis serovar thuringiensis str. IS5056]
gi|228778855|gb|EEM27118.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis Bt407]
gi|228785377|gb|EEM33387.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
thuringiensis str. T01001]
gi|228818321|gb|EEM64394.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
berliner ATCC 10792]
gi|326942015|gb|AEA17911.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
chinensis CT-43]
gi|409175754|gb|AFV20059.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis Bt407]
gi|452106091|gb|AGG03031.1| Peptidyl prolyl 4-hydroxylase-like protein, alpha subunit [Bacillus
thuringiensis serovar thuringiensis str. IS5056]
Length = 216
Score = 90.1 bits (222), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 61/180 (33%), Positives = 87/180 (48%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + + D E +IE+SK K++R KV + D D R S FL D
Sbjct: 36 FEEPLIVVLANVLSDEECGELIEMSKNKMKRSKVGSSRDV--NDIRTSSGAFLE-----D 88
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI + N+ E L I NY + Y H D R R++
Sbjct: 89 NELTSKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 144
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 145 TLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|73542634|ref|YP_297154.1| procollagen-proline,2-oxoglutarate-4-dioxygenase [Ralstonia
eutropha JMP134]
gi|72120047|gb|AAZ62310.1| Procollagen-proline,2-oxoglutarate-4-dioxygenase [Ralstonia
eutropha JMP134]
Length = 282
Score = 89.7 bits (221), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 58/176 (32%), Positives = 85/176 (48%), Gaps = 18/176 (10%)
Query: 56 IYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTR 113
+ D+E + ++EL++G++ R V+N GD +D R S G+H + +I+ R
Sbjct: 99 LSDAECDALVELARGRLARSPVINPDTGDENLIDARTSMGAMFQ---VGEHTLIQRIEDR 155
Query: 114 IQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---------ATPRDEGLWRLASFMF 164
I + + + E LQI NY GG Y H D A G R A+ +
Sbjct: 156 IAAVLGVPVDHGEG----LQILNYKPGGEYQPHFDFFNPKRPGEARQLRVGGQRTATLVI 211
Query: 165 YLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
YL + GGAT FP + L V P KG+AV++ + LD R H+G PV G KW
Sbjct: 212 YLNTPQAGGATAFPRIGLEVAPVKGNAVYFSYLQPDGKLDERTLHAGLPVQSGEKW 267
>gi|423541303|ref|ZP_17517694.1| hypothetical protein IGK_03395 [Bacillus cereus HuB4-10]
gi|401172491|gb|EJQ79712.1| hypothetical protein IGK_03395 [Bacillus cereus HuB4-10]
Length = 216
Score = 89.7 bits (221), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 59/180 (32%), Positives = 88/180 (48%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + I D E + +IE+SK K++R + + D D R S FL +
Sbjct: 36 FEEPLIVVLGNVISDEECDELIEMSKNKIKRSTIGSSRDV--NDIRTSSGAFLE-----E 88
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI + N+ + E L I NY + Y H D R R++
Sbjct: 89 NELTSKIEKRISSIMNVPVTHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 144
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|228910069|ref|ZP_04073889.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL 200]
gi|228849586|gb|EEM94420.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL 200]
Length = 248
Score = 89.7 bits (221), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 61/180 (33%), Positives = 88/180 (48%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + + D E + +IE+SK K++R KV + D D R S FL D
Sbjct: 68 FEEPLIVVLANVLSDEECDELIEMSKNKMKRSKVGSSRDV--NDIRTSSGAFL-----ED 120
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI + N+ E L I NY + Y H D R R++
Sbjct: 121 NELTSKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAVNNRIS 176
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 177 TLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 236
>gi|241598365|ref|XP_002404734.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
gi|215500465|gb|EEC09959.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
Length = 524
Score = 89.7 bits (221), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 68/240 (28%), Positives = 104/240 (43%), Gaps = 33/240 (13%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
E Y C+G + S L+C Y S + F K+ P+K+EE L P VV + D + D +
Sbjct: 274 ENYKRLCRGEQLRTPKMDSQLRCRYYSGESGFFKLQPIKLEEYNLKPYVVVLRDLLQDRD 333
Query: 61 INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIF---------GDHPFLYKIQ 111
+ +I +K +V + +LS+ +Y + + D P ++
Sbjct: 334 LADMIAFAKPRVRK-------------LQLSRRILVYSKHYCDTSTWLNDDDAPVAARVN 380
Query: 112 TRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD-----ATPRDEGLW------RLA 160
+Q + L + Q+ NYG+GGHY H D T R + R+A
Sbjct: 381 QYLQSLLGLGTLYSKDEAEKYQLANYGIGGHYVPHHDYLEETLTSRHVSIVTRLFGDRVA 440
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ M Y++DVE GGAT+FPSL + V P+K S F R A+ NKW
Sbjct: 441 TLMIYMSDVEEGGATVFPSLGVRVSPKKVSMQFIRAVMRWVAFTLREVCVSFCCAVANKW 500
>gi|402584932|gb|EJW78873.1| hypothetical protein WUBG_10221 [Wuchereria bancrofti]
Length = 187
Score = 89.7 bits (221), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 45/124 (36%), Positives = 67/124 (54%), Gaps = 10/124 (8%)
Query: 103 DHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW----- 157
+H + +I R+ TNL E LQ+ NYG+GGHY+ H D + R+
Sbjct: 14 EHEVVNRINKRLDLATNL----ETETAEELQVQNYGIGGHYEPHYDCSRRESVFEKTKNG 69
Query: 158 -RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVAL 216
R+A+ + Y+T E+GG T+F L ++ K +A+FWYN + +D R YH+ CPV
Sbjct: 70 NRIATILIYMTKPEIGGGTVFIDLKTSISCTKNAALFWYNLMRSGAVDIRSYHAACPVLT 129
Query: 217 GNKW 220
G KW
Sbjct: 130 GTKW 133
>gi|403238305|ref|ZP_10916891.1| procollagen-proline dioxygenase [Bacillus sp. 10403023]
Length = 296
Score = 89.7 bits (221), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 51/172 (29%), Positives = 91/172 (52%), Gaps = 14/172 (8%)
Query: 56 IYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTR 113
+ + E +++IE+S+ +++ V++ G+ R SK Y ++ F+ K++ R
Sbjct: 118 LSEEECDQLIEMSRERLKPSTVIDPKTGEEKAATGRTSKGMSFY---LQENEFIKKVEKR 174
Query: 114 IQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPR-----DEGLWRLASFMFYLTD 168
I ++ + E LQ+ NYG+G Y H D P+ ++G R+ +F+ YL D
Sbjct: 175 IAELIEFPVENGEG----LQVLNYGIGEEYKSHFDYFPQSKVVPEKGGQRVGTFLIYLND 230
Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
V GG T+FP +++ P+KGSAV++ ++ +D HS PV+ G KW
Sbjct: 231 VPAGGETVFPKAGVSIVPKKGSAVYFQYGNSKGEVDRMSLHSSIPVSEGEKW 282
>gi|194745802|ref|XP_001955376.1| GF16267 [Drosophila ananassae]
gi|190628413|gb|EDV43937.1| GF16267 [Drosophila ananassae]
Length = 385
Score = 89.4 bits (220), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 67/218 (30%), Positives = 100/218 (45%), Gaps = 31/218 (14%)
Query: 2 IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
I+ C+G +VP+ K L+C Y + + FL++ PLK+E+L LDP + HD I E
Sbjct: 46 IHLETCRGRNTVPK--KFYLRCRYFTEGDPFLQLAPLKLEQLNLDPFIGIFHDVISIGEQ 103
Query: 62 NRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLV 121
+I L++ ++ V+ SK + +I RI+DMT L
Sbjct: 104 KNLINLTRNRLRLQNPQRAVMEAEVELNASKE-------------VERIHRRIEDMTGLN 150
Query: 122 IGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPSLN 181
+ EE PL I NYG+GG + +H D F L+DV++GG FP L
Sbjct: 151 L--EE--SPPLTILNYGIGGQHPIHLDCE------------QFMLSDVQMGGYASFPELG 194
Query: 182 LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 219
P +GSA+ +N D R + CP A+ K
Sbjct: 195 FGFKPSRGSALVVHNMDNAANCDIRSLQATCPGAVTFK 232
>gi|218899396|ref|YP_002447807.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus cereus
G9842]
gi|218542449|gb|ACK94843.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
G9842]
Length = 216
Score = 89.4 bits (220), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 61/180 (33%), Positives = 88/180 (48%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + + D E + +IE+SK K++R KV + D D R S FL D
Sbjct: 36 FEEPLIVVLANVLSDEECDELIEMSKNKMKRSKVGSSRDV--NDIRTSSGAFL-----ED 88
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI + N+ E L I NY + Y H D R R++
Sbjct: 89 NELTSKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAVNNRIS 144
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 145 TLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|196011906|ref|XP_002115816.1| hypothetical protein TRIADDRAFT_59903 [Trichoplax adhaerens]
gi|190581592|gb|EDV21668.1| hypothetical protein TRIADDRAFT_59903 [Trichoplax adhaerens]
Length = 444
Score = 89.4 bits (220), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 60/197 (30%), Positives = 96/197 (48%), Gaps = 16/197 (8%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
Y C+ + + + + LKC+Y +N + L P+ VEE+ P + HD I E
Sbjct: 238 YTKLCRSHKNYQTSLNNGLKCYY--FNQSPLLHFNPVAVEEISYSPVIRLYHDIISHQEA 295
Query: 62 NRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQT----RIQDM 117
+ +S K+ + + + S+ Y F H +L I R+ +
Sbjct: 296 EILKNISSKKLTVARTF-----VQIMPNNSEAEGEYR--FAKHAWLGDIDNQVVRRLSVL 348
Query: 118 TNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDE--GLWRLASFMFYLTDVELGGAT 175
+ + G + Y LQ+ NYG+GGHY H D+ D+ G RLA+ MFYL+DV++GGAT
Sbjct: 349 SEELTGLDLSYAEKLQVANYGVGGHYSPHYDSASIDDDTGKPRLATIMFYLSDVDIGGAT 408
Query: 176 IFPSLNLTVFPEKGSAV 192
+FP + +FP K S +
Sbjct: 409 VFPDIGKAIFPRKTSEI 425
>gi|156333122|ref|XP_001619372.1| hypothetical protein NEMVEDRAFT_v1g151555 [Nematostella vectensis]
gi|156202442|gb|EDO27272.1| predicted protein [Nematostella vectensis]
Length = 144
Score = 89.0 bits (219), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 48/116 (41%), Positives = 68/116 (58%), Gaps = 8/116 (6%)
Query: 109 KIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLW---RLASFMF 164
+I R+Q + L + E LQ+ NYG+GGHY+ H D A + L R+A+F+
Sbjct: 16 RISYRVQAYSGLNMTTSE----DLQVVNYGIGGHYEPHYDFARDKFTSLGTGNRIATFLS 71
Query: 165 YLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
YL+DVE GG T+F + TV+P+KG A FWYN + D H+ CPV +G+KW
Sbjct: 72 YLSDVEAGGGTVFTRVGATVWPQKGDAAFWYNLKRSGDGDSSTRHAACPVLVGSKW 127
>gi|229104864|ref|ZP_04235524.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-28]
gi|228678581|gb|EEL32798.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-28]
Length = 216
Score = 89.0 bits (219), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 59/180 (32%), Positives = 87/180 (48%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + I D E +IE+SK K++R + + D D R S FL +
Sbjct: 36 FEEPLIVVLGNVISDEECGELIEMSKNKIKRSTIGSSRDV--NDIRTSSGAFLE-----E 88
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI + N+ + E L I NY + Y H D R R++
Sbjct: 89 NELTSKIEKRISSIMNVPVTHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 144
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + L+ H G PV G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204
>gi|89096248|ref|ZP_01169141.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus sp.
NRRL B-14911]
gi|89089102|gb|EAR68210.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus sp.
NRRL B-14911]
Length = 217
Score = 89.0 bits (219), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 56/179 (31%), Positives = 92/179 (51%), Gaps = 12/179 (6%)
Query: 46 DPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHP 105
+P +V + + + D E +I +S+ K++R K+ G+T VD + + E G++
Sbjct: 38 EPLIVILGNVLSDEECEGLIRMSEDKLKRSKI---GNTRTVDDIRTSSSMFFEE--GENE 92
Query: 106 FLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW---RLASF 162
+ +I+ R+ + N+ + E LQ+ NY +G Y H D R+++
Sbjct: 93 LVARIERRLSQIMNIPVEHGE----GLQMLNYHIGQEYKAHFDFFSSSSRAASNPRISTL 148
Query: 163 MFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 221
+ YL DVE GG T FP LN +V P+KGSAV++ + N L+ H G PV G+KW
Sbjct: 149 VMYLNDVEEGGETYFPKLNFSVNPQKGSAVYFEYFYDNQDLNDLTLHGGAPVIKGSKWA 207
>gi|372266874|ref|ZP_09502922.1| peptidyl prolyl 4-hydroxylase-like protein subunit alpha
[Alteromonas sp. S89]
Length = 294
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 61/210 (29%), Positives = 104/210 (49%), Gaps = 22/210 (10%)
Query: 25 YESYNNTFLKIGPLKVEELYL--DPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--Y 80
+ ++N + +G +VE + P +V + + + E + ++E+S+ + +VVN +
Sbjct: 79 FPTFNTGVIPLGDQQVEARFAIRQPNIVLFANFLAEWECDALVEMSRPNLSPSRVVNTQH 138
Query: 81 GDTIYVDTRLSK-VYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGL 139
G +R S +F G+ P + I+ RI + + E + PLQI +Y +
Sbjct: 139 GAFELKPSRTSGGTHFAR----GETPLIADIEARIASLLKV----PEAHGEPLQILHYPV 190
Query: 140 GG----HYDLHCDATPRDE-----GLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGS 190
G HYD P ++ G R+ + + YL+DVE GGAT+FP + L V P+KG+
Sbjct: 191 SGEYRPHYDFFDPEKPGNQEVLAAGGQRVGTLIMYLSDVESGGATVFPRVGLEVQPQKGA 250
Query: 191 AVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
A+F+ + LD + H G PV G KW
Sbjct: 251 ALFFSYVGEHGKLDLQSLHGGSPVLAGEKW 280
>gi|229098707|ref|ZP_04229647.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-29]
gi|423441025|ref|ZP_17417931.1| hypothetical protein IEA_01355 [Bacillus cereus BAG4X2-1]
gi|423533441|ref|ZP_17509859.1| hypothetical protein IGI_01273 [Bacillus cereus HuB2-9]
gi|228684786|gb|EEL38724.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-29]
gi|402417686|gb|EJV49986.1| hypothetical protein IEA_01355 [Bacillus cereus BAG4X2-1]
gi|402463660|gb|EJV95360.1| hypothetical protein IGI_01273 [Bacillus cereus HuB2-9]
Length = 216
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 59/180 (32%), Positives = 87/180 (48%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + I D E N +IE+SK K++R + + D D R S FL +
Sbjct: 36 FEEPLIVVLGNVISDEECNELIEMSKNKIKRSTIGSARDV--NDIRTSSGAFLE-----E 88
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI + N+ + E L I NY + Y H D R R++
Sbjct: 89 NELTSKIEKRISSIMNVPVTHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 144
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + L+ H G V G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGASVTKGEKW 204
>gi|403234403|ref|ZP_10912989.1| Procollagen-proline dioxygenase [Bacillus sp. 10403023]
Length = 217
Score = 88.2 bits (217), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 57/180 (31%), Positives = 94/180 (52%), Gaps = 15/180 (8%)
Query: 46 DPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDT-RLSKVYFLYPEIFGDH 104
+P +V + + + D E + +I LSK ++ R K+ N VD R S F+ ++
Sbjct: 38 EPLIVVLGNVLSDEECDELIRLSKDRINRSKIANAN----VDNMRTSSSTFIEE---NEN 90
Query: 105 PFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD--ATPRDE-GLWRLAS 161
+ +I+ RI + N+ Y LQI NY +G Y H D ++P + R+++
Sbjct: 91 IIVSRIEKRISQIMNI----PTEYGEGLQILNYQVGQEYKSHFDFFSSPHNAINNPRIST 146
Query: 162 FMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 221
+ YL+DVE GG T FP L+ +V P+KG AV++ + + L+ H G PV +G+KW
Sbjct: 147 LVMYLSDVEQGGETYFPKLHFSVSPQKGMAVYFEYFYNDQTLNELTLHGGAPVIVGDKWA 206
>gi|433460968|ref|ZP_20418587.1| prolyl 4-hydroxylase alpha subunit [Halobacillus sp. BAB-2008]
gi|432190746|gb|ELK47751.1| prolyl 4-hydroxylase alpha subunit [Halobacillus sp. BAB-2008]
Length = 211
Score = 87.4 bits (215), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 57/182 (31%), Positives = 88/182 (48%), Gaps = 14/182 (7%)
Query: 46 DPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHP 105
+P++ + + + + E +I LSK KV R K+ + D D R S FL D
Sbjct: 33 EPKIAILGNVVSEEECEALIRLSKDKVNRSKIGSDHDV--SDIRTSSSAFL-----PDDE 85
Query: 106 FLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLASF 162
+I+ R+ + N+ + E + I +Y G Y H D +T R R+++
Sbjct: 86 LTGRIEKRLAQIMNVPVEHGE----GIHILHYKPGQEYKAHHDYFRSTSRAAKNPRISTL 141
Query: 163 MFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWGK 222
+ YL DVE GG T FP +NLTV P KG AV++ + + ++ R H G PV G KW
Sbjct: 142 VLYLNDVEEGGETYFPEMNLTVSPHKGMAVYFEYFYNDPAINERTLHGGSPVTAGEKWAA 201
Query: 223 LL 224
+
Sbjct: 202 TM 203
>gi|444517246|gb|ELV11441.1| Prolyl 4-hydroxylase subunit alpha-2 [Tupaia chinensis]
Length = 466
Score = 87.4 bits (215), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 59/192 (30%), Positives = 94/192 (48%), Gaps = 25/192 (13%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
++Y C+G + + + L C Y N L I P K E+ + P +V+ +D + D
Sbjct: 288 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSD 347
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI RI E++K K+ R V + G R+SK +L + D P + ++ R+Q
Sbjct: 348 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 404
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATI 176
+T L + E LQ+ NYG+GG Y+ H D + ++DVE GGAT+
Sbjct: 405 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFS--------------RMSDVEAGGATV 446
Query: 177 FPSLNLTVFPEK 188
FP L ++P+K
Sbjct: 447 FPDLGAAIWPKK 458
>gi|242003035|ref|XP_002436120.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
gi|215499456|gb|EEC08950.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
Length = 173
Score = 87.0 bits (214), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 53/137 (38%), Positives = 71/137 (51%), Gaps = 24/137 (17%)
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGL--W---- 157
HP + K+ RI T L E LQ+ NYG+GGHY H D + +D+ L W
Sbjct: 11 HPVVKKLSRRIAAATGLSTSSAEH----LQVVNYGVGGHYSPHFDFSTKDKPLRGWETFA 66
Query: 158 --RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYN---AHANTLL--------- 203
R A+++ YL+ VE GGAT+F L + V PE G A+FW+N N+L
Sbjct: 67 GQRQATWLVYLSSVERGGATLFKRLRVRVQPEAGMALFWHNLPPGSTNSLPSCCVHRSVG 126
Query: 204 DYRMYHSGCPVALGNKW 220
D R H CPV +G+KW
Sbjct: 127 DERTEHGACPVLVGSKW 143
>gi|424863736|ref|ZP_18287648.1| prolyl 4-hydroxylase subunit alpha-2 [SAR86 cluster bacterium
SAR86A]
gi|400757057|gb|EJP71269.1| prolyl 4-hydroxylase subunit alpha-2 [SAR86 cluster bacterium
SAR86A]
Length = 205
Score = 87.0 bits (214), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 55/186 (29%), Positives = 91/186 (48%), Gaps = 17/186 (9%)
Query: 46 DPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHP 105
DP V +++ + D E +E+ KGK+ER KV++ ++ + +R + +L
Sbjct: 16 DPIVYVVNNFLSDDECEAFVEMGKGKMERAKVISDDESEFHASRTNDFCWLE---HSASD 72
Query: 106 FLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDA----TPRDEGLW---- 157
++++ R + + I E++ Q+ YG G Y H DA T + W
Sbjct: 73 VIHEVSKRFSVLVKMPINNAEQF----QLVYYGPGNEYKPHFDAFDKTTKEGQNNWFPGG 128
Query: 158 -RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNA-HANTLLDYRMYHSGCPVA 215
R+ + + YL DVE GGAT FP +N++V P KG V ++N T ++ + H G PV
Sbjct: 129 QRMVTALAYLNDVEEGGATDFPKINVSVKPNKGDVVVFHNCIEGTTEINPQALHGGSPVV 188
Query: 216 LGNKWG 221
G KW
Sbjct: 189 AGEKWA 194
>gi|423448819|ref|ZP_17425698.1| hypothetical protein IEC_03427 [Bacillus cereus BAG5O-1]
gi|401129413|gb|EJQ37096.1| hypothetical protein IEC_03427 [Bacillus cereus BAG5O-1]
Length = 216
Score = 86.7 bits (213), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 58/180 (32%), Positives = 87/180 (48%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P +V + + I D E + +IE+SK K++R + + D D R S FL +
Sbjct: 36 FEEPLIVVLGNVISDEECDELIEMSKNKIKRSTIGSSRDV--NDIRTSSGAFLE-----E 88
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ KI+ RI + N+ + E L I NY + Y H D R R++
Sbjct: 89 NELTSKIEKRISSIMNVPVTHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 144
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP LNL+V P KG AV++ + + L+ H G V G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGASVTKGEKW 204
>gi|205374182|ref|ZP_03226981.1| prolyl 4-hydroxylase alpha subunit [Bacillus coahuilensis m4-4]
Length = 210
Score = 86.7 bits (213), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 56/183 (30%), Positives = 94/183 (51%), Gaps = 12/183 (6%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P V + + + D E + +I LSK ++ R K+ + D R S FL PE +
Sbjct: 30 FHEPFVAVLGNVLSDEECDELISLSKDRMNRSKIAGNQEN---DIRTSTSVFL-PEDASE 85
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--RLAS 161
+ +++ RI + N+ + E LQ+ NY +G Y H D + + R+++
Sbjct: 86 --VVQRVEKRISQIMNIPVEHGE----GLQLLNYQIGQEYKAHFDFFSPKKLIENPRIST 139
Query: 162 FMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 221
+ YL DVE GG T FP+L L+V P KG AV++ + + +L+ H G PV +G+KW
Sbjct: 140 LVLYLNDVEEGGDTYFPNLKLSVSPHKGMAVYFEYFYDDPMLNELTLHGGAPVTIGDKWA 199
Query: 222 KLL 224
+
Sbjct: 200 ATM 202
>gi|156398644|ref|XP_001638298.1| predicted protein [Nematostella vectensis]
gi|156225417|gb|EDO46235.1| predicted protein [Nematostella vectensis]
Length = 495
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 51/141 (36%), Positives = 76/141 (53%), Gaps = 16/141 (11%)
Query: 89 RLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD 148
R+SK +L G+ + +++ RI MT L + E + Q+ NYGL G YD H D
Sbjct: 339 RISKNCWLSGREHGE--VIDRVERRIAAMTRLNLETAEGF----QVQNYGLAGQYDPHFD 392
Query: 149 ATPRDEGLW---------RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHA 199
+ RD R+A+ + +++ VE GGAT+FP + + P+KG AVFW+N
Sbjct: 393 FS-RDLANSSLGSLGTGNRIATVLVWMSQVESGGATVFPYVGARILPQKGDAVFWHNLLR 451
Query: 200 NTLLDYRMYHSGCPVALGNKW 220
+ D+R H+GCPV G KW
Sbjct: 452 SGDGDFRTRHAGCPVLSGIKW 472
>gi|195352180|ref|XP_002042592.1| GM14979 [Drosophila sechellia]
gi|194124476|gb|EDW46519.1| GM14979 [Drosophila sechellia]
Length = 461
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 65/219 (29%), Positives = 96/219 (43%), Gaps = 53/219 (24%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
Y + C+G + N C Y FL++ PLK E L LDP +V H+ + D EI+
Sbjct: 295 YEIGCRGQFLR----RRNHVCTYNFTITEFLRLAPLKQEVLNLDPYIVIYHNILNDDEID 350
Query: 63 RIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVI 122
++ + S +VVN I+ RI ++T L
Sbjct: 351 KLKQHSNDNT--AEVVN-----------------------------PIEKRINELTRLSF 379
Query: 123 GREERYKGPLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPSLNL 182
++ L ++ G G T + + + +F+L +VELGGAT+FP L +
Sbjct: 380 LNSDQ----LIVSKNGPG---------TQKHIKEYSKGTLLFFLNNVELGGATVFPKLKI 426
Query: 183 TVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 221
+VFP+KGS + WYN D R CPV GNKWG
Sbjct: 427 SVFPQKGSCLIWYNTP-----DPRSDPLECPVLQGNKWG 460
>gi|449284064|gb|EMC90646.1| Prolyl 4-hydroxylase subunit alpha-3, partial [Columba livia]
Length = 174
Score = 85.9 bits (211), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 56/156 (35%), Positives = 81/156 (51%), Gaps = 12/156 (7%)
Query: 72 VERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGP 131
++R V + + R+SK +L HP + ++ R+ +T L + Y
Sbjct: 1 LQRSVVASGEKQQKAEYRISKSAWLKDTA---HPVVQTLEKRMAAVTGLDL--RPPYAEY 55
Query: 132 LQINNYGLGGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELGGATIFPSLNLTV 184
LQ+ NYGLGGHY+ H D AT R L+R+ A+ M YL+ V GG+T F NL+V
Sbjct: 56 LQVVNYGLGGHYEPHFDHATSRKSPLYRMKSGNRIATLMIYLSAVGAGGSTAFVHANLSV 115
Query: 185 FPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
K +A+FW+N N D H+GCPV G+KW
Sbjct: 116 PVVKNAALFWWNLRRNGDGDGDTLHAGCPVLAGDKW 151
>gi|363806698|ref|NP_001242522.1| uncharacterized protein LOC100806046 [Glycine max]
gi|255647110|gb|ACU24023.1| unknown [Glycine max]
Length = 289
Score = 85.9 bits (211), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 67/243 (27%), Positives = 110/243 (45%), Gaps = 34/243 (13%)
Query: 4 PLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINR 63
P + +GNL P D+ S + E+ ++ + G VE + +PR H+ + E
Sbjct: 44 PSSSRGNLPKPNDLASIARNTIETSDSD--ERGEQWVEVVSWEPRAFVYHNFLTKEECEY 101
Query: 64 IIELSKGKVERGKVVNYGDTIYVDTRL--SKVYFLYPEIFGDHPFLYKIQTRIQDMTNLV 121
+I+++K + + VV+ D+R+ S FL G + I+ +I D T +
Sbjct: 102 LIDIAKPSMHKSTVVDSETGKSKDSRVRTSSGTFL---ARGRDKIVRNIEKKISDFTFIP 158
Query: 122 IGREERYKGPLQINNYGLGG----HYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIF 177
+ E LQ+ +Y +G HYD D G R+A+ + YLTDVE GG T+F
Sbjct: 159 VEHGEG----LQVLHYEVGQKYEPHYDYFLDDFNTKNGGQRIATVLMYLTDVEEGGETVF 214
Query: 178 PSLN-------------------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGN 218
P+ L++ P++G A+ +++ + LD H GCPV GN
Sbjct: 215 PAAKGNFSFVPWWNELFECGKKGLSIKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGN 274
Query: 219 KWG 221
KW
Sbjct: 275 KWS 277
>gi|47204411|emb|CAF95476.1| unnamed protein product [Tetraodon nigroviridis]
Length = 284
Score = 85.9 bits (211), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 57/176 (32%), Positives = 88/176 (50%), Gaps = 18/176 (10%)
Query: 52 IHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQ 111
+HDA+ + S K+ R V + + R+SK +L + ++
Sbjct: 97 LHDALDH------LAFSHFKLRRSVVATRDKQVTAEYRISKSAWLKGSA---QSAVSRLD 147
Query: 112 TRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLW------RLASFMF 164
RI +T L + + + LQ+ NYG+GGHY+ H D AT ++ R+A+ M
Sbjct: 148 QRISMLTGLNV--QHPHGEYLQVVNYGIGGHYEPHFDHATSPSSPVFKLKTGNRVATVMI 205
Query: 165 YLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
YL+ VE GG+T F N +V K +A+FW+N H N D H+GCPV +G+KW
Sbjct: 206 YLSSVEAGGSTAFIYANFSVPVMKNAAIFWWNLHRNGRGDPDTLHAGCPVLIGDKW 261
>gi|159487419|ref|XP_001701720.1| predicted protein [Chlamydomonas reinhardtii]
gi|158280939|gb|EDP06695.1| predicted protein [Chlamydomonas reinhardtii]
Length = 274
Score = 85.5 bits (210), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 61/208 (29%), Positives = 98/208 (47%), Gaps = 33/208 (15%)
Query: 40 VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV-NYGDTIYVDTRLSKVYFLYP 98
V+++ L PR H+ + +E +++L+ K++R VV N G+ + + R S Y ++
Sbjct: 1 VQQVGLHPRAYYFHNFLTKAERGHLVKLAAPKLKRSTVVGNDGEGVVDNIRTS--YGMFI 58
Query: 99 EIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDE---G 155
D P + +I+ RI T+L + +E +Q+ Y G Y H D+ +
Sbjct: 59 RRLQD-PVVARIEKRISLWTHLPVEHQED----IQVLRYAHGQTYGAHYDSGDKSNEPGP 113
Query: 156 LWRLASFMFYLTDVELGGATIFP----------------------SLNLTVFPEKGSAVF 193
WRLA+F+ YL+DVE GG T FP N+ P+ G AV
Sbjct: 114 KWRLATFLMYLSDVEEGGETAFPHNSVWADPSIPEKVGDKFSDCAKGNVAAKPKAGDAVL 173
Query: 194 WYNAHANTLLDYRMYHSGCPVALGNKWG 221
+Y+ + N +D H+GCPV G KW
Sbjct: 174 FYSFYPNMTMDPAAMHTGCPVIKGVKWA 201
>gi|317127314|ref|YP_004093596.1| Procollagen-proline dioxygenase [Bacillus cellulosilyticus DSM
2522]
gi|315472262|gb|ADU28865.1| Procollagen-proline dioxygenase [Bacillus cellulosilyticus DSM
2522]
Length = 229
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 53/179 (29%), Positives = 88/179 (49%), Gaps = 13/179 (7%)
Query: 46 DPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHP 105
+P +V + + + + E +++I LSK ++ER K+ N D R S F ++
Sbjct: 43 EPLIVLLGNVLSEEECDQLISLSKDRIERSKISNKS---VHDLRTSSSMFFDD---AEND 96
Query: 106 FLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW---RLASF 162
+ ++ R+ + + + E +QI NY +G Y H D R+++
Sbjct: 97 VVSTVEKRVSQIMKIPVDHGE----GIQILNYAIGQEYKAHYDYFSSGNSKVNNPRISTL 152
Query: 163 MFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 221
+ YL DVE GG T FP LN V P+KG AV++ + +T L+ H G PV +G+KW
Sbjct: 153 VMYLNDVEAGGETYFPKLNFYVAPKKGMAVYFEYFYNDTTLNELTLHGGAPVVIGDKWA 211
>gi|398818543|ref|ZP_10577128.1| 2OG-Fe(II) oxygenase superfamily enzyme [Brevibacillus sp. BC25]
gi|398027481|gb|EJL21031.1| 2OG-Fe(II) oxygenase superfamily enzyme [Brevibacillus sp. BC25]
Length = 220
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 59/180 (32%), Positives = 92/180 (51%), Gaps = 14/180 (7%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
Y +P VV + + + DSE + +IE S+ +++R K+ G + T S V+ E
Sbjct: 38 YEEPLVVVLGNVLSDSECDELIEHSRERLQRSKIGEDGSVNSIRTS-SGVFCEQTET--- 93
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ +I+ RI + N+ I + LQ+ Y G Y H D T R R++
Sbjct: 94 ---ITRIEKRISQIMNIPI----EHGDGLQVLRYTPGQEYKPHYDFFAETSRASTNNRIS 146
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T+FP L+L+VFP KG AV++ ++N L+ H+G V G KW
Sbjct: 147 TLVMYLNDVEQGGETVFPLLHLSVFPTKGMAVYFEYFYSNQELNDFTLHAGTQVIHGEKW 206
>gi|194871352|ref|XP_001972832.1| GG13663 [Drosophila erecta]
gi|190654615|gb|EDV51858.1| GG13663 [Drosophila erecta]
Length = 420
Score = 85.1 bits (209), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 62/202 (30%), Positives = 90/202 (44%), Gaps = 45/202 (22%)
Query: 20 NLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN 79
NL C Y FL++ PLK E L DP +V H+ + D EI ++ + S N
Sbjct: 263 NLVCTYNFTATEFLRLSPLKQEVLNWDPYIVLYHEVLNDDEIEKLKQHSND--------N 314
Query: 80 YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGL 139
+ I +PF +I RI MT L I ++ L ++
Sbjct: 315 SAEEI-------------------NPFKKRIFQRISHMTRLRIPHSDQ----LIVSENVS 351
Query: 140 GGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHA 199
H H TP+ + +F+L +V+ GGAT+FP+L + VFP++GS +FW+
Sbjct: 352 ETHR--HKGKTPK-------GTLLFFLDNVKQGGATVFPNLKIAVFPQRGSCLFWHKT-- 400
Query: 200 NTLLDYRMYHSGCPVALGNKWG 221
LD R CPV GNKW
Sbjct: 401 ---LDTRNEPLECPVLQGNKWS 419
>gi|156370183|ref|XP_001628351.1| predicted protein [Nematostella vectensis]
gi|156215325|gb|EDO36288.1| predicted protein [Nematostella vectensis]
Length = 478
Score = 85.1 bits (209), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 60/176 (34%), Positives = 87/176 (49%), Gaps = 19/176 (10%)
Query: 23 CFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGD 82
C+Y++ + L + P KVE++ DPRVV + D E RI +++ + R V N
Sbjct: 312 CWYDNRGDARLLLKPNKVEQVNDDPRVVIFRGLVTDRETARIKQIASPMLNRATVYNIDT 371
Query: 83 TI--YVDTRLSKVYFLYPEIFGDH--PFLYKIQTRIQDMTNLVIGREERYKGPLQINNYG 138
+ Y D R+SK +L DH + + RI +T L + E+ LQI NYG
Sbjct: 372 GVLEYADYRVSKSAWL-----EDHLDETIATVNKRIAMVTGLDVQTAEK----LQIANYG 422
Query: 139 LGGHYDLHCDATPRDEGLW------RLASFMFYLTDVELGGATIFPSLNLTVFPEK 188
+GG Y+ H D D L R+A+ + YL DV LGGAT+F + V P K
Sbjct: 423 MGGQYEQHTDHGEPDSPLANDPLGNRIATLLIYLNDVALGGATVFLKAGVHVPPTK 478
>gi|413963357|ref|ZP_11402584.1| ProCollegen-proline dioxygenase [Burkholderia sp. SJ98]
gi|413929189|gb|EKS68477.1| ProCollegen-proline dioxygenase [Burkholderia sp. SJ98]
Length = 286
Score = 84.7 bits (208), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 55/185 (29%), Positives = 89/185 (48%), Gaps = 18/185 (9%)
Query: 47 PRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDH 104
P + + D + D+E +R+IE+ + V+R VV+ G I ++ R S+ F+
Sbjct: 94 PVIALVADVLDDTECDRLIEIGREHVQRSSVVDPDSGKEITIEERRSEGAFVNASTDA-- 151
Query: 105 PFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDE---------G 155
+ I RI ++ + E L I YG+GG Y H D P ++ G
Sbjct: 152 -LVETIDRRIAELFRQPVENGE----DLHILRYGMGGEYRPHYDYFPEEQAGSKHHMQRG 206
Query: 156 LWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 215
R+A+ + YL +VE GG T FP + L + P +GSA+++ + D + H+G PV
Sbjct: 207 GQRIATVILYLNEVEQGGDTTFPDIGLAIHPRRGSALYFEYVNELGQSDPKTLHAGTPVE 266
Query: 216 LGNKW 220
G KW
Sbjct: 267 KGEKW 271
>gi|241778760|ref|XP_002399787.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
gi|215508519|gb|EEC17973.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
Length = 427
Score = 84.7 bits (208), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 65/244 (26%), Positives = 112/244 (45%), Gaps = 34/244 (13%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
Y C+G + S L+C Y + F K+ P+K+EE L P +V +HD I D ++
Sbjct: 168 YKRLCRGEQLRTLKMDSQLRCRYYKGQDGFFKLQPIKLEEFNLKPYIVVLHDVIQDRDLE 227
Query: 63 RIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFL--YPEIFGDHPFLYK----IQTRIQD 116
+I +K + +TI + + FL + + +L++ I +R+
Sbjct: 228 DLIAFAKPRAR--------NTIPLFRNVKWCTFLKRFCSLLAASTWLFEQNATIASRLNR 279
Query: 117 MTNLVIG---REERYKG-PLQINNYGLGGH--------YDLHCDATPRDEGLW------R 158
++G + ++ P Q+ NYG GGH YD++ D+ D+ R
Sbjct: 280 YLTALLGMGTSDSNFEAEPYQLANYGTGGHYLPHHDYLYDVYEDSDETDDFSQFPSYGDR 339
Query: 159 LASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV--AL 216
LA+ M Y++DVE GGAT+FP L + + P+K V + ++ R C + ++
Sbjct: 340 LATLMIYMSDVEEGGATVFPKLGVRLTPKKVKMVIYKVQPDSSAQKLRALGDCCHLRSSV 399
Query: 217 GNKW 220
NKW
Sbjct: 400 ANKW 403
>gi|319652187|ref|ZP_08006306.1| hypothetical protein HMPREF1013_02919 [Bacillus sp. 2_A_57_CT2]
gi|317396176|gb|EFV76895.1| hypothetical protein HMPREF1013_02919 [Bacillus sp. 2_A_57_CT2]
Length = 283
Score = 84.7 bits (208), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 57/182 (31%), Positives = 88/182 (48%), Gaps = 14/182 (7%)
Query: 47 PRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYG--DTIYVDTRLSKVYFLYPEIFGDH 104
P V+ + + E + +I LS+ +++ VV+ G + R SK ++
Sbjct: 96 PFVLHLDQVLSSEECDELISLSRSRLQPSLVVDRGSGEERAGSGRTSKSMAFR---LKEN 152
Query: 105 PFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATP-----RDEGLWRL 159
+ +I+TRI ++T E LQI NYGLG Y H D P +G R+
Sbjct: 153 ELVERIETRIAELTGYPAENGE----GLQILNYGLGEEYKPHFDFFPPHMADASKGGQRV 208
Query: 160 ASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 219
+F+ YL DVE GG T+F L+ P+KG+A++++ +A LD HS PV G K
Sbjct: 209 GTFLIYLNDVEDGGETVFSKAGLSFVPKKGAAIYFHYGNAQGQLDRLSVHSSVPVRKGEK 268
Query: 220 WG 221
W
Sbjct: 269 WA 270
>gi|251794605|ref|YP_003009336.1| procollagen-proline dioxygenase [Paenibacillus sp. JDR-2]
gi|247542231|gb|ACS99249.1| Procollagen-proline dioxygenase [Paenibacillus sp. JDR-2]
Length = 209
Score = 84.7 bits (208), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 52/176 (29%), Positives = 96/176 (54%), Gaps = 10/176 (5%)
Query: 46 DPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHP 105
+P ++ + + + +E + +I+L+ +++R K+ + D V T S ++F E +
Sbjct: 31 EPLILILDNVLSWAECDLLIDLASARMQRAKIGSSHDVSEVRTS-SSMFFEESE----NE 85
Query: 106 FLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-RLASFMF 164
+ +++ R+ ++ N+ + E PLQ+ Y G Y H D + + R+++ +
Sbjct: 86 CIGQVEARVAELMNIPVSHAE----PLQVLRYQPGEQYHPHFDYFTQGSSMNNRISTLVM 141
Query: 165 YLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
YL DVE GG T FPSL+ +V P+KGSAV++ + +T L+ H+G PV G KW
Sbjct: 142 YLNDVEEGGETYFPSLHFSVTPKKGSAVYFEYFYNDTRLNELTLHAGHPVEAGEKW 197
>gi|329913962|ref|ZP_08276011.1| hypothetical protein IMCC9480_1311 [Oxalobacteraceae bacterium
IMCC9480]
gi|327545257|gb|EGF30515.1| hypothetical protein IMCC9480_1311 [Oxalobacteraceae bacterium
IMCC9480]
Length = 280
Score = 84.3 bits (207), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 56/190 (29%), Positives = 89/190 (46%), Gaps = 23/190 (12%)
Query: 47 PRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTI--YVDTRLSKVYFLYPEIFGDH 104
PR+V + + + D E + I +S+ + R ++ I + D+R S+ + G+
Sbjct: 92 PRIVVLGNVLSDDECDAIAAMSRTRFARSTTIDNASGINRFDDSRTSESAHIQ---RGET 148
Query: 105 PFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---------ATPRDEG 155
+ +I R+ ++ + E PLQ+ Y G Y H D A ++
Sbjct: 149 ELIARIDARLAALSGWPVDHGE----PLQLQKYQAGNEYRPHFDWFDPALAGTAKHLEKS 204
Query: 156 LWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 215
RLA+ + YLTDVE GG T FP + L V P+KG A+F+ N + D + H+G PV
Sbjct: 205 GQRLATIILYLTDVEEGGGTSFPGIGLDVHPQKGGALFFRNTTPYGVPDRKTQHAGLPVE 264
Query: 216 LG-----NKW 220
G NKW
Sbjct: 265 KGTKIIANKW 274
>gi|302844247|ref|XP_002953664.1| prolyl 4-hydroxylase alpha subunit-like protein [Volvox carteri f.
nagariensis]
gi|300261073|gb|EFJ45288.1| prolyl 4-hydroxylase alpha subunit-like protein [Volvox carteri f.
nagariensis]
Length = 364
Score = 84.3 bits (207), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 63/234 (26%), Positives = 106/234 (45%), Gaps = 32/234 (13%)
Query: 13 VPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKV 72
+PE + + + + F + VE++ L PR H+ + +E ++ L+ K+
Sbjct: 21 LPERLLESALVMHTEADKQFDEEATPWVEQVGLHPRAYLFHNFLTKAERAHMVRLAAPKL 80
Query: 73 ERGKVV-NYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGP 131
+R VV + G+ + + R S + ++ D P + +I+ RI T+L I +E
Sbjct: 81 KRSTVVGSKGEGVVDNIRTS--FGMFIRRLSD-PIIARIEKRISLWTHLPIEHQED---- 133
Query: 132 LQINNYGLGGHYDLHCDATPRDEGL---WRLASFMFYLTDVELGGATIFPSLN------- 181
+Q+ Y G Y H D+ + + WRLA+F+ YL+DVE GG T FP +
Sbjct: 134 IQVLRYAHGQTYGAHYDSGASSDHVGPKWRLATFLMYLSDVEEGGETAFPQNSVWYDPTI 193
Query: 182 --------------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 221
+ P+ G AV +Y+ N +D H+GCPV G KW
Sbjct: 194 PERIGPVSECAKGHVAAKPKAGDAVLFYSFLPNNTMDPAAMHTGCPVIKGIKWA 247
>gi|357496283|ref|XP_003618430.1| Prolyl 4-hydroxylase subunit alpha-2 [Medicago truncatula]
gi|217073992|gb|ACJ85356.1| unknown [Medicago truncatula]
gi|355493445|gb|AES74648.1| Prolyl 4-hydroxylase subunit alpha-2 [Medicago truncatula]
gi|388494436|gb|AFK35284.1| unknown [Medicago truncatula]
Length = 313
Score = 84.3 bits (207), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 59/208 (28%), Positives = 90/208 (43%), Gaps = 23/208 (11%)
Query: 33 LKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRL 90
+K P +V +L PR + + D E + +IELSK K+E+ V + G +I + R
Sbjct: 44 VKFDPTRVTQLSWSPRAFLYKNFLTDEECDHLIELSKDKLEKSMVADNESGKSIQSEVRT 103
Query: 91 SKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT 150
S FL + + I+ RI T L + E + +N H+D D
Sbjct: 104 SSGMFLNKQ---QDEIVSGIEARIAAWTFLPVENGESMQVLHYMNGEKYEPHFDFFHDKA 160
Query: 151 PRDEGLWRLASFMFYLTDVELGGATIFPSLN------------------LTVFPEKGSAV 192
+ G R+A+ + YL++VE GG TIFP V P KG A+
Sbjct: 161 NQRLGGHRVATVLMYLSNVEKGGETIFPHAEGKLSQPKDESWSECAHKGYAVKPRKGDAL 220
Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
+++ H + D + H CPV G KW
Sbjct: 221 LFFSLHLDATTDSKSLHGSCPVIEGEKW 248
>gi|260812289|ref|XP_002600853.1| hypothetical protein BRAFLDRAFT_214927 [Branchiostoma floridae]
gi|229286143|gb|EEN56865.1| hypothetical protein BRAFLDRAFT_214927 [Branchiostoma floridae]
Length = 281
Score = 84.0 bits (206), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 50/156 (32%), Positives = 81/156 (51%), Gaps = 11/156 (7%)
Query: 71 KVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKG 130
K+ R ++ N + R+S+ +L+ + D + ++ RI +T L
Sbjct: 108 KMFRSRIGNSFSEVESHIRISQQAWLHDK---DDEIVARVSKRIGLLTGL--NTTPTSTE 162
Query: 131 PLQINNYGLGGHYDLHCDATPRDEGLW------RLASFMFYLTDVELGGATIFPSLNLTV 184
LQ+ NYGLGG Y+ H D +E +W R+A+F+ YL+DV GGAT+FP N+TV
Sbjct: 163 LLQVLNYGLGGQYEPHHDYMTAEEKMWGTILGNRMATFLMYLSDVTAGGATVFPVANVTV 222
Query: 185 FPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
K + + + + + D H+GCPV +G+KW
Sbjct: 223 PVVKNAGLLFMDLLRSGRGDVNSLHAGCPVVIGSKW 258
>gi|195591300|ref|XP_002085380.1| GD14756 [Drosophila simulans]
gi|194197389|gb|EDX10965.1| GD14756 [Drosophila simulans]
Length = 477
Score = 83.6 bits (205), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 67/226 (29%), Positives = 97/226 (42%), Gaps = 51/226 (22%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
Y + C+G + N C Y FL++ PLK E L DP +V H+ + D EI+
Sbjct: 295 YEIGCRGQFLG----RRNHVCSYNFTITEFLRLAPLKQEVLNWDPYIVIYHNVLKDDEID 350
Query: 63 RIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVI 122
++ + S +VVN P +I RI ++T L
Sbjct: 351 KLKQHSNDNA--AEVVN-------------------------PIEKRIFQRINELTRL-- 381
Query: 123 GREERYKGPLQINNYGLGGHYDLHCDATPRDEGLWRLASFMF-------YLTDVELGGAT 175
+ L ++ G G H + L+ +++F F L +VELGGAT
Sbjct: 382 ----SFLNQLIVSKNGPGTQK--HIKEYSKGTLLFFVSTFSFAIYIYISLLNNVELGGAT 435
Query: 176 IFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 221
+FP L ++VFP+KGS + WYN D R CPV GNKWG
Sbjct: 436 VFPKLKISVFPQKGSCLIWYNTP-----DPRSEPLECPVLQGNKWG 476
>gi|449432777|ref|XP_004134175.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 303
Score = 83.6 bits (205), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 59/213 (27%), Positives = 91/213 (42%), Gaps = 34/213 (15%)
Query: 35 IGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSK 92
+ P KV+++ PR + D E + +I L+K +++R V + G + + R S
Sbjct: 36 VNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGKSKVSEVRTSS 95
Query: 93 VYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLH----CD 148
F++ P + I+ +I T L E +Q+ Y G YD H D
Sbjct: 96 GAFIHK---AKDPIVSGIEDKIAAWTFLPKDNGE----DIQVLRYEYGQKYDAHFDYFAD 148
Query: 149 ATPRDEGLWRLASFMFYLTDVELGGATIFPSLN---------------------LTVFPE 187
G R+A+ + YL+DVE GG T+FPS + V P
Sbjct: 149 KVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPR 208
Query: 188 KGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
KG A+ +++ H N + D H GCPV G KW
Sbjct: 209 KGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKW 241
>gi|386712780|ref|YP_006179102.1| prolyl 4-hydroxylase alpha subunit [Halobacillus halophilus DSM
2266]
gi|384072335|emb|CCG43825.1| prolyl 4-hydroxylase alpha subunit [Halobacillus halophilus DSM
2266]
Length = 211
Score = 83.6 bits (205), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 55/182 (30%), Positives = 89/182 (48%), Gaps = 14/182 (7%)
Query: 46 DPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHP 105
+P + + + + + E +I LSK K+ R K+ + + D R S FL PE
Sbjct: 33 NPLIAILGNVVSEEECEELIFLSKNKMNRSKIGSQHEV--SDIRTSSSTFL-PE----DD 85
Query: 106 FLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLASF 162
+I+ R+ + N+ + E L I NY G Y H D + + R+++
Sbjct: 86 LTNRIEKRVAQIMNVPVEHGE----GLHILNYKQGQEYKAHYDYFRSKAKAANNPRISTL 141
Query: 163 MFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWGK 222
+ YL DVE GG T FP +NL++ P KG AV++ +++ L++ R H G PV G KW
Sbjct: 142 VLYLNDVEEGGETYFPHMNLSISPHKGMAVYFEYFYSDPLINERTLHGGSPVTSGEKWAA 201
Query: 223 LL 224
+
Sbjct: 202 TM 203
>gi|356517655|ref|XP_003527502.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Glycine max]
Length = 290
Score = 83.2 bits (204), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 65/207 (31%), Positives = 94/207 (45%), Gaps = 32/207 (15%)
Query: 40 VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLY 97
E L +PR H+ + E +IEL+K ++ + VV+ G + R S FL
Sbjct: 79 TEILSWEPRAFIYHNFLSKEECEYLIELAKPQMVKSSVVDSKTGKSTESRVRTSSGMFLK 138
Query: 98 PEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDATPRD 153
G + I+ RI D T + EE +G LQI +Y +G HYD D
Sbjct: 139 R---GKDKIVQNIEKRIADFTFIP---EENGEG-LQILHYEVGQKYEPHYDYFLDEFNTK 191
Query: 154 EGLWRLASFMFYLTDVELGGATIFPSLN-------------------LTVFPEKGSAVFW 194
G R+A+ + YL+DVE GG T+FP+ N L+V P+ G A+ +
Sbjct: 192 NGGQRIATVLMYLSDVEEGGETVFPAANANFSSVPWWNDLSQCARKGLSVKPKMGDALLF 251
Query: 195 YNAHANTLLDYRMYHSGCPVALGNKWG 221
++ + LD H GCPV GNKW
Sbjct: 252 WSMRPDATLDPSSLHGGCPVIKGNKWS 278
>gi|260787668|ref|XP_002588874.1| hypothetical protein BRAFLDRAFT_235878 [Branchiostoma floridae]
gi|229274045|gb|EEN44885.1| hypothetical protein BRAFLDRAFT_235878 [Branchiostoma floridae]
Length = 151
Score = 83.2 bits (204), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 49/124 (39%), Positives = 71/124 (57%), Gaps = 9/124 (7%)
Query: 103 DHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDE------GL 156
+H + K+ R++ +T L + Y Q+ NYGLGG Y+ H D RDE
Sbjct: 8 EHTVIAKLSRRVEYITGLDVNWP--YGEAFQVLNYGLGGFYEPHVDYF-RDEQPALLTNG 64
Query: 157 WRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVAL 216
R+ +F+FYL+DVE GGAT+F LNLTV K SAV +++ + + H+GCPV +
Sbjct: 65 QRIVTFLFYLSDVEAGGATVFTRLNLTVPAVKNSAVLFHDLKRSLEFEKDSEHAGCPVLM 124
Query: 217 GNKW 220
G+KW
Sbjct: 125 GSKW 128
>gi|363543301|ref|NP_001241866.1| prolyl 4-hydroxylase 6 precursor [Zea mays]
gi|195624808|gb|ACG34234.1| oxidoreductase [Zea mays]
gi|347978818|gb|AEP37751.1| prolyl 4-hydroxylase 6 [Zea mays]
Length = 297
Score = 82.8 bits (203), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 61/209 (29%), Positives = 87/209 (41%), Gaps = 31/209 (14%)
Query: 37 PLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVY 94
P V +L PR + D+E + I+ L+KG +E+ V + G ++ R S
Sbjct: 32 PASVTQLSSRPRAFLYSGFLSDTECDHIVSLAKGSMEKSMVADNDSGKSVASQARTSSGT 91
Query: 95 FLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD----AT 150
FL + + I+ R+ T L E LQ+ Y G YD H D
Sbjct: 92 FLAKR---EDEIVSAIEKRVAAWTFL----PEENAESLQVLRYETGQKYDAHFDYFHDRN 144
Query: 151 PRDEGLWRLASFMFYLTDVELGGATIFPSLN------------------LTVFPEKGSAV 192
G R+A+ + YLTDV+ GG T+FP+ L V P+KG A+
Sbjct: 145 NLKLGGQRVATVLMYLTDVKKGGETVFPNAEGSHLQYKDETWSECSRSGLAVKPKKGDAL 204
Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKWG 221
++N H N D H CPV G KW
Sbjct: 205 LFFNLHVNATADTGSLHGSCPVIEGEKWS 233
>gi|427795421|gb|JAA63162.1| Putative prolyl-4-hydroxylase-alpha efb, partial [Rhipicephalus
pulchellus]
Length = 568
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 52/155 (33%), Positives = 79/155 (50%), Gaps = 9/155 (5%)
Query: 2 IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
IY C+G P L C Y + N +L + P K E ++ PR+V HD + + E+
Sbjct: 356 IYERLCRGEKFPPLFHDRELTCQYRTNNRPYLLLQPAKEEVMFPKPRIVIYHDVLSEHEM 415
Query: 62 NRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
N I L++ ++ R V NY G+ R+SK +L E +H + ++ RI+D+T
Sbjct: 416 NVIKTLAQPRLRRATVQNYKSGELETASYRISKSAWLKNE---EHGVIARVTRRIEDITG 472
Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCDATPRDE 154
L E LQ+ NYG+GGHY+ H D R+E
Sbjct: 473 LTADTAEE----LQVVNYGIGGHYEPHFDFARREE 503
>gi|241044301|ref|XP_002407178.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
gi|215492128|gb|EEC01769.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
Length = 554
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 65/224 (29%), Positives = 100/224 (44%), Gaps = 35/224 (15%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
Y C+G + S L+C Y + F K+ P+KVEE L P +V +H+ I D +I
Sbjct: 291 YKRLCRGEQLRTPKMDSKLRCRYYKGQHGFFKLQPIKVEEANLKPYIVVMHNVIQDRDIE 350
Query: 63 RIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDM----T 118
++ +K +++R R S +L D P ++ ++ + T
Sbjct: 351 DLMAFAKPRLQRSTHYGVRGMEASQVRTSSNAWLND---LDAPVATRLNRFLRSLLGLGT 407
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCD-----------------ATPRDEGLWRLAS 161
+ G E+Y Q+ NYG+GG Y H D T D R+A+
Sbjct: 408 TYLGGEAEQY----QLANYGIGGQYMSHHDYLQDTYHIPNRVTDDFEKTSGD----RIAT 459
Query: 162 FMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDY 205
M Y++DVE GGAT+FPSL + + P+K V N +LL Y
Sbjct: 460 LMVYMSDVEEGGATVFPSLGVRLTPKK---VISPNQSRTSLLSY 500
>gi|363807286|ref|NP_001242363.1| uncharacterized protein LOC100796794 precursor [Glycine max]
gi|255641119|gb|ACU20838.1| unknown [Glycine max]
Length = 297
Score = 82.4 bits (202), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 60/213 (28%), Positives = 91/213 (42%), Gaps = 34/213 (15%)
Query: 35 IGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSK 92
I P KV+++ PR + D E + +I L+K +++R V + G++ D R S
Sbjct: 31 INPSKVKQISWKPRAFVYEGFLTDLECDHLISLAKSELKRSAVADNLSGESQLSDVRTSS 90
Query: 93 VYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLH----CD 148
F+ P + I+ +I T L E +Q++ Y G YD H D
Sbjct: 91 GMFISKN---KDPIVAGIEDKISSWTFLPKENGE----DIQVSRYEHGQKYDPHYDYFTD 143
Query: 149 ATPRDEGLWRLASFMFYLTDVELGGATIFPSLN---------------------LTVFPE 187
G R+A+ + YLTDV GG T+FPS + V P
Sbjct: 144 KVNIARGGHRIATVLMYLTDVAKGGETVFPSAEEPPRRRGAETSSDLSECAKKGIAVKPR 203
Query: 188 KGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+G A+ +++ H N D H+GCPV G KW
Sbjct: 204 RGDALLFFSLHTNATPDTSSLHAGCPVIEGEKW 236
>gi|445499353|ref|ZP_21466208.1| prolyl 4-hydroxylase alpha subunit [Janthinobacterium sp. HH01]
gi|444789348|gb|ELX10896.1| prolyl 4-hydroxylase alpha subunit [Janthinobacterium sp. HH01]
Length = 272
Score = 82.4 bits (202), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 61/207 (29%), Positives = 93/207 (44%), Gaps = 26/207 (12%)
Query: 33 LKIGPLKVEELYL---DPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGD--TIYVD 87
L P +V E+ P+++ + + + D E + II + R V D ++ +
Sbjct: 66 LVAAPDRVAEVLFVLKQPQIILLGNVLSDEECDAIIAHCGTRYTRSTVTGEADGSSMVHE 125
Query: 88 TRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHC 147
R S++ F+ G+ +I+ R+ + + E P Q+ Y Y H
Sbjct: 126 GRTSEMAFIQ---RGEAEVAERIERRLAALAHWPAECSE----PFQLQKYDATQEYRPHY 178
Query: 148 DATPRD---------EGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAH 198
D D G RLA+F+ YL+DVE GG T+FP L L V+P+KGSA+++ N
Sbjct: 179 DWLDPDSSGHRSHLARGGQRLATFILYLSDVEQGGGTVFPGLGLEVYPKKGSALWFLNTD 238
Query: 199 ANTLLDYRMYHSGCPVALG-----NKW 220
N D R H G PV G NKW
Sbjct: 239 INHQPDKRTLHGGAPVVRGTKIIANKW 265
>gi|168046048|ref|XP_001775487.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162673157|gb|EDQ59684.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 263
Score = 82.4 bits (202), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 60/209 (28%), Positives = 96/209 (45%), Gaps = 30/209 (14%)
Query: 35 IGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSK 92
I P +V++L PR + + D+E + +I L+K K+E+ V + G ++ + R S
Sbjct: 1 IDPTRVKQLSWKPRAFLYSNFLSDAECDHMISLAKDKLEKSMVADNESGKSVKSEIRTSS 60
Query: 93 VYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCD 148
FL + G + +I+ RI T L E +Q+ Y G H+D D
Sbjct: 61 GMFL---MKGQDDIISRIEDRIAAWTFLPKENGE----AIQVLRYQDGEKYEPHFDYFHD 113
Query: 149 ATPRDEGLWRLASFMFYLTDVELGGATIFPS-----------------LNLTVFPEKGSA 191
+ G R+A+ + YL+DV GG T+FPS + V P KG A
Sbjct: 114 KNNQALGGHRIATVLMYLSDVVKGGETVFPSSEDRGGPKDDSWSACGKTGVAVKPRKGDA 173
Query: 192 VFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ +++ H + + D H+GCPV G KW
Sbjct: 174 LLFFSLHPSAVPDESSLHTGCPVIEGEKW 202
>gi|91778899|ref|YP_554107.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
xenovorans LB400]
gi|91691559|gb|ABE34757.1| Procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
xenovorans LB400]
Length = 292
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 58/185 (31%), Positives = 88/185 (47%), Gaps = 18/185 (9%)
Query: 47 PRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDH 104
P+V+ D + E +IE S+ +++R VN G + R S+ + G+
Sbjct: 103 PQVIVFADVLSPDECAEMIERSRHRLKRSTTVNPATGKEDVIRNRTSEGIWYQ---RGED 159
Query: 105 PFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDE---------G 155
PF+ ++ RI + N + E +G LQI +YG G Y H D P D+ G
Sbjct: 160 PFIERMDRRISSLMNWPV---ENGEG-LQILHYGTTGEYRPHFDYFPPDQPGSAVHTAQG 215
Query: 156 LWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 215
R+A+ + YL DV GG TIFP ++V +G AV++ + LD H G PV
Sbjct: 216 GQRVATLVIYLNDVPDGGETIFPEAGMSVAASQGGAVYFRYMNDRRQLDPLTLHGGAPVL 275
Query: 216 LGNKW 220
G+KW
Sbjct: 276 AGDKW 280
>gi|356540840|ref|XP_003538892.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Glycine max]
Length = 290
Score = 81.6 bits (200), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 65/243 (26%), Positives = 109/243 (44%), Gaps = 33/243 (13%)
Query: 4 PLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINR 63
P + +GNL P D+ S + + ++ ++ G VE + +PR H+ + E
Sbjct: 44 PSSSRGNLPKPNDLASIARNTIHTSDDDDVR-GEQWVEVVSWEPRAFVYHNFLTKEECEY 102
Query: 64 IIELSKGKVERGKVVNYGDTIYVDTRL--SKVYFLYPEIFGDHPFLYKIQTRIQDMTNLV 121
+I+++K + + VV+ D+R+ S FL G + I+ RI + +
Sbjct: 103 LIDIAKPNMHKSSVVDSETGKSKDSRVRTSSGTFL---ARGRDKIVRDIEKRIAHYSFIP 159
Query: 122 IGREERYKGPLQINNYGLGG----HYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIF 177
+ E LQ+ +Y +G HYD D G R+A+ + YLTDVE GG T+F
Sbjct: 160 VEHGEG----LQVLHYEVGQKYEPHYDYFLDDFNTKNGGQRIATVLMYLTDVEEGGETVF 215
Query: 178 PSLN-------------------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGN 218
P+ L++ P++G A+ +++ + LD H GCPV GN
Sbjct: 216 PAAKGNFSSVPWWNELSECGKKGLSIKPKRGDALLFWSMKPDATLDPSSLHGGCPVIKGN 275
Query: 219 KWG 221
KW
Sbjct: 276 KWS 278
>gi|449529555|ref|XP_004171765.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 284
Score = 81.6 bits (200), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 64/212 (30%), Positives = 95/212 (44%), Gaps = 32/212 (15%)
Query: 34 KIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNY--GDTIYVDTRLS 91
K G VE + +PR H+ + E +I L+K +E+ VV+ G+++ R S
Sbjct: 68 KRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSKTGESVDSRVRTS 127
Query: 92 KVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLG----GHYDLHC 147
FL G + I+ RI D T + I E LQI +Y +G HYD
Sbjct: 128 SGMFLNR---GQDKIIRNIEKRIADFTFIPIEHGE----GLQILHYEVGQKYDAHYDYFV 180
Query: 148 DATPRDEGLWRLASFMFYLTDVELGGATIFPSLN-------------------LTVFPEK 188
D +G R+A+ + YL+DVE GG T+FP+ L+V P+
Sbjct: 181 DEYNIKKGGQRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSECGKGGLSVKPKM 240
Query: 189 GSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
G A+ +++ + LD H CPV GNKW
Sbjct: 241 GDALLFWSMKPDATLDPTSLHGACPVIRGNKW 272
>gi|295699617|ref|YP_003607510.1| procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
gi|295438830|gb|ADG17999.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
Length = 286
Score = 81.6 bits (200), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 57/186 (30%), Positives = 92/186 (49%), Gaps = 20/186 (10%)
Query: 47 PRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSK-VYFLYPEIFGD 103
P++V D + +E +IE S+ +++R VN G + R S+ V++ G+
Sbjct: 97 PQLVVFADVLSAAECAELIERSRHRLKRSTTVNPLTGREDVIRNRTSEGVWYRR----GE 152
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---------ATPRDE 154
+ +++ RI +TN + E +G LQ+ +YG G Y H D A +
Sbjct: 153 DQLIARVERRIASLTNWPL---ENGEG-LQVLHYGTSGEYSPHFDFFAPDQPGSAVHTTQ 208
Query: 155 GLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 214
G R+A+ + YL DV GG T+FP+ L+V + G AV++ +A LD H G PV
Sbjct: 209 GGQRVATLIIYLNDVADGGETVFPTAGLSVAAQAGGAVYFRYMNAERQLDPSTLHGGAPV 268
Query: 215 ALGNKW 220
G+KW
Sbjct: 269 LAGDKW 274
>gi|302844281|ref|XP_002953681.1| hypothetical protein VOLCADRAFT_63898 [Volvox carteri f.
nagariensis]
gi|300261090|gb|EFJ45305.1| hypothetical protein VOLCADRAFT_63898 [Volvox carteri f.
nagariensis]
Length = 304
Score = 81.6 bits (200), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 60/203 (29%), Positives = 90/203 (44%), Gaps = 28/203 (13%)
Query: 40 VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPE 99
+E + PRV H+ I D E +IEL+ +++R VV G D+ +Y
Sbjct: 1 IEHVAWKPRVFIYHNFITDMEAKHMIELAAPQMKRSTVVGAGGQSVEDS-YRTLYTAGVR 59
Query: 100 IFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLWRL 159
+ D + +I+ R+ T + + +E +QI YG+G Y +H D DE R+
Sbjct: 60 RYQDD-VVERIENRVAAWTQISVLHQED----MQILRYGIGQQYKVHADTLRDDEAGVRV 114
Query: 160 ASFMFYLTDVELGGATIFP--------------------SLNLTVF-PEKGSA-VFWYNA 197
A+ + YL + E GG T FP + N F P++G A +FW
Sbjct: 115 ATVLIYLNEPEAGGETAFPDSQWVNPKLAETIGANFSACAKNHVAFAPKRGDALLFWSIG 174
Query: 198 HANTLLDYRMYHSGCPVALGNKW 220
T DY H+GCPV G KW
Sbjct: 175 PDGTTEDYHASHTGCPVLSGVKW 197
>gi|389795384|ref|ZP_10198508.1| procollagen-proline dioxygenase [Rhodanobacter fulvus Jip2]
gi|388430823|gb|EIL87950.1| procollagen-proline dioxygenase [Rhodanobacter fulvus Jip2]
Length = 293
Score = 81.6 bits (200), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 59/185 (31%), Positives = 93/185 (50%), Gaps = 18/185 (9%)
Query: 47 PRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIY--VDTRLSKVYFLYPEIFGDH 104
P + + + D E + +I S K++R V+ + Y + R S+ F +P D
Sbjct: 97 PTIAVLDQVLDDEECDELIRRSADKLQRSTTVDPVNGGYEVIAARSSEGTF-FPVNADD- 154
Query: 105 PFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD-ATPRDEGL------- 156
F+ ++ RI ++ N + E +G LQ+ +YG GG Y H D +P D G
Sbjct: 155 -FIARLDRRIAELMNCPV---ENGEG-LQVLHYGEGGEYQPHFDYFSPGDPGSEAQMVVG 209
Query: 157 -WRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 215
R+++ + YL DV GGAT+FP+L L V P KG AV++ ++ + +D H G PV
Sbjct: 210 GQRVSTLLIYLNDVAQGGATVFPTLGLRVLPRKGMAVYFEYSNRDGQVDPLTLHGGEPVE 269
Query: 216 LGNKW 220
G KW
Sbjct: 270 KGEKW 274
>gi|307110383|gb|EFN58619.1| hypothetical protein CHLNCDRAFT_19485 [Chlorella variabilis]
Length = 328
Score = 81.6 bits (200), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 53/185 (28%), Positives = 91/185 (49%), Gaps = 11/185 (5%)
Query: 39 KVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYP 98
+VE + PR H+ + + E + I+ L+K ++R VV G V+ ++ Y +
Sbjct: 31 RVEPVSWKPRAFVFHNFMTEEEADHIVALAKPFMKRSTVVGAGGA-SVEDQIRTSYGTFL 89
Query: 99 EIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLWR 158
+ D P + ++ R+ T L + +E +QI YG+G Y H D+ D R
Sbjct: 90 KRLQD-PIVTAVEQRLATWTKLNVSHQED----MQILRYGIGQKYGAHYDSLDNDSP--R 142
Query: 159 LASFMFYLTDVEL--GGATIFPSLN-LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 215
+ + + YL+DV GG T FP + ++P+KG A+ +Y+ + D H+GCP+
Sbjct: 143 VCTVLLYLSDVPADGGGETAFPGVRRQALYPKKGDALLFYSLKPDGTSDAYSLHTGCPII 202
Query: 216 LGNKW 220
G KW
Sbjct: 203 SGVKW 207
>gi|229086310|ref|ZP_04218488.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-44]
gi|228697005|gb|EEL49812.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-44]
Length = 220
Score = 81.6 bits (200), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 58/179 (32%), Positives = 88/179 (49%), Gaps = 16/179 (8%)
Query: 46 DPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDT-RLSKVYFLYPEIFGDH 104
+P +V + + + D E +IELSK ++R K+ G + VD R S FL ++
Sbjct: 42 EPLIVVLENVLSDEECESLIELSKDSMKRSKI---GASREVDNIRTSSGTFLE-----EN 93
Query: 105 PFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLAS 161
+ I+ R+ + N+ + E L I Y G Y H D R R+++
Sbjct: 94 ETVAIIEKRVSSIMNIPVEHGE----GLHILKYTPGQEYKAHYDYFAEHSRAAENNRIST 149
Query: 162 FMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ YL DVE GG T FP LNL++ P+KGSAV++ + + L+ H G PV G KW
Sbjct: 150 LVMYLNDVEEGGETFFPKLNLSIAPKKGSAVYFEYFYNDKSLNELTLHGGAPVIKGEKW 208
>gi|226314793|ref|YP_002774689.1| hypothetical protein BBR47_52080 [Brevibacillus brevis NBRC 100599]
gi|226097743|dbj|BAH46185.1| conserved hypothetical protein [Brevibacillus brevis NBRC 100599]
Length = 215
Score = 81.6 bits (200), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 58/181 (32%), Positives = 92/181 (50%), Gaps = 16/181 (8%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDT-RLSKVYFLYPEIFG 102
Y +P VV + + + DSE + +IE S+ +++R K+ G+ V++ R S F
Sbjct: 33 YEEPLVVVLGNVLSDSECDELIEHSRERLQRSKI---GEDRSVNSIRTSSGVFC-----E 84
Query: 103 DHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRL 159
+ +I+ RI + N+ I + LQ+ Y G Y H D T R R+
Sbjct: 85 QTETITRIEKRISQIMNIPI----EHGDGLQVLRYTPGQEYKPHYDFFAETSRASTNNRI 140
Query: 160 ASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 219
++ + YL DVE GG T+FP L+L+VFP KG AV++ + N ++ H+G V G K
Sbjct: 141 STLVMYLNDVEQGGETVFPLLHLSVFPTKGMAVYFEYFYRNQEVNEFTLHAGAQVIHGEK 200
Query: 220 W 220
W
Sbjct: 201 W 201
>gi|224141327|ref|XP_002324025.1| predicted protein [Populus trichocarpa]
gi|222867027|gb|EEF04158.1| predicted protein [Populus trichocarpa]
Length = 239
Score = 81.3 bits (199), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 61/228 (26%), Positives = 98/228 (42%), Gaps = 38/228 (16%)
Query: 17 IKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGK 76
I+S F ++++ P + +L PR + D E + +I L+KGK+ +
Sbjct: 2 IRSKTGAFTKAFD-------PTRAAQLSWQPRAFVYKGFLSDEECDHLINLAKGKLVKSM 54
Query: 77 VVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQI 134
V N G+++ R S F++ + + I+ RI T L E P+QI
Sbjct: 55 VANDETGESMESQERTSSGMFIFKT---EDEIVNGIEARIAAWTFL----PEENGEPIQI 107
Query: 135 NNYGLGGHYDLH----CDATPRDEGLWRLASFMFYLTDVELGGATIFPSLNL-------- 182
Y G Y+ H D ++EG R A+ + YL+DV+ GG T+FP+
Sbjct: 108 LRYEHGQKYEAHIDYFVDKANQEEGGHRAATVLMYLSDVKKGGETVFPTSEAEGSQAKDD 167
Query: 183 ----------TVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
V P KG A+ +++ H + D H+ CPV G KW
Sbjct: 168 SWSDCAKKGYAVKPNKGDALLFFSLHPDATPDPGSLHASCPVIEGEKW 215
>gi|413932756|gb|AFW67307.1| oxidoreductase [Zea mays]
Length = 297
Score = 81.3 bits (199), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 61/208 (29%), Positives = 88/208 (42%), Gaps = 31/208 (14%)
Query: 37 PLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVY 94
P V +L PR + D+E + ++ L+KG +E+ V + G ++ R S
Sbjct: 32 PASVTQLSSRPRAFLYSGFLSDTECDHLVSLAKGSMEKSMVADNDSGKSVASQARTSSGT 91
Query: 95 FLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD-ATPRD 153
FL + + I+ R+ T L E LQ+ Y G YD H D R+
Sbjct: 92 FLAKR---EDEIVSAIEKRVAAWTFL----PEENAESLQVLRYETGQKYDAHFDYFHDRN 144
Query: 154 E---GLWRLASFMFYLTDVELGGATIFPSLN------------------LTVFPEKGSAV 192
G R+A+ + YLTDV GG T+FP+ L V P+KG A+
Sbjct: 145 NLKLGGQRVATVLMYLTDVNKGGETVFPNAEGSHLQYKDETWSECSRSGLAVKPKKGDAL 204
Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
++N H N D H CPV G KW
Sbjct: 205 LFFNLHVNATADTGSLHGSCPVIEGEKW 232
>gi|241029040|ref|XP_002406378.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
gi|215491954|gb|EEC01595.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
Length = 539
Score = 81.3 bits (199), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 64/241 (26%), Positives = 115/241 (47%), Gaps = 28/241 (11%)
Query: 1 EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
+ Y C+G L ++S L+C Y + F + P+K+EE+ L P ++ +HD + D +
Sbjct: 282 QSYKRLCRGKLLRSPKMESQLRCRYYKGQDGFFALQPIKLEEMNLKPYIIVMHDVLQDKD 341
Query: 61 INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
I ++ ++ +V K + Y ++ T S +L + + P ++ + ++ + +
Sbjct: 342 IKELMAFAEPRVR--KTLPYLFICHIHTFYSA--WLNED---EAPIAVRMNSYLRALLGM 394
Query: 121 VIGREERYKGPLQINNYGLGG----HYDLHCDA-----TPRDEGLW-----RLASFMFYL 166
+ Q+ NYG GG H+D D+ + D L R+A+ M YL
Sbjct: 395 GTSDTDEEAEAYQLANYGTGGQFLPHHDFLQDSFHSYNSSADYYLQYGTGDRVATLMIYL 454
Query: 167 TDVELGGATIFPSLNLTVFPEKGSAVF--WYNAHANTLLDYRMYHSGCPV-----ALGNK 219
TDVE GGAT+FP+L L + P+K + F N+ +L + ++ V A+ NK
Sbjct: 455 TDVEEGGATVFPTLGLRLTPKKVNLFFISLRNSDGARILHWVVFTVCIKVTFFCLAVANK 514
Query: 220 W 220
W
Sbjct: 515 W 515
>gi|242032633|ref|XP_002463711.1| hypothetical protein SORBIDRAFT_01g004670 [Sorghum bicolor]
gi|241917565|gb|EER90709.1| hypothetical protein SORBIDRAFT_01g004670 [Sorghum bicolor]
Length = 297
Score = 81.3 bits (199), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 59/208 (28%), Positives = 87/208 (41%), Gaps = 31/208 (14%)
Query: 37 PLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVY 94
P +V +L PR + D+E + +I L+KG +E+ V + G ++ R S
Sbjct: 32 PARVTQLSWRPRAFLYSGFLSDTECDHLINLAKGSMEKSMVADNDSGKSLMSQVRTSSGA 91
Query: 95 FLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD----AT 150
FL + + I+ R+ T L E +Q+ Y +G YD H D
Sbjct: 92 FLAKH---EDEIVSAIEKRVAAWTFL----PEENAESMQVLRYEIGQKYDAHFDYFHDKN 144
Query: 151 PRDEGLWRLASFMFYLTDVELGGATIFPSLN------------------LTVFPEKGSAV 192
G R A+ + YLTDV+ GG T+FP+ L V P+KG A+
Sbjct: 145 NVKHGGQRFATVLMYLTDVKKGGETVFPNAEGSHLQYKDETWSECSRSGLAVKPKKGDAL 204
Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
++ H N D H CPV G KW
Sbjct: 205 LFFGLHLNATTDTSSLHGSCPVIEGEKW 232
>gi|221512814|ref|NP_649045.2| CG18231 [Drosophila melanogaster]
gi|220902637|gb|AAF49253.3| CG18231 [Drosophila melanogaster]
Length = 470
Score = 81.3 bits (199), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 63/221 (28%), Positives = 93/221 (42%), Gaps = 57/221 (25%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
Y + C+G + N C Y FLK+ PLK E L DP +V HD + D EI+
Sbjct: 278 YEIGCRGQFLR----RRNHVCTYNFTITEFLKLAPLKQEVLNWDPYIVIYHDVLNDDEID 333
Query: 63 RIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVI 122
++ +N D + V+ P +I RI ++T L
Sbjct: 334 KL----------KNHLNDTDAVEVN-----------------PIEKRIFQRINELTRLSF 366
Query: 123 GREERY----KGPLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFP 178
++ GP T + + + + +F+L +VELGGA +FP
Sbjct: 367 EHSDQQIVSKNGP-----------------RTHKHKKEYLKGTLLFFLNNVELGGAMVFP 409
Query: 179 SLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 219
L ++VFP+KGS +FW+N LD R CPV GNK
Sbjct: 410 KLKISVFPQKGSCLFWHNT-----LDPRSEPLECPVLQGNK 445
>gi|195505253|ref|XP_002099424.1| GE23369 [Drosophila yakuba]
gi|194185525|gb|EDW99136.1| GE23369 [Drosophila yakuba]
Length = 164
Score = 81.3 bits (199), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 50/181 (27%), Positives = 78/181 (43%), Gaps = 39/181 (21%)
Query: 40 VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPE 99
+E++ L+P VV HD I E ++IEL+ ++ V + + R K ++ E
Sbjct: 1 MEQVGLNPYVVLYHDVISPQESAQLIELAASDLKASGVFQAKGSTFKRLRTVKARWIKKE 60
Query: 100 IFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLWRL 159
+ +I RI+DMT + E+
Sbjct: 61 F---NELTKRITRRIRDMTGFDLKEGEK-------------------------------- 85
Query: 160 ASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 219
F L+DVE GGAT+FP T++P G+A+ WYN H + D H+ CPV +G+K
Sbjct: 86 ----FQLSDVEQGGATVFPMSGYTIYPRAGTALLWYNLHTDGHCDPSTLHAACPVMVGSK 141
Query: 220 W 220
W
Sbjct: 142 W 142
>gi|228990015|ref|ZP_04149988.1| Prolyl 4-hydroxylase alpha subunit [Bacillus pseudomycoides DSM
12442]
gi|228769681|gb|EEM18271.1| Prolyl 4-hydroxylase alpha subunit [Bacillus pseudomycoides DSM
12442]
Length = 219
Score = 80.9 bits (198), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 62/217 (28%), Positives = 98/217 (45%), Gaps = 16/217 (7%)
Query: 9 GNLSVPEDIKSN--LKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIE 66
G +S + N L F + N + +++ +P +V + + + D E +IE
Sbjct: 2 GQMSTKNETVKNTELTIFNHTGNTIVTEDREIQIISRLEEPLIVVLANVLSDEECETLIE 61
Query: 67 LSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREE 126
+SK K++R K+ T D R S FL + +I+ RI + N+ E
Sbjct: 62 MSKNKMKRSKIGVSRKT--NDIRTSSGAFLE-----ESEITTRIERRIASIMNVPAPHGE 114
Query: 127 RYKGPLQINNYGLGGHYDLHCDATPRDEGLW---RLASFMFYLTDVELGGATIFPSLNLT 183
LQI Y +G Y H D + R+++ + YL VE GG T FP LNL+
Sbjct: 115 ----GLQILKYTVGQEYQAHYDFFVENSAAASNNRMSTLVMYLNHVEEGGETFFPKLNLS 170
Query: 184 VFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
V P+KG AV++ + + ++ H G PV G KW
Sbjct: 171 VSPKKGMAVYFEYFYQDESINKLTLHGGAPVIKGEKW 207
>gi|357467085|ref|XP_003603827.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355492875|gb|AES74078.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 280
Score = 80.9 bits (198), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 60/207 (28%), Positives = 91/207 (43%), Gaps = 32/207 (15%)
Query: 40 VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLY 97
E L +PR H+ + E +I L+K + + VV+ G + R S FL
Sbjct: 69 TEILSWEPRAFVYHNFLSKEECEHLINLAKPFLAKSSVVDSKTGKSTESRVRTSSGMFLK 128
Query: 98 PEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDATPRD 153
G + I+ RI D T + + E LQ+ +YG+G HYD D
Sbjct: 129 R---GKDKIIQNIERRIADFTFIPVENGE----GLQVLHYGVGEKYEPHYDYFLDEFNTK 181
Query: 154 EGLWRLASFMFYLTDVELGGATIFPSLN-------------------LTVFPEKGSAVFW 194
G R+A+ + YL+DVE GG T+FP+ L++ P+ G A+ +
Sbjct: 182 NGGQRVATVLMYLSDVEEGGETVFPAAKANFSSVPWWNDLSECARKGLSLKPKMGDALLF 241
Query: 195 YNAHANTLLDYRMYHSGCPVALGNKWG 221
++ + LD H GCPV +GNKW
Sbjct: 242 WSMRPDATLDASSLHGGCPVIVGNKWS 268
>gi|66771505|gb|AAY55064.1| IP12044p [Drosophila melanogaster]
Length = 484
Score = 80.9 bits (198), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 63/221 (28%), Positives = 93/221 (42%), Gaps = 57/221 (25%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
Y + C+G + N C Y FLK+ PLK E L DP +V HD + D EI+
Sbjct: 292 YEIGCRGQFLR----RRNHVCTYNFTITEFLKLAPLKQEVLNWDPYIVIYHDVLNDDEID 347
Query: 63 RIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVI 122
++ +N D + V+ P +I RI ++T L
Sbjct: 348 KL----------KNHLNDTDAVEVN-----------------PIEKRIFQRINELTRLSF 380
Query: 123 GREERY----KGPLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFP 178
++ GP T + + + + +F+L +VELGGA +FP
Sbjct: 381 EHSDQQIVSKNGP-----------------RTHKHKKEYLKGTLLFFLNNVELGGAMVFP 423
Query: 179 SLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 219
L ++VFP+KGS +FW+N LD R CPV GNK
Sbjct: 424 KLKISVFPQKGSCLFWHNT-----LDPRSEPLECPVLQGNK 459
>gi|229002593|ref|ZP_04160640.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock3-17]
gi|229003816|ref|ZP_04161625.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock1-4]
gi|228757417|gb|EEM06653.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock1-4]
gi|228758520|gb|EEM07660.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock3-17]
Length = 219
Score = 80.9 bits (198), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 62/217 (28%), Positives = 98/217 (45%), Gaps = 16/217 (7%)
Query: 9 GNLSVPEDIKSN--LKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIE 66
G +S + N L F + N + +++ +P +V + + + D E +IE
Sbjct: 2 GQMSTKNETVENTELTIFNHTGNTIVTEDREIQIISRLEEPLIVVLANVLSDEECETLIE 61
Query: 67 LSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREE 126
+SK K++R K+ T D R S FL + +I+ RI + N+ E
Sbjct: 62 MSKNKMKRSKIGISRKT--NDIRTSSGAFLE-----ESEITTRIERRIASIMNVPAPHGE 114
Query: 127 RYKGPLQINNYGLGGHYDLHCDATPRDEGLW---RLASFMFYLTDVELGGATIFPSLNLT 183
LQI Y +G Y H D + R+++ + YL VE GG T FP LNL+
Sbjct: 115 ----GLQILKYTVGQEYQAHYDFFVENSAAASNNRMSTLVMYLNHVEEGGETFFPKLNLS 170
Query: 184 VFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
V P+KG AV++ + + ++ H G PV G KW
Sbjct: 171 VSPKKGMAVYFEYFYQDESINKLTLHGGAPVIKGEKW 207
>gi|6437556|gb|AAF08583.1|AC011623_16 unknown protein [Arabidopsis thaliana]
Length = 278
Score = 80.9 bits (198), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 55/188 (29%), Positives = 88/188 (46%), Gaps = 13/188 (6%)
Query: 39 KVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFL 96
KV+++ PR + D E + +I L+K ++R V + G++ D R S F+
Sbjct: 37 KVKQVSSKPRAFVYEGFLTDLECDHLISLAKENLQRSAVADNDNGESQVSDVRTSSGTFI 96
Query: 97 YPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD----ATPR 152
G P + I+ ++ T L E LQ+ Y G YD H D
Sbjct: 97 SK---GKDPIVSGIEDKLSTWTFLPKENGE----DLQVLRYEHGQKYDAHFDYFHDKVNI 149
Query: 153 DEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGC 212
G R+A+ + YL++V GG T+FP + + P+KG+A+ ++N + + D H GC
Sbjct: 150 ARGGHRIATVLLYLSNVTKGGETVFPDAQVCLKPKKGNALLFFNLQQDAIPDPFSLHGGC 209
Query: 213 PVALGNKW 220
PV G KW
Sbjct: 210 PVIEGEKW 217
>gi|449520146|ref|XP_004167095.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 249
Score = 80.5 bits (197), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 65/212 (30%), Positives = 94/212 (44%), Gaps = 32/212 (15%)
Query: 34 KIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLS 91
K G VE + +PR H+ + E +I L+K +E+ VV+ G + R S
Sbjct: 33 KRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDNETGKNVEDSVRTS 92
Query: 92 KVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATP 151
FL G + I+ RI D T + I E LQI +Y +G YD H D
Sbjct: 93 SGMFLNR---GQDKIVSNIEKRIADFTFIPIEHGE----GLQILHYEVGQKYDAHYDFFD 145
Query: 152 RDEGL----WRLASFMFYLTDVELGGATIFPSLN-------------------LTVFPEK 188
+ L R+A+ + YL+DVE GG T+FP+ L+V P+
Sbjct: 146 DEFNLKEIGQRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSKCGKGGLSVKPKM 205
Query: 189 GSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
G A+ +++ +T LD H CPV GNKW
Sbjct: 206 GDALLFWSMKPDTTLDPTSLHGACPVIRGNKW 237
>gi|218193936|gb|EEC76363.1| hypothetical protein OsI_13952 [Oryza sativa Indica Group]
Length = 1062
Score = 80.5 bits (197), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 60/209 (28%), Positives = 89/209 (42%), Gaps = 31/209 (14%)
Query: 37 PLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVY 94
P +V +L PR + E + ++ L+KG++E+ V + G +I R S
Sbjct: 34 PARVTQLSWRPRAFLYSGFLSHDECDHLVNLAKGRMEKSMVADNDSGKSIMSQVRTSSGT 93
Query: 95 FLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD----AT 150
FL + + I+ R+ T L E +QI +Y LG YD H D
Sbjct: 94 FLSKH---EDDIVSGIEKRVAAWTFL----PEENAESIQILHYELGQKYDAHFDYFHDKN 146
Query: 151 PRDEGLWRLASFMFYLTDVELGGATIFPSL------------------NLTVFPEKGSAV 192
G R+A+ + YLTDV+ GG T+FP+ L V P+KG A+
Sbjct: 147 NLKRGGHRVATVLMYLTDVKKGGETVFPNAAGRHLQLKDETWSDCARSGLAVKPKKGDAL 206
Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKWG 221
+++ H N D H CPV G KW
Sbjct: 207 LFFSLHVNATTDPASLHGSCPVIEGEKWS 235
>gi|356550516|ref|XP_003543632.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 318
Score = 80.5 bits (197), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 62/214 (28%), Positives = 92/214 (42%), Gaps = 31/214 (14%)
Query: 31 TFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDT 88
+ +K P +V +L PR + D E + +I L+K K+E+ V + G +I +
Sbjct: 47 SSVKFDPTRVTQLSWSPRAFLYKGFLSDEECDHLITLAKDKLEKSMVADNESGKSIMSEV 106
Query: 89 RLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYD 144
R S FL + I+ RI T L I E +QI +Y G H+D
Sbjct: 107 RTSSGMFLNK---AQDEIVAGIEARIAAWTFLPIENGES----MQILHYENGQKYEPHFD 159
Query: 145 LHCDATPRDEGLWRLASFMFYLTDVELGGATIFPSLNL------------------TVFP 186
D + G R+A+ + YL+DVE GG TIFP+ V P
Sbjct: 160 YFHDKANQVMGGHRIATVLMYLSDVEKGGETIFPNAKAKLLQPKDESWSECAHKGYAVKP 219
Query: 187 EKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
KG A+ +++ H + D + H CPV G KW
Sbjct: 220 RKGDALLFFSLHLDASTDNKSLHGSCPVIEGEKW 253
>gi|168002780|ref|XP_001754091.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162694645|gb|EDQ80992.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 214
Score = 80.5 bits (197), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 61/207 (29%), Positives = 92/207 (44%), Gaps = 32/207 (15%)
Query: 40 VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRL--SKVYFLY 97
VE L +PR H + + E N +IE+++ + + VV+ D+RL S FL
Sbjct: 3 VEVLSWEPRAFLYHHFLTEEECNHLIEVARPSLVKSTVVDSDTGKSKDSRLRTSSGTFL- 61
Query: 98 PEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDATPRD 153
+ G P + +I+ RI D T + + E LQ+ Y HYD DA
Sbjct: 62 --MRGQDPVIKRIEKRIADFTFIPAEQGE----GLQVLQYKESEKYEPHYDYFHDAYNTK 115
Query: 154 EGLWRLASFMFYLTDVELGGATIFPSLN-------------------LTVFPEKGSAVFW 194
G R+A+ + YL++VE GG T+FP+ L+V P G A+ +
Sbjct: 116 NGGQRIATVLMYLSNVEEGGETVFPAAQVNKTEVPDWDKLSECAQKGLSVRPRMGDALLF 175
Query: 195 YNAHANTLLDYRMYHSGCPVALGNKWG 221
++ + LD H GCPV G KW
Sbjct: 176 WSMKPDATLDSTSLHGGCPVIKGTKWS 202
>gi|302844249|ref|XP_002953665.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
gi|300261074|gb|EFJ45289.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
Length = 245
Score = 80.5 bits (197), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 61/226 (26%), Positives = 102/226 (45%), Gaps = 32/226 (14%)
Query: 13 VPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKV 72
+PE + + + + F + VE++ L PR H+ + +E ++ L+ K+
Sbjct: 27 LPERLLPSALVMHHEADKQFDEEATPWVEQVGLHPRAYLFHNFLTKAERAHMVRLAAPKL 86
Query: 73 ERGKVV-NYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGP 131
+R VV N G+ + + R S Y ++ D P + +I+ RI T+L I +E
Sbjct: 87 KRSTVVGNDGEGVVDEIRTS--YGMFIRRLAD-PVITRIEKRISLWTHLPIEHQED---- 139
Query: 132 LQINNYGLGGHYDLHCDATPRDE---GLWRLASFMFYLTDVELGGATIFPSLN------- 181
+Q+ Y G Y H D+ + WRLA+F+ YL+DVE GG T FP +
Sbjct: 140 IQVLRYAHGQTYGAHYDSGDKSNEPGPKWRLATFLMYLSDVEEGGETAFPQNSVWYDPTI 199
Query: 182 --------------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 213
+ P+ G AV +Y+ + N +D H+GCP
Sbjct: 200 PERIGPVSECAKGHVAAKPKAGDAVLFYSFYPNLTMDPAAMHTGCP 245
>gi|302793288|ref|XP_002978409.1| hypothetical protein SELMODRAFT_418273 [Selaginella moellendorffii]
gi|300153758|gb|EFJ20395.1| hypothetical protein SELMODRAFT_418273 [Selaginella moellendorffii]
Length = 256
Score = 80.1 bits (196), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 61/209 (29%), Positives = 95/209 (45%), Gaps = 32/209 (15%)
Query: 37 PLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRL--SKVY 94
P+ E + PR H+ + E + +I L++ ++R VV+ D+R+ S
Sbjct: 43 PVWTETISWQPRASVFHNFLSSEECDHLIRLAQPNMKRSAVVDNQTGKSKDSRVRTSSGT 102
Query: 95 FLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD----AT 150
FL G + +I+ RI T + +E +G LQ+ +Y +G YD H D
Sbjct: 103 FLR---RGQDEIISRIEERIAKFTFIP---KEHGEG-LQVLHYEVGQKYDAHHDYFHDKV 155
Query: 151 PRDEGLWRLASFMFYLTDVELGGATIFPSLNL-------------------TVFPEKGSA 191
G R+A+ + YL+DVE GG T+FPS + +V P KG A
Sbjct: 156 NTKNGGQRVATVLMYLSDVEEGGETVFPSAKVNSSSVPWWDELSECGKKGVSVKPRKGDA 215
Query: 192 VFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ +++ + LD H GCPV GNKW
Sbjct: 216 LLFWSMSPDAELDPFSLHGGCPVIKGNKW 244
>gi|302773668|ref|XP_002970251.1| hypothetical protein SELMODRAFT_411114 [Selaginella moellendorffii]
gi|300161767|gb|EFJ28381.1| hypothetical protein SELMODRAFT_411114 [Selaginella moellendorffii]
Length = 256
Score = 80.1 bits (196), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 61/209 (29%), Positives = 95/209 (45%), Gaps = 32/209 (15%)
Query: 37 PLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRL--SKVY 94
P+ E + PR H+ + E + +I L++ ++R VV+ D+R+ S
Sbjct: 43 PVWTETISWQPRASVFHNFLSSEECDHLIRLAQPNMKRSAVVDNQTGKSKDSRVRTSSGT 102
Query: 95 FLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD----AT 150
FL G + +I+ RI T + +E +G LQ+ +Y +G YD H D
Sbjct: 103 FLR---RGQDEIISRIEERIAKFTFIP---KEHGEG-LQVLHYEVGQKYDAHHDYFHDKV 155
Query: 151 PRDEGLWRLASFMFYLTDVELGGATIFPSLNL-------------------TVFPEKGSA 191
G R+A+ + YL+DVE GG T+FPS + +V P KG A
Sbjct: 156 NTKNGGQRVATVLMYLSDVEEGGETVFPSAKVNSSSVPWWDELSECAKKGVSVKPRKGDA 215
Query: 192 VFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ +++ + LD H GCPV GNKW
Sbjct: 216 LLFWSMSPDAELDPFSLHGGCPVIKGNKW 244
>gi|255083627|ref|XP_002508388.1| predicted protein [Micromonas sp. RCC299]
gi|226523665|gb|ACO69646.1| predicted protein [Micromonas sp. RCC299]
Length = 253
Score = 80.1 bits (196), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 61/208 (29%), Positives = 94/208 (45%), Gaps = 42/208 (20%)
Query: 47 PRVVKIHDAIYDSEINRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDH 104
PR +H+ + E +RI+E+++ +V R V++ G + R S+ FL G
Sbjct: 5 PRAFHLHNFMSHEECDRILEIARPRVRRSTVIDSVTGQSKVDPIRTSEQTFLN---RGTW 61
Query: 105 PFLYKIQTRIQDMTNLVIGREERYKGP-LQINNYGLGGHYDLHCD------ATPRD---E 154
+ K++ R+ +T L Y G +QI YGLG YD H D A+ + E
Sbjct: 62 DIVTKVEERLAVVTQLPA-----YHGEDMQILKYGLGQKYDAHHDVGELTSASGKQLAAE 116
Query: 155 GLWRLASFMFYLTDVELGGATIFPSL----------------------NLTVFPEKGSAV 192
G R+A+ + YL+DVE GG T FP N+ V P KG +
Sbjct: 117 GGHRVATVLLYLSDVEEGGETAFPDSEWMTPELRKWAEGQKWSDCAEGNVAVKPRKGDGL 176
Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
+++ + +D H+GCPV G KW
Sbjct: 177 LFWSVNNENAIDPHSMHAGCPVIRGEKW 204
>gi|215490181|dbj|BAG86624.1| type 2 proly 4-hydroxylase [Nicotiana tabacum]
Length = 294
Score = 80.1 bits (196), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 62/214 (28%), Positives = 90/214 (42%), Gaps = 34/214 (15%)
Query: 35 IGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSK 92
I P K +++ PR + D E N +I L+K +++R V + G++ + R S
Sbjct: 28 INPSKAKQISWKPRAFVYEGFLTDEECNHLISLAKSELKRSAVADNESGNSKTSEVRTSS 87
Query: 93 VYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCD 148
F+ P+ P + I+ +I T L E +Q+ Y G HYD D
Sbjct: 88 GMFI-PK--AKDPIVSGIEEKIATWTFLPKENGEE----IQVLRYEEGQKYEPHYDYFVD 140
Query: 149 ATPRDEGLWRLASFMFYLTDVELGGATIFPSLN---------------------LTVFPE 187
G RLA+ + YLT+VE GG T+FP + V P
Sbjct: 141 KVNIARGGHRLATVLMYLTNVEKGGETVFPKAEESPRRRSMIADDSLSECAKKGIPVKPR 200
Query: 188 KGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 221
KG A+ +Y+ H N D H GCPV G KW
Sbjct: 201 KGDALLFYSLHPNATPDPLSLHGGCPVIQGEKWS 234
>gi|356572148|ref|XP_003554232.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Glycine max]
Length = 319
Score = 80.1 bits (196), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 61/212 (28%), Positives = 92/212 (43%), Gaps = 31/212 (14%)
Query: 33 LKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRL 90
+K P +V +L PR + + E + +I L+K K+E+ V + G +I D R
Sbjct: 50 VKFDPTRVTQLSWSPRAFLYKGFLSEEECDHLIVLAKDKLEKSMVADNDSGKSIMSDIRT 109
Query: 91 SKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLH 146
S FL + I+ RI T L + E +QI +Y G H+D
Sbjct: 110 SSGMFLNK---AQDEIVAGIEARIAAWTFLPVENGES----MQILHYENGQKYEPHFDYF 162
Query: 147 CDATPRDEGLWRLASFMFYLTDVELGGATIFPSLNL------------------TVFPEK 188
D + G R+A+ + YL+DVE GG TIFP+ V P+K
Sbjct: 163 HDKANQVMGGHRIATVLMYLSDVEKGGETIFPNAEAKLLQPKDESWSECAHKGYAVKPQK 222
Query: 189 GSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
G A+ +++ H + D + H CPV G KW
Sbjct: 223 GDALLFFSLHLDASTDTKSLHGSCPVIEGEKW 254
>gi|449443243|ref|XP_004139389.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 284
Score = 80.1 bits (196), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 64/212 (30%), Positives = 93/212 (43%), Gaps = 32/212 (15%)
Query: 34 KIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLS 91
K G VE + +PR H+ + E +I L+K +E+ VV+ G + R S
Sbjct: 68 KRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDNETGKNVEDSVRTS 127
Query: 92 KVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLG----GHYDLHC 147
FL G + I+ RI D T + I E LQI +Y +G HYD
Sbjct: 128 SGMFLNR---GQDKIVSNIEKRIADFTFIPIEHGE----GLQILHYEVGQKYDAHYDYFV 180
Query: 148 DATPRDEGLWRLASFMFYLTDVELGGATIFPSLN-------------------LTVFPEK 188
D +G R+A+ + YL+DVE GG T+FP+ L+V P+
Sbjct: 181 DEYNIKKGGQRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSKCGKGGLSVKPKM 240
Query: 189 GSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
G A+ +++ + LD H CPV GNKW
Sbjct: 241 GDALLFWSMKPDATLDPTSLHGACPVIRGNKW 272
>gi|115456019|ref|NP_001051610.1| Os03g0803500 [Oryza sativa Japonica Group]
gi|29150365|gb|AAO72374.1| putative oxidoreductase [Oryza sativa Japonica Group]
gi|108711618|gb|ABF99413.1| oxidoreductase, 2OG-Fe oxygenase family protein, putative,
expressed [Oryza sativa Japonica Group]
gi|113550081|dbj|BAF13524.1| Os03g0803500 [Oryza sativa Japonica Group]
gi|215765410|dbj|BAG87107.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222625993|gb|EEE60125.1| hypothetical protein OsJ_13003 [Oryza sativa Japonica Group]
Length = 299
Score = 80.1 bits (196), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 60/208 (28%), Positives = 89/208 (42%), Gaps = 31/208 (14%)
Query: 37 PLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVY 94
P +V +L PR + E + ++ L+KG++E+ V + G +I R S
Sbjct: 34 PARVTQLSWRPRAFLYSGFLSHDECDHLVNLAKGRMEKSMVADNDSGKSIMSQVRTSSGT 93
Query: 95 FLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD----AT 150
FL + + I+ R+ T L E +QI +Y LG YD H D
Sbjct: 94 FLSKH---EDDIVSGIEKRVAAWTFL----PEENAESIQILHYELGQKYDAHFDYFHDKN 146
Query: 151 PRDEGLWRLASFMFYLTDVELGGATIFPSL------------------NLTVFPEKGSAV 192
G R+A+ + YLTDV+ GG T+FP+ L V P+KG A+
Sbjct: 147 NLKRGGHRVATVLMYLTDVKKGGETVFPNAAGRHLQLKDETWSDCARSGLAVKPKKGDAL 206
Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
+++ H N D H CPV G KW
Sbjct: 207 LFFSLHVNATTDPASLHGSCPVIEGEKW 234
>gi|385205097|ref|ZP_10031967.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
gi|385184988|gb|EIF34262.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
Length = 292
Score = 80.1 bits (196), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 56/185 (30%), Positives = 87/185 (47%), Gaps = 18/185 (9%)
Query: 47 PRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDH 104
P+++ D + E +IE S+ +++R VN G + R S+ + G+
Sbjct: 103 PQMIVFADVLSPDECAEMIERSRHRLKRSTTVNPATGKEDVIRNRTSEGIWYQ---RGED 159
Query: 105 PFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDE---------G 155
PF+ ++ RI + N + E +G LQ+ YG G Y H D P D+ G
Sbjct: 160 PFIERMDRRISSLMNWPV---ENGEG-LQLLRYGTTGEYRPHFDYFPPDQPGSTVHTAQG 215
Query: 156 LWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 215
R+A+ + YL DV GG TIFP ++V +G AV++ + LD H G PV
Sbjct: 216 GQRVATLVIYLNDVPDGGETIFPEAGMSVAASQGGAVYFRYMNGRRQLDPLTLHGGAPVL 275
Query: 216 LGNKW 220
G+KW
Sbjct: 276 SGDKW 280
>gi|297802350|ref|XP_002869059.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297314895|gb|EFH45318.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 290
Score = 80.1 bits (196), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 67/236 (28%), Positives = 107/236 (45%), Gaps = 34/236 (14%)
Query: 12 SVPEDIKSNLKCF--YESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSK 69
S P D+ + ++ ESY + G +E + +PR H+ + + E +I L+K
Sbjct: 50 SRPMDLTTIVQTIEERESYGDEEDGNGDRWLEVISWEPRAFVYHNFLTNEECEHLISLAK 109
Query: 70 GKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREER 127
+ + KVV+ G +I R S FL G + +I+ RI D T + I E
Sbjct: 110 PSMVKSKVVDVKTGKSIDSRVRTSSGTFLKR---GHDEIVEEIENRISDFTFIPIENGEG 166
Query: 128 YKGPLQINNYGLGG----HYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPSLN-- 181
LQ+ +Y +G H+D D +G R+A+ + YL+DV+ GG T+FP+
Sbjct: 167 ----LQVLHYEVGQKYEPHHDYFFDEFNVRKGGQRIATVLMYLSDVDEGGETVFPAAKGN 222
Query: 182 -----------------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
L+V P+K A+ +++ + LD H GCPV GNKW
Sbjct: 223 ISDVPWWDELSQCGKEGLSVLPKKRDALLFWSMKPDASLDPSSLHGGCPVIKGNKW 278
>gi|218192156|gb|EEC74583.1| hypothetical protein OsI_10158 [Oryza sativa Indica Group]
Length = 299
Score = 79.7 bits (195), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 62/206 (30%), Positives = 92/206 (44%), Gaps = 36/206 (17%)
Query: 42 ELYLDPRVVKIHDAIYDSEINRIIELSK-GKVERGKVVN--YGDTIYVDTRLSKVYFLYP 98
++ PRV + D+E +I L+K G++ER VVN G+++ TR S FL
Sbjct: 39 DVSWSPRVFLYEGFLSDAECEHLIALAKQGRMERSTVVNGKSGESVMSKTRTSSGMFL-- 96
Query: 99 EIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD------ATPR 152
I + +I+ RI T E +Q+ YG G Y+ H D A+ R
Sbjct: 97 -IRKQDEVVARIEERIAAWTMFPAENGE----SMQMLRYGQGEKYEPHFDYIRGRQASAR 151
Query: 153 DEGLWRLASFMFYLTDVELGGATIFPSLN------------------LTVFPEKGSAVFW 194
G R+A+ + YL++V++GG T+FP V P KGSAV +
Sbjct: 152 --GGHRIATVLMYLSNVKMGGETVFPDAEARLSQPKDETWSDCAEQGFAVKPTKGSAVLF 209
Query: 195 YNAHANTLLDYRMYHSGCPVALGNKW 220
++ + N D H CPV G KW
Sbjct: 210 FSLYPNATFDPGSLHGSCPVIQGEKW 235
>gi|407708877|ref|YP_006792741.1| prolyl 4-hydroxylase [Burkholderia phenoliruptrix BR3459a]
gi|407237560|gb|AFT87758.1| prolyl 4-hydroxylase [Burkholderia phenoliruptrix BR3459a]
Length = 300
Score = 79.7 bits (195), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 56/185 (30%), Positives = 91/185 (49%), Gaps = 18/185 (9%)
Query: 47 PRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDH 104
P+V+ + + E + +IE S+ +++R +V+ G + R S+ + G+
Sbjct: 111 PQVIVFANVLSPEECDEVIERSRHRLKRSTIVDPATGQEGVIRNRTSEGIWYQ---RGED 167
Query: 105 PFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDE---------G 155
F+ ++ RI + N + E +G LQI +YG G Y H D P D+ G
Sbjct: 168 AFIERLDRRIASLMNWPV---ENGEG-LQILHYGPTGEYRPHFDYFPPDQPGSAVHTARG 223
Query: 156 LWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 215
R+A+ + YL DV GG TIFP+ L+V ++G AV++ + LD H G PV
Sbjct: 224 GQRVATLVVYLNDVADGGETIFPAAGLSVAAKQGGAVYFRYMNGQRQLDPLTLHGGAPVR 283
Query: 216 LGNKW 220
G+KW
Sbjct: 284 AGDKW 288
>gi|323528042|ref|YP_004230194.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1001]
gi|323385044|gb|ADX57134.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1001]
Length = 300
Score = 79.7 bits (195), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 56/185 (30%), Positives = 91/185 (49%), Gaps = 18/185 (9%)
Query: 47 PRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDH 104
P+V+ + + E + +IE S+ +++R +V+ G + R S+ + G+
Sbjct: 111 PQVIVFANVLSPEECDEVIERSRHRLKRSTIVDPATGQEGVIRNRTSEGIWYQ---RGED 167
Query: 105 PFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDE---------G 155
F+ ++ RI + N + E +G LQI +YG G Y H D P D+ G
Sbjct: 168 AFIERLDQRIASLMNWPV---ENGEG-LQILHYGPTGEYRPHFDYFPPDQPGSAVHTARG 223
Query: 156 LWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 215
R+A+ + YL DV GG TIFP+ L+V ++G AV++ + LD H G PV
Sbjct: 224 GQRVATLVVYLNDVADGGETIFPAAGLSVAAKQGGAVYFRYMNGQRQLDPLTLHGGAPVH 283
Query: 216 LGNKW 220
G+KW
Sbjct: 284 AGDKW 288
>gi|295704991|ref|YP_003598066.1| 2OG-Fe(II) oxygenase [Bacillus megaterium DSM 319]
gi|294802650|gb|ADF39716.1| 2OG-Fe(II) oxygenase family protein [Bacillus megaterium DSM 319]
Length = 219
Score = 79.3 bits (194), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 56/201 (27%), Positives = 97/201 (48%), Gaps = 12/201 (5%)
Query: 23 CFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGD 82
F S N L+ + + + +P V+ + + + + E + +I LSK K++R K+ G
Sbjct: 15 IFNHSGNKIKLEDREINIVARFEEPLVLVLGNVLSNEECDELIRLSKDKMQRSKI---GA 71
Query: 83 TIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGH 142
V++ + + E ++ +++I+ R+ ++G Y LQI Y
Sbjct: 72 AREVNSIRTSSGMFFDE--SENELVHQIERRLSK----IMGPSIEYAEGLQILKYLPDQE 125
Query: 143 YDLHCD---ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHA 199
Y H D + + R+++ + YL DVE GG T FP L L+V P KG AV++ ++
Sbjct: 126 YKAHHDYFTSASKASKNNRISTLVMYLNDVEEGGETYFPKLGLSVSPTKGMAVYFEYFYS 185
Query: 200 NTLLDYRMYHSGCPVALGNKW 220
+ L+ R H G PV G KW
Sbjct: 186 DAELNDRTLHGGAPVIKGEKW 206
>gi|341878860|gb|EGT34795.1| hypothetical protein CAEBREN_10065 [Caenorhabditis brenneri]
Length = 163
Score = 79.3 bits (194), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 43/112 (38%), Positives = 59/112 (52%), Gaps = 12/112 (10%)
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTD 168
MTNL + E LQI NYG+GGHYD H D ++E R+A+ +FY++
Sbjct: 1 MTNLEMETAEE----LQIANYGIGGHYDPHFDHAKKEESKSFESLGTGNRIATVLFYMSQ 56
Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GG T+F + TV P K A+FWYN + + H+ CPV +G KW
Sbjct: 57 PSHGGGTVFTEVKSTVLPTKNDALFWYNLYKQGDGNPDTRHAACPVLVGIKW 108
>gi|170690448|ref|ZP_02881615.1| Procollagen-proline dioxygenase [Burkholderia graminis C4D1M]
gi|170144883|gb|EDT13044.1| Procollagen-proline dioxygenase [Burkholderia graminis C4D1M]
Length = 307
Score = 79.3 bits (194), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 56/185 (30%), Positives = 90/185 (48%), Gaps = 18/185 (9%)
Query: 47 PRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDH 104
P+V+ + + E + +IE S+ +++R +V+ G + R S+ + G+
Sbjct: 118 PQVIVFANVLSPEECDEVIERSRHRLKRSTIVDPATGQEDVIRNRTSEGIWYQ---RGED 174
Query: 105 PFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDE---------G 155
F+ ++ RI + N + E +G LQI +YG G Y H D P D+ G
Sbjct: 175 AFIERLDQRIASLMNWPV---ENGEG-LQILHYGPTGEYRPHFDYFPPDQPGSMVHTARG 230
Query: 156 LWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 215
R+A+ + YL DV GG TIFP L+V ++G AV++ + LD H G PV
Sbjct: 231 GQRVATLVIYLNDVPDGGETIFPEAGLSVAAKQGGAVYFRYMNGQRQLDPLTLHGGAPVR 290
Query: 216 LGNKW 220
G+KW
Sbjct: 291 AGDKW 295
>gi|357483925|ref|XP_003612249.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355513584|gb|AES95207.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 289
Score = 79.3 bits (194), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 65/241 (26%), Positives = 106/241 (43%), Gaps = 33/241 (13%)
Query: 6 ACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRII 65
+ NL P D+ S + + ++ K G VE + +PR H+ + E +I
Sbjct: 45 SSNQNLPKPNDLTSIVHNTVDRNDDEEGK-GEQWVEVVSWEPRAFVYHNFLTKEECEYLI 103
Query: 66 ELSKGKVERGKVVNYGDTIYVDTRL--SKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIG 123
+++K + + VV+ D+R+ S FL G + I+ +I D T + +
Sbjct: 104 DIAKPSMHKSTVVDSETGKSKDSRVRTSSGTFL---ARGRDKIVRNIEKKIADFTFIPVE 160
Query: 124 REERYKGPLQINNYGLGG----HYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPS 179
E LQ+ +Y +G HYD D G R+A+ + YLTDVE GG T+FP+
Sbjct: 161 HGEG----LQVLHYEVGQKYEPHYDYFLDEFNTKNGGQRIATVLMYLTDVEEGGETVFPA 216
Query: 180 LN-------------------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
L++ P++G A+ +++ + LD H GCPV GNKW
Sbjct: 217 AKGNFSNVPWYNELSDCGKKGLSIKPKRGDALLFWSMKPDATLDASSLHGGCPVIKGNKW 276
Query: 221 G 221
Sbjct: 277 S 277
>gi|297832394|ref|XP_002884079.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297329919|gb|EFH60338.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 291
Score = 79.3 bits (194), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 62/210 (29%), Positives = 94/210 (44%), Gaps = 32/210 (15%)
Query: 36 GPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRL--SKV 93
G VE + +PR V H+ + + E +I L+K + + VV+ D+R+ S
Sbjct: 76 GERWVEVISWEPRAVVYHNFLSNEECEHLINLAKPSMVKSTVVDEKTGGSKDSRVRTSSG 135
Query: 94 YFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDA 149
FL G + I+ RI D T + + E LQ+ +Y +G HYD D
Sbjct: 136 TFLRR---GHDEVVEVIEKRISDFTFIPVENGE----GLQVLHYQVGQKYEPHYDYFLDE 188
Query: 150 TPRDEGLWRLASFMFYLTDVELGGATIFPSLN-------------------LTVFPEKGS 190
G R+A+ + YL+DV+ GG T+FP+ L+V P+K
Sbjct: 189 FNTKNGGQRIATVLMYLSDVDDGGETVFPAARGNISAVPWWNELSKCGKEGLSVLPKKRD 248
Query: 191 AVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
A+ ++N + LD H GCPV GNKW
Sbjct: 249 ALLFWNMRPDASLDPSSLHGGCPVVKGNKW 278
>gi|15227885|ref|NP_179363.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
gi|25411813|pir||F84555 similar to prolyl 4-hydroxylase alpha subunit [imported] -
Arabidopsis thaliana
gi|89274129|gb|ABD65585.1| At2g17720 [Arabidopsis thaliana]
gi|110738861|dbj|BAF01353.1| similar to prolyl 4-hydroxylase alpha subunit [Arabidopsis
thaliana]
gi|330251579|gb|AEC06673.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
Length = 291
Score = 79.3 bits (194), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 62/210 (29%), Positives = 94/210 (44%), Gaps = 32/210 (15%)
Query: 36 GPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRL--SKV 93
G VE + +PR V H+ + + E +I L+K + + VV+ D+R+ S
Sbjct: 76 GERWVEVISWEPRAVVYHNFLTNEECEHLISLAKPSMVKSTVVDEKTGGSKDSRVRTSSG 135
Query: 94 YFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDA 149
FL G + I+ RI D T + + E LQ+ +Y +G HYD D
Sbjct: 136 TFLR---RGHDEVVEVIEKRISDFTFIPVENGE----GLQVLHYQVGQKYEPHYDYFLDE 188
Query: 150 TPRDEGLWRLASFMFYLTDVELGGATIFPSLN-------------------LTVFPEKGS 190
G R+A+ + YL+DV+ GG T+FP+ L+V P+K
Sbjct: 189 FNTKNGGQRIATVLMYLSDVDDGGETVFPAARGNISAVPWWNELSKCGKEGLSVLPKKRD 248
Query: 191 AVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
A+ ++N + LD H GCPV GNKW
Sbjct: 249 ALLFWNMRPDASLDPSSLHGGCPVVKGNKW 278
>gi|384046522|ref|YP_005494539.1| prolyl 4-hydroxylase alpha subunit [Bacillus megaterium WSH-002]
gi|345444213|gb|AEN89230.1| Prolyl 4-hydroxylase alpha subunit [Bacillus megaterium WSH-002]
Length = 219
Score = 79.3 bits (194), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 55/201 (27%), Positives = 98/201 (48%), Gaps = 12/201 (5%)
Query: 23 CFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGD 82
F S N L+ + + + +P V+ + + + + E + +I+LSK K++R K+ G
Sbjct: 15 IFNHSGNKIKLEDREIDIVARFEEPLVLVLGNVLSNEECDELIQLSKDKMQRSKI---GA 71
Query: 83 TIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGH 142
V++ + + E ++ +++I+ R+ ++G Y LQI Y
Sbjct: 72 EREVNSIRTSSGMFFEE--SENELVHQIERRLSK----IMGPSIEYAEGLQILKYLPDQE 125
Query: 143 YDLHCD---ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHA 199
Y H D + + R+++ + YL DVE GG T FP L L++ P KG AV++ ++
Sbjct: 126 YKAHHDYFTSASKASKNNRISTLVMYLNDVEEGGETYFPKLGLSISPTKGMAVYFEYFYS 185
Query: 200 NTLLDYRMYHSGCPVALGNKW 220
+ L+ R H G PV G KW
Sbjct: 186 DAELNDRTLHGGAPVIKGEKW 206
>gi|444512226|gb|ELV10078.1| Prolyl 4-hydroxylase subunit alpha-1 [Tupaia chinensis]
Length = 474
Score = 79.3 bits (194), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 58/179 (32%), Positives = 92/179 (51%), Gaps = 20/179 (11%)
Query: 3 YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G + + + L C Y N N + P K E+ + PR+++ HD I D+E
Sbjct: 262 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 321
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + +L+K ++ R + N GD V R+SK +L ++P + +I RIQD+T
Sbjct: 322 IEIVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSG---YENPVVSRINMRIQDLT 378
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFY-LTD 168
L + E LQ+ NYG+GG Y+ H D +DE R+A+++FY LTD
Sbjct: 379 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYDLTD 433
>gi|255071007|ref|XP_002507585.1| predicted protein [Micromonas sp. RCC299]
gi|226522860|gb|ACO68843.1| predicted protein [Micromonas sp. RCC299]
Length = 433
Score = 79.3 bits (194), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 59/208 (28%), Positives = 99/208 (47%), Gaps = 29/208 (13%)
Query: 37 PLKVEELYLD-PRVVKIHDAIYDSEINRIIELSKGKVERGKVVNY--GDTIYVDTRLSKV 93
P ++ + LD PR + + E + ++E ++ + + VV+ G + + + R S
Sbjct: 155 PRNIQVVSLDNPRAFMHIGFLSERECDLLVEYARPNMYKSGVVDASNGGSSFSNIRTSTG 214
Query: 94 YFLYPEIF--GDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATP 151
F+ P +F G + + +I+ RI T + E P+Q+ Y +G Y H D
Sbjct: 215 SFV-PTVFPLGMNDVVRRIERRIAAWTQIPAAHGE----PIQVLRYQIGQEYQSHFDYFF 269
Query: 152 RDEGLW--RLASFMFYLTDVELGGATIFPSLN-----------------LTVFPEKGSAV 192
+ G+ R+A+ + YL+DV+ GG T+FPS +TV P+KG A+
Sbjct: 270 HEGGMKNNRIATVLMYLSDVKDGGETVFPSAESLQVKPEPIHHACAKNGITVIPKKGDAI 329
Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
++N LD H+GCPV LG KW
Sbjct: 330 LFWNMKVGGDLDGGSTHAGCPVVLGEKW 357
>gi|356502610|ref|XP_003520111.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 286
Score = 79.0 bits (193), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 58/209 (27%), Positives = 95/209 (45%), Gaps = 32/209 (15%)
Query: 38 LKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYF 95
L++E + PR H+ + E +I ++ +++ V + G ++ D R S F
Sbjct: 72 LRMEVISWQPRAFLYHNFLTKEECEYLINIATPHMQKSTVADNQSGQSVVHDVRKSTGAF 131
Query: 96 LYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRD-- 153
L G + I+ RI D+T + I E P+ + +Y +G +YD H D D
Sbjct: 132 LD---RGQDEIVRNIEKRIADVTFIPIENGE----PIYVIHYEVGQYYDPHYDYFIDDFN 184
Query: 154 --EGLWRLASFMFYLTDVELGGATIFP-------------------SLNLTVFPEKGSAV 192
G R+A+ + YL++VE GG T+FP + L++ P+ G A+
Sbjct: 185 IENGGQRIATMLMYLSNVEEGGETMFPRAKANFSSVPWWNELSNCGKMGLSIKPKMGDAL 244
Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKWG 221
+++ N LD HS CPV GNKW
Sbjct: 245 LFWSMKPNATLDALTLHSACPVIKGNKWS 273
>gi|333981907|ref|YP_004511117.1| procollagen-proline dioxygenase [Methylomonas methanica MC09]
gi|333805948|gb|AEF98617.1| Procollagen-proline dioxygenase [Methylomonas methanica MC09]
Length = 286
Score = 79.0 bits (193), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 61/238 (25%), Positives = 104/238 (43%), Gaps = 32/238 (13%)
Query: 5 LACQGNLSVPEDIKSNLKCFYESYNNTFLKIG--------PLKVEELYLDPRVVKIHDAI 56
+AC N P+ SN Y + + G +KV P +V + + +
Sbjct: 46 IACVANDVSPQPEPSNKAKLPYQYETSLVAAGNNIDLFDRSVKVSLRVSRPDIVVVDEFM 105
Query: 57 YDSEINRIIELSKGKVERGKVVNYGD---TIYVDTRLSKVYFLYPEIFGDHPFLYKIQTR 113
E ++IE S+ K+ +V+ + D YF G+ P + ++ R
Sbjct: 106 SGEECEQLIEQSRRKLTPSAIVDPQTGKFQVIADRSSEGTYFQR----GESPLISRLDRR 161
Query: 114 IQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---------ATPRDEGLWRLASFMF 164
I ++ N E + +QI +YG+G Y H D A + R+A+ +
Sbjct: 162 ISELMNW----PEDHGEGIQILHYGVGAQYKPHFDYFLENESGGALQMTQSGQRVATLVM 217
Query: 165 YLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTL--LDYRMYHSGCPVALGNKW 220
YL +V GG T+FP + +++ P++GSA ++ A+ N+L +D H G PV G KW
Sbjct: 218 YLNEVTEGGETVFPDVGISITPKRGSAAYF--AYCNSLGQVDPATLHGGAPVLTGEKW 273
>gi|91789558|ref|YP_550510.1| procollagen-proline,2-oxoglutarate-4-dioxygenase [Polaromonas sp.
JS666]
gi|91698783|gb|ABE45612.1| Procollagen-proline,2-oxoglutarate-4-dioxygenase [Polaromonas sp.
JS666]
Length = 277
Score = 79.0 bits (193), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 56/186 (30%), Positives = 87/186 (46%), Gaps = 22/186 (11%)
Query: 47 PRVVKIHDAIYDSEINRIIELSKGKVERGKVVNY---GDTIYVDTRLSKVYFLYPEIFGD 103
P +V + + DSE ++E+++ ++ R VN G+ D ++F G+
Sbjct: 90 PDLVVFGNLLSDSECEALMEVAQPRLARSLTVNIKTGGEERNRDRTSQGMFFAR----GE 145
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD-------ATPR--DE 154
+P + +++ RI + + R E LQ+ Y G Y H D TP
Sbjct: 146 NPLVQRVEARIARLVGWPVDRGEG----LQVLRYRQGAQYKPHYDYFDPAEPGTPAILQR 201
Query: 155 GLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 214
G R+A+ + YL + E GGAT+FP + L V P +G+AVF+ AN R H G PV
Sbjct: 202 GGQRVATLIMYLNEPEQGGATVFPDIGLQVTPRRGTAVFFSYPAANPASLTR--HGGEPV 259
Query: 215 ALGNKW 220
G KW
Sbjct: 260 KAGEKW 265
>gi|357467075|ref|XP_003603822.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355492870|gb|AES74073.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 683
Score = 79.0 bits (193), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 59/194 (30%), Positives = 87/194 (44%), Gaps = 26/194 (13%)
Query: 47 PRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDH 104
PR H+ + E +I L+K + R VV+ G+ +R S FL G
Sbjct: 119 PRASMYHNFLSKEECEHLINLAKPFMARSLVVDGVTGEVKESSSRTSSGMFLDR---GKD 175
Query: 105 PFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDATPRDEGLWRLA 160
+ I+ RI D+T++ I E L + +YG+G HYD D G R+A
Sbjct: 176 KIVQNIERRIADITSVPIENGE----GLHVIHYGVGQKCEPHYDYTSDGVVTKNGGPRVA 231
Query: 161 SFMFYLTDVELGGATIFPSLN-------------LTVFPEKGSAVFWYNAHANTLLDYRM 207
+ + YL+DVE GG T+FP L+V P+ G A+ +++ + LD
Sbjct: 232 TVLMYLSDVEEGGETVFPDAQPNFTSVSKCSGDGLSVKPKMGDALLFWSMKPDGTLDTSS 291
Query: 208 YHSGCPVALGNKWG 221
H G PV GNKW
Sbjct: 292 LHGGSPVIRGNKWA 305
Score = 54.7 bits (130), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 48/177 (27%), Positives = 71/177 (40%), Gaps = 33/177 (18%)
Query: 60 EINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDM 117
E +I L+K + R VV+ G R S FL G + I+ RI D+
Sbjct: 377 ECEHLINLAKPFMTRSLVVDGLTGKGRESSARTSSGRFLER---GKDKIVQNIEQRIADI 433
Query: 118 TNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIF 177
T++ P ++ L G R+A+ + YL+DVE GG T+F
Sbjct: 434 TSI----------PRMARDFML-----FTAGGVVTKNGGPRVATVLMYLSDVEEGGETVF 478
Query: 178 PSLN-------------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 221
P+ L+V P+ G A+ + + + LD H G PV GNKW
Sbjct: 479 PNAKPNINSVSKYPEKGLSVKPKMGDALLFRSMKPDGTLDTSSLHGGSPVIRGNKWA 535
>gi|405970696|gb|EKC35577.1| Prolyl 4-hydroxylase subunit alpha-1 [Crassostrea gigas]
Length = 171
Score = 79.0 bits (193), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 47/130 (36%), Positives = 73/130 (56%), Gaps = 17/130 (13%)
Query: 107 LYKIQTRIQDMTNLVIGREERYKGP--LQINNYGLGG----HYDL----------HCDAT 150
L+ + RI+ +T L + + +IN++G+GG H+D + + +
Sbjct: 16 LFPLTKRIEIITGLSTSVSKLFSDSENYEINHFGIGGMMKPHFDFLNISLGEYQKNVERS 75
Query: 151 PRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHS 210
R G R+A++MFYLTDVE GGAT+FP + V KG+A+FWYN N+ D R ++
Sbjct: 76 VRMSGD-RVATWMFYLTDVEKGGATVFPEAKVRVPVTKGAALFWYNIKRNSEKDQRSLNA 134
Query: 211 GCPVALGNKW 220
CPV LG+K+
Sbjct: 135 DCPVILGSKF 144
>gi|18405808|ref|NP_566838.1| prolyl 4-hydroxylase [Arabidopsis thaliana]
gi|21617881|gb|AAM66931.1| prolyl 4-hydroxylase, putative [Arabidopsis thaliana]
gi|332643929|gb|AEE77450.1| prolyl 4-hydroxylase [Arabidopsis thaliana]
Length = 316
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 58/209 (27%), Positives = 89/209 (42%), Gaps = 31/209 (14%)
Query: 37 PLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVY 94
P +V +L PRV + D E + I+L+KGK+E+ V + G+++ + R S
Sbjct: 53 PTRVTQLSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESEVRTSSGM 112
Query: 95 FLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDAT 150
FL + ++ ++ T L E +QI +Y G H+D D
Sbjct: 113 FLSKR---QDDIVSNVEAKLAAWTFL----PEENGESMQILHYENGQKYEPHFDYFHDQA 165
Query: 151 PRDEGLWRLASFMFYLTDVELGGATIFP------------------SLNLTVFPEKGSAV 192
+ G R+A+ + YL++VE GG T+FP V P KG A+
Sbjct: 166 NLELGGHRIATVLMYLSNVEKGGETVFPMWKGKATQLKDDSWTECAKQGYAVKPRKGDAL 225
Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKWG 221
++N H N D H CPV G KW
Sbjct: 226 LFFNLHPNATTDSNSLHGSCPVVEGEKWS 254
>gi|294499597|ref|YP_003563297.1| 2OG-Fe(II) oxygenase family protein [Bacillus megaterium QM B1551]
gi|294349534|gb|ADE69863.1| 2OG-Fe(II) oxygenase family protein [Bacillus megaterium QM B1551]
Length = 219
Score = 78.6 bits (192), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 51/180 (28%), Positives = 91/180 (50%), Gaps = 12/180 (6%)
Query: 44 YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
+ +P V+ + + + + E + +I+LSK K++R K+ G V++ + + E +
Sbjct: 36 FEEPLVLVLGNVLSNEECDELIQLSKDKMQRSKI---GAAREVNSIRTSSGMFFEE--SE 90
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
+ +++I+ R+ ++G Y LQ+ Y Y H D + + R++
Sbjct: 91 NELVHQIERRLSK----IMGPSIEYAEGLQVLKYLPDQEYKAHHDYFTSASKASKNNRIS 146
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ + YL DVE GG T FP L L+V P KG AV++ +++ L+ R H G PV G KW
Sbjct: 147 TLVMYLNDVEEGGETYFPKLGLSVSPTKGMAVYFEYFYSDAELNDRTLHGGAPVIKGEKW 206
>gi|242001766|ref|XP_002435526.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
gi|215498862|gb|EEC08356.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
Length = 559
Score = 78.6 bits (192), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 54/200 (27%), Positives = 93/200 (46%), Gaps = 17/200 (8%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
Y C+G + + S L+C Y + F + P+K+EE+ L P ++ + D + + +I
Sbjct: 295 YRRLCRGEVLRTPQMDSKLRCRYYKGQDGFFTLHPIKLEEINLKPYIIVMRDVVQERDIE 354
Query: 63 RIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVI 122
++ ++ +++R R S +L+ + + P ++ ++ + L
Sbjct: 355 DLMAFAEPRLQRSTTYTGDGNAPSTRRTSSNAWLWDD---EAPIANRMNWYLRALVGLGT 411
Query: 123 GREERYKGPLQINNYGLGG----HYD-----LHCDATPRDEGLW-----RLASFMFYLTD 168
E Q+ NYG GG HYD LH + D L RLA+ M Y+TD
Sbjct: 412 LGSEYEAEAYQLANYGSGGYFLPHYDYLQDTLHAHNSTADYYLQNNEGDRLATLMIYMTD 471
Query: 169 VELGGATIFPSLNLTVFPEK 188
VE GGAT+FP L + + P+K
Sbjct: 472 VEEGGATVFPRLGVRLVPKK 491
>gi|108706361|gb|ABF94156.1| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
Group]
gi|222624253|gb|EEE58385.1| hypothetical protein OsJ_09545 [Oryza sativa Japonica Group]
Length = 299
Score = 78.6 bits (192), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 62/206 (30%), Positives = 91/206 (44%), Gaps = 36/206 (17%)
Query: 42 ELYLDPRVVKIHDAIYDSEINRIIELSK-GKVERGKVVN--YGDTIYVDTRLSKVYFLYP 98
++ PRV + D E +I L+K G++ER VVN G+++ TR S FL
Sbjct: 39 DVSWSPRVFLYEGFLSDVECEHLIALAKQGRMERSTVVNGKSGESVMSKTRTSSGMFL-- 96
Query: 99 EIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD------ATPR 152
I + +I+ RI T E +Q+ YG G Y+ H D A+ R
Sbjct: 97 -IRKQDEVVARIEERIAAWTMFPAENGE----SMQMLRYGQGEKYEPHFDYIRGRQASAR 151
Query: 153 DEGLWRLASFMFYLTDVELGGATIFPSLN------------------LTVFPEKGSAVFW 194
G R+A+ + YL++V++GG T+FP V P KGSAV +
Sbjct: 152 --GGHRIATVLMYLSNVKMGGETVFPDAEARLSQPKDETWSDCAEQGFAVKPTKGSAVLF 209
Query: 195 YNAHANTLLDYRMYHSGCPVALGNKW 220
++ + N D H CPV G KW
Sbjct: 210 FSLYPNATFDPGSLHGSCPVIQGEKW 235
>gi|148684485|gb|EDL16432.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide III [Mus musculus]
Length = 396
Score = 78.6 bits (192), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 60/197 (30%), Positives = 93/197 (47%), Gaps = 21/197 (10%)
Query: 7 CQGNLSVPEDIK-SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRII 65
CQ S P + +L C YE+ ++ +L + P + E ++L P + HD + D E +I
Sbjct: 146 CQTLGSQPTHYQIPSLYCSYETNSSPYLLLQPARKEVVHLRPLIALYHDFVSDEEAQKIR 205
Query: 66 ELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGRE 125
EL++ ++R V + + V+ R+SK +L + P L + RI +T L I +
Sbjct: 206 ELAEPWLQRSVVASGEKQLQVEYRISKSAWLKDTV---DPMLVTLDHRIAALTGLDI--Q 260
Query: 126 ERYKGPLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIF-------P 178
Y LQ+ NYG+GGHY+ H D G L+ VE GGAT F P
Sbjct: 261 PPYAEYLQVVNYGIGGHYEPHFDHATVTMG--------SMLSSVEAGGATAFIYGNFSVP 312
Query: 179 SLNLTVFPEKGSAVFWY 195
+ L+ G+ F Y
Sbjct: 313 VVKLSSVEAGGATAFIY 329
>gi|18086437|gb|AAL57673.1| AT3g28480/MFJ20_16 [Arabidopsis thaliana]
gi|24796986|gb|AAN64505.1| At3g28480/MFJ20_16 [Arabidopsis thaliana]
Length = 316
Score = 78.6 bits (192), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 58/209 (27%), Positives = 89/209 (42%), Gaps = 31/209 (14%)
Query: 37 PLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVY 94
P +V +L PRV + D E + I+L+KGK+E+ V + G+++ + R S
Sbjct: 53 PTRVTQLSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESEVRTSSGM 112
Query: 95 FLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDAT 150
FL + ++ ++ T L E +QI +Y G H+D D
Sbjct: 113 FLSKR---QDDIVNNVEAKLAAWTFL----PEENGESMQILHYENGQKYEPHFDYFHDQA 165
Query: 151 PRDEGLWRLASFMFYLTDVELGGATIFP------------------SLNLTVFPEKGSAV 192
+ G R+A+ + YL++VE GG T+FP V P KG A+
Sbjct: 166 NLELGGHRIATVLMYLSNVEKGGETVFPMWKGKATQLKDDSWTECAKQGYAVKPRKGDAL 225
Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKWG 221
++N H N D H CPV G KW
Sbjct: 226 LFFNLHPNATTDSNSLHGSCPVVEGEKWS 254
>gi|9294583|dbj|BAB02864.1| prolyl 4-hydroxylase alpha subunit-like protein [Arabidopsis
thaliana]
Length = 332
Score = 78.6 bits (192), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 58/208 (27%), Positives = 89/208 (42%), Gaps = 31/208 (14%)
Query: 37 PLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVY 94
P +V +L PRV + D E + I+L+KGK+E+ V + G+++ + R S
Sbjct: 69 PTRVTQLSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESEVRTSSGM 128
Query: 95 FLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDAT 150
FL + ++ ++ T L E +QI +Y G H+D D
Sbjct: 129 FLSKR---QDDIVSNVEAKLAAWTFL----PEENGESMQILHYENGQKYEPHFDYFHDQA 181
Query: 151 PRDEGLWRLASFMFYLTDVELGGATIFP------------------SLNLTVFPEKGSAV 192
+ G R+A+ + YL++VE GG T+FP V P KG A+
Sbjct: 182 NLELGGHRIATVLMYLSNVEKGGETVFPMWKGKATQLKDDSWTECAKQGYAVKPRKGDAL 241
Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
++N H N D H CPV G KW
Sbjct: 242 LFFNLHPNATTDSNSLHGSCPVVEGEKW 269
>gi|240256489|ref|NP_201407.4| iron ion binding / oxidoreductase/ oxidoreductase protein
[Arabidopsis thaliana]
gi|332010770|gb|AED98153.1| iron ion binding / oxidoreductase/ oxidoreductase protein
[Arabidopsis thaliana]
Length = 289
Score = 78.6 bits (192), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 61/206 (29%), Positives = 93/206 (45%), Gaps = 32/206 (15%)
Query: 40 VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRL--SKVYFLY 97
VE + +PR H+ + E +IEL+K +E+ VV+ D+R+ S FL
Sbjct: 78 VEIISWEPRASVYHNFLTKEECKYLIELAKPHMEKSTVVDEKTGKSTDSRVRTSSGTFL- 136
Query: 98 PEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDATPRD 153
G + +I+ RI D T + + E LQ+ +Y +G HYD D
Sbjct: 137 --ARGRDKTIREIEKRISDFTFIPVEHGE----GLQVLHYEIGQKYEPHYDYFMDEYNTR 190
Query: 154 EGLWRLASFMFYLTDVELGGATIFPSLN-------------------LTVFPEKGSAVFW 194
G R+A+ + YL+DVE GG T+FP+ L+V P+ G A+ +
Sbjct: 191 NGGQRIATVLMYLSDVEEGGETVFPAAKGNYSAVPWWNELSECGKGGLSVKPKMGDALLF 250
Query: 195 YNAHANTLLDYRMYHSGCPVALGNKW 220
++ + LD H GC V GNKW
Sbjct: 251 WSMTPDATLDPSSLHGGCAVIKGNKW 276
>gi|194871369|ref|XP_001972835.1| GG15736 [Drosophila erecta]
gi|190654618|gb|EDV51861.1| GG15736 [Drosophila erecta]
Length = 476
Score = 78.6 bits (192), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 55/200 (27%), Positives = 95/200 (47%), Gaps = 24/200 (12%)
Query: 21 LKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNY 80
L C Y + FLK+ PLK+EEL ++ + + + +I+ + +S+ K++R + ++
Sbjct: 294 LVCRYVDWT-PFLKLAPLKMEELSMETHISIFYGVLRQKDIDELKNVSRPKLQRIEHLSG 352
Query: 81 GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLG 140
+ + S H + K+ I D+T G + L++ NYG+
Sbjct: 353 NCSCKIGNLSS----------SSHDVVRKVNELILDIT----GFPSKGNQMLEVINYGIA 398
Query: 141 GHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHAN 200
G+Y+ A PR + A+ + +L + E GG +FPS +L V P KGS + W N +
Sbjct: 399 GNYNPDDTARPRKQNK---ANALIFLDNAERGGEIVFPSRHLKVRPRKGSMLVWMNLERS 455
Query: 201 TLLDYRMYHSGCPVALGNKW 220
+ YH CP+ GN W
Sbjct: 456 VI-----YHQ-CPILKGNMW 469
>gi|297818458|ref|XP_002877112.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297322950|gb|EFH53371.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 289
Score = 78.6 bits (192), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 62/209 (29%), Positives = 89/209 (42%), Gaps = 32/209 (15%)
Query: 37 PLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV---NYGDTIYVDTRLSKV 93
P +V +L PR + + D E + +I L+KGK+E+ VV N G++I + R S
Sbjct: 29 PTRVTQLSWTPRAFLYNGFLSDEECDHLINLAKGKLEKSMVVADDNSGESIDSEERTSSG 88
Query: 94 YFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRD 153
FL + ++ ++ T L E LQI +Y G YD H D
Sbjct: 89 VFLTKR---QDDIVANVEAKLATWTFL----PEENGEALQILHYENGQKYDPHFDYYYDK 141
Query: 154 EGL----WRLASFMFYLTDVELGGATIFP------------------SLNLTVFPEKGSA 191
E L R+A+ + YL++V GG T+FP V P KG A
Sbjct: 142 ETLKLGGHRIATVLMYLSNVTKGGETVFPMWKGKTPQLKDDTWSECAKQGYAVKPRKGDA 201
Query: 192 VFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ ++N H N D H CPV G KW
Sbjct: 202 LLFFNLHPNATTDPTSLHGSCPVIEGEKW 230
>gi|147800995|emb|CAN64470.1| hypothetical protein VITISV_014644 [Vitis vinifera]
Length = 288
Score = 78.6 bits (192), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 60/205 (29%), Positives = 92/205 (44%), Gaps = 32/205 (15%)
Query: 41 EELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRL--SKVYFLYP 98
E + +PR H+ + E +I+L+K +++ VV+ D+R+ S FL
Sbjct: 78 EVISWEPRAFVYHNFLSKDECEYLIKLAKPHMQKSTVVDSSTGKSKDSRVRTSSGTFL-- 135
Query: 99 EIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDATPRDE 154
G + I+ R+ D T L + E LQI +Y +G HYD D
Sbjct: 136 -TRGQDKIIRGIEKRLSDFTFLPVEHGEG----LQILHYEVGQKYEPHYDYFLDDYNTKN 190
Query: 155 GLWRLASFMFYLTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWY 195
G R+A+ + YL+DVE GG T+FP+ L+V P+ G A+ ++
Sbjct: 191 GGQRMATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSXCGKEGLSVKPKMGDALLFW 250
Query: 196 NAHANTLLDYRMYHSGCPVALGNKW 220
+ + LD H GCPV GNKW
Sbjct: 251 SMKPDASLDPSSLHGGCPVIKGNKW 275
>gi|325922187|ref|ZP_08183974.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas gardneri ATCC
19865]
gi|325547306|gb|EGD18373.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas gardneri ATCC
19865]
Length = 285
Score = 78.2 bits (191), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 69/228 (30%), Positives = 102/228 (44%), Gaps = 26/228 (11%)
Query: 6 ACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELY--LDPRVVKIHDAIYDSEINR 63
A +L VP + + L + + + L +G +V L L PRVV + D + D+E +
Sbjct: 57 AVTHSLPVPVRVPTVL----QDNDASLLDLGDRQVRVLVSLLLPRVVVLGDFLSDAECDA 112
Query: 64 IIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLV 121
+I L++ ++ R + V+ G I R S L G +I+ RI + +
Sbjct: 113 LIALAQPRLARSRTVDNDNGAQIVHAARTSDSMCLQ---LGQDALCQRIEARIARLLDWP 169
Query: 122 IGREERYKGPLQINNYGLGGHYDLHCD-------ATP--RDEGLWRLASFMFYLTDVELG 172
+ E LQ+ Y G Y H D TP G RLAS + YL E G
Sbjct: 170 VDHGEG----LQVLRYATGAEYQPHYDYFDPTAAGTPVLLQAGGQRLASLVMYLNTPERG 225
Query: 173 GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GAT FP ++L V KG+AVF+ + + R H+G PV G KW
Sbjct: 226 GATRFPDVHLDVAAVKGNAVFFSYDRPHPM--TRSLHAGAPVLAGEKW 271
>gi|187920106|ref|YP_001889137.1| procollagen-proline dioxygenase [Burkholderia phytofirmans PsJN]
gi|187718544|gb|ACD19767.1| Procollagen-proline dioxygenase [Burkholderia phytofirmans PsJN]
Length = 295
Score = 78.2 bits (191), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 57/185 (30%), Positives = 87/185 (47%), Gaps = 18/185 (9%)
Query: 47 PRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDH 104
P+V+ D + E +IE S+ +++R VN G + R S+ + G+
Sbjct: 106 PQVIVFGDVLSPDECAEMIERSRHRLKRSTTVNPETGKEDVIRNRTSEGIWYQ---RGED 162
Query: 105 PFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDE---------G 155
F+ ++ RI + N + E +G LQI +YG G Y H D P D+ G
Sbjct: 163 AFIERMDRRISSLMNWPV---ENGEG-LQILHYGTTGEYRPHFDYFPPDQPGSAVHTAQG 218
Query: 156 LWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 215
R+A+ + YL DV GG TIFP ++V +G AV++ + LD H G PV
Sbjct: 219 GQRVATLVIYLNDVPDGGETIFPEAGISVAARQGGAVYFRYMNGQRQLDPLTLHGGAPVL 278
Query: 216 LGNKW 220
G+KW
Sbjct: 279 GGDKW 283
>gi|359806348|ref|NP_001241485.1| uncharacterized protein LOC100783075 precursor [Glycine max]
gi|255645457|gb|ACU23224.1| unknown [Glycine max]
Length = 298
Score = 78.2 bits (191), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 58/213 (27%), Positives = 90/213 (42%), Gaps = 34/213 (15%)
Query: 35 IGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSK 92
+ P KV+++ PR + D E + +I L+K +++R V + G++ D R S
Sbjct: 32 VNPSKVKQISWKPRAFVYEGFLTDLECDHLISLAKSELKRSAVADNLSGESQLSDVRTSS 91
Query: 93 VYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLH----CD 148
F+ P + I+ +I T L E +Q+ Y G YD H D
Sbjct: 92 GMFISKN---KDPIISGIEDKISSWTFLPKENGED----IQVLRYEHGQKYDPHYDYFTD 144
Query: 149 ATPRDEGLWRLASFMFYLTDVELGGATIFPSLN---------------------LTVFPE 187
G R+A+ + YLT+V GG T+FPS + V P
Sbjct: 145 KVNIARGGHRIATVLMYLTNVTKGGETVFPSAEEPPRRRGTETSSDLSECAKKGIAVKPH 204
Query: 188 KGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+G A+ +++ H N D H+GCPV G KW
Sbjct: 205 RGDALLFFSLHTNATPDTSSLHAGCPVIEGEKW 237
>gi|242039227|ref|XP_002467008.1| hypothetical protein SORBIDRAFT_01g018200 [Sorghum bicolor]
gi|241920862|gb|EER94006.1| hypothetical protein SORBIDRAFT_01g018200 [Sorghum bicolor]
Length = 307
Score = 78.2 bits (191), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 57/204 (27%), Positives = 92/204 (45%), Gaps = 28/204 (13%)
Query: 40 VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPE 99
E L +PR H+ + E + +I L+K +++ VV+ D+R+ ++
Sbjct: 96 TEVLSWEPRAFVYHNFLSKEECDHLISLAKPHMKKSTVVDSATGASKDSRVRTSSGMFLR 155
Query: 100 IFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRD----EG 155
G + I+ RI D T + + E LQ+ +Y +G Y+ H D D G
Sbjct: 156 R-GQDKIIQTIEKRIADFTFIPVEHGE----GLQVLHYEVGQKYEPHFDYFHDDYNTKNG 210
Query: 156 LWRLASFMFYLTDVELGGATIFPSL-------------------NLTVFPEKGSAVFWYN 196
R+A+ + YL+DVE GG T+FPS L+V P+ G A+ +++
Sbjct: 211 GQRIATLLMYLSDVEDGGETVFPSSTTNSSSSPFYNELSECAKGGLSVKPKMGDALLFWS 270
Query: 197 AHANTLLDYRMYHSGCPVALGNKW 220
+ +D H GCPV GNKW
Sbjct: 271 MKPDGSMDSTSLHGGCPVIKGNKW 294
>gi|225468574|ref|XP_002263060.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
gi|296084059|emb|CBI24447.3| unnamed protein product [Vitis vinifera]
Length = 288
Score = 78.2 bits (191), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 60/205 (29%), Positives = 92/205 (44%), Gaps = 32/205 (15%)
Query: 41 EELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRL--SKVYFLYP 98
E + +PR H+ + E +I+L+K +++ VV+ D+R+ S FL
Sbjct: 78 EVISWEPRAFVYHNFLSKDECEYLIKLAKPHMQKSTVVDSSTGKSKDSRVRTSSGTFL-- 135
Query: 99 EIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDATPRDE 154
G + I+ R+ D T L + E LQI +Y +G HYD D
Sbjct: 136 -TRGQDKIIRGIEKRLSDFTFLPVEHGEG----LQILHYEVGQKYEPHYDYFLDDYNTKN 190
Query: 155 GLWRLASFMFYLTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWY 195
G R+A+ + YL+DVE GG T+FP+ L+V P+ G A+ ++
Sbjct: 191 GGQRMATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKEGLSVKPKMGDALLFW 250
Query: 196 NAHANTLLDYRMYHSGCPVALGNKW 220
+ + LD H GCPV GNKW
Sbjct: 251 SMKPDASLDPSSLHGGCPVIKGNKW 275
>gi|224085946|ref|XP_002307750.1| predicted protein [Populus trichocarpa]
gi|222857199|gb|EEE94746.1| predicted protein [Populus trichocarpa]
Length = 288
Score = 78.2 bits (191), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 62/242 (25%), Positives = 102/242 (42%), Gaps = 28/242 (11%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
+ L S P D+ S + ES + K E L +PR H+ + E
Sbjct: 40 FSLPVSSEDSSPNDLNSYRRIASESDGDGMGKREEQWTEILSWEPRAFLYHNFLSKEECE 99
Query: 63 RIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVI 122
+I L+K + + VV+ D+R+ ++ G + +I+ RI D + + +
Sbjct: 100 YLINLAKPHMMKSTVVDSKTGRSKDSRVRTSSGMFLRR-GRDRVIREIEKRIADFSFIPV 158
Query: 123 GREERYKGPLQINNYGLG----GHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFP 178
E LQ+ +Y +G H+D D G R A+ + YL+DVE GG T+FP
Sbjct: 159 EHGE----GLQVLHYEVGQKYEAHFDYFLDEFNTKNGGQRTATLLMYLSDVEEGGETVFP 214
Query: 179 SLN-------------------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 219
+ N L++ P+ G+A+ +++ + LD H CPV GNK
Sbjct: 215 AANMNISAVPWWNELSECAKQGLSLKPKMGNALLFWSTRPDATLDPSSLHGSCPVIRGNK 274
Query: 220 WG 221
W
Sbjct: 275 WS 276
>gi|195379214|ref|XP_002048375.1| GJ13932 [Drosophila virilis]
gi|194155533|gb|EDW70717.1| GJ13932 [Drosophila virilis]
Length = 444
Score = 78.2 bits (191), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 48/202 (23%), Positives = 92/202 (45%), Gaps = 32/202 (15%)
Query: 19 SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV 78
+ + C+Y++ FL + PLKVE L +P V HD IY++EI +++ + + +
Sbjct: 264 TQMYCYYQNSKEPFLILAPLKVELLNTEPYVALYHDVIYENEIKKLLSIDLASMRHDRTA 323
Query: 79 NYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYG 138
++ +++ T ++ + + R+ DMT + + E+ + + NYG
Sbjct: 324 DHKNSVKYTTVTRELNDV-------------LNHRVMDMTAMNVASEKDF----LLINYG 366
Query: 139 LGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAH 198
+GGH + L++V GG TI P L + + +KG+A+ ++
Sbjct: 367 IGGHIRALSEQQ---------------LSEVPQGGDTILPELEIAIKSKKGAALVTHHLD 411
Query: 199 ANTLLDYRMYHSGCPVALGNKW 220
+D H CPV +G+ W
Sbjct: 412 KQLKIDLSSDHLSCPVLVGSMW 433
>gi|42567428|ref|NP_195306.2| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
gi|332661174|gb|AEE86574.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
Length = 290
Score = 78.2 bits (191), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 66/236 (27%), Positives = 109/236 (46%), Gaps = 34/236 (14%)
Query: 12 SVPEDIKSNLKCFYE--SYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSK 69
S+P D+ + ++ E S+ + G +E + +PR H+ + + E +I L+K
Sbjct: 50 SMPMDLTTIVQTIQERESFGDEEDGNGDRWLEVISWEPRAFVYHNFLTNEECEHLISLAK 109
Query: 70 GKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREER 127
+ + KVV+ G +I R S FL G + +I+ RI D T + E
Sbjct: 110 PSMMKSKVVDVKTGKSIDSRVRTSSGTFLN---RGHDEIVEEIENRISDFTFIP---PEN 163
Query: 128 YKGPLQINNYGLGGHYDLH----CDATPRDEGLWRLASFMFYLTDVELGGATIFPSLN-- 181
+G LQ+ +Y +G Y+ H D +G R+A+ + YL+DV+ GG T+FP+
Sbjct: 164 GEG-LQVLHYEVGQRYEPHHDYFFDEFNVRKGGQRIATVLMYLSDVDEGGETVFPAAKGN 222
Query: 182 -----------------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
L+V P+K A+ +++ + LD H GCPV GNKW
Sbjct: 223 VSDVPWWDELSQCGKEGLSVLPKKRDALLFWSMKPDASLDPSSLHGGCPVIKGNKW 278
>gi|406665340|ref|ZP_11073114.1| hypothetical protein B857_00901 [Bacillus isronensis B3W22]
gi|405387266|gb|EKB46691.1| hypothetical protein B857_00901 [Bacillus isronensis B3W22]
Length = 211
Score = 78.2 bits (191), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 54/186 (29%), Positives = 91/186 (48%), Gaps = 13/186 (6%)
Query: 38 LKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLY 97
+ E L+ +P +VK + + D E +I+ + ++ER K+ + R S F
Sbjct: 21 ITAEVLHEEPLIVKFLNVLSDEECQNLIDCASSRLERSKLAKKEIS---SIRTSSGMFFE 77
Query: 98 PEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDE 154
++P + +I+ RI + +L I E +G LQ+ +Y G + H D
Sbjct: 78 E---NENPLISEIEKRISSLMHLPI---EHAEG-LQVLHYEPGQEFKAHFDFFGPNHPSS 130
Query: 155 GLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 214
R+++ + YL DVE GG T FP+L + P+KG+AV++ + + L+ HSG PV
Sbjct: 131 SNNRISTLVVYLNDVEEGGVTTFPNLGIVNVPKKGTAVYFEYFYNDQKLNELTLHSGEPV 190
Query: 215 ALGNKW 220
G KW
Sbjct: 191 IQGEKW 196
>gi|290243077|ref|YP_003494747.1| Procollagen-proline dioxygenase [Thioalkalivibrio sp. K90mix]
gi|288945582|gb|ADC73280.1| Procollagen-proline dioxygenase [Thioalkalivibrio sp. K90mix]
Length = 575
Score = 77.8 bits (190), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 63/226 (27%), Positives = 96/226 (42%), Gaps = 23/226 (10%)
Query: 12 SVPEDIKSNLKCFYESYNNTFLKIGPLKVEE------LYLDPRVVKIHDAIYDSEINRII 65
SV E + S+L E ++ + E L DP VV + + + E +I
Sbjct: 16 SVREAVTSSLPVVAEEAEPERVERNRMPAERYDGMETLSQDPLVVYLDEFLEPGECEALI 75
Query: 66 ELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGRE 125
L++G+++R V G + R +L + + P +I R+ +
Sbjct: 76 HLAQGRMKRALVSLDGSSGVSQGRTGSNCWLR---YQEEPLARRIGERVAKRVGFPL--- 129
Query: 126 ERYKGPLQINNYGLGGHYDLHCDA----TPRD-----EGLWRLASFMFYLTDVELGGATI 176
Y PLQ+ +YG Y H DA TPR +G R+ + + YL +VE GGAT
Sbjct: 130 -EYAEPLQVIHYGHEQEYRPHYDAYDLDTPRGLRCTRQGGQRMVTALLYLNEVEEGGATA 188
Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTLLDY-RMYHSGCPVALGNKWG 221
FP+ + V P KG + N A+ + R H G PV G KW
Sbjct: 189 FPNAGVEVAPRKGRIAIFNNVGADPGRPHPRSLHGGMPVKSGEKWA 234
>gi|357478545|ref|XP_003609558.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
gi|355510613|gb|AES91755.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
Length = 299
Score = 77.8 bits (190), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 60/213 (28%), Positives = 90/213 (42%), Gaps = 34/213 (15%)
Query: 35 IGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSK 92
I P KV+++ PR + D E + +I L+K +++R V + GD+ D R S
Sbjct: 32 INPSKVKQISWIPRAFVYQGFLTDLECDHLISLAKSELKRSAVADNLSGDSQLSDVRTSS 91
Query: 93 VYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLH----CD 148
F+ P + I+ RI T L E +Q+ Y G YD H D
Sbjct: 92 GMFISKN---KDPIVSGIEDRISAWTFLPKENGE----DIQVLRYEHGQKYDPHYDYFAD 144
Query: 149 ATPRDEGLWRLASFMFYLTDVELGGATIFPSLN---------------------LTVFPE 187
+G RLA+ + YLT+V GG T+FP + V P
Sbjct: 145 KVNIVQGGHRLATVLMYLTNVTKGGETVFPEAEEPPRRRGSKKSSDLSECAKKGIAVKPR 204
Query: 188 KGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+G A+ +++ N + D H+GCPV G KW
Sbjct: 205 RGDALLFFSLDTNAIPDTNSLHAGCPVLEGEKW 237
>gi|281206564|gb|EFA80750.1| putative prolyl 4-hydroxylase alpha subunit [Polysphondylium
pallidum PN500]
Length = 251
Score = 77.8 bits (190), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 60/191 (31%), Positives = 83/191 (43%), Gaps = 20/191 (10%)
Query: 39 KVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYP 98
K+ E+ PR+ +I + D E +IE SK K++ ++ G S
Sbjct: 56 KLIEVSQKPRIYRIPKFLTDEECEHLIETSKNKLKPCNEISSG------VHRSGWGLFMK 109
Query: 99 EIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLG----GHYDLHCDATPRDE 154
E DHP I R++ NL E +Q+ Y G H+D T
Sbjct: 110 EGEEDHPVTQNIFNRMKTFVNLTESSEV-----MQVIRYNPGEETSAHFDYFNPLTTNGA 164
Query: 155 ---GLW--RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYH 209
GL+ R+ + + YL DVE GG T FP +N+ V P KG AV +YN N +D H
Sbjct: 165 MKIGLYGQRICTILMYLADVEEGGETSFPEVNVKVKPIKGDAVLFYNCKPNGEVDPLSLH 224
Query: 210 SGCPVALGNKW 220
G PV G KW
Sbjct: 225 QGDPVIKGTKW 235
>gi|388492638|gb|AFK34385.1| unknown [Medicago truncatula]
Length = 299
Score = 77.8 bits (190), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 60/213 (28%), Positives = 90/213 (42%), Gaps = 34/213 (15%)
Query: 35 IGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSK 92
I P KV+++ PR + D E + +I L+K +++R V + GD+ D R S
Sbjct: 32 INPSKVKQISWIPRAFVYQGFLTDLECDHLISLAKSELKRSAVADNLSGDSQLSDVRTSS 91
Query: 93 VYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLH----CD 148
F+ P + I+ RI T L E +Q+ Y G YD H D
Sbjct: 92 GMFISKN---KDPIVSGIEDRISAWTFLPKENGE----DIQVLRYEHGQKYDPHYDYFAD 144
Query: 149 ATPRDEGLWRLASFMFYLTDVELGGATIFPSLN---------------------LTVFPE 187
+G RLA+ + YLT+V GG T+FP + V P
Sbjct: 145 KVNIVQGGHRLATVLMYLTNVTKGGETVFPEAEEPPRRRGSKKSSDLSECAKKGIAVKPR 204
Query: 188 KGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+G A+ +++ N + D H+GCPV G KW
Sbjct: 205 RGDALLFFSLDTNAIPDTNSLHAGCPVLEGEKW 237
>gi|255637501|gb|ACU19077.1| unknown [Glycine max]
Length = 318
Score = 77.8 bits (190), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 61/214 (28%), Positives = 91/214 (42%), Gaps = 31/214 (14%)
Query: 31 TFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDT 88
+ +K P +V +L PR + D E + +I L+K K+E+ V + G +I +
Sbjct: 47 SSVKFDPTRVTQLSWSPRAFLYKGFLSDEECDHLITLAKDKLEKSMVADNESGKSIMSEV 106
Query: 89 RLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYD 144
R S FL + I+ RI T L I E +QI +Y G H+D
Sbjct: 107 RTSSGMFLNK---AQDEIVAGIEARIAAWTFLPIENGE----SMQILHYENGQKYEPHFD 159
Query: 145 LHCDATPRDEGLWRLASFMFYLTDVELGGATIFPSLNL------------------TVFP 186
D + G R+A+ + YL+DVE GG TIF + V P
Sbjct: 160 YFHDKANQVMGGHRIATVLMYLSDVEKGGETIFSNAKAKLLQPKDESWSECAHKGYAVKP 219
Query: 187 EKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
KG A+ +++ H + D + H CPV G KW
Sbjct: 220 RKGDALLFFSLHLDASTDNKSLHGSCPVIEGEKW 253
>gi|325267002|ref|ZP_08133672.1| 2OG-Fe(II) oxygenase [Kingella denitrificans ATCC 33394]
gi|324981502|gb|EGC17144.1| 2OG-Fe(II) oxygenase [Kingella denitrificans ATCC 33394]
Length = 279
Score = 77.8 bits (190), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 58/186 (31%), Positives = 86/186 (46%), Gaps = 20/186 (10%)
Query: 47 PRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYV---DTRLSKVYFLYPEIFGD 103
P VV + + I E ++I L++GKVE VV+ +V D F E
Sbjct: 91 PEVVVLDNFITAEECAQLIALAEGKVEDATVVDPATGEFVKHQDRTSMNAAFARAE---- 146
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---------ATPRDE 154
HP + +++ RI + E +G +Q+ Y GG Y H D
Sbjct: 147 HPLIARLEARIAAAIHWPA---ENGEG-MQVLRYRSGGEYKAHFDYFDTQSEGGRKNMQT 202
Query: 155 GLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 214
G R+ +F+ YL DV+ GGAT FP+LN + P+KG A+F+ N N + H+G PV
Sbjct: 203 GGQRVGTFLVYLCDVDAGGATRFPALNFEIRPKKGMALFFANTLPNGEGNPLTLHAGVPV 262
Query: 215 ALGNKW 220
G K+
Sbjct: 263 VSGVKY 268
>gi|449495423|ref|XP_004159836.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 304
Score = 77.4 bits (189), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 57/215 (26%), Positives = 89/215 (41%), Gaps = 35/215 (16%)
Query: 35 IGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSK 92
+ P KV+++ PR + D E + +I L+K +++R V + G + + R S
Sbjct: 36 VNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGKSKVSEVRTSS 95
Query: 93 VYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLH----CD 148
F++ P + I+ +I T L E +Q+ Y G YD H D
Sbjct: 96 GAFIHK---AKDPIVSGIEDKIAAWTFLPKDNGE----DIQVLRYEYGQKYDAHFDYFAD 148
Query: 149 ATPRDEGLWRLASFMFYLTDVELGGATIF----------------------PSLNLTVFP 186
G R+A+ + YL+DVE GG T+F + V P
Sbjct: 149 KVNIARGGHRMATVLMYLSDVEKGGETVFLLRRSESQRRQASETNEDLSDCAKKGIAVKP 208
Query: 187 EKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 221
KG A+ +++ H N + D H GCPV G KW
Sbjct: 209 RKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWS 243
>gi|21593091|gb|AAM65040.1| putative prolyl 4-hydroxylase, alpha subunit [Arabidopsis thaliana]
Length = 291
Score = 77.4 bits (189), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 61/210 (29%), Positives = 93/210 (44%), Gaps = 32/210 (15%)
Query: 36 GPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRL--SKV 93
G VE + +PR V H+ + + E +I L+K + + VV+ D+R+ S
Sbjct: 76 GERWVEVISWEPRAVVYHNFLTNEECEHLISLAKPSMVKSTVVDEKTGGSKDSRVRTSSG 135
Query: 94 YFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDA 149
FL G + I+ RI D T + + E LQ+ +Y +G HYD D
Sbjct: 136 TFLR---RGHDEVVEVIEKRISDFTFIPVENGE----GLQVLHYQVGQKYEPHYDYFLDE 188
Query: 150 TPRDEGLWRLASFMFYLTDVELGGATIFPSLN-------------------LTVFPEKGS 190
G R+A+ + YL+DV+ GG T+FP+ L+V P+
Sbjct: 189 FNTKNGGQRIATVLMYLSDVDDGGETVFPAARGNISAVPWWNELSKCGKEGLSVLPKXRD 248
Query: 191 AVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
A+ ++N + LD H GCPV GNKW
Sbjct: 249 ALLFWNMRPDASLDPSSLHGGCPVVKGNKW 278
>gi|330799463|ref|XP_003287764.1| hypothetical protein DICPUDRAFT_151895 [Dictyostelium purpureum]
gi|325082219|gb|EGC35708.1| hypothetical protein DICPUDRAFT_151895 [Dictyostelium purpureum]
Length = 220
Score = 77.4 bits (189), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 60/198 (30%), Positives = 88/198 (44%), Gaps = 20/198 (10%)
Query: 37 PLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFL 96
P+K+ EL PRV +I + + + E N +I+ SK K+ ++ G S
Sbjct: 22 PIKLIELSQKPRVYRIPEFLTEEECNHLIDTSKNKLRPCNEISSG------VHRSGWGLF 75
Query: 97 YPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLG----GHYDLHCDATPR 152
E +HP I ++++ N+ E +QI Y G HYD T
Sbjct: 76 MKEGEEEHPVTKNIFNKMKNFVNISDSCE-----VMQIIRYNPGEETSAHYDYFNPLTTN 130
Query: 153 DE---GLW--RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRM 207
GL+ R+ + + YL DVE GG T FP + + V P +G AV +YN N +D
Sbjct: 131 GSMKIGLYGQRICTILMYLCDVEEGGETSFPEVGIKVKPIRGDAVLFYNCKPNGDVDPLS 190
Query: 208 YHSGCPVALGNKWGKLLL 225
H G PV G KW + L
Sbjct: 191 LHQGDPVTKGTKWVAIKL 208
>gi|297850430|ref|XP_002893096.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297338938|gb|EFH69355.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 287
Score = 77.4 bits (189), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 69/244 (28%), Positives = 101/244 (41%), Gaps = 33/244 (13%)
Query: 2 IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
++ L + S P D+ + E + K G E L +PR H+ + E
Sbjct: 39 VFSLPINNDESSPIDLSYFRRAATER-SEGLGKRGDQWTEVLSWEPRAFVYHNFLSKEEC 97
Query: 62 NRIIELSKGKVERGKVVNYGDTIYVDTRL--SKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
+I L+K + + VV+ D+R+ S FL G + I+ RI D T
Sbjct: 98 EYLISLAKPHMVKSTVVDSETGKSKDSRVRTSSGTFLRR---GRDKIIKTIEKRIADYTF 154
Query: 120 LVIGREERYKGPLQINNYGLGG----HYDLHCDATPRDEGLWRLASFMFYLTDVELGGAT 175
+ E LQI +Y G HYD D G R+A+ + YL+DVE GG T
Sbjct: 155 IPADHGE----GLQILHYEAGQKYEPHYDYFVDEFNTKNGGQRMATMLMYLSDVEEGGET 210
Query: 176 IFPSLN-------------------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVAL 216
+FP+ N L+V P G A+ +++ + LD H GCPV
Sbjct: 211 VFPAANMNFSSVPWYNELSECGKKGLSVKPRMGDALLFWSMRPDATLDPTSLHGGCPVIR 270
Query: 217 GNKW 220
GNKW
Sbjct: 271 GNKW 274
>gi|414870899|tpg|DAA49456.1| TPA: hypothetical protein ZEAMMB73_536273 [Zea mays]
Length = 364
Score = 77.4 bits (189), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 59/204 (28%), Positives = 95/204 (46%), Gaps = 28/204 (13%)
Query: 40 VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPE 99
E L +PR H+ + E + +I L+K +++ VV+ D+R+ ++
Sbjct: 153 TEVLSWEPRAFVYHNFLSKEECDHLISLAKPHMKKSTVVDSATGGSKDSRVRTSSGMFLR 212
Query: 100 IFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRD----EG 155
G + I+ RI D T + + E+ +G LQ+ +Y +G Y+ H D D G
Sbjct: 213 -RGQDKIIRTIEKRIADYTFIPV---EQGEG-LQVLHYEVGQKYEPHFDYFHDDYNTKNG 267
Query: 156 LWRLASFMFYLTDVELGGATIFPSL-------------------NLTVFPEKGSAVFWYN 196
R+A+ + YL+DVE GG T+FPS L+V P+ G A+ +++
Sbjct: 268 GQRIATLLMYLSDVEDGGETVFPSSTTNSSSSPFYNELSECAKGGLSVKPKMGDALLFWS 327
Query: 197 AHANTLLDYRMYHSGCPVALGNKW 220
+ LD H GCPV GNKW
Sbjct: 328 MKPDGSLDPTSLHGGCPVIKGNKW 351
>gi|357137804|ref|XP_003570489.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Brachypodium
distachyon]
Length = 318
Score = 77.0 bits (188), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 59/204 (28%), Positives = 92/204 (45%), Gaps = 28/204 (13%)
Query: 40 VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPE 99
E + +PR H+ + E +I L+K ++E+ VV+ D+R+ ++
Sbjct: 107 TEVISWEPRAFVYHNFLSKEECEYLIGLAKPRMEKSTVVDSTTGKSKDSRVRTSSGMFLR 166
Query: 100 IFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDATPRDEG 155
G + I+ RI D T + E +G LQ+ +Y +G H+D D G
Sbjct: 167 -RGRDKVIRAIERRIADYTFIPA---EHGEG-LQVLHYEVGQKYEPHFDYFLDEFNTKNG 221
Query: 156 LWRLASFMFYLTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWYN 196
R+A+ + YL+DVE GG TIFP N L V P+ G A+ +++
Sbjct: 222 GQRMATILMYLSDVEEGGETIFPDANVNSSSLPWHNELSECARKGLAVKPKMGDALLFWS 281
Query: 197 AHANTLLDYRMYHSGCPVALGNKW 220
+ + LD H GCPV GNKW
Sbjct: 282 MNPDATLDPLSLHGGCPVIRGNKW 305
>gi|212720775|ref|NP_001131953.1| uncharacterized protein LOC100193348 [Zea mays]
gi|194693016|gb|ACF80592.1| unknown [Zea mays]
gi|347978798|gb|AEP37741.1| prolyl 4-hydroxylase 1 [Zea mays]
gi|414870898|tpg|DAA49455.1| TPA: hypothetical protein ZEAMMB73_536273 [Zea mays]
Length = 307
Score = 77.0 bits (188), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 58/204 (28%), Positives = 93/204 (45%), Gaps = 28/204 (13%)
Query: 40 VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPE 99
E L +PR H+ + E + +I L+K +++ VV+ D+R+ ++
Sbjct: 96 TEVLSWEPRAFVYHNFLSKEECDHLISLAKPHMKKSTVVDSATGGSKDSRVRTSSGMFLR 155
Query: 100 IFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRD----EG 155
G + I+ RI D T + + + E LQ+ +Y +G Y+ H D D G
Sbjct: 156 -RGQDKIIRTIEKRIADYTFIPVEQGEG----LQVLHYEVGQKYEPHFDYFHDDYNTKNG 210
Query: 156 LWRLASFMFYLTDVELGGATIFPSL-------------------NLTVFPEKGSAVFWYN 196
R+A+ + YL+DVE GG T+FPS L+V P+ G A+ +++
Sbjct: 211 GQRIATLLMYLSDVEDGGETVFPSSTTNSSSSPFYNELSECAKGGLSVKPKMGDALLFWS 270
Query: 197 AHANTLLDYRMYHSGCPVALGNKW 220
+ LD H GCPV GNKW
Sbjct: 271 MKPDGSLDPTSLHGGCPVIKGNKW 294
>gi|390570433|ref|ZP_10250698.1| procollagen-proline dioxygenase [Burkholderia terrae BS001]
gi|389937613|gb|EIM99476.1| procollagen-proline dioxygenase [Burkholderia terrae BS001]
Length = 285
Score = 77.0 bits (188), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 61/229 (26%), Positives = 99/229 (43%), Gaps = 21/229 (9%)
Query: 6 ACQGNLSVPEDIKSNL---KCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
A ++ PED C + N + V + P+V+ D + E
Sbjct: 52 AVAAVIASPEDEARAYHYDACPVAAGNTVHAHDRDVTVRIRFERPQVIAFDDVLSGEECA 111
Query: 63 RIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
+IE ++ +++R VN G + R S+ ++ + F+ ++ RI + N
Sbjct: 112 ELIERARHRLKRSTTVNPENGSEDVIQLRTSEGFWFQR---CEDAFIERLDHRISALMNW 168
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRDE---------GLWRLASFMFYLTDVEL 171
+ E +G LQI +Y GG Y H D P + G R+A+ + YL+DVE
Sbjct: 169 PL---EHGEG-LQILHYRQGGEYRPHFDYFPPGQNGSVLHTARGGQRVATLIVYLSDVEG 224
Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GG T+FP L V +G A+++ + LD H G PV G+KW
Sbjct: 225 GGETVFPDAGLAVMARQGGAIYFRYMNGRRQLDPLTLHGGAPVTSGDKW 273
>gi|420246706|ref|ZP_14750139.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. BT03]
gi|398073616|gb|EJL64785.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. BT03]
Length = 282
Score = 77.0 bits (188), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 61/229 (26%), Positives = 99/229 (43%), Gaps = 21/229 (9%)
Query: 6 ACQGNLSVPEDIKSNL---KCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
A ++ PED C + N + V + P+V+ D + E
Sbjct: 49 AVAAVIASPEDEARAYHYDACPVAAGNTVHAHDRDVTVRIRFERPQVIAFDDVLSGEECA 108
Query: 63 RIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
+IE ++ +++R VN G + R S+ ++ + F+ ++ RI + N
Sbjct: 109 ELIERARHRLKRSTTVNPENGSEDVIQLRTSEGFWFQR---CEDAFIERLDHRISALMNW 165
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRDE---------GLWRLASFMFYLTDVEL 171
+ E +G LQI +Y GG Y H D P + G R+A+ + YL+DVE
Sbjct: 166 PL---EHGEG-LQILHYRQGGEYRPHFDYFPPGQNGSVLHTARGGQRVATLIVYLSDVEG 221
Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GG T+FP L V +G A+++ + LD H G PV G+KW
Sbjct: 222 GGETVFPDAGLAVMARQGGAIYFRYMNGRRQLDPLTLHGGAPVTSGDKW 270
>gi|357517885|ref|XP_003629231.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355523253|gb|AET03707.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 279
Score = 77.0 bits (188), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 58/210 (27%), Positives = 93/210 (44%), Gaps = 38/210 (18%)
Query: 40 VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFL- 96
VE + +PRV H+ + E +I ++K V++ VV+ G ++ R S F+
Sbjct: 70 VEIVSWEPRVFLYHNFLAKEECEHLINIAKPDVQKSTVVDDTTGKSVNSSARTSSGTFID 129
Query: 97 --YPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD----AT 150
Y +I D I+ RI D T + + E + I +Y +G YD H D
Sbjct: 130 RGYDKILSD------IEKRIADFTFIPVEHGED----VNILHYEVGQKYDFHTDYFEDEV 179
Query: 151 PRDEGLWRLASFMFYLTDVELGGATIFPSLN-------------------LTVFPEKGSA 191
G R+A+ + YL+DVE GG T+FPS L++ P+ G+A
Sbjct: 180 NTKHGGERIATMLMYLSDVEEGGETVFPSAKGNFSSVPWWNELSDCGKKGLSIKPKMGNA 239
Query: 192 VFWYNAHANTLLDYRMYHSGCPVALGNKWG 221
+ ++ + +D H CPV G+KW
Sbjct: 240 ILFWGMKPDATVDPLSVHGACPVIKGDKWS 269
>gi|393200372|ref|YP_006462214.1| prolyl 4-hydroxylase [Solibacillus silvestris StLB046]
gi|327439703|dbj|BAK16068.1| prolyl 4-hydroxylase [Solibacillus silvestris StLB046]
Length = 211
Score = 77.0 bits (188), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 54/186 (29%), Positives = 90/186 (48%), Gaps = 13/186 (6%)
Query: 38 LKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLY 97
+ E L+ +P +VK + + D E +I+ + ++ER K+ + R S F
Sbjct: 21 ITAEVLHEEPLIVKFLNVLSDEECQNLIDCASSRLERSKLAKKEIS---SIRTSSGMFFE 77
Query: 98 PEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDE 154
++P + +I+ RI + +L I E +G LQ+ +Y G + H D
Sbjct: 78 E---NENPLISEIEKRISSLMHLPI---EHAEG-LQVLHYEPGQEFKPHFDFFGPNHPSS 130
Query: 155 GLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 214
R+ + + YL DVE GG T FP+L + P+KG+AV++ + + L+ HSG PV
Sbjct: 131 SNNRICTLVVYLNDVEEGGVTTFPNLGIVNVPKKGTAVYFEYFYNDQKLNELTLHSGEPV 190
Query: 215 ALGNKW 220
G KW
Sbjct: 191 IQGEKW 196
>gi|125542543|gb|EAY88682.1| hypothetical protein OsI_10157 [Oryza sativa Indica Group]
Length = 321
Score = 76.6 bits (187), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 61/212 (28%), Positives = 90/212 (42%), Gaps = 41/212 (19%)
Query: 47 PRVVKIHDAIYDSEINRIIELSK-GKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGD 103
PR + D+E + +I L+K GK+E+ VV+ G+++ R S FL +
Sbjct: 49 PRAFLYEGFLSDAECDHLISLAKQGKMEKSTVVDGESGESVTSKVRTSSGMFLDKK---Q 105
Query: 104 HPFLYKIQTRIQDMT-------------NLVIGREERYKGPLQINNYGLGGHYDLHCDAT 150
+ +I+ RI T N I + +QI YG G Y+ H D
Sbjct: 106 DEVVARIEERIAAWTMLPTECIIFYCFANFAILKLSENGESMQILRYGQGEKYEPHFDYI 165
Query: 151 PRDEGLWR----LASFMFYLTDVELGGATIFPSLN------------------LTVFPEK 188
+G R +A+ + YL++V++GG TIFP V P K
Sbjct: 166 SGRQGSTREGDRVATVLMYLSNVKMGGETIFPDCEARLSQPKDETWSDCAEQGFAVKPAK 225
Query: 189 GSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GSAV +++ H N LD H CPV G KW
Sbjct: 226 GSAVLFFSLHPNATLDTDSLHGSCPVIEGEKW 257
>gi|225459748|ref|XP_002285898.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Vitis vinifera]
gi|302141716|emb|CBI18919.3| unnamed protein product [Vitis vinifera]
Length = 288
Score = 76.6 bits (187), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 58/210 (27%), Positives = 92/210 (43%), Gaps = 28/210 (13%)
Query: 34 KIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKV 93
K G E + +PR H+ + E +I L+K +++ VV+ D+R+
Sbjct: 71 KRGEQWTEIVSWEPRAFIYHNFLSKEECEYMISLAKPYMKKSTVVDSETGRSKDSRVRTS 130
Query: 94 YFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLG----GHYDLHCDA 149
++ G + I+ RI D T + + E LQ+ +Y +G HYD D
Sbjct: 131 SGMFLRR-GRDKIIRDIEKRIADFTFIPVEHGE----GLQVLHYEVGQKYDAHYDYFLDE 185
Query: 150 TPRDEGLWRLASFMFYLTDVELGGATIFPSLN-------------------LTVFPEKGS 190
G R+A+ + YL+DVE GG T+FP+ L+V P+ G
Sbjct: 186 FNTKNGGQRIATLLMYLSDVEEGGETVFPATKANFSSVPWWNELSECGKKGLSVKPKMGD 245
Query: 191 AVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
A+ +++ + LD H GCPV GNKW
Sbjct: 246 ALLFWSMRPDATLDPSSLHGGCPVIKGNKW 275
>gi|21537370|gb|AAM61711.1| putative prolyl 4-hydroxylase, alpha subunit [Arabidopsis thaliana]
Length = 287
Score = 76.6 bits (187), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 68/244 (27%), Positives = 101/244 (41%), Gaps = 33/244 (13%)
Query: 2 IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
++ L + S P D+ + E + K G E L +PR H+ + E
Sbjct: 39 VFSLPINNDESSPIDLSYFRRAATER-SEGLGKRGDQWTEVLSWEPRAFVYHNFLSKEEC 97
Query: 62 NRIIELSKGKVERGKVVNYGDTIYVDTRL--SKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
+I L+K + + VV+ D+R+ S FL G + I+ RI D T
Sbjct: 98 EYLISLAKPHMVKSTVVDSETGKSKDSRVRTSSGTFLRR---GRDKIIKTIEKRIADYTF 154
Query: 120 LVIGREERYKGPLQINNYGLGG----HYDLHCDATPRDEGLWRLASFMFYLTDVELGGAT 175
+ E LQ+ +Y G HYD D G R+A+ + YL+DVE GG T
Sbjct: 155 IPADHGE----GLQVLHYEAGQKYEPHYDYFVDEFNTKNGGQRMATMLMYLSDVEEGGET 210
Query: 176 IFPSLN-------------------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVAL 216
+FP+ N L+V P G A+ +++ + LD H GCPV
Sbjct: 211 VFPAANMNFSSVPWYNELSECGKKGLSVKPRMGDALLFWSMRPDATLDPTSLHGGCPVIR 270
Query: 217 GNKW 220
GNKW
Sbjct: 271 GNKW 274
>gi|449513594|ref|XP_002191636.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like, partial
[Taeniopygia guttata]
Length = 346
Score = 76.6 bits (187), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 56/176 (31%), Positives = 89/176 (50%), Gaps = 19/176 (10%)
Query: 3 YPLACQGN-LSVPEDIKSNLKC-FYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G L + + L C +Y+ N +GP+K E+ + PR+V+ D I D E
Sbjct: 178 YEMLCRGEGLKMTPRRQKRLFCRYYDGNRNPRYILGPVKQEDEWDKPRIVRFLDIISDEE 237
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + EL+K ++ R V + G R+SK +L + P + +I TRIQD+T
Sbjct: 238 IETVKELAKPRLSRATVHDPETGKLTTAHYRVSKSAWLSG---YESPVVSRINTRIQDLT 294
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYL 166
L + E LQ+ NYG+GG Y+ H D +DE R+A+++FY+
Sbjct: 295 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYV 346
>gi|18394842|ref|NP_564109.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
gi|9558598|gb|AAF88161.1|AC026234_12 Contains similarity to a prolyl 4-hydroxylase alpha subunit protein
from Gallus gallus gi|212530 [Arabidopsis thaliana]
gi|90962978|gb|ABE02413.1| At1g20270 [Arabidopsis thaliana]
gi|332191835|gb|AEE29956.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
Length = 287
Score = 76.6 bits (187), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 68/244 (27%), Positives = 101/244 (41%), Gaps = 33/244 (13%)
Query: 2 IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
++ L + S P D+ + E + K G E L +PR H+ + E
Sbjct: 39 VFSLPINNDESSPIDLSYFRRAATER-SEGLGKRGDQWTEVLSWEPRAFVYHNFLSKEEC 97
Query: 62 NRIIELSKGKVERGKVVNYGDTIYVDTRL--SKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
+I L+K + + VV+ D+R+ S FL G + I+ RI D T
Sbjct: 98 EYLISLAKPHMVKSTVVDSETGKSKDSRVRTSSGTFLRR---GRDKIIKTIEKRIADYTF 154
Query: 120 LVIGREERYKGPLQINNYGLGG----HYDLHCDATPRDEGLWRLASFMFYLTDVELGGAT 175
+ E LQ+ +Y G HYD D G R+A+ + YL+DVE GG T
Sbjct: 155 IPADHGE----GLQVLHYEAGQKYEPHYDYFVDEFNTKNGGQRMATMLMYLSDVEEGGET 210
Query: 176 IFPSLN-------------------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVAL 216
+FP+ N L+V P G A+ +++ + LD H GCPV
Sbjct: 211 VFPAANMNFSSVPWYNELSECGKKGLSVKPRMGDALLFWSMRPDATLDPTSLHGGCPVIR 270
Query: 217 GNKW 220
GNKW
Sbjct: 271 GNKW 274
>gi|255551575|ref|XP_002516833.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223543921|gb|EEF45447.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 297
Score = 76.6 bits (187), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 58/213 (27%), Positives = 91/213 (42%), Gaps = 34/213 (15%)
Query: 35 IGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSK 92
I P KV+++ PR + D E + +I L+K +++R V + G + + R S
Sbjct: 31 IDPSKVKQVSWKPRAFVYEGFLTDLECDHLISLAKSELKRSAVADNESGKSKLSEVRTSS 90
Query: 93 VYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLH----CD 148
F+ G P + I+ +I T L E LQ+ Y G YD H D
Sbjct: 91 GMFIAK---GKDPIIAGIEEKISTWTFLPKENGED----LQVLRYEHGQKYDPHYDYFAD 143
Query: 149 ATPRDEGLWRLASFMFYLTDVELGGATIFPSLN---------------------LTVFPE 187
G R+A+ + YL+DV GG T+FP+ ++V P
Sbjct: 144 KINIARGGHRMATVLMYLSDVVKGGETVFPNAEEPPRRKATESHEDLSECAKKGISVKPR 203
Query: 188 KGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+G A+ +++ H + D H+GCPV G KW
Sbjct: 204 RGDALLFFSLHPTAIPDPNSLHAGCPVIEGEKW 236
>gi|159487421|ref|XP_001701721.1| predicted protein [Chlamydomonas reinhardtii]
gi|158280940|gb|EDP06696.1| predicted protein [Chlamydomonas reinhardtii]
Length = 336
Score = 76.6 bits (187), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 58/207 (28%), Positives = 94/207 (45%), Gaps = 31/207 (14%)
Query: 40 VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPE 99
V+++ L PR H+ + +E ++ ++ K++R VV VD + Y ++
Sbjct: 19 VQQVGLHPRAYYFHNFLTKAERAHLVRVAAPKLKRSTVVGGKGEGVVDD-IRTSYGMFIR 77
Query: 100 IFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGL--- 156
D P + +I+ RI T+L + +E +QI Y G Y H D+ + +
Sbjct: 78 RLSD-PVVTRIEKRISLWTHLPVEHQED----IQILRYAHGQTYGAHYDSGASSDHVGPK 132
Query: 157 WRLASFMFYLTDVELGGATIFP----------------------SLNLTVFPEKGSAVFW 194
WRLA+F+ YL+DVE GG T FP ++ P+ G AV +
Sbjct: 133 WRLATFLMYLSDVEEGGETAFPHNSVWADPSIPEQVGDKFSDCAKGHVAAKPKAGDAVLF 192
Query: 195 YNAHANTLLDYRMYHSGCPVALGNKWG 221
Y+ + N +D H+GCPV G KW
Sbjct: 193 YSFYPNNTMDPASMHTGCPVIKGVKWA 219
>gi|384429387|ref|YP_005638747.1| procollagen-proline, 2-oxoglutarate-4-dioxygenase [Xanthomonas
campestris pv. raphani 756C]
gi|341938490|gb|AEL08629.1| procollagen-proline, 2-oxoglutarate-4-dioxygenase [Xanthomonas
campestris pv. raphani 756C]
Length = 286
Score = 76.6 bits (187), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 68/224 (30%), Positives = 100/224 (44%), Gaps = 26/224 (11%)
Query: 10 NLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELY--LDPRVVKIHDAIYDSEINRIIEL 67
L VP + + L+ S L +G +V+ L + PRVV + + D E + +I L
Sbjct: 61 GLPVPVRVPAPLQADASS----LLDLGDRQVQVLVSLMLPRVVVLGGLLSDDECDALIAL 116
Query: 68 SKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGRE 125
++ ++ R + V+ G I R S L P G +I+ RI + +
Sbjct: 117 ARPQLARSRTVDNRDGSEIVHAARTSHSMALQP---GQDALCQRIEARIARLLEWPV--- 170
Query: 126 ERYKGPLQINNYGLGGHYDLHCD-------ATP--RDEGLWRLASFMFYLTDVELGGATI 176
E +G LQ+ Y G Y H D TP G R+AS + YL E GGAT
Sbjct: 171 EHGEG-LQVLRYATGAQYAPHYDYFEPDAPGTPVLLQHGGQRVASLVMYLNTPERGGATR 229
Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
FP ++L V KG+AVF+ + + R H+G PV G KW
Sbjct: 230 FPDVHLDVAAVKGNAVFFSYDRPHPM--TRTLHAGAPVLAGEKW 271
>gi|326316001|ref|YP_004233673.1| procollagen-proline dioxygenase [Acidovorax avenae subsp. avenae
ATCC 19860]
gi|323372837|gb|ADX45106.1| Procollagen-proline dioxygenase [Acidovorax avenae subsp. avenae
ATCC 19860]
Length = 298
Score = 76.6 bits (187), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 63/203 (31%), Positives = 95/203 (46%), Gaps = 26/203 (12%)
Query: 33 LKIGPLKVEELYL--DPRVVKIHDAIYDSEINRIIELSKGKVERGKVV--NYGDTIYVDT 88
+ +G +V+ L PRVV + + E + II+ ++ ++ R V G D
Sbjct: 95 IDVGDRRVDVLMAMAQPRVVLFGNLLSPEECDAIIDAARPRMARSLTVATRTGGEEVNDD 154
Query: 89 RLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD 148
R S F E ++P + K++ RI + N + E +G LQ+ +Y G Y H D
Sbjct: 155 RTSNGMFFQRE---ENPMVAKLEARIARLVNWPL---ENGEG-LQVLHYRPGAEYKPHYD 207
Query: 149 -------ATPR--DEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVF--WYNA 197
TP G R+A+ + YL D E GG T FP ++L V P +G+AVF +
Sbjct: 208 YFDPTEPGTPTILRRGGQRVATIVIYLNDPEKGGGTTFPDVHLEVAPRRGNAVFFSYERP 267
Query: 198 HANTLLDYRMYHSGCPVALGNKW 220
H +T R H G PV G+KW
Sbjct: 268 HPST----RTLHGGAPVVAGDKW 286
>gi|326489721|dbj|BAK01841.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 315
Score = 76.3 bits (186), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 58/204 (28%), Positives = 92/204 (45%), Gaps = 28/204 (13%)
Query: 40 VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPE 99
E + +PR H+ + E +IEL+K ++ + VV+ D+R+ ++ +
Sbjct: 104 TEVISWEPRAFVYHNFLSKEECEYLIELAKPRMVKSTVVDSETGKSKDSRVRTSSGMFLQ 163
Query: 100 IFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDATPRDEG 155
G + I+ RI D T + E +G LQ+ +Y +G H+D D G
Sbjct: 164 -RGRDKVIRAIERRIADYTFIPA---EHGEG-LQVLHYEVGQKYEPHFDYFLDEFNTKNG 218
Query: 156 LWRLASFMFYLTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWYN 196
R+A+ + YL+D+E GG TIFP N L V P+ G A+ +++
Sbjct: 219 GQRMATILMYLSDIEEGGETIFPDANVNSSSLPWYNELSECARKGLAVKPKMGDALLFWS 278
Query: 197 AHANTLLDYRMYHSGCPVALGNKW 220
+ LD H GCPV GNKW
Sbjct: 279 MKPDATLDPLSLHGGCPVIKGNKW 302
>gi|302791635|ref|XP_002977584.1| hypothetical protein SELMODRAFT_106693 [Selaginella moellendorffii]
gi|300154954|gb|EFJ21588.1| hypothetical protein SELMODRAFT_106693 [Selaginella moellendorffii]
Length = 296
Score = 76.3 bits (186), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 58/208 (27%), Positives = 91/208 (43%), Gaps = 29/208 (13%)
Query: 35 IGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSK 92
+ P KV +L PR + +E + +++++K K+++ V + G ++ + R S
Sbjct: 37 VDPTKVIQLSWKPRAFLYKGFMSAAECDHVVKMAKDKLQKSMVADNESGKSVLSNIRTSS 96
Query: 93 VYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCD 148
FL G + +I+ RI T L E +Q+ Y G HYD D
Sbjct: 97 GMFLSK---GQDEVINRIEERIAAWTFLPKENGE----AIQVLRYEFGEKYEPHYDYFHD 149
Query: 149 ATPRDEGLWRLASFMFYLTDVELGGATIFPSLN----------------LTVFPEKGSAV 192
+ G R+A+ + YL+DV GG T+FPS + V P KG A+
Sbjct: 150 KYNQALGGHRIATVLMYLSDVVKGGETVFPSSEDTTVKDDSWSDCAKKGIAVKPRKGDAL 209
Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
+Y+ H + D H GCPV G KW
Sbjct: 210 LFYSLHPDATPDESSLHGGCPVIEGEKW 237
>gi|297818456|ref|XP_002877111.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297322949|gb|EFH53370.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 316
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 56/209 (26%), Positives = 88/209 (42%), Gaps = 31/209 (14%)
Query: 37 PLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVY 94
P +V +L PR + D E + I+L+KGK+E+ V + G+++ + R S
Sbjct: 53 PTRVTQLSWTPRAFLYKGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESEVRTSSGM 112
Query: 95 FLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDAT 150
FL + ++ ++ T + E +QI +Y G H+D D
Sbjct: 113 FLSKR---QDDIVANVEAKLAAWTFI----PEENGESMQILHYENGQKYEPHFDYFHDQA 165
Query: 151 PRDEGLWRLASFMFYLTDVELGGATIFP------------------SLNLTVFPEKGSAV 192
+ G R+A+ + YL++VE GG T+FP V P KG A+
Sbjct: 166 NLELGGHRIATVLMYLSNVEKGGETVFPMWKGKTTQLKDDSWTECAKQGYAVKPRKGDAL 225
Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKWG 221
++N H N D H CPV G KW
Sbjct: 226 LFFNLHPNATTDSNSLHGSCPVVEGEKWS 254
>gi|77761111|ref|YP_241833.2| hypothetical protein XC_0735 [Xanthomonas campestris pv. campestris
str. 8004]
Length = 288
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 63/208 (30%), Positives = 96/208 (46%), Gaps = 22/208 (10%)
Query: 26 ESYNNTFLKIGPLKVEELY--LDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNY--G 81
++ ++ L +G +V+ L + PRVV + + D E + +I L++ ++ R + V+ G
Sbjct: 75 QADASSLLDLGDRQVQVLVSLMLPRVVVLGGLLADDECDALIALARPQLARSRTVDNRDG 134
Query: 82 DTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG 141
I R S L P G +I+ RI + + E +G LQ+ Y G
Sbjct: 135 SEIVHAARTSHSMALQP---GQDALCQRIEARIAQLLEWPV---EHGEG-LQVLRYATGA 187
Query: 142 HYDLHCD-------ATP--RDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAV 192
Y H D TP G R+AS + YL E GGAT FP ++L V KG+AV
Sbjct: 188 QYAPHYDYFEPDAPGTPVLLQHGGQRVASLVMYLNTPERGGATRFPDVHLDVAAVKGNAV 247
Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
F+ + + R H+G PV G KW
Sbjct: 248 FFSYDRPHPM--TRTLHAGAPVLAGEKW 273
>gi|66572403|gb|AAY47813.1| conserved hypothetical protein [Xanthomonas campestris pv.
campestris str. 8004]
Length = 308
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 63/208 (30%), Positives = 96/208 (46%), Gaps = 22/208 (10%)
Query: 26 ESYNNTFLKIGPLKVEELY--LDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNY--G 81
++ ++ L +G +V+ L + PRVV + + D E + +I L++ ++ R + V+ G
Sbjct: 95 QADASSLLDLGDRQVQVLVSLMLPRVVVLGGLLADDECDALIALARPQLARSRTVDNRDG 154
Query: 82 DTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG 141
I R S L P G +I+ RI + + E +G LQ+ Y G
Sbjct: 155 SEIVHAARTSHSMALQP---GQDALCQRIEARIAQLLEWPV---EHGEG-LQVLRYATGA 207
Query: 142 HYDLHCD-------ATP--RDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAV 192
Y H D TP G R+AS + YL E GGAT FP ++L V KG+AV
Sbjct: 208 QYAPHYDYFEPDAPGTPVLLQHGGQRVASLVMYLNTPERGGATRFPDVHLDVAAVKGNAV 267
Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
F+ + + R H+G PV G KW
Sbjct: 268 FFSYDRPHPM--TRTLHAGAPVLAGEKW 293
>gi|224102545|ref|XP_002312720.1| predicted protein [Populus trichocarpa]
gi|222852540|gb|EEE90087.1| predicted protein [Populus trichocarpa]
Length = 300
Score = 75.9 bits (185), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 59/214 (27%), Positives = 91/214 (42%), Gaps = 36/214 (16%)
Query: 35 IGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSK 92
I P KV+++ PR + D E + +I L+K +++R V + G + + R S
Sbjct: 34 INPAKVKQVSWKPRAFVYEGFLTDLECDHLISLAKSELKRSAVADNESGKSKLSEVRTSS 93
Query: 93 VYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGP-LQINNYGLGGHYDLH----C 147
F+ P + I+ +I T L R G +Q+ Y G YD H
Sbjct: 94 GMFITK---AKDPIVAGIEDKIATWTFL-----PRENGEDIQVLRYEHGQKYDPHYDYFS 145
Query: 148 DATPRDEGLWRLASFMFYLTDVELGGATIFPSLN---------------------LTVFP 186
D G R+A+ + YLTDVE GG T+FPS + V P
Sbjct: 146 DKVNIARGGHRVATVLMYLTDVEKGGETVFPSAEELPRRKASVSHEDLSECARKGIAVKP 205
Query: 187 EKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+G A+ +++ + + D H+GCPV G KW
Sbjct: 206 RRGDALLFFSLYPTAVPDTSSIHAGCPVIEGEKW 239
>gi|363543369|ref|NP_001241694.1| prolyl 4-hydroxylase 8-4 [Zea mays]
gi|347978838|gb|AEP37761.1| prolyl 4-hydroxylase 8-4 [Zea mays]
Length = 307
Score = 75.9 bits (185), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 57/204 (27%), Positives = 91/204 (44%), Gaps = 28/204 (13%)
Query: 40 VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPE 99
E + +PR H+ + E +I L+K + + VV+ D+R+ ++ +
Sbjct: 96 TEVISWEPRAFVYHNFLSKDECEYLIGLAKPHMVKSTVVDSTTGKSKDSRVRTSSGMFLQ 155
Query: 100 IFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDATPRDEG 155
G + + I+ RI D T + + E LQ+ +Y +G H+D D G
Sbjct: 156 -RGRNKVIRAIEKRIADYTFIPVDHGEG----LQVLHYEVGQKYEPHFDYFLDEFNTKNG 210
Query: 156 LWRLASFMFYLTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWYN 196
R+A+ + YL+DVE GG TIFP N L+V P+ G A+ +++
Sbjct: 211 GQRIATLLMYLSDVEEGGETIFPDANVNASSLPWYNELSDCAKRGLSVKPKMGDALLFWS 270
Query: 197 AHANTLLDYRMYHSGCPVALGNKW 220
+ LD H GCPV GNKW
Sbjct: 271 MKPDATLDPLSLHGGCPVIKGNKW 294
>gi|340357957|ref|ZP_08680560.1| prolyl 4-hydroxylase [Sporosarcina newyorkensis 2681]
gi|339616017|gb|EGQ20677.1| prolyl 4-hydroxylase [Sporosarcina newyorkensis 2681]
Length = 211
Score = 75.5 bits (184), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 51/177 (28%), Positives = 87/177 (49%), Gaps = 11/177 (6%)
Query: 46 DPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHP 105
+P +V + + + D E + +I+L+ KV+R K+ + + R S F+ + ++
Sbjct: 32 EPLIVVLGNVLSDEECDELIQLAGDKVKRSKIGTTREE--NELRTSSSMFIEDD---ENL 86
Query: 106 FLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--RLASFM 163
+ +++ RI + + + E LQI Y G Y H D D + R+++ +
Sbjct: 87 IVTRVKKRISAIMKIPMEHGE----GLQILRYTPGQQYKAHHDFFSSDSKITNNRISTLV 142
Query: 164 FYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
YL DVE GG T FP L +V P KG AV++ +++ L+ H G PV G KW
Sbjct: 143 MYLNDVEQGGETFFPHLKFSVSPRKGMAVYFEYFYSDQTLNDFTLHGGAPVVEGEKW 199
>gi|148701599|gb|EDL33546.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide, isoform CRA_d [Mus
musculus]
Length = 545
Score = 75.5 bits (184), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 59/183 (32%), Positives = 92/183 (50%), Gaps = 18/183 (9%)
Query: 1 EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
++Y C+G + + + L C Y N L I P K E+ + P +V+ +D + D
Sbjct: 363 DVYESLCRGEGVKLTPRRQKKLFCRYHHGNRVPQLLIAPFKEEDEWDSPHIVRYYDVMSD 422
Query: 59 SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
EI RI E++K K+ R V + G R+SK +L + D P + ++ R+Q
Sbjct: 423 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 479
Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPR--DEGLW----RLASFMFYLTDVE 170
+T L + E LQ+ NYG+GG Y+ H D + R D GL RLA+F+ Y++
Sbjct: 480 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYVS-TG 534
Query: 171 LGG 173
LGG
Sbjct: 535 LGG 537
>gi|168060785|ref|XP_001782374.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666166|gb|EDQ52828.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 211
Score = 75.5 bits (184), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 59/207 (28%), Positives = 90/207 (43%), Gaps = 32/207 (15%)
Query: 40 VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRL--SKVYFLY 97
VE L +PR H + E N +IE++K + + V++ D+R+ S FL
Sbjct: 2 VEVLSWEPRAFLYHHFLTQVECNHLIEVAKPSLVKSTVIDSATGKSKDSRVRTSSGTFL- 60
Query: 98 PEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDATPRD 153
+ G + +I+ RI D T + + + E LQ+ Y HYD DA
Sbjct: 61 --VRGQDHIIKRIEKRIADFTFIPVEQGE----GLQVLQYRESEKYEPHYDYFHDAFNTK 114
Query: 154 EGLWRLASFMFYLTDVELGGATIFPSLN-------------------LTVFPEKGSAVFW 194
G R+A+ + YL+DVE GG T+FP+ L+V P G A+ +
Sbjct: 115 NGGQRIATVLMYLSDVEKGGETVFPASKVNASEVPDWDQRSECAKRGLSVRPRMGDALLF 174
Query: 195 YNAHANTLLDYRMYHSGCPVALGNKWG 221
++ + LD H CPV G KW
Sbjct: 175 WSMKPDAKLDPTSLHGACPVIQGTKWS 201
>gi|149068803|gb|EDM18355.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide III [Rattus
norvegicus]
Length = 266
Score = 75.5 bits (184), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 57/179 (31%), Positives = 87/179 (48%), Gaps = 20/179 (11%)
Query: 7 CQGNLSVPEDIK-SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRII 65
CQ S P + +L C YE+ ++ +L + P + E ++L P V HD + D E +I
Sbjct: 43 CQTLGSQPTHYQIPSLYCSYETNSSPYLLLQPARKEVIHLRPLVALYHDFVSDEEAQKIR 102
Query: 66 ELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGRE 125
EL++ ++R V + + V+ R+SK +L + P L + RI +T L I +
Sbjct: 103 ELAEPWLQRSVVASGEKQLQVEYRISKSAWLKDTV---DPVLVTLDRRIAALTGLDI--Q 157
Query: 126 ERYKGPLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTV 184
Y LQ+ NYG+GGHY+ H D L+ VE GGAT F N +V
Sbjct: 158 PPYAEYLQVVNYGIGGHYEPHFDHA--------------TLSSVEAGGATAFIYGNFSV 202
>gi|48716447|dbj|BAD23054.1| putative prolyl 4-hydroxylase [Oryza sativa Japonica Group]
Length = 310
Score = 75.5 bits (184), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 62/238 (26%), Positives = 99/238 (41%), Gaps = 28/238 (11%)
Query: 6 ACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRII 65
A G + P D + + + G E + +PR H+ + E + +I
Sbjct: 65 AAAGGDAEPADPRPPRTRARRDLSEGLGERGAQWTEVISWEPRAFVYHNFLSKEECDYLI 124
Query: 66 ELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGRE 125
L+K + + VV+ D+R+ ++ + G + I+ RI D T + +
Sbjct: 125 GLAKPHMVKSTVVDSTTGKSKDSRVRTSSGMFLQ-RGRDKVIRAIEKRIADYTFIPMEHG 183
Query: 126 ERYKGPLQINNYGLGG----HYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPSLN 181
E LQ+ +Y +G H+D D G R+A+ + YL+DVE GG TIFP N
Sbjct: 184 EG----LQVLHYEVGQKYEPHFDYFLDEYNTKNGGQRMATLLMYLSDVEEGGETIFPDAN 239
Query: 182 -------------------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
L V P+ G A+ +++ + LD H GCPV GNKW
Sbjct: 240 VNSSSLPWYNELSECARKGLAVKPKMGDALLFWSMKPDATLDPLSLHGGCPVIKGNKW 297
>gi|388500582|gb|AFK38357.1| unknown [Medicago truncatula]
Length = 299
Score = 75.5 bits (184), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 59/213 (27%), Positives = 89/213 (41%), Gaps = 34/213 (15%)
Query: 35 IGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSK 92
I P KV+++ PR + D E + +I L+K +++R V + GD+ D R S
Sbjct: 32 INPSKVKQISWIPRAFVYQGFLTDLECDHLISLAKSELKRSAVADNLSGDSQLSDVRTSS 91
Query: 93 VYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLH----CD 148
+ P + I+ RI T L E +Q+ Y G YD H D
Sbjct: 92 GMLISKN---KDPIVSGIEDRISAWTFLPKENGE----DIQVLRYEHGQKYDPHYDYFAD 144
Query: 149 ATPRDEGLWRLASFMFYLTDVELGGATIFPSLN---------------------LTVFPE 187
+G RLA+ + YLT+V GG T+FP + V P
Sbjct: 145 KVNIVQGGHRLATVLMYLTNVTKGGETVFPEAEEPPRRRGSKKSSDLSECAKKGIAVKPR 204
Query: 188 KGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+G A+ +++ N + D H+GCPV G KW
Sbjct: 205 RGDALLFFSLDTNAIPDTNSLHAGCPVLEGEKW 237
>gi|328710203|ref|XP_001949232.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Acyrthosiphon
pisum]
Length = 500
Score = 75.5 bits (184), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 72/230 (31%), Positives = 107/230 (46%), Gaps = 31/230 (13%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
+ ACQ S + I KC Y +L IGPL+ E + L P + H+ +YD EI
Sbjct: 283 FKTACQ---STTDFIYPKFKCRYYHGGRKYLMIGPLREEIVSLIPSMKLYHNVLYDDEIK 339
Query: 63 RIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYP---EIFGD-HPFLYKIQTRIQDMT 118
+I EL+ K+E+ + DT D L KV ++F H L +I ++ T
Sbjct: 340 KIKELANPKLEKLSI----DT-NEDISLRKVASFRKHNDQVFETIHHRLAQISSK--PTT 392
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHC---DATPRDEGLWRLASFMFYLTDVELGGAT 175
N+V ++Y + NYG+GGHY H D R A + ++ DV GGAT
Sbjct: 393 NIV----DKY----VVTNYGIGGHYLPHTKYIDDNHLINSKRRDAIVIIHMDDVPEGGAT 444
Query: 176 IFPSLNLTVFPEKGSAVFWYNAH-----ANTLLDYRMYHSGCPVALGNKW 220
+ P++ V KGSA+ Y+ L ++ Y S CP+ G+KW
Sbjct: 445 VLPNVEFCVPSVKGSALVIYSTRNTLPPIKELFEFAQYGS-CPIVYGDKW 493
>gi|357467077|ref|XP_003603823.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355492871|gb|AES74074.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 291
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 60/202 (29%), Positives = 92/202 (45%), Gaps = 27/202 (13%)
Query: 40 VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLY 97
E L +PR H+ + E +I L+K ++R VV+ G I R S FL
Sbjct: 85 TEVLSSEPRASMYHNFLSKEECEHLINLAKPFMQRSLVVDGVTGQGILNSVRTSSGTFLE 144
Query: 98 PEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDATPRD 153
G + ++ RI D+T++ I E LQI +Y +G HYD + + +
Sbjct: 145 R---GKDKIVQNVERRIADITSIPIENGE----GLQIIHYEVGQKFEPHYDYNFNWRITN 197
Query: 154 EGLWRLASFMFYLTDVELGGATIFPSLN--------------LTVFPEKGSAVFWYNAHA 199
G R+A+ + YL+DVE GG T+FP+ L V P+ G A+ +++
Sbjct: 198 NGGPRVATVLMYLSDVEEGGETVFPNAKPNFNSVSKYHPGKGLVVKPKMGDALLFWSVKP 257
Query: 200 NTLLDYRMYHSGCPVALGNKWG 221
+ LD H G PV G+KW
Sbjct: 258 DGSLDTASLHGGSPVIRGSKWA 279
>gi|326495334|dbj|BAJ85763.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 300
Score = 75.1 bits (183), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 59/206 (28%), Positives = 92/206 (44%), Gaps = 32/206 (15%)
Query: 40 VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRL--SKVYFLY 97
E L +PR H+ + E +I L+K +++ VV+ D+R+ S FL
Sbjct: 89 TEVLSWEPRAFIYHNFLSKEECEYLISLAKPHMKKSTVVDSATGGSKDSRVRTSSGTFLR 148
Query: 98 PEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRD---- 153
G + I+ RI D T + + E LQ+ +Y +G Y+ H D D
Sbjct: 149 ---RGQDKIVRTIEKRISDFTFIPVENGE----GLQVLHYEVGQKYEPHFDYFHDDFNTK 201
Query: 154 EGLWRLASFMFYLTDVELGGATIFPSLN-------------------LTVFPEKGSAVFW 194
G R+A+ + YL+DVE GG T+FPS ++V P+ G A+ +
Sbjct: 202 NGGQRIATVLMYLSDVEEGGETVFPSAKVNSSSIPFYNELSECAKRGISVKPKMGDALLF 261
Query: 195 YNAHANTLLDYRMYHSGCPVALGNKW 220
++ + LD H GCPV G+KW
Sbjct: 262 WSMRPDGTLDPTSLHGGCPVIKGDKW 287
>gi|344253558|gb|EGW09662.1| Glucose 1,6-bisphosphate synthase [Cricetulus griseus]
Length = 904
Score = 75.1 bits (183), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 58/185 (31%), Positives = 90/185 (48%), Gaps = 20/185 (10%)
Query: 1 EIYPLACQGNLSVPEDIKS-NLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDS 59
+ Y CQ S P ++ L C YE+ ++ +L + P + E ++L P V HD + D+
Sbjct: 691 DTYEGLCQTLGSQPTHYQNPRLYCSYETNSSPYLLLQPARKEVIHLRPFVALYHDFVSDA 750
Query: 60 EINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
E +I EL++ ++R V + + V+ R+SK +L + P L + RI +T
Sbjct: 751 EAQKIRELAEPWLQRSVVASGEKQLPVEYRISKSAWLKDTV---DPMLGTLDHRIAALTG 807
Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPS 179
L I + Y LQ+ NYG+GGHY+ H D L+ VE GGAT F
Sbjct: 808 LDI--QPPYAEYLQVVNYGIGGHYEPHFDHAT--------------LSAVEAGGATAFIY 851
Query: 180 LNLTV 184
N +V
Sbjct: 852 ANFSV 856
>gi|120609859|ref|YP_969537.1| 2OG-Fe(II) oxygenase [Acidovorax citrulli AAC00-1]
gi|120588323|gb|ABM31763.1| 2OG-Fe(II) oxygenase [Acidovorax citrulli AAC00-1]
Length = 309
Score = 75.1 bits (183), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 62/203 (30%), Positives = 95/203 (46%), Gaps = 26/203 (12%)
Query: 33 LKIGPLKVEELYL--DPRVVKIHDAIYDSEINRIIELSKGKVERGKVV--NYGDTIYVDT 88
+ +G +V+ L PRVV + + E + II+ ++ ++ R V G D
Sbjct: 106 IDVGDRRVDVLMAMAQPRVVLFGNLLSPEECDAIIDAARPRMARSLTVATRTGGEEVNDD 165
Query: 89 RLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD 148
R S F E ++P + +++ RI + N + E +G LQ+ +Y G Y H D
Sbjct: 166 RTSNGMFFQRE---ENPVVARLEARIARLVNWPL---ENGEG-LQVLHYRPGAEYKPHYD 218
Query: 149 -------ATPR--DEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVF--WYNA 197
TP G R+A+ + YL D E GG T FP ++L V P +G+AVF +
Sbjct: 219 YFDPAEPGTPTILRRGGQRVATIVIYLNDPEKGGGTTFPDVHLEVAPRRGNAVFFSYERP 278
Query: 198 HANTLLDYRMYHSGCPVALGNKW 220
H +T R H G PV G+KW
Sbjct: 279 HPST----RTLHGGAPVVAGDKW 297
>gi|195352174|ref|XP_002042589.1| GM14934 [Drosophila sechellia]
gi|194124473|gb|EDW46516.1| GM14934 [Drosophila sechellia]
Length = 438
Score = 75.1 bits (183), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 57/200 (28%), Positives = 93/200 (46%), Gaps = 24/200 (12%)
Query: 21 LKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNY 80
L C Y + FLK+ PLK+EEL + P + + + +I + S+ K++R K ++
Sbjct: 256 LVCRYVDWTQ-FLKLAPLKMEELSMKPHISIFYGFLGQKDIEVLKNASRPKLQRVKHLSG 314
Query: 81 GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLG 140
+ + S H + K+ I D+T G + L++ NYG+
Sbjct: 315 NCSCKIGNLSS----------SSHDVVRKVNELILDIT----GFPSKGNQMLEVINYGIA 360
Query: 141 GHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHAN 200
G+Y+ A P+ + A+ +L + GG +FPS +L V P KGS +FW N
Sbjct: 361 GNYNPEDTAKPK---IHNKANAFIFLENAGKGGEIVFPSRHLKVRPRKGSMLFWEN---- 413
Query: 201 TLLDYRMYHSGCPVALGNKW 220
L + +YH CP+ GN W
Sbjct: 414 -LKNSVIYHQ-CPILKGNMW 431
>gi|357125236|ref|XP_003564301.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Brachypodium
distachyon]
Length = 293
Score = 75.1 bits (183), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 55/206 (26%), Positives = 90/206 (43%), Gaps = 31/206 (15%)
Query: 39 KVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFL 96
+V +L PR + +E + +++L+KG++++ V + G ++ R S FL
Sbjct: 30 RVTQLSWRPRAFLYSGFLSHAECDHLVKLAKGRLQKSMVADNDSGKSVMSQVRTSSGTFL 89
Query: 97 YPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD----ATPR 152
+ + I+ R+ T L E +Q+ +Y +G YD H D +
Sbjct: 90 NKH---EDEIISGIEKRVAAWTFL----PEENAESIQVLHYEVGQKYDAHFDYFHDKNNQ 142
Query: 153 DEGLWRLASFMFYLTDVELGGATIFPSLN------------------LTVFPEKGSAVFW 194
G R+A+ + YLTDV+ GG T+FP+ L V P KG A+ +
Sbjct: 143 KLGGHRVATVLMYLTDVKKGGETVFPNAEGRHLQHKDETWSECARSGLAVKPRKGDALLF 202
Query: 195 YNAHANTLLDYRMYHSGCPVALGNKW 220
++ H N D H CPV G KW
Sbjct: 203 FSLHINATTDPSSLHGSCPVIEGEKW 228
>gi|259490206|ref|NP_001159002.1| prolyl 4-hydroxylase alpha-2 subunit [Zea mays]
gi|195626402|gb|ACG35031.1| prolyl 4-hydroxylase alpha-2 subunit precursor [Zea mays]
gi|347978830|gb|AEP37757.1| prolyl 4-hydroxylase 8 [Zea mays]
gi|347978832|gb|AEP37758.1| prolyl 4-hydroxylase 8-1 [Zea mays]
gi|413939569|gb|AFW74120.1| prolyl 4-hydroxylase alpha-2 subunit isoform 1 [Zea mays]
gi|413939570|gb|AFW74121.1| prolyl 4-hydroxylase alpha-2 subunit isoform 2 [Zea mays]
gi|413939571|gb|AFW74122.1| prolyl 4-hydroxylase alpha-2 subunit isoform 3 [Zea mays]
gi|413939572|gb|AFW74123.1| prolyl 4-hydroxylase alpha-2 subunit isoform 4 [Zea mays]
Length = 307
Score = 75.1 bits (183), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 57/204 (27%), Positives = 90/204 (44%), Gaps = 28/204 (13%)
Query: 40 VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPE 99
E + +PR H+ + E +I L+K + + VV+ D+R+ ++ +
Sbjct: 96 TEVISWEPRAFVYHNFLSKDECEYLIGLAKPHMVKSTVVDSTTGKSKDSRVRTSSGMFLQ 155
Query: 100 IFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDATPRDEG 155
G + I+ RI D T + + E LQ+ +Y +G H+D D G
Sbjct: 156 -RGRDKVIRAIEKRIADYTFIPVDHGEG----LQVLHYEVGQKYEPHFDYFLDEFNTKNG 210
Query: 156 LWRLASFMFYLTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWYN 196
R+A+ + YL+DVE GG TIFP N L+V P+ G A+ +++
Sbjct: 211 GQRIATLLMYLSDVEEGGETIFPDANVNASSLPWYNELSDCAKRGLSVKPKMGDALLFWS 270
Query: 197 AHANTLLDYRMYHSGCPVALGNKW 220
+ LD H GCPV GNKW
Sbjct: 271 MKPDATLDPLSLHGGCPVIKGNKW 294
>gi|449522594|ref|XP_004168311.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Cucumis
sativus]
Length = 313
Score = 75.1 bits (183), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 56/208 (26%), Positives = 91/208 (43%), Gaps = 31/208 (14%)
Query: 37 PLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVY 94
P +V +L PR + D+E + +I+L+K K+E+ V + G ++ + R S
Sbjct: 50 PTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVSSEVRTSSGM 109
Query: 95 FLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDAT 150
FL + ++ RI T L E +QI +Y G H+D D
Sbjct: 110 FLRK---AQDEVVAGVEARIAAWTLLPAENGE----SIQILHYENGQKYEPHFDFFHDKV 162
Query: 151 PRDEGLWRLASFMFYLTDVELGGATIFPSLNL------------------TVFPEKGSAV 192
++ G R+A+ + YL++VE GG TIFP+ V +KG A+
Sbjct: 163 NQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQAKDESWSDCSRKGYAVKAQKGDAL 222
Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
+++ + + D R H CPV G KW
Sbjct: 223 LFFSLNLDATTDERSLHGSCPVIAGEKW 250
>gi|159487763|ref|XP_001701892.1| predicted protein [Chlamydomonas reinhardtii]
gi|158281111|gb|EDP06867.1| predicted protein [Chlamydomonas reinhardtii]
Length = 259
Score = 75.1 bits (183), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 61/205 (29%), Positives = 92/205 (44%), Gaps = 32/205 (15%)
Query: 40 VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDT-RLSKVYFLYP 98
+E + PRV H+ I + E +IEL+ +++R VV G D R S FL
Sbjct: 1 IEHVAWKPRVFIYHNFITEVEAKHLIELAAPQMKRSTVVGAGGKSVEDNYRTSYGTFL-- 58
Query: 99 EIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLWR 158
+ + D + +I+ R+ T + + +E QI YGLG Y +H D +E R
Sbjct: 59 KRYQDE-IVERIENRVAAWTQIPVAHQED----TQILRYGLGQQYKVHADTLRDEEAGVR 113
Query: 159 LASFMFYLTDVELGGATIFPSL---------------------NLTVFPEKGSA-VFWY- 195
+A+ + YL + + GG T FPS ++ P++G A +FW
Sbjct: 114 VATVLIYLNEPDGGGETAFPSSEWVNPQLAKTLGANFSDCAKNHVAFAPKRGDALLFWSI 173
Query: 196 NAHANTLLDYRMYHSGCPVALGNKW 220
N NT D H+GCPV G KW
Sbjct: 174 NPDGNT-EDTHASHTGCPVLSGVKW 197
>gi|307725787|ref|YP_003909000.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1003]
gi|307586312|gb|ADN59709.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1003]
Length = 313
Score = 74.7 bits (182), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 55/185 (29%), Positives = 88/185 (47%), Gaps = 18/185 (9%)
Query: 47 PRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDH 104
P+V+ + + E +IE S+ +++R +V+ G + R S+ + G+
Sbjct: 124 PQVIVFGNVLSPDECAEMIERSRHRLKRSTIVDPATGREDVIRNRTSEGIWYQ---RGED 180
Query: 105 PFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDE---------G 155
+ ++ RI + N + E +G LQI +YG G Y H D P D+ G
Sbjct: 181 ALIERLDQRIASLMNWPL---ENGEG-LQILHYGPSGEYRPHFDYFPPDQPGSAVHTARG 236
Query: 156 LWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 215
R+A+ + YL DV GG TIFP L+V ++G AV++ + LD H G PV
Sbjct: 237 GQRVATLVVYLNDVPDGGETIFPEAGLSVAAQQGGAVYFRYMNGRRQLDPLTLHGGAPVL 296
Query: 216 LGNKW 220
G+KW
Sbjct: 297 SGDKW 301
>gi|226529219|ref|NP_001151238.1| LOC100284871 [Zea mays]
gi|195645242|gb|ACG42089.1| prolyl 4-hydroxylase alpha-2 subunit precursor [Zea mays]
gi|347978812|gb|AEP37748.1| prolyl 4-hydroxylase 5 [Zea mays]
gi|413923983|gb|AFW63915.1| prolyl 4-hydroxylase alpha-2 subunit [Zea mays]
Length = 308
Score = 74.7 bits (182), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 57/204 (27%), Positives = 90/204 (44%), Gaps = 28/204 (13%)
Query: 40 VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPE 99
E + +PR H+ + E +I L+K + + VV+ D+R+ ++ +
Sbjct: 97 TEVISWEPRAFVYHNFLSKEECEYLIGLAKPHMVKSTVVDSTTGKSKDSRVRTSSGMFLQ 156
Query: 100 IFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDATPRDEG 155
G + I+ RI D T + + E LQ+ +Y +G H+D D G
Sbjct: 157 -RGRDKVIRVIEKRIADYTFIPVDHGEG----LQVLHYEVGQKYEPHFDYFLDEFNTKNG 211
Query: 156 LWRLASFMFYLTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWYN 196
R+A+ + YL+DVE GG TIFP N L+V P+ G A+ +++
Sbjct: 212 GQRMATLLMYLSDVEEGGETIFPDANVNVSSLPWYNELSECAKRGLSVKPKMGDALLFWS 271
Query: 197 AHANTLLDYRMYHSGCPVALGNKW 220
+ LD H GCPV GNKW
Sbjct: 272 MKPDATLDPLSLHGGCPVIRGNKW 295
>gi|449491267|ref|XP_004158845.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 287
Score = 74.7 bits (182), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 58/206 (28%), Positives = 94/206 (45%), Gaps = 32/206 (15%)
Query: 40 VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRL--SKVYFLY 97
VE + +PR H+ + E +I L+K +++ VV+ D+R+ S FL
Sbjct: 76 VEVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSTVVDSETGQSKDSRVRTSSGTFL- 134
Query: 98 PEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDATPRD 153
P G + I+ R+ D + + + E LQ+ +Y +G H+D D
Sbjct: 135 PR--GRDKTVRTIEKRLSDFSFIPVEHGE----GLQVLHYEVGQKYEPHFDYFLDEYNTK 188
Query: 154 EGLWRLASFMFYLTDVELGGATIFPSLN-------------------LTVFPEKGSAVFW 194
G R+A+ + YL+DVE GG T+FP+ L+V P++G A+ +
Sbjct: 189 NGGQRIATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKPKRGDALLF 248
Query: 195 YNAHANTLLDYRMYHSGCPVALGNKW 220
++ + LD H GCPV GNKW
Sbjct: 249 WSMKPDASLDPSSLHGGCPVIKGNKW 274
>gi|449434114|ref|XP_004134841.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 287
Score = 74.7 bits (182), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 58/206 (28%), Positives = 94/206 (45%), Gaps = 32/206 (15%)
Query: 40 VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRL--SKVYFLY 97
VE + +PR H+ + E +I L+K +++ VV+ D+R+ S FL
Sbjct: 76 VEVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSTVVDSETGQSKDSRVRTSSGTFL- 134
Query: 98 PEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDATPRD 153
P G + I+ R+ D + + + E LQ+ +Y +G H+D D
Sbjct: 135 PR--GRDKTVRTIEKRLSDFSFIPVEHGE----GLQVLHYEVGQKYEPHFDYFLDEYNTK 188
Query: 154 EGLWRLASFMFYLTDVELGGATIFPSLN-------------------LTVFPEKGSAVFW 194
G R+A+ + YL+DVE GG T+FP+ L+V P++G A+ +
Sbjct: 189 NGGQRIATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKPKRGDALLF 248
Query: 195 YNAHANTLLDYRMYHSGCPVALGNKW 220
++ + LD H GCPV GNKW
Sbjct: 249 WSMKPDASLDPSSLHGGCPVIKGNKW 274
>gi|307111754|gb|EFN59988.1| hypothetical protein CHLNCDRAFT_49444 [Chlorella variabilis]
Length = 344
Score = 74.7 bits (182), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 65/227 (28%), Positives = 96/227 (42%), Gaps = 37/227 (16%)
Query: 24 FYESYNNTFLKIGPL-------KVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGK 76
F S+ N+ P +V+ L+ D R+ H+ + D E + II+L++ + R
Sbjct: 40 FAASFGNSSCASEPACDPSRSPRVQVLHEDARIFLYHNFLTDEECDHIIKLAEPTMARSG 99
Query: 77 VV--NYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQI 134
VV + G + + R SK FL G + I+ RI T + G E LQ+
Sbjct: 100 VVETDSGKSKIDNVRTSKGTFLN---RGHDSVIADIEARIAKWTLMPAGNGEG----LQV 152
Query: 135 NNYGLG----GHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPSLN--------- 181
Y G GHYD G R + + YL DVE GG T FP++
Sbjct: 153 LKYEHGQEYEGHYDYFFHKAGTANGGNRYLTVLMYLNDVEEGGETCFPNIPSPNGDNGPE 212
Query: 182 --------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
L P+KG+AV +++ L+ R H+ CPV G KW
Sbjct: 213 FSECARKVLAAKPKKGNAVLFHSIKPTGELERRSLHTACPVIKGVKW 259
>gi|302786814|ref|XP_002975178.1| hypothetical protein SELMODRAFT_174666 [Selaginella moellendorffii]
gi|300157337|gb|EFJ23963.1| hypothetical protein SELMODRAFT_174666 [Selaginella moellendorffii]
Length = 283
Score = 74.7 bits (182), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 57/209 (27%), Positives = 90/209 (43%), Gaps = 30/209 (14%)
Query: 35 IGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSK 92
+ P KV +L PR + +E + +++++K K+++ V + G ++ + R S
Sbjct: 23 VDPTKVIQLSWKPRAFLYKGFMSAAECDHVVKMAKDKLQKSMVADNESGKSVLSNIRTSS 82
Query: 93 VYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCD 148
FL G + +I+ RI T L E +Q+ Y G HYD D
Sbjct: 83 GMFLSK---GQDEVINRIEERIAAWTFLPKENGE----AIQVLRYEFGEKYEPHYDYFHD 135
Query: 149 ATPRDEGLWRLASFMFYLTDVELGGATIFPSLN-----------------LTVFPEKGSA 191
+ G R+A+ + YL+D GG T+FPS + V P KG A
Sbjct: 136 KYNQALGGHRIATVLMYLSDAVKGGETVFPSSEEDTTVKDDSWSDCAKKGIAVKPRKGDA 195
Query: 192 VFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ +Y+ H + D H GCPV G KW
Sbjct: 196 LLFYSLHPDATPDESSLHGGCPVIEGEKW 224
>gi|241710335|ref|XP_002412046.1| prolyl 4-hydroxylase alpha subunit 1, putative [Ixodes scapularis]
gi|215505101|gb|EEC14595.1| prolyl 4-hydroxylase alpha subunit 1, putative [Ixodes scapularis]
Length = 65
Score = 74.7 bits (182), Expect = 3e-11, Method: Composition-based stats.
Identities = 31/63 (49%), Positives = 43/63 (68%)
Query: 158 RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALG 217
R+A+ M Y++DVE GGAT+FP L + + P+KG A FW+N AN + H+GCPV G
Sbjct: 3 RVATLMIYMSDVEEGGATVFPYLGVRLTPQKGDAAFWWNLKANGEGEVLTTHAGCPVLYG 62
Query: 218 NKW 220
+KW
Sbjct: 63 SKW 65
>gi|115482738|ref|NP_001064962.1| Os10g0497800 [Oryza sativa Japonica Group]
gi|78708853|gb|ABB47828.1| prolyl 4-hydroxylase alpha subunit, putative, expressed [Oryza
sativa Japonica Group]
gi|113639571|dbj|BAF26876.1| Os10g0497800 [Oryza sativa Japonica Group]
gi|215767852|dbj|BAH00081.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218184821|gb|EEC67248.1| hypothetical protein OsI_34188 [Oryza sativa Indica Group]
Length = 321
Score = 74.3 bits (181), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 61/206 (29%), Positives = 90/206 (43%), Gaps = 32/206 (15%)
Query: 40 VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRL--SKVYFLY 97
E L +PR H+ + E +I L+K +++ VV+ D+R+ S FL
Sbjct: 110 TEVLSWEPRAFLYHNFLSKEECEYLISLAKPHMKKSTVVDASTGGSKDSRVRTSSGMFLG 169
Query: 98 PEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDATPRD 153
G + I+ RI D T + + E LQ+ +Y +G H+D D
Sbjct: 170 R---GQDKIIRTIEKRISDYTFIPVENGE----GLQVLHYEVGQKYEPHFDYFHDEFNTK 222
Query: 154 EGLWRLASFMFYLTDVELGGATIFPSL-------------------NLTVFPEKGSAVFW 194
G R+A+ + YL+DVE GG TIFPS L V P+ G A+ +
Sbjct: 223 NGGQRIATLLMYLSDVEEGGETIFPSSKANSSSSPFYNELSECAKKGLAVKPKMGDALLF 282
Query: 195 YNAHANTLLDYRMYHSGCPVALGNKW 220
++ + LD H GCPV GNKW
Sbjct: 283 WSMRPDGSLDATSLHGGCPVIKGNKW 308
>gi|363543371|ref|NP_001241695.1| prolyl 4-hydroxylase 8-5 [Zea mays]
gi|347978840|gb|AEP37762.1| prolyl 4-hydroxylase 8-5 [Zea mays]
Length = 307
Score = 74.3 bits (181), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 57/204 (27%), Positives = 89/204 (43%), Gaps = 28/204 (13%)
Query: 40 VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPE 99
E + +PR H+ + E +I L+K + + VV+ D+R+ ++ +
Sbjct: 96 TEVISWEPRAFVYHNFLSKDECEYLIGLAKPHMVKSTVVDSTTGKSKDSRVRTSSGMFLQ 155
Query: 100 IFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDATPRDEG 155
G + I+ RI D T + + E LQ+ +Y +G H+D D G
Sbjct: 156 -RGRDKVIRAIEKRIADYTFIPVDHGEG----LQVLHYEVGQKYEPHFDYFLDEFNTKNG 210
Query: 156 LWRLASFMFYLTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWYN 196
R+A+ + YL+DVE GG TIFP N L+V P+ G A+ +++
Sbjct: 211 GQRIATLLMYLSDVEEGGETIFPDANVNASSLPWYNELSDCAKRGLSVKPKMGDALLFWS 270
Query: 197 AHANTLLDYRMYHSGCPVALGNKW 220
LD H GCPV GNKW
Sbjct: 271 MKPGATLDPLSLHGGCPVIKGNKW 294
>gi|186474111|ref|YP_001861453.1| procollagen-proline dioxygenase [Burkholderia phymatum STM815]
gi|184196443|gb|ACC74407.1| Procollagen-proline dioxygenase [Burkholderia phymatum STM815]
Length = 305
Score = 74.3 bits (181), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 53/185 (28%), Positives = 88/185 (47%), Gaps = 18/185 (9%)
Query: 47 PRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDH 104
P+V+ D + E + +IE ++ +++R VN G + R S+ ++ +
Sbjct: 116 PQVIVFDDVLSRDECDELIERARHRLKRSTTVNPESGREDVIQLRTSEGFWFQ---RCED 172
Query: 105 PFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDE---------G 155
F+ ++ RI + N + E +G LQI +Y GG Y H D P + G
Sbjct: 173 AFIERLDRRISALMNWPL---EHGEG-LQILHYTKGGEYRPHFDYFPPSQSGSVLHTSRG 228
Query: 156 LWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 215
R+A+ + YL+DV GG T+FP+ L V +G A+++ + + LD H G PV
Sbjct: 229 GQRVATLIVYLSDVAGGGETVFPNAGLAVMARQGGAIYFRYLNGHRQLDPLTLHGGAPVT 288
Query: 216 LGNKW 220
G KW
Sbjct: 289 NGEKW 293
>gi|357447555|ref|XP_003594053.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
truncatula]
gi|355483101|gb|AES64304.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
truncatula]
Length = 303
Score = 74.3 bits (181), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 59/223 (26%), Positives = 93/223 (41%), Gaps = 36/223 (16%)
Query: 27 SYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTI 84
SY T I P KV+++ PR + D E + +I ++K +++R V + G++
Sbjct: 27 SYAGTSAIIDPTKVKQVSWKPRAFVYKGFLTDLECDHLISIAKSELKRSAVADNLSGESK 86
Query: 85 YVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYD 144
+ R S F+ + I+ +I T L E +Q+ Y G YD
Sbjct: 87 LSEVRTSSGMFISKN---KDAIVSGIEDKISSWTFLPKENGED----IQVLRYEHGQKYD 139
Query: 145 LH----CDATPRDEGLWRLASFMFYLTDVELGGATIFPSLNL------------------ 182
H D G R+A+ + YLT+V GG T+FP+ L
Sbjct: 140 PHYDYFADKVNIARGGHRVATVLMYLTNVTKGGETVFPNAELQESPRHKLSETDEDLSEC 199
Query: 183 -----TVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
V P +G A+ +++ H N + D H+GCPV G KW
Sbjct: 200 GKKGVAVKPRRGDALLFFSLHPNAIPDTLSLHAGCPVIEGEKW 242
>gi|148653656|ref|YP_001280749.1| procollagen-proline dioxygenase [Psychrobacter sp. PRwf-1]
gi|148572740|gb|ABQ94799.1| Procollagen-proline dioxygenase [Psychrobacter sp. PRwf-1]
Length = 268
Score = 74.3 bits (181), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 60/217 (27%), Positives = 105/217 (48%), Gaps = 24/217 (11%)
Query: 19 SNLKCFYESYNNTFLKIGPLKVEELYL--DPRVVKIHDAIYDSEINRIIELSKGKVERGK 76
SN K + + N ++++ +V ++ P V I+D + E + +I + K++ +
Sbjct: 49 SNQKIPHINMTNNYVELSDKRVSLSFVCYKPFVTVINDFLSPEECDALISDADQKLKASR 108
Query: 77 VVNYGDTIYVD----TRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPL 132
VV+ D +V+ T S Y G+ + I+ RI D+ N + E L
Sbjct: 109 VVDPEDGSFVEHSARTSTSTGYHR-----GEIDIIKTIEARIADLINWPVDHGE----GL 159
Query: 133 QINNYGLGGHYDLHCD------ATPR---DEGLWRLASFMFYLTDVELGGATIFPSLNLT 183
Q+ Y GG Y H D + R +G R+ +F+ YL++V+ GG+T FP+LN
Sbjct: 160 QVLRYEDGGEYRPHFDFFDPAKKSSRLVTKQGGQRVGTFLMYLSEVDSGGSTRFPNLNFE 219
Query: 184 VFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ P KGSA+++ N + ++ H+G PV G K+
Sbjct: 220 IRPNKGSALYFANTNLKAEIEPLTLHAGMPVTEGVKY 256
>gi|319786559|ref|YP_004146034.1| Procollagen-proline dioxygenase [Pseudoxanthomonas suwonensis 11-1]
gi|317465071|gb|ADV26803.1| Procollagen-proline dioxygenase [Pseudoxanthomonas suwonensis 11-1]
Length = 289
Score = 74.3 bits (181), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 66/229 (28%), Positives = 99/229 (43%), Gaps = 32/229 (13%)
Query: 4 PLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINR 63
PL C+ VP I N ++ + + L + PRVV + + D E +
Sbjct: 69 PLPCR----VPAPIGLNGPALLDAGDRQVQLLASLML------PRVVVLGGLLSDEECDA 118
Query: 64 IIELSKGKVERGKVVNY---GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
++ELS+ ++ R V+ G ++ D R S+ F G HP I+ RI +
Sbjct: 119 LVELSRPRLRRSTTVDAQTGGSQVHAD-RTSRGTFFE---RGAHPVCATIEARIARLLEW 174
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRDE---------GLWRLASFMFYLTDVEL 171
+ E +G LQ+ +Y G + H D DE G R+A+ + YL
Sbjct: 175 PV---ENGEG-LQVLHYPPGAEFRPHYDYFDPDEPGAEVLLRQGGQRVATVVMYLNTPAR 230
Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
GGAT FP +L V KG+AVF+ + + R H G PV G KW
Sbjct: 231 GGATTFPDAHLEVAAVKGNAVFFSYDRPHPM--TRTLHGGAPVTEGEKW 277
>gi|384251901|gb|EIE25378.1| hypothetical protein COCSUDRAFT_35772 [Coccomyxa subellipsoidea
C-169]
Length = 222
Score = 74.3 bits (181), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 57/204 (27%), Positives = 95/204 (46%), Gaps = 30/204 (14%)
Query: 40 VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLY 97
+E L +PR H+ + ++E + +++ K +E+ +VV+ G + R S FL
Sbjct: 1 MEVLSWEPRAYLYHNFLTEAEADYLVQKGKPHMEKSEVVDNETGKSAPSKVRTSSGMFLN 60
Query: 98 PEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDATPRD 153
G+ + +I+ RI T + +E +G LQI +Y H+D D
Sbjct: 61 ---RGEDDVIERIEARIAKYTAIP---KENGEG-LQILHYQASEEYRPHFDYFHDNFNTQ 113
Query: 154 EGLWRLASFMFYLTDVELGGATIFP-----------------SLNLTVFPEKGSAVFWYN 196
G R+A+ + YL+DVE GG T+FP P+KG A+F+Y+
Sbjct: 114 NGGQRIATMLMYLSDVEDGGETVFPESSDKPNVGNTKFSQCAQAGAAAKPKKGDALFFYS 173
Query: 197 AHANTLLDYRMYHSGCPVALGNKW 220
+ +D + H+GCPV G+KW
Sbjct: 174 LTPDGRMDEKSLHAGCPVMKGDKW 197
>gi|156352046|ref|XP_001622583.1| predicted protein [Nematostella vectensis]
gi|156209154|gb|EDO30483.1| predicted protein [Nematostella vectensis]
Length = 497
Score = 74.3 bits (181), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 60/220 (27%), Positives = 107/220 (48%), Gaps = 29/220 (13%)
Query: 3 YPLACQGNLSVPEDIK--SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y C+G P ++ L+C+Y+S ++ L++ P K+E L D +++ + D I +S+
Sbjct: 284 YERLCRGQ---PNKVRIPKQLRCYYKS-SHPLLRLKPAKIEVLDPDRQILLLRDVINESQ 339
Query: 61 INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
+ I EL+ KV + + + R S +L D + + RI+ +T+
Sbjct: 340 MQFIKELAAPKVSSLHLSPTNRSP-SERRFSSSAWLGD---ADGAPIAALSRRIEAITDF 395
Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPSL 180
+ + LQ+ ++G+GGH++ PR + + F V+ GG+ +F
Sbjct: 396 HVTGDS--AESLQVVHFGIGGHFE------PR----YGYNALNF----VDAGGSNVFLDS 439
Query: 181 NLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
L+V P+KGSAVFW N + H+ CPV +G+KW
Sbjct: 440 ELSVSPQKGSAVFWLNMRRSG---KETLHAACPVIVGHKW 476
>gi|357146834|ref|XP_003574128.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Brachypodium
distachyon]
Length = 306
Score = 74.3 bits (181), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 59/206 (28%), Positives = 91/206 (44%), Gaps = 32/206 (15%)
Query: 40 VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRL--SKVYFLY 97
E L +PR H+ + E +I L+K +++ VV+ D+R+ S FL
Sbjct: 95 TEVLSWEPRAFLYHNFLSKEECEYLISLAKPHMKKSTVVDSATGGSKDSRVRTSSGTFLR 154
Query: 98 PEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRD---- 153
G + I+ RI D T + E LQ+ +Y +G Y+ H D D
Sbjct: 155 ---RGQDKVIRTIEKRISDFTFIPAENGE----GLQVLHYEVGQKYEPHFDYFHDDFNTK 207
Query: 154 EGLWRLASFMFYLTDVELGGATIFPSLN-------------------LTVFPEKGSAVFW 194
G R+A+ + YL+DVE GG T+FPS ++V P+ G A+ +
Sbjct: 208 NGGQRIATLLMYLSDVEEGGETVFPSAKVNSSSIPFYNELSECAKRGISVKPKMGDALLF 267
Query: 195 YNAHANTLLDYRMYHSGCPVALGNKW 220
++ + LD H GCPV G+KW
Sbjct: 268 WSMRPDGTLDPTSLHGGCPVIKGDKW 293
>gi|30689216|ref|NP_189490.2| Oxoglutarate/iron-dependent oxygenase [Arabidopsis thaliana]
gi|332643931|gb|AEE77452.1| Oxoglutarate/iron-dependent oxygenase [Arabidopsis thaliana]
Length = 288
Score = 74.3 bits (181), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 58/211 (27%), Positives = 90/211 (42%), Gaps = 32/211 (15%)
Query: 35 IGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV---NYGDTIYVDTRLS 91
+ P ++ +L PR + D E + +I+L+KGK+E+ VV + G++ + R S
Sbjct: 27 VDPTRITQLSWTPRAFLYKGFLSDEECDHLIKLAKGKLEKSMVVADVDSGESEDSEVRTS 86
Query: 92 KVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD--- 148
FL + ++ ++ T L E LQI +Y G YD H D
Sbjct: 87 SGMFLTKR---QDDIVANVEAKLAAWTFL----PEENGEALQILHYENGQKYDPHFDYFY 139
Query: 149 -ATPRDEGLWRLASFMFYLTDVELGGATIFPSLN------------------LTVFPEKG 189
+ G R+A+ + YL++V GG T+FP+ V P KG
Sbjct: 140 DKKALELGGHRIATVLMYLSNVTKGGETVFPNWKGKTPQLKDDSWSKCAKQGYAVKPRKG 199
Query: 190 SAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
A+ ++N H N D H CPV G KW
Sbjct: 200 DALLFFNLHLNGTTDPNSLHGSCPVIEGEKW 230
>gi|224013908|ref|XP_002296618.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220968970|gb|EED87314.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 601
Score = 74.3 bits (181), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 57/187 (30%), Positives = 81/187 (43%), Gaps = 16/187 (8%)
Query: 47 PRVVKIHDAIYDSEINRIIELS-------KGKVERGKVVNYGDTIYVDTRLSKVYFLYPE 99
P VV + + D E +R+++L KV+ K N D + R S +
Sbjct: 400 PWVVSLEGFLSDEEADRLVQLGNQQGYKRSTKVQTHKGGNSIDAGITEDRTSHNTWCQEP 459
Query: 100 IFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-- 157
D P + I RI +T E LQ+ Y G Y H D P+ +
Sbjct: 460 SCYDDPLVAPIIERIAMLTKSSANHSEH----LQLLQYTEGQFYKQHNDYIPQQRDMACG 515
Query: 158 -RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHAN--TLLDYRMYHSGCPV 214
R+ + YL DVE GG T FP L+LTV P++G+A+ W + + D R H PV
Sbjct: 516 PRIMTLFLYLNDVEEGGGTRFPLLDLTVQPKRGNAILWASVRDDDPEEKDIRTDHEALPV 575
Query: 215 ALGNKWG 221
A G K+G
Sbjct: 576 AKGMKYG 582
>gi|224117220|ref|XP_002331751.1| predicted protein [Populus trichocarpa]
gi|222874448|gb|EEF11579.1| predicted protein [Populus trichocarpa]
Length = 266
Score = 74.3 bits (181), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 63/206 (30%), Positives = 97/206 (47%), Gaps = 32/206 (15%)
Query: 40 VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRL--SKVYFLY 97
VE + +PR H+ + +E + +I L+K +++ VV+ D+R+ S FL
Sbjct: 55 VEAISWEPRAFIYHNFLTKAECDYLINLAKPHMQKSMVVDSSSGKSKDSRVRTSSGTFL- 113
Query: 98 PEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRD---- 153
P G + I+ RI D + + E +G LQI +Y +G Y+ H D D
Sbjct: 114 PR--GRDKIIRDIEKRIADFSFIP---SEHGEG-LQILHYEVGQKYEPHFDYFMDDYNTE 167
Query: 154 EGLWRLASFMFYLTDVELGGATIFPSLN-------------------LTVFPEKGSAVFW 194
G R+A+ + YL+DVE GG T+FPS L+V P+ G A+ +
Sbjct: 168 NGGQRIATVLMYLSDVEEGGETVFPSAKGNISSVPWWNELSECGKGGLSVKPKMGDALLF 227
Query: 195 YNAHANTLLDYRMYHSGCPVALGNKW 220
++ + LD H GCPV GNKW
Sbjct: 228 WSMKPDASLDPSSLHGGCPVIRGNKW 253
>gi|412992163|emb|CCO19876.1| predicted protein [Bathycoccus prasinos]
Length = 350
Score = 74.3 bits (181), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 56/213 (26%), Positives = 92/213 (43%), Gaps = 40/213 (18%)
Query: 40 VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLY 97
E + PR +H + + E I+ ++K ++R VV+ G+ R SK FL
Sbjct: 80 TEPISWQPRAFVLHSILSEEECEEILRIAKPMMKRSTVVDSITGEIKTDPIRTSKQTFL- 138
Query: 98 PEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGP-LQINNYGLGGHYDLHCDATPRD--- 153
G +P + +++ R+ T L Y G +QI +YG+G Y H D ++
Sbjct: 139 --ARGKYPVVTRVEERLSRFTMLPW-----YNGEDMQILSYGVGEKYSAHHDVGEKNTKS 191
Query: 154 ------EGLWRLASFMFYLTDVELGGATIFP-------------------SLNLTVF-PE 187
+G R+A+ + YL D E GG T FP + N F P+
Sbjct: 192 GQQLSADGGQRVATVLLYLQDTEEGGETAFPDSEWIEPESEYAQQKFSECAKNGVAFKPK 251
Query: 188 KGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+G + +++ +D + H+GCPV G KW
Sbjct: 252 RGDGLLFFSITPEGDIDQKSMHAGCPVVKGTKW 284
>gi|388496942|gb|AFK36537.1| unknown [Lotus japonicus]
Length = 302
Score = 74.3 bits (181), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 57/215 (26%), Positives = 91/215 (42%), Gaps = 36/215 (16%)
Query: 35 IGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSK 92
I P KV+++ PR + + E + +I L+K +++R V + GD+ D R S
Sbjct: 36 IDPSKVKQVSWKPRAFVYKGFLTELECDHLISLAKSELKRSAVADNLSGDSKLSDVRTSS 95
Query: 93 VYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCD 148
F+ P + I+ +I T L E +Q+ Y G HYD D
Sbjct: 96 GMFISKN---KDPIVAGIEDKISSWTFLPKENGED----IQVLRYEHGQKYDPHYDFFAD 148
Query: 149 ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVF----------------------- 185
G R+A+ + YLT+V GG T+FP+ + F
Sbjct: 149 KVNIARGGHRVATVLMYLTNVTRGGETVFPNAEVEEFPRHRGSETIDDLSECAKKGIAVK 208
Query: 186 PEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
P +G A+ +++ + N + D H+GCPV G KW
Sbjct: 209 PRRGDALLFFSLYPNAVPDTMSLHAGCPVIEGEKW 243
>gi|242063586|ref|XP_002453082.1| hypothetical protein SORBIDRAFT_04g038020 [Sorghum bicolor]
gi|241932913|gb|EES06058.1| hypothetical protein SORBIDRAFT_04g038020 [Sorghum bicolor]
Length = 307
Score = 74.3 bits (181), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 57/204 (27%), Positives = 89/204 (43%), Gaps = 28/204 (13%)
Query: 40 VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPE 99
E + +PR H+ + E +I L+K + + VV+ D+R+ ++ +
Sbjct: 96 TEVISWEPRAFVYHNFLSKEECEYLIGLAKPHMVKSTVVDSTTGKSKDSRVRTSSGMFLQ 155
Query: 100 IFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDATPRDEG 155
G + I+ RI D T + E LQ+ +Y +G H+D D G
Sbjct: 156 -RGRDKVIRAIEKRIADYTFIPADHGEG----LQVLHYEVGQKYEPHFDYFLDEFNTKNG 210
Query: 156 LWRLASFMFYLTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWYN 196
R+A+ + YL+DVE GG TIFP N L+V P+ G A+ +++
Sbjct: 211 GQRMATLLMYLSDVEEGGETIFPDANVNASSLPWYNELSECAKRGLSVKPKMGDALLFWS 270
Query: 197 AHANTLLDYRMYHSGCPVALGNKW 220
+ LD H GCPV GNKW
Sbjct: 271 MKPDATLDPLSLHGGCPVIRGNKW 294
>gi|222613083|gb|EEE51215.1| hypothetical protein OsJ_32038 [Oryza sativa Japonica Group]
Length = 222
Score = 73.9 bits (180), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 61/206 (29%), Positives = 90/206 (43%), Gaps = 32/206 (15%)
Query: 40 VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRL--SKVYFLY 97
E L +PR H+ + E +I L+K +++ VV+ D+R+ S FL
Sbjct: 11 TEVLSWEPRAFLYHNFLSKEECEYLISLAKPHMKKSTVVDASTGGSKDSRVRTSSGMFLG 70
Query: 98 PEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDATPRD 153
G + I+ RI D T + + E LQ+ +Y +G H+D D
Sbjct: 71 R---GQDKIIRTIEKRISDYTFIPVENGE----GLQVLHYEVGQKYEPHFDYFHDEFNTK 123
Query: 154 EGLWRLASFMFYLTDVELGGATIFPSL-------------------NLTVFPEKGSAVFW 194
G R+A+ + YL+DVE GG TIFPS L V P+ G A+ +
Sbjct: 124 NGGQRIATLLMYLSDVEEGGETIFPSSKANSSSSPFYNELSECAKKGLAVKPKMGDALLF 183
Query: 195 YNAHANTLLDYRMYHSGCPVALGNKW 220
++ + LD H GCPV GNKW
Sbjct: 184 WSMRPDGSLDATSLHGGCPVIKGNKW 209
>gi|357447553|ref|XP_003594052.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
truncatula]
gi|355483100|gb|AES64303.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
truncatula]
Length = 301
Score = 73.9 bits (180), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 58/221 (26%), Positives = 93/221 (42%), Gaps = 34/221 (15%)
Query: 27 SYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTI 84
SY T I P KV+++ PR + D E + +I ++K +++R V + G++
Sbjct: 27 SYAGTSAIIDPTKVKQVSWKPRAFVYKGFLTDLECDHLISIAKSELKRSAVADNLSGESK 86
Query: 85 YVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYD 144
+ R S F+ + I+ +I T L E +Q+ Y G YD
Sbjct: 87 LSEVRTSSGMFISKN---KDAIVSGIEDKISSWTFLPKENGED----IQVLRYEHGQKYD 139
Query: 145 LH----CDATPRDEGLWRLASFMFYLTDVELGGATIFPSLN------------------- 181
H D G R+A+ + YLT+V GG T+FP+
Sbjct: 140 PHYDYFADKVNIARGGHRVATVLMYLTNVTKGGETVFPNAEESPRHKLSETDEDLSECGK 199
Query: 182 --LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ V P +G A+ +++ H N + D H+GCPV G KW
Sbjct: 200 KGVAVKPRRGDALLFFSLHPNAIPDTLSLHAGCPVIEGEKW 240
>gi|344199983|ref|YP_004784309.1| 2OG-Fe(II) oxygenase [Acidithiobacillus ferrivorans SS3]
gi|343775427|gb|AEM47983.1| 2OG-Fe(II) oxygenase [Acidithiobacillus ferrivorans SS3]
Length = 212
Score = 73.9 bits (180), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 43/122 (35%), Positives = 64/122 (52%), Gaps = 9/122 (7%)
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDA----TPRDE-GLWR 158
+P + ++ RI +L IG E + PLQ+ +Y GG YD+H D+ +P+ E G R
Sbjct: 68 YPIIKAVRRRI----SLFIGVAEENQEPLQVLHYTRGGRYDIHYDSFLEGSPQLENGGNR 123
Query: 159 LASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGN 218
+ + + YL DVE GG T FP + + P G+ + + N A L H+G PV G
Sbjct: 124 MLTVLLYLNDVEQGGWTQFPHIMANIVPNVGTGILFRNTDAQNLQLRESLHAGLPVIDGE 183
Query: 219 KW 220
KW
Sbjct: 184 KW 185
>gi|449461905|ref|XP_004148682.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 295
Score = 73.9 bits (180), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 56/211 (26%), Positives = 92/211 (43%), Gaps = 34/211 (16%)
Query: 37 PLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVY 94
P +V +L PR + D+E + +I+L+K K+E+ V + G ++ + R S
Sbjct: 29 PTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVSSEVRTSSGM 88
Query: 95 FLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDAT 150
FL + ++ RI T L E +QI +Y G H+D D
Sbjct: 89 FLRK---AQDEVVAGVEARIAAWTLLPAENGE----SIQILHYENGQKYEPHFDFFHDKV 141
Query: 151 PRDEGLWRLASFMFYLTDVELGGATIFPSLNL---------------------TVFPEKG 189
++ G R+A+ + YL++VE GG TIFP+ + V +KG
Sbjct: 142 NQELGGHRIATVLMYLSNVEKGGETIFPNSEVWYGSESQAKDESWSDCSRKGYAVKAQKG 201
Query: 190 SAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
A+ +++ + + D R H CPV G KW
Sbjct: 202 DALLFFSLNLDATTDERSLHGSCPVIAGEKW 232
>gi|385206010|ref|ZP_10032880.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
gi|385185901|gb|EIF35175.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
Length = 296
Score = 73.9 bits (180), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 49/185 (26%), Positives = 88/185 (47%), Gaps = 18/185 (9%)
Query: 47 PRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDH 104
P + + D + +E ++I L++ ++ R VV+ G + R S F G+
Sbjct: 102 PAAILLDDFLSANECEQLISLARPRLSRSTVVDPVTGRNVVAGHRSSDGMFFR---LGET 158
Query: 105 PFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD----ATPRDE-----G 155
P + +++ RI ++T L + E +G LQ+ +Y +G H D P ++
Sbjct: 159 PLIARLEARIAELTGLPV---ENGEG-LQLLHYEVGAESTPHVDYLIAGNPANQESIARS 214
Query: 156 LWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 215
R+ + + YL DVE GG T+FP +V P +G A+++ + L D H+ P+
Sbjct: 215 GQRVGTLLMYLNDVEGGGETMFPQTGWSVVPRRGQALYFEYGNRFGLADPSSLHTSTPLR 274
Query: 216 LGNKW 220
+G KW
Sbjct: 275 VGEKW 279
>gi|428671901|gb|EKX72816.1| conserved hypothetical protein [Babesia equi]
Length = 234
Score = 73.9 bits (180), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 62/211 (29%), Positives = 98/211 (46%), Gaps = 27/211 (12%)
Query: 20 NLKCFYESYNNTFLKIGPLKVEELY--LDPRVVKIHDAIYDSEINRIIELSKGK-----V 72
N+ C+ S T + +K+ +LY L+P + I + + I +++ S+GK
Sbjct: 26 NIHCYKRS---TLIINDSIKLMKLYIHLNPEISMIFNVLEPEWIQHMMDASEGKWVKSQT 82
Query: 73 ERGKVVNYGDT---IYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYK 129
RG + DT +TR S+ E + + KI+ R+ LV G +
Sbjct: 83 SRGLSSGHPDTYQTTVSETRKSQSAIFEHE---ETDVIAKIERRVA----LVAGIGVEFL 135
Query: 130 GPLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKG 189
L + Y G ++ H D G +R A+ + YL DVE GG T+FP+L L + P
Sbjct: 136 EKLVMVKYNPGDYFKEHHD------GSFRTATILLYLNDVE-GGETVFPNLGLAIKPVGN 188
Query: 190 SAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
SAVFW N + +D RM H+G +G K+
Sbjct: 189 SAVFWRNLNGENEMDERMIHAGTTPKVGTKY 219
>gi|114796723|gb|ABI79328.1| prolyl 4-hydroxylase [Dianthus caryophyllus]
Length = 297
Score = 73.6 bits (179), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 56/223 (25%), Positives = 92/223 (41%), Gaps = 35/223 (15%)
Query: 27 SYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTI 84
S N++ K+ P KV ++ PR + D E + +I ++K +++R V + G +
Sbjct: 24 SSNDSIFKLNPSKVRQISWKPRAFVYEGFLTDEECDHLISIAKTELKRSAVADNESGKSQ 83
Query: 85 YVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLG---- 140
+ R S F+ + +I+ ++ T L I E +Q+ Y G
Sbjct: 84 VSEVRTSSGAFISK---AKDAIVQRIEEKLATWTFLPIENGE----DIQVLRYEEGQKYE 136
Query: 141 GHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPSLNL------------------ 182
H+D D G R A+ + YL++VE GG T+FP+ L
Sbjct: 137 NHFDFFSDKVNIARGGHRYATVLMYLSNVEKGGDTVFPNAELSERQKAAIAANDDLSECA 196
Query: 183 ----TVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 221
+V P KG A+ +++ D H GCPV G KW
Sbjct: 197 KRGISVKPRKGDALLFFSLTPTATPDQLSLHGGCPVIEGEKWS 239
>gi|15239594|ref|NP_197391.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
gi|21593296|gb|AAM65245.1| prolyl 4-hydroxylase alpha subunit-like protein [Arabidopsis
thaliana]
gi|332005243|gb|AED92626.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
Length = 298
Score = 73.6 bits (179), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 55/213 (25%), Positives = 90/213 (42%), Gaps = 34/213 (15%)
Query: 35 IGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSK 92
+ P KV+++ PR + + E + ++ L+K ++R V + G++ + + R S
Sbjct: 32 VNPSKVKQVSSKPRAFVYEGFLTELECDHMVSLAKASLKRSAVADNDSGESKFSEVRTSS 91
Query: 93 VYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---- 148
F+ G P + I+ +I T L E +Q+ Y G YD H D
Sbjct: 92 GTFISK---GKDPIVSGIEDKISTWTFLPKENGE----DIQVLRYEHGQKYDAHFDYFHD 144
Query: 149 ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNL---------------------TVFPE 187
G R+A+ + YL++V GG T+FP + V P
Sbjct: 145 KVNIVRGGHRMATILMYLSNVTKGGETVFPDAEIPSRRVLSENKEDLSDCAKRGIAVKPR 204
Query: 188 KGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
KG A+ ++N H + + D H GCPV G KW
Sbjct: 205 KGDALLFFNLHPDAIPDPLSLHGGCPVIEGEKW 237
>gi|20260280|gb|AAM13038.1| unknown protein [Arabidopsis thaliana]
gi|22136524|gb|AAM91340.1| unknown protein [Arabidopsis thaliana]
Length = 298
Score = 73.6 bits (179), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 55/213 (25%), Positives = 90/213 (42%), Gaps = 34/213 (15%)
Query: 35 IGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSK 92
+ P KV+++ PR + + E + ++ L+K ++R V + G++ + + R S
Sbjct: 32 VNPSKVKQVSSKPRAFVYEGFLTELECDHMVSLAKASLKRSAVADNDSGESKFSEVRTSS 91
Query: 93 VYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---- 148
F+ G P + I+ +I T L E +Q+ Y G YD H D
Sbjct: 92 GTFISK---GKDPIVSGIEDKISTWTFLPKENGE----DIQVLRYEHGQKYDAHFDYFHD 144
Query: 149 ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNL---------------------TVFPE 187
G R+A+ + YL++V GG T+FP + V P
Sbjct: 145 KVNIVRGGHRMATILMYLSNVTKGGETVFPDAEIPSRRVLSENEEDLSDCAKRGIAVKPR 204
Query: 188 KGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
KG A+ ++N H + + D H GCPV G KW
Sbjct: 205 KGDALLFFNLHPDAIPDPLSLHGGCPVIEGEKW 237
>gi|415977972|ref|ZP_11559036.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein
[Acidithiobacillus sp. GGI-221]
gi|339834153|gb|EGQ61937.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein
[Acidithiobacillus sp. GGI-221]
Length = 215
Score = 73.6 bits (179), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 54/191 (28%), Positives = 87/191 (45%), Gaps = 20/191 (10%)
Query: 45 LDPRVVKIHDAIYDSEINRIIELSKGK--VERGKVVNYGDTIYVDTRLSKVY-------- 94
+ P+++ ++D I ++ L + + G V + ++ VD Y
Sbjct: 3 MGPKILSVNDTIGLVHFKGLLSLDECAELIAIGSVSDAKPSVVVDGASDAAYETPGRCST 62
Query: 95 FLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDA----T 150
+ P + +P + +I+ RI+ L G + + PLQI +Y GG YD+H DA +
Sbjct: 63 VVAPSVDA-YPIILEIRRRIE----LFSGISQENQEPLQILHYTRGGKYDIHYDAFSDGS 117
Query: 151 PR-DEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYH 209
P+ G RL + + YL DVE GG T FP + + P GS + + N A H
Sbjct: 118 PQLRNGGNRLLTVLLYLNDVEYGGWTQFPHIMANIVPNAGSGILFRNTDAQNRQLRESLH 177
Query: 210 SGCPVALGNKW 220
+G PV G KW
Sbjct: 178 AGLPVTHGEKW 188
>gi|198284815|ref|YP_002221136.1| 2OG-Fe(II) oxygenase [Acidithiobacillus ferrooxidans ATCC 53993]
gi|218668131|ref|YP_002427500.1| 2OG-Fe(II) oxygenase [Acidithiobacillus ferrooxidans ATCC 23270]
gi|198249336|gb|ACH84929.1| 2OG-Fe(II) oxygenase [Acidithiobacillus ferrooxidans ATCC 53993]
gi|218520344|gb|ACK80930.1| oxidoreductase, 2OG-Fe(II) oxygenase family [Acidithiobacillus
ferrooxidans ATCC 23270]
Length = 213
Score = 73.6 bits (179), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 54/191 (28%), Positives = 87/191 (45%), Gaps = 20/191 (10%)
Query: 45 LDPRVVKIHDAIYDSEINRIIELSKGK--VERGKVVNYGDTIYVDTRLSKVY-------- 94
+ P+++ ++D I ++ L + + G V + ++ VD Y
Sbjct: 1 MGPKILSVNDTIGLVHFKGLLSLDECAELIAIGSVSDAKPSVVVDGASDAAYETPGRCST 60
Query: 95 FLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDA----T 150
+ P + +P + +I+ RI+ L G + + PLQI +Y GG YD+H DA +
Sbjct: 61 VVAPSVDA-YPIILEIRRRIE----LFSGISQENQEPLQILHYTRGGKYDIHYDAFSDGS 115
Query: 151 PR-DEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYH 209
P+ G RL + + YL DVE GG T FP + + P GS + + N A H
Sbjct: 116 PQLRNGGNRLLTVLLYLNDVEYGGWTQFPHIMANIVPNAGSGILFRNTDAQNRQLRESLH 175
Query: 210 SGCPVALGNKW 220
+G PV G KW
Sbjct: 176 AGLPVTHGEKW 186
>gi|356555587|ref|XP_003546112.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 2
[Glycine max]
Length = 297
Score = 73.6 bits (179), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 56/209 (26%), Positives = 93/209 (44%), Gaps = 30/209 (14%)
Query: 35 IGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSK 92
I P KV+++ PR + + E + +I ++K +++R V + G++ + R S
Sbjct: 35 IDPSKVKQVSWKPRAFVYEGFLTELECDHLISIAKSELKRSAVADNLSGESKLSEVRTSS 94
Query: 93 VYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLH----CD 148
F+ P+ P + ++ +I T L E +Q+ Y G YD H D
Sbjct: 95 GMFI-PK--NKDPIVAGVEDKISSWTLLPKENGED----IQVLRYEHGQKYDPHYDYFAD 147
Query: 149 ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNL-----------------TVFPEKGSA 191
G R+A+ + YLTDV GG T+FP+ L V P +G A
Sbjct: 148 KVNIARGGHRVATVLMYLTDVTKGGETVFPNAELKSSETKEDLSECAQKGIAVKPRRGDA 207
Query: 192 VFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ +++ + N + D H+GCPV G KW
Sbjct: 208 LLFFSLYPNAIPDTMSLHAGCPVIEGEKW 236
>gi|77747935|ref|NP_638775.2| hypothetical protein XCC3429 [Xanthomonas campestris pv. campestris
str. ATCC 33913]
Length = 288
Score = 73.6 bits (179), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 62/208 (29%), Positives = 95/208 (45%), Gaps = 22/208 (10%)
Query: 26 ESYNNTFLKIGPLKVEELY--LDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNY--G 81
++ ++ L +G +V+ L + PRVV + + D E + +I L++ ++ R + V+ G
Sbjct: 75 QADASSLLDLGDRQVQVLVSLMLPRVVVLGGLLADDECDALIALARPQLARSRTVDNRDG 134
Query: 82 DTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG 141
I R S L P G +I+ RI + + E +G LQ+ Y G
Sbjct: 135 SEIVHAARTSHSMALQP---GQDALCQRIEARIAQLLEWPV---EHGEG-LQVLRYATGA 187
Query: 142 HYDLHCD-------ATP--RDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAV 192
Y H D TP G R+AS + YL E GGAT P ++L V KG+AV
Sbjct: 188 QYAPHYDYFEPDAPGTPVLLQHGGQRVASLVMYLNTPERGGATRVPDVHLDVAAVKGNAV 247
Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
F+ + + R H+G PV G KW
Sbjct: 248 FFSYDRPHPM--TRTLHAGAPVLAGEKW 273
>gi|337280547|ref|YP_004620019.1| hypothetical protein Rta_28970 [Ramlibacter tataouinensis TTB310]
gi|334731624|gb|AEG94000.1| conserved hypothetical protein [Ramlibacter tataouinensis TTB310]
Length = 286
Score = 73.6 bits (179), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 57/189 (30%), Positives = 88/189 (46%), Gaps = 26/189 (13%)
Query: 46 DPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNY---GDTIYVDTRLSKVYFLYPEIFG 102
+PRVV + D E ++I L+K ++ R V G+ + D S ++F G
Sbjct: 98 NPRVVVFGSLLSDQECEQLIGLAKPRLARSLTVATKTGGEEVNEDRTSSGMFFQR----G 153
Query: 103 DHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD-------ATPR--D 153
++ + +I+ RI + N + E +G LQ+ +Y G Y H D TP
Sbjct: 154 ENELVARIEARIARLVNWPV---ENGEG-LQVLHYRPGAEYKPHYDYFDPAEPGTPTILK 209
Query: 154 EGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVF--WYNAHANTLLDYRMYHSG 211
G R+ + + YL + E GG T FP ++L V P++G VF + H +T R H G
Sbjct: 210 RGGQRVGTLVMYLGEPEKGGGTTFPDVHLEVAPKRGHGVFFSYERPHPST----RTLHGG 265
Query: 212 CPVALGNKW 220
PV G KW
Sbjct: 266 APVLAGEKW 274
>gi|21114687|gb|AAM42699.1| conserved hypothetical protein [Xanthomonas campestris pv.
campestris str. ATCC 33913]
Length = 308
Score = 73.6 bits (179), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 62/208 (29%), Positives = 95/208 (45%), Gaps = 22/208 (10%)
Query: 26 ESYNNTFLKIGPLKVEELY--LDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNY--G 81
++ ++ L +G +V+ L + PRVV + + D E + +I L++ ++ R + V+ G
Sbjct: 95 QADASSLLDLGDRQVQVLVSLMLPRVVVLGGLLADDECDALIALARPQLARSRTVDNRDG 154
Query: 82 DTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG 141
I R S L P G +I+ RI + + E +G LQ+ Y G
Sbjct: 155 SEIVHAARTSHSMALQP---GQDALCQRIEARIAQLLEWPV---EHGEG-LQVLRYATGA 207
Query: 142 HYDLHCD-------ATP--RDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAV 192
Y H D TP G R+AS + YL E GGAT P ++L V KG+AV
Sbjct: 208 QYAPHYDYFEPDAPGTPVLLQHGGQRVASLVMYLNTPERGGATRVPDVHLDVAAVKGNAV 267
Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
F+ + + R H+G PV G KW
Sbjct: 268 FFSYDRPHPM--TRTLHAGAPVLAGEKW 293
>gi|90704797|dbj|BAE92293.1| putative prolyl 4-hydroxylase, alpha subunit [Cryptomeria japonica]
Length = 302
Score = 73.6 bits (179), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 56/208 (26%), Positives = 91/208 (43%), Gaps = 32/208 (15%)
Query: 39 KVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFL 96
+VE L +PR H+ + E +I ++K + + VV+ G ++ + R S +FL
Sbjct: 90 RVEVLSWEPRAFLYHNFLAKDECEYLINIAKPHMVKSMVVDSKTGGSMDSNVRTSSGWFL 149
Query: 97 YPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGL----GGHYDLHCDATPR 152
G + +I+ RI D +++ + E L + +Y + HYD D
Sbjct: 150 NR---GQDKIIRRIEKRIADFSHIPVEHGE----GLHVLHYEVEQKYDAHYDYFSDTINV 202
Query: 153 DEGLWRLASFMFYLTDVELGGATIFPSLN-------------------LTVFPEKGSAVF 193
G R A+ + YL+DVE GG T+FP L+V P+ G A+
Sbjct: 203 KNGGQRGATMLMYLSDVEKGGETVFPQSKVNSSSVPWWDELSECGRSGLSVRPKMGDALL 262
Query: 194 WYNAHANTLLDYRMYHSGCPVALGNKWG 221
+++ + LD H CPV GNKW
Sbjct: 263 FWSVKPDASLDPSSLHGSCPVIQGNKWS 290
>gi|50845214|gb|AAT84604.1| prolyl 4-hydroxylase [Dianthus caryophyllus]
Length = 316
Score = 73.6 bits (179), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 65/243 (26%), Positives = 107/243 (44%), Gaps = 33/243 (13%)
Query: 3 YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
+P +C G L+ + KS L+ E+ ++ + + P V +L PR + E +
Sbjct: 20 HPSSC-GWLNNVKKGKSVLRLKSENVPSS-VGVDPSHVTQLSWKPRAFLYEGFLTHEECD 77
Query: 63 RIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
+I+++K K+E+ V + G +I + R S FL + I+ RI T L
Sbjct: 78 HLIDMAKDKLEKSMVADNESGKSIPSEVRTSSGMFLQK---AQDDVVAAIEARIAAWTFL 134
Query: 121 VIGREERYKGPLQINNYGLGG----HYDLHCDATPRDEGLWRLASFMFYLTDVELGGATI 176
I E +QI +Y G H+D D + G R+A+ + YL++VE GG T+
Sbjct: 135 PIENGE----AMQILHYERGQKYEPHFDYFHDKVNQQLGGHRIATVLMYLSNVEEGGETV 190
Query: 177 FPSLNL------------------TVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGN 218
FP+ +V P+KG A+ +++ H + D H CPV G
Sbjct: 191 FPNAEAKLQLANNESLSDCAKGGYSVKPKKGDALLFFSLHPDASTDSLSLHGSCPVIEGE 250
Query: 219 KWG 221
KW
Sbjct: 251 KWS 253
>gi|195494572|ref|XP_002094895.1| GE22068 [Drosophila yakuba]
gi|194180996|gb|EDW94607.1| GE22068 [Drosophila yakuba]
Length = 438
Score = 73.6 bits (179), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 54/200 (27%), Positives = 93/200 (46%), Gaps = 24/200 (12%)
Query: 21 LKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNY 80
L C Y + FLK+ PLK+EEL + P + + + +I + +S+ K++R + ++
Sbjct: 256 LVCHYVDWT-PFLKLAPLKMEELSMKPHISIFYGFLGPKDIEVLKNVSRPKLQRNEHLSA 314
Query: 81 GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLG 140
+ + S H + K+ I D+T G + +++ NYG+
Sbjct: 315 NCSCKIGNLFS----------SSHDVVRKVNELILDIT----GFPSKGNEMVEVINYGIA 360
Query: 141 GHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHAN 200
G+Y+ A PR + +F+F L + GG +FPS +L + P KGS + W N +
Sbjct: 361 GNYNPDDTAQPRKHN--KANAFIF-LGNAGKGGEIVFPSRDLKIRPRKGSMIVWENLKKS 417
Query: 201 TLLDYRMYHSGCPVALGNKW 220
+ YH CP+ GN W
Sbjct: 418 VI-----YHQ-CPILKGNLW 431
>gi|297812067|ref|XP_002873917.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297319754|gb|EFH50176.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 298
Score = 73.6 bits (179), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 57/213 (26%), Positives = 92/213 (43%), Gaps = 34/213 (15%)
Query: 35 IGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSK 92
I P KV+++ PR + + E + ++ L+K ++R V + G++ + + R S
Sbjct: 32 INPSKVKQVSSKPRAFVYEGFLTELECDHMVSLAKASLKRSAVADNDSGESKFSEVRTSS 91
Query: 93 VYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---- 148
F+ P+ G P + I+ +I T L E +Q+ Y G YD H D
Sbjct: 92 GTFI-PK--GKDPIVSGIEDKISTWTFLPKENGED----IQVLRYEHGQKYDAHFDYFHD 144
Query: 149 ATPRDEGLWRLASFMFYLTDVELGGATIFPSLN---------------------LTVFPE 187
G R+A+ + YL++V GG T+FP + V P
Sbjct: 145 KVNIVRGGHRIATVLMYLSNVTKGGETVFPDAEVPSCRVLSENKEDLSDCAKRGIAVKPR 204
Query: 188 KGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
KG A+ ++N H + + D H GCPV G KW
Sbjct: 205 KGDALLFFNLHPDAIPDPLSLHGGCPVIEGEKW 237
>gi|124267278|ref|YP_001021282.1| hypothetical protein Mpe_A2091 [Methylibium petroleiphilum PM1]
gi|124260053|gb|ABM95047.1| conserved hypothetical protein [Methylibium petroleiphilum PM1]
Length = 289
Score = 73.2 bits (178), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 60/196 (30%), Positives = 87/196 (44%), Gaps = 24/196 (12%)
Query: 38 LKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYF 95
++V DPRV+ + D+E + I+ L+ ++ R V+ G + R S F
Sbjct: 93 VRVVMAMRDPRVIVFSGLLSDAECDEIVALAGARLARSHTVDTATGASEVNAARTSDGMF 152
Query: 96 LYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD------- 148
G+HP + + RI + N + E +G LQ+ +Y G Y H D
Sbjct: 153 F---TRGEHPVCARFEARIAALLNWPV---ENGEG-LQVLHYRPGAEYKPHYDYFDPDQP 205
Query: 149 ATPR--DEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWY--NAHANTLLD 204
TP G R+A+ + YL GG T FP + L V P KG AVF+ H +T
Sbjct: 206 GTPAVLRRGGQRVATLVTYLNTPTRGGGTTFPDIGLEVTPLKGHAVFFSYDRPHPST--- 262
Query: 205 YRMYHSGCPVALGNKW 220
R H G PV G+KW
Sbjct: 263 -RSLHGGAPVLEGDKW 277
>gi|383757171|ref|YP_005436156.1| putative prolyl 4-hydroxylase alpha subunit [Rubrivivax gelatinosus
IL144]
gi|381377840|dbj|BAL94657.1| putative prolyl 4-hydroxylase alpha subunit homologue
oxidoreductase protein [Rubrivivax gelatinosus IL144]
Length = 279
Score = 73.2 bits (178), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 56/185 (30%), Positives = 84/185 (45%), Gaps = 20/185 (10%)
Query: 47 PRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDH 104
PRVV + D E + ++ L++ ++ R + V+ G + R S F G+
Sbjct: 92 PRVVVFGGLLSDEECDELVALARPRLARSETVDNSTGGSEVNAARTSDGMFFE---RGEK 148
Query: 105 PFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---------ATPRDEG 155
P + +I+ RI ++ + ER +G LQ+ Y G Y H D A G
Sbjct: 149 PLIERIERRIAELVRWPV---ERGEG-LQVLRYRPGAQYKPHHDFFDPAHPGTANILRRG 204
Query: 156 LWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 215
R+ + + YL GGAT FP + L V P KG+AVF+ ++ L R H G PV
Sbjct: 205 GQRVGTVVMYLNTPAGGGATTFPEVGLEVQPVKGNAVFF--SYERPLASTRTLHGGAPVL 262
Query: 216 LGNKW 220
G KW
Sbjct: 263 DGEKW 267
>gi|303287328|ref|XP_003062953.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226455589|gb|EEH52892.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 259
Score = 73.2 bits (178), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 59/214 (27%), Positives = 97/214 (45%), Gaps = 37/214 (17%)
Query: 40 VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLY 97
VE + PR +H+ + D+E + ++EL++ +V R VV+ G++ R S+ FL
Sbjct: 1 VEPISWHPRAFHLHNIMTDAECDEVLELARTRVRRSTVVDSTTGESKVDPIRTSEQCFLN 60
Query: 98 PEIFGDHPFLYKIQTRIQDMTNLVI-GREERYKGPLQINNYGLGGHYDLHCDATPRD--- 153
G P + I+ R++ T L E+ P ++ Y G YD H D D
Sbjct: 61 ---RGHFPIVSVIEKRLERYTMLPWYNGEDLQARPSRVLKYSNGQKYDAHHDVGELDTAS 117
Query: 154 ------EGLWRLASFMFYLTDVEL--GGATIFPSL-------------------NLTVFP 186
EG R+A+ + YL+DV+ GG T FP ++ V P
Sbjct: 118 GKQLAAEGGHRVATVLLYLSDVDDDGGGETAFPDSEWIDPTADRGSGWSECAEDHVAVKP 177
Query: 187 EKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+KG + +++ ++D + H+GCPV LG W
Sbjct: 178 KKGDGLLFWSITPEGVIDQQSMHAGCPV-LGKSW 210
>gi|215490183|dbj|BAG86625.1| type 2 proly 4-hydroxylase [Nicotiana tabacum]
Length = 318
Score = 73.2 bits (178), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 56/211 (26%), Positives = 88/211 (41%), Gaps = 31/211 (14%)
Query: 35 IGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSK 92
I P +V ++ PR + + D E + I L+K K+E+ V + G ++ + R S
Sbjct: 57 IDPTRVTQISWRPRAFVYRNFLTDEECDHFITLAKHKLEKSMVADNESGKSVESEVRTSS 116
Query: 93 VYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCD 148
F + ++ RI T L E +QI +Y G H+D D
Sbjct: 117 GMFFRK---AQDQVVANVEARIAAWTFL----PEENGESIQILHYEHGQKYEPHFDYFHD 169
Query: 149 ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNL------------------TVFPEKGS 190
++ G R+A+ + YL+DVE GG T+FP+ V P KG
Sbjct: 170 KVNQELGGHRVATVLMYLSDVEKGGETVFPNSEAKKTQAKGDDWSDCAKKGYAVKPRKGD 229
Query: 191 AVFWYNAHANTLLDYRMYHSGCPVALGNKWG 221
A+ +++ H + D H CPV G KW
Sbjct: 230 ALLFFSLHPDATTDPLSLHGSCPVIEGEKWS 260
>gi|224133600|ref|XP_002327635.1| predicted protein [Populus trichocarpa]
gi|222836720|gb|EEE75113.1| predicted protein [Populus trichocarpa]
Length = 291
Score = 73.2 bits (178), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 59/206 (28%), Positives = 92/206 (44%), Gaps = 32/206 (15%)
Query: 40 VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLY 97
E + PR H+ + +E +I L+K ++++ VV+ G + R S FL
Sbjct: 80 AEVISWKPRAFVYHNFLTKAECEYLINLAKPRMQKSTVVDSSTGKSKDSKVRTSSGTFL- 138
Query: 98 PEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDATPRD 153
P G + I+ RI D + + + E LQI +Y +G H+D D
Sbjct: 139 PR--GRDKIVRDIEKRIADFSFIPVEHGE----GLQILHYEVGQRYEPHFDYFMDEYNTK 192
Query: 154 EGLWRLASFMFYLTDVELGGATIFPSLN-------------------LTVFPEKGSAVFW 194
G R+A+ + YL+DVE GG T+FPS L+V P+ G A+ +
Sbjct: 193 NGGQRIATVLMYLSDVEEGGETVFPSAEGNISAVPWWNELSECGKGGLSVKPKMGDALLF 252
Query: 195 YNAHANTLLDYRMYHSGCPVALGNKW 220
++ + + D H GCPV GNKW
Sbjct: 253 WSMNPDGSPDPSSLHGGCPVIRGNKW 278
>gi|308812133|ref|XP_003083374.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein (ISS)
[Ostreococcus tauri]
gi|116055254|emb|CAL57650.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein (ISS)
[Ostreococcus tauri]
Length = 311
Score = 73.2 bits (178), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 61/216 (28%), Positives = 95/216 (43%), Gaps = 43/216 (19%)
Query: 40 VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLY 97
+E++ PR + + D+E +R+IE + +E +V + G+ D R S ++
Sbjct: 68 IEKISDSPRAYVFREFLTDAECDRVIERAYPTMEASEVTDDDSGEARPDDARSSIGGWVS 127
Query: 98 PEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDE--- 154
+ D + I+ R L + R E +Q+ Y G YD H D DE
Sbjct: 128 GD---DDEVIRNIELRASTWAMLPMNRGE----TMQVLRYEKGQKYDAHDDFF-HDEHNV 179
Query: 155 --GLWRLASFMFYLTDVELGGATIFP------------------------SLN----LTV 184
G R+A+ + YL+DVE GG T+FP S N L V
Sbjct: 180 KNGGQRVATILMYLSDVEEGGETVFPLGTPLGGRDPEKSGVTGDNACELASQNDPRVLAV 239
Query: 185 FPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
P +G A+ ++NAH + +D + H+GCPV G KW
Sbjct: 240 KPRRGDALLFFNAHLSGEMDEKANHAGCPVNRGTKW 275
>gi|255539064|ref|XP_002510597.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223551298|gb|EEF52784.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 289
Score = 72.8 bits (177), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 55/205 (26%), Positives = 88/205 (42%), Gaps = 28/205 (13%)
Query: 40 VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPE 99
E + +PR H+ + E +I L+K + + VV+ D+R+ ++
Sbjct: 78 TEIISWEPRAFVYHNFLSKEECEYLIALAKPHMVKSTVVDSKTGRSKDSRVRTSSGMFLR 137
Query: 100 IFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLG----GHYDLHCDATPRDEG 155
G + I+ RI D + + I E LQ+ +Y +G HYD D G
Sbjct: 138 -RGRDKIIRNIEKRIADFSFIPIEHGE----GLQVLHYEVGQKYEAHYDYFLDEFNTKNG 192
Query: 156 LWRLASFMFYLTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWYN 196
R A+ + YL+DVE GG T+FP+ L+V P+ G+A+ +++
Sbjct: 193 GQRTATLLMYLSDVEEGGETVFPAAKANISNVPSWNELSECARQGLSVKPKMGNALLFWS 252
Query: 197 AHANTLLDYRMYHSGCPVALGNKWG 221
+ LD H CPV GNKW
Sbjct: 253 TRPDATLDPASLHGSCPVIRGNKWS 277
>gi|224011205|ref|XP_002295377.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|209583408|gb|ACI64094.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 207
Score = 72.8 bits (177), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 56/188 (29%), Positives = 82/188 (43%), Gaps = 19/188 (10%)
Query: 47 PRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRL--------SKVYFLYP 98
P VV I + D E NR IEL + ER Y T+ +D +
Sbjct: 9 PWVVAIEGFLSDEECNRFIELGGDRYERS--TEYASTMNLDGTFDSKESSGRTSTNTWCG 66
Query: 99 EIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW- 157
E D P + K+ R++ +T + E LQ+ Y +G Y+ H D + EG
Sbjct: 67 EGCRDDPIIKKVIERMESLTGIPYANFE----DLQLVRYEIGQRYEEHHDYSSSHEGTQY 122
Query: 158 --RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNA--HANTLLDYRMYHSGCP 213
R+ + FYL DVE GG T F L+ P++G A+ W + A ++D +H P
Sbjct: 123 GPRILTVFFYLNDVEEGGGTQFDELDFVTEPKRGMALIWPSTTNEAPDVMDDWTWHEALP 182
Query: 214 VALGNKWG 221
V G K+G
Sbjct: 183 VTKGIKYG 190
>gi|260806885|ref|XP_002598314.1| hypothetical protein BRAFLDRAFT_204780 [Branchiostoma floridae]
gi|229283586|gb|EEN54326.1| hypothetical protein BRAFLDRAFT_204780 [Branchiostoma floridae]
Length = 282
Score = 72.8 bits (177), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 37/93 (39%), Positives = 53/93 (56%), Gaps = 5/93 (5%)
Query: 133 QINNYGLGGHYDLHCDATPRD-----EGLWRLASFMFYLTDVELGGATIFPSLNLTVFPE 187
Q+ NYGLGG Y+ H D + R+ +F+FYL++VE GGAT+F N+ V
Sbjct: 167 QVLNYGLGGQYEPHYDHLKEEVSRTLMAANRILTFLFYLSEVEAGGATVFTEANIAVPVV 226
Query: 188 KGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
K SAV + N + + H+GCPV +G+KW
Sbjct: 227 KNSAVLFENTNKALVRSRASVHAGCPVLIGSKW 259
>gi|359477455|ref|XP_002278454.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1 [Vitis
vinifera]
Length = 296
Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 61/230 (26%), Positives = 94/230 (40%), Gaps = 33/230 (14%)
Query: 17 IKSNLKCFYESYNNTF-LKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERG 75
I S + F SY + + KV ++ PR + + E + +I L+K +++R
Sbjct: 13 ISSTILEFSSSYADAAGSNVSAAKVRQISWKPRAFVYEGFLSEEECDHLISLAKSELKRS 72
Query: 76 KVVNYGDTIYVDTRLSKVYFLYPEIFGD--HPFLYKIQTRIQDMTNLVIGREERYKGPLQ 133
V D + +RLS+V G P + I+ +I T L E +Q
Sbjct: 73 AVA---DNVSGKSRLSEVRTSSGMFIGKGKDPIVAGIEDKIAAWTFLPKDNGED----MQ 125
Query: 134 INNYGLGGHYDLH----CDATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLT------ 183
+ Y G YD H D G R+A+ + YL+DV GG T+FP ++
Sbjct: 126 VLRYEPGQKYDAHYDYFVDKVNIARGGHRIATVLMYLSDVVKGGETVFPMAEVSSSTLPT 185
Query: 184 -------------VFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
V P KG A+ +++ H + D H GCPV G KW
Sbjct: 186 NDDLSECARKGIAVKPRKGDALLFFSLHPTAIPDPMSLHGGCPVIEGEKW 235
>gi|195627276|gb|ACG35468.1| prolyl 4-hydroxylase [Zea mays]
Length = 298
Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 59/208 (28%), Positives = 91/208 (43%), Gaps = 31/208 (14%)
Query: 37 PLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVY 94
P +V +L PR + D+E + +I L+K K+E+ V + G ++ + R S
Sbjct: 32 PSRVVQLSWRPRAFLHKGFLLDAECDHLIALAKDKLEKSMVADNKSGKSVQSEVRTSSGM 91
Query: 95 FLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDAT 150
FL + + +I+ RI T L E +QI +Y G HYD D
Sbjct: 92 FLEKK---QDEVVTRIEERISAWTFLPPENGE----AIQILHYQNGEKYEPHYDYFHDKN 144
Query: 151 PRDEGLWRLASFMFYLTDVELGGATIFPSLN------------------LTVFPEKGSAV 192
+ G R+A+ + YL++VE GG TIFP+ V P KG A+
Sbjct: 145 NQALGGHRIATVLMYLSNVEKGGETIFPNAEGKLLQPKDDTWSDCARNGYAVKPVKGDAL 204
Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
+++ H ++ D H CPV G KW
Sbjct: 205 LFFSLHPDSTTDSDSLHGSCPVIEGQKW 232
>gi|388567209|ref|ZP_10153646.1| procollagen-proline dioxygenase [Hydrogenophaga sp. PBC]
gi|388265592|gb|EIK91145.1| procollagen-proline dioxygenase [Hydrogenophaga sp. PBC]
Length = 296
Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 60/188 (31%), Positives = 84/188 (44%), Gaps = 26/188 (13%)
Query: 47 PRVVKIHDAIYDSEINRIIELSKGKVERGKVVNY---GDTIYVDTRLSKVYFLYPEIFGD 103
PRVV + + + E + IIE +K K+ R V G+ + D S ++F G
Sbjct: 109 PRVVVLGNLLSAEECDAIIESAKPKLARSLTVQTATGGEELNADRTSSGMFFTR----GQ 164
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD-------ATPR--DE 154
P + ++ RI + + E +G LQ+ +Y G Y H D TP
Sbjct: 165 TPEVTAVERRIARLVGWPV---ENGEG-LQVLHYRPGAEYKPHYDYFDPKEAGTPTILKR 220
Query: 155 GLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWY--NAHANTLLDYRMYHSGC 212
G R+A+ + YL + GG T FP + L V P KGSAVF+ H T R H G
Sbjct: 221 GGQRVATLVMYLNEPARGGGTTFPDVGLEVAPVKGSAVFFSYDRPHPTT----RSLHGGA 276
Query: 213 PVALGNKW 220
PV G KW
Sbjct: 277 PVLEGEKW 284
>gi|159464219|ref|XP_001690339.1| hypothetical protein CHLREDRAFT_114525 [Chlamydomonas reinhardtii]
gi|158279839|gb|EDP05598.1| predicted protein [Chlamydomonas reinhardtii]
Length = 244
Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 54/194 (27%), Positives = 89/194 (45%), Gaps = 28/194 (14%)
Query: 48 RVVKIHDAIYDSEINRIIELSKGKVER-GKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPF 106
R+ I + D E + I+++S+ ++ER G V G + R S FL G+ P
Sbjct: 1 RIFLIEHFLTDEEADHIVQVSERRLERSGVVATNGGSEESQIRTSFGVFLE---RGEDPV 57
Query: 107 LYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW----RLASF 162
+ ++ RI +T + +G E LQ+ Y YD H D +G+ R A+
Sbjct: 58 VKGVEERISALTLMPVGNGE----GLQVLRYQKEQKYDAHWDYFFHKDGIANGGNRYATV 113
Query: 163 MFYLTDVELGGATIFPSL----------------NLTVFPEKGSAVFWYNAHANTLLDYR 206
+ YL D E GG T+FP++ +L P+KG+A+ +++ L+ +
Sbjct: 114 LMYLVDTEEGGETVFPNIAAPGGENVGFSECARYHLAAKPKKGTAILFHSIKPTGELERK 173
Query: 207 MYHSGCPVALGNKW 220
H+ CPV G KW
Sbjct: 174 SLHTACPVIKGIKW 187
>gi|221482398|gb|EEE20746.1| prolyl 4-hydroxylase alpha subunit, putative [Toxoplasma gondii
GT1]
gi|221504447|gb|EEE30120.1| prolyl 4-hydroxylase alpha subunit, putative [Toxoplasma gondii
VEG]
Length = 401
Score = 72.4 bits (176), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 52/189 (27%), Positives = 90/189 (47%), Gaps = 16/189 (8%)
Query: 38 LKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTI----YVDTRL-SK 92
+++ ++ +P V I + + DS+ R+++L +G+ ER K T Y ++ S+
Sbjct: 203 IQILAIHENPEVFLIPELLTDSDCERLLQLCEGRWERSKTSTGYATAEPRDYTSSKSPSR 262
Query: 93 VYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPR 152
+ P + +I I+ + + G + PL + Y G ++ LH D
Sbjct: 263 TSWSVPLAIAE----TEIVENIERIVSAFAGMPVEHLEPLVVVRYEEGQYFKLHSD---- 314
Query: 153 DEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT-LLDYRMYHSG 211
G +R + + YL DVE GG T F +L V P KG+ V W N++ T +D R+ H+G
Sbjct: 315 --GGFRPKTILLYLNDVEAGGETSFENLGFRVAPMKGAGVLWNNSYPGTNEIDPRLIHAG 372
Query: 212 CPVALGNKW 220
P G K+
Sbjct: 373 LPPEKGVKF 381
>gi|15077349|gb|AAK83137.1| prolyl 4-hydroxylase alpha subunit [Cavia porcellus]
Length = 141
Score = 72.4 bits (176), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 48/148 (32%), Positives = 77/148 (52%), Gaps = 11/148 (7%)
Query: 3 YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
Y + C+G + + + L C Y N N + P K E+ + PR+++ HD I D+E
Sbjct: 1 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 60
Query: 61 INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
I + +L+K ++ R + N GD V R+SK +L ++P + +I RIQD+T
Sbjct: 61 IEIVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLS---GYENPVVSRINMRIQDLT 117
Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLH 146
L + E LQ+ NYG+GG Y+ H
Sbjct: 118 GLDVSTAEE----LQVANYGVGGQYEPH 141
>gi|237841319|ref|XP_002369957.1| 2OG-Fe(II) oxygenase family protein, putative [Toxoplasma gondii
ME49]
gi|211967621|gb|EEB02817.1| 2OG-Fe(II) oxygenase family protein, putative [Toxoplasma gondii
ME49]
Length = 401
Score = 72.4 bits (176), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 52/189 (27%), Positives = 90/189 (47%), Gaps = 16/189 (8%)
Query: 38 LKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTI----YVDTRL-SK 92
+++ ++ +P V I + + DS+ R+++L +G+ ER K T Y ++ S+
Sbjct: 203 IQILAIHENPEVFLIPELLTDSDCERLLQLCEGRWERSKTSTGYATAEPRDYTSSKSPSR 262
Query: 93 VYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPR 152
+ P + +I I+ + + G + PL + Y G ++ LH D
Sbjct: 263 TSWSVPLAIAE----TEIVENIERIVSAFAGMPVEHLEPLVVVRYEEGQYFKLHSD---- 314
Query: 153 DEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT-LLDYRMYHSG 211
G +R + + YL DVE GG T F +L V P KG+ V W N++ T +D R+ H+G
Sbjct: 315 --GGFRPKTILLYLNDVEAGGETSFENLGFRVAPMKGAGVLWNNSYPGTNEIDPRLIHAG 372
Query: 212 CPVALGNKW 220
P G K+
Sbjct: 373 LPPEKGVKF 381
>gi|357417854|ref|YP_004930874.1| procollagen-proline dioxygenase [Pseudoxanthomonas spadix BD-a59]
gi|355335432|gb|AER56833.1| Procollagen-proline dioxygenase [Pseudoxanthomonas spadix BD-a59]
Length = 283
Score = 72.4 bits (176), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 62/208 (29%), Positives = 94/208 (45%), Gaps = 22/208 (10%)
Query: 26 ESYNNTFLKIGPLKVEELY--LDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YG 81
E L+ G +V+ L L PRV+ + + E + +I L++ +++R V + G
Sbjct: 73 ERNGPALLQAGDRQVQVLASLLHPRVIVFGNLLAAEECDALIALARRQIKRSPVFDPDTG 132
Query: 82 DTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG 141
R S+ F G +P +++ RI + N + E +G LQ+ YG G
Sbjct: 133 QDQQHQARTSEGMFFG---RGANPLCARVEARIAALLNWPL---ENGEG-LQVLRYGPGA 185
Query: 142 HYDLHCD----ATPRDE-----GLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAV 192
Y+ H D A P E G R+AS + YL GGAT FP +L V P KG+AV
Sbjct: 186 QYEPHYDYFDPARPGAEVALRRGGQRVASLVIYLNTPTQGGATTFPDAHLEVAPIKGNAV 245
Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
++ + + H G PV G KW
Sbjct: 246 YFSYDRPHPMTG--TLHGGAPVVEGEKW 271
>gi|242047772|ref|XP_002461632.1| hypothetical protein SORBIDRAFT_02g005750 [Sorghum bicolor]
gi|241925009|gb|EER98153.1| hypothetical protein SORBIDRAFT_02g005750 [Sorghum bicolor]
Length = 307
Score = 72.4 bits (176), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 56/206 (27%), Positives = 89/206 (43%), Gaps = 31/206 (15%)
Query: 39 KVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFL 96
+V+ + PR+ + D+E + ++ L+K K++R V + G ++ + R S FL
Sbjct: 43 RVKAVSWQPRIFVYKGFLSDAECDHLVTLAKKKIQRSMVADNQSGKSVMSEVRTSSGMFL 102
Query: 97 YPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDATPR 152
P + +I+ RI T L E +QI Y G H+D D +
Sbjct: 103 NKR---QDPVVSRIEERIAAWTFLPQENAEN----MQILRYEHGQKYEPHFDYFHDKINQ 155
Query: 153 DEGLWRLASFMFYLTDVELGGATIFPSLN------------------LTVFPEKGSAVFW 194
G R A+ + YL+ V+ GG T+FP+ L V P KG AV +
Sbjct: 156 VRGGHRYATVLMYLSTVDKGGETVFPNAKGWESQPKDDTFSECAHQGLAVKPVKGDAVLF 215
Query: 195 YNAHANTLLDYRMYHSGCPVALGNKW 220
++ H + + D H CPV G KW
Sbjct: 216 FSLHVDGVPDPLSLHGSCPVIQGEKW 241
>gi|226495689|ref|NP_001149322.1| LOC100282945 precursor [Zea mays]
gi|194697650|gb|ACF82909.1| unknown [Zea mays]
gi|194708468|gb|ACF88318.1| unknown [Zea mays]
gi|195626376|gb|ACG35018.1| oxidoreductase [Zea mays]
gi|347978842|gb|AEP37763.1| prolyl 4-hydroxylase 9 [Zea mays]
gi|413945802|gb|AFW78451.1| oxidoreductase [Zea mays]
Length = 308
Score = 72.4 bits (176), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 58/207 (28%), Positives = 84/207 (40%), Gaps = 30/207 (14%)
Query: 37 PLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVY 94
P ++ PRV + D E N +I L++ +++R V + G + + R S
Sbjct: 48 PHHSRQISCKPRVFLYQHFLSDDEANHLISLARAELKRSAVADNMSGKSTLSEVRTSSGT 107
Query: 95 FLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLG----GHYDLHCDAT 150
FL G P + I+ +I T L E +Q+ Y G HYD D
Sbjct: 108 FLRK---GQDPIVEGIEDKIAAWTFLPKENGED----IQVLRYKHGEKYEPHYDYFTDNV 160
Query: 151 PRDEGLWRLASFMFYLTDVELGGATIFP-----------------SLNLTVFPEKGSAVF 193
G R A+ + YLTDV GG T+FP + V P KG A+
Sbjct: 161 NTVRGGHRYATVLLYLTDVPEGGETVFPLAEEPDDAKDATLSECAQKGIAVRPRKGDALL 220
Query: 194 WYNAHANTLLDYRMYHSGCPVALGNKW 220
++N + + D H GCPV G KW
Sbjct: 221 FFNLNPDGTTDSVSLHGGCPVIKGEKW 247
>gi|255552788|ref|XP_002517437.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223543448|gb|EEF44979.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 311
Score = 72.4 bits (176), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 59/211 (27%), Positives = 91/211 (43%), Gaps = 37/211 (17%)
Query: 37 PLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVY 94
P +V +L PR + E + +I+L++ K+E+ V + G +I + R S
Sbjct: 46 PTRVTQLSWHPRAFLYKGFLSYEECDHLIDLARDKLEKSMVADNESGKSIESEVRTSSGM 105
Query: 95 FLYP---EIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHC 147
F+ EI D I+ RI T L E +QI +Y G H+D
Sbjct: 106 FIAKAQDEIVAD------IEARIAAWTFL----PEENGESMQILHYEHGQKYEPHFDYFH 155
Query: 148 DATPRDEGLWRLASFMFYLTDVELGGATIFPSLN------------------LTVFPEKG 189
D ++ G R+A+ + YL++VE GG T+FP+ V PEKG
Sbjct: 156 DKANQELGGHRVATVLMYLSNVEKGGETVFPNAEGKLSQPKEDSWSDCAKGGYAVKPEKG 215
Query: 190 SAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
A+ +++ H + D H CPV G KW
Sbjct: 216 DALLFFSLHPDATTDSDSLHGSCPVIEGEKW 246
>gi|78046308|ref|YP_362483.1| 2OG-Fe(II) oxygenase [Xanthomonas campestris pv. vesicatoria str.
85-10]
gi|78034738|emb|CAJ22383.1| putative 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas
campestris pv. vesicatoria str. 85-10]
Length = 296
Score = 72.4 bits (176), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 61/214 (28%), Positives = 93/214 (43%), Gaps = 22/214 (10%)
Query: 20 NLKCFYESYNNTFLKIGPLKVEELY--LDPRVVKIHDAIYDSEINRIIELSKGKVERGKV 77
+ + + + L +G V L L PRVV + + D E + +I L++ ++ R +
Sbjct: 77 RVPALQQDADASLLALGDRDVRVLVSLLLPRVVVLGGFLSDEECDALIALARPRLARSRT 136
Query: 78 VNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQIN 135
V+ G+ + R S L G +I+ RI + + + E LQ+
Sbjct: 137 VDNANGEHVVHAARTSDSMCLR---LGQDALCQRIEARIARLLDWPVDHGEG----LQVL 189
Query: 136 NYGLGGHYDLHCD-------ATPR--DEGLWRLASFMFYLTDVELGGATIFPSLNLTVFP 186
Y G Y H D TP G R+AS + YL E GGAT FP +L V
Sbjct: 190 RYATGAEYRPHYDYFDPDAAGTPVLVQAGGQRVASLVMYLNTPERGGATRFPDAHLDVAA 249
Query: 187 EKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
KG+AVF+ + + R H+G PV G+KW
Sbjct: 250 VKGNAVFFSYDRPHPM--TRSLHAGAPVLAGDKW 281
>gi|363807814|ref|NP_001242181.1| uncharacterized protein LOC100782154 [Glycine max]
gi|255644463|gb|ACU22735.1| unknown [Glycine max]
Length = 285
Score = 72.4 bits (176), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 63/238 (26%), Positives = 102/238 (42%), Gaps = 38/238 (15%)
Query: 11 LSVPEDIKSNLKCFY--ESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELS 68
LS P D+ S + + E NN + VE + +PR H+ + E +I +
Sbjct: 56 LSKPNDLNSVPRNTHVSEGENNRVKRW----VEVMSWEPRAFLYHNFLTKEECEYLINTA 111
Query: 69 KGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREE 126
+ + V++ G+ I R S Y + G + I+ RI D+T + I E
Sbjct: 112 TPNMLKSLVIDNESGEGIETSYRTSTEYVVE---RGKDKIVRNIEKRIADVTFIPIEHGE 168
Query: 127 RYKGPLQINNYGLGGHYDLHCDATPRD----EGLWRLASFMFYLTDVELGGATIFPSLN- 181
PL + Y +G +Y+ H D + G R+A+ + YL++VE GG T+FP N
Sbjct: 169 ----PLHVIRYAVGQYYEPHVDYFEEEFSLVNGGQRIATMLMYLSNVEGGGETVFPIANA 224
Query: 182 ------------------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 221
L++ P+ G A+ +++ + LD H CPV GNKW
Sbjct: 225 NFSSVPWWNELSECGQTGLSIKPKMGDALLFWSMKPDATLDPLTLHRACPVIKGNKWS 282
>gi|66820122|ref|XP_643703.1| hypothetical protein DDB_G0275385 [Dictyostelium discoideum AX4]
gi|60471803|gb|EAL69758.1| hypothetical protein DDB_G0275385 [Dictyostelium discoideum AX4]
Length = 221
Score = 72.4 bits (176), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 58/193 (30%), Positives = 83/193 (43%), Gaps = 20/193 (10%)
Query: 37 PLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFL 96
P+K+ EL PR+ +I + D E +I+ SK K+ ++ G S
Sbjct: 22 PVKLIELSQAPRIYRIPGFLTDEECEFLIDTSKNKLRPCNEISSG------VHRSGWGLF 75
Query: 97 YPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLG----GHYDLHCDATPR 152
E DH I +++ N+ E +Q+ Y G H+D T
Sbjct: 76 MKEGEEDHQITKNIFNKMKSFVNISESCE-----VMQVIRYNQGEETSSHFDYFNPLTTN 130
Query: 153 DE---GLW--RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRM 207
GL+ R+ + + YL DVE GG T FP + + V P KG AV +YN N +D
Sbjct: 131 GSMKIGLYGQRVCTILMYLCDVEEGGETTFPEVGIKVKPIKGDAVLFYNCKPNGDVDPLS 190
Query: 208 YHSGCPVALGNKW 220
H G PV GNKW
Sbjct: 191 LHQGDPVLKGNKW 203
>gi|330821584|ref|YP_004350446.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
gladioli BSR3]
gi|327373579|gb|AEA64934.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
gladioli BSR3]
Length = 302
Score = 72.4 bits (176), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 50/185 (27%), Positives = 82/185 (44%), Gaps = 18/185 (9%)
Query: 47 PRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDH 104
P V + + E ++IEL++ ++ R VV+ G I R S F G+
Sbjct: 102 PAAVLLDGFLSAGECRQLIELARPRLNRSTVVDPVTGRNIVAGHRSSDGMFFR---LGET 158
Query: 105 PFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---------ATPRDEG 155
P + +I+ RI +T + E +G LQ+ +Y G H D A
Sbjct: 159 PLISRIEQRIAALTGFPV---ENGEG-LQMLHYEAGAESTPHVDYLVPGNPANAESIARS 214
Query: 156 LWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 215
R+ + + YL DVE GG T+FP + +V P +G A ++ + + D H+ P+
Sbjct: 215 GQRVGTLLMYLNDVESGGETLFPQVGCSVVPRRGQAFYFEYGNGSGRSDPASLHASSPIG 274
Query: 216 LGNKW 220
G+KW
Sbjct: 275 SGDKW 279
>gi|47210159|emb|CAF93191.1| unnamed protein product [Tetraodon nigroviridis]
Length = 78
Score = 72.0 bits (175), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 29/55 (52%), Positives = 38/55 (69%)
Query: 166 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
++DVE GGAT+FP ++P KG+AVFWYN + DYR H+ CPV +GNKW
Sbjct: 1 MSDVEAGGATVFPDFGAAIWPRKGTAVFWYNLFRSGEGDYRTRHAACPVLVGNKW 55
>gi|194751833|ref|XP_001958228.1| GF10815 [Drosophila ananassae]
gi|190625510|gb|EDV41034.1| GF10815 [Drosophila ananassae]
Length = 273
Score = 72.0 bits (175), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 33/64 (51%), Positives = 42/64 (65%)
Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
S + L+DVE GG T+FP LNL V +KGS + WYN +N D R+ H+ CPV +GNKW
Sbjct: 207 SLLKNLSDVEQGGDTVFPHLNLKVPAQKGSLMVWYNLLSNGTTDSRVLHASCPVLMGNKW 266
Query: 221 GKLL 224
K L
Sbjct: 267 SKYL 270
Score = 42.4 bits (98), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 26/66 (39%), Positives = 41/66 (62%), Gaps = 8/66 (12%)
Query: 20 NLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN 79
+LKC Y + FLK+ P+K+E+++LDP + HD I + EI+ + LS VE+G
Sbjct: 166 HLKCQYLK-ASPFLKLAPIKMEKVFLDPPMSIYHDLINEKEISLLKNLS--DVEQG---- 218
Query: 80 YGDTIY 85
GDT++
Sbjct: 219 -GDTVF 223
>gi|328876967|gb|EGG25330.1| putative prolyl 4-hydroxylase alpha subunit [Dictyostelium
fasciculatum]
Length = 244
Score = 72.0 bits (175), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 59/196 (30%), Positives = 88/196 (44%), Gaps = 20/196 (10%)
Query: 39 KVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYP 98
K+ E+ PRV ++ D + +E +I++SK K+ ++ G S
Sbjct: 25 KLIEMSQCPRVYRVPDFLSPAECEHLIDISKNKLRPCNEISSG------VHRSGWGLFMK 78
Query: 99 EIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLG----GHYDLHCDATPRDE 154
E DH + KI R++ + NL E +Q+ Y G HYD T
Sbjct: 79 EGEEDHDVVKKIFQRMKMLVNLTENCEV-----MQVIRYHPGEETSAHYDYFNPLTTNGA 133
Query: 155 ---GLW--RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYH 209
GL+ R+ + + YL++VE GG T FP + + V P KG AV +YN N +D H
Sbjct: 134 MKIGLYGQRVCTILMYLSEVEEGGETSFPEVGVKVKPVKGDAVLFYNCKPNGEVDPLSLH 193
Query: 210 SGCPVALGNKWGKLLL 225
G PV G KW + L
Sbjct: 194 QGDPVIKGTKWVAIKL 209
>gi|363543295|ref|NP_001241863.1| prolyl 4-hydroxylase 4 precursor [Zea mays]
gi|347978806|gb|AEP37745.1| prolyl 4-hydroxylase 4 [Zea mays]
gi|414591890|tpg|DAA42461.1| TPA: hypothetical protein ZEAMMB73_637248 [Zea mays]
Length = 274
Score = 72.0 bits (175), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 56/206 (27%), Positives = 88/206 (42%), Gaps = 31/206 (15%)
Query: 39 KVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFL 96
+V+ + PR+ + D+E + ++ L+K K++R V + G ++ + R S FL
Sbjct: 44 RVKAVSWHPRIFVYKGFLSDAECDHLVTLAKKKIQRSMVADNESGKSVKSEVRTSSGMFL 103
Query: 97 YPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDATPR 152
P + +I+ RI T L E +Q+ Y G H+D D +
Sbjct: 104 DKR---QDPVVSRIEERIAAWTFLPQENAEN----MQVLRYEPGQKYEPHFDYFHDRVNQ 156
Query: 153 DEGLWRLASFMFYLTDVELGGATIFPSLN------------------LTVFPEKGSAVFW 194
G R A+ + YL+ V GG T+FP+ L V P KG AV +
Sbjct: 157 ARGGHRYATVLMYLSTVREGGETVFPNAKGWESQPKDATFSECAHKGLAVKPVKGDAVLF 216
Query: 195 YNAHANTLLDYRMYHSGCPVALGNKW 220
++ HA+ D H CPV G KW
Sbjct: 217 FSLHADGTPDPLSLHGSCPVIRGEKW 242
>gi|160900716|ref|YP_001566298.1| procollagen-proline dioxygenase [Delftia acidovorans SPH-1]
gi|160366300|gb|ABX37913.1| Procollagen-proline dioxygenase [Delftia acidovorans SPH-1]
Length = 294
Score = 72.0 bits (175), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 63/232 (27%), Positives = 105/232 (45%), Gaps = 26/232 (11%)
Query: 5 LACQGNLSVPEDI--KSNLKCFYESYNNTFLKIGPLKVEEL--YLDPRVVKIHDAIYDSE 60
L QG +S P + S+L + + + + +G +V+ L +PR+V + + E
Sbjct: 61 LPEQGEVSPPAVVVSASSLPEPDLAQDPSSIDVGDRQVQVLVSMRNPRIVVFGNLLSHEE 120
Query: 61 INRIIELSKGKVERGKVV---NYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDM 117
+ II ++ ++ R V + G+ I D + ++F G+ + +++ RI +
Sbjct: 121 CDAIIAAARPRMARSLTVATQSGGEEINDDRTSNGMFFQR----GETGIVSQLEERIARL 176
Query: 118 TNLVIGREERYKGPLQINNYGLGGHYDLHCD-------ATPR--DEGLWRLASFMFYLTD 168
+ E LQ+ +YG G Y H D TP G R+ + + YL +
Sbjct: 177 LRWPLDHGEG----LQVLHYGPGAEYKPHHDYFAPGEPGTPTILKRGGQRVGTLVIYLNE 232
Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
E GGATIFP + L V P +G+AVF+ + R H G PV G KW
Sbjct: 233 PERGGATIFPEVPLQVVPRRGNAVFFSYERPDP--STRTLHGGAPVLAGEKW 282
>gi|91779740|ref|YP_554948.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
xenovorans LB400]
gi|91692400|gb|ABE35598.1| Procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
xenovorans LB400]
Length = 296
Score = 72.0 bits (175), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 50/185 (27%), Positives = 85/185 (45%), Gaps = 18/185 (9%)
Query: 47 PRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDH 104
P V + D + +E ++I L++ ++ R VV+ G + R S F G+
Sbjct: 102 PAAVLLDDFLSANECEQLIALARPRLSRSTVVDPVTGRNVVAGHRSSDGMFFR---LGET 158
Query: 105 PFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD----ATPRDE-----G 155
P + +++ RI ++T L + E +G LQ+ +Y G H D P +
Sbjct: 159 PLIARLEARIAELTGLPV---ENGEG-LQLLHYEAGAESTPHVDYLIAGNPANRESIARS 214
Query: 156 LWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 215
R+ + + YL DVE GG T+FP +V P +G A+++ + L D H+ P+
Sbjct: 215 GQRVGTLLMYLNDVEGGGETMFPQTGWSVVPRRGQALYFEYGNRFGLADPSSLHTSTPLR 274
Query: 216 LGNKW 220
G KW
Sbjct: 275 AGEKW 279
>gi|359477453|ref|XP_003631980.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2 [Vitis
vinifera]
gi|297736941|emb|CBI26142.3| unnamed protein product [Vitis vinifera]
Length = 298
Score = 71.6 bits (174), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 61/232 (26%), Positives = 93/232 (40%), Gaps = 35/232 (15%)
Query: 17 IKSNLKCFYESYNNTF-LKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERG 75
I S + F SY + + KV ++ PR + + E + +I L+K +++R
Sbjct: 13 ISSTILEFSSSYADAAGSNVSAAKVRQISWKPRAFVYEGFLSEEECDHLISLAKSELKRS 72
Query: 76 KVVNYGDTIYVDTRLSKVYFLYPEIFGD--HPFLYKIQTRIQDMTNLVIGREERYKGPLQ 133
V D + +RLS+V G P + I+ +I T L E +Q
Sbjct: 73 AVA---DNVSGKSRLSEVRTSSGMFIGKGKDPIVAGIEDKIAAWTFLPKDNGED----MQ 125
Query: 134 INNYGLGGHYDLH----CDATPRDEGLWRLASFMFYLTDVELGGATIFPSLN-------- 181
+ Y G YD H D G R+A+ + YL+DV GG T+FP
Sbjct: 126 VLRYEPGQKYDAHYDYFVDKVNIARGGHRIATVLMYLSDVVKGGETVFPMAEEPSRRKPL 185
Query: 182 -------------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ V P KG A+ +++ H + D H GCPV G KW
Sbjct: 186 PTNDDLSECARKGIAVKPRKGDALLFFSLHPTAIPDPMSLHGGCPVIEGEKW 237
>gi|293337056|ref|NP_001169835.1| uncharacterized protein LOC100383727 precursor [Zea mays]
gi|224031897|gb|ACN35024.1| unknown [Zea mays]
gi|347978800|gb|AEP37742.1| prolyl 4-hydroxylase 2 [Zea mays]
gi|414871435|tpg|DAA49992.1| TPA: hypothetical protein ZEAMMB73_500506 [Zea mays]
Length = 299
Score = 71.6 bits (174), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 59/208 (28%), Positives = 90/208 (43%), Gaps = 31/208 (14%)
Query: 37 PLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVY 94
P +V +L PR + D+E + +I L+K K+E+ V + G ++ + R S
Sbjct: 33 PSRVVQLSWRPRAFLHKGFLSDAECDHLIALAKDKLEKSMVADNESGKSVQSEVRTSSGM 92
Query: 95 FLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDAT 150
FL + + +I+ RI T L E +QI +Y G HYD D
Sbjct: 93 FLERK---QDEVVTRIEERISAWTFLPPENGES----IQILHYQNGEKYEPHYDYFHDKK 145
Query: 151 PRDEGLWRLASFMFYLTDVELGGATIFPSLN------------------LTVFPEKGSAV 192
+ G R+A+ + YL++VE GG TIFP+ V P KG A+
Sbjct: 146 NQALGGHRIATVLMYLSNVEKGGETIFPNAEGKLLQPKDNTWSDCARNGYAVKPVKGDAL 205
Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
+++ H + D H CPV G KW
Sbjct: 206 LFFSLHPDATTDSDSLHGSCPVIEGQKW 233
>gi|171059332|ref|YP_001791681.1| procollagen-proline dioxygenase [Leptothrix cholodnii SP-6]
gi|170776777|gb|ACB34916.1| Procollagen-proline dioxygenase [Leptothrix cholodnii SP-6]
Length = 287
Score = 71.6 bits (174), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 57/188 (30%), Positives = 89/188 (47%), Gaps = 24/188 (12%)
Query: 46 DPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGD 103
DPRVV + E + ++ L++ ++ R + V+ G + + R S+ F + G+
Sbjct: 99 DPRVVVFGGFLSHDECDALVALAQPRLARSETVDNDTGGSEVNEARTSQGMFF---MRGE 155
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD-------ATPR--DE 154
+ +I+ RI + + + E +G +Q+ +Y G Y H D TP
Sbjct: 156 GELISRIEARIAALLDWPL---ENGEG-VQVLHYRPGAEYKPHYDYFDPAQPGTPTILKR 211
Query: 155 GLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVF--WYNAHANTLLDYRMYHSGC 212
G R+ + + YL E GG T FP +NL V P KG+AVF + AH +T R H G
Sbjct: 212 GGQRVGTLVMYLNTPERGGGTTFPDVNLEVAPIKGNAVFFSYERAHPST----RSLHGGA 267
Query: 213 PVALGNKW 220
PV G KW
Sbjct: 268 PVLAGEKW 275
>gi|443686890|gb|ELT90009.1| hypothetical protein CAPTEDRAFT_129682, partial [Capitella teleta]
Length = 93
Score = 71.6 bits (174), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 36/86 (41%), Positives = 48/86 (55%), Gaps = 7/86 (8%)
Query: 133 QINNYGLGGHYDLHCDATPRDE-------GLWRLASFMFYLTDVELGGATIFPSLNLTVF 185
Q +NYG+GGHY+ H D R E R+A+FM Y+ V GGAT+FP + L
Sbjct: 1 QTSNYGIGGHYEPHYDHDERSEVAPEVALSGDRIATFMIYMNHVNAGGATVFPKIGLYAK 60
Query: 186 PEKGSAVFWYNAHANTLLDYRMYHSG 211
PEK +A+FWYN + D H+G
Sbjct: 61 PEKNAAIFWYNYKKSGESDANTLHAG 86
>gi|255579590|ref|XP_002530636.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223529809|gb|EEF31744.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 287
Score = 71.6 bits (174), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 57/206 (27%), Positives = 90/206 (43%), Gaps = 32/206 (15%)
Query: 40 VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRL--SKVYFLY 97
E + +PR H+ + E +I L+K +++ VV+ D+R+ S FL
Sbjct: 76 AEVISWEPRAFVYHNFLTKEECEYLINLAKPNMQKSTVVDSETGRSKDSRVRTSSGTFLS 135
Query: 98 PEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDATPRD 153
G + I+ RI D + + + E LQ+ +Y +G H+D D
Sbjct: 136 R---GRDKKIRDIEKRIADFSFIPVEHGE----GLQVLHYEVGQKYEPHFDYFNDEFNTK 188
Query: 154 EGLWRLASFMFYLTDVELGGATIFPSLN-------------------LTVFPEKGSAVFW 194
G R+A+ + YL+DVE GG T+FP+ L+V P G A+ +
Sbjct: 189 NGGQRVATLLMYLSDVEEGGETVFPAAKGNFSAVPWWNELSECGKKGLSVKPNMGDALLF 248
Query: 195 YNAHANTLLDYRMYHSGCPVALGNKW 220
++ + LD H GCPV GNKW
Sbjct: 249 WSMKPDATLDPSSLHGGCPVINGNKW 274
>gi|357517897|ref|XP_003629237.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355523259|gb|AET03713.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|388513409|gb|AFK44766.1| unknown [Medicago truncatula]
gi|388516345|gb|AFK46234.1| unknown [Medicago truncatula]
Length = 275
Score = 71.6 bits (174), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 53/207 (25%), Positives = 92/207 (44%), Gaps = 32/207 (15%)
Query: 40 VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLY 97
V+ + +PR H+ + E +I ++K + + +V++ G ++ R S FL
Sbjct: 66 VQIISWEPRAFLYHNFLTKEECEHLINIAKPSMHKSEVIDEKTGKSLNSSIRTSSGTFLD 125
Query: 98 PEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDATPRD 153
E GD + I+ RI D T + + E + + +Y +G HYD D
Sbjct: 126 RE--GDE-IVSNIEKRIADFTFIPVEHGESF----NVLHYEVGQKYEPHYDYFLDTFSTR 178
Query: 154 EGLWRLASFMFYLTDVELGGATIFPSLN-------------------LTVFPEKGSAVFW 194
R+A+ + YL+DVE GG T+FP+ L++ P+ G+A+ +
Sbjct: 179 HAGQRIATMLMYLSDVEEGGETVFPNAKGNFSSVPWWNELSDCGKGGLSIKPKMGNAILF 238
Query: 195 YNAHANTLLDYRMYHSGCPVALGNKWG 221
++ + LD H CPV G+KW
Sbjct: 239 WSMKPDATLDPSSLHGACPVIKGDKWS 265
>gi|357140446|ref|XP_003571778.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Brachypodium
distachyon]
Length = 298
Score = 71.6 bits (174), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 59/208 (28%), Positives = 89/208 (42%), Gaps = 31/208 (14%)
Query: 37 PLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVY 94
P +V +L PR + + E + +IEL+K K+E+ V + G ++ + R S
Sbjct: 32 PSRVVQLSWRPRAFLHKGFLSEPECDHMIELAKDKLEKSMVADNESGKSVQSEVRTSSGM 91
Query: 95 FLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDAT 150
FL + +I+ RI T L E +QI +Y G HYD D
Sbjct: 92 FLEKR---QDEVVARIEERIAAWTFLPSENGES----IQILHYKNGEKYEPHYDYFHDKN 144
Query: 151 PRDEGLWRLASFMFYLTDVELGGATIFPSLN------------------LTVFPEKGSAV 192
+ G R+A+ + YL++VE GG TIFP+ V P KG A+
Sbjct: 145 NQALGGHRIATVLMYLSNVEKGGETIFPNAEGKLTQHKDETASECAKNGYAVKPMKGDAL 204
Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
+++ H + D H CPV G KW
Sbjct: 205 LFFSLHPDATTDPDSLHGSCPVIEGQKW 232
>gi|110289076|gb|ABB47602.2| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
Group]
Length = 309
Score = 71.6 bits (174), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 59/209 (28%), Positives = 88/209 (42%), Gaps = 32/209 (15%)
Query: 37 PLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVY 94
P +V +L PR + D+E +I L+K K+E+ V + G ++ + R S
Sbjct: 42 PSRVVQLSWRPRAFLHKGFLTDAECEHLISLAKDKLEKSMVADNESGKSVMSEVRTSSGM 101
Query: 95 FLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDAT 150
FL + + +I+ RI T L E +QI +Y G HYD D
Sbjct: 102 FLEKK---QDEVVARIEERIAAWTFLPPDNGES----IQILHYQNGEKYEPHYDYFHDKN 154
Query: 151 PRDEGLWRLASFMFYLTDVELGGATIFPSLNL-------------------TVFPEKGSA 191
+ G R+A+ + YL+DV GG TIFP + V P KG A
Sbjct: 155 NQALGGHRIATVLMYLSDVGKGGETIFPEAEVGKLLQPKDDTWSDCAKNGYAVKPVKGDA 214
Query: 192 VFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ +++ H + D H CPV G KW
Sbjct: 215 LLFFSLHPDATTDSDSLHGSCPVIEGQKW 243
>gi|302762452|ref|XP_002964648.1| hypothetical protein SELMODRAFT_82355 [Selaginella moellendorffii]
gi|300168377|gb|EFJ34981.1| hypothetical protein SELMODRAFT_82355 [Selaginella moellendorffii]
Length = 225
Score = 71.6 bits (174), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 51/198 (25%), Positives = 91/198 (45%), Gaps = 30/198 (15%)
Query: 47 PRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPF 106
PR +H+ + D E + +I ++ +++ VV+ D+R+ ++ G
Sbjct: 21 PRASLVHNFLTDDECDHLIRVAMPLMQKSTVVDSQTGGSRDSRVRTSSGMFLN-RGQDRV 79
Query: 107 LYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD-----ATPRDEGLWRLAS 161
+ +I+ +I +T + E +Q+ +Y G YD H D R+ G R+A+
Sbjct: 80 ISEIEDKIAKLTFIPKDHGE----GIQVLHYEPGQKYDAHHDFFYDTVNTRNGGQ-RIAT 134
Query: 162 FMFYLTDVELGGATIFPS-------------------LNLTVFPEKGSAVFWYNAHANTL 202
+ YLTDVE GG T+FP ++V P++G A+ +++ +
Sbjct: 135 LLMYLTDVEEGGETVFPKSAKNSSSLPWHNQLSECGRRGVSVRPKRGDALLFWSMSPDAQ 194
Query: 203 LDYRMYHSGCPVALGNKW 220
LD+ H GCPV G+KW
Sbjct: 195 LDHSSLHGGCPVIKGDKW 212
>gi|209522122|ref|ZP_03270769.1| Procollagen-proline dioxygenase [Burkholderia sp. H160]
gi|209497434|gb|EDZ97642.1| Procollagen-proline dioxygenase [Burkholderia sp. H160]
Length = 296
Score = 71.6 bits (174), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 50/185 (27%), Positives = 87/185 (47%), Gaps = 18/185 (9%)
Query: 47 PRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDH 104
P V + D + E ++I L++ +++R VV+ G + R S F G+
Sbjct: 102 PAAVHLADFLSADECEQLIALAQPRLDRSTVVDPVTGRNVVAGHRSSHGMFFR---LGET 158
Query: 105 PFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD-----ATPRDEGL--- 156
P + +I+ RI +T + E +G LQ+ +Y G H D E +
Sbjct: 159 PLIVRIEARIAALTGTPV---ENGEG-LQMLHYEEGAESTPHVDYLITGNEANRESIARS 214
Query: 157 -WRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 215
R+ + + YL DVE GG T+FP + +V P++G A+++ + L D H+ P+
Sbjct: 215 GQRMGTLLMYLKDVEGGGETVFPQIGWSVAPQRGHALYFEYGNRFGLCDPSSLHASTPLR 274
Query: 216 LGNKW 220
+G+KW
Sbjct: 275 VGDKW 279
>gi|148537204|dbj|BAF63493.1| prolyl 4-hydroxylase [Potamogeton distinctus]
Length = 246
Score = 71.6 bits (174), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 56/185 (30%), Positives = 81/185 (43%), Gaps = 31/185 (16%)
Query: 60 EINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDM 117
E + +I L K K+E+ V + G ++ + R S FL E D + +I+ RI
Sbjct: 8 ECDHLIALGKDKLEKSMVADNESGKSVMSEIRTSSGMFL--ERRQDET-ITRIEKRIAAW 64
Query: 118 TNLVIGREERYKGPLQINNYGLGGHYDLH----CDATPRDEGLWRLASFMFYLTDVELGG 173
T L E P+QI +Y G YD H D + G R+A+ + YL+DV+ GG
Sbjct: 65 TFLP----EENGEPIQILHYEKGQKYDAHYDYFHDKNNQRVGGHRMATVLMYLSDVKKGG 120
Query: 174 ATIFPSLN------------------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 215
T+FP V P KG A+ +++ H N D H+ CPV
Sbjct: 121 ETVFPDAEGKLLQVKDDTWSDCARSGYAVKPRKGDALLFFSCHPNATTDPNSLHASCPVI 180
Query: 216 LGNKW 220
G KW
Sbjct: 181 EGEKW 185
>gi|302823087|ref|XP_002993198.1| hypothetical protein SELMODRAFT_431327 [Selaginella moellendorffii]
gi|300138968|gb|EFJ05718.1| hypothetical protein SELMODRAFT_431327 [Selaginella moellendorffii]
Length = 269
Score = 71.6 bits (174), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 57/207 (27%), Positives = 90/207 (43%), Gaps = 19/207 (9%)
Query: 32 FLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDT-----IYV 86
L+IG +K E L PR++ +H + E + +I ++ ++ + VV+ I
Sbjct: 52 LLRIGLVKPEVLNWSPRIILLHKFLSAEECDYLIAIAGPRLAKSTVVDTSTGKARHGIES 111
Query: 87 DTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLH 146
R S FL +P + I+ RI + + + E + N H+D
Sbjct: 112 KVRTSTGMFL-SNYDRRYPMIQAIERRIAVYSMIPVENGELLQVLRYEPNQYYKPHHDYF 170
Query: 147 CDATPRDEGLWRLASFMFYLTDVELGGATIFPSL-------------NLTVFPEKGSAVF 193
D G R+A+ + YL+DVE GG TIFPS+ L V P KG A+
Sbjct: 171 SDQFNLKRGGQRVATVLMYLSDVEEGGETIFPSVGDGECECGGELRKGLCVKPRKGDAIL 230
Query: 194 WYNAHANTLLDYRMYHSGCPVALGNKW 220
+++A + +D H GC V G KW
Sbjct: 231 FWSAALDGNVDSNSLHGGCSVLRGEKW 257
>gi|145341735|ref|XP_001415959.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144576182|gb|ABO94251.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 254
Score = 71.6 bits (174), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 54/212 (25%), Positives = 89/212 (41%), Gaps = 38/212 (17%)
Query: 39 KVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFL 96
+VE L PR + DA+ +++ ++ ++ +V R VV+ G++ R SK FL
Sbjct: 2 RVEPLSWYPRAFALRDALTEAQCEAVLRATRARVRRSTVVDSVTGESKVDPIRTSKQTFL 61
Query: 97 YPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRD--- 153
D + +I + +T L E +Q+ Y +G YD H D D
Sbjct: 62 N----RDEEVVREIYDALSAVTMLPWTHNED----MQVLEYRVGEKYDAHEDVGAEDSLS 113
Query: 154 ------EGLWRLASFMFYLTDVELGGATIFPSLN-------------------LTVFPEK 188
+G R+A+ + YL + E GG T FP + + P +
Sbjct: 114 GRELSKDGGKRVATVLLYLEEPEAGGETAFPDSEWIDPKMAEGTSWSKCAEHRVAMKPRR 173
Query: 189 GSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
G + +++ N +D+R H GCPV G KW
Sbjct: 174 GDGLIFWSVDPNGKIDHRALHVGCPVVAGVKW 205
>gi|302815629|ref|XP_002989495.1| hypothetical protein SELMODRAFT_129912 [Selaginella moellendorffii]
gi|300142673|gb|EFJ09371.1| hypothetical protein SELMODRAFT_129912 [Selaginella moellendorffii]
Length = 213
Score = 71.6 bits (174), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 51/198 (25%), Positives = 91/198 (45%), Gaps = 30/198 (15%)
Query: 47 PRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPF 106
PR +H+ + D E + +I ++ +++ VV+ D+R+ ++ G
Sbjct: 9 PRASLVHNFLTDDECDHLIRVAMPLMQKSTVVDSQTGGSRDSRVRTSSGMFLN-RGQDRV 67
Query: 107 LYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD-----ATPRDEGLWRLAS 161
+ +I+ +I +T + E +Q+ +Y G YD H D R+ G R+A+
Sbjct: 68 ISEIEDKIAKLTFIPKDHGE----GIQVLHYEPGQKYDAHHDFFYDTVNTRNGGQ-RIAT 122
Query: 162 FMFYLTDVELGGATIFPS-------------------LNLTVFPEKGSAVFWYNAHANTL 202
+ YLTDVE GG T+FP ++V P++G A+ +++ +
Sbjct: 123 LLMYLTDVEEGGETVFPKSAKNSSSLPWHNQLSECGRRGVSVRPKRGDALLFWSMSPDAQ 182
Query: 203 LDYRMYHSGCPVALGNKW 220
LD+ H GCPV G+KW
Sbjct: 183 LDHSSLHGGCPVIKGDKW 200
>gi|333912984|ref|YP_004486716.1| procollagen-proline dioxygenase [Delftia sp. Cs1-4]
gi|333743184|gb|AEF88361.1| Procollagen-proline dioxygenase [Delftia sp. Cs1-4]
Length = 294
Score = 71.6 bits (174), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 62/232 (26%), Positives = 106/232 (45%), Gaps = 26/232 (11%)
Query: 5 LACQGNLSVPEDI--KSNLKCFYESYNNTFLKIGPLKVEEL--YLDPRVVKIHDAIYDSE 60
L QG+++ P + S+L + + + + +G +V+ L +PR+V + + E
Sbjct: 61 LPEQGDVAPPAVVISASSLPEPDLAQDPSSIDVGDRQVQVLVSMRNPRIVVFGNLLSHEE 120
Query: 61 INRIIELSKGKVERGKVV---NYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDM 117
+ II ++ ++ R V + G+ I D + ++F G+ + +++ RI +
Sbjct: 121 CDAIIAAARPRMARSLTVATQSGGEEINDDRTSNGMFFQR----GETGIVSQLEERIARL 176
Query: 118 TNLVIGREERYKGPLQINNYGLGGHYDLHCD-------ATPR--DEGLWRLASFMFYLTD 168
+ E LQ+ +YG G Y H D TP G R+ + + YL +
Sbjct: 177 LRWPLDHGEG----LQVLHYGPGAEYKPHHDYFAPGEPGTPTILKRGGQRVGTLVIYLNE 232
Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
E GGATIFP + L V P +G+AVF+ + R H G PV G KW
Sbjct: 233 PERGGATIFPEVPLQVVPRRGNAVFFSYERPDP--STRTLHGGAPVLAGEKW 282
>gi|254254263|ref|ZP_04947580.1| hypothetical protein BDAG_03558 [Burkholderia dolosa AUO158]
gi|124898908|gb|EAY70751.1| hypothetical protein BDAG_03558 [Burkholderia dolosa AUO158]
Length = 285
Score = 71.6 bits (174), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 54/185 (29%), Positives = 85/185 (45%), Gaps = 18/185 (9%)
Query: 47 PRVVKIHDAIYDSEINRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDH 104
P++V + + E + +I+ S K+E+ VN G + R S + G+
Sbjct: 96 PQIVVFGNVLDQDECDEMIQRSMHKLEQSTTVNAETGTQEVIRHRTSHGTWFQ---NGED 152
Query: 105 PFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---------ATPRDEG 155
+ +I+TR+ + N + E +G LQ+ Y GG Y H D T G
Sbjct: 153 ALIRRIETRLAALMNCPV---ENGEG-LQVLRYTPGGEYRSHYDYFQPTAAGSLTHVRTG 208
Query: 156 LWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 215
R+A+ + YL DV GG T+FP ++V P +G AV++ + LD H+G PV
Sbjct: 209 GQRVATLIVYLNDVPSGGETVFPEAGISVVPRRGDAVYFRYMNRLRQLDPATLHAGAPVR 268
Query: 216 LGNKW 220
G KW
Sbjct: 269 DGEKW 273
>gi|448930198|gb|AGE53763.1| prolyl 4-hydroxylase [Paramecium bursaria Chlorella virus IL-3A]
gi|448931603|gb|AGE55164.1| prolyl 4-hydroxylase [Paramecium bursaria Chlorella virus MA-1E]
Length = 239
Score = 71.2 bits (173), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 62/189 (32%), Positives = 97/189 (51%), Gaps = 26/189 (13%)
Query: 51 KIHDAIYDSEINRIIE--LSKG--KVERGKVVNYGDTIYVD--TRLSKVYFLYPEIFGDH 104
++HD + D+E + +I + KG K E G + D I +D +R S+ + P G+H
Sbjct: 54 ELHDFLSDAECDVLINAAIKKGLIKSEVGGATD-DDPIKLDPKSRNSEQTWFTP---GEH 109
Query: 105 PFLYKIQTRIQDMTNLVIGREERYK-GPLQINNYGLGGHYDLH-----CD-ATPRDEGLW 157
+ KIQ + +++ N ++Y +Q+ Y G +Y H CD A P+D+
Sbjct: 110 KIIDKIQKKTRELLNSKKHCIDKYNFEDVQVARYKPGQYYYHHYDGDDCDDACPKDQ--- 166
Query: 158 RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYR-MYHSGCPV-- 214
RLA+ M YL + GG T FP+LN V P+KG AVF++ A T Y+ H+G PV
Sbjct: 167 RLATLMVYLKAPKEGGETDFPTLNTQVLPKKGKAVFFWVADPATRKLYKETLHAGLPVKN 226
Query: 215 ---ALGNKW 220
+ N+W
Sbjct: 227 GVKVIANQW 235
>gi|448927821|gb|AGE51393.1| prolyl 4-hydroxylase [Paramecium bursaria Chlorella virus CviKI]
Length = 239
Score = 71.2 bits (173), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 62/189 (32%), Positives = 97/189 (51%), Gaps = 26/189 (13%)
Query: 51 KIHDAIYDSEINRIIE--LSKG--KVERGKVVNYGDTIYVD--TRLSKVYFLYPEIFGDH 104
++HD + D+E + +I + KG K E G + D I +D +R S+ + P G+H
Sbjct: 54 ELHDFLSDAECDVLINAAIKKGLIKSEVGGATD-DDPIKLDPKSRNSEQTWFTP---GEH 109
Query: 105 PFLYKIQTRIQDMTNLVIGREERYK-GPLQINNYGLGGHYDLH-----CD-ATPRDEGLW 157
+ KIQ + +++ N ++Y +Q+ Y G +Y H CD A P+D+
Sbjct: 110 KIIDKIQKKTRELLNSKKHCIDKYNFEDVQVARYKPGQYYYHHYDGDDCDDACPKDQ--- 166
Query: 158 RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYR-MYHSGCPV-- 214
RLA+ M YL + GG T FP+LN V P+KG AVF++ A T Y+ H+G PV
Sbjct: 167 RLATLMVYLKAPKEGGETDFPTLNTQVLPKKGKAVFFWVADPATRKLYKETLHAGLPVKN 226
Query: 215 ---ALGNKW 220
+ N+W
Sbjct: 227 GVKVIANQW 235
>gi|448928822|gb|AGE52391.1| prolyl 4-hydroxylase [Paramecium bursaria Chlorella virus CvsA1]
Length = 239
Score = 71.2 bits (173), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 63/189 (33%), Positives = 97/189 (51%), Gaps = 26/189 (13%)
Query: 51 KIHDAIYDSEINRIIE--LSKG--KVERGKVVNYGDTIYVD--TRLSKVYFLYPEIFGDH 104
++HD + D+E + +I + KG K E G + D I +D +R S+ + P G+H
Sbjct: 54 ELHDFLSDAECDVLINAAIKKGLIKSEVGGATD-DDPIKLDPKSRNSEQTWFTP---GEH 109
Query: 105 PFLYKIQTRIQDMTNLVIGREERYK-GPLQINNYGLGGHYDLH-----CD-ATPRDEGLW 157
+ KIQ + +++ N ++Y +Q+ Y G +Y H CD A P+D+
Sbjct: 110 EVIDKIQNKTRELLNNKKHCIDKYIFEDVQVARYKPGQYYYHHYDGDDCDDACPKDQ--- 166
Query: 158 RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYR-MYHSGCPV-- 214
RLA+ M YL E GG T FP+LN V P+KG AVF++ A T Y+ H+G PV
Sbjct: 167 RLATLMVYLKAPEEGGETDFPTLNTQVLPKKGKAVFFWVADPATRKLYKETLHAGLPVKN 226
Query: 215 ---ALGNKW 220
+ N+W
Sbjct: 227 GVKVIANQW 235
>gi|325915062|ref|ZP_08177391.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas vesicatoria
ATCC 35937]
gi|325538760|gb|EGD10427.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas vesicatoria
ATCC 35937]
Length = 286
Score = 71.2 bits (173), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 59/217 (27%), Positives = 98/217 (45%), Gaps = 22/217 (10%)
Query: 17 IKSNLKCFYESYNNTFLKIGPLKVEELYLD--PRVVKIHDAIYDSEINRIIELSKGKVER 74
+ + + + + L +G +V L PRV+ + + D+E + +I L++ ++ R
Sbjct: 64 VPVRVPALLQDSDASLLDLGDRQVHVLMRMQLPRVMVLGGFLSDAECDAMIALAQPRLAR 123
Query: 75 GKVVNYGDTIYV--DTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPL 132
+ V+ + +V R S L G +I+ RI + + + E +G L
Sbjct: 124 SRTVDNANGAHVVHAARTSDSMCLQ---LGQDALCQRIEARIARLLDWPV---ENGEG-L 176
Query: 133 QINNYGLGGHYDLHCD-------ATP--RDEGLWRLASFMFYLTDVELGGATIFPSLNLT 183
Q+ YG G Y H D TP G R+AS + YL + GGAT FP ++L
Sbjct: 177 QVLRYGTGAEYQPHYDYFDPDAAGTPVLLQAGGQRVASLVMYLNTPDRGGATRFPDVHLD 236
Query: 184 VFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+ KG+AVF+ + + R H+G PV G KW
Sbjct: 237 IAAIKGNAVFFSYDRPHPM--TRSLHAGAPVLAGEKW 271
>gi|448924767|gb|AGE48348.1| prolyl 4-hydroxylase [Paramecium bursaria Chlorella virus AN69C]
gi|448933638|gb|AGE57193.1| prolyl 4-hydroxylase [Paramecium bursaria Chlorella virus NE-JV-4]
Length = 239
Score = 71.2 bits (173), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 62/189 (32%), Positives = 97/189 (51%), Gaps = 26/189 (13%)
Query: 51 KIHDAIYDSEINRIIE--LSKG--KVERGKVVNYGDTIYVD--TRLSKVYFLYPEIFGDH 104
++HD + D+E + +I + KG K E G + D I +D +R S+ + P G+H
Sbjct: 54 ELHDFLSDAECDILINAAIKKGLIKSEVGGATD-DDPIKLDPKSRNSEQTWFTP---GEH 109
Query: 105 PFLYKIQTRIQDMTNLVIGREERYK-GPLQINNYGLGGHYDLH-----CD-ATPRDEGLW 157
+ KIQ + +++ + ++Y +Q+ Y G +Y H CD A P+D+
Sbjct: 110 KIIDKIQNKTRELLDSKKHCIDKYNFEDVQVARYKPGQYYYHHYDGDDCDDACPKDQ--- 166
Query: 158 RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYR-MYHSGCPV-- 214
RLA+ M YL E GG T FP+LN V P+KG AVF++ A T Y+ H+G PV
Sbjct: 167 RLATLMVYLKAPEEGGETDFPTLNTQVLPKKGKAVFFWVADPATRKLYKETLHAGLPVKN 226
Query: 215 ---ALGNKW 220
+ N+W
Sbjct: 227 GVKVIANQW 235
>gi|260802724|ref|XP_002596242.1| hypothetical protein BRAFLDRAFT_117983 [Branchiostoma floridae]
gi|229281496|gb|EEN52254.1| hypothetical protein BRAFLDRAFT_117983 [Branchiostoma floridae]
Length = 527
Score = 71.2 bits (173), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 46/155 (29%), Positives = 81/155 (52%), Gaps = 14/155 (9%)
Query: 1 EIYPLACQGN----LSVPEDIKSNLKCFYESYNN-TFLKIGPLKVEELYLDPRVVKIHDA 55
EIY L CQ ++ +LKC Y + NN L + P K+E+++ P++ H+
Sbjct: 306 EIYELLCQAEQPDMFNITPSRAKHLKCRYFTNNNHPRLLLAPQKLEQVFDKPKMWIFHNI 365
Query: 56 IYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTR 113
+ D E+ I +L++ ++ R + N G+ + R+SK +L +H + ++ R
Sbjct: 366 LTDPEMKVIKDLAQPRLRRATIQNSITGELEHASYRISKSAWLQG---WEHKVIRRVNQR 422
Query: 114 IQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD 148
++D+T L + E LQ+ NYG+GGHY+ H D
Sbjct: 423 VEDVTGLTMETAEE----LQVVNYGMGGHYEPHFD 453
>gi|308799555|ref|XP_003074558.1| putative oxidoreductase (ISS) [Ostreococcus tauri]
gi|116000729|emb|CAL50409.1| putative oxidoreductase (ISS) [Ostreococcus tauri]
Length = 274
Score = 71.2 bits (173), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 54/213 (25%), Positives = 90/213 (42%), Gaps = 38/213 (17%)
Query: 38 LKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYF 95
+ VE L PR + +A+ ++E+ I+ L++ +V R V++ G ++ R SK F
Sbjct: 7 IAVEPLSWYPRAFALRNALDETEMRAILALARTRVARSTVIDSESGKSVVNPIRTSKQTF 66
Query: 96 LYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPR--- 152
L + P + K+ R+ +T+L E LQ+ Y G YD H D
Sbjct: 67 LS----RNDPVVRKVLERMSSVTHLPWYHCED----LQVLEYSAGEKYDAHEDVGEEGTK 118
Query: 153 ------DEGLWRLASFMFYLTDVELGGATIFPS-------------------LNLTVFPE 187
G R+A+ + YL + E GG T FP + + P
Sbjct: 119 SGDQLSKNGGKRVATILLYLEEPEEGGETAFPDSEWIDPERAKTETWSKCAHRRVAMKPT 178
Query: 188 KGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+G + +++ + +D+R H GCP G KW
Sbjct: 179 RGDGLMFWSVRPDGTIDHRALHVGCPPTRGTKW 211
>gi|212720650|ref|NP_001132477.1| uncharacterized protein LOC100193935 precursor [Zea mays]
gi|194694488|gb|ACF81328.1| unknown [Zea mays]
gi|347978828|gb|AEP37756.1| prolyl 4-hydroxylase 7 [Zea mays]
gi|413934218|gb|AFW68769.1| prolyl 4-hydroxylase [Zea mays]
Length = 298
Score = 71.2 bits (173), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 58/208 (27%), Positives = 90/208 (43%), Gaps = 31/208 (14%)
Query: 37 PLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVY 94
P +V +L PR + D+E + +I L+K K+E+ V + G ++ + R S
Sbjct: 32 PSRVVQLSWRPRAFLHKGFLLDAECDHLIALAKDKLEKSMVADNKSGKSVQSEVRTSSGM 91
Query: 95 FLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDAT 150
FL + + +I+ RI T L E +QI +Y G HYD D
Sbjct: 92 FLEKK---QDEVVTRIEERISAWTFLPPENGE----AIQILHYQNGEKYEPHYDYFHDKN 144
Query: 151 PRDEGLWRLASFMFYLTDVELGGATIFPSLN------------------LTVFPEKGSAV 192
+ G R+A+ + YL++VE GG TIFP+ V P KG A+
Sbjct: 145 NQALGGHRIATVLMYLSNVEKGGETIFPNAEGKLLQPKDDTWSDCARNGYAVKPVKGDAL 204
Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
+++ H ++ D H CP G KW
Sbjct: 205 LFFSLHPDSTTDSDSLHGSCPAIEGQKW 232
>gi|198417608|ref|XP_002125299.1| PREDICTED: similar to prolyl-4-hydroxylase-alpha EFB CG31022-PA
[Ciona intestinalis]
Length = 471
Score = 71.2 bits (173), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 31/64 (48%), Positives = 43/64 (67%)
Query: 158 RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALG 217
R+A+ + YL++V+ GG+T F N+ P KGSAVFWYN + + LD R H+ CPV +G
Sbjct: 381 RIATALVYLSEVQKGGSTAFFYPNIVAEPIKGSAVFWYNLYPSGALDKRTLHAACPVLIG 440
Query: 218 NKWG 221
NKW
Sbjct: 441 NKWA 444
>gi|388495016|gb|AFK35574.1| unknown [Lotus japonicus]
Length = 297
Score = 71.2 bits (173), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 56/213 (26%), Positives = 89/213 (41%), Gaps = 34/213 (15%)
Query: 35 IGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNY--GDTIYVDTRLSK 92
I P KV+++ PR + E + +I L+K +++R V + GD+ + R S
Sbjct: 31 INPSKVKQVSWKPRAFVYEGFLTGLECDHLISLAKSELKRSAVADNLPGDSKLSEVRTSS 90
Query: 93 VYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLH----CD 148
F+ + P + I+ +I T L E +Q+ Y G YD H D
Sbjct: 91 GMFISKK---KDPIVAGIEDKISAWTFLPKENGED----MQVLRYEHGQKYDPHYDYFTD 143
Query: 149 ATPRDEGLWRLASFMFYLTDVELGGATIFP---------------------SLNLTVFPE 187
G R+A+ + YLT+V GG T+FP + V P
Sbjct: 144 KVNIVRGGHRMATVLLYLTNVTRGGETVFPVAEEPPRRRGLETNSDLSECAKKGIAVKPR 203
Query: 188 KGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+G A+ +++ H + D H+GCPV G KW
Sbjct: 204 RGDALLFFSLHTTAIPDTDSLHAGCPVIEGEKW 236
>gi|307102963|gb|EFN51228.1| hypothetical protein CHLNCDRAFT_141231 [Chlorella variabilis]
Length = 313
Score = 71.2 bits (173), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 58/186 (31%), Positives = 82/186 (44%), Gaps = 28/186 (15%)
Query: 56 IYDSEINRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTR 113
+ + E + I+ L+K +ER VV+ G + D R SK FL G + I+ R
Sbjct: 48 LTEEECDHIVALAKPHLERSGVVDTATGGSEISDIRTSKGMFLE---RGHDDTVAAIEER 104
Query: 114 IQDMTNLVIGREERYKGPLQINNYGLGGHYD--LHCDATPRDEGLWRLASFMFYLTDVEL 171
I T L +G E LQ+ NY G YD G R A+ + YL VE
Sbjct: 105 IARWTLLPVGNGEG----LQVLNYHPGEKYDDYFFDKVNGESNGGNRYATVLMYLNTVEE 160
Query: 172 GGATIFPSL-----------------NLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 214
GG T+FP++ +L P KGSAV +++ + L+ R H+ CPV
Sbjct: 161 GGETVFPNIPAPGGDNGPTFTECARRHLAAKPTKGSAVLFHSIKPSGDLERRSLHTACPV 220
Query: 215 ALGNKW 220
G KW
Sbjct: 221 VKGEKW 226
>gi|449459442|ref|XP_004147455.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
gi|449515722|ref|XP_004164897.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 319
Score = 71.2 bits (173), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 58/213 (27%), Positives = 92/213 (43%), Gaps = 30/213 (14%)
Query: 31 TFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV-NYGDTIYVDTR 89
+ + I P +V +L PR + E +I +KGK+ + V G ++ R
Sbjct: 51 SAMTIDPTRVIQLSSKPRAFLYKGFLSAEECQHLINSAKGKLHQSLVAAGTGQSVTSKER 110
Query: 90 LSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD- 148
S FL+ + +I++RI T L + E P+QI Y G Y+ H D
Sbjct: 111 TSTGMFLHK---AQDEIVARIESRIAAWTFLPLDNGE----PIQILRYENGQKYEPHFDF 163
Query: 149 -ATPRDE--GLWRLASFMFYLTDVELGGATIFPS------------------LNLTVFPE 187
P + G R+A+ + YL++VE GG T+FP+ + V P+
Sbjct: 164 FQDPGNIAIGGHRIATILMYLSNVEKGGETVFPNSPVKLSEEEKADLSECGKVGYGVRPK 223
Query: 188 KGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
G A+ +++ + N D YH CPV G KW
Sbjct: 224 LGDALLFFSMNPNVTPDTTSYHGSCPVIEGEKW 256
>gi|388520325|gb|AFK48224.1| unknown [Lotus japonicus]
Length = 188
Score = 71.2 bits (173), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 54/182 (29%), Positives = 81/182 (44%), Gaps = 32/182 (17%)
Query: 64 IIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLV 121
+I L+K + + VV+ G ++ R S FL G + I+ RI D +
Sbjct: 1 MINLAKPHMAKSSVVDSQTGKSVGSRVRTSSGMFLKR---GKDKVIQTIEKRIADFAFIP 57
Query: 122 IGREERYKGPLQINNYGLGG----HYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIF 177
+ E LQ+ +Y +G HYD D G R+A+ + YL+DVE GG TIF
Sbjct: 58 VENGEG----LQVLHYEVGQKYEPHYDYFLDEFNTKNGGQRIATVLMYLSDVEEGGETIF 113
Query: 178 PSLN-------------------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGN 218
P+ L+V P++G A+ +++ + LD H GCPV GN
Sbjct: 114 PAAKANFSSVPWYNDLSVCAKKGLSVKPKRGDALLFWSIRPDATLDPSSLHGGCPVIRGN 173
Query: 219 KW 220
KW
Sbjct: 174 KW 175
>gi|356555585|ref|XP_003546111.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 1
[Glycine max]
Length = 301
Score = 71.2 bits (173), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 55/213 (25%), Positives = 93/213 (43%), Gaps = 34/213 (15%)
Query: 35 IGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSK 92
I P KV+++ PR + + E + +I ++K +++R V + G++ + R S
Sbjct: 35 IDPSKVKQVSWKPRAFVYEGFLTELECDHLISIAKSELKRSAVADNLSGESKLSEVRTSS 94
Query: 93 VYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLH----CD 148
F+ P+ P + ++ +I T L E +Q+ Y G YD H D
Sbjct: 95 GMFI-PK--NKDPIVAGVEDKISSWTLLPKENGED----IQVLRYEHGQKYDPHYDYFAD 147
Query: 149 ATPRDEGLWRLASFMFYLTDVELGGATIFPSLN---------------------LTVFPE 187
G R+A+ + YLTDV GG T+FP+ + V P
Sbjct: 148 KVNIARGGHRVATVLMYLTDVTKGGETVFPNAEESPRHRGSETKEDLSECAQKGIAVKPR 207
Query: 188 KGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+G A+ +++ + N + D H+GCPV G KW
Sbjct: 208 RGDALLFFSLYPNAIPDTMSLHAGCPVIEGEKW 240
>gi|284035817|ref|YP_003385747.1| 2OG-Fe(II) oxygenase [Spirosoma linguale DSM 74]
gi|283815110|gb|ADB36948.1| 2OG-Fe(II) oxygenase [Spirosoma linguale DSM 74]
Length = 328
Score = 71.2 bits (173), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 55/176 (31%), Positives = 83/176 (47%), Gaps = 14/176 (7%)
Query: 50 VKIHDAIYDS-EINRIIELSKGKV--ERGKVVNYGDTI-YVDTRLSKVYFLYPEIFGDHP 105
V+IH + + E II+ ++ K R ++ +T+ DTR S FL HP
Sbjct: 152 VQIHPHFFSADECAYIIQYAEEKTLFTRSQLEYDDNTVNESDTRTSYSAFLKDR---QHP 208
Query: 106 FLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFY 165
I R+ + + Y PLQ YG G + H D+ + RL + + Y
Sbjct: 209 VFQAIYERVAASLKVDLN----YIEPLQCVRYGEGQQFKPHFDSMSANH---RLHTMLVY 261
Query: 166 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 221
L D +GG T FP LN+ V P++GSA+++ N N LL H+G P+A G K+
Sbjct: 262 LNDDFVGGETYFPELNMNVHPKRGSALYFLNRDDNNLLLLNSVHAGLPIAQGMKYA 317
>gi|332526359|ref|ZP_08402485.1| procollagen-proline dioxygenase [Rubrivivax benzoatilyticus JA2]
gi|332110495|gb|EGJ10818.1| procollagen-proline dioxygenase [Rubrivivax benzoatilyticus JA2]
Length = 224
Score = 70.9 bits (172), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 61/226 (26%), Positives = 98/226 (43%), Gaps = 26/226 (11%)
Query: 6 ACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRII 65
A G SVPE + + + + + + PRVV + + E + ++
Sbjct: 2 APAGAASVPEPALAGAPGVLRAGDREVHVLATMAL------PRVVVFGGLLSEQECDELV 55
Query: 66 ELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIG 123
L++ ++ R + V+ G + R S F G+ P + +I+ RI ++ + +
Sbjct: 56 ALAQPRLLRSETVDNSTGGSEVNAARTSDGMFFE---RGETPLIERIERRIAELVHWPV- 111
Query: 124 REERYKGPLQINNYGLGGHYDLHCD---------ATPRDEGLWRLASFMFYLTDVELGGA 174
ER +G LQ+ +Y G Y H D A G R+ + + YL GGA
Sbjct: 112 --ERGEG-LQVLHYRPGAQYKPHHDFFDPAHPGTANILRRGGQRVGTVVIYLNTPAGGGA 168
Query: 175 TIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
T FP + L V P KG+AVF+ ++ L R H G PV G KW
Sbjct: 169 TTFPEVGLEVQPIKGNAVFF--SYERPLASTRTLHGGAPVLDGEKW 212
>gi|218199253|gb|EEC81680.1| hypothetical protein OsI_25242 [Oryza sativa Indica Group]
Length = 487
Score = 70.9 bits (172), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 58/206 (28%), Positives = 87/206 (42%), Gaps = 31/206 (15%)
Query: 39 KVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFL 96
+V + PRV + D E + +++L K K++R V + G ++ + R S FL
Sbjct: 56 RVRAVSWRPRVFVYKGFLSDDECDHLVKLGKRKMQRSMVADNKSGKSVMSEVRTSSGMFL 115
Query: 97 YPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDATPR 152
P + +I+ RI T L E +QI Y G H+D D +
Sbjct: 116 DKR---QDPVVSRIEKRIAAWTFL----PEENAENIQILRYEHGQKYEPHFDYFHDKVNQ 168
Query: 153 DEGLWRLASFMFYLTDVELGGATIFPSLN------------------LTVFPEKGSAVFW 194
G R A+ + YL+ VE GG T+FP+ L V P KG AV +
Sbjct: 169 ALGGHRYATVLMYLSTVEKGGETVFPNAEGWENQPKDDTFSECAQKGLAVKPVKGDAVLF 228
Query: 195 YNAHANTLLDYRMYHSGCPVALGNKW 220
++ H + + D H CPV G KW
Sbjct: 229 FSLHIDGVPDPLSLHGSCPVIEGEKW 254
>gi|321466507|gb|EFX77502.1| hypothetical protein DAPPUDRAFT_25542 [Daphnia pulex]
Length = 92
Score = 70.9 bits (172), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 40/92 (43%), Positives = 54/92 (58%), Gaps = 5/92 (5%)
Query: 134 INNYGLGGHYDLHCD--ATPRDEGLWR--LASFMFYLTDVELGGATIFPSLNLTVFPEKG 189
I +YG+GGH+ HCD R E LA+ + YL +VE GGAT+FP + V P KG
Sbjct: 1 ILSYGVGGHFSPHCDYIRNKRIEAKTGNILATLIIYLNEVENGGATVFPIVKTRVKPVKG 60
Query: 190 SAVFWYNAHA-NTLLDYRMYHSGCPVALGNKW 220
SA+FWYN + N + H+ CP+ G+KW
Sbjct: 61 SALFWYNLNPDNGEGNPTTLHASCPILSGSKW 92
>gi|307102975|gb|EFN51240.1| hypothetical protein CHLNCDRAFT_28187 [Chlorella variabilis]
Length = 322
Score = 70.9 bits (172), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 60/216 (27%), Positives = 98/216 (45%), Gaps = 38/216 (17%)
Query: 39 KVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYP 98
++E + PR + +H + SE + +I L++ ++E KVV+ + +D+ ++
Sbjct: 14 RIELVSWKPRALLLHGFLAHSECDHMISLAEARLEPSKVVSRDGSGKLDSVRTRQGLSSS 73
Query: 99 EIF---GDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLG----GHYDLHCD--- 148
F + ++ RI+ T+L E+ LQ+ Y LG HYD+H
Sbjct: 74 GTFLTKRQDSVVAGVEDRIELATHLPFSHSEQ----LQVLKYELGQKYSAHYDVHGSNEQ 129
Query: 149 ---ATPRDE-GLWRLASFMFYLTDVELGGATIFP-------------------SLNLTVF 185
A R E G R A+ + YL+DVE GG T FP S + V
Sbjct: 130 AQLAIRRGEQGGSRYATMLMYLSDVEEGGETSFPHGRWIDEGAQAQPPYSECGSRGVAVK 189
Query: 186 PEKGSAVFWYNAHAN-TLLDYRMYHSGCPVALGNKW 220
P KG A+ +Y+ ++ D+ H+GCPVA G K+
Sbjct: 190 PRKGDAILFYSLKSDGQSKDFFSLHAGCPVAKGVKY 225
>gi|325925807|ref|ZP_08187179.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas perforans
91-118]
gi|325543793|gb|EGD15204.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas perforans
91-118]
Length = 286
Score = 70.9 bits (172), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 61/214 (28%), Positives = 92/214 (42%), Gaps = 22/214 (10%)
Query: 20 NLKCFYESYNNTFLKIGPLKVEELY--LDPRVVKIHDAIYDSEINRIIELSKGKVERGKV 77
+ + + + L +G V L L PRVV + + D E + +I L++ + R +
Sbjct: 67 RVPALQQDADASLLALGDRDVRVLVSLLLPRVVVLGGFLSDEECDALIALARPHLARSRT 126
Query: 78 VNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQIN 135
V+ G+ + R S L G +I+ RI + + + E LQ+
Sbjct: 127 VDNANGEHVVHAARTSDSMCLR---LGQDALCQRIEARIARLLDWPVDHGEG----LQVL 179
Query: 136 NYGLGGHYDLHCD-------ATPR--DEGLWRLASFMFYLTDVELGGATIFPSLNLTVFP 186
Y G Y H D TP G R+AS + YL E GGAT FP +L V
Sbjct: 180 RYATGAEYRPHYDYFDPDAAGTPVLVQAGGQRVASLVMYLNTPERGGATRFPDAHLDVAA 239
Query: 187 EKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
KG+AVF+ + + R H+G PV G+KW
Sbjct: 240 VKGNAVFFSYDRPHPM--TRSLHAGAPVLAGDKW 271
>gi|225452614|ref|XP_002281420.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
gi|296087745|emb|CBI35001.3| unnamed protein product [Vitis vinifera]
Length = 316
Score = 70.9 bits (172), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 57/208 (27%), Positives = 88/208 (42%), Gaps = 31/208 (14%)
Query: 37 PLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVY 94
P +V +L PR + + E + +I L+K K+E+ V + G +I + R S
Sbjct: 51 PTRVTQLSWRPRAFLYKGFLSEEECDHLITLAKDKLEKSMVADNESGKSIMSEVRTSSGM 110
Query: 95 FLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDAT 150
FL + + I+ RI T L + E +QI +Y G H+D D
Sbjct: 111 FL---LKAQDEIVADIEARIAAWTFLPVENGES----IQILHYENGEKYEPHFDYFHDKV 163
Query: 151 PRDEGLWRLASFMFYLTDVELGGATIFPSLN------------------LTVFPEKGSAV 192
+ G R+A+ + YL VE GG T+FP+ V P+KG A+
Sbjct: 164 NQLLGGHRIATVLMYLATVEEGGETVFPNSEGRFSQPKDDSWSDCAKKGYAVNPKKGDAL 223
Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
+++ H + D H CPV G KW
Sbjct: 224 LFFSLHPDATTDPSSLHGSCPVIAGEKW 251
>gi|115481998|ref|NP_001064592.1| Os10g0413500 [Oryza sativa Japonica Group]
gi|110289075|gb|ABG66075.1| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
Group]
gi|113639201|dbj|BAF26506.1| Os10g0413500 [Oryza sativa Japonica Group]
gi|215692577|dbj|BAG87997.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222612821|gb|EEE50953.1| hypothetical protein OsJ_31503 [Oryza sativa Japonica Group]
Length = 308
Score = 70.9 bits (172), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 59/208 (28%), Positives = 87/208 (41%), Gaps = 31/208 (14%)
Query: 37 PLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVY 94
P +V +L PR + D+E +I L+K K+E+ V + G ++ + R S
Sbjct: 42 PSRVVQLSWRPRAFLHKGFLTDAECEHLISLAKDKLEKSMVADNESGKSVMSEVRTSSGM 101
Query: 95 FLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDAT 150
FL + + +I+ RI T L E +QI +Y G HYD D
Sbjct: 102 FLEKK---QDEVVARIEERIAAWTFLPPDNGES----IQILHYQNGEKYEPHYDYFHDKN 154
Query: 151 PRDEGLWRLASFMFYLTDVELGGATIFPSLN------------------LTVFPEKGSAV 192
+ G R+A+ + YL+DV GG TIFP V P KG A+
Sbjct: 155 NQALGGHRIATVLMYLSDVGKGGETIFPEAEGKLLQPKDDTWSDCAKNGYAVKPVKGDAL 214
Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
+++ H + D H CPV G KW
Sbjct: 215 LFFSLHPDATTDSDSLHGSCPVIEGQKW 242
>gi|218184507|gb|EEC66934.1| hypothetical protein OsI_33548 [Oryza sativa Indica Group]
Length = 308
Score = 70.9 bits (172), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 59/208 (28%), Positives = 87/208 (41%), Gaps = 31/208 (14%)
Query: 37 PLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVY 94
P +V +L PR + D+E +I L+K K+E+ V + G ++ + R S
Sbjct: 42 PSRVVQLSWRPRAFLHKGFLTDAECEHLISLAKDKLEKSMVADNESGKSVMSEVRTSSGM 101
Query: 95 FLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDAT 150
FL + + +I+ RI T L E +QI +Y G HYD D
Sbjct: 102 FLEKK---QDEVVARIEERIAAWTFLPPDNGES----IQILHYQNGEKYEPHYDYFHDKN 154
Query: 151 PRDEGLWRLASFMFYLTDVELGGATIFPSLN------------------LTVFPEKGSAV 192
+ G R+A+ + YL+DV GG TIFP V P KG A+
Sbjct: 155 NQALGGHRIATVLMYLSDVGKGGETIFPEAEGKLLQPKDDTWSDCAKNGYAVKPVKGDAL 214
Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
+++ H + D H CPV G KW
Sbjct: 215 LFFSLHPDATTDSDSLHGSCPVIEGQKW 242
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.322 0.143 0.443
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,778,808,045
Number of Sequences: 23463169
Number of extensions: 158162085
Number of successful extensions: 358937
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1390
Number of HSP's successfully gapped in prelim test: 503
Number of HSP's that attempted gapping in prelim test: 354759
Number of HSP's gapped (non-prelim): 2047
length of query: 227
length of database: 8,064,228,071
effective HSP length: 137
effective length of query: 90
effective length of database: 9,144,741,214
effective search space: 823026709260
effective search space used: 823026709260
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 74 (33.1 bits)