BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy8177
(312 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|345481336|ref|XP_001600680.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Nasonia
vitripennis]
Length = 556
Score = 383 bits (984), Expect = e-104, Method: Compositional matrix adjust.
Identities = 176/282 (62%), Positives = 218/282 (77%), Gaps = 17/282 (6%)
Query: 2 IFPTHQRAQGNKLYYQEALNK---------SPELKDEPPKVNN--------VAPTLEVTE 44
+ PTHQRA GN+ YYQE + K + ++ P + + E+TE
Sbjct: 242 LVPTHQRALGNRAYYQEEIQKRTNESRRKRGEDGSEDTPAADQHFTVTEKKIKSVSEMTE 301
Query: 45 REKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDS 104
RE+YEMLCRG++ +P +I +L+CRYV R +P+L++ P KEEEAYL PRI++Y DV+YD
Sbjct: 302 RERYEMLCRGEIKMPLSIQKELRCRYVDRGIPFLKIAPFKEEEAYLDPRIVIYHDVIYDD 361
Query: 105 EIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGL 164
EI+ IK+MAQPR +RATVQNYKTGELEIANYRISKSAWL+E EH + +S+RVEHMT +
Sbjct: 362 EIETIKRMAQPRFKRATVQNYKTGELEIANYRISKSAWLQEHEHKHVRAVSQRVEHMTSM 421
Query: 165 TTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATV 224
+ TAEELQVVNYGIGGHYEPH+DFAR E NAFKSLGTGNR+ATVL+YMSDV QGG TV
Sbjct: 422 SIETAEELQVVNYGIGGHYEPHFDFARREEKNAFKSLGTGNRIATVLYYMSDVEQGGGTV 481
Query: 225 FTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
FT +N+SLWP+KG+AAFW+NL +G+GDY TRHAACPVLTGS
Sbjct: 482 FTKINISLWPKKGSAAFWYNLKPNGEGDYKTRHAACPVLTGS 523
>gi|380025232|ref|XP_003696381.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Apis florea]
Length = 537
Score = 383 bits (983), Expect = e-104, Method: Compositional matrix adjust.
Identities = 175/282 (62%), Positives = 220/282 (78%), Gaps = 17/282 (6%)
Query: 2 IFPTHQRAQGNKLYYQEAL----NKSPELKDEPPKVNNVAPTL-------------EVTE 44
+ PTH+RA GN+ YYQ+ + ++S + + E + + P E+TE
Sbjct: 223 LVPTHERALGNRAYYQKEIQSKASQSKKKRGEDGQDDTAVPAQHFTVVEERVKTLDEMTE 282
Query: 45 REKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDS 104
RE+YEMLCRG++T+PP + LKCRYV R +P+L++ P KEEEAYL PRI++Y +V+YD
Sbjct: 283 RERYEMLCRGEVTIPPEVQKNLKCRYVDRGIPFLKIAPFKEEEAYLDPRIVVYHNVIYDD 342
Query: 105 EIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGL 164
EI+ IK+MAQPR +RATVQNYKTG LEIANYRISKSAWL+E EH + +SRRVEHMT +
Sbjct: 343 EIETIKRMAQPRFKRATVQNYKTGALEIANYRISKSAWLQEHEHKHVAAVSRRVEHMTSM 402
Query: 165 TTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATV 224
T TAEELQVVNYGIGGHYEPH+DFAR E NAFKSLGTGNR+ATVL+YMSDV QGG TV
Sbjct: 403 TVDTAEELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIATVLYYMSDVEQGGGTV 462
Query: 225 FTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
FT++N++LWP+KG+AAFW+NL +G+GD+ TRHAACPVLTGS
Sbjct: 463 FTAINIALWPKKGSAAFWYNLKPNGEGDFKTRHAACPVLTGS 504
>gi|350416719|ref|XP_003491070.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Bombus
impatiens]
Length = 557
Score = 383 bits (983), Expect = e-104, Method: Compositional matrix adjust.
Identities = 176/282 (62%), Positives = 219/282 (77%), Gaps = 17/282 (6%)
Query: 2 IFPTHQRAQGNKLYYQEAL----NKSPELKDEPPKVNNVAPTL-------------EVTE 44
+ PTH+RA GN+ YYQ+ + N+S + + E + + P E+TE
Sbjct: 243 LVPTHERALGNRAYYQKEIQSKANQSKKKRGEDGQDDTAVPAQHFTVAEEKMKTWEEMTE 302
Query: 45 REKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDS 104
RE+YEMLCRG++++PP I L CRYV R +P+L++ P KEEEAYL PRI++Y +V+YD
Sbjct: 303 RERYEMLCRGEVSIPPEIQKNLVCRYVDRGIPFLKIAPFKEEEAYLDPRIVVYHNVIYDE 362
Query: 105 EIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGL 164
EI+ IK+MAQPR +RATVQNYKTG LEIANYRISKSAWL+E EH + +SRRVEHMT +
Sbjct: 363 EIETIKRMAQPRFKRATVQNYKTGALEIANYRISKSAWLQEHEHEHVAAVSRRVEHMTSM 422
Query: 165 TTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATV 224
T TAEELQVVNYGIGGHYEPH+DFAR E NAFKSLGTGNR+ATVL+YMSDV QGG TV
Sbjct: 423 TVDTAEELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIATVLYYMSDVEQGGGTV 482
Query: 225 FTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
FT++N+SLWP+KG+AAFW+NL +G+GD+ TRHAACPVLTGS
Sbjct: 483 FTAINISLWPKKGSAAFWYNLKPNGEGDFKTRHAACPVLTGS 524
>gi|340722330|ref|XP_003399560.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Bombus
terrestris]
Length = 557
Score = 383 bits (983), Expect = e-104, Method: Compositional matrix adjust.
Identities = 176/282 (62%), Positives = 219/282 (77%), Gaps = 17/282 (6%)
Query: 2 IFPTHQRAQGNKLYYQEAL----NKSPELKDEPPKVNNVAPTL-------------EVTE 44
+ PTH+RA GN+ YYQ+ + N+S + + E + + P E+TE
Sbjct: 243 LVPTHERALGNRAYYQKEIQSKANQSKKKRGEDGQDDTAVPAQHFTVAEEKMKTWEEMTE 302
Query: 45 REKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDS 104
RE+YEMLCRG++++PP I L CRYV R +P+L++ P KEEEAYL PRI++Y +V+YD
Sbjct: 303 RERYEMLCRGEVSIPPEIQKNLVCRYVDRGIPFLKIAPFKEEEAYLDPRIVVYHNVIYDE 362
Query: 105 EIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGL 164
EI+ IK+MAQPR +RATVQNYKTG LEIANYRISKSAWL+E EH + +SRRVEHMT +
Sbjct: 363 EIETIKRMAQPRFKRATVQNYKTGALEIANYRISKSAWLQEHEHEHVAAVSRRVEHMTSM 422
Query: 165 TTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATV 224
T TAEELQVVNYGIGGHYEPH+DFAR E NAFKSLGTGNR+ATVL+YMSDV QGG TV
Sbjct: 423 TVDTAEELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIATVLYYMSDVEQGGGTV 482
Query: 225 FTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
FT++N+SLWP+KG+AAFW+NL +G+GD+ TRHAACPVLTGS
Sbjct: 483 FTAINISLWPKKGSAAFWYNLKPNGEGDFKTRHAACPVLTGS 524
>gi|383864775|ref|XP_003707853.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Megachile
rotundata]
Length = 550
Score = 382 bits (982), Expect = e-104, Method: Compositional matrix adjust.
Identities = 176/276 (63%), Positives = 216/276 (78%), Gaps = 11/276 (3%)
Query: 2 IFPTHQRAQGNKLYYQEAL----NKSPELKDEPPKVNNVAPTLE-------VTEREKYEM 50
+ PTH+RA GN+ YYQ+ + N+S + + E + + P E +TERE+YEM
Sbjct: 242 LVPTHERALGNRAYYQKEIQSKANQSKKKRGEDGQDDTAVPAQEKVKTWEEMTERERYEM 301
Query: 51 LCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIK 110
LCRG++++PP I LKCRYV R +P+L++ P KEEEAYL PRI++Y +V+YD EI+ IK
Sbjct: 302 LCRGEVSIPPEIQKNLKCRYVDRGIPFLKIAPFKEEEAYLDPRIVIYHNVIYDEEIETIK 361
Query: 111 KMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAE 170
+MAQPR +RATVQNYKTG LEIANYRISKSAWL+E EH + +S+RVEHMT L TAE
Sbjct: 362 RMAQPRFKRATVQNYKTGALEIANYRISKSAWLQEHEHKHVAAVSKRVEHMTSLNVETAE 421
Query: 171 ELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNL 230
ELQVVNYGIGGHYEPH+DFAR E NAFKSLGTGNR+ATVL+YMSDV QGG TVFT++N+
Sbjct: 422 ELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIATVLYYMSDVEQGGGTVFTAINI 481
Query: 231 SLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
SLWP KG+AAFW NL +G+GD TRHAACPVLTGS
Sbjct: 482 SLWPRKGSAAFWFNLKPNGEGDLRTRHAACPVLTGS 517
>gi|328790718|ref|XP_392392.4| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Apis mellifera]
Length = 415
Score = 381 bits (979), Expect = e-103, Method: Compositional matrix adjust.
Identities = 175/282 (62%), Positives = 220/282 (78%), Gaps = 17/282 (6%)
Query: 2 IFPTHQRAQGNKLYYQEAL----NKSPELKDEPPKVNNVAPTL-------------EVTE 44
+ PTH+RA GN+ YYQ+ + ++S + + E + + P E+TE
Sbjct: 101 LVPTHERALGNRAYYQKEIQSKASQSKKKRGEDGQDDTAVPAQHFTVAEERVKTLDEMTE 160
Query: 45 REKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDS 104
RE+YEMLCRG++T+PP + LKCRYV R +P+L++ P KEEEAYL PRI++Y +V+YD
Sbjct: 161 RERYEMLCRGEVTIPPEVQKNLKCRYVDRGIPFLKIAPFKEEEAYLDPRIVVYHNVIYDD 220
Query: 105 EIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGL 164
EI+ IK+MAQPR +RATVQNYKTG LEIANYRISKSAWL+E EH + +SRRVEHMT +
Sbjct: 221 EIETIKRMAQPRFKRATVQNYKTGALEIANYRISKSAWLQEHEHKHVAAVSRRVEHMTSM 280
Query: 165 TTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATV 224
T TAEELQVVNYGIGGHYEPH+DFAR E NAFKSLGTGNR+ATVL+YMSDV QGG TV
Sbjct: 281 TVDTAEELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIATVLYYMSDVEQGGGTV 340
Query: 225 FTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
FT++N++LWP+KG+AAFW+NL +G+GD+ TRHAACPVLTGS
Sbjct: 341 FTAINIALWPKKGSAAFWYNLKPNGEGDFKTRHAACPVLTGS 382
>gi|332026992|gb|EGI67088.1| Prolyl 4-hydroxylase subunit alpha-1 [Acromyrmex echinatior]
Length = 415
Score = 381 bits (979), Expect = e-103, Method: Compositional matrix adjust.
Identities = 175/282 (62%), Positives = 219/282 (77%), Gaps = 17/282 (6%)
Query: 2 IFPTHQRAQGNKLYYQEAL----NKSPELKDEPPKVNNVAPTL-------------EVTE 44
+ PTH+RA GN+ YYQ+ + ++S + + E K + P E+TE
Sbjct: 101 LVPTHERALGNRAYYQKEIQSKASQSKKKRGEDGKDDTAIPEQNFTVAEERVKTWEEMTE 160
Query: 45 REKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDS 104
RE+YEMLCRG++++PP + LKCRYV R +P+L++ P KEEEAYL PRI++Y +V+YD
Sbjct: 161 RERYEMLCRGEVSIPPEVEKNLKCRYVDRGIPFLKIAPFKEEEAYLDPRIVVYHNVIYDE 220
Query: 105 EIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGL 164
EI+ IK+MAQPR +RATVQNYKTG LEIANYRISKSAWL+E EH + +S+RVEHMT +
Sbjct: 221 EIETIKRMAQPRFKRATVQNYKTGALEIANYRISKSAWLQEHEHKHVAAVSKRVEHMTSM 280
Query: 165 TTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATV 224
+ TAEELQVVNYGIGGHYEPH+DFAR E NAFKSLGTGNR+ATVL+YMSDV QGG TV
Sbjct: 281 SVETAEELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIATVLYYMSDVEQGGGTV 340
Query: 225 FTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
FT++N+SLWP KG+AAFWHNL +G+GD+ TRHAACPVLTGS
Sbjct: 341 FTAINISLWPRKGSAAFWHNLKPNGEGDFKTRHAACPVLTGS 382
>gi|307190793|gb|EFN74662.1| Prolyl 4-hydroxylase subunit alpha-2 [Camponotus floridanus]
Length = 476
Score = 375 bits (963), Expect = e-101, Method: Compositional matrix adjust.
Identities = 173/282 (61%), Positives = 219/282 (77%), Gaps = 17/282 (6%)
Query: 2 IFPTHQRAQGNKLYYQEAL----NKSPELKDEPPKVNNVAPTL-------------EVTE 44
+ PTH+RA GN+ YYQ+ + ++S + + E + + P E+TE
Sbjct: 162 LVPTHERALGNRAYYQKEIQSKASQSKKKRGEDGQDDTAIPEQNFTVAEERVKTWEEMTE 221
Query: 45 REKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDS 104
RE+YEMLCRG++++P + LKCRYV R +P+L++ PLKEEEAYL PRI++Y +V+YD
Sbjct: 222 RERYEMLCRGEVSIPREVEKNLKCRYVDRGIPFLKIAPLKEEEAYLDPRIVVYHNVIYDE 281
Query: 105 EIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGL 164
EI+ IK+MAQPR +RATVQNYKTG LEIANYRISKSAWL+E EH + +S+RVEHMT +
Sbjct: 282 EIETIKRMAQPRFKRATVQNYKTGALEIANYRISKSAWLQEHEHKHVAAVSKRVEHMTSM 341
Query: 165 TTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATV 224
+ TAEELQVVNYGIGGHYEPH+DFAR E NAFKSLGTGNR+ATVL+YMSDV QGG TV
Sbjct: 342 SIETAEELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIATVLYYMSDVEQGGGTV 401
Query: 225 FTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
FT++N+SLWP KG+AAFW+NL +G+GD+ TRHAACPVLTGS
Sbjct: 402 FTAINISLWPRKGSAAFWYNLKPNGEGDFKTRHAACPVLTGS 443
>gi|307211752|gb|EFN87747.1| Prolyl 4-hydroxylase subunit alpha-1 [Harpegnathos saltator]
Length = 415
Score = 374 bits (959), Expect = e-101, Method: Compositional matrix adjust.
Identities = 172/282 (60%), Positives = 217/282 (76%), Gaps = 17/282 (6%)
Query: 2 IFPTHQRAQGNKLYYQEAL----NKSPELKDEPPKVNNVAPTL-------------EVTE 44
+ PTH+RA GN+ YYQ+ + ++S + + E + + P E+TE
Sbjct: 101 LVPTHERALGNRAYYQKEIQSKASQSKKKRGEDGQDDTAIPEQNFTVAEERVKTWEEMTE 160
Query: 45 REKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDS 104
RE+YEMLCRG++++P + LKCRYV R +P+L++ P KEEEAYL PRI+ Y +V+YD
Sbjct: 161 RERYEMLCRGEVSIPLEVEKNLKCRYVDRGIPFLKIAPFKEEEAYLDPRIVFYHNVIYDE 220
Query: 105 EIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGL 164
EI+ IK+MAQPR +RATVQNYKTG LEIANYRISKSAWL+E EH + +S+RVEHMT +
Sbjct: 221 EIETIKRMAQPRFKRATVQNYKTGALEIANYRISKSAWLQEHEHKHVAAVSKRVEHMTSM 280
Query: 165 TTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATV 224
+ TAEELQVVNYGIGGHYEPH+DFAR E NAFKSLGTGNR+ATVL+YMSDV QGG TV
Sbjct: 281 SVETAEELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIATVLYYMSDVEQGGGTV 340
Query: 225 FTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
FT++N+SLWP KG+AAFW+NL +G+GD+ TRHAACPVLTGS
Sbjct: 341 FTAINISLWPRKGSAAFWYNLKPNGEGDFKTRHAACPVLTGS 382
>gi|91091610|ref|XP_969386.1| PREDICTED: similar to prolyl 4-hydroxylase alpha subunit 1,
putative [Tribolium castaneum]
gi|270001037|gb|EEZ97484.1| hypothetical protein TcasGA2_TC011321 [Tribolium castaneum]
Length = 536
Score = 357 bits (916), Expect = 4e-96, Method: Compositional matrix adjust.
Identities = 176/292 (60%), Positives = 211/292 (72%), Gaps = 14/292 (4%)
Query: 2 IFPTHQRAQGNKLYYQEALNKSPELK----DEPPKVNNVAPTLEVTEREKYEMLCRGDLT 57
I P+H RA GNK+YY++ L KS K D P ++ RE YE LCRG+++
Sbjct: 238 ILPSHPRALGNKIYYEDELQKSVNTKKKGDDGGEPEGESKPYVDPYGREFYEQLCRGEIS 297
Query: 58 VPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRL 117
+P ++LKC Y+ RN P+L++ P K EEA+ +P I ++RDV+ DSEI IK+MAQPR
Sbjct: 298 LPVEKASKLKCFYLSRNQPFLKIAPFKVEEAHHRPDIFIFRDVLADSEIATIKRMAQPRF 357
Query: 118 RRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNY 177
+RATVQN TGELEIA YRISKSAWL+E EH I +S+RV MTGLT STAEELQVVNY
Sbjct: 358 KRATVQNTDTGELEIAQYRISKSAWLKEEEHKHIADVSQRVSDMTGLTMSTAEELQVVNY 417
Query: 178 GIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKG 237
GIGGHYEPH+DFAR E NAFKSLGTGNR+ATVLFYMSDV QGGATVF S+ +SLWP+KG
Sbjct: 418 GIGGHYEPHFDFARRDERNAFKSLGTGNRIATVLFYMSDVEQGGATVFPSIQVSLWPQKG 477
Query: 238 TAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCGLRR 279
+AAFW+NLH SGDGD TRHAACPVLTGS + + PC L R
Sbjct: 478 SAAFWYNLHPSGDGDKMTRHAACPVLTGSKWVSNKWIHERGQEFRRPCTLER 529
>gi|157114985|ref|XP_001658091.1| prolyl 4-hydroxylase alpha subunit 1, putative [Aedes aegypti]
gi|108877086|gb|EAT41311.1| AAEL007038-PA [Aedes aegypti]
Length = 545
Score = 356 bits (914), Expect = 6e-96, Method: Compositional matrix adjust.
Identities = 172/281 (61%), Positives = 209/281 (74%), Gaps = 16/281 (5%)
Query: 2 IFPTHQRAQGNKLYYQEALNKSPELK-------------DEPPKVNNVAPTLEV---TER 45
+ P H+RA GNK+YY++ L K + K D K+ V +ER
Sbjct: 236 LVPDHERAVGNKVYYEKELEKEAKQKALRGDDGSVDVPVDTTTKIRTSTSNPHVYDSSER 295
Query: 46 EKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSE 105
+ YE LCRG+ A ++LKCRYV P+L++ PLK EEA L+P I++Y DV+ ++E
Sbjct: 296 KLYEQLCRGEAERSVAETSKLKCRYVTNKSPFLKIAPLKLEEANLKPYIVIYHDVISEAE 355
Query: 106 IDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLT 165
++L+K++A+PR RRATVQNYKTGELE+ANYRISKSAWL++ EHP I+ I RVE MTGLT
Sbjct: 356 MELVKRLAKPRFRRATVQNYKTGELEVANYRISKSAWLKDHEHPYIKAIGERVEDMTGLT 415
Query: 166 TSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVF 225
STAEELQVVNYGIGGHYEPH+DFAR E NAFKSLGTGNR+ATVLFYMSDV QGGATVF
Sbjct: 416 MSTAEELQVVNYGIGGHYEPHFDFARREETNAFKSLGTGNRIATVLFYMSDVTQGGATVF 475
Query: 226 TSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
SL L+LWP+KG AAFW NLH+SG GDY TRHAACPVLTG+
Sbjct: 476 PSLRLALWPKKGAAAFWFNLHASGQGDYSTRHAACPVLTGT 516
>gi|170064960|ref|XP_001867743.1| prolyl 4-hydroxylase subunit alpha-1 [Culex quinquefasciatus]
gi|167882146|gb|EDS45529.1| prolyl 4-hydroxylase subunit alpha-1 [Culex quinquefasciatus]
Length = 545
Score = 354 bits (908), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 170/280 (60%), Positives = 215/280 (76%), Gaps = 15/280 (5%)
Query: 2 IFPTHQRAQGNKLYYQEALNKSPELK----DEPPKVNNVAPTLEV-----------TERE 46
+ P H+RA GNK YY++ L K K D+ + V T+++ TER
Sbjct: 237 LVPNHERAVGNKAYYEKELEKEARQKALRGDDGSEDVPVDTTIQIKKETSSLVYDSTERV 296
Query: 47 KYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEI 106
YE LCRG+ A +A+L+CRYV + P+L++ PLK EEA+L+P I++Y +VM D+EI
Sbjct: 297 LYEQLCRGEAHRAEADLAKLRCRYVTNSSPFLKIAPLKLEEAHLEPYIVIYHEVMSDAEI 356
Query: 107 DLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTT 166
++IK++A+PR RRATVQNYKTGELE+ANYRISKSAWL++ EH V+ + +RVE MTGLT
Sbjct: 357 EVIKRLAKPRFRRATVQNYKTGELEVANYRISKSAWLKDEEHSVVRTVGQRVEDMTGLTM 416
Query: 167 STAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFT 226
+TAEELQVVNYGIGGHYEPH+DFAR E NAFKSLGTGNR+ATVLFYMSDV+QGGATVF
Sbjct: 417 TTAEELQVVNYGIGGHYEPHFDFARREEKNAFKSLGTGNRIATVLFYMSDVSQGGATVFP 476
Query: 227 SLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
S+ ++L P+KGTAAFW+NLH+SG GDY TRHAACPVLTG+
Sbjct: 477 SIRVALRPKKGTAAFWYNLHASGHGDYATRHAACPVLTGT 516
>gi|347964867|ref|XP_309164.4| AGAP000971-PA [Anopheles gambiae str. PEST]
gi|333466515|gb|EAA04901.5| AGAP000971-PA [Anopheles gambiae str. PEST]
Length = 553
Score = 353 bits (907), Expect = 4e-95, Method: Compositional matrix adjust.
Identities = 181/299 (60%), Positives = 210/299 (70%), Gaps = 22/299 (7%)
Query: 2 IFPTHQRAQGNKLYYQEALNKSPELK-----DEPPKVNNVAPTLEVT-------EREKYE 49
+ P H+RA NK YY + L K + K D +V T E T ER+ YE
Sbjct: 247 LVPDHERAVSNKAYYVKELQKEAQQKILRGDDGSEEVPVDTTTKEATPHVYDTNERKLYE 306
Query: 50 MLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLI 109
LCRG+ P + +QL CRY + P+LR+ PLK EEAYL+P I++Y DVM D EI+ I
Sbjct: 307 QLCRGEQQPPIELRSQLVCRYTTNSSPFLRIGPLKLEEAYLRPYIVIYHDVMSDREIERI 366
Query: 110 KKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTA 169
K A+PR RRATVQNYKTGELE ANYRISKSAWL++ E +I IS+RVE MTGLT TA
Sbjct: 367 KHYARPRFRRATVQNYKTGELEFANYRISKSAWLKDAEDEMIRTISQRVEDMTGLTMETA 426
Query: 170 EELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLN 229
EELQVVNYGIGGHYEPH+DFAR E NAFKSLGTGNR+ATVLFYMSDV QGGATVF SLN
Sbjct: 427 EELQVVNYGIGGHYEPHFDFARREERNAFKSLGTGNRIATVLFYMSDVTQGGATVFPSLN 486
Query: 230 LSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCGLR 278
L+LWP KGTAAFW NLH+SG GDY TRHAACPVLTG+ + + PCGL+
Sbjct: 487 LALWPRKGTAAFWFNLHASGRGDYATRHAACPVLTGTKWVSNKWIHERGQEFRRPCGLQ 545
>gi|328696638|ref|XP_003240086.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 2
[Acyrthosiphon pisum]
Length = 534
Score = 353 bits (906), Expect = 5e-95, Method: Compositional matrix adjust.
Identities = 176/269 (65%), Positives = 211/269 (78%), Gaps = 6/269 (2%)
Query: 2 IFPTHQRAQGNKLYYQEAL-NKSPEL--KDEPPKVNNVAPTLEVTEREKYEMLCRGDLTV 58
I P H+RA GN YY+ A+ N + E+ ++PPK V TL+ ERE+Y MLCR + +
Sbjct: 237 ILPNHERALGNLAYYEAAIKNGTTEIGKSEQPPKA--VTATLDPEERERYHMLCRNENLM 294
Query: 59 PPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRL 117
I +QL+CRY + N P L + PLKEEEA+ PRIILYRDV+YD+EI++IK+MAQPRL
Sbjct: 295 SIQISSQLRCRYTNNNRNPLLLIAPLKEEEAFFSPRIILYRDVLYDNEIEVIKRMAQPRL 354
Query: 118 RRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNY 177
+RATVQNYKTGELE A+YRISKSAWL+E E V+ +++RVE MTGLTT TAEELQVVNY
Sbjct: 355 KRATVQNYKTGELEFADYRISKSAWLKEHEDVVVANVAKRVEVMTGLTTETAEELQVVNY 414
Query: 178 GIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKG 237
G+GGHY+PHYDFAR E NAFKSLGTGNR+ATVLFYMSDVAQGGATVF L ++L P KG
Sbjct: 415 GVGGHYDPHYDFARTEEINAFKSLGTGNRIATVLFYMSDVAQGGATVFPWLGVALQPVKG 474
Query: 238 TAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
TAA W NL+ SG+GD TRHAACPVL GS
Sbjct: 475 TAAVWFNLYPSGNGDLRTRHAACPVLQGS 503
>gi|193688213|ref|XP_001943683.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 1
[Acyrthosiphon pisum]
Length = 552
Score = 353 bits (906), Expect = 5e-95, Method: Compositional matrix adjust.
Identities = 176/269 (65%), Positives = 211/269 (78%), Gaps = 6/269 (2%)
Query: 2 IFPTHQRAQGNKLYYQEAL-NKSPEL--KDEPPKVNNVAPTLEVTEREKYEMLCRGDLTV 58
I P H+RA GN YY+ A+ N + E+ ++PPK V TL+ ERE+Y MLCR + +
Sbjct: 255 ILPNHERALGNLAYYEAAIKNGTTEIGKSEQPPKA--VTATLDPEERERYHMLCRNENLM 312
Query: 59 PPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRL 117
I +QL+CRY + N P L + PLKEEEA+ PRIILYRDV+YD+EI++IK+MAQPRL
Sbjct: 313 SIQISSQLRCRYTNNNRNPLLLIAPLKEEEAFFSPRIILYRDVLYDNEIEVIKRMAQPRL 372
Query: 118 RRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNY 177
+RATVQNYKTGELE A+YRISKSAWL+E E V+ +++RVE MTGLTT TAEELQVVNY
Sbjct: 373 KRATVQNYKTGELEFADYRISKSAWLKEHEDVVVANVAKRVEVMTGLTTETAEELQVVNY 432
Query: 178 GIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKG 237
G+GGHY+PHYDFAR E NAFKSLGTGNR+ATVLFYMSDVAQGGATVF L ++L P KG
Sbjct: 433 GVGGHYDPHYDFARTEEINAFKSLGTGNRIATVLFYMSDVAQGGATVFPWLGVALQPVKG 492
Query: 238 TAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
TAA W NL+ SG+GD TRHAACPVL GS
Sbjct: 493 TAAVWFNLYPSGNGDLRTRHAACPVLQGS 521
>gi|242018356|ref|XP_002429643.1| Prolyl 4-hydroxylase alpha-1 subunit precursor, putative [Pediculus
humanus corporis]
gi|212514628|gb|EEB16905.1| Prolyl 4-hydroxylase alpha-1 subunit precursor, putative [Pediculus
humanus corporis]
Length = 534
Score = 342 bits (877), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 161/291 (55%), Positives = 210/291 (72%), Gaps = 15/291 (5%)
Query: 2 IFPTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTE-----REKYEMLCRGDL 56
++P H+RAQGNK+YY++AL +S K + + + R YE LCR ++
Sbjct: 239 LYPNHERAQGNKIYYEDALQQSKGQKKKGDDGDEIVIEKNTNSKYYKGRGMYEKLCRNEV 298
Query: 57 TVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPR 116
+ + A+LKCRYV P+L L +KEEEA+L PRI+LY DV+ D EI I+++A PR
Sbjct: 299 GLSEKMKAKLKCRYVDFGRPFLMLAKVKEEEAFLDPRIVLYHDVLSDREIKTIQQLAVPR 358
Query: 117 LRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVN 176
+RATVQN +TG+LE+A+YRISKSAWL + +HP + ++S+RVE +TGL +TAE LQVVN
Sbjct: 359 FKRATVQNSETGKLEVAHYRISKSAWLEDVDHPYVAKVSQRVEDITGLNMATAESLQVVN 418
Query: 177 YGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEK 236
YGIGGHYEPH+DFAR E NAF+SLGTGNR+AT+LFYMSDV+QGGATVF + +SLWP+K
Sbjct: 419 YGIGGHYEPHFDFARKEEKNAFQSLGTGNRIATILFYMSDVSQGGATVFPGIKVSLWPKK 478
Query: 237 GTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCGL 277
GTAAFW+NL +G+GDY TRHAACPVLTGS + + PCGL
Sbjct: 479 GTAAFWYNLRKNGEGDYLTRHAACPVLTGSKWVCNKWIHERGQEFRRPCGL 529
>gi|195055779|ref|XP_001994790.1| GH14110 [Drosophila grimshawi]
gi|193892553|gb|EDV91419.1| GH14110 [Drosophila grimshawi]
Length = 487
Score = 342 bits (877), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 166/278 (59%), Positives = 201/278 (72%), Gaps = 13/278 (4%)
Query: 2 IFPTHQRAQGNKLYYQEALNKSPEL--------KDEPP----KVNNVAP-TLEVTEREKY 48
+ P H+RA GNK +Y++ + E+ DE P V P ++TER Y
Sbjct: 180 LLPDHERANGNKKFYEKEIAHQMEMGKMKGDDGSDEMPVSDLHVTKTDPGVFDLTERTAY 239
Query: 49 EMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDL 108
EMLCRG+L PA + L+CRYV+ NV +LRL PLK EEA++ P I++Y D MYDSEI++
Sbjct: 240 EMLCRGELKPSPAEIRPLRCRYVNNNVDFLRLAPLKLEEAFMDPYIVIYHDAMYDSEIEV 299
Query: 109 IKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST 168
+K+MA+PR RRATVQN TG LE ANYRISKSAWL+ PEH +I + +R MTGL +
Sbjct: 300 LKRMARPRFRRATVQNSVTGALETANYRISKSAWLKTPEHEIIGTVVQRTADMTGLDMDS 359
Query: 169 AEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL 228
AEELQVVNYGIGGHYEPH+DFAR E AF+ L GNR+AT+LFYMSDV QGGATVFTSL
Sbjct: 360 AEELQVVNYGIGGHYEPHFDFARREEKLAFEGLNLGNRIATMLFYMSDVQQGGATVFTSL 419
Query: 229 NLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+LWP+KGTAAFW NLH SG+GD TRHAACPVLTGS
Sbjct: 420 RTALWPKKGTAAFWMNLHRSGEGDARTRHAACPVLTGS 457
>gi|195391754|ref|XP_002054525.1| GJ24502 [Drosophila virilis]
gi|194152611|gb|EDW68045.1| GJ24502 [Drosophila virilis]
Length = 487
Score = 342 bits (876), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 168/278 (60%), Positives = 203/278 (73%), Gaps = 13/278 (4%)
Query: 2 IFPTHQRAQGNKLYYQEALNKSPELK--------DEPP----KVNNVAP-TLEVTEREKY 48
+ P H+RA GNK +Y++ + EL+ DE P V P ++TER+ Y
Sbjct: 180 LLPNHERANGNKKFYEKEIAHLKELQKMKGDDGTDEMPVSDLPVAKSDPGVFDMTERKAY 239
Query: 49 EMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDL 108
EMLCRG+L P+ + L+CRYV+ NV +LRL PLK EEAY+ P I++Y D MYDSEI++
Sbjct: 240 EMLCRGELKPSPSELRPLRCRYVNNNVAFLRLAPLKLEEAYMDPYIVIYHDAMYDSEIEI 299
Query: 109 IKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST 168
IK+MA+PR RRATVQN TG LE ANYRISKSAWL+ EH VI + +R MTGL +
Sbjct: 300 IKRMARPRFRRATVQNSVTGALETANYRISKSAWLKTAEHRVIGTVVQRTADMTGLDMDS 359
Query: 169 AEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL 228
AEELQVVNYGIGGHYEPH+DFAR E AF+ L GNR+AT+LFYMSDV QGGATVFTSL
Sbjct: 360 AEELQVVNYGIGGHYEPHFDFARREEKRAFEGLNLGNRIATMLFYMSDVEQGGATVFTSL 419
Query: 229 NLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+ +LWP+KGTAAFW NLH SG+GD TRHAACPVLTGS
Sbjct: 420 HAALWPKKGTAAFWMNLHRSGEGDVRTRHAACPVLTGS 457
>gi|195452726|ref|XP_002073473.1| GK14136 [Drosophila willistoni]
gi|194169558|gb|EDW84459.1| GK14136 [Drosophila willistoni]
Length = 550
Score = 340 bits (871), Expect = 7e-91, Method: Compositional matrix adjust.
Identities = 168/278 (60%), Positives = 202/278 (72%), Gaps = 13/278 (4%)
Query: 2 IFPTHQRAQGNKLYYQEALNKSPELK--------DEPP----KVNNVAPTL-EVTEREKY 48
+ P H+RA GNK +Y++ + E+K DE P V P + ++TER+ Y
Sbjct: 243 LLPYHERANGNKKFYEKEIAHLKEMKRMKGDDGSDEMPVSDLPVAKSDPGVYDITERKAY 302
Query: 49 EMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDL 108
EMLCRG+L PA + L+CRYV NVP+LRL PLK EEA++ P I++Y D MYDSE+DL
Sbjct: 303 EMLCRGELKPSPADLRPLRCRYVTNNVPFLRLGPLKLEEAHMDPYIVIYHDAMYDSEMDL 362
Query: 109 IKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST 168
IK+MA+PR RRATVQN TG LE ANYRISKSAWL+ E VI + +R MTGL +
Sbjct: 363 IKRMARPRFRRATVQNSVTGALETANYRISKSAWLKTEEDQVIGTVVQRTADMTGLDMDS 422
Query: 169 AEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL 228
AEELQVVNYGIGGHYEPH+DFAR E AF+ L GNR+ATVLFYMSDV QGGATVFTSL
Sbjct: 423 AEELQVVNYGIGGHYEPHFDFARREEKRAFEGLNLGNRIATVLFYMSDVEQGGATVFTSL 482
Query: 229 NLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+ +LWP+KGTAAFW NLH G+GD TRHAACPVLTG+
Sbjct: 483 HAALWPKKGTAAFWMNLHRDGEGDVRTRHAACPVLTGT 520
>gi|194905436|ref|XP_001981196.1| GG11753 [Drosophila erecta]
gi|190655834|gb|EDV53066.1| GG11753 [Drosophila erecta]
Length = 550
Score = 339 bits (870), Expect = 8e-91, Method: Compositional matrix adjust.
Identities = 167/278 (60%), Positives = 201/278 (72%), Gaps = 13/278 (4%)
Query: 2 IFPTHQRAQGNKLYYQEALNKSPELK--------DEPPK----VNNVAPTL-EVTEREKY 48
+ P H+RA GNK +Y++ + + +L DE PK V P + ++TER Y
Sbjct: 243 LLPHHERANGNKRFYEKEIAQQLQLSKMKGDDGTDEMPKSDLPVAKSDPAIFDMTERRAY 302
Query: 49 EMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDL 108
EMLCRG+L P+ + L+CRYV VP+LRL PLK EEA+ P I+++ D MYD EIDL
Sbjct: 303 EMLCRGELKPSPSDLRSLRCRYVTNGVPFLRLGPLKLEEAHADPYIVIFHDAMYDGEIDL 362
Query: 109 IKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST 168
IK+MA+PR RRATVQN TG LE ANYRISKSAWL+ PEH VIE + +R MTGL +
Sbjct: 363 IKRMARPRFRRATVQNSVTGALETANYRISKSAWLKTPEHRVIETVVQRTADMTGLDMDS 422
Query: 169 AEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL 228
AEELQVVNYGIGGHYEPH+DFAR E AF+ L GNR+ATVLFYMSDV QGGATVFTSL
Sbjct: 423 AEELQVVNYGIGGHYEPHFDFARKEEQRAFEGLNLGNRIATVLFYMSDVEQGGATVFTSL 482
Query: 229 NLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+ +L+P+KGTAAFW NLH G GD TRHAACPVLTG+
Sbjct: 483 HTALFPKKGTAAFWMNLHRDGQGDVRTRHAACPVLTGT 520
>gi|312383453|gb|EFR28539.1| hypothetical protein AND_03427 [Anopheles darlingi]
Length = 341
Score = 338 bits (868), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 182/325 (56%), Positives = 211/325 (64%), Gaps = 49/325 (15%)
Query: 2 IFPTHQRAQGNKLYYQEALNKSPELK-------------DEPPKVNNVAPTLEV---TER 45
+ P H+RA NK YY + L K K D K++ + V TER
Sbjct: 8 LVPDHERAVSNKAYYVKELEKEALQKILRGDDGSEEVPVDTSTKIHKGEASPHVYDKTER 67
Query: 46 EKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSE 105
+ YE LCRG+ P + +QL CRY P+LRL PLK EEAY QP I++Y DVM D E
Sbjct: 68 KLYEQLCRGEQEPPIELRSQLVCRYATNRSPFLRLAPLKLEEAYRQPDIVIYHDVMSDRE 127
Query: 106 IDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLT 165
I+LIK A+PR RRATVQNYKTGELE ANYRISKSAWL++ EH VI +++RVE MTGLT
Sbjct: 128 IELIKHYARPRFRRATVQNYKTGELEFANYRISKSAWLKDTEHEVIRTVNQRVEDMTGLT 187
Query: 166 TSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFY------------ 213
+TAEELQVVNYGIGGHYEPH+DFAR E NAFKSLGTGNR+ATVLFY
Sbjct: 188 MATAEELQVVNYGIGGHYEPHFDFARREERNAFKSLGTGNRIATVLFYVSDLCLCHTSHT 247
Query: 214 -----------MSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPV 262
MSDV QGGATVF SLNL+L P KGTAAFWHNLH+SG+GDY TRHAACPV
Sbjct: 248 NADFRFLSVGQMSDVTQGGATVFPSLNLALRPRKGTAAFWHNLHASGNGDYATRHAACPV 307
Query: 263 LTGSNSLHSTC----------PCGL 277
LTG+ + + PCGL
Sbjct: 308 LTGTKWVSNKWIHERGQEFRRPCGL 332
>gi|195110919|ref|XP_002000027.1| GI24860 [Drosophila mojavensis]
gi|193916621|gb|EDW15488.1| GI24860 [Drosophila mojavensis]
Length = 487
Score = 338 bits (866), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 165/278 (59%), Positives = 203/278 (73%), Gaps = 13/278 (4%)
Query: 2 IFPTHQRAQGNKLYYQEALNKSPE-LK------------DEPPKVNNVAPTLEVTEREKY 48
+ P H+RA GNK +Y++ + + E LK + P V + ++TER+ Y
Sbjct: 180 LLPDHERANGNKKFYEKEIAQLKEKLKVKGDDGSDATPVSDLPVVKSDPGVFDMTERKAY 239
Query: 49 EMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDL 108
EMLCRG+L + P+++ L+CRYV NVP+LRL PLK EEA+L P I++Y D M+DSEI++
Sbjct: 240 EMLCRGELKLSPSVLRPLRCRYVSNNVPFLRLAPLKLEEAFLDPYIVIYHDAMFDSEIEV 299
Query: 109 IKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST 168
+K+MA+PR RRATVQN TG LE ANYRISKSAWL+ EH VI + +R MTGL +
Sbjct: 300 LKRMARPRFRRATVQNAVTGALETANYRISKSAWLKTAEHRVIGTVVQRTADMTGLDMDS 359
Query: 169 AEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL 228
AEELQVVNYGIGGHYEPH+DFAR E AF+ L GNR+ATVLFYMSDV QGGATVFTSL
Sbjct: 360 AEELQVVNYGIGGHYEPHFDFARREEIRAFEGLNLGNRIATVLFYMSDVEQGGATVFTSL 419
Query: 229 NLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+ L P+KGTAAFW NLH SG+GD TRHAACPVLTGS
Sbjct: 420 HAVLKPKKGTAAFWMNLHRSGEGDVRTRHAACPVLTGS 457
>gi|194765194|ref|XP_001964712.1| GF22904 [Drosophila ananassae]
gi|190614984|gb|EDV30508.1| GF22904 [Drosophila ananassae]
Length = 547
Score = 338 bits (866), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 167/278 (60%), Positives = 203/278 (73%), Gaps = 13/278 (4%)
Query: 2 IFPTHQRAQGNKLYYQEALNKSPELK--------DEPPK----VNNVAPT-LEVTEREKY 48
+ P H+RA GNK +Y++ + +L+ DE PK V P+ ++TER+ Y
Sbjct: 240 LLPHHERANGNKRFYEKEIANQQQLRKMKGDDGSDEMPKSDLPVAKSDPSVFDMTERKAY 299
Query: 49 EMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDL 108
EMLCRG+L PA + L+CRYV NVP+LRL PLK EEA+ +P I++Y D MYDSEI+L
Sbjct: 300 EMLCRGELKPSPADLRPLRCRYVTNNVPFLRLGPLKLEEAHQEPYIVIYHDAMYDSEIEL 359
Query: 109 IKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST 168
IK+MA+PR RRATVQN TG LE ANYRISKSAWL+ E VI + +R MTGL +
Sbjct: 360 IKRMARPRFRRATVQNSVTGALETANYRISKSAWLKTEEDHVIGTVVQRTADMTGLDMDS 419
Query: 169 AEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL 228
AEELQVVNYGIGGHYEPH+DFAR E AF+ L GNR+ATVLFYMSDV QGGATVFTSL
Sbjct: 420 AEELQVVNYGIGGHYEPHFDFARKEEKRAFEGLNLGNRIATVLFYMSDVEQGGATVFTSL 479
Query: 229 NLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+ +L+P+KGTAAFW NLH G+GD TRHAACPVLTG+
Sbjct: 480 HTALFPKKGTAAFWMNLHRDGEGDVRTRHAACPVLTGT 517
>gi|195575089|ref|XP_002105512.1| GD21521 [Drosophila simulans]
gi|194201439|gb|EDX15015.1| GD21521 [Drosophila simulans]
Length = 550
Score = 335 bits (860), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 166/278 (59%), Positives = 200/278 (71%), Gaps = 13/278 (4%)
Query: 2 IFPTHQRAQGNKLYYQEALNKSPELK--------DEPPK----VNNVAPTL-EVTEREKY 48
+ P H+RA GNK +Y++ + + +L+ DE PK V P + ++TER Y
Sbjct: 243 LLPHHERANGNKRFYEKEIAQQLQLRKMKGDDGTDEMPKSDLPVAKSDPAIFDMTERRAY 302
Query: 49 EMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDL 108
EMLCRG+L P+ + L+CRYV VP+LRL PLK EE + P I++Y D MYDSEIDL
Sbjct: 303 EMLCRGELKPSPSDLRSLRCRYVTNRVPFLRLGPLKLEEVHADPYIVIYHDAMYDSEIDL 362
Query: 109 IKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST 168
IK+MA+PR RRATVQN TG LE ANYRISKSAWL+ E VIE + +R MTGL +
Sbjct: 363 IKRMARPRFRRATVQNSVTGALETANYRISKSAWLKTQEDRVIETVVQRTADMTGLDMDS 422
Query: 169 AEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL 228
AEELQVVNYGIGGHYEPH+DFAR E AF+ L GNR+ATVLFYMSDV QGGATVFTSL
Sbjct: 423 AEELQVVNYGIGGHYEPHFDFARKEEERAFEGLNLGNRIATVLFYMSDVEQGGATVFTSL 482
Query: 229 NLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+ +L+P+KGTAAFW NLH G GD TRHAACPVLTG+
Sbjct: 483 HTALFPKKGTAAFWMNLHRDGQGDVRTRHAACPVLTGT 520
>gi|24651407|ref|NP_733371.1| prolyl-4-hydroxylase-alpha EFB [Drosophila melanogaster]
gi|20269806|gb|AAM18058.1|AF495536_1 prolyl 4-hydroxylase alpha-related protein PH4[alpha]EFB
[Drosophila melanogaster]
gi|15292529|gb|AAK93533.1| SD05564p [Drosophila melanogaster]
gi|23172692|gb|AAF57053.2| prolyl-4-hydroxylase-alpha EFB [Drosophila melanogaster]
gi|220946562|gb|ACL85824.1| PH4alphaEFB-PA [synthetic construct]
Length = 550
Score = 335 bits (859), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 166/278 (59%), Positives = 200/278 (71%), Gaps = 13/278 (4%)
Query: 2 IFPTHQRAQGNKLYYQEALNKSPELK--------DEPPK----VNNVAPTL-EVTEREKY 48
+ P H+RA GNK +Y++ + + +L+ DE PK V P + ++TER Y
Sbjct: 243 LLPHHERANGNKRFYEKEIAQQLQLRKMKGDDGTDEMPKSDLPVAKSDPAIFDMTERRAY 302
Query: 49 EMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDL 108
EMLCRG+L P+ + L+CRYV VP+LRL PLK EE + P I++Y D MYDSEIDL
Sbjct: 303 EMLCRGELKPSPSDLRSLRCRYVTNRVPFLRLGPLKLEEVHADPYIVIYHDAMYDSEIDL 362
Query: 109 IKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST 168
IK+MA+PR RRATVQN TG LE ANYRISKSAWL+ E VIE + +R MTGL +
Sbjct: 363 IKRMARPRFRRATVQNSVTGALETANYRISKSAWLKTQEDRVIETVVQRTADMTGLDMDS 422
Query: 169 AEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL 228
AEELQVVNYGIGGHYEPH+DFAR E AF+ L GNR+ATVLFYMSDV QGGATVFTSL
Sbjct: 423 AEELQVVNYGIGGHYEPHFDFARKEEQRAFEGLNLGNRIATVLFYMSDVEQGGATVFTSL 482
Query: 229 NLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+ +L+P+KGTAAFW NLH G GD TRHAACPVLTG+
Sbjct: 483 HTALFPKKGTAAFWMNLHRDGQGDVRTRHAACPVLTGT 520
>gi|195341536|ref|XP_002037362.1| GM12882 [Drosophila sechellia]
gi|194131478|gb|EDW53521.1| GM12882 [Drosophila sechellia]
Length = 550
Score = 334 bits (857), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 165/278 (59%), Positives = 200/278 (71%), Gaps = 13/278 (4%)
Query: 2 IFPTHQRAQGNKLYYQEALNKSPELK--------DEPPK----VNNVAPTL-EVTEREKY 48
+ P H+RA GNK +Y++ + + +L+ DE PK V P + ++TER Y
Sbjct: 243 LLPHHERANGNKRFYEKEIAQQLQLRKMKGDDGTDEMPKSDLPVAKSDPAIFDMTERRAY 302
Query: 49 EMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDL 108
EMLCRG+L P+ + L+CRYV VP+LRL PLK EE + P I++Y D MYDSEIDL
Sbjct: 303 EMLCRGELKPSPSDLRSLRCRYVTNRVPFLRLGPLKLEEVHADPYIVIYHDAMYDSEIDL 362
Query: 109 IKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST 168
IK+MA+PR RRATVQN TG LE ANYRISKSAWL+ E VIE + +R MTGL +
Sbjct: 363 IKRMARPRFRRATVQNSVTGALETANYRISKSAWLKTQEDRVIETVVQRTADMTGLDMDS 422
Query: 169 AEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL 228
AEELQVVNYGIGGHYEPH+DFAR E AF+ + GNR+ATVLFYMSDV QGGATVFTSL
Sbjct: 423 AEELQVVNYGIGGHYEPHFDFARKEEERAFEGINLGNRIATVLFYMSDVEQGGATVFTSL 482
Query: 229 NLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+ +L+P+KGTAAFW NLH G GD TRHAACPVLTG+
Sbjct: 483 HTALFPKKGTAAFWMNLHRDGQGDVRTRHAACPVLTGT 520
>gi|125772807|ref|XP_001357662.1| GA15946 [Drosophila pseudoobscura pseudoobscura]
gi|54637394|gb|EAL26796.1| GA15946 [Drosophila pseudoobscura pseudoobscura]
Length = 549
Score = 334 bits (857), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 162/278 (58%), Positives = 197/278 (70%), Gaps = 13/278 (4%)
Query: 2 IFPTHQRAQGNKLYYQEALNKSPELK--------DEPPKVN-----NVAPTLEVTEREKY 48
+ P H+RA GNK +Y++ + E+K DE P + + L V ER+ Y
Sbjct: 242 LLPNHERANGNKRFYEKEIAHQKEMKKMKGDDGTDEMPVSDLPVARSDTGELGVKERKSY 301
Query: 49 EMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDL 108
EMLCRG+L P + L+CRYV NVP+LRL PLK EEA+ P I++Y D MYDSE+DL
Sbjct: 302 EMLCRGELKPSPTYMRSLRCRYVTNNVPFLRLGPLKLEEAHKDPYIVIYHDAMYDSEMDL 361
Query: 109 IKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST 168
IK+MA+PR RRATVQN TG LE ANYRISKSAWL+ E VI ++ +R MTGL +
Sbjct: 362 IKRMARPRFRRATVQNSVTGALETANYRISKSAWLKTEEDSVIAKVVQRTADMTGLDMES 421
Query: 169 AEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL 228
AEELQVVNYGIGGHY PH+DFAR E AF+ L GNR+ATVLFYMSDV QGGATVFT+L
Sbjct: 422 AEELQVVNYGIGGHYAPHFDFARREEKRAFEGLNLGNRIATVLFYMSDVEQGGATVFTTL 481
Query: 229 NLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+LWP++GTAAFW NLH G+GD T+HAACPVLTG+
Sbjct: 482 RTALWPKRGTAAFWMNLHRDGEGDKRTQHAACPVLTGT 519
>gi|195159323|ref|XP_002020531.1| GL13463 [Drosophila persimilis]
gi|194117300|gb|EDW39343.1| GL13463 [Drosophila persimilis]
Length = 487
Score = 333 bits (854), Expect = 6e-89, Method: Compositional matrix adjust.
Identities = 162/278 (58%), Positives = 197/278 (70%), Gaps = 13/278 (4%)
Query: 2 IFPTHQRAQGNKLYYQEALNKSPELK--------DEPPKVN-----NVAPTLEVTEREKY 48
+ P H+RA GNK +Y++ + E+K DE P + + L V ER+ Y
Sbjct: 180 LLPNHERANGNKRFYEKEIAHQKEMKKMKGDDGTDEMPVSDLPVARSDTGELGVKERKSY 239
Query: 49 EMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDL 108
EMLCRG+L P + L+CRYV NVP+LRL PLK EEA+ P I++Y D MYDSE+DL
Sbjct: 240 EMLCRGELKPSPTYMRSLRCRYVTNNVPFLRLGPLKLEEAHKDPYIVIYHDAMYDSEMDL 299
Query: 109 IKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST 168
IK+MA+PR RRATVQN TG LE ANYRISKSAWL+ E VI ++ +R MTGL +
Sbjct: 300 IKRMARPRFRRATVQNSVTGALETANYRISKSAWLKTEEDSVIAKVVQRTADMTGLDMES 359
Query: 169 AEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL 228
AEELQVVNYGIGGHY PH+DFAR E AF+ L GNR+ATVLFYMSDV QGGATVFT+L
Sbjct: 360 AEELQVVNYGIGGHYAPHFDFARREEKRAFEGLNLGNRIATVLFYMSDVEQGGATVFTTL 419
Query: 229 NLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+LWP++GTAAFW NLH G+GD T+HAACPVLTG+
Sbjct: 420 RTALWPKRGTAAFWMNLHRDGEGDKRTQHAACPVLTGT 457
>gi|321474877|gb|EFX85841.1| hypothetical protein DAPPUDRAFT_208740 [Daphnia pulex]
Length = 545
Score = 332 bits (852), Expect = 9e-89, Method: Compositional matrix adjust.
Identities = 165/303 (54%), Positives = 210/303 (69%), Gaps = 26/303 (8%)
Query: 2 IFPTHQRAQGNKLYYQEALNKSPELK-------------DEPPKVNNVA---PTLEVTER 45
+ P HQRA GNK YY++ L + ++ DEP N+ P+ ++ ER
Sbjct: 238 LVPFHQRALGNKKYYEDLLRQQGVIQRRGETGDEDNVVMDEPFNTANLKLTKPSDQLPER 297
Query: 46 EKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSE 105
E YE LCRG+ + P I +L+CRYV NVPYL + P+K EEA+ +P I++Y +V+ D E
Sbjct: 298 ENYEKLCRGEKLMDPKIEGRLRCRYVTNNVPYLYIQPVKMEEAFHKPLIVIYHNVINDDE 357
Query: 106 IDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLT 165
I+ +KKMAQPR +RATVQN TG LE ANYRISKSAWL+ EH + +++RRV +TGL
Sbjct: 358 IETVKKMAQPRFKRATVQNSVTGNLEPANYRISKSAWLKSEEHDHVFKVTRRVGDVTGLD 417
Query: 166 TSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVF 225
+TAE+LQVVNYGIGGHYEPH+D+AR E NAFK LG GNRVAT LFYMS+V GGATVF
Sbjct: 418 MATAEDLQVVNYGIGGHYEPHFDYARKEEVNAFKDLGWGNRVATWLFYMSEVEAGGATVF 477
Query: 226 TSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PC 275
LNL+LWP+KG+AAFW+NLH +G+G+ TRHAACPVLTGS + + PC
Sbjct: 478 PKLNLALWPQKGSAAFWYNLHPNGEGNELTRHAACPVLTGSKWVSNKWIHERNQEFRHPC 537
Query: 276 GLR 278
GLR
Sbjct: 538 GLR 540
>gi|195505190|ref|XP_002099397.1| GE10881 [Drosophila yakuba]
gi|194185498|gb|EDW99109.1| GE10881 [Drosophila yakuba]
Length = 487
Score = 329 bits (843), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 165/278 (59%), Positives = 198/278 (71%), Gaps = 13/278 (4%)
Query: 2 IFPTHQRAQGNKLYYQEALNKSPELK--------DEPPK----VNNVAPTL-EVTEREKY 48
+ P H+RA GNK +Y++ + + +L DE PK V P + ++TER Y
Sbjct: 180 LLPHHERANGNKRFYEKEIAQQLQLSKMKGDDGTDEMPKSDLPVAKSDPAIFDMTERRAY 239
Query: 49 EMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDL 108
EMLCRG+L P+ + L+CRYV VP+LRL PLK EEA+ P I++Y D MYDSEID+
Sbjct: 240 EMLCRGELKPSPSELRPLRCRYVTNGVPFLRLGPLKLEEAHADPYIVIYHDAMYDSEIDV 299
Query: 109 IKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST 168
IK+MA+PR RRATVQN TG LE ANYRISKSAWL+ E VI + +R MTGL +
Sbjct: 300 IKRMARPRFRRATVQNSVTGALETANYRISKSAWLKTHEDRVIGTVVQRTADMTGLDMES 359
Query: 169 AEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL 228
AEELQVVNYGIGGHYEPH+DFAR E AF+ L GNR+ATVLFYMSDV QGGATVFTSL
Sbjct: 360 AEELQVVNYGIGGHYEPHFDFARKEEERAFEGLNLGNRIATVLFYMSDVEQGGATVFTSL 419
Query: 229 NLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+ +L+P KGTAAFW NLH G GD TRHAACPVLTG+
Sbjct: 420 HTALFPRKGTAAFWMNLHRDGQGDVRTRHAACPVLTGT 457
>gi|240974259|ref|XP_002401836.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
gi|215491070|gb|EEC00711.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
Length = 490
Score = 327 bits (837), Expect = 6e-87, Method: Compositional matrix adjust.
Identities = 171/297 (57%), Positives = 206/297 (69%), Gaps = 23/297 (7%)
Query: 4 PTHQRAQGNKLYYQEALNKSPELK-----DEPPKVNNVA-----PTLEVTEREKYEMLCR 53
P H RA GNK YY++A++K+ K D P V P + +ER YE LCR
Sbjct: 192 PDHPRAPGNKRYYEDAISKTELHKRGDDGDVPMDEAAVGKKHHGPDAD-SERGIYERLCR 250
Query: 54 GD-LTVPPAIVAQ-LKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
G+ VPP + L C+Y P+L L P KEE + +PRI++Y DVM E+D++K
Sbjct: 251 GEKFPVPPLYKDKDLTCQYRTNGSPFLLLQPAKEEVMFPKPRIVIYHDVMSKHEMDVVKL 310
Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
+AQPRL+RATVQNYK+GELE+ANYRISKSAWLR EH VI R++RR+EH+TGL+ TAEE
Sbjct: 311 LAQPRLKRATVQNYKSGELEVANYRISKSAWLRNEEHGVIARVTRRIEHITGLSADTAEE 370
Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
LQVVNYGIGGHYEPH+DFAR E NAF+SLGTGNR+AT L YMSDV GGATVF L L+
Sbjct: 371 LQVVNYGIGGHYEPHFDFARREEKNAFQSLGTGNRIATWLNYMSDVPAGGATVFPQLRLT 430
Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHS----------TCPCGLR 278
LWPEKG AAFW+NLH SG+GD TRHAACPVL GS + + T PCG R
Sbjct: 431 LWPEKGAAAFWYNLHRSGEGDMLTRHAACPVLAGSKWVSNKWFHERGQEFTRPCGTR 487
>gi|239792190|dbj|BAH72464.1| ACYPI007079 [Acyrthosiphon pisum]
Length = 249
Score = 320 bits (820), Expect = 5e-85, Method: Compositional matrix adjust.
Identities = 154/218 (70%), Positives = 180/218 (82%), Gaps = 1/218 (0%)
Query: 50 MLCRGDLTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDL 108
MLCR + + I +QL+CRY + N P L + PLKEEEA+ PRIILYRDV+YD+EI++
Sbjct: 1 MLCRNENLMSIQISSQLRCRYTNNNRNPLLLIAPLKEEEAFFSPRIILYRDVLYDNEIEV 60
Query: 109 IKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST 168
IK+MAQPRL+RATVQNYKTGELE A+YRISKSAWL+E E V+ +++RVE MTGLTT T
Sbjct: 61 IKRMAQPRLKRATVQNYKTGELEFADYRISKSAWLKEHEDVVVANVAKRVEVMTGLTTET 120
Query: 169 AEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL 228
AEELQVVNYG+GGHY+PHYDFAR E NAFKSLGTGNR+ATVLFYMSDVAQGGATVF L
Sbjct: 121 AEELQVVNYGVGGHYDPHYDFARTEEINAFKSLGTGNRIATVLFYMSDVAQGGATVFPWL 180
Query: 229 NLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
++L P KGTAA W NL+ SG+GD TRHAACPVL GS
Sbjct: 181 GVALQPVKGTAAVWFNLYPSGNGDLRTRHAACPVLQGS 218
>gi|321474952|gb|EFX85916.1| hypothetical protein DAPPUDRAFT_45616 [Daphnia pulex]
Length = 537
Score = 315 bits (806), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 157/295 (53%), Positives = 199/295 (67%), Gaps = 19/295 (6%)
Query: 2 IFPTHQRAQGNKLYYQEALNKSPELK------DEPPKVNNVA---PTLEVTEREKYEMLC 52
I P HQRA GNK +Y++ L K L +P N+ P + ER+KYE LC
Sbjct: 237 IVPYHQRAIGNKKHYEDVLRKEGILLPIEMILTKPFNTANLKLKKPVDNLEERDKYEKLC 296
Query: 53 RGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKM 112
RG+ + P I L+CRY+ NVP+ + P+K EEA L+P I++Y DVM D EI+ +KKM
Sbjct: 297 RGEKLMDPKIEGHLRCRYITNNVPFFFIQPIKMEEALLKPMIVVYHDVMSDDEIETVKKM 356
Query: 113 AQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEEL 172
A+PR +RAT++N KTGELE ANYRISKSAWL+ EH I +++RRV +TGL STAE+L
Sbjct: 357 AKPRFKRATIRNSKTGELEPANYRISKSAWLKSEEHDHILKVTRRVGDITGLDMSTAEDL 416
Query: 173 QVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSL 232
QVVNYGIGGHYEPH+D+AR AFK LG GNR+AT LFYMSDV GGATVF ++
Sbjct: 417 QVVNYGIGGHYEPHFDYARTETTEAFKELGWGNRIATWLFYMSDVEAGGATVFPPTGAAV 476
Query: 233 WPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCGL 277
WP KG+AAFW+NL+ +G G+ TRHAACPVL+GS + + PCGL
Sbjct: 477 WPRKGSAAFWYNLYPNGKGNELTRHAACPVLSGSKWVSNRWIHEHRQEFRRPCGL 531
>gi|391342914|ref|XP_003745760.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Metaseiulus
occidentalis]
Length = 525
Score = 301 bits (772), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 150/269 (55%), Positives = 184/269 (68%), Gaps = 6/269 (2%)
Query: 4 PTHQRAQGNKLYYQEALNKSP------ELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLT 57
P H RA GNK YY L K+ + K+ P + + ER YE LCRG+
Sbjct: 232 PDHPRASGNKRYYLSELGKNQSGEGRGDTKEAPVETHIKRQDSLSDERIMYERLCRGEPV 291
Query: 58 VPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRL 117
P + L C Y H N PY+ L P K E + +P + L+ D+M D EI + +++ PRL
Sbjct: 292 EKPFLRKNLHCTYFHNNHPYMILQPSKLEVIHERPYLALFHDIMSDDEIQTVIELSAPRL 351
Query: 118 RRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNY 177
+RATVQN K+GELE+ANYRISKSAWL+ +H V+ER+S R E++TGLT TAEELQVVNY
Sbjct: 352 KRATVQNAKSGELEVANYRISKSAWLKNHDHEVVERLSFRFEYLTGLTHLTAEELQVVNY 411
Query: 178 GIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKG 237
GIGGHYE H+DFAR E +AFK LGTGNR+AT + YMSDV GGATVF L L++WPEKG
Sbjct: 412 GIGGHYEAHFDFARRDEKDAFKQLGTGNRIATWINYMSDVKAGGATVFPRLGLTVWPEKG 471
Query: 238 TAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+AAFW NLH SG+GD TRHAACPVL GS
Sbjct: 472 SAAFWWNLHRSGEGDILTRHAACPVLAGS 500
>gi|268536692|ref|XP_002633481.1| C. briggsae CBR-PHY-2 protein [Caenorhabditis briggsae]
gi|94442973|emb|CAJ98659.1| prolyl 4-hydroxylase [Caenorhabditis briggsae]
Length = 539
Score = 299 bits (766), Expect = 8e-79, Method: Compositional matrix adjust.
Identities = 152/298 (51%), Positives = 202/298 (67%), Gaps = 16/298 (5%)
Query: 2 IFPTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPA 61
I P H RA+GN +Y++ L + D PP VN + ER+ YE LCRG+ +PP
Sbjct: 235 IAPNHPRAKGNVKWYEDMLQGKDMVGDLPPIVNKRVEFDGIVERDAYEALCRGE--IPPV 292
Query: 62 ---IVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLR 118
++L+C Y+ R+ P+L++ P+K E P +L+++V+ DSEI++IK++A P+L+
Sbjct: 293 EEKWKSKLRC-YLKRDKPFLKIAPIKVEILRFDPLAVLFKNVISDSEIEVIKELASPKLK 351
Query: 119 RATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYG 178
RATVQN KTGELE A YRISKSAWL+ PVI+R++RR+E TGL +T+EELQV NYG
Sbjct: 352 RATVQNSKTGELEHATYRISKSAWLKGDLDPVIDRVNRRIEDFTGLNQATSEELQVANYG 411
Query: 179 IGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGT 238
+GGHY+PH+DFAR E NAFK+L TGNR+ATVLFYMS +GGATVF L +++P K
Sbjct: 412 LGGHYDPHFDFARKEEKNAFKTLNTGNRIATVLFYMSQPERGGATVFNHLGTAVFPSKND 471
Query: 239 AAFWHNLHSSGDGDYYTRHAACPVLTG----SNS-LHS-----TCPCGLRRGLQRSGI 286
A FW+NL G+GD TRHAACPVL G SN +H T PCGL G+Q + I
Sbjct: 472 ALFWYNLRRDGEGDLRTRHAACPVLLGVKWVSNKWIHERGQEFTRPCGLEEGVQENFI 529
>gi|112984520|ref|NP_001037195.1| prolyl 4-hydroxylase alpha subunit precursor [Bombyx mori]
gi|37543673|gb|AAM21932.1| prolyl 4-hydroxylase alpha subunit [Bombyx mori]
Length = 550
Score = 299 bits (766), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 163/305 (53%), Positives = 201/305 (65%), Gaps = 25/305 (8%)
Query: 4 PTHQRAQGNKLYYQEAL-NKSPELK--------DEPPKVNNVAPTLE--VTEREKYEMLC 52
P H RA+GN +YQ+ + + ELK DEP + + L ER+ YE LC
Sbjct: 238 PKHVRARGNIPHYQKTIAEQEAELKKQQRGETSDEPEEEDGQDYELSEYAKERKVYESLC 297
Query: 53 RGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKM 112
RG++ +P I +LKC YV P+L+L P+K E+ Y++P I ++ +VM D EI+ IKK
Sbjct: 298 RGEMEIPHEITKRLKCWYVTDTHPFLKLAPIKVEQMYVKPDIFMFHEVMTDDEIEFIKKR 357
Query: 113 AQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEEL 172
A+PR +RA V + KTGEL A+YRISKS+WLR+ E PVI RI++RV MTGL+ AEEL
Sbjct: 358 AKPRFKRAVVHDPKTGELTPAHYRISKSSWLRDEESPVIARITQRVTDMTGLSMLHAEEL 417
Query: 173 QVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSL 232
QVVNYGIGGHYEPH+DFAR E N F G GNR+ATVLFYMSDVAQGGATVFT L LSL
Sbjct: 418 QVVNYGIGGHYEPHFDFARKRE-NPFTKFG-GNRIATVLFYMSDVAQGGATVFTELGLSL 475
Query: 233 WPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCGLRRGLQ 282
+P K AAFW NLH+SG+GD TRHAACPVL GS + + PC L Q
Sbjct: 476 FPIKRAAAFWLNLHASGEGDLATRHAACPVLRGSKWVSNKWIHQGGQELLRPCDLE--YQ 533
Query: 283 RSGII 287
GII
Sbjct: 534 EEGII 538
>gi|17541712|ref|NP_502317.1| Protein PHY-2 [Caenorhabditis elegans]
gi|32171589|sp|Q20065.1|P4HA2_CAEEL RecName: Full=Prolyl 4-hydroxylase subunit alpha-2; Short=4-PH
alpha-2; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-2; Flags: Precursor
gi|3876769|emb|CAA93469.1| Protein PHY-2 [Caenorhabditis elegans]
Length = 539
Score = 298 bits (762), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 151/296 (51%), Positives = 200/296 (67%), Gaps = 12/296 (4%)
Query: 2 IFPTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLT-VPP 60
I P H RA+GN +Y++ L + D PP VN + ER+ YE LCRG++ V P
Sbjct: 235 IAPNHPRAKGNVKWYEDMLQGKDMVGDLPPIVNKRVEYDGIVERDAYEALCRGEIPPVEP 294
Query: 61 AIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRA 120
+L+C Y+ R+ P+L+L P+K E P +L+++V++DSEI++IK++A P+L+RA
Sbjct: 295 KWKNKLRC-YLKRDKPFLKLAPIKVEILRFDPLAVLFKNVIHDSEIEVIKELASPKLKRA 353
Query: 121 TVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIG 180
TVQN KTGELE A YRISKSAWL+ PVI+R++RR+E T L +T+EELQV NYG+G
Sbjct: 354 TVQNSKTGELEHATYRISKSAWLKGDLDPVIDRVNRRIEDFTNLNQATSEELQVANYGLG 413
Query: 181 GHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAA 240
GHY+PH+DFAR E NAFK+L TGNR+ATVLFYMS +GGATVF L +++P K A
Sbjct: 414 GHYDPHFDFARKEEKNAFKTLNTGNRIATVLFYMSQPERGGATVFNHLGTAVFPSKNDAL 473
Query: 241 FWHNLHSSGDGDYYTRHAACPVLTG----SNS-LHS-----TCPCGLRRGLQRSGI 286
FW+NL G+GD TRHAACPVL G SN +H T PCGL +Q + I
Sbjct: 474 FWYNLRRDGEGDLRTRHAACPVLLGVKWVSNKWIHEKGQEFTRPCGLEEEVQENFI 529
>gi|312032360|ref|NP_001185667.1| prolyl 4-hydroxylase subunit alpha-1 isoform 4 precursor [Gallus
gallus]
Length = 536
Score = 297 bits (761), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 149/273 (54%), Positives = 192/273 (70%), Gaps = 11/273 (4%)
Query: 4 PTHQRAQGNKLYYQ-------EALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD- 55
P HQRA GN Y++ EA S + +D+ K V + ER KYEMLCRG+
Sbjct: 240 PEHQRANGNMKYFEYIMAKEKEANKSSTDAEDQTEKETEVKKKDYLPERRKYEMLCRGEG 299
Query: 56 LTVPPAIVAQLKCRYV--HRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
L + P +L CRY +RN Y+ L P+K+E+ + +PRI+ + D++ D EI+ +K++A
Sbjct: 300 LKMTPRRQKRLFCRYYDGNRNPRYI-LGPVKQEDEWDKPRIVRFLDIISDEEIETVKELA 358
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+PRLRRAT+ N TG LE A+YRISKSAWL E PV+ RI+ R++ +TGL STAEELQ
Sbjct: 359 KPRLRRATISNPITGALETAHYRISKSAWLSGYESPVVSRINTRIQDLTGLDVSTAEELQ 418
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
V NYG+GG YEPH+DFAR E +AFK LGTGNR+AT LFYMSDV+ GGATVF + S+W
Sbjct: 419 VANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGASVW 478
Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
P+KGTA FW+NL SG+GDY TRHAACPVL G+
Sbjct: 479 PKKGTAVFWYNLFPSGEGDYSTRHAACPVLVGN 511
>gi|148233143|ref|NP_001090904.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Sus scrofa]
gi|83778522|gb|ABC47142.1| procollagen-proline 2-oxoglutarate-4-dioxygenase [Sus scrofa]
Length = 534
Score = 296 bits (759), Expect = 7e-78, Method: Compositional matrix adjust.
Identities = 147/274 (53%), Positives = 192/274 (70%), Gaps = 11/274 (4%)
Query: 4 PTHQRAQGNKLYYQEALNKSPEL-KDEPPKVNNVAPTLE--------VTEREKYEMLCRG 54
P HQRA GN Y++ + K E K +N TL+ + ER+KYEMLCRG
Sbjct: 236 PEHQRANGNLKYFEYIMAKEKEANKSASDDQSNQKTTLKKKGVAVDYLPERQKYEMLCRG 295
Query: 55 D-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKM 112
+ + + P +L CRY N P L P K+E+ + +PRII + D++ D+EID++K +
Sbjct: 296 EGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIDIVKDL 355
Query: 113 AQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEEL 172
A+PRLRRAT+ N TG+LE +YRISKSAWL E+PV+ R++ R++ +TGL STAEEL
Sbjct: 356 AKPRLRRATISNPITGDLETVHYRISKSAWLSGYENPVVSRLNMRIQDLTGLDVSTAEEL 415
Query: 173 QVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSL 232
QV NYG+GG YEPH+DFAR E +AFK LGTGNR+AT LFYMSDV+ GGATVF + S+
Sbjct: 416 QVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGASV 475
Query: 233 WPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 476 WPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 509
>gi|410251926|gb|JAA13930.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
Length = 566
Score = 296 bits (758), Expect = 9e-78, Method: Compositional matrix adjust.
Identities = 155/332 (46%), Positives = 211/332 (63%), Gaps = 28/332 (8%)
Query: 4 PTHQRAQGNKLYYQEALNKSPEL----------KDEPPKVNNVAPTLEVTEREKYEMLCR 53
P HQRA GN Y++ + K ++ + PK VA + ER+KYEMLCR
Sbjct: 236 PEHQRANGNLKYFEYIMAKEKDVNKSASDDQSDQKTTPKKKGVAVDY-LPERQKYEMLCR 294
Query: 54 GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
G+ + + P +L CRY N P L P K+E+ + +PRII + D++ D+EI+++K
Sbjct: 295 GEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 354
Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
+A+PRLRRAT+ N TG+LE +YRISKSAWL E+PV+ RI+ R++ +TGL STAEE
Sbjct: 355 LAKPRLRRATISNPITGDLETVHYRISKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEE 414
Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
LQV NYG+GG YEPH+DFAR E +AFK LGTGNR+AT LFYMSDV+ GGATVF + S
Sbjct: 415 LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 474
Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PC-----G 276
+WP+KGTA FW+NL +SG+GDY TRHAACPVL G+ + + PC G
Sbjct: 475 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRPCTFVRIG 534
Query: 277 LRRGLQRSGIICTLVGMVITIRGMLPVLYSLD 308
+ L CTL+ ++ T + +++D
Sbjct: 535 MTNRLPFFSYCCTLMCLIYTFPSLNFQEFTID 566
>gi|312032358|ref|NP_001185666.1| prolyl 4-hydroxylase subunit alpha-1 isoform 3 precursor [Gallus
gallus]
Length = 536
Score = 296 bits (757), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 148/273 (54%), Positives = 191/273 (69%), Gaps = 11/273 (4%)
Query: 4 PTHQRAQGNKLYYQ-------EALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD- 55
P HQRA GN Y++ EA S + +D+ K V + ER KYEMLCRG+
Sbjct: 240 PEHQRANGNMKYFEYIMAKEKEANKSSTDAEDQTEKETEVKKKDYLPERRKYEMLCRGEG 299
Query: 56 LTVPPAIVAQLKCRYV--HRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
L + P +L CRY +RN Y+ L P+K+E+ + +PRI+ + D++ D EI+ +K++A
Sbjct: 300 LKMTPRRQKRLFCRYYDGNRNPRYI-LGPVKQEDEWDKPRIVRFLDIISDEEIETVKELA 358
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+PRLRRAT+ N TG LE A+YRISKSAWL E PV+ RI+ R++ +TGL STAEELQ
Sbjct: 359 KPRLRRATISNPITGALETAHYRISKSAWLSGYESPVVSRINTRIQDLTGLDVSTAEELQ 418
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
V NYG+GG YEPH+DF R E +AFK LGTGNR+AT LFYMSDV+ GGATVF + S+W
Sbjct: 419 VANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGASVW 478
Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
P+KGTA FW+NL SG+GDY TRHAACPVL G+
Sbjct: 479 PKKGTAVFWYNLFPSGEGDYSTRHAACPVLVGN 511
>gi|355562502|gb|EHH19096.1| hypothetical protein EGK_19739 [Macaca mulatta]
gi|355782842|gb|EHH64763.1| hypothetical protein EGM_18071 [Macaca fascicularis]
gi|383418719|gb|AFH32573.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Macaca
mulatta]
Length = 534
Score = 295 bits (755), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 146/275 (53%), Positives = 192/275 (69%), Gaps = 13/275 (4%)
Query: 4 PTHQRAQGNKLYYQEALNKSPEL----------KDEPPKVNNVAPTLEVTEREKYEMLCR 53
P HQRA GN Y++ + K ++ + PK VA + ER+KYEMLCR
Sbjct: 236 PEHQRANGNLKYFEYIMAKEKDVNKSASDDQSDQKTTPKKKGVAVDY-LPERQKYEMLCR 294
Query: 54 GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
G+ + + P +L CRY N P L P K+E+ + +PRII + D++ D+EI+++K
Sbjct: 295 GEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 354
Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
+A+PRLRRAT+ N TG+LE +YRISKSAWL E+PV+ RI+ R++ +TGL STAEE
Sbjct: 355 LAKPRLRRATISNPITGDLETVHYRISKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEE 414
Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
LQV NYG+GG YEPH+DFAR E +AFK LGTGNR+AT LFYMSDV+ GGATVF + S
Sbjct: 415 LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 474
Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 475 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 509
>gi|380813206|gb|AFE78477.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Macaca
mulatta]
gi|384947328|gb|AFI37269.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Macaca
mulatta]
Length = 534
Score = 295 bits (755), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 146/275 (53%), Positives = 192/275 (69%), Gaps = 13/275 (4%)
Query: 4 PTHQRAQGNKLYYQEALNKSPEL----------KDEPPKVNNVAPTLEVTEREKYEMLCR 53
P HQRA GN Y++ + K ++ + PK VA + ER+KYEMLCR
Sbjct: 236 PEHQRANGNLKYFEYIMAKEKDVNKSASGDQSDQKTTPKKKGVAVDY-LPERQKYEMLCR 294
Query: 54 GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
G+ + + P +L CRY N P L P K+E+ + +PRII + D++ D+EI+++K
Sbjct: 295 GEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 354
Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
+A+PRLRRAT+ N TG+LE +YRISKSAWL E+PV+ RI+ R++ +TGL STAEE
Sbjct: 355 LAKPRLRRATISNPITGDLETVHYRISKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEE 414
Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
LQV NYG+GG YEPH+DFAR E +AFK LGTGNR+AT LFYMSDV+ GGATVF + S
Sbjct: 415 LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 474
Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 475 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 509
>gi|190788|gb|AAA36535.1| prolyl 4-hydroxylase alpha subunit (EC 1.14.11.2) [Homo sapiens]
Length = 534
Score = 295 bits (754), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 146/275 (53%), Positives = 192/275 (69%), Gaps = 13/275 (4%)
Query: 4 PTHQRAQGNKLYYQEALNKSPEL----------KDEPPKVNNVAPTLEVTEREKYEMLCR 53
P HQRA GN Y++ + K ++ + PK VA + ER+KYEMLCR
Sbjct: 236 PEHQRANGNLKYFEYIMAKEKDVNKSASDDQSDQKTTPKKKGVAVDY-LPERQKYEMLCR 294
Query: 54 GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
G+ + + P +L CRY N P L P K+E+ + +PRII + D++ D+EI+++K
Sbjct: 295 GEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 354
Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
+A+PRLRRAT+ N TG+LE +YRISKSAWL E+PV+ RI+ R++ +TGL STAEE
Sbjct: 355 LAKPRLRRATISNPITGDLETVHYRISKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEE 414
Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
LQV NYG+GG YEPH+DFAR E +AFK LGTGNR+AT LFYMSDV+ GGATVF + S
Sbjct: 415 LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 474
Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 475 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 509
>gi|395820526|ref|XP_003783615.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2 [Otolemur
garnettii]
Length = 534
Score = 295 bits (754), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 146/275 (53%), Positives = 192/275 (69%), Gaps = 13/275 (4%)
Query: 4 PTHQRAQGNKLYYQEALNKSPEL----------KDEPPKVNNVAPTLEVTEREKYEMLCR 53
P HQRA GN Y++ + K ++ + PK VA + ER+KYEMLCR
Sbjct: 236 PEHQRANGNLKYFEYIMAKEKDVNKSSSDDQSDQKTTPKKKGVAVDY-LPERQKYEMLCR 294
Query: 54 GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
G+ + + P +L CRY N P L P K+E+ + +PRII + D++ D+EI+++K
Sbjct: 295 GEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 354
Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
+A+PRLRRAT+ N TG+LE +YRISKSAWL E+PV+ RI+ R++ +TGL STAEE
Sbjct: 355 LAKPRLRRATISNPITGDLETVHYRISKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEE 414
Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
LQV NYG+GG YEPH+DFAR E +AFK LGTGNR+AT LFYMSDV+ GGATVF + S
Sbjct: 415 LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 474
Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 475 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 509
>gi|63252888|ref|NP_001017962.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Homo
sapiens]
gi|197099666|ref|NP_001125733.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Pongo abelii]
gi|217272849|ref|NP_001136067.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Homo
sapiens]
gi|114631177|ref|XP_001140234.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 3 [Pan
troglodytes]
gi|114631181|ref|XP_001140652.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 7 [Pan
troglodytes]
gi|2507090|sp|P13674.2|P4HA1_HUMAN RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
alpha-1; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1; Flags: Precursor
gi|75061858|sp|Q5RAG8.1|P4HA1_PONAB RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
alpha-1; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1; Flags: Precursor
gi|602675|gb|AAA59068.1| alpha-subunit of prolyl 4-hydroxylase [Homo sapiens]
gi|23271226|gb|AAH34998.1| Prolyl 4-hydroxylase, alpha polypeptide I [Homo sapiens]
gi|55729010|emb|CAH91242.1| hypothetical protein [Pongo abelii]
gi|56403853|emb|CAI29712.1| hypothetical protein [Pongo abelii]
gi|119574854|gb|EAW54469.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I, isoform CRA_c [Homo
sapiens]
gi|119574855|gb|EAW54470.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I, isoform CRA_d [Homo
sapiens]
gi|123981532|gb|ABM82595.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I [synthetic
construct]
gi|123996359|gb|ABM85781.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I [synthetic
construct]
gi|261861532|dbj|BAI47288.1| prolyl 4-hydroxylase, alpha polypeptide I [synthetic construct]
gi|410295852|gb|JAA26526.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
gi|410349611|gb|JAA41409.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
Length = 534
Score = 295 bits (754), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 146/275 (53%), Positives = 192/275 (69%), Gaps = 13/275 (4%)
Query: 4 PTHQRAQGNKLYYQEALNKSPEL----------KDEPPKVNNVAPTLEVTEREKYEMLCR 53
P HQRA GN Y++ + K ++ + PK VA + ER+KYEMLCR
Sbjct: 236 PEHQRANGNLKYFEYIMAKEKDVNKSASDDQSDQKTTPKKKGVAVDY-LPERQKYEMLCR 294
Query: 54 GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
G+ + + P +L CRY N P L P K+E+ + +PRII + D++ D+EI+++K
Sbjct: 295 GEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 354
Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
+A+PRLRRAT+ N TG+LE +YRISKSAWL E+PV+ RI+ R++ +TGL STAEE
Sbjct: 355 LAKPRLRRATISNPITGDLETVHYRISKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEE 414
Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
LQV NYG+GG YEPH+DFAR E +AFK LGTGNR+AT LFYMSDV+ GGATVF + S
Sbjct: 415 LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 474
Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 475 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 509
>gi|281350467|gb|EFB26051.1| hypothetical protein PANDA_009188 [Ailuropoda melanoleuca]
Length = 511
Score = 295 bits (754), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 145/274 (52%), Positives = 193/274 (70%), Gaps = 11/274 (4%)
Query: 4 PTHQRAQGNKLYYQEALNKSPEL-KDEPPKVNNVAPTLE--------VTEREKYEMLCRG 54
P HQRA GN Y++ + K ++ K ++ TL+ + ER+KYEMLCRG
Sbjct: 236 PEHQRANGNLKYFEYIMAKEKDVNKSASDDQSDQKTTLKKKGAAVDYLPERQKYEMLCRG 295
Query: 55 D-LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKM 112
+ + + P +L CRY N P L P K+E+ + +PRII + D++ D+EI+++K +
Sbjct: 296 EGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKDL 355
Query: 113 AQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEEL 172
A+PRLRRAT+ N TG+LE +YRISKSAWL E+PV+ RI+ R++ +TGL STAEEL
Sbjct: 356 AKPRLRRATISNPITGDLETVHYRISKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEEL 415
Query: 173 QVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSL 232
QV NYG+GG YEPH+DFAR E +AFK LGTGNR+AT LFYMSDV+ GGATVF + S+
Sbjct: 416 QVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGASV 475
Query: 233 WPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 476 WPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 509
>gi|326923461|ref|XP_003207954.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 1
[Meleagris gallopavo]
Length = 536
Score = 294 bits (752), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 147/273 (53%), Positives = 190/273 (69%), Gaps = 11/273 (4%)
Query: 4 PTHQRAQGNKLYYQ-------EALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD- 55
P HQRA GN Y++ EA S + +D+ K + ER KYEMLCRG+
Sbjct: 240 PEHQRANGNMKYFEYIMAKEKEANKSSTDAEDQTEKETEFKKKDYLPERRKYEMLCRGEG 299
Query: 56 LTVPPAIVAQLKCRYV--HRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
L + P +L CRY +RN Y+ L P+K+E+ + +PRI+ + D++ D EI+ +K++A
Sbjct: 300 LKMTPRRQKRLFCRYYDGNRNPRYI-LGPVKQEDEWDKPRIVRFLDIISDEEIETVKELA 358
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+PRLRRAT+ N TG LE A+YRISKSAWL E PV+ RI+ R++ +TGL STAEELQ
Sbjct: 359 KPRLRRATISNPITGALETAHYRISKSAWLSGYESPVVSRINTRIQDLTGLDVSTAEELQ 418
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
V NYG+GG YEPH+DF R E +AFK LGTGNR+AT LFYMSDV+ GGATVF + S+W
Sbjct: 419 VANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGASVW 478
Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
P+KGTA FW+NL SG+GDY TRHAACPVL G+
Sbjct: 479 PKKGTAVFWYNLFPSGEGDYSTRHAACPVLVGN 511
>gi|291404184|ref|XP_002718472.1| PREDICTED: prolyl 4-hydroxylase, alpha I subunit isoform 2
[Oryctolagus cuniculus]
Length = 534
Score = 293 bits (751), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 146/275 (53%), Positives = 191/275 (69%), Gaps = 13/275 (4%)
Query: 4 PTHQRAQGNKLYYQEALNKSPEL----------KDEPPKVNNVAPTLEVTEREKYEMLCR 53
P HQRA GN Y++ + K + K P+ VA + ER+KYEMLCR
Sbjct: 236 PEHQRANGNLKYFEYIMAKEKDANKSASDGQSDKKTTPRRKGVAVDY-LPERQKYEMLCR 294
Query: 54 GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
G+ + + P +L CRY N P L P K+E+ + +PRII + D++ D+EI+++K
Sbjct: 295 GEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 354
Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
+A+PRLRRAT+ N TG+LE +YRISKSAWL E+PV+ RI+ R++ +TGL STAEE
Sbjct: 355 LAKPRLRRATISNPITGDLETVHYRISKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEE 414
Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
LQV NYG+GG YEPH+DFAR E +AFK LGTGNR+AT LFYMSDV+ GGATVF + S
Sbjct: 415 LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 474
Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 475 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 509
>gi|344274274|ref|XP_003408942.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2
[Loxodonta africana]
Length = 534
Score = 293 bits (750), Expect = 7e-77, Method: Compositional matrix adjust.
Identities = 144/274 (52%), Positives = 189/274 (68%), Gaps = 11/274 (4%)
Query: 4 PTHQRAQGNKLYYQEALNKSPE----LKDEPPKVNNVAPTLEVT-----EREKYEMLCRG 54
P HQRA GN Y++ + K + D P + V ER+KYEMLCRG
Sbjct: 236 PEHQRANGNLKYFEYIMTKEKDSNKSTSDAPSDQKSTVKKKGVAADYLPERQKYEMLCRG 295
Query: 55 D-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKM 112
+ + + P +L CRY N P L P K+E+ + +PRI+ + D++ D+EI+++K +
Sbjct: 296 EGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIVRFHDIISDAEIEVVKDL 355
Query: 113 AQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEEL 172
A+PRLRRAT+ N TG+LE +YRISKSAWL E+PV+ RI+ R++ +TGL STAEEL
Sbjct: 356 AKPRLRRATISNPITGDLETVHYRISKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEEL 415
Query: 173 QVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSL 232
QV NYG+GG YEPH+DFAR E +AFK LGTGNR+AT LFYMSDV+ GGATVF + S+
Sbjct: 416 QVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPDVGASV 475
Query: 233 WPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 476 WPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 509
>gi|410295850|gb|JAA26525.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
gi|410295854|gb|JAA26527.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
Length = 534
Score = 293 bits (750), Expect = 7e-77, Method: Compositional matrix adjust.
Identities = 145/275 (52%), Positives = 192/275 (69%), Gaps = 13/275 (4%)
Query: 4 PTHQRAQGNKLYYQEALNKSPEL----------KDEPPKVNNVAPTLEVTEREKYEMLCR 53
P HQRA GN Y++ + K ++ + PK VA + ER+KYEMLCR
Sbjct: 236 PEHQRANGNLKYFEYIMAKEKDVNKSASDDQSDQKTTPKKKGVAVDY-LPERQKYEMLCR 294
Query: 54 GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
G+ + + P +L CRY N P L P K+E+ + +PRII + D++ D+EI+++K
Sbjct: 295 GEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 354
Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
+A+PRLRRATV + +TG+L A YR+SKSAWL E+PV+ RI+ R++ +TGL STAEE
Sbjct: 355 LAKPRLRRATVHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEE 414
Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
LQV NYG+GG YEPH+DFAR E +AFK LGTGNR+AT LFYMSDV+ GGATVF + S
Sbjct: 415 LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 474
Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 475 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 509
>gi|344254200|gb|EGW10304.1| Prolyl 4-hydroxylase subunit alpha-1 [Cricetulus griseus]
Length = 507
Score = 293 bits (749), Expect = 8e-77, Method: Compositional matrix adjust.
Identities = 146/275 (53%), Positives = 190/275 (69%), Gaps = 13/275 (4%)
Query: 4 PTHQRAQGNKLYYQEALNKSPELK----DEP------PKVNNVAPTLEVTEREKYEMLCR 53
P HQRA GN Y++ + K + D+P PK +A + ER KYEMLCR
Sbjct: 209 PEHQRANGNLRYFEYIMTKEKDTNKSASDDPSDQKTTPKKKGIAVDY-LPERRKYEMLCR 267
Query: 54 GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
G+ + + P +L CRY N P L P K+E+ + +PRII + D++ D+EI+++K
Sbjct: 268 GEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 327
Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
+A+PRLRRAT+ N TG LE +YRISKSAWL E PV+ RI+ R++ +TGL STAEE
Sbjct: 328 LAKPRLRRATISNPITGNLETVHYRISKSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEE 387
Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
LQV NYG+GG YEPH+DFAR E +AF+ LGTGNR+AT LFYMSDV+ GGATVF + S
Sbjct: 388 LQVANYGVGGQYEPHFDFARKDEPDAFQELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 447
Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 448 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 482
>gi|312032356|ref|NP_001185665.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Gallus
gallus]
Length = 536
Score = 293 bits (749), Expect = 8e-77, Method: Compositional matrix adjust.
Identities = 146/273 (53%), Positives = 192/273 (70%), Gaps = 11/273 (4%)
Query: 4 PTHQRAQGNKLYYQ-------EALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD- 55
P HQRA GN Y++ EA S + +D+ K V + ER KYEMLCRG+
Sbjct: 240 PEHQRANGNMKYFEYIMAKEKEANKSSTDAEDQTEKETEVKKKDYLPERRKYEMLCRGEG 299
Query: 56 LTVPPAIVAQLKCRYV--HRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
L + P +L CRY +RN Y+ L P+K+E+ + +PRI+ + D++ D EI+ +K++A
Sbjct: 300 LKMTPRRQKRLFCRYYDGNRNPRYI-LGPVKQEDEWDKPRIVRFLDIISDEEIETVKELA 358
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+PRL RATV + +TG+L A+YR+SKSAWL E PV+ RI+ R++ +TGL STAEELQ
Sbjct: 359 KPRLSRATVHDPETGKLTTAHYRVSKSAWLSGYESPVVSRINTRIQDLTGLDVSTAEELQ 418
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
V NYG+GG YEPH+DFAR E +AFK LGTGNR+AT LFYMSDV+ GGATVF + S+W
Sbjct: 419 VANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGASVW 478
Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
P+KGTA FW+NL SG+GDY TRHAACPVL G+
Sbjct: 479 PKKGTAVFWYNLFPSGEGDYSTRHAACPVLVGN 511
>gi|74224984|dbj|BAE38205.1| unnamed protein product [Mus musculus]
Length = 534
Score = 293 bits (749), Expect = 8e-77, Method: Compositional matrix adjust.
Identities = 146/275 (53%), Positives = 193/275 (70%), Gaps = 13/275 (4%)
Query: 4 PTHQRAQGNKLYYQEALNK--------SPELKDE--PPKVNNVAPTLEVTEREKYEMLCR 53
P HQRA GN +Y++ ++K S + D+ PK +A + ER+KYEMLCR
Sbjct: 236 PEHQRANGNLVYFEYIMSKEKDANKSASGDQSDQKTAPKKKGIAVDY-LPERQKYEMLCR 294
Query: 54 GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
G+ + + P +L CRY N P L P K+E+ + +PRII + D++ D+EI+++K
Sbjct: 295 GEGIKMTPRRQKRLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 354
Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
+A+PRLRRAT+ N TG LE +YRISKSAWL E PV+ RI+ R++ +TGL STAEE
Sbjct: 355 LAKPRLRRATISNPVTGALETVHYRISKSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEE 414
Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
LQV NYG+GG YEPH+DFAR E +AF+ LGTGNR+AT LFYMSDV+ GGATVF + S
Sbjct: 415 LQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 474
Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 475 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 509
>gi|33859596|ref|NP_035160.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Mus musculus]
gi|20455506|sp|Q60715.2|P4HA1_MOUSE RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
alpha-1; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1; Flags: Precursor
gi|16307134|gb|AAH09654.1| Procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha 1 polypeptide [Mus musculus]
gi|74144306|dbj|BAE36020.1| unnamed protein product [Mus musculus]
gi|74146660|dbj|BAE41331.1| unnamed protein product [Mus musculus]
gi|148700260|gb|EDL32207.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha 1 polypeptide, isoform CRA_a [Mus
musculus]
Length = 534
Score = 293 bits (749), Expect = 8e-77, Method: Compositional matrix adjust.
Identities = 146/275 (53%), Positives = 193/275 (70%), Gaps = 13/275 (4%)
Query: 4 PTHQRAQGNKLYYQEALNK--------SPELKDE--PPKVNNVAPTLEVTEREKYEMLCR 53
P HQRA GN +Y++ ++K S + D+ PK +A + ER+KYEMLCR
Sbjct: 236 PEHQRANGNLVYFEYIMSKEKDANKSASGDQSDQKTAPKKKGIAVDY-LPERQKYEMLCR 294
Query: 54 GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
G+ + + P +L CRY N P L P K+E+ + +PRII + D++ D+EI+++K
Sbjct: 295 GEGIKMTPRRQKRLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 354
Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
+A+PRLRRAT+ N TG LE +YRISKSAWL E PV+ RI+ R++ +TGL STAEE
Sbjct: 355 LAKPRLRRATISNPVTGALETVHYRISKSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEE 414
Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
LQV NYG+GG YEPH+DFAR E +AF+ LGTGNR+AT LFYMSDV+ GGATVF + S
Sbjct: 415 LQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 474
Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 475 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 509
>gi|74225936|dbj|BAE28745.1| unnamed protein product [Mus musculus]
Length = 561
Score = 293 bits (749), Expect = 9e-77, Method: Compositional matrix adjust.
Identities = 146/275 (53%), Positives = 193/275 (70%), Gaps = 13/275 (4%)
Query: 4 PTHQRAQGNKLYYQEALNK--------SPELKDE--PPKVNNVAPTLEVTEREKYEMLCR 53
P HQRA GN +Y++ ++K S + D+ PK +A + ER+KYEMLCR
Sbjct: 236 PEHQRANGNLVYFEYIMSKEKDANKSASGDQSDQKTAPKKKGIAVDY-LPERQKYEMLCR 294
Query: 54 GD-LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
G+ + + P +L CRY N P L P K+E+ + +PRII + D++ D+EI+++K
Sbjct: 295 GEGIKMTPRRQKRLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 354
Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
+A+PRLRRAT+ N TG LE +YRISKSAWL E PV+ RI+ R++ +TGL STAEE
Sbjct: 355 LAKPRLRRATISNPVTGALETVHYRISKSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEE 414
Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
LQV NYG+GG YEPH+DFAR E +AF+ LGTGNR+AT LFYMSDV+ GGATVF + S
Sbjct: 415 LQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 474
Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 475 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 509
>gi|115495019|ref|NP_001069238.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Bos taurus]
gi|122144801|sp|Q1RMU3.1|P4HA1_BOVIN RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
alpha-1; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1; Flags: Precursor
gi|92097479|gb|AAI14709.1| Prolyl 4-hydroxylase, alpha polypeptide I [Bos taurus]
gi|296472132|tpg|DAA14247.1| TPA: prolyl 4-hydroxylase subunit alpha-1 precursor [Bos taurus]
gi|440892721|gb|ELR45796.1| Prolyl 4-hydroxylase subunit alpha-1 [Bos grunniens mutus]
Length = 534
Score = 293 bits (749), Expect = 9e-77, Method: Compositional matrix adjust.
Identities = 144/274 (52%), Positives = 191/274 (69%), Gaps = 11/274 (4%)
Query: 4 PTHQRAQGNKLYYQEALNK--------SPELKDEPPKVNNVAPTLE-VTEREKYEMLCRG 54
P HQRA GN Y++ + K S + D+ + ++ + ER+KYEMLCRG
Sbjct: 236 PEHQRANGNLKYFEYIMAKEKDANKSSSDDQSDQKTTLKKKGAAVDYLPERQKYEMLCRG 295
Query: 55 D-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKM 112
+ + + P +L CRY N P L P K+E+ + +PRII + D++ D+EI+++K +
Sbjct: 296 EGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEVVKDL 355
Query: 113 AQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEEL 172
A+PRLRRAT+ N TG+LE +YRISKSAWL E+PV+ RI+ R++ +TGL STAEEL
Sbjct: 356 AKPRLRRATISNPITGDLETVHYRISKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEEL 415
Query: 173 QVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSL 232
QV NYG+GG YEPH+DFAR E +AFK LGTGNR+AT LFYMSDV GGATVF + S+
Sbjct: 416 QVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVLAGGATVFPEVGASV 475
Query: 233 WPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 476 WPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 509
>gi|354483225|ref|XP_003503795.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 2
[Cricetulus griseus]
Length = 534
Score = 293 bits (749), Expect = 9e-77, Method: Compositional matrix adjust.
Identities = 146/275 (53%), Positives = 190/275 (69%), Gaps = 13/275 (4%)
Query: 4 PTHQRAQGNKLYYQEALNKSPELK----DEP------PKVNNVAPTLEVTEREKYEMLCR 53
P HQRA GN Y++ + K + D+P PK +A + ER KYEMLCR
Sbjct: 236 PEHQRANGNLRYFEYIMTKEKDTNKSASDDPSDQKTTPKKKGIAVDY-LPERRKYEMLCR 294
Query: 54 GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
G+ + + P +L CRY N P L P K+E+ + +PRII + D++ D+EI+++K
Sbjct: 295 GEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 354
Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
+A+PRLRRAT+ N TG LE +YRISKSAWL E PV+ RI+ R++ +TGL STAEE
Sbjct: 355 LAKPRLRRATISNPITGNLETVHYRISKSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEE 414
Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
LQV NYG+GG YEPH+DFAR E +AF+ LGTGNR+AT LFYMSDV+ GGATVF + S
Sbjct: 415 LQVANYGVGGQYEPHFDFARKDEPDAFQELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 474
Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 475 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 509
>gi|149038788|gb|EDL93077.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha 1 polypeptide, isoform CRA_b
[Rattus norvegicus]
Length = 534
Score = 292 bits (748), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 146/275 (53%), Positives = 193/275 (70%), Gaps = 13/275 (4%)
Query: 4 PTHQRAQGNKLYYQEALNK--------SPELKDEP--PKVNNVAPTLEVTEREKYEMLCR 53
P HQRA GN +Y++ ++K S + D+ PK +A + ER+KYEMLCR
Sbjct: 236 PEHQRANGNLVYFEYIMSKEKDANKSASGDQSDQKTTPKKKGIAVDY-LPERQKYEMLCR 294
Query: 54 GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
G+ + + P +L CRY N P L P K+E+ + +PRII + D++ D+EI+++K
Sbjct: 295 GEGIKMTPRRQKRLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 354
Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
+A+PRLRRAT+ N TG LE +YRISKSAWL E PV+ RI+ R++ +TGL STAEE
Sbjct: 355 LAKPRLRRATISNPVTGALETVHYRISKSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEE 414
Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
LQV NYG+GG YEPH+DFAR E +AF+ LGTGNR+AT LFYMSDV+ GGATVF + S
Sbjct: 415 LQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 474
Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 475 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 509
>gi|426255744|ref|XP_004021508.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1 [Ovis
aries]
Length = 534
Score = 292 bits (748), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 144/274 (52%), Positives = 191/274 (69%), Gaps = 11/274 (4%)
Query: 4 PTHQRAQGNKLYYQEALNK--------SPELKDEPPKVNNVAPTLE-VTEREKYEMLCRG 54
P HQRA GN Y++ + K S + D+ + ++ + ER+KYEMLCRG
Sbjct: 236 PEHQRANGNLKYFEYIMAKEKDANKSSSDDQSDQKTTLKKKGAAVDYLPERQKYEMLCRG 295
Query: 55 D-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKM 112
+ + + P +L CRY N P L P K+E+ + +PRII + D++ D+EI+++K +
Sbjct: 296 EGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKDL 355
Query: 113 AQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEEL 172
A+PRLRRAT+ N TG+LE +YRISKSAWL E+PV+ RI+ R++ +TGL STAEEL
Sbjct: 356 AKPRLRRATISNPITGDLETVHYRISKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEEL 415
Query: 173 QVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSL 232
QV NYG+GG YEPH+DFAR E +AFK LGTGNR+AT LFYMSDV GGATVF + S+
Sbjct: 416 QVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVLAGGATVFPEVGASV 475
Query: 233 WPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 476 WPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 509
>gi|410251924|gb|JAA13929.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
Length = 566
Score = 292 bits (747), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 153/332 (46%), Positives = 210/332 (63%), Gaps = 28/332 (8%)
Query: 4 PTHQRAQGNKLYYQEALNKSPEL----------KDEPPKVNNVAPTLEVTEREKYEMLCR 53
P HQRA GN Y++ + K ++ + PK VA + ER+KYEMLCR
Sbjct: 236 PEHQRANGNLKYFEYIMAKEKDVNKSASDDQSDQKTTPKKKGVAVDY-LPERQKYEMLCR 294
Query: 54 GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
G+ + + P +L CRY N P L P K+E+ + +PRII + D++ D+EI+++K
Sbjct: 295 GEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 354
Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
+A+PRL RATV + +TG+L A YR+SKSAWL E+PV+ RI+ R++ +TGL STAEE
Sbjct: 355 LAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEE 414
Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
LQV NYG+GG YEPH+DFAR E +AFK LGTGNR+AT LFYMSDV+ GGATVF + S
Sbjct: 415 LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 474
Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PC-----G 276
+WP+KGTA FW+NL +SG+GDY TRHAACPVL G+ + + PC G
Sbjct: 475 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRPCTFVRIG 534
Query: 277 LRRGLQRSGIICTLVGMVITIRGMLPVLYSLD 308
+ L CTL+ ++ T + +++D
Sbjct: 535 MTNRLPFFSYCCTLMCLIYTFPSLNFQEFTID 566
>gi|129365|sp|P16924.1|P4HA1_CHICK RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
alpha-1; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1
Length = 516
Score = 291 bits (746), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 145/273 (53%), Positives = 191/273 (69%), Gaps = 11/273 (4%)
Query: 4 PTHQRAQGNKLYYQ-------EALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD- 55
P HQRA GN Y++ EA S + +D+ K V + ER KYEMLCRG+
Sbjct: 220 PEHQRANGNMKYFEYIMAKEKEANKSSTDAEDQTDKETEVKKKDYLPERRKYEMLCRGEG 279
Query: 56 LTVPPAIVAQLKCRYV--HRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
L + P +L CRY +RN Y+ L P+K+E+ + +PRI+ + D++ D EI+ +K++A
Sbjct: 280 LKMTPRRQKRLFCRYYDGNRNPRYI-LGPVKQEDEWDKPRIVRFLDIISDEEIETVKELA 338
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+PRL RATV + +TG+L A+YR+SKSAWL E PV+ RI+ R++ +TGL STAEELQ
Sbjct: 339 KPRLSRATVHDPETGKLTTAHYRVSKSAWLSGYESPVVSRINTRIQDLTGLDVSTAEELQ 398
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
V NYG+GG YEPH+DF R E +AFK LGTGNR+AT LFYMSDV+ GGATVF + S+W
Sbjct: 399 VANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGASVW 458
Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
P+KGTA FW+NL SG+GDY TRHAACPVL G+
Sbjct: 459 PKKGTAVFWYNLFPSGEGDYSTRHAACPVLVGN 491
>gi|312032354|ref|NP_001185664.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Gallus
gallus]
Length = 536
Score = 291 bits (745), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 145/273 (53%), Positives = 191/273 (69%), Gaps = 11/273 (4%)
Query: 4 PTHQRAQGNKLYYQ-------EALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD- 55
P HQRA GN Y++ EA S + +D+ K V + ER KYEMLCRG+
Sbjct: 240 PEHQRANGNMKYFEYIMAKEKEANKSSTDAEDQTEKETEVKKKDYLPERRKYEMLCRGEG 299
Query: 56 LTVPPAIVAQLKCRYV--HRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
L + P +L CRY +RN Y+ L P+K+E+ + +PRI+ + D++ D EI+ +K++A
Sbjct: 300 LKMTPRRQKRLFCRYYDGNRNPRYI-LGPVKQEDEWDKPRIVRFLDIISDEEIETVKELA 358
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+PRL RATV + +TG+L A+YR+SKSAWL E PV+ RI+ R++ +TGL STAEELQ
Sbjct: 359 KPRLSRATVHDPETGKLTTAHYRVSKSAWLSGYESPVVSRINTRIQDLTGLDVSTAEELQ 418
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
V NYG+GG YEPH+DF R E +AFK LGTGNR+AT LFYMSDV+ GGATVF + S+W
Sbjct: 419 VANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGASVW 478
Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
P+KGTA FW+NL SG+GDY TRHAACPVL G+
Sbjct: 479 PKKGTAVFWYNLFPSGEGDYSTRHAACPVLVGN 511
>gi|47550697|ref|NP_999856.1| prolyl 4-hydroxylase, alpha polypeptide I b precursor [Danio rerio]
gi|28277826|gb|AAH45890.1| Procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I [Danio rerio]
Length = 536
Score = 291 bits (745), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 147/277 (53%), Positives = 192/277 (69%), Gaps = 15/277 (5%)
Query: 4 PTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLE------------VTEREKYEML 51
P H RA GN Y++ L K + ++E K + L+ + ER+KYE L
Sbjct: 236 PNHHRANGNLKYFEFQLEKQRKAENEK-KEEDQKRVLDKRDAQRKRSKDPLPERKKYERL 294
Query: 52 CRGD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLI 109
CRG+ + + P ++L CRY + N P L L P+K+E+ + +PRI+ Y +++ DSEI+ +
Sbjct: 295 CRGEGIKLTPRRQSRLFCRYSNNNRNPRLLLAPVKQEDEWDRPRIVRYHEIISDSEIETV 354
Query: 110 KKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTA 169
K+MA+PRLRRAT+ N TG LE A YRISKSAWL EH IERI++R+E +TGL TA
Sbjct: 355 KEMAKPRLRRATISNPITGVLETAPYRISKSAWLSGYEHSTIERINQRIEDVTGLEMDTA 414
Query: 170 EELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLN 229
EELQV NYG+GG YEPH+DF R E +AFK LGTGNR+AT LFYMSDV+ GGATVFT +
Sbjct: 415 EELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFTDVG 474
Query: 230 LSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
++WP+KGTA FW+NL SG+GDY TRHAACPVL G+
Sbjct: 475 AAVWPKKGTAVFWYNLFPSGEGDYSTRHAACPVLVGN 511
>gi|212530|gb|AAA49002.1| prolyl 4-hydroxylase, alpha subunit (EC 1.14.11.2), partial [Gallus
gallus]
Length = 489
Score = 291 bits (745), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 145/273 (53%), Positives = 191/273 (69%), Gaps = 11/273 (4%)
Query: 4 PTHQRAQGNKLYYQ-------EALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD- 55
P HQRA GN Y++ EA S + +D+ K V + ER KYEMLCRG+
Sbjct: 193 PEHQRANGNMKYFEYIMAKEKEANKSSTDAEDQTDKETEVKKKDYLPERRKYEMLCRGEG 252
Query: 56 LTVPPAIVAQLKCRYV--HRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
L + P +L CRY +RN Y+ L P+K+E+ + +PRI+ + D++ D EI+ +K++A
Sbjct: 253 LKMTPRRQKRLFCRYYDGNRNPRYI-LGPVKQEDEWDKPRIVRFLDIISDEEIETVKELA 311
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+PRL RATV + +TG+L A+YR+SKSAWL E PV+ RI+ R++ +TGL STAEELQ
Sbjct: 312 KPRLSRATVHDPETGKLTTAHYRVSKSAWLSGYESPVVSRINTRIQDLTGLDVSTAEELQ 371
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
V NYG+GG YEPH+DF R E +AFK LGTGNR+AT LFYMSDV+ GGATVF + S+W
Sbjct: 372 VANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGASVW 431
Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
P+KGTA FW+NL SG+GDY TRHAACPVL G+
Sbjct: 432 PKKGTAVFWYNLFPSGEGDYSTRHAACPVLVGN 464
>gi|334314087|ref|XP_003339988.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2
[Monodelphis domestica]
Length = 537
Score = 291 bits (745), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 144/277 (51%), Positives = 186/277 (67%), Gaps = 14/277 (5%)
Query: 4 PTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVT------------EREKYEML 51
P HQRA GN Y++ + K + K + P E ER KYEML
Sbjct: 236 PEHQRANGNLKYFEYIMAKEKDANTSTTKTADEQPEQETAPKRKGRAKDYLPERRKYEML 295
Query: 52 CRGD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLI 109
CRG+ L + P +L CRY N P L P K+E+ + +PRI+ + +++ D+EI+++
Sbjct: 296 CRGEGLKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIVRFHEIISDAEIEIV 355
Query: 110 KKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTA 169
K +A+PRLRRAT+ N TG LE A+YRISKSAWL E PV+ RI+ R++ +TGL STA
Sbjct: 356 KDLAKPRLRRATISNPITGVLETAHYRISKSAWLSGYEDPVVSRINMRIQDLTGLDVSTA 415
Query: 170 EELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLN 229
EELQV NYG+GG YEPH+DF R E +AFK LGTGNR+AT LFYMSDV+ GGATVF +
Sbjct: 416 EELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVG 475
Query: 230 LSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
S+WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 476 ASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 512
>gi|332244067|ref|XP_003271193.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
alpha-1 [Nomascus leucogenys]
Length = 502
Score = 291 bits (745), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 144/275 (52%), Positives = 191/275 (69%), Gaps = 13/275 (4%)
Query: 4 PTHQRAQGNKLYYQEALNKSPEL----------KDEPPKVNNVAPTLEVTEREKYEMLCR 53
P HQRA GN Y++ + K ++ + PK VA + ER+KYEMLCR
Sbjct: 204 PEHQRANGNLKYFEYIMAKEKDVNKSASDDQSDQKTTPKKKGVAVDY-LPERQKYEMLCR 262
Query: 54 GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
G+ + + P +L CRY N P L P K+E+ + +PRII + D++ D+EI+++K
Sbjct: 263 GEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 322
Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
+A+PRL RATV + +TG+L A YR+SKSAWL E+PV+ RI+ R++ +TGL STAEE
Sbjct: 323 LAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEE 382
Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
LQV NYG+GG YEPH+DFAR E +AFK LGTGNR+AT LFYMSDV+ GGATVF + S
Sbjct: 383 LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 442
Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 443 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 477
>gi|432106758|gb|ELK32410.1| Prolyl 4-hydroxylase subunit alpha-1 [Myotis davidii]
Length = 534
Score = 291 bits (745), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 146/275 (53%), Positives = 192/275 (69%), Gaps = 13/275 (4%)
Query: 4 PTHQRAQGNKLYYQEALNK--------SPELKDEP--PKVNNVAPTLEVTEREKYEMLCR 53
P HQRA GN Y++ + K S + D+ PK VA + ER+KYEMLCR
Sbjct: 236 PEHQRANGNLKYFEYIMAKEKDANKSASDDQSDQKTTPKKKGVAADY-LPERQKYEMLCR 294
Query: 54 GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
G+ + + P +L CRY N P L P K+E+ + +PRII + D++ D+EI+++K
Sbjct: 295 GEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 354
Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
+A+PRL RATV + +TG+L A YR+SKSAWL E+PV+ RI+ R++ +TGL STAEE
Sbjct: 355 LAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEE 414
Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
LQV NYG+GG YEPH+DFAR E +AFK LGTGNR+AT LFYMSDV+ GGATVF + S
Sbjct: 415 LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 474
Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 475 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 509
>gi|380813208|gb|AFE78478.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Macaca
mulatta]
gi|384947330|gb|AFI37270.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Macaca
mulatta]
Length = 534
Score = 291 bits (745), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 144/275 (52%), Positives = 191/275 (69%), Gaps = 13/275 (4%)
Query: 4 PTHQRAQGNKLYYQEALNKSPEL----------KDEPPKVNNVAPTLEVTEREKYEMLCR 53
P HQRA GN Y++ + K ++ + PK VA + ER+KYEMLCR
Sbjct: 236 PEHQRANGNLKYFEYIMAKEKDVNKSASGDQSDQKTTPKKKGVAVDY-LPERQKYEMLCR 294
Query: 54 GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
G+ + + P +L CRY N P L P K+E+ + +PRII + D++ D+EI+++K
Sbjct: 295 GEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 354
Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
+A+PRL RATV + +TG+L A YR+SKSAWL E+PV+ RI+ R++ +TGL STAEE
Sbjct: 355 LAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEE 414
Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
LQV NYG+GG YEPH+DFAR E +AFK LGTGNR+AT LFYMSDV+ GGATVF + S
Sbjct: 415 LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 474
Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 475 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 509
>gi|190786|gb|AAA36534.1| prolyl 4-hydroxylase alpha subunit (EC 1.14.11.2) [Homo sapiens]
Length = 534
Score = 291 bits (744), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 144/275 (52%), Positives = 191/275 (69%), Gaps = 13/275 (4%)
Query: 4 PTHQRAQGNKLYYQEALNKSPEL----------KDEPPKVNNVAPTLEVTEREKYEMLCR 53
P HQRA GN Y++ + K ++ + PK VA + ER+KYEMLCR
Sbjct: 236 PEHQRANGNLKYFEYIMAKEKDVNKSASDDQSDQKTTPKKKGVAVDY-LPERQKYEMLCR 294
Query: 54 GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
G+ + + P +L CRY N P L P K+E+ + +PRII + D++ D+EI+++K
Sbjct: 295 GEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 354
Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
+A+PRL RATV + +TG+L A YR+SKSAWL E+PV+ RI+ R++ +TGL STAEE
Sbjct: 355 LAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEE 414
Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
LQV NYG+GG YEPH+DFAR E +AFK LGTGNR+AT LFYMSDV+ GGATVF + S
Sbjct: 415 LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 474
Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 475 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 509
>gi|383418721|gb|AFH32574.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Macaca
mulatta]
Length = 534
Score = 291 bits (744), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 144/275 (52%), Positives = 191/275 (69%), Gaps = 13/275 (4%)
Query: 4 PTHQRAQGNKLYYQEALNKSPEL----------KDEPPKVNNVAPTLEVTEREKYEMLCR 53
P HQRA GN Y++ + K ++ + PK VA + ER+KYEMLCR
Sbjct: 236 PEHQRANGNLKYFEYIMAKEKDVNKSASDDQSDQKTTPKKKGVAVDY-LPERQKYEMLCR 294
Query: 54 GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
G+ + + P +L CRY N P L P K+E+ + +PRII + D++ D+EI+++K
Sbjct: 295 GEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 354
Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
+A+PRL RATV + +TG+L A YR+SKSAWL E+PV+ RI+ R++ +TGL STAEE
Sbjct: 355 LAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEE 414
Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
LQV NYG+GG YEPH+DFAR E +AFK LGTGNR+AT LFYMSDV+ GGATVF + S
Sbjct: 415 LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 474
Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 475 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 509
>gi|395820524|ref|XP_003783614.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1 [Otolemur
garnettii]
Length = 534
Score = 291 bits (744), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 144/275 (52%), Positives = 191/275 (69%), Gaps = 13/275 (4%)
Query: 4 PTHQRAQGNKLYYQEALNKSPEL----------KDEPPKVNNVAPTLEVTEREKYEMLCR 53
P HQRA GN Y++ + K ++ + PK VA + ER+KYEMLCR
Sbjct: 236 PEHQRANGNLKYFEYIMAKEKDVNKSSSDDQSDQKTTPKKKGVAVDY-LPERQKYEMLCR 294
Query: 54 GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
G+ + + P +L CRY N P L P K+E+ + +PRII + D++ D+EI+++K
Sbjct: 295 GEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 354
Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
+A+PRL RATV + +TG+L A YR+SKSAWL E+PV+ RI+ R++ +TGL STAEE
Sbjct: 355 LAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEE 414
Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
LQV NYG+GG YEPH+DFAR E +AFK LGTGNR+AT LFYMSDV+ GGATVF + S
Sbjct: 415 LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 474
Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 475 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 509
>gi|296220402|ref|XP_002756291.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Callithrix
jacchus]
Length = 534
Score = 291 bits (744), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 145/275 (52%), Positives = 192/275 (69%), Gaps = 13/275 (4%)
Query: 4 PTHQRAQGNKLYYQEALNKSPE-----LKDEP-----PKVNNVAPTLEVTEREKYEMLCR 53
P HQRA GN Y++ + K + L D+ PK +A + ER+KYEMLCR
Sbjct: 236 PEHQRANGNLKYFEYIMAKEKDVNKSALDDQSDQKTTPKKKGIAVDY-LPERQKYEMLCR 294
Query: 54 GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
G+ + + P +L CRY N P L P K+E+ + +PRII + D++ D+EI+++K
Sbjct: 295 GEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 354
Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
+A+PRL RATV + +TG+L A YR+SKSAWL E+PV+ RI+ R++ +TGL STAEE
Sbjct: 355 LAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEE 414
Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
LQV NYG+GG YEPH+DFAR E +AFK LGTGNR+AT LFYMSDV+ GGATVF + S
Sbjct: 415 LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 474
Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 475 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 509
>gi|63252886|ref|NP_000908.2| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Homo
sapiens]
gi|114631173|ref|XP_508168.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 13 [Pan
troglodytes]
gi|602676|gb|AAA59069.1| alpha-subunit of prolyl 4-hydroxylase [Homo sapiens]
gi|62897481|dbj|BAD96680.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I variant [Homo
sapiens]
gi|119574852|gb|EAW54467.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I, isoform CRA_a [Homo
sapiens]
gi|119574853|gb|EAW54468.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I, isoform CRA_b [Homo
sapiens]
gi|410349609|gb|JAA41408.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
gi|410349613|gb|JAA41410.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
Length = 534
Score = 290 bits (743), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 144/275 (52%), Positives = 191/275 (69%), Gaps = 13/275 (4%)
Query: 4 PTHQRAQGNKLYYQEALNKSPEL----------KDEPPKVNNVAPTLEVTEREKYEMLCR 53
P HQRA GN Y++ + K ++ + PK VA + ER+KYEMLCR
Sbjct: 236 PEHQRANGNLKYFEYIMAKEKDVNKSASDDQSDQKTTPKKKGVAVDY-LPERQKYEMLCR 294
Query: 54 GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
G+ + + P +L CRY N P L P K+E+ + +PRII + D++ D+EI+++K
Sbjct: 295 GEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 354
Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
+A+PRL RATV + +TG+L A YR+SKSAWL E+PV+ RI+ R++ +TGL STAEE
Sbjct: 355 LAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEE 414
Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
LQV NYG+GG YEPH+DFAR E +AFK LGTGNR+AT LFYMSDV+ GGATVF + S
Sbjct: 415 LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 474
Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 475 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 509
>gi|410900628|ref|XP_003963798.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Takifugu
rubripes]
Length = 548
Score = 290 bits (743), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 151/307 (49%), Positives = 199/307 (64%), Gaps = 33/307 (10%)
Query: 4 PTHQRAQGNKLYYQEALNKS-----------------PELKDEPPKVNNVAPTLE----V 42
P HQRA+GN Y++ L K P++ +E K + T +
Sbjct: 238 PEHQRAKGNLKYFEFQLEKQRKDAEEETTKEKEEREEPDITEEKKKKKKKSQTKSTFQLI 297
Query: 43 TEREKYEMLCRGD-LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDV 100
ER+KYEMLCRG+ + + P ++L CRY N P L P+K+++ + +P I+ Y D+
Sbjct: 298 PERKKYEMLCRGEGIKMTPRRQSRLFCRYYDNNHNPKYVLSPVKQQDEWDRPYIVRYIDI 357
Query: 101 MYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEH 160
+ D EI+ +KK+A+PRLRRAT+ N TG LE A+YRISKSAWL EHPVIE I++R+E
Sbjct: 358 ISDKEIETVKKLAKPRLRRATISNPITGVLETASYRISKSAWLTGYEHPVIEIINQRIED 417
Query: 161 MTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQG 220
+TGL TAEELQV NYG+GG YEPH+DF R E +AFK LGTGNR+AT LFYMSDVA G
Sbjct: 418 LTGLEMDTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVAAG 477
Query: 221 GATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC------- 273
GATVF + ++WP+KGTA FW+NL ++G+GDY TRHAACPVL G+ + +
Sbjct: 478 GATVFPDVGAAVWPQKGTAVFWYNLFANGEGDYSTRHAACPVLVGNKWVSNKWIHERGQE 537
Query: 274 ---PCGL 277
PCGL
Sbjct: 538 WRRPCGL 544
>gi|397490069|ref|XP_003816032.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Pan paniscus]
Length = 488
Score = 290 bits (743), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 144/276 (52%), Positives = 191/276 (69%), Gaps = 13/276 (4%)
Query: 4 PTHQRAQGNKLYYQEALNKSPEL----------KDEPPKVNNVAPTLEVTEREKYEMLCR 53
P HQRA GN Y++ + K ++ + PK VA + ER+KYEMLCR
Sbjct: 190 PEHQRANGNLKYFEYIMAKEKDVNKSASDDQSDQKTTPKKKGVAVDY-LPERQKYEMLCR 248
Query: 54 GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
G+ + + P +L CRY N P L P K+E+ + +PRII + D++ D+EI+++K
Sbjct: 249 GEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 308
Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
+A+PRL RATV + +TG+L A YR+SKSAWL E+PV+ RI+ R++ +TGL STAEE
Sbjct: 309 LAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEE 368
Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
LQV NYG+GG YEPH+DFAR E +AFK LGTGNR+AT LFYMSDV+ GGATVF + S
Sbjct: 369 LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 428
Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
+WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 429 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNK 464
>gi|474940|emb|CAA55546.1| gamma-butyrobetaine,2-oxoglutarate dioxygenase [Rattus norvegicus]
Length = 534
Score = 290 bits (742), Expect = 5e-76, Method: Compositional matrix adjust.
Identities = 145/275 (52%), Positives = 193/275 (70%), Gaps = 13/275 (4%)
Query: 4 PTHQRAQGNKLYYQEALNK--------SPELKDEP--PKVNNVAPTLEVTEREKYEMLCR 53
P HQRA GN +Y++ ++K S E D+ PK +A + ER+KYEMLCR
Sbjct: 236 PEHQRANGNLVYFEYIMSKEKDANKSASGERADQKTTPKKKGIAVDY-LPERQKYEMLCR 294
Query: 54 GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
G+ + + P +L CRY N P L P K+E+ + +PRII + D++ D+EI+++K
Sbjct: 295 GEGIKMTPRRQKRLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 354
Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
+A+PRL RATV + +TG+L A YR+SKSAWL E PV+ RI+ R++ +TGL STAEE
Sbjct: 355 LAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEE 414
Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
LQV NYG+GG YEPH+DFAR E +AF+ LGTGNR+AT LFYMSDV+ GGATVF + S
Sbjct: 415 LQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 474
Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 475 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 509
>gi|301770069|ref|XP_002920453.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Ailuropoda
melanoleuca]
Length = 534
Score = 290 bits (742), Expect = 5e-76, Method: Compositional matrix adjust.
Identities = 143/274 (52%), Positives = 192/274 (70%), Gaps = 11/274 (4%)
Query: 4 PTHQRAQGNKLYYQEALNKSPEL-KDEPPKVNNVAPTLE--------VTEREKYEMLCRG 54
P HQRA GN Y++ + K ++ K ++ TL+ + ER+KYEMLCRG
Sbjct: 236 PEHQRANGNLKYFEYIMAKEKDVNKSASDDQSDQKTTLKKKGAAVDYLPERQKYEMLCRG 295
Query: 55 D-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKM 112
+ + + P +L CRY N P L P K+E+ + +PRII + D++ D+EI+++K +
Sbjct: 296 EGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKDL 355
Query: 113 AQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEEL 172
A+PRL RATV + +TG+L A YR+SKSAWL E+PV+ RI+ R++ +TGL STAEEL
Sbjct: 356 AKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEEL 415
Query: 173 QVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSL 232
QV NYG+GG YEPH+DFAR E +AFK LGTGNR+AT LFYMSDV+ GGATVF + S+
Sbjct: 416 QVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGASV 475
Query: 233 WPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 476 WPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 509
>gi|449280261|gb|EMC87600.1| Prolyl 4-hydroxylase subunit alpha-1 [Columba livia]
Length = 536
Score = 290 bits (742), Expect = 6e-76, Method: Compositional matrix adjust.
Identities = 145/273 (53%), Positives = 190/273 (69%), Gaps = 11/273 (4%)
Query: 4 PTHQRAQGNKLYYQ-------EALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD- 55
P HQRA GN Y++ EA S + +D+ K V + ER KYEMLCRG+
Sbjct: 240 PEHQRANGNMKYFEYIMAKEKEANKSSTDSEDQAEKETEVKKKDYLPERRKYEMLCRGEG 299
Query: 56 LTVPPAIVAQLKCRYV--HRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
L + P +L CRY +RN Y+ L P+K+E+ + +PRI+ + D++ D EI+ +K++A
Sbjct: 300 LKMTPRRQKRLFCRYYDGNRNPRYI-LGPVKQEDEWDKPRIVRFLDIISDEEIETVKELA 358
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+PRL RATV + +TG+L A+YR+SKSAWL E PV+ RI+ R++ +TGL STAEELQ
Sbjct: 359 KPRLSRATVHDPETGKLTTAHYRVSKSAWLSGYESPVVSRINTRIQDLTGLDVSTAEELQ 418
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
V NYG+GG YEPH+DF R E +AFK LGTGNR+AT LFYMSDV+ GGATVF + S+W
Sbjct: 419 VANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGASVW 478
Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
P KGTA FW+NL SG+GDY TRHAACPVL G+
Sbjct: 479 PRKGTAVFWYNLFPSGEGDYSTRHAACPVLVGN 511
>gi|402880501|ref|XP_003903839.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like, partial
[Papio anubis]
Length = 379
Score = 290 bits (742), Expect = 6e-76, Method: Compositional matrix adjust.
Identities = 144/276 (52%), Positives = 191/276 (69%), Gaps = 13/276 (4%)
Query: 4 PTHQRAQGNKLYYQEALNKSPEL----------KDEPPKVNNVAPTLEVTEREKYEMLCR 53
P HQRA GN Y++ + K ++ + PK VA + ER+KYEMLCR
Sbjct: 81 PEHQRANGNLKYFEYIMAKEKDVNKSASDDQSDQKTTPKKKGVAVDY-LPERQKYEMLCR 139
Query: 54 GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
G+ + + P +L CRY N P L P K+E+ + +PRII + D++ D+EI+++K
Sbjct: 140 GEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 199
Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
+A+PRL RATV + +TG+L A YR+SKSAWL E+PV+ RI+ R++ +TGL STAEE
Sbjct: 200 LAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEE 259
Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
LQV NYG+GG YEPH+DFAR E +AFK LGTGNR+AT LFYMSDV+ GGATVF + S
Sbjct: 260 LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 319
Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
+WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 320 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNK 355
>gi|26336999|dbj|BAC32183.1| unnamed protein product [Mus musculus]
gi|148700261|gb|EDL32208.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha 1 polypeptide, isoform CRA_b [Mus
musculus]
Length = 534
Score = 290 bits (742), Expect = 6e-76, Method: Compositional matrix adjust.
Identities = 144/275 (52%), Positives = 193/275 (70%), Gaps = 13/275 (4%)
Query: 4 PTHQRAQGNKLYYQEALNK--------SPELKDE--PPKVNNVAPTLEVTEREKYEMLCR 53
P HQRA GN +Y++ ++K S + D+ PK +A + ER+KYEMLCR
Sbjct: 236 PEHQRANGNLVYFEYIMSKEKDANKSASGDQSDQKTAPKKKGIAVDY-LPERQKYEMLCR 294
Query: 54 GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
G+ + + P +L CRY N P L P K+E+ + +PRII + D++ D+EI+++K
Sbjct: 295 GEGIKMTPRRQKRLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 354
Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
+A+PRL RATV + +TG+L A YR+SKSAWL E PV+ RI+ R++ +TGL STAEE
Sbjct: 355 LAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEE 414
Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
LQV NYG+GG YEPH+DFAR E +AF+ LGTGNR+AT LFYMSDV+ GGATVF + S
Sbjct: 415 LQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 474
Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 475 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 509
>gi|73952886|ref|XP_850682.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 3 [Canis
lupus familiaris]
Length = 534
Score = 290 bits (742), Expect = 6e-76, Method: Compositional matrix adjust.
Identities = 142/274 (51%), Positives = 191/274 (69%), Gaps = 11/274 (4%)
Query: 4 PTHQRAQGNKLYYQEALNK--------SPELKDEPPKVNNVAPTLE-VTEREKYEMLCRG 54
P HQRA GN Y++ + K S + D+ + ++ + ER+KYEMLCRG
Sbjct: 236 PEHQRANGNLKYFEYIMAKEKDANKSASDDQSDQKTTLKKKGAAVDYLPERQKYEMLCRG 295
Query: 55 D-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKM 112
+ + + P +L CRY N P L P K+E+ + +PRII + D++ D+EI+++K +
Sbjct: 296 EGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKDL 355
Query: 113 AQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEEL 172
A+PRL RATV + +TG+L A YR+SKSAWL E+PV+ RI+ R++ +TGL STAEEL
Sbjct: 356 AKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEEL 415
Query: 173 QVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSL 232
QV NYG+GG YEPH+DFAR E +AFK LGTGNR+AT LFYMSDV+ GGATVF + S+
Sbjct: 416 QVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGASV 475
Query: 233 WPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 476 WPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 509
>gi|354483223|ref|XP_003503794.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 1
[Cricetulus griseus]
Length = 534
Score = 290 bits (741), Expect = 7e-76, Method: Compositional matrix adjust.
Identities = 144/275 (52%), Positives = 190/275 (69%), Gaps = 13/275 (4%)
Query: 4 PTHQRAQGNKLYYQEALNKSPELK----DEP------PKVNNVAPTLEVTEREKYEMLCR 53
P HQRA GN Y++ + K + D+P PK +A + ER KYEMLCR
Sbjct: 236 PEHQRANGNLRYFEYIMTKEKDTNKSASDDPSDQKTTPKKKGIAVDY-LPERRKYEMLCR 294
Query: 54 GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
G+ + + P +L CRY N P L P K+E+ + +PRII + D++ D+EI+++K
Sbjct: 295 GEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 354
Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
+A+PRL RATV + +TG+L A YR+SKSAWL E PV+ RI+ R++ +TGL STAEE
Sbjct: 355 LAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEE 414
Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
LQV NYG+GG YEPH+DFAR E +AF+ LGTGNR+AT LFYMSDV+ GGATVF + S
Sbjct: 415 LQVANYGVGGQYEPHFDFARKDEPDAFQELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 474
Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 475 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 509
>gi|326923463|ref|XP_003207955.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 2
[Meleagris gallopavo]
Length = 536
Score = 290 bits (741), Expect = 8e-76, Method: Compositional matrix adjust.
Identities = 144/273 (52%), Positives = 190/273 (69%), Gaps = 11/273 (4%)
Query: 4 PTHQRAQGNKLYYQ-------EALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD- 55
P HQRA GN Y++ EA S + +D+ K + ER KYEMLCRG+
Sbjct: 240 PEHQRANGNMKYFEYIMAKEKEANKSSTDAEDQTEKETEFKKKDYLPERRKYEMLCRGEG 299
Query: 56 LTVPPAIVAQLKCRYV--HRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
L + P +L CRY +RN Y+ L P+K+E+ + +PRI+ + D++ D EI+ +K++A
Sbjct: 300 LKMTPRRQKRLFCRYYDGNRNPRYI-LGPVKQEDEWDKPRIVRFLDIISDEEIETVKELA 358
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+PRL RATV + +TG+L A+YR+SKSAWL E PV+ RI+ R++ +TGL STAEELQ
Sbjct: 359 KPRLSRATVHDPETGKLTTAHYRVSKSAWLSGYESPVVSRINTRIQDLTGLDVSTAEELQ 418
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
V NYG+GG YEPH+DF R E +AFK LGTGNR+AT LFYMSDV+ GGATVF + S+W
Sbjct: 419 VANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGASVW 478
Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
P+KGTA FW+NL SG+GDY TRHAACPVL G+
Sbjct: 479 PKKGTAVFWYNLFPSGEGDYSTRHAACPVLVGN 511
>gi|51036657|ref|NP_742059.2| prolyl 4-hydroxylase subunit alpha-1 precursor [Rattus norvegicus]
gi|90111077|sp|P54001.2|P4HA1_RAT RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
alpha-1; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1; Flags: Precursor
gi|50927553|gb|AAH78703.1| Procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I [Rattus norvegicus]
gi|149038787|gb|EDL93076.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha 1 polypeptide, isoform CRA_a
[Rattus norvegicus]
Length = 534
Score = 290 bits (741), Expect = 8e-76, Method: Compositional matrix adjust.
Identities = 144/275 (52%), Positives = 193/275 (70%), Gaps = 13/275 (4%)
Query: 4 PTHQRAQGNKLYYQEALNK--------SPELKDEP--PKVNNVAPTLEVTEREKYEMLCR 53
P HQRA GN +Y++ ++K S + D+ PK +A + ER+KYEMLCR
Sbjct: 236 PEHQRANGNLVYFEYIMSKEKDANKSASGDQSDQKTTPKKKGIAVDY-LPERQKYEMLCR 294
Query: 54 GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
G+ + + P +L CRY N P L P K+E+ + +PRII + D++ D+EI+++K
Sbjct: 295 GEGIKMTPRRQKRLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 354
Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
+A+PRL RATV + +TG+L A YR+SKSAWL E PV+ RI+ R++ +TGL STAEE
Sbjct: 355 LAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEE 414
Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
LQV NYG+GG YEPH+DFAR E +AF+ LGTGNR+AT LFYMSDV+ GGATVF + S
Sbjct: 415 LQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 474
Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 475 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 509
>gi|308476969|ref|XP_003100699.1| hypothetical protein CRE_15564 [Caenorhabditis remanei]
gi|308264511|gb|EFP08464.1| hypothetical protein CRE_15564 [Caenorhabditis remanei]
Length = 573
Score = 290 bits (741), Expect = 9e-76, Method: Compositional matrix adjust.
Identities = 154/316 (48%), Positives = 200/316 (63%), Gaps = 34/316 (10%)
Query: 2 IFPTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPA 61
I P H RA+GN +Y++ L + D PP VN + ER+ YE LCRG+ +PP
Sbjct: 251 IAPNHPRAKGNVKWYEDMLQGKDMVGDLPPIVNKRVEFDGIVERDAYEALCRGE--IPPV 308
Query: 62 ---IVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLR 118
+L+C Y+ R+ P+L++ P+K E P +L+++V+ DSEI +IK++A P+L+
Sbjct: 309 EKKWKNKLRC-YLKRDKPFLKIAPIKVEILRFDPLAVLFKNVISDSEIKVIKELASPKLK 367
Query: 119 RATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYG 178
RATVQN KTGELE A YRISKSAWL+ HPVIER++RR+E TGL T+EELQV NYG
Sbjct: 368 RATVQNSKTGELEHATYRISKSAWLKGDLHPVIERVNRRIEDFTGLYQGTSEELQVANYG 427
Query: 179 IGGHYEPHYDFARPG------------------EANAFKSLGTGNRVATVLFYMSDVAQG 220
+GGHY+PH+DFAR E NAFK+L TGNR+ATVLFYMS +G
Sbjct: 428 LGGHYDPHFDFARIANYGLGGHYEPHYDMSLKEEKNAFKTLNTGNRIATVLFYMSQPERG 487
Query: 221 GATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG----SNS-LHS---- 271
GATVF L +++P K A FW+NL G+GD TRHAACPVL G SN +H
Sbjct: 488 GATVFNHLGTAVFPSKNDALFWYNLRRDGEGDLRTRHAACPVLLGVKWVSNKWIHERGQE 547
Query: 272 -TCPCGLRRGLQRSGI 286
T PCGL G+Q + I
Sbjct: 548 FTRPCGLEEGVQENFI 563
>gi|291404182|ref|XP_002718471.1| PREDICTED: prolyl 4-hydroxylase, alpha I subunit isoform 1
[Oryctolagus cuniculus]
Length = 534
Score = 289 bits (740), Expect = 9e-76, Method: Compositional matrix adjust.
Identities = 144/275 (52%), Positives = 190/275 (69%), Gaps = 13/275 (4%)
Query: 4 PTHQRAQGNKLYYQEALNKSPEL----------KDEPPKVNNVAPTLEVTEREKYEMLCR 53
P HQRA GN Y++ + K + K P+ VA + ER+KYEMLCR
Sbjct: 236 PEHQRANGNLKYFEYIMAKEKDANKSASDGQSDKKTTPRRKGVAVDY-LPERQKYEMLCR 294
Query: 54 GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
G+ + + P +L CRY N P L P K+E+ + +PRII + D++ D+EI+++K
Sbjct: 295 GEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 354
Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
+A+PRL RATV + +TG+L A YR+SKSAWL E+PV+ RI+ R++ +TGL STAEE
Sbjct: 355 LAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEE 414
Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
LQV NYG+GG YEPH+DFAR E +AFK LGTGNR+AT LFYMSDV+ GGATVF + S
Sbjct: 415 LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 474
Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 475 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 509
>gi|344274272|ref|XP_003408941.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1
[Loxodonta africana]
Length = 534
Score = 289 bits (740), Expect = 9e-76, Method: Compositional matrix adjust.
Identities = 142/274 (51%), Positives = 188/274 (68%), Gaps = 11/274 (4%)
Query: 4 PTHQRAQGNKLYYQEALNKSPE----LKDEPPKVNNVAPTLEVT-----EREKYEMLCRG 54
P HQRA GN Y++ + K + D P + V ER+KYEMLCRG
Sbjct: 236 PEHQRANGNLKYFEYIMTKEKDSNKSTSDAPSDQKSTVKKKGVAADYLPERQKYEMLCRG 295
Query: 55 D-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKM 112
+ + + P +L CRY N P L P K+E+ + +PRI+ + D++ D+EI+++K +
Sbjct: 296 EGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIVRFHDIISDAEIEVVKDL 355
Query: 113 AQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEEL 172
A+PRL RATV + +TG+L A YR+SKSAWL E+PV+ RI+ R++ +TGL STAEEL
Sbjct: 356 AKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEEL 415
Query: 173 QVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSL 232
QV NYG+GG YEPH+DFAR E +AFK LGTGNR+AT LFYMSDV+ GGATVF + S+
Sbjct: 416 QVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPDVGASV 475
Query: 233 WPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 476 WPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 509
>gi|321474875|gb|EFX85839.1| hypothetical protein DAPPUDRAFT_309105 [Daphnia pulex]
Length = 545
Score = 289 bits (740), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 143/281 (50%), Positives = 186/281 (66%), Gaps = 18/281 (6%)
Query: 2 IFPTHQRAQGNKLYYQEAL------NKSPELKDEP----------PKVNNVAPTLEVTER 45
I P HQRA GNK +Y++ L + E+ DE K+ P
Sbjct: 240 IVPYHQRALGNKRHYEKLLRQLGVTERRGEIGDEDNIDMSEPFDTTKLKLTKPPGTTEHW 299
Query: 46 EKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSE 105
+ YE LCRG+ + P + +L+CRYV NVPY + P+K EEA L+PRI++Y D++ D E
Sbjct: 300 DVYEQLCRGEKLMDPKLEGRLRCRYVTNNVPYFYIQPIKMEEALLKPRIVVYHDIISDEE 359
Query: 106 IDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLT 165
I+ IK++AQPR RATVQ ++GE E + YRI+KSAWL+ EH + I+ RV +TGL
Sbjct: 360 IETIKRLAQPRFERATVQKKESGEREFSRYRIAKSAWLKHEEHDYVSDINFRVGDITGLD 419
Query: 166 TSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVF 225
+T+E+LQV NYGIGGHYEPHYD+AR GE + G G R+AT LFYMSDV GGATVF
Sbjct: 420 MATSEDLQVCNYGIGGHYEPHYDYARKGEVQ--QDFGWGGRIATWLFYMSDVEAGGATVF 477
Query: 226 TSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
LNLSLWP+KG+AAFW NL+ +G+G+ T+HA CPVLTGS
Sbjct: 478 PKLNLSLWPQKGSAAFWFNLYPNGEGNEMTQHAGCPVLTGS 518
>gi|349604936|gb|AEQ00344.1| Prolyl 4-hydroxylase subunit alpha-1-like protein, partial [Equus
caballus]
Length = 302
Score = 289 bits (739), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 146/275 (53%), Positives = 192/275 (69%), Gaps = 13/275 (4%)
Query: 4 PTHQRAQGNKLYYQEALNK--------SPELKDEP--PKVNNVAPTLEVTEREKYEMLCR 53
P HQRA GN Y++ + K S + D+ PK VA + ER+KYEMLCR
Sbjct: 4 PEHQRANGNLKYFEYIMAKEKDDNKSASDDQSDQKTTPKKKGVAVDY-LPERQKYEMLCR 62
Query: 54 GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
G+ + + P +L CRY N P L P K+E+ + +PRII + D++ D+EI+++K
Sbjct: 63 GEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 122
Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
+A+PRL RATV + +TG+L A YR+SKSAWL E+PV+ RI+ R++ +TGL STAEE
Sbjct: 123 LAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEE 182
Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
LQV NYG+GG YEPH+DFAR E +AFK LGTGNR+AT LFYMSDV+ GGATVF + S
Sbjct: 183 LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 242
Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 243 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 277
>gi|836898|gb|AAC52197.1| prolyl 4-hydroxylase alpha(I)-subunit, partial [Mus musculus]
gi|1096887|prf||2112362A Pro 4-hydroxylase:SUBUNIT=alpha:ISOTYPE=I
Length = 526
Score = 289 bits (739), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 144/275 (52%), Positives = 193/275 (70%), Gaps = 13/275 (4%)
Query: 4 PTHQRAQGNKLYYQEALNK--------SPELKDE--PPKVNNVAPTLEVTEREKYEMLCR 53
P HQRA GN +Y++ ++K S + D+ PK +A + ER+KYEMLCR
Sbjct: 228 PEHQRANGNLVYFEYIMSKEKDANKSASGDQSDQKTAPKKKGIAVDY-LPERQKYEMLCR 286
Query: 54 GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
G+ + + P +L CRY N P L P K+E+ + +PRII + D++ D+EI+++K
Sbjct: 287 GEGIKMTPRRQKRLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKY 346
Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
+A+PRL RATV + +TG+L A YR+SKSAWL E PV+ RI+ R++ +TGL STAEE
Sbjct: 347 LAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEE 406
Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
LQV NYG+GG YEPH+DFAR E +AF+ LGTGNR+AT LFYMSDV+ GGATVF + S
Sbjct: 407 LQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 466
Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 467 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 501
>gi|151556370|gb|AAI47868.1| P4HA1 protein [Bos taurus]
Length = 534
Score = 288 bits (738), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 142/274 (51%), Positives = 190/274 (69%), Gaps = 11/274 (4%)
Query: 4 PTHQRAQGNKLYYQEALNK--------SPELKDEPPKVNNVAPTLE-VTEREKYEMLCRG 54
P HQRA GN Y++ + K S + D+ + ++ + ER+KYEMLCRG
Sbjct: 236 PEHQRANGNLKYFEYIMAKEKDANKSSSDDQSDQKTTLKKKGAAVDYLPERQKYEMLCRG 295
Query: 55 D-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKM 112
+ + + P +L CRY N P L P K+E+ + +PRII + D++ D+EI+++K +
Sbjct: 296 EGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEVVKDL 355
Query: 113 AQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEEL 172
A+PRL RATV + +TG+L A YR+SKSAWL E+PV+ RI+ R++ +TGL STAEEL
Sbjct: 356 AKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEEL 415
Query: 173 QVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSL 232
QV NYG+GG YEPH+DFAR E +AFK LGTGNR+AT LFYMSDV GGATVF + S+
Sbjct: 416 QVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVLAGGATVFPEVGASV 475
Query: 233 WPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 476 WPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 509
>gi|348576112|ref|XP_003473831.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cavia
porcellus]
Length = 534
Score = 288 bits (738), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 142/274 (51%), Positives = 191/274 (69%), Gaps = 11/274 (4%)
Query: 4 PTHQRAQGNKLYYQEALNK--------SPELKDEPPKVNNVAPTLE-VTEREKYEMLCRG 54
P HQRA GN Y++ + K S + D+ + ++ + ER+KYEMLCRG
Sbjct: 236 PEHQRANGNLKYFEYIMAKEKDDNKSTSGDQSDQKSTLRKKGIAVDYLPERQKYEMLCRG 295
Query: 55 D-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKM 112
+ + + P +L CRY N P L P K+E+ + +PRII + D++ D+EI+++K +
Sbjct: 296 EGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKDL 355
Query: 113 AQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEEL 172
A+PRL RATV + +TG+L A YR+SKSAWL E+PV+ RI+ R++ +TGL STAEEL
Sbjct: 356 AKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEEL 415
Query: 173 QVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSL 232
QV NYG+GG YEPH+DFAR E +AFK LGTGNR+AT LFYMSDV+ GGATVF + S+
Sbjct: 416 QVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGASV 475
Query: 233 WPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 476 WPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 509
>gi|291230950|ref|XP_002735430.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Saccoglossus
kowalevskii]
Length = 533
Score = 288 bits (738), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 146/282 (51%), Positives = 183/282 (64%), Gaps = 21/282 (7%)
Query: 1 MIFPTHQRAQGNKLYYQEALNK---------------SPELKDEPPKVNNVAPTLEVTER 45
++ P H R GNK Y+++ L K E + +N+ P ER
Sbjct: 231 LLDPEHVRGLGNKAYFEQELAKYNRQRGDDADVPGEEEKEFLESHKPLNDYLP-----ER 285
Query: 46 EKYEMLCRGD-LTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDS 104
E YE LCRG+ + + P +LKCR N P+L L P KEE + +P++I++ D + +
Sbjct: 286 EAYEALCRGEQVKMSPQRQKKLKCRLRDYNRPFLILQPAKEEVVFDKPKLIIFHDAILTN 345
Query: 105 EIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGL 164
EI +K +A PRLRRAT+QN TG LE A YRISKSAWL E + V+ R++ R+E TGL
Sbjct: 346 EIRKVKALASPRLRRATIQNSVTGNLEFAEYRISKSAWLSEDDGDVVHRLNHRIEQYTGL 405
Query: 165 TTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATV 224
T TAEELQV NYG+GGHYEPH+DFAR E NAFKSL TGNR+AT LFYMSDV GGATV
Sbjct: 406 TMDTAEELQVANYGLGGHYEPHFDFARKEEINAFKSLNTGNRIATFLFYMSDVEAGGATV 465
Query: 225 FTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
F + L PEKG+AAFW+NL +G+GDY TRHAACPVL GS
Sbjct: 466 FPQVGARLIPEKGSAAFWYNLLKNGEGDYSTRHAACPVLVGS 507
>gi|426255746|ref|XP_004021509.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2 [Ovis
aries]
Length = 534
Score = 288 bits (738), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 142/274 (51%), Positives = 190/274 (69%), Gaps = 11/274 (4%)
Query: 4 PTHQRAQGNKLYYQEALNK--------SPELKDEPPKVNNVAPTLE-VTEREKYEMLCRG 54
P HQRA GN Y++ + K S + D+ + ++ + ER+KYEMLCRG
Sbjct: 236 PEHQRANGNLKYFEYIMAKEKDANKSSSDDQSDQKTTLKKKGAAVDYLPERQKYEMLCRG 295
Query: 55 D-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKM 112
+ + + P +L CRY N P L P K+E+ + +PRII + D++ D+EI+++K +
Sbjct: 296 EGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKDL 355
Query: 113 AQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEEL 172
A+PRL RATV + +TG+L A YR+SKSAWL E+PV+ RI+ R++ +TGL STAEEL
Sbjct: 356 AKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEEL 415
Query: 173 QVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSL 232
QV NYG+GG YEPH+DFAR E +AFK LGTGNR+AT LFYMSDV GGATVF + S+
Sbjct: 416 QVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVLAGGATVFPEVGASV 475
Query: 233 WPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 476 WPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 509
>gi|321474953|gb|EFX85917.1| hypothetical protein DAPPUDRAFT_309108 [Daphnia pulex]
Length = 549
Score = 288 bits (738), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 152/306 (49%), Positives = 197/306 (64%), Gaps = 30/306 (9%)
Query: 2 IFPTHQRAQGNKLYYQEALNK---SPEL-KDEPPKVNNVAP-------------TLEVTE 44
I P HQRA GNK +Y++ L + PE K E V P T ++
Sbjct: 238 IVPYHQRAIGNKKHYEDVLRQLGVIPEHGKTEDSDVGMSEPFNTANLKLKKPPGTFGISN 297
Query: 45 R--EKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMY 102
+KYE LCRG+ + P I L+CRYV N PY + PLK EEA+L+P +++Y DV++
Sbjct: 298 DHWDKYEKLCRGEKLMDPKIEGHLRCRYVTNNEPYFFIQPLKMEEAFLKPLLVIYHDVIF 357
Query: 103 DSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMT 162
D EI+ +KK+A PR +R TV N TG+LE A YRISK+A+L+ EH + ++SRRV +T
Sbjct: 358 DEEIETVKKLAHPRFKRTTVMNSATGKLETAKYRISKAAFLKNKEHHHVLKMSRRVGAIT 417
Query: 163 GLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAF-KSLGTGNRVATVLFYMSDVAQGG 221
GL STAE+LQV NYGIGGHYEPH+D+AR E F K G NR+AT LFYMSDV GG
Sbjct: 418 GLDMSTAEDLQVCNYGIGGHYEPHFDYARKNETIGFNKDSGWRNRIATWLFYMSDVEAGG 477
Query: 222 ATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC-------- 273
ATVF +LN++LWP+KG+AAFW+NL +G+G+ TRHAACPVLTGS + +
Sbjct: 478 ATVFPALNVALWPQKGSAAFWYNLFPNGEGNELTRHAACPVLTGSKWVANKWIHEKNQEL 537
Query: 274 --PCGL 277
PCGL
Sbjct: 538 RRPCGL 543
>gi|405965633|gb|EKC30995.1| Prolyl 4-hydroxylase subunit alpha-1 [Crassostrea gigas]
Length = 617
Score = 288 bits (737), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 144/279 (51%), Positives = 185/279 (66%), Gaps = 15/279 (5%)
Query: 2 IFPTHQRAQGNKLYYQEALNKS------------PELKDEPPKVNNVAPTLEV---TERE 46
+ P H RAQ N+ YY++ L + E K E P P E E +
Sbjct: 313 LLPHHTRAQNNRKYYEKLLEEQRRKQYRRGEDGGEEDKTEEPNKYTERPLDEYRKSDEFQ 372
Query: 47 KYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEI 106
YE LCRG+ T + +LKCRYVH+N P L L P KEEE YL P I++Y DV+ D EI
Sbjct: 373 TYESLCRGEDTHDYKLKHKLKCRYVHKNNPRLLLKPAKEEEVYLNPWIVIYHDVVSDKEI 432
Query: 107 DLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTT 166
D IK++A P L RATV N +TG+LE A YR+SKSAWL++ + PVI ++ R+ +TGL+
Sbjct: 433 DTIKRIATPLLSRATVHNPRTGKLETAEYRVSKSAWLKDGDDPVIHNVNNRISDITGLSM 492
Query: 167 STAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFT 226
+TAEELQ+ NYG+GG YEPH+DFAR E AF+ LG+GNR+AT L YM++V GGATVFT
Sbjct: 493 ATAEELQIANYGLGGQYEPHFDFARREETEAFRDLGSGNRIATWLTYMTNVDAGGATVFT 552
Query: 227 SLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
+ + L+P KG AAFW+NL+ SGDG + TRHAACPVL G
Sbjct: 553 HIGVKLFPIKGAAAFWYNLYRSGDGIFDTRHAACPVLVG 591
>gi|224052167|ref|XP_002191912.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Taeniopygia
guttata]
Length = 536
Score = 288 bits (737), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 144/274 (52%), Positives = 190/274 (69%), Gaps = 11/274 (4%)
Query: 4 PTHQRAQGNKLYYQ-------EALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD- 55
P HQRA GN Y++ EA S + +++ K V + ER KYEMLCRG+
Sbjct: 240 PEHQRANGNMKYFEYIMAKEKEANKSSTDSEEQQEKETEVKKKDYLPERRKYEMLCRGEG 299
Query: 56 LTVPPAIVAQLKCRYV--HRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
L + P +L CRY +RN Y+ L P+K+E+ + +PRI+ + D++ D EI+ +K++A
Sbjct: 300 LKMTPRRQKRLFCRYYDGNRNPRYI-LGPVKQEDEWDKPRIVRFLDIISDEEIETVKELA 358
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+PRL RATV + +TG+L A+YR+SKSAWL E PV+ RI+ R++ +TGL STAEELQ
Sbjct: 359 KPRLSRATVHDPETGKLTTAHYRVSKSAWLSGYESPVVSRINTRIQDLTGLDVSTAEELQ 418
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
V NYG+GG YEPH+DF R E +AFK LGTGNR+AT LFYMSDV+ GGATVF + S+W
Sbjct: 419 VANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGASVW 478
Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
P KGTA FW+NL SG+GDY TRHAACPVL G+
Sbjct: 479 PRKGTAVFWYNLFPSGEGDYSTRHAACPVLVGNK 512
>gi|345305838|ref|XP_001508476.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Ornithorhynchus
anatinus]
Length = 493
Score = 287 bits (735), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 141/276 (51%), Positives = 186/276 (67%), Gaps = 14/276 (5%)
Query: 6 HQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVT------------EREKYEMLCR 53
HQRA GN Y++ + K + P+ ++ P E T ER KYEMLCR
Sbjct: 194 HQRANGNLKYFEYIMAKEKDANKSTPQTSDDQPEQETTPKKKGRVKDYLPERRKYEMLCR 253
Query: 54 GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
G+ + + P +L CRY N P L P K+E+ + +PRI+ Y +++ D+EI+ +K
Sbjct: 254 GEGIKMTPRRQKRLFCRYHDGNRNPKFILAPAKQEDEWDKPRIVRYHEIISDAEIETVKD 313
Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
+A+PRL RATV + +TG+L A YR+SKSAWL E PV+ RI+ R++ +TGL STAEE
Sbjct: 314 LAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEE 373
Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
LQV NYG+GG YEPH+DF R E +AFK LGTGNR+AT LFYMSDV+ GGATVF + S
Sbjct: 374 LQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 433
Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
+WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 434 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNK 469
>gi|334314085|ref|XP_001363658.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1
[Monodelphis domestica]
Length = 537
Score = 287 bits (734), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 141/277 (50%), Positives = 185/277 (66%), Gaps = 14/277 (5%)
Query: 4 PTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVT------------EREKYEML 51
P HQRA GN Y++ + K + K + P E ER KYEML
Sbjct: 236 PEHQRANGNLKYFEYIMAKEKDANTSTTKTADEQPEQETAPKRKGRAKDYLPERRKYEML 295
Query: 52 CRGD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLI 109
CRG+ L + P +L CRY N P L P K+E+ + +PRI+ + +++ D+EI+++
Sbjct: 296 CRGEGLKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIVRFHEIISDAEIEIV 355
Query: 110 KKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTA 169
K +A+PRL RATV + +TG+L A YR+SKSAWL E PV+ RI+ R++ +TGL STA
Sbjct: 356 KDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYEDPVVSRINMRIQDLTGLDVSTA 415
Query: 170 EELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLN 229
EELQV NYG+GG YEPH+DF R E +AFK LGTGNR+AT LFYMSDV+ GGATVF +
Sbjct: 416 EELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVG 475
Query: 230 LSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
S+WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 476 ASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 512
>gi|74148153|dbj|BAE36242.1| unnamed protein product [Mus musculus]
Length = 454
Score = 287 bits (734), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 143/275 (52%), Positives = 192/275 (69%), Gaps = 13/275 (4%)
Query: 4 PTHQRAQGNKLYYQEALNK--------SPELKDE--PPKVNNVAPTLEVTEREKYEMLCR 53
P HQRA GN +Y++ ++K S + D+ PK +A + ER+KYEMLCR
Sbjct: 156 PEHQRANGNLVYFEYIMSKEKDANKSASGDQSDQKTAPKKKGIAVDY-LPERQKYEMLCR 214
Query: 54 GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
G+ + + P +L CRY N P L P K+E+ + +PRII + D++ D+E +++K
Sbjct: 215 GEGIKMTPRRQKRLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAENEIVKD 274
Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
+A+PRL RATV + +TG+L A YR+SKSAWL E PV+ RI+ R++ +TGL STAEE
Sbjct: 275 LAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEE 334
Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
LQV NYG+GG YEPH+DFAR E +AF+ LGTGNR+AT LFYMSDV+ GGATVF + S
Sbjct: 335 LQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 394
Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 395 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 429
>gi|395501518|ref|XP_003755140.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Sarcophilus
harrisii]
Length = 385
Score = 286 bits (731), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 141/278 (50%), Positives = 186/278 (66%), Gaps = 14/278 (5%)
Query: 4 PTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLE------------VTEREKYEML 51
P HQRA GN Y++ + K + K P E ++ER KYEML
Sbjct: 84 PEHQRANGNLKYFEYIMAKEKDTNKSTTKSAADQPEQESAPKRKGRAKDYLSERRKYEML 143
Query: 52 CRGD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLI 109
CRG+ L + P +L CRY N P L P K+E+ + +PRI+ + +++ D+EI+++
Sbjct: 144 CRGEGLKMTPQRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIVRFHEIISDAEIEIV 203
Query: 110 KKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTA 169
K +A+PRL RATV + +TG+L A YR+SKSAWL E PV+ RI+ R++ +TGL STA
Sbjct: 204 KDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYEDPVVSRINMRIQDLTGLDVSTA 263
Query: 170 EELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLN 229
EELQV NYG+GG YEPH+DF R E +AFK LGTGNR+AT LFYMSDV+ GGATVF +
Sbjct: 264 EELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVG 323
Query: 230 LSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
S+WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 324 ASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNK 361
>gi|432926124|ref|XP_004080841.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Oryzias
latipes]
Length = 523
Score = 285 bits (730), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 138/270 (51%), Positives = 187/270 (69%), Gaps = 6/270 (2%)
Query: 4 PTHQRAQGNKLYYQEALNKSPEL----KDEPPKVNNVAPTLEVTEREKYEMLCRGD-LTV 58
PTHQRA GN Y++ L+K + + E + A + ER KYE LCRG +
Sbjct: 230 PTHQRANGNLKYFEYQLSKQKKAVQMNESEEDQKGAQADDEYLLERRKYEQLCRGQGALM 289
Query: 59 PPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRL 117
P +++L CRY + + P + P+K+E+ + P I+ Y DV + E++ +K++A+PRL
Sbjct: 290 TPRRLSRLFCRYFNNHGHPNYLIGPVKQEDEWDSPYIVRYHDVASEKEMETVKELAKPRL 349
Query: 118 RRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNY 177
RRATV + +TG+L A YR+SKSAWL EHP+++RI++R+E +TGL STAE+LQV NY
Sbjct: 350 RRATVHDPQTGKLTTAQYRVSKSAWLGSHEHPIVDRINQRIEDITGLDVSTAEDLQVANY 409
Query: 178 GIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKG 237
G+GG YEPH+DF R EA+AF+ LGTGNR+AT L YMSDV GG TVFT + +WP+KG
Sbjct: 410 GVGGQYEPHFDFGRKDEADAFEELGTGNRIATWLLYMSDVQAGGNTVFTDIGAVVWPKKG 469
Query: 238 TAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
TA FW+NLH SG+GDY TRHAACPVL G+
Sbjct: 470 TAVFWYNLHRSGEGDYRTRHAACPVLVGNK 499
>gi|387016440|gb|AFJ50339.1| Prolyl 4-hydroxylase subunit alpha-1-like [Crotalus adamanteus]
Length = 543
Score = 285 bits (729), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 145/278 (52%), Positives = 193/278 (69%), Gaps = 15/278 (5%)
Query: 4 PTHQRAQGNKLYYQEALNKSPELK--------DEPP----KVNNVAPTLE-VTEREKYEM 50
P HQRA GN Y++ + K E + DE P K P+ + + ER+KYE
Sbjct: 241 PGHQRANGNLKYFEYIMVKEKEKEANESVTDTDEQPGKKVKTQKRGPSKDYLPERQKYEK 300
Query: 51 LCRGD-LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDL 108
LCRG+ L + P +L CRY + N P L P+++E+ + +PRI+ + D++ + EI+
Sbjct: 301 LCRGEGLKMTPRREKKLFCRYYNGNGNPNYILGPVRQEDEWDRPRIVRFLDIISNEEIEK 360
Query: 109 IKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST 168
+K++++PRLRRAT+ N TG LE A+YRISKSAWL E+PV+ RI++R++ +TGL ST
Sbjct: 361 VKELSKPRLRRATISNPITGVLETAHYRISKSAWLSGYENPVVARINQRIQDLTGLDVST 420
Query: 169 AEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL 228
AEELQV NYG+GG YEPH+DF R E +AFK LGTGNR+AT LFYMSDVA GGATVF +
Sbjct: 421 AEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVAAGGATVFPEV 480
Query: 229 NLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
S+WP+KGTA FW+NL SG+GDY TRHAACPVL G+
Sbjct: 481 GASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPVLVGN 518
>gi|348518914|ref|XP_003446976.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Oreochromis
niloticus]
Length = 536
Score = 285 bits (728), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 150/299 (50%), Positives = 197/299 (65%), Gaps = 25/299 (8%)
Query: 2 IFPTHQRAQGNKLYYQEALNKSPELKD-----EPPKVNNVA------PTLEVTEREKYEM 50
I P+HQRA GN Y+++ L K +L++ +PP + P + ERE YE
Sbjct: 236 IDPSHQRAGGNLRYFEQLLMK--QLREMNQDYQPPSEEPIQLGTYSRPKDHLPERESYEA 293
Query: 51 LCRGD-LTVPPAIVAQLKCRYVH-RNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDL 108
LCRG+ + + A ++L CRY + P+L L P+KEE+ + P I+ Y D++ D EI+
Sbjct: 294 LCRGEGIQMTEARRSRLFCRYHDGKRNPHLLLKPVKEEDEWDSPHIVRYLDLLSDEEIEK 353
Query: 109 IKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST 168
IK++A+PRL RATV++ KTG L ANYR+SKSAWL E PVI+R+++R+E +TGLT T
Sbjct: 354 IKELAKPRLARATVRDPKTGVLTTANYRVSKSAWLEGEEDPVIDRVNQRIEAITGLTVET 413
Query: 169 AEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL 228
AE LQV NYG+GG YEPH+DF+R E +AFK LGTGNRVAT L YMSDV GGATVF
Sbjct: 414 AELLQVANYGVGGQYEPHFDFSRKDEPDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDF 473
Query: 229 NLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCGL 277
++WP KGT+ FW+NL SG+GDY TRHAACPVL GS + + PCGL
Sbjct: 474 GAAIWPRKGTSVFWYNLFRSGEGDYRTRHAACPVLVGSKWVSNKWIHERGQEFRRPCGL 532
>gi|410927705|ref|XP_003977281.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Takifugu
rubripes]
Length = 531
Score = 285 bits (728), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 139/266 (52%), Positives = 189/266 (71%), Gaps = 4/266 (1%)
Query: 5 THQRAQGNKLYYQEALNKSPEL--KDEPPKVNNVAPTLEVTEREKYEMLCRGD-LTVPPA 61
THQRA GN+ Y++ L K ++ ++ + N P +ER+KYE LCRG+ L +
Sbjct: 241 THQRATGNRKYFEYQLAKQNKVAQSEQGGRDENHQPNDYRSERKKYEQLCRGEGLKMTAR 300
Query: 62 IVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRA 120
+QL CRY P + P+K+E+ + +P I+ Y D++ + E++ +K++A+PRLRRA
Sbjct: 301 RQSQLFCRYYDNGRHPKYVIGPVKQEDEWDRPHIVRYHDILSNREMETVKELAKPRLRRA 360
Query: 121 TVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIG 180
TV + +TG+L A YR+SKSAWL EHPV++RI++R+E +TGL STAE+LQV NYG+G
Sbjct: 361 TVHDPQTGQLTTAPYRVSKSAWLGAFEHPVVDRINQRIEDITGLDVSTAEDLQVANYGVG 420
Query: 181 GHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAA 240
G YEPHYDF R E +AFK LGTGNR+AT L YMS+V GGATVFT + S+ P+KG+A
Sbjct: 421 GQYEPHYDFGRKDEPDAFKELGTGNRIATWLLYMSEVQAGGATVFTDIGASVSPKKGSAV 480
Query: 241 FWHNLHSSGDGDYYTRHAACPVLTGS 266
FW+NLH SGDGDY TRHAACPVL G+
Sbjct: 481 FWYNLHPSGDGDYRTRHAACPVLLGN 506
>gi|348501574|ref|XP_003438344.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Oreochromis
niloticus]
Length = 615
Score = 283 bits (725), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 148/305 (48%), Positives = 198/305 (64%), Gaps = 31/305 (10%)
Query: 4 PTHQRAQGNKLYYQEALNKSPEL-KDEPPKVNNVAP--TLE----------------VTE 44
P H R + N Y++ L K + ++E PK T E + E
Sbjct: 307 PEHPRGKSNLKYFEFQLEKQKKAAEEEAPKQKEREKRETAEKKKKKKQKKSKKAFSLIPE 366
Query: 45 REKYEMLCRGD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMY 102
REKYEMLCRG+ + + P ++L CRY N P L L P+K+++ + +P I+ Y D++
Sbjct: 367 REKYEMLCRGEGIKMTPRRQSRLFCRYYDNNRNPSLLLAPVKQQDEWDRPYIVRYLDIIS 426
Query: 103 DSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMT 162
D+EI+ +K++A+PRLRRAT+ N TG LE A+YRISKSAWL E + P+IE+I+ R+E +T
Sbjct: 427 DAEIERVKQLAKPRLRRATISNPITGVLETASYRISKSAWLTEYDDPMIEKINDRIEGVT 486
Query: 163 GLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGA 222
GL TAEELQV NYG+GG YEPH+DF R E +AFK LGTGNR+AT LFYMSDV+ GGA
Sbjct: 487 GLEMDTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVSAGGA 546
Query: 223 TVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC--------- 273
TVF + ++WP+KGTA FW+NL +SG+GDY TRHAACPVL G+ + +
Sbjct: 547 TVFPDVGAAVWPQKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWIHERGQEWR 606
Query: 274 -PCGL 277
PCGL
Sbjct: 607 RPCGL 611
>gi|292619367|ref|XP_001922562.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Danio rerio]
Length = 541
Score = 283 bits (725), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 147/295 (49%), Positives = 194/295 (65%), Gaps = 20/295 (6%)
Query: 4 PTHQRAQGNKLYY------QEALNKSPELKDEPPKVNNVAPTLE--VTEREKYEMLCRGD 55
P HQRA GN Y+ Q+ K K+E K + + + E+ KYE LCRG+
Sbjct: 244 PEHQRALGNLKYFDYQLAKQKKAEKEQSTKEESKKEQETSDGKKEYLPEKRKYEKLCRGE 303
Query: 56 -LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
L + P L CRY + N P+ + P+K+E+ + +PRII Y +++ + EI+ IK+++
Sbjct: 304 GLRMTPRRQKHLFCRYFNGNRHPFYTIGPVKQEDEWDRPRIIRYHEIITEQEIEKIKELS 363
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+PRLRRAT+ N TG LE A+YRISKSAWL EHPV++RI++R+E +TGL TAEELQ
Sbjct: 364 KPRLRRATISNPITGVLETAHYRISKSAWLAAYEHPVVDRINQRIEDITGLNVKTAEELQ 423
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
V NYG+GG YEPH+DF R E +AFK LGTGNR+AT LFYMSDVA GGATVF + ++
Sbjct: 424 VANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVAAGGATVFPEVGAAVK 483
Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCGLR 278
P KGTA FW+NL SG+GDY TRHAACPVL G+ + + PCGL+
Sbjct: 484 PLKGTAVFWYNLFPSGEGDYSTRHAACPVLVGNKWVSNKWIHERGQEFRRPCGLK 538
>gi|327267604|ref|XP_003218589.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Anolis
carolinensis]
Length = 542
Score = 283 bits (725), Expect = 6e-74, Method: Compositional matrix adjust.
Identities = 141/278 (50%), Positives = 193/278 (69%), Gaps = 16/278 (5%)
Query: 4 PTHQRAQGNKLYYQEALNKSPE------LKDEPPKVNNVAPTLE------VTEREKYEML 51
P HQRA GN Y++ ++K E L + K + + + + ER+KYEML
Sbjct: 241 PEHQRANGNLKYFEYIMSKEKEKEANKSLSETDEKTGKESKSKKGPSKDYLPERQKYEML 300
Query: 52 CRGD-LTVPPAIVAQLKCRYV--HRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDL 108
CRG+ L + P +L CRY +RN Y+ L P+K+E+ + +PRI+ + +++ D EI+
Sbjct: 301 CRGEGLKMTPRRQKKLFCRYYDGNRNPKYI-LRPVKQEDEWDRPRIVRFVEIISDEEIET 359
Query: 109 IKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST 168
+K++A+PRL RATV + +TG+L A+YR+SKSAWL E+P++ RI+ R++ +TGL ST
Sbjct: 360 VKELAKPRLSRATVHDPQTGKLTTAHYRVSKSAWLSGYENPIVARINTRIQDLTGLDVST 419
Query: 169 AEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL 228
AEELQV NYG+GG YEPH+DF R E +AFK LGTGNR+AT LFYMSDV+ GGATVF +
Sbjct: 420 AEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEV 479
Query: 229 NLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
S+WP KGTA FW+NL SG+GDY TRHAACPVL G+
Sbjct: 480 GASVWPRKGTAVFWYNLFPSGEGDYSTRHAACPVLVGN 517
>gi|226874876|ref|NP_035161.2| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Mus
musculus]
gi|148701601|gb|EDL33548.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide, isoform CRA_f [Mus
musculus]
Length = 537
Score = 283 bits (724), Expect = 7e-74, Method: Compositional matrix adjust.
Identities = 144/272 (52%), Positives = 188/272 (69%), Gaps = 10/272 (3%)
Query: 4 PTHQRAQGNKLYYQEALNK------SPELKDEPPKVNNV--APTLEVTEREKYEMLCRGD 55
P+H+RA GN Y++ L + S + N+ PT + ER+ YE LCRG+
Sbjct: 240 PSHERAGGNLRYFERLLEEERGKSLSNQTDAGLATQENLYERPTDYLPERDVYESLCRGE 299
Query: 56 -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
+ + P +L CRY H N VP L + P KEE+ + P I+ Y DVM D EI+ IK++A
Sbjct: 300 GVKLTPRRQKKLFCRYHHGNRVPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 359
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT TAE LQ
Sbjct: 360 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 419
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
V NYG+GG YEPH+DF+R E +AFK LGTGNRVAT L YMSDV GGATVF L ++W
Sbjct: 420 VANYGMGGQYEPHFDFSRSDEQDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDLGAAIW 479
Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
P+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 480 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 511
>gi|148701597|gb|EDL33544.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide, isoform CRA_b [Mus
musculus]
Length = 506
Score = 283 bits (723), Expect = 8e-74, Method: Compositional matrix adjust.
Identities = 144/272 (52%), Positives = 188/272 (69%), Gaps = 10/272 (3%)
Query: 4 PTHQRAQGNKLYYQEALNK------SPELKDEPPKVNNV--APTLEVTEREKYEMLCRGD 55
P+H+RA GN Y++ L + S + N+ PT + ER+ YE LCRG+
Sbjct: 209 PSHERAGGNLRYFERLLEEERGKSLSNQTDAGLATQENLYERPTDYLPERDVYESLCRGE 268
Query: 56 -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
+ + P +L CRY H N VP L + P KEE+ + P I+ Y DVM D EI+ IK++A
Sbjct: 269 GVKLTPRRQKKLFCRYHHGNRVPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 328
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT TAE LQ
Sbjct: 329 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 388
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
V NYG+GG YEPH+DF+R E +AFK LGTGNRVAT L YMSDV GGATVF L ++W
Sbjct: 389 VANYGMGGQYEPHFDFSRSDEQDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDLGAAIW 448
Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
P+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 449 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 480
>gi|403255941|ref|XP_003920663.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Saimiri
boliviensis boliviensis]
gi|403255945|ref|XP_003920665.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 5 [Saimiri
boliviensis boliviensis]
Length = 535
Score = 283 bits (723), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 143/272 (52%), Positives = 189/272 (69%), Gaps = 10/272 (3%)
Query: 4 PTHQRAQGNKLYYQEALNKSPE--LKDEP------PKVNNVAPTLEVTEREKYEMLCRGD 55
P+H+RA GN Y+++ L + E L ++ P+ P + ER+ YE LCRG+
Sbjct: 238 PSHERAGGNLRYFEQLLEEEREKMLSNQTEAELATPEGIYERPVDYLPERDVYESLCRGE 297
Query: 56 -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
+ + P +L CRY H N P L + P KEE+ + P I+ Y DVM D EI+ IK++A
Sbjct: 298 GVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 357
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT TAE LQ
Sbjct: 358 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 417
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
V NYG+GG YEPH+DF+R E +AFK LGTGNRVAT L YMSDV GGATVF L ++W
Sbjct: 418 VANYGVGGQYEPHFDFSRNDERDAFKHLGTGNRVATFLNYMSDVEAGGATVFPDLGAAIW 477
Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
P+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 478 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 509
>gi|410948134|ref|XP_003980796.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Felis
catus]
Length = 535
Score = 282 bits (722), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 142/272 (52%), Positives = 186/272 (68%), Gaps = 10/272 (3%)
Query: 4 PTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVA--------PTLEVTEREKYEMLCRGD 55
P+H+RA GN Y+++ L + E +A P + ER+ YE LCRG+
Sbjct: 238 PSHERAGGNLRYFEQLLEEEREKMLSNQTEAGLATQESIYERPVDYLPERDIYESLCRGE 297
Query: 56 -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
+ + P +L CRY H N P L + P KEE+ + P I+ Y DVM D EI+ IK++A
Sbjct: 298 GVKLTPRRQKRLFCRYHHGNRTPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 357
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT TAE LQ
Sbjct: 358 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 417
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
V NYG+GG YEPH+DF+R E +AFK LGTGNRVAT L YMSDV GGATVF L ++W
Sbjct: 418 VANYGMGGQYEPHFDFSRKNEQDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDLGAAIW 477
Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
P+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 478 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 509
>gi|2498741|sp|Q60716.1|P4HA2_MOUSE RecName: Full=Prolyl 4-hydroxylase subunit alpha-2; Short=4-PH
alpha-2; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-2; Flags: Precursor
gi|836900|gb|AAC52198.1| prolyl 4-hydroxylase alpha(II)-subunit [Mus musculus]
gi|18073923|emb|CAC85691.1| Prolyl 4-hydroxylase alpha IIb subunit [Mus musculus]
gi|1096888|prf||2112362B Pro 4-hydroxylase:SUBUNIT=alpha:ISOTYPE=II
Length = 537
Score = 282 bits (721), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 143/272 (52%), Positives = 188/272 (69%), Gaps = 10/272 (3%)
Query: 4 PTHQRAQGNKLYYQEALNK------SPELKDEPPKVNNV--APTLEVTEREKYEMLCRGD 55
P+H+RA GN Y++ L + S + N+ PT + ER+ YE LCRG+
Sbjct: 240 PSHERAGGNLRYFERLLEEERGKSLSNQTDAGLATQENLYERPTDYLPERDVYESLCRGE 299
Query: 56 -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
+ + P +L CRY H N VP L + P KEE+ + P I+ Y DVM D EI+ IK++A
Sbjct: 300 GVKLTPRRQKKLFCRYHHGNRVPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 359
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT TAE LQ
Sbjct: 360 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 419
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
V NYG+GG YEPH+DF+R + +AFK LGTGNRVAT L YMSDV GGATVF L ++W
Sbjct: 420 VANYGMGGQYEPHFDFSRSDDEDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDLGAAIW 479
Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
P+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 480 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 511
>gi|351706369|gb|EHB09288.1| Prolyl 4-hydroxylase subunit alpha-2 [Heterocephalus glaber]
Length = 535
Score = 281 bits (720), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 145/276 (52%), Positives = 185/276 (67%), Gaps = 18/276 (6%)
Query: 4 PTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTE------------REKYEML 51
P+H+RA GN Y++ L E + P N TL E RE YE L
Sbjct: 238 PSHERAGGNLRYFERLL----EEERRKPLSNQTEATLAAQEGVYDRPMDYLPEREVYESL 293
Query: 52 CRGD-LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLI 109
CRG+ + + P +L CRY H N P L + P KEE+ + P I+ Y +VM D EID I
Sbjct: 294 CRGEGVKLTPQRQKRLFCRYHHGNRAPELLIAPFKEEDEWDSPHIVRYYNVMSDEEIDRI 353
Query: 110 KKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTA 169
K++A+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++++TGLT TA
Sbjct: 354 KELAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQYITGLTVQTA 413
Query: 170 EELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLN 229
E LQV NYG+GG YEPH+DF+R E +AFK LGTGNRVAT L YMSDV GGATVF L
Sbjct: 414 ELLQVANYGMGGQYEPHFDFSRNHERDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDLG 473
Query: 230 LSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
+LWP+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 474 AALWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 509
>gi|157818741|ref|NP_001101745.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Rattus norvegicus]
gi|149052604|gb|EDM04421.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide (predicted),
isoform CRA_a [Rattus norvegicus]
Length = 535
Score = 281 bits (720), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 143/272 (52%), Positives = 187/272 (68%), Gaps = 10/272 (3%)
Query: 4 PTHQRAQGNKLYYQEALNK------SPELKDEPPKVNNV--APTLEVTEREKYEMLCRGD 55
P+H+RA GN Y++ L + S + N+ P + ER+ YE LCRG+
Sbjct: 238 PSHERAGGNLRYFERLLEEERGKSLSNQTDAGLASQENLYERPVDYLPERDVYESLCRGE 297
Query: 56 -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
+ + P +L CRY H N VP L + P KEE+ + P I+ Y DVM D EI+ IK++A
Sbjct: 298 GIKMTPRRQKRLFCRYHHGNRVPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 357
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT TAE LQ
Sbjct: 358 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 417
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
V NYG+GG YEPH+DF+R E +AFK LGTGNRVAT L YMSDV GGATVF L ++W
Sbjct: 418 VANYGMGGQYEPHFDFSRSDERDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDLGAAIW 477
Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
P+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 478 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 509
>gi|170591592|ref|XP_001900554.1| prolyl 4-hydroxylase [Brugia malayi]
gi|16415740|emb|CAC82616.1| prolyl 4-hydroxylase [Brugia malayi]
gi|21425621|emb|CAD19314.1| prolyl 4-hydroxylase [Brugia malayi]
gi|158592166|gb|EDP30768.1| prolyl 4-hydroxylase, putative [Brugia malayi]
Length = 541
Score = 281 bits (720), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 145/301 (48%), Positives = 192/301 (63%), Gaps = 17/301 (5%)
Query: 2 IFPTHQRAQGNKLYYQEALN----KSPELKDEPPKVNNVAPT--LEVTEREKYEMLCRGD 55
I P H RA+ N +Y++ L K + + P V N PT LE E + YE LCR +
Sbjct: 237 IDPNHPRAKNNIKWYEDLLAEEGLKPIDYRRNIPPVTNPRPTTGLETAEHDIYEALCRNE 296
Query: 56 LTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQP 115
+ V + ++L C Y + P+LRL P K E P +L+RDV+ D E+ +I+ +A P
Sbjct: 297 IPVSIKVTSKLYC-YYKMDRPFLRLAPFKVEILRFNPLAVLFRDVITDEEVTMIQMLATP 355
Query: 116 RLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVV 175
RLRRATVQN TGELE A+YR SKSAWL++ EH V+ RI++R++ MT L T+EELQV
Sbjct: 356 RLRRATVQNSITGELETASYRTSKSAWLKDEEHEVVHRINKRIDLMTNLEQETSEELQVG 415
Query: 176 NYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPE 235
NYGIGGHY+PH+DFAR E NAF+SL TGNR+AT+LFYM+ GGATVFT + ++ P
Sbjct: 416 NYGIGGHYDPHFDFARREEVNAFQSLNTGNRLATLLFYMTQPESGGATVFTEVKTTVMPS 475
Query: 236 KGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCGLRRGLQRSG 285
K A FW+NL SG+GD TRHAACPVLTG+ + + PCGL R ++
Sbjct: 476 KNDALFWYNLLRSGEGDLRTRHAACPVLTGTKWVSNKWIHERGQEFRRPCGLSRSVEEQF 535
Query: 286 I 286
+
Sbjct: 536 V 536
>gi|321474898|gb|EFX85862.1| hypothetical protein DAPPUDRAFT_309117 [Daphnia pulex]
Length = 541
Score = 281 bits (719), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 149/306 (48%), Positives = 194/306 (63%), Gaps = 32/306 (10%)
Query: 2 IFPTHQRAQGNKLYYQEALNKSPELKD-------------EPPKVNNVAPT-------LE 41
I P HQR N YY+E L++ E++ EP + + T +
Sbjct: 232 IVPFHQRGLSNIQYYREILHQQGEIQFQQQHETAGANSTIEPFNTSKLKLTKPSGTAGIP 291
Query: 42 VTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVM 101
+ KYE LCRG+ + P I A+L+CRYV NVPY + P+K E A L+PR+++Y +V+
Sbjct: 292 AEQWNKYERLCRGEKLMDPKIEARLRCRYVTNNVPYFFIQPIKMELASLKPRLVIYHNVV 351
Query: 102 YDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHM 161
D EI+ KK+AQ RLRR+TVQN TG E YRI+K+A+L+ EH I +++RR+ +
Sbjct: 352 TDEEIETAKKLAQSRLRRSTVQNSLTGASEPTKYRIAKAAFLQNSEHDHIVKMTRRIGDV 411
Query: 162 TGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGG 221
TGL +TAEELQV NYGIGGHYEPHYD AR GE K G GNR+AT +FYMSDV GG
Sbjct: 412 TGLDMTTAEELQVCNYGIGGHYEPHYDHARKGEVQ--KDFGWGNRIATWMFYMSDVEAGG 469
Query: 222 ATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC-------- 273
ATVF +NL+LWP+KG+AAFW NLH +G+GD T+HAACPVLTGS + +
Sbjct: 470 ATVFPQINLALWPQKGSAAFWFNLHPNGEGDDLTQHAACPVLTGSKWVSNKWIHERNQEF 529
Query: 274 --PCGL 277
PCGL
Sbjct: 530 RRPCGL 535
>gi|297675929|ref|XP_002815906.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Pongo
abelii]
Length = 535
Score = 281 bits (719), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 141/272 (51%), Positives = 186/272 (68%), Gaps = 10/272 (3%)
Query: 4 PTHQRAQGNKLYYQE--------ALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD 55
P+H+RA GN Y+++ L+ E + P+ P + ER+ YE LCRG+
Sbjct: 238 PSHERAGGNLRYFEQLLEEEREKTLSNQTEAELATPEGIYERPVDYLPERDVYESLCRGE 297
Query: 56 -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
+ + P +L CRY H N P L + P KEE+ + P I+ Y DVM D EI+ IK++A
Sbjct: 298 GVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 357
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT TAE LQ
Sbjct: 358 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 417
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
V NYG+GG YEPH+DF+R E + FK LGTGNRVAT L YMSDV GGATVF L ++W
Sbjct: 418 VANYGVGGQYEPHFDFSRNDERDTFKHLGTGNRVATFLNYMSDVEAGGATVFPDLGAAIW 477
Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
P+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 478 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 509
>gi|114601566|ref|XP_001162222.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Pan
troglodytes]
gi|114601568|ref|XP_001162843.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 17 [Pan
troglodytes]
gi|397518358|ref|XP_003829358.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Pan
paniscus]
gi|397518362|ref|XP_003829360.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 5 [Pan
paniscus]
gi|410215944|gb|JAA05191.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
gi|410255608|gb|JAA15771.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
gi|410331279|gb|JAA34586.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
Length = 535
Score = 281 bits (718), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 141/272 (51%), Positives = 186/272 (68%), Gaps = 10/272 (3%)
Query: 4 PTHQRAQGNKLYYQE--------ALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD 55
P+H+RA GN Y+++ L+ E + P+ P + ER+ YE LCRG+
Sbjct: 238 PSHERAGGNLRYFEQLLEEEREKTLSNQTEAELATPEGIYERPVDYLPERDIYESLCRGE 297
Query: 56 -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
+ + P +L CRY H N P L + P KEE+ + P I+ Y DVM D EI+ IK++A
Sbjct: 298 GVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 357
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT TAE LQ
Sbjct: 358 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 417
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
V NYG+GG YEPH+DF+R E + FK LGTGNRVAT L YMSDV GGATVF L ++W
Sbjct: 418 VANYGVGGQYEPHFDFSRNDERDTFKHLGTGNRVATFLNYMSDVEAGGATVFPDLGAAIW 477
Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
P+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 478 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 509
>gi|332221660|ref|XP_003259981.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Nomascus
leucogenys]
Length = 537
Score = 281 bits (718), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 142/272 (52%), Positives = 188/272 (69%), Gaps = 10/272 (3%)
Query: 4 PTHQRAQGNKLYYQEALNKSPE--LKDEP------PKVNNVAPTLEVTEREKYEMLCRGD 55
P+H+RA GN Y+++ L + E L ++ P+ P + ER+ YE LCRG+
Sbjct: 240 PSHERAGGNLRYFEQLLEEEREKMLSNQTEAELATPEGIYERPVDYLPERDVYESLCRGE 299
Query: 56 -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
+ + P +L CRY H N P L + P KEE+ + P I+ Y DVM D EI+ IK++A
Sbjct: 300 GVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 359
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT TAE LQ
Sbjct: 360 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 419
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
V NYG+GG YEPH+DF+R E + FK LGTGNRVAT L YMSDV GGATVF L ++W
Sbjct: 420 VANYGVGGQYEPHFDFSRNDERDTFKHLGTGNRVATFLNYMSDVEAGGATVFPDLGAAIW 479
Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
P+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 480 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 511
>gi|149052606|gb|EDM04423.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide (predicted),
isoform CRA_c [Rattus norvegicus]
Length = 506
Score = 281 bits (718), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 143/272 (52%), Positives = 187/272 (68%), Gaps = 10/272 (3%)
Query: 4 PTHQRAQGNKLYYQEALNK------SPELKDEPPKVNNV--APTLEVTEREKYEMLCRGD 55
P+H+RA GN Y++ L + S + N+ P + ER+ YE LCRG+
Sbjct: 209 PSHERAGGNLRYFERLLEEERGKSLSNQTDAGLASQENLYERPVDYLPERDVYESLCRGE 268
Query: 56 -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
+ + P +L CRY H N VP L + P KEE+ + P I+ Y DVM D EI+ IK++A
Sbjct: 269 GIKMTPRRQKRLFCRYHHGNRVPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 328
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT TAE LQ
Sbjct: 329 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 388
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
V NYG+GG YEPH+DF+R E +AFK LGTGNRVAT L YMSDV GGATVF L ++W
Sbjct: 389 VANYGMGGQYEPHFDFSRSDERDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDLGAAIW 448
Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
P+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 449 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 480
>gi|291190274|ref|NP_001167096.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha 1 polypeptide precursor [Salmo
salar]
gi|223648100|gb|ACN10808.1| Prolyl 4-hydroxylase subunit alpha-1 precursor [Salmo salar]
Length = 545
Score = 281 bits (718), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 141/277 (50%), Positives = 186/277 (67%), Gaps = 16/277 (5%)
Query: 6 HQRAQGNKLYYQEALNKSPELKDEP--------------PKVNNVAPTLEVTEREKYEML 51
HQRA GN Y++ L K +++ E P + ER KYE L
Sbjct: 244 HQRANGNLKYFEYQLAKQKKVEAEEGLKEKEKREREKREASEKKGRPADYLPERRKYEQL 303
Query: 52 CRGD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLI 109
CRG+ + + P +++ CRY N P L P+K+E+ + +PRII Y DV+ +SEI+ +
Sbjct: 304 CRGEGIKMTPRRQSRMFCRYSDNNRHPLYVLGPVKQEDEWDRPRIIRYHDVLSNSEIEKV 363
Query: 110 KKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTA 169
K++A+PRLRRAT+ N TG LE A+YRISKSAWL E PV+++I++R+E +TGL TA
Sbjct: 364 KELAKPRLRRATISNPITGVLETAHYRISKSAWLTAYEDPVVDKINQRIEDITGLNVKTA 423
Query: 170 EELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLN 229
EELQV NYG+GG YEPH+DF R E +AFK LGTGNR+AT L YMSDV GGATVFT +
Sbjct: 424 EELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLIYMSDVPSGGATVFTDVG 483
Query: 230 LSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
++WP+KG+A FW+NL SG+GDY TRHAACPVL G+
Sbjct: 484 AAVWPKKGSAVFWYNLFPSGEGDYSTRHAACPVLVGN 520
>gi|395736141|ref|XP_003776706.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Pongo abelii]
Length = 577
Score = 281 bits (718), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 141/272 (51%), Positives = 186/272 (68%), Gaps = 10/272 (3%)
Query: 4 PTHQRAQGNKLYYQE--------ALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD 55
P+H+RA GN Y+++ L+ E + P+ P + ER+ YE LCRG+
Sbjct: 280 PSHERAGGNLRYFEQLLEEEREKTLSNQTEAELATPEGIYERPVDYLPERDVYESLCRGE 339
Query: 56 -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
+ + P +L CRY H N P L + P KEE+ + P I+ Y DVM D EI+ IK++A
Sbjct: 340 GVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 399
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT TAE LQ
Sbjct: 400 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 459
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
V NYG+GG YEPH+DF+R E + FK LGTGNRVAT L YMSDV GGATVF L ++W
Sbjct: 460 VANYGVGGQYEPHFDFSRNDERDTFKHLGTGNRVATFLNYMSDVEAGGATVFPDLGAAIW 519
Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
P+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 520 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 551
>gi|4758868|ref|NP_004190.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Homo
sapiens]
gi|217272863|ref|NP_001136071.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Homo
sapiens]
gi|20455169|sp|O15460.1|P4HA2_HUMAN RecName: Full=Prolyl 4-hydroxylase subunit alpha-2; Short=4-PH
alpha-2; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-2; Flags: Precursor
gi|2439985|gb|AAB71339.1| prolyl 4-hydroxylase alpha (II) subunit [Homo sapiens]
gi|18073926|emb|CAC85689.1| Prolyl 4-hydroxylase alpha IIb subunit [Homo sapiens]
gi|119582746|gb|EAW62342.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_b
[Homo sapiens]
gi|119582747|gb|EAW62343.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_b
[Homo sapiens]
Length = 535
Score = 281 bits (718), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 141/272 (51%), Positives = 185/272 (68%), Gaps = 10/272 (3%)
Query: 4 PTHQRAQGNKLYYQE--------ALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD 55
P+H+RA GN Y+++ L E + P+ P + ER+ YE LCRG+
Sbjct: 238 PSHERAGGNLRYFEQLLEEEREKTLTNQTEAELATPEGIYERPVDYLPERDVYESLCRGE 297
Query: 56 -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
+ + P +L CRY H N P L + P KEE+ + P I+ Y DVM D EI+ IK++A
Sbjct: 298 GVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 357
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT TAE LQ
Sbjct: 358 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 417
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
V NYG+GG YEPH+DF+R E + FK LGTGNRVAT L YMSDV GGATVF L ++W
Sbjct: 418 VANYGVGGQYEPHFDFSRNDERDTFKHLGTGNRVATFLNYMSDVEAGGATVFPDLGAAIW 477
Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
P+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 478 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 509
>gi|332221664|ref|XP_003259983.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 5 [Nomascus
leucogenys]
Length = 558
Score = 280 bits (717), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 142/272 (52%), Positives = 188/272 (69%), Gaps = 10/272 (3%)
Query: 4 PTHQRAQGNKLYYQEALNKSPE--LKDEP------PKVNNVAPTLEVTEREKYEMLCRGD 55
P+H+RA GN Y+++ L + E L ++ P+ P + ER+ YE LCRG+
Sbjct: 261 PSHERAGGNLRYFEQLLEEEREKMLSNQTEAELATPEGIYERPVDYLPERDVYESLCRGE 320
Query: 56 -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
+ + P +L CRY H N P L + P KEE+ + P I+ Y DVM D EI+ IK++A
Sbjct: 321 GVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 380
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT TAE LQ
Sbjct: 381 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 440
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
V NYG+GG YEPH+DF+R E + FK LGTGNRVAT L YMSDV GGATVF L ++W
Sbjct: 441 VANYGVGGQYEPHFDFSRNDERDTFKHLGTGNRVATFLNYMSDVEAGGATVFPDLGAAIW 500
Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
P+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 501 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 532
>gi|119582752|gb|EAW62348.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_f
[Homo sapiens]
Length = 567
Score = 280 bits (717), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 141/272 (51%), Positives = 185/272 (68%), Gaps = 10/272 (3%)
Query: 4 PTHQRAQGNKLYYQE--------ALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD 55
P+H+RA GN Y+++ L E + P+ P + ER+ YE LCRG+
Sbjct: 270 PSHERAGGNLRYFEQLLEEEREKTLTNQTEAELATPEGIYERPVDYLPERDVYESLCRGE 329
Query: 56 -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
+ + P +L CRY H N P L + P KEE+ + P I+ Y DVM D EI+ IK++A
Sbjct: 330 GVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 389
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT TAE LQ
Sbjct: 390 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 449
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
V NYG+GG YEPH+DF+R E + FK LGTGNRVAT L YMSDV GGATVF L ++W
Sbjct: 450 VANYGVGGQYEPHFDFSRNDERDTFKHLGTGNRVATFLNYMSDVEAGGATVFPDLGAAIW 509
Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
P+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 510 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 541
>gi|344264847|ref|XP_003404501.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1
[Loxodonta africana]
Length = 536
Score = 280 bits (717), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 142/273 (52%), Positives = 186/273 (68%), Gaps = 11/273 (4%)
Query: 4 PTHQRAQGNKLYYQEALNK------SPELKDEPPKVNN---VAPTLEVTEREKYEMLCRG 54
P+H+RA GN Y++ L + S + D P P + ER+ YE LCRG
Sbjct: 238 PSHERAGGNLRYFEHLLEEERKKTLSNQTMDAEPATREGIYERPVDYLPERDVYESLCRG 297
Query: 55 D-LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKM 112
+ + + P +L CRY H N P L + P KEE+ + P I+ Y DVM D EI+ IK++
Sbjct: 298 EGVKLTPRRQKRLFCRYHHGNRTPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKQI 357
Query: 113 AQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEEL 172
A+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ +++RR++H+TGLT TAE L
Sbjct: 358 AKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVAQVNRRMQHITGLTVKTAELL 417
Query: 173 QVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSL 232
QV NYG+GG YEPH+DF+R E +AFK LGTGNRVAT L YMSDV GGATVF L ++
Sbjct: 418 QVANYGMGGQYEPHFDFSRSHEQDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDLGAAI 477
Query: 233 WPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
WP+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 478 WPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 510
>gi|355691582|gb|EHH26767.1| hypothetical protein EGK_16829 [Macaca mulatta]
gi|355750162|gb|EHH54500.1| hypothetical protein EGM_15360 [Macaca fascicularis]
gi|384939464|gb|AFI33337.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Macaca
mulatta]
Length = 535
Score = 280 bits (716), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 142/272 (52%), Positives = 188/272 (69%), Gaps = 10/272 (3%)
Query: 4 PTHQRAQGNKLYYQEALNKSPE--LKDEP------PKVNNVAPTLEVTEREKYEMLCRGD 55
P+H+RA GN Y+++ L + E L ++ P+ P + ER+ YE LCRG+
Sbjct: 238 PSHERAGGNLRYFEQLLEEEREKMLSNQTEAELATPEGIYERPVDYLPERDVYESLCRGE 297
Query: 56 -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
+ + P +L CRY H N P L + P KEE+ + P I+ Y DVM D EI+ IK++A
Sbjct: 298 GVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 357
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT TAE LQ
Sbjct: 358 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 417
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
V NYG+GG YEPH+DF+R E + FK LGTGNRVAT L YMSDV GGATVF L ++W
Sbjct: 418 VANYGVGGQYEPHFDFSRNDERHTFKHLGTGNRVATFLNYMSDVEAGGATVFPDLGAAIW 477
Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
P+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 478 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 509
>gi|432904500|ref|XP_004077362.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Oryzias
latipes]
Length = 555
Score = 280 bits (716), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 146/305 (47%), Positives = 197/305 (64%), Gaps = 32/305 (10%)
Query: 4 PTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLE-------------------VTE 44
P HQRA GN+ Y++ L K E +DE + E + E
Sbjct: 243 PEHQRANGNQKYFEFQLEKQ-EKQDETAEKETQQQDREKRDTTQKKKKKQSQKSLSLIPE 301
Query: 45 REKYEMLCRGD-LTVPPAIVAQLKCRYV-HRNVPYLRLMPLKEEEAYLQPRIILYRDVMY 102
R+KYEMLCRG+ + + ++L CRY +++ P L P+K+++ + +P I+ Y D++
Sbjct: 302 RKKYEMLCRGEGVRMTSRRQSRLFCRYYDNKHNPRFVLAPVKQQDEWDRPYIVRYIDIIS 361
Query: 103 DSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMT 162
++E+D IK++A+PRLRRAT+ N TG LE A YRISKSAWL E PV+E+I++R+E +T
Sbjct: 362 EAEMDKIKQLAKPRLRRATISNPVTGVLETAPYRISKSAWLTAYEDPVVEKINQRIEDLT 421
Query: 163 GLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGA 222
GL TAEELQV NYG+GG YEPH+DF R E +AFK LGTGNR+AT LFYMSDV+ GGA
Sbjct: 422 GLEMDTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVSAGGA 481
Query: 223 TVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC--------- 273
TVF + S+ P+KGTA FW+NL +SG+GDY TRHAACPVL G+ + +
Sbjct: 482 TVFPDVGASVGPQKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWIHERGQEWR 541
Query: 274 -PCGL 277
PCGL
Sbjct: 542 RPCGL 546
>gi|402593814|gb|EJW87741.1| hypothetical protein WUBG_01349 [Wuchereria bancrofti]
Length = 541
Score = 280 bits (716), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 145/301 (48%), Positives = 191/301 (63%), Gaps = 17/301 (5%)
Query: 2 IFPTHQRAQGNKLYYQEALN----KSPELKDEPPKVNNVAPT--LEVTEREKYEMLCRGD 55
I P H RA+ N +Y++ L K + + P V N P LE E + YE LCR +
Sbjct: 237 IDPNHPRAKNNIKWYEDLLAEEGLKPIDYRRNIPPVTNPRPKTGLETAEHDIYEALCRNE 296
Query: 56 LTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQP 115
+ V + ++L C Y + P+LRL P K E P +L+RDV+ D EI +I+ +A P
Sbjct: 297 IPVSIKVTSKLYC-YYKMDRPFLRLAPFKVEILRFNPLAVLFRDVITDEEITMIQMLATP 355
Query: 116 RLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVV 175
RLRRATVQN TGELE A+YR SKSAWL++ EH V+ RI++R++ MT L T+EELQV
Sbjct: 356 RLRRATVQNSITGELETASYRTSKSAWLKDEEHEVVHRINKRIDLMTNLEQETSEELQVG 415
Query: 176 NYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPE 235
NYGIGGHY+PH+DFAR E NAF+SL TGNR+AT+LFYM+ GGATVFT + ++ P
Sbjct: 416 NYGIGGHYDPHFDFARREEVNAFQSLNTGNRLATLLFYMTQPESGGATVFTEVKTTVMPS 475
Query: 236 KGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCGLRRGLQRSG 285
K A FW+NL SG+GD TRHAACPVLTG+ + + PCGL R ++
Sbjct: 476 KNDALFWYNLLRSGEGDLRTRHAACPVLTGTKWVSNKWIHERGQEFRRPCGLSRSVEEQF 535
Query: 286 I 286
+
Sbjct: 536 V 536
>gi|348557542|ref|XP_003464578.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 1
[Cavia porcellus]
Length = 535
Score = 280 bits (715), Expect = 7e-73, Method: Compositional matrix adjust.
Identities = 143/272 (52%), Positives = 186/272 (68%), Gaps = 10/272 (3%)
Query: 4 PTHQRAQGNKLYYQEALN--KSPELKDEPPKVNNVA------PTLEVTEREKYEMLCRGD 55
P+H+RA GN Y++ L + L ++ V P+ + ERE YE LCRG+
Sbjct: 238 PSHERAGGNLRYFERLLEEERGKLLSNQTEAVLAAQEGIYERPSDYLPEREVYESLCRGE 297
Query: 56 -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
+ + P +L CRY H N P L + P KEE+ + P I+ Y DVM D EI+ IK++A
Sbjct: 298 GIKLTPQRRKRLFCRYHHGNRAPELLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 357
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++ +TGLT TAE LQ
Sbjct: 358 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEEDDPVVARVNRRMQQITGLTVKTAELLQ 417
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
V NYG+GG YEPH+DF+R E +AFK LGTGNRVAT L YMSDV GGATVF L +LW
Sbjct: 418 VANYGMGGQYEPHFDFSRSHERDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDLGAALW 477
Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
P+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 478 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 509
>gi|335283456|ref|XP_003354320.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Sus scrofa]
Length = 535
Score = 279 bits (713), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 142/272 (52%), Positives = 184/272 (67%), Gaps = 10/272 (3%)
Query: 4 PTHQRAQGNKLYYQ--------EALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD 55
P H+RA GN Y++ + L+ E P P + ER+ YE LCRG+
Sbjct: 238 PGHERAGGNLRYFERLLEEEREKMLSNHTEAGPSTPGGIYERPVDYLPERDVYESLCRGE 297
Query: 56 -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
+ + P +L CRY H N P L + P KEE+ + P I+ Y DVM D EI+ IK++A
Sbjct: 298 GVKLTPRRQKRLFCRYHHGNRTPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 357
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT TAE LQ
Sbjct: 358 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 417
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
V NYG+GG YEPH+DF+R E +AFK LGTGNRVAT L YMSDV GGATVF L ++W
Sbjct: 418 VANYGMGGQYEPHFDFSRKDEQDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDLGAAIW 477
Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
P+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 478 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 509
>gi|324507368|gb|ADY43128.1| Prolyl 4-hydroxylase subunit alpha-2 [Ascaris suum]
Length = 534
Score = 278 bits (712), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 141/289 (48%), Positives = 196/289 (67%), Gaps = 21/289 (7%)
Query: 4 PTHQRAQGNKLYYQEAL-NKSPELKDEPP----KVNNVAPTLEVTEREKYEMLCRGDLTV 58
P H RA+GN +Y++ L + + ++ + PP ++++ P ER+ YE LCRG+ V
Sbjct: 238 PDHPRAKGNVKWYEDMLEDDNKDISELPPLKLERLDDGIP-----ERDVYEALCRGEQKV 292
Query: 59 PPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLR 118
+++ C Y+ + P+L+L P+K E P ++L++ V+ D EI++I+K+A P+L+
Sbjct: 293 NVTAQSEVYC-YLKMDRPFLKLAPIKVEILRFSPLVVLFKQVISDYEIEVIEKLAIPKLK 351
Query: 119 RATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYG 178
RATVQN +TG+LE ANYRISKSAWL+ +HP I+RI++R++ MT L TAEELQ NYG
Sbjct: 352 RATVQNARTGDLEYANYRISKSAWLKGTDHPAIDRINKRIDLMTNLNQETAEELQAQNYG 411
Query: 179 IGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGT 238
IGGHY+PH+DFAR + NAFK+L TGNR+AT+L YMSDV GGATVF L +++P K
Sbjct: 412 IGGHYDPHFDFARKEDINAFKTLNTGNRIATILIYMSDVESGGATVFNHLGNAVFPSKYD 471
Query: 239 AAFWHNLHSSGDGDYYTRHAACPVLTG----SNS-LHSTC-----PCGL 277
A FW+NL G+GD TRHAACPVLTG SN +H PCGL
Sbjct: 472 ALFWYNLRRDGEGDLRTRHAACPVLTGIKWVSNKWIHDRGQEFRRPCGL 520
>gi|395817620|ref|XP_003782263.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Otolemur
garnettii]
Length = 540
Score = 278 bits (711), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 141/272 (51%), Positives = 184/272 (67%), Gaps = 10/272 (3%)
Query: 4 PTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVA--------PTLEVTEREKYEMLCRGD 55
P+H+RA GN Y++ L + E +A P + ERE YE LCRG+
Sbjct: 243 PSHERAGGNLRYFEHLLEEEREKMLSNKTEAELATQEGIYERPVDYLPEREVYESLCRGE 302
Query: 56 -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
+ + P +L CRY H N P L + P KEE+ + P I+ Y DVM D EI+ IK++A
Sbjct: 303 GVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 362
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++ R++H+TGL+ TAE LQ
Sbjct: 363 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNHRMQHITGLSVKTAELLQ 422
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
V NYG+GG YEPH+DF+R E +AFK LGTGNRVAT L YMSDV GGATVF L ++W
Sbjct: 423 VANYGVGGQYEPHFDFSRNHERDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDLGAAIW 482
Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
P+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 483 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 514
>gi|291387304|ref|XP_002710243.1| PREDICTED: prolyl 4-hydroxylase, alpha II subunit isoform 1
precursor (predicted)-like isoform 3 [Oryctolagus
cuniculus]
Length = 535
Score = 278 bits (710), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 144/273 (52%), Positives = 186/273 (68%), Gaps = 12/273 (4%)
Query: 4 PTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLE---------VTEREKYEMLCRG 54
P+H+RA GN Y++ L + K + VA T E + ER+ YE LCRG
Sbjct: 238 PSHERAGGNLRYFERLLEEQ-RGKSLLNQTEAVAVTQEGIYERPVDYLPERDVYESLCRG 296
Query: 55 D-LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKM 112
+ + + P +L CRY N P L + P KEE+ + P I+ Y DVM D EI+ IK++
Sbjct: 297 EGVKLTPRRQKRLFCRYHDGNGAPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEI 356
Query: 113 AQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEEL 172
A+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ RI+RR++H+TGLT TAE L
Sbjct: 357 AKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARINRRMQHITGLTVKTAELL 416
Query: 173 QVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSL 232
QV NYG+GG YEPH+DF+R E +AFK LGTGNRVAT L YMSDV GGATVF L ++
Sbjct: 417 QVANYGMGGQYEPHFDFSRNNERDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDLGAAI 476
Query: 233 WPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
WP+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 477 WPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 509
>gi|291190128|ref|NP_001167431.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Salmo salar]
gi|223649060|gb|ACN11288.1| Prolyl 4-hydroxylase subunit alpha-2 precursor [Salmo salar]
Length = 538
Score = 277 bits (709), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 150/298 (50%), Positives = 196/298 (65%), Gaps = 22/298 (7%)
Query: 2 IFPTHQRAQGNKLYYQEALNKS-PELK--------DEPPKVNNVA-PTLEVTEREKYEML 51
I +HQRA GN Y+++ L+K EL +EP ++ P + ERE YE L
Sbjct: 237 IDSSHQRAGGNLRYFEKLLSKQLKELNQEVQEPATEEPIQLGTYKRPKDYLPEREIYEGL 296
Query: 52 CRGD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLI 109
CRG+ + + ++L CRY N P L L P+KEE+ + P I+ Y + + DSEI+ I
Sbjct: 297 CRGEGVKMTSERRSRLYCRYHDGNRNPRLLLQPMKEEDEWDSPHIVRYLNALSDSEIEKI 356
Query: 110 KKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTA 169
K++A+PRL RATV++ KTG L ANYR+SKSAWL E PVIER+++R+E +TGLTT TA
Sbjct: 357 KELAKPRLARATVRDPKTGVLTTANYRVSKSAWLEGEEDPVIERVNQRIEDITGLTTQTA 416
Query: 170 EELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLN 229
E LQ+ NYG+GG YEPH+DF+R E +AFK+LGTGNRVAT L YMSDV GGATVF
Sbjct: 417 ELLQIANYGVGGQYEPHFDFSRKDEPDAFKTLGTGNRVATFLNYMSDVEAGGATVFPDFG 476
Query: 230 LSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCGL 277
+++P+KGTA FW+NL SG+GDY TRHAACPVL G + + PCGL
Sbjct: 477 AAIYPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGCKWVSNKWIHERGQEFRRPCGL 534
>gi|354474413|ref|XP_003499425.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1
[Cricetulus griseus]
Length = 535
Score = 277 bits (708), Expect = 6e-72, Method: Compositional matrix adjust.
Identities = 141/272 (51%), Positives = 185/272 (68%), Gaps = 10/272 (3%)
Query: 4 PTHQRAQGNKLYYQ--------EALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD 55
P+H+RA GN Y++ ++L E + P + ER+ E LCRG+
Sbjct: 238 PSHERAGGNLRYFERLLEEEREKSLFNQTEAGLATQENVYERPVDFLPERDVLESLCRGE 297
Query: 56 -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
+ + P +L CRY H N VP L + P KEE+ + P I+ Y DVM D EI+ IK++A
Sbjct: 298 GVKLTPQRQKKLFCRYHHGNRVPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 357
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT TAE LQ
Sbjct: 358 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 417
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
V NYG+GG YEPH+DF+R E +AFK LGTGNRVAT L YMSDV GGATVF L ++W
Sbjct: 418 VANYGMGGQYEPHFDFSRSDEQDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDLGAAIW 477
Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
P+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 478 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 509
>gi|410914996|ref|XP_003970973.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Takifugu
rubripes]
Length = 538
Score = 276 bits (707), Expect = 7e-72, Method: Compositional matrix adjust.
Identities = 147/298 (49%), Positives = 194/298 (65%), Gaps = 21/298 (7%)
Query: 1 MIFPTHQRAQGNKLYYQEALNKS-PELK-------DEPPKVNNVA-PTLEVTEREKYEML 51
+I +H+RA GN YY+ L K EL +EP ++ + P + ERE YE L
Sbjct: 237 VIDSSHERAGGNLRYYENLLRKQLSELNQDYEPASEEPIQLGTYSRPKDHLPEREAYEAL 296
Query: 52 CRGD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLI 109
CRG+ L + A ++L CRY N P+L L P+KEE+ + P I+ Y D + + EI+ I
Sbjct: 297 CRGEGLQMNEARRSRLFCRYQDGNRNPHLLLKPIKEEDEWDSPNIVRYLDFLSNEEIEKI 356
Query: 110 KKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTA 169
K++A+P+L RATV++ K+G L A+YR+SKSAWL E P+I R+++R+E +TGLT TA
Sbjct: 357 KELAKPKLARATVRDPKSGVLTTASYRVSKSAWLEGEEDPIIARVNQRIEDLTGLTVKTA 416
Query: 170 EELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLN 229
E LQV NYG+GG YEPH+DF+R E +AFK LGTGNRVAT L YMSDV GGATVF
Sbjct: 417 ELLQVANYGVGGQYEPHFDFSRKDEPDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDFG 476
Query: 230 LSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCGL 277
++WP KGTA FW+NL SG+GDY TRHAACPVL G+ + + PCGL
Sbjct: 477 AAIWPRKGTAVFWYNLFKSGEGDYRTRHAACPVLVGNKWVSNKWIHERGQEFRRPCGL 534
>gi|393909803|gb|EFO21561.2| prolyl 4-hydroxylase 2 [Loa loa]
Length = 542
Score = 276 bits (707), Expect = 7e-72, Method: Compositional matrix adjust.
Identities = 143/297 (48%), Positives = 188/297 (63%), Gaps = 17/297 (5%)
Query: 2 IFPTHQRAQGNKLYYQEALN----KSPELKDEPPKVNNVAPT--LEVTEREKYEMLCRGD 55
I P H RA+ N +Y++ L K + + P V N P L+ TE + YE LCR +
Sbjct: 238 IDPNHPRARNNIKWYEDLLAEDGVKPIDYRRNIPPVTNPRPKNGLKTTEHDMYEALCRNE 297
Query: 56 LTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQP 115
+ V ++L C Y + P+LRL P K E P + +RDV+ D E+ +I+ +A P
Sbjct: 298 VPVSVKATSKLYC-YYKMDRPFLRLAPFKVEILRFSPLAVFFRDVITDEEVTIIQMLATP 356
Query: 116 RLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVV 175
RLRRATVQN TGELE A+YR SKSAWL++ EH ++ RI+RR++ MT L T+EELQV
Sbjct: 357 RLRRATVQNSITGELETASYRTSKSAWLKDEEHEIVHRINRRIDLMTNLEQETSEELQVG 416
Query: 176 NYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPE 235
NYGIGGHY+PH+DFAR E NAF+SL TGNR+AT+LFYM+ GGATVFT + ++ P
Sbjct: 417 NYGIGGHYDPHFDFARREEVNAFQSLNTGNRLATLLFYMTQPESGGATVFTEVKTTVMPS 476
Query: 236 KGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCGLRRGLQ 282
K A FW+NL SG+GD TRHAACPVL GS + + PCGL R ++
Sbjct: 477 KNDALFWYNLLRSGEGDLRTRHAACPVLIGSKWVSNKWIHERGQEFRRPCGLSRSVE 533
>gi|312080225|ref|XP_003142509.1| prolyl 4-hydroxylase 2 [Loa loa]
Length = 541
Score = 276 bits (706), Expect = 9e-72, Method: Compositional matrix adjust.
Identities = 143/297 (48%), Positives = 188/297 (63%), Gaps = 17/297 (5%)
Query: 2 IFPTHQRAQGNKLYYQEALN----KSPELKDEPPKVNNVAPT--LEVTEREKYEMLCRGD 55
I P H RA+ N +Y++ L K + + P V N P L+ TE + YE LCR +
Sbjct: 237 IDPNHPRARNNIKWYEDLLAEDGVKPIDYRRNIPPVTNPRPKNGLKTTEHDMYEALCRNE 296
Query: 56 LTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQP 115
+ V ++L C Y + P+LRL P K E P + +RDV+ D E+ +I+ +A P
Sbjct: 297 VPVSVKATSKLYC-YYKMDRPFLRLAPFKVEILRFSPLAVFFRDVITDEEVTIIQMLATP 355
Query: 116 RLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVV 175
RLRRATVQN TGELE A+YR SKSAWL++ EH ++ RI+RR++ MT L T+EELQV
Sbjct: 356 RLRRATVQNSITGELETASYRTSKSAWLKDEEHEIVHRINRRIDLMTNLEQETSEELQVG 415
Query: 176 NYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPE 235
NYGIGGHY+PH+DFAR E NAF+SL TGNR+AT+LFYM+ GGATVFT + ++ P
Sbjct: 416 NYGIGGHYDPHFDFARREEVNAFQSLNTGNRLATLLFYMTQPESGGATVFTEVKTTVMPS 475
Query: 236 KGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCGLRRGLQ 282
K A FW+NL SG+GD TRHAACPVL GS + + PCGL R ++
Sbjct: 476 KNDALFWYNLLRSGEGDLRTRHAACPVLIGSKWVSNKWIHERGQEFRRPCGLSRSVE 532
>gi|226874889|ref|NP_001152881.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Bos
taurus]
gi|296485624|tpg|DAA27739.1| TPA: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Bos taurus]
Length = 535
Score = 276 bits (706), Expect = 9e-72, Method: Compositional matrix adjust.
Identities = 141/272 (51%), Positives = 186/272 (68%), Gaps = 10/272 (3%)
Query: 4 PTHQRAQGNKLYYQ--------EALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD 55
P+H+RA GN Y++ + L+ E + + P + ER+ YE LCRG+
Sbjct: 238 PSHERAGGNLHYFERLLEEEREKMLSNHTEAELASQQGIYERPVDYLPERDVYESLCRGE 297
Query: 56 -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
+ + P +L CRY H N VP L + P KEE+ + P I+ Y DVM D EI+ IK++A
Sbjct: 298 GVKLTPRRQKRLFCRYHHGNRVPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 357
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++ R++H+TGLT TAE LQ
Sbjct: 358 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNLRMQHITGLTVKTAELLQ 417
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
V NYG+GG YEPH+DF+R E +AFK LGTGNRVAT L YMSDV GGATVF L ++W
Sbjct: 418 VANYGMGGQYEPHFDFSRKDEQDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDLGAAIW 477
Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
P+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 478 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 509
>gi|426229219|ref|XP_004008688.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 1
[Ovis aries]
Length = 535
Score = 276 bits (705), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 141/272 (51%), Positives = 185/272 (68%), Gaps = 10/272 (3%)
Query: 4 PTHQRAQGNKLYYQ--------EALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD 55
P+H+RA GN Y++ + L E + + P + ER+ YE LCRG+
Sbjct: 238 PSHERAGGNLHYFERLLEEEREKMLTNHTEAELAAQQGIYERPVDYLPERDVYESLCRGE 297
Query: 56 -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
+ + P +L CRY H N VP L + P KEE+ + P I+ Y DVM D EI+ IK++A
Sbjct: 298 GVKLTPRRQKRLFCRYHHGNRVPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 357
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++ R++H+TGLT TAE LQ
Sbjct: 358 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNLRMQHITGLTVKTAELLQ 417
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
V NYG+GG YEPH+DF+R E +AFK LGTGNRVAT L YMSDV GGATVF L ++W
Sbjct: 418 VANYGMGGQYEPHFDFSRKDEQDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDLGAAIW 477
Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
P+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 478 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 509
>gi|47218149|emb|CAG10069.1| unnamed protein product [Tetraodon nigroviridis]
Length = 595
Score = 275 bits (703), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 152/330 (46%), Positives = 199/330 (60%), Gaps = 58/330 (17%)
Query: 4 PTHQRAQGNKLYYQEALNKSPELKD-------------EPPKVNNVA----PTLEVT--- 43
P HQRA+GN Y++ L K + KD EP P + T
Sbjct: 264 PEHQRAKGNLKYFEFQLEK--QRKDAEEEPPKETEKRVEPDTTEKKKRKKKPQSKATFQL 321
Query: 44 --EREKYEMLCRGD-LTVPPAIVAQLKCRYVH-RNVPYLRLMPLKEEEAYLQPRIILYRD 99
ER+KYEMLCRG+ + + P ++L CRY + P L P+K+++ + +P I+ Y D
Sbjct: 322 IPERKKYEMLCRGEGIRLTPRRQSRLFCRYYDSKRHPRYILSPVKQQDEWDRPYIVRYLD 381
Query: 100 VMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISK-------------------- 139
++ D EI+L+K++A+PRLRRAT+ N TG LE A+YRISK
Sbjct: 382 IISDKEIELVKQLAKPRLRRATISNPITGVLETASYRISKRRATVHDPQTGKLTTAQYRV 441
Query: 140 --SAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANA 197
SAWL EHPVIE I++R+E +TGL TAEELQV NYG+GG YEPH+DF R E +A
Sbjct: 442 SKSAWLTGYEHPVIETINQRIEDLTGLEVDTAEELQVANYGVGGQYEPHFDFGRKDEPDA 501
Query: 198 FKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRH 257
FK LGTGNR+AT LFYMSDVA GGATVF + ++WP+KG+A FW+NL +SG+GDY TRH
Sbjct: 502 FKELGTGNRIATWLFYMSDVAAGGATVFPDVGAAVWPQKGSAVFWYNLFTSGEGDYSTRH 561
Query: 258 AACPVLTGSNSLHSTC----------PCGL 277
AACPVL G+ + + PCGL
Sbjct: 562 AACPVLVGNKWVSNKWIHERGQEWRRPCGL 591
>gi|281348666|gb|EFB24250.1| hypothetical protein PANDA_000722 [Ailuropoda melanoleuca]
Length = 505
Score = 274 bits (701), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 140/272 (51%), Positives = 184/272 (67%), Gaps = 10/272 (3%)
Query: 4 PTHQRAQGNKLYYQ--------EALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD 55
P+H+RA GN Y++ + L+ E + P + ER+ YE LCRG+
Sbjct: 227 PSHERAGGNLRYFERLLEEEREKMLSNQTEAGLATQEGIYERPVDYLPERDIYESLCRGE 286
Query: 56 -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
+ + P +L CRY H N P L + P KEE+ + P I+ Y DVM D EI+ IK++A
Sbjct: 287 GVKLTPRRQKRLFCRYHHGNRTPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 346
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++ R++H+TGLT TAE LQ
Sbjct: 347 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNLRMQHITGLTVKTAELLQ 406
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
V NYG+GG YEPH+DF+R E +AFK LGTGNRVAT L YMSDV GGATVF L ++W
Sbjct: 407 VANYGMGGQYEPHFDFSRKNEQDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDLGAAIW 466
Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
P+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 467 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 498
>gi|301754231|ref|XP_002912939.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Ailuropoda
melanoleuca]
Length = 535
Score = 274 bits (701), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 140/272 (51%), Positives = 184/272 (67%), Gaps = 10/272 (3%)
Query: 4 PTHQRAQGNKLYYQ--------EALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD 55
P+H+RA GN Y++ + L+ E + P + ER+ YE LCRG+
Sbjct: 238 PSHERAGGNLRYFERLLEEEREKMLSNQTEAGLATQEGIYERPVDYLPERDIYESLCRGE 297
Query: 56 -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
+ + P +L CRY H N P L + P KEE+ + P I+ Y DVM D EI+ IK++A
Sbjct: 298 GVKLTPRRQKRLFCRYHHGNRTPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 357
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++ R++H+TGLT TAE LQ
Sbjct: 358 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNLRMQHITGLTVKTAELLQ 417
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
V NYG+GG YEPH+DF+R E +AFK LGTGNRVAT L YMSDV GGATVF L ++W
Sbjct: 418 VANYGMGGQYEPHFDFSRKNEQDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDLGAAIW 477
Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
P+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 478 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 509
>gi|395509389|ref|XP_003758980.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2
[Sarcophilus harrisii]
Length = 536
Score = 274 bits (701), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 142/293 (48%), Positives = 190/293 (64%), Gaps = 21/293 (7%)
Query: 5 THQRAQGNKLYYQEAL---------NKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD 55
+H+RA GN Y+++ L NK+ E + P + ER+ YE LCRG+
Sbjct: 239 SHERAGGNLRYFEKLLEEERLGKRLNKTSETQPATQGGIYERPPDYLPERDVYEALCRGE 298
Query: 56 -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
+ + P +L CRY N P L + P KEE+ + P I+ Y DV+ D EI+ IK++A
Sbjct: 299 GIKLTPRRQKRLFCRYHDGNRTPQLLIAPFKEEDEWDSPHIVRYYDVLSDEEIERIKELA 358
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+P+L RATV++ KTG L +ANYR+SKS+WL E + PVI +++RR+ ++TGL+ TAE LQ
Sbjct: 359 KPKLARATVRDPKTGVLTVANYRVSKSSWLEEGDDPVIAQLNRRMHYITGLSVKTAELLQ 418
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
V NYG+GG YEPH+DF+R GE +AFK LGTGNRVAT L YMSDV GGATVF ++W
Sbjct: 419 VANYGMGGQYEPHFDFSRKGEQDAFKHLGTGNRVATFLNYMSDVEAGGATVFPDFGATIW 478
Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCG 276
P+KGT+ FW+NL SG+GDY TRHAACPVL GS + + PCG
Sbjct: 479 PKKGTSVFWYNLFRSGEGDYRTRHAACPVLVGSKWVSNKWFHERGQEFLRPCG 531
>gi|327265288|ref|XP_003217440.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Anolis
carolinensis]
Length = 554
Score = 274 bits (700), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 140/275 (50%), Positives = 188/275 (68%), Gaps = 14/275 (5%)
Query: 4 PTHQRAQGNKLYYQE---------ALNKSPELKDEPPKVNNV--APTLEVTEREKYEMLC 52
P+H+RA N Y+++ AL+ +P EP N + P + ERE YE LC
Sbjct: 255 PSHERAGSNMQYFEKLLENEQNEKALDDAPNAT-EPSTYNGIYERPPDYLPEREIYEALC 313
Query: 53 RGD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIK 110
RG+ + + P +L CRY + N P+L + P KEE+ + P I+ Y +V+ D EI+ IK
Sbjct: 314 RGEGVKMTPRRQKRLFCRYHNGNQNPHLLIAPFKEEDEWDSPHIVRYYNVLSDEEIEKIK 373
Query: 111 KMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAE 170
++A+P+L RATV++ KTG L +ANYR+SKS+WL E + V+ ++++R+EH+TGLT TAE
Sbjct: 374 ELAKPKLARATVRDPKTGVLTVANYRVSKSSWLEEEDDLVVAKVNQRMEHITGLTVKTAE 433
Query: 171 ELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNL 230
LQV NYG+GG YEPH+DF+R E +AFK LGTGNRVAT L YMSDV GGATVF
Sbjct: 434 LLQVANYGMGGQYEPHFDFSRKEEPDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDFGA 493
Query: 231 SLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
++WP+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 494 AIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVG 528
>gi|54792285|emb|CAG28668.1| prolyl 4-hydroxylase alpha-2 subunit [Gallus gallus]
Length = 538
Score = 273 bits (698), Expect = 7e-71, Method: Compositional matrix adjust.
Identities = 140/276 (50%), Positives = 184/276 (66%), Gaps = 19/276 (6%)
Query: 5 THQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVT-------------EREKYEML 51
TH+RA N Y+++ L K + E P VA T V ER+ YE L
Sbjct: 242 THERAGSNLRYFEKLLEK----EREKPSNKTVATTEPVVQSGAYERPLDYLPERDIYEAL 297
Query: 52 CRGD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLI 109
CRG+ + + P +L CRY N P+L + P KEE+ + P I+ Y DVM D EI+ I
Sbjct: 298 CRGEGVKMTPQRQKRLFCRYHDGNRNPHLLIAPFKEEDEWDSPHIVRYYDVMSDEEIEKI 357
Query: 110 KKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTA 169
K++A+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ ++++R++ +TGLT TA
Sbjct: 358 KQLAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVAKVNQRMQQITGLTVKTA 417
Query: 170 EELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLN 229
E LQV NYG+GG YEPH+DF+R E +AFK LGTGNRVAT L YMSDV GGATVF
Sbjct: 418 ELLQVANYGMGGQYEPHFDFSRKDEPDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDFG 477
Query: 230 LSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
++WP+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 478 AAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVG 513
>gi|324511726|gb|ADY44875.1| Prolyl 4-hydroxylase subunit alpha-1 [Ascaris suum]
Length = 550
Score = 272 bits (696), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 143/296 (48%), Positives = 190/296 (64%), Gaps = 19/296 (6%)
Query: 4 PTHQRAQGNKLYYQ-----EALNKSPELKDEPPKVNNVAPT--LEVTEREKYEMLCRGDL 56
P H A+GN +Y+ E + S K+ PP + N P LE +ER YE LCR ++
Sbjct: 236 PYHPHARGNVKWYEDLLVEEGVKPSDHRKNIPP-LENRRPDDGLEDSERTIYEALCRNEV 294
Query: 57 TVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPR 116
V ++QL C Y + P+LRL P K E P +L+ D++ D E +I+++A PR
Sbjct: 295 PVSIKAISQLYC-YYKMDRPFLRLAPFKVEILRFNPLAVLFVDIISDEEAKMIQQIATPR 353
Query: 117 LRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVN 176
L+RATVQN KTGELE A YRISKSAWL+ +H +I+RI+RR+E MT L T+EELQ+ N
Sbjct: 354 LKRATVQNSKTGELETAAYRISKSAWLKGGDHELIDRINRRIELMTNLIQETSEELQIAN 413
Query: 177 YGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEK 236
YG+GGHY+PH+DFAR E AF+SLGTGNR+ATVLFY+++ GG TVFT L ++ P K
Sbjct: 414 YGVGGHYDPHFDFARKEEPKAFESLGTGNRLATVLFYLTEPEIGGGTVFTELRTAVMPSK 473
Query: 237 GTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCGLRRGLQ 282
A FW+NL+ SG+GD TRHAACPVL G + + PCGL+ +Q
Sbjct: 474 NGALFWYNLYRSGEGDLRTRHAACPVLVGIKWVANKWIHERGQEFLRPCGLKPSVQ 529
>gi|47213360|emb|CAF90979.1| unnamed protein product [Tetraodon nigroviridis]
Length = 511
Score = 272 bits (695), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 135/275 (49%), Positives = 189/275 (68%), Gaps = 11/275 (4%)
Query: 4 PTHQRAQGNKLYYQEALNKSPEL--KDEPPKVNNVAPTLEVTEREKYEMLCRGD-LTVPP 60
PTHQRA GN+ Y++ L K ++ +++ + P +E++KYE LCRG+ L + P
Sbjct: 215 PTHQRATGNRRYFEYQLAKQTKVGKREKGRRQEEHQPDDYQSEKKKYEQLCRGEGLRMTP 274
Query: 61 AIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRR 119
+ L CRY P + P+K+E+ + PRI+ Y DV+ + E++ +K++A+PRLRR
Sbjct: 275 QRQSGLFCRYYDNGRHPKYVIGPVKQEDEWDHPRIVRYHDVLSNREMEKVKELARPRLRR 334
Query: 120 ATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGI 179
ATV + +TG+L A YR+SKSAWL EHP++++I++R+E +TGL STAE+LQV NYG+
Sbjct: 335 ATVHDPRTGQLTTAPYRVSKSAWLGAFEHPIVDQINQRIEDITGLDVSTAEDLQVANYGV 394
Query: 180 GGHYEPHYDFARPGEANAFKSLGTGNRVATVLFY-------MSDVAQGGATVFTSLNLSL 232
GG YEPH+DF + E +AF+ LGTGNR+AT L Y MSDV GGATVFT + S+
Sbjct: 395 GGQYEPHFDFGQKDEPDAFEELGTGNRIATWLLYVSAAVLRMSDVQAGGATVFTDIGASV 454
Query: 233 WPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
P+KG+A FW+NL SGDGDY TRHAACPVL G+
Sbjct: 455 LPQKGSAVFWYNLRPSGDGDYRTRHAACPVLLGNK 489
>gi|345326417|ref|XP_001510155.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like
[Ornithorhynchus anatinus]
Length = 888
Score = 271 bits (692), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 137/274 (50%), Positives = 186/274 (67%), Gaps = 15/274 (5%)
Query: 5 THQRAQGNKLYYQEALNKSPELKDEPPKVNNVA-----------PTLEVTEREKYEMLCR 53
+H+RA GN Y+++ L + E ++P + + P + ER+ YE LCR
Sbjct: 591 SHERAGGNLRYFEKLLEE--ERMEKPLNRTSASKPATHGGIYERPPDYLPERDVYEGLCR 648
Query: 54 GD-LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
G+ + + P +L CRY N P L + P KEE+ + P I+ Y DV+ D EI+ IK+
Sbjct: 649 GEGVKLTPRRQKRLFCRYHDGNRTPQLLIAPFKEEDEWDSPHIVRYYDVLSDEEIEKIKE 708
Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
+A+P+L RATV++ KTG L +ANYR+SKS+WL E + PV+ +++RR++++TGLT TAE
Sbjct: 709 LAKPKLARATVRDPKTGVLTVANYRVSKSSWLEEEDDPVVAQVNRRMQYITGLTVKTAEL 768
Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
LQV NYG+GG YEPH+DF+R E +AFK LGTGNRVAT L YMSDV GGATVF +
Sbjct: 769 LQVANYGMGGQYEPHFDFSRKDEPDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDFGAA 828
Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
+WP+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 829 IWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVG 862
>gi|170649696|gb|ACB21278.1| prolyl 4-hydroxylase, alpha II subunit isoform 1 precursor
(predicted) [Callicebus moloch]
Length = 555
Score = 270 bits (691), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 143/292 (48%), Positives = 189/292 (64%), Gaps = 30/292 (10%)
Query: 4 PTHQRAQGNKLYYQEALNKSPE--LKDEP------PKVNNVAPTLEVTEREKYEMLCRGD 55
P+H+RA GN Y+++ L + E L ++ P+ P + ER+ YE LCRG+
Sbjct: 238 PSHERAGGNLRYFEQLLEEEREKMLSNQTEAELATPEGIYERPVDYLPERDVYESLCRGE 297
Query: 56 -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
+ + P +L CRY H N P L + P KEE+ + P I+ Y DVM D EI+ IK++A
Sbjct: 298 GVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 357
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT TAE LQ
Sbjct: 358 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 417
Query: 174 VVNYGIGGHYEPHYDFAR--------------------PGEANAFKSLGTGNRVATVLFY 213
V NYG+GG YEPH+DF+R E +AFK LGTGNRVAT L Y
Sbjct: 418 VANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYNDERDAFKHLGTGNRVATFLNY 477
Query: 214 MSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
MSDV GGATVF L ++WP+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 478 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 529
>gi|308451420|ref|XP_003088665.1| CRE-PHY-2 protein [Caenorhabditis remanei]
gi|308246199|gb|EFO90151.1| CRE-PHY-2 protein [Caenorhabditis remanei]
Length = 609
Score = 268 bits (686), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 154/352 (43%), Positives = 199/352 (56%), Gaps = 70/352 (19%)
Query: 2 IFPTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPA 61
I P H RA+GN +Y++ L + D PP VN + ER+ YE LCRG+ +PP
Sbjct: 251 IAPNHPRAKGNVKWYEDMLQGKDMVGDLPPIVNKRVEFDGIVERDAYEALCRGE--IPPV 308
Query: 62 ---IVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLR 118
+L+C Y+ R+ P+L++ P+K E P +L+++V+ DSEI +IK++A P+L+
Sbjct: 309 EKKWKNKLRC-YLKRDKPFLKIAPIKVEILRFDPLAVLFKNVISDSEIKVIKELASPKLK 367
Query: 119 RATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYG 178
RATVQN KTGELE A YRISKSAWL+ HPVIER++RR+E TGL T+EELQV NYG
Sbjct: 368 RATVQNSKTGELEHATYRISKSAWLKGDLHPVIERVNRRIEDFTGLYQGTSEELQVANYG 427
Query: 179 IGGH------------------YEPHYDFARPG--------------------------- 193
+GGH YEPHYD + G
Sbjct: 428 LGGHYDPHFDFARIANYGLGGHYEPHYDMSLVGYHPIQLTVSLEYFQRGVPEPYGKNGNR 487
Query: 194 ---------EANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHN 244
E NAFK+L TGNR+ATVLFYMS +GGATVF L +++P K A FW+N
Sbjct: 488 IATVLFYKEEKNAFKTLNTGNRIATVLFYMSQPERGGATVFNHLGTAVFPSKNDALFWYN 547
Query: 245 LHSSGDGDYYTRHAACPVLTG----SNS-LHS-----TCPCGLRRGLQRSGI 286
L G+GD TRHAACPVL G SN +H T PCGL G+Q + I
Sbjct: 548 LRRDGEGDLRTRHAACPVLLGVKWVSNKWIHERGQEFTRPCGLEEGVQENFI 599
>gi|321474876|gb|EFX85840.1| hypothetical protein DAPPUDRAFT_309107 [Daphnia pulex]
Length = 528
Score = 268 bits (685), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 130/272 (47%), Positives = 180/272 (66%), Gaps = 7/272 (2%)
Query: 2 IFPTHQRAQGN-----KLYYQEALNKSPELKDEPPKVNNVAPT--LEVTEREKYEMLCRG 54
I P HQ+A N KL Q+ +N K+N T L + YE LCRG
Sbjct: 232 IVPYHQQALDNIKHYQKLLLQQGVNTEERFNTTKLKLNKSTGTFGLRRDHWDNYEKLCRG 291
Query: 55 DLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQ 114
+ + P + +L+CRYV NVP+ + P+K EEA L+P +++Y V++D+EID++KK+AQ
Sbjct: 292 EKLLDPKVEGRLRCRYVTNNVPFFFIQPVKMEEALLKPLLVIYHGVIFDAEIDVVKKLAQ 351
Query: 115 PRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQV 174
PR +R V + TG YRI+K+A+L++ EH +I ++SRRV +TGL + +E+LQV
Sbjct: 352 PRFKRTGVTDRDTGRSMPVQYRIAKAAFLKDSEHNLIVKMSRRVGDITGLDMAASEDLQV 411
Query: 175 VNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWP 234
NYGIGGHY PH+D+AR GE + + L GNR+AT LFYMSDV GGATVF ++ +LWP
Sbjct: 412 CNYGIGGHYVPHFDYARQGEIHGPRDLDWGNRIATWLFYMSDVEAGGATVFPAVGAALWP 471
Query: 235 EKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+KG+AAFW+NL +G+GD T HA CPVLTGS
Sbjct: 472 QKGSAAFWYNLRPNGNGDEDTLHAGCPVLTGS 503
>gi|281183175|ref|NP_001162504.1| prolyl 4-hydroxylase subunit alpha-2 [Papio anubis]
gi|159461520|gb|ABW96795.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase, alpha
polypeptide II, isoform 1 (predicted) [Papio anubis]
Length = 578
Score = 268 bits (684), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 141/292 (48%), Positives = 186/292 (63%), Gaps = 30/292 (10%)
Query: 4 PTHQRAQGNKLYYQE--------ALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD 55
P+H+RA GN Y+++ L+ E + P+ P + ER+ YE LCRG+
Sbjct: 261 PSHERAGGNLRYFEQLLEEEREKMLSNQTEAELATPEGIYERPVDYLPERDVYESLCRGE 320
Query: 56 -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
+ + P +L CRY H N P L + P KEE+ + P I+ Y DVM D EI+ IK++A
Sbjct: 321 GVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 380
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT TAE LQ
Sbjct: 381 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 440
Query: 174 VVNYGIGGHYEPHYDFAR--------------------PGEANAFKSLGTGNRVATVLFY 213
V NYG+GG YEPH+DF+R E + FK LGTGNRVAT L Y
Sbjct: 441 VANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYNDERHTFKHLGTGNRVATFLNY 500
Query: 214 MSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
MSDV GGATVF L ++WP+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 501 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 552
>gi|449267219|gb|EMC78185.1| Prolyl 4-hydroxylase subunit alpha-2 [Columba livia]
Length = 538
Score = 268 bits (684), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 137/275 (49%), Positives = 184/275 (66%), Gaps = 14/275 (5%)
Query: 5 THQRAQGNKLYYQEALN---------KSPELKDEPPKVNNVA---PTLEVTEREKYEMLC 52
TH+RA N Y+++ L + + P V + A P + ER+ YE LC
Sbjct: 238 THERAGSNLRYFEKLLEKEREKEQEKSNKTMTTTEPVVQSGAYERPLDYLPERDIYEALC 297
Query: 53 RGD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIK 110
RG+ + + P +L CRY N P+L + P KEE+ + P I+ Y DVM D EI+ IK
Sbjct: 298 RGEGVKMTPRRQKRLFCRYHDGNRNPHLLIAPFKEEDEWDSPHIVRYYDVMSDEEIEKIK 357
Query: 111 KMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAE 170
++A+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ ++++R++ +TGLT TAE
Sbjct: 358 QLAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVAKVNQRMQQITGLTVKTAE 417
Query: 171 ELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNL 230
LQV NYG+GG YEPH+DF+R E +AFK LGTGNRVAT L YMSDV GGATVF
Sbjct: 418 LLQVANYGMGGQYEPHFDFSRKDEPDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDFGA 477
Query: 231 SLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
++WP+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 478 AIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVG 512
>gi|167045848|gb|ABZ10515.1| prolyl 4-hydroxylase, alpha II subunit isoform 1 precursor
(predicted) [Callithrix jacchus]
Length = 555
Score = 268 bits (684), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 142/292 (48%), Positives = 188/292 (64%), Gaps = 30/292 (10%)
Query: 4 PTHQRAQGNKLYYQEALNKSPE--LKDEP------PKVNNVAPTLEVTEREKYEMLCRGD 55
P+H+RA GN Y+++ L + E L ++ P+ P + ER+ YE LCRG+
Sbjct: 238 PSHERAGGNLRYFEQLLEEEREKMLSNQTEAELATPEGIYERPVDYLPERDVYESLCRGE 297
Query: 56 -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
+ + P +L CRY H N L + P KEE+ + P I+ Y DVM D EI+ IK++A
Sbjct: 298 GVKLTPRRQKRLFCRYHHGNRASQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 357
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT TAE LQ
Sbjct: 358 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 417
Query: 174 VVNYGIGGHYEPHYDFAR--------------------PGEANAFKSLGTGNRVATVLFY 213
V NYG+GG YEPH+DF+R E +AFK LGTGNRVAT L Y
Sbjct: 418 VANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYNDERDAFKHLGTGNRVATFLNY 477
Query: 214 MSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
MSDV GGATVF L ++WP+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 478 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 529
>gi|90085216|dbj|BAE91349.1| unnamed protein product [Macaca fascicularis]
Length = 244
Score = 267 bits (683), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 126/219 (57%), Positives = 165/219 (75%), Gaps = 2/219 (0%)
Query: 50 MLCRGD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEID 107
MLCRG+ + + P +L CRY N P L P K+E+ + +PRII + D++ D+EI+
Sbjct: 1 MLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIE 60
Query: 108 LIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTS 167
++K +A+PRL RATV + +TG+L A YR+SKSAWL E+PV+ RI+ R++ +TGL S
Sbjct: 61 IVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVS 120
Query: 168 TAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTS 227
TAEELQV NYG+GG YEPH+DFAR E +AFK LGTGNR+AT LFYMSDV+ GGATVF
Sbjct: 121 TAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPE 180
Query: 228 LNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+ S+WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 181 VGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 219
>gi|301613004|ref|XP_002936004.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Xenopus
(Silurana) tropicalis]
Length = 526
Score = 267 bits (683), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 131/225 (58%), Positives = 165/225 (73%), Gaps = 2/225 (0%)
Query: 44 EREKYEMLCRGD-LTVPPAIVAQLKCRYVH-RNVPYLRLMPLKEEEAYLQPRIILYRDVM 101
E+EKYE LCRG+ + + +L CRY + P L L P K+E+ + +PRI+ Y D++
Sbjct: 277 EKEKYEKLCRGEGVKMTSRRQKRLFCRYFDGKKDPLLILSPTKQEDEWDKPRIVRYHDII 336
Query: 102 YDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHM 161
D EI +K++A+PRLRRAT+ N TG LE A YRI+KSAWL E PV+ R++RR+E +
Sbjct: 337 SDEEISKVKELAKPRLRRATISNPITGVLETAQYRITKSAWLSGYEDPVVARLNRRIEGV 396
Query: 162 TGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGG 221
TGL STAEELQV NYGIGG YEPH+DF R E +AFK LGTGNRVAT LFYMSDV GG
Sbjct: 397 TGLDMSTAEELQVANYGIGGQYEPHFDFLRKYEPDAFKKLGTGNRVATWLFYMSDVEAGG 456
Query: 222 ATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
ATVF + +++P+KGTA FW+NL SG+GDY TRHAACPVL G+
Sbjct: 457 ATVFPEVGAAVYPKKGTAVFWYNLLESGEGDYSTRHAACPVLVGN 501
>gi|431892682|gb|ELK03115.1| Prolyl 4-hydroxylase subunit alpha-2 [Pteropus alecto]
Length = 629
Score = 267 bits (683), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 141/292 (48%), Positives = 187/292 (64%), Gaps = 30/292 (10%)
Query: 4 PTHQRAQGNKLYYQEALNK------SPELKDEPPKVNNV--APTLEVTEREKYEMLCRGD 55
P+H+RA GN Y++ L + S + + E + + P + ER+ YE LCRG+
Sbjct: 244 PSHERAGGNLRYFERLLEEERDKMVSNQTEAELATQDGIYERPVDYLPERDVYESLCRGE 303
Query: 56 -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
+ + P +L CRY H N P L + P KEE+ + P I+ Y DVM D EI+ IK++A
Sbjct: 304 GVKLTPRRQKRLFCRYHHGNRTPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEINRIKEIA 363
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT TAE LQ
Sbjct: 364 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 423
Query: 174 VVNYGIGGHYEPHYDFAR--------------------PGEANAFKSLGTGNRVATVLFY 213
V NYG+GG YEPH+DF+R E + FK LGTGNRVAT L Y
Sbjct: 424 VANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYNDEQDVFKHLGTGNRVATFLNY 483
Query: 214 MSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
MSDV GGATVF L ++WP+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 484 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 535
>gi|390459659|ref|XP_002806656.2| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
alpha-2 [Callithrix jacchus]
Length = 579
Score = 267 bits (682), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 141/292 (48%), Positives = 185/292 (63%), Gaps = 30/292 (10%)
Query: 4 PTHQRAQGNKLYYQE--------ALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD 55
P+H+RA GN Y+++ L+ E + P+ P + ER+ YE LCRG+
Sbjct: 262 PSHERAGGNLRYFEQLLEEEREKMLSNQTEAELATPEGIYERPVDYLPERDVYESLCRGE 321
Query: 56 -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
+ + P +L CRY H N L + P KEE+ + P I+ Y DVM D EI+ IK++A
Sbjct: 322 GVKLTPRRQKRLFCRYHHGNRASQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 381
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT TAE LQ
Sbjct: 382 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 441
Query: 174 VVNYGIGGHYEPHYDFAR--------------------PGEANAFKSLGTGNRVATVLFY 213
V NYG+GG YEPH+DF+R E +AFK LGTGNRVAT L Y
Sbjct: 442 VANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYNDERDAFKHLGTGNRVATFLNY 501
Query: 214 MSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
MSDV GGATVF L ++WP+KGTA FW+NL SG GDY TRHAACPVL G
Sbjct: 502 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGXGDYRTRHAACPVLVG 553
>gi|197215651|gb|ACH53042.1| prolyl 4-hydroxylase, alpha II subunit isoform 1 precursor
(predicted) [Otolemur garnettii]
Length = 555
Score = 266 bits (680), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 141/292 (48%), Positives = 184/292 (63%), Gaps = 30/292 (10%)
Query: 4 PTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVA--------PTLEVTEREKYEMLCRGD 55
P+H+RA GN Y++ L + E +A P + ERE YE LCRG+
Sbjct: 238 PSHERAGGNLRYFEHLLEEEREKMLSNKTEAELATQEGIYERPVDYLPEREVYESLCRGE 297
Query: 56 -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
+ + P +L CRY H N P L + P KEE+ + P I+ Y DVM D EI+ IK++A
Sbjct: 298 GVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 357
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++ R++H+TGL+ TAE LQ
Sbjct: 358 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNHRMQHITGLSVKTAELLQ 417
Query: 174 VVNYGIGGHYEPHYDFAR--------------------PGEANAFKSLGTGNRVATVLFY 213
V NYG+GG YEPH+DF+R E +AFK LGTGNRVAT L Y
Sbjct: 418 VANYGVGGQYEPHFDFSRRPFDSGLKTEGNRVATFLNYNHERDAFKRLGTGNRVATFLNY 477
Query: 214 MSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
MSDV GGATVF L ++WP+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 478 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 529
>gi|308497208|ref|XP_003110791.1| CRE-DPY-18 protein [Caenorhabditis remanei]
gi|308242671|gb|EFO86623.1| CRE-DPY-18 protein [Caenorhabditis remanei]
Length = 559
Score = 266 bits (680), Expect = 9e-69, Method: Compositional matrix adjust.
Identities = 137/291 (47%), Positives = 189/291 (64%), Gaps = 17/291 (5%)
Query: 4 PTHQRAQGNKLYYQEALNKS----PELKDEPPKVNNVAP--TLEVTEREKYEMLCRGDLT 57
P+H RA+GN +Y++ L + E++ P++ N P L TER YE LCR ++
Sbjct: 235 PSHPRAKGNVKWYEDLLEQEGVRRSEMRKNLPEIQNRRPDSVLGNTERTMYEALCRNEVP 294
Query: 58 VPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRL 117
V +++L C Y R+ P+L P+K E P +L++DV+ D E+ I+++A+P+L
Sbjct: 295 VSQKDISRLYC-YYKRDRPFLVYAPIKVEIKRFNPLAVLFKDVISDDEVATIQELAKPKL 353
Query: 118 RRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNY 177
RATV + TG+L A YRISKSAWL+E EH V+ER+++R+E MT L TAEELQ+ NY
Sbjct: 354 ARATVHDSATGKLVTATYRISKSAWLKEWEHEVVERVNKRIELMTNLEMETAEELQIANY 413
Query: 178 GIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKG 237
GIGGHY+PH+D A+ E+ +F+SLGTGNR+ATVLFYMS + GG TVFT + ++ P K
Sbjct: 414 GIGGHYDPHFDHAKKEESKSFESLGTGNRIATVLFYMSQPSHGGGTVFTEVKSTVLPTKN 473
Query: 238 TAAFWHNLHSSGDGDYYTRHAACPVLTG----SNS-LHSTC-----PCGLR 278
A FW+NL GDG+ TRHAACPVL G SN +H PCGL+
Sbjct: 474 DALFWYNLFKQGDGNPDTRHAACPVLVGIKWVSNKWIHEKGNEFRRPCGLK 524
>gi|189241578|ref|XP_969458.2| PREDICTED: similar to prolyl 4-hydroxylase alpha subunit 1,
putative [Tribolium castaneum]
Length = 515
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 128/265 (48%), Positives = 176/265 (66%), Gaps = 12/265 (4%)
Query: 2 IFPTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPA 61
I P H RA NK YY++ EL++ K +E E+E Y+ LCR ++++P A
Sbjct: 243 ILPYHSRALRNKFYYEQ------ELQNPVDKTKKDQDHVEDVEKEVYKKLCRAEISLPEA 296
Query: 62 IVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRAT 121
++LKC Y + N P+LR+ P K E+A+L P I+++ +V+ D EI+ +K++AQ RL A
Sbjct: 297 KSSKLKCFYQNSNHPFLRIAPFKVEQAHLDPDILIFHNVLSDCEIETMKQLAQSRLVTAV 356
Query: 122 VQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGG 181
+N + +LE+ +RISK AWL + EH + +++RV HMTGLT STAEE QVVNYGIGG
Sbjct: 357 FENPHSKQLELFPFRISKVAWLEDQEHQHLAVVAQRVAHMTGLTLSTAEEFQVVNYGIGG 416
Query: 182 HYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAF 241
HYEPH+DF + G+R+ TVLFY+SDV QGGATVF + +S+WP+KG+A
Sbjct: 417 HYEPHFDFQSTVDP------AIGSRIETVLFYLSDVEQGGATVFPEIQVSVWPQKGSAVV 470
Query: 242 WHNLHSSGDGDYYTRHAACPVLTGS 266
W NLH SGDGD T+HA CPVL GS
Sbjct: 471 WFNLHPSGDGDQRTKHAGCPVLIGS 495
>gi|432109537|gb|ELK33711.1| Prolyl 4-hydroxylase subunit alpha-2 [Myotis davidii]
Length = 555
Score = 266 bits (679), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 140/292 (47%), Positives = 182/292 (62%), Gaps = 30/292 (10%)
Query: 4 PTHQRAQGNKLYYQEALNK--------SPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD 55
P+H+RA GN Y+++ L + E P P + ER+ YE LCRG+
Sbjct: 238 PSHERAGGNLRYFEQLLEEERGKMASNQTEAGQAPQDSIYERPADYLPERDVYESLCRGE 297
Query: 56 -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
+ + P +L CRY N P L + P KEE+ + P I+ Y DVM D EI IK++A
Sbjct: 298 GVKLTPKRQKRLFCRYHDGNRTPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIQRIKEIA 357
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT TAE LQ
Sbjct: 358 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 417
Query: 174 VVNYGIGGHYEPHYDFAR--------------------PGEANAFKSLGTGNRVATVLFY 213
V NYG+GG YEPH+DF+R E + FK LGTGNRVAT L Y
Sbjct: 418 VANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYNDEQDVFKHLGTGNRVATFLNY 477
Query: 214 MSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
MSDV GGATVF L ++WP+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 478 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 529
>gi|270001038|gb|EEZ97485.1| hypothetical protein TcasGA2_TC011322 [Tribolium castaneum]
Length = 509
Score = 266 bits (679), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 128/265 (48%), Positives = 176/265 (66%), Gaps = 12/265 (4%)
Query: 2 IFPTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPA 61
I P H RA NK YY++ EL++ K +E E+E Y+ LCR ++++P A
Sbjct: 237 ILPYHSRALRNKFYYEQ------ELQNPVDKTKKDQDHVEDVEKEVYKKLCRAEISLPEA 290
Query: 62 IVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRAT 121
++LKC Y + N P+LR+ P K E+A+L P I+++ +V+ D EI+ +K++AQ RL A
Sbjct: 291 KSSKLKCFYQNSNHPFLRIAPFKVEQAHLDPDILIFHNVLSDCEIETMKQLAQSRLVTAV 350
Query: 122 VQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGG 181
+N + +LE+ +RISK AWL + EH + +++RV HMTGLT STAEE QVVNYGIGG
Sbjct: 351 FENPHSKQLELFPFRISKVAWLEDQEHQHLAVVAQRVAHMTGLTLSTAEEFQVVNYGIGG 410
Query: 182 HYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAF 241
HYEPH+DF + G+R+ TVLFY+SDV QGGATVF + +S+WP+KG+A
Sbjct: 411 HYEPHFDFQSTVDP------AIGSRIETVLFYLSDVEQGGATVFPEIQVSVWPQKGSAVV 464
Query: 242 WHNLHSSGDGDYYTRHAACPVLTGS 266
W NLH SGDGD T+HA CPVL GS
Sbjct: 465 WFNLHPSGDGDQRTKHAGCPVLIGS 489
>gi|291387302|ref|XP_002710242.1| PREDICTED: prolyl 4-hydroxylase, alpha II subunit isoform 1
precursor (predicted)-like isoform 2 [Oryctolagus
cuniculus]
gi|217273039|gb|ACK28132.1| prolyl 4-hydroxylase, alpha II subunit isoform 1 precursor
(predicted) [Oryctolagus cuniculus]
Length = 555
Score = 265 bits (678), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 144/293 (49%), Positives = 186/293 (63%), Gaps = 32/293 (10%)
Query: 4 PTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLE---------VTEREKYEMLCRG 54
P+H+RA GN Y++ L + K + VA T E + ER+ YE LCRG
Sbjct: 238 PSHERAGGNLRYFERLLEEQ-RGKSLLNQTEAVAVTQEGIYERPVDYLPERDVYESLCRG 296
Query: 55 D-LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKM 112
+ + + P +L CRY N P L + P KEE+ + P I+ Y DVM D EI+ IK++
Sbjct: 297 EGVKLTPRRQKRLFCRYHDGNGAPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEI 356
Query: 113 AQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEEL 172
A+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ RI+RR++H+TGLT TAE L
Sbjct: 357 AKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARINRRMQHITGLTVKTAELL 416
Query: 173 QVVNYGIGGHYEPHYDFAR--------------------PGEANAFKSLGTGNRVATVLF 212
QV NYG+GG YEPH+DF+R E +AFK LGTGNRVAT L
Sbjct: 417 QVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYNNERDAFKRLGTGNRVATFLN 476
Query: 213 YMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
YMSDV GGATVF L ++WP+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 477 YMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 529
>gi|297675927|ref|XP_002815905.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Pongo
abelii]
gi|395736137|ref|XP_003776704.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Pongo abelii]
Length = 533
Score = 263 bits (672), Expect = 8e-68, Method: Compositional matrix adjust.
Identities = 135/272 (49%), Positives = 182/272 (66%), Gaps = 12/272 (4%)
Query: 4 PTHQRAQGNKLYYQE--------ALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD 55
P+H+RA GN Y+++ L+ E + P+ P + ER+ YE LCRG+
Sbjct: 238 PSHERAGGNLRYFEQLLEEEREKTLSNQTEAELATPEGIYERPVDYLPERDVYESLCRGE 297
Query: 56 -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
+ + P +L CRY H N P L + P KEE+ + P I+ Y DVM D EI+ IK++A
Sbjct: 298 GVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 357
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT TAE LQ
Sbjct: 358 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 417
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
V NYG+GG YEPH+DF+R + K+ GNR+AT L YMSDV GGATVF L ++W
Sbjct: 418 VANYGVGGQYEPHFDFSRRPFDSGLKT--EGNRLATFLNYMSDVEAGGATVFPDLGAAIW 475
Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
P+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 476 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 507
>gi|184185444|gb|ACC68850.1| prolyl 4-hydroxylase, alpha II subunit isoform 1 precursor
(predicted) [Rhinolophus ferrumequinum]
Length = 555
Score = 263 bits (671), Expect = 9e-68, Method: Compositional matrix adjust.
Identities = 141/292 (48%), Positives = 186/292 (63%), Gaps = 30/292 (10%)
Query: 4 PTHQRAQGNKLYYQEALNK------SPELKDEPPKVNNV--APTLEVTEREKYEMLCRGD 55
P+H+RA GN Y++ L + S + + EP + P + ER+ YE LCRG+
Sbjct: 238 PSHERAGGNLRYFERLLEEEREKIVSNQTEAEPASQEGIYERPVDYLPERDVYESLCRGE 297
Query: 56 -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
+ + P +L CRY H N P L + P KEE+ + P I+ Y DVM D EI+ IK++A
Sbjct: 298 GVKLTPRRQKRLFCRYHHGNRTPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIEKIKEIA 357
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+P+L RATV++ KTG L +A+YR+SKS+WL E E PV+ R++ R++H+TGL+ TAE LQ
Sbjct: 358 KPKLARATVRDPKTGVLTVASYRVSKSSWLEETEDPVVARLNLRMQHITGLSVKTAELLQ 417
Query: 174 VVNYGIGGHYEPHYDFAR--------------------PGEANAFKSLGTGNRVATVLFY 213
V NYG+GG YEPH+DF+R E + FK LGTGNRVAT L Y
Sbjct: 418 VANYGMGGQYEPHFDFSRRPFDNGLKTEGNRLATFLNYNDEHDVFKHLGTGNRVATFLNY 477
Query: 214 MSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
MSDV GGATVF L ++WP+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 478 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 529
>gi|403255937|ref|XP_003920661.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Saimiri
boliviensis boliviensis]
gi|403255939|ref|XP_003920662.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Saimiri
boliviensis boliviensis]
gi|403255943|ref|XP_003920664.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 4 [Saimiri
boliviensis boliviensis]
Length = 533
Score = 263 bits (671), Expect = 9e-68, Method: Compositional matrix adjust.
Identities = 136/272 (50%), Positives = 184/272 (67%), Gaps = 12/272 (4%)
Query: 4 PTHQRAQGNKLYYQEALNKSPE--LKDEP------PKVNNVAPTLEVTEREKYEMLCRGD 55
P+H+RA GN Y+++ L + E L ++ P+ P + ER+ YE LCRG+
Sbjct: 238 PSHERAGGNLRYFEQLLEEEREKMLSNQTEAELATPEGIYERPVDYLPERDVYESLCRGE 297
Query: 56 -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
+ + P +L CRY H N P L + P KEE+ + P I+ Y DVM D EI+ IK++A
Sbjct: 298 GVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 357
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT TAE LQ
Sbjct: 358 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 417
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
V NYG+GG YEPH+DF+R + K+ GNR+AT L YMSDV GGATVF L ++W
Sbjct: 418 VANYGVGGQYEPHFDFSRRPFDSGLKT--EGNRLATFLNYMSDVEAGGATVFPDLGAAIW 475
Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
P+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 476 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 507
>gi|229368743|gb|ACQ63024.1| prolyl 4-hydroxylase, alpha II subunit isoform 1 precursor
(predicted) [Dasypus novemcinctus]
Length = 556
Score = 263 bits (671), Expect = 9e-68, Method: Compositional matrix adjust.
Identities = 142/295 (48%), Positives = 187/295 (63%), Gaps = 35/295 (11%)
Query: 4 PTHQRAQGNKLYYQEAL---------NKSPELKDEPPKVNNV--APTLEVTEREKYEMLC 52
P+H+RA GN Y++ L N++ E EP + P + ER+ YE LC
Sbjct: 238 PSHERAGGNLRYFERLLEEEREKLLSNQTTEA--EPTTQEGIYERPADYLPERDVYESLC 295
Query: 53 RGD-LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIK 110
RG+ + + P +L CRY H N P L + P KEE+ + P I+ Y D+M D EI+ IK
Sbjct: 296 RGEGVKLTPRRQKRLFCRYHHGNRTPQLLIAPFKEEDEWDSPHIVRYYDIMSDEEIERIK 355
Query: 111 KMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAE 170
++A+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ +++RR+EH+TGLT TAE
Sbjct: 356 EIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEENDDPVVAQVNRRMEHITGLTVKTAE 415
Query: 171 ELQVVNYGIGGHYEPHYDFAR--------------------PGEANAFKSLGTGNRVATV 210
LQV NYG+GG YEPH+DF+R E + FK LGTGNRVAT
Sbjct: 416 LLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYNHEQDVFKHLGTGNRVATF 475
Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
L YMSDV GGATVF L ++WP+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 476 LNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 530
>gi|114601548|ref|XP_001162501.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 9 [Pan
troglodytes]
gi|114601562|ref|XP_001162805.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 16 [Pan
troglodytes]
gi|114601564|ref|XP_517917.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 19 [Pan
troglodytes]
gi|397518354|ref|XP_003829356.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Pan
paniscus]
gi|397518356|ref|XP_003829357.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Pan
paniscus]
gi|397518360|ref|XP_003829359.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 4 [Pan
paniscus]
gi|410215942|gb|JAA05190.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
gi|410255606|gb|JAA15770.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
gi|410331277|gb|JAA34585.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
gi|410331281|gb|JAA34587.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
Length = 533
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 135/272 (49%), Positives = 182/272 (66%), Gaps = 12/272 (4%)
Query: 4 PTHQRAQGNKLYYQE--------ALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD 55
P+H+RA GN Y+++ L+ E + P+ P + ER+ YE LCRG+
Sbjct: 238 PSHERAGGNLRYFEQLLEEEREKTLSNQTEAELATPEGIYERPVDYLPERDIYESLCRGE 297
Query: 56 -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
+ + P +L CRY H N P L + P KEE+ + P I+ Y DVM D EI+ IK++A
Sbjct: 298 GVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 357
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT TAE LQ
Sbjct: 358 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 417
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
V NYG+GG YEPH+DF+R + K+ GNR+AT L YMSDV GGATVF L ++W
Sbjct: 418 VANYGVGGQYEPHFDFSRRPFDSGLKT--EGNRLATFLNYMSDVEAGGATVFPDLGAAIW 475
Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
P+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 476 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 507
>gi|386780652|ref|NP_001247763.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Macaca mulatta]
gi|383422579|gb|AFH34503.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Macaca
mulatta]
gi|384939466|gb|AFI33338.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Macaca
mulatta]
Length = 533
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 136/272 (50%), Positives = 184/272 (67%), Gaps = 12/272 (4%)
Query: 4 PTHQRAQGNKLYYQEALNKSPE--LKDEP------PKVNNVAPTLEVTEREKYEMLCRGD 55
P+H+RA GN Y+++ L + E L ++ P+ P + ER+ YE LCRG+
Sbjct: 238 PSHERAGGNLRYFEQLLEEEREKMLSNQTEAELATPEGIYERPVDYLPERDVYESLCRGE 297
Query: 56 -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
+ + P +L CRY H N P L + P KEE+ + P I+ Y DVM D EI+ IK++A
Sbjct: 298 GVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 357
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT TAE LQ
Sbjct: 358 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 417
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
V NYG+GG YEPH+DF+R + K+ GNR+AT L YMSDV GGATVF L ++W
Sbjct: 418 VANYGVGGQYEPHFDFSRRPFDSGLKT--EGNRLATFLNYMSDVEAGGATVFPDLGAAIW 475
Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
P+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 476 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 507
>gi|426349879|ref|XP_004042513.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Gorilla gorilla
gorilla]
Length = 565
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 135/272 (49%), Positives = 182/272 (66%), Gaps = 12/272 (4%)
Query: 4 PTHQRAQGNKLYYQE--------ALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD 55
P+H+RA GN Y+++ L+ E + P+ P + ER+ YE LCRG+
Sbjct: 270 PSHERAGGNLRYFEQLLEEEREKTLSNQTEAELATPEGIYERPVDYLPERDVYESLCRGE 329
Query: 56 -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
+ + P +L CRY H N P L + P KEE+ + P I+ Y DVM D EI+ IK++A
Sbjct: 330 GVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 389
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT TAE LQ
Sbjct: 390 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 449
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
V NYG+GG YEPH+DF+R + K+ GNR+AT L YMSDV GGATVF L ++W
Sbjct: 450 VANYGVGGQYEPHFDFSRRPFDSGLKT--EGNRLATFLNYMSDVEAGGATVFPDLGAAIW 507
Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
P+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 508 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 539
>gi|395736139|ref|XP_003776705.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Pongo abelii]
Length = 575
Score = 262 bits (670), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 135/272 (49%), Positives = 182/272 (66%), Gaps = 12/272 (4%)
Query: 4 PTHQRAQGNKLYYQE--------ALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD 55
P+H+RA GN Y+++ L+ E + P+ P + ER+ YE LCRG+
Sbjct: 280 PSHERAGGNLRYFEQLLEEEREKTLSNQTEAELATPEGIYERPVDYLPERDVYESLCRGE 339
Query: 56 -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
+ + P +L CRY H N P L + P KEE+ + P I+ Y DVM D EI+ IK++A
Sbjct: 340 GVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 399
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT TAE LQ
Sbjct: 400 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 459
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
V NYG+GG YEPH+DF+R + K+ GNR+AT L YMSDV GGATVF L ++W
Sbjct: 460 VANYGVGGQYEPHFDFSRRPFDSGLKT--EGNRLATFLNYMSDVEAGGATVFPDLGAAIW 517
Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
P+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 518 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 549
>gi|63252891|ref|NP_001017973.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Homo
sapiens]
gi|63252893|ref|NP_001017974.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Homo
sapiens]
gi|217272861|ref|NP_001136070.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Homo
sapiens]
gi|18073925|emb|CAC85688.1| Prolyl 4-hydroxylase alpha IIa subunit [Homo sapiens]
gi|23274221|gb|AAH35813.1| Prolyl 4-hydroxylase, alpha polypeptide II [Homo sapiens]
gi|37183058|gb|AAQ89329.1| P4HA2 [Homo sapiens]
gi|119582745|gb|EAW62341.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_a
[Homo sapiens]
gi|119582750|gb|EAW62346.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_a
[Homo sapiens]
gi|123983232|gb|ABM83357.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II [synthetic
construct]
gi|157928048|gb|ABW03320.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II [synthetic
construct]
Length = 533
Score = 262 bits (670), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 135/272 (49%), Positives = 181/272 (66%), Gaps = 12/272 (4%)
Query: 4 PTHQRAQGNKLYYQE--------ALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD 55
P+H+RA GN Y+++ L E + P+ P + ER+ YE LCRG+
Sbjct: 238 PSHERAGGNLRYFEQLLEEEREKTLTNQTEAELATPEGIYERPVDYLPERDVYESLCRGE 297
Query: 56 -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
+ + P +L CRY H N P L + P KEE+ + P I+ Y DVM D EI+ IK++A
Sbjct: 298 GVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 357
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT TAE LQ
Sbjct: 358 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 417
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
V NYG+GG YEPH+DF+R + K+ GNR+AT L YMSDV GGATVF L ++W
Sbjct: 418 VANYGVGGQYEPHFDFSRRPFDSGLKT--EGNRLATFLNYMSDVEAGGATVFPDLGAAIW 475
Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
P+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 476 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 507
>gi|332221656|ref|XP_003259979.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Nomascus
leucogenys]
gi|332221658|ref|XP_003259980.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Nomascus
leucogenys]
Length = 535
Score = 262 bits (670), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 136/272 (50%), Positives = 184/272 (67%), Gaps = 12/272 (4%)
Query: 4 PTHQRAQGNKLYYQEALNKSPE--LKDEP------PKVNNVAPTLEVTEREKYEMLCRGD 55
P+H+RA GN Y+++ L + E L ++ P+ P + ER+ YE LCRG+
Sbjct: 240 PSHERAGGNLRYFEQLLEEEREKMLSNQTEAELATPEGIYERPVDYLPERDVYESLCRGE 299
Query: 56 -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
+ + P +L CRY H N P L + P KEE+ + P I+ Y DVM D EI+ IK++A
Sbjct: 300 GVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 359
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT TAE LQ
Sbjct: 360 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 419
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
V NYG+GG YEPH+DF+R + K+ GNR+AT L YMSDV GGATVF L ++W
Sbjct: 420 VANYGVGGQYEPHFDFSRRPFDSGLKT--EGNRLATFLNYMSDVEAGGATVFPDLGAAIW 477
Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
P+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 478 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 509
>gi|148701600|gb|EDL33547.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide, isoform CRA_e [Mus
musculus]
Length = 593
Score = 262 bits (670), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 137/272 (50%), Positives = 183/272 (67%), Gaps = 12/272 (4%)
Query: 4 PTHQRAQGNKLYYQEALNK------SPELKDEPPKVNNV--APTLEVTEREKYEMLCRGD 55
P+H+RA GN Y++ L + S + N+ PT + ER+ YE LCRG+
Sbjct: 298 PSHERAGGNLRYFERLLEEERGKSLSNQTDAGLATQENLYERPTDYLPERDVYESLCRGE 357
Query: 56 -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
+ + P +L CRY H N VP L + P KEE+ + P I+ Y DVM D EI+ IK++A
Sbjct: 358 GVKLTPRRQKKLFCRYHHGNRVPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 417
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT TAE LQ
Sbjct: 418 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 477
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
V NYG+GG YEPH+DF+R + K+ GNR+AT L YMSDV GGATVF L ++W
Sbjct: 478 VANYGMGGQYEPHFDFSRRPFDSGLKT--EGNRLATFLNYMSDVEAGGATVFPDLGAAIW 535
Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
P+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 536 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 567
>gi|119582748|gb|EAW62344.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_c
[Homo sapiens]
Length = 565
Score = 262 bits (670), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 135/272 (49%), Positives = 181/272 (66%), Gaps = 12/272 (4%)
Query: 4 PTHQRAQGNKLYYQE--------ALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD 55
P+H+RA GN Y+++ L E + P+ P + ER+ YE LCRG+
Sbjct: 270 PSHERAGGNLRYFEQLLEEEREKTLTNQTEAELATPEGIYERPVDYLPERDVYESLCRGE 329
Query: 56 -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
+ + P +L CRY H N P L + P KEE+ + P I+ Y DVM D EI+ IK++A
Sbjct: 330 GVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 389
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT TAE LQ
Sbjct: 390 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 449
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
V NYG+GG YEPH+DF+R + K+ GNR+AT L YMSDV GGATVF L ++W
Sbjct: 450 VANYGVGGQYEPHFDFSRRPFDSGLKT--EGNRLATFLNYMSDVEAGGATVFPDLGAAIW 507
Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
P+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 508 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 539
>gi|332221662|ref|XP_003259982.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 4 [Nomascus
leucogenys]
Length = 556
Score = 262 bits (670), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 136/272 (50%), Positives = 184/272 (67%), Gaps = 12/272 (4%)
Query: 4 PTHQRAQGNKLYYQEALNKSPE--LKDEP------PKVNNVAPTLEVTEREKYEMLCRGD 55
P+H+RA GN Y+++ L + E L ++ P+ P + ER+ YE LCRG+
Sbjct: 261 PSHERAGGNLRYFEQLLEEEREKMLSNQTEAELATPEGIYERPVDYLPERDVYESLCRGE 320
Query: 56 -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
+ + P +L CRY H N P L + P KEE+ + P I+ Y DVM D EI+ IK++A
Sbjct: 321 GVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 380
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT TAE LQ
Sbjct: 381 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 440
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
V NYG+GG YEPH+DF+R + K+ GNR+AT L YMSDV GGATVF L ++W
Sbjct: 441 VANYGVGGQYEPHFDFSRRPFDSGLKT--EGNRLATFLNYMSDVEAGGATVFPDLGAAIW 498
Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
P+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 499 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 530
>gi|116283554|gb|AAH17062.1| P4HA2 protein [Homo sapiens]
Length = 504
Score = 262 bits (670), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 135/272 (49%), Positives = 181/272 (66%), Gaps = 12/272 (4%)
Query: 4 PTHQRAQGNKLYYQE--------ALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD 55
P+H+RA GN Y+++ L E + P+ P + ER+ YE LCRG+
Sbjct: 209 PSHERAGGNLRYFEQLLEEEREKTLTNQTEAELATPEGIYERPVDYLPERDVYESLCRGE 268
Query: 56 -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
+ + P +L CRY H N P L + P KEE+ + P I+ Y DVM D EI+ IK++A
Sbjct: 269 GVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 328
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT TAE LQ
Sbjct: 329 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 388
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
V NYG+GG YEPH+DF+R + K+ GNR+AT L YMSDV GGATVF L ++W
Sbjct: 389 VANYGVGGQYEPHFDFSRRPFDSGLKT--EGNRLATFLNYMSDVEAGGATVFPDLGAAIW 446
Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
P+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 447 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 478
>gi|348523976|ref|XP_003449499.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Oreochromis
niloticus]
Length = 594
Score = 262 bits (670), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 131/275 (47%), Positives = 185/275 (67%), Gaps = 12/275 (4%)
Query: 4 PTHQRAQGNKLYYQEAL----------NKSPELKDEPPKVNNVAPTLEVTEREKYEMLCR 53
PTHQRA GN Y++ L + ++ + + +TER+KYE LCR
Sbjct: 295 PTHQRANGNLKYFEYQLAKQKKVEKVEKVEEKEEETKVRQRRESKDDYLTERKKYEQLCR 354
Query: 54 GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
G + + P ++L CRY N P + P+K+E+ + P I+ Y +++ + +++ +K+
Sbjct: 355 GQGIKLTPRRQSRLFCRYYDNNRHPRYVIGPVKQEDEWDSPHIVRYHNIVSEKDMEKVKE 414
Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
+A+PRLRRAT+ N TG LE A+YRISKSAWL EHPV+++I++ +E +TGL TAE+
Sbjct: 415 LAKPRLRRATISNPVTGVLETAHYRISKSAWLGAYEHPVVDKINQLIEDVTGLNVKTAED 474
Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
LQV NYG+GG YEPH+DF R E +AF+ LGTGNR+AT L YM+DV GGATVFT + +
Sbjct: 475 LQVANYGLGGQYEPHFDFGRKDEPDAFEELGTGNRIATWLLYMTDVQAGGATVFTDIGAA 534
Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+ P+KGTA FW+NL+ SG+GDY TRHAACPVL G+
Sbjct: 535 VKPKKGTAVFWYNLYPSGEGDYRTRHAACPVLLGN 569
>gi|209862961|ref|NP_001129548.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Mus
musculus]
gi|17390970|gb|AAH18411.1| P4ha2 protein [Mus musculus]
gi|18073922|emb|CAC85690.1| Prolyl 4-hydroxylase alpha IIa subunit [Mus musculus]
gi|74211515|dbj|BAE26490.1| unnamed protein product [Mus musculus]
Length = 535
Score = 262 bits (670), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 137/272 (50%), Positives = 183/272 (67%), Gaps = 12/272 (4%)
Query: 4 PTHQRAQGNKLYYQEALNK------SPELKDEPPKVNNV--APTLEVTEREKYEMLCRGD 55
P+H+RA GN Y++ L + S + N+ PT + ER+ YE LCRG+
Sbjct: 240 PSHERAGGNLRYFERLLEEERGKSLSNQTDAGLATQENLYERPTDYLPERDVYESLCRGE 299
Query: 56 -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
+ + P +L CRY H N VP L + P KEE+ + P I+ Y DVM D EI+ IK++A
Sbjct: 300 GVKLTPRRQKKLFCRYHHGNRVPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 359
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT TAE LQ
Sbjct: 360 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 419
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
V NYG+GG YEPH+DF+R + K+ GNR+AT L YMSDV GGATVF L ++W
Sbjct: 420 VANYGMGGQYEPHFDFSRRPFDSGLKT--EGNRLATFLNYMSDVEAGGATVFPDLGAAIW 477
Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
P+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 478 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 509
>gi|57997558|emb|CAI46066.1| hypothetical protein [Homo sapiens]
Length = 533
Score = 262 bits (669), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 135/272 (49%), Positives = 181/272 (66%), Gaps = 12/272 (4%)
Query: 4 PTHQRAQGNKLYYQE--------ALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD 55
P+H+RA GN Y+++ L E + P+ P + ER+ YE LCRG+
Sbjct: 238 PSHERAGGNLRYFEQLLEEEREKTLTNQTEAELATPEGIYERPVDYLPERDVYESLCRGE 297
Query: 56 -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
+ + P +L CRY H N P L + P KEE+ + P I+ Y DVM D EI+ IK++A
Sbjct: 298 GVKLTPRRQKRLFCRYHHGNRAPQLPIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 357
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT TAE LQ
Sbjct: 358 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 417
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
V NYG+GG YEPH+DF+R + K+ GNR+AT L YMSDV GGATVF L ++W
Sbjct: 418 VANYGVGGQYEPHFDFSRRPFDSGLKT--EGNRLATFLNYMSDVEAGGATVFPDLGAAIW 475
Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
P+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 476 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 507
>gi|119582749|gb|EAW62345.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_d
[Homo sapiens]
Length = 488
Score = 261 bits (668), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 135/272 (49%), Positives = 181/272 (66%), Gaps = 12/272 (4%)
Query: 4 PTHQRAQGNKLYYQE--------ALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD 55
P+H+RA GN Y+++ L E + P+ P + ER+ YE LCRG+
Sbjct: 193 PSHERAGGNLRYFEQLLEEEREKTLTNQTEAELATPEGIYERPVDYLPERDVYESLCRGE 252
Query: 56 -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
+ + P +L CRY H N P L + P KEE+ + P I+ Y DVM D EI+ IK++A
Sbjct: 253 GVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 312
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT TAE LQ
Sbjct: 313 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 372
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
V NYG+GG YEPH+DF+R + K+ GNR+AT L YMSDV GGATVF L ++W
Sbjct: 373 VANYGVGGQYEPHFDFSRRPFDSGLKT--EGNRLATFLNYMSDVEAGGATVFPDLGAAIW 430
Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
P+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 431 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 462
>gi|410948132|ref|XP_003980795.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Felis
catus]
gi|410948136|ref|XP_003980797.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Felis
catus]
Length = 533
Score = 261 bits (667), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 135/272 (49%), Positives = 181/272 (66%), Gaps = 12/272 (4%)
Query: 4 PTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVA--------PTLEVTEREKYEMLCRGD 55
P+H+RA GN Y+++ L + E +A P + ER+ YE LCRG+
Sbjct: 238 PSHERAGGNLRYFEQLLEEEREKMLSNQTEAGLATQESIYERPVDYLPERDIYESLCRGE 297
Query: 56 -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
+ + P +L CRY H N P L + P KEE+ + P I+ Y DVM D EI+ IK++A
Sbjct: 298 GVKLTPRRQKRLFCRYHHGNRTPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 357
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT TAE LQ
Sbjct: 358 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 417
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
V NYG+GG YEPH+DF+R + K+ GNR+AT L YMSDV GGATVF L ++W
Sbjct: 418 VANYGMGGQYEPHFDFSRRPFDSGLKT--EGNRLATFLNYMSDVEAGGATVFPDLGAAIW 475
Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
P+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 476 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 507
>gi|268572523|ref|XP_002641343.1| C. briggsae CBR-DPY-18 protein [Caenorhabditis briggsae]
gi|94442971|emb|CAJ98658.1| prolyl 4-hydroxylase [Caenorhabditis briggsae]
Length = 559
Score = 261 bits (667), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 135/291 (46%), Positives = 187/291 (64%), Gaps = 17/291 (5%)
Query: 4 PTHQRAQGNKLYYQEALNKS----PELKDEPPKVNNVAP--TLEVTEREKYEMLCRGDLT 57
P H RA+GN +Y++ L + E++ P + N P L TER YE LCR ++
Sbjct: 235 PAHPRAKGNIKWYEDLLEQEGVRRSEMRKSIPPIQNRRPDSVLGNTERTMYEALCRNEVP 294
Query: 58 VPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRL 117
V +++L C Y R+ P+L P+K E P +L++DV+ D E+ I+++A+P+L
Sbjct: 295 VSQKDISKLYC-YYKRDRPFLIYAPIKVEIKRFNPLAVLFKDVISDEEVATIQELAKPKL 353
Query: 118 RRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNY 177
RATV + TG+L A YRISKSAWL+ EH V+ER+++R++ MT L TAEELQ+ NY
Sbjct: 354 ARATVHDSVTGKLVTATYRISKSAWLKAWEHEVVERVNKRIDLMTNLEMETAEELQIANY 413
Query: 178 GIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKG 237
GIGGHY+PH+D A+ E+ +F+SLGTGNR+ATVLFYMS + GG TVFT + ++ P K
Sbjct: 414 GIGGHYDPHFDHAKKEESKSFESLGTGNRIATVLFYMSQPSHGGGTVFTEVKSTVLPTKN 473
Query: 238 TAAFWHNLHSSGDGDYYTRHAACPVLTG----SNS-LHSTC-----PCGLR 278
A FW+NL+ GDG+ TRHAACPVL G SN +H PCGL+
Sbjct: 474 DALFWYNLYKQGDGNPDTRHAACPVLVGIKWVSNKWIHEKGNEFRRPCGLK 524
>gi|37496185|emb|CAE47803.1| Prolyl 4-hydroxylase alpha subunit [Sus scrofa]
Length = 263
Score = 261 bits (666), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 131/257 (50%), Positives = 176/257 (68%), Gaps = 11/257 (4%)
Query: 6 HQRAQGNKLYYQEALNKSPEL-KDEPPKVNNVAPTLE--------VTEREKYEMLCRGD- 55
HQRA GN Y++ + K E K P +N TL+ + E +KYEMLCRG+
Sbjct: 7 HQRANGNLKYFEYIMAKEKEANKSAPDDQSNQKTTLKKKGVAVDYLPEGQKYEMLCRGEG 66
Query: 56 LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQ 114
+ + P +L CRY N P L P K+E+ + +PRII + D++ D+EID++K +A+
Sbjct: 67 IKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIDIVKDLAK 126
Query: 115 PRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQV 174
PRL RATV + +TG+L A YR+SKSAWL E+PV+ R++ R++ +TGL STAEELQV
Sbjct: 127 PRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPVVSRLNMRIQDLTGLDVSTAEELQV 186
Query: 175 VNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWP 234
NYG+GG YEPH+DFAR E +AFK LGTGNR+AT LFYMSDV+ GGATVF + S+WP
Sbjct: 187 ANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGASVWP 246
Query: 235 EKGTAAFWHNLHSSGDG 251
+KGTA FW+NL + G+G
Sbjct: 247 KKGTAVFWYNLFAGGEG 263
>gi|113682363|ref|NP_001038463.1| prolyl 4-hydroxylase, alpha polypeptide I a precursor [Danio rerio]
Length = 522
Score = 260 bits (665), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 134/269 (49%), Positives = 175/269 (65%), Gaps = 34/269 (12%)
Query: 44 EREKYEMLCRGD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVM 101
E+ KYE LCRG+ L + P L CRY + N P+ + P+K+E+ + +PRII Y +++
Sbjct: 251 EKRKYEKLCRGEGLKMTPRRQKHLFCRYFNGNRHPFYTIGPVKQEDEWDRPRIIRYHEII 310
Query: 102 YDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISK---------------------- 139
+ EI+ IK++++PRLRRAT+ N TG LE A+YRISK
Sbjct: 311 TEQEIEKIKELSKPRLRRATISNPITGVLETAHYRISKRRATVHDPQTGKLTTAQYRVSK 370
Query: 140 SAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFK 199
SAWL EHPV++RI++R+E +TGL TAEELQV NYG+GG YEPH+DF R E +AFK
Sbjct: 371 SAWLAAYEHPVVDRINQRIEDITGLNVKTAEELQVANYGVGGQYEPHFDFGRKDEPDAFK 430
Query: 200 SLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAA 259
LGTGNR+AT LFYMSDVA GGATVF + ++ P KGTA FW+NL SG+GDY TRHAA
Sbjct: 431 ELGTGNRIATWLFYMSDVAAGGATVFPEVGAAVKPLKGTAVFWYNLFPSGEGDYSTRHAA 490
Query: 260 CPVLTGSNSLHSTC----------PCGLR 278
CPVL G+ + + PCGL+
Sbjct: 491 CPVLVGNKWVSNKWIHERGQEFRRPCGLK 519
>gi|344264849|ref|XP_003404502.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2
[Loxodonta africana]
Length = 534
Score = 260 bits (664), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 134/273 (49%), Positives = 182/273 (66%), Gaps = 13/273 (4%)
Query: 4 PTHQRAQGNKLYYQEALNKSPE-------LKDEPPKVNNV--APTLEVTEREKYEMLCRG 54
P+H+RA GN Y++ L + + + EP + P + ER+ YE LCRG
Sbjct: 238 PSHERAGGNLRYFEHLLEEERKKTLSNQTMDAEPATREGIYERPVDYLPERDVYESLCRG 297
Query: 55 D-LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKM 112
+ + + P +L CRY H N P L + P KEE+ + P I+ Y DVM D EI+ IK++
Sbjct: 298 EGVKLTPRRQKRLFCRYHHGNRTPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKQI 357
Query: 113 AQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEEL 172
A+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ +++RR++H+TGLT TAE L
Sbjct: 358 AKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVAQVNRRMQHITGLTVKTAELL 417
Query: 173 QVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSL 232
QV NYG+GG YEPH+DF+R + K+ GNR+AT L YMSDV GGATVF L ++
Sbjct: 418 QVANYGMGGQYEPHFDFSRRPFDSGLKT--EGNRLATFLNYMSDVEAGGATVFPDLGAAI 475
Query: 233 WPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
WP+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 476 WPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 508
>gi|395817618|ref|XP_003782262.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Otolemur
garnettii]
Length = 538
Score = 260 bits (664), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 135/272 (49%), Positives = 179/272 (65%), Gaps = 12/272 (4%)
Query: 4 PTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVA--------PTLEVTEREKYEMLCRGD 55
P+H+RA GN Y++ L + E +A P + ERE YE LCRG+
Sbjct: 243 PSHERAGGNLRYFEHLLEEEREKMLSNKTEAELATQEGIYERPVDYLPEREVYESLCRGE 302
Query: 56 -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
+ + P +L CRY H N P L + P KEE+ + P I+ Y DVM D EI+ IK++A
Sbjct: 303 GVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 362
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++ R++H+TGL+ TAE LQ
Sbjct: 363 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNHRMQHITGLSVKTAELLQ 422
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
V NYG+GG YEPH+DF+R + K+ GNRVAT L YMSDV GGATVF L ++W
Sbjct: 423 VANYGVGGQYEPHFDFSRRPFDSGLKT--EGNRVATFLNYMSDVEAGGATVFPDLGAAIW 480
Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
P+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 481 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 512
>gi|348557544|ref|XP_003464579.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 2
[Cavia porcellus]
Length = 533
Score = 260 bits (664), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 136/272 (50%), Positives = 181/272 (66%), Gaps = 12/272 (4%)
Query: 4 PTHQRAQGNKLYYQEALN--KSPELKDEPPKVNNVA------PTLEVTEREKYEMLCRGD 55
P+H+RA GN Y++ L + L ++ V P+ + ERE YE LCRG+
Sbjct: 238 PSHERAGGNLRYFERLLEEERGKLLSNQTEAVLAAQEGIYERPSDYLPEREVYESLCRGE 297
Query: 56 -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
+ + P +L CRY H N P L + P KEE+ + P I+ Y DVM D EI+ IK++A
Sbjct: 298 GIKLTPQRRKRLFCRYHHGNRAPELLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 357
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++ +TGLT TAE LQ
Sbjct: 358 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEEDDPVVARVNRRMQQITGLTVKTAELLQ 417
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
V NYG+GG YEPH+DF+R + K+ GNR+AT L YMSDV GGATVF L +LW
Sbjct: 418 VANYGMGGQYEPHFDFSRRPFDSGLKT--EGNRLATFLNYMSDVEAGGATVFPDLGAALW 475
Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
P+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 476 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 507
>gi|432949777|ref|XP_004084253.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Oryzias
latipes]
Length = 532
Score = 259 bits (662), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 144/300 (48%), Positives = 190/300 (63%), Gaps = 29/300 (9%)
Query: 2 IFPTHQRAQGNKLYYQEALNKSPELKD-----EPPKVNNVA------PTLEVTEREKYEM 50
I P HQRA GN Y+++ L K +LK+ +PP + P + ERE YE
Sbjct: 234 IDPNHQRAGGNLRYFEQLLMK--QLKESNQDYQPPSEEPIQLGTYTRPKDHLPERETYEA 291
Query: 51 LCRGD-LTVPPAIVAQLKCRYVH-RNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDL 108
LCRG+ L + A ++L CRY + P L L P+KEE+ + P I+ Y +++ D EI+
Sbjct: 292 LCRGEGLQLTEARRSRLFCRYHDGKRSPRLLLKPIKEEDEWDNPHIVRYLNILSDQEIEK 351
Query: 109 IKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST 168
IK++A+PRL RATV++ KTG L A YR+SKSAWL + PVI+R+++R++ +TGLT T
Sbjct: 352 IKELAKPRLARATVRDPKTGVLTTAPYRVSKSAWLEGEDDPVIDRVNQRIQDITGLTVET 411
Query: 169 AEELQVVNYGIGGHYEPHYDFA-RPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTS 227
AE LQV NYG+GG YEPH+DF+ RP ++N GNR+AT L YMSDV GGATVF
Sbjct: 412 AELLQVANYGVGGQYEPHFDFSRRPFDSNLKVD---GNRLATFLNYMSDVEAGGATVFPD 468
Query: 228 LNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCGL 277
S+WP KGTA FW+NL SG+GDY TRHAACPVL GS + + PCGL
Sbjct: 469 FGASIWPRKGTAVFWYNLFRSGEGDYRTRHAACPVLVGSKWVSNKWIHERGQEFRRPCGL 528
>gi|17552840|ref|NP_499464.1| Protein DPY-18 [Caenorhabditis elegans]
gi|20455505|sp|Q10576.2|P4HA1_CAEEL RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
alpha-1; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1; AltName: Full=Protein dumpy-18; Flags:
Precursor
gi|3881011|emb|CAA21045.1| Protein DPY-18 [Caenorhabditis elegans]
gi|6900013|emb|CAB71298.1| prolyl 4-hydroxylase alpha subunit 1 [Caenorhabditis elegans]
Length = 559
Score = 258 bits (658), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 138/292 (47%), Positives = 187/292 (64%), Gaps = 19/292 (6%)
Query: 4 PTHQRAQGNKLYY-----QEALNKSPELKDEPPKVNNVAP--TLEVTEREKYEMLCRGDL 56
PTH RA+GN +Y QE + +S K+ PP + N P L TER YE LCR ++
Sbjct: 235 PTHPRAKGNVKWYEDLLEQEGVRRSDMRKNLPP-IQNRRPDSVLGNTERTMYEALCRNEV 293
Query: 57 TVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPR 116
V +++L C Y R+ P+L P+K E P +L++DV+ D E+ I+++A+P+
Sbjct: 294 PVSQKDISRLYC-YYKRDRPFLVYAPIKVEIKRFNPLAVLFKDVISDDEVAAIQELAKPK 352
Query: 117 LRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVN 176
L RATV + TG+L A YRISKSAWL+E E V+E +++R+ +MT L TAEELQ+ N
Sbjct: 353 LARATVHDSVTGKLVTATYRISKSAWLKEWEGDVVETVNKRIGYMTNLEMETAEELQIAN 412
Query: 177 YGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEK 236
YGIGGHY+PH+D A+ E+ +F+SLGTGNR+ATVLFYMS + GG TVFT ++ P K
Sbjct: 413 YGIGGHYDPHFDHAKKEESKSFESLGTGNRIATVLFYMSQPSHGGGTVFTEAKSTILPTK 472
Query: 237 GTAAFWHNLHSSGDGDYYTRHAACPVLTG----SNS-LHSTC-----PCGLR 278
A FW+NL+ GDG+ TRHAACPVL G SN +H PCGL+
Sbjct: 473 NDALFWYNLYKQGDGNPDTRHAACPVLVGIKWVSNKWIHEKGNEFRRPCGLK 524
>gi|357605723|gb|EHJ64752.1| prolyl 4-hydroxylase alpha subunit [Danaus plexippus]
Length = 235
Score = 258 bits (658), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 131/217 (60%), Positives = 160/217 (73%), Gaps = 5/217 (2%)
Query: 74 NVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIA 133
N P+LRL P++ E Y P II++ DV+ D EID IK++AQPR RRATV + TGEL A
Sbjct: 4 NHPFLRLAPVRMEYLYRNPDIIVFNDVLSDYEIDYIKRIAQPRFRRATVHDPATGELVPA 63
Query: 134 NYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPG 193
+YRISKSAWL++ E V+ R+SRRV +TGL+ +TAEELQVVNYGIGGHY+PH+DFAR
Sbjct: 64 HYRISKSAWLKDEESAVVARVSRRVADITGLSMTTAEELQVVNYGIGGHYDPHFDFARK- 122
Query: 194 EANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDY 253
E NAF+ GNR+ATVLFYMSDVAQGGATVFT L LS++P +G+A FW NLH SG+GD
Sbjct: 123 EENAFEKF-NGNRIATVLFYMSDVAQGGATVFTELGLSVFPRRGSAVFWLNLHPSGEGDL 181
Query: 254 YTRHAACPVLTGSNSLHSTCPCGLRRGLQRSGIICTL 290
TRHAACPVL GS + C + +G Q C L
Sbjct: 182 ATRHAACPVLRGSKWV---CNKWIHQGGQELIRPCNL 215
>gi|390363005|ref|XP_797519.3| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like
[Strongylocentrotus purpuratus]
Length = 579
Score = 258 bits (658), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 135/289 (46%), Positives = 180/289 (62%), Gaps = 26/289 (8%)
Query: 4 PTHQRAQGNKLYYQEAL----------NKSPELKDEPPKVNNVAPTLEVT---------E 44
P H+RA+ NK+++ L + E+ D+ ++ LE E
Sbjct: 266 PKHERAKNNKIFFMSELEEKEIKEKPRGEDAEIDDKTGEIVKTQEELEKEKAEQAYSYPE 325
Query: 45 REKYEMLCRGDLTVPPAIVAQL-KCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYD 103
R++YE LCRGD + +L KC+Y H N P+L L P KEE + PR++ YR+++ D
Sbjct: 326 RKQYEALCRGDPGALKVVDHRLLKCQYQHYNHPFLYLQPAKEEVIFDDPRLVFYRNILND 385
Query: 104 SEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTG 163
EI +K++A PRL+RAT+QN TG LE A+YRISKSAW+++ E +I I RV+ TG
Sbjct: 386 KEIAFVKRLASPRLQRATIQNAITGNLEFADYRISKSAWVKQEEDQLIRSIRFRVQAYTG 445
Query: 164 LTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGAT 223
L TAE+LQVVNYGIGGHYEPH+DFAR E NAF+SLGTGNR+AT LFY+S ++
Sbjct: 446 LELDTAEDLQVVNYGIGGHYEPHFDFARAEETNAFQSLGTGNRIATALFYVSITCPDMSS 505
Query: 224 VFTSLN------LSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+ + LSL GTA FW+NL SG G+Y TRHAACPVL+GS
Sbjct: 506 TYEPRDEIRNGFLSLVYPSGTAVFWYNLRKSGQGNYDTRHAACPVLSGS 554
>gi|291387300|ref|XP_002710241.1| PREDICTED: prolyl 4-hydroxylase, alpha II subunit isoform 1
precursor (predicted)-like isoform 1 [Oryctolagus
cuniculus]
Length = 533
Score = 257 bits (657), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 137/273 (50%), Positives = 181/273 (66%), Gaps = 14/273 (5%)
Query: 4 PTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLE---------VTEREKYEMLCRG 54
P+H+RA GN Y++ L + K + VA T E + ER+ YE LCRG
Sbjct: 238 PSHERAGGNLRYFERLLEEQ-RGKSLLNQTEAVAVTQEGIYERPVDYLPERDVYESLCRG 296
Query: 55 D-LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKM 112
+ + + P +L CRY N P L + P KEE+ + P I+ Y DVM D EI+ IK++
Sbjct: 297 EGVKLTPRRQKRLFCRYHDGNGAPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEI 356
Query: 113 AQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEEL 172
A+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ RI+RR++H+TGLT TAE L
Sbjct: 357 AKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARINRRMQHITGLTVKTAELL 416
Query: 173 QVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSL 232
QV NYG+GG YEPH+DF+R + K+ GNR+AT L YMSDV GGATVF L ++
Sbjct: 417 QVANYGMGGQYEPHFDFSRRPFDSGLKT--EGNRLATFLNYMSDVEAGGATVFPDLGAAI 474
Query: 233 WPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
WP+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 475 WPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 507
>gi|339236271|ref|XP_003379690.1| prolyl 4-hydroxylase subunit alpha-1 [Trichinella spiralis]
gi|316977627|gb|EFV60702.1| prolyl 4-hydroxylase subunit alpha-1 [Trichinella spiralis]
Length = 558
Score = 256 bits (653), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 134/291 (46%), Positives = 180/291 (61%), Gaps = 27/291 (9%)
Query: 2 IFPTHQRAQGNKLYYQEALN-----KSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDL 56
+ P H RA GN YYQ+ L+ + + K PP N L + ER+ YE LCR +
Sbjct: 242 VDPNHPRASGNLKYYQDLLDPEGKPRKIDPKKLPPPTNRRPDDLSIPERDVYEGLCRSEY 301
Query: 57 TVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPR 116
+ A+L C Y RN PYL+L P+K E + +P+I+ +R V+ D EI +IK++A P
Sbjct: 302 PISDKDRAKLYC-YYKRNRPYLKLAPIKVEVMHWKPKIVYFRGVISDEEIAVIKQLASPL 360
Query: 117 LRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVN 176
L+RATV N TG+LE A+YRISKSAWL++ EH V++RIS R++ MT LT TAE LQ+ N
Sbjct: 361 LKRATVHNADTGQLETASYRISKSAWLKDTEHEVVKRISDRIDMMTDLTMETAELLQIAN 420
Query: 177 YGIGGHYEPHYDFARPGEAN---------------------AFKSLGTGNRVATVLFYMS 215
YGIGGHY+PH+D + GE++ +F+SL GNR+ATVLFY+S
Sbjct: 421 YGIGGHYDPHFDMSTRGESDPYEEGTGNRIATVLFYTNDPYSFESLNAGNRIATVLFYIS 480
Query: 216 DVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
GG TVFTS +++ P K AAFW N+ G+ D TRHAACPVL G+
Sbjct: 481 QPEAGGGTVFTSHKITVEPSKYDAAFWFNVLQGGEPDMSTRHAACPVLAGT 531
>gi|326928728|ref|XP_003210527.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Meleagris
gallopavo]
Length = 535
Score = 256 bits (653), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 131/273 (47%), Positives = 179/273 (65%), Gaps = 14/273 (5%)
Query: 5 THQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLE----------VTEREKYEMLCRG 54
TH+RA N Y+++ L K E V P ++ + ER+ YE LCRG
Sbjct: 239 THERAGSNLRYFEKLLEKEREKSSXNKTVATTEPVVQSGAYERPLDYLPERDIYEALCRG 298
Query: 55 D-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKM 112
+ + + P +L CRY + N P+L + P KEE+ + P I+ Y DVM D EI+ IK++
Sbjct: 299 EGVKMTPRRQKRLFCRYHNGNRNPHLVIAPFKEEDEWDSPHIVRYYDVMSDEEIEKIKQL 358
Query: 113 AQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEEL 172
A+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ ++++R++ +TGLT TAE L
Sbjct: 359 AKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVAKVNQRMQQITGLTVKTAELL 418
Query: 173 QVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSL 232
QV NYG+GG YEPH+DF+R + KS GNR+AT L YMSDV GGATVF ++
Sbjct: 419 QVANYGMGGQYEPHFDFSRRPFDSTLKS--EGNRLATFLNYMSDVEAGGATVFPDFGAAI 476
Query: 233 WPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
WP+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 477 WPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVG 509
>gi|354474415|ref|XP_003499426.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2
[Cricetulus griseus]
Length = 533
Score = 256 bits (653), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 134/272 (49%), Positives = 180/272 (66%), Gaps = 12/272 (4%)
Query: 4 PTHQRAQGNKLYYQ--------EALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD 55
P+H+RA GN Y++ ++L E + P + ER+ E LCRG+
Sbjct: 238 PSHERAGGNLRYFERLLEEEREKSLFNQTEAGLATQENVYERPVDFLPERDVLESLCRGE 297
Query: 56 -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
+ + P +L CRY H N VP L + P KEE+ + P I+ Y DVM D EI+ IK++A
Sbjct: 298 GVKLTPQRQKKLFCRYHHGNRVPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 357
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT TAE LQ
Sbjct: 358 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 417
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
V NYG+GG YEPH+DF+R + K+ GNR+AT L YMSDV GGATVF L ++W
Sbjct: 418 VANYGMGGQYEPHFDFSRRPFDSGLKT--EGNRLATFLNYMSDVEAGGATVFPDLGAAIW 475
Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
P+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 476 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 507
>gi|440912197|gb|ELR61789.1| Prolyl 4-hydroxylase subunit alpha-2, partial [Bos grunniens mutus]
Length = 535
Score = 255 bits (652), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 134/272 (49%), Positives = 181/272 (66%), Gaps = 12/272 (4%)
Query: 4 PTHQRAQGNKLYYQ--------EALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD 55
P+H+RA GN Y++ + L+ E + + P + ER+ YE LCRG+
Sbjct: 240 PSHERAGGNLHYFERLLEEEREKMLSNHTEAELASQQGIYERPVDYLPERDVYESLCRGE 299
Query: 56 -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
+ + P +L CRY H N VP L + P KEE+ + P I+ Y DVM D EI+ IK++A
Sbjct: 300 GVKLTPRRQKRLFCRYHHGNRVPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 359
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++ R++H+TGLT TAE LQ
Sbjct: 360 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNLRMQHITGLTVKTAELLQ 419
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
V NYG+GG YEPH+DF+R + K+ GNR+AT L YMSDV GGATVF L ++W
Sbjct: 420 VANYGMGGQYEPHFDFSRRPFDSGLKT--EGNRLATFLNYMSDVEAGGATVFPDLGAAIW 477
Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
P+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 478 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 509
>gi|226874885|ref|NP_001029465.2| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Bos
taurus]
gi|296485623|tpg|DAA27738.1| TPA: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Bos taurus]
Length = 533
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 134/272 (49%), Positives = 181/272 (66%), Gaps = 12/272 (4%)
Query: 4 PTHQRAQGNKLYYQ--------EALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD 55
P+H+RA GN Y++ + L+ E + + P + ER+ YE LCRG+
Sbjct: 238 PSHERAGGNLHYFERLLEEEREKMLSNHTEAELASQQGIYERPVDYLPERDVYESLCRGE 297
Query: 56 -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
+ + P +L CRY H N VP L + P KEE+ + P I+ Y DVM D EI+ IK++A
Sbjct: 298 GVKLTPRRQKRLFCRYHHGNRVPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 357
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++ R++H+TGLT TAE LQ
Sbjct: 358 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNLRMQHITGLTVKTAELLQ 417
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
V NYG+GG YEPH+DF+R + K+ GNR+AT L YMSDV GGATVF L ++W
Sbjct: 418 VANYGMGGQYEPHFDFSRRPFDSGLKT--EGNRLATFLNYMSDVEAGGATVFPDLGAAIW 475
Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
P+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 476 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 507
>gi|156370129|ref|XP_001628324.1| predicted protein [Nematostella vectensis]
gi|156215298|gb|EDO36261.1| predicted protein [Nematostella vectensis]
Length = 541
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 139/302 (46%), Positives = 187/302 (61%), Gaps = 38/302 (12%)
Query: 2 IFPTHQRAQGNKLYYQEALNKSPELK-DEPPKVNNVAPT---LEVTERE---------KY 48
I PTH RA N Y+ + K + + D +P+ ++ ERE Y
Sbjct: 207 IDPTHTRATDNVAYFGSEIAKQTKKRGDTGTSRRTKSPSSTFKKLKEREYFHRTKAFQNY 266
Query: 49 EMLCRGDL-TVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID 107
E LCRG++ ++ Q+ C + R+ P L P + E +++P ++++R+ + DSEI
Sbjct: 267 EKLCRGEVRSLTKWEQGQMSCWQI-RDDPLTVLKPGRIERVFVKPEVLIFRNFITDSEIK 325
Query: 108 LIKKMAQPRLRRATVQ----------NYK------------TGELEIANYRISKSAWLRE 145
IK++A PRL+RATV+ NY+ TG+LE ANYRISKS WLR+
Sbjct: 326 RIKELATPRLKRATVKDPVTGELIFANYRISKRRATIQHPVTGKLEFANYRISKSGWLRD 385
Query: 146 PEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGN 205
E +++RIS RV+ +GL +T+E+LQVVNYGIGGHYEPHYDFAR GE + F SLGTGN
Sbjct: 386 EEDELVKRISYRVQAYSGLNMTTSEDLQVVNYGIGGHYEPHYDFARDGE-DKFTSLGTGN 444
Query: 206 RVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
R+AT L Y+SDV GG TVFT + ++WP+KG AAFW+NL SGDGD TRHAACPVL G
Sbjct: 445 RIATFLSYLSDVEAGGGTVFTRVGATVWPQKGDAAFWYNLKRSGDGDSSTRHAACPVLVG 504
Query: 266 SN 267
S
Sbjct: 505 SK 506
>gi|74353841|gb|AAI03334.1| Prolyl 4-hydroxylase, alpha polypeptide II [Bos taurus]
Length = 487
Score = 255 bits (651), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 134/272 (49%), Positives = 181/272 (66%), Gaps = 12/272 (4%)
Query: 4 PTHQRAQGNKLYYQ--------EALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD 55
P+H+RA GN Y++ + L+ E + + P + ER+ YE LCRG+
Sbjct: 192 PSHERAGGNLHYFERLLEEEREKMLSNHTEAELASQQGIYERPVDYLPERDVYESLCRGE 251
Query: 56 -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
+ + P +L CRY H N VP L + P KEE+ + P I+ Y DVM D EI+ IK++A
Sbjct: 252 GVKLTPRRQKRLFCRYHHGNRVPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 311
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++ R++H+TGLT TAE LQ
Sbjct: 312 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNLRMQHITGLTVKTAELLQ 371
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
V NYG+GG YEPH+DF+R + K+ GNR+AT L YMSDV GGATVF L ++W
Sbjct: 372 VANYGMGGQYEPHFDFSRRPFDSGLKT--EGNRLATFLNYMSDVEAGGATVFPDLGAAIW 429
Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
P+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 430 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 461
>gi|426229221|ref|XP_004008689.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 2
[Ovis aries]
Length = 487
Score = 254 bits (650), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 134/272 (49%), Positives = 180/272 (66%), Gaps = 12/272 (4%)
Query: 4 PTHQRAQGNKLYYQ--------EALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD 55
P+H+RA GN Y++ + L E + + P + ER+ YE LCRG+
Sbjct: 192 PSHERAGGNLHYFERLLEEEREKMLTNHTEAELAAQQGIYERPVDYLPERDVYESLCRGE 251
Query: 56 -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
+ + P +L CRY H N VP L + P KEE+ + P I+ Y DVM D EI+ IK++A
Sbjct: 252 GVKLTPRRQKRLFCRYHHGNRVPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 311
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++ R++H+TGLT TAE LQ
Sbjct: 312 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNLRMQHITGLTVKTAELLQ 371
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
V NYG+GG YEPH+DF+R + K+ GNR+AT L YMSDV GGATVF L ++W
Sbjct: 372 VANYGMGGQYEPHFDFSRRPFDSGLKT--EGNRLATFLNYMSDVEAGGATVFPDLGAAIW 429
Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
P+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 430 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 461
>gi|57525020|ref|NP_001006155.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Gallus gallus]
gi|82082587|sp|Q5ZLK5.1|P4HA2_CHICK RecName: Full=Prolyl 4-hydroxylase subunit alpha-2; Short=4-PH
alpha-2; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-2; Flags: Precursor
gi|53129464|emb|CAG31388.1| hypothetical protein RCJMB04_5l17 [Gallus gallus]
Length = 534
Score = 253 bits (647), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 134/276 (48%), Positives = 179/276 (64%), Gaps = 21/276 (7%)
Query: 5 THQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVT-------------EREKYEML 51
TH+RA N Y+++ L K + E P VA T V ER+ YE L
Sbjct: 239 THERAGSNLRYFEKLLEK----EREKPSNKTVATTEPVVQSGAYERPLDYLPERDIYEAL 294
Query: 52 CRGD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLI 109
CRG+ + + P +L CRY N P+L + P KEE+ + P I+ Y DVM D EI+ I
Sbjct: 295 CRGEGVKMTPRRQKRLFCRYHDGNRNPHLLIAPFKEEDEWDSPHIVRYYDVMSDEEIEKI 354
Query: 110 KKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTA 169
K++A+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ ++++R++ +TGLT TA
Sbjct: 355 KQLAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVAKVNQRMQQITGLTVKTA 414
Query: 170 EELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLN 229
E LQV NYG+GG YEPH+DF+R + KS GNR+AT L YMSDV GGATVF
Sbjct: 415 ELLQVANYGMGGQYEPHFDFSRRPFDSTLKS--EGNRLATFLNYMSDVEAGGATVFPDFG 472
Query: 230 LSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
++WP+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 473 AAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVG 508
>gi|73970649|ref|XP_850109.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Canis
lupus familiaris]
Length = 533
Score = 252 bits (644), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 133/272 (48%), Positives = 178/272 (65%), Gaps = 12/272 (4%)
Query: 4 PTHQRAQGNKLYYQ--------EALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD 55
P+H+RA GN Y++ + L E + P + ER+ YE LCRG+
Sbjct: 238 PSHERAGGNLRYFERLLEEEREKMLLNQTEAGLATQESIYERPVDYLPERDVYESLCRGE 297
Query: 56 -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
+ + P +L CRY H N P L + P KEE+ + P I+ Y DVM D EI+ IK++A
Sbjct: 298 GVKLTPRRQKRLFCRYHHGNRTPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 357
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++ R++H+TGLT TAE LQ
Sbjct: 358 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNLRMQHITGLTVKTAELLQ 417
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
V NYG+GG YEPH+DF+R + K+ GNR+AT L YMSDV GGATVF L ++W
Sbjct: 418 VANYGMGGQYEPHFDFSRRPFDSGLKT--EGNRLATFLNYMSDVEAGGATVFPDLGAAIW 475
Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
P+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 476 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 507
>gi|334311009|ref|XP_001371555.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Monodelphis
domestica]
Length = 534
Score = 252 bits (643), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 129/275 (46%), Positives = 180/275 (65%), Gaps = 13/275 (4%)
Query: 4 PTHQRAQGNKLYYQE---------ALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRG 54
P H+RA GN Y+++ LNK+ E + P+ + ERE YE LCRG
Sbjct: 238 PNHERAGGNLRYFEKLIEEERLGKTLNKTSETEPATQGAFYQRPSDYLPEREVYEALCRG 297
Query: 55 D-LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKM 112
+ + + P +L CRY N P L + P KEE+ + P I+ Y DV+ D EI+ IK++
Sbjct: 298 EGIKLTPQRRKRLFCRYHDSNKTPQLLIAPFKEEDEWDSPHIVRYYDVLSDEEIEKIKEI 357
Query: 113 AQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEEL 172
++P+L RATV++ KTG L + +YRISKS+WL+E + P+I +++RR++++TGL+ TAE L
Sbjct: 358 SKPKLSRATVRDPKTGHLIVVSYRISKSSWLKEDDDPIIAQVNRRMQYITGLSVKTAELL 417
Query: 173 QVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSL 232
QV NYG+GG YEPH+DF+R + K+ GNR+AT L YMSDV GGATVF ++
Sbjct: 418 QVSNYGMGGQYEPHFDFSRRPFDSGLKT--EGNRLATFLNYMSDVEAGGATVFPDFGAAI 475
Query: 233 WPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
WP+KGT+ FW+NL SG+ DY TRHAACPVL GS
Sbjct: 476 WPKKGTSVFWYNLFRSGECDYRTRHAACPVLVGSK 510
>gi|355709025|gb|AES03456.1| prolyl 4-hydroxylase, alpha polypeptide II [Mustela putorius furo]
Length = 532
Score = 252 bits (643), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 133/272 (48%), Positives = 178/272 (65%), Gaps = 12/272 (4%)
Query: 4 PTHQRAQGNKLYYQ--------EALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD 55
P+H+RA GN Y++ + L E + P + ER+ YE LCRG+
Sbjct: 238 PSHERAGGNLRYFERLLEEEREKMLLNQTEAGLATQESIYERPVDYLPERDVYESLCRGE 297
Query: 56 -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
+ + P +L CRY H N P L + P KEE+ + P I+ Y DVM D EI+ IK++A
Sbjct: 298 GVKLTPRRQKRLFCRYHHGNRTPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 357
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++ R++H+TGLT TAE LQ
Sbjct: 358 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNLRMQHITGLTVKTAELLQ 417
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
V NYG+GG YEPH+DF+R + K+ GNR+AT L YMSDV GGATVF L ++W
Sbjct: 418 VANYGMGGQYEPHFDFSRRPFDSGLKT--EGNRLATFLNYMSDVEAGGATVFPDLGAAIW 475
Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
P+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 476 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 507
>gi|190402274|gb|ACE77683.1| prolyl 4-hydroxylase subunit alpha-2 precursor (predicted) [Sorex
araneus]
Length = 533
Score = 251 bits (642), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 134/275 (48%), Positives = 179/275 (65%), Gaps = 18/275 (6%)
Query: 4 PTHQRAQGNKLYY---------QEALNKSPELKDEPPKVNNV--APTLEVTEREKYEMLC 52
P+H+RA GN Y+ + LN++ EP P+ + ER+ YE LC
Sbjct: 238 PSHERAGGNLRYFERLLEEEREKTVLNQTGA---EPATQEGFYERPSDYLPERDVYESLC 294
Query: 53 RGD-LTVPPAIVAQLKCRYVH-RNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIK 110
RG+ + + P +L CRY H P L + P KEE+ + P I+ Y DVM D EI+ IK
Sbjct: 295 RGEGVKLTPRRQKRLFCRYHHGHGAPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIK 354
Query: 111 KMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAE 170
++A+P+L RATV++ KTG L A+YR+SKS+WL E + PV+ R++ R++H+TGLT TAE
Sbjct: 355 EIAKPKLARATVRDPKTGVLTTASYRVSKSSWLEETDDPVVARVNLRMQHITGLTVKTAE 414
Query: 171 ELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNL 230
LQV NYG+GG YEPH+DF+R + K+ GNR+AT L YMSDV GGATVF L
Sbjct: 415 LLQVANYGMGGQYEPHFDFSRRPFDSGLKT--EGNRLATFLNYMSDVEAGGATVFPDLGA 472
Query: 231 SLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
++WP+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 473 AIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 507
>gi|224068121|ref|XP_002191580.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Taeniopygia
guttata]
Length = 539
Score = 251 bits (641), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 135/277 (48%), Positives = 181/277 (65%), Gaps = 18/277 (6%)
Query: 5 THQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVT--------------EREKYEM 50
TH+RA N Y+++ L K E K++ +N T E ER+ YE
Sbjct: 239 THERAGSNLRYFEKLLEKEREEKEKENSMNKTVTTTEAVVQSGAYERPLDYLPERDIYEA 298
Query: 51 LCRGD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDL 108
LCRG+ + + P +L CRY N P+L + P KEE+ + P I+ Y DVM D EI+
Sbjct: 299 LCRGEGVKMTPRRQKRLFCRYHDGNRNPHLLIAPFKEEDEWDSPHIVRYYDVMSDEEIEK 358
Query: 109 IKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST 168
IK++A+PRL RATV++ KTG L +A+YR+SKS+WL E + PV+ ++++R++H+TGLT T
Sbjct: 359 IKQLAKPRLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVAKVNQRMQHITGLTVKT 418
Query: 169 AEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL 228
AE LQV NYG+GG YEPH+DF+R + KS GNR+AT L YMSDV GGATVF
Sbjct: 419 AELLQVANYGMGGQYEPHFDFSRRPFDSTLKS--EGNRLATFLNYMSDVEAGGATVFPDF 476
Query: 229 NLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
++WP+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 477 GAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVG 513
>gi|148226320|ref|NP_001087703.1| prolyl 4-hydroxylase, alpha polypeptide 2 precursor [Xenopus
laevis]
gi|51703693|gb|AAH81114.1| MGC83530 protein [Xenopus laevis]
Length = 533
Score = 251 bits (640), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 142/295 (48%), Positives = 187/295 (63%), Gaps = 26/295 (8%)
Query: 5 THQRAQGNKLYYQE---------ALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD 55
H+RA N Y+++ N++ E + P V N P + ER+ YE LCRG+
Sbjct: 239 NHERAGSNLKYFEKMQERQKGELKQNETIETETRQPGVYN-RPLDYLPERDVYEALCRGE 297
Query: 56 -LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
+ + P +L CRY N P L L P+K E+ + PRI+ Y DV+ D EI+ IK++A
Sbjct: 298 GVKMNPRRQKRLFCRYHDGNRNPRLILGPIKMEDEWDSPRIVRYLDVLSDEEIEKIKELA 357
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+PRL RATV++ KTG L +ANYR+SKSAWL E + PVI R++ R++ +TGLT TAE LQ
Sbjct: 358 KPRLARATVRDPKTGVLTVANYRVSKSAWLEEYDDPVIGRVNSRMQAITGLTKDTAELLQ 417
Query: 174 VVNYGIGGHYEPHYDFA-RPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSL 232
V NYG+GG YEPH+DF+ RP ++N K+ GNR+AT L YMSDV GGATVF ++
Sbjct: 418 VANYGMGGQYEPHFDFSRRPFDSN-LKT--EGNRLATYLNYMSDVEAGGATVFPDFGAAI 474
Query: 233 WPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCGL 277
WP KGTA FW+NL SG+GDY TRHAACPVL GS + + PCGL
Sbjct: 475 WPRKGTAVFWYNLFRSGEGDYRTRHAACPVLVGSKWVSNKWFHERGQEFLRPCGL 529
>gi|443709455|gb|ELU04127.1| hypothetical protein CAPTEDRAFT_149240 [Capitella teleta]
Length = 532
Score = 251 bits (640), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 135/280 (48%), Positives = 174/280 (62%), Gaps = 17/280 (6%)
Query: 2 IFPTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVT---------------ERE 46
+ P H RAQ N +Y + + K + + K ++ A + E +
Sbjct: 232 LVPEHTRAQNNLNHYNQLIAKEEQEEGVRKKGDDGALKDAIMNDRFLNEEDQYRASPEFQ 291
Query: 47 KYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEI 106
YE LCRG+ VP +L C+Y + P + PL+EE A L+P I +Y +M D EI
Sbjct: 292 TYEALCRGEDVVPVKDPHKLTCQYRFWH-PMFYINPLREETASLEPWIAVYHQLMNDHEI 350
Query: 107 DLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTT 166
+ IK+MA PRL RATV N TG+LE A YRISKS WLR+ E P+I RIS R +T L+
Sbjct: 351 ERIKEMATPRLARATVHNSATGQLEHAKYRISKSGWLRDEEDPLIARISERCSALTNLSL 410
Query: 167 STAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFT 226
+T EELQVVNYGIGG YEPH+DF+R E AF+ GNR+ TV++YM+DV GGATVF
Sbjct: 411 TTVEELQVVNYGIGGQYEPHFDFSRRSEPTAFEKW-RGNRILTVIYYMTDVEAGGATVFL 469
Query: 227 SLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+ ++PEKG+AA WHNL SG+GD TRHAACPVLTGS
Sbjct: 470 DAGVKVYPEKGSAAVWHNLLPSGEGDMRTRHAACPVLTGS 509
>gi|395509387|ref|XP_003758979.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1
[Sarcophilus harrisii]
Length = 534
Score = 251 bits (640), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 134/293 (45%), Positives = 184/293 (62%), Gaps = 23/293 (7%)
Query: 5 THQRAQGNKLYYQEAL---------NKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD 55
+H+RA GN Y+++ L NK+ E + P + ER+ YE LCRG+
Sbjct: 239 SHERAGGNLRYFEKLLEEERLGKRLNKTSETQPATQGGIYERPPDYLPERDVYEALCRGE 298
Query: 56 -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
+ + P +L CRY N P L + P KEE+ + P I+ Y DV+ D EI+ IK++A
Sbjct: 299 GIKLTPRRQKRLFCRYHDGNRTPQLLIAPFKEEDEWDSPHIVRYYDVLSDEEIERIKELA 358
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+P+L RATV++ KTG L +ANYR+SKS+WL E + PVI +++RR+ ++TGL+ TAE LQ
Sbjct: 359 KPKLARATVRDPKTGVLTVANYRVSKSSWLEEGDDPVIAQLNRRMHYITGLSVKTAELLQ 418
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
V NYG+GG YEPH+DF+R + K+ GNR+AT L YMSDV GGATVF ++W
Sbjct: 419 VANYGMGGQYEPHFDFSRRPFDSGLKT--EGNRLATFLNYMSDVEAGGATVFPDFGATIW 476
Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCG 276
P+KGT+ FW+NL SG+GDY TRHAACPVL GS + + PCG
Sbjct: 477 PKKGTSVFWYNLFRSGEGDYRTRHAACPVLVGSKWVSNKWFHERGQEFLRPCG 529
>gi|387016442|gb|AFJ50340.1| Prolyl 4-hydroxylase subunit alpha-2-like [Crotalus adamanteus]
Length = 533
Score = 250 bits (638), Expect = 7e-64, Method: Compositional matrix adjust.
Identities = 132/278 (47%), Positives = 178/278 (64%), Gaps = 15/278 (5%)
Query: 1 MIFPTHQRAQGNKLYYQEALNKSPELKD---------EPPKVNNV--APTLEVTEREKYE 49
++ P H+RA N Y+++ L E +P N + P + ERE YE
Sbjct: 232 ILDPGHERAGSNMQYFEKLLESEKESNQINKLSVNPSDPKTYNGIYERPQDYLPERETYE 291
Query: 50 MLCRGD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEID 107
LCRG+ + + P L CRY + N P+L + P KEE+ + P I+ Y +V+ D EI+
Sbjct: 292 ALCRGEGVKLTPRRQKGLFCRYHNGNRNPHLIIAPFKEEDEWDSPHIVRYYEVLSDEEIE 351
Query: 108 LIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTS 167
IK++A+P+L RATV++ KTG L +ANYR+SKS+WL E + V+ R++ R+E +TGLTT
Sbjct: 352 KIKELAKPKLARATVRDPKTGVLTVANYRVSKSSWLEEEDDLVVARVNHRMEQITGLTTK 411
Query: 168 TAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTS 227
TAE LQV NYG+GG YEPH+DF+R K+ GNR+AT L YMSDV GGATVF
Sbjct: 412 TAELLQVANYGMGGQYEPHFDFSRRPFDITLKT--EGNRLATFLNYMSDVEAGGATVFPD 469
Query: 228 LNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
++WP+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 470 FGAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVG 507
>gi|56118630|ref|NP_001007975.1| prolyl 4-hydroxylase, alpha polypeptide 2 precursor [Xenopus
(Silurana) tropicalis]
gi|51513259|gb|AAH80485.1| p4ha2 protein [Xenopus (Silurana) tropicalis]
Length = 527
Score = 249 bits (637), Expect = 9e-64, Method: Compositional matrix adjust.
Identities = 134/276 (48%), Positives = 179/276 (64%), Gaps = 16/276 (5%)
Query: 4 PTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLE----------VTEREKYEMLCR 53
P H RA N Y+++ K + N + T + + ER+ YE LCR
Sbjct: 238 PNHDRAVNNLKYFEKMQEKQKAELKQNESTNTESATRQPGVYSRPLDYLPERDVYEALCR 297
Query: 54 GD-LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
G+ + + P +L CRY + N PYL L P+K E+ + PRI+ Y + + D EI IK+
Sbjct: 298 GEGVKMNPRRQRRLFCRYHNGNRSPYLILSPVKVEDEWDSPRIVRYLNALSDEEIAKIKE 357
Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
+A+P+L RATV++ KTG L +ANYR+SKSAWL E + PVI R++ R++ +TGLT TAE
Sbjct: 358 LAKPKLARATVRDPKTGVLSVANYRVSKSAWLEENDDPVIARVNLRMQAITGLTVDTAEL 417
Query: 172 LQVVNYGIGGHYEPHYDFA-RPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNL 230
LQV NYG+GG YEPH+DF+ RP ++N K+ GNR+AT L YMSDV GGATVF
Sbjct: 418 LQVANYGMGGQYEPHFDFSRRPFDSN-LKT--DGNRLATFLNYMSDVEAGGATVFPDFGA 474
Query: 231 SLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
++WP+KGTA FW+NL SG+GDY TRHAACPVL GS
Sbjct: 475 AIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGS 510
>gi|340367965|ref|XP_003382523.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Amphimedon
queenslandica]
Length = 525
Score = 248 bits (632), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 136/292 (46%), Positives = 185/292 (63%), Gaps = 22/292 (7%)
Query: 4 PTHQRAQGNKLYYQEALNKSPELKDEPPK-VNNVAPTLEVTEREKYEMLCRGDLTVPPAI 62
P+H+RA N+ Y+ ++EP K V++ + +E YE LCR +P +
Sbjct: 242 PSHERAISNREYFNRVS------REEPDKFVDHEGVLDDESEHAVYEKLCREPAPIPSHL 295
Query: 63 VAQLKCRYVH-RNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRAT 121
+L C Y + + P L L P+K E A+++P+I ++ D++ D EI+ +K++A P+L RAT
Sbjct: 296 HKKLICYYFNNKRNPRLILSPIKTEVAFVKPKIYIFYDIVTDREIERLKELANPKLNRAT 355
Query: 122 VQNYKTGELEIANYRISKSAWLREPEHPV--IERISRRVEHMTGLTTSTAEELQVVNYGI 179
V + GEL A YRISKS WL + P+ ++RI +R+E +TGLT STAE+LQVVNYGI
Sbjct: 356 VHG-ENGELLHATYRISKSGWLSGSDDPLGYVDRIDQRIEDVTGLTMSTAEQLQVVNYGI 414
Query: 180 GGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTA 239
GG YEPHYDFAR GE + F SLG+GNR++T+L YMSDV +GGATVF + L P K A
Sbjct: 415 GGQYEPHYDFARTGE-DTFTSLGSGNRISTLLIYMSDVEKGGATVFPGVGARLVPIKRAA 473
Query: 240 AFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCGLRRGL 281
A+W NL SGDGDY TRHA CPVL GS + + PCGL R +
Sbjct: 474 AYWWNLKRSGDGDYSTRHAGCPVLVGSKWVCNKWIHERGQEFRRPCGLSRDV 525
>gi|297301157|ref|XP_001103971.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2 [Macaca
mulatta]
Length = 512
Score = 247 bits (630), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 129/275 (46%), Positives = 174/275 (63%), Gaps = 35/275 (12%)
Query: 4 PTHQRAQGNKLYYQEALNKSPEL----------KDEPPKVNNVAPTLEVTEREKYEMLCR 53
P HQRA GN Y++ + K ++ + PK VA + ER+KYEMLCR
Sbjct: 236 PEHQRANGNLKYFEYIMAKEKDVNKSASDDQSDQKTTPKKKGVAVDY-LPERQKYEMLCR 294
Query: 54 GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
G+ + + P +L CRY N P L P K+E+ + +PRII + D++ D+EI+++K
Sbjct: 295 GEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 354
Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
+A+PRL RATV + +TG+L A YR+SKSAWL E+PV+ RI+ R++ +TGL STAEE
Sbjct: 355 LAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEE 414
Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
LQV NYG+GG YEPH+DFAR MSDV+ GGATVF + S
Sbjct: 415 LQVANYGVGGQYEPHFDFAR----------------------MSDVSAGGATVFPEVGAS 452
Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 453 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 487
>gi|324510827|gb|ADY44523.1| Prolyl 4-hydroxylase subunit alpha-1 [Ascaris suum]
Length = 551
Score = 247 bits (630), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 135/295 (45%), Positives = 182/295 (61%), Gaps = 17/295 (5%)
Query: 4 PTHQRAQGNKLYYQ-----EALNKSPELKDEPPKVN-NVAPTLEVTEREKYEMLCRGDLT 57
P H RA+GN +Y+ E + + ++ PP +N LE TER+ +E LCR ++
Sbjct: 236 PNHPRAKGNLKWYEDLLEDEGVRRVDMRRNIPPLLNPRHDGGLEHTERDIFEALCRHEVP 295
Query: 58 VPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRL 117
V +++L C Y + PYLRL P+K E L P +L+ +M D E +I+ +A P+L
Sbjct: 296 VSTKALSRLYC-YYKMDRPYLRLAPIKVEIMRLNPLAVLFHQIMSDEEAHIIEMLAIPKL 354
Query: 118 RRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNY 177
RATVQN TG LE A+YRISKSAWL+ EH V++R ++R++ T L TAEELQ+ NY
Sbjct: 355 NRATVQNAMTGGLETASYRISKSAWLKPHEHEVVDRFNKRLDMATNLEMETAEELQIQNY 414
Query: 178 GIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKG 237
G+GGHY+PH+D AR E NAFK LGTGNRVAT+L YM++ GG TVFT + S+ K
Sbjct: 415 GVGGHYDPHFDCARKEEKNAFKELGTGNRVATILVYMTEPEIGGGTVFTEVKTSVACTKN 474
Query: 238 TAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCGLRRGLQ 282
A FW+NL SG+ D +RHAACPVLTG + + PCGL + Q
Sbjct: 475 AALFWYNLLRSGEVDMRSRHAACPVLTGVKWVTNKWIHERGQEWRRPCGLNQFDQ 529
>gi|326923465|ref|XP_003207956.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 3
[Meleagris gallopavo]
Length = 518
Score = 244 bits (623), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 131/273 (47%), Positives = 175/273 (64%), Gaps = 29/273 (10%)
Query: 4 PTHQRAQGNKLYYQ-------EALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD- 55
P HQRA GN Y++ EA S + +D+ K + ER KYEMLCRG+
Sbjct: 240 PEHQRANGNMKYFEYIMAKEKEANKSSTDAEDQTEKETEFKKKDYLPERRKYEMLCRGEG 299
Query: 56 LTVPPAIVAQLKCRYV--HRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
L + P +L CRY +RN Y+ L P+K+E+ + +PRI+ + D++ D EI+ +K++A
Sbjct: 300 LKMTPRRQKRLFCRYYDGNRNPRYI-LGPVKQEDEWDKPRIVRFLDIISDEEIETVKELA 358
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+PRL RATV + +TG+L A+YR+SKSAWL E PV+ RI+ R++ +TGL STAEELQ
Sbjct: 359 KPRLSRATVHDPETGKLTTAHYRVSKSAWLSGYESPVVSRINTRIQDLTGLDVSTAEELQ 418
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
E +AFK LGTGNR+AT LFYMSDV+ GGATVF + S+W
Sbjct: 419 ------------------KDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGASVW 460
Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
P+KGTA FW+NL SG+GDY TRHAACPVL G+
Sbjct: 461 PKKGTAVFWYNLFPSGEGDYSTRHAACPVLVGN 493
>gi|395820528|ref|XP_003783616.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 3 [Otolemur
garnettii]
Length = 516
Score = 244 bits (623), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 130/275 (47%), Positives = 175/275 (63%), Gaps = 31/275 (11%)
Query: 4 PTHQRAQGNKLYYQEALNKSPEL----------KDEPPKVNNVAPTLEVTEREKYEMLCR 53
P HQRA GN Y++ + K ++ + PK VA + ER+KYEMLCR
Sbjct: 236 PEHQRANGNLKYFEYIMAKEKDVNKSSSDDQSDQKTTPKKKGVAVDY-LPERQKYEMLCR 294
Query: 54 GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
G+ + + P +L CRY N P L P K+E+ + +PRII + D++ D+EI+++K
Sbjct: 295 GEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 354
Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
+A+PRL RATV + +TG+L A YR+SKSAWL E+PV+ RI+ R++ +TGL STAEE
Sbjct: 355 LAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEE 414
Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
LQ E +AFK LGTGNR+AT LFYMSDV+ GGATVF + S
Sbjct: 415 LQ------------------KDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 456
Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 457 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 491
>gi|607947|gb|AAA62207.1| prolyl 4-hydroxylase alpha subunit [Caenorhabditis elegans]
Length = 558
Score = 244 bits (623), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 136/292 (46%), Positives = 184/292 (63%), Gaps = 20/292 (6%)
Query: 4 PTHQRAQGNKLYY-----QEALNKSPELKDEPPKVNNVAP--TLEVTEREKYEMLCRGDL 56
PTH RA+GN +Y QE + +S K+ PP + N P L TER YE LCR ++
Sbjct: 235 PTHPRAKGNVKWYEDLLEQEGVRRSDMRKNLPP-IQNRRPDSVLGNTERTMYEALCRNEV 293
Query: 57 TVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPR 116
V + +L C Y+ +L P+K E P +L++DV+ D E+ I+++A+P+
Sbjct: 294 PVSRRHL-RLYCYYL-AGPSFLVYAPIKVEIKRFNPLAVLFKDVISDDEVAAIQELAKPK 351
Query: 117 LRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVN 176
L RATV + TG+L A YRISKSAWL+E E V+E +++R+ +MT L TAEELQ+ N
Sbjct: 352 LARATVHDSVTGKLVTATYRISKSAWLKEWEGDVVETVNKRIGYMTNLEMETAEELQIAN 411
Query: 177 YGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEK 236
YGIGGHY+PH+D A+ E+ +F+SLGTGNR+ATVLFYMS + GG TVFT ++ P K
Sbjct: 412 YGIGGHYDPHFDHAKKEESKSFESLGTGNRIATVLFYMSQPSHGGGTVFTEAKSTILPTK 471
Query: 237 GTAAFWHNLHSSGDGDYYTRHAACPVLTG----SNS-LHSTC-----PCGLR 278
A FW+NL+ GDG+ TRHAACPVL G SN +H PCGL+
Sbjct: 472 NDALFWYNLYKQGDGNPDTRHAACPVLVGIKWVSNKWIHEKGNEFRRPCGLK 523
>gi|217272851|ref|NP_001136068.1| prolyl 4-hydroxylase subunit alpha-1 isoform 3 precursor [Homo
sapiens]
gi|114631189|ref|XP_001140871.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 10 [Pan
troglodytes]
Length = 516
Score = 244 bits (622), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 130/275 (47%), Positives = 175/275 (63%), Gaps = 31/275 (11%)
Query: 4 PTHQRAQGNKLYYQEALNKSPEL----------KDEPPKVNNVAPTLEVTEREKYEMLCR 53
P HQRA GN Y++ + K ++ + PK VA + ER+KYEMLCR
Sbjct: 236 PEHQRANGNLKYFEYIMAKEKDVNKSASDDQSDQKTTPKKKGVAVDY-LPERQKYEMLCR 294
Query: 54 GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
G+ + + P +L CRY N P L P K+E+ + +PRII + D++ D+EI+++K
Sbjct: 295 GEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 354
Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
+A+PRL RATV + +TG+L A YR+SKSAWL E+PV+ RI+ R++ +TGL STAEE
Sbjct: 355 LAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEE 414
Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
LQ E +AFK LGTGNR+AT LFYMSDV+ GGATVF + S
Sbjct: 415 LQ------------------KDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 456
Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 457 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 491
>gi|350014318|dbj|GAA37183.1| prolyl 4-hydroxylase [Clonorchis sinensis]
Length = 595
Score = 244 bits (622), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 129/287 (44%), Positives = 180/287 (62%), Gaps = 14/287 (4%)
Query: 4 PTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAIV 63
P H+++ N+ +Y+ L K + P ++ E E E Y+ LCRG+ PP
Sbjct: 305 PEHEQSLSNEEFYRTRLQKGEGIIGPAPPPEKLSKLDE--ETEIYQALCRGEQLFPPPPD 362
Query: 64 AQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQ 123
Q+ CRY + PY ++ P+KEE Y PRI+++ DV++ SE+ I+++A PRLRRATV+
Sbjct: 363 DQVYCRYYIPH-PYYKIGPVKEEVLYPDPRIVMWYDVIHPSEVGRIQELALPRLRRATVK 421
Query: 124 NYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHY 183
N TG+LE A YR SKSAWL++ V R+++R+ +TGL TAE+LQV NYGIGG+Y
Sbjct: 422 NPVTGKLENAYYRTSKSAWLQDGLDEVTHRLNQRIHALTGLAMETAEDLQVGNYGIGGYY 481
Query: 184 EPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWH 243
PH+DF R E +AF+ + GNR+AT++FY++DV GGATVF S+ P +G A FW+
Sbjct: 482 APHFDFGRKREKDAFE-VENGNRIATIIFYLTDVKAGGATVFNRFGASVKPVRGAAGFWY 540
Query: 244 NLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCGLRRG 280
NLH SG+GD TRH ACPVL GS + + PC L RG
Sbjct: 541 NLHPSGEGDLRTRHVACPVLVGSKWVMNVWFHERGQEFRRPCELTRG 587
>gi|291404186|ref|XP_002718473.1| PREDICTED: prolyl 4-hydroxylase, alpha I subunit isoform 3
[Oryctolagus cuniculus]
Length = 516
Score = 243 bits (620), Expect = 7e-62, Method: Compositional matrix adjust.
Identities = 130/275 (47%), Positives = 174/275 (63%), Gaps = 31/275 (11%)
Query: 4 PTHQRAQGNKLYYQEALNKSPEL----------KDEPPKVNNVAPTLEVTEREKYEMLCR 53
P HQRA GN Y++ + K + K P+ VA + ER+KYEMLCR
Sbjct: 236 PEHQRANGNLKYFEYIMAKEKDANKSASDGQSDKKTTPRRKGVAVDY-LPERQKYEMLCR 294
Query: 54 GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
G+ + + P +L CRY N P L P K+E+ + +PRII + D++ D+EI+++K
Sbjct: 295 GEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 354
Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
+A+PRL RATV + +TG+L A YR+SKSAWL E+PV+ RI+ R++ +TGL STAEE
Sbjct: 355 LAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEE 414
Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
LQ E +AFK LGTGNR+AT LFYMSDV+ GGATVF + S
Sbjct: 415 LQ------------------KDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 456
Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 457 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 491
>gi|344274276|ref|XP_003408943.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 3
[Loxodonta africana]
Length = 516
Score = 242 bits (618), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 128/274 (46%), Positives = 172/274 (62%), Gaps = 29/274 (10%)
Query: 4 PTHQRAQGNKLYYQEALNKSPE----LKDEPPKVNNVAPTLEVT-----EREKYEMLCRG 54
P HQRA GN Y++ + K + D P + V ER+KYEMLCRG
Sbjct: 236 PEHQRANGNLKYFEYIMTKEKDSNKSTSDAPSDQKSTVKKKGVAADYLPERQKYEMLCRG 295
Query: 55 D-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKM 112
+ + + P +L CRY N P L P K+E+ + +PRI+ + D++ D+EI+++K +
Sbjct: 296 EGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIVRFHDIISDAEIEVVKDL 355
Query: 113 AQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEEL 172
A+PRL RATV + +TG+L A YR+SKSAWL E+PV+ RI+ R++ +TGL STAEEL
Sbjct: 356 AKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEEL 415
Query: 173 QVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSL 232
Q E +AFK LGTGNR+AT LFYMSDV+ GGATVF + S+
Sbjct: 416 Q------------------KDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPDVGASV 457
Query: 233 WPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 458 WPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 491
>gi|426255748|ref|XP_004021510.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 3 [Ovis
aries]
Length = 516
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 128/274 (46%), Positives = 174/274 (63%), Gaps = 29/274 (10%)
Query: 4 PTHQRAQGNKLYYQEALNK--------SPELKDEPPKVNNVAPTLE-VTEREKYEMLCRG 54
P HQRA GN Y++ + K S + D+ + ++ + ER+KYEMLCRG
Sbjct: 236 PEHQRANGNLKYFEYIMAKEKDANKSSSDDQSDQKTTLKKKGAAVDYLPERQKYEMLCRG 295
Query: 55 D-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKM 112
+ + + P +L CRY N P L P K+E+ + +PRII + D++ D+EI+++K +
Sbjct: 296 EGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKDL 355
Query: 113 AQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEEL 172
A+PRL RATV + +TG+L A YR+SKSAWL E+PV+ RI+ R++ +TGL STAEEL
Sbjct: 356 AKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEEL 415
Query: 173 QVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSL 232
Q E +AFK LGTGNR+AT LFYMSDV GGATVF + S+
Sbjct: 416 Q------------------KDEPDAFKELGTGNRIATWLFYMSDVLAGGATVFPEVGASV 457
Query: 233 WPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 458 WPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 491
>gi|341884171|gb|EGT40106.1| CBN-PHY-2 protein [Caenorhabditis brenneri]
Length = 607
Score = 240 bits (613), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 143/367 (38%), Positives = 194/367 (52%), Gaps = 86/367 (23%)
Query: 2 IFPTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPA 61
I P H RA+GN +Y++ L + D PP VN + ER+ YE LCRG+ +PP
Sbjct: 235 IAPNHPRAKGNVKWYEDMLQGKDMVGDLPPIVNKRVEFDGIVERDAYEALCRGE--IPPV 292
Query: 62 ---IVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLR 118
+L+C Y+ R+ P+L++ P+K E P +L+++V+ DSEI++IK++A P+L
Sbjct: 293 EEKWRNKLRC-YLKRDKPFLKIAPIKVEILRFDPLAVLFKNVISDSEIEVIKELASPKLE 351
Query: 119 RATVQNYKTGELEIANYRISK-------------------------------SAWLREPE 147
RATV+ G L +YRI+K SAWL+
Sbjct: 352 RATVKG-PDGTLITVDYRIAKRLVNWNTLHIVSPKGGFPKSKKMKNKCLVGFSAWLKGDL 410
Query: 148 HPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPG-------------- 193
PVI+R++RR+E TGL +T+EELQV NYG+GGHY+PH+DFAR
Sbjct: 411 DPVIDRVNRRIEDFTGLNQATSEELQVANYGLGGHYDPHFDFARIANYGLGGHYEPHYDM 470
Query: 194 ------------------------EANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLN 229
E NAFK+L TGNR+ATVLFYMS GGATVF L
Sbjct: 471 SLRGVPEPYGKNGNRIATVLFYKEEKNAFKTLNTGNRIATVLFYMSQPELGGATVFNHLG 530
Query: 230 LSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG----SNS-LHS-----TCPCGLRR 279
+++P K A FW+NL G+GD TRHAACPVL G SN +H T PCGL
Sbjct: 531 TAVFPSKNDALFWYNLRRDGEGDLRTRHAACPVLLGVKWVSNKWIHEKGQEFTRPCGLEE 590
Query: 280 GLQRSGI 286
G+Q + +
Sbjct: 591 GVQENFV 597
>gi|156352054|ref|XP_001622587.1| predicted protein [Nematostella vectensis]
gi|156209158|gb|EDO30487.1| predicted protein [Nematostella vectensis]
Length = 531
Score = 240 bits (613), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 127/248 (51%), Positives = 157/248 (63%), Gaps = 17/248 (6%)
Query: 46 EKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSE 105
E YE LCRG A+L+C Y P R+ PLK EE + P I + RDVMYDSE
Sbjct: 281 EAYERLCRGISYRSNEEAAKLRCYYDFTRHPMFRIRPLKVEELHSDPPIWMLRDVMYDSE 340
Query: 106 IDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREP----EHPVIERISRRVEHM 161
I+ IK+ A P+LRRATV N KTGELE A+YRISKS WL +P E ++ R++RR +
Sbjct: 341 IEYIKRTATPKLRRATVTNLKTGELEFADYRISKSGWLEDPRDDNEEKILNRVNRRTSII 400
Query: 162 TGLTTS--TAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQ 219
TGL T+ +AE LQ+VNYG GHYEPH+D A ++ K LG GNR+ATVL+YMSDV
Sbjct: 401 TGLDTTPRSAEALQIVNYGAAGHYEPHFDHATEAVSSILK-LGIGNRIATVLYYMSDVEA 459
Query: 220 GGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC------ 273
GGATVF + P KG AAFW+NLH +G GD TRHAACP++ GS + +
Sbjct: 460 GGATVFVDAEAIVKPSKGDAAFWYNLHKNGKGDERTRHAACPIIVGSKWVCNKWIHEHGQ 519
Query: 274 ----PCGL 277
PCGL
Sbjct: 520 EFRRPCGL 527
>gi|431904119|gb|ELK09541.1| Prolyl 4-hydroxylase subunit alpha-1 [Pteropus alecto]
Length = 507
Score = 239 bits (611), Expect = 9e-61, Method: Compositional matrix adjust.
Identities = 124/245 (50%), Positives = 166/245 (67%), Gaps = 13/245 (5%)
Query: 4 PTHQRAQGNKLYYQEALNK--------SPELKDEP--PKVNNVAPTLEVTEREKYEMLCR 53
P HQRA GN Y++ + K S + D+ PK VA + ER+KYEMLCR
Sbjct: 236 PEHQRANGNLKYFEYIMAKEKDANKSTSDDQSDQKTTPKKKGVAVDY-LPERQKYEMLCR 294
Query: 54 GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
G+ + + P +L CRY N P L P K+E+ + +PRII + D++ D+EI+++K
Sbjct: 295 GEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 354
Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
+A+PRL RATV + +TG+L A YR+SKSAWL E+PV+ RI+ R++ +TGL STAEE
Sbjct: 355 LAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEE 414
Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
LQV NYG+GG YEPH+DFAR E +AFK LGTGNR+AT LFYMSDV+ GGATVF + S
Sbjct: 415 LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 474
Query: 232 LWPEK 236
+WP+K
Sbjct: 475 VWPKK 479
>gi|256083648|ref|XP_002578053.1| prolyl 4-hydroxylase alpha subunit 1 [Schistosoma mansoni]
gi|360044447|emb|CCD81995.1| putative prolyl 4-hydroxylase alpha subunit 1 [Schistosoma mansoni]
Length = 584
Score = 239 bits (609), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 128/287 (44%), Positives = 177/287 (61%), Gaps = 14/287 (4%)
Query: 4 PTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAIV 63
PT+ RA N+ YY E +++ P+ ++ + E E YE LCR + P
Sbjct: 294 PTNTRAINNEAYYVEQIDRGEGRIGPNPRSQAISKHDQ--ETELYESLCRNENPFPTVPS 351
Query: 64 AQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQ 123
L CRY + + ++ P+KEE PRI+++ D+++ SEI+ IK++A PRLRRATV+
Sbjct: 352 HHLTCRYYTPHA-FFKIGPVKEETLNPDPRIVMWYDLIFPSEIEKIKELATPRLRRATVK 410
Query: 124 NYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHY 183
N TG LEIA YR SKSAWL + ++IS+R+ +TGL+ TAE+LQV NYG+GGHY
Sbjct: 411 NPVTGILEIAFYRTSKSAWLPHSMSEITDQISQRIRAVTGLSLETAEDLQVGNYGLGGHY 470
Query: 184 EPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWH 243
PH+DF R E +AF+ + GNR+AT++FY+SDV GGATVF + + P+KG A FW
Sbjct: 471 APHFDFGRKREKDAFE-VKNGNRIATIIFYLSDVQAGGATVFNRIGTRVVPKKGAAGFWF 529
Query: 244 NLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCGLRRG 280
NL +G+GD TRHAACPVL GS + + PC L RG
Sbjct: 530 NLLPNGEGDLRTRHAACPVLAGSKWVMNLWFHERGQEFRRPCELERG 576
>gi|443709454|gb|ELU04126.1| hypothetical protein CAPTEDRAFT_167710 [Capitella teleta]
Length = 535
Score = 238 bits (607), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 130/296 (43%), Positives = 171/296 (57%), Gaps = 25/296 (8%)
Query: 4 PTHQRAQGNKLYYQEALNKSPELKDEPPKV------------NNVAPTLEVTER-EKYEM 50
P H RAQ N ++++A+ + E E ++ + TE + YE
Sbjct: 237 PEHSRAQSNLAHFEQAIKEKEEALAEESRIRVEREAFRNGRFEHDPDAYHATEFFQTYEA 296
Query: 51 LCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIK 110
LCRG+ +P +L C+Y + P + PL+EE P I +Y +M D +ID IK
Sbjct: 297 LCRGEDVIPIKDAHKLTCQYRVWH-PMFTINPLREETMNFDPWIAVYHQLMSDKDIDDIK 355
Query: 111 KMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAE 170
+A PRL RATV N TGELE A YRISKS WL++ EHP + +IS R +T L+ ST E
Sbjct: 356 ALATPRLARATVVNSVTGELEFAKYRISKSGWLKDEEHPTVAKISNRCSALTNLSLSTVE 415
Query: 171 ELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNL 230
ELQ+ NYGIGGHYEPH+D++R E +F GNR+ TV+FY+SDV GG TVF +
Sbjct: 416 ELQIANYGIGGHYEPHFDYSRLAEVTSFDHW-RGNRILTVIFYLSDVEAGGGTVFMTAGT 474
Query: 231 SLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHS----------TCPCG 276
L PEKG AA W+NLH G GD T+HAACPVLTG+ + + T PCG
Sbjct: 475 KLRPEKGAAAVWYNLHPDGTGDDETKHAACPVLTGNKWVANKWFHERGQEFTRPCG 530
>gi|55925444|ref|NP_001007286.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Danio rerio]
gi|49900294|gb|AAH76508.1| Procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide 2 [Danio rerio]
gi|182891794|gb|AAI65288.1| P4ha2 protein [Danio rerio]
Length = 514
Score = 238 bits (607), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 134/298 (44%), Positives = 174/298 (58%), Gaps = 44/298 (14%)
Query: 2 IFPTHQRAQGNKLYYQEALNK--------SPELKDEPPKVNNV--APTLEVTEREKYEML 51
I P+HQRA GN Y++ L+K PE DE P + P + ERE YE L
Sbjct: 235 IDPSHQRAGGNLRYFERLLSKELQDSGQTQPEPADERPIQLDTYQRPKDYLPEREAYEAL 294
Query: 52 CRGD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLI 109
CRG+ + + ++L CRY N P L L P+KEE+ + P I+ + + + D EI I
Sbjct: 295 CRGEGVKMTTKRQSRLFCRYRDGNRNPRLLLKPMKEEDEWDSPHIVRFLEALSDEEIQKI 354
Query: 110 KKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTA 169
K++A P+L RATV++ KTG L +A+YR+SKSAWL + PVI R+++R+E +TGLT TA
Sbjct: 355 KEIATPKLARATVRDPKTGVLTVAHYRVSKSAWLEGEDDPVIARVNQRIEDITGLTVDTA 414
Query: 170 EELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLN 229
E LQV NYG+GG YEPH+DF+R MSDV GGATVF
Sbjct: 415 ELLQVANYGVGGQYEPHFDFSR----------------------MSDVEAGGATVFPDFG 452
Query: 230 LSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCGL 277
S+WP KGTA FW+NL SG+GDY TRHAACPVL GS + + PCGL
Sbjct: 453 ASVWPRKGTAVFWYNLFRSGEGDYRTRHAACPVLVGSKWVSNKWIHERGQEFRRPCGL 510
>gi|312092237|ref|XP_003147267.1| hypothetical protein LOAG_11701 [Loa loa]
Length = 553
Score = 238 bits (607), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 132/295 (44%), Positives = 179/295 (60%), Gaps = 19/295 (6%)
Query: 2 IFPTHQRAQGNKLYYQEALN----KSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLT 57
I P H RA+GN +Y++ L + +++ + P +NN P + + Y+ LCR ++
Sbjct: 236 INPDHPRAKGNVRWYEDLLEDEGVRRADMRRKVPPINN--PRDKSDLNDTYQALCRQEMP 293
Query: 58 VPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRL 117
V ++L C Y + PYLRL P+K E Y P +L+ D+M D E +I+ +A P+L
Sbjct: 294 VNIKAQSRLYC-YYKMDRPYLRLAPIKVEIVYQNPLAVLFHDIMSDEESRIIEMLAVPKL 352
Query: 118 RRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNY 177
RATV N +TG LE A+YRISKSAWLR EH V+ RI+RR++ T L +TAEELQV NY
Sbjct: 353 DRATVHNVETGNLETASYRISKSAWLRSTEHEVVNRINRRLDLATNLEIATAEELQVQNY 412
Query: 178 GIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKG 237
GIGGHYEPH D +R + +AF+ GTGNR+AT+L YM++ GG TVF +L S+ K
Sbjct: 413 GIGGHYEPHLDCSR--DEDAFERTGTGNRIATILIYMTEPEIGGRTVFINLKASVPCTKN 470
Query: 238 TAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCGLRRGLQ 282
A FW+NL SG D + HAACPVLTG+ + PCGL R Q
Sbjct: 471 AALFWYNLMRSGAVDMRSYHAACPVLTGTKWTANKWFHERGQEWRRPCGLNRFDQ 525
>gi|157111033|ref|XP_001651361.1| prolyl 4-hydroxylase alpha subunit 1, putative [Aedes aegypti]
gi|108878552|gb|EAT42777.1| AAEL005714-PA, partial [Aedes aegypti]
Length = 522
Score = 237 bits (605), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 117/266 (43%), Positives = 173/266 (65%), Gaps = 8/266 (3%)
Query: 2 IFPTHQRAQGNKLYYQEAL-NKSPELKDEPPKVNNV-APTLEVTEREKYEMLCRGDLTVP 59
+ P H+ K +Y++ L + +++ + N + + ++ ++ LCRG++
Sbjct: 237 LVPDHESTLHQKTFYEDILWYQQEQVRTTLFRSNRIPSSKASMSSLTTFKKLCRGEIQRN 296
Query: 60 PAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRR 119
+ + LKCRYV + ++ P K EE +L+P+I+++ DV+ D+EI+L+K++A+P L R
Sbjct: 297 VSETSHLKCRYVSNLSAFSKIGPFKLEEMHLKPKIVIFHDVLSDTEIELLKRLAKPILER 356
Query: 120 ATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGI 179
AT+ N +TG+ E + R+SKS+W + H I I++RV MTGL+ TAEELQVVNYG+
Sbjct: 357 ATIANQQTGKAERSKDRVSKSSWFPDEYHSTIRTITKRVADMTGLSMDTAEELQVVNYGL 416
Query: 180 GGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTA 239
GG Y+PH+DF G+ L NR+ATVLFYMSDV+ GGATVF L ++L KGTA
Sbjct: 417 GGQYDPHFDFFHWGK------LKEVNRIATVLFYMSDVSIGGATVFPKLGVTLEARKGTA 470
Query: 240 AFWHNLHSSGDGDYYTRHAACPVLTG 265
AFW+NLHSSG+ DY T H ACPVL G
Sbjct: 471 AFWYNLHSSGELDYSTLHGACPVLIG 496
>gi|427795421|gb|JAA63162.1| Putative prolyl-4-hydroxylase-alpha efb, partial [Rhipicephalus
pulchellus]
Length = 568
Score = 237 bits (604), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 122/220 (55%), Positives = 153/220 (69%), Gaps = 10/220 (4%)
Query: 4 PTHQRAQGNKLYYQEALNKSPELK---------DEPPKVNNVAPTLEV-TEREKYEMLCR 53
P H RA GNK YY++ L K + K D+ + +P + +ER YE LCR
Sbjct: 303 PDHPRAPGNKRYYEDTLAKREQYKRGDDGDISEDDSITLKKRSPLPDADSERGIYERLCR 362
Query: 54 GDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
G+ P +L C+Y N PYL L P KEE + +PRI++Y DV+ + E+++IK +A
Sbjct: 363 GEKFPPLFHDRELTCQYRTNNRPYLLLQPAKEEVMFPKPRIVIYHDVLSEHEMNVIKTLA 422
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
QPRLRRATVQNYK+GELE A+YRISKSAWL+ EH VI R++RR+E +TGLT TAEELQ
Sbjct: 423 QPRLRRATVQNYKSGELETASYRISKSAWLKNEEHGVIARVTRRIEDITGLTADTAEELQ 482
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFY 213
VVNYGIGGHYEPH+DFAR E NAF+SLGTGNR+AT L Y
Sbjct: 483 VVNYGIGGHYEPHFDFARREEKNAFQSLGTGNRIATWLNY 522
>gi|386368303|gb|AFJ06910.1| procollagen-proline dioxygenase [Mytilus galloprovincialis]
Length = 535
Score = 236 bits (603), Expect = 7e-60, Method: Compositional matrix adjust.
Identities = 125/294 (42%), Positives = 176/294 (59%), Gaps = 20/294 (6%)
Query: 4 PTHQRAQGNKLYYQEALNKSPELKDEPPK-VNNVA-----PTLEVT---EREKYEMLCRG 54
P H RAQ NK+ + E + K+ + + + N PT E E + Y+ LC+G
Sbjct: 238 PDHVRAQNNKIDFMERVKKAANVTSRVKRDLTNTTHYVPKPTPEYNSTPELQSYKRLCKG 297
Query: 55 DLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQ 114
P ++Q+ CRY H N P L L P+KEEE Y ++L+ D+ D E+ +IK +A
Sbjct: 298 LDVKPREKMSQVVCRYRHNNNPRLLLSPIKEEEVYRDANMVLFHDIASDKEMKIIKSLAI 357
Query: 115 PRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQV 174
P+L RATV + TG+L A YRI+K+AWL + +H V++R+ R++ +TGL +A+ LQV
Sbjct: 358 PKLFRATVHDPTTGKLIHAKYRITKTAWLDDRDHLVVDRVQNRIKAVTGLDLDSADALQV 417
Query: 175 VNYGIGGHYEPHYDFA-RPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
NYGIGGHY+PHYDF+ R + + GNR+AT L YM+DV GGATVF +++ +
Sbjct: 418 ANYGIGGHYDPHYDFSTRDDDDTSETEKRDGNRIATFLLYMTDVDAGGATVFPIIDVRVL 477
Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCGL 277
P+KGTA FW+NL SG G TRHAACPVL G+ + + PCGL
Sbjct: 478 PKKGTAVFWYNLRRSGKGIMETRHAACPVLVGTKWVSNKWIRTRGQEFRRPCGL 531
>gi|260825357|ref|XP_002607633.1| hypothetical protein BRAFLDRAFT_59428 [Branchiostoma floridae]
gi|229292981|gb|EEN63643.1| hypothetical protein BRAFLDRAFT_59428 [Branchiostoma floridae]
Length = 520
Score = 236 bits (602), Expect = 9e-60, Method: Compositional matrix adjust.
Identities = 120/228 (52%), Positives = 157/228 (68%), Gaps = 6/228 (2%)
Query: 44 EREKYEMLCRGD----LTVPPAIVAQLKCRY-VHRNVPYLRLMPLKEEEAYLQPRIILYR 98
E YE+LC+ D + + V LKCRY + N P L L P++ E+ + +P++ +
Sbjct: 269 ESRVYELLCQADQPEIFNITSSRVKHLKCRYFTNNNHPRLLLAPIRLEQVFDKPKLWVLH 328
Query: 99 DVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRV 158
+++ D E+++IKK+AQPRLRRA V++ TGE E+A+YRISKSAWL + EH VI R+++RV
Sbjct: 329 NILTDPEMEVIKKLAQPRLRRARVESPTTGEGELASYRISKSAWLYDWEHRVIRRVNQRV 388
Query: 159 EHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVA 218
E +TGLT TAE LQVVNYGIGGHYEPH+D A E A G+R+AT+LFYMSDV
Sbjct: 389 EDVTGLTMETAELLQVVNYGIGGHYEPHFDCATKDEEFALDP-NEGDRIATMLFYMSDVE 447
Query: 219 QGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
GGATVF + + PEKG AFW+NL SG+GD T HA CPVL GS
Sbjct: 448 AGGATVFPQVGARVVPEKGAGAFWYNLLKSGEGDMLTEHAGCPVLVGS 495
>gi|196011902|ref|XP_002115814.1| hypothetical protein TRIADDRAFT_30039 [Trichoplax adhaerens]
gi|190581590|gb|EDV21666.1| hypothetical protein TRIADDRAFT_30039 [Trichoplax adhaerens]
Length = 534
Score = 232 bits (592), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 123/269 (45%), Positives = 170/269 (63%), Gaps = 9/269 (3%)
Query: 4 PTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPA-- 61
P H+RA+ N YY++ L K+ + K+ ++ E + Y+ LCRG V
Sbjct: 244 PKHERAKQNIYYYEKVLTKNSDGKEGEDSLSQDENDWS-HEFDFYKKLCRGGPKVKAGDN 302
Query: 62 --IVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRR 119
+ L C Y R L P+ E LQP I++Y +++ D E++ +K +A P L+R
Sbjct: 303 KMVSNHLTC-YQLRQHARLLFSPINVEVISLQPYILIYHNLLNDLEVEALKTLAAPMLQR 361
Query: 120 ATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGI 179
ATV N TG+LE A YRISKSAWL + +HP++ RIS +E +TGLT +AE LQ+ NYGI
Sbjct: 362 ATVHNKDTGKLEYATYRISKSAWLNDDDHPLVRRISTLIEDVTGLTMESAEALQIANYGI 421
Query: 180 GGHYEPHYDFA--RPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKG 237
GGHYEPH+D A R G + FK+ GNR+AT+L Y+S V GGATVF+S + + P +G
Sbjct: 422 GGHYEPHFDHADVRSG-TDVFKTWKGGNRIATMLIYLSSVELGGATVFSSAGVRIEPRQG 480
Query: 238 TAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+AAFW+NLH +G+G+ TRHAACPVL GS
Sbjct: 481 SAAFWYNLHRNGNGNNLTRHAACPVLIGS 509
>gi|410975458|ref|XP_003994148.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Felis catus]
Length = 567
Score = 229 bits (585), Expect = 8e-58, Method: Compositional matrix adjust.
Identities = 130/308 (42%), Positives = 177/308 (57%), Gaps = 44/308 (14%)
Query: 4 PTHQRAQGNKLYYQEALNK--------SPELKDEPPKVNNVAPTLE-VTEREKYEMLCRG 54
P HQRA GN Y++ + K S + D + ++ + ER+KYEMLCRG
Sbjct: 236 PEHQRANGNLKYFEYIMAKEKDGNKSASDDQSDRKTTLKKKGVAVDYLPERQKYEMLCRG 295
Query: 55 D-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKM 112
+ + + P +L CRY N P L P K+E+ + +PRII + D++ D+EI+++K +
Sbjct: 296 EGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKDL 355
Query: 113 AQPRLRRATVQNYKTGELEIANYRISKS--AWLREPEHPVIERISRRVEH-----MTGLT 165
A+PRL RATV + +TG+L A YR+SKS +W + +I + E G +
Sbjct: 356 AKPRLSRATVHDPETGKLTTAQYRVSKSLVSWGKVQRALLIRSMQVCCERGPEAAWDGGS 415
Query: 166 TSTAEELQ--------------------------VVNYGIGGHYEPHYDFARPGEANAFK 199
S E L V NYG+GG YEPH+DFAR E +AFK
Sbjct: 416 MSAEECLAELSLLAGECSAALVPIGVCESRLGKGVANYGVGGQYEPHFDFARKDEPDAFK 475
Query: 200 SLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAA 259
LGTGNR+AT LFYMSDV+ GGATVF + S+WP+KGTA FW+NL +SG+GDY TRHAA
Sbjct: 476 ELGTGNRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAA 535
Query: 260 CPVLTGSN 267
CPVL G+
Sbjct: 536 CPVLVGNK 543
>gi|260825355|ref|XP_002607632.1| hypothetical protein BRAFLDRAFT_84679 [Branchiostoma floridae]
gi|229292980|gb|EEN63642.1| hypothetical protein BRAFLDRAFT_84679 [Branchiostoma floridae]
Length = 519
Score = 226 bits (577), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 115/228 (50%), Positives = 152/228 (66%), Gaps = 6/228 (2%)
Query: 44 EREKYEMLCRGD----LTVPPAIVAQLKCRY-VHRNVPYLRLMPLKEEEAYLQPRIILYR 98
E YE+LC+G+ + P+ V LKCRY + N P L L P++ E+ + +P++ +
Sbjct: 268 ESRVYELLCQGNQPEIFNITPSRVKHLKCRYFTNNNHPRLLLAPIRLEQVFDKPKLWVLH 327
Query: 99 DVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRV 158
+++ D E+++IKK+AQPRLR A QN TG +++YRISK+AWL EH +I R+ +RV
Sbjct: 328 NILSDPEMEVIKKLAQPRLRPAATQNPTTGGAVLSSYRISKNAWLYYWEHRLINRVKQRV 387
Query: 159 EHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVA 218
E TGLT TAE LQV+NYGIGGHYEPH+D A E A G+R+AT+LFYMSDV
Sbjct: 388 EDATGLTMETAEPLQVINYGIGGHYEPHFDCATKDEEFALDP-NEGDRIATMLFYMSDVE 446
Query: 219 QGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
GGATVF + + PEKG AFW+NL SG+GD T HA CPVL GS
Sbjct: 447 AGGATVFPQVGARVVPEKGAGAFWYNLLKSGEGDMLTEHAGCPVLVGS 494
>gi|51490656|emb|CAF31507.1| prolyl 4-hydroxylase 2 precursor [Brugia malayi]
Length = 551
Score = 226 bits (577), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 123/288 (42%), Positives = 173/288 (60%), Gaps = 19/288 (6%)
Query: 4 PTHQRAQGNKLYYQEALN----KSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVP 59
P H RA+GN +Y++ L + +++ + P +NN P + ++ YE LCR ++ +
Sbjct: 239 PDHPRAKGNVRWYEDLLEDEGIRRADMRRKVPPMNN--PRDKSNLKDTYEALCRQEVPIN 296
Query: 60 PAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRR 119
++L C Y + PYLRL P K E + P ++L+RD++ D E+ +I+ +A P+L R
Sbjct: 297 TKAQSRLYC-YYKMDRPYLRLAPFKVEIVHQNPLVVLFRDIVSDEEMRIIEMLAVPKLAR 355
Query: 120 ATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGI 179
ATV N TG +E A YR S+S+WL EH V++RI++R++ T L T TAEELQV NYGI
Sbjct: 356 ATVHNVVTGNIETAFYRTSQSSWLGSTEHEVVKRINKRLDLATNLETETAEELQVQNYGI 415
Query: 180 GGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTA 239
GGHYEPHYD +R N F+ GNR+AT+L YM++ GG TVF L S+ K A
Sbjct: 416 GGHYEPHYDCSR--RENVFEKTKNGNRIATILIYMTEPEIGGGTVFIDLKTSVSCTKNAA 473
Query: 240 AFWHNLHSSGDGDYYTRHAACPVLTGS-----NSLHSTC-----PCGL 277
FW+NL SG D + HAACPVLTG+ H + PCGL
Sbjct: 474 LFWYNLMRSGAVDMRSYHAACPVLTGTKWTANKWFHESGQEWRRPCGL 521
>gi|339236275|ref|XP_003379692.1| prolyl 4-hydroxylase subunit alpha-2 [Trichinella spiralis]
gi|316977629|gb|EFV60704.1| prolyl 4-hydroxylase subunit alpha-2 [Trichinella spiralis]
Length = 441
Score = 224 bits (571), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 132/322 (40%), Positives = 180/322 (55%), Gaps = 52/322 (16%)
Query: 2 IFPTHQRAQGNKLYYQEALNKSPELK----DEPPKVNNVAPTLEVTEREKYEMLCRGDLT 57
I P H RA+GN +Y + L K + D PP VN + ER+ +E LCRG+
Sbjct: 122 IKPDHPRAEGNVKWYLDLLAKEGVSRVTDHDLPPIVNARPNDQALPERKDFEALCRGEYL 181
Query: 58 VPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRL 117
+ ++L C Y R+ P+L L P+K E + +P+I+++R V+ +EI ++K +A PRL
Sbjct: 182 LTEKQRSRLYC-YYKRDTPFLSLAPIKVEVMHWKPKIVIFRQVISANEIAVLKTLAYPRL 240
Query: 118 RRATVQNYKTGELEIA---------------------------NYRISKSAWLREPEHPV 150
RATVQN +TGELE A +YRISKSAWL+E EHPV
Sbjct: 241 SRATVQNSETGELETAKYRISKRCRTLRRATVHNKETGQLEHASYRISKSAWLKEHEHPV 300
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
++RI +R+ MT L TAE+LQ+ NYG+GGHY+PH+D AR E + ++ G GNR+AT
Sbjct: 301 VDRIVKRIHDMTNLNMETAEDLQIANYGLGGHYDPHFDHARRDEVDPYEH-GHGNRIATT 359
Query: 211 LFYMSDVAQGGATVFTSLN----LSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
LFY +V F SLN ++ G AAFW NL +G+GD TRHAACPVL G
Sbjct: 360 LFYKEEV-----NAFKSLNTGNRIATVLFYGDAAFWFNLKPNGEGDMSTRHAACPVLAGV 414
Query: 267 NSLHSTC----------PCGLR 278
+ + PCGLR
Sbjct: 415 KWVANKWIHERGQEFYRPCGLR 436
>gi|347972274|ref|XP_001237637.3| AGAP004611-PA [Anopheles gambiae str. PEST]
gi|333469330|gb|EAU76664.3| AGAP004611-PA [Anopheles gambiae str. PEST]
Length = 514
Score = 222 bits (565), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 118/264 (44%), Positives = 163/264 (61%), Gaps = 11/264 (4%)
Query: 4 PTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAIV 63
P +QRA +K E L K E K K + + P + + Y LCRGD P +
Sbjct: 236 PDNQRALNSK----EPLEKWIEYK----KQHGLPPPVPEPYVKNYPSLCRGDDQRPAKEL 287
Query: 64 AQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQ 123
A+L+CRY H P+LR+ PLK +E P I++Y DV+ + EID I +++P + R+ V
Sbjct: 288 AKLRCRYEHNRTPFLRISPLKLQEVNHDPMIVMYHDVISNKEIDAIISISKPLMHRSMVG 347
Query: 124 NYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHY 183
+ E ++ R S +AWL + HPV+ +S+R E MT L + AE LQV NYGIGGHY
Sbjct: 348 D--DHEKAVSKTRTSSNAWLDDVMHPVVRTLSQRTEDMTNLAMTAAERLQVGNYGIGGHY 405
Query: 184 EPHYDFARPGEAN-AFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFW 242
PHYD+A E + S+G GNR+ATV++Y+SDVA GGATVF L L ++P+KG+A FW
Sbjct: 406 LPHYDYAVAEEGKEVYPSIGKGNRIATVMYYLSDVAIGGATVFPQLGLGVFPQKGSAIFW 465
Query: 243 HNLHSSGDGDYYTRHAACPVLTGS 266
+NLH++G D+ T H ACPV GS
Sbjct: 466 YNLHANGTVDHRTLHGACPVFVGS 489
>gi|449673565|ref|XP_002167120.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Hydra
magnipapillata]
Length = 571
Score = 222 bits (565), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 120/268 (44%), Positives = 168/268 (62%), Gaps = 8/268 (2%)
Query: 4 PTHQRAQGNKLYYQEALNKSPELK---DEPPKVNNVAPTLEVTER--EKYEMLCRGDLT- 57
P QR N Y+ + L+ S D+ K ++ A T + + YE LCRG++
Sbjct: 280 PNEQRIVENLDYFNKYLHTSRSTSRYGDDGLKDDSSAFTSDNKNKVLNAYEQLCRGEVRP 339
Query: 58 VPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRL 117
+ A++KC Y ++ P L+L P K E ++ P I + R+++ + +I+LIK+ A P L
Sbjct: 340 LTKKEQAKMKCWYSAKD-PVLKLKPQKVERVWVDPEIFILRNIISEKQINLIKEAASPML 398
Query: 118 RRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNY 177
RRAT+Q+ TG+L A+YRISKSAWL ++ ++ + R + TGL S AE+LQV NY
Sbjct: 399 RRATIQDPITGKLRHADYRISKSAWLSTNKYNFLQALEARTQATTGLDLSYAEQLQVANY 458
Query: 178 GIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKG 237
G+GGHYEPH+D +R E + F LG GNR+ATVLFY+SDV GGATVFT +++P KG
Sbjct: 459 GLGGHYEPHFDHSRENE-DRFTDLGMGNRIATVLFYLSDVEAGGATVFTVGKTAVFPSKG 517
Query: 238 TAAFWHNLHSSGDGDYYTRHAACPVLTG 265
A FW NL +G G+ TRHAACPVL G
Sbjct: 518 DAVFWFNLKRNGKGNPNTRHAACPVLVG 545
>gi|301613006|ref|XP_002936013.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Xenopus
(Silurana) tropicalis]
Length = 504
Score = 222 bits (565), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 125/271 (46%), Positives = 159/271 (58%), Gaps = 34/271 (12%)
Query: 4 PTHQRAQGNKLYYQEALNKSPELKDEPPK------VNNVAPTLEVTEREKYEMLCRGD-L 56
P HQR GN Y++ ++K P P + ER+KYE LCRG+ +
Sbjct: 235 PEHQRGNGNLRYFEYIMSKESNKSSSSPSEGAELGTRKGRPKDHLPERQKYEKLCRGEGV 294
Query: 57 TVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQP 115
+ +L CRY N P L L P K+E+ + +PRI+ Y D++ D EI +K++A+P
Sbjct: 295 KMTSRRQKRLFCRYFDGNKDPLLILSPTKQEDEWDKPRIVRYHDIISDEEISKVKELAKP 354
Query: 116 RLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVV 175
RLRRAT+ N TG LE A YRISK W EL+V
Sbjct: 355 RLRRATISNPITGVLETAQYRISKR-W-------------------------AIMELEVA 388
Query: 176 NYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPE 235
NYG+GG YEPH+DFAR E +AFK LGTGNRVAT LFYMSDV GGATVF + +++P+
Sbjct: 389 NYGMGGQYEPHFDFARKDEPDAFKELGTGNRVATWLFYMSDVEAGGATVFPEVGAAVYPK 448
Query: 236 KGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
KGTA FW+NL SG+GDY TRHAACPVL G+
Sbjct: 449 KGTAVFWYNLFESGEGDYSTRHAACPVLVGN 479
>gi|443707037|gb|ELU02831.1| hypothetical protein CAPTEDRAFT_181697 [Capitella teleta]
Length = 538
Score = 218 bits (555), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 108/229 (47%), Positives = 146/229 (63%), Gaps = 6/229 (2%)
Query: 42 VTEREKYEMLCRGDLTVPPAIVAQ---LKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYR 98
++ + YE LCRG+ T + + C YV R P L+P KEE +L P I +Y
Sbjct: 287 TSQFDDYERLCRGEETKVGKLSNSHLIMLCNYV-RPHPMFILVPAKEEVMFLDPFIAIYH 345
Query: 99 DVMYDSEIDLIKKMAQPRLRRATVQNYKTGELE-IANYRISKSAWLREPEHPVIERISRR 157
++M D E D+IK++++P+L R+ V Y G + + +YR SKSAW+ + EHP+I R+S R
Sbjct: 346 NLMTDKEADMIKRISKPKLHRSGVFTYSGGNQKPVQDYRTSKSAWIEDEEHPMIRRVSER 405
Query: 158 VEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDV 217
+T L+ T E QVVNYGIGGHYEPH+DFARP E F GNR+ TV+FY++
Sbjct: 406 TSALTDLSLDTVELFQVVNYGIGGHYEPHFDFARPNEIATFDP-EVGNRIITVIFYVAAP 464
Query: 218 AQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
GGATVF L + LWPEKG+ A W NL +G+GDY T+HA CP +TGS
Sbjct: 465 EAGGATVFPDLGVKLWPEKGSCAVWWNLMRNGEGDYRTKHAGCPTITGS 513
>gi|344252711|gb|EGW08815.1| Prolyl 4-hydroxylase subunit alpha-2 [Cricetulus griseus]
Length = 584
Score = 216 bits (551), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 128/302 (42%), Positives = 169/302 (55%), Gaps = 58/302 (19%)
Query: 4 PTHQRAQGNKLYYQ--------EALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD 55
P+H+RA GN Y++ ++L E + P + ER+ E LCRG+
Sbjct: 238 PSHERAGGNLRYFERLLEEEREKSLFNQTEAGLATQENVYERPVDFLPERDVLESLCRGE 297
Query: 56 -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
+ + P +L CRY H N VP L + P KEE+ + P I+ Y DVM D EI+ IK++A
Sbjct: 298 GVKLTPQRQKKLFCRYHHGNRVPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 357
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT TAE LQ
Sbjct: 358 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 417
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFY-------------------- 213
E +AFK LGTGNRVAT L Y
Sbjct: 418 ------------------SDEQDAFKRLGTGNRVATFLNYGDLRTLSCPQGFVALLSLGR 459
Query: 214 ----------MSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVL 263
MSDV GGATVF L ++WP+KGTA FW+NL SG+GDY TRHAACPVL
Sbjct: 460 GAKLFALCSQMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVL 519
Query: 264 TG 265
G
Sbjct: 520 VG 521
>gi|170064951|ref|XP_001867739.1| prolyl 4-hydroxylase subunit alpha-2 [Culex quinquefasciatus]
gi|167882142|gb|EDS45525.1| prolyl 4-hydroxylase subunit alpha-2 [Culex quinquefasciatus]
Length = 516
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 105/220 (47%), Positives = 147/220 (66%), Gaps = 5/220 (2%)
Query: 48 YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID 107
YE LCRGD PP+ + L CRY P+LRL PLK+E L P + +Y D D+EI+
Sbjct: 276 YEPLCRGDHQRPPSETSNLYCRYHMSTSPFLRLAPLKQEVVNLDPFVAVYHDAASDAEIN 335
Query: 108 LIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMT-GLTT 166
+ ++ +P++ R+ V + + E++ R S+++WL + +HPV+ +SRR + M GL
Sbjct: 336 KVIELGRPQINRSMVGD--AAKKEVSKSRTSQNSWLTDYDHPVVAALSRRTKDMALGLDE 393
Query: 167 STAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFT 226
+ E LQV NYGIGGHY PHYD++R E N + L TGNR+AT++FY+SDV +GGATVF
Sbjct: 394 TAYESLQVNNYGIGGHYLPHYDWSR--EENPYPELNTGNRIATLMFYLSDVEEGGATVFP 451
Query: 227 SLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
L + ++P+KGTA FW+NL +SG GD T H ACPVL GS
Sbjct: 452 HLGVGVFPKKGTAIFWYNLRASGKGDEKTLHGACPVLIGS 491
>gi|444512226|gb|ELV10078.1| Prolyl 4-hydroxylase subunit alpha-1 [Tupaia chinensis]
Length = 474
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 112/222 (50%), Positives = 149/222 (67%), Gaps = 13/222 (5%)
Query: 4 PTHQRAQGNKLYYQEALNK--------SPELKDEP--PKVNNVAPTLEVTEREKYEMLCR 53
P HQRA GN Y++ + K S + D+ PK VA + ER+KYEMLCR
Sbjct: 209 PEHQRANGNLKYFEYIMAKEKDTNKSASDDQSDQKTTPKKKGVAVDY-LPERQKYEMLCR 267
Query: 54 GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
G+ + + P +L CRY N P L P K+E+ + +PRII + D++ D+EI+++K
Sbjct: 268 GEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 327
Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
+A+PRLRRAT+ N TG+LE +YRISKSAWL E+PV+ RI+ R++ +TGL STAEE
Sbjct: 328 LAKPRLRRATISNPITGDLETVHYRISKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEE 387
Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFY 213
LQV NYG+GG YEPH+DFAR E +AFK LGTGNR+AT LFY
Sbjct: 388 LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFY 429
>gi|393903732|gb|EFO16802.2| hypothetical protein LOAG_11701 [Loa loa]
Length = 531
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 123/295 (41%), Positives = 166/295 (56%), Gaps = 42/295 (14%)
Query: 2 IFPTHQRAQGNKLYYQEALN----KSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLT 57
I P H RA+GN +Y++ L + +++ + P +NN P + + Y+ LCR ++
Sbjct: 237 INPDHPRAKGNVRWYEDLLEDEGVRRADMRRKVPPINN--PRDKSDLNDTYQALCRQEMP 294
Query: 58 VPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRL 117
V ++L C Y + PYLRL P+K E Y P +L+ D+M D E +I+ +A P+L
Sbjct: 295 VNIKAQSRLYC-YYKMDRPYLRLAPIKVEIVYQNPLAVLFHDIMSDEESRIIEMLAVPKL 353
Query: 118 RRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNY 177
RATV N +TG LE A+YRISKSAWLR EH V+ RI+RR++ T L +TAEELQV NY
Sbjct: 354 DRATVHNVETGNLETASYRISKSAWLRSTEHEVVNRINRRLDLATNLEIATAEELQVQNY 413
Query: 178 GIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKG 237
GIGGHYEPH D +R + +AF+ GTGNR+AT+L Y
Sbjct: 414 GIGGHYEPHLDCSR--DEDAFERTGTGNRIATILIY-----------------------N 448
Query: 238 TAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCGLRRGLQ 282
A FW+NL SG D + HAACPVLTG+ + PCGL R Q
Sbjct: 449 AALFWYNLMRSGAVDMRSYHAACPVLTGTKWTANKWFHERGQEWRRPCGLNRFDQ 503
>gi|426365135|ref|XP_004049642.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Gorilla gorilla
gorilla]
Length = 500
Score = 211 bits (538), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 111/236 (47%), Positives = 152/236 (64%), Gaps = 7/236 (2%)
Query: 33 VNNVAPTLEVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQ 91
+N V L + KY + + P +L CRY N P L P K+E+ + +
Sbjct: 245 INTVFKILNILFEAKY---LQSTASFTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDK 301
Query: 92 PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
PRII + D++ D+EI+++K +A+PRL RATV + +TG+L A YR+SK + +
Sbjct: 302 PRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKRTICLL--YINL 359
Query: 152 ERISRRVEHMTGLTTSTAEEL-QVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
+R R+ + L +T + QV NYG+GG YEPH+DFAR E +AFK LGTGNR+AT
Sbjct: 360 KRYYTRLGFLFLLYNTTCPFVPQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 419
Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
LFYMSDV+ GGATVF + S+WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 420 LFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 475
>gi|449513594|ref|XP_002191636.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like, partial
[Taeniopygia guttata]
Length = 346
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 109/221 (49%), Positives = 150/221 (67%), Gaps = 11/221 (4%)
Query: 4 PTHQRAQGNKLYYQ-------EALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD- 55
P HQRA GN Y++ EA S + +++ K V + ER KYEMLCRG+
Sbjct: 127 PEHQRANGNMKYFEYIMAKEKEANKSSTDSEEQQEKETEVKKKDYLPERRKYEMLCRGEG 186
Query: 56 LTVPPAIVAQLKCRYV--HRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
L + P +L CRY +RN Y+ L P+K+E+ + +PRI+ + D++ D EI+ +K++A
Sbjct: 187 LKMTPRRQKRLFCRYYDGNRNPRYI-LGPVKQEDEWDKPRIVRFLDIISDEEIETVKELA 245
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+PRL RATV + +TG+L A+YR+SKSAWL E PV+ RI+ R++ +TGL STAEELQ
Sbjct: 246 KPRLSRATVHDPETGKLTTAHYRVSKSAWLSGYESPVVSRINTRIQDLTGLDVSTAEELQ 305
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYM 214
V NYG+GG YEPH+DFAR E +AFK LGTGNR+AT LFY+
Sbjct: 306 VANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYV 346
>gi|148701598|gb|EDL33545.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide, isoform CRA_c [Mus
musculus]
gi|149052607|gb|EDM04424.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide (predicted),
isoform CRA_d [Rattus norvegicus]
Length = 189
Score = 208 bits (530), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 98/165 (59%), Positives = 127/165 (76%), Gaps = 2/165 (1%)
Query: 101 MYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEH 160
M D EI+ IK++A+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H
Sbjct: 1 MSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQH 60
Query: 161 MTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQG 220
+TGLT TAE LQV NYG+GG YEPH+DF+R + K+ GNR+AT L YMSDV G
Sbjct: 61 ITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKT--EGNRLATFLNYMSDVEAG 118
Query: 221 GATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
GATVF L ++WP+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 119 GATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 163
>gi|194765178|ref|XP_001964704.1| GF23330 [Drosophila ananassae]
gi|190614976|gb|EDV30500.1| GF23330 [Drosophila ananassae]
Length = 537
Score = 207 bits (528), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 106/243 (43%), Positives = 149/243 (61%), Gaps = 3/243 (1%)
Query: 24 PELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPL 83
PE DEP + E +YE +CRG++ P L+CR N P+ L PL
Sbjct: 263 PEESDEPLLPRHSDSYSLTHEFAQYEKVCRGEVNPTPRQERNLRCRLSQGNHPFRLLAPL 322
Query: 84 KEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWL 143
K EE L P ++ Y D++ +I +++MA PR+RR+TV G+ + + +R+SK+AWL
Sbjct: 323 KLEEHNLDPYVVTYHDMLSAQKIRDLRQMAVPRMRRSTVNPLPGGQNKKSAFRVSKNAWL 382
Query: 144 REPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT 203
HP +E + R ++ TGL T+ E+LQV NYG+GGHYEPH+DF R + N + +
Sbjct: 383 AYESHPTMEGMLRDLKDATGLDTTYCEQLQVANYGVGGHYEPHWDFFR--DPNHYPA-EE 439
Query: 204 GNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVL 263
GNR+AT +FY+SDV QGGAT F L+ ++ P+ G FW+NLH S D DY T+HA CPVL
Sbjct: 440 GNRIATAIFYLSDVEQGGATAFPFLDFAVKPQLGNVLFWYNLHRSLDMDYRTKHAGCPVL 499
Query: 264 TGS 266
GS
Sbjct: 500 KGS 502
>gi|74216495|dbj|BAE25162.1| unnamed protein product [Mus musculus]
Length = 187
Score = 207 bits (526), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 97/163 (59%), Positives = 126/163 (77%), Gaps = 2/163 (1%)
Query: 103 DSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMT 162
D EI+ IK++A+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+T
Sbjct: 1 DEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHIT 60
Query: 163 GLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGA 222
GLT TAE LQV NYG+GG YEPH+DF+R + K+ GNR+AT L YMSDV GGA
Sbjct: 61 GLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKT--EGNRLATFLNYMSDVEAGGA 118
Query: 223 TVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
TVF L ++WP+KGTA FW+NL SG+GDY TRHAACPVL G
Sbjct: 119 TVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 161
>gi|194765138|ref|XP_001964684.1| GF23317 [Drosophila ananassae]
gi|190614956|gb|EDV30480.1| GF23317 [Drosophila ananassae]
Length = 520
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 114/251 (45%), Positives = 156/251 (62%), Gaps = 14/251 (5%)
Query: 18 EALNKSPELKDEP-PKVN-NVAPTLEVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNV 75
EAL ++ +P PKV + +PTL YEM CRG P + ++L CRY
Sbjct: 260 EALIRTGTSNQQPQPKVGLSRSPTL-------YEMGCRG--MYPASTDSKLVCRYNSTTT 310
Query: 76 PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANY 135
P+L L PLK E L P +++Y DV+ +EID +K+MA P L+RATV G+ E+
Sbjct: 311 PFLTLAPLKMEIVGLNPYMVIYHDVLSSAEIDEMKEMATPSLKRATVYKASLGKNEVVKT 370
Query: 136 RISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEA 195
R SK AW + + + R++ R+ MTG S +E LQ++NYG+GGHY+ HYDF E
Sbjct: 371 RTSKVAWFPDSYNSLTLRLNARIHDMTGFDLSGSEMLQLMNYGLGGHYDKHYDFFNATEK 430
Query: 196 NAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYT 255
++ SL TG+R+ATVLFYMSDV QGGATVF ++ +++P++GTA W+NL G D T
Sbjct: 431 SS--SL-TGDRIATVLFYMSDVEQGGATVFPNIYKTVYPQRGTAVMWYNLKDDGQPDEQT 487
Query: 256 RHAACPVLTGS 266
HAACPVL GS
Sbjct: 488 LHAACPVLVGS 498
>gi|24651420|ref|NP_733374.1| prolyl-4-hydroxylase-alpha NE1 [Drosophila melanogaster]
gi|7301952|gb|AAF57058.1| prolyl-4-hydroxylase-alpha NE1 [Drosophila melanogaster]
gi|363987308|gb|AEW43896.1| FI16820p1 [Drosophila melanogaster]
Length = 537
Score = 206 bits (523), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 102/223 (45%), Positives = 145/223 (65%), Gaps = 5/223 (2%)
Query: 44 EREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYD 103
E KYE +CRG+ V P + +L+CRY N PY L PLK EE L P + + D++
Sbjct: 285 EFAKYEKVCRGE--VHPIVRQELRCRYSRGNHPYRFLAPLKLEEHSLDPYVATFHDILSP 342
Query: 104 SEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTG 163
+I +++MA PR+ R+TV G+L+ + +R+SK+AWL HP + + R ++ TG
Sbjct: 343 GKISQLREMAVPRMHRSTVNPLPGGQLKKSAFRVSKNAWLAYESHPTMVGMLRDLKDATG 402
Query: 164 LTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGAT 223
L T+ E+LQV NYG+GGHYEPH+DF R + N + + GNR+AT +FY+S+V QGGAT
Sbjct: 403 LDTTFCEQLQVANYGVGGHYEPHWDFFR--DPNHYPA-EEGNRIATAIFYLSEVEQGGAT 459
Query: 224 VFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
F L++++ P+ G FW+NLH S D DY T+HA CPVL GS
Sbjct: 460 AFPFLDIAVKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLKGS 502
>gi|195452734|ref|XP_002073476.1| GK13124 [Drosophila willistoni]
gi|194169561|gb|EDW84462.1| GK13124 [Drosophila willistoni]
Length = 536
Score = 205 bits (522), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 101/228 (44%), Positives = 142/228 (62%), Gaps = 13/228 (5%)
Query: 44 EREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYD 103
E YE +CRG++ PA L+CRY N PY +L PLK EE L P ++ Y D++
Sbjct: 282 EFAHYEKVCRGEVEPSPAQQRPLRCRYSQGNHPYRQLAPLKMEEHSLDPFVVTYHDMLSP 341
Query: 104 SEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTG 163
++I +++MA P +RR+TV G+ + +++R+SK+AWL HP + ++ R + TG
Sbjct: 342 NKIAQLREMAVPHMRRSTVNPLPGGQNKKSSFRVSKNAWLAYETHPTMGKMLRDLSDTTG 401
Query: 164 LTTSTAEELQVVNYGIGGHYEPHYDFAR-----PGEANAFKSLGTGNRVATVLFYMSDVA 218
L + E+LQV NYG+GGHYEPH+DF R P E GNR+AT ++Y+S+V
Sbjct: 402 LDMTYCEQLQVANYGVGGHYEPHWDFFRNPDHYPAEE--------GNRIATAIYYLSEVE 453
Query: 219 QGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
QGGAT F LN ++ P+ G FW+NLH S D DY T+HA CPVL GS
Sbjct: 454 QGGATAFPFLNFAVRPQLGNVLFWYNLHRSSDMDYRTKHAGCPVLKGS 501
>gi|227553849|gb|ACP40552.1| IP22178p [Drosophila melanogaster]
Length = 467
Score = 205 bits (522), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 102/223 (45%), Positives = 145/223 (65%), Gaps = 5/223 (2%)
Query: 44 EREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYD 103
E KYE +CRG+ V P + +L+CRY N PY L PLK EE L P + + D++
Sbjct: 215 EFAKYEKVCRGE--VHPIVRQELRCRYSRGNHPYRFLAPLKLEEHSLDPYVATFHDILSP 272
Query: 104 SEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTG 163
+I +++MA PR+ R+TV G+L+ + +R+SK+AWL HP + + R ++ TG
Sbjct: 273 GKISQLREMAVPRMHRSTVNPLPGGQLKKSAFRVSKNAWLAYESHPTMVGMLRDLKDATG 332
Query: 164 LTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGAT 223
L T+ E+LQV NYG+GGHYEPH+DF R + N + + GNR+AT +FY+S+V QGGAT
Sbjct: 333 LDTTFCEQLQVANYGVGGHYEPHWDFFR--DPNHYPA-EEGNRIATAIFYLSEVEQGGAT 389
Query: 224 VFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
F L++++ P+ G FW+NLH S D DY T+HA CPVL GS
Sbjct: 390 AFPFLDIAVKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLKGS 432
>gi|20269818|gb|AAM18064.1| prolyl 4-hydroxylase alpha-related protein PH4[alpha]NE1
[Drosophila melanogaster]
Length = 286
Score = 205 bits (521), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 102/226 (45%), Positives = 146/226 (64%), Gaps = 5/226 (2%)
Query: 44 EREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYD 103
E KYE +CRG+ V P + +L+CRY N PY L PLK EE L P + + D++
Sbjct: 34 EFAKYEKVCRGE--VHPIVRQELRCRYSRGNHPYRFLAPLKLEEHSLDPYVATFHDILSP 91
Query: 104 SEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTG 163
+I +++MA PR+ R+TV G+L+ + +R+SK+AWL HP + + R ++ TG
Sbjct: 92 GKISQLREMAVPRMHRSTVNPLPGGQLKKSAFRVSKNAWLAYESHPTMVGMLRDLKDATG 151
Query: 164 LTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGAT 223
L T+ E+LQV NYG+GGHYEPH+DF R + N + + GNR+AT +FY+S+V QGGAT
Sbjct: 152 LDTTFCEQLQVANYGVGGHYEPHWDFFR--DPNHYPA-EEGNRIATAIFYLSEVEQGGAT 208
Query: 224 VFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSL 269
F L++++ P+ G FW+NLH S D DY T+HA CPVL GS +
Sbjct: 209 AFPFLDIAVKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLKGSKWI 254
>gi|170064956|ref|XP_001867741.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
gi|167882144|gb|EDS45527.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
Length = 520
Score = 203 bits (517), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 101/220 (45%), Positives = 144/220 (65%), Gaps = 5/220 (2%)
Query: 48 YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID 107
YE LCRGD P + +QL CRY P+LRL PLK E L+P I++Y + + D EI
Sbjct: 280 YEKLCRGDYERPGEVTSQLFCRYETSATPFLRLAPLKLEVVNLEPLIVVYHEAVSDREIA 339
Query: 108 LIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTG-LTT 166
+ ++A+P ++R+ V + ++ + I+ RIS++AW P++E +++R M G L
Sbjct: 340 KLIELARPLIKRSAVGDTRSEQ--ISKIRISQNAWFENEHDPIVETLNQRARDMAGGLNE 397
Query: 167 STAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFT 226
+ E LQV NYG+GG Y HYD++ AN F + G GNR+AT++FY+SDV +GG+TVF
Sbjct: 398 PSYELLQVNNYGLGGFYSIHYDWSTS--ANPFPNKGMGNRIATLMFYLSDVQEGGSTVFP 455
Query: 227 SLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
LNL++ P KGTA FW+NLH +G G+ T HAACPVL GS
Sbjct: 456 RLNLAVRPRKGTAIFWYNLHRNGKGNKKTLHAACPVLIGS 495
>gi|195341544|ref|XP_002037366.1| GM12151 [Drosophila sechellia]
gi|194131482|gb|EDW53525.1| GM12151 [Drosophila sechellia]
Length = 537
Score = 203 bits (516), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 102/223 (45%), Positives = 144/223 (64%), Gaps = 5/223 (2%)
Query: 44 EREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYD 103
E KYE +CRG+ V P +L+CRY N PY L PLK EE L P + + D++
Sbjct: 285 EFAKYEKVCRGE--VHPIARQELRCRYSRGNHPYRFLAPLKLEEHSLDPYVATFHDMLNP 342
Query: 104 SEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTG 163
+I +++MA PR+ R+TV G+L+ + +R+SK+AWL HP + + R ++ TG
Sbjct: 343 RKISQLREMAVPRMHRSTVNPLPGGQLKKSAFRVSKNAWLAYESHPTMVGMLRDLKDATG 402
Query: 164 LTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGAT 223
L T+ E+LQV NYG+GGHYEPH+DF R + N + + GNR+AT +FY+S+V QGGAT
Sbjct: 403 LDTTFCEQLQVANYGVGGHYEPHWDFFR--DPNHYPA-EEGNRIATAIFYLSEVEQGGAT 459
Query: 224 VFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
F L++++ P+ G FW+NLH S D DY T+HA CPVL GS
Sbjct: 460 AFPFLDIAVKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLKGS 502
>gi|195575099|ref|XP_002105517.1| GD17024 [Drosophila simulans]
gi|194201444|gb|EDX15020.1| GD17024 [Drosophila simulans]
Length = 537
Score = 203 bits (516), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 102/223 (45%), Positives = 144/223 (64%), Gaps = 5/223 (2%)
Query: 44 EREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYD 103
E KYE +CRG+ V P +L+CRY N PY L PLK EE L P + + D++
Sbjct: 285 EFAKYEKVCRGE--VHPIARQELRCRYSRGNHPYRFLAPLKLEEHSLDPYVATFHDMLSP 342
Query: 104 SEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTG 163
+I +++MA PR+ R+TV G+L+ + +R+SK+AWL HP + + R ++ TG
Sbjct: 343 RKISQLREMAVPRMHRSTVNPLPGGQLKKSAFRVSKNAWLAYESHPTMVGMLRDLKDATG 402
Query: 164 LTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGAT 223
L T+ E+LQV NYG+GGHYEPH+DF R + N + + GNR+AT +FY+S+V QGGAT
Sbjct: 403 LDTTFCEQLQVANYGVGGHYEPHWDFFR--DPNHYPA-EEGNRIATAIFYLSEVEQGGAT 459
Query: 224 VFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
F L++++ P+ G FW+NLH S D DY T+HA CPVL GS
Sbjct: 460 AFPFLDIAVKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLKGS 502
>gi|198429625|ref|XP_002128613.1| PREDICTED: similar to procollagen-proline, 2-oxoglutarate
4-dioxygenase (proline 4-hydroxylase), alpha 1
polypeptide [Ciona intestinalis]
Length = 195
Score = 203 bits (516), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 95/165 (57%), Positives = 124/165 (75%), Gaps = 1/165 (0%)
Query: 101 MYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEH 160
M D E+ +IK +A+PRLRRATVQN TG LE A+YR+SKSAWL++ +HPVI+R+ +R+
Sbjct: 1 MSDKEMAMIKSLAKPRLRRATVQNPVTGVLEFAHYRVSKSAWLKDEDHPVIKRVCQRISD 60
Query: 161 MTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQG 220
+TGL+ TAEELQ+ NYG+GG YEPH+D++R + F GNR+AT L YMS+V QG
Sbjct: 61 VTGLSMETAEELQIANYGVGGQYEPHFDYSRKSDFGKFDD-EVGNRIATFLTYMSNVEQG 119
Query: 221 GATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
G+TVF +++ P KG+A FW+NL SG GD TRHAACPVLTG
Sbjct: 120 GSTVFLHPGIAVRPIKGSAVFWYNLLPSGAGDERTRHAACPVLTG 164
>gi|195505207|ref|XP_002099404.1| GE23380 [Drosophila yakuba]
gi|194185505|gb|EDW99116.1| GE23380 [Drosophila yakuba]
Length = 540
Score = 201 bits (511), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 118/296 (39%), Positives = 166/296 (56%), Gaps = 22/296 (7%)
Query: 4 PTHQRAQGNKLYYQEALNK----SPELKDEPPKVNNVAPTLEVTEREKYEM---LCRGDL 56
P H+ A NKL Y+ L K +P + + P V P +E Y++ +CRG+L
Sbjct: 247 PDHEEAHRNKLLYEGQLAKERSFTPRKQVDLPHVAGKEP------KESYKLYTQVCRGEL 300
Query: 57 TVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPR 116
P L+C H+ VPY RL P K E+ L P + +V++DSEID+I + +
Sbjct: 301 HQTPREQRNLRCWLTHQGVPYYRLAPFKIEQLNLDPYVAYVHEVLWDSEIDMIMEHGKGN 360
Query: 117 LRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVN 176
++R+ V ++G R S++ WL +P + +I +R+E +TGL+T +AE LQ+VN
Sbjct: 361 MKRSMVG--QSGNSTTTEIRTSQNTWLWYDANPWLAKIKQRLEDVTGLSTESAEPLQLVN 418
Query: 177 YGIGGHYEPHYDFARPGEANAFKSLG-TGNRVATVLFYMSDVAQGGATVFTSLNLSLWPE 235
YGIGG YEPH+DF E + K G GNR+AT LFY++DVA GGAT F L L++ P
Sbjct: 419 YGIGGQYEPHFDFM---EDDGQKVFGWKGNRLATALFYLNDVALGGATAFPFLRLAVPPV 475
Query: 236 KGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTCPCGLRRGLQRSGIICTLV 291
KG+ W+NLHSS D+ T+HA CPVL GS + C G Q C LV
Sbjct: 476 KGSLLIWYNLHSSTHKDFRTKHAGCPVLQGSKWI---CNEWFHVGAQEFRRPCGLV 528
>gi|170064953|ref|XP_001867740.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
gi|167882143|gb|EDS45526.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
Length = 509
Score = 201 bits (510), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 114/264 (43%), Positives = 157/264 (59%), Gaps = 17/264 (6%)
Query: 16 YQEALNKSPE-LKDEPPKVNNVAPTLEVTEREKY-----------EMLCRGDLTVPPAIV 63
Y +AL + + LK +P + + E KY E+LCRGD P +
Sbjct: 221 YVDALKITNQILKQDPTHAGRLVEKKTIGELMKYLENKLRPEVPHELLCRGDYQRPASET 280
Query: 64 AQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQ 123
+ L CRY +LRL PLKEE L P I +Y DV D EI + ++A+ R+ RAT++
Sbjct: 281 SHLYCRYHTGTSSFLRLAPLKEEVLNLDPFITVYHDVASDREISKLIELAKSRISRATIR 340
Query: 124 NYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTG-LTTSTAEELQVVNYGIGGH 182
+ GE +++N R S++AWL + V+ + RRV MTG L + E LQV NYG+GGH
Sbjct: 341 D--DGEPQVSNARTSQNAWLDAGDDRVVTTLDRRVGDMTGGLRQQSYEMLQVNNYGVGGH 398
Query: 183 YEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFW 242
Y H+D+A EA + L GNR+ATV+FY+SDV GGATVF L L+++P KG+A W
Sbjct: 399 YVAHHDWAM--EAVPYAGLRVGNRIATVMFYLSDVEIGGATVFPQLGLAVFPRKGSAILW 456
Query: 243 HNLHSSGDGDYYTRHAACPVLTGS 266
+NL+ +G GD T HAACPVL+GS
Sbjct: 457 YNLYRNGKGDRRTLHAACPVLSGS 480
>gi|195505202|ref|XP_002099402.1| GE23382 [Drosophila yakuba]
gi|194185503|gb|EDW99114.1| GE23382 [Drosophila yakuba]
Length = 537
Score = 200 bits (508), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 101/223 (45%), Positives = 143/223 (64%), Gaps = 5/223 (2%)
Query: 44 EREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYD 103
E +YE +CRG+ V P +L+CRY + PY L PLK EE L P + Y D++
Sbjct: 285 EFAQYEKVCRGE--VHPIARQELRCRYSRGSHPYRYLAPLKLEEHSLDPYVATYHDMLSP 342
Query: 104 SEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTG 163
+I +++MA PR+RR+TV G+ + + +R+SK+AWL HP + + R ++ TG
Sbjct: 343 RKISQLREMAVPRMRRSTVNPLPGGQHKKSAFRVSKNAWLAYESHPTMVGMLRDLKEATG 402
Query: 164 LTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGAT 223
L T+ E+LQV NYG+GGHYEPH+DF R + N + GNR+AT +FY+S+V QGGAT
Sbjct: 403 LDTTYCEQLQVANYGVGGHYEPHWDFFR--DPNHYPE-EEGNRIATAIFYLSEVEQGGAT 459
Query: 224 VFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
F L++++ P+ G FW+NLH S D DY T+HA CPVL GS
Sbjct: 460 AFPFLDIAVKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLKGS 502
>gi|195505218|ref|XP_002099409.1| GE10887 [Drosophila yakuba]
gi|194185510|gb|EDW99121.1| GE10887 [Drosophila yakuba]
Length = 521
Score = 199 bits (506), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 108/251 (43%), Positives = 143/251 (56%), Gaps = 14/251 (5%)
Query: 28 DEPPKVNNVAPTLE----------VTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPY 77
DE +N P LE V E + Y + C G + P L+C YV P+
Sbjct: 228 DEKALLNESKPILEHAPIPEEGEPVDEFQAYSLTCSGHWRLTPKEQRHLRCGYVTETHPF 287
Query: 78 LRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRI 137
L + PLK EE + P ++LY DV+Y SEID+I+K+ + RL+RATV + E ++N R
Sbjct: 288 LWIAPLKAEELFQDPLLVLYHDVIYQSEIDVIRKLTENRLKRATVTGH--NESVVSNVRT 345
Query: 138 SKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD--FARPGEA 195
S+ ++ H V+ I +RV MT L AE+ Q NYGIGGHY H D + +A
Sbjct: 346 SQFTFIPVSAHKVLSTIDQRVADMTNLNMKYAEDHQFANYGIGGHYGQHMDWFYQTTIDA 405
Query: 196 NAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYT 255
S GNR+ATVLFY+SDV+QGG T F L L P+K AAFWHNLH+SG GD T
Sbjct: 406 GLISSPEMGNRIATVLFYLSDVSQGGGTAFPQLRTLLKPKKYAAAFWHNLHASGVGDVRT 465
Query: 256 RHAACPVLTGS 266
+H ACP++ GS
Sbjct: 466 QHGACPIIAGS 476
>gi|195110925|ref|XP_002000030.1| GI22756 [Drosophila mojavensis]
gi|193916624|gb|EDW15491.1| GI22756 [Drosophila mojavensis]
Length = 533
Score = 198 bits (503), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 102/239 (42%), Positives = 137/239 (57%), Gaps = 3/239 (1%)
Query: 28 DEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEE 87
DE A E YE +CRG++ A +L+CRY Y L PLK EE
Sbjct: 263 DEGAHSRQAAGYRLTQEFAHYEKVCRGEVGPSAAQQRRLRCRYARGRHAYRLLAPLKLEE 322
Query: 88 AYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPE 147
L P ++ Y D++ +I ++ MA P ++R+TV G+ + +R+SK+AWL
Sbjct: 323 HSLDPLVVSYHDMLSPQQIGELRAMAVPHMQRSTVNPLSGGQRMKSAFRVSKNAWLPYST 382
Query: 148 HPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRV 207
HP++ R+ R V TGL + E+LQV NYG+GGHYEPH+DF R GNR+
Sbjct: 383 HPMMGRMLRDVGDATGLDMTYCEQLQVANYGVGGHYEPHWDFFRDSR---HYPAAEGNRI 439
Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
AT +FY+SDV QGGAT F LN ++ P+ G FW+NLH S D DY T+HA CPVL GS
Sbjct: 440 ATAIFYLSDVEQGGATAFPFLNFAVRPQLGNILFWYNLHRSSDEDYRTKHAGCPVLKGS 498
>gi|20269816|gb|AAM18063.1|AF495541_1 prolyl 4-hydroxylase alpha-related protein PH4[alpha]SG1
[Drosophila melanogaster]
Length = 540
Score = 198 bits (503), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 112/290 (38%), Positives = 161/290 (55%), Gaps = 25/290 (8%)
Query: 4 PTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEM---LCRGDLTVPP 60
P H+ A NK+ Y+ L + P K + E ++E Y++ +CRG+L P
Sbjct: 247 PDHEEALKNKILYEGQLARERSFA--PRKQVELPHIAEKEQKESYKLYTEVCRGELHQSP 304
Query: 61 AIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRA 120
L+C H+ VPY RL P K E+ + P + +V++DSEID I + + + R+
Sbjct: 305 REQRNLRCWLSHQGVPYYRLFPFKIEQLNIDPYVAYVHEVLWDSEIDTIMEHGKGNMERS 364
Query: 121 TV---QNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNY 177
V +N T E+ RIS++ WL +P + +I +R+E +TGL+T +AE LQ+VNY
Sbjct: 365 KVGQSENSTTSEV-----RISRNTWLWYDANPWLSKIKQRLEDVTGLSTESAEPLQLVNY 419
Query: 178 GIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKG 237
GIGG YEPH+DF + F GNR+ T LFY++DVA GGAT F L L++ P KG
Sbjct: 420 GIGGQYEPHFDFVEDDGQSVFS--WKGNRLLTALFYLNDVALGGATAFPFLRLAVPPVKG 477
Query: 238 TAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCGL 277
+ W+NLHSS D+ T+HA CPVL GS + + PCGL
Sbjct: 478 SLLIWYNLHSSTHKDFRTKHAGCPVLQGSKWICNEWFHVGAQEFRRPCGL 527
>gi|194905410|ref|XP_001981191.1| GG11931 [Drosophila erecta]
gi|190655829|gb|EDV53061.1| GG11931 [Drosophila erecta]
Length = 537
Score = 198 bits (503), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 101/225 (44%), Positives = 141/225 (62%), Gaps = 7/225 (3%)
Query: 44 EREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYD 103
E KYE +CRG+ V P +L+CRY + PY L PLK EE L P + + D++
Sbjct: 285 EFAKYEEVCRGE--VQPIARQELRCRYSRGSHPYRILAPLKLEEHSLDPYVASFHDMLSP 342
Query: 104 SEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTG 163
+I +++MA PR++R+TV G+ + + +R+SK+AWL HP + + R ++ TG
Sbjct: 343 RKISQLREMAVPRMQRSTVNPRPGGQHKKSAFRVSKNAWLAYEAHPTMAGMLRDLKDATG 402
Query: 164 LTTSTAEELQVVNYGIGGHYEPHYDFAR-PGEANAFKSLGTGNRVATVLFYMSDVAQGGA 222
L T+ E+LQV NYG+GGHYEPH+DF R P A GNR+AT +FY+S+V QGGA
Sbjct: 403 LDTTFCEQLQVANYGVGGHYEPHWDFFRDPSHYPA----AEGNRIATAIFYLSEVEQGGA 458
Query: 223 TVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
T F L+ ++ P+ G FW+NLH S D DY T+HA CPVL GS
Sbjct: 459 TAFPFLDFAVKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLKGSK 503
>gi|116008434|ref|NP_651806.2| CG9698 [Drosophila melanogaster]
gi|113194862|gb|AAF57062.2| CG9698 [Drosophila melanogaster]
Length = 547
Score = 197 bits (502), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 109/257 (42%), Positives = 147/257 (57%), Gaps = 7/257 (2%)
Query: 12 NKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAIVAQLKCRYV 71
N L + LN+S + + P P V E + Y + C G + P L+C YV
Sbjct: 256 NALSEKALLNESKPILEHAPIPEEGEP---VGEFQAYSLTCSGHWRLTPKEQRHLRCGYV 312
Query: 72 HRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELE 131
P+L + PLK EE + P ++LY DV+Y SEID+I+K+ + RL RAT+ ++ E
Sbjct: 313 TETHPFLWIAPLKAEELFQDPLLVLYHDVIYQSEIDVIRKLTENRLMRATITSH--NESV 370
Query: 132 IANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD--F 189
++N R S+ ++ H V+ I +RV MT L AE+ Q NYGIGGHY H D +
Sbjct: 371 VSNVRTSQFTFIPVTAHKVLSTIDQRVADMTNLNMKYAEDHQFANYGIGGHYGQHMDWFY 430
Query: 190 ARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSG 249
+A S GNR+ATVLFY+SDVAQGG T F L L P+K AAFWHNLH+SG
Sbjct: 431 QTTFDAGLVSSPEMGNRIATVLFYLSDVAQGGGTAFPQLRTLLKPKKYAAAFWHNLHASG 490
Query: 250 DGDYYTRHAACPVLTGS 266
GD T+H ACP++ GS
Sbjct: 491 VGDVRTQHGACPIIAGS 507
>gi|194905372|ref|XP_001981184.1| GG11758 [Drosophila erecta]
gi|190655822|gb|EDV53054.1| GG11758 [Drosophila erecta]
Length = 550
Score = 197 bits (502), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 107/249 (42%), Positives = 144/249 (57%), Gaps = 7/249 (2%)
Query: 20 LNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLR 79
LN+S + + P P V E + Y + C G + P + L+C YV P+L
Sbjct: 261 LNESKPILEHAPIPEEGEP---VGEFQAYSLTCSGHWRLTPKEQSHLRCGYVTETHPFLW 317
Query: 80 LMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISK 139
+ PLK EE + P ++LY DV+Y SEID+I+K+ + RL RATV + E ++N R S+
Sbjct: 318 IAPLKAEELFQDPLLVLYHDVIYQSEIDVIRKLTENRLMRATVTGHN--ESLVSNVRTSQ 375
Query: 140 SAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD--FARPGEANA 197
++ H V+ I +RV MT L AE+ Q NYGIGGHY H D + +A
Sbjct: 376 FTFIPASAHKVLSTIDQRVADMTNLNMKYAEDHQFANYGIGGHYGQHMDWFYQTTFDAGL 435
Query: 198 FKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRH 257
S GNR+ATVLFY+SDV+QGG T F L L P+K AAFWHNLH+SG GD T+H
Sbjct: 436 VSSPEMGNRIATVLFYLSDVSQGGGTAFPQLRTLLKPKKYAAAFWHNLHASGVGDVRTQH 495
Query: 258 AACPVLTGS 266
ACP++ GS
Sbjct: 496 GACPIIAGS 504
>gi|24651424|ref|NP_733376.1| prolyl-4-hydroxylase-alpha SG1 [Drosophila melanogaster]
gi|23172697|gb|AAF57059.2| prolyl-4-hydroxylase-alpha SG1 [Drosophila melanogaster]
gi|66772443|gb|AAY55533.1| IP03659p [Drosophila melanogaster]
gi|220951214|gb|ACL88150.1| PH4alphaSG1-PA [synthetic construct]
gi|220959938|gb|ACL92512.1| PH4alphaSG1-PA [synthetic construct]
Length = 540
Score = 197 bits (502), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 112/290 (38%), Positives = 161/290 (55%), Gaps = 25/290 (8%)
Query: 4 PTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEM---LCRGDLTVPP 60
P H+ A NK+ Y+ L + P K + E ++E Y++ +CRG+L P
Sbjct: 247 PDHEEALKNKILYEGQLARERSFA--PRKQVELPQIAEKEQKESYKLYTQVCRGELHQSP 304
Query: 61 AIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRA 120
L+C H+ VPY RL P K E+ + P + +V++DSEID I + + + R+
Sbjct: 305 REQRNLRCWLYHQGVPYYRLSPFKIEQLNVDPYVAYVHEVLWDSEIDTIMEHGKGNMERS 364
Query: 121 TV---QNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNY 177
V +N T E+ RIS++ WL +P + +I +R+E +TGL+T +AE LQ+VNY
Sbjct: 365 KVGQSENSTTSEV-----RISRNTWLWYDANPWLSKIKQRLEDVTGLSTESAEPLQLVNY 419
Query: 178 GIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKG 237
GIGG YEPH+DF + F GNR+ T LFY++DVA GGAT F L L++ P KG
Sbjct: 420 GIGGQYEPHFDFVEDDGQSVFS--WKGNRLLTALFYLNDVALGGATAFPFLRLAVPPVKG 477
Query: 238 TAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCGL 277
+ W+NLHSS D+ T+HA CPVL GS + + PCGL
Sbjct: 478 SLLIWYNLHSSTHKDFRTKHAGCPVLQGSKWICNEWFHVGAQEFRRPCGL 527
>gi|66772331|gb|AAY55477.1| IP03959p [Drosophila melanogaster]
gi|66772361|gb|AAY55492.1| IP03859p [Drosophila melanogaster]
Length = 541
Score = 197 bits (502), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 112/290 (38%), Positives = 161/290 (55%), Gaps = 25/290 (8%)
Query: 4 PTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEM---LCRGDLTVPP 60
P H+ A NK+ Y+ L + P K + E ++E Y++ +CRG+L P
Sbjct: 248 PDHEEALKNKILYEGQLARERSFA--PRKQVELPQIAEKEQKESYKLYTQVCRGELHQSP 305
Query: 61 AIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRA 120
L+C H+ VPY RL P K E+ + P + +V++DSEID I + + + R+
Sbjct: 306 REQRNLRCWLYHQGVPYYRLSPFKIEQLNVDPYVAYVHEVLWDSEIDTIMEHGKGNMERS 365
Query: 121 TV---QNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNY 177
V +N T E+ RIS++ WL +P + +I +R+E +TGL+T +AE LQ+VNY
Sbjct: 366 KVGQSENSTTSEV-----RISRNTWLWYDANPWLSKIKQRLEDVTGLSTESAEPLQLVNY 420
Query: 178 GIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKG 237
GIGG YEPH+DF + F GNR+ T LFY++DVA GGAT F L L++ P KG
Sbjct: 421 GIGGQYEPHFDFVEDDGQSVFS--WKGNRLLTALFYLNDVALGGATAFPFLRLAVPPVKG 478
Query: 238 TAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCGL 277
+ W+NLHSS D+ T+HA CPVL GS + + PCGL
Sbjct: 479 SLLIWYNLHSSTHKDFRTKHAGCPVLQGSKWICNEWFHVGAQEFRRPCGL 528
>gi|195390835|ref|XP_002054073.1| GJ22993 [Drosophila virilis]
gi|194152159|gb|EDW67593.1| GJ22993 [Drosophila virilis]
Length = 525
Score = 197 bits (500), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 100/219 (45%), Positives = 136/219 (62%), Gaps = 5/219 (2%)
Query: 48 YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID 107
YE CRG ++L C Y N P+LRL PLK E L P ++LY DV+ SEI
Sbjct: 289 YERGCRGQFPTK----SKLHCVYNSTNSPFLRLAPLKTELLALDPYMVLYHDVITPSEIR 344
Query: 108 LIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTS 167
++ +A P L+RATV N K G + R SK WL + +P+ R++RR+ MTG
Sbjct: 345 ELQYLAVPTLKRATVFNQKMGRNTVVKTRTSKVTWLTDSLNPLTVRLNRRISDMTGFDLY 404
Query: 168 TAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTS 227
+E LQV+NYG+GGHY+ H+D+ A L G+R+ATVLFY++DV QGGATVF +
Sbjct: 405 GSEMLQVMNYGLGGHYDLHFDYFNATIAKDLTKLN-GDRIATVLFYLTDVEQGGATVFPN 463
Query: 228 LNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+ +++P+KGTA W+NL + DGD T HAACPV+ GS
Sbjct: 464 IKQAIFPKKGTAVMWYNLRHNNDGDPQTLHAACPVIVGS 502
>gi|194905397|ref|XP_001981189.1| GG11929 [Drosophila erecta]
gi|190655827|gb|EDV53059.1| GG11929 [Drosophila erecta]
Length = 538
Score = 196 bits (499), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 111/289 (38%), Positives = 162/289 (56%), Gaps = 24/289 (8%)
Query: 4 PTHQRAQGNKLYYQEALNKS----PELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVP 59
P H+ A NK+ Y+ L K+ P + +PP+ P + Y LCRG+L
Sbjct: 246 PDHEEALKNKVLYEGQLAKARNVIPRKQVDPPQTAEEEPKESF---QLYTQLCRGELHQS 302
Query: 60 PAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRR 119
P L+C H+ VPY RL P K E+ L P + L V++DSE+++I + + + R
Sbjct: 303 PREQRNLRCWLSHQGVPYYRLSPFKFEQLNLDPYVALVHHVLWDSEMEMIMQHGRGSMER 362
Query: 120 ATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGI 179
+ V + + IA+ R S++ WL +P + RI +R+E +TGL+T +AE LQ++NYGI
Sbjct: 363 SKVGQSENSK--IADRRTSQNTWLWYDVNPWLSRIKQRLEDVTGLSTESAEPLQLLNYGI 420
Query: 180 GGHYEPHYDFARPGEANAFKSLG-TGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGT 238
GG YEPH+DF E K G +R+ T +FY++DVA GGAT F L L++ PEKG+
Sbjct: 421 GGQYEPHFDFVEDAE----KIFGWQDDRLMTAIFYINDVALGGATAFPFLRLAVPPEKGS 476
Query: 239 AAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCGL 277
W+NLHSS DY ++HA CP+L GS + + PCGL
Sbjct: 477 LLMWNNLHSSLHKDYRSKHAGCPILQGSKWICTEWFHVGAQELKRPCGL 525
>gi|198449643|ref|XP_001357664.2| GA15938 [Drosophila pseudoobscura pseudoobscura]
gi|198130698|gb|EAL26798.2| GA15938 [Drosophila pseudoobscura pseudoobscura]
Length = 549
Score = 196 bits (499), Expect = 9e-48, Method: Compositional matrix adjust.
Identities = 108/271 (39%), Positives = 160/271 (59%), Gaps = 14/271 (5%)
Query: 4 PTHQRAQGNKLYYQ------EALNKSPELKDEPPKVNNVAPTLEVTE-REKYEMLCRGDL 56
P H+ A NK+ Y+ +++ SP +K +PP+ P E+ E +E Y+ +CRG+L
Sbjct: 251 PGHEEAVKNKIVYEALLARERSISSSPRMKLDPPQEAAPEPEPELKESQELYQRVCRGEL 310
Query: 57 TVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPR 116
P L+C H++VPY RL P K E+ P + + DV+ D E + I + + +
Sbjct: 311 RQSPKEQRYLRCWLSHQDVPYQRLSPFKVEQLSGDPYVAYFHDVLSDKESEQIIEHGKGQ 370
Query: 117 LRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVN 176
+ R+ + +TG +++ R S++ WL +P + I +R+E +TGL+T TAE LQ+VN
Sbjct: 371 VTRSEIG--QTGNSTVSDIRTSQNTWLWYENNPWLADIKQRLEDITGLSTDTAEPLQLVN 428
Query: 177 YGIGGHYEPHYDFARPGEANAFKSLG-TGNRVATVLFYMSDVAQGGATVFTSLNLSLWPE 235
YGIGG YEPH+DF E N G GNR+ T LFY++DV GGAT F L+L++ P
Sbjct: 429 YGIGGQYEPHFDFMDDAEKN----FGWKGNRLLTALFYLNDVPLGGATAFPFLHLAVPPV 484
Query: 236 KGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
KG+ W+NLH S D+ T+HA CPVL GS
Sbjct: 485 KGSLLVWYNLHRSLHKDFRTKHAGCPVLKGS 515
>gi|443697961|gb|ELT98195.1| hypothetical protein CAPTEDRAFT_181380 [Capitella teleta]
Length = 530
Score = 196 bits (498), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 105/276 (38%), Positives = 160/276 (57%), Gaps = 14/276 (5%)
Query: 2 IFPTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLE------------VTEREKYE 49
I P H+RA N+ YY+ + ++ + + + K +N AP ++ YE
Sbjct: 231 IEPGHERAIANRRYYERIIAEADDAERQKLKGDNGAPVVDGKPHRFLTDYTGSKSYSDYE 290
Query: 50 MLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLI 109
LCRG+ T +L CRY R P + PLKEE P I +Y DV+ DS+ +I
Sbjct: 291 KLCRGEETHKRPFKHRLVCRY-QRYHPIFYISPLKEEMLNFDPAIYVYHDVLTDSQNAII 349
Query: 110 KKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTA 169
K++++P+L R+ V + + ++N+R S++AW + HP+I R+S++ ++ LT T
Sbjct: 350 KEVSRPKLHRSGVFSKTDADTGLSNFRTSQTAWHDDSTHPLIARLSQKASAISNLTLETV 409
Query: 170 EELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLN 229
E LQV+NYGIGG YEPH+DF + E N F S NRVAT + Y+S++ GG TV+ ++
Sbjct: 410 EHLQVLNYGIGGLYEPHWDFVQGEERNEF-SESDRNRVATFICYLSELEAGGYTVYPTVG 468
Query: 230 LSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
++ P K + A W+NL +G GDY T HAACP+L G
Sbjct: 469 AAVVPRKNSCALWYNLMRNGTGDYRTYHAACPILYG 504
>gi|297515507|gb|ADI44133.1| RT08151p [Drosophila melanogaster]
Length = 546
Score = 196 bits (498), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 108/257 (42%), Positives = 146/257 (56%), Gaps = 7/257 (2%)
Query: 12 NKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAIVAQLKCRYV 71
N L + LN+S + + P P V E + Y + C G + P L+C YV
Sbjct: 256 NALSEKALLNESKPILEHAPIPEEGEP---VGEFQAYSLTCSGHWRLTPKEQRHLRCGYV 312
Query: 72 HRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELE 131
P+L + PLK EE + P ++LY DV+Y SEID+I+K+ + RL RAT+ ++ E
Sbjct: 313 TETHPFLWIAPLKAEELFQDPLLVLYHDVIYQSEIDVIRKLTENRLMRATITSH--NESV 370
Query: 132 IANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD--F 189
++N R S+ ++ H V+ I +RV MT L AE+ Q NYGIGGHY H D +
Sbjct: 371 VSNVRTSQFTFIPVTAHKVLSTIDQRVADMTNLNMKYAEDHQFANYGIGGHYGQHMDWFY 430
Query: 190 ARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSG 249
+A S GNR+A VLFY+SDVAQGG T F L L P+K AAFWHNLH+SG
Sbjct: 431 QTTFDAGLVSSPEMGNRIAAVLFYLSDVAQGGGTAFPQLRTLLKPKKYAAAFWHNLHASG 490
Query: 250 DGDYYTRHAACPVLTGS 266
GD T+H ACP++ GS
Sbjct: 491 VGDVRTQHGACPIIAGS 507
>gi|195391760|ref|XP_002054528.1| GJ22757 [Drosophila virilis]
gi|194152614|gb|EDW68048.1| GJ22757 [Drosophila virilis]
Length = 534
Score = 196 bits (497), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 97/223 (43%), Positives = 132/223 (59%), Gaps = 3/223 (1%)
Query: 44 EREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYD 103
E YE +CRG++ A L+CRY Y L PLK EE L P ++ + D++
Sbjct: 280 EFAHYEKVCRGEVGASAAQQRPLRCRYTRGEHAYRLLAPLKLEEHSLDPLVVTFHDMLSQ 339
Query: 104 SEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTG 163
I +++MA P ++R+TV G+ + +R+SK+AWL HP + R+ R V TG
Sbjct: 340 HRIAELREMAVPHMQRSTVNPLPGGQRRKSAFRVSKNAWLPYSTHPTMGRMLRDVSDATG 399
Query: 164 LTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGAT 223
L + E+LQV NYG+GGHYEPH+DF R GNR+AT +FY+SDV QGGAT
Sbjct: 400 LDMTFCEQLQVANYGVGGHYEPHWDFFRDSR---HYPAAEGNRIATAIFYLSDVEQGGAT 456
Query: 224 VFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
F LN ++ P+ G FW+NLH S D D+ T+HA CPVL GS
Sbjct: 457 AFPFLNFAVRPQLGNILFWYNLHRSSDMDFRTKHAGCPVLKGS 499
>gi|195159313|ref|XP_002020526.1| GL14040 [Drosophila persimilis]
gi|194117295|gb|EDW39338.1| GL14040 [Drosophila persimilis]
Length = 549
Score = 196 bits (497), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 108/271 (39%), Positives = 159/271 (58%), Gaps = 14/271 (5%)
Query: 4 PTHQRAQGNKLYYQ------EALNKSPELKDEPPKVNNVAPTLEVTE-REKYEMLCRGDL 56
P H+ A NK+ Y+ +++ SP +K +PP+ P E+ E +E Y+ +CRG+L
Sbjct: 251 PGHEEAVKNKIVYEALLARERSISSSPRMKLDPPQEAAPEPEPELKESQELYQRVCRGEL 310
Query: 57 TVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPR 116
P L+C H++VPY RL P K E+ P + + DV+ D E + I + + +
Sbjct: 311 RQSPKEQRYLRCWLSHQDVPYQRLSPFKVEQLSGDPYVAYFHDVLSDKESEQIIEHGKGQ 370
Query: 117 LRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVN 176
+ R+ + +TG ++ R S++ WL +P + I +R+E +TGL+T TAE LQ+VN
Sbjct: 371 VTRSEIG--QTGNSTVSEIRTSQNTWLWYENNPWLADIKQRLEDITGLSTDTAEPLQLVN 428
Query: 177 YGIGGHYEPHYDFARPGEANAFKSLG-TGNRVATVLFYMSDVAQGGATVFTSLNLSLWPE 235
YGIGG YEPH+DF E N G GNR+ T LFY++DV GGAT F L+L++ P
Sbjct: 429 YGIGGQYEPHFDFMDDAEKN----FGWKGNRLLTALFYLNDVPLGGATAFPFLHLAVPPV 484
Query: 236 KGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
KG+ W+NLH S D+ T+HA CPVL GS
Sbjct: 485 KGSLLVWYNLHRSLHKDFRTKHAGCPVLKGS 515
>gi|443697959|gb|ELT98193.1| hypothetical protein CAPTEDRAFT_162820 [Capitella teleta]
Length = 347
Score = 195 bits (496), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 109/298 (36%), Positives = 166/298 (55%), Gaps = 24/298 (8%)
Query: 2 IFPTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLE------------VTEREKYE 49
I P H+RA N+ YY+ + ++ + + + K +N AP ++ YE
Sbjct: 48 IEPGHERAIANRRYYERIIAEADDAERQKLKGDNGAPVVDGKPHRFLTDYTGSKSYSDYE 107
Query: 50 MLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLI 109
LCRG+ T +L CRY R P + PLKEE P I +Y DV+ DS+ +I
Sbjct: 108 KLCRGEETHKRPFKHRLVCRY-QRYHPIFYISPLKEEMLNFDPAIYVYHDVLTDSQNAII 166
Query: 110 KKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTA 169
K++++P+L R+ V + + ++N+R S++AW + HP+I R+S++ ++ LT T
Sbjct: 167 KEVSRPKLHRSGVFSKTDADTGLSNFRTSQTAWHDDSTHPLIARLSQKASAISNLTLETV 226
Query: 170 EELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLN 229
E LQV+NYGIGG YEPH+DF + E N F S NRVAT + Y+S++ GG TV+ ++
Sbjct: 227 EHLQVLNYGIGGLYEPHWDFVQGEERNEF-SESDRNRVATFICYLSELEAGGYTVYPTVG 285
Query: 230 LSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCGL 277
++ P K + A W+NL +G GDY T HAACP+L G + + PCGL
Sbjct: 286 AAVVPRKNSCALWYNLMRNGTGDYRTYHAACPILYGYKWVANKWFHEGGQEFVRPCGL 343
>gi|196011900|ref|XP_002115813.1| hypothetical protein TRIADDRAFT_59899 [Trichoplax adhaerens]
gi|190581589|gb|EDV21665.1| hypothetical protein TRIADDRAFT_59899 [Trichoplax adhaerens]
Length = 581
Score = 195 bits (496), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 109/271 (40%), Positives = 158/271 (58%), Gaps = 9/271 (3%)
Query: 4 PTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTE-REKYEMLCRGDLTVPPA- 61
P H+ A+ Y++ LN S + K ++ + + Y+ LCRG++
Sbjct: 260 PEHKTAKKYLNIYEKRLNTSTKEKSTEDLDDDNDDEKDFKQIFNSYKELCRGNVNQKTGD 319
Query: 62 ---IVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLR 118
+ QL C +RN P L PL E LQP I++Y +++ +SE+ L+K +A P L+
Sbjct: 320 DVKLNNQLNCYQDYRN-PRLLFSPLNVEVLSLQPYIVIYHNLLTNSEVVLLKTLASPLLK 378
Query: 119 RATVQNYKTGEL-EIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNY 177
RA V E E YRISK+AWL + +HP ++RI+ + + GLT+ TAE LQ+ NY
Sbjct: 379 RAVVVGKPDKEYGEETTYRISKTAWLDKEDHPAVKRITTLIGDIIGLTSETAEPLQIANY 438
Query: 178 GIGGHYEPHYDFARPGEANAFKSLGT--GNRVATVLFYMSDVAQGGATVFTSLNLSLWPE 235
GIGGHYEPH DF + A + GNR+ATVL Y+S+V GGATVF + + P
Sbjct: 439 GIGGHYEPHLDFIESEDKEALSEYTSRIGNRIATVLIYLSNVEAGGATVFPKAGVRVEPR 498
Query: 236 KGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+G+AAFW+N+H +G+G+ + HAACPVL GS
Sbjct: 499 QGSAAFWYNMHRNGEGNKLSVHAACPVLIGS 529
>gi|156370133|ref|XP_001628326.1| predicted protein [Nematostella vectensis]
gi|156215300|gb|EDO36263.1| predicted protein [Nematostella vectensis]
Length = 526
Score = 194 bits (494), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 114/270 (42%), Positives = 161/270 (59%), Gaps = 8/270 (2%)
Query: 4 PTHQRAQGNKLYYQEALNKSPELK-DEPPKVNNVAPT-----LEVTEREKYEMLCR-GDL 56
P Q N + + K P L PP N L E +Y LCR
Sbjct: 232 PNDTELQSNIRKLKHLIAKQPHLNVTSPPNTANRIEDDGDDELSREEMAEYTRLCRPNSQ 291
Query: 57 TVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPR 116
T P+ QL C Y++++ P L+L P+ E + P+I L+ +V+ + EI+ + ++A+PR
Sbjct: 292 TRLPSSNKQLTCSYLNKH-PGLKLKPVAMEIVSVNPQITLFHNVLSEMEIEQMLELARPR 350
Query: 117 LRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVN 176
LRRA V N +TGE+E +YRIS+ AWL + + ++ RI+RRV +TGL T+T E LQV N
Sbjct: 351 LRRARVNNLETGEIEDVDYRISQIAWLSDSDGDIVRRINRRVGFITGLNTNTGECLQVNN 410
Query: 177 YGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEK 236
YG+GGHYEPH+D + E + SLG GNR+AT +FY+S+V GG+TVF + P K
Sbjct: 411 YGVGGHYEPHFDHSLDMENSPIASLGQGNRIATFMFYLSEVEAGGSTVFIKTGVKTNPFK 470
Query: 237 GTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
G A FW+NL SG+GD+ + HA CPVL G+
Sbjct: 471 GGAVFWYNLKKSGEGDWDSLHAGCPVLIGN 500
>gi|195055773|ref|XP_001994787.1| GH17427 [Drosophila grimshawi]
gi|193892550|gb|EDV91416.1| GH17427 [Drosophila grimshawi]
Length = 538
Score = 193 bits (490), Expect = 9e-47, Method: Compositional matrix adjust.
Identities = 96/223 (43%), Positives = 136/223 (60%), Gaps = 3/223 (1%)
Query: 44 EREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYD 103
E YE +CRG+++ A L+CRY Y L PLK EE L P ++ Y D++
Sbjct: 284 EFAHYEKVCRGEVSASAAQQRPLRCRYARGQHAYRVLAPLKLEEHSLDPLVVSYHDMLSP 343
Query: 104 SEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTG 163
+I +++MA P ++R+TV + + + +R+SK+AWL HP++ R+ R + TG
Sbjct: 344 QQIIELRQMAVPHMKRSTVNPLPGRQSKKSAFRVSKNAWLEYDTHPMMGRMLRDLSDATG 403
Query: 164 LTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGAT 223
L + E+LQV NYG+GGHYEPH+DF + + GNR+AT +FY+SDV QGGAT
Sbjct: 404 LDMTYCEQLQVANYGVGGHYEPHWDFFVDSQHYPAEE---GNRIATAIFYLSDVEQGGAT 460
Query: 224 VFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
F LN ++ P+ G FW+NLH S D DY T+HA CPVL GS
Sbjct: 461 AFPFLNFAVRPQLGNILFWYNLHRSLDMDYRTKHAGCPVLKGS 503
>gi|321463241|gb|EFX74258.1| hypothetical protein DAPPUDRAFT_22132 [Daphnia pulex]
Length = 523
Score = 193 bits (490), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 100/226 (44%), Positives = 140/226 (61%), Gaps = 8/226 (3%)
Query: 48 YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID 107
+ LCRG+ + ++A+LKC Y R+ Y LMP+K E+ +P I + DV+ D EI+
Sbjct: 275 FNALCRGERLLNDKLLAELKCWYDTRHQFYFLLMPIKIEQHSFEPAIYTFHDVLSDEEIE 334
Query: 108 LIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTT- 166
IK++A+P L R+ VQ E++N R SK+AWL E HP++ R+SRR+ +TGL T
Sbjct: 335 TIKELAKPLLARSMVQGKLGVGHEVSNVRTSKTAWLPEGLHPLLNRLSRRIGLITGLKTD 394
Query: 167 ---STAEELQVVNYGIGGHYEPHYDFARPGEANA----FKSLGTGNRVATVLFYMSDVAQ 219
AE LQV NYGIGGHY PH+D+ +A+ + L G+R+AT +FY++DV +
Sbjct: 395 PIRDEAELLQVANYGIGGHYSPHHDYLMKDKADFEYMHHRELQAGDRIATFMFYLNDVER 454
Query: 220 GGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
GG+T F +++ P KG AAFW NL SG D T H ACPVL G
Sbjct: 455 GGSTAFPRAGVAVKPVKGGAAFWFNLKRSGKPDPLTLHGACPVLLG 500
>gi|442747045|gb|JAA65682.1| Putative prolyl 4-hydroxylase alpha subunit [Ixodes ricinus]
Length = 538
Score = 192 bits (487), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 108/269 (40%), Positives = 156/269 (57%), Gaps = 16/269 (5%)
Query: 9 AQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAIVAQLKC 68
+Q +++ Q+ + +S E K + V P E E + Y+ LCRG+L P + +QL+C
Sbjct: 246 SQTHEVAVQDRIEQSAESKAQ--LFQEVTP--EDQEDQSYKRLCRGELLRSPKMDSQLRC 301
Query: 69 RYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTG 128
RY + L P+K EE L+P II+ DV+ D +I + A+PRL R+T Y
Sbjct: 302 RYYKGQDGFFSLQPIKLEEINLKPYIIVMHDVVQDKDIKDLMAYAEPRLERSTT--YTGS 359
Query: 129 ELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST----AEELQVVNYGIGGHYE 184
E+ + R S +AWL E E P+ R++ + + G+ TS AE Q+ NYG GG +
Sbjct: 360 EMVPSPVRTSSTAWLNEDEAPIAVRMNSYLRALLGMGTSDTNEEAEAYQLANYGTGGQFL 419
Query: 185 PHYDFARPG------EANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGT 238
PH+DF + A+ + GTG+R+AT++ YM+DV +GGATVF SL + L P+KG
Sbjct: 420 PHHDFLQDSLHSYNSSADYYLQYGTGDRLATLMIYMTDVEEGGATVFPSLGIRLTPKKGD 479
Query: 239 AAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
AAFW NL +SG+GD T HA CPVL GS
Sbjct: 480 AAFWWNLKASGEGDRLTTHAGCPVLYGSK 508
>gi|195159317|ref|XP_002020528.1| GL14042 [Drosophila persimilis]
gi|194117297|gb|EDW39340.1| GL14042 [Drosophila persimilis]
Length = 534
Score = 192 bits (487), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 97/223 (43%), Positives = 136/223 (60%), Gaps = 3/223 (1%)
Query: 44 EREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYD 103
E YE +CRG++ P L+CRY + PY L PLK EE L P ++ Y D++
Sbjct: 280 EFAHYEKVCRGEVGPSPRQERPLRCRYSLGSHPYRHLAPLKLEEHSLDPFVVTYHDMLSP 339
Query: 104 SEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTG 163
+I ++ MA PR+ R+TV G+ + +++R+SK+AWL HP + + + TG
Sbjct: 340 RKIADLRLMAVPRMHRSTVNPLPGGQNKKSSFRVSKNAWLAYDSHPTMGGMLSDLSDATG 399
Query: 164 LTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGAT 223
L + E+LQV NYG+GGHYEPH+DF R + + GNR+AT +FY+SDV QGGAT
Sbjct: 400 LDMTFCEQLQVANYGVGGHYEPHWDFFRDPDHYPAEE---GNRMATAIFYLSDVEQGGAT 456
Query: 224 VFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
F LN ++ P+ G FW+N+H S D DY T+HA CPVL GS
Sbjct: 457 AFPFLNFAVKPQLGNVLFWYNVHRSLDVDYRTKHAGCPVLKGS 499
>gi|125772813|ref|XP_001357665.1| GA21991 [Drosophila pseudoobscura pseudoobscura]
gi|54637397|gb|EAL26799.1| GA21991 [Drosophila pseudoobscura pseudoobscura]
Length = 534
Score = 192 bits (487), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 97/223 (43%), Positives = 136/223 (60%), Gaps = 3/223 (1%)
Query: 44 EREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYD 103
E YE +CRG++ P L+CRY + PY L PLK EE L P ++ Y D++
Sbjct: 280 EFAHYEKVCRGEVGPSPRQERPLRCRYSLGSHPYRHLAPLKLEEHSLDPFVVTYHDMLSP 339
Query: 104 SEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTG 163
+I ++ MA PR+ R+TV G+ + +++R+SK+AWL HP + + + TG
Sbjct: 340 RKIADLRLMAVPRMHRSTVNPLPGGQNKKSSFRVSKNAWLAYDSHPTMGGMLSDLSDATG 399
Query: 164 LTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGAT 223
L + E+LQV NYG+GGHYEPH+DF R + + GNR+AT +FY+SDV QGGAT
Sbjct: 400 LDMTFCEQLQVANYGVGGHYEPHWDFFRDPDHYPAEE---GNRMATAIFYLSDVEQGGAT 456
Query: 224 VFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
F LN ++ P+ G FW+N+H S D DY T+HA CPVL GS
Sbjct: 457 AFPFLNFAVKPQLGNVLFWYNVHRSLDVDYRTKHAGCPVLKGS 499
>gi|195575145|ref|XP_002105540.1| GD16902 [Drosophila simulans]
gi|194201467|gb|EDX15043.1| GD16902 [Drosophila simulans]
Length = 525
Score = 191 bits (486), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 94/219 (42%), Positives = 137/219 (62%), Gaps = 4/219 (1%)
Query: 48 YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID 107
Y+M CRG PP+ ++L C Y P+L L PLK E L P ++LY DV+ EI
Sbjct: 287 YQMGCRGQF--PPSADSKLYCLYNRTTSPFLILAPLKMELVGLDPYMVLYHDVLSPKEIT 344
Query: 108 LIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTS 167
++ MA P L+RATV +G E+ R SK AW + +P+ R++ R+ MTG
Sbjct: 345 ELQGMATPGLKRATVYQASSGRNEVVKTRTSKVAWFPDGYNPLTVRLNARISDMTGFNLY 404
Query: 168 TAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTS 227
+E LQ++NYG+GGHY+ HYDF + N+ + +G+R+ATVLFY++DV QGGATVF +
Sbjct: 405 GSEMLQLMNYGLGGHYDQHYDFF--NKTNSNMTAMSGDRIATVLFYLTDVEQGGATVFPN 462
Query: 228 LNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+ +++P++G+ W+NL +G D T HAACPV+ GS
Sbjct: 463 IRKAVFPQRGSVVMWYNLRDNGQIDTQTLHAACPVIVGS 501
>gi|21711777|gb|AAM75079.1| RE70601p [Drosophila melanogaster]
Length = 316
Score = 191 bits (485), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 93/219 (42%), Positives = 137/219 (62%), Gaps = 4/219 (1%)
Query: 48 YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID 107
Y++ CRG PP+ ++L C Y P+L L PLK E L P ++LY DV+ EI
Sbjct: 78 YQIGCRGQF--PPSADSKLYCLYNRTTSPFLILAPLKMELVGLDPYMVLYHDVLSPKEIK 135
Query: 108 LIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTS 167
++ MA P L+RATV +G E+ R SK AW + +P+ R++ R+ MTG
Sbjct: 136 ELQGMATPSLKRATVYQASSGRNEVVKTRTSKVAWFPDGYNPLTVRLNARISDMTGFNLY 195
Query: 168 TAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTS 227
+E LQ++NYG+GGHY+ HYDF + N+ + +G+R+ATVLFY++DV QGGATVF +
Sbjct: 196 GSEMLQLMNYGLGGHYDQHYDFF--NKTNSNMTAMSGDRIATVLFYLTDVEQGGATVFPN 253
Query: 228 LNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+ +++P++G+ W+NL +G D T HAACPV+ GS
Sbjct: 254 IRKAVFPQRGSVVMWYNLKDNGQIDTQTLHAACPVIVGS 292
>gi|198449500|ref|XP_001357604.2| GA15939 [Drosophila pseudoobscura pseudoobscura]
gi|198130634|gb|EAL26738.2| GA15939 [Drosophila pseudoobscura pseudoobscura]
Length = 528
Score = 191 bits (484), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 102/238 (42%), Positives = 141/238 (59%), Gaps = 4/238 (1%)
Query: 31 PKVNNVAPTLEVTERE--KYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEA 88
P +APTL+ T + +YE CRG L P+ +L C Y N +LRL PLK E
Sbjct: 268 PNSIGIAPTLKSTAQPLGEYERGCRG-LFPSPSKDGRLHCVYNSTNSAFLRLAPLKMELV 326
Query: 89 YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
L P ++LY DV+ EI ++ MA P L+RATV E+ R SK AW + +
Sbjct: 327 GLDPYMVLYHDVISAPEISQLQDMATPGLKRATVYKASGRRSEVVKTRTSKVAWFPDTFN 386
Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVA 208
+ ER++RR+ MT +E LQ +NYG+GGHY+ HYDF A + G+R+A
Sbjct: 387 ELTERLNRRIADMTNFDLLGSEMLQAMNYGLGGHYDKHYDFFNASTATNLTQM-NGDRIA 445
Query: 209 TVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
TVLFY++DV QGGATVF ++ +++P++G+A W+NL GD + T HAACPVL GS
Sbjct: 446 TVLFYLTDVEQGGATVFPNIRKAVFPQRGSAIIWYNLKDDGDPNPQTLHAACPVLVGS 503
>gi|195391766|ref|XP_002054531.1| GJ24504 [Drosophila virilis]
gi|194152617|gb|EDW68051.1| GJ24504 [Drosophila virilis]
Length = 545
Score = 191 bits (484), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 107/268 (39%), Positives = 152/268 (56%), Gaps = 13/268 (4%)
Query: 4 PTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAIV 63
P +R N +Q ++ +++ PP N+ E++E + Y C G + PA +
Sbjct: 245 PGSERYINNYKDFQPPSDELNPVEEHPPLPENLT---ELSEFDLYRYTCNGHIKPTPAEL 301
Query: 64 AQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQ 123
QL+C Y+ P+L L PLK EE P ++LY DV+Y SEID + K+ + ++ RATV
Sbjct: 302 RQLRCGYMTETHPFLLLAPLKVEELSHDPLLVLYHDVIYQSEIDTLAKLTKNKIHRATVT 361
Query: 124 NYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHY 183
++N R S+ ++ + H V+ I +RV MT L AE+ Q+ NYGIGGHY
Sbjct: 362 GNNASV--VSNARTSQFTFIPKTRHKVLRTIDQRVADMTDLNMVFAEDHQLANYGIGGHY 419
Query: 184 EPHYDFARPGEANAFKSLGT-----GNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGT 238
H D+ P NAF++ GNR+ATVLFY++DV QGG T F L L P+K
Sbjct: 420 AQHMDWFSP---NAFETKQVANSEMGNRIATVLFYLTDVEQGGGTAFPVLKQLLKPKKYA 476
Query: 239 AAFWHNLHSSGDGDYYTRHAACPVLTGS 266
AAFW+NLH+SG GD T H ACP++ GS
Sbjct: 477 AAFWYNLHASGAGDVRTMHGACPIIVGS 504
>gi|198449635|ref|XP_001357660.2| GA21971 [Drosophila pseudoobscura pseudoobscura]
gi|198130694|gb|EAL26794.2| GA21971 [Drosophila pseudoobscura pseudoobscura]
Length = 549
Score = 190 bits (483), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 105/256 (41%), Positives = 144/256 (56%), Gaps = 13/256 (5%)
Query: 14 LYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHR 73
LY + + + + ++P K++ E E Y + C G + L+C Y+
Sbjct: 259 LYESKTIEEHAPIPEDPSKLD---------EFEAYRLTCSGHSRLTAREERHLRCGYMTE 309
Query: 74 NVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIA 133
P+L L PLK EE P ++LY DV+Y SEID+I+++ R+ RA V T + ++
Sbjct: 310 THPFLLLAPLKAEELSHDPLLVLYHDVIYQSEIDVIRQLTTNRMARAMVT--LTNQSTVS 367
Query: 134 NYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARP 192
N R S+ ++ + EH V++ I RRV MT L AE+ Q NYGIGGHY H D F
Sbjct: 368 NVRTSQITFIAKTEHEVLQTIDRRVADMTNLNMDYAEDHQFANYGIGGHYGQHMDWFTET 427
Query: 193 GEANAF-KSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDG 251
N S GNR+ATVLFY+SDVAQGG T F L L P+K AAFWHNLH++G G
Sbjct: 428 TFDNGLVSSTEMGNRIATVLFYLSDVAQGGGTAFPYLKQHLRPKKYAAAFWHNLHAAGRG 487
Query: 252 DYYTRHAACPVLTGSN 267
D T+H ACP++ GS
Sbjct: 488 DARTQHGACPIIAGSK 503
>gi|195159142|ref|XP_002020441.1| GL13994 [Drosophila persimilis]
gi|194117210|gb|EDW39253.1| GL13994 [Drosophila persimilis]
Length = 493
Score = 190 bits (483), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 102/238 (42%), Positives = 141/238 (59%), Gaps = 4/238 (1%)
Query: 31 PKVNNVAPTLEVTERE--KYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEA 88
P +APTL+ T + +YE CRG L P+ +L C Y N +LRL PLK E
Sbjct: 233 PNSIGIAPTLKSTAQPLGEYERGCRG-LFPSPSKDGRLHCVYNSTNSAFLRLAPLKMELV 291
Query: 89 YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
L P ++LY DV+ EI ++ MA P L+RATV E+ R SK AW + +
Sbjct: 292 GLDPYMVLYHDVISALEISQLQDMATPGLKRATVYKASGRRSEVVKTRTSKVAWFPDTFN 351
Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVA 208
+ ER++RR+ MT +E LQ +NYG+GGHY+ HYDF A + G+R+A
Sbjct: 352 ELTERLNRRIADMTNFDLLGSEMLQAMNYGLGGHYDKHYDFFNASTAANLTQM-NGDRIA 410
Query: 209 TVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
TVLFY++DV QGGATVF ++ +++P++G+A W+NL GD + T HAACPVL GS
Sbjct: 411 TVLFYLTDVEQGGATVFPNIRKAVFPQRGSAIIWYNLKDDGDPNPQTLHAACPVLVGS 468
>gi|195341590|ref|XP_002037389.1| GM12139 [Drosophila sechellia]
gi|194131505|gb|EDW53548.1| GM12139 [Drosophila sechellia]
Length = 525
Score = 190 bits (483), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 93/219 (42%), Positives = 137/219 (62%), Gaps = 4/219 (1%)
Query: 48 YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID 107
Y++ CRG PP+ ++L C Y P+L L PLK E L+P ++LY DV+ EI
Sbjct: 287 YQVGCRGQF--PPSADSKLYCLYNRTTSPFLILAPLKMELVGLEPYMVLYHDVLSPKEIT 344
Query: 108 LIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTS 167
++ MA P L+RATV +G E+ R SK AW + +P+ R++ R+ MTG
Sbjct: 345 ELQGMATPGLKRATVYQASSGRNEVVKTRTSKVAWFPDGYNPLTVRLNARISDMTGFNLY 404
Query: 168 TAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTS 227
+E LQ++NYG+GGHY+ HYDF N+ + +G+R+ATVLFY++DV QGGATVF +
Sbjct: 405 GSEMLQLMNYGLGGHYDQHYDFF--NNTNSNMTAMSGDRIATVLFYLTDVEQGGATVFPN 462
Query: 228 LNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+ +++P++G+ W+NL +G D T HAACPV+ GS
Sbjct: 463 IRKAVFPQRGSVVMWYNLRDNGQIDTQTLHAACPVIVGS 501
>gi|195341548|ref|XP_002037368.1| GM12149 [Drosophila sechellia]
gi|194131484|gb|EDW53527.1| GM12149 [Drosophila sechellia]
Length = 537
Score = 190 bits (483), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 111/288 (38%), Positives = 156/288 (54%), Gaps = 24/288 (8%)
Query: 4 PTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREK-YEMLCRGDLTVPPAI 62
P H+ A NK+ Y+ L + P+ P E+ E K Y +CRG+L P
Sbjct: 247 PDHEDALKNKILYEGQLARERSF---VPREQAELPQKELKESYKLYTQVCRGELHQSPRE 303
Query: 63 VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATV 122
L+C H+ V Y L P K E+ + P + +V++DSEID I + + + R+ V
Sbjct: 304 QRNLRCWLSHQGVLYYHLSPFKIEQLNIDPYVAYVHEVLWDSEIDTIIEHGKGNMERSKV 363
Query: 123 ---QNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGI 179
+N T E+ RIS++ WL +P + +I +R+E +TGL+T +AE LQ+VNYGI
Sbjct: 364 GQIENSTTTEV-----RISRNTWLWYDANPWLSKIKQRLEDVTGLSTESAEPLQLVNYGI 418
Query: 180 GGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTA 239
GG YEPH+DF F GNR+ T LFY++DVA GGAT F L L++ P KG+
Sbjct: 419 GGQYEPHFDFVEDDGKTVFS--WKGNRLLTALFYLNDVALGGATAFPFLRLAVPPVKGSL 476
Query: 240 AFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCGL 277
W+NLHSS D+ T+HA CPVL GS + + PCGL
Sbjct: 477 LIWYNLHSSTHKDFRTKHAGCPVLQGSKWICNEWFHVAAQEFRRPCGL 524
>gi|24651477|ref|NP_733395.1| prolyl-4-hydroxylase-alpha PV [Drosophila melanogaster]
gi|20269812|gb|AAM18061.1|AF495539_1 prolyl 4-hydroxylase alpha-related protein PH4[alpha]PV [Drosophila
melanogaster]
gi|23172718|gb|AAN14252.1| prolyl-4-hydroxylase-alpha PV [Drosophila melanogaster]
Length = 525
Score = 190 bits (482), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 93/219 (42%), Positives = 137/219 (62%), Gaps = 4/219 (1%)
Query: 48 YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID 107
Y++ CRG PP+ ++L C Y P+L L PLK E L P ++LY DV+ EI
Sbjct: 287 YQIGCRGQF--PPSADSKLYCLYNRTTSPFLILAPLKMELVGLDPYMVLYHDVLSPKEIK 344
Query: 108 LIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTS 167
++ MA P L+RATV +G E+ R SK AW + +P+ R++ R+ MTG
Sbjct: 345 ELQGMATPGLKRATVYQASSGRNEVVKTRTSKVAWFPDGYNPLTVRLNARISDMTGFNLY 404
Query: 168 TAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTS 227
+E LQ++NYG+GGHY+ HYDF + N+ + +G+R+ATVLFY++DV QGGATVF +
Sbjct: 405 GSEMLQLMNYGLGGHYDQHYDFF--NKTNSNMTAMSGDRIATVLFYLTDVEQGGATVFPN 462
Query: 228 LNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+ +++P++G+ W+NL +G D T HAACPV+ GS
Sbjct: 463 IRKAVFPQRGSVVMWYNLKDNGQIDTQTLHAACPVIVGS 501
>gi|444517246|gb|ELV11441.1| Prolyl 4-hydroxylase subunit alpha-2 [Tupaia chinensis]
Length = 466
Score = 189 bits (481), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 106/243 (43%), Positives = 145/243 (59%), Gaps = 32/243 (13%)
Query: 4 PTHQRAQGNKLYYQEALN----KSPELKDEPPKVNNVA----PTLEVTEREKYEMLCRGD 55
P+H+RA GN Y++ L K P + E P + ER+ YE LCRG+
Sbjct: 238 PSHERAGGNLRYFERLLEEEREKMPSNQTEAELATQEGIYERPVDYLPERDVYESLCRGE 297
Query: 56 -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
+ + P +L CRY H N P L + P KEE+ + P I+ Y DVM D EI+ IK++A
Sbjct: 298 GVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 357
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT TAE LQ
Sbjct: 358 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 417
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
V NYG+GG YEPH+DF+R MSDV GGATVF L ++W
Sbjct: 418 VANYGMGGQYEPHFDFSR----------------------MSDVEAGGATVFPDLGAAIW 455
Query: 234 PEK 236
P+K
Sbjct: 456 PKK 458
>gi|195061074|ref|XP_001995919.1| GH14105 [Drosophila grimshawi]
gi|193891711|gb|EDV90577.1| GH14105 [Drosophila grimshawi]
Length = 513
Score = 189 bits (480), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 97/220 (44%), Positives = 134/220 (60%), Gaps = 6/220 (2%)
Query: 47 KYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEI 106
KYE CRG PA ++L C Y N +LRL PLK E L P ++LY D + EI
Sbjct: 278 KYEKGCRGQ--YAPATSSRLHCVYNSTNSAFLRLAPLKMELLQLDPYMVLYHDAISPREI 335
Query: 107 DLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTT 166
+ ++ +A PRL+RA V + T + R SK WL + + R+++R+E M+G T
Sbjct: 336 EDLQFLAMPRLKRAKVVDQVTHRNMMVKERTSKVTWLGDATNAFTMRLNKRIEDMSGFTM 395
Query: 167 STAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFT 226
+E LQV+NYG+GGHY HYDF K+ G+R+ATV+FY+SDV QGGATVF
Sbjct: 396 YGSEMLQVMNYGLGGHYASHYDFLNATS----KTRLNGDRIATVMFYLSDVEQGGATVFP 451
Query: 227 SLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+ +++P++GTA W+NL +GD D T HAACPV+ GS
Sbjct: 452 KIQKAVFPQRGTAIIWYNLKENGDFDTNTIHAACPVIVGS 491
>gi|194765168|ref|XP_001964699.1| GF22909 [Drosophila ananassae]
gi|190614971|gb|EDV30495.1| GF22909 [Drosophila ananassae]
Length = 525
Score = 189 bits (480), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 106/251 (42%), Positives = 147/251 (58%), Gaps = 7/251 (2%)
Query: 19 ALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYL 78
+LN++ +++ PP + T +++ Y + C G P L+C Y+ P+L
Sbjct: 232 SLNETKAVEEHPP-IPKEGDT--ISDFHGYMLTCSGHFRPTPREQRDLRCGYMDETHPFL 288
Query: 79 RLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRIS 138
+ PLK EE P +ILY DV+Y SEID I+K+ +L+RAT+ + T E ++N R S
Sbjct: 289 WIAPLKAEELSRDPLLILYHDVIYQSEIDTIRKLTTNKLKRATITS--TNESVVSNVRTS 346
Query: 139 KSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPG-EAN 196
+ +L E V+ I RRV MT AE+ Q NYGIGGHY H D F +P +A
Sbjct: 347 QFTFLPVTEDKVLATIDRRVADMTNFNMRYAEDHQFANYGIGGHYGQHMDWFYQPSFDAG 406
Query: 197 AFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTR 256
S GNR+ATVLFY+SDV QGG T F L + L P+K AAFW+NLH+SG GD T+
Sbjct: 407 LVSSPEMGNRIATVLFYLSDVTQGGGTAFPHLRVLLKPKKYAAAFWYNLHASGVGDPRTQ 466
Query: 257 HAACPVLTGSN 267
H ACP+++GS
Sbjct: 467 HGACPIISGSK 477
>gi|194905290|ref|XP_001981166.1| GG11918 [Drosophila erecta]
gi|190655804|gb|EDV53036.1| GG11918 [Drosophila erecta]
Length = 525
Score = 187 bits (474), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 93/219 (42%), Positives = 133/219 (60%), Gaps = 4/219 (1%)
Query: 48 YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID 107
Y+M CRG PP+ +L C Y +L L PLK E L P ++LY DV+ EI
Sbjct: 287 YQMGCRGQF--PPSADGKLYCLYNRTTSAFLMLAPLKMELVGLDPYMVLYHDVLSAKEIK 344
Query: 108 LIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTS 167
++ MA P L RATV +G E+ R SK AW + +P+ R++ R+ MTG
Sbjct: 345 ELQGMATPGLTRATVFQASSGRNEVVKTRTSKVAWFPDSYNPLTVRLNARIADMTGFNLY 404
Query: 168 TAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTS 227
+E LQ++NYG+GGHY+ HYDF +N + +G+R+ATVLFY++DV QGGATVF +
Sbjct: 405 GSEMLQLMNYGLGGHYDQHYDFFNTINSNL--TAMSGDRIATVLFYLTDVEQGGATVFPN 462
Query: 228 LNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+ +++P++G+ W+NL +G D T HAACPV+ GS
Sbjct: 463 IRKAVFPQRGSVIMWYNLQDNGQTDNKTLHAACPVIVGS 501
>gi|195110931|ref|XP_002000033.1| GI24862 [Drosophila mojavensis]
gi|193916627|gb|EDW15494.1| GI24862 [Drosophila mojavensis]
Length = 549
Score = 187 bits (474), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 97/228 (42%), Positives = 138/228 (60%), Gaps = 4/228 (1%)
Query: 41 EVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDV 100
++++ E Y C G + P+ + QL+C Y+ P+L L PLK EE P ++L+ DV
Sbjct: 283 KLSDFELYRHTCNGHIRPTPSELRQLRCGYMTETHPFLLLAPLKVEELSHDPLLVLFHDV 342
Query: 101 MYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEH 160
+Y SEID + ++A+ ++ RATV + + ++N R S+ +L + H V+ I +RV
Sbjct: 343 IYQSEIDTLMRLAKNKIHRATVTGHNSSV--VSNARTSQFTFLPKTRHKVLRTIDQRVAD 400
Query: 161 MTGLTTSTAEELQVVNYGIGGHYEPHYDFARP--GEANAFKSLGTGNRVATVLFYMSDVA 218
MT L AE+ Q+ NYGIGGHY H D+ P E + GNR+ TVLFY+SDV
Sbjct: 401 MTDLHLEYAEDHQLANYGIGGHYAQHMDWFYPITFETKQVSNPEMGNRIGTVLFYLSDVE 460
Query: 219 QGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
QGGAT F +L L P+K AAFW+NLH+SG GD T H ACP++ GS
Sbjct: 461 QGGATAFPALKQLLRPKKHAAAFWYNLHASGVGDARTMHGACPIIVGS 508
>gi|326914688|ref|XP_003203656.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Meleagris
gallopavo]
Length = 539
Score = 187 bits (474), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 109/266 (40%), Positives = 153/266 (57%), Gaps = 9/266 (3%)
Query: 4 PTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRG-DLTVPPAI 62
P++QR N Y++ L + P + NV ++ R+ YE LC+G + P
Sbjct: 255 PSNQRVTRNVAKYEKLLATHGDRVGRPLQRPNVT---QLQNRDAYEELCQGLGAQMAPEQ 311
Query: 63 VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATV 122
+QL C Y PYL L P K+E LQP I+LY D + D+E + IK +A P L+R+ V
Sbjct: 312 PSQLGCSYETNGSPYLLLQPAKKETLRLQPYIVLYHDFVSDAEAETIKGLAGPWLQRSVV 371
Query: 123 QNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQVVNYGIG 180
+ + + + YRISKSAWL++ PV+ + R+ +TGL AE LQVVNYG+G
Sbjct: 372 ASGE--KQQKVEYRISKSAWLKDTADPVVRALELRMAAITGLDLRPPYAEYLQVVNYGLG 429
Query: 181 GHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAA 240
GHYEPH+D A ++ ++ + +GNR+ATV+ Y+S V GG+T F N S+ K A
Sbjct: 430 GHYEPHFDHATSRKSPLYR-MKSGNRIATVMIYLSAVEAGGSTAFIYANFSVPVVKNAAL 488
Query: 241 FWHNLHSSGDGDYYTRHAACPVLTGS 266
FW NL +GDGD T HA CPVL G
Sbjct: 489 FWWNLRRNGDGDGDTLHAGCPVLAGD 514
>gi|195505255|ref|XP_002099425.1| GE23368 [Drosophila yakuba]
gi|194185526|gb|EDW99137.1| GE23368 [Drosophila yakuba]
Length = 528
Score = 186 bits (473), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 98/237 (41%), Positives = 139/237 (58%), Gaps = 7/237 (2%)
Query: 30 PPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAY 89
P KV A T E Y+M CRG P+ ++L C Y P+L L PLK E
Sbjct: 275 PVKVQAQAQT---AEPSAYQMGCRGQFA--PSADSKLHCLYNRTTSPFLMLAPLKMELVG 329
Query: 90 LQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHP 149
L P ++LY DV+ EI ++ MA P L+RATV +G E+ R SK AW + P
Sbjct: 330 LDPYMVLYHDVLSAKEIKELQGMATPGLKRATVFQAASGRNEVVRTRTSKVAWFPDGYSP 389
Query: 150 VIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVAT 209
+ R++ R+ MTG +E LQ++NYG+GGHY+ HYD+ +N + +G+R+AT
Sbjct: 390 LTVRLNARITDMTGFNLHGSEMLQLMNYGLGGHYDQHYDYFNTINSNL--TAMSGDRIAT 447
Query: 210 VLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
VLFY++DV QGGATVF ++ +++P++G+ W+NL G D T HAACPV+ GS
Sbjct: 448 VLFYLTDVEQGGATVFPNIRKAVFPQRGSVIMWYNLKDDGQIDTQTLHAACPVIVGS 504
>gi|148701599|gb|EDL33546.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide, isoform CRA_d [Mus
musculus]
Length = 545
Score = 186 bits (472), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 102/222 (45%), Positives = 144/222 (64%), Gaps = 12/222 (5%)
Query: 4 PTHQRAQGNKLYYQEALNK------SPELKDEPPKVNNV--APTLEVTEREKYEMLCRGD 55
P+H+RA GN Y++ L + S + N+ PT + ER+ YE LCRG+
Sbjct: 313 PSHERAGGNLRYFERLLEEERGKSLSNQTDAGLATQENLYERPTDYLPERDVYESLCRGE 372
Query: 56 -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
+ + P +L CRY H N VP L + P KEE+ + P I+ Y DVM D EI+ IK++A
Sbjct: 373 GVKLTPRRQKKLFCRYHHGNRVPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 432
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT TAE LQ
Sbjct: 433 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 492
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMS 215
V NYG+GG YEPH+DF+R + K+ GNR+AT L Y+S
Sbjct: 493 VANYGMGGQYEPHFDFSRRPFDSGLKT--EGNRLATFLNYVS 532
>gi|440899661|gb|ELR50930.1| Prolyl 4-hydroxylase subunit alpha-3, partial [Bos grunniens mutus]
Length = 478
Score = 186 bits (472), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 113/272 (41%), Positives = 158/272 (58%), Gaps = 19/272 (6%)
Query: 4 PTHQRAQGNKLYYQEALNKSP-----ELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTV 58
P ++R N L Y++ L +SP E + P V P L+ R+ YE LC+ +
Sbjct: 192 PDNKRVARNVLKYEKLLAESPNQAVAETVMQRPNV----PHLQT--RDTYEGLCQTLGSQ 245
Query: 59 PPAI-VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRL 117
P + L C Y + PYL L P+++E +L+P ++LY D + D+E I+ +A+P L
Sbjct: 246 PTHYRIPSLYCSYETSSSPYLLLQPVRKEVIHLEPYVVLYHDFVSDAEAQTIRGLAEPWL 305
Query: 118 RRATVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQV 174
+R+ V +GE ++ YRISKSAWL++ PV+ + R+ +TGL AE LQV
Sbjct: 306 QRSVV---ASGEKQLPVEYRISKSAWLKDTVDPVLVTLDHRIAALTGLDVQPPYAEYLQV 362
Query: 175 VNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWP 234
VNYGIGGHYEPH+D A + ++ + +GNRVAT + Y+S V GGAT F N S+
Sbjct: 363 VNYGIGGHYEPHFDHATSPSSPLYR-MNSGNRVATFMIYLSSVEAGGATAFIYGNFSVPV 421
Query: 235 EKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
K A FW NLH SG+GD T HAACPVL G
Sbjct: 422 VKNAALFWWNLHRSGEGDGDTLHAACPVLVGD 453
>gi|48675383|ref|NP_001001598.1| prolyl 4-hydroxylase subunit alpha-3 precursor [Bos taurus]
gi|75053350|sp|Q75UG4.1|P4HA3_BOVIN RecName: Full=Prolyl 4-hydroxylase subunit alpha-3; Short=4-PH
alpha-3; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-3; Flags: Precursor
gi|47115494|dbj|BAD18888.1| Collagen prolyl 4-hydroxylase alpha III subunit [Bos taurus]
gi|296479828|tpg|DAA21943.1| TPA: prolyl 4-hydroxylase subunit alpha-3 precursor [Bos taurus]
Length = 544
Score = 186 bits (471), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 113/273 (41%), Positives = 158/273 (57%), Gaps = 19/273 (6%)
Query: 4 PTHQRAQGNKLYYQEALNKSP-----ELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTV 58
P ++R N L Y++ L +SP E + P V P L+ R+ YE LC+ +
Sbjct: 258 PDNKRVARNVLKYEKLLAESPNQAVAETVMQRPNV----PHLQT--RDTYEGLCQTLGSQ 311
Query: 59 PPAI-VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRL 117
P + L C Y + PYL L P+++E +L+P ++LY D + D+E I+ +A+P L
Sbjct: 312 PTHYRIPSLYCSYETSSSPYLLLQPVRKEVIHLEPYVVLYHDFVSDAEAQTIRGLAEPWL 371
Query: 118 RRATVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQV 174
+R+ V +GE ++ YRISKSAWL++ PV+ + R+ +TGL AE LQV
Sbjct: 372 QRSVV---ASGEKQLPVEYRISKSAWLKDTVDPVLVTLDHRIAALTGLDVQPPYAEYLQV 428
Query: 175 VNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWP 234
VNYGIGGHYEPH+D A + ++ + +GNRVAT + Y+S V GGAT F N S+
Sbjct: 429 VNYGIGGHYEPHFDHATSPSSPLYR-MNSGNRVATFMIYLSSVEAGGATAFIYGNFSVPV 487
Query: 235 EKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
K A FW NLH SG+GD T HAACPVL G
Sbjct: 488 VKNAALFWWNLHRSGEGDGDTLHAACPVLVGDK 520
>gi|426245942|ref|XP_004016760.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3, partial [Ovis
aries]
Length = 514
Score = 185 bits (470), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 113/273 (41%), Positives = 158/273 (57%), Gaps = 19/273 (6%)
Query: 4 PTHQRAQGNKLYYQEALNKSP-----ELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTV 58
P ++R N L Y++ L +SP E + P V P L+ R+ YE LC+ +
Sbjct: 228 PDNKRVARNVLKYEKLLAESPNQAVAETVMQRPNV----PHLQT--RDTYEGLCQTLGSQ 281
Query: 59 PPAI-VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRL 117
P + L C Y + PYL L P+++E +L+P ++LY D + D+E I+ +A+P L
Sbjct: 282 PTHYQIPSLYCSYETSSSPYLLLQPVRKEVIHLEPYVVLYHDFVSDAEAQKIRGLAEPWL 341
Query: 118 RRATVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQV 174
+R+ V +GE ++ YRISKSAWL++ PV+ + R+ +TGL AE LQV
Sbjct: 342 QRSVV---ASGEKQLPVEYRISKSAWLKDTVDPVLVTLDHRIAALTGLDVQPPYAEYLQV 398
Query: 175 VNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWP 234
VNYGIGGHYEPH+D A + ++ + +GNRVAT + Y+S V GGAT F N S+
Sbjct: 399 VNYGIGGHYEPHFDHATSPSSPLYR-MNSGNRVATFMIYLSSVEAGGATAFIYGNFSVPV 457
Query: 235 EKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
K A FW NLH SG+GD T HAACPVL G
Sbjct: 458 VKNAALFWWNLHRSGEGDGDTLHAACPVLVGDK 490
>gi|363729586|ref|XP_417248.3| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Gallus gallus]
Length = 542
Score = 185 bits (470), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 110/266 (41%), Positives = 153/266 (57%), Gaps = 11/266 (4%)
Query: 4 PTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRG-DLTVPPAI 62
P++QR N Y++ L + P + NV ++ R+ YE LC+G + P
Sbjct: 258 PSNQRVTRNVAKYEKLLATHGDRVGAPLQRPNVT---QLQNRDAYEELCQGLGAQMAPER 314
Query: 63 VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATV 122
+ L C Y PYL L P K+E LQP I+LY D + D+E + IK +A P L+R+ V
Sbjct: 315 PSHLGCSYETNGSPYLLLQPAKKETLRLQPYIVLYHDFVSDAEAETIKGLAGPWLQRSVV 374
Query: 123 QNYKTGE-LEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQVVNYGI 179
+GE + YRISKSAWL++ PV++ + R+ +TGL AE LQVVNYG+
Sbjct: 375 ---ASGEKQQKVEYRISKSAWLKDTADPVVQALELRMAAITGLDLRPPYAEYLQVVNYGL 431
Query: 180 GGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTA 239
GGHYEPH+D A ++ ++ + +GNR+ATV+ Y+S V GG+T F N S+ K A
Sbjct: 432 GGHYEPHFDHATSRKSPLYR-MKSGNRIATVMIYLSAVEAGGSTAFIYANFSVPVVKNAA 490
Query: 240 AFWHNLHSSGDGDYYTRHAACPVLTG 265
FW NL +GDGD T HA CPVL G
Sbjct: 491 LFWWNLRRNGDGDGDTLHAGCPVLAG 516
>gi|431838427|gb|ELK00359.1| Prolyl 4-hydroxylase subunit alpha-3 [Pteropus alecto]
Length = 483
Score = 185 bits (469), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 112/272 (41%), Positives = 157/272 (57%), Gaps = 19/272 (6%)
Query: 4 PTHQRAQGNKLYYQEALNKSP-----ELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTV 58
P ++R N L Y++ L +SP E + P V P L+ R+ YE LC+ +
Sbjct: 197 PDNKRMARNVLKYEKLLAESPTQAVVEAVIQRPNV----PHLQT--RDTYEGLCQTLGSQ 250
Query: 59 PPAI-VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRL 117
P + L C Y + PYL L P+++E +L+P ++LY D + D E I+ +A+P L
Sbjct: 251 PTHYQIPSLHCSYETNSSPYLLLQPVRKEVIHLEPYVVLYHDFVSDLEAQKIRGLAEPWL 310
Query: 118 RRATVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQV 174
+R+ V +GE ++ YRISKSAWL++ P++ + R+ +TGL AE LQV
Sbjct: 311 QRSVV---ASGEKQLPVEYRISKSAWLKDTADPMLVTLDHRIAALTGLDVQPPYAEYLQV 367
Query: 175 VNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWP 234
VNYGIGGHYEPH+D A + ++ + +GNRVAT + Y+S V GGAT F N S+
Sbjct: 368 VNYGIGGHYEPHFDHATSPSSPLYR-MKSGNRVATFMIYLSSVEAGGATAFIYANFSVPV 426
Query: 235 EKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
K A FW NLH SG+GD T HAACPVL G
Sbjct: 427 VKNAALFWWNLHRSGEGDSDTLHAACPVLVGD 458
>gi|403263105|ref|XP_003923900.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
alpha-3, partial [Saimiri boliviensis boliviensis]
Length = 534
Score = 184 bits (468), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 114/273 (41%), Positives = 160/273 (58%), Gaps = 21/273 (7%)
Query: 4 PTHQRAQGNKLYYQEALNKSP-----ELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTV 58
P ++R N L Y+ L +SP E + P + P L+ R+ YE LC+ L
Sbjct: 248 PDNKRMARNVLKYERLLAESPNQVVAEAVIQRPNI----PHLQT--RDTYEGLCQ-TLGS 300
Query: 59 PPAI--VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPR 116
P + + L C Y + PYL L P+++E +L+P I LY D + DSE I+++A+P
Sbjct: 301 QPTLYQIPSLYCSYEINSNPYLLLQPIQKEVLHLEPYIALYHDFVSDSEAQKIRELAEPW 360
Query: 117 LRRATVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQ 173
L+R+ V +GE ++ YRISKSAWL++ P++ ++ R+ +TGL AE LQ
Sbjct: 361 LQRSVV---ASGEKQLQVEYRISKSAWLKDTVDPMLVTLNHRIAALTGLDVRPPYAEYLQ 417
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
VVNYGIGGHYEPH+D A + ++ + +GNRVAT + Y+S V GGAT F NLS+
Sbjct: 418 VVNYGIGGHYEPHFDHATSPSSPLYR-MKSGNRVATFMIYLSSVEAGGATAFIYANLSVP 476
Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
K A FW NLH SG+GD T HA CPVL G+
Sbjct: 477 VVKNAALFWWNLHRSGEGDSDTLHAGCPVLVGN 509
>gi|195061068|ref|XP_001995918.1| GH14106 [Drosophila grimshawi]
gi|193891710|gb|EDV90576.1| GH14106 [Drosophila grimshawi]
Length = 511
Score = 184 bits (468), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 104/252 (41%), Positives = 140/252 (55%), Gaps = 20/252 (7%)
Query: 15 YYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRN 74
Y + P LK P N+A Y + CRG VP + L C Y +
Sbjct: 255 YVHNMIRNEPNLK--PVAKENIA---------SYSLGCRGQF-VPQS---NLHCEYKMKT 299
Query: 75 VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIAN 134
P+LRL PLK E L P I+++ D + EID ++ +A+P L+R TV + G+
Sbjct: 300 SPFLRLAPLKMEIVLLNPFIVVFHDALSPQEIDYLQNLARPLLKRTTV--HVNGKYVSRR 357
Query: 135 YRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGE 194
R SK AWL + + RI RRV MT L+ +E ++NYG+GGHY HYDF +
Sbjct: 358 VRTSKGAWLERDLNNLTRRIERRVVDMTELSMQGSEAYNIMNYGLGGHYAAHYDFFNTTK 417
Query: 195 ANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYY 254
++ TG+R+ATVLFY+SDV QGGATVF +L L++ PE+G A FW+NL +G GD
Sbjct: 418 Q---QTSETGDRIATVLFYLSDVEQGGATVFPNLKLAVSPERGMALFWYNLLDNGTGDTR 474
Query: 255 TRHAACPVLTGS 266
T H CPVL GS
Sbjct: 475 TLHGGCPVLVGS 486
>gi|15808763|gb|AAL08488.1| prolyl-4-hydroxylase alpha subunit-like protein [Onchocerca
volvulus]
Length = 571
Score = 184 bits (468), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 110/271 (40%), Positives = 157/271 (57%), Gaps = 9/271 (3%)
Query: 2 IFPTHQRAQGNKLYYQEALNKSP----ELKDEPPKVNNVAPTLEVTEREK--YEMLCRGD 55
I P H RA+ N Y+ L + +L + +NN+ E E K YE LCR +
Sbjct: 243 INPDHPRAKDNVKEYEYLLKNNEVQRIDLWRKTFPINNMRNDNEFDEGIKLIYEALCRRE 302
Query: 56 LTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQP 115
+ V + +QL C Y + PYLRL P K E P +L+ ++ D + +I+ +A P
Sbjct: 303 VPVNTKVQSQLYC-YYKTDRPYLRLAPFKVEIVRQNPLNVLFYGIISDEQARIIQMLAVP 361
Query: 116 RLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVV 175
+L + + N TG E+ ++RI KSA LR E+ ++RI +R+E T L TAE+L V+
Sbjct: 362 KLNGSRIYNDLTGSFELPSFRILKSARLRSTEYETVKRIDKRLELATNLEIETAEDLAVL 421
Query: 176 NYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTS-LNLSLWP 234
NYGIGG +EPH+D A G+ F+ LGTGNR+AT L Y+++ GG TVFTS L +S+
Sbjct: 422 NYGIGGQFEPHFDCALKGD-QCFEKLGTGNRIATFLIYLTEPEIGGRTVFTSNLKISVPC 480
Query: 235 EKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
K A FW+NL +G+ D + HAACPV TG
Sbjct: 481 VKNAALFWYNLMRNGEVDTRSLHAACPVATG 511
>gi|15808767|gb|AAL08490.1|AF369789_1 prolyl-4-hydroxylase alpha subunit-like protein [Onchocerca
volvulus]
Length = 571
Score = 184 bits (467), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 110/271 (40%), Positives = 157/271 (57%), Gaps = 9/271 (3%)
Query: 2 IFPTHQRAQGNKLYYQEALNKSP----ELKDEPPKVNNVAPTLEVTEREK--YEMLCRGD 55
I P H RA+ N Y+ L + +L + +NN+ E E K YE LCR +
Sbjct: 243 INPDHPRAKDNVKEYEYLLKNNEVQRIDLWRKTFPINNMRNDNEFDEGIKLIYEALCRRE 302
Query: 56 LTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQP 115
+ V + +QL C Y + PYLRL P K E P +L+ ++ D + +I+ +A P
Sbjct: 303 VPVNTKVQSQLYC-YYKTDRPYLRLAPFKVEIVRQNPLNVLFYGIISDEQARIIEMLAVP 361
Query: 116 RLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVV 175
+L + + N TG E+ ++RI KSA LR E+ ++RI +R+E T L TAE+L V+
Sbjct: 362 KLNGSRIYNDLTGSFELPSFRILKSARLRSTEYETVKRIDKRLELATNLEIETAEDLAVL 421
Query: 176 NYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTS-LNLSLWP 234
NYGIGG +EPH+D A G+ F+ LGTGNR+AT L Y+++ GG TVFTS L +S+
Sbjct: 422 NYGIGGQFEPHFDCALKGD-QCFEKLGTGNRIATFLIYLTEPEIGGRTVFTSNLKISVPC 480
Query: 235 EKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
K A FW+NL +G+ D + HAACPV TG
Sbjct: 481 VKNAALFWYNLMRNGEVDTRSLHAACPVATG 511
>gi|313229039|emb|CBY18191.1| unnamed protein product [Oikopleura dioica]
Length = 522
Score = 184 bits (467), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 104/266 (39%), Positives = 154/266 (57%), Gaps = 12/266 (4%)
Query: 7 QRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCR----GDLTVPPAI 62
+R + N YYQ + K EL P ER++YE LC+ + T+
Sbjct: 233 ERIESNWRYYQGKV-KDSELDSFPEDYLERPSHYNPEERQRYEELCQLGYNNEHTIRDNN 291
Query: 63 VAQLKCRYV--HRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRA 120
L+C H + + +L P K EE QP ++ + D++ D+EI+ ++++ + +L RA
Sbjct: 292 DDSLRCFLFKGHEDDFFSQLGPWKVEEIAKQPYVVRFFDILNDNEINSLERLGEEKLARA 351
Query: 121 TVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIG 180
TV + T +L A+YR+SKSAWL++ + +E+ +RR+ +TGL AE+LQ+ NYGIG
Sbjct: 352 TVFDPATHKLVNADYRVSKSAWLKDEDSDTVEKYNRRISRLTGLDLEYAEQLQMSNYGIG 411
Query: 181 GHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAA 240
G YEPHYD++R + R+AT L Y++ V QGG TVFT L L + KG+A
Sbjct: 412 GQYEPHYDYSRRE-----WDIYNNRRIATWLSYLTTVEQGGGTVFTELGLHIRSIKGSAV 466
Query: 241 FWHNLHSSGDGDYYTRHAACPVLTGS 266
FW+NL +G GD TRHAACPVL G+
Sbjct: 467 FWYNLLPNGSGDERTRHAACPVLRGN 492
>gi|260802724|ref|XP_002596242.1| hypothetical protein BRAFLDRAFT_117983 [Branchiostoma floridae]
gi|229281496|gb|EEN52254.1| hypothetical protein BRAFLDRAFT_117983 [Branchiostoma floridae]
Length = 527
Score = 184 bits (467), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 100/207 (48%), Positives = 137/207 (66%), Gaps = 19/207 (9%)
Query: 2 IFPTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLE------------VTEREKYE 49
I P H RA N ++++ + KS L PPK +VA ++E + ERE YE
Sbjct: 252 INPEHTRAINNMKFFEKEMEKSQNLV-APPKDEDVA-SIERGEYKRDLARDYLPEREIYE 309
Query: 50 MLCRGD----LTVPPAIVAQLKCRYV-HRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDS 104
+LC+ + + P+ LKCRY + N P L L P K E+ + +P++ ++ +++ D
Sbjct: 310 LLCQAEQPDMFNITPSRAKHLKCRYFTNNNHPRLLLAPQKLEQVFDKPKMWIFHNILTDP 369
Query: 105 EIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGL 164
E+ +IK +AQPRLRRAT+QN TGELE A+YRISKSAWL+ EH VI R+++RVE +TGL
Sbjct: 370 EMKVIKDLAQPRLRRATIQNSITGELEHASYRISKSAWLQGWEHKVIRRVNQRVEDVTGL 429
Query: 165 TTSTAEELQVVNYGIGGHYEPHYDFAR 191
T TAEELQVVNYG+GGHYEPH+DFAR
Sbjct: 430 TMETAEELQVVNYGMGGHYEPHFDFAR 456
>gi|395814850|ref|XP_003780953.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Otolemur
garnettii]
Length = 544
Score = 184 bits (467), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 111/273 (40%), Positives = 158/273 (57%), Gaps = 19/273 (6%)
Query: 4 PTHQRAQGNKLYYQEALNKSP-----ELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTV 58
P ++R N L Y++ L +SP E + P V P L+ R+ YE LC+ +
Sbjct: 258 PDNKRMARNVLKYEKLLAESPNQAVAETVMQRPNV----PHLQT--RDTYEGLCQTLGSQ 311
Query: 59 PPAI-VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRL 117
P + L C Y + PYL L P+++E +L+P + LY D + DSE I+++A+P L
Sbjct: 312 PTHYQIPSLYCSYETNSSPYLLLQPIRKEVIHLEPFVALYHDFVSDSEAQKIRELAEPWL 371
Query: 118 RRATVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQV 174
+R+ V +GE ++ +YRISKSAWL++ P++ + R+ +TGL AE LQV
Sbjct: 372 QRSVV---ASGEKQLQVDYRISKSAWLKDTVDPMLVTLDHRIAALTGLDVQPPYAEYLQV 428
Query: 175 VNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWP 234
VNYGIGGHYEPH+D A + ++ + +GNRVAT + Y+S V GGAT F N S+
Sbjct: 429 VNYGIGGHYEPHFDHATSPSSPLYR-MKSGNRVATFMIYLSSVEAGGATAFIYANFSVPV 487
Query: 235 EKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
K A FW NLH +G+GD T HA CPVL G
Sbjct: 488 VKNAALFWWNLHRNGEGDSDTLHAGCPVLVGDK 520
>gi|432891690|ref|XP_004075614.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Oryzias
latipes]
Length = 517
Score = 184 bits (466), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 104/228 (45%), Positives = 135/228 (59%), Gaps = 8/228 (3%)
Query: 42 VTEREKYEMLCRGDLTVPPAIVA-QLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDV 100
++ R+ YE LCR + P +L C Y N P L L+P+K E LQP +++Y +
Sbjct: 268 LSTRDTYERLCRTQGSQPIHFENPRLYCDYFTNNNPALLLLPVKREVLSLQPYVVIYHNF 327
Query: 101 MYDSEIDLIKKMAQPRLRRATVQNYKTGELE-IANYRISKSAWLREPEHPVIERISRRVE 159
+ D E + IK AQP LRR+ V +GE + YRISKSAWL+ E ++ ++ +R+
Sbjct: 328 ITDREAEEIKGFAQPALRRSVV---ASGENQATVEYRISKSAWLKGSESCIVGKLDQRIS 384
Query: 160 HMTGLTTS--TAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDV 217
+TGL AE LQVVNYGIGGHYEPH+D A + FK L TGNRVAT + Y+S V
Sbjct: 385 MLTGLNVRPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPVFK-LKTGNRVATFMIYLSSV 443
Query: 218 AQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
GG+T F N S+ K A FW NLH +G GD T HA CPVL G
Sbjct: 444 EAGGSTAFIYANFSVPVLKKAAIFWWNLHRNGRGDAETLHAGCPVLIG 491
>gi|427783867|gb|JAA57385.1| Putative prolyl 4-hydroxylase subunit alpha-1 [Rhipicephalus
pulchellus]
Length = 548
Score = 184 bits (466), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 99/232 (42%), Positives = 142/232 (61%), Gaps = 11/232 (4%)
Query: 44 EREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYD 103
E + Y+ LCRG+ P + +QL+CRY + +LRL P+K EEA L+P II + D++ D
Sbjct: 288 ETQNYKRLCRGEQLRTPKMDSQLRCRYYYGRNGFLRLQPVKIEEANLKPYIITFHDIIGD 347
Query: 104 SEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTG 163
+I+ + A PRL R+T +Y E + R S +AWL + + PV R++R VE + G
Sbjct: 348 RDINDLLAYATPRLFRST--HYGEHGTETSLIRTSSTAWLGDQDAPVATRLNRFVESLLG 405
Query: 164 LTTS----TAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL-----GTGNRVATVLFYM 214
L + AE Q+ NYG+GG Y H+DF A+ + L G+R+AT++FY+
Sbjct: 406 LGSQYLKGEAEYYQLANYGVGGQYIAHHDFLADIYADPNRKLDDFERSAGDRIATLMFYL 465
Query: 215 SDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
SDV +GGATVF L + L P+KG AAFW NL+S G+G+ T+H CPVL GS
Sbjct: 466 SDVEEGGATVFPHLGVRLTPKKGNAAFWWNLNSDGEGEQLTKHGGCPVLYGS 517
>gi|195055775|ref|XP_001994788.1| GH17428 [Drosophila grimshawi]
gi|193892551|gb|EDV91417.1| GH17428 [Drosophila grimshawi]
Length = 540
Score = 184 bits (466), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 94/247 (38%), Positives = 141/247 (57%), Gaps = 10/247 (4%)
Query: 43 TEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMY 102
E Y+ +CR +L PA +L+CR N P + EE +L P +I D++
Sbjct: 282 NEYHMYQQVCREELKPEPATQRKLRCRLHRGNGLRSSYQPYRLEELHLDPYVIQVHDIIS 341
Query: 103 DSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMT 162
E +++++A+P L+R+ V + E N+RIS+ + EHP+++R+S+ +E+++
Sbjct: 342 AEETIVLQQLARPELQRSMVYSLSNSEHISTNFRISQGTFFEYHEHPIMQRMSQHLENIS 401
Query: 163 GLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGA 222
GL +AE+LQV NYGIGGHYEPH D + + NRVAT ++Y+S+V GG
Sbjct: 402 GLDMRSAEQLQVANYGIGGHYEPHMDSFSENHNYGINTYMSTNRVATGIYYLSNVEAGGG 461
Query: 223 TVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC--------- 273
T F L L + PE+G+ FW+NLH SGD DY T+HA CPVL GS + +
Sbjct: 462 TAFPFLPLLVEPERGSLLFWYNLHRSGDLDYRTKHAGCPVLMGSKWIANVWIRLSNQDHI 521
Query: 274 -PCGLRR 279
PC L+R
Sbjct: 522 RPCDLQR 528
>gi|296217074|ref|XP_002754870.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Callithrix
jacchus]
Length = 544
Score = 184 bits (466), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 114/273 (41%), Positives = 159/273 (58%), Gaps = 21/273 (7%)
Query: 4 PTHQRAQGNKLYYQEALNKSP-----ELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTV 58
P ++R N L Y+ L +SP E + P + P L+ R+ YE LC+ L
Sbjct: 258 PDNKRMARNVLKYERLLAESPNQVVAEAVIQRPNI----PHLQT--RDTYEGLCQT-LGS 310
Query: 59 PPAI--VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPR 116
P + + L C Y + PYL L P+++E +L+P I LY D + DSE I++ A+P
Sbjct: 311 QPTLYQIPSLYCSYETNSNPYLVLQPIQKEILHLEPYIALYHDFVSDSEAQKIREFAEPW 370
Query: 117 LRRATVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQ 173
L+R+ V +GE ++ YRISKSAWL++ P++ ++ R+ +TGL AE LQ
Sbjct: 371 LQRSVV---ASGEKQLQVEYRISKSAWLKDTVDPMLVTLNHRIAALTGLDVRPPYAEYLQ 427
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
VVNYGIGGHYEPH+D A + ++ + +GNRVAT + Y+S V GGAT F NLS+
Sbjct: 428 VVNYGIGGHYEPHFDHATSPSSPLYR-MKSGNRVATFMIYLSSVEAGGATAFIYANLSVP 486
Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
K A FW NLH SG+GD T HA CPVL G+
Sbjct: 487 VVKNAALFWWNLHRSGEGDSDTLHAGCPVLVGN 519
>gi|156398644|ref|XP_001638298.1| predicted protein [Nematostella vectensis]
gi|156225417|gb|EDO46235.1| predicted protein [Nematostella vectensis]
Length = 495
Score = 184 bits (466), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 97/201 (48%), Positives = 123/201 (61%), Gaps = 19/201 (9%)
Query: 96 LYRDVMYDSEIDLIKKMAQ----PRLRRATVQNYKTGELEIANYRISKSAWLREPEH-PV 150
++ + + E D+ K++ + P L RATV N TG LE A+YRISK+ WL EH V
Sbjct: 295 VFENFNWHVERDIYKRLCRGEKLPTLNRATVHNPITGHLETAHYRISKNCWLSGREHGEV 354
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
I+R+ RR+ MT L TAE QV NYG+ G Y+PH+DF+R ++ SLGTGNR+ATV
Sbjct: 355 IDRVERRIAAMTRLNLETAEGFQVQNYGLAGQYDPHFDFSRDLANSSLGSLGTGNRIATV 414
Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG----- 265
L +MS V GGATVF + + P+KG A FWHNL SGDGD+ TRHA CPVL+G
Sbjct: 415 LVWMSQVESGGATVFPYVGARILPQKGDAVFWHNLLRSGDGDFRTRHAGCPVLSGIKWVA 474
Query: 266 -------SNSLHSTCPCGLRR 279
N H PC LRR
Sbjct: 475 NKWIHEYGNEFHR--PCSLRR 493
>gi|194213450|ref|XP_001495951.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Equus
caballus]
Length = 548
Score = 183 bits (464), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 111/271 (40%), Positives = 157/271 (57%), Gaps = 19/271 (7%)
Query: 4 PTHQRAQGNKLYYQEALNKSP-----ELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTV 58
P ++R N L Y++ L +SP E + P V P L+ R+ YE LC+ +
Sbjct: 262 PDNKRMARNVLKYEKLLAESPNQVVAEAVIQRPNV----PHLQT--RDTYEGLCQTLGSQ 315
Query: 59 PPAI-VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRL 117
P + L C Y + P+L L P+++E +L+P ++LY D + DSE I+ +A+P L
Sbjct: 316 PTHYQIPSLYCSYETNSSPFLLLQPVRKEVIHLEPYVVLYHDFVSDSEAQKIRGLAEPWL 375
Query: 118 RRATVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQV 174
+R+ V +GE ++ YRISKSAWL++ P++ + R+ +TGL AE LQV
Sbjct: 376 QRSVV---ASGEKQLPVEYRISKSAWLKDTVDPMLVTLDHRIAALTGLDVQPPYAEYLQV 432
Query: 175 VNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWP 234
VNYGIGGHYEPH+D A + ++ + +GNRVAT + Y+S V GGAT F N S+
Sbjct: 433 VNYGIGGHYEPHFDHATSPTSPLYR-MKSGNRVATFMIYLSSVEAGGATAFIYANFSVPV 491
Query: 235 EKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
K A FW NLH SG+GD T HA CPVL G
Sbjct: 492 VKNAALFWWNLHRSGEGDSDTLHAGCPVLVG 522
>gi|194765180|ref|XP_001964705.1| GF23331 [Drosophila ananassae]
gi|190614977|gb|EDV30501.1| GF23331 [Drosophila ananassae]
Length = 535
Score = 183 bits (464), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 102/247 (41%), Positives = 142/247 (57%), Gaps = 15/247 (6%)
Query: 44 EREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYD 103
E + YE +CRGDL PA + +L+CR+ + Y P K EE +P + V+
Sbjct: 281 EFKMYEQVCRGDLNPSPAKLRELRCRFRRSRLGY---APFKLEELSHEPLVFQVHQVVSS 337
Query: 104 SEIDLIKKMAQPRLRRATVQNYKTGEL-EIANYRISKSAWLREPEHPVIERISRRVEHMT 162
+ IKKMA+P+++R+TV + G + A +R S+ A + + +SR V ++
Sbjct: 338 KSAEFIKKMARPKIKRSTVYSIGGGGGSQAAAFRTSQGASFNYSRNAATKILSRHVGDLS 397
Query: 163 GLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGA 222
L + AEELQV NYGIGGHYEPH+D + P + GNR+AT ++Y+SDV GG
Sbjct: 398 SLDMNFAEELQVANYGIGGHYEPHWD-SFPENHIYDEGDDRGNRIATGIYYLSDVEAGGG 456
Query: 223 TVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSL----------HST 272
T F L L + PEKG+ FW+NLH SGD DY T+HAACPVL GS + H+
Sbjct: 457 TAFPFLPLLVTPEKGSLLFWYNLHESGDQDYRTKHAACPVLQGSKWIANVWIRERNQHNV 516
Query: 273 CPCGLRR 279
PCGL+R
Sbjct: 517 RPCGLQR 523
>gi|344296798|ref|XP_003420090.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Loxodonta
africana]
Length = 544
Score = 183 bits (464), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 110/268 (41%), Positives = 153/268 (57%), Gaps = 11/268 (4%)
Query: 4 PTHQRAQGNKLYYQEALNKSP-ELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAI 62
P ++R N L Y+ L SP ++ E P L+ R+ YE LC+ + P
Sbjct: 258 PDNKRMARNVLKYERLLADSPKQMVAEAVIQRPNVPHLQT--RDTYEGLCQTLGSQPTHY 315
Query: 63 -VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRAT 121
+ L C Y + PYL L P ++E +L+P ++LY D + D E IK +A+P L+R+
Sbjct: 316 QIPSLYCSYETNSNPYLLLQPFRKEVIHLEPYVVLYHDFVNDMEAQKIKGLAEPWLQRSV 375
Query: 122 VQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQVVNYG 178
V +GE ++ +YRISKSAWL++ P++ + R+ +TGL AE LQVVNYG
Sbjct: 376 V---ASGEKQLQVDYRISKSAWLKDSVDPMLVTLDHRIAALTGLDVQPPYAEYLQVVNYG 432
Query: 179 IGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGT 238
IGGHYEPH+D A + ++ + +GNRVAT + Y+S V GGAT F N S+ K
Sbjct: 433 IGGHYEPHFDHATSPSSPLYR-MKSGNRVATFMIYLSAVEAGGATAFIYANFSMPVVKNA 491
Query: 239 AAFWHNLHSSGDGDYYTRHAACPVLTGS 266
A FW NLH SG+GD T HA CPVL G
Sbjct: 492 ALFWWNLHRSGEGDGDTLHAGCPVLVGD 519
>gi|417402564|gb|JAA48127.1| Putative prolyl 4-hydroxylase alpha subunit [Desmodus rotundus]
Length = 544
Score = 183 bits (464), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 112/268 (41%), Positives = 153/268 (57%), Gaps = 13/268 (4%)
Query: 4 PTHQRAQGNKLYYQEALNKSPELKDEPPKVN--NVAPTLEVTEREKYEMLCRGDLTVPPA 61
P ++R N L Y++ L +SP + NV P L+ R YE LC+ + P
Sbjct: 258 PDNKRMARNVLKYEKLLAESPSQAAAEAVIQRPNV-PHLQT--RATYEELCQTLGSQPTH 314
Query: 62 IV-AQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRA 120
L C Y PYL L P+++E +L+P ++LY D + D E I+ A+P L+R+
Sbjct: 315 YQNPSLHCSYETGASPYLLLQPIRKEVVHLEPYVVLYHDFVNDLEAQKIRGFAEPWLQRS 374
Query: 121 TVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQVVNY 177
V +GE ++ YRISKSAWL++ P++ + RR+ +TGL T AE LQVVNY
Sbjct: 375 VV---ASGEKQLPVEYRISKSAWLKDTVDPMLVTLDRRIAALTGLDTQPPYAEHLQVVNY 431
Query: 178 GIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKG 237
GIGGHYEPH+D A + ++ + +GNRVAT + Y+S V GGAT F N S+ K
Sbjct: 432 GIGGHYEPHFDHATSPSSPLYR-MKSGNRVATFMIYLSSVEAGGATAFIYANFSVPVVKN 490
Query: 238 TAAFWHNLHSSGDGDYYTRHAACPVLTG 265
A FW NLH SG+GD T HA CPVL G
Sbjct: 491 AALFWWNLHRSGEGDGDTLHAGCPVLVG 518
>gi|348505573|ref|XP_003440335.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Oreochromis
niloticus]
Length = 517
Score = 182 bits (462), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 105/231 (45%), Positives = 133/231 (57%), Gaps = 18/231 (7%)
Query: 45 REKYEMLCRGD------LTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYR 98
R+ YE LCR T P QL C Y N P L LMP + E LQP ++LY
Sbjct: 271 RDTYERLCRTQGSQRRHFTNP-----QLFCDYFTNNNPALMLMPARRELVSLQPYVVLYH 325
Query: 99 DVMYDSEIDLIKKMAQPRLRRATVQNYKTGELE-IANYRISKSAWLREPEHPVIERISRR 157
D + D+E + IK +A P LRR+ V GE + A+YRISKSAWL+ ++ ++ +R
Sbjct: 326 DFVTDTEAEDIKSLAHPGLRRSVV---AAGEKQATADYRISKSAWLKGSAQSIVGKLDQR 382
Query: 158 VEHMTGLTTST--AEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMS 215
+ +TGL E LQVVNYGIGGHYEPH+D A + FK L TGNRVAT + Y+S
Sbjct: 383 ISLLTGLNVKHPYGEYLQVVNYGIGGHYEPHFDHATSPSSPVFK-LKTGNRVATFMIYLS 441
Query: 216 DVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
V GG+T F N S+ + A FW NLH +G+GD T HA CPVL G
Sbjct: 442 PVEAGGSTAFIYANFSVPVVEKAAIFWWNLHRNGEGDDDTLHAGCPVLIGD 492
>gi|297689698|ref|XP_002822285.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Pongo abelii]
Length = 544
Score = 182 bits (461), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 111/268 (41%), Positives = 157/268 (58%), Gaps = 13/268 (4%)
Query: 4 PTHQRAQGNKLYYQEALNKSP-ELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAI 62
P ++R N L Y+ L +SP ++ E P L+ R+ YE LC+ L P +
Sbjct: 258 PDNKRMARNVLKYERLLAESPNQVVSEAVIQRPNTPHLQT--RDTYEGLCQ-TLGSQPTL 314
Query: 63 --VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRA 120
+ L C Y + YL L P+++E +L+P I LY D + DSE I+++A+P L+R+
Sbjct: 315 YQIPSLYCSYETNSNAYLLLQPIRKEVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRS 374
Query: 121 TVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQVVNY 177
V +GE ++ YRISKSAWL++ P++ ++ R+ +TGL AE LQVVNY
Sbjct: 375 VV---ASGEKQLQVEYRISKSAWLKDTVDPMLVTLNHRIAALTGLDVRPPYAEYLQVVNY 431
Query: 178 GIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKG 237
GIGGHYEPH+D A + ++ + +GNRVAT + Y+S V GGAT F NLS+ +
Sbjct: 432 GIGGHYEPHFDHATSPSSPLYR-MKSGNRVATFMIYLSSVEAGGATAFIYANLSVPVVRN 490
Query: 238 TAAFWHNLHSSGDGDYYTRHAACPVLTG 265
A FW NLH SG+GD T HA CPVL G
Sbjct: 491 AALFWWNLHRSGEGDSDTLHAGCPVLVG 518
>gi|332211329|ref|XP_003254773.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Nomascus
leucogenys]
Length = 544
Score = 182 bits (461), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 112/269 (41%), Positives = 157/269 (58%), Gaps = 13/269 (4%)
Query: 4 PTHQRAQGNKLYYQEALNKSP-ELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAI 62
P ++R N L Y+ L +SP +L E P L+ R+ YE LC+ L P +
Sbjct: 258 PDNKRMARNVLKYERLLAESPNQLVAEAVIQRPNIPHLQT--RDIYEGLCQ-TLGCQPTL 314
Query: 63 --VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRA 120
+ L C Y + YL L P+++E +L+P I LY D + DSE I+++A+P L+R+
Sbjct: 315 YQIPSLYCSYETNSNAYLLLQPIRKEVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRS 374
Query: 121 TVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQVVNY 177
V +GE ++ YRISKSAWL++ P++ ++ R+ +TGL AE LQVVNY
Sbjct: 375 VV---ASGEKQLQVEYRISKSAWLKDTVDPMLVTLNHRIAALTGLDVRPPYAEYLQVVNY 431
Query: 178 GIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKG 237
GIGGHYEPH+D A + ++ + +GNRVAT + Y+S V GGAT F NLS+ +
Sbjct: 432 GIGGHYEPHFDHATSPSSPLYR-MKSGNRVATFMIYLSSVEAGGATAFIYANLSVPVVRN 490
Query: 238 TAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
A FW NLH SG+GD T HA CPVL G
Sbjct: 491 AALFWWNLHRSGEGDSDTLHAGCPVLVGD 519
>gi|195113237|ref|XP_002001174.1| GI10637 [Drosophila mojavensis]
gi|193917768|gb|EDW16635.1| GI10637 [Drosophila mojavensis]
Length = 529
Score = 182 bits (461), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 99/233 (42%), Positives = 139/233 (59%), Gaps = 6/233 (2%)
Query: 35 NVAPTLEVT-EREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPR 93
NV P T + + YE CRG P + +L C Y +LRL PLK E L P
Sbjct: 276 NVVPKKFFTPQAQAYERGCRGQY--PQNL--KLYCVYNSTTSAFLRLAPLKMELISLDPY 331
Query: 94 IILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIER 153
+++Y DV+ SEI ++ +A P L+RATV N ++ + R SK WL + + + R
Sbjct: 332 MVIYHDVISPSEISELQSLAVPGLKRATVFNQQSMRNHVVKTRTSKVTWLLDTLNQLTIR 391
Query: 154 ISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFY 213
++RR+ MTG +E LQV+NYG+GGHY+ HYD+ A L G+R+ATVLFY
Sbjct: 392 LNRRITDMTGFDMYGSEMLQVMNYGLGGHYDKHYDYFNSSVAADLTRLN-GDRIATVLFY 450
Query: 214 MSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
++DV QGGATVF ++ +++P+ GTA W+NL G+GD T HAACPV+ GS
Sbjct: 451 LTDVEQGGATVFPNIEKAVFPKSGTAVVWYNLRHDGNGDPQTLHAACPVIVGS 503
>gi|194765174|ref|XP_001964702.1| GF23328 [Drosophila ananassae]
gi|190614974|gb|EDV30498.1| GF23328 [Drosophila ananassae]
Length = 542
Score = 181 bits (460), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 108/289 (37%), Positives = 161/289 (55%), Gaps = 28/289 (9%)
Query: 6 HQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEM---LCRGDLTVPPAI 62
H+ A NK+ Y+ L K E P K + ++ + +E Y++ +CRG+L P
Sbjct: 252 HEEALRNKVAYEAILAK--ERNHRPRKPSALSEPNKKEAKESYQLYKRVCRGELRQSPRQ 309
Query: 63 VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATV 122
+L+C + H+NV + RL P K E+ L P + + + + SE++ I + + R+ V
Sbjct: 310 QRKLRCLFSHQNVAFYRLAPFKVEQLNLDPYVAYFHEAINSSEMEQIIEKGLGSMERSRV 369
Query: 123 ---QNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGI 179
QN T E+ R S + WL E+P + +I +R+E +TGL+T +AE LQ+VNYGI
Sbjct: 370 GQSQNATTSEI-----RTSANTWLWYNENPWLSKIKQRLEDITGLSTESAEPLQLVNYGI 424
Query: 180 GGHYEPHYDFAR-PGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGT 238
GG YEPH+DF P + +K GNR+ T LFY++DVA GGAT F L L++ P KG+
Sbjct: 425 GGQYEPHFDFVEEPQKVFGWK----GNRMLTALFYINDVALGGATAFPFLQLAVPPVKGS 480
Query: 239 AAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCGL 277
W+NLH S D+ T+HA CPV+ GS + + PCGL
Sbjct: 481 LLVWYNLHRSLHKDFRTKHAGCPVIKGSKWICNEWFHEGTQVFKRPCGL 529
>gi|313241587|emb|CBY33829.1| unnamed protein product [Oikopleura dioica]
Length = 541
Score = 181 bits (460), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 100/234 (42%), Positives = 136/234 (58%), Gaps = 6/234 (2%)
Query: 41 EVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRD 99
E E E YE LCR +P LKC Y N P+L L P+K EE + +P II + +
Sbjct: 278 EREETEYYEKLCRIPNELPREKADTLKCFYWTNNDHPFLVLGPVKAEELWDEPEIIRFYE 337
Query: 100 VMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWL----REPEHPVIERIS 155
++ D E+D+I K A+P+ ATVQ+ TG+L A+YRIS+SAWL + + +
Sbjct: 338 IITDEELDIINKQARPKSNLATVQDPITGKLVNADYRISESAWLPANTDSAQDEKLRQFR 397
Query: 156 RRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMS 215
+R+ +TGLT AE++Q NYGIGG YEPHYD + +A F GNR+AT L Y++
Sbjct: 398 KRISIITGLTMERAEDIQYSNYGIGGQYEPHYDMSTENDAGKFDE-EDGNRIATWLTYLN 456
Query: 216 DVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSL 269
+ GG TVF + P +A FW+NL G DY TRHAACPVL G ++
Sbjct: 457 EPKHGGDTVFLGPGIKAEPIHKSAVFWYNLLRDGSCDYRTRHAACPVLIGQKTV 510
>gi|355709028|gb|AES03457.1| prolyl 4-hydroxylase, alpha polypeptide III [Mustela putorius furo]
Length = 477
Score = 181 bits (460), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 111/272 (40%), Positives = 156/272 (57%), Gaps = 19/272 (6%)
Query: 4 PTHQRAQGNKLYYQEALNKSP-----ELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTV 58
P ++R N L Y++ L +SP E + P V P L+ R+ YE LC+ +
Sbjct: 192 PDNKRMARNVLKYEKLLAESPNQVVAEAVIQRPNV----PHLQT--RDTYEGLCQTLGSQ 245
Query: 59 PPAI-VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRL 117
P + L C Y + PYL L P+++E +L+P ++LY D + D E I+ +A+P L
Sbjct: 246 PIHYQIPSLYCSYETNSSPYLLLQPIRKEVIHLEPYVVLYHDFVSDMEAQKIRGLAEPWL 305
Query: 118 RRATVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQV 174
+R+ V +GE ++ YRISKSAWL++ P++ + R+ +TGL AE LQV
Sbjct: 306 QRSVV---ASGEKQLPVEYRISKSAWLKDTVDPLLVNLDHRIGALTGLDVQPPYAEYLQV 362
Query: 175 VNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWP 234
VNYGIGGHYEPH+D A + ++ + +GNRVAT + Y+S V GGAT F N S+
Sbjct: 363 VNYGIGGHYEPHFDHATSPTSPLYR-MKSGNRVATFMIYLSSVEAGGATAFIYANFSVPV 421
Query: 235 EKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
K A FW NLH SG+GD T HA CPVL G
Sbjct: 422 VKNAALFWWNLHRSGEGDGDTLHAGCPVLVGD 453
>gi|313213106|emb|CBY36968.1| unnamed protein product [Oikopleura dioica]
Length = 541
Score = 181 bits (460), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 100/234 (42%), Positives = 136/234 (58%), Gaps = 6/234 (2%)
Query: 41 EVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRD 99
E E E YE LCR +P LKC Y N P+L L P+K EE + +P II + +
Sbjct: 278 EREETEYYEKLCRIPNELPREKADTLKCFYWTNNDHPFLVLGPVKAEELWDEPEIIRFYE 337
Query: 100 VMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWL----REPEHPVIERIS 155
++ D E+D+I K A+P+ ATVQ+ TG+L A+YRIS+SAWL + + +
Sbjct: 338 IITDEELDIINKQARPKSNLATVQDPITGKLVNADYRISESAWLPANTDSAQDEKLRQFR 397
Query: 156 RRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMS 215
+R+ +TGLT AE++Q NYGIGG YEPHYD + +A F GNR+AT L Y++
Sbjct: 398 KRISIITGLTMERAEDIQYSNYGIGGQYEPHYDMSTENDAGKFDE-EDGNRIATWLTYLN 456
Query: 216 DVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSL 269
+ GG TVF + P +A FW+NL G DY TRHAACPVL G ++
Sbjct: 457 EPKHGGDTVFLGPGIKAEPIHKSAVFWYNLLRDGSCDYRTRHAACPVLIGQKTV 510
>gi|195452742|ref|XP_002073480.1| GK13123 [Drosophila willistoni]
gi|194169565|gb|EDW84466.1| GK13123 [Drosophila willistoni]
Length = 540
Score = 181 bits (459), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 105/294 (35%), Positives = 158/294 (53%), Gaps = 24/294 (8%)
Query: 4 PTHQRAQGNKLYY--QEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPA 61
P H+ A +K Y Q A + P L KV+ + +L++ Y+ +CRG+L P
Sbjct: 247 PNHETALKDKPIYETQLAWQRDPRLNVAASKVDESSKSLDL-----YQRVCRGELRQSPR 301
Query: 62 IVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRAT 121
+L+C Y R V + RL P K E+ L P + + +V+ D E D + + +++R+
Sbjct: 302 QQRKLRCFYSDRGVAFYRLGPFKVEQLNLDPYVAYFHNVISDDETDDLIEHGMGQVKRSR 361
Query: 122 VQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGG 181
V G ++ R S++ WL + P ++ + R+E +TGL +AE LQ+VNYGIGG
Sbjct: 362 VGT--VGNSTVSEVRTSQNTWLWYEQQPWLKNLKLRLEDITGLGMESAEPLQLVNYGIGG 419
Query: 182 HYEPHYDFARPGEANAFKSLG-TGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAA 240
HYEPHYDF + + G GNR+ T L Y+++V GGAT F L L++ P KG+
Sbjct: 420 HYEPHYDFVE----DKVTTFGWKGNRLLTALLYLNEVPMGGATAFPYLKLAVPPVKGSLL 475
Query: 241 FWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCGLRRGLQRS 284
W+NLH S D D+ T+HA CPVL GS + + PCGL ++S
Sbjct: 476 VWYNLHRSLDPDFRTKHAGCPVLMGSKWVCNEWFHEGAQEFRRPCGLMNDSKKS 529
>gi|313229343|emb|CBY23930.1| unnamed protein product [Oikopleura dioica]
Length = 542
Score = 181 bits (459), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 100/234 (42%), Positives = 136/234 (58%), Gaps = 6/234 (2%)
Query: 41 EVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRD 99
E E E YE LCR +P LKC Y N P+L L P+K EE + +P II + +
Sbjct: 279 EREETEYYEKLCRIPNELPREKADTLKCFYWTNNDHPFLVLGPVKAEELWDEPEIIRFYE 338
Query: 100 VMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWL----REPEHPVIERIS 155
++ D E+D+I K A+P+ ATVQ+ TG+L A+YRIS+SAWL + + +
Sbjct: 339 IITDEELDIINKQARPKSNLATVQDPITGKLVNADYRISESAWLPANTDSAQDEKLRQFR 398
Query: 156 RRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMS 215
+R+ +TGLT AE++Q NYGIGG YEPHYD + +A F GNR+AT L Y++
Sbjct: 399 KRISIITGLTMERAEDIQYSNYGIGGQYEPHYDMSTENDAGKFDE-EDGNRIATWLTYLN 457
Query: 216 DVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSL 269
+ GG TVF + P +A FW+NL G DY TRHAACPVL G ++
Sbjct: 458 EPKHGGDTVFLGPGIKAEPIHKSAVFWYNLLRDGSCDYRTRHAACPVLIGQKTV 511
>gi|301759032|ref|XP_002915381.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Ailuropoda
melanoleuca]
Length = 539
Score = 181 bits (459), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 111/273 (40%), Positives = 156/273 (57%), Gaps = 19/273 (6%)
Query: 4 PTHQRAQGNKLYYQEALNKSP-----ELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTV 58
P ++R N L Y++ L +SP E + P V P L+ R+ YE LC+ +
Sbjct: 253 PDNKRMARNVLKYEKLLAESPNQVVAEAVIQRPNV----PHLQT--RDTYEGLCQTLGSQ 306
Query: 59 PPAI-VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRL 117
P + L C Y + PYL L P+++E +L+P ++LY D + D E I+ +A+P L
Sbjct: 307 PTHYQIPSLYCSYETNSSPYLLLQPVRKEVIHLEPYVVLYHDFVSDGEAQKIRGLAEPWL 366
Query: 118 RRATVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQV 174
+R+ V +GE ++ YRISKSAWL++ P++ + R+ +TGL AE LQV
Sbjct: 367 QRSVV---ASGEKQLPVEYRISKSAWLKDTVDPLLVTLDHRIGALTGLDVQPPYAEYLQV 423
Query: 175 VNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWP 234
VNYGIGGHYEPH+D A + ++ + +GNRVAT + Y+S V GGAT F N S+
Sbjct: 424 VNYGIGGHYEPHFDHATSPTSPLYR-MKSGNRVATFMIYLSSVEAGGATAFIYANFSVPV 482
Query: 235 EKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
K A FW NLH SG+GD T HA CPVL G
Sbjct: 483 VKNAALFWWNLHRSGEGDGDTLHAGCPVLVGDK 515
>gi|281353153|gb|EFB28737.1| hypothetical protein PANDA_003344 [Ailuropoda melanoleuca]
Length = 456
Score = 181 bits (459), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 111/273 (40%), Positives = 155/273 (56%), Gaps = 19/273 (6%)
Query: 4 PTHQRAQGNKLYYQEALNKSP-----ELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTV 58
P ++R N L Y++ L +SP E + P V P L+ R+ YE LC+ +
Sbjct: 193 PDNKRMARNVLKYEKLLAESPNQVVAEAVIQRPNV----PHLQT--RDTYEGLCQTLGSQ 246
Query: 59 PPAI-VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRL 117
P + L C Y + PYL L P+++E +L+P ++LY D + D E I+ +A+P L
Sbjct: 247 PTHYQIPSLYCSYETNSSPYLLLQPVRKEVIHLEPYVVLYHDFVSDGEAQKIRGLAEPWL 306
Query: 118 RRATVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTS--TAEELQV 174
+R+ V +GE ++ YRISKSAWL++ P++ + R+ +TGL AE LQV
Sbjct: 307 QRSVV---ASGEKQLPVEYRISKSAWLKDTVDPLLVTLDHRIGALTGLDVQPPYAEYLQV 363
Query: 175 VNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWP 234
VNYGIGGHYEPH+D A ++ + +GNRVAT + Y+S V GGAT F N S+
Sbjct: 364 VNYGIGGHYEPHFDHATVTMGPLYR-MKSGNRVATFMIYLSSVEAGGATAFIYANFSVPV 422
Query: 235 EKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
K A FW NLH SG+GD T HA CPVL G
Sbjct: 423 VKNAALFWWNLHRSGEGDGDTLHAGCPVLVGDK 455
>gi|351696981|gb|EHA99899.1| Prolyl 4-hydroxylase subunit alpha-3 [Heterocephalus glaber]
Length = 572
Score = 181 bits (459), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 112/273 (41%), Positives = 155/273 (56%), Gaps = 19/273 (6%)
Query: 4 PTHQRAQGNKLYYQEALNKSP-----ELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTV 58
P ++R N L Y+ L ++P E + P V P L+ R+ YE LC+ +
Sbjct: 286 PENKRMVRNVLKYERLLAENPHQAVAETVIQRPNV----PHLQT--RDTYEGLCQTLGSQ 339
Query: 59 PPAI-VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRL 117
P + L C Y + PYL L P+++E +L+P + LY D + D E I+K+A+P L
Sbjct: 340 PIHYQIPGLYCSYETNSSPYLLLQPVRKEVIHLEPYVALYHDFVSDPEAQKIRKLAEPWL 399
Query: 118 RRATVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQV 174
+R+ V +GE ++ YRISKSAWL++ PV+ + R+ +TGL AE LQV
Sbjct: 400 QRSVV---ASGEKQLQVEYRISKSAWLKDTADPVLVTLDHRIAALTGLDVQHPYAEYLQV 456
Query: 175 VNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWP 234
VNYGIGGHYEPH+D A + ++ + +GNRVAT + Y+S V GGAT F N S+
Sbjct: 457 VNYGIGGHYEPHFDHATSPSSPLYR-MKSGNRVATFMIYLSSVEAGGATAFIYANFSVPV 515
Query: 235 EKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
K A FW NLH SG+GD T HA CPVL G
Sbjct: 516 VKNAALFWWNLHRSGEGDGDTLHAGCPVLVGDK 548
>gi|195452746|ref|XP_002073482.1| GK14141 [Drosophila willistoni]
gi|194169567|gb|EDW84468.1| GK14141 [Drosophila willistoni]
Length = 541
Score = 181 bits (458), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 103/257 (40%), Positives = 140/257 (54%), Gaps = 8/257 (3%)
Query: 16 YQEALNKSPELKDEPPKVNNVAPTLE----VTEREKYEMLCRGDLTVPPAIVAQLKCRYV 71
Y ++K + K P + AP +++ + Y C G + L+C Y+
Sbjct: 250 YNNFISKHLDEKQSPATLEEHAPIPSDPSVMSDFDIYRFTCSGHIKKTAREERHLRCGYL 309
Query: 72 HRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELE 131
P+L L PLK EE P ++LY DV+Y SEID+I+ + + + RATV K E
Sbjct: 310 TETHPFLNLAPLKVEELNHNPLLVLYHDVIYQSEIDVIRNLTENEISRATVIGAKGSE-- 367
Query: 132 IANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FA 190
++ R S+ ++ + H V++ I +RV M+ L AE Q NYGIGGHY H D F
Sbjct: 368 VSKVRTSQFTFIPKTRHKVLQTIDQRVADMSNLNMDYAELHQFANYGIGGHYAQHNDWFG 427
Query: 191 RPGEANAF-KSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSG 249
+ N S GNR+ATVLFY+SDVAQGG T F L L P+K AAFWHNLH+SG
Sbjct: 428 QDAFDNELVSSPEMGNRIATVLFYLSDVAQGGGTAFPHLKQLLQPKKYAAAFWHNLHASG 487
Query: 250 DGDYYTRHAACPVLTGS 266
GD T H ACP++ GS
Sbjct: 488 VGDLRTLHGACPIIAGS 504
>gi|73988166|ref|XP_851718.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Canis lupus
familiaris]
Length = 544
Score = 181 bits (458), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 111/273 (40%), Positives = 156/273 (57%), Gaps = 19/273 (6%)
Query: 4 PTHQRAQGNKLYYQEALNKSP-----ELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTV 58
P ++R N L Y++ L +SP E + P V P L+ R+ YE LC+ +
Sbjct: 258 PDNKRMARNVLKYEKLLAESPNQVVAEAVIQRPNV----PHLQT--RDTYEGLCQTLGSQ 311
Query: 59 PPAI-VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRL 117
P + L C Y + PYL L P+++E +L+P ++LY D + D E I+ +A+P L
Sbjct: 312 PTHYQIPSLYCSYETNSSPYLLLQPVRKEVIHLEPYVVLYHDFVNDVEAQKIRGLAEPWL 371
Query: 118 RRATVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQV 174
+R+ V +GE ++ YRISKSAWL++ P++ + R+ +TGL AE LQV
Sbjct: 372 QRSVV---ASGEKQLPVEYRISKSAWLKDTVDPLLVTLDHRIGALTGLDVQPPYAEYLQV 428
Query: 175 VNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWP 234
VNYGIGGHYEPH+D A + ++ + +GNRVAT + Y+S V GGAT F N S+
Sbjct: 429 VNYGIGGHYEPHFDHATSPTSPLYR-MKSGNRVATFMIYLSSVEAGGATAFIYANFSVPV 487
Query: 235 EKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
K A FW NLH SG+GD T HA CPVL G
Sbjct: 488 VKNAALFWWNLHRSGEGDGDTLHAGCPVLVGDK 520
>gi|194905294|ref|XP_001981167.1| GG11919 [Drosophila erecta]
gi|190655805|gb|EDV53037.1| GG11919 [Drosophila erecta]
Length = 533
Score = 181 bits (458), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 90/224 (40%), Positives = 136/224 (60%), Gaps = 11/224 (4%)
Query: 48 YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID 107
++ C G L P +L C Y P+LRL PLK E+ L+P ++LY +V+ EI
Sbjct: 286 FKTSCNGLLEKP----TRLHCFYNFTTTPFLRLAPLKTEQIGLKPYVVLYHEVLSAREIS 341
Query: 108 LIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTS 167
++ A ++ VQ+ K + R +K WL++ + + RI+RR+ MTG +
Sbjct: 342 MLMGKAAQNMKNTRVQSEKA--VNTNRERTAKGYWLKKESNEMTRRITRRIVDMTGFDLA 399
Query: 168 TAEELQVVNYGIGGHYEPHYDFARPGEAN-----AFKSLGTGNRVATVLFYMSDVAQGGA 222
+E+ QV+NYGIGGHY H+D+ +N + S+ G+R+ATVLFY++DV QGGA
Sbjct: 400 DSEDFQVINYGIGGHYSLHFDYFGFASSNYTGERSHHSIVLGDRIATVLFYLTDVEQGGA 459
Query: 223 TVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
TVF ++ S++P+ GTA FW+NL + G+GD TRHA+CPV+ GS
Sbjct: 460 TVFGNVGYSVYPQAGTAIFWYNLDTDGNGDPLTRHASCPVVVGS 503
>gi|410972729|ref|XP_003992809.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Felis catus]
Length = 533
Score = 181 bits (458), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 110/270 (40%), Positives = 157/270 (58%), Gaps = 13/270 (4%)
Query: 4 PTHQRAQGNKLYYQEALNKSPE--LKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPA 61
P ++R N L Y++ L +SP + + + NV P L+ R+ YE LC+ + P
Sbjct: 247 PDNKRMSRNVLKYEKLLAESPTRVVAEAVIRRPNV-PHLQT--RDTYEGLCQTLGSQPTH 303
Query: 62 I-VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRA 120
+ L C Y + PYL L P+++E +L+P ++LY D + D E I+ +A+P L+R+
Sbjct: 304 YQIPSLYCSYETNSSPYLLLQPIRKEVIHLEPYVVLYHDFVNDLEAQKIRGLAEPWLQRS 363
Query: 121 TVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQVVNY 177
V +GE ++ YRISKSAWL++ P++ + R+ +TGL AE LQVVNY
Sbjct: 364 VV---ASGEKQLPVEYRISKSAWLKDTVDPLLVTLDHRIGALTGLDVQPPYAEYLQVVNY 420
Query: 178 GIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKG 237
GIGGHYEPH+D A + ++ + +GNRVAT + Y+S V GGAT F N S+ K
Sbjct: 421 GIGGHYEPHFDHATSPTSPLYR-MKSGNRVATFMIYLSSVEAGGATAFIYANFSVPVVKN 479
Query: 238 TAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
A FW NLH SG+GD T HA CPVL G
Sbjct: 480 AALFWWNLHRSGEGDGDTLHAGCPVLVGDK 509
>gi|402894624|ref|XP_003910453.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
alpha-3 [Papio anubis]
Length = 535
Score = 180 bits (457), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 110/267 (41%), Positives = 151/267 (56%), Gaps = 20/267 (7%)
Query: 4 PTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAI- 62
P ++R N L Y+ L +SP N V +R E LC+ L P +
Sbjct: 258 PDNKRMARNVLKYERXLAESP----------NQVVAEAVIQRPNXEGLCQ-TLGSQPTLY 306
Query: 63 -VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRAT 121
+ L C Y + YL L P+++E +L+P I LY D + DSE I++ A+P L+R+
Sbjct: 307 QIPSLYCSYETNSNAYLLLQPIRKEVIHLEPYIALYHDFVSDSEAQKIREFAEPWLQRSV 366
Query: 122 VQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQVVNYG 178
V +GE ++ YRISKSAWL++ P++ ++ R+ +TGL AE LQVVNYG
Sbjct: 367 V---ASGEKQLQVEYRISKSAWLKDTVDPMLVTLNHRIAALTGLDVRPPYAEYLQVVNYG 423
Query: 179 IGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGT 238
IGGHYEPH+D A + ++ + +GNRVAT + Y+S V GGAT F NLS+ K
Sbjct: 424 IGGHYEPHFDHATSPSSPLYR-MKSGNRVATFMIYLSSVEAGGATAFIYANLSVPVVKNA 482
Query: 239 AAFWHNLHSSGDGDYYTRHAACPVLTG 265
A FW NLH SG+GD T HA CPVL G
Sbjct: 483 ALFWWNLHRSGEGDSDTLHAGCPVLVG 509
>gi|395521232|ref|XP_003764722.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Sarcophilus
harrisii]
Length = 521
Score = 180 bits (457), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 111/272 (40%), Positives = 151/272 (55%), Gaps = 19/272 (6%)
Query: 4 PTHQRAQGNKLYYQEALNK-----SPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTV 58
P ++R N Y+ L + PE+ + P V P L+ R+ YE LC+ +
Sbjct: 235 PDNKRIARNIRKYERLLEEKSNVTGPEVAIKRPNV----PHLQT--RDTYEGLCQTLGSQ 288
Query: 59 PPAI-VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRL 117
P + L C Y PYL L P+++E +L+P I+LY D + DSE I+ A P L
Sbjct: 289 PTHYQIPSLYCAYETNGSPYLLLQPVRKEVLHLEPYIVLYHDFVSDSEAQKIRGFAAPWL 348
Query: 118 RRATVQNYKTGE-LEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTS--TAEELQV 174
+R+ V +GE + YRISKSAWL++ P++ + RR+ +TGL AE LQV
Sbjct: 349 QRSVV---ASGEKQQQVEYRISKSAWLKDTVDPILVSLDRRIAALTGLNVQPPYAEHLQV 405
Query: 175 VNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWP 234
VNYGIGGHYEPH+D A + ++ + +GNRVAT + Y+S V GG+T F N S+
Sbjct: 406 VNYGIGGHYEPHFDHATSPSSPLYR-MNSGNRVATFMIYLSSVEAGGSTAFIYANFSVPV 464
Query: 235 EKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
K A FW NLH SG GD T HA CPVL G
Sbjct: 465 VKNAALFWWNLHRSGQGDGDTLHAGCPVLVGD 496
>gi|116496629|gb|AAI26171.1| Prolyl 4-hydroxylase, alpha polypeptide III [Homo sapiens]
Length = 544
Score = 180 bits (457), Expect = 7e-43, Method: Compositional matrix adjust.
Identities = 112/273 (41%), Positives = 158/273 (57%), Gaps = 21/273 (7%)
Query: 4 PTHQRAQGNKLYYQEALNKSP-----ELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTV 58
P ++R N L Y+ L +SP E + P + P L+ R+ YE LC+ L
Sbjct: 258 PDNKRMARNVLKYERLLAESPNHVVAEAVIQRPNI----PHLQT--RDTYEGLCQ-TLGS 310
Query: 59 PPAI--VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPR 116
P + + L C Y + YL L P+++E +L+P I LY D + DSE I+++A+P
Sbjct: 311 QPTLYQIPSLYCSYETNSNAYLLLQPIRKEVIHLEPYIALYHDFVSDSEAQKIRELAEPW 370
Query: 117 LRRATVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQ 173
L+R+ V +GE ++ YRISKSAWL++ +P + ++ R+ +TGL AE LQ
Sbjct: 371 LQRSVV---ASGEKQLQVEYRISKSAWLKDTVNPKLVTLNHRIAALTGLDVRPPYAEYLQ 427
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
VVNYGIGGHYEPH+D A + ++ + +GNRVAT + Y+S V GGAT F NLS+
Sbjct: 428 VVNYGIGGHYEPHFDHATSPSSPLYR-MKSGNRVATFMIYLSSVEAGGATAFIYANLSVP 486
Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+ A FW NLH SG+GD T HA CPVL G
Sbjct: 487 VVRNAALFWWNLHRSGEGDSDTLHAGCPVLVGD 519
>gi|195452778|ref|XP_002073496.1| GK13116 [Drosophila willistoni]
gi|194169581|gb|EDW84482.1| GK13116 [Drosophila willistoni]
Length = 521
Score = 180 bits (456), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 95/241 (39%), Positives = 144/241 (59%), Gaps = 7/241 (2%)
Query: 26 LKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKE 85
+++EP N+ P +E CRG+ P A+L C Y + P+LRL PLK
Sbjct: 265 IRNEP----NIKPKPFNKSVGDFERGCRGEF--PALTDAKLYCIYNTTSSPFLRLAPLKM 318
Query: 86 EEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLRE 145
E L P ++LY DV+ +EI +++MA+P L+RATV N + R +K AW +
Sbjct: 319 ELIGLDPYMVLYHDVISPNEIAELQEMAKPELKRATVYNSTKNTNQFVKTRTAKVAWFLD 378
Query: 146 PEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGN 205
+ + ER+++R+ MT + +E LQV+NYG+GG+Y H+D+ N S G+
Sbjct: 379 TFNQLTERLNQRIMDMTNFVLNGSEMLQVMNYGLGGYYVKHFDYFNTT-TNPHISQINGD 437
Query: 206 RVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
R+ATVLFY++DV QGGATVF + +++P++G+A W+NL G+G+ T HAACPV+ G
Sbjct: 438 RIATVLFYLNDVEQGGATVFPEIKKAVFPKRGSAIMWYNLKDDGEGNRDTLHAACPVIVG 497
Query: 266 S 266
S
Sbjct: 498 S 498
>gi|38454288|ref|NP_942070.1| prolyl 4-hydroxylase subunit alpha-3 precursor [Rattus norvegicus]
gi|81870816|sp|Q6W3E9.1|P4HA3_RAT RecName: Full=Prolyl 4-hydroxylase subunit alpha-3; Short=4-PH
alpha-3; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-3; Flags: Precursor
gi|36962768|gb|AAQ87605.1| collagen prolyl 4-hydroxylase alpha III subunit [Rattus norvegicus]
Length = 544
Score = 179 bits (455), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 111/270 (41%), Positives = 154/270 (57%), Gaps = 13/270 (4%)
Query: 4 PTHQRAQGNKLYYQEALNKSPELKDEPPKVN--NVAPTLEVTEREKYEMLCRGDLTVPPA 61
P ++R N L Y+ L ++ L + NV P L+ R+ YE LC+ + P
Sbjct: 258 PDNKRMARNVLKYERLLAENGHLMAAETAIQRPNV-PHLQT--RDTYEGLCQTLGSQPTH 314
Query: 62 I-VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRA 120
+ L C Y + PYL L P ++E +L+P + LY D + D E I+++A+P L+R+
Sbjct: 315 YQIPSLYCSYETNSSPYLLLQPARKEVIHLRPLVALYHDFVSDEEAQKIRELAEPWLQRS 374
Query: 121 TVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQVVNY 177
V +GE ++ YRISKSAWL++ PV+ + RR+ +TGL AE LQVVNY
Sbjct: 375 VV---ASGEKQLQVEYRISKSAWLKDTVDPVLVTLDRRIAALTGLDIQPPYAEYLQVVNY 431
Query: 178 GIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKG 237
GIGGHYEPH+D A + +K + +GNR AT++ Y+S V GGAT F N S+ K
Sbjct: 432 GIGGHYEPHFDHATSPSSPLYK-MKSGNRAATLMIYLSSVEAGGATAFIYGNFSVPVVKN 490
Query: 238 TAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
A FW NLH SG+GD T HA CPVL G
Sbjct: 491 AALFWWNLHRSGEGDDDTLHAGCPVLVGDK 520
>gi|313242424|emb|CBY34571.1| unnamed protein product [Oikopleura dioica]
Length = 503
Score = 179 bits (455), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 99/234 (42%), Positives = 136/234 (58%), Gaps = 6/234 (2%)
Query: 41 EVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRD 99
E E E YE LCR +P LKC Y N P+L L P+K EE + +P II + +
Sbjct: 240 EREETEYYEKLCRIPNELPREKADTLKCFYWTNNDHPFLVLGPVKAEELWDEPEIIRFYE 299
Query: 100 VMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWL----REPEHPVIERIS 155
++ D E+D+I + A+P+ ATVQ+ TG+L A+YRIS+SAWL + + +
Sbjct: 300 IITDEELDIINEQARPKSNLATVQDPITGKLVNADYRISESAWLPANTDSAQDEKLRQFR 359
Query: 156 RRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMS 215
+R+ +TGLT AE++Q NYGIGG YEPHYD + +A F GNR+AT L Y++
Sbjct: 360 KRISIITGLTMERAEDIQYSNYGIGGQYEPHYDMSTENDAGKFDE-EDGNRIATWLTYLN 418
Query: 216 DVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSL 269
+ GG TVF + P +A FW+NL G DY TRHAACPVL G ++
Sbjct: 419 EPKHGGDTVFLGPGIKAEPIHKSAVFWYNLLRDGSCDYRTRHAACPVLIGQKTV 472
>gi|59809017|gb|AAH89446.1| P4HA3 protein [Homo sapiens]
Length = 528
Score = 179 bits (455), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 112/273 (41%), Positives = 157/273 (57%), Gaps = 21/273 (7%)
Query: 4 PTHQRAQGNKLYYQEALNKSP-----ELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTV 58
P ++R N L Y+ L +SP E + P + P L+ R+ YE LC+ L
Sbjct: 242 PDNKRMARNVLKYERLLAESPNHVVAEAVIQRPNI----PHLQT--RDTYEGLCQT-LGS 294
Query: 59 PPAI--VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPR 116
P + + L C Y + YL L P+++E +L+P I LY D + DSE I+++A+P
Sbjct: 295 QPTLYQIPSLYCSYETNSNAYLLLQPIRKEVIHLEPYIALYHDFVSDSEAQKIRELAEPW 354
Query: 117 LRRATVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQ 173
L+R+ V +GE ++ YRISKSAWL++ P + ++ R+ +TGL AE LQ
Sbjct: 355 LQRSVV---ASGEKQLQVEYRISKSAWLKDTVDPKLVTLNHRIAALTGLDVRPPYAEYLQ 411
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
VVNYGIGGHYEPH+D A + ++ + +GNRVAT + Y+S V GGAT F NLS+
Sbjct: 412 VVNYGIGGHYEPHFDHATSPSSPLYR-MKSGNRVATFMIYLSSVEAGGATAFIYANLSVP 470
Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+ A FW NLH SG+GD T HA CPVL G
Sbjct: 471 VVRNAALFWWNLHRSGEGDSDTLHAGCPVLVGD 503
>gi|426369750|ref|XP_004051847.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3, partial [Gorilla
gorilla gorilla]
Length = 517
Score = 179 bits (455), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 112/273 (41%), Positives = 157/273 (57%), Gaps = 21/273 (7%)
Query: 4 PTHQRAQGNKLYYQEALNKSP-----ELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTV 58
P ++R N L Y+ L +SP E + P + P L+ R+ YE LC+ L
Sbjct: 231 PDNKRMARNVLKYERLLAESPNHVVAEAVIQRPNI----PHLQT--RDTYEGLCQT-LGS 283
Query: 59 PPAI--VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPR 116
P + + L C Y + YL L P+++E +L+P I LY D + DSE I+++A+P
Sbjct: 284 QPTLYQIPSLYCSYETNSNAYLLLQPIRKEVIHLEPYIALYHDFVSDSEAQKIRELAEPW 343
Query: 117 LRRATVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQ 173
L+R+ V +GE ++ YRISKSAWL++ P + ++ R+ +TGL AE LQ
Sbjct: 344 LQRSVV---ASGEKQLQVEYRISKSAWLKDTVDPKLVALNHRIAALTGLDVRPPYAEYLQ 400
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
VVNYGIGGHYEPH+D A + ++ + +GNRVAT + Y+S V GGAT F NLS+
Sbjct: 401 VVNYGIGGHYEPHFDHATSPSSPLYR-MKSGNRVATFMIYLSSVEAGGATAFIYANLSVP 459
Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+ A FW NLH SG+GD T HA CPVL G
Sbjct: 460 VVRNAALFWWNLHRSGEGDSDTLHAGCPVLVGD 492
>gi|33589818|ref|NP_878907.1| prolyl 4-hydroxylase subunit alpha-3 precursor [Homo sapiens]
gi|114639354|ref|XP_001174896.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Pan troglodytes]
gi|397487266|ref|XP_003814725.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Pan paniscus]
gi|74738714|sp|Q7Z4N8.1|P4HA3_HUMAN RecName: Full=Prolyl 4-hydroxylase subunit alpha-3; Short=4-PH
alpha-3; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-3; Flags: Precursor
gi|33188232|gb|AAP97874.1| prolyl 4-hydroxylase alpha III subunit [Homo sapiens]
gi|36962719|gb|AAQ87603.1| collagen prolyl 4-hydroxylase alpha III subunit [Homo sapiens]
gi|37182165|gb|AAQ88885.1| GPGA711 [Homo sapiens]
gi|109658570|gb|AAI17334.1| Prolyl 4-hydroxylase, alpha polypeptide III [Homo sapiens]
gi|119595341|gb|EAW74935.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide III, isoform CRA_b
[Homo sapiens]
gi|410219716|gb|JAA07077.1| prolyl 4-hydroxylase, alpha polypeptide III [Pan troglodytes]
gi|410248278|gb|JAA12106.1| prolyl 4-hydroxylase, alpha polypeptide III [Pan troglodytes]
gi|410336087|gb|JAA36990.1| prolyl 4-hydroxylase, alpha polypeptide III [Pan troglodytes]
Length = 544
Score = 179 bits (455), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 112/273 (41%), Positives = 157/273 (57%), Gaps = 21/273 (7%)
Query: 4 PTHQRAQGNKLYYQEALNKSP-----ELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTV 58
P ++R N L Y+ L +SP E + P + P L+ R+ YE LC+ L
Sbjct: 258 PDNKRMARNVLKYERLLAESPNHVVAEAVIQRPNI----PHLQT--RDTYEGLCQ-TLGS 310
Query: 59 PPAI--VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPR 116
P + + L C Y + YL L P+++E +L+P I LY D + DSE I+++A+P
Sbjct: 311 QPTLYQIPSLYCSYETNSNAYLLLQPIRKEVIHLEPYIALYHDFVSDSEAQKIRELAEPW 370
Query: 117 LRRATVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQ 173
L+R+ V +GE ++ YRISKSAWL++ P + ++ R+ +TGL AE LQ
Sbjct: 371 LQRSVV---ASGEKQLQVEYRISKSAWLKDTVDPKLVTLNHRIAALTGLDVRPPYAEYLQ 427
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
VVNYGIGGHYEPH+D A + ++ + +GNRVAT + Y+S V GGAT F NLS+
Sbjct: 428 VVNYGIGGHYEPHFDHATSPSSPLYR-MKSGNRVATFMIYLSSVEAGGATAFIYANLSVP 486
Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+ A FW NLH SG+GD T HA CPVL G
Sbjct: 487 VVRNAALFWWNLHRSGEGDSDTLHAGCPVLVGD 519
>gi|170029530|ref|XP_001842645.1| prolyl 4-hydroxylase subunit alpha-1 [Culex quinquefasciatus]
gi|167863229|gb|EDS26612.1| prolyl 4-hydroxylase subunit alpha-1 [Culex quinquefasciatus]
Length = 522
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 96/220 (43%), Positives = 135/220 (61%), Gaps = 9/220 (4%)
Query: 48 YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID 107
YE LCRG++ +++L+CR + P+LRL PLK EE L+P I LY V+ D EID
Sbjct: 274 YEPLCRGEVHRFADELSKLRCRLDTKTTPFLRLAPLKVEEVSLEPPIYLYHKVISDEEID 333
Query: 108 LIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMT-GLTT 166
+ ++ + RL RATV G++ ++ RIS++ WL E P++ + RR M+ GL+
Sbjct: 334 KLIELGKARLNRATV-----GQM-VSQVRISQNVWLSEEVDPLLGVLQRRTYDMSRGLSM 387
Query: 167 STAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFT 226
+ +QV NYGIGGH PHYD E F GNR+AT+++Y+SDV GG TVF
Sbjct: 388 QGFDMVQVNNYGIGGHNIPHYDC--DSEYPPFPQFNMGNRLATLMYYLSDVEVGGGTVFP 445
Query: 227 SLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
L+L ++P KG+A FWHN+H +G+ D HA CP L GS
Sbjct: 446 RLSLGVFPIKGSAIFWHNVHHNGNVDERMLHAGCPTLIGS 485
>gi|126327904|ref|XP_001367838.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Monodelphis
domestica]
Length = 559
Score = 179 bits (453), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 111/272 (40%), Positives = 151/272 (55%), Gaps = 19/272 (6%)
Query: 4 PTHQRAQGNKLYYQEALNK-----SPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTV 58
P ++R N L Y+ L + PE + P V P L+ R+ YE LC+ +
Sbjct: 273 PNNKRVARNILKYERLLAEKSSVTGPEAAIKRPNV----PHLQT--RDTYEGLCQTLGSQ 326
Query: 59 PPAI-VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRL 117
P + L C Y PYL L P+++E +L+P I+LY D + DSE I+ A P L
Sbjct: 327 PTHYQIPSLYCAYETNASPYLLLQPVRKEVLHLEPYIVLYHDFVSDSEAQKIRGFAAPWL 386
Query: 118 RRATVQNYKTGE-LEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTS--TAEELQV 174
+R+ V +GE + YRISKSAWL++ P++ + R+ +TGL AE LQV
Sbjct: 387 QRSVV---ASGEKQQQVEYRISKSAWLKDTVDPMLVSLDHRIAALTGLNVQPPYAEHLQV 443
Query: 175 VNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWP 234
VNYGIGGHYEPH+D A + ++ + +GNRVAT + Y+S V GG+T F N S+
Sbjct: 444 VNYGIGGHYEPHFDHATSPSSPLYR-MNSGNRVATFMIYLSSVEAGGSTAFIYANFSVPV 502
Query: 235 EKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
K A FW NLH SG+GD T HA CPVL G
Sbjct: 503 VKNAALFWWNLHRSGEGDGDTLHAGCPVLVGD 534
>gi|195055767|ref|XP_001994784.1| GH14132 [Drosophila grimshawi]
gi|193892547|gb|EDV91413.1| GH14132 [Drosophila grimshawi]
Length = 537
Score = 178 bits (452), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 93/225 (41%), Positives = 127/225 (56%), Gaps = 2/225 (0%)
Query: 42 VTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVM 101
++ E Y C G + P L+C Y+ P+L L PLK EE P ++LY DV+
Sbjct: 285 LSHDEIYRYTCNGYIKKTPPEERNLRCGYMSETHPFLLLAPLKVEELNRNPLLVLYHDVI 344
Query: 102 YDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHM 161
Y SEID++ K+ + R RA V T ++ R S+ ++ H V+ I +RV M
Sbjct: 345 YQSEIDVLNKLNRKRYERAGVVINSTST--VSKKRTSQHIFIAATRHKVLRTIDQRVADM 402
Query: 162 TGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGG 221
T L AE+ Q+ +YGIGGHY H+D+ + K GNR+ATVLFY+SDVAQGG
Sbjct: 403 TNLNMQYAEDHQLADYGIGGHYSQHFDWFGNSDLANSKCDEMGNRIATVLFYLSDVAQGG 462
Query: 222 ATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
T F L L P+K AAFW+NLH+SG GD+ H CP++ GS
Sbjct: 463 GTAFPILKQLLKPKKYAAAFWYNLHASGKGDWRNLHGGCPIIVGS 507
>gi|348555277|ref|XP_003463450.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Cavia porcellus]
Length = 584
Score = 178 bits (451), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 110/269 (40%), Positives = 155/269 (57%), Gaps = 11/269 (4%)
Query: 4 PTHQRAQGNKLYYQEALNKSP--ELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPA 61
P ++R N L Y+ L +S E+ + + NV P L+ R+ YE LC+ + P
Sbjct: 298 PENKRMVRNVLKYERLLAESSHQEVAETVIQRPNV-PHLQT--RDTYEGLCQTLGSQPIH 354
Query: 62 I-VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRA 120
+ L C Y + PYL L P+++E +L+P + LY D + D E I+++A+P L+R+
Sbjct: 355 YQIPSLYCSYETNSSPYLLLQPVRKEVIHLEPYVALYHDFVSDPEAQKIRELAEPWLQRS 414
Query: 121 TVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQVVNYG 178
V + G+ YRISKSAWL++ P++ ++ R+ +TGL AE LQVVNYG
Sbjct: 415 VVAS--GGKQLQVEYRISKSAWLKDTVDPMLVTLNHRIAALTGLDVRPPYAEYLQVVNYG 472
Query: 179 IGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGT 238
IGGHYEPH+D A + F+ + +GNRVAT + Y+S V GGAT F N S+ K
Sbjct: 473 IGGHYEPHFDHATSPSSPLFR-MKSGNRVATFMIYLSSVEAGGATAFIYANFSVPVVKNA 531
Query: 239 AAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
A FW NLH SG+GD T HA CPVL G
Sbjct: 532 ALFWWNLHRSGEGDGDTLHAGCPVLVGDK 560
>gi|198418585|ref|XP_002122034.1| PREDICTED: similar to Prolyl 4-hydroxylase subunit alpha-1 (4-PH
alpha-1)
(Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1) [Ciona intestinalis]
Length = 525
Score = 177 bits (449), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 100/263 (38%), Positives = 150/263 (57%), Gaps = 20/263 (7%)
Query: 11 GNKLYYQEALNKSPEL----KDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAIVAQL 66
GN LYY+ L + P L KDE + E ++Y +C+G +P + L
Sbjct: 246 GNMLYYRMFL-RYPHLFIFHKDENAE----------DEIKQYNQICQGKFKLPHKVSKNL 294
Query: 67 KCR-YVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQN- 124
+C Y ++N P LR+ P+K EE P I+ + DV+ + +I+ IKKM++ L RA V
Sbjct: 295 RCYLYTNKNDPRLRIKPVKVEELCNSPHIVQFYDVINNDDIETIKKMSKKHLSRALVTGP 354
Query: 125 YKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYE 184
TG +E + R SK AW ++ + ++++ R+ MTGL+ T E+LQV NYG+ G Y+
Sbjct: 355 NNTGIVE--DIRTSKVAWFKKNDFTAVKKLYTRISEMTGLSEETFEDLQVANYGLAGEYQ 412
Query: 185 PHYDFAR-PGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWH 243
PH+D+ P GNR+AT+L Y++DV +GG T F + P KG+A FW+
Sbjct: 413 PHFDYTEDPSIYKREDGAEVGNRIATMLLYLNDVKEGGRTAFIEPKIVAKPIKGSAVFWY 472
Query: 244 NLHSSGDGDYYTRHAACPVLTGS 266
NL+ SG GD TRHA+CPV+ G+
Sbjct: 473 NLYPSGLGDPRTRHASCPVVIGN 495
>gi|410910256|ref|XP_003968606.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Takifugu
rubripes]
Length = 540
Score = 177 bits (448), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 102/225 (45%), Positives = 131/225 (58%), Gaps = 8/225 (3%)
Query: 45 REKYEMLCRGDLTVPPAIVA-QLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYD 103
R+ YE LC+ + P QL C P L L P++ E L+P ++LY D + D
Sbjct: 294 RDTYERLCQTRGSQPVHFENPQLFCDNFANGHPGLLLRPVRREVLSLRPYVVLYHDFISD 353
Query: 104 SEIDLIKKMAQPRLRRATVQNYKTGELE-IANYRISKSAWLREPEHPVIERISRRVEHMT 162
SE + IK+ AQ LRR+ V TG+ + A YRISKSAWL+ H + R+ +++ +T
Sbjct: 354 SESEEIKQHAQLGLRRSVV---ATGDKQATAEYRISKSAWLKGSAHSTVSRLDQKISMLT 410
Query: 163 GLTTST--AEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQG 220
GL E LQVVNYGIGGHYEPH+D A + FK L TGNRVAT + Y+S V G
Sbjct: 411 GLNVQHPHGEYLQVVNYGIGGHYEPHFDHATSPSSPVFK-LKTGNRVATFMIYLSSVEAG 469
Query: 221 GATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
G+T F N S+ K A FW NLH +G+GD T HA CPVL G
Sbjct: 470 GSTAFIYANFSVPVMKNAAIFWWNLHRNGEGDADTLHAGCPVLIG 514
>gi|442757047|gb|JAA70682.1| Putative prolyl 4-hydroxylase alpha subunit [Ixodes ricinus]
Length = 532
Score = 177 bits (448), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 100/262 (38%), Positives = 150/262 (57%), Gaps = 27/262 (10%)
Query: 17 QEALNKSPELK----DEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAIVAQLKCRYVH 72
+E N+S E K DE + + V E Y+ LCRG+ P + +QL+CRY
Sbjct: 255 RERANRSTEFKAQLFDEEIEDDQVT--------ENYKRLCRGEQLRTPKMDSQLRCRYYT 306
Query: 73 RNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEI 132
+ +L P+K EE L+P +++ RD++ D +++ + A+PRL ++ + + +
Sbjct: 307 GETGFFKLQPIKLEEFNLKPYVVVLRDLLQDRDLNDMIAFAKPRLEQS--KTLCAADKDG 364
Query: 133 ANYRISKSAWLREPEHPVIERISRRVEHMTGLTT----STAEELQVVNYGIGGHYEPHYD 188
R S + WL + + PV R+++ ++ + GL T AE+ Q+ NYGIGGHY PH+D
Sbjct: 365 PPSRTSSNTWLNDEDAPVAARVNQYLQSLLGLGTLFSRDEAEKYQLANYGIGGHYVPHHD 424
Query: 189 ----FARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHN 244
F P + N F GNRVAT++ YMSDV +GGATVF SL + + P+KG A FW N
Sbjct: 425 YFEEFQTPSKGNRF-----GNRVATLMIYMSDVEEGGATVFPSLGVRVSPKKGDAVFWWN 479
Query: 245 LHSSGDGDYYTRHAACPVLTGS 266
+ SS +G+ T HA CPVL GS
Sbjct: 480 IMSSWEGEMLTWHAGCPVLYGS 501
>gi|119595340|gb|EAW74934.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide III, isoform CRA_a
[Homo sapiens]
Length = 657
Score = 176 bits (447), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 113/275 (41%), Positives = 159/275 (57%), Gaps = 23/275 (8%)
Query: 1 MIF--PTHQRAQGNKLYYQEALNKSP-----ELKDEPPKVNNVAPTLEVTEREKYEMLCR 53
+IF P ++R N L Y+ L +SP E + P + P L+ R+ YE LC+
Sbjct: 285 IIFCCPDNKRMARNVLKYERLLAESPNHVVAEAVIQRPNI----PHLQT--RDTYEGLCQ 338
Query: 54 GDLTVPPAI--VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
L P + + L C Y + YL L P+++E +L+P I LY D + DSE I++
Sbjct: 339 -TLGSQPTLYQIPSLYCSYETNSNAYLLLQPIRKEVIHLEPYIALYHDFVSDSEAQKIRE 397
Query: 112 MAQPRLRRATVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST-- 168
+A+P L+R+ V +GE ++ YRISKSAWL++ P + ++ R+ +TGL
Sbjct: 398 LAEPWLQRSVV---ASGEKQLQVEYRISKSAWLKDTVDPKLVTLNHRIAALTGLDVRPPY 454
Query: 169 AEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL 228
AE LQVVNYGIGGHYEPH+D A + ++ + +GNRVAT + Y+S V GGAT F
Sbjct: 455 AEYLQVVNYGIGGHYEPHFDHATSPSSPLYR-MKSGNRVATFMIYLSSVEAGGATAFIYA 513
Query: 229 NLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVL 263
NLS+ + A FW NLH SG+GD T HA CPVL
Sbjct: 514 NLSVPVVRNAALFWWNLHRSGEGDSDTLHAGCPVL 548
>gi|195452776|ref|XP_002073495.1| GK13117 [Drosophila willistoni]
gi|194169580|gb|EDW84481.1| GK13117 [Drosophila willistoni]
Length = 487
Score = 176 bits (447), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 91/241 (37%), Positives = 145/241 (60%), Gaps = 7/241 (2%)
Query: 26 LKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKE 85
+++EP N+ P +E CRG+ P A+L C Y + P+LRL PLK
Sbjct: 228 IRNEP----NIKPKPFNKSVGDFERGCRGEF--PALTDAKLYCIYNTTSSPFLRLAPLKM 281
Query: 86 EEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLRE 145
E L P ++LY DV+ +EI +++MA+P+L+RA V N +++ R +K AW +
Sbjct: 282 ELIGLDPYMVLYHDVISPNEIAELQEMAKPQLKRARVYNSTKNTDQLSKTRTAKLAWFLD 341
Query: 146 PEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGN 205
+ + ER+++R+ MT + +E LQV+NYG+GG+Y H+D+ + + G+
Sbjct: 342 TFNQLTERLNQRIMDMTNFVLNGSEMLQVMNYGLGGYYVKHFDYFNTTKGPHITQIN-GD 400
Query: 206 RVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
R+ATVLFY++DV QGGATVF + +++P++G+A W+NL G+G+ T HA CPV+ G
Sbjct: 401 RIATVLFYLNDVEQGGATVFPEIKKAVFPKRGSAIMWYNLKDDGEGNRDTLHAGCPVIVG 460
Query: 266 S 266
S
Sbjct: 461 S 461
>gi|354504916|ref|XP_003514519.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Cricetulus
griseus]
Length = 509
Score = 176 bits (447), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 109/272 (40%), Positives = 152/272 (55%), Gaps = 19/272 (6%)
Query: 4 PTHQRAQGNKLYY-----QEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTV 58
P ++R N L Y Q L + E + P V N+ R+ YE LC+ +
Sbjct: 223 PDNKRMARNVLKYERLLSQNTLQMATETVIQRPNVPNL------QTRDTYEGLCQTLGSQ 276
Query: 59 PPAIVA-QLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRL 117
P +L C Y + PYL L P ++E +L+P + LY D + D+E I+++A+P L
Sbjct: 277 PTHYQNPRLYCSYETNSSPYLLLQPARKEVIHLRPFVALYHDFVSDAEAQKIRELAEPWL 336
Query: 118 RRATVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTS--TAEELQV 174
+R+ V +GE ++ YRISKSAWL++ P++ + R+ +TGL AE LQV
Sbjct: 337 QRSVV---ASGEKQLPVEYRISKSAWLKDTVDPMLGTLDHRIAALTGLDIQPPYAEYLQV 393
Query: 175 VNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWP 234
VNYGIGGHYEPH+D A + ++ + +GNRVAT + Y+S V GGAT F N S+
Sbjct: 394 VNYGIGGHYEPHFDHATSPSSPLYR-MKSGNRVATFMIYLSAVEAGGATAFIYANFSVPV 452
Query: 235 EKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
K A FW NLH SG+GD T HA CPVL G
Sbjct: 453 VKNAALFWWNLHRSGEGDGDTLHAGCPVLVGD 484
>gi|195390833|ref|XP_002054072.1| GJ22994 [Drosophila virilis]
gi|194152158|gb|EDW67592.1| GJ22994 [Drosophila virilis]
Length = 496
Score = 176 bits (446), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 95/215 (44%), Positives = 129/215 (60%), Gaps = 9/215 (4%)
Query: 52 CRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
CRG+ ++ L C Y P+L L P+K E L P II++ DV+ EID ++K
Sbjct: 267 CRGEFVG----ISNLYCVYKFGTSPFLLLAPIKMEIRLLNPFIIVFHDVLSPREIDELQK 322
Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
+A+P L R TV +K E + + R SK W+ + + +RI RR+ M L +E
Sbjct: 323 LARPLLERTTVVKFKKYEKD--SRRTSKGTWIERDHNNLTKRIERRITDMVELDLRYSEP 380
Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
QV+NYG+GGHY H DF G+ A K +R+ATVLFY++DV QGGATVFT LN +
Sbjct: 381 FQVMNYGLGGHYAAHEDFL--GDTWADKK-EEDDRIATVLFYLTDVEQGGATVFTILNQA 437
Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+ P++GTA FW+NLH +G GD T H CPVL GS
Sbjct: 438 VSPKRGTALFWYNLHRNGTGDTRTLHGGCPVLVGS 472
>gi|52139015|gb|AAH82538.1| P4ha3 protein [Mus musculus]
Length = 404
Score = 176 bits (446), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 108/268 (40%), Positives = 152/268 (56%), Gaps = 11/268 (4%)
Query: 4 PTHQRAQGNKLYYQEALNKSP-ELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAI 62
P ++R N L Y+ L ++ ++ E P L+ R+ YE LC+ + P
Sbjct: 118 PDNKRMARNVLKYERLLAENGHQMAAETAIQRPNVPHLQT--RDTYEGLCQTLGSQPTHY 175
Query: 63 -VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRAT 121
+ L C Y + PYL L P ++E +L+P I LY D + D E I+++A+P L+R+
Sbjct: 176 QIPSLYCSYETNSSPYLLLQPARKEVVHLRPLIALYHDFVSDEEAQKIRELAEPWLQRSV 235
Query: 122 VQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTS--TAEELQVVNYG 178
V +GE ++ YRISKSAWL++ P++ + R+ +TGL AE LQVVNYG
Sbjct: 236 V---ASGEKQLQVEYRISKSAWLKDTVDPMLVTLDHRIAALTGLDIQPPYAEYLQVVNYG 292
Query: 179 IGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGT 238
IGGHYEPH+D A + ++ + +GNRVAT + Y+S V GGAT F N S+ K
Sbjct: 293 IGGHYEPHFDHATSPSSPLYR-MKSGNRVATFMIYLSSVEAGGATAFIYGNFSVPVVKNA 351
Query: 239 AAFWHNLHSSGDGDYYTRHAACPVLTGS 266
A FW NLH SG+GD T HA CPVL G
Sbjct: 352 ALFWWNLHRSGEGDGDTLHAGCPVLVGD 379
>gi|81870817|sp|Q6W3F0.1|P4HA3_MOUSE RecName: Full=Prolyl 4-hydroxylase subunit alpha-3; Short=4-PH
alpha-3; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-3; Flags: Precursor
gi|36962749|gb|AAQ87604.1| collagen prolyl 4-hydroxylase alpha III subunit [Mus musculus]
Length = 542
Score = 176 bits (446), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 108/269 (40%), Positives = 152/269 (56%), Gaps = 11/269 (4%)
Query: 4 PTHQRAQGNKLYYQEALNKSP-ELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAI 62
P ++R N L Y+ L ++ ++ E P L+ R+ YE LC+ + P
Sbjct: 256 PDNKRMARNVLKYERLLAENGHQMAAETAIQRPNVPHLQT--RDTYEGLCQTLGSQPTHY 313
Query: 63 -VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRAT 121
+ L C Y + PYL L P ++E +L+P I LY D + D E I+++A+P L+R+
Sbjct: 314 QIPSLYCSYETNSSPYLLLQPARKEVVHLRPLIALYHDFVSDEEAQKIRELAEPWLQRSV 373
Query: 122 VQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQVVNYG 178
V +GE ++ YRISKSAWL++ P++ + R+ +TGL AE LQVVNYG
Sbjct: 374 V---ASGEKQLQVEYRISKSAWLKDTVDPMLVTLDHRIAALTGLDIQPPYAEYLQVVNYG 430
Query: 179 IGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGT 238
IGGHYEPH+D A + ++ + +GNRVAT + Y+S V GGAT F N S+ K
Sbjct: 431 IGGHYEPHFDHATSPSSPLYR-MKSGNRVATFMIYLSSVEAGGATAFIYGNFSVPVVKNA 489
Query: 239 AAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
A FW NLH SG+GD T HA CPVL G
Sbjct: 490 ALFWWNLHRSGEGDGDTLHAGCPVLVGDK 518
>gi|227908832|ref|NP_796135.3| prolyl 4-hydroxylase subunit alpha-3 precursor [Mus musculus]
Length = 542
Score = 176 bits (445), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 108/269 (40%), Positives = 152/269 (56%), Gaps = 11/269 (4%)
Query: 4 PTHQRAQGNKLYYQEALNKSP-ELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAI 62
P ++R N L Y+ L ++ ++ E P L+ R+ YE LC+ + P
Sbjct: 256 PDNKRMARNVLKYERLLAENGHQMAAETAIQRPNVPHLQT--RDTYEGLCQTLGSQPTHY 313
Query: 63 -VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRAT 121
+ L C Y + PYL L P ++E +L+P I LY D + D E I+++A+P L+R+
Sbjct: 314 QIPSLYCSYETNSSPYLLLQPARKEVVHLRPLIALYHDFVSDEEAQKIRELAEPWLQRSV 373
Query: 122 VQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQVVNYG 178
V +GE ++ YRISKSAWL++ P++ + R+ +TGL AE LQVVNYG
Sbjct: 374 V---ASGEKQLQVEYRISKSAWLKDTVDPMLVTLDHRIAALTGLDIQPPYAEYLQVVNYG 430
Query: 179 IGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGT 238
IGGHYEPH+D A + ++ + +GNRVAT + Y+S V GGAT F N S+ K
Sbjct: 431 IGGHYEPHFDHATSPSSPLYR-MKSGNRVATFMIYLSSVEAGGATAFIYGNFSVPVVKNA 489
Query: 239 AAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
A FW NLH SG+GD T HA CPVL G
Sbjct: 490 ALFWWNLHRSGEGDGDTLHAGCPVLVGDK 518
>gi|239915958|ref|NP_001070123.2| prolyl 4-hydroxylase alpha II-like precursor [Danio rerio]
Length = 490
Score = 176 bits (445), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 100/230 (43%), Positives = 139/230 (60%), Gaps = 19/230 (8%)
Query: 39 TLEVTEREKYEMLCRGDLTVPPAIVAQ-LKCRY-VHRNVPYLRLMPLKEEEAYLQPRIIL 96
TL YE LCRG++ + + L CRY P L P+KEEE + +P+II
Sbjct: 253 TLNTQSNNSYEALCRGEVDERTSKRQRALSCRYSTGGGNPRLMYAPVKEEELWDEPKIIR 312
Query: 97 YRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISR 156
Y DV+ D+EI+ +K +A+P L R+ +TG I++ R S+S +L E + RIS+
Sbjct: 313 YHDVISDTEIETLKDIARPELTRS-----QTGWGVISDIRTSQSVFLEEV--GTVARISQ 365
Query: 157 RVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSD 216
R+ +TGL+ +AE+L V NYGIGG Y PH+D E N R AT L YMSD
Sbjct: 366 RIADITGLSVESAEKLHVQNYGIGGRYTPHFDTG--DEVN--------ERTATFLIYMSD 415
Query: 217 VAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
V GGATVFT++ +++ PEKG+A FW+NLH +G+ D T+HA CPVL G+
Sbjct: 416 VEVGGATVFTNVGVAVKPEKGSAVFWYNLHKNGELDLKTKHAGCPVLVGN 465
>gi|195159319|ref|XP_002020529.1| GL14044 [Drosophila persimilis]
gi|194117298|gb|EDW39341.1| GL14044 [Drosophila persimilis]
Length = 536
Score = 176 bits (445), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 96/247 (38%), Positives = 135/247 (54%), Gaps = 14/247 (5%)
Query: 44 EREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYD 103
E YE +CRG+LT P L+CR R Y P K EE + P I+ D++
Sbjct: 283 EFRMYEQVCRGELTPSPTAQRHLRCRLQRRRFDY---APFKLEELHADPPIVQVHDMVSQ 339
Query: 104 SEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTG 163
E ++ A+PR++R+TV N A +R S+ A ++ +R+S+ V ++G
Sbjct: 340 RESLFLQNAARPRIQRSTVYNQAGAGTTAAAFRTSQGASFNYSQYATTQRLSQHVADLSG 399
Query: 164 LTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGAT 223
L AE LQ+ NYGIGGHYEPH+D + P + GNR+AT ++Y+SDV GG T
Sbjct: 400 LDMDYAENLQIANYGIGGHYEPHWD-SFPEHHEYPEDDLYGNRLATAIYYLSDVVAGGGT 458
Query: 224 VFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC---------- 273
F L L + PE+G+ FW+NLH SGD D+ T+HAACPVL GS + +
Sbjct: 459 AFPFLPLLVTPERGSLLFWYNLHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQDRVR 518
Query: 274 PCGLRRG 280
PC L+R
Sbjct: 519 PCDLQRN 525
>gi|92096574|gb|AAI15350.1| LOC557059 protein [Danio rerio]
Length = 508
Score = 176 bits (445), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 100/230 (43%), Positives = 139/230 (60%), Gaps = 19/230 (8%)
Query: 39 TLEVTEREKYEMLCRGDLTVPPAIVAQ-LKCRY-VHRNVPYLRLMPLKEEEAYLQPRIIL 96
TL YE LCRG++ + + L CRY P L P+KEEE + +P+II
Sbjct: 271 TLNTQSNNSYEALCRGEVDERTSKRQRALSCRYSTGGGNPRLMYAPVKEEELWDEPKIIR 330
Query: 97 YRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISR 156
Y DV+ D+EI+ +K +A+P L R+ +TG I++ R S+S +L E + RIS+
Sbjct: 331 YHDVISDTEIETLKDIARPELTRS-----QTGWGVISDIRTSQSVFLEEV--GTVARISQ 383
Query: 157 RVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSD 216
R+ +TGL+ +AE+L V NYGIGG Y PH+D E N R AT L YMSD
Sbjct: 384 RIADITGLSVESAEKLHVQNYGIGGRYTPHFDTG--DEVN--------ERTATFLIYMSD 433
Query: 217 VAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
V GGATVFT++ +++ PEKG+A FW+NLH +G+ D T+HA CPVL G+
Sbjct: 434 VEVGGATVFTNVGVAVKPEKGSAVFWYNLHKNGELDLKTKHAGCPVLVGN 483
>gi|156333122|ref|XP_001619372.1| hypothetical protein NEMVEDRAFT_v1g151555 [Nematostella vectensis]
gi|156202442|gb|EDO27272.1| predicted protein [Nematostella vectensis]
Length = 144
Score = 175 bits (444), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 81/128 (63%), Positives = 99/128 (77%), Gaps = 4/128 (3%)
Query: 139 KSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAF 198
+S WLR+ E +++RIS RV+ +GL +T+E+LQVVNYGIGGHYEPHYDFAR + F
Sbjct: 2 RSGWLRDEEDELVKRISYRVQAYSGLNMTTSEDLQVVNYGIGGHYEPHYDFAR----DKF 57
Query: 199 KSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHA 258
SLGTGNR+AT L Y+SDV GG TVFT + ++WP+KG AAFW+NL SGDGD TRHA
Sbjct: 58 TSLGTGNRIATFLSYLSDVEAGGGTVFTRVGATVWPQKGDAAFWYNLKRSGDGDSSTRHA 117
Query: 259 ACPVLTGS 266
ACPVL GS
Sbjct: 118 ACPVLVGS 125
>gi|195505244|ref|XP_002099420.1| GE10895 [Drosophila yakuba]
gi|194185521|gb|EDW99132.1| GE10895 [Drosophila yakuba]
Length = 533
Score = 175 bits (444), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 93/229 (40%), Positives = 138/229 (60%), Gaps = 11/229 (4%)
Query: 46 EKYEMLCRG--DLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYD 103
E + LCR + ++L CRY P+LRL PL+ EE L P ++LY +V+ D
Sbjct: 272 ESFNQLCRSVSRRHASESKPSRLHCRYNATTTPFLRLAPLRMEELSLDPYVVLYHNVLSD 331
Query: 104 SEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWL----REPEH-PVIERISRRV 158
EI+ ++ M++P L RA V + G EI R + AWL EPE V+ RI RR+
Sbjct: 332 PEIEKLQLMSEPFLERAKVFRVEKGSDEIGASRAADGAWLPHQETEPEDLEVLNRIGRRI 391
Query: 159 EHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVA 218
+TGL+T + ++Q++ YG GGH+ PH+D+ ++ G+R+ATVLFY+++V
Sbjct: 392 GDITGLSTRSGRQMQLLKYGFGGHFTPHFDYF---DSKTLYLEKVGDRIATVLFYLNNVE 448
Query: 219 QGGATVFTSLNLSLWPEKGTAAFWHNLH-SSGDGDYYTRHAACPVLTGS 266
GGATVF S+NL++ +KG+A FWHNL S D D T H ACP+++G+
Sbjct: 449 HGGATVFPSINLAVPTQKGSALFWHNLDGQSYDYDTRTFHGACPLISGT 497
>gi|198449648|ref|XP_001357666.2| GA21989 [Drosophila pseudoobscura pseudoobscura]
gi|198130700|gb|EAL26801.2| GA21989 [Drosophila pseudoobscura pseudoobscura]
Length = 536
Score = 175 bits (443), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 96/247 (38%), Positives = 134/247 (54%), Gaps = 14/247 (5%)
Query: 44 EREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYD 103
E YE +CRG+LT P L+CR R Y P K EE + P I+ D++
Sbjct: 283 EFRMYEQVCRGELTPSPTAQRHLRCRLQRRRFDY---APFKLEELHADPPIVQVHDMVSQ 339
Query: 104 SEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTG 163
E ++ A+PR++R+TV N A +R S+ A ++ +R+S+ V ++G
Sbjct: 340 RESLFLQNAARPRIQRSTVYNQAGAGTTAAAFRTSQGASFNYSQYATTQRLSQHVADLSG 399
Query: 164 LTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGAT 223
L AE LQ+ NYGIGGHYEPH+D + P + GNR+AT ++Y+SDV GG T
Sbjct: 400 LDMDYAENLQIANYGIGGHYEPHWD-SFPEHHEYPEDDLYGNRLATAIYYLSDVVAGGGT 458
Query: 224 VFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC---------- 273
F L L + PE+G+ FW+NLH SGD D+ T+HAACPVL GS + +
Sbjct: 459 AFPFLPLLVTPERGSLLFWYNLHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQDRVR 518
Query: 274 PCGLRRG 280
PC L R
Sbjct: 519 PCDLHRN 525
>gi|442747091|gb|JAA65705.1| Putative prolyl 4-hydroxylase alpha subunit [Ixodes ricinus]
Length = 533
Score = 174 bits (442), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 89/225 (39%), Positives = 136/225 (60%), Gaps = 6/225 (2%)
Query: 46 EKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSE 105
E Y+ LCRG+ P + +QL+CRY + +L P+K EE L+P +++ RD++ D +
Sbjct: 280 ENYKRLCRGEQLRTPKMDSQLRCRYYTGETGFFKLQPIKLEEYNLKPYVVVLRDLLQDRD 339
Query: 106 IDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLT 165
++ + A+PRL ++ + + + R S + WL + + PV R+++ ++ + GL
Sbjct: 340 LNDMIAFAKPRLEQS--KTLCAADKDGPPPRTSSNTWLDDDDAPVAARVNQYLQSLLGLG 397
Query: 166 T----STAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGG 221
T AE+ Q+ NYGIGGHY PH+D+ ++ K G+RVAT++ YMSDV +GG
Sbjct: 398 TLYGKDEAEKYQLANYGIGGHYVPHHDYLEESLTSSKKHRLFGDRVATLMIYMSDVEEGG 457
Query: 222 ATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
ATVF SL + + P KG A FW N+ SS +GD T HA CPVL GS
Sbjct: 458 ATVFPSLGVRVSPRKGDAVFWWNIKSSWEGDVLTWHAGCPVLYGS 502
>gi|403183473|gb|EJY58123.1| AAEL017524-PA, partial [Aedes aegypti]
Length = 212
Score = 174 bits (442), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 85/188 (45%), Positives = 126/188 (67%), Gaps = 3/188 (1%)
Query: 80 LMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISK 139
+ P K EEA L P I++Y + + D EI+ I ++++P L+R+ V ++ E++N R S+
Sbjct: 1 IAPFKLEEASLDPLIVIYHNAISDKEIEQIIQVSKPMLKRSMVG--ESFSKEVSNERTSQ 58
Query: 140 SAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARP-GEANAF 198
+AWL + + +++ +S R E MTGL + E LQV NYGIGG Y PH+D+ R G +
Sbjct: 59 NAWLADYDFELVKVLSLRTEDMTGLDRKSYESLQVNNYGIGGFYLPHFDWVRTNGTEEPY 118
Query: 199 KSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHA 258
K +G GNR+AT+++Y+SDV QGGATVF + + ++P+KG+A FW+NL G GD T H
Sbjct: 119 KDMGLGNRIATLMYYLSDVEQGGATVFPQIGVGVFPKKGSAIFWYNLLPDGTGDERTLHG 178
Query: 259 ACPVLTGS 266
ACPVL GS
Sbjct: 179 ACPVLLGS 186
>gi|47227817|emb|CAG08980.1| unnamed protein product [Tetraodon nigroviridis]
Length = 285
Score = 174 bits (442), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 103/225 (45%), Positives = 128/225 (56%), Gaps = 8/225 (3%)
Query: 45 REKYEMLCRGDLTVPPAIVA-QLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYD 103
R+ YE LCR + P QL C P L L P + E LQP ++LY D + D
Sbjct: 39 RDTYERLCRTRGSQPTHFENPQLFCDNFANGHPGLLLRPARRETLSLQPYVVLYHDFISD 98
Query: 104 SEIDLIKKMAQPRLRRATVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMT 162
+E + IK AQ LRR+ V T + ++ A YRISKSAWL+ + R+ +R+ +T
Sbjct: 99 TEAEEIKHHAQLGLRRSVV---ATRDKQVTAEYRISKSAWLKGSAQSAVSRLDQRISMLT 155
Query: 163 GLTTST--AEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQG 220
GL E LQVVNYGIGGHYEPH+D A + FK L TGNRVATV+ Y+S V G
Sbjct: 156 GLNVQHPHGEYLQVVNYGIGGHYEPHFDHATSPSSPVFK-LKTGNRVATVMIYLSSVEAG 214
Query: 221 GATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
G+T F N S+ K A FW NLH +G GD T HA CPVL G
Sbjct: 215 GSTAFIYANFSVPVMKNAAIFWWNLHRNGRGDPDTLHAGCPVLIG 259
>gi|281362877|ref|NP_733393.3| CG31016, isoform B [Drosophila melanogaster]
gi|442621939|ref|NP_001263119.1| CG31016, isoform C [Drosophila melanogaster]
gi|272477249|gb|AAF57071.5| CG31016, isoform B [Drosophila melanogaster]
gi|440218076|gb|AGB96498.1| CG31016, isoform C [Drosophila melanogaster]
Length = 536
Score = 174 bits (441), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 100/251 (39%), Positives = 146/251 (58%), Gaps = 18/251 (7%)
Query: 26 LKDEP-PKVNNVAPTLEVTER-EKYEMLCRGD--LTVPPAIVAQLKCRYVHRNVPYLRLM 81
L+++P P +N LE E E + LCR + + ++L CRY P+L+L
Sbjct: 258 LRNKPKPSIN-----LESWESDESFNQLCRSSSRRQMGESKPSRLHCRYNTITTPFLKLA 312
Query: 82 PLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSA 141
P + EE L P +I Y +V+ D+EI+ +K M +P L RA V + G EI R + A
Sbjct: 313 PFRMEELSLDPYVIFYHNVLSDAEIEKLKPMGKPFLERAKVFRVEKGSDEIDPSRSADGA 372
Query: 142 WL----REPEH-PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEAN 196
WL +P+ V+ RI RR+E MTGL T + ++Q + YG GGH+ PHYD+ +
Sbjct: 373 WLPHQNIDPDDLEVLNRIGRRIEDMTGLNTRSGSKMQFLKYGFGGHFVPHYDYFN---SK 429
Query: 197 AFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNL-HSSGDGDYYT 255
F G+R+ATVLFY+++V GGATVF LNL++ +KG+A FWHN+ S D D T
Sbjct: 430 TFSLETVGDRIATVLFYLNNVDHGGATVFPKLNLAVPTQKGSALFWHNIDRKSYDYDTRT 489
Query: 256 RHAACPVLTGS 266
H ACP+++G+
Sbjct: 490 FHGACPLISGT 500
>gi|194905305|ref|XP_001981170.1| GG11767 [Drosophila erecta]
gi|190655808|gb|EDV53040.1| GG11767 [Drosophila erecta]
Length = 536
Score = 174 bits (441), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 96/242 (39%), Positives = 145/242 (59%), Gaps = 22/242 (9%)
Query: 39 TLEVTEREK-YEMLCR-------GDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYL 90
+LE E E+ + LCR GD + ++L CRY P+LRL+PL+ EE L
Sbjct: 267 SLECCESEESFNHLCRSVSRRQAGD-----SKPSRLHCRYNTTTRPFLRLVPLRMEELSL 321
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH-- 148
P ++LY +V+ D EI+ +K M++P L RA V + G E+A R + AWL +PE
Sbjct: 322 DPYVVLYHNVLSDPEIEKLKLMSEPFLERAKVYRVEKGSDEVAPSRSADGAWLPDPETEP 381
Query: 149 ---PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGN 205
+ RI RR+ +TGL+T + ++Q++ YG GGH+ PHYD+ ++ G+
Sbjct: 382 EDLETLNRIGRRIGDITGLSTCSGSQMQLLKYGFGGHFVPHYDYF---DSKTSYLEAVGD 438
Query: 206 RVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLH-SSGDGDYYTRHAACPVLT 264
R+ATVLFY+++V GGAT F ++NL++ +KG+A FWHNL S D D T H ACP+++
Sbjct: 439 RIATVLFYLNNVDHGGATAFPNINLAVPTQKGSALFWHNLDGKSYDYDTRTFHGACPLIS 498
Query: 265 GS 266
G+
Sbjct: 499 GT 500
>gi|159884097|gb|ABX00727.1| IP12176p [Drosophila melanogaster]
Length = 538
Score = 174 bits (441), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 93/229 (40%), Positives = 135/229 (58%), Gaps = 11/229 (4%)
Query: 46 EKYEMLCRGD--LTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYD 103
E + LCR + + ++L CRY P+L+L P + EE L P +I Y +V+ D
Sbjct: 277 ESFNQLCRSSSRRQMGESKPSRLHCRYNTITTPFLKLAPFRMEELSLDPYVIFYHNVLSD 336
Query: 104 SEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWL----REPEH-PVIERISRRV 158
+EI+ +K M +P L RA V + G EI R + AWL +P+ V+ RI RR+
Sbjct: 337 AEIEKLKPMGKPFLERAKVFRVEKGSDEIDPSRSADGAWLPHQNIDPDDLEVLNRIGRRI 396
Query: 159 EHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVA 218
E MTGL T + ++Q + YG GGH+ PHYD+ + F G+R+ATVLFY+++V
Sbjct: 397 EDMTGLNTRSGSKMQFLKYGFGGHFVPHYDYFN---SKTFSLETVGDRIATVLFYLNNVD 453
Query: 219 QGGATVFTSLNLSLWPEKGTAAFWHNL-HSSGDGDYYTRHAACPVLTGS 266
GGATVF LNL++ +KG+A FWHN+ S D D T H ACP+++G+
Sbjct: 454 HGGATVFPKLNLAVPTQKGSALFWHNIDRKSYDYDTRTFHGACPLISGT 502
>gi|195505251|ref|XP_002099423.1| GE23370 [Drosophila yakuba]
gi|194185524|gb|EDW99135.1| GE23370 [Drosophila yakuba]
Length = 534
Score = 173 bits (439), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 95/229 (41%), Positives = 134/229 (58%), Gaps = 21/229 (9%)
Query: 48 YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID 107
+++ C G P +L C Y P+LRL PLK E+ L P ++LY +V+ EI
Sbjct: 287 FKLSCNG----PHESSTRLHCFYNFTTTPFLRLAPLKTEQIGLDPYVVLYHEVLSAREIS 342
Query: 108 -LIKKMAQ----PRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMT 162
LI K AQ R+ R T G R +K WL++ + + RI+RR+ MT
Sbjct: 343 MLISKAAQNMKNTRVHRETKPKTNRG-------RTAKGHWLKKESNELTRRITRRIVDMT 395
Query: 163 GLTTSTAEELQVVNYGIGGHYEPHYDFARPGEAN-----AFKSLGTGNRVATVLFYMSDV 217
G + +E+ QV+NYGIGGHY H D+ +N + +S G+R+ATVLFY+SDV
Sbjct: 396 GFDLADSEDFQVINYGIGGHYFLHMDYFDYASSNYTGPRSRQSKVLGDRIATVLFYLSDV 455
Query: 218 AQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
QGGATVF ++ S++P+ GTA FW+NL + G+GD TRHA+CPV+ GS
Sbjct: 456 EQGGATVFGNVGYSVYPQAGTAIFWYNLDTDGNGDPLTRHASCPVIVGS 504
>gi|195069801|ref|XP_001997031.1| GH12975 [Drosophila grimshawi]
gi|193891500|gb|EDV90366.1| GH12975 [Drosophila grimshawi]
Length = 242
Score = 173 bits (439), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 89/230 (38%), Positives = 133/230 (57%), Gaps = 10/230 (4%)
Query: 60 PAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRR 119
PA +L+CR N P + EE +L P +I D++ E +++++A+P L+R
Sbjct: 1 PATQRKLRCRLHRGNGLRSSYQPYRLEELHLDPYVIQVHDIISAEETIVLQQLARPELQR 60
Query: 120 ATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGI 179
+ V + E N+RIS+ + EHP+++R+S+ +E+++GL +AE+LQV NYGI
Sbjct: 61 SMVYSLSNSEHISTNFRISQGTFFEYHEHPIMQRMSQHLENISGLDMRSAEQLQVANYGI 120
Query: 180 GGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTA 239
GGHYEPH D + + NRVAT ++Y+S+V GG T F L L + PE+G+
Sbjct: 121 GGHYEPHMDSFSENHNYGINTYMSTNRVATGIYYLSNVEAGGGTAFPFLPLLVEPERGSL 180
Query: 240 AFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCGLRR 279
FW+NLH SGD DY T+HA CPVL GS + + PC L+R
Sbjct: 181 LFWYNLHRSGDLDYRTKHAGCPVLMGSKWIANVWIRLSNQDHIRPCDLQR 230
>gi|195391758|ref|XP_002054527.1| GJ22759 [Drosophila virilis]
gi|194152613|gb|EDW68047.1| GJ22759 [Drosophila virilis]
Length = 539
Score = 173 bits (438), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 95/243 (39%), Positives = 141/243 (58%), Gaps = 13/243 (5%)
Query: 48 YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID 107
Y+ +CR +L PA + +L+CR N P K EE +L P II DV+ +
Sbjct: 287 YQQVCREELRPAPAALRELRCRLFAGNGRKSTYAPYKLEELHLDPYIIQVHDVISARDTA 346
Query: 108 LIKKMAQPRLRRATVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTT 166
++ +A+P L+R+ V + +TG I AN+R S+ +HP+++++S V ++GL
Sbjct: 347 ELQHLARPELQRSQVYS-RTGHEHISANFRTSQGTTFEYTDHPIMQKMSHHVAEISGLDM 405
Query: 167 STAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFT 226
+AE LQ+ NYGIGGHYEPH D + P + ++ NR+AT ++Y+S+V GG T F
Sbjct: 406 RSAEPLQIANYGIGGHYEPHMD-SFPDSYDYSLNMYKTNRLATGIYYLSNVEAGGGTAFP 464
Query: 227 SLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCG 276
L L + PE+G+ FW+NLH SGD DY T+HAACPVL GS + + PC
Sbjct: 465 FLPLLVTPERGSLLFWYNLHPSGDADYRTKHAACPVLQGSKWIANVWIRLSNQDHVRPCE 524
Query: 277 LRR 279
L+R
Sbjct: 525 LQR 527
>gi|198449502|ref|XP_001357605.2| GA15937 [Drosophila pseudoobscura pseudoobscura]
gi|198130635|gb|EAL26739.2| GA15937 [Drosophila pseudoobscura pseudoobscura]
Length = 510
Score = 172 bits (437), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 91/205 (44%), Positives = 124/205 (60%), Gaps = 14/205 (6%)
Query: 66 LKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRAT--VQ 123
L C Y P+LRL PLK E P +++Y DV+ DSEI I +MA+ R+ R + Q
Sbjct: 293 LHCCYNFTTTPFLRLAPLKMELLGEHPYVVVYHDVLSDSEIAEILEMAERRMARTSTVAQ 352
Query: 124 NYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHY 183
+T + R + AWL+ + + RI+RRV M+GL +E +QV+NYGIGGHY
Sbjct: 353 PNRTS----SPTRTAMGAWLKRSSNALTRRIARRVRDMSGLQLEGSERMQVINYGIGGHY 408
Query: 184 EPHYD-FARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFW 242
PH D F + E GNR+ATVLFY++DV QGGAT+F + P +GTA FW
Sbjct: 409 VPHKDWFTQHPEV-------MGNRLATVLFYLTDVEQGGATMFNKAEHKVLPRRGTALFW 461
Query: 243 HNLHSSGDGDYYTRHAACPVLTGSN 267
+NLH+ G+GD+ T HAACP++ GS
Sbjct: 462 YNLHTDGEGDWSTTHAACPIIVGSK 486
>gi|442751927|gb|JAA68123.1| Putative prolyl 4-hydroxylase alpha subunit [Ixodes ricinus]
Length = 522
Score = 172 bits (437), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 98/267 (36%), Positives = 148/267 (55%), Gaps = 22/267 (8%)
Query: 41 EVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDV 100
E E Y+ LCRG++ P + ++L+CRY + L P+K EE L+P II+ RDV
Sbjct: 258 EDQEEHNYKRLCRGEVLRTPKMDSKLRCRYYKGQDGFFTLRPIKLEEINLKPYIIVMRDV 317
Query: 101 MYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEH 160
+ + +I+ + A+PRL+R+T Y + R S +AWL + E P+ R++ +
Sbjct: 318 VQERDIEDLMAFAEPRLQRSTT--YTGDGNAPSTRRTSSNAWLWDDEAPIANRMNWYLRA 375
Query: 161 MTGLTTS----TAEELQVVNYGIGGHYEPHYDF------ARPGEANAFKSLGTGNRVATV 210
+ GL TS AE Q+ NYG GG++ PH+D+ A A+ + G+R+AT+
Sbjct: 376 LVGLGTSGSDYEAEAYQLANYGSGGYFLPHHDYLQDTLHAHNSTADYYLQNKEGDRLATL 435
Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLH 270
+ YM+DV GGATVF L + L P+KG AAFW NL +SG+GD T HA CPVL GS +
Sbjct: 436 MIYMTDVEVGGATVFPRLGVRLVPKKGDAAFWWNLKASGEGDTLTMHAGCPVLYGSKWIA 495
Query: 271 ST----------CPCGLRRGLQRSGII 287
+ PC + R + + ++
Sbjct: 496 NKWFKSYSNVFRLPCSIDRNVSLAPLV 522
>gi|321461762|gb|EFX72791.1| hypothetical protein DAPPUDRAFT_308081 [Daphnia pulex]
Length = 561
Score = 172 bits (436), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 95/234 (40%), Positives = 139/234 (59%), Gaps = 10/234 (4%)
Query: 41 EVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDV 100
E E E YE LCRG+ I A L+CR V R P L L P+K EE L P I++ D+
Sbjct: 298 EDEENEHYERLCRGEKLRSANIEAGLRCRLVTRGHPALLLQPIKVEEQSLDPMIVVLHDL 357
Query: 101 MYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEH 160
+ + + ++++++ +P+L ++ G+ + R SK+AWL+E E+ + I R+E
Sbjct: 358 ITERQTEILRQLGEPKLA-TSLHRGGEGKFVRSMIRTSKNAWLQEHENASLPAIRHRMEL 416
Query: 161 MTGLT---TSTAEELQVVNYGIGGHYEPHYDFA-----RPGEANAFKSLGTGNRVATVLF 212
TGL + +E Q+ NYGIGG Y+ H D RP + + + +L G+R+AT++
Sbjct: 417 ATGLIYGPETASEYFQIANYGIGGLYKTHTDNVIHPDVRPEDQDPW-NLYVGDRIATLMV 475
Query: 213 YMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
Y+SDV GGATVF ++ WP KG+AAFW NL+ SG+ D TRH ACPVL GS
Sbjct: 476 YLSDVEAGGATVFPRAGVTCWPRKGSAAFWWNLYKSGEPDLTTRHGACPVLHGS 529
>gi|195159144|ref|XP_002020442.1| GL13995 [Drosophila persimilis]
gi|194117211|gb|EDW39254.1| GL13995 [Drosophila persimilis]
Length = 535
Score = 172 bits (436), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 91/205 (44%), Positives = 124/205 (60%), Gaps = 14/205 (6%)
Query: 66 LKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRAT--VQ 123
L C Y P+LRL PLK E P +++Y DV+ DSEI I +MA+ R+ R + Q
Sbjct: 318 LHCCYNFTTTPFLRLAPLKMELLGEHPYVVVYHDVLSDSEIAEILEMAERRMARTSTVAQ 377
Query: 124 NYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHY 183
+T + R + AWL+ + + RI+RRV M+GL +E +QV+NYGIGGHY
Sbjct: 378 PNRTS----SPTRTALGAWLKRSSNALTRRIARRVRDMSGLQLEGSERMQVINYGIGGHY 433
Query: 184 EPHYD-FARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFW 242
PH D F + E GNR+ATVLFY++DV QGGAT+F + P +GTA FW
Sbjct: 434 VPHKDWFTQHPEV-------MGNRLATVLFYLTDVEQGGATMFNKAEHKVLPRRGTALFW 486
Query: 243 HNLHSSGDGDYYTRHAACPVLTGSN 267
+NLH+ G+GD+ T HAACP++ GS
Sbjct: 487 YNLHTDGEGDWSTTHAACPIIVGSK 511
>gi|195452730|ref|XP_002073475.1| GK13125 [Drosophila willistoni]
gi|194169560|gb|EDW84461.1| GK13125 [Drosophila willistoni]
Length = 539
Score = 172 bits (435), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 94/244 (38%), Positives = 139/244 (56%), Gaps = 16/244 (6%)
Query: 48 YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID 107
YE +CRG+ +L+CR + Y L+ EE + P ++ +++ +++
Sbjct: 288 YEQVCRGETRPSAKSQRELRCRLQRSRLSY---EVLELEELHQDPFVVQVHNIVSQKDMN 344
Query: 108 LIKKMAQPRLRRATV--QNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLT 165
L++K+A+P ++R+ V Q++ E A YR SK A EH +E +SR V ++GL
Sbjct: 345 LLQKIARPNIQRSQVYAQDHNANETVAAAYRTSKGATFEYFEHRSMELLSRHVADLSGLD 404
Query: 166 TSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVF 225
++AE LQ+ NYGIGGHYEPH+D P GNR+AT ++Y+S+V GG T F
Sbjct: 405 MNSAELLQIANYGIGGHYEPHWD-CFPDHHVYLPDDRDGNRIATGIYYLSEVEAGGGTAF 463
Query: 226 TSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PC 275
L L + PE+G+ FW+NLH SGD DY T+HAACPVL GS + + PC
Sbjct: 464 PFLPLLVTPERGSLVFWYNLHRSGDQDYRTKHAACPVLQGSKWIANVWIRQSNQDQIRPC 523
Query: 276 GLRR 279
GL+R
Sbjct: 524 GLQR 527
>gi|335294484|ref|XP_003357239.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Sus scrofa]
Length = 545
Score = 172 bits (435), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 106/273 (38%), Positives = 153/273 (56%), Gaps = 18/273 (6%)
Query: 4 PTHQRAQGNKLYYQEALNKSP-----ELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTV 58
P ++R N L Y++ L +S E + P + P L+ R+ YE LC+ +
Sbjct: 258 PDNKRMARNVLKYEKLLAESASQAVAETVIQRPNI----PHLQT--RDTYEGLCQTLGSQ 311
Query: 59 PPAI-VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRL 117
P + L C Y + PYL L P+++E +L+P ++LY D + D+E I+ +A+P +
Sbjct: 312 PTHYQIPSLYCSYETSSSPYLLLQPIRKEVIHLEPYVVLYHDFVTDAEAQKIRGLAEPWV 371
Query: 118 RRATVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQV 174
+ +GE ++ YRISKSAWL++ P++ + R+ +TGL AE LQV
Sbjct: 372 TAEIL--VASGEKQLPVEYRISKSAWLKDTVDPMLVTLDHRIAALTGLDVQPPYAEYLQV 429
Query: 175 VNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWP 234
VNYGIGGHYEPH+D A + ++ + +GNRVAT + Y+S V GGAT F N S+
Sbjct: 430 VNYGIGGHYEPHFDHATSPSSPLYR-MKSGNRVATFMIYLSSVEAGGATAFIYGNFSVPV 488
Query: 235 EKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
K A FW NLH SG+GD T HA CPVL G
Sbjct: 489 VKNAALFWWNLHRSGEGDGDTLHAGCPVLVGDK 521
>gi|198417610|ref|XP_002125349.1| PREDICTED: similar to Prolyl 4-hydroxylase subunit alpha-1
precursor (4-PH alpha-1)
(Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1) [Ciona intestinalis]
Length = 527
Score = 172 bits (435), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 100/267 (37%), Positives = 151/267 (56%), Gaps = 13/267 (4%)
Query: 4 PTHQRAQGNKL-YYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAI 62
P + R GN++ + + A ++ E+ D+ +V T + LCRG+ T+
Sbjct: 236 PENTRILGNRIRFTRHARVQTQEVVDDKFYTFSVDET--------FFKLCRGEQTLTKKK 287
Query: 63 V-AQLKCRYVHRNV--PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRR 119
+L+C Y+ N+ P L + P+K EE P I+ + DV+ D+ I+ IKK+A+P+L R
Sbjct: 288 QHKKLRC-YLSTNMGNPKLLIRPVKVEELSKSPDIVQFHDVLSDTVINEIKKLAKPQLFR 346
Query: 120 ATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGI 179
A +L+ A YRI+K AWL + + P + +I+ R+ +TGLT +T+EE+QV NYG+
Sbjct: 347 AIHAGSDDTDLQKAPYRITKLAWLLDDDGPEVAKITERISDITGLTLNTSEEIQVANYGV 406
Query: 180 GGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTA 239
GG Y PH+D E G R+AT L Y+SDV GG T F + +S P KG+A
Sbjct: 407 GGEYPPHFDIPTTDEERDDLKSQDGERIATFLIYLSDVEVGGRTAFVNAGVSAKPIKGSA 466
Query: 240 AFWHNLHSSGDGDYYTRHAACPVLTGS 266
FW+N+ SG+ D T H ACPV G+
Sbjct: 467 VFWYNVFPSGEPDLRTYHGACPVAFGN 493
>gi|67084101|gb|AAY66985.1| truncated prolyl 4-hydroxylase alpha subunit [Ixodes scapularis]
Length = 452
Score = 171 bits (433), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 99/265 (37%), Positives = 147/265 (55%), Gaps = 24/265 (9%)
Query: 44 EREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYD 103
E Y LCRG+ P + ++L+CRY + L P+K E+ L+P II+ RDV+ +
Sbjct: 191 EELNYRRLCRGEALRTPQMDSKLRCRYYKGQDGFFTLHPIKLEKINLKPYIIVMRDVVQE 250
Query: 104 SEIDLIKKMAQPRLRRATVQNYKTGELEIANYR-ISKSAWLREPEHPVIERISRRVEHMT 162
+I+ + A+PRL+R+T TG+ + R S +AWL + E P+ R++ + +
Sbjct: 251 RDIENLMAFAEPRLQRSTTY---TGDGNAPSTRQTSSNAWLWDDEAPIANRMNWYLRALV 307
Query: 163 GLTTS----TAEELQVVNYGIGGHYEPHYDF------ARPGEANAFKSLGTGNRVATVLF 212
GL TS AE Q+ NYG GG++ PHYD+ A A+ + G+R+AT++
Sbjct: 308 GLGTSGSEYEAEAYQLANYGSGGYFLPHYDYLQDTLHAHNSTADYYLQNNEGDRLATLMI 367
Query: 213 YMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHST 272
YM+DV +GGATVF L + L P+KG AAFW NL +SG+GD T HA CPVL GS + +
Sbjct: 368 YMTDVKEGGATVFPRLGVRLVPKKGDAAFWWNLKASGEGDTLTMHAGCPVLYGSKWIANK 427
Query: 273 ----------CPCGLRRGLQRSGII 287
PC R L + ++
Sbjct: 428 WFKSYSNVFRLPCSTDRNLSLAPLV 452
>gi|195575115|ref|XP_002105525.1| GD21527 [Drosophila simulans]
gi|194201452|gb|EDX15028.1| GD21527 [Drosophila simulans]
Length = 495
Score = 171 bits (432), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 93/229 (40%), Positives = 126/229 (55%), Gaps = 27/229 (11%)
Query: 41 EVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDV 100
+V E + Y + C G + P L+C YV P+L + PLK EE + P ++LY DV
Sbjct: 245 QVGEFQAYSLTCSGHWQLTPKEQRHLRCGYVTETHPFLWIAPLKAEELFQDPLLVLYHDV 304
Query: 101 MYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEH 160
+Y SEID+I+K+ + RL RAT+ ++ E ++N R S+ ++ H V+ I +RV
Sbjct: 305 IYQSEIDVIRKLTKNRLMRATITSH--NESVVSNVRTSQFTFIPVTAHKVLSTIDQRVAD 362
Query: 161 MTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFY---MSDV 217
MT L AE+ Q NYGIGGHY H D+ FY +SDV
Sbjct: 363 MTNLNMKYAEDHQFANYGIGGHYGQHMDW----------------------FYQTTLSDV 400
Query: 218 AQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
AQGG T F L L P+K AAFWHNLH+SG GD T+H ACP++ GS
Sbjct: 401 AQGGGTAFPQLRTLLKPKKYAAAFWHNLHASGVGDVRTQHGACPIIAGS 449
>gi|292621357|ref|XP_691737.4| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Danio rerio]
Length = 538
Score = 171 bits (432), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 97/228 (42%), Positives = 129/228 (56%), Gaps = 12/228 (5%)
Query: 45 REKYEMLCRGDLTVPPAIVA-QLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYD 103
R YE LC+ + P L C Y P L L P++ E LQP ++L+ +
Sbjct: 292 RNAYEQLCQTKGSQPKHFENPSLFCDYFTNGSPALFLQPIRREIISLQPYVVLFHGFVTQ 351
Query: 104 SEIDLIKKMAQPRLRRATV---QNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEH 160
+E I+K A P LRR+ V N T E YRISKSAWL+E H V+ ++ +R+
Sbjct: 352 AEAKNIRKYAMPGLRRSVVASGMNQATAE-----YRISKSAWLKESAHEVVGKLDQRITL 406
Query: 161 MTGLTTS--TAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVA 218
+TGL AE LQVVNYGIGGHYEPH+D A + ++ L TGNRVAT++ Y+S V
Sbjct: 407 VTGLNVQPPYAEYLQVVNYGIGGHYEPHFDHATSDSSPLYR-LKTGNRVATIMIYLSPVQ 465
Query: 219 QGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
GG+T F N S+ + A FW NLH +G G+ T HA CPV+ G+
Sbjct: 466 AGGSTAFIYANFSVPVVQNAALFWWNLHKNGQGNVDTLHAGCPVIVGN 513
>gi|195341560|ref|XP_002037374.1| GM12888 [Drosophila sechellia]
gi|194131490|gb|EDW53533.1| GM12888 [Drosophila sechellia]
Length = 501
Score = 170 bits (430), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 93/229 (40%), Positives = 126/229 (55%), Gaps = 27/229 (11%)
Query: 41 EVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDV 100
+V E + Y + C G + P L+C YV P+L + PLK EE + P ++LY DV
Sbjct: 251 QVGEFQAYSLTCSGHWRLTPKEQRHLRCGYVTETHPFLWIAPLKAEELFQDPLLVLYHDV 310
Query: 101 MYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEH 160
+Y SEID+I+K+ + RL RAT+ ++ E ++N R S+ ++ H V+ I +RV
Sbjct: 311 IYQSEIDVIRKLTKNRLMRATITSH--NESVVSNVRTSQITFIPVTAHKVLSTIDQRVAD 368
Query: 161 MTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFY---MSDV 217
MT L AE+ Q NYGIGGHY H D+ FY +SDV
Sbjct: 369 MTNLNMKYAEDHQFANYGIGGHYGQHMDW----------------------FYQTTLSDV 406
Query: 218 AQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
AQGG T F L L P+K AAFWHNLH+SG GD T+H ACP++ GS
Sbjct: 407 AQGGGTAFPQLRTLLKPKKYAAAFWHNLHASGVGDVRTQHGACPIIAGS 455
>gi|4336512|gb|AAD17844.1| prolyl 4-hydroxylase alpha subunit [Drosophila melanogaster]
Length = 535
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 99/255 (38%), Positives = 139/255 (54%), Gaps = 15/255 (5%)
Query: 41 EVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDV 100
E E YE +CRG+L P+ L+CR + Y P K EE +L P ++ V
Sbjct: 278 ESREFRMYEQVCRGELAPLPSKQRNLRCRLRKSRLGY---APFKLEELHLDPLVVQLHQV 334
Query: 101 MYDSEIDLIKKMAQPRLRRATVQNYK-TGELEIANYRISKSAWLREPEHPVIERISRRVE 159
+ + D ++K A+PR++R+TV + G A +R S+ A + + +SR V
Sbjct: 335 IGSKDSDSLQKTARPRIKRSTVYSLGGNGGSTAAAFRTSQGASFNYSRNAATKLLSRHVG 394
Query: 160 HMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQ 219
+GL AE+LQV NYGIGGHYEPH+D + P + GNR+AT ++Y+SDV
Sbjct: 395 DFSGLNMDYAEDLQVANYGIGGHYEPHWD-SFPENHIYQEGDLHGNRMATGIYYLSDVEA 453
Query: 220 GGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC------ 273
GG T F L L + PE+G+ FW+NLH SGD D+ T+HAACPVL GS + +
Sbjct: 454 GGGTAFPFLPLLVTPERGSLLFWYNLHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQ 513
Query: 274 ----PCGLRRGLQRS 284
PC L RG + S
Sbjct: 514 DNVRPCDLERGQEIS 528
>gi|241598362|ref|XP_002404733.1| prolyl 4-hydroxylase alpha subunit 1, putative [Ixodes scapularis]
gi|215500464|gb|EEC09958.1| prolyl 4-hydroxylase alpha subunit 1, putative [Ixodes scapularis]
Length = 340
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 90/207 (43%), Positives = 128/207 (61%), Gaps = 8/207 (3%)
Query: 64 AQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQ 123
+QL+CRY + L P+K EE L+P +I+ DV+ D +I+ + A+PRL R+T
Sbjct: 3 SQLRCRYYKGQDGFFSLQPIKLEEINLKPYVIVMHDVVQDKDIEDLMAFAEPRLERSTT- 61
Query: 124 NYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST----AEELQVVNYGI 179
Y E+ + R S +AWL E E P+ R++ + + G+ TS AE Q+ NYG
Sbjct: 62 -YTGNEMMPSPERTSSTAWLNEDEAPIAVRMNSYLRALLGMGTSDTDEEAEAYQLANYGT 120
Query: 180 GGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTA 239
GGH+ PH+DF + A S+ TG+R+AT++ YM+DV +GG TVF +L + L P+KG A
Sbjct: 121 GGHFLPHHDFLQ-DSLQADNSV-TGDRLATLMIYMTDVEEGGTTVFPNLGIRLTPKKGDA 178
Query: 240 AFWHNLHSSGDGDYYTRHAACPVLTGS 266
AFW NL +SGDG+ T HA CPVL GS
Sbjct: 179 AFWWNLKASGDGERLTTHAGCPVLYGS 205
>gi|195341584|ref|XP_002037386.1| GM12898 [Drosophila sechellia]
gi|194131502|gb|EDW53545.1| GM12898 [Drosophila sechellia]
Length = 536
Score = 169 bits (427), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 91/229 (39%), Positives = 135/229 (58%), Gaps = 11/229 (4%)
Query: 46 EKYEMLCRGD--LTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYD 103
E + LCR + + ++L CRY P+L+L P + EE L P ++LY +V+ D
Sbjct: 275 ESFYQLCRSSSRRQMGESKPSRLHCRYNTTTTPFLKLAPFRMEELSLDPYVVLYHNVLSD 334
Query: 104 SEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWL----REPEH-PVIERISRRV 158
EI+ +K M++P L RA V + G EIA R + AWL +P+ V+ RI RR+
Sbjct: 335 PEIEKLKPMSKPFLERAKVFRVEKGSDEIAPSRSADGAWLPHQDTDPDDLEVLRRIGRRI 394
Query: 159 EHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVA 218
+ +TGL T + ++Q + YG GGH+ PHYD+ + + G+R+ATVLFY+++V
Sbjct: 395 KDLTGLNTRSGSQMQFLKYGFGGHFVPHYDYFNSKTSYLER---VGDRIATVLFYLNNVD 451
Query: 219 QGGATVFTSLNLSLWPEKGTAAFWHNL-HSSGDGDYYTRHAACPVLTGS 266
GGAT F LNL + +KG+A FWHNL S D D T H ACP+++G+
Sbjct: 452 HGGATAFPKLNLVVPTQKGSALFWHNLDRKSYDYDTCTFHGACPLISGT 500
>gi|194905419|ref|XP_001981192.1| GG11932 [Drosophila erecta]
gi|190655830|gb|EDV53062.1| GG11932 [Drosophila erecta]
Length = 535
Score = 169 bits (427), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 96/255 (37%), Positives = 139/255 (54%), Gaps = 15/255 (5%)
Query: 41 EVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDV 100
E E YE +CRG+L P+ L+CR + Y P K EE +L P ++ V
Sbjct: 278 ESREFRMYEQVCRGELAPLPSKQRDLRCRLWRSRLGY---APFKLEELHLDPPVVQLHQV 334
Query: 101 MYDSEIDLIKKMAQPRLRRATVQNYK-TGELEIANYRISKSAWLREPEHPVIERISRRVE 159
+ + + +++ A+PR++R+TV + G+ A +R S+ A + + +S V
Sbjct: 335 IGSKDAESLQRTARPRIKRSTVYSLAGNGDSTAAAFRTSQGASFNYSRNAATKLLSHHVG 394
Query: 160 HMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQ 219
+GL AE+LQV NYGIGGHYEPH+D + P + GNR+AT ++Y+SDV
Sbjct: 395 DFSGLNMEYAEDLQVANYGIGGHYEPHWD-SFPDNHVYQEGDLHGNRIATAIYYLSDVEA 453
Query: 220 GGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC------ 273
GG T F L L + PE+G+ FW+NLH SGD D+ T+HAACPVL GS + +
Sbjct: 454 GGGTAFPFLPLLVTPERGSLLFWYNLHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQ 513
Query: 274 ----PCGLRRGLQRS 284
PC L RG + S
Sbjct: 514 DNVRPCDLERGQEIS 528
>gi|24651418|ref|NP_524594.2| prolyl-4-hydroxylase-alpha MP [Drosophila melanogaster]
gi|7301951|gb|AAF57057.1| prolyl-4-hydroxylase-alpha MP [Drosophila melanogaster]
gi|359807686|gb|AEV66559.1| FI17802p1 [Drosophila melanogaster]
Length = 535
Score = 168 bits (426), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 98/255 (38%), Positives = 139/255 (54%), Gaps = 15/255 (5%)
Query: 41 EVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDV 100
E E YE +CRG+L P+ L+CR + Y P K EE +L P ++ V
Sbjct: 278 ESREFRMYEQVCRGELAPLPSKQRNLRCRLRKSRLGY---APFKLEELHLDPLVVQLHQV 334
Query: 101 MYDSEIDLIKKMAQPRLRRATVQNYK-TGELEIANYRISKSAWLREPEHPVIERISRRVE 159
+ + D ++K A+PR++R+TV + G A +R S+ A + + +SR V
Sbjct: 335 IGSKDSDSLQKTARPRIKRSTVYSLGGNGGSTAAAFRTSQGASFNYSRNAATKLLSRHVG 394
Query: 160 HMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQ 219
+GL AE+LQV NYGIGGHYEPH+D + P + GNR+AT ++Y++DV
Sbjct: 395 DFSGLNMDYAEDLQVANYGIGGHYEPHWD-SFPENHIYQEGDLHGNRMATGIYYLADVEA 453
Query: 220 GGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC------ 273
GG T F L L + PE+G+ FW+NLH SGD D+ T+HAACPVL GS + +
Sbjct: 454 GGGTAFPFLPLLVTPERGSLLFWYNLHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQ 513
Query: 274 ----PCGLRRGLQRS 284
PC L RG + S
Sbjct: 514 DNVRPCDLERGQEIS 528
>gi|66772633|gb|AAY55628.1| IP02961p [Drosophila melanogaster]
Length = 409
Score = 168 bits (425), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 97/251 (38%), Positives = 135/251 (53%), Gaps = 15/251 (5%)
Query: 41 EVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDV 100
E E YE +CRG+L P+ L+CR + Y P K EE +L P ++ V
Sbjct: 152 ESREFRMYEQVCRGELAPLPSKQRNLRCRLRKSRLGY---APFKLEELHLDPLVVQLHQV 208
Query: 101 MYDSEIDLIKKMAQPRLRRATVQNYK-TGELEIANYRISKSAWLREPEHPVIERISRRVE 159
+ + D ++K A+PR++R+TV + G A +R S+ A + + +SR V
Sbjct: 209 IGSKDSDSLQKTARPRIKRSTVYSLGGNGGSTAAAFRTSQGASFNYSRNAATKLLSRHVG 268
Query: 160 HMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQ 219
+GL AE+LQV NYGIGGHYEPH+D L GNR+AT ++Y++DV
Sbjct: 269 DFSGLNMDYAEDLQVANYGIGGHYEPHWDSFPENHIYQEGDL-HGNRMATGIYYLADVEA 327
Query: 220 GGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC------ 273
GG T F L L + PE+G+ FW+NLH SGD D+ T+HAACPVL GS + +
Sbjct: 328 GGGTAFPFLPLLVTPERGSLLFWYNLHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQ 387
Query: 274 ----PCGLRRG 280
PC L RG
Sbjct: 388 DNVRPCDLERG 398
>gi|442762205|gb|JAA73261.1| Putative prolyl 4-hydroxylase alpha subunit, partial [Ixodes
ricinus]
Length = 482
Score = 168 bits (425), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 93/235 (39%), Positives = 137/235 (58%), Gaps = 16/235 (6%)
Query: 44 EREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYD 103
E + Y+ LCRG+L P + ++L+CRY + L P+K EE L+P I++ DV+ D
Sbjct: 221 EMQNYKRLCRGELLRTPKMDSKLRCRYYKGHGGSFTLHPIKLEEVNLKPYIVVMHDVVQD 280
Query: 104 SEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTG 163
+I+ ++ A+PRL+ T Y +E R S +AW+ E PV ++++ + + G
Sbjct: 281 RDIEDLRAFAEPRLQ--TSLTYDVPGVESPAVRTSSNAWMDEKNAPVATKLNKFLRSLLG 338
Query: 164 LTTS----TAEELQVVNYGIGGHYEPHYDF--------ARPGEANAFKSLGTGNRVATVL 211
+ TS AE+ Q+ NYG GGH+ H D+ P E K +G +RVAT++
Sbjct: 339 MGTSYSDGEAEKYQLANYGTGGHFLTHPDYLGDLFENDTDPSEFEFHKKVG--DRVATLM 396
Query: 212 FYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
YMSDV +GGATVF L + L P+KG AAFW NL ++G+G+ T HA CPVL GS
Sbjct: 397 IYMSDVEEGGATVFPYLGVRLTPQKGDAAFWWNLKANGEGEVLTTHAGCPVLYGS 451
>gi|157114983|ref|XP_001658090.1| prolyl 4-hydroxylase alpha subunit 1, putative [Aedes aegypti]
gi|108877085|gb|EAT41310.1| AAEL007032-PA, partial [Aedes aegypti]
Length = 448
Score = 167 bits (423), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 90/219 (41%), Positives = 127/219 (57%), Gaps = 27/219 (12%)
Query: 48 YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID 107
YE LCRG+ PA VA L+CRY +N +L++ P K EEA L P I++Y + + D EID
Sbjct: 245 YEPLCRGEYQRTPAQVANLRCRYESKNSSFLKIAPFKLEEASLDPLIVIYHNAISDKEID 304
Query: 108 LIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTS 167
I ++++P L+R+ V ++ E++N R + + +++ +S R E MTGL
Sbjct: 305 QIIQVSKPMLKRSMVG--ESFSKEVSNERTNY-------DFELVKVLSLRTEDMTGLDRK 355
Query: 168 TAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTS 227
+ E LQV NYGIGG Y PH+D+ R E +SDV QGGATVF
Sbjct: 356 SYESLQVNNYGIGGFYLPHFDWVRTNEP------------------ISDVEQGGATVFPQ 397
Query: 228 LNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+ + ++P+KG+A FW+NL G GD T H ACPVL GS
Sbjct: 398 IGVGVFPKKGSAIFWYNLLPDGTGDERTLHGACPVLLGS 436
>gi|195159311|ref|XP_002020525.1| GL13465 [Drosophila persimilis]
gi|194117294|gb|EDW39337.1| GL13465 [Drosophila persimilis]
Length = 578
Score = 167 bits (422), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 91/219 (41%), Positives = 127/219 (57%), Gaps = 6/219 (2%)
Query: 49 EMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDL 108
++ CRG P + +L C Y P+LRL P K E L P ++LY DV+ E
Sbjct: 344 QLCCRG--GCPYRDMHRLTCSYNTTAAPFLRLAPFKTELLSLAPYMVLYHDVITPLESLT 401
Query: 109 IKKMAQPRL-RRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTS 167
+K +++P + RRA N + I + R S S WL E+ V+ER+ RRV MT
Sbjct: 402 LKNLSKPHMKRRAMTFNKQKLRPLIDSGRTSNSVWLTSHENAVMERLERRVGVMTNFEME 461
Query: 168 TAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTS 227
+E Q++NYGIGGHY+PH D E + G G+R+ATVLFY+SDV QGGAT+F
Sbjct: 462 NSEVYQLINYGIGGHYKPHTDHF---ETPQLEHRGGGDRIATVLFYLSDVPQGGATLFPR 518
Query: 228 LNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
LN+S+ P +G A W+NL+ G G+ T H +CP++ GS
Sbjct: 519 LNISVQPRQGDALLWYNLNDRGQGEIGTVHTSCPIIKGS 557
>gi|115313004|gb|AAI24075.1| Zgc:152670 [Danio rerio]
Length = 235
Score = 167 bits (422), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 92/202 (45%), Positives = 127/202 (62%), Gaps = 18/202 (8%)
Query: 66 LKCRY-VHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQN 124
L CRY P L P+KEEE + +P+II Y DV+ D+EI+ +K +A+P L R+
Sbjct: 26 LSCRYSTGGGNPRLMYAPVKEEELWDEPKIIRYHDVISDTEIETLKDIARPELTRS---- 81
Query: 125 YKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYE 184
+TG I+ R S+S +L E + RIS+R+ +TGL+ +AE+L V NYGIGG Y
Sbjct: 82 -QTGWGVISEIRTSQSVFLDEV--GTVARISQRIADITGLSVESAEKLHVQNYGIGGRYT 138
Query: 185 PHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHN 244
PH+D G+ N R AT L YMSDV GGATVFT++ +++ PEKG+A FW+N
Sbjct: 139 PHFDAG--GDVN--------ERTATFLIYMSDVEVGGATVFTNVGVAVKPEKGSAVFWNN 188
Query: 245 LHSSGDGDYYTRHAACPVLTGS 266
LH +G+ D T+HA CPVL G+
Sbjct: 189 LHKNGELDLKTKHAGCPVLVGN 210
>gi|85857698|gb|ABC86384.1| IP10964p [Drosophila melanogaster]
Length = 534
Score = 167 bits (422), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 89/225 (39%), Positives = 131/225 (58%), Gaps = 12/225 (5%)
Query: 48 YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID 107
+++ C G P +L C Y P+LRL PLK E+ L P ++LY +V+ EI
Sbjct: 286 FKLSCNG----PLESSTRLHCFYNFTTTPFLRLAPLKTEQIGLDPYVVLYHEVLSAREIS 341
Query: 108 LIKKMAQPRLRRATVQNYKTGELEIANY-RISKSAWLREPEHPVIERISRRVEHMTGLTT 166
++ A ++ + +K + N R +K WL++ + + +RI+RR+ MTG
Sbjct: 342 MLIGKAAQNMKNTKI--HKERAVPKKNRGRTAKGFWLKKESNELTKRITRRIMDMTGFDL 399
Query: 167 STAEELQVVNYGIGGHYEPH---YDFARPGEANAFK--SLGTGNRVATVLFYMSDVAQGG 221
+ +E QV+NYGIGGHY H +DFA + S+ G+R+ATVLFY++DV QGG
Sbjct: 400 ADSEGFQVINYGIGGHYFLHMDYFDFASSNHTDTRSRYSIDLGDRIATVLFYLTDVEQGG 459
Query: 222 ATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
ATVF + + P+ GTA FW+NL + G+GD TRHAACPV+ GS
Sbjct: 460 ATVFGDVGYYVSPQAGTAIFWYNLDTDGNGDPRTRHAACPVIVGS 504
>gi|221460681|ref|NP_733394.3| CG31013 [Drosophila melanogaster]
gi|220903261|gb|AAF57073.4| CG31013 [Drosophila melanogaster]
Length = 534
Score = 166 bits (421), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 89/225 (39%), Positives = 131/225 (58%), Gaps = 12/225 (5%)
Query: 48 YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID 107
+++ C G P +L C Y P+LRL PLK E+ L P ++LY +V+ EI
Sbjct: 286 FKLSCNG----PLESSTRLHCFYNFTTTPFLRLAPLKTEQIGLDPYVVLYHEVLSAREIS 341
Query: 108 LIKKMAQPRLRRATVQNYKTGELEIANY-RISKSAWLREPEHPVIERISRRVEHMTGLTT 166
++ A ++ + +K + N R +K WL++ + + +RI+RR+ MTG
Sbjct: 342 MLIGKAAQNMKNTKI--HKERAVPKKNRGRTAKGFWLKKESNELTKRITRRIMDMTGFDL 399
Query: 167 STAEELQVVNYGIGGHYEPH---YDFARPGEANAFK--SLGTGNRVATVLFYMSDVAQGG 221
+ +E QV+NYGIGGHY H +DFA + S+ G+R+ATVLFY++DV QGG
Sbjct: 400 ADSEGFQVINYGIGGHYFLHMDYFDFASSNHTDTRSRYSIDLGDRIATVLFYLTDVEQGG 459
Query: 222 ATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
ATVF + + P+ GTA FW+NL + G+GD TRHAACPV+ GS
Sbjct: 460 ATVFGDVGYYVSPQAGTAIFWYNLDTDGNGDPRTRHAACPVIVGS 504
>gi|444731524|gb|ELW71877.1| Prolyl 4-hydroxylase subunit alpha-3 [Tupaia chinensis]
Length = 562
Score = 166 bits (421), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 111/293 (37%), Positives = 154/293 (52%), Gaps = 39/293 (13%)
Query: 4 PTHQRAQGNKLYYQEALNKS-----PELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTV 58
P ++R N L Y+ L +S E + P V P L+ R+ YE LC+ +
Sbjct: 256 PDNKRMARNILKYERLLAESSNQAVAEAVIQRPNV----PHLQT--RDTYEGLCQTLGSQ 309
Query: 59 PPAI-VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRL 117
P + L C Y + PYL L P+++E +L+P I LY D + DSE I+ +A+P L
Sbjct: 310 PTHYQIPSLYCSYETNSSPYLLLQPVRKELIHLEPYIALYHDFVSDSEAQKIRALAEPWL 369
Query: 118 RRATVQNYKTGELEI-ANYRISK--------------------SAWLREPEHPVIERISR 156
+R+ V +GE ++ YRISK SAWL++ P++ +
Sbjct: 370 QRSVV---ASGEKQLQVEYRISKRRRLVVSGIASLMPQSVVYFSAWLKDTVDPMLVTLDH 426
Query: 157 RVEHMTGLTTST--AEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYM 214
R+ +TGL AE LQVVNYGIGGHYEPH+D A + ++ + +GNRVAT + Y+
Sbjct: 427 RIAALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYR-MKSGNRVATFMIYL 485
Query: 215 SDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
S V GGAT F N S+ K A FW NLH SG+G+ T HA CPVL G
Sbjct: 486 SSVEAGGATAFIYANFSVPVVKNAALFWWNLHRSGEGNSDTLHAGCPVLVGDK 538
>gi|390176896|ref|XP_002136934.2| GA26861 [Drosophila pseudoobscura pseudoobscura]
gi|388858831|gb|EDY67492.2| GA26861 [Drosophila pseudoobscura pseudoobscura]
Length = 513
Score = 166 bits (421), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 92/220 (41%), Positives = 128/220 (58%), Gaps = 10/220 (4%)
Query: 49 EMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDL 108
++ CRG P + +L C Y P+LRL P K E L P ++LY DV+ E
Sbjct: 281 QLCCRGG--CPYRDMHRLTCSYNTTAAPFLRLAPFKTEILSLSPYMVLYHDVITPLESLT 338
Query: 109 IKKMAQPRL-RRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTS 167
+K +++P + RRA N + I + R S S WL E+ V+ER+ RRV MT
Sbjct: 339 LKNLSKPHMKRRAMTFNKQKLRPLIDSGRTSNSVWLTSHENAVMERLERRVGVMTNFEME 398
Query: 168 TAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFT 226
+E Q++NYGIGGHY+PH D F P + G G+R+ATVLFY+SDV QGGAT+F
Sbjct: 399 NSEVYQLINYGIGGHYKPHTDHFETP------QHRGGGDRIATVLFYLSDVPQGGATLFP 452
Query: 227 SLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
LN+S+ P +G A W+NL+ G G+ T H +CP++ GS
Sbjct: 453 RLNISVQPRQGDALLWYNLNDRGQGEIGTVHTSCPIIQGS 492
>gi|195575143|ref|XP_002105539.1| GD16913 [Drosophila simulans]
gi|194201466|gb|EDX15042.1| GD16913 [Drosophila simulans]
Length = 534
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 85/207 (41%), Positives = 124/207 (59%), Gaps = 6/207 (2%)
Query: 65 QLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQN 124
+L C Y P+LRL PLK E+ L P ++LY +V+ EI ++ A ++ V
Sbjct: 299 RLHCFYNFTTTPFLRLAPLKIEQIGLDPYVVLYHEVLSAREISMLIGKAAQNMKNTRVHK 358
Query: 125 YKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYE 184
+ G + R +K W ++ + + + I+RR+ MTG + +E QV+NYGIGGHY
Sbjct: 359 -EQGVPKKNRGRTAKGFWFKKESNELTKGITRRIMDMTGFDLADSEGFQVINYGIGGHYL 417
Query: 185 PH---YDFARPGEANAFK--SLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTA 239
H +DFA + S+ G+R+ATVLFY++DV QGGATVF + S++P+ GTA
Sbjct: 418 LHMDYFDFASSNHTDTRSGYSMDLGDRIATVLFYLTDVEQGGATVFADVGYSVYPQAGTA 477
Query: 240 AFWHNLHSSGDGDYYTRHAACPVLTGS 266
FW+NL ++G GD TRHAACPV+ GS
Sbjct: 478 IFWYNLDTNGKGDPRTRHAACPVIVGS 504
>gi|198477152|ref|XP_002136738.1| GA29216 [Drosophila pseudoobscura pseudoobscura]
gi|198145043|gb|EDY71755.1| GA29216 [Drosophila pseudoobscura pseudoobscura]
Length = 517
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 91/221 (41%), Positives = 128/221 (57%), Gaps = 8/221 (3%)
Query: 49 EMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDL 108
++ CRG P + +L C Y P+LRL P K E L P ++LY DV+ E
Sbjct: 281 QLCCRGG--CPYRDMHRLTCSYNTTAAPFLRLAPFKTEILSLSPYMVLYHDVITPLESLT 338
Query: 109 IKKMAQPRLRR---ATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLT 165
+K +++P ++R V N K I + R S S WL E+ V+ER+ RRV MT
Sbjct: 339 LKNLSKPLMKRRAMVMVNNLKVRPF-IDSGRTSNSVWLASHENAVMERLERRVGVMTNFE 397
Query: 166 TSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVF 225
+E Q++NYGIGGHY+PH D +A + G G+R+ATVLFY+SDV QGGAT+F
Sbjct: 398 MENSEVYQLINYGIGGHYKPHTDHFETPQAPEHR--GGGDRIATVLFYLSDVPQGGATLF 455
Query: 226 TSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
LN+S+ P +G A W+NL+ G G+ T H +CP++ GS
Sbjct: 456 PRLNISVQPRQGDALLWYNLNDRGQGEIGTVHTSCPIIQGS 496
>gi|195341588|ref|XP_002037388.1| GM12140 [Drosophila sechellia]
gi|194131504|gb|EDW53547.1| GM12140 [Drosophila sechellia]
Length = 534
Score = 165 bits (417), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 84/207 (40%), Positives = 124/207 (59%), Gaps = 6/207 (2%)
Query: 65 QLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQN 124
+L C Y P+LRL PLK E+ L P ++LY +V+ EI ++ A ++ V
Sbjct: 299 RLHCFYNFTTTPFLRLAPLKIEQIGLDPYVVLYHEVLSAREISMLIGKATQNMKNTRVHK 358
Query: 125 YKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYE 184
+ G + R +K W ++ + + + I+RR+ MTG + +E QV+NYGIGGHY
Sbjct: 359 -EQGVPKKNRGRTAKGFWFKKESNELTKGITRRIMDMTGFDLADSEGFQVINYGIGGHYL 417
Query: 185 PH---YDFARPGEANAFKS--LGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTA 239
H +DFA + S + G+R+ATVLFY++DV QGGATVF + S++P+ GTA
Sbjct: 418 LHMDYFDFASSNHTDTRSSYSMDLGDRIATVLFYLTDVEQGGATVFADVGYSVYPQAGTA 477
Query: 240 AFWHNLHSSGDGDYYTRHAACPVLTGS 266
FW+NL ++G GD T+HAACPV+ GS
Sbjct: 478 IFWYNLDTNGKGDPRTKHAACPVIVGS 504
>gi|221126103|ref|XP_002165259.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Hydra
magnipapillata]
Length = 533
Score = 165 bits (417), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 93/218 (42%), Positives = 125/218 (57%), Gaps = 4/218 (1%)
Query: 51 LCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIK 110
LC+G + + +L C+YV Y+ L PLK E + P I LY +++ D E I
Sbjct: 293 LCQGREKMAQKDINRLFCKYVAPKAHYI-LKPLKMEVLHHDPYIELYYELITDDEAKHII 351
Query: 111 KMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAE 170
K A+P LRRA V + TG+L A+YR+SK+ W+ E + +I RRV +TGL AE
Sbjct: 352 KFAKPLLRRAFVHDMVTGDLIYADYRVSKNTWIAEDMDVIAAKIIRRVGDVTGLNMRYAE 411
Query: 171 ELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL-- 228
LQV NYGI G YEPH+D + F G GNR+AT+L Y+SDV GG TVFT+
Sbjct: 412 HLQVANYGIAGQYEPHFDHSTGTRPKHFDRWG-GNRIATMLLYLSDVDWGGRTVFTNTAP 470
Query: 229 NLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+ P KG FW+NL +G + T+HA CPV+ G
Sbjct: 471 GVGTDPIKGAGVFWYNLLRNGKSNPKTQHAGCPVVLGQ 508
>gi|195575097|ref|XP_002105516.1| GD17035 [Drosophila simulans]
gi|194201443|gb|EDX15019.1| GD17035 [Drosophila simulans]
Length = 535
Score = 164 bits (415), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 97/255 (38%), Positives = 138/255 (54%), Gaps = 15/255 (5%)
Query: 41 EVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDV 100
E E YE +CRG+L + L+CR + Y P K EE +L P ++ V
Sbjct: 278 ESREFRMYEQVCRGELAPLSSKQRSLRCRLRKSRLGY---APFKLEELHLDPLVVQLHQV 334
Query: 101 MYDSEIDLIKKMAQPRLRRATVQNYK-TGELEIANYRISKSAWLREPEHPVIERISRRVE 159
+ ++ + ++K A+PR++R+TV + G A +R S+ A + + +S V
Sbjct: 335 IGSNDSESLQKTARPRIKRSTVYSLGGNGGSTAAAFRTSQGASFNYSRNAATKLLSHHVG 394
Query: 160 HMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQ 219
+GL AE+LQV NYGIGGHYEPH+D + P + GNR+AT ++Y+SDV
Sbjct: 395 DFSGLNMDYAEDLQVANYGIGGHYEPHWD-SFPENHIYQEGDLHGNRIATGIYYLSDVEA 453
Query: 220 GGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC------ 273
GG T F L L + PEKG+ FW+NLH SGD D+ T+HAACPVL GS + +
Sbjct: 454 GGGTAFPFLPLLVTPEKGSLLFWYNLHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQ 513
Query: 274 ----PCGLRRGLQRS 284
PC L RG + S
Sbjct: 514 DNVRPCDLERGQEIS 528
>gi|195110923|ref|XP_002000029.1| GI22757 [Drosophila mojavensis]
gi|193916623|gb|EDW15490.1| GI22757 [Drosophila mojavensis]
Length = 535
Score = 164 bits (414), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 93/242 (38%), Positives = 134/242 (55%), Gaps = 12/242 (4%)
Query: 48 YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID 107
Y+ +CR +L A +L+CRY + L + K EE + P II +V+ E
Sbjct: 286 YQQVCREELMPTAAAQRELRCRYFSGHGRSLNYLAYKLEELHRDPYIIQLHEVIGAHESV 345
Query: 108 LIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTS 167
++ +A+P L+R+ V + G A +R S+ EHP+IE++S+ + ++GL
Sbjct: 346 QLQHLARPVLQRSEVYSPTNGS-TAATFRTSQGTVFEYDEHPIIEKLSQHMTLISGLDMG 404
Query: 168 TAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTS 227
AE LQ+ NYGIGGHYEPH D + + T NR+AT +FY+S+V GGAT F
Sbjct: 405 FAEPLQIANYGIGGHYEPHMDSFPESFDYSLQRFKT-NRIATGIFYLSNVEAGGATAFPF 463
Query: 228 LNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCGL 277
L L + PE+G+ FW+NLH SGD DY T+HA CPVL GS + + PC L
Sbjct: 464 LPLLVKPEQGSLLFWYNLHRSGDADYRTKHAGCPVLQGSKWIANVWIRLSHQDHVRPCQL 523
Query: 278 RR 279
+R
Sbjct: 524 QR 525
>gi|195505199|ref|XP_002099401.1| GE23383 [Drosophila yakuba]
gi|194185502|gb|EDW99113.1| GE23383 [Drosophila yakuba]
Length = 535
Score = 163 bits (413), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 96/250 (38%), Positives = 132/250 (52%), Gaps = 15/250 (6%)
Query: 41 EVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDV 100
E E YE +CRG+L PA L+CR + Y P K EE +L P ++ V
Sbjct: 278 ESREFRMYEQVCRGELAPLPAKQRNLRCRLRKSRLGY---APFKLEELHLDPLLVQLHQV 334
Query: 101 MYDSEIDLIKKMAQPRLRRATVQNYK-TGELEIANYRISKSAWLREPEHPVIERISRRVE 159
+ + + +++ A+PR++R+TV + G A +R S+ A + +S V
Sbjct: 335 IGAKDSESLQRTARPRIKRSTVYSLAGNGGSTAAAFRTSQGASFNYSRSAATKLLSHHVG 394
Query: 160 HMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQ 219
+GL AE+LQV NYGIGGHYEPH+D L GNR+AT ++Y+SDV
Sbjct: 395 DFSGLNMEYAEDLQVANYGIGGHYEPHWDSFPENHVYQEGDL-HGNRIATGIYYLSDVEA 453
Query: 220 GGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC------ 273
GG T F L L + PEKG+ FW+NLH SGD D+ T+HAACPVL GS + +
Sbjct: 454 GGGTAFPFLPLLVTPEKGSLLFWYNLHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQ 513
Query: 274 ----PCGLRR 279
PC L R
Sbjct: 514 DKVRPCDLER 523
>gi|195159150|ref|XP_002020445.1| GL13509 [Drosophila persimilis]
gi|194117214|gb|EDW39257.1| GL13509 [Drosophila persimilis]
Length = 554
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 84/232 (36%), Positives = 137/232 (59%), Gaps = 12/232 (5%)
Query: 46 EKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSE 105
E++ +CR P+ +L CRY P+LRL PL+ EE L P I++Y +V+ D+E
Sbjct: 307 EEFNQICRYSHQNKPS---RLHCRYNTTTTPFLRLAPLRMEELSLDPYIVVYHNVLSDAE 363
Query: 106 IDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPE-----HPVIERISRRVEH 160
I ++++ +P L+R+ V + K ++ + R + AWL + VI+RI RR+
Sbjct: 364 IAEVERVTEPLLKRSVVFDGKENKMSTSKKRTALGAWLPDDNMDVSGRAVIQRILRRIHE 423
Query: 161 MTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQG 220
+TGL + +++Q++ YG GGHY+ H+D+ ++ + G+R+ATVLFY++DV G
Sbjct: 424 LTGLIMNDRQDMQLIKYGYGGHYDIHFDYFN---TSSPITKARGDRMATVLFYLNDVKHG 480
Query: 221 GATVFTSLNLSLWPEKGTAAFWHNLH-SSGDGDYYTRHAACPVLTGSNSLHS 271
G+T FT L L + E+G FW+N+ + D D T H ACPV+ G+ S+ S
Sbjct: 481 GSTAFTDLQLKVPSERGKVLFWYNMRGETHDLDSRTLHGACPVIDGTKSILS 532
>gi|321458081|gb|EFX69155.1| hypothetical protein DAPPUDRAFT_228756 [Daphnia pulex]
Length = 570
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 98/263 (37%), Positives = 137/263 (52%), Gaps = 32/263 (12%)
Query: 19 ALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRG--DLTVPPAIVAQLKCRYVHRNVP 76
A NKSPE E E E + LCR + + P + +LKCR + P
Sbjct: 285 ATNKSPEWS-------------EWEELEVFFRLCREGEEKSRPTGLKGRLKCRQISHTHP 331
Query: 77 YLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQP-RLRRATVQNYKTGELEI-AN 134
Y L PLK EE L P I ++ D M D+E ++ K +A RL R+ + + G+ + ++
Sbjct: 332 YFILRPLKLEEHSLVPYIAVFHDFMSDAETEIFKSLAMAERLERSAHGSKRPGQGGVTSD 391
Query: 135 YRISKSAWLREPEHPVIERISRRVEHMTGLTTS----TAEELQVVNYGIGGHYEPHYD-- 188
R SK +W+ + H V+++IS+R+ GL + +E QV NYGIGG Y PH D
Sbjct: 392 KRTSKQSWVEDGSHHVVDQISKRISDSVGLNSQPSNVGSEHYQVANYGIGGRYTPHTDHG 451
Query: 189 -----FARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWH 243
P E + F+ G+R+ T + Y+ DV GGATVFT + + P+KG A FW
Sbjct: 452 VLSKSMGGPSEFDLFR----GDRILTFMTYLDDVEAGGATVFTHAGVVVRPKKGMAVFWW 507
Query: 244 NLHSSGDGDYYTRHAACPVLTGS 266
NL S +GD TRH CPVL GS
Sbjct: 508 NLKSDSNGDTLTRHGGCPVLHGS 530
>gi|195505209|ref|XP_002099405.1| GE10885 [Drosophila yakuba]
gi|194185506|gb|EDW99117.1| GE10885 [Drosophila yakuba]
Length = 473
Score = 162 bits (411), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 89/203 (43%), Positives = 120/203 (59%), Gaps = 7/203 (3%)
Query: 64 AQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQ 123
A+L C Y +LRL PLK E L P ++L+ DV+ D +I I+ +A+ L RA V
Sbjct: 255 AKLHCLYNTTASYFLRLAPLKMELLSLDPYMVLFHDVVSDKDITSIRNLAKGGLVRA-VT 313
Query: 124 NYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHY 183
K G E R +K WL E +I+R+S+ + MT L A+ QV+NYGIGG+Y
Sbjct: 314 VTKDGSYEEDPARTTKGTWLVE-NSKLIQRLSQLAQDMTNLDIRDADPFQVLNYGIGGYY 372
Query: 184 EPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWH 243
H+DF E F NR+AT +FY+SDV QGGAT+F L LS++P+KG+A W+
Sbjct: 373 GTHFDFLADTEMGNF-----SNRIATAVFYLSDVPQGGATIFPKLGLSVFPKKGSALLWY 427
Query: 244 NLHSSGDGDYYTRHAACPVLTGS 266
NL GDGD T H+ACP + GS
Sbjct: 428 NLDHKGDGDNRTAHSACPTIVGS 450
>gi|195341542|ref|XP_002037365.1| GM12152 [Drosophila sechellia]
gi|194131481|gb|EDW53524.1| GM12152 [Drosophila sechellia]
Length = 535
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 96/255 (37%), Positives = 138/255 (54%), Gaps = 15/255 (5%)
Query: 41 EVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDV 100
E E YE +CRG+L P+ L+CR + Y P K EE +L P ++ V
Sbjct: 278 ESREFRMYEQVCRGELAPLPSKQRSLRCRLRKSRLGY---APFKLEELHLDPLVVQLHQV 334
Query: 101 MYDSEIDLIKKMAQPRLRRATVQNYK-TGELEIANYRISKSAWLREPEHPVIERISRRVE 159
+ ++ + ++K A+P ++R+TV + G A +R S+ A ++ + +S V
Sbjct: 335 IGSNDSESLQKSARPMIKRSTVYSLGGNGGSTAAAFRTSQGASFNYSKNAATKLLSHHVG 394
Query: 160 HMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQ 219
+ L AE+LQV NYGIGGHYEPH+D + P + GNR+AT ++Y+SDV
Sbjct: 395 DFSDLNMDYAEDLQVANYGIGGHYEPHWD-SFPENHIYQEGDLHGNRIATGIYYLSDVEA 453
Query: 220 GGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC------ 273
GG T F L L + PEKG+ FW+NLH SGD D+ T+HAACPVL GS + +
Sbjct: 454 GGGTAFPFLPLLVTPEKGSLLFWYNLHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQ 513
Query: 274 ----PCGLRRGLQRS 284
PC L RG + S
Sbjct: 514 DNVRPCDLERGQEIS 528
>gi|449485593|ref|XP_004175686.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
alpha-3 [Taeniopygia guttata]
Length = 567
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 89/205 (43%), Positives = 123/205 (60%), Gaps = 7/205 (3%)
Query: 65 QLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQN 124
L C Y N P+L L P K+E ++QP + LY D + D+E + IK +A P L+R+ V
Sbjct: 342 HLSCSYETNNSPFLLLQPAKKEMVWIQPHVALYHDFITDAEAETIKGLAGPWLQRSVV-- 399
Query: 125 YKTGE-LEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTT--STAEELQVVNYGIGG 181
+GE + A Y ISKS WL++ PV+ + +R+ +TGL AE LQVVNYG+GG
Sbjct: 400 -ASGEKQQKAEYWISKSTWLKDTVDPVVHALDQRIIAVTGLDLWPPYAEYLQVVNYGLGG 458
Query: 182 HYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAF 241
HYEPH+D A ++ ++ + +GNR ATV+ Y+S V GG+T N S+ K A F
Sbjct: 459 HYEPHFDHATSTKSPLYR-MKSGNRNATVMIYLSAVEAGGSTALIYTNFSVPVVKNAALF 517
Query: 242 WHNLHSSGDGDYYTRHAACPVLTGS 266
W NL +G+GD T HA CPVL G
Sbjct: 518 WWNLRRNGNGDGDTLHAGCPVLAGD 542
>gi|198449504|ref|XP_002136909.1| GA26876 [Drosophila pseudoobscura pseudoobscura]
gi|198130636|gb|EDY67467.1| GA26876 [Drosophila pseudoobscura pseudoobscura]
Length = 527
Score = 161 bits (408), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 82/232 (35%), Positives = 135/232 (58%), Gaps = 12/232 (5%)
Query: 46 EKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSE 105
E++ +CR P+ +L CRY P+LRL PL+ EE L P I++Y +V+ D+E
Sbjct: 280 EEFNQICRSSHQNKPS---RLHCRYNTTTTPFLRLAPLRMEELSLDPYIVVYHNVLSDAE 336
Query: 106 IDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPE-----HPVIERISRRVEH 160
I ++++ +P L+R+ V + K ++ + R + AWL + VI+RI RR+
Sbjct: 337 IAEVERVTEPLLKRSVVFDGKGNKMSTSKRRTALGAWLPDDNMDVSGRAVIQRIFRRIHE 396
Query: 161 MTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQG 220
+TGL + +++Q++ YG GGHY+ H+D+ + G+R+ATVLFY++D+ G
Sbjct: 397 LTGLIINDRQDMQLIKYGYGGHYDIHFDYFNTSTP---ITKARGDRMATVLFYLNDMKHG 453
Query: 221 GATVFTSLNLSLWPEKGTAAFWHNLH-SSGDGDYYTRHAACPVLTGSNSLHS 271
G+T FT L L + E+G FW+N+ + D D T H ACPV+ G+ ++ S
Sbjct: 454 GSTAFTDLQLKVPSERGKVLFWYNMRGETHDVDSRTLHGACPVINGTKTILS 505
>gi|198449508|ref|XP_002136911.1| GA26875 [Drosophila pseudoobscura pseudoobscura]
gi|198130638|gb|EDY67469.1| GA26875 [Drosophila pseudoobscura pseudoobscura]
Length = 516
Score = 161 bits (407), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 83/232 (35%), Positives = 137/232 (59%), Gaps = 12/232 (5%)
Query: 46 EKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSE 105
E++ +CR P+ +L CRY P+LRL PL+ EE L P I++Y +V+ D+E
Sbjct: 269 EEFNQICRSWHQNKPS---RLHCRYNTTTTPFLRLAPLRMEELSLDPYIVVYHNVLCDAE 325
Query: 106 IDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPE-----HPVIERISRRVEH 160
I ++++ +P L+R+ V + K ++ + R + AWL + VI+RI RR+
Sbjct: 326 IAEVERVTEPLLKRSVVFDGKENKMSTSKKRTALGAWLPDDNMDVSGRAVIQRIFRRIHE 385
Query: 161 MTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQG 220
+TGL + +++Q++ YG GGHY+ H+D+ ++ + G+R+ATVLFY++DV G
Sbjct: 386 LTGLIINDRQDMQLIKYGYGGHYDIHFDYFN---TSSPITKARGDRMATVLFYLNDVKHG 442
Query: 221 GATVFTSLNLSLWPEKGTAAFWHNLH-SSGDGDYYTRHAACPVLTGSNSLHS 271
G+T FT L L + E+G FW+N+ + D D T H ACPV+ G+ ++ S
Sbjct: 443 GSTAFTDLQLKVPSERGKVLFWYNMRGETHDLDSRTLHGACPVIDGTKTILS 494
>gi|195159146|ref|XP_002020443.1| GL13510 [Drosophila persimilis]
gi|194117212|gb|EDW39255.1| GL13510 [Drosophila persimilis]
Length = 527
Score = 160 bits (405), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 82/232 (35%), Positives = 135/232 (58%), Gaps = 12/232 (5%)
Query: 46 EKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSE 105
E++ +CR P+ +L CRY P+LRL PL+ EE L P I++Y +V+ D+E
Sbjct: 280 EEFNQICRSWHQNKPS---RLHCRYNTTTTPFLRLAPLRMEELSLDPYIVVYHNVLSDAE 336
Query: 106 IDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPE-----HPVIERISRRVEH 160
I ++++ +P L+R+ V + K ++ + R + AWL + VI+RI RR+
Sbjct: 337 IAEVERVTEPLLKRSVVFDGKENKMSTSKKRTALGAWLPDDNMDVSGRAVIQRIFRRIHE 396
Query: 161 MTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQG 220
+TGL + +++Q++ YG GGHY+ H+D+ + G+R+ATVLFY++D+ G
Sbjct: 397 LTGLIINDRQDMQLIKYGYGGHYDIHFDYFNTSTP---ITKARGDRMATVLFYLNDMKHG 453
Query: 221 GATVFTSLNLSLWPEKGTAAFWHNLH-SSGDGDYYTRHAACPVLTGSNSLHS 271
G+T FT L L + E+G FW+N+ + D D T H ACPV+ G+ ++ S
Sbjct: 454 GSTAFTDLQLKVPSERGKVLFWYNMRGETHDLDSRTLHGACPVINGTKTILS 505
>gi|443712762|gb|ELU05926.1| hypothetical protein CAPTEDRAFT_153364 [Capitella teleta]
Length = 491
Score = 160 bits (405), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 96/230 (41%), Positives = 136/230 (59%), Gaps = 17/230 (7%)
Query: 47 KYEMLCRGDLTVPPAIVAQLKCRYVH-RNVPYLRLMPLKEEEAY-LQPRIILYRDVMYDS 104
KY+ LCRGD+ V + + L CRY R++P +P+ +EE + + P + ++ DV+ D+
Sbjct: 239 KYQELCRGDMIVEESKKSLLYCRYAKGRDIP----LPIYKEEVHNVDPHVAIFYDVISDA 294
Query: 105 EIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGL 164
E D I + A P + R V N + + ++ RISK WL + +I+++S R+ +TGL
Sbjct: 295 EADHIIRHAFPGMFRGLVGN--STLRQSSDQRISKVGWLFDNVDTLIKKLSARIGDVTGL 352
Query: 165 TT------STAEELQVVNYGIGGHYEPHYDFARPGE--ANAFKSL-GTGNRVATVLFYMS 215
T S E +QVVNYGIGG YEPH DF E N SL TG+R++T LFY+S
Sbjct: 353 NTVYTPVRSPVEAMQVVNYGIGGQYEPHLDFYEDPEMLKNVNPSLQDTGDRISTFLFYLS 412
Query: 216 DVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
V GGATVF LN+ + P K AAFW+N +G+ D T HA CPV+ G
Sbjct: 413 RVHLGGATVFPKLNVRVPPVKNGAAFWYNARPNGEHDKRTLHAGCPVVLG 462
>gi|116008432|ref|NP_651804.2| CG15539, isoform A [Drosophila melanogaster]
gi|66772391|gb|AAY55507.1| IP10910p [Drosophila melanogaster]
gi|66772535|gb|AAY55579.1| IP10810p [Drosophila melanogaster]
gi|113194858|gb|AAF57060.2| CG15539, isoform A [Drosophila melanogaster]
Length = 386
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 85/203 (41%), Positives = 120/203 (59%), Gaps = 7/203 (3%)
Query: 64 AQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQ 123
A+L C Y + +LRL PLK E L P ++L+ DV+ D +I I+ + + +L R TV
Sbjct: 168 AKLYCLYKTTSSYFLRLAPLKMELLSLDPYMVLFHDVVSDKDIVSIRNLTKGKLAR-TVT 226
Query: 124 NYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHY 183
K G R +K WL E + +I+R+S+ + MT A+ QV+NYGIGG Y
Sbjct: 227 VSKDGNYTEDPDRTTKGTWLVE-NNALIQRLSQLTQDMTNFDIHDADPFQVLNYGIGGFY 285
Query: 184 EPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWH 243
H+DF E + F +R+AT +FY+SDV QGGAT+F L LS++P+KG+A W+
Sbjct: 286 GIHFDFLEDAELDNF-----SDRIATAVFYLSDVPQGGATIFPKLGLSVFPKKGSALLWY 340
Query: 244 NLHSSGDGDYYTRHAACPVLTGS 266
NL GDGD T H+ACP + GS
Sbjct: 341 NLDHKGDGDNRTAHSACPTVVGS 363
>gi|195159297|ref|XP_002020518.1| GL13472 [Drosophila persimilis]
gi|194117287|gb|EDW39330.1| GL13472 [Drosophila persimilis]
Length = 526
Score = 159 bits (403), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 95/261 (36%), Positives = 137/261 (52%), Gaps = 19/261 (7%)
Query: 14 LYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHR 73
LY + + + + ++P K++ E E Y + C G + L+C Y+
Sbjct: 230 LYESKTIEEHAPIPEDPSKLD---------EFEAYRLTCSGHSRLTAREQRHLRCGYMTE 280
Query: 74 NVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIA 133
P+L L PLK EE P ++LY DV+Y SEID+I+++ R+ RA V T + ++
Sbjct: 281 THPFLLLAPLKAEELSHDPLLVLYHDVIYQSEIDVIRQLTTNRMARAMVT--LTNQSTVS 338
Query: 134 NYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARP 192
N R S+ ++ + EH V++ I RRV MT L AE+ Q NYGIGGHY H D F
Sbjct: 339 NVRTSQITFIAKTEHEVLQTIDRRVADMTNLNMDYAEDHQFANYGIGGHYGQHMDWFTET 398
Query: 193 GEANAF-KSLGTGNRVATVLFYMSDVAQGGATVFTS------LNLSLWPEKGTAAFWHNL 245
N S GNR+ATVLFY + + ++ L L +K AAFWHNL
Sbjct: 399 TFDNGLVSSTEMGNRIATVLFYNISLNSSRMWLMSAALTCPYLKQHLRLKKYAAAFWHNL 458
Query: 246 HSSGDGDYYTRHAACPVLTGS 266
H++G GD T+H ACP++ GS
Sbjct: 459 HAAGRGDARTQHGACPIIAGS 479
>gi|116008128|ref|NP_001036776.1| CG15539, isoform B [Drosophila melanogaster]
gi|113194857|gb|ABI31220.1| CG15539, isoform B [Drosophila melanogaster]
Length = 509
Score = 159 bits (402), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 85/203 (41%), Positives = 120/203 (59%), Gaps = 7/203 (3%)
Query: 64 AQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQ 123
A+L C Y + +LRL PLK E L P ++L+ DV+ D +I I+ + + +L R TV
Sbjct: 291 AKLYCLYKTTSSYFLRLAPLKMELLSLDPYMVLFHDVVSDKDIVSIRNLTKGKLAR-TVT 349
Query: 124 NYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHY 183
K G R +K WL E + +I+R+S+ + MT A+ QV+NYGIGG Y
Sbjct: 350 VSKDGNYTEDPDRTTKGTWLVE-NNALIQRLSQLTQDMTNFDIHDADPFQVLNYGIGGFY 408
Query: 184 EPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWH 243
H+DF E + F +R+AT +FY+SDV QGGAT+F L LS++P+KG+A W+
Sbjct: 409 GIHFDFLEDAELDNF-----SDRIATAVFYLSDVPQGGATIFPKLGLSVFPKKGSALLWY 463
Query: 244 NLHSSGDGDYYTRHAACPVLTGS 266
NL GDGD T H+ACP + GS
Sbjct: 464 NLDHKGDGDNRTAHSACPTVVGS 486
>gi|194751825|ref|XP_001958224.1| GF23629 [Drosophila ananassae]
gi|190625506|gb|EDV41030.1| GF23629 [Drosophila ananassae]
Length = 523
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 86/226 (38%), Positives = 130/226 (57%), Gaps = 20/226 (8%)
Query: 66 LKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNY 125
L CRY P+LRL PLK EE L P +++Y +V+Y++EI+ +KK Q + +
Sbjct: 306 LFCRYNFTTTPFLRLAPLKLEEINLDPYVVMYHEVLYETEIEELKK--QSGHMKNGYADQ 363
Query: 126 KTGELEIANYR--ISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHY 183
K G + YR +++ +W + E P ERI+RR+ MTGL + LQV NYG G ++
Sbjct: 364 KNGTM----YRAVVARHSWWSD-ESPTRERINRRIRDMTGLDFPITDTLQVANYGCGTYF 418
Query: 184 EPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWH 243
+PH+D+ G + G+R+ T++FY SDV QGGATVF + +S+ P KG++ FW+
Sbjct: 419 KPHFDYTSDGYETP-NADALGDRLGTIIFYASDVLQGGATVFPDIKVSITPRKGSSVFWY 477
Query: 244 NLHSSGDGDYYTRHAACPVLTG-----SNSLH-----STCPCGLRR 279
NL+ G D +RH+ CPV+ G + +H PCG R+
Sbjct: 478 NLYDDGRPDIRSRHSVCPVINGDRWTLTKWIHIFPQMFIIPCGPRK 523
>gi|443721482|gb|ELU10773.1| hypothetical protein CAPTEDRAFT_174752 [Capitella teleta]
Length = 525
Score = 158 bits (400), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 99/244 (40%), Positives = 141/244 (57%), Gaps = 23/244 (9%)
Query: 38 PTLEVTEREKYEMLCRGDLTVPPAIVAQ---LKCRYVHRNVPYLRLMPLKEEEAYLQPRI 94
P L+ T+ YE LCRG+ P + ++ LKCRY +P++R KEE +P I
Sbjct: 264 PKLKSTK--AYEALCRGEQLKLPDVDSEQQALKCRYKPGILPFVRY---KEEMLNRKPHI 318
Query: 95 ILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANY-RISKSAWLREPE-HPVIE 152
+L+ DVM D+E +K A +L RA V + + A+ RIS+ +WL + + I
Sbjct: 319 VLFHDVMSDAEAKTMKMEAMHKLERAHVADNENKHGHSASAKRISQVSWLWDDHANKTIH 378
Query: 153 RISRRVEHMTGLTTS------TAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL----- 201
++SRRV +TGL T +AE Q++NYGIGG YEPH D+ +++ SL
Sbjct: 379 QLSRRVADITGLQTGVVSGLHSAEPFQILNYGIGGQYEPHVDYFAGNHSHS--SLPEHVR 436
Query: 202 GTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACP 261
+GNR+AT +FY++DV GGATVF L + + P K AAFW+N+ +GD D T HA CP
Sbjct: 437 ASGNRLATFMFYLNDVHAGGATVFPKLKVGIPPTKNGAAFWYNIGLNGDVDPLTEHAGCP 496
Query: 262 VLTG 265
VL G
Sbjct: 497 VLLG 500
>gi|301626782|ref|XP_002942567.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Xenopus
(Silurana) tropicalis]
Length = 716
Score = 158 bits (400), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 100/268 (37%), Positives = 140/268 (52%), Gaps = 30/268 (11%)
Query: 4 PTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTL-EVTEREKYEMLCRGDLTVPPAI 62
P + R N Y++ L +SP + ++ P + + R+ YE LC+ + P +
Sbjct: 449 PDNGRLARNIAKYEQILYESPTADADAEEMKLQRPNVTHLKTRDLYEGLCQTLGSQPTSY 508
Query: 63 V-AQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRAT 121
+ C Y + PYL L P+K+E L+P+++LY D + D E + IK++A P L R+
Sbjct: 509 EDPHMSCMYDTNSHPYLLLQPMKKEIVSLRPQVVLYHDFVSDLEAEKIKELASPWLHRSV 568
Query: 122 VQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQVVNYG 178
V +GE + A YRISKSAWL++ HP ++ + R+ +TGL AE LQVVNYG
Sbjct: 569 V---ASGEKQAEAEYRISKSAWLKDTIHPFVQNLDTRISGVTGLNAHPPYAEYLQVVNYG 625
Query: 179 IGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGT 238
IGGHYEPH+D A +S V GG+T F N S K
Sbjct: 626 IGGHYEPHFDHAT----------------------LSHVDLGGSTAFVFANFSSPVVKNA 663
Query: 239 AAFWHNLHSSGDGDYYTRHAACPVLTGS 266
A FW NLH +G GD T HA CPV+ GS
Sbjct: 664 AVFWWNLHRNGLGDEDTLHAGCPVIIGS 691
>gi|449668268|ref|XP_002154169.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Hydra
magnipapillata]
Length = 531
Score = 158 bits (400), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 93/227 (40%), Positives = 131/227 (57%), Gaps = 11/227 (4%)
Query: 47 KYEMLCRGDL---TVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYD 103
+Y CR D T+ V L C Y N P L L PLK + P ++++ +++ +
Sbjct: 295 RYARACRRDQRTKTIAVKDVNNLVCFY-KNNKPRLILKPLKVTRMHDNPDVLVFHEMITE 353
Query: 104 SEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRR----VE 159
+ I+ +A PRLR + V + + A+YR+SK+ + + +E ISR+ VE
Sbjct: 354 EVAEKIRDVANPRLRPSEVIDPIIQKHVTASYRVSKNVFFDDAFEEELE-ISRKLRPLVE 412
Query: 160 HMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQ 219
T L +E+LQV NYG+GG YE H DF PG + GNR+AT+L Y+SDV +
Sbjct: 413 DATDLNDDFSEQLQVNNYGLGGQYEFHVDFGDPG--SPLDKHEHGNRIATLLIYLSDVER 470
Query: 220 GGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
GG TVFT L LSL P+ G AAFWHNL+ +G G Y T HA+CPV++GS
Sbjct: 471 GGDTVFTRLGLSLKPKLGDAAFWHNLYKNGSGIYATEHASCPVVSGS 517
>gi|195575139|ref|XP_002105537.1| GD21537 [Drosophila simulans]
gi|194201464|gb|EDX15040.1| GD21537 [Drosophila simulans]
Length = 536
Score = 158 bits (399), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 94/244 (38%), Positives = 139/244 (56%), Gaps = 14/244 (5%)
Query: 34 NNVAPTLEVTEREK---YEMLCRGD--LTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEA 88
N P++ + RE + LCR + + ++L CRY P+LRL P + EE
Sbjct: 260 NTPKPSINLESRESDESFNQLCRSSSRRQMGESKPSRLHCRYNTTTTPFLRLAPFRMEEL 319
Query: 89 YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWL----R 144
L P ++ Y +V+ D EI+ +K M++P L RA V + G EIA R + AWL
Sbjct: 320 SLDPYVVFYHNVLSDPEIEKLKPMSEPFLERAKVFRVEKGSDEIAPTRSADGAWLPHQDT 379
Query: 145 EPEH-PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT 203
+P+ V+ RI RR+ +TGL T + ++Q + YG GGH+ PHYD+ + +
Sbjct: 380 DPDDLEVLRRIGRRIRDITGLNTRSGSQMQFLKYGFGGHFVPHYDYFNSKTSYLER---V 436
Query: 204 GNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNL-HSSGDGDYYTRHAACPV 262
G+R+ATVLFY+++V GGAT F LNL + +KG+A FWHNL S D D T H ACP+
Sbjct: 437 GDRMATVLFYLNNVDHGGATAFPKLNLVVPTQKGSALFWHNLDRKSYDYDTRTSHGACPL 496
Query: 263 LTGS 266
++G+
Sbjct: 497 ISGT 500
>gi|17861644|gb|AAL39299.1| GH17175p [Drosophila melanogaster]
Length = 187
Score = 158 bits (399), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 76/158 (48%), Positives = 107/158 (67%), Gaps = 3/158 (1%)
Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
MA PR+ R+TV G+L+ + +R+SK+AWL HP + + R ++ TGL T+ E+
Sbjct: 1 MAVPRMHRSTVNPLPGGQLKKSAFRVSKNAWLAYESHPTMVGMLRDLKDATGLDTTFCEQ 60
Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
LQV NYG+GGHYEPH+DF R + N + + GNR+AT +FY+S+V QGGAT F L+++
Sbjct: 61 LQVANYGVGGHYEPHWDFFR--DPNHYPA-EEGNRIATAIFYLSEVEQGGATAFPFLDIA 117
Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSL 269
+ P+ G FW+NLH S D DY T+HA CPVL GS +
Sbjct: 118 VKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLKGSKWI 155
>gi|195444366|ref|XP_002069834.1| GK11733 [Drosophila willistoni]
gi|194165919|gb|EDW80820.1| GK11733 [Drosophila willistoni]
Length = 517
Score = 157 bits (398), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 90/218 (41%), Positives = 132/218 (60%), Gaps = 15/218 (6%)
Query: 52 CRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
CRG+ P L C Y + P+LR+ P K E P + Y DV+ DSEI+ +K
Sbjct: 288 CRGEYKPPKG----LSCYYEYGADPFLRIAPFKVELLNRSPYVAAYYDVLNDSEIEELKL 343
Query: 112 MAQPRLRRATVQNYKTGELEIANY-RISKSAWLREPEHPVIERISRRVEHMTGL--TTST 168
M+ P++RR+ + N+ T +++ A+ R S S ++ E ++E IS+R MT L T +
Sbjct: 344 MSSPQIRRSLLYNH-TLDIDQADVDRTSNSVFMEETGITLLETISQRAADMTDLYVTAIS 402
Query: 169 AEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL 228
+E+LQV+NYG+GG Y PH D+ N G+R+ATVLFY++DV QGGATVF L
Sbjct: 403 SEDLQVINYGLGGQYTPHCDYFDENAEN-------GDRLATVLFYLTDVQQGGATVFPFL 455
Query: 229 NLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
LS +P+KG+A + NL ++ GD + H+ACPVL G+
Sbjct: 456 RLSYFPKKGSALIFRNLDNAMSGDKDSTHSACPVLFGN 493
>gi|194764881|ref|XP_001964556.1| GF23245 [Drosophila ananassae]
gi|190614828|gb|EDV30352.1| GF23245 [Drosophila ananassae]
Length = 460
Score = 157 bits (397), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 89/234 (38%), Positives = 126/234 (53%), Gaps = 24/234 (10%)
Query: 52 CRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
C +P L CRY P+L + PLK EE P +++Y DV+Y++EI+ +
Sbjct: 233 CSAKFRLPN----HLHCRYNSSTSPFLHIAPLKMEEISTDPYMVVYHDVIYENEINWL-- 286
Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
+ R + V GE +I+ R S+ V+ I +R++ MTGL+ +E+
Sbjct: 287 LDNSDFRTSLV-----GESQISTLRTSQDMPFGANSGEVMRNIEKRIKDMTGLSMDLSED 341
Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
++NYGIGG Y+ HYDF E F G R+ TVLFY+ DV G+TVF LN+S
Sbjct: 342 FMLINYGIGGTYKMHYDFYVYSEPLRFLR---GERIVTVLFYLGDVELSGSTVFPFLNIS 398
Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS--------NSLHST--CPC 275
+ P+KG+A W+NLH+SGD T+H ACPV+ GS N LH T PC
Sbjct: 399 ITPKKGSAVMWYNLHNSGDVHQKTQHCACPVVVGSKYVLTKWINELHQTFITPC 452
>gi|443719426|gb|ELU09607.1| hypothetical protein CAPTEDRAFT_229373 [Capitella teleta]
Length = 576
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 96/252 (38%), Positives = 141/252 (55%), Gaps = 30/252 (11%)
Query: 48 YEMLCRGD-LTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEI 106
Y LCRG+ P +V L C Y + +PYLR EE PRI L +V+ + +I
Sbjct: 324 YMKLCRGEHFDRDPEVVKALYCTYRYGILPYLRY---NEEIFNFNPRIALIYNVIKNRDI 380
Query: 107 DLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTT 166
+++K A L + V + + +++N RISK++WL + E I ++S++V +TGL+T
Sbjct: 381 NMLKDKATAGLSSSRVGD--PAKSKLSNERISKTSWLWDTEDERIFKLSKQVADITGLST 438
Query: 167 ------STAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL-----GTGNRVATVLFYMS 215
S AE Q+VNYGIGG Y+PH+D+ E + +++ TG+RVAT +FY+S
Sbjct: 439 QYSTLHSHAEPFQLVNYGIGGQYQPHFDYY---ENDMLRNVPAFIQDTGDRVATFMFYLS 495
Query: 216 DVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC-- 273
V GGATVF L++ + KG AAFW N+ SGD + T+HA CPVL G + +
Sbjct: 496 SVKAGGATVFPKLHVRIPAVKGAAAFWFNIRRSGDREPLTQHAGCPVLLGEKWVANKWIR 555
Query: 274 --------PCGL 277
PCGL
Sbjct: 556 ELGQEYNRPCGL 567
>gi|380805043|gb|AFE74397.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor, partial
[Macaca mulatta]
Length = 128
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 71/128 (55%), Positives = 97/128 (75%)
Query: 76 PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANY 135
P L + P KEE+ + P I+ Y DVM D EI+ IK++A+P+L RATV++ KTG L +A+Y
Sbjct: 1 PQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASY 60
Query: 136 RISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEA 195
R+SKS+WL E + PV+ R++RR++H+TGLT TAE LQV NYG+GG YEPH+DF+R E
Sbjct: 61 RVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDER 120
Query: 196 NAFKSLGT 203
+ FK LGT
Sbjct: 121 HTFKHLGT 128
>gi|195452744|ref|XP_002073481.1| GK14140 [Drosophila willistoni]
gi|194169566|gb|EDW84467.1| GK14140 [Drosophila willistoni]
Length = 454
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 81/215 (37%), Positives = 124/215 (57%), Gaps = 5/215 (2%)
Query: 52 CRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
C G+ V QL C Y ++ +LR+ P+K E L P I+L DV+ SE + +K
Sbjct: 223 CSGNCEVDREF--QLFCLYNTKDAYFLRIAPVKMEILSLNPYIVLCHDVILPSEQEFLKT 280
Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
+ RL A + E+ R SK+ WL++ V R+S +E ++ L ++ +
Sbjct: 281 QSSKRLEGARALDQVKNEVVFNFIRTSKATWLKKNSDNVTRRLSHWIEDVSNLDSNIGDL 340
Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
Q++NYG+GG +E H D R E + +K L +R+AT +FY+ DV QGGAT+F +LNL+
Sbjct: 341 YQIINYGVGGLFEAHSDTMRKDE-DRWKVL--YDRIATFIFYLQDVPQGGATLFNNLNLT 397
Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
++P+ G A FW NL ++GD D +T H CPV+ GS
Sbjct: 398 VFPKAGAALFWFNLDNAGDTDLFTVHTGCPVIVGS 432
>gi|241999340|ref|XP_002434313.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
gi|215496072|gb|EEC05713.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
Length = 267
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 91/237 (38%), Positives = 130/237 (54%), Gaps = 15/237 (6%)
Query: 44 EREKYEMLCRGDLTVPPAIVAQLKCRY-VHRNVPYLRLMPLKEEEAYLQPRIILYRDVMY 102
E +Y +C D V P ++L C+ P+L L P K E PRI+++ D +
Sbjct: 10 ESAEYMSMCVADGDVRPR-QSKLLCKISTIGGHPFLVLQPFKIEVLSEDPRIVVFPDFLN 68
Query: 103 DSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMT 162
E ++ + ++Q +L RA V E + R +K AW+ + HP++ ++SRR+ T
Sbjct: 69 PRECEIFRSISQEKLSRAKVYLGGPPEGGFSLRRTNKVAWMSDDLHPLLGKVSRRIALAT 128
Query: 163 GLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGA 222
GLT ++AE QV NYG+GGHY PH D+A GEA +GNR+AT+L Y++DVA GGA
Sbjct: 129 GLTLTSAEMYQVANYGLGGHYIPHPDYAGFGEAQGDIYKSSGNRLATMLIYLADVAGGGA 188
Query: 223 TVFTSLNLSLWPEKGTAAFWHNLHSSGD-------------GDYYTRHAACPVLTGS 266
T F ++ L++ P GTA FW+NL GD T H CPVLTGS
Sbjct: 189 TAFINMRLAVKPTLGTALFWYNLKPYDGPIVNESFWNQRRFGDPRTFHMGCPVLTGS 245
>gi|198477150|ref|XP_002136737.1| GA29215 [Drosophila pseudoobscura pseudoobscura]
gi|198145042|gb|EDY71754.1| GA29215 [Drosophila pseudoobscura pseudoobscura]
Length = 508
Score = 155 bits (391), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 87/203 (42%), Positives = 115/203 (56%), Gaps = 7/203 (3%)
Query: 64 AQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQ 123
++L C Y +LRL PLK E L P ++LY DV+ D E+ L+K MAQ L RA
Sbjct: 291 SRLYCLYNTTATAFLRLAPLKMELLSLDPYVVLYHDVLADREMSLLKLMAQRDLVRAVTY 350
Query: 124 NYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHY 183
N + R +K+ WL +P H +I R+ E M+ L +E+ QV+NYGIGGHY
Sbjct: 351 NATEKKHSEDPNRTTKAGWL-DPSHNLIRRMGILTEDMSNLDLERSEDFQVLNYGIGGHY 409
Query: 184 EPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWH 243
H DF +RVAT+LFY+SDV GGATVF L+LS++P+KG W+
Sbjct: 410 AVHPDFFEGSNPE------LPDRVATLLFYLSDVPLGGATVFPLLDLSVFPKKGAVLMWY 463
Query: 244 NLHSSGDGDYYTRHAACPVLTGS 266
NL G G T H+ACPV+ GS
Sbjct: 464 NLDHKGQGMEKTIHSACPVVVGS 486
>gi|195064500|ref|XP_001996577.1| GH12091 [Drosophila grimshawi]
gi|193895397|gb|EDV94263.1| GH12091 [Drosophila grimshawi]
Length = 521
Score = 155 bits (391), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 91/287 (31%), Positives = 148/287 (51%), Gaps = 31/287 (10%)
Query: 2 IFPTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKY--EMLCRGDLTV- 58
+F TH L + L S ++ KVN++ TL T+ + Y E+L + D +
Sbjct: 220 LFKTHM------LLAMQILQASMNPEEAHEKVNDIFKTLSSTDLDSYVNELLNQDDDQLF 273
Query: 59 ----------PPAIVA---------QLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRD 99
P V L CRY P+LRL PLK EE P +++Y +
Sbjct: 274 MELQSMQPIATPEFVGCRGHFPKRHNLSCRYNFTTTPFLRLAPLKLEEINHDPYVVMYHN 333
Query: 100 VMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVE 159
V+YDSEI+ +K+++ P+++ + YK + ++ + ++ WL E P +ER+++R+
Sbjct: 334 VIYDSEIEEMKRLS-PQMQNGYIHGYKANQTKVTDI-AARVNWLVE-NTPFLERMNQRIT 390
Query: 160 HMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQ 219
MTG +QV N+GIG ++E HYD+ G+R+A+++FY SDV
Sbjct: 391 DMTGFDLKEFPSVQVANFGIGNNFEAHYDYIFGKRVRKEDVGDLGDRLASIIFYSSDVPL 450
Query: 220 GGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
GGATVF + +++ P+KG + W+NL G D + H+ CPV+ GS
Sbjct: 451 GGATVFPDIQVAVQPQKGNSLLWYNLFDDGTPDPRSLHSVCPVVVGS 497
>gi|198466403|ref|XP_002135183.1| GA23911 [Drosophila pseudoobscura pseudoobscura]
gi|198150584|gb|EDY73810.1| GA23911 [Drosophila pseudoobscura pseudoobscura]
Length = 534
Score = 155 bits (391), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 84/223 (37%), Positives = 124/223 (55%), Gaps = 16/223 (7%)
Query: 48 YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID 107
YE+ CRG P +L CRY P+LRL PLK EE P I++Y +V+ D EI+
Sbjct: 296 YEIGCRG--LFPKR--TKLVCRYNFTTTPFLRLAPLKMEEVNHDPYIVMYHEVLSDREIE 351
Query: 108 LIKKMAQPRLRRATVQNYKTGELEIANYRI----SKSAWLREPEHPVIERISRRVEHMTG 163
+K R + N + E + +I + W RE + + ER++RR+ MT
Sbjct: 352 EMKG------RSGQMSNGWADQKEANSTKIRDIVCRHTWWRE-QSAIKERVNRRISDMTN 404
Query: 164 LTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGAT 223
E+LQV NYG+G H++PHYD+ G L G+R+ +++FY SDV QGGAT
Sbjct: 405 FDFPPQEDLQVANYGLGTHFKPHYDYTSDGYETP-DVLTLGDRLGSIIFYASDVPQGGAT 463
Query: 224 VFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
VF +S++P KG++ FW+NL+ G D ++H+ CPV+ G
Sbjct: 464 VFPRSRVSIFPRKGSSVFWYNLYDDGRIDTRSQHSVCPVIVGD 506
>gi|195390805|ref|XP_002054058.1| GJ23004 [Drosophila virilis]
gi|194152144|gb|EDW67578.1| GJ23004 [Drosophila virilis]
Length = 446
Score = 155 bits (391), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 86/215 (40%), Positives = 122/215 (56%), Gaps = 16/215 (7%)
Query: 52 CRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
C L P + L CRY + P+LR+ PLK EE + P ++LY +V+YDSEI+
Sbjct: 222 CAASLQRP----SHLHCRYNNWTTPFLRIAPLKMEELSIDPFVVLYHNVIYDSEIEWF-- 275
Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
+ Q + +Y + +R K+ ++ + +++ I RV M+GL+ +++
Sbjct: 276 LTQSFDYTPALLDYGG----FSAHRSGKNVFIELEKGELVKTIEMRVTDMSGLSMEGSDD 331
Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
L ++NYGIGGHY PH+D E T +R+AT LFY+SDV GGAT F LNL+
Sbjct: 332 LSLINYGIGGHYIPHHDSFSEEENK------TEDRIATALFYLSDVELGGATTFPLLNLT 385
Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+ PEKGTA WHNL SG T HAACPV+ GS
Sbjct: 386 ISPEKGTAVLWHNLKDSGTPHPKTVHAACPVIVGS 420
>gi|195440206|ref|XP_002067933.1| GK11220 [Drosophila willistoni]
gi|194164018|gb|EDW78919.1| GK11220 [Drosophila willistoni]
Length = 459
Score = 154 bits (390), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 88/239 (36%), Positives = 135/239 (56%), Gaps = 8/239 (3%)
Query: 44 EREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYD 103
E Y + CRG L +PP +L CRY P+LRL PLK+EE L P I++Y DV++D
Sbjct: 220 EMTSYHLGCRG-LFLPPG---KLVCRYNFTTSPFLRLAPLKQEEINLDPYIVVYHDVLHD 275
Query: 104 SEIDLIKK-MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMT 162
EI +K+ MA + A ++ K + ++ I + +WL + + ++ +++R+ MT
Sbjct: 276 REIAQMKEEMANAHISNAWIEERKANQSQMRQV-IGRVSWLTDSSN-FMDSVNQRIMDMT 333
Query: 163 GLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGA 222
G + E LQV NYG G +++PHYD+ G L G+R+A+V+FY S+V GGA
Sbjct: 334 GFSMKGIESLQVCNYGPGCNFKPHYDYMAEGYEPP-NILTLGDRLASVIFYASEVHLGGA 392
Query: 223 TVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTCPCGLRRGL 281
TVF L++++ P+KG W+N + D ++HA CP L GS T GL + L
Sbjct: 393 TVFPRLDVAITPKKGAGLVWYNTYDDSTHDQRSQHAVCPTLMGSRWSKKTPHQGLEKCL 451
>gi|195113239|ref|XP_002001175.1| GI10638 [Drosophila mojavensis]
gi|193917769|gb|EDW16636.1| GI10638 [Drosophila mojavensis]
Length = 511
Score = 154 bits (390), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 81/230 (35%), Positives = 122/230 (53%), Gaps = 13/230 (5%)
Query: 48 YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID 107
+E+ CRG ++ C Y ++ +LRL P+K E L P ++++ DV+ EID
Sbjct: 279 FEIGCRGQYVQQSGLM----CTYKSKSPAFLRLAPIKMEVLVLDPLVVIFHDVLSSREID 334
Query: 108 LIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTS 167
++++A+P L R+ V Y+ +RIS W+ + + RI RR+ M L
Sbjct: 335 GLQEIARPHLERSMVVKYRANVQ--GKHRISAGTWVERKYNNLTWRIERRIADMVDLNLE 392
Query: 168 TAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTS 227
+E V+NYGIGG Y+ H+DF NR+ATVLFYM+DV QGGATVF
Sbjct: 393 GSEPFYVINYGIGGQYKAHWDFFGADTVE-------DNRLATVLFYMNDVEQGGATVFPR 445
Query: 228 LNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTCPCGL 277
L ++ ++G A FW+N+ +G D T H CP+L GS + + L
Sbjct: 446 LGQTVRAKRGNALFWYNMQHNGTVDDRTLHGGCPILVGSKWIFTQWISDL 495
>gi|195166681|ref|XP_002024163.1| GL22882 [Drosophila persimilis]
gi|194107518|gb|EDW29561.1| GL22882 [Drosophila persimilis]
Length = 534
Score = 154 bits (390), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 83/223 (37%), Positives = 123/223 (55%), Gaps = 16/223 (7%)
Query: 48 YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID 107
YE+ CRG +V CRY P+LRL PLK EE P I++Y +V+ D EI+
Sbjct: 296 YEIGCRGLFPKRTNLV----CRYNFTTTPFLRLAPLKMEEVNHDPYIVMYHEVLSDREIE 351
Query: 108 LIKKMAQPRLRRATVQNYKTGELEIANYRIS----KSAWLREPEHPVIERISRRVEHMTG 163
+K R + N + E + +I + W RE + + ER++RR+ MT
Sbjct: 352 EMKG------RSGQMSNGWADQKEANSTKIRDIVCRHTWWRE-QSAIKERVNRRISDMTN 404
Query: 164 LTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGAT 223
E+LQV NYG+G H++PHYD+ G L G+R+ +++FY SDV QGGAT
Sbjct: 405 FDFPPQEDLQVANYGLGTHFKPHYDYTSDGYETP-DVLTLGDRLGSIIFYASDVPQGGAT 463
Query: 224 VFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
VF +S++P KG++ FW+NL+ G D ++H+ CPV+ G
Sbjct: 464 VFPRSRVSIFPRKGSSVFWYNLYDDGRIDTRSQHSVCPVIVGD 506
>gi|313217217|emb|CBY38368.1| unnamed protein product [Oikopleura dioica]
gi|313239835|emb|CBY17758.1| unnamed protein product [Oikopleura dioica]
Length = 521
Score = 154 bits (389), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 91/228 (39%), Positives = 132/228 (57%), Gaps = 11/228 (4%)
Query: 46 EKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPY--LRLMPLKEEEAYLQPRIILYRDVMYD 103
++YE LCR P + LKC Y P L+ P+K EE + P ++ + +V+ D
Sbjct: 268 QEYERLCR---EFSPPHKSNLKCFYWTGPSPLSPLQWAPVKTEELHGDPLVVQFYEVISD 324
Query: 104 SEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPE----HPVIERISRRVE 159
E I+ +A L RAT+Q+ TG+L A+YRI K+AWL E E + I + + ++
Sbjct: 325 EEERAIQFLAGEHLNRATIQDPATGKLVNADYRIQKTAWLTEFEKLDVNGTIAKYNEKLT 384
Query: 160 HMTGLTTSTAEELQVVNYGIGGHYEPHYDF-ARPGEANAFKSLGTGNRVATVLFYMSDVA 218
+TGL AE +QV NYG+ G YEPH+D + PG N + + G+R+AT L YMS+
Sbjct: 385 KITGLDADYAELVQVGNYGVAGQYEPHWDHQSYPGAENRWDPI-EGSRIATWLAYMSEPN 443
Query: 219 QGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
GG TVF + P + +A FW+NL SG+ D T+HAACPVL+G+
Sbjct: 444 MGGGTVFIQAGIQARPIRNSAVFWYNLLPSGESDDNTQHAACPVLSGT 491
>gi|195379216|ref|XP_002048376.1| GJ13933 [Drosophila virilis]
gi|194155534|gb|EDW70718.1| GJ13933 [Drosophila virilis]
Length = 521
Score = 154 bits (389), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 82/217 (37%), Positives = 123/217 (56%), Gaps = 11/217 (5%)
Query: 52 CRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
CRG P + L CRY P+LRL PLK EE P I++Y +V+ DSEI+ +K+
Sbjct: 289 CRGLFPKPKS----LSCRYNSTTTPFLRLAPLKLEEISHDPYIVMYHNVLSDSEIEEMKQ 344
Query: 112 MA--QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTA 169
++ AT + T L+I ++++ WL E P +ERI+RR+ MTG
Sbjct: 345 LSVLMENGLSATNKPNNTEPLDI----VARAGWLVEAT-PFLERINRRITDMTGFDVLDM 399
Query: 170 EELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLN 229
+ + NYGIG +++PHYD+ G + G R+AT++FY SDVAQGGAT F +
Sbjct: 400 WAVLLANYGIGNYFKPHYDYMYGGRVSGEAVAELGERIATLIFYASDVAQGGATNFPDIQ 459
Query: 230 LSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+++ P+KG + FW+N+ G D + H+ CP + GS
Sbjct: 460 VAVQPQKGNSLFWYNMFDDGTPDPRSLHSVCPTIVGS 496
>gi|195390831|ref|XP_002054071.1| GJ22995 [Drosophila virilis]
gi|194152157|gb|EDW67591.1| GJ22995 [Drosophila virilis]
Length = 485
Score = 154 bits (389), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 89/262 (33%), Positives = 129/262 (49%), Gaps = 26/262 (9%)
Query: 6 HQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAIVAQ 65
H +GNK + LN+ ++++ + P++ T Y CRG ++
Sbjct: 224 HALMKGNKTNNSDLLNERAQIEELVGTAPKLRPSIRYTT--DYARGCRGQFVQQTNLI-- 279
Query: 66 LKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNY 125
C+Y R P+LRL PLK E ++P I+ + DV+ EI ++++A P L+R TV +
Sbjct: 280 --CKYKFRPSPFLRLAPLKMEVLVVKPFIVAFHDVLSPHEIGELQQLAMPLLKRTTVYDS 337
Query: 126 KTG-ELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYE 184
G + R SK WL + + +RI RR+ MTG + LQV+NYG+ GHY
Sbjct: 338 NAGLHGSVKGTRTSKGIWLSRSHNNLTKRIGRRISDMTGFHLEGSTSLQVMNYGLSGHYA 397
Query: 185 PHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHN 244
H D+ E +SDV QGG TVF + + PE+G A W+N
Sbjct: 398 LHTDYFNTAE-------------------LSDVEQGGDTVFPRIEQAFKPERGKALLWYN 438
Query: 245 LHSSGDGDYYTRHAACPVLTGS 266
LH +G GD T H ACPVL GS
Sbjct: 439 LHRNGTGDKRTEHGACPVLVGS 460
>gi|312385412|gb|EFR29925.1| hypothetical protein AND_00803 [Anopheles darlingi]
Length = 468
Score = 154 bits (388), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 76/169 (44%), Positives = 111/169 (65%), Gaps = 3/169 (1%)
Query: 47 KYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEI 106
K+ LCRG+ + +A+L+CRYV VP+L++ PLK EE L P I++Y V+ D+EI
Sbjct: 279 KFYSLCRGESPRTASEMAKLRCRYVSNRVPFLKIAPLKLEEVSLDPFIVVYHQVISDNEI 338
Query: 107 DLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTT 166
I ++++ LRRA V + + E++ R S +AWL +P HP + +SRR E MTGLT
Sbjct: 339 KTIIEISRDSLRRAMVGD--VAKQEVSKARTSSNAWLDDPMHPHVRSLSRRTEDMTGLTM 396
Query: 167 STAEELQVVNYGIGGHYEPHYDFARPGEA-NAFKSLGTGNRVATVLFYM 214
AE+LQV NYGIGGHY PH+D+ P E + ++ GNR+ATV++Y+
Sbjct: 397 WAAEQLQVGNYGIGGHYLPHFDYGTPEEGVELYPNIEKGNRIATVMYYV 445
>gi|195575105|ref|XP_002105520.1| GD21524 [Drosophila simulans]
gi|194201447|gb|EDX15023.1| GD21524 [Drosophila simulans]
Length = 448
Score = 154 bits (388), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 84/200 (42%), Positives = 116/200 (58%), Gaps = 7/200 (3%)
Query: 65 QLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQN 124
+L C Y +LRL PLK E L P ++L+ DV+ D +I I+ MA+ RL RA +
Sbjct: 256 KLYCLYNTTASYFLRLAPLKMELLSLDPYMVLFHDVVSDKDIVSIRNMAKGRLARAVTVS 315
Query: 125 YKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYE 184
K G R +K WL E +I+R+S+ + MT A+ QV+NYGIGG Y
Sbjct: 316 -KDGNYTEDPDRTTKGTWLVE-NSKLIQRLSQLTQDMTNFEIHDADPFQVLNYGIGGFYG 373
Query: 185 PHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHN 244
H DF E + F +R+AT +FY+SDV QGGAT+F L LS++P+KG+A W+N
Sbjct: 374 IHLDFLGEAELDNF-----SDRIATAVFYLSDVPQGGATIFPKLGLSVFPKKGSALLWYN 428
Query: 245 LHSSGDGDYYTRHAACPVLT 264
L GDGD T H+ACP ++
Sbjct: 429 LDHKGDGDNRTAHSACPTVS 448
>gi|195452728|ref|XP_002073474.1| GK14137 [Drosophila willistoni]
gi|194169559|gb|EDW84460.1| GK14137 [Drosophila willistoni]
Length = 536
Score = 153 bits (387), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 98/270 (36%), Positives = 142/270 (52%), Gaps = 23/270 (8%)
Query: 4 PTHQRAQGNK-LYYQEALN-KSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPA 61
PTH Q K L ++E L+ K E+ +P T Y LC+G P
Sbjct: 256 PTHSAQQTRKYLLHREMLSTKKVEVASDP------------TWHANYTRLCQGHRLPEPF 303
Query: 62 IVAQLKCRY-VHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID-LIKKMAQPRLRR 119
L C R+V ++ L PLK E+ ++ P I +Y V+ D++I+ ++++ Q + R
Sbjct: 304 TGKSLHCYLDAKRHVSFI-LAPLKVEQVHVDPDINVYHGVLNDAQIEKILQESDQNEMMR 362
Query: 120 ATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGI 179
+ V K G IA+ R+S+ WL P++ +S + ++G + AE++QV NYG+
Sbjct: 363 SAVSGDK-GSATIADLRVSQQTWLNYSS-PIMRSLSNLISDISGFDMAGAEQMQVANYGV 420
Query: 180 GGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTA 239
GG YEPH D+ FK G+R++T +FY+SDV GG TVF LN+ L P KG
Sbjct: 421 GGQYEPHPDYFEVNLPQEFK----GDRISTSMFYLSDVELGGNTVFIKLNVFLPPIKGAM 476
Query: 240 AFWHNLHSSGDGDYYTRHAACPVLTGSNSL 269
WHNLH S D D T HA CPVL GS +
Sbjct: 477 VMWHNLHYSLDVDRRTIHAGCPVLIGSKRI 506
>gi|328718395|ref|XP_003246475.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Acyrthosiphon
pisum]
Length = 518
Score = 153 bits (387), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 85/203 (41%), Positives = 119/203 (58%), Gaps = 7/203 (3%)
Query: 67 KCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYK 126
KCRY N+ Y LMP KEE+ +P I +Y DV+YD EI IK +A ++ ATV++
Sbjct: 294 KCRYQTNNLFYRILMPFKEEDINSEPLIKIYHDVLYDDEILKIKTLALENMKDATVKSVD 353
Query: 127 -TGELEIANYRISKSAWLREPEH-PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYE 184
G+ I R + W+ + + ++ + R+E TG +T TAE+ Q+VNYG+GGHY
Sbjct: 354 GKGDSLIEKTRSGQVYWISKVDAVEYLDALDTRIESFTGFSTKTAEQYQIVNYGLGGHYL 413
Query: 185 PHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHN 244
PH+D A A L GNR+ TVLFY++DV G T F LN+ EKG A W+N
Sbjct: 414 PHHD----SFAKAINCLQFGNRLVTVLFYLTDVQNDGYTSFPLLNIIAPAEKGAALVWNN 469
Query: 245 LH-SSGDGDYYTRHAACPVLTGS 266
LH S+G Y + H +CP+L G+
Sbjct: 470 LHMSNGQKFYESLHGSCPLLKGN 492
>gi|156370183|ref|XP_001628351.1| predicted protein [Nematostella vectensis]
gi|156215325|gb|EDO36288.1| predicted protein [Nematostella vectensis]
Length = 478
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 95/240 (39%), Positives = 131/240 (54%), Gaps = 7/240 (2%)
Query: 2 IFPTHQRAQGNKLYYQEALNK-SPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD-LTVP 59
I P H+ + + +Y +A+ PEL K N +E Y LCRG+ + V
Sbjct: 241 IDPRHKSVRESIEHYSKAVKHGEPELSYPQIKANEQRLNIEYFVNSDYSKLCRGEPIKVR 300
Query: 60 PAIVAQLK---CRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPR 116
V K C Y +R L L P K E+ PR++++R ++ D E IK++A P
Sbjct: 301 HFQVMSAKSYHCWYDNRGDARLLLKPNKVEQVNDDPRVVIFRGLVTDRETARIKQIASPM 360
Query: 117 LRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVN 176
L RATV N TG LE A+YR+SKSAWL + I +++R+ +TGL TAE+LQ+ N
Sbjct: 361 LNRATVYNIDTGVLEYADYRVSKSAWLEDHLDETIATVNKRIAMVTGLDVQTAEKLQIAN 420
Query: 177 YGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEK 236
YG+GG YE H D P A L GNR+AT+L Y++DVA GGATVF + + P K
Sbjct: 421 YGMGGQYEQHTDHGEPDSPLANDPL--GNRIATLLIYLNDVALGGATVFLKAGVHVPPTK 478
>gi|195499025|ref|XP_002096772.1| GE25857 [Drosophila yakuba]
gi|194182873|gb|EDW96484.1| GE25857 [Drosophila yakuba]
Length = 490
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 88/239 (36%), Positives = 137/239 (57%), Gaps = 18/239 (7%)
Query: 28 DEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEE 87
D P+ VAP+ VT E+ E+ + TV ++L CRY P+ R+ PLK EE
Sbjct: 239 DNKPE--EVAPSHGVTHIEE-ELATVQNCTVVVQKPSRLHCRYNSTTTPFTRIAPLKMEE 295
Query: 88 AYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPE 147
L P ++++ DV+YD EI+L+ + L T + + R SK +++ E +
Sbjct: 296 LSLDPYMVVFHDVIYDREIELMLNSSNFILSL-------TDSGQESEVRASKDSYIVESK 348
Query: 148 HPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRV 207
++ RV MTGL+ ++ ++NYGIGGHY HYD+ + K G+R+
Sbjct: 349 -----TLNDRVTDMTGLSMELSDPFSLINYGIGGHYMLHYDYHKYTNTTRAK---YGDRI 400
Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
AT+LFY+ +V GGAT+F +N+++ P+KG+A FW+NLH+SG T H+ACPV++GS
Sbjct: 401 ATLLFYLGEVDSGGATIFPRINITVTPKKGSAVFWYNLHNSGALHLETLHSACPVISGS 459
>gi|313243209|emb|CBY39868.1| unnamed protein product [Oikopleura dioica]
Length = 430
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 90/228 (39%), Positives = 132/228 (57%), Gaps = 11/228 (4%)
Query: 46 EKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPY--LRLMPLKEEEAYLQPRIILYRDVMYD 103
++YE LCR P + LKC Y P L+ P+K EE + P ++ + +V+ D
Sbjct: 177 QEYERLCR---EFSPPHKSNLKCFYWTGPSPVSPLQWAPVKTEELHDDPLVVQFYEVISD 233
Query: 104 SEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPE----HPVIERISRRVE 159
E I+ +A L RAT+Q+ TG+L A+YRI K+AWL E + + I + + ++
Sbjct: 234 EEERAIQFLAGEHLNRATIQDPATGKLVNADYRIQKTAWLTEFDKFDVNGTIAKYNAKLT 293
Query: 160 HMTGLTTSTAEELQVVNYGIGGHYEPHYDF-ARPGEANAFKSLGTGNRVATVLFYMSDVA 218
+TGL AE +QV NYG+ G YEPH+D + PG N + + G+R+AT L YMS+
Sbjct: 294 KITGLDADHAELVQVGNYGVAGQYEPHWDHQSYPGAENRWDPI-EGSRIATWLAYMSEPN 352
Query: 219 QGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
GG TVF + P + +A FW+NL SG+ D T+HAACPVL+G+
Sbjct: 353 MGGGTVFIQAGIQARPIRNSAVFWYNLLPSGESDDNTQHAACPVLSGT 400
>gi|26352077|dbj|BAC39675.1| unnamed protein product [Mus musculus]
Length = 383
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 101/268 (37%), Positives = 139/268 (51%), Gaps = 32/268 (11%)
Query: 4 PTHQRAQGNKLYYQEALNKSP-ELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAI 62
P ++R N L Y+ L ++ ++ E P L+ R+ YE LC+ + P
Sbjct: 118 PDNKRMARNVLKYERLLAENGHQMAAETAIQRPNVPHLQT--RDTYEGLCQTLGSQPTHY 175
Query: 63 -VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRAT 121
+ L C Y + PYL L P ++E +L+P I LY D + D E I+++A+P L+R+
Sbjct: 176 QIPSLYCSYETNSSPYLLLQPARKEVVHLRPLIALYHDFVSDEEAQKIRELAEPWLQRSV 235
Query: 122 VQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTS--TAEELQVVNYG 178
V +GE ++ YRISKSAWL++ P++ + R+ +TGL AE LQVVNYG
Sbjct: 236 V---ASGEKQLQVEYRISKSAWLKDTVDPMLVTLDHRIAALTGLDIQPPYAEYLQVVNYG 292
Query: 179 IGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGT 238
IGGHYEPH+D A +S V GGAT F N S+ K
Sbjct: 293 IGGHYEPHFDHAT----------------------LSSVEAGGATAFIYGNFSVPVVKNA 330
Query: 239 AAFWHNLHSSGDGDYYTRHAACPVLTGS 266
A FW NLH SG+GD T HA CPVL G
Sbjct: 331 ALFWWNLHRSGEGDGDTLHAGCPVLVGD 358
>gi|194904100|ref|XP_001981000.1| GG23922 [Drosophila erecta]
gi|190652703|gb|EDV49958.1| GG23922 [Drosophila erecta]
Length = 490
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 87/241 (36%), Positives = 138/241 (57%), Gaps = 19/241 (7%)
Query: 26 LKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKE 85
L+++P ++ P+ EV E+ E+ + T ++L CRY P+ R+ PLK
Sbjct: 238 LENKPEEI---FPSHEVIHFEE-ELATVQNCTAVVQKPSRLHCRYNSSTTPFTRIAPLKM 293
Query: 86 EEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLRE 145
EE P +++Y DV+YDSEIDL+ + L T + + R SK +++ +
Sbjct: 294 EELSSDPYMVVYHDVIYDSEIDLMLNASNFSLSL-------TNSGQKSEVRASKDSYIVD 346
Query: 146 PEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGN 205
+ ++ RV MTGL+ ++ ++NYGIGGHY HYD+ E + G+
Sbjct: 347 SK-----TLNDRVTDMTGLSMEMSDPFSMINYGIGGHYMLHYDY---HEYSNMTREKYGD 398
Query: 206 RVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
R+ATVLFY+ +V GGAT+F +N+++ P+KG+A FW+NLH+SG T H+ACPV++G
Sbjct: 399 RIATVLFYLGEVHSGGATIFPRINITVTPKKGSAVFWYNLHNSGAMHSETLHSACPVISG 458
Query: 266 S 266
S
Sbjct: 459 S 459
>gi|195159164|ref|XP_002020452.1| GL13506 [Drosophila persimilis]
gi|194117221|gb|EDW39264.1| GL13506 [Drosophila persimilis]
Length = 536
Score = 152 bits (385), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 89/250 (35%), Positives = 139/250 (55%), Gaps = 24/250 (9%)
Query: 31 PKVNNVAPTLEVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYL 90
P + P L+ E++ +CR P+ +L CRY P+LRL PL+ EE L
Sbjct: 278 PVSEEMKPILD----EEFNQICRSSHQNKPS---RLHCRYNATTTPFLRLAPLRMEELSL 330
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELE---IANYRISKSAWL-REP 146
P I++Y +V+ D+EI ++++A+P L+ V GE++ + R + AW+ E
Sbjct: 331 DPYIVVYHNVLSDAEIAKVERVAEPLLKSIGV-----GEMDNSKKSKVRTALGAWIPDEN 385
Query: 147 EH----PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLG 202
H PVI+RI RR+ MTGL + +Q++ YG GGHY+ H+D+ +
Sbjct: 386 MHISGWPVIQRIVRRIHDMTGLIIKRGQVVQLIKYGYGGHYDTHFDYLNDSLP---ITQA 442
Query: 203 TGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLH-SSGDGDYYTRHAACP 261
G+R+ATVLFY++DV GG+TVF L L + E+G W+N+H + D D T H +CP
Sbjct: 443 LGDRMATVLFYLNDVKHGGSTVFPVLQLKVPSERGKVLVWYNMHGETHDLDSRTLHGSCP 502
Query: 262 VLTGSNSLHS 271
V+ G+ ++ S
Sbjct: 503 VIDGAKTVLS 512
>gi|196011908|ref|XP_002115817.1| hypothetical protein TRIADDRAFT_30052 [Trichoplax adhaerens]
gi|190581593|gb|EDV21669.1| hypothetical protein TRIADDRAFT_30052, partial [Trichoplax
adhaerens]
Length = 495
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 88/251 (35%), Positives = 139/251 (55%), Gaps = 17/251 (6%)
Query: 26 LKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAIVAQ-------LKCRYVHRNVPYL 78
+D ++ T + E ++ LCRG++ I+ LKC Y +++ P L
Sbjct: 227 FQDYVKRLGRADSTRRLAENTEFGNLCRGNVKEVNYILCSILLANKTLKCYYSNQS-PLL 285
Query: 79 RLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRIS 138
L P+ EE L P I++Y D++ D +I+ IKK++ + ++ ++ ++S
Sbjct: 286 YLAPIPVEEISLDPFIVIYYDIINDHQIETIKKISPSKSNKSPNHAMLCSGIKSEATQVS 345
Query: 139 K---SAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEA 195
S WL + PV+E+ISR + +T L + AE+LQV NYGIGGHY PHYD
Sbjct: 346 IFCCSTWLEDAYDPVVEKISRLTQELTHLDVNYAEDLQVANYGIGGHYVPHYDSTIIAPE 405
Query: 196 NAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYT 255
+ + R+AT++FY+S+V GGAT+F L +++ P+KG+A FW NL +G + T
Sbjct: 406 DPLQ------RLATMMFYLSNVEIGGATIFPRLGVAVRPQKGSALFWINLKRNGLTNRQT 459
Query: 256 RHAACPVLTGS 266
HAACPV+ GS
Sbjct: 460 LHAACPVVIGS 470
>gi|194905392|ref|XP_001981188.1| GG11756 [Drosophila erecta]
gi|190655826|gb|EDV53058.1| GG11756 [Drosophila erecta]
Length = 509
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 85/204 (41%), Positives = 115/204 (56%), Gaps = 7/204 (3%)
Query: 63 VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATV 122
A+L C Y +LRL PLK E L P ++L+ DV+ D +I I+ +A+ L RA V
Sbjct: 290 TAKLHCLYNTTASHFLRLAPLKMELLSLDPYVVLFHDVVSDQDILSIRNLAKGGLARA-V 348
Query: 123 QNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGH 182
+ G + R +K WL E +I+R+S+ + MT A+ QV+NYGIGG
Sbjct: 349 TVTQDGNDKEDPARTTKGTWLVE-NSKLIQRLSQLSQDMTNFDVRDADPFQVLNYGIGGF 407
Query: 183 YEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFW 242
Y H+DF E F +R+AT +FY+SDV QGGAT F L LS++PEKG A W
Sbjct: 408 YGTHFDFLEDTEMGHF-----SDRIATAVFYLSDVPQGGATTFPDLGLSVFPEKGAALLW 462
Query: 243 HNLHSSGDGDYYTRHAACPVLTGS 266
+NL G GD T H+ACP + GS
Sbjct: 463 YNLDHKGVGDNRTAHSACPTIVGS 486
>gi|195392288|ref|XP_002054791.1| GJ24631 [Drosophila virilis]
gi|194152877|gb|EDW68311.1| GJ24631 [Drosophila virilis]
Length = 499
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 82/216 (37%), Positives = 126/216 (58%), Gaps = 7/216 (3%)
Query: 51 LCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIK 110
LCRG P + + L+CRY + P+LRL PLK E+ L P ++LY DV+ +E + I
Sbjct: 266 LCRGHSL--PLVSSSLRCRYNTASAPFLRLAPLKLEQLSLDPYMVLYHDVVQANEREHIM 323
Query: 111 KMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAE 170
++A+P LRRA V G + R + +A + +R+ +R+E M+G + +
Sbjct: 324 QLAKPHLRRALV-----GAARAHSQRFAMNAGFSYNDSRQGQRLRQRLEDMSGFDLTNSG 378
Query: 171 ELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNL 230
+L V+NYGIGG Y HYD + A + NR+AT+L Y++DV GG T F +L L
Sbjct: 379 QLAVLNYGIGGQYYMHYDCWFSQDDAAQVASIKDNRIATILLYLTDVQLGGLTSFPALGL 438
Query: 231 SLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
++ P G+A WHN++++ + D T HAACP+L G+
Sbjct: 439 AVQPSPGSALIWHNMNNAAECDRRTLHAACPLLLGT 474
>gi|15077349|gb|AAK83137.1| prolyl 4-hydroxylase alpha subunit [Cavia porcellus]
Length = 141
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 75/141 (53%), Positives = 102/141 (72%), Gaps = 2/141 (1%)
Query: 48 YEMLCRGD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSE 105
YEMLCRG+ + + P +L CRY N P L P K+E+ + +PRII + D++ D+E
Sbjct: 1 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 60
Query: 106 IDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLT 165
I+++K +A+PRLRRAT+ N TG+LE +YRISKSAWL E+PV+ RI+ R++ +TGL
Sbjct: 61 IEIVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSGYENPVVSRINMRIQDLTGLD 120
Query: 166 TSTAEELQVVNYGIGGHYEPH 186
STAEELQV NYG+GG YEPH
Sbjct: 121 VSTAEELQVANYGVGGQYEPH 141
>gi|198449524|ref|XP_002136918.1| GA26871 [Drosophila pseudoobscura pseudoobscura]
gi|198130646|gb|EDY67476.1| GA26871 [Drosophila pseudoobscura pseudoobscura]
Length = 530
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 87/250 (34%), Positives = 138/250 (55%), Gaps = 24/250 (9%)
Query: 31 PKVNNVAPTLEVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYL 90
P + P L+ E++ +CR P+ +L CRY P+LRL PL+ EE L
Sbjct: 272 PVSEEMKPILD----EEFNQICRSSHQNKPS---RLHCRYNATTTPFLRLAPLRMEELSL 324
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELE---IANYRISKSAWLREPE 147
P I++Y +V+ D+EI ++++A+P L+ V GE++ + R + AW+ +
Sbjct: 325 DPYIVVYHNVLSDAEIAKVERVAEPLLKSIGV-----GEMDNSKKSKVRTALGAWIPDKN 379
Query: 148 H-----PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLG 202
PVI+RI RR+ MTGL + +Q++ YG GGHY+ H+D+ +
Sbjct: 380 MHISGWPVIQRIVRRIHDMTGLIIKHGQVVQLIKYGYGGHYDTHFDYLNDSLP---ITQA 436
Query: 203 TGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLH-SSGDGDYYTRHAACP 261
G+R+ATVLFY++DV GG+TVF L L + E+G W+N+H + D D T H +CP
Sbjct: 437 LGDRMATVLFYLNDVKHGGSTVFPVLKLKVPSERGKVLVWYNMHGETHDLDSRTLHGSCP 496
Query: 262 VLTGSNSLHS 271
V+ G+ ++ S
Sbjct: 497 VIDGAKTVLS 506
>gi|198449506|ref|XP_002136910.1| GA26925 [Drosophila pseudoobscura pseudoobscura]
gi|198130637|gb|EDY67468.1| GA26925 [Drosophila pseudoobscura pseudoobscura]
Length = 543
Score = 152 bits (383), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 90/269 (33%), Positives = 146/269 (54%), Gaps = 17/269 (6%)
Query: 11 GNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRG------DLTVPPAIVA 64
G+K + +A + ++ PP+ + + +TE K+ LCR D + + A
Sbjct: 249 GDKTFGNKAYHIVSHFQEHPPQQSINIGSRGITE--KFNRLCRSMSRRKTDGSAAHSKPA 306
Query: 65 QLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQN 124
+L CRY +LRL PL+ EE L P I+LY +V+ D E+ ++ M+ P L RA + +
Sbjct: 307 RLHCRYNATTTAFLRLAPLRMEELSLDPYIVLYHNVLSDEEMARLENMSTPLLHRARIFD 366
Query: 125 YKTGELEIANYRISKSAWLREP-----EHPVIERISRRVEHMTGLTTSTAEELQVVNYGI 179
+T + +I+ R + + P + ++E I +R+ +TGL ++ +Q + YG
Sbjct: 367 KETKKPKISPVRSADEVGIPNPKLVTEDIQLVECIQKRITDLTGLMLTSMRRIQFLKYGF 426
Query: 180 GGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTA 239
GG Y PH+DF + S G+R+ATV+FY++DV GGAT F +L+L + E+G
Sbjct: 427 GGIYVPHHDFF---SVHTPTSRLHGDRIATVIFYLNDVEHGGATAFPNLDLVVPTERGAV 483
Query: 240 AFWHNLH-SSGDGDYYTRHAACPVLTGSN 267
FWHN+ + D DY T H ACPV+ G+
Sbjct: 484 LFWHNMDGETYDLDYRTLHGACPVIVGTK 512
>gi|291224083|ref|XP_002732036.1| PREDICTED: prolyl 4-hydroxylase, alpha I subunit-like [Saccoglossus
kowalevskii]
Length = 491
Score = 152 bits (383), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 94/246 (38%), Positives = 139/246 (56%), Gaps = 23/246 (9%)
Query: 26 LKDEPPKVNNVAP-TLEVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLK 84
LKD+P +V P + + +R+ YE LCRG+ P +++KC+YV L L P K
Sbjct: 240 LKDKP----SVRPNSTYLDDRDAYEALCRGERR-KPLDSSKVKCQYVTNGNYRLLLQPAK 294
Query: 85 EEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATV----QNYKTGELEIANYRISKS 140
+E + PR++LY DV+ D EI+ + K+A+P+LRR+ V + A YR+S
Sbjct: 295 QEIMHHNPRVVLYHDVISDEEINEVIKLAKPKLRRSLVVTKGSSPSGTGSSDAEYRVSSG 354
Query: 141 AWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKS 200
WL + + VI +++RR+ ++GL+T TA E + H E A E +
Sbjct: 355 GWLEDWDGTVIAKLTRRISDISGLSTLTAPEYR--------HAE-----ALQIENSDVHL 401
Query: 201 LGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAAC 260
G+ NR+AT +FYMS+V GG TVF ++ + P K A FW+NL +SG+ D TRHA C
Sbjct: 402 PGSRNRIATWMFYMSEVKAGGYTVFPEVDAFVPPVKNAAVFWYNLKASGESDDLTRHAGC 461
Query: 261 PVLTGS 266
PVL GS
Sbjct: 462 PVLIGS 467
>gi|195159321|ref|XP_002020530.1| GL13464 [Drosophila persimilis]
gi|194117299|gb|EDW39342.1| GL13464 [Drosophila persimilis]
Length = 533
Score = 152 bits (383), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 85/223 (38%), Positives = 123/223 (55%), Gaps = 7/223 (3%)
Query: 48 YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID 107
Y LC+G L+C + Y L PL+ E+ +L P I +Y ++ +ID
Sbjct: 287 YSRLCQGRRLPEKGSGTSLRCFLDGKRHAYFTLAPLQVEQVHLDPDIDVYHGILTLDQID 346
Query: 108 LIKKMAQPR-LRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTT 166
I + A + + R+ V G + + R+S+ WL + E P+++ I+R V ++G
Sbjct: 347 SIFEAADKQEMTRSGVAG-DGGTRTVVDLRVSQQTWL-DYESPIMKSIARLVVFISGFDI 404
Query: 167 STAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFT 226
+ AE +QV NYG+GG YEPH D+ + FK G+R++T +FY+SDV QGG TVFT
Sbjct: 405 AGAEAMQVANYGVGGQYEPHPDYFEVNLPSDFK----GDRISTSMFYLSDVEQGGYTVFT 460
Query: 227 SLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSL 269
LN+ L P KG WHNLH S D D T HA CPV+ GS +
Sbjct: 461 KLNVFLPPIKGALVMWHNLHRSLDVDPRTHHAGCPVIVGSKRI 503
>gi|195128345|ref|XP_002008624.1| GI13596 [Drosophila mojavensis]
gi|193920233|gb|EDW19100.1| GI13596 [Drosophila mojavensis]
Length = 527
Score = 151 bits (382), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 86/225 (38%), Positives = 129/225 (57%), Gaps = 14/225 (6%)
Query: 45 REKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDS 104
+E Y + CRG P L CRY P+LRL P K EE L P I+LY +V+ DS
Sbjct: 286 QEPYYLGCRG--GYPKR--TNLHCRYNTTTTPFLRLAPFKMEEVSLDPYIVLYHNVISDS 341
Query: 105 EIDLIKKMAQPR---LRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHM 161
EI+ IK+ A L R + N T + +I +++ W+ E P +RI+ R+ +
Sbjct: 342 EIEDIKQHATNFTNGLSRNPLLNV-TDKPQI----VARMQWV-EKMTPFTDRINLRITDI 395
Query: 162 TGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGG 221
TG + +Q+ NYGIGGH+ PH+D+ G + + G G+R AT++FY SD+ QGG
Sbjct: 396 TGFGVDECKTVQIANYGIGGHFIPHFDYTTEGRVSINDTFGIGDRTATIVFYASDM-QGG 454
Query: 222 ATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
ATVF ++ +++ P+KG+A W+NL + T H+ CPV++GS
Sbjct: 455 ATVFPNIQVTVQPQKGSALHWYNLFDDDSPNPLTLHSVCPVISGS 499
>gi|241598357|ref|XP_002404731.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
gi|215500462|gb|EEC09956.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
Length = 218
Score = 151 bits (382), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 85/209 (40%), Positives = 116/209 (55%), Gaps = 30/209 (14%)
Query: 64 AQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQ 123
+QL+CRY +L L +K EE L+P II+ DV+ D +I+ + + A+PRL R+T
Sbjct: 3 SQLRCRYYKGQDGFLALQQIKLEEMNLKPYIIVMHDVVQDKDIEKLMEFAEPRLERSTT- 61
Query: 124 NYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHY 183
Y E+ R S +AWL E E P+ + NYG GGH+
Sbjct: 62 -YNGSEVMPTPQRTSSTAWLNEDEAPI----------------------ALANYGTGGHF 98
Query: 184 EPHYDF------ARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKG 237
PH+DF A A+ + G G+R+AT++ YM+DV GGATVF SL + L P+KG
Sbjct: 99 LPHHDFFQDSLNAYNSSADYYLQHGRGDRIATLMIYMTDVEAGGATVFPSLGIRLTPKKG 158
Query: 238 TAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
AAFW NL +SG+G+ T HA CPVL GS
Sbjct: 159 DAAFWWNLKASGEGERLTMHAGCPVLYGS 187
>gi|198449650|ref|XP_001357661.2| GA13747 [Drosophila pseudoobscura pseudoobscura]
gi|198130701|gb|EAL26795.2| GA13747 [Drosophila pseudoobscura pseudoobscura]
Length = 533
Score = 151 bits (382), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 85/223 (38%), Positives = 123/223 (55%), Gaps = 7/223 (3%)
Query: 48 YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID 107
Y LC+G L+C + Y L PL+ E+ +L P I +Y ++ +ID
Sbjct: 287 YSRLCQGRRLPEKGSGTSLRCFLDGKRHAYFTLAPLQVEQVHLDPDIDVYHGILTLDQID 346
Query: 108 LIKKMAQPR-LRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTT 166
I + A + + R+ V G + + R+S+ WL + E P+++ I+R V ++G
Sbjct: 347 SIFEAADKQEMTRSGVAG-DGGTRTVVDLRVSQQTWL-DYESPIMKSIARLVVFISGFDI 404
Query: 167 STAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFT 226
+ AE +QV NYG+GG YEPH D+ + FK G+R++T +FY+SDV QGG TVFT
Sbjct: 405 AGAEAMQVANYGVGGQYEPHPDYFEVNLPSDFK----GDRISTSMFYLSDVEQGGYTVFT 460
Query: 227 SLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSL 269
LN+ L P KG WHNLH S D D T HA CPV+ GS +
Sbjct: 461 KLNVFLPPIKGALVMWHNLHRSLDVDPRTHHAGCPVIVGSKRI 503
>gi|195425415|ref|XP_002061004.1| GK10713 [Drosophila willistoni]
gi|194157089|gb|EDW71990.1| GK10713 [Drosophila willistoni]
Length = 502
Score = 151 bits (381), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 85/218 (38%), Positives = 132/218 (60%), Gaps = 11/218 (5%)
Query: 52 CRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
CRG+ P L C Y ++ P+L L P K E P + +Y DV+YD EI+ +K+
Sbjct: 251 CRGEYEHPKG----LSCYYDSKDEPFLFLAPFKVEILNNLPFVAIYHDVLYDREIEELKR 306
Query: 112 MAQPRLRRATVQNY-KTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTT--ST 168
+A P + R+T+ +Y K G + + N+R S S +L +++ + +RV MT L ++
Sbjct: 307 LAVPTITRSTIYDYDKEGNVPV-NFRTSNSVFLLNNASYLVDILRQRVADMTHLNVFKNS 365
Query: 169 AEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL 228
+++LQV+NYG+GG+Y H+DF E+ + G+R+ TVL YM+DV QGGATVF +L
Sbjct: 366 SDDLQVMNYGLGGYYRYHFDFFGKDES---PNKLLGDRIITVLIYMTDVQQGGATVFPAL 422
Query: 229 NLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
++ +P+KG+A + NL ++ D T HA CPVL GS
Sbjct: 423 RITNFPKKGSALIFRNLDNNISPDPSTLHAGCPVLFGS 460
>gi|403274090|ref|XP_003928822.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Saimiri
boliviensis boliviensis]
Length = 149
Score = 150 bits (380), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 67/95 (70%), Positives = 79/95 (83%)
Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
LQV NYG+GG YEPH+DFAR E +AFK LGTGNR+AT LFYMSDV+ GGATVF + S
Sbjct: 30 LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 89
Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 90 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 124
>gi|195159148|ref|XP_002020444.1| GL13996 [Drosophila persimilis]
gi|194117213|gb|EDW39256.1| GL13996 [Drosophila persimilis]
Length = 559
Score = 150 bits (380), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 90/273 (32%), Positives = 147/273 (53%), Gaps = 23/273 (8%)
Query: 10 QGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTER---EKYEMLCRG------DLTVPP 60
+G+K + +A + ++ PP+ ++ + R EK+ LCR D +
Sbjct: 264 RGDKTFGDKAYHIVSHFQEHPPQ-----QSINIGSRGFTEKFNRLCRSMSRRKTDGSAAH 318
Query: 61 AIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRA 120
+ A+L CRY +LRL PL+ EE L P I+LY +V+ D E+ ++ M+ P L RA
Sbjct: 319 SKPARLHCRYNATTTAFLRLAPLRMEELSLDPYIVLYHNVLSDEEMARLENMSTPLLHRA 378
Query: 121 TVQNYKTGELEIANYRISKSAWLREP-----EHPVIERISRRVEHMTGLTTSTAEELQVV 175
+ + +T + +I+ R + + P + ++E I +R+ +TGL ++ +Q +
Sbjct: 379 RIFDKETKKPKISPVRSADEVGIPNPKLVTGDIQLVECIQKRITDLTGLMLTSMRRIQFL 438
Query: 176 NYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPE 235
YG GG Y PH+DF + S G+R+ATV+FY++DV GGAT F +L+L + E
Sbjct: 439 KYGFGGIYVPHHDFF---SVHTPTSRLHGDRIATVIFYLNDVEHGGATAFPNLDLVVPTE 495
Query: 236 KGTAAFWHNLH-SSGDGDYYTRHAACPVLTGSN 267
+G FWHN+ + D DY T H ACPV+ G+
Sbjct: 496 RGAVLFWHNMDGETYDLDYRTLHGACPVIVGTK 528
>gi|194765172|ref|XP_001964701.1| GF23326 [Drosophila ananassae]
gi|190614973|gb|EDV30497.1| GF23326 [Drosophila ananassae]
Length = 885
Score = 150 bits (380), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 79/201 (39%), Positives = 113/201 (56%), Gaps = 19/201 (9%)
Query: 66 LKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNY 125
L C Y + P+LRL P+K E P I ++ DV+Y E+ I+ + L +T NY
Sbjct: 675 LYCLYNTKTSPFLRLAPIKTELLSKDPYIAIFHDVVYPKELTRIRTACKSHLIASTTINY 734
Query: 126 KTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEP 185
+ + +YR SKS W+ + + +RI+ V TGL +T+E QV+NYGIGG +E
Sbjct: 735 TSNAYSVDSYRTSKSVWIPTDSNNLTQRITNLVGDATGLEMTTSEMFQVINYGIGGLFEA 794
Query: 186 HYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNL 245
H D P +NA +SDV QGGAT+FT LNL+++P+ G+A FW+NL
Sbjct: 795 HMD---PVLSNA----------------LSDVEQGGATIFTKLNLTVFPQSGSALFWYNL 835
Query: 246 HSSGDGDYYTRHAACPVLTGS 266
+ G+ D T HA CPV+ GS
Sbjct: 836 DNWGNEDKRTEHAGCPVIVGS 856
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 64/217 (29%), Positives = 105/217 (48%), Gaps = 20/217 (9%)
Query: 16 YQEALNKSP---ELKDEPP-------KVNNVAPTLEVTER----EKYEMLCRGDLTVPPA 61
YQ AL KSP +L +E K+ + P +E + E + + C G
Sbjct: 244 YQVALKKSPPDAKLYEEHQYLESMYLKLFGLDPNIEEYDNSYKSEVFSLCCNGKCQKDKK 303
Query: 62 IVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRAT 121
I L C Y + L + P+K+E + P I L+ DV+ E +++ +++ L +T
Sbjct: 304 I-QNLYCFYDTKTSNALIIAPVKKEILSVDPYIALFHDVISQKEQKILQSVSKIHLMAST 362
Query: 122 VQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGG 181
+ + NYRISKS W + V +R++ +E TG ++E QV+NYG+GG
Sbjct: 363 TIH--NNNKAVKNYRISKSVWYASDYNDVTKRLTTFMEQATGYDMKSSELFQVINYGLGG 420
Query: 182 HYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVA 218
++ H D+ + + GT +R+AT LFY S +A
Sbjct: 421 RFDGHEDYLLTDKT---RFNGTSDRIATTLFYESTLA 454
>gi|198449641|ref|XP_002136935.1| GA26860 [Drosophila pseudoobscura pseudoobscura]
gi|198130697|gb|EDY67493.1| GA26860 [Drosophila pseudoobscura pseudoobscura]
Length = 508
Score = 150 bits (379), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 84/203 (41%), Positives = 115/203 (56%), Gaps = 7/203 (3%)
Query: 64 AQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQ 123
++L C Y +LRL PLK E L P ++LY DV+ D E+ L+K MAQ L RA+
Sbjct: 291 SRLYCLYNTTATAFLRLAPLKMELLSLDPYVVLYHDVLADREMSLLKSMAQKDLVRASTY 350
Query: 124 NYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHY 183
+ + R +K+ WL +P H +I R+ E MT L E+ QV+NYGIGGH
Sbjct: 351 DVMDKKHSEDPNRTTKARWL-DPSHSLIRRMGILTEDMTNLDLERLEDFQVLNYGIGGHD 409
Query: 184 EPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWH 243
+ H D+ +RVAT+LFY+SDV GGATVF L+LS++P++G W+
Sbjct: 410 DIHPDYYEGSNPE------LPDRVATLLFYLSDVPLGGATVFPLLDLSVFPKRGAVLMWY 463
Query: 244 NLHSSGDGDYYTRHAACPVLTGS 266
NL G G T H+ACPV+ GS
Sbjct: 464 NLDHKGQGIEKTVHSACPVVVGS 486
>gi|328718393|ref|XP_001945742.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 1
[Acyrthosiphon pisum]
Length = 511
Score = 150 bits (378), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 87/209 (41%), Positives = 121/209 (57%), Gaps = 19/209 (9%)
Query: 67 KCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYK 126
KCRY N+ Y LMP KEE+ +P I +Y DV+YD EI IK +A +++ A V++
Sbjct: 291 KCRYQTNNLFYRILMPFKEEDINSEPLIKIYHDVLYDDEILKIKTLALEKMKDAKVKS-- 348
Query: 127 TGELEIANY------RISKSAWLREPEH-PVIERISRRVEHMTGLTTSTAEELQVVNYGI 179
++ NY R + W+ E + + ++ R+E TG +T TAE Q+VNYG+
Sbjct: 349 ---VDGKNYLLEEKTRSGQVYWIFEVDAVEYFDALNTRIESFTGFSTKTAERYQIVNYGL 405
Query: 180 GGHYEPHYD-FARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGT 238
GGHY PH+D FA+ E F GNR+ TVLFY++DV G T F LN+ EKG
Sbjct: 406 GGHYIPHHDSFAKGAENVKF-----GNRLVTVLFYLTDVQNDGYTSFPMLNIIAPAEKGA 460
Query: 239 AAFWHNLH-SSGDGDYYTRHAACPVLTGS 266
A W+NLH S+G Y T H +CP+L G+
Sbjct: 461 ALVWNNLHMSNGQKFYETLHGSCPLLKGN 489
>gi|328718391|ref|XP_003246474.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 2
[Acyrthosiphon pisum]
Length = 514
Score = 150 bits (378), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 87/209 (41%), Positives = 121/209 (57%), Gaps = 19/209 (9%)
Query: 67 KCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYK 126
KCRY N+ Y LMP KEE+ +P I +Y DV+YD EI IK +A +++ A V++
Sbjct: 294 KCRYQTNNLFYRILMPFKEEDINSEPLIKIYHDVLYDDEILKIKTLALEKMKDAKVKS-- 351
Query: 127 TGELEIANY------RISKSAWLREPEH-PVIERISRRVEHMTGLTTSTAEELQVVNYGI 179
++ NY R + W+ E + + ++ R+E TG +T TAE Q+VNYG+
Sbjct: 352 ---VDGKNYLLEEKTRSGQVYWIFEVDAVEYFDALNTRIESFTGFSTKTAERYQIVNYGL 408
Query: 180 GGHYEPHYD-FARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGT 238
GGHY PH+D FA+ E F GNR+ TVLFY++DV G T F LN+ EKG
Sbjct: 409 GGHYIPHHDSFAKGAENVKF-----GNRLVTVLFYLTDVQNDGYTSFPMLNIIAPAEKGA 463
Query: 239 AAFWHNLH-SSGDGDYYTRHAACPVLTGS 266
A W+NLH S+G Y T H +CP+L G+
Sbjct: 464 ALVWNNLHMSNGQKFYETLHGSCPLLKGN 492
>gi|195572619|ref|XP_002104293.1| GD18524 [Drosophila simulans]
gi|194200220|gb|EDX13796.1| GD18524 [Drosophila simulans]
Length = 472
Score = 150 bits (378), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 78/218 (35%), Positives = 125/218 (57%), Gaps = 15/218 (6%)
Query: 49 EMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDL 108
E+ + + T ++L CRY P+ R+ PLK EE L P ++++ DV+YD+EID
Sbjct: 239 ELATKQNCTAVVQKPSRLHCRYNTSTTPFTRIAPLKMEELSLDPYMVVFHDVVYDTEIDG 298
Query: 109 IKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST 168
+ L + T + + R SK +++ + E ++ RV MTG +
Sbjct: 299 M-------LNSSNFGLSLTDSGQKSEVRTSKDSYIVDSE-----SLNERVTDMTGFSMEM 346
Query: 169 AEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL 228
++ ++NYG+GGHY HYDF K G+R+ATVLFY+ +V GGAT+F +
Sbjct: 347 SDPFSLINYGLGGHYMLHYDFHEYTNTTRPKQ---GDRIATVLFYLGEVDSGGATIFPKI 403
Query: 229 NLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
N+++ P+KG+A FW+NLH+SG + + H+ACPV++GS
Sbjct: 404 NIAVTPKKGSAVFWYNLHNSGAMNLKSLHSACPVISGS 441
>gi|260806889|ref|XP_002598316.1| hypothetical protein BRAFLDRAFT_261183 [Branchiostoma floridae]
gi|229283588|gb|EEN54328.1| hypothetical protein BRAFLDRAFT_261183 [Branchiostoma floridae]
Length = 531
Score = 149 bits (377), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 95/246 (38%), Positives = 137/246 (55%), Gaps = 19/246 (7%)
Query: 45 REKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAY-LQPRIILYRDVMYD 103
R+KYE LCR + A + CRY R PY L P+K E + P I L+ D++ +
Sbjct: 292 RDKYEELCRVGVLQNRAPRSSASCRYF-RPSPYFYLGPIKMEVLHETNPVIHLFHDIVSE 350
Query: 104 SEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTG 163
SE +++MA P+ R+ V G+ I N R+S++AW + + PV+ ++SRRV++ TG
Sbjct: 351 SEAARMREMAIPKFHRSVVVGDDGGDAIILN-RVSETAWHFDYDDPVVAKLSRRVDYATG 409
Query: 164 LTTS--TAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGG 221
L+T+ TAE QVVNYG+GG Y PH D+ + + GNRV T L Y+SDV GG
Sbjct: 410 LSTAEGTAEAFQVVNYGLGGQYIPHTDYFEGDHVT--RHIQNGNRVVTFLLYLSDVDAGG 467
Query: 222 ATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC-------- 273
ATVF +++++ P A FW ++ SG + HA CPVL GS + +
Sbjct: 468 ATVFPIVDVAV-PINSAAVFW-SMERSGAVVPNSLHAGCPVLIGSKWIANKWIREHGNEF 525
Query: 274 --PCGL 277
PCGL
Sbjct: 526 RRPCGL 531
>gi|328713119|ref|XP_003244997.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Acyrthosiphon
pisum]
Length = 487
Score = 149 bits (377), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 90/224 (40%), Positives = 122/224 (54%), Gaps = 12/224 (5%)
Query: 47 KYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEI 106
++ LC+ +++ + KCRY N+ Y LMP KEE+ +P I +Y DV+YD EI
Sbjct: 249 EFRNLCKHGVSLR-TLTKYSKCRYQTNNLFYRILMPFKEEDINSEPFIKIYHDVLYDDEI 307
Query: 107 DLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIE---RISRRVEHMTG 163
IK M+ + A V KT I R R E IE ++ R+E TG
Sbjct: 308 LKIKTMSLANMSDAKV---KTSNDSILRERSRSGQVYRMNEVDAIEYFDALNTRIESFTG 364
Query: 164 LTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGAT 223
+T TAE Q+VNYG+GGHY PH+D + G N + GNR+ TVLFY++DV G T
Sbjct: 365 FSTKTAERYQIVNYGLGGHYFPHFDTFKKGTEN----MEFGNRLVTVLFYLTDVQNDGYT 420
Query: 224 VFTSLNLSLWPEKGTAAFWHNLH-SSGDGDYYTRHAACPVLTGS 266
F LN+ EKG+A W+NLH S G Y + H ACP+L G+
Sbjct: 421 SFPMLNIIAPAEKGSALVWNNLHMSDGQLCYESLHGACPLLKGN 464
>gi|195572621|ref|XP_002104294.1| GD18523 [Drosophila simulans]
gi|194200221|gb|EDX13797.1| GD18523 [Drosophila simulans]
Length = 490
Score = 149 bits (376), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 80/211 (37%), Positives = 121/211 (57%), Gaps = 31/211 (14%)
Query: 64 AQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQ 123
++L CRY P+ R+ PLK EE L P ++++ DV+YD+EID
Sbjct: 272 SRLHCRYNTSTTPFTRIAPLKMEELSLDPYMVVFHDVVYDTEID---------------- 315
Query: 124 NYKTGELEIANYRISKSAWLREPE-------HPVIER-ISRRVEHMTGLTTSTAEELQVV 175
G L +N+ IS+S + E H V + ++ RV MTGL+ ++ ++
Sbjct: 316 ----GMLNSSNFGISESVSGLKSEVRTSKDSHIVDSKTLNERVTDMTGLSMEMSDPFSLI 371
Query: 176 NYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPE 235
NYG+GGH+ H+DF K G+R+ATVLFY+ +V GGAT+F LN+++ P+
Sbjct: 372 NYGLGGHFILHHDFHEYTNTTRLKQ---GDRIATVLFYLGEVDSGGATIFPMLNITVTPK 428
Query: 236 KGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
KG+A FW+NLH+SG + T H+ACPV++GS
Sbjct: 429 KGSAVFWYNLHNSGAVNSKTLHSACPVISGS 459
>gi|390176836|ref|XP_003736216.1| GA26872, isoform B [Drosophila pseudoobscura pseudoobscura]
gi|388858809|gb|EIM52289.1| GA26872, isoform B [Drosophila pseudoobscura pseudoobscura]
Length = 567
Score = 149 bits (375), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 90/269 (33%), Positives = 142/269 (52%), Gaps = 17/269 (6%)
Query: 11 GNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLC------RGDLTVPPAIVA 64
G+K + +A + + PP+ + TE K+ LC + D + + A
Sbjct: 273 GDKTFGNKAYHIVSHFQKHPPQQSINMENGNFTE--KFNRLCSSMSRRKTDGSAAHSKPA 330
Query: 65 QLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQN 124
+L CRY +LRL PL+ EE L P I+LY +V+ D E+ ++ M+ P L RA V +
Sbjct: 331 RLHCRYNATTTAFLRLAPLRMEELSLDPYIVLYHNVLSDEEMARLENMSTPLLHRARVFD 390
Query: 125 YKTGELEIANYRISKSAWLREP-----EHPVIERISRRVEHMTGLTTSTAEELQVVNYGI 179
+ +I+ R + + P + ++ERI +R+ +TGL ++ +Q + YG
Sbjct: 391 SGIRKPKISPARTADEVQIPNPKLVAEDIQLVERIQKRMTDLTGLVLTSMRRIQFLKYGF 450
Query: 180 GGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTA 239
GG Y PH+DF + S G+R+ATV+FY++DV GGAT F +L+L + E+G
Sbjct: 451 GGIYVPHHDFF---SVHTPTSRLHGDRIATVIFYLNDVEHGGATAFPNLDLVVPTERGAV 507
Query: 240 AFWHNLH-SSGDGDYYTRHAACPVLTGSN 267
FWHN+ + D DY T H ACPV+ G+
Sbjct: 508 LFWHNMDGETYDLDYRTLHGACPVIVGTK 536
>gi|195330780|ref|XP_002032081.1| GM23710 [Drosophila sechellia]
gi|194121024|gb|EDW43067.1| GM23710 [Drosophila sechellia]
Length = 490
Score = 149 bits (375), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 81/211 (38%), Positives = 122/211 (57%), Gaps = 31/211 (14%)
Query: 64 AQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQ 123
++L CRY P+ R+ PLK EE L P ++++ DV+YD+EID
Sbjct: 272 SRLHCRYNTSTTPFTRIAPLKMEELSLDPYMVVFHDVVYDTEID---------------- 315
Query: 124 NYKTGELEIANYRISKSAWLREPE-------HPVIER-ISRRVEHMTGLTTSTAEELQVV 175
G L +N+ IS+S + E H V + ++ RV MTGL+ ++ ++
Sbjct: 316 ----GMLNSSNFGISESVSGLKSEVRTSKDSHIVDSKTLNERVTDMTGLSMEMSDPFSLI 371
Query: 176 NYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPE 235
NYG+GGH+ H+DF E L G+R+ATVLFY+ +V GGAT+F LN+++ P+
Sbjct: 372 NYGLGGHFILHHDFH---EYTNTTRLKRGDRIATVLFYLGEVDSGGATIFPMLNITVTPK 428
Query: 236 KGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
KG+A FW+NLH+SG + T H+ACPV++GS
Sbjct: 429 KGSAVFWYNLHNSGAVNSKTLHSACPVISGS 459
>gi|195591302|ref|XP_002085381.1| GD14757 [Drosophila simulans]
gi|194197390|gb|EDX10966.1| GD14757 [Drosophila simulans]
Length = 525
Score = 149 bits (375), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 89/252 (35%), Positives = 138/252 (54%), Gaps = 13/252 (5%)
Query: 17 QEALNKSPELKDEPPKVNNVAPTLEVTERE-KYEMLCRGDLTVPPAIVAQLKCRYVHRNV 75
+E N +L+D V +V R +E+ CRG +V CRY
Sbjct: 259 EEMDNIMSDLRDPHSDVEVEKELYQVKRRSSNFELGCRGLYRQKTNLV----CRYKSTAN 314
Query: 76 PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANY 135
+LRL PLK EE L P I +Y +V+YDSEI +K + + + T EI +
Sbjct: 315 TFLRLAPLKLEEISLDPFIAMYHEVLYDSEIHELKGQSMNMVNGYASERNGT---EIRD- 370
Query: 136 RISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPG-E 194
+++ W V ERI++R+ MT S E+LQ+ NYG+G +++PH+D++ G E
Sbjct: 371 TVARYDWWSNTS-LVRERINQRIIDMTEFNFSKDEKLQITNYGVGTYFQPHFDYSSDGFE 429
Query: 195 ANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYY 254
+LG +R+A++LFY S+V QGGATVF +N++++P+KG+ +W NLH G D
Sbjct: 430 TPNITTLG--DRLASILFYASEVPQGGATVFPEINVTVFPQKGSMLYWFNLHDDGRPDIR 487
Query: 255 TRHAACPVLTGS 266
++H+ CPV+ G
Sbjct: 488 SKHSVCPVINGD 499
>gi|194765184|ref|XP_001964707.1| GF22906 [Drosophila ananassae]
gi|190614979|gb|EDV30503.1| GF22906 [Drosophila ananassae]
Length = 708
Score = 149 bits (375), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 86/244 (35%), Positives = 125/244 (51%), Gaps = 8/244 (3%)
Query: 47 KYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEI 106
Y LC+G + L C R PY L PL+ E +L P I +Y ++ +I
Sbjct: 461 NYTRLCQGKKLPEESTGRPLSCYLDGRTNPYFVLAPLQVEPVHLDPDINVYHRMLSQQQI 520
Query: 107 DLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTT 166
+ I + A + G+ +A+ R+S+ WL P+++ ISR ++ ++G
Sbjct: 521 NSIFEEADKLTMYRSAVAGNAGKSTVADLRVSQQTWLNYTS-PIMKSISRIIQFVSGFDI 579
Query: 167 STAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFT 226
+ AE +QV NYG+GG YEPH D+ F+ G+R++T +FY+S+V QGG TVFT
Sbjct: 580 AGAEFMQVANYGVGGQYEPHPDYFEFNLPQQFQ----GDRISTSMFYLSNVEQGGYTVFT 635
Query: 227 SLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTCPCGLRRGLQRSGI 286
LN+ L P +G WHNLH S D D T HA CPVL GS + + + G Q
Sbjct: 636 KLNVFLPPIQGAMVMWHNLHRSLDVDARTLHAGCPVLVGSKRIGNIW---MHSGFQEFRR 692
Query: 287 ICTL 290
C L
Sbjct: 693 PCNL 696
>gi|198449518|ref|XP_002136915.1| GA26872, isoform A [Drosophila pseudoobscura pseudoobscura]
gi|198130643|gb|EDY67473.1| GA26872, isoform A [Drosophila pseudoobscura pseudoobscura]
Length = 543
Score = 149 bits (375), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 90/269 (33%), Positives = 142/269 (52%), Gaps = 17/269 (6%)
Query: 11 GNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLC------RGDLTVPPAIVA 64
G+K + +A + + PP+ + TE K+ LC + D + + A
Sbjct: 249 GDKTFGNKAYHIVSHFQKHPPQQSINMENGNFTE--KFNRLCSSMSRRKTDGSAAHSKPA 306
Query: 65 QLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQN 124
+L CRY +LRL PL+ EE L P I+LY +V+ D E+ ++ M+ P L RA V +
Sbjct: 307 RLHCRYNATTTAFLRLAPLRMEELSLDPYIVLYHNVLSDEEMARLENMSTPLLHRARVFD 366
Query: 125 YKTGELEIANYRISKSAWLREP-----EHPVIERISRRVEHMTGLTTSTAEELQVVNYGI 179
+ +I+ R + + P + ++ERI +R+ +TGL ++ +Q + YG
Sbjct: 367 SGIRKPKISPARTADEVQIPNPKLVAEDIQLVERIQKRMTDLTGLVLTSMRRIQFLKYGF 426
Query: 180 GGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTA 239
GG Y PH+DF + S G+R+ATV+FY++DV GGAT F +L+L + E+G
Sbjct: 427 GGIYVPHHDFF---SVHTPTSRLHGDRIATVIFYLNDVEHGGATAFPNLDLVVPTERGAV 483
Query: 240 AFWHNLH-SSGDGDYYTRHAACPVLTGSN 267
FWHN+ + D DY T H ACPV+ G+
Sbjct: 484 LFWHNMDGETYDLDYRTLHGACPVIVGTK 512
>gi|24651430|ref|NP_733378.1| prolyl-4-hydroxylase-alpha NE2 [Drosophila melanogaster]
gi|23172699|gb|AAF57061.2| prolyl-4-hydroxylase-alpha NE2 [Drosophila melanogaster]
Length = 542
Score = 148 bits (374), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 79/222 (35%), Positives = 122/222 (54%), Gaps = 6/222 (2%)
Query: 52 CRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
C G VP + + L C Y H P+L+L P+K E + P ++L D++ E LI+
Sbjct: 293 CSGRCQVPRNL-SNLYCVYNHVTSPFLQLAPIKTEILSIDPFVVLLHDMISQKESTLIRT 351
Query: 112 MAQPRL--RRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTA 169
++ + T + E ++ YR SKS W + ++I+ R+ TGL ++
Sbjct: 352 SSKEHMLPSATTDPDASDDETQVDTYRTSKSVWYSSDFNDTTKKITERLGDATGLDMNST 411
Query: 170 EELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLN 229
E QV+NYG+GG +E H D E N F GT +R+AT LFY+++V QGG T F LN
Sbjct: 412 EFYQVINYGLGGFFETHLDMLL-SEKNRFN--GTSDRIATTLFYLNEVRQGGGTYFPRLN 468
Query: 230 LSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHS 271
L+++P+ G+A FW+NL + G+ + H CPV+ GS + S
Sbjct: 469 LTVFPQPGSALFWYNLDTKGNDHMGSLHTGCPVIVGSKWVMS 510
>gi|20269814|gb|AAM18062.1|AF495540_1 prolyl 4-hydroxylase alpha-related protein PH4[alpha]NE2
[Drosophila melanogaster]
gi|19528175|gb|AAL90202.1| AT27756p [Drosophila melanogaster]
Length = 542
Score = 148 bits (374), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 79/222 (35%), Positives = 122/222 (54%), Gaps = 6/222 (2%)
Query: 52 CRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
C G VP + + L C Y H P+L+L P+K E + P ++L D++ E LI+
Sbjct: 293 CSGRCQVPRNL-SNLYCVYNHVTSPFLQLAPIKTEILSIDPFVVLLHDMISQKESTLIRT 351
Query: 112 MAQPRL--RRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTA 169
++ + T + E ++ YR SKS W + ++I+ R+ TGL ++
Sbjct: 352 SSKEHMLPSATTDPDASDDETQVDTYRTSKSVWYSSDFNDTTKKITERLGDATGLDMNST 411
Query: 170 EELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLN 229
E QV+NYG+GG +E H D E N F GT +R+AT LFY+++V QGG T F LN
Sbjct: 412 EFYQVINYGLGGFFETHLDMLL-SEKNRFN--GTSDRIATTLFYLNEVRQGGGTYFPRLN 468
Query: 230 LSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHS 271
L+++P+ G+A FW+NL + G+ + H CPV+ GS + S
Sbjct: 469 LTVFPQPGSALFWYNLDTKGNDHMGSLHTGCPVIVGSKWVMS 510
>gi|195330778|ref|XP_002032080.1| GM23711 [Drosophila sechellia]
gi|194121023|gb|EDW43066.1| GM23711 [Drosophila sechellia]
Length = 490
Score = 148 bits (374), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 81/237 (34%), Positives = 133/237 (56%), Gaps = 28/237 (11%)
Query: 36 VAPTLEVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRII 95
+ + EV E+ E+ + + T ++L CRY P+ R+ PLK EE L P ++
Sbjct: 245 IVASNEVIHFEE-ELATKQNCTAVVQKPSRLHCRYNTSTTPFTRIAPLKMEELSLDPYMV 303
Query: 96 LYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERIS 155
++ DV+YD+EID + L + T + + R SK +++ + + ++
Sbjct: 304 VFHDVVYDTEIDGM-------LNSSNFVLSLTDSGQKSEVRTSKDSYIVDAK-----SLN 351
Query: 156 RRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDF------ARPGEANAFKSLGTGNRVAT 209
RV MTG + ++ ++NYG+GGHY HYDF RP + G+R+AT
Sbjct: 352 ERVTDMTGFSMEMSDPFSLINYGLGGHYMLHYDFHEYTNTTRPKQ---------GDRIAT 402
Query: 210 VLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
VLFY+ +V GGAT+F +N+++ P+KG+A FW+NLH+SG + + H+ACPV++GS
Sbjct: 403 VLFYLGEVDSGGATIFPKINIAVTPKKGSAVFWYNLHNSGAMNLKSLHSACPVISGS 459
>gi|211938649|gb|ACJ13221.1| FI08532p [Drosophila melanogaster]
Length = 543
Score = 148 bits (374), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 79/222 (35%), Positives = 122/222 (54%), Gaps = 6/222 (2%)
Query: 52 CRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
C G VP + + L C Y H P+L+L P+K E + P ++L D++ E LI+
Sbjct: 294 CSGRCQVPRNL-SNLYCVYNHVTSPFLQLAPIKTEILSIDPFVVLLHDMISQKESTLIRT 352
Query: 112 MAQPRL--RRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTA 169
++ + T + E ++ YR SKS W + ++I+ R+ TGL ++
Sbjct: 353 SSKEHMLPSATTDPDASDDETQVDTYRTSKSVWYSSDFNDTTKKITERLGDATGLDMNST 412
Query: 170 EELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLN 229
E QV+NYG+GG +E H D E N F GT +R+AT LFY+++V QGG T F LN
Sbjct: 413 EFYQVINYGLGGFFETHLDMLL-SEKNRFN--GTSDRIATTLFYLNEVRQGGGTYFPRLN 469
Query: 230 LSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHS 271
L+++P+ G+A FW+NL + G+ + H CPV+ GS + S
Sbjct: 470 LTVFPQPGSALFWYNLDTKGNDHMGSLHTGCPVIVGSKWVMS 511
>gi|417402369|gb|JAA48034.1| Putative prolyl 4-hydroxylase alpha subunit [Desmodus rotundus]
Length = 529
Score = 148 bits (373), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 95/245 (38%), Positives = 137/245 (55%), Gaps = 13/245 (5%)
Query: 4 PTHQRAQGNKLYYQEALNKSPELKDEPPKVN--NVAPTLEVTEREKYEMLCRGDLTVPPA 61
P ++R N L Y++ L +SP + NV P L+ R YE LC+ + P
Sbjct: 258 PDNKRMARNVLKYEKLLAESPSQAAAEAVIQRPNV-PHLQT--RATYEELCQTLGSQPTH 314
Query: 62 IV-AQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRA 120
L C Y PYL L P+++E +L+P ++LY D + D E I+ A+P L+R+
Sbjct: 315 YQNPSLHCSYETGASPYLLLQPIRKEVVHLEPYVVLYHDFVNDLEAQKIRGFAEPWLQRS 374
Query: 121 TVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQVVNY 177
V +GE ++ YRISKSAWL++ P++ + RR+ +TGL T AE LQVVNY
Sbjct: 375 VV---ASGEKQLPVEYRISKSAWLKDTVDPMLVTLDRRIAALTGLDTQPPYAEHLQVVNY 431
Query: 178 GIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKG 237
GIGGHYEPH+D A + ++ + +GNRVAT + Y+S V GGAT F N S+ K
Sbjct: 432 GIGGHYEPHFDHATSPSSPLYR-MKSGNRVATFMIYLSSVEAGGATAFIYANFSVPVVKC 490
Query: 238 TAAFW 242
++ W
Sbjct: 491 SSPRW 495
>gi|198459366|ref|XP_002138685.1| GA24919 [Drosophila pseudoobscura pseudoobscura]
gi|198136669|gb|EDY69243.1| GA24919 [Drosophila pseudoobscura pseudoobscura]
Length = 448
Score = 148 bits (373), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 90/231 (38%), Positives = 125/231 (54%), Gaps = 11/231 (4%)
Query: 40 LEVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRD 99
L++ E Y CRG L PP L C Y P LRL P K E P I +Y D
Sbjct: 204 LKIINFEHYVRGCRG-LFDPPK---GLSCHYDFHTHPVLRLAPFKVEPLSQDPYIAMYHD 259
Query: 100 VMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVE 159
V+YDSEI+ +K A P + R+ V Y + + R S SA+ + ++ + +++RRV
Sbjct: 260 VIYDSEIEELKDNAFPDMERSKVYTYSDKDGKDTG-RTSMSAFQTDHQYTAVTKVNRRVM 318
Query: 160 HMTG---LTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSD 216
HMTG L +++EL V+NY Y H D+ P + + G+R+ATVLFY++D
Sbjct: 319 HMTGFEVLADGSSDELLVLNYATAAQYLTHSDYFGPAYSEYIQR---GDRIATVLFYLND 375
Query: 217 VAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
V QGG TVF L + P KG+A ++NL+SS GD T H CPVL G+
Sbjct: 376 VEQGGKTVFPRLGIFRSPMKGSAVVFYNLNSSLQGDPRTEHGGCPVLVGTK 426
>gi|194871348|ref|XP_001972831.1| GG13664 [Drosophila erecta]
gi|190654614|gb|EDV51857.1| GG13664 [Drosophila erecta]
Length = 520
Score = 147 bits (372), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 87/247 (35%), Positives = 137/247 (55%), Gaps = 23/247 (9%)
Query: 32 KVNNVAPTLEVTEREK-----------YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRL 80
+++N+ L E EK +E+ CRG +V CRY +LRL
Sbjct: 264 ELDNIVSELNDAEVEKELYQVKRSASNFEIGCRGLYRQRTNLV----CRYKSTANTFLRL 319
Query: 81 MPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKS 140
PLK EE L P I +Y +V+YDSEI +K + + Q T EI + +++
Sbjct: 320 APLKFEEISLDPFIAVYHEVLYDSEIHALKGKSGNMVNGYARQRNGT---EIRD-TVARY 375
Query: 141 AWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPG-EANAFK 199
W + ERI++R+ MTG + E+LQ+ NYG+G ++EPH+D++ G E
Sbjct: 376 DWWSDTS-LTRERINQRIIDMTGFNFTKDEKLQIANYGVGTYFEPHFDYSSDGFETPEVT 434
Query: 200 SLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAA 259
+LG +R+A+++FY +V QGGATVF +N++++P+KG+ +W NLH G D ++H+A
Sbjct: 435 TLG--DRLASIIFYAGEVLQGGATVFPEINVTVFPQKGSMLYWFNLHDDGRPDIRSQHSA 492
Query: 260 CPVLTGS 266
CPV+ G
Sbjct: 493 CPVVNGD 499
>gi|281361323|ref|NP_652183.2| CG15864 [Drosophila melanogaster]
gi|272476864|gb|AAF54202.3| CG15864 [Drosophila melanogaster]
Length = 490
Score = 147 bits (372), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 78/210 (37%), Positives = 122/210 (58%), Gaps = 19/210 (9%)
Query: 61 AIVAQ----LKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPR 116
A+V Q L CRY P+ R+ PLK EE L P ++++ DV+YD+EID +
Sbjct: 265 AVVVQKPSRLHCRYNTTTTPFTRIAPLKMEELGLDPYMVVFHDVIYDTEIDGM------- 317
Query: 117 LRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVN 176
L + T + + R SK +++ + + ++ RV MTG + ++ ++N
Sbjct: 318 LNSSNFGLSLTDSGQKSEVRTSKDSYIVDAK-----TLNERVTDMTGFSMEMSDPFSLIN 372
Query: 177 YGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEK 236
YG+GGHY HYDF K G+R+ATVLFY+ +V GGAT+F +N+++ P+K
Sbjct: 373 YGLGGHYMLHYDFHEYTNTTRPKQ---GDRIATVLFYLGEVDSGGATIFPMINITVTPKK 429
Query: 237 GTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
G+A FW+NLH+SG + + H+ACPV++GS
Sbjct: 430 GSAVFWYNLHNSGAMNLKSLHSACPVISGS 459
>gi|195341558|ref|XP_002037373.1| GM12146 [Drosophila sechellia]
gi|194131489|gb|EDW53532.1| GM12146 [Drosophila sechellia]
Length = 485
Score = 147 bits (372), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 90/265 (33%), Positives = 142/265 (53%), Gaps = 18/265 (6%)
Query: 16 YQEALNKSP-------ELKDEPPKVNNVAPTLEVTER-----EKYEM--LCRGDLTVPPA 61
Y++AL +SP E ++ +V ++P+ + E EK E+ C G P
Sbjct: 222 YEDALKQSPHDQEIFQEYQNLKRRVLTLSPSEPMREEPNDDIEKMELPPCCSGRCEGPRK 281
Query: 62 IVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRAT 121
+ +L C Y P+LRL P+K E + P +IL+ D++ +E LI+ ++ ++ +
Sbjct: 282 L-KRLYCVYNCVTAPFLRLAPIKTEILSIDPFVILFHDMVSPTEGALIRSSSKNQILPSE 340
Query: 122 VQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGG 181
N E E+A +R SKS W + ++++R+ TGL +E QV+NYGIGG
Sbjct: 341 TVN-AANEFEVAKFRTSKSVWFDSDANEATLKLTQRLGEATGLDMKHSEPFQVINYGIGG 399
Query: 182 HYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAF 241
+E H+D + E G +R+AT LFY++DV QGGAT F LN++++P+ GT
Sbjct: 400 VFESHFDTSLADEDRFVN--GYIDRLATTLFYLNDVPQGGATHFPGLNITVFPKFGTVLM 457
Query: 242 WHNLHSSGDGDYYTRHAACPVLTGS 266
W+NLH+ G T H CPV+ GS
Sbjct: 458 WYNLHTEGLLHVRTMHTGCPVIVGS 482
>gi|78706702|ref|NP_001027154.1| CG18749 [Drosophila melanogaster]
gi|21429852|gb|AAM50604.1| GH05783p [Drosophila melanogaster]
gi|23175900|gb|AAN14309.1| CG18749 [Drosophila melanogaster]
gi|220956638|gb|ACL90862.1| CG18749-PB [synthetic construct]
Length = 491
Score = 147 bits (372), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 79/202 (39%), Positives = 120/202 (59%), Gaps = 15/202 (7%)
Query: 65 QLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQN 124
+L CRY P+ R+ PLK EE L P ++++ DV+YD+EID + + L + V
Sbjct: 274 KLHCRYNTSTTPFTRIAPLKMEELGLDPYMVVFHDVIYDTEIDGMLNSSDFGLSES-VSG 332
Query: 125 YKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYE 184
K + R SK + + + + ++ RV MTGL+ ++ ++NYG+GGH+
Sbjct: 333 LK------SEVRTSKDSHIVDAK-----TLNERVTDMTGLSMEMSDPFSLINYGLGGHFI 381
Query: 185 PHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHN 244
H+DF E L G+R+ATVLFY+ +V GGATVF LN+++ P+KG+A FW+N
Sbjct: 382 LHHDFH---EYTNTTRLKQGDRIATVLFYLREVDSGGATVFPMLNITVMPKKGSAVFWYN 438
Query: 245 LHSSGDGDYYTRHAACPVLTGS 266
LH+SG + T H ACPV++GS
Sbjct: 439 LHNSGAVNSKTLHTACPVISGS 460
>gi|195109817|ref|XP_001999478.1| GI23043 [Drosophila mojavensis]
gi|193916072|gb|EDW14939.1| GI23043 [Drosophila mojavensis]
Length = 491
Score = 147 bits (372), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 77/216 (35%), Positives = 122/216 (56%), Gaps = 7/216 (3%)
Query: 51 LCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIK 110
+CRG +P + L+CRY P+LRL PLK E+ + P + L + ++D+E++ I
Sbjct: 258 ICRGQRQLP--VSDSLRCRYSAEGSPFLRLAPLKLEQLSIDPYVALCHNAIHDNELEYII 315
Query: 111 KMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAE 170
+ ++P L+RA V E R++ A + +R+E M+G S +
Sbjct: 316 EQSRPYLKRALVDQGVVHE-----KRVTMDAAFDLNASTHGRTLRQRLEDMSGFDLSNSG 370
Query: 171 ELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNL 230
+L V+NYGIGGHY H+D ++ A+++ NR+AT+L Y+++V GG T F +L L
Sbjct: 371 QLAVLNYGIGGHYSMHFDCWFSSDSAAYEAYIRSNRIATILLYLNEVQMGGITSFPALGL 430
Query: 231 SLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+ P KG+A WHN++ + DY T HAACP L G+
Sbjct: 431 GVQPIKGSALIWHNMNHEIECDYRTLHAACPTLLGN 466
>gi|195352182|ref|XP_002042593.1| GM14980 [Drosophila sechellia]
gi|194124477|gb|EDW46520.1| GM14980 [Drosophila sechellia]
Length = 520
Score = 147 bits (371), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 88/252 (34%), Positives = 136/252 (53%), Gaps = 13/252 (5%)
Query: 17 QEALNKSPELKDEPPKVNNVAPTLEVTERE-KYEMLCRGDLTVPPAIVAQLKCRYVHRNV 75
+E N +L+D V +V R +E+ CRG +V CR+
Sbjct: 259 EEVDNIMSDLRDPHNDVEVEKELYQVKRRSSNFELGCRGLYRQKTNLV----CRFKSTAN 314
Query: 76 PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANY 135
+LRL PLK EE L P I +Y +V+YDSEI +K + + + T EI +
Sbjct: 315 TFLRLAPLKLEEISLDPFIAMYHEVLYDSEIHELKGQSMNMVNGYASERNGT---EIRDT 371
Query: 136 RISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPG-E 194
+ W V ERI++R+ MT S E+LQ+ NYG+G +++PH+D++ G E
Sbjct: 372 VVRYDWW--SNISLVRERINQRIIDMTEFNFSKDEKLQIANYGVGTYFQPHFDYSSDGFE 429
Query: 195 ANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYY 254
+LG +R+A++LFY S+V QGGATVF +N++++P+KG+ +W NLH G D
Sbjct: 430 TPNITTLG--DRLASILFYASEVPQGGATVFPEINVTVFPQKGSMLYWFNLHDDGRPDIR 487
Query: 255 TRHAACPVLTGS 266
++H+ CPV+ G
Sbjct: 488 SKHSVCPVINGD 499
>gi|195438148|ref|XP_002066999.1| GK24258 [Drosophila willistoni]
gi|194163084|gb|EDW77985.1| GK24258 [Drosophila willistoni]
Length = 217
Score = 147 bits (371), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 78/218 (35%), Positives = 119/218 (54%), Gaps = 7/218 (3%)
Query: 49 EMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDL 108
E+ CRG L P L C Y P+LRL P K EE L P I+L+ + +YD+EI
Sbjct: 2 ELGCRGHLKAPSN--RNLFCSYNSTTTPFLRLAPFKTEEISLDPFILLFHNAIYDNEISY 59
Query: 109 IKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST 168
K+ + +R A NY T + YRI + + + + RV+ ++GL+
Sbjct: 60 FTKVKRKDMREAHTDNYTTPNEQ---YRIMQVKVYEGIGDKMDKTLLERVKDISGLSAGN 116
Query: 169 AEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL 228
EL NYG+G ++ H D+ + TG+R+AT+LFY+SDVAQGG T+F
Sbjct: 117 KSELAAGNYGLGSYFPEHSDYRDIKVSPELNE--TGDRLATILFYLSDVAQGGHTIFPLA 174
Query: 229 NLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
N+++ P+KG+A FW NLH+ G+ + + H CP++ G+
Sbjct: 175 NVTVQPKKGSALFWFNLHNDGEPNIKSLHGVCPIIEGN 212
>gi|390178148|ref|XP_001358756.3| GA13990 [Drosophila pseudoobscura pseudoobscura]
gi|388859341|gb|EAL27899.3| GA13990 [Drosophila pseudoobscura pseudoobscura]
Length = 498
Score = 147 bits (371), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 84/220 (38%), Positives = 124/220 (56%), Gaps = 15/220 (6%)
Query: 48 YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID 107
Y+ C G P + L CRY + R+ PLK EE P ++L+ DV+Y+SEID
Sbjct: 263 YKRGCNGVFRAP----SYLHCRYNSTTTAFARIAPLKMEELSHDPYMVLFHDVVYESEID 318
Query: 108 LIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLRE-PEHPVIERISRRVEHMTGLTT 166
+ Q L+ + V G+ + + R SK E + V++ + RR+ MTGL
Sbjct: 319 FLLNATQ--LKASLV-----GQYQYSPVRTSKEQHFVEYNDTAVVKTLHRRLNDMTGLDM 371
Query: 167 STAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFT 226
++ L ++NYG+GGHY+ HYD EAN L G+R+ATVLFY+ +V GGAT F
Sbjct: 372 IESDALTLINYGMGGHYDVHYDSHNYSEAN---RLILGDRIATVLFYVGEVDSGGATTFP 428
Query: 227 SLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+N+S+ P+KG+A W+NL ++G + HA CPV+ GS
Sbjct: 429 YINVSVTPKKGSAVLWYNLDNAGQMNPKAIHAGCPVIVGS 468
>gi|66770649|gb|AAY54636.1| IP12415p [Drosophila melanogaster]
gi|66772017|gb|AAY55320.1| IP12615p [Drosophila melanogaster]
Length = 512
Score = 147 bits (371), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 91/253 (35%), Positives = 134/253 (52%), Gaps = 14/253 (5%)
Query: 17 QEALNKSPELKDEPPKVNNVAPTLEVTERE--KYEMLCRGDLTVPPAIVAQLKCRYVHRN 74
QE L+ +EP V L +R E+ CRG +V CRY
Sbjct: 250 QEELDNIMSDLNEPQNDVEVEKDLYQVKRSPSNCELGCRGLYRQKTNLV----CRYKSTA 305
Query: 75 VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIAN 134
+LRL PLK EE L P + +Y +V+YDSEI +K + + Q T EI +
Sbjct: 306 NTFLRLAPLKLEEISLDPFMAMYHEVLYDSEIRELKGQSMNMVNGYASQRNGT---EIRD 362
Query: 135 YRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPG- 193
+ W V ERI++R+ MTG E+LQ+ NYG+G +++PH+D++ G
Sbjct: 363 TVVRYDWW--SNTSLVRERINQRIIDMTGFNFLKDEKLQIANYGLGTYFQPHFDYSSDGF 420
Query: 194 EANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDY 253
E +LG +R+A++LFY S+V QGGATVF +N++++P+KG+ +W NLH G D
Sbjct: 421 ETPNITTLG--DRLASILFYASEVPQGGATVFPEINVTVFPQKGSMLYWFNLHDDGKPDI 478
Query: 254 YTRHAACPVLTGS 266
+ H+ CPVL G
Sbjct: 479 RSLHSVCPVLNGD 491
>gi|195575111|ref|XP_002105523.1| GD16991 [Drosophila simulans]
gi|194201450|gb|EDX15026.1| GD16991 [Drosophila simulans]
Length = 542
Score = 147 bits (370), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 79/222 (35%), Positives = 123/222 (55%), Gaps = 6/222 (2%)
Query: 52 CRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
C G VP + + L C Y H P+L+L P+K E + P ++L D++ E LI+
Sbjct: 293 CSGRCAVPRNL-SSLYCVYNHVTSPFLQLAPIKTEILSVDPFVLLLHDMISQKESTLIRN 351
Query: 112 MAQPRL--RRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTA 169
++ + T + E ++ YR SKS W + ++I+ R+ TGL T+
Sbjct: 352 SSKEHMLPSATTDPDSSDTETQVDTYRTSKSVWYSSDFNDTTKKITERLGDATGLDTNFT 411
Query: 170 EELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLN 229
E QV+NYG+GG +E H D E N F GT +R+AT LFY+++V QGG T F +N
Sbjct: 412 EFYQVINYGLGGFFETHLDMLL-SEKNRFN--GTRDRIATTLFYLNEVRQGGGTYFPRIN 468
Query: 230 LSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHS 271
L+++P+ G+A FW+NL ++G+ + H CPV+ GS + S
Sbjct: 469 LTVFPQPGSALFWYNLDTNGNDHMGSLHTGCPVIVGSKWVMS 510
>gi|221512818|ref|NP_730346.2| CG32201 [Drosophila melanogaster]
gi|220902638|gb|AAN11679.2| CG32201 [Drosophila melanogaster]
Length = 520
Score = 147 bits (370), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 91/253 (35%), Positives = 134/253 (52%), Gaps = 14/253 (5%)
Query: 17 QEALNKSPELKDEPPKVNNVAPTLEVTERE--KYEMLCRGDLTVPPAIVAQLKCRYVHRN 74
QE L+ +EP V L +R E+ CRG +V CRY
Sbjct: 258 QEELDNIMSDLNEPQNDVEVEKDLYQVKRSPSNCELGCRGLYRQKTNLV----CRYKSTA 313
Query: 75 VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIAN 134
+LRL PLK EE L P + +Y +V+YDSEI +K + + Q T EI +
Sbjct: 314 NTFLRLAPLKLEEISLDPFMAMYHEVLYDSEIRELKGQSMNMVNGYASQRNGT---EIRD 370
Query: 135 YRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPG- 193
+ W V ERI++R+ MTG E+LQ+ NYG+G +++PH+D++ G
Sbjct: 371 TVVRYDWW--SNTSLVRERINQRIIDMTGFNFLKDEKLQIANYGLGTYFQPHFDYSSDGF 428
Query: 194 EANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDY 253
E +LG +R+A++LFY S+V QGGATVF +N++++P+KG+ +W NLH G D
Sbjct: 429 ETPNITTLG--DRLASILFYASEVPQGGATVFPEINVTVFPQKGSMLYWFNLHDDGKPDI 486
Query: 254 YTRHAACPVLTGS 266
+ H+ CPVL G
Sbjct: 487 RSLHSVCPVLNGD 499
>gi|66771935|gb|AAY55279.1| IP12715p [Drosophila melanogaster]
Length = 451
Score = 147 bits (370), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 91/253 (35%), Positives = 134/253 (52%), Gaps = 14/253 (5%)
Query: 17 QEALNKSPELKDEPPKVNNVAPTLEVTERE--KYEMLCRGDLTVPPAIVAQLKCRYVHRN 74
QE L+ +EP V L +R E+ CRG +V CRY
Sbjct: 189 QEELDNIMSDLNEPQNDVEVEKDLYQVKRSPSNCELGCRGLYRQKTNLV----CRYKSTA 244
Query: 75 VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIAN 134
+LRL PLK EE L P + +Y +V+YDSEI +K + + Q T EI +
Sbjct: 245 NTFLRLAPLKLEEISLDPFMAMYHEVLYDSEIRELKGQSMNMVNGYASQRNGT---EIRD 301
Query: 135 YRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPG- 193
+ W V ERI++R+ MTG E+LQ+ NYG+G +++PH+D++ G
Sbjct: 302 TVVRYDWW--SNTSLVRERINQRIIDMTGFNFLKDEKLQIANYGLGTYFQPHFDYSSDGF 359
Query: 194 EANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDY 253
E +LG +R+A++LFY S+V QGGATVF +N++++P+KG+ +W NLH G D
Sbjct: 360 ETPNITTLG--DRLASILFYASEVPQGGATVFPEINVTVFPQKGSMLYWFNLHDDGKPDI 417
Query: 254 YTRHAACPVLTGS 266
+ H+ CPVL G
Sbjct: 418 RSLHSVCPVLNGD 430
>gi|195145084|ref|XP_002013526.1| GL24185 [Drosophila persimilis]
gi|194102469|gb|EDW24512.1| GL24185 [Drosophila persimilis]
Length = 229
Score = 147 bits (370), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 81/202 (40%), Positives = 118/202 (58%), Gaps = 11/202 (5%)
Query: 66 LKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNY 125
L CRY + R+ PLK EE P ++L+ DV+Y+SEID + Q L+ + V
Sbjct: 8 LHCRYNSTTTAFARIAPLKMEELSHDPYMVLFHDVVYESEIDFLLNATQ--LKASLV--- 62
Query: 126 KTGELEIANYRISKSAWLRE-PEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYE 184
G+ + + R SK E + V++ + RR+ MTGL ++ L ++NYG+GGHY+
Sbjct: 63 --GQYQYSPVRTSKEQHFVEYNDTAVVKTLHRRLNDMTGLDMIESDTLTLINYGMGGHYD 120
Query: 185 PHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHN 244
HYD EAN L G+R+ATVLFY+ +V GGAT F +N+S+ P+KG+A W+N
Sbjct: 121 VHYDSHNYSEAN---RLILGDRIATVLFYVGEVDSGGATTFPYINVSVTPKKGSAVLWYN 177
Query: 245 LHSSGDGDYYTRHAACPVLTGS 266
L +SG + HA CPV+ GS
Sbjct: 178 LDNSGQMNPKAIHAGCPVIVGS 199
>gi|195452770|ref|XP_002073492.1| GK14148 [Drosophila willistoni]
gi|194169577|gb|EDW84478.1| GK14148 [Drosophila willistoni]
Length = 444
Score = 147 bits (370), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 84/229 (36%), Positives = 120/229 (52%), Gaps = 18/229 (7%)
Query: 46 EKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSE 105
E + +C+ + P L CRY P+LRL P + EE L P ++ Y +V+ D E
Sbjct: 220 EGFNAICQS--SHKPKPTKHLYCRYNTTTTPFLRLAPFRMEELSLNPYMVAYHNVLSDEE 277
Query: 106 IDLIKKMAQPRLRRATVQNYKTGELEIA-NYRISKSAWLREPEHP-------VIERISRR 157
I + +M+ P L++A + ++I + R +AW E P +I+RI
Sbjct: 278 IRQLNRMSAPLLKKA----FPVSAVDIDYDVRTVDTAWFPNSETPHTKENDRLIKRIVNI 333
Query: 158 VEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDV 217
V +TGL A+ Q V YG GGHY PH+D+ + ++ G+R+ATVLFY++ V
Sbjct: 334 VSDLTGLNADVADSFQAVRYGFGGHYSPHHDYFN---ESIHQTAVNGDRLATVLFYLNTV 390
Query: 218 AQGGATVFTSLNLSLWPEKGTAAFWHNLH-SSGDGDYYTRHAACPVLTG 265
GGATVF LNL + EKG FW+NL S D D T H CPV+ G
Sbjct: 391 KHGGATVFPLLNLKVPAEKGKVLFWYNLDGESLDFDENTEHGVCPVVDG 439
>gi|194751823|ref|XP_001958223.1| GF23631 [Drosophila ananassae]
gi|190625505|gb|EDV41029.1| GF23631 [Drosophila ananassae]
Length = 502
Score = 146 bits (369), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 87/224 (38%), Positives = 122/224 (54%), Gaps = 10/224 (4%)
Query: 49 EMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDL 108
E+ CRG P+ L CRYV +L+L PLK E +QP I+LY DV+Y+ E
Sbjct: 262 ELGCRGKWPKKPS--PTLTCRYVRETHDFLKLAPLKMEFLNMQPLIVLYHDVLYEGEFKS 319
Query: 109 IKKMAQPRLRRATVQNY----KTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGL 164
++ +A Y K G+ + + R+ K + RI+RR+ MTGL
Sbjct: 320 MRDIAIFNATMGDGWTYVDFDKKGKPKRQD-RVVKMITFQGTTAEFTLRINRRIADMTGL 378
Query: 165 TTSTAEELQVVNYGIGGHYEPHYDFARPGEA--NAFKSLGTGNRVATVLFYMSDVAQGGA 222
+ L + NYG+GGH+ H D+ + N F LG G+R+AT L Y SDV GG
Sbjct: 379 EMNENMALHLTNYGLGGHFGKHVDYVELAKRPPNFFGDLG-GDRIATALLYASDVPLGGT 437
Query: 223 TVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
TVFT L LS+ P+KG+A W NL+++GD D + H+ACPV+ GS
Sbjct: 438 TVFTKLKLSIEPKKGSALIWFNLNNAGDPDPMSEHSACPVVLGS 481
>gi|195341556|ref|XP_002037372.1| GM12148 [Drosophila sechellia]
gi|194131488|gb|EDW53531.1| GM12148 [Drosophila sechellia]
Length = 542
Score = 146 bits (369), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 91/273 (33%), Positives = 140/273 (51%), Gaps = 22/273 (8%)
Query: 16 YQEALNKSPE----------LKDEPPKVNNVAPTLEVTEREKYEML-----CRGDLTVPP 60
YQ AL SP L+ ++++ P +E E +E L C G VP
Sbjct: 243 YQVALKLSPHDPEIYEEYRILEKRDLTLSDIEP-MEQDEDNSHERLVLPPCCSGRCAVPR 301
Query: 61 AIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRL--R 118
+ + L C Y H P+L+L P+K E + P ++L D++ E LI+ ++ +
Sbjct: 302 NLNS-LYCVYNHVTSPFLQLAPIKTEILSVDPFVVLLHDMISQKESTLIRNSSKEHMLPS 360
Query: 119 RATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYG 178
T + E ++ YR SKS W + ++I+ R+ TGL + E QV+NYG
Sbjct: 361 ATTDPDASDTETQVDTYRTSKSVWYSSDFNDTTKKITERLGDATGLDMNFTEFYQVINYG 420
Query: 179 IGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGT 238
+GG +E H D E N F GT +R+AT LFY+++V QGG T F LNL+++P+ G+
Sbjct: 421 LGGFFETHLDMLL-SEKNRFN--GTRDRIATTLFYLNEVRQGGGTYFPRLNLTVFPQPGS 477
Query: 239 AAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHS 271
A FW+NL + G+ + H CPV+ GS + S
Sbjct: 478 ALFWYNLDTKGNDHMDSLHTGCPVIVGSKWVMS 510
>gi|195172672|ref|XP_002027120.1| GL20071 [Drosophila persimilis]
gi|194112933|gb|EDW34976.1| GL20071 [Drosophila persimilis]
Length = 455
Score = 146 bits (369), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 90/231 (38%), Positives = 126/231 (54%), Gaps = 11/231 (4%)
Query: 40 LEVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRD 99
L++ E Y CRG L PP L C Y P LRL P K E P I +Y D
Sbjct: 211 LKMINFEHYVRGCRG-LFDPPK---GLSCHYDFHTHPVLRLAPFKVEPLSQDPYIAMYHD 266
Query: 100 VMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVE 159
V+YDSEI+ +K A P + R+ V Y + E R S SA+ + ++ + +++RRV
Sbjct: 267 VIYDSEIEELKDNAFPDMERSKVYTY-SDEDSKNTGRTSMSAFQTDHQYKAVTKVNRRVM 325
Query: 160 HMTG---LTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSD 216
HMTG L +++EL V+NY Y H D+ P + + + G+R+ATVLFY++D
Sbjct: 326 HMTGFEVLADGSSDELLVLNYATAAQYLTHSDYFGPAYS---EYIQRGDRIATVLFYLND 382
Query: 217 VAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
V QGG TVF L + P KG+A ++N++SS GD T H CPVL G+
Sbjct: 383 VEQGGKTVFPRLGIFRSPMKGSAVVFYNMNSSLQGDPRTEHGGCPVLVGTK 433
>gi|195145080|ref|XP_002013524.1| GL24183 [Drosophila persimilis]
gi|194102467|gb|EDW24510.1| GL24183 [Drosophila persimilis]
Length = 296
Score = 146 bits (368), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 81/209 (38%), Positives = 123/209 (58%), Gaps = 9/209 (4%)
Query: 62 IVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRAT 121
++ L CRY + + RL PLK E P +++Y DV+YD+E+ + + R+ R+
Sbjct: 52 MIKNLHCRYHKKGSAFSRLAPLKLEIFSHDPYVVIYHDVLYDAEMQGLIDSTRRRMSRSM 111
Query: 122 VQNYKTGELEIANYRISKSAWLREPEHP-VIERISRRVEHMTGLTTSTAEELQVVNYGIG 180
VQ Y+ ++EI+ R SK A E P +++RI R++ MTG +E L ++ Y G
Sbjct: 112 VQ-YEIRQIEISEQRTSKEAPFTEKNDPQLLKRIYDRLKDMTGCDMLRSEHLSILLYDQG 170
Query: 181 GHYEPHYDFA----RPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEK 236
GH++PH D+ P E ++ G+R A+V+FY++DV GG TVF L L + P K
Sbjct: 171 GHHDPHVDYHDLYWHPQE---YEYHPFGDRQASVVFYLNDVEDGGETVFPKLQLVIPPTK 227
Query: 237 GTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
G+A WHNL G+GD T+HA+CPVL+G
Sbjct: 228 GSALMWHNLRPWGEGDPRTQHASCPVLSG 256
>gi|328718387|ref|XP_001952104.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Acyrthosiphon
pisum]
Length = 293
Score = 146 bits (368), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 89/221 (40%), Positives = 125/221 (56%), Gaps = 13/221 (5%)
Query: 51 LCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIK 110
LC+ V + KCRY N+ Y LMP KEE+ +P I +Y DV+YD EI IK
Sbjct: 59 LCKH--GVSRTLTKYSKCRYQTNNLFYRILMPFKEEDINSEPLIKIYHDVLYDDEILKIK 116
Query: 111 KMAQPRLRRATVQNY--KTGELEIANYRISKSAWLREPEH-PVIERISRRVEHMTGLTTS 167
+A + A V++ K LE R + W+ E + + ++ R+E TG +T
Sbjct: 117 TLALENMNDAHVKSVDGKDDVLE-EKTRSGQVYWISEVDAVEYFDALNTRIESFTGFSTK 175
Query: 168 TAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFT 226
TAE+ Q+VNYG+GGHY PH+D FA+ E F GNR+ TVLFY++DV G T F
Sbjct: 176 TAEQYQIVNYGLGGHYLPHHDSFAKGTENVEF-----GNRLVTVLFYLTDVQNDGYTSFP 230
Query: 227 SLNLSLWPEKGTAAFWHNLH-SSGDGDYYTRHAACPVLTGS 266
LN++ +KG A W+NLH S+G Y + H +CP+L G+
Sbjct: 231 LLNINAPVDKGAALVWNNLHMSNGQLFYESLHGSCPLLKGN 271
>gi|198452400|ref|XP_002137470.1| GA26529 [Drosophila pseudoobscura pseudoobscura]
gi|198131917|gb|EDY68028.1| GA26529 [Drosophila pseudoobscura pseudoobscura]
Length = 348
Score = 146 bits (368), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 85/225 (37%), Positives = 127/225 (56%), Gaps = 5/225 (2%)
Query: 42 VTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVM 101
V E+ Y CRG + L CRY ++ + RL PLK E P +++Y DV+
Sbjct: 101 VLEQRPYFDGCRGAFPTK-SHHHSLHCRYHNKGSAFSRLAPLKLEIFSHDPYVVIYHDVL 159
Query: 102 YDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHP-VIERISRRVEH 160
YD+E+ + + R+ R+ VQ Y+ ++EI+ R SK A E P +++RI R++
Sbjct: 160 YDAEMQGLIDSTRRRMSRSMVQ-YEIRQIEISEQRTSKEAPFTEKNDPQLLKRIYDRLKD 218
Query: 161 MTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQG 220
MTG +E L ++ Y GGH++PH D+ + G +R A+V+FY++DV G
Sbjct: 219 MTGCDMLRSEHLSILLYDQGGHHDPHVDYHDLYWEYEYHPFG--DRQASVVFYLNDVEDG 276
Query: 221 GATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
G TVF L L + P KG+A WHNL G+GD T+HA+CPVL+G
Sbjct: 277 GETVFPKLQLVIPPTKGSALMWHNLRPWGEGDPRTQHASCPVLSG 321
>gi|194765182|ref|XP_001964706.1| GF22908 [Drosophila ananassae]
gi|190614978|gb|EDV30502.1| GF22908 [Drosophila ananassae]
Length = 509
Score = 145 bits (367), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 84/227 (37%), Positives = 124/227 (54%), Gaps = 13/227 (5%)
Query: 47 KYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEI 106
Y LC+G +P LKC + + L PLK E+ +L P I +Y V+ +I
Sbjct: 262 NYSRLCQGK-RLPEKQDNILKCYLDGKRHAFFTLAPLKVEQVHLDPDITVYHGVLSSKQI 320
Query: 107 DLIKKMAQPRLR-RATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLT 165
I + + R R+ V + + + R+S+ WL P ++ ++R E++ GLT
Sbjct: 321 SSIFTESNKKERIRSGVAGENGEDRTVKDIRVSQQTWLNYST-PTMQYVNRINEYICGLT 379
Query: 166 TSTAEELQVVNYGIGGHYEPH---YDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGA 222
AEE+QV NYG+GG YEPH ++F P + + G+R++T +FY+S+V QGG
Sbjct: 380 MRGAEEMQVANYGVGGQYEPHPDYFEFDLPPDFD-------GDRISTSMFYLSNVQQGGY 432
Query: 223 TVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSL 269
TVF +LN+ L P KG+ WHNLH S D D T HA CPV+ GS +
Sbjct: 433 TVFPNLNVFLPPVKGSMVLWHNLHYSLDVDARTWHAGCPVIVGSKKI 479
>gi|198466401|ref|XP_002135182.1| GA23910 [Drosophila pseudoobscura pseudoobscura]
gi|198150583|gb|EDY73809.1| GA23910 [Drosophila pseudoobscura pseudoobscura]
Length = 530
Score = 145 bits (367), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 83/219 (37%), Positives = 120/219 (54%), Gaps = 12/219 (5%)
Query: 48 YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID 107
YE+ CRG +V CRY P+LRL PLK EE P I+LY +V+YD EI+
Sbjct: 296 YEIGCRGLFPKRTNLV----CRYNFTTTPFLRLAPLKMEEVNHDPYIVLYHEVLYDREIE 351
Query: 108 LIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTS 167
+KK ++ + + + EI I++ AW E + RI +R+ +TG
Sbjct: 352 ELKKQSKNMINGFSEPQQENKIREI----IARHAWWWE-QTTTRARIYQRITDITGFQLF 406
Query: 168 TAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTS 227
EEL V NYG+G + PHYD+ N G + T+LFY+SD+ QGGAT+F S
Sbjct: 407 VQEELNVANYGLGTIFGPHYDYT---PENYDIGWFMGGPLGTILFYVSDLQQGGATIFPS 463
Query: 228 LNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+N+++ P KG+A W NL+ G+ D T H++CPV+ G
Sbjct: 464 INITVSPRKGSALLWFNLYDDGEPDPRTLHSSCPVIEGD 502
>gi|403298096|ref|XP_003939871.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Saimiri
boliviensis boliviensis]
Length = 412
Score = 145 bits (366), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 81/188 (43%), Positives = 118/188 (62%), Gaps = 13/188 (6%)
Query: 4 PTHQRAQGNKLYYQEALNKSPEL----------KDEPPKVNNVAPTLEVTEREKYEMLCR 53
P HQRA GN Y++ + K ++ + PK +A + ER+KYEMLCR
Sbjct: 211 PEHQRANGNLKYFEYIMAKEKDVNKSASDDQSDQKTTPKKKGIAVDY-LPERQKYEMLCR 269
Query: 54 GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
G+ + + P +L CRY N P L P K+E+ + +PRII + D++ D+EI+++K
Sbjct: 270 GEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 329
Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
+A+PRL RATV + +TG+L A YR+SKSAWL E+PV+ RI+ R++ +TGL STAEE
Sbjct: 330 LAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEE 389
Query: 172 LQVVNYGI 179
LQV N+ I
Sbjct: 390 LQVGNHII 397
>gi|355752458|gb|EHH56578.1| hypothetical protein EGM_06023, partial [Macaca fascicularis]
Length = 586
Score = 145 bits (366), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 93/235 (39%), Positives = 136/235 (57%), Gaps = 13/235 (5%)
Query: 4 PTHQRAQGNKLYYQEALNKSP-ELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAI 62
P ++R N L Y+ L +SP ++ E P L+ R+ YE LC+ L P +
Sbjct: 245 PDNKRMARNVLKYERLLAESPNQVVAEAVIQRPNIPHLQT--RDTYEGLCQ-TLGSQPTL 301
Query: 63 --VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRA 120
+ L C Y + YL L P+++E +L+P I LY D + DSE I++ A+P L+R+
Sbjct: 302 YQIPSLYCSYETNSNAYLLLQPIRKEVIHLEPYIALYHDFVSDSEAQKIREFAEPWLQRS 361
Query: 121 TVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQVVNY 177
V +GE ++ YRISKSAWL++ P++ ++ R+ +TGL AE LQVVNY
Sbjct: 362 VV---ASGEKQLQVEYRISKSAWLKDTVDPMLVTLNHRIAALTGLDVRPPYAEYLQVVNY 418
Query: 178 GIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSL 232
GIGGHYEPH+D A + ++ + +GNRVAT + Y+S V GGAT F NLS+
Sbjct: 419 GIGGHYEPHFDHATSPSSPLYR-MKSGNRVATFMIYLSSVEAGGATAFIYANLSV 472
>gi|66771513|gb|AAY55068.1| IP12095p [Drosophila melanogaster]
Length = 538
Score = 145 bits (365), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 91/265 (34%), Positives = 138/265 (52%), Gaps = 18/265 (6%)
Query: 16 YQEALNKSP-------ELKDEPPKVNNVAPTLEVTER-----EKYEM--LCRGDLTVPPA 61
YQ AL SP E ++ +V ++P+ + E E+ E+ C G P
Sbjct: 241 YQAALKHSPHDLEIFQEYQNLKRRVLTLSPSEPIREEPNDDIEEMELPPCCSGRCEGPRK 300
Query: 62 IVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRAT 121
+ +L C Y P+LRL P+K E + P +IL D++ E LI+ ++ ++ +
Sbjct: 301 L-NRLYCVYNCVTAPFLRLAPIKTEILSVDPFVILLHDMVSHKEGALIRSSSKNQILPSE 359
Query: 122 VQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGG 181
N E EIA +R SKS W + ++++R+ TGL +E QV+NYGIGG
Sbjct: 360 TVN-AANEFEIAKFRTSKSVWFDSDANEATLKLTQRLGEATGLDMKHSEPFQVINYGIGG 418
Query: 182 HYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAF 241
+E H+D + E G +R+AT LFY++DV QGGAT F LN++++P+ GT
Sbjct: 419 VFESHFDTSLADEDRFVN--GYIDRLATTLFYLNDVPQGGATHFPGLNITVFPKFGTVLM 476
Query: 242 WHNLHSSGDGDYYTRHAACPVLTGS 266
W+NLH+ G T H CPV+ GS
Sbjct: 477 WYNLHTEGMLHVRTMHTGCPVIVGS 501
>gi|195159160|ref|XP_002020450.1| GL13507 [Drosophila persimilis]
gi|194117219|gb|EDW39262.1| GL13507 [Drosophila persimilis]
Length = 543
Score = 145 bits (365), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 89/269 (33%), Positives = 140/269 (52%), Gaps = 17/269 (6%)
Query: 11 GNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLC------RGDLTVPPAIVA 64
G+K + +A + + PP+ + TE K+ LC + D + + A
Sbjct: 249 GDKTFGDKAYHIVSHFQKHPPQQSINMENGNFTE--KFNRLCSSMSRRKTDGSAAHSKPA 306
Query: 65 QLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQN 124
+L CRY +LRL PL+ EE L P I+LY V+ D E+ ++ M+ P L RA V +
Sbjct: 307 RLHCRYNATTTAFLRLAPLRMEELSLDPYIVLYHSVLSDEEMARLENMSTPLLHRARVFD 366
Query: 125 YKTGELEIANYRISKSAWLREP-----EHPVIERISRRVEHMTGLTTSTAEELQVVNYGI 179
+ +I+ R + + P + ++E I +R+ +TGL ++ +Q + YG
Sbjct: 367 SGIRKPKISPARTADEVQIPNPKLVAEDIQLVECIQKRITDLTGLMLTSMRRIQFLKYGF 426
Query: 180 GGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTA 239
GG Y PH+DF + S G+R+ATV+FY++DV GGAT F +L+L + E+G
Sbjct: 427 GGIYVPHHDFF---SVHTPTSRLHGDRIATVIFYLNDVEHGGATAFPNLDLVVPTERGAV 483
Query: 240 AFWHNLH-SSGDGDYYTRHAACPVLTGSN 267
FWHN+ + D DY T H ACPV+ G+
Sbjct: 484 LFWHNMDGETYDLDYRTLHGACPVIVGTK 512
>gi|355566863|gb|EHH23242.1| hypothetical protein EGK_06672, partial [Macaca mulatta]
Length = 583
Score = 145 bits (365), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 93/235 (39%), Positives = 136/235 (57%), Gaps = 13/235 (5%)
Query: 4 PTHQRAQGNKLYYQEALNKSP-ELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAI 62
P ++R N L Y+ L +SP ++ E P L+ R+ YE LC+ L P +
Sbjct: 242 PDNKRMARNVLKYERLLAESPNQVVAEAVIQRPNIPHLQT--RDTYEGLCQ-TLGSQPTL 298
Query: 63 --VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRA 120
+ L C Y + YL L P+++E +L+P I LY D + DSE I++ A+P L+R+
Sbjct: 299 YQIPSLYCSYETNSNAYLLLQPIRKEVIHLEPYIALYHDFVSDSEAQKIREFAEPWLQRS 358
Query: 121 TVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQVVNY 177
V +GE ++ YRISKSAWL++ P++ ++ R+ +TGL AE LQVVNY
Sbjct: 359 VV---ASGEKQLQVEYRISKSAWLKDTVDPMLVTLNHRIAALTGLDVRPPYAEYLQVVNY 415
Query: 178 GIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSL 232
GIGGHYEPH+D A + ++ + +GNRVAT + Y+S V GGAT F NLS+
Sbjct: 416 GIGGHYEPHFDHATSPSSPLYR-MKSGNRVATFMIYLSSVEAGGATAFIYANLSV 469
>gi|261245137|gb|ACX54875.1| FI12021p [Drosophila melanogaster]
Length = 538
Score = 145 bits (365), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 91/265 (34%), Positives = 138/265 (52%), Gaps = 18/265 (6%)
Query: 16 YQEALNKSP-------ELKDEPPKVNNVAPTLEVTER-----EKYEM--LCRGDLTVPPA 61
YQ AL SP E ++ +V ++P+ + E E+ E+ C G P
Sbjct: 241 YQAALKHSPHDLEIFQEYQNLKRRVLTLSPSEPIREEPNDDIEEMELPPCCSGRCEGPRK 300
Query: 62 IVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRAT 121
+ +L C Y P+LRL P+K E + P +IL D++ E LI+ ++ ++ +
Sbjct: 301 L-NRLYCVYNCVTAPFLRLAPIKTEILSVDPFVILLHDMVSHKEGALIRSSSKNQILPSE 359
Query: 122 VQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGG 181
N E EIA +R SKS W + ++++R+ TGL +E QV+NYGIGG
Sbjct: 360 TVN-AANEFEIAKFRTSKSVWFDSDANEATLKLTQRLGEATGLDMKHSEPFQVINYGIGG 418
Query: 182 HYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAF 241
+E H+D + E G +R+AT LFY++DV QGGAT F LN++++P+ GT
Sbjct: 419 VFESHFDTSLADEDRFVN--GYIDRLATTLFYLNDVPQGGATHFPGLNITVFPKFGTVLM 476
Query: 242 WHNLHSSGDGDYYTRHAACPVLTGS 266
W+NLH+ G T H CPV+ GS
Sbjct: 477 WYNLHTEGMLHVRTMHTGCPVIVGS 501
>gi|116008537|ref|NP_733379.2| CG31524, isoform A [Drosophila melanogaster]
gi|113194861|gb|AAN14239.2| CG31524, isoform A [Drosophila melanogaster]
Length = 536
Score = 145 bits (365), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 91/265 (34%), Positives = 138/265 (52%), Gaps = 18/265 (6%)
Query: 16 YQEALNKSP-------ELKDEPPKVNNVAPTLEVTER-----EKYEM--LCRGDLTVPPA 61
YQ AL SP E ++ +V ++P+ + E E+ E+ C G P
Sbjct: 239 YQAALKHSPHDLEIFQEYQNLKRRVLTLSPSEPIREEPNDDIEEMELPPCCSGRCEGPRK 298
Query: 62 IVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRAT 121
+ +L C Y P+LRL P+K E + P +IL D++ E LI+ ++ ++ +
Sbjct: 299 L-NRLYCVYNCVTAPFLRLAPIKTEILSVDPFVILLHDMVSHKEGALIRSSSKNQILPSE 357
Query: 122 VQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGG 181
N E EIA +R SKS W + ++++R+ TGL +E QV+NYGIGG
Sbjct: 358 TVN-AANEFEIAKFRTSKSVWFDSDANEATLKLTQRLGEATGLDMKHSEPFQVINYGIGG 416
Query: 182 HYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAF 241
+E H+D + E G +R+AT LFY++DV QGGAT F LN++++P+ GT
Sbjct: 417 VFESHFDTSLADEDRFVN--GYIDRLATTLFYLNDVPQGGATHFPGLNITVFPKFGTVLM 474
Query: 242 WHNLHSSGDGDYYTRHAACPVLTGS 266
W+NLH+ G T H CPV+ GS
Sbjct: 475 WYNLHTEGMLHVRTMHTGCPVIVGS 499
>gi|66770643|gb|AAY54633.1| IP12395p [Drosophila melanogaster]
Length = 538
Score = 145 bits (365), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 91/265 (34%), Positives = 138/265 (52%), Gaps = 18/265 (6%)
Query: 16 YQEALNKSP-------ELKDEPPKVNNVAPTLEVTER-----EKYEM--LCRGDLTVPPA 61
YQ AL SP E ++ +V ++P+ + E E+ E+ C G P
Sbjct: 241 YQAALKHSPHDLEIFQEYQNLKRRVLTLSPSEPIREEPNDDIEEMELPPCCSGRCEGPRK 300
Query: 62 IVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRAT 121
+ +L C Y P+LRL P+K E + P +IL D++ E LI+ ++ ++ +
Sbjct: 301 L-NRLYCVYNCVTAPFLRLAPIKTEILSVDPFVILLHDMVSHKEGALIRSSSKNQILPSE 359
Query: 122 VQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGG 181
N E EIA +R SKS W + ++++R+ TGL +E QV+NYGIGG
Sbjct: 360 TVN-AANEFEIAKFRTSKSVWFDSDANEATLKLTQRLGEATGLDMKHSEPFQVINYGIGG 418
Query: 182 HYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAF 241
+E H+D + E G +R+AT LFY++DV QGGAT F LN++++P+ GT
Sbjct: 419 VFESHFDTSLADEDRFVN--GYIDRLATTLFYLNDVPQGGATHFPGLNITVFPKFGTVLM 476
Query: 242 WHNLHSSGDGDYYTRHAACPVLTGS 266
W+NLH+ G T H CPV+ GS
Sbjct: 477 WYNLHTEGMLHVRTMHTGCPVIVGS 501
>gi|116008130|ref|NP_001036777.1| CG31524, isoform B [Drosophila melanogaster]
gi|113194860|gb|ABI31221.1| CG31524, isoform B [Drosophila melanogaster]
Length = 535
Score = 144 bits (364), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 91/265 (34%), Positives = 138/265 (52%), Gaps = 18/265 (6%)
Query: 16 YQEALNKSP-------ELKDEPPKVNNVAPTLEVTER-----EKYEM--LCRGDLTVPPA 61
YQ AL SP E ++ +V ++P+ + E E+ E+ C G P
Sbjct: 238 YQAALKHSPHDLEIFQEYQNLKRRVLTLSPSEPIREEPNDDIEEMELPPCCSGRCEGPRK 297
Query: 62 IVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRAT 121
+ +L C Y P+LRL P+K E + P +IL D++ E LI+ ++ ++ +
Sbjct: 298 L-NRLYCVYNCVTAPFLRLAPIKTEILSVDPFVILLHDMVSHKEGALIRSSSKNQILPSE 356
Query: 122 VQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGG 181
N E EIA +R SKS W + ++++R+ TGL +E QV+NYGIGG
Sbjct: 357 TVN-AANEFEIAKFRTSKSVWFDSDANEATLKLTQRLGEATGLDMKHSEPFQVINYGIGG 415
Query: 182 HYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAF 241
+E H+D + E G +R+AT LFY++DV QGGAT F LN++++P+ GT
Sbjct: 416 VFESHFDTSLADEDRFVN--GYIDRLATTLFYLNDVPQGGATHFPGLNITVFPKFGTVLM 473
Query: 242 WHNLHSSGDGDYYTRHAACPVLTGS 266
W+NLH+ G T H CPV+ GS
Sbjct: 474 WYNLHTEGMLHVRTMHTGCPVIVGS 498
>gi|195128343|ref|XP_002008623.1| GI13594 [Drosophila mojavensis]
gi|193920232|gb|EDW19099.1| GI13594 [Drosophila mojavensis]
Length = 511
Score = 144 bits (364), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 82/229 (35%), Positives = 126/229 (55%), Gaps = 22/229 (9%)
Query: 45 REKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDS 104
+E Y + CRG P L CRY P+LRL P K EE L P I+LY +V+ D
Sbjct: 274 QEPYYLGCRG--GYPKR--TNLHCRYNTTTTPFLRLAPFKMEEVSLDPYIVLYHNVISDR 329
Query: 105 EIDLIKKMAQPRLRRATVQNYKTG-----ELEIAN--YRISKSAWLREPEHPVIERISRR 157
EI+ +K+ A N+ G +L + + +++ W+R+ P +RI+ R
Sbjct: 330 EIEDMKQHAT---------NFANGLSISPDLNVTDKPQIVARMQWVRKMT-PFTDRINLR 379
Query: 158 VEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDV 217
+ +TG + +Q+ NYGIGGH+ PH+D+ P G G+R AT++FY S+V
Sbjct: 380 ITDITGFEVDEFKAVQIGNYGIGGHFMPHFDYTTPDRLRIEDIYGLGDRTATIVFYASEV 439
Query: 218 AQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
QGGATVF ++ +++ P+KG+A W+NL + + H ACPV++GS
Sbjct: 440 -QGGATVFPNIQVTVQPQKGSALHWYNLFDDDSPNPLSLHTACPVISGS 487
>gi|194373965|dbj|BAG62295.1| unnamed protein product [Homo sapiens]
Length = 604
Score = 144 bits (364), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 94/239 (39%), Positives = 137/239 (57%), Gaps = 21/239 (8%)
Query: 4 PTHQRAQGNKLYYQEALNKSP-----ELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTV 58
P ++R N L Y+ L +SP E + P + P L+ R+ YE LC+ L
Sbjct: 258 PDNKRMARNVLKYERLLAESPNHVVAEAVIQRPNI----PHLQT--RDTYEGLCQ-TLGS 310
Query: 59 PPAI--VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPR 116
P + + L C Y + YL L P+++E +L+P I LY D + DSE I+++A+P
Sbjct: 311 QPTLYQIPSLYCSYETNSNAYLLLQPIRKEVIHLEPYIALYHDFVSDSEAQKIRELAEPW 370
Query: 117 LRRATVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQ 173
L+R+ V +GE ++ YRISKSAWL++ P + ++ R+ +TGL AE LQ
Sbjct: 371 LQRSVV---ASGEKQLQVEYRISKSAWLKDTVDPKLVTLNHRIAALTGLDVRPPYAEYLQ 427
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSL 232
VVNYGIGGHYEPH+D A + ++ + +GNRVAT + Y+S V GGAT F NLS+
Sbjct: 428 VVNYGIGGHYEPHFDHATSPSSPLYR-MKSGNRVATFMIYLSSVEAGGATAFIYANLSV 485
>gi|195575113|ref|XP_002105524.1| GD16980 [Drosophila simulans]
gi|194201451|gb|EDX15027.1| GD16980 [Drosophila simulans]
Length = 518
Score = 144 bits (362), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 91/265 (34%), Positives = 140/265 (52%), Gaps = 19/265 (7%)
Query: 16 YQEALNKSP---ELKDEPPKVNNVAPTLEVTE---------REKYEM--LCRGDLTVPPA 61
Y++AL +SP E+ E + V TL ++E E+ E+ C G P
Sbjct: 222 YEDALKQSPHDQEIFQEYQHLKKVL-TLSLSEPIREEPNDDNEEMELPHCCSGRCERPQK 280
Query: 62 IVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRAT 121
+ +L C Y P+LRL P+K E + P +IL D++ +E LI+ ++ ++ +
Sbjct: 281 L-KRLYCVYNCITAPFLRLAPIKTEILSVDPFVILLHDMVSPTEGALIRSSSKNQILPSE 339
Query: 122 VQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGG 181
N E E+A +R SKS W + ++++R+ TGL +E QV+NYGIGG
Sbjct: 340 TVN-AANEFEVAKFRTSKSVWFDSDANEATLKLTQRLGEATGLDMKHSEPFQVINYGIGG 398
Query: 182 HYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAF 241
+E H+D + E G +R+AT LFY++DV QGGAT F LN++++P+ GT
Sbjct: 399 VFESHFDTSLADEDRFVN--GYIDRLATTLFYLNDVPQGGATHFPGLNITVFPKFGTVLM 456
Query: 242 WHNLHSSGDGDYYTRHAACPVLTGS 266
W+NLH+ G T H CPV+ GS
Sbjct: 457 WYNLHTEGLLHVRTMHTGCPVIVGS 481
>gi|326436053|gb|EGD81623.1| p4ha2 protein [Salpingoeca sp. ATCC 50818]
Length = 548
Score = 144 bits (362), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 82/222 (36%), Positives = 117/222 (52%), Gaps = 11/222 (4%)
Query: 46 EKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYL-QPRIILYRDVMYDS 104
E++ LCRG+ P L C H N P+L L P+K E + + R+ ++R
Sbjct: 293 ERFRRLCRGETLYHPQ--RPLTCELKHYNQPHLFLKPIKVEHLHEGRQRLQVFRQFASPE 350
Query: 105 EIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGL 164
E ++ + RL RA + G + +RIS +AWL+ +++RI R+E T +
Sbjct: 351 ECRHLQHAGKRRLERAVA--WTDGRFQPVEFRISTAAWLQPDHDAIVKRIHGRIEDATQV 408
Query: 165 TTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATV 224
AE LQ+ NYG+GG YEPH+D + G + G R+AT + Y++ V QGG T
Sbjct: 409 DIEYAEALQISNYGMGGFYEPHFDHSSRG------TNPDGERLATFMIYLNPVKQGGFTA 462
Query: 225 FTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
F L ++ P G A FW+NL SG GD T H ACPVL GS
Sbjct: 463 FPRLGAAVQPGYGDAVFWYNLQPSGVGDPLTLHGACPVLRGS 504
>gi|195505197|ref|XP_002099400.1| GE10884 [Drosophila yakuba]
gi|194185501|gb|EDW99112.1| GE10884 [Drosophila yakuba]
Length = 527
Score = 143 bits (361), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 87/240 (36%), Positives = 121/240 (50%), Gaps = 15/240 (6%)
Query: 48 YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID 107
Y LC+G LKC + Y L PL+ E +L P I +Y ++ I
Sbjct: 281 YTRLCQGRRLPEERSGDPLKCYLDGKRHAYFILAPLQVEPVHLDPDINVYHGMLSSKHIQ 340
Query: 108 LIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTS 167
I + A + + G + + R+S+ WL + + PV++ + R +E ++G +
Sbjct: 341 SIFEEADKKEMVRSAVAGDGGARTVKDLRVSQQTWL-DYKSPVMKSVGRIIEFVSGFDMA 399
Query: 168 TAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTS 227
AE +QV NYG+GG YEPH D+ F G+R++T +FY+SDV QGG TVFT
Sbjct: 400 GAEFMQVANYGVGGQYEPHPDYFEVNLPEEF----IGDRISTSMFYLSDVEQGGYTVFTK 455
Query: 228 LNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNS-----LHSTC-----PCGL 277
LN+ L P KG WHNLH S D D T HA CPV+ GS +HS PCGL
Sbjct: 456 LNVFLPPVKGALVMWHNLHRSLDVDARTLHAGCPVIVGSKRIGNIWMHSGYQEFRRPCGL 515
>gi|195505214|ref|XP_002099407.1| GE23379 [Drosophila yakuba]
gi|194185508|gb|EDW99119.1| GE23379 [Drosophila yakuba]
Length = 547
Score = 143 bits (361), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 80/222 (36%), Positives = 121/222 (54%), Gaps = 6/222 (2%)
Query: 52 CRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
C G V + L C Y H P+L+L P+K E + P ++L+ D++ E LI+
Sbjct: 298 CSGRCEVSRNLTG-LYCVYNHVTSPFLQLAPIKTEILSIDPFVLLFHDMISQKESTLIRS 356
Query: 112 MAQPR-LRRATVQNYKTG-ELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTA 169
++ L AT +G E +A +R SKS W + +RI+ R+ TGL +
Sbjct: 357 SSKEHMLPSATTDVDASGSEDHVATFRTSKSVWYSSTSNDTTKRITERLGDATGLDMNFT 416
Query: 170 EELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLN 229
E QV+NYG+GG +E H D + + GT +R+AT LFY+++V QGG T F LN
Sbjct: 417 EYFQVINYGLGGFFETHLDMLLSDRS---RFNGTRDRLATTLFYLNEVRQGGGTHFPRLN 473
Query: 230 LSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHS 271
L+++P+ G+A FW+NL + G+ T H CPV+ GS + S
Sbjct: 474 LTVFPQPGSALFWYNLDTRGNDHTSTLHTGCPVIVGSKWVMS 515
>gi|339261892|ref|XP_003367679.1| prolyl 4-hydroxylase subunit alpha-2 [Trichinella spiralis]
gi|316962562|gb|EFV48687.1| prolyl 4-hydroxylase subunit alpha-2 [Trichinella spiralis]
Length = 319
Score = 143 bits (360), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 84/212 (39%), Positives = 118/212 (55%), Gaps = 32/212 (15%)
Query: 2 IFPTHQRAQGNKLYYQEALNKSPELK----DEPPKVNNVAPTLEVTEREKYEMLCRGDLT 57
I P H RA+GN +Y + L K + D PP VN + ER+ +E LCRG+
Sbjct: 109 IKPDHPRAEGNVKWYLDLLAKEGVSRVTDHDLPPIVNARPNDQALPERKDFEALCRGEYL 168
Query: 58 VPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRL 117
+ ++L C Y R+ P+L L P+K E + +P+I+++R V+ +EI ++K +A PRL
Sbjct: 169 LTEKQRSRLYC-YYKRDTPFLSLAPIKVEVMHWKPKIVIFRQVISANEIAVLKTLAYPRL 227
Query: 118 RRATVQNYKTGELEIA---------------------------NYRISKSAWLREPEHPV 150
RATVQN +TGELE A +YRISKSAWL+E EHPV
Sbjct: 228 SRATVQNSETGELETAKYRISKRCRTLRRATVHNKETGQLEHASYRISKSAWLKEHEHPV 287
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGH 182
++RI +R+ MT L TAE+LQ YG+GG
Sbjct: 288 VDRIVKRIHDMTNLNMETAEDLQNATYGLGGQ 319
>gi|161076739|ref|NP_001097101.1| CG34345 [Drosophila melanogaster]
gi|157400090|gb|ABV53635.1| CG34345 [Drosophila melanogaster]
Length = 504
Score = 142 bits (359), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 79/222 (35%), Positives = 122/222 (54%), Gaps = 17/222 (7%)
Query: 49 EMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDL 108
E C+G PP QL CRY P++R+ PLKEEE P I LY DV+YDSEI
Sbjct: 275 EQGCQGKF--PPG--PQLVCRYNSTTTPFMRIAPLKEEEISRDPLIWLYHDVIYDSEIAQ 330
Query: 109 IKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH---PVIERISRRVEHMTGLT 165
+ + + + T NY T + R+++ ++ + + + + R+ ++GL
Sbjct: 331 LTNVTREEMILGTTTNYTTPD------RVNRLFHIKVTDDDGGKLDKTLVNRMADISGLD 384
Query: 166 TSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT-GNRVATVLFYMSDVAQGGATV 224
L +NYG+GG+++ H D+ + + L G+R+ T LFYM+DV GG T+
Sbjct: 385 VGNTTTLARINYGLGGYFQEHSDYM---DIKLYPELTEEGDRLMTFLFYMTDVPVGGTTI 441
Query: 225 FTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
F L++ P+KG+A FW+NLH++GD + TRHA CP + GS
Sbjct: 442 FPGAQLAIQPKKGSALFWYNLHNNGDPNLLTRHAVCPTIVGS 483
>gi|92109908|gb|ABE73278.1| IP10618p [Drosophila melanogaster]
Length = 501
Score = 142 bits (359), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 79/222 (35%), Positives = 122/222 (54%), Gaps = 17/222 (7%)
Query: 49 EMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDL 108
E C+G PP QL CRY P++R+ PLKEEE P I LY DV+YDSEI
Sbjct: 272 EQGCQGKF--PPG--PQLVCRYNSTTTPFMRIAPLKEEEISRDPLIWLYHDVIYDSEIAQ 327
Query: 109 IKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH---PVIERISRRVEHMTGLT 165
+ + + + T NY T + R+++ ++ + + + + R+ ++GL
Sbjct: 328 LTNVTREEMILGTTTNYTTPD------RVNRLFHIKVTDDDGGKLDKTLVNRMADISGLD 381
Query: 166 TSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT-GNRVATVLFYMSDVAQGGATV 224
L +NYG+GG+++ H D+ + + L G+R+ T LFYM+DV GG T+
Sbjct: 382 VGNTTTLARINYGLGGYFQEHSDYM---DIKLYPELTEEGDRLMTFLFYMTDVPVGGTTI 438
Query: 225 FTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
F L++ P+KG+A FW+NLH++GD + TRHA CP + GS
Sbjct: 439 FPGAQLAIQPKKGSALFWYNLHNNGDPNLLTRHAVCPTIVGS 480
>gi|195505216|ref|XP_002099408.1| GE23378 [Drosophila yakuba]
gi|194185509|gb|EDW99120.1| GE23378 [Drosophila yakuba]
Length = 546
Score = 142 bits (358), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 81/214 (37%), Positives = 118/214 (55%), Gaps = 8/214 (3%)
Query: 60 PAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLI----KKMAQP 115
P + +L C Y P+LRL P+K E + P I+L D++ E L+ K M P
Sbjct: 299 PRKLKRLYCVYNGVTAPFLRLAPIKTEILSIDPFIVLLHDMVSVEEGALLRTFSKNMISP 358
Query: 116 R--LRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+ + E E+ ++R SKS WL + ++++R+ TGL S +E Q
Sbjct: 359 SETAELSDSEEKSIFEFEVGSFRTSKSVWLDNDANEATLKLTQRLGDATGLDISHSEPFQ 418
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
V+NYGIGG +E H+D + E N F G +R+AT LFY++DV QGGAT F LN++++
Sbjct: 419 VINYGIGGIFESHFDTSLQDE-NRFLD-GYMDRLATTLFYLNDVPQGGATHFPGLNITVF 476
Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
P+ GTA FW+NL + G T H CPV+ GS
Sbjct: 477 PKFGTALFWYNLDTKGLLRLRTMHTGCPVIVGSK 510
>gi|328707957|ref|XP_001947811.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Acyrthosiphon
pisum]
Length = 507
Score = 142 bits (357), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 84/222 (37%), Positives = 118/222 (53%), Gaps = 12/222 (5%)
Query: 48 YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID 107
+ LCR +V P + KCRY +N PY +MP KEE+ P I LY D++YD EI
Sbjct: 267 FRYLCREGKSVRP-LTYDSKCRYQTKNSPYRMIMPFKEEDISSNPNIKLYHDIIYDEEIK 325
Query: 108 LIKKMAQPRLRRATVQNYKTGELE-IANYRISKSAWLREPEHPVI-ERISRRVEHMTGLT 165
I MA L A Y G++ + + R+ + W E +P++ +++ R+E +T T
Sbjct: 326 TITDMASKDLSDAAY--YFNGKITLLDDQRLGQLKWFSENANPILFGKLNDRIECITEYT 383
Query: 166 TSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVF 225
T TAE Q +NYG+GGH+ H D G GNR+ T+LFYM+DV G TVF
Sbjct: 384 TKTAEGYQTINYGLGGHFSVHMDAFTDGPK------LNGNRLVTILFYMTDVPDDGYTVF 437
Query: 226 TSLNLSLWPEKGTAAFWHNLH-SSGDGDYYTRHAACPVLTGS 266
+LN KG+A W NL ++G T H CPV+ G+
Sbjct: 438 PNLNYVAHCRKGSALVWLNLRLNNGSVHSGTFHGGCPVIKGN 479
>gi|405964866|gb|EKC30308.1| KRR1 small subunit processome component-like protein [Crassostrea
gigas]
Length = 885
Score = 142 bits (357), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 89/246 (36%), Positives = 133/246 (54%), Gaps = 28/246 (11%)
Query: 43 TEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMY 102
TE YE LCR + + A+L+C +PY + KEE +PRI ++ DV+
Sbjct: 614 TEDAMYEALCREEQKSLHEL-AKLRCFLRDTVIPYYKA---KEEVVNYEPRIAIFHDVIS 669
Query: 103 DSEIDLIKKMAQPRLRRATVQNYKTG---ELEIA-----NYRISKSAWLREPEHPVIERI 154
+ I+ +K +A L R+TV TG ++ I N R+S++ W+R E+P + R+
Sbjct: 670 STSIEHLKSIASKGLTRSTVFLENTGPNGQVTITYGKQDNIRVSQTCWIRTDEYPELLRL 729
Query: 155 SRRVEHMTGLTT------STAEELQVVNYGIGGHYEPHYDF--------ARPGEANAFKS 200
R++ +TGL+ S +E+ QVVNYG+GG Y H+D+ + P ++ +
Sbjct: 730 ENRIQLITGLSAEYKPVRSHSEKFQVVNYGVGGMYTAHHDYTGYKLGIISNPMDSEDIST 789
Query: 201 LGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAAC 260
+G+R+AT +FYM+D GGATVF + + KG AAFW NL SG D T H C
Sbjct: 790 --SGDRMATWMFYMNDAKAGGATVFPEVRTRIPVAKGGAAFWFNLRPSGATDPRTLHGGC 847
Query: 261 PVLTGS 266
PVL GS
Sbjct: 848 PVLVGS 853
>gi|195452772|ref|XP_002073493.1| GK14149 [Drosophila willistoni]
gi|194169578|gb|EDW84479.1| GK14149 [Drosophila willistoni]
Length = 496
Score = 142 bits (357), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 83/214 (38%), Positives = 109/214 (50%), Gaps = 23/214 (10%)
Query: 65 QLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATV-- 122
+L CRY P+LRL P + EE L P I+ Y +V+ D EI + ++ L++
Sbjct: 265 KLHCRYNTTTTPFLRLAPFRMEELSLDPYIVAYYNVLSDQEITQLDRLTATLLKKTFAIG 324
Query: 123 --QNYKTGELEIANYRISKSAWLREPEHP-------VIERISRRVEHMTGLTTSTAEELQ 173
+Y N R + AW E P +IERI V +TGL A+ Q
Sbjct: 325 PDDDYDD------NARTADGAWFPNNETPRTEENIQLIERIINLVSDLTGLQGDKADSFQ 378
Query: 174 VVNYGIGGHYEPHYDFARPG-EANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSL 232
V YG GGHY PH+D+ + AF G+R+ATV FY++ V GGATVF SLNL +
Sbjct: 379 AVRYGFGGHYTPHFDYLNMSIDQTAF----YGDRLATVFFYLNTVKHGGATVFPSLNLKV 434
Query: 233 WPEKGTAAFWHNLH-SSGDGDYYTRHAACPVLTG 265
EKG FW+NL S D D T H CPV+ G
Sbjct: 435 PAEKGKVLFWYNLDGESFDFDENTEHGGCPVVDG 468
>gi|198477148|ref|XP_002136736.1| GA29214 [Drosophila pseudoobscura pseudoobscura]
gi|198145041|gb|EDY71753.1| GA29214 [Drosophila pseudoobscura pseudoobscura]
Length = 520
Score = 142 bits (357), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 80/203 (39%), Positives = 110/203 (54%), Gaps = 6/203 (2%)
Query: 65 QLKCRYV-HRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQ 123
L C Y+ R P+L L ++ E P I+LY DV+ S++ ++ ++P L AT
Sbjct: 293 HLHCFYLTKRGSPFLLLARVRTEILSDDPFIVLYYDVLTHSDMVSLRNTSEPLLHPATTI 352
Query: 124 NYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHY 183
Y E++N R + WL R R + +TGL S +E QV NYGIGG +
Sbjct: 353 QYLNAPQELSNSRTAHFVWLEPTITEATRRADRVLWDVTGLNLSNSEMFQVNNYGIGGSF 412
Query: 184 EPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWH 243
H D E N R+AT +FY+SDV QGGAT+FT LN++++P+ GT FW+
Sbjct: 413 MRHSDLLH-SERNYL----VRERIATAIFYLSDVPQGGATLFTELNVTVFPQAGTVLFWY 467
Query: 244 NLHSSGDGDYYTRHAACPVLTGS 266
NL SGD D TRH CPV+ GS
Sbjct: 468 NLAHSGDHDMRTRHTGCPVIGGS 490
>gi|241044301|ref|XP_002407178.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
gi|215492128|gb|EEC01769.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
Length = 554
Score = 142 bits (357), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 76/202 (37%), Positives = 121/202 (59%), Gaps = 11/202 (5%)
Query: 44 EREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYD 103
E + Y+ LCRG+ P + ++L+CRY + +L P+K EEA L+P I++ +V+ D
Sbjct: 287 ETQNYKRLCRGEQLRTPKMDSKLRCRYYKGQHGFFKLQPIKVEEANLKPYIVVMHNVIQD 346
Query: 104 SEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTG 163
+I+ + A+PRL+R+T +Y +E + R S +AWL + + PV R++R + + G
Sbjct: 347 RDIEDLMAFAKPRLQRST--HYGVRGMEASQVRTSSNAWLNDLDAPVATRLNRFLRSLLG 404
Query: 164 LTTS----TAEELQVVNYGIGGHYEPHYDFAR-----PGEANAFKSLGTGNRVATVLFYM 214
L T+ AE+ Q+ NYGIGG Y H+D+ + P +G+R+AT++ YM
Sbjct: 405 LGTTYLGGEAEQYQLANYGIGGQYMSHHDYLQDTYHIPNRVTDDFEKTSGDRIATLMVYM 464
Query: 215 SDVAQGGATVFTSLNLSLWPEK 236
SDV +GGATVF SL + L P+K
Sbjct: 465 SDVEEGGATVFPSLGVRLTPKK 486
>gi|195575095|ref|XP_002105515.1| GD21523 [Drosophila simulans]
gi|194201442|gb|EDX15018.1| GD21523 [Drosophila simulans]
Length = 527
Score = 141 bits (356), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 91/267 (34%), Positives = 135/267 (50%), Gaps = 20/267 (7%)
Query: 4 PTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAIV 63
PTH Q K Y E+ +K+ P + Y LC+G
Sbjct: 250 PTHSAQQTQK--YLESRVSGKNVKETNP-----------SWFSNYTRLCQGRRLPEERSG 296
Query: 64 AQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEI-DLIKKMAQPRLRRATV 122
L C + Y L PL+ E +L P I +Y ++ +I + ++ + + R+ V
Sbjct: 297 DPLSCYLDGKRHAYFTLAPLQVEPVHLDPDINVYHGMLSSKQILSIFEEADKEEMVRSAV 356
Query: 123 QNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGH 182
G+ + + R+S+ WL + + PV+ +SR ++ ++G + AE +QV NYG+GG
Sbjct: 357 AG-DGGKRTVRDLRVSQQTWL-DYKSPVMNSVSRIIQFVSGFDMAGAEYMQVANYGVGGQ 414
Query: 183 YEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFW 242
YEPH D+ E N K+ G+R++T +FY+SDV QGG TVFT LN+ L P KG W
Sbjct: 415 YEPHPDYF---EVNLPKNF-EGDRISTSMFYLSDVEQGGYTVFTKLNVFLPPVKGALVMW 470
Query: 243 HNLHSSGDGDYYTRHAACPVLTGSNSL 269
HNLH S D D T HA CPV+ GS +
Sbjct: 471 HNLHRSLDVDARTLHAGCPVIVGSKRI 497
>gi|241044303|ref|XP_002407179.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
gi|215492129|gb|EEC01770.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
Length = 456
Score = 141 bits (356), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 80/217 (36%), Positives = 125/217 (57%), Gaps = 11/217 (5%)
Query: 51 LCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIK 110
LCRG+ + L C Y + PY ++ P+K E+ P ++ + DV++ EI +
Sbjct: 224 LCRGEKIRNASEEKDLFCLYDVPH-PYFKIGPVKVEQMNKNPYVLQFYDVLWPQEIKAFR 282
Query: 111 KMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAE 170
+M P+L RATV++ T +++ R+S+ AW+ +++R++ RV +TGL+
Sbjct: 283 RMGDPQLERATVRD--TARNTVSHARVSQVAWISPDSDVLLDRVNARVAMLTGLS----H 336
Query: 171 ELQVVN-YGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLN 229
L+ N YG GGHYEPH+D+ E + LG G+R+AT +FY+SDV GG+TVF
Sbjct: 337 RLRKYNSYGPGGHYEPHHDYLE--ELDEVDKLG-GDRIATFMFYLSDVNLGGSTVFPYAK 393
Query: 230 LSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+ P+ G+AAFW+N+ G D T H AC VL G+
Sbjct: 394 AGVMPKMGSAAFWYNMREDGSYDRATLHGACSVLHGT 430
>gi|20177113|gb|AAM12259.1| RE23792p [Drosophila melanogaster]
gi|220948174|gb|ACL86630.1| PH4alphaSG2-PB [synthetic construct]
gi|220960438|gb|ACL92755.1| PH4alphaSG2-PB [synthetic construct]
Length = 301
Score = 141 bits (355), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 81/223 (36%), Positives = 121/223 (54%), Gaps = 7/223 (3%)
Query: 48 YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEI- 106
Y LC+G L+C + Y L PL+ E +L P I +Y ++ +I
Sbjct: 55 YTRLCQGRRLPEERSGDPLRCYLDGKRHAYFTLAPLQVEPVHLDPDINVYHGMLSSKQIL 114
Query: 107 DLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTT 166
+ ++ + + R+ V GE + + R+S+ WL + + PV+ + R ++ ++G
Sbjct: 115 SIFEEADKEEMVRSAVAG-SGGEGTVRDLRVSQQTWL-DYKSPVMNSVGRIIQFVSGFDM 172
Query: 167 STAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFT 226
+ AE +QV NYG+GG YEPH D+ E N K+ G+R++T +FY+SDV QGG TVFT
Sbjct: 173 AGAEHMQVANYGVGGQYEPHPDYF---EVNLPKNF-EGDRISTSMFYLSDVEQGGYTVFT 228
Query: 227 SLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSL 269
LN+ L P KG WHNLH S D T HA CPV+ GS +
Sbjct: 229 KLNVFLPPVKGALVMWHNLHRSLHVDARTLHAGCPVIVGSKRI 271
>gi|195061021|ref|XP_001995909.1| GH14207 [Drosophila grimshawi]
gi|193891701|gb|EDV90567.1| GH14207 [Drosophila grimshawi]
Length = 477
Score = 141 bits (355), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 83/227 (36%), Positives = 126/227 (55%), Gaps = 22/227 (9%)
Query: 46 EKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSE 105
EKY LC P+ +L C Y + P+LRL PLK E + P ++++ + +YDSE
Sbjct: 246 EKYTRLCGASHKPKPS---RLICNYKMDSSPFLRLAPLKMEMLSMDPYVVVFHEAIYDSE 302
Query: 106 IDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLRE-----PEHPVIERISRRVEH 160
ID ++++ + RL R + K G+ + + R S W+ E + ++ERI RRV
Sbjct: 303 IDELRRLCESRLSRTEIA--KQGKNK--SIRSSSGVWIFELDLNRQQLELLERIRRRVAD 358
Query: 161 MTGLTTS-TAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQ 219
M+GL ++E+Q + Y GGHY PH+DF +R+ATVLFY++DVA+
Sbjct: 359 MSGLLIDFNSQEVQYMEYVFGGHYYPHWDFKGIPHLE--------DRIATVLFYLNDVAR 410
Query: 220 GGATVFTSLNLSLWPEKGTAAFWHNLH-SSGDGDYYTRHAACPVLTG 265
GGAT+F L L + PE+G WHN+ + D + + H ACPV+ G
Sbjct: 411 GGATIFPDLELLVQPERGKVLHWHNMDLGTYDLEKRSLHGACPVIMG 457
>gi|390176894|ref|XP_002136933.2| GA26862 [Drosophila pseudoobscura pseudoobscura]
gi|388858830|gb|EDY67491.2| GA26862 [Drosophila pseudoobscura pseudoobscura]
Length = 520
Score = 141 bits (355), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 78/203 (38%), Positives = 112/203 (55%), Gaps = 6/203 (2%)
Query: 65 QLKCRYV-HRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQ 123
L C Y+ R P+L L ++ E P I LY DV+ S++ ++ ++P L AT
Sbjct: 293 HLHCFYLTKRGSPFLLLARVRTEILSDDPFIALYYDVLTHSDMVSLRNTSEPLLHPATTI 352
Query: 124 NYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHY 183
Y E++N R + WL R R + +TGL S +E+ QV NYGIGG +
Sbjct: 353 QYLNAPQELSNSRTAHFVWLEPTITEATRRADRVLWDVTGLNLSNSEKFQVNNYGIGGSF 412
Query: 184 EPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWH 243
H D ++ ++ R+AT +FY+SDV QGGAT+FT LN++++P+ GT FW+
Sbjct: 413 MRHSD-----PLHSERNYLVRERIATAIFYLSDVPQGGATLFTELNVTVFPQAGTVLFWY 467
Query: 244 NLHSSGDGDYYTRHAACPVLTGS 266
NL SGD D TRH CPV+ GS
Sbjct: 468 NLAHSGDHDMRTRHTGCPVIVGS 490
>gi|195159299|ref|XP_002020519.1| GL13471 [Drosophila persimilis]
gi|194117288|gb|EDW39331.1| GL13471 [Drosophila persimilis]
Length = 238
Score = 141 bits (355), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 80/203 (39%), Positives = 109/203 (53%), Gaps = 6/203 (2%)
Query: 65 QLKCRYV-HRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQ 123
L C Y+ R P+L L ++ E P I LY DV+ S++ ++ ++P L AT
Sbjct: 31 HLHCFYLTKRGSPFLLLARVRTEILSDDPFIALYYDVLTHSDMVSLRNTSEPLLHPATTI 90
Query: 124 NYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHY 183
Y E++N R + WL R R + +TGL S +E QV NYGIGG +
Sbjct: 91 QYFNAPQELSNSRTAHFVWLEPTITEATRRADRVLWDVTGLNLSNSEMFQVNNYGIGGSF 150
Query: 184 EPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWH 243
H D E N R+AT +FY+SDV QGGAT+FT LN++++P+ GT FW+
Sbjct: 151 MRHSDLLH-SERNYL----VRERIATAIFYLSDVPQGGATLFTELNVTVFPQAGTVLFWY 205
Query: 244 NLHSSGDGDYYTRHAACPVLTGS 266
NL SGD D TRH CPV+ GS
Sbjct: 206 NLAHSGDHDMRTRHTGCPVIVGS 228
>gi|195159303|ref|XP_002020521.1| GL13468 [Drosophila persimilis]
gi|194117290|gb|EDW39333.1| GL13468 [Drosophila persimilis]
Length = 415
Score = 141 bits (355), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 83/222 (37%), Positives = 116/222 (52%), Gaps = 27/222 (12%)
Query: 49 EMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDL 108
++ CRG P + +L C Y P+LRL P K E L P ++LY DV+ E
Sbjct: 196 QLCCRGG--CPYRDMHRLTCSYNTTAAPFLRLAPFKTELLSLSPYMVLYHDVITPLESLT 253
Query: 109 IKKMAQPRLRR---ATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLT 165
+K +++P ++R V N K I + R S S WL E+ V+ER+ RRV MT
Sbjct: 254 LKNLSKPLMKRRAMVMVNNLKVRPF-IDSGRTSNSVWLTSHENAVMERLERRVGVMTNFE 312
Query: 166 TSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATV 224
+E Q++NYGIGGHY+PH D F P +SDV QGGAT+
Sbjct: 313 MENSEVYQLINYGIGGHYKPHTDHFETPQ--------------------LSDVPQGGATL 352
Query: 225 FTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
F LN+S+ P +G A W+NL+ G G+ T H +CP++ GS
Sbjct: 353 FPRLNISVQPRQGDALLWYNLNDRGQGEIGTVHTSCPIIKGS 394
>gi|195069738|ref|XP_001997014.1| GH23597 [Drosophila grimshawi]
gi|193892024|gb|EDV90890.1| GH23597 [Drosophila grimshawi]
Length = 239
Score = 141 bits (355), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 84/233 (36%), Positives = 128/233 (54%), Gaps = 22/233 (9%)
Query: 46 EKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSE 105
EKY LC P+ +L C Y + P+LRL PLK E + P ++++ + +YDSE
Sbjct: 6 EKYTRLCGASHKPKPS---RLICNYKMDSSPFLRLAPLKMEMLSMDPYVVVFHEAIYDSE 62
Query: 106 IDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLRE-----PEHPVIERISRRVEH 160
ID ++++ + RL R + K G+ + + R S W+ E + ++ERI RRV
Sbjct: 63 IDELRRLCESRLSRTEIA--KQGKNK--SIRSSSGVWIFELDLNRQQLELLERIRRRVAD 118
Query: 161 MTGLTTS-TAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQ 219
M+GL ++E+Q + Y GGHY PH+DF +R+ATVLFY++DVA+
Sbjct: 119 MSGLLIDFNSQEVQYMEYVFGGHYYPHWDFKGIPHLE--------DRIATVLFYLNDVAR 170
Query: 220 GGATVFTSLNLSLWPEKGTAAFWHNLH-SSGDGDYYTRHAACPVLTGSNSLHS 271
GGAT+F L L + PE+G WHN+ + D + + H ACPV+ G + S
Sbjct: 171 GGATIFPDLELLVQPERGKVLHWHNMDLGTYDLEKRSLHGACPVIMGKKEVIS 223
>gi|195338688|ref|XP_002035956.1| GM16188 [Drosophila sechellia]
gi|194129836|gb|EDW51879.1| GM16188 [Drosophila sechellia]
Length = 392
Score = 140 bits (354), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 79/219 (36%), Positives = 118/219 (53%), Gaps = 11/219 (5%)
Query: 49 EMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDL 108
E C+G PP QL CRY P++R+ PLKEEE P I LY DV+YDSEI
Sbjct: 163 EQGCQGKF--PPG--PQLVCRYNSTTTPFMRIAPLKEEEISRDPLIWLYHDVIYDSEITQ 218
Query: 109 IKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST 168
+ + + + T NY T + + I + + + + + R+ ++GL
Sbjct: 219 LTNLTREEMILGTTTNYTTPDRVNRLFHIKVT---NDDGGKLDKTLVNRMADISGLDMGN 275
Query: 169 AEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT-GNRVATVLFYMSDVAQGGATVFTS 227
L +NYG+GG+++ H D+ + L G+R+ T LFYM+DV GG T+F
Sbjct: 276 TTTLARINYGLGGYFQEHSDYM---DIKLHPELTEEGDRLMTFLFYMTDVLVGGGTIFPG 332
Query: 228 LNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
L++ P+KG+A FW+NLH++GD + TRHA CP + GS
Sbjct: 333 AQLAIQPKKGSALFWYNLHNNGDPNPLTRHAVCPTIVGS 371
>gi|341878860|gb|EGT34795.1| hypothetical protein CAEBREN_10065 [Caenorhabditis brenneri]
Length = 163
Score = 140 bits (354), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 71/128 (55%), Positives = 88/128 (68%), Gaps = 10/128 (7%)
Query: 161 MTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQG 220
MT L TAEELQ+ NYGIGGHY+PH+D A+ E+ +F+SLGTGNR+ATVLFYMS + G
Sbjct: 1 MTNLEMETAEELQIANYGIGGHYDPHFDHAKKEESKSFESLGTGNRIATVLFYMSQPSHG 60
Query: 221 GATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG----SNS-LHSTC-- 273
G TVFT + ++ P K A FW+NL+ GDG+ TRHAACPVL G SN +H
Sbjct: 61 GGTVFTEVKSTVLPTKNDALFWYNLYKQGDGNPDTRHAACPVLVGIKWVSNKWIHEKGNE 120
Query: 274 ---PCGLR 278
PCGL+
Sbjct: 121 FRRPCGLK 128
>gi|198466399|ref|XP_002135181.1| GA23909 [Drosophila pseudoobscura pseudoobscura]
gi|198150582|gb|EDY73808.1| GA23909 [Drosophila pseudoobscura pseudoobscura]
Length = 530
Score = 140 bits (354), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 82/234 (35%), Positives = 125/234 (53%), Gaps = 24/234 (10%)
Query: 66 LKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNY 125
L CRY P+LRL PLK EE P I++Y V+ D E++ +K++A+P + N
Sbjct: 306 LVCRYNSTTTPFLRLAPLKMEEVNHDPYIVMYHQVLSDREMEEMKQLARP------MTNG 359
Query: 126 KTGELEIANYR-----ISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIG 180
+G E+AN +++ AWL E P ER++ R+ MTG S + LQ+ N+G+G
Sbjct: 360 MSGS-EMANLTEPLEIVARVAWLIEAS-PFRERLNLRIGDMTGFDVSDFKALQLANFGVG 417
Query: 181 GHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAA 240
+++ HYD+ R N G+R +++FY S+V QGGAT+F + +++ P+KG +
Sbjct: 418 SYFKAHYDY-RTERVNDLGVTELGDRTGSIIFYASEVPQGGATIFPDIQVTVTPQKGNSL 476
Query: 241 FWHNLHSSGDGDYYTRHAACPVLTGS-----NSLHS-----TCPCGLRRGLQRS 284
FW N D + HA CPV+ GS LH PC R G ++S
Sbjct: 477 FWFNTFDDSTPDPRSLHAICPVIAGSRWTITKWLHQWPQMFLKPCSPRAGERKS 530
>gi|195379218|ref|XP_002048377.1| GJ13934 [Drosophila virilis]
gi|194155535|gb|EDW70719.1| GJ13934 [Drosophila virilis]
Length = 469
Score = 140 bits (354), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 84/222 (37%), Positives = 116/222 (52%), Gaps = 29/222 (13%)
Query: 52 CRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
CRG P L CRYV ++ YLRL PLK E LQP I LY DV++DSEI+ +K
Sbjct: 263 CRG--LWPKRQTLPLTCRYVQQHSAYLRLAPLKMEILSLQPLIQLYHDVLHDSEIEAVKN 320
Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
+ RA +N + K LR+ H + + RR+ M+GL +
Sbjct: 321 VTN---HRAMAENLAS---------TVKLITLRDAPH--TQNMHRRITDMSGLDMAQNNT 366
Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
L ++N+G+GG+ GNR+ATV+FY SDV GGAT+F L L
Sbjct: 367 LHLLNFGLGGYLGKQLKL-------------QGNRIATVIFYASDVQLGGATIFPRLQLV 413
Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC 273
+ P++G+A W+NL+++G D TRHA CPV+ GS S C
Sbjct: 414 VKPKRGSALLWYNLNAAGKPDPLTRHAVCPVVVGSRWAISKC 455
>gi|194905313|ref|XP_001981171.1| GG11766 [Drosophila erecta]
gi|190655809|gb|EDV53041.1| GG11766 [Drosophila erecta]
Length = 496
Score = 140 bits (354), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 77/223 (34%), Positives = 122/223 (54%), Gaps = 9/223 (4%)
Query: 46 EKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSE 105
E ++ LCR + P+ +L CRY P+L L PLK E+ L+P I++Y D++ + +
Sbjct: 256 EDFKRLCRSSFSPKPS---KLHCRYNSTTSPFLILAPLKMEQISLEPYIVVYHDILPEGD 312
Query: 106 IDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLT 165
I + +A+PRLR + + K + PV++R+++R+ +TGL
Sbjct: 313 IHQLIALAEPRLRATLAFTEDKSDSVFGAFLPFKD--MNSSGEPVLDRLTQRMRDITGLQ 370
Query: 166 TSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVF 225
+ ++ YG G HY +DF E N+ ++ G G+R+ATV+FY++D GGATVF
Sbjct: 371 IHQRNRINIIKYGFGAHYAARHDFF--NETNS-ETEGYGDRMATVMFYLNDAPNGGATVF 427
Query: 226 TSLNLSLWPEKGTAAFWHNLH-SSGDGDYYTRHAACPVLTGSN 267
+N+ + E+G FW+NL + D D T HAACPV GS
Sbjct: 428 PRINVKVPAERGKVLFWYNLDGETHDVDPKTVHAACPVFHGSK 470
>gi|25012370|gb|AAN71294.1| RE09701p [Drosophila melanogaster]
Length = 301
Score = 140 bits (354), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 81/223 (36%), Positives = 121/223 (54%), Gaps = 7/223 (3%)
Query: 48 YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEI- 106
Y LC+G L+C + Y L PL+ E +L P I +Y ++ +I
Sbjct: 55 YTRLCQGRRLPEERSGDPLRCYLDGKRHAYFTLAPLQVELVHLDPDINVYHGMLSSKQIL 114
Query: 107 DLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTT 166
+ ++ + + R+ V GE + + R+S+ WL + + PV+ + R ++ ++G
Sbjct: 115 SIFEEADKEEMVRSAVAG-SGGEGTVRDLRVSQQTWL-DYKSPVMNSVGRIIQFVSGFDM 172
Query: 167 STAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFT 226
+ AE +QV NYG+GG YEPH D+ E N K+ G+R++T +FY+SDV QGG TVFT
Sbjct: 173 AGAEHMQVANYGVGGQYEPHPDYF---EVNLPKNF-EGDRISTSMFYLSDVEQGGYTVFT 228
Query: 227 SLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSL 269
LN+ L P KG WHNLH S D T HA CPV+ GS +
Sbjct: 229 KLNVFLPPVKGALVMWHNLHRSLHVDARTLHAGCPVIVGSKRI 271
>gi|195159305|ref|XP_002020522.1| GL13469 [Drosophila persimilis]
gi|194117291|gb|EDW39334.1| GL13469 [Drosophila persimilis]
Length = 253
Score = 140 bits (353), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 81/203 (39%), Positives = 107/203 (52%), Gaps = 23/203 (11%)
Query: 64 AQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQ 123
++L C Y +LRL PLK E L P ++LY DV+ D E+ L+K MAQ L RA
Sbjct: 32 SRLYCLYNTTATAFLRLAPLKMELLSLDPYVVLYHDVLADREMSLLKLMAQRDLVRAVTY 91
Query: 124 NYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHY 183
N + R +K+ WL +P H +I R+ E M+ L +E+ QV+NYGIGGHY
Sbjct: 92 NATEKKHSEDPNRTTKAGWL-DPSHNLIRRMGILTEDMSNLDLERSEDFQVLNYGIGGHY 150
Query: 184 EPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWH 243
H DF F +SDV GGATVF L+LS++P+KG W+
Sbjct: 151 AVHPDF----------------------FELSDVPLGGATVFPLLDLSVFPKKGAVLMWY 188
Query: 244 NLHSSGDGDYYTRHAACPVLTGS 266
NL G G T H+ACPV+ GS
Sbjct: 189 NLDHKGQGMEKTIHSACPVVVGS 211
>gi|195452736|ref|XP_002073477.1| GK14138 [Drosophila willistoni]
gi|194169562|gb|EDW84463.1| GK14138 [Drosophila willistoni]
Length = 518
Score = 140 bits (352), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 76/215 (35%), Positives = 112/215 (52%), Gaps = 21/215 (9%)
Query: 52 CRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
C G V + QL C Y ++ +LR+ P+K E L P I+LY D + E +K
Sbjct: 303 CNGKCQVSKEL--QLYCLYNTKDSYFLRIAPVKMEVLSLNPYIVLYHDFILPREQGSLKA 360
Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
+ L A TGE + + R +K+ W + VI RIS+R+E +T L E
Sbjct: 361 QSIKYLSVAETIYPDTGEWQADSSRTAKAMWFEDSSAEVISRISQRIEDITNLNPEKGEL 420
Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
Q++NYGIGG YE HYD+ E + DV QGGAT+ +++LS
Sbjct: 421 YQIINYGIGGLYETHYDYLYENE-------------------LQDVPQGGATLLNNISLS 461
Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
++P+ G A FW+NL+++GD ++ H ACPV+ GS
Sbjct: 462 VFPKAGAALFWYNLNNAGDTEWNVAHTACPVIVGS 496
>gi|196011912|ref|XP_002115819.1| hypothetical protein TRIADDRAFT_59908 [Trichoplax adhaerens]
gi|190581595|gb|EDV21671.1| hypothetical protein TRIADDRAFT_59908 [Trichoplax adhaerens]
Length = 300
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 86/228 (37%), Positives = 129/228 (56%), Gaps = 14/228 (6%)
Query: 46 EKYEMLCRGDLTVPPAIVAQ-LKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDS 104
+K LC G+ +P LKC Y + + R MP EE P IILY ++ ++
Sbjct: 58 QKIRELCIGNENLPAKSSGHHLKCYYFYPSSK-TRFMPYAIEEMSRDPLIILYHNLTSNA 116
Query: 105 EIDLIKKMAQPRLRRATVQNYKTGELEIANY----RISKSAWLREPEHPVIERISRRVEH 160
E++ +K +A +L+ A V Y T + N RI+K A++ + E V I++R++
Sbjct: 117 EMESLKALAAKQLQPAGV--YHTTSADNRNLEGYTRIAKMAFILDEESAVASAITQRLQD 174
Query: 161 MTGLTTSTAEELQVVNYGIGGHYEPHYDF--ARPGEANAFKSLGTGNRVATVLFYMSDVA 218
+TGL + +E LQV+NYGI G Y PHYD A+ G+ +S + +R+AT + Y+SDV
Sbjct: 175 VTGLNMNFSEPLQVINYGIAGQYTPHYDTFPAKSGD----RSHPSHDRLATAILYLSDVE 230
Query: 219 QGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+GGATVFT++N+ + P KG W+N G+ T HA CPVL GS
Sbjct: 231 RGGATVFTNINVRVLPRKGNVIIWYNYLPDGNLHPGTLHAGCPVLVGS 278
>gi|21358309|ref|NP_651801.1| prolyl-4-hydroxylase-alpha SG2 [Drosophila melanogaster]
gi|20269808|gb|AAM18059.1|AF495537_1 prolyl 4-hydroxylase alpha-related protein PH4[alpha]SG2
[Drosophila melanogaster]
gi|10726875|gb|AAG22175.1| prolyl-4-hydroxylase-alpha SG2 [Drosophila melanogaster]
Length = 527
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 81/223 (36%), Positives = 121/223 (54%), Gaps = 7/223 (3%)
Query: 48 YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEI- 106
Y LC+G L+C + Y L PL+ E +L P I +Y ++ +I
Sbjct: 281 YTRLCQGRRLPEERSGDPLRCYLDGKRHAYFTLAPLQVEPVHLDPDINVYHGMLSSKQIL 340
Query: 107 DLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTT 166
+ ++ + + R+ V GE + + R+S+ WL + + PV+ + R ++ ++G
Sbjct: 341 SIFEEADKEEMVRSAVAG-SGGEGTVRDLRVSQQTWL-DYKSPVMNSVGRIIQFVSGFDM 398
Query: 167 STAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFT 226
+ AE +QV NYG+GG YEPH D+ E N K+ G+R++T +FY+SDV QGG TVFT
Sbjct: 399 AGAEHMQVANYGVGGQYEPHPDYF---EVNLPKNF-EGDRISTSMFYLSDVEQGGYTVFT 454
Query: 227 SLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSL 269
LN+ L P KG WHNLH S D T HA CPV+ GS +
Sbjct: 455 KLNVFLPPVKGALVMWHNLHRSLHVDARTLHAGCPVIVGSKRI 497
>gi|198449520|ref|XP_002136916.1| GA26928 [Drosophila pseudoobscura pseudoobscura]
gi|198130644|gb|EDY67474.1| GA26928 [Drosophila pseudoobscura pseudoobscura]
Length = 532
Score = 139 bits (351), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 80/231 (34%), Positives = 127/231 (54%), Gaps = 14/231 (6%)
Query: 47 KYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEI 106
++ +CR P+ +L CRY P+LRL PL+ EE L P I++Y +V+ D+EI
Sbjct: 281 EFIQICRSSHQNKPS---RLHCRYNATTTPFLRLAPLRMEELSLDPYIVVYHNVLSDAEI 337
Query: 107 DLIKKMAQPRLRRATVQNYKTGELEIANYRISKSA-----WLREPEHPVIERISRRVEHM 161
++++ +P L+R + + + R + ++ PVIER+ R + M
Sbjct: 338 AEVERVIEPLLQRIGRYDETPNSMSPSKRRTGFTGPHIDDYMHVSGAPVIERVHRHIRDM 397
Query: 162 TGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGG 221
TGL + E L +V YG+GGH + HYDF A+ + G+R+ATVLFY++DV GG
Sbjct: 398 TGLFMN--EHLMMVKYGLGGHCDQHYDFLN---ASYPSTHAMGDRMATVLFYLNDVKHGG 452
Query: 222 ATVFTSLNLSLWPEKGTAAFWHNLH-SSGDGDYYTRHAACPVLTGSNSLHS 271
+T FT L L + E+G FW+N+ + + D T H +CPV+ G+ + S
Sbjct: 453 STAFTDLQLKVPSERGKVLFWYNMRGETHNLDRRTVHGSCPVIDGTKKILS 503
>gi|195577074|ref|XP_002078398.1| GD23422 [Drosophila simulans]
gi|194190407|gb|EDX03983.1| GD23422 [Drosophila simulans]
Length = 513
Score = 139 bits (351), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 77/221 (34%), Positives = 119/221 (53%), Gaps = 15/221 (6%)
Query: 49 EMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDL 108
E C+G PP QL CRY P++R+ PLKEEE P I LY +V+YDSEI
Sbjct: 284 EQGCQGKF--PPG--PQLVCRYNSTTTPFMRIAPLKEEEISRDPLIWLYHNVIYDSEIAQ 339
Query: 109 IKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH---PVIERISRRVEHMTGLT 165
+ + + + T NY T + R+ + ++ + + + + R+ ++GL
Sbjct: 340 LTNLTREEMILGTTTNYTTPD------RVDRLFHIKVTDDDGGKLDKTLVNRMADISGLD 393
Query: 166 TSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVF 225
L +NYG+GG+++ H D+ G+R+ T LFYM+D+ GGAT+F
Sbjct: 394 VGNTTTLARINYGLGGYFQEHSDYMDIKLHPELTE--EGDRLMTFLFYMTDIPVGGATIF 451
Query: 226 TSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
L++ P+KG+A FW+NLH++GD + TRHA CP + GS
Sbjct: 452 PGAQLAIQPKKGSALFWYNLHNNGDPNPLTRHAVCPTIVGS 492
>gi|195159309|ref|XP_002020524.1| GL13466 [Drosophila persimilis]
gi|194117293|gb|EDW39336.1| GL13466 [Drosophila persimilis]
Length = 643
Score = 139 bits (349), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 79/203 (38%), Positives = 108/203 (53%), Gaps = 6/203 (2%)
Query: 65 QLKCRYV-HRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQ 123
L C Y+ R P+L L ++ E P I LY DV+ S++ ++ ++P L AT
Sbjct: 416 HLHCFYLTKRGSPFLLLARVRTEILSDDPFIALYYDVLTHSDMVSLRNTSEPLLHPATTI 475
Query: 124 NYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHY 183
Y E++N R + WL R R + +TGL S +E QV NYGIGG +
Sbjct: 476 QYFNAPQELSNSRTAHFVWLEPTITEATRRADRVLWDVTGLNLSNSEMFQVNNYGIGGSF 535
Query: 184 EPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWH 243
H D E N R+AT +FY+SDV GGAT+FT LN++++P+ GT FW+
Sbjct: 536 MRHSDLLH-SERNYL----VRERIATAIFYLSDVPHGGATLFTELNVTVFPQAGTVLFWY 590
Query: 244 NLHSSGDGDYYTRHAACPVLTGS 266
NL SGD D TRH CPV+ GS
Sbjct: 591 NLAHSGDHDMRTRHTGCPVIVGS 613
>gi|241029040|ref|XP_002406378.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
gi|215491954|gb|EEC01595.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
Length = 539
Score = 138 bits (348), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 84/225 (37%), Positives = 121/225 (53%), Gaps = 20/225 (8%)
Query: 39 TLEVT----EREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRI 94
T EVT E + Y+ LCRG L P + +QL+CRY + L P+K EE L+P I
Sbjct: 271 TQEVTPDDQEDQSYKRLCRGKLLRSPKMESQLRCRYYKGQDGFFALQPIKLEEMNLKPYI 330
Query: 95 ILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERI 154
I+ DV+ D +I + A+PR+R+ L I + SAWL E E P+ R+
Sbjct: 331 IVMHDVLQDKDIKELMAFAEPRVRKT------LPYLFICHIHTFYSAWLNEDEAPIAVRM 384
Query: 155 SRRVEHMTGLTTST----AEELQVVNYGIGGHYEPHYDFARP------GEANAFKSLGTG 204
+ + + G+ TS AE Q+ NYG GG + PH+DF + A+ + GTG
Sbjct: 385 NSYLRALLGMGTSDTDEEAEAYQLANYGTGGQFLPHHDFLQDSFHSYNSSADYYLQYGTG 444
Query: 205 NRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSG 249
+RVAT++ Y++DV +GGATVF +L L L P+K F +S G
Sbjct: 445 DRVATLMIYLTDVEEGGATVFPTLGLRLTPKKVNLFFISLRNSDG 489
>gi|195166677|ref|XP_002024161.1| GL22880 [Drosophila persimilis]
gi|194107516|gb|EDW29559.1| GL22880 [Drosophila persimilis]
Length = 507
Score = 138 bits (348), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 78/224 (34%), Positives = 121/224 (54%), Gaps = 18/224 (8%)
Query: 48 YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID 107
+E CRG +V CRY P+LRL PLK EE P I++Y V+ D E++
Sbjct: 235 HESGCRGLFPKRTNLV----CRYNSTTTPFLRLAPLKMEEVNHDPYIVMYHQVLSDREME 290
Query: 108 LIKKMAQPRLRRATVQNYKTGELEIANYR-----ISKSAWLREPEHPVIERISRRVEHMT 162
+K++A+P + N +G E+AN +++ AWL E P ER++ R+ MT
Sbjct: 291 EMKQLARP------MTNGMSGS-EMANLTEPLEIVARVAWLIEAS-PFRERLNLRIGDMT 342
Query: 163 GLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGA 222
G S + LQ+ N+G+G +++ HYD+ R N G+R +++FY S+V QGG
Sbjct: 343 GFDVSDFKALQLANFGVGSYFKAHYDY-RTERVNDLGVTELGDRTGSIIFYASEVPQGGT 401
Query: 223 TVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
T+F + +++ P+KG + FW N D + HA CPV+ GS
Sbjct: 402 TIFPDIQVTVTPQKGNSLFWFNTFDDSTPDPRSLHAICPVIAGS 445
>gi|195069797|ref|XP_001997029.1| GH12978 [Drosophila grimshawi]
gi|193891498|gb|EDV90364.1| GH12978 [Drosophila grimshawi]
Length = 518
Score = 138 bits (348), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 81/222 (36%), Positives = 121/222 (54%), Gaps = 12/222 (5%)
Query: 48 YEMLCRGDLTVPPAIVAQL--KCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSE 105
Y LC+G +P Q +C Y +L PLK E+ L P I +Y DV+ D++
Sbjct: 281 YVRLCQGK-RLPEIKTNQSSPRCYLDSNQHAYFKLSPLKVEQVNLAPDINIYYDVLNDNQ 339
Query: 106 IDLIKKMA-QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGL 164
I I +++ + R++V Y + + R+S+ WL P++ + V ++G
Sbjct: 340 IKSILELSTEFESFRSSVNKYN-----VTDKRVSQQVWLNYSS-PIMRTYRQLVGAISGF 393
Query: 165 TTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATV 224
+ AE +QV NYGIGG YEPH+DF+ A + + G +R++T + Y+SDV QGG TV
Sbjct: 394 NMTNAEIMQVANYGIGGQYEPHHDFSGANLAARYANFG--DRISTNMIYLSDVQQGGYTV 451
Query: 225 FTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
F + N+ + P KG WHNL S DGD T HA CPV+ G+
Sbjct: 452 FPTQNVFVKPIKGAMVMWHNLLRSLDGDRRTLHAGCPVIEGT 493
>gi|195110921|ref|XP_002000028.1| GI24861 [Drosophila mojavensis]
gi|193916622|gb|EDW15489.1| GI24861 [Drosophila mojavensis]
Length = 508
Score = 138 bits (347), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 84/247 (34%), Positives = 127/247 (51%), Gaps = 18/247 (7%)
Query: 48 YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID 107
+ LC+G P L C P RL PLK E+A+L P I +Y DV+ D +I+
Sbjct: 276 FSRLCQGKRLPEPG---SLSCYLDFERHPRFRLSPLKVEQAHLNPDIHIYYDVLTDPQIE 332
Query: 108 LIKKMAQPRLRRATVQNYKTGELE--IANYRISKSAWLREPEHPVIERISRRVEHMTGLT 165
+ +A + ++++++ L + R+S+ WL P++ + + ++GL
Sbjct: 333 SVLDLA------SQLESFRSKVLGDVVTETRVSQQVWLNYTS-PIMRTVGNLLGAISGLD 385
Query: 166 TSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVF 225
+ EE+QV NYGIGG Y PH+D+ + + GNR+ T +FY+SDV QGG TVF
Sbjct: 386 MTNVEEMQVANYGIGGQYFPHFDYISELREDYIER---GNRITTNMFYLSDVLQGGYTVF 442
Query: 226 TSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTCPCGLRRGLQRSG 285
LN+ L P KG+ W N+H S D HA CPVL GS + + ++ +R
Sbjct: 443 PFLNVFLRPVKGSLVIWPNVHRSLAPDSRVLHAGCPVLEGSKRIGNIWIHSAQQEFRRP- 501
Query: 286 IICTLVG 292
CTLV
Sbjct: 502 --CTLVS 506
>gi|167519971|ref|XP_001744325.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163777411|gb|EDQ91028.1| predicted protein [Monosiga brevicollis MX1]
Length = 492
Score = 138 bits (347), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 94/273 (34%), Positives = 131/273 (47%), Gaps = 18/273 (6%)
Query: 4 PTHQRAQGNKLYYQEALNKSPELKDEPPKVN-----NVAP--TLEVTERE--KYEMLCRG 54
P + R N YY L+K+ V N P L ERE K+ LC+G
Sbjct: 208 PDNGRVFKNVEYYTHQLHKASNGTSASGSVRVARKANYRPDNVLGRDERELLKFNKLCQG 267
Query: 55 DLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYL-QPRIILYRDVMYDSEIDLIKKMA 113
P+ L CR H N P+L L P++ E + R+ ++R+ E +++
Sbjct: 268 RKIYKPS--KPLSCRLQHFNKPHLFLKPIRVEYVHEGNNRLQIFRNFASAQECAHLREEG 325
Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
+ +L RA + G +RIS +AWL+ V+ + R+ T L AE LQ
Sbjct: 326 RKKLSRAVA--WTDGAFRPVEFRISTAAWLQPDHDDVVTNLHTRIADATQLDLEFAEALQ 383
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
V NYGIGG YE HYD A+ + L G+R+AT + Y++ V QGG T F L ++
Sbjct: 384 VSNYGIGGFYETHYDH----HASRERELPEGDRIATFMIYLNQVEQGGYTAFPRLGAAVE 439
Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
P G A FW+NL G+ D T H ACPVL GS
Sbjct: 440 PGHGDAVFWYNLLPDGESDNNTLHGACPVLQGS 472
>gi|194905381|ref|XP_001981186.1| GG11928 [Drosophila erecta]
gi|190655824|gb|EDV53056.1| GG11928 [Drosophila erecta]
Length = 543
Score = 138 bits (347), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 72/207 (34%), Positives = 115/207 (55%), Gaps = 5/207 (2%)
Query: 63 VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATV 122
+ +L C Y P+L+L P+K E L P ++L D++ E LI+ ++ L ++ +
Sbjct: 304 LTRLYCVYNRVTSPFLQLAPIKTEILSLDPFVLLLHDMVRQKESTLIRASSKEHLLQSEI 363
Query: 123 QN--YKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIG 180
N + E +A +R SKS W + ++I+ R+ TGL E QV+NYG+G
Sbjct: 364 TNTDASSSEDNVAIFRTSKSVWYSSDFNDTTKKITERLADATGLDMHFTEYFQVINYGLG 423
Query: 181 GHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAA 240
G + H D ++ + GT +R+AT +FY++ V QGGAT F LNL+++P+ G+A
Sbjct: 424 GFFATHLDMLL---SDKTRFNGTSDRIATTVFYLNGVRQGGATHFPLLNLTVFPQPGSAL 480
Query: 241 FWHNLHSSGDGDYYTRHAACPVLTGSN 267
FW+NL + G+ T H CPV+ GS
Sbjct: 481 FWYNLDTKGNDQRSTMHTGCPVIVGSK 507
>gi|194905424|ref|XP_001981193.1| GG11755 [Drosophila erecta]
gi|190655831|gb|EDV53063.1| GG11755 [Drosophila erecta]
Length = 527
Score = 137 bits (346), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 90/267 (33%), Positives = 132/267 (49%), Gaps = 20/267 (7%)
Query: 4 PTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAIV 63
PTH Q K AL K + + P + N Y MLC+G
Sbjct: 250 PTHSAQQTRKYLESRALGKIDQ-ETNPTWLAN------------YTMLCQGRRLPEERSA 296
Query: 64 AQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEI-DLIKKMAQPRLRRATV 122
LKC + Y L PL+ E +L P I +Y ++ ++I ++ + + ++ R+ V
Sbjct: 297 DPLKCYLDGKRHAYFTLAPLQVEPVHLDPDINVYHGMLSANQILSILDEAEKMQMFRSAV 356
Query: 123 QNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGH 182
G + + R+S+ WL + + V++ + R E ++G + AE +QV NYG+GG
Sbjct: 357 SG-NGGNSTVKDLRVSQQTWL-DYKSAVMKSVGRINELVSGFDMAGAEYMQVANYGVGGQ 414
Query: 183 YEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFW 242
YEPH D+ FK G+R++T +FY+SDV QGG TVF LN+ L P G W
Sbjct: 415 YEPHPDYFGVNLPVEFK----GDRISTSMFYLSDVEQGGYTVFPKLNVFLPPVSGALVMW 470
Query: 243 HNLHSSGDGDYYTRHAACPVLTGSNSL 269
HNLH S D D T HA CPV+ GS +
Sbjct: 471 HNLHRSLDVDARTLHAGCPVIVGSKRI 497
>gi|194905376|ref|XP_001981185.1| GG11927 [Drosophila erecta]
gi|190655823|gb|EDV53055.1| GG11927 [Drosophila erecta]
Length = 539
Score = 137 bits (345), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 73/207 (35%), Positives = 113/207 (54%), Gaps = 3/207 (1%)
Query: 60 PAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRR 119
P + +L C Y +LRL P+K E + P ++L D++ E LI+ ++ +
Sbjct: 299 PRKLKRLYCVYNCATAAFLRLAPIKTEILSIDPFVVLLHDMVSPKEAALIRSSSKSTIFP 358
Query: 120 ATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGI 179
+ N + ++ +R SKS WL + ++++R+ TGL +E QV+NYGI
Sbjct: 359 SETVN-AANDFVVSKFRTSKSVWLDRDANEATVKLTQRLADATGLDVKHSEHFQVINYGI 417
Query: 180 GGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTA 239
GG +E H+D + N F G +R+AT LFY++DV QGGAT F LN++++P G A
Sbjct: 418 GGVFESHFDTTLE-DTNRFVG-GFIDRIATTLFYLNDVPQGGATHFPGLNITVFPRLGAA 475
Query: 240 AFWHNLHSSGDGDYYTRHAACPVLTGS 266
FW+NL + G T H CPV+ GS
Sbjct: 476 LFWYNLDTQGMLQVRTMHTGCPVIVGS 502
>gi|405964867|gb|EKC30309.1| Prolyl 4-hydroxylase subunit alpha-1 [Crassostrea gigas]
Length = 591
Score = 137 bits (345), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 84/239 (35%), Positives = 131/239 (54%), Gaps = 26/239 (10%)
Query: 48 YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID 107
YE LCR + + A+L+C +PY + KEE +PRI ++ DV+ + I+
Sbjct: 327 YEALCREEQKSLQEL-AKLRCFLRETVIPYYKA---KEEVVNYEPRIAIFHDVISPTSIE 382
Query: 108 LIKKMAQPRLRRATVQNYKTGEL------EIANYRISKSAWLREPEHPVIERISRRVEHM 161
+K +A R+TV TG ++ N R+S+++WL E+P + R+ R++
Sbjct: 383 HLKSVASKGFTRSTVFLENTGPDGHVTYGKLDNVRVSQTSWLGTDEYPELSRLENRIKLT 442
Query: 162 TGLTT------STAEELQVVNYGIGGHYEPHYDF--------ARPGEANAFKSLGTGNRV 207
TGL+ S +E+ QV+NYG+GG Y HYD+ + P +++ ++ +G R+
Sbjct: 443 TGLSAEYKSVRSHSEKFQVLNYGVGGMYTVHYDYTGYMLGIPSNPLDSDDIRT--SGERM 500
Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
AT +FY++DV GGATVF + + KG AAFW+N+ SG D T H CPVL GS
Sbjct: 501 ATWMFYLNDVKAGGATVFPEVKTRIPVAKGGAAFWYNVRPSGATDPRTLHGGCPVLVGS 559
>gi|195575103|ref|XP_002105519.1| GD17002 [Drosophila simulans]
gi|194201446|gb|EDX15022.1| GD17002 [Drosophila simulans]
Length = 793
Score = 137 bits (344), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 80/215 (37%), Positives = 115/215 (53%), Gaps = 14/215 (6%)
Query: 4 PTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREK-YEMLCRGDLTVPPAI 62
P H+ A NK+ Y+ L + P+ P E E K Y +CRG+L P
Sbjct: 215 PDHEDALKNKILYEGQLARERSF---VPREQAELPQKEQKESYKLYTQVCRGELHQSPRD 271
Query: 63 VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATV 122
L+C H+ VPY L P K E+ + P + +V++DSEID I + + + R+ V
Sbjct: 272 QRNLRCWLSHQGVPYYHLSPFKIEQLNIDPYVAYVHEVLWDSEIDTIMEHGKGNMERSKV 331
Query: 123 ---QNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGI 179
+N T E+ RIS++ WL +P + +I +R+E +TGL+T +AE LQ+VNYGI
Sbjct: 332 GQIENSTTTEV-----RISRNTWLWYDANPWLSKIKQRLEDVTGLSTESAEPLQLVNYGI 386
Query: 180 GGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYM 214
GG YEPH+DF N F GNR+ T LFY+
Sbjct: 387 GGQYEPHFDFVEDDGQNVFS--WKGNRLLTALFYL 419
>gi|47204411|emb|CAF95476.1| unnamed protein product [Tetraodon nigroviridis]
Length = 284
Score = 136 bits (343), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 76/153 (49%), Positives = 94/153 (61%), Gaps = 7/153 (4%)
Query: 116 RLRRATVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEEL 172
+LRR+ V T + ++ A YRISKSAWL+ + R+ +R+ +TGL E L
Sbjct: 110 KLRRSVV---ATRDKQVTAEYRISKSAWLKGSAQSAVSRLDQRISMLTGLNVQHPHGEYL 166
Query: 173 QVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSL 232
QVVNYGIGGHYEPH+D A + FK L TGNRVATV+ Y+S V GG+T F N S+
Sbjct: 167 QVVNYGIGGHYEPHFDHATSPSSPVFK-LKTGNRVATVMIYLSSVEAGGSTAFIYANFSV 225
Query: 233 WPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
K A FW NLH +G GD T HA CPVL G
Sbjct: 226 PVMKNAAIFWWNLHRNGRGDPDTLHAGCPVLIG 258
>gi|195069795|ref|XP_001997028.1| GH12977 [Drosophila grimshawi]
gi|193891497|gb|EDV90363.1| GH12977 [Drosophila grimshawi]
Length = 517
Score = 136 bits (343), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 82/222 (36%), Positives = 120/222 (54%), Gaps = 12/222 (5%)
Query: 48 YEMLCRGDLTVPPAIVAQL--KCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSE 105
Y LC+G +P Q +C Y +L PLK E+ L P I +Y DV+ D++
Sbjct: 280 YVRLCQGK-RLPEIKTNQSSPRCYLDSNQHAYFKLSPLKVEQVNLAPDINIYYDVLNDNQ 338
Query: 106 IDLIKKMA-QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGL 164
I I +++ + R++V Y + + R+S+ WL P++ + V ++G
Sbjct: 339 IKSILELSTEFDSFRSSVNKYN-----VTDKRVSQQVWLNYSS-PIMRTYRQLVGAISGF 392
Query: 165 TTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATV 224
+ AE +QV NYGIGG YEPH+DF G S+ G+R++T + Y+SDV QGG TV
Sbjct: 393 NMTNAETMQVANYGIGGQYEPHHDFF--GINLPANSVKRGDRISTNMIYLSDVQQGGYTV 450
Query: 225 FTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
F + N+ + P KG WHNL S DGD T HA CPV+ G+
Sbjct: 451 FPTQNVFVKPIKGAMVMWHNLLRSLDGDRRTLHAGCPVIEGT 492
>gi|289526401|gb|ADD01323.1| FI13021p [Drosophila melanogaster]
gi|373432715|gb|AEY70761.1| FI17809p1 [Drosophila melanogaster]
Length = 193
Score = 136 bits (343), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 65/136 (47%), Positives = 90/136 (66%), Gaps = 5/136 (3%)
Query: 136 RISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPH---YDFARP 192
R +K WL++ + + +RI+RR+ MTG + +E QV+NYGIGGHY H +DFA
Sbjct: 28 RTAKGFWLKKESNELTKRITRRIMDMTGFDLADSEGFQVINYGIGGHYFLHMDYFDFASS 87
Query: 193 GEANAFK--SLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGD 250
+ S+ G+R+ATVLFY++DV QGGATVF + + P+ GTA FW+NL + G+
Sbjct: 88 NHTDTRSRYSIDLGDRIATVLFYLTDVEQGGATVFGDVGYYVSPQAGTAIFWYNLDTDGN 147
Query: 251 GDYYTRHAACPVLTGS 266
GD TRHAACPV+ GS
Sbjct: 148 GDPRTRHAACPVIVGS 163
>gi|195391756|ref|XP_002054526.1| GJ24503 [Drosophila virilis]
gi|194152612|gb|EDW68046.1| GJ24503 [Drosophila virilis]
Length = 519
Score = 136 bits (342), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 84/246 (34%), Positives = 122/246 (49%), Gaps = 12/246 (4%)
Query: 46 EKYEMLCRGD-LTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDS 104
+ Y LC+G L+ P + L C RL PLK E+ L P I +Y D++ D
Sbjct: 281 DNYTQLCQGKRLSEPKPNGSALNCYLDFTRHARFRLAPLKVEQVRLNPDIHIYYDLINDD 340
Query: 105 EIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGL 164
+ID I ++ + + ++R+S+ WL P++ +S V ++G
Sbjct: 341 QIDDIYEVVD----QFDSFRSSVSSSIVTDWRVSQQVWLNYSS-PILRSVSNLVGAISGF 395
Query: 165 TTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATV 224
AE++QV NYGIGG Y PH D+ + + GNR+AT +FY+SDV GG TV
Sbjct: 396 DMENAEQMQVANYGIGGQYAPHTDYLSKIPDS---YIPRGNRIATNMFYLSDVLNGGYTV 452
Query: 225 FTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTCPCGLRRGLQRS 284
F LN+ L P KG W+NLH S + D T HA CPV+ G + + R+ +R
Sbjct: 453 FPKLNVFLKPVKGAMVSWYNLHRSLNKDSRTLHAGCPVIEGVKRIGNIWIHSTRQEFRRP 512
Query: 285 GIICTL 290
CTL
Sbjct: 513 ---CTL 515
>gi|449284064|gb|EMC90646.1| Prolyl 4-hydroxylase subunit alpha-3, partial [Columba livia]
Length = 174
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 71/157 (45%), Positives = 97/157 (61%), Gaps = 13/157 (8%)
Query: 133 ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQVVNYGIGGHYEPHYDFA 190
A YRISKSAWL++ HPV++ + +R+ +TGL AE LQVVNYG+GGHYEPH+D A
Sbjct: 15 AEYRISKSAWLKDTAHPVVQTLEKRMAAVTGLDLRPPYAEYLQVVNYGLGGHYEPHFDHA 74
Query: 191 RPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGD 250
++ ++ + +GNR+AT++ Y+S V GG+T F NLS+ K A FW NL +GD
Sbjct: 75 TSRKSPLYR-MKSGNRIATLMIYLSAVGAGGSTAFVHANLSVPVVKNAALFWWNLRRNGD 133
Query: 251 GDYYTRHAACPVLTGSNSLHSTC----------PCGL 277
GD T HA CPVL G + + PCG+
Sbjct: 134 GDGDTLHAGCPVLAGDKWVANKWIHEHGQEFRRPCGI 170
>gi|195452738|ref|XP_002073478.1| GK14139 [Drosophila willistoni]
gi|194169563|gb|EDW84464.1| GK14139 [Drosophila willistoni]
Length = 215
Score = 135 bits (340), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 68/202 (33%), Positives = 110/202 (54%), Gaps = 19/202 (9%)
Query: 65 QLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQN 124
+L C Y ++ +LR+ P+K E L P I+LY D + SE + +K + RL A +
Sbjct: 11 KLYCLYNTKDSYFLRIAPVKMEVLSLDPYIVLYHDFILSSEQEFLKAESIERLSVAETVD 70
Query: 125 YKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYE 184
TG+ R +K+ W + VI RI++R+E +T L + Q+++YGIGG ++
Sbjct: 71 PDTGKWYADASRTAKAMWFYDTSSVVIRRINQRIEEITNLDPEKGDLYQIISYGIGGLFQ 130
Query: 185 PHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHN 244
HYD+ E + DV QGGAT+F +++LS++P+ G A FW+N
Sbjct: 131 THYDYLHENE-------------------LQDVPQGGATLFNNISLSVFPKAGAALFWYN 171
Query: 245 LHSSGDGDYYTRHAACPVLTGS 266
L+++GD ++ H CPV+ GS
Sbjct: 172 LNNAGDTEWNVAHTGCPVIVGS 193
>gi|195505241|ref|XP_002099419.1| GE10893 [Drosophila yakuba]
gi|194185520|gb|EDW99131.1| GE10893 [Drosophila yakuba]
Length = 508
Score = 135 bits (340), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 76/227 (33%), Positives = 123/227 (54%), Gaps = 13/227 (5%)
Query: 46 EKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSE 105
E Y+ LCR + P+ +L CRY P+L L P K EE L+P I++Y D++ D +
Sbjct: 262 EDYKRLCRSSFSPRPS---KLLCRYNSDTSPFLILAPFKMEEISLEPYIVVYHDILPDKD 318
Query: 106 IDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH-----PVIERISRRVEH 160
+ + +A+PRLR V E ++ R + +L + P+++R+++R+
Sbjct: 319 MQQLIALAEPRLRPTEVFEEDKSEARTSD-RSALGTFLPFKDMNPSGGPLLDRLTQRMRD 377
Query: 161 MTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQG 220
+TG+ ++ YG G Y ++DF + + G G+R+ATVLFY++D G
Sbjct: 378 ITGIQIRHENTFNIIKYGFGSQYATNFDFFNGTNS---EMEGYGDRMATVLFYLNDAPNG 434
Query: 221 GATVFTSLNLSLWPEKGTAAFWHNLH-SSGDGDYYTRHAACPVLTGS 266
GATVF +++ + E+G FWHNL+ + D + T HAACPV GS
Sbjct: 435 GATVFPRIDVKVTAERGKVLFWHNLNGETHDVEPNTLHAACPVFQGS 481
>gi|241598365|ref|XP_002404734.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
gi|215500465|gb|EEC09959.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
Length = 524
Score = 135 bits (339), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 73/203 (35%), Positives = 118/203 (58%), Gaps = 10/203 (4%)
Query: 46 EKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSE 105
E Y+ LCRG+ P + +QL+CRY + +L P+K EE L+P +++ RD++ D +
Sbjct: 274 ENYKRLCRGEQLRTPKMDSQLRCRYYSGESGFFKLQPIKLEEYNLKPYVVVLRDLLQDRD 333
Query: 106 IDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLT 165
+ + A+PR+R+ + L + + S WL + + PV R+++ ++ + GL
Sbjct: 334 LADMIAFAKPRVRKLQLSRRI---LVYSKHYCDTSTWLNDDDAPVAARVNQYLQSLLGLG 390
Query: 166 T----STAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT---GNRVATVLFYMSDVA 218
T AE+ Q+ NYGIGGHY PH+D+ + S+ T G+RVAT++ YMSDV
Sbjct: 391 TLYSKDEAEKYQLANYGIGGHYVPHHDYLEETLTSRHVSIVTRLFGDRVATLMIYMSDVE 450
Query: 219 QGGATVFTSLNLSLWPEKGTAAF 241
+GGATVF SL + + P+K + F
Sbjct: 451 EGGATVFPSLGVRVSPKKVSMQF 473
>gi|47191658|emb|CAG13505.1| unnamed protein product [Tetraodon nigroviridis]
Length = 156
Score = 134 bits (338), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 72/154 (46%), Positives = 94/154 (61%), Gaps = 26/154 (16%)
Query: 64 AQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLR---- 118
+ L CRY N P L L P KEE+ + P I+ Y D + D+EID IK++A+P++R
Sbjct: 3 SHLFCRYRSGNRNPRLLLKPFKEEDEWDSPHIVRYLDFLSDTEIDKIKELAKPKVRHYSK 62
Query: 119 ---------------------RATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRR 157
RATV++ KTG L ANYR+SKSAWL E PVI R+++R
Sbjct: 63 KKSVCYNVEITRSTFLFFQLARATVRDPKTGVLTTANYRVSKSAWLEGEEDPVIARVNQR 122
Query: 158 VEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFAR 191
+E +TGLT TAE LQV NYG+GG YEPH+DF+R
Sbjct: 123 IEDLTGLTVETAELLQVANYGLGGQYEPHFDFSR 156
>gi|402584932|gb|EJW78873.1| hypothetical protein WUBG_10221 [Wuchereria bancrofti]
Length = 187
Score = 134 bits (337), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 67/127 (52%), Positives = 83/127 (65%), Gaps = 2/127 (1%)
Query: 140 SAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFK 199
S+WL EH V+ RI++R++ T L T TAEELQV NYGIGGHYEPHYD +R + F+
Sbjct: 7 SSWLGSTEHEVVNRINKRLDLATNLETETAEELQVQNYGIGGHYEPHYDCSR--RESVFE 64
Query: 200 SLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAA 259
GNR+AT+L YM+ GG TVF L S+ K A FW+NL SG D + HAA
Sbjct: 65 KTKNGNRIATILIYMTKPEIGGGTVFIDLKTSISCTKNAALFWYNLMRSGAVDIRSYHAA 124
Query: 260 CPVLTGS 266
CPVLTG+
Sbjct: 125 CPVLTGT 131
>gi|195352184|ref|XP_002042594.1| GM14981 [Drosophila sechellia]
gi|194124478|gb|EDW46521.1| GM14981 [Drosophila sechellia]
Length = 539
Score = 133 bits (335), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 79/225 (35%), Positives = 118/225 (52%), Gaps = 8/225 (3%)
Query: 47 KYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEI 106
K E CRG+ + + +L CRY +L+L PLK E +QP I+LY DV+Y++E
Sbjct: 297 KLERGCRGEWSRKSS--PELICRYNRDTSAFLKLAPLKLEFLSVQPMILLYHDVLYENEF 354
Query: 107 DLIKKMA---QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTG 163
++ +A + T ++ R+ K + P I+RR+ M+G
Sbjct: 355 KSMRDLAMYNDSMIDGWTYVDFDKKGNPKQQDRVVKIISFQGTTAPFTLSINRRLADMSG 414
Query: 164 LTTSTAEELQVVNYGIGGHYEPHYDFARPGEA--NAFKSLGTGNRVATVLFYMSDVAQGG 221
L L + NYG+GGH+ H D+ + + F G G+R+AT LFY SDV GG
Sbjct: 415 LEMRENMVLYLTNYGLGGHFGKHVDYVELAKRPPDFFADFG-GDRIATALFYASDVPLGG 473
Query: 222 ATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
TVFT L +++ P+KG A W NL+ +G+ D T H+ CPV+ GS
Sbjct: 474 TTVFTKLKIAVKPKKGNALIWFNLNHAGEPDPLTEHSVCPVVLGS 518
>gi|195390825|ref|XP_002054068.1| GJ24233 [Drosophila virilis]
gi|194152154|gb|EDW67588.1| GJ24233 [Drosophila virilis]
Length = 533
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 81/210 (38%), Positives = 117/210 (55%), Gaps = 16/210 (7%)
Query: 65 QLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQN 124
+L C Y +LRL P K E P I ++ DV+Y SEI + ++ +P L+R VQN
Sbjct: 296 RLTCYYKTNPSEFLRLAPFKLELLSKDPYIAVFHDVIYASEIAELIRIGEPMLKRTAVQN 355
Query: 125 Y-KTGELEIANYRISKSAW-----LREPEHPVIERISRRVEHMTGL--TTSTAEELQVVN 176
+ + I+ R + +W L + E +I RI RR+E MTGL T + ++LQ++N
Sbjct: 356 ITQNVDTYISKDRTATGSWILNGNLTKLERNMIWRIQRRIEDMTGLLITGFSEQDLQLLN 415
Query: 177 YGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEK 236
Y GGHY+ HYDF +F +R+AT L Y++DV +GGATVF L+L + PE+
Sbjct: 416 YVFGGHYQSHYDFF---NCPSFPH----DRIATTLIYLNDVVRGGATVFPKLDLVVQPER 468
Query: 237 GTAAFWHN-LHSSGDGDYYTRHAACPVLTG 265
G W+N L + D D + H CPVL G
Sbjct: 469 GKVLHWYNMLPDTFDYDRRSLHGGCPVLIG 498
>gi|21358233|ref|NP_651814.1| prolyl-4-hydroxylase-alpha NE3 [Drosophila melanogaster]
gi|20269810|gb|AAM18060.1|AF495538_1 prolyl 4-hydroxylase alpha-related protein PH4[alpha]NE3
[Drosophila melanogaster]
gi|15291443|gb|AAK92990.1| GH21465p [Drosophila melanogaster]
gi|23172714|gb|AAN14251.1| prolyl-4-hydroxylase-alpha NE3 [Drosophila melanogaster]
gi|220945610|gb|ACL85348.1| PH4alphaNE3-PA [synthetic construct]
gi|220955396|gb|ACL90241.1| PH4alphaNE3-PA [synthetic construct]
Length = 481
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 80/243 (32%), Positives = 130/243 (53%), Gaps = 19/243 (7%)
Query: 25 ELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLK 84
+ K P + + +P L E Y+ LCR + P+ +L CRY +L L PLK
Sbjct: 245 QFKANPYEAIDSSPKL----GEGYKRLCRSSFSPNPS---KLHCRYNSTTSAFLILAPLK 297
Query: 85 EEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLR 144
EE L+P I++Y D++ D +I + +A+P L K E+ N ++S++
Sbjct: 298 MEEISLEPHIVVYHDILPDKDIQQLITLAEPLL--------KPTEMFDDNKNEARSSYRT 349
Query: 145 EPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTG 204
P+++ +++R+ +TGL + ++ YG G Y +YDF + + +S G G
Sbjct: 350 PLGGPLLDSLTQRMRDITGLQIRQGNPINIIKYGFGAPYTNYYDFFKKRNS---ESKGFG 406
Query: 205 NRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLH-SSGDGDYYTRHAACPVL 263
+R+AT +FY++D GGATVF LN+ + E+G FW+NL+ + D + T HAACPV
Sbjct: 407 DRMATFMFYLNDAPYGGATVFPRLNVKVPAERGKVLFWYNLNGDTHDMEPTTMHAACPVF 466
Query: 264 TGS 266
GS
Sbjct: 467 HGS 469
>gi|195591304|ref|XP_002085382.1| GD14758 [Drosophila simulans]
gi|194197391|gb|EDX10967.1| GD14758 [Drosophila simulans]
Length = 509
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 79/225 (35%), Positives = 117/225 (52%), Gaps = 8/225 (3%)
Query: 47 KYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEI 106
K E CRG+ P +L CRY +L+L PLK E +QP I+LY DV+Y++E
Sbjct: 267 KLERGCRGEW--PRKSSPELICRYNRDTSAFLKLAPLKLEFLSVQPMILLYHDVLYENEF 324
Query: 107 DLIKKMA---QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTG 163
++ +A + T ++ R+ K + P I+RR+ M+G
Sbjct: 325 KSMRDIAMYNDSMIDGWTYVDFDKKGNPKQQDRVVKIISFQGTTAPFTLSINRRLADMSG 384
Query: 164 LTTSTAEELQVVNYGIGGHYEPHYDFARPGEA--NAFKSLGTGNRVATVLFYMSDVAQGG 221
L L + NYG+GGH+ H D+ + + F G G+R+AT +FY SDV GG
Sbjct: 385 LEMRENMVLYLTNYGLGGHFGKHVDYVELAKRPPDFFADFG-GDRIATAVFYASDVPLGG 443
Query: 222 ATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
TVFT L +++ P+KG A W NL+ +G+ D T H+ CPV+ GS
Sbjct: 444 TTVFTKLKIAVQPKKGNALIWFNLNHAGEPDPLTEHSVCPVVLGS 488
>gi|24666354|ref|NP_730347.1| CG32199 [Drosophila melanogaster]
gi|23093193|gb|AAF49251.3| CG32199 [Drosophila melanogaster]
Length = 509
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 78/225 (34%), Positives = 117/225 (52%), Gaps = 8/225 (3%)
Query: 47 KYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEI 106
K E CRG+ P +L CRY +L+L PLK E +QP I+LY DV+Y++E
Sbjct: 267 KLERGCRGEW--PKKSSPELICRYNRDTSAFLKLAPLKLEFLSVQPMILLYHDVLYENEF 324
Query: 107 DLIKKMAQ---PRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTG 163
++ +A + T ++ R+ K + P I+RR+ M+G
Sbjct: 325 KSMRDIAMYNGSMIDGWTYVDFDKKGNPKQQDRVVKMIAFQGTTAPFTLSINRRMADMSG 384
Query: 164 LTTSTAEELQVVNYGIGGHYEPHYDFARPGEA--NAFKSLGTGNRVATVLFYMSDVAQGG 221
L L + NYG+GGH+ H D+ + + F G G+R+AT L Y SD+ GG
Sbjct: 385 LEMRDNMVLYLTNYGLGGHFGKHVDYVELAKRPPDFFADFG-GDRIATALIYASDIPLGG 443
Query: 222 ATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
TVFT L +++ P+KG+A W NL+ +G+ D T H+ CPV+ GS
Sbjct: 444 TTVFTKLKIAVQPKKGSALIWFNLNHAGEPDPLTEHSVCPVVLGS 488
>gi|198466405|ref|XP_001353987.2| GA16752 [Drosophila pseudoobscura pseudoobscura]
gi|198150585|gb|EAL29723.2| GA16752 [Drosophila pseudoobscura pseudoobscura]
Length = 510
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 87/237 (36%), Positives = 125/237 (52%), Gaps = 40/237 (16%)
Query: 52 CRGDL---TVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDL 108
CRG + PP L CRY +L L PLK E QP I+LY +V+Y+ E+
Sbjct: 273 CRGQWQRKSSPP-----LACRYNREYSAFLLLAPLKMEVLNQQPLIVLYHEVLYEKELRA 327
Query: 109 IKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLR-EPEHPVIE-------------RI 154
++ +A + AT+Q+ T R+ ++ EPE V++ I
Sbjct: 328 MRDIAN---KNATMQDGWT--------RMHSDQRVKPEPEDRVLKLHIFQGNSESFSPSI 376
Query: 155 SRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFA----RPGEANAFKSLGTGNRVATV 210
+RR+ MTGL L + NYG+GG++ HYD+ RP AN F G G+ +ATV
Sbjct: 377 NRRIADMTGLEVQGNNALHLSNYGLGGYFNAHYDYVELTKRP--ANYFTEWG-GDVLATV 433
Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
L Y SDV GGA VF L +S+ P+KG A W NL+++G+ D ++HA CPV+ GS+
Sbjct: 434 LLYASDVRLGGAVVFPKLKISVEPKKGNALIWDNLNNAGNPDKLSKHAVCPVVMGSH 490
>gi|20177086|gb|AAM12247.1| AT28279p [Drosophila melanogaster]
Length = 509
Score = 132 bits (331), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 78/225 (34%), Positives = 116/225 (51%), Gaps = 8/225 (3%)
Query: 47 KYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEI 106
K E CRG+ P +L CRY +L+L PLK E +QP I+LY DV+Y++E
Sbjct: 267 KLERGCRGEW--PKKSSPELICRYNRDTSAFLKLAPLKLEFLSVQPMILLYHDVLYENEF 324
Query: 107 DLIKKMAQ---PRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTG 163
++ +A + T ++ R+ K + P I+RR+ M+G
Sbjct: 325 KSMRDIAMYNGSMIDGWTYVDFDKKGNPKQQDRVVKMIAFQGTTAPFTLSINRRMADMSG 384
Query: 164 LTTSTAEELQVVNYGIGGHYEPHYDFARPGEA--NAFKSLGTGNRVATVLFYMSDVAQGG 221
L L + NYG+GGH+ H D+ + + F G G+R+AT L Y SD+ GG
Sbjct: 385 LEMRDNMVLYLTNYGLGGHFGKHVDYVELAKRPPDFFADFG-GDRIATALIYASDIPLGG 443
Query: 222 ATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
TVFT L +++ P+KG A W NL+ +G+ D T H+ CPV+ GS
Sbjct: 444 TTVFTKLKIAVQPKKGNALIWFNLNHAGEPDPLTEHSVCPVVLGS 488
>gi|195471732|ref|XP_002088156.1| GE14021 [Drosophila yakuba]
gi|194174257|gb|EDW87868.1| GE14021 [Drosophila yakuba]
Length = 265
Score = 132 bits (331), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 76/218 (34%), Positives = 114/218 (52%), Gaps = 16/218 (7%)
Query: 49 EMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDL 108
E CRG+ PP QL CRY P++R+ PLKEEE +P I LY DV+YDSEI
Sbjct: 43 EQGCRGNF--PPH--PQLVCRYNSTTTPFMRIAPLKEEEISKEPLIWLYHDVIYDSEIAQ 98
Query: 109 IKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST 168
+ + + + T NY T + + + + + + + R+ ++GL
Sbjct: 99 LTNLTREEMILGTTNNYTTPDRVNRLFHVKVT---NDDGGQLDRTLVNRMADISGLDMGN 155
Query: 169 AEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL 228
L +NYG+GG+++ H D+ A L T +SDV GGAT+F +
Sbjct: 156 TTSLARINYGLGGYFQEHSDYVDIKLHPASSLLPTS---------ISDVPVGGATIFPAA 206
Query: 229 NLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
L++ P+KG+A FW+NLH++GD + TRHA CP + GS
Sbjct: 207 KLAIQPKKGSALFWYNLHNNGDPNPLTRHAVCPTIVGS 244
>gi|194760358|ref|XP_001962408.1| GF14452 [Drosophila ananassae]
gi|190616105|gb|EDV31629.1| GF14452 [Drosophila ananassae]
Length = 498
Score = 131 bits (330), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 77/217 (35%), Positives = 115/217 (52%), Gaps = 12/217 (5%)
Query: 52 CRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
CRG+ +V CRY P++R+ PLKEEE P I LY DV++DSE+ L+ K
Sbjct: 271 CRGEYPNQSRLV----CRYNTTTTPFMRIAPLKEEEISKDPLIWLYHDVLFDSEMALLTK 326
Query: 112 MAQPRLRRATVQNYKTGELE-IANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAE 170
R +Q Y + YRI + + + R+ ++GL
Sbjct: 327 NLT---REEMIQGYTNNQTTPDKGYRIFQVKVYEGDGGKLDRTLVNRMTDISGLDVGNHT 383
Query: 171 ELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT-GNRVATVLFYMSDVAQGGATVFTSLN 229
L NYG+G H++ H D+ E LG+ G+R+ T LFY SDV GGAT+F + N
Sbjct: 384 YLARANYGLGTHFQEHSDYVDLREN---PDLGSEGDRLFTFLFYASDVEMGGATIFPAAN 440
Query: 230 LSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+S+ P+KG+A FW+NLH+ + + +RHA CP++ G+
Sbjct: 441 ISIKPKKGSALFWYNLHNDWEPNPLSRHAVCPMVLGN 477
>gi|195441323|ref|XP_002068462.1| GK20483 [Drosophila willistoni]
gi|194164547|gb|EDW79448.1| GK20483 [Drosophila willistoni]
Length = 550
Score = 131 bits (330), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 75/218 (34%), Positives = 115/218 (52%), Gaps = 9/218 (4%)
Query: 52 CRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEI-DLIK 110
CRG +V CRY P+L+L P+K EE L P I+ Y DV+ D+EI DL +
Sbjct: 312 CRGMFRQHTNLV----CRYNFTTSPFLQLAPMKLEEISLDPYIVQYHDVLSDNEIEDLKR 367
Query: 111 KMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAE 170
+ + + + E + I + P +++RI+RR+ MTG ++
Sbjct: 368 EGIKGTMINGWTSLKSSNATENESRTIVARVAIMSPSLEIVQRINRRIIDMTGFNIEESK 427
Query: 171 ELQVVNYGIGGHYEPHYDF--ARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL 228
+Q+ + +GG + PHYD+ R + + K LG +RVA+V+FY DV +GGAT F
Sbjct: 428 TIQLAAFSVGGFFMPHYDYLYDRLLDTDVLKKLG--DRVASVIFYAGDVTEGGATNFPRN 485
Query: 229 NLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
L + P+KG+A FW+N G D + H+ CPV+ GS
Sbjct: 486 QLVVQPKKGSALFWYNKFDDGSPDPRSLHSICPVVVGS 523
>gi|148684485|gb|EDL16432.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide III [Mus musculus]
Length = 396
Score = 131 bits (330), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 98/300 (32%), Positives = 147/300 (49%), Gaps = 24/300 (8%)
Query: 3 FPTHQRAQGNKLYYQEALNKSP-ELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPA 61
F ++R N L Y+ L ++ ++ E P L+ R+ YE LC+ + P
Sbjct: 98 FQDNKRMARNVLKYERLLAENGHQMAAETAIQRPNVPHLQT--RDTYEGLCQTLGSQPTH 155
Query: 62 I-VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRA 120
+ L C Y + PYL L P ++E +L+P I LY D + D E I+++A+P L+R+
Sbjct: 156 YQIPSLYCSYETNSSPYLLLQPARKEVVHLRPLIALYHDFVSDEEAQKIRELAEPWLQRS 215
Query: 121 TVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQVVNY 177
V +GE ++ YRISKSAWL++ P++ + R+ +TGL AE LQVVNY
Sbjct: 216 VV---ASGEKQLQVEYRISKSAWLKDTVDPMLVTLDHRIAALTGLDIQPPYAEYLQVVNY 272
Query: 178 GIGGHYEPHYDFARPGEANAFKSLGTGNRVATV-------LFYMSDVAQGGATVFTSLNL 230
GIGGHYEPH+D A + S+ G A + + +S V GGAT F N
Sbjct: 273 GIGGHYEPHFDHATVTMGSMLSSVEAGGATAFIYGNFSVPVVKLSSVEAGGATAFIYGNF 332
Query: 231 SL----WPEKGTAAFWHNLHSSGDGDYYTRHAACPVLT---GSNSLHSTCPCGLRRGLQR 283
S+ WP G+ + N T A L+ G+ ++ P L+ GLQ+
Sbjct: 333 SVPVVKWPTSGSTSMDRNSEDPAAPTLKTETLADGSLSEKPGAKAMGRGEPTLLKEGLQQ 392
>gi|194751829|ref|XP_001958226.1| GF23628 [Drosophila ananassae]
gi|190625508|gb|EDV41032.1| GF23628 [Drosophila ananassae]
Length = 484
Score = 131 bits (329), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 81/272 (29%), Positives = 136/272 (50%), Gaps = 33/272 (12%)
Query: 11 GNKLYYQEALNKSPELK-DEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAIVAQ---- 65
N+ Y++A NK LK EP +++ V +L +E+ + P+++A
Sbjct: 205 NNETIYKQASNK---LKVSEPKEIDKVVYSLLTQWKEESHNATN---STEPSLIAHYTGC 258
Query: 66 ---------LKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPR 116
L CRY P+L+L PLK EE L P I+LY +V+ D EI+ +K +
Sbjct: 259 RNQFPKQNNLVCRYNATTTPFLKLAPLKLEEVSLDPYIVLYHNVISDREIEEMKGL---- 314
Query: 117 LRRATVQNYKTGELEIANYR--ISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQV 174
+ G ++ R +S+ WL + E +R++ R+ +TG LQ+
Sbjct: 315 -----IDEMDNGWTDLNESREIVSRLVWLTK-ESRFRKRLNLRIRDITGFNVDEIRGLQI 368
Query: 175 VNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWP 234
N+G+GG ++PHYD+ ++ G+R+A+++FY+ DV GG TVF + +++ P
Sbjct: 369 ANFGVGGQFKPHYDYFTERILRLNNTI-LGDRIASIIFYVGDVVHGGQTVFPDIQIAVKP 427
Query: 235 EKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+KG++ FW N D + H+ CPVL G
Sbjct: 428 QKGSSLFWFNTFDDATPDPRSLHSVCPVLIGD 459
>gi|194765144|ref|XP_001964687.1| GF22917 [Drosophila ananassae]
gi|190614959|gb|EDV30483.1| GF22917 [Drosophila ananassae]
Length = 529
Score = 131 bits (329), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 78/241 (32%), Positives = 124/241 (51%), Gaps = 16/241 (6%)
Query: 35 NVAPTLEVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRI 94
+V+ + T Y+ CR T P +L CRY P+L++ PLK EE L P I
Sbjct: 265 DVSRDIYETLSNNYQATCRSSHTPNPT---RLHCRYNSTTTPFLKIAPLKMEEISLDPYI 321
Query: 95 ILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLRE--------P 146
++Y DV+ D +I + ++++ +L A V + + +R + +WL + P
Sbjct: 322 VVYHDVLPDGDISEVLRLSETKLEPAQVVSTPRTSNNV-KFRTALGSWLPDYEEVVKGPP 380
Query: 147 EHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNR 206
+ P+ R+ + +TGL + QV+ Y G HY H+D+ + ++ G+R
Sbjct: 381 KGPLYGRLRNILRDVTGLVIWDYQFFQVLKYQFGAHYAQHHDYFN---MSLKSTVLQGDR 437
Query: 207 VATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLH-SSGDGDYYTRHAACPVLTG 265
+ATVLFY++D GGATVF LN+ + EKG FW+NL + D D T H ACP+ G
Sbjct: 438 IATVLFYLNDAPHGGATVFPMLNVKVPAEKGKILFWYNLKGETHDFDEKTLHGACPIFHG 497
Query: 266 S 266
+
Sbjct: 498 T 498
>gi|195591298|ref|XP_002085379.1| GD14755 [Drosophila simulans]
gi|194197388|gb|EDX10964.1| GD14755 [Drosophila simulans]
Length = 515
Score = 130 bits (327), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 68/207 (32%), Positives = 114/207 (55%), Gaps = 16/207 (7%)
Query: 64 AQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQ 123
+ L CRY +L+L PLK EE P I+L+ +++ D EI+ +K +
Sbjct: 295 SNLVCRYNSSTNAFLQLAPLKMEEVSRDPYIVLFHEMISDKEIEEMK---------GEIT 345
Query: 124 NYKTGELEIANYR--ISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGG 181
+ G + + + +S+ W+R+ E +RI++R+ MTG +Q+ N+G+GG
Sbjct: 346 EMENGWTSLGDSKEIVSRVYWIRK-ESSFSKRINQRISDMTGFKLEEFPAIQLANFGVGG 404
Query: 182 HYEPHYDF--ARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTA 239
+++PHYD+ R E + +LG +R+ +++FY +V+QGG TVF L +++ P+KG A
Sbjct: 405 YFKPHYDYYTDRLKEVDVNNTLG--DRIGSIIFYAGEVSQGGQTVFPDLKVAVEPKKGNA 462
Query: 240 AFWHNLHSSGDGDYYTRHAACPVLTGS 266
FW N D T H+ CPV+ GS
Sbjct: 463 LFWFNAFDDSSPDPRTLHSVCPVIVGS 489
>gi|386771382|ref|NP_649044.3| CG18233 [Drosophila melanogaster]
gi|383291998|gb|AAF49254.3| CG18233 [Drosophila melanogaster]
Length = 515
Score = 130 bits (326), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 72/219 (32%), Positives = 118/219 (53%), Gaps = 20/219 (9%)
Query: 52 CRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
CRG P + L CRY +L+L PLK EE P I+++ +V+ D +I+ +K
Sbjct: 287 CRG--LFPKK--SNLVCRYNSSTNAFLKLAPLKMEEISRDPYIVMFHEVISDKDIEEMK- 341
Query: 112 MAQPRLRRATVQNYKTGELEIANYR--ISKSAWLREPEHPVIERISRRVEHMTGLTTSTA 169
+ + G + + + +S+ W+R+ E +RI++R+ MTG
Sbjct: 342 --------GEITEMENGWTSLGDPKEIVSRVYWIRK-ESSFSKRINQRISDMTGFKLEEF 392
Query: 170 EELQVVNYGIGGHYEPHYDF--ARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTS 227
+Q+ N+G+GG+++PHYDF R E + +LG +R+ +++FY +V+QGG TVF
Sbjct: 393 PAIQLANFGVGGYFKPHYDFYTDRLKEVDVNNTLG--DRIGSIIFYAGEVSQGGQTVFPD 450
Query: 228 LNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
L +++ P+KG A FW N D + H+ CPVL GS
Sbjct: 451 LKVAVEPKKGNALFWFNAFDDSTPDPRSLHSVCPVLVGS 489
>gi|443705944|gb|ELU02240.1| hypothetical protein CAPTEDRAFT_227850 [Capitella teleta]
Length = 475
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 85/226 (37%), Positives = 119/226 (52%), Gaps = 20/226 (8%)
Query: 84 KEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWL 143
K E + P I L+ D + DSEI +K MA+P+ + + V + GE R+S +A++
Sbjct: 176 KTELLHANPEIYLFHDFISDSEIQRLKDMAEPQFQSSAVLDDTGGESFFDVSRLSSTAFV 235
Query: 144 REPEHPVIERISRRVEHMTGLTT------STAEELQVVNYGIGGHYEPHYDFARPGEANA 197
+ + ++ ++RRV +TGL T S +E LQV+ YG GG Y PHYD EA+
Sbjct: 236 ND-SNDLVASLNRRVSKLTGLQTEVLDSFSESESLQVLRYGPGGLYTPHYD-TLGSEADL 293
Query: 198 FKSLG-TGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTR 256
+ TG+R+AT + Y+ GGATVF L +S+ +KG AAFW NLH G D T
Sbjct: 294 PPYIQHTGDRIATFILYLDIATAGGATVFPLLPMSIPIQKGAAAFWFNLHPDGSLDRRTL 353
Query: 257 HAACPVLTGS-----------NSLHSTCPCGLRRGLQRSGIICTLV 291
HAACPV+ G+ S H G RR IIC L+
Sbjct: 354 HAACPVIRGTKWECVIVSNDMTSDHEMFTVGKRRTEIVRLIICILL 399
>gi|347966278|ref|XP_003435891.1| AGAP013377-PA [Anopheles gambiae str. PEST]
gi|333470133|gb|EGK97522.1| AGAP013377-PA [Anopheles gambiae str. PEST]
Length = 290
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 70/217 (32%), Positives = 124/217 (57%), Gaps = 5/217 (2%)
Query: 46 EKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSE 105
+ Y LCRG PP++ + L C Y RN + + P K E P + L+ + ++D E
Sbjct: 46 DPYMDLCRGVYVPPPSLTSSLYCWYDVRNAHSV-ISPSKVEALSNDPFVALFHEFVHDGE 104
Query: 106 IDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLT 165
I ++ + ++++ N + N+ ++ L + +HPV+ER+++R+E TGL+
Sbjct: 105 IAQLQALGSMHIKQSGPSNDSWLPVFYENH---QTYTLHDRDHPVVERLTKRIERRTGLS 161
Query: 166 TSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVF 225
TAE+L+V+ +G D E +A + G+R+AT+LF++SDV GG T+F
Sbjct: 162 CDTAEDLKVIYNEVGAFKTAALDAIHKKE-DAQRFAYAGDRLATMLFFLSDVTNGGYTIF 220
Query: 226 TSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPV 262
L +++ P+KGTAAFW+NL +G+G+ +++ CP+
Sbjct: 221 PKLRVAIRPQKGTAAFWYNLKDTGEGNVQMKYSICPL 257
>gi|195145314|ref|XP_002013641.1| GL24244 [Drosophila persimilis]
gi|194102584|gb|EDW24627.1| GL24244 [Drosophila persimilis]
Length = 496
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 77/225 (34%), Positives = 119/225 (52%), Gaps = 11/225 (4%)
Query: 52 CRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
C+G +P + + L+C Y +LRL PL+ E P + LY +V+ +E +
Sbjct: 271 CQGRSRLP--VQSSLRCHYSAEGSAFLRLAPLRMELLSRDPLVALYHEVVSAAEQRHLML 328
Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
+++ +L+R Y R SA + P +E++ RR+E +TGL + +E
Sbjct: 329 LSESQLQRQRGHQYD-------KIRTFASASVAANATPTVEQLHRRLEDITGLDLAESEP 381
Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
L+++NYGIGG Y H D +P + R+ATVL Y+SDV GG T F +L L
Sbjct: 382 LRILNYGIGGQYYIHVDCEQP--QTHVEPYPKEYRLATVLLYLSDVRLGGFTSFPALGLG 439
Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTCPCG 276
+ P +G+A WHN +++G+ DY HAACPVL G+ + S G
Sbjct: 440 IRPNRGSALVWHNANNAGNCDYRALHAACPVLLGTRWVASKWISG 484
>gi|195352176|ref|XP_002042590.1| GM14977 [Drosophila sechellia]
gi|194124474|gb|EDW46517.1| GM14977 [Drosophila sechellia]
Length = 485
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 70/201 (34%), Positives = 105/201 (52%), Gaps = 14/201 (6%)
Query: 66 LKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNY 125
L C Y +LR+ PLK E L+P I+LY DV+Y+SEI IK ++ P L+
Sbjct: 290 LSCHYEQNTSEFLRIAPLKVETLSLKPHIVLYHDVIYESEISKIKNISLPSLKSPL---- 345
Query: 126 KTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEP 185
++ +Y + + +P+ P +S R++ MTG + Q+ NYGI G
Sbjct: 346 --RIIDAVDYNLKLAQIREDPQSP----LSLRIKDMTGEDVKEDTDFQIDNYGICGFRNF 399
Query: 186 HYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNL 245
H D + A G+R+ ++LF+M+DV QGGA F +LNL++WP KG+A W NL
Sbjct: 400 HTDNIEIQDQTA----ELGDRLTSILFFMNDVVQGGAFAFPNLNLTIWPHKGSALVWRNL 455
Query: 246 HSSGDGDYYTRHAACPVLTGS 266
+ H +CPV+ GS
Sbjct: 456 DHRMQPNKDLLHVSCPVVVGS 476
>gi|195494561|ref|XP_002094890.1| GE19962 [Drosophila yakuba]
gi|194180991|gb|EDW94602.1| GE19962 [Drosophila yakuba]
Length = 539
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 76/220 (34%), Positives = 114/220 (51%), Gaps = 8/220 (3%)
Query: 52 CRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
CRG+ PP +L CRY +L+L PLK E +QP I+LY DV+Y++E ++
Sbjct: 302 CRGEW--PPKSSPELICRYNRDTSAFLKLAPLKLEILSVQPVILLYHDVLYENEFKSMRD 359
Query: 112 MA---QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST 168
A + T ++ R+ K+ + P I+RR+ +M+GL
Sbjct: 360 AAIFNASMIDGWTYYDFDQKGNPKWQDRVVKTIGFQGTTAPFTLSINRRLGYMSGLEMRE 419
Query: 169 AEELQVVNYGIGGHYEPHYDFARPGEA--NAFKSLGTGNRVATVLFYMSDVAQGGATVFT 226
L + NYG+GG++ H+D+ + N F G G+ +AT + Y SDV GG TVF+
Sbjct: 420 NMMLYLTNYGLGGNFRKHFDYVELAKRPPNFFADSG-GDHIATAVLYASDVPLGGTTVFS 478
Query: 227 SLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
L L++ P+KG A W NL+ G D T H+ CPV+ GS
Sbjct: 479 KLKLAVQPKKGNALVWFNLNHDGKPDPLTEHSVCPVVLGS 518
>gi|242001766|ref|XP_002435526.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
gi|215498862|gb|EEC08356.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
Length = 559
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 74/203 (36%), Positives = 114/203 (56%), Gaps = 12/203 (5%)
Query: 44 EREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYD 103
E Y LCRG++ P + ++L+CRY + L P+K EE L+P II+ RDV+ +
Sbjct: 291 EELNYRRLCRGEVLRTPQMDSKLRCRYYKGQDGFFTLHPIKLEEINLKPYIIVMRDVVQE 350
Query: 104 SEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTG 163
+I+ + A+PRL+R+T Y + R S +AWL + E P+ R++ + + G
Sbjct: 351 RDIEDLMAFAEPRLQRSTT--YTGDGNAPSTRRTSSNAWLWDDEAPIANRMNWYLRALVG 408
Query: 164 LTT----STAEELQVVNYGIGGHYEPHYDF------ARPGEANAFKSLGTGNRVATVLFY 213
L T AE Q+ NYG GG++ PHYD+ A A+ + G+R+AT++ Y
Sbjct: 409 LGTLGSEYEAEAYQLANYGSGGYFLPHYDYLQDTLHAHNSTADYYLQNNEGDRLATLMIY 468
Query: 214 MSDVAQGGATVFTSLNLSLWPEK 236
M+DV +GGATVF L + L P+K
Sbjct: 469 MTDVEEGGATVFPRLGVRLVPKK 491
>gi|195113247|ref|XP_002001179.1| GI22114 [Drosophila mojavensis]
gi|193917773|gb|EDW16640.1| GI22114 [Drosophila mojavensis]
Length = 487
Score = 128 bits (322), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 83/274 (30%), Positives = 144/274 (52%), Gaps = 27/274 (9%)
Query: 9 AQGNKLYYQEALNKSPELKDEPPK---------VNNVA--PTLEVTEREKYEMLCRGDLT 57
A G++ + AL + P L+D+ + V N+ P L++ ++E E +
Sbjct: 215 AAGDEELSRAALLEEPSLRDQVEQFLLDYRNYNVTNIEDHPYLDIMDKEFIEFCGSSYMP 274
Query: 58 VPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRL 117
P +V C Y + +L L P K E P ++++ DV+Y+SEI+ + ++++P L
Sbjct: 275 QPTRLV----CSYKTKPSKFLYLAPFKMELLSEDPYMVVFHDVIYESEIEHLNRISKPFL 330
Query: 118 RRATVQNYKTGELEIANYRISKSAWL-REPEHP----VIERISRRVEHMTGLTTSTAEEL 172
+RATV E + +R + A+L R+ P ++ERI +R+ M+ L + +
Sbjct: 331 QRATVVVEDNSEDTLIKFRTANGAFLYRDKISPKDVQLVERIFQRMRDMSDLQIND-DAF 389
Query: 173 QVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSL 232
+ + Y GGHY+ H D+ N T +R AT + Y++DVA+GGATVF + +++
Sbjct: 390 EYLKYDFGGHYDIHADYF-----NYTDDQFTDDRFATFVIYLNDVARGGATVFPDVEIAV 444
Query: 233 WPEKGTAAFWHNLH-SSGDGDYYTRHAACPVLTG 265
PE+G W+N++ S D + ++ H ACPVL G
Sbjct: 445 HPERGKVIHWYNMNPKSFDYELHSYHGACPVLIG 478
>gi|221512810|ref|NP_649043.3| CG18234 [Drosophila melanogaster]
gi|66771545|gb|AAY55084.1| IP12246p [Drosophila melanogaster]
gi|220902636|gb|AAF49255.4| CG18234 [Drosophila melanogaster]
Length = 515
Score = 128 bits (321), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 73/201 (36%), Positives = 106/201 (52%), Gaps = 14/201 (6%)
Query: 66 LKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNY 125
L C Y +LR+ PLK E L+P I+LY DV+YDSEI +K ++ P L+ Y
Sbjct: 290 LSCHYEKNTSEFLRIAPLKVETLSLKPHIVLYHDVIYDSEISKVKNISLPSLKSPLRILY 349
Query: 126 KTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEP 185
+Y + K A +RE +S R++ MTG + Q+ NYGI G
Sbjct: 350 AI------DYNL-KFAKIREDHQ---SPLSLRIKDMTGEDVQEDTDFQIDNYGICGFRNF 399
Query: 186 HYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNL 245
H D + A G+R+ +++F+M+DVAQGGA F +LNL++WP+KG+A W NL
Sbjct: 400 HTDNIELQDQTA----ELGDRLTSIMFFMNDVAQGGALAFPNLNLTIWPQKGSALVWRNL 455
Query: 246 HSSGDGDYYTRHAACPVLTGS 266
+ H +CPV+ GS
Sbjct: 456 DHRMQPNQDLLHVSCPVVVGS 476
>gi|195128347|ref|XP_002008625.1| GI13597 [Drosophila mojavensis]
gi|193920234|gb|EDW19101.1| GI13597 [Drosophila mojavensis]
Length = 457
Score = 127 bits (320), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 77/225 (34%), Positives = 108/225 (48%), Gaps = 42/225 (18%)
Query: 49 EMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDL 108
E CRG P L CRYV+ N YL+L P+K E+ L+P + LY DV+YDSEI
Sbjct: 260 ERACRG--LWPERKTDHLSCRYVYENSAYLKLAPMKLEQLSLEPVVQLYHDVLYDSEIKA 317
Query: 109 IKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST 168
IK M+ P + V+ I++RV MTG
Sbjct: 318 IKNMSVPEAKAKRVE----------------------------LNINQRVADMTGYGMME 349
Query: 169 AEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL 228
+L V+N+ +G + AR +R+AT++FY +DVA GGAT+F L
Sbjct: 350 HNKLHVLNFALGQGADTKSCKAR------------ADRIATIVFYANDVAIGGATIFPKL 397
Query: 229 NLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC 273
L + P +GTA W+NL++ G D +HA CPV+ GS + C
Sbjct: 398 RLLVQPRRGTALLWYNLNADGAADPLAKHAVCPVVLGSRWAITKC 442
>gi|195494570|ref|XP_002094894.1| GE19958 [Drosophila yakuba]
gi|194180995|gb|EDW94606.1| GE19958 [Drosophila yakuba]
Length = 498
Score = 127 bits (320), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 73/202 (36%), Positives = 107/202 (52%), Gaps = 14/202 (6%)
Query: 65 QLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQN 124
L C Y LR+ PLK E L+P I+LY DV+YDSEI +K ++ P L+
Sbjct: 290 NLSCHYEKHTSDLLRIAPLKVETLSLKPHIVLYHDVIYDSEISKVKNISLPSLKSP---- 345
Query: 125 YKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYE 184
+ E N +++K + E H ++ R++ MTG + Q+ NYGI G
Sbjct: 346 LRILHAEDHNLKLAK---ISEDYHS---PLNLRIKDMTGEDVKEDTDFQIDNYGICGFRY 399
Query: 185 PHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHN 244
H D + A G+R+ +++F+M+DVAQGGA VF LNL++WP+KG+A W N
Sbjct: 400 YHTDNLESQDQTA----ELGDRLTSIMFFMNDVAQGGAFVFLHLNLTIWPQKGSALVWRN 455
Query: 245 LHSSGDGDYYTRHAACPVLTGS 266
L + HA+CPV+ GS
Sbjct: 456 LDHRMQPNEDLLHASCPVIVGS 477
>gi|195591296|ref|XP_002085378.1| GD14754 [Drosophila simulans]
gi|194197387|gb|EDX10963.1| GD14754 [Drosophila simulans]
Length = 508
Score = 127 bits (318), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 69/202 (34%), Positives = 105/202 (51%), Gaps = 14/202 (6%)
Query: 65 QLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQN 124
L C Y +LR+ PLK E L+P I+LY DV+YDSEI +K ++ P L+
Sbjct: 288 HLSCHYEQNTSEFLRIAPLKVETLSLKPHIVLYHDVIYDSEISKVKNISLPSLKSPL--- 344
Query: 125 YKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYE 184
++ +Y + + + + P +S R++ MTG + Q+ NYGI G
Sbjct: 345 ---RIIDAVDYNLKLAQIRDDHQSP----LSLRIKDMTGEDVQEDSDFQIDNYGICGFRN 397
Query: 185 PHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHN 244
H D + A G+R+ ++LF+M+DV QGGA F +LNL++WP+KG+A W N
Sbjct: 398 FHTDNIEMQDQTA----ELGDRLTSILFFMTDVVQGGAFAFPNLNLTIWPQKGSALVWRN 453
Query: 245 LHSSGDGDYYTRHAACPVLTGS 266
L + H +CPV+ GS
Sbjct: 454 LDHRMQPNKDLLHVSCPVVVGS 475
>gi|242003035|ref|XP_002436120.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
gi|215499456|gb|EEC08950.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
Length = 173
Score = 127 bits (318), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 67/143 (46%), Positives = 86/143 (60%), Gaps = 14/143 (9%)
Query: 140 SAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFK 199
+AWL + HPV++++SRR+ TGL+TS+AE LQVVNYG+GGHY PH+DF+ +
Sbjct: 3 AAWLSDHHHPVVKKLSRRIAAATGLSTSSAEHLQVVNYGVGGHYSPHFDFSTKDKPLRGW 62
Query: 200 SLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNL-------------H 246
G R AT L Y+S V +GGAT+F L + + PE G A FWHNL H
Sbjct: 63 ETFAGQRQATWLVYLSSVERGGATLFKRLRVRVQPEAGMALFWHNLPPGSTNSLPSCCVH 122
Query: 247 SSGDGDYYTRHAACPVLTGSNSL 269
S GD T H ACPVL GS +
Sbjct: 123 RS-VGDERTEHGACPVLVGSKWI 144
>gi|195494568|ref|XP_002094893.1| GE19959 [Drosophila yakuba]
gi|194180994|gb|EDW94605.1| GE19959 [Drosophila yakuba]
Length = 486
Score = 126 bits (316), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 74/217 (34%), Positives = 118/217 (54%), Gaps = 16/217 (7%)
Query: 52 CRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
CRG +V CRY +L+L PLK EE P I+++ +V+ D EI+ +K
Sbjct: 239 CRGLFPRKTNLV----CRYNSSTNAFLKLAPLKMEEISRDPYIVMFHEVISDKEIEEMKG 294
Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
+ ++N TG LE +S W+RE E +RI++R+ MTG
Sbjct: 295 DIRE------MENGWTG-LEDPKEIVSSVYWIRE-ETSFSKRINQRISDMTGFKLEEFVA 346
Query: 172 LQVVNYGIGGHYEPHYDF--ARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLN 229
+Q+ N+G+GG+++PH+D+ R +A +LG +R+A+++FY +V+QGG TVF L
Sbjct: 347 IQLANFGVGGYFKPHFDYYTERLRGVDANNTLG--DRIASIIFYAGEVSQGGQTVFPDLK 404
Query: 230 LSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+ + P++G A FW N D + H+ CPV+ GS
Sbjct: 405 VVVEPKRGNALFWFNKLDDSSPDPRSLHSVCPVIVGS 441
>gi|194871364|ref|XP_001972834.1| GG13661 [Drosophila erecta]
gi|190654617|gb|EDV51860.1| GG13661 [Drosophila erecta]
Length = 506
Score = 126 bits (316), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 73/201 (36%), Positives = 110/201 (54%), Gaps = 17/201 (8%)
Query: 68 CRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRAT-VQNYK 126
C Y +LR+ PLK E ++P I+LY DV+YDSEI +K ++ P LR + + +
Sbjct: 293 CHYEKNTSDFLRIAPLKVETLSVKPHIVLYHDVIYDSEISKVKNISLPSLRSPSRILRAE 352
Query: 127 TGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPH 186
L++A R +P P +S R++ MTG +LQ+ NYGI G H
Sbjct: 353 DHNLKLAKIR-------EDPRSP----LSLRIKDMTGEDVEEDTDLQIENYGICGFRFYH 401
Query: 187 YDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNL- 245
D + A G+R+ ++LF+M+DVA GGA VF + NL+++P+KG+A W NL
Sbjct: 402 NDNLESQDQTA----KLGDRLTSILFFMNDVALGGAFVFLNANLTIFPQKGSALVWRNLD 457
Query: 246 HSSGDGDYYTRHAACPVLTGS 266
HS + +H +CPV+ GS
Sbjct: 458 HSLQPKEDLLQHLSCPVIVGS 478
>gi|196011906|ref|XP_002115816.1| hypothetical protein TRIADDRAFT_59903 [Trichoplax adhaerens]
gi|190581592|gb|EDV21668.1| hypothetical protein TRIADDRAFT_59903 [Trichoplax adhaerens]
Length = 444
Score = 125 bits (315), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 73/194 (37%), Positives = 106/194 (54%), Gaps = 11/194 (5%)
Query: 48 YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID 107
Y LCR ++ LKC Y +++ P L P+ EE P I LY D++ E +
Sbjct: 238 YTKLCRSHKNYQTSLNNGLKCYYFNQS-PLLHFNPVAVEEISYSPVIRLYHDIISHQEAE 296
Query: 108 LIKKMAQPRLR--RATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLT 165
++K ++ +L R VQ YR +K AWL + ++ V+ R+S E +TGL
Sbjct: 297 ILKNISSKKLTVARTFVQIMPNNSEAEGEYRFAKHAWLGDIDNQVVRRLSVLSEELTGLD 356
Query: 166 TSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGN-RVATVLFYMSDVAQGGATV 224
S AE+LQV NYG+GGHY PHYD A + TG R+AT++FY+SDV GGATV
Sbjct: 357 LSYAEKLQVANYGVGGHYSPHYDSASIDD-------DTGKPRLATIMFYLSDVDIGGATV 409
Query: 225 FTSLNLSLWPEKGT 238
F + +++P K +
Sbjct: 410 FPDIGKAIFPRKTS 423
>gi|195156517|ref|XP_002019146.1| GL25581 [Drosophila persimilis]
gi|194115299|gb|EDW37342.1| GL25581 [Drosophila persimilis]
Length = 206
Score = 125 bits (313), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 73/202 (36%), Positives = 104/202 (51%), Gaps = 22/202 (10%)
Query: 66 LKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID-LIKKMAQPRLRRATVQN 124
L CRY H P+LRL PLKEEE P I LY DV+YDSE + L + + + + N
Sbjct: 2 LVCRYNHTTTPFLRLAPLKEEEVSRDPLIWLYHDVLYDSEFEQLTVNLTRAEMVQGYTDN 61
Query: 125 YKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYE 184
Y T E E RI + + R+ ++GL T +L VNYG+G H+
Sbjct: 62 YTTTEKE----RIFYVNIFEGSGEKLDRDLVNRMADISGLLTGEHTQLGTVNYGLGSHFP 117
Query: 185 PHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHN 244
H D++ +AN M+DV GGAT+F +NL++ P+KG+A FW+N
Sbjct: 118 EHGDYSDI-KANP----------------MTDVPLGGATIFPKINLTIQPKKGSALFWYN 160
Query: 245 LHSSGDGDYYTRHAACPVLTGS 266
+H+ + TRHA CP + G+
Sbjct: 161 IHNDWEPHVLTRHAVCPTIEGN 182
>gi|198428011|ref|XP_002120302.1| PREDICTED: similar to prolyl 4-hydroxylase alpha-2 subunit, partial
[Ciona intestinalis]
Length = 233
Score = 125 bits (313), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 74/204 (36%), Positives = 107/204 (52%), Gaps = 11/204 (5%)
Query: 65 QLKCRYVHRNV--PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATV 122
+LKC Y H P L + P+K EE P ++ + DV+ D + + I ++A P + R+ V
Sbjct: 7 KLKC-YFHNGWKNPRLLIQPIKSEELCDSPHVVRFYDVLSDRDSEEIIRLAAPLMFRSGV 65
Query: 123 QNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGH 182
+ R+ K+AWL PV+ RV +TGL LQV NYGIGGH
Sbjct: 66 TGDDGAINDNPMERVGKNAWL--DNSPVVNNFMTRVADITGLNVGAEIYLQVANYGIGGH 123
Query: 183 YEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFW 242
++PH D E ++++ R+AT L Y SDV GG T F + P KG+A FW
Sbjct: 124 FDPHID-----ETGGYENI-MERRIATFLTYFSDVEYGGNTPFVYQEVVAEPIKGSAIFW 177
Query: 243 HNLHSSGDGDYYTRHAACPVLTGS 266
+++ + G D T HAACPV+ G+
Sbjct: 178 YDVFNDGSADERTEHAACPVVLGN 201
>gi|198471971|ref|XP_002133305.1| GA28042 [Drosophila pseudoobscura pseudoobscura]
gi|198139547|gb|EDY70707.1| GA28042 [Drosophila pseudoobscura pseudoobscura]
Length = 203
Score = 125 bits (313), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 73/202 (36%), Positives = 104/202 (51%), Gaps = 22/202 (10%)
Query: 66 LKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID-LIKKMAQPRLRRATVQN 124
L CRY H P+LRL PLKEEE P I LY DV+YDSE + L + + + + N
Sbjct: 2 LVCRYNHTTTPFLRLAPLKEEEVSRDPLIWLYHDVLYDSEFEQLTVNLTRAEMVQGYTDN 61
Query: 125 YKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYE 184
Y T E E RI + + R+ ++GL T +L VNYG+G H+
Sbjct: 62 YTTTEKE----RIFYVNIFEGSGEKLDRDLVNRMADISGLLTGEHTQLGTVNYGLGSHFP 117
Query: 185 PHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHN 244
H D++ +AN M+DV GGAT+F +NL++ P+KG+A FW+N
Sbjct: 118 EHGDYSDI-KANP----------------MTDVPLGGATIFPKINLTIQPKKGSALFWYN 160
Query: 245 LHSSGDGDYYTRHAACPVLTGS 266
+H+ + TRHA CP + G+
Sbjct: 161 IHNDWEPHVLTRHAVCPTIEGN 182
>gi|321466285|gb|EFX77281.1| hypothetical protein DAPPUDRAFT_106233 [Daphnia pulex]
Length = 128
Score = 124 bits (312), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 63/117 (53%), Positives = 77/117 (65%)
Query: 167 STAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFT 226
STAE LQ VNYGIG HYEPH+D+AR AFK LG GNR+AT LFYMSDV G ATVF
Sbjct: 2 STAEVLQFVNYGIGWHYEPHFDYARKETTEAFKELGWGNRIATCLFYMSDVEAGSATVFP 61
Query: 227 SLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTCPCGLRRGLQR 283
++WP KG+AAF +NL+ + G+ +TRHA PV+ S + +T RR R
Sbjct: 62 PTGAAVWPRKGSAAFCYNLYPNDKGNEFTRHATFPVIFLSKWVSNTWIHEHRREFHR 118
>gi|194871344|ref|XP_001972830.1| GG13666 [Drosophila erecta]
gi|190654613|gb|EDV51856.1| GG13666 [Drosophila erecta]
Length = 539
Score = 124 bits (311), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 77/227 (33%), Positives = 114/227 (50%), Gaps = 12/227 (5%)
Query: 47 KYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEI 106
K E CRG+ P +L CRY +L+L PLK E +QP I LY DV+Y+ E
Sbjct: 297 KLERGCRGEW--PKKSSPELICRYSRDTSAFLKLAPLKLEFLSVQPMIHLYHDVLYEKEF 354
Query: 107 DLIKKMAQPRLRRATVQNYKTGELEI---ANYRISKSAWLREPEHPVIERISRRVEHMTG 163
++ +A + Y +I R+ K ++ P I+RR+ M+G
Sbjct: 355 KSMRDVAVFNATMIDGRTYFDFHKKIKPKTQDRVVKMIDFKDTTAPYTLSINRRIADMSG 414
Query: 164 LTTSTAEELQVVNYGIGGHYEPHYDFA----RPGEANAFKSLGTGNRVATVLFYMSDVAQ 219
L L + NYG+GG + H D+ RP + F + G+R+AT + Y SDV
Sbjct: 415 LEMRENMVLYLSNYGLGGDFGKHVDYVELAKRPSD---FFADFKGDRIATAVLYASDVPL 471
Query: 220 GGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
GG TVF L +++ P+KG A W NL+ +G+ D T H+ CP++ GS
Sbjct: 472 GGTTVFPKLKIAVQPKKGNALVWFNLNHAGEPDPLTEHSVCPIVLGS 518
>gi|374370415|ref|ZP_09628419.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus basilensis OR16]
gi|373098067|gb|EHP39184.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus basilensis OR16]
Length = 454
Score = 124 bits (311), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 70/177 (39%), Positives = 97/177 (54%), Gaps = 5/177 (2%)
Query: 92 PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
PR+ L++ ++ D+E D + +A+ RL R+ V N TG+ + R S A + EHP+I
Sbjct: 132 PRVTLFQQLLTDAECDALVALARGRLARSPVINPDTGDENLIEARTSLGAMFQVGEHPLI 191
Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDF---ARPGEANAFKSLGTGNRVA 208
ERI + +TG+ E LQ++NY GG Y+PHYDF RPGEA K G RV
Sbjct: 192 ERIEDCIAAVTGIAAERGEGLQILNYKPGGEYQPHYDFFNPQRPGEARQLKV--GGQRVG 249
Query: 209 TVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
T++ Y++ GGAT F L L + P KG A ++ S G D T HA PV G
Sbjct: 250 TLVIYLNSPLAGGATAFPKLGLEVAPVKGNAVYFSYRKSDGALDERTLHAGLPVEAG 306
>gi|299065638|emb|CBJ36810.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
CMR15]
Length = 289
Score = 124 bits (310), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 67/177 (37%), Positives = 97/177 (54%), Gaps = 1/177 (0%)
Query: 92 PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
PRI+L++ + D E D + + + RL+R+ V N +TGE + + R S+ A + EHP+I
Sbjct: 97 PRIVLFQHFLSDEECDQLITLGRHRLKRSPVVNPETGEENLISARTSQGAMFQVGEHPLI 156
Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT-GNRVATV 210
RI R+ TG+ E QV++Y GG Y+PH+D+ PG + + L G RVAT+
Sbjct: 157 ARIEARIAQATGVPVEHGEGFQVLHYQPGGEYQPHFDYFNPGRSGEARQLEVGGQRVATL 216
Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
+ Y++ V GGAT F L L + P KG A F+ G D T HA PV G
Sbjct: 217 VIYLNSVPAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDDKTLHAGLPVERGEK 273
>gi|17547533|ref|NP_520935.1| hypothetical protein RSc2814 [Ralstonia solanacearum GMI1000]
gi|17429837|emb|CAD16521.1| putative prolyl 4-hydroxylase alpha subunit homologue
oxidoreductase protein [Ralstonia solanacearum GMI1000]
Length = 289
Score = 124 bits (310), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 66/177 (37%), Positives = 97/177 (54%), Gaps = 1/177 (0%)
Query: 92 PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
PRI+L++ + D E D + + + RL+R+ V N +TGE + + R S+ A + EHP++
Sbjct: 97 PRIVLFQHFLSDEECDQLIALGRHRLKRSPVVNPETGEENLISARTSQGAMFQVGEHPLV 156
Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT-GNRVATV 210
RI R+ TG+ E QV++Y GG Y+PH+D+ PG + + L G RVAT+
Sbjct: 157 ARIEARIAQATGVPVEHGEGFQVLHYQPGGEYQPHFDYFNPGRSGEARQLEVGGQRVATL 216
Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
+ Y++ V GGAT F L L + P KG A F+ G D T HA PV G
Sbjct: 217 VIYLNSVPAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLPVERGEK 273
>gi|421749438|ref|ZP_16186877.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator HPC(L)]
gi|409771699|gb|EKN53918.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator HPC(L)]
Length = 319
Score = 123 bits (309), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 68/177 (38%), Positives = 99/177 (55%), Gaps = 5/177 (2%)
Query: 92 PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
PRI L++ ++ E + + +++ RL R+ V N TG+ + + R S A + EHP+I
Sbjct: 127 PRIALFQRLLMPDECEALIALSRGRLARSPVVNPDTGDENLIDARTSMGAMFQVGEHPLI 186
Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDF---ARPGEANAFKSLGTGNRVA 208
ER+ R+ +TG+ E LQ++NY G Y+PHYDF RPGEA + G R+A
Sbjct: 187 ERLEARIAAVTGVPVEHGEGLQILNYKPGAEYQPHYDFFNPQRPGEARQLRV--GGQRMA 244
Query: 209 TVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
T++ Y++DV GGAT F L L + P +G A F+ L G D T HA PV G
Sbjct: 245 TLVIYLNDVPAGGATAFPKLGLRVNPVQGNAVFFAYLGEDGSLDERTLHAGLPVEQG 301
>gi|344169181|emb|CCA81504.1| putative Prolyl 4-hydroxylase alpha subunit [blood disease
bacterium R229]
Length = 289
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 67/177 (37%), Positives = 97/177 (54%), Gaps = 1/177 (0%)
Query: 92 PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
PRI+L++ + D E D + + + RL+R+ V N +TGE + + R S+ A + EHP+I
Sbjct: 97 PRIVLFQHFLSDEECDELIALGRHRLKRSPVVNPETGEENLISARTSQGAMFQVGEHPLI 156
Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT-GNRVATV 210
RI R+ TG+ E QV++Y GG Y+PH+D+ PG + + L G RVAT+
Sbjct: 157 ARIEARIAQATGVPVEHGEGFQVLHYQPGGEYQPHFDYFNPGRSGEARQLEVGGQRVATL 216
Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
+ Y++ V GGAT F L L + P KG A F+ G D T HA PV G
Sbjct: 217 VIYLNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLPVERGEK 273
>gi|300690371|ref|YP_003751366.1| prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum PSI07]
gi|299077431|emb|CBJ50057.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
PSI07]
Length = 289
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 67/177 (37%), Positives = 97/177 (54%), Gaps = 1/177 (0%)
Query: 92 PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
PRI+L++ + D E D + + + RL+R+ V N +TGE + + R S+ A + EHP+I
Sbjct: 97 PRIVLFQHFLSDEECDELIALGRHRLKRSPVVNPETGEENLISARTSQGAMFQVGEHPLI 156
Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT-GNRVATV 210
RI R+ TG+ E QV++Y GG Y+PH+D+ PG + + L G RVAT+
Sbjct: 157 ARIEARIAQATGVPVEHGEGFQVLHYQPGGEYQPHFDYFNPGRSGEARQLEVGGQRVATL 216
Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
+ Y++ V GGAT F L L + P KG A F+ G D T HA PV G
Sbjct: 217 VIYLNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLPVERGEK 273
>gi|344172475|emb|CCA85118.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia syzygii R24]
Length = 289
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 67/177 (37%), Positives = 97/177 (54%), Gaps = 1/177 (0%)
Query: 92 PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
PRI+L++ + D E D + + + RL+R+ V N +TGE + + R S+ A + EHP+I
Sbjct: 97 PRIVLFQHFLSDEECDELIALGRHRLKRSPVVNPETGEENLISARTSQGAMFQVGEHPLI 156
Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT-GNRVATV 210
RI R+ TG+ E QV++Y GG Y+PH+D+ PG + + L G RVAT+
Sbjct: 157 ARIEARIAQATGVPVEHGEGFQVLHYQPGGEYQPHFDYFNPGRSGEARQLEVGGQRVATL 216
Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
+ Y++ V GGAT F L L + P KG A F+ G D T HA PV G
Sbjct: 217 VIYLNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLPVERGEK 273
>gi|194871359|ref|XP_001972833.1| GG13662 [Drosophila erecta]
gi|190654616|gb|EDV51859.1| GG13662 [Drosophila erecta]
Length = 515
Score = 122 bits (307), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 68/205 (33%), Positives = 111/205 (54%), Gaps = 16/205 (7%)
Query: 66 LKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNY 125
L CRY +L+L PLK EE P I+++ +V+ D EI+ +K ++
Sbjct: 297 LVCRYNFSTNAFLKLAPLKMEEISRDPYIVMFHEVISDKEIEEMK---------GEIKQM 347
Query: 126 KTG--ELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHY 183
+ G LE +S W+ + E +RI+ R+ MTG +Q+ N+G+GG++
Sbjct: 348 ENGWTSLEEPKEIVSHIYWITK-ESSFSKRINDRISDMTGFKVEEFPAIQLANFGVGGYF 406
Query: 184 EPHYDF--ARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAF 241
+PHYD+ R E +A +LG +R+A+++ Y +V+QGG TVF + +++ P+KG A F
Sbjct: 407 KPHYDYYTERLKELDANNTLG--DRLASIIIYAGEVSQGGQTVFPDIKVAVEPKKGKALF 464
Query: 242 WHNLHSSGDGDYYTRHAACPVLTGS 266
W N D + H+ CPV+ GS
Sbjct: 465 WFNDFDDSSPDPRSLHSVCPVIVGS 489
>gi|207744371|ref|YP_002260763.1| prolyl 4-hydroxylase subunit alpha [Ralstonia solanacearum IPO1609]
gi|206595776|emb|CAQ62703.1| prolyl 4-hydroxylase alpha subunit homologue protein [Ralstonia
solanacearum IPO1609]
Length = 280
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 66/177 (37%), Positives = 97/177 (54%), Gaps = 1/177 (0%)
Query: 92 PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
PRI+L++ + D E D + + + RL+R+ V N +TGE + + R S+ A + EHP++
Sbjct: 88 PRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPETGEENLISARTSEGAMFQVGEHPLV 147
Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT-GNRVATV 210
RI R+ TG+ E QV++Y GG Y+PH+D+ PG + + L G RVAT+
Sbjct: 148 ARIEARIAQATGVPVEHGEGFQVLHYHPGGEYQPHFDYFNPGRSGEARQLEVGGQRVATL 207
Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
+ Y++ V GGAT F L L + P KG A F+ G D T HA PV G
Sbjct: 208 VIYLNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLPVERGEK 264
>gi|83746819|ref|ZP_00943867.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum UW551]
gi|83726588|gb|EAP73718.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum UW551]
Length = 289
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 66/177 (37%), Positives = 97/177 (54%), Gaps = 1/177 (0%)
Query: 92 PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
PRI+L++ + D E D + + + RL+R+ V N +TGE + + R S+ A + EHP++
Sbjct: 97 PRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPETGEENLISARTSEGAMFQVGEHPLV 156
Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT-GNRVATV 210
RI R+ TG+ E QV++Y GG Y+PH+D+ PG + + L G RVAT+
Sbjct: 157 ARIEARIAQATGVPVEHGEGFQVLHYHPGGEYQPHFDYFNPGRSGEARQLEVGGQRVATL 216
Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
+ Y++ V GGAT F L L + P KG A F+ G D T HA PV G
Sbjct: 217 VIYLNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLPVERGEK 273
>gi|386332363|ref|YP_006028532.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum Po82]
gi|334194811|gb|AEG67996.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum Po82]
Length = 292
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 66/177 (37%), Positives = 97/177 (54%), Gaps = 1/177 (0%)
Query: 92 PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
PRI+L++ + D E D + + + RL+R+ V N +TGE + + R S+ A + EHP++
Sbjct: 100 PRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPETGEENLISARTSEGAMFQVGEHPLV 159
Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT-GNRVATV 210
RI R+ TG+ E QV++Y GG Y+PH+D+ PG + + L G RVAT+
Sbjct: 160 ARIEARIAQATGVPVEHGEGFQVLHYHPGGEYQPHFDYFNPGRSGEARQLEVGGQRVATL 219
Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
+ Y++ V GGAT F L L + P KG A F+ G D T HA PV G
Sbjct: 220 VIYLNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLPVERGEK 276
>gi|421890664|ref|ZP_16321519.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
K60-1]
gi|378964031|emb|CCF98267.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
K60-1]
Length = 288
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 66/177 (37%), Positives = 97/177 (54%), Gaps = 1/177 (0%)
Query: 92 PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
PRI+L++ + D E D + + + RL+R+ V N +TGE + + R S+ A + EHP++
Sbjct: 96 PRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPETGEENLISARTSEGAMFQVGEHPLV 155
Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT-GNRVATV 210
RI R+ TG+ E QV++Y GG Y+PH+D+ PG + + L G RVAT+
Sbjct: 156 ARIEARIAQATGVPVEHGEGFQVLHYHPGGEYQPHFDYFNPGRSGEARQLDVGGQRVATL 215
Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
+ Y++ V GGAT F L L + P KG A F+ G D T HA PV G
Sbjct: 216 VIYLNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLPVERGEK 272
>gi|187930127|ref|YP_001900614.1| procollagen-proline dioxygenase [Ralstonia pickettii 12J]
gi|187727017|gb|ACD28182.1| Procollagen-proline dioxygenase [Ralstonia pickettii 12J]
Length = 288
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 66/175 (37%), Positives = 95/175 (54%), Gaps = 1/175 (0%)
Query: 92 PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
PRI+L++ + D+E D + + + RL+R+ V N TGE + + R S+ + EHP+I
Sbjct: 96 PRIVLFQHFLSDAECDELIAIGRNRLKRSPVVNPDTGEENLISARTSQGGMFQVGEHPLI 155
Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT-GNRVATV 210
+I R+ G+ E QV+NY GG Y+PH+DF PG + + L G RVAT+
Sbjct: 156 AKIEVRIAQAVGVPVEHGEGFQVLNYQPGGEYQPHFDFFNPGRSGEARQLEVGGQRVATM 215
Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
+ Y++ V GGAT F L L + P KG A F+ G D T HA PV G
Sbjct: 216 VIYLNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDEDTLHAGLPVERG 270
>gi|241664232|ref|YP_002982592.1| procollagen-proline dioxygenase [Ralstonia pickettii 12D]
gi|309783051|ref|ZP_07677770.1| procollagen-proline dioxygenase [Ralstonia sp. 5_7_47FAA]
gi|404397139|ref|ZP_10988932.1| hypothetical protein HMPREF0989_00773 [Ralstonia sp. 5_2_56FAA]
gi|240866259|gb|ACS63920.1| Procollagen-proline dioxygenase [Ralstonia pickettii 12D]
gi|308918159|gb|EFP63837.1| procollagen-proline dioxygenase [Ralstonia sp. 5_7_47FAA]
gi|348610674|gb|EGY60360.1| hypothetical protein HMPREF0989_00773 [Ralstonia sp. 5_2_56FAA]
Length = 288
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 66/175 (37%), Positives = 94/175 (53%), Gaps = 1/175 (0%)
Query: 92 PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
PRI+L++ + D E D + + + RL+R+ V N TGE + + R S+ + EHP+I
Sbjct: 96 PRIVLFQHFLSDQECDELIAIGRNRLKRSPVVNPDTGEENLISARTSQGGMFQVGEHPLI 155
Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT-GNRVATV 210
+I R+ G+ E QV+NY GG Y+PH+DF PG + + L G RVAT+
Sbjct: 156 AKIEARIAQAVGVPVEHGEGFQVLNYQPGGEYQPHFDFFNPGRSGEARQLEVGGQRVATM 215
Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
+ Y++ V GGAT F L L + P KG A F+ G D T HA PV G
Sbjct: 216 VIYLNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDEDTLHAGLPVERG 270
>gi|300702992|ref|YP_003744594.1| prolyl 4-hydroxylase subunit alpha [Ralstonia solanacearum
CFBP2957]
gi|299070655|emb|CBJ41950.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
CFBP2957]
Length = 289
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 66/177 (37%), Positives = 97/177 (54%), Gaps = 1/177 (0%)
Query: 92 PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
PRI+L++ + D E D + + + RL+R+ V N +TGE + + R S+ A + EHP++
Sbjct: 97 PRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPETGEENLISARTSEGAMFQVGEHPLV 156
Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT-GNRVATV 210
RI R+ TG+ E QV++Y GG Y+PH+D+ PG + + L G RVAT+
Sbjct: 157 ARIEARIAQATGVPVEHGEGFQVLHYHPGGEYQPHFDYFNPGRSGEARQLEVGGQRVATL 216
Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
+ Y++ V GGAT F L L + P KG A F+ G D T HA PV G
Sbjct: 217 VIYLNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLPVERGEK 273
>gi|149068803|gb|EDM18355.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide III [Rattus
norvegicus]
Length = 266
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 78/208 (37%), Positives = 108/208 (51%), Gaps = 33/208 (15%)
Query: 45 REKYEMLCRGDLTVPPAI-VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYD 103
R+ YE LC+ + P + L C Y + PYL L P ++E +L+P + LY D + D
Sbjct: 36 RDTYEGLCQTLGSQPTHYQIPSLYCSYETNSSPYLLLQPARKEVIHLRPLVALYHDFVSD 95
Query: 104 SEIDLIKKMAQPRLRRATVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMT 162
E I+++A+P L+R+ V +GE ++ YRISKSAWL++ PV+ + RR+ +T
Sbjct: 96 EEAQKIRELAEPWLQRSVV---ASGEKQLQVEYRISKSAWLKDTVDPVLVTLDRRIAALT 152
Query: 163 GLTTST--AEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQG 220
GL AE LQVVNYGIGGHYEPH+D A +S V G
Sbjct: 153 GLDIQPPYAEYLQVVNYGIGGHYEPHFDHAT----------------------LSSVEAG 190
Query: 221 GATVFTSLNLSL----WPEKGTAAFWHN 244
GAT F N S+ WP G + N
Sbjct: 191 GATAFIYGNFSVPVVKWPTSGYTSMDRN 218
>gi|73542634|ref|YP_297154.1| procollagen-proline,2-oxoglutarate-4-dioxygenase [Ralstonia
eutropha JMP134]
gi|72120047|gb|AAZ62310.1| Procollagen-proline,2-oxoglutarate-4-dioxygenase [Ralstonia
eutropha JMP134]
Length = 282
Score = 121 bits (304), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 74/210 (35%), Positives = 107/210 (50%), Gaps = 13/210 (6%)
Query: 59 PPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLR 118
P A Q R R VP L + P I LY+ ++ D+E D + ++A+ RL
Sbjct: 65 PDASATQPAPRLARREVPVLFSL--------QSPSIRLYQHLLSDAECDALVELARGRLA 116
Query: 119 RATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYG 178
R+ V N TG+ + + R S A + EH +I+RI R+ + G+ E LQ++NY
Sbjct: 117 RSPVINPDTGDENLIDARTSMGAMFQVGEHTLIQRIEDRIAAVLGVPVDHGEGLQILNYK 176
Query: 179 IGGHYEPHYDF---ARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPE 235
GG Y+PH+DF RPGEA + G R AT++ Y++ GGAT F + L + P
Sbjct: 177 PGGEYQPHFDFFNPKRPGEARQLRV--GGQRTATLVIYLNTPQAGGATAFPRIGLEVAPV 234
Query: 236 KGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
KG A ++ L G D T HA PV +G
Sbjct: 235 KGNAVYFSYLQPDGKLDERTLHAGLPVQSG 264
>gi|421895470|ref|ZP_16325871.1| prolyl 4-hydroxylase alpha subunit homologue protein [Ralstonia
solanacearum MolK2]
gi|206586635|emb|CAQ17221.1| prolyl 4-hydroxylase alpha subunit homologue protein [Ralstonia
solanacearum MolK2]
Length = 283
Score = 121 bits (304), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 66/177 (37%), Positives = 96/177 (54%), Gaps = 1/177 (0%)
Query: 92 PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
PRI+L++ + D E D + + + RL+R+ V N +TGE + + R S+ A + EHP++
Sbjct: 91 PRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPETGEENLISARTSEGAMFQVGEHPLV 150
Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT-GNRVATV 210
RI R+ TG+ E QV++Y GG Y+PH+D+ PG + L G RVAT+
Sbjct: 151 ARIEARIAQATGVPVEHGEGFQVLHYHPGGEYQPHFDYFNPGRGGEARQLEVGGQRVATL 210
Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
+ Y++ V GGAT F L L + P KG A F+ G D T HA PV G
Sbjct: 211 VIYLNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGMLDDNTLHAGLPVERGEK 267
>gi|390178051|ref|XP_002137433.2| GA30144 [Drosophila pseudoobscura pseudoobscura]
gi|388859305|gb|EDY67991.2| GA30144 [Drosophila pseudoobscura pseudoobscura]
Length = 546
Score = 121 bits (303), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 71/210 (33%), Positives = 112/210 (53%), Gaps = 11/210 (5%)
Query: 52 CRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
C+G +P + + L+C Y +LRL PL+ E P + +Y +V+ +E +
Sbjct: 272 CQGRSRLP--VQSSLRCHYSAEGSAFLRLAPLRMELLSRDPLVAVYHEVVSAAEQRHLML 329
Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
+++ +L+R Y R SA + P +E++ RR+E +TGL + +E
Sbjct: 330 LSESQLQRQRGHQYD-------KIRTFASASVAANATPTVEQLHRRLEDITGLDLAESEP 382
Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
L+++NYGIGG Y H D +P + R+ATVL Y+SDV GG T F +L L
Sbjct: 383 LRILNYGIGGQYYIHVDCEQP--QTHVEPYPKEYRLATVLLYLSDVRLGGFTSFPALGLG 440
Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACP 261
+ P +G+A WHN +++G+ DY HAACP
Sbjct: 441 IRPNRGSALVWHNANNAGNCDYRALHAACP 470
>gi|195341582|ref|XP_002037385.1| GM12897 [Drosophila sechellia]
gi|194131501|gb|EDW53544.1| GM12897 [Drosophila sechellia]
Length = 467
Score = 121 bits (303), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 79/244 (32%), Positives = 120/244 (49%), Gaps = 33/244 (13%)
Query: 25 ELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLK 84
+ K P K + +P L E Y+ LCR + P+ +L CRY +L L LK
Sbjct: 244 QFKANPYKAVDRSPKL----GEDYKRLCRSSFSPTPS---KLHCRYNSTTSRFLILASLK 296
Query: 85 EEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLR 144
EE L+P I+ Y D++ D +I + +A+P L+ V + E + ++ R S
Sbjct: 297 MEEISLEPYIVAYHDILPDKDIQQLITLAEPLLKPIEVFDENKNEAKSSD-RTSLGG--- 352
Query: 145 EPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTG 204
P+++R++ R+ +TGL + ++ YG G H E G G
Sbjct: 353 ----PLLDRLTERMRDITGLQIPQGNPINIIKYGFGAHSETE---------------GYG 393
Query: 205 NRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDG-DYYTRHAACPVL 263
+R+ATV+FY++D GGATVF LN+ + E+G W+NL+ GD D T HA CPV
Sbjct: 394 DRMATVMFYLNDAPYGGATVFPRLNVKVPAERGKVLLWYNLN--GDSQDVTTVHAVCPVF 451
Query: 264 TGSN 267
GS
Sbjct: 452 HGSK 455
>gi|241778760|ref|XP_002399787.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
gi|215508519|gb|EEC17973.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
Length = 427
Score = 120 bits (302), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 78/246 (31%), Positives = 123/246 (50%), Gaps = 28/246 (11%)
Query: 4 PTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAIV 63
P R + +K Y E + P+ E + + Y+ LCRG+ +
Sbjct: 139 PVRDRMKRSKEYKAELFQEDPQ---------------EYQDSQNYKRLCRGEQLRTLKMD 183
Query: 64 AQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRAT-- 121
+QL+CRY + +L P+K EE L+P I++ DV+ D +++ + A+PR R
Sbjct: 184 SQLRCRYYKGQDGFFKLQPIKLEEFNLKPYIVVLHDVIQDRDLEDLIAFAKPRARNTIPL 243
Query: 122 VQNYK-TGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST----AEELQVVN 176
+N K L+ ++ S WL E + R++R + + G+ TS AE Q+ N
Sbjct: 244 FRNVKWCTFLKRFCSLLAASTWLFEQNATIASRLNRYLTALLGMGTSDSNFEAEPYQLAN 303
Query: 177 YGIGGHYEPHYD-----FARPGEANAFKSLGT-GNRVATVLFYMSDVAQGGATVFTSLNL 230
YG GGHY PH+D + E + F + G+R+AT++ YMSDV +GGATVF L +
Sbjct: 304 YGTGGHYLPHHDYLYDVYEDSDETDDFSQFPSYGDRLATLMIYMSDVEEGGATVFPKLGV 363
Query: 231 SLWPEK 236
L P+K
Sbjct: 364 RLTPKK 369
>gi|430808003|ref|ZP_19435118.1| prolyl 4-hydroxylase [Cupriavidus sp. HMR-1]
gi|429499635|gb|EKZ98045.1| prolyl 4-hydroxylase [Cupriavidus sp. HMR-1]
Length = 293
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 72/209 (34%), Positives = 107/209 (51%), Gaps = 13/209 (6%)
Query: 60 PAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRR 119
P + R+ R +P L + PRI+L ++++ D+E D + +A+ RL+R
Sbjct: 77 PTVTGGNAFRHKDREMPVLFRLE--------SPRILLLQNLLDDAECDAVVALARDRLQR 128
Query: 120 ATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGI 179
+ V N TG+ + + R S A + EH +++RI R+ +TG E QV+NY
Sbjct: 129 SPVVNPDTGDENLIDARTSMGAMFQVGEHALLQRIEARIAAVTGWPVEHGEGFQVLNYKP 188
Query: 180 GGHYEPHYDF---ARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEK 236
GG Y+PH+DF RPGEA + G RVAT++ Y++ A GGAT F + L + P K
Sbjct: 189 GGEYQPHFDFFNPKRPGEARQLRV--GGQRVATMVIYLNSPASGGATAFPRIGLEVAPVK 246
Query: 237 GTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
G A + G D T HA PV G
Sbjct: 247 GNAVLFSYGLPDGALDERTLHAGLPVEAG 275
>gi|94312029|ref|YP_585239.1| prolyl 4-hydroxylase [Cupriavidus metallidurans CH34]
gi|93355881|gb|ABF09970.1| prolyl 4-hydroxylase [Cupriavidus metallidurans CH34]
Length = 293
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 67/177 (37%), Positives = 98/177 (55%), Gaps = 5/177 (2%)
Query: 92 PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
PRI+L ++++ D+E D + +A+ RL+R+ V N TG+ + + R S A + EH ++
Sbjct: 101 PRILLLQNLLDDAECDAVVALARDRLQRSPVVNPDTGDENLIDARTSMGAMFQVGEHALL 160
Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDF---ARPGEANAFKSLGTGNRVA 208
+RI R+ +TG E QV+NY GG Y+PH+DF RPGEA + G RVA
Sbjct: 161 QRIEARIAAVTGWPVEHGEGFQVLNYKPGGEYQPHFDFFNPKRPGEARQLRV--GGQRVA 218
Query: 209 TVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
T++ Y++ A GGAT F + L + P KG A + G D T HA PV G
Sbjct: 219 TMVIYLNSPASGGATAFPRIGLEVAPVKGNAVLFSYGLPDGALDERTLHAGLPVEAG 275
>gi|195113245|ref|XP_002001178.1| GI22115 [Drosophila mojavensis]
gi|193917772|gb|EDW16639.1| GI22115 [Drosophila mojavensis]
Length = 498
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 72/235 (30%), Positives = 120/235 (51%), Gaps = 21/235 (8%)
Query: 38 PTLEVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILY 97
P L++ E + + C P +L C Y + +L L P K E P I+++
Sbjct: 241 PYLDIMEND-FIKFCGSSYMPQPT---RLVCSYKTKPSKFLYLAPFKMELLSEDPYIVVF 296
Query: 98 RDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLR----EPEHP-VIE 152
DV+YDSEI ++ A+P L R+ V+ E ++ R +K A++ PE V++
Sbjct: 297 HDVIYDSEIKHLRNTAEPLLHRSYVKK-SNNESVVSKVRTAKGAFMHADRLSPESAQVVQ 355
Query: 153 RISRRVEHMTGLTTSTA--EELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
R+ +R+ ++ L E+Q +NY G HY H D+ ++ +R+AT
Sbjct: 356 RLKQRMGDLSDLNIKREGYNEMQYLNYDFGDHYLLHMDYF---------NISMNDRIATF 406
Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
L Y++DV +GG T+F + ++ PEKG W+N++S+ D + + H ACPVL G
Sbjct: 407 LIYLNDVTRGGGTIFPQVKQAVHPEKGKLILWYNMNSNLDYELASLHGACPVLIG 461
>gi|405967005|gb|EKC32220.1| Prolyl 4-hydroxylase subunit alpha-1 [Crassostrea gigas]
Length = 303
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 94/289 (32%), Positives = 137/289 (47%), Gaps = 44/289 (15%)
Query: 1 MIFPTHQRAQGNKLYYQEALNKSPELKDEPPKVNNV-APTLEVTEREKYEMLCRGDLTVP 59
M F QR Q + L +EA S D +N+ AP + LCRG
Sbjct: 1 MEFTDFQRFQAHNLVIKEATRSSISQDD----INSFFAPP------NTFMKLCRGPAK-S 49
Query: 60 PAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRR 119
+ ++L+C +P + KEE PRI L+ DV+ + +I +KK +L
Sbjct: 50 KIVESKLRCYLRKTAIP---IYMAKEEVVNYTPRISLFHDVISNDDIRQLKKAGTKKLTH 106
Query: 120 ATVQNYKTGELEIANYRISKSAWLREPEHP-VIERISRRVEHMTGLTT------STAEEL 172
+ +TG + R+S++ W+ + P V R++RR+ ++ L T S E
Sbjct: 107 S-----RTGGGYVTRLRVSQTGWVYDQAIPQVSRRLARRIANIVNLDTTFRSKASPVEPW 161
Query: 173 QVVNYGIGGHYEPHYDFARPGEANAF-----------KSLG---TGNRVATVLFYMSDVA 218
QV++Y GG+Y H D P + F ++L TG R+AT +FY+SDV
Sbjct: 162 QVLSYTTGGYYGEHID---PDIGDEFLWNMTEAVQGPRALWRKHTGQRIATWMFYLSDVE 218
Query: 219 QGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
GGATVF L + KG AAFW+NL SG D T+HA CPV+ GS
Sbjct: 219 AGGATVFPKLEARVPVVKGAAAFWYNLTPSGKIDRRTQHAGCPVILGSK 267
>gi|319652187|ref|ZP_08006306.1| hypothetical protein HMPREF1013_02919 [Bacillus sp. 2_A_57_CT2]
gi|317396176|gb|EFV76895.1| hypothetical protein HMPREF1013_02919 [Bacillus sp. 2_A_57_CT2]
Length = 283
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 63/175 (36%), Positives = 98/175 (56%), Gaps = 3/175 (1%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+P ++ V+ E D + +++ RL+ + V + +GE + R SKS R E+ +
Sbjct: 95 KPFVLHLDQVLSSEECDELISLSRSRLQPSLVVDRGSGEERAGSGRTSKSMAFRLKENEL 154
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
+ERI R+ +TG E LQ++NYG+G Y+PH+DF P A+A K G RV T
Sbjct: 155 VERIETRIAELTGYPAENGEGLQILNYGLGEEYKPHFDFFPPHMADASKG---GQRVGTF 211
Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
L Y++DV GG TVF+ LS P+KG A ++H ++ G D + H++ PV G
Sbjct: 212 LIYLNDVEDGGETVFSKAGLSFVPKKGAAIYFHYGNAQGQLDRLSVHSSVPVRKG 266
>gi|156352046|ref|XP_001622583.1| predicted protein [Nematostella vectensis]
gi|156209154|gb|EDO30483.1| predicted protein [Nematostella vectensis]
Length = 497
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 86/242 (35%), Positives = 121/242 (50%), Gaps = 40/242 (16%)
Query: 34 NNVAPTLEVTEREK--------YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKE 85
+N+ + V R+K YE LCRG I QL+C Y + P LRL P K
Sbjct: 262 DNLPSRVNVGNRDKGKEDHAFDYERLCRGQPN-KVRIPKQLRC-YYKSSHPLLRLKPAKI 319
Query: 86 EEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLRE 145
E +I+L RDV+ +S++ IK++A P++ + E R S SAWL +
Sbjct: 320 EVLDPDRQILLLRDVINESQMQFIKELAAPKVSSLHLSPTNRSPSE---RRFSSSAWLGD 376
Query: 146 PEHPVIERISRRVEHMTGL--TTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT 203
+ I +SRR+E +T T +AE LQVV++GIGGH+EP Y + NA
Sbjct: 377 ADGAPIAALSRRIEAITDFHVTGDSAESLQVVHFGIGGHFEPRYGY------NA------ 424
Query: 204 GNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVL 263
++ V GG+ VF LS+ P+KG+A FW N+ SG T HAACPV+
Sbjct: 425 ----------LNFVDAGGSNVFLDSELSVSPQKGSAVFWLNMRRSGKE---TLHAACPVI 471
Query: 264 TG 265
G
Sbjct: 472 VG 473
>gi|344253558|gb|EGW09662.1| Glucose 1,6-bisphosphate synthase [Cricetulus griseus]
Length = 904
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 83/239 (34%), Positives = 120/239 (50%), Gaps = 40/239 (16%)
Query: 3 FPTHQRAQGNKLYY-----QEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLT 57
+P ++R N L Y Q L + E + P V N+ R+ YE LC+ +
Sbjct: 649 YPDNKRMARNVLKYERLLSQNTLQMATETVIQRPNVPNL------QTRDTYEGLCQTLGS 702
Query: 58 VPPAIV-AQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPR 116
P +L C Y + PYL L P ++E +L+P + LY D + D+E I+++A+P
Sbjct: 703 QPTHYQNPRLYCSYETNSSPYLLLQPARKEVIHLRPFVALYHDFVSDAEAQKIRELAEPW 762
Query: 117 LRRATVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQ 173
L+R+ V +GE ++ YRISKSAWL++ P++ + R+ +TGL AE LQ
Sbjct: 763 LQRSVV---ASGEKQLPVEYRISKSAWLKDTVDPMLGTLDHRIAALTGLDIQPPYAEYLQ 819
Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSL 232
VVNYGIGGHYEPH+D A +S V GGAT F N S+
Sbjct: 820 VVNYGIGGHYEPHFDHAT----------------------LSAVEAGGATAFIYANFSV 856
>gi|260787668|ref|XP_002588874.1| hypothetical protein BRAFLDRAFT_235878 [Branchiostoma floridae]
gi|229274045|gb|EEN44885.1| hypothetical protein BRAFLDRAFT_235878 [Branchiostoma floridae]
Length = 151
Score = 118 bits (295), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 66/150 (44%), Positives = 88/150 (58%), Gaps = 15/150 (10%)
Query: 140 SAWLREPEHPVIERISRRVEHMTGLTTST--AEELQVVNYGIGGHYEPHYDFARPGEANA 197
S WL + EH VI ++SRRVE++TGL + E QV+NYG+GG YEPH D+ R +
Sbjct: 1 SGWLFDTEHTVIAKLSRRVEYITGLDVNWPYGEAFQVLNYGLGGFYEPHVDYFRDEQP-- 58
Query: 198 FKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRH 257
L G R+ T LFY+SDV GGATVFT LNL++ K +A +H+L S + + + H
Sbjct: 59 -ALLTNGQRIVTFLFYLSDVEAGGATVFTRLNLTVPAVKNSAVLFHDLKRSLEFEKDSEH 117
Query: 258 AACPVLTGSNSLHST----------CPCGL 277
A CPVL GS + + PCGL
Sbjct: 118 AGCPVLMGSKWIANKWIHAHGNEFRWPCGL 147
>gi|113869198|ref|YP_727687.1| prolyl 4-hydroxylase alpha subunit [Ralstonia eutropha H16]
gi|113527974|emb|CAJ94319.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia eutropha H16]
Length = 297
Score = 117 bits (294), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 66/177 (37%), Positives = 100/177 (56%), Gaps = 5/177 (2%)
Query: 92 PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
P++ L++ ++ D E D + +++ RL R+ V N TG+ + + R S A + EHP+I
Sbjct: 105 PQVQLFQQLLTDDECDALVALSRGRLARSPVVNPDTGDENLIDARTSMGAMFQVAEHPLI 164
Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDF---ARPGEANAFKSLGTGNRVA 208
RI R+ +TG+ E LQ++NY GG Y+PH+D+ RPGEA S+G G R+A
Sbjct: 165 TRIEARIAAVTGVPAEHGEGLQILNYKPGGEYQPHFDYFNPQRPGEARQL-SVG-GQRIA 222
Query: 209 TVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
T++ Y++ GGAT F + L + P KG A ++ L G D T HA PV G
Sbjct: 223 TLVIYLNTPEAGGATAFPRVGLEVAPVKGNAVYFSYLLPDGALDERTLHAGLPVAFG 279
>gi|195575137|ref|XP_002105536.1| GD21536 [Drosophila simulans]
gi|194201463|gb|EDX15039.1| GD21536 [Drosophila simulans]
Length = 465
Score = 117 bits (294), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 75/222 (33%), Positives = 111/222 (50%), Gaps = 29/222 (13%)
Query: 46 EKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSE 105
E Y+ LCR + P +L CRY P+L L PLK EE L+P I++Y D++ D +
Sbjct: 261 EDYKRLCRSSFSPTPL---KLHCRYNSTTSPFLILAPLKMEEISLEPYIVMYHDILPDKD 317
Query: 106 IDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLT 165
I + +A+P L K E+ N +KS+ +++R++ R+ +TGL
Sbjct: 318 IQQLITLAEPLL--------KPTEMFDENKNEAKSSDRPALGGLLLDRLNERMGDITGLQ 369
Query: 166 TSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVF 225
+ ++ Y G H E G G+R+ TV+FY++D GGATVF
Sbjct: 370 IPQGNPINIIKYAFGAHSETE---------------GYGDRMDTVMFYLNDAPYGGATVF 414
Query: 226 TSLNLSLWPEKGTAAFWHNLHSSGD-GDYYTRHAACPVLTGS 266
LN+ + E+G W+NL +GD D T HAACPV GS
Sbjct: 415 PHLNVKVPAERGKVLLWYNL--NGDTQDVTTVHAACPVFHGS 454
>gi|339327280|ref|YP_004686973.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator N-1]
gi|338167437|gb|AEI78492.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator N-1]
Length = 297
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 65/177 (36%), Positives = 100/177 (56%), Gaps = 5/177 (2%)
Query: 92 PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
P++ L++ ++ D E D + +++ RL R+ V N TG+ + + R S A + EH +I
Sbjct: 105 PQVQLFQQLLTDDECDALVALSRGRLARSPVVNPDTGDENLIDARTSMGAMFQVAEHALI 164
Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDF---ARPGEANAFKSLGTGNRVA 208
RI R+ +TG+ E LQ++NY GG Y+PH+D+ RPGEA S+G G R+A
Sbjct: 165 ARIEARIAAVTGVPAEHGEGLQILNYKPGGEYQPHFDYFNPQRPGEARQL-SVG-GQRIA 222
Query: 209 TVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
T++ Y++ GGAT F + L + P KG A ++ L G D T HA PV +G
Sbjct: 223 TLVIYLNTPEAGGATAFPRVGLEVAPVKGNAVYFSYLLPDGTLDERTLHAGLPVASG 279
>gi|195113263|ref|XP_002001187.1| GI10646 [Drosophila mojavensis]
gi|193917781|gb|EDW16648.1| GI10646 [Drosophila mojavensis]
Length = 471
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 72/202 (35%), Positives = 96/202 (47%), Gaps = 33/202 (16%)
Query: 65 QLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQN 124
L CRY + P+LR+ PLK EE L P I+LY +Y+SEI+ + K + L
Sbjct: 277 HLHCRYNYWMTPFLRIAPLKLEELSLDPLIVLYHKAIYNSEIETLLKRQEFNLISGKDNM 336
Query: 125 YKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYE 184
+T I RV M+GL +E L V+N GH++
Sbjct: 337 DRT--------------------------IHERVADMSGLNLDRSEVLSVINNDNNGHFQ 370
Query: 185 PHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHN 244
D E +R+ATVLFY+ DV GAT+F LNL++ PEKGTA WHN
Sbjct: 371 LQEDAPETTE-------RPQDRIATVLFYLEDVELVGATIFPRLNLTIKPEKGTALLWHN 423
Query: 245 LHSSGDGDYYTRHAACPVLTGS 266
L S G +AACPV++ S
Sbjct: 424 LESCGSSHPKALYAACPVISSS 445
>gi|194290782|ref|YP_002006689.1| prolyl 4-hydroxylase subunit alpha [Cupriavidus taiwanensis LMG
19424]
gi|193224617|emb|CAQ70628.1| putative Prolyl 4-hydroxylase alpha subunit [Cupriavidus
taiwanensis LMG 19424]
Length = 296
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 65/177 (36%), Positives = 99/177 (55%), Gaps = 5/177 (2%)
Query: 92 PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
P++ L++ ++ D E D + +++ RL R+ V N TG+ + + R S A + EH +I
Sbjct: 104 PQVQLFQQLLSDDECDALVALSRGRLARSPVVNPDTGDENLIDARTSMGAMFQVAEHALI 163
Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDF---ARPGEANAFKSLGTGNRVA 208
RI R+ +TG+ E LQ++NY GG Y+PH+D+ RPGEA S+G G R+A
Sbjct: 164 ARIEARIAAVTGVPADHGEGLQILNYKPGGEYQPHFDYFNPQRPGEARQL-SVG-GQRIA 221
Query: 209 TVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
T++ Y++ GGAT F + L + P KG A ++ L G D T HA PV G
Sbjct: 222 TLVIYLNTPEAGGATAFPRVGLEVAPVKGNAVYFSYLLPDGTLDDRTLHAGLPVAAG 278
>gi|295699617|ref|YP_003607510.1| procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
gi|295438830|gb|ADG17999.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
Length = 286
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 60/178 (33%), Positives = 99/178 (55%), Gaps = 1/178 (0%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+P+++++ DV+ +E + + ++ RL+R+T N TG ++ R S+ W R E +
Sbjct: 96 RPQLVVFADVLSAAECAELIERSRHRLKRSTTVNPLTGREDVIRNRTSEGVWYRRGEDQL 155
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGE-ANAFKSLGTGNRVAT 209
I R+ RR+ +T E LQV++YG G Y PH+DF P + +A + G RVAT
Sbjct: 156 IARVERRIASLTNWPLENGEGLQVLHYGTSGEYSPHFDFFAPDQPGSAVHTTQGGQRVAT 215
Query: 210 VLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
++ Y++DVA GG TVF + LS+ + G A ++ +++ D T H PVL G
Sbjct: 216 LIIYLNDVADGGETVFPTAGLSVAAQAGGAVYFRYMNAERQLDPSTLHGGAPVLAGDK 273
>gi|312385117|gb|EFR29691.1| hypothetical protein AND_01144 [Anopheles darlingi]
Length = 295
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 74/216 (34%), Positives = 118/216 (54%), Gaps = 8/216 (3%)
Query: 52 CRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
C+G P + + L+C Y RN + + P K E +P + L+ DV++DSEI +++
Sbjct: 45 CKGTYQRPVGLTSWLRCWYDARN-DHSVIGPRKVEMLNYEPFVALFYDVIHDSEITRLQE 103
Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
+ ++ V T Y ++ L+ + PV++R+S+R E M+GL+ TAE+
Sbjct: 104 LGDGVIK---VSGATTDGWLPVYYENHQTYTLQNRDDPVVKRLSQRTERMSGLSCDTAED 160
Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDV--AQGGATV-FTSL 228
L+V+ Y G Y+ + + A + G R+ATVLF+MSDV A+GG + F L
Sbjct: 161 LKVI-YNEVGAYKSFIVDGKKKSSVAQQFAFAGKRLATVLFFMSDVDGAEGGGRIAFPYL 219
Query: 229 NLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLT 264
LS+ P+KG A FW+NLH SG D ++ CP+L
Sbjct: 220 GLSVLPQKGAALFWYNLHDSGRPDERMTYSICPLLA 255
>gi|195069799|ref|XP_001997030.1| GH12979 [Drosophila grimshawi]
gi|193891499|gb|EDV90365.1| GH12979 [Drosophila grimshawi]
Length = 517
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 75/221 (33%), Positives = 115/221 (52%), Gaps = 12/221 (5%)
Query: 48 YEMLCRGDLTVPPAIVAQL--KCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSE 105
Y LC+G +P Q +C Y +L PLK E+ L P I +Y V+ D++
Sbjct: 280 YVRLCQGK-RLPEIKTNQSSPRCYLDSNRHAYFKLSPLKVEQVNLDPDINIYYGVLNDNQ 338
Query: 106 IDLIKKMA-QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGL 164
I I +++ + R+T + Y I++ RIS+ WL P++ + V ++G
Sbjct: 339 IKSILRLSDELDSFRSTHRKYV-----ISDMRISQQVWLNYSS-PIMRTYRQLVGAISGF 392
Query: 165 TTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATV 224
+ E +Q+ NYGIGGHYEPH D+ G G+R++T + Y+SDV QGG TV
Sbjct: 393 NMTNVEIMQLANYGIGGHYEPHIDYM--GSPLPPYYAKRGDRISTSMIYLSDVQQGGYTV 450
Query: 225 FTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
F + N+ + P KG+ W+N S + D+ T HA C V+ G
Sbjct: 451 FPTQNVFVKPVKGSMILWYNQLRSLNPDHRTLHAGCAVIEG 491
>gi|195055777|ref|XP_001994789.1| GH14121 [Drosophila grimshawi]
gi|193892552|gb|EDV91418.1| GH14121 [Drosophila grimshawi]
Length = 517
Score = 115 bits (287), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 75/221 (33%), Positives = 115/221 (52%), Gaps = 12/221 (5%)
Query: 48 YEMLCRGDLTVPPAIVAQL--KCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSE 105
Y LC+G +P Q +C Y +L PLK E+ L P I +Y V+ D++
Sbjct: 280 YVRLCQGK-RLPEIKTNQSSPRCYLDSNRHAYFKLSPLKVEQVNLDPDINIYYGVLNDNQ 338
Query: 106 IDLIKKMA-QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGL 164
I I +++ + R+T + Y I++ RIS+ WL P++ + V ++G
Sbjct: 339 IKSILRLSDELDSFRSTHRKYV-----ISDMRISQQVWLNYSS-PIMRTYRQLVGAISGF 392
Query: 165 TTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATV 224
+ E +Q+ NYGIGGHYEPH D+ G G+R++T + Y+SDV QGG TV
Sbjct: 393 NMTNVEIMQLANYGIGGHYEPHIDYM--GSPLPPYYAKRGDRISTSMIYLSDVQQGGYTV 450
Query: 225 FTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
F + N+ + P KG+ W+N S + D+ T HA C V+ G
Sbjct: 451 FPTQNVFVKPVKGSMILWYNQLRSLNPDHRTLHAGCAVIEG 491
>gi|260812289|ref|XP_002600853.1| hypothetical protein BRAFLDRAFT_214927 [Branchiostoma floridae]
gi|229286143|gb|EEN56865.1| hypothetical protein BRAFLDRAFT_214927 [Branchiostoma floridae]
Length = 281
Score = 114 bits (286), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 61/135 (45%), Positives = 85/135 (62%), Gaps = 8/135 (5%)
Query: 136 RISKSAWLREPEHPVIERISRRVEHMTGLTTS--TAEELQVVNYGIGGHYEPHYDFARPG 193
RIS+ AWL + + ++ R+S+R+ +TGL T+ + E LQV+NYG+GG YEPH+D+
Sbjct: 126 RISQQAWLHDKDDEIVARVSKRIGLLTGLNTTPTSTELLQVLNYGLGGQYEPHHDYMTAE 185
Query: 194 EANAFKSLGT--GNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDG 251
E K GT GNR+AT L Y+SDV GGATVF N+++ K + +L SG G
Sbjct: 186 E----KMWGTILGNRMATFLMYLSDVTAGGATVFPVANVTVPVVKNAGLLFMDLLRSGRG 241
Query: 252 DYYTRHAACPVLTGS 266
D + HA CPV+ GS
Sbjct: 242 DVNSLHAGCPVVIGS 256
>gi|389770666|ref|ZP_10192118.1| procollagen-proline dioxygenase [Rhodanobacter sp. 115]
gi|388429637|gb|EIL86932.1| procollagen-proline dioxygenase [Rhodanobacter sp. 115]
Length = 286
Score = 114 bits (286), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 61/176 (34%), Positives = 98/176 (55%), Gaps = 1/176 (0%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
QP + + V+ E D + + A +L+R+T+ + TG+ E R S+ +
Sbjct: 94 QPVLAVLDGVLSHEECDELIRRAAAKLQRSTIVDPTTGKHETIADRSSEGTFFEINADDF 153
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT-GNRVAT 209
I R+ RR+ + L E LQ+++YG GG Y+PH+DF PG+ + + T G RV+T
Sbjct: 154 IARLDRRISALMNLPVDHGEGLQILHYGPGGEYKPHFDFFPPGDPGSAVQMATGGQRVST 213
Query: 210 VLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
++ Y+++V GGAT+F L LS+ P+KG+A ++ +S G D T H PVL G
Sbjct: 214 LVMYLNEVEDGGATIFPELGLSVLPKKGSAVYFEYTNSRGQLDPRTLHGGAPVLRG 269
>gi|91789558|ref|YP_550510.1| procollagen-proline,2-oxoglutarate-4-dioxygenase [Polaromonas sp.
JS666]
gi|91698783|gb|ABE45612.1| Procollagen-proline,2-oxoglutarate-4-dioxygenase [Polaromonas sp.
JS666]
Length = 277
Score = 114 bits (285), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 61/177 (34%), Positives = 97/177 (54%), Gaps = 3/177 (1%)
Query: 92 PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
P ++++ +++ DSE + + ++AQPRL R+ N KTG E R S+ + E+P++
Sbjct: 90 PDLVVFGNLLSDSECEALMEVAQPRLARSLTVNIKTGGEERNRDRTSQGMFFARGENPLV 149
Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT-GNRVATV 210
+R+ R+ + G E LQV+ Y G Y+PHYD+ P E L G RVAT+
Sbjct: 150 QRVEARIARLVGWPVDRGEGLQVLRYRQGAQYKPHYDYFDPAEPGTPAILQRGGQRVATL 209
Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
+ Y+++ QGGATVF + L + P +GTA F+ + + + TRH PV G
Sbjct: 210 IMYLNEPEQGGATVFPDIGLQVTPRRGTAVFFS--YPAANPASLTRHGGEPVKAGEK 264
>gi|329913962|ref|ZP_08276011.1| hypothetical protein IMCC9480_1311 [Oxalobacteraceae bacterium
IMCC9480]
gi|327545257|gb|EGF30515.1| hypothetical protein IMCC9480_1311 [Oxalobacteraceae bacterium
IMCC9480]
Length = 280
Score = 114 bits (285), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 62/176 (35%), Positives = 95/176 (53%), Gaps = 1/176 (0%)
Query: 92 PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
PRI++ +V+ D E D I M++ R R+T + +G + R S+SA ++ E +I
Sbjct: 92 PRIVVLGNVLSDDECDAIAAMSRTRFARSTTIDNASGINRFDDSRTSESAHIQRGETELI 151
Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL-GTGNRVATV 210
RI R+ ++G E LQ+ Y G Y PH+D+ P A K L +G R+AT+
Sbjct: 152 ARIDARLAALSGWPVDHGEPLQLQKYQAGNEYRPHFDWFDPALAGTAKHLEKSGQRLATI 211
Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+ Y++DV +GG T F + L + P+KG A F+ N G D T+HA PV G+
Sbjct: 212 ILYLTDVEEGGGTSFPGIGLDVHPQKGGALFFRNTTPYGVPDRKTQHAGLPVEKGT 267
>gi|302791635|ref|XP_002977584.1| hypothetical protein SELMODRAFT_106693 [Selaginella moellendorffii]
gi|300154954|gb|EFJ21588.1| hypothetical protein SELMODRAFT_106693 [Selaginella moellendorffii]
Length = 296
Score = 114 bits (285), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 66/200 (33%), Positives = 102/200 (51%), Gaps = 20/200 (10%)
Query: 82 PLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSA 141
P K + +PR LY+ M +E D + KMA+ +L+++ V + ++G+ ++N R S
Sbjct: 39 PTKVIQLSWKPRAFLYKGFMSAAECDHVVKMAKDKLQKSMVADNESGKSVLSNIRTSSGM 98
Query: 142 WLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL 201
+L + + VI RI R+ T L E +QV+ Y G YEPHYD+ + +
Sbjct: 99 FLSKGQDEVINRIEERIAAWTFLPKENGEAIQVLRYEFGEKYEPHYDYFH----DKYNQA 154
Query: 202 GTGNRVATVLFYMSDVAQGGATVF-----TSLNLSLW-----------PEKGTAAFWHNL 245
G+R+ATVL Y+SDV +GG TVF T++ W P KG A +++L
Sbjct: 155 LGGHRIATVLMYLSDVVKGGETVFPSSEDTTVKDDSWSDCAKKGIAVKPRKGDALLFYSL 214
Query: 246 HSSGDGDYYTRHAACPVLTG 265
H D + H CPV+ G
Sbjct: 215 HPDATPDESSLHGGCPVIEG 234
>gi|372266874|ref|ZP_09502922.1| peptidyl prolyl 4-hydroxylase-like protein subunit alpha
[Alteromonas sp. S89]
Length = 294
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 63/192 (32%), Positives = 100/192 (52%), Gaps = 6/192 (3%)
Query: 80 LMPLKEEE-----AYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIAN 134
++PL +++ A QP I+L+ + + + E D + +M++P L + V N + G E+
Sbjct: 86 VIPLGDQQVEARFAIRQPNIVLFANFLAEWECDALVEMSRPNLSPSRVVNTQHGAFELKP 145
Query: 135 YRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGE 194
R S E P+I I R+ + + + E LQ+++Y + G Y PHYDF P +
Sbjct: 146 SRTSGGTHFARGETPLIADIEARIASLLKVPEAHGEPLQILHYPVSGEYRPHYDFFDPEK 205
Query: 195 ANAFKSLGT-GNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDY 253
+ L G RV T++ Y+SDV GGATVF + L + P+KG A F+ + G D
Sbjct: 206 PGNQEVLAAGGQRVGTLIMYLSDVESGGATVFPRVGLEVQPQKGAALFFSYVGEHGKLDL 265
Query: 254 YTRHAACPVLTG 265
+ H PVL G
Sbjct: 266 QSLHGGSPVLAG 277
>gi|195166671|ref|XP_002024158.1| GL22696 [Drosophila persimilis]
gi|194107513|gb|EDW29556.1| GL22696 [Drosophila persimilis]
Length = 491
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 65/207 (31%), Positives = 105/207 (50%), Gaps = 32/207 (15%)
Query: 60 PAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRR 119
P V + CRY+ R+ P+L+L P+++E + + LY D+ EI+ +K +A+PRL+R
Sbjct: 295 PRKVNDVHCRYL-RSTPFLQLAPIRQENLDNEAHVYLYHDLFNHEEIEALKSLARPRLKR 353
Query: 120 ATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGI 179
+ + T K A L +I ++RR++ ++G+ + E LQVVNYGI
Sbjct: 354 QKISSNFT----------CKIAQLSNSAQDIIRTVNRRIQDVSGMDMNEKEVLQVVNYGI 403
Query: 180 GGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTA 239
G Y+ + AT L +MS+V QGG TVF L+L + P+KG+
Sbjct: 404 AGRYDLD---------------DSAGSAATALIFMSNVQQGGETVFPFLSLRVKPQKGSL 448
Query: 240 AFWHNLHSSGDGDYYTRHAACPVLTGS 266
W N D+ H +CP++ G+
Sbjct: 449 LLWRN------TDWSVLHNSCPLIIGN 469
>gi|302786814|ref|XP_002975178.1| hypothetical protein SELMODRAFT_174666 [Selaginella moellendorffii]
gi|300157337|gb|EFJ23963.1| hypothetical protein SELMODRAFT_174666 [Selaginella moellendorffii]
Length = 283
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 65/201 (32%), Positives = 101/201 (50%), Gaps = 21/201 (10%)
Query: 82 PLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSA 141
P K + +PR LY+ M +E D + KMA+ +L+++ V + ++G+ ++N R S
Sbjct: 25 PTKVIQLSWKPRAFLYKGFMSAAECDHVVKMAKDKLQKSMVADNESGKSVLSNIRTSSGM 84
Query: 142 WLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL 201
+L + + VI RI R+ T L E +QV+ Y G YEPHYD+ + +
Sbjct: 85 FLSKGQDEVINRIEERIAAWTFLPKENGEAIQVLRYEFGEKYEPHYDYFH----DKYNQA 140
Query: 202 GTGNRVATVLFYMSDVAQGGATVF------TSLNLSLW-----------PEKGTAAFWHN 244
G+R+ATVL Y+SD +GG TVF T++ W P KG A +++
Sbjct: 141 LGGHRIATVLMYLSDAVKGGETVFPSSEEDTTVKDDSWSDCAKKGIAVKPRKGDALLFYS 200
Query: 245 LHSSGDGDYYTRHAACPVLTG 265
LH D + H CPV+ G
Sbjct: 201 LHPDATPDESSLHGGCPVIEG 221
>gi|194751827|ref|XP_001958225.1| GF23630 [Drosophila ananassae]
gi|190625507|gb|EDV41031.1| GF23630 [Drosophila ananassae]
Length = 431
Score = 111 bits (278), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 68/222 (30%), Positives = 110/222 (49%), Gaps = 40/222 (18%)
Query: 48 YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID 107
YE+ CRG + +L C+Y P+L++ PLK+E L P I ++ +V+Y+ E+
Sbjct: 244 YELGCRGLFPLK----NKLFCQYNFHTTPFLKIAPLKQEILSLDPFISMFHEVLYEYELH 299
Query: 108 LIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTS 167
+K+ + ++ + YK + R S+R+ +TGL S
Sbjct: 300 GLKEDLKNPIKS---KKYKKN---------------------ITNRFSQRLTDITGLHFS 335
Query: 168 TAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTS 227
+++ + NYG+ E HY++ G V +LF++SD QGGATVF
Sbjct: 336 KRDQINIDNYGLENQAEVHYNYK-----------DIGGPVGAILFFISDDVQGGATVFPK 384
Query: 228 LNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSL 269
L +S++P+KG+ W+N+ G D T H+ CPVL G NSL
Sbjct: 385 LKVSVFPKKGSCLVWYNIKDDGRLDPRTTHSICPVLEG-NSL 425
>gi|91778899|ref|YP_554107.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
xenovorans LB400]
gi|91691559|gb|ABE34757.1| Procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
xenovorans LB400]
Length = 292
Score = 111 bits (278), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 56/180 (31%), Positives = 97/180 (53%), Gaps = 1/180 (0%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+P++I++ DV+ E + + ++ RL+R+T N TG+ ++ R S+ W + E P
Sbjct: 102 RPQVIVFADVLSPDECAEMIERSRHRLKRSTTVNPATGKEDVIRNRTSEGIWYQRGEDPF 161
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGE-ANAFKSLGTGNRVAT 209
IER+ RR+ + E LQ+++YG G Y PH+D+ P + +A + G RVAT
Sbjct: 162 IERMDRRISSLMNWPVENGEGLQILHYGTTGEYRPHFDYFPPDQPGSAVHTAQGGQRVAT 221
Query: 210 VLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSL 269
++ Y++DV GG T+F +S+ +G A ++ ++ D T H PVL G +
Sbjct: 222 LVIYLNDVPDGGETIFPEAGMSVAASQGGAVYFRYMNDRRQLDPLTLHGGAPVLAGDKWI 281
>gi|386766694|ref|NP_651648.5| CG11828 [Drosophila melanogaster]
gi|383293009|gb|AAF56834.5| CG11828 [Drosophila melanogaster]
Length = 458
Score = 111 bits (278), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 67/216 (31%), Positives = 106/216 (49%), Gaps = 16/216 (7%)
Query: 52 CRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
CRG +P + L+CRY+ P+LR+ P+K E+ ++P + L+ D + +E +
Sbjct: 239 CRGKNLLPSK--SYLRCRYLRDGSPFLRMAPVKLEQLNIEPFVGLFHDAISPAEQKDLLH 296
Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
+ RL ++ + ++ +A +H + RI +R+E +TG +E
Sbjct: 297 LTDSRL------EHRKKDSSSVEAKVDTNA----SDH--VRRIHQRIEDITGFDLEESEP 344
Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
L V NYGIGG H D +P E + R A+ +FY+SDV GG F L
Sbjct: 345 LTVSNYGIGGQDFIHLDCEQPKEFIGY--YPKEYRSASAMFYLSDVQMGGYASFPDLGFG 402
Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
P +G+A WHN +SG+ D + A CPVL G+
Sbjct: 403 FKPRRGSALVWHNTDNSGNCDTRSLQATCPVLLGNQ 438
>gi|333981907|ref|YP_004511117.1| procollagen-proline dioxygenase [Methylomonas methanica MC09]
gi|333805948|gb|AEF98617.1| Procollagen-proline dioxygenase [Methylomonas methanica MC09]
Length = 286
Score = 111 bits (278), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 53/176 (30%), Positives = 103/176 (58%), Gaps = 1/176 (0%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+P I++ + M E + + + ++ +L + + + +TG+ ++ R S+ + + E P+
Sbjct: 95 RPDIVVVDEFMSGEECEQLIEQSRRKLTPSAIVDPQTGKFQVIADRSSEGTYFQRGESPL 154
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEAN-AFKSLGTGNRVAT 209
I R+ RR+ + E +Q+++YG+G Y+PH+D+ E+ A + +G RVAT
Sbjct: 155 ISRLDRRISELMNWPEDHGEGIQILHYGVGAQYKPHFDYFLENESGGALQMTQSGQRVAT 214
Query: 210 VLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
++ Y+++V +GG TVF + +S+ P++G+AA++ +S G D T H PVLTG
Sbjct: 215 LVMYLNEVTEGGETVFPDVGISITPKRGSAAYFAYCNSLGQVDPATLHGGAPVLTG 270
>gi|198466393|ref|XP_001353986.2| GA18007 [Drosophila pseudoobscura pseudoobscura]
gi|198150579|gb|EAL29722.2| GA18007 [Drosophila pseudoobscura pseudoobscura]
Length = 455
Score = 111 bits (278), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 64/207 (30%), Positives = 105/207 (50%), Gaps = 32/207 (15%)
Query: 60 PAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRR 119
P V + CRY+ R+ P+L+L P+++E + + LY D+ EI+ +K +A+P+L+R
Sbjct: 259 PRKVNDVHCRYL-RSTPFLQLAPIRQENLDNEAHVYLYHDLFNHEEIEALKSLARPKLKR 317
Query: 120 ATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGI 179
+ + T K A L +I ++RR++ ++G+ + E LQVVNYGI
Sbjct: 318 QKISSNFT----------CKIAQLSNSAQDIIRTVNRRIQDVSGMDMNEKEMLQVVNYGI 367
Query: 180 GGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTA 239
G Y+ + AT L +MS+V QGG TVF L+L + P+KG+
Sbjct: 368 AGRYDLD---------------DSAGSAATALIFMSNVQQGGETVFPFLSLRVKPQKGSL 412
Query: 240 AFWHNLHSSGDGDYYTRHAACPVLTGS 266
W N D+ H +CP++ G+
Sbjct: 413 LLWRN------TDWSVLHNSCPLIIGN 433
>gi|186474111|ref|YP_001861453.1| procollagen-proline dioxygenase [Burkholderia phymatum STM815]
gi|184196443|gb|ACC74407.1| Procollagen-proline dioxygenase [Burkholderia phymatum STM815]
Length = 305
Score = 111 bits (278), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 58/182 (31%), Positives = 99/182 (54%), Gaps = 1/182 (0%)
Query: 89 YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
+ +P++I++ DV+ E D + + A+ RL+R+T N ++G ++ R S+ W + E
Sbjct: 113 FERPQVIVFDDVLSRDECDELIERARHRLKRSTTVNPESGREDVIQLRTSEGFWFQRCED 172
Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANA-FKSLGTGNRV 207
IER+ RR+ + E LQ+++Y GG Y PH+D+ P ++ + + G RV
Sbjct: 173 AFIERLDRRISALMNWPLEHGEGLQILHYTKGGEYRPHFDYFPPSQSGSVLHTSRGGQRV 232
Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
AT++ Y+SDVA GG TVF + L++ +G A ++ L+ D T H PV G
Sbjct: 233 ATLIVYLSDVAGGGETVFPNAGLAVMARQGGAIYFRYLNGHRQLDPLTLHGGAPVTNGEK 292
Query: 268 SL 269
+
Sbjct: 293 WI 294
>gi|443730626|gb|ELU16050.1| hypothetical protein CAPTEDRAFT_114796, partial [Capitella teleta]
Length = 150
Score = 111 bits (277), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 58/126 (46%), Positives = 73/126 (57%), Gaps = 2/126 (1%)
Query: 142 WLREPEHPVIERISRRVEHMTGLTTST-AEELQVVNYGIGGHYEPHYDFARPGE-ANAFK 199
WLR +++SRRV T L AE QV YGIGGHYEPH+DF++ N
Sbjct: 2 WLRSENSASADKLSRRVSSATKLDAEKYAELFQVSTYGIGGHYEPHFDFSKVKYFTNPVL 61
Query: 200 SLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAA 259
+ G+R+AT + Y++DV GG TVF LNL + P K +A FWHNL G D T H A
Sbjct: 62 NEQMGDRIATFMIYLNDVEAGGRTVFPRLNLVIEPIKNSAVFWHNLLDDGQQDDRTIHGA 121
Query: 260 CPVLTG 265
CPV+ G
Sbjct: 122 CPVVLG 127
>gi|340787855|ref|YP_004753320.1| peptidyl prolyl 4-hydroxylase-like protein subunit alpha
[Collimonas fungivorans Ter331]
gi|340553122|gb|AEK62497.1| Peptidyl prolyl 4-hydroxylase-like protein, alpha subunit
[Collimonas fungivorans Ter331]
Length = 289
Score = 110 bits (276), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 57/176 (32%), Positives = 96/176 (54%), Gaps = 1/176 (0%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+PR IL+ +V+ E D + +++ +L R+ V +++TG ++ +R S + P
Sbjct: 99 KPRAILFGNVLSHDECDQLIALSKTKLLRSGVVDHQTGNTKLHEHRTSSGTFFHRGTTPF 158
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLG-TGNRVAT 209
I I +R+ + + S E LQ++NY +GG Y PHYD+ RP + K L G R AT
Sbjct: 159 IAMIDKRLAALMQVPESHGEGLQILNYQMGGEYRPHYDYFRPDAPGSAKHLARGGQRTAT 218
Query: 210 VLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
++ Y++DV GG T+F LS+ P KG+A ++ ++ D + H PV+ G
Sbjct: 219 LIIYLNDVDGGGETIFPRNGLSIVPAKGSAIYFSYTNAENQLDSLSFHGGSPVIEG 274
>gi|170591594|ref|XP_001900555.1| prolyl 4-hydroxylase 2 precursor [Brugia malayi]
gi|158592167|gb|EDP30769.1| prolyl 4-hydroxylase 2 precursor, putative [Brugia malayi]
Length = 405
Score = 110 bits (275), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 59/159 (37%), Positives = 97/159 (61%), Gaps = 7/159 (4%)
Query: 4 PTHQRAQGNKLYYQEALN----KSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVP 59
P H RA+GN +Y++ L + +++ + P +NN P + ++ YE LCR ++ +
Sbjct: 239 PDHPRAKGNVRWYEDLLEDEGIRRADMRRKVPPMNN--PRDKSNLKDTYEALCRQEVPIN 296
Query: 60 PAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRR 119
++L C Y + PYLRL P K E + P ++L+RD++ D E+ +I+ +A P+L R
Sbjct: 297 TKAQSRLYC-YYKMDRPYLRLAPFKVEIVHQNPLVVLFRDIVSDEEMRIIEMLAVPKLAR 355
Query: 120 ATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRV 158
ATV N TG +E A YR S+S+WL EH V++RI++R+
Sbjct: 356 ATVHNVVTGNIETAFYRTSQSSWLGSTEHEVVKRINKRL 394
>gi|389795384|ref|ZP_10198508.1| procollagen-proline dioxygenase [Rhodanobacter fulvus Jip2]
gi|388430823|gb|EIL87950.1| procollagen-proline dioxygenase [Rhodanobacter fulvus Jip2]
Length = 293
Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 74/228 (32%), Positives = 114/228 (50%), Gaps = 14/228 (6%)
Query: 49 EMLCRGDLTVPPAIVAQLKCR---YVHR--NVPYLRLMPLKEEEAYL-----QPRIILYR 98
E L RG+ PPA A LK + YV +P ++P + + + P I +
Sbjct: 47 EALARGEQ--PPA-AAPLKAQATGYVADAPRLPAGNVIPTHDRDVRVLLRVATPTIAVLD 103
Query: 99 DVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRV 158
V+ D E D + + + +L+R+T + G E+ R S+ + I R+ RR+
Sbjct: 104 QVLDDEECDELIRRSADKLQRSTTVDPVNGGYEVIAARSSEGTFFPVNADDFIARLDRRI 163
Query: 159 EHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT-GNRVATVLFYMSDV 217
+ E LQV++YG GG Y+PH+D+ PG+ + + G RV+T+L Y++DV
Sbjct: 164 AELMNCPVENGEGLQVLHYGEGGEYQPHFDYFSPGDPGSEAQMVVGGQRVSTLLIYLNDV 223
Query: 218 AQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
AQGGATVF +L L + P KG A ++ + G D T H PV G
Sbjct: 224 AQGGATVFPTLGLRVLPRKGMAVYFEYSNRDGQVDPLTLHGGEPVEKG 271
>gi|198449528|ref|XP_002136919.1| GA26870 [Drosophila pseudoobscura pseudoobscura]
gi|198130648|gb|EDY67477.1| GA26870 [Drosophila pseudoobscura pseudoobscura]
Length = 491
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 76/234 (32%), Positives = 121/234 (51%), Gaps = 22/234 (9%)
Query: 52 CRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
CRG+ + L CR+ R Y RL K EE L P I+LY DV+ E++L+K
Sbjct: 279 CRGEYPWK----STLHCRFSWRPSFYARL---KVEEVLLDPYIVLYHDVVSGKEMELLKD 331
Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
+ L T ++G L + + +S P+++ + +R+ MTGL+ + +E
Sbjct: 332 YGRTNL---THDPLRSG-LSAKHCALPESL-------PLVQSLHQRLWDMTGLSLNGSES 380
Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
+ NYGIGG H D+ E + L NR+ T+ ++S+V+QGG TVF +L ++
Sbjct: 381 WLITNYGIGGFLGLHKDYFDEIE----EELQGDNRLFTIQIFLSNVSQGGYTVFPNLEVA 436
Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTCPCGLRRGLQRSG 285
+ P+ GTA ++NL S GD TRH CPV+ G+ + + + L+R G
Sbjct: 437 VKPQAGTALVFYNLLDSLVGDTRTRHFGCPVIDGNKWIATKFLSAKEQTLRRRG 490
>gi|195159168|ref|XP_002020454.1| GL13504 [Drosophila persimilis]
gi|194117223|gb|EDW39266.1| GL13504 [Drosophila persimilis]
Length = 491
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 76/234 (32%), Positives = 120/234 (51%), Gaps = 22/234 (9%)
Query: 52 CRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
CRG+ + L CR+ R Y RL K EE L P I+LY DV+ E++L+K
Sbjct: 279 CRGEYPWK----STLHCRFSWRPSFYARL---KVEEVLLDPYIVLYHDVVSGKEMELLKD 331
Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
+ L T ++G L + + +S P+++ + +R+ MTGL+ + +E
Sbjct: 332 YGRTNL---THDPLRSG-LSAKHCALPESL-------PLVQSLHQRLWDMTGLSLNGSES 380
Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
+ NYGIGG H D+ E + L NR+ T+ ++S+V+QGG TVF +L ++
Sbjct: 381 WLITNYGIGGFLGLHKDYFDEIE----EELQGDNRLFTIQIFLSNVSQGGYTVFPNLEVA 436
Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTCPCGLRRGLQRSG 285
+ P+ GTA ++NL S GD TRH CPV+ G + + + L+R G
Sbjct: 437 VKPQAGTALVFYNLLDSLVGDTRTRHFGCPVIDGDKWIATKFLSAKEQTLRRRG 490
>gi|253575459|ref|ZP_04852796.1| prolyl 4-hydroxylase [Paenibacillus sp. oral taxon 786 str. D14]
gi|251845106|gb|EES73117.1| prolyl 4-hydroxylase [Paenibacillus sp. oral taxon 786 str. D14]
Length = 215
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 57/175 (32%), Positives = 97/175 (55%), Gaps = 10/175 (5%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+P I+ + ++ D E + + A PRLR + + N E+ R S+ + E E+P
Sbjct: 29 EPLIMRFERLLTDDECRQLIEAAAPRLRESKLVNKVVSEI-----RTSRGMFFEEEENPF 83
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
I RI +R+ + + AE LQV++YG G Y+ HYDF P +A + NR++T+
Sbjct: 84 IHRIEKRISALMNVPIEHAEGLQVLHYGPGQEYQAHYDFFGPNSPSA-----SNNRISTL 138
Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
+ Y++DV GG TVF L+L + PE+G+A ++ + + + T H++ PV+ G
Sbjct: 139 IIYLNDVEAGGETVFPLLDLEVKPERGSALYFEYFYRQQELNNLTLHSSVPVVRG 193
>gi|6437556|gb|AAF08583.1|AC011623_16 unknown protein [Arabidopsis thaliana]
Length = 278
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 56/182 (30%), Positives = 97/182 (53%), Gaps = 4/182 (2%)
Query: 84 KEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWL 143
K ++ +PR +Y + D E D + +A+ L+R+ V + GE ++++ R S ++
Sbjct: 37 KVKQVSSKPRAFVYEGFLTDLECDHLISLAKENLQRSAVADNDNGESQVSDVRTSSGTFI 96
Query: 144 REPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT 203
+ + P++ I ++ T L E+LQV+ Y G Y+ H+D+ + N +
Sbjct: 97 SKGKDPIVSGIEDKLSTWTFLPKENGEDLQVLRYEHGQKYDAHFDYFHD-KVNIARG--- 152
Query: 204 GNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVL 263
G+R+ATVL Y+S+V +GG TVF + L P+KG A + NL D ++ H CPV+
Sbjct: 153 GHRIATVLLYLSNVTKGGETVFPDAQVCLKPKKGNALLFFNLQQDAIPDPFSLHGGCPVI 212
Query: 264 TG 265
G
Sbjct: 213 EG 214
>gi|330821584|ref|YP_004350446.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
gladioli BSR3]
gi|327373579|gb|AEA64934.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
gladioli BSR3]
Length = 302
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 60/178 (33%), Positives = 95/178 (53%), Gaps = 1/178 (0%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+P +L + E + ++A+PRL R+TV + TG +A +R S + R E P+
Sbjct: 101 RPAAVLLDGFLSAGECRQLIELARPRLNRSTVVDPVTGRNIVAGHRSSDGMFFRLGETPL 160
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGE-ANAFKSLGTGNRVAT 209
I RI +R+ +TG E LQ+++Y G PH D+ PG ANA +G RV T
Sbjct: 161 ISRIEQRIAALTGFPVENGEGLQMLHYEAGAESTPHVDYLVPGNPANAESIARSGQRVGT 220
Query: 210 VLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
+L Y++DV GG T+F + S+ P +G A ++ + SG D + HA+ P+ +G
Sbjct: 221 LLMYLNDVESGGETLFPQVGCSVVPRRGQAFYFEYGNGSGRSDPASLHASSPIGSGDK 278
>gi|385205097|ref|ZP_10031967.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
gi|385184988|gb|EIF34262.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
Length = 292
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 55/180 (30%), Positives = 96/180 (53%), Gaps = 1/180 (0%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+P++I++ DV+ E + + ++ RL+R+T N TG+ ++ R S+ W + E P
Sbjct: 102 RPQMIVFADVLSPDECAEMIERSRHRLKRSTTVNPATGKEDVIRNRTSEGIWYQRGEDPF 161
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGE-ANAFKSLGTGNRVAT 209
IER+ RR+ + E LQ++ YG G Y PH+D+ P + + + G RVAT
Sbjct: 162 IERMDRRISSLMNWPVENGEGLQLLRYGTTGEYRPHFDYFPPDQPGSTVHTAQGGQRVAT 221
Query: 210 VLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSL 269
++ Y++DV GG T+F +S+ +G A ++ ++ D T H PVL+G +
Sbjct: 222 LVIYLNDVPDGGETIFPEAGMSVAASQGGAVYFRYMNGRRQLDPLTLHGGAPVLSGDKWI 281
>gi|226495689|ref|NP_001149322.1| LOC100282945 precursor [Zea mays]
gi|194697650|gb|ACF82909.1| unknown [Zea mays]
gi|194708468|gb|ACF88318.1| unknown [Zea mays]
gi|195626376|gb|ACG35018.1| oxidoreductase [Zea mays]
gi|347978842|gb|AEP37763.1| prolyl 4-hydroxylase 9 [Zea mays]
gi|413945802|gb|AFW78451.1| oxidoreductase [Zea mays]
Length = 308
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 60/207 (28%), Positives = 102/207 (49%), Gaps = 21/207 (10%)
Query: 76 PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANY 135
P + P + +PR+ LY+ + D E + + +A+ L+R+ V + +G+ ++
Sbjct: 42 PAAVVYPHHSRQISCKPRVFLYQHFLSDDEANHLISLARAELKRSAVADNMSGKSTLSEV 101
Query: 136 RISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEA 195
R S +LR+ + P++E I ++ T L E++QV+ Y G YEPHYD+
Sbjct: 102 RTSSGTFLRKGQDPIVEGIEDKIAAWTFLPKENGEDIQVLRYKHGEKYEPHYDYF----T 157
Query: 196 NAFKSLGTGNRVATVLFYMSDVAQGGATVF-----------------TSLNLSLWPEKGT 238
+ ++ G+R ATVL Y++DV +GG TVF +++ P KG
Sbjct: 158 DNVNTVRGGHRYATVLLYLTDVPEGGETVFPLAEEPDDAKDATLSECAQKGIAVRPRKGD 217
Query: 239 AAFWHNLHSSGDGDYYTRHAACPVLTG 265
A + NL+ G D + H CPV+ G
Sbjct: 218 ALLFFNLNPDGTTDSVSLHGGCPVIKG 244
>gi|332526359|ref|ZP_08402485.1| procollagen-proline dioxygenase [Rubrivivax benzoatilyticus JA2]
gi|332110495|gb|EGJ10818.1| procollagen-proline dioxygenase [Rubrivivax benzoatilyticus JA2]
Length = 224
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 63/179 (35%), Positives = 91/179 (50%), Gaps = 7/179 (3%)
Query: 92 PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
PR++++ ++ + E D + +AQPRL R+ + TG E+ R S + E P+I
Sbjct: 37 PRVVVFGGLLSEQECDELVALAQPRLLRSETVDNSTGGSEVNAARTSDGMFFERGETPLI 96
Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDF---ARPGEANAFKSLGTGNRVA 208
ERI RR+ + E LQV++Y G Y+PH+DF A PG AN + G RV
Sbjct: 97 ERIERRIAELVHWPVERGEGLQVLHYRPGAQYKPHHDFFDPAHPGTANILRR--GGQRVG 154
Query: 209 TVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
TV+ Y++ A GGAT F + L + P KG A F+ + T H PVL G
Sbjct: 155 TVVIYLNTPAGGGATTFPEVGLEVQPIKGNAVFFS--YERPLASTRTLHGGAPVLDGEK 211
>gi|383757171|ref|YP_005436156.1| putative prolyl 4-hydroxylase alpha subunit [Rubrivivax gelatinosus
IL144]
gi|381377840|dbj|BAL94657.1| putative prolyl 4-hydroxylase alpha subunit homologue
oxidoreductase protein [Rubrivivax gelatinosus IL144]
Length = 279
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 68/198 (34%), Positives = 97/198 (48%), Gaps = 15/198 (7%)
Query: 71 VHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGEL 130
+ R V L +M L PR++++ ++ D E D + +A+PRL R+ + TG
Sbjct: 79 LDREVRVLAVMSL--------PRVVVFGGLLSDEECDELVALARPRLARSETVDNSTGGS 130
Query: 131 EIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDF- 189
E+ R S + E P+IERI RR+ + E LQV+ Y G Y+PH+DF
Sbjct: 131 EVNAARTSDGMFFERGEKPLIERIERRIAELVRWPVERGEGLQVLRYRPGAQYKPHHDFF 190
Query: 190 --ARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHS 247
A PG AN + G RV TV+ Y++ A GGAT F + L + P KG A F+ +
Sbjct: 191 DPAHPGTANILRR--GGQRVGTVVMYLNTPAGGGATTFPEVGLEVQPVKGNAVFFS--YE 246
Query: 248 SGDGDYYTRHAACPVLTG 265
T H PVL G
Sbjct: 247 RPLASTRTLHGGAPVLDG 264
>gi|390352104|ref|XP_003727818.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like
[Strongylocentrotus purpuratus]
Length = 121
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 50/100 (50%), Positives = 68/100 (68%), Gaps = 5/100 (5%)
Query: 167 STAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFT 226
+ E LQ+ NYG+GGHY PH+DF R + GNR+A++LFY+SDVA+GG TVF
Sbjct: 2 NATEFLQIANYGLGGHYLPHFDFTRDVATHK-----NGNRIASMLFYLSDVAKGGDTVFI 56
Query: 227 SLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+ PEKG+A FW+NL +G D T+HA+CPV++GS
Sbjct: 57 DAGAKIKPEKGSAIFWYNLFKNGKVDERTKHASCPVISGS 96
>gi|255607134|ref|XP_002538686.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223510975|gb|EEF23697.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 318
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 58/153 (37%), Positives = 91/153 (59%), Gaps = 3/153 (1%)
Query: 92 PRIILYRDVMYDSEIDLIKKMAQPRLRRA-TVQNYKTGELEIANYRISKSAWLREPEHPV 150
PRI L+ DV+ D+E D + ++ RL+R+ V N +GE + + R S A+ + E+ +
Sbjct: 126 PRIALFDDVLSDAECDALIAASRSRLQRSKVVANRGSGEF-VDDTRTSYGAYFNKGENSL 184
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT-GNRVAT 209
+ I RR+ +T + AE LQ++NYG+GG Y PH+D+ P + L + G R+AT
Sbjct: 185 VATIQRRIAELTRWPLTHAEPLQILNYGLGGEYLPHFDYFEPQQPGLPSPLESGGQRIAT 244
Query: 210 VLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFW 242
V+ Y++DV GG T+F LNL P KG A ++
Sbjct: 245 VVMYLNDVEAGGGTIFPHLNLETRPRKGGAIYF 277
>gi|198417608|ref|XP_002125299.1| PREDICTED: similar to prolyl-4-hydroxylase-alpha EFB CG31022-PA
[Ciona intestinalis]
Length = 471
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 65/178 (36%), Positives = 88/178 (49%), Gaps = 47/178 (26%)
Query: 135 YRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGE 194
YRIS +AWL + + ++R+S+R+ +TGLT S+ E LQV NYG+ GHY H+D E
Sbjct: 265 YRISNTAWLDDKDSSSVKRLSQRLADVTGLTGSS-ELLQVANYGMAGHYIAHFDAMTREE 323
Query: 195 ANAFKSL----------------------------------------------GTGNRVA 208
+ KSL TG R+A
Sbjct: 324 EDYVKSLSNRQTVLSNITEDDLLDDKSIIGSADNKTVGTTQQPDDRNENYEYGNTGQRIA 383
Query: 209 TVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
T L Y+S+V +GG+T F N+ P KG+A FW+NL+ SG D T HAACPVL G+
Sbjct: 384 TALVYLSEVQKGGSTAFFYPNIVAEPIKGSAVFWYNLYPSGALDKRTLHAACPVLIGN 441
>gi|187920106|ref|YP_001889137.1| procollagen-proline dioxygenase [Burkholderia phytofirmans PsJN]
gi|187718544|gb|ACD19767.1| Procollagen-proline dioxygenase [Burkholderia phytofirmans PsJN]
Length = 295
Score = 108 bits (269), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 55/180 (30%), Positives = 97/180 (53%), Gaps = 1/180 (0%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+P++I++ DV+ E + + ++ RL+R+T N +TG+ ++ R S+ W + E
Sbjct: 105 RPQVIVFGDVLSPDECAEMIERSRHRLKRSTTVNPETGKEDVIRNRTSEGIWYQRGEDAF 164
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGE-ANAFKSLGTGNRVAT 209
IER+ RR+ + E LQ+++YG G Y PH+D+ P + +A + G RVAT
Sbjct: 165 IERMDRRISSLMNWPVENGEGLQILHYGTTGEYRPHFDYFPPDQPGSAVHTAQGGQRVAT 224
Query: 210 VLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSL 269
++ Y++DV GG T+F +S+ +G A ++ ++ D T H PVL G +
Sbjct: 225 LVIYLNDVPDGGETIFPEAGISVAARQGGAVYFRYMNGQRQLDPLTLHGGAPVLGGDKWI 284
>gi|407708877|ref|YP_006792741.1| prolyl 4-hydroxylase [Burkholderia phenoliruptrix BR3459a]
gi|407237560|gb|AFT87758.1| prolyl 4-hydroxylase [Burkholderia phenoliruptrix BR3459a]
Length = 300
Score = 108 bits (269), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 55/180 (30%), Positives = 99/180 (55%), Gaps = 1/180 (0%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+P++I++ +V+ E D + + ++ RL+R+T+ + TG+ + R S+ W + E
Sbjct: 110 RPQVIVFANVLSPEECDEVIERSRHRLKRSTIVDPATGQEGVIRNRTSEGIWYQRGEDAF 169
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGE-ANAFKSLGTGNRVAT 209
IER+ RR+ + E LQ+++YG G Y PH+D+ P + +A + G RVAT
Sbjct: 170 IERLDRRIASLMNWPVENGEGLQILHYGPTGEYRPHFDYFPPDQPGSAVHTARGGQRVAT 229
Query: 210 VLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSL 269
++ Y++DVA GG T+F + LS+ ++G A ++ ++ D T H PV G +
Sbjct: 230 LVVYLNDVADGGETIFPAAGLSVAAKQGGAVYFRYMNGQRQLDPLTLHGGAPVRAGDKWI 289
>gi|242047772|ref|XP_002461632.1| hypothetical protein SORBIDRAFT_02g005750 [Sorghum bicolor]
gi|241925009|gb|EER98153.1| hypothetical protein SORBIDRAFT_02g005750 [Sorghum bicolor]
Length = 307
Score = 108 bits (269), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 60/193 (31%), Positives = 100/193 (51%), Gaps = 22/193 (11%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
QPRI +Y+ + D+E D + +A+ +++R+ V + ++G+ ++ R S +L + + PV
Sbjct: 50 QPRIFVYKGFLSDAECDHLVTLAKKKIQRSMVADNQSGKSVMSEVRTSSGMFLNKRQDPV 109
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
+ RI R+ T L AE +Q++ Y G YEPH+D+ + + G+R ATV
Sbjct: 110 VSRIEERIAAWTFLPQENAENMQILRYEHGQKYEPHFDYFH----DKINQVRGGHRYATV 165
Query: 211 LFYMSDVAQGGATVFTSLN------------------LSLWPEKGTAAFWHNLHSSGDGD 252
L Y+S V +GG TVF + L++ P KG A + +LH G D
Sbjct: 166 LMYLSTVDKGGETVFPNAKGWESQPKDDTFSECAHQGLAVKPVKGDAVLFFSLHVDGVPD 225
Query: 253 YYTRHAACPVLTG 265
+ H +CPV+ G
Sbjct: 226 PLSLHGSCPVIQG 238
>gi|302773668|ref|XP_002970251.1| hypothetical protein SELMODRAFT_411114 [Selaginella moellendorffii]
gi|300161767|gb|EFJ28381.1| hypothetical protein SELMODRAFT_411114 [Selaginella moellendorffii]
Length = 256
Score = 107 bits (268), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 61/206 (29%), Positives = 102/206 (49%), Gaps = 23/206 (11%)
Query: 81 MPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKS 140
MP+ E QPR ++ + + E D + ++AQP ++R+ V + +TG+ + + R S
Sbjct: 42 MPVWTETISWQPRASVFHNFLSSEECDHLIRLAQPNMKRSAVVDNQTGKSKDSRVRTSSG 101
Query: 141 AWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKS 200
+LR + +I RI R+ T + E LQV++Y +G Y+ H+D+ + +
Sbjct: 102 TFLRRGQDEIISRIEERIAKFTFIPKEHGEGLQVLHYEVGQKYDAHHDYFH----DKVNT 157
Query: 201 LGTGNRVATVLFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAF 241
G RVATVL Y+SDV +GG TVF S +S+ P KG A
Sbjct: 158 KNGGQRVATVLMYLSDVEEGGETVFPSAKVNSSSVPWWDELSECAKKGVSVKPRKGDALL 217
Query: 242 WHNLHSSGDGDYYTRHAACPVLTGSN 267
+ ++ + D ++ H CPV+ G+
Sbjct: 218 FWSMSPDAELDPFSLHGGCPVIKGNK 243
>gi|218199253|gb|EEC81680.1| hypothetical protein OsI_25242 [Oryza sativa Indica Group]
Length = 487
Score = 107 bits (268), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 61/193 (31%), Positives = 102/193 (52%), Gaps = 22/193 (11%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+PR+ +Y+ + D E D + K+ + +++R+ V + K+G+ ++ R S +L + + PV
Sbjct: 63 RPRVFVYKGFLSDDECDHLVKLGKRKMQRSMVADNKSGKSVMSEVRTSSGMFLDKRQDPV 122
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
+ RI +R+ T L AE +Q++ Y G YEPH+D+ ++LG G+R ATV
Sbjct: 123 VSRIEKRIAAWTFLPEENAENIQILRYEHGQKYEPHFDYFHD---KVNQALG-GHRYATV 178
Query: 211 LFYMSDVAQGGATVFTSL------------------NLSLWPEKGTAAFWHNLHSSGDGD 252
L Y+S V +GG TVF + L++ P KG A + +LH G D
Sbjct: 179 LMYLSTVEKGGETVFPNAEGWENQPKDDTFSECAQKGLAVKPVKGDAVLFFSLHIDGVPD 238
Query: 253 YYTRHAACPVLTG 265
+ H +CPV+ G
Sbjct: 239 PLSLHGSCPVIEG 251
>gi|319652240|ref|ZP_08006358.1| prolyl 4-hydroxylase [Bacillus sp. 2_A_57_CT2]
gi|317396063|gb|EFV76783.1| prolyl 4-hydroxylase [Bacillus sp. 2_A_57_CT2]
Length = 216
Score = 107 bits (268), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 54/176 (30%), Positives = 95/176 (53%), Gaps = 9/176 (5%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+P I++ +V+ D E D + + ++ R++R+ V N LE+ R S S + E E+ +
Sbjct: 37 EPLIVILGNVLSDEECDQLIQQSKDRMQRSKVAN----SLEVDELRTSSSTFFHEGENEI 92
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
+ RI +R+ + + E LQ++NY IG Y+ H+DF A + R++T+
Sbjct: 93 VARIEKRISQIMNIPVEHGEGLQILNYKIGQEYKAHFDFFSSTSRAA-----SNPRISTL 147
Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+ Y++DV QGG T F LN S+ P+KG A ++ ++ + + T H PV+ G
Sbjct: 148 VMYLNDVEQGGETYFPKLNFSVSPQKGMAVYFEYFYNDQNLNDLTLHGGAPVVMGD 203
>gi|302793288|ref|XP_002978409.1| hypothetical protein SELMODRAFT_418273 [Selaginella moellendorffii]
gi|300153758|gb|EFJ20395.1| hypothetical protein SELMODRAFT_418273 [Selaginella moellendorffii]
Length = 256
Score = 107 bits (268), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 61/206 (29%), Positives = 102/206 (49%), Gaps = 23/206 (11%)
Query: 81 MPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKS 140
MP+ E QPR ++ + + E D + ++AQP ++R+ V + +TG+ + + R S
Sbjct: 42 MPVWTETISWQPRASVFHNFLSSEECDHLIRLAQPNMKRSAVVDNQTGKSKDSRVRTSSG 101
Query: 141 AWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKS 200
+LR + +I RI R+ T + E LQV++Y +G Y+ H+D+ + +
Sbjct: 102 TFLRRGQDEIISRIEERIAKFTFIPKEHGEGLQVLHYEVGQKYDAHHDYFH----DKVNT 157
Query: 201 LGTGNRVATVLFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAF 241
G RVATVL Y+SDV +GG TVF S +S+ P KG A
Sbjct: 158 KNGGQRVATVLMYLSDVEEGGETVFPSAKVNSSSVPWWDELSECGKKGVSVKPRKGDALL 217
Query: 242 WHNLHSSGDGDYYTRHAACPVLTGSN 267
+ ++ + D ++ H CPV+ G+
Sbjct: 218 FWSMSPDAELDPFSLHGGCPVIKGNK 243
>gi|125552794|gb|EAY98503.1| hypothetical protein OsI_20415 [Oryza sativa Indica Group]
Length = 319
Score = 107 bits (267), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 59/207 (28%), Positives = 103/207 (49%), Gaps = 25/207 (12%)
Query: 80 LMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISK 139
+ P + +PR+ LY+ + D E + + +A+ L+R+ V + +G+ E+++ R S
Sbjct: 53 VYPHHSRQISWKPRVFLYQHFLSDDEANHLVSLARAELKRSAVADNLSGKSELSDARTSS 112
Query: 140 SAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFK 199
++R+ + P++ I ++ T L E++QV+ Y G YE HYD+ ++
Sbjct: 113 GTFIRKSQDPIVAGIEEKIAAWTFLPKENGEDIQVLRYKHGEKYERHYDYF----SDNVN 168
Query: 200 SLGTGNRVATVLFYMSDVAQGGATVF---------------------TSLNLSLWPEKGT 238
+L G+R+ATVL Y++DVA+GG TVF +++ P KG
Sbjct: 169 TLRGGHRIATVLMYLTDVAEGGETVFPLAEEFTESGTNNEDSTLSECAKKGVAVKPRKGD 228
Query: 239 AAFWHNLHSSGDGDYYTRHAACPVLTG 265
A + NL D + HA CPV+ G
Sbjct: 229 ALLFFNLSPDASKDSLSLHAGCPVIKG 255
>gi|357467085|ref|XP_003603827.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355492875|gb|AES74078.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 280
Score = 107 bits (267), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 64/201 (31%), Positives = 100/201 (49%), Gaps = 24/201 (11%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+PR +Y + + E + + +A+P L +++V + KTG+ + R S +L+ + +
Sbjct: 75 EPRAFVYHNFLSKEECEHLINLAKPFLAKSSVVDSKTGKSTESRVRTSSGMFLKRGKDKI 134
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
I+ I RR+ T + E LQV++YG+G YEPHYD+ + F + G RVATV
Sbjct: 135 IQNIERRIADFTFIPVENGEGLQVLHYGVGEKYEPHYDYF----LDEFNTKNGGQRVATV 190
Query: 211 LFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAFWHNLHSSGDG 251
L Y+SDV +GG TVF + LSL P+ G A + ++
Sbjct: 191 LMYLSDVEEGGETVFPAAKANFSSVPWWNDLSECARKGLSLKPKMGDALLFWSMRPDATL 250
Query: 252 DYYTRHAACPVLTGSNSLHST 272
D + H CPV+ G N ST
Sbjct: 251 DASSLHGGCPVIVG-NKWSST 270
>gi|115464581|ref|NP_001055890.1| Os05g0489100 [Oryza sativa Japonica Group]
gi|50511363|gb|AAT77286.1| putative prolyl 4-hydroxylase alpha subunit [Oryza sativa Japonica
Group]
gi|113579441|dbj|BAF17804.1| Os05g0489100 [Oryza sativa Japonica Group]
gi|125587281|gb|EAZ27945.1| hypothetical protein OsJ_11906 [Oryza sativa Japonica Group]
gi|215737307|dbj|BAG96236.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 319
Score = 107 bits (266), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 59/207 (28%), Positives = 103/207 (49%), Gaps = 25/207 (12%)
Query: 80 LMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISK 139
+ P + +PR+ LY+ + D E + + +A+ L+R+ V + +G+ E+++ R S
Sbjct: 53 VYPHHSRQISWKPRVFLYQHFLSDDEANHLVSLARTELKRSAVADNLSGKSELSDARTSS 112
Query: 140 SAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFK 199
++R+ + P++ I ++ T L E++QV+ Y G YE HYD+ ++
Sbjct: 113 GTFIRKSQDPIVAGIEEKIAAWTFLPKENGEDIQVLRYKHGEKYERHYDYF----SDNVN 168
Query: 200 SLGTGNRVATVLFYMSDVAQGGATVF---------------------TSLNLSLWPEKGT 238
+L G+R+ATVL Y++DVA+GG TVF +++ P KG
Sbjct: 169 TLRGGHRIATVLMYLTDVAEGGETVFPLAEEFTESGTNNEDSTLSECAKKGVAVKPRKGD 228
Query: 239 AAFWHNLHSSGDGDYYTRHAACPVLTG 265
A + NL D + HA CPV+ G
Sbjct: 229 ALLFFNLSPDASKDSLSLHAGCPVIKG 255
>gi|242088305|ref|XP_002439985.1| hypothetical protein SORBIDRAFT_09g023860 [Sorghum bicolor]
gi|241945270|gb|EES18415.1| hypothetical protein SORBIDRAFT_09g023860 [Sorghum bicolor]
Length = 308
Score = 107 bits (266), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 60/207 (28%), Positives = 102/207 (49%), Gaps = 21/207 (10%)
Query: 76 PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANY 135
P + P + +PR+ LY+ + D E + + +A+ L+R+ V + +G+ +++
Sbjct: 42 PAAVVYPHHSRQISWKPRVFLYQHFLSDDEANHLISLARAELKRSAVADNMSGKSTLSDV 101
Query: 136 RISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEA 195
R S +LR+ + P++E I ++ T L E++QV+ Y G YEPHYD+
Sbjct: 102 RTSSGTFLRKGQDPIVEGIEDKIAAWTFLPKENGEDIQVLRYKHGEKYEPHYDYF----T 157
Query: 196 NAFKSLGTGNRVATVLFYMSDVAQGGATVF-----------------TSLNLSLWPEKGT 238
+ ++ G+R ATVL Y++DVA+GG TVF +++ P KG
Sbjct: 158 DNVNTIRGGHRYATVLLYLTDVAEGGETVFPLAEEVDDAKDATFSECAQKGIAVKPRKGD 217
Query: 239 AAFWHNLHSSGDGDYYTRHAACPVLTG 265
A + NL G D + H C V+ G
Sbjct: 218 ALLFFNLKPDGTTDPVSLHGGCAVIRG 244
>gi|255085592|ref|XP_002505227.1| predicted protein [Micromonas sp. RCC299]
gi|226520496|gb|ACO66485.1| predicted protein [Micromonas sp. RCC299]
Length = 267
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 68/205 (33%), Positives = 103/205 (50%), Gaps = 24/205 (11%)
Query: 79 RLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRIS 138
R++ L E +P+ LYR + +E D IK+ A+P+L ++TV + KTG+ +N R S
Sbjct: 4 RIVKLSE-----KPKAYLYRGFLRQAECDYIKERAKPKLEKSTVVDNKTGQSVPSNIRTS 58
Query: 139 KSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAF 198
+ E +IE I RR+ T + E +QV+ Y +G YEPH D A + N
Sbjct: 59 DGMFFDRHEDDIIEDIERRIAEWTNVPWENGEGIQVLRYEVGQKYEPHLD-AFSDKFNTE 117
Query: 199 KSLGTGNRVATVLFYMSDVAQGGATVF-----------------TSLNLSLWPEKGTAAF 241
+S G G R+ATVL Y+SDV +GG TVF +++ KG A
Sbjct: 118 ESKG-GQRMATVLMYLSDVEEGGETVFPRSVDKPHKGDPKWSECAQRGVAVKARKGDALL 176
Query: 242 WHNLHSSGDGDYYTRHAACPVLTGS 266
+ +L + D + H CPV+ G+
Sbjct: 177 FWSLDIDSNVDELSLHGGCPVIKGT 201
>gi|307111754|gb|EFN59988.1| hypothetical protein CHLNCDRAFT_49444 [Chlorella variabilis]
Length = 344
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 66/192 (34%), Positives = 95/192 (49%), Gaps = 25/192 (13%)
Query: 93 RIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIE 152
RI LY + + D E D I K+A+P + R+ V +G+ +I N R SK +L VI
Sbjct: 71 RIFLYHNFLTDEECDHIIKLAEPTMARSGVVETDSGKSKIDNVRTSKGTFLNRGHDSVIA 130
Query: 153 RISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD--FARPGEANAFKSLGTGNRVATV 210
I R+ T + E LQV+ Y G YE HYD F + G AN GNR TV
Sbjct: 131 DIEARIAKWTLMPAGNGEGLQVLKYEHGQEYEGHYDYFFHKAGTANG------GNRYLTV 184
Query: 211 LFYMSDVAQGGATVFTSLN-----------------LSLWPEKGTAAFWHNLHSSGDGDY 253
L Y++DV +GG T F ++ L+ P+KG A +H++ +G+ +
Sbjct: 185 LMYLNDVEEGGETCFPNIPSPNGDNGPEFSECARKVLAAKPKKGNAVLFHSIKPTGELER 244
Query: 254 YTRHAACPVLTG 265
+ H ACPV+ G
Sbjct: 245 RSLHTACPVIKG 256
>gi|241710333|ref|XP_002412045.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
gi|215505100|gb|EEC14594.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
Length = 440
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 70/210 (33%), Positives = 109/210 (51%), Gaps = 18/210 (8%)
Query: 21 NKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRL 80
N S E K + P + + EV E + Y+ LCRG+L P + +QL+CRY + L
Sbjct: 235 NISNEPKHKVPVRDPTKHSAEVIEHQNYKRLCRGELLRSPKMDSQLRCRYYKGQDGFFTL 294
Query: 81 MPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYR---- 136
P+K EE L+P II+ +V+ D +I + A+PR R+ + L+ N +
Sbjct: 295 QPVKLEEVNLKPYIIVMHNVVQDRDIKDMIDFAEPRARKTPALYF----LKKGNTKTHIL 350
Query: 137 --ISKSAWLREPEHPVIERISRRVEHMTGLTTS----TAEELQVVNYGIGGHYEPHYDFA 190
I + AWL E P+ R++R + + G++ S AE Q+ NYGIGG Y PH D+
Sbjct: 351 LPIYQRAWLGEDSAPIANRMNRYLRALVGMSASGSNLDAEPYQLANYGIGGQYLPHNDYL 410
Query: 191 RPG-EANA---FKSLGTGNRVATVLFYMSD 216
+ AN + G+RVAT++ Y+S+
Sbjct: 411 QDALHANTSEYYVHHKAGDRVATLMIYVSE 440
>gi|145345764|ref|XP_001417370.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144577597|gb|ABO95663.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 328
Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 66/198 (33%), Positives = 96/198 (48%), Gaps = 20/198 (10%)
Query: 86 EEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLRE 145
E +P +YR + E D +K +A P L R+TV + G ++ R S +L
Sbjct: 57 ERVSWRPHAEVYRGFLTREECDHLKALATPSLGRSTVVDASNGGSVPSDIRTSSGMFLLR 116
Query: 146 PEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGN 205
E V+ I RR+ T + S E QV+ Y G Y PH+D+ + E N + G G
Sbjct: 117 GEDDVVASIERRIASWTHVPESHGEGFQVLRYEFGQEYRPHFDYFQD-EFNQKREKG-GQ 174
Query: 206 RVATVLFYMSDVAQGGATVF------------------TSLNLSLWPEKGTAAFWHNLHS 247
RVATVL Y++DV +GG T+F + L++ P KG A F+ +LH
Sbjct: 175 RVATVLMYLTDVEEGGETIFPDAEAGANPGGGDDASSCAAGKLAVKPRKGDALFFRSLHH 234
Query: 248 SGDGDYYTRHAACPVLTG 265
+G D + HA CPV+ G
Sbjct: 235 NGTSDAMSSHAGCPVVKG 252
>gi|254254263|ref|ZP_04947580.1| hypothetical protein BDAG_03558 [Burkholderia dolosa AUO158]
gi|124898908|gb|EAY70751.1| hypothetical protein BDAG_03558 [Burkholderia dolosa AUO158]
Length = 285
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 57/176 (32%), Positives = 93/176 (52%), Gaps = 1/176 (0%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+P+I+++ +V+ E D + + + +L ++T N +TG E+ +R S W + E +
Sbjct: 95 RPQIVVFGNVLDQDECDEMIQRSMHKLEQSTTVNAETGTQEVIRHRTSHGTWFQNGEDAL 154
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT-GNRVAT 209
I RI R+ + E LQV+ Y GG Y HYD+ +P A + + T G RVAT
Sbjct: 155 IRRIETRLAALMNCPVENGEGLQVLRYTPGGEYRSHYDYFQPTAAGSLTHVRTGGQRVAT 214
Query: 210 VLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
++ Y++DV GG TVF +S+ P +G A ++ ++ D T HA PV G
Sbjct: 215 LIVYLNDVPSGGETVFPEAGISVVPRRGDAVYFRYMNRLRQLDPATLHAGAPVRDG 270
>gi|377810637|ref|YP_005043077.1| proCollegen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia sp.
YI23]
gi|357939998|gb|AET93554.1| proCollegen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia sp.
YI23]
Length = 297
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 58/176 (32%), Positives = 94/176 (53%), Gaps = 1/176 (0%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+P +L + + SE D + +A+PRL R+TV + TG A +R S + R E P+
Sbjct: 101 RPAAVLLDEFLTGSECDQLIALARPRLSRSTVVDPVTGRDVAAGHRSSDGTFFRLAETPL 160
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLG-TGNRVAT 209
+ R+ R+ +TGL E LQ++ Y G PH D+ G +S+ +G RV T
Sbjct: 161 VARLEMRIAALTGLAAENGEGLQLLRYQPGAESTPHVDYLVAGNETNRESIARSGQRVGT 220
Query: 210 VLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
+L Y++DV GG TVF + S+ P +G A ++ + +G D + HA+ P+ +G
Sbjct: 221 LLMYLNDVEGGGETVFPQVGCSVVPRRGQALYFEYCNRAGVCDPASLHASTPLRSG 276
>gi|413963357|ref|ZP_11402584.1| ProCollegen-proline dioxygenase [Burkholderia sp. SJ98]
gi|413929189|gb|EKS68477.1| ProCollegen-proline dioxygenase [Burkholderia sp. SJ98]
Length = 286
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 59/195 (30%), Positives = 105/195 (53%), Gaps = 4/195 (2%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
QP I L DV+ D+E D + ++ + ++R++V + +G+ R S+ A++ +
Sbjct: 93 QPVIALVADVLDDTECDRLIEIGREHVQRSSVVDPDSGKEITIEERRSEGAFVNASTDAL 152
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT-GNRVAT 209
+E I RR+ + E+L ++ YG+GG Y PHYD+ +A + + G R+AT
Sbjct: 153 VETIDRRIAELFRQPVENGEDLHILRYGMGGEYRPHYDYFPEEQAGSKHHMQRGGQRIAT 212
Query: 210 VLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSL 269
V+ Y+++V QGG T F + L++ P +G+A ++ ++ G D T HA PV G +
Sbjct: 213 VILYLNEVEQGGDTTFPDIGLAIHPRRGSALYFEYVNELGQSDPKTLHAGTPVEKGEKWI 272
Query: 270 HSTCPCGLRRGLQRS 284
+ +RRG R+
Sbjct: 273 ATKW---IRRGRFRA 284
>gi|323528042|ref|YP_004230194.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1001]
gi|323385044|gb|ADX57134.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1001]
Length = 300
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 54/180 (30%), Positives = 99/180 (55%), Gaps = 1/180 (0%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+P++I++ +V+ E D + + ++ RL+R+T+ + TG+ + R S+ W + E
Sbjct: 110 RPQVIVFANVLSPEECDEVIERSRHRLKRSTIVDPATGQEGVIRNRTSEGIWYQRGEDAF 169
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGE-ANAFKSLGTGNRVAT 209
IER+ +R+ + E LQ+++YG G Y PH+D+ P + +A + G RVAT
Sbjct: 170 IERLDQRIASLMNWPVENGEGLQILHYGPTGEYRPHFDYFPPDQPGSAVHTARGGQRVAT 229
Query: 210 VLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSL 269
++ Y++DVA GG T+F + LS+ ++G A ++ ++ D T H PV G +
Sbjct: 230 LVVYLNDVADGGETIFPAAGLSVAAKQGGAVYFRYMNGQRQLDPLTLHGGAPVHAGDKWI 289
>gi|222636605|gb|EEE66737.1| hypothetical protein OsJ_23428 [Oryza sativa Japonica Group]
Length = 487
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 60/193 (31%), Positives = 101/193 (52%), Gaps = 22/193 (11%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+PR+ +Y+ + D E D + K+ + +++R+ V + K+G+ ++ R S +L + + PV
Sbjct: 63 RPRVFVYKGFLSDDECDHLVKLGKRKMQRSMVADNKSGKSVMSEVRTSSGMFLDKRQDPV 122
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
+ RI +R+ T L AE +Q++ Y G YEPH+D+ ++LG G+R ATV
Sbjct: 123 VSRIEKRIAAWTFLPEENAENIQILRYEHGQKYEPHFDYFHD---KVNQALG-GHRYATV 178
Query: 211 LFYMSDVAQGGATVFTSL------------------NLSLWPEKGTAAFWHNLHSSGDGD 252
L Y+S V +GG TVF + L++ P KG + +LH G D
Sbjct: 179 LMYLSTVEKGGETVFPNAEGWENQPKDDTFSECAQKGLAVKPVKGDTVLFFSLHIDGVPD 238
Query: 253 YYTRHAACPVLTG 265
+ H +CPV+ G
Sbjct: 239 PLSLHGSCPVIEG 251
>gi|445499353|ref|ZP_21466208.1| prolyl 4-hydroxylase alpha subunit [Janthinobacterium sp. HH01]
gi|444789348|gb|ELX10896.1| prolyl 4-hydroxylase alpha subunit [Janthinobacterium sp. HH01]
Length = 272
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 62/178 (34%), Positives = 89/178 (50%), Gaps = 1/178 (0%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
QP+IIL +V+ D E D I R R+TV G + R S+ A+++ E V
Sbjct: 82 QPQIILLGNVLSDEECDAIIAHCGTRYTRSTVTGEADGSSMVHEGRTSEMAFIQRGEAEV 141
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLG-TGNRVAT 209
ERI RR+ + +E Q+ Y Y PHYD+ P + L G R+AT
Sbjct: 142 AERIERRLAALAHWPAECSEPFQLQKYDATQEYRPHYDWLDPDSSGHRSHLARGGQRLAT 201
Query: 210 VLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
+ Y+SDV QGG TVF L L ++P+KG+A ++ N + D T H PV+ G+
Sbjct: 202 FILYLSDVEQGGGTVFPGLGLEVYPKKGSALWFLNTDINHQPDKRTLHGGAPVVRGTK 259
>gi|325267002|ref|ZP_08133672.1| 2OG-Fe(II) oxygenase [Kingella denitrificans ATCC 33394]
gi|324981502|gb|EGC17144.1| 2OG-Fe(II) oxygenase [Kingella denitrificans ATCC 33394]
Length = 279
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 60/181 (33%), Positives = 92/181 (50%), Gaps = 1/181 (0%)
Query: 92 PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
P +++ + + E + +A+ ++ ATV + TGE R S +A EHP+I
Sbjct: 91 PEVVVLDNFITAEECAQLIALAEGKVEDATVVDPATGEFVKHQDRTSMNAAFARAEHPLI 150
Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTG-NRVATV 210
R+ R+ E +QV+ Y GG Y+ H+D+ K++ TG RV T
Sbjct: 151 ARLEARIAAAIHWPAENGEGMQVLRYRSGGEYKAHFDYFDTQSEGGRKNMQTGGQRVGTF 210
Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLH 270
L Y+ DV GGAT F +LN + P+KG A F+ N +G+G+ T HA PV++G L
Sbjct: 211 LVYLCDVDAGGATRFPALNFEIRPKKGMALFFANTLPNGEGNPLTLHAGVPVVSGVKYLA 270
Query: 271 S 271
S
Sbjct: 271 S 271
>gi|120609859|ref|YP_969537.1| 2OG-Fe(II) oxygenase [Acidovorax citrulli AAC00-1]
gi|120588323|gb|ABM31763.1| 2OG-Fe(II) oxygenase [Acidovorax citrulli AAC00-1]
Length = 309
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 59/183 (32%), Positives = 93/183 (50%), Gaps = 7/183 (3%)
Query: 88 AYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPE 147
A QPR++L+ +++ E D I A+PR+ R+ +TG E+ + R S + + E
Sbjct: 118 AMAQPRVVLFGNLLSPEECDAIIDAARPRMARSLTVATRTGGEEVNDDRTSNGMFFQREE 177
Query: 148 HPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT-GNR 206
+PV+ R+ R+ + E LQV++Y G Y+PHYD+ P E L G R
Sbjct: 178 NPVVARLEARIARLVNWPLENGEGLQVLHYRPGAEYKPHYDYFDPAEPGTPTILRRGGQR 237
Query: 207 VATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAF--WHNLHSSGDGDYYTRHAACPVLT 264
VAT++ Y++D +GG T F ++L + P +G A F + H S T H PV+
Sbjct: 238 VATIVIYLNDPEKGGGTTFPDVHLEVAPRRGNAVFFSYERPHPS----TRTLHGGAPVVA 293
Query: 265 GSN 267
G
Sbjct: 294 GDK 296
>gi|115471029|ref|NP_001059113.1| Os07g0194500 [Oryza sativa Japonica Group]
gi|113610649|dbj|BAF21027.1| Os07g0194500 [Oryza sativa Japonica Group]
gi|215768445|dbj|BAH00674.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 319
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 60/193 (31%), Positives = 101/193 (52%), Gaps = 22/193 (11%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+PR+ +Y+ + D E D + K+ + +++R+ V + K+G+ ++ R S +L + + PV
Sbjct: 63 RPRVFVYKGFLSDDECDHLVKLGKRKMQRSMVADNKSGKSVMSEVRTSSGMFLDKRQDPV 122
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
+ RI +R+ T L AE +Q++ Y G YEPH+D+ ++LG G+R ATV
Sbjct: 123 VSRIEKRIAAWTFLPEENAENIQILRYEHGQKYEPHFDYFHD---KVNQALG-GHRYATV 178
Query: 211 LFYMSDVAQGGATVFTSL------------------NLSLWPEKGTAAFWHNLHSSGDGD 252
L Y+S V +GG TVF + L++ P KG + +LH G D
Sbjct: 179 LMYLSTVEKGGETVFPNAEGWENQPKDDTFSECAQKGLAVKPVKGDTVLFFSLHIDGVPD 238
Query: 253 YYTRHAACPVLTG 265
+ H +CPV+ G
Sbjct: 239 PLSLHGSCPVIEG 251
>gi|34393269|dbj|BAC83179.1| prolyl 4-hydroxylase alpha-1 subunit precursor-like protein [Oryza
sativa Japonica Group]
gi|50509101|dbj|BAD30161.1| prolyl 4-hydroxylase alpha-1 subunit precursor-like protein [Oryza
sativa Japonica Group]
Length = 313
Score = 105 bits (261), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 60/193 (31%), Positives = 101/193 (52%), Gaps = 22/193 (11%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+PR+ +Y+ + D E D + K+ + +++R+ V + K+G+ ++ R S +L + + PV
Sbjct: 57 RPRVFVYKGFLSDDECDHLVKLGKRKMQRSMVADNKSGKSVMSEVRTSSGMFLDKRQDPV 116
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
+ RI +R+ T L AE +Q++ Y G YEPH+D+ ++LG G+R ATV
Sbjct: 117 VSRIEKRIAAWTFLPEENAENIQILRYEHGQKYEPHFDYFHD---KVNQALG-GHRYATV 172
Query: 211 LFYMSDVAQGGATVFTSL------------------NLSLWPEKGTAAFWHNLHSSGDGD 252
L Y+S V +GG TVF + L++ P KG + +LH G D
Sbjct: 173 LMYLSTVEKGGETVFPNAEGWENQPKDDTFSECAQKGLAVKPVKGDTVLFFSLHIDGVPD 232
Query: 253 YYTRHAACPVLTG 265
+ H +CPV+ G
Sbjct: 233 PLSLHGSCPVIEG 245
>gi|218192156|gb|EEC74583.1| hypothetical protein OsI_10158 [Oryza sativa Indica Group]
Length = 299
Score = 105 bits (261), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 65/193 (33%), Positives = 97/193 (50%), Gaps = 23/193 (11%)
Query: 92 PRIILYRDVMYDSEID-LIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
PR+ LY + D+E + LI Q R+ R+TV N K+GE ++ R S +L + V
Sbjct: 44 PRVFLYEGFLSDAECEHLIALAKQGRMERSTVVNGKSGESVMSKTRTSSGMFLIRKQDEV 103
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
+ RI R+ T E +Q++ YG G YEPH+D+ R +A+A G+R+ATV
Sbjct: 104 VARIEERIAAWTMFPAENGESMQMLRYGQGEKYEPHFDYIRGRQASARG----GHRIATV 159
Query: 211 LFYMSDVAQGGATVFTSLNLSL-------W-----------PEKGTAAFWHNLHSSGDGD 252
L Y+S+V GG TVF L W P KG+A + +L+ + D
Sbjct: 160 LMYLSNVKMGGETVFPDAEARLSQPKDETWSDCAEQGFAVKPTKGSAVLFFSLYPNATFD 219
Query: 253 YYTRHAACPVLTG 265
+ H +CPV+ G
Sbjct: 220 PGSLHGSCPVIQG 232
>gi|341893180|gb|EGT49115.1| CBN-PHY-4 protein [Caenorhabditis brenneri]
Length = 282
Score = 105 bits (261), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 78/260 (30%), Positives = 123/260 (47%), Gaps = 32/260 (12%)
Query: 41 EVTE-REKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRD 99
E+ E EK C +L A QL C + +++ + ++ L LQP I+ Y +
Sbjct: 21 EIIEFNEKMWEKCGKELRGNSATNPQLVCFQIKKHLLFRKMEILS-----LQPFIVQYHN 75
Query: 100 VMYDSEIDLIKKMAQPRLRRATVQNY-------KTGELEIANYRISKSAWLREPEHPVIE 152
+++ +++A+ +R + V KT E + R + WL
Sbjct: 76 LVH-------RRLAKRAVRESEVLQLEQLKISGKTETPEKSQVRAANGTWLMHTNRLNFA 128
Query: 153 RISRRVE-HMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVL 211
RI R ++ ++ L +TAE Q+++Y G+Y PHYDF P E N GNR+ATVL
Sbjct: 129 RIFRNLQLNIDALDLTTAEPWQILSYNSDGYYAPHYDFLNP-ETNRQLVDSRGNRIATVL 187
Query: 212 FYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN---- 267
+ +GG TVF +NL++ P+ G W N SSG+ D T HAACP+ G+
Sbjct: 188 VILQIAKKGGTTVFPKINLNIRPKAGDVIVWLNTLSSGESDPQTLHAACPIKEGNKIGAT 247
Query: 268 -SLHS-----TCPCGLRRGL 281
+HS + PC L+ +
Sbjct: 248 LWVHSKGQELSLPCSLQENV 267
>gi|307725787|ref|YP_003909000.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1003]
gi|307586312|gb|ADN59709.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1003]
Length = 313
Score = 104 bits (260), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 53/178 (29%), Positives = 98/178 (55%), Gaps = 1/178 (0%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+P++I++ +V+ E + + ++ RL+R+T+ + TG ++ R S+ W + E +
Sbjct: 123 RPQVIVFGNVLSPDECAEMIERSRHRLKRSTIVDPATGREDVIRNRTSEGIWYQRGEDAL 182
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGE-ANAFKSLGTGNRVAT 209
IER+ +R+ + E LQ+++YG G Y PH+D+ P + +A + G RVAT
Sbjct: 183 IERLDQRIASLMNWPLENGEGLQILHYGPSGEYRPHFDYFPPDQPGSAVHTARGGQRVAT 242
Query: 210 VLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
++ Y++DV GG T+F LS+ ++G A ++ ++ D T H PVL+G
Sbjct: 243 LVVYLNDVPDGGETIFPEAGLSVAAQQGGAVYFRYMNGRRQLDPLTLHGGAPVLSGDK 300
>gi|403234403|ref|ZP_10912989.1| Procollagen-proline dioxygenase [Bacillus sp. 10403023]
Length = 217
Score = 104 bits (260), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 53/177 (29%), Positives = 96/177 (54%), Gaps = 12/177 (6%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+P I++ +V+ D E D + ++++ R+ R+ + N + N R S S ++ E E+ +
Sbjct: 38 EPLIVVLGNVLSDEECDELIRLSKDRINRSKIAN-----ANVDNMRTSSSTFIEENENII 92
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRVAT 209
+ RI +R+ + + T E LQ++NY +G Y+ H+D F+ P A R++T
Sbjct: 93 VSRIEKRISQIMNIPTEYGEGLQILNYQVGQEYKSHFDFFSSPHNA------INNPRIST 146
Query: 210 VLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
++ Y+SDV QGG T F L+ S+ P+KG A ++ ++ + T H PV+ G
Sbjct: 147 LVMYLSDVEQGGETYFPKLHFSVSPQKGMAVYFEYFYNDQTLNELTLHGGAPVIVGD 203
>gi|260806885|ref|XP_002598314.1| hypothetical protein BRAFLDRAFT_204780 [Branchiostoma floridae]
gi|229283586|gb|EEN54326.1| hypothetical protein BRAFLDRAFT_204780 [Branchiostoma floridae]
Length = 282
Score = 104 bits (260), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 57/150 (38%), Positives = 84/150 (56%), Gaps = 15/150 (10%)
Query: 142 WLREPEHPVIERISRRVEHMTGLTTS--TAEELQVVNYGIGGHYEPHYDFARPGEANAFK 199
W+ + E V+ ++SR V H+TGL T+ T + QV+NYG+GG YEPHYD + +
Sbjct: 134 WVPDTEDLVVAKLSRMVAHITGLNTTFPTGDNFQVLNYGLGGQYEPHYDHLK---EEVSR 190
Query: 200 SLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAA 259
+L NR+ T LFY+S+V GGATVFT N+++ K +A + N + + + HA
Sbjct: 191 TLMAANRILTFLFYLSEVEAGGATVFTEANIAVPVVKNSAVLFENTNKALVRSRASVHAG 250
Query: 260 CPVLTGSNSLHSTC----------PCGLRR 279
CPVL GS + + PCGL +
Sbjct: 251 CPVLIGSKWVANKWIHEVGNELQRPCGLTQ 280
>gi|363543295|ref|NP_001241863.1| prolyl 4-hydroxylase 4 precursor [Zea mays]
gi|347978806|gb|AEP37745.1| prolyl 4-hydroxylase 4 [Zea mays]
gi|414591890|tpg|DAA42461.1| TPA: hypothetical protein ZEAMMB73_637248 [Zea mays]
Length = 274
Score = 104 bits (260), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 60/193 (31%), Positives = 98/193 (50%), Gaps = 22/193 (11%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
PRI +Y+ + D+E D + +A+ +++R+ V + ++G+ + R S +L + + PV
Sbjct: 51 HPRIFVYKGFLSDAECDHLVTLAKKKIQRSMVADNESGKSVKSEVRTSSGMFLDKRQDPV 110
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
+ RI R+ T L AE +QV+ Y G YEPH+D+ + G+R ATV
Sbjct: 111 VSRIEERIAAWTFLPQENAENMQVLRYEPGQKYEPHFDYFH----DRVNQARGGHRYATV 166
Query: 211 LFYMSDVAQGGATVFTSLN------------------LSLWPEKGTAAFWHNLHSSGDGD 252
L Y+S V +GG TVF + L++ P KG A + +LH+ G D
Sbjct: 167 LMYLSTVREGGETVFPNAKGWESQPKDATFSECAHKGLAVKPVKGDAVLFFSLHADGTPD 226
Query: 253 YYTRHAACPVLTG 265
+ H +CPV+ G
Sbjct: 227 PLSLHGSCPVIRG 239
>gi|363807286|ref|NP_001242363.1| uncharacterized protein LOC100796794 precursor [Glycine max]
gi|255641119|gb|ACU20838.1| unknown [Glycine max]
Length = 297
Score = 104 bits (259), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 61/207 (29%), Positives = 105/207 (50%), Gaps = 25/207 (12%)
Query: 80 LMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISK 139
+ P K ++ +PR +Y + D E D + +A+ L+R+ V + +GE ++++ R S
Sbjct: 31 INPSKVKQISWKPRAFVYEGFLTDLECDHLISLAKSELKRSAVADNLSGESQLSDVRTSS 90
Query: 140 SAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFK 199
++ + + P++ I ++ T L E++QV Y G Y+PHYD+ +
Sbjct: 91 GMFISKNKDPIVAGIEDKISSWTFLPKENGEDIQVSRYEHGQKYDPHYDYF----TDKVN 146
Query: 200 SLGTGNRVATVLFYMSDVAQGGATVF-------------TSLNLS--------LWPEKGT 238
G+R+ATVL Y++DVA+GG TVF TS +LS + P +G
Sbjct: 147 IARGGHRIATVLMYLTDVAKGGETVFPSAEEPPRRRGAETSSDLSECAKKGIAVKPRRGD 206
Query: 239 AAFWHNLHSSGDGDYYTRHAACPVLTG 265
A + +LH++ D + HA CPV+ G
Sbjct: 207 ALLFFSLHTNATPDTSSLHAGCPVIEG 233
>gi|390570433|ref|ZP_10250698.1| procollagen-proline dioxygenase [Burkholderia terrae BS001]
gi|389937613|gb|EIM99476.1| procollagen-proline dioxygenase [Burkholderia terrae BS001]
Length = 285
Score = 104 bits (259), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 55/182 (30%), Positives = 94/182 (51%), Gaps = 1/182 (0%)
Query: 89 YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
+ +P++I + DV+ E + + A+ RL+R+T N + G ++ R S+ W + E
Sbjct: 93 FERPQVIAFDDVLSGEECAELIERARHRLKRSTTVNPENGSEDVIQLRTSEGFWFQRCED 152
Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGE-ANAFKSLGTGNRV 207
IER+ R+ + E LQ+++Y GG Y PH+D+ PG+ + + G RV
Sbjct: 153 AFIERLDHRISALMNWPLEHGEGLQILHYRQGGEYRPHFDYFPPGQNGSVLHTARGGQRV 212
Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
AT++ Y+SDV GG TVF L++ +G A ++ ++ D T H PV +G
Sbjct: 213 ATLIVYLSDVEGGGETVFPDAGLAVMARQGGAIYFRYMNGRRQLDPLTLHGGAPVTSGDK 272
Query: 268 SL 269
+
Sbjct: 273 WI 274
>gi|170690448|ref|ZP_02881615.1| Procollagen-proline dioxygenase [Burkholderia graminis C4D1M]
gi|170144883|gb|EDT13044.1| Procollagen-proline dioxygenase [Burkholderia graminis C4D1M]
Length = 307
Score = 104 bits (259), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 52/180 (28%), Positives = 97/180 (53%), Gaps = 1/180 (0%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+P++I++ +V+ E D + + ++ RL+R+T+ + TG+ ++ R S+ W + E
Sbjct: 117 RPQVIVFANVLSPEECDEVIERSRHRLKRSTIVDPATGQEDVIRNRTSEGIWYQRGEDAF 176
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAF-KSLGTGNRVAT 209
IER+ +R+ + E LQ+++YG G Y PH+D+ P + + + G RVAT
Sbjct: 177 IERLDQRIASLMNWPVENGEGLQILHYGPTGEYRPHFDYFPPDQPGSMVHTARGGQRVAT 236
Query: 210 VLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSL 269
++ Y++DV GG T+F LS+ ++G A ++ ++ D T H PV G +
Sbjct: 237 LVIYLNDVPDGGETIFPEAGLSVAAKQGGAVYFRYMNGQRQLDPLTLHGGAPVRAGDKWI 296
>gi|420246706|ref|ZP_14750139.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. BT03]
gi|398073616|gb|EJL64785.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. BT03]
Length = 282
Score = 103 bits (258), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 55/182 (30%), Positives = 94/182 (51%), Gaps = 1/182 (0%)
Query: 89 YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
+ +P++I + DV+ E + + A+ RL+R+T N + G ++ R S+ W + E
Sbjct: 90 FERPQVIAFDDVLSGEECAELIERARHRLKRSTTVNPENGSEDVIQLRTSEGFWFQRCED 149
Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGE-ANAFKSLGTGNRV 207
IER+ R+ + E LQ+++Y GG Y PH+D+ PG+ + + G RV
Sbjct: 150 AFIERLDHRISALMNWPLEHGEGLQILHYRQGGEYRPHFDYFPPGQNGSVLHTARGGQRV 209
Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
AT++ Y+SDV GG TVF L++ +G A ++ ++ D T H PV +G
Sbjct: 210 ATLIVYLSDVEGGGETVFPDAGLAVMARQGGAIYFRYMNGRRQLDPLTLHGGAPVTSGDK 269
Query: 268 SL 269
+
Sbjct: 270 WI 271
>gi|108706361|gb|ABF94156.1| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
Group]
gi|222624253|gb|EEE58385.1| hypothetical protein OsJ_09545 [Oryza sativa Japonica Group]
Length = 299
Score = 103 bits (258), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 65/193 (33%), Positives = 96/193 (49%), Gaps = 23/193 (11%)
Query: 92 PRIILYRDVMYDSEID-LIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
PR+ LY + D E + LI Q R+ R+TV N K+GE ++ R S +L + V
Sbjct: 44 PRVFLYEGFLSDVECEHLIALAKQGRMERSTVVNGKSGESVMSKTRTSSGMFLIRKQDEV 103
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
+ RI R+ T E +Q++ YG G YEPH+D+ R +A+A G+R+ATV
Sbjct: 104 VARIEERIAAWTMFPAENGESMQMLRYGQGEKYEPHFDYIRGRQASARG----GHRIATV 159
Query: 211 LFYMSDVAQGGATVFTSLNLSL-------W-----------PEKGTAAFWHNLHSSGDGD 252
L Y+S+V GG TVF L W P KG+A + +L+ + D
Sbjct: 160 LMYLSNVKMGGETVFPDAEARLSQPKDETWSDCAEQGFAVKPTKGSAVLFFSLYPNATFD 219
Query: 253 YYTRHAACPVLTG 265
+ H +CPV+ G
Sbjct: 220 PGSLHGSCPVIQG 232
>gi|297832394|ref|XP_002884079.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297329919|gb|EFH60338.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 291
Score = 103 bits (258), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 60/195 (30%), Positives = 97/195 (49%), Gaps = 23/195 (11%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+PR ++Y + + + E + + +A+P + ++TV + KTG + + R S +LR V
Sbjct: 86 EPRAVVYHNFLSNEECEHLINLAKPSMVKSTVVDEKTGGSKDSRVRTSSGTFLRRGHDEV 145
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
+E I +R+ T + E LQV++Y +G YEPHYD+ + F + G R+ATV
Sbjct: 146 VEVIEKRISDFTFIPVENGEGLQVLHYQVGQKYEPHYDYF----LDEFNTKNGGQRIATV 201
Query: 211 LFYMSDVAQGGATVFTSL-------------------NLSLWPEKGTAAFWHNLHSSGDG 251
L Y+SDV GG TVF + LS+ P+K A + N+
Sbjct: 202 LMYLSDVDDGGETVFPAARGNISAVPWWNELSKCGKEGLSVLPKKRDALLFWNMRPDASL 261
Query: 252 DYYTRHAACPVLTGS 266
D + H CPV+ G+
Sbjct: 262 DPSSLHGGCPVVKGN 276
>gi|168046048|ref|XP_001775487.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162673157|gb|EDQ59684.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 263
Score = 103 bits (258), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 63/201 (31%), Positives = 101/201 (50%), Gaps = 21/201 (10%)
Query: 82 PLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSA 141
P + ++ +PR LY + + D+E D + +A+ +L ++ V + ++G+ + R S
Sbjct: 3 PTRVKQLSWKPRAFLYSNFLSDAECDHMISLAKDKLEKSMVADNESGKSVKSEIRTSSGM 62
Query: 142 WLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL 201
+L + + +I RI R+ T L E +QV+ Y G YEPH+D+ A L
Sbjct: 63 FLMKGQDDIISRIEDRIAAWTFLPKENGEAIQVLRYQDGEKYEPHFDYFHDKNNQA---L 119
Query: 202 GTGNRVATVLFYMSDVAQGGATVFTS-----------------LNLSLWPEKGTAAFWHN 244
G G+R+ATVL Y+SDV +GG TVF S +++ P KG A + +
Sbjct: 120 G-GHRIATVLMYLSDVVKGGETVFPSSEDRGGPKDDSWSACGKTGVAVKPRKGDALLFFS 178
Query: 245 LHSSGDGDYYTRHAACPVLTG 265
LH S D + H CPV+ G
Sbjct: 179 LHPSAVPDESSLHTGCPVIEG 199
>gi|357146834|ref|XP_003574128.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Brachypodium
distachyon]
Length = 306
Score = 103 bits (258), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 58/194 (29%), Positives = 96/194 (49%), Gaps = 23/194 (11%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+PR LY + + E + + +A+P ++++TV + TG + + R S +LR + V
Sbjct: 101 EPRAFLYHNFLSKEECEYLISLAKPHMKKSTVVDSATGGSKDSRVRTSSGTFLRRGQDKV 160
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
I I +R+ T + E LQV++Y +G YEPH+D+ + F + G R+AT+
Sbjct: 161 IRTIEKRISDFTFIPAENGEGLQVLHYEVGQKYEPHFDYFH----DDFNTKNGGQRIATL 216
Query: 211 LFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAFWHNLHSSGDG 251
L Y+SDV +GG TVF S +S+ P+ G A + ++ G
Sbjct: 217 LMYLSDVEEGGETVFPSAKVNSSSIPFYNELSECAKRGISVKPKMGDALLFWSMRPDGTL 276
Query: 252 DYYTRHAACPVLTG 265
D + H CPV+ G
Sbjct: 277 DPTSLHGGCPVIKG 290
>gi|15227885|ref|NP_179363.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
gi|25411813|pir||F84555 similar to prolyl 4-hydroxylase alpha subunit [imported] -
Arabidopsis thaliana
gi|89274129|gb|ABD65585.1| At2g17720 [Arabidopsis thaliana]
gi|110738861|dbj|BAF01353.1| similar to prolyl 4-hydroxylase alpha subunit [Arabidopsis
thaliana]
gi|330251579|gb|AEC06673.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
Length = 291
Score = 103 bits (258), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 60/195 (30%), Positives = 97/195 (49%), Gaps = 23/195 (11%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+PR ++Y + + + E + + +A+P + ++TV + KTG + + R S +LR V
Sbjct: 86 EPRAVVYHNFLTNEECEHLISLAKPSMVKSTVVDEKTGGSKDSRVRTSSGTFLRRGHDEV 145
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
+E I +R+ T + E LQV++Y +G YEPHYD+ + F + G R+ATV
Sbjct: 146 VEVIEKRISDFTFIPVENGEGLQVLHYQVGQKYEPHYDYF----LDEFNTKNGGQRIATV 201
Query: 211 LFYMSDVAQGGATVFTSL-------------------NLSLWPEKGTAAFWHNLHSSGDG 251
L Y+SDV GG TVF + LS+ P+K A + N+
Sbjct: 202 LMYLSDVDDGGETVFPAARGNISAVPWWNELSKCGKEGLSVLPKKRDALLFWNMRPDASL 261
Query: 252 DYYTRHAACPVLTGS 266
D + H CPV+ G+
Sbjct: 262 DPSSLHGGCPVVKGN 276
>gi|255551575|ref|XP_002516833.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223543921|gb|EEF45447.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 297
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 60/205 (29%), Positives = 102/205 (49%), Gaps = 25/205 (12%)
Query: 82 PLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSA 141
P K ++ +PR +Y + D E D + +A+ L+R+ V + ++G+ +++ R S
Sbjct: 33 PSKVKQVSWKPRAFVYEGFLTDLECDHLISLAKSELKRSAVADNESGKSKLSEVRTSSGM 92
Query: 142 WLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL 201
++ + + P+I I ++ T L E+LQV+ Y G Y+PHYD+ A+
Sbjct: 93 FIAKGKDPIIAGIEEKISTWTFLPKENGEDLQVLRYEHGQKYDPHYDYF----ADKINIA 148
Query: 202 GTGNRVATVLFYMSDVAQGGATVFTSL---------------------NLSLWPEKGTAA 240
G+R+ATVL Y+SDV +GG TVF + +S+ P +G A
Sbjct: 149 RGGHRMATVLMYLSDVVKGGETVFPNAEEPPRRKATESHEDLSECAKKGISVKPRRGDAL 208
Query: 241 FWHNLHSSGDGDYYTRHAACPVLTG 265
+ +LH + D + HA CPV+ G
Sbjct: 209 LFFSLHPTAIPDPNSLHAGCPVIEG 233
>gi|326495334|dbj|BAJ85763.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 300
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 56/196 (28%), Positives = 96/196 (48%), Gaps = 23/196 (11%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+PR +Y + + E + + +A+P ++++TV + TG + + R S +LR + +
Sbjct: 95 EPRAFIYHNFLSKEECEYLISLAKPHMKKSTVVDSATGGSKDSRVRTSSGTFLRRGQDKI 154
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
+ I +R+ T + E LQV++Y +G YEPH+D+ + F + G R+ATV
Sbjct: 155 VRTIEKRISDFTFIPVENGEGLQVLHYEVGQKYEPHFDYFH----DDFNTKNGGQRIATV 210
Query: 211 LFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAFWHNLHSSGDG 251
L Y+SDV +GG TVF S +S+ P+ G A + ++ G
Sbjct: 211 LMYLSDVEEGGETVFPSAKVNSSSIPFYNELSECAKRGISVKPKMGDALLFWSMRPDGTL 270
Query: 252 DYYTRHAACPVLTGSN 267
D + H CPV+ G
Sbjct: 271 DPTSLHGGCPVIKGDK 286
>gi|403238305|ref|ZP_10916891.1| procollagen-proline dioxygenase [Bacillus sp. 10403023]
Length = 296
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 59/173 (34%), Positives = 95/173 (54%), Gaps = 4/173 (2%)
Query: 94 IILYRD-VMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIE 152
IL+ D + + E D + +M++ RL+ +TV + KTGE + A R SK E+ I+
Sbjct: 110 FILHLDYFLSEEECDQLIEMSRERLKPSTVIDPKTGEEKAATGRTSKGMSFYLQENEFIK 169
Query: 153 RISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLF 212
++ +R+ + E LQV+NYGIG Y+ H+D+ + K G RV T L
Sbjct: 170 KVEKRIAELIEFPVENGEGLQVLNYGIGEEYKSHFDYFPQSKVVPEKG---GQRVGTFLI 226
Query: 213 YMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
Y++DV GG TVF +S+ P+KG+A ++ +S G+ D + H++ PV G
Sbjct: 227 YLNDVPAGGETVFPKAGVSIVPKKGSAVYFQYGNSKGEVDRMSLHSSIPVSEG 279
>gi|412992163|emb|CCO19876.1| predicted protein [Bathycoccus prasinos]
Length = 350
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 56/198 (28%), Positives = 97/198 (48%), Gaps = 21/198 (10%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
QPR + ++ + E + I ++A+P ++R+TV + TGE++ R SK +L ++PV
Sbjct: 86 QPRAFVLHSILSEEECEEILRIAKPMMKRSTVVDSITGEIKTDPIRTSKQTFLARGKYPV 145
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFK-SLGTGNRVAT 209
+ R+ R+ T L E++Q+++YG+G Y H+D + + S G RVAT
Sbjct: 146 VTRVEERLSRFTMLPWYNGEDMQILSYGVGEKYSAHHDVGEKNTKSGQQLSADGGQRVAT 205
Query: 210 VLFYMSDVAQGGATVF--------------------TSLNLSLWPEKGTAAFWHNLHSSG 249
VL Y+ D +GG T F ++ P++G + ++ G
Sbjct: 206 VLLYLQDTEEGGETAFPDSEWIEPESEYAQQKFSECAKNGVAFKPKRGDGLLFFSITPEG 265
Query: 250 DGDYYTRHAACPVLTGSN 267
D D + HA CPV+ G+
Sbjct: 266 DIDQKSMHAGCPVVKGTK 283
>gi|365090417|ref|ZP_09328465.1| 2OG-Fe(II) oxygenase [Acidovorax sp. NO-1]
gi|363416516|gb|EHL23626.1| 2OG-Fe(II) oxygenase [Acidovorax sp. NO-1]
Length = 302
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 59/181 (32%), Positives = 92/181 (50%), Gaps = 7/181 (3%)
Query: 88 AYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPE 147
A QPRI+++ +++ E D + AQPRL R+ KTG EI + R S + + +
Sbjct: 111 AMAQPRIVVFGNLLSPEECDALIADAQPRLARSLTVATKTGGEEINDDRTSDGMFFQRGQ 170
Query: 148 HPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT-GNR 206
P+I+RI R+ + E LQV++Y G Y+PHYD+ P E + G R
Sbjct: 171 SPLIQRIEERIARLLNWPIENGEGLQVLHYRPGAEYKPHYDYFDPAEPGTPSIVNRGGQR 230
Query: 207 VATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAF--WHNLHSSGDGDYYTRHAACPVLT 264
V T++ Y++ +GG T F ++L + P++G A F + H S T H PV+
Sbjct: 231 VGTLVMYLNTPEKGGGTTFPDVHLEVAPQRGNAVFFSYERPHPS----TRTLHGGAPVIA 286
Query: 265 G 265
G
Sbjct: 287 G 287
>gi|255637501|gb|ACU19077.1| unknown [Glycine max]
Length = 318
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 59/213 (27%), Positives = 105/213 (49%), Gaps = 22/213 (10%)
Query: 71 VHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGEL 130
++R ++ P + + PR LY+ + D E D + +A+ +L ++ V + ++G+
Sbjct: 42 LNRGGSSVKFDPTRVTQLSWSPRAFLYKGFLSDEECDHLITLAKDKLEKSMVADNESGKS 101
Query: 131 EIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFA 190
++ R S +L + + ++ I R+ T L E +Q+++Y G YEPH+D+
Sbjct: 102 IMSEVRTSSGMFLNKAQDEIVAGIEARIAAWTFLPIENGESMQILHYENGQKYEPHFDYF 161
Query: 191 RPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSL-------W---------- 233
A + +G G+R+ATVL Y+SDV +GG T+F++ L W
Sbjct: 162 HD---KANQVMG-GHRIATVLMYLSDVEKGGETIFSNAKAKLLQPKDESWSECAHKGYAV 217
Query: 234 -PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
P KG A + +LH D + H +CPV+ G
Sbjct: 218 KPRKGDALLFFSLHLDASTDNKSLHGSCPVIEG 250
>gi|356517655|ref|XP_003527502.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Glycine max]
Length = 290
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 60/207 (28%), Positives = 104/207 (50%), Gaps = 24/207 (11%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+PR +Y + + E + + ++A+P++ +++V + KTG+ + R S +L+ + +
Sbjct: 85 EPRAFIYHNFLSKEECEYLIELAKPQMVKSSVVDSKTGKSTESRVRTSSGMFLKRGKDKI 144
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
++ I +R+ T + E LQ+++Y +G YEPHYD+ + F + G R+ATV
Sbjct: 145 VQNIEKRIADFTFIPEENGEGLQILHYEVGQKYEPHYDYF----LDEFNTKNGGQRIATV 200
Query: 211 LFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAFWHNLHSSGDG 251
L Y+SDV +GG TVF + N LS+ P+ G A + ++
Sbjct: 201 LMYLSDVEEGGETVFPAANANFSSVPWWNDLSQCARKGLSVKPKMGDALLFWSMRPDATL 260
Query: 252 DYYTRHAACPVLTGSNSLHSTCPCGLR 278
D + H CPV+ G N ST LR
Sbjct: 261 DPSSLHGGCPVIKG-NKWSSTKWMHLR 286
>gi|430751569|ref|YP_007214477.1| 2OG-Fe(II) oxygenase [Thermobacillus composti KWC4]
gi|430735534|gb|AGA59479.1| 2OG-Fe(II) oxygenase superfamily enzyme [Thermobacillus composti
KWC4]
Length = 215
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 54/175 (30%), Positives = 96/175 (54%), Gaps = 10/175 (5%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+P I+ + ++ D E + + A PRL+ + + N +++ R S+ + E E P
Sbjct: 29 EPLIVRFERLLSDDECRQLIETAAPRLKESKLVNKV-----VSDIRTSRGMFFEEEESPF 83
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
I RI RR+ + + AE LQV++YG G Y+ H+DF PG A NR++T+
Sbjct: 84 IHRIERRIAQLMNVPIEHAEGLQVLHYGPGQEYKAHHDFFAPGSPAA-----RNNRISTL 138
Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
+ Y++DV +GG TVF L +++ P++G A ++ + + + T H++ PV+ G
Sbjct: 139 IVYLNDVEEGGETVFPLLGIAMKPKRGAALYFEYFYRNQALNDLTLHSSVPVVRG 193
>gi|89096248|ref|ZP_01169141.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus sp.
NRRL B-14911]
gi|89089102|gb|EAR68210.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus sp.
NRRL B-14911]
Length = 217
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 54/176 (30%), Positives = 98/176 (55%), Gaps = 9/176 (5%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+P I++ +V+ D E + + +M++ +L+R+ + N +T + + R S S + E E+ +
Sbjct: 38 EPLIVILGNVLSDEECEGLIRMSEDKLKRSKIGNTRT----VDDIRTSSSMFFEEGENEL 93
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
+ RI RR+ + + E LQ++NY IG Y+ H+DF A R++T+
Sbjct: 94 VARIERRLSQIMNIPVEHGEGLQMLNYHIGQEYKAHFDFFSSSSRAASNP-----RISTL 148
Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+ Y++DV +GG T F LN S+ P+KG+A ++ + + D + T H PV+ GS
Sbjct: 149 VMYLNDVEEGGETYFPKLNFSVNPQKGSAVYFEYFYDNQDLNDLTLHGGAPVIKGS 204
>gi|149180354|ref|ZP_01858859.1| prolyl 4-hydroxylase, alpha subunit [Bacillus sp. SG-1]
gi|148852546|gb|EDL66691.1| prolyl 4-hydroxylase, alpha subunit [Bacillus sp. SG-1]
Length = 212
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 51/176 (28%), Positives = 96/176 (54%), Gaps = 13/176 (7%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+P I++ +V+ D E D + +++ +L+R+ + N + + R S S ++ E E V
Sbjct: 36 EPLIVVLGNVLSDEECDALIGLSKDKLKRSKIGNTRNEN----DMRTSSSTFMEEGESEV 91
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
+ R+ +R+ + + E LQ++NY IG Y+ H+DF FK+ + R++T+
Sbjct: 92 VTRVEKRISQIMNIPYENGEGLQILNYKIGQEYKAHFDF--------FKN-ASNPRISTL 142
Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+ Y++DV +GG T F LN S+ P+KG A ++ + + + + T H PV+ G
Sbjct: 143 VMYLNDVEEGGETYFPKLNFSVSPQKGMAVYFEYFYDNQELNDLTLHGGAPVIIGD 198
>gi|384251901|gb|EIE25378.1| hypothetical protein COCSUDRAFT_35772 [Coccomyxa subellipsoidea
C-169]
Length = 222
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 58/193 (30%), Positives = 93/193 (48%), Gaps = 21/193 (10%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+PR LY + + ++E D + + +P + ++ V + +TG+ + R S +L E V
Sbjct: 7 EPRAYLYHNFLTEAEADYLVQKGKPHMEKSEVVDNETGKSAPSKVRTSSGMFLNRGEDDV 66
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
IERI R+ T + E LQ+++Y Y PH+D+ + F + G R+AT+
Sbjct: 67 IERIEARIAKYTAIPKENGEGLQILHYQASEEYRPHFDYFH----DNFNTQNGGQRIATM 122
Query: 211 LFYMSDVAQGGATVF-----------------TSLNLSLWPEKGTAAFWHNLHSSGDGDY 253
L Y+SDV GG TVF + P+KG A F+++L G D
Sbjct: 123 LMYLSDVEDGGETVFPESSDKPNVGNTKFSQCAQAGAAAKPKKGDALFFYSLTPDGRMDE 182
Query: 254 YTRHAACPVLTGS 266
+ HA CPV+ G
Sbjct: 183 KSLHAGCPVMKGD 195
>gi|297268736|ref|XP_001115675.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Macaca
mulatta]
Length = 567
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 66/180 (36%), Positives = 100/180 (55%), Gaps = 13/180 (7%)
Query: 56 LTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQP 115
L P +V KC + + + M LK L + L ++ + +D + +
Sbjct: 258 LIAAPKLVPHAKCPLIREELASVMFM-LKGGAYLLNDKPFL----LHHNHLDFVL-LPSH 311
Query: 116 RLRRATVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEEL 172
+L+R+ V +GE ++ YRISKSAWL++ P++ ++ R+ +TGL AE L
Sbjct: 312 QLQRSVV---ASGEKQLQVEYRISKSAWLKDTVDPMLVTLNHRIAALTGLDVRPPYAEYL 368
Query: 173 QVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSL 232
QVVNYGIGGHYEPH+D A + ++ + +GNRVAT + Y+S V GGAT F NLS+
Sbjct: 369 QVVNYGIGGHYEPHFDHATSPSSPLYR-MKSGNRVATFMIYLSSVEAGGATAFIYANLSV 427
>gi|359806348|ref|NP_001241485.1| uncharacterized protein LOC100783075 precursor [Glycine max]
gi|255645457|gb|ACU23224.1| unknown [Glycine max]
Length = 298
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 60/205 (29%), Positives = 104/205 (50%), Gaps = 25/205 (12%)
Query: 82 PLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSA 141
P K ++ +PR +Y + D E D + +A+ L+R+ V + +GE ++++ R S
Sbjct: 34 PSKVKQISWKPRAFVYEGFLTDLECDHLISLAKSELKRSAVADNLSGESQLSDVRTSSGM 93
Query: 142 WLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL 201
++ + + P+I I ++ T L E++QV+ Y G Y+PHYD+ +
Sbjct: 94 FISKNKDPIISGIEDKISSWTFLPKENGEDIQVLRYEHGQKYDPHYDYF----TDKVNIA 149
Query: 202 GTGNRVATVLFYMSDVAQGGATVF-------------TSLNLS--------LWPEKGTAA 240
G+R+ATVL Y+++V +GG TVF TS +LS + P +G A
Sbjct: 150 RGGHRIATVLMYLTNVTKGGETVFPSAEEPPRRRGTETSSDLSECAKKGIAVKPHRGDAL 209
Query: 241 FWHNLHSSGDGDYYTRHAACPVLTG 265
+ +LH++ D + HA CPV+ G
Sbjct: 210 LFFSLHTNATPDTSSLHAGCPVIEG 234
>gi|294461211|gb|ADE76168.1| unknown [Picea sitchensis]
Length = 280
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 61/219 (27%), Positives = 106/219 (48%), Gaps = 28/219 (12%)
Query: 71 VHRNVPYLRLMPLKEEEAYLQ------PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQN 124
+HRN PY +L+ L P + LY++ + D+E D + +A+ +L+++ V +
Sbjct: 1 MHRNFPYYKLVQLLALTRLELLSCLGIPGLFLYKNFLTDAECDHLIFLARDKLQKSMVAD 60
Query: 125 YKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYE 184
++G+ ++ R S +L + + ++ + R+ T L E +QV++Y +G YE
Sbjct: 61 NESGKSVMSEIRTSSGMFLNKAQDEIVASVEDRIAAWTFLPIENGEAMQVLHYELGQKYE 120
Query: 185 PHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL---------------- 228
PH+D+ A G+R+ATVL Y+SDV +GG TVF +
Sbjct: 121 PHFDYFHDKINQAM----GGHRIATVLMYLSDVVKGGETVFPNAETKDSQPKDDSWSECA 176
Query: 229 --NLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
S+ P KG A + +L D + H +CPV+ G
Sbjct: 177 KGGYSVKPNKGDALLFFSLRPDATTDQSSLHGSCPVIEG 215
>gi|363543301|ref|NP_001241866.1| prolyl 4-hydroxylase 6 precursor [Zea mays]
gi|195624808|gb|ACG34234.1| oxidoreductase [Zea mays]
gi|347978818|gb|AEP37751.1| prolyl 4-hydroxylase 6 [Zea mays]
Length = 297
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 63/193 (32%), Positives = 97/193 (50%), Gaps = 22/193 (11%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+PR LY + D+E D I +A+ + ++ V + +G+ + R S +L + E +
Sbjct: 41 RPRAFLYSGFLSDTECDHIVSLAKGSMEKSMVADNDSGKSVASQARTSSGTFLAKREDEI 100
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
+ I +RV T L AE LQV+ Y G Y+ H+D+ + N K LG G RVATV
Sbjct: 101 VSAIEKRVAAWTFLPEENAESLQVLRYETGQKYDAHFDYFH--DRNNLK-LG-GQRVATV 156
Query: 211 LFYMSDVAQGGATVF------------------TSLNLSLWPEKGTAAFWHNLHSSGDGD 252
L Y++DV +GG TVF + L++ P+KG A + NLH + D
Sbjct: 157 LMYLTDVKKGGETVFPNAEGSHLQYKDETWSECSRSGLAVKPKKGDALLFFNLHVNATAD 216
Query: 253 YYTRHAACPVLTG 265
+ H +CPV+ G
Sbjct: 217 TGSLHGSCPVIEG 229
>gi|326316001|ref|YP_004233673.1| procollagen-proline dioxygenase [Acidovorax avenae subsp. avenae
ATCC 19860]
gi|323372837|gb|ADX45106.1| Procollagen-proline dioxygenase [Acidovorax avenae subsp. avenae
ATCC 19860]
Length = 298
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 57/183 (31%), Positives = 93/183 (50%), Gaps = 7/183 (3%)
Query: 88 AYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPE 147
A QPR++L+ +++ E D I A+PR+ R+ +TG E+ + R S + + E
Sbjct: 107 AMAQPRVVLFGNLLSPEECDAIIDAARPRMARSLTVATRTGGEEVNDDRTSNGMFFQREE 166
Query: 148 HPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT-GNR 206
+P++ ++ R+ + E LQV++Y G Y+PHYD+ P E L G R
Sbjct: 167 NPMVAKLEARIARLVNWPLENGEGLQVLHYRPGAEYKPHYDYFDPTEPGTPTILRRGGQR 226
Query: 207 VATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAF--WHNLHSSGDGDYYTRHAACPVLT 264
VAT++ Y++D +GG T F ++L + P +G A F + H S T H PV+
Sbjct: 227 VATIVIYLNDPEKGGGTTFPDVHLEVAPRRGNAVFFSYERPHPS----TRTLHGGAPVVA 282
Query: 265 GSN 267
G
Sbjct: 283 GDK 285
>gi|319943342|ref|ZP_08017624.1| 2OG-Fe(II) oxygenase [Lautropia mirabilis ATCC 51599]
gi|319743157|gb|EFV95562.1| 2OG-Fe(II) oxygenase [Lautropia mirabilis ATCC 51599]
Length = 311
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 63/239 (26%), Positives = 117/239 (48%), Gaps = 17/239 (7%)
Query: 31 PKVNNVAPTLEVTEREKYEMLCRGDLTVPPAIVAQL-KCRYVHRNVPYLRLMPLKEEEAY 89
P+ A L+ R++Y+ P +++QL + R V +M
Sbjct: 74 PEDERAAAGLQGARRQRYQ-------ASPIRLISQLPRFTVADREVELAAVMS------- 119
Query: 90 LQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHP 149
P I + R ++ D E D + ++++ +++ + V + ++G ++ R S+ + E+
Sbjct: 120 -NPNIAVIRGLLSDEECDEVIRLSRGKMKTSQVVDRESGGSYESSVRKSEGSHFERGENE 178
Query: 150 VIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGE-ANAFKSLGTGNRVA 208
++ RI R+ + L + E LQ+++YG GG Y+ H DF P + +A + G R+
Sbjct: 179 LVRRIEARLSALVDLPVNRGEPLQILHYGPGGEYKAHQDFFEPKDPGSAVLTRVGGQRIG 238
Query: 209 TVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
TV+ Y++DV +GG T F + S P KG+A ++ ++ G DY HA PV+ G
Sbjct: 239 TVVMYLNDVPEGGETAFPDIGFSAKPIKGSAVYFEYQNADGQLDYRCLHAGMPVIRGDK 297
>gi|242039227|ref|XP_002467008.1| hypothetical protein SORBIDRAFT_01g018200 [Sorghum bicolor]
gi|241920862|gb|EER94006.1| hypothetical protein SORBIDRAFT_01g018200 [Sorghum bicolor]
Length = 307
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 98/201 (48%), Gaps = 24/201 (11%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+PR +Y + + E D + +A+P ++++TV + TG + + R S +LR + +
Sbjct: 102 EPRAFVYHNFLSKEECDHLISLAKPHMKKSTVVDSATGASKDSRVRTSSGMFLRRGQDKI 161
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
I+ I +R+ T + E LQV++Y +G YEPH+D+ + + + G R+AT+
Sbjct: 162 IQTIEKRIADFTFIPVEHGEGLQVLHYEVGQKYEPHFDYFH----DDYNTKNGGQRIATL 217
Query: 211 LFYMSDVAQGGATVF-------------------TSLNLSLWPEKGTAAFWHNLHSSGDG 251
L Y+SDV GG TVF LS+ P+ G A + ++ G
Sbjct: 218 LMYLSDVEDGGETVFPSSTTNSSSSPFYNELSECAKGGLSVKPKMGDALLFWSMKPDGSM 277
Query: 252 DYYTRHAACPVLTGSNSLHST 272
D + H CPV+ G N ST
Sbjct: 278 DSTSLHGGCPVIKG-NKWSST 297
>gi|21593091|gb|AAM65040.1| putative prolyl 4-hydroxylase, alpha subunit [Arabidopsis thaliana]
Length = 291
Score = 101 bits (252), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 59/195 (30%), Positives = 96/195 (49%), Gaps = 23/195 (11%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+PR ++Y + + + E + + +A+P + ++TV + KTG + + R S +LR V
Sbjct: 86 EPRAVVYHNFLTNEECEHLISLAKPSMVKSTVVDEKTGGSKDSRVRTSSGTFLRRGHDEV 145
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
+E I +R+ T + E LQV++Y +G YEPHYD+ + F + G R+ATV
Sbjct: 146 VEVIEKRISDFTFIPVENGEGLQVLHYQVGQKYEPHYDYF----LDEFNTKNGGQRIATV 201
Query: 211 LFYMSDVAQGGATVFTSL-------------------NLSLWPEKGTAAFWHNLHSSGDG 251
L Y+SDV GG TVF + LS+ P+ A + N+
Sbjct: 202 LMYLSDVDDGGETVFPAARGNISAVPWWNELSKCGKEGLSVLPKXRDALLFWNMRPDASL 261
Query: 252 DYYTRHAACPVLTGS 266
D + H CPV+ G+
Sbjct: 262 DPSSLHGGCPVVKGN 276
>gi|328710203|ref|XP_001949232.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Acyrthosiphon
pisum]
Length = 500
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 65/210 (30%), Positives = 107/210 (50%), Gaps = 17/210 (8%)
Query: 62 IVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRAT 121
I + KCRY H YL + PL+EE L P + LY +V+YD EI IK++A P+L + +
Sbjct: 294 IYPKFKCRYYHGGRKYLMIGPLREEIVSLIPSMKLYHNVLYDDEIKKIKELANPKLEKLS 353
Query: 122 VQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGL-TTSTAEELQVVNYGIG 180
+ T E + + K A R+ V E I R+ ++ TT+ ++ V NYGIG
Sbjct: 354 ID---TNE----DISLRKVASFRKHNDQVFETIHHRLAQISSKPTTNIVDKYVVTNYGIG 406
Query: 181 GHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAA 240
GHY PH + + + R A V+ +M DV +GGATV ++ + KG+A
Sbjct: 407 GHYLPHTKYIDDNHL-----INSKRRDAIVIIHMDDVPEGGATVLPNVEFCVPSVKGSAL 461
Query: 241 FWHNLHSS----GDGDYYTRHAACPVLTGS 266
++ ++ + + ++ +CP++ G
Sbjct: 462 VIYSTRNTLPPIKELFEFAQYGSCPIVYGD 491
>gi|356550516|ref|XP_003543632.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 318
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 59/213 (27%), Positives = 104/213 (48%), Gaps = 22/213 (10%)
Query: 71 VHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGEL 130
++R ++ P + + PR LY+ + D E D + +A+ +L ++ V + ++G+
Sbjct: 42 LNRGGSSVKFDPTRVTQLSWSPRAFLYKGFLSDEECDHLITLAKDKLEKSMVADNESGKS 101
Query: 131 EIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFA 190
++ R S +L + + ++ I R+ T L E +Q+++Y G YEPH+D+
Sbjct: 102 IMSEVRTSSGMFLNKAQDEIVAGIEARIAAWTFLPIENGESMQILHYENGQKYEPHFDYF 161
Query: 191 RPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSL-------W---------- 233
A + +G G+R+ATVL Y+SDV +GG T+F + L W
Sbjct: 162 HD---KANQVMG-GHRIATVLMYLSDVEKGGETIFPNAKAKLLQPKDESWSECAHKGYAV 217
Query: 234 -PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
P KG A + +LH D + H +CPV+ G
Sbjct: 218 KPRKGDALLFFSLHLDASTDNKSLHGSCPVIEG 250
>gi|385206010|ref|ZP_10032880.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
gi|385185901|gb|EIF35175.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
Length = 296
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 58/176 (32%), Positives = 92/176 (52%), Gaps = 1/176 (0%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+P IL D + +E + + +A+PRL R+TV + TG +A +R S + R E P+
Sbjct: 101 RPAAILLDDFLSANECEQLISLARPRLSRSTVVDPVTGRNVVAGHRSSDGMFFRLGETPL 160
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGE-ANAFKSLGTGNRVAT 209
I R+ R+ +TGL E LQ+++Y +G PH D+ G AN +G RV T
Sbjct: 161 IARLEARIAELTGLPVENGEGLQLLHYEVGAESTPHVDYLIAGNPANQESIARSGQRVGT 220
Query: 210 VLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
+L Y++DV GG T+F S+ P +G A ++ + G D + H + P+ G
Sbjct: 221 LLMYLNDVEGGGETMFPQTGWSVVPRRGQALYFEYGNRFGLADPSSLHTSTPLRVG 276
>gi|395003644|ref|ZP_10387769.1| 2OG-Fe(II) oxygenase superfamily enzyme [Acidovorax sp. CF316]
gi|394318439|gb|EJE54870.1| 2OG-Fe(II) oxygenase superfamily enzyme [Acidovorax sp. CF316]
Length = 299
Score = 101 bits (251), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 55/179 (30%), Positives = 92/179 (51%), Gaps = 3/179 (1%)
Query: 88 AYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPE 147
A +PRI+++ +++ E D + A PR+ R+ KTG E+ + R S + + E
Sbjct: 108 AIAKPRIVVFGNLLSAEECDALIAAAAPRMARSLTVATKTGGEEVNDDRTSDGMFFQRGE 167
Query: 148 HPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLG-TGNR 206
+PV++RI R+ + E LQV++Y G Y+PHYD+ PGE L G R
Sbjct: 168 NPVVQRIEERIARLLDWPIENGEGLQVLHYRPGAEYKPHYDYFDPGEPGTPTILKRGGQR 227
Query: 207 VATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
V T++ Y++ +GG T F +++ + P++G A F+ + T H PV+ G
Sbjct: 228 VGTLVMYLNTPEKGGGTTFPDVHVEVAPQRGNAVFFS--YERAHPATRTLHGGAPVIAG 284
>gi|264677094|ref|YP_003277000.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni CNB-2]
gi|262207606|gb|ACY31704.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni CNB-2]
Length = 306
Score = 101 bits (251), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 53/176 (30%), Positives = 93/176 (52%), Gaps = 3/176 (1%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
PR++++ +++ D E D I A+PR+RR+ + ++G + + R S + + E+ +
Sbjct: 118 HPRVVVFGNLLSDEECDAIIAAARPRMRRSLTVDNQSGGEAVNDDRTSNGMFFQRGENDL 177
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT-GNRVAT 209
I + +R+ + E +QV++Y G Y+PHYD+ P E L G RV T
Sbjct: 178 ISLVEQRIARLLNWPLENGEGMQVLHYRPGAEYKPHYDYFAPNEPGTPTILKRGGQRVGT 237
Query: 210 VLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
++ Y+++ A+GGAT F + L + P +G A F+ ++ D T H PVL G
Sbjct: 238 LVMYLNEPARGGATTFPDVGLQIVPRRGNAVFFS--YNRPDPATKTLHGGAPVLEG 291
>gi|357137804|ref|XP_003570489.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Brachypodium
distachyon]
Length = 318
Score = 100 bits (250), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 60/201 (29%), Positives = 99/201 (49%), Gaps = 24/201 (11%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+PR +Y + + E + + +A+PR+ ++TV + TG+ + + R S +LR V
Sbjct: 113 EPRAFVYHNFLSKEECEYLIGLAKPRMEKSTVVDSTTGKSKDSRVRTSSGMFLRRGRDKV 172
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
I I RR+ T + E LQV++Y +G YEPH+D+ + F + G R+AT+
Sbjct: 173 IRAIERRIADYTFIPAEHGEGLQVLHYEVGQKYEPHFDYF----LDEFNTKNGGQRMATI 228
Query: 211 LFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAFWHNLHSSGDG 251
L Y+SDV +GG T+F N L++ P+ G A + +++
Sbjct: 229 LMYLSDVEEGGETIFPDANVNSSSLPWHNELSECARKGLAVKPKMGDALLFWSMNPDATL 288
Query: 252 DYYTRHAACPVLTGSNSLHST 272
D + H CPV+ G N ST
Sbjct: 289 DPLSLHGGCPVIRG-NKWSST 308
>gi|195503448|ref|XP_002098656.1| GE23815 [Drosophila yakuba]
gi|194184757|gb|EDW98368.1| GE23815 [Drosophila yakuba]
Length = 472
Score = 100 bits (250), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 68/225 (30%), Positives = 103/225 (45%), Gaps = 34/225 (15%)
Query: 42 VTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVM 101
+ ER + CRG +PP+ + L+CRY P+LR LK E+ ++P + L+ D +
Sbjct: 239 IAERLVHVDNCRGK-NLPPS-KSFLRCRYFREGSPFLRWAALKLEQLNIEPFVGLFHDAI 296
Query: 102 YDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHM 161
+E + + ++ + RL +Y AN + S +H + RI +R+E +
Sbjct: 297 SPAEQEDLLRLTETRLEHRKKDSYSVE----ANVDTNGS------DH--VRRIHQRIEDI 344
Query: 162 TGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGG 221
TG +E L V NYGIGG H D +P +SDV GG
Sbjct: 345 TGFDLEDSEPLTVSNYGIGGQESIHLDCEQPK--------------------LSDVQMGG 384
Query: 222 ATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
F L P +G+A WHN S+G+ D + A CPVL G+
Sbjct: 385 YASFPDLGFGFKPSRGSALVWHNTDSAGNCDTRSLQATCPVLLGN 429
>gi|393200372|ref|YP_006462214.1| prolyl 4-hydroxylase [Solibacillus silvestris StLB046]
gi|327439703|dbj|BAK16068.1| prolyl 4-hydroxylase [Solibacillus silvestris StLB046]
Length = 211
Score = 100 bits (250), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 54/175 (30%), Positives = 94/175 (53%), Gaps = 10/175 (5%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+P I+ + +V+ D E + A RL R+ K + EI++ R S + E E+P+
Sbjct: 29 EPLIVKFLNVLSDEECQNLIDCASSRLERS-----KLAKKEISSIRTSSGMFFEENENPL 83
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
I I +R+ + L AE LQV++Y G ++PH+DF P ++ + NR+ T+
Sbjct: 84 ISEIEKRISSLMHLPIEHAEGLQVLHYEPGQEFKPHFDFFGPNHPSS-----SNNRICTL 138
Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
+ Y++DV +GG T F +L + P+KGTA ++ ++ + T H+ PV+ G
Sbjct: 139 VVYLNDVEEGGVTTFPNLGIVNVPKKGTAVYFEYFYNDQKLNELTLHSGEPVIQG 193
>gi|377811809|ref|YP_005044249.1| ProCollegen-proline dioxygenase [Burkholderia sp. YI23]
gi|357941170|gb|AET94726.1| ProCollegen-proline dioxygenase [Burkholderia sp. YI23]
Length = 283
Score = 100 bits (250), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 55/195 (28%), Positives = 103/195 (52%), Gaps = 4/195 (2%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+P + L DV+ E D + ++ + R+RR++V + +G + + R S+ A++ P+
Sbjct: 90 EPVVALLADVLSPRECDRLIEIGRERVRRSSVVDPDSGGEVLIDARKSEGAFVNGSTDPL 149
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT-GNRVAT 209
+ I RR+ + E+L ++ YG GG Y PH+D+ +A + + G R+AT
Sbjct: 150 VATIDRRIAELVQQPVENGEDLHILRYGAGGEYRPHFDYFPEEQAGSKHHMQRGGQRIAT 209
Query: 210 VLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSL 269
++ Y++ V +GG T F + L++ P +G A ++ +++ G D T HA PV G +
Sbjct: 210 LILYLNQVEEGGDTTFPDIGLTIHPRRGAALYFEYVNALGQTDPRTLHAGMPVERGEKWI 269
Query: 270 HSTCPCGLRRGLQRS 284
+ +RRG R+
Sbjct: 270 ATKW---MRRGRFRA 281
>gi|356572148|ref|XP_003554232.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Glycine max]
Length = 319
Score = 100 bits (250), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 58/213 (27%), Positives = 105/213 (49%), Gaps = 22/213 (10%)
Query: 71 VHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGEL 130
++R ++ P + + PR LY+ + + E D + +A+ +L ++ V + +G+
Sbjct: 43 LNRGGSSVKFDPTRVTQLSWSPRAFLYKGFLSEEECDHLIVLAKDKLEKSMVADNDSGKS 102
Query: 131 EIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFA 190
+++ R S +L + + ++ I R+ T L E +Q+++Y G YEPH+D+
Sbjct: 103 IMSDIRTSSGMFLNKAQDEIVAGIEARIAAWTFLPVENGESMQILHYENGQKYEPHFDYF 162
Query: 191 RPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSL-------W---------- 233
A + +G G+R+ATVL Y+SDV +GG T+F + L W
Sbjct: 163 HD---KANQVMG-GHRIATVLMYLSDVEKGGETIFPNAEAKLLQPKDESWSECAHKGYAV 218
Query: 234 -PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
P+KG A + +LH D + H +CPV+ G
Sbjct: 219 KPQKGDALLFFSLHLDASTDTKSLHGSCPVIEG 251
>gi|29150368|gb|AAO72377.1| putative oxidoreductase [Oryza sativa Japonica Group]
gi|108711617|gb|ABF99412.1| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
Group]
gi|125546090|gb|EAY92229.1| hypothetical protein OsI_13949 [Oryza sativa Indica Group]
gi|125588294|gb|EAZ28958.1| hypothetical protein OsJ_13002 [Oryza sativa Japonica Group]
Length = 310
Score = 100 bits (250), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 57/193 (29%), Positives = 96/193 (49%), Gaps = 22/193 (11%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+PRI Y+ + D E D + K+ + +L+R+ V + ++G+ ++ R S +L + + PV
Sbjct: 54 KPRIFFYKGFLSDDECDHLVKLGKEKLKRSMVADNESGKSVMSEVRTSSGMFLDKQQDPV 113
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
+ I R+ T L AE +Q++ Y G Y+PH+D+ + + L G+R ATV
Sbjct: 114 VSGIEERIAAWTLLPQENAENIQILRYENGQKYDPHFDYFQ----DKVNQLQGGHRYATV 169
Query: 211 LFYMSDVAQGGATVFTSL------------------NLSLWPEKGTAAFWHNLHSSGDGD 252
L Y+S V +GG TVF + L++ KG + + NL G D
Sbjct: 170 LTYLSTVEKGGETVFPNAEGWESQPKDDSFSDCAKKGLAVKAVKGDSVLFFNLQPDGTPD 229
Query: 253 YYTRHAACPVLTG 265
+ H +CPV+ G
Sbjct: 230 PLSLHGSCPVIEG 242
>gi|299532490|ref|ZP_07045880.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni S44]
gi|298719437|gb|EFI60404.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni S44]
Length = 299
Score = 100 bits (249), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 53/176 (30%), Positives = 93/176 (52%), Gaps = 3/176 (1%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
PR++++ +++ D E D I A+PR+RR+ + ++G + + R S + + E+ +
Sbjct: 111 HPRVVVFGNLLSDEECDAIIAAARPRMRRSLTVDNQSGGEAVNDDRTSNGMFFQRGENEL 170
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT-GNRVAT 209
I + +R+ + E +QV++Y G Y+PHYD+ P E L G RV T
Sbjct: 171 ISLVEQRIARLLNWPLENGEGMQVLHYRPGAEYKPHYDYFAPNEPGTPTILKRGGQRVGT 230
Query: 210 VLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
++ Y+++ A+GGAT F + L + P +G A F+ ++ D T H PVL G
Sbjct: 231 LVMYLNEPARGGATTFPDVGLQVVPRRGNAVFFS--YNRPDPATKTLHGGAPVLEG 284
>gi|357125236|ref|XP_003564301.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Brachypodium
distachyon]
Length = 293
Score = 100 bits (249), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 61/193 (31%), Positives = 100/193 (51%), Gaps = 22/193 (11%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+PR LY + +E D + K+A+ RL+++ V + +G+ ++ R S +L + E +
Sbjct: 37 RPRAFLYSGFLSHAECDHLVKLAKGRLQKSMVADNDSGKSVMSQVRTSSGTFLNKHEDEI 96
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
I I +RV T L AE +QV++Y +G Y+ H+D+ + LG G+RVATV
Sbjct: 97 ISGIEKRVAAWTFLPEENAESIQVLHYEVGQKYDAHFDYFHDKNN---QKLG-GHRVATV 152
Query: 211 LFYMSDVAQGGATVFTSL------------------NLSLWPEKGTAAFWHNLHSSGDGD 252
L Y++DV +GG TVF + L++ P KG A + +LH + D
Sbjct: 153 LMYLTDVKKGGETVFPNAEGRHLQHKDETWSECARSGLAVKPRKGDALLFFSLHINATTD 212
Query: 253 YYTRHAACPVLTG 265
+ H +CPV+ G
Sbjct: 213 PSSLHGSCPVIEG 225
>gi|302845234|ref|XP_002954156.1| hypothetical protein VOLCADRAFT_82641 [Volvox carteri f.
nagariensis]
gi|300260655|gb|EFJ44873.1| hypothetical protein VOLCADRAFT_82641 [Volvox carteri f.
nagariensis]
Length = 309
Score = 100 bits (249), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 65/192 (33%), Positives = 95/192 (49%), Gaps = 19/192 (9%)
Query: 92 PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
PR L + + D E + I A+PR+ +++V + +G+ + R S AWL + E +I
Sbjct: 61 PRAFLLKGFLSDEECEHIIAKAKPRMVKSSVVDNASGKSVDSEIRTSTGAWLAKGEDEII 120
Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRVATV 210
RI +RV +T + E LQV++Y G YEPHYD F P NA G G RV TV
Sbjct: 121 SRIEKRVAQVTMIPLENHEGLQVLHYHDGQKYEPHYDYFHDP--VNASPEHG-GQRVVTV 177
Query: 211 LFYMSDVAQGGATVFTSLN---------------LSLWPEKGTAAFWHNLHSSGDGDYYT 255
L Y++ V +GG TV + L++ P KG A +++L G D +
Sbjct: 178 LMYLTTVEEGGETVLPHADQKVSGEGWSECAKRGLAVKPVKGDALMFYSLKPDGSNDPAS 237
Query: 256 RHAACPVLTGSN 267
H +CP L G
Sbjct: 238 LHGSCPTLKGDK 249
>gi|407938132|ref|YP_006853773.1| 2OG-Fe(II) oxygenase [Acidovorax sp. KKS102]
gi|407895926|gb|AFU45135.1| 2OG-Fe(II) oxygenase [Acidovorax sp. KKS102]
Length = 303
Score = 100 bits (249), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 59/185 (31%), Positives = 93/185 (50%), Gaps = 11/185 (5%)
Query: 88 AYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPE 147
A QPRI+++ +++ E D + A+PR+ R+ KTG EI R S + + +
Sbjct: 112 AMAQPRIVVFGNLLSPEECDALIAAAEPRMARSLTVATKTGGEEINADRTSDGMFFQRGQ 171
Query: 148 HPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDF---ARPGEANAFKSLGTG 204
P+I+RI R+ + E LQV++Y G Y+PHYD+ A PG + K G
Sbjct: 172 SPLIQRIEERIARLLQWPIENGEGLQVLHYRPGAEYKPHYDYFDPAEPGTPSIIKR--GG 229
Query: 205 NRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAF--WHNLHSSGDGDYYTRHAACPV 262
RV T++ Y++ +GG T F ++L + P++G A F + H S T H PV
Sbjct: 230 QRVGTLVMYLNTPDKGGGTTFPDVHLEVAPQRGNAVFFSYERPHPS----TRTLHGGAPV 285
Query: 263 LTGSN 267
+ G
Sbjct: 286 IAGDK 290
>gi|209522122|ref|ZP_03270769.1| Procollagen-proline dioxygenase [Burkholderia sp. H160]
gi|209497434|gb|EDZ97642.1| Procollagen-proline dioxygenase [Burkholderia sp. H160]
Length = 296
Score = 100 bits (249), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 65/199 (32%), Positives = 100/199 (50%), Gaps = 6/199 (3%)
Query: 75 VPYLRLMPLKEEEAYL-----QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGE 129
VP L+PL E + + +P + D + E + + +AQPRL R+TV + TG
Sbjct: 80 VPDGPLIPLGERKVRVLSRLQRPAAVHLADFLSADECEQLIALAQPRLDRSTVVDPVTGR 139
Query: 130 LEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDF 189
+A +R S + R E P+I RI R+ +TG E LQ+++Y G PH D+
Sbjct: 140 NVVAGHRSSHGMFFRLGETPLIVRIEARIAALTGTPVENGEGLQMLHYEEGAESTPHVDY 199
Query: 190 ARPG-EANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSS 248
G EAN +G R+ T+L Y+ DV GG TVF + S+ P++G A ++ +
Sbjct: 200 LITGNEANRESIARSGQRMGTLLMYLKDVEGGGETVFPQIGWSVAPQRGHALYFEYGNRF 259
Query: 249 GDGDYYTRHAACPVLTGSN 267
G D + HA+ P+ G
Sbjct: 260 GLCDPSSLHASTPLRVGDK 278
>gi|18086437|gb|AAL57673.1| AT3g28480/MFJ20_16 [Arabidopsis thaliana]
gi|24796986|gb|AAN64505.1| At3g28480/MFJ20_16 [Arabidopsis thaliana]
Length = 316
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 62/202 (30%), Positives = 99/202 (49%), Gaps = 22/202 (10%)
Query: 82 PLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSA 141
P + + PR+ LY + D E D K+A+ +L ++ V + +GE + R S
Sbjct: 53 PTRVTQLSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESEVRTSSGM 112
Query: 142 WLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL 201
+L + + ++ + ++ T L E +Q+++Y G YEPH+D+ +AN L
Sbjct: 113 FLSKRQDDIVNNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHD-QANL--EL 169
Query: 202 GTGNRVATVLFYMSDVAQGGATVF-------TSLNLSLW-----------PEKGTAAFWH 243
G G+R+ATVL Y+S+V +GG TVF T L W P KG A +
Sbjct: 170 G-GHRIATVLMYLSNVEKGGETVFPMWKGKATQLKDDSWTECAKQGYAVKPRKGDALLFF 228
Query: 244 NLHSSGDGDYYTRHAACPVLTG 265
NLH + D + H +CPV+ G
Sbjct: 229 NLHPNATTDSNSLHGSCPVVEG 250
>gi|413932756|gb|AFW67307.1| oxidoreductase [Zea mays]
Length = 297
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 62/193 (32%), Positives = 97/193 (50%), Gaps = 22/193 (11%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+PR LY + D+E D + +A+ + ++ V + +G+ + R S +L + E +
Sbjct: 41 RPRAFLYSGFLSDTECDHLVSLAKGSMEKSMVADNDSGKSVASQARTSSGTFLAKREDEI 100
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
+ I +RV T L AE LQV+ Y G Y+ H+D+ + N K LG G RVATV
Sbjct: 101 VSAIEKRVAAWTFLPEENAESLQVLRYETGQKYDAHFDYFH--DRNNLK-LG-GQRVATV 156
Query: 211 LFYMSDVAQGGATVF------------------TSLNLSLWPEKGTAAFWHNLHSSGDGD 252
L Y++DV +GG TVF + L++ P+KG A + NLH + D
Sbjct: 157 LMYLTDVNKGGETVFPNAEGSHLQYKDETWSECSRSGLAVKPKKGDALLFFNLHVNATAD 216
Query: 253 YYTRHAACPVLTG 265
+ H +CPV+ G
Sbjct: 217 TGSLHGSCPVIEG 229
>gi|18405808|ref|NP_566838.1| prolyl 4-hydroxylase [Arabidopsis thaliana]
gi|21617881|gb|AAM66931.1| prolyl 4-hydroxylase, putative [Arabidopsis thaliana]
gi|332643929|gb|AEE77450.1| prolyl 4-hydroxylase [Arabidopsis thaliana]
Length = 316
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 62/202 (30%), Positives = 99/202 (49%), Gaps = 22/202 (10%)
Query: 82 PLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSA 141
P + + PR+ LY + D E D K+A+ +L ++ V + +GE + R S
Sbjct: 53 PTRVTQLSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESEVRTSSGM 112
Query: 142 WLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL 201
+L + + ++ + ++ T L E +Q+++Y G YEPH+D+ +AN L
Sbjct: 113 FLSKRQDDIVSNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHD-QANL--EL 169
Query: 202 GTGNRVATVLFYMSDVAQGGATVF-------TSLNLSLW-----------PEKGTAAFWH 243
G G+R+ATVL Y+S+V +GG TVF T L W P KG A +
Sbjct: 170 G-GHRIATVLMYLSNVEKGGETVFPMWKGKATQLKDDSWTECAKQGYAVKPRKGDALLFF 228
Query: 244 NLHSSGDGDYYTRHAACPVLTG 265
NLH + D + H +CPV+ G
Sbjct: 229 NLHPNATTDSNSLHGSCPVVEG 250
>gi|356502610|ref|XP_003520111.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 286
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 60/230 (26%), Positives = 110/230 (47%), Gaps = 23/230 (10%)
Query: 56 LTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQP 115
L+ P A R H + L+ E QPR LY + + E + + +A P
Sbjct: 45 LSTPHANANSSVSRNTHIEAEEDDQVALRMEVISWQPRAFLYHNFLTKEECEYLINIATP 104
Query: 116 RLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVV 175
++++TV + ++G+ + + R S A+L + ++ I +R+ +T + E + V+
Sbjct: 105 HMQKSTVADNQSGQSVVHDVRKSTGAFLDRGQDEIVRNIEKRIADVTFIPIENGEPIYVI 164
Query: 176 NYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVF---------- 225
+Y +G +Y+PHYD+ + F G R+AT+L Y+S+V +GG T+F
Sbjct: 165 HYEVGQYYDPHYDYF----IDDFNIENGGQRIATMLMYLSNVEEGGETMFPRAKANFSSV 220
Query: 226 ---------TSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+ LS+ P+ G A + ++ + D T H+ACPV+ G+
Sbjct: 221 PWWNELSNCGKMGLSIKPKMGDALLFWSMKPNATLDALTLHSACPVIKGN 270
>gi|221068712|ref|ZP_03544817.1| Procollagen-proline dioxygenase [Comamonas testosteroni KF-1]
gi|220713735|gb|EED69103.1| Procollagen-proline dioxygenase [Comamonas testosteroni KF-1]
Length = 299
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 53/175 (30%), Positives = 93/175 (53%), Gaps = 3/175 (1%)
Query: 92 PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
PR++++ +++ D E D I A PR++R+ + ++G + + R S + + E+ +I
Sbjct: 112 PRVVVFGNLLSDEECDAIIAAAGPRMQRSLTVDNQSGGEAVNDDRTSNGMFFQRGENDLI 171
Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLG-TGNRVATV 210
R+ +R+ + E +QV++Y G Y+PHYD+ P E L G RV T+
Sbjct: 172 CRVEQRIARLLNWPLENGEGMQVLHYRPGAEYKPHYDYFAPNEPGTPTILKRGGQRVGTL 231
Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
+ Y+++ A+GGAT F + L + P +G A F+ ++ D T H PVL G
Sbjct: 232 VMYLNEPARGGATTFPDVGLQVVPRRGNAVFFS--YNRPDPATKTLHGGAPVLEG 284
>gi|351714551|gb|EHB17470.1| Prolyl 4-hydroxylase subunit alpha-1 [Heterocephalus glaber]
Length = 388
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 57/147 (38%), Positives = 88/147 (59%), Gaps = 11/147 (7%)
Query: 4 PTHQRAQGNKLYYQEALNK--------SPELKDEPPKVNNVAPTLE-VTEREKYEMLCRG 54
P HQRA GN Y++ + K S + D+ + ++ + ER+KYEMLCRG
Sbjct: 241 PEHQRANGNLKYFEYIMAKEKDANKSASDDQSDQKSTLRKKGIAVDYLPERQKYEMLCRG 300
Query: 55 D-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKM 112
+ + + P +L CRY N P L P K+E+ + +PRII + D++ D+EI+++K +
Sbjct: 301 EGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKDL 360
Query: 113 AQPRLRRATVQNYKTGELEIANYRISK 139
A+PRL RATV + +TG+L A YR+SK
Sbjct: 361 AKPRLSRATVHDPETGKLTTAQYRVSK 387
>gi|414870899|tpg|DAA49456.1| TPA: hypothetical protein ZEAMMB73_536273 [Zea mays]
Length = 364
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 97/201 (48%), Gaps = 24/201 (11%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+PR +Y + + E D + +A+P ++++TV + TG + + R S +LR + +
Sbjct: 159 EPRAFVYHNFLSKEECDHLISLAKPHMKKSTVVDSATGGSKDSRVRTSSGMFLRRGQDKI 218
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
I I +R+ T + E LQV++Y +G YEPH+D+ + + + G R+AT+
Sbjct: 219 IRTIEKRIADYTFIPVEQGEGLQVLHYEVGQKYEPHFDYFH----DDYNTKNGGQRIATL 274
Query: 211 LFYMSDVAQGGATVF-------------------TSLNLSLWPEKGTAAFWHNLHSSGDG 251
L Y+SDV GG TVF LS+ P+ G A + ++ G
Sbjct: 275 LMYLSDVEDGGETVFPSSTTNSSSSPFYNELSECAKGGLSVKPKMGDALLFWSMKPDGSL 334
Query: 252 DYYTRHAACPVLTGSNSLHST 272
D + H CPV+ G N ST
Sbjct: 335 DPTSLHGGCPVIKG-NKWSST 354
>gi|224001336|ref|XP_002290340.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220973762|gb|EED92092.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 483
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 63/206 (30%), Positives = 108/206 (52%), Gaps = 25/206 (12%)
Query: 86 EEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATV--QNYKTGELEIANYRISKSAWL 143
E L+P ++ + D E D I ++A P+++ ++V ++ G+ + + +R S+SA+L
Sbjct: 262 ETLSLRPLVVSVEGFLSDEECDYIAEIASPQVKYSSVSLKDADKGK-DSSEWRTSQSAFL 320
Query: 144 REPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL-- 201
+ V+ I RV +T + + E +QV+ YG G Y+ H+D+ P + KS
Sbjct: 321 SARDDEVLTEIDHRVASLTRIPRNHQEYVQVLRYGAGEKYDSHHDYFDPSAYRSDKSTLR 380
Query: 202 ----GTGNRVATVLFYMSDVAQGGATVF--------------TSLNLSLWPEKGTAAFWH 243
G NR ATV +Y++DV GG T+F S+ L + P+KG ++
Sbjct: 381 LIENGKKNRYATVFWYLTDVHDGGETIFPRYGGAPAPRSHKDCSIGLKVKPQKGKVVIFY 440
Query: 244 NLHSSGDGDYYTRHAACPVLTGSNSL 269
+L +SG+ D ++ H ACPV G N+L
Sbjct: 441 SLDASGEMDPFSLHGACPV--GENNL 464
>gi|218184507|gb|EEC66934.1| hypothetical protein OsI_33548 [Oryza sativa Indica Group]
Length = 308
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 59/193 (30%), Positives = 97/193 (50%), Gaps = 22/193 (11%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+PR L++ + D+E + + +A+ +L ++ V + ++G+ ++ R S +L + + V
Sbjct: 51 RPRAFLHKGFLTDAECEHLISLAKDKLEKSMVADNESGKSVMSEVRTSSGMFLEKKQDEV 110
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
+ RI R+ T L E +Q+++Y G YEPHYD+ A LG G+R+ATV
Sbjct: 111 VARIEERIAAWTFLPPDNGESIQILHYQNGEKYEPHYDYFHDKNNQA---LG-GHRIATV 166
Query: 211 LFYMSDVAQGGATVFTSLNLSL-------W-----------PEKGTAAFWHNLHSSGDGD 252
L Y+SDV +GG T+F L W P KG A + +LH D
Sbjct: 167 LMYLSDVGKGGETIFPEAEGKLLQPKDDTWSDCAKNGYAVKPVKGDALLFFSLHPDATTD 226
Query: 253 YYTRHAACPVLTG 265
+ H +CPV+ G
Sbjct: 227 SDSLHGSCPVIEG 239
>gi|326526235|dbj|BAJ97134.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 308
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 57/203 (28%), Positives = 97/203 (47%), Gaps = 21/203 (10%)
Query: 80 LMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISK 139
+ P + PR LY + D E + + +A+ L+R+ V + +G+ +++ R S
Sbjct: 46 VYPHHSRQISWHPRAFLYPHFLSDDEANHLVSLARAELKRSAVADETSGKSQLSEVRTSS 105
Query: 140 SAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFK 199
++ + + P++ I ++ T L E++QV+ Y G YEPHYDF ++
Sbjct: 106 GTFISKGKDPIVAGIEDKIAAWTFLPKENGEDMQVLRYKRGEKYEPHYDFF----TDSVN 161
Query: 200 SLGTGNRVATVLFYMSDVAQGGATVF-----------------TSLNLSLWPEKGTAAFW 242
++ G+RVATVL Y++DVA+GG TVF +++ P KG A +
Sbjct: 162 TILGGHRVATVLLYLTDVAEGGETVFPLAKGRKGSHHKGLSECAQKGIAVKPRKGDALLF 221
Query: 243 HNLHSSGDGDYYTRHAACPVLTG 265
NL D + H C V+ G
Sbjct: 222 FNLRPDAATDPTSLHGGCEVIKG 244
>gi|224133600|ref|XP_002327635.1| predicted protein [Populus trichocarpa]
gi|222836720|gb|EEE75113.1| predicted protein [Populus trichocarpa]
Length = 291
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 57/201 (28%), Positives = 101/201 (50%), Gaps = 24/201 (11%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+PR +Y + + +E + + +A+PR++++TV + TG+ + + R S +L +
Sbjct: 86 KPRAFVYHNFLTKAECEYLINLAKPRMQKSTVVDSSTGKSKDSKVRTSSGTFLPRGRDKI 145
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
+ I +R+ + + E LQ+++Y +G YEPH+D+ + + + G R+ATV
Sbjct: 146 VRDIEKRIADFSFIPVEHGEGLQILHYEVGQRYEPHFDYF----MDEYNTKNGGQRIATV 201
Query: 211 LFYMSDVAQGGATVFTSL-------------------NLSLWPEKGTAAFWHNLHSSGDG 251
L Y+SDV +GG TVF S LS+ P+ G A + +++ G
Sbjct: 202 LMYLSDVEEGGETVFPSAEGNISAVPWWNELSECGKGGLSVKPKMGDALLFWSMNPDGSP 261
Query: 252 DYYTRHAACPVLTGSNSLHST 272
D + H CPV+ G N ST
Sbjct: 262 DPSSLHGGCPVIRG-NKWSST 281
>gi|449432777|ref|XP_004134175.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 303
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 57/205 (27%), Positives = 100/205 (48%), Gaps = 25/205 (12%)
Query: 82 PLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSA 141
P K ++ PR +Y + D E D + +A+ L+R++V + +G+ +++ R S A
Sbjct: 38 PAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGKSKVSEVRTSSGA 97
Query: 142 WLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL 201
++ + + P++ I ++ T L E++QV+ Y G Y+ H+D+ A+
Sbjct: 98 FIHKAKDPIVSGIEDKIAAWTFLPKDNGEDIQVLRYEYGQKYDAHFDYF----ADKVNIA 153
Query: 202 GTGNRVATVLFYMSDVAQGGATVFTSL---------------------NLSLWPEKGTAA 240
G+R+ATVL Y+SDV +GG TVF S +++ P KG A
Sbjct: 154 RGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRKGDAL 213
Query: 241 FWHNLHSSGDGDYYTRHAACPVLTG 265
+ +LH + D + H CPV+ G
Sbjct: 214 LFFSLHPNAIPDTSSLHGGCPVIEG 238
>gi|115481998|ref|NP_001064592.1| Os10g0413500 [Oryza sativa Japonica Group]
gi|110289075|gb|ABG66075.1| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
Group]
gi|113639201|dbj|BAF26506.1| Os10g0413500 [Oryza sativa Japonica Group]
gi|215692577|dbj|BAG87997.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222612821|gb|EEE50953.1| hypothetical protein OsJ_31503 [Oryza sativa Japonica Group]
Length = 308
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 59/193 (30%), Positives = 97/193 (50%), Gaps = 22/193 (11%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+PR L++ + D+E + + +A+ +L ++ V + ++G+ ++ R S +L + + V
Sbjct: 51 RPRAFLHKGFLTDAECEHLISLAKDKLEKSMVADNESGKSVMSEVRTSSGMFLEKKQDEV 110
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
+ RI R+ T L E +Q+++Y G YEPHYD+ A LG G+R+ATV
Sbjct: 111 VARIEERIAAWTFLPPDNGESIQILHYQNGEKYEPHYDYFHDKNNQA---LG-GHRIATV 166
Query: 211 LFYMSDVAQGGATVFTSLNLSL-------W-----------PEKGTAAFWHNLHSSGDGD 252
L Y+SDV +GG T+F L W P KG A + +LH D
Sbjct: 167 LMYLSDVGKGGETIFPEAEGKLLQPKDDTWSDCAKNGYAVKPVKGDALLFFSLHPDATTD 226
Query: 253 YYTRHAACPVLTG 265
+ H +CPV+ G
Sbjct: 227 SDSLHGSCPVIEG 239
>gi|9294583|dbj|BAB02864.1| prolyl 4-hydroxylase alpha subunit-like protein [Arabidopsis
thaliana]
Length = 332
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 62/202 (30%), Positives = 99/202 (49%), Gaps = 22/202 (10%)
Query: 82 PLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSA 141
P + + PR+ LY + D E D K+A+ +L ++ V + +GE + R S
Sbjct: 69 PTRVTQLSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESEVRTSSGM 128
Query: 142 WLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL 201
+L + + ++ + ++ T L E +Q+++Y G YEPH+D+ +AN L
Sbjct: 129 FLSKRQDDIVSNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHD-QANL--EL 185
Query: 202 GTGNRVATVLFYMSDVAQGGATVF-------TSLNLSLW-----------PEKGTAAFWH 243
G G+R+ATVL Y+S+V +GG TVF T L W P KG A +
Sbjct: 186 G-GHRIATVLMYLSNVEKGGETVFPMWKGKATQLKDDSWTECAKQGYAVKPRKGDALLFF 244
Query: 244 NLHSSGDGDYYTRHAACPVLTG 265
NLH + D + H +CPV+ G
Sbjct: 245 NLHPNATTDSNSLHGSCPVVEG 266
>gi|114796723|gb|ABI79328.1| prolyl 4-hydroxylase [Dianthus caryophyllus]
Length = 297
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 59/214 (27%), Positives = 102/214 (47%), Gaps = 26/214 (12%)
Query: 74 NVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIA 133
N +L P K + +PR +Y + D E D + +A+ L+R+ V + ++G+ +++
Sbjct: 26 NDSIFKLNPSKVRQISWKPRAFVYEGFLTDEECDHLISIAKTELKRSAVADNESGKSQVS 85
Query: 134 NYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPG 193
R S A++ + + +++RI ++ T L E++QV+ Y G YE H+DF
Sbjct: 86 EVRTSSGAFISKAKDAIVQRIEEKLATWTFLPIENGEDIQVLRYEEGQKYENHFDFF--- 142
Query: 194 EANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNL----------------------S 231
++ G+R ATVL Y+S+V +GG TVF + L S
Sbjct: 143 -SDKVNIARGGHRYATVLMYLSNVEKGGDTVFPNAELSERQKAAIAANDDLSECAKRGIS 201
Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
+ P KG A + +L + D + H CPV+ G
Sbjct: 202 VKPRKGDALLFFSLTPTATPDQLSLHGGCPVIEG 235
>gi|116309432|emb|CAH66506.1| OSIGBa0111I14.1 [Oryza sativa Indica Group]
Length = 267
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 62/204 (30%), Positives = 101/204 (49%), Gaps = 19/204 (9%)
Query: 77 YLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYR 136
+LRL +K E PRII++ + + E D ++ +A+PRL+ +TV + TG+ +N R
Sbjct: 53 FLRLGLVKPEVISWSPRIIVFHNFLSSEECDYLRSIARPRLQISTVVDVATGKGVKSNVR 112
Query: 137 ISKSAWLREPEH--PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGE 194
S ++ E PVI+ I +R+ + + E +QV+ Y +Y PH+D+
Sbjct: 113 TSSGMFVSSEERKLPVIQSIEKRISVYSQIPEENGELIQVLRYEPSQYYRPHHDYF---- 168
Query: 195 ANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL-------------NLSLWPEKGTAAF 241
++ F G RVAT+L Y++D +GG T F L + P KG A
Sbjct: 169 SDTFNIKRGGQRVATMLMYLTDGVEGGETHFPQAGDGECSCGGKMVKGLCVKPNKGDAVL 228
Query: 242 WHNLHSSGDGDYYTRHAACPVLTG 265
+ ++ G+ D + H CPVL G
Sbjct: 229 FWSMGLDGETDSNSIHGGCPVLEG 252
>gi|115457822|ref|NP_001052511.1| Os04g0346000 [Oryza sativa Japonica Group]
gi|38346023|emb|CAE03962.2| OSJNBb0085H11.11 [Oryza sativa Japonica Group]
gi|113564082|dbj|BAF14425.1| Os04g0346000 [Oryza sativa Japonica Group]
gi|125547818|gb|EAY93640.1| hypothetical protein OsI_15426 [Oryza sativa Indica Group]
gi|125589953|gb|EAZ30303.1| hypothetical protein OsJ_14349 [Oryza sativa Japonica Group]
gi|215693934|dbj|BAG89133.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 267
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 62/208 (29%), Positives = 102/208 (49%), Gaps = 19/208 (9%)
Query: 73 RNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEI 132
+ +LRL +K E PRII++ + + E D ++ +A+PRL+ +TV + TG+
Sbjct: 49 QEAAFLRLGLVKPEVISWSPRIIVFHNFLSSEECDYLRSIARPRLQISTVVDVATGKGVK 108
Query: 133 ANYRISKSAWLREPEH--PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFA 190
+N R S ++ E PVI+ I +R+ + + E +QV+ Y +Y PH+D+
Sbjct: 109 SNVRTSSGMFVSSEERKLPVIQSIEKRISVYSQIPEENGELIQVLRYEPSQYYRPHHDYF 168
Query: 191 RPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL-------------NLSLWPEKG 237
++ F G RVAT+L Y++D +GG T F L + P KG
Sbjct: 169 ----SDTFNIKRGGQRVATMLMYLTDGVEGGETHFPQAGDGECSCGGKMVKGLCVKPNKG 224
Query: 238 TAAFWHNLHSSGDGDYYTRHAACPVLTG 265
A + ++ G+ D + H CPVL G
Sbjct: 225 DAVLFWSMGLDGETDSNSIHGGCPVLEG 252
>gi|195352178|ref|XP_002042591.1| GM14978 [Drosophila sechellia]
gi|194124475|gb|EDW46518.1| GM14978 [Drosophila sechellia]
Length = 467
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 58/184 (31%), Positives = 94/184 (51%), Gaps = 32/184 (17%)
Query: 64 AQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQ 123
+ L CRY +L+L PLK EE P I+++ +V+ D EI+ +K
Sbjct: 296 SNLVCRYNSSTNAFLQLAPLKMEEVSRDPYIVMFHEVVSDKEIEEMK------------- 342
Query: 124 NYKTGEL-EIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGH 182
GE+ E+ N + E +RI++R+ MTG +Q N+G+GG+
Sbjct: 343 ----GEITEMENGK----------ESSFSKRINQRISDMTGFKLEEFPAIQSANFGVGGY 388
Query: 183 YEPHYDF--ARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAA 240
++PHYD+ R E + +LG +R+ +++FY +V+QGG TVF + + P+KG A
Sbjct: 389 FKPHYDYYTDRLKEVDVNNTLG--DRIGSIIFYAGEVSQGGQTVFPDSKVMVEPKKGNAL 446
Query: 241 FWHN 244
W N
Sbjct: 447 LWFN 450
>gi|357467075|ref|XP_003603822.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355492870|gb|AES74073.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 683
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 59/188 (31%), Positives = 97/188 (51%), Gaps = 17/188 (9%)
Query: 92 PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
PR +Y + + E + + +A+P + R+ V + TGE++ ++ R S +L + ++
Sbjct: 119 PRASMYHNFLSKEECEHLINLAKPFMARSLVVDGVTGEVKESSSRTSSGMFLDRGKDKIV 178
Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVL 211
+ I RR+ +T + E L V++YG+G EPHYD+ G G RVATVL
Sbjct: 179 QNIERRIADITSVPIENGEGLHVIHYGVGQKCEPHYDYTSDGVVTK----NGGPRVATVL 234
Query: 212 FYMSDVAQGGATV-------FTSLN------LSLWPEKGTAAFWHNLHSSGDGDYYTRHA 258
Y+SDV +GG TV FTS++ LS+ P+ G A + ++ G D + H
Sbjct: 235 MYLSDVEEGGETVFPDAQPNFTSVSKCSGDGLSVKPKMGDALLFWSMKPDGTLDTSSLHG 294
Query: 259 ACPVLTGS 266
PV+ G+
Sbjct: 295 GSPVIRGN 302
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 42/175 (24%), Positives = 76/175 (43%), Gaps = 32/175 (18%)
Query: 105 EIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGL 164
E + + +A+P + R+ V + TG+ ++ R S +L + +++ I +R+ +T +
Sbjct: 377 ECEHLINLAKPFMTRSLVVDGLTGKGRESSARTSSGRFLERGKDKIVQNIEQRIADITSI 436
Query: 165 TTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATV 224
+ + G+ + G G RVATVL Y+SDV +GG TV
Sbjct: 437 PRMARDFMLFTAGGV---------VTKNG----------GPRVATVLMYLSDVEEGGETV 477
Query: 225 FTSLN-------------LSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
F + LS+ P+ G A + ++ G D + H PV+ G+
Sbjct: 478 FPNAKPNINSVSKYPEKGLSVKPKMGDALLFRSMKPDGTLDTSSLHGGSPVIRGN 532
>gi|326489721|dbj|BAK01841.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 315
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 58/201 (28%), Positives = 100/201 (49%), Gaps = 24/201 (11%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+PR +Y + + E + + ++A+PR+ ++TV + +TG+ + + R S +L+ V
Sbjct: 110 EPRAFVYHNFLSKEECEYLIELAKPRMVKSTVVDSETGKSKDSRVRTSSGMFLQRGRDKV 169
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
I I RR+ T + E LQV++Y +G YEPH+D+ + F + G R+AT+
Sbjct: 170 IRAIERRIADYTFIPAEHGEGLQVLHYEVGQKYEPHFDYF----LDEFNTKNGGQRMATI 225
Query: 211 LFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAFWHNLHSSGDG 251
L Y+SD+ +GG T+F N L++ P+ G A + ++
Sbjct: 226 LMYLSDIEEGGETIFPDANVNSSSLPWYNELSECARKGLAVKPKMGDALLFWSMKPDATL 285
Query: 252 DYYTRHAACPVLTGSNSLHST 272
D + H CPV+ G N ST
Sbjct: 286 DPLSLHGGCPVIKG-NKWSST 305
>gi|168002780|ref|XP_001754091.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162694645|gb|EDQ80992.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 214
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 59/196 (30%), Positives = 96/196 (48%), Gaps = 23/196 (11%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+PR LY + + E + + ++A+P L ++TV + TG+ + + R S +L + PV
Sbjct: 9 EPRAFLYHHFLTEEECNHLIEVARPSLVKSTVVDSDTGKSKDSRLRTSSGTFLMRGQDPV 68
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
I+RI +R+ T + E LQV+ Y YEPHYD+ +A+ + G R+ATV
Sbjct: 69 IKRIEKRIADFTFIPAEQGEGLQVLQYKESEKYEPHYDYFH----DAYNTKNGGQRIATV 124
Query: 211 LFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAFWHNLHSSGDG 251
L Y+S+V +GG TVF + LS+ P G A + ++
Sbjct: 125 LMYLSNVEEGGETVFPAAQVNKTEVPDWDKLSECAQKGLSVRPRMGDALLFWSMKPDATL 184
Query: 252 DYYTRHAACPVLTGSN 267
D + H CPV+ G+
Sbjct: 185 DSTSLHGGCPVIKGTK 200
>gi|195627276|gb|ACG35468.1| prolyl 4-hydroxylase [Zea mays]
Length = 298
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 60/193 (31%), Positives = 97/193 (50%), Gaps = 22/193 (11%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+PR L++ + D+E D + +A+ +L ++ V + K+G+ + R S +L + + V
Sbjct: 41 RPRAFLHKGFLLDAECDHLIALAKDKLEKSMVADNKSGKSVQSEVRTSSGMFLEKKQDEV 100
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
+ RI R+ T L E +Q+++Y G YEPHYD+ A LG G+R+ATV
Sbjct: 101 VTRIEERISAWTFLPPENGEAIQILHYQNGEKYEPHYDYFHDKNNQA---LG-GHRIATV 156
Query: 211 LFYMSDVAQGGATVFTSLNLSL-------W-----------PEKGTAAFWHNLHSSGDGD 252
L Y+S+V +GG T+F + L W P KG A + +LH D
Sbjct: 157 LMYLSNVEKGGETIFPNAEGKLLQPKDDTWSDCARNGYAVKPVKGDALLFFSLHPDSTTD 216
Query: 253 YYTRHAACPVLTG 265
+ H +CPV+ G
Sbjct: 217 SDSLHGSCPVIEG 229
>gi|212720775|ref|NP_001131953.1| uncharacterized protein LOC100193348 [Zea mays]
gi|194693016|gb|ACF80592.1| unknown [Zea mays]
gi|347978798|gb|AEP37741.1| prolyl 4-hydroxylase 1 [Zea mays]
gi|414870898|tpg|DAA49455.1| TPA: hypothetical protein ZEAMMB73_536273 [Zea mays]
Length = 307
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 97/201 (48%), Gaps = 24/201 (11%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+PR +Y + + E D + +A+P ++++TV + TG + + R S +LR + +
Sbjct: 102 EPRAFVYHNFLSKEECDHLISLAKPHMKKSTVVDSATGGSKDSRVRTSSGMFLRRGQDKI 161
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
I I +R+ T + E LQV++Y +G YEPH+D+ + + + G R+AT+
Sbjct: 162 IRTIEKRIADYTFIPVEQGEGLQVLHYEVGQKYEPHFDYFH----DDYNTKNGGQRIATL 217
Query: 211 LFYMSDVAQGGATVF-------------------TSLNLSLWPEKGTAAFWHNLHSSGDG 251
L Y+SDV GG TVF LS+ P+ G A + ++ G
Sbjct: 218 LMYLSDVEDGGETVFPSSTTNSSSSPFYNELSECAKGGLSVKPKMGDALLFWSMKPDGSL 277
Query: 252 DYYTRHAACPVLTGSNSLHST 272
D + H CPV+ G N ST
Sbjct: 278 DPTSLHGGCPVIKG-NKWSST 297
>gi|326501992|dbj|BAK06488.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 306
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 65/207 (31%), Positives = 101/207 (48%), Gaps = 21/207 (10%)
Query: 78 LRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPR-LRRATVQNYKTGELEIANYR 136
+R P + +PR LY+ + ++E D + +A+ L+++ V + +TG+ ++ R
Sbjct: 31 VRFDPTRAVHVSWRPRAFLYKGFLTEAECDHLVALAEEGGLQKSMVVDRQTGKSVMSEVR 90
Query: 137 ISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEAN 196
S +L + + V+ I R+ T L E +QV+ Y G YEPH DF R A
Sbjct: 91 TSSGTFLAKKQDQVVATIEARIAAWTLLPQENGESIQVLRYENGQKYEPHVDFIRHA-AK 149
Query: 197 AFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNL------------------SLWPEKGT 238
S G G+RVATVL Y+SDV GG TVF + + ++ P KG
Sbjct: 150 GHHSRG-GHRVATVLMYLSDVKMGGETVFPNSDAKTLQPKDDTQSECARRGYAVKPVKGD 208
Query: 239 AAFWHNLHSSGDGDYYTRHAACPVLTG 265
A + +LH +G D + H CPV+ G
Sbjct: 209 AVLFFSLHPNGTTDRDSLHGGCPVIEG 235
>gi|91779740|ref|YP_554948.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
xenovorans LB400]
gi|91692400|gb|ABE35598.1| Procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
xenovorans LB400]
Length = 296
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 56/176 (31%), Positives = 92/176 (52%), Gaps = 1/176 (0%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+P +L D + +E + + +A+PRL R+TV + TG +A +R S + R E P+
Sbjct: 101 RPAAVLLDDFLSANECEQLIALARPRLSRSTVVDPVTGRNVVAGHRSSDGMFFRLGETPL 160
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLG-TGNRVAT 209
I R+ R+ +TGL E LQ+++Y G PH D+ G +S+ +G RV T
Sbjct: 161 IARLEARIAELTGLPVENGEGLQLLHYEAGAESTPHVDYLIAGNPANRESIARSGQRVGT 220
Query: 210 VLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
+L Y++DV GG T+F S+ P +G A ++ + G D + H + P+ G
Sbjct: 221 LLMYLNDVEGGGETMFPQTGWSVVPRRGQALYFEYGNRFGLADPSSLHTSTPLRAG 276
>gi|222613083|gb|EEE51215.1| hypothetical protein OsJ_32038 [Oryza sativa Japonica Group]
Length = 222
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 57/201 (28%), Positives = 97/201 (48%), Gaps = 24/201 (11%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+PR LY + + E + + +A+P ++++TV + TG + + R S +L + +
Sbjct: 17 EPRAFLYHNFLSKEECEYLISLAKPHMKKSTVVDASTGGSKDSRVRTSSGMFLGRGQDKI 76
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
I I +R+ T + E LQV++Y +G YEPH+D+ + F + G R+AT+
Sbjct: 77 IRTIEKRISDYTFIPVENGEGLQVLHYEVGQKYEPHFDYFH----DEFNTKNGGQRIATL 132
Query: 211 LFYMSDVAQGGATVF-------------------TSLNLSLWPEKGTAAFWHNLHSSGDG 251
L Y+SDV +GG T+F L++ P+ G A + ++ G
Sbjct: 133 LMYLSDVEEGGETIFPSSKANSSSSPFYNELSECAKKGLAVKPKMGDALLFWSMRPDGSL 192
Query: 252 DYYTRHAACPVLTGSNSLHST 272
D + H CPV+ G N ST
Sbjct: 193 DATSLHGGCPVIKG-NKWSST 212
>gi|115482738|ref|NP_001064962.1| Os10g0497800 [Oryza sativa Japonica Group]
gi|78708853|gb|ABB47828.1| prolyl 4-hydroxylase alpha subunit, putative, expressed [Oryza
sativa Japonica Group]
gi|113639571|dbj|BAF26876.1| Os10g0497800 [Oryza sativa Japonica Group]
gi|215767852|dbj|BAH00081.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218184821|gb|EEC67248.1| hypothetical protein OsI_34188 [Oryza sativa Indica Group]
Length = 321
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 61/231 (26%), Positives = 107/231 (46%), Gaps = 31/231 (13%)
Query: 61 AIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRA 120
A + L+ R + P+ ++ +PR LY + + E + + +A+P ++++
Sbjct: 93 AFESGLEMRGGEKGEPWTEVLSW-------EPRAFLYHNFLSKEECEYLISLAKPHMKKS 145
Query: 121 TVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIG 180
TV + TG + + R S +L + +I I +R+ T + E LQV++Y +G
Sbjct: 146 TVVDASTGGSKDSRVRTSSGMFLGRGQDKIIRTIEKRISDYTFIPVENGEGLQVLHYEVG 205
Query: 181 GHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVF--------------- 225
YEPH+D+ + F + G R+AT+L Y+SDV +GG T+F
Sbjct: 206 QKYEPHFDYFH----DEFNTKNGGQRIATLLMYLSDVEEGGETIFPSSKANSSSSPFYNE 261
Query: 226 ----TSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHST 272
L++ P+ G A + ++ G D + H CPV+ G N ST
Sbjct: 262 LSECAKKGLAVKPKMGDALLFWSMRPDGSLDATSLHGGCPVIKG-NKWSST 311
>gi|110289076|gb|ABB47602.2| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
Group]
Length = 309
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 58/194 (29%), Positives = 97/194 (50%), Gaps = 23/194 (11%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+PR L++ + D+E + + +A+ +L ++ V + ++G+ ++ R S +L + + V
Sbjct: 51 RPRAFLHKGFLTDAECEHLISLAKDKLEKSMVADNESGKSVMSEVRTSSGMFLEKKQDEV 110
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
+ RI R+ T L E +Q+++Y G YEPHYD+ A LG G+R+ATV
Sbjct: 111 VARIEERIAAWTFLPPDNGESIQILHYQNGEKYEPHYDYFHDKNNQA---LG-GHRIATV 166
Query: 211 LFYMSDVAQGGATVFTSLNL--------SLW-----------PEKGTAAFWHNLHSSGDG 251
L Y+SDV +GG T+F + W P KG A + +LH
Sbjct: 167 LMYLSDVGKGGETIFPEAEVGKLLQPKDDTWSDCAKNGYAVKPVKGDALLFFSLHPDATT 226
Query: 252 DYYTRHAACPVLTG 265
D + H +CPV+ G
Sbjct: 227 DSDSLHGSCPVIEG 240
>gi|357496283|ref|XP_003618430.1| Prolyl 4-hydroxylase subunit alpha-2 [Medicago truncatula]
gi|217073992|gb|ACJ85356.1| unknown [Medicago truncatula]
gi|355493445|gb|AES74648.1| Prolyl 4-hydroxylase subunit alpha-2 [Medicago truncatula]
gi|388494436|gb|AFK35284.1| unknown [Medicago truncatula]
Length = 313
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 60/206 (29%), Positives = 101/206 (49%), Gaps = 22/206 (10%)
Query: 78 LRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRI 137
++ P + + PR LY++ + D E D + ++++ +L ++ V + ++G+ + R
Sbjct: 44 VKFDPTRVTQLSWSPRAFLYKNFLTDEECDHLIELSKDKLEKSMVADNESGKSIQSEVRT 103
Query: 138 SKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANA 197
S +L + + ++ I R+ T L E +QV++Y G YEPH+DF A
Sbjct: 104 SSGMFLNKQQDEIVSGIEARIAAWTFLPVENGESMQVLHYMNGEKYEPHFDFFHD---KA 160
Query: 198 FKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSL-------W-----------PEKGTA 239
+ LG G+RVATVL Y+S+V +GG T+F L W P KG A
Sbjct: 161 NQRLG-GHRVATVLMYLSNVEKGGETIFPHAEGKLSQPKDESWSECAHKGYAVKPRKGDA 219
Query: 240 AFWHNLHSSGDGDYYTRHAACPVLTG 265
+ +LH D + H +CPV+ G
Sbjct: 220 LLFFSLHLDATTDSKSLHGSCPVIEG 245
>gi|307110383|gb|EFN58619.1| hypothetical protein CHLNCDRAFT_19485 [Chlorella variabilis]
Length = 328
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 59/184 (32%), Positives = 95/184 (51%), Gaps = 16/184 (8%)
Query: 86 EEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLRE 145
E +PR ++ + M + E D I +A+P ++R+TV +E R S +L+
Sbjct: 33 EPVSWKPRAFVFHNFMTEEEADHIVALAKPFMKRSTVVGAGGASVE-DQIRTSYGTFLKR 91
Query: 146 PEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGN 205
+ P++ + +R+ T L S E++Q++ YGIG Y HYD SL +
Sbjct: 92 LQDPIVTAVEQRLATWTKLNVSHQEDMQILRYGIGQKYGAHYD-----------SLDNDS 140
Query: 206 -RVATVLFYMSDVAQ--GGATVFTSL-NLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACP 261
RV TVL Y+SDV GG T F + +L+P+KG A +++L G D Y+ H CP
Sbjct: 141 PRVCTVLLYLSDVPADGGGETAFPGVRRQALYPKKGDALLFYSLKPDGTSDAYSLHTGCP 200
Query: 262 VLTG 265
+++G
Sbjct: 201 IISG 204
>gi|225468574|ref|XP_002263060.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
gi|296084059|emb|CBI24447.3| unnamed protein product [Vitis vinifera]
Length = 288
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 99/201 (49%), Gaps = 24/201 (11%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+PR +Y + + E + + K+A+P ++++TV + TG+ + + R S +L + +
Sbjct: 83 EPRAFVYHNFLSKDECEYLIKLAKPHMQKSTVVDSSTGKSKDSRVRTSSGTFLTRGQDKI 142
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
I I +R+ T L E LQ+++Y +G YEPHYD+ + + + G R+ATV
Sbjct: 143 IRGIEKRLSDFTFLPVEHGEGLQILHYEVGQKYEPHYDYF----LDDYNTKNGGQRMATV 198
Query: 211 LFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAFWHNLHSSGDG 251
L Y+SDV +GG TVF + LS+ P+ G A + ++
Sbjct: 199 LMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKEGLSVKPKMGDALLFWSMKPDASL 258
Query: 252 DYYTRHAACPVLTGSNSLHST 272
D + H CPV+ G N ST
Sbjct: 259 DPSSLHGGCPVIKG-NKWSST 278
>gi|194906709|ref|XP_001981416.1| GG11627 [Drosophila erecta]
gi|190656054|gb|EDV53286.1| GG11627 [Drosophila erecta]
Length = 462
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 62/215 (28%), Positives = 97/215 (45%), Gaps = 34/215 (15%)
Query: 52 CRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
CRG +PP+ + L+CRY P+LRL LK E+ ++P + L+ D + +E + + +
Sbjct: 239 CRGK-NLPPS-KSSLRCRYFREGSPFLRLAALKLEQLNIEPFVGLFHDAILQAEQEDLLR 296
Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
+ + RL +I + R+ +H + RI +R+E +TG +E
Sbjct: 297 LTESRLEHK----------KIESSRVEAKVDTNASDH--VRRIHQRIEDITGFDLEGSEP 344
Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
L V N+GIGG H D +P ++DV GG F L
Sbjct: 345 LTVSNHGIGGQEAIHLDCGQPK--------------------LNDVQMGGYASFPDLGFG 384
Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
P +G+A WHN + G+ D A CPVL G+
Sbjct: 385 FKPVRGSALVWHNTDNCGNCDIRGLQATCPVLLGN 419
>gi|242032633|ref|XP_002463711.1| hypothetical protein SORBIDRAFT_01g004670 [Sorghum bicolor]
gi|241917565|gb|EER90709.1| hypothetical protein SORBIDRAFT_01g004670 [Sorghum bicolor]
Length = 297
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 60/202 (29%), Positives = 99/202 (49%), Gaps = 22/202 (10%)
Query: 82 PLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSA 141
P + + +PR LY + D+E D + +A+ + ++ V + +G+ ++ R S A
Sbjct: 32 PARVTQLSWRPRAFLYSGFLSDTECDHLINLAKGSMEKSMVADNDSGKSLMSQVRTSSGA 91
Query: 142 WLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL 201
+L + E ++ I +RV T L AE +QV+ Y IG Y+ H+D+ + N K
Sbjct: 92 FLAKHEDEIVSAIEKRVAAWTFLPEENAESMQVLRYEIGQKYDAHFDYFH--DKNNVKH- 148
Query: 202 GTGNRVATVLFYMSDVAQGGATVF------------------TSLNLSLWPEKGTAAFWH 243
G R ATVL Y++DV +GG TVF + L++ P+KG A +
Sbjct: 149 -GGQRFATVLMYLTDVKKGGETVFPNAEGSHLQYKDETWSECSRSGLAVKPKKGDALLFF 207
Query: 244 NLHSSGDGDYYTRHAACPVLTG 265
LH + D + H +CPV+ G
Sbjct: 208 GLHLNATTDTSSLHGSCPVIEG 229
>gi|147800995|emb|CAN64470.1| hypothetical protein VITISV_014644 [Vitis vinifera]
Length = 288
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 99/201 (49%), Gaps = 24/201 (11%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+PR +Y + + E + + K+A+P ++++TV + TG+ + + R S +L + +
Sbjct: 83 EPRAFVYHNFLSKDECEYLIKLAKPHMQKSTVVDSSTGKSKDSRVRTSSGTFLTRGQDKI 142
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
I I +R+ T L E LQ+++Y +G YEPHYD+ + + + G R+ATV
Sbjct: 143 IRGIEKRLSDFTFLPVEHGEGLQILHYEVGQKYEPHYDYF----LDDYNTKNGGQRMATV 198
Query: 211 LFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAFWHNLHSSGDG 251
L Y+SDV +GG TVF + LS+ P+ G A + ++
Sbjct: 199 LMYLSDVEEGGETVFPAAKGNFSSVPWWNELSXCGKEGLSVKPKMGDALLFWSMKPDASL 258
Query: 252 DYYTRHAACPVLTGSNSLHST 272
D + H CPV+ G N ST
Sbjct: 259 DPSSLHGGCPVIKG-NKWSST 278
>gi|194745802|ref|XP_001955376.1| GF16267 [Drosophila ananassae]
gi|190628413|gb|EDV43937.1| GF16267 [Drosophila ananassae]
Length = 385
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 67/210 (31%), Positives = 97/210 (46%), Gaps = 34/210 (16%)
Query: 52 CRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
CRG TVP L+CRY P+L+L PLK E+ L P I ++ DV+ E +
Sbjct: 51 CRGRNTVPKKFY--LRCRYFTEGDPFLQLAPLKLEQLNLDPFIGIFHDVISIGEQKNLIN 108
Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
+ + RLR +QN + +E A ++ S +ERI RR+E MTGL +
Sbjct: 109 LTRNRLR---LQNPQRAVME-AEVELNASK--------EVERIHRRIEDMTGLNLEESPP 156
Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
L ++NYGIGG + H D + F +SDV GG F L
Sbjct: 157 LTILNYGIGGQHPIHLDCEQ--------------------FMLSDVQMGGYASFPELGFG 196
Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACP 261
P +G+A HN+ ++ + D + A CP
Sbjct: 197 FKPSRGSALVVHNMDNAANCDIRSLQATCP 226
>gi|449495423|ref|XP_004159836.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 304
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 59/206 (28%), Positives = 102/206 (49%), Gaps = 26/206 (12%)
Query: 82 PLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSA 141
P K ++ PR +Y + D E D + +A+ L+R++V + +G+ +++ R S A
Sbjct: 38 PAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGKSKVSEVRTSSGA 97
Query: 142 WLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL 201
++ + + P++ I ++ T L E++QV+ Y G Y+ H+D+ A+
Sbjct: 98 FIHKAKDPIVSGIEDKIAAWTFLPKDNGEDIQVLRYEYGQKYDAHFDYF----ADKVNIA 153
Query: 202 GTGNRVATVLFYMSDVAQGGATVF--------------TSLNLS--------LWPEKGTA 239
G+R+ATVL Y+SDV +GG TVF T+ +LS + P KG A
Sbjct: 154 RGGHRMATVLMYLSDVEKGGETVFLLRRSESQRRQASETNEDLSDCAKKGIAVKPRKGDA 213
Query: 240 AFWHNLHSSGDGDYYTRHAACPVLTG 265
+ +LH + D + H CPV+ G
Sbjct: 214 LLFFSLHPNAIPDTSSLHGGCPVIEG 239
>gi|416009427|ref|ZP_11561250.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein
[Acidithiobacillus sp. GGI-221]
gi|339836568|gb|EGQ64151.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein
[Acidithiobacillus sp. GGI-221]
Length = 196
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 54/157 (34%), Positives = 84/157 (53%), Gaps = 5/157 (3%)
Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
+ Q LR ATV + +TG+ R+S+ AW + +HP+++ ++ + +TG+ E
Sbjct: 31 IGQSLLRPATVTDEQTGQEVAHGERVSEMAWPKRDDHPILQSLAEGIAQLTGIPIDCQEP 90
Query: 172 LQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNL 230
LQ+++Y GG Y+PHYD FA A+A GNR T++ Y++ V +GG T F L L
Sbjct: 91 LQILHYRPGGEYKPHYDAFA----ADAPTLRQGGNRQGTLILYLNAVEEGGETAFPELGL 146
Query: 231 SLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
+ P G F+ NL+ G + HA PV G
Sbjct: 147 QVSPIPGGGVFFRNLNEEGQRHPLSLHAGLPVRKGEK 183
>gi|307106819|gb|EFN55064.1| hypothetical protein CHLNCDRAFT_35843 [Chlorella variabilis]
Length = 287
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 61/205 (29%), Positives = 101/205 (49%), Gaps = 17/205 (8%)
Query: 76 PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANY 135
P L K E+ +PR +Y + + D E + +K++A+ RL ++TV + KTG+ +
Sbjct: 29 PPQELWRGKVEQVSWRPRAFVYHNFLSDEECEHLKELARKRLTKSTVVDNKTGKSMDSTV 88
Query: 136 RISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEA 195
R S +L E V+ I +R+ +T + E +Q++ Y G YEPH D+ +
Sbjct: 89 RTSSGTFLARGEDEVVRAIEKRISLVTMIPEENGEAIQILKYVDGQKYEPHTDYFH--DK 146
Query: 196 NAFKSLGTGNRVATVLFYMSDVAQGGATVF----TSLNLSLWPE-----------KGTAA 240
++ G RVAT+L Y+S +GG TVF + W E KG+A
Sbjct: 147 YNSRTENGGQRVATILMYLSTPEEGGETVFPYAEKKVEGEGWSECARKGLAVKAVKGSAL 206
Query: 241 FWHNLHSSGDGDYYTRHAACPVLTG 265
+++L +G+ D + H +CP L G
Sbjct: 207 LFYSLKPNGEEDQASTHGSCPTLAG 231
>gi|159478673|ref|XP_001697425.1| predicted protein [Chlamydomonas reinhardtii]
gi|158274304|gb|EDP00087.1| predicted protein [Chlamydomonas reinhardtii]
Length = 297
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 62/192 (32%), Positives = 96/192 (50%), Gaps = 19/192 (9%)
Query: 92 PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
PR L ++ + D E D I + A+P++ +++V + ++G+ + R S W + E VI
Sbjct: 49 PRAFLLKNFLSDEECDYIVEKARPKMVKSSVVDNESGKSVDSEIRTSTGTWFAKGEDSVI 108
Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRVATV 210
+I +RV +T + E LQV++Y G YEPHYD F P NA G G RV T+
Sbjct: 109 SKIEKRVAQVTMIPLENHEGLQVLHYHDGQKYEPHYDYFHDP--VNAGPEHG-GQRVVTM 165
Query: 211 LFYMSDVAQGGATVFTSL---------------NLSLWPEKGTAAFWHNLHSSGDGDYYT 255
L Y++ V +GG TV + L++ P KG A +++L G D +
Sbjct: 166 LMYLTTVEEGGETVLPNAEQKVTGDGWSECAKRGLAVKPIKGDALMFYSLKPDGSNDPAS 225
Query: 256 RHAACPVLTGSN 267
H +CP L G
Sbjct: 226 LHGSCPTLKGDK 237
>gi|194765140|ref|XP_001964685.1| GF23318 [Drosophila ananassae]
gi|190614957|gb|EDV30481.1| GF23318 [Drosophila ananassae]
Length = 412
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 61/206 (29%), Positives = 94/206 (45%), Gaps = 47/206 (22%)
Query: 65 QLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQN 124
+L C Y P+LR+ P K E+ L P ++++ DV+ EI + + +L +A N
Sbjct: 221 RLMCYYNSSTTPFLRIAPFKTEQIGLDPYVVVFHDVLSPREISKLISLTDRKLVQAVTVN 280
Query: 125 YKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYE 184
K+ + + R +K+ W+ + +RI RR+ M+G + AE Q
Sbjct: 281 KKSFKEMV---RTAKAHWVYRGYQELTKRIYRRIHDMSGFELADAENFQ----------- 326
Query: 185 PHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLN----LSLWPEKGTAA 240
+SDV QGGATVF ++ +++P GTAA
Sbjct: 327 -----------------------------LSDVEQGGATVFPGISADSAYTVYPRAGTAA 357
Query: 241 FWHNLHSSGDGDYYTRHAACPVLTGS 266
W+NLH+ G GD T H ACPV+ GS
Sbjct: 358 MWYNLHTDGLGDPTTLHVACPVIVGS 383
>gi|351731158|ref|ZP_08948849.1| 2OG-Fe(II) oxygenase [Acidovorax radicis N35]
Length = 303
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 58/183 (31%), Positives = 92/183 (50%), Gaps = 11/183 (6%)
Query: 88 AYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPE 147
A QPR++++ +++ E D + A PR+ R+ KTG EI + R S + + +
Sbjct: 112 AIAQPRVVVFGNLLSPEECDALIADAAPRMARSLTVATKTGGEEINDDRTSDGMFFQRGQ 171
Query: 148 HPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDF---ARPGEANAFKSLGTG 204
P+I+RI R+ + E LQV++Y G Y+PHYD+ A PG K G
Sbjct: 172 SPLIQRIEERIARLLNWPIENGEGLQVLHYRPGAEYKPHYDYFDPAEPGTPTIVKR--GG 229
Query: 205 NRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAF--WHNLHSSGDGDYYTRHAACPV 262
RV T++ Y++ +GG T F +++ + P++G A F + H S T H PV
Sbjct: 230 QRVGTLVMYLNTPEKGGGTTFPDVHVEVAPQRGNAVFFSYERPHPS----TRTLHGGAPV 285
Query: 263 LTG 265
L G
Sbjct: 286 LAG 288
>gi|302834449|ref|XP_002948787.1| hypothetical protein VOLCADRAFT_80309 [Volvox carteri f.
nagariensis]
gi|300265978|gb|EFJ50167.1| hypothetical protein VOLCADRAFT_80309 [Volvox carteri f.
nagariensis]
Length = 329
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 66/211 (31%), Positives = 105/211 (49%), Gaps = 30/211 (14%)
Query: 74 NVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIA 133
+VP R++ L QPR+ LY+ ++ E D + K+AQ RL R+ V + TGE ++
Sbjct: 44 DVPDSRMVVLS-----WQPRVFLYKGILTQEECDYLIKIAQGRLERSGVSDATTGEGGVS 98
Query: 134 NYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARP 192
+ R S + E+ V++RI R+ T L E +QV+ Y Y+PH+D F+
Sbjct: 99 DIRTSSGMFYTRGENDVVKRIETRLAMWTMLPVENGEGIQVLRYEKTQKYDPHHDYFSFE 158
Query: 193 G-EANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL-----------------NLSLWP 234
G +AN GNR+ATVL Y++ +GG TVF + L++ P
Sbjct: 159 GRDANG------GNRMATVLMYLATPEEGGETVFPKIPVPAGQTRANFSECGMKGLAVKP 212
Query: 235 EKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
KG A + ++ G + + H +CPV+ G
Sbjct: 213 VKGDAVLFWSIRPDGRFEPGSLHGSCPVIRG 243
>gi|418530659|ref|ZP_13096582.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni ATCC 11996]
gi|371452378|gb|EHN65407.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni ATCC 11996]
Length = 299
Score = 98.2 bits (243), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 51/175 (29%), Positives = 94/175 (53%), Gaps = 3/175 (1%)
Query: 92 PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
PR++++ +++ + E D I A+PR++R+ + ++G + + R S + + E+ +I
Sbjct: 112 PRVVVFGNLLSNEECDAIIAAARPRMQRSLTVDNQSGGEAVNDDRTSNGMFFQRGENDLI 171
Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLG-TGNRVATV 210
R+ +R+ + E +QV++Y G Y+PHYD+ P E L G RV T+
Sbjct: 172 SRVEQRIARLLNWPLENGEGMQVLHYRPGAEYKPHYDYFAPNEPGTPTILKRGGQRVGTL 231
Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
+ Y+++ A+GGAT F + L + P +G A F+ ++ + T H PVL G
Sbjct: 232 VMYLNEPARGGATTFPDVGLQVVPRRGNAVFFS--YNRPEPATKTLHGGAPVLEG 284
>gi|357447553|ref|XP_003594052.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
truncatula]
gi|355483100|gb|AES64303.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
truncatula]
Length = 301
Score = 98.2 bits (243), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 59/205 (28%), Positives = 102/205 (49%), Gaps = 25/205 (12%)
Query: 82 PLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSA 141
P K ++ +PR +Y+ + D E D + +A+ L+R+ V + +GE +++ R S
Sbjct: 37 PTKVKQVSWKPRAFVYKGFLTDLECDHLISIAKSELKRSAVADNLSGESKLSEVRTSSGM 96
Query: 142 WLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL 201
++ + + ++ I ++ T L E++QV+ Y G Y+PHYD+ A+
Sbjct: 97 FISKNKDAIVSGIEDKISSWTFLPKENGEDIQVLRYEHGQKYDPHYDYF----ADKVNIA 152
Query: 202 GTGNRVATVLFYMSDVAQGGATVF-------------TSLNLS--------LWPEKGTAA 240
G+RVATVL Y+++V +GG TVF T +LS + P +G A
Sbjct: 153 RGGHRVATVLMYLTNVTKGGETVFPNAEESPRHKLSETDEDLSECGKKGVAVKPRRGDAL 212
Query: 241 FWHNLHSSGDGDYYTRHAACPVLTG 265
+ +LH + D + HA CPV+ G
Sbjct: 213 LFFSLHPNAIPDTLSLHAGCPVIEG 237
>gi|297818456|ref|XP_002877111.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297322949|gb|EFH53370.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 316
Score = 98.2 bits (243), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 61/202 (30%), Positives = 99/202 (49%), Gaps = 22/202 (10%)
Query: 82 PLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSA 141
P + + PR LY+ + D E D K+A+ +L ++ V + +GE + R S
Sbjct: 53 PTRVTQLSWTPRAFLYKGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESEVRTSSGM 112
Query: 142 WLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL 201
+L + + ++ + ++ T + E +Q+++Y G YEPH+D+ +AN L
Sbjct: 113 FLSKRQDDIVANVEAKLAAWTFIPEENGESMQILHYENGQKYEPHFDYFHD-QANL--EL 169
Query: 202 GTGNRVATVLFYMSDVAQGGATVF-------TSLNLSLW-----------PEKGTAAFWH 243
G G+R+ATVL Y+S+V +GG TVF T L W P KG A +
Sbjct: 170 G-GHRIATVLMYLSNVEKGGETVFPMWKGKTTQLKDDSWTECAKQGYAVKPRKGDALLFF 228
Query: 244 NLHSSGDGDYYTRHAACPVLTG 265
NLH + D + H +CPV+ G
Sbjct: 229 NLHPNATTDSNSLHGSCPVVEG 250
>gi|224102545|ref|XP_002312720.1| predicted protein [Populus trichocarpa]
gi|222852540|gb|EEE90087.1| predicted protein [Populus trichocarpa]
Length = 300
Score = 98.2 bits (243), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 56/207 (27%), Positives = 103/207 (49%), Gaps = 25/207 (12%)
Query: 80 LMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISK 139
+ P K ++ +PR +Y + D E D + +A+ L+R+ V + ++G+ +++ R S
Sbjct: 34 INPAKVKQVSWKPRAFVYEGFLTDLECDHLISLAKSELKRSAVADNESGKSKLSEVRTSS 93
Query: 140 SAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFK 199
++ + + P++ I ++ T L E++QV+ Y G Y+PHYD+ ++
Sbjct: 94 GMFITKAKDPIVAGIEDKIATWTFLPRENGEDIQVLRYEHGQKYDPHYDYF----SDKVN 149
Query: 200 SLGTGNRVATVLFYMSDVAQGGATVFTSL---------------------NLSLWPEKGT 238
G+RVATVL Y++DV +GG TVF S +++ P +G
Sbjct: 150 IARGGHRVATVLMYLTDVEKGGETVFPSAEELPRRKASVSHEDLSECARKGIAVKPRRGD 209
Query: 239 AAFWHNLHSSGDGDYYTRHAACPVLTG 265
A + +L+ + D + HA CPV+ G
Sbjct: 210 ALLFFSLYPTAVPDTSSIHAGCPVIEG 236
>gi|357447555|ref|XP_003594053.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
truncatula]
gi|355483101|gb|AES64304.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
truncatula]
Length = 303
Score = 98.2 bits (243), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 57/207 (27%), Positives = 101/207 (48%), Gaps = 27/207 (13%)
Query: 82 PLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSA 141
P K ++ +PR +Y+ + D E D + +A+ L+R+ V + +GE +++ R S
Sbjct: 37 PTKVKQVSWKPRAFVYKGFLTDLECDHLISIAKSELKRSAVADNLSGESKLSEVRTSSGM 96
Query: 142 WLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL 201
++ + + ++ I ++ T L E++QV+ Y G Y+PHYD+ A+
Sbjct: 97 FISKNKDAIVSGIEDKISSWTFLPKENGEDIQVLRYEHGQKYDPHYDYF----ADKVNIA 152
Query: 202 GTGNRVATVLFYMSDVAQGGATVFTSLNL-----------------------SLWPEKGT 238
G+RVATVL Y+++V +GG TVF + L ++ P +G
Sbjct: 153 RGGHRVATVLMYLTNVTKGGETVFPNAELQESPRHKLSETDEDLSECGKKGVAVKPRRGD 212
Query: 239 AAFWHNLHSSGDGDYYTRHAACPVLTG 265
A + +LH + D + HA CPV+ G
Sbjct: 213 ALLFFSLHPNAIPDTLSLHAGCPVIEG 239
>gi|388492638|gb|AFK34385.1| unknown [Medicago truncatula]
Length = 299
Score = 98.2 bits (243), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 56/207 (27%), Positives = 103/207 (49%), Gaps = 25/207 (12%)
Query: 80 LMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISK 139
+ P K ++ PR +Y+ + D E D + +A+ L+R+ V + +G+ ++++ R S
Sbjct: 32 INPSKVKQISWIPRAFVYQGFLTDLECDHLISLAKSELKRSAVADNLSGDSQLSDVRTSS 91
Query: 140 SAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFK 199
++ + + P++ I R+ T L E++QV+ Y G Y+PHYD+ A+
Sbjct: 92 GMFISKNKDPIVSGIEDRISAWTFLPKENGEDIQVLRYEHGQKYDPHYDYF----ADKVN 147
Query: 200 SLGTGNRVATVLFYMSDVAQGGATVF---------------------TSLNLSLWPEKGT 238
+ G+R+ATVL Y+++V +GG TVF +++ P +G
Sbjct: 148 IVQGGHRLATVLMYLTNVTKGGETVFPEAEEPPRRRGSKKSSDLSECAKKGIAVKPRRGD 207
Query: 239 AAFWHNLHSSGDGDYYTRHAACPVLTG 265
A + +L ++ D + HA CPVL G
Sbjct: 208 ALLFFSLDTNAIPDTNSLHAGCPVLEG 234
>gi|215490183|dbj|BAG86625.1| type 2 proly 4-hydroxylase [Nicotiana tabacum]
Length = 318
Score = 98.2 bits (243), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 60/202 (29%), Positives = 97/202 (48%), Gaps = 22/202 (10%)
Query: 82 PLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSA 141
P + + +PR +YR+ + D E D +A+ +L ++ V + ++G+ + R S
Sbjct: 59 PTRVTQISWRPRAFVYRNFLTDEECDHFITLAKHKLEKSMVADNESGKSVESEVRTSSGM 118
Query: 142 WLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL 201
+ R+ + V+ + R+ T L E +Q+++Y G YEPH+D+ + L
Sbjct: 119 FFRKAQDQVVANVEARIAAWTFLPEENGESIQILHYEHGQKYEPHFDYFHD---KVNQEL 175
Query: 202 GTGNRVATVLFYMSDVAQGGATVF-------TSLNLSLW-----------PEKGTAAFWH 243
G G+RVATVL Y+SDV +GG TVF T W P KG A +
Sbjct: 176 G-GHRVATVLMYLSDVEKGGETVFPNSEAKKTQAKGDDWSDCAKKGYAVKPRKGDALLFF 234
Query: 244 NLHSSGDGDYYTRHAACPVLTG 265
+LH D + H +CPV+ G
Sbjct: 235 SLHPDATTDPLSLHGSCPVIEG 256
>gi|293337056|ref|NP_001169835.1| uncharacterized protein LOC100383727 precursor [Zea mays]
gi|224031897|gb|ACN35024.1| unknown [Zea mays]
gi|347978800|gb|AEP37742.1| prolyl 4-hydroxylase 2 [Zea mays]
gi|414871435|tpg|DAA49992.1| TPA: hypothetical protein ZEAMMB73_500506 [Zea mays]
Length = 299
Score = 98.2 bits (243), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 59/193 (30%), Positives = 97/193 (50%), Gaps = 22/193 (11%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+PR L++ + D+E D + +A+ +L ++ V + ++G+ + R S +L + V
Sbjct: 42 RPRAFLHKGFLSDAECDHLIALAKDKLEKSMVADNESGKSVQSEVRTSSGMFLERKQDEV 101
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
+ RI R+ T L E +Q+++Y G YEPHYD+ + A LG G+R+ATV
Sbjct: 102 VTRIEERISAWTFLPPENGESIQILHYQNGEKYEPHYDYFHDKKNQA---LG-GHRIATV 157
Query: 211 LFYMSDVAQGGATVFTSLNLSL-------W-----------PEKGTAAFWHNLHSSGDGD 252
L Y+S+V +GG T+F + L W P KG A + +LH D
Sbjct: 158 LMYLSNVEKGGETIFPNAEGKLLQPKDNTWSDCARNGYAVKPVKGDALLFFSLHPDATTD 217
Query: 253 YYTRHAACPVLTG 265
+ H +CPV+ G
Sbjct: 218 SDSLHGSCPVIEG 230
>gi|160900716|ref|YP_001566298.1| procollagen-proline dioxygenase [Delftia acidovorans SPH-1]
gi|160366300|gb|ABX37913.1| Procollagen-proline dioxygenase [Delftia acidovorans SPH-1]
Length = 294
Score = 98.2 bits (243), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 54/175 (30%), Positives = 91/175 (52%), Gaps = 3/175 (1%)
Query: 92 PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
PRI+++ +++ E D I A+PR+ R+ ++G EI + R S + + E ++
Sbjct: 107 PRIVVFGNLLSHEECDAIIAAARPRMARSLTVATQSGGEEINDDRTSNGMFFQRGETGIV 166
Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLG-TGNRVATV 210
++ R+ + E LQV++YG G Y+PH+D+ PGE L G RV T+
Sbjct: 167 SQLEERIARLLRWPLDHGEGLQVLHYGPGAEYKPHHDYFAPGEPGTPTILKRGGQRVGTL 226
Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
+ Y+++ +GGAT+F + L + P +G A F+ + D T H PVL G
Sbjct: 227 VIYLNEPERGGATIFPEVPLQVVPRRGNAVFFS--YERPDPSTRTLHGGAPVLAG 279
>gi|357478545|ref|XP_003609558.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
gi|355510613|gb|AES91755.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
Length = 299
Score = 98.2 bits (243), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 56/207 (27%), Positives = 103/207 (49%), Gaps = 25/207 (12%)
Query: 80 LMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISK 139
+ P K ++ PR +Y+ + D E D + +A+ L+R+ V + +G+ ++++ R S
Sbjct: 32 INPSKVKQISWIPRAFVYQGFLTDLECDHLISLAKSELKRSAVADNLSGDSQLSDVRTSS 91
Query: 140 SAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFK 199
++ + + P++ I R+ T L E++QV+ Y G Y+PHYD+ A+
Sbjct: 92 GMFISKNKDPIVSGIEDRISAWTFLPKENGEDIQVLRYEHGQKYDPHYDYF----ADKVN 147
Query: 200 SLGTGNRVATVLFYMSDVAQGGATVF---------------------TSLNLSLWPEKGT 238
+ G+R+ATVL Y+++V +GG TVF +++ P +G
Sbjct: 148 IVQGGHRLATVLMYLTNVTKGGETVFPEAEEPPRRRGSKKSSDLSECAKKGIAVKPRRGD 207
Query: 239 AAFWHNLHSSGDGDYYTRHAACPVLTG 265
A + +L ++ D + HA CPVL G
Sbjct: 208 ALLFFSLDTNAIPDTNSLHAGCPVLEG 234
>gi|333912984|ref|YP_004486716.1| procollagen-proline dioxygenase [Delftia sp. Cs1-4]
gi|333743184|gb|AEF88361.1| Procollagen-proline dioxygenase [Delftia sp. Cs1-4]
Length = 294
Score = 98.2 bits (243), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 54/175 (30%), Positives = 91/175 (52%), Gaps = 3/175 (1%)
Query: 92 PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
PRI+++ +++ E D I A+PR+ R+ ++G EI + R S + + E ++
Sbjct: 107 PRIVVFGNLLSHEECDAIIAAARPRMARSLTVATQSGGEEINDDRTSNGMFFQRGETGIV 166
Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLG-TGNRVATV 210
++ R+ + E LQV++YG G Y+PH+D+ PGE L G RV T+
Sbjct: 167 SQLEERIARLLRWPLDHGEGLQVLHYGPGAEYKPHHDYFAPGEPGTPTILKRGGQRVGTL 226
Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
+ Y+++ +GGAT+F + L + P +G A F+ + D T H PVL G
Sbjct: 227 VIYLNEPERGGATIFPEVPLQVVPRRGNAVFFS--YERPDPSTRTLHGGAPVLAG 279
>gi|356540840|ref|XP_003538892.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Glycine max]
Length = 290
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 56/201 (27%), Positives = 99/201 (49%), Gaps = 24/201 (11%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+PR +Y + + E + + +A+P + +++V + +TG+ + + R S +L +
Sbjct: 85 EPRAFVYHNFLTKEECEYLIDIAKPNMHKSSVVDSETGKSKDSRVRTSSGTFLARGRDKI 144
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
+ I +R+ H + + E LQV++Y +G YEPHYD+ + F + G R+ATV
Sbjct: 145 VRDIEKRIAHYSFIPVEHGEGLQVLHYEVGQKYEPHYDYF----LDDFNTKNGGQRIATV 200
Query: 211 LFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAFWHNLHSSGDG 251
L Y++DV +GG TVF + LS+ P++G A + ++
Sbjct: 201 LMYLTDVEEGGETVFPAAKGNFSSVPWWNELSECGKKGLSIKPKRGDALLFWSMKPDATL 260
Query: 252 DYYTRHAACPVLTGSNSLHST 272
D + H CPV+ G N ST
Sbjct: 261 DPSSLHGGCPVIKG-NKWSST 280
>gi|212720650|ref|NP_001132477.1| uncharacterized protein LOC100193935 precursor [Zea mays]
gi|194694488|gb|ACF81328.1| unknown [Zea mays]
gi|347978828|gb|AEP37756.1| prolyl 4-hydroxylase 7 [Zea mays]
gi|413934218|gb|AFW68769.1| prolyl 4-hydroxylase [Zea mays]
Length = 298
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 59/193 (30%), Positives = 96/193 (49%), Gaps = 22/193 (11%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+PR L++ + D+E D + +A+ +L ++ V + K+G+ + R S +L + + V
Sbjct: 41 RPRAFLHKGFLLDAECDHLIALAKDKLEKSMVADNKSGKSVQSEVRTSSGMFLEKKQDEV 100
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
+ RI R+ T L E +Q+++Y G YEPHYD+ A LG G+R+ATV
Sbjct: 101 VTRIEERISAWTFLPPENGEAIQILHYQNGEKYEPHYDYFHDKNNQA---LG-GHRIATV 156
Query: 211 LFYMSDVAQGGATVFTSLNLSL-------W-----------PEKGTAAFWHNLHSSGDGD 252
L Y+S+V +GG T+F + L W P KG A + +LH D
Sbjct: 157 LMYLSNVEKGGETIFPNAEGKLLQPKDDTWSDCARNGYAVKPVKGDALLFFSLHPDSTTD 216
Query: 253 YYTRHAACPVLTG 265
+ H +CP + G
Sbjct: 217 SDSLHGSCPAIEG 229
>gi|406665340|ref|ZP_11073114.1| hypothetical protein B857_00901 [Bacillus isronensis B3W22]
gi|405387266|gb|EKB46691.1| hypothetical protein B857_00901 [Bacillus isronensis B3W22]
Length = 211
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 53/175 (30%), Positives = 94/175 (53%), Gaps = 10/175 (5%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+P I+ + +V+ D E + A RL R+ K + EI++ R S + E E+P+
Sbjct: 29 EPLIVKFLNVLSDEECQNLIDCASSRLERS-----KLAKKEISSIRTSSGMFFEENENPL 83
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
I I +R+ + L AE LQV++Y G ++ H+DF P ++ + NR++T+
Sbjct: 84 ISEIEKRISSLMHLPIEHAEGLQVLHYEPGQEFKAHFDFFGPNHPSS-----SNNRISTL 138
Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
+ Y++DV +GG T F +L + P+KGTA ++ ++ + T H+ PV+ G
Sbjct: 139 VVYLNDVEEGGVTTFPNLGIVNVPKKGTAVYFEYFYNDQKLNELTLHSGEPVIQG 193
>gi|251794605|ref|YP_003009336.1| procollagen-proline dioxygenase [Paenibacillus sp. JDR-2]
gi|247542231|gb|ACS99249.1| Procollagen-proline dioxygenase [Paenibacillus sp. JDR-2]
Length = 209
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 54/175 (30%), Positives = 95/175 (54%), Gaps = 11/175 (6%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+P I++ +V+ +E DL+ +A R++RA + + +++ R S S + E E+
Sbjct: 31 EPLILILDNVLSWAECDLLIDLASARMQRAKIGSSH----DVSEVRTSSSMFFEESENEC 86
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
I ++ RV + + S AE LQV+ Y G Y PH+D+ G + NR++T+
Sbjct: 87 IGQVEARVAELMNIPVSHAEPLQVLRYQPGEQYHPHFDYFTQGSS-------MNNRISTL 139
Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
+ Y++DV +GG T F SL+ S+ P+KG+A ++ ++ + T HA PV G
Sbjct: 140 VMYLNDVEEGGETYFPSLHFSVTPKKGSAVYFEYFYNDTRLNELTLHAGHPVEAG 194
>gi|195574593|ref|XP_002105269.1| GD21390 [Drosophila simulans]
gi|194201196|gb|EDX14772.1| GD21390 [Drosophila simulans]
Length = 478
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 63/216 (29%), Positives = 97/216 (44%), Gaps = 34/216 (15%)
Query: 52 CRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
CRG +P + L+CRY+ P+LRL P+K E+ +P + L+ D + +E + +
Sbjct: 255 CRGKNLLPSK--SYLRCRYLRDGSPFLRLAPVKLEQLNFEPFVGLFHDAISPAEQEDLLH 312
Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
+ RL + E ++ +A +H + R+ +R+E +TG +E
Sbjct: 313 LTDSRLE------HTRKESSSVEAKVDTNA----SDH--VRRMHQRIEDITGFEMEESEP 360
Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
L V NYGIGG H D +P +SDV GG F L
Sbjct: 361 LTVFNYGIGGQELIHLDCEQPE--------------------LSDVQMGGYASFPDLGFG 400
Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
P +G+A WHN +SG+ D + A CPVL G+
Sbjct: 401 FKPRRGSALVWHNTDNSGNCDTRSLQATCPVLLGNQ 436
>gi|159794881|pdb|2JIJ|A Chain A, Crystal Structure Of The Apo Form Of Chlamydomonas
Reinhardtii Prolyl-4 Hydroxylase Type I
gi|159794882|pdb|2JIJ|B Chain B, Crystal Structure Of The Apo Form Of Chlamydomonas
Reinhardtii Prolyl-4 Hydroxylase Type I
gi|159794883|pdb|2JIJ|C Chain C, Crystal Structure Of The Apo Form Of Chlamydomonas
Reinhardtii Prolyl-4 Hydroxylase Type I
Length = 233
Score = 97.8 bits (242), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 62/192 (32%), Positives = 96/192 (50%), Gaps = 19/192 (9%)
Query: 92 PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
PR L ++ + D E D I + A+P++ +++V + ++G+ + R S W + E VI
Sbjct: 29 PRAFLLKNFLSDEECDYIVEKARPKMVKSSVVDNESGKSVDSEIRTSTGTWFAKGEDSVI 88
Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRVATV 210
+I +RV +T + E LQV++Y G YEPHYD F P NA G G RV T+
Sbjct: 89 SKIEKRVAQVTMIPLENHEGLQVLHYHDGQKYEPHYDYFHDP--VNAGPEHG-GQRVVTM 145
Query: 211 LFYMSDVAQGGATVFTSL---------------NLSLWPEKGTAAFWHNLHSSGDGDYYT 255
L Y++ V +GG TV + L++ P KG A +++L G D +
Sbjct: 146 LMYLTTVEEGGETVLPNAEQKVTGDGWSECAKRGLAVKPIKGDALMFYSLKPDGSNDPAS 205
Query: 256 RHAACPVLTGSN 267
H +CP L G
Sbjct: 206 LHGSCPTLKGDK 217
>gi|21537370|gb|AAM61711.1| putative prolyl 4-hydroxylase, alpha subunit [Arabidopsis thaliana]
Length = 287
Score = 97.8 bits (242), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 60/201 (29%), Positives = 98/201 (48%), Gaps = 24/201 (11%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+PR +Y + + E + + +A+P + ++TV + +TG+ + + R S +LR +
Sbjct: 82 EPRAFVYHNFLSKEECEYLISLAKPHMVKSTVVDSETGKSKDSRVRTSSGTFLRRGRDKI 141
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
I+ I +R+ T + E LQV++Y G YEPHYD+ + F + G R+AT+
Sbjct: 142 IKTIEKRIADYTFIPADHGEGLQVLHYEAGQKYEPHYDYF----VDEFNTKNGGQRMATM 197
Query: 211 LFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAFWHNLHSSGDG 251
L Y+SDV +GG TVF + N LS+ P G A + ++
Sbjct: 198 LMYLSDVEEGGETVFPAANMNFSSVPWYNELSECGKKGLSVKPRMGDALLFWSMRPDATL 257
Query: 252 DYYTRHAACPVLTGSNSLHST 272
D + H CPV+ G N ST
Sbjct: 258 DPTSLHGGCPVIRG-NKWSST 277
>gi|18394842|ref|NP_564109.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
gi|9558598|gb|AAF88161.1|AC026234_12 Contains similarity to a prolyl 4-hydroxylase alpha subunit protein
from Gallus gallus gi|212530 [Arabidopsis thaliana]
gi|90962978|gb|ABE02413.1| At1g20270 [Arabidopsis thaliana]
gi|332191835|gb|AEE29956.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
Length = 287
Score = 97.8 bits (242), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 60/201 (29%), Positives = 98/201 (48%), Gaps = 24/201 (11%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+PR +Y + + E + + +A+P + ++TV + +TG+ + + R S +LR +
Sbjct: 82 EPRAFVYHNFLSKEECEYLISLAKPHMVKSTVVDSETGKSKDSRVRTSSGTFLRRGRDKI 141
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
I+ I +R+ T + E LQV++Y G YEPHYD+ + F + G R+AT+
Sbjct: 142 IKTIEKRIADYTFIPADHGEGLQVLHYEAGQKYEPHYDYF----VDEFNTKNGGQRMATM 197
Query: 211 LFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAFWHNLHSSGDG 251
L Y+SDV +GG TVF + N LS+ P G A + ++
Sbjct: 198 LMYLSDVEEGGETVFPAANMNFSSVPWYNELSECGKKGLSVKPRMGDALLFWSMRPDATL 257
Query: 252 DYYTRHAACPVLTGSNSLHST 272
D + H CPV+ G N ST
Sbjct: 258 DPTSLHGGCPVIRG-NKWSST 277
>gi|241913390|pdb|3GZE|A Chain A, Algal Prolyl 4-Hydroxylase Complexed With Zinc And
(Ser-Pro)5 Peptide Substrate
gi|241913391|pdb|3GZE|B Chain B, Algal Prolyl 4-Hydroxylase Complexed With Zinc And
(Ser-Pro)5 Peptide Substrate
gi|241913392|pdb|3GZE|C Chain C, Algal Prolyl 4-Hydroxylase Complexed With Zinc And
(Ser-Pro)5 Peptide Substrate
gi|241913393|pdb|3GZE|D Chain D, Algal Prolyl 4-Hydroxylase Complexed With Zinc And
(Ser-Pro)5 Peptide Substrate
Length = 225
Score = 97.8 bits (242), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 62/192 (32%), Positives = 96/192 (50%), Gaps = 19/192 (9%)
Query: 92 PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
PR L ++ + D E D I + A+P++ +++V + ++G+ + R S W + E VI
Sbjct: 21 PRAFLLKNFLSDEECDYIVEKARPKMVKSSVVDNESGKSVDSEIRTSTGTWFAKGEDSVI 80
Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRVATV 210
+I +RV +T + E LQV++Y G YEPHYD F P NA G G RV T+
Sbjct: 81 SKIEKRVAQVTMIPLENHEGLQVLHYHDGQKYEPHYDYFHDP--VNAGPEHG-GQRVVTM 137
Query: 211 LFYMSDVAQGGATVFTSL---------------NLSLWPEKGTAAFWHNLHSSGDGDYYT 255
L Y++ V +GG TV + L++ P KG A +++L G D +
Sbjct: 138 LMYLTTVEEGGETVLPNAEQKVTGDGWSECAKRGLAVKPIKGDALMFYSLKPDGSNDPAS 197
Query: 256 RHAACPVLTGSN 267
H +CP L G
Sbjct: 198 LHGSCPTLKGDK 209
>gi|159794879|pdb|2JIG|A Chain A, Crystal Structure Of Chlamydomonas Reinhardtii Prolyl-4
Hydroxylase Type I Complexed With Zinc And Pyridine-2,4-
Dicarboxylate
gi|159794880|pdb|2JIG|B Chain B, Crystal Structure Of Chlamydomonas Reinhardtii Prolyl-4
Hydroxylase Type I Complexed With Zinc And Pyridine-2,4-
Dicarboxylate
Length = 224
Score = 97.8 bits (242), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 62/192 (32%), Positives = 96/192 (50%), Gaps = 19/192 (9%)
Query: 92 PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
PR L ++ + D E D I + A+P++ +++V + ++G+ + R S W + E VI
Sbjct: 20 PRAFLLKNFLSDEECDYIVEKARPKMVKSSVVDNESGKSVDSEIRTSTGTWFAKGEDSVI 79
Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRVATV 210
+I +RV +T + E LQV++Y G YEPHYD F P NA G G RV T+
Sbjct: 80 SKIEKRVAQVTMIPLENHEGLQVLHYHDGQKYEPHYDYFHDP--VNAGPEHG-GQRVVTM 136
Query: 211 LFYMSDVAQGGATVFTSL---------------NLSLWPEKGTAAFWHNLHSSGDGDYYT 255
L Y++ V +GG TV + L++ P KG A +++L G D +
Sbjct: 137 LMYLTTVEEGGETVLPNAEQKVTGDGWSECAKRGLAVKPIKGDALMFYSLKPDGSNDPAS 196
Query: 256 RHAACPVLTGSN 267
H +CP L G
Sbjct: 197 LHGSCPTLKGDK 208
>gi|218665910|ref|YP_002425647.1| 2OG-Fe(II) oxygenase [Acidithiobacillus ferrooxidans ATCC 23270]
gi|218518123|gb|ACK78709.1| oxidoreductase, 2OG-Fe(II) oxygenase family [Acidithiobacillus
ferrooxidans ATCC 23270]
Length = 248
Score = 97.4 bits (241), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 54/155 (34%), Positives = 85/155 (54%), Gaps = 5/155 (3%)
Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
+ Q LR ATV + +TG+ R+S+ AW + ++P+++ ++ + +TG+ E
Sbjct: 83 IGQSLLRPATVTDEQTGQEVAHGERVSEMAWPKRDDYPILQSLAEGIAQLTGIPIDCQEP 142
Query: 172 LQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNL 230
LQ+++Y GG Y+PHYD FA A+A GNR AT++ Y++ V +GG T F L L
Sbjct: 143 LQILHYRPGGEYKPHYDAFA----ADAPTLRQGGNRQATLILYLNAVEEGGETAFPELGL 198
Query: 231 SLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
+ P G F+ NL+ G + HA PV G
Sbjct: 199 QVSPIPGGGVFFRNLNEEGQRHPLSLHAGLPVRKG 233
>gi|363806698|ref|NP_001242522.1| uncharacterized protein LOC100806046 [Glycine max]
gi|255647110|gb|ACU24023.1| unknown [Glycine max]
Length = 289
Score = 97.4 bits (241), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 56/201 (27%), Positives = 98/201 (48%), Gaps = 24/201 (11%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+PR +Y + + E + + +A+P + ++TV + +TG+ + + R S +L +
Sbjct: 84 EPRAFVYHNFLTKEECEYLIDIAKPSMHKSTVVDSETGKSKDSRVRTSSGTFLARGRDKI 143
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
+ I +++ T + E LQV++Y +G YEPHYD+ + F + G R+ATV
Sbjct: 144 VRNIEKKISDFTFIPVEHGEGLQVLHYEVGQKYEPHYDYF----LDDFNTKNGGQRIATV 199
Query: 211 LFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAFWHNLHSSGDG 251
L Y++DV +GG TVF + LS+ P++G A + ++
Sbjct: 200 LMYLTDVEEGGETVFPAAKGNFSFVPWWNELFECGKKGLSIKPKRGDALLFWSMKPDASL 259
Query: 252 DYYTRHAACPVLTGSNSLHST 272
D + H CPV+ G N ST
Sbjct: 260 DPSSLHGGCPVIKG-NKWSST 279
>gi|357483925|ref|XP_003612249.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355513584|gb|AES95207.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 289
Score = 97.4 bits (241), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 56/201 (27%), Positives = 98/201 (48%), Gaps = 24/201 (11%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+PR +Y + + E + + +A+P + ++TV + +TG+ + + R S +L +
Sbjct: 84 EPRAFVYHNFLTKEECEYLIDIAKPSMHKSTVVDSETGKSKDSRVRTSSGTFLARGRDKI 143
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
+ I +++ T + E LQV++Y +G YEPHYD+ + F + G R+ATV
Sbjct: 144 VRNIEKKIADFTFIPVEHGEGLQVLHYEVGQKYEPHYDYF----LDEFNTKNGGQRIATV 199
Query: 211 LFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAFWHNLHSSGDG 251
L Y++DV +GG TVF + LS+ P++G A + ++
Sbjct: 200 LMYLTDVEEGGETVFPAAKGNFSNVPWYNELSDCGKKGLSIKPKRGDALLFWSMKPDATL 259
Query: 252 DYYTRHAACPVLTGSNSLHST 272
D + H CPV+ G N ST
Sbjct: 260 DASSLHGGCPVIKG-NKWSST 279
>gi|356555587|ref|XP_003546112.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 2
[Glycine max]
Length = 297
Score = 97.4 bits (241), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 56/201 (27%), Positives = 101/201 (50%), Gaps = 21/201 (10%)
Query: 82 PLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSA 141
P K ++ +PR +Y + + E D + +A+ L+R+ V + +GE +++ R S
Sbjct: 37 PSKVKQVSWKPRAFVYEGFLTELECDHLISIAKSELKRSAVADNLSGESKLSEVRTSSGM 96
Query: 142 WLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL 201
++ + + P++ + ++ T L E++QV+ Y G Y+PHYD+ A+
Sbjct: 97 FIPKNKDPIVAGVEDKISSWTLLPKENGEDIQVLRYEHGQKYDPHYDYF----ADKVNIA 152
Query: 202 GTGNRVATVLFYMSDVAQGGATVFTSLNL-----------------SLWPEKGTAAFWHN 244
G+RVATVL Y++DV +GG TVF + L ++ P +G A + +
Sbjct: 153 RGGHRVATVLMYLTDVTKGGETVFPNAELKSSETKEDLSECAQKGIAVKPRRGDALLFFS 212
Query: 245 LHSSGDGDYYTRHAACPVLTG 265
L+ + D + HA CPV+ G
Sbjct: 213 LYPNAIPDTMSLHAGCPVIEG 233
>gi|218193936|gb|EEC76363.1| hypothetical protein OsI_13952 [Oryza sativa Indica Group]
Length = 1062
Score = 97.4 bits (241), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 58/202 (28%), Positives = 101/202 (50%), Gaps = 22/202 (10%)
Query: 82 PLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSA 141
P + + +PR LY + E D + +A+ R+ ++ V + +G+ ++ R S
Sbjct: 34 PARVTQLSWRPRAFLYSGFLSHDECDHLVNLAKGRMEKSMVADNDSGKSIMSQVRTSSGT 93
Query: 142 WLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL 201
+L + E ++ I +RV T L AE +Q+++Y +G Y+ H+D+ + N K
Sbjct: 94 FLSKHEDDIVSGIEKRVAAWTFLPEENAESIQILHYELGQKYDAHFDYFH--DKNNLKR- 150
Query: 202 GTGNRVATVLFYMSDVAQGGATVFTSL------------------NLSLWPEKGTAAFWH 243
G+RVATVL Y++DV +GG TVF + L++ P+KG A +
Sbjct: 151 -GGHRVATVLMYLTDVKKGGETVFPNAAGRHLQLKDETWSDCARSGLAVKPKKGDALLFF 209
Query: 244 NLHSSGDGDYYTRHAACPVLTG 265
+LH + D + H +CPV+ G
Sbjct: 210 SLHVNATTDPASLHGSCPVIEG 231
>gi|168060785|ref|XP_001782374.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666166|gb|EDQ52828.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 211
Score = 97.4 bits (241), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 61/196 (31%), Positives = 97/196 (49%), Gaps = 23/196 (11%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+PR LY + E + + ++A+P L ++TV + TG+ + + R S +L + +
Sbjct: 8 EPRAFLYHHFLTQVECNHLIEVAKPSLVKSTVIDSATGKSKDSRVRTSSGTFLVRGQDHI 67
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
I+RI +R+ T + E LQV+ Y YEPHYD+ +AF + G R+ATV
Sbjct: 68 IKRIEKRIADFTFIPVEQGEGLQVLQYRESEKYEPHYDYFH----DAFNTKNGGQRIATV 123
Query: 211 LFYMSDVAQGGATVF--TSLN-----------------LSLWPEKGTAAFWHNLHSSGDG 251
L Y+SDV +GG TVF + +N LS+ P G A + ++
Sbjct: 124 LMYLSDVEKGGETVFPASKVNASEVPDWDQRSECAKRGLSVRPRMGDALLFWSMKPDAKL 183
Query: 252 DYYTRHAACPVLTGSN 267
D + H ACPV+ G+
Sbjct: 184 DPTSLHGACPVIQGTK 199
>gi|363543369|ref|NP_001241694.1| prolyl 4-hydroxylase 8-4 [Zea mays]
gi|347978838|gb|AEP37761.1| prolyl 4-hydroxylase 8-4 [Zea mays]
Length = 307
Score = 97.4 bits (241), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 58/201 (28%), Positives = 98/201 (48%), Gaps = 24/201 (11%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+PR +Y + + E + + +A+P + ++TV + TG+ + + R S +L+ + V
Sbjct: 102 EPRAFVYHNFLSKDECEYLIGLAKPHMVKSTVVDSTTGKSKDSRVRTSSGMFLQRGRNKV 161
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
I I +R+ T + E LQV++Y +G YEPH+D+ + F + G R+AT+
Sbjct: 162 IRAIEKRIADYTFIPVDHGEGLQVLHYEVGQKYEPHFDYF----LDEFNTKNGGQRIATL 217
Query: 211 LFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAFWHNLHSSGDG 251
L Y+SDV +GG T+F N LS+ P+ G A + ++
Sbjct: 218 LMYLSDVEEGGETIFPDANVNASSLPWYNELSDCAKRGLSVKPKMGDALLFWSMKPDATL 277
Query: 252 DYYTRHAACPVLTGSNSLHST 272
D + H CPV+ G N ST
Sbjct: 278 DPLSLHGGCPVIKG-NKWSST 297
>gi|242075290|ref|XP_002447581.1| hypothetical protein SORBIDRAFT_06g004550 [Sorghum bicolor]
gi|241938764|gb|EES11909.1| hypothetical protein SORBIDRAFT_06g004550 [Sorghum bicolor]
Length = 263
Score = 97.4 bits (241), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 61/203 (30%), Positives = 100/203 (49%), Gaps = 19/203 (9%)
Query: 78 LRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRI 137
LRL +K E PRII++ + + E D + +A+PRL+ +TV + TG+ ++ R
Sbjct: 50 LRLRYVKPEVISWTPRIIIFHNFLSSEECDYLMAIARPRLQMSTVVDVATGKGVKSDVRT 109
Query: 138 SKSAWLREPEH--PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEA 195
S ++ E PVI+ I +R+ + + E +QV+ Y +Y PH+D+ +
Sbjct: 110 SSGMFVNSEERKSPVIQAIEKRISVFSQIPKENGELIQVLRYEASQYYRPHHDYF----S 165
Query: 196 NAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL-------------NLSLWPEKGTAAFW 242
+ F G RVAT+L Y++D +GG T F L + P KG A +
Sbjct: 166 DTFNLKRGGQRVATMLMYLTDGVEGGETHFLQAGDGECSCGGNVVKGLCVKPNKGDAVLF 225
Query: 243 HNLHSSGDGDYYTRHAACPVLTG 265
++ G+ D + H+ CPVL G
Sbjct: 226 WSMGLDGNTDPNSIHSGCPVLKG 248
>gi|297850430|ref|XP_002893096.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297338938|gb|EFH69355.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 287
Score = 97.4 bits (241), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 98/201 (48%), Gaps = 24/201 (11%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+PR +Y + + E + + +A+P + ++TV + +TG+ + + R S +LR +
Sbjct: 82 EPRAFVYHNFLSKEECEYLISLAKPHMVKSTVVDSETGKSKDSRVRTSSGTFLRRGRDKI 141
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
I+ I +R+ T + E LQ+++Y G YEPHYD+ + F + G R+AT+
Sbjct: 142 IKTIEKRIADYTFIPADHGEGLQILHYEAGQKYEPHYDYF----VDEFNTKNGGQRMATM 197
Query: 211 LFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAFWHNLHSSGDG 251
L Y+SDV +GG TVF + N LS+ P G A + ++
Sbjct: 198 LMYLSDVEEGGETVFPAANMNFSSVPWYNELSECGKKGLSVKPRMGDALLFWSMRPDATL 257
Query: 252 DYYTRHAACPVLTGSNSLHST 272
D + H CPV+ G N ST
Sbjct: 258 DPTSLHGGCPVIRG-NKWSST 277
>gi|195069793|ref|XP_001997027.1| GH12976 [Drosophila grimshawi]
gi|193891496|gb|EDV90362.1| GH12976 [Drosophila grimshawi]
Length = 83
Score = 97.4 bits (241), Expect = 8e-18, Method: Composition-based stats.
Identities = 43/53 (81%), Positives = 46/53 (86%)
Query: 214 MSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
MSDV QGGATVFTSL +LWP+KGTAAFW NLH SG+GD TRHAACPVLTGS
Sbjct: 1 MSDVQQGGATVFTSLRTALWPKKGTAAFWMNLHRSGEGDARTRHAACPVLTGS 53
>gi|319786559|ref|YP_004146034.1| Procollagen-proline dioxygenase [Pseudoxanthomonas suwonensis 11-1]
gi|317465071|gb|ADV26803.1| Procollagen-proline dioxygenase [Pseudoxanthomonas suwonensis 11-1]
Length = 289
Score = 97.1 bits (240), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 52/152 (34%), Positives = 82/152 (53%), Gaps = 1/152 (0%)
Query: 92 PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
PR+++ ++ D E D + ++++PRLRR+T + +TG ++ R S+ + HPV
Sbjct: 102 PRVVVLGGLLSDEECDALVELSRPRLRRSTTVDAQTGGSQVHADRTSRGTFFERGAHPVC 161
Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL-GTGNRVATV 210
I R+ + E LQV++Y G + PHYD+ P E A L G RVATV
Sbjct: 162 ATIEARIARLLEWPVENGEGLQVLHYPPGAEFRPHYDYFDPDEPGAEVLLRQGGQRVATV 221
Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFW 242
+ Y++ A+GGAT F +L + KG A F+
Sbjct: 222 VMYLNTPARGGATTFPDAHLEVAAVKGNAVFF 253
>gi|226479086|emb|CAX73038.1| Proline HYdroxylase [Schistosoma japonicum]
Length = 437
Score = 97.1 bits (240), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 60/148 (40%), Positives = 85/148 (57%), Gaps = 8/148 (5%)
Query: 4 PTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAIV 63
PT++RA N+ YY E L++ P+ + A + E E YE LCR + P
Sbjct: 291 PTNERAINNEAYYVEQLDRGEGRLGPNPR--SQATSKHDQETELYESLCRDENPFPTVPS 348
Query: 64 AQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQ 123
L CRY + Y R+ P+KEE Y PRI+++ D+++ SEI+ IK +A PRLRRATV+
Sbjct: 349 HYLTCRYYTPHAFY-RIGPVKEETLYPDPRIVMWYDLIFPSEIEKIKDLATPRLRRATVK 407
Query: 124 NYKTGELEIANYRISKS-----AWLREP 146
N TG LE+A YR SK+ W++ P
Sbjct: 408 NPITGNLEVAFYRTSKALGFRILWMKSP 435
>gi|115456019|ref|NP_001051610.1| Os03g0803500 [Oryza sativa Japonica Group]
gi|29150365|gb|AAO72374.1| putative oxidoreductase [Oryza sativa Japonica Group]
gi|108711618|gb|ABF99413.1| oxidoreductase, 2OG-Fe oxygenase family protein, putative,
expressed [Oryza sativa Japonica Group]
gi|113550081|dbj|BAF13524.1| Os03g0803500 [Oryza sativa Japonica Group]
gi|215765410|dbj|BAG87107.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222625993|gb|EEE60125.1| hypothetical protein OsJ_13003 [Oryza sativa Japonica Group]
Length = 299
Score = 97.1 bits (240), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 58/202 (28%), Positives = 101/202 (50%), Gaps = 22/202 (10%)
Query: 82 PLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSA 141
P + + +PR LY + E D + +A+ R+ ++ V + +G+ ++ R S
Sbjct: 34 PARVTQLSWRPRAFLYSGFLSHDECDHLVNLAKGRMEKSMVADNDSGKSIMSQVRTSSGT 93
Query: 142 WLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL 201
+L + E ++ I +RV T L AE +Q+++Y +G Y+ H+D+ + N K
Sbjct: 94 FLSKHEDDIVSGIEKRVAAWTFLPEENAESIQILHYELGQKYDAHFDYFH--DKNNLKR- 150
Query: 202 GTGNRVATVLFYMSDVAQGGATVFTSL------------------NLSLWPEKGTAAFWH 243
G+RVATVL Y++DV +GG TVF + L++ P+KG A +
Sbjct: 151 -GGHRVATVLMYLTDVKKGGETVFPNAAGRHLQLKDETWSDCARSGLAVKPKKGDALLFF 209
Query: 244 NLHSSGDGDYYTRHAACPVLTG 265
+LH + D + H +CPV+ G
Sbjct: 210 SLHVNATTDPASLHGSCPVIEG 231
>gi|30689216|ref|NP_189490.2| Oxoglutarate/iron-dependent oxygenase [Arabidopsis thaliana]
gi|332643931|gb|AEE77452.1| Oxoglutarate/iron-dependent oxygenase [Arabidopsis thaliana]
Length = 288
Score = 97.1 bits (240), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 61/193 (31%), Positives = 98/193 (50%), Gaps = 23/193 (11%)
Query: 92 PRIILYRDVMYDSEIDLIKKMAQPRLRRA-TVQNYKTGELEIANYRISKSAWLREPEHPV 150
PR LY+ + D E D + K+A+ +L ++ V + +GE E + R S +L + + +
Sbjct: 39 PRAFLYKGFLSDEECDHLIKLAKGKLEKSMVVADVDSGESEDSEVRTSSGMFLTKRQDDI 98
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
+ + ++ T L E LQ+++Y G Y+PH+D+ +A LG G+R+ATV
Sbjct: 99 VANVEAKLAAWTFLPEENGEALQILHYENGQKYDPHFDYFYDKKA---LELG-GHRIATV 154
Query: 211 LFYMSDVAQGGATVFTS-------LNLSLW-----------PEKGTAAFWHNLHSSGDGD 252
L Y+S+V +GG TVF + L W P KG A + NLH +G D
Sbjct: 155 LMYLSNVTKGGETVFPNWKGKTPQLKDDSWSKCAKQGYAVKPRKGDALLFFNLHLNGTTD 214
Query: 253 YYTRHAACPVLTG 265
+ H +CPV+ G
Sbjct: 215 PNSLHGSCPVIEG 227
>gi|297824279|ref|XP_002880022.1| AT-P4H-1 [Arabidopsis lyrata subsp. lyrata]
gi|297325861|gb|EFH56281.1| AT-P4H-1 [Arabidopsis lyrata subsp. lyrata]
Length = 283
Score = 97.1 bits (240), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 62/208 (29%), Positives = 103/208 (49%), Gaps = 19/208 (9%)
Query: 73 RNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEI 132
++ LR+ +K E PRII+ D + E + +K +A+PRL+ +TV + KTG+
Sbjct: 65 KDAELLRIGNVKPEVVSWSPRIIVLHDFLSPEECEYLKAIARPRLQVSTVVDVKTGKGVK 124
Query: 133 ANYRISKSAWLR--EPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFA 190
++ R S +L E +P+I+ I +R+ + + E +QV+ Y Y+PH+D+
Sbjct: 125 SDVRTSSGMFLTHVERSNPIIQAIEKRIAVFSQVPAENGELIQVLRYEPKQFYKPHHDYF 184
Query: 191 RPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVF-------------TSLNLSLWPEKG 237
A+ F G RVAT+L Y++D +GG T F +S+ P KG
Sbjct: 185 ----ADTFNLKRGGQRVATMLMYLTDDVEGGETYFPLAGDGDCTCGGKIMKGISVKPTKG 240
Query: 238 TAAFWHNLHSSGDGDYYTRHAACPVLTG 265
A + ++ G D + H C VL+G
Sbjct: 241 DAVLFWSMGLDGQSDPRSIHGGCEVLSG 268
>gi|388500582|gb|AFK38357.1| unknown [Medicago truncatula]
Length = 299
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 56/207 (27%), Positives = 102/207 (49%), Gaps = 25/207 (12%)
Query: 80 LMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISK 139
+ P K ++ PR +Y+ + D E D + +A+ L+R+ V + +G+ ++++ R S
Sbjct: 32 INPSKVKQISWIPRAFVYQGFLTDLECDHLISLAKSELKRSAVADNLSGDSQLSDVRTSS 91
Query: 140 SAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFK 199
+ + + P++ I R+ T L E++QV+ Y G Y+PHYD+ A+
Sbjct: 92 GMLISKNKDPIVSGIEDRISAWTFLPKENGEDIQVLRYEHGQKYDPHYDYF----ADKVN 147
Query: 200 SLGTGNRVATVLFYMSDVAQGGATVF---------------------TSLNLSLWPEKGT 238
+ G+R+ATVL Y+++V +GG TVF +++ P +G
Sbjct: 148 IVQGGHRLATVLMYLTNVTKGGETVFPEAEEPPRRRGSKKSSDLSECAKKGIAVKPRRGD 207
Query: 239 AAFWHNLHSSGDGDYYTRHAACPVLTG 265
A + +L ++ D + HA CPVL G
Sbjct: 208 ALLFFSLDTNAIPDTNSLHAGCPVLEG 234
>gi|15224220|ref|NP_181836.1| P4H isoform 1 [Arabidopsis thaliana]
gi|3763917|gb|AAC64297.1| hypothetical protein [Arabidopsis thaliana]
gi|20197628|gb|AAM15158.1| hypothetical protein [Arabidopsis thaliana]
gi|26450452|dbj|BAC42340.1| unknown protein [Arabidopsis thaliana]
gi|29824245|gb|AAP04083.1| unknown protein [Arabidopsis thaliana]
gi|330255112|gb|AEC10206.1| P4H isoform 1 [Arabidopsis thaliana]
Length = 283
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 62/208 (29%), Positives = 103/208 (49%), Gaps = 19/208 (9%)
Query: 73 RNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEI 132
++ LR+ +K E PRII+ D + E + +K +A+PRL+ +TV + KTG+
Sbjct: 65 KDAELLRIGNVKPEVVSWSPRIIVLHDFLSPEECEYLKAIARPRLQVSTVVDVKTGKGVK 124
Query: 133 ANYRISKSAWLR--EPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFA 190
++ R S +L E +P+I+ I +R+ + + E +QV+ Y Y+PH+D+
Sbjct: 125 SDVRTSSGMFLTHVERSYPIIQAIEKRIAVFSQVPAENGELIQVLRYEPQQFYKPHHDYF 184
Query: 191 RPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVF-------------TSLNLSLWPEKG 237
A+ F G RVAT+L Y++D +GG T F +S+ P KG
Sbjct: 185 ----ADTFNLKRGGQRVATMLMYLTDDVEGGETYFPLAGDGDCTCGGKIMKGISVKPTKG 240
Query: 238 TAAFWHNLHSSGDGDYYTRHAACPVLTG 265
A + ++ G D + H C VL+G
Sbjct: 241 DAVLFWSMGLDGQSDPRSIHGGCEVLSG 268
>gi|224085946|ref|XP_002307750.1| predicted protein [Populus trichocarpa]
gi|222857199|gb|EEE94746.1| predicted protein [Populus trichocarpa]
Length = 288
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 58/195 (29%), Positives = 94/195 (48%), Gaps = 23/195 (11%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+PR LY + + E + + +A+P + ++TV + KTG + + R S +LR V
Sbjct: 83 EPRAFLYHNFLSKEECEYLINLAKPHMMKSTVVDSKTGRSKDSRVRTSSGMFLRRGRDRV 142
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
I I +R+ + + E LQV++Y +G YE H+D+ + F + G R AT+
Sbjct: 143 IREIEKRIADFSFIPVEHGEGLQVLHYEVGQKYEAHFDYF----LDEFNTKNGGQRTATL 198
Query: 211 LFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAFWHNLHSSGDG 251
L Y+SDV +GG TVF + N LSL P+ G A + +
Sbjct: 199 LMYLSDVEEGGETVFPAANMNISAVPWWNELSECAKQGLSLKPKMGNALLFWSTRPDATL 258
Query: 252 DYYTRHAACPVLTGS 266
D + H +CPV+ G+
Sbjct: 259 DPSSLHGSCPVIRGN 273
>gi|195341061|ref|XP_002037130.1| GM12749 [Drosophila sechellia]
gi|194131246|gb|EDW53289.1| GM12749 [Drosophila sechellia]
Length = 467
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 64/215 (29%), Positives = 95/215 (44%), Gaps = 34/215 (15%)
Query: 52 CRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
CRG +P + L+CRY P+LRL P+K E+ +P + L D + +E + +
Sbjct: 255 CRGKNLLPNK--SSLRCRYFRGGSPFLRLAPVKLEQLNFEPFVGLVHDAISQAEQEDLLH 312
Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
+ RL + E ++ +A +H + RI +R+E +TG +E
Sbjct: 313 LTDSRLE------HTRKESSSVEAKVDTNA----SDH--VRRIHQRIEDITGFDMEESEP 360
Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
L V NYGIGG H D +P +SDV GG F L
Sbjct: 361 LIVSNYGIGGQELIHLDCEQPK--------------------LSDVQMGGYASFPDLGFG 400
Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
P +G+A WHN +SG+ D + A CPVL G+
Sbjct: 401 FKPRRGSALVWHNTDNSGNCDTRSLQATCPVLLGN 435
>gi|28393447|gb|AAO42145.1| putative prolyl 4-hydroxylase [Arabidopsis thaliana]
Length = 253
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 61/193 (31%), Positives = 98/193 (50%), Gaps = 23/193 (11%)
Query: 92 PRIILYRDVMYDSEIDLIKKMAQPRLRRA-TVQNYKTGELEIANYRISKSAWLREPEHPV 150
PR LY+ + D E D + K+A+ +L ++ V + +GE E + R S +L + + +
Sbjct: 4 PRAFLYKGFLSDEECDHLIKLAKGKLEKSMVVADVDSGESEDSEVRTSSGMFLTKRQDDI 63
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
+ + ++ T L E LQ+++Y G Y+PH+D+ +A LG G+R+ATV
Sbjct: 64 VANVEAKLAAWTFLPEENGEALQILHYENGQKYDPHFDYFYDKKA---LELG-GHRIATV 119
Query: 211 LFYMSDVAQGGATVFTS-------LNLSLW-----------PEKGTAAFWHNLHSSGDGD 252
L Y+S+V +GG TVF + L W P KG A + NLH +G D
Sbjct: 120 LMYLSNVTKGGETVFPNWKGKTPQLKDDSWSKCAKQGYAVKPRKGDALLFFNLHLNGTTD 179
Query: 253 YYTRHAACPVLTG 265
+ H +CPV+ G
Sbjct: 180 PNSLHGSCPVIEG 192
>gi|259490206|ref|NP_001159002.1| prolyl 4-hydroxylase alpha-2 subunit [Zea mays]
gi|195626402|gb|ACG35031.1| prolyl 4-hydroxylase alpha-2 subunit precursor [Zea mays]
gi|347978830|gb|AEP37757.1| prolyl 4-hydroxylase 8 [Zea mays]
gi|347978832|gb|AEP37758.1| prolyl 4-hydroxylase 8-1 [Zea mays]
gi|413939569|gb|AFW74120.1| prolyl 4-hydroxylase alpha-2 subunit isoform 1 [Zea mays]
gi|413939570|gb|AFW74121.1| prolyl 4-hydroxylase alpha-2 subunit isoform 2 [Zea mays]
gi|413939571|gb|AFW74122.1| prolyl 4-hydroxylase alpha-2 subunit isoform 3 [Zea mays]
gi|413939572|gb|AFW74123.1| prolyl 4-hydroxylase alpha-2 subunit isoform 4 [Zea mays]
Length = 307
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 58/201 (28%), Positives = 97/201 (48%), Gaps = 24/201 (11%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+PR +Y + + E + + +A+P + ++TV + TG+ + + R S +L+ V
Sbjct: 102 EPRAFVYHNFLSKDECEYLIGLAKPHMVKSTVVDSTTGKSKDSRVRTSSGMFLQRGRDKV 161
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
I I +R+ T + E LQV++Y +G YEPH+D+ + F + G R+AT+
Sbjct: 162 IRAIEKRIADYTFIPVDHGEGLQVLHYEVGQKYEPHFDYF----LDEFNTKNGGQRIATL 217
Query: 211 LFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAFWHNLHSSGDG 251
L Y+SDV +GG T+F N LS+ P+ G A + ++
Sbjct: 218 LMYLSDVEEGGETIFPDANVNASSLPWYNELSDCAKRGLSVKPKMGDALLFWSMKPDATL 277
Query: 252 DYYTRHAACPVLTGSNSLHST 272
D + H CPV+ G N ST
Sbjct: 278 DPLSLHGGCPVIKG-NKWSST 297
>gi|215490181|dbj|BAG86624.1| type 2 proly 4-hydroxylase [Nicotiana tabacum]
Length = 294
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 57/217 (26%), Positives = 101/217 (46%), Gaps = 25/217 (11%)
Query: 70 YVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGE 129
+V + + P K ++ +PR +Y + D E + + +A+ L+R+ V + ++G
Sbjct: 18 FVRESSSSAIINPSKAKQISWKPRAFVYEGFLTDEECNHLISLAKSELKRSAVADNESGN 77
Query: 130 LEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDF 189
+ + R S ++ + + P++ I ++ T L EE+QV+ Y G YEPHYD+
Sbjct: 78 SKTSEVRTSSGMFIPKAKDPIVSGIEEKIATWTFLPKENGEEIQVLRYEEGQKYEPHYDY 137
Query: 190 ARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS------------------ 231
+ G+R+ATVL Y+++V +GG TVF S
Sbjct: 138 F----VDKVNIARGGHRLATVLMYLTNVEKGGETVFPKAEESPRRRSMIADDSLSECAKK 193
Query: 232 ---LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
+ P KG A +++LH + D + H CPV+ G
Sbjct: 194 GIPVKPRKGDALLFYSLHPNATPDPLSLHGGCPVIQG 230
>gi|363543371|ref|NP_001241695.1| prolyl 4-hydroxylase 8-5 [Zea mays]
gi|347978840|gb|AEP37762.1| prolyl 4-hydroxylase 8-5 [Zea mays]
Length = 307
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 58/201 (28%), Positives = 97/201 (48%), Gaps = 24/201 (11%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+PR +Y + + E + + +A+P + ++TV + TG+ + + R S +L+ V
Sbjct: 102 EPRAFVYHNFLSKDECEYLIGLAKPHMVKSTVVDSTTGKSKDSRVRTSSGMFLQRGRDKV 161
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
I I +R+ T + E LQV++Y +G YEPH+D+ + F + G R+AT+
Sbjct: 162 IRAIEKRIADYTFIPVDHGEGLQVLHYEVGQKYEPHFDYF----LDEFNTKNGGQRIATL 217
Query: 211 LFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAFWHNLHSSGDG 251
L Y+SDV +GG T+F N LS+ P+ G A + ++
Sbjct: 218 LMYLSDVEEGGETIFPDANVNASSLPWYNELSDCAKRGLSVKPKMGDALLFWSMKPGATL 277
Query: 252 DYYTRHAACPVLTGSNSLHST 272
D + H CPV+ G N ST
Sbjct: 278 DPLSLHGGCPVIKG-NKWSST 297
>gi|398818543|ref|ZP_10577128.1| 2OG-Fe(II) oxygenase superfamily enzyme [Brevibacillus sp. BC25]
gi|398027481|gb|EJL21031.1| 2OG-Fe(II) oxygenase superfamily enzyme [Brevibacillus sp. BC25]
Length = 220
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 59/179 (32%), Positives = 100/179 (55%), Gaps = 15/179 (8%)
Query: 89 YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGEL-EIANYRISKSAWLREPE 147
Y +P +++ +V+ DSE D + + ++ RL+R+ K GE + + R S + + E
Sbjct: 38 YEEPLVVVLGNVLSDSECDELIEHSRERLQRS-----KIGEDGSVNSIRTSSGVFCEQTE 92
Query: 148 HPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDF-ARPGEANAFKSLGTGNR 206
I RI +R+ + + + LQV+ Y G Y+PHYDF A A+ T NR
Sbjct: 93 --TITRIEKRISQIMNIPIEHGDGLQVLRYTPGQEYKPHYDFFAETSRAS------TNNR 144
Query: 207 VATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
++T++ Y++DV QGG TVF L+LS++P KG A ++ +S+ + + +T HA V+ G
Sbjct: 145 ISTLVMYLNDVEQGGETVFPLLHLSVFPTKGMAVYFEYFYSNQELNDFTLHAGTQVIHG 203
>gi|307102963|gb|EFN51228.1| hypothetical protein CHLNCDRAFT_141231 [Chlorella variabilis]
Length = 313
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 60/187 (32%), Positives = 93/187 (49%), Gaps = 23/187 (12%)
Query: 96 LYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERIS 155
++ + + + E D I +A+P L R+ V + TG EI++ R SK +L + I
Sbjct: 43 IFINFLTEEECDHIVALAKPHLERSGVVDTATGGSEISDIRTSKGMFLERGHDDTVAAIE 102
Query: 156 RRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMS 215
R+ T L E LQV+NY G Y+ ++ GE+N GNR ATVL Y++
Sbjct: 103 ERIARWTLLPVGNGEGLQVLNYHPGEKYDDYFFDKVNGESNG------GNRYATVLMYLN 156
Query: 216 DVAQGGATVFTSL-----------------NLSLWPEKGTAAFWHNLHSSGDGDYYTRHA 258
V +GG TVF ++ +L+ P KG+A +H++ SGD + + H
Sbjct: 157 TVEEGGETVFPNIPAPGGDNGPTFTECARRHLAAKPTKGSAVLFHSIKPSGDLERRSLHT 216
Query: 259 ACPVLTG 265
ACPV+ G
Sbjct: 217 ACPVVKG 223
>gi|297829156|ref|XP_002882460.1| hypothetical protein ARALYDRAFT_896741 [Arabidopsis lyrata subsp.
lyrata]
gi|297328300|gb|EFH58719.1| hypothetical protein ARALYDRAFT_896741 [Arabidopsis lyrata subsp.
lyrata]
Length = 299
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 55/203 (27%), Positives = 98/203 (48%), Gaps = 25/203 (12%)
Query: 84 KEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWL 143
K ++ +PR +Y + D E D + +A+ L+R+ V + GE ++++ R S ++
Sbjct: 37 KVKQVSAKPRAFVYEGFLTDLECDHLISLAKENLQRSAVADNDNGESQVSDVRTSSGTFI 96
Query: 144 REPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT 203
+ + P++ I ++ T L E+LQV+ Y G Y+ H+D+ + N +
Sbjct: 97 SKGKDPIVSGIEDKLSTWTFLPKENGEDLQVLRYEPGQKYDAHFDYFHD-KVNIARG--- 152
Query: 204 GNRVATVLFYMSDVAQGGATVF---------------------TSLNLSLWPEKGTAAFW 242
G+R+ATVL Y+S+V +GG TVF +++ P+KG A +
Sbjct: 153 GHRIATVLLYLSNVTKGGETVFPDAQEYSRRSLSENKDDLSDCAKKGIAVKPKKGNALLF 212
Query: 243 HNLHSSGDGDYYTRHAACPVLTG 265
NL D ++ H CPV+ G
Sbjct: 213 FNLQQDAIPDPFSLHGGCPVIEG 235
>gi|313215430|emb|CBY42983.1| unnamed protein product [Oikopleura dioica]
Length = 469
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 62/170 (36%), Positives = 90/170 (52%), Gaps = 10/170 (5%)
Query: 34 NNVAPTLEVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPY--LRLMPLKEEEAYLQ 91
N P ++YE LCR P + LKC Y P L+ P+K EE +
Sbjct: 256 NLTRPEAHYESMQEYERLCR---EFSPPHKSSLKCFYWTGPSPLSPLQWAPVKTEELHDD 312
Query: 92 PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPE---- 147
P ++ + +V+ D E I+ +A L RAT+Q+ TG+L A+YRI K+AWL E E
Sbjct: 313 PLVVQFYEVISDEEERAIQFLAGEHLNRATIQDPATGKLVNADYRIQKTAWLTEFEKFDV 372
Query: 148 HPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDF-ARPGEAN 196
+ I + + ++ +TGL AE +QV NYG+ G YEPH+D + PG N
Sbjct: 373 NGTIAKYNEKLTKITGLDADYAELVQVGNYGVAGQYEPHWDHQSYPGAEN 422
>gi|193209070|ref|NP_001123049.1| Protein PHY-4, isoform b [Caenorhabditis elegans]
gi|172051527|emb|CAQ35068.1| Protein PHY-4, isoform b [Caenorhabditis elegans]
Length = 282
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 64/216 (29%), Positives = 105/216 (48%), Gaps = 7/216 (3%)
Query: 52 CRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
C +L + +L C +H+++ ++ L E LQ ++R + + +++
Sbjct: 36 CGKELRGDSSRDGRLVCYRLHKHLLIRKVEILSSEPFILQYHNQVHRRLAKRA----VQE 91
Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVE-HMTGLTTSTAE 170
RL + + + T E + R + WL P RI ++ ++ L STAE
Sbjct: 92 AEALRLEQLKISGFTTTP-EKSQVRAANGTWLIHTGRPSFARIFEGLQANINSLDLSTAE 150
Query: 171 ELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNL 230
Q+++Y G+Y PHYD+ P N G GNR+ATVL + +GG TVF LNL
Sbjct: 151 PWQILSYNADGYYAPHYDYLNPA-TNVQLVEGRGNRIATVLVILQIAKKGGTTVFPRLNL 209
Query: 231 SLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
++ P+ G W N S+G+ + T HAACP+ G+
Sbjct: 210 NIRPKAGDVIVWLNTLSTGESNSQTLHAACPIHEGT 245
>gi|239816557|ref|YP_002945467.1| Procollagen-proline dioxygenase [Variovorax paradoxus S110]
gi|239803134|gb|ACS20201.1| Procollagen-proline dioxygenase [Variovorax paradoxus S110]
Length = 296
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 55/168 (32%), Positives = 88/168 (52%), Gaps = 1/168 (0%)
Query: 99 DVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRV 158
DV E + + +A+PRL +T + TG + R S + R E+ + R+ R+
Sbjct: 106 DVFSAEECEALIALARPRLAPSTSVDPLTGRNRLGAQRSSLGMFFRLRENAFVARLDERL 165
Query: 159 EHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLG-TGNRVATVLFYMSDV 217
+ L E LQV++Y G PH+DF P A SL +G RV+T++ Y+++V
Sbjct: 166 SELMNLPVENGEGLQVLHYPAGAQSLPHFDFLVPSNAANQASLQRSGQRVSTLVAYLNEV 225
Query: 218 AQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
+GG TVF S+ P++G A ++ +S G D+ + HA PVL+G
Sbjct: 226 EEGGETVFPETGWSVSPQRGGAVYFEYCNSLGQVDHASLHAGAPVLSG 273
>gi|357517895|ref|XP_003629236.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355523258|gb|AET03712.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 326
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 56/198 (28%), Positives = 94/198 (47%), Gaps = 23/198 (11%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+PR LY + + E + + +A+P + ++ V + +TG + R S A+L+ +
Sbjct: 121 EPRAFLYHNFLTKEECEHLINIAKPSMHKSAVIDEETGNGVDSRERTSSGAFLKRGSDRI 180
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
++ I RR+ T + E V++Y +G YEPHYD+ + F + G R+AT+
Sbjct: 181 VKNIERRIADFTFIPVEHGENFNVLHYEVGQKYEPHYDYF----MDTFSTTYAGQRIATM 236
Query: 211 LFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAFWHNLHSSGDG 251
L Y+SDV +GG TVF + LS+ P+ G A + ++
Sbjct: 237 LMYLSDVEEGGETVFPNAKGNFSSVPWWNELSDCGKGGLSIKPKMGNAILFWSMKPDATL 296
Query: 252 DYYTRHAACPVLTGSNSL 269
D + H ACPV+ G L
Sbjct: 297 DPSSLHGACPVIKGDKWL 314
>gi|193209068|ref|NP_001123048.1| Protein PHY-4, isoform a [Caenorhabditis elegans]
gi|172051526|emb|CAQ35067.1| Protein PHY-4, isoform a [Caenorhabditis elegans]
Length = 278
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 64/216 (29%), Positives = 105/216 (48%), Gaps = 7/216 (3%)
Query: 52 CRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
C +L + +L C +H+++ ++ L E LQ ++R + + +++
Sbjct: 36 CGKELRGDSSRDGRLVCYRLHKHLLIRKVEILSSEPFILQYHNQVHRRLAKRA----VQE 91
Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVE-HMTGLTTSTAE 170
RL + + + T E + R + WL P RI ++ ++ L STAE
Sbjct: 92 AEALRLEQLKISGFTTTP-EKSQVRAANGTWLIHTGRPSFARIFEGLQANINSLDLSTAE 150
Query: 171 ELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNL 230
Q+++Y G+Y PHYD+ P N G GNR+ATVL + +GG TVF LNL
Sbjct: 151 PWQILSYNADGYYAPHYDYLNPA-TNVQLVEGRGNRIATVLVILQIAKKGGTTVFPRLNL 209
Query: 231 SLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
++ P+ G W N S+G+ + T HAACP+ G+
Sbjct: 210 NIRPKAGDVIVWLNTLSTGESNSQTLHAACPIHEGT 245
>gi|18397528|ref|NP_566279.1| P4H isoform 2 [Arabidopsis thaliana]
gi|332640849|gb|AEE74370.1| P4H isoform 2 [Arabidopsis thaliana]
Length = 299
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 55/203 (27%), Positives = 98/203 (48%), Gaps = 25/203 (12%)
Query: 84 KEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWL 143
K ++ +PR +Y + D E D + +A+ L+R+ V + GE ++++ R S ++
Sbjct: 37 KVKQVSSKPRAFVYEGFLTDLECDHLISLAKENLQRSAVADNDNGESQVSDVRTSSGTFI 96
Query: 144 REPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT 203
+ + P++ I ++ T L E+LQV+ Y G Y+ H+D+ + N +
Sbjct: 97 SKGKDPIVSGIEDKLSTWTFLPKENGEDLQVLRYEHGQKYDAHFDYFHD-KVNIARG--- 152
Query: 204 GNRVATVLFYMSDVAQGGATVF---------------------TSLNLSLWPEKGTAAFW 242
G+R+ATVL Y+S+V +GG TVF +++ P+KG A +
Sbjct: 153 GHRIATVLLYLSNVTKGGETVFPDAQEFSRRSLSENKDDLSDCAKKGIAVKPKKGNALLF 212
Query: 243 HNLHSSGDGDYYTRHAACPVLTG 265
NL D ++ H CPV+ G
Sbjct: 213 FNLQQDAIPDPFSLHGGCPVIEG 235
>gi|357162904|ref|XP_003579560.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Brachypodium
distachyon]
Length = 266
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 61/203 (30%), Positives = 100/203 (49%), Gaps = 19/203 (9%)
Query: 78 LRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRI 137
LRL +K E PRII++ + + E D +K++A+PRL +TV + TG+ ++ R
Sbjct: 53 LRLGYVKPEVISWTPRIIVFHNFLSSEECDFLKEIARPRLEISTVVDVATGKGVKSDVRT 112
Query: 138 SKSAWLREPEH--PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEA 195
S ++ E PVI+ I +R+ + + E +QV+ Y +Y PH+D+ +
Sbjct: 113 SSGMFVNSEERKFPVIQAIEKRISVFSQIPVENGELIQVLRYEPSQYYRPHHDYF----S 168
Query: 196 NAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL-------------NLSLWPEKGTAAFW 242
+ F G RVAT+L Y++D +GG T F L + P KG A +
Sbjct: 169 DTFNLKRGGQRVATMLMYLTDGVEGGETHFPQAGDGECSCGGRIVRGLCVKPNKGDAVLF 228
Query: 243 HNLHSSGDGDYYTRHAACPVLTG 265
++ G+ D + H+ C VL G
Sbjct: 229 WSMGLDGNTDSNSIHSGCAVLKG 251
>gi|388496942|gb|AFK36537.1| unknown [Lotus japonicus]
Length = 302
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 55/207 (26%), Positives = 103/207 (49%), Gaps = 27/207 (13%)
Query: 82 PLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSA 141
P K ++ +PR +Y+ + + E D + +A+ L+R+ V + +G+ ++++ R S
Sbjct: 38 PSKVKQVSWKPRAFVYKGFLTELECDHLISLAKSELKRSAVADNLSGDSKLSDVRTSSGM 97
Query: 142 WLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL 201
++ + + P++ I ++ T L E++QV+ Y G Y+PHYDF A+
Sbjct: 98 FISKNKDPIVAGIEDKISSWTFLPKENGEDIQVLRYEHGQKYDPHYDFF----ADKVNIA 153
Query: 202 GTGNRVATVLFYMSDVAQGGATVFTSL-----------------------NLSLWPEKGT 238
G+RVATVL Y+++V +GG TVF + +++ P +G
Sbjct: 154 RGGHRVATVLMYLTNVTRGGETVFPNAEVEEFPRHRGSETIDDLSECAKKGIAVKPRRGD 213
Query: 239 AAFWHNLHSSGDGDYYTRHAACPVLTG 265
A + +L+ + D + HA CPV+ G
Sbjct: 214 ALLFFSLYPNAVPDTMSLHAGCPVIEG 240
>gi|337280547|ref|YP_004620019.1| hypothetical protein Rta_28970 [Ramlibacter tataouinensis TTB310]
gi|334731624|gb|AEG94000.1| conserved hypothetical protein [Ramlibacter tataouinensis TTB310]
Length = 286
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 56/186 (30%), Positives = 91/186 (48%), Gaps = 11/186 (5%)
Query: 87 EAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREP 146
+A PR++++ ++ D E + + +A+PRL R+ KTG E+ R S + +
Sbjct: 94 QAMYNPRVVVFGSLLSDQECEQLIGLAKPRLARSLTVATKTGGEEVNEDRTSSGMFFQRG 153
Query: 147 EHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDF---ARPGEANAFKSLGT 203
E+ ++ RI R+ + E LQV++Y G Y+PHYD+ A PG K
Sbjct: 154 ENELVARIEARIARLVNWPVENGEGLQVLHYRPGAEYKPHYDYFDPAEPGTPTILKR--G 211
Query: 204 GNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAF--WHNLHSSGDGDYYTRHAACP 261
G RV T++ Y+ + +GG T F ++L + P++G F + H S T H P
Sbjct: 212 GQRVGTLVMYLGEPEKGGGTTFPDVHLEVAPKRGHGVFFSYERPHPS----TRTLHGGAP 267
Query: 262 VLTGSN 267
VL G
Sbjct: 268 VLAGEK 273
>gi|194871369|ref|XP_001972835.1| GG15736 [Drosophila erecta]
gi|190654618|gb|EDV51861.1| GG15736 [Drosophila erecta]
Length = 476
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 61/201 (30%), Positives = 101/201 (50%), Gaps = 27/201 (13%)
Query: 66 LKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNY 125
L CRYV P+L+L PLK EE ++ I ++ V+ +ID +K +++P+L+R +
Sbjct: 294 LVCRYVDW-TPFLKLAPLKMEELSMETHISIFYGVLRQKDIDELKNVSRPKLQRIE---H 349
Query: 126 KTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEP 185
+G +S S+ H V+ +++ + +TG + + L+V+NYGI G+Y P
Sbjct: 350 LSGNCSCKIGNLSSSS------HDVVRKVNELILDITGFPSKGNQMLEVINYGIAGNYNP 403
Query: 186 HYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNL 245
D ARP + N A L ++ + +GG VF S +L + P KG+ W NL
Sbjct: 404 D-DTARPRKQNK----------ANALIFLDNAERGGEIVFPSRHLKVRPRKGSMLVWMNL 452
Query: 246 HSSGDGDYYTRHAACPVLTGS 266
S + CP+L G+
Sbjct: 453 ERS------VIYHQCPILKGN 467
>gi|21618073|gb|AAM67123.1| prolyl 4-hydroxylase alpha subunit-like protein [Arabidopsis
thaliana]
Length = 297
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 55/203 (27%), Positives = 98/203 (48%), Gaps = 25/203 (12%)
Query: 84 KEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWL 143
K ++ +PR +Y + D E D + +A+ L+R+ V + GE ++++ R S ++
Sbjct: 35 KVKQVSSKPRAFVYEGFLTDLECDHLISLAKENLQRSAVADNDNGESQVSDVRTSSGTFI 94
Query: 144 REPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT 203
+ + P++ I ++ T L E+LQV+ Y G Y+ H+D+ + N +
Sbjct: 95 SKGKDPIVSGIEDKLSTWTFLPKENGEDLQVLRYEHGQKYDAHFDYFHD-KVNIARG--- 150
Query: 204 GNRVATVLFYMSDVAQGGATVF---------------------TSLNLSLWPEKGTAAFW 242
G+R+ATVL Y+S+V +GG TVF +++ P+KG A +
Sbjct: 151 GHRIATVLLYLSNVTKGGETVFPDAQEFSRRSLSENKDDLSDCAKKGIAVKPKKGNALLF 210
Query: 243 HNLHSSGDGDYYTRHAACPVLTG 265
NL D ++ H CPV+ G
Sbjct: 211 FNLQQDAIPDPFSLHGGCPVIEG 233
>gi|317127314|ref|YP_004093596.1| Procollagen-proline dioxygenase [Bacillus cellulosilyticus DSM
2522]
gi|315472262|gb|ADU28865.1| Procollagen-proline dioxygenase [Bacillus cellulosilyticus DSM
2522]
Length = 229
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 48/176 (27%), Positives = 89/176 (50%), Gaps = 10/176 (5%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+P I+L +V+ + E D + +++ R+ R+ + N +L R S S + + E+ V
Sbjct: 43 EPLIVLLGNVLSEEECDQLISLSKDRIERSKISNKSVHDL-----RTSSSMFFDDAENDV 97
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
+ + +RV + + E +Q++NY IG Y+ HYD+ G + R++T+
Sbjct: 98 VSTVEKRVSQIMKIPVDHGEGIQILNYAIGQEYKAHYDYFSSGNSKV-----NNPRISTL 152
Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+ Y++DV GG T F LN + P+KG A ++ ++ + T H PV+ G
Sbjct: 153 VMYLNDVEAGGETYFPKLNFYVAPKKGMAVYFEYFYNDTTLNELTLHGGAPVVIGD 208
>gi|50845214|gb|AAT84604.1| prolyl 4-hydroxylase [Dianthus caryophyllus]
Length = 316
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 64/216 (29%), Positives = 104/216 (48%), Gaps = 23/216 (10%)
Query: 69 RYVHRNVPY-LRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKT 127
R NVP + + P + +PR LY + E D + MA+ +L ++ V + ++
Sbjct: 38 RLKSENVPSSVGVDPSHVTQLSWKPRAFLYEGFLTHEECDHLIDMAKDKLEKSMVADNES 97
Query: 128 GELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHY 187
G+ + R S +L++ + V+ I R+ T L E +Q+++Y G YEPH+
Sbjct: 98 GKSIPSEVRTSSGMFLQKAQDDVVAAIEARIAAWTFLPIENGEAMQILHYERGQKYEPHF 157
Query: 188 DFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVF----TSLNL------------- 230
D+ + LG G+R+ATVL Y+S+V +GG TVF L L
Sbjct: 158 DYFHD---KVNQQLG-GHRIATVLMYLSNVEEGGETVFPNAEAKLQLANNESLSDCAKGG 213
Query: 231 -SLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
S+ P+KG A + +LH D + H +CPV+ G
Sbjct: 214 YSVKPKKGDALLFFSLHPDASTDSLSLHGSCPVIEG 249
>gi|295700439|ref|YP_003608332.1| procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
gi|295439652|gb|ADG18821.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
Length = 296
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 58/178 (32%), Positives = 90/178 (50%), Gaps = 1/178 (0%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+P + + + E + + +AQPRL R+ V + TG IA +R S + R E P+
Sbjct: 101 RPAAVHLANFLSADECEQLIALAQPRLDRSAVVDPVTGRDVIATHRSSHGMFFRLGETPL 160
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPG-EANAFKSLGTGNRVAT 209
I RI R+ +T E LQ+++Y G PH D+ G EAN +G R+ T
Sbjct: 161 IARIEARIAELTATPVENGEGLQMLHYEEGAESTPHVDYLMTGNEANRESIARSGQRMGT 220
Query: 210 VLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
+L Y+ DV GG TVF + S+ P++G A ++ + G D + HA+ P+ TG
Sbjct: 221 LLMYLKDVEGGGETVFPQVGWSIVPQRGHALYFEYGNRYGMCDPSSLHASTPLRTGDK 278
>gi|226314793|ref|YP_002774689.1| hypothetical protein BBR47_52080 [Brevibacillus brevis NBRC 100599]
gi|226097743|dbj|BAH46185.1| conserved hypothetical protein [Brevibacillus brevis NBRC 100599]
Length = 215
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 55/178 (30%), Positives = 99/178 (55%), Gaps = 13/178 (7%)
Query: 89 YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
Y +P +++ +V+ DSE D + + ++ RL+R+ + ++ + + R S + + E
Sbjct: 33 YEEPLVVVLGNVLSDSECDELIEHSRERLQRSKIGEDRS----VNSIRTSSGVFCEQTE- 87
Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDF-ARPGEANAFKSLGTGNRV 207
I RI +R+ + + + LQV+ Y G Y+PHYDF A A+ T NR+
Sbjct: 88 -TITRIEKRISQIMNIPIEHGDGLQVLRYTPGQEYKPHYDFFAETSRAS------TNNRI 140
Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
+T++ Y++DV QGG TVF L+LS++P KG A ++ + + + + +T HA V+ G
Sbjct: 141 STLVMYLNDVEQGGETVFPLLHLSVFPTKGMAVYFEYFYRNQEVNEFTLHAGAQVIHG 198
>gi|224141325|ref|XP_002324024.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Populus
trichocarpa]
gi|222867026|gb|EEF04157.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Populus
trichocarpa]
Length = 308
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 56/202 (27%), Positives = 98/202 (48%), Gaps = 22/202 (10%)
Query: 82 PLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSA 141
P + + PR LY+ + D E D + +A+ +L ++ V + ++G+ + R S
Sbjct: 43 PTRVTQLSWNPRAFLYKGFLSDEECDHLMNLARDKLEKSMVADNESGKSIESEVRTSSGM 102
Query: 142 WLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL 201
++ + + +++ I R+ T L E +Q+++Y G YEPH+D+ A + L
Sbjct: 103 FIGKSQDEIVDDIEARIAAWTFLPQENGESIQILHYEHGQKYEPHFDYFHD---KANQEL 159
Query: 202 GTGNRVATVLFYMSDVAQGGATVF------------------TSLNLSLWPEKGTAAFWH 243
G G+RV TVL Y+S+V +GG TVF ++ P+KG A +
Sbjct: 160 G-GHRVVTVLMYLSNVGKGGETVFPNSEGKTIQPKDDSWSDCAKNGYAVKPQKGDALLFF 218
Query: 244 NLHSSGDGDYYTRHAACPVLTG 265
+LH D + H +CPV+ G
Sbjct: 219 SLHPDATTDTNSLHGSCPVIEG 240
>gi|241767624|ref|ZP_04765273.1| Procollagen-proline dioxygenase [Acidovorax delafieldii 2AN]
gi|241361463|gb|EER57922.1| Procollagen-proline dioxygenase [Acidovorax delafieldii 2AN]
Length = 318
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 51/177 (28%), Positives = 87/177 (49%), Gaps = 3/177 (1%)
Query: 92 PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
PR++++ +++ E + + A PR+ R+ +TG E+ + R S + + E P++
Sbjct: 131 PRVVVFGNLLSPEECEALIAAAAPRMARSLTVATQTGGEEVNDDRTSHGMFFQRGESPLV 190
Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLG-TGNRVATV 210
+RI R+ + E LQV++Y G Y+PHYD+ P E + G RV T+
Sbjct: 191 QRIEERIASLLNWPIENGEGLQVLHYRPGAEYKPHYDYFDPAEPGTPTVIQRGGQRVGTL 250
Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
+ Y++ QGG T F + + P++G AAF+ + T H PVL G
Sbjct: 251 VMYLNTPEQGGGTTFPDAQIEVAPQRGNAAFFS--YERPTPSTRTLHGGAPVLAGDK 305
>gi|110738390|dbj|BAF01121.1| hypothetical protein [Arabidopsis thaliana]
Length = 299
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 55/203 (27%), Positives = 98/203 (48%), Gaps = 25/203 (12%)
Query: 84 KEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWL 143
K ++ +PR +Y + D E D + +A+ L+R+ V + GE ++++ R S ++
Sbjct: 37 KVKQVSSKPRAFVYGGFLTDLECDHLISLAKENLQRSAVADNDNGESQVSDVRTSSGTFI 96
Query: 144 REPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT 203
+ + P++ I ++ T L E+LQV+ Y G Y+ H+D+ + N +
Sbjct: 97 SKGKDPIVSGIEDKLSTWTFLPKENGEDLQVLRYEHGQKYDAHFDYFHD-KVNIARG--- 152
Query: 204 GNRVATVLFYMSDVAQGGATVF---------------------TSLNLSLWPEKGTAAFW 242
G+R+ATVL Y+S+V +GG TVF +++ P+KG A +
Sbjct: 153 GHRIATVLLYLSNVTKGGETVFPDAQEFSRRSLSENKDDLSDCAKKGIAVKPKKGNALLF 212
Query: 243 HNLHSSGDGDYYTRHAACPVLTG 265
NL D ++ H CPV+ G
Sbjct: 213 FNLQQDAIPDPFSLHGGCPVIEG 235
>gi|224069056|ref|XP_002302889.1| predicted protein [Populus trichocarpa]
gi|222844615|gb|EEE82162.1| predicted protein [Populus trichocarpa]
Length = 287
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 63/203 (31%), Positives = 100/203 (49%), Gaps = 19/203 (9%)
Query: 78 LRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRI 137
LR+ +K E PRII+ D + E D ++ +A+PRLR +TV + KTG+ + R
Sbjct: 74 LRIGYVKPEIISWSPRIIVLHDFLSSEECDYLRALAKPRLRISTVVDVKTGKGIESKVRT 133
Query: 138 SKSAWLREPE--HPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEA 195
S +L E + V++ I +R+ + + E +QV+ Y +Y+PH+D+ +
Sbjct: 134 SSGMFLSSEEKTYQVVQAIEKRISVYSQVPIENGELIQVLRYEKNQYYKPHHDYF----S 189
Query: 196 NAFKSLGTGNRVATVLFYMSDVAQGGATVF-------------TSLNLSLWPEKGTAAFW 242
+ F G RVAT+L Y+SD +GG T F LS+ P KG A +
Sbjct: 190 DTFNLKRGGQRVATMLMYLSDNVEGGETYFPMAGSGKCSCGGKVVDGLSVKPIKGNAVLF 249
Query: 243 HNLHSSGDGDYYTRHAACPVLTG 265
++ G D + H C VL+G
Sbjct: 250 WSMGLDGQSDPSSIHGGCEVLSG 272
>gi|449522594|ref|XP_004168311.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Cucumis
sativus]
Length = 313
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 56/202 (27%), Positives = 99/202 (49%), Gaps = 22/202 (10%)
Query: 82 PLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSA 141
P + + QPR LY+ + D+E D + +A+ +L ++ V + +G+ + R S
Sbjct: 50 PTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVSSEVRTSSGM 109
Query: 142 WLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL 201
+LR+ + V+ + R+ T L E +Q+++Y G YEPH+DF + L
Sbjct: 110 FLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHD---KVNQEL 166
Query: 202 GTGNRVATVLFYMSDVAQGGATVFTSLNL------------------SLWPEKGTAAFWH 243
G G+R+ATVL Y+S+V +GG T+F + ++ +KG A +
Sbjct: 167 G-GHRIATVLMYLSNVEKGGETIFPNSEFKESQAKDESWSDCSRKGYAVKAQKGDALLFF 225
Query: 244 NLHSSGDGDYYTRHAACPVLTG 265
+L+ D + H +CPV+ G
Sbjct: 226 SLNLDATTDERSLHGSCPVIAG 247
>gi|124267278|ref|YP_001021282.1| hypothetical protein Mpe_A2091 [Methylibium petroleiphilum PM1]
gi|124260053|gb|ABM95047.1| conserved hypothetical protein [Methylibium petroleiphilum PM1]
Length = 289
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 71/240 (29%), Positives = 105/240 (43%), Gaps = 26/240 (10%)
Query: 40 LEVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMP---------LKEEEAYL 90
LE+ R+ E R + PPA V + P+L P ++ A
Sbjct: 51 LEIVLRDIVEAGTRQKVLPPPARVPE----------PFLDGAPATLWAHDREVRVVMAMR 100
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
PR+I++ ++ D+E D I +A RL R+ + TG E+ R S + EHPV
Sbjct: 101 DPRVIVFSGLLSDAECDEIVALAGARLARSHTVDTATGASEVNAARTSDGMFFTRGEHPV 160
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL-GTGNRVAT 209
R R+ + E LQV++Y G Y+PHYD+ P + L G RVAT
Sbjct: 161 CARFEARIAALLNWPVENGEGLQVLHYRPGAEYKPHYDYFDPDQPGTPAVLRRGGQRVAT 220
Query: 210 VLFYMSDVAQGGATVFTSLNLSLWPEKGTAAF--WHNLHSSGDGDYYTRHAACPVLTGSN 267
++ Y++ +GG T F + L + P KG A F + H S + H PVL G
Sbjct: 221 LVTYLNTPTRGGGTTFPDIGLEVTPLKGHAVFFSYDRPHPS----TRSLHGGAPVLEGDK 276
>gi|398810140|ref|ZP_10568970.1| 2OG-Fe(II) oxygenase superfamily enzyme [Variovorax sp. CF313]
gi|398083831|gb|EJL74535.1| 2OG-Fe(II) oxygenase superfamily enzyme [Variovorax sp. CF313]
Length = 296
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 55/168 (32%), Positives = 87/168 (51%), Gaps = 1/168 (0%)
Query: 99 DVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRV 158
DV E + + +A+PRL +T + +G + R S + R E+ I R+ +RV
Sbjct: 106 DVFDPQECEELIALARPRLAPSTTVDPLSGRDLVGEQRSSLGMFFRLRENAFIARLDQRV 165
Query: 159 EHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLG-TGNRVATVLFYMSDV 217
+ L E LQV+ Y G PH+DF P A SL +G RV+T++ Y+++V
Sbjct: 166 SELMNLPVENGEGLQVLCYPAGAQSMPHFDFLVPSNAANKASLARSGQRVSTLVSYLNEV 225
Query: 218 AQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
+GG T+F S+ P +G+A ++ +S G D+ + HA PVL G
Sbjct: 226 EEGGETIFPECGWSVPPRRGSAVYFEYCNSLGQVDHASLHAGGPVLHG 273
>gi|242063586|ref|XP_002453082.1| hypothetical protein SORBIDRAFT_04g038020 [Sorghum bicolor]
gi|241932913|gb|EES06058.1| hypothetical protein SORBIDRAFT_04g038020 [Sorghum bicolor]
Length = 307
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 58/201 (28%), Positives = 97/201 (48%), Gaps = 24/201 (11%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+PR +Y + + E + + +A+P + ++TV + TG+ + + R S +L+ V
Sbjct: 102 EPRAFVYHNFLSKEECEYLIGLAKPHMVKSTVVDSTTGKSKDSRVRTSSGMFLQRGRDKV 161
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
I I +R+ T + E LQV++Y +G YEPH+D+ + F + G R+AT+
Sbjct: 162 IRAIEKRIADYTFIPADHGEGLQVLHYEVGQKYEPHFDYF----LDEFNTKNGGQRMATL 217
Query: 211 LFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAFWHNLHSSGDG 251
L Y+SDV +GG T+F N LS+ P+ G A + ++
Sbjct: 218 LMYLSDVEEGGETIFPDANVNASSLPWYNELSECAKRGLSVKPKMGDALLFWSMKPDATL 277
Query: 252 DYYTRHAACPVLTGSNSLHST 272
D + H CPV+ G N ST
Sbjct: 278 DPLSLHGGCPVIRG-NKWSST 297
>gi|226529219|ref|NP_001151238.1| LOC100284871 [Zea mays]
gi|195645242|gb|ACG42089.1| prolyl 4-hydroxylase alpha-2 subunit precursor [Zea mays]
gi|347978812|gb|AEP37748.1| prolyl 4-hydroxylase 5 [Zea mays]
gi|413923983|gb|AFW63915.1| prolyl 4-hydroxylase alpha-2 subunit [Zea mays]
Length = 308
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 58/201 (28%), Positives = 97/201 (48%), Gaps = 24/201 (11%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+PR +Y + + E + + +A+P + ++TV + TG+ + + R S +L+ V
Sbjct: 103 EPRAFVYHNFLSKEECEYLIGLAKPHMVKSTVVDSTTGKSKDSRVRTSSGMFLQRGRDKV 162
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
I I +R+ T + E LQV++Y +G YEPH+D+ + F + G R+AT+
Sbjct: 163 IRVIEKRIADYTFIPVDHGEGLQVLHYEVGQKYEPHFDYF----LDEFNTKNGGQRMATL 218
Query: 211 LFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAFWHNLHSSGDG 251
L Y+SDV +GG T+F N LS+ P+ G A + ++
Sbjct: 219 LMYLSDVEEGGETIFPDANVNVSSLPWYNELSECAKRGLSVKPKMGDALLFWSMKPDATL 278
Query: 252 DYYTRHAACPVLTGSNSLHST 272
D + H CPV+ G N ST
Sbjct: 279 DPLSLHGGCPVIRG-NKWSST 298
>gi|356555585|ref|XP_003546111.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 1
[Glycine max]
Length = 301
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 58/205 (28%), Positives = 102/205 (49%), Gaps = 25/205 (12%)
Query: 82 PLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSA 141
P K ++ +PR +Y + + E D + +A+ L+R+ V + +GE +++ R S
Sbjct: 37 PSKVKQVSWKPRAFVYEGFLTELECDHLISIAKSELKRSAVADNLSGESKLSEVRTSSGM 96
Query: 142 WLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL 201
++ + + P++ + ++ T L E++QV+ Y G Y+PHYD+ A+
Sbjct: 97 FIPKNKDPIVAGVEDKISSWTLLPKENGEDIQVLRYEHGQKYDPHYDYF----ADKVNIA 152
Query: 202 GTGNRVATVLFYMSDVAQGGATVF-------------TSLNLS--------LWPEKGTAA 240
G+RVATVL Y++DV +GG TVF T +LS + P +G A
Sbjct: 153 RGGHRVATVLMYLTDVTKGGETVFPNAEESPRHRGSETKEDLSECAQKGIAVKPRRGDAL 212
Query: 241 FWHNLHSSGDGDYYTRHAACPVLTG 265
+ +L+ + D + HA CPV+ G
Sbjct: 213 LFFSLYPNAIPDTMSLHAGCPVIEG 237
>gi|449461905|ref|XP_004148682.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 295
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 59/211 (27%), Positives = 100/211 (47%), Gaps = 29/211 (13%)
Query: 78 LRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRI 137
L P + + QPR LY+ + D+E D + +A+ +L ++ V + +G+ + R
Sbjct: 25 LIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVSSEVRT 84
Query: 138 SKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANA 197
S +LR+ + V+ + R+ T L E +Q+++Y G YEPH+DF
Sbjct: 85 SSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHD---KV 141
Query: 198 FKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW-----------------------P 234
+ LG G+R+ATVL Y+S+V +GG T+F N +W
Sbjct: 142 NQELG-GHRIATVLMYLSNVEKGGETIFP--NSEVWYGSESQAKDESWSDCSRKGYAVKA 198
Query: 235 EKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
+KG A + +L+ D + H +CPV+ G
Sbjct: 199 QKGDALLFFSLNLDATTDERSLHGSCPVIAG 229
>gi|297802350|ref|XP_002869059.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297314895|gb|EFH45318.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 290
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 55/195 (28%), Positives = 94/195 (48%), Gaps = 23/195 (11%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+PR +Y + + + E + + +A+P + ++ V + KTG+ + R S +L+ +
Sbjct: 86 EPRAFVYHNFLTNEECEHLISLAKPSMVKSKVVDVKTGKSIDSRVRTSSGTFLKRGHDEI 145
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
+E I R+ T + E LQV++Y +G YEPH+D+ + F G R+ATV
Sbjct: 146 VEEIENRISDFTFIPIENGEGLQVLHYEVGQKYEPHHDYF----FDEFNVRKGGQRIATV 201
Query: 211 LFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAFWHNLHSSGDG 251
L Y+SDV +GG TVF + LS+ P+K A + ++
Sbjct: 202 LMYLSDVDEGGETVFPAAKGNISDVPWWDELSQCGKEGLSVLPKKRDALLFWSMKPDASL 261
Query: 252 DYYTRHAACPVLTGS 266
D + H CPV+ G+
Sbjct: 262 DPSSLHGGCPVIKGN 276
>gi|205374182|ref|ZP_03226981.1| prolyl 4-hydroxylase alpha subunit [Bacillus coahuilensis m4-4]
Length = 210
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 50/178 (28%), Positives = 91/178 (51%), Gaps = 11/178 (6%)
Query: 89 YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
+ +P + + +V+ D E D + +++ R+ R+ + + ++ R S S +L E
Sbjct: 30 FHEPFVAVLGNVLSDEECDELISLSKDRMNRSKIAGNQENDI-----RTSTSVFLPEDAS 84
Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVA 208
V++R+ +R+ + + E LQ++NY IG Y+ H+DF P K L R++
Sbjct: 85 EVVQRVEKRISQIMNIPVEHGEGLQLLNYQIGQEYKAHFDFFSP------KKLIENPRIS 138
Query: 209 TVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
T++ Y++DV +GG T F +L LS+ P KG A ++ + + T H PV G
Sbjct: 139 TLVLYLNDVEEGGDTYFPNLKLSVSPHKGMAVYFEYFYDDPMLNELTLHGGAPVTIGD 196
>gi|229002593|ref|ZP_04160640.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock3-17]
gi|229003816|ref|ZP_04161625.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock1-4]
gi|228757417|gb|EEM06653.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock1-4]
gi|228758520|gb|EEM07660.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock3-17]
Length = 219
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 53/176 (30%), Positives = 95/176 (53%), Gaps = 13/176 (7%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQ-NYKTGELEIANYRISKSAWLREPEHP 149
+P I++ +V+ D E + + +M++ +++R+ + + KT ++ R S A+L E E
Sbjct: 41 EPLIVVLANVLSDEECETLIEMSKNKMKRSKIGISRKTNDI-----RTSSGAFLEESE-- 93
Query: 150 VIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVAT 209
+ RI RR+ + + E LQ++ Y +G Y+ HYDF A A + NR++T
Sbjct: 94 ITTRIERRIASIMNVPAPHGEGLQILKYTVGQEYQAHYDFFVENSAAA-----SNNRMST 148
Query: 210 VLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
++ Y++ V +GG T F LNLS+ P+KG A ++ + + T H PV+ G
Sbjct: 149 LVMYLNHVEEGGETFFPKLNLSVSPKKGMAVYFEYFYQDESINKLTLHGGAPVIKG 204
>gi|228990015|ref|ZP_04149988.1| Prolyl 4-hydroxylase alpha subunit [Bacillus pseudomycoides DSM
12442]
gi|228769681|gb|EEM18271.1| Prolyl 4-hydroxylase alpha subunit [Bacillus pseudomycoides DSM
12442]
Length = 219
Score = 94.7 bits (234), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 53/176 (30%), Positives = 95/176 (53%), Gaps = 13/176 (7%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQ-NYKTGELEIANYRISKSAWLREPEHP 149
+P I++ +V+ D E + + +M++ +++R+ + + KT ++ R S A+L E E
Sbjct: 41 EPLIVVLANVLSDEECETLIEMSKNKMKRSKIGVSRKTNDI-----RTSSGAFLEESE-- 93
Query: 150 VIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVAT 209
+ RI RR+ + + E LQ++ Y +G Y+ HYDF A A + NR++T
Sbjct: 94 ITTRIERRIASIMNVPAPHGEGLQILKYTVGQEYQAHYDFFVENSAAA-----SNNRMST 148
Query: 210 VLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
++ Y++ V +GG T F LNLS+ P+KG A ++ + + T H PV+ G
Sbjct: 149 LVMYLNHVEEGGETFFPKLNLSVSPKKGMAVYFEYFYQDESINKLTLHGGAPVIKG 204
>gi|224034451|gb|ACN36301.1| unknown [Zea mays]
gi|413945801|gb|AFW78450.1| hypothetical protein ZEAMMB73_588774 [Zea mays]
Length = 295
Score = 94.7 bits (234), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 57/207 (27%), Positives = 96/207 (46%), Gaps = 34/207 (16%)
Query: 76 PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANY 135
P + P + +PR+ LY+ + D E + + +A+ L+R+ V + +G+ ++
Sbjct: 42 PAAVVYPHHSRQISCKPRVFLYQHFLSDDEANHLISLARAELKRSAVADNMSGKSTLS-- 99
Query: 136 RISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEA 195
E P++E I ++ T L E++QV+ Y G YEPHYD+
Sbjct: 100 -----------EDPIVEGIEDKIAAWTFLPKENGEDIQVLRYKHGEKYEPHYDYF----T 144
Query: 196 NAFKSLGTGNRVATVLFYMSDVAQGGATVF-----------------TSLNLSLWPEKGT 238
+ ++ G+R ATVL Y++DV +GG TVF +++ P KG
Sbjct: 145 DNVNTVRGGHRYATVLLYLTDVPEGGETVFPLAEEPDDAKDATLSECAQKGIAVRPRKGD 204
Query: 239 AAFWHNLHSSGDGDYYTRHAACPVLTG 265
A + NL+ G D + H CPV+ G
Sbjct: 205 ALLFFNLNPDGTTDSVSLHGGCPVIKG 231
>gi|159795555|pdb|2V4A|A Chain A, Crystal Structure Of The Semet-Labeled Prolyl-4
Hydroxylase (P4h) Type I From Green Algae Chlamydomonas
Reinhardtii.
gi|159795556|pdb|2V4A|B Chain B, Crystal Structure Of The Semet-Labeled Prolyl-4
Hydroxylase (P4h) Type I From Green Algae Chlamydomonas
Reinhardtii.
gi|159795557|pdb|2V4A|C Chain C, Crystal Structure Of The Semet-Labeled Prolyl-4
Hydroxylase (P4h) Type I From Green Algae Chlamydomonas
Reinhardtii.
gi|159795558|pdb|2V4A|D Chain D, Crystal Structure Of The Semet-Labeled Prolyl-4
Hydroxylase (P4h) Type I From Green Algae Chlamydomonas
Reinhardtii
Length = 233
Score = 94.7 bits (234), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 62/191 (32%), Positives = 94/191 (49%), Gaps = 19/191 (9%)
Query: 92 PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
PR L ++ + D E D I + A+P+ +++V + ++G+ + R S W + E VI
Sbjct: 29 PRAFLLKNFLSDEECDYIVEKARPKXVKSSVVDNESGKSVDSEIRTSTGTWFAKGEDSVI 88
Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRVATV 210
+I +RV +T + E LQV++Y G YEPHYD F P NA G G RV T
Sbjct: 89 SKIEKRVAQVTXIPLENHEGLQVLHYHDGQKYEPHYDYFHDP--VNAGPEHG-GQRVVTX 145
Query: 211 LFYMSDVAQGGATVFTSL---------------NLSLWPEKGTAAFWHNLHSSGDGDYYT 255
L Y++ V +GG TV + L++ P KG A +++L G D +
Sbjct: 146 LXYLTTVEEGGETVLPNAEQKVTGDGWSECAKRGLAVKPIKGDALXFYSLKPDGSNDPAS 205
Query: 256 RHAACPVLTGS 266
H +CP L G
Sbjct: 206 LHGSCPTLKGD 216
>gi|242039723|ref|XP_002467256.1| hypothetical protein SORBIDRAFT_01g022150 [Sorghum bicolor]
gi|241921110|gb|EER94254.1| hypothetical protein SORBIDRAFT_01g022150 [Sorghum bicolor]
Length = 303
Score = 94.7 bits (234), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 58/193 (30%), Positives = 96/193 (49%), Gaps = 22/193 (11%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+PR L++ + D+E D + +A+ +L ++ V + ++G+ + R S +L + + V
Sbjct: 46 RPRAFLHKGFLSDAECDHLIVLAKDKLEKSMVADNESGKSVQSEVRTSSGMFLEKKQDEV 105
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
+ I R+ T L E +Q+++Y G YEPHYD+ A LG G+R+ATV
Sbjct: 106 VRGIEERIAAWTFLPPENGESIQILHYQNGEKYEPHYDYFHDKNNQA---LG-GHRIATV 161
Query: 211 LFYMSDVAQGGATVFTSLNLSL-------W-----------PEKGTAAFWHNLHSSGDGD 252
L Y+S+V +GG T+F + L W P KG A + +LH D
Sbjct: 162 LMYLSNVEKGGETIFPNAEGKLLQPKDDTWSDCARNGYAVKPVKGDALLFFSLHPDATTD 221
Query: 253 YYTRHAACPVLTG 265
+ H +CPV+ G
Sbjct: 222 SESLHGSCPVIEG 234
>gi|388567209|ref|ZP_10153646.1| procollagen-proline dioxygenase [Hydrogenophaga sp. PBC]
gi|388265592|gb|EIK91145.1| procollagen-proline dioxygenase [Hydrogenophaga sp. PBC]
Length = 296
Score = 94.4 bits (233), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 55/175 (31%), Positives = 88/175 (50%), Gaps = 3/175 (1%)
Query: 92 PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
PR+++ +++ E D I + A+P+L R+ TG E+ R S + + P +
Sbjct: 109 PRVVVLGNLLSAEECDAIIESAKPKLARSLTVQTATGGEELNADRTSSGMFFTRGQTPEV 168
Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLG-TGNRVATV 210
+ RR+ + G E LQV++Y G Y+PHYD+ P EA L G RVAT+
Sbjct: 169 TAVERRIARLVGWPVENGEGLQVLHYRPGAEYKPHYDYFDPKEAGTPTILKRGGQRVATL 228
Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
+ Y+++ A+GG T F + L + P KG+A F+ + + H PVL G
Sbjct: 229 VMYLNEPARGGGTTFPDVGLEVAPVKGSAVFFS--YDRPHPTTRSLHGGAPVLEG 281
>gi|357140446|ref|XP_003571778.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Brachypodium
distachyon]
Length = 298
Score = 94.4 bits (233), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 56/193 (29%), Positives = 98/193 (50%), Gaps = 22/193 (11%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+PR L++ + + E D + ++A+ +L ++ V + ++G+ + R S +L + + V
Sbjct: 41 RPRAFLHKGFLSEPECDHMIELAKDKLEKSMVADNESGKSVQSEVRTSSGMFLEKRQDEV 100
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
+ RI R+ T L + E +Q+++Y G YEPHYD+ A LG G+R+ATV
Sbjct: 101 VARIEERIAAWTFLPSENGESIQILHYKNGEKYEPHYDYFHDKNNQA---LG-GHRIATV 156
Query: 211 LFYMSDVAQGGATVFTSL------------------NLSLWPEKGTAAFWHNLHSSGDGD 252
L Y+S+V +GG T+F + ++ P KG A + +LH D
Sbjct: 157 LMYLSNVEKGGETIFPNAEGKLTQHKDETASECAKNGYAVKPMKGDALLFFSLHPDATTD 216
Query: 253 YYTRHAACPVLTG 265
+ H +CPV+ G
Sbjct: 217 PDSLHGSCPVIEG 229
>gi|195352174|ref|XP_002042589.1| GM14934 [Drosophila sechellia]
gi|194124473|gb|EDW46516.1| GM14934 [Drosophila sechellia]
Length = 438
Score = 94.4 bits (233), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 60/204 (29%), Positives = 103/204 (50%), Gaps = 33/204 (16%)
Query: 66 LKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNY 125
L CRYV +L+L PLK EE ++P I ++ + +I+++K ++P+L+R +
Sbjct: 256 LVCRYVDW-TQFLKLAPLKMEELSMKPHISIFYGFLGQKDIEVLKNASRPKLQRVK---H 311
Query: 126 KTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEP 185
+G +S S+ H V+ +++ + +TG + + L+V+NYGI G+Y P
Sbjct: 312 LSGNCSCKIGNLSSSS------HDVVRKVNELILDITGFPSKGNQMLEVINYGIAGNYNP 365
Query: 186 HYDFARP---GEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFW 242
D A+P +ANAF ++ + +GG VF S +L + P KG+ FW
Sbjct: 366 E-DTAKPKIHNKANAF-------------IFLENAGKGGEIVFPSRHLKVRPRKGSMLFW 411
Query: 243 HNLHSSGDGDYYTRHAACPVLTGS 266
NL +S + CP+L G+
Sbjct: 412 ENLKNS------VIYHQCPILKGN 429
>gi|48716447|dbj|BAD23054.1| putative prolyl 4-hydroxylase [Oryza sativa Japonica Group]
Length = 310
Score = 94.4 bits (233), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 57/201 (28%), Positives = 97/201 (48%), Gaps = 24/201 (11%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+PR +Y + + E D + +A+P + ++TV + TG+ + + R S +L+ V
Sbjct: 105 EPRAFVYHNFLSKEECDYLIGLAKPHMVKSTVVDSTTGKSKDSRVRTSSGMFLQRGRDKV 164
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
I I +R+ T + E LQV++Y +G YEPH+D+ + + + G R+AT+
Sbjct: 165 IRAIEKRIADYTFIPMEHGEGLQVLHYEVGQKYEPHFDYF----LDEYNTKNGGQRMATL 220
Query: 211 LFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAFWHNLHSSGDG 251
L Y+SDV +GG T+F N L++ P+ G A + ++
Sbjct: 221 LMYLSDVEEGGETIFPDANVNSSSLPWYNELSECARKGLAVKPKMGDALLFWSMKPDATL 280
Query: 252 DYYTRHAACPVLTGSNSLHST 272
D + H CPV+ G N ST
Sbjct: 281 DPLSLHGGCPVIKG-NKWSST 300
>gi|357517881|ref|XP_003629229.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355523251|gb|AET03705.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 278
Score = 94.4 bits (233), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 52/194 (26%), Positives = 95/194 (48%), Gaps = 23/194 (11%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+PR LY + + E + + A+P +++++V + +TG+ + ++ R S +L +
Sbjct: 73 EPRAFLYHNFLTKKECEHLINTAKPSMQKSSVVDNETGKSKDSSVRTSSGTFLDRGGDEI 132
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
+ I +R+ T + E V+ Y +G Y+PH D+ A+ + ++ G R+AT+
Sbjct: 133 VRNIEKRIADFTFIPVENGESFNVLRYEVGQKYDPHLDYF----ADDYNTVNGGQRIATM 188
Query: 211 LFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAFWHNLHSSGDG 251
L Y+SDV +GG TVF + LS+ P+ G A + ++ G
Sbjct: 189 LMYLSDVEEGGETVFPAAKGNISSVPWWNELSDCGKKGLSIKPKMGDALLFWSMKPDGTL 248
Query: 252 DYYTRHAACPVLTG 265
D + H ACPV+ G
Sbjct: 249 DPSSLHGACPVIKG 262
>gi|225459748|ref|XP_002285898.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Vitis vinifera]
gi|302141716|emb|CBI18919.3| unnamed protein product [Vitis vinifera]
Length = 288
Score = 94.4 bits (233), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 98/201 (48%), Gaps = 24/201 (11%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+PR +Y + + E + + +A+P ++++TV + +TG + + R S +LR +
Sbjct: 83 EPRAFIYHNFLSKEECEYMISLAKPYMKKSTVVDSETGRSKDSRVRTSSGMFLRRGRDKI 142
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
I I +R+ T + E LQV++Y +G Y+ HYD+ + F + G R+AT+
Sbjct: 143 IRDIEKRIADFTFIPVEHGEGLQVLHYEVGQKYDAHYDYF----LDEFNTKNGGQRIATL 198
Query: 211 LFYMSDVAQGGATVF--TSLN-----------------LSLWPEKGTAAFWHNLHSSGDG 251
L Y+SDV +GG TVF T N LS+ P+ G A + ++
Sbjct: 199 LMYLSDVEEGGETVFPATKANFSSVPWWNELSECGKKGLSVKPKMGDALLFWSMRPDATL 258
Query: 252 DYYTRHAACPVLTGSNSLHST 272
D + H CPV+ G N ST
Sbjct: 259 DPSSLHGGCPVIKG-NKWSST 278
>gi|171059332|ref|YP_001791681.1| procollagen-proline dioxygenase [Leptothrix cholodnii SP-6]
gi|170776777|gb|ACB34916.1| Procollagen-proline dioxygenase [Leptothrix cholodnii SP-6]
Length = 287
Score = 94.4 bits (233), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 57/181 (31%), Positives = 87/181 (48%), Gaps = 11/181 (6%)
Query: 92 PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
PR++++ + E D + +AQPRL R+ + TG E+ R S+ + E +I
Sbjct: 100 PRVVVFGGFLSHDECDALVALAQPRLARSETVDNDTGGSEVNEARTSQGMFFMRGEGELI 159
Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDF---ARPGEANAFKSLGTGNRVA 208
RI R+ + E +QV++Y G Y+PHYD+ A+PG K G RV
Sbjct: 160 SRIEARIAALLDWPLENGEGVQVLHYRPGAEYKPHYDYFDPAQPGTPTILKR--GGQRVG 217
Query: 209 TVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAF--WHNLHSSGDGDYYTRHAACPVLTGS 266
T++ Y++ +GG T F +NL + P KG A F + H S + H PVL G
Sbjct: 218 TLVMYLNTPERGGGTTFPDVNLEVAPIKGNAVFFSYERAHPS----TRSLHGGAPVLAGE 273
Query: 267 N 267
Sbjct: 274 K 274
>gi|125542543|gb|EAY88682.1| hypothetical protein OsI_10157 [Oryza sativa Indica Group]
Length = 321
Score = 94.0 bits (232), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 65/213 (30%), Positives = 97/213 (45%), Gaps = 44/213 (20%)
Query: 91 QPRIILYRDVMYDSEID-LIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHP 149
+PR LY + D+E D LI Q ++ ++TV + ++GE + R S +L + +
Sbjct: 48 RPRAFLYEGFLSDAECDHLISLAKQGKMEKSTVVDGESGESVTSKVRTSSGMFLDKKQDE 107
Query: 150 VIERISRRVEHMTGLTTS-----------------TAEELQVVNYGIGGHYEPHYDF--A 190
V+ RI R+ T L T E +Q++ YG G YEPH+D+
Sbjct: 108 VVARIEERIAAWTMLPTECIIFYCFANFAILKLSENGESMQILRYGQGEKYEPHFDYISG 167
Query: 191 RPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSL-------W---------- 233
R G S G+RVATVL Y+S+V GG T+F L W
Sbjct: 168 RQG------STREGDRVATVLMYLSNVKMGGETIFPDCEARLSQPKDETWSDCAEQGFAV 221
Query: 234 -PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
P KG+A + +LH + D + H +CPV+ G
Sbjct: 222 KPAKGSAVLFFSLHPNATLDTDSLHGSCPVIEG 254
>gi|297812067|ref|XP_002873917.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297319754|gb|EFH50176.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 298
Score = 94.0 bits (232), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 54/207 (26%), Positives = 97/207 (46%), Gaps = 25/207 (12%)
Query: 80 LMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISK 139
+ P K ++ +PR +Y + + E D + +A+ L+R+ V + +GE + + R S
Sbjct: 32 INPSKVKQVSSKPRAFVYEGFLTELECDHMVSLAKASLKRSAVADNDSGESKFSEVRTSS 91
Query: 140 SAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFK 199
++ + + P++ I ++ T L E++QV+ Y G Y+ H+D+ +
Sbjct: 92 GTFIPKGKDPIVSGIEDKISTWTFLPKENGEDIQVLRYEHGQKYDAHFDYFH----DKVN 147
Query: 200 SLGTGNRVATVLFYMSDVAQGGATVF---------------------TSLNLSLWPEKGT 238
+ G+R+ATVL Y+S+V +GG TVF +++ P KG
Sbjct: 148 IVRGGHRIATVLMYLSNVTKGGETVFPDAEVPSCRVLSENKEDLSDCAKRGIAVKPRKGD 207
Query: 239 AAFWHNLHSSGDGDYYTRHAACPVLTG 265
A + NLH D + H CPV+ G
Sbjct: 208 ALLFFNLHPDAIPDPLSLHGGCPVIEG 234
>gi|449454448|ref|XP_004144967.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
gi|449474082|ref|XP_004154068.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
gi|449515181|ref|XP_004164628.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 300
Score = 94.0 bits (232), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 64/240 (26%), Positives = 111/240 (46%), Gaps = 27/240 (11%)
Query: 47 KYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEI 106
K++ L L + + + + C Y + P K ++ +PR +Y + D E
Sbjct: 3 KFDNLLFIFLILTSSFIRESTCSYA--GSASATVDPSKVKQISWKPRAFVYEGFLTDLEC 60
Query: 107 DLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTT 166
D + +A+ L+R+ V + +G+ +++ R S ++ + + P++ I ++ T L
Sbjct: 61 DHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPIVSGIEDKISAWTFLPK 120
Query: 167 STAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVF- 225
E++QV+ Y G YE HYD+ + G+R+ATVL Y+S+V QGG TVF
Sbjct: 121 ENGEDIQVLRYEHGQKYESHYDYF----VDKVNIAWGGHRLATVLMYLSNVTQGGETVFP 176
Query: 226 ------------TSLNLS--------LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
T +LS + P+KG A + +L + D + H CPVL G
Sbjct: 177 LAEKPSHRRAYETDEDLSECAKKGVAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEG 236
>gi|356546462|ref|XP_003541645.1| PREDICTED: uncharacterized protein LOC100818794 [Glycine max]
Length = 839
Score = 94.0 bits (232), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 59/205 (28%), Positives = 102/205 (49%), Gaps = 25/205 (12%)
Query: 82 PLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSA 141
P K ++ +PR +Y + + E D + +A+ L+R+ V + +GE +++ R S
Sbjct: 575 PSKVKQVSWKPRAFVYEGFLTELECDHLISIAKSELKRSAVADNLSGESKLSEVRTSSGM 634
Query: 142 WLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL 201
++ + + ++ I ++ T L E++QV+ Y G Y+PHYD+ A+
Sbjct: 635 FIPKNKDLIVAGIEDKISSWTFLPKENGEDIQVLRYEHGQKYDPHYDYF----ADKVNIA 690
Query: 202 GTGNRVATVLFYMSDVAQGGATVF-------------TSLNLS--------LWPEKGTAA 240
G+RVATVL Y++DV +GG TVF T+ NLS + P +G A
Sbjct: 691 RGGHRVATVLMYLTDVTKGGETVFPDAEESPRHKGSETNENLSECAQKGIAVKPRRGDAL 750
Query: 241 FWHNLHSSGDGDYYTRHAACPVLTG 265
+ +L+ + D + HA CPV+ G
Sbjct: 751 LFFSLYPNAIPDTLSLHAGCPVIEG 775
>gi|42567428|ref|NP_195306.2| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
gi|332661174|gb|AEE86574.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
Length = 290
Score = 94.0 bits (232), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 58/201 (28%), Positives = 95/201 (47%), Gaps = 24/201 (11%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+PR +Y + + + E + + +A+P + ++ V + KTG+ + R S +L +
Sbjct: 86 EPRAFVYHNFLTNEECEHLISLAKPSMMKSKVVDVKTGKSIDSRVRTSSGTFLNRGHDEI 145
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
+E I R+ T + E LQV++Y +G YEPH+D+ + F G R+ATV
Sbjct: 146 VEEIENRISDFTFIPPENGEGLQVLHYEVGQRYEPHHDYF----FDEFNVRKGGQRIATV 201
Query: 211 LFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAFWHNLHSSGDG 251
L Y+SDV +GG TVF + LS+ P+K A + ++
Sbjct: 202 LMYLSDVDEGGETVFPAAKGNVSDVPWWDELSQCGKEGLSVLPKKRDALLFWSMKPDASL 261
Query: 252 DYYTRHAACPVLTGSNSLHST 272
D + H CPV+ G N ST
Sbjct: 262 DPSSLHGGCPVIKG-NKWSST 281
>gi|255641919|gb|ACU21228.1| unknown [Glycine max]
Length = 301
Score = 93.6 bits (231), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 59/205 (28%), Positives = 102/205 (49%), Gaps = 25/205 (12%)
Query: 82 PLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSA 141
P K ++ +PR +Y + + E D + +A+ L+R+ V + +GE +++ R S
Sbjct: 37 PSKVKQVSWKPRAFVYEGFLTELECDHLISIAKSELKRSAVADNLSGESKLSEVRTSSGM 96
Query: 142 WLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL 201
++ + + ++ I ++ T L E++QV+ Y G Y+PHYD+ A+
Sbjct: 97 FIPKNKDLIVAGIEDKISSWTFLPKENGEDIQVLRYEHGQKYDPHYDYF----ADKVNIA 152
Query: 202 GTGNRVATVLFYMSDVAQGGATVF-------------TSLNLS--------LWPEKGTAA 240
G+RVATVL Y++DV +GG TVF T+ NLS + P +G A
Sbjct: 153 RGGHRVATVLMYLTDVTKGGETVFPDAEESPRHKGSETNENLSECAQKGIAVKPRRGDAL 212
Query: 241 FWHNLHSSGDGDYYTRHAACPVLTG 265
+ +L+ + D + HA CPV+ G
Sbjct: 213 LFFSLYPNAIPDTLSLHAGCPVIEG 237
>gi|414587756|tpg|DAA38327.1| TPA: hypothetical protein ZEAMMB73_894856 [Zea mays]
Length = 263
Score = 93.6 bits (231), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 60/203 (29%), Positives = 99/203 (48%), Gaps = 19/203 (9%)
Query: 78 LRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRI 137
LRL +K E PRII++ + + E D + +A+PRL+ +TV + TG+ ++ R
Sbjct: 50 LRLGYVKPEVISWTPRIIVFHNFLSSEECDYLMAIARPRLQISTVVDVATGKGVKSDVRT 109
Query: 138 SKSAWLREPEH--PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEA 195
S ++ E PV++ I +R+ + + E +QV+ Y +Y PH+D+ +
Sbjct: 110 SSGMFVNSEERKSPVVQAIEKRISVFSQIPKENGELIQVLRYEASQYYRPHHDYF----S 165
Query: 196 NAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL-------------NLSLWPEKGTAAFW 242
+ F G RVAT+L Y++D GG T F L + P KG A +
Sbjct: 166 DTFNLKRGGQRVATMLMYLTDGVVGGETHFPQAGDGECSCGGNVVKGLCVKPNKGDAVLF 225
Query: 243 HNLHSSGDGDYYTRHAACPVLTG 265
++ G+ D + H+ CPVL G
Sbjct: 226 WSMGLDGNTDPNSIHSGCPVLKG 248
>gi|423541303|ref|ZP_17517694.1| hypothetical protein IGK_03395 [Bacillus cereus HuB4-10]
gi|401172491|gb|EJQ79712.1| hypothetical protein IGK_03395 [Bacillus cereus HuB4-10]
Length = 216
Score = 93.6 bits (231), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 55/196 (28%), Positives = 103/196 (52%), Gaps = 16/196 (8%)
Query: 89 YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
+ +P I++ +V+ D E D + +M++ +++R+T+ + + ++ + R S A+L E E
Sbjct: 36 FEEPLIVVLGNVISDEECDELIEMSKNKIKRSTIGSSR----DVNDIRTSSGAFLEENE- 90
Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
+ +I +R+ + + + E L ++NY + Y+ HYD FA + A NR+
Sbjct: 91 -LTSKIEKRISSIMNVPVTHGEGLHILNYEVDQQYKAHYDYFAEHSRSAA------NNRI 143
Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
+T++ Y++DV +GG T F LNLS+ P KG A ++ + + T H PV G
Sbjct: 144 STLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEK 203
Query: 268 SLHSTCPCGLRRGLQR 283
+ + +RRG R
Sbjct: 204 WIATQW---VRRGTYR 216
>gi|255552788|ref|XP_002517437.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223543448|gb|EEF44979.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 311
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 59/202 (29%), Positives = 98/202 (48%), Gaps = 22/202 (10%)
Query: 82 PLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSA 141
P + + PR LY+ + E D + +A+ +L ++ V + ++G+ + R S
Sbjct: 46 PTRVTQLSWHPRAFLYKGFLSYEECDHLIDLARDKLEKSMVADNESGKSIESEVRTSSGM 105
Query: 142 WLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL 201
++ + + ++ I R+ T L E +Q+++Y G YEPH+D+ A + L
Sbjct: 106 FIAKAQDEIVADIEARIAAWTFLPEENGESMQILHYEHGQKYEPHFDYFHD---KANQEL 162
Query: 202 GTGNRVATVLFYMSDVAQGGATVFTSLNLSL-------W-----------PEKGTAAFWH 243
G G+RVATVL Y+S+V +GG TVF + L W PEKG A +
Sbjct: 163 G-GHRVATVLMYLSNVEKGGETVFPNAEGKLSQPKEDSWSDCAKGGYAVKPEKGDALLFF 221
Query: 244 NLHSSGDGDYYTRHAACPVLTG 265
+LH D + H +CPV+ G
Sbjct: 222 SLHPDATTDSDSLHGSCPVIEG 243
>gi|423615424|ref|ZP_17591258.1| hypothetical protein IIO_00750 [Bacillus cereus VD115]
gi|401259961|gb|EJR66134.1| hypothetical protein IIO_00750 [Bacillus cereus VD115]
Length = 216
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 55/196 (28%), Positives = 103/196 (52%), Gaps = 16/196 (8%)
Query: 89 YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
+ +P I++ +V+ D E D + +M++ +++R+T+ + + ++ + R S A+L E E
Sbjct: 36 FEEPLIVVLGNVISDEECDELIEMSKNKIKRSTIGSSR----DVNDIRTSSGAFLEENE- 90
Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
+ +I +R+ + + + E L ++NY + Y+ HYD FA + A NR+
Sbjct: 91 -LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYFAEHSRSAA------NNRI 143
Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
+T++ Y++DV +GG T F LNLS+ P KG A ++ + + T H PV G
Sbjct: 144 STLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEK 203
Query: 268 SLHSTCPCGLRRGLQR 283
+ + +RRG R
Sbjct: 204 WIATQW---VRRGTYR 216
>gi|388495016|gb|AFK35574.1| unknown [Lotus japonicus]
Length = 297
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 56/207 (27%), Positives = 103/207 (49%), Gaps = 25/207 (12%)
Query: 80 LMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISK 139
+ P K ++ +PR +Y + E D + +A+ L+R+ V + G+ +++ R S
Sbjct: 31 INPSKVKQVSWKPRAFVYEGFLTGLECDHLISLAKSELKRSAVADNLPGDSKLSEVRTSS 90
Query: 140 SAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFK 199
++ + + P++ I ++ T L E++QV+ Y G Y+PHYD+ +
Sbjct: 91 GMFISKKKDPIVAGIEDKISAWTFLPKENGEDMQVLRYEHGQKYDPHYDYF----TDKVN 146
Query: 200 SLGTGNRVATVLFYMSDVAQGGATVF-------------TSLNLS--------LWPEKGT 238
+ G+R+ATVL Y+++V +GG TVF T+ +LS + P +G
Sbjct: 147 IVRGGHRMATVLLYLTNVTRGGETVFPVAEEPPRRRGLETNSDLSECAKKGIAVKPRRGD 206
Query: 239 AAFWHNLHSSGDGDYYTRHAACPVLTG 265
A + +LH++ D + HA CPV+ G
Sbjct: 207 ALLFFSLHTTAIPDTDSLHAGCPVIEG 233
>gi|357128903|ref|XP_003566109.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Brachypodium
distachyon]
Length = 313
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 57/211 (27%), Positives = 101/211 (47%), Gaps = 25/211 (11%)
Query: 76 PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANY 135
P + P + +PR+ LY+ + D E + + +A+ L+R+ V + +G+ ++
Sbjct: 43 PASVVYPHHSRQISWKPRVFLYQHFLSDDEANHLLSLARAELKRSAVADNTSGKSTLSEV 102
Query: 136 RISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEA 195
R S ++ + + P++ I ++ T L E++QV+ Y G EP +DF
Sbjct: 103 RTSYGTFISKGKDPIVAGIEDKIAAWTFLPKENGEDMQVLRYKRGEKDEPQFDFF----T 158
Query: 196 NAFKSLGTGNRVATVLFYMSDVAQGGATVF---------------TSLN------LSLWP 234
+ ++ G+RVATVL Y++DVA+GG TVF T+L+ +++ P
Sbjct: 159 DTVNTVRGGHRVATVLLYLTDVAEGGETVFPLAKDFTDTGLHDKDTTLSECAQKGIAVKP 218
Query: 235 EKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
KG A + NL D + H C V+ G
Sbjct: 219 RKGDALLFFNLRPDAATDPLSLHGGCTVIKG 249
>gi|302841711|ref|XP_002952400.1| hypothetical protein VOLCADRAFT_81799 [Volvox carteri f.
nagariensis]
gi|300262336|gb|EFJ46543.1| hypothetical protein VOLCADRAFT_81799 [Volvox carteri f.
nagariensis]
Length = 269
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 59/193 (30%), Positives = 92/193 (47%), Gaps = 24/193 (12%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
RI L+R + E D I+ A+ RL R+ V + +G +++ R S + E +
Sbjct: 42 DARIYLWRGFLTPEECDYIRMKAEKRLERSGVVDTASGSSVVSDIRTSDGMFFERGEDAI 101
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPH--YDFARPGEANAFKSLGTGNRVA 208
+E + +R+ T E LQV+ Y Y+ H Y F + G AN GNR A
Sbjct: 102 LEAVEQRLADWTMTPIWAGEALQVLRYRKDQKYDSHVNYFFHKEGSANG------GNRWA 155
Query: 209 TVLFYMSDVAQGGATVFTSL----------------NLSLWPEKGTAAFWHNLHSSGDGD 252
TVL Y++D +GG TVF + NL++ P KG A +H++ ++G +
Sbjct: 156 TVLTYLTDTEEGGETVFPKIPAPGGVNVGFSECAKYNLAVKPRKGDAILFHSMKTNGQLE 215
Query: 253 YYTRHAACPVLTG 265
+ H ACPV+ G
Sbjct: 216 ERSLHGACPVIKG 228
>gi|449434114|ref|XP_004134841.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 287
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 52/196 (26%), Positives = 96/196 (48%), Gaps = 23/196 (11%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+PR +Y + + E + + +A+P ++++TV + +TG+ + + R S +L
Sbjct: 82 EPRAFVYHNFLTKEECEYLISLAKPHMQKSTVVDSETGQSKDSRVRTSSGTFLPRGRDKT 141
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
+ I +R+ + + E LQV++Y +G YEPH+D+ + + + G R+ATV
Sbjct: 142 VRTIEKRLSDFSFIPVEHGEGLQVLHYEVGQKYEPHFDYF----LDEYNTKNGGQRIATV 197
Query: 211 LFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAFWHNLHSSGDG 251
L Y+SDV +GG TVF + LS+ P++G A + ++
Sbjct: 198 LMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKPKRGDALLFWSMKPDASL 257
Query: 252 DYYTRHAACPVLTGSN 267
D + H CPV+ G+
Sbjct: 258 DPSSLHGGCPVIKGNK 273
>gi|449491267|ref|XP_004158845.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 287
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 52/196 (26%), Positives = 96/196 (48%), Gaps = 23/196 (11%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+PR +Y + + E + + +A+P ++++TV + +TG+ + + R S +L
Sbjct: 82 EPRAFVYHNFLTKEECEYLISLAKPHMQKSTVVDSETGQSKDSRVRTSSGTFLPRGRDKT 141
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
+ I +R+ + + E LQV++Y +G YEPH+D+ + + + G R+ATV
Sbjct: 142 VRTIEKRLSDFSFIPVEHGEGLQVLHYEVGQKYEPHFDYF----LDEYNTKNGGQRIATV 197
Query: 211 LFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAFWHNLHSSGDG 251
L Y+SDV +GG TVF + LS+ P++G A + ++
Sbjct: 198 LMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKPKRGDALLFWSMKPDASL 257
Query: 252 DYYTRHAACPVLTGSN 267
D + H CPV+ G+
Sbjct: 258 DPSSLHGGCPVIKGNK 273
>gi|299115886|emb|CBN75895.1| prolyl 4-hydroxylase alpha-1 subunit precursor-like protein
[Ectocarpus siliculosus]
Length = 404
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 56/194 (28%), Positives = 100/194 (51%), Gaps = 22/194 (11%)
Query: 90 LQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQ--NYKTGELEIANYRISKSAWLREPE 147
++P + R+ + D E I++ A P ++ + V ++ G+ + N+R S + ++
Sbjct: 198 MEPLVFEARNFLLDEECKHIREKADPHMKPSPVSLMDHDKGKPD-TNWRTSTTYFMPSTR 256
Query: 148 HPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTG--N 205
P+++ I RRVE T + S E++QV+ Y G Y H+DF + +++ G N
Sbjct: 257 DPLLQGIDRRVEEFTRVPKSHQEQVQVLKYDKGQRYTAHHDFL---DERTMRNMDGGRKN 313
Query: 206 RVATVLFYMSDVAQGGATVF--------------TSLNLSLWPEKGTAAFWHNLHSSGDG 251
R+ TV +Y+SDV +GG T+F + L + P +G A +++L G
Sbjct: 314 RMITVFWYLSDVEEGGETIFPRYGGRTGRVDFSDCTTGLKVKPVEGKVAMFYSLKPDGQF 373
Query: 252 DYYTRHAACPVLTG 265
D ++ H ACPV+TG
Sbjct: 374 DDFSLHGACPVITG 387
>gi|357517897|ref|XP_003629237.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355523259|gb|AET03713.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|388513409|gb|AFK44766.1| unknown [Medicago truncatula]
gi|388516345|gb|AFK46234.1| unknown [Medicago truncatula]
Length = 275
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 54/194 (27%), Positives = 92/194 (47%), Gaps = 23/194 (11%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+PR LY + + E + + +A+P + ++ V + KTG+ ++ R S +L +
Sbjct: 72 EPRAFLYHNFLTKEECEHLINIAKPSMHKSEVIDEKTGKSLNSSIRTSSGTFLDREGDEI 131
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
+ I +R+ T + E V++Y +G YEPHYD+ + F + G R+AT+
Sbjct: 132 VSNIEKRIADFTFIPVEHGESFNVLHYEVGQKYEPHYDYF----LDTFSTRHAGQRIATM 187
Query: 211 LFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAFWHNLHSSGDG 251
L Y+SDV +GG TVF + LS+ P+ G A + ++
Sbjct: 188 LMYLSDVEEGGETVFPNAKGNFSSVPWWNELSDCGKGGLSIKPKMGNAILFWSMKPDATL 247
Query: 252 DYYTRHAACPVLTG 265
D + H ACPV+ G
Sbjct: 248 DPSSLHGACPVIKG 261
>gi|15239594|ref|NP_197391.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
gi|21593296|gb|AAM65245.1| prolyl 4-hydroxylase alpha subunit-like protein [Arabidopsis
thaliana]
gi|332005243|gb|AED92626.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
Length = 298
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 53/205 (25%), Positives = 96/205 (46%), Gaps = 25/205 (12%)
Query: 82 PLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSA 141
P K ++ +PR +Y + + E D + +A+ L+R+ V + +GE + + R S
Sbjct: 34 PSKVKQVSSKPRAFVYEGFLTELECDHMVSLAKASLKRSAVADNDSGESKFSEVRTSSGT 93
Query: 142 WLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL 201
++ + + P++ I ++ T L E++QV+ Y G Y+ H+D+ + +
Sbjct: 94 FISKGKDPIVSGIEDKISTWTFLPKENGEDIQVLRYEHGQKYDAHFDYFH----DKVNIV 149
Query: 202 GTGNRVATVLFYMSDVAQGGATVF---------------------TSLNLSLWPEKGTAA 240
G+R+AT+L Y+S+V +GG TVF +++ P KG A
Sbjct: 150 RGGHRMATILMYLSNVTKGGETVFPDAEIPSRRVLSENKEDLSDCAKRGIAVKPRKGDAL 209
Query: 241 FWHNLHSSGDGDYYTRHAACPVLTG 265
+ NLH D + H CPV+ G
Sbjct: 210 LFFNLHPDAIPDPLSLHGGCPVIEG 234
>gi|255579590|ref|XP_002530636.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223529809|gb|EEF31744.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 287
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 54/195 (27%), Positives = 93/195 (47%), Gaps = 23/195 (11%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+PR +Y + + E + + +A+P ++++TV + +TG + + R S +L
Sbjct: 82 EPRAFVYHNFLTKEECEYLINLAKPNMQKSTVVDSETGRSKDSRVRTSSGTFLSRGRDKK 141
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
I I +R+ + + E LQV++Y +G YEPH+D+ + F + G RVAT+
Sbjct: 142 IRDIEKRIADFSFIPVEHGEGLQVLHYEVGQKYEPHFDYFN----DEFNTKNGGQRVATL 197
Query: 211 LFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAFWHNLHSSGDG 251
L Y+SDV +GG TVF + LS+ P G A + ++
Sbjct: 198 LMYLSDVEEGGETVFPAAKGNFSAVPWWNELSECGKKGLSVKPNMGDALLFWSMKPDATL 257
Query: 252 DYYTRHAACPVLTGS 266
D + H CPV+ G+
Sbjct: 258 DPSSLHGGCPVINGN 272
>gi|228987427|ref|ZP_04147547.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
tochigiensis BGSC 4Y1]
gi|228772399|gb|EEM20845.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
tochigiensis BGSC 4Y1]
Length = 232
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 56/193 (29%), Positives = 102/193 (52%), Gaps = 16/193 (8%)
Query: 89 YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
+ +P I++ +V+ D E D + ++++ +L R+ V + + ++ + R SK A+L + E
Sbjct: 52 FEEPLIVVLGNVLSDEECDELIELSKNKLARSKVGSSR----DVNDIRTSKGAFLDDNE- 106
Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
+ E+I +R+ + + S E L ++NY + Y+ HYD FA + A NR+
Sbjct: 107 -LTEKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSAA------NNRI 159
Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
+T++ Y++DV +GG T F LNLS+ P KG A ++ + + T H PV G
Sbjct: 160 STLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEK 219
Query: 268 SLHSTCPCGLRRG 280
+ + +RRG
Sbjct: 220 WIATQW---VRRG 229
>gi|20260280|gb|AAM13038.1| unknown protein [Arabidopsis thaliana]
gi|22136524|gb|AAM91340.1| unknown protein [Arabidopsis thaliana]
Length = 298
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 53/205 (25%), Positives = 96/205 (46%), Gaps = 25/205 (12%)
Query: 82 PLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSA 141
P K ++ +PR +Y + + E D + +A+ L+R+ V + +GE + + R S
Sbjct: 34 PSKVKQVSSKPRAFVYEGFLTELECDHMVSLAKASLKRSAVADNDSGESKFSEVRTSSGT 93
Query: 142 WLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL 201
++ + + P++ I ++ T L E++QV+ Y G Y+ H+D+ + +
Sbjct: 94 FISKGKDPIVSGIEDKISTWTFLPKENGEDIQVLRYEHGQKYDAHFDYFH----DKVNIV 149
Query: 202 GTGNRVATVLFYMSDVAQGGATVF---------------------TSLNLSLWPEKGTAA 240
G+R+AT+L Y+S+V +GG TVF +++ P KG A
Sbjct: 150 RGGHRMATILMYLSNVTKGGETVFPDAEIPSRRVLSENEEDLSDCAKRGIAVKPRKGDAL 209
Query: 241 FWHNLHSSGDGDYYTRHAACPVLTG 265
+ NLH D + H CPV+ G
Sbjct: 210 LFFNLHPDAIPDPLSLHGGCPVIEG 234
>gi|47567794|ref|ZP_00238502.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus G9241]
gi|47555471|gb|EAL13814.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus G9241]
Length = 216
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 56/193 (29%), Positives = 102/193 (52%), Gaps = 16/193 (8%)
Query: 89 YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
+ +P I++ +V+ D E D + ++++ +L R+ V + + ++ + R SK A+L + E
Sbjct: 36 FEEPLIVVLGNVLSDEECDELIELSKNKLARSKVGSSR----DVNDIRTSKGAFLDDNE- 90
Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
+ E+I +R+ + + S E L ++NY + Y+ HYD FA + A NR+
Sbjct: 91 -LTEKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSAA------NNRI 143
Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
+T++ Y++DV +GG T F LNLS+ P KG A ++ + + T H PV G
Sbjct: 144 STLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEK 203
Query: 268 SLHSTCPCGLRRG 280
+ + +RRG
Sbjct: 204 WIATQW---VRRG 213
>gi|402813396|ref|ZP_10862991.1| hypothetical protein PAV_1c08470 [Paenibacillus alvei DSM 29]
gi|402509339|gb|EJW19859.1| hypothetical protein PAV_1c08470 [Paenibacillus alvei DSM 29]
Length = 215
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 56/178 (31%), Positives = 98/178 (55%), Gaps = 13/178 (7%)
Query: 89 YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
Y +P I++ +V+ + E D + + ++ RL+R+ K GE N +I S+ + E+
Sbjct: 33 YEEPLIVILGNVLSNEECDELIEHSKERLQRS-----KIGEERSVN-QIRTSSGVFCEEN 86
Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDF-ARPGEANAFKSLGTGNRV 207
+ +I +R+ + + + LQV+ Y G Y+PH+DF A A+A NR+
Sbjct: 87 ETVAKIEKRISQIMNIPIEHGDGLQVLLYAPGQEYKPHFDFFADTSRASA------NNRI 140
Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
+T++ Y++DV +GG T F LNLS++P KG A ++ +S+ + + T HA PV G
Sbjct: 141 STLVMYLNDVEEGGETTFPMLNLSVFPSKGMAVYFEYFYSNHELNERTLHAGAPVRKG 198
>gi|229157835|ref|ZP_04285910.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 4342]
gi|228625792|gb|EEK82544.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 4342]
Length = 232
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 54/192 (28%), Positives = 101/192 (52%), Gaps = 14/192 (7%)
Query: 89 YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
+ +P I++ +V+ D E D + ++++ +L R+ V + + ++ + R SK A+L + E
Sbjct: 52 FEEPLIVVLGNVLSDEECDELIELSKNKLARSKVGSSR----DVNDIRTSKGAFLDDNE- 106
Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVA 208
+ E+I +R+ + + S E L ++NY + Y+ HYD+ +A NR++
Sbjct: 107 -LTEKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSA-----ANNRIS 160
Query: 209 TVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNS 268
T++ Y++DV +GG T F LNLS+ P KG A ++ + + T H PV G
Sbjct: 161 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 220
Query: 269 LHSTCPCGLRRG 280
+ + +RRG
Sbjct: 221 IATQW---VRRG 229
>gi|423489423|ref|ZP_17466105.1| hypothetical protein IEU_04046 [Bacillus cereus BtB2-4]
gi|402431659|gb|EJV63723.1| hypothetical protein IEU_04046 [Bacillus cereus BtB2-4]
Length = 216
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 52/178 (29%), Positives = 95/178 (53%), Gaps = 13/178 (7%)
Query: 89 YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
+ +P I++ +V+ D E D + ++++ ++ R+ V + + ++ + R S A+L E E
Sbjct: 36 FEEPLIVVLANVLSDEECDELIELSKSKMERSKVGSSR----DVNDIRTSSGAFLEENE- 90
Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
+ +I +R+ +T + S E L ++NY + Y+ HYD FA + A NR+
Sbjct: 91 -LTSKIEKRISSITNVPVSHGEGLHILNYEVDQEYKAHYDYFAEHSRSAA------NNRI 143
Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
+T++ Y++DV +GG T F LNLS+ P KG A ++ + + T H PV G
Sbjct: 144 STLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKG 201
>gi|255083627|ref|XP_002508388.1| predicted protein [Micromonas sp. RCC299]
gi|226523665|gb|ACO69646.1| predicted protein [Micromonas sp. RCC299]
Length = 253
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 58/198 (29%), Positives = 98/198 (49%), Gaps = 25/198 (12%)
Query: 92 PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
PR + M E D I ++A+PR+RR+TV + TG+ ++ R S+ +L ++
Sbjct: 5 PRAFHLHNFMSHEECDRILEIARPRVRRSTVIDSVTGQSKVDPIRTSEQTFLNRGTWDIV 64
Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT--GNRVAT 209
++ R+ +T L E++Q++ YG+G Y+ H+D A+ K L G+RVAT
Sbjct: 65 TKVEERLAVVTQLPAYHGEDMQILKYGLGQKYDAHHDVGELTSASG-KQLAAEGGHRVAT 123
Query: 210 VLFYMSDVAQGGATVF----------------------TSLNLSLWPEKGTAAFWHNLHS 247
VL Y+SDV +GG T F N+++ P KG + ++++
Sbjct: 124 VLLYLSDVEEGGETAFPDSEWMTPELRKWAEGQKWSDCAEGNVAVKPRKGDGLLFWSVNN 183
Query: 248 SGDGDYYTRHAACPVLTG 265
D ++ HA CPV+ G
Sbjct: 184 ENAIDPHSMHAGCPVIRG 201
>gi|18071415|gb|AAL58274.1|AC068923_16 putative prolyl 4-hydroxylase, alpha subunit [Oryza sativa Japonica
Group]
Length = 343
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 55/199 (27%), Positives = 95/199 (47%), Gaps = 26/199 (13%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+PR LY + + E + + +A+P ++++TV + TG + + R S +L + +
Sbjct: 116 EPRAFLYHNFLSKEECEYLISLAKPHMKKSTVVDASTGGSKDSRVRTSSGMFLGRGQDKI 175
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
I I +R+ T + E LQV++Y +G YEPH+D+ + F + G R+AT+
Sbjct: 176 IRTIEKRISDYTFIPVENGEGLQVLHYEVGQKYEPHFDYFH----DEFNTKNGGQRIATL 231
Query: 211 LFYMSDVAQGGATVF-------------------TSLNLSLWPEKGTAAFWHNLHSSGDG 251
L Y+SDV +GG T+F L++ P+ G A + ++ G
Sbjct: 232 LMYLSDVEEGGETIFPSSKANSSSSPFYNELSECAKKGLAVKPKMGDALLFWSMRPDGSL 291
Query: 252 DYYTRHAACPV---LTGSN 267
D + H P+ LT SN
Sbjct: 292 DATSLHGEIPILWLLTNSN 310
>gi|357445147|ref|XP_003592851.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
gi|355481899|gb|AES63102.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
Length = 281
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 62/203 (30%), Positives = 98/203 (48%), Gaps = 19/203 (9%)
Query: 78 LRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRI 137
LRL +K E PRIIL + + E D ++ +A PRL+ +TV + TG+ ++ R
Sbjct: 68 LRLGYVKPEVLSWSPRIILLHNFLSYEECDYLRGVALPRLKISTVVDANTGKGIKSDVRT 127
Query: 138 SKSAWL--REPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEA 195
S +L E ++P+I I +R+ + + E +QV+ Y +Y PH+D+ +
Sbjct: 128 SSGMFLSHEERKYPMIHAIEKRISVYSQIPIENGELMQVLRYEKNQYYRPHHDYF----S 183
Query: 196 NAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL-------------NLSLWPEKGTAAFW 242
+ F G R+AT+L Y+ D +GG T F S L + P KG A +
Sbjct: 184 DTFNLKRGGQRIATMLMYLGDNVEGGETHFPSAGSDECSCGGKLTKGLCVKPVKGNAVLF 243
Query: 243 HNLHSSGDGDYYTRHAACPVLTG 265
++ G D + H CPVL G
Sbjct: 244 WSMGLDGQSDPDSVHGGCPVLAG 266
>gi|423598444|ref|ZP_17574444.1| hypothetical protein III_01246 [Bacillus cereus VD078]
gi|423660914|ref|ZP_17636083.1| hypothetical protein IKM_01311 [Bacillus cereus VDM022]
gi|401236714|gb|EJR43171.1| hypothetical protein III_01246 [Bacillus cereus VD078]
gi|401300955|gb|EJS06544.1| hypothetical protein IKM_01311 [Bacillus cereus VDM022]
Length = 216
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 54/193 (27%), Positives = 102/193 (52%), Gaps = 16/193 (8%)
Query: 89 YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
+ +P I++ +V+ D E D + ++++ +++R+ V + + ++ + R S A+L E E
Sbjct: 36 FEEPLIVVLANVLSDEECDELIELSKSKMKRSKVGSSR----DVNDIRTSSGAFLEENE- 90
Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
+ +I +R+ +T + + E L ++NY + Y+ HYD FA + A NR+
Sbjct: 91 -LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYFAEHSRSAA------NNRI 143
Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
+T++ Y++DV +GG T F LNLS+ P KG A ++ + + T H PV G
Sbjct: 144 STLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEK 203
Query: 268 SLHSTCPCGLRRG 280
+ + +RRG
Sbjct: 204 WIATQW---VRRG 213
>gi|423483822|ref|ZP_17460512.1| hypothetical protein IEQ_03600 [Bacillus cereus BAG6X1-2]
gi|401141373|gb|EJQ48928.1| hypothetical protein IEQ_03600 [Bacillus cereus BAG6X1-2]
Length = 216
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 54/193 (27%), Positives = 102/193 (52%), Gaps = 16/193 (8%)
Query: 89 YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
+ +P I++ +V+ D E D + +M++ +++R+T+ + + ++ + R S A+L E E
Sbjct: 36 FEEPLIVVLGNVISDEECDELIEMSKNKIKRSTIGSSR----DVNDIRTSSGAFLEENE- 90
Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
+ +I +R+ + + + E L ++NY + Y+ HYD FA + A NR+
Sbjct: 91 -LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYFAEHSRSAA------NNRI 143
Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
+T++ Y++DV +GG T F LNLS+ P KG A ++ + + T H PV G
Sbjct: 144 STLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEK 203
Query: 268 SLHSTCPCGLRRG 280
+ + +RRG
Sbjct: 204 WIATQW---VRRG 213
>gi|159464219|ref|XP_001690339.1| hypothetical protein CHLREDRAFT_114525 [Chlamydomonas reinhardtii]
gi|158279839|gb|EDP05598.1| predicted protein [Chlamydomonas reinhardtii]
Length = 244
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 61/191 (31%), Positives = 94/191 (49%), Gaps = 25/191 (13%)
Query: 93 RIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIE 152
RI L + D E D I ++++ RL R+ V G E + R S +L E PV++
Sbjct: 1 RIFLIEHFLTDEEADHIVQVSERRLERSGVVATNGGSEE-SQIRTSFGVFLERGEDPVVK 59
Query: 153 RISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD--FARPGEANAFKSLGTGNRVATV 210
+ R+ +T + E LQV+ Y Y+ H+D F + G AN GNR ATV
Sbjct: 60 GVEERISALTLMPVGNGEGLQVLRYQKEQKYDAHWDYFFHKDGIANG------GNRYATV 113
Query: 211 LFYMSDVAQGGATVFTSL----------------NLSLWPEKGTAAFWHNLHSSGDGDYY 254
L Y+ D +GG TVF ++ +L+ P+KGTA +H++ +G+ +
Sbjct: 114 LMYLVDTEEGGETVFPNIAAPGGENVGFSECARYHLAAKPKKGTAILFHSIKPTGELERK 173
Query: 255 TRHAACPVLTG 265
+ H ACPV+ G
Sbjct: 174 SLHTACPVIKG 184
>gi|255539064|ref|XP_002510597.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223551298|gb|EEF52784.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 289
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 55/195 (28%), Positives = 93/195 (47%), Gaps = 23/195 (11%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+PR +Y + + E + + +A+P + ++TV + KTG + + R S +LR +
Sbjct: 84 EPRAFVYHNFLSKEECEYLIALAKPHMVKSTVVDSKTGRSKDSRVRTSSGMFLRRGRDKI 143
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
I I +R+ + + E LQV++Y +G YE HYD+ + F + G R AT+
Sbjct: 144 IRNIEKRIADFSFIPIEHGEGLQVLHYEVGQKYEAHYDYF----LDEFNTKNGGQRTATL 199
Query: 211 LFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAFWHNLHSSGDG 251
L Y+SDV +GG TVF + LS+ P+ G A + +
Sbjct: 200 LMYLSDVEEGGETVFPAAKANISNVPSWNELSECARQGLSVKPKMGNALLFWSTRPDATL 259
Query: 252 DYYTRHAACPVLTGS 266
D + H +CPV+ G+
Sbjct: 260 DPASLHGSCPVIRGN 274
>gi|357467077|ref|XP_003603823.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355492871|gb|AES74074.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 291
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 54/190 (28%), Positives = 95/190 (50%), Gaps = 18/190 (9%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+PR +Y + + E + + +A+P ++R+ V + TG+ + + R S +L + +
Sbjct: 91 EPRASMYHNFLSKEECEHLINLAKPFMQRSLVVDGVTGQGILNSVRTSSGTFLERGKDKI 150
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
++ + RR+ +T + E LQ+++Y +G +EPHYD+ N + G RVATV
Sbjct: 151 VQNVERRIADITSIPIENGEGLQIIHYEVGQKFEPHYDY----NFNWRITNNGGPRVATV 206
Query: 211 LFYMSDVAQGGATVFTSLN--------------LSLWPEKGTAAFWHNLHSSGDGDYYTR 256
L Y+SDV +GG TVF + L + P+ G A + ++ G D +
Sbjct: 207 LMYLSDVEEGGETVFPNAKPNFNSVSKYHPGKGLVVKPKMGDALLFWSVKPDGSLDTASL 266
Query: 257 HAACPVLTGS 266
H PV+ GS
Sbjct: 267 HGGSPVIRGS 276
>gi|229019457|ref|ZP_04176278.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1273]
gi|229025700|ref|ZP_04182104.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1272]
gi|423417837|ref|ZP_17394926.1| hypothetical protein IE3_01309 [Bacillus cereus BAG3X2-1]
gi|228735575|gb|EEL86166.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1272]
gi|228741812|gb|EEL91991.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1273]
gi|401107008|gb|EJQ14965.1| hypothetical protein IE3_01309 [Bacillus cereus BAG3X2-1]
Length = 216
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 54/193 (27%), Positives = 102/193 (52%), Gaps = 16/193 (8%)
Query: 89 YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
+ +P I++ +V+ D E D + ++++ +++R+ V + + ++ + R S A+L E E
Sbjct: 36 FEEPLIVVLANVLSDEECDELIELSKNKMKRSKVGSSR----DVNDIRTSSGAFLEENE- 90
Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
+ +I +R+ +T + + E L ++NY + Y+ HYD FA + A NR+
Sbjct: 91 -LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYFAEHSRSAA------NNRI 143
Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
+T++ Y++DV +GG T F LNLS+ P KG A ++ + + T H PV G
Sbjct: 144 STLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEK 203
Query: 268 SLHSTCPCGLRRG 280
+ + +RRG
Sbjct: 204 WIATQW---VRRG 213
>gi|116788056|gb|ABK24739.1| unknown [Picea sitchensis]
Length = 303
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 58/205 (28%), Positives = 96/205 (46%), Gaps = 34/205 (16%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANY-----------RISK 139
+PR ILY + + E + + +A+P + ++TV + TG+ + + + R S
Sbjct: 87 EPRAILYHNFLNKEECEYLINLAKPHMAKSTVVDSATGKSKDSRFVHRWKSNDSRVRTSS 146
Query: 140 SAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFK 199
+L + I I +R+ T + E LQV++Y +G YEPH+D+ + F
Sbjct: 147 GMFLNRGQDKTIRSIEKRIADFTFIPAEHGEGLQVLHYEVGQKYEPHFDYF----LDEFN 202
Query: 200 SLGTGNRVATVLFYMSDVAQGGATVF-----TSLNLSLW--------------PEKGTAA 240
+ G R+ATVL Y+SDV +GG TVF S ++ W P G A
Sbjct: 203 TKNGGQRIATVLMYLSDVEKGGETVFPASKVNSSSVPWWDELSECAKAGISVRPRMGDAL 262
Query: 241 FWHNLHSSGDGDYYTRHAACPVLTG 265
+ ++ + D + HA CPV+ G
Sbjct: 263 LFWSMRPDAELDPSSLHAGCPVIQG 287
>gi|198466397|ref|XP_002135180.1| GA23908 [Drosophila pseudoobscura pseudoobscura]
gi|198150581|gb|EDY73807.1| GA23908 [Drosophila pseudoobscura pseudoobscura]
Length = 403
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 62/199 (31%), Positives = 93/199 (46%), Gaps = 27/199 (13%)
Query: 68 CRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKT 127
C Y +LRL PLK E L P I +Y DV+Y+ EI + +A L+ + K
Sbjct: 215 CHYESTRTAFLRLAPLKVEMLSLDPYIAIYHDVIYEREIARVMTLALSSLK-GPGRYSKR 273
Query: 128 GELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHY 187
E I KS + E E+ ++++R MTG ++ ++ N GIGG+ H
Sbjct: 274 REHNI------KSVTVYEEENS---QLNQRTRDMTGEQVKEDKDFRIYNSGIGGYIRYHM 324
Query: 188 DFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHS 247
D E +++V GGA F L ++WP KG+A WHNL++
Sbjct: 325 DNLAKEEQQ-----------------LNEVPHGGAISFPQLEFTVWPRKGSALVWHNLNN 367
Query: 248 SGDGDYYTRHAACPVLTGS 266
+ + DY H +CPV+ GS
Sbjct: 368 NLELDYRVAHISCPVIVGS 386
>gi|357417854|ref|YP_004930874.1| procollagen-proline dioxygenase [Pseudoxanthomonas spadix BD-a59]
gi|355335432|gb|AER56833.1| Procollagen-proline dioxygenase [Pseudoxanthomonas spadix BD-a59]
Length = 283
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 50/156 (32%), Positives = 85/156 (54%), Gaps = 5/156 (3%)
Query: 90 LQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHP 149
L PR+I++ +++ E D + +A+ +++R+ V + TG+ + R S+ + +P
Sbjct: 94 LHPRVIVFGNLLAAEECDALIALARRQIKRSPVFDPDTGQDQQHQARTSEGMFFGRGANP 153
Query: 150 VIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDF---ARPGEANAFKSLGTGNR 206
+ R+ R+ + E LQV+ YG G YEPHYD+ ARPG A + G R
Sbjct: 154 LCARVEARIAALLNWPLENGEGLQVLRYGPGAQYEPHYDYFDPARPGAEVALRR--GGQR 211
Query: 207 VATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFW 242
VA+++ Y++ QGGAT F +L + P KG A ++
Sbjct: 212 VASLVIYLNTPTQGGATTFPDAHLEVAPIKGNAVYF 247
>gi|326503458|dbj|BAJ86235.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326516134|dbj|BAJ88090.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 266
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 59/203 (29%), Positives = 100/203 (49%), Gaps = 19/203 (9%)
Query: 78 LRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRI 137
LRL +K E PRII++ + + E D ++++A+PRL +TV + TG+ ++ R
Sbjct: 53 LRLGYVKPEVISWTPRIIVFHNFLSSEECDYLREIARPRLEISTVVDVATGKGVKSDVRT 112
Query: 138 SKSAWLREPEH--PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEA 195
S ++ E PVI+ I +R+ + + E +QV+ Y +Y PH+D+ +
Sbjct: 113 SSGMFVNSEERKLPVIKAIEKRISVFSQIPVENGELIQVLRYEPNQYYRPHHDYF----S 168
Query: 196 NAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL-------------NLSLWPEKGTAAFW 242
+ F G RVAT+L Y++D +GG T F L + P KG A +
Sbjct: 169 DTFNLKRGGQRVATMLMYLTDGVEGGETHFPQAGDGECICGGRLVRGLCVKPNKGDAVLF 228
Query: 243 HNLHSSGDGDYYTRHAACPVLTG 265
++ G+ D + H+ C V+ G
Sbjct: 229 WSMGLDGNTDSNSLHSGCAVVKG 251
>gi|225433714|ref|XP_002268409.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
gi|296089634|emb|CBI39453.3| unnamed protein product [Vitis vinifera]
Length = 287
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 60/208 (28%), Positives = 101/208 (48%), Gaps = 19/208 (9%)
Query: 73 RNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEI 132
++ LR+ +K E PRIIL + E D ++ MA+P L+ +TV + +TG+
Sbjct: 69 KDADILRIGYVKPEILNWSPRIILLHSFLSSEECDYLRAMAEPLLQISTVVDAQTGKGIQ 128
Query: 133 ANYRISKSAWLR--EPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFA 190
++ R S +L + +P++ I +R+ + + E +QV+ Y Y+PH+D+
Sbjct: 129 SDVRTSSGMFLSPDDSTYPIVRAIEKRISVYSQVPVENGELIQVLRYKKSQFYKPHHDYF 188
Query: 191 RPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVF-------------TSLNLSLWPEKG 237
+++F G RVAT+L Y+SD +GG T F + LS+ P KG
Sbjct: 189 ----SDSFNLKRGGQRVATMLIYLSDNVEGGETYFPMAGSGFCRCGGKSVRGLSVAPVKG 244
Query: 238 TAAFWHNLHSSGDGDYYTRHAACPVLTG 265
A + ++ G D + H C VL G
Sbjct: 245 NAVLFWSMGLDGQSDPNSIHGGCEVLAG 272
>gi|308799217|ref|XP_003074389.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
gi|116000560|emb|CAL50240.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
Length = 294
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 61/190 (32%), Positives = 90/190 (47%), Gaps = 18/190 (9%)
Query: 92 PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
P +YR + ++E + I+++A L+ +TV + TG + R S +L E VI
Sbjct: 33 PHAEVYRGFLTEAECEHIERLATAELKPSTVVDASTGGDASSEIRTSSGMFLGRAEDDVI 92
Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVL 211
E I R+ T + S E QV+ Y Y HYD+ + N + G G R+ TVL
Sbjct: 93 EAIEARIAAWTHVPESHGEGFQVLRYEKHQEYRAHYDYFHD-KFNVKREKG-GQRMGTVL 150
Query: 212 FYMSDVAQGGATVFTSL----------------NLSLWPEKGTAAFWHNLHSSGDGDYYT 255
Y+SDV +GG TVF L++ P KG A F+ +L G D ++
Sbjct: 151 MYLSDVEEGGETVFPKFEDGTPAGSEASECARNKLAVRPRKGDALFFRSLRHDGVPDTFS 210
Query: 256 RHAACPVLTG 265
HA CPV+ G
Sbjct: 211 EHAGCPVIRG 220
>gi|449468746|ref|XP_004152082.1| PREDICTED: putative prolyl 4-hydroxylase-like [Cucumis sativus]
Length = 290
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 61/203 (30%), Positives = 99/203 (48%), Gaps = 19/203 (9%)
Query: 78 LRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRI 137
LRL +K E PRII+ + + E D +K +A RL +TV + KTG+ +++R
Sbjct: 75 LRLGYVKPEVVSWSPRIIVLHNFLSTKECDYLKGIALARLEISTVVDTKTGKGVKSDFRT 134
Query: 138 SKSAWL--REPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEA 195
S +L E P+++ I +R+ + + E +QV+ Y Y+PH+D+ +
Sbjct: 135 SSGMFLSHHEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQFYKPHHDYF----S 190
Query: 196 NAFKSLGTGNRVATVLFYMSDVAQGGATVF-------------TSLNLSLWPEKGTAAFW 242
+ F G R+AT+L Y+S+ +GG T F T LS+ P KG A +
Sbjct: 191 DTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDAVLF 250
Query: 243 HNLHSSGDGDYYTRHAACPVLTG 265
++ G D + H C VL+G
Sbjct: 251 WSMGLDGQSDPKSIHGGCEVLSG 273
>gi|423527903|ref|ZP_17504348.1| hypothetical protein IGE_01455 [Bacillus cereus HuB1-1]
gi|402451566|gb|EJV83385.1| hypothetical protein IGE_01455 [Bacillus cereus HuB1-1]
Length = 248
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 51/178 (28%), Positives = 95/178 (53%), Gaps = 13/178 (7%)
Query: 89 YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
+ +P I++ +V+ D E D + +M++ +++R+ V + + ++ + R S A+L + E
Sbjct: 68 FEEPLIVVLANVLSDEECDKLIEMSKNKMKRSKVGSSR----DVNDIRTSSGAFLEDNE- 122
Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
+ +I +R+ + + S E L ++NY + Y+ HYD FA + A NR+
Sbjct: 123 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSAA------NNRI 175
Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
+T++ Y++DV +GG T F LNLS+ P KG A ++ + + T H PV G
Sbjct: 176 STLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKG 233
>gi|159469311|ref|XP_001692811.1| predicted protein [Chlamydomonas reinhardtii]
gi|158278064|gb|EDP03830.1| predicted protein [Chlamydomonas reinhardtii]
Length = 273
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 57/191 (29%), Positives = 93/191 (48%), Gaps = 24/191 (12%)
Query: 93 RIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIE 152
RI L++ + E D I+ A+ RL R+ V + +G +++ R S + E +IE
Sbjct: 44 RIYLWKGFLTPEECDYIRMKAEKRLERSGVVDTGSGGSVVSDIRTSDGMFFERGEDAIIE 103
Query: 153 RISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD--FARPGEANAFKSLGTGNRVATV 210
+ +R+ T E LQV+ Y Y+ H+D F + G +N GNR ATV
Sbjct: 104 AVEQRLADWTMTPIWGGESLQVLRYRKDQKYDSHWDYFFHKDGSSNG------GNRWATV 157
Query: 211 LFYMSDVAQGGATVFTSL----------------NLSLWPEKGTAAFWHNLHSSGDGDYY 254
L Y+++ +GG TVF + NL++ P KG A +H++ +G+ +
Sbjct: 158 LLYLTETEEGGETVFPKIPAPNGINVGFSECAKYNLAVKPHKGDALLFHSMKPTGELEER 217
Query: 255 TRHAACPVLTG 265
+ H ACPV+ G
Sbjct: 218 SMHGACPVIRG 228
>gi|75760922|ref|ZP_00740932.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
israelensis ATCC 35646]
gi|423385740|ref|ZP_17362996.1| hypothetical protein ICE_03486 [Bacillus cereus BAG1X1-2]
gi|423561293|ref|ZP_17537569.1| hypothetical protein II5_00697 [Bacillus cereus MSX-A1]
gi|74491592|gb|EAO54798.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
israelensis ATCC 35646]
gi|401201550|gb|EJR08415.1| hypothetical protein II5_00697 [Bacillus cereus MSX-A1]
gi|401635796|gb|EJS53551.1| hypothetical protein ICE_03486 [Bacillus cereus BAG1X1-2]
Length = 248
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 51/178 (28%), Positives = 95/178 (53%), Gaps = 13/178 (7%)
Query: 89 YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
+ +P I++ +V+ D E D + +M++ +++R+ V + + ++ + R S A+L + E
Sbjct: 68 FEEPLIVVLANVLSDEECDKLIEMSKNKMKRSKVGSSR----DVNDIRTSSGAFLEDNE- 122
Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
+ +I +R+ + + S E L ++NY + Y+ HYD FA + A NR+
Sbjct: 123 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSAA------NNRI 175
Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
+T++ Y++DV +GG T F LNLS+ P KG A ++ + + T H PV G
Sbjct: 176 STLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKG 233
>gi|423358724|ref|ZP_17336227.1| hypothetical protein IC1_00704 [Bacillus cereus VD022]
gi|401084596|gb|EJP92842.1| hypothetical protein IC1_00704 [Bacillus cereus VD022]
Length = 248
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 51/178 (28%), Positives = 95/178 (53%), Gaps = 13/178 (7%)
Query: 89 YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
+ +P I++ +V+ D E D + +M++ +++R+ V + + ++ + R S A+L + E
Sbjct: 68 FEEPLIVVLANVLSDEECDKLIEMSKNKMKRSKVGSSR----DVNDIRTSSGAFLEDNE- 122
Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
+ +I +R+ + + S E L ++NY + Y+ HYD FA + A NR+
Sbjct: 123 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSAA------NNRI 175
Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
+T++ Y++DV +GG T F LNLS+ P KG A ++ + + T H PV G
Sbjct: 176 STLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKG 233
>gi|255071007|ref|XP_002507585.1| predicted protein [Micromonas sp. RCC299]
gi|226522860|gb|ACO68843.1| predicted protein [Micromonas sp. RCC299]
Length = 433
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 60/197 (30%), Positives = 94/197 (47%), Gaps = 31/197 (15%)
Query: 92 PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLRE----PE 147
PR ++ + + E DL+ + A+P + ++ V + G +N R S +++
Sbjct: 166 PRAFMHIGFLSERECDLLVEYARPNMYKSGVVDASNGGSSFSNIRTSTGSFVPTVFPLGM 225
Query: 148 HPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD--FARPGEANAFKSLGTGN 205
+ V+ RI RR+ T + + E +QV+ Y IG Y+ H+D F G N N
Sbjct: 226 NDVVRRIERRIAAWTQIPAAHGEPIQVLRYQIGQEYQSHFDYFFHEGGMKN--------N 277
Query: 206 RVATVLFYMSDVAQGGATVFTSL-----------------NLSLWPEKGTAAFWHNLHSS 248
R+ATVL Y+SDV GG TVF S +++ P+KG A + N+
Sbjct: 278 RIATVLMYLSDVKDGGETVFPSAESLQVKPEPIHHACAKNGITVIPKKGDAILFWNMKVG 337
Query: 249 GDGDYYTRHAACPVLTG 265
GD D + HA CPV+ G
Sbjct: 338 GDLDGGSTHAGCPVVLG 354
>gi|163941996|ref|YP_001646880.1| 2OG-Fe(II) oxygenase [Bacillus weihenstephanensis KBAB4]
gi|229013455|ref|ZP_04170592.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides DSM 2048]
gi|423495146|ref|ZP_17471790.1| hypothetical protein IEW_04044 [Bacillus cereus CER057]
gi|423498060|ref|ZP_17474677.1| hypothetical protein IEY_01287 [Bacillus cereus CER074]
gi|163864193|gb|ABY45252.1| 2OG-Fe(II) oxygenase [Bacillus weihenstephanensis KBAB4]
gi|228747867|gb|EEL97733.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides DSM 2048]
gi|401151239|gb|EJQ58691.1| hypothetical protein IEW_04044 [Bacillus cereus CER057]
gi|401161347|gb|EJQ68714.1| hypothetical protein IEY_01287 [Bacillus cereus CER074]
Length = 216
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 54/193 (27%), Positives = 101/193 (52%), Gaps = 16/193 (8%)
Query: 89 YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
+ +P I++ +V+ D E D + ++++ ++ R+ V + + ++ + R S A+L E E
Sbjct: 36 FEEPLIVVLANVLSDEECDELIELSKSKMERSKVGSSR----DVNDIRTSSGAFLEENE- 90
Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
+ +I +R+ +T + + E L ++NY + Y+ HYD FA + A NR+
Sbjct: 91 -LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYFAEHSRSAA------NNRI 143
Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
+T++ Y++DV +GG T F LNLS+ P KG A ++ + + T H PV G
Sbjct: 144 STLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEK 203
Query: 268 SLHSTCPCGLRRG 280
+ + +RRG
Sbjct: 204 WIATQW---VRRG 213
>gi|423604110|ref|ZP_17580003.1| hypothetical protein IIK_00691 [Bacillus cereus VD102]
gi|401245796|gb|EJR52149.1| hypothetical protein IIK_00691 [Bacillus cereus VD102]
Length = 216
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 56/196 (28%), Positives = 101/196 (51%), Gaps = 16/196 (8%)
Query: 89 YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
+ +P I++ +V+ D E D + ++++ +L R+ V + + ++ + R S A+L + E
Sbjct: 36 FEEPLIVVLGNVLSDEECDELIELSKNKLARSKVGSSR----DVNDIRTSSGAFLDDNE- 90
Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
+ +I +R+ + + S E L ++NY + Y+ HYD FA + A NR+
Sbjct: 91 -LTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYFAEHSRSAA------NNRI 143
Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
+T++ Y++DV +GG T F LNLS+ P KG A ++ H + T H PV G
Sbjct: 144 STLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFHQDQSLNELTLHGGAPVTKGEK 203
Query: 268 SLHSTCPCGLRRGLQR 283
+ + +RRG R
Sbjct: 204 WIATQW---VRRGTYR 216
>gi|308467521|ref|XP_003096008.1| CRE-PHY-4 protein [Caenorhabditis remanei]
gi|308244157|gb|EFO88109.1| CRE-PHY-4 protein [Caenorhabditis remanei]
Length = 198
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 51/142 (35%), Positives = 73/142 (51%), Gaps = 2/142 (1%)
Query: 126 KTGELEIANYRISKSAWLREPEHPVIERISRRVE-HMTGLTTSTAEELQVVNYGIGGHYE 184
KT E + R + WL P ++ R ++ + L STAE Q+++Y G+Y
Sbjct: 25 KTETPEKSEIRAANGTWLIHENRPNFAKMFRNLQTDIAALDLSTAEPWQILSYNSDGYYA 84
Query: 185 PHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHN 244
HYDF P + N GNR+ATVL + +GG TVF +NL++ P+ G W N
Sbjct: 85 HHYDFLNP-DTNKQLVEARGNRIATVLVILQIAKKGGTTVFPKINLNIRPKAGDVVVWLN 143
Query: 245 LHSSGDGDYYTRHAACPVLTGS 266
SG+ D T HAACP+ G+
Sbjct: 144 TLPSGESDSQTLHAACPIKEGT 165
>gi|228960501|ref|ZP_04122151.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
pakistani str. T13001]
gi|229047930|ref|ZP_04193506.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH676]
gi|423630961|ref|ZP_17606708.1| hypothetical protein IK5_03811 [Bacillus cereus VD154]
gi|423650103|ref|ZP_17625673.1| hypothetical protein IKA_03890 [Bacillus cereus VD169]
gi|228723387|gb|EEL74756.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH676]
gi|228799198|gb|EEM46165.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
pakistani str. T13001]
gi|401264328|gb|EJR70440.1| hypothetical protein IK5_03811 [Bacillus cereus VD154]
gi|401282521|gb|EJR88420.1| hypothetical protein IKA_03890 [Bacillus cereus VD169]
Length = 248
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 51/178 (28%), Positives = 95/178 (53%), Gaps = 13/178 (7%)
Query: 89 YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
+ +P I++ +V+ D E D + +M++ +++R+ V + + ++ + R S A+L + E
Sbjct: 68 FEEPLIVVLANVLSDEECDELIEMSKNKMKRSKVGSSR----DVNDIRTSSGAFLEDNE- 122
Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
+ +I +R+ + + S E L ++NY + Y+ HYD FA + A NR+
Sbjct: 123 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSAA------NNRI 175
Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
+T++ Y++DV +GG T F LNLS+ P KG A ++ + + T H PV G
Sbjct: 176 STLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKG 233
>gi|240256489|ref|NP_201407.4| iron ion binding / oxidoreductase/ oxidoreductase protein
[Arabidopsis thaliana]
gi|332010770|gb|AED98153.1| iron ion binding / oxidoreductase/ oxidoreductase protein
[Arabidopsis thaliana]
Length = 289
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 94/201 (46%), Gaps = 24/201 (11%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+PR +Y + + E + ++A+P + ++TV + KTG+ + R S +L
Sbjct: 84 EPRASVYHNFLTKEECKYLIELAKPHMEKSTVVDEKTGKSTDSRVRTSSGTFLARGRDKT 143
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
I I +R+ T + E LQV++Y IG YEPHYD+ + + + G R+ATV
Sbjct: 144 IREIEKRISDFTFIPVEHGEGLQVLHYEIGQKYEPHYDYF----MDEYNTRNGGQRIATV 199
Query: 211 LFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAFWHNLHSSGDG 251
L Y+SDV +GG TVF + LS+ P+ G A + ++
Sbjct: 200 LMYLSDVEEGGETVFPAAKGNYSAVPWWNELSECGKGGLSVKPKMGDALLFWSMTPDATL 259
Query: 252 DYYTRHAACPVLTGSNSLHST 272
D + H C V+ G N ST
Sbjct: 260 DPSSLHGGCAVIKG-NKWSST 279
>gi|359477455|ref|XP_002278454.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1 [Vitis
vinifera]
Length = 296
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 55/194 (28%), Positives = 95/194 (48%), Gaps = 23/194 (11%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+PR +Y + + E D + +A+ L+R+ V + +G+ ++ R S ++ + + P+
Sbjct: 43 KPRAFVYEGFLSEEECDHLISLAKSELKRSAVADNVSGKSRLSEVRTSSGMFIGKGKDPI 102
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
+ I ++ T L E++QV+ Y G Y+ HYD+ + G+R+ATV
Sbjct: 103 VAGIEDKIAAWTFLPKDNGEDMQVLRYEPGQKYDAHYDYF----VDKVNIARGGHRIATV 158
Query: 211 LFYMSDVAQGGATVF-----------TSLNLS--------LWPEKGTAAFWHNLHSSGDG 251
L Y+SDV +GG TVF T+ +LS + P KG A + +LH +
Sbjct: 159 LMYLSDVVKGGETVFPMAEVSSSTLPTNDDLSECARKGIAVKPRKGDALLFFSLHPTAIP 218
Query: 252 DYYTRHAACPVLTG 265
D + H CPV+ G
Sbjct: 219 DPMSLHGGCPVIEG 232
>gi|268562483|ref|XP_002638619.1| Hypothetical protein CBG05671 [Caenorhabditis briggsae]
Length = 520
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 51/142 (35%), Positives = 74/142 (52%), Gaps = 2/142 (1%)
Query: 126 KTGELEIANYRISKSAWLREPEHPVIERISRRVE-HMTGLTTSTAEELQVVNYGIGGHYE 184
KT E + R + WL + P +I ++ ++ L STAE Q+++Y G+Y
Sbjct: 92 KTETPEKSQVRAANGTWLIHTKRPNFAKIFWNLQVNIRALDLSTAEPWQILSYNSEGYYA 151
Query: 185 PHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHN 244
PHYDF P E N GNR+ATVL + +GG TVF +N+++ P+ G W N
Sbjct: 152 PHYDFLNP-ETNKVLVESRGNRIATVLVILQIAKKGGTTVFPKININIRPKIGDVVVWLN 210
Query: 245 LHSSGDGDYYTRHAACPVLTGS 266
G+ D T HAACP+ G+
Sbjct: 211 TVPDGESDSQTLHAACPIKEGT 232
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 53/198 (26%), Positives = 88/198 (44%), Gaps = 5/198 (2%)
Query: 78 LRLMPLKEEEAYLQPRIILYRDVMYDSEI-DLIKKMAQPRLRRATVQNYKTGELEIANYR 136
+ +K E P +++YRD+ ++ D I+ M V N G + YR
Sbjct: 294 ISFQAVKVEVISWSPGLVIYRDMFTKKQVLDYIEIMKHQDFEEQQVVN-DDGTEYYSKYR 352
Query: 137 ISKSAWLREPEHPVIERISRRVEHMT-GLTTSTAEELQVVNYGIGGHYEPHYDFAR-PGE 194
+ + P+ P I + V+ + L ++E++ ++Y GGHY H+DF P E
Sbjct: 353 KANGTQIIAPDFPAALSIWKTVKILIPTLNIESSEDIVALSYIRGGHYAAHHDFLEYPSE 412
Query: 195 ANAFKSLGT-GNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDY 253
+ GNR T++ GGAT+F SLN ++ P G A FW N + +
Sbjct: 413 KEWDGWMKDYGNRFGTLIMAFETAELGGATIFPSLNAAIRPNTGDAFFWFNAMGNTKQED 472
Query: 254 YTRHAACPVLTGSNSLHS 271
+ H CP+ G S+ +
Sbjct: 473 LSDHGGCPIYEGKKSIST 490
>gi|228910069|ref|ZP_04073889.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL 200]
gi|228849586|gb|EEM94420.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL 200]
Length = 248
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 49/177 (27%), Positives = 94/177 (53%), Gaps = 11/177 (6%)
Query: 89 YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
+ +P I++ +V+ D E D + +M++ +++R+ V + + ++ + R S A+L + E
Sbjct: 68 FEEPLIVVLANVLSDEECDELIEMSKNKMKRSKVGSSR----DVNDIRTSSGAFLEDNE- 122
Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVA 208
+ +I +R+ + + S E L ++NY + Y+ HYD+ +A NR++
Sbjct: 123 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSA-----VNNRIS 176
Query: 209 TVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
T++ Y++DV +GG T F LNLS+ P KG A ++ + + T H PV G
Sbjct: 177 TLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKG 233
>gi|340357957|ref|ZP_08680560.1| prolyl 4-hydroxylase [Sporosarcina newyorkensis 2681]
gi|339616017|gb|EGQ20677.1| prolyl 4-hydroxylase [Sporosarcina newyorkensis 2681]
Length = 211
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 49/175 (28%), Positives = 91/175 (52%), Gaps = 10/175 (5%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+P I++ +V+ D E D + ++A +++R+ + + E R S S ++ + E+ +
Sbjct: 32 EPLIVVLGNVLSDEECDELIQLAGDKVKRSKIGTTR----EENELRTSSSMFIEDDENLI 87
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
+ R+ +R+ + + E LQ++ Y G Y+ H+DF S T NR++T+
Sbjct: 88 VTRVKKRISAIMKIPMEHGEGLQILRYTPGQQYKAHHDFFSS------DSKITNNRISTL 141
Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
+ Y++DV QGG T F L S+ P KG A ++ +S + +T H PV+ G
Sbjct: 142 VMYLNDVEQGGETFFPHLKFSVSPRKGMAVYFEYFYSDQTLNDFTLHGGAPVVEG 196
>gi|229075940|ref|ZP_04208916.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock4-18]
gi|229117732|ref|ZP_04247101.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-3]
gi|407706764|ref|YP_006830349.1| alpha/beta fold family hydrolase [Bacillus thuringiensis MC28]
gi|423377905|ref|ZP_17355189.1| hypothetical protein IC9_01258 [Bacillus cereus BAG1O-2]
gi|423464099|ref|ZP_17440867.1| hypothetical protein IEK_01286 [Bacillus cereus BAG6O-1]
gi|423547540|ref|ZP_17523898.1| hypothetical protein IGO_03975 [Bacillus cereus HuB5-5]
gi|423622677|ref|ZP_17598455.1| hypothetical protein IK3_01275 [Bacillus cereus VD148]
gi|228665709|gb|EEL21182.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-3]
gi|228707255|gb|EEL59452.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock4-18]
gi|401179261|gb|EJQ86434.1| hypothetical protein IGO_03975 [Bacillus cereus HuB5-5]
gi|401260797|gb|EJR66965.1| hypothetical protein IK3_01275 [Bacillus cereus VD148]
gi|401636171|gb|EJS53925.1| hypothetical protein IC9_01258 [Bacillus cereus BAG1O-2]
gi|402420366|gb|EJV52637.1| hypothetical protein IEK_01286 [Bacillus cereus BAG6O-1]
gi|407384449|gb|AFU14950.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis MC28]
Length = 216
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 54/196 (27%), Positives = 103/196 (52%), Gaps = 16/196 (8%)
Query: 89 YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
+ +P I++ +V+ D E + + +M++ +++R+T+ + + ++ + R S A+L E E
Sbjct: 36 FEEPLIVVLGNVISDEECNELIEMSKNKIKRSTIGSAR----DVNDIRTSSGAFLEENE- 90
Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
+ +I +R+ + + + E L ++NY + Y+ HYD FA + A NR+
Sbjct: 91 -LTSKIEKRISSIMNVPVTHGEGLHILNYEVDQQYKAHYDYFAEHSRSAA------NNRI 143
Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
+T++ Y++DV +GG T F LNLS+ P KG A ++ + + T H PV G
Sbjct: 144 STLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEK 203
Query: 268 SLHSTCPCGLRRGLQR 283
+ + +RRG R
Sbjct: 204 WIATQW---VRRGTYR 216
>gi|423437685|ref|ZP_17414666.1| hypothetical protein IE9_03866 [Bacillus cereus BAG4X12-1]
gi|423503075|ref|ZP_17479667.1| hypothetical protein IG1_00641 [Bacillus cereus HD73]
gi|401120840|gb|EJQ28636.1| hypothetical protein IE9_03866 [Bacillus cereus BAG4X12-1]
gi|402459296|gb|EJV91033.1| hypothetical protein IG1_00641 [Bacillus cereus HD73]
Length = 248
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 51/178 (28%), Positives = 95/178 (53%), Gaps = 13/178 (7%)
Query: 89 YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
+ +P I++ +V+ D E D + +M++ +++R+ V + + ++ + R S A+L + E
Sbjct: 68 FEEPLIVVLANVLSDEECDELIEMSKNKMKRSKVGSAR----DVNDIRTSSGAFLEDNE- 122
Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
+ +I +R+ + + S E L ++NY + Y+ HYD FA + A NR+
Sbjct: 123 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSAA------NNRI 175
Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
+T++ Y++DV +GG T F LNLS+ P KG A ++ + + T H PV G
Sbjct: 176 STLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKG 233
>gi|229174912|ref|ZP_04302432.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus MM3]
gi|228608580|gb|EEK65882.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus MM3]
Length = 216
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 53/178 (29%), Positives = 95/178 (53%), Gaps = 13/178 (7%)
Query: 89 YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
+ +P I++ +V+ D E D + ++++ +L R+ V + + ++ + R SK A+L + E
Sbjct: 36 FEEPLIVVLGNVLSDEECDELIELSKSKLARSKVGSSR----DVNDIRTSKGAFLDDNEL 91
Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
V +I +R+ + + S E L ++NY + Y+ HYD FA + A NR+
Sbjct: 92 TV--KIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSAA------NNRI 143
Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
+T++ Y++DV +GG T F LNLS+ P KG A ++ + + T H PV G
Sbjct: 144 STLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKG 201
>gi|356576923|ref|XP_003556579.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 287
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 60/203 (29%), Positives = 101/203 (49%), Gaps = 19/203 (9%)
Query: 78 LRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRI 137
LRL +K E PRIIL + + E D ++ +A PRL + V + KTG+ ++ R
Sbjct: 74 LRLGYVKPEVLNWSPRIILLHNFLSMEECDYLRAIALPRLHISNVVDTKTGKGIKSDVRT 133
Query: 138 SKSAWL--REPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEA 195
S +L +E ++P+++ I +R+ + + E +QV+ Y +Y+PH+D+ +
Sbjct: 134 SSGMFLNPQERKYPMVQAIEKRISVYSQIPIENGELMQVLRYEKNQYYKPHHDYF----S 189
Query: 196 NAFKSLGTGNRVATVLFYMSDVAQGGATVF-------------TSLNLSLWPEKGTAAFW 242
+ F G R+AT+L Y+SD +GG T F LS+ P KG A +
Sbjct: 190 DTFNLKRGGQRIATMLMYLSDNIEGGETYFPLAGSGECSCGGKLVKGLSVKPIKGNAVLF 249
Query: 243 HNLHSSGDGDYYTRHAACPVLTG 265
++ G D + H C V++G
Sbjct: 250 WSMGLDGQSDPNSVHGGCEVISG 272
>gi|423582447|ref|ZP_17558558.1| hypothetical protein IIA_03962 [Bacillus cereus VD014]
gi|401213326|gb|EJR20067.1| hypothetical protein IIA_03962 [Bacillus cereus VD014]
Length = 248
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 51/178 (28%), Positives = 95/178 (53%), Gaps = 13/178 (7%)
Query: 89 YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
+ +P I++ +V+ D E D + +M++ +++R+ V + + ++ + R S A+L + E
Sbjct: 68 FEEPLIVVLANVLSDEECDELIEMSKNKMKRSKVGSSR----DVNDIRTSSGAFLEDSEL 123
Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
+ +I +R+ + + S E L ++NY + Y+ HYD FA + A NR+
Sbjct: 124 TL--KIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSAA------NNRI 175
Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
+T++ Y++DV +GG T F LNLS+ P KG A ++ + + T H PV G
Sbjct: 176 STLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKG 233
>gi|225452614|ref|XP_002281420.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
gi|296087745|emb|CBI35001.3| unnamed protein product [Vitis vinifera]
Length = 316
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 53/202 (26%), Positives = 97/202 (48%), Gaps = 22/202 (10%)
Query: 82 PLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSA 141
P + + +PR LY+ + + E D + +A+ +L ++ V + ++G+ ++ R S
Sbjct: 51 PTRVTQLSWRPRAFLYKGFLSEEECDHLITLAKDKLEKSMVADNESGKSIMSEVRTSSGM 110
Query: 142 WLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL 201
+L + + ++ I R+ T L E +Q+++Y G YEPH+D+ + L
Sbjct: 111 FLLKAQDEIVADIEARIAAWTFLPVENGESIQILHYENGEKYEPHFDYFH----DKVNQL 166
Query: 202 GTGNRVATVLFYMSDVAQGGATVF------------------TSLNLSLWPEKGTAAFWH 243
G+R+ATVL Y++ V +GG TVF ++ P+KG A +
Sbjct: 167 LGGHRIATVLMYLATVEEGGETVFPNSEGRFSQPKDDSWSDCAKKGYAVNPKKGDALLFF 226
Query: 244 NLHSSGDGDYYTRHAACPVLTG 265
+LH D + H +CPV+ G
Sbjct: 227 SLHPDATTDPSSLHGSCPVIAG 248
>gi|228902749|ref|ZP_04066896.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL
4222]
gi|228967277|ref|ZP_04128313.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
sotto str. T04001]
gi|402564350|ref|YP_006607074.1| prolyl 4-hydroxylase subunit alpha domain-containing protein
[Bacillus thuringiensis HD-771]
gi|434377355|ref|YP_006611999.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus
thuringiensis HD-789]
gi|228792646|gb|EEM40212.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
sotto str. T04001]
gi|228856936|gb|EEN01449.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL
4222]
gi|401793002|gb|AFQ19041.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus
thuringiensis HD-771]
gi|401875912|gb|AFQ28079.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus
thuringiensis HD-789]
Length = 216
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 51/178 (28%), Positives = 95/178 (53%), Gaps = 13/178 (7%)
Query: 89 YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
+ +P I++ +V+ D E D + +M++ +++R+ V + + ++ + R S A+L + E
Sbjct: 36 FEEPLIVVLANVLSDEECDKLIEMSKNKMKRSKVGSSR----DVNDIRTSSGAFLEDNE- 90
Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
+ +I +R+ + + S E L ++NY + Y+ HYD FA + A NR+
Sbjct: 91 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSAA------NNRI 143
Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
+T++ Y++DV +GG T F LNLS+ P KG A ++ + + T H PV G
Sbjct: 144 STLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKG 201
>gi|423521903|ref|ZP_17498376.1| hypothetical protein IGC_01286 [Bacillus cereus HuA4-10]
gi|401176565|gb|EJQ83760.1| hypothetical protein IGC_01286 [Bacillus cereus HuA4-10]
Length = 216
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 51/178 (28%), Positives = 95/178 (53%), Gaps = 13/178 (7%)
Query: 89 YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
+ +P I++ +V+ D E D + ++++ ++R+ V + + ++ + R S A+L E E
Sbjct: 36 FEEPLIVVLANVLSDEECDKLIELSKNNMKRSKVGSSR----DVNDIRTSSGAFLEENE- 90
Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
+ +I +R+ +T + + E L ++NY + Y+ HYD FA + A NR+
Sbjct: 91 -LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYFAEHSRSAA------NNRI 143
Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
+T++ Y++DV +GG T F LNLS+ P KG A ++ + + T H PV G
Sbjct: 144 STLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKG 201
>gi|302823087|ref|XP_002993198.1| hypothetical protein SELMODRAFT_431327 [Selaginella moellendorffii]
gi|300138968|gb|EFJ05718.1| hypothetical protein SELMODRAFT_431327 [Selaginella moellendorffii]
Length = 269
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 65/206 (31%), Positives = 99/206 (48%), Gaps = 22/206 (10%)
Query: 78 LRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELE---IAN 134
LR+ +K E PRIIL + E D + +A PRL ++TV + TG+ +
Sbjct: 53 LRIGLVKPEVLNWSPRIILLHKFLSAEECDYLIAIAGPRLAKSTVVDTSTGKARHGIESK 112
Query: 135 YRISKSAWLR--EPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARP 192
R S +L + +P+I+ I RR+ + + E LQV+ Y +Y+PH+D+
Sbjct: 113 VRTSTGMFLSNYDRRYPMIQAIERRIAVYSMIPVENGELLQVLRYEPNQYYKPHHDYF-- 170
Query: 193 GEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL-------------NLSLWPEKGTA 239
++ F G RVATVL Y+SDV +GG T+F S+ L + P KG A
Sbjct: 171 --SDQFNLKRGGQRVATVLMYLSDVEEGGETIFPSVGDGECECGGELRKGLCVKPRKGDA 228
Query: 240 AFWHNLHSSGDGDYYTRHAACPVLTG 265
+ + G+ D + H C VL G
Sbjct: 229 ILFWSAALDGNVDSNSLHGGCSVLRG 254
>gi|195159162|ref|XP_002020451.1| GL14001 [Drosophila persimilis]
gi|194117220|gb|EDW39263.1| GL14001 [Drosophila persimilis]
Length = 452
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 62/179 (34%), Positives = 91/179 (50%), Gaps = 27/179 (15%)
Query: 47 KYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEI 106
++ +CR P+ +L CRY P+LRL PL+ EE L P I +Y +V+ D+EI
Sbjct: 281 EFIQICRSSHQNKPS---RLHCRYNATTTPFLRLAPLRMEELSLDPYIAVYHNVLSDAEI 337
Query: 107 DLIKKMAQPRLRR----------ATVQNYKTGEL--EIANYRISKSAWLREPEHPVIERI 154
++++ +P L+R T +TG I NY A PVIER+
Sbjct: 338 AEVERVIEPLLKRIGRYDEMPNSMTPSKRRTGFTGPHIDNYMHVSGA-------PVIERV 390
Query: 155 SRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFY 213
R + MTGL + L ++ YG+GGH + HYDF A+ + G+R+ATVLFY
Sbjct: 391 HRHIRDMTGLFMNV--HLMMIKYGLGGHCDQHYDFLN---ASYPSTHAMGDRMATVLFY 444
>gi|145343778|ref|XP_001416487.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144576712|gb|ABO94780.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 255
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 58/193 (30%), Positives = 91/193 (47%), Gaps = 27/193 (13%)
Query: 92 PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
PR +Y + D E D I +++ L ++ V + KTG ++ R S ++ P I
Sbjct: 1 PRAFVYEGFLTDEECDHILALSKGHLHKSGVVDAKTGGSTTSDIRTSTGTFISRAHDPTI 60
Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD--FARPGEANAFKSLGTGNRVAT 209
I R+E + + E LQV+ Y G Y+ H+D F + G+ N NR+AT
Sbjct: 61 TAIEERIELWSQIPVDHGEALQVLRYENGQEYKAHFDYFFHKGGKRN--------NRIAT 112
Query: 210 VLFYMSDVAQGGATVFTSLNL-----------------SLWPEKGTAAFWHNLHSSGDGD 252
VL Y+SDV +GG TVF + ++ S+ KG A + ++ G+ D
Sbjct: 113 VLLYLSDVEEGGETVFPNTDVPTDRDRSQYSECGNGGKSVKARKGDALLFWSMKPGGELD 172
Query: 253 YYTRHAACPVLTG 265
+ HA CPV+ G
Sbjct: 173 PGSSHAGCPVIKG 185
>gi|359477453|ref|XP_003631980.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2 [Vitis
vinifera]
gi|297736941|emb|CBI26142.3| unnamed protein product [Vitis vinifera]
Length = 298
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 55/196 (28%), Positives = 95/196 (48%), Gaps = 25/196 (12%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+PR +Y + + E D + +A+ L+R+ V + +G+ ++ R S ++ + + P+
Sbjct: 43 KPRAFVYEGFLSEEECDHLISLAKSELKRSAVADNVSGKSRLSEVRTSSGMFIGKGKDPI 102
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
+ I ++ T L E++QV+ Y G Y+ HYD+ + G+R+ATV
Sbjct: 103 VAGIEDKIAAWTFLPKDNGEDMQVLRYEPGQKYDAHYDYF----VDKVNIARGGHRIATV 158
Query: 211 LFYMSDVAQGGATVF-------------TSLNLS--------LWPEKGTAAFWHNLHSSG 249
L Y+SDV +GG TVF T+ +LS + P KG A + +LH +
Sbjct: 159 LMYLSDVVKGGETVFPMAEEPSRRKPLPTNDDLSECARKGIAVKPRKGDALLFFSLHPTA 218
Query: 250 DGDYYTRHAACPVLTG 265
D + H CPV+ G
Sbjct: 219 IPDPMSLHGGCPVIEG 234
>gi|423634936|ref|ZP_17610589.1| hypothetical protein IK7_01345 [Bacillus cereus VD156]
gi|401278922|gb|EJR84852.1| hypothetical protein IK7_01345 [Bacillus cereus VD156]
Length = 248
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 51/178 (28%), Positives = 95/178 (53%), Gaps = 13/178 (7%)
Query: 89 YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
+ +P I++ +V+ D E D + +M++ +++R+ V + + ++ + R S A+L + E
Sbjct: 68 FEEPLIVVLANVLSDEECDELIEMSKNKMKRSKVGSSR----DVNDIRTSSGAFLEDSEL 123
Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
+ +I +R+ + + S E L ++NY + Y+ HYD FA + A NR+
Sbjct: 124 TL--KIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSAA------NNRI 175
Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
+T++ Y++DV +GG T F LNLS+ P KG A ++ + + T H PV G
Sbjct: 176 STLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKG 233
>gi|49480949|ref|YP_038297.1| prolyl 4-hydroxylase subunit alpha [Bacillus thuringiensis serovar
konkukian str. 97-27]
gi|49332505|gb|AAT63151.1| prolyl 4-hydroxylase, alpha subunit [Bacillus thuringiensis serovar
konkukian str. 97-27]
Length = 232
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 50/177 (28%), Positives = 94/177 (53%), Gaps = 11/177 (6%)
Query: 89 YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
+ +P I++ +V+ D E D + ++++ +L R+ V + + ++ + R S A+L + E
Sbjct: 52 FEEPLIVVLGNVLSDEECDELIELSKNKLARSKVGSSR----DVNDIRTSSGAFLDDNE- 106
Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVA 208
+ E+I +R+ + + S E L ++NY + Y+ HYD+ +A NR++
Sbjct: 107 -LTEKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSA-----ANNRIS 160
Query: 209 TVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
T++ Y++DV +GG T F LNLS+ P KG A ++ + + T H PV G
Sbjct: 161 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKG 217
>gi|229093299|ref|ZP_04224414.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-42]
gi|228690082|gb|EEL43879.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-42]
Length = 232
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 52/178 (29%), Positives = 95/178 (53%), Gaps = 13/178 (7%)
Query: 89 YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
+ +P I++ +V+ D E D + ++++ +L R+ V + + ++ + R S A+L + E
Sbjct: 52 FEEPLIVVLGNVLSDEECDELIELSKNKLARSKVGSSR----DVNDIRTSSGAFLDDNE- 106
Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
+ E+I +R+ + + S E L ++NY + Y+ HYD FA + A NR+
Sbjct: 107 -LTEKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSAA------NNRI 159
Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
+T++ Y++DV +GG T F LNLS+ P KG A ++ + + T H PV G
Sbjct: 160 STLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKG 217
>gi|218899396|ref|YP_002447807.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus cereus
G9842]
gi|218542449|gb|ACK94843.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
G9842]
Length = 216
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 49/177 (27%), Positives = 94/177 (53%), Gaps = 11/177 (6%)
Query: 89 YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
+ +P I++ +V+ D E D + +M++ +++R+ V + + ++ + R S A+L + E
Sbjct: 36 FEEPLIVVLANVLSDEECDELIEMSKNKMKRSKVGSSR----DVNDIRTSSGAFLEDNE- 90
Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVA 208
+ +I +R+ + + S E L ++NY + Y+ HYD+ +A NR++
Sbjct: 91 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSA-----VNNRIS 144
Query: 209 TVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
T++ Y++DV +GG T F LNLS+ P KG A ++ + + T H PV G
Sbjct: 145 TLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKG 201
>gi|15233345|ref|NP_195307.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
gi|3805848|emb|CAA21468.1| putative protein [Arabidopsis thaliana]
gi|7270534|emb|CAB81491.1| putative protein [Arabidopsis thaliana]
gi|332661175|gb|AEE86575.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
Length = 272
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 73/271 (26%), Positives = 123/271 (45%), Gaps = 42/271 (15%)
Query: 7 QRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAIVAQL 66
QR QG K+Y L + DE V+ +++ E+ K +L L ++ L
Sbjct: 15 QRLQGLKIYETSDLIQHINTFDELVG-EQVSVDVKIEEKTKDMIL----LCSLSPLLTTL 69
Query: 67 KCRYVH----RNVPYLRLMPLKEEEAYLQPRIILYRDVM--------YDSEIDLIKKMAQ 114
C V P R + + +E PR +Y + + + E D + +A+
Sbjct: 70 TCSMVKVAASLRFPNERWLEVITKE----PRAFVYHNFLALFFKICKTNEECDHLISLAK 125
Query: 115 PRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQV 174
P + R+ V+N TG E ++ R S ++R +++ I +R+ T + E LQV
Sbjct: 126 PSMARSKVRNALTGLGEESSSRTSSGTFIRSGHDKIVKEIEKRISEFTFIPQENGETLQV 185
Query: 175 VNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVF-------TS 227
+NY +G +EPH+D F+ R+ATVL Y+SDV +GG TVF +
Sbjct: 186 INYEVGQKFEPHFD--------GFQ------RIATVLMYLSDVDKGGETVFPEAKGIKSK 231
Query: 228 LNLSLWPEKGTAAFWHNLHSSGDGDYYTRHA 258
+S+ P+KG A + ++ G D ++H
Sbjct: 232 KGVSVRPKKGDALLFWSMRPDGSRDPSSKHG 262
>gi|228954520|ref|ZP_04116545.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
kurstaki str. T03a001]
gi|449091198|ref|YP_007423639.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
kurstaki str. HD73]
gi|228805177|gb|EEM51771.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
kurstaki str. T03a001]
gi|449024955|gb|AGE80118.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
kurstaki str. HD73]
Length = 216
Score = 91.3 bits (225), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 51/178 (28%), Positives = 95/178 (53%), Gaps = 13/178 (7%)
Query: 89 YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
+ +P I++ +V+ D E D + +M++ +++R+ V + + ++ + R S A+L + E
Sbjct: 36 FEEPLIVVLANVLSDEECDELIEMSKNKMKRSKVGSAR----DVNDIRTSSGAFLEDNE- 90
Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
+ +I +R+ + + S E L ++NY + Y+ HYD FA + A NR+
Sbjct: 91 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSAA------NNRI 143
Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
+T++ Y++DV +GG T F LNLS+ P KG A ++ + + T H PV G
Sbjct: 144 STLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKG 201
>gi|356502598|ref|XP_003520105.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 296
Score = 91.3 bits (225), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 54/195 (27%), Positives = 92/195 (47%), Gaps = 23/195 (11%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+PRI LY + + E + + +A+P +R++TV +TG + R S +L +
Sbjct: 91 EPRIFLYHNFLTKEECEHLINIAKPNMRKSTVIESETGMSIESRVRTSSGTFLARGRDKI 150
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
+ I R+ T + EELQV++Y +G Y PH+D+ + + G+R+AT+
Sbjct: 151 VRNIENRIADFTFIPVDNGEELQVLHYQVGEKYVPHHDYF----MDDINTANGGDRIATM 206
Query: 211 LFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAFWHNLHSSGDG 251
L Y+SDV +GG TVF LS+ P+ A + ++
Sbjct: 207 LMYLSDVEEGGETVFPDAKGNFSSMPGWNELSVCGKKGLSIKPKMRNALLFWSIKPDATY 266
Query: 252 DYYTRHAACPVLTGS 266
D + H +CPV+ G+
Sbjct: 267 DPLSLHGSCPVIKGN 281
>gi|195166675|ref|XP_002024160.1| GL22879 [Drosophila persimilis]
gi|194107515|gb|EDW29558.1| GL22879 [Drosophila persimilis]
Length = 484
Score = 91.3 bits (225), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 61/199 (30%), Positives = 93/199 (46%), Gaps = 27/199 (13%)
Query: 68 CRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKT 127
C Y ++RL PLK E L P I +Y DV+Y+ EI + +A L+ + K
Sbjct: 296 CHYESTRTAFVRLAPLKVEMLSLDPYIAIYHDVIYEREIARVMTLALSSLK-GPGRYSKR 354
Query: 128 GELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHY 187
E I KS + E E+ ++++R MTG ++ ++ N GIGG+ H
Sbjct: 355 REHNI------KSVTVYEEENS---QLNQRTRDMTGEQVKEDKDFRIYNSGIGGYIRYHM 405
Query: 188 DFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHS 247
D E +++V GGA F L ++WP KG+A WHNL++
Sbjct: 406 DNLAKEEQQ-----------------LNEVPHGGAISFPQLEFTVWPRKGSALVWHNLNN 448
Query: 248 SGDGDYYTRHAACPVLTGS 266
+ + DY H +CPV+ GS
Sbjct: 449 NLELDYRVAHISCPVIVGS 467
>gi|222111817|ref|YP_002554081.1| procollagen-proline dioxygenase [Acidovorax ebreus TPSY]
gi|221731261|gb|ACM34081.1| Procollagen-proline dioxygenase [Acidovorax ebreus TPSY]
Length = 289
Score = 90.9 bits (224), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 48/154 (31%), Positives = 81/154 (52%), Gaps = 5/154 (3%)
Query: 92 PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
PR++L+ +++ E I AQPR+ R+ TG E+ R S + + E PV+
Sbjct: 102 PRVVLFGNLLSPEECQAIIDAAQPRMARSLTVQTTTGGEEVNADRTSDGMFFQRGETPVV 161
Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDF---ARPGEANAFKSLGTGNRVA 208
+R+ R+ + E LQV++Y G Y+PHYD+ +PG + + G RVA
Sbjct: 162 QRLEERIARLVRWPIQNGEGLQVLHYRPGAEYKPHYDYFDPDQPGTSTIVRR--GGQRVA 219
Query: 209 TVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFW 242
T++ Y+++ +GG T F + L + P +G A F+
Sbjct: 220 TLVIYLNNPRKGGGTTFPDVPLEVAPRQGNAVFF 253
>gi|195494572|ref|XP_002094895.1| GE22068 [Drosophila yakuba]
gi|194180996|gb|EDW94607.1| GE22068 [Drosophila yakuba]
Length = 438
Score = 90.9 bits (224), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 98/201 (48%), Gaps = 27/201 (13%)
Query: 66 LKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNY 125
L C YV P+L+L PLK EE ++P I ++ + +I+++K +++P+L+R
Sbjct: 256 LVCHYVDW-TPFLKLAPLKMEELSMKPHISIFYGFLGPKDIEVLKNVSRPKLQR------ 308
Query: 126 KTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEP 185
E AN K L H V+ +++ + +TG + E ++V+NYGI G+Y P
Sbjct: 309 --NEHLSANCS-CKIGNLFSSSHDVVRKVNELILDITGFPSKGNEMVEVINYGIAGNYNP 365
Query: 186 HYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNL 245
D A+P + N A ++ + +GG VF S +L + P KG+ W NL
Sbjct: 366 D-DTAQPRKHNK----------ANAFIFLGNAGKGGEIVFPSRDLKIRPRKGSMIVWENL 414
Query: 246 HSSGDGDYYTRHAACPVLTGS 266
S + CP+L G+
Sbjct: 415 KKS------VIYHQCPILKGN 429
>gi|228922987|ref|ZP_04086280.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
huazhongensis BGSC 4BD1]
gi|228836620|gb|EEM81968.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
huazhongensis BGSC 4BD1]
Length = 216
Score = 90.9 bits (224), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 51/178 (28%), Positives = 95/178 (53%), Gaps = 13/178 (7%)
Query: 89 YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
+ +P I++ +V+ D E D + +M++ +++R+ V + + ++ + R S A+L + E
Sbjct: 36 FEEPLIVVLANVLSDEECDELIEMSKNKMKRSKVGSSR----DVNDIRTSSGAFLEDSEL 91
Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
+ +I +R+ + + S E L ++NY + Y+ HYD FA + A NR+
Sbjct: 92 TL--KIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSAA------NNRI 143
Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
+T++ Y++DV +GG T F LNLS+ P KG A ++ + + T H PV G
Sbjct: 144 STLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKG 201
>gi|389793983|ref|ZP_10197143.1| 2OG-Fe(II) oxygenase [Rhodanobacter fulvus Jip2]
gi|388433014|gb|EIL89992.1| 2OG-Fe(II) oxygenase [Rhodanobacter fulvus Jip2]
Length = 282
Score = 90.9 bits (224), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 54/162 (33%), Positives = 88/162 (54%), Gaps = 7/162 (4%)
Query: 107 DLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTT 166
DLI+ +A+PRL+RA + G+ +I R S+ + R E P++ I +R+ + G+
Sbjct: 109 DLIE-LARPRLQRALTVD-SDGKQQIDQRRTSEGMFFRAGETPLVAAIEQRLAQLLGVPA 166
Query: 167 STAEELQVVNYGIGGHYEPHYDFARPGEANAFK-SLGTGNRVATVLFYMSDVAQGGATVF 225
S E LQ+++YG G YEPHYD+ P K + G R+A+V+ Y++ +GG T F
Sbjct: 167 SHGEGLQILHYGPGQEYEPHYDWFDPALPGYDKLTARAGQRIASVVMYLNTPERGGGTAF 226
Query: 226 TSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
+ L++ +G A ++ + GD + HA PVL G
Sbjct: 227 PEIGLTVTARRGAAVYF----AYEGGDQSSLHAGLPVLQGEK 264
>gi|121595595|ref|YP_987491.1| 2OG-Fe(II) oxygenase [Acidovorax sp. JS42]
gi|120607675|gb|ABM43415.1| 2OG-Fe(II) oxygenase [Acidovorax sp. JS42]
Length = 289
Score = 90.9 bits (224), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 48/154 (31%), Positives = 81/154 (52%), Gaps = 5/154 (3%)
Query: 92 PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
PR++L+ +++ E I AQPR+ R+ TG E+ R S + + E PV+
Sbjct: 102 PRVVLFGNLLSPEECQAIIDAAQPRMARSLTVQTTTGGEEVNADRTSDGMFFQRGETPVV 161
Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDF---ARPGEANAFKSLGTGNRVA 208
+R+ R+ + E LQV++Y G Y+PHYD+ +PG + + G RVA
Sbjct: 162 QRLEERIARLVRWPIQNGEGLQVLHYRPGAEYKPHYDYFDPDQPGTSTIVRR--GGQRVA 219
Query: 209 TVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFW 242
T++ Y+++ +GG T F + L + P +G A F+
Sbjct: 220 TLVIYLNNPLKGGGTTFPDVPLEVAPRQGNAVFF 253
>gi|302765413|ref|XP_002966127.1| hypothetical protein SELMODRAFT_86017 [Selaginella moellendorffii]
gi|300165547|gb|EFJ32154.1| hypothetical protein SELMODRAFT_86017 [Selaginella moellendorffii]
Length = 201
Score = 90.9 bits (224), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 57/185 (30%), Positives = 87/185 (47%), Gaps = 22/185 (11%)
Query: 103 DSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMT 162
D E D + +A PRLRR++V + KTG + + R S A+LR ++ I R+ +T
Sbjct: 9 DDECDHLIGLALPRLRRSSVIDEKTGLGKDSRNRTSWGAFLRRDHDNIVSGIEDRISSIT 68
Query: 163 GLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGA 222
+ E LQVV Y G +EPH D+ + E N G+R+ T+L Y+++V GG
Sbjct: 69 FIPKEYGESLQVVRYKTGQKFEPHQDYYKLTENNN----NGGHRIGTLLLYLTNVENGGE 124
Query: 223 TVF------------------TSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLT 264
TVF T + + P +G + SG+ D ++ H CPV+
Sbjct: 125 TVFPRALANVINDYSTNTSECTKKGIVIRPRRGDGLLFWITRPSGEIDPFSFHGGCPVVK 184
Query: 265 GSNSL 269
G L
Sbjct: 185 GEKWL 189
>gi|195591294|ref|XP_002085377.1| GD12338 [Drosophila simulans]
gi|194197386|gb|EDX10962.1| GD12338 [Drosophila simulans]
Length = 438
Score = 90.9 bits (224), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 59/204 (28%), Positives = 101/204 (49%), Gaps = 33/204 (16%)
Query: 66 LKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNY 125
L CRY P+L+L PLK EE ++P I ++ + +I+++K + +P+L+R +
Sbjct: 256 LVCRYADW-TPFLKLAPLKMEELSMKPHISIFYVFLGQKDIEVLKNVFRPKLQRIE---H 311
Query: 126 KTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEP 185
+G +S S+ H V+ +++ + +TG + + L+V+NYGI G+Y P
Sbjct: 312 LSGNCSCKIGNLSSSS------HDVVRKVNELILDITGFPSKGNQMLEVINYGIAGNYNP 365
Query: 186 HYDFARP---GEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFW 242
D A+P +ANAF ++ +GG VF S +L + P KG+ W
Sbjct: 366 E-DTAKPKIHNKANAF-------------IFLESAGKGGEIVFPSRHLKVRPRKGSMLVW 411
Query: 243 HNLHSSGDGDYYTRHAACPVLTGS 266
NL +S + CP+L G+
Sbjct: 412 ENLKNS------VIYHQCPILKGN 429
>gi|228916870|ref|ZP_04080433.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
pulsiensis BGSC 4CC1]
gi|228842793|gb|EEM87878.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
pulsiensis BGSC 4CC1]
Length = 232
Score = 90.9 bits (224), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 50/177 (28%), Positives = 94/177 (53%), Gaps = 11/177 (6%)
Query: 89 YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
+ +P I++ +V+ D E D + ++++ +L R+ V + + ++ + R SK A+L + E
Sbjct: 52 FEEPLIVVLGNVLSDEECDELIELSKNKLARSKVGSSR----DVNDIRTSKGAFLDDNE- 106
Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVA 208
+ +I +R+ + + S E L ++NY + Y+ HYD+ +A NR++
Sbjct: 107 -LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSA-----ANNRIS 160
Query: 209 TVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
T++ Y++DV +GG T F LNLS+ P KG A ++ + + T H PV G
Sbjct: 161 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKG 217
>gi|423669823|ref|ZP_17644852.1| hypothetical protein IKO_03520 [Bacillus cereus VDM034]
gi|423673973|ref|ZP_17648912.1| hypothetical protein IKS_01516 [Bacillus cereus VDM062]
gi|401298950|gb|EJS04550.1| hypothetical protein IKO_03520 [Bacillus cereus VDM034]
gi|401309524|gb|EJS14857.1| hypothetical protein IKS_01516 [Bacillus cereus VDM062]
Length = 216
Score = 90.5 bits (223), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 54/193 (27%), Positives = 101/193 (52%), Gaps = 16/193 (8%)
Query: 89 YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
+ +P I++ +V+ D E D + ++++ ++ R+ V + + ++ + R S A+L E E
Sbjct: 36 FEEPLIVVLANVLSDEECDELIELSKSKMERSKVGSSR----DVNDIRTSSGAFLEENE- 90
Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
+ +I +R+ +T + + E L ++NY + Y+ HYD FA + A NR+
Sbjct: 91 -LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYFAEHSRSAA------NNRI 143
Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
+T++ Y++DV +GG T F LNLS+ P KG A ++ + + T H PV G
Sbjct: 144 STLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQLLNELTLHGGAPVTKGEK 203
Query: 268 SLHSTCPCGLRRG 280
+ + +RRG
Sbjct: 204 WIATQW---VRRG 213
>gi|423389445|ref|ZP_17366671.1| hypothetical protein ICG_01293 [Bacillus cereus BAG1X1-3]
gi|401641536|gb|EJS59253.1| hypothetical protein ICG_01293 [Bacillus cereus BAG1X1-3]
Length = 216
Score = 90.5 bits (223), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 53/193 (27%), Positives = 102/193 (52%), Gaps = 16/193 (8%)
Query: 89 YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
+ +P I++ +V+ D E + + ++++ +++R+ V + + ++ + R S A+L E E
Sbjct: 36 FEEPLIVVLANVLSDEECEELIELSKNKMKRSKVGSSR----DVNDIRTSSGAFLEENE- 90
Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
+ +I +R+ +T + + E L ++NY + Y+ HYD FA + A NR+
Sbjct: 91 -LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYFAEHSRSAA------NNRI 143
Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
+T++ Y++DV +GG T F LNLS+ P KG A ++ + + T H PV G
Sbjct: 144 STLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEK 203
Query: 268 SLHSTCPCGLRRG 280
+ + +RRG
Sbjct: 204 WIATQW---VRRG 213
>gi|239814309|ref|YP_002943219.1| Procollagen-proline dioxygenase [Variovorax paradoxus S110]
gi|239800886|gb|ACS17953.1| Procollagen-proline dioxygenase [Variovorax paradoxus S110]
Length = 279
Score = 90.5 bits (223), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 52/177 (29%), Positives = 86/177 (48%), Gaps = 3/177 (1%)
Query: 92 PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
PR++++ +++ E + + A+ RL R+ +TG + R S+ + E+ ++
Sbjct: 92 PRVVVFGNLVSPEECEGLIAAARVRLARSLTVETRTGGEVLNVDRTSEGMFFERGENDIV 151
Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLG-TGNRVATV 210
R+ +R+ + E LQ++ Y G Y PHYD+ PGE L G RVAT+
Sbjct: 152 ARLEQRIAALLRWPVEFGEGLQILRYAPGAQYRPHYDYFDPGEPGTPTILKRGGQRVATL 211
Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
+ Y+ + QGGAT F + L + P +GT F+ + D T H PVL G
Sbjct: 212 VMYLQEPGQGGATTFPDVGLEVAPVRGTGVFFS--YEEPDPATRTLHGGAPVLAGEK 266
>gi|423612451|ref|ZP_17588312.1| hypothetical protein IIM_03166 [Bacillus cereus VD107]
gi|401246040|gb|EJR52392.1| hypothetical protein IIM_03166 [Bacillus cereus VD107]
Length = 254
Score = 90.5 bits (223), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 51/192 (26%), Positives = 98/192 (51%), Gaps = 14/192 (7%)
Query: 89 YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
+ +P I++ +V+ D E D + ++++ ++ R+ + + + + + R S A+L E E
Sbjct: 74 FEEPLIVVLANVLSDEECDELIELSKNKMERSKIGSSRN----VNDIRTSSGAFLEENE- 128
Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVA 208
+I +R+ +T + + E L ++NY + Y+ HYD+ +A NR++
Sbjct: 129 -FTSKIEKRISSITNVPVAHGEGLHILNYAVDQEYKAHYDYFAEHSRSA-----ANNRIS 182
Query: 209 TVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNS 268
T++ Y++DV +GG T F LNLS+ P KG A ++ + + T H PV G
Sbjct: 183 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 242
Query: 269 LHSTCPCGLRRG 280
+ + +RRG
Sbjct: 243 IATQW---MRRG 251
>gi|302764100|ref|XP_002965471.1| hypothetical protein SELMODRAFT_67344 [Selaginella moellendorffii]
gi|300166285|gb|EFJ32891.1| hypothetical protein SELMODRAFT_67344 [Selaginella moellendorffii]
Length = 264
Score = 90.5 bits (223), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 65/206 (31%), Positives = 98/206 (47%), Gaps = 22/206 (10%)
Query: 78 LRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELE---IAN 134
LR+ +K E PRI L + E D + +A PRL ++TV + TG+ +
Sbjct: 52 LRIGLVKPEVLNWSPRITLLHKFLSAEECDYLIAIAGPRLAKSTVVDTSTGKARHGIESK 111
Query: 135 YRISKSAWLR--EPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARP 192
R S +L + +P+IE I RR+ + + E LQV+ Y +Y+PH+D+
Sbjct: 112 VRTSTGMFLSNYDRRYPMIEAIERRIAVYSMIPVENGELLQVLRYEPNQYYKPHHDYF-- 169
Query: 193 GEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL-------------NLSLWPEKGTA 239
++ F G RVATVL Y+SDV +GG T+F S+ L + P KG A
Sbjct: 170 --SDQFNLKRGGQRVATVLMYLSDVEEGGETIFPSVGDGECECGGELRKGLCVKPRKGDA 227
Query: 240 AFWHNLHSSGDGDYYTRHAACPVLTG 265
+ + G+ D + H C VL G
Sbjct: 228 ILFWSAALDGNVDSNSLHGGCSVLRG 253
>gi|325922187|ref|ZP_08183974.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas gardneri ATCC
19865]
gi|325547306|gb|EGD18373.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas gardneri ATCC
19865]
Length = 285
Score = 90.5 bits (223), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 58/181 (32%), Positives = 89/181 (49%), Gaps = 7/181 (3%)
Query: 88 AYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPE 147
+ L PR+++ D + D+E D + +AQPRL R+ + G + R S S L+ +
Sbjct: 92 SLLLPRVVVLGDFLSDAECDALIALAQPRLARSRTVDNDNGAQIVHAARTSDSMCLQLGQ 151
Query: 148 HPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL-GTGNR 206
+ +RI R+ + E LQV+ Y G Y+PHYD+ P A L G R
Sbjct: 152 DALCQRIEARIARLLDWPVDHGEGLQVLRYATGAEYQPHYDYFDPTAAGTPVLLQAGGQR 211
Query: 207 VATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTR--HAACPVLT 264
+A+++ Y++ +GGAT F ++L + KG A F+ S TR HA PVL
Sbjct: 212 LASLVMYLNTPERGGATRFPDVHLDVAAVKGNAVFF----SYDRPHPMTRSLHAGAPVLA 267
Query: 265 G 265
G
Sbjct: 268 G 268
>gi|229104864|ref|ZP_04235524.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-28]
gi|228678581|gb|EEL32798.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-28]
Length = 216
Score = 90.5 bits (223), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 54/196 (27%), Positives = 102/196 (52%), Gaps = 16/196 (8%)
Query: 89 YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
+ +P I++ +V+ D E + +M++ +++R+T+ + + ++ + R S A+L E E
Sbjct: 36 FEEPLIVVLGNVISDEECGELIEMSKNKIKRSTIGSSR----DVNDIRTSSGAFLEENE- 90
Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
+ +I +R+ + + + E L ++NY + Y+ HYD FA + A NR+
Sbjct: 91 -LTSKIEKRISSIMNVPVTHGEGLHILNYEVDQQYKAHYDYFAEHSRSAA------NNRI 143
Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
+T++ Y++DV +GG T F LNLS+ P KG A ++ + + T H PV G
Sbjct: 144 STLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEK 203
Query: 268 SLHSTCPCGLRRGLQR 283
+ + +RRG R
Sbjct: 204 WIATQW---VRRGTYR 216
>gi|224141327|ref|XP_002324025.1| predicted protein [Populus trichocarpa]
gi|222867027|gb|EEF04158.1| predicted protein [Populus trichocarpa]
Length = 239
Score = 90.5 bits (223), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 61/202 (30%), Positives = 95/202 (47%), Gaps = 22/202 (10%)
Query: 82 PLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSA 141
P + + QPR +Y+ + D E D + +A+ +L ++ V N +TGE + R S
Sbjct: 15 PTRAAQLSWQPRAFVYKGFLSDEECDHLINLAKGKLVKSMVANDETGESMESQERTSSGM 74
Query: 142 WLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL 201
++ + E ++ I R+ T L E +Q++ Y G YE H D+ +AN +
Sbjct: 75 FIFKTEDEIVNGIEARIAAWTFLPEENGEPIQILRYEHGQKYEAHIDYF-VDKANQEEG- 132
Query: 202 GTGNRVATVLFYMSDVAQGGATVF-------TSLNLSLW-----------PEKGTAAFWH 243
G+R ATVL Y+SDV +GG TVF + W P KG A +
Sbjct: 133 --GHRAATVLMYLSDVKKGGETVFPTSEAEGSQAKDDSWSDCAKKGYAVKPNKGDALLFF 190
Query: 244 NLHSSGDGDYYTRHAACPVLTG 265
+LH D + HA+CPV+ G
Sbjct: 191 SLHPDATPDPGSLHASCPVIEG 212
>gi|413945803|gb|AFW78452.1| hypothetical protein ZEAMMB73_588774 [Zea mays]
Length = 239
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 45/135 (33%), Positives = 78/135 (57%), Gaps = 4/135 (2%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+PR+ LY+ + D E + + +A+ L+R+ V + +G+ ++ R S +LR+ + P+
Sbjct: 57 KPRVFLYQHFLSDDEANHLISLARAELKRSAVADNMSGKSTLSEVRTSSGTFLRKGQDPI 116
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
+E I ++ T L E++QV+ Y G YEPHYD+ + ++ G+R ATV
Sbjct: 117 VEGIEDKIAAWTFLPKENGEDIQVLRYKHGEKYEPHYDYF----TDNVNTVRGGHRYATV 172
Query: 211 LFYMSDVAQGGATVF 225
L Y++DV +GG TVF
Sbjct: 173 LLYLTDVPEGGETVF 187
>gi|423448819|ref|ZP_17425698.1| hypothetical protein IEC_03427 [Bacillus cereus BAG5O-1]
gi|401129413|gb|EJQ37096.1| hypothetical protein IEC_03427 [Bacillus cereus BAG5O-1]
Length = 216
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 54/196 (27%), Positives = 102/196 (52%), Gaps = 16/196 (8%)
Query: 89 YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
+ +P I++ +V+ D E D + +M++ +++R+T+ + + ++ + R S A+L E E
Sbjct: 36 FEEPLIVVLGNVISDEECDELIEMSKNKIKRSTIGSSR----DVNDIRTSSGAFLEENE- 90
Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
+ +I +R+ + + + E L ++NY + Y+ HYD FA + A NR+
Sbjct: 91 -LTSKIEKRISSIMNVPVTHGEGLHILNYEVDQQYKAHYDYFAEHSRSAA------NNRI 143
Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
+T++ Y++DV +GG T F LNLS+ P KG A ++ + + T H V G
Sbjct: 144 STLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGASVTKGEK 203
Query: 268 SLHSTCPCGLRRGLQR 283
+ + +RRG R
Sbjct: 204 WIATQW---VRRGTYR 216
>gi|297797785|ref|XP_002866777.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297312612|gb|EFH43036.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 266
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 53/163 (32%), Positives = 83/163 (50%), Gaps = 12/163 (7%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+PR +Y + + E + ++A+P + ++TV + KTG+ + R S +L
Sbjct: 83 EPRASVYHNFLTKEECKYLIELAKPHMEKSTVVDEKTGKSTDSRVRTSSGTFLARGRDKT 142
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
I I +R+ T + E LQV++Y IG YEPHYD+ + + + G R+ATV
Sbjct: 143 IREIEKRISDFTFIPVEHGEGLQVLHYEIGQKYEPHYDYF----MDEYNTRNGGQRIATV 198
Query: 211 LFYMSDVAQGGATVFTSL--NLSLWPEKGTAAFWHNLHSSGDG 251
L Y+SDV +GG TVF + N S P +W+ L G G
Sbjct: 199 LMYLSDVEEGGETVFPAAKGNYSAVP------WWNELSECGKG 235
>gi|428170517|gb|EKX39441.1| hypothetical protein GUITHDRAFT_114401 [Guillardia theta CCMP2712]
Length = 322
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 55/187 (29%), Positives = 88/187 (47%), Gaps = 7/187 (3%)
Query: 86 EEAYLQPRIILYRDVMYDSEIDLIKKMA-QPRLRRATVQNYKTGELEIANYRISKSAWLR 144
E + PRI + +++ + E D + +A Q L + + Y T +L + R +K AWL
Sbjct: 76 ETVSVDPRIFIVHNLLTEEECDHLVSLALQKGLSASLITPYGTNKLVESTTRTNKQAWLD 135
Query: 145 EPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTG 204
+ V++R+ ++ +T T E LQV++Y + H+D+ P G
Sbjct: 136 FQQDDVVKRVEDKIAKLTKTTPEQGENLQVLHYAKSQQFTEHHDYFDPATDPPENYEKGG 195
Query: 205 NRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDG------DYYTRHA 258
NR+ TV+ Y+ +GG T F + NL L KG A ++NL DG D T HA
Sbjct: 196 NRLITVIVYLQAAEEGGETHFGAANLKLTAAKGDAVMFYNLKHGCDGIDPTCVDKQTLHA 255
Query: 259 ACPVLTG 265
P + G
Sbjct: 256 GLPPIKG 262
>gi|10177121|dbj|BAB10411.1| prolyl 4-hydroxylase, alpha subunit-like protein [Arabidopsis
thaliana]
Length = 267
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 53/163 (32%), Positives = 83/163 (50%), Gaps = 12/163 (7%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+PR +Y + + E + ++A+P + ++TV + KTG+ + R S +L
Sbjct: 84 EPRASVYHNFLTKEECKYLIELAKPHMEKSTVVDEKTGKSTDSRVRTSSGTFLARGRDKT 143
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
I I +R+ T + E LQV++Y IG YEPHYD+ + + + G R+ATV
Sbjct: 144 IREIEKRISDFTFIPVEHGEGLQVLHYEIGQKYEPHYDYF----MDEYNTRNGGQRIATV 199
Query: 211 LFYMSDVAQGGATVFTSL--NLSLWPEKGTAAFWHNLHSSGDG 251
L Y+SDV +GG TVF + N S P +W+ L G G
Sbjct: 200 LMYLSDVEEGGETVFPAAKGNYSAVP------WWNELSECGKG 236
>gi|229163182|ref|ZP_04291137.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus R309803]
gi|228620245|gb|EEK77116.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus R309803]
Length = 229
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 52/178 (29%), Positives = 95/178 (53%), Gaps = 13/178 (7%)
Query: 89 YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
+ +P I++ +V+ D E D + ++++ +L R+ V + + ++ + R SK A+L + E
Sbjct: 49 FEEPLIVVLGNVLSDEECDELIELSKSKLARSKVGSSR----DVNDIRTSKGAFLDDNE- 103
Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
+ +I +R+ + + S E L ++NY + Y+ HYD FA + A NR+
Sbjct: 104 -LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSAA------NNRI 156
Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
+T++ Y++DV +GG T F LNLS+ P KG A ++ + + T H PV G
Sbjct: 157 STLVMYLNDVEEGGETFFPKLNLSVNPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKG 214
>gi|423657194|ref|ZP_17632493.1| hypothetical protein IKG_04182 [Bacillus cereus VD200]
gi|401289937|gb|EJR95641.1| hypothetical protein IKG_04182 [Bacillus cereus VD200]
Length = 248
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 50/178 (28%), Positives = 94/178 (52%), Gaps = 13/178 (7%)
Query: 89 YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
+ +P I++ +V+ D E D + +M++ ++ R+ + + + ++ + R S A+L + E
Sbjct: 68 FEEPLIVVLANVLSDEECDELIEMSKNKMERSKIGSSR----DVNDIRTSSGAFLEDNE- 122
Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
+ +I +R+ + + S E L ++NY + Y+ HYD FA + A NR+
Sbjct: 123 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSAA------NNRI 175
Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
+T++ Y++DV +GG T F LNLS+ P KG A ++ + + T H PV G
Sbjct: 176 STLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKG 233
>gi|424863736|ref|ZP_18287648.1| prolyl 4-hydroxylase subunit alpha-2 [SAR86 cluster bacterium
SAR86A]
gi|400757057|gb|EJP71269.1| prolyl 4-hydroxylase subunit alpha-2 [SAR86 cluster bacterium
SAR86A]
Length = 205
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 53/180 (29%), Positives = 87/180 (48%), Gaps = 10/180 (5%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
P + + + + D E + +M + ++ RA V E E R + WL V
Sbjct: 16 DPIVYVVNNFLSDDECEAFVEMGKGKMERAKV--ISDDESEFHASRTNDFCWLEHSASDV 73
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDF----ARPGEANAFKSLGTGNR 206
I +S+R + + + AE+ Q+V YG G Y+PH+D + G+ N F G R
Sbjct: 74 IHEVSKRFSVLVKMPINNAEQFQLVYYGPGNEYKPHFDAFDKTTKEGQNNWFPG---GQR 130
Query: 207 VATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHN-LHSSGDGDYYTRHAACPVLTG 265
+ T L Y++DV +GGAT F +N+S+ P KG +HN + + + + H PV+ G
Sbjct: 131 MVTALAYLNDVEEGGATDFPKINVSVKPNKGDVVVFHNCIEGTTEINPQALHGGSPVVAG 190
>gi|423406337|ref|ZP_17383486.1| hypothetical protein ICY_01022 [Bacillus cereus BAG2X1-3]
gi|401660331|gb|EJS77813.1| hypothetical protein ICY_01022 [Bacillus cereus BAG2X1-3]
Length = 216
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 52/178 (29%), Positives = 95/178 (53%), Gaps = 13/178 (7%)
Query: 89 YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
+ +P I++ +V+ D E D + ++++ +L R+ V + + ++ + R SK A+L + E
Sbjct: 36 FEEPLIVVLGNVLSDEECDKLIELSKNKLARSKVGSSR----DVNDIRTSKGAFLDDNE- 90
Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
+ +I +R+ + + S E L ++NY + Y+ HYD FA + A NR+
Sbjct: 91 -LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSAA------NNRI 143
Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
+T++ Y++DV +GG T F LNLS+ P KG A ++ + + T H PV G
Sbjct: 144 STLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKG 201
>gi|319795182|ref|YP_004156822.1| procollagen-proline dioxygenase [Variovorax paradoxus EPS]
gi|315597645|gb|ADU38711.1| Procollagen-proline dioxygenase [Variovorax paradoxus EPS]
Length = 296
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 51/169 (30%), Positives = 91/169 (53%), Gaps = 1/169 (0%)
Query: 99 DVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRV 158
+V+ E + +MA+PRL +T+ + +G +++ R S + R E+ ++ R+ RR+
Sbjct: 107 NVVDAHECKALIEMAKPRLAPSTLVDPMSGRDVVSDKRASWGMFFRLCENDLVARLDRRL 166
Query: 159 EHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLG-TGNRVATVLFYMSDV 217
+ L E L ++ Y G EPH+D+ P A +S+ +G RV+T++ Y++D
Sbjct: 167 SALMNLPLENGEGLHLLYYPTGAGSEPHHDYLAPTNAANRESIARSGQRVSTLVTYLNDA 226
Query: 218 AQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
+GG TVF L L++ P +G A ++ +G D + HA+ PV G
Sbjct: 227 PEGGQTVFPQLGLAVSPIRGNACYFEYCDGNGRVDARSLHASAPVTRGD 275
>gi|423395462|ref|ZP_17372663.1| hypothetical protein ICU_01156 [Bacillus cereus BAG2X1-1]
gi|401654873|gb|EJS72412.1| hypothetical protein ICU_01156 [Bacillus cereus BAG2X1-1]
Length = 216
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 52/178 (29%), Positives = 95/178 (53%), Gaps = 13/178 (7%)
Query: 89 YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
+ +P I++ +V+ D E D + ++++ +L R+ V + + ++ + R SK A+L + E
Sbjct: 36 FEEPLIVVLGNVLSDEECDKLIELSKNKLARSKVGSSR----DVNDIRTSKGAFLDDNE- 90
Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
+ +I +R+ + + S E L ++NY + Y+ HYD FA + A NR+
Sbjct: 91 -LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSAA------NNRI 143
Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
+T++ Y++DV +GG T F LNLS+ P KG A ++ + + T H PV G
Sbjct: 144 STLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKG 201
>gi|423457579|ref|ZP_17434376.1| hypothetical protein IEI_00719 [Bacillus cereus BAG5X2-1]
gi|401147963|gb|EJQ55456.1| hypothetical protein IEI_00719 [Bacillus cereus BAG5X2-1]
Length = 216
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 52/178 (29%), Positives = 94/178 (52%), Gaps = 13/178 (7%)
Query: 89 YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
+ +P I++ +V+ D E D + ++++ +L R+ V + + ++ + R S A+L + E
Sbjct: 36 FEEPLIVVLGNVLSDEECDELIELSKSKLARSKVGSSR----DVNDIRTSSGAFLEDNEL 91
Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
V +I +R+ + + S E L ++NY + Y+ HYD FA + A NR+
Sbjct: 92 TV--KIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSAA------NNRI 143
Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
+T++ Y++DV +GG T F LNLS+ P KG A ++ + + T H PV G
Sbjct: 144 STLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKG 201
>gi|325915062|ref|ZP_08177391.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas vesicatoria
ATCC 35937]
gi|325538760|gb|EGD10427.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas vesicatoria
ATCC 35937]
Length = 286
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 58/177 (32%), Positives = 87/177 (49%), Gaps = 7/177 (3%)
Query: 92 PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
PR+++ + D+E D + +AQPRL R+ + G + R S S L+ + +
Sbjct: 96 PRVMVLGGFLSDAECDAMIALAQPRLARSRTVDNANGAHVVHAARTSDSMCLQLGQDALC 155
Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL-GTGNRVATV 210
+RI R+ + E LQV+ YG G Y+PHYD+ P A L G RVA++
Sbjct: 156 QRIEARIARLLDWPVENGEGLQVLRYGTGAEYQPHYDYFDPDAAGTPVLLQAGGQRVASL 215
Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTR--HAACPVLTG 265
+ Y++ +GGAT F ++L + KG A F+ S TR HA PVL G
Sbjct: 216 VMYLNTPDRGGATRFPDVHLDIAAIKGNAVFF----SYDRPHPMTRSLHAGAPVLAG 268
>gi|384046522|ref|YP_005494539.1| prolyl 4-hydroxylase alpha subunit [Bacillus megaterium WSH-002]
gi|345444213|gb|AEN89230.1| Prolyl 4-hydroxylase alpha subunit [Bacillus megaterium WSH-002]
Length = 219
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 48/178 (26%), Positives = 96/178 (53%), Gaps = 11/178 (6%)
Query: 89 YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
+ +P +++ +V+ + E D + ++++ +++R+ + E E+ + R S + E E+
Sbjct: 36 FEEPLVLVLGNVLSNEECDELIQLSKDKMQRSKI----GAEREVNSIRTSSGMFFEESEN 91
Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
++ +I RR+ + G + AE LQ++ Y Y+ H+D F +A+ NR+
Sbjct: 92 ELVHQIERRLSKIMGPSIEYAEGLQILKYLPDQEYKAHHDYFTSASKASK------NNRI 145
Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
+T++ Y++DV +GG T F L LS+ P KG A ++ +S + + T H PV+ G
Sbjct: 146 STLVMYLNDVEEGGETYFPKLGLSISPTKGMAVYFEYFYSDAELNDRTLHGGAPVIKG 203
>gi|229086310|ref|ZP_04218488.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-44]
gi|228697005|gb|EEL49812.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-44]
Length = 220
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 51/176 (28%), Positives = 91/176 (51%), Gaps = 13/176 (7%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+P I++ +V+ D E + + ++++ ++R+ + + E+ N R S +L E E
Sbjct: 42 EPLIVVLENVLSDEECESLIELSKDSMKRSKIGASR----EVDNIRTSSGTFLEENETVA 97
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRVAT 209
I I +RV + + E L ++ Y G Y+ HYD FA A NR++T
Sbjct: 98 I--IEKRVSSIMNIPVEHGEGLHILKYTPGQEYKAHYDYFAEHSRA------AENNRIST 149
Query: 210 VLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
++ Y++DV +GG T F LNLS+ P+KG+A ++ ++ + T H PV+ G
Sbjct: 150 LVMYLNDVEEGGETFFPKLNLSIAPKKGSAVYFEYFYNDKSLNELTLHGGAPVIKG 205
>gi|206978009|ref|ZP_03238895.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
H3081.97]
gi|423373947|ref|ZP_17351286.1| hypothetical protein IC5_03002 [Bacillus cereus AND1407]
gi|206743809|gb|EDZ55230.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
H3081.97]
gi|401094762|gb|EJQ02832.1| hypothetical protein IC5_03002 [Bacillus cereus AND1407]
Length = 216
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 55/196 (28%), Positives = 101/196 (51%), Gaps = 16/196 (8%)
Query: 89 YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
+ +P I++ +V+ D E D + ++++ +L R+ V + + ++ + R S A+L + E
Sbjct: 36 FEEPLIVVLGNVLSDEECDKLIELSKNKLARSKVGSSR----DVNDIRTSSGAFLDDDE- 90
Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
+ +I +R+ + + S E L ++NY + Y+ HYD FA + A NR+
Sbjct: 91 -LTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYFAEHSRSAA------NNRI 143
Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
+T++ Y++DV +GG T F LNLS+ P KG A ++ + + T H PV G
Sbjct: 144 STLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEK 203
Query: 268 SLHSTCPCGLRRGLQR 283
+ + +RRG R
Sbjct: 204 WIATQW---VRRGTYR 216
>gi|407698902|ref|YP_006823689.1| hypothetical protein AMBLS11_03220 [Alteromonas macleodii str.
'Black Sea 11']
gi|407248049|gb|AFT77234.1| hypothetical protein AMBLS11_03220 [Alteromonas macleodii str.
'Black Sea 11']
Length = 263
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 53/179 (29%), Positives = 89/179 (49%), Gaps = 7/179 (3%)
Query: 93 RIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIE 152
++ Y D + E D I + + +L + + G + R S + L + +++
Sbjct: 81 QLFAYDDFLSSQECDDIVALTKDKLAPSKL----AGAASADDIRTSSTCELAFLGNKLVK 136
Query: 153 RISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKS--LGTGNRVATV 210
+ R+ L E +Q +Y +G +Y+PHYDF PG +K+ L G R T
Sbjct: 137 DVDSRIVSTLSLGVGEGEVIQAQHYNVGEYYKPHYDFFPPGSPQ-YKTHCLSRGQRTWTC 195
Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSL 269
+ Y++D GG T FT L++++ P+KG A FW+NL SGD + + H A PV G ++
Sbjct: 196 MIYLNDECDGGHTRFTKLDIAVRPKKGMALFWNNLLPSGDPNLNSIHFAEPVTRGHKTV 254
>gi|229111709|ref|ZP_04241257.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-15]
gi|296504733|ref|YP_003666433.1| prolyl 4-hydroxylase subunit alpha [Bacillus thuringiensis BMB171]
gi|423585282|ref|ZP_17561369.1| hypothetical protein IIE_00694 [Bacillus cereus VD045]
gi|423640681|ref|ZP_17616299.1| hypothetical protein IK9_00626 [Bacillus cereus VD166]
gi|228671703|gb|EEL26999.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-15]
gi|296325785|gb|ADH08713.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis BMB171]
gi|401233925|gb|EJR40411.1| hypothetical protein IIE_00694 [Bacillus cereus VD045]
gi|401279742|gb|EJR85664.1| hypothetical protein IK9_00626 [Bacillus cereus VD166]
Length = 248
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 50/178 (28%), Positives = 93/178 (52%), Gaps = 13/178 (7%)
Query: 89 YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
+ +P I++ +V+ D E D + +M++ ++ R+ + + + ++ + R S A+L + E
Sbjct: 68 FEEPLIVVLANVLSDEECDELIEMSKNKMERSKIGSSR----DVNDIRTSSGAFLEDNE- 122
Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
+I +R+ + + S E L ++NY + Y+ HYD FA + A NR+
Sbjct: 123 -FTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSAA------NNRI 175
Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
+T++ Y++DV +GG T F LNLS+ P KG A ++ + + T H PV G
Sbjct: 176 STLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKG 233
>gi|423400914|ref|ZP_17378087.1| hypothetical protein ICW_01312 [Bacillus cereus BAG2X1-2]
gi|401653904|gb|EJS71447.1| hypothetical protein ICW_01312 [Bacillus cereus BAG2X1-2]
Length = 216
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 50/178 (28%), Positives = 95/178 (53%), Gaps = 13/178 (7%)
Query: 89 YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
+ +P I++ +V+ D E D + ++++ +++R+ V + + ++ + R S A+L + E
Sbjct: 36 FEEPLIVVLGNVLSDEECDELIELSKSKMKRSKVGSSR----DVNDIRTSSGAFLDDNE- 90
Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
+ +I +R+ + + S E L ++NY + Y+ HYD FA + A NR+
Sbjct: 91 -LTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYFAEHSRSAA------NNRI 143
Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
+T++ Y++DV +GG T F LNLS+ P KG A ++ + + T H PV G
Sbjct: 144 STLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKG 201
>gi|449529555|ref|XP_004171765.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 284
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 54/196 (27%), Positives = 93/196 (47%), Gaps = 23/196 (11%)
Query: 91 QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
+PR +Y + + E + +A+P + ++TV + KTGE + R S +L + +
Sbjct: 80 EPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSKTGESVDSRVRTSSGMFLNRGQDKI 139
Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
I I +R+ T + E LQ+++Y +G Y+ HYD+ + + G R+AT+
Sbjct: 140 IRNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYF----VDEYNIKKGGQRMATL 195
Query: 211 LFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAFWHNLHSSGDG 251
L Y+SDV +GG TVF + LS+ P+ G A + ++
Sbjct: 196 LMYLSDVEEGGETVFPAAKGNFSSVPWWNELSECGKGGLSVKPKMGDALLFWSMKPDATL 255
Query: 252 DYYTRHAACPVLTGSN 267
D + H ACPV+ G+
Sbjct: 256 DPTSLHGACPVIRGNK 271
>gi|229140971|ref|ZP_04269515.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST26]
gi|228642547|gb|EEK98834.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST26]
Length = 232
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 53/195 (27%), Positives = 100/195 (51%), Gaps = 14/195 (7%)
Query: 89 YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
+ +P I++ +V+ D E D + ++++ +L R+ V + + ++ + R S A+L + E
Sbjct: 52 FEEPLIVVLGNVLSDEECDKLIELSKNKLARSKVGSSR----DVNDIRTSSGAFLDDNE- 106
Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVA 208
+ +I +R+ + + S E L ++NY + Y+ HYD+ +A NR++
Sbjct: 107 -LTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYFAEHSRSA-----ANNRIS 160
Query: 209 TVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNS 268
T++ Y++DV +GG T F LNLS+ P KG A ++ + + T H PV G
Sbjct: 161 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 220
Query: 269 LHSTCPCGLRRGLQR 283
+ + +RRG R
Sbjct: 221 IATQW---VRRGTYR 232
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.319 0.136 0.411
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,179,375,595
Number of Sequences: 23463169
Number of extensions: 217742668
Number of successful extensions: 465570
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1424
Number of HSP's successfully gapped in prelim test: 422
Number of HSP's that attempted gapping in prelim test: 461153
Number of HSP's gapped (non-prelim): 2037
length of query: 312
length of database: 8,064,228,071
effective HSP length: 142
effective length of query: 170
effective length of database: 9,027,425,369
effective search space: 1534662312730
effective search space used: 1534662312730
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 76 (33.9 bits)