BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= psy8177
         (312 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|345481336|ref|XP_001600680.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Nasonia
           vitripennis]
          Length = 556

 Score =  383 bits (984), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 176/282 (62%), Positives = 218/282 (77%), Gaps = 17/282 (6%)

Query: 2   IFPTHQRAQGNKLYYQEALNK---------SPELKDEPPKVNN--------VAPTLEVTE 44
           + PTHQRA GN+ YYQE + K           +  ++ P  +         +    E+TE
Sbjct: 242 LVPTHQRALGNRAYYQEEIQKRTNESRRKRGEDGSEDTPAADQHFTVTEKKIKSVSEMTE 301

Query: 45  REKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDS 104
           RE+YEMLCRG++ +P +I  +L+CRYV R +P+L++ P KEEEAYL PRI++Y DV+YD 
Sbjct: 302 RERYEMLCRGEIKMPLSIQKELRCRYVDRGIPFLKIAPFKEEEAYLDPRIVIYHDVIYDD 361

Query: 105 EIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGL 164
           EI+ IK+MAQPR +RATVQNYKTGELEIANYRISKSAWL+E EH  +  +S+RVEHMT +
Sbjct: 362 EIETIKRMAQPRFKRATVQNYKTGELEIANYRISKSAWLQEHEHKHVRAVSQRVEHMTSM 421

Query: 165 TTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATV 224
           +  TAEELQVVNYGIGGHYEPH+DFAR  E NAFKSLGTGNR+ATVL+YMSDV QGG TV
Sbjct: 422 SIETAEELQVVNYGIGGHYEPHFDFARREEKNAFKSLGTGNRIATVLYYMSDVEQGGGTV 481

Query: 225 FTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           FT +N+SLWP+KG+AAFW+NL  +G+GDY TRHAACPVLTGS
Sbjct: 482 FTKINISLWPKKGSAAFWYNLKPNGEGDYKTRHAACPVLTGS 523


>gi|380025232|ref|XP_003696381.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Apis florea]
          Length = 537

 Score =  383 bits (983), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 175/282 (62%), Positives = 220/282 (78%), Gaps = 17/282 (6%)

Query: 2   IFPTHQRAQGNKLYYQEAL----NKSPELKDEPPKVNNVAPTL-------------EVTE 44
           + PTH+RA GN+ YYQ+ +    ++S + + E  + +   P               E+TE
Sbjct: 223 LVPTHERALGNRAYYQKEIQSKASQSKKKRGEDGQDDTAVPAQHFTVVEERVKTLDEMTE 282

Query: 45  REKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDS 104
           RE+YEMLCRG++T+PP +   LKCRYV R +P+L++ P KEEEAYL PRI++Y +V+YD 
Sbjct: 283 RERYEMLCRGEVTIPPEVQKNLKCRYVDRGIPFLKIAPFKEEEAYLDPRIVVYHNVIYDD 342

Query: 105 EIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGL 164
           EI+ IK+MAQPR +RATVQNYKTG LEIANYRISKSAWL+E EH  +  +SRRVEHMT +
Sbjct: 343 EIETIKRMAQPRFKRATVQNYKTGALEIANYRISKSAWLQEHEHKHVAAVSRRVEHMTSM 402

Query: 165 TTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATV 224
           T  TAEELQVVNYGIGGHYEPH+DFAR  E NAFKSLGTGNR+ATVL+YMSDV QGG TV
Sbjct: 403 TVDTAEELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIATVLYYMSDVEQGGGTV 462

Query: 225 FTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           FT++N++LWP+KG+AAFW+NL  +G+GD+ TRHAACPVLTGS
Sbjct: 463 FTAINIALWPKKGSAAFWYNLKPNGEGDFKTRHAACPVLTGS 504


>gi|350416719|ref|XP_003491070.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Bombus
           impatiens]
          Length = 557

 Score =  383 bits (983), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 176/282 (62%), Positives = 219/282 (77%), Gaps = 17/282 (6%)

Query: 2   IFPTHQRAQGNKLYYQEAL----NKSPELKDEPPKVNNVAPTL-------------EVTE 44
           + PTH+RA GN+ YYQ+ +    N+S + + E  + +   P               E+TE
Sbjct: 243 LVPTHERALGNRAYYQKEIQSKANQSKKKRGEDGQDDTAVPAQHFTVAEEKMKTWEEMTE 302

Query: 45  REKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDS 104
           RE+YEMLCRG++++PP I   L CRYV R +P+L++ P KEEEAYL PRI++Y +V+YD 
Sbjct: 303 RERYEMLCRGEVSIPPEIQKNLVCRYVDRGIPFLKIAPFKEEEAYLDPRIVVYHNVIYDE 362

Query: 105 EIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGL 164
           EI+ IK+MAQPR +RATVQNYKTG LEIANYRISKSAWL+E EH  +  +SRRVEHMT +
Sbjct: 363 EIETIKRMAQPRFKRATVQNYKTGALEIANYRISKSAWLQEHEHEHVAAVSRRVEHMTSM 422

Query: 165 TTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATV 224
           T  TAEELQVVNYGIGGHYEPH+DFAR  E NAFKSLGTGNR+ATVL+YMSDV QGG TV
Sbjct: 423 TVDTAEELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIATVLYYMSDVEQGGGTV 482

Query: 225 FTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           FT++N+SLWP+KG+AAFW+NL  +G+GD+ TRHAACPVLTGS
Sbjct: 483 FTAINISLWPKKGSAAFWYNLKPNGEGDFKTRHAACPVLTGS 524


>gi|340722330|ref|XP_003399560.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Bombus
           terrestris]
          Length = 557

 Score =  383 bits (983), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 176/282 (62%), Positives = 219/282 (77%), Gaps = 17/282 (6%)

Query: 2   IFPTHQRAQGNKLYYQEAL----NKSPELKDEPPKVNNVAPTL-------------EVTE 44
           + PTH+RA GN+ YYQ+ +    N+S + + E  + +   P               E+TE
Sbjct: 243 LVPTHERALGNRAYYQKEIQSKANQSKKKRGEDGQDDTAVPAQHFTVAEEKMKTWEEMTE 302

Query: 45  REKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDS 104
           RE+YEMLCRG++++PP I   L CRYV R +P+L++ P KEEEAYL PRI++Y +V+YD 
Sbjct: 303 RERYEMLCRGEVSIPPEIQKNLVCRYVDRGIPFLKIAPFKEEEAYLDPRIVVYHNVIYDE 362

Query: 105 EIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGL 164
           EI+ IK+MAQPR +RATVQNYKTG LEIANYRISKSAWL+E EH  +  +SRRVEHMT +
Sbjct: 363 EIETIKRMAQPRFKRATVQNYKTGALEIANYRISKSAWLQEHEHEHVAAVSRRVEHMTSM 422

Query: 165 TTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATV 224
           T  TAEELQVVNYGIGGHYEPH+DFAR  E NAFKSLGTGNR+ATVL+YMSDV QGG TV
Sbjct: 423 TVDTAEELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIATVLYYMSDVEQGGGTV 482

Query: 225 FTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           FT++N+SLWP+KG+AAFW+NL  +G+GD+ TRHAACPVLTGS
Sbjct: 483 FTAINISLWPKKGSAAFWYNLKPNGEGDFKTRHAACPVLTGS 524


>gi|383864775|ref|XP_003707853.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Megachile
           rotundata]
          Length = 550

 Score =  382 bits (982), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 176/276 (63%), Positives = 216/276 (78%), Gaps = 11/276 (3%)

Query: 2   IFPTHQRAQGNKLYYQEAL----NKSPELKDEPPKVNNVAPTLE-------VTEREKYEM 50
           + PTH+RA GN+ YYQ+ +    N+S + + E  + +   P  E       +TERE+YEM
Sbjct: 242 LVPTHERALGNRAYYQKEIQSKANQSKKKRGEDGQDDTAVPAQEKVKTWEEMTERERYEM 301

Query: 51  LCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIK 110
           LCRG++++PP I   LKCRYV R +P+L++ P KEEEAYL PRI++Y +V+YD EI+ IK
Sbjct: 302 LCRGEVSIPPEIQKNLKCRYVDRGIPFLKIAPFKEEEAYLDPRIVIYHNVIYDEEIETIK 361

Query: 111 KMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAE 170
           +MAQPR +RATVQNYKTG LEIANYRISKSAWL+E EH  +  +S+RVEHMT L   TAE
Sbjct: 362 RMAQPRFKRATVQNYKTGALEIANYRISKSAWLQEHEHKHVAAVSKRVEHMTSLNVETAE 421

Query: 171 ELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNL 230
           ELQVVNYGIGGHYEPH+DFAR  E NAFKSLGTGNR+ATVL+YMSDV QGG TVFT++N+
Sbjct: 422 ELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIATVLYYMSDVEQGGGTVFTAINI 481

Query: 231 SLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           SLWP KG+AAFW NL  +G+GD  TRHAACPVLTGS
Sbjct: 482 SLWPRKGSAAFWFNLKPNGEGDLRTRHAACPVLTGS 517


>gi|328790718|ref|XP_392392.4| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Apis mellifera]
          Length = 415

 Score =  381 bits (979), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 175/282 (62%), Positives = 220/282 (78%), Gaps = 17/282 (6%)

Query: 2   IFPTHQRAQGNKLYYQEAL----NKSPELKDEPPKVNNVAPTL-------------EVTE 44
           + PTH+RA GN+ YYQ+ +    ++S + + E  + +   P               E+TE
Sbjct: 101 LVPTHERALGNRAYYQKEIQSKASQSKKKRGEDGQDDTAVPAQHFTVAEERVKTLDEMTE 160

Query: 45  REKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDS 104
           RE+YEMLCRG++T+PP +   LKCRYV R +P+L++ P KEEEAYL PRI++Y +V+YD 
Sbjct: 161 RERYEMLCRGEVTIPPEVQKNLKCRYVDRGIPFLKIAPFKEEEAYLDPRIVVYHNVIYDD 220

Query: 105 EIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGL 164
           EI+ IK+MAQPR +RATVQNYKTG LEIANYRISKSAWL+E EH  +  +SRRVEHMT +
Sbjct: 221 EIETIKRMAQPRFKRATVQNYKTGALEIANYRISKSAWLQEHEHKHVAAVSRRVEHMTSM 280

Query: 165 TTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATV 224
           T  TAEELQVVNYGIGGHYEPH+DFAR  E NAFKSLGTGNR+ATVL+YMSDV QGG TV
Sbjct: 281 TVDTAEELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIATVLYYMSDVEQGGGTV 340

Query: 225 FTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           FT++N++LWP+KG+AAFW+NL  +G+GD+ TRHAACPVLTGS
Sbjct: 341 FTAINIALWPKKGSAAFWYNLKPNGEGDFKTRHAACPVLTGS 382


>gi|332026992|gb|EGI67088.1| Prolyl 4-hydroxylase subunit alpha-1 [Acromyrmex echinatior]
          Length = 415

 Score =  381 bits (979), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 175/282 (62%), Positives = 219/282 (77%), Gaps = 17/282 (6%)

Query: 2   IFPTHQRAQGNKLYYQEAL----NKSPELKDEPPKVNNVAPTL-------------EVTE 44
           + PTH+RA GN+ YYQ+ +    ++S + + E  K +   P               E+TE
Sbjct: 101 LVPTHERALGNRAYYQKEIQSKASQSKKKRGEDGKDDTAIPEQNFTVAEERVKTWEEMTE 160

Query: 45  REKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDS 104
           RE+YEMLCRG++++PP +   LKCRYV R +P+L++ P KEEEAYL PRI++Y +V+YD 
Sbjct: 161 RERYEMLCRGEVSIPPEVEKNLKCRYVDRGIPFLKIAPFKEEEAYLDPRIVVYHNVIYDE 220

Query: 105 EIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGL 164
           EI+ IK+MAQPR +RATVQNYKTG LEIANYRISKSAWL+E EH  +  +S+RVEHMT +
Sbjct: 221 EIETIKRMAQPRFKRATVQNYKTGALEIANYRISKSAWLQEHEHKHVAAVSKRVEHMTSM 280

Query: 165 TTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATV 224
           +  TAEELQVVNYGIGGHYEPH+DFAR  E NAFKSLGTGNR+ATVL+YMSDV QGG TV
Sbjct: 281 SVETAEELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIATVLYYMSDVEQGGGTV 340

Query: 225 FTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           FT++N+SLWP KG+AAFWHNL  +G+GD+ TRHAACPVLTGS
Sbjct: 341 FTAINISLWPRKGSAAFWHNLKPNGEGDFKTRHAACPVLTGS 382


>gi|307190793|gb|EFN74662.1| Prolyl 4-hydroxylase subunit alpha-2 [Camponotus floridanus]
          Length = 476

 Score =  375 bits (963), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 173/282 (61%), Positives = 219/282 (77%), Gaps = 17/282 (6%)

Query: 2   IFPTHQRAQGNKLYYQEAL----NKSPELKDEPPKVNNVAPTL-------------EVTE 44
           + PTH+RA GN+ YYQ+ +    ++S + + E  + +   P               E+TE
Sbjct: 162 LVPTHERALGNRAYYQKEIQSKASQSKKKRGEDGQDDTAIPEQNFTVAEERVKTWEEMTE 221

Query: 45  REKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDS 104
           RE+YEMLCRG++++P  +   LKCRYV R +P+L++ PLKEEEAYL PRI++Y +V+YD 
Sbjct: 222 RERYEMLCRGEVSIPREVEKNLKCRYVDRGIPFLKIAPLKEEEAYLDPRIVVYHNVIYDE 281

Query: 105 EIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGL 164
           EI+ IK+MAQPR +RATVQNYKTG LEIANYRISKSAWL+E EH  +  +S+RVEHMT +
Sbjct: 282 EIETIKRMAQPRFKRATVQNYKTGALEIANYRISKSAWLQEHEHKHVAAVSKRVEHMTSM 341

Query: 165 TTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATV 224
           +  TAEELQVVNYGIGGHYEPH+DFAR  E NAFKSLGTGNR+ATVL+YMSDV QGG TV
Sbjct: 342 SIETAEELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIATVLYYMSDVEQGGGTV 401

Query: 225 FTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           FT++N+SLWP KG+AAFW+NL  +G+GD+ TRHAACPVLTGS
Sbjct: 402 FTAINISLWPRKGSAAFWYNLKPNGEGDFKTRHAACPVLTGS 443


>gi|307211752|gb|EFN87747.1| Prolyl 4-hydroxylase subunit alpha-1 [Harpegnathos saltator]
          Length = 415

 Score =  374 bits (959), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 172/282 (60%), Positives = 217/282 (76%), Gaps = 17/282 (6%)

Query: 2   IFPTHQRAQGNKLYYQEAL----NKSPELKDEPPKVNNVAPTL-------------EVTE 44
           + PTH+RA GN+ YYQ+ +    ++S + + E  + +   P               E+TE
Sbjct: 101 LVPTHERALGNRAYYQKEIQSKASQSKKKRGEDGQDDTAIPEQNFTVAEERVKTWEEMTE 160

Query: 45  REKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDS 104
           RE+YEMLCRG++++P  +   LKCRYV R +P+L++ P KEEEAYL PRI+ Y +V+YD 
Sbjct: 161 RERYEMLCRGEVSIPLEVEKNLKCRYVDRGIPFLKIAPFKEEEAYLDPRIVFYHNVIYDE 220

Query: 105 EIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGL 164
           EI+ IK+MAQPR +RATVQNYKTG LEIANYRISKSAWL+E EH  +  +S+RVEHMT +
Sbjct: 221 EIETIKRMAQPRFKRATVQNYKTGALEIANYRISKSAWLQEHEHKHVAAVSKRVEHMTSM 280

Query: 165 TTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATV 224
           +  TAEELQVVNYGIGGHYEPH+DFAR  E NAFKSLGTGNR+ATVL+YMSDV QGG TV
Sbjct: 281 SVETAEELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIATVLYYMSDVEQGGGTV 340

Query: 225 FTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           FT++N+SLWP KG+AAFW+NL  +G+GD+ TRHAACPVLTGS
Sbjct: 341 FTAINISLWPRKGSAAFWYNLKPNGEGDFKTRHAACPVLTGS 382


>gi|91091610|ref|XP_969386.1| PREDICTED: similar to prolyl 4-hydroxylase alpha subunit 1,
           putative [Tribolium castaneum]
 gi|270001037|gb|EEZ97484.1| hypothetical protein TcasGA2_TC011321 [Tribolium castaneum]
          Length = 536

 Score =  357 bits (916), Expect = 4e-96,   Method: Compositional matrix adjust.
 Identities = 176/292 (60%), Positives = 211/292 (72%), Gaps = 14/292 (4%)

Query: 2   IFPTHQRAQGNKLYYQEALNKSPELK----DEPPKVNNVAPTLEVTEREKYEMLCRGDLT 57
           I P+H RA GNK+YY++ L KS   K    D         P ++   RE YE LCRG+++
Sbjct: 238 ILPSHPRALGNKIYYEDELQKSVNTKKKGDDGGEPEGESKPYVDPYGREFYEQLCRGEIS 297

Query: 58  VPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRL 117
           +P    ++LKC Y+ RN P+L++ P K EEA+ +P I ++RDV+ DSEI  IK+MAQPR 
Sbjct: 298 LPVEKASKLKCFYLSRNQPFLKIAPFKVEEAHHRPDIFIFRDVLADSEIATIKRMAQPRF 357

Query: 118 RRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNY 177
           +RATVQN  TGELEIA YRISKSAWL+E EH  I  +S+RV  MTGLT STAEELQVVNY
Sbjct: 358 KRATVQNTDTGELEIAQYRISKSAWLKEEEHKHIADVSQRVSDMTGLTMSTAEELQVVNY 417

Query: 178 GIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKG 237
           GIGGHYEPH+DFAR  E NAFKSLGTGNR+ATVLFYMSDV QGGATVF S+ +SLWP+KG
Sbjct: 418 GIGGHYEPHFDFARRDERNAFKSLGTGNRIATVLFYMSDVEQGGATVFPSIQVSLWPQKG 477

Query: 238 TAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCGLRR 279
           +AAFW+NLH SGDGD  TRHAACPVLTGS  + +            PC L R
Sbjct: 478 SAAFWYNLHPSGDGDKMTRHAACPVLTGSKWVSNKWIHERGQEFRRPCTLER 529


>gi|157114985|ref|XP_001658091.1| prolyl 4-hydroxylase alpha subunit 1, putative [Aedes aegypti]
 gi|108877086|gb|EAT41311.1| AAEL007038-PA [Aedes aegypti]
          Length = 545

 Score =  356 bits (914), Expect = 6e-96,   Method: Compositional matrix adjust.
 Identities = 172/281 (61%), Positives = 209/281 (74%), Gaps = 16/281 (5%)

Query: 2   IFPTHQRAQGNKLYYQEALNKSPELK-------------DEPPKVNNVAPTLEV---TER 45
           + P H+RA GNK+YY++ L K  + K             D   K+        V   +ER
Sbjct: 236 LVPDHERAVGNKVYYEKELEKEAKQKALRGDDGSVDVPVDTTTKIRTSTSNPHVYDSSER 295

Query: 46  EKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSE 105
           + YE LCRG+     A  ++LKCRYV    P+L++ PLK EEA L+P I++Y DV+ ++E
Sbjct: 296 KLYEQLCRGEAERSVAETSKLKCRYVTNKSPFLKIAPLKLEEANLKPYIVIYHDVISEAE 355

Query: 106 IDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLT 165
           ++L+K++A+PR RRATVQNYKTGELE+ANYRISKSAWL++ EHP I+ I  RVE MTGLT
Sbjct: 356 MELVKRLAKPRFRRATVQNYKTGELEVANYRISKSAWLKDHEHPYIKAIGERVEDMTGLT 415

Query: 166 TSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVF 225
            STAEELQVVNYGIGGHYEPH+DFAR  E NAFKSLGTGNR+ATVLFYMSDV QGGATVF
Sbjct: 416 MSTAEELQVVNYGIGGHYEPHFDFARREETNAFKSLGTGNRIATVLFYMSDVTQGGATVF 475

Query: 226 TSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
            SL L+LWP+KG AAFW NLH+SG GDY TRHAACPVLTG+
Sbjct: 476 PSLRLALWPKKGAAAFWFNLHASGQGDYSTRHAACPVLTGT 516


>gi|170064960|ref|XP_001867743.1| prolyl 4-hydroxylase subunit alpha-1 [Culex quinquefasciatus]
 gi|167882146|gb|EDS45529.1| prolyl 4-hydroxylase subunit alpha-1 [Culex quinquefasciatus]
          Length = 545

 Score =  354 bits (908), Expect = 3e-95,   Method: Compositional matrix adjust.
 Identities = 170/280 (60%), Positives = 215/280 (76%), Gaps = 15/280 (5%)

Query: 2   IFPTHQRAQGNKLYYQEALNKSPELK----DEPPKVNNVAPTLEV-----------TERE 46
           + P H+RA GNK YY++ L K    K    D+  +   V  T+++           TER 
Sbjct: 237 LVPNHERAVGNKAYYEKELEKEARQKALRGDDGSEDVPVDTTIQIKKETSSLVYDSTERV 296

Query: 47  KYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEI 106
            YE LCRG+     A +A+L+CRYV  + P+L++ PLK EEA+L+P I++Y +VM D+EI
Sbjct: 297 LYEQLCRGEAHRAEADLAKLRCRYVTNSSPFLKIAPLKLEEAHLEPYIVIYHEVMSDAEI 356

Query: 107 DLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTT 166
           ++IK++A+PR RRATVQNYKTGELE+ANYRISKSAWL++ EH V+  + +RVE MTGLT 
Sbjct: 357 EVIKRLAKPRFRRATVQNYKTGELEVANYRISKSAWLKDEEHSVVRTVGQRVEDMTGLTM 416

Query: 167 STAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFT 226
           +TAEELQVVNYGIGGHYEPH+DFAR  E NAFKSLGTGNR+ATVLFYMSDV+QGGATVF 
Sbjct: 417 TTAEELQVVNYGIGGHYEPHFDFARREEKNAFKSLGTGNRIATVLFYMSDVSQGGATVFP 476

Query: 227 SLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           S+ ++L P+KGTAAFW+NLH+SG GDY TRHAACPVLTG+
Sbjct: 477 SIRVALRPKKGTAAFWYNLHASGHGDYATRHAACPVLTGT 516


>gi|347964867|ref|XP_309164.4| AGAP000971-PA [Anopheles gambiae str. PEST]
 gi|333466515|gb|EAA04901.5| AGAP000971-PA [Anopheles gambiae str. PEST]
          Length = 553

 Score =  353 bits (907), Expect = 4e-95,   Method: Compositional matrix adjust.
 Identities = 181/299 (60%), Positives = 210/299 (70%), Gaps = 22/299 (7%)

Query: 2   IFPTHQRAQGNKLYYQEALNKSPELK-----DEPPKVNNVAPTLEVT-------EREKYE 49
           + P H+RA  NK YY + L K  + K     D   +V     T E T       ER+ YE
Sbjct: 247 LVPDHERAVSNKAYYVKELQKEAQQKILRGDDGSEEVPVDTTTKEATPHVYDTNERKLYE 306

Query: 50  MLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLI 109
            LCRG+   P  + +QL CRY   + P+LR+ PLK EEAYL+P I++Y DVM D EI+ I
Sbjct: 307 QLCRGEQQPPIELRSQLVCRYTTNSSPFLRIGPLKLEEAYLRPYIVIYHDVMSDREIERI 366

Query: 110 KKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTA 169
           K  A+PR RRATVQNYKTGELE ANYRISKSAWL++ E  +I  IS+RVE MTGLT  TA
Sbjct: 367 KHYARPRFRRATVQNYKTGELEFANYRISKSAWLKDAEDEMIRTISQRVEDMTGLTMETA 426

Query: 170 EELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLN 229
           EELQVVNYGIGGHYEPH+DFAR  E NAFKSLGTGNR+ATVLFYMSDV QGGATVF SLN
Sbjct: 427 EELQVVNYGIGGHYEPHFDFARREERNAFKSLGTGNRIATVLFYMSDVTQGGATVFPSLN 486

Query: 230 LSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCGLR 278
           L+LWP KGTAAFW NLH+SG GDY TRHAACPVLTG+  + +            PCGL+
Sbjct: 487 LALWPRKGTAAFWFNLHASGRGDYATRHAACPVLTGTKWVSNKWIHERGQEFRRPCGLQ 545


>gi|328696638|ref|XP_003240086.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 2
           [Acyrthosiphon pisum]
          Length = 534

 Score =  353 bits (906), Expect = 5e-95,   Method: Compositional matrix adjust.
 Identities = 176/269 (65%), Positives = 211/269 (78%), Gaps = 6/269 (2%)

Query: 2   IFPTHQRAQGNKLYYQEAL-NKSPEL--KDEPPKVNNVAPTLEVTEREKYEMLCRGDLTV 58
           I P H+RA GN  YY+ A+ N + E+   ++PPK   V  TL+  ERE+Y MLCR +  +
Sbjct: 237 ILPNHERALGNLAYYEAAIKNGTTEIGKSEQPPKA--VTATLDPEERERYHMLCRNENLM 294

Query: 59  PPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRL 117
              I +QL+CRY + N  P L + PLKEEEA+  PRIILYRDV+YD+EI++IK+MAQPRL
Sbjct: 295 SIQISSQLRCRYTNNNRNPLLLIAPLKEEEAFFSPRIILYRDVLYDNEIEVIKRMAQPRL 354

Query: 118 RRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNY 177
           +RATVQNYKTGELE A+YRISKSAWL+E E  V+  +++RVE MTGLTT TAEELQVVNY
Sbjct: 355 KRATVQNYKTGELEFADYRISKSAWLKEHEDVVVANVAKRVEVMTGLTTETAEELQVVNY 414

Query: 178 GIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKG 237
           G+GGHY+PHYDFAR  E NAFKSLGTGNR+ATVLFYMSDVAQGGATVF  L ++L P KG
Sbjct: 415 GVGGHYDPHYDFARTEEINAFKSLGTGNRIATVLFYMSDVAQGGATVFPWLGVALQPVKG 474

Query: 238 TAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           TAA W NL+ SG+GD  TRHAACPVL GS
Sbjct: 475 TAAVWFNLYPSGNGDLRTRHAACPVLQGS 503


>gi|193688213|ref|XP_001943683.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 1
           [Acyrthosiphon pisum]
          Length = 552

 Score =  353 bits (906), Expect = 5e-95,   Method: Compositional matrix adjust.
 Identities = 176/269 (65%), Positives = 211/269 (78%), Gaps = 6/269 (2%)

Query: 2   IFPTHQRAQGNKLYYQEAL-NKSPEL--KDEPPKVNNVAPTLEVTEREKYEMLCRGDLTV 58
           I P H+RA GN  YY+ A+ N + E+   ++PPK   V  TL+  ERE+Y MLCR +  +
Sbjct: 255 ILPNHERALGNLAYYEAAIKNGTTEIGKSEQPPKA--VTATLDPEERERYHMLCRNENLM 312

Query: 59  PPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRL 117
              I +QL+CRY + N  P L + PLKEEEA+  PRIILYRDV+YD+EI++IK+MAQPRL
Sbjct: 313 SIQISSQLRCRYTNNNRNPLLLIAPLKEEEAFFSPRIILYRDVLYDNEIEVIKRMAQPRL 372

Query: 118 RRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNY 177
           +RATVQNYKTGELE A+YRISKSAWL+E E  V+  +++RVE MTGLTT TAEELQVVNY
Sbjct: 373 KRATVQNYKTGELEFADYRISKSAWLKEHEDVVVANVAKRVEVMTGLTTETAEELQVVNY 432

Query: 178 GIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKG 237
           G+GGHY+PHYDFAR  E NAFKSLGTGNR+ATVLFYMSDVAQGGATVF  L ++L P KG
Sbjct: 433 GVGGHYDPHYDFARTEEINAFKSLGTGNRIATVLFYMSDVAQGGATVFPWLGVALQPVKG 492

Query: 238 TAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           TAA W NL+ SG+GD  TRHAACPVL GS
Sbjct: 493 TAAVWFNLYPSGNGDLRTRHAACPVLQGS 521


>gi|242018356|ref|XP_002429643.1| Prolyl 4-hydroxylase alpha-1 subunit precursor, putative [Pediculus
           humanus corporis]
 gi|212514628|gb|EEB16905.1| Prolyl 4-hydroxylase alpha-1 subunit precursor, putative [Pediculus
           humanus corporis]
          Length = 534

 Score =  342 bits (877), Expect = 1e-91,   Method: Compositional matrix adjust.
 Identities = 161/291 (55%), Positives = 210/291 (72%), Gaps = 15/291 (5%)

Query: 2   IFPTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTE-----REKYEMLCRGDL 56
           ++P H+RAQGNK+YY++AL +S   K +    + +             R  YE LCR ++
Sbjct: 239 LYPNHERAQGNKIYYEDALQQSKGQKKKGDDGDEIVIEKNTNSKYYKGRGMYEKLCRNEV 298

Query: 57  TVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPR 116
            +   + A+LKCRYV    P+L L  +KEEEA+L PRI+LY DV+ D EI  I+++A PR
Sbjct: 299 GLSEKMKAKLKCRYVDFGRPFLMLAKVKEEEAFLDPRIVLYHDVLSDREIKTIQQLAVPR 358

Query: 117 LRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVN 176
            +RATVQN +TG+LE+A+YRISKSAWL + +HP + ++S+RVE +TGL  +TAE LQVVN
Sbjct: 359 FKRATVQNSETGKLEVAHYRISKSAWLEDVDHPYVAKVSQRVEDITGLNMATAESLQVVN 418

Query: 177 YGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEK 236
           YGIGGHYEPH+DFAR  E NAF+SLGTGNR+AT+LFYMSDV+QGGATVF  + +SLWP+K
Sbjct: 419 YGIGGHYEPHFDFARKEEKNAFQSLGTGNRIATILFYMSDVSQGGATVFPGIKVSLWPKK 478

Query: 237 GTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCGL 277
           GTAAFW+NL  +G+GDY TRHAACPVLTGS  + +            PCGL
Sbjct: 479 GTAAFWYNLRKNGEGDYLTRHAACPVLTGSKWVCNKWIHERGQEFRRPCGL 529


>gi|195055779|ref|XP_001994790.1| GH14110 [Drosophila grimshawi]
 gi|193892553|gb|EDV91419.1| GH14110 [Drosophila grimshawi]
          Length = 487

 Score =  342 bits (877), Expect = 1e-91,   Method: Compositional matrix adjust.
 Identities = 166/278 (59%), Positives = 201/278 (72%), Gaps = 13/278 (4%)

Query: 2   IFPTHQRAQGNKLYYQEALNKSPEL--------KDEPP----KVNNVAP-TLEVTEREKY 48
           + P H+RA GNK +Y++ +    E+         DE P     V    P   ++TER  Y
Sbjct: 180 LLPDHERANGNKKFYEKEIAHQMEMGKMKGDDGSDEMPVSDLHVTKTDPGVFDLTERTAY 239

Query: 49  EMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDL 108
           EMLCRG+L   PA +  L+CRYV+ NV +LRL PLK EEA++ P I++Y D MYDSEI++
Sbjct: 240 EMLCRGELKPSPAEIRPLRCRYVNNNVDFLRLAPLKLEEAFMDPYIVIYHDAMYDSEIEV 299

Query: 109 IKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST 168
           +K+MA+PR RRATVQN  TG LE ANYRISKSAWL+ PEH +I  + +R   MTGL   +
Sbjct: 300 LKRMARPRFRRATVQNSVTGALETANYRISKSAWLKTPEHEIIGTVVQRTADMTGLDMDS 359

Query: 169 AEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL 228
           AEELQVVNYGIGGHYEPH+DFAR  E  AF+ L  GNR+AT+LFYMSDV QGGATVFTSL
Sbjct: 360 AEELQVVNYGIGGHYEPHFDFARREEKLAFEGLNLGNRIATMLFYMSDVQQGGATVFTSL 419

Query: 229 NLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
             +LWP+KGTAAFW NLH SG+GD  TRHAACPVLTGS
Sbjct: 420 RTALWPKKGTAAFWMNLHRSGEGDARTRHAACPVLTGS 457


>gi|195391754|ref|XP_002054525.1| GJ24502 [Drosophila virilis]
 gi|194152611|gb|EDW68045.1| GJ24502 [Drosophila virilis]
          Length = 487

 Score =  342 bits (876), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 168/278 (60%), Positives = 203/278 (73%), Gaps = 13/278 (4%)

Query: 2   IFPTHQRAQGNKLYYQEALNKSPELK--------DEPP----KVNNVAP-TLEVTEREKY 48
           + P H+RA GNK +Y++ +    EL+        DE P     V    P   ++TER+ Y
Sbjct: 180 LLPNHERANGNKKFYEKEIAHLKELQKMKGDDGTDEMPVSDLPVAKSDPGVFDMTERKAY 239

Query: 49  EMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDL 108
           EMLCRG+L   P+ +  L+CRYV+ NV +LRL PLK EEAY+ P I++Y D MYDSEI++
Sbjct: 240 EMLCRGELKPSPSELRPLRCRYVNNNVAFLRLAPLKLEEAYMDPYIVIYHDAMYDSEIEI 299

Query: 109 IKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST 168
           IK+MA+PR RRATVQN  TG LE ANYRISKSAWL+  EH VI  + +R   MTGL   +
Sbjct: 300 IKRMARPRFRRATVQNSVTGALETANYRISKSAWLKTAEHRVIGTVVQRTADMTGLDMDS 359

Query: 169 AEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL 228
           AEELQVVNYGIGGHYEPH+DFAR  E  AF+ L  GNR+AT+LFYMSDV QGGATVFTSL
Sbjct: 360 AEELQVVNYGIGGHYEPHFDFARREEKRAFEGLNLGNRIATMLFYMSDVEQGGATVFTSL 419

Query: 229 NLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           + +LWP+KGTAAFW NLH SG+GD  TRHAACPVLTGS
Sbjct: 420 HAALWPKKGTAAFWMNLHRSGEGDVRTRHAACPVLTGS 457


>gi|195452726|ref|XP_002073473.1| GK14136 [Drosophila willistoni]
 gi|194169558|gb|EDW84459.1| GK14136 [Drosophila willistoni]
          Length = 550

 Score =  340 bits (871), Expect = 7e-91,   Method: Compositional matrix adjust.
 Identities = 168/278 (60%), Positives = 202/278 (72%), Gaps = 13/278 (4%)

Query: 2   IFPTHQRAQGNKLYYQEALNKSPELK--------DEPP----KVNNVAPTL-EVTEREKY 48
           + P H+RA GNK +Y++ +    E+K        DE P     V    P + ++TER+ Y
Sbjct: 243 LLPYHERANGNKKFYEKEIAHLKEMKRMKGDDGSDEMPVSDLPVAKSDPGVYDITERKAY 302

Query: 49  EMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDL 108
           EMLCRG+L   PA +  L+CRYV  NVP+LRL PLK EEA++ P I++Y D MYDSE+DL
Sbjct: 303 EMLCRGELKPSPADLRPLRCRYVTNNVPFLRLGPLKLEEAHMDPYIVIYHDAMYDSEMDL 362

Query: 109 IKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST 168
           IK+MA+PR RRATVQN  TG LE ANYRISKSAWL+  E  VI  + +R   MTGL   +
Sbjct: 363 IKRMARPRFRRATVQNSVTGALETANYRISKSAWLKTEEDQVIGTVVQRTADMTGLDMDS 422

Query: 169 AEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL 228
           AEELQVVNYGIGGHYEPH+DFAR  E  AF+ L  GNR+ATVLFYMSDV QGGATVFTSL
Sbjct: 423 AEELQVVNYGIGGHYEPHFDFARREEKRAFEGLNLGNRIATVLFYMSDVEQGGATVFTSL 482

Query: 229 NLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           + +LWP+KGTAAFW NLH  G+GD  TRHAACPVLTG+
Sbjct: 483 HAALWPKKGTAAFWMNLHRDGEGDVRTRHAACPVLTGT 520


>gi|194905436|ref|XP_001981196.1| GG11753 [Drosophila erecta]
 gi|190655834|gb|EDV53066.1| GG11753 [Drosophila erecta]
          Length = 550

 Score =  339 bits (870), Expect = 8e-91,   Method: Compositional matrix adjust.
 Identities = 167/278 (60%), Positives = 201/278 (72%), Gaps = 13/278 (4%)

Query: 2   IFPTHQRAQGNKLYYQEALNKSPELK--------DEPPK----VNNVAPTL-EVTEREKY 48
           + P H+RA GNK +Y++ + +  +L         DE PK    V    P + ++TER  Y
Sbjct: 243 LLPHHERANGNKRFYEKEIAQQLQLSKMKGDDGTDEMPKSDLPVAKSDPAIFDMTERRAY 302

Query: 49  EMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDL 108
           EMLCRG+L   P+ +  L+CRYV   VP+LRL PLK EEA+  P I+++ D MYD EIDL
Sbjct: 303 EMLCRGELKPSPSDLRSLRCRYVTNGVPFLRLGPLKLEEAHADPYIVIFHDAMYDGEIDL 362

Query: 109 IKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST 168
           IK+MA+PR RRATVQN  TG LE ANYRISKSAWL+ PEH VIE + +R   MTGL   +
Sbjct: 363 IKRMARPRFRRATVQNSVTGALETANYRISKSAWLKTPEHRVIETVVQRTADMTGLDMDS 422

Query: 169 AEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL 228
           AEELQVVNYGIGGHYEPH+DFAR  E  AF+ L  GNR+ATVLFYMSDV QGGATVFTSL
Sbjct: 423 AEELQVVNYGIGGHYEPHFDFARKEEQRAFEGLNLGNRIATVLFYMSDVEQGGATVFTSL 482

Query: 229 NLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           + +L+P+KGTAAFW NLH  G GD  TRHAACPVLTG+
Sbjct: 483 HTALFPKKGTAAFWMNLHRDGQGDVRTRHAACPVLTGT 520


>gi|312383453|gb|EFR28539.1| hypothetical protein AND_03427 [Anopheles darlingi]
          Length = 341

 Score =  338 bits (868), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 182/325 (56%), Positives = 211/325 (64%), Gaps = 49/325 (15%)

Query: 2   IFPTHQRAQGNKLYYQEALNKSPELK-------------DEPPKVNNVAPTLEV---TER 45
           + P H+RA  NK YY + L K    K             D   K++    +  V   TER
Sbjct: 8   LVPDHERAVSNKAYYVKELEKEALQKILRGDDGSEEVPVDTSTKIHKGEASPHVYDKTER 67

Query: 46  EKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSE 105
           + YE LCRG+   P  + +QL CRY     P+LRL PLK EEAY QP I++Y DVM D E
Sbjct: 68  KLYEQLCRGEQEPPIELRSQLVCRYATNRSPFLRLAPLKLEEAYRQPDIVIYHDVMSDRE 127

Query: 106 IDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLT 165
           I+LIK  A+PR RRATVQNYKTGELE ANYRISKSAWL++ EH VI  +++RVE MTGLT
Sbjct: 128 IELIKHYARPRFRRATVQNYKTGELEFANYRISKSAWLKDTEHEVIRTVNQRVEDMTGLT 187

Query: 166 TSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFY------------ 213
            +TAEELQVVNYGIGGHYEPH+DFAR  E NAFKSLGTGNR+ATVLFY            
Sbjct: 188 MATAEELQVVNYGIGGHYEPHFDFARREERNAFKSLGTGNRIATVLFYVSDLCLCHTSHT 247

Query: 214 -----------MSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPV 262
                      MSDV QGGATVF SLNL+L P KGTAAFWHNLH+SG+GDY TRHAACPV
Sbjct: 248 NADFRFLSVGQMSDVTQGGATVFPSLNLALRPRKGTAAFWHNLHASGNGDYATRHAACPV 307

Query: 263 LTGSNSLHSTC----------PCGL 277
           LTG+  + +            PCGL
Sbjct: 308 LTGTKWVSNKWIHERGQEFRRPCGL 332


>gi|195110919|ref|XP_002000027.1| GI24860 [Drosophila mojavensis]
 gi|193916621|gb|EDW15488.1| GI24860 [Drosophila mojavensis]
          Length = 487

 Score =  338 bits (866), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 165/278 (59%), Positives = 203/278 (73%), Gaps = 13/278 (4%)

Query: 2   IFPTHQRAQGNKLYYQEALNKSPE-LK------------DEPPKVNNVAPTLEVTEREKY 48
           + P H+RA GNK +Y++ + +  E LK             + P V +     ++TER+ Y
Sbjct: 180 LLPDHERANGNKKFYEKEIAQLKEKLKVKGDDGSDATPVSDLPVVKSDPGVFDMTERKAY 239

Query: 49  EMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDL 108
           EMLCRG+L + P+++  L+CRYV  NVP+LRL PLK EEA+L P I++Y D M+DSEI++
Sbjct: 240 EMLCRGELKLSPSVLRPLRCRYVSNNVPFLRLAPLKLEEAFLDPYIVIYHDAMFDSEIEV 299

Query: 109 IKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST 168
           +K+MA+PR RRATVQN  TG LE ANYRISKSAWL+  EH VI  + +R   MTGL   +
Sbjct: 300 LKRMARPRFRRATVQNAVTGALETANYRISKSAWLKTAEHRVIGTVVQRTADMTGLDMDS 359

Query: 169 AEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL 228
           AEELQVVNYGIGGHYEPH+DFAR  E  AF+ L  GNR+ATVLFYMSDV QGGATVFTSL
Sbjct: 360 AEELQVVNYGIGGHYEPHFDFARREEIRAFEGLNLGNRIATVLFYMSDVEQGGATVFTSL 419

Query: 229 NLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           +  L P+KGTAAFW NLH SG+GD  TRHAACPVLTGS
Sbjct: 420 HAVLKPKKGTAAFWMNLHRSGEGDVRTRHAACPVLTGS 457


>gi|194765194|ref|XP_001964712.1| GF22904 [Drosophila ananassae]
 gi|190614984|gb|EDV30508.1| GF22904 [Drosophila ananassae]
          Length = 547

 Score =  338 bits (866), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 167/278 (60%), Positives = 203/278 (73%), Gaps = 13/278 (4%)

Query: 2   IFPTHQRAQGNKLYYQEALNKSPELK--------DEPPK----VNNVAPT-LEVTEREKY 48
           + P H+RA GNK +Y++ +    +L+        DE PK    V    P+  ++TER+ Y
Sbjct: 240 LLPHHERANGNKRFYEKEIANQQQLRKMKGDDGSDEMPKSDLPVAKSDPSVFDMTERKAY 299

Query: 49  EMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDL 108
           EMLCRG+L   PA +  L+CRYV  NVP+LRL PLK EEA+ +P I++Y D MYDSEI+L
Sbjct: 300 EMLCRGELKPSPADLRPLRCRYVTNNVPFLRLGPLKLEEAHQEPYIVIYHDAMYDSEIEL 359

Query: 109 IKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST 168
           IK+MA+PR RRATVQN  TG LE ANYRISKSAWL+  E  VI  + +R   MTGL   +
Sbjct: 360 IKRMARPRFRRATVQNSVTGALETANYRISKSAWLKTEEDHVIGTVVQRTADMTGLDMDS 419

Query: 169 AEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL 228
           AEELQVVNYGIGGHYEPH+DFAR  E  AF+ L  GNR+ATVLFYMSDV QGGATVFTSL
Sbjct: 420 AEELQVVNYGIGGHYEPHFDFARKEEKRAFEGLNLGNRIATVLFYMSDVEQGGATVFTSL 479

Query: 229 NLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           + +L+P+KGTAAFW NLH  G+GD  TRHAACPVLTG+
Sbjct: 480 HTALFPKKGTAAFWMNLHRDGEGDVRTRHAACPVLTGT 517


>gi|195575089|ref|XP_002105512.1| GD21521 [Drosophila simulans]
 gi|194201439|gb|EDX15015.1| GD21521 [Drosophila simulans]
          Length = 550

 Score =  335 bits (860), Expect = 1e-89,   Method: Compositional matrix adjust.
 Identities = 166/278 (59%), Positives = 200/278 (71%), Gaps = 13/278 (4%)

Query: 2   IFPTHQRAQGNKLYYQEALNKSPELK--------DEPPK----VNNVAPTL-EVTEREKY 48
           + P H+RA GNK +Y++ + +  +L+        DE PK    V    P + ++TER  Y
Sbjct: 243 LLPHHERANGNKRFYEKEIAQQLQLRKMKGDDGTDEMPKSDLPVAKSDPAIFDMTERRAY 302

Query: 49  EMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDL 108
           EMLCRG+L   P+ +  L+CRYV   VP+LRL PLK EE +  P I++Y D MYDSEIDL
Sbjct: 303 EMLCRGELKPSPSDLRSLRCRYVTNRVPFLRLGPLKLEEVHADPYIVIYHDAMYDSEIDL 362

Query: 109 IKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST 168
           IK+MA+PR RRATVQN  TG LE ANYRISKSAWL+  E  VIE + +R   MTGL   +
Sbjct: 363 IKRMARPRFRRATVQNSVTGALETANYRISKSAWLKTQEDRVIETVVQRTADMTGLDMDS 422

Query: 169 AEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL 228
           AEELQVVNYGIGGHYEPH+DFAR  E  AF+ L  GNR+ATVLFYMSDV QGGATVFTSL
Sbjct: 423 AEELQVVNYGIGGHYEPHFDFARKEEERAFEGLNLGNRIATVLFYMSDVEQGGATVFTSL 482

Query: 229 NLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           + +L+P+KGTAAFW NLH  G GD  TRHAACPVLTG+
Sbjct: 483 HTALFPKKGTAAFWMNLHRDGQGDVRTRHAACPVLTGT 520


>gi|24651407|ref|NP_733371.1| prolyl-4-hydroxylase-alpha EFB [Drosophila melanogaster]
 gi|20269806|gb|AAM18058.1|AF495536_1 prolyl 4-hydroxylase alpha-related protein PH4[alpha]EFB
           [Drosophila melanogaster]
 gi|15292529|gb|AAK93533.1| SD05564p [Drosophila melanogaster]
 gi|23172692|gb|AAF57053.2| prolyl-4-hydroxylase-alpha EFB [Drosophila melanogaster]
 gi|220946562|gb|ACL85824.1| PH4alphaEFB-PA [synthetic construct]
          Length = 550

 Score =  335 bits (859), Expect = 1e-89,   Method: Compositional matrix adjust.
 Identities = 166/278 (59%), Positives = 200/278 (71%), Gaps = 13/278 (4%)

Query: 2   IFPTHQRAQGNKLYYQEALNKSPELK--------DEPPK----VNNVAPTL-EVTEREKY 48
           + P H+RA GNK +Y++ + +  +L+        DE PK    V    P + ++TER  Y
Sbjct: 243 LLPHHERANGNKRFYEKEIAQQLQLRKMKGDDGTDEMPKSDLPVAKSDPAIFDMTERRAY 302

Query: 49  EMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDL 108
           EMLCRG+L   P+ +  L+CRYV   VP+LRL PLK EE +  P I++Y D MYDSEIDL
Sbjct: 303 EMLCRGELKPSPSDLRSLRCRYVTNRVPFLRLGPLKLEEVHADPYIVIYHDAMYDSEIDL 362

Query: 109 IKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST 168
           IK+MA+PR RRATVQN  TG LE ANYRISKSAWL+  E  VIE + +R   MTGL   +
Sbjct: 363 IKRMARPRFRRATVQNSVTGALETANYRISKSAWLKTQEDRVIETVVQRTADMTGLDMDS 422

Query: 169 AEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL 228
           AEELQVVNYGIGGHYEPH+DFAR  E  AF+ L  GNR+ATVLFYMSDV QGGATVFTSL
Sbjct: 423 AEELQVVNYGIGGHYEPHFDFARKEEQRAFEGLNLGNRIATVLFYMSDVEQGGATVFTSL 482

Query: 229 NLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           + +L+P+KGTAAFW NLH  G GD  TRHAACPVLTG+
Sbjct: 483 HTALFPKKGTAAFWMNLHRDGQGDVRTRHAACPVLTGT 520


>gi|195341536|ref|XP_002037362.1| GM12882 [Drosophila sechellia]
 gi|194131478|gb|EDW53521.1| GM12882 [Drosophila sechellia]
          Length = 550

 Score =  334 bits (857), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 165/278 (59%), Positives = 200/278 (71%), Gaps = 13/278 (4%)

Query: 2   IFPTHQRAQGNKLYYQEALNKSPELK--------DEPPK----VNNVAPTL-EVTEREKY 48
           + P H+RA GNK +Y++ + +  +L+        DE PK    V    P + ++TER  Y
Sbjct: 243 LLPHHERANGNKRFYEKEIAQQLQLRKMKGDDGTDEMPKSDLPVAKSDPAIFDMTERRAY 302

Query: 49  EMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDL 108
           EMLCRG+L   P+ +  L+CRYV   VP+LRL PLK EE +  P I++Y D MYDSEIDL
Sbjct: 303 EMLCRGELKPSPSDLRSLRCRYVTNRVPFLRLGPLKLEEVHADPYIVIYHDAMYDSEIDL 362

Query: 109 IKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST 168
           IK+MA+PR RRATVQN  TG LE ANYRISKSAWL+  E  VIE + +R   MTGL   +
Sbjct: 363 IKRMARPRFRRATVQNSVTGALETANYRISKSAWLKTQEDRVIETVVQRTADMTGLDMDS 422

Query: 169 AEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL 228
           AEELQVVNYGIGGHYEPH+DFAR  E  AF+ +  GNR+ATVLFYMSDV QGGATVFTSL
Sbjct: 423 AEELQVVNYGIGGHYEPHFDFARKEEERAFEGINLGNRIATVLFYMSDVEQGGATVFTSL 482

Query: 229 NLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           + +L+P+KGTAAFW NLH  G GD  TRHAACPVLTG+
Sbjct: 483 HTALFPKKGTAAFWMNLHRDGQGDVRTRHAACPVLTGT 520


>gi|125772807|ref|XP_001357662.1| GA15946 [Drosophila pseudoobscura pseudoobscura]
 gi|54637394|gb|EAL26796.1| GA15946 [Drosophila pseudoobscura pseudoobscura]
          Length = 549

 Score =  334 bits (857), Expect = 3e-89,   Method: Compositional matrix adjust.
 Identities = 162/278 (58%), Positives = 197/278 (70%), Gaps = 13/278 (4%)

Query: 2   IFPTHQRAQGNKLYYQEALNKSPELK--------DEPPKVN-----NVAPTLEVTEREKY 48
           + P H+RA GNK +Y++ +    E+K        DE P  +     +    L V ER+ Y
Sbjct: 242 LLPNHERANGNKRFYEKEIAHQKEMKKMKGDDGTDEMPVSDLPVARSDTGELGVKERKSY 301

Query: 49  EMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDL 108
           EMLCRG+L   P  +  L+CRYV  NVP+LRL PLK EEA+  P I++Y D MYDSE+DL
Sbjct: 302 EMLCRGELKPSPTYMRSLRCRYVTNNVPFLRLGPLKLEEAHKDPYIVIYHDAMYDSEMDL 361

Query: 109 IKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST 168
           IK+MA+PR RRATVQN  TG LE ANYRISKSAWL+  E  VI ++ +R   MTGL   +
Sbjct: 362 IKRMARPRFRRATVQNSVTGALETANYRISKSAWLKTEEDSVIAKVVQRTADMTGLDMES 421

Query: 169 AEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL 228
           AEELQVVNYGIGGHY PH+DFAR  E  AF+ L  GNR+ATVLFYMSDV QGGATVFT+L
Sbjct: 422 AEELQVVNYGIGGHYAPHFDFARREEKRAFEGLNLGNRIATVLFYMSDVEQGGATVFTTL 481

Query: 229 NLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
             +LWP++GTAAFW NLH  G+GD  T+HAACPVLTG+
Sbjct: 482 RTALWPKRGTAAFWMNLHRDGEGDKRTQHAACPVLTGT 519


>gi|195159323|ref|XP_002020531.1| GL13463 [Drosophila persimilis]
 gi|194117300|gb|EDW39343.1| GL13463 [Drosophila persimilis]
          Length = 487

 Score =  333 bits (854), Expect = 6e-89,   Method: Compositional matrix adjust.
 Identities = 162/278 (58%), Positives = 197/278 (70%), Gaps = 13/278 (4%)

Query: 2   IFPTHQRAQGNKLYYQEALNKSPELK--------DEPPKVN-----NVAPTLEVTEREKY 48
           + P H+RA GNK +Y++ +    E+K        DE P  +     +    L V ER+ Y
Sbjct: 180 LLPNHERANGNKRFYEKEIAHQKEMKKMKGDDGTDEMPVSDLPVARSDTGELGVKERKSY 239

Query: 49  EMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDL 108
           EMLCRG+L   P  +  L+CRYV  NVP+LRL PLK EEA+  P I++Y D MYDSE+DL
Sbjct: 240 EMLCRGELKPSPTYMRSLRCRYVTNNVPFLRLGPLKLEEAHKDPYIVIYHDAMYDSEMDL 299

Query: 109 IKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST 168
           IK+MA+PR RRATVQN  TG LE ANYRISKSAWL+  E  VI ++ +R   MTGL   +
Sbjct: 300 IKRMARPRFRRATVQNSVTGALETANYRISKSAWLKTEEDSVIAKVVQRTADMTGLDMES 359

Query: 169 AEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL 228
           AEELQVVNYGIGGHY PH+DFAR  E  AF+ L  GNR+ATVLFYMSDV QGGATVFT+L
Sbjct: 360 AEELQVVNYGIGGHYAPHFDFARREEKRAFEGLNLGNRIATVLFYMSDVEQGGATVFTTL 419

Query: 229 NLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
             +LWP++GTAAFW NLH  G+GD  T+HAACPVLTG+
Sbjct: 420 RTALWPKRGTAAFWMNLHRDGEGDKRTQHAACPVLTGT 457


>gi|321474877|gb|EFX85841.1| hypothetical protein DAPPUDRAFT_208740 [Daphnia pulex]
          Length = 545

 Score =  332 bits (852), Expect = 9e-89,   Method: Compositional matrix adjust.
 Identities = 165/303 (54%), Positives = 210/303 (69%), Gaps = 26/303 (8%)

Query: 2   IFPTHQRAQGNKLYYQEALNKSPELK-------------DEPPKVNNVA---PTLEVTER 45
           + P HQRA GNK YY++ L +   ++             DEP    N+    P+ ++ ER
Sbjct: 238 LVPFHQRALGNKKYYEDLLRQQGVIQRRGETGDEDNVVMDEPFNTANLKLTKPSDQLPER 297

Query: 46  EKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSE 105
           E YE LCRG+  + P I  +L+CRYV  NVPYL + P+K EEA+ +P I++Y +V+ D E
Sbjct: 298 ENYEKLCRGEKLMDPKIEGRLRCRYVTNNVPYLYIQPVKMEEAFHKPLIVIYHNVINDDE 357

Query: 106 IDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLT 165
           I+ +KKMAQPR +RATVQN  TG LE ANYRISKSAWL+  EH  + +++RRV  +TGL 
Sbjct: 358 IETVKKMAQPRFKRATVQNSVTGNLEPANYRISKSAWLKSEEHDHVFKVTRRVGDVTGLD 417

Query: 166 TSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVF 225
            +TAE+LQVVNYGIGGHYEPH+D+AR  E NAFK LG GNRVAT LFYMS+V  GGATVF
Sbjct: 418 MATAEDLQVVNYGIGGHYEPHFDYARKEEVNAFKDLGWGNRVATWLFYMSEVEAGGATVF 477

Query: 226 TSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PC 275
             LNL+LWP+KG+AAFW+NLH +G+G+  TRHAACPVLTGS  + +            PC
Sbjct: 478 PKLNLALWPQKGSAAFWYNLHPNGEGNELTRHAACPVLTGSKWVSNKWIHERNQEFRHPC 537

Query: 276 GLR 278
           GLR
Sbjct: 538 GLR 540


>gi|195505190|ref|XP_002099397.1| GE10881 [Drosophila yakuba]
 gi|194185498|gb|EDW99109.1| GE10881 [Drosophila yakuba]
          Length = 487

 Score =  329 bits (843), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 165/278 (59%), Positives = 198/278 (71%), Gaps = 13/278 (4%)

Query: 2   IFPTHQRAQGNKLYYQEALNKSPELK--------DEPPK----VNNVAPTL-EVTEREKY 48
           + P H+RA GNK +Y++ + +  +L         DE PK    V    P + ++TER  Y
Sbjct: 180 LLPHHERANGNKRFYEKEIAQQLQLSKMKGDDGTDEMPKSDLPVAKSDPAIFDMTERRAY 239

Query: 49  EMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDL 108
           EMLCRG+L   P+ +  L+CRYV   VP+LRL PLK EEA+  P I++Y D MYDSEID+
Sbjct: 240 EMLCRGELKPSPSELRPLRCRYVTNGVPFLRLGPLKLEEAHADPYIVIYHDAMYDSEIDV 299

Query: 109 IKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST 168
           IK+MA+PR RRATVQN  TG LE ANYRISKSAWL+  E  VI  + +R   MTGL   +
Sbjct: 300 IKRMARPRFRRATVQNSVTGALETANYRISKSAWLKTHEDRVIGTVVQRTADMTGLDMES 359

Query: 169 AEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL 228
           AEELQVVNYGIGGHYEPH+DFAR  E  AF+ L  GNR+ATVLFYMSDV QGGATVFTSL
Sbjct: 360 AEELQVVNYGIGGHYEPHFDFARKEEERAFEGLNLGNRIATVLFYMSDVEQGGATVFTSL 419

Query: 229 NLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           + +L+P KGTAAFW NLH  G GD  TRHAACPVLTG+
Sbjct: 420 HTALFPRKGTAAFWMNLHRDGQGDVRTRHAACPVLTGT 457


>gi|240974259|ref|XP_002401836.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
 gi|215491070|gb|EEC00711.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
          Length = 490

 Score =  327 bits (837), Expect = 6e-87,   Method: Compositional matrix adjust.
 Identities = 171/297 (57%), Positives = 206/297 (69%), Gaps = 23/297 (7%)

Query: 4   PTHQRAQGNKLYYQEALNKSPELK-----DEPPKVNNVA-----PTLEVTEREKYEMLCR 53
           P H RA GNK YY++A++K+   K     D P     V      P  + +ER  YE LCR
Sbjct: 192 PDHPRAPGNKRYYEDAISKTELHKRGDDGDVPMDEAAVGKKHHGPDAD-SERGIYERLCR 250

Query: 54  GD-LTVPPAIVAQ-LKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           G+   VPP    + L C+Y     P+L L P KEE  + +PRI++Y DVM   E+D++K 
Sbjct: 251 GEKFPVPPLYKDKDLTCQYRTNGSPFLLLQPAKEEVMFPKPRIVIYHDVMSKHEMDVVKL 310

Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
           +AQPRL+RATVQNYK+GELE+ANYRISKSAWLR  EH VI R++RR+EH+TGL+  TAEE
Sbjct: 311 LAQPRLKRATVQNYKSGELEVANYRISKSAWLRNEEHGVIARVTRRIEHITGLSADTAEE 370

Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
           LQVVNYGIGGHYEPH+DFAR  E NAF+SLGTGNR+AT L YMSDV  GGATVF  L L+
Sbjct: 371 LQVVNYGIGGHYEPHFDFARREEKNAFQSLGTGNRIATWLNYMSDVPAGGATVFPQLRLT 430

Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHS----------TCPCGLR 278
           LWPEKG AAFW+NLH SG+GD  TRHAACPVL GS  + +          T PCG R
Sbjct: 431 LWPEKGAAAFWYNLHRSGEGDMLTRHAACPVLAGSKWVSNKWFHERGQEFTRPCGTR 487


>gi|239792190|dbj|BAH72464.1| ACYPI007079 [Acyrthosiphon pisum]
          Length = 249

 Score =  320 bits (820), Expect = 5e-85,   Method: Compositional matrix adjust.
 Identities = 154/218 (70%), Positives = 180/218 (82%), Gaps = 1/218 (0%)

Query: 50  MLCRGDLTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDL 108
           MLCR +  +   I +QL+CRY + N  P L + PLKEEEA+  PRIILYRDV+YD+EI++
Sbjct: 1   MLCRNENLMSIQISSQLRCRYTNNNRNPLLLIAPLKEEEAFFSPRIILYRDVLYDNEIEV 60

Query: 109 IKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST 168
           IK+MAQPRL+RATVQNYKTGELE A+YRISKSAWL+E E  V+  +++RVE MTGLTT T
Sbjct: 61  IKRMAQPRLKRATVQNYKTGELEFADYRISKSAWLKEHEDVVVANVAKRVEVMTGLTTET 120

Query: 169 AEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL 228
           AEELQVVNYG+GGHY+PHYDFAR  E NAFKSLGTGNR+ATVLFYMSDVAQGGATVF  L
Sbjct: 121 AEELQVVNYGVGGHYDPHYDFARTEEINAFKSLGTGNRIATVLFYMSDVAQGGATVFPWL 180

Query: 229 NLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
            ++L P KGTAA W NL+ SG+GD  TRHAACPVL GS
Sbjct: 181 GVALQPVKGTAAVWFNLYPSGNGDLRTRHAACPVLQGS 218


>gi|321474952|gb|EFX85916.1| hypothetical protein DAPPUDRAFT_45616 [Daphnia pulex]
          Length = 537

 Score =  315 bits (806), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 157/295 (53%), Positives = 199/295 (67%), Gaps = 19/295 (6%)

Query: 2   IFPTHQRAQGNKLYYQEALNKSPELK------DEPPKVNNVA---PTLEVTEREKYEMLC 52
           I P HQRA GNK +Y++ L K   L        +P    N+    P   + ER+KYE LC
Sbjct: 237 IVPYHQRAIGNKKHYEDVLRKEGILLPIEMILTKPFNTANLKLKKPVDNLEERDKYEKLC 296

Query: 53  RGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKM 112
           RG+  + P I   L+CRY+  NVP+  + P+K EEA L+P I++Y DVM D EI+ +KKM
Sbjct: 297 RGEKLMDPKIEGHLRCRYITNNVPFFFIQPIKMEEALLKPMIVVYHDVMSDDEIETVKKM 356

Query: 113 AQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEEL 172
           A+PR +RAT++N KTGELE ANYRISKSAWL+  EH  I +++RRV  +TGL  STAE+L
Sbjct: 357 AKPRFKRATIRNSKTGELEPANYRISKSAWLKSEEHDHILKVTRRVGDITGLDMSTAEDL 416

Query: 173 QVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSL 232
           QVVNYGIGGHYEPH+D+AR     AFK LG GNR+AT LFYMSDV  GGATVF     ++
Sbjct: 417 QVVNYGIGGHYEPHFDYARTETTEAFKELGWGNRIATWLFYMSDVEAGGATVFPPTGAAV 476

Query: 233 WPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCGL 277
           WP KG+AAFW+NL+ +G G+  TRHAACPVL+GS  + +            PCGL
Sbjct: 477 WPRKGSAAFWYNLYPNGKGNELTRHAACPVLSGSKWVSNRWIHEHRQEFRRPCGL 531


>gi|391342914|ref|XP_003745760.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Metaseiulus
           occidentalis]
          Length = 525

 Score =  301 bits (772), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 150/269 (55%), Positives = 184/269 (68%), Gaps = 6/269 (2%)

Query: 4   PTHQRAQGNKLYYQEALNKSP------ELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLT 57
           P H RA GNK YY   L K+       + K+ P + +         ER  YE LCRG+  
Sbjct: 232 PDHPRASGNKRYYLSELGKNQSGEGRGDTKEAPVETHIKRQDSLSDERIMYERLCRGEPV 291

Query: 58  VPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRL 117
             P +   L C Y H N PY+ L P K E  + +P + L+ D+M D EI  + +++ PRL
Sbjct: 292 EKPFLRKNLHCTYFHNNHPYMILQPSKLEVIHERPYLALFHDIMSDDEIQTVIELSAPRL 351

Query: 118 RRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNY 177
           +RATVQN K+GELE+ANYRISKSAWL+  +H V+ER+S R E++TGLT  TAEELQVVNY
Sbjct: 352 KRATVQNAKSGELEVANYRISKSAWLKNHDHEVVERLSFRFEYLTGLTHLTAEELQVVNY 411

Query: 178 GIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKG 237
           GIGGHYE H+DFAR  E +AFK LGTGNR+AT + YMSDV  GGATVF  L L++WPEKG
Sbjct: 412 GIGGHYEAHFDFARRDEKDAFKQLGTGNRIATWINYMSDVKAGGATVFPRLGLTVWPEKG 471

Query: 238 TAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           +AAFW NLH SG+GD  TRHAACPVL GS
Sbjct: 472 SAAFWWNLHRSGEGDILTRHAACPVLAGS 500


>gi|268536692|ref|XP_002633481.1| C. briggsae CBR-PHY-2 protein [Caenorhabditis briggsae]
 gi|94442973|emb|CAJ98659.1| prolyl 4-hydroxylase [Caenorhabditis briggsae]
          Length = 539

 Score =  299 bits (766), Expect = 8e-79,   Method: Compositional matrix adjust.
 Identities = 152/298 (51%), Positives = 202/298 (67%), Gaps = 16/298 (5%)

Query: 2   IFPTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPA 61
           I P H RA+GN  +Y++ L     + D PP VN       + ER+ YE LCRG+  +PP 
Sbjct: 235 IAPNHPRAKGNVKWYEDMLQGKDMVGDLPPIVNKRVEFDGIVERDAYEALCRGE--IPPV 292

Query: 62  ---IVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLR 118
                ++L+C Y+ R+ P+L++ P+K E     P  +L+++V+ DSEI++IK++A P+L+
Sbjct: 293 EEKWKSKLRC-YLKRDKPFLKIAPIKVEILRFDPLAVLFKNVISDSEIEVIKELASPKLK 351

Query: 119 RATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYG 178
           RATVQN KTGELE A YRISKSAWL+    PVI+R++RR+E  TGL  +T+EELQV NYG
Sbjct: 352 RATVQNSKTGELEHATYRISKSAWLKGDLDPVIDRVNRRIEDFTGLNQATSEELQVANYG 411

Query: 179 IGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGT 238
           +GGHY+PH+DFAR  E NAFK+L TGNR+ATVLFYMS   +GGATVF  L  +++P K  
Sbjct: 412 LGGHYDPHFDFARKEEKNAFKTLNTGNRIATVLFYMSQPERGGATVFNHLGTAVFPSKND 471

Query: 239 AAFWHNLHSSGDGDYYTRHAACPVLTG----SNS-LHS-----TCPCGLRRGLQRSGI 286
           A FW+NL   G+GD  TRHAACPVL G    SN  +H      T PCGL  G+Q + I
Sbjct: 472 ALFWYNLRRDGEGDLRTRHAACPVLLGVKWVSNKWIHERGQEFTRPCGLEEGVQENFI 529


>gi|112984520|ref|NP_001037195.1| prolyl 4-hydroxylase alpha subunit precursor [Bombyx mori]
 gi|37543673|gb|AAM21932.1| prolyl 4-hydroxylase alpha subunit [Bombyx mori]
          Length = 550

 Score =  299 bits (766), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 163/305 (53%), Positives = 201/305 (65%), Gaps = 25/305 (8%)

Query: 4   PTHQRAQGNKLYYQEAL-NKSPELK--------DEPPKVNNVAPTLE--VTEREKYEMLC 52
           P H RA+GN  +YQ+ +  +  ELK        DEP + +     L     ER+ YE LC
Sbjct: 238 PKHVRARGNIPHYQKTIAEQEAELKKQQRGETSDEPEEEDGQDYELSEYAKERKVYESLC 297

Query: 53  RGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKM 112
           RG++ +P  I  +LKC YV    P+L+L P+K E+ Y++P I ++ +VM D EI+ IKK 
Sbjct: 298 RGEMEIPHEITKRLKCWYVTDTHPFLKLAPIKVEQMYVKPDIFMFHEVMTDDEIEFIKKR 357

Query: 113 AQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEEL 172
           A+PR +RA V + KTGEL  A+YRISKS+WLR+ E PVI RI++RV  MTGL+   AEEL
Sbjct: 358 AKPRFKRAVVHDPKTGELTPAHYRISKSSWLRDEESPVIARITQRVTDMTGLSMLHAEEL 417

Query: 173 QVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSL 232
           QVVNYGIGGHYEPH+DFAR  E N F   G GNR+ATVLFYMSDVAQGGATVFT L LSL
Sbjct: 418 QVVNYGIGGHYEPHFDFARKRE-NPFTKFG-GNRIATVLFYMSDVAQGGATVFTELGLSL 475

Query: 233 WPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCGLRRGLQ 282
           +P K  AAFW NLH+SG+GD  TRHAACPVL GS  + +            PC L    Q
Sbjct: 476 FPIKRAAAFWLNLHASGEGDLATRHAACPVLRGSKWVSNKWIHQGGQELLRPCDLE--YQ 533

Query: 283 RSGII 287
             GII
Sbjct: 534 EEGII 538


>gi|17541712|ref|NP_502317.1| Protein PHY-2 [Caenorhabditis elegans]
 gi|32171589|sp|Q20065.1|P4HA2_CAEEL RecName: Full=Prolyl 4-hydroxylase subunit alpha-2; Short=4-PH
           alpha-2; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-2; Flags: Precursor
 gi|3876769|emb|CAA93469.1| Protein PHY-2 [Caenorhabditis elegans]
          Length = 539

 Score =  298 bits (762), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 151/296 (51%), Positives = 200/296 (67%), Gaps = 12/296 (4%)

Query: 2   IFPTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLT-VPP 60
           I P H RA+GN  +Y++ L     + D PP VN       + ER+ YE LCRG++  V P
Sbjct: 235 IAPNHPRAKGNVKWYEDMLQGKDMVGDLPPIVNKRVEYDGIVERDAYEALCRGEIPPVEP 294

Query: 61  AIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRA 120
               +L+C Y+ R+ P+L+L P+K E     P  +L+++V++DSEI++IK++A P+L+RA
Sbjct: 295 KWKNKLRC-YLKRDKPFLKLAPIKVEILRFDPLAVLFKNVIHDSEIEVIKELASPKLKRA 353

Query: 121 TVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIG 180
           TVQN KTGELE A YRISKSAWL+    PVI+R++RR+E  T L  +T+EELQV NYG+G
Sbjct: 354 TVQNSKTGELEHATYRISKSAWLKGDLDPVIDRVNRRIEDFTNLNQATSEELQVANYGLG 413

Query: 181 GHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAA 240
           GHY+PH+DFAR  E NAFK+L TGNR+ATVLFYMS   +GGATVF  L  +++P K  A 
Sbjct: 414 GHYDPHFDFARKEEKNAFKTLNTGNRIATVLFYMSQPERGGATVFNHLGTAVFPSKNDAL 473

Query: 241 FWHNLHSSGDGDYYTRHAACPVLTG----SNS-LHS-----TCPCGLRRGLQRSGI 286
           FW+NL   G+GD  TRHAACPVL G    SN  +H      T PCGL   +Q + I
Sbjct: 474 FWYNLRRDGEGDLRTRHAACPVLLGVKWVSNKWIHEKGQEFTRPCGLEEEVQENFI 529


>gi|312032360|ref|NP_001185667.1| prolyl 4-hydroxylase subunit alpha-1 isoform 4 precursor [Gallus
           gallus]
          Length = 536

 Score =  297 bits (761), Expect = 4e-78,   Method: Compositional matrix adjust.
 Identities = 149/273 (54%), Positives = 192/273 (70%), Gaps = 11/273 (4%)

Query: 4   PTHQRAQGNKLYYQ-------EALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD- 55
           P HQRA GN  Y++       EA   S + +D+  K   V     + ER KYEMLCRG+ 
Sbjct: 240 PEHQRANGNMKYFEYIMAKEKEANKSSTDAEDQTEKETEVKKKDYLPERRKYEMLCRGEG 299

Query: 56  LTVPPAIVAQLKCRYV--HRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
           L + P    +L CRY   +RN  Y+ L P+K+E+ + +PRI+ + D++ D EI+ +K++A
Sbjct: 300 LKMTPRRQKRLFCRYYDGNRNPRYI-LGPVKQEDEWDKPRIVRFLDIISDEEIETVKELA 358

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +PRLRRAT+ N  TG LE A+YRISKSAWL   E PV+ RI+ R++ +TGL  STAEELQ
Sbjct: 359 KPRLRRATISNPITGALETAHYRISKSAWLSGYESPVVSRINTRIQDLTGLDVSTAEELQ 418

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
           V NYG+GG YEPH+DFAR  E +AFK LGTGNR+AT LFYMSDV+ GGATVF  +  S+W
Sbjct: 419 VANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGASVW 478

Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           P+KGTA FW+NL  SG+GDY TRHAACPVL G+
Sbjct: 479 PKKGTAVFWYNLFPSGEGDYSTRHAACPVLVGN 511


>gi|148233143|ref|NP_001090904.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Sus scrofa]
 gi|83778522|gb|ABC47142.1| procollagen-proline 2-oxoglutarate-4-dioxygenase [Sus scrofa]
          Length = 534

 Score =  296 bits (759), Expect = 7e-78,   Method: Compositional matrix adjust.
 Identities = 147/274 (53%), Positives = 192/274 (70%), Gaps = 11/274 (4%)

Query: 4   PTHQRAQGNKLYYQEALNKSPEL-KDEPPKVNNVAPTLE--------VTEREKYEMLCRG 54
           P HQRA GN  Y++  + K  E  K      +N   TL+        + ER+KYEMLCRG
Sbjct: 236 PEHQRANGNLKYFEYIMAKEKEANKSASDDQSNQKTTLKKKGVAVDYLPERQKYEMLCRG 295

Query: 55  D-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKM 112
           + + + P    +L CRY   N  P   L P K+E+ + +PRII + D++ D+EID++K +
Sbjct: 296 EGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIDIVKDL 355

Query: 113 AQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEEL 172
           A+PRLRRAT+ N  TG+LE  +YRISKSAWL   E+PV+ R++ R++ +TGL  STAEEL
Sbjct: 356 AKPRLRRATISNPITGDLETVHYRISKSAWLSGYENPVVSRLNMRIQDLTGLDVSTAEEL 415

Query: 173 QVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSL 232
           QV NYG+GG YEPH+DFAR  E +AFK LGTGNR+AT LFYMSDV+ GGATVF  +  S+
Sbjct: 416 QVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGASV 475

Query: 233 WPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 476 WPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 509


>gi|410251926|gb|JAA13930.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
          Length = 566

 Score =  296 bits (758), Expect = 9e-78,   Method: Compositional matrix adjust.
 Identities = 155/332 (46%), Positives = 211/332 (63%), Gaps = 28/332 (8%)

Query: 4   PTHQRAQGNKLYYQEALNKSPEL----------KDEPPKVNNVAPTLEVTEREKYEMLCR 53
           P HQRA GN  Y++  + K  ++          +   PK   VA    + ER+KYEMLCR
Sbjct: 236 PEHQRANGNLKYFEYIMAKEKDVNKSASDDQSDQKTTPKKKGVAVDY-LPERQKYEMLCR 294

Query: 54  GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           G+ + + P    +L CRY   N  P   L P K+E+ + +PRII + D++ D+EI+++K 
Sbjct: 295 GEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 354

Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
           +A+PRLRRAT+ N  TG+LE  +YRISKSAWL   E+PV+ RI+ R++ +TGL  STAEE
Sbjct: 355 LAKPRLRRATISNPITGDLETVHYRISKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEE 414

Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
           LQV NYG+GG YEPH+DFAR  E +AFK LGTGNR+AT LFYMSDV+ GGATVF  +  S
Sbjct: 415 LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 474

Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PC-----G 276
           +WP+KGTA FW+NL +SG+GDY TRHAACPVL G+  + +            PC     G
Sbjct: 475 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRPCTFVRIG 534

Query: 277 LRRGLQRSGIICTLVGMVITIRGMLPVLYSLD 308
           +   L      CTL+ ++ T   +    +++D
Sbjct: 535 MTNRLPFFSYCCTLMCLIYTFPSLNFQEFTID 566


>gi|312032358|ref|NP_001185666.1| prolyl 4-hydroxylase subunit alpha-1 isoform 3 precursor [Gallus
           gallus]
          Length = 536

 Score =  296 bits (757), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 148/273 (54%), Positives = 191/273 (69%), Gaps = 11/273 (4%)

Query: 4   PTHQRAQGNKLYYQ-------EALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD- 55
           P HQRA GN  Y++       EA   S + +D+  K   V     + ER KYEMLCRG+ 
Sbjct: 240 PEHQRANGNMKYFEYIMAKEKEANKSSTDAEDQTEKETEVKKKDYLPERRKYEMLCRGEG 299

Query: 56  LTVPPAIVAQLKCRYV--HRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
           L + P    +L CRY   +RN  Y+ L P+K+E+ + +PRI+ + D++ D EI+ +K++A
Sbjct: 300 LKMTPRRQKRLFCRYYDGNRNPRYI-LGPVKQEDEWDKPRIVRFLDIISDEEIETVKELA 358

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +PRLRRAT+ N  TG LE A+YRISKSAWL   E PV+ RI+ R++ +TGL  STAEELQ
Sbjct: 359 KPRLRRATISNPITGALETAHYRISKSAWLSGYESPVVSRINTRIQDLTGLDVSTAEELQ 418

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
           V NYG+GG YEPH+DF R  E +AFK LGTGNR+AT LFYMSDV+ GGATVF  +  S+W
Sbjct: 419 VANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGASVW 478

Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           P+KGTA FW+NL  SG+GDY TRHAACPVL G+
Sbjct: 479 PKKGTAVFWYNLFPSGEGDYSTRHAACPVLVGN 511


>gi|355562502|gb|EHH19096.1| hypothetical protein EGK_19739 [Macaca mulatta]
 gi|355782842|gb|EHH64763.1| hypothetical protein EGM_18071 [Macaca fascicularis]
 gi|383418719|gb|AFH32573.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Macaca
           mulatta]
          Length = 534

 Score =  295 bits (755), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 146/275 (53%), Positives = 192/275 (69%), Gaps = 13/275 (4%)

Query: 4   PTHQRAQGNKLYYQEALNKSPEL----------KDEPPKVNNVAPTLEVTEREKYEMLCR 53
           P HQRA GN  Y++  + K  ++          +   PK   VA    + ER+KYEMLCR
Sbjct: 236 PEHQRANGNLKYFEYIMAKEKDVNKSASDDQSDQKTTPKKKGVAVDY-LPERQKYEMLCR 294

Query: 54  GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           G+ + + P    +L CRY   N  P   L P K+E+ + +PRII + D++ D+EI+++K 
Sbjct: 295 GEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 354

Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
           +A+PRLRRAT+ N  TG+LE  +YRISKSAWL   E+PV+ RI+ R++ +TGL  STAEE
Sbjct: 355 LAKPRLRRATISNPITGDLETVHYRISKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEE 414

Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
           LQV NYG+GG YEPH+DFAR  E +AFK LGTGNR+AT LFYMSDV+ GGATVF  +  S
Sbjct: 415 LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 474

Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           +WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 475 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 509


>gi|380813206|gb|AFE78477.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Macaca
           mulatta]
 gi|384947328|gb|AFI37269.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Macaca
           mulatta]
          Length = 534

 Score =  295 bits (755), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 146/275 (53%), Positives = 192/275 (69%), Gaps = 13/275 (4%)

Query: 4   PTHQRAQGNKLYYQEALNKSPEL----------KDEPPKVNNVAPTLEVTEREKYEMLCR 53
           P HQRA GN  Y++  + K  ++          +   PK   VA    + ER+KYEMLCR
Sbjct: 236 PEHQRANGNLKYFEYIMAKEKDVNKSASGDQSDQKTTPKKKGVAVDY-LPERQKYEMLCR 294

Query: 54  GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           G+ + + P    +L CRY   N  P   L P K+E+ + +PRII + D++ D+EI+++K 
Sbjct: 295 GEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 354

Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
           +A+PRLRRAT+ N  TG+LE  +YRISKSAWL   E+PV+ RI+ R++ +TGL  STAEE
Sbjct: 355 LAKPRLRRATISNPITGDLETVHYRISKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEE 414

Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
           LQV NYG+GG YEPH+DFAR  E +AFK LGTGNR+AT LFYMSDV+ GGATVF  +  S
Sbjct: 415 LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 474

Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           +WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 475 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 509


>gi|190788|gb|AAA36535.1| prolyl 4-hydroxylase alpha subunit (EC 1.14.11.2) [Homo sapiens]
          Length = 534

 Score =  295 bits (754), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 146/275 (53%), Positives = 192/275 (69%), Gaps = 13/275 (4%)

Query: 4   PTHQRAQGNKLYYQEALNKSPEL----------KDEPPKVNNVAPTLEVTEREKYEMLCR 53
           P HQRA GN  Y++  + K  ++          +   PK   VA    + ER+KYEMLCR
Sbjct: 236 PEHQRANGNLKYFEYIMAKEKDVNKSASDDQSDQKTTPKKKGVAVDY-LPERQKYEMLCR 294

Query: 54  GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           G+ + + P    +L CRY   N  P   L P K+E+ + +PRII + D++ D+EI+++K 
Sbjct: 295 GEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 354

Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
           +A+PRLRRAT+ N  TG+LE  +YRISKSAWL   E+PV+ RI+ R++ +TGL  STAEE
Sbjct: 355 LAKPRLRRATISNPITGDLETVHYRISKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEE 414

Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
           LQV NYG+GG YEPH+DFAR  E +AFK LGTGNR+AT LFYMSDV+ GGATVF  +  S
Sbjct: 415 LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 474

Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           +WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 475 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 509


>gi|395820526|ref|XP_003783615.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2 [Otolemur
           garnettii]
          Length = 534

 Score =  295 bits (754), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 146/275 (53%), Positives = 192/275 (69%), Gaps = 13/275 (4%)

Query: 4   PTHQRAQGNKLYYQEALNKSPEL----------KDEPPKVNNVAPTLEVTEREKYEMLCR 53
           P HQRA GN  Y++  + K  ++          +   PK   VA    + ER+KYEMLCR
Sbjct: 236 PEHQRANGNLKYFEYIMAKEKDVNKSSSDDQSDQKTTPKKKGVAVDY-LPERQKYEMLCR 294

Query: 54  GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           G+ + + P    +L CRY   N  P   L P K+E+ + +PRII + D++ D+EI+++K 
Sbjct: 295 GEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 354

Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
           +A+PRLRRAT+ N  TG+LE  +YRISKSAWL   E+PV+ RI+ R++ +TGL  STAEE
Sbjct: 355 LAKPRLRRATISNPITGDLETVHYRISKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEE 414

Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
           LQV NYG+GG YEPH+DFAR  E +AFK LGTGNR+AT LFYMSDV+ GGATVF  +  S
Sbjct: 415 LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 474

Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           +WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 475 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 509


>gi|63252888|ref|NP_001017962.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Homo
           sapiens]
 gi|197099666|ref|NP_001125733.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Pongo abelii]
 gi|217272849|ref|NP_001136067.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Homo
           sapiens]
 gi|114631177|ref|XP_001140234.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 3 [Pan
           troglodytes]
 gi|114631181|ref|XP_001140652.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 7 [Pan
           troglodytes]
 gi|2507090|sp|P13674.2|P4HA1_HUMAN RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
           alpha-1; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-1; Flags: Precursor
 gi|75061858|sp|Q5RAG8.1|P4HA1_PONAB RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
           alpha-1; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-1; Flags: Precursor
 gi|602675|gb|AAA59068.1| alpha-subunit of prolyl 4-hydroxylase [Homo sapiens]
 gi|23271226|gb|AAH34998.1| Prolyl 4-hydroxylase, alpha polypeptide I [Homo sapiens]
 gi|55729010|emb|CAH91242.1| hypothetical protein [Pongo abelii]
 gi|56403853|emb|CAI29712.1| hypothetical protein [Pongo abelii]
 gi|119574854|gb|EAW54469.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I, isoform CRA_c [Homo
           sapiens]
 gi|119574855|gb|EAW54470.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I, isoform CRA_d [Homo
           sapiens]
 gi|123981532|gb|ABM82595.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I [synthetic
           construct]
 gi|123996359|gb|ABM85781.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I [synthetic
           construct]
 gi|261861532|dbj|BAI47288.1| prolyl 4-hydroxylase, alpha polypeptide I [synthetic construct]
 gi|410295852|gb|JAA26526.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
 gi|410349611|gb|JAA41409.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
          Length = 534

 Score =  295 bits (754), Expect = 3e-77,   Method: Compositional matrix adjust.
 Identities = 146/275 (53%), Positives = 192/275 (69%), Gaps = 13/275 (4%)

Query: 4   PTHQRAQGNKLYYQEALNKSPEL----------KDEPPKVNNVAPTLEVTEREKYEMLCR 53
           P HQRA GN  Y++  + K  ++          +   PK   VA    + ER+KYEMLCR
Sbjct: 236 PEHQRANGNLKYFEYIMAKEKDVNKSASDDQSDQKTTPKKKGVAVDY-LPERQKYEMLCR 294

Query: 54  GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           G+ + + P    +L CRY   N  P   L P K+E+ + +PRII + D++ D+EI+++K 
Sbjct: 295 GEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 354

Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
           +A+PRLRRAT+ N  TG+LE  +YRISKSAWL   E+PV+ RI+ R++ +TGL  STAEE
Sbjct: 355 LAKPRLRRATISNPITGDLETVHYRISKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEE 414

Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
           LQV NYG+GG YEPH+DFAR  E +AFK LGTGNR+AT LFYMSDV+ GGATVF  +  S
Sbjct: 415 LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 474

Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           +WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 475 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 509


>gi|281350467|gb|EFB26051.1| hypothetical protein PANDA_009188 [Ailuropoda melanoleuca]
          Length = 511

 Score =  295 bits (754), Expect = 3e-77,   Method: Compositional matrix adjust.
 Identities = 145/274 (52%), Positives = 193/274 (70%), Gaps = 11/274 (4%)

Query: 4   PTHQRAQGNKLYYQEALNKSPEL-KDEPPKVNNVAPTLE--------VTEREKYEMLCRG 54
           P HQRA GN  Y++  + K  ++ K      ++   TL+        + ER+KYEMLCRG
Sbjct: 236 PEHQRANGNLKYFEYIMAKEKDVNKSASDDQSDQKTTLKKKGAAVDYLPERQKYEMLCRG 295

Query: 55  D-LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKM 112
           + + + P    +L CRY   N  P   L P K+E+ + +PRII + D++ D+EI+++K +
Sbjct: 296 EGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKDL 355

Query: 113 AQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEEL 172
           A+PRLRRAT+ N  TG+LE  +YRISKSAWL   E+PV+ RI+ R++ +TGL  STAEEL
Sbjct: 356 AKPRLRRATISNPITGDLETVHYRISKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEEL 415

Query: 173 QVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSL 232
           QV NYG+GG YEPH+DFAR  E +AFK LGTGNR+AT LFYMSDV+ GGATVF  +  S+
Sbjct: 416 QVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGASV 475

Query: 233 WPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 476 WPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 509


>gi|326923461|ref|XP_003207954.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 1
           [Meleagris gallopavo]
          Length = 536

 Score =  294 bits (752), Expect = 4e-77,   Method: Compositional matrix adjust.
 Identities = 147/273 (53%), Positives = 190/273 (69%), Gaps = 11/273 (4%)

Query: 4   PTHQRAQGNKLYYQ-------EALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD- 55
           P HQRA GN  Y++       EA   S + +D+  K         + ER KYEMLCRG+ 
Sbjct: 240 PEHQRANGNMKYFEYIMAKEKEANKSSTDAEDQTEKETEFKKKDYLPERRKYEMLCRGEG 299

Query: 56  LTVPPAIVAQLKCRYV--HRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
           L + P    +L CRY   +RN  Y+ L P+K+E+ + +PRI+ + D++ D EI+ +K++A
Sbjct: 300 LKMTPRRQKRLFCRYYDGNRNPRYI-LGPVKQEDEWDKPRIVRFLDIISDEEIETVKELA 358

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +PRLRRAT+ N  TG LE A+YRISKSAWL   E PV+ RI+ R++ +TGL  STAEELQ
Sbjct: 359 KPRLRRATISNPITGALETAHYRISKSAWLSGYESPVVSRINTRIQDLTGLDVSTAEELQ 418

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
           V NYG+GG YEPH+DF R  E +AFK LGTGNR+AT LFYMSDV+ GGATVF  +  S+W
Sbjct: 419 VANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGASVW 478

Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           P+KGTA FW+NL  SG+GDY TRHAACPVL G+
Sbjct: 479 PKKGTAVFWYNLFPSGEGDYSTRHAACPVLVGN 511


>gi|291404184|ref|XP_002718472.1| PREDICTED: prolyl 4-hydroxylase, alpha I subunit isoform 2
           [Oryctolagus cuniculus]
          Length = 534

 Score =  293 bits (751), Expect = 5e-77,   Method: Compositional matrix adjust.
 Identities = 146/275 (53%), Positives = 191/275 (69%), Gaps = 13/275 (4%)

Query: 4   PTHQRAQGNKLYYQEALNKSPEL----------KDEPPKVNNVAPTLEVTEREKYEMLCR 53
           P HQRA GN  Y++  + K  +           K   P+   VA    + ER+KYEMLCR
Sbjct: 236 PEHQRANGNLKYFEYIMAKEKDANKSASDGQSDKKTTPRRKGVAVDY-LPERQKYEMLCR 294

Query: 54  GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           G+ + + P    +L CRY   N  P   L P K+E+ + +PRII + D++ D+EI+++K 
Sbjct: 295 GEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 354

Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
           +A+PRLRRAT+ N  TG+LE  +YRISKSAWL   E+PV+ RI+ R++ +TGL  STAEE
Sbjct: 355 LAKPRLRRATISNPITGDLETVHYRISKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEE 414

Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
           LQV NYG+GG YEPH+DFAR  E +AFK LGTGNR+AT LFYMSDV+ GGATVF  +  S
Sbjct: 415 LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 474

Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           +WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 475 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 509


>gi|344274274|ref|XP_003408942.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2
           [Loxodonta africana]
          Length = 534

 Score =  293 bits (750), Expect = 7e-77,   Method: Compositional matrix adjust.
 Identities = 144/274 (52%), Positives = 189/274 (68%), Gaps = 11/274 (4%)

Query: 4   PTHQRAQGNKLYYQEALNKSPE----LKDEPPKVNNVAPTLEVT-----EREKYEMLCRG 54
           P HQRA GN  Y++  + K  +      D P    +      V      ER+KYEMLCRG
Sbjct: 236 PEHQRANGNLKYFEYIMTKEKDSNKSTSDAPSDQKSTVKKKGVAADYLPERQKYEMLCRG 295

Query: 55  D-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKM 112
           + + + P    +L CRY   N  P   L P K+E+ + +PRI+ + D++ D+EI+++K +
Sbjct: 296 EGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIVRFHDIISDAEIEVVKDL 355

Query: 113 AQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEEL 172
           A+PRLRRAT+ N  TG+LE  +YRISKSAWL   E+PV+ RI+ R++ +TGL  STAEEL
Sbjct: 356 AKPRLRRATISNPITGDLETVHYRISKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEEL 415

Query: 173 QVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSL 232
           QV NYG+GG YEPH+DFAR  E +AFK LGTGNR+AT LFYMSDV+ GGATVF  +  S+
Sbjct: 416 QVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPDVGASV 475

Query: 233 WPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 476 WPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 509


>gi|410295850|gb|JAA26525.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
 gi|410295854|gb|JAA26527.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
          Length = 534

 Score =  293 bits (750), Expect = 7e-77,   Method: Compositional matrix adjust.
 Identities = 145/275 (52%), Positives = 192/275 (69%), Gaps = 13/275 (4%)

Query: 4   PTHQRAQGNKLYYQEALNKSPEL----------KDEPPKVNNVAPTLEVTEREKYEMLCR 53
           P HQRA GN  Y++  + K  ++          +   PK   VA    + ER+KYEMLCR
Sbjct: 236 PEHQRANGNLKYFEYIMAKEKDVNKSASDDQSDQKTTPKKKGVAVDY-LPERQKYEMLCR 294

Query: 54  GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           G+ + + P    +L CRY   N  P   L P K+E+ + +PRII + D++ D+EI+++K 
Sbjct: 295 GEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 354

Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
           +A+PRLRRATV + +TG+L  A YR+SKSAWL   E+PV+ RI+ R++ +TGL  STAEE
Sbjct: 355 LAKPRLRRATVHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEE 414

Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
           LQV NYG+GG YEPH+DFAR  E +AFK LGTGNR+AT LFYMSDV+ GGATVF  +  S
Sbjct: 415 LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 474

Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           +WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 475 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 509


>gi|344254200|gb|EGW10304.1| Prolyl 4-hydroxylase subunit alpha-1 [Cricetulus griseus]
          Length = 507

 Score =  293 bits (749), Expect = 8e-77,   Method: Compositional matrix adjust.
 Identities = 146/275 (53%), Positives = 190/275 (69%), Gaps = 13/275 (4%)

Query: 4   PTHQRAQGNKLYYQEALNKSPELK----DEP------PKVNNVAPTLEVTEREKYEMLCR 53
           P HQRA GN  Y++  + K  +      D+P      PK   +A    + ER KYEMLCR
Sbjct: 209 PEHQRANGNLRYFEYIMTKEKDTNKSASDDPSDQKTTPKKKGIAVDY-LPERRKYEMLCR 267

Query: 54  GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           G+ + + P    +L CRY   N  P   L P K+E+ + +PRII + D++ D+EI+++K 
Sbjct: 268 GEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 327

Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
           +A+PRLRRAT+ N  TG LE  +YRISKSAWL   E PV+ RI+ R++ +TGL  STAEE
Sbjct: 328 LAKPRLRRATISNPITGNLETVHYRISKSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEE 387

Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
           LQV NYG+GG YEPH+DFAR  E +AF+ LGTGNR+AT LFYMSDV+ GGATVF  +  S
Sbjct: 388 LQVANYGVGGQYEPHFDFARKDEPDAFQELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 447

Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           +WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 448 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 482


>gi|312032356|ref|NP_001185665.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Gallus
           gallus]
          Length = 536

 Score =  293 bits (749), Expect = 8e-77,   Method: Compositional matrix adjust.
 Identities = 146/273 (53%), Positives = 192/273 (70%), Gaps = 11/273 (4%)

Query: 4   PTHQRAQGNKLYYQ-------EALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD- 55
           P HQRA GN  Y++       EA   S + +D+  K   V     + ER KYEMLCRG+ 
Sbjct: 240 PEHQRANGNMKYFEYIMAKEKEANKSSTDAEDQTEKETEVKKKDYLPERRKYEMLCRGEG 299

Query: 56  LTVPPAIVAQLKCRYV--HRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
           L + P    +L CRY   +RN  Y+ L P+K+E+ + +PRI+ + D++ D EI+ +K++A
Sbjct: 300 LKMTPRRQKRLFCRYYDGNRNPRYI-LGPVKQEDEWDKPRIVRFLDIISDEEIETVKELA 358

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +PRL RATV + +TG+L  A+YR+SKSAWL   E PV+ RI+ R++ +TGL  STAEELQ
Sbjct: 359 KPRLSRATVHDPETGKLTTAHYRVSKSAWLSGYESPVVSRINTRIQDLTGLDVSTAEELQ 418

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
           V NYG+GG YEPH+DFAR  E +AFK LGTGNR+AT LFYMSDV+ GGATVF  +  S+W
Sbjct: 419 VANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGASVW 478

Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           P+KGTA FW+NL  SG+GDY TRHAACPVL G+
Sbjct: 479 PKKGTAVFWYNLFPSGEGDYSTRHAACPVLVGN 511


>gi|74224984|dbj|BAE38205.1| unnamed protein product [Mus musculus]
          Length = 534

 Score =  293 bits (749), Expect = 8e-77,   Method: Compositional matrix adjust.
 Identities = 146/275 (53%), Positives = 193/275 (70%), Gaps = 13/275 (4%)

Query: 4   PTHQRAQGNKLYYQEALNK--------SPELKDE--PPKVNNVAPTLEVTEREKYEMLCR 53
           P HQRA GN +Y++  ++K        S +  D+   PK   +A    + ER+KYEMLCR
Sbjct: 236 PEHQRANGNLVYFEYIMSKEKDANKSASGDQSDQKTAPKKKGIAVDY-LPERQKYEMLCR 294

Query: 54  GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           G+ + + P    +L CRY   N  P   L P K+E+ + +PRII + D++ D+EI+++K 
Sbjct: 295 GEGIKMTPRRQKRLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 354

Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
           +A+PRLRRAT+ N  TG LE  +YRISKSAWL   E PV+ RI+ R++ +TGL  STAEE
Sbjct: 355 LAKPRLRRATISNPVTGALETVHYRISKSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEE 414

Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
           LQV NYG+GG YEPH+DFAR  E +AF+ LGTGNR+AT LFYMSDV+ GGATVF  +  S
Sbjct: 415 LQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 474

Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           +WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 475 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 509


>gi|33859596|ref|NP_035160.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Mus musculus]
 gi|20455506|sp|Q60715.2|P4HA1_MOUSE RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
           alpha-1; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-1; Flags: Precursor
 gi|16307134|gb|AAH09654.1| Procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha 1 polypeptide [Mus musculus]
 gi|74144306|dbj|BAE36020.1| unnamed protein product [Mus musculus]
 gi|74146660|dbj|BAE41331.1| unnamed protein product [Mus musculus]
 gi|148700260|gb|EDL32207.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha 1 polypeptide, isoform CRA_a [Mus
           musculus]
          Length = 534

 Score =  293 bits (749), Expect = 8e-77,   Method: Compositional matrix adjust.
 Identities = 146/275 (53%), Positives = 193/275 (70%), Gaps = 13/275 (4%)

Query: 4   PTHQRAQGNKLYYQEALNK--------SPELKDE--PPKVNNVAPTLEVTEREKYEMLCR 53
           P HQRA GN +Y++  ++K        S +  D+   PK   +A    + ER+KYEMLCR
Sbjct: 236 PEHQRANGNLVYFEYIMSKEKDANKSASGDQSDQKTAPKKKGIAVDY-LPERQKYEMLCR 294

Query: 54  GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           G+ + + P    +L CRY   N  P   L P K+E+ + +PRII + D++ D+EI+++K 
Sbjct: 295 GEGIKMTPRRQKRLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 354

Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
           +A+PRLRRAT+ N  TG LE  +YRISKSAWL   E PV+ RI+ R++ +TGL  STAEE
Sbjct: 355 LAKPRLRRATISNPVTGALETVHYRISKSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEE 414

Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
           LQV NYG+GG YEPH+DFAR  E +AF+ LGTGNR+AT LFYMSDV+ GGATVF  +  S
Sbjct: 415 LQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 474

Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           +WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 475 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 509


>gi|74225936|dbj|BAE28745.1| unnamed protein product [Mus musculus]
          Length = 561

 Score =  293 bits (749), Expect = 9e-77,   Method: Compositional matrix adjust.
 Identities = 146/275 (53%), Positives = 193/275 (70%), Gaps = 13/275 (4%)

Query: 4   PTHQRAQGNKLYYQEALNK--------SPELKDE--PPKVNNVAPTLEVTEREKYEMLCR 53
           P HQRA GN +Y++  ++K        S +  D+   PK   +A    + ER+KYEMLCR
Sbjct: 236 PEHQRANGNLVYFEYIMSKEKDANKSASGDQSDQKTAPKKKGIAVDY-LPERQKYEMLCR 294

Query: 54  GD-LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           G+ + + P    +L CRY   N  P   L P K+E+ + +PRII + D++ D+EI+++K 
Sbjct: 295 GEGIKMTPRRQKRLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 354

Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
           +A+PRLRRAT+ N  TG LE  +YRISKSAWL   E PV+ RI+ R++ +TGL  STAEE
Sbjct: 355 LAKPRLRRATISNPVTGALETVHYRISKSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEE 414

Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
           LQV NYG+GG YEPH+DFAR  E +AF+ LGTGNR+AT LFYMSDV+ GGATVF  +  S
Sbjct: 415 LQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 474

Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           +WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 475 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 509


>gi|115495019|ref|NP_001069238.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Bos taurus]
 gi|122144801|sp|Q1RMU3.1|P4HA1_BOVIN RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
           alpha-1; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-1; Flags: Precursor
 gi|92097479|gb|AAI14709.1| Prolyl 4-hydroxylase, alpha polypeptide I [Bos taurus]
 gi|296472132|tpg|DAA14247.1| TPA: prolyl 4-hydroxylase subunit alpha-1 precursor [Bos taurus]
 gi|440892721|gb|ELR45796.1| Prolyl 4-hydroxylase subunit alpha-1 [Bos grunniens mutus]
          Length = 534

 Score =  293 bits (749), Expect = 9e-77,   Method: Compositional matrix adjust.
 Identities = 144/274 (52%), Positives = 191/274 (69%), Gaps = 11/274 (4%)

Query: 4   PTHQRAQGNKLYYQEALNK--------SPELKDEPPKVNNVAPTLE-VTEREKYEMLCRG 54
           P HQRA GN  Y++  + K        S +  D+   +      ++ + ER+KYEMLCRG
Sbjct: 236 PEHQRANGNLKYFEYIMAKEKDANKSSSDDQSDQKTTLKKKGAAVDYLPERQKYEMLCRG 295

Query: 55  D-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKM 112
           + + + P    +L CRY   N  P   L P K+E+ + +PRII + D++ D+EI+++K +
Sbjct: 296 EGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEVVKDL 355

Query: 113 AQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEEL 172
           A+PRLRRAT+ N  TG+LE  +YRISKSAWL   E+PV+ RI+ R++ +TGL  STAEEL
Sbjct: 356 AKPRLRRATISNPITGDLETVHYRISKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEEL 415

Query: 173 QVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSL 232
           QV NYG+GG YEPH+DFAR  E +AFK LGTGNR+AT LFYMSDV  GGATVF  +  S+
Sbjct: 416 QVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVLAGGATVFPEVGASV 475

Query: 233 WPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 476 WPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 509


>gi|354483225|ref|XP_003503795.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 2
           [Cricetulus griseus]
          Length = 534

 Score =  293 bits (749), Expect = 9e-77,   Method: Compositional matrix adjust.
 Identities = 146/275 (53%), Positives = 190/275 (69%), Gaps = 13/275 (4%)

Query: 4   PTHQRAQGNKLYYQEALNKSPELK----DEP------PKVNNVAPTLEVTEREKYEMLCR 53
           P HQRA GN  Y++  + K  +      D+P      PK   +A    + ER KYEMLCR
Sbjct: 236 PEHQRANGNLRYFEYIMTKEKDTNKSASDDPSDQKTTPKKKGIAVDY-LPERRKYEMLCR 294

Query: 54  GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           G+ + + P    +L CRY   N  P   L P K+E+ + +PRII + D++ D+EI+++K 
Sbjct: 295 GEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 354

Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
           +A+PRLRRAT+ N  TG LE  +YRISKSAWL   E PV+ RI+ R++ +TGL  STAEE
Sbjct: 355 LAKPRLRRATISNPITGNLETVHYRISKSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEE 414

Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
           LQV NYG+GG YEPH+DFAR  E +AF+ LGTGNR+AT LFYMSDV+ GGATVF  +  S
Sbjct: 415 LQVANYGVGGQYEPHFDFARKDEPDAFQELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 474

Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           +WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 475 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 509


>gi|149038788|gb|EDL93077.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha 1 polypeptide, isoform CRA_b
           [Rattus norvegicus]
          Length = 534

 Score =  292 bits (748), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 146/275 (53%), Positives = 193/275 (70%), Gaps = 13/275 (4%)

Query: 4   PTHQRAQGNKLYYQEALNK--------SPELKDEP--PKVNNVAPTLEVTEREKYEMLCR 53
           P HQRA GN +Y++  ++K        S +  D+   PK   +A    + ER+KYEMLCR
Sbjct: 236 PEHQRANGNLVYFEYIMSKEKDANKSASGDQSDQKTTPKKKGIAVDY-LPERQKYEMLCR 294

Query: 54  GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           G+ + + P    +L CRY   N  P   L P K+E+ + +PRII + D++ D+EI+++K 
Sbjct: 295 GEGIKMTPRRQKRLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 354

Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
           +A+PRLRRAT+ N  TG LE  +YRISKSAWL   E PV+ RI+ R++ +TGL  STAEE
Sbjct: 355 LAKPRLRRATISNPVTGALETVHYRISKSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEE 414

Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
           LQV NYG+GG YEPH+DFAR  E +AF+ LGTGNR+AT LFYMSDV+ GGATVF  +  S
Sbjct: 415 LQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 474

Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           +WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 475 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 509


>gi|426255744|ref|XP_004021508.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1 [Ovis
           aries]
          Length = 534

 Score =  292 bits (748), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 144/274 (52%), Positives = 191/274 (69%), Gaps = 11/274 (4%)

Query: 4   PTHQRAQGNKLYYQEALNK--------SPELKDEPPKVNNVAPTLE-VTEREKYEMLCRG 54
           P HQRA GN  Y++  + K        S +  D+   +      ++ + ER+KYEMLCRG
Sbjct: 236 PEHQRANGNLKYFEYIMAKEKDANKSSSDDQSDQKTTLKKKGAAVDYLPERQKYEMLCRG 295

Query: 55  D-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKM 112
           + + + P    +L CRY   N  P   L P K+E+ + +PRII + D++ D+EI+++K +
Sbjct: 296 EGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKDL 355

Query: 113 AQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEEL 172
           A+PRLRRAT+ N  TG+LE  +YRISKSAWL   E+PV+ RI+ R++ +TGL  STAEEL
Sbjct: 356 AKPRLRRATISNPITGDLETVHYRISKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEEL 415

Query: 173 QVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSL 232
           QV NYG+GG YEPH+DFAR  E +AFK LGTGNR+AT LFYMSDV  GGATVF  +  S+
Sbjct: 416 QVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVLAGGATVFPEVGASV 475

Query: 233 WPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 476 WPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 509


>gi|410251924|gb|JAA13929.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
          Length = 566

 Score =  292 bits (747), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 153/332 (46%), Positives = 210/332 (63%), Gaps = 28/332 (8%)

Query: 4   PTHQRAQGNKLYYQEALNKSPEL----------KDEPPKVNNVAPTLEVTEREKYEMLCR 53
           P HQRA GN  Y++  + K  ++          +   PK   VA    + ER+KYEMLCR
Sbjct: 236 PEHQRANGNLKYFEYIMAKEKDVNKSASDDQSDQKTTPKKKGVAVDY-LPERQKYEMLCR 294

Query: 54  GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           G+ + + P    +L CRY   N  P   L P K+E+ + +PRII + D++ D+EI+++K 
Sbjct: 295 GEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 354

Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
           +A+PRL RATV + +TG+L  A YR+SKSAWL   E+PV+ RI+ R++ +TGL  STAEE
Sbjct: 355 LAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEE 414

Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
           LQV NYG+GG YEPH+DFAR  E +AFK LGTGNR+AT LFYMSDV+ GGATVF  +  S
Sbjct: 415 LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 474

Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PC-----G 276
           +WP+KGTA FW+NL +SG+GDY TRHAACPVL G+  + +            PC     G
Sbjct: 475 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRPCTFVRIG 534

Query: 277 LRRGLQRSGIICTLVGMVITIRGMLPVLYSLD 308
           +   L      CTL+ ++ T   +    +++D
Sbjct: 535 MTNRLPFFSYCCTLMCLIYTFPSLNFQEFTID 566


>gi|129365|sp|P16924.1|P4HA1_CHICK RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
           alpha-1; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-1
          Length = 516

 Score =  291 bits (746), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 145/273 (53%), Positives = 191/273 (69%), Gaps = 11/273 (4%)

Query: 4   PTHQRAQGNKLYYQ-------EALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD- 55
           P HQRA GN  Y++       EA   S + +D+  K   V     + ER KYEMLCRG+ 
Sbjct: 220 PEHQRANGNMKYFEYIMAKEKEANKSSTDAEDQTDKETEVKKKDYLPERRKYEMLCRGEG 279

Query: 56  LTVPPAIVAQLKCRYV--HRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
           L + P    +L CRY   +RN  Y+ L P+K+E+ + +PRI+ + D++ D EI+ +K++A
Sbjct: 280 LKMTPRRQKRLFCRYYDGNRNPRYI-LGPVKQEDEWDKPRIVRFLDIISDEEIETVKELA 338

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +PRL RATV + +TG+L  A+YR+SKSAWL   E PV+ RI+ R++ +TGL  STAEELQ
Sbjct: 339 KPRLSRATVHDPETGKLTTAHYRVSKSAWLSGYESPVVSRINTRIQDLTGLDVSTAEELQ 398

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
           V NYG+GG YEPH+DF R  E +AFK LGTGNR+AT LFYMSDV+ GGATVF  +  S+W
Sbjct: 399 VANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGASVW 458

Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           P+KGTA FW+NL  SG+GDY TRHAACPVL G+
Sbjct: 459 PKKGTAVFWYNLFPSGEGDYSTRHAACPVLVGN 491


>gi|312032354|ref|NP_001185664.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Gallus
           gallus]
          Length = 536

 Score =  291 bits (745), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 145/273 (53%), Positives = 191/273 (69%), Gaps = 11/273 (4%)

Query: 4   PTHQRAQGNKLYYQ-------EALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD- 55
           P HQRA GN  Y++       EA   S + +D+  K   V     + ER KYEMLCRG+ 
Sbjct: 240 PEHQRANGNMKYFEYIMAKEKEANKSSTDAEDQTEKETEVKKKDYLPERRKYEMLCRGEG 299

Query: 56  LTVPPAIVAQLKCRYV--HRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
           L + P    +L CRY   +RN  Y+ L P+K+E+ + +PRI+ + D++ D EI+ +K++A
Sbjct: 300 LKMTPRRQKRLFCRYYDGNRNPRYI-LGPVKQEDEWDKPRIVRFLDIISDEEIETVKELA 358

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +PRL RATV + +TG+L  A+YR+SKSAWL   E PV+ RI+ R++ +TGL  STAEELQ
Sbjct: 359 KPRLSRATVHDPETGKLTTAHYRVSKSAWLSGYESPVVSRINTRIQDLTGLDVSTAEELQ 418

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
           V NYG+GG YEPH+DF R  E +AFK LGTGNR+AT LFYMSDV+ GGATVF  +  S+W
Sbjct: 419 VANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGASVW 478

Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           P+KGTA FW+NL  SG+GDY TRHAACPVL G+
Sbjct: 479 PKKGTAVFWYNLFPSGEGDYSTRHAACPVLVGN 511


>gi|47550697|ref|NP_999856.1| prolyl 4-hydroxylase, alpha polypeptide I b precursor [Danio rerio]
 gi|28277826|gb|AAH45890.1| Procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I [Danio rerio]
          Length = 536

 Score =  291 bits (745), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 147/277 (53%), Positives = 192/277 (69%), Gaps = 15/277 (5%)

Query: 4   PTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLE------------VTEREKYEML 51
           P H RA GN  Y++  L K  + ++E  K  +    L+            + ER+KYE L
Sbjct: 236 PNHHRANGNLKYFEFQLEKQRKAENEK-KEEDQKRVLDKRDAQRKRSKDPLPERKKYERL 294

Query: 52  CRGD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLI 109
           CRG+ + + P   ++L CRY + N  P L L P+K+E+ + +PRI+ Y +++ DSEI+ +
Sbjct: 295 CRGEGIKLTPRRQSRLFCRYSNNNRNPRLLLAPVKQEDEWDRPRIVRYHEIISDSEIETV 354

Query: 110 KKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTA 169
           K+MA+PRLRRAT+ N  TG LE A YRISKSAWL   EH  IERI++R+E +TGL   TA
Sbjct: 355 KEMAKPRLRRATISNPITGVLETAPYRISKSAWLSGYEHSTIERINQRIEDVTGLEMDTA 414

Query: 170 EELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLN 229
           EELQV NYG+GG YEPH+DF R  E +AFK LGTGNR+AT LFYMSDV+ GGATVFT + 
Sbjct: 415 EELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFTDVG 474

Query: 230 LSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
            ++WP+KGTA FW+NL  SG+GDY TRHAACPVL G+
Sbjct: 475 AAVWPKKGTAVFWYNLFPSGEGDYSTRHAACPVLVGN 511


>gi|212530|gb|AAA49002.1| prolyl 4-hydroxylase, alpha subunit (EC 1.14.11.2), partial [Gallus
           gallus]
          Length = 489

 Score =  291 bits (745), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 145/273 (53%), Positives = 191/273 (69%), Gaps = 11/273 (4%)

Query: 4   PTHQRAQGNKLYYQ-------EALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD- 55
           P HQRA GN  Y++       EA   S + +D+  K   V     + ER KYEMLCRG+ 
Sbjct: 193 PEHQRANGNMKYFEYIMAKEKEANKSSTDAEDQTDKETEVKKKDYLPERRKYEMLCRGEG 252

Query: 56  LTVPPAIVAQLKCRYV--HRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
           L + P    +L CRY   +RN  Y+ L P+K+E+ + +PRI+ + D++ D EI+ +K++A
Sbjct: 253 LKMTPRRQKRLFCRYYDGNRNPRYI-LGPVKQEDEWDKPRIVRFLDIISDEEIETVKELA 311

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +PRL RATV + +TG+L  A+YR+SKSAWL   E PV+ RI+ R++ +TGL  STAEELQ
Sbjct: 312 KPRLSRATVHDPETGKLTTAHYRVSKSAWLSGYESPVVSRINTRIQDLTGLDVSTAEELQ 371

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
           V NYG+GG YEPH+DF R  E +AFK LGTGNR+AT LFYMSDV+ GGATVF  +  S+W
Sbjct: 372 VANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGASVW 431

Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           P+KGTA FW+NL  SG+GDY TRHAACPVL G+
Sbjct: 432 PKKGTAVFWYNLFPSGEGDYSTRHAACPVLVGN 464


>gi|334314087|ref|XP_003339988.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2
           [Monodelphis domestica]
          Length = 537

 Score =  291 bits (745), Expect = 3e-76,   Method: Compositional matrix adjust.
 Identities = 144/277 (51%), Positives = 186/277 (67%), Gaps = 14/277 (5%)

Query: 4   PTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVT------------EREKYEML 51
           P HQRA GN  Y++  + K  +      K  +  P  E              ER KYEML
Sbjct: 236 PEHQRANGNLKYFEYIMAKEKDANTSTTKTADEQPEQETAPKRKGRAKDYLPERRKYEML 295

Query: 52  CRGD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLI 109
           CRG+ L + P    +L CRY   N  P   L P K+E+ + +PRI+ + +++ D+EI+++
Sbjct: 296 CRGEGLKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIVRFHEIISDAEIEIV 355

Query: 110 KKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTA 169
           K +A+PRLRRAT+ N  TG LE A+YRISKSAWL   E PV+ RI+ R++ +TGL  STA
Sbjct: 356 KDLAKPRLRRATISNPITGVLETAHYRISKSAWLSGYEDPVVSRINMRIQDLTGLDVSTA 415

Query: 170 EELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLN 229
           EELQV NYG+GG YEPH+DF R  E +AFK LGTGNR+AT LFYMSDV+ GGATVF  + 
Sbjct: 416 EELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVG 475

Query: 230 LSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
            S+WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 476 ASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 512


>gi|332244067|ref|XP_003271193.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
           alpha-1 [Nomascus leucogenys]
          Length = 502

 Score =  291 bits (745), Expect = 3e-76,   Method: Compositional matrix adjust.
 Identities = 144/275 (52%), Positives = 191/275 (69%), Gaps = 13/275 (4%)

Query: 4   PTHQRAQGNKLYYQEALNKSPEL----------KDEPPKVNNVAPTLEVTEREKYEMLCR 53
           P HQRA GN  Y++  + K  ++          +   PK   VA    + ER+KYEMLCR
Sbjct: 204 PEHQRANGNLKYFEYIMAKEKDVNKSASDDQSDQKTTPKKKGVAVDY-LPERQKYEMLCR 262

Query: 54  GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           G+ + + P    +L CRY   N  P   L P K+E+ + +PRII + D++ D+EI+++K 
Sbjct: 263 GEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 322

Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
           +A+PRL RATV + +TG+L  A YR+SKSAWL   E+PV+ RI+ R++ +TGL  STAEE
Sbjct: 323 LAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEE 382

Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
           LQV NYG+GG YEPH+DFAR  E +AFK LGTGNR+AT LFYMSDV+ GGATVF  +  S
Sbjct: 383 LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 442

Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           +WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 443 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 477


>gi|432106758|gb|ELK32410.1| Prolyl 4-hydroxylase subunit alpha-1 [Myotis davidii]
          Length = 534

 Score =  291 bits (745), Expect = 3e-76,   Method: Compositional matrix adjust.
 Identities = 146/275 (53%), Positives = 192/275 (69%), Gaps = 13/275 (4%)

Query: 4   PTHQRAQGNKLYYQEALNK--------SPELKDEP--PKVNNVAPTLEVTEREKYEMLCR 53
           P HQRA GN  Y++  + K        S +  D+   PK   VA    + ER+KYEMLCR
Sbjct: 236 PEHQRANGNLKYFEYIMAKEKDANKSASDDQSDQKTTPKKKGVAADY-LPERQKYEMLCR 294

Query: 54  GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           G+ + + P    +L CRY   N  P   L P K+E+ + +PRII + D++ D+EI+++K 
Sbjct: 295 GEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 354

Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
           +A+PRL RATV + +TG+L  A YR+SKSAWL   E+PV+ RI+ R++ +TGL  STAEE
Sbjct: 355 LAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEE 414

Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
           LQV NYG+GG YEPH+DFAR  E +AFK LGTGNR+AT LFYMSDV+ GGATVF  +  S
Sbjct: 415 LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 474

Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           +WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 475 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 509


>gi|380813208|gb|AFE78478.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Macaca
           mulatta]
 gi|384947330|gb|AFI37270.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Macaca
           mulatta]
          Length = 534

 Score =  291 bits (745), Expect = 3e-76,   Method: Compositional matrix adjust.
 Identities = 144/275 (52%), Positives = 191/275 (69%), Gaps = 13/275 (4%)

Query: 4   PTHQRAQGNKLYYQEALNKSPEL----------KDEPPKVNNVAPTLEVTEREKYEMLCR 53
           P HQRA GN  Y++  + K  ++          +   PK   VA    + ER+KYEMLCR
Sbjct: 236 PEHQRANGNLKYFEYIMAKEKDVNKSASGDQSDQKTTPKKKGVAVDY-LPERQKYEMLCR 294

Query: 54  GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           G+ + + P    +L CRY   N  P   L P K+E+ + +PRII + D++ D+EI+++K 
Sbjct: 295 GEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 354

Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
           +A+PRL RATV + +TG+L  A YR+SKSAWL   E+PV+ RI+ R++ +TGL  STAEE
Sbjct: 355 LAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEE 414

Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
           LQV NYG+GG YEPH+DFAR  E +AFK LGTGNR+AT LFYMSDV+ GGATVF  +  S
Sbjct: 415 LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 474

Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           +WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 475 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 509


>gi|190786|gb|AAA36534.1| prolyl 4-hydroxylase alpha subunit (EC 1.14.11.2) [Homo sapiens]
          Length = 534

 Score =  291 bits (744), Expect = 3e-76,   Method: Compositional matrix adjust.
 Identities = 144/275 (52%), Positives = 191/275 (69%), Gaps = 13/275 (4%)

Query: 4   PTHQRAQGNKLYYQEALNKSPEL----------KDEPPKVNNVAPTLEVTEREKYEMLCR 53
           P HQRA GN  Y++  + K  ++          +   PK   VA    + ER+KYEMLCR
Sbjct: 236 PEHQRANGNLKYFEYIMAKEKDVNKSASDDQSDQKTTPKKKGVAVDY-LPERQKYEMLCR 294

Query: 54  GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           G+ + + P    +L CRY   N  P   L P K+E+ + +PRII + D++ D+EI+++K 
Sbjct: 295 GEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 354

Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
           +A+PRL RATV + +TG+L  A YR+SKSAWL   E+PV+ RI+ R++ +TGL  STAEE
Sbjct: 355 LAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEE 414

Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
           LQV NYG+GG YEPH+DFAR  E +AFK LGTGNR+AT LFYMSDV+ GGATVF  +  S
Sbjct: 415 LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 474

Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           +WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 475 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 509


>gi|383418721|gb|AFH32574.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Macaca
           mulatta]
          Length = 534

 Score =  291 bits (744), Expect = 3e-76,   Method: Compositional matrix adjust.
 Identities = 144/275 (52%), Positives = 191/275 (69%), Gaps = 13/275 (4%)

Query: 4   PTHQRAQGNKLYYQEALNKSPEL----------KDEPPKVNNVAPTLEVTEREKYEMLCR 53
           P HQRA GN  Y++  + K  ++          +   PK   VA    + ER+KYEMLCR
Sbjct: 236 PEHQRANGNLKYFEYIMAKEKDVNKSASDDQSDQKTTPKKKGVAVDY-LPERQKYEMLCR 294

Query: 54  GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           G+ + + P    +L CRY   N  P   L P K+E+ + +PRII + D++ D+EI+++K 
Sbjct: 295 GEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 354

Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
           +A+PRL RATV + +TG+L  A YR+SKSAWL   E+PV+ RI+ R++ +TGL  STAEE
Sbjct: 355 LAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEE 414

Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
           LQV NYG+GG YEPH+DFAR  E +AFK LGTGNR+AT LFYMSDV+ GGATVF  +  S
Sbjct: 415 LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 474

Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           +WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 475 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 509


>gi|395820524|ref|XP_003783614.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1 [Otolemur
           garnettii]
          Length = 534

 Score =  291 bits (744), Expect = 3e-76,   Method: Compositional matrix adjust.
 Identities = 144/275 (52%), Positives = 191/275 (69%), Gaps = 13/275 (4%)

Query: 4   PTHQRAQGNKLYYQEALNKSPEL----------KDEPPKVNNVAPTLEVTEREKYEMLCR 53
           P HQRA GN  Y++  + K  ++          +   PK   VA    + ER+KYEMLCR
Sbjct: 236 PEHQRANGNLKYFEYIMAKEKDVNKSSSDDQSDQKTTPKKKGVAVDY-LPERQKYEMLCR 294

Query: 54  GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           G+ + + P    +L CRY   N  P   L P K+E+ + +PRII + D++ D+EI+++K 
Sbjct: 295 GEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 354

Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
           +A+PRL RATV + +TG+L  A YR+SKSAWL   E+PV+ RI+ R++ +TGL  STAEE
Sbjct: 355 LAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEE 414

Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
           LQV NYG+GG YEPH+DFAR  E +AFK LGTGNR+AT LFYMSDV+ GGATVF  +  S
Sbjct: 415 LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 474

Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           +WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 475 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 509


>gi|296220402|ref|XP_002756291.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Callithrix
           jacchus]
          Length = 534

 Score =  291 bits (744), Expect = 3e-76,   Method: Compositional matrix adjust.
 Identities = 145/275 (52%), Positives = 192/275 (69%), Gaps = 13/275 (4%)

Query: 4   PTHQRAQGNKLYYQEALNKSPE-----LKDEP-----PKVNNVAPTLEVTEREKYEMLCR 53
           P HQRA GN  Y++  + K  +     L D+      PK   +A    + ER+KYEMLCR
Sbjct: 236 PEHQRANGNLKYFEYIMAKEKDVNKSALDDQSDQKTTPKKKGIAVDY-LPERQKYEMLCR 294

Query: 54  GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           G+ + + P    +L CRY   N  P   L P K+E+ + +PRII + D++ D+EI+++K 
Sbjct: 295 GEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 354

Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
           +A+PRL RATV + +TG+L  A YR+SKSAWL   E+PV+ RI+ R++ +TGL  STAEE
Sbjct: 355 LAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEE 414

Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
           LQV NYG+GG YEPH+DFAR  E +AFK LGTGNR+AT LFYMSDV+ GGATVF  +  S
Sbjct: 415 LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 474

Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           +WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 475 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 509


>gi|63252886|ref|NP_000908.2| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Homo
           sapiens]
 gi|114631173|ref|XP_508168.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 13 [Pan
           troglodytes]
 gi|602676|gb|AAA59069.1| alpha-subunit of prolyl 4-hydroxylase [Homo sapiens]
 gi|62897481|dbj|BAD96680.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I variant [Homo
           sapiens]
 gi|119574852|gb|EAW54467.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I, isoform CRA_a [Homo
           sapiens]
 gi|119574853|gb|EAW54468.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I, isoform CRA_b [Homo
           sapiens]
 gi|410349609|gb|JAA41408.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
 gi|410349613|gb|JAA41410.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
          Length = 534

 Score =  290 bits (743), Expect = 4e-76,   Method: Compositional matrix adjust.
 Identities = 144/275 (52%), Positives = 191/275 (69%), Gaps = 13/275 (4%)

Query: 4   PTHQRAQGNKLYYQEALNKSPEL----------KDEPPKVNNVAPTLEVTEREKYEMLCR 53
           P HQRA GN  Y++  + K  ++          +   PK   VA    + ER+KYEMLCR
Sbjct: 236 PEHQRANGNLKYFEYIMAKEKDVNKSASDDQSDQKTTPKKKGVAVDY-LPERQKYEMLCR 294

Query: 54  GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           G+ + + P    +L CRY   N  P   L P K+E+ + +PRII + D++ D+EI+++K 
Sbjct: 295 GEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 354

Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
           +A+PRL RATV + +TG+L  A YR+SKSAWL   E+PV+ RI+ R++ +TGL  STAEE
Sbjct: 355 LAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEE 414

Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
           LQV NYG+GG YEPH+DFAR  E +AFK LGTGNR+AT LFYMSDV+ GGATVF  +  S
Sbjct: 415 LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 474

Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           +WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 475 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 509


>gi|410900628|ref|XP_003963798.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Takifugu
           rubripes]
          Length = 548

 Score =  290 bits (743), Expect = 4e-76,   Method: Compositional matrix adjust.
 Identities = 151/307 (49%), Positives = 199/307 (64%), Gaps = 33/307 (10%)

Query: 4   PTHQRAQGNKLYYQEALNKS-----------------PELKDEPPKVNNVAPTLE----V 42
           P HQRA+GN  Y++  L K                  P++ +E  K    + T      +
Sbjct: 238 PEHQRAKGNLKYFEFQLEKQRKDAEEETTKEKEEREEPDITEEKKKKKKKSQTKSTFQLI 297

Query: 43  TEREKYEMLCRGD-LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDV 100
            ER+KYEMLCRG+ + + P   ++L CRY   N  P   L P+K+++ + +P I+ Y D+
Sbjct: 298 PERKKYEMLCRGEGIKMTPRRQSRLFCRYYDNNHNPKYVLSPVKQQDEWDRPYIVRYIDI 357

Query: 101 MYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEH 160
           + D EI+ +KK+A+PRLRRAT+ N  TG LE A+YRISKSAWL   EHPVIE I++R+E 
Sbjct: 358 ISDKEIETVKKLAKPRLRRATISNPITGVLETASYRISKSAWLTGYEHPVIEIINQRIED 417

Query: 161 MTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQG 220
           +TGL   TAEELQV NYG+GG YEPH+DF R  E +AFK LGTGNR+AT LFYMSDVA G
Sbjct: 418 LTGLEMDTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVAAG 477

Query: 221 GATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC------- 273
           GATVF  +  ++WP+KGTA FW+NL ++G+GDY TRHAACPVL G+  + +         
Sbjct: 478 GATVFPDVGAAVWPQKGTAVFWYNLFANGEGDYSTRHAACPVLVGNKWVSNKWIHERGQE 537

Query: 274 ---PCGL 277
              PCGL
Sbjct: 538 WRRPCGL 544


>gi|397490069|ref|XP_003816032.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Pan paniscus]
          Length = 488

 Score =  290 bits (743), Expect = 4e-76,   Method: Compositional matrix adjust.
 Identities = 144/276 (52%), Positives = 191/276 (69%), Gaps = 13/276 (4%)

Query: 4   PTHQRAQGNKLYYQEALNKSPEL----------KDEPPKVNNVAPTLEVTEREKYEMLCR 53
           P HQRA GN  Y++  + K  ++          +   PK   VA    + ER+KYEMLCR
Sbjct: 190 PEHQRANGNLKYFEYIMAKEKDVNKSASDDQSDQKTTPKKKGVAVDY-LPERQKYEMLCR 248

Query: 54  GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           G+ + + P    +L CRY   N  P   L P K+E+ + +PRII + D++ D+EI+++K 
Sbjct: 249 GEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 308

Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
           +A+PRL RATV + +TG+L  A YR+SKSAWL   E+PV+ RI+ R++ +TGL  STAEE
Sbjct: 309 LAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEE 368

Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
           LQV NYG+GG YEPH+DFAR  E +AFK LGTGNR+AT LFYMSDV+ GGATVF  +  S
Sbjct: 369 LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 428

Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
           +WP+KGTA FW+NL +SG+GDY TRHAACPVL G+ 
Sbjct: 429 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNK 464


>gi|474940|emb|CAA55546.1| gamma-butyrobetaine,2-oxoglutarate dioxygenase [Rattus norvegicus]
          Length = 534

 Score =  290 bits (742), Expect = 5e-76,   Method: Compositional matrix adjust.
 Identities = 145/275 (52%), Positives = 193/275 (70%), Gaps = 13/275 (4%)

Query: 4   PTHQRAQGNKLYYQEALNK--------SPELKDEP--PKVNNVAPTLEVTEREKYEMLCR 53
           P HQRA GN +Y++  ++K        S E  D+   PK   +A    + ER+KYEMLCR
Sbjct: 236 PEHQRANGNLVYFEYIMSKEKDANKSASGERADQKTTPKKKGIAVDY-LPERQKYEMLCR 294

Query: 54  GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           G+ + + P    +L CRY   N  P   L P K+E+ + +PRII + D++ D+EI+++K 
Sbjct: 295 GEGIKMTPRRQKRLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 354

Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
           +A+PRL RATV + +TG+L  A YR+SKSAWL   E PV+ RI+ R++ +TGL  STAEE
Sbjct: 355 LAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEE 414

Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
           LQV NYG+GG YEPH+DFAR  E +AF+ LGTGNR+AT LFYMSDV+ GGATVF  +  S
Sbjct: 415 LQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 474

Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           +WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 475 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 509


>gi|301770069|ref|XP_002920453.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Ailuropoda
           melanoleuca]
          Length = 534

 Score =  290 bits (742), Expect = 5e-76,   Method: Compositional matrix adjust.
 Identities = 143/274 (52%), Positives = 192/274 (70%), Gaps = 11/274 (4%)

Query: 4   PTHQRAQGNKLYYQEALNKSPEL-KDEPPKVNNVAPTLE--------VTEREKYEMLCRG 54
           P HQRA GN  Y++  + K  ++ K      ++   TL+        + ER+KYEMLCRG
Sbjct: 236 PEHQRANGNLKYFEYIMAKEKDVNKSASDDQSDQKTTLKKKGAAVDYLPERQKYEMLCRG 295

Query: 55  D-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKM 112
           + + + P    +L CRY   N  P   L P K+E+ + +PRII + D++ D+EI+++K +
Sbjct: 296 EGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKDL 355

Query: 113 AQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEEL 172
           A+PRL RATV + +TG+L  A YR+SKSAWL   E+PV+ RI+ R++ +TGL  STAEEL
Sbjct: 356 AKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEEL 415

Query: 173 QVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSL 232
           QV NYG+GG YEPH+DFAR  E +AFK LGTGNR+AT LFYMSDV+ GGATVF  +  S+
Sbjct: 416 QVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGASV 475

Query: 233 WPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 476 WPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 509


>gi|449280261|gb|EMC87600.1| Prolyl 4-hydroxylase subunit alpha-1 [Columba livia]
          Length = 536

 Score =  290 bits (742), Expect = 6e-76,   Method: Compositional matrix adjust.
 Identities = 145/273 (53%), Positives = 190/273 (69%), Gaps = 11/273 (4%)

Query: 4   PTHQRAQGNKLYYQ-------EALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD- 55
           P HQRA GN  Y++       EA   S + +D+  K   V     + ER KYEMLCRG+ 
Sbjct: 240 PEHQRANGNMKYFEYIMAKEKEANKSSTDSEDQAEKETEVKKKDYLPERRKYEMLCRGEG 299

Query: 56  LTVPPAIVAQLKCRYV--HRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
           L + P    +L CRY   +RN  Y+ L P+K+E+ + +PRI+ + D++ D EI+ +K++A
Sbjct: 300 LKMTPRRQKRLFCRYYDGNRNPRYI-LGPVKQEDEWDKPRIVRFLDIISDEEIETVKELA 358

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +PRL RATV + +TG+L  A+YR+SKSAWL   E PV+ RI+ R++ +TGL  STAEELQ
Sbjct: 359 KPRLSRATVHDPETGKLTTAHYRVSKSAWLSGYESPVVSRINTRIQDLTGLDVSTAEELQ 418

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
           V NYG+GG YEPH+DF R  E +AFK LGTGNR+AT LFYMSDV+ GGATVF  +  S+W
Sbjct: 419 VANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGASVW 478

Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           P KGTA FW+NL  SG+GDY TRHAACPVL G+
Sbjct: 479 PRKGTAVFWYNLFPSGEGDYSTRHAACPVLVGN 511


>gi|402880501|ref|XP_003903839.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like, partial
           [Papio anubis]
          Length = 379

 Score =  290 bits (742), Expect = 6e-76,   Method: Compositional matrix adjust.
 Identities = 144/276 (52%), Positives = 191/276 (69%), Gaps = 13/276 (4%)

Query: 4   PTHQRAQGNKLYYQEALNKSPEL----------KDEPPKVNNVAPTLEVTEREKYEMLCR 53
           P HQRA GN  Y++  + K  ++          +   PK   VA    + ER+KYEMLCR
Sbjct: 81  PEHQRANGNLKYFEYIMAKEKDVNKSASDDQSDQKTTPKKKGVAVDY-LPERQKYEMLCR 139

Query: 54  GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           G+ + + P    +L CRY   N  P   L P K+E+ + +PRII + D++ D+EI+++K 
Sbjct: 140 GEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 199

Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
           +A+PRL RATV + +TG+L  A YR+SKSAWL   E+PV+ RI+ R++ +TGL  STAEE
Sbjct: 200 LAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEE 259

Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
           LQV NYG+GG YEPH+DFAR  E +AFK LGTGNR+AT LFYMSDV+ GGATVF  +  S
Sbjct: 260 LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 319

Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
           +WP+KGTA FW+NL +SG+GDY TRHAACPVL G+ 
Sbjct: 320 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNK 355


>gi|26336999|dbj|BAC32183.1| unnamed protein product [Mus musculus]
 gi|148700261|gb|EDL32208.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha 1 polypeptide, isoform CRA_b [Mus
           musculus]
          Length = 534

 Score =  290 bits (742), Expect = 6e-76,   Method: Compositional matrix adjust.
 Identities = 144/275 (52%), Positives = 193/275 (70%), Gaps = 13/275 (4%)

Query: 4   PTHQRAQGNKLYYQEALNK--------SPELKDE--PPKVNNVAPTLEVTEREKYEMLCR 53
           P HQRA GN +Y++  ++K        S +  D+   PK   +A    + ER+KYEMLCR
Sbjct: 236 PEHQRANGNLVYFEYIMSKEKDANKSASGDQSDQKTAPKKKGIAVDY-LPERQKYEMLCR 294

Query: 54  GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           G+ + + P    +L CRY   N  P   L P K+E+ + +PRII + D++ D+EI+++K 
Sbjct: 295 GEGIKMTPRRQKRLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 354

Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
           +A+PRL RATV + +TG+L  A YR+SKSAWL   E PV+ RI+ R++ +TGL  STAEE
Sbjct: 355 LAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEE 414

Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
           LQV NYG+GG YEPH+DFAR  E +AF+ LGTGNR+AT LFYMSDV+ GGATVF  +  S
Sbjct: 415 LQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 474

Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           +WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 475 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 509


>gi|73952886|ref|XP_850682.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 3 [Canis
           lupus familiaris]
          Length = 534

 Score =  290 bits (742), Expect = 6e-76,   Method: Compositional matrix adjust.
 Identities = 142/274 (51%), Positives = 191/274 (69%), Gaps = 11/274 (4%)

Query: 4   PTHQRAQGNKLYYQEALNK--------SPELKDEPPKVNNVAPTLE-VTEREKYEMLCRG 54
           P HQRA GN  Y++  + K        S +  D+   +      ++ + ER+KYEMLCRG
Sbjct: 236 PEHQRANGNLKYFEYIMAKEKDANKSASDDQSDQKTTLKKKGAAVDYLPERQKYEMLCRG 295

Query: 55  D-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKM 112
           + + + P    +L CRY   N  P   L P K+E+ + +PRII + D++ D+EI+++K +
Sbjct: 296 EGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKDL 355

Query: 113 AQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEEL 172
           A+PRL RATV + +TG+L  A YR+SKSAWL   E+PV+ RI+ R++ +TGL  STAEEL
Sbjct: 356 AKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEEL 415

Query: 173 QVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSL 232
           QV NYG+GG YEPH+DFAR  E +AFK LGTGNR+AT LFYMSDV+ GGATVF  +  S+
Sbjct: 416 QVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGASV 475

Query: 233 WPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 476 WPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 509


>gi|354483223|ref|XP_003503794.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 1
           [Cricetulus griseus]
          Length = 534

 Score =  290 bits (741), Expect = 7e-76,   Method: Compositional matrix adjust.
 Identities = 144/275 (52%), Positives = 190/275 (69%), Gaps = 13/275 (4%)

Query: 4   PTHQRAQGNKLYYQEALNKSPELK----DEP------PKVNNVAPTLEVTEREKYEMLCR 53
           P HQRA GN  Y++  + K  +      D+P      PK   +A    + ER KYEMLCR
Sbjct: 236 PEHQRANGNLRYFEYIMTKEKDTNKSASDDPSDQKTTPKKKGIAVDY-LPERRKYEMLCR 294

Query: 54  GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           G+ + + P    +L CRY   N  P   L P K+E+ + +PRII + D++ D+EI+++K 
Sbjct: 295 GEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 354

Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
           +A+PRL RATV + +TG+L  A YR+SKSAWL   E PV+ RI+ R++ +TGL  STAEE
Sbjct: 355 LAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEE 414

Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
           LQV NYG+GG YEPH+DFAR  E +AF+ LGTGNR+AT LFYMSDV+ GGATVF  +  S
Sbjct: 415 LQVANYGVGGQYEPHFDFARKDEPDAFQELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 474

Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           +WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 475 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 509


>gi|326923463|ref|XP_003207955.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 2
           [Meleagris gallopavo]
          Length = 536

 Score =  290 bits (741), Expect = 8e-76,   Method: Compositional matrix adjust.
 Identities = 144/273 (52%), Positives = 190/273 (69%), Gaps = 11/273 (4%)

Query: 4   PTHQRAQGNKLYYQ-------EALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD- 55
           P HQRA GN  Y++       EA   S + +D+  K         + ER KYEMLCRG+ 
Sbjct: 240 PEHQRANGNMKYFEYIMAKEKEANKSSTDAEDQTEKETEFKKKDYLPERRKYEMLCRGEG 299

Query: 56  LTVPPAIVAQLKCRYV--HRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
           L + P    +L CRY   +RN  Y+ L P+K+E+ + +PRI+ + D++ D EI+ +K++A
Sbjct: 300 LKMTPRRQKRLFCRYYDGNRNPRYI-LGPVKQEDEWDKPRIVRFLDIISDEEIETVKELA 358

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +PRL RATV + +TG+L  A+YR+SKSAWL   E PV+ RI+ R++ +TGL  STAEELQ
Sbjct: 359 KPRLSRATVHDPETGKLTTAHYRVSKSAWLSGYESPVVSRINTRIQDLTGLDVSTAEELQ 418

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
           V NYG+GG YEPH+DF R  E +AFK LGTGNR+AT LFYMSDV+ GGATVF  +  S+W
Sbjct: 419 VANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGASVW 478

Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           P+KGTA FW+NL  SG+GDY TRHAACPVL G+
Sbjct: 479 PKKGTAVFWYNLFPSGEGDYSTRHAACPVLVGN 511


>gi|51036657|ref|NP_742059.2| prolyl 4-hydroxylase subunit alpha-1 precursor [Rattus norvegicus]
 gi|90111077|sp|P54001.2|P4HA1_RAT RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
           alpha-1; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-1; Flags: Precursor
 gi|50927553|gb|AAH78703.1| Procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I [Rattus norvegicus]
 gi|149038787|gb|EDL93076.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha 1 polypeptide, isoform CRA_a
           [Rattus norvegicus]
          Length = 534

 Score =  290 bits (741), Expect = 8e-76,   Method: Compositional matrix adjust.
 Identities = 144/275 (52%), Positives = 193/275 (70%), Gaps = 13/275 (4%)

Query: 4   PTHQRAQGNKLYYQEALNK--------SPELKDEP--PKVNNVAPTLEVTEREKYEMLCR 53
           P HQRA GN +Y++  ++K        S +  D+   PK   +A    + ER+KYEMLCR
Sbjct: 236 PEHQRANGNLVYFEYIMSKEKDANKSASGDQSDQKTTPKKKGIAVDY-LPERQKYEMLCR 294

Query: 54  GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           G+ + + P    +L CRY   N  P   L P K+E+ + +PRII + D++ D+EI+++K 
Sbjct: 295 GEGIKMTPRRQKRLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 354

Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
           +A+PRL RATV + +TG+L  A YR+SKSAWL   E PV+ RI+ R++ +TGL  STAEE
Sbjct: 355 LAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEE 414

Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
           LQV NYG+GG YEPH+DFAR  E +AF+ LGTGNR+AT LFYMSDV+ GGATVF  +  S
Sbjct: 415 LQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 474

Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           +WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 475 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 509


>gi|308476969|ref|XP_003100699.1| hypothetical protein CRE_15564 [Caenorhabditis remanei]
 gi|308264511|gb|EFP08464.1| hypothetical protein CRE_15564 [Caenorhabditis remanei]
          Length = 573

 Score =  290 bits (741), Expect = 9e-76,   Method: Compositional matrix adjust.
 Identities = 154/316 (48%), Positives = 200/316 (63%), Gaps = 34/316 (10%)

Query: 2   IFPTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPA 61
           I P H RA+GN  +Y++ L     + D PP VN       + ER+ YE LCRG+  +PP 
Sbjct: 251 IAPNHPRAKGNVKWYEDMLQGKDMVGDLPPIVNKRVEFDGIVERDAYEALCRGE--IPPV 308

Query: 62  ---IVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLR 118
                 +L+C Y+ R+ P+L++ P+K E     P  +L+++V+ DSEI +IK++A P+L+
Sbjct: 309 EKKWKNKLRC-YLKRDKPFLKIAPIKVEILRFDPLAVLFKNVISDSEIKVIKELASPKLK 367

Query: 119 RATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYG 178
           RATVQN KTGELE A YRISKSAWL+   HPVIER++RR+E  TGL   T+EELQV NYG
Sbjct: 368 RATVQNSKTGELEHATYRISKSAWLKGDLHPVIERVNRRIEDFTGLYQGTSEELQVANYG 427

Query: 179 IGGHYEPHYDFARPG------------------EANAFKSLGTGNRVATVLFYMSDVAQG 220
           +GGHY+PH+DFAR                    E NAFK+L TGNR+ATVLFYMS   +G
Sbjct: 428 LGGHYDPHFDFARIANYGLGGHYEPHYDMSLKEEKNAFKTLNTGNRIATVLFYMSQPERG 487

Query: 221 GATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG----SNS-LHS---- 271
           GATVF  L  +++P K  A FW+NL   G+GD  TRHAACPVL G    SN  +H     
Sbjct: 488 GATVFNHLGTAVFPSKNDALFWYNLRRDGEGDLRTRHAACPVLLGVKWVSNKWIHERGQE 547

Query: 272 -TCPCGLRRGLQRSGI 286
            T PCGL  G+Q + I
Sbjct: 548 FTRPCGLEEGVQENFI 563


>gi|291404182|ref|XP_002718471.1| PREDICTED: prolyl 4-hydroxylase, alpha I subunit isoform 1
           [Oryctolagus cuniculus]
          Length = 534

 Score =  289 bits (740), Expect = 9e-76,   Method: Compositional matrix adjust.
 Identities = 144/275 (52%), Positives = 190/275 (69%), Gaps = 13/275 (4%)

Query: 4   PTHQRAQGNKLYYQEALNKSPEL----------KDEPPKVNNVAPTLEVTEREKYEMLCR 53
           P HQRA GN  Y++  + K  +           K   P+   VA    + ER+KYEMLCR
Sbjct: 236 PEHQRANGNLKYFEYIMAKEKDANKSASDGQSDKKTTPRRKGVAVDY-LPERQKYEMLCR 294

Query: 54  GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           G+ + + P    +L CRY   N  P   L P K+E+ + +PRII + D++ D+EI+++K 
Sbjct: 295 GEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 354

Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
           +A+PRL RATV + +TG+L  A YR+SKSAWL   E+PV+ RI+ R++ +TGL  STAEE
Sbjct: 355 LAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEE 414

Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
           LQV NYG+GG YEPH+DFAR  E +AFK LGTGNR+AT LFYMSDV+ GGATVF  +  S
Sbjct: 415 LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 474

Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           +WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 475 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 509


>gi|344274272|ref|XP_003408941.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1
           [Loxodonta africana]
          Length = 534

 Score =  289 bits (740), Expect = 9e-76,   Method: Compositional matrix adjust.
 Identities = 142/274 (51%), Positives = 188/274 (68%), Gaps = 11/274 (4%)

Query: 4   PTHQRAQGNKLYYQEALNKSPE----LKDEPPKVNNVAPTLEVT-----EREKYEMLCRG 54
           P HQRA GN  Y++  + K  +      D P    +      V      ER+KYEMLCRG
Sbjct: 236 PEHQRANGNLKYFEYIMTKEKDSNKSTSDAPSDQKSTVKKKGVAADYLPERQKYEMLCRG 295

Query: 55  D-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKM 112
           + + + P    +L CRY   N  P   L P K+E+ + +PRI+ + D++ D+EI+++K +
Sbjct: 296 EGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIVRFHDIISDAEIEVVKDL 355

Query: 113 AQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEEL 172
           A+PRL RATV + +TG+L  A YR+SKSAWL   E+PV+ RI+ R++ +TGL  STAEEL
Sbjct: 356 AKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEEL 415

Query: 173 QVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSL 232
           QV NYG+GG YEPH+DFAR  E +AFK LGTGNR+AT LFYMSDV+ GGATVF  +  S+
Sbjct: 416 QVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPDVGASV 475

Query: 233 WPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 476 WPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 509


>gi|321474875|gb|EFX85839.1| hypothetical protein DAPPUDRAFT_309105 [Daphnia pulex]
          Length = 545

 Score =  289 bits (740), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 143/281 (50%), Positives = 186/281 (66%), Gaps = 18/281 (6%)

Query: 2   IFPTHQRAQGNKLYYQEAL------NKSPELKDEP----------PKVNNVAPTLEVTER 45
           I P HQRA GNK +Y++ L       +  E+ DE            K+    P       
Sbjct: 240 IVPYHQRALGNKRHYEKLLRQLGVTERRGEIGDEDNIDMSEPFDTTKLKLTKPPGTTEHW 299

Query: 46  EKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSE 105
           + YE LCRG+  + P +  +L+CRYV  NVPY  + P+K EEA L+PRI++Y D++ D E
Sbjct: 300 DVYEQLCRGEKLMDPKLEGRLRCRYVTNNVPYFYIQPIKMEEALLKPRIVVYHDIISDEE 359

Query: 106 IDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLT 165
           I+ IK++AQPR  RATVQ  ++GE E + YRI+KSAWL+  EH  +  I+ RV  +TGL 
Sbjct: 360 IETIKRLAQPRFERATVQKKESGEREFSRYRIAKSAWLKHEEHDYVSDINFRVGDITGLD 419

Query: 166 TSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVF 225
            +T+E+LQV NYGIGGHYEPHYD+AR GE    +  G G R+AT LFYMSDV  GGATVF
Sbjct: 420 MATSEDLQVCNYGIGGHYEPHYDYARKGEVQ--QDFGWGGRIATWLFYMSDVEAGGATVF 477

Query: 226 TSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
             LNLSLWP+KG+AAFW NL+ +G+G+  T+HA CPVLTGS
Sbjct: 478 PKLNLSLWPQKGSAAFWFNLYPNGEGNEMTQHAGCPVLTGS 518


>gi|349604936|gb|AEQ00344.1| Prolyl 4-hydroxylase subunit alpha-1-like protein, partial [Equus
           caballus]
          Length = 302

 Score =  289 bits (739), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 146/275 (53%), Positives = 192/275 (69%), Gaps = 13/275 (4%)

Query: 4   PTHQRAQGNKLYYQEALNK--------SPELKDEP--PKVNNVAPTLEVTEREKYEMLCR 53
           P HQRA GN  Y++  + K        S +  D+   PK   VA    + ER+KYEMLCR
Sbjct: 4   PEHQRANGNLKYFEYIMAKEKDDNKSASDDQSDQKTTPKKKGVAVDY-LPERQKYEMLCR 62

Query: 54  GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           G+ + + P    +L CRY   N  P   L P K+E+ + +PRII + D++ D+EI+++K 
Sbjct: 63  GEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 122

Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
           +A+PRL RATV + +TG+L  A YR+SKSAWL   E+PV+ RI+ R++ +TGL  STAEE
Sbjct: 123 LAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEE 182

Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
           LQV NYG+GG YEPH+DFAR  E +AFK LGTGNR+AT LFYMSDV+ GGATVF  +  S
Sbjct: 183 LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 242

Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           +WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 243 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 277


>gi|836898|gb|AAC52197.1| prolyl 4-hydroxylase alpha(I)-subunit, partial [Mus musculus]
 gi|1096887|prf||2112362A Pro 4-hydroxylase:SUBUNIT=alpha:ISOTYPE=I
          Length = 526

 Score =  289 bits (739), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 144/275 (52%), Positives = 193/275 (70%), Gaps = 13/275 (4%)

Query: 4   PTHQRAQGNKLYYQEALNK--------SPELKDE--PPKVNNVAPTLEVTEREKYEMLCR 53
           P HQRA GN +Y++  ++K        S +  D+   PK   +A    + ER+KYEMLCR
Sbjct: 228 PEHQRANGNLVYFEYIMSKEKDANKSASGDQSDQKTAPKKKGIAVDY-LPERQKYEMLCR 286

Query: 54  GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           G+ + + P    +L CRY   N  P   L P K+E+ + +PRII + D++ D+EI+++K 
Sbjct: 287 GEGIKMTPRRQKRLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKY 346

Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
           +A+PRL RATV + +TG+L  A YR+SKSAWL   E PV+ RI+ R++ +TGL  STAEE
Sbjct: 347 LAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEE 406

Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
           LQV NYG+GG YEPH+DFAR  E +AF+ LGTGNR+AT LFYMSDV+ GGATVF  +  S
Sbjct: 407 LQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 466

Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           +WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 467 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 501


>gi|151556370|gb|AAI47868.1| P4HA1 protein [Bos taurus]
          Length = 534

 Score =  288 bits (738), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 142/274 (51%), Positives = 190/274 (69%), Gaps = 11/274 (4%)

Query: 4   PTHQRAQGNKLYYQEALNK--------SPELKDEPPKVNNVAPTLE-VTEREKYEMLCRG 54
           P HQRA GN  Y++  + K        S +  D+   +      ++ + ER+KYEMLCRG
Sbjct: 236 PEHQRANGNLKYFEYIMAKEKDANKSSSDDQSDQKTTLKKKGAAVDYLPERQKYEMLCRG 295

Query: 55  D-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKM 112
           + + + P    +L CRY   N  P   L P K+E+ + +PRII + D++ D+EI+++K +
Sbjct: 296 EGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEVVKDL 355

Query: 113 AQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEEL 172
           A+PRL RATV + +TG+L  A YR+SKSAWL   E+PV+ RI+ R++ +TGL  STAEEL
Sbjct: 356 AKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEEL 415

Query: 173 QVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSL 232
           QV NYG+GG YEPH+DFAR  E +AFK LGTGNR+AT LFYMSDV  GGATVF  +  S+
Sbjct: 416 QVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVLAGGATVFPEVGASV 475

Query: 233 WPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 476 WPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 509


>gi|348576112|ref|XP_003473831.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cavia
           porcellus]
          Length = 534

 Score =  288 bits (738), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 142/274 (51%), Positives = 191/274 (69%), Gaps = 11/274 (4%)

Query: 4   PTHQRAQGNKLYYQEALNK--------SPELKDEPPKVNNVAPTLE-VTEREKYEMLCRG 54
           P HQRA GN  Y++  + K        S +  D+   +      ++ + ER+KYEMLCRG
Sbjct: 236 PEHQRANGNLKYFEYIMAKEKDDNKSTSGDQSDQKSTLRKKGIAVDYLPERQKYEMLCRG 295

Query: 55  D-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKM 112
           + + + P    +L CRY   N  P   L P K+E+ + +PRII + D++ D+EI+++K +
Sbjct: 296 EGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKDL 355

Query: 113 AQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEEL 172
           A+PRL RATV + +TG+L  A YR+SKSAWL   E+PV+ RI+ R++ +TGL  STAEEL
Sbjct: 356 AKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEEL 415

Query: 173 QVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSL 232
           QV NYG+GG YEPH+DFAR  E +AFK LGTGNR+AT LFYMSDV+ GGATVF  +  S+
Sbjct: 416 QVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGASV 475

Query: 233 WPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 476 WPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 509


>gi|291230950|ref|XP_002735430.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Saccoglossus
           kowalevskii]
          Length = 533

 Score =  288 bits (738), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 146/282 (51%), Positives = 183/282 (64%), Gaps = 21/282 (7%)

Query: 1   MIFPTHQRAQGNKLYYQEALNK---------------SPELKDEPPKVNNVAPTLEVTER 45
           ++ P H R  GNK Y+++ L K                 E  +    +N+  P     ER
Sbjct: 231 LLDPEHVRGLGNKAYFEQELAKYNRQRGDDADVPGEEEKEFLESHKPLNDYLP-----ER 285

Query: 46  EKYEMLCRGD-LTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDS 104
           E YE LCRG+ + + P    +LKCR    N P+L L P KEE  + +P++I++ D +  +
Sbjct: 286 EAYEALCRGEQVKMSPQRQKKLKCRLRDYNRPFLILQPAKEEVVFDKPKLIIFHDAILTN 345

Query: 105 EIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGL 164
           EI  +K +A PRLRRAT+QN  TG LE A YRISKSAWL E +  V+ R++ R+E  TGL
Sbjct: 346 EIRKVKALASPRLRRATIQNSVTGNLEFAEYRISKSAWLSEDDGDVVHRLNHRIEQYTGL 405

Query: 165 TTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATV 224
           T  TAEELQV NYG+GGHYEPH+DFAR  E NAFKSL TGNR+AT LFYMSDV  GGATV
Sbjct: 406 TMDTAEELQVANYGLGGHYEPHFDFARKEEINAFKSLNTGNRIATFLFYMSDVEAGGATV 465

Query: 225 FTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           F  +   L PEKG+AAFW+NL  +G+GDY TRHAACPVL GS
Sbjct: 466 FPQVGARLIPEKGSAAFWYNLLKNGEGDYSTRHAACPVLVGS 507


>gi|426255746|ref|XP_004021509.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2 [Ovis
           aries]
          Length = 534

 Score =  288 bits (738), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 142/274 (51%), Positives = 190/274 (69%), Gaps = 11/274 (4%)

Query: 4   PTHQRAQGNKLYYQEALNK--------SPELKDEPPKVNNVAPTLE-VTEREKYEMLCRG 54
           P HQRA GN  Y++  + K        S +  D+   +      ++ + ER+KYEMLCRG
Sbjct: 236 PEHQRANGNLKYFEYIMAKEKDANKSSSDDQSDQKTTLKKKGAAVDYLPERQKYEMLCRG 295

Query: 55  D-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKM 112
           + + + P    +L CRY   N  P   L P K+E+ + +PRII + D++ D+EI+++K +
Sbjct: 296 EGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKDL 355

Query: 113 AQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEEL 172
           A+PRL RATV + +TG+L  A YR+SKSAWL   E+PV+ RI+ R++ +TGL  STAEEL
Sbjct: 356 AKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEEL 415

Query: 173 QVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSL 232
           QV NYG+GG YEPH+DFAR  E +AFK LGTGNR+AT LFYMSDV  GGATVF  +  S+
Sbjct: 416 QVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVLAGGATVFPEVGASV 475

Query: 233 WPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 476 WPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 509


>gi|321474953|gb|EFX85917.1| hypothetical protein DAPPUDRAFT_309108 [Daphnia pulex]
          Length = 549

 Score =  288 bits (738), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 152/306 (49%), Positives = 197/306 (64%), Gaps = 30/306 (9%)

Query: 2   IFPTHQRAQGNKLYYQEALNK---SPEL-KDEPPKVNNVAP-------------TLEVTE 44
           I P HQRA GNK +Y++ L +    PE  K E   V    P             T  ++ 
Sbjct: 238 IVPYHQRAIGNKKHYEDVLRQLGVIPEHGKTEDSDVGMSEPFNTANLKLKKPPGTFGISN 297

Query: 45  R--EKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMY 102
              +KYE LCRG+  + P I   L+CRYV  N PY  + PLK EEA+L+P +++Y DV++
Sbjct: 298 DHWDKYEKLCRGEKLMDPKIEGHLRCRYVTNNEPYFFIQPLKMEEAFLKPLLVIYHDVIF 357

Query: 103 DSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMT 162
           D EI+ +KK+A PR +R TV N  TG+LE A YRISK+A+L+  EH  + ++SRRV  +T
Sbjct: 358 DEEIETVKKLAHPRFKRTTVMNSATGKLETAKYRISKAAFLKNKEHHHVLKMSRRVGAIT 417

Query: 163 GLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAF-KSLGTGNRVATVLFYMSDVAQGG 221
           GL  STAE+LQV NYGIGGHYEPH+D+AR  E   F K  G  NR+AT LFYMSDV  GG
Sbjct: 418 GLDMSTAEDLQVCNYGIGGHYEPHFDYARKNETIGFNKDSGWRNRIATWLFYMSDVEAGG 477

Query: 222 ATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC-------- 273
           ATVF +LN++LWP+KG+AAFW+NL  +G+G+  TRHAACPVLTGS  + +          
Sbjct: 478 ATVFPALNVALWPQKGSAAFWYNLFPNGEGNELTRHAACPVLTGSKWVANKWIHEKNQEL 537

Query: 274 --PCGL 277
             PCGL
Sbjct: 538 RRPCGL 543


>gi|405965633|gb|EKC30995.1| Prolyl 4-hydroxylase subunit alpha-1 [Crassostrea gigas]
          Length = 617

 Score =  288 bits (737), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 144/279 (51%), Positives = 185/279 (66%), Gaps = 15/279 (5%)

Query: 2   IFPTHQRAQGNKLYYQEALNKS------------PELKDEPPKVNNVAPTLEV---TERE 46
           + P H RAQ N+ YY++ L +              E K E P      P  E     E +
Sbjct: 313 LLPHHTRAQNNRKYYEKLLEEQRRKQYRRGEDGGEEDKTEEPNKYTERPLDEYRKSDEFQ 372

Query: 47  KYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEI 106
            YE LCRG+ T    +  +LKCRYVH+N P L L P KEEE YL P I++Y DV+ D EI
Sbjct: 373 TYESLCRGEDTHDYKLKHKLKCRYVHKNNPRLLLKPAKEEEVYLNPWIVIYHDVVSDKEI 432

Query: 107 DLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTT 166
           D IK++A P L RATV N +TG+LE A YR+SKSAWL++ + PVI  ++ R+  +TGL+ 
Sbjct: 433 DTIKRIATPLLSRATVHNPRTGKLETAEYRVSKSAWLKDGDDPVIHNVNNRISDITGLSM 492

Query: 167 STAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFT 226
           +TAEELQ+ NYG+GG YEPH+DFAR  E  AF+ LG+GNR+AT L YM++V  GGATVFT
Sbjct: 493 ATAEELQIANYGLGGQYEPHFDFARREETEAFRDLGSGNRIATWLTYMTNVDAGGATVFT 552

Query: 227 SLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
            + + L+P KG AAFW+NL+ SGDG + TRHAACPVL G
Sbjct: 553 HIGVKLFPIKGAAAFWYNLYRSGDGIFDTRHAACPVLVG 591


>gi|224052167|ref|XP_002191912.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Taeniopygia
           guttata]
          Length = 536

 Score =  288 bits (737), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 144/274 (52%), Positives = 190/274 (69%), Gaps = 11/274 (4%)

Query: 4   PTHQRAQGNKLYYQ-------EALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD- 55
           P HQRA GN  Y++       EA   S + +++  K   V     + ER KYEMLCRG+ 
Sbjct: 240 PEHQRANGNMKYFEYIMAKEKEANKSSTDSEEQQEKETEVKKKDYLPERRKYEMLCRGEG 299

Query: 56  LTVPPAIVAQLKCRYV--HRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
           L + P    +L CRY   +RN  Y+ L P+K+E+ + +PRI+ + D++ D EI+ +K++A
Sbjct: 300 LKMTPRRQKRLFCRYYDGNRNPRYI-LGPVKQEDEWDKPRIVRFLDIISDEEIETVKELA 358

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +PRL RATV + +TG+L  A+YR+SKSAWL   E PV+ RI+ R++ +TGL  STAEELQ
Sbjct: 359 KPRLSRATVHDPETGKLTTAHYRVSKSAWLSGYESPVVSRINTRIQDLTGLDVSTAEELQ 418

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
           V NYG+GG YEPH+DF R  E +AFK LGTGNR+AT LFYMSDV+ GGATVF  +  S+W
Sbjct: 419 VANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGASVW 478

Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
           P KGTA FW+NL  SG+GDY TRHAACPVL G+ 
Sbjct: 479 PRKGTAVFWYNLFPSGEGDYSTRHAACPVLVGNK 512


>gi|345305838|ref|XP_001508476.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Ornithorhynchus
           anatinus]
          Length = 493

 Score =  287 bits (735), Expect = 3e-75,   Method: Compositional matrix adjust.
 Identities = 141/276 (51%), Positives = 186/276 (67%), Gaps = 14/276 (5%)

Query: 6   HQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVT------------EREKYEMLCR 53
           HQRA GN  Y++  + K  +     P+ ++  P  E T            ER KYEMLCR
Sbjct: 194 HQRANGNLKYFEYIMAKEKDANKSTPQTSDDQPEQETTPKKKGRVKDYLPERRKYEMLCR 253

Query: 54  GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           G+ + + P    +L CRY   N  P   L P K+E+ + +PRI+ Y +++ D+EI+ +K 
Sbjct: 254 GEGIKMTPRRQKRLFCRYHDGNRNPKFILAPAKQEDEWDKPRIVRYHEIISDAEIETVKD 313

Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
           +A+PRL RATV + +TG+L  A YR+SKSAWL   E PV+ RI+ R++ +TGL  STAEE
Sbjct: 314 LAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEE 373

Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
           LQV NYG+GG YEPH+DF R  E +AFK LGTGNR+AT LFYMSDV+ GGATVF  +  S
Sbjct: 374 LQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 433

Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
           +WP+KGTA FW+NL +SG+GDY TRHAACPVL G+ 
Sbjct: 434 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNK 469


>gi|334314085|ref|XP_001363658.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1
           [Monodelphis domestica]
          Length = 537

 Score =  287 bits (734), Expect = 5e-75,   Method: Compositional matrix adjust.
 Identities = 141/277 (50%), Positives = 185/277 (66%), Gaps = 14/277 (5%)

Query: 4   PTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVT------------EREKYEML 51
           P HQRA GN  Y++  + K  +      K  +  P  E              ER KYEML
Sbjct: 236 PEHQRANGNLKYFEYIMAKEKDANTSTTKTADEQPEQETAPKRKGRAKDYLPERRKYEML 295

Query: 52  CRGD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLI 109
           CRG+ L + P    +L CRY   N  P   L P K+E+ + +PRI+ + +++ D+EI+++
Sbjct: 296 CRGEGLKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIVRFHEIISDAEIEIV 355

Query: 110 KKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTA 169
           K +A+PRL RATV + +TG+L  A YR+SKSAWL   E PV+ RI+ R++ +TGL  STA
Sbjct: 356 KDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYEDPVVSRINMRIQDLTGLDVSTA 415

Query: 170 EELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLN 229
           EELQV NYG+GG YEPH+DF R  E +AFK LGTGNR+AT LFYMSDV+ GGATVF  + 
Sbjct: 416 EELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVG 475

Query: 230 LSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
            S+WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 476 ASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 512


>gi|74148153|dbj|BAE36242.1| unnamed protein product [Mus musculus]
          Length = 454

 Score =  287 bits (734), Expect = 5e-75,   Method: Compositional matrix adjust.
 Identities = 143/275 (52%), Positives = 192/275 (69%), Gaps = 13/275 (4%)

Query: 4   PTHQRAQGNKLYYQEALNK--------SPELKDE--PPKVNNVAPTLEVTEREKYEMLCR 53
           P HQRA GN +Y++  ++K        S +  D+   PK   +A    + ER+KYEMLCR
Sbjct: 156 PEHQRANGNLVYFEYIMSKEKDANKSASGDQSDQKTAPKKKGIAVDY-LPERQKYEMLCR 214

Query: 54  GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           G+ + + P    +L CRY   N  P   L P K+E+ + +PRII + D++ D+E +++K 
Sbjct: 215 GEGIKMTPRRQKRLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAENEIVKD 274

Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
           +A+PRL RATV + +TG+L  A YR+SKSAWL   E PV+ RI+ R++ +TGL  STAEE
Sbjct: 275 LAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEE 334

Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
           LQV NYG+GG YEPH+DFAR  E +AF+ LGTGNR+AT LFYMSDV+ GGATVF  +  S
Sbjct: 335 LQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 394

Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           +WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 395 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 429


>gi|395501518|ref|XP_003755140.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Sarcophilus
           harrisii]
          Length = 385

 Score =  286 bits (731), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 141/278 (50%), Positives = 186/278 (66%), Gaps = 14/278 (5%)

Query: 4   PTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLE------------VTEREKYEML 51
           P HQRA GN  Y++  + K  +      K     P  E            ++ER KYEML
Sbjct: 84  PEHQRANGNLKYFEYIMAKEKDTNKSTTKSAADQPEQESAPKRKGRAKDYLSERRKYEML 143

Query: 52  CRGD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLI 109
           CRG+ L + P    +L CRY   N  P   L P K+E+ + +PRI+ + +++ D+EI+++
Sbjct: 144 CRGEGLKMTPQRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIVRFHEIISDAEIEIV 203

Query: 110 KKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTA 169
           K +A+PRL RATV + +TG+L  A YR+SKSAWL   E PV+ RI+ R++ +TGL  STA
Sbjct: 204 KDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYEDPVVSRINMRIQDLTGLDVSTA 263

Query: 170 EELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLN 229
           EELQV NYG+GG YEPH+DF R  E +AFK LGTGNR+AT LFYMSDV+ GGATVF  + 
Sbjct: 264 EELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVG 323

Query: 230 LSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
            S+WP+KGTA FW+NL +SG+GDY TRHAACPVL G+ 
Sbjct: 324 ASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNK 361


>gi|432926124|ref|XP_004080841.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Oryzias
           latipes]
          Length = 523

 Score =  285 bits (730), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 138/270 (51%), Positives = 187/270 (69%), Gaps = 6/270 (2%)

Query: 4   PTHQRAQGNKLYYQEALNKSPEL----KDEPPKVNNVAPTLEVTEREKYEMLCRGD-LTV 58
           PTHQRA GN  Y++  L+K  +     + E  +    A    + ER KYE LCRG    +
Sbjct: 230 PTHQRANGNLKYFEYQLSKQKKAVQMNESEEDQKGAQADDEYLLERRKYEQLCRGQGALM 289

Query: 59  PPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRL 117
            P  +++L CRY + +  P   + P+K+E+ +  P I+ Y DV  + E++ +K++A+PRL
Sbjct: 290 TPRRLSRLFCRYFNNHGHPNYLIGPVKQEDEWDSPYIVRYHDVASEKEMETVKELAKPRL 349

Query: 118 RRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNY 177
           RRATV + +TG+L  A YR+SKSAWL   EHP+++RI++R+E +TGL  STAE+LQV NY
Sbjct: 350 RRATVHDPQTGKLTTAQYRVSKSAWLGSHEHPIVDRINQRIEDITGLDVSTAEDLQVANY 409

Query: 178 GIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKG 237
           G+GG YEPH+DF R  EA+AF+ LGTGNR+AT L YMSDV  GG TVFT +   +WP+KG
Sbjct: 410 GVGGQYEPHFDFGRKDEADAFEELGTGNRIATWLLYMSDVQAGGNTVFTDIGAVVWPKKG 469

Query: 238 TAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
           TA FW+NLH SG+GDY TRHAACPVL G+ 
Sbjct: 470 TAVFWYNLHRSGEGDYRTRHAACPVLVGNK 499


>gi|387016440|gb|AFJ50339.1| Prolyl 4-hydroxylase subunit alpha-1-like [Crotalus adamanteus]
          Length = 543

 Score =  285 bits (729), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 145/278 (52%), Positives = 193/278 (69%), Gaps = 15/278 (5%)

Query: 4   PTHQRAQGNKLYYQEALNKSPELK--------DEPP----KVNNVAPTLE-VTEREKYEM 50
           P HQRA GN  Y++  + K  E +        DE P    K     P+ + + ER+KYE 
Sbjct: 241 PGHQRANGNLKYFEYIMVKEKEKEANESVTDTDEQPGKKVKTQKRGPSKDYLPERQKYEK 300

Query: 51  LCRGD-LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDL 108
           LCRG+ L + P    +L CRY + N  P   L P+++E+ + +PRI+ + D++ + EI+ 
Sbjct: 301 LCRGEGLKMTPRREKKLFCRYYNGNGNPNYILGPVRQEDEWDRPRIVRFLDIISNEEIEK 360

Query: 109 IKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST 168
           +K++++PRLRRAT+ N  TG LE A+YRISKSAWL   E+PV+ RI++R++ +TGL  ST
Sbjct: 361 VKELSKPRLRRATISNPITGVLETAHYRISKSAWLSGYENPVVARINQRIQDLTGLDVST 420

Query: 169 AEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL 228
           AEELQV NYG+GG YEPH+DF R  E +AFK LGTGNR+AT LFYMSDVA GGATVF  +
Sbjct: 421 AEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVAAGGATVFPEV 480

Query: 229 NLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
             S+WP+KGTA FW+NL  SG+GDY TRHAACPVL G+
Sbjct: 481 GASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPVLVGN 518


>gi|348518914|ref|XP_003446976.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Oreochromis
           niloticus]
          Length = 536

 Score =  285 bits (728), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 150/299 (50%), Positives = 197/299 (65%), Gaps = 25/299 (8%)

Query: 2   IFPTHQRAQGNKLYYQEALNKSPELKD-----EPPKVNNVA------PTLEVTEREKYEM 50
           I P+HQRA GN  Y+++ L K  +L++     +PP    +       P   + ERE YE 
Sbjct: 236 IDPSHQRAGGNLRYFEQLLMK--QLREMNQDYQPPSEEPIQLGTYSRPKDHLPERESYEA 293

Query: 51  LCRGD-LTVPPAIVAQLKCRYVH-RNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDL 108
           LCRG+ + +  A  ++L CRY   +  P+L L P+KEE+ +  P I+ Y D++ D EI+ 
Sbjct: 294 LCRGEGIQMTEARRSRLFCRYHDGKRNPHLLLKPVKEEDEWDSPHIVRYLDLLSDEEIEK 353

Query: 109 IKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST 168
           IK++A+PRL RATV++ KTG L  ANYR+SKSAWL   E PVI+R+++R+E +TGLT  T
Sbjct: 354 IKELAKPRLARATVRDPKTGVLTTANYRVSKSAWLEGEEDPVIDRVNQRIEAITGLTVET 413

Query: 169 AEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL 228
           AE LQV NYG+GG YEPH+DF+R  E +AFK LGTGNRVAT L YMSDV  GGATVF   
Sbjct: 414 AELLQVANYGVGGQYEPHFDFSRKDEPDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDF 473

Query: 229 NLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCGL 277
             ++WP KGT+ FW+NL  SG+GDY TRHAACPVL GS  + +            PCGL
Sbjct: 474 GAAIWPRKGTSVFWYNLFRSGEGDYRTRHAACPVLVGSKWVSNKWIHERGQEFRRPCGL 532


>gi|410927705|ref|XP_003977281.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Takifugu
           rubripes]
          Length = 531

 Score =  285 bits (728), Expect = 3e-74,   Method: Compositional matrix adjust.
 Identities = 139/266 (52%), Positives = 189/266 (71%), Gaps = 4/266 (1%)

Query: 5   THQRAQGNKLYYQEALNKSPEL--KDEPPKVNNVAPTLEVTEREKYEMLCRGD-LTVPPA 61
           THQRA GN+ Y++  L K  ++   ++  +  N  P    +ER+KYE LCRG+ L +   
Sbjct: 241 THQRATGNRKYFEYQLAKQNKVAQSEQGGRDENHQPNDYRSERKKYEQLCRGEGLKMTAR 300

Query: 62  IVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRA 120
             +QL CRY      P   + P+K+E+ + +P I+ Y D++ + E++ +K++A+PRLRRA
Sbjct: 301 RQSQLFCRYYDNGRHPKYVIGPVKQEDEWDRPHIVRYHDILSNREMETVKELAKPRLRRA 360

Query: 121 TVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIG 180
           TV + +TG+L  A YR+SKSAWL   EHPV++RI++R+E +TGL  STAE+LQV NYG+G
Sbjct: 361 TVHDPQTGQLTTAPYRVSKSAWLGAFEHPVVDRINQRIEDITGLDVSTAEDLQVANYGVG 420

Query: 181 GHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAA 240
           G YEPHYDF R  E +AFK LGTGNR+AT L YMS+V  GGATVFT +  S+ P+KG+A 
Sbjct: 421 GQYEPHYDFGRKDEPDAFKELGTGNRIATWLLYMSEVQAGGATVFTDIGASVSPKKGSAV 480

Query: 241 FWHNLHSSGDGDYYTRHAACPVLTGS 266
           FW+NLH SGDGDY TRHAACPVL G+
Sbjct: 481 FWYNLHPSGDGDYRTRHAACPVLLGN 506


>gi|348501574|ref|XP_003438344.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Oreochromis
           niloticus]
          Length = 615

 Score =  283 bits (725), Expect = 5e-74,   Method: Compositional matrix adjust.
 Identities = 148/305 (48%), Positives = 198/305 (64%), Gaps = 31/305 (10%)

Query: 4   PTHQRAQGNKLYYQEALNKSPEL-KDEPPKVNNVAP--TLE----------------VTE 44
           P H R + N  Y++  L K  +  ++E PK        T E                + E
Sbjct: 307 PEHPRGKSNLKYFEFQLEKQKKAAEEEAPKQKEREKRETAEKKKKKKQKKSKKAFSLIPE 366

Query: 45  REKYEMLCRGD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMY 102
           REKYEMLCRG+ + + P   ++L CRY   N  P L L P+K+++ + +P I+ Y D++ 
Sbjct: 367 REKYEMLCRGEGIKMTPRRQSRLFCRYYDNNRNPSLLLAPVKQQDEWDRPYIVRYLDIIS 426

Query: 103 DSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMT 162
           D+EI+ +K++A+PRLRRAT+ N  TG LE A+YRISKSAWL E + P+IE+I+ R+E +T
Sbjct: 427 DAEIERVKQLAKPRLRRATISNPITGVLETASYRISKSAWLTEYDDPMIEKINDRIEGVT 486

Query: 163 GLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGA 222
           GL   TAEELQV NYG+GG YEPH+DF R  E +AFK LGTGNR+AT LFYMSDV+ GGA
Sbjct: 487 GLEMDTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVSAGGA 546

Query: 223 TVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC--------- 273
           TVF  +  ++WP+KGTA FW+NL +SG+GDY TRHAACPVL G+  + +           
Sbjct: 547 TVFPDVGAAVWPQKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWIHERGQEWR 606

Query: 274 -PCGL 277
            PCGL
Sbjct: 607 RPCGL 611


>gi|292619367|ref|XP_001922562.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Danio rerio]
          Length = 541

 Score =  283 bits (725), Expect = 5e-74,   Method: Compositional matrix adjust.
 Identities = 147/295 (49%), Positives = 194/295 (65%), Gaps = 20/295 (6%)

Query: 4   PTHQRAQGNKLYY------QEALNKSPELKDEPPKVNNVAPTLE--VTEREKYEMLCRGD 55
           P HQRA GN  Y+      Q+   K    K+E  K    +   +  + E+ KYE LCRG+
Sbjct: 244 PEHQRALGNLKYFDYQLAKQKKAEKEQSTKEESKKEQETSDGKKEYLPEKRKYEKLCRGE 303

Query: 56  -LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
            L + P     L CRY + N  P+  + P+K+E+ + +PRII Y +++ + EI+ IK+++
Sbjct: 304 GLRMTPRRQKHLFCRYFNGNRHPFYTIGPVKQEDEWDRPRIIRYHEIITEQEIEKIKELS 363

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +PRLRRAT+ N  TG LE A+YRISKSAWL   EHPV++RI++R+E +TGL   TAEELQ
Sbjct: 364 KPRLRRATISNPITGVLETAHYRISKSAWLAAYEHPVVDRINQRIEDITGLNVKTAEELQ 423

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
           V NYG+GG YEPH+DF R  E +AFK LGTGNR+AT LFYMSDVA GGATVF  +  ++ 
Sbjct: 424 VANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVAAGGATVFPEVGAAVK 483

Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCGLR 278
           P KGTA FW+NL  SG+GDY TRHAACPVL G+  + +            PCGL+
Sbjct: 484 PLKGTAVFWYNLFPSGEGDYSTRHAACPVLVGNKWVSNKWIHERGQEFRRPCGLK 538


>gi|327267604|ref|XP_003218589.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Anolis
           carolinensis]
          Length = 542

 Score =  283 bits (725), Expect = 6e-74,   Method: Compositional matrix adjust.
 Identities = 141/278 (50%), Positives = 193/278 (69%), Gaps = 16/278 (5%)

Query: 4   PTHQRAQGNKLYYQEALNKSPE------LKDEPPKVNNVAPTLE------VTEREKYEML 51
           P HQRA GN  Y++  ++K  E      L +   K    + + +      + ER+KYEML
Sbjct: 241 PEHQRANGNLKYFEYIMSKEKEKEANKSLSETDEKTGKESKSKKGPSKDYLPERQKYEML 300

Query: 52  CRGD-LTVPPAIVAQLKCRYV--HRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDL 108
           CRG+ L + P    +L CRY   +RN  Y+ L P+K+E+ + +PRI+ + +++ D EI+ 
Sbjct: 301 CRGEGLKMTPRRQKKLFCRYYDGNRNPKYI-LRPVKQEDEWDRPRIVRFVEIISDEEIET 359

Query: 109 IKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST 168
           +K++A+PRL RATV + +TG+L  A+YR+SKSAWL   E+P++ RI+ R++ +TGL  ST
Sbjct: 360 VKELAKPRLSRATVHDPQTGKLTTAHYRVSKSAWLSGYENPIVARINTRIQDLTGLDVST 419

Query: 169 AEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL 228
           AEELQV NYG+GG YEPH+DF R  E +AFK LGTGNR+AT LFYMSDV+ GGATVF  +
Sbjct: 420 AEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEV 479

Query: 229 NLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
             S+WP KGTA FW+NL  SG+GDY TRHAACPVL G+
Sbjct: 480 GASVWPRKGTAVFWYNLFPSGEGDYSTRHAACPVLVGN 517


>gi|226874876|ref|NP_035161.2| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Mus
           musculus]
 gi|148701601|gb|EDL33548.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha II polypeptide, isoform CRA_f [Mus
           musculus]
          Length = 537

 Score =  283 bits (724), Expect = 7e-74,   Method: Compositional matrix adjust.
 Identities = 144/272 (52%), Positives = 188/272 (69%), Gaps = 10/272 (3%)

Query: 4   PTHQRAQGNKLYYQEALNK------SPELKDEPPKVNNV--APTLEVTEREKYEMLCRGD 55
           P+H+RA GN  Y++  L +      S +         N+   PT  + ER+ YE LCRG+
Sbjct: 240 PSHERAGGNLRYFERLLEEERGKSLSNQTDAGLATQENLYERPTDYLPERDVYESLCRGE 299

Query: 56  -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
            + + P    +L CRY H N VP L + P KEE+ +  P I+ Y DVM D EI+ IK++A
Sbjct: 300 GVKLTPRRQKKLFCRYHHGNRVPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 359

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT  TAE LQ
Sbjct: 360 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 419

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
           V NYG+GG YEPH+DF+R  E +AFK LGTGNRVAT L YMSDV  GGATVF  L  ++W
Sbjct: 420 VANYGMGGQYEPHFDFSRSDEQDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDLGAAIW 479

Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           P+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 480 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 511


>gi|148701597|gb|EDL33544.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha II polypeptide, isoform CRA_b [Mus
           musculus]
          Length = 506

 Score =  283 bits (723), Expect = 8e-74,   Method: Compositional matrix adjust.
 Identities = 144/272 (52%), Positives = 188/272 (69%), Gaps = 10/272 (3%)

Query: 4   PTHQRAQGNKLYYQEALNK------SPELKDEPPKVNNV--APTLEVTEREKYEMLCRGD 55
           P+H+RA GN  Y++  L +      S +         N+   PT  + ER+ YE LCRG+
Sbjct: 209 PSHERAGGNLRYFERLLEEERGKSLSNQTDAGLATQENLYERPTDYLPERDVYESLCRGE 268

Query: 56  -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
            + + P    +L CRY H N VP L + P KEE+ +  P I+ Y DVM D EI+ IK++A
Sbjct: 269 GVKLTPRRQKKLFCRYHHGNRVPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 328

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT  TAE LQ
Sbjct: 329 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 388

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
           V NYG+GG YEPH+DF+R  E +AFK LGTGNRVAT L YMSDV  GGATVF  L  ++W
Sbjct: 389 VANYGMGGQYEPHFDFSRSDEQDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDLGAAIW 448

Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           P+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 449 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 480


>gi|403255941|ref|XP_003920663.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Saimiri
           boliviensis boliviensis]
 gi|403255945|ref|XP_003920665.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 5 [Saimiri
           boliviensis boliviensis]
          Length = 535

 Score =  283 bits (723), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 143/272 (52%), Positives = 189/272 (69%), Gaps = 10/272 (3%)

Query: 4   PTHQRAQGNKLYYQEALNKSPE--LKDEP------PKVNNVAPTLEVTEREKYEMLCRGD 55
           P+H+RA GN  Y+++ L +  E  L ++       P+     P   + ER+ YE LCRG+
Sbjct: 238 PSHERAGGNLRYFEQLLEEEREKMLSNQTEAELATPEGIYERPVDYLPERDVYESLCRGE 297

Query: 56  -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
            + + P    +L CRY H N  P L + P KEE+ +  P I+ Y DVM D EI+ IK++A
Sbjct: 298 GVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 357

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT  TAE LQ
Sbjct: 358 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 417

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
           V NYG+GG YEPH+DF+R  E +AFK LGTGNRVAT L YMSDV  GGATVF  L  ++W
Sbjct: 418 VANYGVGGQYEPHFDFSRNDERDAFKHLGTGNRVATFLNYMSDVEAGGATVFPDLGAAIW 477

Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           P+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 478 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 509


>gi|410948134|ref|XP_003980796.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Felis
           catus]
          Length = 535

 Score =  282 bits (722), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 142/272 (52%), Positives = 186/272 (68%), Gaps = 10/272 (3%)

Query: 4   PTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVA--------PTLEVTEREKYEMLCRGD 55
           P+H+RA GN  Y+++ L +  E          +A        P   + ER+ YE LCRG+
Sbjct: 238 PSHERAGGNLRYFEQLLEEEREKMLSNQTEAGLATQESIYERPVDYLPERDIYESLCRGE 297

Query: 56  -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
            + + P    +L CRY H N  P L + P KEE+ +  P I+ Y DVM D EI+ IK++A
Sbjct: 298 GVKLTPRRQKRLFCRYHHGNRTPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 357

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT  TAE LQ
Sbjct: 358 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 417

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
           V NYG+GG YEPH+DF+R  E +AFK LGTGNRVAT L YMSDV  GGATVF  L  ++W
Sbjct: 418 VANYGMGGQYEPHFDFSRKNEQDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDLGAAIW 477

Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           P+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 478 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 509


>gi|2498741|sp|Q60716.1|P4HA2_MOUSE RecName: Full=Prolyl 4-hydroxylase subunit alpha-2; Short=4-PH
           alpha-2; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-2; Flags: Precursor
 gi|836900|gb|AAC52198.1| prolyl 4-hydroxylase alpha(II)-subunit [Mus musculus]
 gi|18073923|emb|CAC85691.1| Prolyl 4-hydroxylase alpha IIb subunit [Mus musculus]
 gi|1096888|prf||2112362B Pro 4-hydroxylase:SUBUNIT=alpha:ISOTYPE=II
          Length = 537

 Score =  282 bits (721), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 143/272 (52%), Positives = 188/272 (69%), Gaps = 10/272 (3%)

Query: 4   PTHQRAQGNKLYYQEALNK------SPELKDEPPKVNNV--APTLEVTEREKYEMLCRGD 55
           P+H+RA GN  Y++  L +      S +         N+   PT  + ER+ YE LCRG+
Sbjct: 240 PSHERAGGNLRYFERLLEEERGKSLSNQTDAGLATQENLYERPTDYLPERDVYESLCRGE 299

Query: 56  -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
            + + P    +L CRY H N VP L + P KEE+ +  P I+ Y DVM D EI+ IK++A
Sbjct: 300 GVKLTPRRQKKLFCRYHHGNRVPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 359

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT  TAE LQ
Sbjct: 360 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 419

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
           V NYG+GG YEPH+DF+R  + +AFK LGTGNRVAT L YMSDV  GGATVF  L  ++W
Sbjct: 420 VANYGMGGQYEPHFDFSRSDDEDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDLGAAIW 479

Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           P+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 480 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 511


>gi|351706369|gb|EHB09288.1| Prolyl 4-hydroxylase subunit alpha-2 [Heterocephalus glaber]
          Length = 535

 Score =  281 bits (720), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 145/276 (52%), Positives = 185/276 (67%), Gaps = 18/276 (6%)

Query: 4   PTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTE------------REKYEML 51
           P+H+RA GN  Y++  L    E +   P  N    TL   E            RE YE L
Sbjct: 238 PSHERAGGNLRYFERLL----EEERRKPLSNQTEATLAAQEGVYDRPMDYLPEREVYESL 293

Query: 52  CRGD-LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLI 109
           CRG+ + + P    +L CRY H N  P L + P KEE+ +  P I+ Y +VM D EID I
Sbjct: 294 CRGEGVKLTPQRQKRLFCRYHHGNRAPELLIAPFKEEDEWDSPHIVRYYNVMSDEEIDRI 353

Query: 110 KKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTA 169
           K++A+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++++TGLT  TA
Sbjct: 354 KELAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQYITGLTVQTA 413

Query: 170 EELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLN 229
           E LQV NYG+GG YEPH+DF+R  E +AFK LGTGNRVAT L YMSDV  GGATVF  L 
Sbjct: 414 ELLQVANYGMGGQYEPHFDFSRNHERDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDLG 473

Query: 230 LSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
            +LWP+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 474 AALWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 509


>gi|157818741|ref|NP_001101745.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Rattus norvegicus]
 gi|149052604|gb|EDM04421.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha II polypeptide (predicted),
           isoform CRA_a [Rattus norvegicus]
          Length = 535

 Score =  281 bits (720), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 143/272 (52%), Positives = 187/272 (68%), Gaps = 10/272 (3%)

Query: 4   PTHQRAQGNKLYYQEALNK------SPELKDEPPKVNNV--APTLEVTEREKYEMLCRGD 55
           P+H+RA GN  Y++  L +      S +         N+   P   + ER+ YE LCRG+
Sbjct: 238 PSHERAGGNLRYFERLLEEERGKSLSNQTDAGLASQENLYERPVDYLPERDVYESLCRGE 297

Query: 56  -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
            + + P    +L CRY H N VP L + P KEE+ +  P I+ Y DVM D EI+ IK++A
Sbjct: 298 GIKMTPRRQKRLFCRYHHGNRVPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 357

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT  TAE LQ
Sbjct: 358 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 417

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
           V NYG+GG YEPH+DF+R  E +AFK LGTGNRVAT L YMSDV  GGATVF  L  ++W
Sbjct: 418 VANYGMGGQYEPHFDFSRSDERDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDLGAAIW 477

Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           P+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 478 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 509


>gi|170591592|ref|XP_001900554.1| prolyl 4-hydroxylase [Brugia malayi]
 gi|16415740|emb|CAC82616.1| prolyl 4-hydroxylase [Brugia malayi]
 gi|21425621|emb|CAD19314.1| prolyl 4-hydroxylase [Brugia malayi]
 gi|158592166|gb|EDP30768.1| prolyl 4-hydroxylase, putative [Brugia malayi]
          Length = 541

 Score =  281 bits (720), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 145/301 (48%), Positives = 192/301 (63%), Gaps = 17/301 (5%)

Query: 2   IFPTHQRAQGNKLYYQEALN----KSPELKDEPPKVNNVAPT--LEVTEREKYEMLCRGD 55
           I P H RA+ N  +Y++ L     K  + +   P V N  PT  LE  E + YE LCR +
Sbjct: 237 IDPNHPRAKNNIKWYEDLLAEEGLKPIDYRRNIPPVTNPRPTTGLETAEHDIYEALCRNE 296

Query: 56  LTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQP 115
           + V   + ++L C Y   + P+LRL P K E     P  +L+RDV+ D E+ +I+ +A P
Sbjct: 297 IPVSIKVTSKLYC-YYKMDRPFLRLAPFKVEILRFNPLAVLFRDVITDEEVTMIQMLATP 355

Query: 116 RLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVV 175
           RLRRATVQN  TGELE A+YR SKSAWL++ EH V+ RI++R++ MT L   T+EELQV 
Sbjct: 356 RLRRATVQNSITGELETASYRTSKSAWLKDEEHEVVHRINKRIDLMTNLEQETSEELQVG 415

Query: 176 NYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPE 235
           NYGIGGHY+PH+DFAR  E NAF+SL TGNR+AT+LFYM+    GGATVFT +  ++ P 
Sbjct: 416 NYGIGGHYDPHFDFARREEVNAFQSLNTGNRLATLLFYMTQPESGGATVFTEVKTTVMPS 475

Query: 236 KGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCGLRRGLQRSG 285
           K  A FW+NL  SG+GD  TRHAACPVLTG+  + +            PCGL R ++   
Sbjct: 476 KNDALFWYNLLRSGEGDLRTRHAACPVLTGTKWVSNKWIHERGQEFRRPCGLSRSVEEQF 535

Query: 286 I 286
           +
Sbjct: 536 V 536


>gi|321474898|gb|EFX85862.1| hypothetical protein DAPPUDRAFT_309117 [Daphnia pulex]
          Length = 541

 Score =  281 bits (719), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 149/306 (48%), Positives = 194/306 (63%), Gaps = 32/306 (10%)

Query: 2   IFPTHQRAQGNKLYYQEALNKSPELKD-------------EPPKVNNVAPT-------LE 41
           I P HQR   N  YY+E L++  E++              EP   + +  T       + 
Sbjct: 232 IVPFHQRGLSNIQYYREILHQQGEIQFQQQHETAGANSTIEPFNTSKLKLTKPSGTAGIP 291

Query: 42  VTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVM 101
             +  KYE LCRG+  + P I A+L+CRYV  NVPY  + P+K E A L+PR+++Y +V+
Sbjct: 292 AEQWNKYERLCRGEKLMDPKIEARLRCRYVTNNVPYFFIQPIKMELASLKPRLVIYHNVV 351

Query: 102 YDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHM 161
            D EI+  KK+AQ RLRR+TVQN  TG  E   YRI+K+A+L+  EH  I +++RR+  +
Sbjct: 352 TDEEIETAKKLAQSRLRRSTVQNSLTGASEPTKYRIAKAAFLQNSEHDHIVKMTRRIGDV 411

Query: 162 TGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGG 221
           TGL  +TAEELQV NYGIGGHYEPHYD AR GE    K  G GNR+AT +FYMSDV  GG
Sbjct: 412 TGLDMTTAEELQVCNYGIGGHYEPHYDHARKGEVQ--KDFGWGNRIATWMFYMSDVEAGG 469

Query: 222 ATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC-------- 273
           ATVF  +NL+LWP+KG+AAFW NLH +G+GD  T+HAACPVLTGS  + +          
Sbjct: 470 ATVFPQINLALWPQKGSAAFWFNLHPNGEGDDLTQHAACPVLTGSKWVSNKWIHERNQEF 529

Query: 274 --PCGL 277
             PCGL
Sbjct: 530 RRPCGL 535


>gi|297675929|ref|XP_002815906.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Pongo
           abelii]
          Length = 535

 Score =  281 bits (719), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 141/272 (51%), Positives = 186/272 (68%), Gaps = 10/272 (3%)

Query: 4   PTHQRAQGNKLYYQE--------ALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD 55
           P+H+RA GN  Y+++         L+   E +   P+     P   + ER+ YE LCRG+
Sbjct: 238 PSHERAGGNLRYFEQLLEEEREKTLSNQTEAELATPEGIYERPVDYLPERDVYESLCRGE 297

Query: 56  -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
            + + P    +L CRY H N  P L + P KEE+ +  P I+ Y DVM D EI+ IK++A
Sbjct: 298 GVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 357

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT  TAE LQ
Sbjct: 358 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 417

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
           V NYG+GG YEPH+DF+R  E + FK LGTGNRVAT L YMSDV  GGATVF  L  ++W
Sbjct: 418 VANYGVGGQYEPHFDFSRNDERDTFKHLGTGNRVATFLNYMSDVEAGGATVFPDLGAAIW 477

Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           P+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 478 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 509


>gi|114601566|ref|XP_001162222.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Pan
           troglodytes]
 gi|114601568|ref|XP_001162843.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 17 [Pan
           troglodytes]
 gi|397518358|ref|XP_003829358.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Pan
           paniscus]
 gi|397518362|ref|XP_003829360.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 5 [Pan
           paniscus]
 gi|410215944|gb|JAA05191.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
 gi|410255608|gb|JAA15771.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
 gi|410331279|gb|JAA34586.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
          Length = 535

 Score =  281 bits (718), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 141/272 (51%), Positives = 186/272 (68%), Gaps = 10/272 (3%)

Query: 4   PTHQRAQGNKLYYQE--------ALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD 55
           P+H+RA GN  Y+++         L+   E +   P+     P   + ER+ YE LCRG+
Sbjct: 238 PSHERAGGNLRYFEQLLEEEREKTLSNQTEAELATPEGIYERPVDYLPERDIYESLCRGE 297

Query: 56  -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
            + + P    +L CRY H N  P L + P KEE+ +  P I+ Y DVM D EI+ IK++A
Sbjct: 298 GVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 357

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT  TAE LQ
Sbjct: 358 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 417

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
           V NYG+GG YEPH+DF+R  E + FK LGTGNRVAT L YMSDV  GGATVF  L  ++W
Sbjct: 418 VANYGVGGQYEPHFDFSRNDERDTFKHLGTGNRVATFLNYMSDVEAGGATVFPDLGAAIW 477

Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           P+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 478 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 509


>gi|332221660|ref|XP_003259981.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Nomascus
           leucogenys]
          Length = 537

 Score =  281 bits (718), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 142/272 (52%), Positives = 188/272 (69%), Gaps = 10/272 (3%)

Query: 4   PTHQRAQGNKLYYQEALNKSPE--LKDEP------PKVNNVAPTLEVTEREKYEMLCRGD 55
           P+H+RA GN  Y+++ L +  E  L ++       P+     P   + ER+ YE LCRG+
Sbjct: 240 PSHERAGGNLRYFEQLLEEEREKMLSNQTEAELATPEGIYERPVDYLPERDVYESLCRGE 299

Query: 56  -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
            + + P    +L CRY H N  P L + P KEE+ +  P I+ Y DVM D EI+ IK++A
Sbjct: 300 GVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 359

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT  TAE LQ
Sbjct: 360 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 419

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
           V NYG+GG YEPH+DF+R  E + FK LGTGNRVAT L YMSDV  GGATVF  L  ++W
Sbjct: 420 VANYGVGGQYEPHFDFSRNDERDTFKHLGTGNRVATFLNYMSDVEAGGATVFPDLGAAIW 479

Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           P+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 480 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 511


>gi|149052606|gb|EDM04423.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha II polypeptide (predicted),
           isoform CRA_c [Rattus norvegicus]
          Length = 506

 Score =  281 bits (718), Expect = 4e-73,   Method: Compositional matrix adjust.
 Identities = 143/272 (52%), Positives = 187/272 (68%), Gaps = 10/272 (3%)

Query: 4   PTHQRAQGNKLYYQEALNK------SPELKDEPPKVNNV--APTLEVTEREKYEMLCRGD 55
           P+H+RA GN  Y++  L +      S +         N+   P   + ER+ YE LCRG+
Sbjct: 209 PSHERAGGNLRYFERLLEEERGKSLSNQTDAGLASQENLYERPVDYLPERDVYESLCRGE 268

Query: 56  -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
            + + P    +L CRY H N VP L + P KEE+ +  P I+ Y DVM D EI+ IK++A
Sbjct: 269 GIKMTPRRQKRLFCRYHHGNRVPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 328

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT  TAE LQ
Sbjct: 329 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 388

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
           V NYG+GG YEPH+DF+R  E +AFK LGTGNRVAT L YMSDV  GGATVF  L  ++W
Sbjct: 389 VANYGMGGQYEPHFDFSRSDERDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDLGAAIW 448

Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           P+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 449 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 480


>gi|291190274|ref|NP_001167096.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha 1 polypeptide precursor [Salmo
           salar]
 gi|223648100|gb|ACN10808.1| Prolyl 4-hydroxylase subunit alpha-1 precursor [Salmo salar]
          Length = 545

 Score =  281 bits (718), Expect = 4e-73,   Method: Compositional matrix adjust.
 Identities = 141/277 (50%), Positives = 186/277 (67%), Gaps = 16/277 (5%)

Query: 6   HQRAQGNKLYYQEALNKSPELKDEP--------------PKVNNVAPTLEVTEREKYEML 51
           HQRA GN  Y++  L K  +++ E                      P   + ER KYE L
Sbjct: 244 HQRANGNLKYFEYQLAKQKKVEAEEGLKEKEKREREKREASEKKGRPADYLPERRKYEQL 303

Query: 52  CRGD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLI 109
           CRG+ + + P   +++ CRY   N  P   L P+K+E+ + +PRII Y DV+ +SEI+ +
Sbjct: 304 CRGEGIKMTPRRQSRMFCRYSDNNRHPLYVLGPVKQEDEWDRPRIIRYHDVLSNSEIEKV 363

Query: 110 KKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTA 169
           K++A+PRLRRAT+ N  TG LE A+YRISKSAWL   E PV+++I++R+E +TGL   TA
Sbjct: 364 KELAKPRLRRATISNPITGVLETAHYRISKSAWLTAYEDPVVDKINQRIEDITGLNVKTA 423

Query: 170 EELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLN 229
           EELQV NYG+GG YEPH+DF R  E +AFK LGTGNR+AT L YMSDV  GGATVFT + 
Sbjct: 424 EELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLIYMSDVPSGGATVFTDVG 483

Query: 230 LSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
            ++WP+KG+A FW+NL  SG+GDY TRHAACPVL G+
Sbjct: 484 AAVWPKKGSAVFWYNLFPSGEGDYSTRHAACPVLVGN 520


>gi|395736141|ref|XP_003776706.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Pongo abelii]
          Length = 577

 Score =  281 bits (718), Expect = 4e-73,   Method: Compositional matrix adjust.
 Identities = 141/272 (51%), Positives = 186/272 (68%), Gaps = 10/272 (3%)

Query: 4   PTHQRAQGNKLYYQE--------ALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD 55
           P+H+RA GN  Y+++         L+   E +   P+     P   + ER+ YE LCRG+
Sbjct: 280 PSHERAGGNLRYFEQLLEEEREKTLSNQTEAELATPEGIYERPVDYLPERDVYESLCRGE 339

Query: 56  -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
            + + P    +L CRY H N  P L + P KEE+ +  P I+ Y DVM D EI+ IK++A
Sbjct: 340 GVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 399

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT  TAE LQ
Sbjct: 400 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 459

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
           V NYG+GG YEPH+DF+R  E + FK LGTGNRVAT L YMSDV  GGATVF  L  ++W
Sbjct: 460 VANYGVGGQYEPHFDFSRNDERDTFKHLGTGNRVATFLNYMSDVEAGGATVFPDLGAAIW 519

Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           P+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 520 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 551


>gi|4758868|ref|NP_004190.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Homo
           sapiens]
 gi|217272863|ref|NP_001136071.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Homo
           sapiens]
 gi|20455169|sp|O15460.1|P4HA2_HUMAN RecName: Full=Prolyl 4-hydroxylase subunit alpha-2; Short=4-PH
           alpha-2; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-2; Flags: Precursor
 gi|2439985|gb|AAB71339.1| prolyl 4-hydroxylase alpha (II) subunit [Homo sapiens]
 gi|18073926|emb|CAC85689.1| Prolyl 4-hydroxylase alpha IIb subunit [Homo sapiens]
 gi|119582746|gb|EAW62342.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide II, isoform CRA_b
           [Homo sapiens]
 gi|119582747|gb|EAW62343.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide II, isoform CRA_b
           [Homo sapiens]
          Length = 535

 Score =  281 bits (718), Expect = 4e-73,   Method: Compositional matrix adjust.
 Identities = 141/272 (51%), Positives = 185/272 (68%), Gaps = 10/272 (3%)

Query: 4   PTHQRAQGNKLYYQE--------ALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD 55
           P+H+RA GN  Y+++         L    E +   P+     P   + ER+ YE LCRG+
Sbjct: 238 PSHERAGGNLRYFEQLLEEEREKTLTNQTEAELATPEGIYERPVDYLPERDVYESLCRGE 297

Query: 56  -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
            + + P    +L CRY H N  P L + P KEE+ +  P I+ Y DVM D EI+ IK++A
Sbjct: 298 GVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 357

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT  TAE LQ
Sbjct: 358 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 417

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
           V NYG+GG YEPH+DF+R  E + FK LGTGNRVAT L YMSDV  GGATVF  L  ++W
Sbjct: 418 VANYGVGGQYEPHFDFSRNDERDTFKHLGTGNRVATFLNYMSDVEAGGATVFPDLGAAIW 477

Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           P+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 478 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 509


>gi|332221664|ref|XP_003259983.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 5 [Nomascus
           leucogenys]
          Length = 558

 Score =  280 bits (717), Expect = 4e-73,   Method: Compositional matrix adjust.
 Identities = 142/272 (52%), Positives = 188/272 (69%), Gaps = 10/272 (3%)

Query: 4   PTHQRAQGNKLYYQEALNKSPE--LKDEP------PKVNNVAPTLEVTEREKYEMLCRGD 55
           P+H+RA GN  Y+++ L +  E  L ++       P+     P   + ER+ YE LCRG+
Sbjct: 261 PSHERAGGNLRYFEQLLEEEREKMLSNQTEAELATPEGIYERPVDYLPERDVYESLCRGE 320

Query: 56  -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
            + + P    +L CRY H N  P L + P KEE+ +  P I+ Y DVM D EI+ IK++A
Sbjct: 321 GVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 380

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT  TAE LQ
Sbjct: 381 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 440

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
           V NYG+GG YEPH+DF+R  E + FK LGTGNRVAT L YMSDV  GGATVF  L  ++W
Sbjct: 441 VANYGVGGQYEPHFDFSRNDERDTFKHLGTGNRVATFLNYMSDVEAGGATVFPDLGAAIW 500

Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           P+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 501 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 532


>gi|119582752|gb|EAW62348.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide II, isoform CRA_f
           [Homo sapiens]
          Length = 567

 Score =  280 bits (717), Expect = 5e-73,   Method: Compositional matrix adjust.
 Identities = 141/272 (51%), Positives = 185/272 (68%), Gaps = 10/272 (3%)

Query: 4   PTHQRAQGNKLYYQE--------ALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD 55
           P+H+RA GN  Y+++         L    E +   P+     P   + ER+ YE LCRG+
Sbjct: 270 PSHERAGGNLRYFEQLLEEEREKTLTNQTEAELATPEGIYERPVDYLPERDVYESLCRGE 329

Query: 56  -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
            + + P    +L CRY H N  P L + P KEE+ +  P I+ Y DVM D EI+ IK++A
Sbjct: 330 GVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 389

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT  TAE LQ
Sbjct: 390 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 449

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
           V NYG+GG YEPH+DF+R  E + FK LGTGNRVAT L YMSDV  GGATVF  L  ++W
Sbjct: 450 VANYGVGGQYEPHFDFSRNDERDTFKHLGTGNRVATFLNYMSDVEAGGATVFPDLGAAIW 509

Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           P+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 510 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 541


>gi|344264847|ref|XP_003404501.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1
           [Loxodonta africana]
          Length = 536

 Score =  280 bits (717), Expect = 5e-73,   Method: Compositional matrix adjust.
 Identities = 142/273 (52%), Positives = 186/273 (68%), Gaps = 11/273 (4%)

Query: 4   PTHQRAQGNKLYYQEALNK------SPELKDEPPKVNN---VAPTLEVTEREKYEMLCRG 54
           P+H+RA GN  Y++  L +      S +  D  P         P   + ER+ YE LCRG
Sbjct: 238 PSHERAGGNLRYFEHLLEEERKKTLSNQTMDAEPATREGIYERPVDYLPERDVYESLCRG 297

Query: 55  D-LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKM 112
           + + + P    +L CRY H N  P L + P KEE+ +  P I+ Y DVM D EI+ IK++
Sbjct: 298 EGVKLTPRRQKRLFCRYHHGNRTPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKQI 357

Query: 113 AQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEEL 172
           A+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ +++RR++H+TGLT  TAE L
Sbjct: 358 AKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVAQVNRRMQHITGLTVKTAELL 417

Query: 173 QVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSL 232
           QV NYG+GG YEPH+DF+R  E +AFK LGTGNRVAT L YMSDV  GGATVF  L  ++
Sbjct: 418 QVANYGMGGQYEPHFDFSRSHEQDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDLGAAI 477

Query: 233 WPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           WP+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 478 WPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 510


>gi|355691582|gb|EHH26767.1| hypothetical protein EGK_16829 [Macaca mulatta]
 gi|355750162|gb|EHH54500.1| hypothetical protein EGM_15360 [Macaca fascicularis]
 gi|384939464|gb|AFI33337.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Macaca
           mulatta]
          Length = 535

 Score =  280 bits (716), Expect = 5e-73,   Method: Compositional matrix adjust.
 Identities = 142/272 (52%), Positives = 188/272 (69%), Gaps = 10/272 (3%)

Query: 4   PTHQRAQGNKLYYQEALNKSPE--LKDEP------PKVNNVAPTLEVTEREKYEMLCRGD 55
           P+H+RA GN  Y+++ L +  E  L ++       P+     P   + ER+ YE LCRG+
Sbjct: 238 PSHERAGGNLRYFEQLLEEEREKMLSNQTEAELATPEGIYERPVDYLPERDVYESLCRGE 297

Query: 56  -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
            + + P    +L CRY H N  P L + P KEE+ +  P I+ Y DVM D EI+ IK++A
Sbjct: 298 GVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 357

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT  TAE LQ
Sbjct: 358 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 417

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
           V NYG+GG YEPH+DF+R  E + FK LGTGNRVAT L YMSDV  GGATVF  L  ++W
Sbjct: 418 VANYGVGGQYEPHFDFSRNDERHTFKHLGTGNRVATFLNYMSDVEAGGATVFPDLGAAIW 477

Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           P+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 478 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 509


>gi|432904500|ref|XP_004077362.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Oryzias
           latipes]
          Length = 555

 Score =  280 bits (716), Expect = 6e-73,   Method: Compositional matrix adjust.
 Identities = 146/305 (47%), Positives = 197/305 (64%), Gaps = 32/305 (10%)

Query: 4   PTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLE-------------------VTE 44
           P HQRA GN+ Y++  L K  E +DE  +        E                   + E
Sbjct: 243 PEHQRANGNQKYFEFQLEKQ-EKQDETAEKETQQQDREKRDTTQKKKKKQSQKSLSLIPE 301

Query: 45  REKYEMLCRGD-LTVPPAIVAQLKCRYV-HRNVPYLRLMPLKEEEAYLQPRIILYRDVMY 102
           R+KYEMLCRG+ + +     ++L CRY  +++ P   L P+K+++ + +P I+ Y D++ 
Sbjct: 302 RKKYEMLCRGEGVRMTSRRQSRLFCRYYDNKHNPRFVLAPVKQQDEWDRPYIVRYIDIIS 361

Query: 103 DSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMT 162
           ++E+D IK++A+PRLRRAT+ N  TG LE A YRISKSAWL   E PV+E+I++R+E +T
Sbjct: 362 EAEMDKIKQLAKPRLRRATISNPVTGVLETAPYRISKSAWLTAYEDPVVEKINQRIEDLT 421

Query: 163 GLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGA 222
           GL   TAEELQV NYG+GG YEPH+DF R  E +AFK LGTGNR+AT LFYMSDV+ GGA
Sbjct: 422 GLEMDTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVSAGGA 481

Query: 223 TVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC--------- 273
           TVF  +  S+ P+KGTA FW+NL +SG+GDY TRHAACPVL G+  + +           
Sbjct: 482 TVFPDVGASVGPQKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWIHERGQEWR 541

Query: 274 -PCGL 277
            PCGL
Sbjct: 542 RPCGL 546


>gi|402593814|gb|EJW87741.1| hypothetical protein WUBG_01349 [Wuchereria bancrofti]
          Length = 541

 Score =  280 bits (716), Expect = 6e-73,   Method: Compositional matrix adjust.
 Identities = 145/301 (48%), Positives = 191/301 (63%), Gaps = 17/301 (5%)

Query: 2   IFPTHQRAQGNKLYYQEALN----KSPELKDEPPKVNNVAPT--LEVTEREKYEMLCRGD 55
           I P H RA+ N  +Y++ L     K  + +   P V N  P   LE  E + YE LCR +
Sbjct: 237 IDPNHPRAKNNIKWYEDLLAEEGLKPIDYRRNIPPVTNPRPKTGLETAEHDIYEALCRNE 296

Query: 56  LTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQP 115
           + V   + ++L C Y   + P+LRL P K E     P  +L+RDV+ D EI +I+ +A P
Sbjct: 297 IPVSIKVTSKLYC-YYKMDRPFLRLAPFKVEILRFNPLAVLFRDVITDEEITMIQMLATP 355

Query: 116 RLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVV 175
           RLRRATVQN  TGELE A+YR SKSAWL++ EH V+ RI++R++ MT L   T+EELQV 
Sbjct: 356 RLRRATVQNSITGELETASYRTSKSAWLKDEEHEVVHRINKRIDLMTNLEQETSEELQVG 415

Query: 176 NYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPE 235
           NYGIGGHY+PH+DFAR  E NAF+SL TGNR+AT+LFYM+    GGATVFT +  ++ P 
Sbjct: 416 NYGIGGHYDPHFDFARREEVNAFQSLNTGNRLATLLFYMTQPESGGATVFTEVKTTVMPS 475

Query: 236 KGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCGLRRGLQRSG 285
           K  A FW+NL  SG+GD  TRHAACPVLTG+  + +            PCGL R ++   
Sbjct: 476 KNDALFWYNLLRSGEGDLRTRHAACPVLTGTKWVSNKWIHERGQEFRRPCGLSRSVEEQF 535

Query: 286 I 286
           +
Sbjct: 536 V 536


>gi|348557542|ref|XP_003464578.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 1
           [Cavia porcellus]
          Length = 535

 Score =  280 bits (715), Expect = 7e-73,   Method: Compositional matrix adjust.
 Identities = 143/272 (52%), Positives = 186/272 (68%), Gaps = 10/272 (3%)

Query: 4   PTHQRAQGNKLYYQEALN--KSPELKDEPPKVNNVA------PTLEVTEREKYEMLCRGD 55
           P+H+RA GN  Y++  L   +   L ++   V          P+  + ERE YE LCRG+
Sbjct: 238 PSHERAGGNLRYFERLLEEERGKLLSNQTEAVLAAQEGIYERPSDYLPEREVYESLCRGE 297

Query: 56  -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
            + + P    +L CRY H N  P L + P KEE+ +  P I+ Y DVM D EI+ IK++A
Sbjct: 298 GIKLTPQRRKRLFCRYHHGNRAPELLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 357

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++ +TGLT  TAE LQ
Sbjct: 358 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEEDDPVVARVNRRMQQITGLTVKTAELLQ 417

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
           V NYG+GG YEPH+DF+R  E +AFK LGTGNRVAT L YMSDV  GGATVF  L  +LW
Sbjct: 418 VANYGMGGQYEPHFDFSRSHERDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDLGAALW 477

Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           P+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 478 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 509


>gi|335283456|ref|XP_003354320.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Sus scrofa]
          Length = 535

 Score =  279 bits (713), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 142/272 (52%), Positives = 184/272 (67%), Gaps = 10/272 (3%)

Query: 4   PTHQRAQGNKLYYQ--------EALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD 55
           P H+RA GN  Y++        + L+   E     P      P   + ER+ YE LCRG+
Sbjct: 238 PGHERAGGNLRYFERLLEEEREKMLSNHTEAGPSTPGGIYERPVDYLPERDVYESLCRGE 297

Query: 56  -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
            + + P    +L CRY H N  P L + P KEE+ +  P I+ Y DVM D EI+ IK++A
Sbjct: 298 GVKLTPRRQKRLFCRYHHGNRTPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 357

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT  TAE LQ
Sbjct: 358 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 417

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
           V NYG+GG YEPH+DF+R  E +AFK LGTGNRVAT L YMSDV  GGATVF  L  ++W
Sbjct: 418 VANYGMGGQYEPHFDFSRKDEQDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDLGAAIW 477

Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           P+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 478 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 509


>gi|324507368|gb|ADY43128.1| Prolyl 4-hydroxylase subunit alpha-2 [Ascaris suum]
          Length = 534

 Score =  278 bits (712), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 141/289 (48%), Positives = 196/289 (67%), Gaps = 21/289 (7%)

Query: 4   PTHQRAQGNKLYYQEAL-NKSPELKDEPP----KVNNVAPTLEVTEREKYEMLCRGDLTV 58
           P H RA+GN  +Y++ L + + ++ + PP    ++++  P     ER+ YE LCRG+  V
Sbjct: 238 PDHPRAKGNVKWYEDMLEDDNKDISELPPLKLERLDDGIP-----ERDVYEALCRGEQKV 292

Query: 59  PPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLR 118
                +++ C Y+  + P+L+L P+K E     P ++L++ V+ D EI++I+K+A P+L+
Sbjct: 293 NVTAQSEVYC-YLKMDRPFLKLAPIKVEILRFSPLVVLFKQVISDYEIEVIEKLAIPKLK 351

Query: 119 RATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYG 178
           RATVQN +TG+LE ANYRISKSAWL+  +HP I+RI++R++ MT L   TAEELQ  NYG
Sbjct: 352 RATVQNARTGDLEYANYRISKSAWLKGTDHPAIDRINKRIDLMTNLNQETAEELQAQNYG 411

Query: 179 IGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGT 238
           IGGHY+PH+DFAR  + NAFK+L TGNR+AT+L YMSDV  GGATVF  L  +++P K  
Sbjct: 412 IGGHYDPHFDFARKEDINAFKTLNTGNRIATILIYMSDVESGGATVFNHLGNAVFPSKYD 471

Query: 239 AAFWHNLHSSGDGDYYTRHAACPVLTG----SNS-LHSTC-----PCGL 277
           A FW+NL   G+GD  TRHAACPVLTG    SN  +H        PCGL
Sbjct: 472 ALFWYNLRRDGEGDLRTRHAACPVLTGIKWVSNKWIHDRGQEFRRPCGL 520


>gi|395817620|ref|XP_003782263.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Otolemur
           garnettii]
          Length = 540

 Score =  278 bits (711), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 141/272 (51%), Positives = 184/272 (67%), Gaps = 10/272 (3%)

Query: 4   PTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVA--------PTLEVTEREKYEMLCRGD 55
           P+H+RA GN  Y++  L +  E          +A        P   + ERE YE LCRG+
Sbjct: 243 PSHERAGGNLRYFEHLLEEEREKMLSNKTEAELATQEGIYERPVDYLPEREVYESLCRGE 302

Query: 56  -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
            + + P    +L CRY H N  P L + P KEE+ +  P I+ Y DVM D EI+ IK++A
Sbjct: 303 GVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 362

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++ R++H+TGL+  TAE LQ
Sbjct: 363 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNHRMQHITGLSVKTAELLQ 422

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
           V NYG+GG YEPH+DF+R  E +AFK LGTGNRVAT L YMSDV  GGATVF  L  ++W
Sbjct: 423 VANYGVGGQYEPHFDFSRNHERDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDLGAAIW 482

Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           P+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 483 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 514


>gi|291387304|ref|XP_002710243.1| PREDICTED: prolyl 4-hydroxylase, alpha II subunit isoform 1
           precursor (predicted)-like isoform 3 [Oryctolagus
           cuniculus]
          Length = 535

 Score =  278 bits (710), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 144/273 (52%), Positives = 186/273 (68%), Gaps = 12/273 (4%)

Query: 4   PTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLE---------VTEREKYEMLCRG 54
           P+H+RA GN  Y++  L +    K    +   VA T E         + ER+ YE LCRG
Sbjct: 238 PSHERAGGNLRYFERLLEEQ-RGKSLLNQTEAVAVTQEGIYERPVDYLPERDVYESLCRG 296

Query: 55  D-LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKM 112
           + + + P    +L CRY   N  P L + P KEE+ +  P I+ Y DVM D EI+ IK++
Sbjct: 297 EGVKLTPRRQKRLFCRYHDGNGAPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEI 356

Query: 113 AQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEEL 172
           A+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ RI+RR++H+TGLT  TAE L
Sbjct: 357 AKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARINRRMQHITGLTVKTAELL 416

Query: 173 QVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSL 232
           QV NYG+GG YEPH+DF+R  E +AFK LGTGNRVAT L YMSDV  GGATVF  L  ++
Sbjct: 417 QVANYGMGGQYEPHFDFSRNNERDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDLGAAI 476

Query: 233 WPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           WP+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 477 WPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 509


>gi|291190128|ref|NP_001167431.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Salmo salar]
 gi|223649060|gb|ACN11288.1| Prolyl 4-hydroxylase subunit alpha-2 precursor [Salmo salar]
          Length = 538

 Score =  277 bits (709), Expect = 4e-72,   Method: Compositional matrix adjust.
 Identities = 150/298 (50%), Positives = 196/298 (65%), Gaps = 22/298 (7%)

Query: 2   IFPTHQRAQGNKLYYQEALNKS-PELK--------DEPPKVNNVA-PTLEVTEREKYEML 51
           I  +HQRA GN  Y+++ L+K   EL         +EP ++     P   + ERE YE L
Sbjct: 237 IDSSHQRAGGNLRYFEKLLSKQLKELNQEVQEPATEEPIQLGTYKRPKDYLPEREIYEGL 296

Query: 52  CRGD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLI 109
           CRG+ + +     ++L CRY   N  P L L P+KEE+ +  P I+ Y + + DSEI+ I
Sbjct: 297 CRGEGVKMTSERRSRLYCRYHDGNRNPRLLLQPMKEEDEWDSPHIVRYLNALSDSEIEKI 356

Query: 110 KKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTA 169
           K++A+PRL RATV++ KTG L  ANYR+SKSAWL   E PVIER+++R+E +TGLTT TA
Sbjct: 357 KELAKPRLARATVRDPKTGVLTTANYRVSKSAWLEGEEDPVIERVNQRIEDITGLTTQTA 416

Query: 170 EELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLN 229
           E LQ+ NYG+GG YEPH+DF+R  E +AFK+LGTGNRVAT L YMSDV  GGATVF    
Sbjct: 417 ELLQIANYGVGGQYEPHFDFSRKDEPDAFKTLGTGNRVATFLNYMSDVEAGGATVFPDFG 476

Query: 230 LSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCGL 277
            +++P+KGTA FW+NL  SG+GDY TRHAACPVL G   + +            PCGL
Sbjct: 477 AAIYPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGCKWVSNKWIHERGQEFRRPCGL 534


>gi|354474413|ref|XP_003499425.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1
           [Cricetulus griseus]
          Length = 535

 Score =  277 bits (708), Expect = 6e-72,   Method: Compositional matrix adjust.
 Identities = 141/272 (51%), Positives = 185/272 (68%), Gaps = 10/272 (3%)

Query: 4   PTHQRAQGNKLYYQ--------EALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD 55
           P+H+RA GN  Y++        ++L    E      +     P   + ER+  E LCRG+
Sbjct: 238 PSHERAGGNLRYFERLLEEEREKSLFNQTEAGLATQENVYERPVDFLPERDVLESLCRGE 297

Query: 56  -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
            + + P    +L CRY H N VP L + P KEE+ +  P I+ Y DVM D EI+ IK++A
Sbjct: 298 GVKLTPQRQKKLFCRYHHGNRVPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 357

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT  TAE LQ
Sbjct: 358 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 417

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
           V NYG+GG YEPH+DF+R  E +AFK LGTGNRVAT L YMSDV  GGATVF  L  ++W
Sbjct: 418 VANYGMGGQYEPHFDFSRSDEQDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDLGAAIW 477

Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           P+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 478 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 509


>gi|410914996|ref|XP_003970973.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Takifugu
           rubripes]
          Length = 538

 Score =  276 bits (707), Expect = 7e-72,   Method: Compositional matrix adjust.
 Identities = 147/298 (49%), Positives = 194/298 (65%), Gaps = 21/298 (7%)

Query: 1   MIFPTHQRAQGNKLYYQEALNKS-PELK-------DEPPKVNNVA-PTLEVTEREKYEML 51
           +I  +H+RA GN  YY+  L K   EL        +EP ++   + P   + ERE YE L
Sbjct: 237 VIDSSHERAGGNLRYYENLLRKQLSELNQDYEPASEEPIQLGTYSRPKDHLPEREAYEAL 296

Query: 52  CRGD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLI 109
           CRG+ L +  A  ++L CRY   N  P+L L P+KEE+ +  P I+ Y D + + EI+ I
Sbjct: 297 CRGEGLQMNEARRSRLFCRYQDGNRNPHLLLKPIKEEDEWDSPNIVRYLDFLSNEEIEKI 356

Query: 110 KKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTA 169
           K++A+P+L RATV++ K+G L  A+YR+SKSAWL   E P+I R+++R+E +TGLT  TA
Sbjct: 357 KELAKPKLARATVRDPKSGVLTTASYRVSKSAWLEGEEDPIIARVNQRIEDLTGLTVKTA 416

Query: 170 EELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLN 229
           E LQV NYG+GG YEPH+DF+R  E +AFK LGTGNRVAT L YMSDV  GGATVF    
Sbjct: 417 ELLQVANYGVGGQYEPHFDFSRKDEPDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDFG 476

Query: 230 LSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCGL 277
            ++WP KGTA FW+NL  SG+GDY TRHAACPVL G+  + +            PCGL
Sbjct: 477 AAIWPRKGTAVFWYNLFKSGEGDYRTRHAACPVLVGNKWVSNKWIHERGQEFRRPCGL 534


>gi|393909803|gb|EFO21561.2| prolyl 4-hydroxylase 2 [Loa loa]
          Length = 542

 Score =  276 bits (707), Expect = 7e-72,   Method: Compositional matrix adjust.
 Identities = 143/297 (48%), Positives = 188/297 (63%), Gaps = 17/297 (5%)

Query: 2   IFPTHQRAQGNKLYYQEALN----KSPELKDEPPKVNNVAPT--LEVTEREKYEMLCRGD 55
           I P H RA+ N  +Y++ L     K  + +   P V N  P   L+ TE + YE LCR +
Sbjct: 238 IDPNHPRARNNIKWYEDLLAEDGVKPIDYRRNIPPVTNPRPKNGLKTTEHDMYEALCRNE 297

Query: 56  LTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQP 115
           + V     ++L C Y   + P+LRL P K E     P  + +RDV+ D E+ +I+ +A P
Sbjct: 298 VPVSVKATSKLYC-YYKMDRPFLRLAPFKVEILRFSPLAVFFRDVITDEEVTIIQMLATP 356

Query: 116 RLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVV 175
           RLRRATVQN  TGELE A+YR SKSAWL++ EH ++ RI+RR++ MT L   T+EELQV 
Sbjct: 357 RLRRATVQNSITGELETASYRTSKSAWLKDEEHEIVHRINRRIDLMTNLEQETSEELQVG 416

Query: 176 NYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPE 235
           NYGIGGHY+PH+DFAR  E NAF+SL TGNR+AT+LFYM+    GGATVFT +  ++ P 
Sbjct: 417 NYGIGGHYDPHFDFARREEVNAFQSLNTGNRLATLLFYMTQPESGGATVFTEVKTTVMPS 476

Query: 236 KGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCGLRRGLQ 282
           K  A FW+NL  SG+GD  TRHAACPVL GS  + +            PCGL R ++
Sbjct: 477 KNDALFWYNLLRSGEGDLRTRHAACPVLIGSKWVSNKWIHERGQEFRRPCGLSRSVE 533


>gi|312080225|ref|XP_003142509.1| prolyl 4-hydroxylase 2 [Loa loa]
          Length = 541

 Score =  276 bits (706), Expect = 9e-72,   Method: Compositional matrix adjust.
 Identities = 143/297 (48%), Positives = 188/297 (63%), Gaps = 17/297 (5%)

Query: 2   IFPTHQRAQGNKLYYQEALN----KSPELKDEPPKVNNVAPT--LEVTEREKYEMLCRGD 55
           I P H RA+ N  +Y++ L     K  + +   P V N  P   L+ TE + YE LCR +
Sbjct: 237 IDPNHPRARNNIKWYEDLLAEDGVKPIDYRRNIPPVTNPRPKNGLKTTEHDMYEALCRNE 296

Query: 56  LTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQP 115
           + V     ++L C Y   + P+LRL P K E     P  + +RDV+ D E+ +I+ +A P
Sbjct: 297 VPVSVKATSKLYC-YYKMDRPFLRLAPFKVEILRFSPLAVFFRDVITDEEVTIIQMLATP 355

Query: 116 RLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVV 175
           RLRRATVQN  TGELE A+YR SKSAWL++ EH ++ RI+RR++ MT L   T+EELQV 
Sbjct: 356 RLRRATVQNSITGELETASYRTSKSAWLKDEEHEIVHRINRRIDLMTNLEQETSEELQVG 415

Query: 176 NYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPE 235
           NYGIGGHY+PH+DFAR  E NAF+SL TGNR+AT+LFYM+    GGATVFT +  ++ P 
Sbjct: 416 NYGIGGHYDPHFDFARREEVNAFQSLNTGNRLATLLFYMTQPESGGATVFTEVKTTVMPS 475

Query: 236 KGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCGLRRGLQ 282
           K  A FW+NL  SG+GD  TRHAACPVL GS  + +            PCGL R ++
Sbjct: 476 KNDALFWYNLLRSGEGDLRTRHAACPVLIGSKWVSNKWIHERGQEFRRPCGLSRSVE 532


>gi|226874889|ref|NP_001152881.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Bos
           taurus]
 gi|296485624|tpg|DAA27739.1| TPA: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Bos taurus]
          Length = 535

 Score =  276 bits (706), Expect = 9e-72,   Method: Compositional matrix adjust.
 Identities = 141/272 (51%), Positives = 186/272 (68%), Gaps = 10/272 (3%)

Query: 4   PTHQRAQGNKLYYQ--------EALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD 55
           P+H+RA GN  Y++        + L+   E +    +     P   + ER+ YE LCRG+
Sbjct: 238 PSHERAGGNLHYFERLLEEEREKMLSNHTEAELASQQGIYERPVDYLPERDVYESLCRGE 297

Query: 56  -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
            + + P    +L CRY H N VP L + P KEE+ +  P I+ Y DVM D EI+ IK++A
Sbjct: 298 GVKLTPRRQKRLFCRYHHGNRVPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 357

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++ R++H+TGLT  TAE LQ
Sbjct: 358 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNLRMQHITGLTVKTAELLQ 417

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
           V NYG+GG YEPH+DF+R  E +AFK LGTGNRVAT L YMSDV  GGATVF  L  ++W
Sbjct: 418 VANYGMGGQYEPHFDFSRKDEQDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDLGAAIW 477

Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           P+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 478 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 509


>gi|426229219|ref|XP_004008688.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 1
           [Ovis aries]
          Length = 535

 Score =  276 bits (705), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 141/272 (51%), Positives = 185/272 (68%), Gaps = 10/272 (3%)

Query: 4   PTHQRAQGNKLYYQ--------EALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD 55
           P+H+RA GN  Y++        + L    E +    +     P   + ER+ YE LCRG+
Sbjct: 238 PSHERAGGNLHYFERLLEEEREKMLTNHTEAELAAQQGIYERPVDYLPERDVYESLCRGE 297

Query: 56  -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
            + + P    +L CRY H N VP L + P KEE+ +  P I+ Y DVM D EI+ IK++A
Sbjct: 298 GVKLTPRRQKRLFCRYHHGNRVPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 357

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++ R++H+TGLT  TAE LQ
Sbjct: 358 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNLRMQHITGLTVKTAELLQ 417

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
           V NYG+GG YEPH+DF+R  E +AFK LGTGNRVAT L YMSDV  GGATVF  L  ++W
Sbjct: 418 VANYGMGGQYEPHFDFSRKDEQDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDLGAAIW 477

Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           P+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 478 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 509


>gi|47218149|emb|CAG10069.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 595

 Score =  275 bits (703), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 152/330 (46%), Positives = 199/330 (60%), Gaps = 58/330 (17%)

Query: 4   PTHQRAQGNKLYYQEALNKSPELKD-------------EPPKVNNVA----PTLEVT--- 43
           P HQRA+GN  Y++  L K  + KD             EP           P  + T   
Sbjct: 264 PEHQRAKGNLKYFEFQLEK--QRKDAEEEPPKETEKRVEPDTTEKKKRKKKPQSKATFQL 321

Query: 44  --EREKYEMLCRGD-LTVPPAIVAQLKCRYVH-RNVPYLRLMPLKEEEAYLQPRIILYRD 99
             ER+KYEMLCRG+ + + P   ++L CRY   +  P   L P+K+++ + +P I+ Y D
Sbjct: 322 IPERKKYEMLCRGEGIRLTPRRQSRLFCRYYDSKRHPRYILSPVKQQDEWDRPYIVRYLD 381

Query: 100 VMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISK-------------------- 139
           ++ D EI+L+K++A+PRLRRAT+ N  TG LE A+YRISK                    
Sbjct: 382 IISDKEIELVKQLAKPRLRRATISNPITGVLETASYRISKRRATVHDPQTGKLTTAQYRV 441

Query: 140 --SAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANA 197
             SAWL   EHPVIE I++R+E +TGL   TAEELQV NYG+GG YEPH+DF R  E +A
Sbjct: 442 SKSAWLTGYEHPVIETINQRIEDLTGLEVDTAEELQVANYGVGGQYEPHFDFGRKDEPDA 501

Query: 198 FKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRH 257
           FK LGTGNR+AT LFYMSDVA GGATVF  +  ++WP+KG+A FW+NL +SG+GDY TRH
Sbjct: 502 FKELGTGNRIATWLFYMSDVAAGGATVFPDVGAAVWPQKGSAVFWYNLFTSGEGDYSTRH 561

Query: 258 AACPVLTGSNSLHSTC----------PCGL 277
           AACPVL G+  + +            PCGL
Sbjct: 562 AACPVLVGNKWVSNKWIHERGQEWRRPCGL 591


>gi|281348666|gb|EFB24250.1| hypothetical protein PANDA_000722 [Ailuropoda melanoleuca]
          Length = 505

 Score =  274 bits (701), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 140/272 (51%), Positives = 184/272 (67%), Gaps = 10/272 (3%)

Query: 4   PTHQRAQGNKLYYQ--------EALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD 55
           P+H+RA GN  Y++        + L+   E      +     P   + ER+ YE LCRG+
Sbjct: 227 PSHERAGGNLRYFERLLEEEREKMLSNQTEAGLATQEGIYERPVDYLPERDIYESLCRGE 286

Query: 56  -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
            + + P    +L CRY H N  P L + P KEE+ +  P I+ Y DVM D EI+ IK++A
Sbjct: 287 GVKLTPRRQKRLFCRYHHGNRTPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 346

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++ R++H+TGLT  TAE LQ
Sbjct: 347 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNLRMQHITGLTVKTAELLQ 406

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
           V NYG+GG YEPH+DF+R  E +AFK LGTGNRVAT L YMSDV  GGATVF  L  ++W
Sbjct: 407 VANYGMGGQYEPHFDFSRKNEQDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDLGAAIW 466

Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           P+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 467 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 498


>gi|301754231|ref|XP_002912939.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Ailuropoda
           melanoleuca]
          Length = 535

 Score =  274 bits (701), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 140/272 (51%), Positives = 184/272 (67%), Gaps = 10/272 (3%)

Query: 4   PTHQRAQGNKLYYQ--------EALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD 55
           P+H+RA GN  Y++        + L+   E      +     P   + ER+ YE LCRG+
Sbjct: 238 PSHERAGGNLRYFERLLEEEREKMLSNQTEAGLATQEGIYERPVDYLPERDIYESLCRGE 297

Query: 56  -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
            + + P    +L CRY H N  P L + P KEE+ +  P I+ Y DVM D EI+ IK++A
Sbjct: 298 GVKLTPRRQKRLFCRYHHGNRTPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 357

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++ R++H+TGLT  TAE LQ
Sbjct: 358 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNLRMQHITGLTVKTAELLQ 417

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
           V NYG+GG YEPH+DF+R  E +AFK LGTGNRVAT L YMSDV  GGATVF  L  ++W
Sbjct: 418 VANYGMGGQYEPHFDFSRKNEQDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDLGAAIW 477

Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           P+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 478 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 509


>gi|395509389|ref|XP_003758980.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2
           [Sarcophilus harrisii]
          Length = 536

 Score =  274 bits (701), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 142/293 (48%), Positives = 190/293 (64%), Gaps = 21/293 (7%)

Query: 5   THQRAQGNKLYYQEAL---------NKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD 55
           +H+RA GN  Y+++ L         NK+ E +          P   + ER+ YE LCRG+
Sbjct: 239 SHERAGGNLRYFEKLLEEERLGKRLNKTSETQPATQGGIYERPPDYLPERDVYEALCRGE 298

Query: 56  -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
            + + P    +L CRY   N  P L + P KEE+ +  P I+ Y DV+ D EI+ IK++A
Sbjct: 299 GIKLTPRRQKRLFCRYHDGNRTPQLLIAPFKEEDEWDSPHIVRYYDVLSDEEIERIKELA 358

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +P+L RATV++ KTG L +ANYR+SKS+WL E + PVI +++RR+ ++TGL+  TAE LQ
Sbjct: 359 KPKLARATVRDPKTGVLTVANYRVSKSSWLEEGDDPVIAQLNRRMHYITGLSVKTAELLQ 418

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
           V NYG+GG YEPH+DF+R GE +AFK LGTGNRVAT L YMSDV  GGATVF     ++W
Sbjct: 419 VANYGMGGQYEPHFDFSRKGEQDAFKHLGTGNRVATFLNYMSDVEAGGATVFPDFGATIW 478

Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCG 276
           P+KGT+ FW+NL  SG+GDY TRHAACPVL GS  + +            PCG
Sbjct: 479 PKKGTSVFWYNLFRSGEGDYRTRHAACPVLVGSKWVSNKWFHERGQEFLRPCG 531


>gi|327265288|ref|XP_003217440.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Anolis
           carolinensis]
          Length = 554

 Score =  274 bits (700), Expect = 4e-71,   Method: Compositional matrix adjust.
 Identities = 140/275 (50%), Positives = 188/275 (68%), Gaps = 14/275 (5%)

Query: 4   PTHQRAQGNKLYYQE---------ALNKSPELKDEPPKVNNV--APTLEVTEREKYEMLC 52
           P+H+RA  N  Y+++         AL+ +P    EP   N +   P   + ERE YE LC
Sbjct: 255 PSHERAGSNMQYFEKLLENEQNEKALDDAPNAT-EPSTYNGIYERPPDYLPEREIYEALC 313

Query: 53  RGD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIK 110
           RG+ + + P    +L CRY + N  P+L + P KEE+ +  P I+ Y +V+ D EI+ IK
Sbjct: 314 RGEGVKMTPRRQKRLFCRYHNGNQNPHLLIAPFKEEDEWDSPHIVRYYNVLSDEEIEKIK 373

Query: 111 KMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAE 170
           ++A+P+L RATV++ KTG L +ANYR+SKS+WL E +  V+ ++++R+EH+TGLT  TAE
Sbjct: 374 ELAKPKLARATVRDPKTGVLTVANYRVSKSSWLEEEDDLVVAKVNQRMEHITGLTVKTAE 433

Query: 171 ELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNL 230
            LQV NYG+GG YEPH+DF+R  E +AFK LGTGNRVAT L YMSDV  GGATVF     
Sbjct: 434 LLQVANYGMGGQYEPHFDFSRKEEPDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDFGA 493

Query: 231 SLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           ++WP+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 494 AIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVG 528


>gi|54792285|emb|CAG28668.1| prolyl 4-hydroxylase alpha-2 subunit [Gallus gallus]
          Length = 538

 Score =  273 bits (698), Expect = 7e-71,   Method: Compositional matrix adjust.
 Identities = 140/276 (50%), Positives = 184/276 (66%), Gaps = 19/276 (6%)

Query: 5   THQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVT-------------EREKYEML 51
           TH+RA  N  Y+++ L K    + E P    VA T  V              ER+ YE L
Sbjct: 242 THERAGSNLRYFEKLLEK----EREKPSNKTVATTEPVVQSGAYERPLDYLPERDIYEAL 297

Query: 52  CRGD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLI 109
           CRG+ + + P    +L CRY   N  P+L + P KEE+ +  P I+ Y DVM D EI+ I
Sbjct: 298 CRGEGVKMTPQRQKRLFCRYHDGNRNPHLLIAPFKEEDEWDSPHIVRYYDVMSDEEIEKI 357

Query: 110 KKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTA 169
           K++A+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ ++++R++ +TGLT  TA
Sbjct: 358 KQLAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVAKVNQRMQQITGLTVKTA 417

Query: 170 EELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLN 229
           E LQV NYG+GG YEPH+DF+R  E +AFK LGTGNRVAT L YMSDV  GGATVF    
Sbjct: 418 ELLQVANYGMGGQYEPHFDFSRKDEPDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDFG 477

Query: 230 LSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
            ++WP+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 478 AAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVG 513


>gi|324511726|gb|ADY44875.1| Prolyl 4-hydroxylase subunit alpha-1 [Ascaris suum]
          Length = 550

 Score =  272 bits (696), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 143/296 (48%), Positives = 190/296 (64%), Gaps = 19/296 (6%)

Query: 4   PTHQRAQGNKLYYQ-----EALNKSPELKDEPPKVNNVAPT--LEVTEREKYEMLCRGDL 56
           P H  A+GN  +Y+     E +  S   K+ PP + N  P   LE +ER  YE LCR ++
Sbjct: 236 PYHPHARGNVKWYEDLLVEEGVKPSDHRKNIPP-LENRRPDDGLEDSERTIYEALCRNEV 294

Query: 57  TVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPR 116
            V    ++QL C Y   + P+LRL P K E     P  +L+ D++ D E  +I+++A PR
Sbjct: 295 PVSIKAISQLYC-YYKMDRPFLRLAPFKVEILRFNPLAVLFVDIISDEEAKMIQQIATPR 353

Query: 117 LRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVN 176
           L+RATVQN KTGELE A YRISKSAWL+  +H +I+RI+RR+E MT L   T+EELQ+ N
Sbjct: 354 LKRATVQNSKTGELETAAYRISKSAWLKGGDHELIDRINRRIELMTNLIQETSEELQIAN 413

Query: 177 YGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEK 236
           YG+GGHY+PH+DFAR  E  AF+SLGTGNR+ATVLFY+++   GG TVFT L  ++ P K
Sbjct: 414 YGVGGHYDPHFDFARKEEPKAFESLGTGNRLATVLFYLTEPEIGGGTVFTELRTAVMPSK 473

Query: 237 GTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCGLRRGLQ 282
             A FW+NL+ SG+GD  TRHAACPVL G   + +            PCGL+  +Q
Sbjct: 474 NGALFWYNLYRSGEGDLRTRHAACPVLVGIKWVANKWIHERGQEFLRPCGLKPSVQ 529


>gi|47213360|emb|CAF90979.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 511

 Score =  272 bits (695), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 135/275 (49%), Positives = 189/275 (68%), Gaps = 11/275 (4%)

Query: 4   PTHQRAQGNKLYYQEALNKSPEL--KDEPPKVNNVAPTLEVTEREKYEMLCRGD-LTVPP 60
           PTHQRA GN+ Y++  L K  ++  +++  +     P    +E++KYE LCRG+ L + P
Sbjct: 215 PTHQRATGNRRYFEYQLAKQTKVGKREKGRRQEEHQPDDYQSEKKKYEQLCRGEGLRMTP 274

Query: 61  AIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRR 119
              + L CRY      P   + P+K+E+ +  PRI+ Y DV+ + E++ +K++A+PRLRR
Sbjct: 275 QRQSGLFCRYYDNGRHPKYVIGPVKQEDEWDHPRIVRYHDVLSNREMEKVKELARPRLRR 334

Query: 120 ATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGI 179
           ATV + +TG+L  A YR+SKSAWL   EHP++++I++R+E +TGL  STAE+LQV NYG+
Sbjct: 335 ATVHDPRTGQLTTAPYRVSKSAWLGAFEHPIVDQINQRIEDITGLDVSTAEDLQVANYGV 394

Query: 180 GGHYEPHYDFARPGEANAFKSLGTGNRVATVLFY-------MSDVAQGGATVFTSLNLSL 232
           GG YEPH+DF +  E +AF+ LGTGNR+AT L Y       MSDV  GGATVFT +  S+
Sbjct: 395 GGQYEPHFDFGQKDEPDAFEELGTGNRIATWLLYVSAAVLRMSDVQAGGATVFTDIGASV 454

Query: 233 WPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
            P+KG+A FW+NL  SGDGDY TRHAACPVL G+ 
Sbjct: 455 LPQKGSAVFWYNLRPSGDGDYRTRHAACPVLLGNK 489


>gi|345326417|ref|XP_001510155.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like
           [Ornithorhynchus anatinus]
          Length = 888

 Score =  271 bits (692), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 137/274 (50%), Positives = 186/274 (67%), Gaps = 15/274 (5%)

Query: 5   THQRAQGNKLYYQEALNKSPELKDEPPKVNNVA-----------PTLEVTEREKYEMLCR 53
           +H+RA GN  Y+++ L +  E  ++P    + +           P   + ER+ YE LCR
Sbjct: 591 SHERAGGNLRYFEKLLEE--ERMEKPLNRTSASKPATHGGIYERPPDYLPERDVYEGLCR 648

Query: 54  GD-LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           G+ + + P    +L CRY   N  P L + P KEE+ +  P I+ Y DV+ D EI+ IK+
Sbjct: 649 GEGVKLTPRRQKRLFCRYHDGNRTPQLLIAPFKEEDEWDSPHIVRYYDVLSDEEIEKIKE 708

Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
           +A+P+L RATV++ KTG L +ANYR+SKS+WL E + PV+ +++RR++++TGLT  TAE 
Sbjct: 709 LAKPKLARATVRDPKTGVLTVANYRVSKSSWLEEEDDPVVAQVNRRMQYITGLTVKTAEL 768

Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
           LQV NYG+GG YEPH+DF+R  E +AFK LGTGNRVAT L YMSDV  GGATVF     +
Sbjct: 769 LQVANYGMGGQYEPHFDFSRKDEPDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDFGAA 828

Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           +WP+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 829 IWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVG 862


>gi|170649696|gb|ACB21278.1| prolyl 4-hydroxylase, alpha II subunit isoform 1 precursor
           (predicted) [Callicebus moloch]
          Length = 555

 Score =  270 bits (691), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 143/292 (48%), Positives = 189/292 (64%), Gaps = 30/292 (10%)

Query: 4   PTHQRAQGNKLYYQEALNKSPE--LKDEP------PKVNNVAPTLEVTEREKYEMLCRGD 55
           P+H+RA GN  Y+++ L +  E  L ++       P+     P   + ER+ YE LCRG+
Sbjct: 238 PSHERAGGNLRYFEQLLEEEREKMLSNQTEAELATPEGIYERPVDYLPERDVYESLCRGE 297

Query: 56  -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
            + + P    +L CRY H N  P L + P KEE+ +  P I+ Y DVM D EI+ IK++A
Sbjct: 298 GVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 357

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT  TAE LQ
Sbjct: 358 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 417

Query: 174 VVNYGIGGHYEPHYDFAR--------------------PGEANAFKSLGTGNRVATVLFY 213
           V NYG+GG YEPH+DF+R                      E +AFK LGTGNRVAT L Y
Sbjct: 418 VANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYNDERDAFKHLGTGNRVATFLNY 477

Query: 214 MSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           MSDV  GGATVF  L  ++WP+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 478 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 529


>gi|308451420|ref|XP_003088665.1| CRE-PHY-2 protein [Caenorhabditis remanei]
 gi|308246199|gb|EFO90151.1| CRE-PHY-2 protein [Caenorhabditis remanei]
          Length = 609

 Score =  268 bits (686), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 154/352 (43%), Positives = 199/352 (56%), Gaps = 70/352 (19%)

Query: 2   IFPTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPA 61
           I P H RA+GN  +Y++ L     + D PP VN       + ER+ YE LCRG+  +PP 
Sbjct: 251 IAPNHPRAKGNVKWYEDMLQGKDMVGDLPPIVNKRVEFDGIVERDAYEALCRGE--IPPV 308

Query: 62  ---IVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLR 118
                 +L+C Y+ R+ P+L++ P+K E     P  +L+++V+ DSEI +IK++A P+L+
Sbjct: 309 EKKWKNKLRC-YLKRDKPFLKIAPIKVEILRFDPLAVLFKNVISDSEIKVIKELASPKLK 367

Query: 119 RATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYG 178
           RATVQN KTGELE A YRISKSAWL+   HPVIER++RR+E  TGL   T+EELQV NYG
Sbjct: 368 RATVQNSKTGELEHATYRISKSAWLKGDLHPVIERVNRRIEDFTGLYQGTSEELQVANYG 427

Query: 179 IGGH------------------YEPHYDFARPG--------------------------- 193
           +GGH                  YEPHYD +  G                           
Sbjct: 428 LGGHYDPHFDFARIANYGLGGHYEPHYDMSLVGYHPIQLTVSLEYFQRGVPEPYGKNGNR 487

Query: 194 ---------EANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHN 244
                    E NAFK+L TGNR+ATVLFYMS   +GGATVF  L  +++P K  A FW+N
Sbjct: 488 IATVLFYKEEKNAFKTLNTGNRIATVLFYMSQPERGGATVFNHLGTAVFPSKNDALFWYN 547

Query: 245 LHSSGDGDYYTRHAACPVLTG----SNS-LHS-----TCPCGLRRGLQRSGI 286
           L   G+GD  TRHAACPVL G    SN  +H      T PCGL  G+Q + I
Sbjct: 548 LRRDGEGDLRTRHAACPVLLGVKWVSNKWIHERGQEFTRPCGLEEGVQENFI 599


>gi|321474876|gb|EFX85840.1| hypothetical protein DAPPUDRAFT_309107 [Daphnia pulex]
          Length = 528

 Score =  268 bits (685), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 130/272 (47%), Positives = 180/272 (66%), Gaps = 7/272 (2%)

Query: 2   IFPTHQRAQGN-----KLYYQEALNKSPELKDEPPKVNNVAPT--LEVTEREKYEMLCRG 54
           I P HQ+A  N     KL  Q+ +N          K+N    T  L     + YE LCRG
Sbjct: 232 IVPYHQQALDNIKHYQKLLLQQGVNTEERFNTTKLKLNKSTGTFGLRRDHWDNYEKLCRG 291

Query: 55  DLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQ 114
           +  + P +  +L+CRYV  NVP+  + P+K EEA L+P +++Y  V++D+EID++KK+AQ
Sbjct: 292 EKLLDPKVEGRLRCRYVTNNVPFFFIQPVKMEEALLKPLLVIYHGVIFDAEIDVVKKLAQ 351

Query: 115 PRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQV 174
           PR +R  V +  TG      YRI+K+A+L++ EH +I ++SRRV  +TGL  + +E+LQV
Sbjct: 352 PRFKRTGVTDRDTGRSMPVQYRIAKAAFLKDSEHNLIVKMSRRVGDITGLDMAASEDLQV 411

Query: 175 VNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWP 234
            NYGIGGHY PH+D+AR GE +  + L  GNR+AT LFYMSDV  GGATVF ++  +LWP
Sbjct: 412 CNYGIGGHYVPHFDYARQGEIHGPRDLDWGNRIATWLFYMSDVEAGGATVFPAVGAALWP 471

Query: 235 EKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           +KG+AAFW+NL  +G+GD  T HA CPVLTGS
Sbjct: 472 QKGSAAFWYNLRPNGNGDEDTLHAGCPVLTGS 503


>gi|281183175|ref|NP_001162504.1| prolyl 4-hydroxylase subunit alpha-2 [Papio anubis]
 gi|159461520|gb|ABW96795.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase, alpha
           polypeptide II, isoform 1 (predicted) [Papio anubis]
          Length = 578

 Score =  268 bits (684), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 141/292 (48%), Positives = 186/292 (63%), Gaps = 30/292 (10%)

Query: 4   PTHQRAQGNKLYYQE--------ALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD 55
           P+H+RA GN  Y+++         L+   E +   P+     P   + ER+ YE LCRG+
Sbjct: 261 PSHERAGGNLRYFEQLLEEEREKMLSNQTEAELATPEGIYERPVDYLPERDVYESLCRGE 320

Query: 56  -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
            + + P    +L CRY H N  P L + P KEE+ +  P I+ Y DVM D EI+ IK++A
Sbjct: 321 GVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 380

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT  TAE LQ
Sbjct: 381 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 440

Query: 174 VVNYGIGGHYEPHYDFAR--------------------PGEANAFKSLGTGNRVATVLFY 213
           V NYG+GG YEPH+DF+R                      E + FK LGTGNRVAT L Y
Sbjct: 441 VANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYNDERHTFKHLGTGNRVATFLNY 500

Query: 214 MSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           MSDV  GGATVF  L  ++WP+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 501 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 552


>gi|449267219|gb|EMC78185.1| Prolyl 4-hydroxylase subunit alpha-2 [Columba livia]
          Length = 538

 Score =  268 bits (684), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 137/275 (49%), Positives = 184/275 (66%), Gaps = 14/275 (5%)

Query: 5   THQRAQGNKLYYQEALN---------KSPELKDEPPKVNNVA---PTLEVTEREKYEMLC 52
           TH+RA  N  Y+++ L           +  +    P V + A   P   + ER+ YE LC
Sbjct: 238 THERAGSNLRYFEKLLEKEREKEQEKSNKTMTTTEPVVQSGAYERPLDYLPERDIYEALC 297

Query: 53  RGD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIK 110
           RG+ + + P    +L CRY   N  P+L + P KEE+ +  P I+ Y DVM D EI+ IK
Sbjct: 298 RGEGVKMTPRRQKRLFCRYHDGNRNPHLLIAPFKEEDEWDSPHIVRYYDVMSDEEIEKIK 357

Query: 111 KMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAE 170
           ++A+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ ++++R++ +TGLT  TAE
Sbjct: 358 QLAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVAKVNQRMQQITGLTVKTAE 417

Query: 171 ELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNL 230
            LQV NYG+GG YEPH+DF+R  E +AFK LGTGNRVAT L YMSDV  GGATVF     
Sbjct: 418 LLQVANYGMGGQYEPHFDFSRKDEPDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDFGA 477

Query: 231 SLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           ++WP+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 478 AIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVG 512


>gi|167045848|gb|ABZ10515.1| prolyl 4-hydroxylase, alpha II subunit isoform 1 precursor
           (predicted) [Callithrix jacchus]
          Length = 555

 Score =  268 bits (684), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 142/292 (48%), Positives = 188/292 (64%), Gaps = 30/292 (10%)

Query: 4   PTHQRAQGNKLYYQEALNKSPE--LKDEP------PKVNNVAPTLEVTEREKYEMLCRGD 55
           P+H+RA GN  Y+++ L +  E  L ++       P+     P   + ER+ YE LCRG+
Sbjct: 238 PSHERAGGNLRYFEQLLEEEREKMLSNQTEAELATPEGIYERPVDYLPERDVYESLCRGE 297

Query: 56  -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
            + + P    +L CRY H N    L + P KEE+ +  P I+ Y DVM D EI+ IK++A
Sbjct: 298 GVKLTPRRQKRLFCRYHHGNRASQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 357

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT  TAE LQ
Sbjct: 358 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 417

Query: 174 VVNYGIGGHYEPHYDFAR--------------------PGEANAFKSLGTGNRVATVLFY 213
           V NYG+GG YEPH+DF+R                      E +AFK LGTGNRVAT L Y
Sbjct: 418 VANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYNDERDAFKHLGTGNRVATFLNY 477

Query: 214 MSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           MSDV  GGATVF  L  ++WP+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 478 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 529


>gi|90085216|dbj|BAE91349.1| unnamed protein product [Macaca fascicularis]
          Length = 244

 Score =  267 bits (683), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 126/219 (57%), Positives = 165/219 (75%), Gaps = 2/219 (0%)

Query: 50  MLCRGD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEID 107
           MLCRG+ + + P    +L CRY   N  P   L P K+E+ + +PRII + D++ D+EI+
Sbjct: 1   MLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIE 60

Query: 108 LIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTS 167
           ++K +A+PRL RATV + +TG+L  A YR+SKSAWL   E+PV+ RI+ R++ +TGL  S
Sbjct: 61  IVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVS 120

Query: 168 TAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTS 227
           TAEELQV NYG+GG YEPH+DFAR  E +AFK LGTGNR+AT LFYMSDV+ GGATVF  
Sbjct: 121 TAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPE 180

Query: 228 LNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           +  S+WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 181 VGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 219


>gi|301613004|ref|XP_002936004.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Xenopus
           (Silurana) tropicalis]
          Length = 526

 Score =  267 bits (683), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 131/225 (58%), Positives = 165/225 (73%), Gaps = 2/225 (0%)

Query: 44  EREKYEMLCRGD-LTVPPAIVAQLKCRYVH-RNVPYLRLMPLKEEEAYLQPRIILYRDVM 101
           E+EKYE LCRG+ + +      +L CRY   +  P L L P K+E+ + +PRI+ Y D++
Sbjct: 277 EKEKYEKLCRGEGVKMTSRRQKRLFCRYFDGKKDPLLILSPTKQEDEWDKPRIVRYHDII 336

Query: 102 YDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHM 161
            D EI  +K++A+PRLRRAT+ N  TG LE A YRI+KSAWL   E PV+ R++RR+E +
Sbjct: 337 SDEEISKVKELAKPRLRRATISNPITGVLETAQYRITKSAWLSGYEDPVVARLNRRIEGV 396

Query: 162 TGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGG 221
           TGL  STAEELQV NYGIGG YEPH+DF R  E +AFK LGTGNRVAT LFYMSDV  GG
Sbjct: 397 TGLDMSTAEELQVANYGIGGQYEPHFDFLRKYEPDAFKKLGTGNRVATWLFYMSDVEAGG 456

Query: 222 ATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           ATVF  +  +++P+KGTA FW+NL  SG+GDY TRHAACPVL G+
Sbjct: 457 ATVFPEVGAAVYPKKGTAVFWYNLLESGEGDYSTRHAACPVLVGN 501


>gi|431892682|gb|ELK03115.1| Prolyl 4-hydroxylase subunit alpha-2 [Pteropus alecto]
          Length = 629

 Score =  267 bits (683), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 141/292 (48%), Positives = 187/292 (64%), Gaps = 30/292 (10%)

Query: 4   PTHQRAQGNKLYYQEALNK------SPELKDEPPKVNNV--APTLEVTEREKYEMLCRGD 55
           P+H+RA GN  Y++  L +      S + + E    + +   P   + ER+ YE LCRG+
Sbjct: 244 PSHERAGGNLRYFERLLEEERDKMVSNQTEAELATQDGIYERPVDYLPERDVYESLCRGE 303

Query: 56  -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
            + + P    +L CRY H N  P L + P KEE+ +  P I+ Y DVM D EI+ IK++A
Sbjct: 304 GVKLTPRRQKRLFCRYHHGNRTPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEINRIKEIA 363

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT  TAE LQ
Sbjct: 364 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 423

Query: 174 VVNYGIGGHYEPHYDFAR--------------------PGEANAFKSLGTGNRVATVLFY 213
           V NYG+GG YEPH+DF+R                      E + FK LGTGNRVAT L Y
Sbjct: 424 VANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYNDEQDVFKHLGTGNRVATFLNY 483

Query: 214 MSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           MSDV  GGATVF  L  ++WP+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 484 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 535


>gi|390459659|ref|XP_002806656.2| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
           alpha-2 [Callithrix jacchus]
          Length = 579

 Score =  267 bits (682), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 141/292 (48%), Positives = 185/292 (63%), Gaps = 30/292 (10%)

Query: 4   PTHQRAQGNKLYYQE--------ALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD 55
           P+H+RA GN  Y+++         L+   E +   P+     P   + ER+ YE LCRG+
Sbjct: 262 PSHERAGGNLRYFEQLLEEEREKMLSNQTEAELATPEGIYERPVDYLPERDVYESLCRGE 321

Query: 56  -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
            + + P    +L CRY H N    L + P KEE+ +  P I+ Y DVM D EI+ IK++A
Sbjct: 322 GVKLTPRRQKRLFCRYHHGNRASQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 381

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT  TAE LQ
Sbjct: 382 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 441

Query: 174 VVNYGIGGHYEPHYDFAR--------------------PGEANAFKSLGTGNRVATVLFY 213
           V NYG+GG YEPH+DF+R                      E +AFK LGTGNRVAT L Y
Sbjct: 442 VANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYNDERDAFKHLGTGNRVATFLNY 501

Query: 214 MSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           MSDV  GGATVF  L  ++WP+KGTA FW+NL  SG GDY TRHAACPVL G
Sbjct: 502 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGXGDYRTRHAACPVLVG 553


>gi|197215651|gb|ACH53042.1| prolyl 4-hydroxylase, alpha II subunit isoform 1 precursor
           (predicted) [Otolemur garnettii]
          Length = 555

 Score =  266 bits (680), Expect = 8e-69,   Method: Compositional matrix adjust.
 Identities = 141/292 (48%), Positives = 184/292 (63%), Gaps = 30/292 (10%)

Query: 4   PTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVA--------PTLEVTEREKYEMLCRGD 55
           P+H+RA GN  Y++  L +  E          +A        P   + ERE YE LCRG+
Sbjct: 238 PSHERAGGNLRYFEHLLEEEREKMLSNKTEAELATQEGIYERPVDYLPEREVYESLCRGE 297

Query: 56  -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
            + + P    +L CRY H N  P L + P KEE+ +  P I+ Y DVM D EI+ IK++A
Sbjct: 298 GVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 357

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++ R++H+TGL+  TAE LQ
Sbjct: 358 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNHRMQHITGLSVKTAELLQ 417

Query: 174 VVNYGIGGHYEPHYDFAR--------------------PGEANAFKSLGTGNRVATVLFY 213
           V NYG+GG YEPH+DF+R                      E +AFK LGTGNRVAT L Y
Sbjct: 418 VANYGVGGQYEPHFDFSRRPFDSGLKTEGNRVATFLNYNHERDAFKRLGTGNRVATFLNY 477

Query: 214 MSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           MSDV  GGATVF  L  ++WP+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 478 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 529


>gi|308497208|ref|XP_003110791.1| CRE-DPY-18 protein [Caenorhabditis remanei]
 gi|308242671|gb|EFO86623.1| CRE-DPY-18 protein [Caenorhabditis remanei]
          Length = 559

 Score =  266 bits (680), Expect = 9e-69,   Method: Compositional matrix adjust.
 Identities = 137/291 (47%), Positives = 189/291 (64%), Gaps = 17/291 (5%)

Query: 4   PTHQRAQGNKLYYQEALNKS----PELKDEPPKVNNVAP--TLEVTEREKYEMLCRGDLT 57
           P+H RA+GN  +Y++ L +      E++   P++ N  P   L  TER  YE LCR ++ 
Sbjct: 235 PSHPRAKGNVKWYEDLLEQEGVRRSEMRKNLPEIQNRRPDSVLGNTERTMYEALCRNEVP 294

Query: 58  VPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRL 117
           V    +++L C Y  R+ P+L   P+K E     P  +L++DV+ D E+  I+++A+P+L
Sbjct: 295 VSQKDISRLYC-YYKRDRPFLVYAPIKVEIKRFNPLAVLFKDVISDDEVATIQELAKPKL 353

Query: 118 RRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNY 177
            RATV +  TG+L  A YRISKSAWL+E EH V+ER+++R+E MT L   TAEELQ+ NY
Sbjct: 354 ARATVHDSATGKLVTATYRISKSAWLKEWEHEVVERVNKRIELMTNLEMETAEELQIANY 413

Query: 178 GIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKG 237
           GIGGHY+PH+D A+  E+ +F+SLGTGNR+ATVLFYMS  + GG TVFT +  ++ P K 
Sbjct: 414 GIGGHYDPHFDHAKKEESKSFESLGTGNRIATVLFYMSQPSHGGGTVFTEVKSTVLPTKN 473

Query: 238 TAAFWHNLHSSGDGDYYTRHAACPVLTG----SNS-LHSTC-----PCGLR 278
            A FW+NL   GDG+  TRHAACPVL G    SN  +H        PCGL+
Sbjct: 474 DALFWYNLFKQGDGNPDTRHAACPVLVGIKWVSNKWIHEKGNEFRRPCGLK 524


>gi|189241578|ref|XP_969458.2| PREDICTED: similar to prolyl 4-hydroxylase alpha subunit 1,
           putative [Tribolium castaneum]
          Length = 515

 Score =  266 bits (680), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 128/265 (48%), Positives = 176/265 (66%), Gaps = 12/265 (4%)

Query: 2   IFPTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPA 61
           I P H RA  NK YY++      EL++   K       +E  E+E Y+ LCR ++++P A
Sbjct: 243 ILPYHSRALRNKFYYEQ------ELQNPVDKTKKDQDHVEDVEKEVYKKLCRAEISLPEA 296

Query: 62  IVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRAT 121
             ++LKC Y + N P+LR+ P K E+A+L P I+++ +V+ D EI+ +K++AQ RL  A 
Sbjct: 297 KSSKLKCFYQNSNHPFLRIAPFKVEQAHLDPDILIFHNVLSDCEIETMKQLAQSRLVTAV 356

Query: 122 VQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGG 181
            +N  + +LE+  +RISK AWL + EH  +  +++RV HMTGLT STAEE QVVNYGIGG
Sbjct: 357 FENPHSKQLELFPFRISKVAWLEDQEHQHLAVVAQRVAHMTGLTLSTAEEFQVVNYGIGG 416

Query: 182 HYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAF 241
           HYEPH+DF    +         G+R+ TVLFY+SDV QGGATVF  + +S+WP+KG+A  
Sbjct: 417 HYEPHFDFQSTVDP------AIGSRIETVLFYLSDVEQGGATVFPEIQVSVWPQKGSAVV 470

Query: 242 WHNLHSSGDGDYYTRHAACPVLTGS 266
           W NLH SGDGD  T+HA CPVL GS
Sbjct: 471 WFNLHPSGDGDQRTKHAGCPVLIGS 495


>gi|432109537|gb|ELK33711.1| Prolyl 4-hydroxylase subunit alpha-2 [Myotis davidii]
          Length = 555

 Score =  266 bits (679), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 140/292 (47%), Positives = 182/292 (62%), Gaps = 30/292 (10%)

Query: 4   PTHQRAQGNKLYYQEALNK--------SPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD 55
           P+H+RA GN  Y+++ L +          E    P       P   + ER+ YE LCRG+
Sbjct: 238 PSHERAGGNLRYFEQLLEEERGKMASNQTEAGQAPQDSIYERPADYLPERDVYESLCRGE 297

Query: 56  -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
            + + P    +L CRY   N  P L + P KEE+ +  P I+ Y DVM D EI  IK++A
Sbjct: 298 GVKLTPKRQKRLFCRYHDGNRTPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIQRIKEIA 357

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT  TAE LQ
Sbjct: 358 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 417

Query: 174 VVNYGIGGHYEPHYDFAR--------------------PGEANAFKSLGTGNRVATVLFY 213
           V NYG+GG YEPH+DF+R                      E + FK LGTGNRVAT L Y
Sbjct: 418 VANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYNDEQDVFKHLGTGNRVATFLNY 477

Query: 214 MSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           MSDV  GGATVF  L  ++WP+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 478 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 529


>gi|270001038|gb|EEZ97485.1| hypothetical protein TcasGA2_TC011322 [Tribolium castaneum]
          Length = 509

 Score =  266 bits (679), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 128/265 (48%), Positives = 176/265 (66%), Gaps = 12/265 (4%)

Query: 2   IFPTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPA 61
           I P H RA  NK YY++      EL++   K       +E  E+E Y+ LCR ++++P A
Sbjct: 237 ILPYHSRALRNKFYYEQ------ELQNPVDKTKKDQDHVEDVEKEVYKKLCRAEISLPEA 290

Query: 62  IVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRAT 121
             ++LKC Y + N P+LR+ P K E+A+L P I+++ +V+ D EI+ +K++AQ RL  A 
Sbjct: 291 KSSKLKCFYQNSNHPFLRIAPFKVEQAHLDPDILIFHNVLSDCEIETMKQLAQSRLVTAV 350

Query: 122 VQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGG 181
            +N  + +LE+  +RISK AWL + EH  +  +++RV HMTGLT STAEE QVVNYGIGG
Sbjct: 351 FENPHSKQLELFPFRISKVAWLEDQEHQHLAVVAQRVAHMTGLTLSTAEEFQVVNYGIGG 410

Query: 182 HYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAF 241
           HYEPH+DF    +         G+R+ TVLFY+SDV QGGATVF  + +S+WP+KG+A  
Sbjct: 411 HYEPHFDFQSTVDP------AIGSRIETVLFYLSDVEQGGATVFPEIQVSVWPQKGSAVV 464

Query: 242 WHNLHSSGDGDYYTRHAACPVLTGS 266
           W NLH SGDGD  T+HA CPVL GS
Sbjct: 465 WFNLHPSGDGDQRTKHAGCPVLIGS 489


>gi|291387302|ref|XP_002710242.1| PREDICTED: prolyl 4-hydroxylase, alpha II subunit isoform 1
           precursor (predicted)-like isoform 2 [Oryctolagus
           cuniculus]
 gi|217273039|gb|ACK28132.1| prolyl 4-hydroxylase, alpha II subunit isoform 1 precursor
           (predicted) [Oryctolagus cuniculus]
          Length = 555

 Score =  265 bits (678), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 144/293 (49%), Positives = 186/293 (63%), Gaps = 32/293 (10%)

Query: 4   PTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLE---------VTEREKYEMLCRG 54
           P+H+RA GN  Y++  L +    K    +   VA T E         + ER+ YE LCRG
Sbjct: 238 PSHERAGGNLRYFERLLEEQ-RGKSLLNQTEAVAVTQEGIYERPVDYLPERDVYESLCRG 296

Query: 55  D-LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKM 112
           + + + P    +L CRY   N  P L + P KEE+ +  P I+ Y DVM D EI+ IK++
Sbjct: 297 EGVKLTPRRQKRLFCRYHDGNGAPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEI 356

Query: 113 AQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEEL 172
           A+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ RI+RR++H+TGLT  TAE L
Sbjct: 357 AKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARINRRMQHITGLTVKTAELL 416

Query: 173 QVVNYGIGGHYEPHYDFAR--------------------PGEANAFKSLGTGNRVATVLF 212
           QV NYG+GG YEPH+DF+R                      E +AFK LGTGNRVAT L 
Sbjct: 417 QVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYNNERDAFKRLGTGNRVATFLN 476

Query: 213 YMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           YMSDV  GGATVF  L  ++WP+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 477 YMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 529


>gi|297675927|ref|XP_002815905.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Pongo
           abelii]
 gi|395736137|ref|XP_003776704.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Pongo abelii]
          Length = 533

 Score =  263 bits (672), Expect = 8e-68,   Method: Compositional matrix adjust.
 Identities = 135/272 (49%), Positives = 182/272 (66%), Gaps = 12/272 (4%)

Query: 4   PTHQRAQGNKLYYQE--------ALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD 55
           P+H+RA GN  Y+++         L+   E +   P+     P   + ER+ YE LCRG+
Sbjct: 238 PSHERAGGNLRYFEQLLEEEREKTLSNQTEAELATPEGIYERPVDYLPERDVYESLCRGE 297

Query: 56  -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
            + + P    +L CRY H N  P L + P KEE+ +  P I+ Y DVM D EI+ IK++A
Sbjct: 298 GVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 357

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT  TAE LQ
Sbjct: 358 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 417

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
           V NYG+GG YEPH+DF+R    +  K+   GNR+AT L YMSDV  GGATVF  L  ++W
Sbjct: 418 VANYGVGGQYEPHFDFSRRPFDSGLKT--EGNRLATFLNYMSDVEAGGATVFPDLGAAIW 475

Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           P+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 476 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 507


>gi|184185444|gb|ACC68850.1| prolyl 4-hydroxylase, alpha II subunit isoform 1 precursor
           (predicted) [Rhinolophus ferrumequinum]
          Length = 555

 Score =  263 bits (671), Expect = 9e-68,   Method: Compositional matrix adjust.
 Identities = 141/292 (48%), Positives = 186/292 (63%), Gaps = 30/292 (10%)

Query: 4   PTHQRAQGNKLYYQEALNK------SPELKDEPPKVNNV--APTLEVTEREKYEMLCRGD 55
           P+H+RA GN  Y++  L +      S + + EP     +   P   + ER+ YE LCRG+
Sbjct: 238 PSHERAGGNLRYFERLLEEEREKIVSNQTEAEPASQEGIYERPVDYLPERDVYESLCRGE 297

Query: 56  -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
            + + P    +L CRY H N  P L + P KEE+ +  P I+ Y DVM D EI+ IK++A
Sbjct: 298 GVKLTPRRQKRLFCRYHHGNRTPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIEKIKEIA 357

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +P+L RATV++ KTG L +A+YR+SKS+WL E E PV+ R++ R++H+TGL+  TAE LQ
Sbjct: 358 KPKLARATVRDPKTGVLTVASYRVSKSSWLEETEDPVVARLNLRMQHITGLSVKTAELLQ 417

Query: 174 VVNYGIGGHYEPHYDFAR--------------------PGEANAFKSLGTGNRVATVLFY 213
           V NYG+GG YEPH+DF+R                      E + FK LGTGNRVAT L Y
Sbjct: 418 VANYGMGGQYEPHFDFSRRPFDNGLKTEGNRLATFLNYNDEHDVFKHLGTGNRVATFLNY 477

Query: 214 MSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           MSDV  GGATVF  L  ++WP+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 478 MSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 529


>gi|403255937|ref|XP_003920661.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Saimiri
           boliviensis boliviensis]
 gi|403255939|ref|XP_003920662.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Saimiri
           boliviensis boliviensis]
 gi|403255943|ref|XP_003920664.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 4 [Saimiri
           boliviensis boliviensis]
          Length = 533

 Score =  263 bits (671), Expect = 9e-68,   Method: Compositional matrix adjust.
 Identities = 136/272 (50%), Positives = 184/272 (67%), Gaps = 12/272 (4%)

Query: 4   PTHQRAQGNKLYYQEALNKSPE--LKDEP------PKVNNVAPTLEVTEREKYEMLCRGD 55
           P+H+RA GN  Y+++ L +  E  L ++       P+     P   + ER+ YE LCRG+
Sbjct: 238 PSHERAGGNLRYFEQLLEEEREKMLSNQTEAELATPEGIYERPVDYLPERDVYESLCRGE 297

Query: 56  -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
            + + P    +L CRY H N  P L + P KEE+ +  P I+ Y DVM D EI+ IK++A
Sbjct: 298 GVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 357

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT  TAE LQ
Sbjct: 358 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 417

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
           V NYG+GG YEPH+DF+R    +  K+   GNR+AT L YMSDV  GGATVF  L  ++W
Sbjct: 418 VANYGVGGQYEPHFDFSRRPFDSGLKT--EGNRLATFLNYMSDVEAGGATVFPDLGAAIW 475

Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           P+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 476 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 507


>gi|229368743|gb|ACQ63024.1| prolyl 4-hydroxylase, alpha II subunit isoform 1 precursor
           (predicted) [Dasypus novemcinctus]
          Length = 556

 Score =  263 bits (671), Expect = 9e-68,   Method: Compositional matrix adjust.
 Identities = 142/295 (48%), Positives = 187/295 (63%), Gaps = 35/295 (11%)

Query: 4   PTHQRAQGNKLYYQEAL---------NKSPELKDEPPKVNNV--APTLEVTEREKYEMLC 52
           P+H+RA GN  Y++  L         N++ E   EP     +   P   + ER+ YE LC
Sbjct: 238 PSHERAGGNLRYFERLLEEEREKLLSNQTTEA--EPTTQEGIYERPADYLPERDVYESLC 295

Query: 53  RGD-LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIK 110
           RG+ + + P    +L CRY H N  P L + P KEE+ +  P I+ Y D+M D EI+ IK
Sbjct: 296 RGEGVKLTPRRQKRLFCRYHHGNRTPQLLIAPFKEEDEWDSPHIVRYYDIMSDEEIERIK 355

Query: 111 KMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAE 170
           ++A+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ +++RR+EH+TGLT  TAE
Sbjct: 356 EIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEENDDPVVAQVNRRMEHITGLTVKTAE 415

Query: 171 ELQVVNYGIGGHYEPHYDFAR--------------------PGEANAFKSLGTGNRVATV 210
            LQV NYG+GG YEPH+DF+R                      E + FK LGTGNRVAT 
Sbjct: 416 LLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYNHEQDVFKHLGTGNRVATF 475

Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           L YMSDV  GGATVF  L  ++WP+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 476 LNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 530


>gi|114601548|ref|XP_001162501.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 9 [Pan
           troglodytes]
 gi|114601562|ref|XP_001162805.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 16 [Pan
           troglodytes]
 gi|114601564|ref|XP_517917.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 19 [Pan
           troglodytes]
 gi|397518354|ref|XP_003829356.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Pan
           paniscus]
 gi|397518356|ref|XP_003829357.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Pan
           paniscus]
 gi|397518360|ref|XP_003829359.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 4 [Pan
           paniscus]
 gi|410215942|gb|JAA05190.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
 gi|410255606|gb|JAA15770.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
 gi|410331277|gb|JAA34585.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
 gi|410331281|gb|JAA34587.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
          Length = 533

 Score =  263 bits (671), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 135/272 (49%), Positives = 182/272 (66%), Gaps = 12/272 (4%)

Query: 4   PTHQRAQGNKLYYQE--------ALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD 55
           P+H+RA GN  Y+++         L+   E +   P+     P   + ER+ YE LCRG+
Sbjct: 238 PSHERAGGNLRYFEQLLEEEREKTLSNQTEAELATPEGIYERPVDYLPERDIYESLCRGE 297

Query: 56  -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
            + + P    +L CRY H N  P L + P KEE+ +  P I+ Y DVM D EI+ IK++A
Sbjct: 298 GVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 357

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT  TAE LQ
Sbjct: 358 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 417

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
           V NYG+GG YEPH+DF+R    +  K+   GNR+AT L YMSDV  GGATVF  L  ++W
Sbjct: 418 VANYGVGGQYEPHFDFSRRPFDSGLKT--EGNRLATFLNYMSDVEAGGATVFPDLGAAIW 475

Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           P+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 476 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 507


>gi|386780652|ref|NP_001247763.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Macaca mulatta]
 gi|383422579|gb|AFH34503.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Macaca
           mulatta]
 gi|384939466|gb|AFI33338.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Macaca
           mulatta]
          Length = 533

 Score =  263 bits (671), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 136/272 (50%), Positives = 184/272 (67%), Gaps = 12/272 (4%)

Query: 4   PTHQRAQGNKLYYQEALNKSPE--LKDEP------PKVNNVAPTLEVTEREKYEMLCRGD 55
           P+H+RA GN  Y+++ L +  E  L ++       P+     P   + ER+ YE LCRG+
Sbjct: 238 PSHERAGGNLRYFEQLLEEEREKMLSNQTEAELATPEGIYERPVDYLPERDVYESLCRGE 297

Query: 56  -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
            + + P    +L CRY H N  P L + P KEE+ +  P I+ Y DVM D EI+ IK++A
Sbjct: 298 GVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 357

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT  TAE LQ
Sbjct: 358 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 417

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
           V NYG+GG YEPH+DF+R    +  K+   GNR+AT L YMSDV  GGATVF  L  ++W
Sbjct: 418 VANYGVGGQYEPHFDFSRRPFDSGLKT--EGNRLATFLNYMSDVEAGGATVFPDLGAAIW 475

Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           P+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 476 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 507


>gi|426349879|ref|XP_004042513.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Gorilla gorilla
           gorilla]
          Length = 565

 Score =  263 bits (671), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 135/272 (49%), Positives = 182/272 (66%), Gaps = 12/272 (4%)

Query: 4   PTHQRAQGNKLYYQE--------ALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD 55
           P+H+RA GN  Y+++         L+   E +   P+     P   + ER+ YE LCRG+
Sbjct: 270 PSHERAGGNLRYFEQLLEEEREKTLSNQTEAELATPEGIYERPVDYLPERDVYESLCRGE 329

Query: 56  -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
            + + P    +L CRY H N  P L + P KEE+ +  P I+ Y DVM D EI+ IK++A
Sbjct: 330 GVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 389

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT  TAE LQ
Sbjct: 390 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 449

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
           V NYG+GG YEPH+DF+R    +  K+   GNR+AT L YMSDV  GGATVF  L  ++W
Sbjct: 450 VANYGVGGQYEPHFDFSRRPFDSGLKT--EGNRLATFLNYMSDVEAGGATVFPDLGAAIW 507

Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           P+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 508 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 539


>gi|395736139|ref|XP_003776705.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Pongo abelii]
          Length = 575

 Score =  262 bits (670), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 135/272 (49%), Positives = 182/272 (66%), Gaps = 12/272 (4%)

Query: 4   PTHQRAQGNKLYYQE--------ALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD 55
           P+H+RA GN  Y+++         L+   E +   P+     P   + ER+ YE LCRG+
Sbjct: 280 PSHERAGGNLRYFEQLLEEEREKTLSNQTEAELATPEGIYERPVDYLPERDVYESLCRGE 339

Query: 56  -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
            + + P    +L CRY H N  P L + P KEE+ +  P I+ Y DVM D EI+ IK++A
Sbjct: 340 GVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 399

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT  TAE LQ
Sbjct: 400 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 459

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
           V NYG+GG YEPH+DF+R    +  K+   GNR+AT L YMSDV  GGATVF  L  ++W
Sbjct: 460 VANYGVGGQYEPHFDFSRRPFDSGLKT--EGNRLATFLNYMSDVEAGGATVFPDLGAAIW 517

Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           P+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 518 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 549


>gi|63252891|ref|NP_001017973.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Homo
           sapiens]
 gi|63252893|ref|NP_001017974.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Homo
           sapiens]
 gi|217272861|ref|NP_001136070.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Homo
           sapiens]
 gi|18073925|emb|CAC85688.1| Prolyl 4-hydroxylase alpha IIa subunit [Homo sapiens]
 gi|23274221|gb|AAH35813.1| Prolyl 4-hydroxylase, alpha polypeptide II [Homo sapiens]
 gi|37183058|gb|AAQ89329.1| P4HA2 [Homo sapiens]
 gi|119582745|gb|EAW62341.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide II, isoform CRA_a
           [Homo sapiens]
 gi|119582750|gb|EAW62346.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide II, isoform CRA_a
           [Homo sapiens]
 gi|123983232|gb|ABM83357.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide II [synthetic
           construct]
 gi|157928048|gb|ABW03320.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide II [synthetic
           construct]
          Length = 533

 Score =  262 bits (670), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 135/272 (49%), Positives = 181/272 (66%), Gaps = 12/272 (4%)

Query: 4   PTHQRAQGNKLYYQE--------ALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD 55
           P+H+RA GN  Y+++         L    E +   P+     P   + ER+ YE LCRG+
Sbjct: 238 PSHERAGGNLRYFEQLLEEEREKTLTNQTEAELATPEGIYERPVDYLPERDVYESLCRGE 297

Query: 56  -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
            + + P    +L CRY H N  P L + P KEE+ +  P I+ Y DVM D EI+ IK++A
Sbjct: 298 GVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 357

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT  TAE LQ
Sbjct: 358 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 417

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
           V NYG+GG YEPH+DF+R    +  K+   GNR+AT L YMSDV  GGATVF  L  ++W
Sbjct: 418 VANYGVGGQYEPHFDFSRRPFDSGLKT--EGNRLATFLNYMSDVEAGGATVFPDLGAAIW 475

Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           P+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 476 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 507


>gi|332221656|ref|XP_003259979.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Nomascus
           leucogenys]
 gi|332221658|ref|XP_003259980.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Nomascus
           leucogenys]
          Length = 535

 Score =  262 bits (670), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 136/272 (50%), Positives = 184/272 (67%), Gaps = 12/272 (4%)

Query: 4   PTHQRAQGNKLYYQEALNKSPE--LKDEP------PKVNNVAPTLEVTEREKYEMLCRGD 55
           P+H+RA GN  Y+++ L +  E  L ++       P+     P   + ER+ YE LCRG+
Sbjct: 240 PSHERAGGNLRYFEQLLEEEREKMLSNQTEAELATPEGIYERPVDYLPERDVYESLCRGE 299

Query: 56  -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
            + + P    +L CRY H N  P L + P KEE+ +  P I+ Y DVM D EI+ IK++A
Sbjct: 300 GVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 359

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT  TAE LQ
Sbjct: 360 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 419

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
           V NYG+GG YEPH+DF+R    +  K+   GNR+AT L YMSDV  GGATVF  L  ++W
Sbjct: 420 VANYGVGGQYEPHFDFSRRPFDSGLKT--EGNRLATFLNYMSDVEAGGATVFPDLGAAIW 477

Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           P+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 478 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 509


>gi|148701600|gb|EDL33547.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha II polypeptide, isoform CRA_e [Mus
           musculus]
          Length = 593

 Score =  262 bits (670), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 137/272 (50%), Positives = 183/272 (67%), Gaps = 12/272 (4%)

Query: 4   PTHQRAQGNKLYYQEALNK------SPELKDEPPKVNNV--APTLEVTEREKYEMLCRGD 55
           P+H+RA GN  Y++  L +      S +         N+   PT  + ER+ YE LCRG+
Sbjct: 298 PSHERAGGNLRYFERLLEEERGKSLSNQTDAGLATQENLYERPTDYLPERDVYESLCRGE 357

Query: 56  -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
            + + P    +L CRY H N VP L + P KEE+ +  P I+ Y DVM D EI+ IK++A
Sbjct: 358 GVKLTPRRQKKLFCRYHHGNRVPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 417

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT  TAE LQ
Sbjct: 418 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 477

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
           V NYG+GG YEPH+DF+R    +  K+   GNR+AT L YMSDV  GGATVF  L  ++W
Sbjct: 478 VANYGMGGQYEPHFDFSRRPFDSGLKT--EGNRLATFLNYMSDVEAGGATVFPDLGAAIW 535

Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           P+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 536 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 567


>gi|119582748|gb|EAW62344.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide II, isoform CRA_c
           [Homo sapiens]
          Length = 565

 Score =  262 bits (670), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 135/272 (49%), Positives = 181/272 (66%), Gaps = 12/272 (4%)

Query: 4   PTHQRAQGNKLYYQE--------ALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD 55
           P+H+RA GN  Y+++         L    E +   P+     P   + ER+ YE LCRG+
Sbjct: 270 PSHERAGGNLRYFEQLLEEEREKTLTNQTEAELATPEGIYERPVDYLPERDVYESLCRGE 329

Query: 56  -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
            + + P    +L CRY H N  P L + P KEE+ +  P I+ Y DVM D EI+ IK++A
Sbjct: 330 GVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 389

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT  TAE LQ
Sbjct: 390 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 449

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
           V NYG+GG YEPH+DF+R    +  K+   GNR+AT L YMSDV  GGATVF  L  ++W
Sbjct: 450 VANYGVGGQYEPHFDFSRRPFDSGLKT--EGNRLATFLNYMSDVEAGGATVFPDLGAAIW 507

Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           P+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 508 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 539


>gi|332221662|ref|XP_003259982.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 4 [Nomascus
           leucogenys]
          Length = 556

 Score =  262 bits (670), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 136/272 (50%), Positives = 184/272 (67%), Gaps = 12/272 (4%)

Query: 4   PTHQRAQGNKLYYQEALNKSPE--LKDEP------PKVNNVAPTLEVTEREKYEMLCRGD 55
           P+H+RA GN  Y+++ L +  E  L ++       P+     P   + ER+ YE LCRG+
Sbjct: 261 PSHERAGGNLRYFEQLLEEEREKMLSNQTEAELATPEGIYERPVDYLPERDVYESLCRGE 320

Query: 56  -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
            + + P    +L CRY H N  P L + P KEE+ +  P I+ Y DVM D EI+ IK++A
Sbjct: 321 GVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 380

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT  TAE LQ
Sbjct: 381 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 440

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
           V NYG+GG YEPH+DF+R    +  K+   GNR+AT L YMSDV  GGATVF  L  ++W
Sbjct: 441 VANYGVGGQYEPHFDFSRRPFDSGLKT--EGNRLATFLNYMSDVEAGGATVFPDLGAAIW 498

Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           P+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 499 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 530


>gi|116283554|gb|AAH17062.1| P4HA2 protein [Homo sapiens]
          Length = 504

 Score =  262 bits (670), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 135/272 (49%), Positives = 181/272 (66%), Gaps = 12/272 (4%)

Query: 4   PTHQRAQGNKLYYQE--------ALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD 55
           P+H+RA GN  Y+++         L    E +   P+     P   + ER+ YE LCRG+
Sbjct: 209 PSHERAGGNLRYFEQLLEEEREKTLTNQTEAELATPEGIYERPVDYLPERDVYESLCRGE 268

Query: 56  -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
            + + P    +L CRY H N  P L + P KEE+ +  P I+ Y DVM D EI+ IK++A
Sbjct: 269 GVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 328

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT  TAE LQ
Sbjct: 329 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 388

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
           V NYG+GG YEPH+DF+R    +  K+   GNR+AT L YMSDV  GGATVF  L  ++W
Sbjct: 389 VANYGVGGQYEPHFDFSRRPFDSGLKT--EGNRLATFLNYMSDVEAGGATVFPDLGAAIW 446

Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           P+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 447 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 478


>gi|348523976|ref|XP_003449499.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Oreochromis
           niloticus]
          Length = 594

 Score =  262 bits (670), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 131/275 (47%), Positives = 185/275 (67%), Gaps = 12/275 (4%)

Query: 4   PTHQRAQGNKLYYQEAL----------NKSPELKDEPPKVNNVAPTLEVTEREKYEMLCR 53
           PTHQRA GN  Y++  L              + ++   +    +    +TER+KYE LCR
Sbjct: 295 PTHQRANGNLKYFEYQLAKQKKVEKVEKVEEKEEETKVRQRRESKDDYLTERKKYEQLCR 354

Query: 54  GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           G  + + P   ++L CRY   N  P   + P+K+E+ +  P I+ Y +++ + +++ +K+
Sbjct: 355 GQGIKLTPRRQSRLFCRYYDNNRHPRYVIGPVKQEDEWDSPHIVRYHNIVSEKDMEKVKE 414

Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
           +A+PRLRRAT+ N  TG LE A+YRISKSAWL   EHPV+++I++ +E +TGL   TAE+
Sbjct: 415 LAKPRLRRATISNPVTGVLETAHYRISKSAWLGAYEHPVVDKINQLIEDVTGLNVKTAED 474

Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
           LQV NYG+GG YEPH+DF R  E +AF+ LGTGNR+AT L YM+DV  GGATVFT +  +
Sbjct: 475 LQVANYGLGGQYEPHFDFGRKDEPDAFEELGTGNRIATWLLYMTDVQAGGATVFTDIGAA 534

Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           + P+KGTA FW+NL+ SG+GDY TRHAACPVL G+
Sbjct: 535 VKPKKGTAVFWYNLYPSGEGDYRTRHAACPVLLGN 569


>gi|209862961|ref|NP_001129548.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Mus
           musculus]
 gi|17390970|gb|AAH18411.1| P4ha2 protein [Mus musculus]
 gi|18073922|emb|CAC85690.1| Prolyl 4-hydroxylase alpha IIa subunit [Mus musculus]
 gi|74211515|dbj|BAE26490.1| unnamed protein product [Mus musculus]
          Length = 535

 Score =  262 bits (670), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 137/272 (50%), Positives = 183/272 (67%), Gaps = 12/272 (4%)

Query: 4   PTHQRAQGNKLYYQEALNK------SPELKDEPPKVNNV--APTLEVTEREKYEMLCRGD 55
           P+H+RA GN  Y++  L +      S +         N+   PT  + ER+ YE LCRG+
Sbjct: 240 PSHERAGGNLRYFERLLEEERGKSLSNQTDAGLATQENLYERPTDYLPERDVYESLCRGE 299

Query: 56  -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
            + + P    +L CRY H N VP L + P KEE+ +  P I+ Y DVM D EI+ IK++A
Sbjct: 300 GVKLTPRRQKKLFCRYHHGNRVPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 359

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT  TAE LQ
Sbjct: 360 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 419

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
           V NYG+GG YEPH+DF+R    +  K+   GNR+AT L YMSDV  GGATVF  L  ++W
Sbjct: 420 VANYGMGGQYEPHFDFSRRPFDSGLKT--EGNRLATFLNYMSDVEAGGATVFPDLGAAIW 477

Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           P+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 478 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 509


>gi|57997558|emb|CAI46066.1| hypothetical protein [Homo sapiens]
          Length = 533

 Score =  262 bits (669), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 135/272 (49%), Positives = 181/272 (66%), Gaps = 12/272 (4%)

Query: 4   PTHQRAQGNKLYYQE--------ALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD 55
           P+H+RA GN  Y+++         L    E +   P+     P   + ER+ YE LCRG+
Sbjct: 238 PSHERAGGNLRYFEQLLEEEREKTLTNQTEAELATPEGIYERPVDYLPERDVYESLCRGE 297

Query: 56  -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
            + + P    +L CRY H N  P L + P KEE+ +  P I+ Y DVM D EI+ IK++A
Sbjct: 298 GVKLTPRRQKRLFCRYHHGNRAPQLPIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 357

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT  TAE LQ
Sbjct: 358 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 417

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
           V NYG+GG YEPH+DF+R    +  K+   GNR+AT L YMSDV  GGATVF  L  ++W
Sbjct: 418 VANYGVGGQYEPHFDFSRRPFDSGLKT--EGNRLATFLNYMSDVEAGGATVFPDLGAAIW 475

Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           P+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 476 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 507


>gi|119582749|gb|EAW62345.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide II, isoform CRA_d
           [Homo sapiens]
          Length = 488

 Score =  261 bits (668), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 135/272 (49%), Positives = 181/272 (66%), Gaps = 12/272 (4%)

Query: 4   PTHQRAQGNKLYYQE--------ALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD 55
           P+H+RA GN  Y+++         L    E +   P+     P   + ER+ YE LCRG+
Sbjct: 193 PSHERAGGNLRYFEQLLEEEREKTLTNQTEAELATPEGIYERPVDYLPERDVYESLCRGE 252

Query: 56  -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
            + + P    +L CRY H N  P L + P KEE+ +  P I+ Y DVM D EI+ IK++A
Sbjct: 253 GVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 312

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT  TAE LQ
Sbjct: 313 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 372

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
           V NYG+GG YEPH+DF+R    +  K+   GNR+AT L YMSDV  GGATVF  L  ++W
Sbjct: 373 VANYGVGGQYEPHFDFSRRPFDSGLKT--EGNRLATFLNYMSDVEAGGATVFPDLGAAIW 430

Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           P+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 431 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 462


>gi|410948132|ref|XP_003980795.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Felis
           catus]
 gi|410948136|ref|XP_003980797.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Felis
           catus]
          Length = 533

 Score =  261 bits (667), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 135/272 (49%), Positives = 181/272 (66%), Gaps = 12/272 (4%)

Query: 4   PTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVA--------PTLEVTEREKYEMLCRGD 55
           P+H+RA GN  Y+++ L +  E          +A        P   + ER+ YE LCRG+
Sbjct: 238 PSHERAGGNLRYFEQLLEEEREKMLSNQTEAGLATQESIYERPVDYLPERDIYESLCRGE 297

Query: 56  -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
            + + P    +L CRY H N  P L + P KEE+ +  P I+ Y DVM D EI+ IK++A
Sbjct: 298 GVKLTPRRQKRLFCRYHHGNRTPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 357

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT  TAE LQ
Sbjct: 358 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 417

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
           V NYG+GG YEPH+DF+R    +  K+   GNR+AT L YMSDV  GGATVF  L  ++W
Sbjct: 418 VANYGMGGQYEPHFDFSRRPFDSGLKT--EGNRLATFLNYMSDVEAGGATVFPDLGAAIW 475

Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           P+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 476 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 507


>gi|268572523|ref|XP_002641343.1| C. briggsae CBR-DPY-18 protein [Caenorhabditis briggsae]
 gi|94442971|emb|CAJ98658.1| prolyl 4-hydroxylase [Caenorhabditis briggsae]
          Length = 559

 Score =  261 bits (667), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 135/291 (46%), Positives = 187/291 (64%), Gaps = 17/291 (5%)

Query: 4   PTHQRAQGNKLYYQEALNKS----PELKDEPPKVNNVAP--TLEVTEREKYEMLCRGDLT 57
           P H RA+GN  +Y++ L +      E++   P + N  P   L  TER  YE LCR ++ 
Sbjct: 235 PAHPRAKGNIKWYEDLLEQEGVRRSEMRKSIPPIQNRRPDSVLGNTERTMYEALCRNEVP 294

Query: 58  VPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRL 117
           V    +++L C Y  R+ P+L   P+K E     P  +L++DV+ D E+  I+++A+P+L
Sbjct: 295 VSQKDISKLYC-YYKRDRPFLIYAPIKVEIKRFNPLAVLFKDVISDEEVATIQELAKPKL 353

Query: 118 RRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNY 177
            RATV +  TG+L  A YRISKSAWL+  EH V+ER+++R++ MT L   TAEELQ+ NY
Sbjct: 354 ARATVHDSVTGKLVTATYRISKSAWLKAWEHEVVERVNKRIDLMTNLEMETAEELQIANY 413

Query: 178 GIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKG 237
           GIGGHY+PH+D A+  E+ +F+SLGTGNR+ATVLFYMS  + GG TVFT +  ++ P K 
Sbjct: 414 GIGGHYDPHFDHAKKEESKSFESLGTGNRIATVLFYMSQPSHGGGTVFTEVKSTVLPTKN 473

Query: 238 TAAFWHNLHSSGDGDYYTRHAACPVLTG----SNS-LHSTC-----PCGLR 278
            A FW+NL+  GDG+  TRHAACPVL G    SN  +H        PCGL+
Sbjct: 474 DALFWYNLYKQGDGNPDTRHAACPVLVGIKWVSNKWIHEKGNEFRRPCGLK 524


>gi|37496185|emb|CAE47803.1| Prolyl 4-hydroxylase alpha subunit [Sus scrofa]
          Length = 263

 Score =  261 bits (666), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 131/257 (50%), Positives = 176/257 (68%), Gaps = 11/257 (4%)

Query: 6   HQRAQGNKLYYQEALNKSPEL-KDEPPKVNNVAPTLE--------VTEREKYEMLCRGD- 55
           HQRA GN  Y++  + K  E  K  P   +N   TL+        + E +KYEMLCRG+ 
Sbjct: 7   HQRANGNLKYFEYIMAKEKEANKSAPDDQSNQKTTLKKKGVAVDYLPEGQKYEMLCRGEG 66

Query: 56  LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQ 114
           + + P    +L CRY   N  P   L P K+E+ + +PRII + D++ D+EID++K +A+
Sbjct: 67  IKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIDIVKDLAK 126

Query: 115 PRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQV 174
           PRL RATV + +TG+L  A YR+SKSAWL   E+PV+ R++ R++ +TGL  STAEELQV
Sbjct: 127 PRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPVVSRLNMRIQDLTGLDVSTAEELQV 186

Query: 175 VNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWP 234
            NYG+GG YEPH+DFAR  E +AFK LGTGNR+AT LFYMSDV+ GGATVF  +  S+WP
Sbjct: 187 ANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGASVWP 246

Query: 235 EKGTAAFWHNLHSSGDG 251
           +KGTA FW+NL + G+G
Sbjct: 247 KKGTAVFWYNLFAGGEG 263


>gi|113682363|ref|NP_001038463.1| prolyl 4-hydroxylase, alpha polypeptide I a precursor [Danio rerio]
          Length = 522

 Score =  260 bits (665), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 134/269 (49%), Positives = 175/269 (65%), Gaps = 34/269 (12%)

Query: 44  EREKYEMLCRGD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVM 101
           E+ KYE LCRG+ L + P     L CRY + N  P+  + P+K+E+ + +PRII Y +++
Sbjct: 251 EKRKYEKLCRGEGLKMTPRRQKHLFCRYFNGNRHPFYTIGPVKQEDEWDRPRIIRYHEII 310

Query: 102 YDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISK---------------------- 139
            + EI+ IK++++PRLRRAT+ N  TG LE A+YRISK                      
Sbjct: 311 TEQEIEKIKELSKPRLRRATISNPITGVLETAHYRISKRRATVHDPQTGKLTTAQYRVSK 370

Query: 140 SAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFK 199
           SAWL   EHPV++RI++R+E +TGL   TAEELQV NYG+GG YEPH+DF R  E +AFK
Sbjct: 371 SAWLAAYEHPVVDRINQRIEDITGLNVKTAEELQVANYGVGGQYEPHFDFGRKDEPDAFK 430

Query: 200 SLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAA 259
            LGTGNR+AT LFYMSDVA GGATVF  +  ++ P KGTA FW+NL  SG+GDY TRHAA
Sbjct: 431 ELGTGNRIATWLFYMSDVAAGGATVFPEVGAAVKPLKGTAVFWYNLFPSGEGDYSTRHAA 490

Query: 260 CPVLTGSNSLHSTC----------PCGLR 278
           CPVL G+  + +            PCGL+
Sbjct: 491 CPVLVGNKWVSNKWIHERGQEFRRPCGLK 519


>gi|344264849|ref|XP_003404502.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2
           [Loxodonta africana]
          Length = 534

 Score =  260 bits (664), Expect = 7e-67,   Method: Compositional matrix adjust.
 Identities = 134/273 (49%), Positives = 182/273 (66%), Gaps = 13/273 (4%)

Query: 4   PTHQRAQGNKLYYQEALNKSPE-------LKDEPPKVNNV--APTLEVTEREKYEMLCRG 54
           P+H+RA GN  Y++  L +  +       +  EP     +   P   + ER+ YE LCRG
Sbjct: 238 PSHERAGGNLRYFEHLLEEERKKTLSNQTMDAEPATREGIYERPVDYLPERDVYESLCRG 297

Query: 55  D-LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKM 112
           + + + P    +L CRY H N  P L + P KEE+ +  P I+ Y DVM D EI+ IK++
Sbjct: 298 EGVKLTPRRQKRLFCRYHHGNRTPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKQI 357

Query: 113 AQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEEL 172
           A+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ +++RR++H+TGLT  TAE L
Sbjct: 358 AKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVAQVNRRMQHITGLTVKTAELL 417

Query: 173 QVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSL 232
           QV NYG+GG YEPH+DF+R    +  K+   GNR+AT L YMSDV  GGATVF  L  ++
Sbjct: 418 QVANYGMGGQYEPHFDFSRRPFDSGLKT--EGNRLATFLNYMSDVEAGGATVFPDLGAAI 475

Query: 233 WPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           WP+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 476 WPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 508


>gi|395817618|ref|XP_003782262.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Otolemur
           garnettii]
          Length = 538

 Score =  260 bits (664), Expect = 7e-67,   Method: Compositional matrix adjust.
 Identities = 135/272 (49%), Positives = 179/272 (65%), Gaps = 12/272 (4%)

Query: 4   PTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVA--------PTLEVTEREKYEMLCRGD 55
           P+H+RA GN  Y++  L +  E          +A        P   + ERE YE LCRG+
Sbjct: 243 PSHERAGGNLRYFEHLLEEEREKMLSNKTEAELATQEGIYERPVDYLPEREVYESLCRGE 302

Query: 56  -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
            + + P    +L CRY H N  P L + P KEE+ +  P I+ Y DVM D EI+ IK++A
Sbjct: 303 GVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 362

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++ R++H+TGL+  TAE LQ
Sbjct: 363 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNHRMQHITGLSVKTAELLQ 422

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
           V NYG+GG YEPH+DF+R    +  K+   GNRVAT L YMSDV  GGATVF  L  ++W
Sbjct: 423 VANYGVGGQYEPHFDFSRRPFDSGLKT--EGNRVATFLNYMSDVEAGGATVFPDLGAAIW 480

Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           P+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 481 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 512


>gi|348557544|ref|XP_003464579.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 2
           [Cavia porcellus]
          Length = 533

 Score =  260 bits (664), Expect = 7e-67,   Method: Compositional matrix adjust.
 Identities = 136/272 (50%), Positives = 181/272 (66%), Gaps = 12/272 (4%)

Query: 4   PTHQRAQGNKLYYQEALN--KSPELKDEPPKVNNVA------PTLEVTEREKYEMLCRGD 55
           P+H+RA GN  Y++  L   +   L ++   V          P+  + ERE YE LCRG+
Sbjct: 238 PSHERAGGNLRYFERLLEEERGKLLSNQTEAVLAAQEGIYERPSDYLPEREVYESLCRGE 297

Query: 56  -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
            + + P    +L CRY H N  P L + P KEE+ +  P I+ Y DVM D EI+ IK++A
Sbjct: 298 GIKLTPQRRKRLFCRYHHGNRAPELLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 357

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++ +TGLT  TAE LQ
Sbjct: 358 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEEDDPVVARVNRRMQQITGLTVKTAELLQ 417

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
           V NYG+GG YEPH+DF+R    +  K+   GNR+AT L YMSDV  GGATVF  L  +LW
Sbjct: 418 VANYGMGGQYEPHFDFSRRPFDSGLKT--EGNRLATFLNYMSDVEAGGATVFPDLGAALW 475

Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           P+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 476 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 507


>gi|432949777|ref|XP_004084253.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Oryzias
           latipes]
          Length = 532

 Score =  259 bits (662), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 144/300 (48%), Positives = 190/300 (63%), Gaps = 29/300 (9%)

Query: 2   IFPTHQRAQGNKLYYQEALNKSPELKD-----EPPKVNNVA------PTLEVTEREKYEM 50
           I P HQRA GN  Y+++ L K  +LK+     +PP    +       P   + ERE YE 
Sbjct: 234 IDPNHQRAGGNLRYFEQLLMK--QLKESNQDYQPPSEEPIQLGTYTRPKDHLPERETYEA 291

Query: 51  LCRGD-LTVPPAIVAQLKCRYVH-RNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDL 108
           LCRG+ L +  A  ++L CRY   +  P L L P+KEE+ +  P I+ Y +++ D EI+ 
Sbjct: 292 LCRGEGLQLTEARRSRLFCRYHDGKRSPRLLLKPIKEEDEWDNPHIVRYLNILSDQEIEK 351

Query: 109 IKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST 168
           IK++A+PRL RATV++ KTG L  A YR+SKSAWL   + PVI+R+++R++ +TGLT  T
Sbjct: 352 IKELAKPRLARATVRDPKTGVLTTAPYRVSKSAWLEGEDDPVIDRVNQRIQDITGLTVET 411

Query: 169 AEELQVVNYGIGGHYEPHYDFA-RPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTS 227
           AE LQV NYG+GG YEPH+DF+ RP ++N       GNR+AT L YMSDV  GGATVF  
Sbjct: 412 AELLQVANYGVGGQYEPHFDFSRRPFDSNLKVD---GNRLATFLNYMSDVEAGGATVFPD 468

Query: 228 LNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCGL 277
              S+WP KGTA FW+NL  SG+GDY TRHAACPVL GS  + +            PCGL
Sbjct: 469 FGASIWPRKGTAVFWYNLFRSGEGDYRTRHAACPVLVGSKWVSNKWIHERGQEFRRPCGL 528


>gi|17552840|ref|NP_499464.1| Protein DPY-18 [Caenorhabditis elegans]
 gi|20455505|sp|Q10576.2|P4HA1_CAEEL RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
           alpha-1; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-1; AltName: Full=Protein dumpy-18; Flags:
           Precursor
 gi|3881011|emb|CAA21045.1| Protein DPY-18 [Caenorhabditis elegans]
 gi|6900013|emb|CAB71298.1| prolyl 4-hydroxylase alpha subunit 1 [Caenorhabditis elegans]
          Length = 559

 Score =  258 bits (658), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 138/292 (47%), Positives = 187/292 (64%), Gaps = 19/292 (6%)

Query: 4   PTHQRAQGNKLYY-----QEALNKSPELKDEPPKVNNVAP--TLEVTEREKYEMLCRGDL 56
           PTH RA+GN  +Y     QE + +S   K+ PP + N  P   L  TER  YE LCR ++
Sbjct: 235 PTHPRAKGNVKWYEDLLEQEGVRRSDMRKNLPP-IQNRRPDSVLGNTERTMYEALCRNEV 293

Query: 57  TVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPR 116
            V    +++L C Y  R+ P+L   P+K E     P  +L++DV+ D E+  I+++A+P+
Sbjct: 294 PVSQKDISRLYC-YYKRDRPFLVYAPIKVEIKRFNPLAVLFKDVISDDEVAAIQELAKPK 352

Query: 117 LRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVN 176
           L RATV +  TG+L  A YRISKSAWL+E E  V+E +++R+ +MT L   TAEELQ+ N
Sbjct: 353 LARATVHDSVTGKLVTATYRISKSAWLKEWEGDVVETVNKRIGYMTNLEMETAEELQIAN 412

Query: 177 YGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEK 236
           YGIGGHY+PH+D A+  E+ +F+SLGTGNR+ATVLFYMS  + GG TVFT    ++ P K
Sbjct: 413 YGIGGHYDPHFDHAKKEESKSFESLGTGNRIATVLFYMSQPSHGGGTVFTEAKSTILPTK 472

Query: 237 GTAAFWHNLHSSGDGDYYTRHAACPVLTG----SNS-LHSTC-----PCGLR 278
             A FW+NL+  GDG+  TRHAACPVL G    SN  +H        PCGL+
Sbjct: 473 NDALFWYNLYKQGDGNPDTRHAACPVLVGIKWVSNKWIHEKGNEFRRPCGLK 524


>gi|357605723|gb|EHJ64752.1| prolyl 4-hydroxylase alpha subunit [Danaus plexippus]
          Length = 235

 Score =  258 bits (658), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 131/217 (60%), Positives = 160/217 (73%), Gaps = 5/217 (2%)

Query: 74  NVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIA 133
           N P+LRL P++ E  Y  P II++ DV+ D EID IK++AQPR RRATV +  TGEL  A
Sbjct: 4   NHPFLRLAPVRMEYLYRNPDIIVFNDVLSDYEIDYIKRIAQPRFRRATVHDPATGELVPA 63

Query: 134 NYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPG 193
           +YRISKSAWL++ E  V+ R+SRRV  +TGL+ +TAEELQVVNYGIGGHY+PH+DFAR  
Sbjct: 64  HYRISKSAWLKDEESAVVARVSRRVADITGLSMTTAEELQVVNYGIGGHYDPHFDFARK- 122

Query: 194 EANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDY 253
           E NAF+    GNR+ATVLFYMSDVAQGGATVFT L LS++P +G+A FW NLH SG+GD 
Sbjct: 123 EENAFEKF-NGNRIATVLFYMSDVAQGGATVFTELGLSVFPRRGSAVFWLNLHPSGEGDL 181

Query: 254 YTRHAACPVLTGSNSLHSTCPCGLRRGLQRSGIICTL 290
            TRHAACPVL GS  +   C   + +G Q     C L
Sbjct: 182 ATRHAACPVLRGSKWV---CNKWIHQGGQELIRPCNL 215


>gi|390363005|ref|XP_797519.3| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like
           [Strongylocentrotus purpuratus]
          Length = 579

 Score =  258 bits (658), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 135/289 (46%), Positives = 180/289 (62%), Gaps = 26/289 (8%)

Query: 4   PTHQRAQGNKLYYQEAL----------NKSPELKDEPPKVNNVAPTLEVT---------E 44
           P H+RA+ NK+++   L           +  E+ D+  ++      LE           E
Sbjct: 266 PKHERAKNNKIFFMSELEEKEIKEKPRGEDAEIDDKTGEIVKTQEELEKEKAEQAYSYPE 325

Query: 45  REKYEMLCRGDLTVPPAIVAQL-KCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYD 103
           R++YE LCRGD      +  +L KC+Y H N P+L L P KEE  +  PR++ YR+++ D
Sbjct: 326 RKQYEALCRGDPGALKVVDHRLLKCQYQHYNHPFLYLQPAKEEVIFDDPRLVFYRNILND 385

Query: 104 SEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTG 163
            EI  +K++A PRL+RAT+QN  TG LE A+YRISKSAW+++ E  +I  I  RV+  TG
Sbjct: 386 KEIAFVKRLASPRLQRATIQNAITGNLEFADYRISKSAWVKQEEDQLIRSIRFRVQAYTG 445

Query: 164 LTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGAT 223
           L   TAE+LQVVNYGIGGHYEPH+DFAR  E NAF+SLGTGNR+AT LFY+S      ++
Sbjct: 446 LELDTAEDLQVVNYGIGGHYEPHFDFARAEETNAFQSLGTGNRIATALFYVSITCPDMSS 505

Query: 224 VFTSLN------LSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
            +   +      LSL    GTA FW+NL  SG G+Y TRHAACPVL+GS
Sbjct: 506 TYEPRDEIRNGFLSLVYPSGTAVFWYNLRKSGQGNYDTRHAACPVLSGS 554


>gi|291387300|ref|XP_002710241.1| PREDICTED: prolyl 4-hydroxylase, alpha II subunit isoform 1
           precursor (predicted)-like isoform 1 [Oryctolagus
           cuniculus]
          Length = 533

 Score =  257 bits (657), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 137/273 (50%), Positives = 181/273 (66%), Gaps = 14/273 (5%)

Query: 4   PTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLE---------VTEREKYEMLCRG 54
           P+H+RA GN  Y++  L +    K    +   VA T E         + ER+ YE LCRG
Sbjct: 238 PSHERAGGNLRYFERLLEEQ-RGKSLLNQTEAVAVTQEGIYERPVDYLPERDVYESLCRG 296

Query: 55  D-LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKM 112
           + + + P    +L CRY   N  P L + P KEE+ +  P I+ Y DVM D EI+ IK++
Sbjct: 297 EGVKLTPRRQKRLFCRYHDGNGAPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEI 356

Query: 113 AQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEEL 172
           A+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ RI+RR++H+TGLT  TAE L
Sbjct: 357 AKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARINRRMQHITGLTVKTAELL 416

Query: 173 QVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSL 232
           QV NYG+GG YEPH+DF+R    +  K+   GNR+AT L YMSDV  GGATVF  L  ++
Sbjct: 417 QVANYGMGGQYEPHFDFSRRPFDSGLKT--EGNRLATFLNYMSDVEAGGATVFPDLGAAI 474

Query: 233 WPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           WP+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 475 WPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 507


>gi|339236271|ref|XP_003379690.1| prolyl 4-hydroxylase subunit alpha-1 [Trichinella spiralis]
 gi|316977627|gb|EFV60702.1| prolyl 4-hydroxylase subunit alpha-1 [Trichinella spiralis]
          Length = 558

 Score =  256 bits (653), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 134/291 (46%), Positives = 180/291 (61%), Gaps = 27/291 (9%)

Query: 2   IFPTHQRAQGNKLYYQEALN-----KSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDL 56
           + P H RA GN  YYQ+ L+     +  + K  PP  N     L + ER+ YE LCR + 
Sbjct: 242 VDPNHPRASGNLKYYQDLLDPEGKPRKIDPKKLPPPTNRRPDDLSIPERDVYEGLCRSEY 301

Query: 57  TVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPR 116
            +     A+L C Y  RN PYL+L P+K E  + +P+I+ +R V+ D EI +IK++A P 
Sbjct: 302 PISDKDRAKLYC-YYKRNRPYLKLAPIKVEVMHWKPKIVYFRGVISDEEIAVIKQLASPL 360

Query: 117 LRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVN 176
           L+RATV N  TG+LE A+YRISKSAWL++ EH V++RIS R++ MT LT  TAE LQ+ N
Sbjct: 361 LKRATVHNADTGQLETASYRISKSAWLKDTEHEVVKRISDRIDMMTDLTMETAELLQIAN 420

Query: 177 YGIGGHYEPHYDFARPGEAN---------------------AFKSLGTGNRVATVLFYMS 215
           YGIGGHY+PH+D +  GE++                     +F+SL  GNR+ATVLFY+S
Sbjct: 421 YGIGGHYDPHFDMSTRGESDPYEEGTGNRIATVLFYTNDPYSFESLNAGNRIATVLFYIS 480

Query: 216 DVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
               GG TVFTS  +++ P K  AAFW N+   G+ D  TRHAACPVL G+
Sbjct: 481 QPEAGGGTVFTSHKITVEPSKYDAAFWFNVLQGGEPDMSTRHAACPVLAGT 531


>gi|326928728|ref|XP_003210527.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Meleagris
           gallopavo]
          Length = 535

 Score =  256 bits (653), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 131/273 (47%), Positives = 179/273 (65%), Gaps = 14/273 (5%)

Query: 5   THQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLE----------VTEREKYEMLCRG 54
           TH+RA  N  Y+++ L K  E       V    P ++          + ER+ YE LCRG
Sbjct: 239 THERAGSNLRYFEKLLEKEREKSSXNKTVATTEPVVQSGAYERPLDYLPERDIYEALCRG 298

Query: 55  D-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKM 112
           + + + P    +L CRY + N  P+L + P KEE+ +  P I+ Y DVM D EI+ IK++
Sbjct: 299 EGVKMTPRRQKRLFCRYHNGNRNPHLVIAPFKEEDEWDSPHIVRYYDVMSDEEIEKIKQL 358

Query: 113 AQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEEL 172
           A+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ ++++R++ +TGLT  TAE L
Sbjct: 359 AKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVAKVNQRMQQITGLTVKTAELL 418

Query: 173 QVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSL 232
           QV NYG+GG YEPH+DF+R    +  KS   GNR+AT L YMSDV  GGATVF     ++
Sbjct: 419 QVANYGMGGQYEPHFDFSRRPFDSTLKS--EGNRLATFLNYMSDVEAGGATVFPDFGAAI 476

Query: 233 WPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           WP+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 477 WPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVG 509


>gi|354474415|ref|XP_003499426.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2
           [Cricetulus griseus]
          Length = 533

 Score =  256 bits (653), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 134/272 (49%), Positives = 180/272 (66%), Gaps = 12/272 (4%)

Query: 4   PTHQRAQGNKLYYQ--------EALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD 55
           P+H+RA GN  Y++        ++L    E      +     P   + ER+  E LCRG+
Sbjct: 238 PSHERAGGNLRYFERLLEEEREKSLFNQTEAGLATQENVYERPVDFLPERDVLESLCRGE 297

Query: 56  -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
            + + P    +L CRY H N VP L + P KEE+ +  P I+ Y DVM D EI+ IK++A
Sbjct: 298 GVKLTPQRQKKLFCRYHHGNRVPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 357

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT  TAE LQ
Sbjct: 358 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 417

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
           V NYG+GG YEPH+DF+R    +  K+   GNR+AT L YMSDV  GGATVF  L  ++W
Sbjct: 418 VANYGMGGQYEPHFDFSRRPFDSGLKT--EGNRLATFLNYMSDVEAGGATVFPDLGAAIW 475

Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           P+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 476 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 507


>gi|440912197|gb|ELR61789.1| Prolyl 4-hydroxylase subunit alpha-2, partial [Bos grunniens mutus]
          Length = 535

 Score =  255 bits (652), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 134/272 (49%), Positives = 181/272 (66%), Gaps = 12/272 (4%)

Query: 4   PTHQRAQGNKLYYQ--------EALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD 55
           P+H+RA GN  Y++        + L+   E +    +     P   + ER+ YE LCRG+
Sbjct: 240 PSHERAGGNLHYFERLLEEEREKMLSNHTEAELASQQGIYERPVDYLPERDVYESLCRGE 299

Query: 56  -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
            + + P    +L CRY H N VP L + P KEE+ +  P I+ Y DVM D EI+ IK++A
Sbjct: 300 GVKLTPRRQKRLFCRYHHGNRVPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 359

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++ R++H+TGLT  TAE LQ
Sbjct: 360 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNLRMQHITGLTVKTAELLQ 419

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
           V NYG+GG YEPH+DF+R    +  K+   GNR+AT L YMSDV  GGATVF  L  ++W
Sbjct: 420 VANYGMGGQYEPHFDFSRRPFDSGLKT--EGNRLATFLNYMSDVEAGGATVFPDLGAAIW 477

Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           P+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 478 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 509


>gi|226874885|ref|NP_001029465.2| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Bos
           taurus]
 gi|296485623|tpg|DAA27738.1| TPA: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Bos taurus]
          Length = 533

 Score =  255 bits (652), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 134/272 (49%), Positives = 181/272 (66%), Gaps = 12/272 (4%)

Query: 4   PTHQRAQGNKLYYQ--------EALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD 55
           P+H+RA GN  Y++        + L+   E +    +     P   + ER+ YE LCRG+
Sbjct: 238 PSHERAGGNLHYFERLLEEEREKMLSNHTEAELASQQGIYERPVDYLPERDVYESLCRGE 297

Query: 56  -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
            + + P    +L CRY H N VP L + P KEE+ +  P I+ Y DVM D EI+ IK++A
Sbjct: 298 GVKLTPRRQKRLFCRYHHGNRVPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 357

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++ R++H+TGLT  TAE LQ
Sbjct: 358 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNLRMQHITGLTVKTAELLQ 417

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
           V NYG+GG YEPH+DF+R    +  K+   GNR+AT L YMSDV  GGATVF  L  ++W
Sbjct: 418 VANYGMGGQYEPHFDFSRRPFDSGLKT--EGNRLATFLNYMSDVEAGGATVFPDLGAAIW 475

Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           P+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 476 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 507


>gi|156370129|ref|XP_001628324.1| predicted protein [Nematostella vectensis]
 gi|156215298|gb|EDO36261.1| predicted protein [Nematostella vectensis]
          Length = 541

 Score =  255 bits (652), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 139/302 (46%), Positives = 187/302 (61%), Gaps = 38/302 (12%)

Query: 2   IFPTHQRAQGNKLYYQEALNKSPELK-DEPPKVNNVAPT---LEVTERE---------KY 48
           I PTH RA  N  Y+   + K  + + D        +P+    ++ ERE          Y
Sbjct: 207 IDPTHTRATDNVAYFGSEIAKQTKKRGDTGTSRRTKSPSSTFKKLKEREYFHRTKAFQNY 266

Query: 49  EMLCRGDL-TVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID 107
           E LCRG++ ++      Q+ C  + R+ P   L P + E  +++P ++++R+ + DSEI 
Sbjct: 267 EKLCRGEVRSLTKWEQGQMSCWQI-RDDPLTVLKPGRIERVFVKPEVLIFRNFITDSEIK 325

Query: 108 LIKKMAQPRLRRATVQ----------NYK------------TGELEIANYRISKSAWLRE 145
            IK++A PRL+RATV+          NY+            TG+LE ANYRISKS WLR+
Sbjct: 326 RIKELATPRLKRATVKDPVTGELIFANYRISKRRATIQHPVTGKLEFANYRISKSGWLRD 385

Query: 146 PEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGN 205
            E  +++RIS RV+  +GL  +T+E+LQVVNYGIGGHYEPHYDFAR GE + F SLGTGN
Sbjct: 386 EEDELVKRISYRVQAYSGLNMTTSEDLQVVNYGIGGHYEPHYDFARDGE-DKFTSLGTGN 444

Query: 206 RVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           R+AT L Y+SDV  GG TVFT +  ++WP+KG AAFW+NL  SGDGD  TRHAACPVL G
Sbjct: 445 RIATFLSYLSDVEAGGGTVFTRVGATVWPQKGDAAFWYNLKRSGDGDSSTRHAACPVLVG 504

Query: 266 SN 267
           S 
Sbjct: 505 SK 506


>gi|74353841|gb|AAI03334.1| Prolyl 4-hydroxylase, alpha polypeptide II [Bos taurus]
          Length = 487

 Score =  255 bits (651), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 134/272 (49%), Positives = 181/272 (66%), Gaps = 12/272 (4%)

Query: 4   PTHQRAQGNKLYYQ--------EALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD 55
           P+H+RA GN  Y++        + L+   E +    +     P   + ER+ YE LCRG+
Sbjct: 192 PSHERAGGNLHYFERLLEEEREKMLSNHTEAELASQQGIYERPVDYLPERDVYESLCRGE 251

Query: 56  -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
            + + P    +L CRY H N VP L + P KEE+ +  P I+ Y DVM D EI+ IK++A
Sbjct: 252 GVKLTPRRQKRLFCRYHHGNRVPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 311

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++ R++H+TGLT  TAE LQ
Sbjct: 312 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNLRMQHITGLTVKTAELLQ 371

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
           V NYG+GG YEPH+DF+R    +  K+   GNR+AT L YMSDV  GGATVF  L  ++W
Sbjct: 372 VANYGMGGQYEPHFDFSRRPFDSGLKT--EGNRLATFLNYMSDVEAGGATVFPDLGAAIW 429

Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           P+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 430 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 461


>gi|426229221|ref|XP_004008689.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 2
           [Ovis aries]
          Length = 487

 Score =  254 bits (650), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 134/272 (49%), Positives = 180/272 (66%), Gaps = 12/272 (4%)

Query: 4   PTHQRAQGNKLYYQ--------EALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD 55
           P+H+RA GN  Y++        + L    E +    +     P   + ER+ YE LCRG+
Sbjct: 192 PSHERAGGNLHYFERLLEEEREKMLTNHTEAELAAQQGIYERPVDYLPERDVYESLCRGE 251

Query: 56  -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
            + + P    +L CRY H N VP L + P KEE+ +  P I+ Y DVM D EI+ IK++A
Sbjct: 252 GVKLTPRRQKRLFCRYHHGNRVPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 311

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++ R++H+TGLT  TAE LQ
Sbjct: 312 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNLRMQHITGLTVKTAELLQ 371

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
           V NYG+GG YEPH+DF+R    +  K+   GNR+AT L YMSDV  GGATVF  L  ++W
Sbjct: 372 VANYGMGGQYEPHFDFSRRPFDSGLKT--EGNRLATFLNYMSDVEAGGATVFPDLGAAIW 429

Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           P+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 430 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 461


>gi|57525020|ref|NP_001006155.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Gallus gallus]
 gi|82082587|sp|Q5ZLK5.1|P4HA2_CHICK RecName: Full=Prolyl 4-hydroxylase subunit alpha-2; Short=4-PH
           alpha-2; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-2; Flags: Precursor
 gi|53129464|emb|CAG31388.1| hypothetical protein RCJMB04_5l17 [Gallus gallus]
          Length = 534

 Score =  253 bits (647), Expect = 7e-65,   Method: Compositional matrix adjust.
 Identities = 134/276 (48%), Positives = 179/276 (64%), Gaps = 21/276 (7%)

Query: 5   THQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVT-------------EREKYEML 51
           TH+RA  N  Y+++ L K    + E P    VA T  V              ER+ YE L
Sbjct: 239 THERAGSNLRYFEKLLEK----EREKPSNKTVATTEPVVQSGAYERPLDYLPERDIYEAL 294

Query: 52  CRGD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLI 109
           CRG+ + + P    +L CRY   N  P+L + P KEE+ +  P I+ Y DVM D EI+ I
Sbjct: 295 CRGEGVKMTPRRQKRLFCRYHDGNRNPHLLIAPFKEEDEWDSPHIVRYYDVMSDEEIEKI 354

Query: 110 KKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTA 169
           K++A+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ ++++R++ +TGLT  TA
Sbjct: 355 KQLAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVAKVNQRMQQITGLTVKTA 414

Query: 170 EELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLN 229
           E LQV NYG+GG YEPH+DF+R    +  KS   GNR+AT L YMSDV  GGATVF    
Sbjct: 415 ELLQVANYGMGGQYEPHFDFSRRPFDSTLKS--EGNRLATFLNYMSDVEAGGATVFPDFG 472

Query: 230 LSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
            ++WP+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 473 AAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVG 508


>gi|73970649|ref|XP_850109.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Canis
           lupus familiaris]
          Length = 533

 Score =  252 bits (644), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 133/272 (48%), Positives = 178/272 (65%), Gaps = 12/272 (4%)

Query: 4   PTHQRAQGNKLYYQ--------EALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD 55
           P+H+RA GN  Y++        + L    E      +     P   + ER+ YE LCRG+
Sbjct: 238 PSHERAGGNLRYFERLLEEEREKMLLNQTEAGLATQESIYERPVDYLPERDVYESLCRGE 297

Query: 56  -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
            + + P    +L CRY H N  P L + P KEE+ +  P I+ Y DVM D EI+ IK++A
Sbjct: 298 GVKLTPRRQKRLFCRYHHGNRTPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 357

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++ R++H+TGLT  TAE LQ
Sbjct: 358 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNLRMQHITGLTVKTAELLQ 417

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
           V NYG+GG YEPH+DF+R    +  K+   GNR+AT L YMSDV  GGATVF  L  ++W
Sbjct: 418 VANYGMGGQYEPHFDFSRRPFDSGLKT--EGNRLATFLNYMSDVEAGGATVFPDLGAAIW 475

Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           P+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 476 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 507


>gi|334311009|ref|XP_001371555.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Monodelphis
           domestica]
          Length = 534

 Score =  252 bits (643), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 129/275 (46%), Positives = 180/275 (65%), Gaps = 13/275 (4%)

Query: 4   PTHQRAQGNKLYYQE---------ALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRG 54
           P H+RA GN  Y+++          LNK+ E +          P+  + ERE YE LCRG
Sbjct: 238 PNHERAGGNLRYFEKLIEEERLGKTLNKTSETEPATQGAFYQRPSDYLPEREVYEALCRG 297

Query: 55  D-LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKM 112
           + + + P    +L CRY   N  P L + P KEE+ +  P I+ Y DV+ D EI+ IK++
Sbjct: 298 EGIKLTPQRRKRLFCRYHDSNKTPQLLIAPFKEEDEWDSPHIVRYYDVLSDEEIEKIKEI 357

Query: 113 AQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEEL 172
           ++P+L RATV++ KTG L + +YRISKS+WL+E + P+I +++RR++++TGL+  TAE L
Sbjct: 358 SKPKLSRATVRDPKTGHLIVVSYRISKSSWLKEDDDPIIAQVNRRMQYITGLSVKTAELL 417

Query: 173 QVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSL 232
           QV NYG+GG YEPH+DF+R    +  K+   GNR+AT L YMSDV  GGATVF     ++
Sbjct: 418 QVSNYGMGGQYEPHFDFSRRPFDSGLKT--EGNRLATFLNYMSDVEAGGATVFPDFGAAI 475

Query: 233 WPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
           WP+KGT+ FW+NL  SG+ DY TRHAACPVL GS 
Sbjct: 476 WPKKGTSVFWYNLFRSGECDYRTRHAACPVLVGSK 510


>gi|355709025|gb|AES03456.1| prolyl 4-hydroxylase, alpha polypeptide II [Mustela putorius furo]
          Length = 532

 Score =  252 bits (643), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 133/272 (48%), Positives = 178/272 (65%), Gaps = 12/272 (4%)

Query: 4   PTHQRAQGNKLYYQ--------EALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD 55
           P+H+RA GN  Y++        + L    E      +     P   + ER+ YE LCRG+
Sbjct: 238 PSHERAGGNLRYFERLLEEEREKMLLNQTEAGLATQESIYERPVDYLPERDVYESLCRGE 297

Query: 56  -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
            + + P    +L CRY H N  P L + P KEE+ +  P I+ Y DVM D EI+ IK++A
Sbjct: 298 GVKLTPRRQKRLFCRYHHGNRTPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 357

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++ R++H+TGLT  TAE LQ
Sbjct: 358 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNLRMQHITGLTVKTAELLQ 417

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
           V NYG+GG YEPH+DF+R    +  K+   GNR+AT L YMSDV  GGATVF  L  ++W
Sbjct: 418 VANYGMGGQYEPHFDFSRRPFDSGLKT--EGNRLATFLNYMSDVEAGGATVFPDLGAAIW 475

Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           P+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 476 PKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 507


>gi|190402274|gb|ACE77683.1| prolyl 4-hydroxylase subunit alpha-2 precursor (predicted) [Sorex
           araneus]
          Length = 533

 Score =  251 bits (642), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 134/275 (48%), Positives = 179/275 (65%), Gaps = 18/275 (6%)

Query: 4   PTHQRAQGNKLYY---------QEALNKSPELKDEPPKVNNV--APTLEVTEREKYEMLC 52
           P+H+RA GN  Y+         +  LN++     EP         P+  + ER+ YE LC
Sbjct: 238 PSHERAGGNLRYFERLLEEEREKTVLNQTGA---EPATQEGFYERPSDYLPERDVYESLC 294

Query: 53  RGD-LTVPPAIVAQLKCRYVH-RNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIK 110
           RG+ + + P    +L CRY H    P L + P KEE+ +  P I+ Y DVM D EI+ IK
Sbjct: 295 RGEGVKLTPRRQKRLFCRYHHGHGAPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIK 354

Query: 111 KMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAE 170
           ++A+P+L RATV++ KTG L  A+YR+SKS+WL E + PV+ R++ R++H+TGLT  TAE
Sbjct: 355 EIAKPKLARATVRDPKTGVLTTASYRVSKSSWLEETDDPVVARVNLRMQHITGLTVKTAE 414

Query: 171 ELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNL 230
            LQV NYG+GG YEPH+DF+R    +  K+   GNR+AT L YMSDV  GGATVF  L  
Sbjct: 415 LLQVANYGMGGQYEPHFDFSRRPFDSGLKT--EGNRLATFLNYMSDVEAGGATVFPDLGA 472

Query: 231 SLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           ++WP+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 473 AIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 507


>gi|224068121|ref|XP_002191580.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Taeniopygia
           guttata]
          Length = 539

 Score =  251 bits (641), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 135/277 (48%), Positives = 181/277 (65%), Gaps = 18/277 (6%)

Query: 5   THQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVT--------------EREKYEM 50
           TH+RA  N  Y+++ L K  E K++   +N    T E                ER+ YE 
Sbjct: 239 THERAGSNLRYFEKLLEKEREEKEKENSMNKTVTTTEAVVQSGAYERPLDYLPERDIYEA 298

Query: 51  LCRGD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDL 108
           LCRG+ + + P    +L CRY   N  P+L + P KEE+ +  P I+ Y DVM D EI+ 
Sbjct: 299 LCRGEGVKMTPRRQKRLFCRYHDGNRNPHLLIAPFKEEDEWDSPHIVRYYDVMSDEEIEK 358

Query: 109 IKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST 168
           IK++A+PRL RATV++ KTG L +A+YR+SKS+WL E + PV+ ++++R++H+TGLT  T
Sbjct: 359 IKQLAKPRLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVAKVNQRMQHITGLTVKT 418

Query: 169 AEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL 228
           AE LQV NYG+GG YEPH+DF+R    +  KS   GNR+AT L YMSDV  GGATVF   
Sbjct: 419 AELLQVANYGMGGQYEPHFDFSRRPFDSTLKS--EGNRLATFLNYMSDVEAGGATVFPDF 476

Query: 229 NLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
             ++WP+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 477 GAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVG 513


>gi|148226320|ref|NP_001087703.1| prolyl 4-hydroxylase, alpha polypeptide 2 precursor [Xenopus
           laevis]
 gi|51703693|gb|AAH81114.1| MGC83530 protein [Xenopus laevis]
          Length = 533

 Score =  251 bits (640), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 142/295 (48%), Positives = 187/295 (63%), Gaps = 26/295 (8%)

Query: 5   THQRAQGNKLYYQE---------ALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD 55
            H+RA  N  Y+++           N++ E +   P V N  P   + ER+ YE LCRG+
Sbjct: 239 NHERAGSNLKYFEKMQERQKGELKQNETIETETRQPGVYN-RPLDYLPERDVYEALCRGE 297

Query: 56  -LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
            + + P    +L CRY   N  P L L P+K E+ +  PRI+ Y DV+ D EI+ IK++A
Sbjct: 298 GVKMNPRRQKRLFCRYHDGNRNPRLILGPIKMEDEWDSPRIVRYLDVLSDEEIEKIKELA 357

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +PRL RATV++ KTG L +ANYR+SKSAWL E + PVI R++ R++ +TGLT  TAE LQ
Sbjct: 358 KPRLARATVRDPKTGVLTVANYRVSKSAWLEEYDDPVIGRVNSRMQAITGLTKDTAELLQ 417

Query: 174 VVNYGIGGHYEPHYDFA-RPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSL 232
           V NYG+GG YEPH+DF+ RP ++N  K+   GNR+AT L YMSDV  GGATVF     ++
Sbjct: 418 VANYGMGGQYEPHFDFSRRPFDSN-LKT--EGNRLATYLNYMSDVEAGGATVFPDFGAAI 474

Query: 233 WPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCGL 277
           WP KGTA FW+NL  SG+GDY TRHAACPVL GS  + +            PCGL
Sbjct: 475 WPRKGTAVFWYNLFRSGEGDYRTRHAACPVLVGSKWVSNKWFHERGQEFLRPCGL 529


>gi|443709455|gb|ELU04127.1| hypothetical protein CAPTEDRAFT_149240 [Capitella teleta]
          Length = 532

 Score =  251 bits (640), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 135/280 (48%), Positives = 174/280 (62%), Gaps = 17/280 (6%)

Query: 2   IFPTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVT---------------ERE 46
           + P H RAQ N  +Y + + K  + +    K ++ A    +                E +
Sbjct: 232 LVPEHTRAQNNLNHYNQLIAKEEQEEGVRKKGDDGALKDAIMNDRFLNEEDQYRASPEFQ 291

Query: 47  KYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEI 106
            YE LCRG+  VP     +L C+Y   + P   + PL+EE A L+P I +Y  +M D EI
Sbjct: 292 TYEALCRGEDVVPVKDPHKLTCQYRFWH-PMFYINPLREETASLEPWIAVYHQLMNDHEI 350

Query: 107 DLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTT 166
           + IK+MA PRL RATV N  TG+LE A YRISKS WLR+ E P+I RIS R   +T L+ 
Sbjct: 351 ERIKEMATPRLARATVHNSATGQLEHAKYRISKSGWLRDEEDPLIARISERCSALTNLSL 410

Query: 167 STAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFT 226
           +T EELQVVNYGIGG YEPH+DF+R  E  AF+    GNR+ TV++YM+DV  GGATVF 
Sbjct: 411 TTVEELQVVNYGIGGQYEPHFDFSRRSEPTAFEKW-RGNRILTVIYYMTDVEAGGATVFL 469

Query: 227 SLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
              + ++PEKG+AA WHNL  SG+GD  TRHAACPVLTGS
Sbjct: 470 DAGVKVYPEKGSAAVWHNLLPSGEGDMRTRHAACPVLTGS 509


>gi|395509387|ref|XP_003758979.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1
           [Sarcophilus harrisii]
          Length = 534

 Score =  251 bits (640), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 134/293 (45%), Positives = 184/293 (62%), Gaps = 23/293 (7%)

Query: 5   THQRAQGNKLYYQEAL---------NKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD 55
           +H+RA GN  Y+++ L         NK+ E +          P   + ER+ YE LCRG+
Sbjct: 239 SHERAGGNLRYFEKLLEEERLGKRLNKTSETQPATQGGIYERPPDYLPERDVYEALCRGE 298

Query: 56  -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
            + + P    +L CRY   N  P L + P KEE+ +  P I+ Y DV+ D EI+ IK++A
Sbjct: 299 GIKLTPRRQKRLFCRYHDGNRTPQLLIAPFKEEDEWDSPHIVRYYDVLSDEEIERIKELA 358

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +P+L RATV++ KTG L +ANYR+SKS+WL E + PVI +++RR+ ++TGL+  TAE LQ
Sbjct: 359 KPKLARATVRDPKTGVLTVANYRVSKSSWLEEGDDPVIAQLNRRMHYITGLSVKTAELLQ 418

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
           V NYG+GG YEPH+DF+R    +  K+   GNR+AT L YMSDV  GGATVF     ++W
Sbjct: 419 VANYGMGGQYEPHFDFSRRPFDSGLKT--EGNRLATFLNYMSDVEAGGATVFPDFGATIW 476

Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCG 276
           P+KGT+ FW+NL  SG+GDY TRHAACPVL GS  + +            PCG
Sbjct: 477 PKKGTSVFWYNLFRSGEGDYRTRHAACPVLVGSKWVSNKWFHERGQEFLRPCG 529


>gi|387016442|gb|AFJ50340.1| Prolyl 4-hydroxylase subunit alpha-2-like [Crotalus adamanteus]
          Length = 533

 Score =  250 bits (638), Expect = 7e-64,   Method: Compositional matrix adjust.
 Identities = 132/278 (47%), Positives = 178/278 (64%), Gaps = 15/278 (5%)

Query: 1   MIFPTHQRAQGNKLYYQEALNKSPELKD---------EPPKVNNV--APTLEVTEREKYE 49
           ++ P H+RA  N  Y+++ L    E            +P   N +   P   + ERE YE
Sbjct: 232 ILDPGHERAGSNMQYFEKLLESEKESNQINKLSVNPSDPKTYNGIYERPQDYLPERETYE 291

Query: 50  MLCRGD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEID 107
            LCRG+ + + P     L CRY + N  P+L + P KEE+ +  P I+ Y +V+ D EI+
Sbjct: 292 ALCRGEGVKLTPRRQKGLFCRYHNGNRNPHLIIAPFKEEDEWDSPHIVRYYEVLSDEEIE 351

Query: 108 LIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTS 167
            IK++A+P+L RATV++ KTG L +ANYR+SKS+WL E +  V+ R++ R+E +TGLTT 
Sbjct: 352 KIKELAKPKLARATVRDPKTGVLTVANYRVSKSSWLEEEDDLVVARVNHRMEQITGLTTK 411

Query: 168 TAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTS 227
           TAE LQV NYG+GG YEPH+DF+R       K+   GNR+AT L YMSDV  GGATVF  
Sbjct: 412 TAELLQVANYGMGGQYEPHFDFSRRPFDITLKT--EGNRLATFLNYMSDVEAGGATVFPD 469

Query: 228 LNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
              ++WP+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 470 FGAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVG 507


>gi|56118630|ref|NP_001007975.1| prolyl 4-hydroxylase, alpha polypeptide 2 precursor [Xenopus
           (Silurana) tropicalis]
 gi|51513259|gb|AAH80485.1| p4ha2 protein [Xenopus (Silurana) tropicalis]
          Length = 527

 Score =  249 bits (637), Expect = 9e-64,   Method: Compositional matrix adjust.
 Identities = 134/276 (48%), Positives = 179/276 (64%), Gaps = 16/276 (5%)

Query: 4   PTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLE----------VTEREKYEMLCR 53
           P H RA  N  Y+++   K      +    N  + T +          + ER+ YE LCR
Sbjct: 238 PNHDRAVNNLKYFEKMQEKQKAELKQNESTNTESATRQPGVYSRPLDYLPERDVYEALCR 297

Query: 54  GD-LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           G+ + + P    +L CRY + N  PYL L P+K E+ +  PRI+ Y + + D EI  IK+
Sbjct: 298 GEGVKMNPRRQRRLFCRYHNGNRSPYLILSPVKVEDEWDSPRIVRYLNALSDEEIAKIKE 357

Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
           +A+P+L RATV++ KTG L +ANYR+SKSAWL E + PVI R++ R++ +TGLT  TAE 
Sbjct: 358 LAKPKLARATVRDPKTGVLSVANYRVSKSAWLEENDDPVIARVNLRMQAITGLTVDTAEL 417

Query: 172 LQVVNYGIGGHYEPHYDFA-RPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNL 230
           LQV NYG+GG YEPH+DF+ RP ++N  K+   GNR+AT L YMSDV  GGATVF     
Sbjct: 418 LQVANYGMGGQYEPHFDFSRRPFDSN-LKT--DGNRLATFLNYMSDVEAGGATVFPDFGA 474

Query: 231 SLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           ++WP+KGTA FW+NL  SG+GDY TRHAACPVL GS
Sbjct: 475 AIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGS 510


>gi|340367965|ref|XP_003382523.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Amphimedon
           queenslandica]
          Length = 525

 Score =  248 bits (632), Expect = 4e-63,   Method: Compositional matrix adjust.
 Identities = 136/292 (46%), Positives = 185/292 (63%), Gaps = 22/292 (7%)

Query: 4   PTHQRAQGNKLYYQEALNKSPELKDEPPK-VNNVAPTLEVTEREKYEMLCRGDLTVPPAI 62
           P+H+RA  N+ Y+          ++EP K V++     + +E   YE LCR    +P  +
Sbjct: 242 PSHERAISNREYFNRVS------REEPDKFVDHEGVLDDESEHAVYEKLCREPAPIPSHL 295

Query: 63  VAQLKCRYVH-RNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRAT 121
             +L C Y + +  P L L P+K E A+++P+I ++ D++ D EI+ +K++A P+L RAT
Sbjct: 296 HKKLICYYFNNKRNPRLILSPIKTEVAFVKPKIYIFYDIVTDREIERLKELANPKLNRAT 355

Query: 122 VQNYKTGELEIANYRISKSAWLREPEHPV--IERISRRVEHMTGLTTSTAEELQVVNYGI 179
           V   + GEL  A YRISKS WL   + P+  ++RI +R+E +TGLT STAE+LQVVNYGI
Sbjct: 356 VHG-ENGELLHATYRISKSGWLSGSDDPLGYVDRIDQRIEDVTGLTMSTAEQLQVVNYGI 414

Query: 180 GGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTA 239
           GG YEPHYDFAR GE + F SLG+GNR++T+L YMSDV +GGATVF  +   L P K  A
Sbjct: 415 GGQYEPHYDFARTGE-DTFTSLGSGNRISTLLIYMSDVEKGGATVFPGVGARLVPIKRAA 473

Query: 240 AFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCGLRRGL 281
           A+W NL  SGDGDY TRHA CPVL GS  + +            PCGL R +
Sbjct: 474 AYWWNLKRSGDGDYSTRHAGCPVLVGSKWVCNKWIHERGQEFRRPCGLSRDV 525


>gi|297301157|ref|XP_001103971.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2 [Macaca
           mulatta]
          Length = 512

 Score =  247 bits (630), Expect = 5e-63,   Method: Compositional matrix adjust.
 Identities = 129/275 (46%), Positives = 174/275 (63%), Gaps = 35/275 (12%)

Query: 4   PTHQRAQGNKLYYQEALNKSPEL----------KDEPPKVNNVAPTLEVTEREKYEMLCR 53
           P HQRA GN  Y++  + K  ++          +   PK   VA    + ER+KYEMLCR
Sbjct: 236 PEHQRANGNLKYFEYIMAKEKDVNKSASDDQSDQKTTPKKKGVAVDY-LPERQKYEMLCR 294

Query: 54  GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           G+ + + P    +L CRY   N  P   L P K+E+ + +PRII + D++ D+EI+++K 
Sbjct: 295 GEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 354

Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
           +A+PRL RATV + +TG+L  A YR+SKSAWL   E+PV+ RI+ R++ +TGL  STAEE
Sbjct: 355 LAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEE 414

Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
           LQV NYG+GG YEPH+DFAR                      MSDV+ GGATVF  +  S
Sbjct: 415 LQVANYGVGGQYEPHFDFAR----------------------MSDVSAGGATVFPEVGAS 452

Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           +WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 453 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 487


>gi|324510827|gb|ADY44523.1| Prolyl 4-hydroxylase subunit alpha-1 [Ascaris suum]
          Length = 551

 Score =  247 bits (630), Expect = 5e-63,   Method: Compositional matrix adjust.
 Identities = 135/295 (45%), Positives = 182/295 (61%), Gaps = 17/295 (5%)

Query: 4   PTHQRAQGNKLYYQ-----EALNKSPELKDEPPKVN-NVAPTLEVTEREKYEMLCRGDLT 57
           P H RA+GN  +Y+     E + +    ++ PP +N      LE TER+ +E LCR ++ 
Sbjct: 236 PNHPRAKGNLKWYEDLLEDEGVRRVDMRRNIPPLLNPRHDGGLEHTERDIFEALCRHEVP 295

Query: 58  VPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRL 117
           V    +++L C Y   + PYLRL P+K E   L P  +L+  +M D E  +I+ +A P+L
Sbjct: 296 VSTKALSRLYC-YYKMDRPYLRLAPIKVEIMRLNPLAVLFHQIMSDEEAHIIEMLAIPKL 354

Query: 118 RRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNY 177
            RATVQN  TG LE A+YRISKSAWL+  EH V++R ++R++  T L   TAEELQ+ NY
Sbjct: 355 NRATVQNAMTGGLETASYRISKSAWLKPHEHEVVDRFNKRLDMATNLEMETAEELQIQNY 414

Query: 178 GIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKG 237
           G+GGHY+PH+D AR  E NAFK LGTGNRVAT+L YM++   GG TVFT +  S+   K 
Sbjct: 415 GVGGHYDPHFDCARKEEKNAFKELGTGNRVATILVYMTEPEIGGGTVFTEVKTSVACTKN 474

Query: 238 TAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCGLRRGLQ 282
            A FW+NL  SG+ D  +RHAACPVLTG   + +            PCGL +  Q
Sbjct: 475 AALFWYNLLRSGEVDMRSRHAACPVLTGVKWVTNKWIHERGQEWRRPCGLNQFDQ 529


>gi|326923465|ref|XP_003207956.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 3
           [Meleagris gallopavo]
          Length = 518

 Score =  244 bits (623), Expect = 4e-62,   Method: Compositional matrix adjust.
 Identities = 131/273 (47%), Positives = 175/273 (64%), Gaps = 29/273 (10%)

Query: 4   PTHQRAQGNKLYYQ-------EALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD- 55
           P HQRA GN  Y++       EA   S + +D+  K         + ER KYEMLCRG+ 
Sbjct: 240 PEHQRANGNMKYFEYIMAKEKEANKSSTDAEDQTEKETEFKKKDYLPERRKYEMLCRGEG 299

Query: 56  LTVPPAIVAQLKCRYV--HRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
           L + P    +L CRY   +RN  Y+ L P+K+E+ + +PRI+ + D++ D EI+ +K++A
Sbjct: 300 LKMTPRRQKRLFCRYYDGNRNPRYI-LGPVKQEDEWDKPRIVRFLDIISDEEIETVKELA 358

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +PRL RATV + +TG+L  A+YR+SKSAWL   E PV+ RI+ R++ +TGL  STAEELQ
Sbjct: 359 KPRLSRATVHDPETGKLTTAHYRVSKSAWLSGYESPVVSRINTRIQDLTGLDVSTAEELQ 418

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
                               E +AFK LGTGNR+AT LFYMSDV+ GGATVF  +  S+W
Sbjct: 419 ------------------KDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGASVW 460

Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           P+KGTA FW+NL  SG+GDY TRHAACPVL G+
Sbjct: 461 PKKGTAVFWYNLFPSGEGDYSTRHAACPVLVGN 493


>gi|395820528|ref|XP_003783616.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 3 [Otolemur
           garnettii]
          Length = 516

 Score =  244 bits (623), Expect = 4e-62,   Method: Compositional matrix adjust.
 Identities = 130/275 (47%), Positives = 175/275 (63%), Gaps = 31/275 (11%)

Query: 4   PTHQRAQGNKLYYQEALNKSPEL----------KDEPPKVNNVAPTLEVTEREKYEMLCR 53
           P HQRA GN  Y++  + K  ++          +   PK   VA    + ER+KYEMLCR
Sbjct: 236 PEHQRANGNLKYFEYIMAKEKDVNKSSSDDQSDQKTTPKKKGVAVDY-LPERQKYEMLCR 294

Query: 54  GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           G+ + + P    +L CRY   N  P   L P K+E+ + +PRII + D++ D+EI+++K 
Sbjct: 295 GEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 354

Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
           +A+PRL RATV + +TG+L  A YR+SKSAWL   E+PV+ RI+ R++ +TGL  STAEE
Sbjct: 355 LAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEE 414

Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
           LQ                    E +AFK LGTGNR+AT LFYMSDV+ GGATVF  +  S
Sbjct: 415 LQ------------------KDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 456

Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           +WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 457 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 491


>gi|607947|gb|AAA62207.1| prolyl 4-hydroxylase alpha subunit [Caenorhabditis elegans]
          Length = 558

 Score =  244 bits (623), Expect = 4e-62,   Method: Compositional matrix adjust.
 Identities = 136/292 (46%), Positives = 184/292 (63%), Gaps = 20/292 (6%)

Query: 4   PTHQRAQGNKLYY-----QEALNKSPELKDEPPKVNNVAP--TLEVTEREKYEMLCRGDL 56
           PTH RA+GN  +Y     QE + +S   K+ PP + N  P   L  TER  YE LCR ++
Sbjct: 235 PTHPRAKGNVKWYEDLLEQEGVRRSDMRKNLPP-IQNRRPDSVLGNTERTMYEALCRNEV 293

Query: 57  TVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPR 116
            V    + +L C Y+     +L   P+K E     P  +L++DV+ D E+  I+++A+P+
Sbjct: 294 PVSRRHL-RLYCYYL-AGPSFLVYAPIKVEIKRFNPLAVLFKDVISDDEVAAIQELAKPK 351

Query: 117 LRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVN 176
           L RATV +  TG+L  A YRISKSAWL+E E  V+E +++R+ +MT L   TAEELQ+ N
Sbjct: 352 LARATVHDSVTGKLVTATYRISKSAWLKEWEGDVVETVNKRIGYMTNLEMETAEELQIAN 411

Query: 177 YGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEK 236
           YGIGGHY+PH+D A+  E+ +F+SLGTGNR+ATVLFYMS  + GG TVFT    ++ P K
Sbjct: 412 YGIGGHYDPHFDHAKKEESKSFESLGTGNRIATVLFYMSQPSHGGGTVFTEAKSTILPTK 471

Query: 237 GTAAFWHNLHSSGDGDYYTRHAACPVLTG----SNS-LHSTC-----PCGLR 278
             A FW+NL+  GDG+  TRHAACPVL G    SN  +H        PCGL+
Sbjct: 472 NDALFWYNLYKQGDGNPDTRHAACPVLVGIKWVSNKWIHEKGNEFRRPCGLK 523


>gi|217272851|ref|NP_001136068.1| prolyl 4-hydroxylase subunit alpha-1 isoform 3 precursor [Homo
           sapiens]
 gi|114631189|ref|XP_001140871.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 10 [Pan
           troglodytes]
          Length = 516

 Score =  244 bits (622), Expect = 4e-62,   Method: Compositional matrix adjust.
 Identities = 130/275 (47%), Positives = 175/275 (63%), Gaps = 31/275 (11%)

Query: 4   PTHQRAQGNKLYYQEALNKSPEL----------KDEPPKVNNVAPTLEVTEREKYEMLCR 53
           P HQRA GN  Y++  + K  ++          +   PK   VA    + ER+KYEMLCR
Sbjct: 236 PEHQRANGNLKYFEYIMAKEKDVNKSASDDQSDQKTTPKKKGVAVDY-LPERQKYEMLCR 294

Query: 54  GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           G+ + + P    +L CRY   N  P   L P K+E+ + +PRII + D++ D+EI+++K 
Sbjct: 295 GEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 354

Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
           +A+PRL RATV + +TG+L  A YR+SKSAWL   E+PV+ RI+ R++ +TGL  STAEE
Sbjct: 355 LAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEE 414

Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
           LQ                    E +AFK LGTGNR+AT LFYMSDV+ GGATVF  +  S
Sbjct: 415 LQ------------------KDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 456

Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           +WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 457 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 491


>gi|350014318|dbj|GAA37183.1| prolyl 4-hydroxylase [Clonorchis sinensis]
          Length = 595

 Score =  244 bits (622), Expect = 5e-62,   Method: Compositional matrix adjust.
 Identities = 129/287 (44%), Positives = 180/287 (62%), Gaps = 14/287 (4%)

Query: 4   PTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAIV 63
           P H+++  N+ +Y+  L K   +    P    ++   E  E E Y+ LCRG+   PP   
Sbjct: 305 PEHEQSLSNEEFYRTRLQKGEGIIGPAPPPEKLSKLDE--ETEIYQALCRGEQLFPPPPD 362

Query: 64  AQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQ 123
            Q+ CRY   + PY ++ P+KEE  Y  PRI+++ DV++ SE+  I+++A PRLRRATV+
Sbjct: 363 DQVYCRYYIPH-PYYKIGPVKEEVLYPDPRIVMWYDVIHPSEVGRIQELALPRLRRATVK 421

Query: 124 NYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHY 183
           N  TG+LE A YR SKSAWL++    V  R+++R+  +TGL   TAE+LQV NYGIGG+Y
Sbjct: 422 NPVTGKLENAYYRTSKSAWLQDGLDEVTHRLNQRIHALTGLAMETAEDLQVGNYGIGGYY 481

Query: 184 EPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWH 243
            PH+DF R  E +AF+ +  GNR+AT++FY++DV  GGATVF     S+ P +G A FW+
Sbjct: 482 APHFDFGRKREKDAFE-VENGNRIATIIFYLTDVKAGGATVFNRFGASVKPVRGAAGFWY 540

Query: 244 NLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCGLRRG 280
           NLH SG+GD  TRH ACPVL GS  + +            PC L RG
Sbjct: 541 NLHPSGEGDLRTRHVACPVLVGSKWVMNVWFHERGQEFRRPCELTRG 587


>gi|291404186|ref|XP_002718473.1| PREDICTED: prolyl 4-hydroxylase, alpha I subunit isoform 3
           [Oryctolagus cuniculus]
          Length = 516

 Score =  243 bits (620), Expect = 7e-62,   Method: Compositional matrix adjust.
 Identities = 130/275 (47%), Positives = 174/275 (63%), Gaps = 31/275 (11%)

Query: 4   PTHQRAQGNKLYYQEALNKSPEL----------KDEPPKVNNVAPTLEVTEREKYEMLCR 53
           P HQRA GN  Y++  + K  +           K   P+   VA    + ER+KYEMLCR
Sbjct: 236 PEHQRANGNLKYFEYIMAKEKDANKSASDGQSDKKTTPRRKGVAVDY-LPERQKYEMLCR 294

Query: 54  GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           G+ + + P    +L CRY   N  P   L P K+E+ + +PRII + D++ D+EI+++K 
Sbjct: 295 GEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 354

Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
           +A+PRL RATV + +TG+L  A YR+SKSAWL   E+PV+ RI+ R++ +TGL  STAEE
Sbjct: 355 LAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEE 414

Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
           LQ                    E +AFK LGTGNR+AT LFYMSDV+ GGATVF  +  S
Sbjct: 415 LQ------------------KDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 456

Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           +WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 457 VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 491


>gi|344274276|ref|XP_003408943.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 3
           [Loxodonta africana]
          Length = 516

 Score =  242 bits (618), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 128/274 (46%), Positives = 172/274 (62%), Gaps = 29/274 (10%)

Query: 4   PTHQRAQGNKLYYQEALNKSPE----LKDEPPKVNNVAPTLEVT-----EREKYEMLCRG 54
           P HQRA GN  Y++  + K  +      D P    +      V      ER+KYEMLCRG
Sbjct: 236 PEHQRANGNLKYFEYIMTKEKDSNKSTSDAPSDQKSTVKKKGVAADYLPERQKYEMLCRG 295

Query: 55  D-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKM 112
           + + + P    +L CRY   N  P   L P K+E+ + +PRI+ + D++ D+EI+++K +
Sbjct: 296 EGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIVRFHDIISDAEIEVVKDL 355

Query: 113 AQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEEL 172
           A+PRL RATV + +TG+L  A YR+SKSAWL   E+PV+ RI+ R++ +TGL  STAEEL
Sbjct: 356 AKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEEL 415

Query: 173 QVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSL 232
           Q                    E +AFK LGTGNR+AT LFYMSDV+ GGATVF  +  S+
Sbjct: 416 Q------------------KDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPDVGASV 457

Query: 233 WPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 458 WPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 491


>gi|426255748|ref|XP_004021510.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 3 [Ovis
           aries]
          Length = 516

 Score =  242 bits (617), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 128/274 (46%), Positives = 174/274 (63%), Gaps = 29/274 (10%)

Query: 4   PTHQRAQGNKLYYQEALNK--------SPELKDEPPKVNNVAPTLE-VTEREKYEMLCRG 54
           P HQRA GN  Y++  + K        S +  D+   +      ++ + ER+KYEMLCRG
Sbjct: 236 PEHQRANGNLKYFEYIMAKEKDANKSSSDDQSDQKTTLKKKGAAVDYLPERQKYEMLCRG 295

Query: 55  D-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKM 112
           + + + P    +L CRY   N  P   L P K+E+ + +PRII + D++ D+EI+++K +
Sbjct: 296 EGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKDL 355

Query: 113 AQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEEL 172
           A+PRL RATV + +TG+L  A YR+SKSAWL   E+PV+ RI+ R++ +TGL  STAEEL
Sbjct: 356 AKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEEL 415

Query: 173 QVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSL 232
           Q                    E +AFK LGTGNR+AT LFYMSDV  GGATVF  +  S+
Sbjct: 416 Q------------------KDEPDAFKELGTGNRIATWLFYMSDVLAGGATVFPEVGASV 457

Query: 233 WPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 458 WPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 491


>gi|341884171|gb|EGT40106.1| CBN-PHY-2 protein [Caenorhabditis brenneri]
          Length = 607

 Score =  240 bits (613), Expect = 5e-61,   Method: Compositional matrix adjust.
 Identities = 143/367 (38%), Positives = 194/367 (52%), Gaps = 86/367 (23%)

Query: 2   IFPTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPA 61
           I P H RA+GN  +Y++ L     + D PP VN       + ER+ YE LCRG+  +PP 
Sbjct: 235 IAPNHPRAKGNVKWYEDMLQGKDMVGDLPPIVNKRVEFDGIVERDAYEALCRGE--IPPV 292

Query: 62  ---IVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLR 118
                 +L+C Y+ R+ P+L++ P+K E     P  +L+++V+ DSEI++IK++A P+L 
Sbjct: 293 EEKWRNKLRC-YLKRDKPFLKIAPIKVEILRFDPLAVLFKNVISDSEIEVIKELASPKLE 351

Query: 119 RATVQNYKTGELEIANYRISK-------------------------------SAWLREPE 147
           RATV+    G L   +YRI+K                               SAWL+   
Sbjct: 352 RATVKG-PDGTLITVDYRIAKRLVNWNTLHIVSPKGGFPKSKKMKNKCLVGFSAWLKGDL 410

Query: 148 HPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPG-------------- 193
            PVI+R++RR+E  TGL  +T+EELQV NYG+GGHY+PH+DFAR                
Sbjct: 411 DPVIDRVNRRIEDFTGLNQATSEELQVANYGLGGHYDPHFDFARIANYGLGGHYEPHYDM 470

Query: 194 ------------------------EANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLN 229
                                   E NAFK+L TGNR+ATVLFYMS    GGATVF  L 
Sbjct: 471 SLRGVPEPYGKNGNRIATVLFYKEEKNAFKTLNTGNRIATVLFYMSQPELGGATVFNHLG 530

Query: 230 LSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG----SNS-LHS-----TCPCGLRR 279
            +++P K  A FW+NL   G+GD  TRHAACPVL G    SN  +H      T PCGL  
Sbjct: 531 TAVFPSKNDALFWYNLRRDGEGDLRTRHAACPVLLGVKWVSNKWIHEKGQEFTRPCGLEE 590

Query: 280 GLQRSGI 286
           G+Q + +
Sbjct: 591 GVQENFV 597


>gi|156352054|ref|XP_001622587.1| predicted protein [Nematostella vectensis]
 gi|156209158|gb|EDO30487.1| predicted protein [Nematostella vectensis]
          Length = 531

 Score =  240 bits (613), Expect = 5e-61,   Method: Compositional matrix adjust.
 Identities = 127/248 (51%), Positives = 157/248 (63%), Gaps = 17/248 (6%)

Query: 46  EKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSE 105
           E YE LCRG         A+L+C Y     P  R+ PLK EE +  P I + RDVMYDSE
Sbjct: 281 EAYERLCRGISYRSNEEAAKLRCYYDFTRHPMFRIRPLKVEELHSDPPIWMLRDVMYDSE 340

Query: 106 IDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREP----EHPVIERISRRVEHM 161
           I+ IK+ A P+LRRATV N KTGELE A+YRISKS WL +P    E  ++ R++RR   +
Sbjct: 341 IEYIKRTATPKLRRATVTNLKTGELEFADYRISKSGWLEDPRDDNEEKILNRVNRRTSII 400

Query: 162 TGLTTS--TAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQ 219
           TGL T+  +AE LQ+VNYG  GHYEPH+D A    ++  K LG GNR+ATVL+YMSDV  
Sbjct: 401 TGLDTTPRSAEALQIVNYGAAGHYEPHFDHATEAVSSILK-LGIGNRIATVLYYMSDVEA 459

Query: 220 GGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC------ 273
           GGATVF      + P KG AAFW+NLH +G GD  TRHAACP++ GS  + +        
Sbjct: 460 GGATVFVDAEAIVKPSKGDAAFWYNLHKNGKGDERTRHAACPIIVGSKWVCNKWIHEHGQ 519

Query: 274 ----PCGL 277
               PCGL
Sbjct: 520 EFRRPCGL 527


>gi|431904119|gb|ELK09541.1| Prolyl 4-hydroxylase subunit alpha-1 [Pteropus alecto]
          Length = 507

 Score =  239 bits (611), Expect = 9e-61,   Method: Compositional matrix adjust.
 Identities = 124/245 (50%), Positives = 166/245 (67%), Gaps = 13/245 (5%)

Query: 4   PTHQRAQGNKLYYQEALNK--------SPELKDEP--PKVNNVAPTLEVTEREKYEMLCR 53
           P HQRA GN  Y++  + K        S +  D+   PK   VA    + ER+KYEMLCR
Sbjct: 236 PEHQRANGNLKYFEYIMAKEKDANKSTSDDQSDQKTTPKKKGVAVDY-LPERQKYEMLCR 294

Query: 54  GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           G+ + + P    +L CRY   N  P   L P K+E+ + +PRII + D++ D+EI+++K 
Sbjct: 295 GEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 354

Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
           +A+PRL RATV + +TG+L  A YR+SKSAWL   E+PV+ RI+ R++ +TGL  STAEE
Sbjct: 355 LAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEE 414

Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
           LQV NYG+GG YEPH+DFAR  E +AFK LGTGNR+AT LFYMSDV+ GGATVF  +  S
Sbjct: 415 LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 474

Query: 232 LWPEK 236
           +WP+K
Sbjct: 475 VWPKK 479


>gi|256083648|ref|XP_002578053.1| prolyl 4-hydroxylase alpha subunit 1 [Schistosoma mansoni]
 gi|360044447|emb|CCD81995.1| putative prolyl 4-hydroxylase alpha subunit 1 [Schistosoma mansoni]
          Length = 584

 Score =  239 bits (609), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 128/287 (44%), Positives = 177/287 (61%), Gaps = 14/287 (4%)

Query: 4   PTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAIV 63
           PT+ RA  N+ YY E +++        P+   ++   +  E E YE LCR +   P    
Sbjct: 294 PTNTRAINNEAYYVEQIDRGEGRIGPNPRSQAISKHDQ--ETELYESLCRNENPFPTVPS 351

Query: 64  AQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQ 123
             L CRY   +  + ++ P+KEE     PRI+++ D+++ SEI+ IK++A PRLRRATV+
Sbjct: 352 HHLTCRYYTPHA-FFKIGPVKEETLNPDPRIVMWYDLIFPSEIEKIKELATPRLRRATVK 410

Query: 124 NYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHY 183
           N  TG LEIA YR SKSAWL      + ++IS+R+  +TGL+  TAE+LQV NYG+GGHY
Sbjct: 411 NPVTGILEIAFYRTSKSAWLPHSMSEITDQISQRIRAVTGLSLETAEDLQVGNYGLGGHY 470

Query: 184 EPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWH 243
            PH+DF R  E +AF+ +  GNR+AT++FY+SDV  GGATVF  +   + P+KG A FW 
Sbjct: 471 APHFDFGRKREKDAFE-VKNGNRIATIIFYLSDVQAGGATVFNRIGTRVVPKKGAAGFWF 529

Query: 244 NLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCGLRRG 280
           NL  +G+GD  TRHAACPVL GS  + +            PC L RG
Sbjct: 530 NLLPNGEGDLRTRHAACPVLAGSKWVMNLWFHERGQEFRRPCELERG 576


>gi|443709454|gb|ELU04126.1| hypothetical protein CAPTEDRAFT_167710 [Capitella teleta]
          Length = 535

 Score =  238 bits (607), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 130/296 (43%), Positives = 171/296 (57%), Gaps = 25/296 (8%)

Query: 4   PTHQRAQGNKLYYQEALNKSPELKDEPPKV------------NNVAPTLEVTER-EKYEM 50
           P H RAQ N  ++++A+ +  E   E  ++             +       TE  + YE 
Sbjct: 237 PEHSRAQSNLAHFEQAIKEKEEALAEESRIRVEREAFRNGRFEHDPDAYHATEFFQTYEA 296

Query: 51  LCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIK 110
           LCRG+  +P     +L C+Y   + P   + PL+EE     P I +Y  +M D +ID IK
Sbjct: 297 LCRGEDVIPIKDAHKLTCQYRVWH-PMFTINPLREETMNFDPWIAVYHQLMSDKDIDDIK 355

Query: 111 KMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAE 170
            +A PRL RATV N  TGELE A YRISKS WL++ EHP + +IS R   +T L+ ST E
Sbjct: 356 ALATPRLARATVVNSVTGELEFAKYRISKSGWLKDEEHPTVAKISNRCSALTNLSLSTVE 415

Query: 171 ELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNL 230
           ELQ+ NYGIGGHYEPH+D++R  E  +F     GNR+ TV+FY+SDV  GG TVF +   
Sbjct: 416 ELQIANYGIGGHYEPHFDYSRLAEVTSFDHW-RGNRILTVIFYLSDVEAGGGTVFMTAGT 474

Query: 231 SLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHS----------TCPCG 276
            L PEKG AA W+NLH  G GD  T+HAACPVLTG+  + +          T PCG
Sbjct: 475 KLRPEKGAAAVWYNLHPDGTGDDETKHAACPVLTGNKWVANKWFHERGQEFTRPCG 530


>gi|55925444|ref|NP_001007286.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Danio rerio]
 gi|49900294|gb|AAH76508.1| Procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide 2 [Danio rerio]
 gi|182891794|gb|AAI65288.1| P4ha2 protein [Danio rerio]
          Length = 514

 Score =  238 bits (607), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 134/298 (44%), Positives = 174/298 (58%), Gaps = 44/298 (14%)

Query: 2   IFPTHQRAQGNKLYYQEALNK--------SPELKDEPPKVNNV--APTLEVTEREKYEML 51
           I P+HQRA GN  Y++  L+K         PE  DE P   +    P   + ERE YE L
Sbjct: 235 IDPSHQRAGGNLRYFERLLSKELQDSGQTQPEPADERPIQLDTYQRPKDYLPEREAYEAL 294

Query: 52  CRGD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLI 109
           CRG+ + +     ++L CRY   N  P L L P+KEE+ +  P I+ + + + D EI  I
Sbjct: 295 CRGEGVKMTTKRQSRLFCRYRDGNRNPRLLLKPMKEEDEWDSPHIVRFLEALSDEEIQKI 354

Query: 110 KKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTA 169
           K++A P+L RATV++ KTG L +A+YR+SKSAWL   + PVI R+++R+E +TGLT  TA
Sbjct: 355 KEIATPKLARATVRDPKTGVLTVAHYRVSKSAWLEGEDDPVIARVNQRIEDITGLTVDTA 414

Query: 170 EELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLN 229
           E LQV NYG+GG YEPH+DF+R                      MSDV  GGATVF    
Sbjct: 415 ELLQVANYGVGGQYEPHFDFSR----------------------MSDVEAGGATVFPDFG 452

Query: 230 LSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCGL 277
            S+WP KGTA FW+NL  SG+GDY TRHAACPVL GS  + +            PCGL
Sbjct: 453 ASVWPRKGTAVFWYNLFRSGEGDYRTRHAACPVLVGSKWVSNKWIHERGQEFRRPCGL 510


>gi|312092237|ref|XP_003147267.1| hypothetical protein LOAG_11701 [Loa loa]
          Length = 553

 Score =  238 bits (607), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 132/295 (44%), Positives = 179/295 (60%), Gaps = 19/295 (6%)

Query: 2   IFPTHQRAQGNKLYYQEALN----KSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLT 57
           I P H RA+GN  +Y++ L     +  +++ + P +NN  P  +    + Y+ LCR ++ 
Sbjct: 236 INPDHPRAKGNVRWYEDLLEDEGVRRADMRRKVPPINN--PRDKSDLNDTYQALCRQEMP 293

Query: 58  VPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRL 117
           V     ++L C Y   + PYLRL P+K E  Y  P  +L+ D+M D E  +I+ +A P+L
Sbjct: 294 VNIKAQSRLYC-YYKMDRPYLRLAPIKVEIVYQNPLAVLFHDIMSDEESRIIEMLAVPKL 352

Query: 118 RRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNY 177
            RATV N +TG LE A+YRISKSAWLR  EH V+ RI+RR++  T L  +TAEELQV NY
Sbjct: 353 DRATVHNVETGNLETASYRISKSAWLRSTEHEVVNRINRRLDLATNLEIATAEELQVQNY 412

Query: 178 GIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKG 237
           GIGGHYEPH D +R  + +AF+  GTGNR+AT+L YM++   GG TVF +L  S+   K 
Sbjct: 413 GIGGHYEPHLDCSR--DEDAFERTGTGNRIATILIYMTEPEIGGRTVFINLKASVPCTKN 470

Query: 238 TAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCGLRRGLQ 282
            A FW+NL  SG  D  + HAACPVLTG+    +            PCGL R  Q
Sbjct: 471 AALFWYNLMRSGAVDMRSYHAACPVLTGTKWTANKWFHERGQEWRRPCGLNRFDQ 525


>gi|157111033|ref|XP_001651361.1| prolyl 4-hydroxylase alpha subunit 1, putative [Aedes aegypti]
 gi|108878552|gb|EAT42777.1| AAEL005714-PA, partial [Aedes aegypti]
          Length = 522

 Score =  237 bits (605), Expect = 5e-60,   Method: Compositional matrix adjust.
 Identities = 117/266 (43%), Positives = 173/266 (65%), Gaps = 8/266 (3%)

Query: 2   IFPTHQRAQGNKLYYQEAL-NKSPELKDEPPKVNNV-APTLEVTEREKYEMLCRGDLTVP 59
           + P H+     K +Y++ L  +  +++    + N + +    ++    ++ LCRG++   
Sbjct: 237 LVPDHESTLHQKTFYEDILWYQQEQVRTTLFRSNRIPSSKASMSSLTTFKKLCRGEIQRN 296

Query: 60  PAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRR 119
            +  + LKCRYV     + ++ P K EE +L+P+I+++ DV+ D+EI+L+K++A+P L R
Sbjct: 297 VSETSHLKCRYVSNLSAFSKIGPFKLEEMHLKPKIVIFHDVLSDTEIELLKRLAKPILER 356

Query: 120 ATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGI 179
           AT+ N +TG+ E +  R+SKS+W  +  H  I  I++RV  MTGL+  TAEELQVVNYG+
Sbjct: 357 ATIANQQTGKAERSKDRVSKSSWFPDEYHSTIRTITKRVADMTGLSMDTAEELQVVNYGL 416

Query: 180 GGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTA 239
           GG Y+PH+DF   G+      L   NR+ATVLFYMSDV+ GGATVF  L ++L   KGTA
Sbjct: 417 GGQYDPHFDFFHWGK------LKEVNRIATVLFYMSDVSIGGATVFPKLGVTLEARKGTA 470

Query: 240 AFWHNLHSSGDGDYYTRHAACPVLTG 265
           AFW+NLHSSG+ DY T H ACPVL G
Sbjct: 471 AFWYNLHSSGELDYSTLHGACPVLIG 496


>gi|427795421|gb|JAA63162.1| Putative prolyl-4-hydroxylase-alpha efb, partial [Rhipicephalus
           pulchellus]
          Length = 568

 Score =  237 bits (604), Expect = 5e-60,   Method: Compositional matrix adjust.
 Identities = 122/220 (55%), Positives = 153/220 (69%), Gaps = 10/220 (4%)

Query: 4   PTHQRAQGNKLYYQEALNKSPELK---------DEPPKVNNVAPTLEV-TEREKYEMLCR 53
           P H RA GNK YY++ L K  + K         D+   +   +P  +  +ER  YE LCR
Sbjct: 303 PDHPRAPGNKRYYEDTLAKREQYKRGDDGDISEDDSITLKKRSPLPDADSERGIYERLCR 362

Query: 54  GDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
           G+   P     +L C+Y   N PYL L P KEE  + +PRI++Y DV+ + E+++IK +A
Sbjct: 363 GEKFPPLFHDRELTCQYRTNNRPYLLLQPAKEEVMFPKPRIVIYHDVLSEHEMNVIKTLA 422

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           QPRLRRATVQNYK+GELE A+YRISKSAWL+  EH VI R++RR+E +TGLT  TAEELQ
Sbjct: 423 QPRLRRATVQNYKSGELETASYRISKSAWLKNEEHGVIARVTRRIEDITGLTADTAEELQ 482

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFY 213
           VVNYGIGGHYEPH+DFAR  E NAF+SLGTGNR+AT L Y
Sbjct: 483 VVNYGIGGHYEPHFDFARREEKNAFQSLGTGNRIATWLNY 522


>gi|386368303|gb|AFJ06910.1| procollagen-proline dioxygenase [Mytilus galloprovincialis]
          Length = 535

 Score =  236 bits (603), Expect = 7e-60,   Method: Compositional matrix adjust.
 Identities = 125/294 (42%), Positives = 176/294 (59%), Gaps = 20/294 (6%)

Query: 4   PTHQRAQGNKLYYQEALNKSPELKDEPPK-VNNVA-----PTLEVT---EREKYEMLCRG 54
           P H RAQ NK+ + E + K+  +     + + N       PT E     E + Y+ LC+G
Sbjct: 238 PDHVRAQNNKIDFMERVKKAANVTSRVKRDLTNTTHYVPKPTPEYNSTPELQSYKRLCKG 297

Query: 55  DLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQ 114
               P   ++Q+ CRY H N P L L P+KEEE Y    ++L+ D+  D E+ +IK +A 
Sbjct: 298 LDVKPREKMSQVVCRYRHNNNPRLLLSPIKEEEVYRDANMVLFHDIASDKEMKIIKSLAI 357

Query: 115 PRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQV 174
           P+L RATV +  TG+L  A YRI+K+AWL + +H V++R+  R++ +TGL   +A+ LQV
Sbjct: 358 PKLFRATVHDPTTGKLIHAKYRITKTAWLDDRDHLVVDRVQNRIKAVTGLDLDSADALQV 417

Query: 175 VNYGIGGHYEPHYDFA-RPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
            NYGIGGHY+PHYDF+ R  +  +      GNR+AT L YM+DV  GGATVF  +++ + 
Sbjct: 418 ANYGIGGHYDPHYDFSTRDDDDTSETEKRDGNRIATFLLYMTDVDAGGATVFPIIDVRVL 477

Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCGL 277
           P+KGTA FW+NL  SG G   TRHAACPVL G+  + +            PCGL
Sbjct: 478 PKKGTAVFWYNLRRSGKGIMETRHAACPVLVGTKWVSNKWIRTRGQEFRRPCGL 531


>gi|260825357|ref|XP_002607633.1| hypothetical protein BRAFLDRAFT_59428 [Branchiostoma floridae]
 gi|229292981|gb|EEN63643.1| hypothetical protein BRAFLDRAFT_59428 [Branchiostoma floridae]
          Length = 520

 Score =  236 bits (602), Expect = 9e-60,   Method: Compositional matrix adjust.
 Identities = 120/228 (52%), Positives = 157/228 (68%), Gaps = 6/228 (2%)

Query: 44  EREKYEMLCRGD----LTVPPAIVAQLKCRY-VHRNVPYLRLMPLKEEEAYLQPRIILYR 98
           E   YE+LC+ D      +  + V  LKCRY  + N P L L P++ E+ + +P++ +  
Sbjct: 269 ESRVYELLCQADQPEIFNITSSRVKHLKCRYFTNNNHPRLLLAPIRLEQVFDKPKLWVLH 328

Query: 99  DVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRV 158
           +++ D E+++IKK+AQPRLRRA V++  TGE E+A+YRISKSAWL + EH VI R+++RV
Sbjct: 329 NILTDPEMEVIKKLAQPRLRRARVESPTTGEGELASYRISKSAWLYDWEHRVIRRVNQRV 388

Query: 159 EHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVA 218
           E +TGLT  TAE LQVVNYGIGGHYEPH+D A   E  A      G+R+AT+LFYMSDV 
Sbjct: 389 EDVTGLTMETAELLQVVNYGIGGHYEPHFDCATKDEEFALDP-NEGDRIATMLFYMSDVE 447

Query: 219 QGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
            GGATVF  +   + PEKG  AFW+NL  SG+GD  T HA CPVL GS
Sbjct: 448 AGGATVFPQVGARVVPEKGAGAFWYNLLKSGEGDMLTEHAGCPVLVGS 495


>gi|196011902|ref|XP_002115814.1| hypothetical protein TRIADDRAFT_30039 [Trichoplax adhaerens]
 gi|190581590|gb|EDV21666.1| hypothetical protein TRIADDRAFT_30039 [Trichoplax adhaerens]
          Length = 534

 Score =  232 bits (592), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 123/269 (45%), Positives = 170/269 (63%), Gaps = 9/269 (3%)

Query: 4   PTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPA-- 61
           P H+RA+ N  YY++ L K+ + K+    ++         E + Y+ LCRG   V     
Sbjct: 244 PKHERAKQNIYYYEKVLTKNSDGKEGEDSLSQDENDWS-HEFDFYKKLCRGGPKVKAGDN 302

Query: 62  --IVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRR 119
             +   L C Y  R    L   P+  E   LQP I++Y +++ D E++ +K +A P L+R
Sbjct: 303 KMVSNHLTC-YQLRQHARLLFSPINVEVISLQPYILIYHNLLNDLEVEALKTLAAPMLQR 361

Query: 120 ATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGI 179
           ATV N  TG+LE A YRISKSAWL + +HP++ RIS  +E +TGLT  +AE LQ+ NYGI
Sbjct: 362 ATVHNKDTGKLEYATYRISKSAWLNDDDHPLVRRISTLIEDVTGLTMESAEALQIANYGI 421

Query: 180 GGHYEPHYDFA--RPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKG 237
           GGHYEPH+D A  R G  + FK+   GNR+AT+L Y+S V  GGATVF+S  + + P +G
Sbjct: 422 GGHYEPHFDHADVRSG-TDVFKTWKGGNRIATMLIYLSSVELGGATVFSSAGVRIEPRQG 480

Query: 238 TAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           +AAFW+NLH +G+G+  TRHAACPVL GS
Sbjct: 481 SAAFWYNLHRNGNGNNLTRHAACPVLIGS 509


>gi|410975458|ref|XP_003994148.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Felis catus]
          Length = 567

 Score =  229 bits (585), Expect = 8e-58,   Method: Compositional matrix adjust.
 Identities = 130/308 (42%), Positives = 177/308 (57%), Gaps = 44/308 (14%)

Query: 4   PTHQRAQGNKLYYQEALNK--------SPELKDEPPKVNNVAPTLE-VTEREKYEMLCRG 54
           P HQRA GN  Y++  + K        S +  D    +      ++ + ER+KYEMLCRG
Sbjct: 236 PEHQRANGNLKYFEYIMAKEKDGNKSASDDQSDRKTTLKKKGVAVDYLPERQKYEMLCRG 295

Query: 55  D-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKM 112
           + + + P    +L CRY   N  P   L P K+E+ + +PRII + D++ D+EI+++K +
Sbjct: 296 EGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKDL 355

Query: 113 AQPRLRRATVQNYKTGELEIANYRISKS--AWLREPEHPVIERISRRVEH-----MTGLT 165
           A+PRL RATV + +TG+L  A YR+SKS  +W +     +I  +    E        G +
Sbjct: 356 AKPRLSRATVHDPETGKLTTAQYRVSKSLVSWGKVQRALLIRSMQVCCERGPEAAWDGGS 415

Query: 166 TSTAEELQ--------------------------VVNYGIGGHYEPHYDFARPGEANAFK 199
            S  E L                           V NYG+GG YEPH+DFAR  E +AFK
Sbjct: 416 MSAEECLAELSLLAGECSAALVPIGVCESRLGKGVANYGVGGQYEPHFDFARKDEPDAFK 475

Query: 200 SLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAA 259
            LGTGNR+AT LFYMSDV+ GGATVF  +  S+WP+KGTA FW+NL +SG+GDY TRHAA
Sbjct: 476 ELGTGNRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAA 535

Query: 260 CPVLTGSN 267
           CPVL G+ 
Sbjct: 536 CPVLVGNK 543


>gi|260825355|ref|XP_002607632.1| hypothetical protein BRAFLDRAFT_84679 [Branchiostoma floridae]
 gi|229292980|gb|EEN63642.1| hypothetical protein BRAFLDRAFT_84679 [Branchiostoma floridae]
          Length = 519

 Score =  226 bits (577), Expect = 7e-57,   Method: Compositional matrix adjust.
 Identities = 115/228 (50%), Positives = 152/228 (66%), Gaps = 6/228 (2%)

Query: 44  EREKYEMLCRGD----LTVPPAIVAQLKCRY-VHRNVPYLRLMPLKEEEAYLQPRIILYR 98
           E   YE+LC+G+      + P+ V  LKCRY  + N P L L P++ E+ + +P++ +  
Sbjct: 268 ESRVYELLCQGNQPEIFNITPSRVKHLKCRYFTNNNHPRLLLAPIRLEQVFDKPKLWVLH 327

Query: 99  DVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRV 158
           +++ D E+++IKK+AQPRLR A  QN  TG   +++YRISK+AWL   EH +I R+ +RV
Sbjct: 328 NILSDPEMEVIKKLAQPRLRPAATQNPTTGGAVLSSYRISKNAWLYYWEHRLINRVKQRV 387

Query: 159 EHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVA 218
           E  TGLT  TAE LQV+NYGIGGHYEPH+D A   E  A      G+R+AT+LFYMSDV 
Sbjct: 388 EDATGLTMETAEPLQVINYGIGGHYEPHFDCATKDEEFALDP-NEGDRIATMLFYMSDVE 446

Query: 219 QGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
            GGATVF  +   + PEKG  AFW+NL  SG+GD  T HA CPVL GS
Sbjct: 447 AGGATVFPQVGARVVPEKGAGAFWYNLLKSGEGDMLTEHAGCPVLVGS 494


>gi|51490656|emb|CAF31507.1| prolyl 4-hydroxylase 2 precursor [Brugia malayi]
          Length = 551

 Score =  226 bits (577), Expect = 8e-57,   Method: Compositional matrix adjust.
 Identities = 123/288 (42%), Positives = 173/288 (60%), Gaps = 19/288 (6%)

Query: 4   PTHQRAQGNKLYYQEALN----KSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVP 59
           P H RA+GN  +Y++ L     +  +++ + P +NN  P  +   ++ YE LCR ++ + 
Sbjct: 239 PDHPRAKGNVRWYEDLLEDEGIRRADMRRKVPPMNN--PRDKSNLKDTYEALCRQEVPIN 296

Query: 60  PAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRR 119
               ++L C Y   + PYLRL P K E  +  P ++L+RD++ D E+ +I+ +A P+L R
Sbjct: 297 TKAQSRLYC-YYKMDRPYLRLAPFKVEIVHQNPLVVLFRDIVSDEEMRIIEMLAVPKLAR 355

Query: 120 ATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGI 179
           ATV N  TG +E A YR S+S+WL   EH V++RI++R++  T L T TAEELQV NYGI
Sbjct: 356 ATVHNVVTGNIETAFYRTSQSSWLGSTEHEVVKRINKRLDLATNLETETAEELQVQNYGI 415

Query: 180 GGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTA 239
           GGHYEPHYD +R    N F+    GNR+AT+L YM++   GG TVF  L  S+   K  A
Sbjct: 416 GGHYEPHYDCSR--RENVFEKTKNGNRIATILIYMTEPEIGGGTVFIDLKTSVSCTKNAA 473

Query: 240 AFWHNLHSSGDGDYYTRHAACPVLTGS-----NSLHSTC-----PCGL 277
            FW+NL  SG  D  + HAACPVLTG+        H +      PCGL
Sbjct: 474 LFWYNLMRSGAVDMRSYHAACPVLTGTKWTANKWFHESGQEWRRPCGL 521


>gi|339236275|ref|XP_003379692.1| prolyl 4-hydroxylase subunit alpha-2 [Trichinella spiralis]
 gi|316977629|gb|EFV60704.1| prolyl 4-hydroxylase subunit alpha-2 [Trichinella spiralis]
          Length = 441

 Score =  224 bits (571), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 132/322 (40%), Positives = 180/322 (55%), Gaps = 52/322 (16%)

Query: 2   IFPTHQRAQGNKLYYQEALNKSPELK----DEPPKVNNVAPTLEVTEREKYEMLCRGDLT 57
           I P H RA+GN  +Y + L K    +    D PP VN       + ER+ +E LCRG+  
Sbjct: 122 IKPDHPRAEGNVKWYLDLLAKEGVSRVTDHDLPPIVNARPNDQALPERKDFEALCRGEYL 181

Query: 58  VPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRL 117
           +     ++L C Y  R+ P+L L P+K E  + +P+I+++R V+  +EI ++K +A PRL
Sbjct: 182 LTEKQRSRLYC-YYKRDTPFLSLAPIKVEVMHWKPKIVIFRQVISANEIAVLKTLAYPRL 240

Query: 118 RRATVQNYKTGELEIA---------------------------NYRISKSAWLREPEHPV 150
            RATVQN +TGELE A                           +YRISKSAWL+E EHPV
Sbjct: 241 SRATVQNSETGELETAKYRISKRCRTLRRATVHNKETGQLEHASYRISKSAWLKEHEHPV 300

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           ++RI +R+  MT L   TAE+LQ+ NYG+GGHY+PH+D AR  E + ++  G GNR+AT 
Sbjct: 301 VDRIVKRIHDMTNLNMETAEDLQIANYGLGGHYDPHFDHARRDEVDPYEH-GHGNRIATT 359

Query: 211 LFYMSDVAQGGATVFTSLN----LSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           LFY  +V       F SLN    ++     G AAFW NL  +G+GD  TRHAACPVL G 
Sbjct: 360 LFYKEEV-----NAFKSLNTGNRIATVLFYGDAAFWFNLKPNGEGDMSTRHAACPVLAGV 414

Query: 267 NSLHSTC----------PCGLR 278
             + +            PCGLR
Sbjct: 415 KWVANKWIHERGQEFYRPCGLR 436


>gi|347972274|ref|XP_001237637.3| AGAP004611-PA [Anopheles gambiae str. PEST]
 gi|333469330|gb|EAU76664.3| AGAP004611-PA [Anopheles gambiae str. PEST]
          Length = 514

 Score =  222 bits (565), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 118/264 (44%), Positives = 163/264 (61%), Gaps = 11/264 (4%)

Query: 4   PTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAIV 63
           P +QRA  +K    E L K  E K    K + + P +     + Y  LCRGD   P   +
Sbjct: 236 PDNQRALNSK----EPLEKWIEYK----KQHGLPPPVPEPYVKNYPSLCRGDDQRPAKEL 287

Query: 64  AQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQ 123
           A+L+CRY H   P+LR+ PLK +E    P I++Y DV+ + EID I  +++P + R+ V 
Sbjct: 288 AKLRCRYEHNRTPFLRISPLKLQEVNHDPMIVMYHDVISNKEIDAIISISKPLMHRSMVG 347

Query: 124 NYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHY 183
           +    E  ++  R S +AWL +  HPV+  +S+R E MT L  + AE LQV NYGIGGHY
Sbjct: 348 D--DHEKAVSKTRTSSNAWLDDVMHPVVRTLSQRTEDMTNLAMTAAERLQVGNYGIGGHY 405

Query: 184 EPHYDFARPGEAN-AFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFW 242
            PHYD+A   E    + S+G GNR+ATV++Y+SDVA GGATVF  L L ++P+KG+A FW
Sbjct: 406 LPHYDYAVAEEGKEVYPSIGKGNRIATVMYYLSDVAIGGATVFPQLGLGVFPQKGSAIFW 465

Query: 243 HNLHSSGDGDYYTRHAACPVLTGS 266
           +NLH++G  D+ T H ACPV  GS
Sbjct: 466 YNLHANGTVDHRTLHGACPVFVGS 489


>gi|449673565|ref|XP_002167120.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Hydra
           magnipapillata]
          Length = 571

 Score =  222 bits (565), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 120/268 (44%), Positives = 168/268 (62%), Gaps = 8/268 (2%)

Query: 4   PTHQRAQGNKLYYQEALNKSPELK---DEPPKVNNVAPTLEVTER--EKYEMLCRGDLT- 57
           P  QR   N  Y+ + L+ S       D+  K ++ A T +   +    YE LCRG++  
Sbjct: 280 PNEQRIVENLDYFNKYLHTSRSTSRYGDDGLKDDSSAFTSDNKNKVLNAYEQLCRGEVRP 339

Query: 58  VPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRL 117
           +     A++KC Y  ++ P L+L P K E  ++ P I + R+++ + +I+LIK+ A P L
Sbjct: 340 LTKKEQAKMKCWYSAKD-PVLKLKPQKVERVWVDPEIFILRNIISEKQINLIKEAASPML 398

Query: 118 RRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNY 177
           RRAT+Q+  TG+L  A+YRISKSAWL   ++  ++ +  R +  TGL  S AE+LQV NY
Sbjct: 399 RRATIQDPITGKLRHADYRISKSAWLSTNKYNFLQALEARTQATTGLDLSYAEQLQVANY 458

Query: 178 GIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKG 237
           G+GGHYEPH+D +R  E + F  LG GNR+ATVLFY+SDV  GGATVFT    +++P KG
Sbjct: 459 GLGGHYEPHFDHSRENE-DRFTDLGMGNRIATVLFYLSDVEAGGATVFTVGKTAVFPSKG 517

Query: 238 TAAFWHNLHSSGDGDYYTRHAACPVLTG 265
            A FW NL  +G G+  TRHAACPVL G
Sbjct: 518 DAVFWFNLKRNGKGNPNTRHAACPVLVG 545


>gi|301613006|ref|XP_002936013.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Xenopus
           (Silurana) tropicalis]
          Length = 504

 Score =  222 bits (565), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 125/271 (46%), Positives = 159/271 (58%), Gaps = 34/271 (12%)

Query: 4   PTHQRAQGNKLYYQEALNKSPELKDEPPK------VNNVAPTLEVTEREKYEMLCRGD-L 56
           P HQR  GN  Y++  ++K        P            P   + ER+KYE LCRG+ +
Sbjct: 235 PEHQRGNGNLRYFEYIMSKESNKSSSSPSEGAELGTRKGRPKDHLPERQKYEKLCRGEGV 294

Query: 57  TVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQP 115
            +      +L CRY   N  P L L P K+E+ + +PRI+ Y D++ D EI  +K++A+P
Sbjct: 295 KMTSRRQKRLFCRYFDGNKDPLLILSPTKQEDEWDKPRIVRYHDIISDEEISKVKELAKP 354

Query: 116 RLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVV 175
           RLRRAT+ N  TG LE A YRISK  W                            EL+V 
Sbjct: 355 RLRRATISNPITGVLETAQYRISKR-W-------------------------AIMELEVA 388

Query: 176 NYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPE 235
           NYG+GG YEPH+DFAR  E +AFK LGTGNRVAT LFYMSDV  GGATVF  +  +++P+
Sbjct: 389 NYGMGGQYEPHFDFARKDEPDAFKELGTGNRVATWLFYMSDVEAGGATVFPEVGAAVYPK 448

Query: 236 KGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           KGTA FW+NL  SG+GDY TRHAACPVL G+
Sbjct: 449 KGTAVFWYNLFESGEGDYSTRHAACPVLVGN 479


>gi|443707037|gb|ELU02831.1| hypothetical protein CAPTEDRAFT_181697 [Capitella teleta]
          Length = 538

 Score =  218 bits (555), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 108/229 (47%), Positives = 146/229 (63%), Gaps = 6/229 (2%)

Query: 42  VTEREKYEMLCRGDLTVPPAIVAQ---LKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYR 98
            ++ + YE LCRG+ T    +      + C YV R  P   L+P KEE  +L P I +Y 
Sbjct: 287 TSQFDDYERLCRGEETKVGKLSNSHLIMLCNYV-RPHPMFILVPAKEEVMFLDPFIAIYH 345

Query: 99  DVMYDSEIDLIKKMAQPRLRRATVQNYKTGELE-IANYRISKSAWLREPEHPVIERISRR 157
           ++M D E D+IK++++P+L R+ V  Y  G  + + +YR SKSAW+ + EHP+I R+S R
Sbjct: 346 NLMTDKEADMIKRISKPKLHRSGVFTYSGGNQKPVQDYRTSKSAWIEDEEHPMIRRVSER 405

Query: 158 VEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDV 217
              +T L+  T E  QVVNYGIGGHYEPH+DFARP E   F     GNR+ TV+FY++  
Sbjct: 406 TSALTDLSLDTVELFQVVNYGIGGHYEPHFDFARPNEIATFDP-EVGNRIITVIFYVAAP 464

Query: 218 AQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
             GGATVF  L + LWPEKG+ A W NL  +G+GDY T+HA CP +TGS
Sbjct: 465 EAGGATVFPDLGVKLWPEKGSCAVWWNLMRNGEGDYRTKHAGCPTITGS 513


>gi|344252711|gb|EGW08815.1| Prolyl 4-hydroxylase subunit alpha-2 [Cricetulus griseus]
          Length = 584

 Score =  216 bits (551), Expect = 9e-54,   Method: Compositional matrix adjust.
 Identities = 128/302 (42%), Positives = 169/302 (55%), Gaps = 58/302 (19%)

Query: 4   PTHQRAQGNKLYYQ--------EALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD 55
           P+H+RA GN  Y++        ++L    E      +     P   + ER+  E LCRG+
Sbjct: 238 PSHERAGGNLRYFERLLEEEREKSLFNQTEAGLATQENVYERPVDFLPERDVLESLCRGE 297

Query: 56  -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
            + + P    +L CRY H N VP L + P KEE+ +  P I+ Y DVM D EI+ IK++A
Sbjct: 298 GVKLTPQRQKKLFCRYHHGNRVPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 357

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT  TAE LQ
Sbjct: 358 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 417

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFY-------------------- 213
                               E +AFK LGTGNRVAT L Y                    
Sbjct: 418 ------------------SDEQDAFKRLGTGNRVATFLNYGDLRTLSCPQGFVALLSLGR 459

Query: 214 ----------MSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVL 263
                     MSDV  GGATVF  L  ++WP+KGTA FW+NL  SG+GDY TRHAACPVL
Sbjct: 460 GAKLFALCSQMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVL 519

Query: 264 TG 265
            G
Sbjct: 520 VG 521


>gi|170064951|ref|XP_001867739.1| prolyl 4-hydroxylase subunit alpha-2 [Culex quinquefasciatus]
 gi|167882142|gb|EDS45525.1| prolyl 4-hydroxylase subunit alpha-2 [Culex quinquefasciatus]
          Length = 516

 Score =  216 bits (550), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 105/220 (47%), Positives = 147/220 (66%), Gaps = 5/220 (2%)

Query: 48  YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID 107
           YE LCRGD   PP+  + L CRY     P+LRL PLK+E   L P + +Y D   D+EI+
Sbjct: 276 YEPLCRGDHQRPPSETSNLYCRYHMSTSPFLRLAPLKQEVVNLDPFVAVYHDAASDAEIN 335

Query: 108 LIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMT-GLTT 166
            + ++ +P++ R+ V +    + E++  R S+++WL + +HPV+  +SRR + M  GL  
Sbjct: 336 KVIELGRPQINRSMVGD--AAKKEVSKSRTSQNSWLTDYDHPVVAALSRRTKDMALGLDE 393

Query: 167 STAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFT 226
           +  E LQV NYGIGGHY PHYD++R  E N +  L TGNR+AT++FY+SDV +GGATVF 
Sbjct: 394 TAYESLQVNNYGIGGHYLPHYDWSR--EENPYPELNTGNRIATLMFYLSDVEEGGATVFP 451

Query: 227 SLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
            L + ++P+KGTA FW+NL +SG GD  T H ACPVL GS
Sbjct: 452 HLGVGVFPKKGTAIFWYNLRASGKGDEKTLHGACPVLIGS 491


>gi|444512226|gb|ELV10078.1| Prolyl 4-hydroxylase subunit alpha-1 [Tupaia chinensis]
          Length = 474

 Score =  213 bits (541), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 112/222 (50%), Positives = 149/222 (67%), Gaps = 13/222 (5%)

Query: 4   PTHQRAQGNKLYYQEALNK--------SPELKDEP--PKVNNVAPTLEVTEREKYEMLCR 53
           P HQRA GN  Y++  + K        S +  D+   PK   VA    + ER+KYEMLCR
Sbjct: 209 PEHQRANGNLKYFEYIMAKEKDTNKSASDDQSDQKTTPKKKGVAVDY-LPERQKYEMLCR 267

Query: 54  GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           G+ + + P    +L CRY   N  P   L P K+E+ + +PRII + D++ D+EI+++K 
Sbjct: 268 GEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 327

Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
           +A+PRLRRAT+ N  TG+LE  +YRISKSAWL   E+PV+ RI+ R++ +TGL  STAEE
Sbjct: 328 LAKPRLRRATISNPITGDLETVHYRISKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEE 387

Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFY 213
           LQV NYG+GG YEPH+DFAR  E +AFK LGTGNR+AT LFY
Sbjct: 388 LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFY 429


>gi|393903732|gb|EFO16802.2| hypothetical protein LOAG_11701 [Loa loa]
          Length = 531

 Score =  213 bits (541), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 123/295 (41%), Positives = 166/295 (56%), Gaps = 42/295 (14%)

Query: 2   IFPTHQRAQGNKLYYQEALN----KSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLT 57
           I P H RA+GN  +Y++ L     +  +++ + P +NN  P  +    + Y+ LCR ++ 
Sbjct: 237 INPDHPRAKGNVRWYEDLLEDEGVRRADMRRKVPPINN--PRDKSDLNDTYQALCRQEMP 294

Query: 58  VPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRL 117
           V     ++L C Y   + PYLRL P+K E  Y  P  +L+ D+M D E  +I+ +A P+L
Sbjct: 295 VNIKAQSRLYC-YYKMDRPYLRLAPIKVEIVYQNPLAVLFHDIMSDEESRIIEMLAVPKL 353

Query: 118 RRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNY 177
            RATV N +TG LE A+YRISKSAWLR  EH V+ RI+RR++  T L  +TAEELQV NY
Sbjct: 354 DRATVHNVETGNLETASYRISKSAWLRSTEHEVVNRINRRLDLATNLEIATAEELQVQNY 413

Query: 178 GIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKG 237
           GIGGHYEPH D +R  + +AF+  GTGNR+AT+L Y                        
Sbjct: 414 GIGGHYEPHLDCSR--DEDAFERTGTGNRIATILIY-----------------------N 448

Query: 238 TAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCGLRRGLQ 282
            A FW+NL  SG  D  + HAACPVLTG+    +            PCGL R  Q
Sbjct: 449 AALFWYNLMRSGAVDMRSYHAACPVLTGTKWTANKWFHERGQEWRRPCGLNRFDQ 503


>gi|426365135|ref|XP_004049642.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Gorilla gorilla
           gorilla]
          Length = 500

 Score =  211 bits (538), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 111/236 (47%), Positives = 152/236 (64%), Gaps = 7/236 (2%)

Query: 33  VNNVAPTLEVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQ 91
           +N V   L +    KY    +   +  P    +L CRY   N  P   L P K+E+ + +
Sbjct: 245 INTVFKILNILFEAKY---LQSTASFTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDK 301

Query: 92  PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
           PRII + D++ D+EI+++K +A+PRL RATV + +TG+L  A YR+SK        +  +
Sbjct: 302 PRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKRTICLL--YINL 359

Query: 152 ERISRRVEHMTGLTTSTAEEL-QVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           +R   R+  +  L  +T   + QV NYG+GG YEPH+DFAR  E +AFK LGTGNR+AT 
Sbjct: 360 KRYYTRLGFLFLLYNTTCPFVPQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 419

Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           LFYMSDV+ GGATVF  +  S+WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 420 LFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 475


>gi|449513594|ref|XP_002191636.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like, partial
           [Taeniopygia guttata]
          Length = 346

 Score =  209 bits (533), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 109/221 (49%), Positives = 150/221 (67%), Gaps = 11/221 (4%)

Query: 4   PTHQRAQGNKLYYQ-------EALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD- 55
           P HQRA GN  Y++       EA   S + +++  K   V     + ER KYEMLCRG+ 
Sbjct: 127 PEHQRANGNMKYFEYIMAKEKEANKSSTDSEEQQEKETEVKKKDYLPERRKYEMLCRGEG 186

Query: 56  LTVPPAIVAQLKCRYV--HRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
           L + P    +L CRY   +RN  Y+ L P+K+E+ + +PRI+ + D++ D EI+ +K++A
Sbjct: 187 LKMTPRRQKRLFCRYYDGNRNPRYI-LGPVKQEDEWDKPRIVRFLDIISDEEIETVKELA 245

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +PRL RATV + +TG+L  A+YR+SKSAWL   E PV+ RI+ R++ +TGL  STAEELQ
Sbjct: 246 KPRLSRATVHDPETGKLTTAHYRVSKSAWLSGYESPVVSRINTRIQDLTGLDVSTAEELQ 305

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYM 214
           V NYG+GG YEPH+DFAR  E +AFK LGTGNR+AT LFY+
Sbjct: 306 VANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYV 346


>gi|148701598|gb|EDL33545.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha II polypeptide, isoform CRA_c [Mus
           musculus]
 gi|149052607|gb|EDM04424.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha II polypeptide (predicted),
           isoform CRA_d [Rattus norvegicus]
          Length = 189

 Score =  208 bits (530), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 98/165 (59%), Positives = 127/165 (76%), Gaps = 2/165 (1%)

Query: 101 MYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEH 160
           M D EI+ IK++A+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H
Sbjct: 1   MSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQH 60

Query: 161 MTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQG 220
           +TGLT  TAE LQV NYG+GG YEPH+DF+R    +  K+   GNR+AT L YMSDV  G
Sbjct: 61  ITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKT--EGNRLATFLNYMSDVEAG 118

Query: 221 GATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           GATVF  L  ++WP+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 119 GATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 163


>gi|194765178|ref|XP_001964704.1| GF23330 [Drosophila ananassae]
 gi|190614976|gb|EDV30500.1| GF23330 [Drosophila ananassae]
          Length = 537

 Score =  207 bits (528), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 106/243 (43%), Positives = 149/243 (61%), Gaps = 3/243 (1%)

Query: 24  PELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPL 83
           PE  DEP    +        E  +YE +CRG++   P     L+CR    N P+  L PL
Sbjct: 263 PEESDEPLLPRHSDSYSLTHEFAQYEKVCRGEVNPTPRQERNLRCRLSQGNHPFRLLAPL 322

Query: 84  KEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWL 143
           K EE  L P ++ Y D++   +I  +++MA PR+RR+TV     G+ + + +R+SK+AWL
Sbjct: 323 KLEEHNLDPYVVTYHDMLSAQKIRDLRQMAVPRMRRSTVNPLPGGQNKKSAFRVSKNAWL 382

Query: 144 REPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT 203
               HP +E + R ++  TGL T+  E+LQV NYG+GGHYEPH+DF R  + N + +   
Sbjct: 383 AYESHPTMEGMLRDLKDATGLDTTYCEQLQVANYGVGGHYEPHWDFFR--DPNHYPA-EE 439

Query: 204 GNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVL 263
           GNR+AT +FY+SDV QGGAT F  L+ ++ P+ G   FW+NLH S D DY T+HA CPVL
Sbjct: 440 GNRIATAIFYLSDVEQGGATAFPFLDFAVKPQLGNVLFWYNLHRSLDMDYRTKHAGCPVL 499

Query: 264 TGS 266
            GS
Sbjct: 500 KGS 502


>gi|74216495|dbj|BAE25162.1| unnamed protein product [Mus musculus]
          Length = 187

 Score =  207 bits (526), Expect = 7e-51,   Method: Compositional matrix adjust.
 Identities = 97/163 (59%), Positives = 126/163 (77%), Gaps = 2/163 (1%)

Query: 103 DSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMT 162
           D EI+ IK++A+P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+T
Sbjct: 1   DEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHIT 60

Query: 163 GLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGA 222
           GLT  TAE LQV NYG+GG YEPH+DF+R    +  K+   GNR+AT L YMSDV  GGA
Sbjct: 61  GLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKT--EGNRLATFLNYMSDVEAGGA 118

Query: 223 TVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           TVF  L  ++WP+KGTA FW+NL  SG+GDY TRHAACPVL G
Sbjct: 119 TVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 161


>gi|194765138|ref|XP_001964684.1| GF23317 [Drosophila ananassae]
 gi|190614956|gb|EDV30480.1| GF23317 [Drosophila ananassae]
          Length = 520

 Score =  206 bits (524), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 114/251 (45%), Positives = 156/251 (62%), Gaps = 14/251 (5%)

Query: 18  EALNKSPELKDEP-PKVN-NVAPTLEVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNV 75
           EAL ++     +P PKV  + +PTL       YEM CRG    P +  ++L CRY     
Sbjct: 260 EALIRTGTSNQQPQPKVGLSRSPTL-------YEMGCRG--MYPASTDSKLVCRYNSTTT 310

Query: 76  PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANY 135
           P+L L PLK E   L P +++Y DV+  +EID +K+MA P L+RATV     G+ E+   
Sbjct: 311 PFLTLAPLKMEIVGLNPYMVIYHDVLSSAEIDEMKEMATPSLKRATVYKASLGKNEVVKT 370

Query: 136 RISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEA 195
           R SK AW  +  + +  R++ R+  MTG   S +E LQ++NYG+GGHY+ HYDF    E 
Sbjct: 371 RTSKVAWFPDSYNSLTLRLNARIHDMTGFDLSGSEMLQLMNYGLGGHYDKHYDFFNATEK 430

Query: 196 NAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYT 255
           ++  SL TG+R+ATVLFYMSDV QGGATVF ++  +++P++GTA  W+NL   G  D  T
Sbjct: 431 SS--SL-TGDRIATVLFYMSDVEQGGATVFPNIYKTVYPQRGTAVMWYNLKDDGQPDEQT 487

Query: 256 RHAACPVLTGS 266
            HAACPVL GS
Sbjct: 488 LHAACPVLVGS 498


>gi|24651420|ref|NP_733374.1| prolyl-4-hydroxylase-alpha NE1 [Drosophila melanogaster]
 gi|7301952|gb|AAF57058.1| prolyl-4-hydroxylase-alpha NE1 [Drosophila melanogaster]
 gi|363987308|gb|AEW43896.1| FI16820p1 [Drosophila melanogaster]
          Length = 537

 Score =  206 bits (523), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 102/223 (45%), Positives = 145/223 (65%), Gaps = 5/223 (2%)

Query: 44  EREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYD 103
           E  KYE +CRG+  V P +  +L+CRY   N PY  L PLK EE  L P +  + D++  
Sbjct: 285 EFAKYEKVCRGE--VHPIVRQELRCRYSRGNHPYRFLAPLKLEEHSLDPYVATFHDILSP 342

Query: 104 SEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTG 163
            +I  +++MA PR+ R+TV     G+L+ + +R+SK+AWL    HP +  + R ++  TG
Sbjct: 343 GKISQLREMAVPRMHRSTVNPLPGGQLKKSAFRVSKNAWLAYESHPTMVGMLRDLKDATG 402

Query: 164 LTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGAT 223
           L T+  E+LQV NYG+GGHYEPH+DF R  + N + +   GNR+AT +FY+S+V QGGAT
Sbjct: 403 LDTTFCEQLQVANYGVGGHYEPHWDFFR--DPNHYPA-EEGNRIATAIFYLSEVEQGGAT 459

Query: 224 VFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
            F  L++++ P+ G   FW+NLH S D DY T+HA CPVL GS
Sbjct: 460 AFPFLDIAVKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLKGS 502


>gi|195452734|ref|XP_002073476.1| GK13124 [Drosophila willistoni]
 gi|194169561|gb|EDW84462.1| GK13124 [Drosophila willistoni]
          Length = 536

 Score =  205 bits (522), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 101/228 (44%), Positives = 142/228 (62%), Gaps = 13/228 (5%)

Query: 44  EREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYD 103
           E   YE +CRG++   PA    L+CRY   N PY +L PLK EE  L P ++ Y D++  
Sbjct: 282 EFAHYEKVCRGEVEPSPAQQRPLRCRYSQGNHPYRQLAPLKMEEHSLDPFVVTYHDMLSP 341

Query: 104 SEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTG 163
           ++I  +++MA P +RR+TV     G+ + +++R+SK+AWL    HP + ++ R +   TG
Sbjct: 342 NKIAQLREMAVPHMRRSTVNPLPGGQNKKSSFRVSKNAWLAYETHPTMGKMLRDLSDTTG 401

Query: 164 LTTSTAEELQVVNYGIGGHYEPHYDFAR-----PGEANAFKSLGTGNRVATVLFYMSDVA 218
           L  +  E+LQV NYG+GGHYEPH+DF R     P E         GNR+AT ++Y+S+V 
Sbjct: 402 LDMTYCEQLQVANYGVGGHYEPHWDFFRNPDHYPAEE--------GNRIATAIYYLSEVE 453

Query: 219 QGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           QGGAT F  LN ++ P+ G   FW+NLH S D DY T+HA CPVL GS
Sbjct: 454 QGGATAFPFLNFAVRPQLGNVLFWYNLHRSSDMDYRTKHAGCPVLKGS 501


>gi|227553849|gb|ACP40552.1| IP22178p [Drosophila melanogaster]
          Length = 467

 Score =  205 bits (522), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 102/223 (45%), Positives = 145/223 (65%), Gaps = 5/223 (2%)

Query: 44  EREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYD 103
           E  KYE +CRG+  V P +  +L+CRY   N PY  L PLK EE  L P +  + D++  
Sbjct: 215 EFAKYEKVCRGE--VHPIVRQELRCRYSRGNHPYRFLAPLKLEEHSLDPYVATFHDILSP 272

Query: 104 SEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTG 163
            +I  +++MA PR+ R+TV     G+L+ + +R+SK+AWL    HP +  + R ++  TG
Sbjct: 273 GKISQLREMAVPRMHRSTVNPLPGGQLKKSAFRVSKNAWLAYESHPTMVGMLRDLKDATG 332

Query: 164 LTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGAT 223
           L T+  E+LQV NYG+GGHYEPH+DF R  + N + +   GNR+AT +FY+S+V QGGAT
Sbjct: 333 LDTTFCEQLQVANYGVGGHYEPHWDFFR--DPNHYPA-EEGNRIATAIFYLSEVEQGGAT 389

Query: 224 VFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
            F  L++++ P+ G   FW+NLH S D DY T+HA CPVL GS
Sbjct: 390 AFPFLDIAVKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLKGS 432


>gi|20269818|gb|AAM18064.1| prolyl 4-hydroxylase alpha-related protein PH4[alpha]NE1
           [Drosophila melanogaster]
          Length = 286

 Score =  205 bits (521), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 102/226 (45%), Positives = 146/226 (64%), Gaps = 5/226 (2%)

Query: 44  EREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYD 103
           E  KYE +CRG+  V P +  +L+CRY   N PY  L PLK EE  L P +  + D++  
Sbjct: 34  EFAKYEKVCRGE--VHPIVRQELRCRYSRGNHPYRFLAPLKLEEHSLDPYVATFHDILSP 91

Query: 104 SEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTG 163
            +I  +++MA PR+ R+TV     G+L+ + +R+SK+AWL    HP +  + R ++  TG
Sbjct: 92  GKISQLREMAVPRMHRSTVNPLPGGQLKKSAFRVSKNAWLAYESHPTMVGMLRDLKDATG 151

Query: 164 LTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGAT 223
           L T+  E+LQV NYG+GGHYEPH+DF R  + N + +   GNR+AT +FY+S+V QGGAT
Sbjct: 152 LDTTFCEQLQVANYGVGGHYEPHWDFFR--DPNHYPA-EEGNRIATAIFYLSEVEQGGAT 208

Query: 224 VFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSL 269
            F  L++++ P+ G   FW+NLH S D DY T+HA CPVL GS  +
Sbjct: 209 AFPFLDIAVKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLKGSKWI 254


>gi|170064956|ref|XP_001867741.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
 gi|167882144|gb|EDS45527.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
          Length = 520

 Score =  203 bits (517), Expect = 7e-50,   Method: Compositional matrix adjust.
 Identities = 101/220 (45%), Positives = 144/220 (65%), Gaps = 5/220 (2%)

Query: 48  YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID 107
           YE LCRGD   P  + +QL CRY     P+LRL PLK E   L+P I++Y + + D EI 
Sbjct: 280 YEKLCRGDYERPGEVTSQLFCRYETSATPFLRLAPLKLEVVNLEPLIVVYHEAVSDREIA 339

Query: 108 LIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTG-LTT 166
            + ++A+P ++R+ V + ++ +  I+  RIS++AW      P++E +++R   M G L  
Sbjct: 340 KLIELARPLIKRSAVGDTRSEQ--ISKIRISQNAWFENEHDPIVETLNQRARDMAGGLNE 397

Query: 167 STAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFT 226
            + E LQV NYG+GG Y  HYD++    AN F + G GNR+AT++FY+SDV +GG+TVF 
Sbjct: 398 PSYELLQVNNYGLGGFYSIHYDWSTS--ANPFPNKGMGNRIATLMFYLSDVQEGGSTVFP 455

Query: 227 SLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
            LNL++ P KGTA FW+NLH +G G+  T HAACPVL GS
Sbjct: 456 RLNLAVRPRKGTAIFWYNLHRNGKGNKKTLHAACPVLIGS 495


>gi|195341544|ref|XP_002037366.1| GM12151 [Drosophila sechellia]
 gi|194131482|gb|EDW53525.1| GM12151 [Drosophila sechellia]
          Length = 537

 Score =  203 bits (516), Expect = 8e-50,   Method: Compositional matrix adjust.
 Identities = 102/223 (45%), Positives = 144/223 (64%), Gaps = 5/223 (2%)

Query: 44  EREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYD 103
           E  KYE +CRG+  V P    +L+CRY   N PY  L PLK EE  L P +  + D++  
Sbjct: 285 EFAKYEKVCRGE--VHPIARQELRCRYSRGNHPYRFLAPLKLEEHSLDPYVATFHDMLNP 342

Query: 104 SEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTG 163
            +I  +++MA PR+ R+TV     G+L+ + +R+SK+AWL    HP +  + R ++  TG
Sbjct: 343 RKISQLREMAVPRMHRSTVNPLPGGQLKKSAFRVSKNAWLAYESHPTMVGMLRDLKDATG 402

Query: 164 LTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGAT 223
           L T+  E+LQV NYG+GGHYEPH+DF R  + N + +   GNR+AT +FY+S+V QGGAT
Sbjct: 403 LDTTFCEQLQVANYGVGGHYEPHWDFFR--DPNHYPA-EEGNRIATAIFYLSEVEQGGAT 459

Query: 224 VFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
            F  L++++ P+ G   FW+NLH S D DY T+HA CPVL GS
Sbjct: 460 AFPFLDIAVKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLKGS 502


>gi|195575099|ref|XP_002105517.1| GD17024 [Drosophila simulans]
 gi|194201444|gb|EDX15020.1| GD17024 [Drosophila simulans]
          Length = 537

 Score =  203 bits (516), Expect = 9e-50,   Method: Compositional matrix adjust.
 Identities = 102/223 (45%), Positives = 144/223 (64%), Gaps = 5/223 (2%)

Query: 44  EREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYD 103
           E  KYE +CRG+  V P    +L+CRY   N PY  L PLK EE  L P +  + D++  
Sbjct: 285 EFAKYEKVCRGE--VHPIARQELRCRYSRGNHPYRFLAPLKLEEHSLDPYVATFHDMLSP 342

Query: 104 SEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTG 163
            +I  +++MA PR+ R+TV     G+L+ + +R+SK+AWL    HP +  + R ++  TG
Sbjct: 343 RKISQLREMAVPRMHRSTVNPLPGGQLKKSAFRVSKNAWLAYESHPTMVGMLRDLKDATG 402

Query: 164 LTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGAT 223
           L T+  E+LQV NYG+GGHYEPH+DF R  + N + +   GNR+AT +FY+S+V QGGAT
Sbjct: 403 LDTTFCEQLQVANYGVGGHYEPHWDFFR--DPNHYPA-EEGNRIATAIFYLSEVEQGGAT 459

Query: 224 VFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
            F  L++++ P+ G   FW+NLH S D DY T+HA CPVL GS
Sbjct: 460 AFPFLDIAVKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLKGS 502


>gi|198429625|ref|XP_002128613.1| PREDICTED: similar to procollagen-proline, 2-oxoglutarate
           4-dioxygenase (proline 4-hydroxylase), alpha 1
           polypeptide [Ciona intestinalis]
          Length = 195

 Score =  203 bits (516), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 95/165 (57%), Positives = 124/165 (75%), Gaps = 1/165 (0%)

Query: 101 MYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEH 160
           M D E+ +IK +A+PRLRRATVQN  TG LE A+YR+SKSAWL++ +HPVI+R+ +R+  
Sbjct: 1   MSDKEMAMIKSLAKPRLRRATVQNPVTGVLEFAHYRVSKSAWLKDEDHPVIKRVCQRISD 60

Query: 161 MTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQG 220
           +TGL+  TAEELQ+ NYG+GG YEPH+D++R  +   F     GNR+AT L YMS+V QG
Sbjct: 61  VTGLSMETAEELQIANYGVGGQYEPHFDYSRKSDFGKFDD-EVGNRIATFLTYMSNVEQG 119

Query: 221 GATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           G+TVF    +++ P KG+A FW+NL  SG GD  TRHAACPVLTG
Sbjct: 120 GSTVFLHPGIAVRPIKGSAVFWYNLLPSGAGDERTRHAACPVLTG 164


>gi|195505207|ref|XP_002099404.1| GE23380 [Drosophila yakuba]
 gi|194185505|gb|EDW99116.1| GE23380 [Drosophila yakuba]
          Length = 540

 Score =  201 bits (511), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 118/296 (39%), Positives = 166/296 (56%), Gaps = 22/296 (7%)

Query: 4   PTHQRAQGNKLYYQEALNK----SPELKDEPPKVNNVAPTLEVTEREKYEM---LCRGDL 56
           P H+ A  NKL Y+  L K    +P  + + P V    P      +E Y++   +CRG+L
Sbjct: 247 PDHEEAHRNKLLYEGQLAKERSFTPRKQVDLPHVAGKEP------KESYKLYTQVCRGEL 300

Query: 57  TVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPR 116
              P     L+C   H+ VPY RL P K E+  L P +    +V++DSEID+I +  +  
Sbjct: 301 HQTPREQRNLRCWLTHQGVPYYRLAPFKIEQLNLDPYVAYVHEVLWDSEIDMIMEHGKGN 360

Query: 117 LRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVN 176
           ++R+ V   ++G       R S++ WL    +P + +I +R+E +TGL+T +AE LQ+VN
Sbjct: 361 MKRSMVG--QSGNSTTTEIRTSQNTWLWYDANPWLAKIKQRLEDVTGLSTESAEPLQLVN 418

Query: 177 YGIGGHYEPHYDFARPGEANAFKSLG-TGNRVATVLFYMSDVAQGGATVFTSLNLSLWPE 235
           YGIGG YEPH+DF    E +  K  G  GNR+AT LFY++DVA GGAT F  L L++ P 
Sbjct: 419 YGIGGQYEPHFDFM---EDDGQKVFGWKGNRLATALFYLNDVALGGATAFPFLRLAVPPV 475

Query: 236 KGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTCPCGLRRGLQRSGIICTLV 291
           KG+   W+NLHSS   D+ T+HA CPVL GS  +   C      G Q     C LV
Sbjct: 476 KGSLLIWYNLHSSTHKDFRTKHAGCPVLQGSKWI---CNEWFHVGAQEFRRPCGLV 528


>gi|170064953|ref|XP_001867740.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
 gi|167882143|gb|EDS45526.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
          Length = 509

 Score =  201 bits (510), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 114/264 (43%), Positives = 157/264 (59%), Gaps = 17/264 (6%)

Query: 16  YQEALNKSPE-LKDEPPKVNNVAPTLEVTEREKY-----------EMLCRGDLTVPPAIV 63
           Y +AL  + + LK +P     +     + E  KY           E+LCRGD   P +  
Sbjct: 221 YVDALKITNQILKQDPTHAGRLVEKKTIGELMKYLENKLRPEVPHELLCRGDYQRPASET 280

Query: 64  AQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQ 123
           + L CRY      +LRL PLKEE   L P I +Y DV  D EI  + ++A+ R+ RAT++
Sbjct: 281 SHLYCRYHTGTSSFLRLAPLKEEVLNLDPFITVYHDVASDREISKLIELAKSRISRATIR 340

Query: 124 NYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTG-LTTSTAEELQVVNYGIGGH 182
           +   GE +++N R S++AWL   +  V+  + RRV  MTG L   + E LQV NYG+GGH
Sbjct: 341 D--DGEPQVSNARTSQNAWLDAGDDRVVTTLDRRVGDMTGGLRQQSYEMLQVNNYGVGGH 398

Query: 183 YEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFW 242
           Y  H+D+A   EA  +  L  GNR+ATV+FY+SDV  GGATVF  L L+++P KG+A  W
Sbjct: 399 YVAHHDWAM--EAVPYAGLRVGNRIATVMFYLSDVEIGGATVFPQLGLAVFPRKGSAILW 456

Query: 243 HNLHSSGDGDYYTRHAACPVLTGS 266
           +NL+ +G GD  T HAACPVL+GS
Sbjct: 457 YNLYRNGKGDRRTLHAACPVLSGS 480


>gi|195505202|ref|XP_002099402.1| GE23382 [Drosophila yakuba]
 gi|194185503|gb|EDW99114.1| GE23382 [Drosophila yakuba]
          Length = 537

 Score =  200 bits (508), Expect = 8e-49,   Method: Compositional matrix adjust.
 Identities = 101/223 (45%), Positives = 143/223 (64%), Gaps = 5/223 (2%)

Query: 44  EREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYD 103
           E  +YE +CRG+  V P    +L+CRY   + PY  L PLK EE  L P +  Y D++  
Sbjct: 285 EFAQYEKVCRGE--VHPIARQELRCRYSRGSHPYRYLAPLKLEEHSLDPYVATYHDMLSP 342

Query: 104 SEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTG 163
            +I  +++MA PR+RR+TV     G+ + + +R+SK+AWL    HP +  + R ++  TG
Sbjct: 343 RKISQLREMAVPRMRRSTVNPLPGGQHKKSAFRVSKNAWLAYESHPTMVGMLRDLKEATG 402

Query: 164 LTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGAT 223
           L T+  E+LQV NYG+GGHYEPH+DF R  + N +     GNR+AT +FY+S+V QGGAT
Sbjct: 403 LDTTYCEQLQVANYGVGGHYEPHWDFFR--DPNHYPE-EEGNRIATAIFYLSEVEQGGAT 459

Query: 224 VFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
            F  L++++ P+ G   FW+NLH S D DY T+HA CPVL GS
Sbjct: 460 AFPFLDIAVKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLKGS 502


>gi|195505218|ref|XP_002099409.1| GE10887 [Drosophila yakuba]
 gi|194185510|gb|EDW99121.1| GE10887 [Drosophila yakuba]
          Length = 521

 Score =  199 bits (506), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 108/251 (43%), Positives = 143/251 (56%), Gaps = 14/251 (5%)

Query: 28  DEPPKVNNVAPTLE----------VTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPY 77
           DE   +N   P LE          V E + Y + C G   + P     L+C YV    P+
Sbjct: 228 DEKALLNESKPILEHAPIPEEGEPVDEFQAYSLTCSGHWRLTPKEQRHLRCGYVTETHPF 287

Query: 78  LRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRI 137
           L + PLK EE +  P ++LY DV+Y SEID+I+K+ + RL+RATV  +   E  ++N R 
Sbjct: 288 LWIAPLKAEELFQDPLLVLYHDVIYQSEIDVIRKLTENRLKRATVTGH--NESVVSNVRT 345

Query: 138 SKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD--FARPGEA 195
           S+  ++    H V+  I +RV  MT L    AE+ Q  NYGIGGHY  H D  +    +A
Sbjct: 346 SQFTFIPVSAHKVLSTIDQRVADMTNLNMKYAEDHQFANYGIGGHYGQHMDWFYQTTIDA 405

Query: 196 NAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYT 255
               S   GNR+ATVLFY+SDV+QGG T F  L   L P+K  AAFWHNLH+SG GD  T
Sbjct: 406 GLISSPEMGNRIATVLFYLSDVSQGGGTAFPQLRTLLKPKKYAAAFWHNLHASGVGDVRT 465

Query: 256 RHAACPVLTGS 266
           +H ACP++ GS
Sbjct: 466 QHGACPIIAGS 476


>gi|195110925|ref|XP_002000030.1| GI22756 [Drosophila mojavensis]
 gi|193916624|gb|EDW15491.1| GI22756 [Drosophila mojavensis]
          Length = 533

 Score =  198 bits (503), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 102/239 (42%), Positives = 137/239 (57%), Gaps = 3/239 (1%)

Query: 28  DEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEE 87
           DE       A      E   YE +CRG++    A   +L+CRY      Y  L PLK EE
Sbjct: 263 DEGAHSRQAAGYRLTQEFAHYEKVCRGEVGPSAAQQRRLRCRYARGRHAYRLLAPLKLEE 322

Query: 88  AYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPE 147
             L P ++ Y D++   +I  ++ MA P ++R+TV     G+   + +R+SK+AWL    
Sbjct: 323 HSLDPLVVSYHDMLSPQQIGELRAMAVPHMQRSTVNPLSGGQRMKSAFRVSKNAWLPYST 382

Query: 148 HPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRV 207
           HP++ R+ R V   TGL  +  E+LQV NYG+GGHYEPH+DF R            GNR+
Sbjct: 383 HPMMGRMLRDVGDATGLDMTYCEQLQVANYGVGGHYEPHWDFFRDSR---HYPAAEGNRI 439

Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           AT +FY+SDV QGGAT F  LN ++ P+ G   FW+NLH S D DY T+HA CPVL GS
Sbjct: 440 ATAIFYLSDVEQGGATAFPFLNFAVRPQLGNILFWYNLHRSSDEDYRTKHAGCPVLKGS 498


>gi|20269816|gb|AAM18063.1|AF495541_1 prolyl 4-hydroxylase alpha-related protein PH4[alpha]SG1
           [Drosophila melanogaster]
          Length = 540

 Score =  198 bits (503), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 112/290 (38%), Positives = 161/290 (55%), Gaps = 25/290 (8%)

Query: 4   PTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEM---LCRGDLTVPP 60
           P H+ A  NK+ Y+  L +       P K   +    E  ++E Y++   +CRG+L   P
Sbjct: 247 PDHEEALKNKILYEGQLARERSFA--PRKQVELPHIAEKEQKESYKLYTEVCRGELHQSP 304

Query: 61  AIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRA 120
                L+C   H+ VPY RL P K E+  + P +    +V++DSEID I +  +  + R+
Sbjct: 305 REQRNLRCWLSHQGVPYYRLFPFKIEQLNIDPYVAYVHEVLWDSEIDTIMEHGKGNMERS 364

Query: 121 TV---QNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNY 177
            V   +N  T E+     RIS++ WL    +P + +I +R+E +TGL+T +AE LQ+VNY
Sbjct: 365 KVGQSENSTTSEV-----RISRNTWLWYDANPWLSKIKQRLEDVTGLSTESAEPLQLVNY 419

Query: 178 GIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKG 237
           GIGG YEPH+DF      + F     GNR+ T LFY++DVA GGAT F  L L++ P KG
Sbjct: 420 GIGGQYEPHFDFVEDDGQSVFS--WKGNRLLTALFYLNDVALGGATAFPFLRLAVPPVKG 477

Query: 238 TAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCGL 277
           +   W+NLHSS   D+ T+HA CPVL GS  + +            PCGL
Sbjct: 478 SLLIWYNLHSSTHKDFRTKHAGCPVLQGSKWICNEWFHVGAQEFRRPCGL 527


>gi|194905410|ref|XP_001981191.1| GG11931 [Drosophila erecta]
 gi|190655829|gb|EDV53061.1| GG11931 [Drosophila erecta]
          Length = 537

 Score =  198 bits (503), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 101/225 (44%), Positives = 141/225 (62%), Gaps = 7/225 (3%)

Query: 44  EREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYD 103
           E  KYE +CRG+  V P    +L+CRY   + PY  L PLK EE  L P +  + D++  
Sbjct: 285 EFAKYEEVCRGE--VQPIARQELRCRYSRGSHPYRILAPLKLEEHSLDPYVASFHDMLSP 342

Query: 104 SEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTG 163
            +I  +++MA PR++R+TV     G+ + + +R+SK+AWL    HP +  + R ++  TG
Sbjct: 343 RKISQLREMAVPRMQRSTVNPRPGGQHKKSAFRVSKNAWLAYEAHPTMAGMLRDLKDATG 402

Query: 164 LTTSTAEELQVVNYGIGGHYEPHYDFAR-PGEANAFKSLGTGNRVATVLFYMSDVAQGGA 222
           L T+  E+LQV NYG+GGHYEPH+DF R P    A      GNR+AT +FY+S+V QGGA
Sbjct: 403 LDTTFCEQLQVANYGVGGHYEPHWDFFRDPSHYPA----AEGNRIATAIFYLSEVEQGGA 458

Query: 223 TVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
           T F  L+ ++ P+ G   FW+NLH S D DY T+HA CPVL GS 
Sbjct: 459 TAFPFLDFAVKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLKGSK 503


>gi|116008434|ref|NP_651806.2| CG9698 [Drosophila melanogaster]
 gi|113194862|gb|AAF57062.2| CG9698 [Drosophila melanogaster]
          Length = 547

 Score =  197 bits (502), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 109/257 (42%), Positives = 147/257 (57%), Gaps = 7/257 (2%)

Query: 12  NKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAIVAQLKCRYV 71
           N L  +  LN+S  + +  P      P   V E + Y + C G   + P     L+C YV
Sbjct: 256 NALSEKALLNESKPILEHAPIPEEGEP---VGEFQAYSLTCSGHWRLTPKEQRHLRCGYV 312

Query: 72  HRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELE 131
               P+L + PLK EE +  P ++LY DV+Y SEID+I+K+ + RL RAT+ ++   E  
Sbjct: 313 TETHPFLWIAPLKAEELFQDPLLVLYHDVIYQSEIDVIRKLTENRLMRATITSH--NESV 370

Query: 132 IANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD--F 189
           ++N R S+  ++    H V+  I +RV  MT L    AE+ Q  NYGIGGHY  H D  +
Sbjct: 371 VSNVRTSQFTFIPVTAHKVLSTIDQRVADMTNLNMKYAEDHQFANYGIGGHYGQHMDWFY 430

Query: 190 ARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSG 249
               +A    S   GNR+ATVLFY+SDVAQGG T F  L   L P+K  AAFWHNLH+SG
Sbjct: 431 QTTFDAGLVSSPEMGNRIATVLFYLSDVAQGGGTAFPQLRTLLKPKKYAAAFWHNLHASG 490

Query: 250 DGDYYTRHAACPVLTGS 266
            GD  T+H ACP++ GS
Sbjct: 491 VGDVRTQHGACPIIAGS 507


>gi|194905372|ref|XP_001981184.1| GG11758 [Drosophila erecta]
 gi|190655822|gb|EDV53054.1| GG11758 [Drosophila erecta]
          Length = 550

 Score =  197 bits (502), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 107/249 (42%), Positives = 144/249 (57%), Gaps = 7/249 (2%)

Query: 20  LNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLR 79
           LN+S  + +  P      P   V E + Y + C G   + P   + L+C YV    P+L 
Sbjct: 261 LNESKPILEHAPIPEEGEP---VGEFQAYSLTCSGHWRLTPKEQSHLRCGYVTETHPFLW 317

Query: 80  LMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISK 139
           + PLK EE +  P ++LY DV+Y SEID+I+K+ + RL RATV  +   E  ++N R S+
Sbjct: 318 IAPLKAEELFQDPLLVLYHDVIYQSEIDVIRKLTENRLMRATVTGHN--ESLVSNVRTSQ 375

Query: 140 SAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD--FARPGEANA 197
             ++    H V+  I +RV  MT L    AE+ Q  NYGIGGHY  H D  +    +A  
Sbjct: 376 FTFIPASAHKVLSTIDQRVADMTNLNMKYAEDHQFANYGIGGHYGQHMDWFYQTTFDAGL 435

Query: 198 FKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRH 257
             S   GNR+ATVLFY+SDV+QGG T F  L   L P+K  AAFWHNLH+SG GD  T+H
Sbjct: 436 VSSPEMGNRIATVLFYLSDVSQGGGTAFPQLRTLLKPKKYAAAFWHNLHASGVGDVRTQH 495

Query: 258 AACPVLTGS 266
            ACP++ GS
Sbjct: 496 GACPIIAGS 504


>gi|24651424|ref|NP_733376.1| prolyl-4-hydroxylase-alpha SG1 [Drosophila melanogaster]
 gi|23172697|gb|AAF57059.2| prolyl-4-hydroxylase-alpha SG1 [Drosophila melanogaster]
 gi|66772443|gb|AAY55533.1| IP03659p [Drosophila melanogaster]
 gi|220951214|gb|ACL88150.1| PH4alphaSG1-PA [synthetic construct]
 gi|220959938|gb|ACL92512.1| PH4alphaSG1-PA [synthetic construct]
          Length = 540

 Score =  197 bits (502), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 112/290 (38%), Positives = 161/290 (55%), Gaps = 25/290 (8%)

Query: 4   PTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEM---LCRGDLTVPP 60
           P H+ A  NK+ Y+  L +       P K   +    E  ++E Y++   +CRG+L   P
Sbjct: 247 PDHEEALKNKILYEGQLARERSFA--PRKQVELPQIAEKEQKESYKLYTQVCRGELHQSP 304

Query: 61  AIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRA 120
                L+C   H+ VPY RL P K E+  + P +    +V++DSEID I +  +  + R+
Sbjct: 305 REQRNLRCWLYHQGVPYYRLSPFKIEQLNVDPYVAYVHEVLWDSEIDTIMEHGKGNMERS 364

Query: 121 TV---QNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNY 177
            V   +N  T E+     RIS++ WL    +P + +I +R+E +TGL+T +AE LQ+VNY
Sbjct: 365 KVGQSENSTTSEV-----RISRNTWLWYDANPWLSKIKQRLEDVTGLSTESAEPLQLVNY 419

Query: 178 GIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKG 237
           GIGG YEPH+DF      + F     GNR+ T LFY++DVA GGAT F  L L++ P KG
Sbjct: 420 GIGGQYEPHFDFVEDDGQSVFS--WKGNRLLTALFYLNDVALGGATAFPFLRLAVPPVKG 477

Query: 238 TAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCGL 277
           +   W+NLHSS   D+ T+HA CPVL GS  + +            PCGL
Sbjct: 478 SLLIWYNLHSSTHKDFRTKHAGCPVLQGSKWICNEWFHVGAQEFRRPCGL 527


>gi|66772331|gb|AAY55477.1| IP03959p [Drosophila melanogaster]
 gi|66772361|gb|AAY55492.1| IP03859p [Drosophila melanogaster]
          Length = 541

 Score =  197 bits (502), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 112/290 (38%), Positives = 161/290 (55%), Gaps = 25/290 (8%)

Query: 4   PTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEM---LCRGDLTVPP 60
           P H+ A  NK+ Y+  L +       P K   +    E  ++E Y++   +CRG+L   P
Sbjct: 248 PDHEEALKNKILYEGQLARERSFA--PRKQVELPQIAEKEQKESYKLYTQVCRGELHQSP 305

Query: 61  AIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRA 120
                L+C   H+ VPY RL P K E+  + P +    +V++DSEID I +  +  + R+
Sbjct: 306 REQRNLRCWLYHQGVPYYRLSPFKIEQLNVDPYVAYVHEVLWDSEIDTIMEHGKGNMERS 365

Query: 121 TV---QNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNY 177
            V   +N  T E+     RIS++ WL    +P + +I +R+E +TGL+T +AE LQ+VNY
Sbjct: 366 KVGQSENSTTSEV-----RISRNTWLWYDANPWLSKIKQRLEDVTGLSTESAEPLQLVNY 420

Query: 178 GIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKG 237
           GIGG YEPH+DF      + F     GNR+ T LFY++DVA GGAT F  L L++ P KG
Sbjct: 421 GIGGQYEPHFDFVEDDGQSVFS--WKGNRLLTALFYLNDVALGGATAFPFLRLAVPPVKG 478

Query: 238 TAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCGL 277
           +   W+NLHSS   D+ T+HA CPVL GS  + +            PCGL
Sbjct: 479 SLLIWYNLHSSTHKDFRTKHAGCPVLQGSKWICNEWFHVGAQEFRRPCGL 528


>gi|195390835|ref|XP_002054073.1| GJ22993 [Drosophila virilis]
 gi|194152159|gb|EDW67593.1| GJ22993 [Drosophila virilis]
          Length = 525

 Score =  197 bits (500), Expect = 7e-48,   Method: Compositional matrix adjust.
 Identities = 100/219 (45%), Positives = 136/219 (62%), Gaps = 5/219 (2%)

Query: 48  YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID 107
           YE  CRG         ++L C Y   N P+LRL PLK E   L P ++LY DV+  SEI 
Sbjct: 289 YERGCRGQFPTK----SKLHCVYNSTNSPFLRLAPLKTELLALDPYMVLYHDVITPSEIR 344

Query: 108 LIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTS 167
            ++ +A P L+RATV N K G   +   R SK  WL +  +P+  R++RR+  MTG    
Sbjct: 345 ELQYLAVPTLKRATVFNQKMGRNTVVKTRTSKVTWLTDSLNPLTVRLNRRISDMTGFDLY 404

Query: 168 TAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTS 227
            +E LQV+NYG+GGHY+ H+D+     A     L  G+R+ATVLFY++DV QGGATVF +
Sbjct: 405 GSEMLQVMNYGLGGHYDLHFDYFNATIAKDLTKLN-GDRIATVLFYLTDVEQGGATVFPN 463

Query: 228 LNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           +  +++P+KGTA  W+NL  + DGD  T HAACPV+ GS
Sbjct: 464 IKQAIFPKKGTAVMWYNLRHNNDGDPQTLHAACPVIVGS 502


>gi|194905397|ref|XP_001981189.1| GG11929 [Drosophila erecta]
 gi|190655827|gb|EDV53059.1| GG11929 [Drosophila erecta]
          Length = 538

 Score =  196 bits (499), Expect = 8e-48,   Method: Compositional matrix adjust.
 Identities = 111/289 (38%), Positives = 162/289 (56%), Gaps = 24/289 (8%)

Query: 4   PTHQRAQGNKLYYQEALNKS----PELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVP 59
           P H+ A  NK+ Y+  L K+    P  + +PP+     P       + Y  LCRG+L   
Sbjct: 246 PDHEEALKNKVLYEGQLAKARNVIPRKQVDPPQTAEEEPKESF---QLYTQLCRGELHQS 302

Query: 60  PAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRR 119
           P     L+C   H+ VPY RL P K E+  L P + L   V++DSE+++I +  +  + R
Sbjct: 303 PREQRNLRCWLSHQGVPYYRLSPFKFEQLNLDPYVALVHHVLWDSEMEMIMQHGRGSMER 362

Query: 120 ATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGI 179
           + V   +  +  IA+ R S++ WL    +P + RI +R+E +TGL+T +AE LQ++NYGI
Sbjct: 363 SKVGQSENSK--IADRRTSQNTWLWYDVNPWLSRIKQRLEDVTGLSTESAEPLQLLNYGI 420

Query: 180 GGHYEPHYDFARPGEANAFKSLG-TGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGT 238
           GG YEPH+DF    E    K  G   +R+ T +FY++DVA GGAT F  L L++ PEKG+
Sbjct: 421 GGQYEPHFDFVEDAE----KIFGWQDDRLMTAIFYINDVALGGATAFPFLRLAVPPEKGS 476

Query: 239 AAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCGL 277
              W+NLHSS   DY ++HA CP+L GS  + +            PCGL
Sbjct: 477 LLMWNNLHSSLHKDYRSKHAGCPILQGSKWICTEWFHVGAQELKRPCGL 525


>gi|198449643|ref|XP_001357664.2| GA15938 [Drosophila pseudoobscura pseudoobscura]
 gi|198130698|gb|EAL26798.2| GA15938 [Drosophila pseudoobscura pseudoobscura]
          Length = 549

 Score =  196 bits (499), Expect = 9e-48,   Method: Compositional matrix adjust.
 Identities = 108/271 (39%), Positives = 160/271 (59%), Gaps = 14/271 (5%)

Query: 4   PTHQRAQGNKLYYQ------EALNKSPELKDEPPKVNNVAPTLEVTE-REKYEMLCRGDL 56
           P H+ A  NK+ Y+       +++ SP +K +PP+     P  E+ E +E Y+ +CRG+L
Sbjct: 251 PGHEEAVKNKIVYEALLARERSISSSPRMKLDPPQEAAPEPEPELKESQELYQRVCRGEL 310

Query: 57  TVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPR 116
              P     L+C   H++VPY RL P K E+    P +  + DV+ D E + I +  + +
Sbjct: 311 RQSPKEQRYLRCWLSHQDVPYQRLSPFKVEQLSGDPYVAYFHDVLSDKESEQIIEHGKGQ 370

Query: 117 LRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVN 176
           + R+ +   +TG   +++ R S++ WL    +P +  I +R+E +TGL+T TAE LQ+VN
Sbjct: 371 VTRSEIG--QTGNSTVSDIRTSQNTWLWYENNPWLADIKQRLEDITGLSTDTAEPLQLVN 428

Query: 177 YGIGGHYEPHYDFARPGEANAFKSLG-TGNRVATVLFYMSDVAQGGATVFTSLNLSLWPE 235
           YGIGG YEPH+DF    E N     G  GNR+ T LFY++DV  GGAT F  L+L++ P 
Sbjct: 429 YGIGGQYEPHFDFMDDAEKN----FGWKGNRLLTALFYLNDVPLGGATAFPFLHLAVPPV 484

Query: 236 KGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           KG+   W+NLH S   D+ T+HA CPVL GS
Sbjct: 485 KGSLLVWYNLHRSLHKDFRTKHAGCPVLKGS 515


>gi|443697961|gb|ELT98195.1| hypothetical protein CAPTEDRAFT_181380 [Capitella teleta]
          Length = 530

 Score =  196 bits (498), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 105/276 (38%), Positives = 160/276 (57%), Gaps = 14/276 (5%)

Query: 2   IFPTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLE------------VTEREKYE 49
           I P H+RA  N+ YY+  + ++ + + +  K +N AP ++                  YE
Sbjct: 231 IEPGHERAIANRRYYERIIAEADDAERQKLKGDNGAPVVDGKPHRFLTDYTGSKSYSDYE 290

Query: 50  MLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLI 109
            LCRG+ T       +L CRY  R  P   + PLKEE     P I +Y DV+ DS+  +I
Sbjct: 291 KLCRGEETHKRPFKHRLVCRY-QRYHPIFYISPLKEEMLNFDPAIYVYHDVLTDSQNAII 349

Query: 110 KKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTA 169
           K++++P+L R+ V +    +  ++N+R S++AW  +  HP+I R+S++   ++ LT  T 
Sbjct: 350 KEVSRPKLHRSGVFSKTDADTGLSNFRTSQTAWHDDSTHPLIARLSQKASAISNLTLETV 409

Query: 170 EELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLN 229
           E LQV+NYGIGG YEPH+DF +  E N F S    NRVAT + Y+S++  GG TV+ ++ 
Sbjct: 410 EHLQVLNYGIGGLYEPHWDFVQGEERNEF-SESDRNRVATFICYLSELEAGGYTVYPTVG 468

Query: 230 LSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
            ++ P K + A W+NL  +G GDY T HAACP+L G
Sbjct: 469 AAVVPRKNSCALWYNLMRNGTGDYRTYHAACPILYG 504


>gi|297515507|gb|ADI44133.1| RT08151p [Drosophila melanogaster]
          Length = 546

 Score =  196 bits (498), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 108/257 (42%), Positives = 146/257 (56%), Gaps = 7/257 (2%)

Query: 12  NKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAIVAQLKCRYV 71
           N L  +  LN+S  + +  P      P   V E + Y + C G   + P     L+C YV
Sbjct: 256 NALSEKALLNESKPILEHAPIPEEGEP---VGEFQAYSLTCSGHWRLTPKEQRHLRCGYV 312

Query: 72  HRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELE 131
               P+L + PLK EE +  P ++LY DV+Y SEID+I+K+ + RL RAT+ ++   E  
Sbjct: 313 TETHPFLWIAPLKAEELFQDPLLVLYHDVIYQSEIDVIRKLTENRLMRATITSH--NESV 370

Query: 132 IANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD--F 189
           ++N R S+  ++    H V+  I +RV  MT L    AE+ Q  NYGIGGHY  H D  +
Sbjct: 371 VSNVRTSQFTFIPVTAHKVLSTIDQRVADMTNLNMKYAEDHQFANYGIGGHYGQHMDWFY 430

Query: 190 ARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSG 249
               +A    S   GNR+A VLFY+SDVAQGG T F  L   L P+K  AAFWHNLH+SG
Sbjct: 431 QTTFDAGLVSSPEMGNRIAAVLFYLSDVAQGGGTAFPQLRTLLKPKKYAAAFWHNLHASG 490

Query: 250 DGDYYTRHAACPVLTGS 266
            GD  T+H ACP++ GS
Sbjct: 491 VGDVRTQHGACPIIAGS 507


>gi|195391760|ref|XP_002054528.1| GJ22757 [Drosophila virilis]
 gi|194152614|gb|EDW68048.1| GJ22757 [Drosophila virilis]
          Length = 534

 Score =  196 bits (497), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 97/223 (43%), Positives = 132/223 (59%), Gaps = 3/223 (1%)

Query: 44  EREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYD 103
           E   YE +CRG++    A    L+CRY      Y  L PLK EE  L P ++ + D++  
Sbjct: 280 EFAHYEKVCRGEVGASAAQQRPLRCRYTRGEHAYRLLAPLKLEEHSLDPLVVTFHDMLSQ 339

Query: 104 SEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTG 163
             I  +++MA P ++R+TV     G+   + +R+SK+AWL    HP + R+ R V   TG
Sbjct: 340 HRIAELREMAVPHMQRSTVNPLPGGQRRKSAFRVSKNAWLPYSTHPTMGRMLRDVSDATG 399

Query: 164 LTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGAT 223
           L  +  E+LQV NYG+GGHYEPH+DF R            GNR+AT +FY+SDV QGGAT
Sbjct: 400 LDMTFCEQLQVANYGVGGHYEPHWDFFRDSR---HYPAAEGNRIATAIFYLSDVEQGGAT 456

Query: 224 VFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
            F  LN ++ P+ G   FW+NLH S D D+ T+HA CPVL GS
Sbjct: 457 AFPFLNFAVRPQLGNILFWYNLHRSSDMDFRTKHAGCPVLKGS 499


>gi|195159313|ref|XP_002020526.1| GL14040 [Drosophila persimilis]
 gi|194117295|gb|EDW39338.1| GL14040 [Drosophila persimilis]
          Length = 549

 Score =  196 bits (497), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 108/271 (39%), Positives = 159/271 (58%), Gaps = 14/271 (5%)

Query: 4   PTHQRAQGNKLYYQ------EALNKSPELKDEPPKVNNVAPTLEVTE-REKYEMLCRGDL 56
           P H+ A  NK+ Y+       +++ SP +K +PP+     P  E+ E +E Y+ +CRG+L
Sbjct: 251 PGHEEAVKNKIVYEALLARERSISSSPRMKLDPPQEAAPEPEPELKESQELYQRVCRGEL 310

Query: 57  TVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPR 116
              P     L+C   H++VPY RL P K E+    P +  + DV+ D E + I +  + +
Sbjct: 311 RQSPKEQRYLRCWLSHQDVPYQRLSPFKVEQLSGDPYVAYFHDVLSDKESEQIIEHGKGQ 370

Query: 117 LRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVN 176
           + R+ +   +TG   ++  R S++ WL    +P +  I +R+E +TGL+T TAE LQ+VN
Sbjct: 371 VTRSEIG--QTGNSTVSEIRTSQNTWLWYENNPWLADIKQRLEDITGLSTDTAEPLQLVN 428

Query: 177 YGIGGHYEPHYDFARPGEANAFKSLG-TGNRVATVLFYMSDVAQGGATVFTSLNLSLWPE 235
           YGIGG YEPH+DF    E N     G  GNR+ T LFY++DV  GGAT F  L+L++ P 
Sbjct: 429 YGIGGQYEPHFDFMDDAEKN----FGWKGNRLLTALFYLNDVPLGGATAFPFLHLAVPPV 484

Query: 236 KGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           KG+   W+NLH S   D+ T+HA CPVL GS
Sbjct: 485 KGSLLVWYNLHRSLHKDFRTKHAGCPVLKGS 515


>gi|443697959|gb|ELT98193.1| hypothetical protein CAPTEDRAFT_162820 [Capitella teleta]
          Length = 347

 Score =  195 bits (496), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 109/298 (36%), Positives = 166/298 (55%), Gaps = 24/298 (8%)

Query: 2   IFPTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLE------------VTEREKYE 49
           I P H+RA  N+ YY+  + ++ + + +  K +N AP ++                  YE
Sbjct: 48  IEPGHERAIANRRYYERIIAEADDAERQKLKGDNGAPVVDGKPHRFLTDYTGSKSYSDYE 107

Query: 50  MLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLI 109
            LCRG+ T       +L CRY  R  P   + PLKEE     P I +Y DV+ DS+  +I
Sbjct: 108 KLCRGEETHKRPFKHRLVCRY-QRYHPIFYISPLKEEMLNFDPAIYVYHDVLTDSQNAII 166

Query: 110 KKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTA 169
           K++++P+L R+ V +    +  ++N+R S++AW  +  HP+I R+S++   ++ LT  T 
Sbjct: 167 KEVSRPKLHRSGVFSKTDADTGLSNFRTSQTAWHDDSTHPLIARLSQKASAISNLTLETV 226

Query: 170 EELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLN 229
           E LQV+NYGIGG YEPH+DF +  E N F S    NRVAT + Y+S++  GG TV+ ++ 
Sbjct: 227 EHLQVLNYGIGGLYEPHWDFVQGEERNEF-SESDRNRVATFICYLSELEAGGYTVYPTVG 285

Query: 230 LSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCGL 277
            ++ P K + A W+NL  +G GDY T HAACP+L G   + +            PCGL
Sbjct: 286 AAVVPRKNSCALWYNLMRNGTGDYRTYHAACPILYGYKWVANKWFHEGGQEFVRPCGL 343


>gi|196011900|ref|XP_002115813.1| hypothetical protein TRIADDRAFT_59899 [Trichoplax adhaerens]
 gi|190581589|gb|EDV21665.1| hypothetical protein TRIADDRAFT_59899 [Trichoplax adhaerens]
          Length = 581

 Score =  195 bits (496), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 109/271 (40%), Positives = 158/271 (58%), Gaps = 9/271 (3%)

Query: 4   PTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTE-REKYEMLCRGDLTVPPA- 61
           P H+ A+     Y++ LN S + K      ++     +  +    Y+ LCRG++      
Sbjct: 260 PEHKTAKKYLNIYEKRLNTSTKEKSTEDLDDDNDDEKDFKQIFNSYKELCRGNVNQKTGD 319

Query: 62  ---IVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLR 118
              +  QL C   +RN P L   PL  E   LQP I++Y +++ +SE+ L+K +A P L+
Sbjct: 320 DVKLNNQLNCYQDYRN-PRLLFSPLNVEVLSLQPYIVIYHNLLTNSEVVLLKTLASPLLK 378

Query: 119 RATVQNYKTGEL-EIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNY 177
           RA V      E  E   YRISK+AWL + +HP ++RI+  +  + GLT+ TAE LQ+ NY
Sbjct: 379 RAVVVGKPDKEYGEETTYRISKTAWLDKEDHPAVKRITTLIGDIIGLTSETAEPLQIANY 438

Query: 178 GIGGHYEPHYDFARPGEANAFKSLGT--GNRVATVLFYMSDVAQGGATVFTSLNLSLWPE 235
           GIGGHYEPH DF    +  A     +  GNR+ATVL Y+S+V  GGATVF    + + P 
Sbjct: 439 GIGGHYEPHLDFIESEDKEALSEYTSRIGNRIATVLIYLSNVEAGGATVFPKAGVRVEPR 498

Query: 236 KGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           +G+AAFW+N+H +G+G+  + HAACPVL GS
Sbjct: 499 QGSAAFWYNMHRNGEGNKLSVHAACPVLIGS 529


>gi|156370133|ref|XP_001628326.1| predicted protein [Nematostella vectensis]
 gi|156215300|gb|EDO36263.1| predicted protein [Nematostella vectensis]
          Length = 526

 Score =  194 bits (494), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 114/270 (42%), Positives = 161/270 (59%), Gaps = 8/270 (2%)

Query: 4   PTHQRAQGNKLYYQEALNKSPELK-DEPPKVNNVAPT-----LEVTEREKYEMLCR-GDL 56
           P     Q N    +  + K P L    PP   N         L   E  +Y  LCR    
Sbjct: 232 PNDTELQSNIRKLKHLIAKQPHLNVTSPPNTANRIEDDGDDELSREEMAEYTRLCRPNSQ 291

Query: 57  TVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPR 116
           T  P+   QL C Y++++ P L+L P+  E   + P+I L+ +V+ + EI+ + ++A+PR
Sbjct: 292 TRLPSSNKQLTCSYLNKH-PGLKLKPVAMEIVSVNPQITLFHNVLSEMEIEQMLELARPR 350

Query: 117 LRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVN 176
           LRRA V N +TGE+E  +YRIS+ AWL + +  ++ RI+RRV  +TGL T+T E LQV N
Sbjct: 351 LRRARVNNLETGEIEDVDYRISQIAWLSDSDGDIVRRINRRVGFITGLNTNTGECLQVNN 410

Query: 177 YGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEK 236
           YG+GGHYEPH+D +   E +   SLG GNR+AT +FY+S+V  GG+TVF    +   P K
Sbjct: 411 YGVGGHYEPHFDHSLDMENSPIASLGQGNRIATFMFYLSEVEAGGSTVFIKTGVKTNPFK 470

Query: 237 GTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           G A FW+NL  SG+GD+ + HA CPVL G+
Sbjct: 471 GGAVFWYNLKKSGEGDWDSLHAGCPVLIGN 500


>gi|195055773|ref|XP_001994787.1| GH17427 [Drosophila grimshawi]
 gi|193892550|gb|EDV91416.1| GH17427 [Drosophila grimshawi]
          Length = 538

 Score =  193 bits (490), Expect = 9e-47,   Method: Compositional matrix adjust.
 Identities = 96/223 (43%), Positives = 136/223 (60%), Gaps = 3/223 (1%)

Query: 44  EREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYD 103
           E   YE +CRG+++   A    L+CRY      Y  L PLK EE  L P ++ Y D++  
Sbjct: 284 EFAHYEKVCRGEVSASAAQQRPLRCRYARGQHAYRVLAPLKLEEHSLDPLVVSYHDMLSP 343

Query: 104 SEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTG 163
            +I  +++MA P ++R+TV      + + + +R+SK+AWL    HP++ R+ R +   TG
Sbjct: 344 QQIIELRQMAVPHMKRSTVNPLPGRQSKKSAFRVSKNAWLEYDTHPMMGRMLRDLSDATG 403

Query: 164 LTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGAT 223
           L  +  E+LQV NYG+GGHYEPH+DF    +    +    GNR+AT +FY+SDV QGGAT
Sbjct: 404 LDMTYCEQLQVANYGVGGHYEPHWDFFVDSQHYPAEE---GNRIATAIFYLSDVEQGGAT 460

Query: 224 VFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
            F  LN ++ P+ G   FW+NLH S D DY T+HA CPVL GS
Sbjct: 461 AFPFLNFAVRPQLGNILFWYNLHRSLDMDYRTKHAGCPVLKGS 503


>gi|321463241|gb|EFX74258.1| hypothetical protein DAPPUDRAFT_22132 [Daphnia pulex]
          Length = 523

 Score =  193 bits (490), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 100/226 (44%), Positives = 140/226 (61%), Gaps = 8/226 (3%)

Query: 48  YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID 107
           +  LCRG+  +   ++A+LKC Y  R+  Y  LMP+K E+   +P I  + DV+ D EI+
Sbjct: 275 FNALCRGERLLNDKLLAELKCWYDTRHQFYFLLMPIKIEQHSFEPAIYTFHDVLSDEEIE 334

Query: 108 LIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTT- 166
            IK++A+P L R+ VQ       E++N R SK+AWL E  HP++ R+SRR+  +TGL T 
Sbjct: 335 TIKELAKPLLARSMVQGKLGVGHEVSNVRTSKTAWLPEGLHPLLNRLSRRIGLITGLKTD 394

Query: 167 ---STAEELQVVNYGIGGHYEPHYDFARPGEANA----FKSLGTGNRVATVLFYMSDVAQ 219
                AE LQV NYGIGGHY PH+D+    +A+      + L  G+R+AT +FY++DV +
Sbjct: 395 PIRDEAELLQVANYGIGGHYSPHHDYLMKDKADFEYMHHRELQAGDRIATFMFYLNDVER 454

Query: 220 GGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           GG+T F    +++ P KG AAFW NL  SG  D  T H ACPVL G
Sbjct: 455 GGSTAFPRAGVAVKPVKGGAAFWFNLKRSGKPDPLTLHGACPVLLG 500


>gi|442747045|gb|JAA65682.1| Putative prolyl 4-hydroxylase alpha subunit [Ixodes ricinus]
          Length = 538

 Score =  192 bits (487), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 108/269 (40%), Positives = 156/269 (57%), Gaps = 16/269 (5%)

Query: 9   AQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAIVAQLKC 68
           +Q +++  Q+ + +S E K +      V P  E  E + Y+ LCRG+L   P + +QL+C
Sbjct: 246 SQTHEVAVQDRIEQSAESKAQ--LFQEVTP--EDQEDQSYKRLCRGELLRSPKMDSQLRC 301

Query: 69  RYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTG 128
           RY      +  L P+K EE  L+P II+  DV+ D +I  +   A+PRL R+T   Y   
Sbjct: 302 RYYKGQDGFFSLQPIKLEEINLKPYIIVMHDVVQDKDIKDLMAYAEPRLERSTT--YTGS 359

Query: 129 ELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST----AEELQVVNYGIGGHYE 184
           E+  +  R S +AWL E E P+  R++  +  + G+ TS     AE  Q+ NYG GG + 
Sbjct: 360 EMVPSPVRTSSTAWLNEDEAPIAVRMNSYLRALLGMGTSDTNEEAEAYQLANYGTGGQFL 419

Query: 185 PHYDFARPG------EANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGT 238
           PH+DF +         A+ +   GTG+R+AT++ YM+DV +GGATVF SL + L P+KG 
Sbjct: 420 PHHDFLQDSLHSYNSSADYYLQYGTGDRLATLMIYMTDVEEGGATVFPSLGIRLTPKKGD 479

Query: 239 AAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
           AAFW NL +SG+GD  T HA CPVL GS 
Sbjct: 480 AAFWWNLKASGEGDRLTTHAGCPVLYGSK 508


>gi|195159317|ref|XP_002020528.1| GL14042 [Drosophila persimilis]
 gi|194117297|gb|EDW39340.1| GL14042 [Drosophila persimilis]
          Length = 534

 Score =  192 bits (487), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 97/223 (43%), Positives = 136/223 (60%), Gaps = 3/223 (1%)

Query: 44  EREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYD 103
           E   YE +CRG++   P     L+CRY   + PY  L PLK EE  L P ++ Y D++  
Sbjct: 280 EFAHYEKVCRGEVGPSPRQERPLRCRYSLGSHPYRHLAPLKLEEHSLDPFVVTYHDMLSP 339

Query: 104 SEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTG 163
            +I  ++ MA PR+ R+TV     G+ + +++R+SK+AWL    HP +  +   +   TG
Sbjct: 340 RKIADLRLMAVPRMHRSTVNPLPGGQNKKSSFRVSKNAWLAYDSHPTMGGMLSDLSDATG 399

Query: 164 LTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGAT 223
           L  +  E+LQV NYG+GGHYEPH+DF R  +    +    GNR+AT +FY+SDV QGGAT
Sbjct: 400 LDMTFCEQLQVANYGVGGHYEPHWDFFRDPDHYPAEE---GNRMATAIFYLSDVEQGGAT 456

Query: 224 VFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
            F  LN ++ P+ G   FW+N+H S D DY T+HA CPVL GS
Sbjct: 457 AFPFLNFAVKPQLGNVLFWYNVHRSLDVDYRTKHAGCPVLKGS 499


>gi|125772813|ref|XP_001357665.1| GA21991 [Drosophila pseudoobscura pseudoobscura]
 gi|54637397|gb|EAL26799.1| GA21991 [Drosophila pseudoobscura pseudoobscura]
          Length = 534

 Score =  192 bits (487), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 97/223 (43%), Positives = 136/223 (60%), Gaps = 3/223 (1%)

Query: 44  EREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYD 103
           E   YE +CRG++   P     L+CRY   + PY  L PLK EE  L P ++ Y D++  
Sbjct: 280 EFAHYEKVCRGEVGPSPRQERPLRCRYSLGSHPYRHLAPLKLEEHSLDPFVVTYHDMLSP 339

Query: 104 SEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTG 163
            +I  ++ MA PR+ R+TV     G+ + +++R+SK+AWL    HP +  +   +   TG
Sbjct: 340 RKIADLRLMAVPRMHRSTVNPLPGGQNKKSSFRVSKNAWLAYDSHPTMGGMLSDLSDATG 399

Query: 164 LTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGAT 223
           L  +  E+LQV NYG+GGHYEPH+DF R  +    +    GNR+AT +FY+SDV QGGAT
Sbjct: 400 LDMTFCEQLQVANYGVGGHYEPHWDFFRDPDHYPAEE---GNRMATAIFYLSDVEQGGAT 456

Query: 224 VFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
            F  LN ++ P+ G   FW+N+H S D DY T+HA CPVL GS
Sbjct: 457 AFPFLNFAVKPQLGNVLFWYNVHRSLDVDYRTKHAGCPVLKGS 499


>gi|195575145|ref|XP_002105540.1| GD16902 [Drosophila simulans]
 gi|194201467|gb|EDX15043.1| GD16902 [Drosophila simulans]
          Length = 525

 Score =  191 bits (486), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 94/219 (42%), Positives = 137/219 (62%), Gaps = 4/219 (1%)

Query: 48  YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID 107
           Y+M CRG    PP+  ++L C Y     P+L L PLK E   L P ++LY DV+   EI 
Sbjct: 287 YQMGCRGQF--PPSADSKLYCLYNRTTSPFLILAPLKMELVGLDPYMVLYHDVLSPKEIT 344

Query: 108 LIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTS 167
            ++ MA P L+RATV    +G  E+   R SK AW  +  +P+  R++ R+  MTG    
Sbjct: 345 ELQGMATPGLKRATVYQASSGRNEVVKTRTSKVAWFPDGYNPLTVRLNARISDMTGFNLY 404

Query: 168 TAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTS 227
            +E LQ++NYG+GGHY+ HYDF    + N+  +  +G+R+ATVLFY++DV QGGATVF +
Sbjct: 405 GSEMLQLMNYGLGGHYDQHYDFF--NKTNSNMTAMSGDRIATVLFYLTDVEQGGATVFPN 462

Query: 228 LNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           +  +++P++G+   W+NL  +G  D  T HAACPV+ GS
Sbjct: 463 IRKAVFPQRGSVVMWYNLRDNGQIDTQTLHAACPVIVGS 501


>gi|21711777|gb|AAM75079.1| RE70601p [Drosophila melanogaster]
          Length = 316

 Score =  191 bits (485), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 93/219 (42%), Positives = 137/219 (62%), Gaps = 4/219 (1%)

Query: 48  YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID 107
           Y++ CRG    PP+  ++L C Y     P+L L PLK E   L P ++LY DV+   EI 
Sbjct: 78  YQIGCRGQF--PPSADSKLYCLYNRTTSPFLILAPLKMELVGLDPYMVLYHDVLSPKEIK 135

Query: 108 LIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTS 167
            ++ MA P L+RATV    +G  E+   R SK AW  +  +P+  R++ R+  MTG    
Sbjct: 136 ELQGMATPSLKRATVYQASSGRNEVVKTRTSKVAWFPDGYNPLTVRLNARISDMTGFNLY 195

Query: 168 TAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTS 227
            +E LQ++NYG+GGHY+ HYDF    + N+  +  +G+R+ATVLFY++DV QGGATVF +
Sbjct: 196 GSEMLQLMNYGLGGHYDQHYDFF--NKTNSNMTAMSGDRIATVLFYLTDVEQGGATVFPN 253

Query: 228 LNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           +  +++P++G+   W+NL  +G  D  T HAACPV+ GS
Sbjct: 254 IRKAVFPQRGSVVMWYNLKDNGQIDTQTLHAACPVIVGS 292


>gi|198449500|ref|XP_001357604.2| GA15939 [Drosophila pseudoobscura pseudoobscura]
 gi|198130634|gb|EAL26738.2| GA15939 [Drosophila pseudoobscura pseudoobscura]
          Length = 528

 Score =  191 bits (484), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 102/238 (42%), Positives = 141/238 (59%), Gaps = 4/238 (1%)

Query: 31  PKVNNVAPTLEVTERE--KYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEA 88
           P    +APTL+ T +   +YE  CRG L   P+   +L C Y   N  +LRL PLK E  
Sbjct: 268 PNSIGIAPTLKSTAQPLGEYERGCRG-LFPSPSKDGRLHCVYNSTNSAFLRLAPLKMELV 326

Query: 89  YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
            L P ++LY DV+   EI  ++ MA P L+RATV        E+   R SK AW  +  +
Sbjct: 327 GLDPYMVLYHDVISAPEISQLQDMATPGLKRATVYKASGRRSEVVKTRTSKVAWFPDTFN 386

Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVA 208
            + ER++RR+  MT      +E LQ +NYG+GGHY+ HYDF     A     +  G+R+A
Sbjct: 387 ELTERLNRRIADMTNFDLLGSEMLQAMNYGLGGHYDKHYDFFNASTATNLTQM-NGDRIA 445

Query: 209 TVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           TVLFY++DV QGGATVF ++  +++P++G+A  W+NL   GD +  T HAACPVL GS
Sbjct: 446 TVLFYLTDVEQGGATVFPNIRKAVFPQRGSAIIWYNLKDDGDPNPQTLHAACPVLVGS 503


>gi|195391766|ref|XP_002054531.1| GJ24504 [Drosophila virilis]
 gi|194152617|gb|EDW68051.1| GJ24504 [Drosophila virilis]
          Length = 545

 Score =  191 bits (484), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 107/268 (39%), Positives = 152/268 (56%), Gaps = 13/268 (4%)

Query: 4   PTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAIV 63
           P  +R   N   +Q   ++   +++ PP   N+    E++E + Y   C G +   PA +
Sbjct: 245 PGSERYINNYKDFQPPSDELNPVEEHPPLPENLT---ELSEFDLYRYTCNGHIKPTPAEL 301

Query: 64  AQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQ 123
            QL+C Y+    P+L L PLK EE    P ++LY DV+Y SEID + K+ + ++ RATV 
Sbjct: 302 RQLRCGYMTETHPFLLLAPLKVEELSHDPLLVLYHDVIYQSEIDTLAKLTKNKIHRATVT 361

Query: 124 NYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHY 183
                   ++N R S+  ++ +  H V+  I +RV  MT L    AE+ Q+ NYGIGGHY
Sbjct: 362 GNNASV--VSNARTSQFTFIPKTRHKVLRTIDQRVADMTDLNMVFAEDHQLANYGIGGHY 419

Query: 184 EPHYDFARPGEANAFKSLGT-----GNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGT 238
             H D+  P   NAF++        GNR+ATVLFY++DV QGG T F  L   L P+K  
Sbjct: 420 AQHMDWFSP---NAFETKQVANSEMGNRIATVLFYLTDVEQGGGTAFPVLKQLLKPKKYA 476

Query: 239 AAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           AAFW+NLH+SG GD  T H ACP++ GS
Sbjct: 477 AAFWYNLHASGAGDVRTMHGACPIIVGS 504


>gi|198449635|ref|XP_001357660.2| GA21971 [Drosophila pseudoobscura pseudoobscura]
 gi|198130694|gb|EAL26794.2| GA21971 [Drosophila pseudoobscura pseudoobscura]
          Length = 549

 Score =  190 bits (483), Expect = 6e-46,   Method: Compositional matrix adjust.
 Identities = 105/256 (41%), Positives = 144/256 (56%), Gaps = 13/256 (5%)

Query: 14  LYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHR 73
           LY  + + +   + ++P K++         E E Y + C G   +       L+C Y+  
Sbjct: 259 LYESKTIEEHAPIPEDPSKLD---------EFEAYRLTCSGHSRLTAREERHLRCGYMTE 309

Query: 74  NVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIA 133
             P+L L PLK EE    P ++LY DV+Y SEID+I+++   R+ RA V    T +  ++
Sbjct: 310 THPFLLLAPLKAEELSHDPLLVLYHDVIYQSEIDVIRQLTTNRMARAMVT--LTNQSTVS 367

Query: 134 NYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARP 192
           N R S+  ++ + EH V++ I RRV  MT L    AE+ Q  NYGIGGHY  H D F   
Sbjct: 368 NVRTSQITFIAKTEHEVLQTIDRRVADMTNLNMDYAEDHQFANYGIGGHYGQHMDWFTET 427

Query: 193 GEANAF-KSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDG 251
              N    S   GNR+ATVLFY+SDVAQGG T F  L   L P+K  AAFWHNLH++G G
Sbjct: 428 TFDNGLVSSTEMGNRIATVLFYLSDVAQGGGTAFPYLKQHLRPKKYAAAFWHNLHAAGRG 487

Query: 252 DYYTRHAACPVLTGSN 267
           D  T+H ACP++ GS 
Sbjct: 488 DARTQHGACPIIAGSK 503


>gi|195159142|ref|XP_002020441.1| GL13994 [Drosophila persimilis]
 gi|194117210|gb|EDW39253.1| GL13994 [Drosophila persimilis]
          Length = 493

 Score =  190 bits (483), Expect = 6e-46,   Method: Compositional matrix adjust.
 Identities = 102/238 (42%), Positives = 141/238 (59%), Gaps = 4/238 (1%)

Query: 31  PKVNNVAPTLEVTERE--KYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEA 88
           P    +APTL+ T +   +YE  CRG L   P+   +L C Y   N  +LRL PLK E  
Sbjct: 233 PNSIGIAPTLKSTAQPLGEYERGCRG-LFPSPSKDGRLHCVYNSTNSAFLRLAPLKMELV 291

Query: 89  YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
            L P ++LY DV+   EI  ++ MA P L+RATV        E+   R SK AW  +  +
Sbjct: 292 GLDPYMVLYHDVISALEISQLQDMATPGLKRATVYKASGRRSEVVKTRTSKVAWFPDTFN 351

Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVA 208
            + ER++RR+  MT      +E LQ +NYG+GGHY+ HYDF     A     +  G+R+A
Sbjct: 352 ELTERLNRRIADMTNFDLLGSEMLQAMNYGLGGHYDKHYDFFNASTAANLTQM-NGDRIA 410

Query: 209 TVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           TVLFY++DV QGGATVF ++  +++P++G+A  W+NL   GD +  T HAACPVL GS
Sbjct: 411 TVLFYLTDVEQGGATVFPNIRKAVFPQRGSAIIWYNLKDDGDPNPQTLHAACPVLVGS 468


>gi|195341590|ref|XP_002037389.1| GM12139 [Drosophila sechellia]
 gi|194131505|gb|EDW53548.1| GM12139 [Drosophila sechellia]
          Length = 525

 Score =  190 bits (483), Expect = 6e-46,   Method: Compositional matrix adjust.
 Identities = 93/219 (42%), Positives = 137/219 (62%), Gaps = 4/219 (1%)

Query: 48  YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID 107
           Y++ CRG    PP+  ++L C Y     P+L L PLK E   L+P ++LY DV+   EI 
Sbjct: 287 YQVGCRGQF--PPSADSKLYCLYNRTTSPFLILAPLKMELVGLEPYMVLYHDVLSPKEIT 344

Query: 108 LIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTS 167
            ++ MA P L+RATV    +G  E+   R SK AW  +  +P+  R++ R+  MTG    
Sbjct: 345 ELQGMATPGLKRATVYQASSGRNEVVKTRTSKVAWFPDGYNPLTVRLNARISDMTGFNLY 404

Query: 168 TAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTS 227
            +E LQ++NYG+GGHY+ HYDF      N+  +  +G+R+ATVLFY++DV QGGATVF +
Sbjct: 405 GSEMLQLMNYGLGGHYDQHYDFF--NNTNSNMTAMSGDRIATVLFYLTDVEQGGATVFPN 462

Query: 228 LNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           +  +++P++G+   W+NL  +G  D  T HAACPV+ GS
Sbjct: 463 IRKAVFPQRGSVVMWYNLRDNGQIDTQTLHAACPVIVGS 501


>gi|195341548|ref|XP_002037368.1| GM12149 [Drosophila sechellia]
 gi|194131484|gb|EDW53527.1| GM12149 [Drosophila sechellia]
          Length = 537

 Score =  190 bits (483), Expect = 7e-46,   Method: Compositional matrix adjust.
 Identities = 111/288 (38%), Positives = 156/288 (54%), Gaps = 24/288 (8%)

Query: 4   PTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREK-YEMLCRGDLTVPPAI 62
           P H+ A  NK+ Y+  L +        P+     P  E+ E  K Y  +CRG+L   P  
Sbjct: 247 PDHEDALKNKILYEGQLARERSF---VPREQAELPQKELKESYKLYTQVCRGELHQSPRE 303

Query: 63  VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATV 122
              L+C   H+ V Y  L P K E+  + P +    +V++DSEID I +  +  + R+ V
Sbjct: 304 QRNLRCWLSHQGVLYYHLSPFKIEQLNIDPYVAYVHEVLWDSEIDTIIEHGKGNMERSKV 363

Query: 123 ---QNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGI 179
              +N  T E+     RIS++ WL    +P + +I +R+E +TGL+T +AE LQ+VNYGI
Sbjct: 364 GQIENSTTTEV-----RISRNTWLWYDANPWLSKIKQRLEDVTGLSTESAEPLQLVNYGI 418

Query: 180 GGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTA 239
           GG YEPH+DF        F     GNR+ T LFY++DVA GGAT F  L L++ P KG+ 
Sbjct: 419 GGQYEPHFDFVEDDGKTVFS--WKGNRLLTALFYLNDVALGGATAFPFLRLAVPPVKGSL 476

Query: 240 AFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCGL 277
             W+NLHSS   D+ T+HA CPVL GS  + +            PCGL
Sbjct: 477 LIWYNLHSSTHKDFRTKHAGCPVLQGSKWICNEWFHVAAQEFRRPCGL 524


>gi|24651477|ref|NP_733395.1| prolyl-4-hydroxylase-alpha PV [Drosophila melanogaster]
 gi|20269812|gb|AAM18061.1|AF495539_1 prolyl 4-hydroxylase alpha-related protein PH4[alpha]PV [Drosophila
           melanogaster]
 gi|23172718|gb|AAN14252.1| prolyl-4-hydroxylase-alpha PV [Drosophila melanogaster]
          Length = 525

 Score =  190 bits (482), Expect = 9e-46,   Method: Compositional matrix adjust.
 Identities = 93/219 (42%), Positives = 137/219 (62%), Gaps = 4/219 (1%)

Query: 48  YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID 107
           Y++ CRG    PP+  ++L C Y     P+L L PLK E   L P ++LY DV+   EI 
Sbjct: 287 YQIGCRGQF--PPSADSKLYCLYNRTTSPFLILAPLKMELVGLDPYMVLYHDVLSPKEIK 344

Query: 108 LIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTS 167
            ++ MA P L+RATV    +G  E+   R SK AW  +  +P+  R++ R+  MTG    
Sbjct: 345 ELQGMATPGLKRATVYQASSGRNEVVKTRTSKVAWFPDGYNPLTVRLNARISDMTGFNLY 404

Query: 168 TAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTS 227
            +E LQ++NYG+GGHY+ HYDF    + N+  +  +G+R+ATVLFY++DV QGGATVF +
Sbjct: 405 GSEMLQLMNYGLGGHYDQHYDFF--NKTNSNMTAMSGDRIATVLFYLTDVEQGGATVFPN 462

Query: 228 LNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           +  +++P++G+   W+NL  +G  D  T HAACPV+ GS
Sbjct: 463 IRKAVFPQRGSVVMWYNLKDNGQIDTQTLHAACPVIVGS 501


>gi|444517246|gb|ELV11441.1| Prolyl 4-hydroxylase subunit alpha-2 [Tupaia chinensis]
          Length = 466

 Score =  189 bits (481), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 106/243 (43%), Positives = 145/243 (59%), Gaps = 32/243 (13%)

Query: 4   PTHQRAQGNKLYYQEALN----KSPELKDEPPKVNNVA----PTLEVTEREKYEMLCRGD 55
           P+H+RA GN  Y++  L     K P  + E            P   + ER+ YE LCRG+
Sbjct: 238 PSHERAGGNLRYFERLLEEEREKMPSNQTEAELATQEGIYERPVDYLPERDVYESLCRGE 297

Query: 56  -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
            + + P    +L CRY H N  P L + P KEE+ +  P I+ Y DVM D EI+ IK++A
Sbjct: 298 GVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 357

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT  TAE LQ
Sbjct: 358 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 417

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
           V NYG+GG YEPH+DF+R                      MSDV  GGATVF  L  ++W
Sbjct: 418 VANYGMGGQYEPHFDFSR----------------------MSDVEAGGATVFPDLGAAIW 455

Query: 234 PEK 236
           P+K
Sbjct: 456 PKK 458


>gi|195061074|ref|XP_001995919.1| GH14105 [Drosophila grimshawi]
 gi|193891711|gb|EDV90577.1| GH14105 [Drosophila grimshawi]
          Length = 513

 Score =  189 bits (480), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 97/220 (44%), Positives = 134/220 (60%), Gaps = 6/220 (2%)

Query: 47  KYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEI 106
           KYE  CRG     PA  ++L C Y   N  +LRL PLK E   L P ++LY D +   EI
Sbjct: 278 KYEKGCRGQ--YAPATSSRLHCVYNSTNSAFLRLAPLKMELLQLDPYMVLYHDAISPREI 335

Query: 107 DLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTT 166
           + ++ +A PRL+RA V +  T    +   R SK  WL +  +    R+++R+E M+G T 
Sbjct: 336 EDLQFLAMPRLKRAKVVDQVTHRNMMVKERTSKVTWLGDATNAFTMRLNKRIEDMSGFTM 395

Query: 167 STAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFT 226
             +E LQV+NYG+GGHY  HYDF         K+   G+R+ATV+FY+SDV QGGATVF 
Sbjct: 396 YGSEMLQVMNYGLGGHYASHYDFLNATS----KTRLNGDRIATVMFYLSDVEQGGATVFP 451

Query: 227 SLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
            +  +++P++GTA  W+NL  +GD D  T HAACPV+ GS
Sbjct: 452 KIQKAVFPQRGTAIIWYNLKENGDFDTNTIHAACPVIVGS 491


>gi|194765168|ref|XP_001964699.1| GF22909 [Drosophila ananassae]
 gi|190614971|gb|EDV30495.1| GF22909 [Drosophila ananassae]
          Length = 525

 Score =  189 bits (480), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 106/251 (42%), Positives = 147/251 (58%), Gaps = 7/251 (2%)

Query: 19  ALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYL 78
           +LN++  +++ PP +     T  +++   Y + C G     P     L+C Y+    P+L
Sbjct: 232 SLNETKAVEEHPP-IPKEGDT--ISDFHGYMLTCSGHFRPTPREQRDLRCGYMDETHPFL 288

Query: 79  RLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRIS 138
            + PLK EE    P +ILY DV+Y SEID I+K+   +L+RAT+ +  T E  ++N R S
Sbjct: 289 WIAPLKAEELSRDPLLILYHDVIYQSEIDTIRKLTTNKLKRATITS--TNESVVSNVRTS 346

Query: 139 KSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPG-EAN 196
           +  +L   E  V+  I RRV  MT      AE+ Q  NYGIGGHY  H D F +P  +A 
Sbjct: 347 QFTFLPVTEDKVLATIDRRVADMTNFNMRYAEDHQFANYGIGGHYGQHMDWFYQPSFDAG 406

Query: 197 AFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTR 256
              S   GNR+ATVLFY+SDV QGG T F  L + L P+K  AAFW+NLH+SG GD  T+
Sbjct: 407 LVSSPEMGNRIATVLFYLSDVTQGGGTAFPHLRVLLKPKKYAAAFWYNLHASGVGDPRTQ 466

Query: 257 HAACPVLTGSN 267
           H ACP+++GS 
Sbjct: 467 HGACPIISGSK 477


>gi|194905290|ref|XP_001981166.1| GG11918 [Drosophila erecta]
 gi|190655804|gb|EDV53036.1| GG11918 [Drosophila erecta]
          Length = 525

 Score =  187 bits (474), Expect = 7e-45,   Method: Compositional matrix adjust.
 Identities = 93/219 (42%), Positives = 133/219 (60%), Gaps = 4/219 (1%)

Query: 48  YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID 107
           Y+M CRG    PP+   +L C Y      +L L PLK E   L P ++LY DV+   EI 
Sbjct: 287 YQMGCRGQF--PPSADGKLYCLYNRTTSAFLMLAPLKMELVGLDPYMVLYHDVLSAKEIK 344

Query: 108 LIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTS 167
            ++ MA P L RATV    +G  E+   R SK AW  +  +P+  R++ R+  MTG    
Sbjct: 345 ELQGMATPGLTRATVFQASSGRNEVVKTRTSKVAWFPDSYNPLTVRLNARIADMTGFNLY 404

Query: 168 TAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTS 227
            +E LQ++NYG+GGHY+ HYDF     +N   +  +G+R+ATVLFY++DV QGGATVF +
Sbjct: 405 GSEMLQLMNYGLGGHYDQHYDFFNTINSNL--TAMSGDRIATVLFYLTDVEQGGATVFPN 462

Query: 228 LNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           +  +++P++G+   W+NL  +G  D  T HAACPV+ GS
Sbjct: 463 IRKAVFPQRGSVIMWYNLQDNGQTDNKTLHAACPVIVGS 501


>gi|195110931|ref|XP_002000033.1| GI24862 [Drosophila mojavensis]
 gi|193916627|gb|EDW15494.1| GI24862 [Drosophila mojavensis]
          Length = 549

 Score =  187 bits (474), Expect = 7e-45,   Method: Compositional matrix adjust.
 Identities = 97/228 (42%), Positives = 138/228 (60%), Gaps = 4/228 (1%)

Query: 41  EVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDV 100
           ++++ E Y   C G +   P+ + QL+C Y+    P+L L PLK EE    P ++L+ DV
Sbjct: 283 KLSDFELYRHTCNGHIRPTPSELRQLRCGYMTETHPFLLLAPLKVEELSHDPLLVLFHDV 342

Query: 101 MYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEH 160
           +Y SEID + ++A+ ++ RATV  + +    ++N R S+  +L +  H V+  I +RV  
Sbjct: 343 IYQSEIDTLMRLAKNKIHRATVTGHNSSV--VSNARTSQFTFLPKTRHKVLRTIDQRVAD 400

Query: 161 MTGLTTSTAEELQVVNYGIGGHYEPHYDFARP--GEANAFKSLGTGNRVATVLFYMSDVA 218
           MT L    AE+ Q+ NYGIGGHY  H D+  P   E     +   GNR+ TVLFY+SDV 
Sbjct: 401 MTDLHLEYAEDHQLANYGIGGHYAQHMDWFYPITFETKQVSNPEMGNRIGTVLFYLSDVE 460

Query: 219 QGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           QGGAT F +L   L P+K  AAFW+NLH+SG GD  T H ACP++ GS
Sbjct: 461 QGGATAFPALKQLLRPKKHAAAFWYNLHASGVGDARTMHGACPIIVGS 508


>gi|326914688|ref|XP_003203656.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Meleagris
           gallopavo]
          Length = 539

 Score =  187 bits (474), Expect = 8e-45,   Method: Compositional matrix adjust.
 Identities = 109/266 (40%), Positives = 153/266 (57%), Gaps = 9/266 (3%)

Query: 4   PTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRG-DLTVPPAI 62
           P++QR   N   Y++ L    +    P +  NV    ++  R+ YE LC+G    + P  
Sbjct: 255 PSNQRVTRNVAKYEKLLATHGDRVGRPLQRPNVT---QLQNRDAYEELCQGLGAQMAPEQ 311

Query: 63  VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATV 122
            +QL C Y     PYL L P K+E   LQP I+LY D + D+E + IK +A P L+R+ V
Sbjct: 312 PSQLGCSYETNGSPYLLLQPAKKETLRLQPYIVLYHDFVSDAEAETIKGLAGPWLQRSVV 371

Query: 123 QNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQVVNYGIG 180
            + +  + +   YRISKSAWL++   PV+  +  R+  +TGL      AE LQVVNYG+G
Sbjct: 372 ASGE--KQQKVEYRISKSAWLKDTADPVVRALELRMAAITGLDLRPPYAEYLQVVNYGLG 429

Query: 181 GHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAA 240
           GHYEPH+D A   ++  ++ + +GNR+ATV+ Y+S V  GG+T F   N S+   K  A 
Sbjct: 430 GHYEPHFDHATSRKSPLYR-MKSGNRIATVMIYLSAVEAGGSTAFIYANFSVPVVKNAAL 488

Query: 241 FWHNLHSSGDGDYYTRHAACPVLTGS 266
           FW NL  +GDGD  T HA CPVL G 
Sbjct: 489 FWWNLRRNGDGDGDTLHAGCPVLAGD 514


>gi|195505255|ref|XP_002099425.1| GE23368 [Drosophila yakuba]
 gi|194185526|gb|EDW99137.1| GE23368 [Drosophila yakuba]
          Length = 528

 Score =  186 bits (473), Expect = 9e-45,   Method: Compositional matrix adjust.
 Identities = 98/237 (41%), Positives = 139/237 (58%), Gaps = 7/237 (2%)

Query: 30  PPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAY 89
           P KV   A T    E   Y+M CRG     P+  ++L C Y     P+L L PLK E   
Sbjct: 275 PVKVQAQAQT---AEPSAYQMGCRGQFA--PSADSKLHCLYNRTTSPFLMLAPLKMELVG 329

Query: 90  LQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHP 149
           L P ++LY DV+   EI  ++ MA P L+RATV    +G  E+   R SK AW  +   P
Sbjct: 330 LDPYMVLYHDVLSAKEIKELQGMATPGLKRATVFQAASGRNEVVRTRTSKVAWFPDGYSP 389

Query: 150 VIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVAT 209
           +  R++ R+  MTG     +E LQ++NYG+GGHY+ HYD+     +N   +  +G+R+AT
Sbjct: 390 LTVRLNARITDMTGFNLHGSEMLQLMNYGLGGHYDQHYDYFNTINSNL--TAMSGDRIAT 447

Query: 210 VLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           VLFY++DV QGGATVF ++  +++P++G+   W+NL   G  D  T HAACPV+ GS
Sbjct: 448 VLFYLTDVEQGGATVFPNIRKAVFPQRGSVIMWYNLKDDGQIDTQTLHAACPVIVGS 504


>gi|148701599|gb|EDL33546.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha II polypeptide, isoform CRA_d [Mus
           musculus]
          Length = 545

 Score =  186 bits (472), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 102/222 (45%), Positives = 144/222 (64%), Gaps = 12/222 (5%)

Query: 4   PTHQRAQGNKLYYQEALNK------SPELKDEPPKVNNV--APTLEVTEREKYEMLCRGD 55
           P+H+RA GN  Y++  L +      S +         N+   PT  + ER+ YE LCRG+
Sbjct: 313 PSHERAGGNLRYFERLLEEERGKSLSNQTDAGLATQENLYERPTDYLPERDVYESLCRGE 372

Query: 56  -LTVPPAIVAQLKCRYVHRN-VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMA 113
            + + P    +L CRY H N VP L + P KEE+ +  P I+ Y DVM D EI+ IK++A
Sbjct: 373 GVKLTPRRQKKLFCRYHHGNRVPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIA 432

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           +P+L RATV++ KTG L +A+YR+SKS+WL E + PV+ R++RR++H+TGLT  TAE LQ
Sbjct: 433 KPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQ 492

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMS 215
           V NYG+GG YEPH+DF+R    +  K+   GNR+AT L Y+S
Sbjct: 493 VANYGMGGQYEPHFDFSRRPFDSGLKT--EGNRLATFLNYVS 532


>gi|440899661|gb|ELR50930.1| Prolyl 4-hydroxylase subunit alpha-3, partial [Bos grunniens mutus]
          Length = 478

 Score =  186 bits (472), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 113/272 (41%), Positives = 158/272 (58%), Gaps = 19/272 (6%)

Query: 4   PTHQRAQGNKLYYQEALNKSP-----ELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTV 58
           P ++R   N L Y++ L +SP     E   + P V    P L+   R+ YE LC+   + 
Sbjct: 192 PDNKRVARNVLKYEKLLAESPNQAVAETVMQRPNV----PHLQT--RDTYEGLCQTLGSQ 245

Query: 59  PPAI-VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRL 117
           P    +  L C Y   + PYL L P+++E  +L+P ++LY D + D+E   I+ +A+P L
Sbjct: 246 PTHYRIPSLYCSYETSSSPYLLLQPVRKEVIHLEPYVVLYHDFVSDAEAQTIRGLAEPWL 305

Query: 118 RRATVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQV 174
           +R+ V    +GE ++   YRISKSAWL++   PV+  +  R+  +TGL      AE LQV
Sbjct: 306 QRSVV---ASGEKQLPVEYRISKSAWLKDTVDPVLVTLDHRIAALTGLDVQPPYAEYLQV 362

Query: 175 VNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWP 234
           VNYGIGGHYEPH+D A    +  ++ + +GNRVAT + Y+S V  GGAT F   N S+  
Sbjct: 363 VNYGIGGHYEPHFDHATSPSSPLYR-MNSGNRVATFMIYLSSVEAGGATAFIYGNFSVPV 421

Query: 235 EKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
            K  A FW NLH SG+GD  T HAACPVL G 
Sbjct: 422 VKNAALFWWNLHRSGEGDGDTLHAACPVLVGD 453


>gi|48675383|ref|NP_001001598.1| prolyl 4-hydroxylase subunit alpha-3 precursor [Bos taurus]
 gi|75053350|sp|Q75UG4.1|P4HA3_BOVIN RecName: Full=Prolyl 4-hydroxylase subunit alpha-3; Short=4-PH
           alpha-3; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-3; Flags: Precursor
 gi|47115494|dbj|BAD18888.1| Collagen prolyl 4-hydroxylase alpha III subunit [Bos taurus]
 gi|296479828|tpg|DAA21943.1| TPA: prolyl 4-hydroxylase subunit alpha-3 precursor [Bos taurus]
          Length = 544

 Score =  186 bits (471), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 113/273 (41%), Positives = 158/273 (57%), Gaps = 19/273 (6%)

Query: 4   PTHQRAQGNKLYYQEALNKSP-----ELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTV 58
           P ++R   N L Y++ L +SP     E   + P V    P L+   R+ YE LC+   + 
Sbjct: 258 PDNKRVARNVLKYEKLLAESPNQAVAETVMQRPNV----PHLQT--RDTYEGLCQTLGSQ 311

Query: 59  PPAI-VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRL 117
           P    +  L C Y   + PYL L P+++E  +L+P ++LY D + D+E   I+ +A+P L
Sbjct: 312 PTHYRIPSLYCSYETSSSPYLLLQPVRKEVIHLEPYVVLYHDFVSDAEAQTIRGLAEPWL 371

Query: 118 RRATVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQV 174
           +R+ V    +GE ++   YRISKSAWL++   PV+  +  R+  +TGL      AE LQV
Sbjct: 372 QRSVV---ASGEKQLPVEYRISKSAWLKDTVDPVLVTLDHRIAALTGLDVQPPYAEYLQV 428

Query: 175 VNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWP 234
           VNYGIGGHYEPH+D A    +  ++ + +GNRVAT + Y+S V  GGAT F   N S+  
Sbjct: 429 VNYGIGGHYEPHFDHATSPSSPLYR-MNSGNRVATFMIYLSSVEAGGATAFIYGNFSVPV 487

Query: 235 EKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
            K  A FW NLH SG+GD  T HAACPVL G  
Sbjct: 488 VKNAALFWWNLHRSGEGDGDTLHAACPVLVGDK 520


>gi|426245942|ref|XP_004016760.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3, partial [Ovis
           aries]
          Length = 514

 Score =  185 bits (470), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 113/273 (41%), Positives = 158/273 (57%), Gaps = 19/273 (6%)

Query: 4   PTHQRAQGNKLYYQEALNKSP-----ELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTV 58
           P ++R   N L Y++ L +SP     E   + P V    P L+   R+ YE LC+   + 
Sbjct: 228 PDNKRVARNVLKYEKLLAESPNQAVAETVMQRPNV----PHLQT--RDTYEGLCQTLGSQ 281

Query: 59  PPAI-VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRL 117
           P    +  L C Y   + PYL L P+++E  +L+P ++LY D + D+E   I+ +A+P L
Sbjct: 282 PTHYQIPSLYCSYETSSSPYLLLQPVRKEVIHLEPYVVLYHDFVSDAEAQKIRGLAEPWL 341

Query: 118 RRATVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQV 174
           +R+ V    +GE ++   YRISKSAWL++   PV+  +  R+  +TGL      AE LQV
Sbjct: 342 QRSVV---ASGEKQLPVEYRISKSAWLKDTVDPVLVTLDHRIAALTGLDVQPPYAEYLQV 398

Query: 175 VNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWP 234
           VNYGIGGHYEPH+D A    +  ++ + +GNRVAT + Y+S V  GGAT F   N S+  
Sbjct: 399 VNYGIGGHYEPHFDHATSPSSPLYR-MNSGNRVATFMIYLSSVEAGGATAFIYGNFSVPV 457

Query: 235 EKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
            K  A FW NLH SG+GD  T HAACPVL G  
Sbjct: 458 VKNAALFWWNLHRSGEGDGDTLHAACPVLVGDK 490


>gi|363729586|ref|XP_417248.3| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Gallus gallus]
          Length = 542

 Score =  185 bits (470), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 110/266 (41%), Positives = 153/266 (57%), Gaps = 11/266 (4%)

Query: 4   PTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRG-DLTVPPAI 62
           P++QR   N   Y++ L    +    P +  NV    ++  R+ YE LC+G    + P  
Sbjct: 258 PSNQRVTRNVAKYEKLLATHGDRVGAPLQRPNVT---QLQNRDAYEELCQGLGAQMAPER 314

Query: 63  VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATV 122
            + L C Y     PYL L P K+E   LQP I+LY D + D+E + IK +A P L+R+ V
Sbjct: 315 PSHLGCSYETNGSPYLLLQPAKKETLRLQPYIVLYHDFVSDAEAETIKGLAGPWLQRSVV 374

Query: 123 QNYKTGE-LEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQVVNYGI 179
               +GE  +   YRISKSAWL++   PV++ +  R+  +TGL      AE LQVVNYG+
Sbjct: 375 ---ASGEKQQKVEYRISKSAWLKDTADPVVQALELRMAAITGLDLRPPYAEYLQVVNYGL 431

Query: 180 GGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTA 239
           GGHYEPH+D A   ++  ++ + +GNR+ATV+ Y+S V  GG+T F   N S+   K  A
Sbjct: 432 GGHYEPHFDHATSRKSPLYR-MKSGNRIATVMIYLSAVEAGGSTAFIYANFSVPVVKNAA 490

Query: 240 AFWHNLHSSGDGDYYTRHAACPVLTG 265
            FW NL  +GDGD  T HA CPVL G
Sbjct: 491 LFWWNLRRNGDGDGDTLHAGCPVLAG 516


>gi|431838427|gb|ELK00359.1| Prolyl 4-hydroxylase subunit alpha-3 [Pteropus alecto]
          Length = 483

 Score =  185 bits (469), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 112/272 (41%), Positives = 157/272 (57%), Gaps = 19/272 (6%)

Query: 4   PTHQRAQGNKLYYQEALNKSP-----ELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTV 58
           P ++R   N L Y++ L +SP     E   + P V    P L+   R+ YE LC+   + 
Sbjct: 197 PDNKRMARNVLKYEKLLAESPTQAVVEAVIQRPNV----PHLQT--RDTYEGLCQTLGSQ 250

Query: 59  PPAI-VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRL 117
           P    +  L C Y   + PYL L P+++E  +L+P ++LY D + D E   I+ +A+P L
Sbjct: 251 PTHYQIPSLHCSYETNSSPYLLLQPVRKEVIHLEPYVVLYHDFVSDLEAQKIRGLAEPWL 310

Query: 118 RRATVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQV 174
           +R+ V    +GE ++   YRISKSAWL++   P++  +  R+  +TGL      AE LQV
Sbjct: 311 QRSVV---ASGEKQLPVEYRISKSAWLKDTADPMLVTLDHRIAALTGLDVQPPYAEYLQV 367

Query: 175 VNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWP 234
           VNYGIGGHYEPH+D A    +  ++ + +GNRVAT + Y+S V  GGAT F   N S+  
Sbjct: 368 VNYGIGGHYEPHFDHATSPSSPLYR-MKSGNRVATFMIYLSSVEAGGATAFIYANFSVPV 426

Query: 235 EKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
            K  A FW NLH SG+GD  T HAACPVL G 
Sbjct: 427 VKNAALFWWNLHRSGEGDSDTLHAACPVLVGD 458


>gi|403263105|ref|XP_003923900.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
           alpha-3, partial [Saimiri boliviensis boliviensis]
          Length = 534

 Score =  184 bits (468), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 114/273 (41%), Positives = 160/273 (58%), Gaps = 21/273 (7%)

Query: 4   PTHQRAQGNKLYYQEALNKSP-----ELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTV 58
           P ++R   N L Y+  L +SP     E   + P +    P L+   R+ YE LC+  L  
Sbjct: 248 PDNKRMARNVLKYERLLAESPNQVVAEAVIQRPNI----PHLQT--RDTYEGLCQ-TLGS 300

Query: 59  PPAI--VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPR 116
            P +  +  L C Y   + PYL L P+++E  +L+P I LY D + DSE   I+++A+P 
Sbjct: 301 QPTLYQIPSLYCSYEINSNPYLLLQPIQKEVLHLEPYIALYHDFVSDSEAQKIRELAEPW 360

Query: 117 LRRATVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQ 173
           L+R+ V    +GE ++   YRISKSAWL++   P++  ++ R+  +TGL      AE LQ
Sbjct: 361 LQRSVV---ASGEKQLQVEYRISKSAWLKDTVDPMLVTLNHRIAALTGLDVRPPYAEYLQ 417

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
           VVNYGIGGHYEPH+D A    +  ++ + +GNRVAT + Y+S V  GGAT F   NLS+ 
Sbjct: 418 VVNYGIGGHYEPHFDHATSPSSPLYR-MKSGNRVATFMIYLSSVEAGGATAFIYANLSVP 476

Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
             K  A FW NLH SG+GD  T HA CPVL G+
Sbjct: 477 VVKNAALFWWNLHRSGEGDSDTLHAGCPVLVGN 509


>gi|195061068|ref|XP_001995918.1| GH14106 [Drosophila grimshawi]
 gi|193891710|gb|EDV90576.1| GH14106 [Drosophila grimshawi]
          Length = 511

 Score =  184 bits (468), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 104/252 (41%), Positives = 140/252 (55%), Gaps = 20/252 (7%)

Query: 15  YYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRN 74
           Y    +   P LK  P    N+A          Y + CRG   VP +    L C Y  + 
Sbjct: 255 YVHNMIRNEPNLK--PVAKENIA---------SYSLGCRGQF-VPQS---NLHCEYKMKT 299

Query: 75  VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIAN 134
            P+LRL PLK E   L P I+++ D +   EID ++ +A+P L+R TV  +  G+     
Sbjct: 300 SPFLRLAPLKMEIVLLNPFIVVFHDALSPQEIDYLQNLARPLLKRTTV--HVNGKYVSRR 357

Query: 135 YRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGE 194
            R SK AWL    + +  RI RRV  MT L+   +E   ++NYG+GGHY  HYDF    +
Sbjct: 358 VRTSKGAWLERDLNNLTRRIERRVVDMTELSMQGSEAYNIMNYGLGGHYAAHYDFFNTTK 417

Query: 195 ANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYY 254
               ++  TG+R+ATVLFY+SDV QGGATVF +L L++ PE+G A FW+NL  +G GD  
Sbjct: 418 Q---QTSETGDRIATVLFYLSDVEQGGATVFPNLKLAVSPERGMALFWYNLLDNGTGDTR 474

Query: 255 TRHAACPVLTGS 266
           T H  CPVL GS
Sbjct: 475 TLHGGCPVLVGS 486


>gi|15808763|gb|AAL08488.1| prolyl-4-hydroxylase alpha subunit-like protein [Onchocerca
           volvulus]
          Length = 571

 Score =  184 bits (468), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 110/271 (40%), Positives = 157/271 (57%), Gaps = 9/271 (3%)

Query: 2   IFPTHQRAQGNKLYYQEALNKSP----ELKDEPPKVNNVAPTLEVTEREK--YEMLCRGD 55
           I P H RA+ N   Y+  L  +     +L  +   +NN+    E  E  K  YE LCR +
Sbjct: 243 INPDHPRAKDNVKEYEYLLKNNEVQRIDLWRKTFPINNMRNDNEFDEGIKLIYEALCRRE 302

Query: 56  LTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQP 115
           + V   + +QL C Y   + PYLRL P K E     P  +L+  ++ D +  +I+ +A P
Sbjct: 303 VPVNTKVQSQLYC-YYKTDRPYLRLAPFKVEIVRQNPLNVLFYGIISDEQARIIQMLAVP 361

Query: 116 RLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVV 175
           +L  + + N  TG  E+ ++RI KSA LR  E+  ++RI +R+E  T L   TAE+L V+
Sbjct: 362 KLNGSRIYNDLTGSFELPSFRILKSARLRSTEYETVKRIDKRLELATNLEIETAEDLAVL 421

Query: 176 NYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTS-LNLSLWP 234
           NYGIGG +EPH+D A  G+   F+ LGTGNR+AT L Y+++   GG TVFTS L +S+  
Sbjct: 422 NYGIGGQFEPHFDCALKGD-QCFEKLGTGNRIATFLIYLTEPEIGGRTVFTSNLKISVPC 480

Query: 235 EKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
            K  A FW+NL  +G+ D  + HAACPV TG
Sbjct: 481 VKNAALFWYNLMRNGEVDTRSLHAACPVATG 511


>gi|15808767|gb|AAL08490.1|AF369789_1 prolyl-4-hydroxylase alpha subunit-like protein [Onchocerca
           volvulus]
          Length = 571

 Score =  184 bits (467), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 110/271 (40%), Positives = 157/271 (57%), Gaps = 9/271 (3%)

Query: 2   IFPTHQRAQGNKLYYQEALNKSP----ELKDEPPKVNNVAPTLEVTEREK--YEMLCRGD 55
           I P H RA+ N   Y+  L  +     +L  +   +NN+    E  E  K  YE LCR +
Sbjct: 243 INPDHPRAKDNVKEYEYLLKNNEVQRIDLWRKTFPINNMRNDNEFDEGIKLIYEALCRRE 302

Query: 56  LTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQP 115
           + V   + +QL C Y   + PYLRL P K E     P  +L+  ++ D +  +I+ +A P
Sbjct: 303 VPVNTKVQSQLYC-YYKTDRPYLRLAPFKVEIVRQNPLNVLFYGIISDEQARIIEMLAVP 361

Query: 116 RLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVV 175
           +L  + + N  TG  E+ ++RI KSA LR  E+  ++RI +R+E  T L   TAE+L V+
Sbjct: 362 KLNGSRIYNDLTGSFELPSFRILKSARLRSTEYETVKRIDKRLELATNLEIETAEDLAVL 421

Query: 176 NYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTS-LNLSLWP 234
           NYGIGG +EPH+D A  G+   F+ LGTGNR+AT L Y+++   GG TVFTS L +S+  
Sbjct: 422 NYGIGGQFEPHFDCALKGD-QCFEKLGTGNRIATFLIYLTEPEIGGRTVFTSNLKISVPC 480

Query: 235 EKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
            K  A FW+NL  +G+ D  + HAACPV TG
Sbjct: 481 VKNAALFWYNLMRNGEVDTRSLHAACPVATG 511


>gi|313229039|emb|CBY18191.1| unnamed protein product [Oikopleura dioica]
          Length = 522

 Score =  184 bits (467), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 104/266 (39%), Positives = 154/266 (57%), Gaps = 12/266 (4%)

Query: 7   QRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCR----GDLTVPPAI 62
           +R + N  YYQ  + K  EL   P             ER++YE LC+     + T+    
Sbjct: 233 ERIESNWRYYQGKV-KDSELDSFPEDYLERPSHYNPEERQRYEELCQLGYNNEHTIRDNN 291

Query: 63  VAQLKCRYV--HRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRA 120
              L+C     H +  + +L P K EE   QP ++ + D++ D+EI+ ++++ + +L RA
Sbjct: 292 DDSLRCFLFKGHEDDFFSQLGPWKVEEIAKQPYVVRFFDILNDNEINSLERLGEEKLARA 351

Query: 121 TVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIG 180
           TV +  T +L  A+YR+SKSAWL++ +   +E+ +RR+  +TGL    AE+LQ+ NYGIG
Sbjct: 352 TVFDPATHKLVNADYRVSKSAWLKDEDSDTVEKYNRRISRLTGLDLEYAEQLQMSNYGIG 411

Query: 181 GHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAA 240
           G YEPHYD++R         +    R+AT L Y++ V QGG TVFT L L +   KG+A 
Sbjct: 412 GQYEPHYDYSRRE-----WDIYNNRRIATWLSYLTTVEQGGGTVFTELGLHIRSIKGSAV 466

Query: 241 FWHNLHSSGDGDYYTRHAACPVLTGS 266
           FW+NL  +G GD  TRHAACPVL G+
Sbjct: 467 FWYNLLPNGSGDERTRHAACPVLRGN 492


>gi|260802724|ref|XP_002596242.1| hypothetical protein BRAFLDRAFT_117983 [Branchiostoma floridae]
 gi|229281496|gb|EEN52254.1| hypothetical protein BRAFLDRAFT_117983 [Branchiostoma floridae]
          Length = 527

 Score =  184 bits (467), Expect = 5e-44,   Method: Compositional matrix adjust.
 Identities = 100/207 (48%), Positives = 137/207 (66%), Gaps = 19/207 (9%)

Query: 2   IFPTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLE------------VTEREKYE 49
           I P H RA  N  ++++ + KS  L   PPK  +VA ++E            + ERE YE
Sbjct: 252 INPEHTRAINNMKFFEKEMEKSQNLV-APPKDEDVA-SIERGEYKRDLARDYLPEREIYE 309

Query: 50  MLCRGD----LTVPPAIVAQLKCRYV-HRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDS 104
           +LC+ +      + P+    LKCRY  + N P L L P K E+ + +P++ ++ +++ D 
Sbjct: 310 LLCQAEQPDMFNITPSRAKHLKCRYFTNNNHPRLLLAPQKLEQVFDKPKMWIFHNILTDP 369

Query: 105 EIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGL 164
           E+ +IK +AQPRLRRAT+QN  TGELE A+YRISKSAWL+  EH VI R+++RVE +TGL
Sbjct: 370 EMKVIKDLAQPRLRRATIQNSITGELEHASYRISKSAWLQGWEHKVIRRVNQRVEDVTGL 429

Query: 165 TTSTAEELQVVNYGIGGHYEPHYDFAR 191
           T  TAEELQVVNYG+GGHYEPH+DFAR
Sbjct: 430 TMETAEELQVVNYGMGGHYEPHFDFAR 456


>gi|395814850|ref|XP_003780953.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Otolemur
           garnettii]
          Length = 544

 Score =  184 bits (467), Expect = 5e-44,   Method: Compositional matrix adjust.
 Identities = 111/273 (40%), Positives = 158/273 (57%), Gaps = 19/273 (6%)

Query: 4   PTHQRAQGNKLYYQEALNKSP-----ELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTV 58
           P ++R   N L Y++ L +SP     E   + P V    P L+   R+ YE LC+   + 
Sbjct: 258 PDNKRMARNVLKYEKLLAESPNQAVAETVMQRPNV----PHLQT--RDTYEGLCQTLGSQ 311

Query: 59  PPAI-VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRL 117
           P    +  L C Y   + PYL L P+++E  +L+P + LY D + DSE   I+++A+P L
Sbjct: 312 PTHYQIPSLYCSYETNSSPYLLLQPIRKEVIHLEPFVALYHDFVSDSEAQKIRELAEPWL 371

Query: 118 RRATVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQV 174
           +R+ V    +GE ++  +YRISKSAWL++   P++  +  R+  +TGL      AE LQV
Sbjct: 372 QRSVV---ASGEKQLQVDYRISKSAWLKDTVDPMLVTLDHRIAALTGLDVQPPYAEYLQV 428

Query: 175 VNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWP 234
           VNYGIGGHYEPH+D A    +  ++ + +GNRVAT + Y+S V  GGAT F   N S+  
Sbjct: 429 VNYGIGGHYEPHFDHATSPSSPLYR-MKSGNRVATFMIYLSSVEAGGATAFIYANFSVPV 487

Query: 235 EKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
            K  A FW NLH +G+GD  T HA CPVL G  
Sbjct: 488 VKNAALFWWNLHRNGEGDSDTLHAGCPVLVGDK 520


>gi|432891690|ref|XP_004075614.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Oryzias
           latipes]
          Length = 517

 Score =  184 bits (466), Expect = 6e-44,   Method: Compositional matrix adjust.
 Identities = 104/228 (45%), Positives = 135/228 (59%), Gaps = 8/228 (3%)

Query: 42  VTEREKYEMLCRGDLTVPPAIVA-QLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDV 100
           ++ R+ YE LCR   + P      +L C Y   N P L L+P+K E   LQP +++Y + 
Sbjct: 268 LSTRDTYERLCRTQGSQPIHFENPRLYCDYFTNNNPALLLLPVKREVLSLQPYVVIYHNF 327

Query: 101 MYDSEIDLIKKMAQPRLRRATVQNYKTGELE-IANYRISKSAWLREPEHPVIERISRRVE 159
           + D E + IK  AQP LRR+ V    +GE +    YRISKSAWL+  E  ++ ++ +R+ 
Sbjct: 328 ITDREAEEIKGFAQPALRRSVV---ASGENQATVEYRISKSAWLKGSESCIVGKLDQRIS 384

Query: 160 HMTGLTTS--TAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDV 217
            +TGL      AE LQVVNYGIGGHYEPH+D A    +  FK L TGNRVAT + Y+S V
Sbjct: 385 MLTGLNVRPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPVFK-LKTGNRVATFMIYLSSV 443

Query: 218 AQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
             GG+T F   N S+   K  A FW NLH +G GD  T HA CPVL G
Sbjct: 444 EAGGSTAFIYANFSVPVLKKAAIFWWNLHRNGRGDAETLHAGCPVLIG 491


>gi|427783867|gb|JAA57385.1| Putative prolyl 4-hydroxylase subunit alpha-1 [Rhipicephalus
           pulchellus]
          Length = 548

 Score =  184 bits (466), Expect = 6e-44,   Method: Compositional matrix adjust.
 Identities = 99/232 (42%), Positives = 142/232 (61%), Gaps = 11/232 (4%)

Query: 44  EREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYD 103
           E + Y+ LCRG+    P + +QL+CRY +    +LRL P+K EEA L+P II + D++ D
Sbjct: 288 ETQNYKRLCRGEQLRTPKMDSQLRCRYYYGRNGFLRLQPVKIEEANLKPYIITFHDIIGD 347

Query: 104 SEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTG 163
            +I+ +   A PRL R+T  +Y     E +  R S +AWL + + PV  R++R VE + G
Sbjct: 348 RDINDLLAYATPRLFRST--HYGEHGTETSLIRTSSTAWLGDQDAPVATRLNRFVESLLG 405

Query: 164 LTTS----TAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL-----GTGNRVATVLFYM 214
           L +      AE  Q+ NYG+GG Y  H+DF     A+  + L       G+R+AT++FY+
Sbjct: 406 LGSQYLKGEAEYYQLANYGVGGQYIAHHDFLADIYADPNRKLDDFERSAGDRIATLMFYL 465

Query: 215 SDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           SDV +GGATVF  L + L P+KG AAFW NL+S G+G+  T+H  CPVL GS
Sbjct: 466 SDVEEGGATVFPHLGVRLTPKKGNAAFWWNLNSDGEGEQLTKHGGCPVLYGS 517


>gi|195055775|ref|XP_001994788.1| GH17428 [Drosophila grimshawi]
 gi|193892551|gb|EDV91417.1| GH17428 [Drosophila grimshawi]
          Length = 540

 Score =  184 bits (466), Expect = 6e-44,   Method: Compositional matrix adjust.
 Identities = 94/247 (38%), Positives = 141/247 (57%), Gaps = 10/247 (4%)

Query: 43  TEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMY 102
            E   Y+ +CR +L   PA   +L+CR    N       P + EE +L P +I   D++ 
Sbjct: 282 NEYHMYQQVCREELKPEPATQRKLRCRLHRGNGLRSSYQPYRLEELHLDPYVIQVHDIIS 341

Query: 103 DSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMT 162
             E  +++++A+P L+R+ V +    E    N+RIS+  +    EHP+++R+S+ +E+++
Sbjct: 342 AEETIVLQQLARPELQRSMVYSLSNSEHISTNFRISQGTFFEYHEHPIMQRMSQHLENIS 401

Query: 163 GLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGA 222
           GL   +AE+LQV NYGIGGHYEPH D           +  + NRVAT ++Y+S+V  GG 
Sbjct: 402 GLDMRSAEQLQVANYGIGGHYEPHMDSFSENHNYGINTYMSTNRVATGIYYLSNVEAGGG 461

Query: 223 TVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC--------- 273
           T F  L L + PE+G+  FW+NLH SGD DY T+HA CPVL GS  + +           
Sbjct: 462 TAFPFLPLLVEPERGSLLFWYNLHRSGDLDYRTKHAGCPVLMGSKWIANVWIRLSNQDHI 521

Query: 274 -PCGLRR 279
            PC L+R
Sbjct: 522 RPCDLQR 528


>gi|296217074|ref|XP_002754870.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Callithrix
           jacchus]
          Length = 544

 Score =  184 bits (466), Expect = 6e-44,   Method: Compositional matrix adjust.
 Identities = 114/273 (41%), Positives = 159/273 (58%), Gaps = 21/273 (7%)

Query: 4   PTHQRAQGNKLYYQEALNKSP-----ELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTV 58
           P ++R   N L Y+  L +SP     E   + P +    P L+   R+ YE LC+  L  
Sbjct: 258 PDNKRMARNVLKYERLLAESPNQVVAEAVIQRPNI----PHLQT--RDTYEGLCQT-LGS 310

Query: 59  PPAI--VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPR 116
            P +  +  L C Y   + PYL L P+++E  +L+P I LY D + DSE   I++ A+P 
Sbjct: 311 QPTLYQIPSLYCSYETNSNPYLVLQPIQKEILHLEPYIALYHDFVSDSEAQKIREFAEPW 370

Query: 117 LRRATVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQ 173
           L+R+ V    +GE ++   YRISKSAWL++   P++  ++ R+  +TGL      AE LQ
Sbjct: 371 LQRSVV---ASGEKQLQVEYRISKSAWLKDTVDPMLVTLNHRIAALTGLDVRPPYAEYLQ 427

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
           VVNYGIGGHYEPH+D A    +  ++ + +GNRVAT + Y+S V  GGAT F   NLS+ 
Sbjct: 428 VVNYGIGGHYEPHFDHATSPSSPLYR-MKSGNRVATFMIYLSSVEAGGATAFIYANLSVP 486

Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
             K  A FW NLH SG+GD  T HA CPVL G+
Sbjct: 487 VVKNAALFWWNLHRSGEGDSDTLHAGCPVLVGN 519


>gi|156398644|ref|XP_001638298.1| predicted protein [Nematostella vectensis]
 gi|156225417|gb|EDO46235.1| predicted protein [Nematostella vectensis]
          Length = 495

 Score =  184 bits (466), Expect = 6e-44,   Method: Compositional matrix adjust.
 Identities = 97/201 (48%), Positives = 123/201 (61%), Gaps = 19/201 (9%)

Query: 96  LYRDVMYDSEIDLIKKMAQ----PRLRRATVQNYKTGELEIANYRISKSAWLREPEH-PV 150
           ++ +  +  E D+ K++ +    P L RATV N  TG LE A+YRISK+ WL   EH  V
Sbjct: 295 VFENFNWHVERDIYKRLCRGEKLPTLNRATVHNPITGHLETAHYRISKNCWLSGREHGEV 354

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           I+R+ RR+  MT L   TAE  QV NYG+ G Y+PH+DF+R    ++  SLGTGNR+ATV
Sbjct: 355 IDRVERRIAAMTRLNLETAEGFQVQNYGLAGQYDPHFDFSRDLANSSLGSLGTGNRIATV 414

Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG----- 265
           L +MS V  GGATVF  +   + P+KG A FWHNL  SGDGD+ TRHA CPVL+G     
Sbjct: 415 LVWMSQVESGGATVFPYVGARILPQKGDAVFWHNLLRSGDGDFRTRHAGCPVLSGIKWVA 474

Query: 266 -------SNSLHSTCPCGLRR 279
                   N  H   PC LRR
Sbjct: 475 NKWIHEYGNEFHR--PCSLRR 493


>gi|194213450|ref|XP_001495951.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Equus
           caballus]
          Length = 548

 Score =  183 bits (464), Expect = 9e-44,   Method: Compositional matrix adjust.
 Identities = 111/271 (40%), Positives = 157/271 (57%), Gaps = 19/271 (7%)

Query: 4   PTHQRAQGNKLYYQEALNKSP-----ELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTV 58
           P ++R   N L Y++ L +SP     E   + P V    P L+   R+ YE LC+   + 
Sbjct: 262 PDNKRMARNVLKYEKLLAESPNQVVAEAVIQRPNV----PHLQT--RDTYEGLCQTLGSQ 315

Query: 59  PPAI-VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRL 117
           P    +  L C Y   + P+L L P+++E  +L+P ++LY D + DSE   I+ +A+P L
Sbjct: 316 PTHYQIPSLYCSYETNSSPFLLLQPVRKEVIHLEPYVVLYHDFVSDSEAQKIRGLAEPWL 375

Query: 118 RRATVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQV 174
           +R+ V    +GE ++   YRISKSAWL++   P++  +  R+  +TGL      AE LQV
Sbjct: 376 QRSVV---ASGEKQLPVEYRISKSAWLKDTVDPMLVTLDHRIAALTGLDVQPPYAEYLQV 432

Query: 175 VNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWP 234
           VNYGIGGHYEPH+D A    +  ++ + +GNRVAT + Y+S V  GGAT F   N S+  
Sbjct: 433 VNYGIGGHYEPHFDHATSPTSPLYR-MKSGNRVATFMIYLSSVEAGGATAFIYANFSVPV 491

Query: 235 EKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
            K  A FW NLH SG+GD  T HA CPVL G
Sbjct: 492 VKNAALFWWNLHRSGEGDSDTLHAGCPVLVG 522


>gi|194765180|ref|XP_001964705.1| GF23331 [Drosophila ananassae]
 gi|190614977|gb|EDV30501.1| GF23331 [Drosophila ananassae]
          Length = 535

 Score =  183 bits (464), Expect = 9e-44,   Method: Compositional matrix adjust.
 Identities = 102/247 (41%), Positives = 142/247 (57%), Gaps = 15/247 (6%)

Query: 44  EREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYD 103
           E + YE +CRGDL   PA + +L+CR+    + Y    P K EE   +P +     V+  
Sbjct: 281 EFKMYEQVCRGDLNPSPAKLRELRCRFRRSRLGY---APFKLEELSHEPLVFQVHQVVSS 337

Query: 104 SEIDLIKKMAQPRLRRATVQNYKTGEL-EIANYRISKSAWLREPEHPVIERISRRVEHMT 162
              + IKKMA+P+++R+TV +   G   + A +R S+ A      +   + +SR V  ++
Sbjct: 338 KSAEFIKKMARPKIKRSTVYSIGGGGGSQAAAFRTSQGASFNYSRNAATKILSRHVGDLS 397

Query: 163 GLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGA 222
            L  + AEELQV NYGIGGHYEPH+D + P      +    GNR+AT ++Y+SDV  GG 
Sbjct: 398 SLDMNFAEELQVANYGIGGHYEPHWD-SFPENHIYDEGDDRGNRIATGIYYLSDVEAGGG 456

Query: 223 TVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSL----------HST 272
           T F  L L + PEKG+  FW+NLH SGD DY T+HAACPVL GS  +          H+ 
Sbjct: 457 TAFPFLPLLVTPEKGSLLFWYNLHESGDQDYRTKHAACPVLQGSKWIANVWIRERNQHNV 516

Query: 273 CPCGLRR 279
            PCGL+R
Sbjct: 517 RPCGLQR 523


>gi|344296798|ref|XP_003420090.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Loxodonta
           africana]
          Length = 544

 Score =  183 bits (464), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 110/268 (41%), Positives = 153/268 (57%), Gaps = 11/268 (4%)

Query: 4   PTHQRAQGNKLYYQEALNKSP-ELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAI 62
           P ++R   N L Y+  L  SP ++  E        P L+   R+ YE LC+   + P   
Sbjct: 258 PDNKRMARNVLKYERLLADSPKQMVAEAVIQRPNVPHLQT--RDTYEGLCQTLGSQPTHY 315

Query: 63  -VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRAT 121
            +  L C Y   + PYL L P ++E  +L+P ++LY D + D E   IK +A+P L+R+ 
Sbjct: 316 QIPSLYCSYETNSNPYLLLQPFRKEVIHLEPYVVLYHDFVNDMEAQKIKGLAEPWLQRSV 375

Query: 122 VQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQVVNYG 178
           V    +GE ++  +YRISKSAWL++   P++  +  R+  +TGL      AE LQVVNYG
Sbjct: 376 V---ASGEKQLQVDYRISKSAWLKDSVDPMLVTLDHRIAALTGLDVQPPYAEYLQVVNYG 432

Query: 179 IGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGT 238
           IGGHYEPH+D A    +  ++ + +GNRVAT + Y+S V  GGAT F   N S+   K  
Sbjct: 433 IGGHYEPHFDHATSPSSPLYR-MKSGNRVATFMIYLSAVEAGGATAFIYANFSMPVVKNA 491

Query: 239 AAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           A FW NLH SG+GD  T HA CPVL G 
Sbjct: 492 ALFWWNLHRSGEGDGDTLHAGCPVLVGD 519


>gi|417402564|gb|JAA48127.1| Putative prolyl 4-hydroxylase alpha subunit [Desmodus rotundus]
          Length = 544

 Score =  183 bits (464), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 112/268 (41%), Positives = 153/268 (57%), Gaps = 13/268 (4%)

Query: 4   PTHQRAQGNKLYYQEALNKSPELKDEPPKVN--NVAPTLEVTEREKYEMLCRGDLTVPPA 61
           P ++R   N L Y++ L +SP        +   NV P L+   R  YE LC+   + P  
Sbjct: 258 PDNKRMARNVLKYEKLLAESPSQAAAEAVIQRPNV-PHLQT--RATYEELCQTLGSQPTH 314

Query: 62  IV-AQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRA 120
                L C Y     PYL L P+++E  +L+P ++LY D + D E   I+  A+P L+R+
Sbjct: 315 YQNPSLHCSYETGASPYLLLQPIRKEVVHLEPYVVLYHDFVNDLEAQKIRGFAEPWLQRS 374

Query: 121 TVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQVVNY 177
            V    +GE ++   YRISKSAWL++   P++  + RR+  +TGL T    AE LQVVNY
Sbjct: 375 VV---ASGEKQLPVEYRISKSAWLKDTVDPMLVTLDRRIAALTGLDTQPPYAEHLQVVNY 431

Query: 178 GIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKG 237
           GIGGHYEPH+D A    +  ++ + +GNRVAT + Y+S V  GGAT F   N S+   K 
Sbjct: 432 GIGGHYEPHFDHATSPSSPLYR-MKSGNRVATFMIYLSSVEAGGATAFIYANFSVPVVKN 490

Query: 238 TAAFWHNLHSSGDGDYYTRHAACPVLTG 265
            A FW NLH SG+GD  T HA CPVL G
Sbjct: 491 AALFWWNLHRSGEGDGDTLHAGCPVLVG 518


>gi|348505573|ref|XP_003440335.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Oreochromis
           niloticus]
          Length = 517

 Score =  182 bits (462), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 105/231 (45%), Positives = 133/231 (57%), Gaps = 18/231 (7%)

Query: 45  REKYEMLCRGD------LTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYR 98
           R+ YE LCR         T P     QL C Y   N P L LMP + E   LQP ++LY 
Sbjct: 271 RDTYERLCRTQGSQRRHFTNP-----QLFCDYFTNNNPALMLMPARRELVSLQPYVVLYH 325

Query: 99  DVMYDSEIDLIKKMAQPRLRRATVQNYKTGELE-IANYRISKSAWLREPEHPVIERISRR 157
           D + D+E + IK +A P LRR+ V     GE +  A+YRISKSAWL+     ++ ++ +R
Sbjct: 326 DFVTDTEAEDIKSLAHPGLRRSVV---AAGEKQATADYRISKSAWLKGSAQSIVGKLDQR 382

Query: 158 VEHMTGLTTST--AEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMS 215
           +  +TGL       E LQVVNYGIGGHYEPH+D A    +  FK L TGNRVAT + Y+S
Sbjct: 383 ISLLTGLNVKHPYGEYLQVVNYGIGGHYEPHFDHATSPSSPVFK-LKTGNRVATFMIYLS 441

Query: 216 DVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
            V  GG+T F   N S+   +  A FW NLH +G+GD  T HA CPVL G 
Sbjct: 442 PVEAGGSTAFIYANFSVPVVEKAAIFWWNLHRNGEGDDDTLHAGCPVLIGD 492


>gi|297689698|ref|XP_002822285.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Pongo abelii]
          Length = 544

 Score =  182 bits (461), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 111/268 (41%), Positives = 157/268 (58%), Gaps = 13/268 (4%)

Query: 4   PTHQRAQGNKLYYQEALNKSP-ELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAI 62
           P ++R   N L Y+  L +SP ++  E        P L+   R+ YE LC+  L   P +
Sbjct: 258 PDNKRMARNVLKYERLLAESPNQVVSEAVIQRPNTPHLQT--RDTYEGLCQ-TLGSQPTL 314

Query: 63  --VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRA 120
             +  L C Y   +  YL L P+++E  +L+P I LY D + DSE   I+++A+P L+R+
Sbjct: 315 YQIPSLYCSYETNSNAYLLLQPIRKEVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRS 374

Query: 121 TVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQVVNY 177
            V    +GE ++   YRISKSAWL++   P++  ++ R+  +TGL      AE LQVVNY
Sbjct: 375 VV---ASGEKQLQVEYRISKSAWLKDTVDPMLVTLNHRIAALTGLDVRPPYAEYLQVVNY 431

Query: 178 GIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKG 237
           GIGGHYEPH+D A    +  ++ + +GNRVAT + Y+S V  GGAT F   NLS+   + 
Sbjct: 432 GIGGHYEPHFDHATSPSSPLYR-MKSGNRVATFMIYLSSVEAGGATAFIYANLSVPVVRN 490

Query: 238 TAAFWHNLHSSGDGDYYTRHAACPVLTG 265
            A FW NLH SG+GD  T HA CPVL G
Sbjct: 491 AALFWWNLHRSGEGDSDTLHAGCPVLVG 518


>gi|332211329|ref|XP_003254773.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Nomascus
           leucogenys]
          Length = 544

 Score =  182 bits (461), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 112/269 (41%), Positives = 157/269 (58%), Gaps = 13/269 (4%)

Query: 4   PTHQRAQGNKLYYQEALNKSP-ELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAI 62
           P ++R   N L Y+  L +SP +L  E        P L+   R+ YE LC+  L   P +
Sbjct: 258 PDNKRMARNVLKYERLLAESPNQLVAEAVIQRPNIPHLQT--RDIYEGLCQ-TLGCQPTL 314

Query: 63  --VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRA 120
             +  L C Y   +  YL L P+++E  +L+P I LY D + DSE   I+++A+P L+R+
Sbjct: 315 YQIPSLYCSYETNSNAYLLLQPIRKEVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRS 374

Query: 121 TVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQVVNY 177
            V    +GE ++   YRISKSAWL++   P++  ++ R+  +TGL      AE LQVVNY
Sbjct: 375 VV---ASGEKQLQVEYRISKSAWLKDTVDPMLVTLNHRIAALTGLDVRPPYAEYLQVVNY 431

Query: 178 GIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKG 237
           GIGGHYEPH+D A    +  ++ + +GNRVAT + Y+S V  GGAT F   NLS+   + 
Sbjct: 432 GIGGHYEPHFDHATSPSSPLYR-MKSGNRVATFMIYLSSVEAGGATAFIYANLSVPVVRN 490

Query: 238 TAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
            A FW NLH SG+GD  T HA CPVL G 
Sbjct: 491 AALFWWNLHRSGEGDSDTLHAGCPVLVGD 519


>gi|195113237|ref|XP_002001174.1| GI10637 [Drosophila mojavensis]
 gi|193917768|gb|EDW16635.1| GI10637 [Drosophila mojavensis]
          Length = 529

 Score =  182 bits (461), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 99/233 (42%), Positives = 139/233 (59%), Gaps = 6/233 (2%)

Query: 35  NVAPTLEVT-EREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPR 93
           NV P    T + + YE  CRG    P  +  +L C Y      +LRL PLK E   L P 
Sbjct: 276 NVVPKKFFTPQAQAYERGCRGQY--PQNL--KLYCVYNSTTSAFLRLAPLKMELISLDPY 331

Query: 94  IILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIER 153
           +++Y DV+  SEI  ++ +A P L+RATV N ++    +   R SK  WL +  + +  R
Sbjct: 332 MVIYHDVISPSEISELQSLAVPGLKRATVFNQQSMRNHVVKTRTSKVTWLLDTLNQLTIR 391

Query: 154 ISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFY 213
           ++RR+  MTG     +E LQV+NYG+GGHY+ HYD+     A     L  G+R+ATVLFY
Sbjct: 392 LNRRITDMTGFDMYGSEMLQVMNYGLGGHYDKHYDYFNSSVAADLTRLN-GDRIATVLFY 450

Query: 214 MSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           ++DV QGGATVF ++  +++P+ GTA  W+NL   G+GD  T HAACPV+ GS
Sbjct: 451 LTDVEQGGATVFPNIEKAVFPKSGTAVVWYNLRHDGNGDPQTLHAACPVIVGS 503


>gi|194765174|ref|XP_001964702.1| GF23328 [Drosophila ananassae]
 gi|190614974|gb|EDV30498.1| GF23328 [Drosophila ananassae]
          Length = 542

 Score =  181 bits (460), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 108/289 (37%), Positives = 161/289 (55%), Gaps = 28/289 (9%)

Query: 6   HQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEM---LCRGDLTVPPAI 62
           H+ A  NK+ Y+  L K  E    P K + ++   +   +E Y++   +CRG+L   P  
Sbjct: 252 HEEALRNKVAYEAILAK--ERNHRPRKPSALSEPNKKEAKESYQLYKRVCRGELRQSPRQ 309

Query: 63  VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATV 122
             +L+C + H+NV + RL P K E+  L P +  + + +  SE++ I +     + R+ V
Sbjct: 310 QRKLRCLFSHQNVAFYRLAPFKVEQLNLDPYVAYFHEAINSSEMEQIIEKGLGSMERSRV 369

Query: 123 ---QNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGI 179
              QN  T E+     R S + WL   E+P + +I +R+E +TGL+T +AE LQ+VNYGI
Sbjct: 370 GQSQNATTSEI-----RTSANTWLWYNENPWLSKIKQRLEDITGLSTESAEPLQLVNYGI 424

Query: 180 GGHYEPHYDFAR-PGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGT 238
           GG YEPH+DF   P +   +K    GNR+ T LFY++DVA GGAT F  L L++ P KG+
Sbjct: 425 GGQYEPHFDFVEEPQKVFGWK----GNRMLTALFYINDVALGGATAFPFLQLAVPPVKGS 480

Query: 239 AAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCGL 277
              W+NLH S   D+ T+HA CPV+ GS  + +            PCGL
Sbjct: 481 LLVWYNLHRSLHKDFRTKHAGCPVIKGSKWICNEWFHEGTQVFKRPCGL 529


>gi|313241587|emb|CBY33829.1| unnamed protein product [Oikopleura dioica]
          Length = 541

 Score =  181 bits (460), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 100/234 (42%), Positives = 136/234 (58%), Gaps = 6/234 (2%)

Query: 41  EVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRD 99
           E  E E YE LCR    +P      LKC Y   N  P+L L P+K EE + +P II + +
Sbjct: 278 EREETEYYEKLCRIPNELPREKADTLKCFYWTNNDHPFLVLGPVKAEELWDEPEIIRFYE 337

Query: 100 VMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWL----REPEHPVIERIS 155
           ++ D E+D+I K A+P+   ATVQ+  TG+L  A+YRIS+SAWL       +   + +  
Sbjct: 338 IITDEELDIINKQARPKSNLATVQDPITGKLVNADYRISESAWLPANTDSAQDEKLRQFR 397

Query: 156 RRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMS 215
           +R+  +TGLT   AE++Q  NYGIGG YEPHYD +   +A  F     GNR+AT L Y++
Sbjct: 398 KRISIITGLTMERAEDIQYSNYGIGGQYEPHYDMSTENDAGKFDE-EDGNRIATWLTYLN 456

Query: 216 DVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSL 269
           +   GG TVF    +   P   +A FW+NL   G  DY TRHAACPVL G  ++
Sbjct: 457 EPKHGGDTVFLGPGIKAEPIHKSAVFWYNLLRDGSCDYRTRHAACPVLIGQKTV 510


>gi|355709028|gb|AES03457.1| prolyl 4-hydroxylase, alpha polypeptide III [Mustela putorius furo]
          Length = 477

 Score =  181 bits (460), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 111/272 (40%), Positives = 156/272 (57%), Gaps = 19/272 (6%)

Query: 4   PTHQRAQGNKLYYQEALNKSP-----ELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTV 58
           P ++R   N L Y++ L +SP     E   + P V    P L+   R+ YE LC+   + 
Sbjct: 192 PDNKRMARNVLKYEKLLAESPNQVVAEAVIQRPNV----PHLQT--RDTYEGLCQTLGSQ 245

Query: 59  PPAI-VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRL 117
           P    +  L C Y   + PYL L P+++E  +L+P ++LY D + D E   I+ +A+P L
Sbjct: 246 PIHYQIPSLYCSYETNSSPYLLLQPIRKEVIHLEPYVVLYHDFVSDMEAQKIRGLAEPWL 305

Query: 118 RRATVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQV 174
           +R+ V    +GE ++   YRISKSAWL++   P++  +  R+  +TGL      AE LQV
Sbjct: 306 QRSVV---ASGEKQLPVEYRISKSAWLKDTVDPLLVNLDHRIGALTGLDVQPPYAEYLQV 362

Query: 175 VNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWP 234
           VNYGIGGHYEPH+D A    +  ++ + +GNRVAT + Y+S V  GGAT F   N S+  
Sbjct: 363 VNYGIGGHYEPHFDHATSPTSPLYR-MKSGNRVATFMIYLSSVEAGGATAFIYANFSVPV 421

Query: 235 EKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
            K  A FW NLH SG+GD  T HA CPVL G 
Sbjct: 422 VKNAALFWWNLHRSGEGDGDTLHAGCPVLVGD 453


>gi|313213106|emb|CBY36968.1| unnamed protein product [Oikopleura dioica]
          Length = 541

 Score =  181 bits (460), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 100/234 (42%), Positives = 136/234 (58%), Gaps = 6/234 (2%)

Query: 41  EVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRD 99
           E  E E YE LCR    +P      LKC Y   N  P+L L P+K EE + +P II + +
Sbjct: 278 EREETEYYEKLCRIPNELPREKADTLKCFYWTNNDHPFLVLGPVKAEELWDEPEIIRFYE 337

Query: 100 VMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWL----REPEHPVIERIS 155
           ++ D E+D+I K A+P+   ATVQ+  TG+L  A+YRIS+SAWL       +   + +  
Sbjct: 338 IITDEELDIINKQARPKSNLATVQDPITGKLVNADYRISESAWLPANTDSAQDEKLRQFR 397

Query: 156 RRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMS 215
           +R+  +TGLT   AE++Q  NYGIGG YEPHYD +   +A  F     GNR+AT L Y++
Sbjct: 398 KRISIITGLTMERAEDIQYSNYGIGGQYEPHYDMSTENDAGKFDE-EDGNRIATWLTYLN 456

Query: 216 DVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSL 269
           +   GG TVF    +   P   +A FW+NL   G  DY TRHAACPVL G  ++
Sbjct: 457 EPKHGGDTVFLGPGIKAEPIHKSAVFWYNLLRDGSCDYRTRHAACPVLIGQKTV 510


>gi|195452742|ref|XP_002073480.1| GK13123 [Drosophila willistoni]
 gi|194169565|gb|EDW84466.1| GK13123 [Drosophila willistoni]
          Length = 540

 Score =  181 bits (459), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 105/294 (35%), Positives = 158/294 (53%), Gaps = 24/294 (8%)

Query: 4   PTHQRAQGNKLYY--QEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPA 61
           P H+ A  +K  Y  Q A  + P L     KV+  + +L++     Y+ +CRG+L   P 
Sbjct: 247 PNHETALKDKPIYETQLAWQRDPRLNVAASKVDESSKSLDL-----YQRVCRGELRQSPR 301

Query: 62  IVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRAT 121
              +L+C Y  R V + RL P K E+  L P +  + +V+ D E D + +    +++R+ 
Sbjct: 302 QQRKLRCFYSDRGVAFYRLGPFKVEQLNLDPYVAYFHNVISDDETDDLIEHGMGQVKRSR 361

Query: 122 VQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGG 181
           V     G   ++  R S++ WL   + P ++ +  R+E +TGL   +AE LQ+VNYGIGG
Sbjct: 362 VGT--VGNSTVSEVRTSQNTWLWYEQQPWLKNLKLRLEDITGLGMESAEPLQLVNYGIGG 419

Query: 182 HYEPHYDFARPGEANAFKSLG-TGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAA 240
           HYEPHYDF      +   + G  GNR+ T L Y+++V  GGAT F  L L++ P KG+  
Sbjct: 420 HYEPHYDFVE----DKVTTFGWKGNRLLTALLYLNEVPMGGATAFPYLKLAVPPVKGSLL 475

Query: 241 FWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCGLRRGLQRS 284
            W+NLH S D D+ T+HA CPVL GS  + +            PCGL    ++S
Sbjct: 476 VWYNLHRSLDPDFRTKHAGCPVLMGSKWVCNEWFHEGAQEFRRPCGLMNDSKKS 529


>gi|313229343|emb|CBY23930.1| unnamed protein product [Oikopleura dioica]
          Length = 542

 Score =  181 bits (459), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 100/234 (42%), Positives = 136/234 (58%), Gaps = 6/234 (2%)

Query: 41  EVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRD 99
           E  E E YE LCR    +P      LKC Y   N  P+L L P+K EE + +P II + +
Sbjct: 279 EREETEYYEKLCRIPNELPREKADTLKCFYWTNNDHPFLVLGPVKAEELWDEPEIIRFYE 338

Query: 100 VMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWL----REPEHPVIERIS 155
           ++ D E+D+I K A+P+   ATVQ+  TG+L  A+YRIS+SAWL       +   + +  
Sbjct: 339 IITDEELDIINKQARPKSNLATVQDPITGKLVNADYRISESAWLPANTDSAQDEKLRQFR 398

Query: 156 RRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMS 215
           +R+  +TGLT   AE++Q  NYGIGG YEPHYD +   +A  F     GNR+AT L Y++
Sbjct: 399 KRISIITGLTMERAEDIQYSNYGIGGQYEPHYDMSTENDAGKFDE-EDGNRIATWLTYLN 457

Query: 216 DVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSL 269
           +   GG TVF    +   P   +A FW+NL   G  DY TRHAACPVL G  ++
Sbjct: 458 EPKHGGDTVFLGPGIKAEPIHKSAVFWYNLLRDGSCDYRTRHAACPVLIGQKTV 511


>gi|301759032|ref|XP_002915381.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Ailuropoda
           melanoleuca]
          Length = 539

 Score =  181 bits (459), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 111/273 (40%), Positives = 156/273 (57%), Gaps = 19/273 (6%)

Query: 4   PTHQRAQGNKLYYQEALNKSP-----ELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTV 58
           P ++R   N L Y++ L +SP     E   + P V    P L+   R+ YE LC+   + 
Sbjct: 253 PDNKRMARNVLKYEKLLAESPNQVVAEAVIQRPNV----PHLQT--RDTYEGLCQTLGSQ 306

Query: 59  PPAI-VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRL 117
           P    +  L C Y   + PYL L P+++E  +L+P ++LY D + D E   I+ +A+P L
Sbjct: 307 PTHYQIPSLYCSYETNSSPYLLLQPVRKEVIHLEPYVVLYHDFVSDGEAQKIRGLAEPWL 366

Query: 118 RRATVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQV 174
           +R+ V    +GE ++   YRISKSAWL++   P++  +  R+  +TGL      AE LQV
Sbjct: 367 QRSVV---ASGEKQLPVEYRISKSAWLKDTVDPLLVTLDHRIGALTGLDVQPPYAEYLQV 423

Query: 175 VNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWP 234
           VNYGIGGHYEPH+D A    +  ++ + +GNRVAT + Y+S V  GGAT F   N S+  
Sbjct: 424 VNYGIGGHYEPHFDHATSPTSPLYR-MKSGNRVATFMIYLSSVEAGGATAFIYANFSVPV 482

Query: 235 EKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
            K  A FW NLH SG+GD  T HA CPVL G  
Sbjct: 483 VKNAALFWWNLHRSGEGDGDTLHAGCPVLVGDK 515


>gi|281353153|gb|EFB28737.1| hypothetical protein PANDA_003344 [Ailuropoda melanoleuca]
          Length = 456

 Score =  181 bits (459), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 111/273 (40%), Positives = 155/273 (56%), Gaps = 19/273 (6%)

Query: 4   PTHQRAQGNKLYYQEALNKSP-----ELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTV 58
           P ++R   N L Y++ L +SP     E   + P V    P L+   R+ YE LC+   + 
Sbjct: 193 PDNKRMARNVLKYEKLLAESPNQVVAEAVIQRPNV----PHLQT--RDTYEGLCQTLGSQ 246

Query: 59  PPAI-VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRL 117
           P    +  L C Y   + PYL L P+++E  +L+P ++LY D + D E   I+ +A+P L
Sbjct: 247 PTHYQIPSLYCSYETNSSPYLLLQPVRKEVIHLEPYVVLYHDFVSDGEAQKIRGLAEPWL 306

Query: 118 RRATVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTS--TAEELQV 174
           +R+ V    +GE ++   YRISKSAWL++   P++  +  R+  +TGL      AE LQV
Sbjct: 307 QRSVV---ASGEKQLPVEYRISKSAWLKDTVDPLLVTLDHRIGALTGLDVQPPYAEYLQV 363

Query: 175 VNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWP 234
           VNYGIGGHYEPH+D A       ++ + +GNRVAT + Y+S V  GGAT F   N S+  
Sbjct: 364 VNYGIGGHYEPHFDHATVTMGPLYR-MKSGNRVATFMIYLSSVEAGGATAFIYANFSVPV 422

Query: 235 EKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
            K  A FW NLH SG+GD  T HA CPVL G  
Sbjct: 423 VKNAALFWWNLHRSGEGDGDTLHAGCPVLVGDK 455


>gi|351696981|gb|EHA99899.1| Prolyl 4-hydroxylase subunit alpha-3 [Heterocephalus glaber]
          Length = 572

 Score =  181 bits (459), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 112/273 (41%), Positives = 155/273 (56%), Gaps = 19/273 (6%)

Query: 4   PTHQRAQGNKLYYQEALNKSP-----ELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTV 58
           P ++R   N L Y+  L ++P     E   + P V    P L+   R+ YE LC+   + 
Sbjct: 286 PENKRMVRNVLKYERLLAENPHQAVAETVIQRPNV----PHLQT--RDTYEGLCQTLGSQ 339

Query: 59  PPAI-VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRL 117
           P    +  L C Y   + PYL L P+++E  +L+P + LY D + D E   I+K+A+P L
Sbjct: 340 PIHYQIPGLYCSYETNSSPYLLLQPVRKEVIHLEPYVALYHDFVSDPEAQKIRKLAEPWL 399

Query: 118 RRATVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQV 174
           +R+ V    +GE ++   YRISKSAWL++   PV+  +  R+  +TGL      AE LQV
Sbjct: 400 QRSVV---ASGEKQLQVEYRISKSAWLKDTADPVLVTLDHRIAALTGLDVQHPYAEYLQV 456

Query: 175 VNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWP 234
           VNYGIGGHYEPH+D A    +  ++ + +GNRVAT + Y+S V  GGAT F   N S+  
Sbjct: 457 VNYGIGGHYEPHFDHATSPSSPLYR-MKSGNRVATFMIYLSSVEAGGATAFIYANFSVPV 515

Query: 235 EKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
            K  A FW NLH SG+GD  T HA CPVL G  
Sbjct: 516 VKNAALFWWNLHRSGEGDGDTLHAGCPVLVGDK 548


>gi|195452746|ref|XP_002073482.1| GK14141 [Drosophila willistoni]
 gi|194169567|gb|EDW84468.1| GK14141 [Drosophila willistoni]
          Length = 541

 Score =  181 bits (458), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 103/257 (40%), Positives = 140/257 (54%), Gaps = 8/257 (3%)

Query: 16  YQEALNKSPELKDEPPKVNNVAPTLE----VTEREKYEMLCRGDLTVPPAIVAQLKCRYV 71
           Y   ++K  + K  P  +   AP       +++ + Y   C G +         L+C Y+
Sbjct: 250 YNNFISKHLDEKQSPATLEEHAPIPSDPSVMSDFDIYRFTCSGHIKKTAREERHLRCGYL 309

Query: 72  HRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELE 131
               P+L L PLK EE    P ++LY DV+Y SEID+I+ + +  + RATV   K  E  
Sbjct: 310 TETHPFLNLAPLKVEELNHNPLLVLYHDVIYQSEIDVIRNLTENEISRATVIGAKGSE-- 367

Query: 132 IANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FA 190
           ++  R S+  ++ +  H V++ I +RV  M+ L    AE  Q  NYGIGGHY  H D F 
Sbjct: 368 VSKVRTSQFTFIPKTRHKVLQTIDQRVADMSNLNMDYAELHQFANYGIGGHYAQHNDWFG 427

Query: 191 RPGEANAF-KSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSG 249
           +    N    S   GNR+ATVLFY+SDVAQGG T F  L   L P+K  AAFWHNLH+SG
Sbjct: 428 QDAFDNELVSSPEMGNRIATVLFYLSDVAQGGGTAFPHLKQLLQPKKYAAAFWHNLHASG 487

Query: 250 DGDYYTRHAACPVLTGS 266
            GD  T H ACP++ GS
Sbjct: 488 VGDLRTLHGACPIIAGS 504


>gi|73988166|ref|XP_851718.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Canis lupus
           familiaris]
          Length = 544

 Score =  181 bits (458), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 111/273 (40%), Positives = 156/273 (57%), Gaps = 19/273 (6%)

Query: 4   PTHQRAQGNKLYYQEALNKSP-----ELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTV 58
           P ++R   N L Y++ L +SP     E   + P V    P L+   R+ YE LC+   + 
Sbjct: 258 PDNKRMARNVLKYEKLLAESPNQVVAEAVIQRPNV----PHLQT--RDTYEGLCQTLGSQ 311

Query: 59  PPAI-VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRL 117
           P    +  L C Y   + PYL L P+++E  +L+P ++LY D + D E   I+ +A+P L
Sbjct: 312 PTHYQIPSLYCSYETNSSPYLLLQPVRKEVIHLEPYVVLYHDFVNDVEAQKIRGLAEPWL 371

Query: 118 RRATVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQV 174
           +R+ V    +GE ++   YRISKSAWL++   P++  +  R+  +TGL      AE LQV
Sbjct: 372 QRSVV---ASGEKQLPVEYRISKSAWLKDTVDPLLVTLDHRIGALTGLDVQPPYAEYLQV 428

Query: 175 VNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWP 234
           VNYGIGGHYEPH+D A    +  ++ + +GNRVAT + Y+S V  GGAT F   N S+  
Sbjct: 429 VNYGIGGHYEPHFDHATSPTSPLYR-MKSGNRVATFMIYLSSVEAGGATAFIYANFSVPV 487

Query: 235 EKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
            K  A FW NLH SG+GD  T HA CPVL G  
Sbjct: 488 VKNAALFWWNLHRSGEGDGDTLHAGCPVLVGDK 520


>gi|194905294|ref|XP_001981167.1| GG11919 [Drosophila erecta]
 gi|190655805|gb|EDV53037.1| GG11919 [Drosophila erecta]
          Length = 533

 Score =  181 bits (458), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 90/224 (40%), Positives = 136/224 (60%), Gaps = 11/224 (4%)

Query: 48  YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID 107
           ++  C G L  P     +L C Y     P+LRL PLK E+  L+P ++LY +V+   EI 
Sbjct: 286 FKTSCNGLLEKP----TRLHCFYNFTTTPFLRLAPLKTEQIGLKPYVVLYHEVLSAREIS 341

Query: 108 LIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTS 167
           ++   A   ++   VQ+ K   +     R +K  WL++  + +  RI+RR+  MTG   +
Sbjct: 342 MLMGKAAQNMKNTRVQSEKA--VNTNRERTAKGYWLKKESNEMTRRITRRIVDMTGFDLA 399

Query: 168 TAEELQVVNYGIGGHYEPHYDFARPGEAN-----AFKSLGTGNRVATVLFYMSDVAQGGA 222
            +E+ QV+NYGIGGHY  H+D+     +N     +  S+  G+R+ATVLFY++DV QGGA
Sbjct: 400 DSEDFQVINYGIGGHYSLHFDYFGFASSNYTGERSHHSIVLGDRIATVLFYLTDVEQGGA 459

Query: 223 TVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           TVF ++  S++P+ GTA FW+NL + G+GD  TRHA+CPV+ GS
Sbjct: 460 TVFGNVGYSVYPQAGTAIFWYNLDTDGNGDPLTRHASCPVVVGS 503


>gi|410972729|ref|XP_003992809.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Felis catus]
          Length = 533

 Score =  181 bits (458), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 110/270 (40%), Positives = 157/270 (58%), Gaps = 13/270 (4%)

Query: 4   PTHQRAQGNKLYYQEALNKSPE--LKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPA 61
           P ++R   N L Y++ L +SP   + +   +  NV P L+   R+ YE LC+   + P  
Sbjct: 247 PDNKRMSRNVLKYEKLLAESPTRVVAEAVIRRPNV-PHLQT--RDTYEGLCQTLGSQPTH 303

Query: 62  I-VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRA 120
             +  L C Y   + PYL L P+++E  +L+P ++LY D + D E   I+ +A+P L+R+
Sbjct: 304 YQIPSLYCSYETNSSPYLLLQPIRKEVIHLEPYVVLYHDFVNDLEAQKIRGLAEPWLQRS 363

Query: 121 TVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQVVNY 177
            V    +GE ++   YRISKSAWL++   P++  +  R+  +TGL      AE LQVVNY
Sbjct: 364 VV---ASGEKQLPVEYRISKSAWLKDTVDPLLVTLDHRIGALTGLDVQPPYAEYLQVVNY 420

Query: 178 GIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKG 237
           GIGGHYEPH+D A    +  ++ + +GNRVAT + Y+S V  GGAT F   N S+   K 
Sbjct: 421 GIGGHYEPHFDHATSPTSPLYR-MKSGNRVATFMIYLSSVEAGGATAFIYANFSVPVVKN 479

Query: 238 TAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
            A FW NLH SG+GD  T HA CPVL G  
Sbjct: 480 AALFWWNLHRSGEGDGDTLHAGCPVLVGDK 509


>gi|402894624|ref|XP_003910453.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
           alpha-3 [Papio anubis]
          Length = 535

 Score =  180 bits (457), Expect = 6e-43,   Method: Compositional matrix adjust.
 Identities = 110/267 (41%), Positives = 151/267 (56%), Gaps = 20/267 (7%)

Query: 4   PTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAI- 62
           P ++R   N L Y+  L +SP          N      V +R   E LC+  L   P + 
Sbjct: 258 PDNKRMARNVLKYERXLAESP----------NQVVAEAVIQRPNXEGLCQ-TLGSQPTLY 306

Query: 63  -VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRAT 121
            +  L C Y   +  YL L P+++E  +L+P I LY D + DSE   I++ A+P L+R+ 
Sbjct: 307 QIPSLYCSYETNSNAYLLLQPIRKEVIHLEPYIALYHDFVSDSEAQKIREFAEPWLQRSV 366

Query: 122 VQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQVVNYG 178
           V    +GE ++   YRISKSAWL++   P++  ++ R+  +TGL      AE LQVVNYG
Sbjct: 367 V---ASGEKQLQVEYRISKSAWLKDTVDPMLVTLNHRIAALTGLDVRPPYAEYLQVVNYG 423

Query: 179 IGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGT 238
           IGGHYEPH+D A    +  ++ + +GNRVAT + Y+S V  GGAT F   NLS+   K  
Sbjct: 424 IGGHYEPHFDHATSPSSPLYR-MKSGNRVATFMIYLSSVEAGGATAFIYANLSVPVVKNA 482

Query: 239 AAFWHNLHSSGDGDYYTRHAACPVLTG 265
           A FW NLH SG+GD  T HA CPVL G
Sbjct: 483 ALFWWNLHRSGEGDSDTLHAGCPVLVG 509


>gi|395521232|ref|XP_003764722.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Sarcophilus
           harrisii]
          Length = 521

 Score =  180 bits (457), Expect = 6e-43,   Method: Compositional matrix adjust.
 Identities = 111/272 (40%), Positives = 151/272 (55%), Gaps = 19/272 (6%)

Query: 4   PTHQRAQGNKLYYQEALNK-----SPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTV 58
           P ++R   N   Y+  L +      PE+  + P V    P L+   R+ YE LC+   + 
Sbjct: 235 PDNKRIARNIRKYERLLEEKSNVTGPEVAIKRPNV----PHLQT--RDTYEGLCQTLGSQ 288

Query: 59  PPAI-VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRL 117
           P    +  L C Y     PYL L P+++E  +L+P I+LY D + DSE   I+  A P L
Sbjct: 289 PTHYQIPSLYCAYETNGSPYLLLQPVRKEVLHLEPYIVLYHDFVSDSEAQKIRGFAAPWL 348

Query: 118 RRATVQNYKTGE-LEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTS--TAEELQV 174
           +R+ V    +GE  +   YRISKSAWL++   P++  + RR+  +TGL      AE LQV
Sbjct: 349 QRSVV---ASGEKQQQVEYRISKSAWLKDTVDPILVSLDRRIAALTGLNVQPPYAEHLQV 405

Query: 175 VNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWP 234
           VNYGIGGHYEPH+D A    +  ++ + +GNRVAT + Y+S V  GG+T F   N S+  
Sbjct: 406 VNYGIGGHYEPHFDHATSPSSPLYR-MNSGNRVATFMIYLSSVEAGGSTAFIYANFSVPV 464

Query: 235 EKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
            K  A FW NLH SG GD  T HA CPVL G 
Sbjct: 465 VKNAALFWWNLHRSGQGDGDTLHAGCPVLVGD 496


>gi|116496629|gb|AAI26171.1| Prolyl 4-hydroxylase, alpha polypeptide III [Homo sapiens]
          Length = 544

 Score =  180 bits (457), Expect = 7e-43,   Method: Compositional matrix adjust.
 Identities = 112/273 (41%), Positives = 158/273 (57%), Gaps = 21/273 (7%)

Query: 4   PTHQRAQGNKLYYQEALNKSP-----ELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTV 58
           P ++R   N L Y+  L +SP     E   + P +    P L+   R+ YE LC+  L  
Sbjct: 258 PDNKRMARNVLKYERLLAESPNHVVAEAVIQRPNI----PHLQT--RDTYEGLCQ-TLGS 310

Query: 59  PPAI--VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPR 116
            P +  +  L C Y   +  YL L P+++E  +L+P I LY D + DSE   I+++A+P 
Sbjct: 311 QPTLYQIPSLYCSYETNSNAYLLLQPIRKEVIHLEPYIALYHDFVSDSEAQKIRELAEPW 370

Query: 117 LRRATVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQ 173
           L+R+ V    +GE ++   YRISKSAWL++  +P +  ++ R+  +TGL      AE LQ
Sbjct: 371 LQRSVV---ASGEKQLQVEYRISKSAWLKDTVNPKLVTLNHRIAALTGLDVRPPYAEYLQ 427

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
           VVNYGIGGHYEPH+D A    +  ++ + +GNRVAT + Y+S V  GGAT F   NLS+ 
Sbjct: 428 VVNYGIGGHYEPHFDHATSPSSPLYR-MKSGNRVATFMIYLSSVEAGGATAFIYANLSVP 486

Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
             +  A FW NLH SG+GD  T HA CPVL G 
Sbjct: 487 VVRNAALFWWNLHRSGEGDSDTLHAGCPVLVGD 519


>gi|195452778|ref|XP_002073496.1| GK13116 [Drosophila willistoni]
 gi|194169581|gb|EDW84482.1| GK13116 [Drosophila willistoni]
          Length = 521

 Score =  180 bits (456), Expect = 8e-43,   Method: Compositional matrix adjust.
 Identities = 95/241 (39%), Positives = 144/241 (59%), Gaps = 7/241 (2%)

Query: 26  LKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKE 85
           +++EP    N+ P         +E  CRG+   P    A+L C Y   + P+LRL PLK 
Sbjct: 265 IRNEP----NIKPKPFNKSVGDFERGCRGEF--PALTDAKLYCIYNTTSSPFLRLAPLKM 318

Query: 86  EEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLRE 145
           E   L P ++LY DV+  +EI  +++MA+P L+RATV N      +    R +K AW  +
Sbjct: 319 ELIGLDPYMVLYHDVISPNEIAELQEMAKPELKRATVYNSTKNTNQFVKTRTAKVAWFLD 378

Query: 146 PEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGN 205
             + + ER+++R+  MT    + +E LQV+NYG+GG+Y  H+D+      N   S   G+
Sbjct: 379 TFNQLTERLNQRIMDMTNFVLNGSEMLQVMNYGLGGYYVKHFDYFNTT-TNPHISQINGD 437

Query: 206 RVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           R+ATVLFY++DV QGGATVF  +  +++P++G+A  W+NL   G+G+  T HAACPV+ G
Sbjct: 438 RIATVLFYLNDVEQGGATVFPEIKKAVFPKRGSAIMWYNLKDDGEGNRDTLHAACPVIVG 497

Query: 266 S 266
           S
Sbjct: 498 S 498


>gi|38454288|ref|NP_942070.1| prolyl 4-hydroxylase subunit alpha-3 precursor [Rattus norvegicus]
 gi|81870816|sp|Q6W3E9.1|P4HA3_RAT RecName: Full=Prolyl 4-hydroxylase subunit alpha-3; Short=4-PH
           alpha-3; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-3; Flags: Precursor
 gi|36962768|gb|AAQ87605.1| collagen prolyl 4-hydroxylase alpha III subunit [Rattus norvegicus]
          Length = 544

 Score =  179 bits (455), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 111/270 (41%), Positives = 154/270 (57%), Gaps = 13/270 (4%)

Query: 4   PTHQRAQGNKLYYQEALNKSPELKDEPPKVN--NVAPTLEVTEREKYEMLCRGDLTVPPA 61
           P ++R   N L Y+  L ++  L      +   NV P L+   R+ YE LC+   + P  
Sbjct: 258 PDNKRMARNVLKYERLLAENGHLMAAETAIQRPNV-PHLQT--RDTYEGLCQTLGSQPTH 314

Query: 62  I-VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRA 120
             +  L C Y   + PYL L P ++E  +L+P + LY D + D E   I+++A+P L+R+
Sbjct: 315 YQIPSLYCSYETNSSPYLLLQPARKEVIHLRPLVALYHDFVSDEEAQKIRELAEPWLQRS 374

Query: 121 TVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQVVNY 177
            V    +GE ++   YRISKSAWL++   PV+  + RR+  +TGL      AE LQVVNY
Sbjct: 375 VV---ASGEKQLQVEYRISKSAWLKDTVDPVLVTLDRRIAALTGLDIQPPYAEYLQVVNY 431

Query: 178 GIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKG 237
           GIGGHYEPH+D A    +  +K + +GNR AT++ Y+S V  GGAT F   N S+   K 
Sbjct: 432 GIGGHYEPHFDHATSPSSPLYK-MKSGNRAATLMIYLSSVEAGGATAFIYGNFSVPVVKN 490

Query: 238 TAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
            A FW NLH SG+GD  T HA CPVL G  
Sbjct: 491 AALFWWNLHRSGEGDDDTLHAGCPVLVGDK 520


>gi|313242424|emb|CBY34571.1| unnamed protein product [Oikopleura dioica]
          Length = 503

 Score =  179 bits (455), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 99/234 (42%), Positives = 136/234 (58%), Gaps = 6/234 (2%)

Query: 41  EVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRD 99
           E  E E YE LCR    +P      LKC Y   N  P+L L P+K EE + +P II + +
Sbjct: 240 EREETEYYEKLCRIPNELPREKADTLKCFYWTNNDHPFLVLGPVKAEELWDEPEIIRFYE 299

Query: 100 VMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWL----REPEHPVIERIS 155
           ++ D E+D+I + A+P+   ATVQ+  TG+L  A+YRIS+SAWL       +   + +  
Sbjct: 300 IITDEELDIINEQARPKSNLATVQDPITGKLVNADYRISESAWLPANTDSAQDEKLRQFR 359

Query: 156 RRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMS 215
           +R+  +TGLT   AE++Q  NYGIGG YEPHYD +   +A  F     GNR+AT L Y++
Sbjct: 360 KRISIITGLTMERAEDIQYSNYGIGGQYEPHYDMSTENDAGKFDE-EDGNRIATWLTYLN 418

Query: 216 DVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSL 269
           +   GG TVF    +   P   +A FW+NL   G  DY TRHAACPVL G  ++
Sbjct: 419 EPKHGGDTVFLGPGIKAEPIHKSAVFWYNLLRDGSCDYRTRHAACPVLIGQKTV 472


>gi|59809017|gb|AAH89446.1| P4HA3 protein [Homo sapiens]
          Length = 528

 Score =  179 bits (455), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 112/273 (41%), Positives = 157/273 (57%), Gaps = 21/273 (7%)

Query: 4   PTHQRAQGNKLYYQEALNKSP-----ELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTV 58
           P ++R   N L Y+  L +SP     E   + P +    P L+   R+ YE LC+  L  
Sbjct: 242 PDNKRMARNVLKYERLLAESPNHVVAEAVIQRPNI----PHLQT--RDTYEGLCQT-LGS 294

Query: 59  PPAI--VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPR 116
            P +  +  L C Y   +  YL L P+++E  +L+P I LY D + DSE   I+++A+P 
Sbjct: 295 QPTLYQIPSLYCSYETNSNAYLLLQPIRKEVIHLEPYIALYHDFVSDSEAQKIRELAEPW 354

Query: 117 LRRATVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQ 173
           L+R+ V    +GE ++   YRISKSAWL++   P +  ++ R+  +TGL      AE LQ
Sbjct: 355 LQRSVV---ASGEKQLQVEYRISKSAWLKDTVDPKLVTLNHRIAALTGLDVRPPYAEYLQ 411

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
           VVNYGIGGHYEPH+D A    +  ++ + +GNRVAT + Y+S V  GGAT F   NLS+ 
Sbjct: 412 VVNYGIGGHYEPHFDHATSPSSPLYR-MKSGNRVATFMIYLSSVEAGGATAFIYANLSVP 470

Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
             +  A FW NLH SG+GD  T HA CPVL G 
Sbjct: 471 VVRNAALFWWNLHRSGEGDSDTLHAGCPVLVGD 503


>gi|426369750|ref|XP_004051847.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3, partial [Gorilla
           gorilla gorilla]
          Length = 517

 Score =  179 bits (455), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 112/273 (41%), Positives = 157/273 (57%), Gaps = 21/273 (7%)

Query: 4   PTHQRAQGNKLYYQEALNKSP-----ELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTV 58
           P ++R   N L Y+  L +SP     E   + P +    P L+   R+ YE LC+  L  
Sbjct: 231 PDNKRMARNVLKYERLLAESPNHVVAEAVIQRPNI----PHLQT--RDTYEGLCQT-LGS 283

Query: 59  PPAI--VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPR 116
            P +  +  L C Y   +  YL L P+++E  +L+P I LY D + DSE   I+++A+P 
Sbjct: 284 QPTLYQIPSLYCSYETNSNAYLLLQPIRKEVIHLEPYIALYHDFVSDSEAQKIRELAEPW 343

Query: 117 LRRATVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQ 173
           L+R+ V    +GE ++   YRISKSAWL++   P +  ++ R+  +TGL      AE LQ
Sbjct: 344 LQRSVV---ASGEKQLQVEYRISKSAWLKDTVDPKLVALNHRIAALTGLDVRPPYAEYLQ 400

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
           VVNYGIGGHYEPH+D A    +  ++ + +GNRVAT + Y+S V  GGAT F   NLS+ 
Sbjct: 401 VVNYGIGGHYEPHFDHATSPSSPLYR-MKSGNRVATFMIYLSSVEAGGATAFIYANLSVP 459

Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
             +  A FW NLH SG+GD  T HA CPVL G 
Sbjct: 460 VVRNAALFWWNLHRSGEGDSDTLHAGCPVLVGD 492


>gi|33589818|ref|NP_878907.1| prolyl 4-hydroxylase subunit alpha-3 precursor [Homo sapiens]
 gi|114639354|ref|XP_001174896.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Pan troglodytes]
 gi|397487266|ref|XP_003814725.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Pan paniscus]
 gi|74738714|sp|Q7Z4N8.1|P4HA3_HUMAN RecName: Full=Prolyl 4-hydroxylase subunit alpha-3; Short=4-PH
           alpha-3; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-3; Flags: Precursor
 gi|33188232|gb|AAP97874.1| prolyl 4-hydroxylase alpha III subunit [Homo sapiens]
 gi|36962719|gb|AAQ87603.1| collagen prolyl 4-hydroxylase alpha III subunit [Homo sapiens]
 gi|37182165|gb|AAQ88885.1| GPGA711 [Homo sapiens]
 gi|109658570|gb|AAI17334.1| Prolyl 4-hydroxylase, alpha polypeptide III [Homo sapiens]
 gi|119595341|gb|EAW74935.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide III, isoform CRA_b
           [Homo sapiens]
 gi|410219716|gb|JAA07077.1| prolyl 4-hydroxylase, alpha polypeptide III [Pan troglodytes]
 gi|410248278|gb|JAA12106.1| prolyl 4-hydroxylase, alpha polypeptide III [Pan troglodytes]
 gi|410336087|gb|JAA36990.1| prolyl 4-hydroxylase, alpha polypeptide III [Pan troglodytes]
          Length = 544

 Score =  179 bits (455), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 112/273 (41%), Positives = 157/273 (57%), Gaps = 21/273 (7%)

Query: 4   PTHQRAQGNKLYYQEALNKSP-----ELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTV 58
           P ++R   N L Y+  L +SP     E   + P +    P L+   R+ YE LC+  L  
Sbjct: 258 PDNKRMARNVLKYERLLAESPNHVVAEAVIQRPNI----PHLQT--RDTYEGLCQ-TLGS 310

Query: 59  PPAI--VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPR 116
            P +  +  L C Y   +  YL L P+++E  +L+P I LY D + DSE   I+++A+P 
Sbjct: 311 QPTLYQIPSLYCSYETNSNAYLLLQPIRKEVIHLEPYIALYHDFVSDSEAQKIRELAEPW 370

Query: 117 LRRATVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQ 173
           L+R+ V    +GE ++   YRISKSAWL++   P +  ++ R+  +TGL      AE LQ
Sbjct: 371 LQRSVV---ASGEKQLQVEYRISKSAWLKDTVDPKLVTLNHRIAALTGLDVRPPYAEYLQ 427

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
           VVNYGIGGHYEPH+D A    +  ++ + +GNRVAT + Y+S V  GGAT F   NLS+ 
Sbjct: 428 VVNYGIGGHYEPHFDHATSPSSPLYR-MKSGNRVATFMIYLSSVEAGGATAFIYANLSVP 486

Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
             +  A FW NLH SG+GD  T HA CPVL G 
Sbjct: 487 VVRNAALFWWNLHRSGEGDSDTLHAGCPVLVGD 519


>gi|170029530|ref|XP_001842645.1| prolyl 4-hydroxylase subunit alpha-1 [Culex quinquefasciatus]
 gi|167863229|gb|EDS26612.1| prolyl 4-hydroxylase subunit alpha-1 [Culex quinquefasciatus]
          Length = 522

 Score =  179 bits (454), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 96/220 (43%), Positives = 135/220 (61%), Gaps = 9/220 (4%)

Query: 48  YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID 107
           YE LCRG++      +++L+CR   +  P+LRL PLK EE  L+P I LY  V+ D EID
Sbjct: 274 YEPLCRGEVHRFADELSKLRCRLDTKTTPFLRLAPLKVEEVSLEPPIYLYHKVISDEEID 333

Query: 108 LIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMT-GLTT 166
            + ++ + RL RATV     G++ ++  RIS++ WL E   P++  + RR   M+ GL+ 
Sbjct: 334 KLIELGKARLNRATV-----GQM-VSQVRISQNVWLSEEVDPLLGVLQRRTYDMSRGLSM 387

Query: 167 STAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFT 226
              + +QV NYGIGGH  PHYD     E   F     GNR+AT+++Y+SDV  GG TVF 
Sbjct: 388 QGFDMVQVNNYGIGGHNIPHYDC--DSEYPPFPQFNMGNRLATLMYYLSDVEVGGGTVFP 445

Query: 227 SLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
            L+L ++P KG+A FWHN+H +G+ D    HA CP L GS
Sbjct: 446 RLSLGVFPIKGSAIFWHNVHHNGNVDERMLHAGCPTLIGS 485


>gi|126327904|ref|XP_001367838.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Monodelphis
           domestica]
          Length = 559

 Score =  179 bits (453), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 111/272 (40%), Positives = 151/272 (55%), Gaps = 19/272 (6%)

Query: 4   PTHQRAQGNKLYYQEALNK-----SPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTV 58
           P ++R   N L Y+  L +      PE   + P V    P L+   R+ YE LC+   + 
Sbjct: 273 PNNKRVARNILKYERLLAEKSSVTGPEAAIKRPNV----PHLQT--RDTYEGLCQTLGSQ 326

Query: 59  PPAI-VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRL 117
           P    +  L C Y     PYL L P+++E  +L+P I+LY D + DSE   I+  A P L
Sbjct: 327 PTHYQIPSLYCAYETNASPYLLLQPVRKEVLHLEPYIVLYHDFVSDSEAQKIRGFAAPWL 386

Query: 118 RRATVQNYKTGE-LEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTS--TAEELQV 174
           +R+ V    +GE  +   YRISKSAWL++   P++  +  R+  +TGL      AE LQV
Sbjct: 387 QRSVV---ASGEKQQQVEYRISKSAWLKDTVDPMLVSLDHRIAALTGLNVQPPYAEHLQV 443

Query: 175 VNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWP 234
           VNYGIGGHYEPH+D A    +  ++ + +GNRVAT + Y+S V  GG+T F   N S+  
Sbjct: 444 VNYGIGGHYEPHFDHATSPSSPLYR-MNSGNRVATFMIYLSSVEAGGSTAFIYANFSVPV 502

Query: 235 EKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
            K  A FW NLH SG+GD  T HA CPVL G 
Sbjct: 503 VKNAALFWWNLHRSGEGDGDTLHAGCPVLVGD 534


>gi|195055767|ref|XP_001994784.1| GH14132 [Drosophila grimshawi]
 gi|193892547|gb|EDV91413.1| GH14132 [Drosophila grimshawi]
          Length = 537

 Score =  178 bits (452), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 93/225 (41%), Positives = 127/225 (56%), Gaps = 2/225 (0%)

Query: 42  VTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVM 101
           ++  E Y   C G +   P     L+C Y+    P+L L PLK EE    P ++LY DV+
Sbjct: 285 LSHDEIYRYTCNGYIKKTPPEERNLRCGYMSETHPFLLLAPLKVEELNRNPLLVLYHDVI 344

Query: 102 YDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHM 161
           Y SEID++ K+ + R  RA V    T    ++  R S+  ++    H V+  I +RV  M
Sbjct: 345 YQSEIDVLNKLNRKRYERAGVVINSTST--VSKKRTSQHIFIAATRHKVLRTIDQRVADM 402

Query: 162 TGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGG 221
           T L    AE+ Q+ +YGIGGHY  H+D+    +    K    GNR+ATVLFY+SDVAQGG
Sbjct: 403 TNLNMQYAEDHQLADYGIGGHYSQHFDWFGNSDLANSKCDEMGNRIATVLFYLSDVAQGG 462

Query: 222 ATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
            T F  L   L P+K  AAFW+NLH+SG GD+   H  CP++ GS
Sbjct: 463 GTAFPILKQLLKPKKYAAAFWYNLHASGKGDWRNLHGGCPIIVGS 507


>gi|348555277|ref|XP_003463450.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Cavia porcellus]
          Length = 584

 Score =  178 bits (451), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 110/269 (40%), Positives = 155/269 (57%), Gaps = 11/269 (4%)

Query: 4   PTHQRAQGNKLYYQEALNKSP--ELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPA 61
           P ++R   N L Y+  L +S   E+ +   +  NV P L+   R+ YE LC+   + P  
Sbjct: 298 PENKRMVRNVLKYERLLAESSHQEVAETVIQRPNV-PHLQT--RDTYEGLCQTLGSQPIH 354

Query: 62  I-VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRA 120
             +  L C Y   + PYL L P+++E  +L+P + LY D + D E   I+++A+P L+R+
Sbjct: 355 YQIPSLYCSYETNSSPYLLLQPVRKEVIHLEPYVALYHDFVSDPEAQKIRELAEPWLQRS 414

Query: 121 TVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQVVNYG 178
            V +   G+     YRISKSAWL++   P++  ++ R+  +TGL      AE LQVVNYG
Sbjct: 415 VVAS--GGKQLQVEYRISKSAWLKDTVDPMLVTLNHRIAALTGLDVRPPYAEYLQVVNYG 472

Query: 179 IGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGT 238
           IGGHYEPH+D A    +  F+ + +GNRVAT + Y+S V  GGAT F   N S+   K  
Sbjct: 473 IGGHYEPHFDHATSPSSPLFR-MKSGNRVATFMIYLSSVEAGGATAFIYANFSVPVVKNA 531

Query: 239 AAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
           A FW NLH SG+GD  T HA CPVL G  
Sbjct: 532 ALFWWNLHRSGEGDGDTLHAGCPVLVGDK 560


>gi|198418585|ref|XP_002122034.1| PREDICTED: similar to Prolyl 4-hydroxylase subunit alpha-1 (4-PH
           alpha-1)
           (Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-1) [Ciona intestinalis]
          Length = 525

 Score =  177 bits (449), Expect = 6e-42,   Method: Compositional matrix adjust.
 Identities = 100/263 (38%), Positives = 150/263 (57%), Gaps = 20/263 (7%)

Query: 11  GNKLYYQEALNKSPEL----KDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAIVAQL 66
           GN LYY+  L + P L    KDE  +           E ++Y  +C+G   +P  +   L
Sbjct: 246 GNMLYYRMFL-RYPHLFIFHKDENAE----------DEIKQYNQICQGKFKLPHKVSKNL 294

Query: 67  KCR-YVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQN- 124
           +C  Y ++N P LR+ P+K EE    P I+ + DV+ + +I+ IKKM++  L RA V   
Sbjct: 295 RCYLYTNKNDPRLRIKPVKVEELCNSPHIVQFYDVINNDDIETIKKMSKKHLSRALVTGP 354

Query: 125 YKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYE 184
             TG +E  + R SK AW ++ +   ++++  R+  MTGL+  T E+LQV NYG+ G Y+
Sbjct: 355 NNTGIVE--DIRTSKVAWFKKNDFTAVKKLYTRISEMTGLSEETFEDLQVANYGLAGEYQ 412

Query: 185 PHYDFAR-PGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWH 243
           PH+D+   P           GNR+AT+L Y++DV +GG T F    +   P KG+A FW+
Sbjct: 413 PHFDYTEDPSIYKREDGAEVGNRIATMLLYLNDVKEGGRTAFIEPKIVAKPIKGSAVFWY 472

Query: 244 NLHSSGDGDYYTRHAACPVLTGS 266
           NL+ SG GD  TRHA+CPV+ G+
Sbjct: 473 NLYPSGLGDPRTRHASCPVVIGN 495


>gi|410910256|ref|XP_003968606.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Takifugu
           rubripes]
          Length = 540

 Score =  177 bits (448), Expect = 7e-42,   Method: Compositional matrix adjust.
 Identities = 102/225 (45%), Positives = 131/225 (58%), Gaps = 8/225 (3%)

Query: 45  REKYEMLCRGDLTVPPAIVA-QLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYD 103
           R+ YE LC+   + P      QL C       P L L P++ E   L+P ++LY D + D
Sbjct: 294 RDTYERLCQTRGSQPVHFENPQLFCDNFANGHPGLLLRPVRREVLSLRPYVVLYHDFISD 353

Query: 104 SEIDLIKKMAQPRLRRATVQNYKTGELE-IANYRISKSAWLREPEHPVIERISRRVEHMT 162
           SE + IK+ AQ  LRR+ V    TG+ +  A YRISKSAWL+   H  + R+ +++  +T
Sbjct: 354 SESEEIKQHAQLGLRRSVV---ATGDKQATAEYRISKSAWLKGSAHSTVSRLDQKISMLT 410

Query: 163 GLTTST--AEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQG 220
           GL       E LQVVNYGIGGHYEPH+D A    +  FK L TGNRVAT + Y+S V  G
Sbjct: 411 GLNVQHPHGEYLQVVNYGIGGHYEPHFDHATSPSSPVFK-LKTGNRVATFMIYLSSVEAG 469

Query: 221 GATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           G+T F   N S+   K  A FW NLH +G+GD  T HA CPVL G
Sbjct: 470 GSTAFIYANFSVPVMKNAAIFWWNLHRNGEGDADTLHAGCPVLIG 514


>gi|442757047|gb|JAA70682.1| Putative prolyl 4-hydroxylase alpha subunit [Ixodes ricinus]
          Length = 532

 Score =  177 bits (448), Expect = 7e-42,   Method: Compositional matrix adjust.
 Identities = 100/262 (38%), Positives = 150/262 (57%), Gaps = 27/262 (10%)

Query: 17  QEALNKSPELK----DEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAIVAQLKCRYVH 72
           +E  N+S E K    DE  + + V         E Y+ LCRG+    P + +QL+CRY  
Sbjct: 255 RERANRSTEFKAQLFDEEIEDDQVT--------ENYKRLCRGEQLRTPKMDSQLRCRYYT 306

Query: 73  RNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEI 132
               + +L P+K EE  L+P +++ RD++ D +++ +   A+PRL ++  +     + + 
Sbjct: 307 GETGFFKLQPIKLEEFNLKPYVVVLRDLLQDRDLNDMIAFAKPRLEQS--KTLCAADKDG 364

Query: 133 ANYRISKSAWLREPEHPVIERISRRVEHMTGLTT----STAEELQVVNYGIGGHYEPHYD 188
              R S + WL + + PV  R+++ ++ + GL T      AE+ Q+ NYGIGGHY PH+D
Sbjct: 365 PPSRTSSNTWLNDEDAPVAARVNQYLQSLLGLGTLFSRDEAEKYQLANYGIGGHYVPHHD 424

Query: 189 ----FARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHN 244
               F  P + N F     GNRVAT++ YMSDV +GGATVF SL + + P+KG A FW N
Sbjct: 425 YFEEFQTPSKGNRF-----GNRVATLMIYMSDVEEGGATVFPSLGVRVSPKKGDAVFWWN 479

Query: 245 LHSSGDGDYYTRHAACPVLTGS 266
           + SS +G+  T HA CPVL GS
Sbjct: 480 IMSSWEGEMLTWHAGCPVLYGS 501


>gi|119595340|gb|EAW74934.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide III, isoform CRA_a
           [Homo sapiens]
          Length = 657

 Score =  176 bits (447), Expect = 9e-42,   Method: Compositional matrix adjust.
 Identities = 113/275 (41%), Positives = 159/275 (57%), Gaps = 23/275 (8%)

Query: 1   MIF--PTHQRAQGNKLYYQEALNKSP-----ELKDEPPKVNNVAPTLEVTEREKYEMLCR 53
           +IF  P ++R   N L Y+  L +SP     E   + P +    P L+   R+ YE LC+
Sbjct: 285 IIFCCPDNKRMARNVLKYERLLAESPNHVVAEAVIQRPNI----PHLQT--RDTYEGLCQ 338

Query: 54  GDLTVPPAI--VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
             L   P +  +  L C Y   +  YL L P+++E  +L+P I LY D + DSE   I++
Sbjct: 339 -TLGSQPTLYQIPSLYCSYETNSNAYLLLQPIRKEVIHLEPYIALYHDFVSDSEAQKIRE 397

Query: 112 MAQPRLRRATVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST-- 168
           +A+P L+R+ V    +GE ++   YRISKSAWL++   P +  ++ R+  +TGL      
Sbjct: 398 LAEPWLQRSVV---ASGEKQLQVEYRISKSAWLKDTVDPKLVTLNHRIAALTGLDVRPPY 454

Query: 169 AEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL 228
           AE LQVVNYGIGGHYEPH+D A    +  ++ + +GNRVAT + Y+S V  GGAT F   
Sbjct: 455 AEYLQVVNYGIGGHYEPHFDHATSPSSPLYR-MKSGNRVATFMIYLSSVEAGGATAFIYA 513

Query: 229 NLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVL 263
           NLS+   +  A FW NLH SG+GD  T HA CPVL
Sbjct: 514 NLSVPVVRNAALFWWNLHRSGEGDSDTLHAGCPVL 548


>gi|195452776|ref|XP_002073495.1| GK13117 [Drosophila willistoni]
 gi|194169580|gb|EDW84481.1| GK13117 [Drosophila willistoni]
          Length = 487

 Score =  176 bits (447), Expect = 9e-42,   Method: Compositional matrix adjust.
 Identities = 91/241 (37%), Positives = 145/241 (60%), Gaps = 7/241 (2%)

Query: 26  LKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKE 85
           +++EP    N+ P         +E  CRG+   P    A+L C Y   + P+LRL PLK 
Sbjct: 228 IRNEP----NIKPKPFNKSVGDFERGCRGEF--PALTDAKLYCIYNTTSSPFLRLAPLKM 281

Query: 86  EEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLRE 145
           E   L P ++LY DV+  +EI  +++MA+P+L+RA V N      +++  R +K AW  +
Sbjct: 282 ELIGLDPYMVLYHDVISPNEIAELQEMAKPQLKRARVYNSTKNTDQLSKTRTAKLAWFLD 341

Query: 146 PEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGN 205
             + + ER+++R+  MT    + +E LQV+NYG+GG+Y  H+D+    +      +  G+
Sbjct: 342 TFNQLTERLNQRIMDMTNFVLNGSEMLQVMNYGLGGYYVKHFDYFNTTKGPHITQIN-GD 400

Query: 206 RVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           R+ATVLFY++DV QGGATVF  +  +++P++G+A  W+NL   G+G+  T HA CPV+ G
Sbjct: 401 RIATVLFYLNDVEQGGATVFPEIKKAVFPKRGSAIMWYNLKDDGEGNRDTLHAGCPVIVG 460

Query: 266 S 266
           S
Sbjct: 461 S 461


>gi|354504916|ref|XP_003514519.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Cricetulus
           griseus]
          Length = 509

 Score =  176 bits (447), Expect = 9e-42,   Method: Compositional matrix adjust.
 Identities = 109/272 (40%), Positives = 152/272 (55%), Gaps = 19/272 (6%)

Query: 4   PTHQRAQGNKLYY-----QEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTV 58
           P ++R   N L Y     Q  L  + E   + P V N+        R+ YE LC+   + 
Sbjct: 223 PDNKRMARNVLKYERLLSQNTLQMATETVIQRPNVPNL------QTRDTYEGLCQTLGSQ 276

Query: 59  PPAIVA-QLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRL 117
           P      +L C Y   + PYL L P ++E  +L+P + LY D + D+E   I+++A+P L
Sbjct: 277 PTHYQNPRLYCSYETNSSPYLLLQPARKEVIHLRPFVALYHDFVSDAEAQKIRELAEPWL 336

Query: 118 RRATVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTS--TAEELQV 174
           +R+ V    +GE ++   YRISKSAWL++   P++  +  R+  +TGL      AE LQV
Sbjct: 337 QRSVV---ASGEKQLPVEYRISKSAWLKDTVDPMLGTLDHRIAALTGLDIQPPYAEYLQV 393

Query: 175 VNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWP 234
           VNYGIGGHYEPH+D A    +  ++ + +GNRVAT + Y+S V  GGAT F   N S+  
Sbjct: 394 VNYGIGGHYEPHFDHATSPSSPLYR-MKSGNRVATFMIYLSAVEAGGATAFIYANFSVPV 452

Query: 235 EKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
            K  A FW NLH SG+GD  T HA CPVL G 
Sbjct: 453 VKNAALFWWNLHRSGEGDGDTLHAGCPVLVGD 484


>gi|195390833|ref|XP_002054072.1| GJ22994 [Drosophila virilis]
 gi|194152158|gb|EDW67592.1| GJ22994 [Drosophila virilis]
          Length = 496

 Score =  176 bits (446), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 95/215 (44%), Positives = 129/215 (60%), Gaps = 9/215 (4%)

Query: 52  CRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           CRG+       ++ L C Y     P+L L P+K E   L P II++ DV+   EID ++K
Sbjct: 267 CRGEFVG----ISNLYCVYKFGTSPFLLLAPIKMEIRLLNPFIIVFHDVLSPREIDELQK 322

Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
           +A+P L R TV  +K  E +  + R SK  W+    + + +RI RR+  M  L    +E 
Sbjct: 323 LARPLLERTTVVKFKKYEKD--SRRTSKGTWIERDHNNLTKRIERRITDMVELDLRYSEP 380

Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
            QV+NYG+GGHY  H DF   G+  A K     +R+ATVLFY++DV QGGATVFT LN +
Sbjct: 381 FQVMNYGLGGHYAAHEDFL--GDTWADKK-EEDDRIATVLFYLTDVEQGGATVFTILNQA 437

Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           + P++GTA FW+NLH +G GD  T H  CPVL GS
Sbjct: 438 VSPKRGTALFWYNLHRNGTGDTRTLHGGCPVLVGS 472


>gi|52139015|gb|AAH82538.1| P4ha3 protein [Mus musculus]
          Length = 404

 Score =  176 bits (446), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 108/268 (40%), Positives = 152/268 (56%), Gaps = 11/268 (4%)

Query: 4   PTHQRAQGNKLYYQEALNKSP-ELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAI 62
           P ++R   N L Y+  L ++  ++  E        P L+   R+ YE LC+   + P   
Sbjct: 118 PDNKRMARNVLKYERLLAENGHQMAAETAIQRPNVPHLQT--RDTYEGLCQTLGSQPTHY 175

Query: 63  -VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRAT 121
            +  L C Y   + PYL L P ++E  +L+P I LY D + D E   I+++A+P L+R+ 
Sbjct: 176 QIPSLYCSYETNSSPYLLLQPARKEVVHLRPLIALYHDFVSDEEAQKIRELAEPWLQRSV 235

Query: 122 VQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTS--TAEELQVVNYG 178
           V    +GE ++   YRISKSAWL++   P++  +  R+  +TGL      AE LQVVNYG
Sbjct: 236 V---ASGEKQLQVEYRISKSAWLKDTVDPMLVTLDHRIAALTGLDIQPPYAEYLQVVNYG 292

Query: 179 IGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGT 238
           IGGHYEPH+D A    +  ++ + +GNRVAT + Y+S V  GGAT F   N S+   K  
Sbjct: 293 IGGHYEPHFDHATSPSSPLYR-MKSGNRVATFMIYLSSVEAGGATAFIYGNFSVPVVKNA 351

Query: 239 AAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           A FW NLH SG+GD  T HA CPVL G 
Sbjct: 352 ALFWWNLHRSGEGDGDTLHAGCPVLVGD 379


>gi|81870817|sp|Q6W3F0.1|P4HA3_MOUSE RecName: Full=Prolyl 4-hydroxylase subunit alpha-3; Short=4-PH
           alpha-3; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-3; Flags: Precursor
 gi|36962749|gb|AAQ87604.1| collagen prolyl 4-hydroxylase alpha III subunit [Mus musculus]
          Length = 542

 Score =  176 bits (446), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 108/269 (40%), Positives = 152/269 (56%), Gaps = 11/269 (4%)

Query: 4   PTHQRAQGNKLYYQEALNKSP-ELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAI 62
           P ++R   N L Y+  L ++  ++  E        P L+   R+ YE LC+   + P   
Sbjct: 256 PDNKRMARNVLKYERLLAENGHQMAAETAIQRPNVPHLQT--RDTYEGLCQTLGSQPTHY 313

Query: 63  -VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRAT 121
            +  L C Y   + PYL L P ++E  +L+P I LY D + D E   I+++A+P L+R+ 
Sbjct: 314 QIPSLYCSYETNSSPYLLLQPARKEVVHLRPLIALYHDFVSDEEAQKIRELAEPWLQRSV 373

Query: 122 VQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQVVNYG 178
           V    +GE ++   YRISKSAWL++   P++  +  R+  +TGL      AE LQVVNYG
Sbjct: 374 V---ASGEKQLQVEYRISKSAWLKDTVDPMLVTLDHRIAALTGLDIQPPYAEYLQVVNYG 430

Query: 179 IGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGT 238
           IGGHYEPH+D A    +  ++ + +GNRVAT + Y+S V  GGAT F   N S+   K  
Sbjct: 431 IGGHYEPHFDHATSPSSPLYR-MKSGNRVATFMIYLSSVEAGGATAFIYGNFSVPVVKNA 489

Query: 239 AAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
           A FW NLH SG+GD  T HA CPVL G  
Sbjct: 490 ALFWWNLHRSGEGDGDTLHAGCPVLVGDK 518


>gi|227908832|ref|NP_796135.3| prolyl 4-hydroxylase subunit alpha-3 precursor [Mus musculus]
          Length = 542

 Score =  176 bits (445), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 108/269 (40%), Positives = 152/269 (56%), Gaps = 11/269 (4%)

Query: 4   PTHQRAQGNKLYYQEALNKSP-ELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAI 62
           P ++R   N L Y+  L ++  ++  E        P L+   R+ YE LC+   + P   
Sbjct: 256 PDNKRMARNVLKYERLLAENGHQMAAETAIQRPNVPHLQT--RDTYEGLCQTLGSQPTHY 313

Query: 63  -VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRAT 121
            +  L C Y   + PYL L P ++E  +L+P I LY D + D E   I+++A+P L+R+ 
Sbjct: 314 QIPSLYCSYETNSSPYLLLQPARKEVVHLRPLIALYHDFVSDEEAQKIRELAEPWLQRSV 373

Query: 122 VQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQVVNYG 178
           V    +GE ++   YRISKSAWL++   P++  +  R+  +TGL      AE LQVVNYG
Sbjct: 374 V---ASGEKQLQVEYRISKSAWLKDTVDPMLVTLDHRIAALTGLDIQPPYAEYLQVVNYG 430

Query: 179 IGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGT 238
           IGGHYEPH+D A    +  ++ + +GNRVAT + Y+S V  GGAT F   N S+   K  
Sbjct: 431 IGGHYEPHFDHATSPSSPLYR-MKSGNRVATFMIYLSSVEAGGATAFIYGNFSVPVVKNA 489

Query: 239 AAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
           A FW NLH SG+GD  T HA CPVL G  
Sbjct: 490 ALFWWNLHRSGEGDGDTLHAGCPVLVGDK 518


>gi|239915958|ref|NP_001070123.2| prolyl 4-hydroxylase alpha II-like precursor [Danio rerio]
          Length = 490

 Score =  176 bits (445), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 100/230 (43%), Positives = 139/230 (60%), Gaps = 19/230 (8%)

Query: 39  TLEVTEREKYEMLCRGDLTVPPAIVAQ-LKCRY-VHRNVPYLRLMPLKEEEAYLQPRIIL 96
           TL       YE LCRG++    +   + L CRY      P L   P+KEEE + +P+II 
Sbjct: 253 TLNTQSNNSYEALCRGEVDERTSKRQRALSCRYSTGGGNPRLMYAPVKEEELWDEPKIIR 312

Query: 97  YRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISR 156
           Y DV+ D+EI+ +K +A+P L R+     +TG   I++ R S+S +L E     + RIS+
Sbjct: 313 YHDVISDTEIETLKDIARPELTRS-----QTGWGVISDIRTSQSVFLEEV--GTVARISQ 365

Query: 157 RVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSD 216
           R+  +TGL+  +AE+L V NYGIGG Y PH+D     E N         R AT L YMSD
Sbjct: 366 RIADITGLSVESAEKLHVQNYGIGGRYTPHFDTG--DEVN--------ERTATFLIYMSD 415

Query: 217 VAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           V  GGATVFT++ +++ PEKG+A FW+NLH +G+ D  T+HA CPVL G+
Sbjct: 416 VEVGGATVFTNVGVAVKPEKGSAVFWYNLHKNGELDLKTKHAGCPVLVGN 465


>gi|195159319|ref|XP_002020529.1| GL14044 [Drosophila persimilis]
 gi|194117298|gb|EDW39341.1| GL14044 [Drosophila persimilis]
          Length = 536

 Score =  176 bits (445), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 96/247 (38%), Positives = 135/247 (54%), Gaps = 14/247 (5%)

Query: 44  EREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYD 103
           E   YE +CRG+LT  P     L+CR   R   Y    P K EE +  P I+   D++  
Sbjct: 283 EFRMYEQVCRGELTPSPTAQRHLRCRLQRRRFDY---APFKLEELHADPPIVQVHDMVSQ 339

Query: 104 SEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTG 163
            E   ++  A+PR++R+TV N        A +R S+ A     ++   +R+S+ V  ++G
Sbjct: 340 RESLFLQNAARPRIQRSTVYNQAGAGTTAAAFRTSQGASFNYSQYATTQRLSQHVADLSG 399

Query: 164 LTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGAT 223
           L    AE LQ+ NYGIGGHYEPH+D + P      +    GNR+AT ++Y+SDV  GG T
Sbjct: 400 LDMDYAENLQIANYGIGGHYEPHWD-SFPEHHEYPEDDLYGNRLATAIYYLSDVVAGGGT 458

Query: 224 VFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC---------- 273
            F  L L + PE+G+  FW+NLH SGD D+ T+HAACPVL GS  + +            
Sbjct: 459 AFPFLPLLVTPERGSLLFWYNLHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQDRVR 518

Query: 274 PCGLRRG 280
           PC L+R 
Sbjct: 519 PCDLQRN 525


>gi|92096574|gb|AAI15350.1| LOC557059 protein [Danio rerio]
          Length = 508

 Score =  176 bits (445), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 100/230 (43%), Positives = 139/230 (60%), Gaps = 19/230 (8%)

Query: 39  TLEVTEREKYEMLCRGDLTVPPAIVAQ-LKCRY-VHRNVPYLRLMPLKEEEAYLQPRIIL 96
           TL       YE LCRG++    +   + L CRY      P L   P+KEEE + +P+II 
Sbjct: 271 TLNTQSNNSYEALCRGEVDERTSKRQRALSCRYSTGGGNPRLMYAPVKEEELWDEPKIIR 330

Query: 97  YRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISR 156
           Y DV+ D+EI+ +K +A+P L R+     +TG   I++ R S+S +L E     + RIS+
Sbjct: 331 YHDVISDTEIETLKDIARPELTRS-----QTGWGVISDIRTSQSVFLEEV--GTVARISQ 383

Query: 157 RVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSD 216
           R+  +TGL+  +AE+L V NYGIGG Y PH+D     E N         R AT L YMSD
Sbjct: 384 RIADITGLSVESAEKLHVQNYGIGGRYTPHFDTG--DEVN--------ERTATFLIYMSD 433

Query: 217 VAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           V  GGATVFT++ +++ PEKG+A FW+NLH +G+ D  T+HA CPVL G+
Sbjct: 434 VEVGGATVFTNVGVAVKPEKGSAVFWYNLHKNGELDLKTKHAGCPVLVGN 483


>gi|156333122|ref|XP_001619372.1| hypothetical protein NEMVEDRAFT_v1g151555 [Nematostella vectensis]
 gi|156202442|gb|EDO27272.1| predicted protein [Nematostella vectensis]
          Length = 144

 Score =  175 bits (444), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 81/128 (63%), Positives = 99/128 (77%), Gaps = 4/128 (3%)

Query: 139 KSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAF 198
           +S WLR+ E  +++RIS RV+  +GL  +T+E+LQVVNYGIGGHYEPHYDFAR    + F
Sbjct: 2   RSGWLRDEEDELVKRISYRVQAYSGLNMTTSEDLQVVNYGIGGHYEPHYDFAR----DKF 57

Query: 199 KSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHA 258
            SLGTGNR+AT L Y+SDV  GG TVFT +  ++WP+KG AAFW+NL  SGDGD  TRHA
Sbjct: 58  TSLGTGNRIATFLSYLSDVEAGGGTVFTRVGATVWPQKGDAAFWYNLKRSGDGDSSTRHA 117

Query: 259 ACPVLTGS 266
           ACPVL GS
Sbjct: 118 ACPVLVGS 125


>gi|195505244|ref|XP_002099420.1| GE10895 [Drosophila yakuba]
 gi|194185521|gb|EDW99132.1| GE10895 [Drosophila yakuba]
          Length = 533

 Score =  175 bits (444), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 93/229 (40%), Positives = 138/229 (60%), Gaps = 11/229 (4%)

Query: 46  EKYEMLCRG--DLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYD 103
           E +  LCR         +  ++L CRY     P+LRL PL+ EE  L P ++LY +V+ D
Sbjct: 272 ESFNQLCRSVSRRHASESKPSRLHCRYNATTTPFLRLAPLRMEELSLDPYVVLYHNVLSD 331

Query: 104 SEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWL----REPEH-PVIERISRRV 158
            EI+ ++ M++P L RA V   + G  EI   R +  AWL     EPE   V+ RI RR+
Sbjct: 332 PEIEKLQLMSEPFLERAKVFRVEKGSDEIGASRAADGAWLPHQETEPEDLEVLNRIGRRI 391

Query: 159 EHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVA 218
             +TGL+T +  ++Q++ YG GGH+ PH+D+    ++        G+R+ATVLFY+++V 
Sbjct: 392 GDITGLSTRSGRQMQLLKYGFGGHFTPHFDYF---DSKTLYLEKVGDRIATVLFYLNNVE 448

Query: 219 QGGATVFTSLNLSLWPEKGTAAFWHNLH-SSGDGDYYTRHAACPVLTGS 266
            GGATVF S+NL++  +KG+A FWHNL   S D D  T H ACP+++G+
Sbjct: 449 HGGATVFPSINLAVPTQKGSALFWHNLDGQSYDYDTRTFHGACPLISGT 497


>gi|198449648|ref|XP_001357666.2| GA21989 [Drosophila pseudoobscura pseudoobscura]
 gi|198130700|gb|EAL26801.2| GA21989 [Drosophila pseudoobscura pseudoobscura]
          Length = 536

 Score =  175 bits (443), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 96/247 (38%), Positives = 134/247 (54%), Gaps = 14/247 (5%)

Query: 44  EREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYD 103
           E   YE +CRG+LT  P     L+CR   R   Y    P K EE +  P I+   D++  
Sbjct: 283 EFRMYEQVCRGELTPSPTAQRHLRCRLQRRRFDY---APFKLEELHADPPIVQVHDMVSQ 339

Query: 104 SEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTG 163
            E   ++  A+PR++R+TV N        A +R S+ A     ++   +R+S+ V  ++G
Sbjct: 340 RESLFLQNAARPRIQRSTVYNQAGAGTTAAAFRTSQGASFNYSQYATTQRLSQHVADLSG 399

Query: 164 LTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGAT 223
           L    AE LQ+ NYGIGGHYEPH+D + P      +    GNR+AT ++Y+SDV  GG T
Sbjct: 400 LDMDYAENLQIANYGIGGHYEPHWD-SFPEHHEYPEDDLYGNRLATAIYYLSDVVAGGGT 458

Query: 224 VFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC---------- 273
            F  L L + PE+G+  FW+NLH SGD D+ T+HAACPVL GS  + +            
Sbjct: 459 AFPFLPLLVTPERGSLLFWYNLHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQDRVR 518

Query: 274 PCGLRRG 280
           PC L R 
Sbjct: 519 PCDLHRN 525


>gi|442747091|gb|JAA65705.1| Putative prolyl 4-hydroxylase alpha subunit [Ixodes ricinus]
          Length = 533

 Score =  174 bits (442), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 89/225 (39%), Positives = 136/225 (60%), Gaps = 6/225 (2%)

Query: 46  EKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSE 105
           E Y+ LCRG+    P + +QL+CRY      + +L P+K EE  L+P +++ RD++ D +
Sbjct: 280 ENYKRLCRGEQLRTPKMDSQLRCRYYTGETGFFKLQPIKLEEYNLKPYVVVLRDLLQDRD 339

Query: 106 IDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLT 165
           ++ +   A+PRL ++  +     + +    R S + WL + + PV  R+++ ++ + GL 
Sbjct: 340 LNDMIAFAKPRLEQS--KTLCAADKDGPPPRTSSNTWLDDDDAPVAARVNQYLQSLLGLG 397

Query: 166 T----STAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGG 221
           T      AE+ Q+ NYGIGGHY PH+D+      ++ K    G+RVAT++ YMSDV +GG
Sbjct: 398 TLYGKDEAEKYQLANYGIGGHYVPHHDYLEESLTSSKKHRLFGDRVATLMIYMSDVEEGG 457

Query: 222 ATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           ATVF SL + + P KG A FW N+ SS +GD  T HA CPVL GS
Sbjct: 458 ATVFPSLGVRVSPRKGDAVFWWNIKSSWEGDVLTWHAGCPVLYGS 502


>gi|403183473|gb|EJY58123.1| AAEL017524-PA, partial [Aedes aegypti]
          Length = 212

 Score =  174 bits (442), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 85/188 (45%), Positives = 126/188 (67%), Gaps = 3/188 (1%)

Query: 80  LMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISK 139
           + P K EEA L P I++Y + + D EI+ I ++++P L+R+ V   ++   E++N R S+
Sbjct: 1   IAPFKLEEASLDPLIVIYHNAISDKEIEQIIQVSKPMLKRSMVG--ESFSKEVSNERTSQ 58

Query: 140 SAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARP-GEANAF 198
           +AWL + +  +++ +S R E MTGL   + E LQV NYGIGG Y PH+D+ R  G    +
Sbjct: 59  NAWLADYDFELVKVLSLRTEDMTGLDRKSYESLQVNNYGIGGFYLPHFDWVRTNGTEEPY 118

Query: 199 KSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHA 258
           K +G GNR+AT+++Y+SDV QGGATVF  + + ++P+KG+A FW+NL   G GD  T H 
Sbjct: 119 KDMGLGNRIATLMYYLSDVEQGGATVFPQIGVGVFPKKGSAIFWYNLLPDGTGDERTLHG 178

Query: 259 ACPVLTGS 266
           ACPVL GS
Sbjct: 179 ACPVLLGS 186


>gi|47227817|emb|CAG08980.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 285

 Score =  174 bits (442), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 103/225 (45%), Positives = 128/225 (56%), Gaps = 8/225 (3%)

Query: 45  REKYEMLCRGDLTVPPAIVA-QLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYD 103
           R+ YE LCR   + P      QL C       P L L P + E   LQP ++LY D + D
Sbjct: 39  RDTYERLCRTRGSQPTHFENPQLFCDNFANGHPGLLLRPARRETLSLQPYVVLYHDFISD 98

Query: 104 SEIDLIKKMAQPRLRRATVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMT 162
           +E + IK  AQ  LRR+ V    T + ++ A YRISKSAWL+      + R+ +R+  +T
Sbjct: 99  TEAEEIKHHAQLGLRRSVV---ATRDKQVTAEYRISKSAWLKGSAQSAVSRLDQRISMLT 155

Query: 163 GLTTST--AEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQG 220
           GL       E LQVVNYGIGGHYEPH+D A    +  FK L TGNRVATV+ Y+S V  G
Sbjct: 156 GLNVQHPHGEYLQVVNYGIGGHYEPHFDHATSPSSPVFK-LKTGNRVATVMIYLSSVEAG 214

Query: 221 GATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           G+T F   N S+   K  A FW NLH +G GD  T HA CPVL G
Sbjct: 215 GSTAFIYANFSVPVMKNAAIFWWNLHRNGRGDPDTLHAGCPVLIG 259


>gi|281362877|ref|NP_733393.3| CG31016, isoform B [Drosophila melanogaster]
 gi|442621939|ref|NP_001263119.1| CG31016, isoform C [Drosophila melanogaster]
 gi|272477249|gb|AAF57071.5| CG31016, isoform B [Drosophila melanogaster]
 gi|440218076|gb|AGB96498.1| CG31016, isoform C [Drosophila melanogaster]
          Length = 536

 Score =  174 bits (441), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 100/251 (39%), Positives = 146/251 (58%), Gaps = 18/251 (7%)

Query: 26  LKDEP-PKVNNVAPTLEVTER-EKYEMLCRGD--LTVPPAIVAQLKCRYVHRNVPYLRLM 81
           L+++P P +N     LE  E  E +  LCR      +  +  ++L CRY     P+L+L 
Sbjct: 258 LRNKPKPSIN-----LESWESDESFNQLCRSSSRRQMGESKPSRLHCRYNTITTPFLKLA 312

Query: 82  PLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSA 141
           P + EE  L P +I Y +V+ D+EI+ +K M +P L RA V   + G  EI   R +  A
Sbjct: 313 PFRMEELSLDPYVIFYHNVLSDAEIEKLKPMGKPFLERAKVFRVEKGSDEIDPSRSADGA 372

Query: 142 WL----REPEH-PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEAN 196
           WL     +P+   V+ RI RR+E MTGL T +  ++Q + YG GGH+ PHYD+     + 
Sbjct: 373 WLPHQNIDPDDLEVLNRIGRRIEDMTGLNTRSGSKMQFLKYGFGGHFVPHYDYFN---SK 429

Query: 197 AFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNL-HSSGDGDYYT 255
            F     G+R+ATVLFY+++V  GGATVF  LNL++  +KG+A FWHN+   S D D  T
Sbjct: 430 TFSLETVGDRIATVLFYLNNVDHGGATVFPKLNLAVPTQKGSALFWHNIDRKSYDYDTRT 489

Query: 256 RHAACPVLTGS 266
            H ACP+++G+
Sbjct: 490 FHGACPLISGT 500


>gi|194905305|ref|XP_001981170.1| GG11767 [Drosophila erecta]
 gi|190655808|gb|EDV53040.1| GG11767 [Drosophila erecta]
          Length = 536

 Score =  174 bits (441), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 96/242 (39%), Positives = 145/242 (59%), Gaps = 22/242 (9%)

Query: 39  TLEVTEREK-YEMLCR-------GDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYL 90
           +LE  E E+ +  LCR       GD     +  ++L CRY     P+LRL+PL+ EE  L
Sbjct: 267 SLECCESEESFNHLCRSVSRRQAGD-----SKPSRLHCRYNTTTRPFLRLVPLRMEELSL 321

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH-- 148
            P ++LY +V+ D EI+ +K M++P L RA V   + G  E+A  R +  AWL +PE   
Sbjct: 322 DPYVVLYHNVLSDPEIEKLKLMSEPFLERAKVYRVEKGSDEVAPSRSADGAWLPDPETEP 381

Query: 149 ---PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGN 205
                + RI RR+  +TGL+T +  ++Q++ YG GGH+ PHYD+    ++        G+
Sbjct: 382 EDLETLNRIGRRIGDITGLSTCSGSQMQLLKYGFGGHFVPHYDYF---DSKTSYLEAVGD 438

Query: 206 RVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLH-SSGDGDYYTRHAACPVLT 264
           R+ATVLFY+++V  GGAT F ++NL++  +KG+A FWHNL   S D D  T H ACP+++
Sbjct: 439 RIATVLFYLNNVDHGGATAFPNINLAVPTQKGSALFWHNLDGKSYDYDTRTFHGACPLIS 498

Query: 265 GS 266
           G+
Sbjct: 499 GT 500


>gi|159884097|gb|ABX00727.1| IP12176p [Drosophila melanogaster]
          Length = 538

 Score =  174 bits (441), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 93/229 (40%), Positives = 135/229 (58%), Gaps = 11/229 (4%)

Query: 46  EKYEMLCRGD--LTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYD 103
           E +  LCR      +  +  ++L CRY     P+L+L P + EE  L P +I Y +V+ D
Sbjct: 277 ESFNQLCRSSSRRQMGESKPSRLHCRYNTITTPFLKLAPFRMEELSLDPYVIFYHNVLSD 336

Query: 104 SEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWL----REPEH-PVIERISRRV 158
           +EI+ +K M +P L RA V   + G  EI   R +  AWL     +P+   V+ RI RR+
Sbjct: 337 AEIEKLKPMGKPFLERAKVFRVEKGSDEIDPSRSADGAWLPHQNIDPDDLEVLNRIGRRI 396

Query: 159 EHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVA 218
           E MTGL T +  ++Q + YG GGH+ PHYD+     +  F     G+R+ATVLFY+++V 
Sbjct: 397 EDMTGLNTRSGSKMQFLKYGFGGHFVPHYDYFN---SKTFSLETVGDRIATVLFYLNNVD 453

Query: 219 QGGATVFTSLNLSLWPEKGTAAFWHNL-HSSGDGDYYTRHAACPVLTGS 266
            GGATVF  LNL++  +KG+A FWHN+   S D D  T H ACP+++G+
Sbjct: 454 HGGATVFPKLNLAVPTQKGSALFWHNIDRKSYDYDTRTFHGACPLISGT 502


>gi|195505251|ref|XP_002099423.1| GE23370 [Drosophila yakuba]
 gi|194185524|gb|EDW99135.1| GE23370 [Drosophila yakuba]
          Length = 534

 Score =  173 bits (439), Expect = 8e-41,   Method: Compositional matrix adjust.
 Identities = 95/229 (41%), Positives = 134/229 (58%), Gaps = 21/229 (9%)

Query: 48  YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID 107
           +++ C G    P     +L C Y     P+LRL PLK E+  L P ++LY +V+   EI 
Sbjct: 287 FKLSCNG----PHESSTRLHCFYNFTTTPFLRLAPLKTEQIGLDPYVVLYHEVLSAREIS 342

Query: 108 -LIKKMAQ----PRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMT 162
            LI K AQ     R+ R T      G       R +K  WL++  + +  RI+RR+  MT
Sbjct: 343 MLISKAAQNMKNTRVHRETKPKTNRG-------RTAKGHWLKKESNELTRRITRRIVDMT 395

Query: 163 GLTTSTAEELQVVNYGIGGHYEPHYDFARPGEAN-----AFKSLGTGNRVATVLFYMSDV 217
           G   + +E+ QV+NYGIGGHY  H D+     +N     + +S   G+R+ATVLFY+SDV
Sbjct: 396 GFDLADSEDFQVINYGIGGHYFLHMDYFDYASSNYTGPRSRQSKVLGDRIATVLFYLSDV 455

Query: 218 AQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
            QGGATVF ++  S++P+ GTA FW+NL + G+GD  TRHA+CPV+ GS
Sbjct: 456 EQGGATVFGNVGYSVYPQAGTAIFWYNLDTDGNGDPLTRHASCPVIVGS 504


>gi|195069801|ref|XP_001997031.1| GH12975 [Drosophila grimshawi]
 gi|193891500|gb|EDV90366.1| GH12975 [Drosophila grimshawi]
          Length = 242

 Score =  173 bits (439), Expect = 8e-41,   Method: Compositional matrix adjust.
 Identities = 89/230 (38%), Positives = 133/230 (57%), Gaps = 10/230 (4%)

Query: 60  PAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRR 119
           PA   +L+CR    N       P + EE +L P +I   D++   E  +++++A+P L+R
Sbjct: 1   PATQRKLRCRLHRGNGLRSSYQPYRLEELHLDPYVIQVHDIISAEETIVLQQLARPELQR 60

Query: 120 ATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGI 179
           + V +    E    N+RIS+  +    EHP+++R+S+ +E+++GL   +AE+LQV NYGI
Sbjct: 61  SMVYSLSNSEHISTNFRISQGTFFEYHEHPIMQRMSQHLENISGLDMRSAEQLQVANYGI 120

Query: 180 GGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTA 239
           GGHYEPH D           +  + NRVAT ++Y+S+V  GG T F  L L + PE+G+ 
Sbjct: 121 GGHYEPHMDSFSENHNYGINTYMSTNRVATGIYYLSNVEAGGGTAFPFLPLLVEPERGSL 180

Query: 240 AFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCGLRR 279
            FW+NLH SGD DY T+HA CPVL GS  + +            PC L+R
Sbjct: 181 LFWYNLHRSGDLDYRTKHAGCPVLMGSKWIANVWIRLSNQDHIRPCDLQR 230


>gi|195391758|ref|XP_002054527.1| GJ22759 [Drosophila virilis]
 gi|194152613|gb|EDW68047.1| GJ22759 [Drosophila virilis]
          Length = 539

 Score =  173 bits (438), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 95/243 (39%), Positives = 141/243 (58%), Gaps = 13/243 (5%)

Query: 48  YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID 107
           Y+ +CR +L   PA + +L+CR    N       P K EE +L P II   DV+   +  
Sbjct: 287 YQQVCREELRPAPAALRELRCRLFAGNGRKSTYAPYKLEELHLDPYIIQVHDVISARDTA 346

Query: 108 LIKKMAQPRLRRATVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTT 166
            ++ +A+P L+R+ V + +TG   I AN+R S+       +HP+++++S  V  ++GL  
Sbjct: 347 ELQHLARPELQRSQVYS-RTGHEHISANFRTSQGTTFEYTDHPIMQKMSHHVAEISGLDM 405

Query: 167 STAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFT 226
            +AE LQ+ NYGIGGHYEPH D + P   +   ++   NR+AT ++Y+S+V  GG T F 
Sbjct: 406 RSAEPLQIANYGIGGHYEPHMD-SFPDSYDYSLNMYKTNRLATGIYYLSNVEAGGGTAFP 464

Query: 227 SLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCG 276
            L L + PE+G+  FW+NLH SGD DY T+HAACPVL GS  + +            PC 
Sbjct: 465 FLPLLVTPERGSLLFWYNLHPSGDADYRTKHAACPVLQGSKWIANVWIRLSNQDHVRPCE 524

Query: 277 LRR 279
           L+R
Sbjct: 525 LQR 527


>gi|198449502|ref|XP_001357605.2| GA15937 [Drosophila pseudoobscura pseudoobscura]
 gi|198130635|gb|EAL26739.2| GA15937 [Drosophila pseudoobscura pseudoobscura]
          Length = 510

 Score =  172 bits (437), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 91/205 (44%), Positives = 124/205 (60%), Gaps = 14/205 (6%)

Query: 66  LKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRAT--VQ 123
           L C Y     P+LRL PLK E     P +++Y DV+ DSEI  I +MA+ R+ R +   Q
Sbjct: 293 LHCCYNFTTTPFLRLAPLKMELLGEHPYVVVYHDVLSDSEIAEILEMAERRMARTSTVAQ 352

Query: 124 NYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHY 183
             +T     +  R +  AWL+   + +  RI+RRV  M+GL    +E +QV+NYGIGGHY
Sbjct: 353 PNRTS----SPTRTAMGAWLKRSSNALTRRIARRVRDMSGLQLEGSERMQVINYGIGGHY 408

Query: 184 EPHYD-FARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFW 242
            PH D F +  E         GNR+ATVLFY++DV QGGAT+F      + P +GTA FW
Sbjct: 409 VPHKDWFTQHPEV-------MGNRLATVLFYLTDVEQGGATMFNKAEHKVLPRRGTALFW 461

Query: 243 HNLHSSGDGDYYTRHAACPVLTGSN 267
           +NLH+ G+GD+ T HAACP++ GS 
Sbjct: 462 YNLHTDGEGDWSTTHAACPIIVGSK 486


>gi|442751927|gb|JAA68123.1| Putative prolyl 4-hydroxylase alpha subunit [Ixodes ricinus]
          Length = 522

 Score =  172 bits (437), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 98/267 (36%), Positives = 148/267 (55%), Gaps = 22/267 (8%)

Query: 41  EVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDV 100
           E  E   Y+ LCRG++   P + ++L+CRY      +  L P+K EE  L+P II+ RDV
Sbjct: 258 EDQEEHNYKRLCRGEVLRTPKMDSKLRCRYYKGQDGFFTLRPIKLEEINLKPYIIVMRDV 317

Query: 101 MYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEH 160
           + + +I+ +   A+PRL+R+T   Y       +  R S +AWL + E P+  R++  +  
Sbjct: 318 VQERDIEDLMAFAEPRLQRSTT--YTGDGNAPSTRRTSSNAWLWDDEAPIANRMNWYLRA 375

Query: 161 MTGLTTS----TAEELQVVNYGIGGHYEPHYDF------ARPGEANAFKSLGTGNRVATV 210
           + GL TS     AE  Q+ NYG GG++ PH+D+      A    A+ +     G+R+AT+
Sbjct: 376 LVGLGTSGSDYEAEAYQLANYGSGGYFLPHHDYLQDTLHAHNSTADYYLQNKEGDRLATL 435

Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLH 270
           + YM+DV  GGATVF  L + L P+KG AAFW NL +SG+GD  T HA CPVL GS  + 
Sbjct: 436 MIYMTDVEVGGATVFPRLGVRLVPKKGDAAFWWNLKASGEGDTLTMHAGCPVLYGSKWIA 495

Query: 271 ST----------CPCGLRRGLQRSGII 287
           +            PC + R +  + ++
Sbjct: 496 NKWFKSYSNVFRLPCSIDRNVSLAPLV 522


>gi|321461762|gb|EFX72791.1| hypothetical protein DAPPUDRAFT_308081 [Daphnia pulex]
          Length = 561

 Score =  172 bits (436), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 95/234 (40%), Positives = 139/234 (59%), Gaps = 10/234 (4%)

Query: 41  EVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDV 100
           E  E E YE LCRG+      I A L+CR V R  P L L P+K EE  L P I++  D+
Sbjct: 298 EDEENEHYERLCRGEKLRSANIEAGLRCRLVTRGHPALLLQPIKVEEQSLDPMIVVLHDL 357

Query: 101 MYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEH 160
           + + + ++++++ +P+L   ++     G+   +  R SK+AWL+E E+  +  I  R+E 
Sbjct: 358 ITERQTEILRQLGEPKLA-TSLHRGGEGKFVRSMIRTSKNAWLQEHENASLPAIRHRMEL 416

Query: 161 MTGLT---TSTAEELQVVNYGIGGHYEPHYDFA-----RPGEANAFKSLGTGNRVATVLF 212
            TGL     + +E  Q+ NYGIGG Y+ H D       RP + + + +L  G+R+AT++ 
Sbjct: 417 ATGLIYGPETASEYFQIANYGIGGLYKTHTDNVIHPDVRPEDQDPW-NLYVGDRIATLMV 475

Query: 213 YMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           Y+SDV  GGATVF    ++ WP KG+AAFW NL+ SG+ D  TRH ACPVL GS
Sbjct: 476 YLSDVEAGGATVFPRAGVTCWPRKGSAAFWWNLYKSGEPDLTTRHGACPVLHGS 529


>gi|195159144|ref|XP_002020442.1| GL13995 [Drosophila persimilis]
 gi|194117211|gb|EDW39254.1| GL13995 [Drosophila persimilis]
          Length = 535

 Score =  172 bits (436), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 91/205 (44%), Positives = 124/205 (60%), Gaps = 14/205 (6%)

Query: 66  LKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRAT--VQ 123
           L C Y     P+LRL PLK E     P +++Y DV+ DSEI  I +MA+ R+ R +   Q
Sbjct: 318 LHCCYNFTTTPFLRLAPLKMELLGEHPYVVVYHDVLSDSEIAEILEMAERRMARTSTVAQ 377

Query: 124 NYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHY 183
             +T     +  R +  AWL+   + +  RI+RRV  M+GL    +E +QV+NYGIGGHY
Sbjct: 378 PNRTS----SPTRTALGAWLKRSSNALTRRIARRVRDMSGLQLEGSERMQVINYGIGGHY 433

Query: 184 EPHYD-FARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFW 242
            PH D F +  E         GNR+ATVLFY++DV QGGAT+F      + P +GTA FW
Sbjct: 434 VPHKDWFTQHPEV-------MGNRLATVLFYLTDVEQGGATMFNKAEHKVLPRRGTALFW 486

Query: 243 HNLHSSGDGDYYTRHAACPVLTGSN 267
           +NLH+ G+GD+ T HAACP++ GS 
Sbjct: 487 YNLHTDGEGDWSTTHAACPIIVGSK 511


>gi|195452730|ref|XP_002073475.1| GK13125 [Drosophila willistoni]
 gi|194169560|gb|EDW84461.1| GK13125 [Drosophila willistoni]
          Length = 539

 Score =  172 bits (435), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 94/244 (38%), Positives = 139/244 (56%), Gaps = 16/244 (6%)

Query: 48  YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID 107
           YE +CRG+         +L+CR     + Y     L+ EE +  P ++   +++   +++
Sbjct: 288 YEQVCRGETRPSAKSQRELRCRLQRSRLSY---EVLELEELHQDPFVVQVHNIVSQKDMN 344

Query: 108 LIKKMAQPRLRRATV--QNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLT 165
           L++K+A+P ++R+ V  Q++   E   A YR SK A     EH  +E +SR V  ++GL 
Sbjct: 345 LLQKIARPNIQRSQVYAQDHNANETVAAAYRTSKGATFEYFEHRSMELLSRHVADLSGLD 404

Query: 166 TSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVF 225
            ++AE LQ+ NYGIGGHYEPH+D   P           GNR+AT ++Y+S+V  GG T F
Sbjct: 405 MNSAELLQIANYGIGGHYEPHWD-CFPDHHVYLPDDRDGNRIATGIYYLSEVEAGGGTAF 463

Query: 226 TSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PC 275
             L L + PE+G+  FW+NLH SGD DY T+HAACPVL GS  + +            PC
Sbjct: 464 PFLPLLVTPERGSLVFWYNLHRSGDQDYRTKHAACPVLQGSKWIANVWIRQSNQDQIRPC 523

Query: 276 GLRR 279
           GL+R
Sbjct: 524 GLQR 527


>gi|335294484|ref|XP_003357239.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Sus scrofa]
          Length = 545

 Score =  172 bits (435), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 106/273 (38%), Positives = 153/273 (56%), Gaps = 18/273 (6%)

Query: 4   PTHQRAQGNKLYYQEALNKSP-----ELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTV 58
           P ++R   N L Y++ L +S      E   + P +    P L+   R+ YE LC+   + 
Sbjct: 258 PDNKRMARNVLKYEKLLAESASQAVAETVIQRPNI----PHLQT--RDTYEGLCQTLGSQ 311

Query: 59  PPAI-VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRL 117
           P    +  L C Y   + PYL L P+++E  +L+P ++LY D + D+E   I+ +A+P +
Sbjct: 312 PTHYQIPSLYCSYETSSSPYLLLQPIRKEVIHLEPYVVLYHDFVTDAEAQKIRGLAEPWV 371

Query: 118 RRATVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQV 174
               +    +GE ++   YRISKSAWL++   P++  +  R+  +TGL      AE LQV
Sbjct: 372 TAEIL--VASGEKQLPVEYRISKSAWLKDTVDPMLVTLDHRIAALTGLDVQPPYAEYLQV 429

Query: 175 VNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWP 234
           VNYGIGGHYEPH+D A    +  ++ + +GNRVAT + Y+S V  GGAT F   N S+  
Sbjct: 430 VNYGIGGHYEPHFDHATSPSSPLYR-MKSGNRVATFMIYLSSVEAGGATAFIYGNFSVPV 488

Query: 235 EKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
            K  A FW NLH SG+GD  T HA CPVL G  
Sbjct: 489 VKNAALFWWNLHRSGEGDGDTLHAGCPVLVGDK 521


>gi|198417610|ref|XP_002125349.1| PREDICTED: similar to Prolyl 4-hydroxylase subunit alpha-1
           precursor (4-PH alpha-1)
           (Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-1) [Ciona intestinalis]
          Length = 527

 Score =  172 bits (435), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 100/267 (37%), Positives = 151/267 (56%), Gaps = 13/267 (4%)

Query: 4   PTHQRAQGNKL-YYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAI 62
           P + R  GN++ + + A  ++ E+ D+     +V  T        +  LCRG+ T+    
Sbjct: 236 PENTRILGNRIRFTRHARVQTQEVVDDKFYTFSVDET--------FFKLCRGEQTLTKKK 287

Query: 63  V-AQLKCRYVHRNV--PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRR 119
              +L+C Y+  N+  P L + P+K EE    P I+ + DV+ D+ I+ IKK+A+P+L R
Sbjct: 288 QHKKLRC-YLSTNMGNPKLLIRPVKVEELSKSPDIVQFHDVLSDTVINEIKKLAKPQLFR 346

Query: 120 ATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGI 179
           A        +L+ A YRI+K AWL + + P + +I+ R+  +TGLT +T+EE+QV NYG+
Sbjct: 347 AIHAGSDDTDLQKAPYRITKLAWLLDDDGPEVAKITERISDITGLTLNTSEEIQVANYGV 406

Query: 180 GGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTA 239
           GG Y PH+D     E         G R+AT L Y+SDV  GG T F +  +S  P KG+A
Sbjct: 407 GGEYPPHFDIPTTDEERDDLKSQDGERIATFLIYLSDVEVGGRTAFVNAGVSAKPIKGSA 466

Query: 240 AFWHNLHSSGDGDYYTRHAACPVLTGS 266
            FW+N+  SG+ D  T H ACPV  G+
Sbjct: 467 VFWYNVFPSGEPDLRTYHGACPVAFGN 493


>gi|67084101|gb|AAY66985.1| truncated prolyl 4-hydroxylase alpha subunit [Ixodes scapularis]
          Length = 452

 Score =  171 bits (433), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 99/265 (37%), Positives = 147/265 (55%), Gaps = 24/265 (9%)

Query: 44  EREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYD 103
           E   Y  LCRG+    P + ++L+CRY      +  L P+K E+  L+P II+ RDV+ +
Sbjct: 191 EELNYRRLCRGEALRTPQMDSKLRCRYYKGQDGFFTLHPIKLEKINLKPYIIVMRDVVQE 250

Query: 104 SEIDLIKKMAQPRLRRATVQNYKTGELEIANYR-ISKSAWLREPEHPVIERISRRVEHMT 162
            +I+ +   A+PRL+R+T     TG+    + R  S +AWL + E P+  R++  +  + 
Sbjct: 251 RDIENLMAFAEPRLQRSTTY---TGDGNAPSTRQTSSNAWLWDDEAPIANRMNWYLRALV 307

Query: 163 GLTTS----TAEELQVVNYGIGGHYEPHYDF------ARPGEANAFKSLGTGNRVATVLF 212
           GL TS     AE  Q+ NYG GG++ PHYD+      A    A+ +     G+R+AT++ 
Sbjct: 308 GLGTSGSEYEAEAYQLANYGSGGYFLPHYDYLQDTLHAHNSTADYYLQNNEGDRLATLMI 367

Query: 213 YMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHST 272
           YM+DV +GGATVF  L + L P+KG AAFW NL +SG+GD  T HA CPVL GS  + + 
Sbjct: 368 YMTDVKEGGATVFPRLGVRLVPKKGDAAFWWNLKASGEGDTLTMHAGCPVLYGSKWIANK 427

Query: 273 ----------CPCGLRRGLQRSGII 287
                      PC   R L  + ++
Sbjct: 428 WFKSYSNVFRLPCSTDRNLSLAPLV 452


>gi|195575115|ref|XP_002105525.1| GD21527 [Drosophila simulans]
 gi|194201452|gb|EDX15028.1| GD21527 [Drosophila simulans]
          Length = 495

 Score =  171 bits (432), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 93/229 (40%), Positives = 126/229 (55%), Gaps = 27/229 (11%)

Query: 41  EVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDV 100
           +V E + Y + C G   + P     L+C YV    P+L + PLK EE +  P ++LY DV
Sbjct: 245 QVGEFQAYSLTCSGHWQLTPKEQRHLRCGYVTETHPFLWIAPLKAEELFQDPLLVLYHDV 304

Query: 101 MYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEH 160
           +Y SEID+I+K+ + RL RAT+ ++   E  ++N R S+  ++    H V+  I +RV  
Sbjct: 305 IYQSEIDVIRKLTKNRLMRATITSH--NESVVSNVRTSQFTFIPVTAHKVLSTIDQRVAD 362

Query: 161 MTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFY---MSDV 217
           MT L    AE+ Q  NYGIGGHY  H D+                      FY   +SDV
Sbjct: 363 MTNLNMKYAEDHQFANYGIGGHYGQHMDW----------------------FYQTTLSDV 400

Query: 218 AQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           AQGG T F  L   L P+K  AAFWHNLH+SG GD  T+H ACP++ GS
Sbjct: 401 AQGGGTAFPQLRTLLKPKKYAAAFWHNLHASGVGDVRTQHGACPIIAGS 449


>gi|292621357|ref|XP_691737.4| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Danio rerio]
          Length = 538

 Score =  171 bits (432), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 97/228 (42%), Positives = 129/228 (56%), Gaps = 12/228 (5%)

Query: 45  REKYEMLCRGDLTVPPAIVA-QLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYD 103
           R  YE LC+   + P       L C Y     P L L P++ E   LQP ++L+   +  
Sbjct: 292 RNAYEQLCQTKGSQPKHFENPSLFCDYFTNGSPALFLQPIRREIISLQPYVVLFHGFVTQ 351

Query: 104 SEIDLIKKMAQPRLRRATV---QNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEH 160
           +E   I+K A P LRR+ V    N  T E     YRISKSAWL+E  H V+ ++ +R+  
Sbjct: 352 AEAKNIRKYAMPGLRRSVVASGMNQATAE-----YRISKSAWLKESAHEVVGKLDQRITL 406

Query: 161 MTGLTTS--TAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVA 218
           +TGL      AE LQVVNYGIGGHYEPH+D A    +  ++ L TGNRVAT++ Y+S V 
Sbjct: 407 VTGLNVQPPYAEYLQVVNYGIGGHYEPHFDHATSDSSPLYR-LKTGNRVATIMIYLSPVQ 465

Query: 219 QGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
            GG+T F   N S+   +  A FW NLH +G G+  T HA CPV+ G+
Sbjct: 466 AGGSTAFIYANFSVPVVQNAALFWWNLHKNGQGNVDTLHAGCPVIVGN 513


>gi|195341560|ref|XP_002037374.1| GM12888 [Drosophila sechellia]
 gi|194131490|gb|EDW53533.1| GM12888 [Drosophila sechellia]
          Length = 501

 Score =  170 bits (430), Expect = 8e-40,   Method: Compositional matrix adjust.
 Identities = 93/229 (40%), Positives = 126/229 (55%), Gaps = 27/229 (11%)

Query: 41  EVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDV 100
           +V E + Y + C G   + P     L+C YV    P+L + PLK EE +  P ++LY DV
Sbjct: 251 QVGEFQAYSLTCSGHWRLTPKEQRHLRCGYVTETHPFLWIAPLKAEELFQDPLLVLYHDV 310

Query: 101 MYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEH 160
           +Y SEID+I+K+ + RL RAT+ ++   E  ++N R S+  ++    H V+  I +RV  
Sbjct: 311 IYQSEIDVIRKLTKNRLMRATITSH--NESVVSNVRTSQITFIPVTAHKVLSTIDQRVAD 368

Query: 161 MTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFY---MSDV 217
           MT L    AE+ Q  NYGIGGHY  H D+                      FY   +SDV
Sbjct: 369 MTNLNMKYAEDHQFANYGIGGHYGQHMDW----------------------FYQTTLSDV 406

Query: 218 AQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           AQGG T F  L   L P+K  AAFWHNLH+SG GD  T+H ACP++ GS
Sbjct: 407 AQGGGTAFPQLRTLLKPKKYAAAFWHNLHASGVGDVRTQHGACPIIAGS 455


>gi|4336512|gb|AAD17844.1| prolyl 4-hydroxylase alpha subunit [Drosophila melanogaster]
          Length = 535

 Score =  169 bits (429), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 99/255 (38%), Positives = 139/255 (54%), Gaps = 15/255 (5%)

Query: 41  EVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDV 100
           E  E   YE +CRG+L   P+    L+CR     + Y    P K EE +L P ++    V
Sbjct: 278 ESREFRMYEQVCRGELAPLPSKQRNLRCRLRKSRLGY---APFKLEELHLDPLVVQLHQV 334

Query: 101 MYDSEIDLIKKMAQPRLRRATVQNYK-TGELEIANYRISKSAWLREPEHPVIERISRRVE 159
           +   + D ++K A+PR++R+TV +    G    A +R S+ A      +   + +SR V 
Sbjct: 335 IGSKDSDSLQKTARPRIKRSTVYSLGGNGGSTAAAFRTSQGASFNYSRNAATKLLSRHVG 394

Query: 160 HMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQ 219
             +GL    AE+LQV NYGIGGHYEPH+D + P      +    GNR+AT ++Y+SDV  
Sbjct: 395 DFSGLNMDYAEDLQVANYGIGGHYEPHWD-SFPENHIYQEGDLHGNRMATGIYYLSDVEA 453

Query: 220 GGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC------ 273
           GG T F  L L + PE+G+  FW+NLH SGD D+ T+HAACPVL GS  + +        
Sbjct: 454 GGGTAFPFLPLLVTPERGSLLFWYNLHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQ 513

Query: 274 ----PCGLRRGLQRS 284
               PC L RG + S
Sbjct: 514 DNVRPCDLERGQEIS 528


>gi|241598362|ref|XP_002404733.1| prolyl 4-hydroxylase alpha subunit 1, putative [Ixodes scapularis]
 gi|215500464|gb|EEC09958.1| prolyl 4-hydroxylase alpha subunit 1, putative [Ixodes scapularis]
          Length = 340

 Score =  169 bits (429), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 90/207 (43%), Positives = 128/207 (61%), Gaps = 8/207 (3%)

Query: 64  AQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQ 123
           +QL+CRY      +  L P+K EE  L+P +I+  DV+ D +I+ +   A+PRL R+T  
Sbjct: 3   SQLRCRYYKGQDGFFSLQPIKLEEINLKPYVIVMHDVVQDKDIEDLMAFAEPRLERSTT- 61

Query: 124 NYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST----AEELQVVNYGI 179
            Y   E+  +  R S +AWL E E P+  R++  +  + G+ TS     AE  Q+ NYG 
Sbjct: 62  -YTGNEMMPSPERTSSTAWLNEDEAPIAVRMNSYLRALLGMGTSDTDEEAEAYQLANYGT 120

Query: 180 GGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTA 239
           GGH+ PH+DF +     A  S+ TG+R+AT++ YM+DV +GG TVF +L + L P+KG A
Sbjct: 121 GGHFLPHHDFLQ-DSLQADNSV-TGDRLATLMIYMTDVEEGGTTVFPNLGIRLTPKKGDA 178

Query: 240 AFWHNLHSSGDGDYYTRHAACPVLTGS 266
           AFW NL +SGDG+  T HA CPVL GS
Sbjct: 179 AFWWNLKASGDGERLTTHAGCPVLYGS 205


>gi|195341584|ref|XP_002037386.1| GM12898 [Drosophila sechellia]
 gi|194131502|gb|EDW53545.1| GM12898 [Drosophila sechellia]
          Length = 536

 Score =  169 bits (427), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 91/229 (39%), Positives = 135/229 (58%), Gaps = 11/229 (4%)

Query: 46  EKYEMLCRGD--LTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYD 103
           E +  LCR      +  +  ++L CRY     P+L+L P + EE  L P ++LY +V+ D
Sbjct: 275 ESFYQLCRSSSRRQMGESKPSRLHCRYNTTTTPFLKLAPFRMEELSLDPYVVLYHNVLSD 334

Query: 104 SEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWL----REPEH-PVIERISRRV 158
            EI+ +K M++P L RA V   + G  EIA  R +  AWL     +P+   V+ RI RR+
Sbjct: 335 PEIEKLKPMSKPFLERAKVFRVEKGSDEIAPSRSADGAWLPHQDTDPDDLEVLRRIGRRI 394

Query: 159 EHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVA 218
           + +TGL T +  ++Q + YG GGH+ PHYD+     +   +    G+R+ATVLFY+++V 
Sbjct: 395 KDLTGLNTRSGSQMQFLKYGFGGHFVPHYDYFNSKTSYLER---VGDRIATVLFYLNNVD 451

Query: 219 QGGATVFTSLNLSLWPEKGTAAFWHNL-HSSGDGDYYTRHAACPVLTGS 266
            GGAT F  LNL +  +KG+A FWHNL   S D D  T H ACP+++G+
Sbjct: 452 HGGATAFPKLNLVVPTQKGSALFWHNLDRKSYDYDTCTFHGACPLISGT 500


>gi|194905419|ref|XP_001981192.1| GG11932 [Drosophila erecta]
 gi|190655830|gb|EDV53062.1| GG11932 [Drosophila erecta]
          Length = 535

 Score =  169 bits (427), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 96/255 (37%), Positives = 139/255 (54%), Gaps = 15/255 (5%)

Query: 41  EVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDV 100
           E  E   YE +CRG+L   P+    L+CR     + Y    P K EE +L P ++    V
Sbjct: 278 ESREFRMYEQVCRGELAPLPSKQRDLRCRLWRSRLGY---APFKLEELHLDPPVVQLHQV 334

Query: 101 MYDSEIDLIKKMAQPRLRRATVQNYK-TGELEIANYRISKSAWLREPEHPVIERISRRVE 159
           +   + + +++ A+PR++R+TV +    G+   A +R S+ A      +   + +S  V 
Sbjct: 335 IGSKDAESLQRTARPRIKRSTVYSLAGNGDSTAAAFRTSQGASFNYSRNAATKLLSHHVG 394

Query: 160 HMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQ 219
             +GL    AE+LQV NYGIGGHYEPH+D + P      +    GNR+AT ++Y+SDV  
Sbjct: 395 DFSGLNMEYAEDLQVANYGIGGHYEPHWD-SFPDNHVYQEGDLHGNRIATAIYYLSDVEA 453

Query: 220 GGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC------ 273
           GG T F  L L + PE+G+  FW+NLH SGD D+ T+HAACPVL GS  + +        
Sbjct: 454 GGGTAFPFLPLLVTPERGSLLFWYNLHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQ 513

Query: 274 ----PCGLRRGLQRS 284
               PC L RG + S
Sbjct: 514 DNVRPCDLERGQEIS 528


>gi|24651418|ref|NP_524594.2| prolyl-4-hydroxylase-alpha MP [Drosophila melanogaster]
 gi|7301951|gb|AAF57057.1| prolyl-4-hydroxylase-alpha MP [Drosophila melanogaster]
 gi|359807686|gb|AEV66559.1| FI17802p1 [Drosophila melanogaster]
          Length = 535

 Score =  168 bits (426), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 98/255 (38%), Positives = 139/255 (54%), Gaps = 15/255 (5%)

Query: 41  EVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDV 100
           E  E   YE +CRG+L   P+    L+CR     + Y    P K EE +L P ++    V
Sbjct: 278 ESREFRMYEQVCRGELAPLPSKQRNLRCRLRKSRLGY---APFKLEELHLDPLVVQLHQV 334

Query: 101 MYDSEIDLIKKMAQPRLRRATVQNYK-TGELEIANYRISKSAWLREPEHPVIERISRRVE 159
           +   + D ++K A+PR++R+TV +    G    A +R S+ A      +   + +SR V 
Sbjct: 335 IGSKDSDSLQKTARPRIKRSTVYSLGGNGGSTAAAFRTSQGASFNYSRNAATKLLSRHVG 394

Query: 160 HMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQ 219
             +GL    AE+LQV NYGIGGHYEPH+D + P      +    GNR+AT ++Y++DV  
Sbjct: 395 DFSGLNMDYAEDLQVANYGIGGHYEPHWD-SFPENHIYQEGDLHGNRMATGIYYLADVEA 453

Query: 220 GGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC------ 273
           GG T F  L L + PE+G+  FW+NLH SGD D+ T+HAACPVL GS  + +        
Sbjct: 454 GGGTAFPFLPLLVTPERGSLLFWYNLHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQ 513

Query: 274 ----PCGLRRGLQRS 284
               PC L RG + S
Sbjct: 514 DNVRPCDLERGQEIS 528


>gi|66772633|gb|AAY55628.1| IP02961p [Drosophila melanogaster]
          Length = 409

 Score =  168 bits (425), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 97/251 (38%), Positives = 135/251 (53%), Gaps = 15/251 (5%)

Query: 41  EVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDV 100
           E  E   YE +CRG+L   P+    L+CR     + Y    P K EE +L P ++    V
Sbjct: 152 ESREFRMYEQVCRGELAPLPSKQRNLRCRLRKSRLGY---APFKLEELHLDPLVVQLHQV 208

Query: 101 MYDSEIDLIKKMAQPRLRRATVQNYK-TGELEIANYRISKSAWLREPEHPVIERISRRVE 159
           +   + D ++K A+PR++R+TV +    G    A +R S+ A      +   + +SR V 
Sbjct: 209 IGSKDSDSLQKTARPRIKRSTVYSLGGNGGSTAAAFRTSQGASFNYSRNAATKLLSRHVG 268

Query: 160 HMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQ 219
             +GL    AE+LQV NYGIGGHYEPH+D            L  GNR+AT ++Y++DV  
Sbjct: 269 DFSGLNMDYAEDLQVANYGIGGHYEPHWDSFPENHIYQEGDL-HGNRMATGIYYLADVEA 327

Query: 220 GGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC------ 273
           GG T F  L L + PE+G+  FW+NLH SGD D+ T+HAACPVL GS  + +        
Sbjct: 328 GGGTAFPFLPLLVTPERGSLLFWYNLHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQ 387

Query: 274 ----PCGLRRG 280
               PC L RG
Sbjct: 388 DNVRPCDLERG 398


>gi|442762205|gb|JAA73261.1| Putative prolyl 4-hydroxylase alpha subunit, partial [Ixodes
           ricinus]
          Length = 482

 Score =  168 bits (425), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 93/235 (39%), Positives = 137/235 (58%), Gaps = 16/235 (6%)

Query: 44  EREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYD 103
           E + Y+ LCRG+L   P + ++L+CRY   +     L P+K EE  L+P I++  DV+ D
Sbjct: 221 EMQNYKRLCRGELLRTPKMDSKLRCRYYKGHGGSFTLHPIKLEEVNLKPYIVVMHDVVQD 280

Query: 104 SEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTG 163
            +I+ ++  A+PRL+  T   Y    +E    R S +AW+ E   PV  ++++ +  + G
Sbjct: 281 RDIEDLRAFAEPRLQ--TSLTYDVPGVESPAVRTSSNAWMDEKNAPVATKLNKFLRSLLG 338

Query: 164 LTTS----TAEELQVVNYGIGGHYEPHYDF--------ARPGEANAFKSLGTGNRVATVL 211
           + TS     AE+ Q+ NYG GGH+  H D+          P E    K +G  +RVAT++
Sbjct: 339 MGTSYSDGEAEKYQLANYGTGGHFLTHPDYLGDLFENDTDPSEFEFHKKVG--DRVATLM 396

Query: 212 FYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
            YMSDV +GGATVF  L + L P+KG AAFW NL ++G+G+  T HA CPVL GS
Sbjct: 397 IYMSDVEEGGATVFPYLGVRLTPQKGDAAFWWNLKANGEGEVLTTHAGCPVLYGS 451


>gi|157114983|ref|XP_001658090.1| prolyl 4-hydroxylase alpha subunit 1, putative [Aedes aegypti]
 gi|108877085|gb|EAT41310.1| AAEL007032-PA, partial [Aedes aegypti]
          Length = 448

 Score =  167 bits (423), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 90/219 (41%), Positives = 127/219 (57%), Gaps = 27/219 (12%)

Query: 48  YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID 107
           YE LCRG+    PA VA L+CRY  +N  +L++ P K EEA L P I++Y + + D EID
Sbjct: 245 YEPLCRGEYQRTPAQVANLRCRYESKNSSFLKIAPFKLEEASLDPLIVIYHNAISDKEID 304

Query: 108 LIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTS 167
            I ++++P L+R+ V   ++   E++N R +        +  +++ +S R E MTGL   
Sbjct: 305 QIIQVSKPMLKRSMVG--ESFSKEVSNERTNY-------DFELVKVLSLRTEDMTGLDRK 355

Query: 168 TAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTS 227
           + E LQV NYGIGG Y PH+D+ R  E                   +SDV QGGATVF  
Sbjct: 356 SYESLQVNNYGIGGFYLPHFDWVRTNEP------------------ISDVEQGGATVFPQ 397

Query: 228 LNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           + + ++P+KG+A FW+NL   G GD  T H ACPVL GS
Sbjct: 398 IGVGVFPKKGSAIFWYNLLPDGTGDERTLHGACPVLLGS 436


>gi|195159311|ref|XP_002020525.1| GL13465 [Drosophila persimilis]
 gi|194117294|gb|EDW39337.1| GL13465 [Drosophila persimilis]
          Length = 578

 Score =  167 bits (422), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 91/219 (41%), Positives = 127/219 (57%), Gaps = 6/219 (2%)

Query: 49  EMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDL 108
           ++ CRG    P   + +L C Y     P+LRL P K E   L P ++LY DV+   E   
Sbjct: 344 QLCCRG--GCPYRDMHRLTCSYNTTAAPFLRLAPFKTELLSLAPYMVLYHDVITPLESLT 401

Query: 109 IKKMAQPRL-RRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTS 167
           +K +++P + RRA   N +     I + R S S WL   E+ V+ER+ RRV  MT     
Sbjct: 402 LKNLSKPHMKRRAMTFNKQKLRPLIDSGRTSNSVWLTSHENAVMERLERRVGVMTNFEME 461

Query: 168 TAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTS 227
            +E  Q++NYGIGGHY+PH D     E    +  G G+R+ATVLFY+SDV QGGAT+F  
Sbjct: 462 NSEVYQLINYGIGGHYKPHTDHF---ETPQLEHRGGGDRIATVLFYLSDVPQGGATLFPR 518

Query: 228 LNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           LN+S+ P +G A  W+NL+  G G+  T H +CP++ GS
Sbjct: 519 LNISVQPRQGDALLWYNLNDRGQGEIGTVHTSCPIIKGS 557


>gi|115313004|gb|AAI24075.1| Zgc:152670 [Danio rerio]
          Length = 235

 Score =  167 bits (422), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 92/202 (45%), Positives = 127/202 (62%), Gaps = 18/202 (8%)

Query: 66  LKCRY-VHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQN 124
           L CRY      P L   P+KEEE + +P+II Y DV+ D+EI+ +K +A+P L R+    
Sbjct: 26  LSCRYSTGGGNPRLMYAPVKEEELWDEPKIIRYHDVISDTEIETLKDIARPELTRS---- 81

Query: 125 YKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYE 184
            +TG   I+  R S+S +L E     + RIS+R+  +TGL+  +AE+L V NYGIGG Y 
Sbjct: 82  -QTGWGVISEIRTSQSVFLDEV--GTVARISQRIADITGLSVESAEKLHVQNYGIGGRYT 138

Query: 185 PHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHN 244
           PH+D    G+ N         R AT L YMSDV  GGATVFT++ +++ PEKG+A FW+N
Sbjct: 139 PHFDAG--GDVN--------ERTATFLIYMSDVEVGGATVFTNVGVAVKPEKGSAVFWNN 188

Query: 245 LHSSGDGDYYTRHAACPVLTGS 266
           LH +G+ D  T+HA CPVL G+
Sbjct: 189 LHKNGELDLKTKHAGCPVLVGN 210


>gi|85857698|gb|ABC86384.1| IP10964p [Drosophila melanogaster]
          Length = 534

 Score =  167 bits (422), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 89/225 (39%), Positives = 131/225 (58%), Gaps = 12/225 (5%)

Query: 48  YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID 107
           +++ C G    P     +L C Y     P+LRL PLK E+  L P ++LY +V+   EI 
Sbjct: 286 FKLSCNG----PLESSTRLHCFYNFTTTPFLRLAPLKTEQIGLDPYVVLYHEVLSAREIS 341

Query: 108 LIKKMAQPRLRRATVQNYKTGELEIANY-RISKSAWLREPEHPVIERISRRVEHMTGLTT 166
           ++   A   ++   +  +K   +   N  R +K  WL++  + + +RI+RR+  MTG   
Sbjct: 342 MLIGKAAQNMKNTKI--HKERAVPKKNRGRTAKGFWLKKESNELTKRITRRIMDMTGFDL 399

Query: 167 STAEELQVVNYGIGGHYEPH---YDFARPGEANAFK--SLGTGNRVATVLFYMSDVAQGG 221
           + +E  QV+NYGIGGHY  H   +DFA     +     S+  G+R+ATVLFY++DV QGG
Sbjct: 400 ADSEGFQVINYGIGGHYFLHMDYFDFASSNHTDTRSRYSIDLGDRIATVLFYLTDVEQGG 459

Query: 222 ATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           ATVF  +   + P+ GTA FW+NL + G+GD  TRHAACPV+ GS
Sbjct: 460 ATVFGDVGYYVSPQAGTAIFWYNLDTDGNGDPRTRHAACPVIVGS 504


>gi|221460681|ref|NP_733394.3| CG31013 [Drosophila melanogaster]
 gi|220903261|gb|AAF57073.4| CG31013 [Drosophila melanogaster]
          Length = 534

 Score =  166 bits (421), Expect = 9e-39,   Method: Compositional matrix adjust.
 Identities = 89/225 (39%), Positives = 131/225 (58%), Gaps = 12/225 (5%)

Query: 48  YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID 107
           +++ C G    P     +L C Y     P+LRL PLK E+  L P ++LY +V+   EI 
Sbjct: 286 FKLSCNG----PLESSTRLHCFYNFTTTPFLRLAPLKTEQIGLDPYVVLYHEVLSAREIS 341

Query: 108 LIKKMAQPRLRRATVQNYKTGELEIANY-RISKSAWLREPEHPVIERISRRVEHMTGLTT 166
           ++   A   ++   +  +K   +   N  R +K  WL++  + + +RI+RR+  MTG   
Sbjct: 342 MLIGKAAQNMKNTKI--HKERAVPKKNRGRTAKGFWLKKESNELTKRITRRIMDMTGFDL 399

Query: 167 STAEELQVVNYGIGGHYEPH---YDFARPGEANAFK--SLGTGNRVATVLFYMSDVAQGG 221
           + +E  QV+NYGIGGHY  H   +DFA     +     S+  G+R+ATVLFY++DV QGG
Sbjct: 400 ADSEGFQVINYGIGGHYFLHMDYFDFASSNHTDTRSRYSIDLGDRIATVLFYLTDVEQGG 459

Query: 222 ATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           ATVF  +   + P+ GTA FW+NL + G+GD  TRHAACPV+ GS
Sbjct: 460 ATVFGDVGYYVSPQAGTAIFWYNLDTDGNGDPRTRHAACPVIVGS 504


>gi|444731524|gb|ELW71877.1| Prolyl 4-hydroxylase subunit alpha-3 [Tupaia chinensis]
          Length = 562

 Score =  166 bits (421), Expect = 9e-39,   Method: Compositional matrix adjust.
 Identities = 111/293 (37%), Positives = 154/293 (52%), Gaps = 39/293 (13%)

Query: 4   PTHQRAQGNKLYYQEALNKS-----PELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTV 58
           P ++R   N L Y+  L +S      E   + P V    P L+   R+ YE LC+   + 
Sbjct: 256 PDNKRMARNILKYERLLAESSNQAVAEAVIQRPNV----PHLQT--RDTYEGLCQTLGSQ 309

Query: 59  PPAI-VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRL 117
           P    +  L C Y   + PYL L P+++E  +L+P I LY D + DSE   I+ +A+P L
Sbjct: 310 PTHYQIPSLYCSYETNSSPYLLLQPVRKELIHLEPYIALYHDFVSDSEAQKIRALAEPWL 369

Query: 118 RRATVQNYKTGELEI-ANYRISK--------------------SAWLREPEHPVIERISR 156
           +R+ V    +GE ++   YRISK                    SAWL++   P++  +  
Sbjct: 370 QRSVV---ASGEKQLQVEYRISKRRRLVVSGIASLMPQSVVYFSAWLKDTVDPMLVTLDH 426

Query: 157 RVEHMTGLTTST--AEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYM 214
           R+  +TGL      AE LQVVNYGIGGHYEPH+D A    +  ++ + +GNRVAT + Y+
Sbjct: 427 RIAALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYR-MKSGNRVATFMIYL 485

Query: 215 SDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
           S V  GGAT F   N S+   K  A FW NLH SG+G+  T HA CPVL G  
Sbjct: 486 SSVEAGGATAFIYANFSVPVVKNAALFWWNLHRSGEGNSDTLHAGCPVLVGDK 538


>gi|390176896|ref|XP_002136934.2| GA26861 [Drosophila pseudoobscura pseudoobscura]
 gi|388858831|gb|EDY67492.2| GA26861 [Drosophila pseudoobscura pseudoobscura]
          Length = 513

 Score =  166 bits (421), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 92/220 (41%), Positives = 128/220 (58%), Gaps = 10/220 (4%)

Query: 49  EMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDL 108
           ++ CRG    P   + +L C Y     P+LRL P K E   L P ++LY DV+   E   
Sbjct: 281 QLCCRGG--CPYRDMHRLTCSYNTTAAPFLRLAPFKTEILSLSPYMVLYHDVITPLESLT 338

Query: 109 IKKMAQPRL-RRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTS 167
           +K +++P + RRA   N +     I + R S S WL   E+ V+ER+ RRV  MT     
Sbjct: 339 LKNLSKPHMKRRAMTFNKQKLRPLIDSGRTSNSVWLTSHENAVMERLERRVGVMTNFEME 398

Query: 168 TAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFT 226
            +E  Q++NYGIGGHY+PH D F  P      +  G G+R+ATVLFY+SDV QGGAT+F 
Sbjct: 399 NSEVYQLINYGIGGHYKPHTDHFETP------QHRGGGDRIATVLFYLSDVPQGGATLFP 452

Query: 227 SLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
            LN+S+ P +G A  W+NL+  G G+  T H +CP++ GS
Sbjct: 453 RLNISVQPRQGDALLWYNLNDRGQGEIGTVHTSCPIIQGS 492


>gi|195575143|ref|XP_002105539.1| GD16913 [Drosophila simulans]
 gi|194201466|gb|EDX15042.1| GD16913 [Drosophila simulans]
          Length = 534

 Score =  166 bits (420), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 85/207 (41%), Positives = 124/207 (59%), Gaps = 6/207 (2%)

Query: 65  QLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQN 124
           +L C Y     P+LRL PLK E+  L P ++LY +V+   EI ++   A   ++   V  
Sbjct: 299 RLHCFYNFTTTPFLRLAPLKIEQIGLDPYVVLYHEVLSAREISMLIGKAAQNMKNTRVHK 358

Query: 125 YKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYE 184
            + G  +    R +K  W ++  + + + I+RR+  MTG   + +E  QV+NYGIGGHY 
Sbjct: 359 -EQGVPKKNRGRTAKGFWFKKESNELTKGITRRIMDMTGFDLADSEGFQVINYGIGGHYL 417

Query: 185 PH---YDFARPGEANAFK--SLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTA 239
            H   +DFA     +     S+  G+R+ATVLFY++DV QGGATVF  +  S++P+ GTA
Sbjct: 418 LHMDYFDFASSNHTDTRSGYSMDLGDRIATVLFYLTDVEQGGATVFADVGYSVYPQAGTA 477

Query: 240 AFWHNLHSSGDGDYYTRHAACPVLTGS 266
            FW+NL ++G GD  TRHAACPV+ GS
Sbjct: 478 IFWYNLDTNGKGDPRTRHAACPVIVGS 504


>gi|198477152|ref|XP_002136738.1| GA29216 [Drosophila pseudoobscura pseudoobscura]
 gi|198145043|gb|EDY71755.1| GA29216 [Drosophila pseudoobscura pseudoobscura]
          Length = 517

 Score =  166 bits (420), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 91/221 (41%), Positives = 128/221 (57%), Gaps = 8/221 (3%)

Query: 49  EMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDL 108
           ++ CRG    P   + +L C Y     P+LRL P K E   L P ++LY DV+   E   
Sbjct: 281 QLCCRGG--CPYRDMHRLTCSYNTTAAPFLRLAPFKTEILSLSPYMVLYHDVITPLESLT 338

Query: 109 IKKMAQPRLRR---ATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLT 165
           +K +++P ++R     V N K     I + R S S WL   E+ V+ER+ RRV  MT   
Sbjct: 339 LKNLSKPLMKRRAMVMVNNLKVRPF-IDSGRTSNSVWLASHENAVMERLERRVGVMTNFE 397

Query: 166 TSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVF 225
              +E  Q++NYGIGGHY+PH D     +A   +  G G+R+ATVLFY+SDV QGGAT+F
Sbjct: 398 MENSEVYQLINYGIGGHYKPHTDHFETPQAPEHR--GGGDRIATVLFYLSDVPQGGATLF 455

Query: 226 TSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
             LN+S+ P +G A  W+NL+  G G+  T H +CP++ GS
Sbjct: 456 PRLNISVQPRQGDALLWYNLNDRGQGEIGTVHTSCPIIQGS 496


>gi|195341588|ref|XP_002037388.1| GM12140 [Drosophila sechellia]
 gi|194131504|gb|EDW53547.1| GM12140 [Drosophila sechellia]
          Length = 534

 Score =  165 bits (417), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 84/207 (40%), Positives = 124/207 (59%), Gaps = 6/207 (2%)

Query: 65  QLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQN 124
           +L C Y     P+LRL PLK E+  L P ++LY +V+   EI ++   A   ++   V  
Sbjct: 299 RLHCFYNFTTTPFLRLAPLKIEQIGLDPYVVLYHEVLSAREISMLIGKATQNMKNTRVHK 358

Query: 125 YKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYE 184
            + G  +    R +K  W ++  + + + I+RR+  MTG   + +E  QV+NYGIGGHY 
Sbjct: 359 -EQGVPKKNRGRTAKGFWFKKESNELTKGITRRIMDMTGFDLADSEGFQVINYGIGGHYL 417

Query: 185 PH---YDFARPGEANAFKS--LGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTA 239
            H   +DFA     +   S  +  G+R+ATVLFY++DV QGGATVF  +  S++P+ GTA
Sbjct: 418 LHMDYFDFASSNHTDTRSSYSMDLGDRIATVLFYLTDVEQGGATVFADVGYSVYPQAGTA 477

Query: 240 AFWHNLHSSGDGDYYTRHAACPVLTGS 266
            FW+NL ++G GD  T+HAACPV+ GS
Sbjct: 478 IFWYNLDTNGKGDPRTKHAACPVIVGS 504


>gi|221126103|ref|XP_002165259.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Hydra
           magnipapillata]
          Length = 533

 Score =  165 bits (417), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 93/218 (42%), Positives = 125/218 (57%), Gaps = 4/218 (1%)

Query: 51  LCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIK 110
           LC+G   +    + +L C+YV     Y+ L PLK E  +  P I LY +++ D E   I 
Sbjct: 293 LCQGREKMAQKDINRLFCKYVAPKAHYI-LKPLKMEVLHHDPYIELYYELITDDEAKHII 351

Query: 111 KMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAE 170
           K A+P LRRA V +  TG+L  A+YR+SK+ W+ E    +  +I RRV  +TGL    AE
Sbjct: 352 KFAKPLLRRAFVHDMVTGDLIYADYRVSKNTWIAEDMDVIAAKIIRRVGDVTGLNMRYAE 411

Query: 171 ELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL-- 228
            LQV NYGI G YEPH+D +       F   G GNR+AT+L Y+SDV  GG TVFT+   
Sbjct: 412 HLQVANYGIAGQYEPHFDHSTGTRPKHFDRWG-GNRIATMLLYLSDVDWGGRTVFTNTAP 470

Query: 229 NLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
            +   P KG   FW+NL  +G  +  T+HA CPV+ G 
Sbjct: 471 GVGTDPIKGAGVFWYNLLRNGKSNPKTQHAGCPVVLGQ 508


>gi|195575097|ref|XP_002105516.1| GD17035 [Drosophila simulans]
 gi|194201443|gb|EDX15019.1| GD17035 [Drosophila simulans]
          Length = 535

 Score =  164 bits (415), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 97/255 (38%), Positives = 138/255 (54%), Gaps = 15/255 (5%)

Query: 41  EVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDV 100
           E  E   YE +CRG+L    +    L+CR     + Y    P K EE +L P ++    V
Sbjct: 278 ESREFRMYEQVCRGELAPLSSKQRSLRCRLRKSRLGY---APFKLEELHLDPLVVQLHQV 334

Query: 101 MYDSEIDLIKKMAQPRLRRATVQNYK-TGELEIANYRISKSAWLREPEHPVIERISRRVE 159
           +  ++ + ++K A+PR++R+TV +    G    A +R S+ A      +   + +S  V 
Sbjct: 335 IGSNDSESLQKTARPRIKRSTVYSLGGNGGSTAAAFRTSQGASFNYSRNAATKLLSHHVG 394

Query: 160 HMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQ 219
             +GL    AE+LQV NYGIGGHYEPH+D + P      +    GNR+AT ++Y+SDV  
Sbjct: 395 DFSGLNMDYAEDLQVANYGIGGHYEPHWD-SFPENHIYQEGDLHGNRIATGIYYLSDVEA 453

Query: 220 GGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC------ 273
           GG T F  L L + PEKG+  FW+NLH SGD D+ T+HAACPVL GS  + +        
Sbjct: 454 GGGTAFPFLPLLVTPEKGSLLFWYNLHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQ 513

Query: 274 ----PCGLRRGLQRS 284
               PC L RG + S
Sbjct: 514 DNVRPCDLERGQEIS 528


>gi|195110923|ref|XP_002000029.1| GI22757 [Drosophila mojavensis]
 gi|193916623|gb|EDW15490.1| GI22757 [Drosophila mojavensis]
          Length = 535

 Score =  164 bits (414), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 93/242 (38%), Positives = 134/242 (55%), Gaps = 12/242 (4%)

Query: 48  YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID 107
           Y+ +CR +L    A   +L+CRY   +   L  +  K EE +  P II   +V+   E  
Sbjct: 286 YQQVCREELMPTAAAQRELRCRYFSGHGRSLNYLAYKLEELHRDPYIIQLHEVIGAHESV 345

Query: 108 LIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTS 167
            ++ +A+P L+R+ V +   G    A +R S+       EHP+IE++S+ +  ++GL   
Sbjct: 346 QLQHLARPVLQRSEVYSPTNGS-TAATFRTSQGTVFEYDEHPIIEKLSQHMTLISGLDMG 404

Query: 168 TAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTS 227
            AE LQ+ NYGIGGHYEPH D        + +   T NR+AT +FY+S+V  GGAT F  
Sbjct: 405 FAEPLQIANYGIGGHYEPHMDSFPESFDYSLQRFKT-NRIATGIFYLSNVEAGGATAFPF 463

Query: 228 LNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC----------PCGL 277
           L L + PE+G+  FW+NLH SGD DY T+HA CPVL GS  + +            PC L
Sbjct: 464 LPLLVKPEQGSLLFWYNLHRSGDADYRTKHAGCPVLQGSKWIANVWIRLSHQDHVRPCQL 523

Query: 278 RR 279
           +R
Sbjct: 524 QR 525


>gi|195505199|ref|XP_002099401.1| GE23383 [Drosophila yakuba]
 gi|194185502|gb|EDW99113.1| GE23383 [Drosophila yakuba]
          Length = 535

 Score =  163 bits (413), Expect = 8e-38,   Method: Compositional matrix adjust.
 Identities = 96/250 (38%), Positives = 132/250 (52%), Gaps = 15/250 (6%)

Query: 41  EVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDV 100
           E  E   YE +CRG+L   PA    L+CR     + Y    P K EE +L P ++    V
Sbjct: 278 ESREFRMYEQVCRGELAPLPAKQRNLRCRLRKSRLGY---APFKLEELHLDPLLVQLHQV 334

Query: 101 MYDSEIDLIKKMAQPRLRRATVQNYK-TGELEIANYRISKSAWLREPEHPVIERISRRVE 159
           +   + + +++ A+PR++R+TV +    G    A +R S+ A          + +S  V 
Sbjct: 335 IGAKDSESLQRTARPRIKRSTVYSLAGNGGSTAAAFRTSQGASFNYSRSAATKLLSHHVG 394

Query: 160 HMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQ 219
             +GL    AE+LQV NYGIGGHYEPH+D            L  GNR+AT ++Y+SDV  
Sbjct: 395 DFSGLNMEYAEDLQVANYGIGGHYEPHWDSFPENHVYQEGDL-HGNRIATGIYYLSDVEA 453

Query: 220 GGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC------ 273
           GG T F  L L + PEKG+  FW+NLH SGD D+ T+HAACPVL GS  + +        
Sbjct: 454 GGGTAFPFLPLLVTPEKGSLLFWYNLHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQ 513

Query: 274 ----PCGLRR 279
               PC L R
Sbjct: 514 DKVRPCDLER 523


>gi|195159150|ref|XP_002020445.1| GL13509 [Drosophila persimilis]
 gi|194117214|gb|EDW39257.1| GL13509 [Drosophila persimilis]
          Length = 554

 Score =  163 bits (412), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 84/232 (36%), Positives = 137/232 (59%), Gaps = 12/232 (5%)

Query: 46  EKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSE 105
           E++  +CR      P+   +L CRY     P+LRL PL+ EE  L P I++Y +V+ D+E
Sbjct: 307 EEFNQICRYSHQNKPS---RLHCRYNTTTTPFLRLAPLRMEELSLDPYIVVYHNVLSDAE 363

Query: 106 IDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPE-----HPVIERISRRVEH 160
           I  ++++ +P L+R+ V + K  ++  +  R +  AWL +         VI+RI RR+  
Sbjct: 364 IAEVERVTEPLLKRSVVFDGKENKMSTSKKRTALGAWLPDDNMDVSGRAVIQRILRRIHE 423

Query: 161 MTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQG 220
           +TGL  +  +++Q++ YG GGHY+ H+D+      ++  +   G+R+ATVLFY++DV  G
Sbjct: 424 LTGLIMNDRQDMQLIKYGYGGHYDIHFDYFN---TSSPITKARGDRMATVLFYLNDVKHG 480

Query: 221 GATVFTSLNLSLWPEKGTAAFWHNLH-SSGDGDYYTRHAACPVLTGSNSLHS 271
           G+T FT L L +  E+G   FW+N+   + D D  T H ACPV+ G+ S+ S
Sbjct: 481 GSTAFTDLQLKVPSERGKVLFWYNMRGETHDLDSRTLHGACPVIDGTKSILS 532


>gi|321458081|gb|EFX69155.1| hypothetical protein DAPPUDRAFT_228756 [Daphnia pulex]
          Length = 570

 Score =  163 bits (412), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 98/263 (37%), Positives = 137/263 (52%), Gaps = 32/263 (12%)

Query: 19  ALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRG--DLTVPPAIVAQLKCRYVHRNVP 76
           A NKSPE               E  E E +  LCR   + + P  +  +LKCR +    P
Sbjct: 285 ATNKSPEWS-------------EWEELEVFFRLCREGEEKSRPTGLKGRLKCRQISHTHP 331

Query: 77  YLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQP-RLRRATVQNYKTGELEI-AN 134
           Y  L PLK EE  L P I ++ D M D+E ++ K +A   RL R+   + + G+  + ++
Sbjct: 332 YFILRPLKLEEHSLVPYIAVFHDFMSDAETEIFKSLAMAERLERSAHGSKRPGQGGVTSD 391

Query: 135 YRISKSAWLREPEHPVIERISRRVEHMTGLTTS----TAEELQVVNYGIGGHYEPHYD-- 188
            R SK +W+ +  H V+++IS+R+    GL +      +E  QV NYGIGG Y PH D  
Sbjct: 392 KRTSKQSWVEDGSHHVVDQISKRISDSVGLNSQPSNVGSEHYQVANYGIGGRYTPHTDHG 451

Query: 189 -----FARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWH 243
                   P E + F+    G+R+ T + Y+ DV  GGATVFT   + + P+KG A FW 
Sbjct: 452 VLSKSMGGPSEFDLFR----GDRILTFMTYLDDVEAGGATVFTHAGVVVRPKKGMAVFWW 507

Query: 244 NLHSSGDGDYYTRHAACPVLTGS 266
           NL S  +GD  TRH  CPVL GS
Sbjct: 508 NLKSDSNGDTLTRHGGCPVLHGS 530


>gi|195505209|ref|XP_002099405.1| GE10885 [Drosophila yakuba]
 gi|194185506|gb|EDW99117.1| GE10885 [Drosophila yakuba]
          Length = 473

 Score =  162 bits (411), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 89/203 (43%), Positives = 120/203 (59%), Gaps = 7/203 (3%)

Query: 64  AQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQ 123
           A+L C Y      +LRL PLK E   L P ++L+ DV+ D +I  I+ +A+  L RA V 
Sbjct: 255 AKLHCLYNTTASYFLRLAPLKMELLSLDPYMVLFHDVVSDKDITSIRNLAKGGLVRA-VT 313

Query: 124 NYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHY 183
             K G  E    R +K  WL E    +I+R+S+  + MT L    A+  QV+NYGIGG+Y
Sbjct: 314 VTKDGSYEEDPARTTKGTWLVE-NSKLIQRLSQLAQDMTNLDIRDADPFQVLNYGIGGYY 372

Query: 184 EPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWH 243
             H+DF    E   F      NR+AT +FY+SDV QGGAT+F  L LS++P+KG+A  W+
Sbjct: 373 GTHFDFLADTEMGNF-----SNRIATAVFYLSDVPQGGATIFPKLGLSVFPKKGSALLWY 427

Query: 244 NLHSSGDGDYYTRHAACPVLTGS 266
           NL   GDGD  T H+ACP + GS
Sbjct: 428 NLDHKGDGDNRTAHSACPTIVGS 450


>gi|195341542|ref|XP_002037365.1| GM12152 [Drosophila sechellia]
 gi|194131481|gb|EDW53524.1| GM12152 [Drosophila sechellia]
          Length = 535

 Score =  162 bits (410), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 96/255 (37%), Positives = 138/255 (54%), Gaps = 15/255 (5%)

Query: 41  EVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDV 100
           E  E   YE +CRG+L   P+    L+CR     + Y    P K EE +L P ++    V
Sbjct: 278 ESREFRMYEQVCRGELAPLPSKQRSLRCRLRKSRLGY---APFKLEELHLDPLVVQLHQV 334

Query: 101 MYDSEIDLIKKMAQPRLRRATVQNYK-TGELEIANYRISKSAWLREPEHPVIERISRRVE 159
           +  ++ + ++K A+P ++R+TV +    G    A +R S+ A     ++   + +S  V 
Sbjct: 335 IGSNDSESLQKSARPMIKRSTVYSLGGNGGSTAAAFRTSQGASFNYSKNAATKLLSHHVG 394

Query: 160 HMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQ 219
             + L    AE+LQV NYGIGGHYEPH+D + P      +    GNR+AT ++Y+SDV  
Sbjct: 395 DFSDLNMDYAEDLQVANYGIGGHYEPHWD-SFPENHIYQEGDLHGNRIATGIYYLSDVEA 453

Query: 220 GGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC------ 273
           GG T F  L L + PEKG+  FW+NLH SGD D+ T+HAACPVL GS  + +        
Sbjct: 454 GGGTAFPFLPLLVTPEKGSLLFWYNLHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQ 513

Query: 274 ----PCGLRRGLQRS 284
               PC L RG + S
Sbjct: 514 DNVRPCDLERGQEIS 528


>gi|449485593|ref|XP_004175686.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
           alpha-3 [Taeniopygia guttata]
          Length = 567

 Score =  162 bits (410), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 89/205 (43%), Positives = 123/205 (60%), Gaps = 7/205 (3%)

Query: 65  QLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQN 124
            L C Y   N P+L L P K+E  ++QP + LY D + D+E + IK +A P L+R+ V  
Sbjct: 342 HLSCSYETNNSPFLLLQPAKKEMVWIQPHVALYHDFITDAEAETIKGLAGPWLQRSVV-- 399

Query: 125 YKTGE-LEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTT--STAEELQVVNYGIGG 181
             +GE  + A Y ISKS WL++   PV+  + +R+  +TGL      AE LQVVNYG+GG
Sbjct: 400 -ASGEKQQKAEYWISKSTWLKDTVDPVVHALDQRIIAVTGLDLWPPYAEYLQVVNYGLGG 458

Query: 182 HYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAF 241
           HYEPH+D A   ++  ++ + +GNR ATV+ Y+S V  GG+T     N S+   K  A F
Sbjct: 459 HYEPHFDHATSTKSPLYR-MKSGNRNATVMIYLSAVEAGGSTALIYTNFSVPVVKNAALF 517

Query: 242 WHNLHSSGDGDYYTRHAACPVLTGS 266
           W NL  +G+GD  T HA CPVL G 
Sbjct: 518 WWNLRRNGNGDGDTLHAGCPVLAGD 542


>gi|198449504|ref|XP_002136909.1| GA26876 [Drosophila pseudoobscura pseudoobscura]
 gi|198130636|gb|EDY67467.1| GA26876 [Drosophila pseudoobscura pseudoobscura]
          Length = 527

 Score =  161 bits (408), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 82/232 (35%), Positives = 135/232 (58%), Gaps = 12/232 (5%)

Query: 46  EKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSE 105
           E++  +CR      P+   +L CRY     P+LRL PL+ EE  L P I++Y +V+ D+E
Sbjct: 280 EEFNQICRSSHQNKPS---RLHCRYNTTTTPFLRLAPLRMEELSLDPYIVVYHNVLSDAE 336

Query: 106 IDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPE-----HPVIERISRRVEH 160
           I  ++++ +P L+R+ V + K  ++  +  R +  AWL +         VI+RI RR+  
Sbjct: 337 IAEVERVTEPLLKRSVVFDGKGNKMSTSKRRTALGAWLPDDNMDVSGRAVIQRIFRRIHE 396

Query: 161 MTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQG 220
           +TGL  +  +++Q++ YG GGHY+ H+D+          +   G+R+ATVLFY++D+  G
Sbjct: 397 LTGLIINDRQDMQLIKYGYGGHYDIHFDYFNTSTP---ITKARGDRMATVLFYLNDMKHG 453

Query: 221 GATVFTSLNLSLWPEKGTAAFWHNLH-SSGDGDYYTRHAACPVLTGSNSLHS 271
           G+T FT L L +  E+G   FW+N+   + D D  T H ACPV+ G+ ++ S
Sbjct: 454 GSTAFTDLQLKVPSERGKVLFWYNMRGETHDVDSRTLHGACPVINGTKTILS 505


>gi|198449508|ref|XP_002136911.1| GA26875 [Drosophila pseudoobscura pseudoobscura]
 gi|198130638|gb|EDY67469.1| GA26875 [Drosophila pseudoobscura pseudoobscura]
          Length = 516

 Score =  161 bits (407), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 83/232 (35%), Positives = 137/232 (59%), Gaps = 12/232 (5%)

Query: 46  EKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSE 105
           E++  +CR      P+   +L CRY     P+LRL PL+ EE  L P I++Y +V+ D+E
Sbjct: 269 EEFNQICRSWHQNKPS---RLHCRYNTTTTPFLRLAPLRMEELSLDPYIVVYHNVLCDAE 325

Query: 106 IDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPE-----HPVIERISRRVEH 160
           I  ++++ +P L+R+ V + K  ++  +  R +  AWL +         VI+RI RR+  
Sbjct: 326 IAEVERVTEPLLKRSVVFDGKENKMSTSKKRTALGAWLPDDNMDVSGRAVIQRIFRRIHE 385

Query: 161 MTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQG 220
           +TGL  +  +++Q++ YG GGHY+ H+D+      ++  +   G+R+ATVLFY++DV  G
Sbjct: 386 LTGLIINDRQDMQLIKYGYGGHYDIHFDYFN---TSSPITKARGDRMATVLFYLNDVKHG 442

Query: 221 GATVFTSLNLSLWPEKGTAAFWHNLH-SSGDGDYYTRHAACPVLTGSNSLHS 271
           G+T FT L L +  E+G   FW+N+   + D D  T H ACPV+ G+ ++ S
Sbjct: 443 GSTAFTDLQLKVPSERGKVLFWYNMRGETHDLDSRTLHGACPVIDGTKTILS 494


>gi|195159146|ref|XP_002020443.1| GL13510 [Drosophila persimilis]
 gi|194117212|gb|EDW39255.1| GL13510 [Drosophila persimilis]
          Length = 527

 Score =  160 bits (405), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 82/232 (35%), Positives = 135/232 (58%), Gaps = 12/232 (5%)

Query: 46  EKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSE 105
           E++  +CR      P+   +L CRY     P+LRL PL+ EE  L P I++Y +V+ D+E
Sbjct: 280 EEFNQICRSWHQNKPS---RLHCRYNTTTTPFLRLAPLRMEELSLDPYIVVYHNVLSDAE 336

Query: 106 IDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPE-----HPVIERISRRVEH 160
           I  ++++ +P L+R+ V + K  ++  +  R +  AWL +         VI+RI RR+  
Sbjct: 337 IAEVERVTEPLLKRSVVFDGKENKMSTSKKRTALGAWLPDDNMDVSGRAVIQRIFRRIHE 396

Query: 161 MTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQG 220
           +TGL  +  +++Q++ YG GGHY+ H+D+          +   G+R+ATVLFY++D+  G
Sbjct: 397 LTGLIINDRQDMQLIKYGYGGHYDIHFDYFNTSTP---ITKARGDRMATVLFYLNDMKHG 453

Query: 221 GATVFTSLNLSLWPEKGTAAFWHNLH-SSGDGDYYTRHAACPVLTGSNSLHS 271
           G+T FT L L +  E+G   FW+N+   + D D  T H ACPV+ G+ ++ S
Sbjct: 454 GSTAFTDLQLKVPSERGKVLFWYNMRGETHDLDSRTLHGACPVINGTKTILS 505


>gi|443712762|gb|ELU05926.1| hypothetical protein CAPTEDRAFT_153364 [Capitella teleta]
          Length = 491

 Score =  160 bits (405), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 96/230 (41%), Positives = 136/230 (59%), Gaps = 17/230 (7%)

Query: 47  KYEMLCRGDLTVPPAIVAQLKCRYVH-RNVPYLRLMPLKEEEAY-LQPRIILYRDVMYDS 104
           KY+ LCRGD+ V  +  + L CRY   R++P    +P+ +EE + + P + ++ DV+ D+
Sbjct: 239 KYQELCRGDMIVEESKKSLLYCRYAKGRDIP----LPIYKEEVHNVDPHVAIFYDVISDA 294

Query: 105 EIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGL 164
           E D I + A P + R  V N  +   + ++ RISK  WL +    +I+++S R+  +TGL
Sbjct: 295 EADHIIRHAFPGMFRGLVGN--STLRQSSDQRISKVGWLFDNVDTLIKKLSARIGDVTGL 352

Query: 165 TT------STAEELQVVNYGIGGHYEPHYDFARPGE--ANAFKSL-GTGNRVATVLFYMS 215
            T      S  E +QVVNYGIGG YEPH DF    E   N   SL  TG+R++T LFY+S
Sbjct: 353 NTVYTPVRSPVEAMQVVNYGIGGQYEPHLDFYEDPEMLKNVNPSLQDTGDRISTFLFYLS 412

Query: 216 DVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
            V  GGATVF  LN+ + P K  AAFW+N   +G+ D  T HA CPV+ G
Sbjct: 413 RVHLGGATVFPKLNVRVPPVKNGAAFWYNARPNGEHDKRTLHAGCPVVLG 462


>gi|116008432|ref|NP_651804.2| CG15539, isoform A [Drosophila melanogaster]
 gi|66772391|gb|AAY55507.1| IP10910p [Drosophila melanogaster]
 gi|66772535|gb|AAY55579.1| IP10810p [Drosophila melanogaster]
 gi|113194858|gb|AAF57060.2| CG15539, isoform A [Drosophila melanogaster]
          Length = 386

 Score =  160 bits (404), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 85/203 (41%), Positives = 120/203 (59%), Gaps = 7/203 (3%)

Query: 64  AQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQ 123
           A+L C Y   +  +LRL PLK E   L P ++L+ DV+ D +I  I+ + + +L R TV 
Sbjct: 168 AKLYCLYKTTSSYFLRLAPLKMELLSLDPYMVLFHDVVSDKDIVSIRNLTKGKLAR-TVT 226

Query: 124 NYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHY 183
             K G       R +K  WL E  + +I+R+S+  + MT      A+  QV+NYGIGG Y
Sbjct: 227 VSKDGNYTEDPDRTTKGTWLVE-NNALIQRLSQLTQDMTNFDIHDADPFQVLNYGIGGFY 285

Query: 184 EPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWH 243
             H+DF    E + F      +R+AT +FY+SDV QGGAT+F  L LS++P+KG+A  W+
Sbjct: 286 GIHFDFLEDAELDNF-----SDRIATAVFYLSDVPQGGATIFPKLGLSVFPKKGSALLWY 340

Query: 244 NLHSSGDGDYYTRHAACPVLTGS 266
           NL   GDGD  T H+ACP + GS
Sbjct: 341 NLDHKGDGDNRTAHSACPTVVGS 363


>gi|195159297|ref|XP_002020518.1| GL13472 [Drosophila persimilis]
 gi|194117287|gb|EDW39330.1| GL13472 [Drosophila persimilis]
          Length = 526

 Score =  159 bits (403), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 95/261 (36%), Positives = 137/261 (52%), Gaps = 19/261 (7%)

Query: 14  LYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHR 73
           LY  + + +   + ++P K++         E E Y + C G   +       L+C Y+  
Sbjct: 230 LYESKTIEEHAPIPEDPSKLD---------EFEAYRLTCSGHSRLTAREQRHLRCGYMTE 280

Query: 74  NVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIA 133
             P+L L PLK EE    P ++LY DV+Y SEID+I+++   R+ RA V    T +  ++
Sbjct: 281 THPFLLLAPLKAEELSHDPLLVLYHDVIYQSEIDVIRQLTTNRMARAMVT--LTNQSTVS 338

Query: 134 NYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARP 192
           N R S+  ++ + EH V++ I RRV  MT L    AE+ Q  NYGIGGHY  H D F   
Sbjct: 339 NVRTSQITFIAKTEHEVLQTIDRRVADMTNLNMDYAEDHQFANYGIGGHYGQHMDWFTET 398

Query: 193 GEANAF-KSLGTGNRVATVLFYMSDVAQGGATVFTS------LNLSLWPEKGTAAFWHNL 245
              N    S   GNR+ATVLFY   +      + ++      L   L  +K  AAFWHNL
Sbjct: 399 TFDNGLVSSTEMGNRIATVLFYNISLNSSRMWLMSAALTCPYLKQHLRLKKYAAAFWHNL 458

Query: 246 HSSGDGDYYTRHAACPVLTGS 266
           H++G GD  T+H ACP++ GS
Sbjct: 459 HAAGRGDARTQHGACPIIAGS 479


>gi|116008128|ref|NP_001036776.1| CG15539, isoform B [Drosophila melanogaster]
 gi|113194857|gb|ABI31220.1| CG15539, isoform B [Drosophila melanogaster]
          Length = 509

 Score =  159 bits (402), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 85/203 (41%), Positives = 120/203 (59%), Gaps = 7/203 (3%)

Query: 64  AQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQ 123
           A+L C Y   +  +LRL PLK E   L P ++L+ DV+ D +I  I+ + + +L R TV 
Sbjct: 291 AKLYCLYKTTSSYFLRLAPLKMELLSLDPYMVLFHDVVSDKDIVSIRNLTKGKLAR-TVT 349

Query: 124 NYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHY 183
             K G       R +K  WL E  + +I+R+S+  + MT      A+  QV+NYGIGG Y
Sbjct: 350 VSKDGNYTEDPDRTTKGTWLVE-NNALIQRLSQLTQDMTNFDIHDADPFQVLNYGIGGFY 408

Query: 184 EPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWH 243
             H+DF    E + F      +R+AT +FY+SDV QGGAT+F  L LS++P+KG+A  W+
Sbjct: 409 GIHFDFLEDAELDNF-----SDRIATAVFYLSDVPQGGATIFPKLGLSVFPKKGSALLWY 463

Query: 244 NLHSSGDGDYYTRHAACPVLTGS 266
           NL   GDGD  T H+ACP + GS
Sbjct: 464 NLDHKGDGDNRTAHSACPTVVGS 486


>gi|194751825|ref|XP_001958224.1| GF23629 [Drosophila ananassae]
 gi|190625506|gb|EDV41030.1| GF23629 [Drosophila ananassae]
          Length = 523

 Score =  159 bits (401), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 86/226 (38%), Positives = 130/226 (57%), Gaps = 20/226 (8%)

Query: 66  LKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNY 125
           L CRY     P+LRL PLK EE  L P +++Y +V+Y++EI+ +KK  Q    +    + 
Sbjct: 306 LFCRYNFTTTPFLRLAPLKLEEINLDPYVVMYHEVLYETEIEELKK--QSGHMKNGYADQ 363

Query: 126 KTGELEIANYR--ISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHY 183
           K G +    YR  +++ +W  + E P  ERI+RR+  MTGL     + LQV NYG G ++
Sbjct: 364 KNGTM----YRAVVARHSWWSD-ESPTRERINRRIRDMTGLDFPITDTLQVANYGCGTYF 418

Query: 184 EPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWH 243
           +PH+D+   G      +   G+R+ T++FY SDV QGGATVF  + +S+ P KG++ FW+
Sbjct: 419 KPHFDYTSDGYETP-NADALGDRLGTIIFYASDVLQGGATVFPDIKVSITPRKGSSVFWY 477

Query: 244 NLHSSGDGDYYTRHAACPVLTG-----SNSLH-----STCPCGLRR 279
           NL+  G  D  +RH+ CPV+ G     +  +H        PCG R+
Sbjct: 478 NLYDDGRPDIRSRHSVCPVINGDRWTLTKWIHIFPQMFIIPCGPRK 523


>gi|443721482|gb|ELU10773.1| hypothetical protein CAPTEDRAFT_174752 [Capitella teleta]
          Length = 525

 Score =  158 bits (400), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 99/244 (40%), Positives = 141/244 (57%), Gaps = 23/244 (9%)

Query: 38  PTLEVTEREKYEMLCRGDLTVPPAIVAQ---LKCRYVHRNVPYLRLMPLKEEEAYLQPRI 94
           P L+ T+   YE LCRG+    P + ++   LKCRY    +P++R    KEE    +P I
Sbjct: 264 PKLKSTK--AYEALCRGEQLKLPDVDSEQQALKCRYKPGILPFVRY---KEEMLNRKPHI 318

Query: 95  ILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANY-RISKSAWLREPE-HPVIE 152
           +L+ DVM D+E   +K  A  +L RA V + +      A+  RIS+ +WL +   +  I 
Sbjct: 319 VLFHDVMSDAEAKTMKMEAMHKLERAHVADNENKHGHSASAKRISQVSWLWDDHANKTIH 378

Query: 153 RISRRVEHMTGLTTS------TAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL----- 201
           ++SRRV  +TGL T       +AE  Q++NYGIGG YEPH D+     +++  SL     
Sbjct: 379 QLSRRVADITGLQTGVVSGLHSAEPFQILNYGIGGQYEPHVDYFAGNHSHS--SLPEHVR 436

Query: 202 GTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACP 261
            +GNR+AT +FY++DV  GGATVF  L + + P K  AAFW+N+  +GD D  T HA CP
Sbjct: 437 ASGNRLATFMFYLNDVHAGGATVFPKLKVGIPPTKNGAAFWYNIGLNGDVDPLTEHAGCP 496

Query: 262 VLTG 265
           VL G
Sbjct: 497 VLLG 500


>gi|301626782|ref|XP_002942567.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Xenopus
           (Silurana) tropicalis]
          Length = 716

 Score =  158 bits (400), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 100/268 (37%), Positives = 140/268 (52%), Gaps = 30/268 (11%)

Query: 4   PTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTL-EVTEREKYEMLCRGDLTVPPAI 62
           P + R   N   Y++ L +SP    +  ++    P +  +  R+ YE LC+   + P + 
Sbjct: 449 PDNGRLARNIAKYEQILYESPTADADAEEMKLQRPNVTHLKTRDLYEGLCQTLGSQPTSY 508

Query: 63  V-AQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRAT 121
               + C Y   + PYL L P+K+E   L+P+++LY D + D E + IK++A P L R+ 
Sbjct: 509 EDPHMSCMYDTNSHPYLLLQPMKKEIVSLRPQVVLYHDFVSDLEAEKIKELASPWLHRSV 568

Query: 122 VQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQVVNYG 178
           V    +GE +  A YRISKSAWL++  HP ++ +  R+  +TGL      AE LQVVNYG
Sbjct: 569 V---ASGEKQAEAEYRISKSAWLKDTIHPFVQNLDTRISGVTGLNAHPPYAEYLQVVNYG 625

Query: 179 IGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGT 238
           IGGHYEPH+D A                       +S V  GG+T F   N S    K  
Sbjct: 626 IGGHYEPHFDHAT----------------------LSHVDLGGSTAFVFANFSSPVVKNA 663

Query: 239 AAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           A FW NLH +G GD  T HA CPV+ GS
Sbjct: 664 AVFWWNLHRNGLGDEDTLHAGCPVIIGS 691


>gi|449668268|ref|XP_002154169.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Hydra
           magnipapillata]
          Length = 531

 Score =  158 bits (400), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 93/227 (40%), Positives = 131/227 (57%), Gaps = 11/227 (4%)

Query: 47  KYEMLCRGDL---TVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYD 103
           +Y   CR D    T+    V  L C Y   N P L L PLK    +  P ++++ +++ +
Sbjct: 295 RYARACRRDQRTKTIAVKDVNNLVCFY-KNNKPRLILKPLKVTRMHDNPDVLVFHEMITE 353

Query: 104 SEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRR----VE 159
              + I+ +A PRLR + V +    +   A+YR+SK+ +  +     +E ISR+    VE
Sbjct: 354 EVAEKIRDVANPRLRPSEVIDPIIQKHVTASYRVSKNVFFDDAFEEELE-ISRKLRPLVE 412

Query: 160 HMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQ 219
             T L    +E+LQV NYG+GG YE H DF  PG  +       GNR+AT+L Y+SDV +
Sbjct: 413 DATDLNDDFSEQLQVNNYGLGGQYEFHVDFGDPG--SPLDKHEHGNRIATLLIYLSDVER 470

Query: 220 GGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           GG TVFT L LSL P+ G AAFWHNL+ +G G Y T HA+CPV++GS
Sbjct: 471 GGDTVFTRLGLSLKPKLGDAAFWHNLYKNGSGIYATEHASCPVVSGS 517


>gi|195575139|ref|XP_002105537.1| GD21537 [Drosophila simulans]
 gi|194201464|gb|EDX15040.1| GD21537 [Drosophila simulans]
          Length = 536

 Score =  158 bits (399), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 94/244 (38%), Positives = 139/244 (56%), Gaps = 14/244 (5%)

Query: 34  NNVAPTLEVTEREK---YEMLCRGD--LTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEA 88
           N   P++ +  RE    +  LCR      +  +  ++L CRY     P+LRL P + EE 
Sbjct: 260 NTPKPSINLESRESDESFNQLCRSSSRRQMGESKPSRLHCRYNTTTTPFLRLAPFRMEEL 319

Query: 89  YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWL----R 144
            L P ++ Y +V+ D EI+ +K M++P L RA V   + G  EIA  R +  AWL     
Sbjct: 320 SLDPYVVFYHNVLSDPEIEKLKPMSEPFLERAKVFRVEKGSDEIAPTRSADGAWLPHQDT 379

Query: 145 EPEH-PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT 203
           +P+   V+ RI RR+  +TGL T +  ++Q + YG GGH+ PHYD+     +   +    
Sbjct: 380 DPDDLEVLRRIGRRIRDITGLNTRSGSQMQFLKYGFGGHFVPHYDYFNSKTSYLER---V 436

Query: 204 GNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNL-HSSGDGDYYTRHAACPV 262
           G+R+ATVLFY+++V  GGAT F  LNL +  +KG+A FWHNL   S D D  T H ACP+
Sbjct: 437 GDRMATVLFYLNNVDHGGATAFPKLNLVVPTQKGSALFWHNLDRKSYDYDTRTSHGACPL 496

Query: 263 LTGS 266
           ++G+
Sbjct: 497 ISGT 500


>gi|17861644|gb|AAL39299.1| GH17175p [Drosophila melanogaster]
          Length = 187

 Score =  158 bits (399), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 76/158 (48%), Positives = 107/158 (67%), Gaps = 3/158 (1%)

Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
           MA PR+ R+TV     G+L+ + +R+SK+AWL    HP +  + R ++  TGL T+  E+
Sbjct: 1   MAVPRMHRSTVNPLPGGQLKKSAFRVSKNAWLAYESHPTMVGMLRDLKDATGLDTTFCEQ 60

Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
           LQV NYG+GGHYEPH+DF R  + N + +   GNR+AT +FY+S+V QGGAT F  L+++
Sbjct: 61  LQVANYGVGGHYEPHWDFFR--DPNHYPA-EEGNRIATAIFYLSEVEQGGATAFPFLDIA 117

Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSL 269
           + P+ G   FW+NLH S D DY T+HA CPVL GS  +
Sbjct: 118 VKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLKGSKWI 155


>gi|195444366|ref|XP_002069834.1| GK11733 [Drosophila willistoni]
 gi|194165919|gb|EDW80820.1| GK11733 [Drosophila willistoni]
          Length = 517

 Score =  157 bits (398), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 90/218 (41%), Positives = 132/218 (60%), Gaps = 15/218 (6%)

Query: 52  CRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           CRG+   P      L C Y +   P+LR+ P K E     P +  Y DV+ DSEI+ +K 
Sbjct: 288 CRGEYKPPKG----LSCYYEYGADPFLRIAPFKVELLNRSPYVAAYYDVLNDSEIEELKL 343

Query: 112 MAQPRLRRATVQNYKTGELEIANY-RISKSAWLREPEHPVIERISRRVEHMTGL--TTST 168
           M+ P++RR+ + N+ T +++ A+  R S S ++ E    ++E IS+R   MT L  T  +
Sbjct: 344 MSSPQIRRSLLYNH-TLDIDQADVDRTSNSVFMEETGITLLETISQRAADMTDLYVTAIS 402

Query: 169 AEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL 228
           +E+LQV+NYG+GG Y PH D+      N       G+R+ATVLFY++DV QGGATVF  L
Sbjct: 403 SEDLQVINYGLGGQYTPHCDYFDENAEN-------GDRLATVLFYLTDVQQGGATVFPFL 455

Query: 229 NLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
            LS +P+KG+A  + NL ++  GD  + H+ACPVL G+
Sbjct: 456 RLSYFPKKGSALIFRNLDNAMSGDKDSTHSACPVLFGN 493


>gi|194764881|ref|XP_001964556.1| GF23245 [Drosophila ananassae]
 gi|190614828|gb|EDV30352.1| GF23245 [Drosophila ananassae]
          Length = 460

 Score =  157 bits (397), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 89/234 (38%), Positives = 126/234 (53%), Gaps = 24/234 (10%)

Query: 52  CRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           C     +P      L CRY     P+L + PLK EE    P +++Y DV+Y++EI+ +  
Sbjct: 233 CSAKFRLPN----HLHCRYNSSTSPFLHIAPLKMEEISTDPYMVVYHDVIYENEINWL-- 286

Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
           +     R + V     GE +I+  R S+          V+  I +R++ MTGL+   +E+
Sbjct: 287 LDNSDFRTSLV-----GESQISTLRTSQDMPFGANSGEVMRNIEKRIKDMTGLSMDLSED 341

Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
             ++NYGIGG Y+ HYDF    E   F     G R+ TVLFY+ DV   G+TVF  LN+S
Sbjct: 342 FMLINYGIGGTYKMHYDFYVYSEPLRFLR---GERIVTVLFYLGDVELSGSTVFPFLNIS 398

Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS--------NSLHST--CPC 275
           + P+KG+A  W+NLH+SGD    T+H ACPV+ GS        N LH T   PC
Sbjct: 399 ITPKKGSAVMWYNLHNSGDVHQKTQHCACPVVVGSKYVLTKWINELHQTFITPC 452


>gi|443719426|gb|ELU09607.1| hypothetical protein CAPTEDRAFT_229373 [Capitella teleta]
          Length = 576

 Score =  156 bits (394), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 96/252 (38%), Positives = 141/252 (55%), Gaps = 30/252 (11%)

Query: 48  YEMLCRGD-LTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEI 106
           Y  LCRG+     P +V  L C Y +  +PYLR     EE     PRI L  +V+ + +I
Sbjct: 324 YMKLCRGEHFDRDPEVVKALYCTYRYGILPYLRY---NEEIFNFNPRIALIYNVIKNRDI 380

Query: 107 DLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTT 166
           +++K  A   L  + V +    + +++N RISK++WL + E   I ++S++V  +TGL+T
Sbjct: 381 NMLKDKATAGLSSSRVGD--PAKSKLSNERISKTSWLWDTEDERIFKLSKQVADITGLST 438

Query: 167 ------STAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL-----GTGNRVATVLFYMS 215
                 S AE  Q+VNYGIGG Y+PH+D+    E +  +++      TG+RVAT +FY+S
Sbjct: 439 QYSTLHSHAEPFQLVNYGIGGQYQPHFDYY---ENDMLRNVPAFIQDTGDRVATFMFYLS 495

Query: 216 DVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC-- 273
            V  GGATVF  L++ +   KG AAFW N+  SGD +  T+HA CPVL G   + +    
Sbjct: 496 SVKAGGATVFPKLHVRIPAVKGAAAFWFNIRRSGDREPLTQHAGCPVLLGEKWVANKWIR 555

Query: 274 --------PCGL 277
                   PCGL
Sbjct: 556 ELGQEYNRPCGL 567


>gi|380805043|gb|AFE74397.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor, partial
           [Macaca mulatta]
          Length = 128

 Score =  156 bits (394), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 71/128 (55%), Positives = 97/128 (75%)

Query: 76  PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANY 135
           P L + P KEE+ +  P I+ Y DVM D EI+ IK++A+P+L RATV++ KTG L +A+Y
Sbjct: 1   PQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASY 60

Query: 136 RISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEA 195
           R+SKS+WL E + PV+ R++RR++H+TGLT  TAE LQV NYG+GG YEPH+DF+R  E 
Sbjct: 61  RVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDER 120

Query: 196 NAFKSLGT 203
           + FK LGT
Sbjct: 121 HTFKHLGT 128


>gi|195452744|ref|XP_002073481.1| GK14140 [Drosophila willistoni]
 gi|194169566|gb|EDW84467.1| GK14140 [Drosophila willistoni]
          Length = 454

 Score =  156 bits (394), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 81/215 (37%), Positives = 124/215 (57%), Gaps = 5/215 (2%)

Query: 52  CRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           C G+  V      QL C Y  ++  +LR+ P+K E   L P I+L  DV+  SE + +K 
Sbjct: 223 CSGNCEVDREF--QLFCLYNTKDAYFLRIAPVKMEILSLNPYIVLCHDVILPSEQEFLKT 280

Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
            +  RL  A   +    E+     R SK+ WL++    V  R+S  +E ++ L ++  + 
Sbjct: 281 QSSKRLEGARALDQVKNEVVFNFIRTSKATWLKKNSDNVTRRLSHWIEDVSNLDSNIGDL 340

Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
            Q++NYG+GG +E H D  R  E + +K L   +R+AT +FY+ DV QGGAT+F +LNL+
Sbjct: 341 YQIINYGVGGLFEAHSDTMRKDE-DRWKVL--YDRIATFIFYLQDVPQGGATLFNNLNLT 397

Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           ++P+ G A FW NL ++GD D +T H  CPV+ GS
Sbjct: 398 VFPKAGAALFWFNLDNAGDTDLFTVHTGCPVIVGS 432


>gi|241999340|ref|XP_002434313.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
 gi|215496072|gb|EEC05713.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
          Length = 267

 Score =  155 bits (392), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 91/237 (38%), Positives = 130/237 (54%), Gaps = 15/237 (6%)

Query: 44  EREKYEMLCRGDLTVPPAIVAQLKCRY-VHRNVPYLRLMPLKEEEAYLQPRIILYRDVMY 102
           E  +Y  +C  D  V P   ++L C+       P+L L P K E     PRI+++ D + 
Sbjct: 10  ESAEYMSMCVADGDVRPR-QSKLLCKISTIGGHPFLVLQPFKIEVLSEDPRIVVFPDFLN 68

Query: 103 DSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMT 162
             E ++ + ++Q +L RA V      E   +  R +K AW+ +  HP++ ++SRR+   T
Sbjct: 69  PRECEIFRSISQEKLSRAKVYLGGPPEGGFSLRRTNKVAWMSDDLHPLLGKVSRRIALAT 128

Query: 163 GLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGA 222
           GLT ++AE  QV NYG+GGHY PH D+A  GEA       +GNR+AT+L Y++DVA GGA
Sbjct: 129 GLTLTSAEMYQVANYGLGGHYIPHPDYAGFGEAQGDIYKSSGNRLATMLIYLADVAGGGA 188

Query: 223 TVFTSLNLSLWPEKGTAAFWHNLHSSGD-------------GDYYTRHAACPVLTGS 266
           T F ++ L++ P  GTA FW+NL                  GD  T H  CPVLTGS
Sbjct: 189 TAFINMRLAVKPTLGTALFWYNLKPYDGPIVNESFWNQRRFGDPRTFHMGCPVLTGS 245


>gi|198477150|ref|XP_002136737.1| GA29215 [Drosophila pseudoobscura pseudoobscura]
 gi|198145042|gb|EDY71754.1| GA29215 [Drosophila pseudoobscura pseudoobscura]
          Length = 508

 Score =  155 bits (391), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 87/203 (42%), Positives = 115/203 (56%), Gaps = 7/203 (3%)

Query: 64  AQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQ 123
           ++L C Y      +LRL PLK E   L P ++LY DV+ D E+ L+K MAQ  L RA   
Sbjct: 291 SRLYCLYNTTATAFLRLAPLKMELLSLDPYVVLYHDVLADREMSLLKLMAQRDLVRAVTY 350

Query: 124 NYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHY 183
           N    +      R +K+ WL +P H +I R+    E M+ L    +E+ QV+NYGIGGHY
Sbjct: 351 NATEKKHSEDPNRTTKAGWL-DPSHNLIRRMGILTEDMSNLDLERSEDFQVLNYGIGGHY 409

Query: 184 EPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWH 243
             H DF               +RVAT+LFY+SDV  GGATVF  L+LS++P+KG    W+
Sbjct: 410 AVHPDFFEGSNPE------LPDRVATLLFYLSDVPLGGATVFPLLDLSVFPKKGAVLMWY 463

Query: 244 NLHSSGDGDYYTRHAACPVLTGS 266
           NL   G G   T H+ACPV+ GS
Sbjct: 464 NLDHKGQGMEKTIHSACPVVVGS 486


>gi|195064500|ref|XP_001996577.1| GH12091 [Drosophila grimshawi]
 gi|193895397|gb|EDV94263.1| GH12091 [Drosophila grimshawi]
          Length = 521

 Score =  155 bits (391), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 91/287 (31%), Positives = 148/287 (51%), Gaps = 31/287 (10%)

Query: 2   IFPTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKY--EMLCRGDLTV- 58
           +F TH       L   + L  S   ++   KVN++  TL  T+ + Y  E+L + D  + 
Sbjct: 220 LFKTHM------LLAMQILQASMNPEEAHEKVNDIFKTLSSTDLDSYVNELLNQDDDQLF 273

Query: 59  ----------PPAIVA---------QLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRD 99
                      P  V           L CRY     P+LRL PLK EE    P +++Y +
Sbjct: 274 MELQSMQPIATPEFVGCRGHFPKRHNLSCRYNFTTTPFLRLAPLKLEEINHDPYVVMYHN 333

Query: 100 VMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVE 159
           V+YDSEI+ +K+++ P+++   +  YK  + ++ +   ++  WL E   P +ER+++R+ 
Sbjct: 334 VIYDSEIEEMKRLS-PQMQNGYIHGYKANQTKVTDI-AARVNWLVE-NTPFLERMNQRIT 390

Query: 160 HMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQ 219
            MTG        +QV N+GIG ++E HYD+              G+R+A+++FY SDV  
Sbjct: 391 DMTGFDLKEFPSVQVANFGIGNNFEAHYDYIFGKRVRKEDVGDLGDRLASIIFYSSDVPL 450

Query: 220 GGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           GGATVF  + +++ P+KG +  W+NL   G  D  + H+ CPV+ GS
Sbjct: 451 GGATVFPDIQVAVQPQKGNSLLWYNLFDDGTPDPRSLHSVCPVVVGS 497


>gi|198466403|ref|XP_002135183.1| GA23911 [Drosophila pseudoobscura pseudoobscura]
 gi|198150584|gb|EDY73810.1| GA23911 [Drosophila pseudoobscura pseudoobscura]
          Length = 534

 Score =  155 bits (391), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 84/223 (37%), Positives = 124/223 (55%), Gaps = 16/223 (7%)

Query: 48  YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID 107
           YE+ CRG    P     +L CRY     P+LRL PLK EE    P I++Y +V+ D EI+
Sbjct: 296 YEIGCRG--LFPKR--TKLVCRYNFTTTPFLRLAPLKMEEVNHDPYIVMYHEVLSDREIE 351

Query: 108 LIKKMAQPRLRRATVQNYKTGELEIANYRI----SKSAWLREPEHPVIERISRRVEHMTG 163
            +K       R   + N    + E  + +I     +  W RE +  + ER++RR+  MT 
Sbjct: 352 EMKG------RSGQMSNGWADQKEANSTKIRDIVCRHTWWRE-QSAIKERVNRRISDMTN 404

Query: 164 LTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGAT 223
                 E+LQV NYG+G H++PHYD+   G       L  G+R+ +++FY SDV QGGAT
Sbjct: 405 FDFPPQEDLQVANYGLGTHFKPHYDYTSDGYETP-DVLTLGDRLGSIIFYASDVPQGGAT 463

Query: 224 VFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           VF    +S++P KG++ FW+NL+  G  D  ++H+ CPV+ G 
Sbjct: 464 VFPRSRVSIFPRKGSSVFWYNLYDDGRIDTRSQHSVCPVIVGD 506


>gi|195390805|ref|XP_002054058.1| GJ23004 [Drosophila virilis]
 gi|194152144|gb|EDW67578.1| GJ23004 [Drosophila virilis]
          Length = 446

 Score =  155 bits (391), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 86/215 (40%), Positives = 122/215 (56%), Gaps = 16/215 (7%)

Query: 52  CRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           C   L  P    + L CRY +   P+LR+ PLK EE  + P ++LY +V+YDSEI+    
Sbjct: 222 CAASLQRP----SHLHCRYNNWTTPFLRIAPLKMEELSIDPFVVLYHNVIYDSEIEWF-- 275

Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
           + Q       + +Y       + +R  K+ ++   +  +++ I  RV  M+GL+   +++
Sbjct: 276 LTQSFDYTPALLDYGG----FSAHRSGKNVFIELEKGELVKTIEMRVTDMSGLSMEGSDD 331

Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
           L ++NYGIGGHY PH+D     E        T +R+AT LFY+SDV  GGAT F  LNL+
Sbjct: 332 LSLINYGIGGHYIPHHDSFSEEENK------TEDRIATALFYLSDVELGGATTFPLLNLT 385

Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           + PEKGTA  WHNL  SG     T HAACPV+ GS
Sbjct: 386 ISPEKGTAVLWHNLKDSGTPHPKTVHAACPVIVGS 420


>gi|195440206|ref|XP_002067933.1| GK11220 [Drosophila willistoni]
 gi|194164018|gb|EDW78919.1| GK11220 [Drosophila willistoni]
          Length = 459

 Score =  154 bits (390), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 88/239 (36%), Positives = 135/239 (56%), Gaps = 8/239 (3%)

Query: 44  EREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYD 103
           E   Y + CRG L +PP    +L CRY     P+LRL PLK+EE  L P I++Y DV++D
Sbjct: 220 EMTSYHLGCRG-LFLPPG---KLVCRYNFTTSPFLRLAPLKQEEINLDPYIVVYHDVLHD 275

Query: 104 SEIDLIKK-MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMT 162
            EI  +K+ MA   +  A ++  K  + ++    I + +WL +  +  ++ +++R+  MT
Sbjct: 276 REIAQMKEEMANAHISNAWIEERKANQSQMRQV-IGRVSWLTDSSN-FMDSVNQRIMDMT 333

Query: 163 GLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGA 222
           G +    E LQV NYG G +++PHYD+   G       L  G+R+A+V+FY S+V  GGA
Sbjct: 334 GFSMKGIESLQVCNYGPGCNFKPHYDYMAEGYEPP-NILTLGDRLASVIFYASEVHLGGA 392

Query: 223 TVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTCPCGLRRGL 281
           TVF  L++++ P+KG    W+N +     D  ++HA CP L GS     T   GL + L
Sbjct: 393 TVFPRLDVAITPKKGAGLVWYNTYDDSTHDQRSQHAVCPTLMGSRWSKKTPHQGLEKCL 451


>gi|195113239|ref|XP_002001175.1| GI10638 [Drosophila mojavensis]
 gi|193917769|gb|EDW16636.1| GI10638 [Drosophila mojavensis]
          Length = 511

 Score =  154 bits (390), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 81/230 (35%), Positives = 122/230 (53%), Gaps = 13/230 (5%)

Query: 48  YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID 107
           +E+ CRG       ++    C Y  ++  +LRL P+K E   L P ++++ DV+   EID
Sbjct: 279 FEIGCRGQYVQQSGLM----CTYKSKSPAFLRLAPIKMEVLVLDPLVVIFHDVLSSREID 334

Query: 108 LIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTS 167
            ++++A+P L R+ V  Y+        +RIS   W+    + +  RI RR+  M  L   
Sbjct: 335 GLQEIARPHLERSMVVKYRANVQ--GKHRISAGTWVERKYNNLTWRIERRIADMVDLNLE 392

Query: 168 TAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTS 227
            +E   V+NYGIGG Y+ H+DF               NR+ATVLFYM+DV QGGATVF  
Sbjct: 393 GSEPFYVINYGIGGQYKAHWDFFGADTVE-------DNRLATVLFYMNDVEQGGATVFPR 445

Query: 228 LNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTCPCGL 277
           L  ++  ++G A FW+N+  +G  D  T H  CP+L GS  + +     L
Sbjct: 446 LGQTVRAKRGNALFWYNMQHNGTVDDRTLHGGCPILVGSKWIFTQWISDL 495


>gi|195166681|ref|XP_002024163.1| GL22882 [Drosophila persimilis]
 gi|194107518|gb|EDW29561.1| GL22882 [Drosophila persimilis]
          Length = 534

 Score =  154 bits (390), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 83/223 (37%), Positives = 123/223 (55%), Gaps = 16/223 (7%)

Query: 48  YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID 107
           YE+ CRG       +V    CRY     P+LRL PLK EE    P I++Y +V+ D EI+
Sbjct: 296 YEIGCRGLFPKRTNLV----CRYNFTTTPFLRLAPLKMEEVNHDPYIVMYHEVLSDREIE 351

Query: 108 LIKKMAQPRLRRATVQNYKTGELEIANYRIS----KSAWLREPEHPVIERISRRVEHMTG 163
            +K       R   + N    + E  + +I     +  W RE +  + ER++RR+  MT 
Sbjct: 352 EMKG------RSGQMSNGWADQKEANSTKIRDIVCRHTWWRE-QSAIKERVNRRISDMTN 404

Query: 164 LTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGAT 223
                 E+LQV NYG+G H++PHYD+   G       L  G+R+ +++FY SDV QGGAT
Sbjct: 405 FDFPPQEDLQVANYGLGTHFKPHYDYTSDGYETP-DVLTLGDRLGSIIFYASDVPQGGAT 463

Query: 224 VFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           VF    +S++P KG++ FW+NL+  G  D  ++H+ CPV+ G 
Sbjct: 464 VFPRSRVSIFPRKGSSVFWYNLYDDGRIDTRSQHSVCPVIVGD 506


>gi|313217217|emb|CBY38368.1| unnamed protein product [Oikopleura dioica]
 gi|313239835|emb|CBY17758.1| unnamed protein product [Oikopleura dioica]
          Length = 521

 Score =  154 bits (389), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 91/228 (39%), Positives = 132/228 (57%), Gaps = 11/228 (4%)

Query: 46  EKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPY--LRLMPLKEEEAYLQPRIILYRDVMYD 103
           ++YE LCR      P   + LKC Y     P   L+  P+K EE +  P ++ + +V+ D
Sbjct: 268 QEYERLCR---EFSPPHKSNLKCFYWTGPSPLSPLQWAPVKTEELHGDPLVVQFYEVISD 324

Query: 104 SEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPE----HPVIERISRRVE 159
            E   I+ +A   L RAT+Q+  TG+L  A+YRI K+AWL E E    +  I + + ++ 
Sbjct: 325 EEERAIQFLAGEHLNRATIQDPATGKLVNADYRIQKTAWLTEFEKLDVNGTIAKYNEKLT 384

Query: 160 HMTGLTTSTAEELQVVNYGIGGHYEPHYDF-ARPGEANAFKSLGTGNRVATVLFYMSDVA 218
            +TGL    AE +QV NYG+ G YEPH+D  + PG  N +  +  G+R+AT L YMS+  
Sbjct: 385 KITGLDADYAELVQVGNYGVAGQYEPHWDHQSYPGAENRWDPI-EGSRIATWLAYMSEPN 443

Query: 219 QGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
            GG TVF    +   P + +A FW+NL  SG+ D  T+HAACPVL+G+
Sbjct: 444 MGGGTVFIQAGIQARPIRNSAVFWYNLLPSGESDDNTQHAACPVLSGT 491


>gi|195379216|ref|XP_002048376.1| GJ13933 [Drosophila virilis]
 gi|194155534|gb|EDW70718.1| GJ13933 [Drosophila virilis]
          Length = 521

 Score =  154 bits (389), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 82/217 (37%), Positives = 123/217 (56%), Gaps = 11/217 (5%)

Query: 52  CRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           CRG    P +    L CRY     P+LRL PLK EE    P I++Y +V+ DSEI+ +K+
Sbjct: 289 CRGLFPKPKS----LSCRYNSTTTPFLRLAPLKLEEISHDPYIVMYHNVLSDSEIEEMKQ 344

Query: 112 MA--QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTA 169
           ++        AT +   T  L+I    ++++ WL E   P +ERI+RR+  MTG      
Sbjct: 345 LSVLMENGLSATNKPNNTEPLDI----VARAGWLVEAT-PFLERINRRITDMTGFDVLDM 399

Query: 170 EELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLN 229
             + + NYGIG +++PHYD+   G  +       G R+AT++FY SDVAQGGAT F  + 
Sbjct: 400 WAVLLANYGIGNYFKPHYDYMYGGRVSGEAVAELGERIATLIFYASDVAQGGATNFPDIQ 459

Query: 230 LSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           +++ P+KG + FW+N+   G  D  + H+ CP + GS
Sbjct: 460 VAVQPQKGNSLFWYNMFDDGTPDPRSLHSVCPTIVGS 496


>gi|195390831|ref|XP_002054071.1| GJ22995 [Drosophila virilis]
 gi|194152157|gb|EDW67591.1| GJ22995 [Drosophila virilis]
          Length = 485

 Score =  154 bits (389), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 89/262 (33%), Positives = 129/262 (49%), Gaps = 26/262 (9%)

Query: 6   HQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAIVAQ 65
           H   +GNK    + LN+  ++++       + P++  T    Y   CRG       ++  
Sbjct: 224 HALMKGNKTNNSDLLNERAQIEELVGTAPKLRPSIRYTT--DYARGCRGQFVQQTNLI-- 279

Query: 66  LKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNY 125
             C+Y  R  P+LRL PLK E   ++P I+ + DV+   EI  ++++A P L+R TV + 
Sbjct: 280 --CKYKFRPSPFLRLAPLKMEVLVVKPFIVAFHDVLSPHEIGELQQLAMPLLKRTTVYDS 337

Query: 126 KTG-ELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYE 184
             G    +   R SK  WL    + + +RI RR+  MTG     +  LQV+NYG+ GHY 
Sbjct: 338 NAGLHGSVKGTRTSKGIWLSRSHNNLTKRIGRRISDMTGFHLEGSTSLQVMNYGLSGHYA 397

Query: 185 PHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHN 244
            H D+    E                   +SDV QGG TVF  +  +  PE+G A  W+N
Sbjct: 398 LHTDYFNTAE-------------------LSDVEQGGDTVFPRIEQAFKPERGKALLWYN 438

Query: 245 LHSSGDGDYYTRHAACPVLTGS 266
           LH +G GD  T H ACPVL GS
Sbjct: 439 LHRNGTGDKRTEHGACPVLVGS 460


>gi|312385412|gb|EFR29925.1| hypothetical protein AND_00803 [Anopheles darlingi]
          Length = 468

 Score =  154 bits (388), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 76/169 (44%), Positives = 111/169 (65%), Gaps = 3/169 (1%)

Query: 47  KYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEI 106
           K+  LCRG+     + +A+L+CRYV   VP+L++ PLK EE  L P I++Y  V+ D+EI
Sbjct: 279 KFYSLCRGESPRTASEMAKLRCRYVSNRVPFLKIAPLKLEEVSLDPFIVVYHQVISDNEI 338

Query: 107 DLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTT 166
             I ++++  LRRA V +    + E++  R S +AWL +P HP +  +SRR E MTGLT 
Sbjct: 339 KTIIEISRDSLRRAMVGD--VAKQEVSKARTSSNAWLDDPMHPHVRSLSRRTEDMTGLTM 396

Query: 167 STAEELQVVNYGIGGHYEPHYDFARPGEA-NAFKSLGTGNRVATVLFYM 214
             AE+LQV NYGIGGHY PH+D+  P E    + ++  GNR+ATV++Y+
Sbjct: 397 WAAEQLQVGNYGIGGHYLPHFDYGTPEEGVELYPNIEKGNRIATVMYYV 445


>gi|195575105|ref|XP_002105520.1| GD21524 [Drosophila simulans]
 gi|194201447|gb|EDX15023.1| GD21524 [Drosophila simulans]
          Length = 448

 Score =  154 bits (388), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 84/200 (42%), Positives = 116/200 (58%), Gaps = 7/200 (3%)

Query: 65  QLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQN 124
           +L C Y      +LRL PLK E   L P ++L+ DV+ D +I  I+ MA+ RL RA   +
Sbjct: 256 KLYCLYNTTASYFLRLAPLKMELLSLDPYMVLFHDVVSDKDIVSIRNMAKGRLARAVTVS 315

Query: 125 YKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYE 184
            K G       R +K  WL E    +I+R+S+  + MT      A+  QV+NYGIGG Y 
Sbjct: 316 -KDGNYTEDPDRTTKGTWLVE-NSKLIQRLSQLTQDMTNFEIHDADPFQVLNYGIGGFYG 373

Query: 185 PHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHN 244
            H DF    E + F      +R+AT +FY+SDV QGGAT+F  L LS++P+KG+A  W+N
Sbjct: 374 IHLDFLGEAELDNF-----SDRIATAVFYLSDVPQGGATIFPKLGLSVFPKKGSALLWYN 428

Query: 245 LHSSGDGDYYTRHAACPVLT 264
           L   GDGD  T H+ACP ++
Sbjct: 429 LDHKGDGDNRTAHSACPTVS 448


>gi|195452728|ref|XP_002073474.1| GK14137 [Drosophila willistoni]
 gi|194169559|gb|EDW84460.1| GK14137 [Drosophila willistoni]
          Length = 536

 Score =  153 bits (387), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 98/270 (36%), Positives = 142/270 (52%), Gaps = 23/270 (8%)

Query: 4   PTHQRAQGNK-LYYQEALN-KSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPA 61
           PTH   Q  K L ++E L+ K  E+  +P            T    Y  LC+G     P 
Sbjct: 256 PTHSAQQTRKYLLHREMLSTKKVEVASDP------------TWHANYTRLCQGHRLPEPF 303

Query: 62  IVAQLKCRY-VHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID-LIKKMAQPRLRR 119
               L C     R+V ++ L PLK E+ ++ P I +Y  V+ D++I+ ++++  Q  + R
Sbjct: 304 TGKSLHCYLDAKRHVSFI-LAPLKVEQVHVDPDINVYHGVLNDAQIEKILQESDQNEMMR 362

Query: 120 ATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGI 179
           + V   K G   IA+ R+S+  WL     P++  +S  +  ++G   + AE++QV NYG+
Sbjct: 363 SAVSGDK-GSATIADLRVSQQTWLNYSS-PIMRSLSNLISDISGFDMAGAEQMQVANYGV 420

Query: 180 GGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTA 239
           GG YEPH D+        FK    G+R++T +FY+SDV  GG TVF  LN+ L P KG  
Sbjct: 421 GGQYEPHPDYFEVNLPQEFK----GDRISTSMFYLSDVELGGNTVFIKLNVFLPPIKGAM 476

Query: 240 AFWHNLHSSGDGDYYTRHAACPVLTGSNSL 269
             WHNLH S D D  T HA CPVL GS  +
Sbjct: 477 VMWHNLHYSLDVDRRTIHAGCPVLIGSKRI 506


>gi|328718395|ref|XP_003246475.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Acyrthosiphon
           pisum]
          Length = 518

 Score =  153 bits (387), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 85/203 (41%), Positives = 119/203 (58%), Gaps = 7/203 (3%)

Query: 67  KCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYK 126
           KCRY   N+ Y  LMP KEE+   +P I +Y DV+YD EI  IK +A   ++ ATV++  
Sbjct: 294 KCRYQTNNLFYRILMPFKEEDINSEPLIKIYHDVLYDDEILKIKTLALENMKDATVKSVD 353

Query: 127 -TGELEIANYRISKSAWLREPEH-PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYE 184
             G+  I   R  +  W+ + +    ++ +  R+E  TG +T TAE+ Q+VNYG+GGHY 
Sbjct: 354 GKGDSLIEKTRSGQVYWISKVDAVEYLDALDTRIESFTGFSTKTAEQYQIVNYGLGGHYL 413

Query: 185 PHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHN 244
           PH+D      A A   L  GNR+ TVLFY++DV   G T F  LN+    EKG A  W+N
Sbjct: 414 PHHD----SFAKAINCLQFGNRLVTVLFYLTDVQNDGYTSFPLLNIIAPAEKGAALVWNN 469

Query: 245 LH-SSGDGDYYTRHAACPVLTGS 266
           LH S+G   Y + H +CP+L G+
Sbjct: 470 LHMSNGQKFYESLHGSCPLLKGN 492


>gi|156370183|ref|XP_001628351.1| predicted protein [Nematostella vectensis]
 gi|156215325|gb|EDO36288.1| predicted protein [Nematostella vectensis]
          Length = 478

 Score =  153 bits (386), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 95/240 (39%), Positives = 131/240 (54%), Gaps = 7/240 (2%)

Query: 2   IFPTHQRAQGNKLYYQEALNK-SPELKDEPPKVNNVAPTLEVTEREKYEMLCRGD-LTVP 59
           I P H+  + +  +Y +A+    PEL     K N     +E      Y  LCRG+ + V 
Sbjct: 241 IDPRHKSVRESIEHYSKAVKHGEPELSYPQIKANEQRLNIEYFVNSDYSKLCRGEPIKVR 300

Query: 60  PAIVAQLK---CRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPR 116
              V   K   C Y +R    L L P K E+    PR++++R ++ D E   IK++A P 
Sbjct: 301 HFQVMSAKSYHCWYDNRGDARLLLKPNKVEQVNDDPRVVIFRGLVTDRETARIKQIASPM 360

Query: 117 LRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVN 176
           L RATV N  TG LE A+YR+SKSAWL +     I  +++R+  +TGL   TAE+LQ+ N
Sbjct: 361 LNRATVYNIDTGVLEYADYRVSKSAWLEDHLDETIATVNKRIAMVTGLDVQTAEKLQIAN 420

Query: 177 YGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEK 236
           YG+GG YE H D   P    A   L  GNR+AT+L Y++DVA GGATVF    + + P K
Sbjct: 421 YGMGGQYEQHTDHGEPDSPLANDPL--GNRIATLLIYLNDVALGGATVFLKAGVHVPPTK 478


>gi|195499025|ref|XP_002096772.1| GE25857 [Drosophila yakuba]
 gi|194182873|gb|EDW96484.1| GE25857 [Drosophila yakuba]
          Length = 490

 Score =  153 bits (386), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 88/239 (36%), Positives = 137/239 (57%), Gaps = 18/239 (7%)

Query: 28  DEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEE 87
           D  P+   VAP+  VT  E+ E+    + TV     ++L CRY     P+ R+ PLK EE
Sbjct: 239 DNKPE--EVAPSHGVTHIEE-ELATVQNCTVVVQKPSRLHCRYNSTTTPFTRIAPLKMEE 295

Query: 88  AYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPE 147
             L P ++++ DV+YD EI+L+   +   L         T   + +  R SK +++ E +
Sbjct: 296 LSLDPYMVVFHDVIYDREIELMLNSSNFILSL-------TDSGQESEVRASKDSYIVESK 348

Query: 148 HPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRV 207
                 ++ RV  MTGL+   ++   ++NYGIGGHY  HYD+ +       K    G+R+
Sbjct: 349 -----TLNDRVTDMTGLSMELSDPFSLINYGIGGHYMLHYDYHKYTNTTRAK---YGDRI 400

Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           AT+LFY+ +V  GGAT+F  +N+++ P+KG+A FW+NLH+SG     T H+ACPV++GS
Sbjct: 401 ATLLFYLGEVDSGGATIFPRINITVTPKKGSAVFWYNLHNSGALHLETLHSACPVISGS 459


>gi|313243209|emb|CBY39868.1| unnamed protein product [Oikopleura dioica]
          Length = 430

 Score =  153 bits (386), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 90/228 (39%), Positives = 132/228 (57%), Gaps = 11/228 (4%)

Query: 46  EKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPY--LRLMPLKEEEAYLQPRIILYRDVMYD 103
           ++YE LCR      P   + LKC Y     P   L+  P+K EE +  P ++ + +V+ D
Sbjct: 177 QEYERLCR---EFSPPHKSNLKCFYWTGPSPVSPLQWAPVKTEELHDDPLVVQFYEVISD 233

Query: 104 SEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPE----HPVIERISRRVE 159
            E   I+ +A   L RAT+Q+  TG+L  A+YRI K+AWL E +    +  I + + ++ 
Sbjct: 234 EEERAIQFLAGEHLNRATIQDPATGKLVNADYRIQKTAWLTEFDKFDVNGTIAKYNAKLT 293

Query: 160 HMTGLTTSTAEELQVVNYGIGGHYEPHYDF-ARPGEANAFKSLGTGNRVATVLFYMSDVA 218
            +TGL    AE +QV NYG+ G YEPH+D  + PG  N +  +  G+R+AT L YMS+  
Sbjct: 294 KITGLDADHAELVQVGNYGVAGQYEPHWDHQSYPGAENRWDPI-EGSRIATWLAYMSEPN 352

Query: 219 QGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
            GG TVF    +   P + +A FW+NL  SG+ D  T+HAACPVL+G+
Sbjct: 353 MGGGTVFIQAGIQARPIRNSAVFWYNLLPSGESDDNTQHAACPVLSGT 400


>gi|26352077|dbj|BAC39675.1| unnamed protein product [Mus musculus]
          Length = 383

 Score =  153 bits (386), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 101/268 (37%), Positives = 139/268 (51%), Gaps = 32/268 (11%)

Query: 4   PTHQRAQGNKLYYQEALNKSP-ELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAI 62
           P ++R   N L Y+  L ++  ++  E        P L+   R+ YE LC+   + P   
Sbjct: 118 PDNKRMARNVLKYERLLAENGHQMAAETAIQRPNVPHLQT--RDTYEGLCQTLGSQPTHY 175

Query: 63  -VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRAT 121
            +  L C Y   + PYL L P ++E  +L+P I LY D + D E   I+++A+P L+R+ 
Sbjct: 176 QIPSLYCSYETNSSPYLLLQPARKEVVHLRPLIALYHDFVSDEEAQKIRELAEPWLQRSV 235

Query: 122 VQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTS--TAEELQVVNYG 178
           V    +GE ++   YRISKSAWL++   P++  +  R+  +TGL      AE LQVVNYG
Sbjct: 236 V---ASGEKQLQVEYRISKSAWLKDTVDPMLVTLDHRIAALTGLDIQPPYAEYLQVVNYG 292

Query: 179 IGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGT 238
           IGGHYEPH+D A                       +S V  GGAT F   N S+   K  
Sbjct: 293 IGGHYEPHFDHAT----------------------LSSVEAGGATAFIYGNFSVPVVKNA 330

Query: 239 AAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           A FW NLH SG+GD  T HA CPVL G 
Sbjct: 331 ALFWWNLHRSGEGDGDTLHAGCPVLVGD 358


>gi|194904100|ref|XP_001981000.1| GG23922 [Drosophila erecta]
 gi|190652703|gb|EDV49958.1| GG23922 [Drosophila erecta]
          Length = 490

 Score =  153 bits (386), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 87/241 (36%), Positives = 138/241 (57%), Gaps = 19/241 (7%)

Query: 26  LKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKE 85
           L+++P ++    P+ EV   E+ E+    + T      ++L CRY     P+ R+ PLK 
Sbjct: 238 LENKPEEI---FPSHEVIHFEE-ELATVQNCTAVVQKPSRLHCRYNSSTTPFTRIAPLKM 293

Query: 86  EEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLRE 145
           EE    P +++Y DV+YDSEIDL+   +   L         T   + +  R SK +++ +
Sbjct: 294 EELSSDPYMVVYHDVIYDSEIDLMLNASNFSLSL-------TNSGQKSEVRASKDSYIVD 346

Query: 146 PEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGN 205
            +      ++ RV  MTGL+   ++   ++NYGIGGHY  HYD+    E +       G+
Sbjct: 347 SK-----TLNDRVTDMTGLSMEMSDPFSMINYGIGGHYMLHYDY---HEYSNMTREKYGD 398

Query: 206 RVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           R+ATVLFY+ +V  GGAT+F  +N+++ P+KG+A FW+NLH+SG     T H+ACPV++G
Sbjct: 399 RIATVLFYLGEVHSGGATIFPRINITVTPKKGSAVFWYNLHNSGAMHSETLHSACPVISG 458

Query: 266 S 266
           S
Sbjct: 459 S 459


>gi|195159164|ref|XP_002020452.1| GL13506 [Drosophila persimilis]
 gi|194117221|gb|EDW39264.1| GL13506 [Drosophila persimilis]
          Length = 536

 Score =  152 bits (385), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 89/250 (35%), Positives = 139/250 (55%), Gaps = 24/250 (9%)

Query: 31  PKVNNVAPTLEVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYL 90
           P    + P L+    E++  +CR      P+   +L CRY     P+LRL PL+ EE  L
Sbjct: 278 PVSEEMKPILD----EEFNQICRSSHQNKPS---RLHCRYNATTTPFLRLAPLRMEELSL 330

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELE---IANYRISKSAWL-REP 146
            P I++Y +V+ D+EI  ++++A+P L+   V     GE++    +  R +  AW+  E 
Sbjct: 331 DPYIVVYHNVLSDAEIAKVERVAEPLLKSIGV-----GEMDNSKKSKVRTALGAWIPDEN 385

Query: 147 EH----PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLG 202
            H    PVI+RI RR+  MTGL     + +Q++ YG GGHY+ H+D+          +  
Sbjct: 386 MHISGWPVIQRIVRRIHDMTGLIIKRGQVVQLIKYGYGGHYDTHFDYLNDSLP---ITQA 442

Query: 203 TGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLH-SSGDGDYYTRHAACP 261
            G+R+ATVLFY++DV  GG+TVF  L L +  E+G    W+N+H  + D D  T H +CP
Sbjct: 443 LGDRMATVLFYLNDVKHGGSTVFPVLQLKVPSERGKVLVWYNMHGETHDLDSRTLHGSCP 502

Query: 262 VLTGSNSLHS 271
           V+ G+ ++ S
Sbjct: 503 VIDGAKTVLS 512


>gi|196011908|ref|XP_002115817.1| hypothetical protein TRIADDRAFT_30052 [Trichoplax adhaerens]
 gi|190581593|gb|EDV21669.1| hypothetical protein TRIADDRAFT_30052, partial [Trichoplax
           adhaerens]
          Length = 495

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 88/251 (35%), Positives = 139/251 (55%), Gaps = 17/251 (6%)

Query: 26  LKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAIVAQ-------LKCRYVHRNVPYL 78
            +D   ++     T  + E  ++  LCRG++     I+         LKC Y +++ P L
Sbjct: 227 FQDYVKRLGRADSTRRLAENTEFGNLCRGNVKEVNYILCSILLANKTLKCYYSNQS-PLL 285

Query: 79  RLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRIS 138
            L P+  EE  L P I++Y D++ D +I+ IKK++  +  ++         ++    ++S
Sbjct: 286 YLAPIPVEEISLDPFIVIYYDIINDHQIETIKKISPSKSNKSPNHAMLCSGIKSEATQVS 345

Query: 139 K---SAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEA 195
               S WL +   PV+E+ISR  + +T L  + AE+LQV NYGIGGHY PHYD       
Sbjct: 346 IFCCSTWLEDAYDPVVEKISRLTQELTHLDVNYAEDLQVANYGIGGHYVPHYDSTIIAPE 405

Query: 196 NAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYT 255
           +  +      R+AT++FY+S+V  GGAT+F  L +++ P+KG+A FW NL  +G  +  T
Sbjct: 406 DPLQ------RLATMMFYLSNVEIGGATIFPRLGVAVRPQKGSALFWINLKRNGLTNRQT 459

Query: 256 RHAACPVLTGS 266
            HAACPV+ GS
Sbjct: 460 LHAACPVVIGS 470


>gi|194905392|ref|XP_001981188.1| GG11756 [Drosophila erecta]
 gi|190655826|gb|EDV53058.1| GG11756 [Drosophila erecta]
          Length = 509

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 85/204 (41%), Positives = 115/204 (56%), Gaps = 7/204 (3%)

Query: 63  VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATV 122
            A+L C Y      +LRL PLK E   L P ++L+ DV+ D +I  I+ +A+  L RA V
Sbjct: 290 TAKLHCLYNTTASHFLRLAPLKMELLSLDPYVVLFHDVVSDQDILSIRNLAKGGLARA-V 348

Query: 123 QNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGH 182
              + G  +    R +K  WL E    +I+R+S+  + MT      A+  QV+NYGIGG 
Sbjct: 349 TVTQDGNDKEDPARTTKGTWLVE-NSKLIQRLSQLSQDMTNFDVRDADPFQVLNYGIGGF 407

Query: 183 YEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFW 242
           Y  H+DF    E   F      +R+AT +FY+SDV QGGAT F  L LS++PEKG A  W
Sbjct: 408 YGTHFDFLEDTEMGHF-----SDRIATAVFYLSDVPQGGATTFPDLGLSVFPEKGAALLW 462

Query: 243 HNLHSSGDGDYYTRHAACPVLTGS 266
           +NL   G GD  T H+ACP + GS
Sbjct: 463 YNLDHKGVGDNRTAHSACPTIVGS 486


>gi|195392288|ref|XP_002054791.1| GJ24631 [Drosophila virilis]
 gi|194152877|gb|EDW68311.1| GJ24631 [Drosophila virilis]
          Length = 499

 Score =  152 bits (383), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 82/216 (37%), Positives = 126/216 (58%), Gaps = 7/216 (3%)

Query: 51  LCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIK 110
           LCRG     P + + L+CRY   + P+LRL PLK E+  L P ++LY DV+  +E + I 
Sbjct: 266 LCRGHSL--PLVSSSLRCRYNTASAPFLRLAPLKLEQLSLDPYMVLYHDVVQANEREHIM 323

Query: 111 KMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAE 170
           ++A+P LRRA V     G     + R + +A     +    +R+ +R+E M+G   + + 
Sbjct: 324 QLAKPHLRRALV-----GAARAHSQRFAMNAGFSYNDSRQGQRLRQRLEDMSGFDLTNSG 378

Query: 171 ELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNL 230
           +L V+NYGIGG Y  HYD     +  A  +    NR+AT+L Y++DV  GG T F +L L
Sbjct: 379 QLAVLNYGIGGQYYMHYDCWFSQDDAAQVASIKDNRIATILLYLTDVQLGGLTSFPALGL 438

Query: 231 SLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           ++ P  G+A  WHN++++ + D  T HAACP+L G+
Sbjct: 439 AVQPSPGSALIWHNMNNAAECDRRTLHAACPLLLGT 474


>gi|15077349|gb|AAK83137.1| prolyl 4-hydroxylase alpha subunit [Cavia porcellus]
          Length = 141

 Score =  152 bits (383), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 75/141 (53%), Positives = 102/141 (72%), Gaps = 2/141 (1%)

Query: 48  YEMLCRGD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSE 105
           YEMLCRG+ + + P    +L CRY   N  P   L P K+E+ + +PRII + D++ D+E
Sbjct: 1   YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 60

Query: 106 IDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLT 165
           I+++K +A+PRLRRAT+ N  TG+LE  +YRISKSAWL   E+PV+ RI+ R++ +TGL 
Sbjct: 61  IEIVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSGYENPVVSRINMRIQDLTGLD 120

Query: 166 TSTAEELQVVNYGIGGHYEPH 186
            STAEELQV NYG+GG YEPH
Sbjct: 121 VSTAEELQVANYGVGGQYEPH 141


>gi|198449524|ref|XP_002136918.1| GA26871 [Drosophila pseudoobscura pseudoobscura]
 gi|198130646|gb|EDY67476.1| GA26871 [Drosophila pseudoobscura pseudoobscura]
          Length = 530

 Score =  152 bits (383), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 87/250 (34%), Positives = 138/250 (55%), Gaps = 24/250 (9%)

Query: 31  PKVNNVAPTLEVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYL 90
           P    + P L+    E++  +CR      P+   +L CRY     P+LRL PL+ EE  L
Sbjct: 272 PVSEEMKPILD----EEFNQICRSSHQNKPS---RLHCRYNATTTPFLRLAPLRMEELSL 324

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELE---IANYRISKSAWLREPE 147
            P I++Y +V+ D+EI  ++++A+P L+   V     GE++    +  R +  AW+ +  
Sbjct: 325 DPYIVVYHNVLSDAEIAKVERVAEPLLKSIGV-----GEMDNSKKSKVRTALGAWIPDKN 379

Query: 148 H-----PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLG 202
                 PVI+RI RR+  MTGL     + +Q++ YG GGHY+ H+D+          +  
Sbjct: 380 MHISGWPVIQRIVRRIHDMTGLIIKHGQVVQLIKYGYGGHYDTHFDYLNDSLP---ITQA 436

Query: 203 TGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLH-SSGDGDYYTRHAACP 261
            G+R+ATVLFY++DV  GG+TVF  L L +  E+G    W+N+H  + D D  T H +CP
Sbjct: 437 LGDRMATVLFYLNDVKHGGSTVFPVLKLKVPSERGKVLVWYNMHGETHDLDSRTLHGSCP 496

Query: 262 VLTGSNSLHS 271
           V+ G+ ++ S
Sbjct: 497 VIDGAKTVLS 506


>gi|198449506|ref|XP_002136910.1| GA26925 [Drosophila pseudoobscura pseudoobscura]
 gi|198130637|gb|EDY67468.1| GA26925 [Drosophila pseudoobscura pseudoobscura]
          Length = 543

 Score =  152 bits (383), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 90/269 (33%), Positives = 146/269 (54%), Gaps = 17/269 (6%)

Query: 11  GNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRG------DLTVPPAIVA 64
           G+K +  +A +     ++ PP+ +    +  +TE  K+  LCR       D +   +  A
Sbjct: 249 GDKTFGNKAYHIVSHFQEHPPQQSINIGSRGITE--KFNRLCRSMSRRKTDGSAAHSKPA 306

Query: 65  QLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQN 124
           +L CRY      +LRL PL+ EE  L P I+LY +V+ D E+  ++ M+ P L RA + +
Sbjct: 307 RLHCRYNATTTAFLRLAPLRMEELSLDPYIVLYHNVLSDEEMARLENMSTPLLHRARIFD 366

Query: 125 YKTGELEIANYRISKSAWLREP-----EHPVIERISRRVEHMTGLTTSTAEELQVVNYGI 179
            +T + +I+  R +    +  P     +  ++E I +R+  +TGL  ++   +Q + YG 
Sbjct: 367 KETKKPKISPVRSADEVGIPNPKLVTEDIQLVECIQKRITDLTGLMLTSMRRIQFLKYGF 426

Query: 180 GGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTA 239
           GG Y PH+DF      +   S   G+R+ATV+FY++DV  GGAT F +L+L +  E+G  
Sbjct: 427 GGIYVPHHDFF---SVHTPTSRLHGDRIATVIFYLNDVEHGGATAFPNLDLVVPTERGAV 483

Query: 240 AFWHNLH-SSGDGDYYTRHAACPVLTGSN 267
            FWHN+   + D DY T H ACPV+ G+ 
Sbjct: 484 LFWHNMDGETYDLDYRTLHGACPVIVGTK 512


>gi|291224083|ref|XP_002732036.1| PREDICTED: prolyl 4-hydroxylase, alpha I subunit-like [Saccoglossus
           kowalevskii]
          Length = 491

 Score =  152 bits (383), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 94/246 (38%), Positives = 139/246 (56%), Gaps = 23/246 (9%)

Query: 26  LKDEPPKVNNVAP-TLEVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLK 84
           LKD+P    +V P +  + +R+ YE LCRG+    P   +++KC+YV      L L P K
Sbjct: 240 LKDKP----SVRPNSTYLDDRDAYEALCRGERR-KPLDSSKVKCQYVTNGNYRLLLQPAK 294

Query: 85  EEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATV----QNYKTGELEIANYRISKS 140
           +E  +  PR++LY DV+ D EI+ + K+A+P+LRR+ V     +        A YR+S  
Sbjct: 295 QEIMHHNPRVVLYHDVISDEEINEVIKLAKPKLRRSLVVTKGSSPSGTGSSDAEYRVSSG 354

Query: 141 AWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKS 200
            WL + +  VI +++RR+  ++GL+T TA E +        H E     A   E +    
Sbjct: 355 GWLEDWDGTVIAKLTRRISDISGLSTLTAPEYR--------HAE-----ALQIENSDVHL 401

Query: 201 LGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAAC 260
            G+ NR+AT +FYMS+V  GG TVF  ++  + P K  A FW+NL +SG+ D  TRHA C
Sbjct: 402 PGSRNRIATWMFYMSEVKAGGYTVFPEVDAFVPPVKNAAVFWYNLKASGESDDLTRHAGC 461

Query: 261 PVLTGS 266
           PVL GS
Sbjct: 462 PVLIGS 467


>gi|195159321|ref|XP_002020530.1| GL13464 [Drosophila persimilis]
 gi|194117299|gb|EDW39342.1| GL13464 [Drosophila persimilis]
          Length = 533

 Score =  152 bits (383), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 85/223 (38%), Positives = 123/223 (55%), Gaps = 7/223 (3%)

Query: 48  YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID 107
           Y  LC+G           L+C    +   Y  L PL+ E+ +L P I +Y  ++   +ID
Sbjct: 287 YSRLCQGRRLPEKGSGTSLRCFLDGKRHAYFTLAPLQVEQVHLDPDIDVYHGILTLDQID 346

Query: 108 LIKKMAQPR-LRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTT 166
            I + A  + + R+ V     G   + + R+S+  WL + E P+++ I+R V  ++G   
Sbjct: 347 SIFEAADKQEMTRSGVAG-DGGTRTVVDLRVSQQTWL-DYESPIMKSIARLVVFISGFDI 404

Query: 167 STAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFT 226
           + AE +QV NYG+GG YEPH D+      + FK    G+R++T +FY+SDV QGG TVFT
Sbjct: 405 AGAEAMQVANYGVGGQYEPHPDYFEVNLPSDFK----GDRISTSMFYLSDVEQGGYTVFT 460

Query: 227 SLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSL 269
            LN+ L P KG    WHNLH S D D  T HA CPV+ GS  +
Sbjct: 461 KLNVFLPPIKGALVMWHNLHRSLDVDPRTHHAGCPVIVGSKRI 503


>gi|195128345|ref|XP_002008624.1| GI13596 [Drosophila mojavensis]
 gi|193920233|gb|EDW19100.1| GI13596 [Drosophila mojavensis]
          Length = 527

 Score =  151 bits (382), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 86/225 (38%), Positives = 129/225 (57%), Gaps = 14/225 (6%)

Query: 45  REKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDS 104
           +E Y + CRG    P      L CRY     P+LRL P K EE  L P I+LY +V+ DS
Sbjct: 286 QEPYYLGCRG--GYPKR--TNLHCRYNTTTTPFLRLAPFKMEEVSLDPYIVLYHNVISDS 341

Query: 105 EIDLIKKMAQPR---LRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHM 161
           EI+ IK+ A      L R  + N  T + +I    +++  W+ E   P  +RI+ R+  +
Sbjct: 342 EIEDIKQHATNFTNGLSRNPLLNV-TDKPQI----VARMQWV-EKMTPFTDRINLRITDI 395

Query: 162 TGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGG 221
           TG      + +Q+ NYGIGGH+ PH+D+   G  +   + G G+R AT++FY SD+ QGG
Sbjct: 396 TGFGVDECKTVQIANYGIGGHFIPHFDYTTEGRVSINDTFGIGDRTATIVFYASDM-QGG 454

Query: 222 ATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           ATVF ++ +++ P+KG+A  W+NL      +  T H+ CPV++GS
Sbjct: 455 ATVFPNIQVTVQPQKGSALHWYNLFDDDSPNPLTLHSVCPVISGS 499


>gi|241598357|ref|XP_002404731.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
 gi|215500462|gb|EEC09956.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
          Length = 218

 Score =  151 bits (382), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 85/209 (40%), Positives = 116/209 (55%), Gaps = 30/209 (14%)

Query: 64  AQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQ 123
           +QL+CRY      +L L  +K EE  L+P II+  DV+ D +I+ + + A+PRL R+T  
Sbjct: 3   SQLRCRYYKGQDGFLALQQIKLEEMNLKPYIIVMHDVVQDKDIEKLMEFAEPRLERSTT- 61

Query: 124 NYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHY 183
            Y   E+     R S +AWL E E P+                       + NYG GGH+
Sbjct: 62  -YNGSEVMPTPQRTSSTAWLNEDEAPI----------------------ALANYGTGGHF 98

Query: 184 EPHYDF------ARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKG 237
            PH+DF      A    A+ +   G G+R+AT++ YM+DV  GGATVF SL + L P+KG
Sbjct: 99  LPHHDFFQDSLNAYNSSADYYLQHGRGDRIATLMIYMTDVEAGGATVFPSLGIRLTPKKG 158

Query: 238 TAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
            AAFW NL +SG+G+  T HA CPVL GS
Sbjct: 159 DAAFWWNLKASGEGERLTMHAGCPVLYGS 187


>gi|198449650|ref|XP_001357661.2| GA13747 [Drosophila pseudoobscura pseudoobscura]
 gi|198130701|gb|EAL26795.2| GA13747 [Drosophila pseudoobscura pseudoobscura]
          Length = 533

 Score =  151 bits (382), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 85/223 (38%), Positives = 123/223 (55%), Gaps = 7/223 (3%)

Query: 48  YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID 107
           Y  LC+G           L+C    +   Y  L PL+ E+ +L P I +Y  ++   +ID
Sbjct: 287 YSRLCQGRRLPEKGSGTSLRCFLDGKRHAYFTLAPLQVEQVHLDPDIDVYHGILTLDQID 346

Query: 108 LIKKMAQPR-LRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTT 166
            I + A  + + R+ V     G   + + R+S+  WL + E P+++ I+R V  ++G   
Sbjct: 347 SIFEAADKQEMTRSGVAG-DGGTRTVVDLRVSQQTWL-DYESPIMKSIARLVVFISGFDI 404

Query: 167 STAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFT 226
           + AE +QV NYG+GG YEPH D+      + FK    G+R++T +FY+SDV QGG TVFT
Sbjct: 405 AGAEAMQVANYGVGGQYEPHPDYFEVNLPSDFK----GDRISTSMFYLSDVEQGGYTVFT 460

Query: 227 SLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSL 269
            LN+ L P KG    WHNLH S D D  T HA CPV+ GS  +
Sbjct: 461 KLNVFLPPIKGALVMWHNLHRSLDVDPRTHHAGCPVIVGSKRI 503


>gi|195425415|ref|XP_002061004.1| GK10713 [Drosophila willistoni]
 gi|194157089|gb|EDW71990.1| GK10713 [Drosophila willistoni]
          Length = 502

 Score =  151 bits (381), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 85/218 (38%), Positives = 132/218 (60%), Gaps = 11/218 (5%)

Query: 52  CRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           CRG+   P      L C Y  ++ P+L L P K E     P + +Y DV+YD EI+ +K+
Sbjct: 251 CRGEYEHPKG----LSCYYDSKDEPFLFLAPFKVEILNNLPFVAIYHDVLYDREIEELKR 306

Query: 112 MAQPRLRRATVQNY-KTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTT--ST 168
           +A P + R+T+ +Y K G + + N+R S S +L      +++ + +RV  MT L    ++
Sbjct: 307 LAVPTITRSTIYDYDKEGNVPV-NFRTSNSVFLLNNASYLVDILRQRVADMTHLNVFKNS 365

Query: 169 AEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL 228
           +++LQV+NYG+GG+Y  H+DF    E+    +   G+R+ TVL YM+DV QGGATVF +L
Sbjct: 366 SDDLQVMNYGLGGYYRYHFDFFGKDES---PNKLLGDRIITVLIYMTDVQQGGATVFPAL 422

Query: 229 NLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
            ++ +P+KG+A  + NL ++   D  T HA CPVL GS
Sbjct: 423 RITNFPKKGSALIFRNLDNNISPDPSTLHAGCPVLFGS 460


>gi|403274090|ref|XP_003928822.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Saimiri
           boliviensis boliviensis]
          Length = 149

 Score =  150 bits (380), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 67/95 (70%), Positives = 79/95 (83%)

Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
           LQV NYG+GG YEPH+DFAR  E +AFK LGTGNR+AT LFYMSDV+ GGATVF  +  S
Sbjct: 30  LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGAS 89

Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           +WP+KGTA FW+NL +SG+GDY TRHAACPVL G+
Sbjct: 90  VWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGN 124


>gi|195159148|ref|XP_002020444.1| GL13996 [Drosophila persimilis]
 gi|194117213|gb|EDW39256.1| GL13996 [Drosophila persimilis]
          Length = 559

 Score =  150 bits (380), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 90/273 (32%), Positives = 147/273 (53%), Gaps = 23/273 (8%)

Query: 10  QGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTER---EKYEMLCRG------DLTVPP 60
           +G+K +  +A +     ++ PP+      ++ +  R   EK+  LCR       D +   
Sbjct: 264 RGDKTFGDKAYHIVSHFQEHPPQ-----QSINIGSRGFTEKFNRLCRSMSRRKTDGSAAH 318

Query: 61  AIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRA 120
           +  A+L CRY      +LRL PL+ EE  L P I+LY +V+ D E+  ++ M+ P L RA
Sbjct: 319 SKPARLHCRYNATTTAFLRLAPLRMEELSLDPYIVLYHNVLSDEEMARLENMSTPLLHRA 378

Query: 121 TVQNYKTGELEIANYRISKSAWLREP-----EHPVIERISRRVEHMTGLTTSTAEELQVV 175
            + + +T + +I+  R +    +  P     +  ++E I +R+  +TGL  ++   +Q +
Sbjct: 379 RIFDKETKKPKISPVRSADEVGIPNPKLVTGDIQLVECIQKRITDLTGLMLTSMRRIQFL 438

Query: 176 NYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPE 235
            YG GG Y PH+DF      +   S   G+R+ATV+FY++DV  GGAT F +L+L +  E
Sbjct: 439 KYGFGGIYVPHHDFF---SVHTPTSRLHGDRIATVIFYLNDVEHGGATAFPNLDLVVPTE 495

Query: 236 KGTAAFWHNLH-SSGDGDYYTRHAACPVLTGSN 267
           +G   FWHN+   + D DY T H ACPV+ G+ 
Sbjct: 496 RGAVLFWHNMDGETYDLDYRTLHGACPVIVGTK 528


>gi|194765172|ref|XP_001964701.1| GF23326 [Drosophila ananassae]
 gi|190614973|gb|EDV30497.1| GF23326 [Drosophila ananassae]
          Length = 885

 Score =  150 bits (380), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 79/201 (39%), Positives = 113/201 (56%), Gaps = 19/201 (9%)

Query: 66  LKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNY 125
           L C Y  +  P+LRL P+K E     P I ++ DV+Y  E+  I+   +  L  +T  NY
Sbjct: 675 LYCLYNTKTSPFLRLAPIKTELLSKDPYIAIFHDVVYPKELTRIRTACKSHLIASTTINY 734

Query: 126 KTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEP 185
            +    + +YR SKS W+    + + +RI+  V   TGL  +T+E  QV+NYGIGG +E 
Sbjct: 735 TSNAYSVDSYRTSKSVWIPTDSNNLTQRITNLVGDATGLEMTTSEMFQVINYGIGGLFEA 794

Query: 186 HYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNL 245
           H D   P  +NA                +SDV QGGAT+FT LNL+++P+ G+A FW+NL
Sbjct: 795 HMD---PVLSNA----------------LSDVEQGGATIFTKLNLTVFPQSGSALFWYNL 835

Query: 246 HSSGDGDYYTRHAACPVLTGS 266
            + G+ D  T HA CPV+ GS
Sbjct: 836 DNWGNEDKRTEHAGCPVIVGS 856



 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 64/217 (29%), Positives = 105/217 (48%), Gaps = 20/217 (9%)

Query: 16  YQEALNKSP---ELKDEPP-------KVNNVAPTLEVTER----EKYEMLCRGDLTVPPA 61
           YQ AL KSP   +L +E         K+  + P +E  +     E + + C G       
Sbjct: 244 YQVALKKSPPDAKLYEEHQYLESMYLKLFGLDPNIEEYDNSYKSEVFSLCCNGKCQKDKK 303

Query: 62  IVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRAT 121
           I   L C Y  +    L + P+K+E   + P I L+ DV+   E  +++ +++  L  +T
Sbjct: 304 I-QNLYCFYDTKTSNALIIAPVKKEILSVDPYIALFHDVISQKEQKILQSVSKIHLMAST 362

Query: 122 VQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGG 181
             +       + NYRISKS W     + V +R++  +E  TG    ++E  QV+NYG+GG
Sbjct: 363 TIH--NNNKAVKNYRISKSVWYASDYNDVTKRLTTFMEQATGYDMKSSELFQVINYGLGG 420

Query: 182 HYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVA 218
            ++ H D+    +    +  GT +R+AT LFY S +A
Sbjct: 421 RFDGHEDYLLTDKT---RFNGTSDRIATTLFYESTLA 454


>gi|198449641|ref|XP_002136935.1| GA26860 [Drosophila pseudoobscura pseudoobscura]
 gi|198130697|gb|EDY67493.1| GA26860 [Drosophila pseudoobscura pseudoobscura]
          Length = 508

 Score =  150 bits (379), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 84/203 (41%), Positives = 115/203 (56%), Gaps = 7/203 (3%)

Query: 64  AQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQ 123
           ++L C Y      +LRL PLK E   L P ++LY DV+ D E+ L+K MAQ  L RA+  
Sbjct: 291 SRLYCLYNTTATAFLRLAPLKMELLSLDPYVVLYHDVLADREMSLLKSMAQKDLVRASTY 350

Query: 124 NYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHY 183
           +    +      R +K+ WL +P H +I R+    E MT L     E+ QV+NYGIGGH 
Sbjct: 351 DVMDKKHSEDPNRTTKARWL-DPSHSLIRRMGILTEDMTNLDLERLEDFQVLNYGIGGHD 409

Query: 184 EPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWH 243
           + H D+               +RVAT+LFY+SDV  GGATVF  L+LS++P++G    W+
Sbjct: 410 DIHPDYYEGSNPE------LPDRVATLLFYLSDVPLGGATVFPLLDLSVFPKRGAVLMWY 463

Query: 244 NLHSSGDGDYYTRHAACPVLTGS 266
           NL   G G   T H+ACPV+ GS
Sbjct: 464 NLDHKGQGIEKTVHSACPVVVGS 486


>gi|328718393|ref|XP_001945742.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 1
           [Acyrthosiphon pisum]
          Length = 511

 Score =  150 bits (378), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 87/209 (41%), Positives = 121/209 (57%), Gaps = 19/209 (9%)

Query: 67  KCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYK 126
           KCRY   N+ Y  LMP KEE+   +P I +Y DV+YD EI  IK +A  +++ A V++  
Sbjct: 291 KCRYQTNNLFYRILMPFKEEDINSEPLIKIYHDVLYDDEILKIKTLALEKMKDAKVKS-- 348

Query: 127 TGELEIANY------RISKSAWLREPEH-PVIERISRRVEHMTGLTTSTAEELQVVNYGI 179
              ++  NY      R  +  W+ E +     + ++ R+E  TG +T TAE  Q+VNYG+
Sbjct: 349 ---VDGKNYLLEEKTRSGQVYWIFEVDAVEYFDALNTRIESFTGFSTKTAERYQIVNYGL 405

Query: 180 GGHYEPHYD-FARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGT 238
           GGHY PH+D FA+  E   F     GNR+ TVLFY++DV   G T F  LN+    EKG 
Sbjct: 406 GGHYIPHHDSFAKGAENVKF-----GNRLVTVLFYLTDVQNDGYTSFPMLNIIAPAEKGA 460

Query: 239 AAFWHNLH-SSGDGDYYTRHAACPVLTGS 266
           A  W+NLH S+G   Y T H +CP+L G+
Sbjct: 461 ALVWNNLHMSNGQKFYETLHGSCPLLKGN 489


>gi|328718391|ref|XP_003246474.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 2
           [Acyrthosiphon pisum]
          Length = 514

 Score =  150 bits (378), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 87/209 (41%), Positives = 121/209 (57%), Gaps = 19/209 (9%)

Query: 67  KCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYK 126
           KCRY   N+ Y  LMP KEE+   +P I +Y DV+YD EI  IK +A  +++ A V++  
Sbjct: 294 KCRYQTNNLFYRILMPFKEEDINSEPLIKIYHDVLYDDEILKIKTLALEKMKDAKVKS-- 351

Query: 127 TGELEIANY------RISKSAWLREPEH-PVIERISRRVEHMTGLTTSTAEELQVVNYGI 179
              ++  NY      R  +  W+ E +     + ++ R+E  TG +T TAE  Q+VNYG+
Sbjct: 352 ---VDGKNYLLEEKTRSGQVYWIFEVDAVEYFDALNTRIESFTGFSTKTAERYQIVNYGL 408

Query: 180 GGHYEPHYD-FARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGT 238
           GGHY PH+D FA+  E   F     GNR+ TVLFY++DV   G T F  LN+    EKG 
Sbjct: 409 GGHYIPHHDSFAKGAENVKF-----GNRLVTVLFYLTDVQNDGYTSFPMLNIIAPAEKGA 463

Query: 239 AAFWHNLH-SSGDGDYYTRHAACPVLTGS 266
           A  W+NLH S+G   Y T H +CP+L G+
Sbjct: 464 ALVWNNLHMSNGQKFYETLHGSCPLLKGN 492


>gi|195572619|ref|XP_002104293.1| GD18524 [Drosophila simulans]
 gi|194200220|gb|EDX13796.1| GD18524 [Drosophila simulans]
          Length = 472

 Score =  150 bits (378), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 78/218 (35%), Positives = 125/218 (57%), Gaps = 15/218 (6%)

Query: 49  EMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDL 108
           E+  + + T      ++L CRY     P+ R+ PLK EE  L P ++++ DV+YD+EID 
Sbjct: 239 ELATKQNCTAVVQKPSRLHCRYNTSTTPFTRIAPLKMEELSLDPYMVVFHDVVYDTEIDG 298

Query: 109 IKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST 168
           +       L  +      T   + +  R SK +++ + E      ++ RV  MTG +   
Sbjct: 299 M-------LNSSNFGLSLTDSGQKSEVRTSKDSYIVDSE-----SLNERVTDMTGFSMEM 346

Query: 169 AEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL 228
           ++   ++NYG+GGHY  HYDF         K    G+R+ATVLFY+ +V  GGAT+F  +
Sbjct: 347 SDPFSLINYGLGGHYMLHYDFHEYTNTTRPKQ---GDRIATVLFYLGEVDSGGATIFPKI 403

Query: 229 NLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           N+++ P+KG+A FW+NLH+SG  +  + H+ACPV++GS
Sbjct: 404 NIAVTPKKGSAVFWYNLHNSGAMNLKSLHSACPVISGS 441


>gi|260806889|ref|XP_002598316.1| hypothetical protein BRAFLDRAFT_261183 [Branchiostoma floridae]
 gi|229283588|gb|EEN54328.1| hypothetical protein BRAFLDRAFT_261183 [Branchiostoma floridae]
          Length = 531

 Score =  149 bits (377), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 95/246 (38%), Positives = 137/246 (55%), Gaps = 19/246 (7%)

Query: 45  REKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAY-LQPRIILYRDVMYD 103
           R+KYE LCR  +    A  +   CRY  R  PY  L P+K E  +   P I L+ D++ +
Sbjct: 292 RDKYEELCRVGVLQNRAPRSSASCRYF-RPSPYFYLGPIKMEVLHETNPVIHLFHDIVSE 350

Query: 104 SEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTG 163
           SE   +++MA P+  R+ V     G+  I N R+S++AW  + + PV+ ++SRRV++ TG
Sbjct: 351 SEAARMREMAIPKFHRSVVVGDDGGDAIILN-RVSETAWHFDYDDPVVAKLSRRVDYATG 409

Query: 164 LTTS--TAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGG 221
           L+T+  TAE  QVVNYG+GG Y PH D+         + +  GNRV T L Y+SDV  GG
Sbjct: 410 LSTAEGTAEAFQVVNYGLGGQYIPHTDYFEGDHVT--RHIQNGNRVVTFLLYLSDVDAGG 467

Query: 222 ATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC-------- 273
           ATVF  +++++ P    A FW ++  SG     + HA CPVL GS  + +          
Sbjct: 468 ATVFPIVDVAV-PINSAAVFW-SMERSGAVVPNSLHAGCPVLIGSKWIANKWIREHGNEF 525

Query: 274 --PCGL 277
             PCGL
Sbjct: 526 RRPCGL 531


>gi|328713119|ref|XP_003244997.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Acyrthosiphon
           pisum]
          Length = 487

 Score =  149 bits (377), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 90/224 (40%), Positives = 122/224 (54%), Gaps = 12/224 (5%)

Query: 47  KYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEI 106
           ++  LC+  +++   +    KCRY   N+ Y  LMP KEE+   +P I +Y DV+YD EI
Sbjct: 249 EFRNLCKHGVSLR-TLTKYSKCRYQTNNLFYRILMPFKEEDINSEPFIKIYHDVLYDDEI 307

Query: 107 DLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIE---RISRRVEHMTG 163
             IK M+   +  A V   KT    I   R       R  E   IE    ++ R+E  TG
Sbjct: 308 LKIKTMSLANMSDAKV---KTSNDSILRERSRSGQVYRMNEVDAIEYFDALNTRIESFTG 364

Query: 164 LTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGAT 223
            +T TAE  Q+VNYG+GGHY PH+D  + G  N    +  GNR+ TVLFY++DV   G T
Sbjct: 365 FSTKTAERYQIVNYGLGGHYFPHFDTFKKGTEN----MEFGNRLVTVLFYLTDVQNDGYT 420

Query: 224 VFTSLNLSLWPEKGTAAFWHNLH-SSGDGDYYTRHAACPVLTGS 266
            F  LN+    EKG+A  W+NLH S G   Y + H ACP+L G+
Sbjct: 421 SFPMLNIIAPAEKGSALVWNNLHMSDGQLCYESLHGACPLLKGN 464


>gi|195572621|ref|XP_002104294.1| GD18523 [Drosophila simulans]
 gi|194200221|gb|EDX13797.1| GD18523 [Drosophila simulans]
          Length = 490

 Score =  149 bits (376), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 80/211 (37%), Positives = 121/211 (57%), Gaps = 31/211 (14%)

Query: 64  AQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQ 123
           ++L CRY     P+ R+ PLK EE  L P ++++ DV+YD+EID                
Sbjct: 272 SRLHCRYNTSTTPFTRIAPLKMEELSLDPYMVVFHDVVYDTEID---------------- 315

Query: 124 NYKTGELEIANYRISKSAWLREPE-------HPVIER-ISRRVEHMTGLTTSTAEELQVV 175
               G L  +N+ IS+S    + E       H V  + ++ RV  MTGL+   ++   ++
Sbjct: 316 ----GMLNSSNFGISESVSGLKSEVRTSKDSHIVDSKTLNERVTDMTGLSMEMSDPFSLI 371

Query: 176 NYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPE 235
           NYG+GGH+  H+DF         K    G+R+ATVLFY+ +V  GGAT+F  LN+++ P+
Sbjct: 372 NYGLGGHFILHHDFHEYTNTTRLKQ---GDRIATVLFYLGEVDSGGATIFPMLNITVTPK 428

Query: 236 KGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           KG+A FW+NLH+SG  +  T H+ACPV++GS
Sbjct: 429 KGSAVFWYNLHNSGAVNSKTLHSACPVISGS 459


>gi|390176836|ref|XP_003736216.1| GA26872, isoform B [Drosophila pseudoobscura pseudoobscura]
 gi|388858809|gb|EIM52289.1| GA26872, isoform B [Drosophila pseudoobscura pseudoobscura]
          Length = 567

 Score =  149 bits (375), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 90/269 (33%), Positives = 142/269 (52%), Gaps = 17/269 (6%)

Query: 11  GNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLC------RGDLTVPPAIVA 64
           G+K +  +A +     +  PP+ +        TE  K+  LC      + D +   +  A
Sbjct: 273 GDKTFGNKAYHIVSHFQKHPPQQSINMENGNFTE--KFNRLCSSMSRRKTDGSAAHSKPA 330

Query: 65  QLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQN 124
           +L CRY      +LRL PL+ EE  L P I+LY +V+ D E+  ++ M+ P L RA V +
Sbjct: 331 RLHCRYNATTTAFLRLAPLRMEELSLDPYIVLYHNVLSDEEMARLENMSTPLLHRARVFD 390

Query: 125 YKTGELEIANYRISKSAWLREP-----EHPVIERISRRVEHMTGLTTSTAEELQVVNYGI 179
               + +I+  R +    +  P     +  ++ERI +R+  +TGL  ++   +Q + YG 
Sbjct: 391 SGIRKPKISPARTADEVQIPNPKLVAEDIQLVERIQKRMTDLTGLVLTSMRRIQFLKYGF 450

Query: 180 GGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTA 239
           GG Y PH+DF      +   S   G+R+ATV+FY++DV  GGAT F +L+L +  E+G  
Sbjct: 451 GGIYVPHHDFF---SVHTPTSRLHGDRIATVIFYLNDVEHGGATAFPNLDLVVPTERGAV 507

Query: 240 AFWHNLH-SSGDGDYYTRHAACPVLTGSN 267
            FWHN+   + D DY T H ACPV+ G+ 
Sbjct: 508 LFWHNMDGETYDLDYRTLHGACPVIVGTK 536


>gi|195330780|ref|XP_002032081.1| GM23710 [Drosophila sechellia]
 gi|194121024|gb|EDW43067.1| GM23710 [Drosophila sechellia]
          Length = 490

 Score =  149 bits (375), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 81/211 (38%), Positives = 122/211 (57%), Gaps = 31/211 (14%)

Query: 64  AQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQ 123
           ++L CRY     P+ R+ PLK EE  L P ++++ DV+YD+EID                
Sbjct: 272 SRLHCRYNTSTTPFTRIAPLKMEELSLDPYMVVFHDVVYDTEID---------------- 315

Query: 124 NYKTGELEIANYRISKSAWLREPE-------HPVIER-ISRRVEHMTGLTTSTAEELQVV 175
               G L  +N+ IS+S    + E       H V  + ++ RV  MTGL+   ++   ++
Sbjct: 316 ----GMLNSSNFGISESVSGLKSEVRTSKDSHIVDSKTLNERVTDMTGLSMEMSDPFSLI 371

Query: 176 NYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPE 235
           NYG+GGH+  H+DF    E      L  G+R+ATVLFY+ +V  GGAT+F  LN+++ P+
Sbjct: 372 NYGLGGHFILHHDFH---EYTNTTRLKRGDRIATVLFYLGEVDSGGATIFPMLNITVTPK 428

Query: 236 KGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           KG+A FW+NLH+SG  +  T H+ACPV++GS
Sbjct: 429 KGSAVFWYNLHNSGAVNSKTLHSACPVISGS 459


>gi|195591302|ref|XP_002085381.1| GD14757 [Drosophila simulans]
 gi|194197390|gb|EDX10966.1| GD14757 [Drosophila simulans]
          Length = 525

 Score =  149 bits (375), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 89/252 (35%), Positives = 138/252 (54%), Gaps = 13/252 (5%)

Query: 17  QEALNKSPELKDEPPKVNNVAPTLEVTERE-KYEMLCRGDLTVPPAIVAQLKCRYVHRNV 75
           +E  N   +L+D    V       +V  R   +E+ CRG       +V    CRY     
Sbjct: 259 EEMDNIMSDLRDPHSDVEVEKELYQVKRRSSNFELGCRGLYRQKTNLV----CRYKSTAN 314

Query: 76  PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANY 135
            +LRL PLK EE  L P I +Y +V+YDSEI  +K  +   +     +   T   EI + 
Sbjct: 315 TFLRLAPLKLEEISLDPFIAMYHEVLYDSEIHELKGQSMNMVNGYASERNGT---EIRD- 370

Query: 136 RISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPG-E 194
            +++  W       V ERI++R+  MT    S  E+LQ+ NYG+G +++PH+D++  G E
Sbjct: 371 TVARYDWWSNTS-LVRERINQRIIDMTEFNFSKDEKLQITNYGVGTYFQPHFDYSSDGFE 429

Query: 195 ANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYY 254
                +LG  +R+A++LFY S+V QGGATVF  +N++++P+KG+  +W NLH  G  D  
Sbjct: 430 TPNITTLG--DRLASILFYASEVPQGGATVFPEINVTVFPQKGSMLYWFNLHDDGRPDIR 487

Query: 255 TRHAACPVLTGS 266
           ++H+ CPV+ G 
Sbjct: 488 SKHSVCPVINGD 499


>gi|194765184|ref|XP_001964707.1| GF22906 [Drosophila ananassae]
 gi|190614979|gb|EDV30503.1| GF22906 [Drosophila ananassae]
          Length = 708

 Score =  149 bits (375), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 86/244 (35%), Positives = 125/244 (51%), Gaps = 8/244 (3%)

Query: 47  KYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEI 106
            Y  LC+G      +    L C    R  PY  L PL+ E  +L P I +Y  ++   +I
Sbjct: 461 NYTRLCQGKKLPEESTGRPLSCYLDGRTNPYFVLAPLQVEPVHLDPDINVYHRMLSQQQI 520

Query: 107 DLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTT 166
           + I + A       +      G+  +A+ R+S+  WL     P+++ ISR ++ ++G   
Sbjct: 521 NSIFEEADKLTMYRSAVAGNAGKSTVADLRVSQQTWLNYTS-PIMKSISRIIQFVSGFDI 579

Query: 167 STAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFT 226
           + AE +QV NYG+GG YEPH D+        F+    G+R++T +FY+S+V QGG TVFT
Sbjct: 580 AGAEFMQVANYGVGGQYEPHPDYFEFNLPQQFQ----GDRISTSMFYLSNVEQGGYTVFT 635

Query: 227 SLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTCPCGLRRGLQRSGI 286
            LN+ L P +G    WHNLH S D D  T HA CPVL GS  + +     +  G Q    
Sbjct: 636 KLNVFLPPIQGAMVMWHNLHRSLDVDARTLHAGCPVLVGSKRIGNIW---MHSGFQEFRR 692

Query: 287 ICTL 290
            C L
Sbjct: 693 PCNL 696


>gi|198449518|ref|XP_002136915.1| GA26872, isoform A [Drosophila pseudoobscura pseudoobscura]
 gi|198130643|gb|EDY67473.1| GA26872, isoform A [Drosophila pseudoobscura pseudoobscura]
          Length = 543

 Score =  149 bits (375), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 90/269 (33%), Positives = 142/269 (52%), Gaps = 17/269 (6%)

Query: 11  GNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLC------RGDLTVPPAIVA 64
           G+K +  +A +     +  PP+ +        TE  K+  LC      + D +   +  A
Sbjct: 249 GDKTFGNKAYHIVSHFQKHPPQQSINMENGNFTE--KFNRLCSSMSRRKTDGSAAHSKPA 306

Query: 65  QLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQN 124
           +L CRY      +LRL PL+ EE  L P I+LY +V+ D E+  ++ M+ P L RA V +
Sbjct: 307 RLHCRYNATTTAFLRLAPLRMEELSLDPYIVLYHNVLSDEEMARLENMSTPLLHRARVFD 366

Query: 125 YKTGELEIANYRISKSAWLREP-----EHPVIERISRRVEHMTGLTTSTAEELQVVNYGI 179
               + +I+  R +    +  P     +  ++ERI +R+  +TGL  ++   +Q + YG 
Sbjct: 367 SGIRKPKISPARTADEVQIPNPKLVAEDIQLVERIQKRMTDLTGLVLTSMRRIQFLKYGF 426

Query: 180 GGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTA 239
           GG Y PH+DF      +   S   G+R+ATV+FY++DV  GGAT F +L+L +  E+G  
Sbjct: 427 GGIYVPHHDFF---SVHTPTSRLHGDRIATVIFYLNDVEHGGATAFPNLDLVVPTERGAV 483

Query: 240 AFWHNLH-SSGDGDYYTRHAACPVLTGSN 267
            FWHN+   + D DY T H ACPV+ G+ 
Sbjct: 484 LFWHNMDGETYDLDYRTLHGACPVIVGTK 512


>gi|24651430|ref|NP_733378.1| prolyl-4-hydroxylase-alpha NE2 [Drosophila melanogaster]
 gi|23172699|gb|AAF57061.2| prolyl-4-hydroxylase-alpha NE2 [Drosophila melanogaster]
          Length = 542

 Score =  148 bits (374), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 79/222 (35%), Positives = 122/222 (54%), Gaps = 6/222 (2%)

Query: 52  CRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           C G   VP  + + L C Y H   P+L+L P+K E   + P ++L  D++   E  LI+ 
Sbjct: 293 CSGRCQVPRNL-SNLYCVYNHVTSPFLQLAPIKTEILSIDPFVVLLHDMISQKESTLIRT 351

Query: 112 MAQPRL--RRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTA 169
            ++  +     T  +    E ++  YR SKS W     +   ++I+ R+   TGL  ++ 
Sbjct: 352 SSKEHMLPSATTDPDASDDETQVDTYRTSKSVWYSSDFNDTTKKITERLGDATGLDMNST 411

Query: 170 EELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLN 229
           E  QV+NYG+GG +E H D     E N F   GT +R+AT LFY+++V QGG T F  LN
Sbjct: 412 EFYQVINYGLGGFFETHLDMLL-SEKNRFN--GTSDRIATTLFYLNEVRQGGGTYFPRLN 468

Query: 230 LSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHS 271
           L+++P+ G+A FW+NL + G+    + H  CPV+ GS  + S
Sbjct: 469 LTVFPQPGSALFWYNLDTKGNDHMGSLHTGCPVIVGSKWVMS 510


>gi|20269814|gb|AAM18062.1|AF495540_1 prolyl 4-hydroxylase alpha-related protein PH4[alpha]NE2
           [Drosophila melanogaster]
 gi|19528175|gb|AAL90202.1| AT27756p [Drosophila melanogaster]
          Length = 542

 Score =  148 bits (374), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 79/222 (35%), Positives = 122/222 (54%), Gaps = 6/222 (2%)

Query: 52  CRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           C G   VP  + + L C Y H   P+L+L P+K E   + P ++L  D++   E  LI+ 
Sbjct: 293 CSGRCQVPRNL-SNLYCVYNHVTSPFLQLAPIKTEILSIDPFVVLLHDMISQKESTLIRT 351

Query: 112 MAQPRL--RRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTA 169
            ++  +     T  +    E ++  YR SKS W     +   ++I+ R+   TGL  ++ 
Sbjct: 352 SSKEHMLPSATTDPDASDDETQVDTYRTSKSVWYSSDFNDTTKKITERLGDATGLDMNST 411

Query: 170 EELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLN 229
           E  QV+NYG+GG +E H D     E N F   GT +R+AT LFY+++V QGG T F  LN
Sbjct: 412 EFYQVINYGLGGFFETHLDMLL-SEKNRFN--GTSDRIATTLFYLNEVRQGGGTYFPRLN 468

Query: 230 LSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHS 271
           L+++P+ G+A FW+NL + G+    + H  CPV+ GS  + S
Sbjct: 469 LTVFPQPGSALFWYNLDTKGNDHMGSLHTGCPVIVGSKWVMS 510


>gi|195330778|ref|XP_002032080.1| GM23711 [Drosophila sechellia]
 gi|194121023|gb|EDW43066.1| GM23711 [Drosophila sechellia]
          Length = 490

 Score =  148 bits (374), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 81/237 (34%), Positives = 133/237 (56%), Gaps = 28/237 (11%)

Query: 36  VAPTLEVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRII 95
           +  + EV   E+ E+  + + T      ++L CRY     P+ R+ PLK EE  L P ++
Sbjct: 245 IVASNEVIHFEE-ELATKQNCTAVVQKPSRLHCRYNTSTTPFTRIAPLKMEELSLDPYMV 303

Query: 96  LYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERIS 155
           ++ DV+YD+EID +       L  +      T   + +  R SK +++ + +      ++
Sbjct: 304 VFHDVVYDTEIDGM-------LNSSNFVLSLTDSGQKSEVRTSKDSYIVDAK-----SLN 351

Query: 156 RRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDF------ARPGEANAFKSLGTGNRVAT 209
            RV  MTG +   ++   ++NYG+GGHY  HYDF       RP +         G+R+AT
Sbjct: 352 ERVTDMTGFSMEMSDPFSLINYGLGGHYMLHYDFHEYTNTTRPKQ---------GDRIAT 402

Query: 210 VLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           VLFY+ +V  GGAT+F  +N+++ P+KG+A FW+NLH+SG  +  + H+ACPV++GS
Sbjct: 403 VLFYLGEVDSGGATIFPKINIAVTPKKGSAVFWYNLHNSGAMNLKSLHSACPVISGS 459


>gi|211938649|gb|ACJ13221.1| FI08532p [Drosophila melanogaster]
          Length = 543

 Score =  148 bits (374), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 79/222 (35%), Positives = 122/222 (54%), Gaps = 6/222 (2%)

Query: 52  CRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           C G   VP  + + L C Y H   P+L+L P+K E   + P ++L  D++   E  LI+ 
Sbjct: 294 CSGRCQVPRNL-SNLYCVYNHVTSPFLQLAPIKTEILSIDPFVVLLHDMISQKESTLIRT 352

Query: 112 MAQPRL--RRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTA 169
            ++  +     T  +    E ++  YR SKS W     +   ++I+ R+   TGL  ++ 
Sbjct: 353 SSKEHMLPSATTDPDASDDETQVDTYRTSKSVWYSSDFNDTTKKITERLGDATGLDMNST 412

Query: 170 EELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLN 229
           E  QV+NYG+GG +E H D     E N F   GT +R+AT LFY+++V QGG T F  LN
Sbjct: 413 EFYQVINYGLGGFFETHLDMLL-SEKNRFN--GTSDRIATTLFYLNEVRQGGGTYFPRLN 469

Query: 230 LSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHS 271
           L+++P+ G+A FW+NL + G+    + H  CPV+ GS  + S
Sbjct: 470 LTVFPQPGSALFWYNLDTKGNDHMGSLHTGCPVIVGSKWVMS 511


>gi|417402369|gb|JAA48034.1| Putative prolyl 4-hydroxylase alpha subunit [Desmodus rotundus]
          Length = 529

 Score =  148 bits (373), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 95/245 (38%), Positives = 137/245 (55%), Gaps = 13/245 (5%)

Query: 4   PTHQRAQGNKLYYQEALNKSPELKDEPPKVN--NVAPTLEVTEREKYEMLCRGDLTVPPA 61
           P ++R   N L Y++ L +SP        +   NV P L+   R  YE LC+   + P  
Sbjct: 258 PDNKRMARNVLKYEKLLAESPSQAAAEAVIQRPNV-PHLQT--RATYEELCQTLGSQPTH 314

Query: 62  IV-AQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRA 120
                L C Y     PYL L P+++E  +L+P ++LY D + D E   I+  A+P L+R+
Sbjct: 315 YQNPSLHCSYETGASPYLLLQPIRKEVVHLEPYVVLYHDFVNDLEAQKIRGFAEPWLQRS 374

Query: 121 TVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQVVNY 177
            V    +GE ++   YRISKSAWL++   P++  + RR+  +TGL T    AE LQVVNY
Sbjct: 375 VV---ASGEKQLPVEYRISKSAWLKDTVDPMLVTLDRRIAALTGLDTQPPYAEHLQVVNY 431

Query: 178 GIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKG 237
           GIGGHYEPH+D A    +  ++ + +GNRVAT + Y+S V  GGAT F   N S+   K 
Sbjct: 432 GIGGHYEPHFDHATSPSSPLYR-MKSGNRVATFMIYLSSVEAGGATAFIYANFSVPVVKC 490

Query: 238 TAAFW 242
           ++  W
Sbjct: 491 SSPRW 495


>gi|198459366|ref|XP_002138685.1| GA24919 [Drosophila pseudoobscura pseudoobscura]
 gi|198136669|gb|EDY69243.1| GA24919 [Drosophila pseudoobscura pseudoobscura]
          Length = 448

 Score =  148 bits (373), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 90/231 (38%), Positives = 125/231 (54%), Gaps = 11/231 (4%)

Query: 40  LEVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRD 99
           L++   E Y   CRG L  PP     L C Y     P LRL P K E     P I +Y D
Sbjct: 204 LKIINFEHYVRGCRG-LFDPPK---GLSCHYDFHTHPVLRLAPFKVEPLSQDPYIAMYHD 259

Query: 100 VMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVE 159
           V+YDSEI+ +K  A P + R+ V  Y   + +    R S SA+  + ++  + +++RRV 
Sbjct: 260 VIYDSEIEELKDNAFPDMERSKVYTYSDKDGKDTG-RTSMSAFQTDHQYTAVTKVNRRVM 318

Query: 160 HMTG---LTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSD 216
           HMTG   L   +++EL V+NY     Y  H D+  P  +   +    G+R+ATVLFY++D
Sbjct: 319 HMTGFEVLADGSSDELLVLNYATAAQYLTHSDYFGPAYSEYIQR---GDRIATVLFYLND 375

Query: 217 VAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
           V QGG TVF  L +   P KG+A  ++NL+SS  GD  T H  CPVL G+ 
Sbjct: 376 VEQGGKTVFPRLGIFRSPMKGSAVVFYNLNSSLQGDPRTEHGGCPVLVGTK 426


>gi|194871348|ref|XP_001972831.1| GG13664 [Drosophila erecta]
 gi|190654614|gb|EDV51857.1| GG13664 [Drosophila erecta]
          Length = 520

 Score =  147 bits (372), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 87/247 (35%), Positives = 137/247 (55%), Gaps = 23/247 (9%)

Query: 32  KVNNVAPTLEVTEREK-----------YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRL 80
           +++N+   L   E EK           +E+ CRG       +V    CRY      +LRL
Sbjct: 264 ELDNIVSELNDAEVEKELYQVKRSASNFEIGCRGLYRQRTNLV----CRYKSTANTFLRL 319

Query: 81  MPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKS 140
            PLK EE  L P I +Y +V+YDSEI  +K  +   +     Q   T   EI +  +++ 
Sbjct: 320 APLKFEEISLDPFIAVYHEVLYDSEIHALKGKSGNMVNGYARQRNGT---EIRD-TVARY 375

Query: 141 AWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPG-EANAFK 199
            W  +      ERI++R+  MTG   +  E+LQ+ NYG+G ++EPH+D++  G E     
Sbjct: 376 DWWSDTS-LTRERINQRIIDMTGFNFTKDEKLQIANYGVGTYFEPHFDYSSDGFETPEVT 434

Query: 200 SLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAA 259
           +LG  +R+A+++FY  +V QGGATVF  +N++++P+KG+  +W NLH  G  D  ++H+A
Sbjct: 435 TLG--DRLASIIFYAGEVLQGGATVFPEINVTVFPQKGSMLYWFNLHDDGRPDIRSQHSA 492

Query: 260 CPVLTGS 266
           CPV+ G 
Sbjct: 493 CPVVNGD 499


>gi|281361323|ref|NP_652183.2| CG15864 [Drosophila melanogaster]
 gi|272476864|gb|AAF54202.3| CG15864 [Drosophila melanogaster]
          Length = 490

 Score =  147 bits (372), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 78/210 (37%), Positives = 122/210 (58%), Gaps = 19/210 (9%)

Query: 61  AIVAQ----LKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPR 116
           A+V Q    L CRY     P+ R+ PLK EE  L P ++++ DV+YD+EID +       
Sbjct: 265 AVVVQKPSRLHCRYNTTTTPFTRIAPLKMEELGLDPYMVVFHDVIYDTEIDGM------- 317

Query: 117 LRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVN 176
           L  +      T   + +  R SK +++ + +      ++ RV  MTG +   ++   ++N
Sbjct: 318 LNSSNFGLSLTDSGQKSEVRTSKDSYIVDAK-----TLNERVTDMTGFSMEMSDPFSLIN 372

Query: 177 YGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEK 236
           YG+GGHY  HYDF         K    G+R+ATVLFY+ +V  GGAT+F  +N+++ P+K
Sbjct: 373 YGLGGHYMLHYDFHEYTNTTRPKQ---GDRIATVLFYLGEVDSGGATIFPMINITVTPKK 429

Query: 237 GTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           G+A FW+NLH+SG  +  + H+ACPV++GS
Sbjct: 430 GSAVFWYNLHNSGAMNLKSLHSACPVISGS 459


>gi|195341558|ref|XP_002037373.1| GM12146 [Drosophila sechellia]
 gi|194131489|gb|EDW53532.1| GM12146 [Drosophila sechellia]
          Length = 485

 Score =  147 bits (372), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 90/265 (33%), Positives = 142/265 (53%), Gaps = 18/265 (6%)

Query: 16  YQEALNKSP-------ELKDEPPKVNNVAPTLEVTER-----EKYEM--LCRGDLTVPPA 61
           Y++AL +SP       E ++   +V  ++P+  + E      EK E+   C G    P  
Sbjct: 222 YEDALKQSPHDQEIFQEYQNLKRRVLTLSPSEPMREEPNDDIEKMELPPCCSGRCEGPRK 281

Query: 62  IVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRAT 121
           +  +L C Y     P+LRL P+K E   + P +IL+ D++  +E  LI+  ++ ++  + 
Sbjct: 282 L-KRLYCVYNCVTAPFLRLAPIKTEILSIDPFVILFHDMVSPTEGALIRSSSKNQILPSE 340

Query: 122 VQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGG 181
             N    E E+A +R SKS W     +    ++++R+   TGL    +E  QV+NYGIGG
Sbjct: 341 TVN-AANEFEVAKFRTSKSVWFDSDANEATLKLTQRLGEATGLDMKHSEPFQVINYGIGG 399

Query: 182 HYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAF 241
            +E H+D +   E       G  +R+AT LFY++DV QGGAT F  LN++++P+ GT   
Sbjct: 400 VFESHFDTSLADEDRFVN--GYIDRLATTLFYLNDVPQGGATHFPGLNITVFPKFGTVLM 457

Query: 242 WHNLHSSGDGDYYTRHAACPVLTGS 266
           W+NLH+ G     T H  CPV+ GS
Sbjct: 458 WYNLHTEGLLHVRTMHTGCPVIVGS 482


>gi|78706702|ref|NP_001027154.1| CG18749 [Drosophila melanogaster]
 gi|21429852|gb|AAM50604.1| GH05783p [Drosophila melanogaster]
 gi|23175900|gb|AAN14309.1| CG18749 [Drosophila melanogaster]
 gi|220956638|gb|ACL90862.1| CG18749-PB [synthetic construct]
          Length = 491

 Score =  147 bits (372), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 79/202 (39%), Positives = 120/202 (59%), Gaps = 15/202 (7%)

Query: 65  QLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQN 124
           +L CRY     P+ R+ PLK EE  L P ++++ DV+YD+EID +   +   L  + V  
Sbjct: 274 KLHCRYNTSTTPFTRIAPLKMEELGLDPYMVVFHDVIYDTEIDGMLNSSDFGLSES-VSG 332

Query: 125 YKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYE 184
            K      +  R SK + + + +      ++ RV  MTGL+   ++   ++NYG+GGH+ 
Sbjct: 333 LK------SEVRTSKDSHIVDAK-----TLNERVTDMTGLSMEMSDPFSLINYGLGGHFI 381

Query: 185 PHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHN 244
            H+DF    E      L  G+R+ATVLFY+ +V  GGATVF  LN+++ P+KG+A FW+N
Sbjct: 382 LHHDFH---EYTNTTRLKQGDRIATVLFYLREVDSGGATVFPMLNITVMPKKGSAVFWYN 438

Query: 245 LHSSGDGDYYTRHAACPVLTGS 266
           LH+SG  +  T H ACPV++GS
Sbjct: 439 LHNSGAVNSKTLHTACPVISGS 460


>gi|195109817|ref|XP_001999478.1| GI23043 [Drosophila mojavensis]
 gi|193916072|gb|EDW14939.1| GI23043 [Drosophila mojavensis]
          Length = 491

 Score =  147 bits (372), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 77/216 (35%), Positives = 122/216 (56%), Gaps = 7/216 (3%)

Query: 51  LCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIK 110
           +CRG   +P  +   L+CRY     P+LRL PLK E+  + P + L  + ++D+E++ I 
Sbjct: 258 ICRGQRQLP--VSDSLRCRYSAEGSPFLRLAPLKLEQLSIDPYVALCHNAIHDNELEYII 315

Query: 111 KMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAE 170
           + ++P L+RA V      E      R++  A            + +R+E M+G   S + 
Sbjct: 316 EQSRPYLKRALVDQGVVHE-----KRVTMDAAFDLNASTHGRTLRQRLEDMSGFDLSNSG 370

Query: 171 ELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNL 230
           +L V+NYGIGGHY  H+D     ++ A+++    NR+AT+L Y+++V  GG T F +L L
Sbjct: 371 QLAVLNYGIGGHYSMHFDCWFSSDSAAYEAYIRSNRIATILLYLNEVQMGGITSFPALGL 430

Query: 231 SLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
            + P KG+A  WHN++   + DY T HAACP L G+
Sbjct: 431 GVQPIKGSALIWHNMNHEIECDYRTLHAACPTLLGN 466


>gi|195352182|ref|XP_002042593.1| GM14980 [Drosophila sechellia]
 gi|194124477|gb|EDW46520.1| GM14980 [Drosophila sechellia]
          Length = 520

 Score =  147 bits (371), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 88/252 (34%), Positives = 136/252 (53%), Gaps = 13/252 (5%)

Query: 17  QEALNKSPELKDEPPKVNNVAPTLEVTERE-KYEMLCRGDLTVPPAIVAQLKCRYVHRNV 75
           +E  N   +L+D    V       +V  R   +E+ CRG       +V    CR+     
Sbjct: 259 EEVDNIMSDLRDPHNDVEVEKELYQVKRRSSNFELGCRGLYRQKTNLV----CRFKSTAN 314

Query: 76  PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANY 135
            +LRL PLK EE  L P I +Y +V+YDSEI  +K  +   +     +   T   EI + 
Sbjct: 315 TFLRLAPLKLEEISLDPFIAMYHEVLYDSEIHELKGQSMNMVNGYASERNGT---EIRDT 371

Query: 136 RISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPG-E 194
            +    W       V ERI++R+  MT    S  E+LQ+ NYG+G +++PH+D++  G E
Sbjct: 372 VVRYDWW--SNISLVRERINQRIIDMTEFNFSKDEKLQIANYGVGTYFQPHFDYSSDGFE 429

Query: 195 ANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYY 254
                +LG  +R+A++LFY S+V QGGATVF  +N++++P+KG+  +W NLH  G  D  
Sbjct: 430 TPNITTLG--DRLASILFYASEVPQGGATVFPEINVTVFPQKGSMLYWFNLHDDGRPDIR 487

Query: 255 TRHAACPVLTGS 266
           ++H+ CPV+ G 
Sbjct: 488 SKHSVCPVINGD 499


>gi|195438148|ref|XP_002066999.1| GK24258 [Drosophila willistoni]
 gi|194163084|gb|EDW77985.1| GK24258 [Drosophila willistoni]
          Length = 217

 Score =  147 bits (371), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 78/218 (35%), Positives = 119/218 (54%), Gaps = 7/218 (3%)

Query: 49  EMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDL 108
           E+ CRG L  P      L C Y     P+LRL P K EE  L P I+L+ + +YD+EI  
Sbjct: 2   ELGCRGHLKAPSN--RNLFCSYNSTTTPFLRLAPFKTEEISLDPFILLFHNAIYDNEISY 59

Query: 109 IKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST 168
             K+ +  +R A   NY T   +   YRI +          + + +  RV+ ++GL+   
Sbjct: 60  FTKVKRKDMREAHTDNYTTPNEQ---YRIMQVKVYEGIGDKMDKTLLERVKDISGLSAGN 116

Query: 169 AEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL 228
             EL   NYG+G ++  H D+     +       TG+R+AT+LFY+SDVAQGG T+F   
Sbjct: 117 KSELAAGNYGLGSYFPEHSDYRDIKVSPELNE--TGDRLATILFYLSDVAQGGHTIFPLA 174

Query: 229 NLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           N+++ P+KG+A FW NLH+ G+ +  + H  CP++ G+
Sbjct: 175 NVTVQPKKGSALFWFNLHNDGEPNIKSLHGVCPIIEGN 212


>gi|390178148|ref|XP_001358756.3| GA13990 [Drosophila pseudoobscura pseudoobscura]
 gi|388859341|gb|EAL27899.3| GA13990 [Drosophila pseudoobscura pseudoobscura]
          Length = 498

 Score =  147 bits (371), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 84/220 (38%), Positives = 124/220 (56%), Gaps = 15/220 (6%)

Query: 48  YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID 107
           Y+  C G    P    + L CRY      + R+ PLK EE    P ++L+ DV+Y+SEID
Sbjct: 263 YKRGCNGVFRAP----SYLHCRYNSTTTAFARIAPLKMEELSHDPYMVLFHDVVYESEID 318

Query: 108 LIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLRE-PEHPVIERISRRVEHMTGLTT 166
            +    Q  L+ + V     G+ + +  R SK     E  +  V++ + RR+  MTGL  
Sbjct: 319 FLLNATQ--LKASLV-----GQYQYSPVRTSKEQHFVEYNDTAVVKTLHRRLNDMTGLDM 371

Query: 167 STAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFT 226
             ++ L ++NYG+GGHY+ HYD     EAN    L  G+R+ATVLFY+ +V  GGAT F 
Sbjct: 372 IESDALTLINYGMGGHYDVHYDSHNYSEAN---RLILGDRIATVLFYVGEVDSGGATTFP 428

Query: 227 SLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
            +N+S+ P+KG+A  W+NL ++G  +    HA CPV+ GS
Sbjct: 429 YINVSVTPKKGSAVLWYNLDNAGQMNPKAIHAGCPVIVGS 468


>gi|66770649|gb|AAY54636.1| IP12415p [Drosophila melanogaster]
 gi|66772017|gb|AAY55320.1| IP12615p [Drosophila melanogaster]
          Length = 512

 Score =  147 bits (371), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 91/253 (35%), Positives = 134/253 (52%), Gaps = 14/253 (5%)

Query: 17  QEALNKSPELKDEPPKVNNVAPTLEVTERE--KYEMLCRGDLTVPPAIVAQLKCRYVHRN 74
           QE L+      +EP     V   L   +R     E+ CRG       +V    CRY    
Sbjct: 250 QEELDNIMSDLNEPQNDVEVEKDLYQVKRSPSNCELGCRGLYRQKTNLV----CRYKSTA 305

Query: 75  VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIAN 134
             +LRL PLK EE  L P + +Y +V+YDSEI  +K  +   +     Q   T   EI +
Sbjct: 306 NTFLRLAPLKLEEISLDPFMAMYHEVLYDSEIRELKGQSMNMVNGYASQRNGT---EIRD 362

Query: 135 YRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPG- 193
             +    W       V ERI++R+  MTG      E+LQ+ NYG+G +++PH+D++  G 
Sbjct: 363 TVVRYDWW--SNTSLVRERINQRIIDMTGFNFLKDEKLQIANYGLGTYFQPHFDYSSDGF 420

Query: 194 EANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDY 253
           E     +LG  +R+A++LFY S+V QGGATVF  +N++++P+KG+  +W NLH  G  D 
Sbjct: 421 ETPNITTLG--DRLASILFYASEVPQGGATVFPEINVTVFPQKGSMLYWFNLHDDGKPDI 478

Query: 254 YTRHAACPVLTGS 266
            + H+ CPVL G 
Sbjct: 479 RSLHSVCPVLNGD 491


>gi|195575111|ref|XP_002105523.1| GD16991 [Drosophila simulans]
 gi|194201450|gb|EDX15026.1| GD16991 [Drosophila simulans]
          Length = 542

 Score =  147 bits (370), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 79/222 (35%), Positives = 123/222 (55%), Gaps = 6/222 (2%)

Query: 52  CRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           C G   VP  + + L C Y H   P+L+L P+K E   + P ++L  D++   E  LI+ 
Sbjct: 293 CSGRCAVPRNL-SSLYCVYNHVTSPFLQLAPIKTEILSVDPFVLLLHDMISQKESTLIRN 351

Query: 112 MAQPRL--RRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTA 169
            ++  +     T  +    E ++  YR SKS W     +   ++I+ R+   TGL T+  
Sbjct: 352 SSKEHMLPSATTDPDSSDTETQVDTYRTSKSVWYSSDFNDTTKKITERLGDATGLDTNFT 411

Query: 170 EELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLN 229
           E  QV+NYG+GG +E H D     E N F   GT +R+AT LFY+++V QGG T F  +N
Sbjct: 412 EFYQVINYGLGGFFETHLDMLL-SEKNRFN--GTRDRIATTLFYLNEVRQGGGTYFPRIN 468

Query: 230 LSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHS 271
           L+++P+ G+A FW+NL ++G+    + H  CPV+ GS  + S
Sbjct: 469 LTVFPQPGSALFWYNLDTNGNDHMGSLHTGCPVIVGSKWVMS 510


>gi|221512818|ref|NP_730346.2| CG32201 [Drosophila melanogaster]
 gi|220902638|gb|AAN11679.2| CG32201 [Drosophila melanogaster]
          Length = 520

 Score =  147 bits (370), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 91/253 (35%), Positives = 134/253 (52%), Gaps = 14/253 (5%)

Query: 17  QEALNKSPELKDEPPKVNNVAPTLEVTERE--KYEMLCRGDLTVPPAIVAQLKCRYVHRN 74
           QE L+      +EP     V   L   +R     E+ CRG       +V    CRY    
Sbjct: 258 QEELDNIMSDLNEPQNDVEVEKDLYQVKRSPSNCELGCRGLYRQKTNLV----CRYKSTA 313

Query: 75  VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIAN 134
             +LRL PLK EE  L P + +Y +V+YDSEI  +K  +   +     Q   T   EI +
Sbjct: 314 NTFLRLAPLKLEEISLDPFMAMYHEVLYDSEIRELKGQSMNMVNGYASQRNGT---EIRD 370

Query: 135 YRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPG- 193
             +    W       V ERI++R+  MTG      E+LQ+ NYG+G +++PH+D++  G 
Sbjct: 371 TVVRYDWW--SNTSLVRERINQRIIDMTGFNFLKDEKLQIANYGLGTYFQPHFDYSSDGF 428

Query: 194 EANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDY 253
           E     +LG  +R+A++LFY S+V QGGATVF  +N++++P+KG+  +W NLH  G  D 
Sbjct: 429 ETPNITTLG--DRLASILFYASEVPQGGATVFPEINVTVFPQKGSMLYWFNLHDDGKPDI 486

Query: 254 YTRHAACPVLTGS 266
            + H+ CPVL G 
Sbjct: 487 RSLHSVCPVLNGD 499


>gi|66771935|gb|AAY55279.1| IP12715p [Drosophila melanogaster]
          Length = 451

 Score =  147 bits (370), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 91/253 (35%), Positives = 134/253 (52%), Gaps = 14/253 (5%)

Query: 17  QEALNKSPELKDEPPKVNNVAPTLEVTERE--KYEMLCRGDLTVPPAIVAQLKCRYVHRN 74
           QE L+      +EP     V   L   +R     E+ CRG       +V    CRY    
Sbjct: 189 QEELDNIMSDLNEPQNDVEVEKDLYQVKRSPSNCELGCRGLYRQKTNLV----CRYKSTA 244

Query: 75  VPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIAN 134
             +LRL PLK EE  L P + +Y +V+YDSEI  +K  +   +     Q   T   EI +
Sbjct: 245 NTFLRLAPLKLEEISLDPFMAMYHEVLYDSEIRELKGQSMNMVNGYASQRNGT---EIRD 301

Query: 135 YRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPG- 193
             +    W       V ERI++R+  MTG      E+LQ+ NYG+G +++PH+D++  G 
Sbjct: 302 TVVRYDWW--SNTSLVRERINQRIIDMTGFNFLKDEKLQIANYGLGTYFQPHFDYSSDGF 359

Query: 194 EANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDY 253
           E     +LG  +R+A++LFY S+V QGGATVF  +N++++P+KG+  +W NLH  G  D 
Sbjct: 360 ETPNITTLG--DRLASILFYASEVPQGGATVFPEINVTVFPQKGSMLYWFNLHDDGKPDI 417

Query: 254 YTRHAACPVLTGS 266
            + H+ CPVL G 
Sbjct: 418 RSLHSVCPVLNGD 430


>gi|195145084|ref|XP_002013526.1| GL24185 [Drosophila persimilis]
 gi|194102469|gb|EDW24512.1| GL24185 [Drosophila persimilis]
          Length = 229

 Score =  147 bits (370), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 81/202 (40%), Positives = 118/202 (58%), Gaps = 11/202 (5%)

Query: 66  LKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNY 125
           L CRY      + R+ PLK EE    P ++L+ DV+Y+SEID +    Q  L+ + V   
Sbjct: 8   LHCRYNSTTTAFARIAPLKMEELSHDPYMVLFHDVVYESEIDFLLNATQ--LKASLV--- 62

Query: 126 KTGELEIANYRISKSAWLRE-PEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYE 184
             G+ + +  R SK     E  +  V++ + RR+  MTGL    ++ L ++NYG+GGHY+
Sbjct: 63  --GQYQYSPVRTSKEQHFVEYNDTAVVKTLHRRLNDMTGLDMIESDTLTLINYGMGGHYD 120

Query: 185 PHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHN 244
            HYD     EAN    L  G+R+ATVLFY+ +V  GGAT F  +N+S+ P+KG+A  W+N
Sbjct: 121 VHYDSHNYSEAN---RLILGDRIATVLFYVGEVDSGGATTFPYINVSVTPKKGSAVLWYN 177

Query: 245 LHSSGDGDYYTRHAACPVLTGS 266
           L +SG  +    HA CPV+ GS
Sbjct: 178 LDNSGQMNPKAIHAGCPVIVGS 199


>gi|195452770|ref|XP_002073492.1| GK14148 [Drosophila willistoni]
 gi|194169577|gb|EDW84478.1| GK14148 [Drosophila willistoni]
          Length = 444

 Score =  147 bits (370), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 84/229 (36%), Positives = 120/229 (52%), Gaps = 18/229 (7%)

Query: 46  EKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSE 105
           E +  +C+   +  P     L CRY     P+LRL P + EE  L P ++ Y +V+ D E
Sbjct: 220 EGFNAICQS--SHKPKPTKHLYCRYNTTTTPFLRLAPFRMEELSLNPYMVAYHNVLSDEE 277

Query: 106 IDLIKKMAQPRLRRATVQNYKTGELEIA-NYRISKSAWLREPEHP-------VIERISRR 157
           I  + +M+ P L++A    +    ++I  + R   +AW    E P       +I+RI   
Sbjct: 278 IRQLNRMSAPLLKKA----FPVSAVDIDYDVRTVDTAWFPNSETPHTKENDRLIKRIVNI 333

Query: 158 VEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDV 217
           V  +TGL    A+  Q V YG GGHY PH+D+      +  ++   G+R+ATVLFY++ V
Sbjct: 334 VSDLTGLNADVADSFQAVRYGFGGHYSPHHDYFN---ESIHQTAVNGDRLATVLFYLNTV 390

Query: 218 AQGGATVFTSLNLSLWPEKGTAAFWHNLH-SSGDGDYYTRHAACPVLTG 265
             GGATVF  LNL +  EKG   FW+NL   S D D  T H  CPV+ G
Sbjct: 391 KHGGATVFPLLNLKVPAEKGKVLFWYNLDGESLDFDENTEHGVCPVVDG 439


>gi|194751823|ref|XP_001958223.1| GF23631 [Drosophila ananassae]
 gi|190625505|gb|EDV41029.1| GF23631 [Drosophila ananassae]
          Length = 502

 Score =  146 bits (369), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 87/224 (38%), Positives = 122/224 (54%), Gaps = 10/224 (4%)

Query: 49  EMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDL 108
           E+ CRG     P+    L CRYV     +L+L PLK E   +QP I+LY DV+Y+ E   
Sbjct: 262 ELGCRGKWPKKPS--PTLTCRYVRETHDFLKLAPLKMEFLNMQPLIVLYHDVLYEGEFKS 319

Query: 109 IKKMAQPRLRRATVQNY----KTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGL 164
           ++ +A           Y    K G+ +  + R+ K    +        RI+RR+  MTGL
Sbjct: 320 MRDIAIFNATMGDGWTYVDFDKKGKPKRQD-RVVKMITFQGTTAEFTLRINRRIADMTGL 378

Query: 165 TTSTAEELQVVNYGIGGHYEPHYDFARPGEA--NAFKSLGTGNRVATVLFYMSDVAQGGA 222
             +    L + NYG+GGH+  H D+    +   N F  LG G+R+AT L Y SDV  GG 
Sbjct: 379 EMNENMALHLTNYGLGGHFGKHVDYVELAKRPPNFFGDLG-GDRIATALLYASDVPLGGT 437

Query: 223 TVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           TVFT L LS+ P+KG+A  W NL+++GD D  + H+ACPV+ GS
Sbjct: 438 TVFTKLKLSIEPKKGSALIWFNLNNAGDPDPMSEHSACPVVLGS 481


>gi|195341556|ref|XP_002037372.1| GM12148 [Drosophila sechellia]
 gi|194131488|gb|EDW53531.1| GM12148 [Drosophila sechellia]
          Length = 542

 Score =  146 bits (369), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 91/273 (33%), Positives = 140/273 (51%), Gaps = 22/273 (8%)

Query: 16  YQEALNKSPE----------LKDEPPKVNNVAPTLEVTEREKYEML-----CRGDLTVPP 60
           YQ AL  SP           L+     ++++ P +E  E   +E L     C G   VP 
Sbjct: 243 YQVALKLSPHDPEIYEEYRILEKRDLTLSDIEP-MEQDEDNSHERLVLPPCCSGRCAVPR 301

Query: 61  AIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRL--R 118
            + + L C Y H   P+L+L P+K E   + P ++L  D++   E  LI+  ++  +   
Sbjct: 302 NLNS-LYCVYNHVTSPFLQLAPIKTEILSVDPFVVLLHDMISQKESTLIRNSSKEHMLPS 360

Query: 119 RATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYG 178
             T  +    E ++  YR SKS W     +   ++I+ R+   TGL  +  E  QV+NYG
Sbjct: 361 ATTDPDASDTETQVDTYRTSKSVWYSSDFNDTTKKITERLGDATGLDMNFTEFYQVINYG 420

Query: 179 IGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGT 238
           +GG +E H D     E N F   GT +R+AT LFY+++V QGG T F  LNL+++P+ G+
Sbjct: 421 LGGFFETHLDMLL-SEKNRFN--GTRDRIATTLFYLNEVRQGGGTYFPRLNLTVFPQPGS 477

Query: 239 AAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHS 271
           A FW+NL + G+    + H  CPV+ GS  + S
Sbjct: 478 ALFWYNLDTKGNDHMDSLHTGCPVIVGSKWVMS 510


>gi|195172672|ref|XP_002027120.1| GL20071 [Drosophila persimilis]
 gi|194112933|gb|EDW34976.1| GL20071 [Drosophila persimilis]
          Length = 455

 Score =  146 bits (369), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 90/231 (38%), Positives = 126/231 (54%), Gaps = 11/231 (4%)

Query: 40  LEVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRD 99
           L++   E Y   CRG L  PP     L C Y     P LRL P K E     P I +Y D
Sbjct: 211 LKMINFEHYVRGCRG-LFDPPK---GLSCHYDFHTHPVLRLAPFKVEPLSQDPYIAMYHD 266

Query: 100 VMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVE 159
           V+YDSEI+ +K  A P + R+ V  Y + E      R S SA+  + ++  + +++RRV 
Sbjct: 267 VIYDSEIEELKDNAFPDMERSKVYTY-SDEDSKNTGRTSMSAFQTDHQYKAVTKVNRRVM 325

Query: 160 HMTG---LTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSD 216
           HMTG   L   +++EL V+NY     Y  H D+  P  +   + +  G+R+ATVLFY++D
Sbjct: 326 HMTGFEVLADGSSDELLVLNYATAAQYLTHSDYFGPAYS---EYIQRGDRIATVLFYLND 382

Query: 217 VAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
           V QGG TVF  L +   P KG+A  ++N++SS  GD  T H  CPVL G+ 
Sbjct: 383 VEQGGKTVFPRLGIFRSPMKGSAVVFYNMNSSLQGDPRTEHGGCPVLVGTK 433


>gi|195145080|ref|XP_002013524.1| GL24183 [Drosophila persimilis]
 gi|194102467|gb|EDW24510.1| GL24183 [Drosophila persimilis]
          Length = 296

 Score =  146 bits (368), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 81/209 (38%), Positives = 123/209 (58%), Gaps = 9/209 (4%)

Query: 62  IVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRAT 121
           ++  L CRY  +   + RL PLK E     P +++Y DV+YD+E+  +    + R+ R+ 
Sbjct: 52  MIKNLHCRYHKKGSAFSRLAPLKLEIFSHDPYVVIYHDVLYDAEMQGLIDSTRRRMSRSM 111

Query: 122 VQNYKTGELEIANYRISKSAWLREPEHP-VIERISRRVEHMTGLTTSTAEELQVVNYGIG 180
           VQ Y+  ++EI+  R SK A   E   P +++RI  R++ MTG     +E L ++ Y  G
Sbjct: 112 VQ-YEIRQIEISEQRTSKEAPFTEKNDPQLLKRIYDRLKDMTGCDMLRSEHLSILLYDQG 170

Query: 181 GHYEPHYDFA----RPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEK 236
           GH++PH D+      P E   ++    G+R A+V+FY++DV  GG TVF  L L + P K
Sbjct: 171 GHHDPHVDYHDLYWHPQE---YEYHPFGDRQASVVFYLNDVEDGGETVFPKLQLVIPPTK 227

Query: 237 GTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           G+A  WHNL   G+GD  T+HA+CPVL+G
Sbjct: 228 GSALMWHNLRPWGEGDPRTQHASCPVLSG 256


>gi|328718387|ref|XP_001952104.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Acyrthosiphon
           pisum]
          Length = 293

 Score =  146 bits (368), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 89/221 (40%), Positives = 125/221 (56%), Gaps = 13/221 (5%)

Query: 51  LCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIK 110
           LC+    V   +    KCRY   N+ Y  LMP KEE+   +P I +Y DV+YD EI  IK
Sbjct: 59  LCKH--GVSRTLTKYSKCRYQTNNLFYRILMPFKEEDINSEPLIKIYHDVLYDDEILKIK 116

Query: 111 KMAQPRLRRATVQNY--KTGELEIANYRISKSAWLREPEH-PVIERISRRVEHMTGLTTS 167
            +A   +  A V++   K   LE    R  +  W+ E +     + ++ R+E  TG +T 
Sbjct: 117 TLALENMNDAHVKSVDGKDDVLE-EKTRSGQVYWISEVDAVEYFDALNTRIESFTGFSTK 175

Query: 168 TAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFT 226
           TAE+ Q+VNYG+GGHY PH+D FA+  E   F     GNR+ TVLFY++DV   G T F 
Sbjct: 176 TAEQYQIVNYGLGGHYLPHHDSFAKGTENVEF-----GNRLVTVLFYLTDVQNDGYTSFP 230

Query: 227 SLNLSLWPEKGTAAFWHNLH-SSGDGDYYTRHAACPVLTGS 266
            LN++   +KG A  W+NLH S+G   Y + H +CP+L G+
Sbjct: 231 LLNINAPVDKGAALVWNNLHMSNGQLFYESLHGSCPLLKGN 271


>gi|198452400|ref|XP_002137470.1| GA26529 [Drosophila pseudoobscura pseudoobscura]
 gi|198131917|gb|EDY68028.1| GA26529 [Drosophila pseudoobscura pseudoobscura]
          Length = 348

 Score =  146 bits (368), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 85/225 (37%), Positives = 127/225 (56%), Gaps = 5/225 (2%)

Query: 42  VTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVM 101
           V E+  Y   CRG      +    L CRY ++   + RL PLK E     P +++Y DV+
Sbjct: 101 VLEQRPYFDGCRGAFPTK-SHHHSLHCRYHNKGSAFSRLAPLKLEIFSHDPYVVIYHDVL 159

Query: 102 YDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHP-VIERISRRVEH 160
           YD+E+  +    + R+ R+ VQ Y+  ++EI+  R SK A   E   P +++RI  R++ 
Sbjct: 160 YDAEMQGLIDSTRRRMSRSMVQ-YEIRQIEISEQRTSKEAPFTEKNDPQLLKRIYDRLKD 218

Query: 161 MTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQG 220
           MTG     +E L ++ Y  GGH++PH D+        +   G  +R A+V+FY++DV  G
Sbjct: 219 MTGCDMLRSEHLSILLYDQGGHHDPHVDYHDLYWEYEYHPFG--DRQASVVFYLNDVEDG 276

Query: 221 GATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           G TVF  L L + P KG+A  WHNL   G+GD  T+HA+CPVL+G
Sbjct: 277 GETVFPKLQLVIPPTKGSALMWHNLRPWGEGDPRTQHASCPVLSG 321


>gi|194765182|ref|XP_001964706.1| GF22908 [Drosophila ananassae]
 gi|190614978|gb|EDV30502.1| GF22908 [Drosophila ananassae]
          Length = 509

 Score =  145 bits (367), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 84/227 (37%), Positives = 124/227 (54%), Gaps = 13/227 (5%)

Query: 47  KYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEI 106
            Y  LC+G   +P      LKC    +   +  L PLK E+ +L P I +Y  V+   +I
Sbjct: 262 NYSRLCQGK-RLPEKQDNILKCYLDGKRHAFFTLAPLKVEQVHLDPDITVYHGVLSSKQI 320

Query: 107 DLIKKMAQPRLR-RATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLT 165
             I   +  + R R+ V      +  + + R+S+  WL     P ++ ++R  E++ GLT
Sbjct: 321 SSIFTESNKKERIRSGVAGENGEDRTVKDIRVSQQTWLNYST-PTMQYVNRINEYICGLT 379

Query: 166 TSTAEELQVVNYGIGGHYEPH---YDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGA 222
              AEE+QV NYG+GG YEPH   ++F  P + +       G+R++T +FY+S+V QGG 
Sbjct: 380 MRGAEEMQVANYGVGGQYEPHPDYFEFDLPPDFD-------GDRISTSMFYLSNVQQGGY 432

Query: 223 TVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSL 269
           TVF +LN+ L P KG+   WHNLH S D D  T HA CPV+ GS  +
Sbjct: 433 TVFPNLNVFLPPVKGSMVLWHNLHYSLDVDARTWHAGCPVIVGSKKI 479


>gi|198466401|ref|XP_002135182.1| GA23910 [Drosophila pseudoobscura pseudoobscura]
 gi|198150583|gb|EDY73809.1| GA23910 [Drosophila pseudoobscura pseudoobscura]
          Length = 530

 Score =  145 bits (367), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 83/219 (37%), Positives = 120/219 (54%), Gaps = 12/219 (5%)

Query: 48  YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID 107
           YE+ CRG       +V    CRY     P+LRL PLK EE    P I+LY +V+YD EI+
Sbjct: 296 YEIGCRGLFPKRTNLV----CRYNFTTTPFLRLAPLKMEEVNHDPYIVLYHEVLYDREIE 351

Query: 108 LIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTS 167
            +KK ++  +   +    +    EI    I++ AW  E +     RI +R+  +TG    
Sbjct: 352 ELKKQSKNMINGFSEPQQENKIREI----IARHAWWWE-QTTTRARIYQRITDITGFQLF 406

Query: 168 TAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTS 227
             EEL V NYG+G  + PHYD+      N       G  + T+LFY+SD+ QGGAT+F S
Sbjct: 407 VQEELNVANYGLGTIFGPHYDYT---PENYDIGWFMGGPLGTILFYVSDLQQGGATIFPS 463

Query: 228 LNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           +N+++ P KG+A  W NL+  G+ D  T H++CPV+ G 
Sbjct: 464 INITVSPRKGSALLWFNLYDDGEPDPRTLHSSCPVIEGD 502


>gi|403298096|ref|XP_003939871.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Saimiri
           boliviensis boliviensis]
          Length = 412

 Score =  145 bits (366), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 81/188 (43%), Positives = 118/188 (62%), Gaps = 13/188 (6%)

Query: 4   PTHQRAQGNKLYYQEALNKSPEL----------KDEPPKVNNVAPTLEVTEREKYEMLCR 53
           P HQRA GN  Y++  + K  ++          +   PK   +A    + ER+KYEMLCR
Sbjct: 211 PEHQRANGNLKYFEYIMAKEKDVNKSASDDQSDQKTTPKKKGIAVDY-LPERQKYEMLCR 269

Query: 54  GD-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           G+ + + P    +L CRY   N  P   L P K+E+ + +PRII + D++ D+EI+++K 
Sbjct: 270 GEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 329

Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
           +A+PRL RATV + +TG+L  A YR+SKSAWL   E+PV+ RI+ R++ +TGL  STAEE
Sbjct: 330 LAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEE 389

Query: 172 LQVVNYGI 179
           LQV N+ I
Sbjct: 390 LQVGNHII 397


>gi|355752458|gb|EHH56578.1| hypothetical protein EGM_06023, partial [Macaca fascicularis]
          Length = 586

 Score =  145 bits (366), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 93/235 (39%), Positives = 136/235 (57%), Gaps = 13/235 (5%)

Query: 4   PTHQRAQGNKLYYQEALNKSP-ELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAI 62
           P ++R   N L Y+  L +SP ++  E        P L+   R+ YE LC+  L   P +
Sbjct: 245 PDNKRMARNVLKYERLLAESPNQVVAEAVIQRPNIPHLQT--RDTYEGLCQ-TLGSQPTL 301

Query: 63  --VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRA 120
             +  L C Y   +  YL L P+++E  +L+P I LY D + DSE   I++ A+P L+R+
Sbjct: 302 YQIPSLYCSYETNSNAYLLLQPIRKEVIHLEPYIALYHDFVSDSEAQKIREFAEPWLQRS 361

Query: 121 TVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQVVNY 177
            V    +GE ++   YRISKSAWL++   P++  ++ R+  +TGL      AE LQVVNY
Sbjct: 362 VV---ASGEKQLQVEYRISKSAWLKDTVDPMLVTLNHRIAALTGLDVRPPYAEYLQVVNY 418

Query: 178 GIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSL 232
           GIGGHYEPH+D A    +  ++ + +GNRVAT + Y+S V  GGAT F   NLS+
Sbjct: 419 GIGGHYEPHFDHATSPSSPLYR-MKSGNRVATFMIYLSSVEAGGATAFIYANLSV 472


>gi|66771513|gb|AAY55068.1| IP12095p [Drosophila melanogaster]
          Length = 538

 Score =  145 bits (365), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 91/265 (34%), Positives = 138/265 (52%), Gaps = 18/265 (6%)

Query: 16  YQEALNKSP-------ELKDEPPKVNNVAPTLEVTER-----EKYEM--LCRGDLTVPPA 61
           YQ AL  SP       E ++   +V  ++P+  + E      E+ E+   C G    P  
Sbjct: 241 YQAALKHSPHDLEIFQEYQNLKRRVLTLSPSEPIREEPNDDIEEMELPPCCSGRCEGPRK 300

Query: 62  IVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRAT 121
           +  +L C Y     P+LRL P+K E   + P +IL  D++   E  LI+  ++ ++  + 
Sbjct: 301 L-NRLYCVYNCVTAPFLRLAPIKTEILSVDPFVILLHDMVSHKEGALIRSSSKNQILPSE 359

Query: 122 VQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGG 181
             N    E EIA +R SKS W     +    ++++R+   TGL    +E  QV+NYGIGG
Sbjct: 360 TVN-AANEFEIAKFRTSKSVWFDSDANEATLKLTQRLGEATGLDMKHSEPFQVINYGIGG 418

Query: 182 HYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAF 241
            +E H+D +   E       G  +R+AT LFY++DV QGGAT F  LN++++P+ GT   
Sbjct: 419 VFESHFDTSLADEDRFVN--GYIDRLATTLFYLNDVPQGGATHFPGLNITVFPKFGTVLM 476

Query: 242 WHNLHSSGDGDYYTRHAACPVLTGS 266
           W+NLH+ G     T H  CPV+ GS
Sbjct: 477 WYNLHTEGMLHVRTMHTGCPVIVGS 501


>gi|195159160|ref|XP_002020450.1| GL13507 [Drosophila persimilis]
 gi|194117219|gb|EDW39262.1| GL13507 [Drosophila persimilis]
          Length = 543

 Score =  145 bits (365), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 89/269 (33%), Positives = 140/269 (52%), Gaps = 17/269 (6%)

Query: 11  GNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLC------RGDLTVPPAIVA 64
           G+K +  +A +     +  PP+ +        TE  K+  LC      + D +   +  A
Sbjct: 249 GDKTFGDKAYHIVSHFQKHPPQQSINMENGNFTE--KFNRLCSSMSRRKTDGSAAHSKPA 306

Query: 65  QLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQN 124
           +L CRY      +LRL PL+ EE  L P I+LY  V+ D E+  ++ M+ P L RA V +
Sbjct: 307 RLHCRYNATTTAFLRLAPLRMEELSLDPYIVLYHSVLSDEEMARLENMSTPLLHRARVFD 366

Query: 125 YKTGELEIANYRISKSAWLREP-----EHPVIERISRRVEHMTGLTTSTAEELQVVNYGI 179
               + +I+  R +    +  P     +  ++E I +R+  +TGL  ++   +Q + YG 
Sbjct: 367 SGIRKPKISPARTADEVQIPNPKLVAEDIQLVECIQKRITDLTGLMLTSMRRIQFLKYGF 426

Query: 180 GGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTA 239
           GG Y PH+DF      +   S   G+R+ATV+FY++DV  GGAT F +L+L +  E+G  
Sbjct: 427 GGIYVPHHDFF---SVHTPTSRLHGDRIATVIFYLNDVEHGGATAFPNLDLVVPTERGAV 483

Query: 240 AFWHNLH-SSGDGDYYTRHAACPVLTGSN 267
            FWHN+   + D DY T H ACPV+ G+ 
Sbjct: 484 LFWHNMDGETYDLDYRTLHGACPVIVGTK 512


>gi|355566863|gb|EHH23242.1| hypothetical protein EGK_06672, partial [Macaca mulatta]
          Length = 583

 Score =  145 bits (365), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 93/235 (39%), Positives = 136/235 (57%), Gaps = 13/235 (5%)

Query: 4   PTHQRAQGNKLYYQEALNKSP-ELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAI 62
           P ++R   N L Y+  L +SP ++  E        P L+   R+ YE LC+  L   P +
Sbjct: 242 PDNKRMARNVLKYERLLAESPNQVVAEAVIQRPNIPHLQT--RDTYEGLCQ-TLGSQPTL 298

Query: 63  --VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRA 120
             +  L C Y   +  YL L P+++E  +L+P I LY D + DSE   I++ A+P L+R+
Sbjct: 299 YQIPSLYCSYETNSNAYLLLQPIRKEVIHLEPYIALYHDFVSDSEAQKIREFAEPWLQRS 358

Query: 121 TVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQVVNY 177
            V    +GE ++   YRISKSAWL++   P++  ++ R+  +TGL      AE LQVVNY
Sbjct: 359 VV---ASGEKQLQVEYRISKSAWLKDTVDPMLVTLNHRIAALTGLDVRPPYAEYLQVVNY 415

Query: 178 GIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSL 232
           GIGGHYEPH+D A    +  ++ + +GNRVAT + Y+S V  GGAT F   NLS+
Sbjct: 416 GIGGHYEPHFDHATSPSSPLYR-MKSGNRVATFMIYLSSVEAGGATAFIYANLSV 469


>gi|261245137|gb|ACX54875.1| FI12021p [Drosophila melanogaster]
          Length = 538

 Score =  145 bits (365), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 91/265 (34%), Positives = 138/265 (52%), Gaps = 18/265 (6%)

Query: 16  YQEALNKSP-------ELKDEPPKVNNVAPTLEVTER-----EKYEM--LCRGDLTVPPA 61
           YQ AL  SP       E ++   +V  ++P+  + E      E+ E+   C G    P  
Sbjct: 241 YQAALKHSPHDLEIFQEYQNLKRRVLTLSPSEPIREEPNDDIEEMELPPCCSGRCEGPRK 300

Query: 62  IVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRAT 121
           +  +L C Y     P+LRL P+K E   + P +IL  D++   E  LI+  ++ ++  + 
Sbjct: 301 L-NRLYCVYNCVTAPFLRLAPIKTEILSVDPFVILLHDMVSHKEGALIRSSSKNQILPSE 359

Query: 122 VQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGG 181
             N    E EIA +R SKS W     +    ++++R+   TGL    +E  QV+NYGIGG
Sbjct: 360 TVN-AANEFEIAKFRTSKSVWFDSDANEATLKLTQRLGEATGLDMKHSEPFQVINYGIGG 418

Query: 182 HYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAF 241
            +E H+D +   E       G  +R+AT LFY++DV QGGAT F  LN++++P+ GT   
Sbjct: 419 VFESHFDTSLADEDRFVN--GYIDRLATTLFYLNDVPQGGATHFPGLNITVFPKFGTVLM 476

Query: 242 WHNLHSSGDGDYYTRHAACPVLTGS 266
           W+NLH+ G     T H  CPV+ GS
Sbjct: 477 WYNLHTEGMLHVRTMHTGCPVIVGS 501


>gi|116008537|ref|NP_733379.2| CG31524, isoform A [Drosophila melanogaster]
 gi|113194861|gb|AAN14239.2| CG31524, isoform A [Drosophila melanogaster]
          Length = 536

 Score =  145 bits (365), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 91/265 (34%), Positives = 138/265 (52%), Gaps = 18/265 (6%)

Query: 16  YQEALNKSP-------ELKDEPPKVNNVAPTLEVTER-----EKYEM--LCRGDLTVPPA 61
           YQ AL  SP       E ++   +V  ++P+  + E      E+ E+   C G    P  
Sbjct: 239 YQAALKHSPHDLEIFQEYQNLKRRVLTLSPSEPIREEPNDDIEEMELPPCCSGRCEGPRK 298

Query: 62  IVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRAT 121
           +  +L C Y     P+LRL P+K E   + P +IL  D++   E  LI+  ++ ++  + 
Sbjct: 299 L-NRLYCVYNCVTAPFLRLAPIKTEILSVDPFVILLHDMVSHKEGALIRSSSKNQILPSE 357

Query: 122 VQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGG 181
             N    E EIA +R SKS W     +    ++++R+   TGL    +E  QV+NYGIGG
Sbjct: 358 TVN-AANEFEIAKFRTSKSVWFDSDANEATLKLTQRLGEATGLDMKHSEPFQVINYGIGG 416

Query: 182 HYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAF 241
            +E H+D +   E       G  +R+AT LFY++DV QGGAT F  LN++++P+ GT   
Sbjct: 417 VFESHFDTSLADEDRFVN--GYIDRLATTLFYLNDVPQGGATHFPGLNITVFPKFGTVLM 474

Query: 242 WHNLHSSGDGDYYTRHAACPVLTGS 266
           W+NLH+ G     T H  CPV+ GS
Sbjct: 475 WYNLHTEGMLHVRTMHTGCPVIVGS 499


>gi|66770643|gb|AAY54633.1| IP12395p [Drosophila melanogaster]
          Length = 538

 Score =  145 bits (365), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 91/265 (34%), Positives = 138/265 (52%), Gaps = 18/265 (6%)

Query: 16  YQEALNKSP-------ELKDEPPKVNNVAPTLEVTER-----EKYEM--LCRGDLTVPPA 61
           YQ AL  SP       E ++   +V  ++P+  + E      E+ E+   C G    P  
Sbjct: 241 YQAALKHSPHDLEIFQEYQNLKRRVLTLSPSEPIREEPNDDIEEMELPPCCSGRCEGPRK 300

Query: 62  IVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRAT 121
           +  +L C Y     P+LRL P+K E   + P +IL  D++   E  LI+  ++ ++  + 
Sbjct: 301 L-NRLYCVYNCVTAPFLRLAPIKTEILSVDPFVILLHDMVSHKEGALIRSSSKNQILPSE 359

Query: 122 VQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGG 181
             N    E EIA +R SKS W     +    ++++R+   TGL    +E  QV+NYGIGG
Sbjct: 360 TVN-AANEFEIAKFRTSKSVWFDSDANEATLKLTQRLGEATGLDMKHSEPFQVINYGIGG 418

Query: 182 HYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAF 241
            +E H+D +   E       G  +R+AT LFY++DV QGGAT F  LN++++P+ GT   
Sbjct: 419 VFESHFDTSLADEDRFVN--GYIDRLATTLFYLNDVPQGGATHFPGLNITVFPKFGTVLM 476

Query: 242 WHNLHSSGDGDYYTRHAACPVLTGS 266
           W+NLH+ G     T H  CPV+ GS
Sbjct: 477 WYNLHTEGMLHVRTMHTGCPVIVGS 501


>gi|116008130|ref|NP_001036777.1| CG31524, isoform B [Drosophila melanogaster]
 gi|113194860|gb|ABI31221.1| CG31524, isoform B [Drosophila melanogaster]
          Length = 535

 Score =  144 bits (364), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 91/265 (34%), Positives = 138/265 (52%), Gaps = 18/265 (6%)

Query: 16  YQEALNKSP-------ELKDEPPKVNNVAPTLEVTER-----EKYEM--LCRGDLTVPPA 61
           YQ AL  SP       E ++   +V  ++P+  + E      E+ E+   C G    P  
Sbjct: 238 YQAALKHSPHDLEIFQEYQNLKRRVLTLSPSEPIREEPNDDIEEMELPPCCSGRCEGPRK 297

Query: 62  IVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRAT 121
           +  +L C Y     P+LRL P+K E   + P +IL  D++   E  LI+  ++ ++  + 
Sbjct: 298 L-NRLYCVYNCVTAPFLRLAPIKTEILSVDPFVILLHDMVSHKEGALIRSSSKNQILPSE 356

Query: 122 VQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGG 181
             N    E EIA +R SKS W     +    ++++R+   TGL    +E  QV+NYGIGG
Sbjct: 357 TVN-AANEFEIAKFRTSKSVWFDSDANEATLKLTQRLGEATGLDMKHSEPFQVINYGIGG 415

Query: 182 HYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAF 241
            +E H+D +   E       G  +R+AT LFY++DV QGGAT F  LN++++P+ GT   
Sbjct: 416 VFESHFDTSLADEDRFVN--GYIDRLATTLFYLNDVPQGGATHFPGLNITVFPKFGTVLM 473

Query: 242 WHNLHSSGDGDYYTRHAACPVLTGS 266
           W+NLH+ G     T H  CPV+ GS
Sbjct: 474 WYNLHTEGMLHVRTMHTGCPVIVGS 498


>gi|195128343|ref|XP_002008623.1| GI13594 [Drosophila mojavensis]
 gi|193920232|gb|EDW19099.1| GI13594 [Drosophila mojavensis]
          Length = 511

 Score =  144 bits (364), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 82/229 (35%), Positives = 126/229 (55%), Gaps = 22/229 (9%)

Query: 45  REKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDS 104
           +E Y + CRG    P      L CRY     P+LRL P K EE  L P I+LY +V+ D 
Sbjct: 274 QEPYYLGCRG--GYPKR--TNLHCRYNTTTTPFLRLAPFKMEEVSLDPYIVLYHNVISDR 329

Query: 105 EIDLIKKMAQPRLRRATVQNYKTG-----ELEIAN--YRISKSAWLREPEHPVIERISRR 157
           EI+ +K+ A          N+  G     +L + +    +++  W+R+   P  +RI+ R
Sbjct: 330 EIEDMKQHAT---------NFANGLSISPDLNVTDKPQIVARMQWVRKMT-PFTDRINLR 379

Query: 158 VEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDV 217
           +  +TG      + +Q+ NYGIGGH+ PH+D+  P         G G+R AT++FY S+V
Sbjct: 380 ITDITGFEVDEFKAVQIGNYGIGGHFMPHFDYTTPDRLRIEDIYGLGDRTATIVFYASEV 439

Query: 218 AQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
            QGGATVF ++ +++ P+KG+A  W+NL      +  + H ACPV++GS
Sbjct: 440 -QGGATVFPNIQVTVQPQKGSALHWYNLFDDDSPNPLSLHTACPVISGS 487


>gi|194373965|dbj|BAG62295.1| unnamed protein product [Homo sapiens]
          Length = 604

 Score =  144 bits (364), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 94/239 (39%), Positives = 137/239 (57%), Gaps = 21/239 (8%)

Query: 4   PTHQRAQGNKLYYQEALNKSP-----ELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTV 58
           P ++R   N L Y+  L +SP     E   + P +    P L+   R+ YE LC+  L  
Sbjct: 258 PDNKRMARNVLKYERLLAESPNHVVAEAVIQRPNI----PHLQT--RDTYEGLCQ-TLGS 310

Query: 59  PPAI--VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPR 116
            P +  +  L C Y   +  YL L P+++E  +L+P I LY D + DSE   I+++A+P 
Sbjct: 311 QPTLYQIPSLYCSYETNSNAYLLLQPIRKEVIHLEPYIALYHDFVSDSEAQKIRELAEPW 370

Query: 117 LRRATVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQ 173
           L+R+ V    +GE ++   YRISKSAWL++   P +  ++ R+  +TGL      AE LQ
Sbjct: 371 LQRSVV---ASGEKQLQVEYRISKSAWLKDTVDPKLVTLNHRIAALTGLDVRPPYAEYLQ 427

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSL 232
           VVNYGIGGHYEPH+D A    +  ++ + +GNRVAT + Y+S V  GGAT F   NLS+
Sbjct: 428 VVNYGIGGHYEPHFDHATSPSSPLYR-MKSGNRVATFMIYLSSVEAGGATAFIYANLSV 485


>gi|195575113|ref|XP_002105524.1| GD16980 [Drosophila simulans]
 gi|194201451|gb|EDX15027.1| GD16980 [Drosophila simulans]
          Length = 518

 Score =  144 bits (362), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 91/265 (34%), Positives = 140/265 (52%), Gaps = 19/265 (7%)

Query: 16  YQEALNKSP---ELKDEPPKVNNVAPTLEVTE---------REKYEM--LCRGDLTVPPA 61
           Y++AL +SP   E+  E   +  V  TL ++E          E+ E+   C G    P  
Sbjct: 222 YEDALKQSPHDQEIFQEYQHLKKVL-TLSLSEPIREEPNDDNEEMELPHCCSGRCERPQK 280

Query: 62  IVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRAT 121
           +  +L C Y     P+LRL P+K E   + P +IL  D++  +E  LI+  ++ ++  + 
Sbjct: 281 L-KRLYCVYNCITAPFLRLAPIKTEILSVDPFVILLHDMVSPTEGALIRSSSKNQILPSE 339

Query: 122 VQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGG 181
             N    E E+A +R SKS W     +    ++++R+   TGL    +E  QV+NYGIGG
Sbjct: 340 TVN-AANEFEVAKFRTSKSVWFDSDANEATLKLTQRLGEATGLDMKHSEPFQVINYGIGG 398

Query: 182 HYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAF 241
            +E H+D +   E       G  +R+AT LFY++DV QGGAT F  LN++++P+ GT   
Sbjct: 399 VFESHFDTSLADEDRFVN--GYIDRLATTLFYLNDVPQGGATHFPGLNITVFPKFGTVLM 456

Query: 242 WHNLHSSGDGDYYTRHAACPVLTGS 266
           W+NLH+ G     T H  CPV+ GS
Sbjct: 457 WYNLHTEGLLHVRTMHTGCPVIVGS 481


>gi|326436053|gb|EGD81623.1| p4ha2 protein [Salpingoeca sp. ATCC 50818]
          Length = 548

 Score =  144 bits (362), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 82/222 (36%), Positives = 117/222 (52%), Gaps = 11/222 (4%)

Query: 46  EKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYL-QPRIILYRDVMYDS 104
           E++  LCRG+    P     L C   H N P+L L P+K E  +  + R+ ++R      
Sbjct: 293 ERFRRLCRGETLYHPQ--RPLTCELKHYNQPHLFLKPIKVEHLHEGRQRLQVFRQFASPE 350

Query: 105 EIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGL 164
           E   ++   + RL RA    +  G  +   +RIS +AWL+     +++RI  R+E  T +
Sbjct: 351 ECRHLQHAGKRRLERAVA--WTDGRFQPVEFRISTAAWLQPDHDAIVKRIHGRIEDATQV 408

Query: 165 TTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATV 224
               AE LQ+ NYG+GG YEPH+D +  G      +   G R+AT + Y++ V QGG T 
Sbjct: 409 DIEYAEALQISNYGMGGFYEPHFDHSSRG------TNPDGERLATFMIYLNPVKQGGFTA 462

Query: 225 FTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           F  L  ++ P  G A FW+NL  SG GD  T H ACPVL GS
Sbjct: 463 FPRLGAAVQPGYGDAVFWYNLQPSGVGDPLTLHGACPVLRGS 504


>gi|195505197|ref|XP_002099400.1| GE10884 [Drosophila yakuba]
 gi|194185501|gb|EDW99112.1| GE10884 [Drosophila yakuba]
          Length = 527

 Score =  143 bits (361), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 87/240 (36%), Positives = 121/240 (50%), Gaps = 15/240 (6%)

Query: 48  YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID 107
           Y  LC+G           LKC    +   Y  L PL+ E  +L P I +Y  ++    I 
Sbjct: 281 YTRLCQGRRLPEERSGDPLKCYLDGKRHAYFILAPLQVEPVHLDPDINVYHGMLSSKHIQ 340

Query: 108 LIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTS 167
            I + A  +    +      G   + + R+S+  WL + + PV++ + R +E ++G   +
Sbjct: 341 SIFEEADKKEMVRSAVAGDGGARTVKDLRVSQQTWL-DYKSPVMKSVGRIIEFVSGFDMA 399

Query: 168 TAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTS 227
            AE +QV NYG+GG YEPH D+        F     G+R++T +FY+SDV QGG TVFT 
Sbjct: 400 GAEFMQVANYGVGGQYEPHPDYFEVNLPEEF----IGDRISTSMFYLSDVEQGGYTVFTK 455

Query: 228 LNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNS-----LHSTC-----PCGL 277
           LN+ L P KG    WHNLH S D D  T HA CPV+ GS       +HS       PCGL
Sbjct: 456 LNVFLPPVKGALVMWHNLHRSLDVDARTLHAGCPVIVGSKRIGNIWMHSGYQEFRRPCGL 515


>gi|195505214|ref|XP_002099407.1| GE23379 [Drosophila yakuba]
 gi|194185508|gb|EDW99119.1| GE23379 [Drosophila yakuba]
          Length = 547

 Score =  143 bits (361), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 80/222 (36%), Positives = 121/222 (54%), Gaps = 6/222 (2%)

Query: 52  CRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           C G   V   +   L C Y H   P+L+L P+K E   + P ++L+ D++   E  LI+ 
Sbjct: 298 CSGRCEVSRNLTG-LYCVYNHVTSPFLQLAPIKTEILSIDPFVLLFHDMISQKESTLIRS 356

Query: 112 MAQPR-LRRATVQNYKTG-ELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTA 169
            ++   L  AT     +G E  +A +R SKS W     +   +RI+ R+   TGL  +  
Sbjct: 357 SSKEHMLPSATTDVDASGSEDHVATFRTSKSVWYSSTSNDTTKRITERLGDATGLDMNFT 416

Query: 170 EELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLN 229
           E  QV+NYG+GG +E H D      +   +  GT +R+AT LFY+++V QGG T F  LN
Sbjct: 417 EYFQVINYGLGGFFETHLDMLLSDRS---RFNGTRDRLATTLFYLNEVRQGGGTHFPRLN 473

Query: 230 LSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHS 271
           L+++P+ G+A FW+NL + G+    T H  CPV+ GS  + S
Sbjct: 474 LTVFPQPGSALFWYNLDTRGNDHTSTLHTGCPVIVGSKWVMS 515


>gi|339261892|ref|XP_003367679.1| prolyl 4-hydroxylase subunit alpha-2 [Trichinella spiralis]
 gi|316962562|gb|EFV48687.1| prolyl 4-hydroxylase subunit alpha-2 [Trichinella spiralis]
          Length = 319

 Score =  143 bits (360), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 84/212 (39%), Positives = 118/212 (55%), Gaps = 32/212 (15%)

Query: 2   IFPTHQRAQGNKLYYQEALNKSPELK----DEPPKVNNVAPTLEVTEREKYEMLCRGDLT 57
           I P H RA+GN  +Y + L K    +    D PP VN       + ER+ +E LCRG+  
Sbjct: 109 IKPDHPRAEGNVKWYLDLLAKEGVSRVTDHDLPPIVNARPNDQALPERKDFEALCRGEYL 168

Query: 58  VPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRL 117
           +     ++L C Y  R+ P+L L P+K E  + +P+I+++R V+  +EI ++K +A PRL
Sbjct: 169 LTEKQRSRLYC-YYKRDTPFLSLAPIKVEVMHWKPKIVIFRQVISANEIAVLKTLAYPRL 227

Query: 118 RRATVQNYKTGELEIA---------------------------NYRISKSAWLREPEHPV 150
            RATVQN +TGELE A                           +YRISKSAWL+E EHPV
Sbjct: 228 SRATVQNSETGELETAKYRISKRCRTLRRATVHNKETGQLEHASYRISKSAWLKEHEHPV 287

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGH 182
           ++RI +R+  MT L   TAE+LQ   YG+GG 
Sbjct: 288 VDRIVKRIHDMTNLNMETAEDLQNATYGLGGQ 319


>gi|161076739|ref|NP_001097101.1| CG34345 [Drosophila melanogaster]
 gi|157400090|gb|ABV53635.1| CG34345 [Drosophila melanogaster]
          Length = 504

 Score =  142 bits (359), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 79/222 (35%), Positives = 122/222 (54%), Gaps = 17/222 (7%)

Query: 49  EMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDL 108
           E  C+G    PP    QL CRY     P++R+ PLKEEE    P I LY DV+YDSEI  
Sbjct: 275 EQGCQGKF--PPG--PQLVCRYNSTTTPFMRIAPLKEEEISRDPLIWLYHDVIYDSEIAQ 330

Query: 109 IKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH---PVIERISRRVEHMTGLT 165
           +  + +  +   T  NY T +      R+++   ++  +     + + +  R+  ++GL 
Sbjct: 331 LTNVTREEMILGTTTNYTTPD------RVNRLFHIKVTDDDGGKLDKTLVNRMADISGLD 384

Query: 166 TSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT-GNRVATVLFYMSDVAQGGATV 224
                 L  +NYG+GG+++ H D+    +   +  L   G+R+ T LFYM+DV  GG T+
Sbjct: 385 VGNTTTLARINYGLGGYFQEHSDYM---DIKLYPELTEEGDRLMTFLFYMTDVPVGGTTI 441

Query: 225 FTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           F    L++ P+KG+A FW+NLH++GD +  TRHA CP + GS
Sbjct: 442 FPGAQLAIQPKKGSALFWYNLHNNGDPNLLTRHAVCPTIVGS 483


>gi|92109908|gb|ABE73278.1| IP10618p [Drosophila melanogaster]
          Length = 501

 Score =  142 bits (359), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 79/222 (35%), Positives = 122/222 (54%), Gaps = 17/222 (7%)

Query: 49  EMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDL 108
           E  C+G    PP    QL CRY     P++R+ PLKEEE    P I LY DV+YDSEI  
Sbjct: 272 EQGCQGKF--PPG--PQLVCRYNSTTTPFMRIAPLKEEEISRDPLIWLYHDVIYDSEIAQ 327

Query: 109 IKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH---PVIERISRRVEHMTGLT 165
           +  + +  +   T  NY T +      R+++   ++  +     + + +  R+  ++GL 
Sbjct: 328 LTNVTREEMILGTTTNYTTPD------RVNRLFHIKVTDDDGGKLDKTLVNRMADISGLD 381

Query: 166 TSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT-GNRVATVLFYMSDVAQGGATV 224
                 L  +NYG+GG+++ H D+    +   +  L   G+R+ T LFYM+DV  GG T+
Sbjct: 382 VGNTTTLARINYGLGGYFQEHSDYM---DIKLYPELTEEGDRLMTFLFYMTDVPVGGTTI 438

Query: 225 FTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           F    L++ P+KG+A FW+NLH++GD +  TRHA CP + GS
Sbjct: 439 FPGAQLAIQPKKGSALFWYNLHNNGDPNLLTRHAVCPTIVGS 480


>gi|195505216|ref|XP_002099408.1| GE23378 [Drosophila yakuba]
 gi|194185509|gb|EDW99120.1| GE23378 [Drosophila yakuba]
          Length = 546

 Score =  142 bits (358), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 81/214 (37%), Positives = 118/214 (55%), Gaps = 8/214 (3%)

Query: 60  PAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLI----KKMAQP 115
           P  + +L C Y     P+LRL P+K E   + P I+L  D++   E  L+    K M  P
Sbjct: 299 PRKLKRLYCVYNGVTAPFLRLAPIKTEILSIDPFIVLLHDMVSVEEGALLRTFSKNMISP 358

Query: 116 R--LRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
                 +  +     E E+ ++R SKS WL    +    ++++R+   TGL  S +E  Q
Sbjct: 359 SETAELSDSEEKSIFEFEVGSFRTSKSVWLDNDANEATLKLTQRLGDATGLDISHSEPFQ 418

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
           V+NYGIGG +E H+D +   E N F   G  +R+AT LFY++DV QGGAT F  LN++++
Sbjct: 419 VINYGIGGIFESHFDTSLQDE-NRFLD-GYMDRLATTLFYLNDVPQGGATHFPGLNITVF 476

Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
           P+ GTA FW+NL + G     T H  CPV+ GS 
Sbjct: 477 PKFGTALFWYNLDTKGLLRLRTMHTGCPVIVGSK 510


>gi|328707957|ref|XP_001947811.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Acyrthosiphon
           pisum]
          Length = 507

 Score =  142 bits (357), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 84/222 (37%), Positives = 118/222 (53%), Gaps = 12/222 (5%)

Query: 48  YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID 107
           +  LCR   +V P +    KCRY  +N PY  +MP KEE+    P I LY D++YD EI 
Sbjct: 267 FRYLCREGKSVRP-LTYDSKCRYQTKNSPYRMIMPFKEEDISSNPNIKLYHDIIYDEEIK 325

Query: 108 LIKKMAQPRLRRATVQNYKTGELE-IANYRISKSAWLREPEHPVI-ERISRRVEHMTGLT 165
            I  MA   L  A    Y  G++  + + R+ +  W  E  +P++  +++ R+E +T  T
Sbjct: 326 TITDMASKDLSDAAY--YFNGKITLLDDQRLGQLKWFSENANPILFGKLNDRIECITEYT 383

Query: 166 TSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVF 225
           T TAE  Q +NYG+GGH+  H D    G          GNR+ T+LFYM+DV   G TVF
Sbjct: 384 TKTAEGYQTINYGLGGHFSVHMDAFTDGPK------LNGNRLVTILFYMTDVPDDGYTVF 437

Query: 226 TSLNLSLWPEKGTAAFWHNLH-SSGDGDYYTRHAACPVLTGS 266
            +LN      KG+A  W NL  ++G     T H  CPV+ G+
Sbjct: 438 PNLNYVAHCRKGSALVWLNLRLNNGSVHSGTFHGGCPVIKGN 479


>gi|405964866|gb|EKC30308.1| KRR1 small subunit processome component-like protein [Crassostrea
           gigas]
          Length = 885

 Score =  142 bits (357), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 89/246 (36%), Positives = 133/246 (54%), Gaps = 28/246 (11%)

Query: 43  TEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMY 102
           TE   YE LCR +      + A+L+C      +PY +    KEE    +PRI ++ DV+ 
Sbjct: 614 TEDAMYEALCREEQKSLHEL-AKLRCFLRDTVIPYYKA---KEEVVNYEPRIAIFHDVIS 669

Query: 103 DSEIDLIKKMAQPRLRRATVQNYKTG---ELEIA-----NYRISKSAWLREPEHPVIERI 154
            + I+ +K +A   L R+TV    TG   ++ I      N R+S++ W+R  E+P + R+
Sbjct: 670 STSIEHLKSIASKGLTRSTVFLENTGPNGQVTITYGKQDNIRVSQTCWIRTDEYPELLRL 729

Query: 155 SRRVEHMTGLTT------STAEELQVVNYGIGGHYEPHYDF--------ARPGEANAFKS 200
             R++ +TGL+       S +E+ QVVNYG+GG Y  H+D+        + P ++    +
Sbjct: 730 ENRIQLITGLSAEYKPVRSHSEKFQVVNYGVGGMYTAHHDYTGYKLGIISNPMDSEDIST 789

Query: 201 LGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAAC 260
             +G+R+AT +FYM+D   GGATVF  +   +   KG AAFW NL  SG  D  T H  C
Sbjct: 790 --SGDRMATWMFYMNDAKAGGATVFPEVRTRIPVAKGGAAFWFNLRPSGATDPRTLHGGC 847

Query: 261 PVLTGS 266
           PVL GS
Sbjct: 848 PVLVGS 853


>gi|195452772|ref|XP_002073493.1| GK14149 [Drosophila willistoni]
 gi|194169578|gb|EDW84479.1| GK14149 [Drosophila willistoni]
          Length = 496

 Score =  142 bits (357), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 83/214 (38%), Positives = 109/214 (50%), Gaps = 23/214 (10%)

Query: 65  QLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATV-- 122
           +L CRY     P+LRL P + EE  L P I+ Y +V+ D EI  + ++    L++     
Sbjct: 265 KLHCRYNTTTTPFLRLAPFRMEELSLDPYIVAYYNVLSDQEITQLDRLTATLLKKTFAIG 324

Query: 123 --QNYKTGELEIANYRISKSAWLREPEHP-------VIERISRRVEHMTGLTTSTAEELQ 173
              +Y        N R +  AW    E P       +IERI   V  +TGL    A+  Q
Sbjct: 325 PDDDYDD------NARTADGAWFPNNETPRTEENIQLIERIINLVSDLTGLQGDKADSFQ 378

Query: 174 VVNYGIGGHYEPHYDFARPG-EANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSL 232
            V YG GGHY PH+D+     +  AF     G+R+ATV FY++ V  GGATVF SLNL +
Sbjct: 379 AVRYGFGGHYTPHFDYLNMSIDQTAF----YGDRLATVFFYLNTVKHGGATVFPSLNLKV 434

Query: 233 WPEKGTAAFWHNLH-SSGDGDYYTRHAACPVLTG 265
             EKG   FW+NL   S D D  T H  CPV+ G
Sbjct: 435 PAEKGKVLFWYNLDGESFDFDENTEHGGCPVVDG 468


>gi|198477148|ref|XP_002136736.1| GA29214 [Drosophila pseudoobscura pseudoobscura]
 gi|198145041|gb|EDY71753.1| GA29214 [Drosophila pseudoobscura pseudoobscura]
          Length = 520

 Score =  142 bits (357), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 80/203 (39%), Positives = 110/203 (54%), Gaps = 6/203 (2%)

Query: 65  QLKCRYV-HRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQ 123
            L C Y+  R  P+L L  ++ E     P I+LY DV+  S++  ++  ++P L  AT  
Sbjct: 293 HLHCFYLTKRGSPFLLLARVRTEILSDDPFIVLYYDVLTHSDMVSLRNTSEPLLHPATTI 352

Query: 124 NYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHY 183
            Y     E++N R +   WL         R  R +  +TGL  S +E  QV NYGIGG +
Sbjct: 353 QYLNAPQELSNSRTAHFVWLEPTITEATRRADRVLWDVTGLNLSNSEMFQVNNYGIGGSF 412

Query: 184 EPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWH 243
             H D     E N         R+AT +FY+SDV QGGAT+FT LN++++P+ GT  FW+
Sbjct: 413 MRHSDLLH-SERNYL----VRERIATAIFYLSDVPQGGATLFTELNVTVFPQAGTVLFWY 467

Query: 244 NLHSSGDGDYYTRHAACPVLTGS 266
           NL  SGD D  TRH  CPV+ GS
Sbjct: 468 NLAHSGDHDMRTRHTGCPVIGGS 490


>gi|241044301|ref|XP_002407178.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
 gi|215492128|gb|EEC01769.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
          Length = 554

 Score =  142 bits (357), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 76/202 (37%), Positives = 121/202 (59%), Gaps = 11/202 (5%)

Query: 44  EREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYD 103
           E + Y+ LCRG+    P + ++L+CRY      + +L P+K EEA L+P I++  +V+ D
Sbjct: 287 ETQNYKRLCRGEQLRTPKMDSKLRCRYYKGQHGFFKLQPIKVEEANLKPYIVVMHNVIQD 346

Query: 104 SEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTG 163
            +I+ +   A+PRL+R+T  +Y    +E +  R S +AWL + + PV  R++R +  + G
Sbjct: 347 RDIEDLMAFAKPRLQRST--HYGVRGMEASQVRTSSNAWLNDLDAPVATRLNRFLRSLLG 404

Query: 164 LTTS----TAEELQVVNYGIGGHYEPHYDFAR-----PGEANAFKSLGTGNRVATVLFYM 214
           L T+     AE+ Q+ NYGIGG Y  H+D+ +     P          +G+R+AT++ YM
Sbjct: 405 LGTTYLGGEAEQYQLANYGIGGQYMSHHDYLQDTYHIPNRVTDDFEKTSGDRIATLMVYM 464

Query: 215 SDVAQGGATVFTSLNLSLWPEK 236
           SDV +GGATVF SL + L P+K
Sbjct: 465 SDVEEGGATVFPSLGVRLTPKK 486


>gi|195575095|ref|XP_002105515.1| GD21523 [Drosophila simulans]
 gi|194201442|gb|EDX15018.1| GD21523 [Drosophila simulans]
          Length = 527

 Score =  141 bits (356), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 91/267 (34%), Positives = 135/267 (50%), Gaps = 20/267 (7%)

Query: 4   PTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAIV 63
           PTH   Q  K  Y E+      +K+  P           +    Y  LC+G         
Sbjct: 250 PTHSAQQTQK--YLESRVSGKNVKETNP-----------SWFSNYTRLCQGRRLPEERSG 296

Query: 64  AQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEI-DLIKKMAQPRLRRATV 122
             L C    +   Y  L PL+ E  +L P I +Y  ++   +I  + ++  +  + R+ V
Sbjct: 297 DPLSCYLDGKRHAYFTLAPLQVEPVHLDPDINVYHGMLSSKQILSIFEEADKEEMVRSAV 356

Query: 123 QNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGH 182
                G+  + + R+S+  WL + + PV+  +SR ++ ++G   + AE +QV NYG+GG 
Sbjct: 357 AG-DGGKRTVRDLRVSQQTWL-DYKSPVMNSVSRIIQFVSGFDMAGAEYMQVANYGVGGQ 414

Query: 183 YEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFW 242
           YEPH D+    E N  K+   G+R++T +FY+SDV QGG TVFT LN+ L P KG    W
Sbjct: 415 YEPHPDYF---EVNLPKNF-EGDRISTSMFYLSDVEQGGYTVFTKLNVFLPPVKGALVMW 470

Query: 243 HNLHSSGDGDYYTRHAACPVLTGSNSL 269
           HNLH S D D  T HA CPV+ GS  +
Sbjct: 471 HNLHRSLDVDARTLHAGCPVIVGSKRI 497


>gi|241044303|ref|XP_002407179.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
 gi|215492129|gb|EEC01770.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
          Length = 456

 Score =  141 bits (356), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 80/217 (36%), Positives = 125/217 (57%), Gaps = 11/217 (5%)

Query: 51  LCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIK 110
           LCRG+     +    L C Y   + PY ++ P+K E+    P ++ + DV++  EI   +
Sbjct: 224 LCRGEKIRNASEEKDLFCLYDVPH-PYFKIGPVKVEQMNKNPYVLQFYDVLWPQEIKAFR 282

Query: 111 KMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAE 170
           +M  P+L RATV++  T    +++ R+S+ AW+      +++R++ RV  +TGL+     
Sbjct: 283 RMGDPQLERATVRD--TARNTVSHARVSQVAWISPDSDVLLDRVNARVAMLTGLS----H 336

Query: 171 ELQVVN-YGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLN 229
            L+  N YG GGHYEPH+D+    E +    LG G+R+AT +FY+SDV  GG+TVF    
Sbjct: 337 RLRKYNSYGPGGHYEPHHDYLE--ELDEVDKLG-GDRIATFMFYLSDVNLGGSTVFPYAK 393

Query: 230 LSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
             + P+ G+AAFW+N+   G  D  T H AC VL G+
Sbjct: 394 AGVMPKMGSAAFWYNMREDGSYDRATLHGACSVLHGT 430


>gi|20177113|gb|AAM12259.1| RE23792p [Drosophila melanogaster]
 gi|220948174|gb|ACL86630.1| PH4alphaSG2-PB [synthetic construct]
 gi|220960438|gb|ACL92755.1| PH4alphaSG2-PB [synthetic construct]
          Length = 301

 Score =  141 bits (355), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 81/223 (36%), Positives = 121/223 (54%), Gaps = 7/223 (3%)

Query: 48  YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEI- 106
           Y  LC+G           L+C    +   Y  L PL+ E  +L P I +Y  ++   +I 
Sbjct: 55  YTRLCQGRRLPEERSGDPLRCYLDGKRHAYFTLAPLQVEPVHLDPDINVYHGMLSSKQIL 114

Query: 107 DLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTT 166
            + ++  +  + R+ V     GE  + + R+S+  WL + + PV+  + R ++ ++G   
Sbjct: 115 SIFEEADKEEMVRSAVAG-SGGEGTVRDLRVSQQTWL-DYKSPVMNSVGRIIQFVSGFDM 172

Query: 167 STAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFT 226
           + AE +QV NYG+GG YEPH D+    E N  K+   G+R++T +FY+SDV QGG TVFT
Sbjct: 173 AGAEHMQVANYGVGGQYEPHPDYF---EVNLPKNF-EGDRISTSMFYLSDVEQGGYTVFT 228

Query: 227 SLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSL 269
            LN+ L P KG    WHNLH S   D  T HA CPV+ GS  +
Sbjct: 229 KLNVFLPPVKGALVMWHNLHRSLHVDARTLHAGCPVIVGSKRI 271


>gi|195061021|ref|XP_001995909.1| GH14207 [Drosophila grimshawi]
 gi|193891701|gb|EDV90567.1| GH14207 [Drosophila grimshawi]
          Length = 477

 Score =  141 bits (355), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 83/227 (36%), Positives = 126/227 (55%), Gaps = 22/227 (9%)

Query: 46  EKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSE 105
           EKY  LC       P+   +L C Y   + P+LRL PLK E   + P ++++ + +YDSE
Sbjct: 246 EKYTRLCGASHKPKPS---RLICNYKMDSSPFLRLAPLKMEMLSMDPYVVVFHEAIYDSE 302

Query: 106 IDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLRE-----PEHPVIERISRRVEH 160
           ID ++++ + RL R  +   K G+ +  + R S   W+ E      +  ++ERI RRV  
Sbjct: 303 IDELRRLCESRLSRTEIA--KQGKNK--SIRSSSGVWIFELDLNRQQLELLERIRRRVAD 358

Query: 161 MTGLTTS-TAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQ 219
           M+GL     ++E+Q + Y  GGHY PH+DF               +R+ATVLFY++DVA+
Sbjct: 359 MSGLLIDFNSQEVQYMEYVFGGHYYPHWDFKGIPHLE--------DRIATVLFYLNDVAR 410

Query: 220 GGATVFTSLNLSLWPEKGTAAFWHNLH-SSGDGDYYTRHAACPVLTG 265
           GGAT+F  L L + PE+G    WHN+   + D +  + H ACPV+ G
Sbjct: 411 GGATIFPDLELLVQPERGKVLHWHNMDLGTYDLEKRSLHGACPVIMG 457


>gi|390176894|ref|XP_002136933.2| GA26862 [Drosophila pseudoobscura pseudoobscura]
 gi|388858830|gb|EDY67491.2| GA26862 [Drosophila pseudoobscura pseudoobscura]
          Length = 520

 Score =  141 bits (355), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 78/203 (38%), Positives = 112/203 (55%), Gaps = 6/203 (2%)

Query: 65  QLKCRYV-HRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQ 123
            L C Y+  R  P+L L  ++ E     P I LY DV+  S++  ++  ++P L  AT  
Sbjct: 293 HLHCFYLTKRGSPFLLLARVRTEILSDDPFIALYYDVLTHSDMVSLRNTSEPLLHPATTI 352

Query: 124 NYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHY 183
            Y     E++N R +   WL         R  R +  +TGL  S +E+ QV NYGIGG +
Sbjct: 353 QYLNAPQELSNSRTAHFVWLEPTITEATRRADRVLWDVTGLNLSNSEKFQVNNYGIGGSF 412

Query: 184 EPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWH 243
             H D       ++ ++     R+AT +FY+SDV QGGAT+FT LN++++P+ GT  FW+
Sbjct: 413 MRHSD-----PLHSERNYLVRERIATAIFYLSDVPQGGATLFTELNVTVFPQAGTVLFWY 467

Query: 244 NLHSSGDGDYYTRHAACPVLTGS 266
           NL  SGD D  TRH  CPV+ GS
Sbjct: 468 NLAHSGDHDMRTRHTGCPVIVGS 490


>gi|195159299|ref|XP_002020519.1| GL13471 [Drosophila persimilis]
 gi|194117288|gb|EDW39331.1| GL13471 [Drosophila persimilis]
          Length = 238

 Score =  141 bits (355), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 80/203 (39%), Positives = 109/203 (53%), Gaps = 6/203 (2%)

Query: 65  QLKCRYV-HRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQ 123
            L C Y+  R  P+L L  ++ E     P I LY DV+  S++  ++  ++P L  AT  
Sbjct: 31  HLHCFYLTKRGSPFLLLARVRTEILSDDPFIALYYDVLTHSDMVSLRNTSEPLLHPATTI 90

Query: 124 NYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHY 183
            Y     E++N R +   WL         R  R +  +TGL  S +E  QV NYGIGG +
Sbjct: 91  QYFNAPQELSNSRTAHFVWLEPTITEATRRADRVLWDVTGLNLSNSEMFQVNNYGIGGSF 150

Query: 184 EPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWH 243
             H D     E N         R+AT +FY+SDV QGGAT+FT LN++++P+ GT  FW+
Sbjct: 151 MRHSDLLH-SERNYL----VRERIATAIFYLSDVPQGGATLFTELNVTVFPQAGTVLFWY 205

Query: 244 NLHSSGDGDYYTRHAACPVLTGS 266
           NL  SGD D  TRH  CPV+ GS
Sbjct: 206 NLAHSGDHDMRTRHTGCPVIVGS 228


>gi|195159303|ref|XP_002020521.1| GL13468 [Drosophila persimilis]
 gi|194117290|gb|EDW39333.1| GL13468 [Drosophila persimilis]
          Length = 415

 Score =  141 bits (355), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 83/222 (37%), Positives = 116/222 (52%), Gaps = 27/222 (12%)

Query: 49  EMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDL 108
           ++ CRG    P   + +L C Y     P+LRL P K E   L P ++LY DV+   E   
Sbjct: 196 QLCCRGG--CPYRDMHRLTCSYNTTAAPFLRLAPFKTELLSLSPYMVLYHDVITPLESLT 253

Query: 109 IKKMAQPRLRR---ATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLT 165
           +K +++P ++R     V N K     I + R S S WL   E+ V+ER+ RRV  MT   
Sbjct: 254 LKNLSKPLMKRRAMVMVNNLKVRPF-IDSGRTSNSVWLTSHENAVMERLERRVGVMTNFE 312

Query: 166 TSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATV 224
              +E  Q++NYGIGGHY+PH D F  P                     +SDV QGGAT+
Sbjct: 313 MENSEVYQLINYGIGGHYKPHTDHFETPQ--------------------LSDVPQGGATL 352

Query: 225 FTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           F  LN+S+ P +G A  W+NL+  G G+  T H +CP++ GS
Sbjct: 353 FPRLNISVQPRQGDALLWYNLNDRGQGEIGTVHTSCPIIKGS 394


>gi|195069738|ref|XP_001997014.1| GH23597 [Drosophila grimshawi]
 gi|193892024|gb|EDV90890.1| GH23597 [Drosophila grimshawi]
          Length = 239

 Score =  141 bits (355), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 84/233 (36%), Positives = 128/233 (54%), Gaps = 22/233 (9%)

Query: 46  EKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSE 105
           EKY  LC       P+   +L C Y   + P+LRL PLK E   + P ++++ + +YDSE
Sbjct: 6   EKYTRLCGASHKPKPS---RLICNYKMDSSPFLRLAPLKMEMLSMDPYVVVFHEAIYDSE 62

Query: 106 IDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLRE-----PEHPVIERISRRVEH 160
           ID ++++ + RL R  +   K G+ +  + R S   W+ E      +  ++ERI RRV  
Sbjct: 63  IDELRRLCESRLSRTEIA--KQGKNK--SIRSSSGVWIFELDLNRQQLELLERIRRRVAD 118

Query: 161 MTGLTTS-TAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQ 219
           M+GL     ++E+Q + Y  GGHY PH+DF               +R+ATVLFY++DVA+
Sbjct: 119 MSGLLIDFNSQEVQYMEYVFGGHYYPHWDFKGIPHLE--------DRIATVLFYLNDVAR 170

Query: 220 GGATVFTSLNLSLWPEKGTAAFWHNLH-SSGDGDYYTRHAACPVLTGSNSLHS 271
           GGAT+F  L L + PE+G    WHN+   + D +  + H ACPV+ G   + S
Sbjct: 171 GGATIFPDLELLVQPERGKVLHWHNMDLGTYDLEKRSLHGACPVIMGKKEVIS 223


>gi|195338688|ref|XP_002035956.1| GM16188 [Drosophila sechellia]
 gi|194129836|gb|EDW51879.1| GM16188 [Drosophila sechellia]
          Length = 392

 Score =  140 bits (354), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 79/219 (36%), Positives = 118/219 (53%), Gaps = 11/219 (5%)

Query: 49  EMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDL 108
           E  C+G    PP    QL CRY     P++R+ PLKEEE    P I LY DV+YDSEI  
Sbjct: 163 EQGCQGKF--PPG--PQLVCRYNSTTTPFMRIAPLKEEEISRDPLIWLYHDVIYDSEITQ 218

Query: 109 IKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST 168
           +  + +  +   T  NY T +     + I  +    +    + + +  R+  ++GL    
Sbjct: 219 LTNLTREEMILGTTTNYTTPDRVNRLFHIKVT---NDDGGKLDKTLVNRMADISGLDMGN 275

Query: 169 AEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT-GNRVATVLFYMSDVAQGGATVFTS 227
              L  +NYG+GG+++ H D+    +      L   G+R+ T LFYM+DV  GG T+F  
Sbjct: 276 TTTLARINYGLGGYFQEHSDYM---DIKLHPELTEEGDRLMTFLFYMTDVLVGGGTIFPG 332

Query: 228 LNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
             L++ P+KG+A FW+NLH++GD +  TRHA CP + GS
Sbjct: 333 AQLAIQPKKGSALFWYNLHNNGDPNPLTRHAVCPTIVGS 371


>gi|341878860|gb|EGT34795.1| hypothetical protein CAEBREN_10065 [Caenorhabditis brenneri]
          Length = 163

 Score =  140 bits (354), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 71/128 (55%), Positives = 88/128 (68%), Gaps = 10/128 (7%)

Query: 161 MTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQG 220
           MT L   TAEELQ+ NYGIGGHY+PH+D A+  E+ +F+SLGTGNR+ATVLFYMS  + G
Sbjct: 1   MTNLEMETAEELQIANYGIGGHYDPHFDHAKKEESKSFESLGTGNRIATVLFYMSQPSHG 60

Query: 221 GATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG----SNS-LHSTC-- 273
           G TVFT +  ++ P K  A FW+NL+  GDG+  TRHAACPVL G    SN  +H     
Sbjct: 61  GGTVFTEVKSTVLPTKNDALFWYNLYKQGDGNPDTRHAACPVLVGIKWVSNKWIHEKGNE 120

Query: 274 ---PCGLR 278
              PCGL+
Sbjct: 121 FRRPCGLK 128


>gi|198466399|ref|XP_002135181.1| GA23909 [Drosophila pseudoobscura pseudoobscura]
 gi|198150582|gb|EDY73808.1| GA23909 [Drosophila pseudoobscura pseudoobscura]
          Length = 530

 Score =  140 bits (354), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 82/234 (35%), Positives = 125/234 (53%), Gaps = 24/234 (10%)

Query: 66  LKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNY 125
           L CRY     P+LRL PLK EE    P I++Y  V+ D E++ +K++A+P      + N 
Sbjct: 306 LVCRYNSTTTPFLRLAPLKMEEVNHDPYIVMYHQVLSDREMEEMKQLARP------MTNG 359

Query: 126 KTGELEIANYR-----ISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIG 180
            +G  E+AN       +++ AWL E   P  ER++ R+  MTG   S  + LQ+ N+G+G
Sbjct: 360 MSGS-EMANLTEPLEIVARVAWLIEAS-PFRERLNLRIGDMTGFDVSDFKALQLANFGVG 417

Query: 181 GHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAA 240
            +++ HYD+ R    N       G+R  +++FY S+V QGGAT+F  + +++ P+KG + 
Sbjct: 418 SYFKAHYDY-RTERVNDLGVTELGDRTGSIIFYASEVPQGGATIFPDIQVTVTPQKGNSL 476

Query: 241 FWHNLHSSGDGDYYTRHAACPVLTGS-----NSLHS-----TCPCGLRRGLQRS 284
           FW N       D  + HA CPV+ GS       LH        PC  R G ++S
Sbjct: 477 FWFNTFDDSTPDPRSLHAICPVIAGSRWTITKWLHQWPQMFLKPCSPRAGERKS 530


>gi|195379218|ref|XP_002048377.1| GJ13934 [Drosophila virilis]
 gi|194155535|gb|EDW70719.1| GJ13934 [Drosophila virilis]
          Length = 469

 Score =  140 bits (354), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 84/222 (37%), Positives = 116/222 (52%), Gaps = 29/222 (13%)

Query: 52  CRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           CRG    P      L CRYV ++  YLRL PLK E   LQP I LY DV++DSEI+ +K 
Sbjct: 263 CRG--LWPKRQTLPLTCRYVQQHSAYLRLAPLKMEILSLQPLIQLYHDVLHDSEIEAVKN 320

Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
           +      RA  +N  +           K   LR+  H   + + RR+  M+GL  +    
Sbjct: 321 VTN---HRAMAENLAS---------TVKLITLRDAPH--TQNMHRRITDMSGLDMAQNNT 366

Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
           L ++N+G+GG+                     GNR+ATV+FY SDV  GGAT+F  L L 
Sbjct: 367 LHLLNFGLGGYLGKQLKL-------------QGNRIATVIFYASDVQLGGATIFPRLQLV 413

Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC 273
           + P++G+A  W+NL+++G  D  TRHA CPV+ GS    S C
Sbjct: 414 VKPKRGSALLWYNLNAAGKPDPLTRHAVCPVVVGSRWAISKC 455


>gi|194905313|ref|XP_001981171.1| GG11766 [Drosophila erecta]
 gi|190655809|gb|EDV53041.1| GG11766 [Drosophila erecta]
          Length = 496

 Score =  140 bits (354), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 77/223 (34%), Positives = 122/223 (54%), Gaps = 9/223 (4%)

Query: 46  EKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSE 105
           E ++ LCR   +  P+   +L CRY     P+L L PLK E+  L+P I++Y D++ + +
Sbjct: 256 EDFKRLCRSSFSPKPS---KLHCRYNSTTSPFLILAPLKMEQISLEPYIVVYHDILPEGD 312

Query: 106 IDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLT 165
           I  +  +A+PRLR          +     +   K   +     PV++R+++R+  +TGL 
Sbjct: 313 IHQLIALAEPRLRATLAFTEDKSDSVFGAFLPFKD--MNSSGEPVLDRLTQRMRDITGLQ 370

Query: 166 TSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVF 225
                 + ++ YG G HY   +DF    E N+ ++ G G+R+ATV+FY++D   GGATVF
Sbjct: 371 IHQRNRINIIKYGFGAHYAARHDFF--NETNS-ETEGYGDRMATVMFYLNDAPNGGATVF 427

Query: 226 TSLNLSLWPEKGTAAFWHNLH-SSGDGDYYTRHAACPVLTGSN 267
             +N+ +  E+G   FW+NL   + D D  T HAACPV  GS 
Sbjct: 428 PRINVKVPAERGKVLFWYNLDGETHDVDPKTVHAACPVFHGSK 470


>gi|25012370|gb|AAN71294.1| RE09701p [Drosophila melanogaster]
          Length = 301

 Score =  140 bits (354), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 81/223 (36%), Positives = 121/223 (54%), Gaps = 7/223 (3%)

Query: 48  YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEI- 106
           Y  LC+G           L+C    +   Y  L PL+ E  +L P I +Y  ++   +I 
Sbjct: 55  YTRLCQGRRLPEERSGDPLRCYLDGKRHAYFTLAPLQVELVHLDPDINVYHGMLSSKQIL 114

Query: 107 DLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTT 166
            + ++  +  + R+ V     GE  + + R+S+  WL + + PV+  + R ++ ++G   
Sbjct: 115 SIFEEADKEEMVRSAVAG-SGGEGTVRDLRVSQQTWL-DYKSPVMNSVGRIIQFVSGFDM 172

Query: 167 STAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFT 226
           + AE +QV NYG+GG YEPH D+    E N  K+   G+R++T +FY+SDV QGG TVFT
Sbjct: 173 AGAEHMQVANYGVGGQYEPHPDYF---EVNLPKNF-EGDRISTSMFYLSDVEQGGYTVFT 228

Query: 227 SLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSL 269
            LN+ L P KG    WHNLH S   D  T HA CPV+ GS  +
Sbjct: 229 KLNVFLPPVKGALVMWHNLHRSLHVDARTLHAGCPVIVGSKRI 271


>gi|195159305|ref|XP_002020522.1| GL13469 [Drosophila persimilis]
 gi|194117291|gb|EDW39334.1| GL13469 [Drosophila persimilis]
          Length = 253

 Score =  140 bits (353), Expect = 8e-31,   Method: Compositional matrix adjust.
 Identities = 81/203 (39%), Positives = 107/203 (52%), Gaps = 23/203 (11%)

Query: 64  AQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQ 123
           ++L C Y      +LRL PLK E   L P ++LY DV+ D E+ L+K MAQ  L RA   
Sbjct: 32  SRLYCLYNTTATAFLRLAPLKMELLSLDPYVVLYHDVLADREMSLLKLMAQRDLVRAVTY 91

Query: 124 NYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHY 183
           N    +      R +K+ WL +P H +I R+    E M+ L    +E+ QV+NYGIGGHY
Sbjct: 92  NATEKKHSEDPNRTTKAGWL-DPSHNLIRRMGILTEDMSNLDLERSEDFQVLNYGIGGHY 150

Query: 184 EPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWH 243
             H DF                      F +SDV  GGATVF  L+LS++P+KG    W+
Sbjct: 151 AVHPDF----------------------FELSDVPLGGATVFPLLDLSVFPKKGAVLMWY 188

Query: 244 NLHSSGDGDYYTRHAACPVLTGS 266
           NL   G G   T H+ACPV+ GS
Sbjct: 189 NLDHKGQGMEKTIHSACPVVVGS 211


>gi|195452736|ref|XP_002073477.1| GK14138 [Drosophila willistoni]
 gi|194169562|gb|EDW84463.1| GK14138 [Drosophila willistoni]
          Length = 518

 Score =  140 bits (352), Expect = 9e-31,   Method: Compositional matrix adjust.
 Identities = 76/215 (35%), Positives = 112/215 (52%), Gaps = 21/215 (9%)

Query: 52  CRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           C G   V   +  QL C Y  ++  +LR+ P+K E   L P I+LY D +   E   +K 
Sbjct: 303 CNGKCQVSKEL--QLYCLYNTKDSYFLRIAPVKMEVLSLNPYIVLYHDFILPREQGSLKA 360

Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
            +   L  A      TGE +  + R +K+ W  +    VI RIS+R+E +T L     E 
Sbjct: 361 QSIKYLSVAETIYPDTGEWQADSSRTAKAMWFEDSSAEVISRISQRIEDITNLNPEKGEL 420

Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
            Q++NYGIGG YE HYD+    E                   + DV QGGAT+  +++LS
Sbjct: 421 YQIINYGIGGLYETHYDYLYENE-------------------LQDVPQGGATLLNNISLS 461

Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           ++P+ G A FW+NL+++GD ++   H ACPV+ GS
Sbjct: 462 VFPKAGAALFWYNLNNAGDTEWNVAHTACPVIVGS 496


>gi|196011912|ref|XP_002115819.1| hypothetical protein TRIADDRAFT_59908 [Trichoplax adhaerens]
 gi|190581595|gb|EDV21671.1| hypothetical protein TRIADDRAFT_59908 [Trichoplax adhaerens]
          Length = 300

 Score =  140 bits (352), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 86/228 (37%), Positives = 129/228 (56%), Gaps = 14/228 (6%)

Query: 46  EKYEMLCRGDLTVPPAIVAQ-LKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDS 104
           +K   LC G+  +P       LKC Y + +    R MP   EE    P IILY ++  ++
Sbjct: 58  QKIRELCIGNENLPAKSSGHHLKCYYFYPSSK-TRFMPYAIEEMSRDPLIILYHNLTSNA 116

Query: 105 EIDLIKKMAQPRLRRATVQNYKTGELEIANY----RISKSAWLREPEHPVIERISRRVEH 160
           E++ +K +A  +L+ A V  Y T   +  N     RI+K A++ + E  V   I++R++ 
Sbjct: 117 EMESLKALAAKQLQPAGV--YHTTSADNRNLEGYTRIAKMAFILDEESAVASAITQRLQD 174

Query: 161 MTGLTTSTAEELQVVNYGIGGHYEPHYDF--ARPGEANAFKSLGTGNRVATVLFYMSDVA 218
           +TGL  + +E LQV+NYGI G Y PHYD   A+ G+    +S  + +R+AT + Y+SDV 
Sbjct: 175 VTGLNMNFSEPLQVINYGIAGQYTPHYDTFPAKSGD----RSHPSHDRLATAILYLSDVE 230

Query: 219 QGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           +GGATVFT++N+ + P KG    W+N    G+    T HA CPVL GS
Sbjct: 231 RGGATVFTNINVRVLPRKGNVIIWYNYLPDGNLHPGTLHAGCPVLVGS 278


>gi|21358309|ref|NP_651801.1| prolyl-4-hydroxylase-alpha SG2 [Drosophila melanogaster]
 gi|20269808|gb|AAM18059.1|AF495537_1 prolyl 4-hydroxylase alpha-related protein PH4[alpha]SG2
           [Drosophila melanogaster]
 gi|10726875|gb|AAG22175.1| prolyl-4-hydroxylase-alpha SG2 [Drosophila melanogaster]
          Length = 527

 Score =  140 bits (352), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 81/223 (36%), Positives = 121/223 (54%), Gaps = 7/223 (3%)

Query: 48  YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEI- 106
           Y  LC+G           L+C    +   Y  L PL+ E  +L P I +Y  ++   +I 
Sbjct: 281 YTRLCQGRRLPEERSGDPLRCYLDGKRHAYFTLAPLQVEPVHLDPDINVYHGMLSSKQIL 340

Query: 107 DLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTT 166
            + ++  +  + R+ V     GE  + + R+S+  WL + + PV+  + R ++ ++G   
Sbjct: 341 SIFEEADKEEMVRSAVAG-SGGEGTVRDLRVSQQTWL-DYKSPVMNSVGRIIQFVSGFDM 398

Query: 167 STAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFT 226
           + AE +QV NYG+GG YEPH D+    E N  K+   G+R++T +FY+SDV QGG TVFT
Sbjct: 399 AGAEHMQVANYGVGGQYEPHPDYF---EVNLPKNF-EGDRISTSMFYLSDVEQGGYTVFT 454

Query: 227 SLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSL 269
            LN+ L P KG    WHNLH S   D  T HA CPV+ GS  +
Sbjct: 455 KLNVFLPPVKGALVMWHNLHRSLHVDARTLHAGCPVIVGSKRI 497


>gi|198449520|ref|XP_002136916.1| GA26928 [Drosophila pseudoobscura pseudoobscura]
 gi|198130644|gb|EDY67474.1| GA26928 [Drosophila pseudoobscura pseudoobscura]
          Length = 532

 Score =  139 bits (351), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 80/231 (34%), Positives = 127/231 (54%), Gaps = 14/231 (6%)

Query: 47  KYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEI 106
           ++  +CR      P+   +L CRY     P+LRL PL+ EE  L P I++Y +V+ D+EI
Sbjct: 281 EFIQICRSSHQNKPS---RLHCRYNATTTPFLRLAPLRMEELSLDPYIVVYHNVLSDAEI 337

Query: 107 DLIKKMAQPRLRRATVQNYKTGELEIANYRISKSA-----WLREPEHPVIERISRRVEHM 161
             ++++ +P L+R    +     +  +  R   +      ++     PVIER+ R +  M
Sbjct: 338 AEVERVIEPLLQRIGRYDETPNSMSPSKRRTGFTGPHIDDYMHVSGAPVIERVHRHIRDM 397

Query: 162 TGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGG 221
           TGL  +  E L +V YG+GGH + HYDF     A+   +   G+R+ATVLFY++DV  GG
Sbjct: 398 TGLFMN--EHLMMVKYGLGGHCDQHYDFLN---ASYPSTHAMGDRMATVLFYLNDVKHGG 452

Query: 222 ATVFTSLNLSLWPEKGTAAFWHNLH-SSGDGDYYTRHAACPVLTGSNSLHS 271
           +T FT L L +  E+G   FW+N+   + + D  T H +CPV+ G+  + S
Sbjct: 453 STAFTDLQLKVPSERGKVLFWYNMRGETHNLDRRTVHGSCPVIDGTKKILS 503


>gi|195577074|ref|XP_002078398.1| GD23422 [Drosophila simulans]
 gi|194190407|gb|EDX03983.1| GD23422 [Drosophila simulans]
          Length = 513

 Score =  139 bits (351), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 77/221 (34%), Positives = 119/221 (53%), Gaps = 15/221 (6%)

Query: 49  EMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDL 108
           E  C+G    PP    QL CRY     P++R+ PLKEEE    P I LY +V+YDSEI  
Sbjct: 284 EQGCQGKF--PPG--PQLVCRYNSTTTPFMRIAPLKEEEISRDPLIWLYHNVIYDSEIAQ 339

Query: 109 IKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH---PVIERISRRVEHMTGLT 165
           +  + +  +   T  NY T +      R+ +   ++  +     + + +  R+  ++GL 
Sbjct: 340 LTNLTREEMILGTTTNYTTPD------RVDRLFHIKVTDDDGGKLDKTLVNRMADISGLD 393

Query: 166 TSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVF 225
                 L  +NYG+GG+++ H D+              G+R+ T LFYM+D+  GGAT+F
Sbjct: 394 VGNTTTLARINYGLGGYFQEHSDYMDIKLHPELTE--EGDRLMTFLFYMTDIPVGGATIF 451

Query: 226 TSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
               L++ P+KG+A FW+NLH++GD +  TRHA CP + GS
Sbjct: 452 PGAQLAIQPKKGSALFWYNLHNNGDPNPLTRHAVCPTIVGS 492


>gi|195159309|ref|XP_002020524.1| GL13466 [Drosophila persimilis]
 gi|194117293|gb|EDW39336.1| GL13466 [Drosophila persimilis]
          Length = 643

 Score =  139 bits (349), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 79/203 (38%), Positives = 108/203 (53%), Gaps = 6/203 (2%)

Query: 65  QLKCRYV-HRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQ 123
            L C Y+  R  P+L L  ++ E     P I LY DV+  S++  ++  ++P L  AT  
Sbjct: 416 HLHCFYLTKRGSPFLLLARVRTEILSDDPFIALYYDVLTHSDMVSLRNTSEPLLHPATTI 475

Query: 124 NYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHY 183
            Y     E++N R +   WL         R  R +  +TGL  S +E  QV NYGIGG +
Sbjct: 476 QYFNAPQELSNSRTAHFVWLEPTITEATRRADRVLWDVTGLNLSNSEMFQVNNYGIGGSF 535

Query: 184 EPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWH 243
             H D     E N         R+AT +FY+SDV  GGAT+FT LN++++P+ GT  FW+
Sbjct: 536 MRHSDLLH-SERNYL----VRERIATAIFYLSDVPHGGATLFTELNVTVFPQAGTVLFWY 590

Query: 244 NLHSSGDGDYYTRHAACPVLTGS 266
           NL  SGD D  TRH  CPV+ GS
Sbjct: 591 NLAHSGDHDMRTRHTGCPVIVGS 613


>gi|241029040|ref|XP_002406378.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
 gi|215491954|gb|EEC01595.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
          Length = 539

 Score =  138 bits (348), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 84/225 (37%), Positives = 121/225 (53%), Gaps = 20/225 (8%)

Query: 39  TLEVT----EREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRI 94
           T EVT    E + Y+ LCRG L   P + +QL+CRY      +  L P+K EE  L+P I
Sbjct: 271 TQEVTPDDQEDQSYKRLCRGKLLRSPKMESQLRCRYYKGQDGFFALQPIKLEEMNLKPYI 330

Query: 95  ILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERI 154
           I+  DV+ D +I  +   A+PR+R+          L I +     SAWL E E P+  R+
Sbjct: 331 IVMHDVLQDKDIKELMAFAEPRVRKT------LPYLFICHIHTFYSAWLNEDEAPIAVRM 384

Query: 155 SRRVEHMTGLTTST----AEELQVVNYGIGGHYEPHYDFARP------GEANAFKSLGTG 204
           +  +  + G+ TS     AE  Q+ NYG GG + PH+DF +         A+ +   GTG
Sbjct: 385 NSYLRALLGMGTSDTDEEAEAYQLANYGTGGQFLPHHDFLQDSFHSYNSSADYYLQYGTG 444

Query: 205 NRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSG 249
           +RVAT++ Y++DV +GGATVF +L L L P+K    F    +S G
Sbjct: 445 DRVATLMIYLTDVEEGGATVFPTLGLRLTPKKVNLFFISLRNSDG 489


>gi|195166677|ref|XP_002024161.1| GL22880 [Drosophila persimilis]
 gi|194107516|gb|EDW29559.1| GL22880 [Drosophila persimilis]
          Length = 507

 Score =  138 bits (348), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 78/224 (34%), Positives = 121/224 (54%), Gaps = 18/224 (8%)

Query: 48  YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID 107
           +E  CRG       +V    CRY     P+LRL PLK EE    P I++Y  V+ D E++
Sbjct: 235 HESGCRGLFPKRTNLV----CRYNSTTTPFLRLAPLKMEEVNHDPYIVMYHQVLSDREME 290

Query: 108 LIKKMAQPRLRRATVQNYKTGELEIANYR-----ISKSAWLREPEHPVIERISRRVEHMT 162
            +K++A+P      + N  +G  E+AN       +++ AWL E   P  ER++ R+  MT
Sbjct: 291 EMKQLARP------MTNGMSGS-EMANLTEPLEIVARVAWLIEAS-PFRERLNLRIGDMT 342

Query: 163 GLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGA 222
           G   S  + LQ+ N+G+G +++ HYD+ R    N       G+R  +++FY S+V QGG 
Sbjct: 343 GFDVSDFKALQLANFGVGSYFKAHYDY-RTERVNDLGVTELGDRTGSIIFYASEVPQGGT 401

Query: 223 TVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           T+F  + +++ P+KG + FW N       D  + HA CPV+ GS
Sbjct: 402 TIFPDIQVTVTPQKGNSLFWFNTFDDSTPDPRSLHAICPVIAGS 445


>gi|195069797|ref|XP_001997029.1| GH12978 [Drosophila grimshawi]
 gi|193891498|gb|EDV90364.1| GH12978 [Drosophila grimshawi]
          Length = 518

 Score =  138 bits (348), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 81/222 (36%), Positives = 121/222 (54%), Gaps = 12/222 (5%)

Query: 48  YEMLCRGDLTVPPAIVAQL--KCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSE 105
           Y  LC+G   +P     Q   +C        Y +L PLK E+  L P I +Y DV+ D++
Sbjct: 281 YVRLCQGK-RLPEIKTNQSSPRCYLDSNQHAYFKLSPLKVEQVNLAPDINIYYDVLNDNQ 339

Query: 106 IDLIKKMA-QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGL 164
           I  I +++ +    R++V  Y      + + R+S+  WL     P++    + V  ++G 
Sbjct: 340 IKSILELSTEFESFRSSVNKYN-----VTDKRVSQQVWLNYSS-PIMRTYRQLVGAISGF 393

Query: 165 TTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATV 224
             + AE +QV NYGIGG YEPH+DF+    A  + + G  +R++T + Y+SDV QGG TV
Sbjct: 394 NMTNAEIMQVANYGIGGQYEPHHDFSGANLAARYANFG--DRISTNMIYLSDVQQGGYTV 451

Query: 225 FTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           F + N+ + P KG    WHNL  S DGD  T HA CPV+ G+
Sbjct: 452 FPTQNVFVKPIKGAMVMWHNLLRSLDGDRRTLHAGCPVIEGT 493


>gi|195110921|ref|XP_002000028.1| GI24861 [Drosophila mojavensis]
 gi|193916622|gb|EDW15489.1| GI24861 [Drosophila mojavensis]
          Length = 508

 Score =  138 bits (347), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 84/247 (34%), Positives = 127/247 (51%), Gaps = 18/247 (7%)

Query: 48  YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID 107
           +  LC+G     P     L C       P  RL PLK E+A+L P I +Y DV+ D +I+
Sbjct: 276 FSRLCQGKRLPEPG---SLSCYLDFERHPRFRLSPLKVEQAHLNPDIHIYYDVLTDPQIE 332

Query: 108 LIKKMAQPRLRRATVQNYKTGELE--IANYRISKSAWLREPEHPVIERISRRVEHMTGLT 165
            +  +A      + ++++++  L   +   R+S+  WL     P++  +   +  ++GL 
Sbjct: 333 SVLDLA------SQLESFRSKVLGDVVTETRVSQQVWLNYTS-PIMRTVGNLLGAISGLD 385

Query: 166 TSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVF 225
            +  EE+QV NYGIGG Y PH+D+      +  +    GNR+ T +FY+SDV QGG TVF
Sbjct: 386 MTNVEEMQVANYGIGGQYFPHFDYISELREDYIER---GNRITTNMFYLSDVLQGGYTVF 442

Query: 226 TSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTCPCGLRRGLQRSG 285
             LN+ L P KG+   W N+H S   D    HA CPVL GS  + +      ++  +R  
Sbjct: 443 PFLNVFLRPVKGSLVIWPNVHRSLAPDSRVLHAGCPVLEGSKRIGNIWIHSAQQEFRRP- 501

Query: 286 IICTLVG 292
             CTLV 
Sbjct: 502 --CTLVS 506


>gi|167519971|ref|XP_001744325.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163777411|gb|EDQ91028.1| predicted protein [Monosiga brevicollis MX1]
          Length = 492

 Score =  138 bits (347), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 94/273 (34%), Positives = 131/273 (47%), Gaps = 18/273 (6%)

Query: 4   PTHQRAQGNKLYYQEALNKSPELKDEPPKVN-----NVAP--TLEVTERE--KYEMLCRG 54
           P + R   N  YY   L+K+         V      N  P   L   ERE  K+  LC+G
Sbjct: 208 PDNGRVFKNVEYYTHQLHKASNGTSASGSVRVARKANYRPDNVLGRDERELLKFNKLCQG 267

Query: 55  DLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYL-QPRIILYRDVMYDSEIDLIKKMA 113
                P+    L CR  H N P+L L P++ E  +    R+ ++R+     E   +++  
Sbjct: 268 RKIYKPS--KPLSCRLQHFNKPHLFLKPIRVEYVHEGNNRLQIFRNFASAQECAHLREEG 325

Query: 114 QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQ 173
           + +L RA    +  G      +RIS +AWL+     V+  +  R+   T L    AE LQ
Sbjct: 326 RKKLSRAVA--WTDGAFRPVEFRISTAAWLQPDHDDVVTNLHTRIADATQLDLEFAEALQ 383

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW 233
           V NYGIGG YE HYD      A+  + L  G+R+AT + Y++ V QGG T F  L  ++ 
Sbjct: 384 VSNYGIGGFYETHYDH----HASRERELPEGDRIATFMIYLNQVEQGGYTAFPRLGAAVE 439

Query: 234 PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           P  G A FW+NL   G+ D  T H ACPVL GS
Sbjct: 440 PGHGDAVFWYNLLPDGESDNNTLHGACPVLQGS 472


>gi|194905381|ref|XP_001981186.1| GG11928 [Drosophila erecta]
 gi|190655824|gb|EDV53056.1| GG11928 [Drosophila erecta]
          Length = 543

 Score =  138 bits (347), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 72/207 (34%), Positives = 115/207 (55%), Gaps = 5/207 (2%)

Query: 63  VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATV 122
           + +L C Y     P+L+L P+K E   L P ++L  D++   E  LI+  ++  L ++ +
Sbjct: 304 LTRLYCVYNRVTSPFLQLAPIKTEILSLDPFVLLLHDMVRQKESTLIRASSKEHLLQSEI 363

Query: 123 QN--YKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIG 180
            N    + E  +A +R SKS W     +   ++I+ R+   TGL     E  QV+NYG+G
Sbjct: 364 TNTDASSSEDNVAIFRTSKSVWYSSDFNDTTKKITERLADATGLDMHFTEYFQVINYGLG 423

Query: 181 GHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAA 240
           G +  H D      ++  +  GT +R+AT +FY++ V QGGAT F  LNL+++P+ G+A 
Sbjct: 424 GFFATHLDMLL---SDKTRFNGTSDRIATTVFYLNGVRQGGATHFPLLNLTVFPQPGSAL 480

Query: 241 FWHNLHSSGDGDYYTRHAACPVLTGSN 267
           FW+NL + G+    T H  CPV+ GS 
Sbjct: 481 FWYNLDTKGNDQRSTMHTGCPVIVGSK 507


>gi|194905424|ref|XP_001981193.1| GG11755 [Drosophila erecta]
 gi|190655831|gb|EDV53063.1| GG11755 [Drosophila erecta]
          Length = 527

 Score =  137 bits (346), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 90/267 (33%), Positives = 132/267 (49%), Gaps = 20/267 (7%)

Query: 4   PTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAIV 63
           PTH   Q  K     AL K  + +  P  + N            Y MLC+G         
Sbjct: 250 PTHSAQQTRKYLESRALGKIDQ-ETNPTWLAN------------YTMLCQGRRLPEERSA 296

Query: 64  AQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEI-DLIKKMAQPRLRRATV 122
             LKC    +   Y  L PL+ E  +L P I +Y  ++  ++I  ++ +  + ++ R+ V
Sbjct: 297 DPLKCYLDGKRHAYFTLAPLQVEPVHLDPDINVYHGMLSANQILSILDEAEKMQMFRSAV 356

Query: 123 QNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGH 182
                G   + + R+S+  WL + +  V++ + R  E ++G   + AE +QV NYG+GG 
Sbjct: 357 SG-NGGNSTVKDLRVSQQTWL-DYKSAVMKSVGRINELVSGFDMAGAEYMQVANYGVGGQ 414

Query: 183 YEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFW 242
           YEPH D+        FK    G+R++T +FY+SDV QGG TVF  LN+ L P  G    W
Sbjct: 415 YEPHPDYFGVNLPVEFK----GDRISTSMFYLSDVEQGGYTVFPKLNVFLPPVSGALVMW 470

Query: 243 HNLHSSGDGDYYTRHAACPVLTGSNSL 269
           HNLH S D D  T HA CPV+ GS  +
Sbjct: 471 HNLHRSLDVDARTLHAGCPVIVGSKRI 497


>gi|194905376|ref|XP_001981185.1| GG11927 [Drosophila erecta]
 gi|190655823|gb|EDV53055.1| GG11927 [Drosophila erecta]
          Length = 539

 Score =  137 bits (345), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 73/207 (35%), Positives = 113/207 (54%), Gaps = 3/207 (1%)

Query: 60  PAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRR 119
           P  + +L C Y      +LRL P+K E   + P ++L  D++   E  LI+  ++  +  
Sbjct: 299 PRKLKRLYCVYNCATAAFLRLAPIKTEILSIDPFVVLLHDMVSPKEAALIRSSSKSTIFP 358

Query: 120 ATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGI 179
           +   N    +  ++ +R SKS WL    +    ++++R+   TGL    +E  QV+NYGI
Sbjct: 359 SETVN-AANDFVVSKFRTSKSVWLDRDANEATVKLTQRLADATGLDVKHSEHFQVINYGI 417

Query: 180 GGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTA 239
           GG +E H+D     + N F   G  +R+AT LFY++DV QGGAT F  LN++++P  G A
Sbjct: 418 GGVFESHFDTTLE-DTNRFVG-GFIDRIATTLFYLNDVPQGGATHFPGLNITVFPRLGAA 475

Query: 240 AFWHNLHSSGDGDYYTRHAACPVLTGS 266
            FW+NL + G     T H  CPV+ GS
Sbjct: 476 LFWYNLDTQGMLQVRTMHTGCPVIVGS 502


>gi|405964867|gb|EKC30309.1| Prolyl 4-hydroxylase subunit alpha-1 [Crassostrea gigas]
          Length = 591

 Score =  137 bits (345), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 84/239 (35%), Positives = 131/239 (54%), Gaps = 26/239 (10%)

Query: 48  YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID 107
           YE LCR +      + A+L+C      +PY +    KEE    +PRI ++ DV+  + I+
Sbjct: 327 YEALCREEQKSLQEL-AKLRCFLRETVIPYYKA---KEEVVNYEPRIAIFHDVISPTSIE 382

Query: 108 LIKKMAQPRLRRATVQNYKTGEL------EIANYRISKSAWLREPEHPVIERISRRVEHM 161
            +K +A     R+TV    TG        ++ N R+S+++WL   E+P + R+  R++  
Sbjct: 383 HLKSVASKGFTRSTVFLENTGPDGHVTYGKLDNVRVSQTSWLGTDEYPELSRLENRIKLT 442

Query: 162 TGLTT------STAEELQVVNYGIGGHYEPHYDF--------ARPGEANAFKSLGTGNRV 207
           TGL+       S +E+ QV+NYG+GG Y  HYD+        + P +++  ++  +G R+
Sbjct: 443 TGLSAEYKSVRSHSEKFQVLNYGVGGMYTVHYDYTGYMLGIPSNPLDSDDIRT--SGERM 500

Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           AT +FY++DV  GGATVF  +   +   KG AAFW+N+  SG  D  T H  CPVL GS
Sbjct: 501 ATWMFYLNDVKAGGATVFPEVKTRIPVAKGGAAFWYNVRPSGATDPRTLHGGCPVLVGS 559


>gi|195575103|ref|XP_002105519.1| GD17002 [Drosophila simulans]
 gi|194201446|gb|EDX15022.1| GD17002 [Drosophila simulans]
          Length = 793

 Score =  137 bits (344), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 80/215 (37%), Positives = 115/215 (53%), Gaps = 14/215 (6%)

Query: 4   PTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREK-YEMLCRGDLTVPPAI 62
           P H+ A  NK+ Y+  L +        P+     P  E  E  K Y  +CRG+L   P  
Sbjct: 215 PDHEDALKNKILYEGQLARERSF---VPREQAELPQKEQKESYKLYTQVCRGELHQSPRD 271

Query: 63  VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATV 122
              L+C   H+ VPY  L P K E+  + P +    +V++DSEID I +  +  + R+ V
Sbjct: 272 QRNLRCWLSHQGVPYYHLSPFKIEQLNIDPYVAYVHEVLWDSEIDTIMEHGKGNMERSKV 331

Query: 123 ---QNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGI 179
              +N  T E+     RIS++ WL    +P + +I +R+E +TGL+T +AE LQ+VNYGI
Sbjct: 332 GQIENSTTTEV-----RISRNTWLWYDANPWLSKIKQRLEDVTGLSTESAEPLQLVNYGI 386

Query: 180 GGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYM 214
           GG YEPH+DF      N F     GNR+ T LFY+
Sbjct: 387 GGQYEPHFDFVEDDGQNVFS--WKGNRLLTALFYL 419


>gi|47204411|emb|CAF95476.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 284

 Score =  136 bits (343), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 76/153 (49%), Positives = 94/153 (61%), Gaps = 7/153 (4%)

Query: 116 RLRRATVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEEL 172
           +LRR+ V    T + ++ A YRISKSAWL+      + R+ +R+  +TGL       E L
Sbjct: 110 KLRRSVV---ATRDKQVTAEYRISKSAWLKGSAQSAVSRLDQRISMLTGLNVQHPHGEYL 166

Query: 173 QVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSL 232
           QVVNYGIGGHYEPH+D A    +  FK L TGNRVATV+ Y+S V  GG+T F   N S+
Sbjct: 167 QVVNYGIGGHYEPHFDHATSPSSPVFK-LKTGNRVATVMIYLSSVEAGGSTAFIYANFSV 225

Query: 233 WPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
              K  A FW NLH +G GD  T HA CPVL G
Sbjct: 226 PVMKNAAIFWWNLHRNGRGDPDTLHAGCPVLIG 258


>gi|195069795|ref|XP_001997028.1| GH12977 [Drosophila grimshawi]
 gi|193891497|gb|EDV90363.1| GH12977 [Drosophila grimshawi]
          Length = 517

 Score =  136 bits (343), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 82/222 (36%), Positives = 120/222 (54%), Gaps = 12/222 (5%)

Query: 48  YEMLCRGDLTVPPAIVAQL--KCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSE 105
           Y  LC+G   +P     Q   +C        Y +L PLK E+  L P I +Y DV+ D++
Sbjct: 280 YVRLCQGK-RLPEIKTNQSSPRCYLDSNQHAYFKLSPLKVEQVNLAPDINIYYDVLNDNQ 338

Query: 106 IDLIKKMA-QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGL 164
           I  I +++ +    R++V  Y      + + R+S+  WL     P++    + V  ++G 
Sbjct: 339 IKSILELSTEFDSFRSSVNKYN-----VTDKRVSQQVWLNYSS-PIMRTYRQLVGAISGF 392

Query: 165 TTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATV 224
             + AE +QV NYGIGG YEPH+DF   G      S+  G+R++T + Y+SDV QGG TV
Sbjct: 393 NMTNAETMQVANYGIGGQYEPHHDFF--GINLPANSVKRGDRISTNMIYLSDVQQGGYTV 450

Query: 225 FTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           F + N+ + P KG    WHNL  S DGD  T HA CPV+ G+
Sbjct: 451 FPTQNVFVKPIKGAMVMWHNLLRSLDGDRRTLHAGCPVIEGT 492


>gi|289526401|gb|ADD01323.1| FI13021p [Drosophila melanogaster]
 gi|373432715|gb|AEY70761.1| FI17809p1 [Drosophila melanogaster]
          Length = 193

 Score =  136 bits (343), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 65/136 (47%), Positives = 90/136 (66%), Gaps = 5/136 (3%)

Query: 136 RISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPH---YDFARP 192
           R +K  WL++  + + +RI+RR+  MTG   + +E  QV+NYGIGGHY  H   +DFA  
Sbjct: 28  RTAKGFWLKKESNELTKRITRRIMDMTGFDLADSEGFQVINYGIGGHYFLHMDYFDFASS 87

Query: 193 GEANAFK--SLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGD 250
              +     S+  G+R+ATVLFY++DV QGGATVF  +   + P+ GTA FW+NL + G+
Sbjct: 88  NHTDTRSRYSIDLGDRIATVLFYLTDVEQGGATVFGDVGYYVSPQAGTAIFWYNLDTDGN 147

Query: 251 GDYYTRHAACPVLTGS 266
           GD  TRHAACPV+ GS
Sbjct: 148 GDPRTRHAACPVIVGS 163


>gi|195391756|ref|XP_002054526.1| GJ24503 [Drosophila virilis]
 gi|194152612|gb|EDW68046.1| GJ24503 [Drosophila virilis]
          Length = 519

 Score =  136 bits (342), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 84/246 (34%), Positives = 122/246 (49%), Gaps = 12/246 (4%)

Query: 46  EKYEMLCRGD-LTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDS 104
           + Y  LC+G  L+ P    + L C          RL PLK E+  L P I +Y D++ D 
Sbjct: 281 DNYTQLCQGKRLSEPKPNGSALNCYLDFTRHARFRLAPLKVEQVRLNPDIHIYYDLINDD 340

Query: 105 EIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGL 164
           +ID I ++      +            + ++R+S+  WL     P++  +S  V  ++G 
Sbjct: 341 QIDDIYEVVD----QFDSFRSSVSSSIVTDWRVSQQVWLNYSS-PILRSVSNLVGAISGF 395

Query: 165 TTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATV 224
               AE++QV NYGIGG Y PH D+      +    +  GNR+AT +FY+SDV  GG TV
Sbjct: 396 DMENAEQMQVANYGIGGQYAPHTDYLSKIPDS---YIPRGNRIATNMFYLSDVLNGGYTV 452

Query: 225 FTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTCPCGLRRGLQRS 284
           F  LN+ L P KG    W+NLH S + D  T HA CPV+ G   + +      R+  +R 
Sbjct: 453 FPKLNVFLKPVKGAMVSWYNLHRSLNKDSRTLHAGCPVIEGVKRIGNIWIHSTRQEFRRP 512

Query: 285 GIICTL 290
              CTL
Sbjct: 513 ---CTL 515


>gi|449284064|gb|EMC90646.1| Prolyl 4-hydroxylase subunit alpha-3, partial [Columba livia]
          Length = 174

 Score =  135 bits (341), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 71/157 (45%), Positives = 97/157 (61%), Gaps = 13/157 (8%)

Query: 133 ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQVVNYGIGGHYEPHYDFA 190
           A YRISKSAWL++  HPV++ + +R+  +TGL      AE LQVVNYG+GGHYEPH+D A
Sbjct: 15  AEYRISKSAWLKDTAHPVVQTLEKRMAAVTGLDLRPPYAEYLQVVNYGLGGHYEPHFDHA 74

Query: 191 RPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGD 250
              ++  ++ + +GNR+AT++ Y+S V  GG+T F   NLS+   K  A FW NL  +GD
Sbjct: 75  TSRKSPLYR-MKSGNRIATLMIYLSAVGAGGSTAFVHANLSVPVVKNAALFWWNLRRNGD 133

Query: 251 GDYYTRHAACPVLTGSNSLHSTC----------PCGL 277
           GD  T HA CPVL G   + +            PCG+
Sbjct: 134 GDGDTLHAGCPVLAGDKWVANKWIHEHGQEFRRPCGI 170


>gi|195452738|ref|XP_002073478.1| GK14139 [Drosophila willistoni]
 gi|194169563|gb|EDW84464.1| GK14139 [Drosophila willistoni]
          Length = 215

 Score =  135 bits (340), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 68/202 (33%), Positives = 110/202 (54%), Gaps = 19/202 (9%)

Query: 65  QLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQN 124
           +L C Y  ++  +LR+ P+K E   L P I+LY D +  SE + +K  +  RL  A   +
Sbjct: 11  KLYCLYNTKDSYFLRIAPVKMEVLSLDPYIVLYHDFILSSEQEFLKAESIERLSVAETVD 70

Query: 125 YKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYE 184
             TG+      R +K+ W  +    VI RI++R+E +T L     +  Q+++YGIGG ++
Sbjct: 71  PDTGKWYADASRTAKAMWFYDTSSVVIRRINQRIEEITNLDPEKGDLYQIISYGIGGLFQ 130

Query: 185 PHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHN 244
            HYD+    E                   + DV QGGAT+F +++LS++P+ G A FW+N
Sbjct: 131 THYDYLHENE-------------------LQDVPQGGATLFNNISLSVFPKAGAALFWYN 171

Query: 245 LHSSGDGDYYTRHAACPVLTGS 266
           L+++GD ++   H  CPV+ GS
Sbjct: 172 LNNAGDTEWNVAHTGCPVIVGS 193


>gi|195505241|ref|XP_002099419.1| GE10893 [Drosophila yakuba]
 gi|194185520|gb|EDW99131.1| GE10893 [Drosophila yakuba]
          Length = 508

 Score =  135 bits (340), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 76/227 (33%), Positives = 123/227 (54%), Gaps = 13/227 (5%)

Query: 46  EKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSE 105
           E Y+ LCR   +  P+   +L CRY     P+L L P K EE  L+P I++Y D++ D +
Sbjct: 262 EDYKRLCRSSFSPRPS---KLLCRYNSDTSPFLILAPFKMEEISLEPYIVVYHDILPDKD 318

Query: 106 IDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH-----PVIERISRRVEH 160
           +  +  +A+PRLR   V      E   ++ R +   +L   +      P+++R+++R+  
Sbjct: 319 MQQLIALAEPRLRPTEVFEEDKSEARTSD-RSALGTFLPFKDMNPSGGPLLDRLTQRMRD 377

Query: 161 MTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQG 220
           +TG+         ++ YG G  Y  ++DF     +   +  G G+R+ATVLFY++D   G
Sbjct: 378 ITGIQIRHENTFNIIKYGFGSQYATNFDFFNGTNS---EMEGYGDRMATVLFYLNDAPNG 434

Query: 221 GATVFTSLNLSLWPEKGTAAFWHNLH-SSGDGDYYTRHAACPVLTGS 266
           GATVF  +++ +  E+G   FWHNL+  + D +  T HAACPV  GS
Sbjct: 435 GATVFPRIDVKVTAERGKVLFWHNLNGETHDVEPNTLHAACPVFQGS 481


>gi|241598365|ref|XP_002404734.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
 gi|215500465|gb|EEC09959.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
          Length = 524

 Score =  135 bits (339), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 73/203 (35%), Positives = 118/203 (58%), Gaps = 10/203 (4%)

Query: 46  EKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSE 105
           E Y+ LCRG+    P + +QL+CRY      + +L P+K EE  L+P +++ RD++ D +
Sbjct: 274 ENYKRLCRGEQLRTPKMDSQLRCRYYSGESGFFKLQPIKLEEYNLKPYVVVLRDLLQDRD 333

Query: 106 IDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLT 165
           +  +   A+PR+R+  +       L  + +    S WL + + PV  R+++ ++ + GL 
Sbjct: 334 LADMIAFAKPRVRKLQLSRRI---LVYSKHYCDTSTWLNDDDAPVAARVNQYLQSLLGLG 390

Query: 166 T----STAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT---GNRVATVLFYMSDVA 218
           T      AE+ Q+ NYGIGGHY PH+D+      +   S+ T   G+RVAT++ YMSDV 
Sbjct: 391 TLYSKDEAEKYQLANYGIGGHYVPHHDYLEETLTSRHVSIVTRLFGDRVATLMIYMSDVE 450

Query: 219 QGGATVFTSLNLSLWPEKGTAAF 241
           +GGATVF SL + + P+K +  F
Sbjct: 451 EGGATVFPSLGVRVSPKKVSMQF 473


>gi|47191658|emb|CAG13505.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 156

 Score =  134 bits (338), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 72/154 (46%), Positives = 94/154 (61%), Gaps = 26/154 (16%)

Query: 64  AQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLR---- 118
           + L CRY   N  P L L P KEE+ +  P I+ Y D + D+EID IK++A+P++R    
Sbjct: 3   SHLFCRYRSGNRNPRLLLKPFKEEDEWDSPHIVRYLDFLSDTEIDKIKELAKPKVRHYSK 62

Query: 119 ---------------------RATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRR 157
                                RATV++ KTG L  ANYR+SKSAWL   E PVI R+++R
Sbjct: 63  KKSVCYNVEITRSTFLFFQLARATVRDPKTGVLTTANYRVSKSAWLEGEEDPVIARVNQR 122

Query: 158 VEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFAR 191
           +E +TGLT  TAE LQV NYG+GG YEPH+DF+R
Sbjct: 123 IEDLTGLTVETAELLQVANYGLGGQYEPHFDFSR 156


>gi|402584932|gb|EJW78873.1| hypothetical protein WUBG_10221 [Wuchereria bancrofti]
          Length = 187

 Score =  134 bits (337), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 67/127 (52%), Positives = 83/127 (65%), Gaps = 2/127 (1%)

Query: 140 SAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFK 199
           S+WL   EH V+ RI++R++  T L T TAEELQV NYGIGGHYEPHYD +R    + F+
Sbjct: 7   SSWLGSTEHEVVNRINKRLDLATNLETETAEELQVQNYGIGGHYEPHYDCSR--RESVFE 64

Query: 200 SLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAA 259
               GNR+AT+L YM+    GG TVF  L  S+   K  A FW+NL  SG  D  + HAA
Sbjct: 65  KTKNGNRIATILIYMTKPEIGGGTVFIDLKTSISCTKNAALFWYNLMRSGAVDIRSYHAA 124

Query: 260 CPVLTGS 266
           CPVLTG+
Sbjct: 125 CPVLTGT 131


>gi|195352184|ref|XP_002042594.1| GM14981 [Drosophila sechellia]
 gi|194124478|gb|EDW46521.1| GM14981 [Drosophila sechellia]
          Length = 539

 Score =  133 bits (335), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 79/225 (35%), Positives = 118/225 (52%), Gaps = 8/225 (3%)

Query: 47  KYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEI 106
           K E  CRG+ +   +   +L CRY      +L+L PLK E   +QP I+LY DV+Y++E 
Sbjct: 297 KLERGCRGEWSRKSS--PELICRYNRDTSAFLKLAPLKLEFLSVQPMILLYHDVLYENEF 354

Query: 107 DLIKKMA---QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTG 163
             ++ +A      +   T  ++          R+ K    +    P    I+RR+  M+G
Sbjct: 355 KSMRDLAMYNDSMIDGWTYVDFDKKGNPKQQDRVVKIISFQGTTAPFTLSINRRLADMSG 414

Query: 164 LTTSTAEELQVVNYGIGGHYEPHYDFARPGEA--NAFKSLGTGNRVATVLFYMSDVAQGG 221
           L       L + NYG+GGH+  H D+    +   + F   G G+R+AT LFY SDV  GG
Sbjct: 415 LEMRENMVLYLTNYGLGGHFGKHVDYVELAKRPPDFFADFG-GDRIATALFYASDVPLGG 473

Query: 222 ATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
            TVFT L +++ P+KG A  W NL+ +G+ D  T H+ CPV+ GS
Sbjct: 474 TTVFTKLKIAVKPKKGNALIWFNLNHAGEPDPLTEHSVCPVVLGS 518


>gi|195390825|ref|XP_002054068.1| GJ24233 [Drosophila virilis]
 gi|194152154|gb|EDW67588.1| GJ24233 [Drosophila virilis]
          Length = 533

 Score =  133 bits (334), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 81/210 (38%), Positives = 117/210 (55%), Gaps = 16/210 (7%)

Query: 65  QLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQN 124
           +L C Y      +LRL P K E     P I ++ DV+Y SEI  + ++ +P L+R  VQN
Sbjct: 296 RLTCYYKTNPSEFLRLAPFKLELLSKDPYIAVFHDVIYASEIAELIRIGEPMLKRTAVQN 355

Query: 125 Y-KTGELEIANYRISKSAW-----LREPEHPVIERISRRVEHMTGL--TTSTAEELQVVN 176
             +  +  I+  R +  +W     L + E  +I RI RR+E MTGL  T  + ++LQ++N
Sbjct: 356 ITQNVDTYISKDRTATGSWILNGNLTKLERNMIWRIQRRIEDMTGLLITGFSEQDLQLLN 415

Query: 177 YGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEK 236
           Y  GGHY+ HYDF       +F      +R+AT L Y++DV +GGATVF  L+L + PE+
Sbjct: 416 YVFGGHYQSHYDFF---NCPSFPH----DRIATTLIYLNDVVRGGATVFPKLDLVVQPER 468

Query: 237 GTAAFWHN-LHSSGDGDYYTRHAACPVLTG 265
           G    W+N L  + D D  + H  CPVL G
Sbjct: 469 GKVLHWYNMLPDTFDYDRRSLHGGCPVLIG 498


>gi|21358233|ref|NP_651814.1| prolyl-4-hydroxylase-alpha NE3 [Drosophila melanogaster]
 gi|20269810|gb|AAM18060.1|AF495538_1 prolyl 4-hydroxylase alpha-related protein PH4[alpha]NE3
           [Drosophila melanogaster]
 gi|15291443|gb|AAK92990.1| GH21465p [Drosophila melanogaster]
 gi|23172714|gb|AAN14251.1| prolyl-4-hydroxylase-alpha NE3 [Drosophila melanogaster]
 gi|220945610|gb|ACL85348.1| PH4alphaNE3-PA [synthetic construct]
 gi|220955396|gb|ACL90241.1| PH4alphaNE3-PA [synthetic construct]
          Length = 481

 Score =  133 bits (334), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 80/243 (32%), Positives = 130/243 (53%), Gaps = 19/243 (7%)

Query: 25  ELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLK 84
           + K  P +  + +P L     E Y+ LCR   +  P+   +L CRY      +L L PLK
Sbjct: 245 QFKANPYEAIDSSPKL----GEGYKRLCRSSFSPNPS---KLHCRYNSTTSAFLILAPLK 297

Query: 85  EEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLR 144
            EE  L+P I++Y D++ D +I  +  +A+P L        K  E+   N   ++S++  
Sbjct: 298 MEEISLEPHIVVYHDILPDKDIQQLITLAEPLL--------KPTEMFDDNKNEARSSYRT 349

Query: 145 EPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTG 204
               P+++ +++R+  +TGL       + ++ YG G  Y  +YDF +   +   +S G G
Sbjct: 350 PLGGPLLDSLTQRMRDITGLQIRQGNPINIIKYGFGAPYTNYYDFFKKRNS---ESKGFG 406

Query: 205 NRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLH-SSGDGDYYTRHAACPVL 263
           +R+AT +FY++D   GGATVF  LN+ +  E+G   FW+NL+  + D +  T HAACPV 
Sbjct: 407 DRMATFMFYLNDAPYGGATVFPRLNVKVPAERGKVLFWYNLNGDTHDMEPTTMHAACPVF 466

Query: 264 TGS 266
            GS
Sbjct: 467 HGS 469


>gi|195591304|ref|XP_002085382.1| GD14758 [Drosophila simulans]
 gi|194197391|gb|EDX10967.1| GD14758 [Drosophila simulans]
          Length = 509

 Score =  132 bits (333), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 79/225 (35%), Positives = 117/225 (52%), Gaps = 8/225 (3%)

Query: 47  KYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEI 106
           K E  CRG+   P     +L CRY      +L+L PLK E   +QP I+LY DV+Y++E 
Sbjct: 267 KLERGCRGEW--PRKSSPELICRYNRDTSAFLKLAPLKLEFLSVQPMILLYHDVLYENEF 324

Query: 107 DLIKKMA---QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTG 163
             ++ +A      +   T  ++          R+ K    +    P    I+RR+  M+G
Sbjct: 325 KSMRDIAMYNDSMIDGWTYVDFDKKGNPKQQDRVVKIISFQGTTAPFTLSINRRLADMSG 384

Query: 164 LTTSTAEELQVVNYGIGGHYEPHYDFARPGEA--NAFKSLGTGNRVATVLFYMSDVAQGG 221
           L       L + NYG+GGH+  H D+    +   + F   G G+R+AT +FY SDV  GG
Sbjct: 385 LEMRENMVLYLTNYGLGGHFGKHVDYVELAKRPPDFFADFG-GDRIATAVFYASDVPLGG 443

Query: 222 ATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
            TVFT L +++ P+KG A  W NL+ +G+ D  T H+ CPV+ GS
Sbjct: 444 TTVFTKLKIAVQPKKGNALIWFNLNHAGEPDPLTEHSVCPVVLGS 488


>gi|24666354|ref|NP_730347.1| CG32199 [Drosophila melanogaster]
 gi|23093193|gb|AAF49251.3| CG32199 [Drosophila melanogaster]
          Length = 509

 Score =  132 bits (333), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 78/225 (34%), Positives = 117/225 (52%), Gaps = 8/225 (3%)

Query: 47  KYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEI 106
           K E  CRG+   P     +L CRY      +L+L PLK E   +QP I+LY DV+Y++E 
Sbjct: 267 KLERGCRGEW--PKKSSPELICRYNRDTSAFLKLAPLKLEFLSVQPMILLYHDVLYENEF 324

Query: 107 DLIKKMAQ---PRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTG 163
             ++ +A      +   T  ++          R+ K    +    P    I+RR+  M+G
Sbjct: 325 KSMRDIAMYNGSMIDGWTYVDFDKKGNPKQQDRVVKMIAFQGTTAPFTLSINRRMADMSG 384

Query: 164 LTTSTAEELQVVNYGIGGHYEPHYDFARPGEA--NAFKSLGTGNRVATVLFYMSDVAQGG 221
           L       L + NYG+GGH+  H D+    +   + F   G G+R+AT L Y SD+  GG
Sbjct: 385 LEMRDNMVLYLTNYGLGGHFGKHVDYVELAKRPPDFFADFG-GDRIATALIYASDIPLGG 443

Query: 222 ATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
            TVFT L +++ P+KG+A  W NL+ +G+ D  T H+ CPV+ GS
Sbjct: 444 TTVFTKLKIAVQPKKGSALIWFNLNHAGEPDPLTEHSVCPVVLGS 488


>gi|198466405|ref|XP_001353987.2| GA16752 [Drosophila pseudoobscura pseudoobscura]
 gi|198150585|gb|EAL29723.2| GA16752 [Drosophila pseudoobscura pseudoobscura]
          Length = 510

 Score =  132 bits (332), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 87/237 (36%), Positives = 125/237 (52%), Gaps = 40/237 (16%)

Query: 52  CRGDL---TVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDL 108
           CRG     + PP     L CRY      +L L PLK E    QP I+LY +V+Y+ E+  
Sbjct: 273 CRGQWQRKSSPP-----LACRYNREYSAFLLLAPLKMEVLNQQPLIVLYHEVLYEKELRA 327

Query: 109 IKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLR-EPEHPVIE-------------RI 154
           ++ +A    + AT+Q+  T        R+     ++ EPE  V++              I
Sbjct: 328 MRDIAN---KNATMQDGWT--------RMHSDQRVKPEPEDRVLKLHIFQGNSESFSPSI 376

Query: 155 SRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFA----RPGEANAFKSLGTGNRVATV 210
           +RR+  MTGL       L + NYG+GG++  HYD+     RP  AN F   G G+ +ATV
Sbjct: 377 NRRIADMTGLEVQGNNALHLSNYGLGGYFNAHYDYVELTKRP--ANYFTEWG-GDVLATV 433

Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
           L Y SDV  GGA VF  L +S+ P+KG A  W NL+++G+ D  ++HA CPV+ GS+
Sbjct: 434 LLYASDVRLGGAVVFPKLKISVEPKKGNALIWDNLNNAGNPDKLSKHAVCPVVMGSH 490


>gi|20177086|gb|AAM12247.1| AT28279p [Drosophila melanogaster]
          Length = 509

 Score =  132 bits (331), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 78/225 (34%), Positives = 116/225 (51%), Gaps = 8/225 (3%)

Query: 47  KYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEI 106
           K E  CRG+   P     +L CRY      +L+L PLK E   +QP I+LY DV+Y++E 
Sbjct: 267 KLERGCRGEW--PKKSSPELICRYNRDTSAFLKLAPLKLEFLSVQPMILLYHDVLYENEF 324

Query: 107 DLIKKMAQ---PRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTG 163
             ++ +A      +   T  ++          R+ K    +    P    I+RR+  M+G
Sbjct: 325 KSMRDIAMYNGSMIDGWTYVDFDKKGNPKQQDRVVKMIAFQGTTAPFTLSINRRMADMSG 384

Query: 164 LTTSTAEELQVVNYGIGGHYEPHYDFARPGEA--NAFKSLGTGNRVATVLFYMSDVAQGG 221
           L       L + NYG+GGH+  H D+    +   + F   G G+R+AT L Y SD+  GG
Sbjct: 385 LEMRDNMVLYLTNYGLGGHFGKHVDYVELAKRPPDFFADFG-GDRIATALIYASDIPLGG 443

Query: 222 ATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
            TVFT L +++ P+KG A  W NL+ +G+ D  T H+ CPV+ GS
Sbjct: 444 TTVFTKLKIAVQPKKGNALIWFNLNHAGEPDPLTEHSVCPVVLGS 488


>gi|195471732|ref|XP_002088156.1| GE14021 [Drosophila yakuba]
 gi|194174257|gb|EDW87868.1| GE14021 [Drosophila yakuba]
          Length = 265

 Score =  132 bits (331), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 76/218 (34%), Positives = 114/218 (52%), Gaps = 16/218 (7%)

Query: 49  EMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDL 108
           E  CRG+   PP    QL CRY     P++R+ PLKEEE   +P I LY DV+YDSEI  
Sbjct: 43  EQGCRGNF--PPH--PQLVCRYNSTTTPFMRIAPLKEEEISKEPLIWLYHDVIYDSEIAQ 98

Query: 109 IKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST 168
           +  + +  +   T  NY T +     + +  +    +    +   +  R+  ++GL    
Sbjct: 99  LTNLTREEMILGTTNNYTTPDRVNRLFHVKVT---NDDGGQLDRTLVNRMADISGLDMGN 155

Query: 169 AEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL 228
              L  +NYG+GG+++ H D+       A   L T          +SDV  GGAT+F + 
Sbjct: 156 TTSLARINYGLGGYFQEHSDYVDIKLHPASSLLPTS---------ISDVPVGGATIFPAA 206

Query: 229 NLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
            L++ P+KG+A FW+NLH++GD +  TRHA CP + GS
Sbjct: 207 KLAIQPKKGSALFWYNLHNNGDPNPLTRHAVCPTIVGS 244


>gi|194760358|ref|XP_001962408.1| GF14452 [Drosophila ananassae]
 gi|190616105|gb|EDV31629.1| GF14452 [Drosophila ananassae]
          Length = 498

 Score =  131 bits (330), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 77/217 (35%), Positives = 115/217 (52%), Gaps = 12/217 (5%)

Query: 52  CRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           CRG+      +V    CRY     P++R+ PLKEEE    P I LY DV++DSE+ L+ K
Sbjct: 271 CRGEYPNQSRLV----CRYNTTTTPFMRIAPLKEEEISKDPLIWLYHDVLFDSEMALLTK 326

Query: 112 MAQPRLRRATVQNYKTGELE-IANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAE 170
                 R   +Q Y   +      YRI +          +   +  R+  ++GL      
Sbjct: 327 NLT---REEMIQGYTNNQTTPDKGYRIFQVKVYEGDGGKLDRTLVNRMTDISGLDVGNHT 383

Query: 171 ELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT-GNRVATVLFYMSDVAQGGATVFTSLN 229
            L   NYG+G H++ H D+    E      LG+ G+R+ T LFY SDV  GGAT+F + N
Sbjct: 384 YLARANYGLGTHFQEHSDYVDLREN---PDLGSEGDRLFTFLFYASDVEMGGATIFPAAN 440

Query: 230 LSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           +S+ P+KG+A FW+NLH+  + +  +RHA CP++ G+
Sbjct: 441 ISIKPKKGSALFWYNLHNDWEPNPLSRHAVCPMVLGN 477


>gi|195441323|ref|XP_002068462.1| GK20483 [Drosophila willistoni]
 gi|194164547|gb|EDW79448.1| GK20483 [Drosophila willistoni]
          Length = 550

 Score =  131 bits (330), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 75/218 (34%), Positives = 115/218 (52%), Gaps = 9/218 (4%)

Query: 52  CRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEI-DLIK 110
           CRG       +V    CRY     P+L+L P+K EE  L P I+ Y DV+ D+EI DL +
Sbjct: 312 CRGMFRQHTNLV----CRYNFTTSPFLQLAPMKLEEISLDPYIVQYHDVLSDNEIEDLKR 367

Query: 111 KMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAE 170
           +  +  +         +   E  +  I     +  P   +++RI+RR+  MTG     ++
Sbjct: 368 EGIKGTMINGWTSLKSSNATENESRTIVARVAIMSPSLEIVQRINRRIIDMTGFNIEESK 427

Query: 171 ELQVVNYGIGGHYEPHYDF--ARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL 228
            +Q+  + +GG + PHYD+   R  + +  K LG  +RVA+V+FY  DV +GGAT F   
Sbjct: 428 TIQLAAFSVGGFFMPHYDYLYDRLLDTDVLKKLG--DRVASVIFYAGDVTEGGATNFPRN 485

Query: 229 NLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
            L + P+KG+A FW+N    G  D  + H+ CPV+ GS
Sbjct: 486 QLVVQPKKGSALFWYNKFDDGSPDPRSLHSICPVVVGS 523


>gi|148684485|gb|EDL16432.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide III [Mus musculus]
          Length = 396

 Score =  131 bits (330), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 98/300 (32%), Positives = 147/300 (49%), Gaps = 24/300 (8%)

Query: 3   FPTHQRAQGNKLYYQEALNKSP-ELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPA 61
           F  ++R   N L Y+  L ++  ++  E        P L+   R+ YE LC+   + P  
Sbjct: 98  FQDNKRMARNVLKYERLLAENGHQMAAETAIQRPNVPHLQT--RDTYEGLCQTLGSQPTH 155

Query: 62  I-VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRA 120
             +  L C Y   + PYL L P ++E  +L+P I LY D + D E   I+++A+P L+R+
Sbjct: 156 YQIPSLYCSYETNSSPYLLLQPARKEVVHLRPLIALYHDFVSDEEAQKIRELAEPWLQRS 215

Query: 121 TVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQVVNY 177
            V    +GE ++   YRISKSAWL++   P++  +  R+  +TGL      AE LQVVNY
Sbjct: 216 VV---ASGEKQLQVEYRISKSAWLKDTVDPMLVTLDHRIAALTGLDIQPPYAEYLQVVNY 272

Query: 178 GIGGHYEPHYDFARPGEANAFKSLGTGNRVATV-------LFYMSDVAQGGATVFTSLNL 230
           GIGGHYEPH+D A     +   S+  G   A +       +  +S V  GGAT F   N 
Sbjct: 273 GIGGHYEPHFDHATVTMGSMLSSVEAGGATAFIYGNFSVPVVKLSSVEAGGATAFIYGNF 332

Query: 231 SL----WPEKGTAAFWHNLHSSGDGDYYTRHAACPVLT---GSNSLHSTCPCGLRRGLQR 283
           S+    WP  G+ +   N          T   A   L+   G+ ++    P  L+ GLQ+
Sbjct: 333 SVPVVKWPTSGSTSMDRNSEDPAAPTLKTETLADGSLSEKPGAKAMGRGEPTLLKEGLQQ 392


>gi|194751829|ref|XP_001958226.1| GF23628 [Drosophila ananassae]
 gi|190625508|gb|EDV41032.1| GF23628 [Drosophila ananassae]
          Length = 484

 Score =  131 bits (329), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 81/272 (29%), Positives = 136/272 (50%), Gaps = 33/272 (12%)

Query: 11  GNKLYYQEALNKSPELK-DEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAIVAQ---- 65
            N+  Y++A NK   LK  EP +++ V  +L    +E+         +  P+++A     
Sbjct: 205 NNETIYKQASNK---LKVSEPKEIDKVVYSLLTQWKEESHNATN---STEPSLIAHYTGC 258

Query: 66  ---------LKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPR 116
                    L CRY     P+L+L PLK EE  L P I+LY +V+ D EI+ +K +    
Sbjct: 259 RNQFPKQNNLVCRYNATTTPFLKLAPLKLEEVSLDPYIVLYHNVISDREIEEMKGL---- 314

Query: 117 LRRATVQNYKTGELEIANYR--ISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQV 174
                +     G  ++   R  +S+  WL + E    +R++ R+  +TG        LQ+
Sbjct: 315 -----IDEMDNGWTDLNESREIVSRLVWLTK-ESRFRKRLNLRIRDITGFNVDEIRGLQI 368

Query: 175 VNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWP 234
            N+G+GG ++PHYD+          ++  G+R+A+++FY+ DV  GG TVF  + +++ P
Sbjct: 369 ANFGVGGQFKPHYDYFTERILRLNNTI-LGDRIASIIFYVGDVVHGGQTVFPDIQIAVKP 427

Query: 235 EKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           +KG++ FW N       D  + H+ CPVL G 
Sbjct: 428 QKGSSLFWFNTFDDATPDPRSLHSVCPVLIGD 459


>gi|194765144|ref|XP_001964687.1| GF22917 [Drosophila ananassae]
 gi|190614959|gb|EDV30483.1| GF22917 [Drosophila ananassae]
          Length = 529

 Score =  131 bits (329), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 78/241 (32%), Positives = 124/241 (51%), Gaps = 16/241 (6%)

Query: 35  NVAPTLEVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRI 94
           +V+  +  T    Y+  CR   T  P    +L CRY     P+L++ PLK EE  L P I
Sbjct: 265 DVSRDIYETLSNNYQATCRSSHTPNPT---RLHCRYNSTTTPFLKIAPLKMEEISLDPYI 321

Query: 95  ILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLRE--------P 146
           ++Y DV+ D +I  + ++++ +L  A V +       +  +R +  +WL +        P
Sbjct: 322 VVYHDVLPDGDISEVLRLSETKLEPAQVVSTPRTSNNV-KFRTALGSWLPDYEEVVKGPP 380

Query: 147 EHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNR 206
           + P+  R+   +  +TGL     +  QV+ Y  G HY  H+D+      +   ++  G+R
Sbjct: 381 KGPLYGRLRNILRDVTGLVIWDYQFFQVLKYQFGAHYAQHHDYFN---MSLKSTVLQGDR 437

Query: 207 VATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLH-SSGDGDYYTRHAACPVLTG 265
           +ATVLFY++D   GGATVF  LN+ +  EKG   FW+NL   + D D  T H ACP+  G
Sbjct: 438 IATVLFYLNDAPHGGATVFPMLNVKVPAEKGKILFWYNLKGETHDFDEKTLHGACPIFHG 497

Query: 266 S 266
           +
Sbjct: 498 T 498


>gi|195591298|ref|XP_002085379.1| GD14755 [Drosophila simulans]
 gi|194197388|gb|EDX10964.1| GD14755 [Drosophila simulans]
          Length = 515

 Score =  130 bits (327), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 68/207 (32%), Positives = 114/207 (55%), Gaps = 16/207 (7%)

Query: 64  AQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQ 123
           + L CRY      +L+L PLK EE    P I+L+ +++ D EI+ +K           + 
Sbjct: 295 SNLVCRYNSSTNAFLQLAPLKMEEVSRDPYIVLFHEMISDKEIEEMK---------GEIT 345

Query: 124 NYKTGELEIANYR--ISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGG 181
             + G   + + +  +S+  W+R+ E    +RI++R+  MTG        +Q+ N+G+GG
Sbjct: 346 EMENGWTSLGDSKEIVSRVYWIRK-ESSFSKRINQRISDMTGFKLEEFPAIQLANFGVGG 404

Query: 182 HYEPHYDF--ARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTA 239
           +++PHYD+   R  E +   +LG  +R+ +++FY  +V+QGG TVF  L +++ P+KG A
Sbjct: 405 YFKPHYDYYTDRLKEVDVNNTLG--DRIGSIIFYAGEVSQGGQTVFPDLKVAVEPKKGNA 462

Query: 240 AFWHNLHSSGDGDYYTRHAACPVLTGS 266
            FW N       D  T H+ CPV+ GS
Sbjct: 463 LFWFNAFDDSSPDPRTLHSVCPVIVGS 489


>gi|386771382|ref|NP_649044.3| CG18233 [Drosophila melanogaster]
 gi|383291998|gb|AAF49254.3| CG18233 [Drosophila melanogaster]
          Length = 515

 Score =  130 bits (326), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 72/219 (32%), Positives = 118/219 (53%), Gaps = 20/219 (9%)

Query: 52  CRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           CRG    P    + L CRY      +L+L PLK EE    P I+++ +V+ D +I+ +K 
Sbjct: 287 CRG--LFPKK--SNLVCRYNSSTNAFLKLAPLKMEEISRDPYIVMFHEVISDKDIEEMK- 341

Query: 112 MAQPRLRRATVQNYKTGELEIANYR--ISKSAWLREPEHPVIERISRRVEHMTGLTTSTA 169
                     +   + G   + + +  +S+  W+R+ E    +RI++R+  MTG      
Sbjct: 342 --------GEITEMENGWTSLGDPKEIVSRVYWIRK-ESSFSKRINQRISDMTGFKLEEF 392

Query: 170 EELQVVNYGIGGHYEPHYDF--ARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTS 227
             +Q+ N+G+GG+++PHYDF   R  E +   +LG  +R+ +++FY  +V+QGG TVF  
Sbjct: 393 PAIQLANFGVGGYFKPHYDFYTDRLKEVDVNNTLG--DRIGSIIFYAGEVSQGGQTVFPD 450

Query: 228 LNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           L +++ P+KG A FW N       D  + H+ CPVL GS
Sbjct: 451 LKVAVEPKKGNALFWFNAFDDSTPDPRSLHSVCPVLVGS 489


>gi|443705944|gb|ELU02240.1| hypothetical protein CAPTEDRAFT_227850 [Capitella teleta]
          Length = 475

 Score =  130 bits (326), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 85/226 (37%), Positives = 119/226 (52%), Gaps = 20/226 (8%)

Query: 84  KEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWL 143
           K E  +  P I L+ D + DSEI  +K MA+P+ + + V +   GE      R+S +A++
Sbjct: 176 KTELLHANPEIYLFHDFISDSEIQRLKDMAEPQFQSSAVLDDTGGESFFDVSRLSSTAFV 235

Query: 144 REPEHPVIERISRRVEHMTGLTT------STAEELQVVNYGIGGHYEPHYDFARPGEANA 197
            +  + ++  ++RRV  +TGL T      S +E LQV+ YG GG Y PHYD     EA+ 
Sbjct: 236 ND-SNDLVASLNRRVSKLTGLQTEVLDSFSESESLQVLRYGPGGLYTPHYD-TLGSEADL 293

Query: 198 FKSLG-TGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTR 256
              +  TG+R+AT + Y+     GGATVF  L +S+  +KG AAFW NLH  G  D  T 
Sbjct: 294 PPYIQHTGDRIATFILYLDIATAGGATVFPLLPMSIPIQKGAAAFWFNLHPDGSLDRRTL 353

Query: 257 HAACPVLTGS-----------NSLHSTCPCGLRRGLQRSGIICTLV 291
           HAACPV+ G+            S H     G RR      IIC L+
Sbjct: 354 HAACPVIRGTKWECVIVSNDMTSDHEMFTVGKRRTEIVRLIICILL 399


>gi|347966278|ref|XP_003435891.1| AGAP013377-PA [Anopheles gambiae str. PEST]
 gi|333470133|gb|EGK97522.1| AGAP013377-PA [Anopheles gambiae str. PEST]
          Length = 290

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 70/217 (32%), Positives = 124/217 (57%), Gaps = 5/217 (2%)

Query: 46  EKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSE 105
           + Y  LCRG    PP++ + L C Y  RN   + + P K E     P + L+ + ++D E
Sbjct: 46  DPYMDLCRGVYVPPPSLTSSLYCWYDVRNAHSV-ISPSKVEALSNDPFVALFHEFVHDGE 104

Query: 106 IDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLT 165
           I  ++ +    ++++   N     +   N+   ++  L + +HPV+ER+++R+E  TGL+
Sbjct: 105 IAQLQALGSMHIKQSGPSNDSWLPVFYENH---QTYTLHDRDHPVVERLTKRIERRTGLS 161

Query: 166 TSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVF 225
             TAE+L+V+   +G       D     E +A +    G+R+AT+LF++SDV  GG T+F
Sbjct: 162 CDTAEDLKVIYNEVGAFKTAALDAIHKKE-DAQRFAYAGDRLATMLFFLSDVTNGGYTIF 220

Query: 226 TSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPV 262
             L +++ P+KGTAAFW+NL  +G+G+   +++ CP+
Sbjct: 221 PKLRVAIRPQKGTAAFWYNLKDTGEGNVQMKYSICPL 257


>gi|195145314|ref|XP_002013641.1| GL24244 [Drosophila persimilis]
 gi|194102584|gb|EDW24627.1| GL24244 [Drosophila persimilis]
          Length = 496

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 77/225 (34%), Positives = 119/225 (52%), Gaps = 11/225 (4%)

Query: 52  CRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           C+G   +P  + + L+C Y      +LRL PL+ E     P + LY +V+  +E   +  
Sbjct: 271 CQGRSRLP--VQSSLRCHYSAEGSAFLRLAPLRMELLSRDPLVALYHEVVSAAEQRHLML 328

Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
           +++ +L+R     Y          R   SA +     P +E++ RR+E +TGL  + +E 
Sbjct: 329 LSESQLQRQRGHQYD-------KIRTFASASVAANATPTVEQLHRRLEDITGLDLAESEP 381

Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
           L+++NYGIGG Y  H D  +P      +      R+ATVL Y+SDV  GG T F +L L 
Sbjct: 382 LRILNYGIGGQYYIHVDCEQP--QTHVEPYPKEYRLATVLLYLSDVRLGGFTSFPALGLG 439

Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTCPCG 276
           + P +G+A  WHN +++G+ DY   HAACPVL G+  + S    G
Sbjct: 440 IRPNRGSALVWHNANNAGNCDYRALHAACPVLLGTRWVASKWISG 484


>gi|195352176|ref|XP_002042590.1| GM14977 [Drosophila sechellia]
 gi|194124474|gb|EDW46517.1| GM14977 [Drosophila sechellia]
          Length = 485

 Score =  129 bits (323), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 70/201 (34%), Positives = 105/201 (52%), Gaps = 14/201 (6%)

Query: 66  LKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNY 125
           L C Y      +LR+ PLK E   L+P I+LY DV+Y+SEI  IK ++ P L+       
Sbjct: 290 LSCHYEQNTSEFLRIAPLKVETLSLKPHIVLYHDVIYESEISKIKNISLPSLKSPL---- 345

Query: 126 KTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEP 185
               ++  +Y +  +    +P+ P    +S R++ MTG       + Q+ NYGI G    
Sbjct: 346 --RIIDAVDYNLKLAQIREDPQSP----LSLRIKDMTGEDVKEDTDFQIDNYGICGFRNF 399

Query: 186 HYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNL 245
           H D     +  A      G+R+ ++LF+M+DV QGGA  F +LNL++WP KG+A  W NL
Sbjct: 400 HTDNIEIQDQTA----ELGDRLTSILFFMNDVVQGGAFAFPNLNLTIWPHKGSALVWRNL 455

Query: 246 HSSGDGDYYTRHAACPVLTGS 266
                 +    H +CPV+ GS
Sbjct: 456 DHRMQPNKDLLHVSCPVVVGS 476


>gi|195494561|ref|XP_002094890.1| GE19962 [Drosophila yakuba]
 gi|194180991|gb|EDW94602.1| GE19962 [Drosophila yakuba]
          Length = 539

 Score =  129 bits (323), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 76/220 (34%), Positives = 114/220 (51%), Gaps = 8/220 (3%)

Query: 52  CRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           CRG+   PP    +L CRY      +L+L PLK E   +QP I+LY DV+Y++E   ++ 
Sbjct: 302 CRGEW--PPKSSPELICRYNRDTSAFLKLAPLKLEILSVQPVILLYHDVLYENEFKSMRD 359

Query: 112 MA---QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST 168
            A      +   T  ++          R+ K+   +    P    I+RR+ +M+GL    
Sbjct: 360 AAIFNASMIDGWTYYDFDQKGNPKWQDRVVKTIGFQGTTAPFTLSINRRLGYMSGLEMRE 419

Query: 169 AEELQVVNYGIGGHYEPHYDFARPGEA--NAFKSLGTGNRVATVLFYMSDVAQGGATVFT 226
              L + NYG+GG++  H+D+    +   N F   G G+ +AT + Y SDV  GG TVF+
Sbjct: 420 NMMLYLTNYGLGGNFRKHFDYVELAKRPPNFFADSG-GDHIATAVLYASDVPLGGTTVFS 478

Query: 227 SLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
            L L++ P+KG A  W NL+  G  D  T H+ CPV+ GS
Sbjct: 479 KLKLAVQPKKGNALVWFNLNHDGKPDPLTEHSVCPVVLGS 518


>gi|242001766|ref|XP_002435526.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
 gi|215498862|gb|EEC08356.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
          Length = 559

 Score =  129 bits (323), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 74/203 (36%), Positives = 114/203 (56%), Gaps = 12/203 (5%)

Query: 44  EREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYD 103
           E   Y  LCRG++   P + ++L+CRY      +  L P+K EE  L+P II+ RDV+ +
Sbjct: 291 EELNYRRLCRGEVLRTPQMDSKLRCRYYKGQDGFFTLHPIKLEEINLKPYIIVMRDVVQE 350

Query: 104 SEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTG 163
            +I+ +   A+PRL+R+T   Y       +  R S +AWL + E P+  R++  +  + G
Sbjct: 351 RDIEDLMAFAEPRLQRSTT--YTGDGNAPSTRRTSSNAWLWDDEAPIANRMNWYLRALVG 408

Query: 164 LTT----STAEELQVVNYGIGGHYEPHYDF------ARPGEANAFKSLGTGNRVATVLFY 213
           L T      AE  Q+ NYG GG++ PHYD+      A    A+ +     G+R+AT++ Y
Sbjct: 409 LGTLGSEYEAEAYQLANYGSGGYFLPHYDYLQDTLHAHNSTADYYLQNNEGDRLATLMIY 468

Query: 214 MSDVAQGGATVFTSLNLSLWPEK 236
           M+DV +GGATVF  L + L P+K
Sbjct: 469 MTDVEEGGATVFPRLGVRLVPKK 491


>gi|195113247|ref|XP_002001179.1| GI22114 [Drosophila mojavensis]
 gi|193917773|gb|EDW16640.1| GI22114 [Drosophila mojavensis]
          Length = 487

 Score =  128 bits (322), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 83/274 (30%), Positives = 144/274 (52%), Gaps = 27/274 (9%)

Query: 9   AQGNKLYYQEALNKSPELKDEPPK---------VNNVA--PTLEVTEREKYEMLCRGDLT 57
           A G++   + AL + P L+D+  +         V N+   P L++ ++E  E      + 
Sbjct: 215 AAGDEELSRAALLEEPSLRDQVEQFLLDYRNYNVTNIEDHPYLDIMDKEFIEFCGSSYMP 274

Query: 58  VPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRL 117
            P  +V    C Y  +   +L L P K E     P ++++ DV+Y+SEI+ + ++++P L
Sbjct: 275 QPTRLV----CSYKTKPSKFLYLAPFKMELLSEDPYMVVFHDVIYESEIEHLNRISKPFL 330

Query: 118 RRATVQNYKTGELEIANYRISKSAWL-REPEHP----VIERISRRVEHMTGLTTSTAEEL 172
           +RATV      E  +  +R +  A+L R+   P    ++ERI +R+  M+ L  +  +  
Sbjct: 331 QRATVVVEDNSEDTLIKFRTANGAFLYRDKISPKDVQLVERIFQRMRDMSDLQIND-DAF 389

Query: 173 QVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSL 232
           + + Y  GGHY+ H D+      N      T +R AT + Y++DVA+GGATVF  + +++
Sbjct: 390 EYLKYDFGGHYDIHADYF-----NYTDDQFTDDRFATFVIYLNDVARGGATVFPDVEIAV 444

Query: 233 WPEKGTAAFWHNLH-SSGDGDYYTRHAACPVLTG 265
            PE+G    W+N++  S D + ++ H ACPVL G
Sbjct: 445 HPERGKVIHWYNMNPKSFDYELHSYHGACPVLIG 478


>gi|221512810|ref|NP_649043.3| CG18234 [Drosophila melanogaster]
 gi|66771545|gb|AAY55084.1| IP12246p [Drosophila melanogaster]
 gi|220902636|gb|AAF49255.4| CG18234 [Drosophila melanogaster]
          Length = 515

 Score =  128 bits (321), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 73/201 (36%), Positives = 106/201 (52%), Gaps = 14/201 (6%)

Query: 66  LKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNY 125
           L C Y      +LR+ PLK E   L+P I+LY DV+YDSEI  +K ++ P L+      Y
Sbjct: 290 LSCHYEKNTSEFLRIAPLKVETLSLKPHIVLYHDVIYDSEISKVKNISLPSLKSPLRILY 349

Query: 126 KTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEP 185
                   +Y + K A +RE        +S R++ MTG       + Q+ NYGI G    
Sbjct: 350 AI------DYNL-KFAKIREDHQ---SPLSLRIKDMTGEDVQEDTDFQIDNYGICGFRNF 399

Query: 186 HYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNL 245
           H D     +  A      G+R+ +++F+M+DVAQGGA  F +LNL++WP+KG+A  W NL
Sbjct: 400 HTDNIELQDQTA----ELGDRLTSIMFFMNDVAQGGALAFPNLNLTIWPQKGSALVWRNL 455

Query: 246 HSSGDGDYYTRHAACPVLTGS 266
                 +    H +CPV+ GS
Sbjct: 456 DHRMQPNQDLLHVSCPVVVGS 476


>gi|195128347|ref|XP_002008625.1| GI13597 [Drosophila mojavensis]
 gi|193920234|gb|EDW19101.1| GI13597 [Drosophila mojavensis]
          Length = 457

 Score =  127 bits (320), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 77/225 (34%), Positives = 108/225 (48%), Gaps = 42/225 (18%)

Query: 49  EMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDL 108
           E  CRG    P      L CRYV+ N  YL+L P+K E+  L+P + LY DV+YDSEI  
Sbjct: 260 ERACRG--LWPERKTDHLSCRYVYENSAYLKLAPMKLEQLSLEPVVQLYHDVLYDSEIKA 317

Query: 109 IKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST 168
           IK M+ P  +   V+                              I++RV  MTG     
Sbjct: 318 IKNMSVPEAKAKRVE----------------------------LNINQRVADMTGYGMME 349

Query: 169 AEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL 228
             +L V+N+ +G   +     AR             +R+AT++FY +DVA GGAT+F  L
Sbjct: 350 HNKLHVLNFALGQGADTKSCKAR------------ADRIATIVFYANDVAIGGATIFPKL 397

Query: 229 NLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTC 273
            L + P +GTA  W+NL++ G  D   +HA CPV+ GS    + C
Sbjct: 398 RLLVQPRRGTALLWYNLNADGAADPLAKHAVCPVVLGSRWAITKC 442


>gi|195494570|ref|XP_002094894.1| GE19958 [Drosophila yakuba]
 gi|194180995|gb|EDW94606.1| GE19958 [Drosophila yakuba]
          Length = 498

 Score =  127 bits (320), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 73/202 (36%), Positives = 107/202 (52%), Gaps = 14/202 (6%)

Query: 65  QLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQN 124
            L C Y       LR+ PLK E   L+P I+LY DV+YDSEI  +K ++ P L+      
Sbjct: 290 NLSCHYEKHTSDLLRIAPLKVETLSLKPHIVLYHDVIYDSEISKVKNISLPSLKSP---- 345

Query: 125 YKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYE 184
            +    E  N +++K   + E  H     ++ R++ MTG       + Q+ NYGI G   
Sbjct: 346 LRILHAEDHNLKLAK---ISEDYHS---PLNLRIKDMTGEDVKEDTDFQIDNYGICGFRY 399

Query: 185 PHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHN 244
            H D     +  A      G+R+ +++F+M+DVAQGGA VF  LNL++WP+KG+A  W N
Sbjct: 400 YHTDNLESQDQTA----ELGDRLTSIMFFMNDVAQGGAFVFLHLNLTIWPQKGSALVWRN 455

Query: 245 LHSSGDGDYYTRHAACPVLTGS 266
           L      +    HA+CPV+ GS
Sbjct: 456 LDHRMQPNEDLLHASCPVIVGS 477


>gi|195591296|ref|XP_002085378.1| GD14754 [Drosophila simulans]
 gi|194197387|gb|EDX10963.1| GD14754 [Drosophila simulans]
          Length = 508

 Score =  127 bits (318), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 69/202 (34%), Positives = 105/202 (51%), Gaps = 14/202 (6%)

Query: 65  QLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQN 124
            L C Y      +LR+ PLK E   L+P I+LY DV+YDSEI  +K ++ P L+      
Sbjct: 288 HLSCHYEQNTSEFLRIAPLKVETLSLKPHIVLYHDVIYDSEISKVKNISLPSLKSPL--- 344

Query: 125 YKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYE 184
                ++  +Y +  +    + + P    +S R++ MTG       + Q+ NYGI G   
Sbjct: 345 ---RIIDAVDYNLKLAQIRDDHQSP----LSLRIKDMTGEDVQEDSDFQIDNYGICGFRN 397

Query: 185 PHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHN 244
            H D     +  A      G+R+ ++LF+M+DV QGGA  F +LNL++WP+KG+A  W N
Sbjct: 398 FHTDNIEMQDQTA----ELGDRLTSILFFMTDVVQGGAFAFPNLNLTIWPQKGSALVWRN 453

Query: 245 LHSSGDGDYYTRHAACPVLTGS 266
           L      +    H +CPV+ GS
Sbjct: 454 LDHRMQPNKDLLHVSCPVVVGS 475


>gi|242003035|ref|XP_002436120.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
 gi|215499456|gb|EEC08950.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
          Length = 173

 Score =  127 bits (318), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 67/143 (46%), Positives = 86/143 (60%), Gaps = 14/143 (9%)

Query: 140 SAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFK 199
           +AWL +  HPV++++SRR+   TGL+TS+AE LQVVNYG+GGHY PH+DF+   +     
Sbjct: 3   AAWLSDHHHPVVKKLSRRIAAATGLSTSSAEHLQVVNYGVGGHYSPHFDFSTKDKPLRGW 62

Query: 200 SLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNL-------------H 246
               G R AT L Y+S V +GGAT+F  L + + PE G A FWHNL             H
Sbjct: 63  ETFAGQRQATWLVYLSSVERGGATLFKRLRVRVQPEAGMALFWHNLPPGSTNSLPSCCVH 122

Query: 247 SSGDGDYYTRHAACPVLTGSNSL 269
            S  GD  T H ACPVL GS  +
Sbjct: 123 RS-VGDERTEHGACPVLVGSKWI 144


>gi|195494568|ref|XP_002094893.1| GE19959 [Drosophila yakuba]
 gi|194180994|gb|EDW94605.1| GE19959 [Drosophila yakuba]
          Length = 486

 Score =  126 bits (316), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 74/217 (34%), Positives = 118/217 (54%), Gaps = 16/217 (7%)

Query: 52  CRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           CRG       +V    CRY      +L+L PLK EE    P I+++ +V+ D EI+ +K 
Sbjct: 239 CRGLFPRKTNLV----CRYNSSTNAFLKLAPLKMEEISRDPYIVMFHEVISDKEIEEMKG 294

Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
             +       ++N  TG LE     +S   W+RE E    +RI++R+  MTG        
Sbjct: 295 DIRE------MENGWTG-LEDPKEIVSSVYWIRE-ETSFSKRINQRISDMTGFKLEEFVA 346

Query: 172 LQVVNYGIGGHYEPHYDF--ARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLN 229
           +Q+ N+G+GG+++PH+D+   R    +A  +LG  +R+A+++FY  +V+QGG TVF  L 
Sbjct: 347 IQLANFGVGGYFKPHFDYYTERLRGVDANNTLG--DRIASIIFYAGEVSQGGQTVFPDLK 404

Query: 230 LSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           + + P++G A FW N       D  + H+ CPV+ GS
Sbjct: 405 VVVEPKRGNALFWFNKLDDSSPDPRSLHSVCPVIVGS 441


>gi|194871364|ref|XP_001972834.1| GG13661 [Drosophila erecta]
 gi|190654617|gb|EDV51860.1| GG13661 [Drosophila erecta]
          Length = 506

 Score =  126 bits (316), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 73/201 (36%), Positives = 110/201 (54%), Gaps = 17/201 (8%)

Query: 68  CRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRAT-VQNYK 126
           C Y      +LR+ PLK E   ++P I+LY DV+YDSEI  +K ++ P LR  + +   +
Sbjct: 293 CHYEKNTSDFLRIAPLKVETLSVKPHIVLYHDVIYDSEISKVKNISLPSLRSPSRILRAE 352

Query: 127 TGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPH 186
              L++A  R        +P  P    +S R++ MTG       +LQ+ NYGI G    H
Sbjct: 353 DHNLKLAKIR-------EDPRSP----LSLRIKDMTGEDVEEDTDLQIENYGICGFRFYH 401

Query: 187 YDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNL- 245
            D     +  A      G+R+ ++LF+M+DVA GGA VF + NL+++P+KG+A  W NL 
Sbjct: 402 NDNLESQDQTA----KLGDRLTSILFFMNDVALGGAFVFLNANLTIFPQKGSALVWRNLD 457

Query: 246 HSSGDGDYYTRHAACPVLTGS 266
           HS    +   +H +CPV+ GS
Sbjct: 458 HSLQPKEDLLQHLSCPVIVGS 478


>gi|196011906|ref|XP_002115816.1| hypothetical protein TRIADDRAFT_59903 [Trichoplax adhaerens]
 gi|190581592|gb|EDV21668.1| hypothetical protein TRIADDRAFT_59903 [Trichoplax adhaerens]
          Length = 444

 Score =  125 bits (315), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 73/194 (37%), Positives = 106/194 (54%), Gaps = 11/194 (5%)

Query: 48  YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID 107
           Y  LCR       ++   LKC Y +++ P L   P+  EE    P I LY D++   E +
Sbjct: 238 YTKLCRSHKNYQTSLNNGLKCYYFNQS-PLLHFNPVAVEEISYSPVIRLYHDIISHQEAE 296

Query: 108 LIKKMAQPRLR--RATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLT 165
           ++K ++  +L   R  VQ           YR +K AWL + ++ V+ R+S   E +TGL 
Sbjct: 297 ILKNISSKKLTVARTFVQIMPNNSEAEGEYRFAKHAWLGDIDNQVVRRLSVLSEELTGLD 356

Query: 166 TSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGN-RVATVLFYMSDVAQGGATV 224
            S AE+LQV NYG+GGHY PHYD A   +        TG  R+AT++FY+SDV  GGATV
Sbjct: 357 LSYAEKLQVANYGVGGHYSPHYDSASIDD-------DTGKPRLATIMFYLSDVDIGGATV 409

Query: 225 FTSLNLSLWPEKGT 238
           F  +  +++P K +
Sbjct: 410 FPDIGKAIFPRKTS 423


>gi|195156517|ref|XP_002019146.1| GL25581 [Drosophila persimilis]
 gi|194115299|gb|EDW37342.1| GL25581 [Drosophila persimilis]
          Length = 206

 Score =  125 bits (313), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 73/202 (36%), Positives = 104/202 (51%), Gaps = 22/202 (10%)

Query: 66  LKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID-LIKKMAQPRLRRATVQN 124
           L CRY H   P+LRL PLKEEE    P I LY DV+YDSE + L   + +  + +    N
Sbjct: 2   LVCRYNHTTTPFLRLAPLKEEEVSRDPLIWLYHDVLYDSEFEQLTVNLTRAEMVQGYTDN 61

Query: 125 YKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYE 184
           Y T E E    RI            +   +  R+  ++GL T    +L  VNYG+G H+ 
Sbjct: 62  YTTTEKE----RIFYVNIFEGSGEKLDRDLVNRMADISGLLTGEHTQLGTVNYGLGSHFP 117

Query: 185 PHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHN 244
            H D++   +AN                 M+DV  GGAT+F  +NL++ P+KG+A FW+N
Sbjct: 118 EHGDYSDI-KANP----------------MTDVPLGGATIFPKINLTIQPKKGSALFWYN 160

Query: 245 LHSSGDGDYYTRHAACPVLTGS 266
           +H+  +    TRHA CP + G+
Sbjct: 161 IHNDWEPHVLTRHAVCPTIEGN 182


>gi|198428011|ref|XP_002120302.1| PREDICTED: similar to prolyl 4-hydroxylase alpha-2 subunit, partial
           [Ciona intestinalis]
          Length = 233

 Score =  125 bits (313), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 74/204 (36%), Positives = 107/204 (52%), Gaps = 11/204 (5%)

Query: 65  QLKCRYVHRNV--PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATV 122
           +LKC Y H     P L + P+K EE    P ++ + DV+ D + + I ++A P + R+ V
Sbjct: 7   KLKC-YFHNGWKNPRLLIQPIKSEELCDSPHVVRFYDVLSDRDSEEIIRLAAPLMFRSGV 65

Query: 123 QNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGH 182
                   +    R+ K+AWL     PV+     RV  +TGL       LQV NYGIGGH
Sbjct: 66  TGDDGAINDNPMERVGKNAWL--DNSPVVNNFMTRVADITGLNVGAEIYLQVANYGIGGH 123

Query: 183 YEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFW 242
           ++PH D     E   ++++    R+AT L Y SDV  GG T F    +   P KG+A FW
Sbjct: 124 FDPHID-----ETGGYENI-MERRIATFLTYFSDVEYGGNTPFVYQEVVAEPIKGSAIFW 177

Query: 243 HNLHSSGDGDYYTRHAACPVLTGS 266
           +++ + G  D  T HAACPV+ G+
Sbjct: 178 YDVFNDGSADERTEHAACPVVLGN 201


>gi|198471971|ref|XP_002133305.1| GA28042 [Drosophila pseudoobscura pseudoobscura]
 gi|198139547|gb|EDY70707.1| GA28042 [Drosophila pseudoobscura pseudoobscura]
          Length = 203

 Score =  125 bits (313), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 73/202 (36%), Positives = 104/202 (51%), Gaps = 22/202 (10%)

Query: 66  LKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID-LIKKMAQPRLRRATVQN 124
           L CRY H   P+LRL PLKEEE    P I LY DV+YDSE + L   + +  + +    N
Sbjct: 2   LVCRYNHTTTPFLRLAPLKEEEVSRDPLIWLYHDVLYDSEFEQLTVNLTRAEMVQGYTDN 61

Query: 125 YKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYE 184
           Y T E E    RI            +   +  R+  ++GL T    +L  VNYG+G H+ 
Sbjct: 62  YTTTEKE----RIFYVNIFEGSGEKLDRDLVNRMADISGLLTGEHTQLGTVNYGLGSHFP 117

Query: 185 PHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHN 244
            H D++   +AN                 M+DV  GGAT+F  +NL++ P+KG+A FW+N
Sbjct: 118 EHGDYSDI-KANP----------------MTDVPLGGATIFPKINLTIQPKKGSALFWYN 160

Query: 245 LHSSGDGDYYTRHAACPVLTGS 266
           +H+  +    TRHA CP + G+
Sbjct: 161 IHNDWEPHVLTRHAVCPTIEGN 182


>gi|321466285|gb|EFX77281.1| hypothetical protein DAPPUDRAFT_106233 [Daphnia pulex]
          Length = 128

 Score =  124 bits (312), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 63/117 (53%), Positives = 77/117 (65%)

Query: 167 STAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFT 226
           STAE LQ VNYGIG HYEPH+D+AR     AFK LG GNR+AT LFYMSDV  G ATVF 
Sbjct: 2   STAEVLQFVNYGIGWHYEPHFDYARKETTEAFKELGWGNRIATCLFYMSDVEAGSATVFP 61

Query: 227 SLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTCPCGLRRGLQR 283
               ++WP KG+AAF +NL+ +  G+ +TRHA  PV+  S  + +T     RR   R
Sbjct: 62  PTGAAVWPRKGSAAFCYNLYPNDKGNEFTRHATFPVIFLSKWVSNTWIHEHRREFHR 118


>gi|194871344|ref|XP_001972830.1| GG13666 [Drosophila erecta]
 gi|190654613|gb|EDV51856.1| GG13666 [Drosophila erecta]
          Length = 539

 Score =  124 bits (311), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 77/227 (33%), Positives = 114/227 (50%), Gaps = 12/227 (5%)

Query: 47  KYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEI 106
           K E  CRG+   P     +L CRY      +L+L PLK E   +QP I LY DV+Y+ E 
Sbjct: 297 KLERGCRGEW--PKKSSPELICRYSRDTSAFLKLAPLKLEFLSVQPMIHLYHDVLYEKEF 354

Query: 107 DLIKKMAQPRLRRATVQNYKTGELEI---ANYRISKSAWLREPEHPVIERISRRVEHMTG 163
             ++ +A         + Y     +I      R+ K    ++   P    I+RR+  M+G
Sbjct: 355 KSMRDVAVFNATMIDGRTYFDFHKKIKPKTQDRVVKMIDFKDTTAPYTLSINRRIADMSG 414

Query: 164 LTTSTAEELQVVNYGIGGHYEPHYDFA----RPGEANAFKSLGTGNRVATVLFYMSDVAQ 219
           L       L + NYG+GG +  H D+     RP +   F +   G+R+AT + Y SDV  
Sbjct: 415 LEMRENMVLYLSNYGLGGDFGKHVDYVELAKRPSD---FFADFKGDRIATAVLYASDVPL 471

Query: 220 GGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           GG TVF  L +++ P+KG A  W NL+ +G+ D  T H+ CP++ GS
Sbjct: 472 GGTTVFPKLKIAVQPKKGNALVWFNLNHAGEPDPLTEHSVCPIVLGS 518


>gi|374370415|ref|ZP_09628419.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus basilensis OR16]
 gi|373098067|gb|EHP39184.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus basilensis OR16]
          Length = 454

 Score =  124 bits (311), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 70/177 (39%), Positives = 97/177 (54%), Gaps = 5/177 (2%)

Query: 92  PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
           PR+ L++ ++ D+E D +  +A+ RL R+ V N  TG+  +   R S  A  +  EHP+I
Sbjct: 132 PRVTLFQQLLTDAECDALVALARGRLARSPVINPDTGDENLIEARTSLGAMFQVGEHPLI 191

Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDF---ARPGEANAFKSLGTGNRVA 208
           ERI   +  +TG+     E LQ++NY  GG Y+PHYDF    RPGEA   K    G RV 
Sbjct: 192 ERIEDCIAAVTGIAAERGEGLQILNYKPGGEYQPHYDFFNPQRPGEARQLKV--GGQRVG 249

Query: 209 TVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           T++ Y++    GGAT F  L L + P KG A ++    S G  D  T HA  PV  G
Sbjct: 250 TLVIYLNSPLAGGATAFPKLGLEVAPVKGNAVYFSYRKSDGALDERTLHAGLPVEAG 306


>gi|299065638|emb|CBJ36810.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
           CMR15]
          Length = 289

 Score =  124 bits (310), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 67/177 (37%), Positives = 97/177 (54%), Gaps = 1/177 (0%)

Query: 92  PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
           PRI+L++  + D E D +  + + RL+R+ V N +TGE  + + R S+ A  +  EHP+I
Sbjct: 97  PRIVLFQHFLSDEECDQLITLGRHRLKRSPVVNPETGEENLISARTSQGAMFQVGEHPLI 156

Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT-GNRVATV 210
            RI  R+   TG+     E  QV++Y  GG Y+PH+D+  PG +   + L   G RVAT+
Sbjct: 157 ARIEARIAQATGVPVEHGEGFQVLHYQPGGEYQPHFDYFNPGRSGEARQLEVGGQRVATL 216

Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
           + Y++ V  GGAT F  L L + P KG A F+      G  D  T HA  PV  G  
Sbjct: 217 VIYLNSVPAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDDKTLHAGLPVERGEK 273


>gi|17547533|ref|NP_520935.1| hypothetical protein RSc2814 [Ralstonia solanacearum GMI1000]
 gi|17429837|emb|CAD16521.1| putative prolyl 4-hydroxylase alpha subunit homologue
           oxidoreductase protein [Ralstonia solanacearum GMI1000]
          Length = 289

 Score =  124 bits (310), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 66/177 (37%), Positives = 97/177 (54%), Gaps = 1/177 (0%)

Query: 92  PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
           PRI+L++  + D E D +  + + RL+R+ V N +TGE  + + R S+ A  +  EHP++
Sbjct: 97  PRIVLFQHFLSDEECDQLIALGRHRLKRSPVVNPETGEENLISARTSQGAMFQVGEHPLV 156

Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT-GNRVATV 210
            RI  R+   TG+     E  QV++Y  GG Y+PH+D+  PG +   + L   G RVAT+
Sbjct: 157 ARIEARIAQATGVPVEHGEGFQVLHYQPGGEYQPHFDYFNPGRSGEARQLEVGGQRVATL 216

Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
           + Y++ V  GGAT F  L L + P KG A F+      G  D  T HA  PV  G  
Sbjct: 217 VIYLNSVPAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLPVERGEK 273


>gi|421749438|ref|ZP_16186877.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator HPC(L)]
 gi|409771699|gb|EKN53918.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator HPC(L)]
          Length = 319

 Score =  123 bits (309), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 68/177 (38%), Positives = 99/177 (55%), Gaps = 5/177 (2%)

Query: 92  PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
           PRI L++ ++   E + +  +++ RL R+ V N  TG+  + + R S  A  +  EHP+I
Sbjct: 127 PRIALFQRLLMPDECEALIALSRGRLARSPVVNPDTGDENLIDARTSMGAMFQVGEHPLI 186

Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDF---ARPGEANAFKSLGTGNRVA 208
           ER+  R+  +TG+     E LQ++NY  G  Y+PHYDF    RPGEA   +    G R+A
Sbjct: 187 ERLEARIAAVTGVPVEHGEGLQILNYKPGAEYQPHYDFFNPQRPGEARQLRV--GGQRMA 244

Query: 209 TVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           T++ Y++DV  GGAT F  L L + P +G A F+  L   G  D  T HA  PV  G
Sbjct: 245 TLVIYLNDVPAGGATAFPKLGLRVNPVQGNAVFFAYLGEDGSLDERTLHAGLPVEQG 301


>gi|344169181|emb|CCA81504.1| putative Prolyl 4-hydroxylase alpha subunit [blood disease
           bacterium R229]
          Length = 289

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 67/177 (37%), Positives = 97/177 (54%), Gaps = 1/177 (0%)

Query: 92  PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
           PRI+L++  + D E D +  + + RL+R+ V N +TGE  + + R S+ A  +  EHP+I
Sbjct: 97  PRIVLFQHFLSDEECDELIALGRHRLKRSPVVNPETGEENLISARTSQGAMFQVGEHPLI 156

Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT-GNRVATV 210
            RI  R+   TG+     E  QV++Y  GG Y+PH+D+  PG +   + L   G RVAT+
Sbjct: 157 ARIEARIAQATGVPVEHGEGFQVLHYQPGGEYQPHFDYFNPGRSGEARQLEVGGQRVATL 216

Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
           + Y++ V  GGAT F  L L + P KG A F+      G  D  T HA  PV  G  
Sbjct: 217 VIYLNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLPVERGEK 273


>gi|300690371|ref|YP_003751366.1| prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum PSI07]
 gi|299077431|emb|CBJ50057.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
           PSI07]
          Length = 289

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 67/177 (37%), Positives = 97/177 (54%), Gaps = 1/177 (0%)

Query: 92  PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
           PRI+L++  + D E D +  + + RL+R+ V N +TGE  + + R S+ A  +  EHP+I
Sbjct: 97  PRIVLFQHFLSDEECDELIALGRHRLKRSPVVNPETGEENLISARTSQGAMFQVGEHPLI 156

Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT-GNRVATV 210
            RI  R+   TG+     E  QV++Y  GG Y+PH+D+  PG +   + L   G RVAT+
Sbjct: 157 ARIEARIAQATGVPVEHGEGFQVLHYQPGGEYQPHFDYFNPGRSGEARQLEVGGQRVATL 216

Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
           + Y++ V  GGAT F  L L + P KG A F+      G  D  T HA  PV  G  
Sbjct: 217 VIYLNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLPVERGEK 273


>gi|344172475|emb|CCA85118.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia syzygii R24]
          Length = 289

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 67/177 (37%), Positives = 97/177 (54%), Gaps = 1/177 (0%)

Query: 92  PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
           PRI+L++  + D E D +  + + RL+R+ V N +TGE  + + R S+ A  +  EHP+I
Sbjct: 97  PRIVLFQHFLSDEECDELIALGRHRLKRSPVVNPETGEENLISARTSQGAMFQVGEHPLI 156

Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT-GNRVATV 210
            RI  R+   TG+     E  QV++Y  GG Y+PH+D+  PG +   + L   G RVAT+
Sbjct: 157 ARIEARIAQATGVPVEHGEGFQVLHYQPGGEYQPHFDYFNPGRSGEARQLEVGGQRVATL 216

Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
           + Y++ V  GGAT F  L L + P KG A F+      G  D  T HA  PV  G  
Sbjct: 217 VIYLNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLPVERGEK 273


>gi|194871359|ref|XP_001972833.1| GG13662 [Drosophila erecta]
 gi|190654616|gb|EDV51859.1| GG13662 [Drosophila erecta]
          Length = 515

 Score =  122 bits (307), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 68/205 (33%), Positives = 111/205 (54%), Gaps = 16/205 (7%)

Query: 66  LKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNY 125
           L CRY      +L+L PLK EE    P I+++ +V+ D EI+ +K           ++  
Sbjct: 297 LVCRYNFSTNAFLKLAPLKMEEISRDPYIVMFHEVISDKEIEEMK---------GEIKQM 347

Query: 126 KTG--ELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHY 183
           + G   LE     +S   W+ + E    +RI+ R+  MTG        +Q+ N+G+GG++
Sbjct: 348 ENGWTSLEEPKEIVSHIYWITK-ESSFSKRINDRISDMTGFKVEEFPAIQLANFGVGGYF 406

Query: 184 EPHYDF--ARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAF 241
           +PHYD+   R  E +A  +LG  +R+A+++ Y  +V+QGG TVF  + +++ P+KG A F
Sbjct: 407 KPHYDYYTERLKELDANNTLG--DRLASIIIYAGEVSQGGQTVFPDIKVAVEPKKGKALF 464

Query: 242 WHNLHSSGDGDYYTRHAACPVLTGS 266
           W N       D  + H+ CPV+ GS
Sbjct: 465 WFNDFDDSSPDPRSLHSVCPVIVGS 489


>gi|207744371|ref|YP_002260763.1| prolyl 4-hydroxylase subunit alpha [Ralstonia solanacearum IPO1609]
 gi|206595776|emb|CAQ62703.1| prolyl 4-hydroxylase alpha subunit homologue protein [Ralstonia
           solanacearum IPO1609]
          Length = 280

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 66/177 (37%), Positives = 97/177 (54%), Gaps = 1/177 (0%)

Query: 92  PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
           PRI+L++  + D E D +  + + RL+R+ V N +TGE  + + R S+ A  +  EHP++
Sbjct: 88  PRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPETGEENLISARTSEGAMFQVGEHPLV 147

Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT-GNRVATV 210
            RI  R+   TG+     E  QV++Y  GG Y+PH+D+  PG +   + L   G RVAT+
Sbjct: 148 ARIEARIAQATGVPVEHGEGFQVLHYHPGGEYQPHFDYFNPGRSGEARQLEVGGQRVATL 207

Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
           + Y++ V  GGAT F  L L + P KG A F+      G  D  T HA  PV  G  
Sbjct: 208 VIYLNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLPVERGEK 264


>gi|83746819|ref|ZP_00943867.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum UW551]
 gi|83726588|gb|EAP73718.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum UW551]
          Length = 289

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 66/177 (37%), Positives = 97/177 (54%), Gaps = 1/177 (0%)

Query: 92  PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
           PRI+L++  + D E D +  + + RL+R+ V N +TGE  + + R S+ A  +  EHP++
Sbjct: 97  PRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPETGEENLISARTSEGAMFQVGEHPLV 156

Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT-GNRVATV 210
            RI  R+   TG+     E  QV++Y  GG Y+PH+D+  PG +   + L   G RVAT+
Sbjct: 157 ARIEARIAQATGVPVEHGEGFQVLHYHPGGEYQPHFDYFNPGRSGEARQLEVGGQRVATL 216

Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
           + Y++ V  GGAT F  L L + P KG A F+      G  D  T HA  PV  G  
Sbjct: 217 VIYLNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLPVERGEK 273


>gi|386332363|ref|YP_006028532.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum Po82]
 gi|334194811|gb|AEG67996.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum Po82]
          Length = 292

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 66/177 (37%), Positives = 97/177 (54%), Gaps = 1/177 (0%)

Query: 92  PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
           PRI+L++  + D E D +  + + RL+R+ V N +TGE  + + R S+ A  +  EHP++
Sbjct: 100 PRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPETGEENLISARTSEGAMFQVGEHPLV 159

Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT-GNRVATV 210
            RI  R+   TG+     E  QV++Y  GG Y+PH+D+  PG +   + L   G RVAT+
Sbjct: 160 ARIEARIAQATGVPVEHGEGFQVLHYHPGGEYQPHFDYFNPGRSGEARQLEVGGQRVATL 219

Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
           + Y++ V  GGAT F  L L + P KG A F+      G  D  T HA  PV  G  
Sbjct: 220 VIYLNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLPVERGEK 276


>gi|421890664|ref|ZP_16321519.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
           K60-1]
 gi|378964031|emb|CCF98267.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
           K60-1]
          Length = 288

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 66/177 (37%), Positives = 97/177 (54%), Gaps = 1/177 (0%)

Query: 92  PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
           PRI+L++  + D E D +  + + RL+R+ V N +TGE  + + R S+ A  +  EHP++
Sbjct: 96  PRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPETGEENLISARTSEGAMFQVGEHPLV 155

Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT-GNRVATV 210
            RI  R+   TG+     E  QV++Y  GG Y+PH+D+  PG +   + L   G RVAT+
Sbjct: 156 ARIEARIAQATGVPVEHGEGFQVLHYHPGGEYQPHFDYFNPGRSGEARQLDVGGQRVATL 215

Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
           + Y++ V  GGAT F  L L + P KG A F+      G  D  T HA  PV  G  
Sbjct: 216 VIYLNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLPVERGEK 272


>gi|187930127|ref|YP_001900614.1| procollagen-proline dioxygenase [Ralstonia pickettii 12J]
 gi|187727017|gb|ACD28182.1| Procollagen-proline dioxygenase [Ralstonia pickettii 12J]
          Length = 288

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 66/175 (37%), Positives = 95/175 (54%), Gaps = 1/175 (0%)

Query: 92  PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
           PRI+L++  + D+E D +  + + RL+R+ V N  TGE  + + R S+    +  EHP+I
Sbjct: 96  PRIVLFQHFLSDAECDELIAIGRNRLKRSPVVNPDTGEENLISARTSQGGMFQVGEHPLI 155

Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT-GNRVATV 210
            +I  R+    G+     E  QV+NY  GG Y+PH+DF  PG +   + L   G RVAT+
Sbjct: 156 AKIEVRIAQAVGVPVEHGEGFQVLNYQPGGEYQPHFDFFNPGRSGEARQLEVGGQRVATM 215

Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           + Y++ V  GGAT F  L L + P KG A F+      G  D  T HA  PV  G
Sbjct: 216 VIYLNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDEDTLHAGLPVERG 270


>gi|241664232|ref|YP_002982592.1| procollagen-proline dioxygenase [Ralstonia pickettii 12D]
 gi|309783051|ref|ZP_07677770.1| procollagen-proline dioxygenase [Ralstonia sp. 5_7_47FAA]
 gi|404397139|ref|ZP_10988932.1| hypothetical protein HMPREF0989_00773 [Ralstonia sp. 5_2_56FAA]
 gi|240866259|gb|ACS63920.1| Procollagen-proline dioxygenase [Ralstonia pickettii 12D]
 gi|308918159|gb|EFP63837.1| procollagen-proline dioxygenase [Ralstonia sp. 5_7_47FAA]
 gi|348610674|gb|EGY60360.1| hypothetical protein HMPREF0989_00773 [Ralstonia sp. 5_2_56FAA]
          Length = 288

 Score =  122 bits (305), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 66/175 (37%), Positives = 94/175 (53%), Gaps = 1/175 (0%)

Query: 92  PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
           PRI+L++  + D E D +  + + RL+R+ V N  TGE  + + R S+    +  EHP+I
Sbjct: 96  PRIVLFQHFLSDQECDELIAIGRNRLKRSPVVNPDTGEENLISARTSQGGMFQVGEHPLI 155

Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT-GNRVATV 210
            +I  R+    G+     E  QV+NY  GG Y+PH+DF  PG +   + L   G RVAT+
Sbjct: 156 AKIEARIAQAVGVPVEHGEGFQVLNYQPGGEYQPHFDFFNPGRSGEARQLEVGGQRVATM 215

Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           + Y++ V  GGAT F  L L + P KG A F+      G  D  T HA  PV  G
Sbjct: 216 VIYLNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDEDTLHAGLPVERG 270


>gi|300702992|ref|YP_003744594.1| prolyl 4-hydroxylase subunit alpha [Ralstonia solanacearum
           CFBP2957]
 gi|299070655|emb|CBJ41950.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
           CFBP2957]
          Length = 289

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 66/177 (37%), Positives = 97/177 (54%), Gaps = 1/177 (0%)

Query: 92  PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
           PRI+L++  + D E D +  + + RL+R+ V N +TGE  + + R S+ A  +  EHP++
Sbjct: 97  PRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPETGEENLISARTSEGAMFQVGEHPLV 156

Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT-GNRVATV 210
            RI  R+   TG+     E  QV++Y  GG Y+PH+D+  PG +   + L   G RVAT+
Sbjct: 157 ARIEARIAQATGVPVEHGEGFQVLHYHPGGEYQPHFDYFNPGRSGEARQLEVGGQRVATL 216

Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
           + Y++ V  GGAT F  L L + P KG A F+      G  D  T HA  PV  G  
Sbjct: 217 VIYLNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLPVERGEK 273


>gi|149068803|gb|EDM18355.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide III [Rattus
           norvegicus]
          Length = 266

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 78/208 (37%), Positives = 108/208 (51%), Gaps = 33/208 (15%)

Query: 45  REKYEMLCRGDLTVPPAI-VAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYD 103
           R+ YE LC+   + P    +  L C Y   + PYL L P ++E  +L+P + LY D + D
Sbjct: 36  RDTYEGLCQTLGSQPTHYQIPSLYCSYETNSSPYLLLQPARKEVIHLRPLVALYHDFVSD 95

Query: 104 SEIDLIKKMAQPRLRRATVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMT 162
            E   I+++A+P L+R+ V    +GE ++   YRISKSAWL++   PV+  + RR+  +T
Sbjct: 96  EEAQKIRELAEPWLQRSVV---ASGEKQLQVEYRISKSAWLKDTVDPVLVTLDRRIAALT 152

Query: 163 GLTTST--AEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQG 220
           GL      AE LQVVNYGIGGHYEPH+D A                       +S V  G
Sbjct: 153 GLDIQPPYAEYLQVVNYGIGGHYEPHFDHAT----------------------LSSVEAG 190

Query: 221 GATVFTSLNLSL----WPEKGTAAFWHN 244
           GAT F   N S+    WP  G  +   N
Sbjct: 191 GATAFIYGNFSVPVVKWPTSGYTSMDRN 218


>gi|73542634|ref|YP_297154.1| procollagen-proline,2-oxoglutarate-4-dioxygenase [Ralstonia
           eutropha JMP134]
 gi|72120047|gb|AAZ62310.1| Procollagen-proline,2-oxoglutarate-4-dioxygenase [Ralstonia
           eutropha JMP134]
          Length = 282

 Score =  121 bits (304), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 74/210 (35%), Positives = 107/210 (50%), Gaps = 13/210 (6%)

Query: 59  PPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLR 118
           P A   Q   R   R VP L  +          P I LY+ ++ D+E D + ++A+ RL 
Sbjct: 65  PDASATQPAPRLARREVPVLFSL--------QSPSIRLYQHLLSDAECDALVELARGRLA 116

Query: 119 RATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYG 178
           R+ V N  TG+  + + R S  A  +  EH +I+RI  R+  + G+     E LQ++NY 
Sbjct: 117 RSPVINPDTGDENLIDARTSMGAMFQVGEHTLIQRIEDRIAAVLGVPVDHGEGLQILNYK 176

Query: 179 IGGHYEPHYDF---ARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPE 235
            GG Y+PH+DF    RPGEA   +    G R AT++ Y++    GGAT F  + L + P 
Sbjct: 177 PGGEYQPHFDFFNPKRPGEARQLRV--GGQRTATLVIYLNTPQAGGATAFPRIGLEVAPV 234

Query: 236 KGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           KG A ++  L   G  D  T HA  PV +G
Sbjct: 235 KGNAVYFSYLQPDGKLDERTLHAGLPVQSG 264


>gi|421895470|ref|ZP_16325871.1| prolyl 4-hydroxylase alpha subunit homologue protein [Ralstonia
           solanacearum MolK2]
 gi|206586635|emb|CAQ17221.1| prolyl 4-hydroxylase alpha subunit homologue protein [Ralstonia
           solanacearum MolK2]
          Length = 283

 Score =  121 bits (304), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 66/177 (37%), Positives = 96/177 (54%), Gaps = 1/177 (0%)

Query: 92  PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
           PRI+L++  + D E D +  + + RL+R+ V N +TGE  + + R S+ A  +  EHP++
Sbjct: 91  PRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPETGEENLISARTSEGAMFQVGEHPLV 150

Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT-GNRVATV 210
            RI  R+   TG+     E  QV++Y  GG Y+PH+D+  PG     + L   G RVAT+
Sbjct: 151 ARIEARIAQATGVPVEHGEGFQVLHYHPGGEYQPHFDYFNPGRGGEARQLEVGGQRVATL 210

Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
           + Y++ V  GGAT F  L L + P KG A F+      G  D  T HA  PV  G  
Sbjct: 211 VIYLNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGMLDDNTLHAGLPVERGEK 267


>gi|390178051|ref|XP_002137433.2| GA30144 [Drosophila pseudoobscura pseudoobscura]
 gi|388859305|gb|EDY67991.2| GA30144 [Drosophila pseudoobscura pseudoobscura]
          Length = 546

 Score =  121 bits (303), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 71/210 (33%), Positives = 112/210 (53%), Gaps = 11/210 (5%)

Query: 52  CRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           C+G   +P  + + L+C Y      +LRL PL+ E     P + +Y +V+  +E   +  
Sbjct: 272 CQGRSRLP--VQSSLRCHYSAEGSAFLRLAPLRMELLSRDPLVAVYHEVVSAAEQRHLML 329

Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
           +++ +L+R     Y          R   SA +     P +E++ RR+E +TGL  + +E 
Sbjct: 330 LSESQLQRQRGHQYD-------KIRTFASASVAANATPTVEQLHRRLEDITGLDLAESEP 382

Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
           L+++NYGIGG Y  H D  +P      +      R+ATVL Y+SDV  GG T F +L L 
Sbjct: 383 LRILNYGIGGQYYIHVDCEQP--QTHVEPYPKEYRLATVLLYLSDVRLGGFTSFPALGLG 440

Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACP 261
           + P +G+A  WHN +++G+ DY   HAACP
Sbjct: 441 IRPNRGSALVWHNANNAGNCDYRALHAACP 470


>gi|195341582|ref|XP_002037385.1| GM12897 [Drosophila sechellia]
 gi|194131501|gb|EDW53544.1| GM12897 [Drosophila sechellia]
          Length = 467

 Score =  121 bits (303), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 79/244 (32%), Positives = 120/244 (49%), Gaps = 33/244 (13%)

Query: 25  ELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLK 84
           + K  P K  + +P L     E Y+ LCR   +  P+   +L CRY      +L L  LK
Sbjct: 244 QFKANPYKAVDRSPKL----GEDYKRLCRSSFSPTPS---KLHCRYNSTTSRFLILASLK 296

Query: 85  EEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLR 144
            EE  L+P I+ Y D++ D +I  +  +A+P L+   V +    E + ++ R S      
Sbjct: 297 MEEISLEPYIVAYHDILPDKDIQQLITLAEPLLKPIEVFDENKNEAKSSD-RTSLGG--- 352

Query: 145 EPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTG 204
               P+++R++ R+  +TGL       + ++ YG G H E                 G G
Sbjct: 353 ----PLLDRLTERMRDITGLQIPQGNPINIIKYGFGAHSETE---------------GYG 393

Query: 205 NRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDG-DYYTRHAACPVL 263
           +R+ATV+FY++D   GGATVF  LN+ +  E+G    W+NL+  GD  D  T HA CPV 
Sbjct: 394 DRMATVMFYLNDAPYGGATVFPRLNVKVPAERGKVLLWYNLN--GDSQDVTTVHAVCPVF 451

Query: 264 TGSN 267
            GS 
Sbjct: 452 HGSK 455


>gi|241778760|ref|XP_002399787.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
 gi|215508519|gb|EEC17973.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
          Length = 427

 Score =  120 bits (302), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 78/246 (31%), Positives = 123/246 (50%), Gaps = 28/246 (11%)

Query: 4   PTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAIV 63
           P   R + +K Y  E   + P+               E  + + Y+ LCRG+      + 
Sbjct: 139 PVRDRMKRSKEYKAELFQEDPQ---------------EYQDSQNYKRLCRGEQLRTLKMD 183

Query: 64  AQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRAT-- 121
           +QL+CRY      + +L P+K EE  L+P I++  DV+ D +++ +   A+PR R     
Sbjct: 184 SQLRCRYYKGQDGFFKLQPIKLEEFNLKPYIVVLHDVIQDRDLEDLIAFAKPRARNTIPL 243

Query: 122 VQNYK-TGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST----AEELQVVN 176
            +N K    L+     ++ S WL E    +  R++R +  + G+ TS     AE  Q+ N
Sbjct: 244 FRNVKWCTFLKRFCSLLAASTWLFEQNATIASRLNRYLTALLGMGTSDSNFEAEPYQLAN 303

Query: 177 YGIGGHYEPHYD-----FARPGEANAFKSLGT-GNRVATVLFYMSDVAQGGATVFTSLNL 230
           YG GGHY PH+D     +    E + F    + G+R+AT++ YMSDV +GGATVF  L +
Sbjct: 304 YGTGGHYLPHHDYLYDVYEDSDETDDFSQFPSYGDRLATLMIYMSDVEEGGATVFPKLGV 363

Query: 231 SLWPEK 236
            L P+K
Sbjct: 364 RLTPKK 369


>gi|430808003|ref|ZP_19435118.1| prolyl 4-hydroxylase [Cupriavidus sp. HMR-1]
 gi|429499635|gb|EKZ98045.1| prolyl 4-hydroxylase [Cupriavidus sp. HMR-1]
          Length = 293

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 72/209 (34%), Positives = 107/209 (51%), Gaps = 13/209 (6%)

Query: 60  PAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRR 119
           P +      R+  R +P L  +          PRI+L ++++ D+E D +  +A+ RL+R
Sbjct: 77  PTVTGGNAFRHKDREMPVLFRLE--------SPRILLLQNLLDDAECDAVVALARDRLQR 128

Query: 120 ATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGI 179
           + V N  TG+  + + R S  A  +  EH +++RI  R+  +TG      E  QV+NY  
Sbjct: 129 SPVVNPDTGDENLIDARTSMGAMFQVGEHALLQRIEARIAAVTGWPVEHGEGFQVLNYKP 188

Query: 180 GGHYEPHYDF---ARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEK 236
           GG Y+PH+DF    RPGEA   +    G RVAT++ Y++  A GGAT F  + L + P K
Sbjct: 189 GGEYQPHFDFFNPKRPGEARQLRV--GGQRVATMVIYLNSPASGGATAFPRIGLEVAPVK 246

Query: 237 GTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           G A  +      G  D  T HA  PV  G
Sbjct: 247 GNAVLFSYGLPDGALDERTLHAGLPVEAG 275


>gi|94312029|ref|YP_585239.1| prolyl 4-hydroxylase [Cupriavidus metallidurans CH34]
 gi|93355881|gb|ABF09970.1| prolyl 4-hydroxylase [Cupriavidus metallidurans CH34]
          Length = 293

 Score =  119 bits (297), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 67/177 (37%), Positives = 98/177 (55%), Gaps = 5/177 (2%)

Query: 92  PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
           PRI+L ++++ D+E D +  +A+ RL+R+ V N  TG+  + + R S  A  +  EH ++
Sbjct: 101 PRILLLQNLLDDAECDAVVALARDRLQRSPVVNPDTGDENLIDARTSMGAMFQVGEHALL 160

Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDF---ARPGEANAFKSLGTGNRVA 208
           +RI  R+  +TG      E  QV+NY  GG Y+PH+DF    RPGEA   +    G RVA
Sbjct: 161 QRIEARIAAVTGWPVEHGEGFQVLNYKPGGEYQPHFDFFNPKRPGEARQLRV--GGQRVA 218

Query: 209 TVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           T++ Y++  A GGAT F  + L + P KG A  +      G  D  T HA  PV  G
Sbjct: 219 TMVIYLNSPASGGATAFPRIGLEVAPVKGNAVLFSYGLPDGALDERTLHAGLPVEAG 275


>gi|195113245|ref|XP_002001178.1| GI22115 [Drosophila mojavensis]
 gi|193917772|gb|EDW16639.1| GI22115 [Drosophila mojavensis]
          Length = 498

 Score =  119 bits (297), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 72/235 (30%), Positives = 120/235 (51%), Gaps = 21/235 (8%)

Query: 38  PTLEVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILY 97
           P L++ E + +   C       P    +L C Y  +   +L L P K E     P I+++
Sbjct: 241 PYLDIMEND-FIKFCGSSYMPQPT---RLVCSYKTKPSKFLYLAPFKMELLSEDPYIVVF 296

Query: 98  RDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLR----EPEHP-VIE 152
            DV+YDSEI  ++  A+P L R+ V+     E  ++  R +K A++      PE   V++
Sbjct: 297 HDVIYDSEIKHLRNTAEPLLHRSYVKK-SNNESVVSKVRTAKGAFMHADRLSPESAQVVQ 355

Query: 153 RISRRVEHMTGLTTSTA--EELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           R+ +R+  ++ L        E+Q +NY  G HY  H D+          ++   +R+AT 
Sbjct: 356 RLKQRMGDLSDLNIKREGYNEMQYLNYDFGDHYLLHMDYF---------NISMNDRIATF 406

Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           L Y++DV +GG T+F  +  ++ PEKG    W+N++S+ D +  + H ACPVL G
Sbjct: 407 LIYLNDVTRGGGTIFPQVKQAVHPEKGKLILWYNMNSNLDYELASLHGACPVLIG 461


>gi|405967005|gb|EKC32220.1| Prolyl 4-hydroxylase subunit alpha-1 [Crassostrea gigas]
          Length = 303

 Score =  119 bits (297), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 94/289 (32%), Positives = 137/289 (47%), Gaps = 44/289 (15%)

Query: 1   MIFPTHQRAQGNKLYYQEALNKSPELKDEPPKVNNV-APTLEVTEREKYEMLCRGDLTVP 59
           M F   QR Q + L  +EA   S    D    +N+  AP         +  LCRG     
Sbjct: 1   MEFTDFQRFQAHNLVIKEATRSSISQDD----INSFFAPP------NTFMKLCRGPAK-S 49

Query: 60  PAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRR 119
             + ++L+C      +P   +   KEE     PRI L+ DV+ + +I  +KK    +L  
Sbjct: 50  KIVESKLRCYLRKTAIP---IYMAKEEVVNYTPRISLFHDVISNDDIRQLKKAGTKKLTH 106

Query: 120 ATVQNYKTGELEIANYRISKSAWLREPEHP-VIERISRRVEHMTGLTT------STAEEL 172
           +     +TG   +   R+S++ W+ +   P V  R++RR+ ++  L T      S  E  
Sbjct: 107 S-----RTGGGYVTRLRVSQTGWVYDQAIPQVSRRLARRIANIVNLDTTFRSKASPVEPW 161

Query: 173 QVVNYGIGGHYEPHYDFARPGEANAF-----------KSLG---TGNRVATVLFYMSDVA 218
           QV++Y  GG+Y  H D   P   + F           ++L    TG R+AT +FY+SDV 
Sbjct: 162 QVLSYTTGGYYGEHID---PDIGDEFLWNMTEAVQGPRALWRKHTGQRIATWMFYLSDVE 218

Query: 219 QGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
            GGATVF  L   +   KG AAFW+NL  SG  D  T+HA CPV+ GS 
Sbjct: 219 AGGATVFPKLEARVPVVKGAAAFWYNLTPSGKIDRRTQHAGCPVILGSK 267


>gi|319652187|ref|ZP_08006306.1| hypothetical protein HMPREF1013_02919 [Bacillus sp. 2_A_57_CT2]
 gi|317396176|gb|EFV76895.1| hypothetical protein HMPREF1013_02919 [Bacillus sp. 2_A_57_CT2]
          Length = 283

 Score =  118 bits (296), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 63/175 (36%), Positives = 98/175 (56%), Gaps = 3/175 (1%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +P ++    V+   E D +  +++ RL+ + V +  +GE    + R SKS   R  E+ +
Sbjct: 95  KPFVLHLDQVLSSEECDELISLSRSRLQPSLVVDRGSGEERAGSGRTSKSMAFRLKENEL 154

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           +ERI  R+  +TG      E LQ++NYG+G  Y+PH+DF  P  A+A K    G RV T 
Sbjct: 155 VERIETRIAELTGYPAENGEGLQILNYGLGEEYKPHFDFFPPHMADASKG---GQRVGTF 211

Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           L Y++DV  GG TVF+   LS  P+KG A ++H  ++ G  D  + H++ PV  G
Sbjct: 212 LIYLNDVEDGGETVFSKAGLSFVPKKGAAIYFHYGNAQGQLDRLSVHSSVPVRKG 266


>gi|156352046|ref|XP_001622583.1| predicted protein [Nematostella vectensis]
 gi|156209154|gb|EDO30483.1| predicted protein [Nematostella vectensis]
          Length = 497

 Score =  118 bits (296), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 86/242 (35%), Positives = 121/242 (50%), Gaps = 40/242 (16%)

Query: 34  NNVAPTLEVTEREK--------YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKE 85
           +N+   + V  R+K        YE LCRG       I  QL+C Y   + P LRL P K 
Sbjct: 262 DNLPSRVNVGNRDKGKEDHAFDYERLCRGQPN-KVRIPKQLRC-YYKSSHPLLRLKPAKI 319

Query: 86  EEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLRE 145
           E      +I+L RDV+ +S++  IK++A P++    +        E    R S SAWL +
Sbjct: 320 EVLDPDRQILLLRDVINESQMQFIKELAAPKVSSLHLSPTNRSPSE---RRFSSSAWLGD 376

Query: 146 PEHPVIERISRRVEHMTGL--TTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT 203
            +   I  +SRR+E +T    T  +AE LQVV++GIGGH+EP Y +      NA      
Sbjct: 377 ADGAPIAALSRRIEAITDFHVTGDSAESLQVVHFGIGGHFEPRYGY------NA------ 424

Query: 204 GNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVL 263
                     ++ V  GG+ VF    LS+ P+KG+A FW N+  SG     T HAACPV+
Sbjct: 425 ----------LNFVDAGGSNVFLDSELSVSPQKGSAVFWLNMRRSGKE---TLHAACPVI 471

Query: 264 TG 265
            G
Sbjct: 472 VG 473


>gi|344253558|gb|EGW09662.1| Glucose 1,6-bisphosphate synthase [Cricetulus griseus]
          Length = 904

 Score =  118 bits (296), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 83/239 (34%), Positives = 120/239 (50%), Gaps = 40/239 (16%)

Query: 3   FPTHQRAQGNKLYY-----QEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLT 57
           +P ++R   N L Y     Q  L  + E   + P V N+        R+ YE LC+   +
Sbjct: 649 YPDNKRMARNVLKYERLLSQNTLQMATETVIQRPNVPNL------QTRDTYEGLCQTLGS 702

Query: 58  VPPAIV-AQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPR 116
            P      +L C Y   + PYL L P ++E  +L+P + LY D + D+E   I+++A+P 
Sbjct: 703 QPTHYQNPRLYCSYETNSSPYLLLQPARKEVIHLRPFVALYHDFVSDAEAQKIRELAEPW 762

Query: 117 LRRATVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEELQ 173
           L+R+ V    +GE ++   YRISKSAWL++   P++  +  R+  +TGL      AE LQ
Sbjct: 763 LQRSVV---ASGEKQLPVEYRISKSAWLKDTVDPMLGTLDHRIAALTGLDIQPPYAEYLQ 819

Query: 174 VVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSL 232
           VVNYGIGGHYEPH+D A                       +S V  GGAT F   N S+
Sbjct: 820 VVNYGIGGHYEPHFDHAT----------------------LSAVEAGGATAFIYANFSV 856


>gi|260787668|ref|XP_002588874.1| hypothetical protein BRAFLDRAFT_235878 [Branchiostoma floridae]
 gi|229274045|gb|EEN44885.1| hypothetical protein BRAFLDRAFT_235878 [Branchiostoma floridae]
          Length = 151

 Score =  118 bits (295), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 66/150 (44%), Positives = 88/150 (58%), Gaps = 15/150 (10%)

Query: 140 SAWLREPEHPVIERISRRVEHMTGLTTST--AEELQVVNYGIGGHYEPHYDFARPGEANA 197
           S WL + EH VI ++SRRVE++TGL  +    E  QV+NYG+GG YEPH D+ R  +   
Sbjct: 1   SGWLFDTEHTVIAKLSRRVEYITGLDVNWPYGEAFQVLNYGLGGFYEPHVDYFRDEQP-- 58

Query: 198 FKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRH 257
              L  G R+ T LFY+SDV  GGATVFT LNL++   K +A  +H+L  S + +  + H
Sbjct: 59  -ALLTNGQRIVTFLFYLSDVEAGGATVFTRLNLTVPAVKNSAVLFHDLKRSLEFEKDSEH 117

Query: 258 AACPVLTGSNSLHST----------CPCGL 277
           A CPVL GS  + +            PCGL
Sbjct: 118 AGCPVLMGSKWIANKWIHAHGNEFRWPCGL 147


>gi|113869198|ref|YP_727687.1| prolyl 4-hydroxylase alpha subunit [Ralstonia eutropha H16]
 gi|113527974|emb|CAJ94319.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia eutropha H16]
          Length = 297

 Score =  117 bits (294), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 66/177 (37%), Positives = 100/177 (56%), Gaps = 5/177 (2%)

Query: 92  PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
           P++ L++ ++ D E D +  +++ RL R+ V N  TG+  + + R S  A  +  EHP+I
Sbjct: 105 PQVQLFQQLLTDDECDALVALSRGRLARSPVVNPDTGDENLIDARTSMGAMFQVAEHPLI 164

Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDF---ARPGEANAFKSLGTGNRVA 208
            RI  R+  +TG+     E LQ++NY  GG Y+PH+D+    RPGEA    S+G G R+A
Sbjct: 165 TRIEARIAAVTGVPAEHGEGLQILNYKPGGEYQPHFDYFNPQRPGEARQL-SVG-GQRIA 222

Query: 209 TVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           T++ Y++    GGAT F  + L + P KG A ++  L   G  D  T HA  PV  G
Sbjct: 223 TLVIYLNTPEAGGATAFPRVGLEVAPVKGNAVYFSYLLPDGALDERTLHAGLPVAFG 279


>gi|195575137|ref|XP_002105536.1| GD21536 [Drosophila simulans]
 gi|194201463|gb|EDX15039.1| GD21536 [Drosophila simulans]
          Length = 465

 Score =  117 bits (294), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 75/222 (33%), Positives = 111/222 (50%), Gaps = 29/222 (13%)

Query: 46  EKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSE 105
           E Y+ LCR   +  P    +L CRY     P+L L PLK EE  L+P I++Y D++ D +
Sbjct: 261 EDYKRLCRSSFSPTPL---KLHCRYNSTTSPFLILAPLKMEEISLEPYIVMYHDILPDKD 317

Query: 106 IDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLT 165
           I  +  +A+P L        K  E+   N   +KS+        +++R++ R+  +TGL 
Sbjct: 318 IQQLITLAEPLL--------KPTEMFDENKNEAKSSDRPALGGLLLDRLNERMGDITGLQ 369

Query: 166 TSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVF 225
                 + ++ Y  G H E                 G G+R+ TV+FY++D   GGATVF
Sbjct: 370 IPQGNPINIIKYAFGAHSETE---------------GYGDRMDTVMFYLNDAPYGGATVF 414

Query: 226 TSLNLSLWPEKGTAAFWHNLHSSGD-GDYYTRHAACPVLTGS 266
             LN+ +  E+G    W+NL  +GD  D  T HAACPV  GS
Sbjct: 415 PHLNVKVPAERGKVLLWYNL--NGDTQDVTTVHAACPVFHGS 454


>gi|339327280|ref|YP_004686973.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator N-1]
 gi|338167437|gb|AEI78492.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator N-1]
          Length = 297

 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 65/177 (36%), Positives = 100/177 (56%), Gaps = 5/177 (2%)

Query: 92  PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
           P++ L++ ++ D E D +  +++ RL R+ V N  TG+  + + R S  A  +  EH +I
Sbjct: 105 PQVQLFQQLLTDDECDALVALSRGRLARSPVVNPDTGDENLIDARTSMGAMFQVAEHALI 164

Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDF---ARPGEANAFKSLGTGNRVA 208
            RI  R+  +TG+     E LQ++NY  GG Y+PH+D+    RPGEA    S+G G R+A
Sbjct: 165 ARIEARIAAVTGVPAEHGEGLQILNYKPGGEYQPHFDYFNPQRPGEARQL-SVG-GQRIA 222

Query: 209 TVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           T++ Y++    GGAT F  + L + P KG A ++  L   G  D  T HA  PV +G
Sbjct: 223 TLVIYLNTPEAGGATAFPRVGLEVAPVKGNAVYFSYLLPDGTLDERTLHAGLPVASG 279


>gi|195113263|ref|XP_002001187.1| GI10646 [Drosophila mojavensis]
 gi|193917781|gb|EDW16648.1| GI10646 [Drosophila mojavensis]
          Length = 471

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 72/202 (35%), Positives = 96/202 (47%), Gaps = 33/202 (16%)

Query: 65  QLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQN 124
            L CRY +   P+LR+ PLK EE  L P I+LY   +Y+SEI+ + K  +  L       
Sbjct: 277 HLHCRYNYWMTPFLRIAPLKLEELSLDPLIVLYHKAIYNSEIETLLKRQEFNLISGKDNM 336

Query: 125 YKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYE 184
            +T                          I  RV  M+GL    +E L V+N    GH++
Sbjct: 337 DRT--------------------------IHERVADMSGLNLDRSEVLSVINNDNNGHFQ 370

Query: 185 PHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHN 244
              D     E          +R+ATVLFY+ DV   GAT+F  LNL++ PEKGTA  WHN
Sbjct: 371 LQEDAPETTE-------RPQDRIATVLFYLEDVELVGATIFPRLNLTIKPEKGTALLWHN 423

Query: 245 LHSSGDGDYYTRHAACPVLTGS 266
           L S G       +AACPV++ S
Sbjct: 424 LESCGSSHPKALYAACPVISSS 445


>gi|194290782|ref|YP_002006689.1| prolyl 4-hydroxylase subunit alpha [Cupriavidus taiwanensis LMG
           19424]
 gi|193224617|emb|CAQ70628.1| putative Prolyl 4-hydroxylase alpha subunit [Cupriavidus
           taiwanensis LMG 19424]
          Length = 296

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 65/177 (36%), Positives = 99/177 (55%), Gaps = 5/177 (2%)

Query: 92  PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
           P++ L++ ++ D E D +  +++ RL R+ V N  TG+  + + R S  A  +  EH +I
Sbjct: 104 PQVQLFQQLLSDDECDALVALSRGRLARSPVVNPDTGDENLIDARTSMGAMFQVAEHALI 163

Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDF---ARPGEANAFKSLGTGNRVA 208
            RI  R+  +TG+     E LQ++NY  GG Y+PH+D+    RPGEA    S+G G R+A
Sbjct: 164 ARIEARIAAVTGVPADHGEGLQILNYKPGGEYQPHFDYFNPQRPGEARQL-SVG-GQRIA 221

Query: 209 TVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           T++ Y++    GGAT F  + L + P KG A ++  L   G  D  T HA  PV  G
Sbjct: 222 TLVIYLNTPEAGGATAFPRVGLEVAPVKGNAVYFSYLLPDGTLDDRTLHAGLPVAAG 278


>gi|295699617|ref|YP_003607510.1| procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
 gi|295438830|gb|ADG17999.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
          Length = 286

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 60/178 (33%), Positives = 99/178 (55%), Gaps = 1/178 (0%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +P+++++ DV+  +E   + + ++ RL+R+T  N  TG  ++   R S+  W R  E  +
Sbjct: 96  RPQLVVFADVLSAAECAELIERSRHRLKRSTTVNPLTGREDVIRNRTSEGVWYRRGEDQL 155

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGE-ANAFKSLGTGNRVAT 209
           I R+ RR+  +T       E LQV++YG  G Y PH+DF  P +  +A  +   G RVAT
Sbjct: 156 IARVERRIASLTNWPLENGEGLQVLHYGTSGEYSPHFDFFAPDQPGSAVHTTQGGQRVAT 215

Query: 210 VLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
           ++ Y++DVA GG TVF +  LS+  + G A ++  +++    D  T H   PVL G  
Sbjct: 216 LIIYLNDVADGGETVFPTAGLSVAAQAGGAVYFRYMNAERQLDPSTLHGGAPVLAGDK 273


>gi|312385117|gb|EFR29691.1| hypothetical protein AND_01144 [Anopheles darlingi]
          Length = 295

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 74/216 (34%), Positives = 118/216 (54%), Gaps = 8/216 (3%)

Query: 52  CRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           C+G    P  + + L+C Y  RN  +  + P K E    +P + L+ DV++DSEI  +++
Sbjct: 45  CKGTYQRPVGLTSWLRCWYDARN-DHSVIGPRKVEMLNYEPFVALFYDVIHDSEITRLQE 103

Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
           +    ++   V    T       Y   ++  L+  + PV++R+S+R E M+GL+  TAE+
Sbjct: 104 LGDGVIK---VSGATTDGWLPVYYENHQTYTLQNRDDPVVKRLSQRTERMSGLSCDTAED 160

Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDV--AQGGATV-FTSL 228
           L+V+ Y   G Y+      +   + A +    G R+ATVLF+MSDV  A+GG  + F  L
Sbjct: 161 LKVI-YNEVGAYKSFIVDGKKKSSVAQQFAFAGKRLATVLFFMSDVDGAEGGGRIAFPYL 219

Query: 229 NLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLT 264
            LS+ P+KG A FW+NLH SG  D    ++ CP+L 
Sbjct: 220 GLSVLPQKGAALFWYNLHDSGRPDERMTYSICPLLA 255


>gi|195069799|ref|XP_001997030.1| GH12979 [Drosophila grimshawi]
 gi|193891499|gb|EDV90365.1| GH12979 [Drosophila grimshawi]
          Length = 517

 Score =  115 bits (288), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 75/221 (33%), Positives = 115/221 (52%), Gaps = 12/221 (5%)

Query: 48  YEMLCRGDLTVPPAIVAQL--KCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSE 105
           Y  LC+G   +P     Q   +C        Y +L PLK E+  L P I +Y  V+ D++
Sbjct: 280 YVRLCQGK-RLPEIKTNQSSPRCYLDSNRHAYFKLSPLKVEQVNLDPDINIYYGVLNDNQ 338

Query: 106 IDLIKKMA-QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGL 164
           I  I +++ +    R+T + Y      I++ RIS+  WL     P++    + V  ++G 
Sbjct: 339 IKSILRLSDELDSFRSTHRKYV-----ISDMRISQQVWLNYSS-PIMRTYRQLVGAISGF 392

Query: 165 TTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATV 224
             +  E +Q+ NYGIGGHYEPH D+   G          G+R++T + Y+SDV QGG TV
Sbjct: 393 NMTNVEIMQLANYGIGGHYEPHIDYM--GSPLPPYYAKRGDRISTSMIYLSDVQQGGYTV 450

Query: 225 FTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           F + N+ + P KG+   W+N   S + D+ T HA C V+ G
Sbjct: 451 FPTQNVFVKPVKGSMILWYNQLRSLNPDHRTLHAGCAVIEG 491


>gi|195055777|ref|XP_001994789.1| GH14121 [Drosophila grimshawi]
 gi|193892552|gb|EDV91418.1| GH14121 [Drosophila grimshawi]
          Length = 517

 Score =  115 bits (287), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 75/221 (33%), Positives = 115/221 (52%), Gaps = 12/221 (5%)

Query: 48  YEMLCRGDLTVPPAIVAQL--KCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSE 105
           Y  LC+G   +P     Q   +C        Y +L PLK E+  L P I +Y  V+ D++
Sbjct: 280 YVRLCQGK-RLPEIKTNQSSPRCYLDSNRHAYFKLSPLKVEQVNLDPDINIYYGVLNDNQ 338

Query: 106 IDLIKKMA-QPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGL 164
           I  I +++ +    R+T + Y      I++ RIS+  WL     P++    + V  ++G 
Sbjct: 339 IKSILRLSDELDSFRSTHRKYV-----ISDMRISQQVWLNYSS-PIMRTYRQLVGAISGF 392

Query: 165 TTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATV 224
             +  E +Q+ NYGIGGHYEPH D+   G          G+R++T + Y+SDV QGG TV
Sbjct: 393 NMTNVEIMQLANYGIGGHYEPHIDYM--GSPLPPYYAKRGDRISTSMIYLSDVQQGGYTV 450

Query: 225 FTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           F + N+ + P KG+   W+N   S + D+ T HA C V+ G
Sbjct: 451 FPTQNVFVKPVKGSMILWYNQLRSLNPDHRTLHAGCAVIEG 491


>gi|260812289|ref|XP_002600853.1| hypothetical protein BRAFLDRAFT_214927 [Branchiostoma floridae]
 gi|229286143|gb|EEN56865.1| hypothetical protein BRAFLDRAFT_214927 [Branchiostoma floridae]
          Length = 281

 Score =  114 bits (286), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 61/135 (45%), Positives = 85/135 (62%), Gaps = 8/135 (5%)

Query: 136 RISKSAWLREPEHPVIERISRRVEHMTGLTTS--TAEELQVVNYGIGGHYEPHYDFARPG 193
           RIS+ AWL + +  ++ R+S+R+  +TGL T+  + E LQV+NYG+GG YEPH+D+    
Sbjct: 126 RISQQAWLHDKDDEIVARVSKRIGLLTGLNTTPTSTELLQVLNYGLGGQYEPHHDYMTAE 185

Query: 194 EANAFKSLGT--GNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDG 251
           E    K  GT  GNR+AT L Y+SDV  GGATVF   N+++   K     + +L  SG G
Sbjct: 186 E----KMWGTILGNRMATFLMYLSDVTAGGATVFPVANVTVPVVKNAGLLFMDLLRSGRG 241

Query: 252 DYYTRHAACPVLTGS 266
           D  + HA CPV+ GS
Sbjct: 242 DVNSLHAGCPVVIGS 256


>gi|389770666|ref|ZP_10192118.1| procollagen-proline dioxygenase [Rhodanobacter sp. 115]
 gi|388429637|gb|EIL86932.1| procollagen-proline dioxygenase [Rhodanobacter sp. 115]
          Length = 286

 Score =  114 bits (286), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 61/176 (34%), Positives = 98/176 (55%), Gaps = 1/176 (0%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           QP + +   V+   E D + + A  +L+R+T+ +  TG+ E    R S+  +        
Sbjct: 94  QPVLAVLDGVLSHEECDELIRRAAAKLQRSTIVDPTTGKHETIADRSSEGTFFEINADDF 153

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT-GNRVAT 209
           I R+ RR+  +  L     E LQ+++YG GG Y+PH+DF  PG+  +   + T G RV+T
Sbjct: 154 IARLDRRISALMNLPVDHGEGLQILHYGPGGEYKPHFDFFPPGDPGSAVQMATGGQRVST 213

Query: 210 VLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           ++ Y+++V  GGAT+F  L LS+ P+KG+A ++   +S G  D  T H   PVL G
Sbjct: 214 LVMYLNEVEDGGATIFPELGLSVLPKKGSAVYFEYTNSRGQLDPRTLHGGAPVLRG 269


>gi|91789558|ref|YP_550510.1| procollagen-proline,2-oxoglutarate-4-dioxygenase [Polaromonas sp.
           JS666]
 gi|91698783|gb|ABE45612.1| Procollagen-proline,2-oxoglutarate-4-dioxygenase [Polaromonas sp.
           JS666]
          Length = 277

 Score =  114 bits (285), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 61/177 (34%), Positives = 97/177 (54%), Gaps = 3/177 (1%)

Query: 92  PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
           P ++++ +++ DSE + + ++AQPRL R+   N KTG  E    R S+  +    E+P++
Sbjct: 90  PDLVVFGNLLSDSECEALMEVAQPRLARSLTVNIKTGGEERNRDRTSQGMFFARGENPLV 149

Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT-GNRVATV 210
           +R+  R+  + G      E LQV+ Y  G  Y+PHYD+  P E      L   G RVAT+
Sbjct: 150 QRVEARIARLVGWPVDRGEGLQVLRYRQGAQYKPHYDYFDPAEPGTPAILQRGGQRVATL 209

Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
           + Y+++  QGGATVF  + L + P +GTA F+   + + +    TRH   PV  G  
Sbjct: 210 IMYLNEPEQGGATVFPDIGLQVTPRRGTAVFFS--YPAANPASLTRHGGEPVKAGEK 264


>gi|329913962|ref|ZP_08276011.1| hypothetical protein IMCC9480_1311 [Oxalobacteraceae bacterium
           IMCC9480]
 gi|327545257|gb|EGF30515.1| hypothetical protein IMCC9480_1311 [Oxalobacteraceae bacterium
           IMCC9480]
          Length = 280

 Score =  114 bits (285), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 62/176 (35%), Positives = 95/176 (53%), Gaps = 1/176 (0%)

Query: 92  PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
           PRI++  +V+ D E D I  M++ R  R+T  +  +G     + R S+SA ++  E  +I
Sbjct: 92  PRIVVLGNVLSDDECDAIAAMSRTRFARSTTIDNASGINRFDDSRTSESAHIQRGETELI 151

Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL-GTGNRVATV 210
            RI  R+  ++G      E LQ+  Y  G  Y PH+D+  P  A   K L  +G R+AT+
Sbjct: 152 ARIDARLAALSGWPVDHGEPLQLQKYQAGNEYRPHFDWFDPALAGTAKHLEKSGQRLATI 211

Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           + Y++DV +GG T F  + L + P+KG A F+ N    G  D  T+HA  PV  G+
Sbjct: 212 ILYLTDVEEGGGTSFPGIGLDVHPQKGGALFFRNTTPYGVPDRKTQHAGLPVEKGT 267


>gi|302791635|ref|XP_002977584.1| hypothetical protein SELMODRAFT_106693 [Selaginella moellendorffii]
 gi|300154954|gb|EFJ21588.1| hypothetical protein SELMODRAFT_106693 [Selaginella moellendorffii]
          Length = 296

 Score =  114 bits (285), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 66/200 (33%), Positives = 102/200 (51%), Gaps = 20/200 (10%)

Query: 82  PLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSA 141
           P K  +   +PR  LY+  M  +E D + KMA+ +L+++ V + ++G+  ++N R S   
Sbjct: 39  PTKVIQLSWKPRAFLYKGFMSAAECDHVVKMAKDKLQKSMVADNESGKSVLSNIRTSSGM 98

Query: 142 WLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL 201
           +L + +  VI RI  R+   T L     E +QV+ Y  G  YEPHYD+      + +   
Sbjct: 99  FLSKGQDEVINRIEERIAAWTFLPKENGEAIQVLRYEFGEKYEPHYDYFH----DKYNQA 154

Query: 202 GTGNRVATVLFYMSDVAQGGATVF-----TSLNLSLW-----------PEKGTAAFWHNL 245
             G+R+ATVL Y+SDV +GG TVF     T++    W           P KG A  +++L
Sbjct: 155 LGGHRIATVLMYLSDVVKGGETVFPSSEDTTVKDDSWSDCAKKGIAVKPRKGDALLFYSL 214

Query: 246 HSSGDGDYYTRHAACPVLTG 265
           H     D  + H  CPV+ G
Sbjct: 215 HPDATPDESSLHGGCPVIEG 234


>gi|372266874|ref|ZP_09502922.1| peptidyl prolyl 4-hydroxylase-like protein subunit alpha
           [Alteromonas sp. S89]
          Length = 294

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 63/192 (32%), Positives = 100/192 (52%), Gaps = 6/192 (3%)

Query: 80  LMPLKEEE-----AYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIAN 134
           ++PL +++     A  QP I+L+ + + + E D + +M++P L  + V N + G  E+  
Sbjct: 86  VIPLGDQQVEARFAIRQPNIVLFANFLAEWECDALVEMSRPNLSPSRVVNTQHGAFELKP 145

Query: 135 YRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGE 194
            R S        E P+I  I  R+  +  +  +  E LQ+++Y + G Y PHYDF  P +
Sbjct: 146 SRTSGGTHFARGETPLIADIEARIASLLKVPEAHGEPLQILHYPVSGEYRPHYDFFDPEK 205

Query: 195 ANAFKSLGT-GNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDY 253
               + L   G RV T++ Y+SDV  GGATVF  + L + P+KG A F+  +   G  D 
Sbjct: 206 PGNQEVLAAGGQRVGTLIMYLSDVESGGATVFPRVGLEVQPQKGAALFFSYVGEHGKLDL 265

Query: 254 YTRHAACPVLTG 265
            + H   PVL G
Sbjct: 266 QSLHGGSPVLAG 277


>gi|195166671|ref|XP_002024158.1| GL22696 [Drosophila persimilis]
 gi|194107513|gb|EDW29556.1| GL22696 [Drosophila persimilis]
          Length = 491

 Score =  112 bits (280), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 65/207 (31%), Positives = 105/207 (50%), Gaps = 32/207 (15%)

Query: 60  PAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRR 119
           P  V  + CRY+ R+ P+L+L P+++E    +  + LY D+    EI+ +K +A+PRL+R
Sbjct: 295 PRKVNDVHCRYL-RSTPFLQLAPIRQENLDNEAHVYLYHDLFNHEEIEALKSLARPRLKR 353

Query: 120 ATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGI 179
             + +  T           K A L      +I  ++RR++ ++G+  +  E LQVVNYGI
Sbjct: 354 QKISSNFT----------CKIAQLSNSAQDIIRTVNRRIQDVSGMDMNEKEVLQVVNYGI 403

Query: 180 GGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTA 239
            G Y+                  +    AT L +MS+V QGG TVF  L+L + P+KG+ 
Sbjct: 404 AGRYDLD---------------DSAGSAATALIFMSNVQQGGETVFPFLSLRVKPQKGSL 448

Query: 240 AFWHNLHSSGDGDYYTRHAACPVLTGS 266
             W N       D+   H +CP++ G+
Sbjct: 449 LLWRN------TDWSVLHNSCPLIIGN 469


>gi|302786814|ref|XP_002975178.1| hypothetical protein SELMODRAFT_174666 [Selaginella moellendorffii]
 gi|300157337|gb|EFJ23963.1| hypothetical protein SELMODRAFT_174666 [Selaginella moellendorffii]
          Length = 283

 Score =  112 bits (280), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 65/201 (32%), Positives = 101/201 (50%), Gaps = 21/201 (10%)

Query: 82  PLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSA 141
           P K  +   +PR  LY+  M  +E D + KMA+ +L+++ V + ++G+  ++N R S   
Sbjct: 25  PTKVIQLSWKPRAFLYKGFMSAAECDHVVKMAKDKLQKSMVADNESGKSVLSNIRTSSGM 84

Query: 142 WLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL 201
           +L + +  VI RI  R+   T L     E +QV+ Y  G  YEPHYD+      + +   
Sbjct: 85  FLSKGQDEVINRIEERIAAWTFLPKENGEAIQVLRYEFGEKYEPHYDYFH----DKYNQA 140

Query: 202 GTGNRVATVLFYMSDVAQGGATVF------TSLNLSLW-----------PEKGTAAFWHN 244
             G+R+ATVL Y+SD  +GG TVF      T++    W           P KG A  +++
Sbjct: 141 LGGHRIATVLMYLSDAVKGGETVFPSSEEDTTVKDDSWSDCAKKGIAVKPRKGDALLFYS 200

Query: 245 LHSSGDGDYYTRHAACPVLTG 265
           LH     D  + H  CPV+ G
Sbjct: 201 LHPDATPDESSLHGGCPVIEG 221


>gi|194751827|ref|XP_001958225.1| GF23630 [Drosophila ananassae]
 gi|190625507|gb|EDV41031.1| GF23630 [Drosophila ananassae]
          Length = 431

 Score =  111 bits (278), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 68/222 (30%), Positives = 110/222 (49%), Gaps = 40/222 (18%)

Query: 48  YEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEID 107
           YE+ CRG   +      +L C+Y     P+L++ PLK+E   L P I ++ +V+Y+ E+ 
Sbjct: 244 YELGCRGLFPLK----NKLFCQYNFHTTPFLKIAPLKQEILSLDPFISMFHEVLYEYELH 299

Query: 108 LIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTS 167
            +K+  +  ++    + YK                       +  R S+R+  +TGL  S
Sbjct: 300 GLKEDLKNPIKS---KKYKKN---------------------ITNRFSQRLTDITGLHFS 335

Query: 168 TAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTS 227
             +++ + NYG+    E HY++              G  V  +LF++SD  QGGATVF  
Sbjct: 336 KRDQINIDNYGLENQAEVHYNYK-----------DIGGPVGAILFFISDDVQGGATVFPK 384

Query: 228 LNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSL 269
           L +S++P+KG+   W+N+   G  D  T H+ CPVL G NSL
Sbjct: 385 LKVSVFPKKGSCLVWYNIKDDGRLDPRTTHSICPVLEG-NSL 425


>gi|91778899|ref|YP_554107.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
           xenovorans LB400]
 gi|91691559|gb|ABE34757.1| Procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
           xenovorans LB400]
          Length = 292

 Score =  111 bits (278), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 56/180 (31%), Positives = 97/180 (53%), Gaps = 1/180 (0%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +P++I++ DV+   E   + + ++ RL+R+T  N  TG+ ++   R S+  W +  E P 
Sbjct: 102 RPQVIVFADVLSPDECAEMIERSRHRLKRSTTVNPATGKEDVIRNRTSEGIWYQRGEDPF 161

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGE-ANAFKSLGTGNRVAT 209
           IER+ RR+  +        E LQ+++YG  G Y PH+D+  P +  +A  +   G RVAT
Sbjct: 162 IERMDRRISSLMNWPVENGEGLQILHYGTTGEYRPHFDYFPPDQPGSAVHTAQGGQRVAT 221

Query: 210 VLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSL 269
           ++ Y++DV  GG T+F    +S+   +G A ++  ++     D  T H   PVL G   +
Sbjct: 222 LVIYLNDVPDGGETIFPEAGMSVAASQGGAVYFRYMNDRRQLDPLTLHGGAPVLAGDKWI 281


>gi|386766694|ref|NP_651648.5| CG11828 [Drosophila melanogaster]
 gi|383293009|gb|AAF56834.5| CG11828 [Drosophila melanogaster]
          Length = 458

 Score =  111 bits (278), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 67/216 (31%), Positives = 106/216 (49%), Gaps = 16/216 (7%)

Query: 52  CRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           CRG   +P    + L+CRY+    P+LR+ P+K E+  ++P + L+ D +  +E   +  
Sbjct: 239 CRGKNLLPSK--SYLRCRYLRDGSPFLRMAPVKLEQLNIEPFVGLFHDAISPAEQKDLLH 296

Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
           +   RL       ++  +      ++  +A     +H  + RI +R+E +TG     +E 
Sbjct: 297 LTDSRL------EHRKKDSSSVEAKVDTNA----SDH--VRRIHQRIEDITGFDLEESEP 344

Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
           L V NYGIGG    H D  +P E   +       R A+ +FY+SDV  GG   F  L   
Sbjct: 345 LTVSNYGIGGQDFIHLDCEQPKEFIGY--YPKEYRSASAMFYLSDVQMGGYASFPDLGFG 402

Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
             P +G+A  WHN  +SG+ D  +  A CPVL G+ 
Sbjct: 403 FKPRRGSALVWHNTDNSGNCDTRSLQATCPVLLGNQ 438


>gi|333981907|ref|YP_004511117.1| procollagen-proline dioxygenase [Methylomonas methanica MC09]
 gi|333805948|gb|AEF98617.1| Procollagen-proline dioxygenase [Methylomonas methanica MC09]
          Length = 286

 Score =  111 bits (278), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 53/176 (30%), Positives = 103/176 (58%), Gaps = 1/176 (0%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +P I++  + M   E + + + ++ +L  + + + +TG+ ++   R S+  + +  E P+
Sbjct: 95  RPDIVVVDEFMSGEECEQLIEQSRRKLTPSAIVDPQTGKFQVIADRSSEGTYFQRGESPL 154

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEAN-AFKSLGTGNRVAT 209
           I R+ RR+  +        E +Q+++YG+G  Y+PH+D+    E+  A +   +G RVAT
Sbjct: 155 ISRLDRRISELMNWPEDHGEGIQILHYGVGAQYKPHFDYFLENESGGALQMTQSGQRVAT 214

Query: 210 VLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           ++ Y+++V +GG TVF  + +S+ P++G+AA++   +S G  D  T H   PVLTG
Sbjct: 215 LVMYLNEVTEGGETVFPDVGISITPKRGSAAYFAYCNSLGQVDPATLHGGAPVLTG 270


>gi|198466393|ref|XP_001353986.2| GA18007 [Drosophila pseudoobscura pseudoobscura]
 gi|198150579|gb|EAL29722.2| GA18007 [Drosophila pseudoobscura pseudoobscura]
          Length = 455

 Score =  111 bits (278), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 64/207 (30%), Positives = 105/207 (50%), Gaps = 32/207 (15%)

Query: 60  PAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRR 119
           P  V  + CRY+ R+ P+L+L P+++E    +  + LY D+    EI+ +K +A+P+L+R
Sbjct: 259 PRKVNDVHCRYL-RSTPFLQLAPIRQENLDNEAHVYLYHDLFNHEEIEALKSLARPKLKR 317

Query: 120 ATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGI 179
             + +  T           K A L      +I  ++RR++ ++G+  +  E LQVVNYGI
Sbjct: 318 QKISSNFT----------CKIAQLSNSAQDIIRTVNRRIQDVSGMDMNEKEMLQVVNYGI 367

Query: 180 GGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTA 239
            G Y+                  +    AT L +MS+V QGG TVF  L+L + P+KG+ 
Sbjct: 368 AGRYDLD---------------DSAGSAATALIFMSNVQQGGETVFPFLSLRVKPQKGSL 412

Query: 240 AFWHNLHSSGDGDYYTRHAACPVLTGS 266
             W N       D+   H +CP++ G+
Sbjct: 413 LLWRN------TDWSVLHNSCPLIIGN 433


>gi|186474111|ref|YP_001861453.1| procollagen-proline dioxygenase [Burkholderia phymatum STM815]
 gi|184196443|gb|ACC74407.1| Procollagen-proline dioxygenase [Burkholderia phymatum STM815]
          Length = 305

 Score =  111 bits (278), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 58/182 (31%), Positives = 99/182 (54%), Gaps = 1/182 (0%)

Query: 89  YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
           + +P++I++ DV+   E D + + A+ RL+R+T  N ++G  ++   R S+  W +  E 
Sbjct: 113 FERPQVIVFDDVLSRDECDELIERARHRLKRSTTVNPESGREDVIQLRTSEGFWFQRCED 172

Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANA-FKSLGTGNRV 207
             IER+ RR+  +        E LQ+++Y  GG Y PH+D+  P ++ +   +   G RV
Sbjct: 173 AFIERLDRRISALMNWPLEHGEGLQILHYTKGGEYRPHFDYFPPSQSGSVLHTSRGGQRV 232

Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
           AT++ Y+SDVA GG TVF +  L++   +G A ++  L+     D  T H   PV  G  
Sbjct: 233 ATLIVYLSDVAGGGETVFPNAGLAVMARQGGAIYFRYLNGHRQLDPLTLHGGAPVTNGEK 292

Query: 268 SL 269
            +
Sbjct: 293 WI 294


>gi|443730626|gb|ELU16050.1| hypothetical protein CAPTEDRAFT_114796, partial [Capitella teleta]
          Length = 150

 Score =  111 bits (277), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 58/126 (46%), Positives = 73/126 (57%), Gaps = 2/126 (1%)

Query: 142 WLREPEHPVIERISRRVEHMTGLTTST-AEELQVVNYGIGGHYEPHYDFARPGE-ANAFK 199
           WLR       +++SRRV   T L     AE  QV  YGIGGHYEPH+DF++     N   
Sbjct: 2   WLRSENSASADKLSRRVSSATKLDAEKYAELFQVSTYGIGGHYEPHFDFSKVKYFTNPVL 61

Query: 200 SLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAA 259
           +   G+R+AT + Y++DV  GG TVF  LNL + P K +A FWHNL   G  D  T H A
Sbjct: 62  NEQMGDRIATFMIYLNDVEAGGRTVFPRLNLVIEPIKNSAVFWHNLLDDGQQDDRTIHGA 121

Query: 260 CPVLTG 265
           CPV+ G
Sbjct: 122 CPVVLG 127


>gi|340787855|ref|YP_004753320.1| peptidyl prolyl 4-hydroxylase-like protein subunit alpha
           [Collimonas fungivorans Ter331]
 gi|340553122|gb|AEK62497.1| Peptidyl prolyl 4-hydroxylase-like protein, alpha subunit
           [Collimonas fungivorans Ter331]
          Length = 289

 Score =  110 bits (276), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 57/176 (32%), Positives = 96/176 (54%), Gaps = 1/176 (0%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +PR IL+ +V+   E D +  +++ +L R+ V +++TG  ++  +R S   +      P 
Sbjct: 99  KPRAILFGNVLSHDECDQLIALSKTKLLRSGVVDHQTGNTKLHEHRTSSGTFFHRGTTPF 158

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLG-TGNRVAT 209
           I  I +R+  +  +  S  E LQ++NY +GG Y PHYD+ RP    + K L   G R AT
Sbjct: 159 IAMIDKRLAALMQVPESHGEGLQILNYQMGGEYRPHYDYFRPDAPGSAKHLARGGQRTAT 218

Query: 210 VLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           ++ Y++DV  GG T+F    LS+ P KG+A ++   ++    D  + H   PV+ G
Sbjct: 219 LIIYLNDVDGGGETIFPRNGLSIVPAKGSAIYFSYTNAENQLDSLSFHGGSPVIEG 274


>gi|170591594|ref|XP_001900555.1| prolyl 4-hydroxylase 2 precursor [Brugia malayi]
 gi|158592167|gb|EDP30769.1| prolyl 4-hydroxylase 2 precursor, putative [Brugia malayi]
          Length = 405

 Score =  110 bits (275), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 59/159 (37%), Positives = 97/159 (61%), Gaps = 7/159 (4%)

Query: 4   PTHQRAQGNKLYYQEALN----KSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVP 59
           P H RA+GN  +Y++ L     +  +++ + P +NN  P  +   ++ YE LCR ++ + 
Sbjct: 239 PDHPRAKGNVRWYEDLLEDEGIRRADMRRKVPPMNN--PRDKSNLKDTYEALCRQEVPIN 296

Query: 60  PAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRR 119
               ++L C Y   + PYLRL P K E  +  P ++L+RD++ D E+ +I+ +A P+L R
Sbjct: 297 TKAQSRLYC-YYKMDRPYLRLAPFKVEIVHQNPLVVLFRDIVSDEEMRIIEMLAVPKLAR 355

Query: 120 ATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRV 158
           ATV N  TG +E A YR S+S+WL   EH V++RI++R+
Sbjct: 356 ATVHNVVTGNIETAFYRTSQSSWLGSTEHEVVKRINKRL 394


>gi|389795384|ref|ZP_10198508.1| procollagen-proline dioxygenase [Rhodanobacter fulvus Jip2]
 gi|388430823|gb|EIL87950.1| procollagen-proline dioxygenase [Rhodanobacter fulvus Jip2]
          Length = 293

 Score =  109 bits (273), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 74/228 (32%), Positives = 114/228 (50%), Gaps = 14/228 (6%)

Query: 49  EMLCRGDLTVPPAIVAQLKCR---YVHR--NVPYLRLMPLKEEEAYL-----QPRIILYR 98
           E L RG+   PPA  A LK +   YV     +P   ++P  + +  +      P I +  
Sbjct: 47  EALARGEQ--PPA-AAPLKAQATGYVADAPRLPAGNVIPTHDRDVRVLLRVATPTIAVLD 103

Query: 99  DVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRV 158
            V+ D E D + + +  +L+R+T  +   G  E+   R S+  +        I R+ RR+
Sbjct: 104 QVLDDEECDELIRRSADKLQRSTTVDPVNGGYEVIAARSSEGTFFPVNADDFIARLDRRI 163

Query: 159 EHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT-GNRVATVLFYMSDV 217
             +        E LQV++YG GG Y+PH+D+  PG+  +   +   G RV+T+L Y++DV
Sbjct: 164 AELMNCPVENGEGLQVLHYGEGGEYQPHFDYFSPGDPGSEAQMVVGGQRVSTLLIYLNDV 223

Query: 218 AQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           AQGGATVF +L L + P KG A ++   +  G  D  T H   PV  G
Sbjct: 224 AQGGATVFPTLGLRVLPRKGMAVYFEYSNRDGQVDPLTLHGGEPVEKG 271


>gi|198449528|ref|XP_002136919.1| GA26870 [Drosophila pseudoobscura pseudoobscura]
 gi|198130648|gb|EDY67477.1| GA26870 [Drosophila pseudoobscura pseudoobscura]
          Length = 491

 Score =  109 bits (272), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 76/234 (32%), Positives = 121/234 (51%), Gaps = 22/234 (9%)

Query: 52  CRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           CRG+        + L CR+  R   Y RL   K EE  L P I+LY DV+   E++L+K 
Sbjct: 279 CRGEYPWK----STLHCRFSWRPSFYARL---KVEEVLLDPYIVLYHDVVSGKEMELLKD 331

Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
             +  L   T    ++G L   +  + +S        P+++ + +R+  MTGL+ + +E 
Sbjct: 332 YGRTNL---THDPLRSG-LSAKHCALPESL-------PLVQSLHQRLWDMTGLSLNGSES 380

Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
             + NYGIGG    H D+    E    + L   NR+ T+  ++S+V+QGG TVF +L ++
Sbjct: 381 WLITNYGIGGFLGLHKDYFDEIE----EELQGDNRLFTIQIFLSNVSQGGYTVFPNLEVA 436

Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTCPCGLRRGLQRSG 285
           + P+ GTA  ++NL  S  GD  TRH  CPV+ G+  + +       + L+R G
Sbjct: 437 VKPQAGTALVFYNLLDSLVGDTRTRHFGCPVIDGNKWIATKFLSAKEQTLRRRG 490


>gi|195159168|ref|XP_002020454.1| GL13504 [Drosophila persimilis]
 gi|194117223|gb|EDW39266.1| GL13504 [Drosophila persimilis]
          Length = 491

 Score =  109 bits (272), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 76/234 (32%), Positives = 120/234 (51%), Gaps = 22/234 (9%)

Query: 52  CRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           CRG+        + L CR+  R   Y RL   K EE  L P I+LY DV+   E++L+K 
Sbjct: 279 CRGEYPWK----STLHCRFSWRPSFYARL---KVEEVLLDPYIVLYHDVVSGKEMELLKD 331

Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
             +  L   T    ++G L   +  + +S        P+++ + +R+  MTGL+ + +E 
Sbjct: 332 YGRTNL---THDPLRSG-LSAKHCALPESL-------PLVQSLHQRLWDMTGLSLNGSES 380

Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
             + NYGIGG    H D+    E    + L   NR+ T+  ++S+V+QGG TVF +L ++
Sbjct: 381 WLITNYGIGGFLGLHKDYFDEIE----EELQGDNRLFTIQIFLSNVSQGGYTVFPNLEVA 436

Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHSTCPCGLRRGLQRSG 285
           + P+ GTA  ++NL  S  GD  TRH  CPV+ G   + +       + L+R G
Sbjct: 437 VKPQAGTALVFYNLLDSLVGDTRTRHFGCPVIDGDKWIATKFLSAKEQTLRRRG 490


>gi|253575459|ref|ZP_04852796.1| prolyl 4-hydroxylase [Paenibacillus sp. oral taxon 786 str. D14]
 gi|251845106|gb|EES73117.1| prolyl 4-hydroxylase [Paenibacillus sp. oral taxon 786 str. D14]
          Length = 215

 Score =  108 bits (271), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 57/175 (32%), Positives = 97/175 (55%), Gaps = 10/175 (5%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +P I+ +  ++ D E   + + A PRLR + + N    E+     R S+  +  E E+P 
Sbjct: 29  EPLIMRFERLLTDDECRQLIEAAAPRLRESKLVNKVVSEI-----RTSRGMFFEEEENPF 83

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           I RI +R+  +  +    AE LQV++YG G  Y+ HYDF  P   +A     + NR++T+
Sbjct: 84  IHRIEKRISALMNVPIEHAEGLQVLHYGPGQEYQAHYDFFGPNSPSA-----SNNRISTL 138

Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           + Y++DV  GG TVF  L+L + PE+G+A ++   +   + +  T H++ PV+ G
Sbjct: 139 IIYLNDVEAGGETVFPLLDLEVKPERGSALYFEYFYRQQELNNLTLHSSVPVVRG 193


>gi|6437556|gb|AAF08583.1|AC011623_16 unknown protein [Arabidopsis thaliana]
          Length = 278

 Score =  108 bits (271), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 56/182 (30%), Positives = 97/182 (53%), Gaps = 4/182 (2%)

Query: 84  KEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWL 143
           K ++   +PR  +Y   + D E D +  +A+  L+R+ V +   GE ++++ R S   ++
Sbjct: 37  KVKQVSSKPRAFVYEGFLTDLECDHLISLAKENLQRSAVADNDNGESQVSDVRTSSGTFI 96

Query: 144 REPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT 203
            + + P++  I  ++   T L     E+LQV+ Y  G  Y+ H+D+    + N  +    
Sbjct: 97  SKGKDPIVSGIEDKLSTWTFLPKENGEDLQVLRYEHGQKYDAHFDYFHD-KVNIARG--- 152

Query: 204 GNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVL 263
           G+R+ATVL Y+S+V +GG TVF    + L P+KG A  + NL      D ++ H  CPV+
Sbjct: 153 GHRIATVLLYLSNVTKGGETVFPDAQVCLKPKKGNALLFFNLQQDAIPDPFSLHGGCPVI 212

Query: 264 TG 265
            G
Sbjct: 213 EG 214


>gi|330821584|ref|YP_004350446.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
           gladioli BSR3]
 gi|327373579|gb|AEA64934.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
           gladioli BSR3]
          Length = 302

 Score =  108 bits (271), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 60/178 (33%), Positives = 95/178 (53%), Gaps = 1/178 (0%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +P  +L    +   E   + ++A+PRL R+TV +  TG   +A +R S   + R  E P+
Sbjct: 101 RPAAVLLDGFLSAGECRQLIELARPRLNRSTVVDPVTGRNIVAGHRSSDGMFFRLGETPL 160

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGE-ANAFKSLGTGNRVAT 209
           I RI +R+  +TG      E LQ+++Y  G    PH D+  PG  ANA     +G RV T
Sbjct: 161 ISRIEQRIAALTGFPVENGEGLQMLHYEAGAESTPHVDYLVPGNPANAESIARSGQRVGT 220

Query: 210 VLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
           +L Y++DV  GG T+F  +  S+ P +G A ++   + SG  D  + HA+ P+ +G  
Sbjct: 221 LLMYLNDVESGGETLFPQVGCSVVPRRGQAFYFEYGNGSGRSDPASLHASSPIGSGDK 278


>gi|385205097|ref|ZP_10031967.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
 gi|385184988|gb|EIF34262.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
          Length = 292

 Score =  108 bits (271), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 55/180 (30%), Positives = 96/180 (53%), Gaps = 1/180 (0%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +P++I++ DV+   E   + + ++ RL+R+T  N  TG+ ++   R S+  W +  E P 
Sbjct: 102 RPQMIVFADVLSPDECAEMIERSRHRLKRSTTVNPATGKEDVIRNRTSEGIWYQRGEDPF 161

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGE-ANAFKSLGTGNRVAT 209
           IER+ RR+  +        E LQ++ YG  G Y PH+D+  P +  +   +   G RVAT
Sbjct: 162 IERMDRRISSLMNWPVENGEGLQLLRYGTTGEYRPHFDYFPPDQPGSTVHTAQGGQRVAT 221

Query: 210 VLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSL 269
           ++ Y++DV  GG T+F    +S+   +G A ++  ++     D  T H   PVL+G   +
Sbjct: 222 LVIYLNDVPDGGETIFPEAGMSVAASQGGAVYFRYMNGRRQLDPLTLHGGAPVLSGDKWI 281


>gi|226495689|ref|NP_001149322.1| LOC100282945 precursor [Zea mays]
 gi|194697650|gb|ACF82909.1| unknown [Zea mays]
 gi|194708468|gb|ACF88318.1| unknown [Zea mays]
 gi|195626376|gb|ACG35018.1| oxidoreductase [Zea mays]
 gi|347978842|gb|AEP37763.1| prolyl 4-hydroxylase 9 [Zea mays]
 gi|413945802|gb|AFW78451.1| oxidoreductase [Zea mays]
          Length = 308

 Score =  108 bits (271), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 60/207 (28%), Positives = 102/207 (49%), Gaps = 21/207 (10%)

Query: 76  PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANY 135
           P   + P    +   +PR+ LY+  + D E + +  +A+  L+R+ V +  +G+  ++  
Sbjct: 42  PAAVVYPHHSRQISCKPRVFLYQHFLSDDEANHLISLARAELKRSAVADNMSGKSTLSEV 101

Query: 136 RISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEA 195
           R S   +LR+ + P++E I  ++   T L     E++QV+ Y  G  YEPHYD+      
Sbjct: 102 RTSSGTFLRKGQDPIVEGIEDKIAAWTFLPKENGEDIQVLRYKHGEKYEPHYDYF----T 157

Query: 196 NAFKSLGTGNRVATVLFYMSDVAQGGATVF-----------------TSLNLSLWPEKGT 238
           +   ++  G+R ATVL Y++DV +GG TVF                     +++ P KG 
Sbjct: 158 DNVNTVRGGHRYATVLLYLTDVPEGGETVFPLAEEPDDAKDATLSECAQKGIAVRPRKGD 217

Query: 239 AAFWHNLHSSGDGDYYTRHAACPVLTG 265
           A  + NL+  G  D  + H  CPV+ G
Sbjct: 218 ALLFFNLNPDGTTDSVSLHGGCPVIKG 244


>gi|332526359|ref|ZP_08402485.1| procollagen-proline dioxygenase [Rubrivivax benzoatilyticus JA2]
 gi|332110495|gb|EGJ10818.1| procollagen-proline dioxygenase [Rubrivivax benzoatilyticus JA2]
          Length = 224

 Score =  108 bits (271), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 63/179 (35%), Positives = 91/179 (50%), Gaps = 7/179 (3%)

Query: 92  PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
           PR++++  ++ + E D +  +AQPRL R+   +  TG  E+   R S   +    E P+I
Sbjct: 37  PRVVVFGGLLSEQECDELVALAQPRLLRSETVDNSTGGSEVNAARTSDGMFFERGETPLI 96

Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDF---ARPGEANAFKSLGTGNRVA 208
           ERI RR+  +        E LQV++Y  G  Y+PH+DF   A PG AN  +    G RV 
Sbjct: 97  ERIERRIAELVHWPVERGEGLQVLHYRPGAQYKPHHDFFDPAHPGTANILRR--GGQRVG 154

Query: 209 TVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
           TV+ Y++  A GGAT F  + L + P KG A F+   +        T H   PVL G  
Sbjct: 155 TVVIYLNTPAGGGATTFPEVGLEVQPIKGNAVFFS--YERPLASTRTLHGGAPVLDGEK 211


>gi|383757171|ref|YP_005436156.1| putative prolyl 4-hydroxylase alpha subunit [Rubrivivax gelatinosus
           IL144]
 gi|381377840|dbj|BAL94657.1| putative prolyl 4-hydroxylase alpha subunit homologue
           oxidoreductase protein [Rubrivivax gelatinosus IL144]
          Length = 279

 Score =  108 bits (271), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 68/198 (34%), Positives = 97/198 (48%), Gaps = 15/198 (7%)

Query: 71  VHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGEL 130
           + R V  L +M L        PR++++  ++ D E D +  +A+PRL R+   +  TG  
Sbjct: 79  LDREVRVLAVMSL--------PRVVVFGGLLSDEECDELVALARPRLARSETVDNSTGGS 130

Query: 131 EIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDF- 189
           E+   R S   +    E P+IERI RR+  +        E LQV+ Y  G  Y+PH+DF 
Sbjct: 131 EVNAARTSDGMFFERGEKPLIERIERRIAELVRWPVERGEGLQVLRYRPGAQYKPHHDFF 190

Query: 190 --ARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHS 247
             A PG AN  +    G RV TV+ Y++  A GGAT F  + L + P KG A F+   + 
Sbjct: 191 DPAHPGTANILRR--GGQRVGTVVMYLNTPAGGGATTFPEVGLEVQPVKGNAVFFS--YE 246

Query: 248 SGDGDYYTRHAACPVLTG 265
                  T H   PVL G
Sbjct: 247 RPLASTRTLHGGAPVLDG 264


>gi|390352104|ref|XP_003727818.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like
           [Strongylocentrotus purpuratus]
          Length = 121

 Score =  108 bits (270), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 50/100 (50%), Positives = 68/100 (68%), Gaps = 5/100 (5%)

Query: 167 STAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFT 226
           +  E LQ+ NYG+GGHY PH+DF R    +       GNR+A++LFY+SDVA+GG TVF 
Sbjct: 2   NATEFLQIANYGLGGHYLPHFDFTRDVATHK-----NGNRIASMLFYLSDVAKGGDTVFI 56

Query: 227 SLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
                + PEKG+A FW+NL  +G  D  T+HA+CPV++GS
Sbjct: 57  DAGAKIKPEKGSAIFWYNLFKNGKVDERTKHASCPVISGS 96


>gi|255607134|ref|XP_002538686.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
 gi|223510975|gb|EEF23697.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
          Length = 318

 Score =  108 bits (270), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 58/153 (37%), Positives = 91/153 (59%), Gaps = 3/153 (1%)

Query: 92  PRIILYRDVMYDSEIDLIKKMAQPRLRRA-TVQNYKTGELEIANYRISKSAWLREPEHPV 150
           PRI L+ DV+ D+E D +   ++ RL+R+  V N  +GE  + + R S  A+  + E+ +
Sbjct: 126 PRIALFDDVLSDAECDALIAASRSRLQRSKVVANRGSGEF-VDDTRTSYGAYFNKGENSL 184

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT-GNRVAT 209
           +  I RR+  +T    + AE LQ++NYG+GG Y PH+D+  P +      L + G R+AT
Sbjct: 185 VATIQRRIAELTRWPLTHAEPLQILNYGLGGEYLPHFDYFEPQQPGLPSPLESGGQRIAT 244

Query: 210 VLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFW 242
           V+ Y++DV  GG T+F  LNL   P KG A ++
Sbjct: 245 VVMYLNDVEAGGGTIFPHLNLETRPRKGGAIYF 277


>gi|198417608|ref|XP_002125299.1| PREDICTED: similar to prolyl-4-hydroxylase-alpha EFB CG31022-PA
           [Ciona intestinalis]
          Length = 471

 Score =  108 bits (270), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 65/178 (36%), Positives = 88/178 (49%), Gaps = 47/178 (26%)

Query: 135 YRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGE 194
           YRIS +AWL + +   ++R+S+R+  +TGLT S+ E LQV NYG+ GHY  H+D     E
Sbjct: 265 YRISNTAWLDDKDSSSVKRLSQRLADVTGLTGSS-ELLQVANYGMAGHYIAHFDAMTREE 323

Query: 195 ANAFKSL----------------------------------------------GTGNRVA 208
            +  KSL                                               TG R+A
Sbjct: 324 EDYVKSLSNRQTVLSNITEDDLLDDKSIIGSADNKTVGTTQQPDDRNENYEYGNTGQRIA 383

Query: 209 TVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           T L Y+S+V +GG+T F   N+   P KG+A FW+NL+ SG  D  T HAACPVL G+
Sbjct: 384 TALVYLSEVQKGGSTAFFYPNIVAEPIKGSAVFWYNLYPSGALDKRTLHAACPVLIGN 441


>gi|187920106|ref|YP_001889137.1| procollagen-proline dioxygenase [Burkholderia phytofirmans PsJN]
 gi|187718544|gb|ACD19767.1| Procollagen-proline dioxygenase [Burkholderia phytofirmans PsJN]
          Length = 295

 Score =  108 bits (269), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 55/180 (30%), Positives = 97/180 (53%), Gaps = 1/180 (0%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +P++I++ DV+   E   + + ++ RL+R+T  N +TG+ ++   R S+  W +  E   
Sbjct: 105 RPQVIVFGDVLSPDECAEMIERSRHRLKRSTTVNPETGKEDVIRNRTSEGIWYQRGEDAF 164

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGE-ANAFKSLGTGNRVAT 209
           IER+ RR+  +        E LQ+++YG  G Y PH+D+  P +  +A  +   G RVAT
Sbjct: 165 IERMDRRISSLMNWPVENGEGLQILHYGTTGEYRPHFDYFPPDQPGSAVHTAQGGQRVAT 224

Query: 210 VLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSL 269
           ++ Y++DV  GG T+F    +S+   +G A ++  ++     D  T H   PVL G   +
Sbjct: 225 LVIYLNDVPDGGETIFPEAGISVAARQGGAVYFRYMNGQRQLDPLTLHGGAPVLGGDKWI 284


>gi|407708877|ref|YP_006792741.1| prolyl 4-hydroxylase [Burkholderia phenoliruptrix BR3459a]
 gi|407237560|gb|AFT87758.1| prolyl 4-hydroxylase [Burkholderia phenoliruptrix BR3459a]
          Length = 300

 Score =  108 bits (269), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 55/180 (30%), Positives = 99/180 (55%), Gaps = 1/180 (0%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +P++I++ +V+   E D + + ++ RL+R+T+ +  TG+  +   R S+  W +  E   
Sbjct: 110 RPQVIVFANVLSPEECDEVIERSRHRLKRSTIVDPATGQEGVIRNRTSEGIWYQRGEDAF 169

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGE-ANAFKSLGTGNRVAT 209
           IER+ RR+  +        E LQ+++YG  G Y PH+D+  P +  +A  +   G RVAT
Sbjct: 170 IERLDRRIASLMNWPVENGEGLQILHYGPTGEYRPHFDYFPPDQPGSAVHTARGGQRVAT 229

Query: 210 VLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSL 269
           ++ Y++DVA GG T+F +  LS+  ++G A ++  ++     D  T H   PV  G   +
Sbjct: 230 LVVYLNDVADGGETIFPAAGLSVAAKQGGAVYFRYMNGQRQLDPLTLHGGAPVRAGDKWI 289


>gi|242047772|ref|XP_002461632.1| hypothetical protein SORBIDRAFT_02g005750 [Sorghum bicolor]
 gi|241925009|gb|EER98153.1| hypothetical protein SORBIDRAFT_02g005750 [Sorghum bicolor]
          Length = 307

 Score =  108 bits (269), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 60/193 (31%), Positives = 100/193 (51%), Gaps = 22/193 (11%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           QPRI +Y+  + D+E D +  +A+ +++R+ V + ++G+  ++  R S   +L + + PV
Sbjct: 50  QPRIFVYKGFLSDAECDHLVTLAKKKIQRSMVADNQSGKSVMSEVRTSSGMFLNKRQDPV 109

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           + RI  R+   T L    AE +Q++ Y  G  YEPH+D+      +    +  G+R ATV
Sbjct: 110 VSRIEERIAAWTFLPQENAENMQILRYEHGQKYEPHFDYFH----DKINQVRGGHRYATV 165

Query: 211 LFYMSDVAQGGATVFTSLN------------------LSLWPEKGTAAFWHNLHSSGDGD 252
           L Y+S V +GG TVF +                    L++ P KG A  + +LH  G  D
Sbjct: 166 LMYLSTVDKGGETVFPNAKGWESQPKDDTFSECAHQGLAVKPVKGDAVLFFSLHVDGVPD 225

Query: 253 YYTRHAACPVLTG 265
             + H +CPV+ G
Sbjct: 226 PLSLHGSCPVIQG 238


>gi|302773668|ref|XP_002970251.1| hypothetical protein SELMODRAFT_411114 [Selaginella moellendorffii]
 gi|300161767|gb|EFJ28381.1| hypothetical protein SELMODRAFT_411114 [Selaginella moellendorffii]
          Length = 256

 Score =  107 bits (268), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 61/206 (29%), Positives = 102/206 (49%), Gaps = 23/206 (11%)

Query: 81  MPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKS 140
           MP+  E    QPR  ++ + +   E D + ++AQP ++R+ V + +TG+ + +  R S  
Sbjct: 42  MPVWTETISWQPRASVFHNFLSSEECDHLIRLAQPNMKRSAVVDNQTGKSKDSRVRTSSG 101

Query: 141 AWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKS 200
            +LR  +  +I RI  R+   T +     E LQV++Y +G  Y+ H+D+      +   +
Sbjct: 102 TFLRRGQDEIISRIEERIAKFTFIPKEHGEGLQVLHYEVGQKYDAHHDYFH----DKVNT 157

Query: 201 LGTGNRVATVLFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAF 241
              G RVATVL Y+SDV +GG TVF S                     +S+ P KG A  
Sbjct: 158 KNGGQRVATVLMYLSDVEEGGETVFPSAKVNSSSVPWWDELSECAKKGVSVKPRKGDALL 217

Query: 242 WHNLHSSGDGDYYTRHAACPVLTGSN 267
           + ++    + D ++ H  CPV+ G+ 
Sbjct: 218 FWSMSPDAELDPFSLHGGCPVIKGNK 243


>gi|218199253|gb|EEC81680.1| hypothetical protein OsI_25242 [Oryza sativa Indica Group]
          Length = 487

 Score =  107 bits (268), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 61/193 (31%), Positives = 102/193 (52%), Gaps = 22/193 (11%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +PR+ +Y+  + D E D + K+ + +++R+ V + K+G+  ++  R S   +L + + PV
Sbjct: 63  RPRVFVYKGFLSDDECDHLVKLGKRKMQRSMVADNKSGKSVMSEVRTSSGMFLDKRQDPV 122

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           + RI +R+   T L    AE +Q++ Y  G  YEPH+D+         ++LG G+R ATV
Sbjct: 123 VSRIEKRIAAWTFLPEENAENIQILRYEHGQKYEPHFDYFHD---KVNQALG-GHRYATV 178

Query: 211 LFYMSDVAQGGATVFTSL------------------NLSLWPEKGTAAFWHNLHSSGDGD 252
           L Y+S V +GG TVF +                    L++ P KG A  + +LH  G  D
Sbjct: 179 LMYLSTVEKGGETVFPNAEGWENQPKDDTFSECAQKGLAVKPVKGDAVLFFSLHIDGVPD 238

Query: 253 YYTRHAACPVLTG 265
             + H +CPV+ G
Sbjct: 239 PLSLHGSCPVIEG 251


>gi|319652240|ref|ZP_08006358.1| prolyl 4-hydroxylase [Bacillus sp. 2_A_57_CT2]
 gi|317396063|gb|EFV76783.1| prolyl 4-hydroxylase [Bacillus sp. 2_A_57_CT2]
          Length = 216

 Score =  107 bits (268), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 54/176 (30%), Positives = 95/176 (53%), Gaps = 9/176 (5%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +P I++  +V+ D E D + + ++ R++R+ V N     LE+   R S S +  E E+ +
Sbjct: 37  EPLIVILGNVLSDEECDQLIQQSKDRMQRSKVAN----SLEVDELRTSSSTFFHEGENEI 92

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           + RI +R+  +  +     E LQ++NY IG  Y+ H+DF       A     +  R++T+
Sbjct: 93  VARIEKRISQIMNIPVEHGEGLQILNYKIGQEYKAHFDFFSSTSRAA-----SNPRISTL 147

Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           + Y++DV QGG T F  LN S+ P+KG A ++   ++  + +  T H   PV+ G 
Sbjct: 148 VMYLNDVEQGGETYFPKLNFSVSPQKGMAVYFEYFYNDQNLNDLTLHGGAPVVMGD 203


>gi|302793288|ref|XP_002978409.1| hypothetical protein SELMODRAFT_418273 [Selaginella moellendorffii]
 gi|300153758|gb|EFJ20395.1| hypothetical protein SELMODRAFT_418273 [Selaginella moellendorffii]
          Length = 256

 Score =  107 bits (268), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 61/206 (29%), Positives = 102/206 (49%), Gaps = 23/206 (11%)

Query: 81  MPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKS 140
           MP+  E    QPR  ++ + +   E D + ++AQP ++R+ V + +TG+ + +  R S  
Sbjct: 42  MPVWTETISWQPRASVFHNFLSSEECDHLIRLAQPNMKRSAVVDNQTGKSKDSRVRTSSG 101

Query: 141 AWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKS 200
            +LR  +  +I RI  R+   T +     E LQV++Y +G  Y+ H+D+      +   +
Sbjct: 102 TFLRRGQDEIISRIEERIAKFTFIPKEHGEGLQVLHYEVGQKYDAHHDYFH----DKVNT 157

Query: 201 LGTGNRVATVLFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAF 241
              G RVATVL Y+SDV +GG TVF S                     +S+ P KG A  
Sbjct: 158 KNGGQRVATVLMYLSDVEEGGETVFPSAKVNSSSVPWWDELSECGKKGVSVKPRKGDALL 217

Query: 242 WHNLHSSGDGDYYTRHAACPVLTGSN 267
           + ++    + D ++ H  CPV+ G+ 
Sbjct: 218 FWSMSPDAELDPFSLHGGCPVIKGNK 243


>gi|125552794|gb|EAY98503.1| hypothetical protein OsI_20415 [Oryza sativa Indica Group]
          Length = 319

 Score =  107 bits (267), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 59/207 (28%), Positives = 103/207 (49%), Gaps = 25/207 (12%)

Query: 80  LMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISK 139
           + P    +   +PR+ LY+  + D E + +  +A+  L+R+ V +  +G+ E+++ R S 
Sbjct: 53  VYPHHSRQISWKPRVFLYQHFLSDDEANHLVSLARAELKRSAVADNLSGKSELSDARTSS 112

Query: 140 SAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFK 199
             ++R+ + P++  I  ++   T L     E++QV+ Y  G  YE HYD+     ++   
Sbjct: 113 GTFIRKSQDPIVAGIEEKIAAWTFLPKENGEDIQVLRYKHGEKYERHYDYF----SDNVN 168

Query: 200 SLGTGNRVATVLFYMSDVAQGGATVF---------------------TSLNLSLWPEKGT 238
           +L  G+R+ATVL Y++DVA+GG TVF                         +++ P KG 
Sbjct: 169 TLRGGHRIATVLMYLTDVAEGGETVFPLAEEFTESGTNNEDSTLSECAKKGVAVKPRKGD 228

Query: 239 AAFWHNLHSSGDGDYYTRHAACPVLTG 265
           A  + NL      D  + HA CPV+ G
Sbjct: 229 ALLFFNLSPDASKDSLSLHAGCPVIKG 255


>gi|357467085|ref|XP_003603827.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355492875|gb|AES74078.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
          Length = 280

 Score =  107 bits (267), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 64/201 (31%), Positives = 100/201 (49%), Gaps = 24/201 (11%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +PR  +Y + +   E + +  +A+P L +++V + KTG+   +  R S   +L+  +  +
Sbjct: 75  EPRAFVYHNFLSKEECEHLINLAKPFLAKSSVVDSKTGKSTESRVRTSSGMFLKRGKDKI 134

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           I+ I RR+   T +     E LQV++YG+G  YEPHYD+      + F +   G RVATV
Sbjct: 135 IQNIERRIADFTFIPVENGEGLQVLHYGVGEKYEPHYDYF----LDEFNTKNGGQRVATV 190

Query: 211 LFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAFWHNLHSSGDG 251
           L Y+SDV +GG TVF +                     LSL P+ G A  + ++      
Sbjct: 191 LMYLSDVEEGGETVFPAAKANFSSVPWWNDLSECARKGLSLKPKMGDALLFWSMRPDATL 250

Query: 252 DYYTRHAACPVLTGSNSLHST 272
           D  + H  CPV+ G N   ST
Sbjct: 251 DASSLHGGCPVIVG-NKWSST 270


>gi|115464581|ref|NP_001055890.1| Os05g0489100 [Oryza sativa Japonica Group]
 gi|50511363|gb|AAT77286.1| putative prolyl 4-hydroxylase alpha subunit [Oryza sativa Japonica
           Group]
 gi|113579441|dbj|BAF17804.1| Os05g0489100 [Oryza sativa Japonica Group]
 gi|125587281|gb|EAZ27945.1| hypothetical protein OsJ_11906 [Oryza sativa Japonica Group]
 gi|215737307|dbj|BAG96236.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 319

 Score =  107 bits (266), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 59/207 (28%), Positives = 103/207 (49%), Gaps = 25/207 (12%)

Query: 80  LMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISK 139
           + P    +   +PR+ LY+  + D E + +  +A+  L+R+ V +  +G+ E+++ R S 
Sbjct: 53  VYPHHSRQISWKPRVFLYQHFLSDDEANHLVSLARTELKRSAVADNLSGKSELSDARTSS 112

Query: 140 SAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFK 199
             ++R+ + P++  I  ++   T L     E++QV+ Y  G  YE HYD+     ++   
Sbjct: 113 GTFIRKSQDPIVAGIEEKIAAWTFLPKENGEDIQVLRYKHGEKYERHYDYF----SDNVN 168

Query: 200 SLGTGNRVATVLFYMSDVAQGGATVF---------------------TSLNLSLWPEKGT 238
           +L  G+R+ATVL Y++DVA+GG TVF                         +++ P KG 
Sbjct: 169 TLRGGHRIATVLMYLTDVAEGGETVFPLAEEFTESGTNNEDSTLSECAKKGVAVKPRKGD 228

Query: 239 AAFWHNLHSSGDGDYYTRHAACPVLTG 265
           A  + NL      D  + HA CPV+ G
Sbjct: 229 ALLFFNLSPDASKDSLSLHAGCPVIKG 255


>gi|242088305|ref|XP_002439985.1| hypothetical protein SORBIDRAFT_09g023860 [Sorghum bicolor]
 gi|241945270|gb|EES18415.1| hypothetical protein SORBIDRAFT_09g023860 [Sorghum bicolor]
          Length = 308

 Score =  107 bits (266), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 60/207 (28%), Positives = 102/207 (49%), Gaps = 21/207 (10%)

Query: 76  PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANY 135
           P   + P    +   +PR+ LY+  + D E + +  +A+  L+R+ V +  +G+  +++ 
Sbjct: 42  PAAVVYPHHSRQISWKPRVFLYQHFLSDDEANHLISLARAELKRSAVADNMSGKSTLSDV 101

Query: 136 RISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEA 195
           R S   +LR+ + P++E I  ++   T L     E++QV+ Y  G  YEPHYD+      
Sbjct: 102 RTSSGTFLRKGQDPIVEGIEDKIAAWTFLPKENGEDIQVLRYKHGEKYEPHYDYF----T 157

Query: 196 NAFKSLGTGNRVATVLFYMSDVAQGGATVF-----------------TSLNLSLWPEKGT 238
           +   ++  G+R ATVL Y++DVA+GG TVF                     +++ P KG 
Sbjct: 158 DNVNTIRGGHRYATVLLYLTDVAEGGETVFPLAEEVDDAKDATFSECAQKGIAVKPRKGD 217

Query: 239 AAFWHNLHSSGDGDYYTRHAACPVLTG 265
           A  + NL   G  D  + H  C V+ G
Sbjct: 218 ALLFFNLKPDGTTDPVSLHGGCAVIRG 244


>gi|255085592|ref|XP_002505227.1| predicted protein [Micromonas sp. RCC299]
 gi|226520496|gb|ACO66485.1| predicted protein [Micromonas sp. RCC299]
          Length = 267

 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 68/205 (33%), Positives = 103/205 (50%), Gaps = 24/205 (11%)

Query: 79  RLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRIS 138
           R++ L E     +P+  LYR  +  +E D IK+ A+P+L ++TV + KTG+   +N R S
Sbjct: 4   RIVKLSE-----KPKAYLYRGFLRQAECDYIKERAKPKLEKSTVVDNKTGQSVPSNIRTS 58

Query: 139 KSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAF 198
              +    E  +IE I RR+   T +     E +QV+ Y +G  YEPH D A   + N  
Sbjct: 59  DGMFFDRHEDDIIEDIERRIAEWTNVPWENGEGIQVLRYEVGQKYEPHLD-AFSDKFNTE 117

Query: 199 KSLGTGNRVATVLFYMSDVAQGGATVF-----------------TSLNLSLWPEKGTAAF 241
           +S G G R+ATVL Y+SDV +GG TVF                     +++   KG A  
Sbjct: 118 ESKG-GQRMATVLMYLSDVEEGGETVFPRSVDKPHKGDPKWSECAQRGVAVKARKGDALL 176

Query: 242 WHNLHSSGDGDYYTRHAACPVLTGS 266
           + +L    + D  + H  CPV+ G+
Sbjct: 177 FWSLDIDSNVDELSLHGGCPVIKGT 201


>gi|307111754|gb|EFN59988.1| hypothetical protein CHLNCDRAFT_49444 [Chlorella variabilis]
          Length = 344

 Score =  106 bits (265), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 66/192 (34%), Positives = 95/192 (49%), Gaps = 25/192 (13%)

Query: 93  RIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIE 152
           RI LY + + D E D I K+A+P + R+ V    +G+ +I N R SK  +L      VI 
Sbjct: 71  RIFLYHNFLTDEECDHIIKLAEPTMARSGVVETDSGKSKIDNVRTSKGTFLNRGHDSVIA 130

Query: 153 RISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD--FARPGEANAFKSLGTGNRVATV 210
            I  R+   T +     E LQV+ Y  G  YE HYD  F + G AN       GNR  TV
Sbjct: 131 DIEARIAKWTLMPAGNGEGLQVLKYEHGQEYEGHYDYFFHKAGTANG------GNRYLTV 184

Query: 211 LFYMSDVAQGGATVFTSLN-----------------LSLWPEKGTAAFWHNLHSSGDGDY 253
           L Y++DV +GG T F ++                  L+  P+KG A  +H++  +G+ + 
Sbjct: 185 LMYLNDVEEGGETCFPNIPSPNGDNGPEFSECARKVLAAKPKKGNAVLFHSIKPTGELER 244

Query: 254 YTRHAACPVLTG 265
            + H ACPV+ G
Sbjct: 245 RSLHTACPVIKG 256


>gi|241710333|ref|XP_002412045.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
 gi|215505100|gb|EEC14594.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
          Length = 440

 Score =  106 bits (265), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 70/210 (33%), Positives = 109/210 (51%), Gaps = 18/210 (8%)

Query: 21  NKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRL 80
           N S E K + P  +    + EV E + Y+ LCRG+L   P + +QL+CRY      +  L
Sbjct: 235 NISNEPKHKVPVRDPTKHSAEVIEHQNYKRLCRGELLRSPKMDSQLRCRYYKGQDGFFTL 294

Query: 81  MPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYR---- 136
            P+K EE  L+P II+  +V+ D +I  +   A+PR R+     +    L+  N +    
Sbjct: 295 QPVKLEEVNLKPYIIVMHNVVQDRDIKDMIDFAEPRARKTPALYF----LKKGNTKTHIL 350

Query: 137 --ISKSAWLREPEHPVIERISRRVEHMTGLTTS----TAEELQVVNYGIGGHYEPHYDFA 190
             I + AWL E   P+  R++R +  + G++ S     AE  Q+ NYGIGG Y PH D+ 
Sbjct: 351 LPIYQRAWLGEDSAPIANRMNRYLRALVGMSASGSNLDAEPYQLANYGIGGQYLPHNDYL 410

Query: 191 RPG-EANA---FKSLGTGNRVATVLFYMSD 216
           +    AN    +     G+RVAT++ Y+S+
Sbjct: 411 QDALHANTSEYYVHHKAGDRVATLMIYVSE 440


>gi|145345764|ref|XP_001417370.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144577597|gb|ABO95663.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 328

 Score =  106 bits (264), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 66/198 (33%), Positives = 96/198 (48%), Gaps = 20/198 (10%)

Query: 86  EEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLRE 145
           E    +P   +YR  +   E D +K +A P L R+TV +   G    ++ R S   +L  
Sbjct: 57  ERVSWRPHAEVYRGFLTREECDHLKALATPSLGRSTVVDASNGGSVPSDIRTSSGMFLLR 116

Query: 146 PEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGN 205
            E  V+  I RR+   T +  S  E  QV+ Y  G  Y PH+D+ +  E N  +  G G 
Sbjct: 117 GEDDVVASIERRIASWTHVPESHGEGFQVLRYEFGQEYRPHFDYFQD-EFNQKREKG-GQ 174

Query: 206 RVATVLFYMSDVAQGGATVF------------------TSLNLSLWPEKGTAAFWHNLHS 247
           RVATVL Y++DV +GG T+F                   +  L++ P KG A F+ +LH 
Sbjct: 175 RVATVLMYLTDVEEGGETIFPDAEAGANPGGGDDASSCAAGKLAVKPRKGDALFFRSLHH 234

Query: 248 SGDGDYYTRHAACPVLTG 265
           +G  D  + HA CPV+ G
Sbjct: 235 NGTSDAMSSHAGCPVVKG 252


>gi|254254263|ref|ZP_04947580.1| hypothetical protein BDAG_03558 [Burkholderia dolosa AUO158]
 gi|124898908|gb|EAY70751.1| hypothetical protein BDAG_03558 [Burkholderia dolosa AUO158]
          Length = 285

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 57/176 (32%), Positives = 93/176 (52%), Gaps = 1/176 (0%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +P+I+++ +V+   E D + + +  +L ++T  N +TG  E+  +R S   W +  E  +
Sbjct: 95  RPQIVVFGNVLDQDECDEMIQRSMHKLEQSTTVNAETGTQEVIRHRTSHGTWFQNGEDAL 154

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT-GNRVAT 209
           I RI  R+  +        E LQV+ Y  GG Y  HYD+ +P  A +   + T G RVAT
Sbjct: 155 IRRIETRLAALMNCPVENGEGLQVLRYTPGGEYRSHYDYFQPTAAGSLTHVRTGGQRVAT 214

Query: 210 VLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           ++ Y++DV  GG TVF    +S+ P +G A ++  ++     D  T HA  PV  G
Sbjct: 215 LIVYLNDVPSGGETVFPEAGISVVPRRGDAVYFRYMNRLRQLDPATLHAGAPVRDG 270


>gi|377810637|ref|YP_005043077.1| proCollegen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia sp.
           YI23]
 gi|357939998|gb|AET93554.1| proCollegen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia sp.
           YI23]
          Length = 297

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 58/176 (32%), Positives = 94/176 (53%), Gaps = 1/176 (0%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +P  +L  + +  SE D +  +A+PRL R+TV +  TG    A +R S   + R  E P+
Sbjct: 101 RPAAVLLDEFLTGSECDQLIALARPRLSRSTVVDPVTGRDVAAGHRSSDGTFFRLAETPL 160

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLG-TGNRVAT 209
           + R+  R+  +TGL     E LQ++ Y  G    PH D+   G     +S+  +G RV T
Sbjct: 161 VARLEMRIAALTGLAAENGEGLQLLRYQPGAESTPHVDYLVAGNETNRESIARSGQRVGT 220

Query: 210 VLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           +L Y++DV  GG TVF  +  S+ P +G A ++   + +G  D  + HA+ P+ +G
Sbjct: 221 LLMYLNDVEGGGETVFPQVGCSVVPRRGQALYFEYCNRAGVCDPASLHASTPLRSG 276


>gi|413963357|ref|ZP_11402584.1| ProCollegen-proline dioxygenase [Burkholderia sp. SJ98]
 gi|413929189|gb|EKS68477.1| ProCollegen-proline dioxygenase [Burkholderia sp. SJ98]
          Length = 286

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 59/195 (30%), Positives = 105/195 (53%), Gaps = 4/195 (2%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           QP I L  DV+ D+E D + ++ +  ++R++V +  +G+      R S+ A++      +
Sbjct: 93  QPVIALVADVLDDTECDRLIEIGREHVQRSSVVDPDSGKEITIEERRSEGAFVNASTDAL 152

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT-GNRVAT 209
           +E I RR+  +        E+L ++ YG+GG Y PHYD+    +A +   +   G R+AT
Sbjct: 153 VETIDRRIAELFRQPVENGEDLHILRYGMGGEYRPHYDYFPEEQAGSKHHMQRGGQRIAT 212

Query: 210 VLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSL 269
           V+ Y+++V QGG T F  + L++ P +G+A ++  ++  G  D  T HA  PV  G   +
Sbjct: 213 VILYLNEVEQGGDTTFPDIGLAIHPRRGSALYFEYVNELGQSDPKTLHAGTPVEKGEKWI 272

Query: 270 HSTCPCGLRRGLQRS 284
            +     +RRG  R+
Sbjct: 273 ATKW---IRRGRFRA 284


>gi|323528042|ref|YP_004230194.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1001]
 gi|323385044|gb|ADX57134.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1001]
          Length = 300

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 54/180 (30%), Positives = 99/180 (55%), Gaps = 1/180 (0%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +P++I++ +V+   E D + + ++ RL+R+T+ +  TG+  +   R S+  W +  E   
Sbjct: 110 RPQVIVFANVLSPEECDEVIERSRHRLKRSTIVDPATGQEGVIRNRTSEGIWYQRGEDAF 169

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGE-ANAFKSLGTGNRVAT 209
           IER+ +R+  +        E LQ+++YG  G Y PH+D+  P +  +A  +   G RVAT
Sbjct: 170 IERLDQRIASLMNWPVENGEGLQILHYGPTGEYRPHFDYFPPDQPGSAVHTARGGQRVAT 229

Query: 210 VLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSL 269
           ++ Y++DVA GG T+F +  LS+  ++G A ++  ++     D  T H   PV  G   +
Sbjct: 230 LVVYLNDVADGGETIFPAAGLSVAAKQGGAVYFRYMNGQRQLDPLTLHGGAPVHAGDKWI 289


>gi|222636605|gb|EEE66737.1| hypothetical protein OsJ_23428 [Oryza sativa Japonica Group]
          Length = 487

 Score =  105 bits (263), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 60/193 (31%), Positives = 101/193 (52%), Gaps = 22/193 (11%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +PR+ +Y+  + D E D + K+ + +++R+ V + K+G+  ++  R S   +L + + PV
Sbjct: 63  RPRVFVYKGFLSDDECDHLVKLGKRKMQRSMVADNKSGKSVMSEVRTSSGMFLDKRQDPV 122

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           + RI +R+   T L    AE +Q++ Y  G  YEPH+D+         ++LG G+R ATV
Sbjct: 123 VSRIEKRIAAWTFLPEENAENIQILRYEHGQKYEPHFDYFHD---KVNQALG-GHRYATV 178

Query: 211 LFYMSDVAQGGATVFTSL------------------NLSLWPEKGTAAFWHNLHSSGDGD 252
           L Y+S V +GG TVF +                    L++ P KG    + +LH  G  D
Sbjct: 179 LMYLSTVEKGGETVFPNAEGWENQPKDDTFSECAQKGLAVKPVKGDTVLFFSLHIDGVPD 238

Query: 253 YYTRHAACPVLTG 265
             + H +CPV+ G
Sbjct: 239 PLSLHGSCPVIEG 251


>gi|445499353|ref|ZP_21466208.1| prolyl 4-hydroxylase alpha subunit [Janthinobacterium sp. HH01]
 gi|444789348|gb|ELX10896.1| prolyl 4-hydroxylase alpha subunit [Janthinobacterium sp. HH01]
          Length = 272

 Score =  105 bits (263), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 62/178 (34%), Positives = 89/178 (50%), Gaps = 1/178 (0%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           QP+IIL  +V+ D E D I      R  R+TV     G   +   R S+ A+++  E  V
Sbjct: 82  QPQIILLGNVLSDEECDAIIAHCGTRYTRSTVTGEADGSSMVHEGRTSEMAFIQRGEAEV 141

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLG-TGNRVAT 209
            ERI RR+  +       +E  Q+  Y     Y PHYD+  P  +     L   G R+AT
Sbjct: 142 AERIERRLAALAHWPAECSEPFQLQKYDATQEYRPHYDWLDPDSSGHRSHLARGGQRLAT 201

Query: 210 VLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
            + Y+SDV QGG TVF  L L ++P+KG+A ++ N   +   D  T H   PV+ G+ 
Sbjct: 202 FILYLSDVEQGGGTVFPGLGLEVYPKKGSALWFLNTDINHQPDKRTLHGGAPVVRGTK 259


>gi|325267002|ref|ZP_08133672.1| 2OG-Fe(II) oxygenase [Kingella denitrificans ATCC 33394]
 gi|324981502|gb|EGC17144.1| 2OG-Fe(II) oxygenase [Kingella denitrificans ATCC 33394]
          Length = 279

 Score =  105 bits (262), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 60/181 (33%), Positives = 92/181 (50%), Gaps = 1/181 (0%)

Query: 92  PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
           P +++  + +   E   +  +A+ ++  ATV +  TGE      R S +A     EHP+I
Sbjct: 91  PEVVVLDNFITAEECAQLIALAEGKVEDATVVDPATGEFVKHQDRTSMNAAFARAEHPLI 150

Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTG-NRVATV 210
            R+  R+           E +QV+ Y  GG Y+ H+D+         K++ TG  RV T 
Sbjct: 151 ARLEARIAAAIHWPAENGEGMQVLRYRSGGEYKAHFDYFDTQSEGGRKNMQTGGQRVGTF 210

Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLH 270
           L Y+ DV  GGAT F +LN  + P+KG A F+ N   +G+G+  T HA  PV++G   L 
Sbjct: 211 LVYLCDVDAGGATRFPALNFEIRPKKGMALFFANTLPNGEGNPLTLHAGVPVVSGVKYLA 270

Query: 271 S 271
           S
Sbjct: 271 S 271


>gi|120609859|ref|YP_969537.1| 2OG-Fe(II) oxygenase [Acidovorax citrulli AAC00-1]
 gi|120588323|gb|ABM31763.1| 2OG-Fe(II) oxygenase [Acidovorax citrulli AAC00-1]
          Length = 309

 Score =  105 bits (262), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 59/183 (32%), Positives = 93/183 (50%), Gaps = 7/183 (3%)

Query: 88  AYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPE 147
           A  QPR++L+ +++   E D I   A+PR+ R+     +TG  E+ + R S   + +  E
Sbjct: 118 AMAQPRVVLFGNLLSPEECDAIIDAARPRMARSLTVATRTGGEEVNDDRTSNGMFFQREE 177

Query: 148 HPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT-GNR 206
           +PV+ R+  R+  +        E LQV++Y  G  Y+PHYD+  P E      L   G R
Sbjct: 178 NPVVARLEARIARLVNWPLENGEGLQVLHYRPGAEYKPHYDYFDPAEPGTPTILRRGGQR 237

Query: 207 VATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAF--WHNLHSSGDGDYYTRHAACPVLT 264
           VAT++ Y++D  +GG T F  ++L + P +G A F  +   H S      T H   PV+ 
Sbjct: 238 VATIVIYLNDPEKGGGTTFPDVHLEVAPRRGNAVFFSYERPHPS----TRTLHGGAPVVA 293

Query: 265 GSN 267
           G  
Sbjct: 294 GDK 296


>gi|115471029|ref|NP_001059113.1| Os07g0194500 [Oryza sativa Japonica Group]
 gi|113610649|dbj|BAF21027.1| Os07g0194500 [Oryza sativa Japonica Group]
 gi|215768445|dbj|BAH00674.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 319

 Score =  105 bits (262), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 60/193 (31%), Positives = 101/193 (52%), Gaps = 22/193 (11%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +PR+ +Y+  + D E D + K+ + +++R+ V + K+G+  ++  R S   +L + + PV
Sbjct: 63  RPRVFVYKGFLSDDECDHLVKLGKRKMQRSMVADNKSGKSVMSEVRTSSGMFLDKRQDPV 122

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           + RI +R+   T L    AE +Q++ Y  G  YEPH+D+         ++LG G+R ATV
Sbjct: 123 VSRIEKRIAAWTFLPEENAENIQILRYEHGQKYEPHFDYFHD---KVNQALG-GHRYATV 178

Query: 211 LFYMSDVAQGGATVFTSL------------------NLSLWPEKGTAAFWHNLHSSGDGD 252
           L Y+S V +GG TVF +                    L++ P KG    + +LH  G  D
Sbjct: 179 LMYLSTVEKGGETVFPNAEGWENQPKDDTFSECAQKGLAVKPVKGDTVLFFSLHIDGVPD 238

Query: 253 YYTRHAACPVLTG 265
             + H +CPV+ G
Sbjct: 239 PLSLHGSCPVIEG 251


>gi|34393269|dbj|BAC83179.1| prolyl 4-hydroxylase alpha-1 subunit precursor-like protein [Oryza
           sativa Japonica Group]
 gi|50509101|dbj|BAD30161.1| prolyl 4-hydroxylase alpha-1 subunit precursor-like protein [Oryza
           sativa Japonica Group]
          Length = 313

 Score =  105 bits (261), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 60/193 (31%), Positives = 101/193 (52%), Gaps = 22/193 (11%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +PR+ +Y+  + D E D + K+ + +++R+ V + K+G+  ++  R S   +L + + PV
Sbjct: 57  RPRVFVYKGFLSDDECDHLVKLGKRKMQRSMVADNKSGKSVMSEVRTSSGMFLDKRQDPV 116

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           + RI +R+   T L    AE +Q++ Y  G  YEPH+D+         ++LG G+R ATV
Sbjct: 117 VSRIEKRIAAWTFLPEENAENIQILRYEHGQKYEPHFDYFHD---KVNQALG-GHRYATV 172

Query: 211 LFYMSDVAQGGATVFTSL------------------NLSLWPEKGTAAFWHNLHSSGDGD 252
           L Y+S V +GG TVF +                    L++ P KG    + +LH  G  D
Sbjct: 173 LMYLSTVEKGGETVFPNAEGWENQPKDDTFSECAQKGLAVKPVKGDTVLFFSLHIDGVPD 232

Query: 253 YYTRHAACPVLTG 265
             + H +CPV+ G
Sbjct: 233 PLSLHGSCPVIEG 245


>gi|218192156|gb|EEC74583.1| hypothetical protein OsI_10158 [Oryza sativa Indica Group]
          Length = 299

 Score =  105 bits (261), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 65/193 (33%), Positives = 97/193 (50%), Gaps = 23/193 (11%)

Query: 92  PRIILYRDVMYDSEID-LIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           PR+ LY   + D+E + LI    Q R+ R+TV N K+GE  ++  R S   +L   +  V
Sbjct: 44  PRVFLYEGFLSDAECEHLIALAKQGRMERSTVVNGKSGESVMSKTRTSSGMFLIRKQDEV 103

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           + RI  R+   T       E +Q++ YG G  YEPH+D+ R  +A+A      G+R+ATV
Sbjct: 104 VARIEERIAAWTMFPAENGESMQMLRYGQGEKYEPHFDYIRGRQASARG----GHRIATV 159

Query: 211 LFYMSDVAQGGATVFTSLNLSL-------W-----------PEKGTAAFWHNLHSSGDGD 252
           L Y+S+V  GG TVF      L       W           P KG+A  + +L+ +   D
Sbjct: 160 LMYLSNVKMGGETVFPDAEARLSQPKDETWSDCAEQGFAVKPTKGSAVLFFSLYPNATFD 219

Query: 253 YYTRHAACPVLTG 265
             + H +CPV+ G
Sbjct: 220 PGSLHGSCPVIQG 232


>gi|341893180|gb|EGT49115.1| CBN-PHY-4 protein [Caenorhabditis brenneri]
          Length = 282

 Score =  105 bits (261), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 78/260 (30%), Positives = 123/260 (47%), Gaps = 32/260 (12%)

Query: 41  EVTE-REKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRD 99
           E+ E  EK    C  +L    A   QL C  + +++ + ++  L      LQP I+ Y +
Sbjct: 21  EIIEFNEKMWEKCGKELRGNSATNPQLVCFQIKKHLLFRKMEILS-----LQPFIVQYHN 75

Query: 100 VMYDSEIDLIKKMAQPRLRRATVQNY-------KTGELEIANYRISKSAWLREPEHPVIE 152
           +++       +++A+  +R + V          KT   E +  R +   WL         
Sbjct: 76  LVH-------RRLAKRAVRESEVLQLEQLKISGKTETPEKSQVRAANGTWLMHTNRLNFA 128

Query: 153 RISRRVE-HMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVL 211
           RI R ++ ++  L  +TAE  Q+++Y   G+Y PHYDF  P E N       GNR+ATVL
Sbjct: 129 RIFRNLQLNIDALDLTTAEPWQILSYNSDGYYAPHYDFLNP-ETNRQLVDSRGNRIATVL 187

Query: 212 FYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN---- 267
             +    +GG TVF  +NL++ P+ G    W N  SSG+ D  T HAACP+  G+     
Sbjct: 188 VILQIAKKGGTTVFPKINLNIRPKAGDVIVWLNTLSSGESDPQTLHAACPIKEGNKIGAT 247

Query: 268 -SLHS-----TCPCGLRRGL 281
             +HS     + PC L+  +
Sbjct: 248 LWVHSKGQELSLPCSLQENV 267


>gi|307725787|ref|YP_003909000.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1003]
 gi|307586312|gb|ADN59709.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1003]
          Length = 313

 Score =  104 bits (260), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 53/178 (29%), Positives = 98/178 (55%), Gaps = 1/178 (0%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +P++I++ +V+   E   + + ++ RL+R+T+ +  TG  ++   R S+  W +  E  +
Sbjct: 123 RPQVIVFGNVLSPDECAEMIERSRHRLKRSTIVDPATGREDVIRNRTSEGIWYQRGEDAL 182

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGE-ANAFKSLGTGNRVAT 209
           IER+ +R+  +        E LQ+++YG  G Y PH+D+  P +  +A  +   G RVAT
Sbjct: 183 IERLDQRIASLMNWPLENGEGLQILHYGPSGEYRPHFDYFPPDQPGSAVHTARGGQRVAT 242

Query: 210 VLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
           ++ Y++DV  GG T+F    LS+  ++G A ++  ++     D  T H   PVL+G  
Sbjct: 243 LVVYLNDVPDGGETIFPEAGLSVAAQQGGAVYFRYMNGRRQLDPLTLHGGAPVLSGDK 300


>gi|403234403|ref|ZP_10912989.1| Procollagen-proline dioxygenase [Bacillus sp. 10403023]
          Length = 217

 Score =  104 bits (260), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 53/177 (29%), Positives = 96/177 (54%), Gaps = 12/177 (6%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +P I++  +V+ D E D + ++++ R+ R+ + N       + N R S S ++ E E+ +
Sbjct: 38  EPLIVVLGNVLSDEECDELIRLSKDRINRSKIAN-----ANVDNMRTSSSTFIEENENII 92

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRVAT 209
           + RI +R+  +  + T   E LQ++NY +G  Y+ H+D F+ P  A          R++T
Sbjct: 93  VSRIEKRISQIMNIPTEYGEGLQILNYQVGQEYKSHFDFFSSPHNA------INNPRIST 146

Query: 210 VLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           ++ Y+SDV QGG T F  L+ S+ P+KG A ++   ++    +  T H   PV+ G 
Sbjct: 147 LVMYLSDVEQGGETYFPKLHFSVSPQKGMAVYFEYFYNDQTLNELTLHGGAPVIVGD 203


>gi|260806885|ref|XP_002598314.1| hypothetical protein BRAFLDRAFT_204780 [Branchiostoma floridae]
 gi|229283586|gb|EEN54326.1| hypothetical protein BRAFLDRAFT_204780 [Branchiostoma floridae]
          Length = 282

 Score =  104 bits (260), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 57/150 (38%), Positives = 84/150 (56%), Gaps = 15/150 (10%)

Query: 142 WLREPEHPVIERISRRVEHMTGLTTS--TAEELQVVNYGIGGHYEPHYDFARPGEANAFK 199
           W+ + E  V+ ++SR V H+TGL T+  T +  QV+NYG+GG YEPHYD  +       +
Sbjct: 134 WVPDTEDLVVAKLSRMVAHITGLNTTFPTGDNFQVLNYGLGGQYEPHYDHLK---EEVSR 190

Query: 200 SLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAA 259
           +L   NR+ T LFY+S+V  GGATVFT  N+++   K +A  + N + +      + HA 
Sbjct: 191 TLMAANRILTFLFYLSEVEAGGATVFTEANIAVPVVKNSAVLFENTNKALVRSRASVHAG 250

Query: 260 CPVLTGSNSLHSTC----------PCGLRR 279
           CPVL GS  + +            PCGL +
Sbjct: 251 CPVLIGSKWVANKWIHEVGNELQRPCGLTQ 280


>gi|363543295|ref|NP_001241863.1| prolyl 4-hydroxylase 4 precursor [Zea mays]
 gi|347978806|gb|AEP37745.1| prolyl 4-hydroxylase 4 [Zea mays]
 gi|414591890|tpg|DAA42461.1| TPA: hypothetical protein ZEAMMB73_637248 [Zea mays]
          Length = 274

 Score =  104 bits (260), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 60/193 (31%), Positives = 98/193 (50%), Gaps = 22/193 (11%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
            PRI +Y+  + D+E D +  +A+ +++R+ V + ++G+   +  R S   +L + + PV
Sbjct: 51  HPRIFVYKGFLSDAECDHLVTLAKKKIQRSMVADNESGKSVKSEVRTSSGMFLDKRQDPV 110

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           + RI  R+   T L    AE +QV+ Y  G  YEPH+D+      +       G+R ATV
Sbjct: 111 VSRIEERIAAWTFLPQENAENMQVLRYEPGQKYEPHFDYFH----DRVNQARGGHRYATV 166

Query: 211 LFYMSDVAQGGATVFTSLN------------------LSLWPEKGTAAFWHNLHSSGDGD 252
           L Y+S V +GG TVF +                    L++ P KG A  + +LH+ G  D
Sbjct: 167 LMYLSTVREGGETVFPNAKGWESQPKDATFSECAHKGLAVKPVKGDAVLFFSLHADGTPD 226

Query: 253 YYTRHAACPVLTG 265
             + H +CPV+ G
Sbjct: 227 PLSLHGSCPVIRG 239


>gi|363807286|ref|NP_001242363.1| uncharacterized protein LOC100796794 precursor [Glycine max]
 gi|255641119|gb|ACU20838.1| unknown [Glycine max]
          Length = 297

 Score =  104 bits (259), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 61/207 (29%), Positives = 105/207 (50%), Gaps = 25/207 (12%)

Query: 80  LMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISK 139
           + P K ++   +PR  +Y   + D E D +  +A+  L+R+ V +  +GE ++++ R S 
Sbjct: 31  INPSKVKQISWKPRAFVYEGFLTDLECDHLISLAKSELKRSAVADNLSGESQLSDVRTSS 90

Query: 140 SAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFK 199
             ++ + + P++  I  ++   T L     E++QV  Y  G  Y+PHYD+      +   
Sbjct: 91  GMFISKNKDPIVAGIEDKISSWTFLPKENGEDIQVSRYEHGQKYDPHYDYF----TDKVN 146

Query: 200 SLGTGNRVATVLFYMSDVAQGGATVF-------------TSLNLS--------LWPEKGT 238
               G+R+ATVL Y++DVA+GG TVF             TS +LS        + P +G 
Sbjct: 147 IARGGHRIATVLMYLTDVAKGGETVFPSAEEPPRRRGAETSSDLSECAKKGIAVKPRRGD 206

Query: 239 AAFWHNLHSSGDGDYYTRHAACPVLTG 265
           A  + +LH++   D  + HA CPV+ G
Sbjct: 207 ALLFFSLHTNATPDTSSLHAGCPVIEG 233


>gi|390570433|ref|ZP_10250698.1| procollagen-proline dioxygenase [Burkholderia terrae BS001]
 gi|389937613|gb|EIM99476.1| procollagen-proline dioxygenase [Burkholderia terrae BS001]
          Length = 285

 Score =  104 bits (259), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 55/182 (30%), Positives = 94/182 (51%), Gaps = 1/182 (0%)

Query: 89  YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
           + +P++I + DV+   E   + + A+ RL+R+T  N + G  ++   R S+  W +  E 
Sbjct: 93  FERPQVIAFDDVLSGEECAELIERARHRLKRSTTVNPENGSEDVIQLRTSEGFWFQRCED 152

Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGE-ANAFKSLGTGNRV 207
             IER+  R+  +        E LQ+++Y  GG Y PH+D+  PG+  +   +   G RV
Sbjct: 153 AFIERLDHRISALMNWPLEHGEGLQILHYRQGGEYRPHFDYFPPGQNGSVLHTARGGQRV 212

Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
           AT++ Y+SDV  GG TVF    L++   +G A ++  ++     D  T H   PV +G  
Sbjct: 213 ATLIVYLSDVEGGGETVFPDAGLAVMARQGGAIYFRYMNGRRQLDPLTLHGGAPVTSGDK 272

Query: 268 SL 269
            +
Sbjct: 273 WI 274


>gi|170690448|ref|ZP_02881615.1| Procollagen-proline dioxygenase [Burkholderia graminis C4D1M]
 gi|170144883|gb|EDT13044.1| Procollagen-proline dioxygenase [Burkholderia graminis C4D1M]
          Length = 307

 Score =  104 bits (259), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 52/180 (28%), Positives = 97/180 (53%), Gaps = 1/180 (0%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +P++I++ +V+   E D + + ++ RL+R+T+ +  TG+ ++   R S+  W +  E   
Sbjct: 117 RPQVIVFANVLSPEECDEVIERSRHRLKRSTIVDPATGQEDVIRNRTSEGIWYQRGEDAF 176

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAF-KSLGTGNRVAT 209
           IER+ +R+  +        E LQ+++YG  G Y PH+D+  P +  +   +   G RVAT
Sbjct: 177 IERLDQRIASLMNWPVENGEGLQILHYGPTGEYRPHFDYFPPDQPGSMVHTARGGQRVAT 236

Query: 210 VLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSL 269
           ++ Y++DV  GG T+F    LS+  ++G A ++  ++     D  T H   PV  G   +
Sbjct: 237 LVIYLNDVPDGGETIFPEAGLSVAAKQGGAVYFRYMNGQRQLDPLTLHGGAPVRAGDKWI 296


>gi|420246706|ref|ZP_14750139.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. BT03]
 gi|398073616|gb|EJL64785.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. BT03]
          Length = 282

 Score =  103 bits (258), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 55/182 (30%), Positives = 94/182 (51%), Gaps = 1/182 (0%)

Query: 89  YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
           + +P++I + DV+   E   + + A+ RL+R+T  N + G  ++   R S+  W +  E 
Sbjct: 90  FERPQVIAFDDVLSGEECAELIERARHRLKRSTTVNPENGSEDVIQLRTSEGFWFQRCED 149

Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGE-ANAFKSLGTGNRV 207
             IER+  R+  +        E LQ+++Y  GG Y PH+D+  PG+  +   +   G RV
Sbjct: 150 AFIERLDHRISALMNWPLEHGEGLQILHYRQGGEYRPHFDYFPPGQNGSVLHTARGGQRV 209

Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
           AT++ Y+SDV  GG TVF    L++   +G A ++  ++     D  T H   PV +G  
Sbjct: 210 ATLIVYLSDVEGGGETVFPDAGLAVMARQGGAIYFRYMNGRRQLDPLTLHGGAPVTSGDK 269

Query: 268 SL 269
            +
Sbjct: 270 WI 271


>gi|108706361|gb|ABF94156.1| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
           Group]
 gi|222624253|gb|EEE58385.1| hypothetical protein OsJ_09545 [Oryza sativa Japonica Group]
          Length = 299

 Score =  103 bits (258), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 65/193 (33%), Positives = 96/193 (49%), Gaps = 23/193 (11%)

Query: 92  PRIILYRDVMYDSEID-LIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           PR+ LY   + D E + LI    Q R+ R+TV N K+GE  ++  R S   +L   +  V
Sbjct: 44  PRVFLYEGFLSDVECEHLIALAKQGRMERSTVVNGKSGESVMSKTRTSSGMFLIRKQDEV 103

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           + RI  R+   T       E +Q++ YG G  YEPH+D+ R  +A+A      G+R+ATV
Sbjct: 104 VARIEERIAAWTMFPAENGESMQMLRYGQGEKYEPHFDYIRGRQASARG----GHRIATV 159

Query: 211 LFYMSDVAQGGATVFTSLNLSL-------W-----------PEKGTAAFWHNLHSSGDGD 252
           L Y+S+V  GG TVF      L       W           P KG+A  + +L+ +   D
Sbjct: 160 LMYLSNVKMGGETVFPDAEARLSQPKDETWSDCAEQGFAVKPTKGSAVLFFSLYPNATFD 219

Query: 253 YYTRHAACPVLTG 265
             + H +CPV+ G
Sbjct: 220 PGSLHGSCPVIQG 232


>gi|297832394|ref|XP_002884079.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
 gi|297329919|gb|EFH60338.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
          Length = 291

 Score =  103 bits (258), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 60/195 (30%), Positives = 97/195 (49%), Gaps = 23/195 (11%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +PR ++Y + + + E + +  +A+P + ++TV + KTG  + +  R S   +LR     V
Sbjct: 86  EPRAVVYHNFLSNEECEHLINLAKPSMVKSTVVDEKTGGSKDSRVRTSSGTFLRRGHDEV 145

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           +E I +R+   T +     E LQV++Y +G  YEPHYD+      + F +   G R+ATV
Sbjct: 146 VEVIEKRISDFTFIPVENGEGLQVLHYQVGQKYEPHYDYF----LDEFNTKNGGQRIATV 201

Query: 211 LFYMSDVAQGGATVFTSL-------------------NLSLWPEKGTAAFWHNLHSSGDG 251
           L Y+SDV  GG TVF +                     LS+ P+K  A  + N+      
Sbjct: 202 LMYLSDVDDGGETVFPAARGNISAVPWWNELSKCGKEGLSVLPKKRDALLFWNMRPDASL 261

Query: 252 DYYTRHAACPVLTGS 266
           D  + H  CPV+ G+
Sbjct: 262 DPSSLHGGCPVVKGN 276


>gi|168046048|ref|XP_001775487.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162673157|gb|EDQ59684.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 263

 Score =  103 bits (258), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 63/201 (31%), Positives = 101/201 (50%), Gaps = 21/201 (10%)

Query: 82  PLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSA 141
           P + ++   +PR  LY + + D+E D +  +A+ +L ++ V + ++G+   +  R S   
Sbjct: 3   PTRVKQLSWKPRAFLYSNFLSDAECDHMISLAKDKLEKSMVADNESGKSVKSEIRTSSGM 62

Query: 142 WLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL 201
           +L + +  +I RI  R+   T L     E +QV+ Y  G  YEPH+D+       A   L
Sbjct: 63  FLMKGQDDIISRIEDRIAAWTFLPKENGEAIQVLRYQDGEKYEPHFDYFHDKNNQA---L 119

Query: 202 GTGNRVATVLFYMSDVAQGGATVFTS-----------------LNLSLWPEKGTAAFWHN 244
           G G+R+ATVL Y+SDV +GG TVF S                   +++ P KG A  + +
Sbjct: 120 G-GHRIATVLMYLSDVVKGGETVFPSSEDRGGPKDDSWSACGKTGVAVKPRKGDALLFFS 178

Query: 245 LHSSGDGDYYTRHAACPVLTG 265
           LH S   D  + H  CPV+ G
Sbjct: 179 LHPSAVPDESSLHTGCPVIEG 199


>gi|357146834|ref|XP_003574128.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Brachypodium
           distachyon]
          Length = 306

 Score =  103 bits (258), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 58/194 (29%), Positives = 96/194 (49%), Gaps = 23/194 (11%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +PR  LY + +   E + +  +A+P ++++TV +  TG  + +  R S   +LR  +  V
Sbjct: 101 EPRAFLYHNFLSKEECEYLISLAKPHMKKSTVVDSATGGSKDSRVRTSSGTFLRRGQDKV 160

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           I  I +R+   T +     E LQV++Y +G  YEPH+D+      + F +   G R+AT+
Sbjct: 161 IRTIEKRISDFTFIPAENGEGLQVLHYEVGQKYEPHFDYFH----DDFNTKNGGQRIATL 216

Query: 211 LFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAFWHNLHSSGDG 251
           L Y+SDV +GG TVF S                     +S+ P+ G A  + ++   G  
Sbjct: 217 LMYLSDVEEGGETVFPSAKVNSSSIPFYNELSECAKRGISVKPKMGDALLFWSMRPDGTL 276

Query: 252 DYYTRHAACPVLTG 265
           D  + H  CPV+ G
Sbjct: 277 DPTSLHGGCPVIKG 290


>gi|15227885|ref|NP_179363.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
           [Arabidopsis thaliana]
 gi|25411813|pir||F84555 similar to prolyl 4-hydroxylase alpha subunit [imported] -
           Arabidopsis thaliana
 gi|89274129|gb|ABD65585.1| At2g17720 [Arabidopsis thaliana]
 gi|110738861|dbj|BAF01353.1| similar to prolyl 4-hydroxylase alpha subunit [Arabidopsis
           thaliana]
 gi|330251579|gb|AEC06673.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
           [Arabidopsis thaliana]
          Length = 291

 Score =  103 bits (258), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 60/195 (30%), Positives = 97/195 (49%), Gaps = 23/195 (11%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +PR ++Y + + + E + +  +A+P + ++TV + KTG  + +  R S   +LR     V
Sbjct: 86  EPRAVVYHNFLTNEECEHLISLAKPSMVKSTVVDEKTGGSKDSRVRTSSGTFLRRGHDEV 145

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           +E I +R+   T +     E LQV++Y +G  YEPHYD+      + F +   G R+ATV
Sbjct: 146 VEVIEKRISDFTFIPVENGEGLQVLHYQVGQKYEPHYDYF----LDEFNTKNGGQRIATV 201

Query: 211 LFYMSDVAQGGATVFTSL-------------------NLSLWPEKGTAAFWHNLHSSGDG 251
           L Y+SDV  GG TVF +                     LS+ P+K  A  + N+      
Sbjct: 202 LMYLSDVDDGGETVFPAARGNISAVPWWNELSKCGKEGLSVLPKKRDALLFWNMRPDASL 261

Query: 252 DYYTRHAACPVLTGS 266
           D  + H  CPV+ G+
Sbjct: 262 DPSSLHGGCPVVKGN 276


>gi|255551575|ref|XP_002516833.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
 gi|223543921|gb|EEF45447.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
          Length = 297

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 60/205 (29%), Positives = 102/205 (49%), Gaps = 25/205 (12%)

Query: 82  PLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSA 141
           P K ++   +PR  +Y   + D E D +  +A+  L+R+ V + ++G+ +++  R S   
Sbjct: 33  PSKVKQVSWKPRAFVYEGFLTDLECDHLISLAKSELKRSAVADNESGKSKLSEVRTSSGM 92

Query: 142 WLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL 201
           ++ + + P+I  I  ++   T L     E+LQV+ Y  G  Y+PHYD+     A+     
Sbjct: 93  FIAKGKDPIIAGIEEKISTWTFLPKENGEDLQVLRYEHGQKYDPHYDYF----ADKINIA 148

Query: 202 GTGNRVATVLFYMSDVAQGGATVFTSL---------------------NLSLWPEKGTAA 240
             G+R+ATVL Y+SDV +GG TVF +                       +S+ P +G A 
Sbjct: 149 RGGHRMATVLMYLSDVVKGGETVFPNAEEPPRRKATESHEDLSECAKKGISVKPRRGDAL 208

Query: 241 FWHNLHSSGDGDYYTRHAACPVLTG 265
            + +LH +   D  + HA CPV+ G
Sbjct: 209 LFFSLHPTAIPDPNSLHAGCPVIEG 233


>gi|326495334|dbj|BAJ85763.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 300

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 56/196 (28%), Positives = 96/196 (48%), Gaps = 23/196 (11%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +PR  +Y + +   E + +  +A+P ++++TV +  TG  + +  R S   +LR  +  +
Sbjct: 95  EPRAFIYHNFLSKEECEYLISLAKPHMKKSTVVDSATGGSKDSRVRTSSGTFLRRGQDKI 154

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           +  I +R+   T +     E LQV++Y +G  YEPH+D+      + F +   G R+ATV
Sbjct: 155 VRTIEKRISDFTFIPVENGEGLQVLHYEVGQKYEPHFDYFH----DDFNTKNGGQRIATV 210

Query: 211 LFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAFWHNLHSSGDG 251
           L Y+SDV +GG TVF S                     +S+ P+ G A  + ++   G  
Sbjct: 211 LMYLSDVEEGGETVFPSAKVNSSSIPFYNELSECAKRGISVKPKMGDALLFWSMRPDGTL 270

Query: 252 DYYTRHAACPVLTGSN 267
           D  + H  CPV+ G  
Sbjct: 271 DPTSLHGGCPVIKGDK 286


>gi|403238305|ref|ZP_10916891.1| procollagen-proline dioxygenase [Bacillus sp. 10403023]
          Length = 296

 Score =  103 bits (256), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 59/173 (34%), Positives = 95/173 (54%), Gaps = 4/173 (2%)

Query: 94  IILYRD-VMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIE 152
            IL+ D  + + E D + +M++ RL+ +TV + KTGE + A  R SK       E+  I+
Sbjct: 110 FILHLDYFLSEEECDQLIEMSRERLKPSTVIDPKTGEEKAATGRTSKGMSFYLQENEFIK 169

Query: 153 RISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLF 212
           ++ +R+  +        E LQV+NYGIG  Y+ H+D+    +    K    G RV T L 
Sbjct: 170 KVEKRIAELIEFPVENGEGLQVLNYGIGEEYKSHFDYFPQSKVVPEKG---GQRVGTFLI 226

Query: 213 YMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           Y++DV  GG TVF    +S+ P+KG+A ++   +S G+ D  + H++ PV  G
Sbjct: 227 YLNDVPAGGETVFPKAGVSIVPKKGSAVYFQYGNSKGEVDRMSLHSSIPVSEG 279


>gi|412992163|emb|CCO19876.1| predicted protein [Bathycoccus prasinos]
          Length = 350

 Score =  103 bits (256), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 56/198 (28%), Positives = 97/198 (48%), Gaps = 21/198 (10%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           QPR  +   ++ + E + I ++A+P ++R+TV +  TGE++    R SK  +L   ++PV
Sbjct: 86  QPRAFVLHSILSEEECEEILRIAKPMMKRSTVVDSITGEIKTDPIRTSKQTFLARGKYPV 145

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFK-SLGTGNRVAT 209
           + R+  R+   T L     E++Q+++YG+G  Y  H+D       +  + S   G RVAT
Sbjct: 146 VTRVEERLSRFTMLPWYNGEDMQILSYGVGEKYSAHHDVGEKNTKSGQQLSADGGQRVAT 205

Query: 210 VLFYMSDVAQGGATVF--------------------TSLNLSLWPEKGTAAFWHNLHSSG 249
           VL Y+ D  +GG T F                        ++  P++G    + ++   G
Sbjct: 206 VLLYLQDTEEGGETAFPDSEWIEPESEYAQQKFSECAKNGVAFKPKRGDGLLFFSITPEG 265

Query: 250 DGDYYTRHAACPVLTGSN 267
           D D  + HA CPV+ G+ 
Sbjct: 266 DIDQKSMHAGCPVVKGTK 283


>gi|365090417|ref|ZP_09328465.1| 2OG-Fe(II) oxygenase [Acidovorax sp. NO-1]
 gi|363416516|gb|EHL23626.1| 2OG-Fe(II) oxygenase [Acidovorax sp. NO-1]
          Length = 302

 Score =  103 bits (256), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 59/181 (32%), Positives = 92/181 (50%), Gaps = 7/181 (3%)

Query: 88  AYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPE 147
           A  QPRI+++ +++   E D +   AQPRL R+     KTG  EI + R S   + +  +
Sbjct: 111 AMAQPRIVVFGNLLSPEECDALIADAQPRLARSLTVATKTGGEEINDDRTSDGMFFQRGQ 170

Query: 148 HPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT-GNR 206
            P+I+RI  R+  +        E LQV++Y  G  Y+PHYD+  P E      +   G R
Sbjct: 171 SPLIQRIEERIARLLNWPIENGEGLQVLHYRPGAEYKPHYDYFDPAEPGTPSIVNRGGQR 230

Query: 207 VATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAF--WHNLHSSGDGDYYTRHAACPVLT 264
           V T++ Y++   +GG T F  ++L + P++G A F  +   H S      T H   PV+ 
Sbjct: 231 VGTLVMYLNTPEKGGGTTFPDVHLEVAPQRGNAVFFSYERPHPS----TRTLHGGAPVIA 286

Query: 265 G 265
           G
Sbjct: 287 G 287


>gi|255637501|gb|ACU19077.1| unknown [Glycine max]
          Length = 318

 Score =  103 bits (256), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 59/213 (27%), Positives = 105/213 (49%), Gaps = 22/213 (10%)

Query: 71  VHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGEL 130
           ++R    ++  P +  +    PR  LY+  + D E D +  +A+ +L ++ V + ++G+ 
Sbjct: 42  LNRGGSSVKFDPTRVTQLSWSPRAFLYKGFLSDEECDHLITLAKDKLEKSMVADNESGKS 101

Query: 131 EIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFA 190
            ++  R S   +L + +  ++  I  R+   T L     E +Q+++Y  G  YEPH+D+ 
Sbjct: 102 IMSEVRTSSGMFLNKAQDEIVAGIEARIAAWTFLPIENGESMQILHYENGQKYEPHFDYF 161

Query: 191 RPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSL-------W---------- 233
                 A + +G G+R+ATVL Y+SDV +GG T+F++    L       W          
Sbjct: 162 HD---KANQVMG-GHRIATVLMYLSDVEKGGETIFSNAKAKLLQPKDESWSECAHKGYAV 217

Query: 234 -PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
            P KG A  + +LH     D  + H +CPV+ G
Sbjct: 218 KPRKGDALLFFSLHLDASTDNKSLHGSCPVIEG 250


>gi|356517655|ref|XP_003527502.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Glycine max]
          Length = 290

 Score =  103 bits (256), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 60/207 (28%), Positives = 104/207 (50%), Gaps = 24/207 (11%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +PR  +Y + +   E + + ++A+P++ +++V + KTG+   +  R S   +L+  +  +
Sbjct: 85  EPRAFIYHNFLSKEECEYLIELAKPQMVKSSVVDSKTGKSTESRVRTSSGMFLKRGKDKI 144

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           ++ I +R+   T +     E LQ+++Y +G  YEPHYD+      + F +   G R+ATV
Sbjct: 145 VQNIEKRIADFTFIPEENGEGLQILHYEVGQKYEPHYDYF----LDEFNTKNGGQRIATV 200

Query: 211 LFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAFWHNLHSSGDG 251
           L Y+SDV +GG TVF + N                   LS+ P+ G A  + ++      
Sbjct: 201 LMYLSDVEEGGETVFPAANANFSSVPWWNDLSQCARKGLSVKPKMGDALLFWSMRPDATL 260

Query: 252 DYYTRHAACPVLTGSNSLHSTCPCGLR 278
           D  + H  CPV+ G N   ST    LR
Sbjct: 261 DPSSLHGGCPVIKG-NKWSSTKWMHLR 286


>gi|430751569|ref|YP_007214477.1| 2OG-Fe(II) oxygenase [Thermobacillus composti KWC4]
 gi|430735534|gb|AGA59479.1| 2OG-Fe(II) oxygenase superfamily enzyme [Thermobacillus composti
           KWC4]
          Length = 215

 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 54/175 (30%), Positives = 96/175 (54%), Gaps = 10/175 (5%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +P I+ +  ++ D E   + + A PRL+ + + N       +++ R S+  +  E E P 
Sbjct: 29  EPLIVRFERLLSDDECRQLIETAAPRLKESKLVNKV-----VSDIRTSRGMFFEEEESPF 83

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           I RI RR+  +  +    AE LQV++YG G  Y+ H+DF  PG   A       NR++T+
Sbjct: 84  IHRIERRIAQLMNVPIEHAEGLQVLHYGPGQEYKAHHDFFAPGSPAA-----RNNRISTL 138

Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           + Y++DV +GG TVF  L +++ P++G A ++   + +   +  T H++ PV+ G
Sbjct: 139 IVYLNDVEEGGETVFPLLGIAMKPKRGAALYFEYFYRNQALNDLTLHSSVPVVRG 193


>gi|89096248|ref|ZP_01169141.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus sp.
           NRRL B-14911]
 gi|89089102|gb|EAR68210.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus sp.
           NRRL B-14911]
          Length = 217

 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 54/176 (30%), Positives = 98/176 (55%), Gaps = 9/176 (5%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +P I++  +V+ D E + + +M++ +L+R+ + N +T    + + R S S +  E E+ +
Sbjct: 38  EPLIVILGNVLSDEECEGLIRMSEDKLKRSKIGNTRT----VDDIRTSSSMFFEEGENEL 93

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           + RI RR+  +  +     E LQ++NY IG  Y+ H+DF       A        R++T+
Sbjct: 94  VARIERRLSQIMNIPVEHGEGLQMLNYHIGQEYKAHFDFFSSSSRAASNP-----RISTL 148

Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           + Y++DV +GG T F  LN S+ P+KG+A ++   + + D +  T H   PV+ GS
Sbjct: 149 VMYLNDVEEGGETYFPKLNFSVNPQKGSAVYFEYFYDNQDLNDLTLHGGAPVIKGS 204


>gi|149180354|ref|ZP_01858859.1| prolyl 4-hydroxylase, alpha subunit [Bacillus sp. SG-1]
 gi|148852546|gb|EDL66691.1| prolyl 4-hydroxylase, alpha subunit [Bacillus sp. SG-1]
          Length = 212

 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 51/176 (28%), Positives = 96/176 (54%), Gaps = 13/176 (7%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +P I++  +V+ D E D +  +++ +L+R+ + N +       + R S S ++ E E  V
Sbjct: 36  EPLIVVLGNVLSDEECDALIGLSKDKLKRSKIGNTRNEN----DMRTSSSTFMEEGESEV 91

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           + R+ +R+  +  +     E LQ++NY IG  Y+ H+DF        FK+  +  R++T+
Sbjct: 92  VTRVEKRISQIMNIPYENGEGLQILNYKIGQEYKAHFDF--------FKN-ASNPRISTL 142

Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           + Y++DV +GG T F  LN S+ P+KG A ++   + + + +  T H   PV+ G 
Sbjct: 143 VMYLNDVEEGGETYFPKLNFSVSPQKGMAVYFEYFYDNQELNDLTLHGGAPVIIGD 198


>gi|384251901|gb|EIE25378.1| hypothetical protein COCSUDRAFT_35772 [Coccomyxa subellipsoidea
           C-169]
          Length = 222

 Score =  102 bits (254), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 58/193 (30%), Positives = 93/193 (48%), Gaps = 21/193 (10%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +PR  LY + + ++E D + +  +P + ++ V + +TG+   +  R S   +L   E  V
Sbjct: 7   EPRAYLYHNFLTEAEADYLVQKGKPHMEKSEVVDNETGKSAPSKVRTSSGMFLNRGEDDV 66

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           IERI  R+   T +     E LQ+++Y     Y PH+D+      + F +   G R+AT+
Sbjct: 67  IERIEARIAKYTAIPKENGEGLQILHYQASEEYRPHFDYFH----DNFNTQNGGQRIATM 122

Query: 211 LFYMSDVAQGGATVF-----------------TSLNLSLWPEKGTAAFWHNLHSSGDGDY 253
           L Y+SDV  GG TVF                      +  P+KG A F+++L   G  D 
Sbjct: 123 LMYLSDVEDGGETVFPESSDKPNVGNTKFSQCAQAGAAAKPKKGDALFFYSLTPDGRMDE 182

Query: 254 YTRHAACPVLTGS 266
            + HA CPV+ G 
Sbjct: 183 KSLHAGCPVMKGD 195


>gi|297268736|ref|XP_001115675.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Macaca
           mulatta]
          Length = 567

 Score =  102 bits (254), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 66/180 (36%), Positives = 100/180 (55%), Gaps = 13/180 (7%)

Query: 56  LTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQP 115
           L   P +V   KC  +   +  +  M LK     L  +  L    ++ + +D +  +   
Sbjct: 258 LIAAPKLVPHAKCPLIREELASVMFM-LKGGAYLLNDKPFL----LHHNHLDFVL-LPSH 311

Query: 116 RLRRATVQNYKTGELEI-ANYRISKSAWLREPEHPVIERISRRVEHMTGLTTST--AEEL 172
           +L+R+ V    +GE ++   YRISKSAWL++   P++  ++ R+  +TGL      AE L
Sbjct: 312 QLQRSVV---ASGEKQLQVEYRISKSAWLKDTVDPMLVTLNHRIAALTGLDVRPPYAEYL 368

Query: 173 QVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSL 232
           QVVNYGIGGHYEPH+D A    +  ++ + +GNRVAT + Y+S V  GGAT F   NLS+
Sbjct: 369 QVVNYGIGGHYEPHFDHATSPSSPLYR-MKSGNRVATFMIYLSSVEAGGATAFIYANLSV 427


>gi|359806348|ref|NP_001241485.1| uncharacterized protein LOC100783075 precursor [Glycine max]
 gi|255645457|gb|ACU23224.1| unknown [Glycine max]
          Length = 298

 Score =  102 bits (254), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 60/205 (29%), Positives = 104/205 (50%), Gaps = 25/205 (12%)

Query: 82  PLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSA 141
           P K ++   +PR  +Y   + D E D +  +A+  L+R+ V +  +GE ++++ R S   
Sbjct: 34  PSKVKQISWKPRAFVYEGFLTDLECDHLISLAKSELKRSAVADNLSGESQLSDVRTSSGM 93

Query: 142 WLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL 201
           ++ + + P+I  I  ++   T L     E++QV+ Y  G  Y+PHYD+      +     
Sbjct: 94  FISKNKDPIISGIEDKISSWTFLPKENGEDIQVLRYEHGQKYDPHYDYF----TDKVNIA 149

Query: 202 GTGNRVATVLFYMSDVAQGGATVF-------------TSLNLS--------LWPEKGTAA 240
             G+R+ATVL Y+++V +GG TVF             TS +LS        + P +G A 
Sbjct: 150 RGGHRIATVLMYLTNVTKGGETVFPSAEEPPRRRGTETSSDLSECAKKGIAVKPHRGDAL 209

Query: 241 FWHNLHSSGDGDYYTRHAACPVLTG 265
            + +LH++   D  + HA CPV+ G
Sbjct: 210 LFFSLHTNATPDTSSLHAGCPVIEG 234


>gi|294461211|gb|ADE76168.1| unknown [Picea sitchensis]
          Length = 280

 Score =  102 bits (254), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 61/219 (27%), Positives = 106/219 (48%), Gaps = 28/219 (12%)

Query: 71  VHRNVPYLRLMPLKEEEAYLQ------PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQN 124
           +HRN PY +L+ L              P + LY++ + D+E D +  +A+ +L+++ V +
Sbjct: 1   MHRNFPYYKLVQLLALTRLELLSCLGIPGLFLYKNFLTDAECDHLIFLARDKLQKSMVAD 60

Query: 125 YKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYE 184
            ++G+  ++  R S   +L + +  ++  +  R+   T L     E +QV++Y +G  YE
Sbjct: 61  NESGKSVMSEIRTSSGMFLNKAQDEIVASVEDRIAAWTFLPIENGEAMQVLHYELGQKYE 120

Query: 185 PHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL---------------- 228
           PH+D+       A      G+R+ATVL Y+SDV +GG TVF +                 
Sbjct: 121 PHFDYFHDKINQAM----GGHRIATVLMYLSDVVKGGETVFPNAETKDSQPKDDSWSECA 176

Query: 229 --NLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
               S+ P KG A  + +L      D  + H +CPV+ G
Sbjct: 177 KGGYSVKPNKGDALLFFSLRPDATTDQSSLHGSCPVIEG 215


>gi|363543301|ref|NP_001241866.1| prolyl 4-hydroxylase 6 precursor [Zea mays]
 gi|195624808|gb|ACG34234.1| oxidoreductase [Zea mays]
 gi|347978818|gb|AEP37751.1| prolyl 4-hydroxylase 6 [Zea mays]
          Length = 297

 Score =  102 bits (253), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 63/193 (32%), Positives = 97/193 (50%), Gaps = 22/193 (11%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +PR  LY   + D+E D I  +A+  + ++ V +  +G+   +  R S   +L + E  +
Sbjct: 41  RPRAFLYSGFLSDTECDHIVSLAKGSMEKSMVADNDSGKSVASQARTSSGTFLAKREDEI 100

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           +  I +RV   T L    AE LQV+ Y  G  Y+ H+D+    + N  K LG G RVATV
Sbjct: 101 VSAIEKRVAAWTFLPEENAESLQVLRYETGQKYDAHFDYFH--DRNNLK-LG-GQRVATV 156

Query: 211 LFYMSDVAQGGATVF------------------TSLNLSLWPEKGTAAFWHNLHSSGDGD 252
           L Y++DV +GG TVF                  +   L++ P+KG A  + NLH +   D
Sbjct: 157 LMYLTDVKKGGETVFPNAEGSHLQYKDETWSECSRSGLAVKPKKGDALLFFNLHVNATAD 216

Query: 253 YYTRHAACPVLTG 265
             + H +CPV+ G
Sbjct: 217 TGSLHGSCPVIEG 229


>gi|326316001|ref|YP_004233673.1| procollagen-proline dioxygenase [Acidovorax avenae subsp. avenae
           ATCC 19860]
 gi|323372837|gb|ADX45106.1| Procollagen-proline dioxygenase [Acidovorax avenae subsp. avenae
           ATCC 19860]
          Length = 298

 Score =  102 bits (253), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 57/183 (31%), Positives = 93/183 (50%), Gaps = 7/183 (3%)

Query: 88  AYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPE 147
           A  QPR++L+ +++   E D I   A+PR+ R+     +TG  E+ + R S   + +  E
Sbjct: 107 AMAQPRVVLFGNLLSPEECDAIIDAARPRMARSLTVATRTGGEEVNDDRTSNGMFFQREE 166

Query: 148 HPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT-GNR 206
           +P++ ++  R+  +        E LQV++Y  G  Y+PHYD+  P E      L   G R
Sbjct: 167 NPMVAKLEARIARLVNWPLENGEGLQVLHYRPGAEYKPHYDYFDPTEPGTPTILRRGGQR 226

Query: 207 VATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAF--WHNLHSSGDGDYYTRHAACPVLT 264
           VAT++ Y++D  +GG T F  ++L + P +G A F  +   H S      T H   PV+ 
Sbjct: 227 VATIVIYLNDPEKGGGTTFPDVHLEVAPRRGNAVFFSYERPHPS----TRTLHGGAPVVA 282

Query: 265 GSN 267
           G  
Sbjct: 283 GDK 285


>gi|319943342|ref|ZP_08017624.1| 2OG-Fe(II) oxygenase [Lautropia mirabilis ATCC 51599]
 gi|319743157|gb|EFV95562.1| 2OG-Fe(II) oxygenase [Lautropia mirabilis ATCC 51599]
          Length = 311

 Score =  102 bits (253), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 63/239 (26%), Positives = 117/239 (48%), Gaps = 17/239 (7%)

Query: 31  PKVNNVAPTLEVTEREKYEMLCRGDLTVPPAIVAQL-KCRYVHRNVPYLRLMPLKEEEAY 89
           P+    A  L+   R++Y+         P  +++QL +     R V    +M        
Sbjct: 74  PEDERAAAGLQGARRQRYQ-------ASPIRLISQLPRFTVADREVELAAVMS------- 119

Query: 90  LQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHP 149
             P I + R ++ D E D + ++++ +++ + V + ++G    ++ R S+ +     E+ 
Sbjct: 120 -NPNIAVIRGLLSDEECDEVIRLSRGKMKTSQVVDRESGGSYESSVRKSEGSHFERGENE 178

Query: 150 VIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGE-ANAFKSLGTGNRVA 208
           ++ RI  R+  +  L  +  E LQ+++YG GG Y+ H DF  P +  +A  +   G R+ 
Sbjct: 179 LVRRIEARLSALVDLPVNRGEPLQILHYGPGGEYKAHQDFFEPKDPGSAVLTRVGGQRIG 238

Query: 209 TVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
           TV+ Y++DV +GG T F  +  S  P KG+A ++   ++ G  DY   HA  PV+ G  
Sbjct: 239 TVVMYLNDVPEGGETAFPDIGFSAKPIKGSAVYFEYQNADGQLDYRCLHAGMPVIRGDK 297


>gi|242039227|ref|XP_002467008.1| hypothetical protein SORBIDRAFT_01g018200 [Sorghum bicolor]
 gi|241920862|gb|EER94006.1| hypothetical protein SORBIDRAFT_01g018200 [Sorghum bicolor]
          Length = 307

 Score =  102 bits (253), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 59/201 (29%), Positives = 98/201 (48%), Gaps = 24/201 (11%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +PR  +Y + +   E D +  +A+P ++++TV +  TG  + +  R S   +LR  +  +
Sbjct: 102 EPRAFVYHNFLSKEECDHLISLAKPHMKKSTVVDSATGASKDSRVRTSSGMFLRRGQDKI 161

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           I+ I +R+   T +     E LQV++Y +G  YEPH+D+      + + +   G R+AT+
Sbjct: 162 IQTIEKRIADFTFIPVEHGEGLQVLHYEVGQKYEPHFDYFH----DDYNTKNGGQRIATL 217

Query: 211 LFYMSDVAQGGATVF-------------------TSLNLSLWPEKGTAAFWHNLHSSGDG 251
           L Y+SDV  GG TVF                       LS+ P+ G A  + ++   G  
Sbjct: 218 LMYLSDVEDGGETVFPSSTTNSSSSPFYNELSECAKGGLSVKPKMGDALLFWSMKPDGSM 277

Query: 252 DYYTRHAACPVLTGSNSLHST 272
           D  + H  CPV+ G N   ST
Sbjct: 278 DSTSLHGGCPVIKG-NKWSST 297


>gi|21593091|gb|AAM65040.1| putative prolyl 4-hydroxylase, alpha subunit [Arabidopsis thaliana]
          Length = 291

 Score =  101 bits (252), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 59/195 (30%), Positives = 96/195 (49%), Gaps = 23/195 (11%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +PR ++Y + + + E + +  +A+P + ++TV + KTG  + +  R S   +LR     V
Sbjct: 86  EPRAVVYHNFLTNEECEHLISLAKPSMVKSTVVDEKTGGSKDSRVRTSSGTFLRRGHDEV 145

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           +E I +R+   T +     E LQV++Y +G  YEPHYD+      + F +   G R+ATV
Sbjct: 146 VEVIEKRISDFTFIPVENGEGLQVLHYQVGQKYEPHYDYF----LDEFNTKNGGQRIATV 201

Query: 211 LFYMSDVAQGGATVFTSL-------------------NLSLWPEKGTAAFWHNLHSSGDG 251
           L Y+SDV  GG TVF +                     LS+ P+   A  + N+      
Sbjct: 202 LMYLSDVDDGGETVFPAARGNISAVPWWNELSKCGKEGLSVLPKXRDALLFWNMRPDASL 261

Query: 252 DYYTRHAACPVLTGS 266
           D  + H  CPV+ G+
Sbjct: 262 DPSSLHGGCPVVKGN 276


>gi|328710203|ref|XP_001949232.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Acyrthosiphon
           pisum]
          Length = 500

 Score =  101 bits (252), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 65/210 (30%), Positives = 107/210 (50%), Gaps = 17/210 (8%)

Query: 62  IVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRAT 121
           I  + KCRY H    YL + PL+EE   L P + LY +V+YD EI  IK++A P+L + +
Sbjct: 294 IYPKFKCRYYHGGRKYLMIGPLREEIVSLIPSMKLYHNVLYDDEIKKIKELANPKLEKLS 353

Query: 122 VQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGL-TTSTAEELQVVNYGIG 180
           +    T E    +  + K A  R+    V E I  R+  ++   TT+  ++  V NYGIG
Sbjct: 354 ID---TNE----DISLRKVASFRKHNDQVFETIHHRLAQISSKPTTNIVDKYVVTNYGIG 406

Query: 181 GHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAA 240
           GHY PH  +           + +  R A V+ +M DV +GGATV  ++   +   KG+A 
Sbjct: 407 GHYLPHTKYIDDNHL-----INSKRRDAIVIIHMDDVPEGGATVLPNVEFCVPSVKGSAL 461

Query: 241 FWHNLHSS----GDGDYYTRHAACPVLTGS 266
             ++  ++     +   + ++ +CP++ G 
Sbjct: 462 VIYSTRNTLPPIKELFEFAQYGSCPIVYGD 491


>gi|356550516|ref|XP_003543632.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
          Length = 318

 Score =  101 bits (252), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 59/213 (27%), Positives = 104/213 (48%), Gaps = 22/213 (10%)

Query: 71  VHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGEL 130
           ++R    ++  P +  +    PR  LY+  + D E D +  +A+ +L ++ V + ++G+ 
Sbjct: 42  LNRGGSSVKFDPTRVTQLSWSPRAFLYKGFLSDEECDHLITLAKDKLEKSMVADNESGKS 101

Query: 131 EIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFA 190
            ++  R S   +L + +  ++  I  R+   T L     E +Q+++Y  G  YEPH+D+ 
Sbjct: 102 IMSEVRTSSGMFLNKAQDEIVAGIEARIAAWTFLPIENGESMQILHYENGQKYEPHFDYF 161

Query: 191 RPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSL-------W---------- 233
                 A + +G G+R+ATVL Y+SDV +GG T+F +    L       W          
Sbjct: 162 HD---KANQVMG-GHRIATVLMYLSDVEKGGETIFPNAKAKLLQPKDESWSECAHKGYAV 217

Query: 234 -PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
            P KG A  + +LH     D  + H +CPV+ G
Sbjct: 218 KPRKGDALLFFSLHLDASTDNKSLHGSCPVIEG 250


>gi|385206010|ref|ZP_10032880.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
 gi|385185901|gb|EIF35175.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
          Length = 296

 Score =  101 bits (252), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 58/176 (32%), Positives = 92/176 (52%), Gaps = 1/176 (0%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +P  IL  D +  +E + +  +A+PRL R+TV +  TG   +A +R S   + R  E P+
Sbjct: 101 RPAAILLDDFLSANECEQLISLARPRLSRSTVVDPVTGRNVVAGHRSSDGMFFRLGETPL 160

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGE-ANAFKSLGTGNRVAT 209
           I R+  R+  +TGL     E LQ+++Y +G    PH D+   G  AN      +G RV T
Sbjct: 161 IARLEARIAELTGLPVENGEGLQLLHYEVGAESTPHVDYLIAGNPANQESIARSGQRVGT 220

Query: 210 VLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           +L Y++DV  GG T+F     S+ P +G A ++   +  G  D  + H + P+  G
Sbjct: 221 LLMYLNDVEGGGETMFPQTGWSVVPRRGQALYFEYGNRFGLADPSSLHTSTPLRVG 276


>gi|395003644|ref|ZP_10387769.1| 2OG-Fe(II) oxygenase superfamily enzyme [Acidovorax sp. CF316]
 gi|394318439|gb|EJE54870.1| 2OG-Fe(II) oxygenase superfamily enzyme [Acidovorax sp. CF316]
          Length = 299

 Score =  101 bits (251), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 55/179 (30%), Positives = 92/179 (51%), Gaps = 3/179 (1%)

Query: 88  AYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPE 147
           A  +PRI+++ +++   E D +   A PR+ R+     KTG  E+ + R S   + +  E
Sbjct: 108 AIAKPRIVVFGNLLSAEECDALIAAAAPRMARSLTVATKTGGEEVNDDRTSDGMFFQRGE 167

Query: 148 HPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLG-TGNR 206
           +PV++RI  R+  +        E LQV++Y  G  Y+PHYD+  PGE      L   G R
Sbjct: 168 NPVVQRIEERIARLLDWPIENGEGLQVLHYRPGAEYKPHYDYFDPGEPGTPTILKRGGQR 227

Query: 207 VATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           V T++ Y++   +GG T F  +++ + P++G A F+   +        T H   PV+ G
Sbjct: 228 VGTLVMYLNTPEKGGGTTFPDVHVEVAPQRGNAVFFS--YERAHPATRTLHGGAPVIAG 284


>gi|264677094|ref|YP_003277000.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni CNB-2]
 gi|262207606|gb|ACY31704.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni CNB-2]
          Length = 306

 Score =  101 bits (251), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 53/176 (30%), Positives = 93/176 (52%), Gaps = 3/176 (1%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
            PR++++ +++ D E D I   A+PR+RR+   + ++G   + + R S   + +  E+ +
Sbjct: 118 HPRVVVFGNLLSDEECDAIIAAARPRMRRSLTVDNQSGGEAVNDDRTSNGMFFQRGENDL 177

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT-GNRVAT 209
           I  + +R+  +        E +QV++Y  G  Y+PHYD+  P E      L   G RV T
Sbjct: 178 ISLVEQRIARLLNWPLENGEGMQVLHYRPGAEYKPHYDYFAPNEPGTPTILKRGGQRVGT 237

Query: 210 VLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           ++ Y+++ A+GGAT F  + L + P +G A F+   ++  D    T H   PVL G
Sbjct: 238 LVMYLNEPARGGATTFPDVGLQIVPRRGNAVFFS--YNRPDPATKTLHGGAPVLEG 291


>gi|357137804|ref|XP_003570489.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Brachypodium
           distachyon]
          Length = 318

 Score =  100 bits (250), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 60/201 (29%), Positives = 99/201 (49%), Gaps = 24/201 (11%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +PR  +Y + +   E + +  +A+PR+ ++TV +  TG+ + +  R S   +LR     V
Sbjct: 113 EPRAFVYHNFLSKEECEYLIGLAKPRMEKSTVVDSTTGKSKDSRVRTSSGMFLRRGRDKV 172

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           I  I RR+   T +     E LQV++Y +G  YEPH+D+      + F +   G R+AT+
Sbjct: 173 IRAIERRIADYTFIPAEHGEGLQVLHYEVGQKYEPHFDYF----LDEFNTKNGGQRMATI 228

Query: 211 LFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAFWHNLHSSGDG 251
           L Y+SDV +GG T+F   N                   L++ P+ G A  + +++     
Sbjct: 229 LMYLSDVEEGGETIFPDANVNSSSLPWHNELSECARKGLAVKPKMGDALLFWSMNPDATL 288

Query: 252 DYYTRHAACPVLTGSNSLHST 272
           D  + H  CPV+ G N   ST
Sbjct: 289 DPLSLHGGCPVIRG-NKWSST 308


>gi|195503448|ref|XP_002098656.1| GE23815 [Drosophila yakuba]
 gi|194184757|gb|EDW98368.1| GE23815 [Drosophila yakuba]
          Length = 472

 Score =  100 bits (250), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 68/225 (30%), Positives = 103/225 (45%), Gaps = 34/225 (15%)

Query: 42  VTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVM 101
           + ER  +   CRG   +PP+  + L+CRY     P+LR   LK E+  ++P + L+ D +
Sbjct: 239 IAERLVHVDNCRGK-NLPPS-KSFLRCRYFREGSPFLRWAALKLEQLNIEPFVGLFHDAI 296

Query: 102 YDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHM 161
             +E + + ++ + RL      +Y       AN   + S      +H  + RI +R+E +
Sbjct: 297 SPAEQEDLLRLTETRLEHRKKDSYSVE----ANVDTNGS------DH--VRRIHQRIEDI 344

Query: 162 TGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGG 221
           TG     +E L V NYGIGG    H D  +P                     +SDV  GG
Sbjct: 345 TGFDLEDSEPLTVSNYGIGGQESIHLDCEQPK--------------------LSDVQMGG 384

Query: 222 ATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
              F  L     P +G+A  WHN  S+G+ D  +  A CPVL G+
Sbjct: 385 YASFPDLGFGFKPSRGSALVWHNTDSAGNCDTRSLQATCPVLLGN 429


>gi|393200372|ref|YP_006462214.1| prolyl 4-hydroxylase [Solibacillus silvestris StLB046]
 gi|327439703|dbj|BAK16068.1| prolyl 4-hydroxylase [Solibacillus silvestris StLB046]
          Length = 211

 Score =  100 bits (250), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 54/175 (30%), Positives = 94/175 (53%), Gaps = 10/175 (5%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +P I+ + +V+ D E   +   A  RL R+     K  + EI++ R S   +  E E+P+
Sbjct: 29  EPLIVKFLNVLSDEECQNLIDCASSRLERS-----KLAKKEISSIRTSSGMFFEENENPL 83

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           I  I +R+  +  L    AE LQV++Y  G  ++PH+DF  P   ++     + NR+ T+
Sbjct: 84  ISEIEKRISSLMHLPIEHAEGLQVLHYEPGQEFKPHFDFFGPNHPSS-----SNNRICTL 138

Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           + Y++DV +GG T F +L +   P+KGTA ++   ++    +  T H+  PV+ G
Sbjct: 139 VVYLNDVEEGGVTTFPNLGIVNVPKKGTAVYFEYFYNDQKLNELTLHSGEPVIQG 193


>gi|377811809|ref|YP_005044249.1| ProCollegen-proline dioxygenase [Burkholderia sp. YI23]
 gi|357941170|gb|AET94726.1| ProCollegen-proline dioxygenase [Burkholderia sp. YI23]
          Length = 283

 Score =  100 bits (250), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 55/195 (28%), Positives = 103/195 (52%), Gaps = 4/195 (2%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +P + L  DV+   E D + ++ + R+RR++V +  +G   + + R S+ A++     P+
Sbjct: 90  EPVVALLADVLSPRECDRLIEIGRERVRRSSVVDPDSGGEVLIDARKSEGAFVNGSTDPL 149

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT-GNRVAT 209
           +  I RR+  +        E+L ++ YG GG Y PH+D+    +A +   +   G R+AT
Sbjct: 150 VATIDRRIAELVQQPVENGEDLHILRYGAGGEYRPHFDYFPEEQAGSKHHMQRGGQRIAT 209

Query: 210 VLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSL 269
           ++ Y++ V +GG T F  + L++ P +G A ++  +++ G  D  T HA  PV  G   +
Sbjct: 210 LILYLNQVEEGGDTTFPDIGLTIHPRRGAALYFEYVNALGQTDPRTLHAGMPVERGEKWI 269

Query: 270 HSTCPCGLRRGLQRS 284
            +     +RRG  R+
Sbjct: 270 ATKW---MRRGRFRA 281


>gi|356572148|ref|XP_003554232.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Glycine max]
          Length = 319

 Score =  100 bits (250), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 58/213 (27%), Positives = 105/213 (49%), Gaps = 22/213 (10%)

Query: 71  VHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGEL 130
           ++R    ++  P +  +    PR  LY+  + + E D +  +A+ +L ++ V +  +G+ 
Sbjct: 43  LNRGGSSVKFDPTRVTQLSWSPRAFLYKGFLSEEECDHLIVLAKDKLEKSMVADNDSGKS 102

Query: 131 EIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFA 190
            +++ R S   +L + +  ++  I  R+   T L     E +Q+++Y  G  YEPH+D+ 
Sbjct: 103 IMSDIRTSSGMFLNKAQDEIVAGIEARIAAWTFLPVENGESMQILHYENGQKYEPHFDYF 162

Query: 191 RPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSL-------W---------- 233
                 A + +G G+R+ATVL Y+SDV +GG T+F +    L       W          
Sbjct: 163 HD---KANQVMG-GHRIATVLMYLSDVEKGGETIFPNAEAKLLQPKDESWSECAHKGYAV 218

Query: 234 -PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
            P+KG A  + +LH     D  + H +CPV+ G
Sbjct: 219 KPQKGDALLFFSLHLDASTDTKSLHGSCPVIEG 251


>gi|29150368|gb|AAO72377.1| putative oxidoreductase [Oryza sativa Japonica Group]
 gi|108711617|gb|ABF99412.1| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
           Group]
 gi|125546090|gb|EAY92229.1| hypothetical protein OsI_13949 [Oryza sativa Indica Group]
 gi|125588294|gb|EAZ28958.1| hypothetical protein OsJ_13002 [Oryza sativa Japonica Group]
          Length = 310

 Score =  100 bits (250), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 57/193 (29%), Positives = 96/193 (49%), Gaps = 22/193 (11%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +PRI  Y+  + D E D + K+ + +L+R+ V + ++G+  ++  R S   +L + + PV
Sbjct: 54  KPRIFFYKGFLSDDECDHLVKLGKEKLKRSMVADNESGKSVMSEVRTSSGMFLDKQQDPV 113

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           +  I  R+   T L    AE +Q++ Y  G  Y+PH+D+ +    +    L  G+R ATV
Sbjct: 114 VSGIEERIAAWTLLPQENAENIQILRYENGQKYDPHFDYFQ----DKVNQLQGGHRYATV 169

Query: 211 LFYMSDVAQGGATVFTSL------------------NLSLWPEKGTAAFWHNLHSSGDGD 252
           L Y+S V +GG TVF +                    L++   KG +  + NL   G  D
Sbjct: 170 LTYLSTVEKGGETVFPNAEGWESQPKDDSFSDCAKKGLAVKAVKGDSVLFFNLQPDGTPD 229

Query: 253 YYTRHAACPVLTG 265
             + H +CPV+ G
Sbjct: 230 PLSLHGSCPVIEG 242


>gi|299532490|ref|ZP_07045880.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni S44]
 gi|298719437|gb|EFI60404.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni S44]
          Length = 299

 Score =  100 bits (249), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 53/176 (30%), Positives = 93/176 (52%), Gaps = 3/176 (1%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
            PR++++ +++ D E D I   A+PR+RR+   + ++G   + + R S   + +  E+ +
Sbjct: 111 HPRVVVFGNLLSDEECDAIIAAARPRMRRSLTVDNQSGGEAVNDDRTSNGMFFQRGENEL 170

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT-GNRVAT 209
           I  + +R+  +        E +QV++Y  G  Y+PHYD+  P E      L   G RV T
Sbjct: 171 ISLVEQRIARLLNWPLENGEGMQVLHYRPGAEYKPHYDYFAPNEPGTPTILKRGGQRVGT 230

Query: 210 VLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           ++ Y+++ A+GGAT F  + L + P +G A F+   ++  D    T H   PVL G
Sbjct: 231 LVMYLNEPARGGATTFPDVGLQVVPRRGNAVFFS--YNRPDPATKTLHGGAPVLEG 284


>gi|357125236|ref|XP_003564301.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Brachypodium
           distachyon]
          Length = 293

 Score =  100 bits (249), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 61/193 (31%), Positives = 100/193 (51%), Gaps = 22/193 (11%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +PR  LY   +  +E D + K+A+ RL+++ V +  +G+  ++  R S   +L + E  +
Sbjct: 37  RPRAFLYSGFLSHAECDHLVKLAKGRLQKSMVADNDSGKSVMSQVRTSSGTFLNKHEDEI 96

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           I  I +RV   T L    AE +QV++Y +G  Y+ H+D+         + LG G+RVATV
Sbjct: 97  ISGIEKRVAAWTFLPEENAESIQVLHYEVGQKYDAHFDYFHDKNN---QKLG-GHRVATV 152

Query: 211 LFYMSDVAQGGATVFTSL------------------NLSLWPEKGTAAFWHNLHSSGDGD 252
           L Y++DV +GG TVF +                    L++ P KG A  + +LH +   D
Sbjct: 153 LMYLTDVKKGGETVFPNAEGRHLQHKDETWSECARSGLAVKPRKGDALLFFSLHINATTD 212

Query: 253 YYTRHAACPVLTG 265
             + H +CPV+ G
Sbjct: 213 PSSLHGSCPVIEG 225


>gi|302845234|ref|XP_002954156.1| hypothetical protein VOLCADRAFT_82641 [Volvox carteri f.
           nagariensis]
 gi|300260655|gb|EFJ44873.1| hypothetical protein VOLCADRAFT_82641 [Volvox carteri f.
           nagariensis]
          Length = 309

 Score =  100 bits (249), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 65/192 (33%), Positives = 95/192 (49%), Gaps = 19/192 (9%)

Query: 92  PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
           PR  L +  + D E + I   A+PR+ +++V +  +G+   +  R S  AWL + E  +I
Sbjct: 61  PRAFLLKGFLSDEECEHIIAKAKPRMVKSSVVDNASGKSVDSEIRTSTGAWLAKGEDEII 120

Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRVATV 210
            RI +RV  +T +     E LQV++Y  G  YEPHYD F  P   NA    G G RV TV
Sbjct: 121 SRIEKRVAQVTMIPLENHEGLQVLHYHDGQKYEPHYDYFHDP--VNASPEHG-GQRVVTV 177

Query: 211 LFYMSDVAQGGATVFTSLN---------------LSLWPEKGTAAFWHNLHSSGDGDYYT 255
           L Y++ V +GG TV    +               L++ P KG A  +++L   G  D  +
Sbjct: 178 LMYLTTVEEGGETVLPHADQKVSGEGWSECAKRGLAVKPVKGDALMFYSLKPDGSNDPAS 237

Query: 256 RHAACPVLTGSN 267
            H +CP L G  
Sbjct: 238 LHGSCPTLKGDK 249


>gi|407938132|ref|YP_006853773.1| 2OG-Fe(II) oxygenase [Acidovorax sp. KKS102]
 gi|407895926|gb|AFU45135.1| 2OG-Fe(II) oxygenase [Acidovorax sp. KKS102]
          Length = 303

 Score =  100 bits (249), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 59/185 (31%), Positives = 93/185 (50%), Gaps = 11/185 (5%)

Query: 88  AYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPE 147
           A  QPRI+++ +++   E D +   A+PR+ R+     KTG  EI   R S   + +  +
Sbjct: 112 AMAQPRIVVFGNLLSPEECDALIAAAEPRMARSLTVATKTGGEEINADRTSDGMFFQRGQ 171

Query: 148 HPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDF---ARPGEANAFKSLGTG 204
            P+I+RI  R+  +        E LQV++Y  G  Y+PHYD+   A PG  +  K    G
Sbjct: 172 SPLIQRIEERIARLLQWPIENGEGLQVLHYRPGAEYKPHYDYFDPAEPGTPSIIKR--GG 229

Query: 205 NRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAF--WHNLHSSGDGDYYTRHAACPV 262
            RV T++ Y++   +GG T F  ++L + P++G A F  +   H S      T H   PV
Sbjct: 230 QRVGTLVMYLNTPDKGGGTTFPDVHLEVAPQRGNAVFFSYERPHPS----TRTLHGGAPV 285

Query: 263 LTGSN 267
           + G  
Sbjct: 286 IAGDK 290


>gi|209522122|ref|ZP_03270769.1| Procollagen-proline dioxygenase [Burkholderia sp. H160]
 gi|209497434|gb|EDZ97642.1| Procollagen-proline dioxygenase [Burkholderia sp. H160]
          Length = 296

 Score =  100 bits (249), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 65/199 (32%), Positives = 100/199 (50%), Gaps = 6/199 (3%)

Query: 75  VPYLRLMPLKEEEAYL-----QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGE 129
           VP   L+PL E +  +     +P  +   D +   E + +  +AQPRL R+TV +  TG 
Sbjct: 80  VPDGPLIPLGERKVRVLSRLQRPAAVHLADFLSADECEQLIALAQPRLDRSTVVDPVTGR 139

Query: 130 LEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDF 189
             +A +R S   + R  E P+I RI  R+  +TG      E LQ+++Y  G    PH D+
Sbjct: 140 NVVAGHRSSHGMFFRLGETPLIVRIEARIAALTGTPVENGEGLQMLHYEEGAESTPHVDY 199

Query: 190 ARPG-EANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSS 248
              G EAN      +G R+ T+L Y+ DV  GG TVF  +  S+ P++G A ++   +  
Sbjct: 200 LITGNEANRESIARSGQRMGTLLMYLKDVEGGGETVFPQIGWSVAPQRGHALYFEYGNRF 259

Query: 249 GDGDYYTRHAACPVLTGSN 267
           G  D  + HA+ P+  G  
Sbjct: 260 GLCDPSSLHASTPLRVGDK 278


>gi|18086437|gb|AAL57673.1| AT3g28480/MFJ20_16 [Arabidopsis thaliana]
 gi|24796986|gb|AAN64505.1| At3g28480/MFJ20_16 [Arabidopsis thaliana]
          Length = 316

 Score =  100 bits (248), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 62/202 (30%), Positives = 99/202 (49%), Gaps = 22/202 (10%)

Query: 82  PLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSA 141
           P +  +    PR+ LY   + D E D   K+A+ +L ++ V +  +GE   +  R S   
Sbjct: 53  PTRVTQLSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESEVRTSSGM 112

Query: 142 WLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL 201
           +L + +  ++  +  ++   T L     E +Q+++Y  G  YEPH+D+    +AN    L
Sbjct: 113 FLSKRQDDIVNNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHD-QANL--EL 169

Query: 202 GTGNRVATVLFYMSDVAQGGATVF-------TSLNLSLW-----------PEKGTAAFWH 243
           G G+R+ATVL Y+S+V +GG TVF       T L    W           P KG A  + 
Sbjct: 170 G-GHRIATVLMYLSNVEKGGETVFPMWKGKATQLKDDSWTECAKQGYAVKPRKGDALLFF 228

Query: 244 NLHSSGDGDYYTRHAACPVLTG 265
           NLH +   D  + H +CPV+ G
Sbjct: 229 NLHPNATTDSNSLHGSCPVVEG 250


>gi|413932756|gb|AFW67307.1| oxidoreductase [Zea mays]
          Length = 297

 Score =  100 bits (248), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 62/193 (32%), Positives = 97/193 (50%), Gaps = 22/193 (11%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +PR  LY   + D+E D +  +A+  + ++ V +  +G+   +  R S   +L + E  +
Sbjct: 41  RPRAFLYSGFLSDTECDHLVSLAKGSMEKSMVADNDSGKSVASQARTSSGTFLAKREDEI 100

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           +  I +RV   T L    AE LQV+ Y  G  Y+ H+D+    + N  K LG G RVATV
Sbjct: 101 VSAIEKRVAAWTFLPEENAESLQVLRYETGQKYDAHFDYFH--DRNNLK-LG-GQRVATV 156

Query: 211 LFYMSDVAQGGATVF------------------TSLNLSLWPEKGTAAFWHNLHSSGDGD 252
           L Y++DV +GG TVF                  +   L++ P+KG A  + NLH +   D
Sbjct: 157 LMYLTDVNKGGETVFPNAEGSHLQYKDETWSECSRSGLAVKPKKGDALLFFNLHVNATAD 216

Query: 253 YYTRHAACPVLTG 265
             + H +CPV+ G
Sbjct: 217 TGSLHGSCPVIEG 229


>gi|18405808|ref|NP_566838.1| prolyl 4-hydroxylase [Arabidopsis thaliana]
 gi|21617881|gb|AAM66931.1| prolyl 4-hydroxylase, putative [Arabidopsis thaliana]
 gi|332643929|gb|AEE77450.1| prolyl 4-hydroxylase [Arabidopsis thaliana]
          Length = 316

 Score =  100 bits (248), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 62/202 (30%), Positives = 99/202 (49%), Gaps = 22/202 (10%)

Query: 82  PLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSA 141
           P +  +    PR+ LY   + D E D   K+A+ +L ++ V +  +GE   +  R S   
Sbjct: 53  PTRVTQLSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESEVRTSSGM 112

Query: 142 WLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL 201
           +L + +  ++  +  ++   T L     E +Q+++Y  G  YEPH+D+    +AN    L
Sbjct: 113 FLSKRQDDIVSNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHD-QANL--EL 169

Query: 202 GTGNRVATVLFYMSDVAQGGATVF-------TSLNLSLW-----------PEKGTAAFWH 243
           G G+R+ATVL Y+S+V +GG TVF       T L    W           P KG A  + 
Sbjct: 170 G-GHRIATVLMYLSNVEKGGETVFPMWKGKATQLKDDSWTECAKQGYAVKPRKGDALLFF 228

Query: 244 NLHSSGDGDYYTRHAACPVLTG 265
           NLH +   D  + H +CPV+ G
Sbjct: 229 NLHPNATTDSNSLHGSCPVVEG 250


>gi|356502610|ref|XP_003520111.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
          Length = 286

 Score =  100 bits (248), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 60/230 (26%), Positives = 110/230 (47%), Gaps = 23/230 (10%)

Query: 56  LTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQP 115
           L+ P A       R  H        + L+ E    QPR  LY + +   E + +  +A P
Sbjct: 45  LSTPHANANSSVSRNTHIEAEEDDQVALRMEVISWQPRAFLYHNFLTKEECEYLINIATP 104

Query: 116 RLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVV 175
            ++++TV + ++G+  + + R S  A+L   +  ++  I +R+  +T +     E + V+
Sbjct: 105 HMQKSTVADNQSGQSVVHDVRKSTGAFLDRGQDEIVRNIEKRIADVTFIPIENGEPIYVI 164

Query: 176 NYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVF---------- 225
           +Y +G +Y+PHYD+      + F     G R+AT+L Y+S+V +GG T+F          
Sbjct: 165 HYEVGQYYDPHYDYF----IDDFNIENGGQRIATMLMYLSNVEEGGETMFPRAKANFSSV 220

Query: 226 ---------TSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
                      + LS+ P+ G A  + ++  +   D  T H+ACPV+ G+
Sbjct: 221 PWWNELSNCGKMGLSIKPKMGDALLFWSMKPNATLDALTLHSACPVIKGN 270


>gi|221068712|ref|ZP_03544817.1| Procollagen-proline dioxygenase [Comamonas testosteroni KF-1]
 gi|220713735|gb|EED69103.1| Procollagen-proline dioxygenase [Comamonas testosteroni KF-1]
          Length = 299

 Score =  100 bits (248), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 53/175 (30%), Positives = 93/175 (53%), Gaps = 3/175 (1%)

Query: 92  PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
           PR++++ +++ D E D I   A PR++R+   + ++G   + + R S   + +  E+ +I
Sbjct: 112 PRVVVFGNLLSDEECDAIIAAAGPRMQRSLTVDNQSGGEAVNDDRTSNGMFFQRGENDLI 171

Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLG-TGNRVATV 210
            R+ +R+  +        E +QV++Y  G  Y+PHYD+  P E      L   G RV T+
Sbjct: 172 CRVEQRIARLLNWPLENGEGMQVLHYRPGAEYKPHYDYFAPNEPGTPTILKRGGQRVGTL 231

Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           + Y+++ A+GGAT F  + L + P +G A F+   ++  D    T H   PVL G
Sbjct: 232 VMYLNEPARGGATTFPDVGLQVVPRRGNAVFFS--YNRPDPATKTLHGGAPVLEG 284


>gi|351714551|gb|EHB17470.1| Prolyl 4-hydroxylase subunit alpha-1 [Heterocephalus glaber]
          Length = 388

 Score =  100 bits (248), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 57/147 (38%), Positives = 88/147 (59%), Gaps = 11/147 (7%)

Query: 4   PTHQRAQGNKLYYQEALNK--------SPELKDEPPKVNNVAPTLE-VTEREKYEMLCRG 54
           P HQRA GN  Y++  + K        S +  D+   +      ++ + ER+KYEMLCRG
Sbjct: 241 PEHQRANGNLKYFEYIMAKEKDANKSASDDQSDQKSTLRKKGIAVDYLPERQKYEMLCRG 300

Query: 55  D-LTVPPAIVAQLKCRYVHRNV-PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKM 112
           + + + P    +L CRY   N  P   L P K+E+ + +PRII + D++ D+EI+++K +
Sbjct: 301 EGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKDL 360

Query: 113 AQPRLRRATVQNYKTGELEIANYRISK 139
           A+PRL RATV + +TG+L  A YR+SK
Sbjct: 361 AKPRLSRATVHDPETGKLTTAQYRVSK 387


>gi|414870899|tpg|DAA49456.1| TPA: hypothetical protein ZEAMMB73_536273 [Zea mays]
          Length = 364

 Score =  100 bits (248), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 59/201 (29%), Positives = 97/201 (48%), Gaps = 24/201 (11%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +PR  +Y + +   E D +  +A+P ++++TV +  TG  + +  R S   +LR  +  +
Sbjct: 159 EPRAFVYHNFLSKEECDHLISLAKPHMKKSTVVDSATGGSKDSRVRTSSGMFLRRGQDKI 218

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           I  I +R+   T +     E LQV++Y +G  YEPH+D+      + + +   G R+AT+
Sbjct: 219 IRTIEKRIADYTFIPVEQGEGLQVLHYEVGQKYEPHFDYFH----DDYNTKNGGQRIATL 274

Query: 211 LFYMSDVAQGGATVF-------------------TSLNLSLWPEKGTAAFWHNLHSSGDG 251
           L Y+SDV  GG TVF                       LS+ P+ G A  + ++   G  
Sbjct: 275 LMYLSDVEDGGETVFPSSTTNSSSSPFYNELSECAKGGLSVKPKMGDALLFWSMKPDGSL 334

Query: 252 DYYTRHAACPVLTGSNSLHST 272
           D  + H  CPV+ G N   ST
Sbjct: 335 DPTSLHGGCPVIKG-NKWSST 354


>gi|224001336|ref|XP_002290340.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220973762|gb|EED92092.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 483

 Score =  100 bits (248), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 63/206 (30%), Positives = 108/206 (52%), Gaps = 25/206 (12%)

Query: 86  EEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATV--QNYKTGELEIANYRISKSAWL 143
           E   L+P ++     + D E D I ++A P+++ ++V  ++   G+ + + +R S+SA+L
Sbjct: 262 ETLSLRPLVVSVEGFLSDEECDYIAEIASPQVKYSSVSLKDADKGK-DSSEWRTSQSAFL 320

Query: 144 REPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL-- 201
              +  V+  I  RV  +T +  +  E +QV+ YG G  Y+ H+D+  P    + KS   
Sbjct: 321 SARDDEVLTEIDHRVASLTRIPRNHQEYVQVLRYGAGEKYDSHHDYFDPSAYRSDKSTLR 380

Query: 202 ----GTGNRVATVLFYMSDVAQGGATVF--------------TSLNLSLWPEKGTAAFWH 243
               G  NR ATV +Y++DV  GG T+F               S+ L + P+KG    ++
Sbjct: 381 LIENGKKNRYATVFWYLTDVHDGGETIFPRYGGAPAPRSHKDCSIGLKVKPQKGKVVIFY 440

Query: 244 NLHSSGDGDYYTRHAACPVLTGSNSL 269
           +L +SG+ D ++ H ACPV  G N+L
Sbjct: 441 SLDASGEMDPFSLHGACPV--GENNL 464


>gi|218184507|gb|EEC66934.1| hypothetical protein OsI_33548 [Oryza sativa Indica Group]
          Length = 308

 Score =  100 bits (248), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 59/193 (30%), Positives = 97/193 (50%), Gaps = 22/193 (11%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +PR  L++  + D+E + +  +A+ +L ++ V + ++G+  ++  R S   +L + +  V
Sbjct: 51  RPRAFLHKGFLTDAECEHLISLAKDKLEKSMVADNESGKSVMSEVRTSSGMFLEKKQDEV 110

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           + RI  R+   T L     E +Q+++Y  G  YEPHYD+       A   LG G+R+ATV
Sbjct: 111 VARIEERIAAWTFLPPDNGESIQILHYQNGEKYEPHYDYFHDKNNQA---LG-GHRIATV 166

Query: 211 LFYMSDVAQGGATVFTSLNLSL-------W-----------PEKGTAAFWHNLHSSGDGD 252
           L Y+SDV +GG T+F      L       W           P KG A  + +LH     D
Sbjct: 167 LMYLSDVGKGGETIFPEAEGKLLQPKDDTWSDCAKNGYAVKPVKGDALLFFSLHPDATTD 226

Query: 253 YYTRHAACPVLTG 265
             + H +CPV+ G
Sbjct: 227 SDSLHGSCPVIEG 239


>gi|326526235|dbj|BAJ97134.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 308

 Score = 99.8 bits (247), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 57/203 (28%), Positives = 97/203 (47%), Gaps = 21/203 (10%)

Query: 80  LMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISK 139
           + P    +    PR  LY   + D E + +  +A+  L+R+ V +  +G+ +++  R S 
Sbjct: 46  VYPHHSRQISWHPRAFLYPHFLSDDEANHLVSLARAELKRSAVADETSGKSQLSEVRTSS 105

Query: 140 SAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFK 199
             ++ + + P++  I  ++   T L     E++QV+ Y  G  YEPHYDF      ++  
Sbjct: 106 GTFISKGKDPIVAGIEDKIAAWTFLPKENGEDMQVLRYKRGEKYEPHYDFF----TDSVN 161

Query: 200 SLGTGNRVATVLFYMSDVAQGGATVF-----------------TSLNLSLWPEKGTAAFW 242
           ++  G+RVATVL Y++DVA+GG TVF                     +++ P KG A  +
Sbjct: 162 TILGGHRVATVLLYLTDVAEGGETVFPLAKGRKGSHHKGLSECAQKGIAVKPRKGDALLF 221

Query: 243 HNLHSSGDGDYYTRHAACPVLTG 265
            NL      D  + H  C V+ G
Sbjct: 222 FNLRPDAATDPTSLHGGCEVIKG 244


>gi|224133600|ref|XP_002327635.1| predicted protein [Populus trichocarpa]
 gi|222836720|gb|EEE75113.1| predicted protein [Populus trichocarpa]
          Length = 291

 Score = 99.8 bits (247), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 57/201 (28%), Positives = 101/201 (50%), Gaps = 24/201 (11%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +PR  +Y + +  +E + +  +A+PR++++TV +  TG+ + +  R S   +L      +
Sbjct: 86  KPRAFVYHNFLTKAECEYLINLAKPRMQKSTVVDSSTGKSKDSKVRTSSGTFLPRGRDKI 145

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           +  I +R+   + +     E LQ+++Y +G  YEPH+D+      + + +   G R+ATV
Sbjct: 146 VRDIEKRIADFSFIPVEHGEGLQILHYEVGQRYEPHFDYF----MDEYNTKNGGQRIATV 201

Query: 211 LFYMSDVAQGGATVFTSL-------------------NLSLWPEKGTAAFWHNLHSSGDG 251
           L Y+SDV +GG TVF S                     LS+ P+ G A  + +++  G  
Sbjct: 202 LMYLSDVEEGGETVFPSAEGNISAVPWWNELSECGKGGLSVKPKMGDALLFWSMNPDGSP 261

Query: 252 DYYTRHAACPVLTGSNSLHST 272
           D  + H  CPV+ G N   ST
Sbjct: 262 DPSSLHGGCPVIRG-NKWSST 281


>gi|449432777|ref|XP_004134175.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 303

 Score = 99.8 bits (247), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 57/205 (27%), Positives = 100/205 (48%), Gaps = 25/205 (12%)

Query: 82  PLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSA 141
           P K ++    PR  +Y   + D E D +  +A+  L+R++V +  +G+ +++  R S  A
Sbjct: 38  PAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGKSKVSEVRTSSGA 97

Query: 142 WLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL 201
           ++ + + P++  I  ++   T L     E++QV+ Y  G  Y+ H+D+     A+     
Sbjct: 98  FIHKAKDPIVSGIEDKIAAWTFLPKDNGEDIQVLRYEYGQKYDAHFDYF----ADKVNIA 153

Query: 202 GTGNRVATVLFYMSDVAQGGATVFTSL---------------------NLSLWPEKGTAA 240
             G+R+ATVL Y+SDV +GG TVF S                       +++ P KG A 
Sbjct: 154 RGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRKGDAL 213

Query: 241 FWHNLHSSGDGDYYTRHAACPVLTG 265
            + +LH +   D  + H  CPV+ G
Sbjct: 214 LFFSLHPNAIPDTSSLHGGCPVIEG 238


>gi|115481998|ref|NP_001064592.1| Os10g0413500 [Oryza sativa Japonica Group]
 gi|110289075|gb|ABG66075.1| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
           Group]
 gi|113639201|dbj|BAF26506.1| Os10g0413500 [Oryza sativa Japonica Group]
 gi|215692577|dbj|BAG87997.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222612821|gb|EEE50953.1| hypothetical protein OsJ_31503 [Oryza sativa Japonica Group]
          Length = 308

 Score = 99.8 bits (247), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 59/193 (30%), Positives = 97/193 (50%), Gaps = 22/193 (11%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +PR  L++  + D+E + +  +A+ +L ++ V + ++G+  ++  R S   +L + +  V
Sbjct: 51  RPRAFLHKGFLTDAECEHLISLAKDKLEKSMVADNESGKSVMSEVRTSSGMFLEKKQDEV 110

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           + RI  R+   T L     E +Q+++Y  G  YEPHYD+       A   LG G+R+ATV
Sbjct: 111 VARIEERIAAWTFLPPDNGESIQILHYQNGEKYEPHYDYFHDKNNQA---LG-GHRIATV 166

Query: 211 LFYMSDVAQGGATVFTSLNLSL-------W-----------PEKGTAAFWHNLHSSGDGD 252
           L Y+SDV +GG T+F      L       W           P KG A  + +LH     D
Sbjct: 167 LMYLSDVGKGGETIFPEAEGKLLQPKDDTWSDCAKNGYAVKPVKGDALLFFSLHPDATTD 226

Query: 253 YYTRHAACPVLTG 265
             + H +CPV+ G
Sbjct: 227 SDSLHGSCPVIEG 239


>gi|9294583|dbj|BAB02864.1| prolyl 4-hydroxylase alpha subunit-like protein [Arabidopsis
           thaliana]
          Length = 332

 Score = 99.8 bits (247), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 62/202 (30%), Positives = 99/202 (49%), Gaps = 22/202 (10%)

Query: 82  PLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSA 141
           P +  +    PR+ LY   + D E D   K+A+ +L ++ V +  +GE   +  R S   
Sbjct: 69  PTRVTQLSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESEVRTSSGM 128

Query: 142 WLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL 201
           +L + +  ++  +  ++   T L     E +Q+++Y  G  YEPH+D+    +AN    L
Sbjct: 129 FLSKRQDDIVSNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHD-QANL--EL 185

Query: 202 GTGNRVATVLFYMSDVAQGGATVF-------TSLNLSLW-----------PEKGTAAFWH 243
           G G+R+ATVL Y+S+V +GG TVF       T L    W           P KG A  + 
Sbjct: 186 G-GHRIATVLMYLSNVEKGGETVFPMWKGKATQLKDDSWTECAKQGYAVKPRKGDALLFF 244

Query: 244 NLHSSGDGDYYTRHAACPVLTG 265
           NLH +   D  + H +CPV+ G
Sbjct: 245 NLHPNATTDSNSLHGSCPVVEG 266


>gi|114796723|gb|ABI79328.1| prolyl 4-hydroxylase [Dianthus caryophyllus]
          Length = 297

 Score = 99.8 bits (247), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 59/214 (27%), Positives = 102/214 (47%), Gaps = 26/214 (12%)

Query: 74  NVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIA 133
           N    +L P K  +   +PR  +Y   + D E D +  +A+  L+R+ V + ++G+ +++
Sbjct: 26  NDSIFKLNPSKVRQISWKPRAFVYEGFLTDEECDHLISIAKTELKRSAVADNESGKSQVS 85

Query: 134 NYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPG 193
             R S  A++ + +  +++RI  ++   T L     E++QV+ Y  G  YE H+DF    
Sbjct: 86  EVRTSSGAFISKAKDAIVQRIEEKLATWTFLPIENGEDIQVLRYEEGQKYENHFDFF--- 142

Query: 194 EANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNL----------------------S 231
            ++       G+R ATVL Y+S+V +GG TVF +  L                      S
Sbjct: 143 -SDKVNIARGGHRYATVLMYLSNVEKGGDTVFPNAELSERQKAAIAANDDLSECAKRGIS 201

Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           + P KG A  + +L  +   D  + H  CPV+ G
Sbjct: 202 VKPRKGDALLFFSLTPTATPDQLSLHGGCPVIEG 235


>gi|116309432|emb|CAH66506.1| OSIGBa0111I14.1 [Oryza sativa Indica Group]
          Length = 267

 Score = 99.8 bits (247), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 62/204 (30%), Positives = 101/204 (49%), Gaps = 19/204 (9%)

Query: 77  YLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYR 136
           +LRL  +K E     PRII++ + +   E D ++ +A+PRL+ +TV +  TG+   +N R
Sbjct: 53  FLRLGLVKPEVISWSPRIIVFHNFLSSEECDYLRSIARPRLQISTVVDVATGKGVKSNVR 112

Query: 137 ISKSAWLREPEH--PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGE 194
            S   ++   E   PVI+ I +R+   + +     E +QV+ Y    +Y PH+D+     
Sbjct: 113 TSSGMFVSSEERKLPVIQSIEKRISVYSQIPEENGELIQVLRYEPSQYYRPHHDYF---- 168

Query: 195 ANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL-------------NLSLWPEKGTAAF 241
           ++ F     G RVAT+L Y++D  +GG T F                 L + P KG A  
Sbjct: 169 SDTFNIKRGGQRVATMLMYLTDGVEGGETHFPQAGDGECSCGGKMVKGLCVKPNKGDAVL 228

Query: 242 WHNLHSSGDGDYYTRHAACPVLTG 265
           + ++   G+ D  + H  CPVL G
Sbjct: 229 FWSMGLDGETDSNSIHGGCPVLEG 252


>gi|115457822|ref|NP_001052511.1| Os04g0346000 [Oryza sativa Japonica Group]
 gi|38346023|emb|CAE03962.2| OSJNBb0085H11.11 [Oryza sativa Japonica Group]
 gi|113564082|dbj|BAF14425.1| Os04g0346000 [Oryza sativa Japonica Group]
 gi|125547818|gb|EAY93640.1| hypothetical protein OsI_15426 [Oryza sativa Indica Group]
 gi|125589953|gb|EAZ30303.1| hypothetical protein OsJ_14349 [Oryza sativa Japonica Group]
 gi|215693934|dbj|BAG89133.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 267

 Score = 99.8 bits (247), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 62/208 (29%), Positives = 102/208 (49%), Gaps = 19/208 (9%)

Query: 73  RNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEI 132
           +   +LRL  +K E     PRII++ + +   E D ++ +A+PRL+ +TV +  TG+   
Sbjct: 49  QEAAFLRLGLVKPEVISWSPRIIVFHNFLSSEECDYLRSIARPRLQISTVVDVATGKGVK 108

Query: 133 ANYRISKSAWLREPEH--PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFA 190
           +N R S   ++   E   PVI+ I +R+   + +     E +QV+ Y    +Y PH+D+ 
Sbjct: 109 SNVRTSSGMFVSSEERKLPVIQSIEKRISVYSQIPEENGELIQVLRYEPSQYYRPHHDYF 168

Query: 191 RPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL-------------NLSLWPEKG 237
               ++ F     G RVAT+L Y++D  +GG T F                 L + P KG
Sbjct: 169 ----SDTFNIKRGGQRVATMLMYLTDGVEGGETHFPQAGDGECSCGGKMVKGLCVKPNKG 224

Query: 238 TAAFWHNLHSSGDGDYYTRHAACPVLTG 265
            A  + ++   G+ D  + H  CPVL G
Sbjct: 225 DAVLFWSMGLDGETDSNSIHGGCPVLEG 252


>gi|195352178|ref|XP_002042591.1| GM14978 [Drosophila sechellia]
 gi|194124475|gb|EDW46518.1| GM14978 [Drosophila sechellia]
          Length = 467

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 58/184 (31%), Positives = 94/184 (51%), Gaps = 32/184 (17%)

Query: 64  AQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQ 123
           + L CRY      +L+L PLK EE    P I+++ +V+ D EI+ +K             
Sbjct: 296 SNLVCRYNSSTNAFLQLAPLKMEEVSRDPYIVMFHEVVSDKEIEEMK------------- 342

Query: 124 NYKTGEL-EIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGH 182
               GE+ E+ N +          E    +RI++R+  MTG        +Q  N+G+GG+
Sbjct: 343 ----GEITEMENGK----------ESSFSKRINQRISDMTGFKLEEFPAIQSANFGVGGY 388

Query: 183 YEPHYDF--ARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAA 240
           ++PHYD+   R  E +   +LG  +R+ +++FY  +V+QGG TVF    + + P+KG A 
Sbjct: 389 FKPHYDYYTDRLKEVDVNNTLG--DRIGSIIFYAGEVSQGGQTVFPDSKVMVEPKKGNAL 446

Query: 241 FWHN 244
            W N
Sbjct: 447 LWFN 450


>gi|357467075|ref|XP_003603822.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355492870|gb|AES74073.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
          Length = 683

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 59/188 (31%), Positives = 97/188 (51%), Gaps = 17/188 (9%)

Query: 92  PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
           PR  +Y + +   E + +  +A+P + R+ V +  TGE++ ++ R S   +L   +  ++
Sbjct: 119 PRASMYHNFLSKEECEHLINLAKPFMARSLVVDGVTGEVKESSSRTSSGMFLDRGKDKIV 178

Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVL 211
           + I RR+  +T +     E L V++YG+G   EPHYD+   G          G RVATVL
Sbjct: 179 QNIERRIADITSVPIENGEGLHVIHYGVGQKCEPHYDYTSDGVVTK----NGGPRVATVL 234

Query: 212 FYMSDVAQGGATV-------FTSLN------LSLWPEKGTAAFWHNLHSSGDGDYYTRHA 258
            Y+SDV +GG TV       FTS++      LS+ P+ G A  + ++   G  D  + H 
Sbjct: 235 MYLSDVEEGGETVFPDAQPNFTSVSKCSGDGLSVKPKMGDALLFWSMKPDGTLDTSSLHG 294

Query: 259 ACPVLTGS 266
             PV+ G+
Sbjct: 295 GSPVIRGN 302



 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 42/175 (24%), Positives = 76/175 (43%), Gaps = 32/175 (18%)

Query: 105 EIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGL 164
           E + +  +A+P + R+ V +  TG+   ++ R S   +L   +  +++ I +R+  +T +
Sbjct: 377 ECEHLINLAKPFMTRSLVVDGLTGKGRESSARTSSGRFLERGKDKIVQNIEQRIADITSI 436

Query: 165 TTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATV 224
                + +     G+           + G          G RVATVL Y+SDV +GG TV
Sbjct: 437 PRMARDFMLFTAGGV---------VTKNG----------GPRVATVLMYLSDVEEGGETV 477

Query: 225 FTSLN-------------LSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           F +               LS+ P+ G A  + ++   G  D  + H   PV+ G+
Sbjct: 478 FPNAKPNINSVSKYPEKGLSVKPKMGDALLFRSMKPDGTLDTSSLHGGSPVIRGN 532


>gi|326489721|dbj|BAK01841.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 315

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 58/201 (28%), Positives = 100/201 (49%), Gaps = 24/201 (11%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +PR  +Y + +   E + + ++A+PR+ ++TV + +TG+ + +  R S   +L+     V
Sbjct: 110 EPRAFVYHNFLSKEECEYLIELAKPRMVKSTVVDSETGKSKDSRVRTSSGMFLQRGRDKV 169

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           I  I RR+   T +     E LQV++Y +G  YEPH+D+      + F +   G R+AT+
Sbjct: 170 IRAIERRIADYTFIPAEHGEGLQVLHYEVGQKYEPHFDYF----LDEFNTKNGGQRMATI 225

Query: 211 LFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAFWHNLHSSGDG 251
           L Y+SD+ +GG T+F   N                   L++ P+ G A  + ++      
Sbjct: 226 LMYLSDIEEGGETIFPDANVNSSSLPWYNELSECARKGLAVKPKMGDALLFWSMKPDATL 285

Query: 252 DYYTRHAACPVLTGSNSLHST 272
           D  + H  CPV+ G N   ST
Sbjct: 286 DPLSLHGGCPVIKG-NKWSST 305


>gi|168002780|ref|XP_001754091.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162694645|gb|EDQ80992.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 214

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 59/196 (30%), Positives = 96/196 (48%), Gaps = 23/196 (11%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +PR  LY   + + E + + ++A+P L ++TV +  TG+ + +  R S   +L   + PV
Sbjct: 9   EPRAFLYHHFLTEEECNHLIEVARPSLVKSTVVDSDTGKSKDSRLRTSSGTFLMRGQDPV 68

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           I+RI +R+   T +     E LQV+ Y     YEPHYD+      +A+ +   G R+ATV
Sbjct: 69  IKRIEKRIADFTFIPAEQGEGLQVLQYKESEKYEPHYDYFH----DAYNTKNGGQRIATV 124

Query: 211 LFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAFWHNLHSSGDG 251
           L Y+S+V +GG TVF +                     LS+ P  G A  + ++      
Sbjct: 125 LMYLSNVEEGGETVFPAAQVNKTEVPDWDKLSECAQKGLSVRPRMGDALLFWSMKPDATL 184

Query: 252 DYYTRHAACPVLTGSN 267
           D  + H  CPV+ G+ 
Sbjct: 185 DSTSLHGGCPVIKGTK 200


>gi|195627276|gb|ACG35468.1| prolyl 4-hydroxylase [Zea mays]
          Length = 298

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 60/193 (31%), Positives = 97/193 (50%), Gaps = 22/193 (11%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +PR  L++  + D+E D +  +A+ +L ++ V + K+G+   +  R S   +L + +  V
Sbjct: 41  RPRAFLHKGFLLDAECDHLIALAKDKLEKSMVADNKSGKSVQSEVRTSSGMFLEKKQDEV 100

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           + RI  R+   T L     E +Q+++Y  G  YEPHYD+       A   LG G+R+ATV
Sbjct: 101 VTRIEERISAWTFLPPENGEAIQILHYQNGEKYEPHYDYFHDKNNQA---LG-GHRIATV 156

Query: 211 LFYMSDVAQGGATVFTSLNLSL-------W-----------PEKGTAAFWHNLHSSGDGD 252
           L Y+S+V +GG T+F +    L       W           P KG A  + +LH     D
Sbjct: 157 LMYLSNVEKGGETIFPNAEGKLLQPKDDTWSDCARNGYAVKPVKGDALLFFSLHPDSTTD 216

Query: 253 YYTRHAACPVLTG 265
             + H +CPV+ G
Sbjct: 217 SDSLHGSCPVIEG 229


>gi|212720775|ref|NP_001131953.1| uncharacterized protein LOC100193348 [Zea mays]
 gi|194693016|gb|ACF80592.1| unknown [Zea mays]
 gi|347978798|gb|AEP37741.1| prolyl 4-hydroxylase 1 [Zea mays]
 gi|414870898|tpg|DAA49455.1| TPA: hypothetical protein ZEAMMB73_536273 [Zea mays]
          Length = 307

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 59/201 (29%), Positives = 97/201 (48%), Gaps = 24/201 (11%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +PR  +Y + +   E D +  +A+P ++++TV +  TG  + +  R S   +LR  +  +
Sbjct: 102 EPRAFVYHNFLSKEECDHLISLAKPHMKKSTVVDSATGGSKDSRVRTSSGMFLRRGQDKI 161

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           I  I +R+   T +     E LQV++Y +G  YEPH+D+      + + +   G R+AT+
Sbjct: 162 IRTIEKRIADYTFIPVEQGEGLQVLHYEVGQKYEPHFDYFH----DDYNTKNGGQRIATL 217

Query: 211 LFYMSDVAQGGATVF-------------------TSLNLSLWPEKGTAAFWHNLHSSGDG 251
           L Y+SDV  GG TVF                       LS+ P+ G A  + ++   G  
Sbjct: 218 LMYLSDVEDGGETVFPSSTTNSSSSPFYNELSECAKGGLSVKPKMGDALLFWSMKPDGSL 277

Query: 252 DYYTRHAACPVLTGSNSLHST 272
           D  + H  CPV+ G N   ST
Sbjct: 278 DPTSLHGGCPVIKG-NKWSST 297


>gi|326501992|dbj|BAK06488.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 306

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 65/207 (31%), Positives = 101/207 (48%), Gaps = 21/207 (10%)

Query: 78  LRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPR-LRRATVQNYKTGELEIANYR 136
           +R  P +      +PR  LY+  + ++E D +  +A+   L+++ V + +TG+  ++  R
Sbjct: 31  VRFDPTRAVHVSWRPRAFLYKGFLTEAECDHLVALAEEGGLQKSMVVDRQTGKSVMSEVR 90

Query: 137 ISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEAN 196
            S   +L + +  V+  I  R+   T L     E +QV+ Y  G  YEPH DF R   A 
Sbjct: 91  TSSGTFLAKKQDQVVATIEARIAAWTLLPQENGESIQVLRYENGQKYEPHVDFIRHA-AK 149

Query: 197 AFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNL------------------SLWPEKGT 238
              S G G+RVATVL Y+SDV  GG TVF + +                   ++ P KG 
Sbjct: 150 GHHSRG-GHRVATVLMYLSDVKMGGETVFPNSDAKTLQPKDDTQSECARRGYAVKPVKGD 208

Query: 239 AAFWHNLHSSGDGDYYTRHAACPVLTG 265
           A  + +LH +G  D  + H  CPV+ G
Sbjct: 209 AVLFFSLHPNGTTDRDSLHGGCPVIEG 235


>gi|91779740|ref|YP_554948.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
           xenovorans LB400]
 gi|91692400|gb|ABE35598.1| Procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
           xenovorans LB400]
          Length = 296

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 56/176 (31%), Positives = 92/176 (52%), Gaps = 1/176 (0%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +P  +L  D +  +E + +  +A+PRL R+TV +  TG   +A +R S   + R  E P+
Sbjct: 101 RPAAVLLDDFLSANECEQLIALARPRLSRSTVVDPVTGRNVVAGHRSSDGMFFRLGETPL 160

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLG-TGNRVAT 209
           I R+  R+  +TGL     E LQ+++Y  G    PH D+   G     +S+  +G RV T
Sbjct: 161 IARLEARIAELTGLPVENGEGLQLLHYEAGAESTPHVDYLIAGNPANRESIARSGQRVGT 220

Query: 210 VLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           +L Y++DV  GG T+F     S+ P +G A ++   +  G  D  + H + P+  G
Sbjct: 221 LLMYLNDVEGGGETMFPQTGWSVVPRRGQALYFEYGNRFGLADPSSLHTSTPLRAG 276


>gi|222613083|gb|EEE51215.1| hypothetical protein OsJ_32038 [Oryza sativa Japonica Group]
          Length = 222

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 57/201 (28%), Positives = 97/201 (48%), Gaps = 24/201 (11%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +PR  LY + +   E + +  +A+P ++++TV +  TG  + +  R S   +L   +  +
Sbjct: 17  EPRAFLYHNFLSKEECEYLISLAKPHMKKSTVVDASTGGSKDSRVRTSSGMFLGRGQDKI 76

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           I  I +R+   T +     E LQV++Y +G  YEPH+D+      + F +   G R+AT+
Sbjct: 77  IRTIEKRISDYTFIPVENGEGLQVLHYEVGQKYEPHFDYFH----DEFNTKNGGQRIATL 132

Query: 211 LFYMSDVAQGGATVF-------------------TSLNLSLWPEKGTAAFWHNLHSSGDG 251
           L Y+SDV +GG T+F                       L++ P+ G A  + ++   G  
Sbjct: 133 LMYLSDVEEGGETIFPSSKANSSSSPFYNELSECAKKGLAVKPKMGDALLFWSMRPDGSL 192

Query: 252 DYYTRHAACPVLTGSNSLHST 272
           D  + H  CPV+ G N   ST
Sbjct: 193 DATSLHGGCPVIKG-NKWSST 212


>gi|115482738|ref|NP_001064962.1| Os10g0497800 [Oryza sativa Japonica Group]
 gi|78708853|gb|ABB47828.1| prolyl 4-hydroxylase alpha subunit, putative, expressed [Oryza
           sativa Japonica Group]
 gi|113639571|dbj|BAF26876.1| Os10g0497800 [Oryza sativa Japonica Group]
 gi|215767852|dbj|BAH00081.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|218184821|gb|EEC67248.1| hypothetical protein OsI_34188 [Oryza sativa Indica Group]
          Length = 321

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 61/231 (26%), Positives = 107/231 (46%), Gaps = 31/231 (13%)

Query: 61  AIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRA 120
           A  + L+ R   +  P+  ++         +PR  LY + +   E + +  +A+P ++++
Sbjct: 93  AFESGLEMRGGEKGEPWTEVLSW-------EPRAFLYHNFLSKEECEYLISLAKPHMKKS 145

Query: 121 TVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIG 180
           TV +  TG  + +  R S   +L   +  +I  I +R+   T +     E LQV++Y +G
Sbjct: 146 TVVDASTGGSKDSRVRTSSGMFLGRGQDKIIRTIEKRISDYTFIPVENGEGLQVLHYEVG 205

Query: 181 GHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVF--------------- 225
             YEPH+D+      + F +   G R+AT+L Y+SDV +GG T+F               
Sbjct: 206 QKYEPHFDYFH----DEFNTKNGGQRIATLLMYLSDVEEGGETIFPSSKANSSSSPFYNE 261

Query: 226 ----TSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSLHST 272
                   L++ P+ G A  + ++   G  D  + H  CPV+ G N   ST
Sbjct: 262 LSECAKKGLAVKPKMGDALLFWSMRPDGSLDATSLHGGCPVIKG-NKWSST 311


>gi|110289076|gb|ABB47602.2| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
           Group]
          Length = 309

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 58/194 (29%), Positives = 97/194 (50%), Gaps = 23/194 (11%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +PR  L++  + D+E + +  +A+ +L ++ V + ++G+  ++  R S   +L + +  V
Sbjct: 51  RPRAFLHKGFLTDAECEHLISLAKDKLEKSMVADNESGKSVMSEVRTSSGMFLEKKQDEV 110

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           + RI  R+   T L     E +Q+++Y  G  YEPHYD+       A   LG G+R+ATV
Sbjct: 111 VARIEERIAAWTFLPPDNGESIQILHYQNGEKYEPHYDYFHDKNNQA---LG-GHRIATV 166

Query: 211 LFYMSDVAQGGATVFTSLNL--------SLW-----------PEKGTAAFWHNLHSSGDG 251
           L Y+SDV +GG T+F    +          W           P KG A  + +LH     
Sbjct: 167 LMYLSDVGKGGETIFPEAEVGKLLQPKDDTWSDCAKNGYAVKPVKGDALLFFSLHPDATT 226

Query: 252 DYYTRHAACPVLTG 265
           D  + H +CPV+ G
Sbjct: 227 DSDSLHGSCPVIEG 240


>gi|357496283|ref|XP_003618430.1| Prolyl 4-hydroxylase subunit alpha-2 [Medicago truncatula]
 gi|217073992|gb|ACJ85356.1| unknown [Medicago truncatula]
 gi|355493445|gb|AES74648.1| Prolyl 4-hydroxylase subunit alpha-2 [Medicago truncatula]
 gi|388494436|gb|AFK35284.1| unknown [Medicago truncatula]
          Length = 313

 Score = 99.0 bits (245), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 60/206 (29%), Positives = 101/206 (49%), Gaps = 22/206 (10%)

Query: 78  LRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRI 137
           ++  P +  +    PR  LY++ + D E D + ++++ +L ++ V + ++G+   +  R 
Sbjct: 44  VKFDPTRVTQLSWSPRAFLYKNFLTDEECDHLIELSKDKLEKSMVADNESGKSIQSEVRT 103

Query: 138 SKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANA 197
           S   +L + +  ++  I  R+   T L     E +QV++Y  G  YEPH+DF       A
Sbjct: 104 SSGMFLNKQQDEIVSGIEARIAAWTFLPVENGESMQVLHYMNGEKYEPHFDFFHD---KA 160

Query: 198 FKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSL-------W-----------PEKGTA 239
            + LG G+RVATVL Y+S+V +GG T+F      L       W           P KG A
Sbjct: 161 NQRLG-GHRVATVLMYLSNVEKGGETIFPHAEGKLSQPKDESWSECAHKGYAVKPRKGDA 219

Query: 240 AFWHNLHSSGDGDYYTRHAACPVLTG 265
             + +LH     D  + H +CPV+ G
Sbjct: 220 LLFFSLHLDATTDSKSLHGSCPVIEG 245


>gi|307110383|gb|EFN58619.1| hypothetical protein CHLNCDRAFT_19485 [Chlorella variabilis]
          Length = 328

 Score = 99.0 bits (245), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 59/184 (32%), Positives = 95/184 (51%), Gaps = 16/184 (8%)

Query: 86  EEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLRE 145
           E    +PR  ++ + M + E D I  +A+P ++R+TV       +E    R S   +L+ 
Sbjct: 33  EPVSWKPRAFVFHNFMTEEEADHIVALAKPFMKRSTVVGAGGASVE-DQIRTSYGTFLKR 91

Query: 146 PEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGN 205
            + P++  + +R+   T L  S  E++Q++ YGIG  Y  HYD           SL   +
Sbjct: 92  LQDPIVTAVEQRLATWTKLNVSHQEDMQILRYGIGQKYGAHYD-----------SLDNDS 140

Query: 206 -RVATVLFYMSDVAQ--GGATVFTSL-NLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACP 261
            RV TVL Y+SDV    GG T F  +   +L+P+KG A  +++L   G  D Y+ H  CP
Sbjct: 141 PRVCTVLLYLSDVPADGGGETAFPGVRRQALYPKKGDALLFYSLKPDGTSDAYSLHTGCP 200

Query: 262 VLTG 265
           +++G
Sbjct: 201 IISG 204


>gi|225468574|ref|XP_002263060.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
 gi|296084059|emb|CBI24447.3| unnamed protein product [Vitis vinifera]
          Length = 288

 Score = 99.0 bits (245), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 59/201 (29%), Positives = 99/201 (49%), Gaps = 24/201 (11%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +PR  +Y + +   E + + K+A+P ++++TV +  TG+ + +  R S   +L   +  +
Sbjct: 83  EPRAFVYHNFLSKDECEYLIKLAKPHMQKSTVVDSSTGKSKDSRVRTSSGTFLTRGQDKI 142

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           I  I +R+   T L     E LQ+++Y +G  YEPHYD+      + + +   G R+ATV
Sbjct: 143 IRGIEKRLSDFTFLPVEHGEGLQILHYEVGQKYEPHYDYF----LDDYNTKNGGQRMATV 198

Query: 211 LFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAFWHNLHSSGDG 251
           L Y+SDV +GG TVF +                     LS+ P+ G A  + ++      
Sbjct: 199 LMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKEGLSVKPKMGDALLFWSMKPDASL 258

Query: 252 DYYTRHAACPVLTGSNSLHST 272
           D  + H  CPV+ G N   ST
Sbjct: 259 DPSSLHGGCPVIKG-NKWSST 278


>gi|194906709|ref|XP_001981416.1| GG11627 [Drosophila erecta]
 gi|190656054|gb|EDV53286.1| GG11627 [Drosophila erecta]
          Length = 462

 Score = 99.0 bits (245), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 62/215 (28%), Positives = 97/215 (45%), Gaps = 34/215 (15%)

Query: 52  CRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           CRG   +PP+  + L+CRY     P+LRL  LK E+  ++P + L+ D +  +E + + +
Sbjct: 239 CRGK-NLPPS-KSSLRCRYFREGSPFLRLAALKLEQLNIEPFVGLFHDAILQAEQEDLLR 296

Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
           + + RL             +I + R+         +H  + RI +R+E +TG     +E 
Sbjct: 297 LTESRLEHK----------KIESSRVEAKVDTNASDH--VRRIHQRIEDITGFDLEGSEP 344

Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
           L V N+GIGG    H D  +P                     ++DV  GG   F  L   
Sbjct: 345 LTVSNHGIGGQEAIHLDCGQPK--------------------LNDVQMGGYASFPDLGFG 384

Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
             P +G+A  WHN  + G+ D     A CPVL G+
Sbjct: 385 FKPVRGSALVWHNTDNCGNCDIRGLQATCPVLLGN 419


>gi|242032633|ref|XP_002463711.1| hypothetical protein SORBIDRAFT_01g004670 [Sorghum bicolor]
 gi|241917565|gb|EER90709.1| hypothetical protein SORBIDRAFT_01g004670 [Sorghum bicolor]
          Length = 297

 Score = 99.0 bits (245), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 60/202 (29%), Positives = 99/202 (49%), Gaps = 22/202 (10%)

Query: 82  PLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSA 141
           P +  +   +PR  LY   + D+E D +  +A+  + ++ V +  +G+  ++  R S  A
Sbjct: 32  PARVTQLSWRPRAFLYSGFLSDTECDHLINLAKGSMEKSMVADNDSGKSLMSQVRTSSGA 91

Query: 142 WLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL 201
           +L + E  ++  I +RV   T L    AE +QV+ Y IG  Y+ H+D+    + N  K  
Sbjct: 92  FLAKHEDEIVSAIEKRVAAWTFLPEENAESMQVLRYEIGQKYDAHFDYFH--DKNNVKH- 148

Query: 202 GTGNRVATVLFYMSDVAQGGATVF------------------TSLNLSLWPEKGTAAFWH 243
             G R ATVL Y++DV +GG TVF                  +   L++ P+KG A  + 
Sbjct: 149 -GGQRFATVLMYLTDVKKGGETVFPNAEGSHLQYKDETWSECSRSGLAVKPKKGDALLFF 207

Query: 244 NLHSSGDGDYYTRHAACPVLTG 265
            LH +   D  + H +CPV+ G
Sbjct: 208 GLHLNATTDTSSLHGSCPVIEG 229


>gi|147800995|emb|CAN64470.1| hypothetical protein VITISV_014644 [Vitis vinifera]
          Length = 288

 Score = 99.0 bits (245), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 59/201 (29%), Positives = 99/201 (49%), Gaps = 24/201 (11%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +PR  +Y + +   E + + K+A+P ++++TV +  TG+ + +  R S   +L   +  +
Sbjct: 83  EPRAFVYHNFLSKDECEYLIKLAKPHMQKSTVVDSSTGKSKDSRVRTSSGTFLTRGQDKI 142

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           I  I +R+   T L     E LQ+++Y +G  YEPHYD+      + + +   G R+ATV
Sbjct: 143 IRGIEKRLSDFTFLPVEHGEGLQILHYEVGQKYEPHYDYF----LDDYNTKNGGQRMATV 198

Query: 211 LFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAFWHNLHSSGDG 251
           L Y+SDV +GG TVF +                     LS+ P+ G A  + ++      
Sbjct: 199 LMYLSDVEEGGETVFPAAKGNFSSVPWWNELSXCGKEGLSVKPKMGDALLFWSMKPDASL 258

Query: 252 DYYTRHAACPVLTGSNSLHST 272
           D  + H  CPV+ G N   ST
Sbjct: 259 DPSSLHGGCPVIKG-NKWSST 278


>gi|194745802|ref|XP_001955376.1| GF16267 [Drosophila ananassae]
 gi|190628413|gb|EDV43937.1| GF16267 [Drosophila ananassae]
          Length = 385

 Score = 98.6 bits (244), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 67/210 (31%), Positives = 97/210 (46%), Gaps = 34/210 (16%)

Query: 52  CRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           CRG  TVP      L+CRY     P+L+L PLK E+  L P I ++ DV+   E   +  
Sbjct: 51  CRGRNTVPKKFY--LRCRYFTEGDPFLQLAPLKLEQLNLDPFIGIFHDVISIGEQKNLIN 108

Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
           + + RLR   +QN +   +E A   ++ S          +ERI RR+E MTGL    +  
Sbjct: 109 LTRNRLR---LQNPQRAVME-AEVELNASK--------EVERIHRRIEDMTGLNLEESPP 156

Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
           L ++NYGIGG +  H D  +                    F +SDV  GG   F  L   
Sbjct: 157 LTILNYGIGGQHPIHLDCEQ--------------------FMLSDVQMGGYASFPELGFG 196

Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACP 261
             P +G+A   HN+ ++ + D  +  A CP
Sbjct: 197 FKPSRGSALVVHNMDNAANCDIRSLQATCP 226


>gi|449495423|ref|XP_004159836.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 304

 Score = 98.6 bits (244), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 59/206 (28%), Positives = 102/206 (49%), Gaps = 26/206 (12%)

Query: 82  PLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSA 141
           P K ++    PR  +Y   + D E D +  +A+  L+R++V +  +G+ +++  R S  A
Sbjct: 38  PAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGKSKVSEVRTSSGA 97

Query: 142 WLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL 201
           ++ + + P++  I  ++   T L     E++QV+ Y  G  Y+ H+D+     A+     
Sbjct: 98  FIHKAKDPIVSGIEDKIAAWTFLPKDNGEDIQVLRYEYGQKYDAHFDYF----ADKVNIA 153

Query: 202 GTGNRVATVLFYMSDVAQGGATVF--------------TSLNLS--------LWPEKGTA 239
             G+R+ATVL Y+SDV +GG TVF              T+ +LS        + P KG A
Sbjct: 154 RGGHRMATVLMYLSDVEKGGETVFLLRRSESQRRQASETNEDLSDCAKKGIAVKPRKGDA 213

Query: 240 AFWHNLHSSGDGDYYTRHAACPVLTG 265
             + +LH +   D  + H  CPV+ G
Sbjct: 214 LLFFSLHPNAIPDTSSLHGGCPVIEG 239


>gi|416009427|ref|ZP_11561250.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein
           [Acidithiobacillus sp. GGI-221]
 gi|339836568|gb|EGQ64151.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein
           [Acidithiobacillus sp. GGI-221]
          Length = 196

 Score = 98.6 bits (244), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 54/157 (34%), Positives = 84/157 (53%), Gaps = 5/157 (3%)

Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
           + Q  LR ATV + +TG+      R+S+ AW +  +HP+++ ++  +  +TG+     E 
Sbjct: 31  IGQSLLRPATVTDEQTGQEVAHGERVSEMAWPKRDDHPILQSLAEGIAQLTGIPIDCQEP 90

Query: 172 LQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNL 230
           LQ+++Y  GG Y+PHYD FA    A+A      GNR  T++ Y++ V +GG T F  L L
Sbjct: 91  LQILHYRPGGEYKPHYDAFA----ADAPTLRQGGNRQGTLILYLNAVEEGGETAFPELGL 146

Query: 231 SLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
            + P  G   F+ NL+  G     + HA  PV  G  
Sbjct: 147 QVSPIPGGGVFFRNLNEEGQRHPLSLHAGLPVRKGEK 183


>gi|307106819|gb|EFN55064.1| hypothetical protein CHLNCDRAFT_35843 [Chlorella variabilis]
          Length = 287

 Score = 98.6 bits (244), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 61/205 (29%), Positives = 101/205 (49%), Gaps = 17/205 (8%)

Query: 76  PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANY 135
           P   L   K E+   +PR  +Y + + D E + +K++A+ RL ++TV + KTG+   +  
Sbjct: 29  PPQELWRGKVEQVSWRPRAFVYHNFLSDEECEHLKELARKRLTKSTVVDNKTGKSMDSTV 88

Query: 136 RISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEA 195
           R S   +L   E  V+  I +R+  +T +     E +Q++ Y  G  YEPH D+    + 
Sbjct: 89  RTSSGTFLARGEDEVVRAIEKRISLVTMIPEENGEAIQILKYVDGQKYEPHTDYFH--DK 146

Query: 196 NAFKSLGTGNRVATVLFYMSDVAQGGATVF----TSLNLSLWPE-----------KGTAA 240
              ++   G RVAT+L Y+S   +GG TVF      +    W E           KG+A 
Sbjct: 147 YNSRTENGGQRVATILMYLSTPEEGGETVFPYAEKKVEGEGWSECARKGLAVKAVKGSAL 206

Query: 241 FWHNLHSSGDGDYYTRHAACPVLTG 265
            +++L  +G+ D  + H +CP L G
Sbjct: 207 LFYSLKPNGEEDQASTHGSCPTLAG 231


>gi|159478673|ref|XP_001697425.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158274304|gb|EDP00087.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 297

 Score = 98.6 bits (244), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 62/192 (32%), Positives = 96/192 (50%), Gaps = 19/192 (9%)

Query: 92  PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
           PR  L ++ + D E D I + A+P++ +++V + ++G+   +  R S   W  + E  VI
Sbjct: 49  PRAFLLKNFLSDEECDYIVEKARPKMVKSSVVDNESGKSVDSEIRTSTGTWFAKGEDSVI 108

Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRVATV 210
            +I +RV  +T +     E LQV++Y  G  YEPHYD F  P   NA    G G RV T+
Sbjct: 109 SKIEKRVAQVTMIPLENHEGLQVLHYHDGQKYEPHYDYFHDP--VNAGPEHG-GQRVVTM 165

Query: 211 LFYMSDVAQGGATVFTSL---------------NLSLWPEKGTAAFWHNLHSSGDGDYYT 255
           L Y++ V +GG TV  +                 L++ P KG A  +++L   G  D  +
Sbjct: 166 LMYLTTVEEGGETVLPNAEQKVTGDGWSECAKRGLAVKPIKGDALMFYSLKPDGSNDPAS 225

Query: 256 RHAACPVLTGSN 267
            H +CP L G  
Sbjct: 226 LHGSCPTLKGDK 237


>gi|194765140|ref|XP_001964685.1| GF23318 [Drosophila ananassae]
 gi|190614957|gb|EDV30481.1| GF23318 [Drosophila ananassae]
          Length = 412

 Score = 98.6 bits (244), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 61/206 (29%), Positives = 94/206 (45%), Gaps = 47/206 (22%)

Query: 65  QLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQN 124
           +L C Y     P+LR+ P K E+  L P ++++ DV+   EI  +  +   +L +A   N
Sbjct: 221 RLMCYYNSSTTPFLRIAPFKTEQIGLDPYVVVFHDVLSPREISKLISLTDRKLVQAVTVN 280

Query: 125 YKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYE 184
            K+ +  +   R +K+ W+      + +RI RR+  M+G   + AE  Q           
Sbjct: 281 KKSFKEMV---RTAKAHWVYRGYQELTKRIYRRIHDMSGFELADAENFQ----------- 326

Query: 185 PHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLN----LSLWPEKGTAA 240
                                        +SDV QGGATVF  ++     +++P  GTAA
Sbjct: 327 -----------------------------LSDVEQGGATVFPGISADSAYTVYPRAGTAA 357

Query: 241 FWHNLHSSGDGDYYTRHAACPVLTGS 266
            W+NLH+ G GD  T H ACPV+ GS
Sbjct: 358 MWYNLHTDGLGDPTTLHVACPVIVGS 383


>gi|351731158|ref|ZP_08948849.1| 2OG-Fe(II) oxygenase [Acidovorax radicis N35]
          Length = 303

 Score = 98.6 bits (244), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 58/183 (31%), Positives = 92/183 (50%), Gaps = 11/183 (6%)

Query: 88  AYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPE 147
           A  QPR++++ +++   E D +   A PR+ R+     KTG  EI + R S   + +  +
Sbjct: 112 AIAQPRVVVFGNLLSPEECDALIADAAPRMARSLTVATKTGGEEINDDRTSDGMFFQRGQ 171

Query: 148 HPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDF---ARPGEANAFKSLGTG 204
            P+I+RI  R+  +        E LQV++Y  G  Y+PHYD+   A PG     K    G
Sbjct: 172 SPLIQRIEERIARLLNWPIENGEGLQVLHYRPGAEYKPHYDYFDPAEPGTPTIVKR--GG 229

Query: 205 NRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAF--WHNLHSSGDGDYYTRHAACPV 262
            RV T++ Y++   +GG T F  +++ + P++G A F  +   H S      T H   PV
Sbjct: 230 QRVGTLVMYLNTPEKGGGTTFPDVHVEVAPQRGNAVFFSYERPHPS----TRTLHGGAPV 285

Query: 263 LTG 265
           L G
Sbjct: 286 LAG 288


>gi|302834449|ref|XP_002948787.1| hypothetical protein VOLCADRAFT_80309 [Volvox carteri f.
           nagariensis]
 gi|300265978|gb|EFJ50167.1| hypothetical protein VOLCADRAFT_80309 [Volvox carteri f.
           nagariensis]
          Length = 329

 Score = 98.6 bits (244), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 66/211 (31%), Positives = 105/211 (49%), Gaps = 30/211 (14%)

Query: 74  NVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIA 133
           +VP  R++ L       QPR+ LY+ ++   E D + K+AQ RL R+ V +  TGE  ++
Sbjct: 44  DVPDSRMVVLS-----WQPRVFLYKGILTQEECDYLIKIAQGRLERSGVSDATTGEGGVS 98

Query: 134 NYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARP 192
           + R S   +    E+ V++RI  R+   T L     E +QV+ Y     Y+PH+D F+  
Sbjct: 99  DIRTSSGMFYTRGENDVVKRIETRLAMWTMLPVENGEGIQVLRYEKTQKYDPHHDYFSFE 158

Query: 193 G-EANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL-----------------NLSLWP 234
           G +AN       GNR+ATVL Y++   +GG TVF  +                  L++ P
Sbjct: 159 GRDANG------GNRMATVLMYLATPEEGGETVFPKIPVPAGQTRANFSECGMKGLAVKP 212

Query: 235 EKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
            KG A  + ++   G  +  + H +CPV+ G
Sbjct: 213 VKGDAVLFWSIRPDGRFEPGSLHGSCPVIRG 243


>gi|418530659|ref|ZP_13096582.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni ATCC 11996]
 gi|371452378|gb|EHN65407.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni ATCC 11996]
          Length = 299

 Score = 98.2 bits (243), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 51/175 (29%), Positives = 94/175 (53%), Gaps = 3/175 (1%)

Query: 92  PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
           PR++++ +++ + E D I   A+PR++R+   + ++G   + + R S   + +  E+ +I
Sbjct: 112 PRVVVFGNLLSNEECDAIIAAARPRMQRSLTVDNQSGGEAVNDDRTSNGMFFQRGENDLI 171

Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLG-TGNRVATV 210
            R+ +R+  +        E +QV++Y  G  Y+PHYD+  P E      L   G RV T+
Sbjct: 172 SRVEQRIARLLNWPLENGEGMQVLHYRPGAEYKPHYDYFAPNEPGTPTILKRGGQRVGTL 231

Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           + Y+++ A+GGAT F  + L + P +G A F+   ++  +    T H   PVL G
Sbjct: 232 VMYLNEPARGGATTFPDVGLQVVPRRGNAVFFS--YNRPEPATKTLHGGAPVLEG 284


>gi|357447553|ref|XP_003594052.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
           truncatula]
 gi|355483100|gb|AES64303.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
           truncatula]
          Length = 301

 Score = 98.2 bits (243), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 59/205 (28%), Positives = 102/205 (49%), Gaps = 25/205 (12%)

Query: 82  PLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSA 141
           P K ++   +PR  +Y+  + D E D +  +A+  L+R+ V +  +GE +++  R S   
Sbjct: 37  PTKVKQVSWKPRAFVYKGFLTDLECDHLISIAKSELKRSAVADNLSGESKLSEVRTSSGM 96

Query: 142 WLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL 201
           ++ + +  ++  I  ++   T L     E++QV+ Y  G  Y+PHYD+     A+     
Sbjct: 97  FISKNKDAIVSGIEDKISSWTFLPKENGEDIQVLRYEHGQKYDPHYDYF----ADKVNIA 152

Query: 202 GTGNRVATVLFYMSDVAQGGATVF-------------TSLNLS--------LWPEKGTAA 240
             G+RVATVL Y+++V +GG TVF             T  +LS        + P +G A 
Sbjct: 153 RGGHRVATVLMYLTNVTKGGETVFPNAEESPRHKLSETDEDLSECGKKGVAVKPRRGDAL 212

Query: 241 FWHNLHSSGDGDYYTRHAACPVLTG 265
            + +LH +   D  + HA CPV+ G
Sbjct: 213 LFFSLHPNAIPDTLSLHAGCPVIEG 237


>gi|297818456|ref|XP_002877111.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
 gi|297322949|gb|EFH53370.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
          Length = 316

 Score = 98.2 bits (243), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 61/202 (30%), Positives = 99/202 (49%), Gaps = 22/202 (10%)

Query: 82  PLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSA 141
           P +  +    PR  LY+  + D E D   K+A+ +L ++ V +  +GE   +  R S   
Sbjct: 53  PTRVTQLSWTPRAFLYKGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESEVRTSSGM 112

Query: 142 WLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL 201
           +L + +  ++  +  ++   T +     E +Q+++Y  G  YEPH+D+    +AN    L
Sbjct: 113 FLSKRQDDIVANVEAKLAAWTFIPEENGESMQILHYENGQKYEPHFDYFHD-QANL--EL 169

Query: 202 GTGNRVATVLFYMSDVAQGGATVF-------TSLNLSLW-----------PEKGTAAFWH 243
           G G+R+ATVL Y+S+V +GG TVF       T L    W           P KG A  + 
Sbjct: 170 G-GHRIATVLMYLSNVEKGGETVFPMWKGKTTQLKDDSWTECAKQGYAVKPRKGDALLFF 228

Query: 244 NLHSSGDGDYYTRHAACPVLTG 265
           NLH +   D  + H +CPV+ G
Sbjct: 229 NLHPNATTDSNSLHGSCPVVEG 250


>gi|224102545|ref|XP_002312720.1| predicted protein [Populus trichocarpa]
 gi|222852540|gb|EEE90087.1| predicted protein [Populus trichocarpa]
          Length = 300

 Score = 98.2 bits (243), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 56/207 (27%), Positives = 103/207 (49%), Gaps = 25/207 (12%)

Query: 80  LMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISK 139
           + P K ++   +PR  +Y   + D E D +  +A+  L+R+ V + ++G+ +++  R S 
Sbjct: 34  INPAKVKQVSWKPRAFVYEGFLTDLECDHLISLAKSELKRSAVADNESGKSKLSEVRTSS 93

Query: 140 SAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFK 199
             ++ + + P++  I  ++   T L     E++QV+ Y  G  Y+PHYD+     ++   
Sbjct: 94  GMFITKAKDPIVAGIEDKIATWTFLPRENGEDIQVLRYEHGQKYDPHYDYF----SDKVN 149

Query: 200 SLGTGNRVATVLFYMSDVAQGGATVFTSL---------------------NLSLWPEKGT 238
               G+RVATVL Y++DV +GG TVF S                       +++ P +G 
Sbjct: 150 IARGGHRVATVLMYLTDVEKGGETVFPSAEELPRRKASVSHEDLSECARKGIAVKPRRGD 209

Query: 239 AAFWHNLHSSGDGDYYTRHAACPVLTG 265
           A  + +L+ +   D  + HA CPV+ G
Sbjct: 210 ALLFFSLYPTAVPDTSSIHAGCPVIEG 236


>gi|357447555|ref|XP_003594053.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
           truncatula]
 gi|355483101|gb|AES64304.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
           truncatula]
          Length = 303

 Score = 98.2 bits (243), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 57/207 (27%), Positives = 101/207 (48%), Gaps = 27/207 (13%)

Query: 82  PLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSA 141
           P K ++   +PR  +Y+  + D E D +  +A+  L+R+ V +  +GE +++  R S   
Sbjct: 37  PTKVKQVSWKPRAFVYKGFLTDLECDHLISIAKSELKRSAVADNLSGESKLSEVRTSSGM 96

Query: 142 WLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL 201
           ++ + +  ++  I  ++   T L     E++QV+ Y  G  Y+PHYD+     A+     
Sbjct: 97  FISKNKDAIVSGIEDKISSWTFLPKENGEDIQVLRYEHGQKYDPHYDYF----ADKVNIA 152

Query: 202 GTGNRVATVLFYMSDVAQGGATVFTSLNL-----------------------SLWPEKGT 238
             G+RVATVL Y+++V +GG TVF +  L                       ++ P +G 
Sbjct: 153 RGGHRVATVLMYLTNVTKGGETVFPNAELQESPRHKLSETDEDLSECGKKGVAVKPRRGD 212

Query: 239 AAFWHNLHSSGDGDYYTRHAACPVLTG 265
           A  + +LH +   D  + HA CPV+ G
Sbjct: 213 ALLFFSLHPNAIPDTLSLHAGCPVIEG 239


>gi|388492638|gb|AFK34385.1| unknown [Medicago truncatula]
          Length = 299

 Score = 98.2 bits (243), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 56/207 (27%), Positives = 103/207 (49%), Gaps = 25/207 (12%)

Query: 80  LMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISK 139
           + P K ++    PR  +Y+  + D E D +  +A+  L+R+ V +  +G+ ++++ R S 
Sbjct: 32  INPSKVKQISWIPRAFVYQGFLTDLECDHLISLAKSELKRSAVADNLSGDSQLSDVRTSS 91

Query: 140 SAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFK 199
             ++ + + P++  I  R+   T L     E++QV+ Y  G  Y+PHYD+     A+   
Sbjct: 92  GMFISKNKDPIVSGIEDRISAWTFLPKENGEDIQVLRYEHGQKYDPHYDYF----ADKVN 147

Query: 200 SLGTGNRVATVLFYMSDVAQGGATVF---------------------TSLNLSLWPEKGT 238
            +  G+R+ATVL Y+++V +GG TVF                         +++ P +G 
Sbjct: 148 IVQGGHRLATVLMYLTNVTKGGETVFPEAEEPPRRRGSKKSSDLSECAKKGIAVKPRRGD 207

Query: 239 AAFWHNLHSSGDGDYYTRHAACPVLTG 265
           A  + +L ++   D  + HA CPVL G
Sbjct: 208 ALLFFSLDTNAIPDTNSLHAGCPVLEG 234


>gi|215490183|dbj|BAG86625.1| type 2 proly 4-hydroxylase [Nicotiana tabacum]
          Length = 318

 Score = 98.2 bits (243), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 60/202 (29%), Positives = 97/202 (48%), Gaps = 22/202 (10%)

Query: 82  PLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSA 141
           P +  +   +PR  +YR+ + D E D    +A+ +L ++ V + ++G+   +  R S   
Sbjct: 59  PTRVTQISWRPRAFVYRNFLTDEECDHFITLAKHKLEKSMVADNESGKSVESEVRTSSGM 118

Query: 142 WLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL 201
           + R+ +  V+  +  R+   T L     E +Q+++Y  G  YEPH+D+         + L
Sbjct: 119 FFRKAQDQVVANVEARIAAWTFLPEENGESIQILHYEHGQKYEPHFDYFHD---KVNQEL 175

Query: 202 GTGNRVATVLFYMSDVAQGGATVF-------TSLNLSLW-----------PEKGTAAFWH 243
           G G+RVATVL Y+SDV +GG TVF       T      W           P KG A  + 
Sbjct: 176 G-GHRVATVLMYLSDVEKGGETVFPNSEAKKTQAKGDDWSDCAKKGYAVKPRKGDALLFF 234

Query: 244 NLHSSGDGDYYTRHAACPVLTG 265
           +LH     D  + H +CPV+ G
Sbjct: 235 SLHPDATTDPLSLHGSCPVIEG 256


>gi|293337056|ref|NP_001169835.1| uncharacterized protein LOC100383727 precursor [Zea mays]
 gi|224031897|gb|ACN35024.1| unknown [Zea mays]
 gi|347978800|gb|AEP37742.1| prolyl 4-hydroxylase 2 [Zea mays]
 gi|414871435|tpg|DAA49992.1| TPA: hypothetical protein ZEAMMB73_500506 [Zea mays]
          Length = 299

 Score = 98.2 bits (243), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 59/193 (30%), Positives = 97/193 (50%), Gaps = 22/193 (11%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +PR  L++  + D+E D +  +A+ +L ++ V + ++G+   +  R S   +L   +  V
Sbjct: 42  RPRAFLHKGFLSDAECDHLIALAKDKLEKSMVADNESGKSVQSEVRTSSGMFLERKQDEV 101

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           + RI  R+   T L     E +Q+++Y  G  YEPHYD+    +  A   LG G+R+ATV
Sbjct: 102 VTRIEERISAWTFLPPENGESIQILHYQNGEKYEPHYDYFHDKKNQA---LG-GHRIATV 157

Query: 211 LFYMSDVAQGGATVFTSLNLSL-------W-----------PEKGTAAFWHNLHSSGDGD 252
           L Y+S+V +GG T+F +    L       W           P KG A  + +LH     D
Sbjct: 158 LMYLSNVEKGGETIFPNAEGKLLQPKDNTWSDCARNGYAVKPVKGDALLFFSLHPDATTD 217

Query: 253 YYTRHAACPVLTG 265
             + H +CPV+ G
Sbjct: 218 SDSLHGSCPVIEG 230


>gi|160900716|ref|YP_001566298.1| procollagen-proline dioxygenase [Delftia acidovorans SPH-1]
 gi|160366300|gb|ABX37913.1| Procollagen-proline dioxygenase [Delftia acidovorans SPH-1]
          Length = 294

 Score = 98.2 bits (243), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 54/175 (30%), Positives = 91/175 (52%), Gaps = 3/175 (1%)

Query: 92  PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
           PRI+++ +++   E D I   A+PR+ R+     ++G  EI + R S   + +  E  ++
Sbjct: 107 PRIVVFGNLLSHEECDAIIAAARPRMARSLTVATQSGGEEINDDRTSNGMFFQRGETGIV 166

Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLG-TGNRVATV 210
            ++  R+  +        E LQV++YG G  Y+PH+D+  PGE      L   G RV T+
Sbjct: 167 SQLEERIARLLRWPLDHGEGLQVLHYGPGAEYKPHHDYFAPGEPGTPTILKRGGQRVGTL 226

Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           + Y+++  +GGAT+F  + L + P +G A F+   +   D    T H   PVL G
Sbjct: 227 VIYLNEPERGGATIFPEVPLQVVPRRGNAVFFS--YERPDPSTRTLHGGAPVLAG 279


>gi|357478545|ref|XP_003609558.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
 gi|355510613|gb|AES91755.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
          Length = 299

 Score = 98.2 bits (243), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 56/207 (27%), Positives = 103/207 (49%), Gaps = 25/207 (12%)

Query: 80  LMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISK 139
           + P K ++    PR  +Y+  + D E D +  +A+  L+R+ V +  +G+ ++++ R S 
Sbjct: 32  INPSKVKQISWIPRAFVYQGFLTDLECDHLISLAKSELKRSAVADNLSGDSQLSDVRTSS 91

Query: 140 SAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFK 199
             ++ + + P++  I  R+   T L     E++QV+ Y  G  Y+PHYD+     A+   
Sbjct: 92  GMFISKNKDPIVSGIEDRISAWTFLPKENGEDIQVLRYEHGQKYDPHYDYF----ADKVN 147

Query: 200 SLGTGNRVATVLFYMSDVAQGGATVF---------------------TSLNLSLWPEKGT 238
            +  G+R+ATVL Y+++V +GG TVF                         +++ P +G 
Sbjct: 148 IVQGGHRLATVLMYLTNVTKGGETVFPEAEEPPRRRGSKKSSDLSECAKKGIAVKPRRGD 207

Query: 239 AAFWHNLHSSGDGDYYTRHAACPVLTG 265
           A  + +L ++   D  + HA CPVL G
Sbjct: 208 ALLFFSLDTNAIPDTNSLHAGCPVLEG 234


>gi|333912984|ref|YP_004486716.1| procollagen-proline dioxygenase [Delftia sp. Cs1-4]
 gi|333743184|gb|AEF88361.1| Procollagen-proline dioxygenase [Delftia sp. Cs1-4]
          Length = 294

 Score = 98.2 bits (243), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 54/175 (30%), Positives = 91/175 (52%), Gaps = 3/175 (1%)

Query: 92  PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
           PRI+++ +++   E D I   A+PR+ R+     ++G  EI + R S   + +  E  ++
Sbjct: 107 PRIVVFGNLLSHEECDAIIAAARPRMARSLTVATQSGGEEINDDRTSNGMFFQRGETGIV 166

Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLG-TGNRVATV 210
            ++  R+  +        E LQV++YG G  Y+PH+D+  PGE      L   G RV T+
Sbjct: 167 SQLEERIARLLRWPLDHGEGLQVLHYGPGAEYKPHHDYFAPGEPGTPTILKRGGQRVGTL 226

Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           + Y+++  +GGAT+F  + L + P +G A F+   +   D    T H   PVL G
Sbjct: 227 VIYLNEPERGGATIFPEVPLQVVPRRGNAVFFS--YERPDPSTRTLHGGAPVLAG 279


>gi|356540840|ref|XP_003538892.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Glycine max]
          Length = 290

 Score = 97.8 bits (242), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 56/201 (27%), Positives = 99/201 (49%), Gaps = 24/201 (11%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +PR  +Y + +   E + +  +A+P + +++V + +TG+ + +  R S   +L      +
Sbjct: 85  EPRAFVYHNFLTKEECEYLIDIAKPNMHKSSVVDSETGKSKDSRVRTSSGTFLARGRDKI 144

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           +  I +R+ H + +     E LQV++Y +G  YEPHYD+      + F +   G R+ATV
Sbjct: 145 VRDIEKRIAHYSFIPVEHGEGLQVLHYEVGQKYEPHYDYF----LDDFNTKNGGQRIATV 200

Query: 211 LFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAFWHNLHSSGDG 251
           L Y++DV +GG TVF +                     LS+ P++G A  + ++      
Sbjct: 201 LMYLTDVEEGGETVFPAAKGNFSSVPWWNELSECGKKGLSIKPKRGDALLFWSMKPDATL 260

Query: 252 DYYTRHAACPVLTGSNSLHST 272
           D  + H  CPV+ G N   ST
Sbjct: 261 DPSSLHGGCPVIKG-NKWSST 280


>gi|212720650|ref|NP_001132477.1| uncharacterized protein LOC100193935 precursor [Zea mays]
 gi|194694488|gb|ACF81328.1| unknown [Zea mays]
 gi|347978828|gb|AEP37756.1| prolyl 4-hydroxylase 7 [Zea mays]
 gi|413934218|gb|AFW68769.1| prolyl 4-hydroxylase [Zea mays]
          Length = 298

 Score = 97.8 bits (242), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 59/193 (30%), Positives = 96/193 (49%), Gaps = 22/193 (11%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +PR  L++  + D+E D +  +A+ +L ++ V + K+G+   +  R S   +L + +  V
Sbjct: 41  RPRAFLHKGFLLDAECDHLIALAKDKLEKSMVADNKSGKSVQSEVRTSSGMFLEKKQDEV 100

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           + RI  R+   T L     E +Q+++Y  G  YEPHYD+       A   LG G+R+ATV
Sbjct: 101 VTRIEERISAWTFLPPENGEAIQILHYQNGEKYEPHYDYFHDKNNQA---LG-GHRIATV 156

Query: 211 LFYMSDVAQGGATVFTSLNLSL-------W-----------PEKGTAAFWHNLHSSGDGD 252
           L Y+S+V +GG T+F +    L       W           P KG A  + +LH     D
Sbjct: 157 LMYLSNVEKGGETIFPNAEGKLLQPKDDTWSDCARNGYAVKPVKGDALLFFSLHPDSTTD 216

Query: 253 YYTRHAACPVLTG 265
             + H +CP + G
Sbjct: 217 SDSLHGSCPAIEG 229


>gi|406665340|ref|ZP_11073114.1| hypothetical protein B857_00901 [Bacillus isronensis B3W22]
 gi|405387266|gb|EKB46691.1| hypothetical protein B857_00901 [Bacillus isronensis B3W22]
          Length = 211

 Score = 97.8 bits (242), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 53/175 (30%), Positives = 94/175 (53%), Gaps = 10/175 (5%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +P I+ + +V+ D E   +   A  RL R+     K  + EI++ R S   +  E E+P+
Sbjct: 29  EPLIVKFLNVLSDEECQNLIDCASSRLERS-----KLAKKEISSIRTSSGMFFEENENPL 83

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           I  I +R+  +  L    AE LQV++Y  G  ++ H+DF  P   ++     + NR++T+
Sbjct: 84  ISEIEKRISSLMHLPIEHAEGLQVLHYEPGQEFKAHFDFFGPNHPSS-----SNNRISTL 138

Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           + Y++DV +GG T F +L +   P+KGTA ++   ++    +  T H+  PV+ G
Sbjct: 139 VVYLNDVEEGGVTTFPNLGIVNVPKKGTAVYFEYFYNDQKLNELTLHSGEPVIQG 193


>gi|251794605|ref|YP_003009336.1| procollagen-proline dioxygenase [Paenibacillus sp. JDR-2]
 gi|247542231|gb|ACS99249.1| Procollagen-proline dioxygenase [Paenibacillus sp. JDR-2]
          Length = 209

 Score = 97.8 bits (242), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 54/175 (30%), Positives = 95/175 (54%), Gaps = 11/175 (6%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +P I++  +V+  +E DL+  +A  R++RA + +      +++  R S S +  E E+  
Sbjct: 31  EPLILILDNVLSWAECDLLIDLASARMQRAKIGSSH----DVSEVRTSSSMFFEESENEC 86

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           I ++  RV  +  +  S AE LQV+ Y  G  Y PH+D+   G +         NR++T+
Sbjct: 87  IGQVEARVAELMNIPVSHAEPLQVLRYQPGEQYHPHFDYFTQGSS-------MNNRISTL 139

Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           + Y++DV +GG T F SL+ S+ P+KG+A ++   ++    +  T HA  PV  G
Sbjct: 140 VMYLNDVEEGGETYFPSLHFSVTPKKGSAVYFEYFYNDTRLNELTLHAGHPVEAG 194


>gi|195574593|ref|XP_002105269.1| GD21390 [Drosophila simulans]
 gi|194201196|gb|EDX14772.1| GD21390 [Drosophila simulans]
          Length = 478

 Score = 97.8 bits (242), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 63/216 (29%), Positives = 97/216 (44%), Gaps = 34/216 (15%)

Query: 52  CRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           CRG   +P    + L+CRY+    P+LRL P+K E+   +P + L+ D +  +E + +  
Sbjct: 255 CRGKNLLPSK--SYLRCRYLRDGSPFLRLAPVKLEQLNFEPFVGLFHDAISPAEQEDLLH 312

Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
           +   RL       +   E      ++  +A     +H  + R+ +R+E +TG     +E 
Sbjct: 313 LTDSRLE------HTRKESSSVEAKVDTNA----SDH--VRRMHQRIEDITGFEMEESEP 360

Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
           L V NYGIGG    H D  +P                     +SDV  GG   F  L   
Sbjct: 361 LTVFNYGIGGQELIHLDCEQPE--------------------LSDVQMGGYASFPDLGFG 400

Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
             P +G+A  WHN  +SG+ D  +  A CPVL G+ 
Sbjct: 401 FKPRRGSALVWHNTDNSGNCDTRSLQATCPVLLGNQ 436


>gi|159794881|pdb|2JIJ|A Chain A, Crystal Structure Of The Apo Form Of Chlamydomonas
           Reinhardtii Prolyl-4 Hydroxylase Type I
 gi|159794882|pdb|2JIJ|B Chain B, Crystal Structure Of The Apo Form Of Chlamydomonas
           Reinhardtii Prolyl-4 Hydroxylase Type I
 gi|159794883|pdb|2JIJ|C Chain C, Crystal Structure Of The Apo Form Of Chlamydomonas
           Reinhardtii Prolyl-4 Hydroxylase Type I
          Length = 233

 Score = 97.8 bits (242), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 62/192 (32%), Positives = 96/192 (50%), Gaps = 19/192 (9%)

Query: 92  PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
           PR  L ++ + D E D I + A+P++ +++V + ++G+   +  R S   W  + E  VI
Sbjct: 29  PRAFLLKNFLSDEECDYIVEKARPKMVKSSVVDNESGKSVDSEIRTSTGTWFAKGEDSVI 88

Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRVATV 210
            +I +RV  +T +     E LQV++Y  G  YEPHYD F  P   NA    G G RV T+
Sbjct: 89  SKIEKRVAQVTMIPLENHEGLQVLHYHDGQKYEPHYDYFHDP--VNAGPEHG-GQRVVTM 145

Query: 211 LFYMSDVAQGGATVFTSL---------------NLSLWPEKGTAAFWHNLHSSGDGDYYT 255
           L Y++ V +GG TV  +                 L++ P KG A  +++L   G  D  +
Sbjct: 146 LMYLTTVEEGGETVLPNAEQKVTGDGWSECAKRGLAVKPIKGDALMFYSLKPDGSNDPAS 205

Query: 256 RHAACPVLTGSN 267
            H +CP L G  
Sbjct: 206 LHGSCPTLKGDK 217


>gi|21537370|gb|AAM61711.1| putative prolyl 4-hydroxylase, alpha subunit [Arabidopsis thaliana]
          Length = 287

 Score = 97.8 bits (242), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 60/201 (29%), Positives = 98/201 (48%), Gaps = 24/201 (11%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +PR  +Y + +   E + +  +A+P + ++TV + +TG+ + +  R S   +LR     +
Sbjct: 82  EPRAFVYHNFLSKEECEYLISLAKPHMVKSTVVDSETGKSKDSRVRTSSGTFLRRGRDKI 141

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           I+ I +R+   T +     E LQV++Y  G  YEPHYD+      + F +   G R+AT+
Sbjct: 142 IKTIEKRIADYTFIPADHGEGLQVLHYEAGQKYEPHYDYF----VDEFNTKNGGQRMATM 197

Query: 211 LFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAFWHNLHSSGDG 251
           L Y+SDV +GG TVF + N                   LS+ P  G A  + ++      
Sbjct: 198 LMYLSDVEEGGETVFPAANMNFSSVPWYNELSECGKKGLSVKPRMGDALLFWSMRPDATL 257

Query: 252 DYYTRHAACPVLTGSNSLHST 272
           D  + H  CPV+ G N   ST
Sbjct: 258 DPTSLHGGCPVIRG-NKWSST 277


>gi|18394842|ref|NP_564109.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
           [Arabidopsis thaliana]
 gi|9558598|gb|AAF88161.1|AC026234_12 Contains similarity to a prolyl 4-hydroxylase alpha subunit protein
           from Gallus gallus gi|212530 [Arabidopsis thaliana]
 gi|90962978|gb|ABE02413.1| At1g20270 [Arabidopsis thaliana]
 gi|332191835|gb|AEE29956.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
           [Arabidopsis thaliana]
          Length = 287

 Score = 97.8 bits (242), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 60/201 (29%), Positives = 98/201 (48%), Gaps = 24/201 (11%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +PR  +Y + +   E + +  +A+P + ++TV + +TG+ + +  R S   +LR     +
Sbjct: 82  EPRAFVYHNFLSKEECEYLISLAKPHMVKSTVVDSETGKSKDSRVRTSSGTFLRRGRDKI 141

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           I+ I +R+   T +     E LQV++Y  G  YEPHYD+      + F +   G R+AT+
Sbjct: 142 IKTIEKRIADYTFIPADHGEGLQVLHYEAGQKYEPHYDYF----VDEFNTKNGGQRMATM 197

Query: 211 LFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAFWHNLHSSGDG 251
           L Y+SDV +GG TVF + N                   LS+ P  G A  + ++      
Sbjct: 198 LMYLSDVEEGGETVFPAANMNFSSVPWYNELSECGKKGLSVKPRMGDALLFWSMRPDATL 257

Query: 252 DYYTRHAACPVLTGSNSLHST 272
           D  + H  CPV+ G N   ST
Sbjct: 258 DPTSLHGGCPVIRG-NKWSST 277


>gi|241913390|pdb|3GZE|A Chain A, Algal Prolyl 4-Hydroxylase Complexed With Zinc And
           (Ser-Pro)5 Peptide Substrate
 gi|241913391|pdb|3GZE|B Chain B, Algal Prolyl 4-Hydroxylase Complexed With Zinc And
           (Ser-Pro)5 Peptide Substrate
 gi|241913392|pdb|3GZE|C Chain C, Algal Prolyl 4-Hydroxylase Complexed With Zinc And
           (Ser-Pro)5 Peptide Substrate
 gi|241913393|pdb|3GZE|D Chain D, Algal Prolyl 4-Hydroxylase Complexed With Zinc And
           (Ser-Pro)5 Peptide Substrate
          Length = 225

 Score = 97.8 bits (242), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 62/192 (32%), Positives = 96/192 (50%), Gaps = 19/192 (9%)

Query: 92  PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
           PR  L ++ + D E D I + A+P++ +++V + ++G+   +  R S   W  + E  VI
Sbjct: 21  PRAFLLKNFLSDEECDYIVEKARPKMVKSSVVDNESGKSVDSEIRTSTGTWFAKGEDSVI 80

Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRVATV 210
            +I +RV  +T +     E LQV++Y  G  YEPHYD F  P   NA    G G RV T+
Sbjct: 81  SKIEKRVAQVTMIPLENHEGLQVLHYHDGQKYEPHYDYFHDP--VNAGPEHG-GQRVVTM 137

Query: 211 LFYMSDVAQGGATVFTSL---------------NLSLWPEKGTAAFWHNLHSSGDGDYYT 255
           L Y++ V +GG TV  +                 L++ P KG A  +++L   G  D  +
Sbjct: 138 LMYLTTVEEGGETVLPNAEQKVTGDGWSECAKRGLAVKPIKGDALMFYSLKPDGSNDPAS 197

Query: 256 RHAACPVLTGSN 267
            H +CP L G  
Sbjct: 198 LHGSCPTLKGDK 209


>gi|159794879|pdb|2JIG|A Chain A, Crystal Structure Of Chlamydomonas Reinhardtii Prolyl-4
           Hydroxylase Type I Complexed With Zinc And Pyridine-2,4-
           Dicarboxylate
 gi|159794880|pdb|2JIG|B Chain B, Crystal Structure Of Chlamydomonas Reinhardtii Prolyl-4
           Hydroxylase Type I Complexed With Zinc And Pyridine-2,4-
           Dicarboxylate
          Length = 224

 Score = 97.8 bits (242), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 62/192 (32%), Positives = 96/192 (50%), Gaps = 19/192 (9%)

Query: 92  PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
           PR  L ++ + D E D I + A+P++ +++V + ++G+   +  R S   W  + E  VI
Sbjct: 20  PRAFLLKNFLSDEECDYIVEKARPKMVKSSVVDNESGKSVDSEIRTSTGTWFAKGEDSVI 79

Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRVATV 210
            +I +RV  +T +     E LQV++Y  G  YEPHYD F  P   NA    G G RV T+
Sbjct: 80  SKIEKRVAQVTMIPLENHEGLQVLHYHDGQKYEPHYDYFHDP--VNAGPEHG-GQRVVTM 136

Query: 211 LFYMSDVAQGGATVFTSL---------------NLSLWPEKGTAAFWHNLHSSGDGDYYT 255
           L Y++ V +GG TV  +                 L++ P KG A  +++L   G  D  +
Sbjct: 137 LMYLTTVEEGGETVLPNAEQKVTGDGWSECAKRGLAVKPIKGDALMFYSLKPDGSNDPAS 196

Query: 256 RHAACPVLTGSN 267
            H +CP L G  
Sbjct: 197 LHGSCPTLKGDK 208


>gi|218665910|ref|YP_002425647.1| 2OG-Fe(II) oxygenase [Acidithiobacillus ferrooxidans ATCC 23270]
 gi|218518123|gb|ACK78709.1| oxidoreductase, 2OG-Fe(II) oxygenase family [Acidithiobacillus
           ferrooxidans ATCC 23270]
          Length = 248

 Score = 97.4 bits (241), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 54/155 (34%), Positives = 85/155 (54%), Gaps = 5/155 (3%)

Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
           + Q  LR ATV + +TG+      R+S+ AW +  ++P+++ ++  +  +TG+     E 
Sbjct: 83  IGQSLLRPATVTDEQTGQEVAHGERVSEMAWPKRDDYPILQSLAEGIAQLTGIPIDCQEP 142

Query: 172 LQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNL 230
           LQ+++Y  GG Y+PHYD FA    A+A      GNR AT++ Y++ V +GG T F  L L
Sbjct: 143 LQILHYRPGGEYKPHYDAFA----ADAPTLRQGGNRQATLILYLNAVEEGGETAFPELGL 198

Query: 231 SLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
            + P  G   F+ NL+  G     + HA  PV  G
Sbjct: 199 QVSPIPGGGVFFRNLNEEGQRHPLSLHAGLPVRKG 233


>gi|363806698|ref|NP_001242522.1| uncharacterized protein LOC100806046 [Glycine max]
 gi|255647110|gb|ACU24023.1| unknown [Glycine max]
          Length = 289

 Score = 97.4 bits (241), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 56/201 (27%), Positives = 98/201 (48%), Gaps = 24/201 (11%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +PR  +Y + +   E + +  +A+P + ++TV + +TG+ + +  R S   +L      +
Sbjct: 84  EPRAFVYHNFLTKEECEYLIDIAKPSMHKSTVVDSETGKSKDSRVRTSSGTFLARGRDKI 143

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           +  I +++   T +     E LQV++Y +G  YEPHYD+      + F +   G R+ATV
Sbjct: 144 VRNIEKKISDFTFIPVEHGEGLQVLHYEVGQKYEPHYDYF----LDDFNTKNGGQRIATV 199

Query: 211 LFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAFWHNLHSSGDG 251
           L Y++DV +GG TVF +                     LS+ P++G A  + ++      
Sbjct: 200 LMYLTDVEEGGETVFPAAKGNFSFVPWWNELFECGKKGLSIKPKRGDALLFWSMKPDASL 259

Query: 252 DYYTRHAACPVLTGSNSLHST 272
           D  + H  CPV+ G N   ST
Sbjct: 260 DPSSLHGGCPVIKG-NKWSST 279


>gi|357483925|ref|XP_003612249.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355513584|gb|AES95207.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
          Length = 289

 Score = 97.4 bits (241), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 56/201 (27%), Positives = 98/201 (48%), Gaps = 24/201 (11%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +PR  +Y + +   E + +  +A+P + ++TV + +TG+ + +  R S   +L      +
Sbjct: 84  EPRAFVYHNFLTKEECEYLIDIAKPSMHKSTVVDSETGKSKDSRVRTSSGTFLARGRDKI 143

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           +  I +++   T +     E LQV++Y +G  YEPHYD+      + F +   G R+ATV
Sbjct: 144 VRNIEKKIADFTFIPVEHGEGLQVLHYEVGQKYEPHYDYF----LDEFNTKNGGQRIATV 199

Query: 211 LFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAFWHNLHSSGDG 251
           L Y++DV +GG TVF +                     LS+ P++G A  + ++      
Sbjct: 200 LMYLTDVEEGGETVFPAAKGNFSNVPWYNELSDCGKKGLSIKPKRGDALLFWSMKPDATL 259

Query: 252 DYYTRHAACPVLTGSNSLHST 272
           D  + H  CPV+ G N   ST
Sbjct: 260 DASSLHGGCPVIKG-NKWSST 279


>gi|356555587|ref|XP_003546112.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 2
           [Glycine max]
          Length = 297

 Score = 97.4 bits (241), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 56/201 (27%), Positives = 101/201 (50%), Gaps = 21/201 (10%)

Query: 82  PLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSA 141
           P K ++   +PR  +Y   + + E D +  +A+  L+R+ V +  +GE +++  R S   
Sbjct: 37  PSKVKQVSWKPRAFVYEGFLTELECDHLISIAKSELKRSAVADNLSGESKLSEVRTSSGM 96

Query: 142 WLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL 201
           ++ + + P++  +  ++   T L     E++QV+ Y  G  Y+PHYD+     A+     
Sbjct: 97  FIPKNKDPIVAGVEDKISSWTLLPKENGEDIQVLRYEHGQKYDPHYDYF----ADKVNIA 152

Query: 202 GTGNRVATVLFYMSDVAQGGATVFTSLNL-----------------SLWPEKGTAAFWHN 244
             G+RVATVL Y++DV +GG TVF +  L                 ++ P +G A  + +
Sbjct: 153 RGGHRVATVLMYLTDVTKGGETVFPNAELKSSETKEDLSECAQKGIAVKPRRGDALLFFS 212

Query: 245 LHSSGDGDYYTRHAACPVLTG 265
           L+ +   D  + HA CPV+ G
Sbjct: 213 LYPNAIPDTMSLHAGCPVIEG 233


>gi|218193936|gb|EEC76363.1| hypothetical protein OsI_13952 [Oryza sativa Indica Group]
          Length = 1062

 Score = 97.4 bits (241), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 58/202 (28%), Positives = 101/202 (50%), Gaps = 22/202 (10%)

Query: 82  PLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSA 141
           P +  +   +PR  LY   +   E D +  +A+ R+ ++ V +  +G+  ++  R S   
Sbjct: 34  PARVTQLSWRPRAFLYSGFLSHDECDHLVNLAKGRMEKSMVADNDSGKSIMSQVRTSSGT 93

Query: 142 WLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL 201
           +L + E  ++  I +RV   T L    AE +Q+++Y +G  Y+ H+D+    + N  K  
Sbjct: 94  FLSKHEDDIVSGIEKRVAAWTFLPEENAESIQILHYELGQKYDAHFDYFH--DKNNLKR- 150

Query: 202 GTGNRVATVLFYMSDVAQGGATVFTSL------------------NLSLWPEKGTAAFWH 243
             G+RVATVL Y++DV +GG TVF +                    L++ P+KG A  + 
Sbjct: 151 -GGHRVATVLMYLTDVKKGGETVFPNAAGRHLQLKDETWSDCARSGLAVKPKKGDALLFF 209

Query: 244 NLHSSGDGDYYTRHAACPVLTG 265
           +LH +   D  + H +CPV+ G
Sbjct: 210 SLHVNATTDPASLHGSCPVIEG 231


>gi|168060785|ref|XP_001782374.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162666166|gb|EDQ52828.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 211

 Score = 97.4 bits (241), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 61/196 (31%), Positives = 97/196 (49%), Gaps = 23/196 (11%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +PR  LY   +   E + + ++A+P L ++TV +  TG+ + +  R S   +L   +  +
Sbjct: 8   EPRAFLYHHFLTQVECNHLIEVAKPSLVKSTVIDSATGKSKDSRVRTSSGTFLVRGQDHI 67

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           I+RI +R+   T +     E LQV+ Y     YEPHYD+      +AF +   G R+ATV
Sbjct: 68  IKRIEKRIADFTFIPVEQGEGLQVLQYRESEKYEPHYDYFH----DAFNTKNGGQRIATV 123

Query: 211 LFYMSDVAQGGATVF--TSLN-----------------LSLWPEKGTAAFWHNLHSSGDG 251
           L Y+SDV +GG TVF  + +N                 LS+ P  G A  + ++      
Sbjct: 124 LMYLSDVEKGGETVFPASKVNASEVPDWDQRSECAKRGLSVRPRMGDALLFWSMKPDAKL 183

Query: 252 DYYTRHAACPVLTGSN 267
           D  + H ACPV+ G+ 
Sbjct: 184 DPTSLHGACPVIQGTK 199


>gi|363543369|ref|NP_001241694.1| prolyl 4-hydroxylase 8-4 [Zea mays]
 gi|347978838|gb|AEP37761.1| prolyl 4-hydroxylase 8-4 [Zea mays]
          Length = 307

 Score = 97.4 bits (241), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 58/201 (28%), Positives = 98/201 (48%), Gaps = 24/201 (11%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +PR  +Y + +   E + +  +A+P + ++TV +  TG+ + +  R S   +L+   + V
Sbjct: 102 EPRAFVYHNFLSKDECEYLIGLAKPHMVKSTVVDSTTGKSKDSRVRTSSGMFLQRGRNKV 161

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           I  I +R+   T +     E LQV++Y +G  YEPH+D+      + F +   G R+AT+
Sbjct: 162 IRAIEKRIADYTFIPVDHGEGLQVLHYEVGQKYEPHFDYF----LDEFNTKNGGQRIATL 217

Query: 211 LFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAFWHNLHSSGDG 251
           L Y+SDV +GG T+F   N                   LS+ P+ G A  + ++      
Sbjct: 218 LMYLSDVEEGGETIFPDANVNASSLPWYNELSDCAKRGLSVKPKMGDALLFWSMKPDATL 277

Query: 252 DYYTRHAACPVLTGSNSLHST 272
           D  + H  CPV+ G N   ST
Sbjct: 278 DPLSLHGGCPVIKG-NKWSST 297


>gi|242075290|ref|XP_002447581.1| hypothetical protein SORBIDRAFT_06g004550 [Sorghum bicolor]
 gi|241938764|gb|EES11909.1| hypothetical protein SORBIDRAFT_06g004550 [Sorghum bicolor]
          Length = 263

 Score = 97.4 bits (241), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 61/203 (30%), Positives = 100/203 (49%), Gaps = 19/203 (9%)

Query: 78  LRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRI 137
           LRL  +K E     PRII++ + +   E D +  +A+PRL+ +TV +  TG+   ++ R 
Sbjct: 50  LRLRYVKPEVISWTPRIIIFHNFLSSEECDYLMAIARPRLQMSTVVDVATGKGVKSDVRT 109

Query: 138 SKSAWLREPEH--PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEA 195
           S   ++   E   PVI+ I +R+   + +     E +QV+ Y    +Y PH+D+     +
Sbjct: 110 SSGMFVNSEERKSPVIQAIEKRISVFSQIPKENGELIQVLRYEASQYYRPHHDYF----S 165

Query: 196 NAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL-------------NLSLWPEKGTAAFW 242
           + F     G RVAT+L Y++D  +GG T F                 L + P KG A  +
Sbjct: 166 DTFNLKRGGQRVATMLMYLTDGVEGGETHFLQAGDGECSCGGNVVKGLCVKPNKGDAVLF 225

Query: 243 HNLHSSGDGDYYTRHAACPVLTG 265
            ++   G+ D  + H+ CPVL G
Sbjct: 226 WSMGLDGNTDPNSIHSGCPVLKG 248


>gi|297850430|ref|XP_002893096.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
 gi|297338938|gb|EFH69355.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
          Length = 287

 Score = 97.4 bits (241), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 59/201 (29%), Positives = 98/201 (48%), Gaps = 24/201 (11%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +PR  +Y + +   E + +  +A+P + ++TV + +TG+ + +  R S   +LR     +
Sbjct: 82  EPRAFVYHNFLSKEECEYLISLAKPHMVKSTVVDSETGKSKDSRVRTSSGTFLRRGRDKI 141

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           I+ I +R+   T +     E LQ+++Y  G  YEPHYD+      + F +   G R+AT+
Sbjct: 142 IKTIEKRIADYTFIPADHGEGLQILHYEAGQKYEPHYDYF----VDEFNTKNGGQRMATM 197

Query: 211 LFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAFWHNLHSSGDG 251
           L Y+SDV +GG TVF + N                   LS+ P  G A  + ++      
Sbjct: 198 LMYLSDVEEGGETVFPAANMNFSSVPWYNELSECGKKGLSVKPRMGDALLFWSMRPDATL 257

Query: 252 DYYTRHAACPVLTGSNSLHST 272
           D  + H  CPV+ G N   ST
Sbjct: 258 DPTSLHGGCPVIRG-NKWSST 277


>gi|195069793|ref|XP_001997027.1| GH12976 [Drosophila grimshawi]
 gi|193891496|gb|EDV90362.1| GH12976 [Drosophila grimshawi]
          Length = 83

 Score = 97.4 bits (241), Expect = 8e-18,   Method: Composition-based stats.
 Identities = 43/53 (81%), Positives = 46/53 (86%)

Query: 214 MSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           MSDV QGGATVFTSL  +LWP+KGTAAFW NLH SG+GD  TRHAACPVLTGS
Sbjct: 1   MSDVQQGGATVFTSLRTALWPKKGTAAFWMNLHRSGEGDARTRHAACPVLTGS 53


>gi|319786559|ref|YP_004146034.1| Procollagen-proline dioxygenase [Pseudoxanthomonas suwonensis 11-1]
 gi|317465071|gb|ADV26803.1| Procollagen-proline dioxygenase [Pseudoxanthomonas suwonensis 11-1]
          Length = 289

 Score = 97.1 bits (240), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 52/152 (34%), Positives = 82/152 (53%), Gaps = 1/152 (0%)

Query: 92  PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
           PR+++   ++ D E D + ++++PRLRR+T  + +TG  ++   R S+  +     HPV 
Sbjct: 102 PRVVVLGGLLSDEECDALVELSRPRLRRSTTVDAQTGGSQVHADRTSRGTFFERGAHPVC 161

Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL-GTGNRVATV 210
             I  R+  +        E LQV++Y  G  + PHYD+  P E  A   L   G RVATV
Sbjct: 162 ATIEARIARLLEWPVENGEGLQVLHYPPGAEFRPHYDYFDPDEPGAEVLLRQGGQRVATV 221

Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFW 242
           + Y++  A+GGAT F   +L +   KG A F+
Sbjct: 222 VMYLNTPARGGATTFPDAHLEVAAVKGNAVFF 253


>gi|226479086|emb|CAX73038.1| Proline HYdroxylase [Schistosoma japonicum]
          Length = 437

 Score = 97.1 bits (240), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 60/148 (40%), Positives = 85/148 (57%), Gaps = 8/148 (5%)

Query: 4   PTHQRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAIV 63
           PT++RA  N+ YY E L++        P+  + A +    E E YE LCR +   P    
Sbjct: 291 PTNERAINNEAYYVEQLDRGEGRLGPNPR--SQATSKHDQETELYESLCRDENPFPTVPS 348

Query: 64  AQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQ 123
             L CRY   +  Y R+ P+KEE  Y  PRI+++ D+++ SEI+ IK +A PRLRRATV+
Sbjct: 349 HYLTCRYYTPHAFY-RIGPVKEETLYPDPRIVMWYDLIFPSEIEKIKDLATPRLRRATVK 407

Query: 124 NYKTGELEIANYRISKS-----AWLREP 146
           N  TG LE+A YR SK+      W++ P
Sbjct: 408 NPITGNLEVAFYRTSKALGFRILWMKSP 435


>gi|115456019|ref|NP_001051610.1| Os03g0803500 [Oryza sativa Japonica Group]
 gi|29150365|gb|AAO72374.1| putative oxidoreductase [Oryza sativa Japonica Group]
 gi|108711618|gb|ABF99413.1| oxidoreductase, 2OG-Fe oxygenase family protein, putative,
           expressed [Oryza sativa Japonica Group]
 gi|113550081|dbj|BAF13524.1| Os03g0803500 [Oryza sativa Japonica Group]
 gi|215765410|dbj|BAG87107.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222625993|gb|EEE60125.1| hypothetical protein OsJ_13003 [Oryza sativa Japonica Group]
          Length = 299

 Score = 97.1 bits (240), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 58/202 (28%), Positives = 101/202 (50%), Gaps = 22/202 (10%)

Query: 82  PLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSA 141
           P +  +   +PR  LY   +   E D +  +A+ R+ ++ V +  +G+  ++  R S   
Sbjct: 34  PARVTQLSWRPRAFLYSGFLSHDECDHLVNLAKGRMEKSMVADNDSGKSIMSQVRTSSGT 93

Query: 142 WLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL 201
           +L + E  ++  I +RV   T L    AE +Q+++Y +G  Y+ H+D+    + N  K  
Sbjct: 94  FLSKHEDDIVSGIEKRVAAWTFLPEENAESIQILHYELGQKYDAHFDYFH--DKNNLKR- 150

Query: 202 GTGNRVATVLFYMSDVAQGGATVFTSL------------------NLSLWPEKGTAAFWH 243
             G+RVATVL Y++DV +GG TVF +                    L++ P+KG A  + 
Sbjct: 151 -GGHRVATVLMYLTDVKKGGETVFPNAAGRHLQLKDETWSDCARSGLAVKPKKGDALLFF 209

Query: 244 NLHSSGDGDYYTRHAACPVLTG 265
           +LH +   D  + H +CPV+ G
Sbjct: 210 SLHVNATTDPASLHGSCPVIEG 231


>gi|30689216|ref|NP_189490.2| Oxoglutarate/iron-dependent oxygenase [Arabidopsis thaliana]
 gi|332643931|gb|AEE77452.1| Oxoglutarate/iron-dependent oxygenase [Arabidopsis thaliana]
          Length = 288

 Score = 97.1 bits (240), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 61/193 (31%), Positives = 98/193 (50%), Gaps = 23/193 (11%)

Query: 92  PRIILYRDVMYDSEIDLIKKMAQPRLRRA-TVQNYKTGELEIANYRISKSAWLREPEHPV 150
           PR  LY+  + D E D + K+A+ +L ++  V +  +GE E +  R S   +L + +  +
Sbjct: 39  PRAFLYKGFLSDEECDHLIKLAKGKLEKSMVVADVDSGESEDSEVRTSSGMFLTKRQDDI 98

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           +  +  ++   T L     E LQ+++Y  G  Y+PH+D+    +A     LG G+R+ATV
Sbjct: 99  VANVEAKLAAWTFLPEENGEALQILHYENGQKYDPHFDYFYDKKA---LELG-GHRIATV 154

Query: 211 LFYMSDVAQGGATVFTS-------LNLSLW-----------PEKGTAAFWHNLHSSGDGD 252
           L Y+S+V +GG TVF +       L    W           P KG A  + NLH +G  D
Sbjct: 155 LMYLSNVTKGGETVFPNWKGKTPQLKDDSWSKCAKQGYAVKPRKGDALLFFNLHLNGTTD 214

Query: 253 YYTRHAACPVLTG 265
             + H +CPV+ G
Sbjct: 215 PNSLHGSCPVIEG 227


>gi|297824279|ref|XP_002880022.1| AT-P4H-1 [Arabidopsis lyrata subsp. lyrata]
 gi|297325861|gb|EFH56281.1| AT-P4H-1 [Arabidopsis lyrata subsp. lyrata]
          Length = 283

 Score = 97.1 bits (240), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 62/208 (29%), Positives = 103/208 (49%), Gaps = 19/208 (9%)

Query: 73  RNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEI 132
           ++   LR+  +K E     PRII+  D +   E + +K +A+PRL+ +TV + KTG+   
Sbjct: 65  KDAELLRIGNVKPEVVSWSPRIIVLHDFLSPEECEYLKAIARPRLQVSTVVDVKTGKGVK 124

Query: 133 ANYRISKSAWLR--EPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFA 190
           ++ R S   +L   E  +P+I+ I +R+   + +     E +QV+ Y     Y+PH+D+ 
Sbjct: 125 SDVRTSSGMFLTHVERSNPIIQAIEKRIAVFSQVPAENGELIQVLRYEPKQFYKPHHDYF 184

Query: 191 RPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVF-------------TSLNLSLWPEKG 237
               A+ F     G RVAT+L Y++D  +GG T F                 +S+ P KG
Sbjct: 185 ----ADTFNLKRGGQRVATMLMYLTDDVEGGETYFPLAGDGDCTCGGKIMKGISVKPTKG 240

Query: 238 TAAFWHNLHSSGDGDYYTRHAACPVLTG 265
            A  + ++   G  D  + H  C VL+G
Sbjct: 241 DAVLFWSMGLDGQSDPRSIHGGCEVLSG 268


>gi|388500582|gb|AFK38357.1| unknown [Medicago truncatula]
          Length = 299

 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 56/207 (27%), Positives = 102/207 (49%), Gaps = 25/207 (12%)

Query: 80  LMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISK 139
           + P K ++    PR  +Y+  + D E D +  +A+  L+R+ V +  +G+ ++++ R S 
Sbjct: 32  INPSKVKQISWIPRAFVYQGFLTDLECDHLISLAKSELKRSAVADNLSGDSQLSDVRTSS 91

Query: 140 SAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFK 199
              + + + P++  I  R+   T L     E++QV+ Y  G  Y+PHYD+     A+   
Sbjct: 92  GMLISKNKDPIVSGIEDRISAWTFLPKENGEDIQVLRYEHGQKYDPHYDYF----ADKVN 147

Query: 200 SLGTGNRVATVLFYMSDVAQGGATVF---------------------TSLNLSLWPEKGT 238
            +  G+R+ATVL Y+++V +GG TVF                         +++ P +G 
Sbjct: 148 IVQGGHRLATVLMYLTNVTKGGETVFPEAEEPPRRRGSKKSSDLSECAKKGIAVKPRRGD 207

Query: 239 AAFWHNLHSSGDGDYYTRHAACPVLTG 265
           A  + +L ++   D  + HA CPVL G
Sbjct: 208 ALLFFSLDTNAIPDTNSLHAGCPVLEG 234


>gi|15224220|ref|NP_181836.1| P4H isoform 1 [Arabidopsis thaliana]
 gi|3763917|gb|AAC64297.1| hypothetical protein [Arabidopsis thaliana]
 gi|20197628|gb|AAM15158.1| hypothetical protein [Arabidopsis thaliana]
 gi|26450452|dbj|BAC42340.1| unknown protein [Arabidopsis thaliana]
 gi|29824245|gb|AAP04083.1| unknown protein [Arabidopsis thaliana]
 gi|330255112|gb|AEC10206.1| P4H isoform 1 [Arabidopsis thaliana]
          Length = 283

 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 62/208 (29%), Positives = 103/208 (49%), Gaps = 19/208 (9%)

Query: 73  RNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEI 132
           ++   LR+  +K E     PRII+  D +   E + +K +A+PRL+ +TV + KTG+   
Sbjct: 65  KDAELLRIGNVKPEVVSWSPRIIVLHDFLSPEECEYLKAIARPRLQVSTVVDVKTGKGVK 124

Query: 133 ANYRISKSAWLR--EPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFA 190
           ++ R S   +L   E  +P+I+ I +R+   + +     E +QV+ Y     Y+PH+D+ 
Sbjct: 125 SDVRTSSGMFLTHVERSYPIIQAIEKRIAVFSQVPAENGELIQVLRYEPQQFYKPHHDYF 184

Query: 191 RPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVF-------------TSLNLSLWPEKG 237
               A+ F     G RVAT+L Y++D  +GG T F                 +S+ P KG
Sbjct: 185 ----ADTFNLKRGGQRVATMLMYLTDDVEGGETYFPLAGDGDCTCGGKIMKGISVKPTKG 240

Query: 238 TAAFWHNLHSSGDGDYYTRHAACPVLTG 265
            A  + ++   G  D  + H  C VL+G
Sbjct: 241 DAVLFWSMGLDGQSDPRSIHGGCEVLSG 268


>gi|224085946|ref|XP_002307750.1| predicted protein [Populus trichocarpa]
 gi|222857199|gb|EEE94746.1| predicted protein [Populus trichocarpa]
          Length = 288

 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 58/195 (29%), Positives = 94/195 (48%), Gaps = 23/195 (11%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +PR  LY + +   E + +  +A+P + ++TV + KTG  + +  R S   +LR     V
Sbjct: 83  EPRAFLYHNFLSKEECEYLINLAKPHMMKSTVVDSKTGRSKDSRVRTSSGMFLRRGRDRV 142

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           I  I +R+   + +     E LQV++Y +G  YE H+D+      + F +   G R AT+
Sbjct: 143 IREIEKRIADFSFIPVEHGEGLQVLHYEVGQKYEAHFDYF----LDEFNTKNGGQRTATL 198

Query: 211 LFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAFWHNLHSSGDG 251
           L Y+SDV +GG TVF + N                   LSL P+ G A  + +       
Sbjct: 199 LMYLSDVEEGGETVFPAANMNISAVPWWNELSECAKQGLSLKPKMGNALLFWSTRPDATL 258

Query: 252 DYYTRHAACPVLTGS 266
           D  + H +CPV+ G+
Sbjct: 259 DPSSLHGSCPVIRGN 273


>gi|195341061|ref|XP_002037130.1| GM12749 [Drosophila sechellia]
 gi|194131246|gb|EDW53289.1| GM12749 [Drosophila sechellia]
          Length = 467

 Score = 96.7 bits (239), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 64/215 (29%), Positives = 95/215 (44%), Gaps = 34/215 (15%)

Query: 52  CRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           CRG   +P    + L+CRY     P+LRL P+K E+   +P + L  D +  +E + +  
Sbjct: 255 CRGKNLLPNK--SSLRCRYFRGGSPFLRLAPVKLEQLNFEPFVGLVHDAISQAEQEDLLH 312

Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEE 171
           +   RL       +   E      ++  +A     +H  + RI +R+E +TG     +E 
Sbjct: 313 LTDSRLE------HTRKESSSVEAKVDTNA----SDH--VRRIHQRIEDITGFDMEESEP 360

Query: 172 LQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS 231
           L V NYGIGG    H D  +P                     +SDV  GG   F  L   
Sbjct: 361 LIVSNYGIGGQELIHLDCEQPK--------------------LSDVQMGGYASFPDLGFG 400

Query: 232 LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
             P +G+A  WHN  +SG+ D  +  A CPVL G+
Sbjct: 401 FKPRRGSALVWHNTDNSGNCDTRSLQATCPVLLGN 435


>gi|28393447|gb|AAO42145.1| putative prolyl 4-hydroxylase [Arabidopsis thaliana]
          Length = 253

 Score = 96.7 bits (239), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 61/193 (31%), Positives = 98/193 (50%), Gaps = 23/193 (11%)

Query: 92  PRIILYRDVMYDSEIDLIKKMAQPRLRRA-TVQNYKTGELEIANYRISKSAWLREPEHPV 150
           PR  LY+  + D E D + K+A+ +L ++  V +  +GE E +  R S   +L + +  +
Sbjct: 4   PRAFLYKGFLSDEECDHLIKLAKGKLEKSMVVADVDSGESEDSEVRTSSGMFLTKRQDDI 63

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           +  +  ++   T L     E LQ+++Y  G  Y+PH+D+    +A     LG G+R+ATV
Sbjct: 64  VANVEAKLAAWTFLPEENGEALQILHYENGQKYDPHFDYFYDKKA---LELG-GHRIATV 119

Query: 211 LFYMSDVAQGGATVFTS-------LNLSLW-----------PEKGTAAFWHNLHSSGDGD 252
           L Y+S+V +GG TVF +       L    W           P KG A  + NLH +G  D
Sbjct: 120 LMYLSNVTKGGETVFPNWKGKTPQLKDDSWSKCAKQGYAVKPRKGDALLFFNLHLNGTTD 179

Query: 253 YYTRHAACPVLTG 265
             + H +CPV+ G
Sbjct: 180 PNSLHGSCPVIEG 192


>gi|259490206|ref|NP_001159002.1| prolyl 4-hydroxylase alpha-2 subunit [Zea mays]
 gi|195626402|gb|ACG35031.1| prolyl 4-hydroxylase alpha-2 subunit precursor [Zea mays]
 gi|347978830|gb|AEP37757.1| prolyl 4-hydroxylase 8 [Zea mays]
 gi|347978832|gb|AEP37758.1| prolyl 4-hydroxylase 8-1 [Zea mays]
 gi|413939569|gb|AFW74120.1| prolyl 4-hydroxylase alpha-2 subunit isoform 1 [Zea mays]
 gi|413939570|gb|AFW74121.1| prolyl 4-hydroxylase alpha-2 subunit isoform 2 [Zea mays]
 gi|413939571|gb|AFW74122.1| prolyl 4-hydroxylase alpha-2 subunit isoform 3 [Zea mays]
 gi|413939572|gb|AFW74123.1| prolyl 4-hydroxylase alpha-2 subunit isoform 4 [Zea mays]
          Length = 307

 Score = 96.7 bits (239), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 58/201 (28%), Positives = 97/201 (48%), Gaps = 24/201 (11%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +PR  +Y + +   E + +  +A+P + ++TV +  TG+ + +  R S   +L+     V
Sbjct: 102 EPRAFVYHNFLSKDECEYLIGLAKPHMVKSTVVDSTTGKSKDSRVRTSSGMFLQRGRDKV 161

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           I  I +R+   T +     E LQV++Y +G  YEPH+D+      + F +   G R+AT+
Sbjct: 162 IRAIEKRIADYTFIPVDHGEGLQVLHYEVGQKYEPHFDYF----LDEFNTKNGGQRIATL 217

Query: 211 LFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAFWHNLHSSGDG 251
           L Y+SDV +GG T+F   N                   LS+ P+ G A  + ++      
Sbjct: 218 LMYLSDVEEGGETIFPDANVNASSLPWYNELSDCAKRGLSVKPKMGDALLFWSMKPDATL 277

Query: 252 DYYTRHAACPVLTGSNSLHST 272
           D  + H  CPV+ G N   ST
Sbjct: 278 DPLSLHGGCPVIKG-NKWSST 297


>gi|215490181|dbj|BAG86624.1| type 2 proly 4-hydroxylase [Nicotiana tabacum]
          Length = 294

 Score = 96.7 bits (239), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 57/217 (26%), Positives = 101/217 (46%), Gaps = 25/217 (11%)

Query: 70  YVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGE 129
           +V  +     + P K ++   +PR  +Y   + D E + +  +A+  L+R+ V + ++G 
Sbjct: 18  FVRESSSSAIINPSKAKQISWKPRAFVYEGFLTDEECNHLISLAKSELKRSAVADNESGN 77

Query: 130 LEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDF 189
            + +  R S   ++ + + P++  I  ++   T L     EE+QV+ Y  G  YEPHYD+
Sbjct: 78  SKTSEVRTSSGMFIPKAKDPIVSGIEEKIATWTFLPKENGEEIQVLRYEEGQKYEPHYDY 137

Query: 190 ARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLS------------------ 231
                 +       G+R+ATVL Y+++V +GG TVF     S                  
Sbjct: 138 F----VDKVNIARGGHRLATVLMYLTNVEKGGETVFPKAEESPRRRSMIADDSLSECAKK 193

Query: 232 ---LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
              + P KG A  +++LH +   D  + H  CPV+ G
Sbjct: 194 GIPVKPRKGDALLFYSLHPNATPDPLSLHGGCPVIQG 230


>gi|363543371|ref|NP_001241695.1| prolyl 4-hydroxylase 8-5 [Zea mays]
 gi|347978840|gb|AEP37762.1| prolyl 4-hydroxylase 8-5 [Zea mays]
          Length = 307

 Score = 96.7 bits (239), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 58/201 (28%), Positives = 97/201 (48%), Gaps = 24/201 (11%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +PR  +Y + +   E + +  +A+P + ++TV +  TG+ + +  R S   +L+     V
Sbjct: 102 EPRAFVYHNFLSKDECEYLIGLAKPHMVKSTVVDSTTGKSKDSRVRTSSGMFLQRGRDKV 161

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           I  I +R+   T +     E LQV++Y +G  YEPH+D+      + F +   G R+AT+
Sbjct: 162 IRAIEKRIADYTFIPVDHGEGLQVLHYEVGQKYEPHFDYF----LDEFNTKNGGQRIATL 217

Query: 211 LFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAFWHNLHSSGDG 251
           L Y+SDV +GG T+F   N                   LS+ P+ G A  + ++      
Sbjct: 218 LMYLSDVEEGGETIFPDANVNASSLPWYNELSDCAKRGLSVKPKMGDALLFWSMKPGATL 277

Query: 252 DYYTRHAACPVLTGSNSLHST 272
           D  + H  CPV+ G N   ST
Sbjct: 278 DPLSLHGGCPVIKG-NKWSST 297


>gi|398818543|ref|ZP_10577128.1| 2OG-Fe(II) oxygenase superfamily enzyme [Brevibacillus sp. BC25]
 gi|398027481|gb|EJL21031.1| 2OG-Fe(II) oxygenase superfamily enzyme [Brevibacillus sp. BC25]
          Length = 220

 Score = 96.7 bits (239), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 59/179 (32%), Positives = 100/179 (55%), Gaps = 15/179 (8%)

Query: 89  YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGEL-EIANYRISKSAWLREPE 147
           Y +P +++  +V+ DSE D + + ++ RL+R+     K GE   + + R S   +  + E
Sbjct: 38  YEEPLVVVLGNVLSDSECDELIEHSRERLQRS-----KIGEDGSVNSIRTSSGVFCEQTE 92

Query: 148 HPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDF-ARPGEANAFKSLGTGNR 206
              I RI +R+  +  +     + LQV+ Y  G  Y+PHYDF A    A+      T NR
Sbjct: 93  --TITRIEKRISQIMNIPIEHGDGLQVLRYTPGQEYKPHYDFFAETSRAS------TNNR 144

Query: 207 VATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           ++T++ Y++DV QGG TVF  L+LS++P KG A ++   +S+ + + +T HA   V+ G
Sbjct: 145 ISTLVMYLNDVEQGGETVFPLLHLSVFPTKGMAVYFEYFYSNQELNDFTLHAGTQVIHG 203


>gi|307102963|gb|EFN51228.1| hypothetical protein CHLNCDRAFT_141231 [Chlorella variabilis]
          Length = 313

 Score = 96.7 bits (239), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 60/187 (32%), Positives = 93/187 (49%), Gaps = 23/187 (12%)

Query: 96  LYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERIS 155
           ++ + + + E D I  +A+P L R+ V +  TG  EI++ R SK  +L       +  I 
Sbjct: 43  IFINFLTEEECDHIVALAKPHLERSGVVDTATGGSEISDIRTSKGMFLERGHDDTVAAIE 102

Query: 156 RRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMS 215
            R+   T L     E LQV+NY  G  Y+ ++     GE+N       GNR ATVL Y++
Sbjct: 103 ERIARWTLLPVGNGEGLQVLNYHPGEKYDDYFFDKVNGESNG------GNRYATVLMYLN 156

Query: 216 DVAQGGATVFTSL-----------------NLSLWPEKGTAAFWHNLHSSGDGDYYTRHA 258
            V +GG TVF ++                 +L+  P KG+A  +H++  SGD +  + H 
Sbjct: 157 TVEEGGETVFPNIPAPGGDNGPTFTECARRHLAAKPTKGSAVLFHSIKPSGDLERRSLHT 216

Query: 259 ACPVLTG 265
           ACPV+ G
Sbjct: 217 ACPVVKG 223


>gi|297829156|ref|XP_002882460.1| hypothetical protein ARALYDRAFT_896741 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297328300|gb|EFH58719.1| hypothetical protein ARALYDRAFT_896741 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 299

 Score = 96.7 bits (239), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 55/203 (27%), Positives = 98/203 (48%), Gaps = 25/203 (12%)

Query: 84  KEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWL 143
           K ++   +PR  +Y   + D E D +  +A+  L+R+ V +   GE ++++ R S   ++
Sbjct: 37  KVKQVSAKPRAFVYEGFLTDLECDHLISLAKENLQRSAVADNDNGESQVSDVRTSSGTFI 96

Query: 144 REPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT 203
            + + P++  I  ++   T L     E+LQV+ Y  G  Y+ H+D+    + N  +    
Sbjct: 97  SKGKDPIVSGIEDKLSTWTFLPKENGEDLQVLRYEPGQKYDAHFDYFHD-KVNIARG--- 152

Query: 204 GNRVATVLFYMSDVAQGGATVF---------------------TSLNLSLWPEKGTAAFW 242
           G+R+ATVL Y+S+V +GG TVF                         +++ P+KG A  +
Sbjct: 153 GHRIATVLLYLSNVTKGGETVFPDAQEYSRRSLSENKDDLSDCAKKGIAVKPKKGNALLF 212

Query: 243 HNLHSSGDGDYYTRHAACPVLTG 265
            NL      D ++ H  CPV+ G
Sbjct: 213 FNLQQDAIPDPFSLHGGCPVIEG 235


>gi|313215430|emb|CBY42983.1| unnamed protein product [Oikopleura dioica]
          Length = 469

 Score = 96.7 bits (239), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 62/170 (36%), Positives = 90/170 (52%), Gaps = 10/170 (5%)

Query: 34  NNVAPTLEVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPY--LRLMPLKEEEAYLQ 91
           N   P       ++YE LCR      P   + LKC Y     P   L+  P+K EE +  
Sbjct: 256 NLTRPEAHYESMQEYERLCR---EFSPPHKSSLKCFYWTGPSPLSPLQWAPVKTEELHDD 312

Query: 92  PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPE---- 147
           P ++ + +V+ D E   I+ +A   L RAT+Q+  TG+L  A+YRI K+AWL E E    
Sbjct: 313 PLVVQFYEVISDEEERAIQFLAGEHLNRATIQDPATGKLVNADYRIQKTAWLTEFEKFDV 372

Query: 148 HPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDF-ARPGEAN 196
           +  I + + ++  +TGL    AE +QV NYG+ G YEPH+D  + PG  N
Sbjct: 373 NGTIAKYNEKLTKITGLDADYAELVQVGNYGVAGQYEPHWDHQSYPGAEN 422


>gi|193209070|ref|NP_001123049.1| Protein PHY-4, isoform b [Caenorhabditis elegans]
 gi|172051527|emb|CAQ35068.1| Protein PHY-4, isoform b [Caenorhabditis elegans]
          Length = 282

 Score = 96.7 bits (239), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 64/216 (29%), Positives = 105/216 (48%), Gaps = 7/216 (3%)

Query: 52  CRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           C  +L    +   +L C  +H+++   ++  L  E   LQ    ++R +   +    +++
Sbjct: 36  CGKELRGDSSRDGRLVCYRLHKHLLIRKVEILSSEPFILQYHNQVHRRLAKRA----VQE 91

Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVE-HMTGLTTSTAE 170
               RL +  +  + T   E +  R +   WL     P   RI   ++ ++  L  STAE
Sbjct: 92  AEALRLEQLKISGFTTTP-EKSQVRAANGTWLIHTGRPSFARIFEGLQANINSLDLSTAE 150

Query: 171 ELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNL 230
             Q+++Y   G+Y PHYD+  P   N     G GNR+ATVL  +    +GG TVF  LNL
Sbjct: 151 PWQILSYNADGYYAPHYDYLNPA-TNVQLVEGRGNRIATVLVILQIAKKGGTTVFPRLNL 209

Query: 231 SLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           ++ P+ G    W N  S+G+ +  T HAACP+  G+
Sbjct: 210 NIRPKAGDVIVWLNTLSTGESNSQTLHAACPIHEGT 245


>gi|239816557|ref|YP_002945467.1| Procollagen-proline dioxygenase [Variovorax paradoxus S110]
 gi|239803134|gb|ACS20201.1| Procollagen-proline dioxygenase [Variovorax paradoxus S110]
          Length = 296

 Score = 96.3 bits (238), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 55/168 (32%), Positives = 88/168 (52%), Gaps = 1/168 (0%)

Query: 99  DVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRV 158
           DV    E + +  +A+PRL  +T  +  TG   +   R S   + R  E+  + R+  R+
Sbjct: 106 DVFSAEECEALIALARPRLAPSTSVDPLTGRNRLGAQRSSLGMFFRLRENAFVARLDERL 165

Query: 159 EHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLG-TGNRVATVLFYMSDV 217
             +  L     E LQV++Y  G    PH+DF  P  A    SL  +G RV+T++ Y+++V
Sbjct: 166 SELMNLPVENGEGLQVLHYPAGAQSLPHFDFLVPSNAANQASLQRSGQRVSTLVAYLNEV 225

Query: 218 AQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
            +GG TVF     S+ P++G A ++   +S G  D+ + HA  PVL+G
Sbjct: 226 EEGGETVFPETGWSVSPQRGGAVYFEYCNSLGQVDHASLHAGAPVLSG 273


>gi|357517895|ref|XP_003629236.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355523258|gb|AET03712.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
          Length = 326

 Score = 96.3 bits (238), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 56/198 (28%), Positives = 94/198 (47%), Gaps = 23/198 (11%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +PR  LY + +   E + +  +A+P + ++ V + +TG    +  R S  A+L+     +
Sbjct: 121 EPRAFLYHNFLTKEECEHLINIAKPSMHKSAVIDEETGNGVDSRERTSSGAFLKRGSDRI 180

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           ++ I RR+   T +     E   V++Y +G  YEPHYD+      + F +   G R+AT+
Sbjct: 181 VKNIERRIADFTFIPVEHGENFNVLHYEVGQKYEPHYDYF----MDTFSTTYAGQRIATM 236

Query: 211 LFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAFWHNLHSSGDG 251
           L Y+SDV +GG TVF +                     LS+ P+ G A  + ++      
Sbjct: 237 LMYLSDVEEGGETVFPNAKGNFSSVPWWNELSDCGKGGLSIKPKMGNAILFWSMKPDATL 296

Query: 252 DYYTRHAACPVLTGSNSL 269
           D  + H ACPV+ G   L
Sbjct: 297 DPSSLHGACPVIKGDKWL 314


>gi|193209068|ref|NP_001123048.1| Protein PHY-4, isoform a [Caenorhabditis elegans]
 gi|172051526|emb|CAQ35067.1| Protein PHY-4, isoform a [Caenorhabditis elegans]
          Length = 278

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 64/216 (29%), Positives = 105/216 (48%), Gaps = 7/216 (3%)

Query: 52  CRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKK 111
           C  +L    +   +L C  +H+++   ++  L  E   LQ    ++R +   +    +++
Sbjct: 36  CGKELRGDSSRDGRLVCYRLHKHLLIRKVEILSSEPFILQYHNQVHRRLAKRA----VQE 91

Query: 112 MAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVE-HMTGLTTSTAE 170
               RL +  +  + T   E +  R +   WL     P   RI   ++ ++  L  STAE
Sbjct: 92  AEALRLEQLKISGFTTTP-EKSQVRAANGTWLIHTGRPSFARIFEGLQANINSLDLSTAE 150

Query: 171 ELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNL 230
             Q+++Y   G+Y PHYD+  P   N     G GNR+ATVL  +    +GG TVF  LNL
Sbjct: 151 PWQILSYNADGYYAPHYDYLNPA-TNVQLVEGRGNRIATVLVILQIAKKGGTTVFPRLNL 209

Query: 231 SLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           ++ P+ G    W N  S+G+ +  T HAACP+  G+
Sbjct: 210 NIRPKAGDVIVWLNTLSTGESNSQTLHAACPIHEGT 245


>gi|18397528|ref|NP_566279.1| P4H isoform 2 [Arabidopsis thaliana]
 gi|332640849|gb|AEE74370.1| P4H isoform 2 [Arabidopsis thaliana]
          Length = 299

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 55/203 (27%), Positives = 98/203 (48%), Gaps = 25/203 (12%)

Query: 84  KEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWL 143
           K ++   +PR  +Y   + D E D +  +A+  L+R+ V +   GE ++++ R S   ++
Sbjct: 37  KVKQVSSKPRAFVYEGFLTDLECDHLISLAKENLQRSAVADNDNGESQVSDVRTSSGTFI 96

Query: 144 REPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT 203
            + + P++  I  ++   T L     E+LQV+ Y  G  Y+ H+D+    + N  +    
Sbjct: 97  SKGKDPIVSGIEDKLSTWTFLPKENGEDLQVLRYEHGQKYDAHFDYFHD-KVNIARG--- 152

Query: 204 GNRVATVLFYMSDVAQGGATVF---------------------TSLNLSLWPEKGTAAFW 242
           G+R+ATVL Y+S+V +GG TVF                         +++ P+KG A  +
Sbjct: 153 GHRIATVLLYLSNVTKGGETVFPDAQEFSRRSLSENKDDLSDCAKKGIAVKPKKGNALLF 212

Query: 243 HNLHSSGDGDYYTRHAACPVLTG 265
            NL      D ++ H  CPV+ G
Sbjct: 213 FNLQQDAIPDPFSLHGGCPVIEG 235


>gi|357162904|ref|XP_003579560.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Brachypodium
           distachyon]
          Length = 266

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 61/203 (30%), Positives = 100/203 (49%), Gaps = 19/203 (9%)

Query: 78  LRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRI 137
           LRL  +K E     PRII++ + +   E D +K++A+PRL  +TV +  TG+   ++ R 
Sbjct: 53  LRLGYVKPEVISWTPRIIVFHNFLSSEECDFLKEIARPRLEISTVVDVATGKGVKSDVRT 112

Query: 138 SKSAWLREPEH--PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEA 195
           S   ++   E   PVI+ I +R+   + +     E +QV+ Y    +Y PH+D+     +
Sbjct: 113 SSGMFVNSEERKFPVIQAIEKRISVFSQIPVENGELIQVLRYEPSQYYRPHHDYF----S 168

Query: 196 NAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL-------------NLSLWPEKGTAAFW 242
           + F     G RVAT+L Y++D  +GG T F                 L + P KG A  +
Sbjct: 169 DTFNLKRGGQRVATMLMYLTDGVEGGETHFPQAGDGECSCGGRIVRGLCVKPNKGDAVLF 228

Query: 243 HNLHSSGDGDYYTRHAACPVLTG 265
            ++   G+ D  + H+ C VL G
Sbjct: 229 WSMGLDGNTDSNSIHSGCAVLKG 251


>gi|388496942|gb|AFK36537.1| unknown [Lotus japonicus]
          Length = 302

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 55/207 (26%), Positives = 103/207 (49%), Gaps = 27/207 (13%)

Query: 82  PLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSA 141
           P K ++   +PR  +Y+  + + E D +  +A+  L+R+ V +  +G+ ++++ R S   
Sbjct: 38  PSKVKQVSWKPRAFVYKGFLTELECDHLISLAKSELKRSAVADNLSGDSKLSDVRTSSGM 97

Query: 142 WLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL 201
           ++ + + P++  I  ++   T L     E++QV+ Y  G  Y+PHYDF     A+     
Sbjct: 98  FISKNKDPIVAGIEDKISSWTFLPKENGEDIQVLRYEHGQKYDPHYDFF----ADKVNIA 153

Query: 202 GTGNRVATVLFYMSDVAQGGATVFTSL-----------------------NLSLWPEKGT 238
             G+RVATVL Y+++V +GG TVF +                         +++ P +G 
Sbjct: 154 RGGHRVATVLMYLTNVTRGGETVFPNAEVEEFPRHRGSETIDDLSECAKKGIAVKPRRGD 213

Query: 239 AAFWHNLHSSGDGDYYTRHAACPVLTG 265
           A  + +L+ +   D  + HA CPV+ G
Sbjct: 214 ALLFFSLYPNAVPDTMSLHAGCPVIEG 240


>gi|337280547|ref|YP_004620019.1| hypothetical protein Rta_28970 [Ramlibacter tataouinensis TTB310]
 gi|334731624|gb|AEG94000.1| conserved hypothetical protein [Ramlibacter tataouinensis TTB310]
          Length = 286

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 56/186 (30%), Positives = 91/186 (48%), Gaps = 11/186 (5%)

Query: 87  EAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREP 146
           +A   PR++++  ++ D E + +  +A+PRL R+     KTG  E+   R S   + +  
Sbjct: 94  QAMYNPRVVVFGSLLSDQECEQLIGLAKPRLARSLTVATKTGGEEVNEDRTSSGMFFQRG 153

Query: 147 EHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDF---ARPGEANAFKSLGT 203
           E+ ++ RI  R+  +        E LQV++Y  G  Y+PHYD+   A PG     K    
Sbjct: 154 ENELVARIEARIARLVNWPVENGEGLQVLHYRPGAEYKPHYDYFDPAEPGTPTILKR--G 211

Query: 204 GNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAF--WHNLHSSGDGDYYTRHAACP 261
           G RV T++ Y+ +  +GG T F  ++L + P++G   F  +   H S      T H   P
Sbjct: 212 GQRVGTLVMYLGEPEKGGGTTFPDVHLEVAPKRGHGVFFSYERPHPS----TRTLHGGAP 267

Query: 262 VLTGSN 267
           VL G  
Sbjct: 268 VLAGEK 273


>gi|194871369|ref|XP_001972835.1| GG15736 [Drosophila erecta]
 gi|190654618|gb|EDV51861.1| GG15736 [Drosophila erecta]
          Length = 476

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 61/201 (30%), Positives = 101/201 (50%), Gaps = 27/201 (13%)

Query: 66  LKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNY 125
           L CRYV    P+L+L PLK EE  ++  I ++  V+   +ID +K +++P+L+R     +
Sbjct: 294 LVCRYVDW-TPFLKLAPLKMEELSMETHISIFYGVLRQKDIDELKNVSRPKLQRIE---H 349

Query: 126 KTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEP 185
            +G        +S S+      H V+ +++  +  +TG  +   + L+V+NYGI G+Y P
Sbjct: 350 LSGNCSCKIGNLSSSS------HDVVRKVNELILDITGFPSKGNQMLEVINYGIAGNYNP 403

Query: 186 HYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNL 245
             D ARP + N           A  L ++ +  +GG  VF S +L + P KG+   W NL
Sbjct: 404 D-DTARPRKQNK----------ANALIFLDNAERGGEIVFPSRHLKVRPRKGSMLVWMNL 452

Query: 246 HSSGDGDYYTRHAACPVLTGS 266
             S        +  CP+L G+
Sbjct: 453 ERS------VIYHQCPILKGN 467


>gi|21618073|gb|AAM67123.1| prolyl 4-hydroxylase alpha subunit-like protein [Arabidopsis
           thaliana]
          Length = 297

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 55/203 (27%), Positives = 98/203 (48%), Gaps = 25/203 (12%)

Query: 84  KEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWL 143
           K ++   +PR  +Y   + D E D +  +A+  L+R+ V +   GE ++++ R S   ++
Sbjct: 35  KVKQVSSKPRAFVYEGFLTDLECDHLISLAKENLQRSAVADNDNGESQVSDVRTSSGTFI 94

Query: 144 REPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT 203
            + + P++  I  ++   T L     E+LQV+ Y  G  Y+ H+D+    + N  +    
Sbjct: 95  SKGKDPIVSGIEDKLSTWTFLPKENGEDLQVLRYEHGQKYDAHFDYFHD-KVNIARG--- 150

Query: 204 GNRVATVLFYMSDVAQGGATVF---------------------TSLNLSLWPEKGTAAFW 242
           G+R+ATVL Y+S+V +GG TVF                         +++ P+KG A  +
Sbjct: 151 GHRIATVLLYLSNVTKGGETVFPDAQEFSRRSLSENKDDLSDCAKKGIAVKPKKGNALLF 210

Query: 243 HNLHSSGDGDYYTRHAACPVLTG 265
            NL      D ++ H  CPV+ G
Sbjct: 211 FNLQQDAIPDPFSLHGGCPVIEG 233


>gi|317127314|ref|YP_004093596.1| Procollagen-proline dioxygenase [Bacillus cellulosilyticus DSM
           2522]
 gi|315472262|gb|ADU28865.1| Procollagen-proline dioxygenase [Bacillus cellulosilyticus DSM
           2522]
          Length = 229

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 48/176 (27%), Positives = 89/176 (50%), Gaps = 10/176 (5%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +P I+L  +V+ + E D +  +++ R+ R+ + N    +L     R S S +  + E+ V
Sbjct: 43  EPLIVLLGNVLSEEECDQLISLSKDRIERSKISNKSVHDL-----RTSSSMFFDDAENDV 97

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           +  + +RV  +  +     E +Q++NY IG  Y+ HYD+   G +          R++T+
Sbjct: 98  VSTVEKRVSQIMKIPVDHGEGIQILNYAIGQEYKAHYDYFSSGNSKV-----NNPRISTL 152

Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           + Y++DV  GG T F  LN  + P+KG A ++   ++    +  T H   PV+ G 
Sbjct: 153 VMYLNDVEAGGETYFPKLNFYVAPKKGMAVYFEYFYNDTTLNELTLHGGAPVVIGD 208


>gi|50845214|gb|AAT84604.1| prolyl 4-hydroxylase [Dianthus caryophyllus]
          Length = 316

 Score = 95.9 bits (237), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 64/216 (29%), Positives = 104/216 (48%), Gaps = 23/216 (10%)

Query: 69  RYVHRNVPY-LRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKT 127
           R    NVP  + + P    +   +PR  LY   +   E D +  MA+ +L ++ V + ++
Sbjct: 38  RLKSENVPSSVGVDPSHVTQLSWKPRAFLYEGFLTHEECDHLIDMAKDKLEKSMVADNES 97

Query: 128 GELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHY 187
           G+   +  R S   +L++ +  V+  I  R+   T L     E +Q+++Y  G  YEPH+
Sbjct: 98  GKSIPSEVRTSSGMFLQKAQDDVVAAIEARIAAWTFLPIENGEAMQILHYERGQKYEPHF 157

Query: 188 DFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVF----TSLNL------------- 230
           D+         + LG G+R+ATVL Y+S+V +GG TVF      L L             
Sbjct: 158 DYFHD---KVNQQLG-GHRIATVLMYLSNVEEGGETVFPNAEAKLQLANNESLSDCAKGG 213

Query: 231 -SLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
            S+ P+KG A  + +LH     D  + H +CPV+ G
Sbjct: 214 YSVKPKKGDALLFFSLHPDASTDSLSLHGSCPVIEG 249


>gi|295700439|ref|YP_003608332.1| procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
 gi|295439652|gb|ADG18821.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
          Length = 296

 Score = 95.9 bits (237), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 58/178 (32%), Positives = 90/178 (50%), Gaps = 1/178 (0%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +P  +   + +   E + +  +AQPRL R+ V +  TG   IA +R S   + R  E P+
Sbjct: 101 RPAAVHLANFLSADECEQLIALAQPRLDRSAVVDPVTGRDVIATHRSSHGMFFRLGETPL 160

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPG-EANAFKSLGTGNRVAT 209
           I RI  R+  +T       E LQ+++Y  G    PH D+   G EAN      +G R+ T
Sbjct: 161 IARIEARIAELTATPVENGEGLQMLHYEEGAESTPHVDYLMTGNEANRESIARSGQRMGT 220

Query: 210 VLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
           +L Y+ DV  GG TVF  +  S+ P++G A ++   +  G  D  + HA+ P+ TG  
Sbjct: 221 LLMYLKDVEGGGETVFPQVGWSIVPQRGHALYFEYGNRYGMCDPSSLHASTPLRTGDK 278


>gi|226314793|ref|YP_002774689.1| hypothetical protein BBR47_52080 [Brevibacillus brevis NBRC 100599]
 gi|226097743|dbj|BAH46185.1| conserved hypothetical protein [Brevibacillus brevis NBRC 100599]
          Length = 215

 Score = 95.9 bits (237), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 55/178 (30%), Positives = 99/178 (55%), Gaps = 13/178 (7%)

Query: 89  YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
           Y +P +++  +V+ DSE D + + ++ RL+R+ +   ++    + + R S   +  + E 
Sbjct: 33  YEEPLVVVLGNVLSDSECDELIEHSRERLQRSKIGEDRS----VNSIRTSSGVFCEQTE- 87

Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDF-ARPGEANAFKSLGTGNRV 207
             I RI +R+  +  +     + LQV+ Y  G  Y+PHYDF A    A+      T NR+
Sbjct: 88  -TITRIEKRISQIMNIPIEHGDGLQVLRYTPGQEYKPHYDFFAETSRAS------TNNRI 140

Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           +T++ Y++DV QGG TVF  L+LS++P KG A ++   + + + + +T HA   V+ G
Sbjct: 141 STLVMYLNDVEQGGETVFPLLHLSVFPTKGMAVYFEYFYRNQEVNEFTLHAGAQVIHG 198


>gi|224141325|ref|XP_002324024.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Populus
           trichocarpa]
 gi|222867026|gb|EEF04157.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Populus
           trichocarpa]
          Length = 308

 Score = 95.5 bits (236), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 56/202 (27%), Positives = 98/202 (48%), Gaps = 22/202 (10%)

Query: 82  PLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSA 141
           P +  +    PR  LY+  + D E D +  +A+ +L ++ V + ++G+   +  R S   
Sbjct: 43  PTRVTQLSWNPRAFLYKGFLSDEECDHLMNLARDKLEKSMVADNESGKSIESEVRTSSGM 102

Query: 142 WLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL 201
           ++ + +  +++ I  R+   T L     E +Q+++Y  G  YEPH+D+       A + L
Sbjct: 103 FIGKSQDEIVDDIEARIAAWTFLPQENGESIQILHYEHGQKYEPHFDYFHD---KANQEL 159

Query: 202 GTGNRVATVLFYMSDVAQGGATVF------------------TSLNLSLWPEKGTAAFWH 243
           G G+RV TVL Y+S+V +GG TVF                       ++ P+KG A  + 
Sbjct: 160 G-GHRVVTVLMYLSNVGKGGETVFPNSEGKTIQPKDDSWSDCAKNGYAVKPQKGDALLFF 218

Query: 244 NLHSSGDGDYYTRHAACPVLTG 265
           +LH     D  + H +CPV+ G
Sbjct: 219 SLHPDATTDTNSLHGSCPVIEG 240


>gi|241767624|ref|ZP_04765273.1| Procollagen-proline dioxygenase [Acidovorax delafieldii 2AN]
 gi|241361463|gb|EER57922.1| Procollagen-proline dioxygenase [Acidovorax delafieldii 2AN]
          Length = 318

 Score = 95.5 bits (236), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 51/177 (28%), Positives = 87/177 (49%), Gaps = 3/177 (1%)

Query: 92  PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
           PR++++ +++   E + +   A PR+ R+     +TG  E+ + R S   + +  E P++
Sbjct: 131 PRVVVFGNLLSPEECEALIAAAAPRMARSLTVATQTGGEEVNDDRTSHGMFFQRGESPLV 190

Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLG-TGNRVATV 210
           +RI  R+  +        E LQV++Y  G  Y+PHYD+  P E      +   G RV T+
Sbjct: 191 QRIEERIASLLNWPIENGEGLQVLHYRPGAEYKPHYDYFDPAEPGTPTVIQRGGQRVGTL 250

Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
           + Y++   QGG T F    + + P++G AAF+   +        T H   PVL G  
Sbjct: 251 VMYLNTPEQGGGTTFPDAQIEVAPQRGNAAFFS--YERPTPSTRTLHGGAPVLAGDK 305


>gi|110738390|dbj|BAF01121.1| hypothetical protein [Arabidopsis thaliana]
          Length = 299

 Score = 95.5 bits (236), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 55/203 (27%), Positives = 98/203 (48%), Gaps = 25/203 (12%)

Query: 84  KEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWL 143
           K ++   +PR  +Y   + D E D +  +A+  L+R+ V +   GE ++++ R S   ++
Sbjct: 37  KVKQVSSKPRAFVYGGFLTDLECDHLISLAKENLQRSAVADNDNGESQVSDVRTSSGTFI 96

Query: 144 REPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT 203
            + + P++  I  ++   T L     E+LQV+ Y  G  Y+ H+D+    + N  +    
Sbjct: 97  SKGKDPIVSGIEDKLSTWTFLPKENGEDLQVLRYEHGQKYDAHFDYFHD-KVNIARG--- 152

Query: 204 GNRVATVLFYMSDVAQGGATVF---------------------TSLNLSLWPEKGTAAFW 242
           G+R+ATVL Y+S+V +GG TVF                         +++ P+KG A  +
Sbjct: 153 GHRIATVLLYLSNVTKGGETVFPDAQEFSRRSLSENKDDLSDCAKKGIAVKPKKGNALLF 212

Query: 243 HNLHSSGDGDYYTRHAACPVLTG 265
            NL      D ++ H  CPV+ G
Sbjct: 213 FNLQQDAIPDPFSLHGGCPVIEG 235


>gi|224069056|ref|XP_002302889.1| predicted protein [Populus trichocarpa]
 gi|222844615|gb|EEE82162.1| predicted protein [Populus trichocarpa]
          Length = 287

 Score = 95.5 bits (236), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 63/203 (31%), Positives = 100/203 (49%), Gaps = 19/203 (9%)

Query: 78  LRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRI 137
           LR+  +K E     PRII+  D +   E D ++ +A+PRLR +TV + KTG+   +  R 
Sbjct: 74  LRIGYVKPEIISWSPRIIVLHDFLSSEECDYLRALAKPRLRISTVVDVKTGKGIESKVRT 133

Query: 138 SKSAWLREPE--HPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEA 195
           S   +L   E  + V++ I +R+   + +     E +QV+ Y    +Y+PH+D+     +
Sbjct: 134 SSGMFLSSEEKTYQVVQAIEKRISVYSQVPIENGELIQVLRYEKNQYYKPHHDYF----S 189

Query: 196 NAFKSLGTGNRVATVLFYMSDVAQGGATVF-------------TSLNLSLWPEKGTAAFW 242
           + F     G RVAT+L Y+SD  +GG T F                 LS+ P KG A  +
Sbjct: 190 DTFNLKRGGQRVATMLMYLSDNVEGGETYFPMAGSGKCSCGGKVVDGLSVKPIKGNAVLF 249

Query: 243 HNLHSSGDGDYYTRHAACPVLTG 265
            ++   G  D  + H  C VL+G
Sbjct: 250 WSMGLDGQSDPSSIHGGCEVLSG 272


>gi|449522594|ref|XP_004168311.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Cucumis
           sativus]
          Length = 313

 Score = 95.1 bits (235), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 56/202 (27%), Positives = 99/202 (49%), Gaps = 22/202 (10%)

Query: 82  PLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSA 141
           P +  +   QPR  LY+  + D+E D +  +A+ +L ++ V +  +G+   +  R S   
Sbjct: 50  PTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVSSEVRTSSGM 109

Query: 142 WLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL 201
           +LR+ +  V+  +  R+   T L     E +Q+++Y  G  YEPH+DF         + L
Sbjct: 110 FLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHD---KVNQEL 166

Query: 202 GTGNRVATVLFYMSDVAQGGATVFTSLNL------------------SLWPEKGTAAFWH 243
           G G+R+ATVL Y+S+V +GG T+F +                     ++  +KG A  + 
Sbjct: 167 G-GHRIATVLMYLSNVEKGGETIFPNSEFKESQAKDESWSDCSRKGYAVKAQKGDALLFF 225

Query: 244 NLHSSGDGDYYTRHAACPVLTG 265
           +L+     D  + H +CPV+ G
Sbjct: 226 SLNLDATTDERSLHGSCPVIAG 247


>gi|124267278|ref|YP_001021282.1| hypothetical protein Mpe_A2091 [Methylibium petroleiphilum PM1]
 gi|124260053|gb|ABM95047.1| conserved hypothetical protein [Methylibium petroleiphilum PM1]
          Length = 289

 Score = 95.1 bits (235), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 71/240 (29%), Positives = 105/240 (43%), Gaps = 26/240 (10%)

Query: 40  LEVTEREKYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMP---------LKEEEAYL 90
           LE+  R+  E   R  +  PPA V +          P+L   P         ++   A  
Sbjct: 51  LEIVLRDIVEAGTRQKVLPPPARVPE----------PFLDGAPATLWAHDREVRVVMAMR 100

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
            PR+I++  ++ D+E D I  +A  RL R+   +  TG  E+   R S   +    EHPV
Sbjct: 101 DPRVIVFSGLLSDAECDEIVALAGARLARSHTVDTATGASEVNAARTSDGMFFTRGEHPV 160

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL-GTGNRVAT 209
             R   R+  +        E LQV++Y  G  Y+PHYD+  P +      L   G RVAT
Sbjct: 161 CARFEARIAALLNWPVENGEGLQVLHYRPGAEYKPHYDYFDPDQPGTPAVLRRGGQRVAT 220

Query: 210 VLFYMSDVAQGGATVFTSLNLSLWPEKGTAAF--WHNLHSSGDGDYYTRHAACPVLTGSN 267
           ++ Y++   +GG T F  + L + P KG A F  +   H S      + H   PVL G  
Sbjct: 221 LVTYLNTPTRGGGTTFPDIGLEVTPLKGHAVFFSYDRPHPS----TRSLHGGAPVLEGDK 276


>gi|398810140|ref|ZP_10568970.1| 2OG-Fe(II) oxygenase superfamily enzyme [Variovorax sp. CF313]
 gi|398083831|gb|EJL74535.1| 2OG-Fe(II) oxygenase superfamily enzyme [Variovorax sp. CF313]
          Length = 296

 Score = 95.1 bits (235), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 55/168 (32%), Positives = 87/168 (51%), Gaps = 1/168 (0%)

Query: 99  DVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRV 158
           DV    E + +  +A+PRL  +T  +  +G   +   R S   + R  E+  I R+ +RV
Sbjct: 106 DVFDPQECEELIALARPRLAPSTTVDPLSGRDLVGEQRSSLGMFFRLRENAFIARLDQRV 165

Query: 159 EHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLG-TGNRVATVLFYMSDV 217
             +  L     E LQV+ Y  G    PH+DF  P  A    SL  +G RV+T++ Y+++V
Sbjct: 166 SELMNLPVENGEGLQVLCYPAGAQSMPHFDFLVPSNAANKASLARSGQRVSTLVSYLNEV 225

Query: 218 AQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
            +GG T+F     S+ P +G+A ++   +S G  D+ + HA  PVL G
Sbjct: 226 EEGGETIFPECGWSVPPRRGSAVYFEYCNSLGQVDHASLHAGGPVLHG 273


>gi|242063586|ref|XP_002453082.1| hypothetical protein SORBIDRAFT_04g038020 [Sorghum bicolor]
 gi|241932913|gb|EES06058.1| hypothetical protein SORBIDRAFT_04g038020 [Sorghum bicolor]
          Length = 307

 Score = 95.1 bits (235), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 58/201 (28%), Positives = 97/201 (48%), Gaps = 24/201 (11%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +PR  +Y + +   E + +  +A+P + ++TV +  TG+ + +  R S   +L+     V
Sbjct: 102 EPRAFVYHNFLSKEECEYLIGLAKPHMVKSTVVDSTTGKSKDSRVRTSSGMFLQRGRDKV 161

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           I  I +R+   T +     E LQV++Y +G  YEPH+D+      + F +   G R+AT+
Sbjct: 162 IRAIEKRIADYTFIPADHGEGLQVLHYEVGQKYEPHFDYF----LDEFNTKNGGQRMATL 217

Query: 211 LFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAFWHNLHSSGDG 251
           L Y+SDV +GG T+F   N                   LS+ P+ G A  + ++      
Sbjct: 218 LMYLSDVEEGGETIFPDANVNASSLPWYNELSECAKRGLSVKPKMGDALLFWSMKPDATL 277

Query: 252 DYYTRHAACPVLTGSNSLHST 272
           D  + H  CPV+ G N   ST
Sbjct: 278 DPLSLHGGCPVIRG-NKWSST 297


>gi|226529219|ref|NP_001151238.1| LOC100284871 [Zea mays]
 gi|195645242|gb|ACG42089.1| prolyl 4-hydroxylase alpha-2 subunit precursor [Zea mays]
 gi|347978812|gb|AEP37748.1| prolyl 4-hydroxylase 5 [Zea mays]
 gi|413923983|gb|AFW63915.1| prolyl 4-hydroxylase alpha-2 subunit [Zea mays]
          Length = 308

 Score = 95.1 bits (235), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 58/201 (28%), Positives = 97/201 (48%), Gaps = 24/201 (11%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +PR  +Y + +   E + +  +A+P + ++TV +  TG+ + +  R S   +L+     V
Sbjct: 103 EPRAFVYHNFLSKEECEYLIGLAKPHMVKSTVVDSTTGKSKDSRVRTSSGMFLQRGRDKV 162

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           I  I +R+   T +     E LQV++Y +G  YEPH+D+      + F +   G R+AT+
Sbjct: 163 IRVIEKRIADYTFIPVDHGEGLQVLHYEVGQKYEPHFDYF----LDEFNTKNGGQRMATL 218

Query: 211 LFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAFWHNLHSSGDG 251
           L Y+SDV +GG T+F   N                   LS+ P+ G A  + ++      
Sbjct: 219 LMYLSDVEEGGETIFPDANVNVSSLPWYNELSECAKRGLSVKPKMGDALLFWSMKPDATL 278

Query: 252 DYYTRHAACPVLTGSNSLHST 272
           D  + H  CPV+ G N   ST
Sbjct: 279 DPLSLHGGCPVIRG-NKWSST 298


>gi|356555585|ref|XP_003546111.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 1
           [Glycine max]
          Length = 301

 Score = 95.1 bits (235), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 58/205 (28%), Positives = 102/205 (49%), Gaps = 25/205 (12%)

Query: 82  PLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSA 141
           P K ++   +PR  +Y   + + E D +  +A+  L+R+ V +  +GE +++  R S   
Sbjct: 37  PSKVKQVSWKPRAFVYEGFLTELECDHLISIAKSELKRSAVADNLSGESKLSEVRTSSGM 96

Query: 142 WLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL 201
           ++ + + P++  +  ++   T L     E++QV+ Y  G  Y+PHYD+     A+     
Sbjct: 97  FIPKNKDPIVAGVEDKISSWTLLPKENGEDIQVLRYEHGQKYDPHYDYF----ADKVNIA 152

Query: 202 GTGNRVATVLFYMSDVAQGGATVF-------------TSLNLS--------LWPEKGTAA 240
             G+RVATVL Y++DV +GG TVF             T  +LS        + P +G A 
Sbjct: 153 RGGHRVATVLMYLTDVTKGGETVFPNAEESPRHRGSETKEDLSECAQKGIAVKPRRGDAL 212

Query: 241 FWHNLHSSGDGDYYTRHAACPVLTG 265
            + +L+ +   D  + HA CPV+ G
Sbjct: 213 LFFSLYPNAIPDTMSLHAGCPVIEG 237


>gi|449461905|ref|XP_004148682.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 295

 Score = 94.7 bits (234), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 59/211 (27%), Positives = 100/211 (47%), Gaps = 29/211 (13%)

Query: 78  LRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRI 137
           L   P +  +   QPR  LY+  + D+E D +  +A+ +L ++ V +  +G+   +  R 
Sbjct: 25  LIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVSSEVRT 84

Query: 138 SKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANA 197
           S   +LR+ +  V+  +  R+   T L     E +Q+++Y  G  YEPH+DF        
Sbjct: 85  SSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHD---KV 141

Query: 198 FKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLW-----------------------P 234
            + LG G+R+ATVL Y+S+V +GG T+F   N  +W                        
Sbjct: 142 NQELG-GHRIATVLMYLSNVEKGGETIFP--NSEVWYGSESQAKDESWSDCSRKGYAVKA 198

Query: 235 EKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           +KG A  + +L+     D  + H +CPV+ G
Sbjct: 199 QKGDALLFFSLNLDATTDERSLHGSCPVIAG 229


>gi|297802350|ref|XP_002869059.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
 gi|297314895|gb|EFH45318.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
          Length = 290

 Score = 94.7 bits (234), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 55/195 (28%), Positives = 94/195 (48%), Gaps = 23/195 (11%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +PR  +Y + + + E + +  +A+P + ++ V + KTG+   +  R S   +L+     +
Sbjct: 86  EPRAFVYHNFLTNEECEHLISLAKPSMVKSKVVDVKTGKSIDSRVRTSSGTFLKRGHDEI 145

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           +E I  R+   T +     E LQV++Y +G  YEPH+D+      + F     G R+ATV
Sbjct: 146 VEEIENRISDFTFIPIENGEGLQVLHYEVGQKYEPHHDYF----FDEFNVRKGGQRIATV 201

Query: 211 LFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAFWHNLHSSGDG 251
           L Y+SDV +GG TVF +                     LS+ P+K  A  + ++      
Sbjct: 202 LMYLSDVDEGGETVFPAAKGNISDVPWWDELSQCGKEGLSVLPKKRDALLFWSMKPDASL 261

Query: 252 DYYTRHAACPVLTGS 266
           D  + H  CPV+ G+
Sbjct: 262 DPSSLHGGCPVIKGN 276


>gi|205374182|ref|ZP_03226981.1| prolyl 4-hydroxylase alpha subunit [Bacillus coahuilensis m4-4]
          Length = 210

 Score = 94.7 bits (234), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 50/178 (28%), Positives = 91/178 (51%), Gaps = 11/178 (6%)

Query: 89  YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
           + +P + +  +V+ D E D +  +++ R+ R+ +   +  ++     R S S +L E   
Sbjct: 30  FHEPFVAVLGNVLSDEECDELISLSKDRMNRSKIAGNQENDI-----RTSTSVFLPEDAS 84

Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVA 208
            V++R+ +R+  +  +     E LQ++NY IG  Y+ H+DF  P      K L    R++
Sbjct: 85  EVVQRVEKRISQIMNIPVEHGEGLQLLNYQIGQEYKAHFDFFSP------KKLIENPRIS 138

Query: 209 TVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
           T++ Y++DV +GG T F +L LS+ P KG A ++   +     +  T H   PV  G 
Sbjct: 139 TLVLYLNDVEEGGDTYFPNLKLSVSPHKGMAVYFEYFYDDPMLNELTLHGGAPVTIGD 196


>gi|229002593|ref|ZP_04160640.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock3-17]
 gi|229003816|ref|ZP_04161625.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock1-4]
 gi|228757417|gb|EEM06653.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock1-4]
 gi|228758520|gb|EEM07660.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock3-17]
          Length = 219

 Score = 94.7 bits (234), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 53/176 (30%), Positives = 95/176 (53%), Gaps = 13/176 (7%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQ-NYKTGELEIANYRISKSAWLREPEHP 149
           +P I++  +V+ D E + + +M++ +++R+ +  + KT ++     R S  A+L E E  
Sbjct: 41  EPLIVVLANVLSDEECETLIEMSKNKMKRSKIGISRKTNDI-----RTSSGAFLEESE-- 93

Query: 150 VIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVAT 209
           +  RI RR+  +  +     E LQ++ Y +G  Y+ HYDF     A A     + NR++T
Sbjct: 94  ITTRIERRIASIMNVPAPHGEGLQILKYTVGQEYQAHYDFFVENSAAA-----SNNRMST 148

Query: 210 VLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           ++ Y++ V +GG T F  LNLS+ P+KG A ++   +     +  T H   PV+ G
Sbjct: 149 LVMYLNHVEEGGETFFPKLNLSVSPKKGMAVYFEYFYQDESINKLTLHGGAPVIKG 204


>gi|228990015|ref|ZP_04149988.1| Prolyl 4-hydroxylase alpha subunit [Bacillus pseudomycoides DSM
           12442]
 gi|228769681|gb|EEM18271.1| Prolyl 4-hydroxylase alpha subunit [Bacillus pseudomycoides DSM
           12442]
          Length = 219

 Score = 94.7 bits (234), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 53/176 (30%), Positives = 95/176 (53%), Gaps = 13/176 (7%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQ-NYKTGELEIANYRISKSAWLREPEHP 149
           +P I++  +V+ D E + + +M++ +++R+ +  + KT ++     R S  A+L E E  
Sbjct: 41  EPLIVVLANVLSDEECETLIEMSKNKMKRSKIGVSRKTNDI-----RTSSGAFLEESE-- 93

Query: 150 VIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVAT 209
           +  RI RR+  +  +     E LQ++ Y +G  Y+ HYDF     A A     + NR++T
Sbjct: 94  ITTRIERRIASIMNVPAPHGEGLQILKYTVGQEYQAHYDFFVENSAAA-----SNNRMST 148

Query: 210 VLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           ++ Y++ V +GG T F  LNLS+ P+KG A ++   +     +  T H   PV+ G
Sbjct: 149 LVMYLNHVEEGGETFFPKLNLSVSPKKGMAVYFEYFYQDESINKLTLHGGAPVIKG 204


>gi|224034451|gb|ACN36301.1| unknown [Zea mays]
 gi|413945801|gb|AFW78450.1| hypothetical protein ZEAMMB73_588774 [Zea mays]
          Length = 295

 Score = 94.7 bits (234), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 57/207 (27%), Positives = 96/207 (46%), Gaps = 34/207 (16%)

Query: 76  PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANY 135
           P   + P    +   +PR+ LY+  + D E + +  +A+  L+R+ V +  +G+  ++  
Sbjct: 42  PAAVVYPHHSRQISCKPRVFLYQHFLSDDEANHLISLARAELKRSAVADNMSGKSTLS-- 99

Query: 136 RISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEA 195
                      E P++E I  ++   T L     E++QV+ Y  G  YEPHYD+      
Sbjct: 100 -----------EDPIVEGIEDKIAAWTFLPKENGEDIQVLRYKHGEKYEPHYDYF----T 144

Query: 196 NAFKSLGTGNRVATVLFYMSDVAQGGATVF-----------------TSLNLSLWPEKGT 238
           +   ++  G+R ATVL Y++DV +GG TVF                     +++ P KG 
Sbjct: 145 DNVNTVRGGHRYATVLLYLTDVPEGGETVFPLAEEPDDAKDATLSECAQKGIAVRPRKGD 204

Query: 239 AAFWHNLHSSGDGDYYTRHAACPVLTG 265
           A  + NL+  G  D  + H  CPV+ G
Sbjct: 205 ALLFFNLNPDGTTDSVSLHGGCPVIKG 231


>gi|159795555|pdb|2V4A|A Chain A, Crystal Structure Of The Semet-Labeled Prolyl-4
           Hydroxylase (P4h) Type I From Green Algae Chlamydomonas
           Reinhardtii.
 gi|159795556|pdb|2V4A|B Chain B, Crystal Structure Of The Semet-Labeled Prolyl-4
           Hydroxylase (P4h) Type I From Green Algae Chlamydomonas
           Reinhardtii.
 gi|159795557|pdb|2V4A|C Chain C, Crystal Structure Of The Semet-Labeled Prolyl-4
           Hydroxylase (P4h) Type I From Green Algae Chlamydomonas
           Reinhardtii.
 gi|159795558|pdb|2V4A|D Chain D, Crystal Structure Of The Semet-Labeled Prolyl-4
           Hydroxylase (P4h) Type I From Green Algae Chlamydomonas
           Reinhardtii
          Length = 233

 Score = 94.7 bits (234), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 62/191 (32%), Positives = 94/191 (49%), Gaps = 19/191 (9%)

Query: 92  PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
           PR  L ++ + D E D I + A+P+  +++V + ++G+   +  R S   W  + E  VI
Sbjct: 29  PRAFLLKNFLSDEECDYIVEKARPKXVKSSVVDNESGKSVDSEIRTSTGTWFAKGEDSVI 88

Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRVATV 210
            +I +RV  +T +     E LQV++Y  G  YEPHYD F  P   NA    G G RV T 
Sbjct: 89  SKIEKRVAQVTXIPLENHEGLQVLHYHDGQKYEPHYDYFHDP--VNAGPEHG-GQRVVTX 145

Query: 211 LFYMSDVAQGGATVFTSL---------------NLSLWPEKGTAAFWHNLHSSGDGDYYT 255
           L Y++ V +GG TV  +                 L++ P KG A  +++L   G  D  +
Sbjct: 146 LXYLTTVEEGGETVLPNAEQKVTGDGWSECAKRGLAVKPIKGDALXFYSLKPDGSNDPAS 205

Query: 256 RHAACPVLTGS 266
            H +CP L G 
Sbjct: 206 LHGSCPTLKGD 216


>gi|242039723|ref|XP_002467256.1| hypothetical protein SORBIDRAFT_01g022150 [Sorghum bicolor]
 gi|241921110|gb|EER94254.1| hypothetical protein SORBIDRAFT_01g022150 [Sorghum bicolor]
          Length = 303

 Score = 94.7 bits (234), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 58/193 (30%), Positives = 96/193 (49%), Gaps = 22/193 (11%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +PR  L++  + D+E D +  +A+ +L ++ V + ++G+   +  R S   +L + +  V
Sbjct: 46  RPRAFLHKGFLSDAECDHLIVLAKDKLEKSMVADNESGKSVQSEVRTSSGMFLEKKQDEV 105

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           +  I  R+   T L     E +Q+++Y  G  YEPHYD+       A   LG G+R+ATV
Sbjct: 106 VRGIEERIAAWTFLPPENGESIQILHYQNGEKYEPHYDYFHDKNNQA---LG-GHRIATV 161

Query: 211 LFYMSDVAQGGATVFTSLNLSL-------W-----------PEKGTAAFWHNLHSSGDGD 252
           L Y+S+V +GG T+F +    L       W           P KG A  + +LH     D
Sbjct: 162 LMYLSNVEKGGETIFPNAEGKLLQPKDDTWSDCARNGYAVKPVKGDALLFFSLHPDATTD 221

Query: 253 YYTRHAACPVLTG 265
             + H +CPV+ G
Sbjct: 222 SESLHGSCPVIEG 234


>gi|388567209|ref|ZP_10153646.1| procollagen-proline dioxygenase [Hydrogenophaga sp. PBC]
 gi|388265592|gb|EIK91145.1| procollagen-proline dioxygenase [Hydrogenophaga sp. PBC]
          Length = 296

 Score = 94.4 bits (233), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 55/175 (31%), Positives = 88/175 (50%), Gaps = 3/175 (1%)

Query: 92  PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
           PR+++  +++   E D I + A+P+L R+      TG  E+   R S   +    + P +
Sbjct: 109 PRVVVLGNLLSAEECDAIIESAKPKLARSLTVQTATGGEELNADRTSSGMFFTRGQTPEV 168

Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLG-TGNRVATV 210
             + RR+  + G      E LQV++Y  G  Y+PHYD+  P EA     L   G RVAT+
Sbjct: 169 TAVERRIARLVGWPVENGEGLQVLHYRPGAEYKPHYDYFDPKEAGTPTILKRGGQRVATL 228

Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           + Y+++ A+GG T F  + L + P KG+A F+   +        + H   PVL G
Sbjct: 229 VMYLNEPARGGGTTFPDVGLEVAPVKGSAVFFS--YDRPHPTTRSLHGGAPVLEG 281


>gi|357140446|ref|XP_003571778.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Brachypodium
           distachyon]
          Length = 298

 Score = 94.4 bits (233), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 56/193 (29%), Positives = 98/193 (50%), Gaps = 22/193 (11%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +PR  L++  + + E D + ++A+ +L ++ V + ++G+   +  R S   +L + +  V
Sbjct: 41  RPRAFLHKGFLSEPECDHMIELAKDKLEKSMVADNESGKSVQSEVRTSSGMFLEKRQDEV 100

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           + RI  R+   T L +   E +Q+++Y  G  YEPHYD+       A   LG G+R+ATV
Sbjct: 101 VARIEERIAAWTFLPSENGESIQILHYKNGEKYEPHYDYFHDKNNQA---LG-GHRIATV 156

Query: 211 LFYMSDVAQGGATVFTSL------------------NLSLWPEKGTAAFWHNLHSSGDGD 252
           L Y+S+V +GG T+F +                     ++ P KG A  + +LH     D
Sbjct: 157 LMYLSNVEKGGETIFPNAEGKLTQHKDETASECAKNGYAVKPMKGDALLFFSLHPDATTD 216

Query: 253 YYTRHAACPVLTG 265
             + H +CPV+ G
Sbjct: 217 PDSLHGSCPVIEG 229


>gi|195352174|ref|XP_002042589.1| GM14934 [Drosophila sechellia]
 gi|194124473|gb|EDW46516.1| GM14934 [Drosophila sechellia]
          Length = 438

 Score = 94.4 bits (233), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 60/204 (29%), Positives = 103/204 (50%), Gaps = 33/204 (16%)

Query: 66  LKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNY 125
           L CRYV     +L+L PLK EE  ++P I ++   +   +I+++K  ++P+L+R     +
Sbjct: 256 LVCRYVDW-TQFLKLAPLKMEELSMKPHISIFYGFLGQKDIEVLKNASRPKLQRVK---H 311

Query: 126 KTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEP 185
            +G        +S S+      H V+ +++  +  +TG  +   + L+V+NYGI G+Y P
Sbjct: 312 LSGNCSCKIGNLSSSS------HDVVRKVNELILDITGFPSKGNQMLEVINYGIAGNYNP 365

Query: 186 HYDFARP---GEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFW 242
             D A+P    +ANAF              ++ +  +GG  VF S +L + P KG+  FW
Sbjct: 366 E-DTAKPKIHNKANAF-------------IFLENAGKGGEIVFPSRHLKVRPRKGSMLFW 411

Query: 243 HNLHSSGDGDYYTRHAACPVLTGS 266
            NL +S        +  CP+L G+
Sbjct: 412 ENLKNS------VIYHQCPILKGN 429


>gi|48716447|dbj|BAD23054.1| putative prolyl 4-hydroxylase [Oryza sativa Japonica Group]
          Length = 310

 Score = 94.4 bits (233), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 57/201 (28%), Positives = 97/201 (48%), Gaps = 24/201 (11%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +PR  +Y + +   E D +  +A+P + ++TV +  TG+ + +  R S   +L+     V
Sbjct: 105 EPRAFVYHNFLSKEECDYLIGLAKPHMVKSTVVDSTTGKSKDSRVRTSSGMFLQRGRDKV 164

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           I  I +R+   T +     E LQV++Y +G  YEPH+D+      + + +   G R+AT+
Sbjct: 165 IRAIEKRIADYTFIPMEHGEGLQVLHYEVGQKYEPHFDYF----LDEYNTKNGGQRMATL 220

Query: 211 LFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAFWHNLHSSGDG 251
           L Y+SDV +GG T+F   N                   L++ P+ G A  + ++      
Sbjct: 221 LMYLSDVEEGGETIFPDANVNSSSLPWYNELSECARKGLAVKPKMGDALLFWSMKPDATL 280

Query: 252 DYYTRHAACPVLTGSNSLHST 272
           D  + H  CPV+ G N   ST
Sbjct: 281 DPLSLHGGCPVIKG-NKWSST 300


>gi|357517881|ref|XP_003629229.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355523251|gb|AET03705.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
          Length = 278

 Score = 94.4 bits (233), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 52/194 (26%), Positives = 95/194 (48%), Gaps = 23/194 (11%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +PR  LY + +   E + +   A+P +++++V + +TG+ + ++ R S   +L      +
Sbjct: 73  EPRAFLYHNFLTKKECEHLINTAKPSMQKSSVVDNETGKSKDSSVRTSSGTFLDRGGDEI 132

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           +  I +R+   T +     E   V+ Y +G  Y+PH D+     A+ + ++  G R+AT+
Sbjct: 133 VRNIEKRIADFTFIPVENGESFNVLRYEVGQKYDPHLDYF----ADDYNTVNGGQRIATM 188

Query: 211 LFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAFWHNLHSSGDG 251
           L Y+SDV +GG TVF +                     LS+ P+ G A  + ++   G  
Sbjct: 189 LMYLSDVEEGGETVFPAAKGNISSVPWWNELSDCGKKGLSIKPKMGDALLFWSMKPDGTL 248

Query: 252 DYYTRHAACPVLTG 265
           D  + H ACPV+ G
Sbjct: 249 DPSSLHGACPVIKG 262


>gi|225459748|ref|XP_002285898.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Vitis vinifera]
 gi|302141716|emb|CBI18919.3| unnamed protein product [Vitis vinifera]
          Length = 288

 Score = 94.4 bits (233), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 59/201 (29%), Positives = 98/201 (48%), Gaps = 24/201 (11%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +PR  +Y + +   E + +  +A+P ++++TV + +TG  + +  R S   +LR     +
Sbjct: 83  EPRAFIYHNFLSKEECEYMISLAKPYMKKSTVVDSETGRSKDSRVRTSSGMFLRRGRDKI 142

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           I  I +R+   T +     E LQV++Y +G  Y+ HYD+      + F +   G R+AT+
Sbjct: 143 IRDIEKRIADFTFIPVEHGEGLQVLHYEVGQKYDAHYDYF----LDEFNTKNGGQRIATL 198

Query: 211 LFYMSDVAQGGATVF--TSLN-----------------LSLWPEKGTAAFWHNLHSSGDG 251
           L Y+SDV +GG TVF  T  N                 LS+ P+ G A  + ++      
Sbjct: 199 LMYLSDVEEGGETVFPATKANFSSVPWWNELSECGKKGLSVKPKMGDALLFWSMRPDATL 258

Query: 252 DYYTRHAACPVLTGSNSLHST 272
           D  + H  CPV+ G N   ST
Sbjct: 259 DPSSLHGGCPVIKG-NKWSST 278


>gi|171059332|ref|YP_001791681.1| procollagen-proline dioxygenase [Leptothrix cholodnii SP-6]
 gi|170776777|gb|ACB34916.1| Procollagen-proline dioxygenase [Leptothrix cholodnii SP-6]
          Length = 287

 Score = 94.4 bits (233), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 57/181 (31%), Positives = 87/181 (48%), Gaps = 11/181 (6%)

Query: 92  PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
           PR++++   +   E D +  +AQPRL R+   +  TG  E+   R S+  +    E  +I
Sbjct: 100 PRVVVFGGFLSHDECDALVALAQPRLARSETVDNDTGGSEVNEARTSQGMFFMRGEGELI 159

Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDF---ARPGEANAFKSLGTGNRVA 208
            RI  R+  +        E +QV++Y  G  Y+PHYD+   A+PG     K    G RV 
Sbjct: 160 SRIEARIAALLDWPLENGEGVQVLHYRPGAEYKPHYDYFDPAQPGTPTILKR--GGQRVG 217

Query: 209 TVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAF--WHNLHSSGDGDYYTRHAACPVLTGS 266
           T++ Y++   +GG T F  +NL + P KG A F  +   H S      + H   PVL G 
Sbjct: 218 TLVMYLNTPERGGGTTFPDVNLEVAPIKGNAVFFSYERAHPS----TRSLHGGAPVLAGE 273

Query: 267 N 267
            
Sbjct: 274 K 274


>gi|125542543|gb|EAY88682.1| hypothetical protein OsI_10157 [Oryza sativa Indica Group]
          Length = 321

 Score = 94.0 bits (232), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 65/213 (30%), Positives = 97/213 (45%), Gaps = 44/213 (20%)

Query: 91  QPRIILYRDVMYDSEID-LIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHP 149
           +PR  LY   + D+E D LI    Q ++ ++TV + ++GE   +  R S   +L + +  
Sbjct: 48  RPRAFLYEGFLSDAECDHLISLAKQGKMEKSTVVDGESGESVTSKVRTSSGMFLDKKQDE 107

Query: 150 VIERISRRVEHMTGLTTS-----------------TAEELQVVNYGIGGHYEPHYDF--A 190
           V+ RI  R+   T L T                    E +Q++ YG G  YEPH+D+   
Sbjct: 108 VVARIEERIAAWTMLPTECIIFYCFANFAILKLSENGESMQILRYGQGEKYEPHFDYISG 167

Query: 191 RPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSL-------W---------- 233
           R G      S   G+RVATVL Y+S+V  GG T+F      L       W          
Sbjct: 168 RQG------STREGDRVATVLMYLSNVKMGGETIFPDCEARLSQPKDETWSDCAEQGFAV 221

Query: 234 -PEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
            P KG+A  + +LH +   D  + H +CPV+ G
Sbjct: 222 KPAKGSAVLFFSLHPNATLDTDSLHGSCPVIEG 254


>gi|297812067|ref|XP_002873917.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
 gi|297319754|gb|EFH50176.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
          Length = 298

 Score = 94.0 bits (232), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 54/207 (26%), Positives = 97/207 (46%), Gaps = 25/207 (12%)

Query: 80  LMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISK 139
           + P K ++   +PR  +Y   + + E D +  +A+  L+R+ V +  +GE + +  R S 
Sbjct: 32  INPSKVKQVSSKPRAFVYEGFLTELECDHMVSLAKASLKRSAVADNDSGESKFSEVRTSS 91

Query: 140 SAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFK 199
             ++ + + P++  I  ++   T L     E++QV+ Y  G  Y+ H+D+      +   
Sbjct: 92  GTFIPKGKDPIVSGIEDKISTWTFLPKENGEDIQVLRYEHGQKYDAHFDYFH----DKVN 147

Query: 200 SLGTGNRVATVLFYMSDVAQGGATVF---------------------TSLNLSLWPEKGT 238
            +  G+R+ATVL Y+S+V +GG TVF                         +++ P KG 
Sbjct: 148 IVRGGHRIATVLMYLSNVTKGGETVFPDAEVPSCRVLSENKEDLSDCAKRGIAVKPRKGD 207

Query: 239 AAFWHNLHSSGDGDYYTRHAACPVLTG 265
           A  + NLH     D  + H  CPV+ G
Sbjct: 208 ALLFFNLHPDAIPDPLSLHGGCPVIEG 234


>gi|449454448|ref|XP_004144967.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
 gi|449474082|ref|XP_004154068.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
 gi|449515181|ref|XP_004164628.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 300

 Score = 94.0 bits (232), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 64/240 (26%), Positives = 111/240 (46%), Gaps = 27/240 (11%)

Query: 47  KYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEI 106
           K++ L    L +  + + +  C Y         + P K ++   +PR  +Y   + D E 
Sbjct: 3   KFDNLLFIFLILTSSFIRESTCSYA--GSASATVDPSKVKQISWKPRAFVYEGFLTDLEC 60

Query: 107 DLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTT 166
           D +  +A+  L+R+ V +  +G+ +++  R S   ++ + + P++  I  ++   T L  
Sbjct: 61  DHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPIVSGIEDKISAWTFLPK 120

Query: 167 STAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVF- 225
              E++QV+ Y  G  YE HYD+      +       G+R+ATVL Y+S+V QGG TVF 
Sbjct: 121 ENGEDIQVLRYEHGQKYESHYDYF----VDKVNIAWGGHRLATVLMYLSNVTQGGETVFP 176

Query: 226 ------------TSLNLS--------LWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
                       T  +LS        + P+KG A  + +L  +   D  + H  CPVL G
Sbjct: 177 LAEKPSHRRAYETDEDLSECAKKGVAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEG 236


>gi|356546462|ref|XP_003541645.1| PREDICTED: uncharacterized protein LOC100818794 [Glycine max]
          Length = 839

 Score = 94.0 bits (232), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 59/205 (28%), Positives = 102/205 (49%), Gaps = 25/205 (12%)

Query: 82  PLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSA 141
           P K ++   +PR  +Y   + + E D +  +A+  L+R+ V +  +GE +++  R S   
Sbjct: 575 PSKVKQVSWKPRAFVYEGFLTELECDHLISIAKSELKRSAVADNLSGESKLSEVRTSSGM 634

Query: 142 WLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL 201
           ++ + +  ++  I  ++   T L     E++QV+ Y  G  Y+PHYD+     A+     
Sbjct: 635 FIPKNKDLIVAGIEDKISSWTFLPKENGEDIQVLRYEHGQKYDPHYDYF----ADKVNIA 690

Query: 202 GTGNRVATVLFYMSDVAQGGATVF-------------TSLNLS--------LWPEKGTAA 240
             G+RVATVL Y++DV +GG TVF             T+ NLS        + P +G A 
Sbjct: 691 RGGHRVATVLMYLTDVTKGGETVFPDAEESPRHKGSETNENLSECAQKGIAVKPRRGDAL 750

Query: 241 FWHNLHSSGDGDYYTRHAACPVLTG 265
            + +L+ +   D  + HA CPV+ G
Sbjct: 751 LFFSLYPNAIPDTLSLHAGCPVIEG 775


>gi|42567428|ref|NP_195306.2| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
           thaliana]
 gi|332661174|gb|AEE86574.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
           thaliana]
          Length = 290

 Score = 94.0 bits (232), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 58/201 (28%), Positives = 95/201 (47%), Gaps = 24/201 (11%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +PR  +Y + + + E + +  +A+P + ++ V + KTG+   +  R S   +L      +
Sbjct: 86  EPRAFVYHNFLTNEECEHLISLAKPSMMKSKVVDVKTGKSIDSRVRTSSGTFLNRGHDEI 145

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           +E I  R+   T +     E LQV++Y +G  YEPH+D+      + F     G R+ATV
Sbjct: 146 VEEIENRISDFTFIPPENGEGLQVLHYEVGQRYEPHHDYF----FDEFNVRKGGQRIATV 201

Query: 211 LFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAFWHNLHSSGDG 251
           L Y+SDV +GG TVF +                     LS+ P+K  A  + ++      
Sbjct: 202 LMYLSDVDEGGETVFPAAKGNVSDVPWWDELSQCGKEGLSVLPKKRDALLFWSMKPDASL 261

Query: 252 DYYTRHAACPVLTGSNSLHST 272
           D  + H  CPV+ G N   ST
Sbjct: 262 DPSSLHGGCPVIKG-NKWSST 281


>gi|255641919|gb|ACU21228.1| unknown [Glycine max]
          Length = 301

 Score = 93.6 bits (231), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 59/205 (28%), Positives = 102/205 (49%), Gaps = 25/205 (12%)

Query: 82  PLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSA 141
           P K ++   +PR  +Y   + + E D +  +A+  L+R+ V +  +GE +++  R S   
Sbjct: 37  PSKVKQVSWKPRAFVYEGFLTELECDHLISIAKSELKRSAVADNLSGESKLSEVRTSSGM 96

Query: 142 WLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL 201
           ++ + +  ++  I  ++   T L     E++QV+ Y  G  Y+PHYD+     A+     
Sbjct: 97  FIPKNKDLIVAGIEDKISSWTFLPKENGEDIQVLRYEHGQKYDPHYDYF----ADKVNIA 152

Query: 202 GTGNRVATVLFYMSDVAQGGATVF-------------TSLNLS--------LWPEKGTAA 240
             G+RVATVL Y++DV +GG TVF             T+ NLS        + P +G A 
Sbjct: 153 RGGHRVATVLMYLTDVTKGGETVFPDAEESPRHKGSETNENLSECAQKGIAVKPRRGDAL 212

Query: 241 FWHNLHSSGDGDYYTRHAACPVLTG 265
            + +L+ +   D  + HA CPV+ G
Sbjct: 213 LFFSLYPNAIPDTLSLHAGCPVIEG 237


>gi|414587756|tpg|DAA38327.1| TPA: hypothetical protein ZEAMMB73_894856 [Zea mays]
          Length = 263

 Score = 93.6 bits (231), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 60/203 (29%), Positives = 99/203 (48%), Gaps = 19/203 (9%)

Query: 78  LRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRI 137
           LRL  +K E     PRII++ + +   E D +  +A+PRL+ +TV +  TG+   ++ R 
Sbjct: 50  LRLGYVKPEVISWTPRIIVFHNFLSSEECDYLMAIARPRLQISTVVDVATGKGVKSDVRT 109

Query: 138 SKSAWLREPEH--PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEA 195
           S   ++   E   PV++ I +R+   + +     E +QV+ Y    +Y PH+D+     +
Sbjct: 110 SSGMFVNSEERKSPVVQAIEKRISVFSQIPKENGELIQVLRYEASQYYRPHHDYF----S 165

Query: 196 NAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL-------------NLSLWPEKGTAAFW 242
           + F     G RVAT+L Y++D   GG T F                 L + P KG A  +
Sbjct: 166 DTFNLKRGGQRVATMLMYLTDGVVGGETHFPQAGDGECSCGGNVVKGLCVKPNKGDAVLF 225

Query: 243 HNLHSSGDGDYYTRHAACPVLTG 265
            ++   G+ D  + H+ CPVL G
Sbjct: 226 WSMGLDGNTDPNSIHSGCPVLKG 248


>gi|423541303|ref|ZP_17517694.1| hypothetical protein IGK_03395 [Bacillus cereus HuB4-10]
 gi|401172491|gb|EJQ79712.1| hypothetical protein IGK_03395 [Bacillus cereus HuB4-10]
          Length = 216

 Score = 93.6 bits (231), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 55/196 (28%), Positives = 103/196 (52%), Gaps = 16/196 (8%)

Query: 89  YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
           + +P I++  +V+ D E D + +M++ +++R+T+ + +    ++ + R S  A+L E E 
Sbjct: 36  FEEPLIVVLGNVISDEECDELIEMSKNKIKRSTIGSSR----DVNDIRTSSGAFLEENE- 90

Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
            +  +I +R+  +  +  +  E L ++NY +   Y+ HYD FA    + A       NR+
Sbjct: 91  -LTSKIEKRISSIMNVPVTHGEGLHILNYEVDQQYKAHYDYFAEHSRSAA------NNRI 143

Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
           +T++ Y++DV +GG T F  LNLS+ P KG A ++   +     +  T H   PV  G  
Sbjct: 144 STLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEK 203

Query: 268 SLHSTCPCGLRRGLQR 283
            + +     +RRG  R
Sbjct: 204 WIATQW---VRRGTYR 216


>gi|255552788|ref|XP_002517437.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
 gi|223543448|gb|EEF44979.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
          Length = 311

 Score = 93.6 bits (231), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 59/202 (29%), Positives = 98/202 (48%), Gaps = 22/202 (10%)

Query: 82  PLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSA 141
           P +  +    PR  LY+  +   E D +  +A+ +L ++ V + ++G+   +  R S   
Sbjct: 46  PTRVTQLSWHPRAFLYKGFLSYEECDHLIDLARDKLEKSMVADNESGKSIESEVRTSSGM 105

Query: 142 WLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL 201
           ++ + +  ++  I  R+   T L     E +Q+++Y  G  YEPH+D+       A + L
Sbjct: 106 FIAKAQDEIVADIEARIAAWTFLPEENGESMQILHYEHGQKYEPHFDYFHD---KANQEL 162

Query: 202 GTGNRVATVLFYMSDVAQGGATVFTSLNLSL-------W-----------PEKGTAAFWH 243
           G G+RVATVL Y+S+V +GG TVF +    L       W           PEKG A  + 
Sbjct: 163 G-GHRVATVLMYLSNVEKGGETVFPNAEGKLSQPKEDSWSDCAKGGYAVKPEKGDALLFF 221

Query: 244 NLHSSGDGDYYTRHAACPVLTG 265
           +LH     D  + H +CPV+ G
Sbjct: 222 SLHPDATTDSDSLHGSCPVIEG 243


>gi|423615424|ref|ZP_17591258.1| hypothetical protein IIO_00750 [Bacillus cereus VD115]
 gi|401259961|gb|EJR66134.1| hypothetical protein IIO_00750 [Bacillus cereus VD115]
          Length = 216

 Score = 93.6 bits (231), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 55/196 (28%), Positives = 103/196 (52%), Gaps = 16/196 (8%)

Query: 89  YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
           + +P I++  +V+ D E D + +M++ +++R+T+ + +    ++ + R S  A+L E E 
Sbjct: 36  FEEPLIVVLGNVISDEECDELIEMSKNKIKRSTIGSSR----DVNDIRTSSGAFLEENE- 90

Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
            +  +I +R+  +  +  +  E L ++NY +   Y+ HYD FA    + A       NR+
Sbjct: 91  -LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYFAEHSRSAA------NNRI 143

Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
           +T++ Y++DV +GG T F  LNLS+ P KG A ++   +     +  T H   PV  G  
Sbjct: 144 STLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEK 203

Query: 268 SLHSTCPCGLRRGLQR 283
            + +     +RRG  R
Sbjct: 204 WIATQW---VRRGTYR 216


>gi|388495016|gb|AFK35574.1| unknown [Lotus japonicus]
          Length = 297

 Score = 93.6 bits (231), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 56/207 (27%), Positives = 103/207 (49%), Gaps = 25/207 (12%)

Query: 80  LMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISK 139
           + P K ++   +PR  +Y   +   E D +  +A+  L+R+ V +   G+ +++  R S 
Sbjct: 31  INPSKVKQVSWKPRAFVYEGFLTGLECDHLISLAKSELKRSAVADNLPGDSKLSEVRTSS 90

Query: 140 SAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFK 199
             ++ + + P++  I  ++   T L     E++QV+ Y  G  Y+PHYD+      +   
Sbjct: 91  GMFISKKKDPIVAGIEDKISAWTFLPKENGEDMQVLRYEHGQKYDPHYDYF----TDKVN 146

Query: 200 SLGTGNRVATVLFYMSDVAQGGATVF-------------TSLNLS--------LWPEKGT 238
            +  G+R+ATVL Y+++V +GG TVF             T+ +LS        + P +G 
Sbjct: 147 IVRGGHRMATVLLYLTNVTRGGETVFPVAEEPPRRRGLETNSDLSECAKKGIAVKPRRGD 206

Query: 239 AAFWHNLHSSGDGDYYTRHAACPVLTG 265
           A  + +LH++   D  + HA CPV+ G
Sbjct: 207 ALLFFSLHTTAIPDTDSLHAGCPVIEG 233


>gi|357128903|ref|XP_003566109.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Brachypodium
           distachyon]
          Length = 313

 Score = 93.6 bits (231), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 57/211 (27%), Positives = 101/211 (47%), Gaps = 25/211 (11%)

Query: 76  PYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANY 135
           P   + P    +   +PR+ LY+  + D E + +  +A+  L+R+ V +  +G+  ++  
Sbjct: 43  PASVVYPHHSRQISWKPRVFLYQHFLSDDEANHLLSLARAELKRSAVADNTSGKSTLSEV 102

Query: 136 RISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEA 195
           R S   ++ + + P++  I  ++   T L     E++QV+ Y  G   EP +DF      
Sbjct: 103 RTSYGTFISKGKDPIVAGIEDKIAAWTFLPKENGEDMQVLRYKRGEKDEPQFDFF----T 158

Query: 196 NAFKSLGTGNRVATVLFYMSDVAQGGATVF---------------TSLN------LSLWP 234
           +   ++  G+RVATVL Y++DVA+GG TVF               T+L+      +++ P
Sbjct: 159 DTVNTVRGGHRVATVLLYLTDVAEGGETVFPLAKDFTDTGLHDKDTTLSECAQKGIAVKP 218

Query: 235 EKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
            KG A  + NL      D  + H  C V+ G
Sbjct: 219 RKGDALLFFNLRPDAATDPLSLHGGCTVIKG 249


>gi|302841711|ref|XP_002952400.1| hypothetical protein VOLCADRAFT_81799 [Volvox carteri f.
           nagariensis]
 gi|300262336|gb|EFJ46543.1| hypothetical protein VOLCADRAFT_81799 [Volvox carteri f.
           nagariensis]
          Length = 269

 Score = 93.6 bits (231), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 59/193 (30%), Positives = 92/193 (47%), Gaps = 24/193 (12%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
             RI L+R  +   E D I+  A+ RL R+ V +  +G   +++ R S   +    E  +
Sbjct: 42  DARIYLWRGFLTPEECDYIRMKAEKRLERSGVVDTASGSSVVSDIRTSDGMFFERGEDAI 101

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPH--YDFARPGEANAFKSLGTGNRVA 208
           +E + +R+   T       E LQV+ Y     Y+ H  Y F + G AN       GNR A
Sbjct: 102 LEAVEQRLADWTMTPIWAGEALQVLRYRKDQKYDSHVNYFFHKEGSANG------GNRWA 155

Query: 209 TVLFYMSDVAQGGATVFTSL----------------NLSLWPEKGTAAFWHNLHSSGDGD 252
           TVL Y++D  +GG TVF  +                NL++ P KG A  +H++ ++G  +
Sbjct: 156 TVLTYLTDTEEGGETVFPKIPAPGGVNVGFSECAKYNLAVKPRKGDAILFHSMKTNGQLE 215

Query: 253 YYTRHAACPVLTG 265
             + H ACPV+ G
Sbjct: 216 ERSLHGACPVIKG 228


>gi|449434114|ref|XP_004134841.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 287

 Score = 93.6 bits (231), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 52/196 (26%), Positives = 96/196 (48%), Gaps = 23/196 (11%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +PR  +Y + +   E + +  +A+P ++++TV + +TG+ + +  R S   +L       
Sbjct: 82  EPRAFVYHNFLTKEECEYLISLAKPHMQKSTVVDSETGQSKDSRVRTSSGTFLPRGRDKT 141

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           +  I +R+   + +     E LQV++Y +G  YEPH+D+      + + +   G R+ATV
Sbjct: 142 VRTIEKRLSDFSFIPVEHGEGLQVLHYEVGQKYEPHFDYF----LDEYNTKNGGQRIATV 197

Query: 211 LFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAFWHNLHSSGDG 251
           L Y+SDV +GG TVF +                     LS+ P++G A  + ++      
Sbjct: 198 LMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKPKRGDALLFWSMKPDASL 257

Query: 252 DYYTRHAACPVLTGSN 267
           D  + H  CPV+ G+ 
Sbjct: 258 DPSSLHGGCPVIKGNK 273


>gi|449491267|ref|XP_004158845.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 287

 Score = 93.6 bits (231), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 52/196 (26%), Positives = 96/196 (48%), Gaps = 23/196 (11%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +PR  +Y + +   E + +  +A+P ++++TV + +TG+ + +  R S   +L       
Sbjct: 82  EPRAFVYHNFLTKEECEYLISLAKPHMQKSTVVDSETGQSKDSRVRTSSGTFLPRGRDKT 141

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           +  I +R+   + +     E LQV++Y +G  YEPH+D+      + + +   G R+ATV
Sbjct: 142 VRTIEKRLSDFSFIPVEHGEGLQVLHYEVGQKYEPHFDYF----LDEYNTKNGGQRIATV 197

Query: 211 LFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAFWHNLHSSGDG 251
           L Y+SDV +GG TVF +                     LS+ P++G A  + ++      
Sbjct: 198 LMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKPKRGDALLFWSMKPDASL 257

Query: 252 DYYTRHAACPVLTGSN 267
           D  + H  CPV+ G+ 
Sbjct: 258 DPSSLHGGCPVIKGNK 273


>gi|299115886|emb|CBN75895.1| prolyl 4-hydroxylase alpha-1 subunit precursor-like protein
           [Ectocarpus siliculosus]
          Length = 404

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 56/194 (28%), Positives = 100/194 (51%), Gaps = 22/194 (11%)

Query: 90  LQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQ--NYKTGELEIANYRISKSAWLREPE 147
           ++P +   R+ + D E   I++ A P ++ + V   ++  G+ +  N+R S + ++    
Sbjct: 198 MEPLVFEARNFLLDEECKHIREKADPHMKPSPVSLMDHDKGKPD-TNWRTSTTYFMPSTR 256

Query: 148 HPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTG--N 205
            P+++ I RRVE  T +  S  E++QV+ Y  G  Y  H+DF    +    +++  G  N
Sbjct: 257 DPLLQGIDRRVEEFTRVPKSHQEQVQVLKYDKGQRYTAHHDFL---DERTMRNMDGGRKN 313

Query: 206 RVATVLFYMSDVAQGGATVF--------------TSLNLSLWPEKGTAAFWHNLHSSGDG 251
           R+ TV +Y+SDV +GG T+F               +  L + P +G  A +++L   G  
Sbjct: 314 RMITVFWYLSDVEEGGETIFPRYGGRTGRVDFSDCTTGLKVKPVEGKVAMFYSLKPDGQF 373

Query: 252 DYYTRHAACPVLTG 265
           D ++ H ACPV+TG
Sbjct: 374 DDFSLHGACPVITG 387


>gi|357517897|ref|XP_003629237.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355523259|gb|AET03713.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|388513409|gb|AFK44766.1| unknown [Medicago truncatula]
 gi|388516345|gb|AFK46234.1| unknown [Medicago truncatula]
          Length = 275

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 54/194 (27%), Positives = 92/194 (47%), Gaps = 23/194 (11%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +PR  LY + +   E + +  +A+P + ++ V + KTG+   ++ R S   +L      +
Sbjct: 72  EPRAFLYHNFLTKEECEHLINIAKPSMHKSEVIDEKTGKSLNSSIRTSSGTFLDREGDEI 131

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           +  I +R+   T +     E   V++Y +G  YEPHYD+      + F +   G R+AT+
Sbjct: 132 VSNIEKRIADFTFIPVEHGESFNVLHYEVGQKYEPHYDYF----LDTFSTRHAGQRIATM 187

Query: 211 LFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAFWHNLHSSGDG 251
           L Y+SDV +GG TVF +                     LS+ P+ G A  + ++      
Sbjct: 188 LMYLSDVEEGGETVFPNAKGNFSSVPWWNELSDCGKGGLSIKPKMGNAILFWSMKPDATL 247

Query: 252 DYYTRHAACPVLTG 265
           D  + H ACPV+ G
Sbjct: 248 DPSSLHGACPVIKG 261


>gi|15239594|ref|NP_197391.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
           thaliana]
 gi|21593296|gb|AAM65245.1| prolyl 4-hydroxylase alpha subunit-like protein [Arabidopsis
           thaliana]
 gi|332005243|gb|AED92626.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
           thaliana]
          Length = 298

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 53/205 (25%), Positives = 96/205 (46%), Gaps = 25/205 (12%)

Query: 82  PLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSA 141
           P K ++   +PR  +Y   + + E D +  +A+  L+R+ V +  +GE + +  R S   
Sbjct: 34  PSKVKQVSSKPRAFVYEGFLTELECDHMVSLAKASLKRSAVADNDSGESKFSEVRTSSGT 93

Query: 142 WLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL 201
           ++ + + P++  I  ++   T L     E++QV+ Y  G  Y+ H+D+      +    +
Sbjct: 94  FISKGKDPIVSGIEDKISTWTFLPKENGEDIQVLRYEHGQKYDAHFDYFH----DKVNIV 149

Query: 202 GTGNRVATVLFYMSDVAQGGATVF---------------------TSLNLSLWPEKGTAA 240
             G+R+AT+L Y+S+V +GG TVF                         +++ P KG A 
Sbjct: 150 RGGHRMATILMYLSNVTKGGETVFPDAEIPSRRVLSENKEDLSDCAKRGIAVKPRKGDAL 209

Query: 241 FWHNLHSSGDGDYYTRHAACPVLTG 265
            + NLH     D  + H  CPV+ G
Sbjct: 210 LFFNLHPDAIPDPLSLHGGCPVIEG 234


>gi|255579590|ref|XP_002530636.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
 gi|223529809|gb|EEF31744.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
          Length = 287

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 54/195 (27%), Positives = 93/195 (47%), Gaps = 23/195 (11%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +PR  +Y + +   E + +  +A+P ++++TV + +TG  + +  R S   +L       
Sbjct: 82  EPRAFVYHNFLTKEECEYLINLAKPNMQKSTVVDSETGRSKDSRVRTSSGTFLSRGRDKK 141

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           I  I +R+   + +     E LQV++Y +G  YEPH+D+      + F +   G RVAT+
Sbjct: 142 IRDIEKRIADFSFIPVEHGEGLQVLHYEVGQKYEPHFDYFN----DEFNTKNGGQRVATL 197

Query: 211 LFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAFWHNLHSSGDG 251
           L Y+SDV +GG TVF +                     LS+ P  G A  + ++      
Sbjct: 198 LMYLSDVEEGGETVFPAAKGNFSAVPWWNELSECGKKGLSVKPNMGDALLFWSMKPDATL 257

Query: 252 DYYTRHAACPVLTGS 266
           D  + H  CPV+ G+
Sbjct: 258 DPSSLHGGCPVINGN 272


>gi|228987427|ref|ZP_04147547.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           tochigiensis BGSC 4Y1]
 gi|228772399|gb|EEM20845.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           tochigiensis BGSC 4Y1]
          Length = 232

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 56/193 (29%), Positives = 102/193 (52%), Gaps = 16/193 (8%)

Query: 89  YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
           + +P I++  +V+ D E D + ++++ +L R+ V + +    ++ + R SK A+L + E 
Sbjct: 52  FEEPLIVVLGNVLSDEECDELIELSKNKLARSKVGSSR----DVNDIRTSKGAFLDDNE- 106

Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
            + E+I +R+  +  +  S  E L ++NY +   Y+ HYD FA    + A       NR+
Sbjct: 107 -LTEKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSAA------NNRI 159

Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
           +T++ Y++DV +GG T F  LNLS+ P KG A ++   +     +  T H   PV  G  
Sbjct: 160 STLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEK 219

Query: 268 SLHSTCPCGLRRG 280
            + +     +RRG
Sbjct: 220 WIATQW---VRRG 229


>gi|20260280|gb|AAM13038.1| unknown protein [Arabidopsis thaliana]
 gi|22136524|gb|AAM91340.1| unknown protein [Arabidopsis thaliana]
          Length = 298

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 53/205 (25%), Positives = 96/205 (46%), Gaps = 25/205 (12%)

Query: 82  PLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSA 141
           P K ++   +PR  +Y   + + E D +  +A+  L+R+ V +  +GE + +  R S   
Sbjct: 34  PSKVKQVSSKPRAFVYEGFLTELECDHMVSLAKASLKRSAVADNDSGESKFSEVRTSSGT 93

Query: 142 WLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL 201
           ++ + + P++  I  ++   T L     E++QV+ Y  G  Y+ H+D+      +    +
Sbjct: 94  FISKGKDPIVSGIEDKISTWTFLPKENGEDIQVLRYEHGQKYDAHFDYFH----DKVNIV 149

Query: 202 GTGNRVATVLFYMSDVAQGGATVF---------------------TSLNLSLWPEKGTAA 240
             G+R+AT+L Y+S+V +GG TVF                         +++ P KG A 
Sbjct: 150 RGGHRMATILMYLSNVTKGGETVFPDAEIPSRRVLSENEEDLSDCAKRGIAVKPRKGDAL 209

Query: 241 FWHNLHSSGDGDYYTRHAACPVLTG 265
            + NLH     D  + H  CPV+ G
Sbjct: 210 LFFNLHPDAIPDPLSLHGGCPVIEG 234


>gi|47567794|ref|ZP_00238502.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus G9241]
 gi|47555471|gb|EAL13814.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus G9241]
          Length = 216

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 56/193 (29%), Positives = 102/193 (52%), Gaps = 16/193 (8%)

Query: 89  YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
           + +P I++  +V+ D E D + ++++ +L R+ V + +    ++ + R SK A+L + E 
Sbjct: 36  FEEPLIVVLGNVLSDEECDELIELSKNKLARSKVGSSR----DVNDIRTSKGAFLDDNE- 90

Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
            + E+I +R+  +  +  S  E L ++NY +   Y+ HYD FA    + A       NR+
Sbjct: 91  -LTEKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSAA------NNRI 143

Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
           +T++ Y++DV +GG T F  LNLS+ P KG A ++   +     +  T H   PV  G  
Sbjct: 144 STLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEK 203

Query: 268 SLHSTCPCGLRRG 280
            + +     +RRG
Sbjct: 204 WIATQW---VRRG 213


>gi|402813396|ref|ZP_10862991.1| hypothetical protein PAV_1c08470 [Paenibacillus alvei DSM 29]
 gi|402509339|gb|EJW19859.1| hypothetical protein PAV_1c08470 [Paenibacillus alvei DSM 29]
          Length = 215

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 56/178 (31%), Positives = 98/178 (55%), Gaps = 13/178 (7%)

Query: 89  YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
           Y +P I++  +V+ + E D + + ++ RL+R+     K GE    N +I  S+ +   E+
Sbjct: 33  YEEPLIVILGNVLSNEECDELIEHSKERLQRS-----KIGEERSVN-QIRTSSGVFCEEN 86

Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDF-ARPGEANAFKSLGTGNRV 207
             + +I +R+  +  +     + LQV+ Y  G  Y+PH+DF A    A+A       NR+
Sbjct: 87  ETVAKIEKRISQIMNIPIEHGDGLQVLLYAPGQEYKPHFDFFADTSRASA------NNRI 140

Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           +T++ Y++DV +GG T F  LNLS++P KG A ++   +S+ + +  T HA  PV  G
Sbjct: 141 STLVMYLNDVEEGGETTFPMLNLSVFPSKGMAVYFEYFYSNHELNERTLHAGAPVRKG 198


>gi|229157835|ref|ZP_04285910.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 4342]
 gi|228625792|gb|EEK82544.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 4342]
          Length = 232

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 54/192 (28%), Positives = 101/192 (52%), Gaps = 14/192 (7%)

Query: 89  YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
           + +P I++  +V+ D E D + ++++ +L R+ V + +    ++ + R SK A+L + E 
Sbjct: 52  FEEPLIVVLGNVLSDEECDELIELSKNKLARSKVGSSR----DVNDIRTSKGAFLDDNE- 106

Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVA 208
            + E+I +R+  +  +  S  E L ++NY +   Y+ HYD+      +A       NR++
Sbjct: 107 -LTEKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSA-----ANNRIS 160

Query: 209 TVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNS 268
           T++ Y++DV +GG T F  LNLS+ P KG A ++   +     +  T H   PV  G   
Sbjct: 161 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 220

Query: 269 LHSTCPCGLRRG 280
           + +     +RRG
Sbjct: 221 IATQW---VRRG 229


>gi|423489423|ref|ZP_17466105.1| hypothetical protein IEU_04046 [Bacillus cereus BtB2-4]
 gi|402431659|gb|EJV63723.1| hypothetical protein IEU_04046 [Bacillus cereus BtB2-4]
          Length = 216

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 52/178 (29%), Positives = 95/178 (53%), Gaps = 13/178 (7%)

Query: 89  YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
           + +P I++  +V+ D E D + ++++ ++ R+ V + +    ++ + R S  A+L E E 
Sbjct: 36  FEEPLIVVLANVLSDEECDELIELSKSKMERSKVGSSR----DVNDIRTSSGAFLEENE- 90

Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
            +  +I +R+  +T +  S  E L ++NY +   Y+ HYD FA    + A       NR+
Sbjct: 91  -LTSKIEKRISSITNVPVSHGEGLHILNYEVDQEYKAHYDYFAEHSRSAA------NNRI 143

Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           +T++ Y++DV +GG T F  LNLS+ P KG A ++   +     +  T H   PV  G
Sbjct: 144 STLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKG 201


>gi|255083627|ref|XP_002508388.1| predicted protein [Micromonas sp. RCC299]
 gi|226523665|gb|ACO69646.1| predicted protein [Micromonas sp. RCC299]
          Length = 253

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 58/198 (29%), Positives = 98/198 (49%), Gaps = 25/198 (12%)

Query: 92  PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
           PR     + M   E D I ++A+PR+RR+TV +  TG+ ++   R S+  +L      ++
Sbjct: 5   PRAFHLHNFMSHEECDRILEIARPRVRRSTVIDSVTGQSKVDPIRTSEQTFLNRGTWDIV 64

Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGT--GNRVAT 209
            ++  R+  +T L     E++Q++ YG+G  Y+ H+D      A+  K L    G+RVAT
Sbjct: 65  TKVEERLAVVTQLPAYHGEDMQILKYGLGQKYDAHHDVGELTSASG-KQLAAEGGHRVAT 123

Query: 210 VLFYMSDVAQGGATVF----------------------TSLNLSLWPEKGTAAFWHNLHS 247
           VL Y+SDV +GG T F                         N+++ P KG    + ++++
Sbjct: 124 VLLYLSDVEEGGETAFPDSEWMTPELRKWAEGQKWSDCAEGNVAVKPRKGDGLLFWSVNN 183

Query: 248 SGDGDYYTRHAACPVLTG 265
               D ++ HA CPV+ G
Sbjct: 184 ENAIDPHSMHAGCPVIRG 201


>gi|18071415|gb|AAL58274.1|AC068923_16 putative prolyl 4-hydroxylase, alpha subunit [Oryza sativa Japonica
           Group]
          Length = 343

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 55/199 (27%), Positives = 95/199 (47%), Gaps = 26/199 (13%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +PR  LY + +   E + +  +A+P ++++TV +  TG  + +  R S   +L   +  +
Sbjct: 116 EPRAFLYHNFLSKEECEYLISLAKPHMKKSTVVDASTGGSKDSRVRTSSGMFLGRGQDKI 175

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           I  I +R+   T +     E LQV++Y +G  YEPH+D+      + F +   G R+AT+
Sbjct: 176 IRTIEKRISDYTFIPVENGEGLQVLHYEVGQKYEPHFDYFH----DEFNTKNGGQRIATL 231

Query: 211 LFYMSDVAQGGATVF-------------------TSLNLSLWPEKGTAAFWHNLHSSGDG 251
           L Y+SDV +GG T+F                       L++ P+ G A  + ++   G  
Sbjct: 232 LMYLSDVEEGGETIFPSSKANSSSSPFYNELSECAKKGLAVKPKMGDALLFWSMRPDGSL 291

Query: 252 DYYTRHAACPV---LTGSN 267
           D  + H   P+   LT SN
Sbjct: 292 DATSLHGEIPILWLLTNSN 310


>gi|357445147|ref|XP_003592851.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
 gi|355481899|gb|AES63102.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
          Length = 281

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 62/203 (30%), Positives = 98/203 (48%), Gaps = 19/203 (9%)

Query: 78  LRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRI 137
           LRL  +K E     PRIIL  + +   E D ++ +A PRL+ +TV +  TG+   ++ R 
Sbjct: 68  LRLGYVKPEVLSWSPRIILLHNFLSYEECDYLRGVALPRLKISTVVDANTGKGIKSDVRT 127

Query: 138 SKSAWL--REPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEA 195
           S   +L   E ++P+I  I +R+   + +     E +QV+ Y    +Y PH+D+     +
Sbjct: 128 SSGMFLSHEERKYPMIHAIEKRISVYSQIPIENGELMQVLRYEKNQYYRPHHDYF----S 183

Query: 196 NAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL-------------NLSLWPEKGTAAFW 242
           + F     G R+AT+L Y+ D  +GG T F S               L + P KG A  +
Sbjct: 184 DTFNLKRGGQRIATMLMYLGDNVEGGETHFPSAGSDECSCGGKLTKGLCVKPVKGNAVLF 243

Query: 243 HNLHSSGDGDYYTRHAACPVLTG 265
            ++   G  D  + H  CPVL G
Sbjct: 244 WSMGLDGQSDPDSVHGGCPVLAG 266


>gi|423598444|ref|ZP_17574444.1| hypothetical protein III_01246 [Bacillus cereus VD078]
 gi|423660914|ref|ZP_17636083.1| hypothetical protein IKM_01311 [Bacillus cereus VDM022]
 gi|401236714|gb|EJR43171.1| hypothetical protein III_01246 [Bacillus cereus VD078]
 gi|401300955|gb|EJS06544.1| hypothetical protein IKM_01311 [Bacillus cereus VDM022]
          Length = 216

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 54/193 (27%), Positives = 102/193 (52%), Gaps = 16/193 (8%)

Query: 89  YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
           + +P I++  +V+ D E D + ++++ +++R+ V + +    ++ + R S  A+L E E 
Sbjct: 36  FEEPLIVVLANVLSDEECDELIELSKSKMKRSKVGSSR----DVNDIRTSSGAFLEENE- 90

Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
            +  +I +R+  +T +  +  E L ++NY +   Y+ HYD FA    + A       NR+
Sbjct: 91  -LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYFAEHSRSAA------NNRI 143

Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
           +T++ Y++DV +GG T F  LNLS+ P KG A ++   +     +  T H   PV  G  
Sbjct: 144 STLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEK 203

Query: 268 SLHSTCPCGLRRG 280
            + +     +RRG
Sbjct: 204 WIATQW---VRRG 213


>gi|423483822|ref|ZP_17460512.1| hypothetical protein IEQ_03600 [Bacillus cereus BAG6X1-2]
 gi|401141373|gb|EJQ48928.1| hypothetical protein IEQ_03600 [Bacillus cereus BAG6X1-2]
          Length = 216

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 54/193 (27%), Positives = 102/193 (52%), Gaps = 16/193 (8%)

Query: 89  YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
           + +P I++  +V+ D E D + +M++ +++R+T+ + +    ++ + R S  A+L E E 
Sbjct: 36  FEEPLIVVLGNVISDEECDELIEMSKNKIKRSTIGSSR----DVNDIRTSSGAFLEENE- 90

Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
            +  +I +R+  +  +  +  E L ++NY +   Y+ HYD FA    + A       NR+
Sbjct: 91  -LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYFAEHSRSAA------NNRI 143

Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
           +T++ Y++DV +GG T F  LNLS+ P KG A ++   +     +  T H   PV  G  
Sbjct: 144 STLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEK 203

Query: 268 SLHSTCPCGLRRG 280
            + +     +RRG
Sbjct: 204 WIATQW---VRRG 213


>gi|159464219|ref|XP_001690339.1| hypothetical protein CHLREDRAFT_114525 [Chlamydomonas reinhardtii]
 gi|158279839|gb|EDP05598.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 244

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 61/191 (31%), Positives = 94/191 (49%), Gaps = 25/191 (13%)

Query: 93  RIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIE 152
           RI L    + D E D I ++++ RL R+ V     G  E +  R S   +L   E PV++
Sbjct: 1   RIFLIEHFLTDEEADHIVQVSERRLERSGVVATNGGSEE-SQIRTSFGVFLERGEDPVVK 59

Query: 153 RISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD--FARPGEANAFKSLGTGNRVATV 210
            +  R+  +T +     E LQV+ Y     Y+ H+D  F + G AN       GNR ATV
Sbjct: 60  GVEERISALTLMPVGNGEGLQVLRYQKEQKYDAHWDYFFHKDGIANG------GNRYATV 113

Query: 211 LFYMSDVAQGGATVFTSL----------------NLSLWPEKGTAAFWHNLHSSGDGDYY 254
           L Y+ D  +GG TVF ++                +L+  P+KGTA  +H++  +G+ +  
Sbjct: 114 LMYLVDTEEGGETVFPNIAAPGGENVGFSECARYHLAAKPKKGTAILFHSIKPTGELERK 173

Query: 255 TRHAACPVLTG 265
           + H ACPV+ G
Sbjct: 174 SLHTACPVIKG 184


>gi|255539064|ref|XP_002510597.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
 gi|223551298|gb|EEF52784.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
          Length = 289

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 55/195 (28%), Positives = 93/195 (47%), Gaps = 23/195 (11%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +PR  +Y + +   E + +  +A+P + ++TV + KTG  + +  R S   +LR     +
Sbjct: 84  EPRAFVYHNFLSKEECEYLIALAKPHMVKSTVVDSKTGRSKDSRVRTSSGMFLRRGRDKI 143

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           I  I +R+   + +     E LQV++Y +G  YE HYD+      + F +   G R AT+
Sbjct: 144 IRNIEKRIADFSFIPIEHGEGLQVLHYEVGQKYEAHYDYF----LDEFNTKNGGQRTATL 199

Query: 211 LFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAFWHNLHSSGDG 251
           L Y+SDV +GG TVF +                     LS+ P+ G A  + +       
Sbjct: 200 LMYLSDVEEGGETVFPAAKANISNVPSWNELSECARQGLSVKPKMGNALLFWSTRPDATL 259

Query: 252 DYYTRHAACPVLTGS 266
           D  + H +CPV+ G+
Sbjct: 260 DPASLHGSCPVIRGN 274


>gi|357467077|ref|XP_003603823.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355492871|gb|AES74074.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
          Length = 291

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 54/190 (28%), Positives = 95/190 (50%), Gaps = 18/190 (9%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +PR  +Y + +   E + +  +A+P ++R+ V +  TG+  + + R S   +L   +  +
Sbjct: 91  EPRASMYHNFLSKEECEHLINLAKPFMQRSLVVDGVTGQGILNSVRTSSGTFLERGKDKI 150

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           ++ + RR+  +T +     E LQ+++Y +G  +EPHYD+      N   +   G RVATV
Sbjct: 151 VQNVERRIADITSIPIENGEGLQIIHYEVGQKFEPHYDY----NFNWRITNNGGPRVATV 206

Query: 211 LFYMSDVAQGGATVFTSLN--------------LSLWPEKGTAAFWHNLHSSGDGDYYTR 256
           L Y+SDV +GG TVF +                L + P+ G A  + ++   G  D  + 
Sbjct: 207 LMYLSDVEEGGETVFPNAKPNFNSVSKYHPGKGLVVKPKMGDALLFWSVKPDGSLDTASL 266

Query: 257 HAACPVLTGS 266
           H   PV+ GS
Sbjct: 267 HGGSPVIRGS 276


>gi|229019457|ref|ZP_04176278.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1273]
 gi|229025700|ref|ZP_04182104.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1272]
 gi|423417837|ref|ZP_17394926.1| hypothetical protein IE3_01309 [Bacillus cereus BAG3X2-1]
 gi|228735575|gb|EEL86166.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1272]
 gi|228741812|gb|EEL91991.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1273]
 gi|401107008|gb|EJQ14965.1| hypothetical protein IE3_01309 [Bacillus cereus BAG3X2-1]
          Length = 216

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 54/193 (27%), Positives = 102/193 (52%), Gaps = 16/193 (8%)

Query: 89  YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
           + +P I++  +V+ D E D + ++++ +++R+ V + +    ++ + R S  A+L E E 
Sbjct: 36  FEEPLIVVLANVLSDEECDELIELSKNKMKRSKVGSSR----DVNDIRTSSGAFLEENE- 90

Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
            +  +I +R+  +T +  +  E L ++NY +   Y+ HYD FA    + A       NR+
Sbjct: 91  -LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYFAEHSRSAA------NNRI 143

Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
           +T++ Y++DV +GG T F  LNLS+ P KG A ++   +     +  T H   PV  G  
Sbjct: 144 STLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEK 203

Query: 268 SLHSTCPCGLRRG 280
            + +     +RRG
Sbjct: 204 WIATQW---VRRG 213


>gi|116788056|gb|ABK24739.1| unknown [Picea sitchensis]
          Length = 303

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 58/205 (28%), Positives = 96/205 (46%), Gaps = 34/205 (16%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANY-----------RISK 139
           +PR ILY + +   E + +  +A+P + ++TV +  TG+ + + +           R S 
Sbjct: 87  EPRAILYHNFLNKEECEYLINLAKPHMAKSTVVDSATGKSKDSRFVHRWKSNDSRVRTSS 146

Query: 140 SAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFK 199
             +L   +   I  I +R+   T +     E LQV++Y +G  YEPH+D+      + F 
Sbjct: 147 GMFLNRGQDKTIRSIEKRIADFTFIPAEHGEGLQVLHYEVGQKYEPHFDYF----LDEFN 202

Query: 200 SLGTGNRVATVLFYMSDVAQGGATVF-----TSLNLSLW--------------PEKGTAA 240
           +   G R+ATVL Y+SDV +GG TVF      S ++  W              P  G A 
Sbjct: 203 TKNGGQRIATVLMYLSDVEKGGETVFPASKVNSSSVPWWDELSECAKAGISVRPRMGDAL 262

Query: 241 FWHNLHSSGDGDYYTRHAACPVLTG 265
            + ++    + D  + HA CPV+ G
Sbjct: 263 LFWSMRPDAELDPSSLHAGCPVIQG 287


>gi|198466397|ref|XP_002135180.1| GA23908 [Drosophila pseudoobscura pseudoobscura]
 gi|198150581|gb|EDY73807.1| GA23908 [Drosophila pseudoobscura pseudoobscura]
          Length = 403

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 62/199 (31%), Positives = 93/199 (46%), Gaps = 27/199 (13%)

Query: 68  CRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKT 127
           C Y      +LRL PLK E   L P I +Y DV+Y+ EI  +  +A   L+    +  K 
Sbjct: 215 CHYESTRTAFLRLAPLKVEMLSLDPYIAIYHDVIYEREIARVMTLALSSLK-GPGRYSKR 273

Query: 128 GELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHY 187
            E  I      KS  + E E+    ++++R   MTG      ++ ++ N GIGG+   H 
Sbjct: 274 REHNI------KSVTVYEEENS---QLNQRTRDMTGEQVKEDKDFRIYNSGIGGYIRYHM 324

Query: 188 DFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHS 247
           D     E                   +++V  GGA  F  L  ++WP KG+A  WHNL++
Sbjct: 325 DNLAKEEQQ-----------------LNEVPHGGAISFPQLEFTVWPRKGSALVWHNLNN 367

Query: 248 SGDGDYYTRHAACPVLTGS 266
           + + DY   H +CPV+ GS
Sbjct: 368 NLELDYRVAHISCPVIVGS 386


>gi|357417854|ref|YP_004930874.1| procollagen-proline dioxygenase [Pseudoxanthomonas spadix BD-a59]
 gi|355335432|gb|AER56833.1| Procollagen-proline dioxygenase [Pseudoxanthomonas spadix BD-a59]
          Length = 283

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 50/156 (32%), Positives = 85/156 (54%), Gaps = 5/156 (3%)

Query: 90  LQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHP 149
           L PR+I++ +++   E D +  +A+ +++R+ V +  TG+ +    R S+  +     +P
Sbjct: 94  LHPRVIVFGNLLAAEECDALIALARRQIKRSPVFDPDTGQDQQHQARTSEGMFFGRGANP 153

Query: 150 VIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDF---ARPGEANAFKSLGTGNR 206
           +  R+  R+  +        E LQV+ YG G  YEPHYD+   ARPG   A +    G R
Sbjct: 154 LCARVEARIAALLNWPLENGEGLQVLRYGPGAQYEPHYDYFDPARPGAEVALRR--GGQR 211

Query: 207 VATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFW 242
           VA+++ Y++   QGGAT F   +L + P KG A ++
Sbjct: 212 VASLVIYLNTPTQGGATTFPDAHLEVAPIKGNAVYF 247


>gi|326503458|dbj|BAJ86235.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326516134|dbj|BAJ88090.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 266

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 59/203 (29%), Positives = 100/203 (49%), Gaps = 19/203 (9%)

Query: 78  LRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRI 137
           LRL  +K E     PRII++ + +   E D ++++A+PRL  +TV +  TG+   ++ R 
Sbjct: 53  LRLGYVKPEVISWTPRIIVFHNFLSSEECDYLREIARPRLEISTVVDVATGKGVKSDVRT 112

Query: 138 SKSAWLREPEH--PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEA 195
           S   ++   E   PVI+ I +R+   + +     E +QV+ Y    +Y PH+D+     +
Sbjct: 113 SSGMFVNSEERKLPVIKAIEKRISVFSQIPVENGELIQVLRYEPNQYYRPHHDYF----S 168

Query: 196 NAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL-------------NLSLWPEKGTAAFW 242
           + F     G RVAT+L Y++D  +GG T F                 L + P KG A  +
Sbjct: 169 DTFNLKRGGQRVATMLMYLTDGVEGGETHFPQAGDGECICGGRLVRGLCVKPNKGDAVLF 228

Query: 243 HNLHSSGDGDYYTRHAACPVLTG 265
            ++   G+ D  + H+ C V+ G
Sbjct: 229 WSMGLDGNTDSNSLHSGCAVVKG 251


>gi|225433714|ref|XP_002268409.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
 gi|296089634|emb|CBI39453.3| unnamed protein product [Vitis vinifera]
          Length = 287

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 60/208 (28%), Positives = 101/208 (48%), Gaps = 19/208 (9%)

Query: 73  RNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEI 132
           ++   LR+  +K E     PRIIL    +   E D ++ MA+P L+ +TV + +TG+   
Sbjct: 69  KDADILRIGYVKPEILNWSPRIILLHSFLSSEECDYLRAMAEPLLQISTVVDAQTGKGIQ 128

Query: 133 ANYRISKSAWLR--EPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFA 190
           ++ R S   +L   +  +P++  I +R+   + +     E +QV+ Y     Y+PH+D+ 
Sbjct: 129 SDVRTSSGMFLSPDDSTYPIVRAIEKRISVYSQVPVENGELIQVLRYKKSQFYKPHHDYF 188

Query: 191 RPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVF-------------TSLNLSLWPEKG 237
               +++F     G RVAT+L Y+SD  +GG T F             +   LS+ P KG
Sbjct: 189 ----SDSFNLKRGGQRVATMLIYLSDNVEGGETYFPMAGSGFCRCGGKSVRGLSVAPVKG 244

Query: 238 TAAFWHNLHSSGDGDYYTRHAACPVLTG 265
            A  + ++   G  D  + H  C VL G
Sbjct: 245 NAVLFWSMGLDGQSDPNSIHGGCEVLAG 272


>gi|308799217|ref|XP_003074389.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
 gi|116000560|emb|CAL50240.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
          Length = 294

 Score = 92.4 bits (228), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 61/190 (32%), Positives = 90/190 (47%), Gaps = 18/190 (9%)

Query: 92  PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
           P   +YR  + ++E + I+++A   L+ +TV +  TG    +  R S   +L   E  VI
Sbjct: 33  PHAEVYRGFLTEAECEHIERLATAELKPSTVVDASTGGDASSEIRTSSGMFLGRAEDDVI 92

Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVL 211
           E I  R+   T +  S  E  QV+ Y     Y  HYD+    + N  +  G G R+ TVL
Sbjct: 93  EAIEARIAAWTHVPESHGEGFQVLRYEKHQEYRAHYDYFHD-KFNVKREKG-GQRMGTVL 150

Query: 212 FYMSDVAQGGATVFTSL----------------NLSLWPEKGTAAFWHNLHSSGDGDYYT 255
            Y+SDV +GG TVF                    L++ P KG A F+ +L   G  D ++
Sbjct: 151 MYLSDVEEGGETVFPKFEDGTPAGSEASECARNKLAVRPRKGDALFFRSLRHDGVPDTFS 210

Query: 256 RHAACPVLTG 265
            HA CPV+ G
Sbjct: 211 EHAGCPVIRG 220


>gi|449468746|ref|XP_004152082.1| PREDICTED: putative prolyl 4-hydroxylase-like [Cucumis sativus]
          Length = 290

 Score = 92.0 bits (227), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 61/203 (30%), Positives = 99/203 (48%), Gaps = 19/203 (9%)

Query: 78  LRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRI 137
           LRL  +K E     PRII+  + +   E D +K +A  RL  +TV + KTG+   +++R 
Sbjct: 75  LRLGYVKPEVVSWSPRIIVLHNFLSTKECDYLKGIALARLEISTVVDTKTGKGVKSDFRT 134

Query: 138 SKSAWL--REPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEA 195
           S   +L   E   P+++ I +R+   + +     E +QV+ Y     Y+PH+D+     +
Sbjct: 135 SSGMFLSHHEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQFYKPHHDYF----S 190

Query: 196 NAFKSLGTGNRVATVLFYMSDVAQGGATVF-------------TSLNLSLWPEKGTAAFW 242
           + F     G R+AT+L Y+S+  +GG T F             T   LS+ P KG A  +
Sbjct: 191 DTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKGDAVLF 250

Query: 243 HNLHSSGDGDYYTRHAACPVLTG 265
            ++   G  D  + H  C VL+G
Sbjct: 251 WSMGLDGQSDPKSIHGGCEVLSG 273


>gi|423527903|ref|ZP_17504348.1| hypothetical protein IGE_01455 [Bacillus cereus HuB1-1]
 gi|402451566|gb|EJV83385.1| hypothetical protein IGE_01455 [Bacillus cereus HuB1-1]
          Length = 248

 Score = 92.0 bits (227), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 51/178 (28%), Positives = 95/178 (53%), Gaps = 13/178 (7%)

Query: 89  YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
           + +P I++  +V+ D E D + +M++ +++R+ V + +    ++ + R S  A+L + E 
Sbjct: 68  FEEPLIVVLANVLSDEECDKLIEMSKNKMKRSKVGSSR----DVNDIRTSSGAFLEDNE- 122

Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
            +  +I +R+  +  +  S  E L ++NY +   Y+ HYD FA    + A       NR+
Sbjct: 123 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSAA------NNRI 175

Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           +T++ Y++DV +GG T F  LNLS+ P KG A ++   +     +  T H   PV  G
Sbjct: 176 STLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKG 233


>gi|159469311|ref|XP_001692811.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158278064|gb|EDP03830.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 273

 Score = 92.0 bits (227), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 57/191 (29%), Positives = 93/191 (48%), Gaps = 24/191 (12%)

Query: 93  RIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIE 152
           RI L++  +   E D I+  A+ RL R+ V +  +G   +++ R S   +    E  +IE
Sbjct: 44  RIYLWKGFLTPEECDYIRMKAEKRLERSGVVDTGSGGSVVSDIRTSDGMFFERGEDAIIE 103

Query: 153 RISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD--FARPGEANAFKSLGTGNRVATV 210
            + +R+   T       E LQV+ Y     Y+ H+D  F + G +N       GNR ATV
Sbjct: 104 AVEQRLADWTMTPIWGGESLQVLRYRKDQKYDSHWDYFFHKDGSSNG------GNRWATV 157

Query: 211 LFYMSDVAQGGATVFTSL----------------NLSLWPEKGTAAFWHNLHSSGDGDYY 254
           L Y+++  +GG TVF  +                NL++ P KG A  +H++  +G+ +  
Sbjct: 158 LLYLTETEEGGETVFPKIPAPNGINVGFSECAKYNLAVKPHKGDALLFHSMKPTGELEER 217

Query: 255 TRHAACPVLTG 265
           + H ACPV+ G
Sbjct: 218 SMHGACPVIRG 228


>gi|75760922|ref|ZP_00740932.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           israelensis ATCC 35646]
 gi|423385740|ref|ZP_17362996.1| hypothetical protein ICE_03486 [Bacillus cereus BAG1X1-2]
 gi|423561293|ref|ZP_17537569.1| hypothetical protein II5_00697 [Bacillus cereus MSX-A1]
 gi|74491592|gb|EAO54798.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           israelensis ATCC 35646]
 gi|401201550|gb|EJR08415.1| hypothetical protein II5_00697 [Bacillus cereus MSX-A1]
 gi|401635796|gb|EJS53551.1| hypothetical protein ICE_03486 [Bacillus cereus BAG1X1-2]
          Length = 248

 Score = 92.0 bits (227), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 51/178 (28%), Positives = 95/178 (53%), Gaps = 13/178 (7%)

Query: 89  YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
           + +P I++  +V+ D E D + +M++ +++R+ V + +    ++ + R S  A+L + E 
Sbjct: 68  FEEPLIVVLANVLSDEECDKLIEMSKNKMKRSKVGSSR----DVNDIRTSSGAFLEDNE- 122

Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
            +  +I +R+  +  +  S  E L ++NY +   Y+ HYD FA    + A       NR+
Sbjct: 123 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSAA------NNRI 175

Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           +T++ Y++DV +GG T F  LNLS+ P KG A ++   +     +  T H   PV  G
Sbjct: 176 STLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKG 233


>gi|423358724|ref|ZP_17336227.1| hypothetical protein IC1_00704 [Bacillus cereus VD022]
 gi|401084596|gb|EJP92842.1| hypothetical protein IC1_00704 [Bacillus cereus VD022]
          Length = 248

 Score = 92.0 bits (227), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 51/178 (28%), Positives = 95/178 (53%), Gaps = 13/178 (7%)

Query: 89  YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
           + +P I++  +V+ D E D + +M++ +++R+ V + +    ++ + R S  A+L + E 
Sbjct: 68  FEEPLIVVLANVLSDEECDKLIEMSKNKMKRSKVGSSR----DVNDIRTSSGAFLEDNE- 122

Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
            +  +I +R+  +  +  S  E L ++NY +   Y+ HYD FA    + A       NR+
Sbjct: 123 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSAA------NNRI 175

Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           +T++ Y++DV +GG T F  LNLS+ P KG A ++   +     +  T H   PV  G
Sbjct: 176 STLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKG 233


>gi|255071007|ref|XP_002507585.1| predicted protein [Micromonas sp. RCC299]
 gi|226522860|gb|ACO68843.1| predicted protein [Micromonas sp. RCC299]
          Length = 433

 Score = 92.0 bits (227), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 60/197 (30%), Positives = 94/197 (47%), Gaps = 31/197 (15%)

Query: 92  PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLRE----PE 147
           PR  ++   + + E DL+ + A+P + ++ V +   G    +N R S  +++        
Sbjct: 166 PRAFMHIGFLSERECDLLVEYARPNMYKSGVVDASNGGSSFSNIRTSTGSFVPTVFPLGM 225

Query: 148 HPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD--FARPGEANAFKSLGTGN 205
           + V+ RI RR+   T +  +  E +QV+ Y IG  Y+ H+D  F   G  N        N
Sbjct: 226 NDVVRRIERRIAAWTQIPAAHGEPIQVLRYQIGQEYQSHFDYFFHEGGMKN--------N 277

Query: 206 RVATVLFYMSDVAQGGATVFTSL-----------------NLSLWPEKGTAAFWHNLHSS 248
           R+ATVL Y+SDV  GG TVF S                   +++ P+KG A  + N+   
Sbjct: 278 RIATVLMYLSDVKDGGETVFPSAESLQVKPEPIHHACAKNGITVIPKKGDAILFWNMKVG 337

Query: 249 GDGDYYTRHAACPVLTG 265
           GD D  + HA CPV+ G
Sbjct: 338 GDLDGGSTHAGCPVVLG 354


>gi|163941996|ref|YP_001646880.1| 2OG-Fe(II) oxygenase [Bacillus weihenstephanensis KBAB4]
 gi|229013455|ref|ZP_04170592.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides DSM 2048]
 gi|423495146|ref|ZP_17471790.1| hypothetical protein IEW_04044 [Bacillus cereus CER057]
 gi|423498060|ref|ZP_17474677.1| hypothetical protein IEY_01287 [Bacillus cereus CER074]
 gi|163864193|gb|ABY45252.1| 2OG-Fe(II) oxygenase [Bacillus weihenstephanensis KBAB4]
 gi|228747867|gb|EEL97733.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides DSM 2048]
 gi|401151239|gb|EJQ58691.1| hypothetical protein IEW_04044 [Bacillus cereus CER057]
 gi|401161347|gb|EJQ68714.1| hypothetical protein IEY_01287 [Bacillus cereus CER074]
          Length = 216

 Score = 92.0 bits (227), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 54/193 (27%), Positives = 101/193 (52%), Gaps = 16/193 (8%)

Query: 89  YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
           + +P I++  +V+ D E D + ++++ ++ R+ V + +    ++ + R S  A+L E E 
Sbjct: 36  FEEPLIVVLANVLSDEECDELIELSKSKMERSKVGSSR----DVNDIRTSSGAFLEENE- 90

Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
            +  +I +R+  +T +  +  E L ++NY +   Y+ HYD FA    + A       NR+
Sbjct: 91  -LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYFAEHSRSAA------NNRI 143

Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
           +T++ Y++DV +GG T F  LNLS+ P KG A ++   +     +  T H   PV  G  
Sbjct: 144 STLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEK 203

Query: 268 SLHSTCPCGLRRG 280
            + +     +RRG
Sbjct: 204 WIATQW---VRRG 213


>gi|423604110|ref|ZP_17580003.1| hypothetical protein IIK_00691 [Bacillus cereus VD102]
 gi|401245796|gb|EJR52149.1| hypothetical protein IIK_00691 [Bacillus cereus VD102]
          Length = 216

 Score = 91.7 bits (226), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 56/196 (28%), Positives = 101/196 (51%), Gaps = 16/196 (8%)

Query: 89  YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
           + +P I++  +V+ D E D + ++++ +L R+ V + +    ++ + R S  A+L + E 
Sbjct: 36  FEEPLIVVLGNVLSDEECDELIELSKNKLARSKVGSSR----DVNDIRTSSGAFLDDNE- 90

Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
            +  +I +R+  +  +  S  E L ++NY +   Y+ HYD FA    + A       NR+
Sbjct: 91  -LTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYFAEHSRSAA------NNRI 143

Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
           +T++ Y++DV +GG T F  LNLS+ P KG A ++   H     +  T H   PV  G  
Sbjct: 144 STLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFHQDQSLNELTLHGGAPVTKGEK 203

Query: 268 SLHSTCPCGLRRGLQR 283
            + +     +RRG  R
Sbjct: 204 WIATQW---VRRGTYR 216


>gi|308467521|ref|XP_003096008.1| CRE-PHY-4 protein [Caenorhabditis remanei]
 gi|308244157|gb|EFO88109.1| CRE-PHY-4 protein [Caenorhabditis remanei]
          Length = 198

 Score = 91.7 bits (226), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 51/142 (35%), Positives = 73/142 (51%), Gaps = 2/142 (1%)

Query: 126 KTGELEIANYRISKSAWLREPEHPVIERISRRVE-HMTGLTTSTAEELQVVNYGIGGHYE 184
           KT   E +  R +   WL     P   ++ R ++  +  L  STAE  Q+++Y   G+Y 
Sbjct: 25  KTETPEKSEIRAANGTWLIHENRPNFAKMFRNLQTDIAALDLSTAEPWQILSYNSDGYYA 84

Query: 185 PHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHN 244
            HYDF  P + N       GNR+ATVL  +    +GG TVF  +NL++ P+ G    W N
Sbjct: 85  HHYDFLNP-DTNKQLVEARGNRIATVLVILQIAKKGGTTVFPKINLNIRPKAGDVVVWLN 143

Query: 245 LHSSGDGDYYTRHAACPVLTGS 266
              SG+ D  T HAACP+  G+
Sbjct: 144 TLPSGESDSQTLHAACPIKEGT 165


>gi|228960501|ref|ZP_04122151.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           pakistani str. T13001]
 gi|229047930|ref|ZP_04193506.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH676]
 gi|423630961|ref|ZP_17606708.1| hypothetical protein IK5_03811 [Bacillus cereus VD154]
 gi|423650103|ref|ZP_17625673.1| hypothetical protein IKA_03890 [Bacillus cereus VD169]
 gi|228723387|gb|EEL74756.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH676]
 gi|228799198|gb|EEM46165.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           pakistani str. T13001]
 gi|401264328|gb|EJR70440.1| hypothetical protein IK5_03811 [Bacillus cereus VD154]
 gi|401282521|gb|EJR88420.1| hypothetical protein IKA_03890 [Bacillus cereus VD169]
          Length = 248

 Score = 91.7 bits (226), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 51/178 (28%), Positives = 95/178 (53%), Gaps = 13/178 (7%)

Query: 89  YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
           + +P I++  +V+ D E D + +M++ +++R+ V + +    ++ + R S  A+L + E 
Sbjct: 68  FEEPLIVVLANVLSDEECDELIEMSKNKMKRSKVGSSR----DVNDIRTSSGAFLEDNE- 122

Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
            +  +I +R+  +  +  S  E L ++NY +   Y+ HYD FA    + A       NR+
Sbjct: 123 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSAA------NNRI 175

Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           +T++ Y++DV +GG T F  LNLS+ P KG A ++   +     +  T H   PV  G
Sbjct: 176 STLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKG 233


>gi|240256489|ref|NP_201407.4| iron ion binding / oxidoreductase/ oxidoreductase protein
           [Arabidopsis thaliana]
 gi|332010770|gb|AED98153.1| iron ion binding / oxidoreductase/ oxidoreductase protein
           [Arabidopsis thaliana]
          Length = 289

 Score = 91.7 bits (226), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 59/201 (29%), Positives = 94/201 (46%), Gaps = 24/201 (11%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +PR  +Y + +   E   + ++A+P + ++TV + KTG+   +  R S   +L       
Sbjct: 84  EPRASVYHNFLTKEECKYLIELAKPHMEKSTVVDEKTGKSTDSRVRTSSGTFLARGRDKT 143

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           I  I +R+   T +     E LQV++Y IG  YEPHYD+      + + +   G R+ATV
Sbjct: 144 IREIEKRISDFTFIPVEHGEGLQVLHYEIGQKYEPHYDYF----MDEYNTRNGGQRIATV 199

Query: 211 LFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAFWHNLHSSGDG 251
           L Y+SDV +GG TVF +                     LS+ P+ G A  + ++      
Sbjct: 200 LMYLSDVEEGGETVFPAAKGNYSAVPWWNELSECGKGGLSVKPKMGDALLFWSMTPDATL 259

Query: 252 DYYTRHAACPVLTGSNSLHST 272
           D  + H  C V+ G N   ST
Sbjct: 260 DPSSLHGGCAVIKG-NKWSST 279


>gi|359477455|ref|XP_002278454.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1 [Vitis
           vinifera]
          Length = 296

 Score = 91.7 bits (226), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 55/194 (28%), Positives = 95/194 (48%), Gaps = 23/194 (11%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +PR  +Y   + + E D +  +A+  L+R+ V +  +G+  ++  R S   ++ + + P+
Sbjct: 43  KPRAFVYEGFLSEEECDHLISLAKSELKRSAVADNVSGKSRLSEVRTSSGMFIGKGKDPI 102

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           +  I  ++   T L     E++QV+ Y  G  Y+ HYD+      +       G+R+ATV
Sbjct: 103 VAGIEDKIAAWTFLPKDNGEDMQVLRYEPGQKYDAHYDYF----VDKVNIARGGHRIATV 158

Query: 211 LFYMSDVAQGGATVF-----------TSLNLS--------LWPEKGTAAFWHNLHSSGDG 251
           L Y+SDV +GG TVF           T+ +LS        + P KG A  + +LH +   
Sbjct: 159 LMYLSDVVKGGETVFPMAEVSSSTLPTNDDLSECARKGIAVKPRKGDALLFFSLHPTAIP 218

Query: 252 DYYTRHAACPVLTG 265
           D  + H  CPV+ G
Sbjct: 219 DPMSLHGGCPVIEG 232


>gi|268562483|ref|XP_002638619.1| Hypothetical protein CBG05671 [Caenorhabditis briggsae]
          Length = 520

 Score = 91.7 bits (226), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 51/142 (35%), Positives = 74/142 (52%), Gaps = 2/142 (1%)

Query: 126 KTGELEIANYRISKSAWLREPEHPVIERISRRVE-HMTGLTTSTAEELQVVNYGIGGHYE 184
           KT   E +  R +   WL   + P   +I   ++ ++  L  STAE  Q+++Y   G+Y 
Sbjct: 92  KTETPEKSQVRAANGTWLIHTKRPNFAKIFWNLQVNIRALDLSTAEPWQILSYNSEGYYA 151

Query: 185 PHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHN 244
           PHYDF  P E N       GNR+ATVL  +    +GG TVF  +N+++ P+ G    W N
Sbjct: 152 PHYDFLNP-ETNKVLVESRGNRIATVLVILQIAKKGGTTVFPKININIRPKIGDVVVWLN 210

Query: 245 LHSSGDGDYYTRHAACPVLTGS 266
               G+ D  T HAACP+  G+
Sbjct: 211 TVPDGESDSQTLHAACPIKEGT 232



 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 53/198 (26%), Positives = 88/198 (44%), Gaps = 5/198 (2%)

Query: 78  LRLMPLKEEEAYLQPRIILYRDVMYDSEI-DLIKKMAQPRLRRATVQNYKTGELEIANYR 136
           +    +K E     P +++YRD+    ++ D I+ M         V N   G    + YR
Sbjct: 294 ISFQAVKVEVISWSPGLVIYRDMFTKKQVLDYIEIMKHQDFEEQQVVN-DDGTEYYSKYR 352

Query: 137 ISKSAWLREPEHPVIERISRRVEHMT-GLTTSTAEELQVVNYGIGGHYEPHYDFAR-PGE 194
            +    +  P+ P    I + V+ +   L   ++E++  ++Y  GGHY  H+DF   P E
Sbjct: 353 KANGTQIIAPDFPAALSIWKTVKILIPTLNIESSEDIVALSYIRGGHYAAHHDFLEYPSE 412

Query: 195 ANAFKSLGT-GNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDY 253
                 +   GNR  T++        GGAT+F SLN ++ P  G A FW N   +   + 
Sbjct: 413 KEWDGWMKDYGNRFGTLIMAFETAELGGATIFPSLNAAIRPNTGDAFFWFNAMGNTKQED 472

Query: 254 YTRHAACPVLTGSNSLHS 271
            + H  CP+  G  S+ +
Sbjct: 473 LSDHGGCPIYEGKKSIST 490


>gi|228910069|ref|ZP_04073889.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL 200]
 gi|228849586|gb|EEM94420.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL 200]
          Length = 248

 Score = 91.7 bits (226), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 49/177 (27%), Positives = 94/177 (53%), Gaps = 11/177 (6%)

Query: 89  YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
           + +P I++  +V+ D E D + +M++ +++R+ V + +    ++ + R S  A+L + E 
Sbjct: 68  FEEPLIVVLANVLSDEECDELIEMSKNKMKRSKVGSSR----DVNDIRTSSGAFLEDNE- 122

Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVA 208
            +  +I +R+  +  +  S  E L ++NY +   Y+ HYD+      +A       NR++
Sbjct: 123 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSA-----VNNRIS 176

Query: 209 TVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           T++ Y++DV +GG T F  LNLS+ P KG A ++   +     +  T H   PV  G
Sbjct: 177 TLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKG 233


>gi|340357957|ref|ZP_08680560.1| prolyl 4-hydroxylase [Sporosarcina newyorkensis 2681]
 gi|339616017|gb|EGQ20677.1| prolyl 4-hydroxylase [Sporosarcina newyorkensis 2681]
          Length = 211

 Score = 91.7 bits (226), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 49/175 (28%), Positives = 91/175 (52%), Gaps = 10/175 (5%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +P I++  +V+ D E D + ++A  +++R+ +   +    E    R S S ++ + E+ +
Sbjct: 32  EPLIVVLGNVLSDEECDELIQLAGDKVKRSKIGTTR----EENELRTSSSMFIEDDENLI 87

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           + R+ +R+  +  +     E LQ++ Y  G  Y+ H+DF          S  T NR++T+
Sbjct: 88  VTRVKKRISAIMKIPMEHGEGLQILRYTPGQQYKAHHDFFSS------DSKITNNRISTL 141

Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           + Y++DV QGG T F  L  S+ P KG A ++   +S    + +T H   PV+ G
Sbjct: 142 VMYLNDVEQGGETFFPHLKFSVSPRKGMAVYFEYFYSDQTLNDFTLHGGAPVVEG 196


>gi|229075940|ref|ZP_04208916.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock4-18]
 gi|229117732|ref|ZP_04247101.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-3]
 gi|407706764|ref|YP_006830349.1| alpha/beta fold family hydrolase [Bacillus thuringiensis MC28]
 gi|423377905|ref|ZP_17355189.1| hypothetical protein IC9_01258 [Bacillus cereus BAG1O-2]
 gi|423464099|ref|ZP_17440867.1| hypothetical protein IEK_01286 [Bacillus cereus BAG6O-1]
 gi|423547540|ref|ZP_17523898.1| hypothetical protein IGO_03975 [Bacillus cereus HuB5-5]
 gi|423622677|ref|ZP_17598455.1| hypothetical protein IK3_01275 [Bacillus cereus VD148]
 gi|228665709|gb|EEL21182.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-3]
 gi|228707255|gb|EEL59452.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock4-18]
 gi|401179261|gb|EJQ86434.1| hypothetical protein IGO_03975 [Bacillus cereus HuB5-5]
 gi|401260797|gb|EJR66965.1| hypothetical protein IK3_01275 [Bacillus cereus VD148]
 gi|401636171|gb|EJS53925.1| hypothetical protein IC9_01258 [Bacillus cereus BAG1O-2]
 gi|402420366|gb|EJV52637.1| hypothetical protein IEK_01286 [Bacillus cereus BAG6O-1]
 gi|407384449|gb|AFU14950.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis MC28]
          Length = 216

 Score = 91.7 bits (226), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 54/196 (27%), Positives = 103/196 (52%), Gaps = 16/196 (8%)

Query: 89  YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
           + +P I++  +V+ D E + + +M++ +++R+T+ + +    ++ + R S  A+L E E 
Sbjct: 36  FEEPLIVVLGNVISDEECNELIEMSKNKIKRSTIGSAR----DVNDIRTSSGAFLEENE- 90

Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
            +  +I +R+  +  +  +  E L ++NY +   Y+ HYD FA    + A       NR+
Sbjct: 91  -LTSKIEKRISSIMNVPVTHGEGLHILNYEVDQQYKAHYDYFAEHSRSAA------NNRI 143

Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
           +T++ Y++DV +GG T F  LNLS+ P KG A ++   +     +  T H   PV  G  
Sbjct: 144 STLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEK 203

Query: 268 SLHSTCPCGLRRGLQR 283
            + +     +RRG  R
Sbjct: 204 WIATQW---VRRGTYR 216


>gi|423437685|ref|ZP_17414666.1| hypothetical protein IE9_03866 [Bacillus cereus BAG4X12-1]
 gi|423503075|ref|ZP_17479667.1| hypothetical protein IG1_00641 [Bacillus cereus HD73]
 gi|401120840|gb|EJQ28636.1| hypothetical protein IE9_03866 [Bacillus cereus BAG4X12-1]
 gi|402459296|gb|EJV91033.1| hypothetical protein IG1_00641 [Bacillus cereus HD73]
          Length = 248

 Score = 91.7 bits (226), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 51/178 (28%), Positives = 95/178 (53%), Gaps = 13/178 (7%)

Query: 89  YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
           + +P I++  +V+ D E D + +M++ +++R+ V + +    ++ + R S  A+L + E 
Sbjct: 68  FEEPLIVVLANVLSDEECDELIEMSKNKMKRSKVGSAR----DVNDIRTSSGAFLEDNE- 122

Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
            +  +I +R+  +  +  S  E L ++NY +   Y+ HYD FA    + A       NR+
Sbjct: 123 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSAA------NNRI 175

Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           +T++ Y++DV +GG T F  LNLS+ P KG A ++   +     +  T H   PV  G
Sbjct: 176 STLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKG 233


>gi|229174912|ref|ZP_04302432.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus MM3]
 gi|228608580|gb|EEK65882.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus MM3]
          Length = 216

 Score = 91.7 bits (226), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 53/178 (29%), Positives = 95/178 (53%), Gaps = 13/178 (7%)

Query: 89  YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
           + +P I++  +V+ D E D + ++++ +L R+ V + +    ++ + R SK A+L + E 
Sbjct: 36  FEEPLIVVLGNVLSDEECDELIELSKSKLARSKVGSSR----DVNDIRTSKGAFLDDNEL 91

Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
            V  +I +R+  +  +  S  E L ++NY +   Y+ HYD FA    + A       NR+
Sbjct: 92  TV--KIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSAA------NNRI 143

Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           +T++ Y++DV +GG T F  LNLS+ P KG A ++   +     +  T H   PV  G
Sbjct: 144 STLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKG 201


>gi|356576923|ref|XP_003556579.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
          Length = 287

 Score = 91.7 bits (226), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 60/203 (29%), Positives = 101/203 (49%), Gaps = 19/203 (9%)

Query: 78  LRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRI 137
           LRL  +K E     PRIIL  + +   E D ++ +A PRL  + V + KTG+   ++ R 
Sbjct: 74  LRLGYVKPEVLNWSPRIILLHNFLSMEECDYLRAIALPRLHISNVVDTKTGKGIKSDVRT 133

Query: 138 SKSAWL--REPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEA 195
           S   +L  +E ++P+++ I +R+   + +     E +QV+ Y    +Y+PH+D+     +
Sbjct: 134 SSGMFLNPQERKYPMVQAIEKRISVYSQIPIENGELMQVLRYEKNQYYKPHHDYF----S 189

Query: 196 NAFKSLGTGNRVATVLFYMSDVAQGGATVF-------------TSLNLSLWPEKGTAAFW 242
           + F     G R+AT+L Y+SD  +GG T F                 LS+ P KG A  +
Sbjct: 190 DTFNLKRGGQRIATMLMYLSDNIEGGETYFPLAGSGECSCGGKLVKGLSVKPIKGNAVLF 249

Query: 243 HNLHSSGDGDYYTRHAACPVLTG 265
            ++   G  D  + H  C V++G
Sbjct: 250 WSMGLDGQSDPNSVHGGCEVISG 272


>gi|423582447|ref|ZP_17558558.1| hypothetical protein IIA_03962 [Bacillus cereus VD014]
 gi|401213326|gb|EJR20067.1| hypothetical protein IIA_03962 [Bacillus cereus VD014]
          Length = 248

 Score = 91.7 bits (226), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 51/178 (28%), Positives = 95/178 (53%), Gaps = 13/178 (7%)

Query: 89  YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
           + +P I++  +V+ D E D + +M++ +++R+ V + +    ++ + R S  A+L + E 
Sbjct: 68  FEEPLIVVLANVLSDEECDELIEMSKNKMKRSKVGSSR----DVNDIRTSSGAFLEDSEL 123

Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
            +  +I +R+  +  +  S  E L ++NY +   Y+ HYD FA    + A       NR+
Sbjct: 124 TL--KIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSAA------NNRI 175

Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           +T++ Y++DV +GG T F  LNLS+ P KG A ++   +     +  T H   PV  G
Sbjct: 176 STLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKG 233


>gi|225452614|ref|XP_002281420.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
 gi|296087745|emb|CBI35001.3| unnamed protein product [Vitis vinifera]
          Length = 316

 Score = 91.7 bits (226), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 53/202 (26%), Positives = 97/202 (48%), Gaps = 22/202 (10%)

Query: 82  PLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSA 141
           P +  +   +PR  LY+  + + E D +  +A+ +L ++ V + ++G+  ++  R S   
Sbjct: 51  PTRVTQLSWRPRAFLYKGFLSEEECDHLITLAKDKLEKSMVADNESGKSIMSEVRTSSGM 110

Query: 142 WLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL 201
           +L + +  ++  I  R+   T L     E +Q+++Y  G  YEPH+D+      +    L
Sbjct: 111 FLLKAQDEIVADIEARIAAWTFLPVENGESIQILHYENGEKYEPHFDYFH----DKVNQL 166

Query: 202 GTGNRVATVLFYMSDVAQGGATVF------------------TSLNLSLWPEKGTAAFWH 243
             G+R+ATVL Y++ V +GG TVF                       ++ P+KG A  + 
Sbjct: 167 LGGHRIATVLMYLATVEEGGETVFPNSEGRFSQPKDDSWSDCAKKGYAVNPKKGDALLFF 226

Query: 244 NLHSSGDGDYYTRHAACPVLTG 265
           +LH     D  + H +CPV+ G
Sbjct: 227 SLHPDATTDPSSLHGSCPVIAG 248


>gi|228902749|ref|ZP_04066896.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL
           4222]
 gi|228967277|ref|ZP_04128313.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           sotto str. T04001]
 gi|402564350|ref|YP_006607074.1| prolyl 4-hydroxylase subunit alpha domain-containing protein
           [Bacillus thuringiensis HD-771]
 gi|434377355|ref|YP_006611999.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus
           thuringiensis HD-789]
 gi|228792646|gb|EEM40212.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           sotto str. T04001]
 gi|228856936|gb|EEN01449.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL
           4222]
 gi|401793002|gb|AFQ19041.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus
           thuringiensis HD-771]
 gi|401875912|gb|AFQ28079.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus
           thuringiensis HD-789]
          Length = 216

 Score = 91.3 bits (225), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 51/178 (28%), Positives = 95/178 (53%), Gaps = 13/178 (7%)

Query: 89  YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
           + +P I++  +V+ D E D + +M++ +++R+ V + +    ++ + R S  A+L + E 
Sbjct: 36  FEEPLIVVLANVLSDEECDKLIEMSKNKMKRSKVGSSR----DVNDIRTSSGAFLEDNE- 90

Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
            +  +I +R+  +  +  S  E L ++NY +   Y+ HYD FA    + A       NR+
Sbjct: 91  -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSAA------NNRI 143

Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           +T++ Y++DV +GG T F  LNLS+ P KG A ++   +     +  T H   PV  G
Sbjct: 144 STLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKG 201


>gi|423521903|ref|ZP_17498376.1| hypothetical protein IGC_01286 [Bacillus cereus HuA4-10]
 gi|401176565|gb|EJQ83760.1| hypothetical protein IGC_01286 [Bacillus cereus HuA4-10]
          Length = 216

 Score = 91.3 bits (225), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 51/178 (28%), Positives = 95/178 (53%), Gaps = 13/178 (7%)

Query: 89  YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
           + +P I++  +V+ D E D + ++++  ++R+ V + +    ++ + R S  A+L E E 
Sbjct: 36  FEEPLIVVLANVLSDEECDKLIELSKNNMKRSKVGSSR----DVNDIRTSSGAFLEENE- 90

Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
            +  +I +R+  +T +  +  E L ++NY +   Y+ HYD FA    + A       NR+
Sbjct: 91  -LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYFAEHSRSAA------NNRI 143

Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           +T++ Y++DV +GG T F  LNLS+ P KG A ++   +     +  T H   PV  G
Sbjct: 144 STLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKG 201


>gi|302823087|ref|XP_002993198.1| hypothetical protein SELMODRAFT_431327 [Selaginella moellendorffii]
 gi|300138968|gb|EFJ05718.1| hypothetical protein SELMODRAFT_431327 [Selaginella moellendorffii]
          Length = 269

 Score = 91.3 bits (225), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 65/206 (31%), Positives = 99/206 (48%), Gaps = 22/206 (10%)

Query: 78  LRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELE---IAN 134
           LR+  +K E     PRIIL    +   E D +  +A PRL ++TV +  TG+      + 
Sbjct: 53  LRIGLVKPEVLNWSPRIILLHKFLSAEECDYLIAIAGPRLAKSTVVDTSTGKARHGIESK 112

Query: 135 YRISKSAWLR--EPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARP 192
            R S   +L   +  +P+I+ I RR+   + +     E LQV+ Y    +Y+PH+D+   
Sbjct: 113 VRTSTGMFLSNYDRRYPMIQAIERRIAVYSMIPVENGELLQVLRYEPNQYYKPHHDYF-- 170

Query: 193 GEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL-------------NLSLWPEKGTA 239
             ++ F     G RVATVL Y+SDV +GG T+F S+              L + P KG A
Sbjct: 171 --SDQFNLKRGGQRVATVLMYLSDVEEGGETIFPSVGDGECECGGELRKGLCVKPRKGDA 228

Query: 240 AFWHNLHSSGDGDYYTRHAACPVLTG 265
             + +    G+ D  + H  C VL G
Sbjct: 229 ILFWSAALDGNVDSNSLHGGCSVLRG 254


>gi|195159162|ref|XP_002020451.1| GL14001 [Drosophila persimilis]
 gi|194117220|gb|EDW39263.1| GL14001 [Drosophila persimilis]
          Length = 452

 Score = 91.3 bits (225), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 62/179 (34%), Positives = 91/179 (50%), Gaps = 27/179 (15%)

Query: 47  KYEMLCRGDLTVPPAIVAQLKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEI 106
           ++  +CR      P+   +L CRY     P+LRL PL+ EE  L P I +Y +V+ D+EI
Sbjct: 281 EFIQICRSSHQNKPS---RLHCRYNATTTPFLRLAPLRMEELSLDPYIAVYHNVLSDAEI 337

Query: 107 DLIKKMAQPRLRR----------ATVQNYKTGEL--EIANYRISKSAWLREPEHPVIERI 154
             ++++ +P L+R           T    +TG     I NY     A       PVIER+
Sbjct: 338 AEVERVIEPLLKRIGRYDEMPNSMTPSKRRTGFTGPHIDNYMHVSGA-------PVIERV 390

Query: 155 SRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFY 213
            R +  MTGL  +    L ++ YG+GGH + HYDF     A+   +   G+R+ATVLFY
Sbjct: 391 HRHIRDMTGLFMNV--HLMMIKYGLGGHCDQHYDFLN---ASYPSTHAMGDRMATVLFY 444


>gi|145343778|ref|XP_001416487.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144576712|gb|ABO94780.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 255

 Score = 91.3 bits (225), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 58/193 (30%), Positives = 91/193 (47%), Gaps = 27/193 (13%)

Query: 92  PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
           PR  +Y   + D E D I  +++  L ++ V + KTG    ++ R S   ++     P I
Sbjct: 1   PRAFVYEGFLTDEECDHILALSKGHLHKSGVVDAKTGGSTTSDIRTSTGTFISRAHDPTI 60

Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD--FARPGEANAFKSLGTGNRVAT 209
             I  R+E  + +     E LQV+ Y  G  Y+ H+D  F + G+ N        NR+AT
Sbjct: 61  TAIEERIELWSQIPVDHGEALQVLRYENGQEYKAHFDYFFHKGGKRN--------NRIAT 112

Query: 210 VLFYMSDVAQGGATVFTSLNL-----------------SLWPEKGTAAFWHNLHSSGDGD 252
           VL Y+SDV +GG TVF + ++                 S+   KG A  + ++   G+ D
Sbjct: 113 VLLYLSDVEEGGETVFPNTDVPTDRDRSQYSECGNGGKSVKARKGDALLFWSMKPGGELD 172

Query: 253 YYTRHAACPVLTG 265
             + HA CPV+ G
Sbjct: 173 PGSSHAGCPVIKG 185


>gi|359477453|ref|XP_003631980.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2 [Vitis
           vinifera]
 gi|297736941|emb|CBI26142.3| unnamed protein product [Vitis vinifera]
          Length = 298

 Score = 91.3 bits (225), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 55/196 (28%), Positives = 95/196 (48%), Gaps = 25/196 (12%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +PR  +Y   + + E D +  +A+  L+R+ V +  +G+  ++  R S   ++ + + P+
Sbjct: 43  KPRAFVYEGFLSEEECDHLISLAKSELKRSAVADNVSGKSRLSEVRTSSGMFIGKGKDPI 102

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           +  I  ++   T L     E++QV+ Y  G  Y+ HYD+      +       G+R+ATV
Sbjct: 103 VAGIEDKIAAWTFLPKDNGEDMQVLRYEPGQKYDAHYDYF----VDKVNIARGGHRIATV 158

Query: 211 LFYMSDVAQGGATVF-------------TSLNLS--------LWPEKGTAAFWHNLHSSG 249
           L Y+SDV +GG TVF             T+ +LS        + P KG A  + +LH + 
Sbjct: 159 LMYLSDVVKGGETVFPMAEEPSRRKPLPTNDDLSECARKGIAVKPRKGDALLFFSLHPTA 218

Query: 250 DGDYYTRHAACPVLTG 265
             D  + H  CPV+ G
Sbjct: 219 IPDPMSLHGGCPVIEG 234


>gi|423634936|ref|ZP_17610589.1| hypothetical protein IK7_01345 [Bacillus cereus VD156]
 gi|401278922|gb|EJR84852.1| hypothetical protein IK7_01345 [Bacillus cereus VD156]
          Length = 248

 Score = 91.3 bits (225), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 51/178 (28%), Positives = 95/178 (53%), Gaps = 13/178 (7%)

Query: 89  YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
           + +P I++  +V+ D E D + +M++ +++R+ V + +    ++ + R S  A+L + E 
Sbjct: 68  FEEPLIVVLANVLSDEECDELIEMSKNKMKRSKVGSSR----DVNDIRTSSGAFLEDSEL 123

Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
            +  +I +R+  +  +  S  E L ++NY +   Y+ HYD FA    + A       NR+
Sbjct: 124 TL--KIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSAA------NNRI 175

Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           +T++ Y++DV +GG T F  LNLS+ P KG A ++   +     +  T H   PV  G
Sbjct: 176 STLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKG 233


>gi|49480949|ref|YP_038297.1| prolyl 4-hydroxylase subunit alpha [Bacillus thuringiensis serovar
           konkukian str. 97-27]
 gi|49332505|gb|AAT63151.1| prolyl 4-hydroxylase, alpha subunit [Bacillus thuringiensis serovar
           konkukian str. 97-27]
          Length = 232

 Score = 91.3 bits (225), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 50/177 (28%), Positives = 94/177 (53%), Gaps = 11/177 (6%)

Query: 89  YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
           + +P I++  +V+ D E D + ++++ +L R+ V + +    ++ + R S  A+L + E 
Sbjct: 52  FEEPLIVVLGNVLSDEECDELIELSKNKLARSKVGSSR----DVNDIRTSSGAFLDDNE- 106

Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVA 208
            + E+I +R+  +  +  S  E L ++NY +   Y+ HYD+      +A       NR++
Sbjct: 107 -LTEKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSA-----ANNRIS 160

Query: 209 TVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           T++ Y++DV +GG T F  LNLS+ P KG A ++   +     +  T H   PV  G
Sbjct: 161 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKG 217


>gi|229093299|ref|ZP_04224414.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-42]
 gi|228690082|gb|EEL43879.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-42]
          Length = 232

 Score = 91.3 bits (225), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 52/178 (29%), Positives = 95/178 (53%), Gaps = 13/178 (7%)

Query: 89  YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
           + +P I++  +V+ D E D + ++++ +L R+ V + +    ++ + R S  A+L + E 
Sbjct: 52  FEEPLIVVLGNVLSDEECDELIELSKNKLARSKVGSSR----DVNDIRTSSGAFLDDNE- 106

Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
            + E+I +R+  +  +  S  E L ++NY +   Y+ HYD FA    + A       NR+
Sbjct: 107 -LTEKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSAA------NNRI 159

Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           +T++ Y++DV +GG T F  LNLS+ P KG A ++   +     +  T H   PV  G
Sbjct: 160 STLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKG 217


>gi|218899396|ref|YP_002447807.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus cereus
           G9842]
 gi|218542449|gb|ACK94843.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           G9842]
          Length = 216

 Score = 91.3 bits (225), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 49/177 (27%), Positives = 94/177 (53%), Gaps = 11/177 (6%)

Query: 89  YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
           + +P I++  +V+ D E D + +M++ +++R+ V + +    ++ + R S  A+L + E 
Sbjct: 36  FEEPLIVVLANVLSDEECDELIEMSKNKMKRSKVGSSR----DVNDIRTSSGAFLEDNE- 90

Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVA 208
            +  +I +R+  +  +  S  E L ++NY +   Y+ HYD+      +A       NR++
Sbjct: 91  -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSA-----VNNRIS 144

Query: 209 TVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           T++ Y++DV +GG T F  LNLS+ P KG A ++   +     +  T H   PV  G
Sbjct: 145 TLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKG 201


>gi|15233345|ref|NP_195307.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
           thaliana]
 gi|3805848|emb|CAA21468.1| putative protein [Arabidopsis thaliana]
 gi|7270534|emb|CAB81491.1| putative protein [Arabidopsis thaliana]
 gi|332661175|gb|AEE86575.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
           thaliana]
          Length = 272

 Score = 91.3 bits (225), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 73/271 (26%), Positives = 123/271 (45%), Gaps = 42/271 (15%)

Query: 7   QRAQGNKLYYQEALNKSPELKDEPPKVNNVAPTLEVTEREKYEMLCRGDLTVPPAIVAQL 66
           QR QG K+Y    L +     DE      V+  +++ E+ K  +L    L     ++  L
Sbjct: 15  QRLQGLKIYETSDLIQHINTFDELVG-EQVSVDVKIEEKTKDMIL----LCSLSPLLTTL 69

Query: 67  KCRYVH----RNVPYLRLMPLKEEEAYLQPRIILYRDVM--------YDSEIDLIKKMAQ 114
            C  V        P  R + +  +E    PR  +Y + +         + E D +  +A+
Sbjct: 70  TCSMVKVAASLRFPNERWLEVITKE----PRAFVYHNFLALFFKICKTNEECDHLISLAK 125

Query: 115 PRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQV 174
           P + R+ V+N  TG  E ++ R S   ++R     +++ I +R+   T +     E LQV
Sbjct: 126 PSMARSKVRNALTGLGEESSSRTSSGTFIRSGHDKIVKEIEKRISEFTFIPQENGETLQV 185

Query: 175 VNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVF-------TS 227
           +NY +G  +EPH+D         F+      R+ATVL Y+SDV +GG TVF       + 
Sbjct: 186 INYEVGQKFEPHFD--------GFQ------RIATVLMYLSDVDKGGETVFPEAKGIKSK 231

Query: 228 LNLSLWPEKGTAAFWHNLHSSGDGDYYTRHA 258
             +S+ P+KG A  + ++   G  D  ++H 
Sbjct: 232 KGVSVRPKKGDALLFWSMRPDGSRDPSSKHG 262


>gi|228954520|ref|ZP_04116545.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           kurstaki str. T03a001]
 gi|449091198|ref|YP_007423639.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           kurstaki str. HD73]
 gi|228805177|gb|EEM51771.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           kurstaki str. T03a001]
 gi|449024955|gb|AGE80118.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           kurstaki str. HD73]
          Length = 216

 Score = 91.3 bits (225), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 51/178 (28%), Positives = 95/178 (53%), Gaps = 13/178 (7%)

Query: 89  YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
           + +P I++  +V+ D E D + +M++ +++R+ V + +    ++ + R S  A+L + E 
Sbjct: 36  FEEPLIVVLANVLSDEECDELIEMSKNKMKRSKVGSAR----DVNDIRTSSGAFLEDNE- 90

Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
            +  +I +R+  +  +  S  E L ++NY +   Y+ HYD FA    + A       NR+
Sbjct: 91  -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSAA------NNRI 143

Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           +T++ Y++DV +GG T F  LNLS+ P KG A ++   +     +  T H   PV  G
Sbjct: 144 STLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKG 201


>gi|356502598|ref|XP_003520105.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
          Length = 296

 Score = 91.3 bits (225), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 54/195 (27%), Positives = 92/195 (47%), Gaps = 23/195 (11%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +PRI LY + +   E + +  +A+P +R++TV   +TG    +  R S   +L      +
Sbjct: 91  EPRIFLYHNFLTKEECEHLINIAKPNMRKSTVIESETGMSIESRVRTSSGTFLARGRDKI 150

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           +  I  R+   T +     EELQV++Y +G  Y PH+D+      +   +   G+R+AT+
Sbjct: 151 VRNIENRIADFTFIPVDNGEELQVLHYQVGEKYVPHHDYF----MDDINTANGGDRIATM 206

Query: 211 LFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAFWHNLHSSGDG 251
           L Y+SDV +GG TVF                       LS+ P+   A  + ++      
Sbjct: 207 LMYLSDVEEGGETVFPDAKGNFSSMPGWNELSVCGKKGLSIKPKMRNALLFWSIKPDATY 266

Query: 252 DYYTRHAACPVLTGS 266
           D  + H +CPV+ G+
Sbjct: 267 DPLSLHGSCPVIKGN 281


>gi|195166675|ref|XP_002024160.1| GL22879 [Drosophila persimilis]
 gi|194107515|gb|EDW29558.1| GL22879 [Drosophila persimilis]
          Length = 484

 Score = 91.3 bits (225), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 61/199 (30%), Positives = 93/199 (46%), Gaps = 27/199 (13%)

Query: 68  CRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKT 127
           C Y      ++RL PLK E   L P I +Y DV+Y+ EI  +  +A   L+    +  K 
Sbjct: 296 CHYESTRTAFVRLAPLKVEMLSLDPYIAIYHDVIYEREIARVMTLALSSLK-GPGRYSKR 354

Query: 128 GELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHY 187
            E  I      KS  + E E+    ++++R   MTG      ++ ++ N GIGG+   H 
Sbjct: 355 REHNI------KSVTVYEEENS---QLNQRTRDMTGEQVKEDKDFRIYNSGIGGYIRYHM 405

Query: 188 DFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHS 247
           D     E                   +++V  GGA  F  L  ++WP KG+A  WHNL++
Sbjct: 406 DNLAKEEQQ-----------------LNEVPHGGAISFPQLEFTVWPRKGSALVWHNLNN 448

Query: 248 SGDGDYYTRHAACPVLTGS 266
           + + DY   H +CPV+ GS
Sbjct: 449 NLELDYRVAHISCPVIVGS 467


>gi|222111817|ref|YP_002554081.1| procollagen-proline dioxygenase [Acidovorax ebreus TPSY]
 gi|221731261|gb|ACM34081.1| Procollagen-proline dioxygenase [Acidovorax ebreus TPSY]
          Length = 289

 Score = 90.9 bits (224), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 48/154 (31%), Positives = 81/154 (52%), Gaps = 5/154 (3%)

Query: 92  PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
           PR++L+ +++   E   I   AQPR+ R+      TG  E+   R S   + +  E PV+
Sbjct: 102 PRVVLFGNLLSPEECQAIIDAAQPRMARSLTVQTTTGGEEVNADRTSDGMFFQRGETPVV 161

Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDF---ARPGEANAFKSLGTGNRVA 208
           +R+  R+  +        E LQV++Y  G  Y+PHYD+    +PG +   +    G RVA
Sbjct: 162 QRLEERIARLVRWPIQNGEGLQVLHYRPGAEYKPHYDYFDPDQPGTSTIVRR--GGQRVA 219

Query: 209 TVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFW 242
           T++ Y+++  +GG T F  + L + P +G A F+
Sbjct: 220 TLVIYLNNPRKGGGTTFPDVPLEVAPRQGNAVFF 253


>gi|195494572|ref|XP_002094895.1| GE22068 [Drosophila yakuba]
 gi|194180996|gb|EDW94607.1| GE22068 [Drosophila yakuba]
          Length = 438

 Score = 90.9 bits (224), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 59/201 (29%), Positives = 98/201 (48%), Gaps = 27/201 (13%)

Query: 66  LKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNY 125
           L C YV    P+L+L PLK EE  ++P I ++   +   +I+++K +++P+L+R      
Sbjct: 256 LVCHYVDW-TPFLKLAPLKMEELSMKPHISIFYGFLGPKDIEVLKNVSRPKLQR------ 308

Query: 126 KTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEP 185
              E   AN    K   L    H V+ +++  +  +TG  +   E ++V+NYGI G+Y P
Sbjct: 309 --NEHLSANCS-CKIGNLFSSSHDVVRKVNELILDITGFPSKGNEMVEVINYGIAGNYNP 365

Query: 186 HYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNL 245
             D A+P + N           A    ++ +  +GG  VF S +L + P KG+   W NL
Sbjct: 366 D-DTAQPRKHNK----------ANAFIFLGNAGKGGEIVFPSRDLKIRPRKGSMIVWENL 414

Query: 246 HSSGDGDYYTRHAACPVLTGS 266
             S        +  CP+L G+
Sbjct: 415 KKS------VIYHQCPILKGN 429


>gi|228922987|ref|ZP_04086280.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           huazhongensis BGSC 4BD1]
 gi|228836620|gb|EEM81968.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           huazhongensis BGSC 4BD1]
          Length = 216

 Score = 90.9 bits (224), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 51/178 (28%), Positives = 95/178 (53%), Gaps = 13/178 (7%)

Query: 89  YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
           + +P I++  +V+ D E D + +M++ +++R+ V + +    ++ + R S  A+L + E 
Sbjct: 36  FEEPLIVVLANVLSDEECDELIEMSKNKMKRSKVGSSR----DVNDIRTSSGAFLEDSEL 91

Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
            +  +I +R+  +  +  S  E L ++NY +   Y+ HYD FA    + A       NR+
Sbjct: 92  TL--KIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSAA------NNRI 143

Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           +T++ Y++DV +GG T F  LNLS+ P KG A ++   +     +  T H   PV  G
Sbjct: 144 STLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKG 201


>gi|389793983|ref|ZP_10197143.1| 2OG-Fe(II) oxygenase [Rhodanobacter fulvus Jip2]
 gi|388433014|gb|EIL89992.1| 2OG-Fe(II) oxygenase [Rhodanobacter fulvus Jip2]
          Length = 282

 Score = 90.9 bits (224), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 54/162 (33%), Positives = 88/162 (54%), Gaps = 7/162 (4%)

Query: 107 DLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTT 166
           DLI+ +A+PRL+RA   +   G+ +I   R S+  + R  E P++  I +R+  + G+  
Sbjct: 109 DLIE-LARPRLQRALTVD-SDGKQQIDQRRTSEGMFFRAGETPLVAAIEQRLAQLLGVPA 166

Query: 167 STAEELQVVNYGIGGHYEPHYDFARPGEANAFK-SLGTGNRVATVLFYMSDVAQGGATVF 225
           S  E LQ+++YG G  YEPHYD+  P      K +   G R+A+V+ Y++   +GG T F
Sbjct: 167 SHGEGLQILHYGPGQEYEPHYDWFDPALPGYDKLTARAGQRIASVVMYLNTPERGGGTAF 226

Query: 226 TSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
             + L++   +G A ++    +   GD  + HA  PVL G  
Sbjct: 227 PEIGLTVTARRGAAVYF----AYEGGDQSSLHAGLPVLQGEK 264


>gi|121595595|ref|YP_987491.1| 2OG-Fe(II) oxygenase [Acidovorax sp. JS42]
 gi|120607675|gb|ABM43415.1| 2OG-Fe(II) oxygenase [Acidovorax sp. JS42]
          Length = 289

 Score = 90.9 bits (224), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 48/154 (31%), Positives = 81/154 (52%), Gaps = 5/154 (3%)

Query: 92  PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
           PR++L+ +++   E   I   AQPR+ R+      TG  E+   R S   + +  E PV+
Sbjct: 102 PRVVLFGNLLSPEECQAIIDAAQPRMARSLTVQTTTGGEEVNADRTSDGMFFQRGETPVV 161

Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDF---ARPGEANAFKSLGTGNRVA 208
           +R+  R+  +        E LQV++Y  G  Y+PHYD+    +PG +   +    G RVA
Sbjct: 162 QRLEERIARLVRWPIQNGEGLQVLHYRPGAEYKPHYDYFDPDQPGTSTIVRR--GGQRVA 219

Query: 209 TVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFW 242
           T++ Y+++  +GG T F  + L + P +G A F+
Sbjct: 220 TLVIYLNNPLKGGGTTFPDVPLEVAPRQGNAVFF 253


>gi|302765413|ref|XP_002966127.1| hypothetical protein SELMODRAFT_86017 [Selaginella moellendorffii]
 gi|300165547|gb|EFJ32154.1| hypothetical protein SELMODRAFT_86017 [Selaginella moellendorffii]
          Length = 201

 Score = 90.9 bits (224), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 57/185 (30%), Positives = 87/185 (47%), Gaps = 22/185 (11%)

Query: 103 DSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRVEHMT 162
           D E D +  +A PRLRR++V + KTG  + +  R S  A+LR     ++  I  R+  +T
Sbjct: 9   DDECDHLIGLALPRLRRSSVIDEKTGLGKDSRNRTSWGAFLRRDHDNIVSGIEDRISSIT 68

Query: 163 GLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATVLFYMSDVAQGGA 222
            +     E LQVV Y  G  +EPH D+ +  E N       G+R+ T+L Y+++V  GG 
Sbjct: 69  FIPKEYGESLQVVRYKTGQKFEPHQDYYKLTENNN----NGGHRIGTLLLYLTNVENGGE 124

Query: 223 TVF------------------TSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLT 264
           TVF                  T   + + P +G    +     SG+ D ++ H  CPV+ 
Sbjct: 125 TVFPRALANVINDYSTNTSECTKKGIVIRPRRGDGLLFWITRPSGEIDPFSFHGGCPVVK 184

Query: 265 GSNSL 269
           G   L
Sbjct: 185 GEKWL 189


>gi|195591294|ref|XP_002085377.1| GD12338 [Drosophila simulans]
 gi|194197386|gb|EDX10962.1| GD12338 [Drosophila simulans]
          Length = 438

 Score = 90.9 bits (224), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 59/204 (28%), Positives = 101/204 (49%), Gaps = 33/204 (16%)

Query: 66  LKCRYVHRNVPYLRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNY 125
           L CRY     P+L+L PLK EE  ++P I ++   +   +I+++K + +P+L+R     +
Sbjct: 256 LVCRYADW-TPFLKLAPLKMEELSMKPHISIFYVFLGQKDIEVLKNVFRPKLQRIE---H 311

Query: 126 KTGELEIANYRISKSAWLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEP 185
            +G        +S S+      H V+ +++  +  +TG  +   + L+V+NYGI G+Y P
Sbjct: 312 LSGNCSCKIGNLSSSS------HDVVRKVNELILDITGFPSKGNQMLEVINYGIAGNYNP 365

Query: 186 HYDFARP---GEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFW 242
             D A+P    +ANAF              ++    +GG  VF S +L + P KG+   W
Sbjct: 366 E-DTAKPKIHNKANAF-------------IFLESAGKGGEIVFPSRHLKVRPRKGSMLVW 411

Query: 243 HNLHSSGDGDYYTRHAACPVLTGS 266
            NL +S        +  CP+L G+
Sbjct: 412 ENLKNS------VIYHQCPILKGN 429


>gi|228916870|ref|ZP_04080433.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           pulsiensis BGSC 4CC1]
 gi|228842793|gb|EEM87878.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           pulsiensis BGSC 4CC1]
          Length = 232

 Score = 90.9 bits (224), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 50/177 (28%), Positives = 94/177 (53%), Gaps = 11/177 (6%)

Query: 89  YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
           + +P I++  +V+ D E D + ++++ +L R+ V + +    ++ + R SK A+L + E 
Sbjct: 52  FEEPLIVVLGNVLSDEECDELIELSKNKLARSKVGSSR----DVNDIRTSKGAFLDDNE- 106

Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVA 208
            +  +I +R+  +  +  S  E L ++NY +   Y+ HYD+      +A       NR++
Sbjct: 107 -LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSA-----ANNRIS 160

Query: 209 TVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           T++ Y++DV +GG T F  LNLS+ P KG A ++   +     +  T H   PV  G
Sbjct: 161 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKG 217


>gi|423669823|ref|ZP_17644852.1| hypothetical protein IKO_03520 [Bacillus cereus VDM034]
 gi|423673973|ref|ZP_17648912.1| hypothetical protein IKS_01516 [Bacillus cereus VDM062]
 gi|401298950|gb|EJS04550.1| hypothetical protein IKO_03520 [Bacillus cereus VDM034]
 gi|401309524|gb|EJS14857.1| hypothetical protein IKS_01516 [Bacillus cereus VDM062]
          Length = 216

 Score = 90.5 bits (223), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 54/193 (27%), Positives = 101/193 (52%), Gaps = 16/193 (8%)

Query: 89  YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
           + +P I++  +V+ D E D + ++++ ++ R+ V + +    ++ + R S  A+L E E 
Sbjct: 36  FEEPLIVVLANVLSDEECDELIELSKSKMERSKVGSSR----DVNDIRTSSGAFLEENE- 90

Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
            +  +I +R+  +T +  +  E L ++NY +   Y+ HYD FA    + A       NR+
Sbjct: 91  -LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYFAEHSRSAA------NNRI 143

Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
           +T++ Y++DV +GG T F  LNLS+ P KG A ++   +     +  T H   PV  G  
Sbjct: 144 STLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQLLNELTLHGGAPVTKGEK 203

Query: 268 SLHSTCPCGLRRG 280
            + +     +RRG
Sbjct: 204 WIATQW---VRRG 213


>gi|423389445|ref|ZP_17366671.1| hypothetical protein ICG_01293 [Bacillus cereus BAG1X1-3]
 gi|401641536|gb|EJS59253.1| hypothetical protein ICG_01293 [Bacillus cereus BAG1X1-3]
          Length = 216

 Score = 90.5 bits (223), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 53/193 (27%), Positives = 102/193 (52%), Gaps = 16/193 (8%)

Query: 89  YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
           + +P I++  +V+ D E + + ++++ +++R+ V + +    ++ + R S  A+L E E 
Sbjct: 36  FEEPLIVVLANVLSDEECEELIELSKNKMKRSKVGSSR----DVNDIRTSSGAFLEENE- 90

Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
            +  +I +R+  +T +  +  E L ++NY +   Y+ HYD FA    + A       NR+
Sbjct: 91  -LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYFAEHSRSAA------NNRI 143

Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
           +T++ Y++DV +GG T F  LNLS+ P KG A ++   +     +  T H   PV  G  
Sbjct: 144 STLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEK 203

Query: 268 SLHSTCPCGLRRG 280
            + +     +RRG
Sbjct: 204 WIATQW---VRRG 213


>gi|239814309|ref|YP_002943219.1| Procollagen-proline dioxygenase [Variovorax paradoxus S110]
 gi|239800886|gb|ACS17953.1| Procollagen-proline dioxygenase [Variovorax paradoxus S110]
          Length = 279

 Score = 90.5 bits (223), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 52/177 (29%), Positives = 86/177 (48%), Gaps = 3/177 (1%)

Query: 92  PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
           PR++++ +++   E + +   A+ RL R+     +TG   +   R S+  +    E+ ++
Sbjct: 92  PRVVVFGNLVSPEECEGLIAAARVRLARSLTVETRTGGEVLNVDRTSEGMFFERGENDIV 151

Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLG-TGNRVATV 210
            R+ +R+  +        E LQ++ Y  G  Y PHYD+  PGE      L   G RVAT+
Sbjct: 152 ARLEQRIAALLRWPVEFGEGLQILRYAPGAQYRPHYDYFDPGEPGTPTILKRGGQRVATL 211

Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
           + Y+ +  QGGAT F  + L + P +GT  F+   +   D    T H   PVL G  
Sbjct: 212 VMYLQEPGQGGATTFPDVGLEVAPVRGTGVFFS--YEEPDPATRTLHGGAPVLAGEK 266


>gi|423612451|ref|ZP_17588312.1| hypothetical protein IIM_03166 [Bacillus cereus VD107]
 gi|401246040|gb|EJR52392.1| hypothetical protein IIM_03166 [Bacillus cereus VD107]
          Length = 254

 Score = 90.5 bits (223), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 51/192 (26%), Positives = 98/192 (51%), Gaps = 14/192 (7%)

Query: 89  YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
           + +P I++  +V+ D E D + ++++ ++ R+ + + +     + + R S  A+L E E 
Sbjct: 74  FEEPLIVVLANVLSDEECDELIELSKNKMERSKIGSSRN----VNDIRTSSGAFLEENE- 128

Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVA 208
               +I +R+  +T +  +  E L ++NY +   Y+ HYD+      +A       NR++
Sbjct: 129 -FTSKIEKRISSITNVPVAHGEGLHILNYAVDQEYKAHYDYFAEHSRSA-----ANNRIS 182

Query: 209 TVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNS 268
           T++ Y++DV +GG T F  LNLS+ P KG A ++   +     +  T H   PV  G   
Sbjct: 183 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 242

Query: 269 LHSTCPCGLRRG 280
           + +     +RRG
Sbjct: 243 IATQW---MRRG 251


>gi|302764100|ref|XP_002965471.1| hypothetical protein SELMODRAFT_67344 [Selaginella moellendorffii]
 gi|300166285|gb|EFJ32891.1| hypothetical protein SELMODRAFT_67344 [Selaginella moellendorffii]
          Length = 264

 Score = 90.5 bits (223), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 65/206 (31%), Positives = 98/206 (47%), Gaps = 22/206 (10%)

Query: 78  LRLMPLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELE---IAN 134
           LR+  +K E     PRI L    +   E D +  +A PRL ++TV +  TG+      + 
Sbjct: 52  LRIGLVKPEVLNWSPRITLLHKFLSAEECDYLIAIAGPRLAKSTVVDTSTGKARHGIESK 111

Query: 135 YRISKSAWLR--EPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARP 192
            R S   +L   +  +P+IE I RR+   + +     E LQV+ Y    +Y+PH+D+   
Sbjct: 112 VRTSTGMFLSNYDRRYPMIEAIERRIAVYSMIPVENGELLQVLRYEPNQYYKPHHDYF-- 169

Query: 193 GEANAFKSLGTGNRVATVLFYMSDVAQGGATVFTSL-------------NLSLWPEKGTA 239
             ++ F     G RVATVL Y+SDV +GG T+F S+              L + P KG A
Sbjct: 170 --SDQFNLKRGGQRVATVLMYLSDVEEGGETIFPSVGDGECECGGELRKGLCVKPRKGDA 227

Query: 240 AFWHNLHSSGDGDYYTRHAACPVLTG 265
             + +    G+ D  + H  C VL G
Sbjct: 228 ILFWSAALDGNVDSNSLHGGCSVLRG 253


>gi|325922187|ref|ZP_08183974.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas gardneri ATCC
           19865]
 gi|325547306|gb|EGD18373.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas gardneri ATCC
           19865]
          Length = 285

 Score = 90.5 bits (223), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 58/181 (32%), Positives = 89/181 (49%), Gaps = 7/181 (3%)

Query: 88  AYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPE 147
           + L PR+++  D + D+E D +  +AQPRL R+   +   G   +   R S S  L+  +
Sbjct: 92  SLLLPRVVVLGDFLSDAECDALIALAQPRLARSRTVDNDNGAQIVHAARTSDSMCLQLGQ 151

Query: 148 HPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL-GTGNR 206
             + +RI  R+  +        E LQV+ Y  G  Y+PHYD+  P  A     L   G R
Sbjct: 152 DALCQRIEARIARLLDWPVDHGEGLQVLRYATGAEYQPHYDYFDPTAAGTPVLLQAGGQR 211

Query: 207 VATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTR--HAACPVLT 264
           +A+++ Y++   +GGAT F  ++L +   KG A F+    S       TR  HA  PVL 
Sbjct: 212 LASLVMYLNTPERGGATRFPDVHLDVAAVKGNAVFF----SYDRPHPMTRSLHAGAPVLA 267

Query: 265 G 265
           G
Sbjct: 268 G 268


>gi|229104864|ref|ZP_04235524.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-28]
 gi|228678581|gb|EEL32798.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-28]
          Length = 216

 Score = 90.5 bits (223), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 54/196 (27%), Positives = 102/196 (52%), Gaps = 16/196 (8%)

Query: 89  YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
           + +P I++  +V+ D E   + +M++ +++R+T+ + +    ++ + R S  A+L E E 
Sbjct: 36  FEEPLIVVLGNVISDEECGELIEMSKNKIKRSTIGSSR----DVNDIRTSSGAFLEENE- 90

Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
            +  +I +R+  +  +  +  E L ++NY +   Y+ HYD FA    + A       NR+
Sbjct: 91  -LTSKIEKRISSIMNVPVTHGEGLHILNYEVDQQYKAHYDYFAEHSRSAA------NNRI 143

Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
           +T++ Y++DV +GG T F  LNLS+ P KG A ++   +     +  T H   PV  G  
Sbjct: 144 STLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEK 203

Query: 268 SLHSTCPCGLRRGLQR 283
            + +     +RRG  R
Sbjct: 204 WIATQW---VRRGTYR 216


>gi|224141327|ref|XP_002324025.1| predicted protein [Populus trichocarpa]
 gi|222867027|gb|EEF04158.1| predicted protein [Populus trichocarpa]
          Length = 239

 Score = 90.5 bits (223), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 61/202 (30%), Positives = 95/202 (47%), Gaps = 22/202 (10%)

Query: 82  PLKEEEAYLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSA 141
           P +  +   QPR  +Y+  + D E D +  +A+ +L ++ V N +TGE   +  R S   
Sbjct: 15  PTRAAQLSWQPRAFVYKGFLSDEECDHLINLAKGKLVKSMVANDETGESMESQERTSSGM 74

Query: 142 WLREPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL 201
           ++ + E  ++  I  R+   T L     E +Q++ Y  G  YE H D+    +AN  +  
Sbjct: 75  FIFKTEDEIVNGIEARIAAWTFLPEENGEPIQILRYEHGQKYEAHIDYF-VDKANQEEG- 132

Query: 202 GTGNRVATVLFYMSDVAQGGATVF-------TSLNLSLW-----------PEKGTAAFWH 243
             G+R ATVL Y+SDV +GG TVF       +      W           P KG A  + 
Sbjct: 133 --GHRAATVLMYLSDVKKGGETVFPTSEAEGSQAKDDSWSDCAKKGYAVKPNKGDALLFF 190

Query: 244 NLHSSGDGDYYTRHAACPVLTG 265
           +LH     D  + HA+CPV+ G
Sbjct: 191 SLHPDATPDPGSLHASCPVIEG 212


>gi|413945803|gb|AFW78452.1| hypothetical protein ZEAMMB73_588774 [Zea mays]
          Length = 239

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 45/135 (33%), Positives = 78/135 (57%), Gaps = 4/135 (2%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +PR+ LY+  + D E + +  +A+  L+R+ V +  +G+  ++  R S   +LR+ + P+
Sbjct: 57  KPRVFLYQHFLSDDEANHLISLARAELKRSAVADNMSGKSTLSEVRTSSGTFLRKGQDPI 116

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           +E I  ++   T L     E++QV+ Y  G  YEPHYD+      +   ++  G+R ATV
Sbjct: 117 VEGIEDKIAAWTFLPKENGEDIQVLRYKHGEKYEPHYDYF----TDNVNTVRGGHRYATV 172

Query: 211 LFYMSDVAQGGATVF 225
           L Y++DV +GG TVF
Sbjct: 173 LLYLTDVPEGGETVF 187


>gi|423448819|ref|ZP_17425698.1| hypothetical protein IEC_03427 [Bacillus cereus BAG5O-1]
 gi|401129413|gb|EJQ37096.1| hypothetical protein IEC_03427 [Bacillus cereus BAG5O-1]
          Length = 216

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 54/196 (27%), Positives = 102/196 (52%), Gaps = 16/196 (8%)

Query: 89  YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
           + +P I++  +V+ D E D + +M++ +++R+T+ + +    ++ + R S  A+L E E 
Sbjct: 36  FEEPLIVVLGNVISDEECDELIEMSKNKIKRSTIGSSR----DVNDIRTSSGAFLEENE- 90

Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
            +  +I +R+  +  +  +  E L ++NY +   Y+ HYD FA    + A       NR+
Sbjct: 91  -LTSKIEKRISSIMNVPVTHGEGLHILNYEVDQQYKAHYDYFAEHSRSAA------NNRI 143

Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
           +T++ Y++DV +GG T F  LNLS+ P KG A ++   +     +  T H    V  G  
Sbjct: 144 STLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGASVTKGEK 203

Query: 268 SLHSTCPCGLRRGLQR 283
            + +     +RRG  R
Sbjct: 204 WIATQW---VRRGTYR 216


>gi|297797785|ref|XP_002866777.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
 gi|297312612|gb|EFH43036.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
          Length = 266

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 53/163 (32%), Positives = 83/163 (50%), Gaps = 12/163 (7%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +PR  +Y + +   E   + ++A+P + ++TV + KTG+   +  R S   +L       
Sbjct: 83  EPRASVYHNFLTKEECKYLIELAKPHMEKSTVVDEKTGKSTDSRVRTSSGTFLARGRDKT 142

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           I  I +R+   T +     E LQV++Y IG  YEPHYD+      + + +   G R+ATV
Sbjct: 143 IREIEKRISDFTFIPVEHGEGLQVLHYEIGQKYEPHYDYF----MDEYNTRNGGQRIATV 198

Query: 211 LFYMSDVAQGGATVFTSL--NLSLWPEKGTAAFWHNLHSSGDG 251
           L Y+SDV +GG TVF +   N S  P      +W+ L   G G
Sbjct: 199 LMYLSDVEEGGETVFPAAKGNYSAVP------WWNELSECGKG 235


>gi|428170517|gb|EKX39441.1| hypothetical protein GUITHDRAFT_114401 [Guillardia theta CCMP2712]
          Length = 322

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 55/187 (29%), Positives = 88/187 (47%), Gaps = 7/187 (3%)

Query: 86  EEAYLQPRIILYRDVMYDSEIDLIKKMA-QPRLRRATVQNYKTGELEIANYRISKSAWLR 144
           E   + PRI +  +++ + E D +  +A Q  L  + +  Y T +L  +  R +K AWL 
Sbjct: 76  ETVSVDPRIFIVHNLLTEEECDHLVSLALQKGLSASLITPYGTNKLVESTTRTNKQAWLD 135

Query: 145 EPEHPVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTG 204
             +  V++R+  ++  +T  T    E LQV++Y     +  H+D+  P           G
Sbjct: 136 FQQDDVVKRVEDKIAKLTKTTPEQGENLQVLHYAKSQQFTEHHDYFDPATDPPENYEKGG 195

Query: 205 NRVATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDG------DYYTRHA 258
           NR+ TV+ Y+    +GG T F + NL L   KG A  ++NL    DG      D  T HA
Sbjct: 196 NRLITVIVYLQAAEEGGETHFGAANLKLTAAKGDAVMFYNLKHGCDGIDPTCVDKQTLHA 255

Query: 259 ACPVLTG 265
             P + G
Sbjct: 256 GLPPIKG 262


>gi|10177121|dbj|BAB10411.1| prolyl 4-hydroxylase, alpha subunit-like protein [Arabidopsis
           thaliana]
          Length = 267

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 53/163 (32%), Positives = 83/163 (50%), Gaps = 12/163 (7%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +PR  +Y + +   E   + ++A+P + ++TV + KTG+   +  R S   +L       
Sbjct: 84  EPRASVYHNFLTKEECKYLIELAKPHMEKSTVVDEKTGKSTDSRVRTSSGTFLARGRDKT 143

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           I  I +R+   T +     E LQV++Y IG  YEPHYD+      + + +   G R+ATV
Sbjct: 144 IREIEKRISDFTFIPVEHGEGLQVLHYEIGQKYEPHYDYF----MDEYNTRNGGQRIATV 199

Query: 211 LFYMSDVAQGGATVFTSL--NLSLWPEKGTAAFWHNLHSSGDG 251
           L Y+SDV +GG TVF +   N S  P      +W+ L   G G
Sbjct: 200 LMYLSDVEEGGETVFPAAKGNYSAVP------WWNELSECGKG 236


>gi|229163182|ref|ZP_04291137.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus R309803]
 gi|228620245|gb|EEK77116.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus R309803]
          Length = 229

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 52/178 (29%), Positives = 95/178 (53%), Gaps = 13/178 (7%)

Query: 89  YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
           + +P I++  +V+ D E D + ++++ +L R+ V + +    ++ + R SK A+L + E 
Sbjct: 49  FEEPLIVVLGNVLSDEECDELIELSKSKLARSKVGSSR----DVNDIRTSKGAFLDDNE- 103

Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
            +  +I +R+  +  +  S  E L ++NY +   Y+ HYD FA    + A       NR+
Sbjct: 104 -LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSAA------NNRI 156

Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           +T++ Y++DV +GG T F  LNLS+ P KG A ++   +     +  T H   PV  G
Sbjct: 157 STLVMYLNDVEEGGETFFPKLNLSVNPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKG 214


>gi|423657194|ref|ZP_17632493.1| hypothetical protein IKG_04182 [Bacillus cereus VD200]
 gi|401289937|gb|EJR95641.1| hypothetical protein IKG_04182 [Bacillus cereus VD200]
          Length = 248

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 50/178 (28%), Positives = 94/178 (52%), Gaps = 13/178 (7%)

Query: 89  YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
           + +P I++  +V+ D E D + +M++ ++ R+ + + +    ++ + R S  A+L + E 
Sbjct: 68  FEEPLIVVLANVLSDEECDELIEMSKNKMERSKIGSSR----DVNDIRTSSGAFLEDNE- 122

Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
            +  +I +R+  +  +  S  E L ++NY +   Y+ HYD FA    + A       NR+
Sbjct: 123 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSAA------NNRI 175

Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           +T++ Y++DV +GG T F  LNLS+ P KG A ++   +     +  T H   PV  G
Sbjct: 176 STLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKG 233


>gi|424863736|ref|ZP_18287648.1| prolyl 4-hydroxylase subunit alpha-2 [SAR86 cluster bacterium
           SAR86A]
 gi|400757057|gb|EJP71269.1| prolyl 4-hydroxylase subunit alpha-2 [SAR86 cluster bacterium
           SAR86A]
          Length = 205

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 53/180 (29%), Positives = 87/180 (48%), Gaps = 10/180 (5%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
            P + +  + + D E +   +M + ++ RA V      E E    R +   WL      V
Sbjct: 16  DPIVYVVNNFLSDDECEAFVEMGKGKMERAKV--ISDDESEFHASRTNDFCWLEHSASDV 73

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDF----ARPGEANAFKSLGTGNR 206
           I  +S+R   +  +  + AE+ Q+V YG G  Y+PH+D      + G+ N F     G R
Sbjct: 74  IHEVSKRFSVLVKMPINNAEQFQLVYYGPGNEYKPHFDAFDKTTKEGQNNWFPG---GQR 130

Query: 207 VATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHN-LHSSGDGDYYTRHAACPVLTG 265
           + T L Y++DV +GGAT F  +N+S+ P KG    +HN +  + + +    H   PV+ G
Sbjct: 131 MVTALAYLNDVEEGGATDFPKINVSVKPNKGDVVVFHNCIEGTTEINPQALHGGSPVVAG 190


>gi|423406337|ref|ZP_17383486.1| hypothetical protein ICY_01022 [Bacillus cereus BAG2X1-3]
 gi|401660331|gb|EJS77813.1| hypothetical protein ICY_01022 [Bacillus cereus BAG2X1-3]
          Length = 216

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 52/178 (29%), Positives = 95/178 (53%), Gaps = 13/178 (7%)

Query: 89  YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
           + +P I++  +V+ D E D + ++++ +L R+ V + +    ++ + R SK A+L + E 
Sbjct: 36  FEEPLIVVLGNVLSDEECDKLIELSKNKLARSKVGSSR----DVNDIRTSKGAFLDDNE- 90

Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
            +  +I +R+  +  +  S  E L ++NY +   Y+ HYD FA    + A       NR+
Sbjct: 91  -LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSAA------NNRI 143

Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           +T++ Y++DV +GG T F  LNLS+ P KG A ++   +     +  T H   PV  G
Sbjct: 144 STLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKG 201


>gi|319795182|ref|YP_004156822.1| procollagen-proline dioxygenase [Variovorax paradoxus EPS]
 gi|315597645|gb|ADU38711.1| Procollagen-proline dioxygenase [Variovorax paradoxus EPS]
          Length = 296

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 51/169 (30%), Positives = 91/169 (53%), Gaps = 1/169 (0%)

Query: 99  DVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIERISRRV 158
           +V+   E   + +MA+PRL  +T+ +  +G   +++ R S   + R  E+ ++ R+ RR+
Sbjct: 107 NVVDAHECKALIEMAKPRLAPSTLVDPMSGRDVVSDKRASWGMFFRLCENDLVARLDRRL 166

Query: 159 EHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLG-TGNRVATVLFYMSDV 217
             +  L     E L ++ Y  G   EPH+D+  P  A   +S+  +G RV+T++ Y++D 
Sbjct: 167 SALMNLPLENGEGLHLLYYPTGAGSEPHHDYLAPTNAANRESIARSGQRVSTLVTYLNDA 226

Query: 218 AQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGS 266
            +GG TVF  L L++ P +G A ++     +G  D  + HA+ PV  G 
Sbjct: 227 PEGGQTVFPQLGLAVSPIRGNACYFEYCDGNGRVDARSLHASAPVTRGD 275


>gi|423395462|ref|ZP_17372663.1| hypothetical protein ICU_01156 [Bacillus cereus BAG2X1-1]
 gi|401654873|gb|EJS72412.1| hypothetical protein ICU_01156 [Bacillus cereus BAG2X1-1]
          Length = 216

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 52/178 (29%), Positives = 95/178 (53%), Gaps = 13/178 (7%)

Query: 89  YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
           + +P I++  +V+ D E D + ++++ +L R+ V + +    ++ + R SK A+L + E 
Sbjct: 36  FEEPLIVVLGNVLSDEECDKLIELSKNKLARSKVGSSR----DVNDIRTSKGAFLDDNE- 90

Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
            +  +I +R+  +  +  S  E L ++NY +   Y+ HYD FA    + A       NR+
Sbjct: 91  -LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSAA------NNRI 143

Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           +T++ Y++DV +GG T F  LNLS+ P KG A ++   +     +  T H   PV  G
Sbjct: 144 STLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKG 201


>gi|423457579|ref|ZP_17434376.1| hypothetical protein IEI_00719 [Bacillus cereus BAG5X2-1]
 gi|401147963|gb|EJQ55456.1| hypothetical protein IEI_00719 [Bacillus cereus BAG5X2-1]
          Length = 216

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 52/178 (29%), Positives = 94/178 (52%), Gaps = 13/178 (7%)

Query: 89  YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
           + +P I++  +V+ D E D + ++++ +L R+ V + +    ++ + R S  A+L + E 
Sbjct: 36  FEEPLIVVLGNVLSDEECDELIELSKSKLARSKVGSSR----DVNDIRTSSGAFLEDNEL 91

Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
            V  +I +R+  +  +  S  E L ++NY +   Y+ HYD FA    + A       NR+
Sbjct: 92  TV--KIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSAA------NNRI 143

Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           +T++ Y++DV +GG T F  LNLS+ P KG A ++   +     +  T H   PV  G
Sbjct: 144 STLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKG 201


>gi|325915062|ref|ZP_08177391.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas vesicatoria
           ATCC 35937]
 gi|325538760|gb|EGD10427.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas vesicatoria
           ATCC 35937]
          Length = 286

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 58/177 (32%), Positives = 87/177 (49%), Gaps = 7/177 (3%)

Query: 92  PRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVI 151
           PR+++    + D+E D +  +AQPRL R+   +   G   +   R S S  L+  +  + 
Sbjct: 96  PRVMVLGGFLSDAECDAMIALAQPRLARSRTVDNANGAHVVHAARTSDSMCLQLGQDALC 155

Query: 152 ERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSL-GTGNRVATV 210
           +RI  R+  +        E LQV+ YG G  Y+PHYD+  P  A     L   G RVA++
Sbjct: 156 QRIEARIARLLDWPVENGEGLQVLRYGTGAEYQPHYDYFDPDAAGTPVLLQAGGQRVASL 215

Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTR--HAACPVLTG 265
           + Y++   +GGAT F  ++L +   KG A F+    S       TR  HA  PVL G
Sbjct: 216 VMYLNTPDRGGATRFPDVHLDIAAIKGNAVFF----SYDRPHPMTRSLHAGAPVLAG 268


>gi|384046522|ref|YP_005494539.1| prolyl 4-hydroxylase alpha subunit [Bacillus megaterium WSH-002]
 gi|345444213|gb|AEN89230.1| Prolyl 4-hydroxylase alpha subunit [Bacillus megaterium WSH-002]
          Length = 219

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 48/178 (26%), Positives = 96/178 (53%), Gaps = 11/178 (6%)

Query: 89  YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
           + +P +++  +V+ + E D + ++++ +++R+ +      E E+ + R S   +  E E+
Sbjct: 36  FEEPLVLVLGNVLSNEECDELIQLSKDKMQRSKI----GAEREVNSIRTSSGMFFEESEN 91

Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
            ++ +I RR+  + G +   AE LQ++ Y     Y+ H+D F    +A+        NR+
Sbjct: 92  ELVHQIERRLSKIMGPSIEYAEGLQILKYLPDQEYKAHHDYFTSASKASK------NNRI 145

Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           +T++ Y++DV +GG T F  L LS+ P KG A ++   +S  + +  T H   PV+ G
Sbjct: 146 STLVMYLNDVEEGGETYFPKLGLSISPTKGMAVYFEYFYSDAELNDRTLHGGAPVIKG 203


>gi|229086310|ref|ZP_04218488.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-44]
 gi|228697005|gb|EEL49812.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-44]
          Length = 220

 Score = 89.7 bits (221), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 51/176 (28%), Positives = 91/176 (51%), Gaps = 13/176 (7%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +P I++  +V+ D E + + ++++  ++R+ +   +    E+ N R S   +L E E   
Sbjct: 42  EPLIVVLENVLSDEECESLIELSKDSMKRSKIGASR----EVDNIRTSSGTFLEENETVA 97

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRVAT 209
           I  I +RV  +  +     E L ++ Y  G  Y+ HYD FA    A         NR++T
Sbjct: 98  I--IEKRVSSIMNIPVEHGEGLHILKYTPGQEYKAHYDYFAEHSRA------AENNRIST 149

Query: 210 VLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           ++ Y++DV +GG T F  LNLS+ P+KG+A ++   ++    +  T H   PV+ G
Sbjct: 150 LVMYLNDVEEGGETFFPKLNLSIAPKKGSAVYFEYFYNDKSLNELTLHGGAPVIKG 205


>gi|206978009|ref|ZP_03238895.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           H3081.97]
 gi|423373947|ref|ZP_17351286.1| hypothetical protein IC5_03002 [Bacillus cereus AND1407]
 gi|206743809|gb|EDZ55230.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           H3081.97]
 gi|401094762|gb|EJQ02832.1| hypothetical protein IC5_03002 [Bacillus cereus AND1407]
          Length = 216

 Score = 89.7 bits (221), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 55/196 (28%), Positives = 101/196 (51%), Gaps = 16/196 (8%)

Query: 89  YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
           + +P I++  +V+ D E D + ++++ +L R+ V + +    ++ + R S  A+L + E 
Sbjct: 36  FEEPLIVVLGNVLSDEECDKLIELSKNKLARSKVGSSR----DVNDIRTSSGAFLDDDE- 90

Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
            +  +I +R+  +  +  S  E L ++NY +   Y+ HYD FA    + A       NR+
Sbjct: 91  -LTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYFAEHSRSAA------NNRI 143

Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSN 267
           +T++ Y++DV +GG T F  LNLS+ P KG A ++   +     +  T H   PV  G  
Sbjct: 144 STLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEK 203

Query: 268 SLHSTCPCGLRRGLQR 283
            + +     +RRG  R
Sbjct: 204 WIATQW---VRRGTYR 216


>gi|407698902|ref|YP_006823689.1| hypothetical protein AMBLS11_03220 [Alteromonas macleodii str.
           'Black Sea 11']
 gi|407248049|gb|AFT77234.1| hypothetical protein AMBLS11_03220 [Alteromonas macleodii str.
           'Black Sea 11']
          Length = 263

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 53/179 (29%), Positives = 89/179 (49%), Gaps = 7/179 (3%)

Query: 93  RIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPVIE 152
           ++  Y D +   E D I  + + +L  + +     G     + R S +  L    + +++
Sbjct: 81  QLFAYDDFLSSQECDDIVALTKDKLAPSKL----AGAASADDIRTSSTCELAFLGNKLVK 136

Query: 153 RISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKS--LGTGNRVATV 210
            +  R+     L     E +Q  +Y +G +Y+PHYDF  PG    +K+  L  G R  T 
Sbjct: 137 DVDSRIVSTLSLGVGEGEVIQAQHYNVGEYYKPHYDFFPPGSPQ-YKTHCLSRGQRTWTC 195

Query: 211 LFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNSL 269
           + Y++D   GG T FT L++++ P+KG A FW+NL  SGD +  + H A PV  G  ++
Sbjct: 196 MIYLNDECDGGHTRFTKLDIAVRPKKGMALFWNNLLPSGDPNLNSIHFAEPVTRGHKTV 254


>gi|229111709|ref|ZP_04241257.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-15]
 gi|296504733|ref|YP_003666433.1| prolyl 4-hydroxylase subunit alpha [Bacillus thuringiensis BMB171]
 gi|423585282|ref|ZP_17561369.1| hypothetical protein IIE_00694 [Bacillus cereus VD045]
 gi|423640681|ref|ZP_17616299.1| hypothetical protein IK9_00626 [Bacillus cereus VD166]
 gi|228671703|gb|EEL26999.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-15]
 gi|296325785|gb|ADH08713.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis BMB171]
 gi|401233925|gb|EJR40411.1| hypothetical protein IIE_00694 [Bacillus cereus VD045]
 gi|401279742|gb|EJR85664.1| hypothetical protein IK9_00626 [Bacillus cereus VD166]
          Length = 248

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 50/178 (28%), Positives = 93/178 (52%), Gaps = 13/178 (7%)

Query: 89  YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
           + +P I++  +V+ D E D + +M++ ++ R+ + + +    ++ + R S  A+L + E 
Sbjct: 68  FEEPLIVVLANVLSDEECDELIEMSKNKMERSKIGSSR----DVNDIRTSSGAFLEDNE- 122

Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
               +I +R+  +  +  S  E L ++NY +   Y+ HYD FA    + A       NR+
Sbjct: 123 -FTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSAA------NNRI 175

Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           +T++ Y++DV +GG T F  LNLS+ P KG A ++   +     +  T H   PV  G
Sbjct: 176 STLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKG 233


>gi|423400914|ref|ZP_17378087.1| hypothetical protein ICW_01312 [Bacillus cereus BAG2X1-2]
 gi|401653904|gb|EJS71447.1| hypothetical protein ICW_01312 [Bacillus cereus BAG2X1-2]
          Length = 216

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 50/178 (28%), Positives = 95/178 (53%), Gaps = 13/178 (7%)

Query: 89  YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
           + +P I++  +V+ D E D + ++++ +++R+ V + +    ++ + R S  A+L + E 
Sbjct: 36  FEEPLIVVLGNVLSDEECDELIELSKSKMKRSKVGSSR----DVNDIRTSSGAFLDDNE- 90

Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYD-FARPGEANAFKSLGTGNRV 207
            +  +I +R+  +  +  S  E L ++NY +   Y+ HYD FA    + A       NR+
Sbjct: 91  -LTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYFAEHSRSAA------NNRI 143

Query: 208 ATVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTG 265
           +T++ Y++DV +GG T F  LNLS+ P KG A ++   +     +  T H   PV  G
Sbjct: 144 STLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKG 201


>gi|449529555|ref|XP_004171765.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 284

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 54/196 (27%), Positives = 93/196 (47%), Gaps = 23/196 (11%)

Query: 91  QPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEHPV 150
           +PR  +Y + +   E   +  +A+P + ++TV + KTGE   +  R S   +L   +  +
Sbjct: 80  EPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSKTGESVDSRVRTSSGMFLNRGQDKI 139

Query: 151 IERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVATV 210
           I  I +R+   T +     E LQ+++Y +G  Y+ HYD+      + +     G R+AT+
Sbjct: 140 IRNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYF----VDEYNIKKGGQRMATL 195

Query: 211 LFYMSDVAQGGATVFTSLN-------------------LSLWPEKGTAAFWHNLHSSGDG 251
           L Y+SDV +GG TVF +                     LS+ P+ G A  + ++      
Sbjct: 196 LMYLSDVEEGGETVFPAAKGNFSSVPWWNELSECGKGGLSVKPKMGDALLFWSMKPDATL 255

Query: 252 DYYTRHAACPVLTGSN 267
           D  + H ACPV+ G+ 
Sbjct: 256 DPTSLHGACPVIRGNK 271


>gi|229140971|ref|ZP_04269515.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST26]
 gi|228642547|gb|EEK98834.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST26]
          Length = 232

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 53/195 (27%), Positives = 100/195 (51%), Gaps = 14/195 (7%)

Query: 89  YLQPRIILYRDVMYDSEIDLIKKMAQPRLRRATVQNYKTGELEIANYRISKSAWLREPEH 148
           + +P I++  +V+ D E D + ++++ +L R+ V + +    ++ + R S  A+L + E 
Sbjct: 52  FEEPLIVVLGNVLSDEECDKLIELSKNKLARSKVGSSR----DVNDIRTSSGAFLDDNE- 106

Query: 149 PVIERISRRVEHMTGLTTSTAEELQVVNYGIGGHYEPHYDFARPGEANAFKSLGTGNRVA 208
            +  +I +R+  +  +  S  E L ++NY +   Y+ HYD+      +A       NR++
Sbjct: 107 -LTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYFAEHSRSA-----ANNRIS 160

Query: 209 TVLFYMSDVAQGGATVFTSLNLSLWPEKGTAAFWHNLHSSGDGDYYTRHAACPVLTGSNS 268
           T++ Y++DV +GG T F  LNLS+ P KG A ++   +     +  T H   PV  G   
Sbjct: 161 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 220

Query: 269 LHSTCPCGLRRGLQR 283
           + +     +RRG  R
Sbjct: 221 IATQW---VRRGTYR 232


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.319    0.136    0.411 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,179,375,595
Number of Sequences: 23463169
Number of extensions: 217742668
Number of successful extensions: 465570
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1424
Number of HSP's successfully gapped in prelim test: 422
Number of HSP's that attempted gapping in prelim test: 461153
Number of HSP's gapped (non-prelim): 2037
length of query: 312
length of database: 8,064,228,071
effective HSP length: 142
effective length of query: 170
effective length of database: 9,027,425,369
effective search space: 1534662312730
effective search space used: 1534662312730
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 76 (33.9 bits)