BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 026697
(235 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q20065|P4HA2_CAEEL Prolyl 4-hydroxylase subunit alpha-2 OS=Caenorhabditis elegans
GN=phy-2 PE=1 SV=1
Length = 539
Score = 130 bits (327), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 74/209 (35%), Positives = 109/209 (52%), Gaps = 23/209 (11%)
Query: 24 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 83
VE++ ++P A ++ N + E E + LA+P ++++TV +S TG+ + + R S +L
Sbjct: 318 VEILRFDPLAVLFKNVIHDSEIEVIKELASPKLKRATVQNSKTGELEHATYRISKSAWLK 377
Query: 84 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE----FNTKNGG 139
D +I + +RI DFT E LQV +Y G Y+PHFD+ E F T N G
Sbjct: 378 GDLDPVIDRVNRRIEDFTNLNQATSEELQVANYGLGGHYDPHFDFARKEEKNAFKTLNTG 437
Query: 140 QRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSM 199
R+ATVL Y+S E GG TVF N L G ++ P DAL ++++
Sbjct: 438 NRIATVLFYMSQPERGGATVF-------------NHL------GTAVFPSKNDALFWYNL 478
Query: 200 KPDASLDPSSLHGGCPVIKGNKWSSTKWI 228
+ D D + H CPV+ G KW S KWI
Sbjct: 479 RRDGEGDLRTRHAACPVLLGVKWVSNKWI 507
>sp|P54001|P4HA1_RAT Prolyl 4-hydroxylase subunit alpha-1 OS=Rattus norvegicus GN=P4ha1
PE=2 SV=2
Length = 534
Score = 130 bits (326), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 72/203 (35%), Positives = 104/203 (51%), Gaps = 23/203 (11%)
Query: 30 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 89
+PR +H+ +S E E + +LA P + ++TV D +TGK ++ R S +L+ D +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYEDPV 393
Query: 90 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATV 145
+ I RI D T + E LQV +Y G +YEPHFD+ D F G R+AT
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIATW 453
Query: 146 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 205
L Y+SDV GG TVFP + G S+ PK G A+ ++++
Sbjct: 454 LFYMSDVSAGGATVFP-------------------EVGASVWPKKGTAVFWYNLFASGEG 494
Query: 206 DPSSLHGGCPVIKGNKWSSTKWI 228
D S+ H CPV+ GNKW S KW+
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWL 517
>sp|P16924|P4HA1_CHICK Prolyl 4-hydroxylase subunit alpha-1 OS=Gallus gallus GN=P4HA1 PE=1
SV=1
Length = 516
Score = 126 bits (316), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 72/203 (35%), Positives = 102/203 (50%), Gaps = 23/203 (11%)
Query: 30 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 89
+PR + + +S EE E + LA P + ++TV D +TGK + R S +L+ +
Sbjct: 316 KPRIVRFLDIISDEEIETVKELAKPRLSRATVHDPETGKLTTAHYRVSKSAWLSGYESPV 375
Query: 90 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATV 145
+ I RI D T + E LQV +Y G +YEPHFD+ D F G R+AT
Sbjct: 376 VSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATW 435
Query: 146 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 205
L Y+SDV GG TVFP + G S+ PK G A+ ++++ P
Sbjct: 436 LFYMSDVSAGGATVFP-------------------EVGASVWPKKGTAVFWYNLFPSGEG 476
Query: 206 DPSSLHGGCPVIKGNKWSSTKWI 228
D S+ H CPV+ GNKW S KW+
Sbjct: 477 DYSTRHAACPVLVGNKWVSNKWL 499
>sp|Q60715|P4HA1_MOUSE Prolyl 4-hydroxylase subunit alpha-1 OS=Mus musculus GN=P4ha1 PE=2
SV=2
Length = 534
Score = 126 bits (316), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 70/203 (34%), Positives = 102/203 (50%), Gaps = 23/203 (11%)
Query: 30 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 89
+PR +H+ +S E E + +LA P +R++T+ + TG + R S +L+ D +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPVTGALETVHYRISKSAWLSGYEDPV 393
Query: 90 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATV 145
+ I RI D T + E LQV +Y G +YEPHFD+ D F G R+AT
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIATW 453
Query: 146 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 205
L Y+SDV GG TVFP + G S+ PK G A+ ++++
Sbjct: 454 LFYMSDVSAGGATVFP-------------------EVGASVWPKKGTAVFWYNLFASGEG 494
Query: 206 DPSSLHGGCPVIKGNKWSSTKWI 228
D S+ H CPV+ GNKW S KW+
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWL 517
>sp|Q5RAG8|P4HA1_PONAB Prolyl 4-hydroxylase subunit alpha-1 OS=Pongo abelii GN=P4HA1 PE=2
SV=1
Length = 534
Score = 124 bits (310), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 69/203 (33%), Positives = 102/203 (50%), Gaps = 23/203 (11%)
Query: 30 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 89
+PR +H+ +S E E + +LA P +R++T+ + TG + R S +L+ + +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSGYENPV 393
Query: 90 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATV 145
+ I RI D T + E LQV +Y G +YEPHFD+ D F G R+AT
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 453
Query: 146 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 205
L Y+SDV GG TVFP + G S+ PK G A+ ++++
Sbjct: 454 LFYMSDVSAGGATVFP-------------------EVGASVWPKKGTAVFWYNLFASGEG 494
Query: 206 DPSSLHGGCPVIKGNKWSSTKWI 228
D S+ H CPV+ GNKW S KW+
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWL 517
>sp|P13674|P4HA1_HUMAN Prolyl 4-hydroxylase subunit alpha-1 OS=Homo sapiens GN=P4HA1 PE=1
SV=2
Length = 534
Score = 124 bits (310), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 69/203 (33%), Positives = 102/203 (50%), Gaps = 23/203 (11%)
Query: 30 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 89
+PR +H+ +S E E + +LA P +R++T+ + TG + R S +L+ + +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSGYENPV 393
Query: 90 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATV 145
+ I RI D T + E LQV +Y G +YEPHFD+ D F G R+AT
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 453
Query: 146 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 205
L Y+SDV GG TVFP + G S+ PK G A+ ++++
Sbjct: 454 LFYMSDVSAGGATVFP-------------------EVGASVWPKKGTAVFWYNLFASGEG 494
Query: 206 DPSSLHGGCPVIKGNKWSSTKWI 228
D S+ H CPV+ GNKW S KW+
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWL 517
>sp|Q1RMU3|P4HA1_BOVIN Prolyl 4-hydroxylase subunit alpha-1 OS=Bos taurus GN=P4HA1 PE=1
SV=1
Length = 534
Score = 123 bits (308), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 69/203 (33%), Positives = 102/203 (50%), Gaps = 23/203 (11%)
Query: 30 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 89
+PR +H+ +S E E + +LA P +R++T+ + TG + R S +L+ + +
Sbjct: 334 KPRIIRFHDIISDAEIEVVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSGYENPV 393
Query: 90 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATV 145
+ I RI D T + E LQV +Y G +YEPHFD+ D F G R+AT
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 453
Query: 146 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 205
L Y+SDV GG TVFP + G S+ PK G A+ ++++
Sbjct: 454 LFYMSDVLAGGATVFP-------------------EVGASVWPKKGTAVFWYNLFASGEG 494
Query: 206 DPSSLHGGCPVIKGNKWSSTKWI 228
D S+ H CPV+ GNKW S KW+
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWL 517
>sp|Q5ZLK5|P4HA2_CHICK Prolyl 4-hydroxylase subunit alpha-2 OS=Gallus gallus GN=P4HA2 PE=2
SV=1
Length = 534
Score = 118 bits (295), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 70/207 (33%), Positives = 105/207 (50%), Gaps = 23/207 (11%)
Query: 31 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 90
P Y++ +S EE E + LA P + ++TV D TG + R S ++L D ++
Sbjct: 337 PHIVRYYDVMSDEEIEKIKQLAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 396
Query: 91 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNT--KNGGQRMATVLMY 148
+ +R+ T ++ E LQV +Y G +YEPHFD+ F++ K+ G R+AT L Y
Sbjct: 397 AKVNQRMQQITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSTLKSEGNRLATFLNY 456
Query: 149 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 208
+SDVE GG TVFP+ G +I PK G A+ ++++ D
Sbjct: 457 MSDVEAGGATVFPDF-------------------GAAIWPKKGTAVFWYNLFRSGEGDYR 497
Query: 209 SLHGGCPVIKGNKWSSTKWI--RVNEY 233
+ H CPV+ G KW S KW R NE+
Sbjct: 498 TRHAACPVLVGCKWVSNKWFHERGNEF 524
>sp|Q10576|P4HA1_CAEEL Prolyl 4-hydroxylase subunit alpha-1 OS=Caenorhabditis elegans
GN=dpy-18 PE=1 SV=2
Length = 559
Score = 115 bits (288), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 69/217 (31%), Positives = 105/217 (48%), Gaps = 25/217 (11%)
Query: 24 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 83
VE+ + P A ++ + +S +E + LA P + ++TV DS TGK + R S +L
Sbjct: 321 VEIKRFNPLAVLFKDVISDDEVAAIQELAKPKLARATVHDSVTGKLVTATYRISKSAWLK 380
Query: 84 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE----FNTKNGG 139
++ + KRI T +E E LQ+ +Y G Y+PHFD+ E F + G
Sbjct: 381 EWEGDVVETVNKRIGYMTNLEMETAEELQIANYGIGGHYDPHFDHAKKEESKSFESLGTG 440
Query: 140 QRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSM 199
R+ATVL Y+S GG TVF A+ +I P DAL ++++
Sbjct: 441 NRIATVLFYMSQPSHGGGTVFTEAKS-------------------TILPTKNDALFWYNL 481
Query: 200 KPDASLDPSSLHGGCPVIKGNKWSSTKWI--RVNEYK 234
+P + H CPV+ G KW S KWI + NE++
Sbjct: 482 YKQGDGNPDTRHAACPVLVGIKWVSNKWIHEKGNEFR 518
>sp|O15460|P4HA2_HUMAN Prolyl 4-hydroxylase subunit alpha-2 OS=Homo sapiens GN=P4HA2 PE=1
SV=1
Length = 535
Score = 115 bits (287), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 66/201 (32%), Positives = 98/201 (48%), Gaps = 23/201 (11%)
Query: 31 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 90
P Y++ +S EE E + +A P + ++TV D TG + R S ++L D ++
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 395
Query: 91 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF----MDEFNTKNGGQRMATVL 146
+ +R+ T ++ E LQV +Y G +YEPHFD+ D F G R+AT L
Sbjct: 396 ARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERDTFKHLGTGNRVATFL 455
Query: 147 MYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 206
Y+SDVE GG TVFP+ G +I PK G A+ ++++ D
Sbjct: 456 NYMSDVEAGGATVFPD-------------------LGAAIWPKKGTAVFWYNLLRSGEGD 496
Query: 207 PSSLHGGCPVIKGNKWSSTKW 227
+ H CPV+ G KW S KW
Sbjct: 497 YRTRHAACPVLVGCKWVSNKW 517
>sp|Q60716|P4HA2_MOUSE Prolyl 4-hydroxylase subunit alpha-2 OS=Mus musculus GN=P4ha2 PE=2
SV=1
Length = 537
Score = 115 bits (287), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 66/201 (32%), Positives = 98/201 (48%), Gaps = 23/201 (11%)
Query: 31 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 90
P Y++ +S EE E + +A P + ++TV D TG + R S ++L D ++
Sbjct: 338 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 397
Query: 91 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATVL 146
+ +R+ T ++ E LQV +Y G +YEPHFD+ D F G R+AT L
Sbjct: 398 ARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRSDDEDAFKRLGTGNRVATFL 457
Query: 147 MYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 206
Y+SDVE GG TVFP+ G +I PK G A+ ++++ D
Sbjct: 458 NYMSDVEAGGATVFPD-------------------LGAAIWPKKGTAVFWYNLLRSGEGD 498
Query: 207 PSSLHGGCPVIKGNKWSSTKW 227
+ H CPV+ G KW S KW
Sbjct: 499 YRTRHAACPVLVGCKWVSNKW 519
>sp|Q6W3E9|P4HA3_RAT Prolyl 4-hydroxylase subunit alpha-3 OS=Rattus norvegicus GN=P4ha3
PE=2 SV=1
Length = 544
Score = 108 bits (269), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 70/210 (33%), Positives = 101/210 (48%), Gaps = 28/210 (13%)
Query: 25 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 84
EVI P +YH+F+S EE + + LA P +++S V + K R S +L
Sbjct: 340 EVIHLRPLVALYHDFVSDEEAQKIRELAEPWLQRSVVASGE--KQLQVEYRISKSAWLKD 397
Query: 85 GRDKIIRDIEKRIADFTFFPLEN--GEGLQVLHYEAGQKYEPHFDYFMDE---FNTKNGG 139
D ++ +++RIA T ++ E LQV++Y G YEPHFD+ G
Sbjct: 398 TVDPVLVTLDRRIAALTGLDIQPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYKMKSG 457
Query: 140 QRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFW-S 198
R AT+++YLS VE GG T F GN S P + +A LFW +
Sbjct: 458 NRAATLMIYLSSVEAGGATAF--IYGNFSV------------------PVVKNAALFWWN 497
Query: 199 MKPDASLDPSSLHGGCPVIKGNKWSSTKWI 228
+ D +LH GCPV+ G+KW + KWI
Sbjct: 498 LHRSGEGDDDTLHAGCPVLVGDKWVANKWI 527
>sp|Q75UG4|P4HA3_BOVIN Prolyl 4-hydroxylase subunit alpha-3 OS=Bos taurus GN=P4HA3 PE=2
SV=1
Length = 544
Score = 106 bits (265), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 70/210 (33%), Positives = 100/210 (47%), Gaps = 28/210 (13%)
Query: 25 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 84
EVI EP +YH+F+S E + + LA P +++S V + K R S +L
Sbjct: 340 EVIHLEPYVVLYHDFVSDAEAQTIRGLAEPWLQRSVVASGE--KQLPVEYRISKSAWLKD 397
Query: 85 GRDKIIRDIEKRIADFTFFPLEN--GEGLQVLHYEAGQKYEPHFDYFMDE---FNTKNGG 139
D ++ ++ RIA T ++ E LQV++Y G YEPHFD+ N G
Sbjct: 398 TVDPVLVTLDHRIAALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRMNSG 457
Query: 140 QRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFW-S 198
R+AT ++YLS VE GG T F GN S P + +A LFW +
Sbjct: 458 NRVATFMIYLSSVEAGGATAF--IYGNFSV------------------PVVKNAALFWWN 497
Query: 199 MKPDASLDPSSLHGGCPVIKGNKWSSTKWI 228
+ D +LH CPV+ G+KW + KWI
Sbjct: 498 LHRSGEGDGDTLHAACPVLVGDKWVANKWI 527
>sp|Q6W3F0|P4HA3_MOUSE Prolyl 4-hydroxylase subunit alpha-3 OS=Mus musculus GN=P4ha3 PE=2
SV=1
Length = 542
Score = 104 bits (260), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 69/210 (32%), Positives = 100/210 (47%), Gaps = 28/210 (13%)
Query: 25 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 84
EV+ P +YH+F+S EE + + LA P +++S V + K R S +L
Sbjct: 338 EVVHLRPLIALYHDFVSDEEAQKIRELAEPWLQRSVVASGE--KQLQVEYRISKSAWLKD 395
Query: 85 GRDKIIRDIEKRIADFTFFPLEN--GEGLQVLHYEAGQKYEPHFDYFMDE---FNTKNGG 139
D ++ ++ RIA T ++ E LQV++Y G YEPHFD+ G
Sbjct: 396 TVDPMLVTLDHRIAALTGLDIQPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRMKSG 455
Query: 140 QRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFW-S 198
R+AT ++YLS VE GG T F GN S P + +A LFW +
Sbjct: 456 NRVATFMIYLSSVEAGGATAF--IYGNFSV------------------PVVKNAALFWWN 495
Query: 199 MKPDASLDPSSLHGGCPVIKGNKWSSTKWI 228
+ D +LH GCPV+ G+KW + KWI
Sbjct: 496 LHRSGEGDGDTLHAGCPVLVGDKWVANKWI 525
>sp|Q7Z4N8|P4HA3_HUMAN Prolyl 4-hydroxylase subunit alpha-3 OS=Homo sapiens GN=P4HA3 PE=1
SV=1
Length = 544
Score = 103 bits (256), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 68/209 (32%), Positives = 95/209 (45%), Gaps = 26/209 (12%)
Query: 25 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 84
EVI EP +YH+F+S E + + LA P +++S V + K R S +L
Sbjct: 340 EVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVASGE--KQLQVEYRISKSAWLKD 397
Query: 85 GRDKIIRDIEKRIADFTFFPLEN--GEGLQVLHYEAGQKYEPHFDYFMDE---FNTKNGG 139
D + + RIA T + E LQV++Y G YEPHFD+ G
Sbjct: 398 TVDPKLVTLNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRMKSG 457
Query: 140 QRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSM 199
R+AT ++YLS VE GG T F A LS+ AL +W++
Sbjct: 458 NRVATFMIYLSSVEAGGATAFIYAN-------------------LSVPVVRNAALFWWNL 498
Query: 200 KPDASLDPSSLHGGCPVIKGNKWSSTKWI 228
D +LH GCPV+ G+KW + KWI
Sbjct: 499 HRSGEGDSDTLHAGCPVLVGDKWVANKWI 527
>sp|Q5UP57|P4H_MIMIV Putative prolyl 4-hydroxylase OS=Acanthamoeba polyphaga mimivirus
GN=MIMI_L593 PE=1 SV=1
Length = 242
Score = 90.5 bits (223), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 60/205 (29%), Positives = 93/205 (45%), Gaps = 33/205 (16%)
Query: 32 RAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIR 91
+ FV +N ++ +C+ ++ A + DS D +R S ++++ + +++
Sbjct: 59 KPFVLNNLINPTKCQEIMQFAN-----GKLFDSQVLSGTDKNIRNSQQMWISKN-NPMVK 112
Query: 92 DIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMD------EFNTKNGGQRMATV 145
I + I P +N E LQV+ Y Q Y H D D EF + GGQR+ TV
Sbjct: 113 PIFENICRQFNVPFDNAEDLQVVRYLPNQYYNEHHDSCCDSSKQCSEF-IERGGQRILTV 171
Query: 146 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDAS- 204
L+YL++ G T FPN KPK GDAL+F+ + +++
Sbjct: 172 LIYLNNEFSDGHTYFPNLNQ-------------------KFKPKTGDALVFYPLANNSNK 212
Query: 205 LDPSSLHGGCPVIKGNKWSSTKWIR 229
P SLH G PV G KW + W R
Sbjct: 213 CHPYSLHAGMPVTSGEKWIANLWFR 237
>sp|Q9NXG6|P4HTM_HUMAN Transmembrane prolyl 4-hydroxylase OS=Homo sapiens GN=P4HTM PE=1
SV=2
Length = 502
Score = 80.5 bits (197), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 61/190 (32%), Positives = 82/190 (43%), Gaps = 32/190 (16%)
Query: 74 VRTSSGTFLARGR--DKIIRDIEKRIADFTFFP---LENGEGLQVLHYEAGQKYEPHFD- 127
VR S T+L +G I+R I +R+ T +E E LQV+ Y G Y H D
Sbjct: 272 VRNSHHTWLYQGEGAHHIMRAIRQRVLRLTRLSPEIVELSEPLQVVRYGEGGHYHAHVDS 331
Query: 128 -------------YFMDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNI------- 167
+E R TVL YL++V GGETVFP A
Sbjct: 332 GPVYPETICSHTKLVANESVPFETSCRYMTVLFYLNNVTGGGETVFPVADNRTYDEMSLI 391
Query: 168 -SAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDAS-----LDPSSLHGGCPVIKGNK 221
V + C K L +KP+ G A+ +++ PD +D SLHGGC V +G K
Sbjct: 392 QDDVDLRDTRRHCDKGNLRVKPQQGTAVFWYNYLPDGQGWVGDVDDYSLHGGCLVTRGTK 451
Query: 222 WSSTKWIRVN 231
W + WI V+
Sbjct: 452 WIANNWINVD 461
>sp|Q8BG58|P4HTM_MOUSE Transmembrane prolyl 4-hydroxylase OS=Mus musculus GN=P4htm PE=2
SV=1
Length = 503
Score = 80.1 bits (196), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 60/190 (31%), Positives = 82/190 (43%), Gaps = 32/190 (16%)
Query: 74 VRTSSGTFLARGR--DKIIRDIEKRIADFTFFP---LENGEGLQVLHYEAGQKYEPHFD- 127
VR S T+L +G ++R I +R+ T +E E LQV+ Y G Y H D
Sbjct: 273 VRNSHHTWLHQGEGAHHVMRAIRQRVLRLTRLSPEIVEFSEPLQVVRYGEGGHYHAHVDS 332
Query: 128 -------------YFMDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNI------- 167
+E R TVL YL++V GGETVFP A
Sbjct: 333 GPVYPETICSHTKLVANESVPFETSCRYMTVLFYLNNVTGGGETVFPVADNRTYDEMSLI 392
Query: 168 -SAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDAS-----LDPSSLHGGCPVIKGNK 221
V + C K L +KP+ G A+ +++ PD +D SLHGGC V +G K
Sbjct: 393 QDDVDLRDTRRHCDKGNLRVKPQQGTAVFWYNYLPDGQGWVGEVDDYSLHGGCLVTRGTK 452
Query: 222 WSSTKWIRVN 231
W + WI V+
Sbjct: 453 WIANNWINVD 462
>sp|Q0AP20|Y1675_MARMM PKHD-type hydroxylase Mmar10_1675 OS=Maricaulis maris (strain
MCS10) GN=Mmar10_1675 PE=3 SV=1
Length = 219
Score = 35.0 bits (79), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 30/120 (25%), Positives = 47/120 (39%), Gaps = 35/120 (29%)
Query: 113 VLHYEAGQKYEPHFDYFMDEFNTKNGGQRM-ATVLMYLSDVE--EGGETVFPNAQGNISA 169
V Y G Y PH D + GG+R + ++LSD + +GGE V G
Sbjct: 83 VSRYRDGMAYGPHID------DALMGGRRADLSFTLFLSDPDSYDGGELVMDGPDGETE- 135
Query: 170 VPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIR 229
IK GDA+++ + S++H PV +G + + W+R
Sbjct: 136 ----------------IKLAAGDAVVYAT---------SAIHQVAPVTRGERVAVVGWVR 170
>sp|O19023|CEL3B_MACMU Chymotrypsin-like elastase family member 3B (Fragment) OS=Macaca
mulatta GN=CELA3B PE=2 SV=1
Length = 257
Score = 32.3 bits (72), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 21/62 (33%), Positives = 30/62 (48%), Gaps = 5/62 (8%)
Query: 152 VEEGGETVFPNAQGNISAVPWWNELS-ECGK----TGLSIKPKMGDALLFWSMKPDASLD 206
V+EG E V P G++ P WN L CG LS ++GDA+ S+ P +
Sbjct: 79 VKEGPEQVIPINSGDLFVHPLWNRLCVACGNDIALIKLSRSAQLGDAVQLASLPPAGDIL 138
Query: 207 PS 208
P+
Sbjct: 139 PN 140
>sp|B2A2Y9|RECF_NATTJ DNA replication and repair protein RecF OS=Natranaerobius
thermophilus (strain ATCC BAA-1301 / DSM 18059 /
JW/NM-WN-LF) GN=recF PE=3 SV=1
Length = 386
Score = 31.6 bits (70), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 19/66 (28%), Positives = 30/66 (45%), Gaps = 9/66 (13%)
Query: 15 GDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRV 74
G RA++ E+I WE F L KE+ +Y + + T + GK+K+ +V
Sbjct: 49 GKSHRAQKEKELIRWETSGFYLKGELEKEQAQYTLEIITNYQ---------NGKNKNLKV 99
Query: 75 RTSSGT 80
S T
Sbjct: 100 NNLSQT 105
>sp|Q92JE7|AAT_RICCN Aspartate aminotransferase OS=Rickettsia conorii (strain ATCC
VR-613 / Malish 7) GN=aatA PE=3 SV=1
Length = 401
Score = 31.6 bits (70), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 19/74 (25%), Positives = 35/74 (47%), Gaps = 11/74 (14%)
Query: 102 FFPLENGEGLQVLHYEAGQKY--EPHFDYFMDEFNTKNGGQRMATVLMYLSDVEEGGETV 159
F N EG+ +L K+ E + DY +DE GG+++ L +++ +++G E +
Sbjct: 61 FTKYTNVEGMPLLKQAIKDKFKRENNIDYELDEIIVSTGGKQVIYNL-FMASLDQGDEVI 119
Query: 160 FPNAQGNISAVPWW 173
P P+W
Sbjct: 120 IP--------APYW 125
>sp|Q973C8|SYT_SULTO Threonine--tRNA ligase OS=Sulfolobus tokodaii (strain DSM 16993 /
JCM 10545 / NBRC 100140 / 7) GN=thrS PE=3 SV=1
Length = 540
Score = 31.6 bits (70), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 22/60 (36%), Positives = 30/60 (50%), Gaps = 2/60 (3%)
Query: 91 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMYLS 150
R I +R+ F+F P E GL + HY+ GQ FM+E N G Q + T +Y S
Sbjct: 136 RIIGERLDLFSF-PDETAPGLALFHYK-GQIIRKELMKFMEEINESMGYQEVFTAEIYRS 193
>sp|Q9CQ52|CEL3B_MOUSE Chymotrypsin-like elastase family member 3B OS=Mus musculus
GN=Cela3b PE=2 SV=1
Length = 269
Score = 30.8 bits (68), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 20/62 (32%), Positives = 29/62 (46%), Gaps = 5/62 (8%)
Query: 152 VEEGGETVFPNAQGNISAVPWWNELS-ECGK----TGLSIKPKMGDALLFWSMKPDASLD 206
VEEG E V P G++ P WN + CG LS ++GDA+ + P +
Sbjct: 91 VEEGQEQVIPINAGDLFVHPKWNSMCVSCGNDIALVKLSRSAQLGDAVQLACLPPAGEIL 150
Query: 207 PS 208
P+
Sbjct: 151 PN 152
>sp|A7H409|SECA_CAMJD Protein translocase subunit SecA OS=Campylobacter jejuni subsp.
doylei (strain ATCC BAA-1458 / RM4099 / 269.97) GN=secA
PE=3 SV=1
Length = 862
Score = 30.4 bits (67), Expect = 9.8, Method: Compositional matrix adjust.
Identities = 33/131 (25%), Positives = 58/131 (44%), Gaps = 12/131 (9%)
Query: 32 RAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIR 91
R+ V+HN L+KE+ + + A H +++ ++ D GK + T+ RG D I
Sbjct: 455 RSEVFHNMLAKEKIPHHVLNAKNHEQEALII-QDAGKKGAVTIATNMA---GRGVDIKID 510
Query: 92 DIEKRIADFTFFPLENGEGLQV---LHYEAGQKYEPHFDYFM----DEFNTKNGGQRMAT 144
D + + E E ++ L AG++ +P F D GG R+ +
Sbjct: 511 DEIRALGGLYIIGTERHESRRIDNQLRGRAGRQGDPGISRFYLSLEDNLLRIFGGDRIKS 570
Query: 145 VLMYLSDVEEG 155
++ L +EEG
Sbjct: 571 IMDRLG-IEEG 580
>sp|Q60MF5|RN207_CAEBR Probable RING finger protein 207 homolog OS=Caenorhabditis briggsae
GN=CBG23170 PE=4 SV=3
Length = 836
Score = 30.4 bits (67), Expect = 9.9, Method: Composition-based stats.
Identities = 28/103 (27%), Positives = 44/103 (42%), Gaps = 4/103 (3%)
Query: 11 CRSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSK 70
CR+ + R ++IS E R+ VY + L K+ E I L +RK + G+
Sbjct: 97 CRNVTHQARMFSSHKIISSEERSKVYSSSLCKDHNEPYI-LYCSDVRKLVCIQCFNGRPL 155
Query: 71 DSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQV 113
+ R S + +G + IE+ A F+ E E L V
Sbjct: 156 EER---HSFISIEQGHRMCLEKIEQSAAKLRFYQSERQEELNV 195
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.318 0.135 0.426
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 97,699,486
Number of Sequences: 539616
Number of extensions: 4206268
Number of successful extensions: 8370
Number of sequences better than 100.0: 34
Number of HSP's better than 100.0 without gapping: 19
Number of HSP's successfully gapped in prelim test: 15
Number of HSP's that attempted gapping in prelim test: 8305
Number of HSP's gapped (non-prelim): 36
length of query: 235
length of database: 191,569,459
effective HSP length: 114
effective length of query: 121
effective length of database: 130,053,235
effective search space: 15736441435
effective search space used: 15736441435
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 59 (27.3 bits)