BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 030651
(174 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q20065|P4HA2_CAEEL Prolyl 4-hydroxylase subunit alpha-2 OS=Caenorhabditis elegans
GN=phy-2 PE=1 SV=1
Length = 539
Score = 55.1 bits (131), Expect = 2e-07, Method: Composition-based stats.
Identities = 25/78 (32%), Positives = 46/78 (58%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
VE++ ++P A ++ N + E E + LA+P ++++TV +S TG+ + + R S +L
Sbjct: 318 VEILRFDPLAVLFKNVIHDSEIEVIKELASPKLKRATVQNSKTGELEHATYRISKSAWLK 377
Query: 137 RGRDKIIRDIEKRIADFT 154
D +I + +RI DFT
Sbjct: 378 GDLDPVIDRVNRRIEDFT 395
>sp|P54001|P4HA1_RAT Prolyl 4-hydroxylase subunit alpha-1 OS=Rattus norvegicus GN=P4ha1
PE=2 SV=2
Length = 534
Score = 50.8 bits (120), Expect = 5e-06, Method: Composition-based stats.
Identities = 25/74 (33%), Positives = 41/74 (55%), Gaps = 1/74 (1%)
Query: 82 WE-PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRD 140
W+ PR +H+ +S E E + +LA P + ++TV D +TGK ++ R S +L+ D
Sbjct: 332 WDKPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYED 391
Query: 141 KIIRDIEKRIADFT 154
++ I RI D T
Sbjct: 392 PVVSRINMRIQDLT 405
>sp|Q60715|P4HA1_MOUSE Prolyl 4-hydroxylase subunit alpha-1 OS=Mus musculus GN=P4ha1 PE=2
SV=2
Length = 534
Score = 46.2 bits (108), Expect = 1e-04, Method: Composition-based stats.
Identities = 22/72 (30%), Positives = 38/72 (52%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR +H+ +S E E + +LA P +R++T+ + TG + R S +L+ D +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPVTGALETVHYRISKSAWLSGYEDPV 393
Query: 143 IRDIEKRIADFT 154
+ I RI D T
Sbjct: 394 VSRINMRIQDLT 405
>sp|Q1RMU3|P4HA1_BOVIN Prolyl 4-hydroxylase subunit alpha-1 OS=Bos taurus GN=P4HA1 PE=1
SV=1
Length = 534
Score = 44.3 bits (103), Expect = 4e-04, Method: Composition-based stats.
Identities = 21/72 (29%), Positives = 38/72 (52%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR +H+ +S E E + +LA P +R++T+ + TG + R S +L+ + +
Sbjct: 334 KPRIIRFHDIISDAEIEVVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSGYENPV 393
Query: 143 IRDIEKRIADFT 154
+ I RI D T
Sbjct: 394 VSRINMRIQDLT 405
>sp|Q5RAG8|P4HA1_PONAB Prolyl 4-hydroxylase subunit alpha-1 OS=Pongo abelii GN=P4HA1 PE=2
SV=1
Length = 534
Score = 44.3 bits (103), Expect = 4e-04, Method: Composition-based stats.
Identities = 21/72 (29%), Positives = 38/72 (52%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR +H+ +S E E + +LA P +R++T+ + TG + R S +L+ + +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSGYENPV 393
Query: 143 IRDIEKRIADFT 154
+ I RI D T
Sbjct: 394 VSRINMRIQDLT 405
>sp|P13674|P4HA1_HUMAN Prolyl 4-hydroxylase subunit alpha-1 OS=Homo sapiens GN=P4HA1 PE=1
SV=2
Length = 534
Score = 44.3 bits (103), Expect = 4e-04, Method: Composition-based stats.
Identities = 21/72 (29%), Positives = 38/72 (52%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR +H+ +S E E + +LA P +R++T+ + TG + R S +L+ + +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSGYENPV 393
Query: 143 IRDIEKRIADFT 154
+ I RI D T
Sbjct: 394 VSRINMRIQDLT 405
>sp|Q6W3E9|P4HA3_RAT Prolyl 4-hydroxylase subunit alpha-3 OS=Rattus norvegicus GN=P4ha3
PE=2 SV=1
Length = 544
Score = 42.7 bits (99), Expect = 0.001, Method: Composition-based stats.
Identities = 24/77 (31%), Positives = 39/77 (50%), Gaps = 2/77 (2%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
EVI P +YH+F+S EE + + LA P +++S V + K R S +L
Sbjct: 340 EVIHLRPLVALYHDFVSDEEAQKIRELAEPWLQRSVVASGE--KQLQVEYRISKSAWLKD 397
Query: 138 GRDKIIRDIEKRIADFT 154
D ++ +++RIA T
Sbjct: 398 TVDPVLVTLDRRIAALT 414
>sp|Q10576|P4HA1_CAEEL Prolyl 4-hydroxylase subunit alpha-1 OS=Caenorhabditis elegans
GN=dpy-18 PE=1 SV=2
Length = 559
Score = 42.7 bits (99), Expect = 0.001, Method: Composition-based stats.
Identities = 23/78 (29%), Positives = 39/78 (50%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
VE+ + P A ++ + +S +E + LA P + ++TV DS TGK + R S +L
Sbjct: 321 VEIKRFNPLAVLFKDVISDDEVAAIQELAKPKLARATVHDSVTGKLVTATYRISKSAWLK 380
Query: 137 RGRDKIIRDIEKRIADFT 154
++ + KRI T
Sbjct: 381 EWEGDVVETVNKRIGYMT 398
>sp|P16924|P4HA1_CHICK Prolyl 4-hydroxylase subunit alpha-1 OS=Gallus gallus GN=P4HA1 PE=1
SV=1
Length = 516
Score = 42.4 bits (98), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/75 (32%), Positives = 38/75 (50%), Gaps = 1/75 (1%)
Query: 81 SWE-PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR 139
W+ PR + + +S EE E + LA P + ++TV D +TGK + R S +L+
Sbjct: 313 EWDKPRIVRFLDIISDEEIETVKELAKPRLSRATVHDPETGKLTTAHYRVSKSAWLSGYE 372
Query: 140 DKIIRDIEKRIADFT 154
++ I RI D T
Sbjct: 373 SPVVSRINTRIQDLT 387
>sp|Q75UG4|P4HA3_BOVIN Prolyl 4-hydroxylase subunit alpha-3 OS=Bos taurus GN=P4HA3 PE=2
SV=1
Length = 544
Score = 42.0 bits (97), Expect = 0.002, Method: Composition-based stats.
Identities = 24/77 (31%), Positives = 38/77 (49%), Gaps = 2/77 (2%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
EVI EP +YH+F+S E + + LA P +++S V + K R S +L
Sbjct: 340 EVIHLEPYVVLYHDFVSDAEAQTIRGLAEPWLQRSVVASGE--KQLPVEYRISKSAWLKD 397
Query: 138 GRDKIIRDIEKRIADFT 154
D ++ ++ RIA T
Sbjct: 398 TVDPVLVTLDHRIAALT 414
>sp|Q6W3F0|P4HA3_MOUSE Prolyl 4-hydroxylase subunit alpha-3 OS=Mus musculus GN=P4ha3 PE=2
SV=1
Length = 542
Score = 40.0 bits (92), Expect = 0.008, Method: Composition-based stats.
Identities = 23/77 (29%), Positives = 38/77 (49%), Gaps = 2/77 (2%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
EV+ P +YH+F+S EE + + LA P +++S V + K R S +L
Sbjct: 338 EVVHLRPLIALYHDFVSDEEAQKIRELAEPWLQRSVVASGE--KQLQVEYRISKSAWLKD 395
Query: 138 GRDKIIRDIEKRIADFT 154
D ++ ++ RIA T
Sbjct: 396 TVDPMLVTLDHRIAALT 412
>sp|Q7Z4N8|P4HA3_HUMAN Prolyl 4-hydroxylase subunit alpha-3 OS=Homo sapiens GN=P4HA3 PE=1
SV=1
Length = 544
Score = 38.1 bits (87), Expect = 0.026, Method: Composition-based stats.
Identities = 24/77 (31%), Positives = 36/77 (46%), Gaps = 2/77 (2%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
EVI EP +YH+F+S E + + LA P +++S V + K R S +L
Sbjct: 340 EVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVASGE--KQLQVEYRISKSAWLKD 397
Query: 138 GRDKIIRDIEKRIADFT 154
D + + RIA T
Sbjct: 398 TVDPKLVTLNHRIAALT 414
>sp|Q0WT31|PMTP_ARATH Probable methyltransferase PMT25 OS=Arabidopsis thaliana
GN=At2g34300 PE=1 SV=2
Length = 770
Score = 33.5 bits (75), Expect = 0.73, Method: Composition-based stats.
Identities = 18/70 (25%), Positives = 34/70 (48%), Gaps = 1/70 (1%)
Query: 1 MAKPRYSRFPTRKSSSSTLILTLLIMFTFAILILLAF-GILSMPSSSGDSRKANDLSSIV 59
MA +YSR +KSSS L +T++++ + ++ F S P+ S + +
Sbjct: 1 MAMGKYSRVDGKKSSSYGLTITIVLLLSLCLVGTWMFMSSWSAPADSAGYSSTDTAKDVS 60
Query: 60 RKSMERSEGD 69
+ + + EGD
Sbjct: 61 KNDLRKEEGD 70
>sp|B2A2Y9|RECF_NATTJ DNA replication and repair protein RecF OS=Natranaerobius
thermophilus (strain ATCC BAA-1301 / DSM 18059 /
JW/NM-WN-LF) GN=recF PE=3 SV=1
Length = 386
Score = 33.1 bits (74), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 19/66 (28%), Positives = 30/66 (45%), Gaps = 9/66 (13%)
Query: 68 GDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRV 127
G RA++ E+I WE F L KE+ +Y + + T + GK+K+ +V
Sbjct: 49 GKSHRAQKEKELIRWETSGFYLKGELEKEQAQYTLEIITNYQ---------NGKNKNLKV 99
Query: 128 RTSSGT 133
S T
Sbjct: 100 NNLSQT 105
>sp|Q8L7V3|PMTQ_ARATH Probable methyltransferase PMT26 OS=Arabidopsis thaliana
GN=At5g64030 PE=1 SV=1
Length = 829
Score = 31.6 bits (70), Expect = 2.4, Method: Composition-based stats.
Identities = 19/57 (33%), Positives = 32/57 (56%), Gaps = 5/57 (8%)
Query: 1 MAKPRYSRFPTRKSSSSTLILTLLIMFTFAILILLAFGILSMPSSS-GDSRKANDLS 56
MA+PRY+R R+ SS+ +++F L+ GI M SSS G ++ +++S
Sbjct: 1 MAQPRYTRIDNRRPSSNYCSTVTVVVFVALCLV----GIWMMTSSSVGPAQNVDEVS 53
>sp|P20659|TRX_DROME Histone-lysine N-methyltransferase trithorax OS=Drosophila
melanogaster GN=trx PE=1 SV=4
Length = 3726
Score = 30.8 bits (68), Expect = 5.0, Method: Composition-based stats.
Identities = 24/87 (27%), Positives = 41/87 (47%), Gaps = 4/87 (4%)
Query: 58 IVRKSMERSEGDEGRAEQWVEVISWEPRAFVYH--NFLSK--EECEYLINLATPHMRKST 113
I +K R+E + +AE+W + + E +A +Y LSK + C L +R +
Sbjct: 1475 ICKKCARRNESSKIKAEEWRQAVMEEFKASLYSVLKLLSKSRQACALLKLSPRKKLRCTC 1534
Query: 114 VVDSDTGKSKDSRVRTSSGTFLARGRD 140
S+ GK + ++ SSG+ G D
Sbjct: 1535 GASSNQGKLQPKALQFSSGSDNGLGSD 1561
>sp|Q8XX80|SYV_RALSO Valine--tRNA ligase OS=Ralstonia solanacearum (strain GMI1000)
GN=valS PE=3 SV=1
Length = 968
Score = 30.4 bits (67), Expect = 5.5, Method: Composition-based stats.
Identities = 15/52 (28%), Positives = 27/52 (51%)
Query: 102 INLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADF 153
I++A H R S +D G+ + + +F+AR ++ +KR+ADF
Sbjct: 898 IDVAAEHARLSKEIDRLRGEIVKCEGKLGNESFVARAPAAVVEQEQKRLADF 949
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.323 0.136 0.389
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 58,955,881
Number of Sequences: 539616
Number of extensions: 2184066
Number of successful extensions: 5758
Number of sequences better than 100.0: 22
Number of HSP's better than 100.0 without gapping: 13
Number of HSP's successfully gapped in prelim test: 9
Number of HSP's that attempted gapping in prelim test: 5745
Number of HSP's gapped (non-prelim): 22
length of query: 174
length of database: 191,569,459
effective HSP length: 109
effective length of query: 65
effective length of database: 132,751,315
effective search space: 8628835475
effective search space used: 8628835475
T: 11
A: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 57 (26.6 bits)