BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 020122
(331 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q3ED68|Y1295_ARATH Uncharacterized PKHD-type hydroxylase At1g22950 OS=Arabidopsis
thaliana GN=At1g22950 PE=2 SV=2
Length = 397
Score = 423 bits (1088), Expect = e-118, Method: Compositional matrix adjust.
Identities = 195/330 (59%), Positives = 258/330 (78%), Gaps = 1/330 (0%)
Query: 1 MALEASIDRRNHAQSATTNGAVVLPPASYRLRLNPSSEHKPDSYDDLHQLEFTPLLFSSL 60
MAL++S + Q + A +LR P+ EH+P++Y+DL L+++P LF+SL
Sbjct: 10 MALDSSGKQPEQQQQQQPRASSGNGEARLKLRRTPNEEHEPENYEDL-PLDYSPSLFTSL 68
Query: 61 ERYLPPTMLSMSRDVKFQYMRDILMKYSRDGERTRVQRHKEYRQRIISNYQPLHRELFTM 120
ERYLP +L+ +R K +MRD+L++YS D ER RV RHKEYR +I+S+YQ LH E++T+
Sbjct: 69 ERYLPEQLLNSTRIDKASFMRDLLLRYSPDTERVRVLRHKEYRDKIMSSYQRLHGEIYTL 128
Query: 121 HAPSVLVPAFVKAVRDNTEASFRSIMAEPIPGIYTFEMLQPRFCEMLLSEVENFERWVHD 180
S P+F+ A +E +FRS M E PGI+TFEM +P+FCEMLL+EVE+ E+WV+D
Sbjct: 129 DPSSFFAPSFLGAFSRKSEPNFRSSMVESYPGIFTFEMFKPQFCEMLLAEVEHMEKWVYD 188
Query: 181 TRFRIMRPNTMNKFGAVLDDFGLETMLDKLMNDFIRPISKVFFPEVGGSTLDSHHGFVVE 240
+R IMRPNTMN FG VLDDFG ++ML KL++DFI PI++V FPEV G++LDSHHG++VE
Sbjct: 189 SRSTIMRPNTMNNFGVVLDDFGFDSMLQKLVDDFISPIAQVLFPEVCGTSLDSHHGYIVE 248
Query: 241 YGMDRDVELGFHVDDSEVTLNVCLGREFSGGELFFRGVRCDKHVNTETQSEEILDYSHVP 300
YG DRDV+LGFHVDDSEV+LNVCLG++FSGGEL+FRGVRCDKHVN+++ +E+ DYSHVP
Sbjct: 249 YGKDRDVDLGFHVDDSEVSLNVCLGKQFSGGELYFRGVRCDKHVNSDSTEKEVYDYSHVP 308
Query: 301 GYAVLHRGRHRHGARATTSGSRVNLLVWCR 330
G+A+LHRGRHRHGARATTSG R NL++WCR
Sbjct: 309 GHAILHRGRHRHGARATTSGHRANLILWCR 338
>sp|Q28C22|OGFD2_XENTR 2-oxoglutarate and iron-dependent oxygenase domain-containing
protein 2 OS=Xenopus tropicalis GN=ogfod2 PE=2 SV=1
Length = 349
Score = 159 bits (403), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 94/262 (35%), Positives = 147/262 (56%), Gaps = 19/262 (7%)
Query: 78 QYMRDILMKYSRDGERTR-VQRHKEYRQRIIS-NYQPLHRELFTMHAPSVLVPAFVKAVR 135
+ R++L ++ ER R + +R+R IS +Y+PL+ E++ + S L F+ AV+
Sbjct: 53 EQFRNVLETIKKEVERRRKLGEESLHRRREISLHYKPLYPEVYVLQE-SFLAAEFLTAVK 111
Query: 136 ------DNTEASFRSIMAEPIPGIYTFEMLQPRFCEMLLSEVENFERWVHDTRFRIMRPN 189
N E + + IY + P FC L+ E+ENFER + RPN
Sbjct: 112 YSKSPQANVEGLLHHLHSITDKRIYRLPVFIPEFCAKLVEELENFER----SDLPKGRPN 167
Query: 190 TMNKFGAVLDDFG-LETMLDKLMNDFIRPISKVFFPEVGGSTLDSHHGFVVEYGMDRDVE 248
TMN +G +L++ G ++ + L +I P++ + FP+ GG LDSH FVV+Y + D++
Sbjct: 168 TMNNYGILLNELGFVDALTAPLCEKYIEPLTSLLFPDWGGGCLDSHRAFVVKYALQEDLD 227
Query: 249 LGFHVDDSEVTLNVCLGREFSGGELFFRGVRCDKHVNTETQSEEILDYSHVPGYAVLHRG 308
L H D++EVTLNV LG+EF+ G L+F ++ + VN T +E H+ G +LHRG
Sbjct: 228 LSCHYDNAEVTLNVSLGKEFTDGNLYFSDMK-EVPVNERTYAE----VEHITGQGILHRG 282
Query: 309 RHRHGARATTSGSRVNLLVWCR 330
+H HGA +SG R NL++W R
Sbjct: 283 QHVHGALPISSGERWNLILWMR 304
>sp|A3KGZ2|OGFD2_DANRE 2-oxoglutarate and iron-dependent oxygenase domain-containing
protein 2 OS=Danio rerio GN=ogfod2 PE=2 SV=1
Length = 345
Score = 154 bits (389), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 89/259 (34%), Positives = 143/259 (55%), Gaps = 19/259 (7%)
Query: 81 RDILMKYSRDGERTRVQRHKEY-RQRIISN-YQPLHRELFTMHAPSVLVPAFVKAVR--- 135
RD++ K + ER + + K R +I Y PLH+ ++ + S L P ++ V+
Sbjct: 52 RDVIGKIQAEIERRQNHKLKSTERAAVIKEIYTPLHQHVYHLQ-ESFLAPELLEMVKYCA 110
Query: 136 ---DNTEASFRSIMAEPIPGIYTFEMLQPRFCEMLLSEVENFERWVHDTRFRIMRPNTMN 192
N + + I E ++ F++ + FC+ LL E+E+FE+ + RPNTMN
Sbjct: 111 SSEANVQGLLKLIQTEAASRVFRFQVFRKEFCKDLLEELEHFEQ----SDAPKGRPNTMN 166
Query: 193 KFGAVLDDFGL-ETMLDKLMNDFIRPISKVFFPEVGGSTLDSHHGFVVEYGMDRDVELGF 251
+G VL++ G E + L ++RP++ + + + GG+ LDSH FVV+Y M D+ L +
Sbjct: 167 NYGIVLNELGFDEGFITPLREVYLRPLTALLYSDCGGNCLDSHKAFVVKYDMHEDLNLSY 226
Query: 252 HVDDSEVTLNVCLGREFSGGELFFRGVRCDKHVNTETQSEEILDYSHVPGYAVLHRGRHR 311
H D+SEVTLNV LG++F+ G LFF +R E ++ H +LHRG+H
Sbjct: 227 HYDNSEVTLNVSLGKDFTEGNLFFGDMR-----QVPLSETECVEVEHRVTEGLLHRGQHM 281
Query: 312 HGARATTSGSRVNLLVWCR 330
HGA + +SG+R NL++W R
Sbjct: 282 HGALSISSGTRWNLIIWMR 300
>sp|Q9CQ04|OGFD2_MOUSE 2-oxoglutarate and iron-dependent oxygenase domain-containing
protein 2 OS=Mus musculus GN=Ogfod2 PE=2 SV=1
Length = 349
Score = 150 bits (380), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 87/249 (34%), Positives = 136/249 (54%), Gaps = 18/249 (7%)
Query: 91 GERTRVQRHKEYRQRII-SNYQPLHRELFTMHAPSVLVPAFVKAVRDNTE--ASFRSIMA 147
G R R+ + R+ +I S+Y P E+++ + L P F+ A +T A ++
Sbjct: 68 GRRRRLGQESAVRKALIASSYHPARPEVYSSLQDAALAPEFMAAAEYSTSPGADLEGLLQ 127
Query: 148 --EPIPG---IYTFEMLQPRFCEMLLSEVENFERWVHDTRFRIMRPNTMNKFGAVLDDFG 202
E + IY + +FC+ LL E+E+FE+ + RPNTMN G ++ + G
Sbjct: 128 RLETVSEEKRIYRVPVFSAKFCQTLLEELEHFEQ----SDMPKGRPNTMNNHGVLMYELG 183
Query: 203 LET-MLDKLMNDFIRPISKVFFPEVGGSTLDSHHGFVVEYGMDRDVELGFHVDDSEVTLN 261
L+ ++ L F+ P+ + +P+ GG LDSH FVV+Y + +D++LG H D++E+TLN
Sbjct: 184 LDDPLVTPLRERFLLPLMALLYPDYGGGYLDSHRAFVVKYALGQDLDLGCHYDNAELTLN 243
Query: 262 VCLGREFSGGELFFRGVRCDKHVNTETQSEEILDYSHVPGYAVLHRGRHRHGARATTSGS 321
V LG++F+GG L+F G+ +E L+ HV G +LHRG HGAR G
Sbjct: 244 VALGKDFTGGALYFGGL-----FQAPAALKETLEVEHVVGSGILHRGGQLHGARPLCKGE 298
Query: 322 RVNLLVWCR 330
R NL+VW R
Sbjct: 299 RWNLVVWLR 307
>sp|Q6N063|OGFD2_HUMAN 2-oxoglutarate and iron-dependent oxygenase domain-containing
protein 2 OS=Homo sapiens GN=OGFOD2 PE=2 SV=2
Length = 350
Score = 150 bits (378), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 89/247 (36%), Positives = 134/247 (54%), Gaps = 18/247 (7%)
Query: 93 RTRVQRHKEYRQRII-SNYQPLHRELFTMHAPSVLVPAFVKAVRDNT--EASFRSIMA-- 147
R R+ + R+ +I S+Y P E++ + L P F+ + +A + ++
Sbjct: 71 RQRLGQESAARKALIASSYHPARPEVYDSLQDAALAPEFLAVTEYSVSPDADLKGLLQRL 130
Query: 148 EPIPG---IYTFEMLQPRFCEMLLSEVENFERWVHDTRFRIMRPNTMNKFGAVLDDFGL- 203
E + IY + FC+ LL E+E+FE+ + RPNTMN +G +L + GL
Sbjct: 131 ETVSEEKRIYRVPVFTAPFCQALLEELEHFEQ----SDMPKGRPNTMNNYGVLLHELGLD 186
Query: 204 ETMLDKLMNDFIRPISKVFFPEVGGSTLDSHHGFVVEYGMDRDVELGFHVDDSEVTLNVC 263
E ++ L F++P+ + +P+ GG LDSH FVV+Y +D+ELG H D++E+TLNV
Sbjct: 187 EPLMTPLRERFLQPLMALLYPDCGGGRLDSHRAFVVKYAPGQDLELGCHYDNAELTLNVA 246
Query: 264 LGREFSGGELFFRGVRCDKHVNTETQSEEILDYSHVPGYAVLHRGRHRHGARATTSGSRV 323
LG+ F+GG L+F G+ T E L+ HV G VLHRG HGAR +G R
Sbjct: 247 LGKVFTGGALYFGGL-----FQAPTALTEPLEVEHVVGQGVLHRGGQLHGARPLGTGERW 301
Query: 324 NLLVWCR 330
NL+VW R
Sbjct: 302 NLVVWLR 308
>sp|Q20679|PLOD_CAEEL Procollagen-lysine,2-oxoglutarate 5-dioxygenase OS=Caenorhabditis
elegans GN=let-268 PE=1 SV=1
Length = 730
Score = 67.0 bits (162), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 53/187 (28%), Positives = 88/187 (47%), Gaps = 23/187 (12%)
Query: 145 IMAEPIPGIYTFEMLQPRFCEMLLSEVENFERWV----HDTRFRIMRPNTMNKFGAVLDD 200
++ + P +Y F ++ RFCE L+ E+E F RW +D R N + ++
Sbjct: 549 VVDQACPDVYDFPLMSERFCEELIEEMEGFGRWSDGSNNDKRLAGGYENVPTR-DIHMNQ 607
Query: 201 FGLETMLDKLMNDFIRPISKVFFPEVGGSTLDSHHGFVVEYGMDRDVELGFHVDDSEVTL 260
G E M+ ++RP+ + F ++S+ FVV Y + L H D S ++
Sbjct: 608 VGFERQWLYFMDTYVRPVQEKTFIGYYHQPVESNMMFVVRYKPEEQPSLRPHHDASTFSI 667
Query: 261 NVCL---GREFSGGELFFRGVRCDKHVNTETQSEEILDYSHVPGYAVLHRGR--HRHGAR 315
++ L GR++ GG GVR ++ N ++E+ GYA++ GR H H
Sbjct: 668 DIALNKKGRDYEGG-----GVRYIRY-NCTVPADEV-------GYAMMFPGRLTHLHEGL 714
Query: 316 ATTSGSR 322
ATT G+R
Sbjct: 715 ATTKGTR 721
>sp|Q9R0E1|PLOD3_MOUSE Procollagen-lysine,2-oxoglutarate 5-dioxygenase 3 OS=Mus musculus
GN=Plod3 PE=1 SV=1
Length = 741
Score = 61.6 bits (148), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 52/188 (27%), Positives = 85/188 (45%), Gaps = 24/188 (12%)
Query: 144 SIMAEPIPGIYTFEMLQPRFCEMLLSEVENFERWV----HDTRFRIMRPNTMNKFGAVLD 199
++ +P P +Y F +L + C+ L+ E+E++ +W D+R N + +
Sbjct: 560 GLVEQPCPDVYWFPLLTEQMCDELVEEMEHYGQWSGGRHEDSRLAGGYEN-VPTVDIHMK 618
Query: 200 DFGLETMLDKLMNDFIRPISKVFFPEVGGSTLDSHHGFVVEYGMDRDVELGFHVDDSEVT 259
G E +L+ ++ P+++ FP T + FVV Y D L H D S T
Sbjct: 619 QVGYEDQWLQLLRTYVGPMTEYLFPGYHTKT-RAVMNFVVRYRPDEQPSLRPHHDSSTFT 677
Query: 260 LNVCL---GREFSGGELFFRGVRCDKHVNTETQSEEILDYSHVPGYAVLHRGR--HRHGA 314
LNV L G ++ GG F +R D +++ + G+A+LH GR H H
Sbjct: 678 LNVALNHKGVDYEGGGCRF--LRYDCRISSPRK-----------GWALLHPGRLTHYHEG 724
Query: 315 RATTSGSR 322
TT G+R
Sbjct: 725 LPTTRGTR 732
>sp|Q5U367|PLOD3_RAT Procollagen-lysine,2-oxoglutarate 5-dioxygenase 3 OS=Rattus
norvegicus GN=Plod3 PE=2 SV=1
Length = 741
Score = 60.8 bits (146), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 53/188 (28%), Positives = 85/188 (45%), Gaps = 24/188 (12%)
Query: 144 SIMAEPIPGIYTFEMLQPRFCEMLLSEVENFERWV----HDTRFRIMRPNTMNKFGAVLD 199
++ +P P +Y F +L + C+ L+ E+E++ +W D+R N + +
Sbjct: 560 GLVEQPCPDVYWFPLLTEQMCDELVEEMEHYGQWSGGRHEDSRLAGGYEN-VPTVDIHMK 618
Query: 200 DFGLETMLDKLMNDFIRPISKVFFPEVGGSTLDSHHGFVVEYGMDRDVELGFHVDDSEVT 259
G E +L+ ++ P+++ FP T + FVV Y D L H D S T
Sbjct: 619 QVGYEDQWLQLLRTYVGPMTEHLFPGYHTKT-RAVMNFVVRYRPDEQPSLRPHHDSSTFT 677
Query: 260 LNVCL---GREFSGGELFFRGVRCDKHVNTETQSEEILDYSHVPGYAVLHRGR--HRHGA 314
LNV L G ++ GG F +R D V++ + G+A+LH GR H H
Sbjct: 678 LNVALNHKGVDYEGGGCRF--LRYDCRVSSPRK-----------GWALLHPGRLTHYHEG 724
Query: 315 RATTSGSR 322
TT G+R
Sbjct: 725 LPTTRGTR 732
>sp|Q5R6K5|PLOD3_PONAB Procollagen-lysine,2-oxoglutarate 5-dioxygenase 3 OS=Pongo abelii
GN=PLOD3 PE=2 SV=1
Length = 738
Score = 60.5 bits (145), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 52/188 (27%), Positives = 86/188 (45%), Gaps = 24/188 (12%)
Query: 144 SIMAEPIPGIYTFEMLQPRFCEMLLSEVENFERWV----HDTRFRIMRPNTMNKFGAVLD 199
I+ +P P +Y F +L + C+ L++E+E++ +W D+R N + +
Sbjct: 557 GIVEQPCPDVYWFPLLSEQMCDELVAEMEHYGQWSGGRHEDSRLAGGYEN-VPTVDIHMK 615
Query: 200 DFGLETMLDKLMNDFIRPISKVFFPEVGGSTLDSHHGFVVEYGMDRDVELGFHVDDSEVT 259
G E +L+ ++ P+++ FP + + FVV Y D L H D S T
Sbjct: 616 QVGYEDQWLQLLRTYVGPMTESLFPGY-HTKARAVMNFVVRYRPDEQPSLRPHHDSSTFT 674
Query: 260 LNVCL---GREFSGGELFFRGVRCDKHVNTETQSEEILDYSHVPGYAVLHRGR--HRHGA 314
LNV L G ++ GG F +R D +++ + G+A+LH GR H H
Sbjct: 675 LNVALNHKGLDYEGGGCRF--LRYDCVISSPRK-----------GWALLHPGRLTHYHEG 721
Query: 315 RATTSGSR 322
TT G+R
Sbjct: 722 LPTTWGTR 729
>sp|O60568|PLOD3_HUMAN Procollagen-lysine,2-oxoglutarate 5-dioxygenase 3 OS=Homo sapiens
GN=PLOD3 PE=1 SV=1
Length = 738
Score = 60.5 bits (145), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 52/188 (27%), Positives = 86/188 (45%), Gaps = 24/188 (12%)
Query: 144 SIMAEPIPGIYTFEMLQPRFCEMLLSEVENFERWV----HDTRFRIMRPNTMNKFGAVLD 199
I+ +P P +Y F +L + C+ L++E+E++ +W D+R N + +
Sbjct: 557 GIVEQPCPDVYWFPLLSEQMCDELVAEMEHYGQWSGGRHEDSRLAGGYEN-VPTVDIHMK 615
Query: 200 DFGLETMLDKLMNDFIRPISKVFFPEVGGSTLDSHHGFVVEYGMDRDVELGFHVDDSEVT 259
G E +L+ ++ P+++ FP + + FVV Y D L H D S T
Sbjct: 616 QVGYEDQWLQLLRTYVGPMTESLFPGY-HTKARAVMNFVVRYRPDEQPSLRPHHDSSTFT 674
Query: 260 LNVCL---GREFSGGELFFRGVRCDKHVNTETQSEEILDYSHVPGYAVLHRGR--HRHGA 314
LNV L G ++ GG F +R D +++ + G+A+LH GR H H
Sbjct: 675 LNVALNHKGLDYEGGGCRF--LRYDCVISSPRK-----------GWALLHPGRLTHYHEG 721
Query: 315 RATTSGSR 322
TT G+R
Sbjct: 722 LPTTWGTR 729
>sp|P24802|PLOD1_CHICK Procollagen-lysine,2-oxoglutarate 5-dioxygenase 1 OS=Gallus gallus
GN=PLOD1 PE=1 SV=1
Length = 730
Score = 60.5 bits (145), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 52/201 (25%), Positives = 84/201 (41%), Gaps = 25/201 (12%)
Query: 132 KAVRDNTEASFRSIMAE-PIPGIYTFEMLQPRFCEMLLSEVENFERWV----HDTRFRIM 186
K + +N A+ + + E P P +Y F + C+ L+ E+E++ +W D+R +
Sbjct: 536 KYIHENYTAALKGKLVEMPCPDVYWFPIFTDTACDELVEEMEHYGKWSTGDNTDSRIQGG 595
Query: 187 RPNTMNKFGAVLDDFGLETMLDKLMNDFIRPISKVFFPEVGGSTLDSHHGFVVEYGMDRD 246
N + ++ G E K + D+I PI++ +P T FVV Y D
Sbjct: 596 YEN-VPTIDIHMNQIGFEREWYKFLLDYIAPITEKLYPGYYTKT-QFELAFVVRYKPDEQ 653
Query: 247 VELGFHVDDSEVTLNVCLGR---EFSGGELFFRGVRCDKHVNTETQSEEILDYSHVPGYA 303
L H D S T+N+ L R ++ GG F C + G+
Sbjct: 654 PSLMPHHDASTFTINIALNRVGIDYEGGGCRFLRYNCSIRAPRK-------------GWT 700
Query: 304 VLHRGR--HRHGARATTSGSR 322
++H GR H H TT G+R
Sbjct: 701 LMHPGRLTHYHEGLPTTKGTR 721
>sp|Q9R0B9|PLOD2_MOUSE Procollagen-lysine,2-oxoglutarate 5-dioxygenase 2 OS=Mus musculus
GN=Plod2 PE=2 SV=2
Length = 737
Score = 59.3 bits (142), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 54/200 (27%), Positives = 88/200 (44%), Gaps = 30/200 (15%)
Query: 135 RDNTEASFRSIMAEPIPGIYTFEMLQPRFCEMLLSEVENFERWV----HDTRFRIMRPNT 190
RD ++ +I+ +P P ++ F + R C+ L+ E+E++ +W HD+R N
Sbjct: 547 RDYSKIFTENIVEQPCPDVFWFPIFSERACDELVEEMEHYGKWSGGKHHDSRISGGYENV 606
Query: 191 MNKFGAVLDDFGLETMLDKLMNDFIRPIS-KVF--FPEVGGSTLDSHHGFVVEYGMDRDV 247
+ GLE + + +FI P++ KVF + G + L+ FVV+Y +R
Sbjct: 607 PTD-DIHMKQIGLENVWLHFIREFIAPVTLKVFAGYYTKGFALLN----FVVKYSPERQR 661
Query: 248 ELGFHVDDSEVTLNVCL---GREFSGGELFFRGVRCDKHVNTETQSEEILDYSHVPGYAV 304
L H D S T+N+ L G +F GG F C S G++
Sbjct: 662 SLRPHHDASTFTINIALNNVGEDFQGGGCKFLRYNCSIE-------------SPRKGWSF 708
Query: 305 LHRGR--HRHGARATTSGSR 322
+H GR H H +G+R
Sbjct: 709 MHPGRLTHLHEGLPVKNGTR 728
>sp|Q811A3|PLOD2_RAT Procollagen-lysine,2-oxoglutarate 5-dioxygenase 2 OS=Rattus
norvegicus GN=Plod2 PE=2 SV=1
Length = 737
Score = 56.2 bits (134), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 53/200 (26%), Positives = 87/200 (43%), Gaps = 30/200 (15%)
Query: 135 RDNTEASFRSIMAEPIPGIYTFEMLQPRFCEMLLSEVENFERWV----HDTRFRIMRPNT 190
RD ++ +I+ +P P ++ F + R C+ L+ E+E++ +W HD+R N
Sbjct: 547 RDYSKIFTENIVEQPCPDVFWFPIFSERACDELVEEMEHYGKWSGGKHHDSRISGGYENV 606
Query: 191 MNKFGAVLDDFGLETMLDKLMNDFIRPIS-KVF--FPEVGGSTLDSHHGFVVEYGMDRDV 247
+ LE + + +FI P++ KVF + G + L+ FVV+Y +R
Sbjct: 607 PTD-DIHMKQIDLENVWLHFIREFIAPVTLKVFAGYYTKGFALLN----FVVKYSPERQR 661
Query: 248 ELGFHVDDSEVTLNVCL---GREFSGGELFFRGVRCDKHVNTETQSEEILDYSHVPGYAV 304
L H D S T+N+ L G +F GG F C S G++
Sbjct: 662 SLRPHHDASTFTINIALNNVGEDFQGGGCKFLRYNCSIE-------------SPRKGWSF 708
Query: 305 LHRGR--HRHGARATTSGSR 322
+H GR H H +G+R
Sbjct: 709 MHPGRLTHLHEGLPVKNGTR 728
>sp|O00469|PLOD2_HUMAN Procollagen-lysine,2-oxoglutarate 5-dioxygenase 2 OS=Homo sapiens
GN=PLOD2 PE=1 SV=2
Length = 737
Score = 55.1 bits (131), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 52/200 (26%), Positives = 87/200 (43%), Gaps = 30/200 (15%)
Query: 135 RDNTEASFRSIMAEPIPGIYTFEMLQPRFCEMLLSEVENFERWV----HDTRFRIMRPNT 190
RD ++ +I+ +P P ++ F + + C+ L+ E+E++ +W HD+R N
Sbjct: 547 RDYSKIFTENIVEQPCPDVFWFPIFSEKACDELVEEMEHYGKWSGGKHHDSRISGGYENV 606
Query: 191 MNKFGAVLDDFGLETMLDKLMNDFIRPIS-KVF--FPEVGGSTLDSHHGFVVEYGMDRDV 247
+ LE + + +FI P++ KVF + G + L+ FVV+Y +R
Sbjct: 607 PTD-DIHMKQVDLENVWLHFIREFIAPVTLKVFAGYYTKGFALLN----FVVKYSPERQR 661
Query: 248 ELGFHVDDSEVTLNVCL---GREFSGGELFFRGVRCDKHVNTETQSEEILDYSHVPGYAV 304
L H D S T+N+ L G +F GG F C S G++
Sbjct: 662 SLRPHHDASTFTINIALNNVGEDFQGGGCKFLRYNCSIE-------------SPRKGWSF 708
Query: 305 LHRGR--HRHGARATTSGSR 322
+H GR H H +G+R
Sbjct: 709 MHPGRLTHLHEGLPVKNGTR 728
>sp|Q9R0E2|PLOD1_MOUSE Procollagen-lysine,2-oxoglutarate 5-dioxygenase 1 OS=Mus musculus
GN=Plod1 PE=1 SV=1
Length = 728
Score = 53.9 bits (128), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 57/229 (24%), Positives = 95/229 (41%), Gaps = 31/229 (13%)
Query: 107 ISNYQP--LHRELFTMHA-PSVLVPAFVKAVRDNTEASFRSIMAEPIPGIYTFEMLQPRF 163
+ NYQ LH +L+ + + P ++ + T+A ++ P P +Y F +
Sbjct: 509 LDNYQTTHLHNDLWEVFSNPEDWKEKYIH--ENYTKALAGKLVETPCPDVYWFPIFTEAA 566
Query: 164 CEMLLSEVENFERWV----HDTRFRIMRPNTMNKFGAVLDDFGLETMLDKLMNDFIRPIS 219
C+ L+ E+E++ +W D R + N + ++ E K + ++I P++
Sbjct: 567 CDELVEEMEHYGQWSLGDNKDNRIQGGYEN-VPTIDIHMNQITFEREWHKFLVEYIAPMT 625
Query: 220 KVFFPEVGGSTLDSHH-GFVVEYGMDRDVELGFHVDDSEVTLNVCL---GREFSGGELFF 275
+ +P G T FVV Y D L H D S T+N+ L G ++ GG F
Sbjct: 626 EKLYP--GYYTRAQFDLAFVVRYKPDEQPSLMPHHDASTFTVNIALNRVGEDYEGGGCRF 683
Query: 276 RGVRCDKHVNTETQSEEILDYSHVPGYAVLHRGR--HRHGARATTSGSR 322
C + G+A+LH GR H H TT G+R
Sbjct: 684 LRYNCSVRAPRK-------------GWALLHPGRLTHYHEGLPTTKGTR 719
>sp|Q5UQC3|PLOD_MIMIV Procollagen lysyl hydroxylase and glycosyltransferase
OS=Acanthamoeba polyphaga mimivirus GN=MIMI_L230 PE=1
SV=1
Length = 895
Score = 53.5 bits (127), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 52/210 (24%), Positives = 91/210 (43%), Gaps = 28/210 (13%)
Query: 126 LVPAFVKAVRDNTEASFRSIMAEPIPGIYTFEMLQPRFCEMLLSEVENFERWVH--DTRF 183
L P F+ +++ + + I + +Y+F + P FC+ ++ ++ W D+ F
Sbjct: 701 LHPEFLSHLQNFKDFDYTEICND----VYSFPLFTPAFCKEVIEVMDKANLWSKGGDSYF 756
Query: 184 --RIMRPNTMNKFGAVLDDFGLETMLDKLMNDFIRPISKVFFPEVGGSTLDSHHGFVVEY 241
RI + L + GL+ ++ +++ P + + T D + FVV+Y
Sbjct: 757 DPRIGGVESYPTQDTQLYEVGLDKQWHYVVFNYVAPFVRHLYNNY--KTKDINLAFVVKY 814
Query: 242 GMDRDVELGFHVDDSEVTLNVCL---GREFSGGELFFRGVRCDKHVNTETQSEEILDYSH 298
M+R EL H D S TLN+ L G+E++ G G +H + +
Sbjct: 815 DMERQSELAPHHDSSTYTLNIALNEYGKEYTAG-----GCEFIRH--------KFIWQGQ 861
Query: 299 VPGYAVLHRGR--HRHGARATTSGSRVNLL 326
GYA +H G+ H A TSG R L+
Sbjct: 862 KVGYATIHAGKLLAYHRALPITSGKRYILV 891
>sp|Q63321|PLOD1_RAT Procollagen-lysine,2-oxoglutarate 5-dioxygenase 1 OS=Rattus
norvegicus GN=Plod1 PE=2 SV=1
Length = 728
Score = 53.5 bits (127), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 55/230 (23%), Positives = 95/230 (41%), Gaps = 33/230 (14%)
Query: 107 ISNYQP--LHRELFTMHA-PSVLVPAFVKAVRDNTEASFRSIMAEPIPGIYTFEMLQPRF 163
+ NYQ LH +L+ + + P ++ + T+A ++ P P +Y F +
Sbjct: 509 LDNYQTTHLHNDLWEVFSNPQDWKEKYIH--ENYTKALAGKLVETPCPDVYWFPIFTEVA 566
Query: 164 CEMLLSEVENFERWV----HDTRFRIMRPNTMNKFGAVLDDFGLETMLDKLMNDFIRPIS 219
C+ L+ E+E++ +W D R + N + ++ E K + ++I P++
Sbjct: 567 CDELVEEMEHYGQWSLGDNKDNRIQGGYEN-VPTIDIHMNQITFEREWHKFLVEYIAPLT 625
Query: 220 KVFFPEVGGSTLDSHH--GFVVEYGMDRDVELGFHVDDSEVTLNVCL---GREFSGGELF 274
+ +P G + FVV Y D L H D S T+N+ L G ++ GG
Sbjct: 626 EKLYP---GYYTKAQFDLAFVVRYKPDEQPSLMPHHDASTFTINIALNRVGEDYEGGGCR 682
Query: 275 FRGVRCDKHVNTETQSEEILDYSHVPGYAVLHRGR--HRHGARATTSGSR 322
F C + G+A++H GR H H TT G+R
Sbjct: 683 FLRYNCSVRAPRK-------------GWALMHPGRLTHYHEGLPTTKGTR 719
>sp|Q5R9N3|PLOD1_PONAB Procollagen-lysine,2-oxoglutarate 5-dioxygenase 1 OS=Pongo abelii
GN=PLOD1 PE=2 SV=1
Length = 727
Score = 53.1 bits (126), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 50/198 (25%), Positives = 82/198 (41%), Gaps = 26/198 (13%)
Query: 135 RDNTEASFRSIMAEPIPGIYTFEMLQPRFCEMLLSEVENFERWV----HDTRFRIMRPNT 190
++ T+A ++ P P +Y F + C+ L+ E+E+F +W D R + N
Sbjct: 537 QNYTKALAGKLVETPCPDVYWFPIFTEVACDELVEEMEHFGQWSLGDNKDNRIQGGYEN- 595
Query: 191 MNKFGAVLDDFGLETMLDKLMNDFIRPISKVFFPEVGGSTLDSHH-GFVVEYGMDRDVEL 249
+ ++ G E K + ++I P+++ +P G T FVV Y D L
Sbjct: 596 VPTIDIHMNQIGFEREWHKFLLEYIAPMTEKLYP--GYYTRAQFDLAFVVRYKPDEQPSL 653
Query: 250 GFHVDDSEVTLNVCLGR---EFSGGELFFRGVRCDKHVNTETQSEEILDYSHVPGYAVLH 306
H D S T+N+ L R ++ GG F C + G+ ++H
Sbjct: 654 MPHHDASTFTINIALNRVGVDYEGGGCRFLRYNCSIRAPRK-------------GWTLMH 700
Query: 307 RGR--HRHGARATTSGSR 322
GR H H TT G+R
Sbjct: 701 PGRLTHYHEGLPTTRGTR 718
>sp|Q02809|PLOD1_HUMAN Procollagen-lysine,2-oxoglutarate 5-dioxygenase 1 OS=Homo sapiens
GN=PLOD1 PE=1 SV=2
Length = 727
Score = 52.8 bits (125), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 50/198 (25%), Positives = 82/198 (41%), Gaps = 26/198 (13%)
Query: 135 RDNTEASFRSIMAEPIPGIYTFEMLQPRFCEMLLSEVENFERWV----HDTRFRIMRPNT 190
++ T+A ++ P P +Y F + C+ L+ E+E+F +W D R + N
Sbjct: 537 QNYTKALAGKLVETPCPDVYWFPIFTEVACDELVEEMEHFGQWSLGNNKDNRIQGGYEN- 595
Query: 191 MNKFGAVLDDFGLETMLDKLMNDFIRPISKVFFPEVGGSTLDSHH-GFVVEYGMDRDVEL 249
+ ++ G E K + ++I P+++ +P G T FVV Y D L
Sbjct: 596 VPTIDIHMNQIGFEREWHKFLLEYIAPMTEKLYP--GYYTRAQFDLAFVVRYKPDEQPSL 653
Query: 250 GFHVDDSEVTLNVCLGR---EFSGGELFFRGVRCDKHVNTETQSEEILDYSHVPGYAVLH 306
H D S T+N+ L R ++ GG F C + G+ ++H
Sbjct: 654 MPHHDASTFTINIALNRVGVDYEGGGCRFLRYNCSIRAPRK-------------GWTLMH 700
Query: 307 RGR--HRHGARATTSGSR 322
GR H H TT G+R
Sbjct: 701 PGRLTHYHEGLPTTRGTR 718
>sp|O77588|PLOD1_BOVIN Procollagen-lysine,2-oxoglutarate 5-dioxygenase 1 OS=Bos taurus
GN=PLOD1 PE=2 SV=2
Length = 726
Score = 50.8 bits (120), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 50/202 (24%), Positives = 81/202 (40%), Gaps = 27/202 (13%)
Query: 132 KAVRDNTEASFRSIMAE-PIPGIYTFEMLQPRFCEMLLSEVENFERWV----HDTRFRIM 186
K + +N + M E P P +Y F + C+ L+ E+E++ +W D R +
Sbjct: 532 KYIHENYTKALAGKMVEMPCPDVYWFPIFTETACDELVEEMEHYGQWSLGDNKDNRIQGG 591
Query: 187 RPNTMNKFGAVLDDFGLETMLDKLMNDFIRPISKVFFPEVGGSTLDSHH-GFVVEYGMDR 245
N + ++ E K + ++I P+++ +P G T FVV Y D
Sbjct: 592 YEN-VPTIDIHMNQINFEREWHKFLVEYIAPMTEKLYP--GYYTRAQFDLAFVVRYKPDE 648
Query: 246 DVELGFHVDDSEVTLNVCLGR---EFSGGELFFRGVRCDKHVNTETQSEEILDYSHVPGY 302
L H D S T+N+ L R ++ GG F C + G+
Sbjct: 649 QPSLVPHHDASTFTINIALNRVGVDYEGGGCRFLRYNCSIRAPRK-------------GW 695
Query: 303 AVLHRGR--HRHGARATTSGSR 322
++H GR H H TT G+R
Sbjct: 696 TLMHPGRLTHYHEGLPTTKGTR 717
>sp|A3LNC4|U507_PICST UPF0507 protein PICST_55861 OS=Scheffersomyces stipitis (strain
ATCC 58785 / CBS 6054 / NBRC 10063 / NRRL Y-11545)
GN=PICST_55861 PE=3 SV=2
Length = 1194
Score = 33.9 bits (76), Expect = 1.7, Method: Composition-based stats.
Identities = 30/119 (25%), Positives = 55/119 (46%), Gaps = 13/119 (10%)
Query: 109 NYQPLHRELFTMHAPSVLVPAF------VKAVRDNTEASFRSIMAEPIPGIYTFEMLQPR 162
+Y P + ++ L+ +F + +D+ +S RS+ + + TFE L
Sbjct: 151 DYFPKGSKFMILYVEDCLIGSFDPHQHLSRIPQDDETSSQRSVSQQEVQDSITFEKLLRS 210
Query: 163 FCEMLLSEVENFERWVH--DTRFRIMRPNTMNKFGAVLDDFGLETMLD---KLMNDFIR 216
F + + E F R H + +FR++R NT K + +F +MLD K++ D I+
Sbjct: 211 FPLLSKAVSERFYRLFHHNNHQFRVLRINTRKKLEHIRIEF--HSMLDEAYKIIQDSIK 267
>sp|O14207|MMS22_SCHPO Protein mms22 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843)
GN=mus7 PE=3 SV=1
Length = 1888
Score = 33.9 bits (76), Expect = 1.9, Method: Composition-based stats.
Identities = 30/97 (30%), Positives = 45/97 (46%), Gaps = 12/97 (12%)
Query: 28 SYRLRLNPSSEHKPDSYDDLHQLEFTPLLFSSLERYLPPTMLSMSRDVKFQ-YMRDILMK 86
S+R + SSE D+ D P SL+R PP + + D F Y++ IL+
Sbjct: 1001 SHRKFNDLSSEISEDTPTDF------PDFVKSLDR--PPNLHVTALDTCFVIYLKVILIS 1052
Query: 87 YSRDGERTRVQRHKEYRQRIISNYQPLHRELFTMHAP 123
SR +V + +RI+S QPLH +T +P
Sbjct: 1053 ISR---LRQVDENTNSIKRIVSRLQPLHSRQYTRESP 1086
>sp|A3QCQ6|CHEZ_SHELP Protein phosphatase CheZ OS=Shewanella loihica (strain ATCC
BAA-1088 / PV-4) GN=cheZ PE=3 SV=1
Length = 245
Score = 32.0 bits (71), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 20/50 (40%), Positives = 27/50 (54%), Gaps = 2/50 (4%)
Query: 202 GLETMLDKLMNDFIRPISKVFFPEVGGSTLDSHHGFVVEYGMD-RDVELG 250
G + M D+L+ D PI K F EVG T H +V++ +D R VEL
Sbjct: 24 GQQEMADELIRDIASPIQKELFDEVGRLTRQLHSA-IVDFQVDGRLVELA 72
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.323 0.138 0.416
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 123,243,004
Number of Sequences: 539616
Number of extensions: 5084498
Number of successful extensions: 11379
Number of sequences better than 100.0: 24
Number of HSP's better than 100.0 without gapping: 14
Number of HSP's successfully gapped in prelim test: 10
Number of HSP's that attempted gapping in prelim test: 11335
Number of HSP's gapped (non-prelim): 24
length of query: 331
length of database: 191,569,459
effective HSP length: 118
effective length of query: 213
effective length of database: 127,894,771
effective search space: 27241586223
effective search space used: 27241586223
T: 11
A: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 61 (28.1 bits)