BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy14856
(734 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|328713170|ref|XP_003245008.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 3-like
isoform 2 [Acyrthosiphon pisum]
Length = 734
Score = 794 bits (2051), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 386/732 (52%), Positives = 505/732 (68%), Gaps = 14/732 (1%)
Query: 11 ILSCVVFFISVHCNKVKNIDEDK---FLVITVASNETDGYKRFIQSAEVNKLQVKTLGLH 67
I C +FFI + K++ LV+TVAS + DG+KRFI SA +N L+ K LG+
Sbjct: 9 IAICGLFFILDSASTKKDVSAKSDLNLLVLTVASEKNDGFKRFIDSANLNGLKTKVLGVD 68
Query: 68 QPWLGGDMSSLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFD 127
+PW GG+M+S+GGGYK+NL L+ D++ +L+TD+YDV++ + IL F FD
Sbjct: 69 KPWQGGNMNSVGGGYKLNLYLEALEPYKNNDNLAVLLTDAYDVVLLANSSTILNAFTEFD 128
Query: 128 ANIVFGAERLCWPDTSLYDKYPAVG-SGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQ 186
++IV E CWPD L DKYP V +GYR++NSGG IGYA + +L+S + IKN DDQ
Sbjct: 129 SSIVISTENSCWPDRKLADKYPTVDLNGYRFINSGGIIGYASQLYKLLSEKPIKNLGDDQ 188
Query: 187 LYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVI 246
L+ L+LD LR K I LD A LFQN+Y + +DIKL +V L N +NT P +
Sbjct: 189 LHLTNLYLDTDLREKLNIKLDNYAKLFQNVYLAEDDIKLKLVNKSYV-LENINFNTQPAV 247
Query: 247 IHGNGKSKIELNSFGNYLAKSWK-TSGCTRC---NLIKHLDSLKPDQFPSVLISVFIDKP 302
IHGNG SKI NS+ NY+ W SGC C NL L +LK + +P VL+S+ +DKP
Sbjct: 248 IHGNGLSKITFNSYTNYIPNKWSPESGCKTCYDNNL--DLSTLKEENYPKVLLSIIVDKP 305
Query: 303 TAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVN 362
T F +EFL+KI N++YP ++ + + +YH D +I + N ++ H +
Sbjct: 306 TPFFDEFLDKIENIDYPKSRLCLSITTLVDYHKEHVDKFISKIGDKY-NASFVFHKTAEE 364
Query: 363 SKEARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNF 422
S AR+ + K DF FY+++++HLDNP LK L+ RN+ +IAP+L RPFKAWSNF
Sbjct: 365 SIHARHFSFSLCTSKLCDFLFYIENEAHLDNPQTLKILIQRNKKIIAPMLTRPFKAWSNF 424
Query: 423 WGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSM 482
WGAL+ +GFYARSFDYM+I+N ++ GIWNVPYI++CYLMK ++++ + Y +++
Sbjct: 425 WGALSKEGFYARSFDYMDIVNYNK--TGIWNVPYISSCYLMKGTILENKYTRPSYKEDNL 482
Query: 483 DYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHP 542
DYDMAF +LR KG+ + ID+ YGHL+DSE+FD NPEVY++ N DW+ RYIHP
Sbjct: 483 DYDMAFSKSLREKGVFMYIDNQYTYGHLIDSESFDITLKNPEVYQIFENRYDWEQRYIHP 542
Query: 543 EYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAV 602
EY ++ PD +PCPDVFWFPI+TE+FC EF++IME +GQWSDGTNND RL TGYEAV
Sbjct: 543 EYMENFNPDKKPAEPCPDVFWFPILTEQFCQEFIEIMENFGQWSDGTNNDTRLRTGYEAV 602
Query: 603 PTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRP 662
PTRDIHM QVGL W EFLR YV P+Q++ FIGY H+P R+ M+FVV+Y P Q SLRP
Sbjct: 603 PTRDIHMNQVGLEKHWLEFLRSYVQPIQKKAFIGYTHDPPRSLMNFVVKYNPLGQASLRP 662
Query: 663 HHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQ 722
HHDSSTYTINIALN G DY+GGGC F+RY C VT ++GWMLMHPGRLTHYHEGL+VT
Sbjct: 663 HHDSSTYTINIALNSPGKDYQGGGCHFLRYKCKVTDLKVGWMLMHPGRLTHYHEGLEVTN 722
Query: 723 GTRYIMISFVDP 734
GTRYIMISFVDP
Sbjct: 723 GTRYIMISFVDP 734
>gi|350421678|ref|XP_003492921.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 3-like
isoform 1 [Bombus impatiens]
gi|350421681|ref|XP_003492922.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 3-like
isoform 2 [Bombus impatiens]
Length = 736
Score = 781 bits (2017), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 375/734 (51%), Positives = 504/734 (68%), Gaps = 9/734 (1%)
Query: 7 LNCLILSCVVFFISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQ--VKTL 64
+ C + + V + + D+D LV T+ASNETDGYKR+++S V + ++ L
Sbjct: 6 IGCCLFWSLFLTYHVFSETLPSTDKDDVLVFTIASNETDGYKRYLRSVNVYGFRDNLRVL 65
Query: 65 GLHQPWLGGD--MSSLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILER 122
GL +PWLGGD +S GGGYKVNLLK L+ D I++ TDSYDVI + +I+ +
Sbjct: 66 GLGEPWLGGDNIKTSAGGGYKVNLLKKALENYGDDDQKIVIFTDSYDVIYLSDLTEIINK 125
Query: 123 FNTFDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNE 182
F + DA ++F AE CWPD SL KYP+ G R+LNSGGF+GYA D+ ++++ IKN+
Sbjct: 126 FKSMDARVLFSAEGSCWPDKSLASKYPSAALGKRFLNSGGFVGYASDVYAILTHAPIKNK 185
Query: 183 EDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNT 242
+DDQL+Y L +LDE LR +HKI LD + +FQNLYG++ D++L F+ + L NT Y+T
Sbjct: 186 DDDQLFYTLAYLDEELRERHKIKLDHKSEIFQNLYGAVADVELKFEGGK-ASLLNTVYST 244
Query: 243 NPVIIHGNGKSKIELNSFGNYLAKSWK-TSGCTRC-NLIKHLDSLKPDQFPSVLISVFID 300
P+I+HGNG SK+ LNS GNYLA +W GC C LD P+ +P +LI++FI+
Sbjct: 245 EPLILHGNGYSKLSLNSLGNYLAHAWSPEEGCVMCWEETIELDRTTPESYPIILIAIFIE 304
Query: 301 KPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNST 360
+PT FL EFL+ I YP K+ + ++NN EYH + D+++ + + K I+ N
Sbjct: 305 RPTPFLTEFLSAIYQQAYPKSKLHLLIHNNVEYHQDVVDNFMKKVGREYNSSKQISVNDA 364
Query: 361 VNSKEARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWS 420
+N +ARNLA++ L K YF +DS SHLDN LK L+ + +IAPLLVRP+K WS
Sbjct: 365 MNEVDARNLAMDYCLLKECSGYFSIDSVSHLDNEHTLKLLIEQQRDIIAPLLVRPYKMWS 424
Query: 421 NFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLN 480
NFWGA+ DGFYARSFDY+ I+N ++ +G+WNVP+I+NCYL+ ++I + Y+
Sbjct: 425 NFWGAIMDDGFYARSFDYIEIVNNER--RGLWNVPFISNCYLINATLISNKETRPSYSEG 482
Query: 481 SMDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYI 540
+D +MAF R + I + + + ++GHLVD +N+D T+P+ Y+++ N LDW+ YI
Sbjct: 483 DLDTEMAFAYANRERNIFMYVSNRVDFGHLVDPDNYDVTVTHPDFYQILNNKLDWEKTYI 542
Query: 541 HPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYE 600
H Y ++ P+ Q CPDV+ FPIV E+F E + IME +G+WSDG+N+D RL GYE
Sbjct: 543 HENYSENFNPNKTPVQVCPDVYRFPIVNERFTKELIDIMETFGKWSDGSNHDPRLTGGYE 602
Query: 601 AVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSL 660
VPTRDIHM QV W FL++YV PLQE F GY+H+P RA M+FVVRYRPDEQPSL
Sbjct: 603 NVPTRDIHMNQVKYEPQWLYFLKEYVRPLQELVFAGYYHDPPRALMNFVVRYRPDEQPSL 662
Query: 661 RPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQV 720
+PHHDSSTYTINIALN+ GVDYEGGGCRFIRYNC+VT T+ GWMLMHPGRLTHYHEGL+V
Sbjct: 663 KPHHDSSTYTINIALNRAGVDYEGGGCRFIRYNCSVTDTKPGWMLMHPGRLTHYHEGLRV 722
Query: 721 TQGTRYIMISFVDP 734
T GTRYIMISFVDP
Sbjct: 723 TSGTRYIMISFVDP 736
>gi|340726794|ref|XP_003401738.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 3-like
isoform 1 [Bombus terrestris]
Length = 736
Score = 776 bits (2005), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 374/735 (50%), Positives = 503/735 (68%), Gaps = 9/735 (1%)
Query: 6 HLNCLILSCVVFFISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQ--VKT 63
++ C + + V + + D+D LV T+ASNETDGYKR+++S V ++
Sbjct: 5 NIGCCLFWSLFLTYHVFSETLPSTDKDDVLVFTIASNETDGYKRYLRSVNVYGFHDNLRV 64
Query: 64 LGLHQPWLGGD--MSSLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILE 121
LGL +PWLGGD +S GGGYKVNLLK L+ D I++ TDSYDVI + +I+
Sbjct: 65 LGLGEPWLGGDNIKTSAGGGYKVNLLKKALENYGDDDQKIVIFTDSYDVIYLSDLTEIIN 124
Query: 122 RFNTFDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKN 181
+F + DA ++F AE CWPD SL KYP G R+LNSGGF+GYA D+ ++++ IKN
Sbjct: 125 KFKSMDARVLFSAEGSCWPDKSLASKYPPATLGKRFLNSGGFVGYASDVYAILTHAPIKN 184
Query: 182 EEDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYN 241
++DDQL+Y L +LDE LR +HKI LD + +FQNLYG++ D++L F+ + L NT YN
Sbjct: 185 KDDDQLFYTLAYLDEELRERHKIKLDHKSEIFQNLYGAVADVELKFEGGK-ASLLNTVYN 243
Query: 242 TNPVIIHGNGKSKIELNSFGNYLAKSWK-TSGCTRC-NLIKHLDSLKPDQFPSVLISVFI 299
T P+I+HGNG SK+ LNS GNYLA++W GC C LD + +P +LI++FI
Sbjct: 244 TEPLILHGNGYSKLSLNSLGNYLARAWSPEEGCVMCWEETIELDRIISQSYPIILIAIFI 303
Query: 300 DKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNS 359
++PT FL EFL+ I YP K+ + ++NN EYH + D+++ + + + K I+ N
Sbjct: 304 ERPTPFLSEFLSAIYQQAYPKSKLHLLIHNNVEYHQDVLDNFMKKVEKEYNSSKQISVND 363
Query: 360 TVNSKEARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAW 419
++ +ARNLA++ L K YF +DS SHLDN LK LV + +IAPLLVRP+K W
Sbjct: 364 AMSEVDARNLAMDYCLLKECSGYFSIDSVSHLDNEHTLKLLVEQQRDIIAPLLVRPYKMW 423
Query: 420 SNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTL 479
SNFWGA+ DGFYARSFDY+ I+ ++ +G+WNVP+I+NCYL+ ++I + Y+
Sbjct: 424 SNFWGAIMDDGFYARSFDYIEIVKNER--RGLWNVPFISNCYLINATLISNKETRPSYSE 481
Query: 480 NSMDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRY 539
+D +MAF R + I + + + ++GHLVD +N+D T+P+ Y+++ N LDW+ Y
Sbjct: 482 GDLDTEMAFAYANRERNIFMYVSNRVDFGHLVDPDNYDVTVTHPDFYQILNNKLDWEKTY 541
Query: 540 IHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGY 599
IH Y ++ P+ Q CPDV+ FPIV E+F E + IME +G+WSDG+N+D RL GY
Sbjct: 542 IHENYSENFNPNKTPVQVCPDVYRFPIVNERFTKELIDIMETFGKWSDGSNHDPRLTGGY 601
Query: 600 EAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPS 659
E VPTRDIHM QV W FL++YV PLQE F GY+H+P RA M+FVVRYRPDEQPS
Sbjct: 602 ENVPTRDIHMNQVKYEPQWLYFLKEYVRPLQELVFAGYYHDPPRALMNFVVRYRPDEQPS 661
Query: 660 LRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQ 719
L+PHHDSSTYTINIALN+ GVDYEGGGCRFIRYNC+VT T+ GWMLMHPGRLTHYHEGL+
Sbjct: 662 LKPHHDSSTYTINIALNRAGVDYEGGGCRFIRYNCSVTDTKPGWMLMHPGRLTHYHEGLR 721
Query: 720 VTQGTRYIMISFVDP 734
VT GTRYIMISFVDP
Sbjct: 722 VTSGTRYIMISFVDP 736
>gi|157117949|ref|XP_001653115.1| procollagen-lysine,2-oxoglutarate 5-dioxygenase [Aedes aegypti]
gi|108875910|gb|EAT40135.1| AAEL008099-PA [Aedes aegypti]
Length = 707
Score = 775 bits (2001), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 371/709 (52%), Positives = 499/709 (70%), Gaps = 11/709 (1%)
Query: 28 NIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLL 87
NI + LV TVASN T+GY R+I+SA+ ++V TLGL +PWLGGDM+ LGGGYK+NLL
Sbjct: 8 NISQKPPLVFTVASNATEGYLRYIRSAKYYGIEVSTLGLGKPWLGGDMTRLGGGYKINLL 67
Query: 88 KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
++ L DD I+L TDSYDV+ + I+E+F TFDA+I+FG+E CWP+ L K
Sbjct: 68 RDALKPYKADDDRIVLFTDSYDVLFLASMEKIIEKFRTFDASILFGSEGFCWPEEDLKSK 127
Query: 148 YPAV-GSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVL 206
YP + G G R+LNSG F+GYA + ++ +K+ +DDQLYY +LDE R + KI L
Sbjct: 128 YPVLEGRGTRFLNSGLFMGYASKVYRMLKT-PVKDTDDDQLYYTKAYLDEKQRNELKIKL 186
Query: 207 DTLANLFQNLYGSLEDIKLNFDLD-EFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLA 265
D A LFQNL G E + L D + + L NT+Y+T P I+HGNG SK+ LN + NYLA
Sbjct: 187 DHTAVLFQNLNGVEEQVVLALDENGKEAFLKNTEYSTVPYIVHGNGPSKLVLNGYANYLA 246
Query: 266 KSWKTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
++ C N + L L + P+V++++FI+K T F+EE+ IA +NYP+KK+ +
Sbjct: 247 GAFVDGECKTIN--EDLIQLDEENLPTVMLALFIEKATPFIEEWFEGIAKINYPSKKMDL 304
Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYV 385
F++NN +YH P DD+I + + +++ + + + R+LAV+ L K D+ F V
Sbjct: 305 FIHNNVDYHKPTIDDFIEKYSSSYRSFRMVDYTDDYEELAGRSLAVDQCLKKQCDYLFVV 364
Query: 386 DSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGD 445
D+D H+D+ D+++ L+ +N+S+I+P+L RP K WSNFWGAL++ GFYARS DYM+I+
Sbjct: 365 DADGHIDDSDIIRKLIVQNKSIISPMLNRPEKVWSNFWGALSSQGFYARSSDYMDIVGRK 424
Query: 446 QGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQ 505
G+ WNVPYI+ YL+K SV+ + Y L D DMA C ++R KGI + + + +
Sbjct: 425 ILGQ--WNVPYISTIYLVKASVLPLVS----YELQGTDPDMALCWHMRAKGIFMHVINAE 478
Query: 506 EYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFP 565
+YGHL+DS+ +D KT+P+ Y+L N DW+ +YI PEY K L D V QPCPDV+WF
Sbjct: 479 QYGHLIDSDYYDTTKTHPDFYQLFNNKHDWEQKYISPEYYKQLEKDYVQIQPCPDVYWFA 538
Query: 566 IVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKY 625
I +E FC +I+EA+G+WSDGT+ DKRL+ GYEAVPTRDIHM QVGL VW +FL+ Y
Sbjct: 539 IASELFCDHLKEIVEAFGKWSDGTHTDKRLQGGYEAVPTRDIHMNQVGLEQVWLKFLQLY 598
Query: 626 VVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGG 685
V PLQE+ FIGY+H+P R+ M+FVVRYRPDEQPSLRPHHDSSTYTINIALN+ G+DYEGG
Sbjct: 599 VKPLQEKVFIGYYHDPPRSLMNFVVRYRPDEQPSLRPHHDSSTYTINIALNRAGIDYEGG 658
Query: 686 GCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
GC F+RYNC+VT TR GWMLMHPGRLTH+HEGL+ GTRYIMISFVDP
Sbjct: 659 GCHFLRYNCSVTDTRKGWMLMHPGRLTHFHEGLRTNSGTRYIMISFVDP 707
>gi|328784759|ref|XP_003250492.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 3-like
[Apis mellifera]
Length = 785
Score = 773 bits (1995), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 388/783 (49%), Positives = 515/783 (65%), Gaps = 58/783 (7%)
Query: 7 LNCLILSCVVFFISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQ--VKTL 64
+ C + + V + +ID+D LV TVA+ ETDGYKR+++S +V + ++ L
Sbjct: 6 VGCYLFWSLFLAYHVVSDTPPSIDKDDVLVFTVATKETDGYKRYLRSIDVYGFRDNLRVL 65
Query: 65 GLHQPWLGGD--MSSLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILER 122
G+ PWLGGD +S+GGGYKVNLLK L+E DD II+ TDSYDVI + +I+++
Sbjct: 66 GMGTPWLGGDHVKTSVGGGYKVNLLKKALEEYQNDDDRIIIFTDSYDVIFLSDLTEIIDK 125
Query: 123 FNTFDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNE 182
F +A ++F AE CWPD SL KYP+V G R+LNSGGFIGYA DI +++ IKN+
Sbjct: 126 FKNTNARVLFSAEGACWPDRSLASKYPSVTRGKRFLNSGGFIGYASDIYAILTYAPIKNK 185
Query: 183 EDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNT 242
+DDQL+Y L +LDE LR HKI LD + +FQNLY ++ D+KL F+ + L NT YNT
Sbjct: 186 DDDQLFYTLAYLDEKLREHHKIKLDHKSVIFQNLYLAVGDVKLKFENGK-ASLLNTVYNT 244
Query: 243 NPVIIHGNGKSKIELNSFGNYLAKSWK-TSGCTRC-NLIKHLDSLKPDQFPSVLISVFID 300
P+I+HGNG SK LNS GNYLA++W GC C L+ P+ +P +LI+VFI+
Sbjct: 245 EPLILHGNGYSKESLNSLGNYLARAWSPEEGCIMCWEGTIELNKTIPESYPIILIAVFIE 304
Query: 301 KPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNST 360
+PT FL EFL I +YP K+ +FV+NN EYH + + ++ N + K ++ N
Sbjct: 305 RPTPFLNEFLATIYQQDYPKSKLHLFVHNNVEYHQDVINSFMKNVGYEYNTSKLVSVNDA 364
Query: 361 VNSKEARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWS 420
+N +ARNLA++ L K YF +DS SHLDN LK LV + +IAPLLVRP+K WS
Sbjct: 365 MNEVDARNLAMDYCLLKECSGYFSIDSISHLDNKYTLKLLVEQQREIIAPLLVRPYKMWS 424
Query: 421 NFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLN 480
NFWGA+ DGFYARSFDYM+I+ ++ +G+WNVP+I+NCYL+ +++I+ + Y+
Sbjct: 425 NFWGAIMDDGFYARSFDYMDIVKNER--RGLWNVPFISNCYLINSTLIRNKETRPSYSEG 482
Query: 481 SMDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYI 540
+D DMAF R + I + + + ++GHLV+ +++D T+P++Y++I N LDW+ RYI
Sbjct: 483 DLDTDMAFAYANRERSIFMYVSNRLDFGHLVNPDSYDITLTHPDLYQIIDNKLDWERRYI 542
Query: 541 HPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRL----- 595
H Y ++ + QPCPDV+WFPIV E+F E + +ME +G+WSDG+N+D RL
Sbjct: 543 HENYSENFNSNQTPLQPCPDVYWFPIVNERFTKELIDVMENFGKWSDGSNHDPRLTGGYE 602
Query: 596 --------------------------------------------ETGYEAVPTRDIHMKQ 611
E+GYEAVPTRDIHMKQ
Sbjct: 603 NVPTRDIHMNQVKNEPQWLYFLKEYVRPLQELVFTGYYHDDPRIESGYEAVPTRDIHMKQ 662
Query: 612 VGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTI 671
+GL W FL +YV PLQE FIGY+ P RA M+FVVRYRPDEQPSL+PHHDSSTYTI
Sbjct: 663 IGLHESWLNFLDQYVSPLQEHVFIGYNTSPPRALMNFVVRYRPDEQPSLKPHHDSSTYTI 722
Query: 672 NIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISF 731
NIALN+VGVDYEGGGCRFIRYNC+VT T+ GWMLMHPGRLTHYHEGL+VT GTRYIMISF
Sbjct: 723 NIALNRVGVDYEGGGCRFIRYNCSVTDTKPGWMLMHPGRLTHYHEGLRVTSGTRYIMISF 782
Query: 732 VDP 734
VDP
Sbjct: 783 VDP 785
>gi|328713172|ref|XP_001943472.2| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 3-like
isoform 1 [Acyrthosiphon pisum]
Length = 784
Score = 771 bits (1990), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 386/782 (49%), Positives = 505/782 (64%), Gaps = 64/782 (8%)
Query: 11 ILSCVVFFISVHCNKVKNIDEDK---FLVITVASNETDGYKRFIQSAEVNKLQVKTLGLH 67
I C +FFI + K++ LV+TVAS + DG+KRFI SA +N L+ K LG+
Sbjct: 9 IAICGLFFILDSASTKKDVSAKSDLNLLVLTVASEKNDGFKRFIDSANLNGLKTKVLGVD 68
Query: 68 QPWLGGDMSSLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFD 127
+PW GG+M+S+GGGYK+NL L+ D++ +L+TD+YDV++ + IL F FD
Sbjct: 69 KPWQGGNMNSVGGGYKLNLYLEALEPYKNNDNLAVLLTDAYDVVLLANSSTILNAFTEFD 128
Query: 128 ANIVFGAERLCWPDTSLYDKYPAVG-SGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQ 186
++IV E CWPD L DKYP V +GYR++NSGG IGYA + +L+S + IKN DDQ
Sbjct: 129 SSIVISTENSCWPDRKLADKYPTVDLNGYRFINSGGIIGYASQLYKLLSEKPIKNLGDDQ 188
Query: 187 LYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVI 246
L+ L+LD LR K I LD A LFQN+Y + +DIKL +V L N +NT P +
Sbjct: 189 LHLTNLYLDTDLREKLNIKLDNYAKLFQNVYLAEDDIKLKLVNKSYV-LENINFNTQPAV 247
Query: 247 IHGNGKSKIELNSFGNYLAKSWK-TSGCTRC---NLIKHLDSLKPDQFPSVLISVFIDKP 302
IHGNG SKI NS+ NY+ W SGC C NL L +LK + +P VL+S+ +DKP
Sbjct: 248 IHGNGLSKITFNSYTNYIPNKWSPESGCKTCYDNNL--DLSTLKEENYPKVLLSIIVDKP 305
Query: 303 TAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVN 362
T F +EFL+KI N++YP ++ + + +YH D +I + N ++ H +
Sbjct: 306 TPFFDEFLDKIENIDYPKSRLCLSITTLVDYHKEHVDKFISKIGDKY-NASFVFHKTAEE 364
Query: 363 SKEARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNF 422
S AR+ + K DF FY+++++HLDNP LK L+ RN+ +IAP+L RPFKAWSNF
Sbjct: 365 SIHARHFSFSLCTSKLCDFLFYIENEAHLDNPQTLKILIQRNKKIIAPMLTRPFKAWSNF 424
Query: 423 WGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSM 482
WGAL+ +GFYARSFDYM+I+N ++ GIWNVPYI++CYLMK ++++ + Y +++
Sbjct: 425 WGALSKEGFYARSFDYMDIVNYNK--TGIWNVPYISSCYLMKGTILENKYTRPSYKEDNL 482
Query: 483 DYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHP 542
DYDMAF +LR KG+ + ID+ YGHL+DSE+FD NPEVY++ N DW+ RYIHP
Sbjct: 483 DYDMAFSKSLREKGVFMYIDNQYTYGHLIDSESFDITLKNPEVYQIFENRYDWEQRYIHP 542
Query: 543 EYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAV 602
EY ++ PD +PCPDVFWFPI+TE+FC EF++IME +GQWSDGTNND RL TGYEAV
Sbjct: 543 EYMENFNPDKKPAEPCPDVFWFPILTEQFCQEFIEIMENFGQWSDGTNNDTRLRTGYEAV 602
Query: 603 PTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHE---------------------- 640
PTRDIHM QVGL W EFLR YV P+Q++ FIGY H+
Sbjct: 603 PTRDIHMNQVGLEKHWLEFLRSYVQPIQKKAFIGYTHDDPRLDNGYEAVPTRDIHMKQVG 662
Query: 641 ----------------------------PVRAPMSFVVRYRPDEQPSLRPHHDSSTYTIN 672
P R+ M+FVV+Y P Q SLRPHHDSSTYTIN
Sbjct: 663 LQNVWLEFLRLFVSRLQEHVYLGYYSDGPPRSLMNFVVKYNPLGQASLRPHHDSSTYTIN 722
Query: 673 IALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFV 732
IALN G DY+GGGC F+RY C VT ++GWMLMHPGRLTHYHEGL+VT GTRYIMISFV
Sbjct: 723 IALNSPGKDYQGGGCHFLRYKCKVTDLKVGWMLMHPGRLTHYHEGLEVTNGTRYIMISFV 782
Query: 733 DP 734
DP
Sbjct: 783 DP 784
>gi|383851266|ref|XP_003701155.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 3-like
[Megachile rotundata]
Length = 784
Score = 770 bits (1988), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 382/782 (48%), Positives = 506/782 (64%), Gaps = 57/782 (7%)
Query: 7 LNCLILSCVVFFISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQ--VKTL 64
L C +L + V + D LV TVA+NETDGYKR+++S +V + ++ L
Sbjct: 6 LGCCLLWSLFLTYHVVSETPPSTDTKDVLVFTVATNETDGYKRYVRSVDVYGFRDNLRVL 65
Query: 65 GLHQPWLGGDM-SSLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERF 123
G PWLGG + +S GGGYKVNLLK L++ ++ I++ TDSYDVI G+ +I+E+F
Sbjct: 66 GTGSPWLGGKVRTSAGGGYKVNLLKQALEKYKNDEERIVMFTDSYDVIFLSGLTEIIEKF 125
Query: 124 NTFDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEE 183
+A I+F AE CWPD SL KYP G R+LNSGGFIGYA DI +++ IKNE
Sbjct: 126 KNTNARILFSAEGSCWPDKSLASKYPPATGGKRFLNSGGFIGYASDIYAILTYAPIKNEN 185
Query: 184 DDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTN 243
DDQL+Y + +LDE LR +HKI LD + +FQNLYG++ D++L F+ + L NT YNT
Sbjct: 186 DDQLHYTIAYLDEKLREQHKIKLDHKSEIFQNLYGAVADVELKFEGGK-ASLLNTVYNTE 244
Query: 244 PVIIHGNGKSKIELNSFGNYLAKSWKTS-GCTRC-NLIKHLDSLKPDQFPSVLISVFIDK 301
P+I+HGNG SK+ LNS GNYLA +W GC C LD P+ +P +LI++FI++
Sbjct: 245 PLILHGNGYSKLSLNSLGNYLANAWSPEEGCVMCWEGTTELDKTLPETYPVILIAIFIER 304
Query: 302 PTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTV 361
PT FLEEFL I YP K+ +F++N EYH + +D+I F +++ K + ++
Sbjct: 305 PTPFLEEFLLTIYEQAYPKSKLDLFIHNTVEYHQDVVNDFIKKFGKEYRSNKQVLPKDSI 364
Query: 362 NSKEARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSN 421
N +ARNLA++ L K YF VDS +HLDN LK LV + ++APLLVRP+K WSN
Sbjct: 365 NEADARNLAMDYCLLKKCSGYFSVDSIAHLDNEYTLKLLVEQQRGIVAPLLVRPYKMWSN 424
Query: 422 FWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNS 481
FWGA+ DGFYARSFDYM I+ ++ +G+WNVP+I+ CYL+ ++I + Y
Sbjct: 425 FWGAIMDDGFYARSFDYMEIVKNER--RGLWNVPFISTCYLINATLISNKETRPSYVEGD 482
Query: 482 MDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIH 541
+D DMAF R + I + + + ++GHLV+ +++D T+P++Y+++ N LDW+ +YIH
Sbjct: 483 LDTDMAFAYANRERSIFMYVSNRVDFGHLVNPDSYDIALTHPDLYQILDNKLDWEKKYIH 542
Query: 542 PEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRL------ 595
Y ++ P+ QPCPDV+WFPIV EKF + IMEA+G+WSDG+NND RL
Sbjct: 543 VNYSENFNPERTPIQPCPDVYWFPIVNEKFTKSLIDIMEAFGKWSDGSNNDPRLTGGYEN 602
Query: 596 -------------------------------------------ETGYEAVPTRDIHMKQV 612
+ GYEAVPTRDIHMKQV
Sbjct: 603 VPTRDIHMNQVNFEPQWLYFLKEYVRPLQEHVFIGYYHDDPRIDGGYEAVPTRDIHMKQV 662
Query: 613 GLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTIN 672
GL W FL +YV PLQE FIGY+ P R+ M+FVVRYRPDEQPSL+PHHDSSTYT+N
Sbjct: 663 GLHETWLNFLYEYVSPLQEHVFIGYYTSPPRSLMNFVVRYRPDEQPSLKPHHDSSTYTVN 722
Query: 673 IALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFV 732
IALN+ GVDYEGGGCRFIRYNC+VT T+ GWMLMHPGRLTH+HEGL+VT GTRYIMISFV
Sbjct: 723 IALNKRGVDYEGGGCRFIRYNCSVTDTKPGWMLMHPGRLTHFHEGLRVTNGTRYIMISFV 782
Query: 733 DP 734
DP
Sbjct: 783 DP 784
>gi|322786337|gb|EFZ12885.1| hypothetical protein SINV_01019 [Solenopsis invicta]
Length = 742
Score = 769 bits (1986), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 381/739 (51%), Positives = 496/739 (67%), Gaps = 42/739 (5%)
Query: 35 LVITVASNETDGYKRFIQSAEVNKLQVK--TLGLHQPWLGGDMSS-LGGGYKVNLLKNEL 91
LV TVASNETDG++R+++S +V + K LGL +PW GG++ GGGYK+NLL+ L
Sbjct: 7 LVFTVASNETDGFRRYLRSTDVYGFRDKLNILGLGEPWKGGNVVKYAGGGYKINLLRKAL 66
Query: 92 DEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAV 151
+ + IIL TDSYDVI G ++ I+ERF +A ++F AE CWPD SL +YP V
Sbjct: 67 KDHQNDETKIILFTDSYDVIFLGDLSSIVERFLATNARVLFSAEAYCWPDKSLAAQYPPV 126
Query: 152 GSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLAN 211
G RYLNSG FIGYA D+ +++ IKNE+DDQL+Y ++L+E LR +HKI LD +
Sbjct: 127 SRGKRYLNSGSFIGYASDVYKILDTAPIKNEDDDQLFYTTVYLNEELRIRHKIKLDHKSE 186
Query: 212 LFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWKT- 270
+FQNL+G++ D++L F +E +L N YNT P+++HGNG SK+ LNS GNYLA++W
Sbjct: 187 IFQNLFGAVADVELRFKGEE-AYLQNIVYNTVPLVLHGNGYSKLVLNSLGNYLARAWTPD 245
Query: 271 SGCTRC-NLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN 329
GC C + LD KP+ +P +LI+VFI++PT FLEEF I + YP K+ +FV+N
Sbjct: 246 EGCLACWDRTIELDKTKPETYPVILIAVFIERPTPFLEEFFRDIYHQFYPKTKLHLFVHN 305
Query: 330 NQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDS 389
N YH + D+ + + K I + +V+ +AR LA+E+ L K Y +DS +
Sbjct: 306 NVPYHEDVVGDFFEKVGQEYLSAKQILPSDSVSEVDARRLAMEHCLLKECSGYLSIDSVA 365
Query: 390 HLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGK 449
HL N LK LV + +IAPLL+RPFKAWSNFWGA+ DGFYARSFDYM II ++ +
Sbjct: 366 HLTNEFTLKLLVEQQRGIIAPLLIRPFKAWSNFWGAITDDGFYARSFDYMEIIKNER--R 423
Query: 450 GIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGH 509
G+WNVP+++NCYL+ ++I + + Y +D +MAF R +G+ + + + E+GH
Sbjct: 424 GLWNVPFVSNCYLINATIIASKVTRPTYEHGDLDTEMAFAHGNRQRGLFMYVSNRLEFGH 483
Query: 510 LVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTE 569
LVD + ++ Q T P++Y++I N LDW+ RYIHP Y ++ PD QPCPDV+WFPIV
Sbjct: 484 LVDPDTYNIQLTYPDMYQIIDNKLDWERRYIHPNYSENFNPDKKPIQPCPDVYWFPIVNL 543
Query: 570 KFCHEFVQIMEAYGQWSDGTNN----------------------------------DKRL 595
+F E V I+E YGQWSDGTN D RL
Sbjct: 544 RFTKELVGIVETYGQWSDGTNQDPRLSGGYENVPTRDIHMNQVQYEQQWLYFLKEFDSRL 603
Query: 596 ETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPD 655
+TGYEAVPTRDIHM QVGL W +FL+ YV PLQE F GY+ P R+ M+FVVRYRPD
Sbjct: 604 DTGYEAVPTRDIHMTQVGLHDAWLKFLKDYVNPLQEHVFTGYNDYPPRSLMNFVVRYRPD 663
Query: 656 EQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYH 715
EQPSLRPHHDSSTYTINIALNQ GVDYEGGGCRFIRYNC+VT T+ GWMLMHPGRLTHYH
Sbjct: 664 EQPSLRPHHDSSTYTINIALNQAGVDYEGGGCRFIRYNCSVTDTKPGWMLMHPGRLTHYH 723
Query: 716 EGLQVTQGTRYIMISFVDP 734
EGL+VT GTRYIMISFVDP
Sbjct: 724 EGLRVTAGTRYIMISFVDP 742
>gi|380020387|ref|XP_003694068.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 3-like
[Apis florea]
Length = 785
Score = 769 bits (1985), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 386/775 (49%), Positives = 510/775 (65%), Gaps = 60/775 (7%)
Query: 17 FFISVHC--NKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQ--VKTLGLHQPWLG 72
F++ H + +ID+D L+ TVA+ ETDGYKR+++S +V + ++ LG+ PWLG
Sbjct: 14 LFLAYHVVSETLPSIDKDDVLIFTVATKETDGYKRYLRSIDVYGFRDNLRVLGMGTPWLG 73
Query: 73 GD--MSSLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANI 130
GD +S+GGGYKVNLLK L+E D+ II+ TDSYDVI + +I+++F +A +
Sbjct: 74 GDHVKTSVGGGYKVNLLKKALEEYQNDDERIIIFTDSYDVIFLSDLTEIIDKFKNMNARV 133
Query: 131 VFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYA 190
+F AE CWPD SL KYP V G R+LNSGGF+GYA DI +++ IKN++DDQL+Y
Sbjct: 134 LFSAEGACWPDRSLASKYPPVTRGKRFLNSGGFMGYASDIYAILTYAPIKNKDDDQLFYT 193
Query: 191 LLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGN 250
L +LDE LR HKI LD + +FQNLY ++ D+KL F+ + L NT YNT P+I+HGN
Sbjct: 194 LAYLDEKLREHHKIKLDHKSVIFQNLYLAVGDVKLKFEGGK-ASLLNTVYNTEPLILHGN 252
Query: 251 GKSKIELNSFGNYLAKSWK-TSGCTRC-NLIKHLDSLKPDQFPSVLISVFIDKPTAFLEE 308
G SK LNS GNYLA +W GC C L+ P +P +LI++FI++PT FL E
Sbjct: 253 GYSKESLNSLGNYLANAWSPEEGCIMCWEGTIELNKTIPKSYPIILIAIFIERPTPFLNE 312
Query: 309 FLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARN 368
FL I +YP K+ +FV+NN EYH + + ++ NF + K ++ N +N +ARN
Sbjct: 313 FLTTIYQQDYPKSKLHLFVHNNVEYHQDVVNSFMKNFGYEYNTSKLVSVNDAMNEVDARN 372
Query: 369 LAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNA 428
LA++ L K YF +DS SHLDN LK LV + +IAPLLVRP+K WSNFWGA+
Sbjct: 373 LAMDYCLLKECSGYFSIDSVSHLDNKYTLKLLVEQQREIIAPLLVRPYKMWSNFWGAIMD 432
Query: 429 DGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAF 488
DGFYARSFDYM+I+ ++ +G+WNVP+I+NCYL+ +++I + Y+ +D DMAF
Sbjct: 433 DGFYARSFDYMDIVKNER--RGLWNVPFISNCYLINSTLISNKETRPSYSEGDLDTDMAF 490
Query: 489 CTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSL 548
R + I + + + ++GHLV+ +++D T+P++Y++I N LDW+ RYIH Y ++
Sbjct: 491 AYANRERSIFMYVSNRLDFGHLVNPDSYDITMTHPDLYQIIDNKLDWERRYIHENYSENF 550
Query: 549 LPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRL------------- 595
+ QPCPDV+WFPIV E+F E + +ME +G+WSDG+N+D RL
Sbjct: 551 NSNQTPLQPCPDVYWFPIVNERFTKELIDVMENFGKWSDGSNHDPRLTGGYENVPTRDIH 610
Query: 596 ------------------------------------ETGYEAVPTRDIHMKQVGLAGVWA 619
E GYEAVPTRDIHMKQVGL W
Sbjct: 611 MNQIKNEPQWLYFLKEYVRPLQELVFTGYYHDDPRIEGGYEAVPTRDIHMKQVGLHESWL 670
Query: 620 EFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVG 679
FL +YV PLQE+ FIGY P RA M+FVVRYRPDEQPSL+PHHDSSTYTINIALN+VG
Sbjct: 671 NFLDQYVSPLQEQVFIGYSTSPPRALMNFVVRYRPDEQPSLKPHHDSSTYTINIALNRVG 730
Query: 680 VDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
VDYEGGGCRFIRYNC+VT T+ GWMLMHPGRLTHYHEGL+VT GTRYIMISFVDP
Sbjct: 731 VDYEGGGCRFIRYNCSVTDTKPGWMLMHPGRLTHYHEGLRVTSGTRYIMISFVDP 785
>gi|307183477|gb|EFN70276.1| Procollagen-lysine,2-oxoglutarate 5-dioxygenase 3 [Camponotus
floridanus]
Length = 787
Score = 768 bits (1983), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 375/782 (47%), Positives = 511/782 (65%), Gaps = 57/782 (7%)
Query: 7 LNCLILSCVVFFISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKL--QVKTL 64
+ C + C VF ++ D + LV TVASNETDG++R+++S EV K +++ L
Sbjct: 9 IGCYLAWCCVFLTYHVVSEAPAADANDVLVFTVASNETDGFQRYLRSVEVYKFRDKLRIL 68
Query: 65 GLHQPWLGGD-MSSLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERF 123
GL +PW GG+ M+ GGGYK+NLLK L++ + I+L TDSYDVI GG++ I+ERF
Sbjct: 69 GLGEPWRGGNVMTYAGGGYKINLLKKALEDYQNDEKKIVLFTDSYDVIFLGGLSAIVERF 128
Query: 124 NTFDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEE 183
DA ++F AE CWPD SL YP V G RYLNSGGFIGYA D+ E++ IK+E+
Sbjct: 129 LDTDARVLFSAEVYCWPDRSLAIHYPTVSGGKRYLNSGGFIGYASDVYEILDKADIKDED 188
Query: 184 DDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTN 243
DDQL+Y ++L + LRT+HKI LD + +FQNL+G++ D++L F +E ++ N YNT
Sbjct: 189 DDQLFYTTVYLQDELRTRHKIKLDHKSEIFQNLFGAVADVELRFKGEE-AYVQNIVYNTV 247
Query: 244 PVIIHGNGKSKIELNSFGNYLAKSWKTS-GCTRC-NLIKHLDSLKPDQFPSVLISVFIDK 301
P+I+HGNG SK+ LNS GNYLA++W + GC C + LD KP+ +P +LI++FI++
Sbjct: 248 PLILHGNGFSKLVLNSLGNYLARAWTANEGCLACWDRTIELDKTKPETYPIILIAIFIEQ 307
Query: 302 PTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTV 361
PT FLEEF I YP ++ +F++NN YH + ++ + + K I + +
Sbjct: 308 PTPFLEEFFQAIHRQAYPKSRLHLFIHNNVPYHESVIYNFFEKTSREYLSGKQILPSDEI 367
Query: 362 NSKEARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSN 421
+ +AR LA+E+ L K Y VD+ +HLDN LK LV + ++APLL+RP+KAWSN
Sbjct: 368 SEVDARKLALEHCLLKECSGYLSVDAVAHLDNEHTLKLLVEQQRGIVAPLLIRPYKAWSN 427
Query: 422 FWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNS 481
FWGA+ DGFYARSFDYM II ++ +G+WNVP+++NCYL+ ++I + Y
Sbjct: 428 FWGAITDDGFYARSFDYMEIIKNER--RGLWNVPFVSNCYLINATIIANKATRPSYEDAE 485
Query: 482 MDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIH 541
+D +MAF R +G+ + +++ ++GHLV+ +++D + T P++Y+++ N LDW+ RYIH
Sbjct: 486 LDTEMAFARTNRQRGLFMYLNNRLDFGHLVNPDSYDIRLTYPDMYQIMDNKLDWEKRYIH 545
Query: 542 PEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTN----------- 590
P Y ++ PD QPCPDV+WFPI T +F E + I+E +GQWSDG+N
Sbjct: 546 PNYSENFNPDKKPIQPCPDVYWFPIATLRFTSELIGIVETFGQWSDGSNHDPRLTGGYEN 605
Query: 591 --------------------------------------NDKRLETGYEAVPTRDIHMKQV 612
+D RLE+GYEAVPTRDIHM QV
Sbjct: 606 VPTRDIHMNQIQYEQQWLYFLKEYVRPLQERVFTGYYHDDSRLESGYEAVPTRDIHMNQV 665
Query: 613 GLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTIN 672
GL W +FL+ Y+ PLQ+ F GY P R+ M+FVVRYRPDEQP LRPHHDSSTYTIN
Sbjct: 666 GLEDAWLKFLKDYISPLQQHVFTGYEDYPPRSLMNFVVRYRPDEQPFLRPHHDSSTYTIN 725
Query: 673 IALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFV 732
IALNQ GVDYEGGGC+FIRYNC+VT T+ GWMLMHPGRLTHYHEGL+VT GTRYIMISFV
Sbjct: 726 IALNQAGVDYEGGGCKFIRYNCSVTDTKPGWMLMHPGRLTHYHEGLRVTAGTRYIMISFV 785
Query: 733 DP 734
DP
Sbjct: 786 DP 787
>gi|332027746|gb|EGI67813.1| Procollagen-lysine,2-oxoglutarate 5-dioxygenase 3 [Acromyrmex
echinatior]
Length = 786
Score = 768 bits (1982), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 382/782 (48%), Positives = 506/782 (64%), Gaps = 58/782 (7%)
Query: 7 LNCLILSCVVFFISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVK--TL 64
+ C ++ C +F ++ H D + LV TVASNETDG+KR+++S E++ K L
Sbjct: 9 IGCWLMWCYIF-LTYHVVSETPADVNDVLVFTVASNETDGFKRYLRSTEIHGFHDKLNVL 67
Query: 65 GLHQPWLGGDMSS-LGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERF 123
GL +PW GG++ GGGYK+NLLK L++ + IIL TDSYDVI G ++ I+ERF
Sbjct: 68 GLGEPWKGGNVVRYAGGGYKINLLKKALEDYQNDEKKIILFTDSYDVIFLGDLSIIVERF 127
Query: 124 NTFDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEE 183
DA ++F AE CWPD SL +YP V G RYLNSGGFIGYA D+ +++ IK+E+
Sbjct: 128 LDTDARVLFSAEAYCWPDKSLATQYPPVSRGKRYLNSGGFIGYASDVYKILETAVIKDED 187
Query: 184 DDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTN 243
DDQL+Y ++L + LR ++KI LD + +FQNLYG++ D++L F +E +L N YNT
Sbjct: 188 DDQLFYTTVYLQDELRLRYKIKLDHKSEIFQNLYGAVADVELRFKGEE-AYLQNIVYNTV 246
Query: 244 PVIIHGNGKSKIELNSFGNYLAKSWKT-SGCTRC-NLIKHLDSLKPDQFPSVLISVFIDK 301
P+++HGNG SK+ LNS GNYLA++W GC C + LD +K +P +LI++FI++
Sbjct: 247 PLVLHGNGPSKLVLNSLGNYLARAWTPDEGCLACWDQTIELDKIKSKTYPVILIAIFIER 306
Query: 302 PTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTV 361
PT FLEEF I YP K+ +F++NN YH + DD+ + + K I + V
Sbjct: 307 PTPFLEEFFRAIYRQYYPKSKLHLFIHNNVPYHEDVVDDFFEKIGQEYLSAKRILPSDDV 366
Query: 362 NSKEARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSN 421
+ +AR LA+E+ L K Y +D+ +HLDN LK LV + ++APLL+RPFKAWSN
Sbjct: 367 SEVDARKLAMEHCLLKECSGYLSIDAVAHLDNEHTLKLLVEQQRGIVAPLLIRPFKAWSN 426
Query: 422 FWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNS 481
FWGA+ DGFYARSFDYM II ++ +G+WNVP+++NCYL+ ++I + Y
Sbjct: 427 FWGAITDDGFYARSFDYMEIIKNER--RGLWNVPFVSNCYLINATIIANKATRPTYEAGD 484
Query: 482 MDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIH 541
+D +MAF R +G+ + +++ E+GHLVD + +D + T P++Y++I N LDW+ RYIH
Sbjct: 485 LDTEMAFAHGNRQRGLFMYVNNRLEFGHLVDPDTYDIRLTYPDIYQIIENKLDWEKRYIH 544
Query: 542 PEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTN----------- 590
Y ++ PD QPCPDV+WFPIV +F E V I+E +GQWSDGTN
Sbjct: 545 SNYSENFNPDNKPIQPCPDVYWFPIVNLRFTKELVGIVETFGQWSDGTNHDPRLSGGYEN 604
Query: 591 --------------------------------------NDKRLETGYEAVPTRDIHMKQV 612
+D RLE+GYEAVPTRDIHM QV
Sbjct: 605 VPTRDIHMNQVQYDQQWLYFLKEYVRPLQEFIFTGYFHDDPRLESGYEAVPTRDIHMNQV 664
Query: 613 GLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTIN 672
GL W +FL+ YV PLQE F GY+ P R+ M+FVVRYRPDEQ SLRPHHDSSTYTIN
Sbjct: 665 GLQDAWLKFLKDYVNPLQEHVFTGYNDYPPRSLMNFVVRYRPDEQSSLRPHHDSSTYTIN 724
Query: 673 IALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFV 732
IALNQ GVDYEGGGCRFIRYNC+VT T+ GWMLMHPGRLTHYHEGL VT GTRYIMISFV
Sbjct: 725 IALNQAGVDYEGGGCRFIRYNCSVTDTKPGWMLMHPGRLTHYHEGLLVTAGTRYIMISFV 784
Query: 733 DP 734
DP
Sbjct: 785 DP 786
>gi|350421684|ref|XP_003492923.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 3-like
isoform 3 [Bombus impatiens]
Length = 785
Score = 766 bits (1977), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 379/783 (48%), Positives = 507/783 (64%), Gaps = 58/783 (7%)
Query: 7 LNCLILSCVVFFISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQ--VKTL 64
+ C + + V + + D+D LV T+ASNETDGYKR+++S V + ++ L
Sbjct: 6 IGCCLFWSLFLTYHVFSETLPSTDKDDVLVFTIASNETDGYKRYLRSVNVYGFRDNLRVL 65
Query: 65 GLHQPWLGGD--MSSLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILER 122
GL +PWLGGD +S GGGYKVNLLK L+ D I++ TDSYDVI + +I+ +
Sbjct: 66 GLGEPWLGGDNIKTSAGGGYKVNLLKKALENYGDDDQKIVIFTDSYDVIYLSDLTEIINK 125
Query: 123 FNTFDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNE 182
F + DA ++F AE CWPD SL KYP+ G R+LNSGGF+GYA D+ ++++ IKN+
Sbjct: 126 FKSMDARVLFSAEGSCWPDKSLASKYPSAALGKRFLNSGGFVGYASDVYAILTHAPIKNK 185
Query: 183 EDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNT 242
+DDQL+Y L +LDE LR +HKI LD + +FQNLYG++ D++L F+ + L NT Y+T
Sbjct: 186 DDDQLFYTLAYLDEELRERHKIKLDHKSEIFQNLYGAVADVELKFEGGK-ASLLNTVYST 244
Query: 243 NPVIIHGNGKSKIELNSFGNYLAKSWK-TSGCTRC-NLIKHLDSLKPDQFPSVLISVFID 300
P+I+HGNG SK+ LNS GNYLA +W GC C LD P+ +P +LI++FI+
Sbjct: 245 EPLILHGNGYSKLSLNSLGNYLAHAWSPEEGCVMCWEETIELDRTTPESYPIILIAIFIE 304
Query: 301 KPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNST 360
+PT FL EFL+ I YP K+ + ++NN EYH + D+++ + + K I+ N
Sbjct: 305 RPTPFLTEFLSAIYQQAYPKSKLHLLIHNNVEYHQDVVDNFMKKVGREYNSSKQISVNDA 364
Query: 361 VNSKEARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWS 420
+N +ARNLA++ L K YF +DS SHLDN LK L+ + +IAPLLVRP+K WS
Sbjct: 365 MNEVDARNLAMDYCLLKECSGYFSIDSVSHLDNEHTLKLLIEQQRDIIAPLLVRPYKMWS 424
Query: 421 NFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLN 480
NFWGA+ DGFYARSFDY+ I+N ++ +G+WNVP+I+NCYL+ ++I + Y+
Sbjct: 425 NFWGAIMDDGFYARSFDYIEIVNNER--RGLWNVPFISNCYLINATLISNKETRPSYSEG 482
Query: 481 SMDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYI 540
+D +MAF R + I + + + ++GHLVD +N+D T+P+ Y+++ N LDW+ YI
Sbjct: 483 DLDTEMAFAYANRERNIFMYVSNRVDFGHLVDPDNYDVTVTHPDFYQILNNKLDWEKTYI 542
Query: 541 HPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRL----- 595
H Y ++ P+ Q CPDV+ FPIV E+F E + IME +G+WSDG+N+D RL
Sbjct: 543 HENYSENFNPNKTPVQVCPDVYRFPIVNERFTKELIDIMETFGKWSDGSNHDPRLTGGYE 602
Query: 596 --------------------------------------------ETGYEAVPTRDIHMKQ 611
E GYEAVPTRDIHMKQ
Sbjct: 603 NVPTRDIHMNQVKYEPQWLYFLKEYVRPLQELVFAGYYHDDPRIEGGYEAVPTRDIHMKQ 662
Query: 612 VGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTI 671
+GL W FL +YV PLQE FIGY+ P RA M+FVVRYRPDEQPSL+PHHDSSTYTI
Sbjct: 663 IGLHESWLNFLYEYVSPLQEHVFIGYNTNPPRALMNFVVRYRPDEQPSLKPHHDSSTYTI 722
Query: 672 NIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISF 731
NIALN+ GVDYEGGGCRFIRYNC+VT T+ GWMLMHPGRLTHYHEGL+VT GTRYIMISF
Sbjct: 723 NIALNRAGVDYEGGGCRFIRYNCSVTDTKPGWMLMHPGRLTHYHEGLRVTSGTRYIMISF 782
Query: 732 VDP 734
VDP
Sbjct: 783 VDP 785
>gi|170052410|ref|XP_001862209.1| procollagen-lysine,2-oxoglutarate 5-dioxygenase 3 [Culex
quinquefasciatus]
gi|167873364|gb|EDS36747.1| procollagen-lysine,2-oxoglutarate 5-dioxygenase 3 [Culex
quinquefasciatus]
Length = 723
Score = 763 bits (1969), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 371/729 (50%), Positives = 505/729 (69%), Gaps = 22/729 (3%)
Query: 10 LILSCVVFFISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQP 69
L+L+CV S C + + LV TVASNET+ Y R+I+SA+ ++V TLGL +P
Sbjct: 13 LLLACV----SHLCVGEEKLPGKAPLVFTVASNETEAYLRYIRSAKRYGIEVTTLGLGKP 68
Query: 70 WLGGDMSSLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDAN 129
W GGDM LGGGYK+NLL++ L DD I+L TDSYDV+ + I+E+F TF+A+
Sbjct: 69 WQGGDMKKLGGGYKINLLRSALKPYKSDDDRIVLFTDSYDVLFLASLEKIVEKFETFEAS 128
Query: 130 IVFGAERLCWPDTSLYDKYPAV-GSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLY 188
I+FG+E CWPD L +KYP + G G R+LNSG F+GYA + +++ N +K+ +DDQLY
Sbjct: 129 ILFGSEGFCWPDPELKNKYPVLEGRGTRFLNSGLFMGYASKVYQMLKN-PVKDTDDDQLY 187
Query: 189 YALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLD-EFVHLTNTKYNTNPVII 247
Y +++D+ LR + + LD A LFQN+ G E I L D D + L NT+Y+TNP+I+
Sbjct: 188 YTKIYIDQQLREELNMKLDHTAALFQNMNGVEEQITLALDPDSKEAFLKNTEYSTNPLIV 247
Query: 248 HGNGKSKIELNSFGNYLAKSWKTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLE 307
HGNG SKI LN + NYLA ++ C ++L L + P V++++F++K T F+E
Sbjct: 248 HGNGPSKITLNGYANYLAGAFVDGECQTVK--ENLIELDEENLPKVMVALFVEKATPFIE 305
Query: 308 EFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEAR 367
E+ IA LNYP +K+ +F++NN ++H P D +I + +++ + + ++ R
Sbjct: 306 EWFENIAKLNYPKQKMDVFIHNNVDHHKPTIDQFIKQYTEEYRSFRMVDYSEDFEELAGR 365
Query: 368 NLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALN 427
+LAV L K D+ F VD+D H+D+PD L+ L+ N +I+P+L RP K WSNFWGAL+
Sbjct: 366 SLAVNQCLKKKCDYLFVVDADGHIDDPDTLRRLITLNRDIISPVLTRPEKVWSNFWGALS 425
Query: 428 ADGFYARSFDYMNIINGDQGGK--GIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD 485
+ GFYARS DYM+I+ G K G+WNVP+I+ YL+K+S ++ I D
Sbjct: 426 SQGFYARSSDYMDIV----GRKILGLWNVPFISTVYLVKSSSSVTSSPTPIP-------D 474
Query: 486 MAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQ 545
MA C ++R KGI + + +T+++GHL+DS+ +D +T+P+ Y+L N DW+ +YI EY
Sbjct: 475 MALCWHMRAKGIFMHVVNTEQFGHLIDSDYYDANRTHPDFYQLFNNKYDWERKYISAEYH 534
Query: 546 KSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTR 605
K L D V QPCPDV+WF I TEKFC +I+EA+G+WSDGT++DKRL+ GYEAVPTR
Sbjct: 535 KQLEKDFVPVQPCPDVYWFSIGTEKFCDHLREIVEAFGKWSDGTHSDKRLQGGYEAVPTR 594
Query: 606 DIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHD 665
DIHM QVGL VW +FL+ YV PLQE+ FIGY H+P R+ M+FVVRYRPDEQPSLRPHHD
Sbjct: 595 DIHMNQVGLEQVWLKFLQLYVKPLQEKVFIGYFHDPPRSLMNFVVRYRPDEQPSLRPHHD 654
Query: 666 SSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTR 725
SSTYTIN+ALN GVDYEGGGC+F+RYNC+VT TR GWMLMHPGRLTH+HEGL T+GTR
Sbjct: 655 SSTYTINVALNTAGVDYEGGGCKFLRYNCSVTDTRKGWMLMHPGRLTHFHEGLLTTKGTR 714
Query: 726 YIMISFVDP 734
YIMISFVDP
Sbjct: 715 YIMISFVDP 723
>gi|340726796|ref|XP_003401739.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 3-like
isoform 2 [Bombus terrestris]
Length = 785
Score = 762 bits (1967), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 378/783 (48%), Positives = 505/783 (64%), Gaps = 58/783 (7%)
Query: 7 LNCLILSCVVFFISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQ--VKTL 64
+ C + + V + + D+D LV T+ASNETDGYKR+++S V ++ L
Sbjct: 6 IGCCLFWSLFLTYHVFSETLPSTDKDDVLVFTIASNETDGYKRYLRSVNVYGFHDNLRVL 65
Query: 65 GLHQPWLGGD--MSSLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILER 122
GL +PWLGGD +S GGGYKVNLLK L+ D I++ TDSYDVI + +I+ +
Sbjct: 66 GLGEPWLGGDNIKTSAGGGYKVNLLKKALENYGDDDQKIVIFTDSYDVIYLSDLTEIINK 125
Query: 123 FNTFDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNE 182
F + DA ++F AE CWPD SL KYP G R+LNSGGF+GYA D+ ++++ IKN+
Sbjct: 126 FKSMDARVLFSAEGSCWPDKSLASKYPPATLGKRFLNSGGFVGYASDVYAILTHAPIKNK 185
Query: 183 EDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNT 242
+DDQL+Y L +LDE LR +HKI LD + +FQNLYG++ D++L F+ + L NT YNT
Sbjct: 186 DDDQLFYTLAYLDEELRERHKIKLDHKSEIFQNLYGAVADVELKFEGGK-ASLLNTVYNT 244
Query: 243 NPVIIHGNGKSKIELNSFGNYLAKSWK-TSGCTRC-NLIKHLDSLKPDQFPSVLISVFID 300
P+I+HGNG SK+ LNS GNYLA++W GC C LD + +P +LI++FI+
Sbjct: 245 EPLILHGNGYSKLSLNSLGNYLARAWSPEEGCVMCWEETIELDRIISQSYPIILIAIFIE 304
Query: 301 KPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNST 360
+PT FL EFL+ I YP K+ + ++NN EYH + D+++ + + + K I+ N
Sbjct: 305 RPTPFLSEFLSAIYQQAYPKSKLHLLIHNNVEYHQDVLDNFMKKVEKEYNSSKQISVNDA 364
Query: 361 VNSKEARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWS 420
++ +ARNLA++ L K YF +DS SHLDN LK LV + +IAPLLVRP+K WS
Sbjct: 365 MSEVDARNLAMDYCLLKECSGYFSIDSVSHLDNEHTLKLLVEQQRDIIAPLLVRPYKMWS 424
Query: 421 NFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLN 480
NFWGA+ DGFYARSFDY+ I+ ++ +G+WNVP+I+NCYL+ ++I + Y+
Sbjct: 425 NFWGAIMDDGFYARSFDYIEIVKNER--RGLWNVPFISNCYLINATLISNKETRPSYSEG 482
Query: 481 SMDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYI 540
+D +MAF R + I + + + ++GHLVD +N+D T+P+ Y+++ N LDW+ YI
Sbjct: 483 DLDTEMAFAYANRERNIFMYVSNRVDFGHLVDPDNYDVTVTHPDFYQILNNKLDWEKTYI 542
Query: 541 HPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRL----- 595
H Y ++ P+ Q CPDV+ FPIV E+F E + IME +G+WSDG+N+D RL
Sbjct: 543 HENYSENFNPNKTPVQVCPDVYRFPIVNERFTKELIDIMETFGKWSDGSNHDPRLTGGYE 602
Query: 596 --------------------------------------------ETGYEAVPTRDIHMKQ 611
E GYEAVPTRDIHMKQ
Sbjct: 603 NVPTRDIHMNQVKYEPQWLYFLKEYVRPLQELVFAGYYHDDPRIEGGYEAVPTRDIHMKQ 662
Query: 612 VGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTI 671
+GL W FL +YV PLQE FIGY+ P RA M+FVVRYRPDEQPSL+PHHDSSTYTI
Sbjct: 663 IGLHESWLNFLYEYVSPLQEHVFIGYNTNPPRALMNFVVRYRPDEQPSLKPHHDSSTYTI 722
Query: 672 NIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISF 731
NIALN+ GVDYEGGGCRFIRYNC+VT T+ GWMLMHPGRLTHYHEGL+VT GTRYIMISF
Sbjct: 723 NIALNRAGVDYEGGGCRFIRYNCSVTDTKPGWMLMHPGRLTHYHEGLRVTSGTRYIMISF 782
Query: 732 VDP 734
VDP
Sbjct: 783 VDP 785
>gi|345484574|ref|XP_001601697.2| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 3-like
[Nasonia vitripennis]
Length = 775
Score = 760 bits (1963), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 384/777 (49%), Positives = 513/777 (66%), Gaps = 61/777 (7%)
Query: 13 SCVVFFISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKL--QVKTLGLHQPW 70
+CV+ + + C V + D LV TVA+NET+G++R+++S EVN V+ LGL Q W
Sbjct: 5 TCVLLAV-LAC--VAAEETDDALVFTVATNETEGFRRYLRSTEVNGFGDNVRVLGLGQAW 61
Query: 71 LGGDMS-SLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFD-A 128
GG++ GGG KVNLLK ++E+ D I+L TDSYDVI + I +F +D A
Sbjct: 62 RGGEIKLYAGGGQKVNLLKEAIEEIKDDPDQIVLFTDSYDVIFLSSLEKISRKFKEWDDA 121
Query: 129 NIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLY 188
++F AE CWP SL +YP V G R+LNSGGFIGYA DI ++++ IK+++DDQL+
Sbjct: 122 RVIFSAEEYCWPLKSLASEYPQVKRGKRFLNSGGFIGYAPDIYAILTSAEIKDDDDDQLF 181
Query: 189 YALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIH 248
Y ++L+ LR KHKI LD + +FQNL G++ DI+L F +E ++ NT YNT P+IIH
Sbjct: 182 YTKVYLNSELREKHKIKLDHKSEIFQNLNGAIHDIELRFKGNE-AYVQNTAYNTVPLIIH 240
Query: 249 GNGKSKIELNSFGNYLAKSWK-TSGCTRC-NLIKHLDSLKPDQFPSVLISVFIDKPTAFL 306
GNG SK+ LNS GNY+A++W GC C + LD + +P +LI++FI+KPT FL
Sbjct: 241 GNGFSKLLLNSLGNYVAQAWSPEEGCLSCWDRTIELDVKNAEAYPKILIAIFIEKPTPFL 300
Query: 307 EEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEA 366
EEFLNKI + YP +K+ F+ NN YH L D+++ +++VK I + A
Sbjct: 301 EEFLNKIKDQRYPKEKLHFFIRNNVPYHEKLIDEFVEKHGDEYQSVKQIKPEDEIAEAAA 360
Query: 367 RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGAL 426
RNLA+ + L YF +DS+SHLDN + L+ LV + ++APLLVRPFKAWSNFWGA+
Sbjct: 361 RNLAMNHCLSVKCSGYFSIDSESHLDNVNTLELLVEQQRGIVAPLLVRPFKAWSNFWGAI 420
Query: 427 NADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDM 486
DGFYARS DYM+II+ ++ +G+WNVP++++CYL+ ++++ + Y +D +M
Sbjct: 421 TDDGFYARSSDYMDIIHHER--RGLWNVPFVSSCYLINATLLENEATRPSYAEADLDAEM 478
Query: 487 AFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQK 546
AF R + I + +++ ++GHLV+ E F+ TNP++Y++ N LDW+ RYIH Y
Sbjct: 479 AFAYANRRRDIFMYVNNRLDFGHLVNPETFNISLTNPDMYQMFDNKLDWEKRYIHVNYSD 538
Query: 547 SLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTN---------------- 590
+ LP+ QPCPDV+WFPIVTE+F +FV+IMEAYG+WSDG+N
Sbjct: 539 NFLPENKPVQPCPDVYWFPIVTERFNKDFVEIMEAYGKWSDGSNYDPRLSNGYENVPTRD 598
Query: 591 ---------------------------------NDKRLETGYEAVPTRDIHMKQVGLAGV 617
+D RLE GYEAVPTRDIHM QVGL
Sbjct: 599 IHMNQVGLESQWLFFLRNYVKPLQELVFLGYFHDDPRLENGYEAVPTRDIHMTQVGLDES 658
Query: 618 WAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQ 677
W EFLR YV PLQ+ F GY+ P R+ M+FVVRYRPDEQPSL+PHHDSSTYTINIALN+
Sbjct: 659 WLEFLRVYVNPLQQAVFTGYYDYPPRSLMNFVVRYRPDEQPSLKPHHDSSTYTINIALNK 718
Query: 678 VGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
VGVDYEGGGCRFIRYNC+VT T+ GWMLMHPGRLTHYHEGL+VT+GTRYIMISFVDP
Sbjct: 719 VGVDYEGGGCRFIRYNCSVTDTKPGWMLMHPGRLTHYHEGLKVTKGTRYIMISFVDP 775
>gi|321459829|gb|EFX70878.1| hypothetical protein DAPPUDRAFT_217067 [Daphnia pulex]
Length = 737
Score = 756 bits (1951), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 355/705 (50%), Positives = 486/705 (68%), Gaps = 8/705 (1%)
Query: 33 KFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLLKNEL 91
KFL++TVA+ ET GYKR+ +S +N L VK LGL + W GGDM+ S+GGG KV +L+ E+
Sbjct: 38 KFLILTVATEETSGYKRYQRSVRINGLPVKVLGLGEEWKGGDMANSVGGGQKVLMLRKEV 97
Query: 92 DEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAV 151
+ + II+ TDSYDV+ + I+E+F F+A ++F AE CWPD +L KYP V
Sbjct: 98 ELHKDDPEKIIMFTDSYDVLFNANEEKIVEQFLQFNARVLFSAEGFCWPDPTLASKYPEV 157
Query: 152 GSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLAN 211
G R+LNSG F+GYA ++ +++++ I N++DDQL+Y +FLDE R + I LD +
Sbjct: 158 ERGKRFLNSGLFMGYAPELHQILNSGEIANDDDDQLFYTKVFLDEKKRQELNIKLDHRSE 217
Query: 212 LFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWKTS 271
+FQNL G++ D++L F HL NT YNT P++IH NG +K+ LN+ GNYL KSW +
Sbjct: 218 IFQNLNGAVSDVELRFIES---HLQNTVYNTVPLVIHANGPTKLFLNTLGNYLPKSWNSE 274
Query: 272 -GCTRC-NLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN 329
GC C + L+ KP FP V++ +FI+ PT F EEFL+K L+YP KI ++++N
Sbjct: 275 EGCLNCWEDMNSLEKKKPKDFPKVVVGMFIENPTPFFEEFLHKFLALSYPKDKIHLYIHN 334
Query: 330 NQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDS 389
YH ++ + + +VK + H V ARN +E L K ++YF VD+ +
Sbjct: 335 GVSYHGKQITGFVESHGAEYASVKLVNHEENVKEWHARNTGIEECLKKKCEYYFNVDALA 394
Query: 390 HLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGK 449
H+DNP LK L+ +N ++AP+++RP++AWSNFWG+L DGFYARS DYM I+ G++ +
Sbjct: 395 HIDNPHTLKLLIEQNRPVVAPMMIRPYQAWSNFWGSLTTDGFYARSIDYMEIVKGER--R 452
Query: 450 GIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGH 509
G+WNVP++T+ YL++ +I K Y N +D DMAFCTN+RN ++L + + ++GH
Sbjct: 453 GLWNVPFVTSVYLVRGDIIHNPKTKPSYIHNLLDADMAFCTNMRNNDVYLFVTNRLDWGH 512
Query: 510 LVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTE 569
L+ +NF+ N E+YE+ N DW+ RY+H Y ++L + + PCPDV+WFP+ TE
Sbjct: 513 LITVDNFETTHLNNELYEIQNNRWDWEKRYLHVNYSQNLNMELNVSMPCPDVYWFPMTTE 572
Query: 570 KFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPL 629
+F E V ME +GQWSDGTN D RLE GYE VPTRDIHM+Q+G+ W FLR YV PL
Sbjct: 573 RFADELVGEMENFGQWSDGTNTDPRLEGGYENVPTRDIHMRQIGMDRHWLAFLRDYVRPL 632
Query: 630 QEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRF 689
QER F+GY H P R+ M+FVVRYRPDEQP L+PHHDSSTYTIN+ALN+ +D+EGGGCRF
Sbjct: 633 QERVFVGYQHYPPRSVMNFVVRYRPDEQPFLKPHHDSSTYTINLALNRPQIDFEGGGCRF 692
Query: 690 IRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
+RYNC+V TR GWMLMHPGRLTHYHEGL T+GTRYIMISFVDP
Sbjct: 693 VRYNCSVLDTRKGWMLMHPGRLTHYHEGLYTTKGTRYIMISFVDP 737
>gi|307195418|gb|EFN77304.1| Procollagen-lysine,2-oxoglutarate 5-dioxygenase 3 [Harpegnathos
saltator]
Length = 793
Score = 754 bits (1946), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 378/789 (47%), Positives = 516/789 (65%), Gaps = 65/789 (8%)
Query: 7 LNCLILSCVVFFISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQ--VKTL 64
+ C ++ C VF ++ H D + LV TVAS+ETDG++R+++SAE+ + +K L
Sbjct: 9 IGCWLVWCYVF-LTYHVVSEAPADTNDVLVATVASDETDGFRRYLRSAEIYGFRDNLKIL 67
Query: 65 GLHQPWLGGDMSS-LGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERF 123
GL + W GG++SS GGGYKVNLL+ L++ ++ I+L TDSYDVI GG++ I+ERF
Sbjct: 68 GLGESWKGGNVSSGPGGGYKVNLLRKALEDYRDDENKIVLFTDSYDVIFLGGLSAIVERF 127
Query: 124 NTFDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEE 183
A I+F AE CWPD SL +YPAV G RYLNSG FIGYA D+ ++ SI++E+
Sbjct: 128 LDTGARILFSAEGYCWPDKSLASQYPAVSRGKRYLNSGSFIGYATDLLAILDTVSIEDED 187
Query: 184 DDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTN 243
DDQL Y ++L++ LR +H+I LD +++FQNL+G++ D++L F +E +L N YNT
Sbjct: 188 DDQLLYTNVYLNDELRARHRIKLDHKSDIFQNLFGAVADVELRFKGEE-AYLQNIVYNTV 246
Query: 244 PVIIHGNGKSKIELNSFGNYLAKSWKT-SGCTRC-NLIKHLDSLKPDQFPSV-LISVFID 300
P+++HGNG SK+ LNS GNY+A++W GC C + LD KP+ +P++ LI++FI+
Sbjct: 247 PLVLHGNGHSKLVLNSLGNYVARAWTPDEGCLACWDQTVELDKTKPEMYPAIILIALFIE 306
Query: 301 KPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNST 360
+PT FLEEF I +YP K+ +F++N +H + D+ K + +V YI+
Sbjct: 307 RPTPFLEEFFEAIYRQSYPKSKLHLFIHNAVSHHDGVVTDFYERAKREYVDVNYISVKQG 366
Query: 361 VNSKEARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWS 420
VN AR LA+++ YF VD+ +HLDN LK LV + ++APLL+RP+KAWS
Sbjct: 367 VNEVHARKLAMKHCAFNKCSGYFSVDAVAHLDNEHTLKLLVEQQRRIVAPLLIRPYKAWS 426
Query: 421 NFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIY--- 477
NFWGA+ DGFYARSFDYM II ++ +G+WNVP+++NCYL+ +++ + + Y
Sbjct: 427 NFWGAITDDGFYARSFDYMEIIKNER--RGLWNVPFVSNCYLINATILNDESTRPFYGNP 484
Query: 478 ---TLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLD 534
+ MD +MAF R+ G+ + + + ++GHLV+ + +D + T PE+Y+++ N LD
Sbjct: 485 DGNSDADMDSEMAFAQRNRHAGVFMYVSNRLDFGHLVNPDTYDIKLTYPEMYQIMDNKLD 544
Query: 535 WDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTN---- 590
WD RYIH +Y +S PD QPCPDV+WFPIVT +F +E + I+EA+GQWSDG+N
Sbjct: 545 WDRRYIHAKYSESFNPDNKPIQPCPDVYWFPIVTRRFTNELIGIVEAFGQWSDGSNHDPR 604
Query: 591 ---------------------------------------------NDKRLETGYEAVPTR 605
+D RLETGYEAVPTR
Sbjct: 605 LSGGYENVPTRDIHMNQVQYEQQWLYFLKEYVRPLQELVFTGYYHDDPRLETGYEAVPTR 664
Query: 606 DIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHD 665
DIHM QV L W +FL+ YV PLQ+ F GY P R+ M+FVV+YRPDEQP LRPHHD
Sbjct: 665 DIHMNQVDLQDAWLKFLKDYVSPLQQLVFTGYDDYPPRSLMNFVVKYRPDEQPYLRPHHD 724
Query: 666 SSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTR 725
SSTYTINIALNQ GVDYEGGGCRFIRYNC+VT T+ GWMLMHPGRLTHYHEGL+VT GTR
Sbjct: 725 SSTYTINIALNQAGVDYEGGGCRFIRYNCSVTDTKPGWMLMHPGRLTHYHEGLRVTAGTR 784
Query: 726 YIMISFVDP 734
YIMISFVDP
Sbjct: 785 YIMISFVDP 793
>gi|347966056|ref|XP_321614.4| AGAP001507-PA [Anopheles gambiae str. PEST]
gi|333470231|gb|EAA00870.4| AGAP001507-PA [Anopheles gambiae str. PEST]
Length = 727
Score = 752 bits (1942), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 362/702 (51%), Positives = 490/702 (69%), Gaps = 11/702 (1%)
Query: 35 LVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNELDEM 94
L+ TVASN T+GY R+++SA+ L V TLG+ +PWLGG+M S+GGGYK+NLL+ L
Sbjct: 35 LIFTVASNATEGYVRYLRSAKHYDLTVTTLGMGKPWLGGNMKSVGGGYKINLLREALKPY 94
Query: 95 DITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAV-GS 153
D ++L TDSYDV+ I E+F +F+A+I+FGAE CWPD SL YP + G
Sbjct: 95 RADKDRLVLFTDSYDVLFLAPWAKIQEKFASFEASILFGAEGFCWPDESLKSAYPPLEGR 154
Query: 154 GYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLANLF 213
G RYLNSG F+GYA + +L+ +K+ EDDQLYY +LDE LR + I LD +A LF
Sbjct: 155 GMRYLNSGLFMGYADKLYKLLKT-PVKDAEDDQLYYTKAYLDEELRQELNIKLDHMATLF 213
Query: 214 QNLYGSLEDIKLNFDLDEF-VHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWKTSG 272
QNL G E + L+ + E L N++YNT P I+HGNG SK+ LNS+ NYLA ++
Sbjct: 214 QNLNGVEEQVVLSLEPSEKEATLANSEYNTKPAIVHGNGPSKLTLNSYANYLAGAFVDGE 273
Query: 273 CTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQE 332
C + +L + P V +++F++KPT FLEE+ IA LNYPA ++ + V++N
Sbjct: 274 CQTVKEGRL--TLSGGELPLVTMALFVEKPTPFLEEWFGTIAKLNYPADRLDVLVHSNVA 331
Query: 333 YHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSHLD 392
YHA ++ + ++++K I H+ ARN A ++ +G D+ F VDS+ HLD
Sbjct: 332 YHAGTVKAFLDAQEGRYRSLKVIEHDGDFTETAARNFATKHCELRGCDYLFVVDSEGHLD 391
Query: 393 NPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIW 452
+P+VL+ L+ N ++IAP+L RP K WSNFWGAL+ GFYARS DYM+I+ + G+W
Sbjct: 392 DPNVLRALIEANRNVIAPVLTRPEKVWSNFWGALSGQGFYARSNDYMDIVG--RKLLGLW 449
Query: 453 NVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGHLVD 512
NVP+++ YL+K +V+ + Y L D DMA C + R+KGI + + + ++YGHL+D
Sbjct: 450 NVPFVSIVYLVKRAVLPEVS----YELQETDPDMALCWHFRSKGIFMHVINVEQYGHLID 505
Query: 513 SENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFC 572
+E FD +T+P+ Y+L N DW+ RY+ P Y++ L D V QPCPDV+WF I +++FC
Sbjct: 506 TEYFDMTRTHPDFYQLFNNRHDWEQRYLAPGYKQQLEADFVPQQPCPDVYWFAIGSDRFC 565
Query: 573 HEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQER 632
+ +I+EA+G+WSDG+++DKRL+ GYEAVPTRDIHM QVGL +W +FL+ YV PLQE+
Sbjct: 566 DDLREIVEAFGEWSDGSHSDKRLQGGYEAVPTRDIHMNQVGLEQLWLKFLQLYVRPLQEK 625
Query: 633 EFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRY 692
FIGY H+P R+ M+FVVRYRPDEQPSLRPHHDSSTYTINIALN GVDYEGGGCRF+RY
Sbjct: 626 VFIGYFHDPPRSLMNFVVRYRPDEQPSLRPHHDSSTYTINIALNTAGVDYEGGGCRFLRY 685
Query: 693 NCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
NC+VT TR GWML+HPGRLTH+HEGL T+GTRYIMISFVDP
Sbjct: 686 NCSVTDTRKGWMLLHPGRLTHFHEGLLTTKGTRYIMISFVDP 727
>gi|427783339|gb|JAA57121.1| Putative procollagen-lysine 2-oxoglutarate 5-dioxygenase
[Rhipicephalus pulchellus]
Length = 772
Score = 749 bits (1935), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 354/704 (50%), Positives = 487/704 (69%), Gaps = 6/704 (0%)
Query: 33 KFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLLKNEL 91
+ ++ TVAS+ETDG+KRF +SA+V L+ K LG+H+ WLGGDM+ +GGGYKV LLK L
Sbjct: 73 RLVIFTVASDETDGFKRFARSAKVYGLEPKILGMHEEWLGGDMAKGMGGGYKVRLLKKAL 132
Query: 92 DEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAV 151
++ +I+ DSYDV+ G ++IL +F F++N+VF AE CWPD SL + YP
Sbjct: 133 EDYKNDAATLIMFVDSYDVVFTAGEDEILRKFYKFNSNVVFSAEGFCWPDRSLAEAYPK- 191
Query: 152 GSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLAN 211
+G R+LNSGGFIGYA + ++S+ ++++ DDQL+Y ++L+E LR K I LD A
Sbjct: 192 ANGERFLNSGGFIGYAPQLYSIVSSSDLEDDADDQLFYTKIYLNEDLRRKWGIRLDHKAE 251
Query: 212 LFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK-T 270
+FQNL G++ D++L LD +L N+ Y T P++IHGNG SK+ LN+ GNYLAKSW
Sbjct: 252 IFQNLNGAVGDVEL-LGLDSEPYLHNSAYGTTPLVIHGNGPSKVILNNLGNYLAKSWNDM 310
Query: 271 SGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNN 330
+GC C L + P VLI +F++ PT FL+E L K+ NLNYP +KI +FV+N
Sbjct: 311 AGCRVCYDTFSLSDKLDSELPKVLIGIFVEHPTPFLKEALQKVYNLNYPKEKIHLFVHNA 370
Query: 331 QEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSH 390
+E+H ++ + + +VKY+ + ARNLA+E L D+ F+VDS++H
Sbjct: 371 EEFHDAEVTKFVEEYGPAYHSVKYLDVSEAKKEWHARNLALEQCLKINCDYAFFVDSEAH 430
Query: 391 LDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKG 450
LDNPD L+ L+ N +++APLL R WSNFWG+L+ADG+YARS DY++++ ++ KG
Sbjct: 431 LDNPDTLRLLIETNRTIVAPLLSRHKSLWSNFWGSLSADGYYARSHDYVSLVKRER--KG 488
Query: 451 IWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGHL 510
IWNVP++ YL+ S++K+ + +D DMAFC N+R++GI + + + YGHL
Sbjct: 489 IWNVPFVNGAYLINGSLVKSREKFPSFINGLLDPDMAFCKNMRDRGIFMFMTNMDNYGHL 548
Query: 511 VDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEK 570
+++E FD + NP+ YE+ N DW+ RY+H Y K L P + PCPDV+WFP+V+E
Sbjct: 549 INAETFDTRHKNPDFYEIYSNQKDWERRYLHENYTKVLDPSYKVDMPCPDVYWFPVVSET 608
Query: 571 FCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQ 630
FC +QIME +G+WS GTN D+RL GYE VPTRDIHM QVGL W FLR+Y+ P+Q
Sbjct: 609 FCEHLIQIMENFGKWSSGTNEDERLAGGYENVPTRDIHMNQVGLEQHWLYFLREYIRPVQ 668
Query: 631 EREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFI 690
E+ F+GY H+P +A M+FVVRY P+EQ LRPHHDSSTYTINIALN+ +DYEGGGC F+
Sbjct: 669 EKVFLGYFHDPPKAIMNFVVRYHPEEQYFLRPHHDSSTYTINIALNRPHIDYEGGGCHFL 728
Query: 691 RYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
RYNC+V + GW LMHPGRLTHYHEGL VT+GTRYIM+SFVDP
Sbjct: 729 RYNCSVVDLKRGWSLMHPGRLTHYHEGLPVTKGTRYIMVSFVDP 772
>gi|242016159|ref|XP_002428703.1| Procollagen-lysine,2-oxoglutarate 5-dioxygenase 3 precursor,
putative [Pediculus humanus corporis]
gi|212513374|gb|EEB15965.1| Procollagen-lysine,2-oxoglutarate 5-dioxygenase 3 precursor,
putative [Pediculus humanus corporis]
Length = 661
Score = 724 bits (1870), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 342/665 (51%), Positives = 472/665 (70%), Gaps = 9/665 (1%)
Query: 75 MSSLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGA 134
M S GGG K+NL + E+++ + II+ TDSYDVI G+NDILE+F+ +VFGA
Sbjct: 1 MKSTGGGQKINLFREEVEKYKNDHEKIIIFTDSYDVIFLAGLNDILEQFDKIGGRVVFGA 60
Query: 135 ERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFL 194
E CWPD +L +YP G +YLNSGG IGYA ++ E++++RSI +++DDQL+Y +L
Sbjct: 61 EPFCWPDKNLASQYPIQSRGKQYLNSGGIIGYAPELYEILTHRSIDDDDDDQLFYTQAYL 120
Query: 195 DETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKS- 253
+ETLR KI LD + +F NL+G+++++ L F E +L N + ++P+I+HGNG +
Sbjct: 121 NETLRNNLKIKLDHKSQIFHNLHGAMDELSLKFKNHE-PYLENEQMKSHPLILHGNGPTV 179
Query: 254 -KIELNSFGNYLAKSWKTS-GCTRC--NLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEF 309
K+ LN+ GNYL W T GC C N+I L P V +++F+ KPT FLE+F
Sbjct: 180 VKVGLNNLGNYLPNCWNTRDGCVSCKENVIT-LSDEDTSNHPRVFVALFVSKPTPFLEDF 238
Query: 310 LNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNL 369
L K+ +L YP KI++FVYN ++H D ++ F+ +K+VK I + + A+ L
Sbjct: 239 LQKVGDLKYPKNKINLFVYNFIKHHERDVDKFVGKFREKYKSVKEIKADDEIAESHAKTL 298
Query: 370 AVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNAD 429
A+E+ DFYF +DS++HLDNP LK LV +N +++AP+LVRPFKAWSNFWG + D
Sbjct: 299 AIEHFKTSKADFYFNLDSEAHLDNPYTLKLLVEQNRTIVAPMLVRPFKAWSNFWGGIAED 358
Query: 430 GFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFC 489
GFYARSFDYM+++N ++ +G+WNVPYI+ CYL+ +VI+ K Y ++D DMAFC
Sbjct: 359 GFYARSFDYMDLVNNEK--RGLWNVPYISGCYLINGTVIRNDETKPSYVEGALDPDMAFC 416
Query: 490 TNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLL 549
++R KG+ + + + ++GHL++ + +D +T+P+ Y++ N DW+ RY+H Y ++L
Sbjct: 417 HHMREKGVFMYVSNRVDFGHLINPDTYDVTRTHPDFYQIFDNKWDWEQRYLHENYSENLN 476
Query: 550 PDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHM 609
P+T PCPDV+WFPI + +FC E ++I E YG+WSDG+N D RL+ GYE VPTRDIHM
Sbjct: 477 PETKPLMPCPDVYWFPIASPRFCQELIEICETYGKWSDGSNKDLRLDGGYENVPTRDIHM 536
Query: 610 KQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTY 669
KQ+GL W FL++YV PLQE FIGY+H P RA M+FVVRY+PDEQPSLRPHHDSSTY
Sbjct: 537 KQIGLEYHWLYFLKEYVRPLQENVFIGYYHNPPRAIMNFVVRYKPDEQPSLRPHHDSSTY 596
Query: 670 TINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMI 729
TIN+ALN VDYEGGGCRF+RYNC+VT TR+GW+LMHPGRLTHYHEGL VT+GTRYIM+
Sbjct: 597 TINLALNTPKVDYEGGGCRFLRYNCSVTDTRLGWLLMHPGRLTHYHEGLLVTKGTRYIMV 656
Query: 730 SFVDP 734
SFVDP
Sbjct: 657 SFVDP 661
>gi|91083241|ref|XP_973819.1| PREDICTED: similar to AGAP001507-PA [Tribolium castaneum]
Length = 751
Score = 723 bits (1867), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 349/713 (48%), Positives = 483/713 (67%), Gaps = 8/713 (1%)
Query: 27 KNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD--MSSLGGGYKV 84
K+ + LV TVAS TDG++R++ SA + LG Q W GG + GGG+K+
Sbjct: 42 KSTTDADILVFTVASEPTDGFQRYLSSAHHYHIAPTVLGFGQEWKGGSDIKNRPGGGWKI 101
Query: 85 NLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSL 144
NLLK L+ IIL TD YDVI ++ IL +F A ++FGAE CWPD L
Sbjct: 102 NLLKTALEPHKDDPTKIILFTDGYDVIFTDTLDAILRKFKETKARVLFGAESSCWPDVQL 161
Query: 145 YDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKI 204
KYP V G R+LNSG ++GYA D+ ++++ I++ +DDQL++ +LDE LR K
Sbjct: 162 APKYPQVTEGKRFLNSGLYMGYAPDLWQVLTFDVIEDTDDDQLFFTKAYLDEDLRKKVGF 221
Query: 205 VLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYL 264
LD + +FQNL G+L ++K +E+ + N Y+T P+I+HGNG SK+ LN GNYL
Sbjct: 222 KLDHKSEIFQNLNGALFEVKAKEGPEEY-KIQNVLYHTVPLILHGNGPSKLSLNYLGNYL 280
Query: 265 AKSWKT-SGCTRCNLIK-HLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKK 322
A SW + GC RC + L + + ++ VL+++F++ T FLEE L+K+ + YP +
Sbjct: 281 ANSWNSVEGCVRCKEGQFDLKNKRANEMSLVLLAIFVEFNTPFLEEMLSKVYSQEYPKHR 340
Query: 323 ISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFY 382
I +F++N ++H+ D+I + +++VK I + AR+L++ L K D Y
Sbjct: 341 IDLFIHNAMKFHSKHITDFIEKHGSEYRSVKDIKPDDGTTEWAARDLSLAQCLSKNCDIY 400
Query: 383 FYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNII 442
F VDS +HLDNP L+ L+ +N +++APLL RP KAWSNFWG L +GFYARS DYM+I+
Sbjct: 401 FSVDSVAHLDNPHTLRLLIEQNRTVVAPLLPRPGKAWSNFWGDLTKEGFYARSNDYMDIV 460
Query: 443 NGDQGGKGIWNVPYITNCYLMKTSVIKATN-IKTIYTLNSMDYDMAFCTNLRNKGIHLKI 501
+ D+ +G+WNVP+I NCY + +++K + K + ++ D DMAFC NLR+ + + +
Sbjct: 461 HNDK--RGLWNVPFIANCYAINATLLKKFDETKLNFDRDNWDADMAFCANLRDLDVFMYV 518
Query: 502 DSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDV 561
+ ++GHLV+ E FD + PE+Y++ N DW+ R+IHPEY ++ P+ + QPCPDV
Sbjct: 519 SNRVDFGHLVNPETFDITRVEPEMYQIFDNEQDWEARFIHPEYPENFNPEKTSLQPCPDV 578
Query: 562 FWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEF 621
+WFPIV+ +FC + +ME +G+WSDG+N D RLE GYEAVPTRDIHM QVG W EF
Sbjct: 579 YWFPIVSPRFCTSLINMMENFGKWSDGSNKDPRLEGGYEAVPTRDIHMNQVGWEKHWLEF 638
Query: 622 LRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVD 681
LRKYV PLQE F+GY H+P R+ M+FVVRY+PDEQPSLRPHHDSSTYTINIALNQ GVD
Sbjct: 639 LRKYVRPLQEHVFLGYFHDPPRSLMNFVVRYKPDEQPSLRPHHDSSTYTINIALNQRGVD 698
Query: 682 YEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
YEGGGCRFIRYNC+V T++GW+L+HPGRLTHYHEGL+VT+G RYIMI+FVDP
Sbjct: 699 YEGGGCRFIRYNCSVVDTKLGWLLIHPGRLTHYHEGLKVTKGIRYIMIAFVDP 751
>gi|195441570|ref|XP_002068579.1| GK20548 [Drosophila willistoni]
gi|194164664|gb|EDW79565.1| GK20548 [Drosophila willistoni]
Length = 699
Score = 723 bits (1867), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 357/701 (50%), Positives = 477/701 (68%), Gaps = 11/701 (1%)
Query: 36 VITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNELDEMD 95
V TVAS TDGY R+I+SA V ++V TLG+ W GGDM GGGYK+NLL+ +
Sbjct: 8 VFTVASEPTDGYMRYIRSARVYDIEVTTLGMGDEWKGGDMQRAGGGYKLNLLREAIAPHK 67
Query: 96 ITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAV-GSG 154
D IIL TDSYDVII V +I+E+F +A I+F AE+ CWPD +L D+YP V G
Sbjct: 68 EAQDKIILFTDSYDVIITANVEEIVEKFKESEAKILFSAEKFCWPDKTLADQYPEVEGKA 127
Query: 155 YRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLANLFQ 214
RYLNSG FIGYA + EL+ + I + +DDQLY+ +FLDET R K I LDT + LFQ
Sbjct: 128 SRYLNSGAFIGYAPQVYELLEDTPIDDTDDDQLYFTKIFLDETKRGKLGIELDTQSRLFQ 187
Query: 215 NLYGSLEDIKLNFDLDEFVH-LTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWKTSGC 273
NL+G+ D+KL DLD L N + T P IIHGNG SK+ELN++GNYLAK++ + C
Sbjct: 188 NLHGAKNDVKLKVDLDSNQGILQNIDFMTTPAIIHGNGLSKVELNAYGNYLAKTF-SGIC 246
Query: 274 TRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEY 333
T C +++ L ++ P + +SV + P F ++FL I LNYP K I +F+Y+N E
Sbjct: 247 TFC--LENPLELNENELPIISLSVIVPHPVPFFDQFLKGIETLNYPKKSIHLFIYSNVEL 304
Query: 334 HAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSHLDN 393
H +++ K + + KY+ ++ + AR LA+E + D+ F VD +SH+D+
Sbjct: 305 HDAAVKSFVNQNKDSYASAKYVLSTDELDERRARQLALEQAKRHHSDYIFNVDGESHIDD 364
Query: 394 PDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWN 453
+VL+ L+ N+ +APL + + WSNFWGAL+ G+YARS DY++I+ D G++N
Sbjct: 365 AEVLRELLRLNKQFVAPLFAKYHELWSNFWGALSDSGYYARSHDYVDIVKRDL--IGMFN 422
Query: 454 VPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDS 513
VP++T+ YL+K S N + D DMA +LRN GI + I + + +GHL+++
Sbjct: 423 VPHVTSIYLIKHSAFDVIN----FNHKEYDPDMALSESLRNAGIFMYISNQRYFGHLINT 478
Query: 514 ENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCH 573
+NF+ P+ + L N DW +YIHP Y L T+ QPCPDVFWF IVT+ FC
Sbjct: 479 DNFNSTLVRPDFHTLFTNRYDWTEKYIHPNYSLQLNESTIIPQPCPDVFWFQIVTDDFCD 538
Query: 574 EFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQERE 633
+ V IME++G WSDG+N+DKRLE GYEAVPTRDIHMKQVGL ++ +FL+ +V PLQE+
Sbjct: 539 DLVAIMESHGGWSDGSNSDKRLEGGYEAVPTRDIHMKQVGLESLYLKFLQLFVRPLQEKV 598
Query: 634 FIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYN 693
F+GY+H P R+ M+F+VRYRPDEQPSLRPHHDSSTYTINIA+N+ G+DYEGGGCRF+RYN
Sbjct: 599 FLGYYHNPPRSLMNFMVRYRPDEQPSLRPHHDSSTYTINIAMNKAGIDYEGGGCRFLRYN 658
Query: 694 CNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
C+VT T+ GWMLMHPGRLTH+HEGL VT GTRYIMISF+DP
Sbjct: 659 CSVTDTKKGWMLMHPGRLTHFHEGLLVTNGTRYIMISFIDP 699
>gi|405960464|gb|EKC26389.1| Procollagen-lysine,2-oxoglutarate 5-dioxygenase 3 [Crassostrea
gigas]
Length = 730
Score = 723 bits (1867), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 349/735 (47%), Positives = 492/735 (66%), Gaps = 16/735 (2%)
Query: 4 NLHLNCLILSCVVFFISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKT 63
N ++ LI +C ++ V ++++++ +IT+ ++ TDG +R+++S L +
Sbjct: 8 NFFIDVLIFTCGIY-------SVASLEDNELKLITIGTDVTDGLRRYLRSTNKYDLDAEV 60
Query: 64 LGLHQPWLGGDMS-SLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILER 122
G+ W GGD++ S GGG+KVN+LK EL++ +++I++ TDSYDV++ G DILE+
Sbjct: 61 FGIGMDWKGGDVANSAGGGHKVNILKKELEKYKDQENLILMFTDSYDVVLTAGKQDILEK 120
Query: 123 FNTFDANIVFGAERLCWPDTSLYDKYPAVGSG-YRYLNSGGFIGYAKDIKELISNRSIKN 181
F F+A +VF AE CWPD SL YP V S R+LNSGG++GYAKD+ E+I++RSIK+
Sbjct: 121 FKKFNARVVFSAEGFCWPDPSLAASYPEVKSKEKRFLNSGGYVGYAKDLYEIITHRSIKD 180
Query: 182 EEDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYN 241
+DDQLY+ +FLDETLR K + LD + LFQN++G+ D+ + F D + N
Sbjct: 181 TDDDQLYFTNIFLDETLRKKWNMKLDVKSELFQNMHGAQGDVTIKFKSDH-SYAYNVITG 239
Query: 242 TNPVIIHGNGKSKIELNSFGNYLAKSWKT-SGCTRCNL-IKHLDSLKPDQFPSVLISVFI 299
T PV++HGNG K E N F NYLA W T +GC C + LK D+FP+VL+S+F
Sbjct: 240 TTPVVVHGNGPIKPEFNRFANYLADGWTTQNGCQACKEETISIRELKDDEFPTVLVSLFF 299
Query: 300 DKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNS 359
++PT F E+FL +IANL YP +I +F++N E+H ++ + M+++ + +
Sbjct: 300 EQPTPFAEDFLERIANLKYPKSRIDLFIHNKVEFHNKDIASFLEKYNDMYRSATILMPSD 359
Query: 360 TVNSKEARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAW 419
+ ARN AVE K + F VD + + +P+ L L+ +N +++AP+L RP+K W
Sbjct: 360 GIYEAAARNWAVEVCKQKNDQYLFSVDVYAQITDPETLIDLIEQNRTVLAPILSRPYKLW 419
Query: 420 SNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTL 479
SNFWGA+N DG+YARS DY++I+ ++ G+WNVPYIT YL+ S+++ ++ IY+
Sbjct: 420 SNFWGAVNKDGWYARSEDYIDIV--EKKKIGLWNVPYITGAYLIHGSLME--ELRDIYSA 475
Query: 480 NSMDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRY 539
+++ DMAFC LR +GI + + + GHLVD +N D + ++Y++++NP DW L+Y
Sbjct: 476 ENVEPDMAFCGGLRKRGIFMYATNRKILGHLVDYDNMDTSHLHNDLYQIVQNPYDWKLKY 535
Query: 540 IHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGY 599
IH Y +SL + QPCPDVFWFPIV+ KFC V+ ME QWS G + D RL GY
Sbjct: 536 IHENYSQSLELNRTLVQPCPDVFWFPIVSTKFCDSLVEEMEHLNQWSGGRHEDPRLAGGY 595
Query: 600 EAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPS 659
E VPT D HM+Q+G+ W FL+ YV PLQER F GYH +P RA M+FVVRYRP+EQ
Sbjct: 596 ENVPTVDTHMRQIGMEEHWLHFLKVYVSPLQERAFEGYHSDPPRAIMNFVVRYRPNEQDR 655
Query: 660 LRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQ 719
LRPHHDSST+TINIALN D+EGGGCRF+RYNC+VTATR GWMLMHPGRLTH+HEGL
Sbjct: 656 LRPHHDSSTFTINIALNTPMKDFEGGGCRFLRYNCSVTATRKGWMLMHPGRLTHFHEGLV 715
Query: 720 VTQGTRYIMISFVDP 734
T+GTRYIMISFVDP
Sbjct: 716 TTKGTRYIMISFVDP 730
>gi|195379566|ref|XP_002048549.1| GJ11296 [Drosophila virilis]
gi|194155707|gb|EDW70891.1| GJ11296 [Drosophila virilis]
Length = 741
Score = 721 bits (1861), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 347/710 (48%), Positives = 486/710 (68%), Gaps = 13/710 (1%)
Query: 27 KNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNL 86
+N++E K V TVA+ TDGY+R+++SA V ++V TLG+ + W GGDM S GGG+K+NL
Sbjct: 43 QNLNE-KIKVFTVATEPTDGYRRYVRSANVYDIEVTTLGMGEEWQGGDMKSAGGGFKINL 101
Query: 87 LKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYD 146
L+ ++++ +D IIL TDSYDVI +++ILE+F A ++F AE+ CWPD SL D
Sbjct: 102 LRKAIEDLKDEEDTIILFTDSYDVIFTAALDEILEKFKESGAKLLFSAEKYCWPDKSLAD 161
Query: 147 KYPAV-GSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIV 205
+YP V G R+LNSG FIGYA + L+ +I+N DDQLY+ +FLDE R K +
Sbjct: 162 QYPEVEGKASRFLNSGAFIGYAPQVYALLE-EAIENTGDDQLYFTKVFLDEAKRAKLGMK 220
Query: 206 LDTLANLFQNLYGSLEDIKLNFDLDEFV-HLTNTKYNTNPVIIHGNGKSKIELNSFGNYL 264
LDT + LFQNL+G+ D+KL DLD L N + T P+IIHGNG SK++LN++GNYL
Sbjct: 221 LDTQSRLFQNLHGAKNDVKLKVDLDSNQGTLQNIDFMTTPLIIHGNGLSKVDLNAYGNYL 280
Query: 265 AKSWKTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKIS 324
AK++ + CT C +++ L P + ++V + + F + FL I LNYP + +
Sbjct: 281 AKTF-SGVCTFC--LEYPLELDEQNLPIITLAVMVPQAVPFFDMFLASIEKLNYPKESLH 337
Query: 325 MFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFY 384
+F+Y+N H + Y +N + + K++ ++ ++ R LA++ + + D+ F+
Sbjct: 338 LFMYSNVALHDDAVESYANNQGKNYASAKFVLSVDELDERQGRQLALDKAKLQHSDYIFF 397
Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
VD+D+H+D+ +VL+ L+ N+ +AP+ + + WSNFWGAL+ +G+YARS DY++I+
Sbjct: 398 VDADAHIDDSEVLRELLRMNKQFVAPVFSKYHELWSNFWGALSENGYYARSHDYVDIVKR 457
Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDST 504
D G++NVP++T YL+K S A + N D DMA C +LRN GI + + +
Sbjct: 458 DL--IGMFNVPHVTTIYLIKHSAFDAIKFEH----NDFDPDMAMCESLRNAGIFMYVSNQ 511
Query: 505 QEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWF 564
+ +GHL++++NF+ P+ Y L N DW L+YIH Y L V QPCPDVFWF
Sbjct: 512 RYHGHLINADNFNTTVVRPDFYTLFSNQYDWTLKYIHQNYSTQLNESMVIPQPCPDVFWF 571
Query: 565 PIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRK 624
IV++ FC + V IMEAYG+WSDG+NND RLE GYEAVPTRDIHM+QVGL ++ +FL+
Sbjct: 572 QIVSDAFCDDLVAIMEAYGKWSDGSNNDNRLEGGYEAVPTRDIHMRQVGLDTLYLKFLQI 631
Query: 625 YVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEG 684
+V PLQER F GY+H P R+ M+F+VRYRPDEQP LRPHHDSSTYTINIA+N VG+DYEG
Sbjct: 632 FVRPLQERVFTGYYHNPPRSLMNFMVRYRPDEQPFLRPHHDSSTYTINIAMNSVGIDYEG 691
Query: 685 GGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
GGC F+RYNC+VT T+ GWMLMHPGRLTH+HEGL VT+GTRYIMISF+DP
Sbjct: 692 GGCHFLRYNCSVTETKKGWMLMHPGRLTHFHEGLLVTKGTRYIMISFIDP 741
>gi|270006955|gb|EFA03403.1| hypothetical protein TcasGA2_TC013390 [Tribolium castaneum]
Length = 756
Score = 721 bits (1860), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 349/718 (48%), Positives = 485/718 (67%), Gaps = 13/718 (1%)
Query: 27 KNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD--MSSLGGGYKV 84
K+ + LV TVAS TDG++R++ SA + LG Q W GG + GGG+K+
Sbjct: 42 KSTTDADILVFTVASEPTDGFQRYLSSAHHYHIAPTVLGFGQEWKGGSDIKNRPGGGWKI 101
Query: 85 NLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSL 144
NLLK L+ IIL TD YDVI ++ IL +F A ++FGAE CWPD L
Sbjct: 102 NLLKTALEPHKDDPTKIILFTDGYDVIFTDTLDAILRKFKETKARVLFGAESSCWPDVQL 161
Query: 145 YDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKI 204
KYP V G R+LNSG ++GYA D+ ++++ I++ +DDQL++ +LDE LR K
Sbjct: 162 APKYPQVTEGKRFLNSGLYMGYAPDLWQVLTFDVIEDTDDDQLFFTKAYLDEDLRKKVGF 221
Query: 205 VLDTLANLFQNLYGSLEDIKLNFDLD-----EFVHLTNTKYNTNPVIIHGNGKSKIELNS 259
LD + +FQNL G++ +++L F++ E + N Y+T P+I+HGNG SK+ LN
Sbjct: 222 KLDHKSEIFQNLNGAVSEVEL-FEVKAKEGPEEYKIQNVLYHTVPLILHGNGPSKLSLNY 280
Query: 260 FGNYLAKSWKT-SGCTRCNLIK-HLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLN 317
GNYLA SW + GC RC + L + + ++ VL+++F++ T FLEE L+K+ +
Sbjct: 281 LGNYLANSWNSVEGCVRCKEGQFDLKNKRANEMSLVLLAIFVEFNTPFLEEMLSKVYSQE 340
Query: 318 YPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHK 377
YP +I +F++N ++H+ D+I + +++VK I + AR+L++ L K
Sbjct: 341 YPKHRIDLFIHNAMKFHSKHITDFIEKHGSEYRSVKDIKPDDGTTEWAARDLSLAQCLSK 400
Query: 378 GVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFD 437
D YF VDS +HLDNP L+ L+ +N +++APLL RP KAWSNFWG L +GFYARS D
Sbjct: 401 NCDIYFSVDSVAHLDNPHTLRLLIEQNRTVVAPLLPRPGKAWSNFWGDLTKEGFYARSND 460
Query: 438 YMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATN-IKTIYTLNSMDYDMAFCTNLRNKG 496
YM+I++ D+ +G+WNVP+I NCY + +++K + K + ++ D DMAFC NLR+
Sbjct: 461 YMDIVHNDK--RGLWNVPFIANCYAINATLLKKFDETKLNFDRDNWDADMAFCANLRDLD 518
Query: 497 IHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQ 556
+ + + + ++GHLV+ E FD + PE+Y++ N DW+ R+IHPEY ++ P+ + Q
Sbjct: 519 VFMYVSNRVDFGHLVNPETFDITRVEPEMYQIFDNEQDWEARFIHPEYPENFNPEKTSLQ 578
Query: 557 PCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAG 616
PCPDV+WFPIV+ +FC + +ME +G+WSDG+N D RLE GYEAVPTRDIHM QVG
Sbjct: 579 PCPDVYWFPIVSPRFCTSLINMMENFGKWSDGSNKDPRLEGGYEAVPTRDIHMNQVGWEK 638
Query: 617 VWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALN 676
W EFLRKYV PLQE F+GY H+P R+ M+FVVRY+PDEQPSLRPHHDSSTYTINIALN
Sbjct: 639 HWLEFLRKYVRPLQEHVFLGYFHDPPRSLMNFVVRYKPDEQPSLRPHHDSSTYTINIALN 698
Query: 677 QVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
Q GVDYEGGGCRFIRYNC+V T++GW+L+HPGRLTHYHEGL+VT+G RYIMI+FVDP
Sbjct: 699 QRGVDYEGGGCRFIRYNCSVVDTKLGWLLIHPGRLTHYHEGLKVTKGIRYIMIAFVDP 756
>gi|195128689|ref|XP_002008794.1| GI11618 [Drosophila mojavensis]
gi|193920403|gb|EDW19270.1| GI11618 [Drosophila mojavensis]
Length = 744
Score = 720 bits (1859), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 356/705 (50%), Positives = 484/705 (68%), Gaps = 12/705 (1%)
Query: 32 DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNEL 91
DK V TVA+ +TDGY+R+I+SA+V ++V TLGL + W GGDM LGGGYK+NLL+ +
Sbjct: 50 DKIKVFTVATEQTDGYRRYIRSAQVYDIEVTTLGLGEEWQGGDMKGLGGGYKINLLRKAV 109
Query: 92 DEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAV 151
+E+ +D IIL TDSYDV+ + +ILE+F A ++F AE+ CWPD SL D YPAV
Sbjct: 110 EELKDAEDTIILFTDSYDVVFTAPLTEILEKFKESGAKVLFSAEKYCWPDKSLADSYPAV 169
Query: 152 GSG-YRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
G+ RYLNSG FIGYA + EL+ I++ DDQLYY +FLDE R K I LDT +
Sbjct: 170 GAKESRYLNSGAFIGYAPQVVELL-KEEIEDTGDDQLYYTKIFLDEAKRAKLNIKLDTQS 228
Query: 211 NLFQNLYGSLEDIKLNFDLDEFVH-LTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK 269
LFQ+L G+ D+KL DLD L N + T P IIHGNG SKI LN++ NYLAK++
Sbjct: 229 RLFQSLNGAQNDVKLEVDLDSNQGVLQNIDFLTTPAIIHGNGPSKINLNAYANYLAKTF- 287
Query: 270 TSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN 329
+ CT C ++ L + P + ++V +++P FL+ FL I LNYP K + +F+Y+
Sbjct: 288 SGVCTFCQ--EYPLELNEQELPIITLAVMVNQPVPFLDMFLAGIEKLNYPKKSMHLFMYS 345
Query: 330 NQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDS 389
N E H L Y+ + +VKYI + + R LA++ + K D+ FYVD D+
Sbjct: 346 NAELHDELVQSYVTKHGKSYASVKYILSTDGLTESQGRQLALDKAKQKHSDYIFYVDGDA 405
Query: 390 HLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGK 449
H+++ +VL+ L+ N+ +AP+ + + WSNFWGAL+ G+YARS DY++I+ +
Sbjct: 406 HIEDSEVLRELLRMNKQFVAPVFSKYHELWSNFWGALSETGYYARSHDYVDIVK--RNLI 463
Query: 450 GIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGH 509
G++NVP++T YL+K S A + +D DMA +LR+ G+ + + + + +GH
Sbjct: 464 GMFNVPHVTTIYLIKKSAFDAVKFEH----KELDPDMAMSDSLRDAGVFMYVSNERYFGH 519
Query: 510 LVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTE 569
L++++NF+ P+ Y L N DW L+YIHP Y L V QPCPDV+WF IVT+
Sbjct: 520 LINADNFNTTVARPDFYTLFSNRYDWTLKYIHPNYSTQLNESVVIPQPCPDVYWFQIVTD 579
Query: 570 KFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPL 629
FC + V IMEAYG+WSDG+N+D RLE GYEAVPTRDIHM+QVGL ++ +FL+ +V PL
Sbjct: 580 AFCDDLVAIMEAYGKWSDGSNSDTRLEGGYEAVPTRDIHMRQVGLDALYLKFLQMFVRPL 639
Query: 630 QEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRF 689
QER F+GY H+P R+ M+F+VRY+PDEQPSLRPHHDSSTYTINIA+N+VG+DYEGGGCRF
Sbjct: 640 QERVFMGYFHDPPRSLMNFMVRYKPDEQPSLRPHHDSSTYTINIAMNRVGIDYEGGGCRF 699
Query: 690 IRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
+RYNC+VT T+ GWMLMHPGRLTH+HEGL VT+GTRYIMISF+DP
Sbjct: 700 LRYNCSVTETKKGWMLMHPGRLTHFHEGLLVTKGTRYIMISFIDP 744
>gi|391344649|ref|XP_003746608.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 3-like
[Metaseiulus occidentalis]
Length = 756
Score = 719 bits (1856), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 351/710 (49%), Positives = 480/710 (67%), Gaps = 14/710 (1%)
Query: 28 NIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDM-SSLGGGYKVNL 86
++D+ + LVITVA++ T+GY+RF+ SAE L V+TLGL + W GGD+ + GGG+KVNL
Sbjct: 58 SLDKFELLVITVATDRTEGYERFLASAEREDLTVETLGLDEEWRGGDVVHTTGGGHKVNL 117
Query: 87 LKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYD 146
L+ LD+ D++I+ DSYDVI G DILE+F DA+ VF AE CWPD SL +
Sbjct: 118 LRKALDKYKDRSDLLIMFVDSYDVIFTGNKQDILEKFFALDADAVFSAEGFCWPDASLEN 177
Query: 147 KYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVL 206
KYP G +YLNSGGF+G+A I ++ ++ +I++E+DDQL+Y ++LD +LR +I L
Sbjct: 178 KYPE-SDGKKYLNSGGFVGFAPAIHKIATHVAIQDEDDDQLFYTKIYLDPSLRESLRIRL 236
Query: 207 DTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAK 266
D + +FQNL G++ D+ ++ D + + NT Y T P++IHGNG SK+ LNS NYLA
Sbjct: 237 DNKSTIFQNLNGAVGDVSIS--EDAYPKVKNTAYGTEPIVIHGNGPSKVALNSLANYLAG 294
Query: 267 SWKTS-GCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
+WK GC C + +L D P V + +FI++ T F +EFL+ L+YP +KIS+
Sbjct: 295 AWKNGEGCLVC---EDRITLGTDTMPQVTVGIFIEEATPFFDEFLDHFIELDYPKEKISL 351
Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYV 385
F++ +YH ++ N + ++ + + + K AR A+E L DFY +
Sbjct: 352 FIHRGVDYHNERLRQFVENGAASYAKLEMTSTDDLLEWK-ARERALEVCLLDACDFYLNL 410
Query: 386 DSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGD 445
DS HL N VL++L+ ++ + IAPL++R +AWSNFWGAL ++GFYARS DYM I+ G+
Sbjct: 411 DSRVHLTNRKVLQHLIAKDRNFIAPLVMRTGQAWSNFWGALTSEGFYARSHDYMEIVKGE 470
Query: 446 QGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQ 505
+ KGIWNVPYI YL+K SV + Y ++D DMA C NLR +GI + +D+ +
Sbjct: 471 K--KGIWNVPYIGEVYLIKASVFSKKPLS--YVNGALDPDMALCKNLRERGIFMYVDNME 526
Query: 506 EYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDT-VNNQPCPDVFWF 564
++G L++SE+FD K +P+ YE+ N W LRYIH EY+ T V QPC DVFWF
Sbjct: 527 DFGFLINSEHFDTSKKHPDFYEIYNNQFAWALRYIHKEYKDIFSNHTGVLRQPCHDVFWF 586
Query: 565 PIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRK 624
P+ + FC ++IME +G WSDGTN+D RL GYE VPTRDIHMKQVGL W FLR+
Sbjct: 587 PLASPTFCTHLIEIMENHGGWSDGTNSDPRLAGGYENVPTRDIHMKQVGLEPQWLFFLRE 646
Query: 625 YVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEG 684
YV P+QE F GY+H+P +A M+FVVRYRPDEQPSL+PHHD+STYT+N+ALNQ G D+ G
Sbjct: 647 YVRPVQEHVFTGYYHDPPKAIMNFVVRYRPDEQPSLKPHHDASTYTLNLALNQAGKDFTG 706
Query: 685 GGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
GG FIR NC+VT++ GW L+HPGRLTHYHEGL T GTRYIM+SFVDP
Sbjct: 707 GGSHFIRQNCSVTSSPSGWGLLHPGRLTHYHEGLTTTSGTRYIMVSFVDP 756
>gi|195166461|ref|XP_002024053.1| GL22837 [Drosophila persimilis]
gi|194107408|gb|EDW29451.1| GL22837 [Drosophila persimilis]
Length = 1367
Score = 718 bits (1854), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 345/705 (48%), Positives = 471/705 (66%), Gaps = 12/705 (1%)
Query: 32 DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNEL 91
DK V TVA+ TDGY R+I+SA + ++V TLGL + W GGDM GGG+KVNLL+ +
Sbjct: 673 DKVEVFTVATEPTDGYARYIRSARIYDVKVTTLGLGEHWKGGDMQHPGGGFKVNLLRKAV 732
Query: 92 DEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAV 151
+ D I+L TDSYDVII + +I+E F A ++F AE+ CWPD+SL D YP V
Sbjct: 733 APLKDEQDTIVLFTDSYDVIITAKLEEIVELFKESKAKLLFSAEKFCWPDSSLTDAYPEV 792
Query: 152 -GSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
G+ R+LNSG FIGYA + L+ +I + +DDQLYY +FLDE R K + LDT +
Sbjct: 793 EGNASRFLNSGAFIGYAPQVNALL-EEAIDDMDDDQLYYTKVFLDEARRAKLGMKLDTQS 851
Query: 211 NLFQNLYGSLEDIKLNFDLDEFVH-LTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK 269
LFQNL+G+ D+KL D++ L N + T P I+HGNG SK++LN++GNYLAK++
Sbjct: 852 RLFQNLHGAKNDVKLKVDIESNQGILQNVNFLTTPAIVHGNGLSKVDLNAYGNYLAKTF- 910
Query: 270 TSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN 329
CT C ++L L P + +SV + F ++FL I +NYP + + + +Y+
Sbjct: 911 NGICTVCQ--EYLLELDEQHLPVISLSVIVPMAVPFFDQFLEGIEKINYPKQNLHLLIYS 968
Query: 330 NQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDS 389
N E H +++ + + KY ++ ++ R LA + + + D+ F++D D+
Sbjct: 969 NVELHDADIKSFVNKHGEKYASAKYTLSTDNLDERQGRQLAFDQAKLRKSDYIFFIDGDA 1028
Query: 390 HLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGK 449
H+D+ +VL+ L+ N+ +APL + + WSNFWGAL+ GFYARS DY++I+ D
Sbjct: 1029 HIDDGEVLRELLKLNKQFVAPLFAKYHELWSNFWGALSEGGFYARSHDYVDIVKRDL--I 1086
Query: 450 GIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGH 509
GI+NVP++T+ YL+++S + + + D DMA C +LR G+ + I + + +GH
Sbjct: 1087 GIFNVPHVTSIYLVRSSAFDVLSFQH----SEYDADMAMCESLRKAGVFMFISNQRYFGH 1142
Query: 510 LVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTE 569
LV+++NFD + P+ Y L N DW +YIHP Y + L TV QPCPDV+W IVT+
Sbjct: 1143 LVNADNFDTKVARPDFYTLFSNRYDWTEKYIHPNYSEQLNASTVIEQPCPDVYWMAIVTD 1202
Query: 570 KFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPL 629
FC + V IME +G WSDG+NND RLE GYEAVPTRDIHMKQVGL ++ +FL +V PL
Sbjct: 1203 AFCDDLVAIMENHGTWSDGSNNDNRLEGGYEAVPTRDIHMKQVGLEVLYLKFLELFVRPL 1262
Query: 630 QEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRF 689
QER F GY+H P RA M+F+VRYRPDEQPSLRPHHD+STYTINIA+NQV DYEGGGCRF
Sbjct: 1263 QERVFTGYYHNPPRALMNFMVRYRPDEQPSLRPHHDASTYTINIAMNQVDTDYEGGGCRF 1322
Query: 690 IRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
+RYNC+VT T+ GWMLMHPGRLTHYHEGL VT+GTRYIMISF+DP
Sbjct: 1323 LRYNCSVTETKKGWMLMHPGRLTHYHEGLLVTKGTRYIMISFIDP 1367
>gi|195018422|ref|XP_001984779.1| GH16659 [Drosophila grimshawi]
gi|193898261|gb|EDV97127.1| GH16659 [Drosophila grimshawi]
Length = 696
Score = 716 bits (1848), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 352/705 (49%), Positives = 481/705 (68%), Gaps = 12/705 (1%)
Query: 32 DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNEL 91
DK V TVAS TDGY+R+I+SA+V ++V TLG+ + W GGDM S GGG+K+NLL+ +
Sbjct: 2 DKIKVFTVASEPTDGYRRYIRSAKVYDIEVTTLGMGEEWKGGDMKSAGGGFKINLLRKAI 61
Query: 92 DEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAV 151
+ + +D IIL TDSYDVII + +IL++F DA ++F AE+ CWPD SL ++YP V
Sbjct: 62 EPLKDAEDTIILFTDSYDVIITSTLEEILQKFKESDAKLLFSAEKYCWPDKSLANQYPEV 121
Query: 152 GSG-YRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
G RYLNSG FIGYA + L+ I++ DDQLYY +FLDET R K + LDT +
Sbjct: 122 GGKESRYLNSGAFIGYAPQVNALLEEL-IEDTGDDQLYYTKVFLDETKRAKLGMKLDTQS 180
Query: 211 NLFQNLYGSLEDIKLNFDLDEFVH-LTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK 269
LFQNL+G+ +D+KL DLD L N + T P IIHGNG SK++LN++GNYLAK++
Sbjct: 181 KLFQNLHGAKDDVKLRVDLDSNQGILENVNFLTKPNIIHGNGLSKVDLNAYGNYLAKTF- 239
Query: 270 TSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN 329
+ CT C +++L L P + ++V + +P F + FL I LNYP K + +F+Y+
Sbjct: 240 SGICTVC--MEYLLDLDEQNLPIITLAVMVPQPVPFFDLFLAGIEKLNYPKKNLHLFIYS 297
Query: 330 NQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDS 389
H L Y++ + +VK++ ++ ++ R LA++ + + D+ FYVD D+
Sbjct: 298 GAALHDDLITSYVNKQGKSYASVKFVLSTDQLDERQGRQLALDKAKLQRSDYIFYVDGDA 357
Query: 390 HLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGK 449
H+D+ ++L+ L+ N+ AP+ + + WSNFWGAL+ +G+YARS DY++I+ D
Sbjct: 358 HIDDRELLRALLRLNKQFAAPVFSKYHELWSNFWGALSENGYYARSHDYVDIVKRDL--I 415
Query: 450 GIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGH 509
GI+NVP++T YL+K S A + N D DMA +LR+ GI + + + + GH
Sbjct: 416 GIFNVPHVTTIYLIKRSAFDAIK----FDHNEFDPDMALSKSLRDAGIFMYVSNQRYLGH 471
Query: 510 LVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTE 569
LV++ENF+ P+ + L N DW ++YI P Y L V QPCPDV+W IVT+
Sbjct: 472 LVNAENFNSTVVRPDFHTLFSNRYDWTIKYIQPNYSAQLNESMVIPQPCPDVYWLHIVTD 531
Query: 570 KFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPL 629
FC + V IMEA+G+WS+G N DKRLE GYEAVPTRDIHM+QVGL V+ +FL+ +V PL
Sbjct: 532 AFCDDLVAIMEAFGKWSEGKNQDKRLEGGYEAVPTRDIHMRQVGLDQVYLKFLQMFVRPL 591
Query: 630 QEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRF 689
QER F GY+H P R+ M+F+VRYRPDEQPSLRPHHDSSTYTINIA+N VG+DYEGGGCRF
Sbjct: 592 QERIFTGYYHNPPRSLMNFMVRYRPDEQPSLRPHHDSSTYTINIAMNNVGIDYEGGGCRF 651
Query: 690 IRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
+RYNC+VT T+ GWMLMHPGRLTH+HEGL VT+GTRYIMISF+DP
Sbjct: 652 LRYNCSVTETKKGWMLMHPGRLTHFHEGLLVTRGTRYIMISFIDP 696
>gi|198466220|ref|XP_001353930.2| GA19434, partial [Drosophila pseudoobscura pseudoobscura]
gi|198150500|gb|EAL29666.2| GA19434, partial [Drosophila pseudoobscura pseudoobscura]
Length = 698
Score = 716 bits (1848), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 346/705 (49%), Positives = 472/705 (66%), Gaps = 12/705 (1%)
Query: 32 DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNEL 91
DK V TVA+ TDGY R+I+SA + ++V TLGL + W GGDM GGG+KVNLL+ +
Sbjct: 4 DKVEVFTVATEPTDGYARYIRSARIYDVKVTTLGLGEHWKGGDMQHPGGGFKVNLLRKAV 63
Query: 92 DEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAV 151
+ D I+L TDSYDVII + +I+E F A ++F AE+ CWPD+SL D YP V
Sbjct: 64 APLKDEQDTIVLFTDSYDVIITAKLEEIVELFKESKAKLLFSAEKFCWPDSSLTDAYPEV 123
Query: 152 -GSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
G+ R+LNSG FIGYA + L+ +I + +DDQLYY +FLDE R K + LDT +
Sbjct: 124 EGNASRFLNSGAFIGYAPQVNALLE-EAIDDMDDDQLYYTKVFLDEARRAKLGMKLDTQS 182
Query: 211 NLFQNLYGSLEDIKLNFDLDEFVH-LTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK 269
LFQNL+G+ D+KL D++ L N + T P I+HGNG SK++LN++GNYLAK++
Sbjct: 183 RLFQNLHGAKNDVKLKVDIESNQGILQNVNFLTTPAIVHGNGLSKVDLNAYGNYLAKTF- 241
Query: 270 TSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN 329
CT C ++L L P + +SV + F ++FL I +NYP + + + +Y+
Sbjct: 242 NGICTVCQ--EYLLELDEQHLPVISLSVIVPMAVPFFDQFLEGIEKINYPKQNLHLLIYS 299
Query: 330 NQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDS 389
N E H +++ + + KY ++ ++ R LA + + + D+ F++D D+
Sbjct: 300 NVELHDADIKSFVNKHGEKYASAKYTLSTDNLDERQGRQLAFDQAKLRKSDYIFFIDGDA 359
Query: 390 HLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGK 449
H+D+ +VL+ L+ N+ +APL + + WSNFWGAL+ GFYARS DY++I+ D
Sbjct: 360 HIDDGEVLRELLKLNKQFVAPLFAKYHELWSNFWGALSEGGFYARSHDYVDIVKRDL--I 417
Query: 450 GIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGH 509
GI+NVP++T+ YL+++SV + + + D DMA C +LR G+ + I + + +GH
Sbjct: 418 GIFNVPHVTSIYLVRSSVFDVLSFQH----SEYDADMAMCESLRKAGVFMFISNQRYFGH 473
Query: 510 LVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTE 569
LV+++NFD + P+ Y L N DW +YIHP Y + L TV QPCPDV+W IVT+
Sbjct: 474 LVNADNFDTKVARPDFYTLFSNRYDWTEKYIHPNYSEQLNASTVIEQPCPDVYWMAIVTD 533
Query: 570 KFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPL 629
FC + V IME +G WSDG+NND RLE GYEAVPTRDIHMKQVGL ++ +FL +V PL
Sbjct: 534 AFCDDLVAIMENHGTWSDGSNNDNRLEGGYEAVPTRDIHMKQVGLEVLYLKFLELFVRPL 593
Query: 630 QEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRF 689
QER F GY+H P RA M+F+VRYRPDEQPSLRPHHD+STYTINIA+NQV DYEGGGCRF
Sbjct: 594 QERVFTGYYHNPPRALMNFMVRYRPDEQPSLRPHHDASTYTINIAMNQVDTDYEGGGCRF 653
Query: 690 IRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
+RYNC+VT T+ GWMLMHPGRLTHYHEGL VT+GTRYIMISF+DP
Sbjct: 654 LRYNCSVTETKKGWMLMHPGRLTHYHEGLLVTKGTRYIMISFIDP 698
>gi|128485638|ref|NP_835202.2| procollagen-lysine,2-oxoglutarate 5-dioxygenase 3 precursor [Rattus
norvegicus]
gi|81883555|sp|Q5U367.1|PLOD3_RAT RecName: Full=Procollagen-lysine,2-oxoglutarate 5-dioxygenase 3;
AltName: Full=Lysyl hydroxylase 3; Short=LH3; Flags:
Precursor
gi|55250563|gb|AAH85683.1| Procollagen-lysine, 2-oxoglutarate 5-dioxygenase 3 [Rattus
norvegicus]
gi|149062975|gb|EDM13298.1| procollagen-lysine, 2-oxoglutarate 5-dioxygenase 3 [Rattus
norvegicus]
Length = 741
Score = 712 bits (1837), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 339/711 (47%), Positives = 484/711 (68%), Gaps = 11/711 (1%)
Query: 29 IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLL 87
++ DK LVITVA+ ET+GY+RF+QSAE V+TLGL Q W GGD++ ++GGG KV L
Sbjct: 37 VNPDKLLVITVATAETEGYRRFLQSAEFFNYTVRTLGLGQEWRGGDVARTVGGGQKVRWL 96
Query: 88 KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
K E+++ +DMII+ DSYDVI+ ++L++F ++++F AE CWPD L ++
Sbjct: 97 KKEMEKYASQEDMIIMFVDSYDVILASSPTELLKKFVQSGSHLLFSAESFCWPDWGLAEQ 156
Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
YP VG G R+LNSGGFIG+A I ++ K+++DDQL+Y L+LD LR K K+ LD
Sbjct: 157 YPEVGVGKRFLNSGGFIGFAPTIHRIVRQWKYKDDDDDQLFYTQLYLDPGLREKLKLSLD 216
Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
+ +FQNL G+L+++ L FD + V + N Y+T PV++HGNG +K++LN GNY+
Sbjct: 217 HKSRIFQNLNGALDEVVLKFDQNR-VRIRNVAYDTLPVVVHGNGPTKLQLNYLGNYVPNG 275
Query: 268 W-KTSGCTRCNLIKH-LDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
W GC CNL + L +P P VL++VF+++PT FL FL ++ L+YP +IS+
Sbjct: 276 WTPQGGCGFCNLNRRTLPGGQPP--PRVLLAVFVEQPTPFLPRFLQRLLLLDYPPDRISL 333
Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKG-VDFYFY 384
F++NN+ YH P D + F VK + ++S EAR++A+++ +FYF
Sbjct: 334 FLHNNEVYHEPHIADAWPQLQDHFSAVKLVGPEEALSSGEARDMAMDSCRQNPECEFYFS 393
Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
+D+D+ L NP+ L+ L+ +N +IAP+L R K WSNFWGAL+ D +YARS DY+ ++
Sbjct: 394 LDADAVLTNPETLRILIEQNRKVIAPMLSRHGKLWSNFWGALSPDEYYARSEDYVELVQR 453
Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDS 503
+ G+WNVPYI+ Y+++ ++ K +++ + D DMAFC ++R+KGI L + +
Sbjct: 454 KR--VGLWNVPYISQAYVIRGETLRTELPEKEVFSSSDTDPDMAFCRSVRDKGIFLHLSN 511
Query: 504 TQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFW 563
E+G L+ + ++D +P+++++ NP+DW +YIH Y ++L + + QPCPDV+W
Sbjct: 512 QHEFGRLLSTSHYDTDHLHPDLWQIFDNPVDWREQYIHENYSRALDGEGLVEQPCPDVYW 571
Query: 564 FPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLR 623
FP++TE+ C E V+ ME YGQWS G + D RL GYE VPT DIHMKQVG W + LR
Sbjct: 572 FPLLTEQMCDELVEEMEHYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYEDQWLQLLR 631
Query: 624 KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYE 683
YV P+ E F GYH + RA M+FVVRYRPDEQPSLRPHHDSST+T+N+ALN GVDYE
Sbjct: 632 TYVGPMTEHLFPGYHTK-TRAVMNFVVRYRPDEQPSLRPHHDSSTFTLNVALNHKGVDYE 690
Query: 684 GGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
GGGCRF+RY+C V++ R GW L+HPGRLTHYHEGL T+GTRYIM+SFVDP
Sbjct: 691 GGGCRFLRYDCRVSSPRKGWALLHPGRLTHYHEGLPTTRGTRYIMVSFVDP 741
>gi|28400779|emb|CAD23628.1| procollagen-lysine, 2-oxoglutarate 5-dioxygenase 3 [Rattus
norvegicus]
Length = 741
Score = 711 bits (1836), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 339/711 (47%), Positives = 484/711 (68%), Gaps = 11/711 (1%)
Query: 29 IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLL 87
++ DK LVITVA+ ET+GY+RF+QSAE V+TLGL Q W GGD++ ++GGG KV L
Sbjct: 37 VNPDKLLVITVATAETEGYRRFLQSAEFFNYTVRTLGLGQEWRGGDVARTVGGGQKVRWL 96
Query: 88 KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
K E+++ +DMII+ DSYDVI+ ++L++F ++++F AE CWPD L ++
Sbjct: 97 KKEMEKYASQEDMIIMFVDSYDVILASSPTELLKKFVQSGSHLLFSAESFCWPDWGLAEQ 156
Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
YP VG G R+LNSGGFIG+A I ++ K+++DDQL+Y L+LD LR K K+ LD
Sbjct: 157 YPEVGVGKRFLNSGGFIGFAPTIHRIVRQWKYKDDDDDQLFYTQLYLDPGLREKLKLSLD 216
Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
+ +FQNL G+L+++ L FD + V + N Y+T PV++HGNG +K++LN GNY+
Sbjct: 217 HKSRIFQNLNGALDEVVLKFDQNR-VRIRNVAYDTLPVVVHGNGPTKLQLNYLGNYVPNG 275
Query: 268 W-KTSGCTRCNLIKH-LDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
W GC CNL + L +P P VL++VF+++PT FL FL ++ L+YP +IS+
Sbjct: 276 WTPQGGCGFCNLNRRTLPGGQPP--PRVLLAVFVEQPTPFLPRFLQRLLLLDYPPDRISL 333
Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKG-VDFYFY 384
F++NN+ YH P D + F VK + ++S EAR++A+++ +FYF
Sbjct: 334 FLHNNEVYHEPHIADAWPQLQDHFSAVKLVGPEEALSSGEARDMAMDSCRQNPECEFYFS 393
Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
+D+D+ L NP+ L+ L+ +N +IAP+L R K WSNFWGAL+ D +YARS DY+ ++
Sbjct: 394 LDADAVLTNPETLRILIEQNRKVIAPMLSRHGKLWSNFWGALSPDEYYARSEDYVELVQR 453
Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDS 503
+ G+WNVPYI+ Y+++ ++ K +++ + D DMAFC ++R+KGI L + +
Sbjct: 454 KR--VGLWNVPYISQAYVIRGETLRTELPEKEVFSSSDTDPDMAFCRSVRDKGIFLHLSN 511
Query: 504 TQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFW 563
E+G L+ + ++D +P+++++ NP+DW +YIH Y ++L + + QPCPDV+W
Sbjct: 512 QHEFGRLLSTSHYDTDHLHPDLWQIFDNPVDWREQYIHENYSRALDGEGLVEQPCPDVYW 571
Query: 564 FPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLR 623
FP++TE+ C E V+ ME YGQWS G + D RL GYE VPT DIHMKQVG W + LR
Sbjct: 572 FPLLTEQMCDELVEEMEHYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYEDQWLQLLR 631
Query: 624 KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYE 683
YV P+ E F GYH + RA M+FVVRYRPDEQPSLRPHHDSST+T+N+ALN GVDYE
Sbjct: 632 TYVGPMTEHLFPGYHTK-TRAVMNFVVRYRPDEQPSLRPHHDSSTFTLNVALNHKGVDYE 690
Query: 684 GGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
GGGCRF+RY+C V++ R GW L+HPGRLTHYHEGL T+GTRYIM+SFVDP
Sbjct: 691 GGGCRFLRYDCRVSSPRKGWALLHPGRLTHYHEGLPTTRGTRYIMVSFVDP 741
>gi|195326740|ref|XP_002030083.1| GM24765 [Drosophila sechellia]
gi|194119026|gb|EDW41069.1| GM24765 [Drosophila sechellia]
Length = 721
Score = 705 bits (1820), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 342/705 (48%), Positives = 473/705 (67%), Gaps = 12/705 (1%)
Query: 32 DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNEL 91
DK V TVA+ TDGY R+I+SA V ++V TLGL + W GGDM GGG+K+NLL+ +
Sbjct: 27 DKIKVFTVATEPTDGYTRYIRSARVYDIEVTTLGLGEEWKGGDMQKPGGGFKLNLLREAI 86
Query: 92 DEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAV 151
D IIL TDSYDVII +++I E+F A I+F AE+ CWPD SL + YP V
Sbjct: 87 APYKNEPDTIILFTDSYDVIITTTLDEIFEKFKEAGARILFSAEKYCWPDKSLANDYPEV 146
Query: 152 -GSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
G R+LNSG FIGYA + L+ + I++ DDQLY+ +FLDET RTK + LD +
Sbjct: 147 EGKASRFLNSGAFIGYAPQVFALLED-PIEDTADDQLYFTKIFLDETKRTKLGLKLDVQS 205
Query: 211 NLFQNLYGSLEDIKLNFDLDEFVH-LTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK 269
LFQNL+G+ D+KL DL+ L N + T P IIHGNG SK++LN++GNYLA+++
Sbjct: 206 RLFQNLHGAKNDVKLKVDLESNQGVLQNVDFMTTPAIIHGNGLSKVDLNAYGNYLARTF- 264
Query: 270 TSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN 329
C C ++L L+ P + +++ + +P F ++FL I +LNYP KK+ + +Y+
Sbjct: 265 NGVCLLCQ--ENLLDLEETNLPVISLALMVTQPVPFFDQFLEGIESLNYPKKKLHLLIYS 322
Query: 330 NQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDS 389
N +H +++ + + K+ ++ ++ R LA++ + D+ F+VD+D+
Sbjct: 323 NIAFHDDDIKSFVNKYGKEYATAKFALSTDELDERQGRQLALDKARLHQSDYIFFVDADA 382
Query: 390 HLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGK 449
H+D+ +VL+ L+ N+ +AP+ + + WSNFWGAL+ G+YARS DY++I+ +
Sbjct: 383 HIDDSEVLRELLRLNKQFVAPIFSKHKELWSNFWGALSEGGYYARSHDYVDIVKREL--I 440
Query: 450 GIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGH 509
G++NVP++T+ YL+K + A + K D DMA C +LRN GI + + + +GH
Sbjct: 441 GMFNVPHVTSIYLVKKTAFDAISFKH----KEFDPDMAMCESLRNAGIFMYASNLRIFGH 496
Query: 510 LVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTE 569
LV++++F+ T P+ Y L N +DW +YIHP Y L QPCPDV+WF IV++
Sbjct: 497 LVNADDFNTTVTRPDFYTLFSNEVDWTEKYIHPNYSLQLNESNKIQQPCPDVYWFQIVSD 556
Query: 570 KFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPL 629
FC + V IMEA+ WSDG+N+D RLE GYEAVPTRDIHMKQVGL ++ +FL+ +V PL
Sbjct: 557 AFCDDLVAIMEAHNGWSDGSNSDNRLEGGYEAVPTRDIHMKQVGLERLYLKFLQLFVRPL 616
Query: 630 QEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRF 689
QER F GY H P RA M+F+VRYRPDEQPSLRPHHDSSTYTINIA+N+ G+DY+GGGCRF
Sbjct: 617 QERAFTGYFHNPPRALMNFMVRYRPDEQPSLRPHHDSSTYTINIAMNRAGIDYQGGGCRF 676
Query: 690 IRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
IRYNC+VT T+ GWMLMHPGRLTHYHEGL VT GTRYIMISF+DP
Sbjct: 677 IRYNCSVTDTKKGWMLMHPGRLTHYHEGLLVTNGTRYIMISFIDP 721
>gi|195589463|ref|XP_002084471.1| GD12814 [Drosophila simulans]
gi|194196480|gb|EDX10056.1| GD12814 [Drosophila simulans]
Length = 721
Score = 705 bits (1820), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 341/705 (48%), Positives = 474/705 (67%), Gaps = 12/705 (1%)
Query: 32 DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNEL 91
DK V TVA+ TDGY R+I+SA V ++V TLGL + W GGDM GGG+K+NLL+ +
Sbjct: 27 DKIKVFTVATEPTDGYTRYIRSARVYDIEVTTLGLGEEWKGGDMQKPGGGFKLNLLREAI 86
Query: 92 DEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAV 151
+ IIL TDSYDVII +++I E+F A I+F AE+ CWPD SL + YP V
Sbjct: 87 APYKNEPETIILFTDSYDVIITTTLDEIFEKFKEAGAKILFSAEKYCWPDKSLANDYPEV 146
Query: 152 -GSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
G R+LNSG FIGYA + L+ + I++ DDQLY+ +FLDET RTK + LD +
Sbjct: 147 EGKASRFLNSGAFIGYAPQVFALLED-PIEDTADDQLYFTKIFLDETKRTKLGLKLDVQS 205
Query: 211 NLFQNLYGSLEDIKLNFDLDEFVH-LTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK 269
LFQNL+G+ D+KL DL+ L N + T P IIHGNG SK++LN++GNYLA+++
Sbjct: 206 RLFQNLHGAKNDVKLKVDLESNQGVLQNVDFMTTPAIIHGNGLSKVDLNAYGNYLARTF- 264
Query: 270 TSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN 329
+ C C ++L L+ P + +++ + +P F ++FL I +LNYP KK+ + +Y+
Sbjct: 265 SGVCLLCQ--ENLLDLEETNLPVISLALMVTQPVPFFDQFLEGIESLNYPKKKLHLLIYS 322
Query: 330 NQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDS 389
N +H +++ + + K+ ++ ++ R LA++ + D+ F+VD+D+
Sbjct: 323 NVAFHDDDIKSFVNKYDKEYATAKFALSTDELDERQGRQLALDKARLHQSDYIFFVDADA 382
Query: 390 HLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGK 449
H+D+ +VL+ L+ N+ +AP+ + + WSNFWGAL+ G+YARS DY++I+ +
Sbjct: 383 HIDDSEVLRELLRLNKQFVAPIFSKHKELWSNFWGALSEGGYYARSHDYVDIVKREL--I 440
Query: 450 GIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGH 509
G++NVP++T+ YL+K + A + K D DMA C +LRN GI + + + +GH
Sbjct: 441 GMFNVPHVTSIYLVKKTAFDAISFKH----KEFDPDMAMCESLRNAGIFMYASNLRIFGH 496
Query: 510 LVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTE 569
LV++++F+ T P+ Y L N +DW +YIHP Y L QPCPDV+WF IV++
Sbjct: 497 LVNADDFNTTVTRPDFYTLFSNEIDWTEKYIHPNYSLQLNESNKIQQPCPDVYWFQIVSD 556
Query: 570 KFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPL 629
FC + V IMEA+ WSDG+N+D RLE GYEAVPTRDIHMKQVGL ++ +FL+ +V PL
Sbjct: 557 AFCDDLVAIMEAHNGWSDGSNSDNRLEGGYEAVPTRDIHMKQVGLERLYLKFLQLFVRPL 616
Query: 630 QEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRF 689
QER F GY H P RA M+F+VRYRPDEQPSLRPHHDSSTYTINIA+N+ G+DY+GGGCRF
Sbjct: 617 QERAFTGYFHNPPRALMNFMVRYRPDEQPSLRPHHDSSTYTINIAMNRAGIDYQGGGCRF 676
Query: 690 IRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
IRYNC+VT T+ GWMLMHPGRLTHYHEGL VT GTRYIMISF+DP
Sbjct: 677 IRYNCSVTDTKKGWMLMHPGRLTHYHEGLLVTNGTRYIMISFIDP 721
>gi|442759307|gb|JAA71812.1| Putative procollagen-lysine2-oxoglutarate 5-dioxygenase [Ixodes
ricinus]
Length = 667
Score = 704 bits (1818), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 339/673 (50%), Positives = 458/673 (68%), Gaps = 10/673 (1%)
Query: 66 LHQPWLGGDMS-SLGGGYKVNLLKNELDEMDITDD--MIILVTDSYDVIIDGGVNDILER 122
+++ WLGGDM+ +GGGYKV LL+ +D DD +I++ DSYDV+ G +IL++
Sbjct: 1 MNEEWLGGDMARGMGGGYKVRLLRKA--AVDYKDDTSVILMFVDSYDVLFAAGAKEILKK 58
Query: 123 FNTFDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNE 182
F F+ N++F AE CWPD SL YP G R+LNSGG IGYA I E++++ +++E
Sbjct: 59 FYKFNTNVLFSAEGFCWPDQSLASSYPT-AKGNRFLNSGGIIGYAXXIYEIVTSAELEDE 117
Query: 183 EDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNT 242
DDQL+Y ++L+E LR K I LD A +FQNL G++ D++L LD +L N+ + T
Sbjct: 118 ADDQLFYTKIYLNEDLRKKWGIKLDHRAEIFQNLNGAVGDVEL-LGLDSEPYLHNSAFGT 176
Query: 243 NPVIIHGNGKSKIELNSFGNYLAKSWKT-SGCTRCNLIKHLDSLKPDQFPSVLISVFIDK 301
P++IHGNG SK+ LNSFGNYLAKSW + +GC C +P + P VLI +FI+
Sbjct: 177 VPLVIHGNGPSKVVLNSFGNYLAKSWNSLAGCRVCYDAFSPADKEPSELPRVLIGIFIEH 236
Query: 302 PTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTV 361
PT FL E L+K+ NLNYP ++I +FV+N E+H D ++ + +++VK++ +
Sbjct: 237 PTPFLWEALSKVYNLNYPRERIDLFVHNAVEFHEEEVDKFVEQYGQSYRSVKHMRNEDGR 296
Query: 362 NSKEARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSN 421
ARNLA+E + D+YF VDSD+HLDN D L+ L+ N +++APLL R WSN
Sbjct: 297 KEWHARNLALEECMKIKCDYYFSVDSDAHLDNGDTLRALIEMNRTVVAPLLSRHKNLWSN 356
Query: 422 FWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNS 481
FWGAL+ DG+YARS DY+ ++ G++ KG+WNVP+I YL+ +++ + +
Sbjct: 357 FWGALSTDGYYARSHDYVQLVKGER--KGLWNVPFINTVYLINGTLLHSKEKFPSFISGL 414
Query: 482 MDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIH 541
+D DMAFC N+R KGI + + + YGHLV+ E FD + NP+ YE+ N +DW+ RYIH
Sbjct: 415 LDPDMAFCKNMREKGIFMYVTNMDTYGHLVNPETFDLKLKNPDFYEIYSNQMDWERRYIH 474
Query: 542 PEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEA 601
Y K L PD + PCPDV+WFP+VT+ FC ++IME +GQWS G N D+RL GYE
Sbjct: 475 ENYSKVLEPDFKVDMPCPDVYWFPVVTDIFCRHMIEIMENFGQWSSGKNEDERLAGGYEN 534
Query: 602 VPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLR 661
VPTRDIHM QV W FLR+Y+ P+QE+ F+GY H+P RA M+FVVRY PDEQ LR
Sbjct: 535 VPTRDIHMNQVNFEQHWLFFLREYIKPVQEKVFLGYFHDPPRAIMNFVVRYHPDEQYFLR 594
Query: 662 PHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVT 721
PHHDSSTYTINIALN+ +DYEGGGC F+RYNC+V + GW LMHPGRLTHYHEGL VT
Sbjct: 595 PHHDSSTYTINIALNRPKIDYEGGGCNFLRYNCSVVDLKQGWSLMHPGRLTHYHEGLPVT 654
Query: 722 QGTRYIMISFVDP 734
+GTRYIM+SFVDP
Sbjct: 655 KGTRYIMVSFVDP 667
>gi|260786918|ref|XP_002588503.1| hypothetical protein BRAFLDRAFT_280606 [Branchiostoma floridae]
gi|229273666|gb|EEN44514.1| hypothetical protein BRAFLDRAFT_280606 [Branchiostoma floridae]
Length = 679
Score = 704 bits (1816), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 337/685 (49%), Positives = 465/685 (67%), Gaps = 7/685 (1%)
Query: 51 IQSAEVNKLQVKTLGLHQPWLGGDM-SSLGGGYKVNLLKNELDEMDITDDMIILVTDSYD 109
++SA+ +QV+ LG+HQ WLGGD+ +++GGG KV LLK L + D++I+ +DSYD
Sbjct: 1 MRSADKYNIQVQVLGMHQEWLGGDVQNNIGGGQKVLLLKEALKKYKDDKDLVIMFSDSYD 60
Query: 110 VIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKD 169
VII +IL +F+ F+A +VFGAE CWPD +L D YP V G YLNSGGFIGYA +
Sbjct: 61 VIITAEKEEILRKFDDFNARVVFGAEGFCWPDRTLADLYPEVRLGKPYLNSGGFIGYASE 120
Query: 170 IKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDL 229
+ +++S+ SI+N+ DDQLYY +FL+ LR K K+ LD + +FQN+ G+ D+ L FD
Sbjct: 121 LYQIVSHTSIQNQHDDQLYYTRIFLNPELREKFKMKLDHTSEIFQNMNGAGADLTLKFD- 179
Query: 230 DEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWKTSGCTRCNLIKHLDSLKPDQ 289
D+ L N YNT P IIHGNG K+ LN GNY+A SW C C ++ SLK D
Sbjct: 180 DDKTRLRNRVYNTEPCIIHGNGPQKLVLNHIGNYVADSWSFDECHSCK--ENTFSLKTDD 237
Query: 290 FPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMF 349
+P V+I +FI++PT F+ EFLNKI NL+YP KI +F++N++E+HA +++ N+ +
Sbjct: 238 YPVVVIGLFIEQPTPFVPEFLNKIYNLDYPKNKIVLFIHNHEEHHAGDVQEFVKNYGGDY 297
Query: 350 KNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIA 409
K V+ + + +N AR + + D+Y VD+D + NP L+ L+ +N S+IA
Sbjct: 298 KAVREVTPSMNMNQWYARKQGLSECIGVKCDYYLSVDADVQITNPKTLQILIQQNRSVIA 357
Query: 410 PLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIK 469
P+ + K WSNFWGA+ DGFYARS DY++I+ G + KG+WNVPYI N YL+ S+++
Sbjct: 358 PMATKYGKLWSNFWGAIGDDGFYARSDDYIDIVQGTK--KGVWNVPYINNVYLIHGSLLQ 415
Query: 470 ATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELI 529
+ + +D DMAFC +LR KGI + + + +G L + ++ + +P+++++
Sbjct: 416 QPKTMPNFIVGQLDADMAFCASLREKGIFMYVTNMDTFGRLTTTTSYSTEHLHPDMWQMY 475
Query: 530 RNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGT 589
N DW+ +YIH ++ K L P T PCPDV+WFPIVTE FC V+ ME YG+WS G
Sbjct: 476 DNRPDWEEKYIHADFYKMLDPKTEVEMPCPDVYWFPIVTETFCKHLVEEMENYGEWSAGK 535
Query: 590 NNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFV 649
N D RL +GYE VPT DIHM Q+G W FL+++V LQE+ + GY+ E +A M+FV
Sbjct: 536 NEDLRLSSGYENVPTVDIHMNQIGFEREWLHFLKEFVTKLQEKVYPGYYSE-AQAIMNFV 594
Query: 650 VRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPG 709
VRY P EQP LRPHHDSST+TIN+ALN+ GVD+EGGGCRF+RYNC+VT T+MGW+LMHPG
Sbjct: 595 VRYHPQEQPFLRPHHDSSTFTINLALNKAGVDFEGGGCRFLRYNCSVTNTKMGWLLMHPG 654
Query: 710 RLTHYHEGLQVTQGTRYIMISFVDP 734
RLTHYHEGL T GTRYIMISFVDP
Sbjct: 655 RLTHYHEGLPTTNGTRYIMISFVDP 679
>gi|6755110|ref|NP_036092.1| procollagen-lysine,2-oxoglutarate 5-dioxygenase 3 precursor [Mus
musculus]
gi|25008937|sp|Q9R0E1.1|PLOD3_MOUSE RecName: Full=Procollagen-lysine,2-oxoglutarate 5-dioxygenase 3;
AltName: Full=Lysyl hydroxylase 3; Short=LH3; Flags:
Precursor
gi|5880317|gb|AAD54618.1|AF046783_1 lysyl hydroxylase 3 [Mus musculus]
gi|15145782|gb|AAK00576.1| lysyl hydroxylase 3 [Mus musculus]
gi|26329015|dbj|BAC28246.1| unnamed protein product [Mus musculus]
gi|26354078|dbj|BAC40669.1| unnamed protein product [Mus musculus]
gi|28175483|gb|AAH43047.1| Procollagen-lysine, 2-oxoglutarate 5-dioxygenase 3 [Mus musculus]
gi|32493408|gb|AAH54734.1| Procollagen-lysine, 2-oxoglutarate 5-dioxygenase 3 [Mus musculus]
gi|148687344|gb|EDL19291.1| procollagen-lysine, 2-oxoglutarate 5-dioxygenase 3 [Mus musculus]
Length = 741
Score = 704 bits (1816), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 333/711 (46%), Positives = 484/711 (68%), Gaps = 11/711 (1%)
Query: 29 IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLL 87
++ DK LVITVA+ ET+GY+RF+QSAE V+TLGL Q W GGD++ ++GGG KV L
Sbjct: 37 VNPDKLLVITVATAETEGYRRFLQSAEFFNYTVRTLGLGQEWRGGDVARTVGGGQKVRWL 96
Query: 88 KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
K E+++ DMII+ DSYDVI+ ++L++F ++++F AE CWP+ L ++
Sbjct: 97 KKEMEKYADQKDMIIMFVDSYDVILASSPTELLKKFVQSGSHLLFSAESFCWPEWGLAEQ 156
Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
YP VG G R+LNSGGFIG+A I +++ + K+++DDQL+Y L+LD LR K K+ LD
Sbjct: 157 YPEVGMGKRFLNSGGFIGFAPTIHQIVRQWNYKDDDDDQLFYTQLYLDPGLREKLKLSLD 216
Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
+ +FQNL G+L+++ L FD + V + N Y+T PV++HGNG +K++LN GNY+
Sbjct: 217 HKSRIFQNLNGALDEVILKFDQNR-VRIRNVAYDTLPVVVHGNGPTKLQLNYLGNYVPNG 275
Query: 268 W-KTSGCTRCN-LIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
W GC CN ++ L +P P VL++VF+++PT FL FL ++ L+YP +IS+
Sbjct: 276 WTPQGGCGFCNQTLRTLPGGQPP--PRVLLAVFVEQPTPFLPRFLQRLLLLDYPPDRISL 333
Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKG-VDFYFY 384
F++N++ YH P D + F VK + +++ EAR++A+++ +FYF
Sbjct: 334 FLHNSEVYHEPHIADAWPQLQDHFSAVKLVGPEEALSAGEARDMAMDSCRQNPECEFYFS 393
Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
+D+D+ L NP+ L+ L+ +N +IAP+L R K WSNFWGAL+ + +YARS DY+ ++
Sbjct: 394 LDADAVLTNPETLRVLIEQNRKVIAPMLSRHGKLWSNFWGALSPNEYYARSEDYVELVQR 453
Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDS 503
+ G+WNVPYI+ Y+++ ++ K +++ + D DMAFC ++R+KGI L + +
Sbjct: 454 KR--VGVWNVPYISQAYVIRGETLRTELPQKEVFSSSDTDPDMAFCKSVRDKGIFLHLSN 511
Query: 504 TQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFW 563
E+G L+ + +D +P+++++ NP+DW +YIH Y ++L + + QPCPDV+W
Sbjct: 512 QHEFGRLLATSRYDTDHLHPDLWQIFDNPVDWREQYIHENYSRALDGEGLVEQPCPDVYW 571
Query: 564 FPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLR 623
FP++TE+ C E V+ ME YGQWS G + D RL GYE VPT DIHMKQVG W + LR
Sbjct: 572 FPLLTEQMCDELVEEMEHYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYEDQWLQLLR 631
Query: 624 KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYE 683
YV P+ E F GYH + RA M+FVVRYRPDEQPSLRPHHDSST+T+N+ALN GVDYE
Sbjct: 632 TYVGPMTEYLFPGYHTK-TRAVMNFVVRYRPDEQPSLRPHHDSSTFTLNVALNHKGVDYE 690
Query: 684 GGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
GGGCRF+RY+C +++ R GW L+HPGRLTHYHEGL T+GTRYIM+SFVDP
Sbjct: 691 GGGCRFLRYDCRISSPRKGWALLHPGRLTHYHEGLPTTRGTRYIMVSFVDP 741
>gi|12850403|dbj|BAB28704.1| unnamed protein product [Mus musculus]
Length = 741
Score = 704 bits (1816), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 333/711 (46%), Positives = 484/711 (68%), Gaps = 11/711 (1%)
Query: 29 IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLL 87
++ DK LVITVA+ ET+GY+RF+QSAE V+TLGL Q W GGD++ ++GGG KV L
Sbjct: 37 VNPDKLLVITVATAETEGYRRFLQSAEFFNYTVRTLGLGQEWRGGDVARTVGGGQKVRWL 96
Query: 88 KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
K E+++ DMII+ DSYDVI+ ++L++F ++++F AE CWP+ L ++
Sbjct: 97 KKEMEKYADQKDMIIMFVDSYDVILASSPTELLKKFVQSGSHLLFSAESFCWPEWGLAEQ 156
Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
YP VG G R+LNSGGFIG+A I +++ + K+++DDQL+Y L+LD LR K K+ LD
Sbjct: 157 YPEVGMGKRFLNSGGFIGFAPTIHQIVRQWNYKDDDDDQLFYTQLYLDPGLREKLKLSLD 216
Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
+ +FQNL G+L+++ L FD + V + N Y+T PV++HGNG +K++LN GNY+
Sbjct: 217 HKSRIFQNLNGALDEVILKFDQNR-VRIRNVAYDTLPVVVHGNGPTKLQLNYLGNYVPNG 275
Query: 268 W-KTSGCTRCN-LIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
W GC CN ++ L +P P VL++VF+++PT FL FL ++ L+YP +IS+
Sbjct: 276 WTPQGGCGFCNQTLRTLPGGQPP--PRVLLAVFVEQPTPFLPRFLQRLLLLDYPPDRISL 333
Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKG-VDFYFY 384
F++N++ YH P D + F VK + +++ EAR++A+++ +FYF
Sbjct: 334 FLHNSEVYHEPHIADAWPQLQDHFSAVKLVGPEEALSAGEARDMAMDSCRQNPECEFYFS 393
Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
+D+D+ L NP+ L+ L+ +N +IAP+L R K WSNFWGAL+ + +YARS DY+ ++
Sbjct: 394 LDADAVLTNPETLRVLIEQNRKVIAPMLSRHGKLWSNFWGALSPNEYYARSEDYVELVQR 453
Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDS 503
+ G+WNVPYI+ Y+++ ++ K +++ + D DMAFC ++R+KGI L + +
Sbjct: 454 KR--VGVWNVPYISQAYVIRGETLRTELPQKEVFSSSDTDPDMAFCKSVRDKGIFLHLSN 511
Query: 504 TQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFW 563
E+G L+ + +D +P+++++ NP+DW +YIH Y ++L + + QPCPDV+W
Sbjct: 512 QHEFGRLLATSRYDTDHLHPDLWQIFDNPVDWREQYIHENYSRALDGEGLVEQPCPDVYW 571
Query: 564 FPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLR 623
FP++TE+ C E V+ ME YGQWS G + D RL GYE VPT DIHMKQVG W + LR
Sbjct: 572 FPLLTEQMCDELVEEMEHYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYEDQWLQLLR 631
Query: 624 KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYE 683
YV P+ E F GYH + RA M+FVVRYRPDEQPSLRPHHDSST+T+N+ALN GVDYE
Sbjct: 632 TYVGPMTEYLFPGYHTK-TRAVMNFVVRYRPDEQPSLRPHHDSSTFTLNVALNHKGVDYE 690
Query: 684 GGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
GGGCRF+RY+C +++ R GW L+HPGRLTHYHEGL T+GTRYIM+SFVDP
Sbjct: 691 GGGCRFLRYDCRISSPRKGWALLHPGRLTHYHEGLPTTRGTRYIMVSFVDP 741
>gi|24662591|ref|NP_648451.1| procollagen lysyl hydroxylase, isoform A [Drosophila melanogaster]
gi|24662595|ref|NP_729687.1| procollagen lysyl hydroxylase, isoform B [Drosophila melanogaster]
gi|7294743|gb|AAF50079.1| procollagen lysyl hydroxylase, isoform A [Drosophila melanogaster]
gi|23093644|gb|AAN11883.1| procollagen lysyl hydroxylase, isoform B [Drosophila melanogaster]
Length = 721
Score = 700 bits (1806), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 340/705 (48%), Positives = 471/705 (66%), Gaps = 12/705 (1%)
Query: 32 DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNEL 91
DK V TVA+ TDGY R+I+SA V ++V TLGL + W GGDM GGG+K+NLL+ +
Sbjct: 27 DKIKVFTVATEPTDGYTRYIRSARVYDIEVTTLGLGEEWKGGDMQKPGGGFKLNLLREAI 86
Query: 92 DEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAV 151
+ IIL TDSYDVII +++I E+F A I+F AE+ CWPD SL + YP V
Sbjct: 87 APYKNEPETIILFTDSYDVIITTTLDEIFEKFKESGAKILFSAEKYCWPDKSLANDYPEV 146
Query: 152 -GSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
G R+LNSG FIGYA + L+ + I++ DDQLY+ +FLDET R K + LD +
Sbjct: 147 EGKASRFLNSGAFIGYAPQVFALLVD-PIEDTADDQLYFTKIFLDETKRAKLGLKLDVQS 205
Query: 211 NLFQNLYGSLEDIKLNFDLDEFVH-LTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK 269
LFQNL+G+ D+KL DL+ L N + T P IIHGNG SK++LN++GNYLA+++
Sbjct: 206 RLFQNLHGAKNDVKLKVDLESNQGVLQNVDFMTTPSIIHGNGLSKVDLNAYGNYLARTF- 264
Query: 270 TSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN 329
C C ++L L+ P + +++ + +P F ++FL I +LNYP +K+ + +Y+
Sbjct: 265 NGVCLLCQ--ENLLDLEETNLPVISLALMVTQPVPFFDQFLEGIESLNYPKEKLHLLIYS 322
Query: 330 NQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDS 389
N +H +++ + K+ ++ ++ R LA++ + D+ F+VD+D+
Sbjct: 323 NVAFHDDDIKSFVNKHAKEYATAKFALSTDELDERQGRQLALDKARLHQSDYIFFVDADA 382
Query: 390 HLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGK 449
H+D+ +VL+ L+ N+ +AP+ + + WSNFWGAL+ G+YARS DY++I+ +
Sbjct: 383 HIDDGEVLRELLRLNKQFVAPIFSKHKELWSNFWGALSEGGYYARSHDYVDIVKREL--I 440
Query: 450 GIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGH 509
G++NVP++T+ YL+K + A + K D DMA C +LRN GI + + + +GH
Sbjct: 441 GMFNVPHVTSIYLVKKTAFDAISFKH----KEFDPDMAMCESLRNAGIFMYASNLRIFGH 496
Query: 510 LVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTE 569
LV++++F+ T P+ Y L N +DW +YIHP Y L QPCPDV+WF IV++
Sbjct: 497 LVNADDFNTTVTRPDFYTLFSNEIDWTEKYIHPNYSLQLNESNKIQQPCPDVYWFQIVSD 556
Query: 570 KFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPL 629
FC + V IMEA+ WSDG+NND RLE GYEAVPTRDIHMKQVGL ++ +FL+ +V PL
Sbjct: 557 AFCDDLVAIMEAHNGWSDGSNNDNRLEGGYEAVPTRDIHMKQVGLERLYLKFLQMFVRPL 616
Query: 630 QEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRF 689
QER F GY H P RA M+F+VRYRPDEQPSLRPHHDSSTYTINIA+N+ G+DY+GGGCRF
Sbjct: 617 QERAFTGYFHNPPRALMNFMVRYRPDEQPSLRPHHDSSTYTINIAMNRAGIDYQGGGCRF 676
Query: 690 IRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
IRYNC+VT T+ GWMLMHPGRLTHYHEGL VT GTRYIMISF+DP
Sbjct: 677 IRYNCSVTDTKKGWMLMHPGRLTHYHEGLLVTNGTRYIMISFIDP 721
>gi|338712640|ref|XP_001504506.2| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 3 [Equus
caballus]
Length = 829
Score = 699 bits (1803), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 328/711 (46%), Positives = 481/711 (67%), Gaps = 11/711 (1%)
Query: 29 IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLL 87
++ +K LVITVA+ ET+GY+RF++SAE V+TLGL + W GGD++ ++GGG KV L
Sbjct: 125 VNPEKLLVITVATAETEGYRRFLRSAEFFNYTVRTLGLGEDWRGGDVARTVGGGQKVRWL 184
Query: 88 KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
K E+++ +DM+I+ DSYDVI+ G +++L++F + ++F AE CWP+ L ++
Sbjct: 185 KKEMEKYADREDMVIMFVDSYDVILAGSPSELLKKFVQSGSRLLFSAESFCWPEWGLAEQ 244
Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
YP VG+G R+LNSGGFIG+A I +++ K+++DDQL+Y L+LD LR K + LD
Sbjct: 245 YPEVGTGKRFLNSGGFIGFAPTIHQIVRQWKYKDDDDDQLFYTRLYLDPGLREKLSLNLD 304
Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
+ +FQNL G+L+++ L FD + V + N Y+T PV++HGNG +K++LN GNY+ K
Sbjct: 305 HKSRIFQNLNGALDEVVLKFDRNR-VRIRNVAYDTLPVVVHGNGPTKLQLNYLGNYVPKG 363
Query: 268 WKT-SGCTRCNLIKH-LDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
W GC C+L + L +P P VL++VF+++PT FL FL ++ L+YP ++++
Sbjct: 364 WTPEGGCGYCDLDRRTLPGGQPP--PRVLLAVFVEQPTPFLPRFLQRLLLLDYPPDRVAL 421
Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFY 384
F++NN+ YH P D + F VK + + EAR++A+++ +FYF
Sbjct: 422 FLHNNEVYHEPHIADSWPQLQDHFSAVKLVGPEEALTPGEARDMAMDSCRQDPKCEFYFS 481
Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
+D+D+ + NP L+ L+ N +IAP+L R K WSNFWGAL+ D +YARS DY+ ++
Sbjct: 482 LDADAVITNPQTLRILIEENRKVIAPMLSRHGKLWSNFWGALSPDEYYARSEDYVELVQR 541
Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDS 503
+ G+WNVPYI+ Y+++ ++ K +++ + D DMAFC +LR++GI L + +
Sbjct: 542 KR--VGVWNVPYISQAYVIRGETLRTELPQKEVFSSSDTDPDMAFCKSLRDQGIFLHLSN 599
Query: 504 TQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFW 563
E+G L+ + +D +P+++++ NPLDW +YIH Y ++L + QPCPDV+W
Sbjct: 600 RHEFGRLLATSRYDTDHLHPDLWQIFDNPLDWKEQYIHENYSRALEGKGLVEQPCPDVYW 659
Query: 564 FPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLR 623
FP+++++ C E V+ ME YGQWS G + D RL GYE VPT DIHMKQVG W + LR
Sbjct: 660 FPLLSDQMCDELVEEMEHYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGFEDQWLQLLR 719
Query: 624 KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYE 683
YV P+ E F GYH + RA M+FVVRYRPDEQPSLRPHHDSST+T+N+ALN G+DYE
Sbjct: 720 TYVGPMTESLFPGYHTK-TRAVMNFVVRYRPDEQPSLRPHHDSSTFTLNVALNHKGLDYE 778
Query: 684 GGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
GGGCRF+RY+C V++ R GW L+HPGRLTHYHEGL T+GTRYIM+SFVDP
Sbjct: 779 GGGCRFLRYDCVVSSPRKGWGLLHPGRLTHYHEGLPTTRGTRYIMVSFVDP 829
>gi|195493375|ref|XP_002094389.1| GE20228 [Drosophila yakuba]
gi|194180490|gb|EDW94101.1| GE20228 [Drosophila yakuba]
Length = 727
Score = 698 bits (1802), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 337/705 (47%), Positives = 470/705 (66%), Gaps = 12/705 (1%)
Query: 32 DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNEL 91
+K V TVA+ TDGY R+ +SA V ++V TLGL + W GGDM GGG+K+NLL+ +
Sbjct: 33 EKIKVFTVATEPTDGYNRYARSARVYDIEVTTLGLGEEWKGGDMQRPGGGFKLNLLREAI 92
Query: 92 DEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAV 151
+ IIL TDSYDVII +++I E+F A I+F AE+ CWPD SL + YP V
Sbjct: 93 APYKNDPETIILFTDSYDVIITTTLDEIFEKFKEAGAKILFSAEKYCWPDKSLANDYPEV 152
Query: 152 -GSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
G R+LNSG F+GYA + L+ + I++ DDQLY+ +FLDE RTK + LD +
Sbjct: 153 EGKASRFLNSGAFMGYAPQVYALLED-PIEDTADDQLYFTKIFLDEAKRTKLGLKLDVKS 211
Query: 211 NLFQNLYGSLEDIKLNFDLDEFVH-LTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK 269
LFQNL+G+ D+KL DLD L N + T P IIHGNG SK++LN++GNYLA+++
Sbjct: 212 RLFQNLHGAKNDVKLKVDLDSNQGVLQNVDFMTTPAIIHGNGLSKVDLNAYGNYLARTF- 270
Query: 270 TSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN 329
+ C C ++L L+ + P + +++ + + F ++FL I LNYP K+ + +Y+
Sbjct: 271 SGVCLLCQ--ENLLDLEETKLPVISLALMVTQAVPFFDQFLEGIETLNYPKDKLHLLIYS 328
Query: 330 NQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDS 389
N +H +++ + K+ ++ ++ R LAV+ + D+ F+VD+D+
Sbjct: 329 NVAFHDDDIKSFVNKHAKEYATAKFALSTDELDERQGRQLAVDKARLHQSDYIFFVDADA 388
Query: 390 HLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGK 449
H+D+ +VL+ L+ N+ +AP+ +P + WSNFWGAL+ G+YARS DY++I+ +
Sbjct: 389 HIDDSEVLRELLRLNKQFVAPIFSKPKELWSNFWGALSEGGYYARSHDYVDIVKREL--I 446
Query: 450 GIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGH 509
G++NVP++T+ YL+K++ A + K D DMA C +LRN GI + + + +GH
Sbjct: 447 GMFNVPHVTSIYLVKSTAFDAISFKH----KEFDPDMAMCESLRNAGIFMYASNLRIFGH 502
Query: 510 LVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTE 569
LV++++F+ T P+ Y L N +DW +YIHP Y L QPCPDV+WF IV++
Sbjct: 503 LVNADDFNTTVTRPDFYTLFSNEIDWTEKYIHPNYSLQLNESYKLQQPCPDVYWFQIVSD 562
Query: 570 KFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPL 629
FC + V IMEA+ WSDG+N+D RLE GYEAVPTRDIH KQVGL ++ +FL+ +V PL
Sbjct: 563 AFCDDLVAIMEAHNGWSDGSNSDSRLEGGYEAVPTRDIHTKQVGLERLYLKFLQLFVRPL 622
Query: 630 QEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRF 689
QER F GY H P RA M+F+VRYRPDEQPSLRPHHDSSTYTINIA+N+ G+DY+GGGCRF
Sbjct: 623 QERAFTGYFHNPPRALMNFMVRYRPDEQPSLRPHHDSSTYTINIAMNRAGIDYQGGGCRF 682
Query: 690 IRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
IRYNC+VT T+ GWMLMHPGRLTHYHEGL VT GTRYIMISF+DP
Sbjct: 683 IRYNCSVTETKKGWMLMHPGRLTHYHEGLLVTNGTRYIMISFIDP 727
>gi|357619307|gb|EHJ71931.1| putative procollagen-lysine,2-oxoglutarate 5-dioxygenase [Danaus
plexippus]
Length = 660
Score = 697 bits (1800), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 334/665 (50%), Positives = 461/665 (69%), Gaps = 10/665 (1%)
Query: 75 MSSLGGGYKVNLLKNELDEMDITDD--MIILVTDSYDVIIDGGVNDILERFNTFDANIVF 132
M GGG+KVNLLK++L M I +D IIL TDSYDV+ G +++I+++F ++F
Sbjct: 1 MKHEGGGHKVNLLKDKLSSMKIPEDRDQIILFTDSYDVMFLGSLDEIVQKFLAMSVRVLF 60
Query: 133 GAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALL 192
AE CWPD+SL +YP +LNSGGFIGY ++ ++++ ++ N++DDQL+Y +
Sbjct: 61 SAEPFCWPDSSLASQYPDSQQLNPFLNSGGFIGYLPELLKILNYETVGNKDDDQLFYTKV 120
Query: 193 FLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFD-LDEFVHLTNTKYNTNPVIIHGNG 251
+LDE R +I LD + +FQNL+G+L D++L + DE+ +L N P+I+HGNG
Sbjct: 121 YLDEDYRESLRISLDHKSAIFQNLHGALSDVQLVANSTDEWPYLVNVVTKQRPLIVHGNG 180
Query: 252 KSKIELNSFGNYLAKSWKTS-GCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFL 310
+K+ LN+ NYLAKSW S GC C+ + + L D+ P V++SVFI+ T F+EEF
Sbjct: 181 PAKLTLNNLSNYLAKSWSVSEGCVLCDEKRIV--LDEDKLPKVMLSVFIEVATPFIEEFF 238
Query: 311 NKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLA 370
I ++YP +KI +F+ N EYH +++ + + K I V EARN+A
Sbjct: 239 QSILAIDYPKQKIHLFIRNGVEYHESEVENFYQAHSSEYFTAKRIKSTDLVGEAEARNIA 298
Query: 371 VENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADG 430
+ + D+ F +DS + ++ PD L YL++ ++APLLVR +AWSNFWGA+N+ G
Sbjct: 299 KDRCIGSDCDYLFCLDSHARVE-PDTLHYLLSTGYDVVAPLLVRSGQAWSNFWGAINSVG 357
Query: 431 FYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTI-YTLNSMDYDMAFC 489
FY+RS DYM+I+N + +GIWNVP+I NCYLM S+ + + K + Y D DMAFC
Sbjct: 358 FYSRSADYMDIVN--RSIEGIWNVPFINNCYLMNISLFRKPSAKHVSYLKEDTDPDMAFC 415
Query: 490 TNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLL 549
+LR+ GI + + + +E+GHLV+SE FD +TNP++Y++I N LDW+ RY+HP+Y +
Sbjct: 416 ASLRSAGIMMYVSNEKEFGHLVNSETFDVSRTNPDIYQVIDNKLDWEQRYLHPKYHEIFA 475
Query: 550 PDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHM 609
PCPDV+WFP+++ +FC E++++MEA+GQWSDG+NNDKRLE+GYEAVPTRDIHM
Sbjct: 476 NKEKQLMPCPDVYWFPLMSMRFCKEWIEVMEAFGQWSDGSNNDKRLESGYEAVPTRDIHM 535
Query: 610 KQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTY 669
QVGL W L+ YV PLQE F GY+H P + M+FVVRYRPDEQPSLRPHHDSSTY
Sbjct: 536 NQVGLDIQWLRILKDYVRPLQELVFTGYYHNPPVSVMNFVVRYRPDEQPSLRPHHDSSTY 595
Query: 670 TINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMI 729
TIN+ALN +DYEGGGCRFIRYNC+V T+ GW+LMHPGRLTH+HEGL VT+GTRYIMI
Sbjct: 596 TINLALNTPHLDYEGGGCRFIRYNCSVKDTKPGWLLMHPGRLTHFHEGLLVTKGTRYIMI 655
Query: 730 SFVDP 734
SFVDP
Sbjct: 656 SFVDP 660
>gi|223647994|gb|ACN10755.1| Procollagen-lysine,2-oxoglutarate 5-dioxygenase 3 precursor [Salmo
salar]
Length = 735
Score = 695 bits (1793), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 334/736 (45%), Positives = 488/736 (66%), Gaps = 13/736 (1%)
Query: 8 NCLILSCVVFFISVHCN---KVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTL 64
+C+ + CV+ + + + + I D LVITVA+ +TDG+ RF+++++ VK L
Sbjct: 4 SCIAVVCVLLLGWMQSSLGAEQRVISPDNLLVITVATEDTDGFTRFMRTSKEFNYTVKVL 63
Query: 65 GLHQPWLGGDMS-SLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERF 123
GL + W GGD++ ++GGG KV LK EL + D++IL DSYDVI+ G ++L +F
Sbjct: 64 GLGEQWKGGDVARTVGGGQKVRWLKTELLKHSDKKDLVILFVDSYDVILASGPEELLWKF 123
Query: 124 NTFDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEE 183
+ +VF AE CWPD L KYPAV +G RYLNSGGFIG+A ++ E++ K+ +
Sbjct: 124 SRLGHRMVFSAEGFCWPDQKLAPKYPAVHTGKRYLNSGGFIGFAPELSEIVQQWKHKDND 183
Query: 184 DDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTN 243
DDQL+Y ++LD+ RTK+ + LD + +FQNL G++E++ L F+ V N Y+T
Sbjct: 184 DDQLFYTKIYLDKVQRTKYNMTLDHRSRIFQNLNGAIEEVVLKFEKAR-VRARNVAYDTL 242
Query: 244 PVIIHGNGKSKIELNSFGNYLAKSWK-TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDK 301
PVIIHGNG +K++LN GNY+ +W +GC C+ + +L+ + P V +SVFI +
Sbjct: 243 PVIIHGNGPTKLQLNYLGNYVPTAWTHETGCGICDDDLVYLNDTPDEDMPLVYLSVFIVQ 302
Query: 302 PTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTV 361
PT FLEEFL ++ +LNYP +I +F++NN YH + + +F + + +
Sbjct: 303 PTPFLEEFLERLTSLNYPTSRIRLFIHNNVVYHEQHIQRFWEKHRVLFPEARLVGPEENL 362
Query: 362 NSKEARNLAVENSLHK-GVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWS 420
+AR +AVE ++YF +D+D + NPDVL+ L+ N+S+IAP+L R K WS
Sbjct: 363 QQDQARTMAVEACQKDHQCEYYFSIDADVVIVNPDVLRVLIEENKSVIAPMLSRHGKLWS 422
Query: 421 NFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTL 479
NFWGAL+ +GFY+RS DY++I+ G + G+WNVPYIT Y++K SV++ + ++Y+
Sbjct: 423 NFWGALSPEGFYSRSEDYIDIVQGKR--IGLWNVPYITQVYMIKGSVLRGRLSQVSLYSQ 480
Query: 480 NSMDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRY 539
MD DM FC +R++G+ + + + E+G LV S N++ + +P+++++ NPLDW +Y
Sbjct: 481 EGMDPDMVFCRAVRDQGVFMFVSNRDEFGRLVSSSNYNTSRLHPDMWQIFDNPLDWKDKY 540
Query: 540 IHPEYQKSLLPD-TVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETG 598
IH Y + + TV QPCPDV+WFP T++ C + V+ ME +G+WS G + D+RL G
Sbjct: 541 IHENYSQIFEDNQTVVEQPCPDVYWFPSFTDRMCDDLVETMEDFGEWSGGRHTDERLAGG 600
Query: 599 YEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQP 658
YE VPT DIHM Q+G W +FL++Y+ P+ E+ + GY+ + +A M+FVVRYRPDEQP
Sbjct: 601 YENVPTVDIHMNQIGFEKEWLKFLKEYISPVTEKLYPGYYPK-AQAVMNFVVRYRPDEQP 659
Query: 659 SLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGL 718
SLRPHHDSST+TIN+ALN G+DY+GGGCRF+RY+CNV A R GW MHPGRLTHYHEGL
Sbjct: 660 SLRPHHDSSTFTINVALNNKGLDYQGGGCRFLRYDCNVEAPRKGWSFMHPGRLTHYHEGL 719
Query: 719 QVTQGTRYIMISFVDP 734
T GTRYIM+SFVDP
Sbjct: 720 PTTSGTRYIMVSFVDP 735
>gi|300795072|ref|NP_001180184.1| procollagen-lysine,2-oxoglutarate 5-dioxygenase 3 precursor [Bos
taurus]
gi|296473078|tpg|DAA15193.1| TPA: procollagen-lysine, 2-oxoglutarate 5-dioxygenase 3 [Bos
taurus]
Length = 751
Score = 694 bits (1790), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 326/711 (45%), Positives = 479/711 (67%), Gaps = 11/711 (1%)
Query: 29 IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLL 87
++ +K LVITVA+ ET+GY+RF+QSAE V+TLGL + W GGD++ ++GGG KV L
Sbjct: 47 VNPEKMLVITVATAETEGYRRFLQSAEFFNYTVRTLGLGEEWRGGDVARTVGGGQKVRWL 106
Query: 88 KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
K E+++ +DM+I+ DSYDV++ G +++L++F + ++F AE CWP+ L ++
Sbjct: 107 KKEMEKYAEREDMVIMFVDSYDVVLAGSPSELLKKFVQSGSRLLFSAESFCWPEWGLAEQ 166
Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
YP VG+G R+LNSGGFIG+A I +++ K+++DDQL+Y L+LD LR K + LD
Sbjct: 167 YPEVGTGKRFLNSGGFIGFAPTIHQIVRQWKYKDDDDDQLFYTRLYLDPGLREKLGLSLD 226
Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
+ +FQNL G+L+++ L F + V + N Y+T PV++HGNG +K++LN GNY+
Sbjct: 227 HKSRIFQNLNGALDEVVLKFGRNR-VRIRNVAYDTLPVVVHGNGPTKLQLNYLGNYVPNG 285
Query: 268 W-KTSGCTRCNLIKH-LDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
W GC CN + L +P P VL++VF+++PT FL FL ++ L+YP ++++
Sbjct: 286 WTPEGGCGFCNQGRRPLPGGQPP--PRVLLAVFVEQPTPFLPRFLQRLLLLDYPPDRVTL 343
Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFY 384
F++NN+ YH P D+ + F VK + + EAR++A++ +FYF
Sbjct: 344 FLHNNEVYHEPHIDESWPQLQDHFSAVKLVGPEEALTPGEARDMAMDICRQDPKCEFYFS 403
Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
+D+D+ + NP L+ L+ N +IAP+L R K WSNFWGAL+ D +YARS DY+ ++
Sbjct: 404 LDADTVITNPQTLRILIEANRKVIAPMLSRHGKLWSNFWGALSPDEYYARSEDYVELVQR 463
Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDS 503
+ G+WNVPYI+ Y+++ ++ + +++ + D DMAFC +LR+KGI L + +
Sbjct: 464 KR--VGVWNVPYISQAYVIRGETLRTELPQREVFSGSDTDPDMAFCKSLRDKGIFLHLSN 521
Query: 504 TQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFW 563
E+G L+ + +D +P+++++ NPLDW +YIH Y ++L + + QPCPDV+W
Sbjct: 522 QHEFGRLLATSRYDTDHLHPDLWQIFDNPLDWKEQYIHENYTRALEGEGLVEQPCPDVYW 581
Query: 564 FPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLR 623
FP+++E+ C E V+ ME YGQWS G + D RL GYE VPT DIHMKQVG W + LR
Sbjct: 582 FPLLSEQMCDELVEEMEHYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYEDQWLQLLR 641
Query: 624 KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYE 683
YV P+ E F GYH + RA M+FVVRYRPDEQPSLRPHHDSST+T+N+ALN G+DYE
Sbjct: 642 TYVGPMTESLFPGYHTK-TRAVMNFVVRYRPDEQPSLRPHHDSSTFTLNVALNHKGLDYE 700
Query: 684 GGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
GGGCRF+RY+C +++ R GW L+HPGRLTHYHEGL T+GTRYIM+SFVDP
Sbjct: 701 GGGCRFLRYDCVISSPRKGWGLLHPGRLTHYHEGLPTTRGTRYIMVSFVDP 751
>gi|440908420|gb|ELR58434.1| Procollagen-lysine,2-oxoglutarate 5-dioxygenase 3 [Bos grunniens
mutus]
Length = 751
Score = 694 bits (1790), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 326/711 (45%), Positives = 479/711 (67%), Gaps = 11/711 (1%)
Query: 29 IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLL 87
++ +K LVITVA+ ET+GY+RF+QSAE V+TLGL + W GGD++ ++GGG KV L
Sbjct: 47 VNPEKMLVITVATAETEGYRRFLQSAEFFNYTVRTLGLGEEWRGGDVARTVGGGQKVRWL 106
Query: 88 KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
K E+++ +DM+I+ DSYDV++ G +++L++F + ++F AE CWP+ L ++
Sbjct: 107 KKEMEKYAEREDMVIMFVDSYDVVLAGSPSELLKKFVQSGSRLLFSAESFCWPEWGLAEQ 166
Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
YP VG+G R+LNSGGFIG+A I +++ K+++DDQL+Y L+LD LR K + LD
Sbjct: 167 YPEVGTGKRFLNSGGFIGFAPTIHQIVRQWKYKDDDDDQLFYTRLYLDPGLREKLGLSLD 226
Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
+ +FQNL G+L+++ L F + V + N Y+T PV++HGNG +K++LN GNY+
Sbjct: 227 HKSRIFQNLNGALDEVVLKFGRNR-VRIRNVAYDTLPVVVHGNGPTKLQLNYLGNYVPNG 285
Query: 268 WKT-SGCTRCNLIKH-LDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
W GC CN + L +P P VL++VF+++PT FL FL ++ L+YP ++++
Sbjct: 286 WTPEGGCGFCNHGRRPLPGGQPP--PRVLLAVFVEQPTPFLPRFLQRLLLLDYPPDRVTL 343
Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFY 384
F++NN+ YH P D+ + F VK + + EAR++A++ +FYF
Sbjct: 344 FLHNNEVYHEPHIDESWPQLQDHFSAVKLVGPEEALTPGEARDMAMDICRQDPKCEFYFS 403
Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
+D+D+ + NP L+ L+ N +IAP+L R K WSNFWGAL+ D +YARS DY+ ++
Sbjct: 404 LDADTVITNPQTLRILIEANRKVIAPMLSRHGKLWSNFWGALSPDEYYARSEDYVELVQR 463
Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDS 503
+ G+WNVPYI+ Y+++ ++ + +++ + D DMAFC +LR+KGI L + +
Sbjct: 464 KR--VGVWNVPYISQAYVIRGETLRTELPQREVFSGSDTDPDMAFCKSLRDKGIFLHLSN 521
Query: 504 TQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFW 563
E+G L+ + +D +P+++++ NPLDW +YIH Y ++L + + QPCPDV+W
Sbjct: 522 QHEFGRLLATSRYDTDHLHPDLWQIFDNPLDWKEQYIHENYTRALEGEGLVEQPCPDVYW 581
Query: 564 FPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLR 623
FP+++E+ C E V+ ME YGQWS G + D RL GYE VPT DIHMKQVG W + LR
Sbjct: 582 FPLLSEQMCDELVEEMEHYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYEDQWLQLLR 641
Query: 624 KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYE 683
YV P+ E F GYH + RA M+FVVRYRPDEQPSLRPHHDSST+T+N+ALN G+DYE
Sbjct: 642 SYVGPMTESLFPGYHTK-TRAVMNFVVRYRPDEQPSLRPHHDSSTFTLNVALNHKGLDYE 700
Query: 684 GGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
GGGCRF+RY+C +++ R GW L+HPGRLTHYHEGL T+GTRYIM+SFVDP
Sbjct: 701 GGGCRFLRYDCVISSPRKGWGLLHPGRLTHYHEGLPTTRGTRYIMVSFVDP 751
>gi|194868950|ref|XP_001972362.1| GG13929 [Drosophila erecta]
gi|190654145|gb|EDV51388.1| GG13929 [Drosophila erecta]
Length = 727
Score = 693 bits (1789), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 334/705 (47%), Positives = 470/705 (66%), Gaps = 12/705 (1%)
Query: 32 DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNEL 91
+K V TVA+ TDGY R+ +SA V ++V TLGL + W GGDM GGG+K+NLL+ +
Sbjct: 33 EKIKVFTVATEPTDGYTRYFRSARVYDIEVTTLGLGEEWKGGDMQRPGGGFKLNLLREAI 92
Query: 92 DEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAV 151
+ IIL TDSYDVII +++I E+F A I+F AE+ CWPD SL + YP V
Sbjct: 93 APYKNDPETIILFTDSYDVIITTTLDEIFEKFKEAGAKILFSAEKFCWPDKSLANDYPEV 152
Query: 152 -GSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
G R+LNSG F+GYA + L+ + I++ DDQLY+ +FLDE RTK + LD +
Sbjct: 153 EGKASRFLNSGAFMGYAPQVFALLED-PIEDTADDQLYFTKIFLDEAKRTKLGLKLDVKS 211
Query: 211 NLFQNLYGSLEDIKLNFDLDEFVH-LTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK 269
LFQNL+G+ D+KL DL+ L N + T P IIHGNG SK++LN++ NYLA+++
Sbjct: 212 RLFQNLHGAKNDVKLKVDLESNQGVLQNVDFMTTPAIIHGNGLSKVDLNAYSNYLARTF- 270
Query: 270 TSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN 329
+ C C ++L L+ P + +++ + + F ++FL I +LNYP +K+ + +Y+
Sbjct: 271 SGVCLLCQ--ENLLDLEETNLPVISVALMVTQAVPFFDQFLKGIESLNYPKEKLHLLIYS 328
Query: 330 NQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDS 389
N +H +++ + + K+ ++ ++ R LA++ + D+ F+VD+D+
Sbjct: 329 NVAFHDDDIKSFVNKYAKEYATAKFALSTDELDERQGRQLALDKARLHQSDYIFFVDADA 388
Query: 390 HLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGK 449
H+D+ +VL+ L+ N+ +AP+ + + WSNFWGAL+ G+YARS DY++I+ +
Sbjct: 389 HIDDSEVLRELLRLNKQFVAPIFSKHNELWSNFWGALSEGGYYARSHDYVDIVKREL--I 446
Query: 450 GIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGH 509
G++NVP++T+ YL+K + A + Y D DMA C +LRN GI + + + +GH
Sbjct: 447 GMFNVPHVTSIYLVKNTAFDAIS----YKHKEFDPDMAMCESLRNAGIFMYASNLRIFGH 502
Query: 510 LVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTE 569
LV++++F+ T P+ Y L N +DW +YIHP Y L QPCPDV+WF IV++
Sbjct: 503 LVNADDFNTTVTRPDFYTLFSNEIDWTEKYIHPNYSLQLNESNKLQQPCPDVYWFQIVSD 562
Query: 570 KFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPL 629
FC + V IMEA+ WSDG+N+D RLE GYEAVPTRDIHMKQVGL ++ +FL+ +V PL
Sbjct: 563 AFCDDLVAIMEAHNGWSDGSNSDNRLEGGYEAVPTRDIHMKQVGLERLYLKFLQLFVRPL 622
Query: 630 QEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRF 689
QER F GY H P RA M+F+VRYRPDEQPSLRPHHDSSTYTINIA+N+ G+DY+GGGCRF
Sbjct: 623 QERAFTGYFHNPPRALMNFMVRYRPDEQPSLRPHHDSSTYTINIAMNRAGIDYQGGGCRF 682
Query: 690 IRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
IRYNC+VT T+ GWMLMHPGRLTHYHEGL VT GTRYIMISF+DP
Sbjct: 683 IRYNCSVTETKKGWMLMHPGRLTHYHEGLLVTNGTRYIMISFIDP 727
>gi|403285799|ref|XP_003934198.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 3
[Saimiri boliviensis boliviensis]
Length = 736
Score = 693 bits (1789), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 325/711 (45%), Positives = 478/711 (67%), Gaps = 11/711 (1%)
Query: 29 IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLL 87
++ +K LVITVA+ ET+GY RF++SAE V+TLGL + W GGD++ ++GGG KV L
Sbjct: 32 VNPEKLLVITVATAETEGYLRFLRSAEFFNYTVRTLGLGEEWRGGDVARTVGGGQKVRWL 91
Query: 88 KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
K E+++ +DMII+ DSYDVI+ GG ++L++F + ++F AE CWP+ L ++
Sbjct: 92 KKEMEKYADREDMIIMFVDSYDVILAGGPTELLKKFVQSGSRLLFSAESFCWPEWGLAEQ 151
Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
YP VG+G R+LNSGGFIG+A + ++ K+++DDQL+Y L+LD LR K + LD
Sbjct: 152 YPEVGTGKRFLNSGGFIGFATTVHHIVRQWRYKDDDDDQLFYTRLYLDPGLREKLGLSLD 211
Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
+ +FQNL G+L+++ L FD + V + N Y+T PV++HGNG +K++LN GNY+
Sbjct: 212 HKSRIFQNLNGALDEVVLKFDRNR-VRIRNVAYDTLPVVVHGNGPTKLQLNYLGNYVPNG 270
Query: 268 W-KTSGCTRCNLIKH-LDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
W GC CN + L +P P V ++VF+++PT FL FL ++ L+YP ++I++
Sbjct: 271 WTPEGGCGFCNRDRRTLPGGQPP--PRVFLAVFVEQPTPFLPRFLQRLLLLDYPHERITL 328
Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFY 384
F++NN+ +H P +D + F K + ++ EAR++A++ +FYF
Sbjct: 329 FLHNNEVFHEPHIEDSWPQLQDHFAATKLVGPEEALSPGEARDMAMDMCRQDPECEFYFS 388
Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
+D+D+ L NP L+ L+ N +IAP+L R K WSNFWGAL+ D +YARS DY+ ++
Sbjct: 389 LDADAVLTNPQTLRILIEENRKVIAPMLSRHGKLWSNFWGALSPDEYYARSEDYVELVQR 448
Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDS 503
+ G+WNVPYI+ Y+++ ++ + +++ + D DMAFC + R+KGI L + +
Sbjct: 449 KR--VGVWNVPYISQAYVIRGETLRMELPQREVFSGSDTDPDMAFCKSFRDKGIFLHLSN 506
Query: 504 TQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFW 563
E+G L+ + +D + +P+++++ NP+DW +Y+H Y ++L + + QPCPDV+W
Sbjct: 507 QHEFGRLLATSRYDTEHLHPDLWQIFDNPVDWQEQYVHENYSRALEGEGIVEQPCPDVYW 566
Query: 564 FPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLR 623
FP+++E+ C E V+ ME YGQWS G + D RL GYE VPT DIHMKQVG W + LR
Sbjct: 567 FPLLSEQMCDELVEEMEHYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYEDQWLQLLR 626
Query: 624 KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYE 683
YV P+ E F GYH + RA M+FVVRYRPDEQPSLRPHHDSST+T+N+ALN G+DYE
Sbjct: 627 TYVGPMTESLFPGYHTK-ARAVMNFVVRYRPDEQPSLRPHHDSSTFTLNVALNHKGLDYE 685
Query: 684 GGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
GGGCRF+RYNC +++ R GW L+HPGRLTHYHEGL T+GTRYIM+SFVDP
Sbjct: 686 GGGCRFLRYNCVISSPRKGWALLHPGRLTHYHEGLPTTRGTRYIMVSFVDP 736
>gi|417404299|gb|JAA48909.1| Putative procollagen-lysine2-oxoglutarate 5-dioxygenase 3 [Desmodus
rotundus]
Length = 741
Score = 691 bits (1783), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 327/711 (45%), Positives = 479/711 (67%), Gaps = 11/711 (1%)
Query: 29 IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLL 87
++ +K LVITVA+ +T+GY+RF+QSAE V+TLGL + W GGD++ ++GGG KV L
Sbjct: 37 VNPEKLLVITVATAKTEGYRRFLQSAEFFNYTVRTLGLGEEWRGGDVARTVGGGQKVRWL 96
Query: 88 KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
K E+++ +DM+IL DSYDVI+ G +++L++F + ++F AE CWP+ L ++
Sbjct: 97 KKEMEKYADQEDMVILFVDSYDVILAGSPSELLKKFVQSGSRLLFSAESFCWPEWGLAEQ 156
Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
YP VG+G R+LNSGGFIG+A I +++ K+++DDQL+Y L+LD LR K + LD
Sbjct: 157 YPEVGTGKRFLNSGGFIGFAPTIHQIVRQWKYKDDDDDQLFYTRLYLDPGLREKLSLNLD 216
Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
+ +FQNL G+L+++ L FD + V + N Y+T PV++HGNG +K++LN GNY+
Sbjct: 217 HKSRIFQNLNGALDEVVLKFDRNR-VRIRNVAYDTLPVVVHGNGPTKLQLNYLGNYVPNG 275
Query: 268 WKTSG-CTRCNLIKH-LDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
W G C C+ + L +P P VL++VF+++PT FL FL ++ L+YP +I++
Sbjct: 276 WTPGGGCGFCDRDRRILPGGQPP--PRVLLAVFVEQPTPFLPRFLQRLMLLDYPPDRITL 333
Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFY 384
F++NN+ YH P D + F VK + + EAR++A+++ +FYF
Sbjct: 334 FLHNNEVYHEPHIADSWPQLQDHFSAVKLVGPEEALTPGEARDMAMDSCRQDPECEFYFS 393
Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
+D+D+ + N L+ L+ N +IAP+L R K WSNFWGAL+ D +YARS DY+ ++
Sbjct: 394 LDADAVITNLQALRILIEENRKVIAPMLSRHGKLWSNFWGALSPDEYYARSEDYVELVQR 453
Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDS 503
+ G+WNVPYI+ Y+++ ++ + +++ + D DMAFC +LR KGI L + +
Sbjct: 454 KR--VGVWNVPYISQAYVIRGETLRTELPQREVFSGSDTDPDMAFCKSLREKGIFLHLSN 511
Query: 504 TQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFW 563
E+G L+ + +D +P+++++ NPLDW +YIH Y ++L + + QPCPDV+W
Sbjct: 512 QHEFGRLLATSRYDTDHLHPDLWQIFDNPLDWKEQYIHKNYSRALEGEGLVEQPCPDVYW 571
Query: 564 FPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLR 623
FP+++++ C E V+ ME YGQWS G + D RL GYE VPT DIHMKQVG W + LR
Sbjct: 572 FPLLSDQMCDELVEEMEHYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYEDQWLQLLR 631
Query: 624 KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYE 683
YV P+ E+ F GYH + RA M+FVVRYRPDEQPSLRPHHDSST+T+N+ALN G+DYE
Sbjct: 632 TYVGPMTEKLFPGYHTK-TRAVMNFVVRYRPDEQPSLRPHHDSSTFTLNVALNHKGLDYE 690
Query: 684 GGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
GGGCRF+RY+C ++A R GW L+HPGRLTHYHEGL T+GTRYIM+SFVDP
Sbjct: 691 GGGCRFLRYDCVISAPRKGWGLLHPGRLTHYHEGLPTTRGTRYIMVSFVDP 741
>gi|344289632|ref|XP_003416546.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 3-like
[Loxodonta africana]
Length = 741
Score = 689 bits (1779), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 325/712 (45%), Positives = 480/712 (67%), Gaps = 11/712 (1%)
Query: 28 NIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNL 86
++ ++ LVITVA+ ET+GY+RF++SAE V+TLGL + W GGD++ ++GGG KV
Sbjct: 36 RVNPERLLVITVATAETEGYRRFLRSAEFFNYTVRTLGLGKEWRGGDVARTVGGGQKVRW 95
Query: 87 LKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYD 146
LK E+++ +DM+I+ DSYDVI+ G ++L++F ++++F AE CWP+ L +
Sbjct: 96 LKKEMEKYRDQEDMVIMFVDSYDVILAGSPTELLKKFVQSGSHLLFSAESFCWPEWGLAE 155
Query: 147 KYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVL 206
+YP VG+G R+LNSGGFIG+A I +++ K+++DDQL+Y L+LD LR K ++ L
Sbjct: 156 QYPEVGTGKRFLNSGGFIGFAPIIHQIVRQWKYKDDDDDQLFYTQLYLDPGLREKLRLSL 215
Query: 207 DTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAK 266
D + +FQNL G+L+++ L FD + V + N Y+T PV+IHGNG +K++LN GNY+
Sbjct: 216 DHKSRIFQNLNGALDEVVLKFDRNR-VRIRNVAYDTLPVVIHGNGPTKLQLNYLGNYVPS 274
Query: 267 SW-KTSGCTRCNLIKH-LDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKIS 324
W GC CN + L +P P VL++VF+++PT FL FL ++ L+YP +++
Sbjct: 275 GWTPEGGCGFCNKDQRTLPGGQPP--PRVLLAVFVEQPTPFLPRFLQRLLLLDYPPDRVT 332
Query: 325 MFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHK-GVDFYF 383
+F++NN+ YH P D + F VK + + EAR++A+++ +FYF
Sbjct: 333 LFLHNNEVYHEPHIADAWPQLQDHFSVVKLVGPEEALTPGEARDMAMDSCRQDLSCEFYF 392
Query: 384 YVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIIN 443
+D+D+ + N L+ L+ + +IAP+L R K WSNFWGAL+ D +YARS DY+ ++
Sbjct: 393 SLDADAVITNQQTLRILIEEDRKVIAPMLSRHGKLWSNFWGALSPDEYYARSEDYVELVQ 452
Query: 444 GDQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKID 502
+ G+WNVPYI+ Y+++ ++ K +++ + D DMAFC +LR+KG+ L +
Sbjct: 453 RKR--VGVWNVPYISQAYVIRGETLRTELPQKEVFSSSDTDPDMAFCKSLRDKGVFLHLS 510
Query: 503 STQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVF 562
+ E+G L+ + +D +P+++++ NPLDW +YIH Y ++L + + QPCPDV+
Sbjct: 511 NQHEFGRLLATSRYDIDHLHPDLWQIFDNPLDWKEQYIHENYSRALEGEGMVEQPCPDVY 570
Query: 563 WFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFL 622
WFP+++++ C E V+ ME YGQWS G + D RL GYE VPT DIHMKQVG W + L
Sbjct: 571 WFPLLSDQMCDELVEEMEHYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYEDQWLQLL 630
Query: 623 RKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDY 682
R YV P+ E F GYH + RA M+FVVRYRPDEQPSLRPHHDSST+T+N+ALN G+DY
Sbjct: 631 RTYVGPMTESLFPGYHTK-TRAVMNFVVRYRPDEQPSLRPHHDSSTFTLNVALNHKGLDY 689
Query: 683 EGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
EGGGCRF+RY+C V++ R GW L+HPGRLTHYHEGL T+GTRYIM+SFVDP
Sbjct: 690 EGGGCRFLRYDCVVSSPRKGWGLLHPGRLTHYHEGLPTTRGTRYIMVSFVDP 741
>gi|444715597|gb|ELW56462.1| Procollagen-lysine,2-oxoglutarate 5-dioxygenase 3 [Tupaia
chinensis]
Length = 744
Score = 689 bits (1778), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 323/712 (45%), Positives = 482/712 (67%), Gaps = 11/712 (1%)
Query: 28 NIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNL 86
++ +K LVITVA+ ET+GY RF++SAE V+TLGL + W GGD++ ++GGG KV
Sbjct: 39 RVNPEKLLVITVATAETEGYHRFLRSAEFFNYTVRTLGLGEEWRGGDVARTVGGGQKVRW 98
Query: 87 LKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYD 146
LK E+++ +DM+I+ DSYDV++ G +++L++F ++++F AE CWP+ L +
Sbjct: 99 LKKEMEKYADREDMVIMFVDSYDVVLAGSPSELLKKFVQSGSHLLFSAESFCWPEWGLAE 158
Query: 147 KYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVL 206
+YP VG+G R+LNSGGFIG+A I +++ K+++DDQL+Y L+LD +R K ++ L
Sbjct: 159 QYPEVGTGKRFLNSGGFIGFAPTIHQIVRQWKYKDDDDDQLFYTRLYLDPGVREKLRLNL 218
Query: 207 DTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAK 266
D + +FQNL G+L+++ L FD + V + N Y+T PV++HGNG +K++LN GNY+
Sbjct: 219 DHKSRIFQNLNGALDEVVLKFDRNR-VRIRNVAYDTLPVVVHGNGPTKLQLNYLGNYVPN 277
Query: 267 SW-KTSGCTRCNLIKH-LDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKIS 324
W GC CN + L +P P VL++VF+++PT FL FL ++ L+YP +++
Sbjct: 278 GWTPEGGCGFCNRDRRTLPGGQPP--PRVLLAVFVEQPTPFLPRFLQRLLLLDYPPDRVT 335
Query: 325 MFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYF 383
+F++NN+ YH P D+ + F +VK + ++ EAR++A+++ +FYF
Sbjct: 336 LFLHNNEVYHEPHIADFWPELQDHFSDVKLVGPEEALSPGEARDMAMDSCRQDPECEFYF 395
Query: 384 YVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIIN 443
+D+D+ L N L+ L+ + +IAP+L R K WSNFWGAL+ D +YARS DY+ ++
Sbjct: 396 SLDADTVLTNQQTLRILIEEDRKVIAPMLSRHGKLWSNFWGALSPDEYYARSEDYVELVQ 455
Query: 444 GDQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKID 502
+ G+WNVPYI+ Y+++ ++ + +++ + D DMAFC LR+KGI L +
Sbjct: 456 RKR--VGVWNVPYISQAYVIRGETLRTELPQREVFSGSDTDPDMAFCKTLRDKGIFLHLS 513
Query: 503 STQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVF 562
+ E+G L+ + +D +P+++++ NPLDW +YIH Y ++L + + QPCPDV+
Sbjct: 514 NQHEFGRLLATSRYDTDHLHPDLWQIFDNPLDWKEQYIHENYSRALEGEGIVEQPCPDVY 573
Query: 563 WFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFL 622
WFP+++++ C E V+ ME YGQWS G + D RL GYE VPT DIHMKQVG W + L
Sbjct: 574 WFPLLSDQMCDELVEEMEHYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYEDQWLQLL 633
Query: 623 RKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDY 682
R YV P+ E F GYH + RA M+FVVRYRPDEQPSL+PHHDSST+T+N+ALN G+DY
Sbjct: 634 RTYVGPMTESLFPGYHTK-TRAVMNFVVRYRPDEQPSLQPHHDSSTFTLNVALNHKGLDY 692
Query: 683 EGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
EGGGCRF+RY+C V++ R GW L+HPGRLTHYHEGL T+GTRYIM+SFVDP
Sbjct: 693 EGGGCRFLRYDCVVSSPRKGWGLLHPGRLTHYHEGLPTTRGTRYIMVSFVDP 744
>gi|431898207|gb|ELK06902.1| Procollagen-lysine,2-oxoglutarate 5-dioxygenase 3 [Pteropus alecto]
Length = 743
Score = 689 bits (1778), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 328/711 (46%), Positives = 478/711 (67%), Gaps = 11/711 (1%)
Query: 29 IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLL 87
++ +K LVITVA+ +T+GY+RF++SAE V+TLGL + W GGD++ ++GGG KV L
Sbjct: 39 VNLEKLLVITVATAQTEGYRRFLRSAEFFNYTVRTLGLGEEWRGGDVARTVGGGQKVRWL 98
Query: 88 KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
K E+++ +DM+I+ DSYDVI+ G +++L++F + ++F AE CWP+ L ++
Sbjct: 99 KKEMEKYADREDMVIMFVDSYDVILAGSPSELLKKFIQSGSRLLFSAEGFCWPEWGLAEQ 158
Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
YP VG+G R+LNSGGFIG+A I +++ K+++DDQL+Y L+LD LR K + LD
Sbjct: 159 YPEVGTGKRFLNSGGFIGFAPTIHQIVHQWKYKDDDDDQLFYTRLYLDPGLREKLSLNLD 218
Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
+ +FQNL G+L+++ L FD + V + N Y+T PV++HGNG +K++LN GNY+
Sbjct: 219 HKSRIFQNLNGALDEVILKFDRNR-VRIRNVAYDTLPVVVHGNGPTKLQLNYLGNYVPNG 277
Query: 268 WKT-SGCTRCNLIKH-LDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
W GC C+L + L KP P VL++VF+++PT FL FL ++ L+YP +I++
Sbjct: 278 WTPEGGCGFCDLDRRTLPGGKPP--PRVLLAVFVEQPTPFLPRFLQRLMLLDYPPNRITL 335
Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFY 384
F++NN+ YH P D + F VK + + EAR++A+++ +FYF
Sbjct: 336 FLHNNEVYHEPHIADSWPQLQNHFSAVKLVGPEEALTPGEARDMAMDSCRQDPECEFYFS 395
Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
+D+D+ + NP L+ L+ N +IAP+L R K WSNFWGAL+ D +YARS DY+ ++
Sbjct: 396 LDADAVITNPKTLRILIEENRKVIAPMLSRHGKLWSNFWGALSPDEYYARSEDYVELVQR 455
Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDS 503
+ G+WNVPYI+ Y+++ ++ + +++ + D DMAFC R+KGI L + +
Sbjct: 456 KR--VGVWNVPYISQAYVIQGETLRTELPQREVFSGSDTDPDMAFCKTWRDKGIFLHLSN 513
Query: 504 TQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFW 563
E+G L+ + +D +P+++++ NPLDW +YIH Y +L + + QPCPDV+W
Sbjct: 514 QHEFGRLLATSRYDTDHLHPDLWQIFDNPLDWKEQYIHENYSLALEGEGLVEQPCPDVYW 573
Query: 564 FPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLR 623
FP+++++ C E V+ ME YGQWS G + D RL GYE VPT DIHMKQVG W + LR
Sbjct: 574 FPLLSDQMCDELVEEMEHYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYEDQWLQLLR 633
Query: 624 KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYE 683
YV P+ E F GYH + RA M+FVVRYRPDEQPSLRPHHDSST+T+N+ALN G+DYE
Sbjct: 634 TYVGPMTESLFPGYHTK-TRAVMNFVVRYRPDEQPSLRPHHDSSTFTLNVALNHKGLDYE 692
Query: 684 GGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
GGGCRF+RY+C V+A R GW L+HPGRLTHYHEGL T+GTRYIM+SFVDP
Sbjct: 693 GGGCRFLRYDCVVSAPRKGWGLLHPGRLTHYHEGLPTTRGTRYIMVSFVDP 743
>gi|327285113|ref|XP_003227279.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 3-like
[Anolis carolinensis]
Length = 741
Score = 689 bits (1778), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 327/716 (45%), Positives = 471/716 (65%), Gaps = 10/716 (1%)
Query: 24 NKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGY 82
N V + K LV+T A++ET+GY+RF+++A+ VKTLGL + W GGD++ ++GGG
Sbjct: 31 NPVDPVSPGKLLVLTAATDETEGYQRFLRTAKFFNYTVKTLGLGEDWKGGDVARTVGGGQ 90
Query: 83 KVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDT 142
KV LKNE+ + +D+I++ DSYDVI+ G ++L +F F + +VF AE CWP+
Sbjct: 91 KVRWLKNEMKKYANEEDLIVMFVDSYDVILAGSPIELLWKFRHFKSKLVFSAESFCWPEW 150
Query: 143 SLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKH 202
SL +KYPAV G R+LNSGGFIGYA I ++ K+ +DDQL+Y ++LD LR KH
Sbjct: 151 SLAEKYPAVAVGKRFLNSGGFIGYAPTINRIVQMWKYKDNDDDQLFYTRIYLDPGLREKH 210
Query: 203 KIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGN 262
I LD + +FQNL G++E++ L F+ V N Y+T PV+IHGNG +K++LN GN
Sbjct: 211 GITLDHKSKIFQNLNGAIEEVVLKFEPTR-VRARNVAYDTLPVVIHGNGPTKLQLNYLGN 269
Query: 263 YLAKSWK-TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPA 320
Y+ +W GC C+ + L L + +P VL+ VFI++PT F +FL ++ +YP
Sbjct: 270 YVPNAWTYEGGCGTCDQGLLDLSDLPDESYPRVLVGVFIEQPTPFFPQFLQRLLTFDYPY 329
Query: 321 KKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGV- 379
+S+F++N YH + F ++K + ++ EAR++A++
Sbjct: 330 SHLSLFIHNRVVYHEQHIQAEWEQLREAFDSIKLVGPEEDISEGEARDIAMDLCRQDTTC 389
Query: 380 DFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYM 439
D+YF +D+D + NP++L+ L+ N+ +IAP++ R K WSNFWGAL+ D +YARS DY+
Sbjct: 390 DYYFSLDADVVVTNPEILQILIQENKKVIAPMMSRHGKLWSNFWGALSPDEYYARSEDYV 449
Query: 440 NIINGDQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIH 498
++ G + G+WNVPYI+ YL++ ++ + I+TL+ D DMAFC ++R KGI
Sbjct: 450 ELVQGKR--IGMWNVPYISQAYLLRGETLRQELPQRNIFTLDDTDPDMAFCKSVREKGIF 507
Query: 499 LKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPC 558
L I + E+G L+ + ++ + +P+++++ NPLDW +YIH Y + +L + QPC
Sbjct: 508 LHISNRDEFGRLLSTSRYNTSRLHPDLWQISENPLDWQDKYIHENYSR-VLEGEYHEQPC 566
Query: 559 PDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVW 618
PDV+WFP+ T++ C E V+ E +GQWS G + D RL GYE VPT DIHM Q+ W
Sbjct: 567 PDVYWFPVFTDQMCDELVEEAENFGQWSGGKHEDTRLAGGYENVPTVDIHMNQISFEKEW 626
Query: 619 AEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQV 678
+FLR Y+ P+ E+ + GY+ + RA M+F+VRYRPDEQPSLRPHHDSST+TIN+ALN
Sbjct: 627 LQFLRDYIAPVTEKLYPGYYTK-ARAIMNFMVRYRPDEQPSLRPHHDSSTFTINVALNHK 685
Query: 679 GVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
G+DYEGGGCRFIRYNC V + R GW LMHPGRLTHYHEGL T GTRYIM+SFVDP
Sbjct: 686 GIDYEGGGCRFIRYNCQVESPRKGWSLMHPGRLTHYHEGLPTTSGTRYIMVSFVDP 741
>gi|194748144|ref|XP_001956509.1| GF24561 [Drosophila ananassae]
gi|190623791|gb|EDV39315.1| GF24561 [Drosophila ananassae]
Length = 730
Score = 689 bits (1778), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 341/707 (48%), Positives = 465/707 (65%), Gaps = 16/707 (2%)
Query: 32 DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNEL 91
DK V TVA+ TDGY R+ +SA V ++V TLGL + W GGDM GGG+K+NLL+ +
Sbjct: 36 DKVKVFTVATEPTDGYNRYYRSARVYDIEVTTLGLGKEWKGGDMQHPGGGFKLNLLRKAI 95
Query: 92 DEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAV 151
+ IIL TDSYDV+I + +I+E+F A ++ AE+ CWPD SL + YP V
Sbjct: 96 SPFKNDPEKIILFTDSYDVLITAPLEEIVEKFKDSGAKVLISAEKYCWPDKSLANAYPEV 155
Query: 152 -GSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
G R+LNSG FIGYA + L+ + I++ DDQLY +FL++ R+K + LDT +
Sbjct: 156 EGKASRFLNSGAFIGYAPQVYGLLED-PIEDTADDQLYLTKVFLNDAKRSKLGLKLDTQS 214
Query: 211 NLFQNLYGSLEDIKLNFDLDEFVH-LTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK 269
LFQNL+G+ D+KL DL+ L N + T P I+HGNG SK++LN++ NYLAK++
Sbjct: 215 KLFQNLHGAKNDVKLKVDLESNQGVLQNVDFMTTPAILHGNGLSKVDLNAYANYLAKTF- 273
Query: 270 TSGCTRCNLIKHLDSLKPDQ--FPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFV 327
C C H + L+ D P + +++ +P F + FL I +NYP K + +F+
Sbjct: 274 NGVCLLC----HENRLELDNTNLPVISLALIAPQPVPFYDRFLEGIRKINYPKKSLHLFI 329
Query: 328 YNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDS 387
Y+N H Y+ + + KYI ++ ++ R LA++ + D+ FYVD+
Sbjct: 330 YSNAALHDDDTKSYVEKHGEEYASAKYILSTDELDERQGRQLALDKARLHNSDYMFYVDA 389
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D+ +D+ +VL+ L+ N+ + PL + + WSNFWGAL+ G+YARS DY++I+ D
Sbjct: 390 DALIDDGEVLRELLALNKQFVGPLFTKHHELWSNFWGALSDGGYYARSHDYVDIVKRDL- 448
Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEY 507
GI+NVP++T+ YL+K+S A + + D DMA C +LRN GI + I + + +
Sbjct: 449 -LGIFNVPHVTSIYLVKSSAFDAMSFQH----KEFDPDMALCESLRNAGIFMYISNQRYF 503
Query: 508 GHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIV 567
GHLV+++NF+ T P+ Y L N DW +YIHP Y + L QPCPDVFWF IV
Sbjct: 504 GHLVNTDNFNSTVTRPDFYTLFSNRYDWTEKYIHPNYSQQLNATYPIPQPCPDVFWFQIV 563
Query: 568 TEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVV 627
T+ FC + V IMEA+G WSDG+N+D RLE GYEAVPTRDIHMKQVGL ++ +FL+ +V
Sbjct: 564 TDAFCDDLVAIMEAHGSWSDGSNSDARLEGGYEAVPTRDIHMKQVGLEPLYLKFLQMFVR 623
Query: 628 PLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGC 687
PLQER F GY H P R+ M+F+VRYRPDEQPSLRPHHDSSTYTINIA+N G+DYEGGGC
Sbjct: 624 PLQERVFTGYFHNPPRSLMNFMVRYRPDEQPSLRPHHDSSTYTINIAMNHAGIDYEGGGC 683
Query: 688 RFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
RF+RYNC+V T+ GWMLMHPGRLTHYHEGL VT+GTRYIMISF+DP
Sbjct: 684 RFLRYNCSVVDTKKGWMLMHPGRLTHYHEGLLVTEGTRYIMISFIDP 730
>gi|73957734|ref|XP_536856.2| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 3
isoform 1 [Canis lupus familiaris]
Length = 740
Score = 689 bits (1777), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 326/711 (45%), Positives = 476/711 (66%), Gaps = 11/711 (1%)
Query: 29 IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLL 87
++ +K LVITVA+ ET+GY+RF+ SAE V+TLGL + W GGD++ ++GGG KV L
Sbjct: 36 VNPEKLLVITVATAETEGYRRFLWSAEFFNYTVRTLGLGEEWRGGDVARTVGGGQKVRWL 95
Query: 88 KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
K E+++ +DM+I+ DSYDVI+ G ++L++F + ++F AE CWP+ L ++
Sbjct: 96 KKEMEKYADREDMVIMFVDSYDVILAGSPAELLKKFVQSGSRLLFSAEGFCWPEWGLAEQ 155
Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
YP VG+G R+LNSGGFIG+A I +++ K+++DDQL+Y L+LD LR K + LD
Sbjct: 156 YPEVGTGKRFLNSGGFIGFAPTIHKVVRQWKYKDDDDDQLFYTRLYLDPGLREKLSLNLD 215
Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
+ +FQNL G+L+++ L FD + V + N Y+T PV++HGNG +K+ LN GNY+
Sbjct: 216 HKSRIFQNLNGALDEVVLKFDRNR-VRIRNVAYDTLPVVVHGNGPTKLHLNYLGNYVPNG 274
Query: 268 W-KTSGCTRCNLIKH-LDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
W GC C + L +P P VL++VF+++PT FL FL ++ L+YP ++++
Sbjct: 275 WTPQGGCGFCGRDRRTLPGGQPP--PRVLLAVFVEQPTPFLPRFLQRLLLLDYPPDRVTL 332
Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFY 384
F++NN+ YH P D + F VK + + EAR++A+++ +FYF
Sbjct: 333 FLHNNEVYHEPHIADSWPQLQDHFSAVKLVGPEEALTPGEARDMAMDSCRQDPECEFYFS 392
Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
+D+D+ + NP L+ L+ N +IAP+L R K WSNFWGAL+ D +YARS DY+ ++
Sbjct: 393 LDADAVITNPQTLRILIEENRKVIAPMLSRHGKLWSNFWGALSPDEYYARSEDYVELVQR 452
Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDS 503
+ G+WNVPYI+ Y+++ ++ + +++ + D DMAFC +LR+KGI L + +
Sbjct: 453 KR--VGVWNVPYISQAYVIRGETLRTELPQREVFSGSDTDPDMAFCKSLRDKGIFLHLSN 510
Query: 504 TQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFW 563
E+G L+ + +D +P+++++ NPLDW +YIH Y ++L + + QPCPDV+W
Sbjct: 511 QHEFGRLLATSRYDTDHLHPDLWQIFDNPLDWKEQYIHENYSRALEGEGLVEQPCPDVYW 570
Query: 564 FPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLR 623
FP+++++ C E V+ ME YGQWS G + D RL GYE VPT DIHMKQVG W + LR
Sbjct: 571 FPLLSDQMCDELVEEMEHYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYEDQWLQLLR 630
Query: 624 KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYE 683
YV P+ E F GYH + RA M+FVVRYRPDEQPSLRPHHDSST+T+N+ALN G+DYE
Sbjct: 631 TYVGPMTESLFPGYHTK-TRAVMNFVVRYRPDEQPSLRPHHDSSTFTLNVALNHKGLDYE 689
Query: 684 GGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
GGGCRF+RY+C V++ R GW L+HPGRLTHYHEGL T+GTRYIM+SFVDP
Sbjct: 690 GGGCRFLRYDCIVSSPRKGWGLLHPGRLTHYHEGLPTTRGTRYIMVSFVDP 740
>gi|395842838|ref|XP_003794215.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 3
[Otolemur garnettii]
Length = 744
Score = 688 bits (1776), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 327/711 (45%), Positives = 477/711 (67%), Gaps = 11/711 (1%)
Query: 29 IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLL 87
++ +K LVITVA+ ET+GY+RF+QSAE V+TLGL + W GGD++ ++GGG KV L
Sbjct: 40 VNPEKLLVITVATAETEGYRRFLQSAEFFNYSVRTLGLGEEWRGGDVARTVGGGQKVRWL 99
Query: 88 KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
K E+++ +DM+I+ DSYDVI+ G +++L++F + ++F AE CWP L ++
Sbjct: 100 KREMEKYADQEDMVIMFVDSYDVILAGSPSELLKKFVQSGSRLLFSAEGFCWPQWGLAEQ 159
Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
YP VG+G R+LNSGGFIG+A I ++ K+++DDQL+Y L+LD LR K + LD
Sbjct: 160 YPEVGTGKRFLNSGGFIGFAPTIHHIVRQWKYKDDDDDQLFYTRLYLDPGLREKLSLNLD 219
Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
+ +FQNL G+L++I L FD + V + N Y+T PV++HGNG +K++LN GNY+
Sbjct: 220 HKSRIFQNLNGALDEIVLKFDRNH-VRIRNVAYDTLPVVVHGNGPTKLQLNYLGNYVPNG 278
Query: 268 WKT-SGCTRCNLIKH-LDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
W GC C+ + L +P P VL++VF+++PT FL FL ++ L+YP ++++
Sbjct: 279 WSPEGGCGFCSRDRRTLPGGQPP--PRVLLAVFVEQPTPFLPRFLQRLLLLDYPPDRVTL 336
Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFY 384
F++NN+ +H P D + F VK + ++ EAR++A+++ +FYF
Sbjct: 337 FLHNNEVFHEPHIADAWPQLQDHFSAVKLVGPEEALSPGEARDMAMDSCRQDPKCEFYFS 396
Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
+D+D+ L N L+ L+ N +IAP+L R K WSNFWGAL+ D +YARS DY+ ++
Sbjct: 397 LDADAVLTNRQTLRILIEENRKVIAPMLSRHGKLWSNFWGALSPDEYYARSEDYVELVQR 456
Query: 445 DQGGKGIWNVPYITNCYLMKTSVI-KATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDS 503
+ G+WNVPYI+ Y+++ + K K +++ + D DMAFC +LR+KGI L + +
Sbjct: 457 KR--VGVWNVPYISQAYVIRGETLRKELPQKEVFSGSDTDPDMAFCKSLRDKGIFLHLSN 514
Query: 504 TQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFW 563
E+G L+ + +D +P+++++ NPLDW +YIH Y ++L + QPCPDV+W
Sbjct: 515 QHEFGRLLATSRYDTDHLHPDLWQIFDNPLDWKEQYIHENYSRALEGKEIVEQPCPDVYW 574
Query: 564 FPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLR 623
FP+++++ C E V+ ME YGQWS G + D RL GYE VPT DIHMKQVG W + LR
Sbjct: 575 FPLLSDQMCDELVEEMEHYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYEDQWLQLLR 634
Query: 624 KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYE 683
YV P+ E F GYH + RA M+FVVRYRPDEQP+LRPHHDSST+T+N+ALN G+DYE
Sbjct: 635 TYVGPMTEHLFPGYHTK-TRAVMNFVVRYRPDEQPALRPHHDSSTFTLNVALNHKGLDYE 693
Query: 684 GGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
GGGCRF+RY+C +++ R GW L+HPGRLTHYHEGL T+GTRYIM+SFVDP
Sbjct: 694 GGGCRFLRYDCIISSPRKGWGLLHPGRLTHYHEGLPTTRGTRYIMVSFVDP 744
>gi|113931556|ref|NP_001039229.1| procollagen-lysine, 2-oxoglutarate 5-dioxygenase 3 precursor
[Xenopus (Silurana) tropicalis]
gi|89272476|emb|CAJ83048.1| procollagen-lysine, 2-oxoglutarate 5-dioxygenase 3 [Xenopus
(Silurana) tropicalis]
Length = 733
Score = 687 bits (1774), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 327/711 (45%), Positives = 478/711 (67%), Gaps = 10/711 (1%)
Query: 29 IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLL 87
+ DK LV+TVA++ T+GY+RF+++A V+TLGL W GGD++ ++GGG KV L
Sbjct: 28 VRPDKLLVVTVATDTTEGYERFLRTARHFNYTVRTLGLGHEWKGGDVARTVGGGQKVRWL 87
Query: 88 KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
K EL++ DD++I+ DSYDV+I G ++L +F+ F+ +VF AE CWP+ SL +
Sbjct: 88 KEELEKHSEQDDLVIMFVDSYDVVIAGTPTELLWKFHQFEHKVVFSAEGFCWPEWSLAES 147
Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
YP + +G R+LNSGGFIG+A + ++ K+++DDQL+Y ++LDE+LR K I LD
Sbjct: 148 YPPISNGKRFLNSGGFIGFAPQLYRMVQLWKYKDDDDDQLFYTKVYLDESLREKFDIALD 207
Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
+ +FQNL G+++++ L F+ ++ V N Y+T PV+IHGNG +K++LN GNY+ S
Sbjct: 208 HKSKIFQNLNGAIDEVVLKFERNK-VRARNVAYDTIPVVIHGNGPTKLQLNYLGNYVPNS 266
Query: 268 WK-TSGCTRCNLIKHLDSLKPD-QFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
W GC C+ S+ D P VL+ VFI++PT FL +FL ++ L+YP ++S+
Sbjct: 267 WTHEGGCEVCDDDLLDLSMLEDDALPQVLLGVFIEQPTPFLPQFLERLVQLDYPRNRLSL 326
Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFY 384
+++N++ +H + N K F ++K + ++ EAR++ ++ + D+YF
Sbjct: 327 YIHNSEVFHEKHIQAFWENNKDSFTSIKIVGPEEALSQGEARDMGMDLCRQDETCDYYFS 386
Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
VD+D L NPD L L+ N+ +IAP++ R K WSNFWGAL+ +G+YARS DY++I+ G
Sbjct: 387 VDADVVLTNPDTLYILIQENKKVIAPMVSRSGKLWSNFWGALSPEGYYARSEDYVDIVQG 446
Query: 445 DQGGKGIWNVPYITNCYLMKTSVIK-ATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDS 503
+ G +WNVPYI + YL+K ++ + K I+TL MD DMAFC ++R+K + L + +
Sbjct: 447 KRAG--VWNVPYIAHVYLIKGETLRNELSNKNIFTLPQMDSDMAFCKSIRDKSVFLHLSN 504
Query: 504 TQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFW 563
E+G L+ + ++ + + +++++ NP+DW +YIH Y K D QPCPDV+W
Sbjct: 505 RDEFGRLISTSKYNTSRLHNDLWQIFENPVDWREKYIHENYTKIFEEDYFE-QPCPDVYW 563
Query: 564 FPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLR 623
FP+ E C EFV+ ME +GQWS G N D+RL GYE VPT DIHM QVG W +FL+
Sbjct: 564 FPVFKEVMCDEFVEEMENFGQWSGGKNTDQRLAGGYENVPTVDIHMTQVGYQEEWLKFLQ 623
Query: 624 KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYE 683
+Y+ P+ E+ F GY+ + +A ++F+VRYRPDEQPSLRPHHDSST+TINIALN G+DYE
Sbjct: 624 EYIAPVTEKLFPGYYTK-AKALLNFIVRYRPDEQPSLRPHHDSSTFTINIALNNKGIDYE 682
Query: 684 GGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
GGGCRF+RYNC V + R GW MHPGRLTHYHEGL T+GTRYIM+SFVDP
Sbjct: 683 GGGCRFLRYNCRVESPRKGWSFMHPGRLTHYHEGLPTTRGTRYIMVSFVDP 733
>gi|147899260|ref|NP_001080446.1| procollagen-lysine, 2-oxoglutarate 5-dioxygenase 3 precursor
[Xenopus laevis]
gi|27696396|gb|AAH43893.1| Plod-prov protein [Xenopus laevis]
Length = 733
Score = 686 bits (1770), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 323/708 (45%), Positives = 478/708 (67%), Gaps = 10/708 (1%)
Query: 32 DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLLKNE 90
DK LV+TVA+ T+GY RF+++A V+TLGL W GGD++ ++GGG KV LK+E
Sbjct: 31 DKLLVVTVATEATEGYLRFLRTARHFNYTVRTLGLGHEWKGGDVARTVGGGQKVRWLKHE 90
Query: 91 LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
L++ DD+II+ DSYDV+I G ++L +F F+ +VF AE CWP+ SL + YP
Sbjct: 91 LEQHKDQDDLIIMFVDSYDVVISGSPTELLWKFQRFEHKVVFSAEGFCWPEWSLAESYPP 150
Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
+ +G R+LNSGGFIG+A + +++ K+ +DDQL+Y ++LDE++R K I LD +
Sbjct: 151 ITNGKRFLNSGGFIGFAPQLYQMVQLWKYKDNDDDQLFYTKIYLDESMREKFDITLDHKS 210
Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK- 269
N+FQNL G+++++ L F+ ++ V N Y+T PV+IHGNG +K++LN GNY+ SW
Sbjct: 211 NIFQNLNGAIDEVVLKFESNK-VRARNVAYDTIPVVIHGNGPTKLQLNYLGNYVPNSWTH 269
Query: 270 TSGCTRCNLIKHLDSLKPD-QFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
GC C+ S+ D P VL+ VFI++PT F+ +FL ++ L+YP ++S++++
Sbjct: 270 EGGCEVCDDDLLDLSMLEDDALPHVLLGVFIEQPTPFIPQFLQRLVQLDYPRNRLSLYIH 329
Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDS 387
N++ YH + + +K F ++K + ++ EAR++ ++ + D+YF VD+
Sbjct: 330 NSEVYHERHIEVFYKKYKDSFTSIKIVGPEEAMSQGEARDMGMDLCRQDQTCDYYFSVDA 389
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D L NPD L L+ N+ +IAP++ R K WSNFWGAL+ +G+YARS DY++I+ +
Sbjct: 390 DVALTNPDTLYILIQENKKVIAPMVSRSGKLWSNFWGALSPEGYYARSEDYVDIVQAKRA 449
Query: 448 GKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQE 506
G +WNVPYI + YL+K ++A + K I+TL MD DM+ C ++R+K + L I + E
Sbjct: 450 G--VWNVPYIAHVYLIKGETLRAELSNKNIFTLPQMDPDMSVCKSIRDKNVFLHISNRDE 507
Query: 507 YGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPI 566
+G L+ + ++ + + +++++ NP+DW +YIH Y K + + QPCPDV+WFP+
Sbjct: 508 FGRLLSTSKYNTSRLHNDLWQIFENPVDWKEKYIHENYSK-IFEEDYYQQPCPDVYWFPV 566
Query: 567 VTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYV 626
+E C EFV+ ME +GQWS G N D+RL GYE VPT DIHM Q+G W +FL++Y+
Sbjct: 567 FSEVMCDEFVEEMENFGQWSGGKNQDQRLAGGYENVPTVDIHMTQIGYQEEWLKFLQEYI 626
Query: 627 VPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGG 686
P+ E+ F GY+ + +A ++F+VRYRPDEQPSLRPHHDSST+T+NIALN G+DYEGGG
Sbjct: 627 APVTEKLFPGYYTK-AKALLNFIVRYRPDEQPSLRPHHDSSTFTVNIALNNKGIDYEGGG 685
Query: 687 CRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
CRF+RYNC V + R GW MHPGRLTHYHEGL T+GTRYIM+SFVDP
Sbjct: 686 CRFLRYNCRVESPRKGWSFMHPGRLTHYHEGLPTTRGTRYIMVSFVDP 733
>gi|207080302|ref|NP_001128871.1| procollagen-lysine,2-oxoglutarate 5-dioxygenase 3 precursor [Pongo
abelii]
gi|62900717|sp|Q5R6K5.1|PLOD3_PONAB RecName: Full=Procollagen-lysine,2-oxoglutarate 5-dioxygenase 3;
AltName: Full=Lysyl hydroxylase 3; Short=LH3; Flags:
Precursor
gi|55731802|emb|CAH92605.1| hypothetical protein [Pongo abelii]
Length = 738
Score = 685 bits (1768), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 325/711 (45%), Positives = 475/711 (66%), Gaps = 11/711 (1%)
Query: 29 IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLL 87
++ +K LVITVA+ ET+GY RF++SAE V+TLGL + W GGD++ ++GGG KV L
Sbjct: 34 VNPEKLLVITVATAETEGYLRFLRSAEFFNYTVRTLGLGEEWRGGDVARTVGGGQKVRWL 93
Query: 88 KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
K E+++ +DMII+ DSYDVI+ G ++L++F + ++F AE CWP+ L ++
Sbjct: 94 KKEMEKYADREDMIIMFVDSYDVILAGSPTELLKKFVQSGSRLLFSAESFCWPEWGLAEQ 153
Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
YP VG+G R+LNSGGFIG+A I +++ K+++DDQL+Y L+LD LR K + LD
Sbjct: 154 YPEVGTGKRFLNSGGFIGFATTIHQIVRQWKYKDDDDDQLFYTRLYLDPGLREKLSLNLD 213
Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
+ +FQNL G+L+++ L FD + V + N Y+T PV++HGNG +K++LN GNY+ K
Sbjct: 214 HKSRIFQNLNGALDEVVLKFDRNR-VRIRNVAYDTLPVVVHGNGPTKLQLNYLGNYVPKG 272
Query: 268 WKT-SGCTRCNLIKH-LDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
W GC CN + L +P P V ++VF+++PT FL FL ++ L+YP ++++
Sbjct: 273 WTPEGGCGFCNQDRRTLPGGQPP--PRVFLAVFVEQPTPFLPRFLQRLLLLDYPPDRVTL 330
Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFY 384
F++NN+ +H P D + F VK + ++ EAR++A++ +FYF
Sbjct: 331 FLHNNEVFHEPHIADSWPQLQDHFSAVKLVGPEEALSPGEARDMAMDLCRQDPECEFYFS 390
Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
+D+D+ L N L+ L+ N +IAP+L R K WSNFWGAL+ D +YARS DY+ ++
Sbjct: 391 LDADTVLTNLQTLRILIEENRKVIAPMLSRHGKLWSNFWGALSPDEYYARSEDYVELVQR 450
Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDS 503
+ G+WNVPYI+ Y+++ ++ + +++ + D DMAFC + R+KGI L + +
Sbjct: 451 KR--VGVWNVPYISQAYVIRGDTLRTELPQRDVFSGSDTDPDMAFCKSFRDKGIFLHLSN 508
Query: 504 TQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFW 563
E+G L+ + +D + +P+++++ NP+DW +YIH Y ++L + + QPCPDV+W
Sbjct: 509 QHEFGRLLATSRYDTEHLHPDLWQIFDNPVDWKEQYIHENYSRALEGEGIVEQPCPDVYW 568
Query: 564 FPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLR 623
FP+++E+ C E V ME YGQWS G + D RL GYE VPT DIHMKQVG W + LR
Sbjct: 569 FPLLSEQMCDELVAEMEHYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYEDQWLQLLR 628
Query: 624 KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYE 683
YV P+ E F GYH + RA M+FVVRYRPDEQPSLRPHHDSST+T+N+ALN G+DYE
Sbjct: 629 TYVGPMTESLFPGYHTK-ARAVMNFVVRYRPDEQPSLRPHHDSSTFTLNVALNHKGLDYE 687
Query: 684 GGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
GGGCRF+RY+C +++ R GW L+HPGRLTHYHEGL T GTRYIM+SFVDP
Sbjct: 688 GGGCRFLRYDCVISSPRKGWALLHPGRLTHYHEGLPTTWGTRYIMVSFVDP 738
>gi|395533681|ref|XP_003768883.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 3
[Sarcophilus harrisii]
Length = 845
Score = 685 bits (1768), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 324/707 (45%), Positives = 471/707 (66%), Gaps = 9/707 (1%)
Query: 32 DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLLKNE 90
+K LVIT A+ ET+GY RF+QSA+ V+TLGL + W GGD++ ++GGG KV LK E
Sbjct: 144 NKLLVITAATEETEGYLRFLQSAKFFNYSVQTLGLGEEWRGGDVARTVGGGQKVRWLKKE 203
Query: 91 LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
+++ DM+I+ DSYDV++ G ++L +F + ++F AE CWP+ L ++YP
Sbjct: 204 MEKYAERKDMVIMFVDSYDVLLAGSPKELLWKFLQSGSRLLFSAESFCWPEWGLAERYPT 263
Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
VG+G R+LNSGGFIG+A I ++ K+++DDQL+Y L+LD LR K + LD +
Sbjct: 264 VGNGKRFLNSGGFIGFAPTIHHIVRQWKYKDDDDDQLFYTRLYLDSKLREKLGLALDHKS 323
Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWKT 270
+FQNL G+++++ L FD ++ V + N Y+T PV+IHGNG +K++LN GNY+ W
Sbjct: 324 RIFQNLNGAIDEVVLKFDRNQ-VRIRNVAYDTLPVVIHGNGPTKLQLNYLGNYIPNGWSP 382
Query: 271 -SGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN 329
GC C+ + +D + FP V +SVF+++PT FL FL ++ ++YP +KI++F++N
Sbjct: 383 EGGCGFCDRDR-IDLQEQQVFPKVFLSVFVEQPTPFLPRFLQRLLLIDYPPEKITLFLHN 441
Query: 330 NQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKG-VDFYFYVDSD 388
N+ +H P + F VK + + +AR++A++N +FYF +D+D
Sbjct: 442 NEVHHEPHIAAAWPQLQDHFSAVKLVGPEEALTPAQARDMAMDNCRQDSECEFYFSLDAD 501
Query: 389 SHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGG 448
+ + NP L+ L+ N +IAP+L R K WSNFWGAL+ + +YARS DY+ ++ +
Sbjct: 502 AVITNPQTLRNLIEENRKVIAPMLSRHGKLWSNFWGALSPEEYYARSEDYVELVQRKR-- 559
Query: 449 KGIWNVPYITNCYLMKTSVI-KATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEY 507
G+WNVPY++ YL+K + K + +++ + D DMAFC +R+KGI L + + +E+
Sbjct: 560 VGVWNVPYVSQAYLIKGETLRKELPQREMFSQSESDPDMAFCKTVRDKGIFLHLSNQEEF 619
Query: 508 GHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIV 567
G L+ + + P+++++ NPLDW +YIH Y +L D + QPCPDV+WFP++
Sbjct: 620 GRLLSTARYRTDHLYPDLWQIFDNPLDWQEQYIHENYSWALDGDGIVEQPCPDVYWFPLL 679
Query: 568 TEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVV 627
+E+ C E V+ ME +GQWS G + D RL GYE VPT DIHMKQ+G W +FLR YV
Sbjct: 680 SEQMCDELVEEMENFGQWSGGKHEDSRLAGGYENVPTVDIHMKQLGYEDEWLQFLRTYVG 739
Query: 628 PLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGC 687
P+ E F GYH + RA M+FVVRYRPDEQPSLRPHHDSST+T+NIALN G+DYEGGGC
Sbjct: 740 PMTENLFPGYHTK-TRAVMNFVVRYRPDEQPSLRPHHDSSTFTLNIALNNKGLDYEGGGC 798
Query: 688 RFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
RF+RY+C V++ R GW L+HPGRLTHYHEGL T+GTRYIM+SFVDP
Sbjct: 799 RFLRYDCIVSSPRKGWGLLHPGRLTHYHEGLPTTKGTRYIMVSFVDP 845
>gi|193786792|dbj|BAG52115.1| unnamed protein product [Homo sapiens]
Length = 738
Score = 685 bits (1767), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 324/711 (45%), Positives = 474/711 (66%), Gaps = 11/711 (1%)
Query: 29 IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLL 87
++ +K LVITVA+ ET+GY RF++SAE V+TLGL + W GGD++ ++GGG KV L
Sbjct: 34 VNPEKLLVITVATAETEGYLRFLRSAEFFNYTVRTLGLGEEWRGGDVARTVGGGQKVRWL 93
Query: 88 KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
K E+++ +DMII+ DSYDVI+ G ++L++F + ++F AE CWP+ L ++
Sbjct: 94 KKEMEKYADREDMIIMFVDSYDVILAGSPTELLKKFVQSGSRLLFSAESFCWPEWGLAEQ 153
Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
YP VG+G R+LNSGGFIG+A I +++ K+++DDQL+YA L+LD LR K + LD
Sbjct: 154 YPEVGTGKRFLNSGGFIGFATTIHQIVRQWKYKDDDDDQLFYARLYLDPGLREKLSLNLD 213
Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
+ +FQNL G+L+++ L FD + V + N Y+T P+++HGNG +K++LN GNY+
Sbjct: 214 HKSRIFQNLNGALDEVVLKFDRNR-VRIRNVAYDTLPIVVHGNGPTKLQLNYLGNYVPNG 272
Query: 268 WKT-SGCTRCNLIKH-LDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
W GC CN + L +P P V ++VF+++PT FL FL ++ L+YP ++++
Sbjct: 273 WTPEGGCGFCNQDRRTLPGGQPP--PRVFLAVFVEQPTPFLPRFLQRLLLLDYPPDRVTL 330
Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFY 384
F++NN+ +H P D + F VK + ++ EAR++A++ +FYF
Sbjct: 331 FLHNNEVFHEPHIADSWPQLQDHFSAVKLVGPEEALSPGEARDMAMDLCRQDPECEFYFS 390
Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
+D+D+ L N L+ L+ N +IAP+L R K WSNFWGAL+ D +YARS DY+ ++
Sbjct: 391 LDADAVLTNLQTLRILIEENRKVIAPMLSRHGKLWSNFWGALSPDEYYARSEDYVELVQR 450
Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDS 503
+ G+WNVPYI+ Y+++ ++ + +++ + D DMAFC + R+KGI L + +
Sbjct: 451 KR--VGVWNVPYISQAYVIRGDTLRMELPQRDVFSGSDTDPDMAFCKSFRDKGIFLHLSN 508
Query: 504 TQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFW 563
E+G L+ + +D + +P+++++ NP+DW +YIH Y ++L + + QPCPDV+W
Sbjct: 509 QHEFGRLLATSRYDTEHLHPDLWQIFDNPVDWKEQYIHENYSRALEGEGIVEQPCPDVYW 568
Query: 564 FPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLR 623
FP+++E+ C E V ME YGQWS G + D RL GYE VPT DIHMKQVG W + LR
Sbjct: 569 FPLLSEQMCDELVAEMERYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYGDQWLQLLR 628
Query: 624 KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYE 683
YV P+ E F GYH + RA M+FVVRYRPDEQPSLRPHHDSST+T+N+ALN G+DYE
Sbjct: 629 TYVGPMTESLFPGYHTK-ARAVMNFVVRYRPDEQPSLRPHHDSSTFTLNVALNHKGLDYE 687
Query: 684 GGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
GGGCRF+RY C +++ R GW L+HPGRLTHYHEGL T GTRYIM+SFVDP
Sbjct: 688 GGGCRFLRYGCVISSPRKGWALLHPGRLTHYHEGLPTTWGTRYIMVSFVDP 738
>gi|292627353|ref|XP_002666609.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 3 [Danio
rerio]
gi|126631809|gb|AAI33840.1| Plod3 protein [Danio rerio]
Length = 730
Score = 685 bits (1767), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 330/732 (45%), Positives = 483/732 (65%), Gaps = 14/732 (1%)
Query: 10 LILSCVVFFISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQP 69
+IL+ ++ I + + + +E LVIT A+ TDGY RF+++ ++ LGL +
Sbjct: 6 VILTVILAVIQLSRTEPRKPNE--LLVITAATEVTDGYLRFMRTIRQFNYTIQVLGLGEQ 63
Query: 70 WLGGDMS-SLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDA 128
W GGD++ ++GGG KV LK EL++ + +I+ DSYDVI+ G ++L +F+ F
Sbjct: 64 WRGGDVARTVGGGQKVRWLKTELEKHKDKQNTVIMFVDSYDVILASGPVELLRKFSRFSH 123
Query: 129 NIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLY 188
+VF AE CWPD L KYPAV G RYLNSGGFIG+A +I ++ K+++DDQL+
Sbjct: 124 RVVFSAEGFCWPDQRLASKYPAVHHGKRYLNSGGFIGFAPEIHAIVQQWKYKDDDDDQLF 183
Query: 189 YALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIH 248
Y ++LD+ R K + LD + +FQNL G++E++ L F+ V + N Y+T PV+IH
Sbjct: 184 YTRIYLDKEKRRKFNMTLDHRSQIFQNLNGAIEEVVLKFEKSR-VRVRNVAYDTLPVVIH 242
Query: 249 GNGKSKIELNSFGNYLAKSWK-TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFL 306
GNG +K++LN GNY+ +W +GC C + L L ++ P V ++VFI++P FL
Sbjct: 243 GNGPTKLQLNYLGNYVPTAWTYENGCGICEEDLLDLSHLSDEEMPLVHVAVFIEQPMPFL 302
Query: 307 EEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEA 366
EEFL ++A LNYP +I +F++NN YH + + +++F + + + +A
Sbjct: 303 EEFLERLATLNYPHTRIRLFLHNNVVYHEQHVERFWTRHRSLFTGARIVGPEENLKHDQA 362
Query: 367 RNLAVENSLHKGV--DFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWG 424
R +AVE + K V D++F +D+D L NPDVL+ L+ N+S+IAP+L R K WSNFWG
Sbjct: 363 RTMAVE-ACKKDVSCDYFFSLDADVALTNPDVLRILIEENKSVIAPMLSRHGKLWSNFWG 421
Query: 425 ALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMD 483
AL+ +GFY+R+ DY++I+ + G+WNVPYIT YL++ +++ ++Y MD
Sbjct: 422 ALSPEGFYSRAEDYIDIVQSKR--VGLWNVPYITQVYLIRGETLRSRLAAVSLYQQEGMD 479
Query: 484 YDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPE 543
DM+FC ++R +GI + + + E+G LV S N++ + +P+++++ NP+DW +YIH
Sbjct: 480 PDMSFCKSVREQGIFMFVSNRDEFGRLVSSANYNISRLHPDMWQIFDNPVDWREKYIHEN 539
Query: 544 YQKSLLPD-TVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAV 602
Y + D +V QPCPDV+WFP +E+ C + V+ ME +GQWS G + D+RL GYE V
Sbjct: 540 YSRIFEDDESVVEQPCPDVYWFPAFSERMCDDLVETMEEFGQWSGGGHKDERLSGGYENV 599
Query: 603 PTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRP 662
PT DIHM Q+ W +FL++Y+VP+ E+ + GY+ + +A M+FVVRYRPDEQPSLRP
Sbjct: 600 PTVDIHMNQIQFEKEWLKFLKEYIVPVTEKLYPGYYPK-AQAVMNFVVRYRPDEQPSLRP 658
Query: 663 HHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQ 722
HHDSST+TINIALN GVDYEGGGCRF+RYNC V + R GW MHPGRLTHYHEGL T+
Sbjct: 659 HHDSSTFTINIALNSKGVDYEGGGCRFLRYNCKVESPRKGWSFMHPGRLTHYHEGLPTTR 718
Query: 723 GTRYIMISFVDP 734
GTRYIM+SFVDP
Sbjct: 719 GTRYIMVSFVDP 730
>gi|113195568|ref|NP_001037808.1| procollagen-lysine,2-oxoglutarate 5-dioxygenase 3 precursor [Danio
rerio]
gi|67973229|gb|AAY84150.1| procollagen lysine 2-oxoglutarate 5-dioxygenase 3 [Danio rerio]
gi|190337538|gb|AAI63451.1| Procollagen-lysine, 2-oxoglutarate 5-dioxygenase 3 [Danio rerio]
Length = 730
Score = 684 bits (1765), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 327/710 (46%), Positives = 473/710 (66%), Gaps = 12/710 (1%)
Query: 32 DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLLKNE 90
++ LVIT A+ TDGY RF+++ ++ LGL + W GGD++ ++GGG KV LK E
Sbjct: 26 NELLVITAATEVTDGYLRFMRTIRQFNYTIQVLGLGEQWRGGDVARTVGGGQKVRWLKTE 85
Query: 91 LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
L++ + +I+ DSYDVI+ G ++L +F+ F +VF AE CWPD L KYPA
Sbjct: 86 LEKHKDKQNTVIMFVDSYDVILASGPVELLRKFSRFSHRVVFSAEGFCWPDQRLASKYPA 145
Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
V G RYLNSGGFIG+A +I ++ K+++DDQL+Y ++LD+ R K + LD +
Sbjct: 146 VHHGKRYLNSGGFIGFAPEIHAIVQQWKYKDDDDDQLFYTRIYLDKEKRRKFNMTLDHRS 205
Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK- 269
+FQNL G++E++ L F+ V + N Y+T PV+IHGNG +K++LN GNY+ +W
Sbjct: 206 QIFQNLNGAIEEVVLKFEKSR-VRVRNVAYDTLPVVIHGNGPTKLQLNYLGNYVPTAWTY 264
Query: 270 TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
+GC C + L L ++ P V ++VFI++P FLEEFL ++A LNYP +I +F++
Sbjct: 265 ENGCGICEEDLLDLSHLSDEEMPLVHVAVFIEQPMPFLEEFLERLATLNYPHTRIRLFLH 324
Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGV--DFYFYVD 386
NN YH + + +++F + + + +AR +AVE + K V D++F +D
Sbjct: 325 NNVVYHEQHVERFWTRHRSLFTGARIVGPEENLKHDQARTMAVE-ACKKDVSCDYFFSLD 383
Query: 387 SDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQ 446
+D L NPDVL+ L+ N+S+IAP+L R K WSNFWGAL+ +GFY+R+ DY++I+ +
Sbjct: 384 ADVALTNPDVLRILIEENKSVIAPMLSRHGKLWSNFWGALSPEGFYSRAEDYIDIVQSKR 443
Query: 447 GGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQ 505
G+WNVPYIT YL++ +++ ++Y MD DM+FC ++R +GI + + +
Sbjct: 444 --VGLWNVPYITQVYLIRGETLRSRLAAVSLYQQEGMDPDMSFCKSVREQGIFMFVSNRD 501
Query: 506 EYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPD-TVNNQPCPDVFWF 564
E+G LV S N++ + +P+++++ NP+DW +YIH Y + D +V QPCPDV+WF
Sbjct: 502 EFGRLVSSANYNISRLHPDMWQIFDNPVDWREKYIHENYSRIFEDDESVVEQPCPDVYWF 561
Query: 565 PIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRK 624
P +E+ C + V+ ME +GQWS G + D+RL GYE VPT DIHM Q+ W +FL++
Sbjct: 562 PAFSERMCDDLVETMEEFGQWSGGGHKDERLSGGYENVPTVDIHMNQIQFEKEWLKFLKE 621
Query: 625 YVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEG 684
Y+VP+ E+ + GY+ + +A M+FVVRYRPDEQPSLRPHHDSST+TINIALN GVDYEG
Sbjct: 622 YIVPVTEKLYPGYYPK-AQAVMNFVVRYRPDEQPSLRPHHDSSTFTINIALNSKGVDYEG 680
Query: 685 GGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
GGCRF+RYNC V + R GW MHPGRLTHYHEGL TQGTRYIM+SFVDP
Sbjct: 681 GGCRFLRYNCKVESPRKGWSFMHPGRLTHYHEGLPTTQGTRYIMVSFVDP 730
>gi|148225280|ref|NP_001088601.1| uncharacterized protein LOC495489 precursor [Xenopus laevis]
gi|54648179|gb|AAH85074.1| LOC495489 protein [Xenopus laevis]
Length = 729
Score = 684 bits (1765), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 327/708 (46%), Positives = 472/708 (66%), Gaps = 10/708 (1%)
Query: 32 DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLLKNE 90
DK LV+TVA+ T+GY RF+++A V+TLGL W GGD++ ++GGG KV LK+E
Sbjct: 27 DKLLVVTVATEATEGYLRFLRTARHYNYTVRTLGLGHEWKGGDVARTVGGGQKVRWLKHE 86
Query: 91 LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
L++ D +II+ DSYDV+I G ++L +F + +VF AE CWP+ SL + YP
Sbjct: 87 LEQHKDQDQLIIMFVDSYDVVIAGTPTELLWKFQQLEHKVVFSAEGFCWPEWSLAESYPP 146
Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
V +G R+LNSGGFIG+A I ++ + K+ +DDQL+Y ++LDE+LR + I LD +
Sbjct: 147 VSNGKRFLNSGGFIGFAPQIYGMVQLWNYKDNDDDQLFYTKIYLDESLRERFNIALDHKS 206
Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK- 269
N+FQNL G+++++ L F+ ++ V N Y+T PV+IHGNG +K++LN GNY+ SW
Sbjct: 207 NIFQNLNGAIDEVVLKFERNK-VRARNVAYDTIPVVIHGNGPTKLQLNYLGNYVPNSWTH 265
Query: 270 TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
GC C+ + L L+ D P VL+ VFI++PT F+ +FL ++ L+YP ++S++++
Sbjct: 266 EGGCEVCDDDLFDLSMLEDDALPHVLLGVFIEQPTPFMSQFLERLVQLDYPQNRLSLYIH 325
Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDS 387
N++ YH + K F +K + ++ EAR++ ++ + D+YF VDS
Sbjct: 326 NSEPYHERHIQAFYERHKDRFTTIKIVGPEEAMSQGEARDMGMDLCRQDETCDYYFSVDS 385
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
+ L NPD L L+ N+ +IAP++ R K WSNFWGAL+ +G+YARS DY +I+ +
Sbjct: 386 NVALTNPDTLYILIQENKKVIAPMVSRSGKLWSNFWGALSPEGYYARSEDYADIVQAKR- 444
Query: 448 GKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQE 506
G+WNVPYI N YL+K ++A + K I+TL MD DM+ C ++R+K + L I + E
Sbjct: 445 -VGVWNVPYIANVYLIKGETLRAELSNKNIFTLPQMDPDMSVCKSIRDKNVFLHISNRDE 503
Query: 507 YGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPI 566
+G L+ + ++ + + +++++ NPLDW +YIH Y K + + QPCPDV+WFP+
Sbjct: 504 FGRLLSTSKYNTSRLHNDLWQIFENPLDWKEKYIHENYSK-IFEEDYYEQPCPDVYWFPV 562
Query: 567 VTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYV 626
E C EFV+ ME +GQWS G N D+RL GYE VPT DIHM QVG W +FL++Y+
Sbjct: 563 FKEIMCDEFVEEMENFGQWSGGKNQDQRLAGGYENVPTVDIHMTQVGYQEEWLKFLQEYI 622
Query: 627 VPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGG 686
P+ E+ F GY+ + +A ++F+VRYRPDEQPSLRPHHDSST+TINIALN G+DYEGGG
Sbjct: 623 GPVTEKLFPGYYTK-AKALLNFIVRYRPDEQPSLRPHHDSSTFTINIALNNKGIDYEGGG 681
Query: 687 CRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
C F+RYNC V + R GW MHPGRLTHYHEGL T+GTRYIM+SFVDP
Sbjct: 682 CHFLRYNCRVESPRKGWSFMHPGRLTHYHEGLPTTRGTRYIMVSFVDP 729
>gi|301791363|ref|XP_002930648.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 3-like
[Ailuropoda melanoleuca]
gi|281349526|gb|EFB25110.1| hypothetical protein PANDA_021154 [Ailuropoda melanoleuca]
Length = 740
Score = 684 bits (1765), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 322/711 (45%), Positives = 477/711 (67%), Gaps = 11/711 (1%)
Query: 29 IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLL 87
++ +K LVITVA+ +T+GY+RF++SAE V+TLGL + W GGD++ ++GGG KV L
Sbjct: 36 VNPEKLLVITVATAKTEGYRRFLRSAEFFNYTVRTLGLGEEWRGGDVARTVGGGQKVRWL 95
Query: 88 KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
K E+++ +DM+I+ DSYDVI+ G +++L++F ++++F AE CWP+ L ++
Sbjct: 96 KKEMEKYADREDMVIMFVDSYDVILAGSPSELLKKFVQSGSHLLFSAEGFCWPEWGLAEQ 155
Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
YP VG+G R+LNSGGFIG+A + +++ K+++DDQL+Y L+LD LR K + LD
Sbjct: 156 YPEVGTGKRFLNSGGFIGFAPTVHQVVRQWKYKDDDDDQLFYTRLYLDPGLREKLSLGLD 215
Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
+ +FQNL G+L+++ L FD + V + N Y+T PV++HGNG +K++LN GNY+
Sbjct: 216 HKSRIFQNLNGALDEVVLKFDRNR-VRIRNVAYDTLPVVVHGNGPTKLQLNYLGNYVPNG 274
Query: 268 W-KTSGCTRCNLIKH-LDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
W GC C + L +P P VL++VF+++ T FL FL ++ L+YP ++++
Sbjct: 275 WTPQGGCGFCGRDRRTLPGGQPP--PRVLLAVFVEQATPFLPRFLQRLLLLDYPPDRVTL 332
Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFY 384
F++NN+ YH P D + F VK + + EAR++A++ +FYF
Sbjct: 333 FLHNNEVYHEPHIADSWSQLQDHFSAVKLVGPEEALTPGEARDMAMDTCRQDPECEFYFS 392
Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
+D+D+ + N L+ L+ N +IAP+L R K WSNFWGAL+ D +YARS DY+ ++
Sbjct: 393 LDADAVITNQQTLRILIEENRKVIAPMLSRHGKLWSNFWGALSPDEYYARSEDYVELVQR 452
Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDS 503
+ G+WNVPYI+ Y+++ ++ + +++ + D DMAFC +LR+KGI L + +
Sbjct: 453 KR--VGVWNVPYISQAYVIRGETLRTELPQREVFSGSDTDPDMAFCKSLRDKGIFLHLSN 510
Query: 504 TQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFW 563
E+G L+ + +D +P+++++ NPLDW +YIH Y ++L + + QPCPDV+W
Sbjct: 511 QHEFGRLLATSRYDTDHLHPDLWQIFDNPLDWKEQYIHENYSRALEGEGLVEQPCPDVYW 570
Query: 564 FPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLR 623
FP+++++ C E V+ ME YGQWS G + D RL GYE VPT DIHMKQVG W + LR
Sbjct: 571 FPLLSDQMCDELVEEMELYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYEDQWLQLLR 630
Query: 624 KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYE 683
YV P+ E F GYH + RA M+FVVRYRPDEQPSLRPHHDSST+T+N+ALN G+DYE
Sbjct: 631 TYVGPMTESLFPGYHTK-TRAVMNFVVRYRPDEQPSLRPHHDSSTFTLNVALNHKGLDYE 689
Query: 684 GGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
GGGCRF+RY+C V++ R GW L+HPGRLTHYHEGL T+GTRYIM+SFVDP
Sbjct: 690 GGGCRFLRYDCIVSSPRKGWGLLHPGRLTHYHEGLPTTRGTRYIMVSFVDP 740
>gi|197097730|ref|NP_001126103.1| procollagen-lysine,2-oxoglutarate 5-dioxygenase 3 precursor [Pongo
abelii]
gi|55730366|emb|CAH91905.1| hypothetical protein [Pongo abelii]
Length = 738
Score = 684 bits (1765), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 324/711 (45%), Positives = 474/711 (66%), Gaps = 11/711 (1%)
Query: 29 IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLL 87
++ +K LVITVA+ ET+GY F++SAE V+TLGL + W GGD++ ++GGG KV L
Sbjct: 34 VNPEKLLVITVATAETEGYLHFLRSAEFFNYTVRTLGLGEQWRGGDVARTVGGGQKVRWL 93
Query: 88 KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
K E+++ +DMII+ DSYDVI+ G ++L++F + ++F AE CWP+ L ++
Sbjct: 94 KKEMEKYADREDMIIMFVDSYDVILAGSPTELLKKFVQSGSRLLFSAESFCWPEWGLAEQ 153
Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
YP VG+G R+LNSGGFIG+A I +++ K+++DDQL+Y L+LD LR K + LD
Sbjct: 154 YPEVGTGKRFLNSGGFIGFATTIHQIVRQWKYKDDDDDQLFYTRLYLDPGLREKLSLNLD 213
Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
+ +FQNL G+L+++ L FD + V + N Y+T PV++HGNG +K++LN GNY+ K
Sbjct: 214 HKSRIFQNLNGALDEVALKFDRNR-VRIRNVAYDTLPVVVHGNGPTKLQLNYLGNYVPKG 272
Query: 268 W-KTSGCTRCNLIKH-LDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
W GC CN + L +P P V ++VF+++PT FL FL ++ L+YP ++++
Sbjct: 273 WTPEGGCGFCNQDRRTLPGGQPP--PRVFLAVFVEQPTPFLPRFLQRLLLLDYPPDRVTL 330
Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFY 384
F++NN+ +H P D + F VK + ++ EAR++A++ +FYF
Sbjct: 331 FLHNNEVFHEPHIADSWPQLQDHFSAVKLVGPEEALSPGEARDMAMDLCRQDPECEFYFS 390
Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
+D+D+ L N L+ L+ N +IAP+L R K WSNFWGAL+ D +YARS DY+ ++
Sbjct: 391 LDADTVLTNLQTLRILIEENRKVIAPMLSRHGKLWSNFWGALSPDEYYARSVDYVELVQR 450
Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDS 503
+ G+WNVPYI+ Y+++ ++ + +++ + D DMAFC + R+KGI L + +
Sbjct: 451 KR--VGVWNVPYISQAYVIRGDTLRTELPQRDVFSGSDTDPDMAFCKSFRDKGIFLHLSN 508
Query: 504 TQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFW 563
E+G L+ + +D + +P+++++ NP+DW +YIH Y ++L + + QPCPDV+W
Sbjct: 509 QHEFGRLLATSRYDTEHLHPDLWQIFDNPVDWKEQYIHENYSRALEGEGIVEQPCPDVYW 568
Query: 564 FPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLR 623
FP+++E+ C E V ME YGQWS G + D RL GYE VPT DIHMKQVG W + LR
Sbjct: 569 FPLLSEQMCDELVAEMEHYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYEDQWLQLLR 628
Query: 624 KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYE 683
YV P+ E F GYH + RA M+FVVRYRPDEQPSLRPHHDSST+T+N+ALN G+DYE
Sbjct: 629 TYVGPMTESLFPGYHTK-ARAVMNFVVRYRPDEQPSLRPHHDSSTFTLNVALNHKGLDYE 687
Query: 684 GGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
GGGCRF+RY+C +++ R GW L+HPGRLTHYHEGL T GTRYIM+SFVDP
Sbjct: 688 GGGCRFLRYDCVISSPRKGWALLHPGRLTHYHEGLPTTWGTRYIMVSFVDP 738
>gi|348517144|ref|XP_003446095.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 3-like
[Oreochromis niloticus]
Length = 734
Score = 684 bits (1765), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 324/716 (45%), Positives = 471/716 (65%), Gaps = 13/716 (1%)
Query: 27 KNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVN 85
+ + + LVITVA+ ETDGY RF+++A VK LGL + W GGD++ ++GGG KV
Sbjct: 24 RKLSPENLLVITVATEETDGYLRFMRTAREFNYTVKVLGLGEEWKGGDVARTVGGGQKVR 83
Query: 86 LLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLY 145
LK E+ + +++I+ DSYDVI G ++L +F+ ++F AE CWPD L
Sbjct: 84 WLKKEVQKHSEKTELVIMFVDSYDVIFASGPEELLSKFSRMGHKVIFSAEGFCWPDQRLA 143
Query: 146 DKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIV 205
KYP V +G RYLNSGGFIGYA +I ++ K+ +DDQL+Y ++LD+T RTK +
Sbjct: 144 SKYPEVRTGKRYLNSGGFIGYAPEISAIVQQWKYKDSDDDQLFYTRIYLDKTHRTKFNMT 203
Query: 206 LDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLA 265
LD + +FQNL G+++++ L F+ + V N Y+T PV+IHGNG +K++LN GNY+
Sbjct: 204 LDHRSRIFQNLNGAVDEVVLKFERAK-VRARNVAYDTLPVVIHGNGPTKLQLNYLGNYVP 262
Query: 266 KSWK-TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKI 323
+W +GC C+ + + + +Q P V ++VFI+ T F+EEFL +++ LNYP +I
Sbjct: 263 TAWTYETGCGICDDDVLFFNEVPDEQMPLVYVAVFIEHATPFMEEFLERLSTLNYPKTRI 322
Query: 324 SMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVE-NSLHKGVDFY 382
+F++NN YH + +++F + + + + EAR +AVE D+Y
Sbjct: 323 RLFIHNNVVYHERHIQKFWERHRSLFPDARVVGPEENLKEDEARTMAVEVCKKDPECDYY 382
Query: 383 FYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNII 442
F +DSD L NPD+L+ L+ N+S+IAP+L + K WSNFWGAL+ +G+Y+RS DY+ I+
Sbjct: 383 FSIDSDVALTNPDILRILIEENKSVIAPMLSKHGKLWSNFWGALSPEGYYSRSEDYIEIV 442
Query: 443 NGDQGGKGIWNVPYITNCYLMKTSVI--KATNIKTIYTLNSMDYDMAFCTNLRNKGIHLK 500
G + G+WNVPYIT YL+K S++ K + + MD DM FC ++R++G+ +
Sbjct: 443 QGKR--VGLWNVPYITQAYLLKGSMLRTKLSQVSLYMDEGGMDADMVFCRSIRDQGVFMY 500
Query: 501 IDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVN--NQPC 558
+ + E+G LV S NF+ + +P+++++ NP+DW +Y+H Y K + D QPC
Sbjct: 501 VSNRDEFGRLVASSNFNTSRLHPDMWQIFDNPVDWKEKYVHENYSK-IFEDEKKYVEQPC 559
Query: 559 PDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVW 618
PDV+WFP ++K C V+ ME +G+WS GT+ D+RL GYE VPT DIHM Q+G W
Sbjct: 560 PDVYWFPAFSDKMCDHMVETMEDHGEWSGGTHKDERLAGGYENVPTVDIHMNQIGFEKEW 619
Query: 619 AEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQV 678
+FL+ Y+ P+ E+ + GY+ +A M+FVVRYRPDEQP LRPHHDSST+TINIALN+
Sbjct: 620 LKFLKDYISPVTEKLYPGYYPR-AQAIMNFVVRYRPDEQPLLRPHHDSSTFTINIALNRK 678
Query: 679 GVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
+DYEGGGCRF+RY+C V + R GW MHPGRLTHYHEGL+VT+GTRYIM+SFVDP
Sbjct: 679 DIDYEGGGCRFLRYDCKVESPRKGWSFMHPGRLTHYHEGLRVTKGTRYIMVSFVDP 734
>gi|4505891|ref|NP_001075.1| procollagen-lysine,2-oxoglutarate 5-dioxygenase 3 precursor [Homo
sapiens]
gi|6093731|sp|O60568.1|PLOD3_HUMAN RecName: Full=Procollagen-lysine,2-oxoglutarate 5-dioxygenase 3;
AltName: Full=Lysyl hydroxylase 3; Short=LH3; Flags:
Precursor
gi|5630086|gb|AAD45831.1|AC004876_4 lysyl hydroxylase 3 [Homo sapiens]
gi|3153235|gb|AAC39753.1| lysyl hydroxylase isoform 3 [Homo sapiens]
gi|3551836|gb|AAC34808.1| lysyl hydroxylase 3 [Homo sapiens]
gi|7546824|gb|AAF63701.1| lysyl hydroxylase 3 [Homo sapiens]
gi|15079714|gb|AAH11674.1| Procollagen-lysine, 2-oxoglutarate 5-dioxygenase 3 [Homo sapiens]
gi|28975434|gb|AAO61775.1| lysyl hydroxylase 3 [Homo sapiens]
gi|119570590|gb|EAW50205.1| procollagen-lysine, 2-oxoglutarate 5-dioxygenase 3 [Homo sapiens]
gi|189053447|dbj|BAG35613.1| unnamed protein product [Homo sapiens]
Length = 738
Score = 682 bits (1760), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 323/711 (45%), Positives = 474/711 (66%), Gaps = 11/711 (1%)
Query: 29 IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLL 87
++ +K LVITVA+ ET+GY RF++SAE V+TLGL + W GGD++ ++GGG KV L
Sbjct: 34 VNPEKLLVITVATAETEGYLRFLRSAEFFNYTVRTLGLGEEWRGGDVARTVGGGQKVRWL 93
Query: 88 KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
K E+++ +DMII+ DSYDVI+ G ++L++F + ++F AE CWP+ L ++
Sbjct: 94 KKEMEKYADREDMIIMFVDSYDVILAGSPTELLKKFVQSGSRLLFSAESFCWPEWGLAEQ 153
Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
YP VG+G R+LNSGGFIG+A I +++ K+++DDQL+Y L+LD LR K + LD
Sbjct: 154 YPEVGTGKRFLNSGGFIGFATTIHQIVRQWKYKDDDDDQLFYTRLYLDPGLREKLSLNLD 213
Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
+ +FQNL G+L+++ L FD + V + N Y+T P+++HGNG +K++LN GNY+
Sbjct: 214 HKSRIFQNLNGALDEVVLKFDRNR-VRIRNVAYDTLPIVVHGNGPTKLQLNYLGNYVPNG 272
Query: 268 WKT-SGCTRCNLIKH-LDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
W GC CN + L +P P V ++VF+++PT FL FL ++ L+YP ++++
Sbjct: 273 WTPEGGCGFCNQDRRTLPGGQPP--PRVFLAVFVEQPTPFLPRFLQRLLLLDYPPDRVTL 330
Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFY 384
F++NN+ +H P D + F VK + ++ EAR++A++ +FYF
Sbjct: 331 FLHNNEVFHEPHIADSWPQLQDHFSAVKLVGPEEALSPGEARDMAMDLCRQDPECEFYFS 390
Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
+D+D+ L N L+ L+ N +IAP+L R K WSNFWGAL+ D +YARS DY+ ++
Sbjct: 391 LDADAVLTNLQTLRILIEENRKVIAPMLSRHGKLWSNFWGALSPDEYYARSEDYVELVQR 450
Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDS 503
+ G+WNVPYI+ Y+++ ++ + +++ + D DMAFC + R+KGI L + +
Sbjct: 451 KR--VGVWNVPYISQAYVIRGDTLRMELPQRDVFSGSDTDPDMAFCKSFRDKGIFLHLSN 508
Query: 504 TQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFW 563
E+G L+ + +D + +P+++++ NP+DW +YIH Y ++L + + QPCPDV+W
Sbjct: 509 QHEFGRLLATSRYDTEHLHPDLWQIFDNPVDWKEQYIHENYSRALEGEGIVEQPCPDVYW 568
Query: 564 FPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLR 623
FP+++E+ C E V ME YGQWS G + D RL GYE VPT DIHMKQVG W + LR
Sbjct: 569 FPLLSEQMCDELVAEMEHYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYEDQWLQLLR 628
Query: 624 KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYE 683
YV P+ E F GYH + RA M+FVVRYRPDEQPSLRPHHDSST+T+N+ALN G+DYE
Sbjct: 629 TYVGPMTESLFPGYHTK-ARAVMNFVVRYRPDEQPSLRPHHDSSTFTLNVALNHKGLDYE 687
Query: 684 GGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
GGGCRF+RY+C +++ R GW L+HPGRLTHYHEGL T GTRYIM+SFVDP
Sbjct: 688 GGGCRFLRYDCVISSPRKGWALLHPGRLTHYHEGLPTTWGTRYIMVSFVDP 738
>gi|426254759|ref|XP_004021044.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 3 [Ovis
aries]
Length = 752
Score = 682 bits (1760), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 327/712 (45%), Positives = 474/712 (66%), Gaps = 12/712 (1%)
Query: 29 IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLL 87
++ +K LVITVA+ ET+GY+RF+QSAE V+TLGL + W GGD++ ++GGG KV L
Sbjct: 47 VNPEKLLVITVATAETEGYRRFLQSAEFFNYTVRTLGLGEEWRGGDVARTVGGGQKVRWL 106
Query: 88 KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
K E+++ +DM+I+ DSYDV++ GG +++L++F + ++F AE CWP+ L ++
Sbjct: 107 KKEMEKYAEREDMVIMFVDSYDVVLAGGPSELLKKFIQSGSRLLFSAESFCWPEWGLAEQ 166
Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
YP VG+G R+LNSGGFIG+A I +++ K+++DDQL+Y L+LD LR K + LD
Sbjct: 167 YPEVGTGKRFLNSGGFIGFAPTIHQIVRQWKYKDDDDDQLFYTRLYLDPGLREKLGLSLD 226
Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
+ +FQNL G+L+++ L F + V + N Y+T PV++HGNG +K++LN GNY+
Sbjct: 227 HKSRIFQNLNGALDEVVLKFGRNR-VRIRNVAYDTLPVVVHGNGPTKLQLNYLGNYVPNG 285
Query: 268 WKT-SGCTRCNLIKH-LDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
W GC CN + L +P P VL++VF+++PT FL FL ++ L+Y +
Sbjct: 286 WTPEGGCGFCNQDRRPLPGGQPP--PRVLLAVFVEQPTPFLPRFLQRLLLLDYRGGGKGL 343
Query: 326 FVYNNQE-YHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYF 383
F + QE YH P DD + F VK + + EAR++A++ +FYF
Sbjct: 344 FSPHLQEVYHEPHIDDSWPQLQDHFSAVKLVGPEEALTPGEARDMAMDICRQDPKCEFYF 403
Query: 384 YVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIIN 443
+D+D+ + NP L+ L+ N +IAP+L R K WSNFWGAL+ D +YARS DY+ ++
Sbjct: 404 SLDADTVITNPQTLRILIEANRKVIAPMLSRHGKLWSNFWGALSPDEYYARSEDYVELVQ 463
Query: 444 GDQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKID 502
+ G+WNVPYI+ Y+++ ++ + +++ + D DMAFC +LR+KGI L +
Sbjct: 464 RKR--VGVWNVPYISQAYVIRGETLRMELPQREVFSGSDTDPDMAFCKSLRDKGIFLHLS 521
Query: 503 STQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVF 562
+ E+G L+ + +D +P+++++ NPLDW +YIH Y ++L + + QPCPDV+
Sbjct: 522 NQHEFGRLLATSRYDTDHLHPDLWQIFDNPLDWKEQYIHENYSRALEGEGLVEQPCPDVY 581
Query: 563 WFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFL 622
WFP+++E+ C E V+ ME YGQWS G + D RL GYE VPT DIHMKQVG W + L
Sbjct: 582 WFPLLSEQMCDELVEEMEHYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYEDQWLQLL 641
Query: 623 RKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDY 682
R YV P+ E F GYH + RA M+FVVRYRPDEQPSLRPHHDSST+T+N+ALN G+DY
Sbjct: 642 RTYVGPMTESLFPGYHTK-TRAVMNFVVRYRPDEQPSLRPHHDSSTFTLNVALNHKGLDY 700
Query: 683 EGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
EGGGCRF+RY+C +++ R GW L+HPGRLTHYHEGL T+GTRYIM+SFVDP
Sbjct: 701 EGGGCRFLRYDCVISSPRKGWGLLHPGRLTHYHEGLPTTRGTRYIMVSFVDP 752
>gi|426357331|ref|XP_004045997.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 3
[Gorilla gorilla gorilla]
Length = 738
Score = 682 bits (1760), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 324/711 (45%), Positives = 474/711 (66%), Gaps = 11/711 (1%)
Query: 29 IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLL 87
++ +K LVITVA+ ET+GY RF++SAE V+TLGL + W GGD++ ++GGG KV L
Sbjct: 34 VNPEKLLVITVATAETEGYLRFLRSAEFFNYTVRTLGLGEEWRGGDVARTVGGGQKVRWL 93
Query: 88 KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
K E+++ +DMII+ DSYDVI+ G ++L++F + ++F AE CWP+ L ++
Sbjct: 94 KKEMEKYTDREDMIIMFVDSYDVILAGSPTELLKKFVQSGSRLLFSAESFCWPEWGLAEQ 153
Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
YP VG+G R+LNSGGFIG+A I +++ K+++DDQL+Y L+LD LR K + LD
Sbjct: 154 YPEVGTGKRFLNSGGFIGFATTIHQIVRQWKYKDDDDDQLFYTRLYLDPGLREKLSLNLD 213
Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
+ +FQNL G+L+++ L FD + V + N Y+T P++IHGNG +K++LN GNY+
Sbjct: 214 HKSRIFQNLNGALDEVVLKFDRNR-VRIRNVAYDTLPIVIHGNGPTKLQLNYLGNYVPNG 272
Query: 268 WKT-SGCTRCNLIKH-LDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
W GC CN + L +P P V ++VF+++PT FL FL ++ L+YP ++++
Sbjct: 273 WTPEGGCGFCNQDRRTLPGGQPP--PRVFLAVFVEQPTPFLPRFLQRLLLLDYPPDRVTL 330
Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFY 384
F++NN+ +H P D + F VK + ++ EAR++A++ +FYF
Sbjct: 331 FLHNNEVFHEPHIADSWPQLQDHFSAVKLVGPEEALSPGEARDMAMDLCRQDPECEFYFS 390
Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
+D+D+ L N L+ L+ N +IAP+L R K WSNFWGAL+ D +YARS DY+ ++
Sbjct: 391 LDADAVLTNLQTLRILIEENRKVIAPMLSRHGKLWSNFWGALSPDEYYARSEDYVELVQR 450
Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDS 503
+ G+WNVPYI+ Y+++ ++ + +++ + D DMAFC + R+KGI L + +
Sbjct: 451 KR--VGVWNVPYISQAYVIRGDTLRMELPQRDVFSGSDTDPDMAFCKSFRDKGIFLHLSN 508
Query: 504 TQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFW 563
E+G L+ + +D + +P+++++ NP+DW +YIH Y ++L + + QPCPDV+W
Sbjct: 509 QYEFGRLLATSRYDTEHLHPDLWQIFDNPVDWKEQYIHENYSRALEGEGIVEQPCPDVYW 568
Query: 564 FPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLR 623
FP+++E+ C E V ME YGQWS G + D RL GYE VPT DIHMKQVG W + LR
Sbjct: 569 FPLLSEQMCDELVAEMEHYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYEDQWLQLLR 628
Query: 624 KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYE 683
YV P+ E F GYH + RA M+FVVRYRPDEQPSLRPHHDSST+T+N+ALN G+DYE
Sbjct: 629 TYVGPMTESLFPGYHTK-ARAVMNFVVRYRPDEQPSLRPHHDSSTFTLNVALNHKGLDYE 687
Query: 684 GGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
GGGCRF+RY+C +++ R GW L+HPGRLTHYHEGL T GTRYIM+SFVDP
Sbjct: 688 GGGCRFLRYDCVISSPRKGWALLHPGRLTHYHEGLPTTWGTRYIMVSFVDP 738
>gi|114615112|ref|XP_001153684.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 3
isoform 1 [Pan troglodytes]
gi|397471318|ref|XP_003807243.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 3 [Pan
paniscus]
gi|410222176|gb|JAA08307.1| procollagen-lysine, 2-oxoglutarate 5-dioxygenase 3 [Pan
troglodytes]
gi|410306108|gb|JAA31654.1| procollagen-lysine, 2-oxoglutarate 5-dioxygenase 3 [Pan
troglodytes]
gi|410349965|gb|JAA41586.1| procollagen-lysine, 2-oxoglutarate 5-dioxygenase 3 [Pan
troglodytes]
Length = 738
Score = 682 bits (1759), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 323/711 (45%), Positives = 474/711 (66%), Gaps = 11/711 (1%)
Query: 29 IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLL 87
++ +K LVITVA+ ET+GY RF++SAE V+TLGL + W GGD++ ++GGG KV L
Sbjct: 34 VNPEKLLVITVATAETEGYLRFLRSAEFFNYTVQTLGLGEEWRGGDVARTVGGGQKVRWL 93
Query: 88 KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
K E+++ +DMII+ DSYDVI+ G ++L++F + ++F AE CWP+ L ++
Sbjct: 94 KKEMEKYADREDMIIMFVDSYDVILAGSPTELLKKFVQSGSRLLFSAESFCWPEWGLAEQ 153
Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
YP VG+G R+LNSGGFIG+A I +++ K+++DDQL+Y L+LD LR K + LD
Sbjct: 154 YPEVGTGKRFLNSGGFIGFATTIHQIVRQWKYKDDDDDQLFYTRLYLDPGLREKLSLNLD 213
Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
+ +FQNL G+L+++ L FD + V + N Y+T P+++HGNG +K++LN GNY+
Sbjct: 214 HKSRIFQNLNGALDEVVLKFDRNR-VRIRNVAYDTLPIVVHGNGPTKLQLNYLGNYVPNG 272
Query: 268 WKT-SGCTRCNLIKH-LDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
W GC CN + L +P P V ++VF+++PT FL FL ++ L+YP ++++
Sbjct: 273 WTPEGGCGFCNQDRRALPGGQPP--PRVFLAVFVEQPTPFLPRFLQRLLLLDYPPDRVTL 330
Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFY 384
F++NN+ +H P D + F VK + ++ EAR++A++ +FYF
Sbjct: 331 FLHNNEVFHEPHIADSWPQLQDHFSAVKLVGPEEALSPGEARDMAMDLCRQDPECEFYFS 390
Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
+D+D+ L N L+ L+ N +IAP+L R K WSNFWGAL+ D +YARS DY+ ++
Sbjct: 391 LDADTVLTNLQTLRILIEENRKVIAPMLSRHGKLWSNFWGALSPDEYYARSEDYVELVQR 450
Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDS 503
+ G+WNVPYI+ Y+++ ++ + +++ + D DMAFC + R+KGI L + +
Sbjct: 451 KR--VGVWNVPYISQAYVIRGDTLRMELPQRDVFSGSDTDPDMAFCKSFRDKGIFLHLSN 508
Query: 504 TQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFW 563
E+G L+ + +D + +P+++++ NP+DW +YIH Y ++L + + QPCPDV+W
Sbjct: 509 QHEFGRLLATSRYDTEHLHPDLWQIFDNPVDWKEQYIHENYSRALEGEGIVEQPCPDVYW 568
Query: 564 FPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLR 623
FP+++E+ C E V ME YGQWS G + D RL GYE VPT DIHMKQVG W + LR
Sbjct: 569 FPLLSEQMCDELVAEMEHYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYEDQWLQLLR 628
Query: 624 KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYE 683
YV P+ E F GYH + RA M+FVVRYRPDEQPSLRPHHDSST+T+N+ALN G+DYE
Sbjct: 629 TYVGPMTESLFPGYHTK-ARAVMNFVVRYRPDEQPSLRPHHDSSTFTLNVALNHKGLDYE 687
Query: 684 GGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
GGGCRF+RY+C +++ R GW L+HPGRLTHYHEGL T GTRYIM+SFVDP
Sbjct: 688 GGGCRFLRYDCVISSPRKGWALLHPGRLTHYHEGLPTTWGTRYIMVSFVDP 738
>gi|355712274|gb|AES04295.1| procollagen-lysine, 2-oxoglutarate 5-dioxygenase 3 [Mustela
putorius furo]
Length = 749
Score = 681 bits (1758), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 327/718 (45%), Positives = 476/718 (66%), Gaps = 18/718 (2%)
Query: 29 IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLL 87
++ +K LVITVA+ ET+GY+RF++SAE V+TLGL + W GGD++ ++GGG KV L
Sbjct: 38 VNPEKLLVITVATAETEGYRRFLRSAEFFNYTVRTLGLGEEWRGGDVARTVGGGQKVRWL 97
Query: 88 KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
K +++ +DM+I+ DSYDVI+ G +++L++F + ++F AE CWP+ L ++
Sbjct: 98 KKAMEKYADREDMVIMFVDSYDVILAGSPSELLKKFVQTGSRLLFSAEGFCWPEWGLAEQ 157
Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
YP VG+G R+LNSGGFIG+A I +++ K+++DDQL+Y L+LD LR K + LD
Sbjct: 158 YPEVGTGKRFLNSGGFIGFAPTIHQVVRQWKYKDDDDDQLFYTRLYLDPGLREKLSLNLD 217
Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
+ +FQNL G+L+++ L FD + V + N Y+T PV++HGNG +K++LN GNY+
Sbjct: 218 HKSRIFQNLNGALDEVVLKFDRNR-VRIRNVAYDTLPVVVHGNGPTKLQLNYLGNYVPNG 276
Query: 268 W-KTSGCTRCNLIKH-LDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
W GC C + L +P P VL++VF+++PT FL FL ++ L+YP +I++
Sbjct: 277 WTPQGGCGFCGRDRRTLPGGQPP--PRVLLAVFVEQPTPFLPRFLQRLLLLDYPPDRITL 334
Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFY 384
F++NN+ YH P D + F VK + + EAR++A++ +FYF
Sbjct: 335 FLHNNEVYHEPHIADSWPQLQDHFSAVKLVGPEEALTPGEARDIAMDTCRQDPECEFYFS 394
Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
+D+D+ L N L+ L+ N +IAP+L R K WSNFWGAL+ D +YARS DY+ ++
Sbjct: 395 LDADAVLTNQQTLRILIEENRKVIAPMLSRHGKLWSNFWGALSPDEYYARSEDYVELVQR 454
Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDS 503
+ G+WNVPYI+ Y+++ ++ + +++ + D DMAFC +LR+KGI L + +
Sbjct: 455 KR--VGVWNVPYISQAYVIRGETLRTELPQREVFSGSDTDPDMAFCKSLRDKGIFLHLSN 512
Query: 504 TQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFW 563
E+G L+ + +D +P+++++ NPLDW +YIH Y ++L + + QPCPDV+W
Sbjct: 513 QHEFGRLLATSRYDTDHLHPDLWQIFDNPLDWKEQYIHENYSRALEGEGLVEQPCPDVYW 572
Query: 564 FPI-------VTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAG 616
FP+ ++E+ C E V+ ME YGQWS G + D RL GYE VPT DIHMKQVG
Sbjct: 573 FPLDVYWFPLLSEQMCDELVEEMEHYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYED 632
Query: 617 VWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALN 676
W + LR YV P+ E F GYH + RA M+FVVRYRPDEQPSLRPHHDSST+T+N+ALN
Sbjct: 633 EWLQLLRTYVGPMTESLFPGYHTK-TRAVMNFVVRYRPDEQPSLRPHHDSSTFTLNVALN 691
Query: 677 QVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
G+DYEGGGCRF+RY+C V++ R GW L+HPGRLTHYHEGL T+GTRYIM+SFVDP
Sbjct: 692 HKGLDYEGGGCRFLRYDCIVSSPRKGWGLLHPGRLTHYHEGLPTTRGTRYIMVSFVDP 749
>gi|384950168|gb|AFI38689.1| procollagen-lysine,2-oxoglutarate 5-dioxygenase 3 precursor [Macaca
mulatta]
Length = 737
Score = 681 bits (1756), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 321/711 (45%), Positives = 475/711 (66%), Gaps = 11/711 (1%)
Query: 29 IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLL 87
++ +K LV+TVA+ ET+GY RF++SAE V+TLGL + W GGD++ ++GGG KV L
Sbjct: 33 VNPEKLLVMTVATAETEGYLRFLRSAEFFNYTVRTLGLGEEWRGGDVARTVGGGQKVRWL 92
Query: 88 KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
K E+++ +D II+ DSYDV++ G +++L++F + ++F AE CWP+ L ++
Sbjct: 93 KKEMEKYADREDTIIMFVDSYDVVLAGSPSELLKKFVQSGSRLLFSAESFCWPEWGLAEQ 152
Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
YP VG+G R+LNSGGFIG+A I +++ K+++DDQL+Y L+LD LR K + LD
Sbjct: 153 YPEVGTGKRFLNSGGFIGFATTIHQIVRQWKYKDDDDDQLFYTRLYLDPGLREKLSLNLD 212
Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
+ +FQNL G+L+++ L FD + V + N Y+T PV++HGNG +K++LN GNY+
Sbjct: 213 HKSRIFQNLNGALDEVVLKFDRNR-VRIRNVAYDTLPVVVHGNGPTKLQLNYLGNYVPNG 271
Query: 268 WKT-SGCTRCNLIKH-LDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
W GC CN + L +P P V ++VF+++PT FL FL ++ L+YP+ ++++
Sbjct: 272 WTPEGGCGFCNRDRRTLPGGQPP--PRVFLAVFVEQPTPFLPRFLQRLLLLDYPSDRVTL 329
Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFY 384
F++NN+ +H P D + F VK + ++ EAR++A++ +FYF
Sbjct: 330 FLHNNEVFHEPHIADSWPQLQDHFAVVKLVGPEEALSPGEARDMAMDMCRQDPECEFYFS 389
Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
+D+D+ L N L+ L+ N +IAP+L R K WSNFWGAL+ D +YARS DY+ ++
Sbjct: 390 LDADAVLTNLQTLRILIEENRKVIAPMLSRHGKLWSNFWGALSPDEYYARSEDYVELVQR 449
Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDS 503
+ G+WNVPYI+ Y+++ ++ + +++ + D DMAFC + R+KGI L + +
Sbjct: 450 KR--VGVWNVPYISQAYVIRGDTLRTELPQRDVFSGSDTDPDMAFCKSFRDKGIFLHLSN 507
Query: 504 TQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFW 563
E+G L+ + +D + +P+++++ NP+DW +YIH Y ++L + + QPCPDV+W
Sbjct: 508 QHEFGRLLATSRYDTEHLHPDLWQIFDNPVDWKEQYIHENYSRALEGEGIVEQPCPDVYW 567
Query: 564 FPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLR 623
FP+++E+ C E V ME YGQWS G + D RL GYE VPT DIHMKQVG W + LR
Sbjct: 568 FPLLSEQMCDELVAEMEHYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYEDQWLQLLR 627
Query: 624 KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYE 683
YV P+ E F GYH + RA M+FVVRYRPDEQPSLRPHHDSST+T+N+ALN G+DYE
Sbjct: 628 TYVGPMTESLFPGYHTK-ARAVMNFVVRYRPDEQPSLRPHHDSSTFTLNVALNHKGLDYE 686
Query: 684 GGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
GGGCRF+RY+C +++ R GW L HPGRLTHYHEGL T+GTRYIM+SFVDP
Sbjct: 687 GGGCRFLRYDCVISSPRKGWALRHPGRLTHYHEGLPTTRGTRYIMVSFVDP 737
>gi|380817708|gb|AFE80728.1| procollagen-lysine,2-oxoglutarate 5-dioxygenase 3 precursor [Macaca
mulatta]
Length = 737
Score = 681 bits (1756), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 321/711 (45%), Positives = 475/711 (66%), Gaps = 11/711 (1%)
Query: 29 IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLL 87
++ +K LV+TVA+ ET+GY RF++SAE V+TLGL + W GGD++ ++GGG KV L
Sbjct: 33 VNPEKLLVMTVATAETEGYLRFLRSAEFFNYTVRTLGLGEEWRGGDVARTVGGGQKVRWL 92
Query: 88 KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
K E+++ +D II+ DSYDV++ G +++L++F + ++F AE CWP+ L ++
Sbjct: 93 KKEMEKYADREDTIIMFVDSYDVVLAGSPSELLKKFVQSGSRLLFSAESFCWPEWGLAEQ 152
Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
YP VG+G R+LNSGGFIG+A I +++ K+++DDQL+Y L+LD LR K + LD
Sbjct: 153 YPEVGTGKRFLNSGGFIGFATTIHQIVRQWKYKDDDDDQLFYTRLYLDPGLREKLSLNLD 212
Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
+ +FQNL G+L+++ L FD + V + N Y+T PV++HGNG +K++LN GNY+
Sbjct: 213 HKSRIFQNLNGALDEVVLKFDRNR-VRIRNVAYDTLPVVVHGNGPTKLQLNYLGNYVPNG 271
Query: 268 W-KTSGCTRCNLIKH-LDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
W GC CN + L +P P V ++VF+++PT FL FL ++ L+YP ++++
Sbjct: 272 WTPEGGCGFCNRDRRTLPGGQPP--PRVFLAVFVEQPTPFLPRFLQRLLLLDYPPDRVTL 329
Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFY 384
F++NN+ +H P D + F VK + ++ EAR++A++ +FYF
Sbjct: 330 FLHNNEVFHEPHIADSWPQLQDHFAVVKLVGPEEALSPGEARDMAMDMCRQDPECEFYFS 389
Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
+D+D+ L N L+ L+ N +IAP+L R K WSNFWGAL+ D +YARS DY+ ++
Sbjct: 390 LDADAVLTNLQTLRILIEENRKVIAPMLSRHGKLWSNFWGALSPDEYYARSEDYVELVQR 449
Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDS 503
+ G+WNVPYI+ Y+++ ++ + +++ + D DMAFC + R+KGI L + +
Sbjct: 450 KR--VGVWNVPYISQAYVIRGDTLRTELPQRDVFSGSDTDPDMAFCKSFRDKGIFLHLSN 507
Query: 504 TQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFW 563
E+G L+ + +D + +P+++++ NP+DW +YIH Y ++L + + QPCPDV+W
Sbjct: 508 QHEFGRLLATSRYDTEHLHPDLWQIFDNPVDWKEQYIHENYSRALEGEGIVEQPCPDVYW 567
Query: 564 FPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLR 623
FP+++E+ C E V ME YGQWS G + D RL GYE VPT DIHMKQVG W + LR
Sbjct: 568 FPLLSEQMCDELVAEMEHYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYEDQWLQLLR 627
Query: 624 KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYE 683
YV P+ E F GYH + RA M+FVVRYRPDEQPSLRPHHDSST+T+N+ALN G+DYE
Sbjct: 628 TYVGPMTESLFPGYHTK-ARAVMNFVVRYRPDEQPSLRPHHDSSTFTLNVALNHKGLDYE 686
Query: 684 GGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
GGGCRF+RY+C +++ R GW L+HPGRLTHYHEGL T+GTRYIM+SFVDP
Sbjct: 687 GGGCRFLRYDCVISSPRKGWALLHPGRLTHYHEGLPTTRGTRYIMVSFVDP 737
>gi|151301193|ref|NP_001093075.1| lysyl hydroxylase 3 precursor [Takifugu rubripes]
gi|146325990|dbj|BAF61137.1| lysyl hydroxylase 3 [Takifugu rubripes]
Length = 731
Score = 679 bits (1751), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 329/714 (46%), Positives = 466/714 (65%), Gaps = 12/714 (1%)
Query: 27 KNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVN 85
+++ + LVIT A+ ETDG+ RF+++A VK LGL + W GGD++ ++GGG KV
Sbjct: 24 QSLSPENLLVITAATEETDGFNRFMRTAREFNYTVKVLGLGEEWRGGDVARTVGGGQKVR 83
Query: 86 LLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLY 145
LK EL + ++M+I+ DSYDVI+ G + L +F+ +VF AE CWPD L
Sbjct: 84 WLKKELSKHSDKENMVIMFVDSYDVILAAGPEEPLYKFSRLGHKVVFSAEGFCWPDQRLA 143
Query: 146 DKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIV 205
KYP V SG RYLNSGGFIG A ++ ++ K+ +DDQL+Y ++LD+ RTK +
Sbjct: 144 SKYPEVHSGKRYLNSGGFIGLASELSAIVQQWKYKDNDDDQLFYTRIYLDKVQRTKFNMT 203
Query: 206 LDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLA 265
LD + +FQNL G+++++ L F+ + V N Y+T PV+IHGNG +K++LN GNY+
Sbjct: 204 LDHRSRIFQNLNGAVDEVVLKFERSK-VRARNVAYDTLPVVIHGNGPTKLQLNYLGNYVP 262
Query: 266 KSWK-TSGCTRCNLIKHLDSLKPDQ-FPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKI 323
+W GC C+ L L D+ P V + VFI+K T FLEEFL ++ ++YP ++
Sbjct: 263 TAWTFAGGCGICD--DELRLLNEDEEMPLVHVGVFIEKATPFLEEFLERLTAMSYPTARL 320
Query: 324 SMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFY 382
+F++NN YH + + +F + + + + +ARN+A E +FY
Sbjct: 321 RLFIHNNVFYHERHIHRFWERHRALFLDAQLVGPEENLPESKARNMAAEACKKDPRCEFY 380
Query: 383 FYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNII 442
F +DSD L NPD L+ L+ N+S+IAP+L + K WSNFWGAL+ +G+Y+RS DY+ I+
Sbjct: 381 FSIDSDVALTNPDTLRILIEENKSVIAPMLSQHGKLWSNFWGALSPEGYYSRSEDYIEIV 440
Query: 443 NGDQGGKGIWNVPYITNCYLMKTSVIKATNIK-TIYTLNSMDYDMAFCTNLRNKGIHLKI 501
G + G+WNVPYIT YL+K SV+++ + +++ MD DM FC N+R++GI L +
Sbjct: 441 QGKR--IGLWNVPYITQVYLIKGSVLRSKLSQLSLFVDEEMDSDMVFCRNIRDQGIFLFV 498
Query: 502 DSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLP-DTVNNQPCPD 560
+ E+G LV S NF+ + +P+++++ NPLDW +YIH Y K ++ QPCPD
Sbjct: 499 SNRDEFGRLVTSTNFNTSRLHPDMWQIFDNPLDWKEKYIHENYSKVFEEQESFVEQPCPD 558
Query: 561 VFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAE 620
V+WFP +EK C V+ ME GQWS G + D+RL GYE VPT DIHM Q+G W +
Sbjct: 559 VYWFPAFSEKMCDHLVETMEDNGQWSSGGHRDERLSGGYENVPTVDIHMNQIGFEKEWLK 618
Query: 621 FLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGV 680
FL++Y+ P+ ER + GY+ + +A M+FVVRY PDEQP LRPHHDSST+TINIALN+ +
Sbjct: 619 FLKEYIAPVTERLYPGYYPK-AQAIMNFVVRYHPDEQPFLRPHHDSSTFTINIALNRKNI 677
Query: 681 DYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
DYEGGGCRF+RYNCNV + R GW MHPGRLTHYHEGL T+GTRYIM+SFVDP
Sbjct: 678 DYEGGGCRFLRYNCNVESPRKGWSFMHPGRLTHYHEGLPTTKGTRYIMVSFVDP 731
>gi|410267362|gb|JAA21647.1| procollagen-lysine, 2-oxoglutarate 5-dioxygenase 3 [Pan
troglodytes]
Length = 738
Score = 679 bits (1751), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 322/711 (45%), Positives = 473/711 (66%), Gaps = 11/711 (1%)
Query: 29 IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLL 87
++ +K LVITVA+ ET+GY RF++SAE V+TLGL + W GGD++ ++GGG K L
Sbjct: 34 VNPEKLLVITVATAETEGYLRFLRSAEFFNYTVQTLGLGEEWRGGDVARTVGGGQKGRGL 93
Query: 88 KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
K E+++ +DMII+ DSYDVI+ G ++L++F + ++F AE CWP+ L ++
Sbjct: 94 KKEMEKYADREDMIIMFVDSYDVILAGSPTELLKKFVQSGSRLLFSAESFCWPEWGLAEQ 153
Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
YP VG+G R+LNSGGFIG+A I +++ K+++DDQL+Y L+LD LR K + LD
Sbjct: 154 YPEVGTGKRFLNSGGFIGFATTIHQIVRQWKYKDDDDDQLFYTRLYLDPGLREKLSLNLD 213
Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
+ +FQNL G+L+++ L FD + V + N Y+T P+++HGNG +K++LN GNY+
Sbjct: 214 HKSRIFQNLNGALDEVVLKFDRNR-VRIRNVAYDTLPIVVHGNGPTKLQLNYLGNYVPNG 272
Query: 268 WKT-SGCTRCNLIKH-LDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
W GC CN + L +P P V ++VF+++PT FL FL ++ L+YP ++++
Sbjct: 273 WTPEGGCGFCNQDRRALPGGQPP--PRVFLAVFVEQPTPFLPRFLQRLLLLDYPPDRVTL 330
Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFY 384
F++NN+ +H P D + F VK + ++ EAR++A++ +FYF
Sbjct: 331 FLHNNEVFHEPHIADSWPQLQDHFSAVKLVGPEEALSPGEARDMAMDLCRQDPECEFYFS 390
Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
+D+D+ L N L+ L+ N +IAP+L R K WSNFWGAL+ D +YARS DY+ ++
Sbjct: 391 LDADTVLTNLQTLRILIEENRKVIAPMLSRHGKLWSNFWGALSPDEYYARSEDYVELVQR 450
Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDS 503
+ G+WNVPYI+ Y+++ ++ + +++ + D DMAFC + R+KGI L + +
Sbjct: 451 KR--VGVWNVPYISQAYVIRGDTLRMELPQRDVFSGSDTDPDMAFCKSFRDKGIFLHLSN 508
Query: 504 TQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFW 563
E+G L+ + +D + +P+++++ NP+DW +YIH Y ++L + + QPCPDV+W
Sbjct: 509 QHEFGRLLATSRYDTEHLHPDLWQIFDNPVDWKEQYIHENYSRALEGEGIVEQPCPDVYW 568
Query: 564 FPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLR 623
FP+++E+ C E V ME YGQWS G + D RL GYE VPT DIHMKQVG W + LR
Sbjct: 569 FPLLSEQMCDELVAEMEHYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYEDQWLQLLR 628
Query: 624 KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYE 683
YV P+ E F GYH + RA M+FVVRYRPDEQPSLRPHHDSST+T+N+ALN G+DYE
Sbjct: 629 TYVGPMTESLFPGYHTK-ARAVMNFVVRYRPDEQPSLRPHHDSSTFTLNVALNHKGLDYE 687
Query: 684 GGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
GGGCRF+RY+C +++ R GW L+HPGRLTHYHEGL T GTRYIM+SFVDP
Sbjct: 688 GGGCRFLRYDCVISSPRKGWALLHPGRLTHYHEGLPTTWGTRYIMVSFVDP 738
>gi|355747555|gb|EHH52052.1| hypothetical protein EGM_12420 [Macaca fascicularis]
Length = 738
Score = 678 bits (1750), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 321/708 (45%), Positives = 472/708 (66%), Gaps = 11/708 (1%)
Query: 32 DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLLKNE 90
+K LV+TVA+ ET+GY RF++SAE V+TLGL + W GGD++ ++GGG KV LK E
Sbjct: 37 EKLLVMTVATAETEGYLRFLRSAEFFNYTVRTLGLGEEWRGGDVARTVGGGQKVRWLKKE 96
Query: 91 LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
+++ +D II+ DSYDV++ G +++L++F + ++F AE CWP+ L ++YP
Sbjct: 97 MEKYADREDTIIMFVDSYDVVLAGSPSELLKKFVQSGSRLLFSAESFCWPEWGLAEQYPE 156
Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
VG+G R+LNSGGFIG+A I +++ K+++DDQL+Y L+LD LR K + LD +
Sbjct: 157 VGTGKRFLNSGGFIGFATTIHQIVRQWKYKDDDDDQLFYTRLYLDPGLREKLSLNLDHKS 216
Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWKT 270
+FQNL G+L+++ L FD + V + N Y+T PV++HGNG +K++LN GNY+ W
Sbjct: 217 RIFQNLNGALDEVVLKFDRNR-VRIRNVAYDTLPVVVHGNGPTKLQLNYLGNYVPNGWTP 275
Query: 271 -SGCTRCNLIKH-LDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
GC CN + L +P P V ++VF+++PT FL FL ++ L+YP ++++F++
Sbjct: 276 EGGCGFCNRDRRTLPGGQPP--PRVFLAVFVEQPTPFLPRFLQRLLLLDYPPDRVTLFLH 333
Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDS 387
NN+ +H P D + F VK + ++ EAR++A++ +FYF +D+
Sbjct: 334 NNEVFHEPHIADSWPQLQDHFAVVKLVGPEEALSPGEARDMAMDMCRQDPECEFYFSLDA 393
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D+ L N L+ L+ N +IAP+L R K WSNFWGAL+ D +YARS DY+ ++ +
Sbjct: 394 DTVLTNLQTLRILIEENRKVIAPMLSRHGKLWSNFWGALSPDEYYARSEDYVELVQRKR- 452
Query: 448 GKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQE 506
G+WNVPYI+ Y+++ ++ + +++ + D DMAFC + R+KGI L + + E
Sbjct: 453 -VGVWNVPYISQAYVIRGDTLRTELPQRDVFSGSDTDPDMAFCKSFRDKGIFLHLSNQHE 511
Query: 507 YGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPI 566
+G L+ + +D + +P+++++ NP+DW +YIH Y ++L + QPCPDV+WFP+
Sbjct: 512 FGRLLATSRYDTEHLHPDLWQIFDNPVDWKEQYIHENYSRALEGAGIVEQPCPDVYWFPL 571
Query: 567 VTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYV 626
++E+ C E V ME YGQWS G + D RL GYE VPT DIHMKQVG W + LR YV
Sbjct: 572 LSEQMCDELVAEMEHYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYEDQWLQLLRTYV 631
Query: 627 VPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGG 686
P+ E F GYH + RA M+FVVRYRPDEQPSLRPHHDSST+T+N+ALN G+DYEGGG
Sbjct: 632 GPMTESLFPGYHTK-ARAVMNFVVRYRPDEQPSLRPHHDSSTFTLNVALNHKGLDYEGGG 690
Query: 687 CRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
CRF+RY+C +++ R GW L+HPGRLTHYHEGL T+GTRYIM+SFVDP
Sbjct: 691 CRFLRYDCVISSPRKGWALLHPGRLTHYHEGLPTTRGTRYIMVSFVDP 738
>gi|334323439|ref|XP_001371229.2| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 3
[Monodelphis domestica]
Length = 785
Score = 675 bits (1742), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 321/703 (45%), Positives = 470/703 (66%), Gaps = 9/703 (1%)
Query: 36 VITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLLKNELDEM 94
VIT A+ ET+GY RF+Q+A+ V+TLGL + W GGD++ ++GGG KV LK E+++
Sbjct: 88 VITAATEETEGYLRFLQTAKFFNYTVQTLGLGEEWRGGDVARTVGGGQKVRWLKKEMEKY 147
Query: 95 DITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAVGSG 154
+DM+I+ DSYDV++ G ++L +F + ++F AE CWP+ L ++YP+VG+G
Sbjct: 148 AERNDMVIMFVDSYDVLLAGSPKELLWKFLQSGSRLLFSAESFCWPEWGLAERYPSVGNG 207
Query: 155 YRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLANLFQ 214
R+LNSGGFIG+A I ++ K+++DDQL+Y L+LD LR K + LD + +FQ
Sbjct: 208 KRFLNSGGFIGFAPTIHHIVRQWKYKDDDDDQLFYTRLYLDSKLREKLGLALDHKSRVFQ 267
Query: 215 NLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWKT-SGC 273
NL G+L+++ L FD ++ V + N Y+T PV+IHGNG +K++LN GNY+ W GC
Sbjct: 268 NLNGALDEVVLKFDRNQ-VRIRNVAYDTLPVVIHGNGPTKLQLNYLGNYIPNGWTPEGGC 326
Query: 274 TRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEY 333
C+ + +D + FP VL+SVF+++PT FL FL ++ ++YP ++IS+F++NN+ +
Sbjct: 327 GFCDRDR-IDLQEGQPFPRVLLSVFVEQPTPFLPRFLQRLLLIDYPPEQISLFLHNNEVH 385
Query: 334 HAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKG-VDFYFYVDSDSHLD 392
H P + F VK + + +AR++A+++ +FYF +D+D+ +
Sbjct: 386 HEPHIAAAWPQLQDHFFAVKLVGPEEALTPAQARDMAMDSCRQDSECEFYFSLDADAIIT 445
Query: 393 NPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIW 452
N L+ L+ N +IAP+L R K WSNFWGAL+ + +YARS DY+ ++ + G+W
Sbjct: 446 NSQTLRNLIEENRKVIAPMLSRHGKLWSNFWGALSPEEYYARSEDYVELVQRKR--VGVW 503
Query: 453 NVPYITNCYLMKTSVI-KATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGHLV 511
NVPYI+ YL+K + K + +++ + D DMAFC +R+KGI L + + +E+G L+
Sbjct: 504 NVPYISQAYLIKGETLRKELPQREVFSRSESDPDMAFCKTIRDKGIFLHLSNQEEFGRLL 563
Query: 512 DSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKF 571
+ + P+++++ NPLDW +YIH Y +L + + QPCPDV+WFP+++E+
Sbjct: 564 STARYKTDHLYPDLWQIFDNPLDWQEQYIHENYTWALDGEGMVEQPCPDVYWFPLLSEQM 623
Query: 572 CHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQE 631
C E V+ ME +GQWS G + D RL GYE VPT DIHMKQ+G W +FLR YV P+ E
Sbjct: 624 CDELVEEMENFGQWSGGKHEDSRLAGGYENVPTVDIHMKQLGYEDEWLQFLRTYVGPMTE 683
Query: 632 REFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIR 691
F GYH + RA M+FVVRYRPDEQPSLRPHHDSST+T+NIALN G+DYEGGGCRF+R
Sbjct: 684 NLFPGYHTK-TRAVMNFVVRYRPDEQPSLRPHHDSSTFTLNIALNSKGLDYEGGGCRFLR 742
Query: 692 YNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
Y+C V++ R GW L+HPGRLTHYHEGL T+GTRYIM+SFVDP
Sbjct: 743 YDCIVSSPRKGWGLLHPGRLTHYHEGLPTTKGTRYIMVSFVDP 785
>gi|395521882|ref|XP_003765043.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 1
[Sarcophilus harrisii]
Length = 733
Score = 671 bits (1732), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 327/724 (45%), Positives = 479/724 (66%), Gaps = 14/724 (1%)
Query: 20 SVHCNKVKNIDED----KFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDM 75
SV +KV N E LV+TVA+ ET+G++RF +SA+ +V+ LGL + W GG+
Sbjct: 15 SVILSKVLNPPESHLPYNLLVLTVATKETEGFRRFKRSAQFFNYKVQVLGLGEDWQGGEK 74
Query: 76 S-SLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGA 134
+LGGG KV LLK L++ +D +IL TDSYDV+ G ++L++F + +VF A
Sbjct: 75 EITLGGGQKVRLLKTALEKYADKEDQVILFTDSYDVVFASGPRELLKKFRQAKSRVVFSA 134
Query: 135 ERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFL 194
E L +PD L KYP V G R+L SGGFIGYA ++ +++++ ++ + DQL+Y +FL
Sbjct: 135 EELIYPDRRLEVKYPQVHDGKRFLGSGGFIGYAPNLSKMVASWDGQDSDSDQLFYTKIFL 194
Query: 195 DETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSK 254
D R K I LD +FQNL G+L+++ L F+ + V N +Y+T PV+IHGNG +K
Sbjct: 195 DPEQRAKINITLDHRCRIFQNLDGALDEVVLKFETAQ-VRARNLEYDTLPVLIHGNGPTK 253
Query: 255 IELNSFGNYLAKSWK-TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNK 312
++LN GNY+ + W +GCT C+ ++ L ++ + PSVLI +FI++PT FL F +
Sbjct: 254 LQLNYLGNYIPRFWTFETGCTVCDEGLRSLKAIGDEALPSVLIGIFIEQPTPFLSLFFKR 313
Query: 313 IANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVE 372
+ NL YP K++ +F++N++E+H + +I + + + VK + V +ARN+ +
Sbjct: 314 LLNLRYPRKRLRLFIHNHEEHHEDQVEQFIADHGSEYHMVKLVGPEQRVRGADARNMGAD 373
Query: 373 NSLH-KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGF 431
+ +Y +D++ L NPD L+ L+ +N+++IAPL++R + WSNFWGAL+ADGF
Sbjct: 374 LCRQDRDCTYYLSMDAEVALTNPDALRLLIEQNKAVIAPLVIRAGRLWSNFWGALSADGF 433
Query: 432 YARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCT 490
YARS DY++I+ G + G+WNVPYI+N YL+K S ++ K ++ + +D DMAFC+
Sbjct: 434 YARSEDYVDIVQGRR--VGVWNVPYISNIYLIKGSTLRGDLQQKDLFHSSKLDADMAFCS 491
Query: 491 NLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLP 550
N+R + I L+I++ +G L+ +N+ + +++E+ NP DW +YIH Y ++L
Sbjct: 492 NVREQVIFLRINNRHSFGRLLSVDNYQTTHLHNDLWEIFNNPEDWKEKYIHENYTEALKG 551
Query: 551 DTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMK 610
V PCPDV+WFPI TE C E V+ ME +GQWS G N D R++ GYE VPT DIHM
Sbjct: 552 KLVET-PCPDVYWFPIFTETACDELVEEMEHFGQWSAGDNKDTRIQGGYENVPTIDIHMN 610
Query: 611 QVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYT 670
Q+ W +FL +Y+ P+ E+ + GY+ + ++FVVRY+PDEQPSL PHHD+ST+T
Sbjct: 611 QIKFEREWHKFLVEYIAPMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFT 669
Query: 671 INIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMIS 730
INIALN+VGVDYEGGGCRFIRYNC++ A R GW LMHPGRLTHYHEGL T+GTRYI +S
Sbjct: 670 INIALNRVGVDYEGGGCRFIRYNCSIRAPRKGWTLMHPGRLTHYHEGLPTTRGTRYIAVS 729
Query: 731 FVDP 734
FVDP
Sbjct: 730 FVDP 733
>gi|432098106|gb|ELK27993.1| Procollagen-lysine,2-oxoglutarate 5-dioxygenase 1 [Myotis davidii]
Length = 737
Score = 671 bits (1730), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 319/707 (45%), Positives = 469/707 (66%), Gaps = 9/707 (1%)
Query: 32 DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNEL 91
D LV+TVA+ ET+G++RF +SA+ +++ LGL + W G +S GGG KV LLK L
Sbjct: 36 DNLLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWHGEKATSAGGGLKVRLLKKAL 95
Query: 92 DEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAV 151
++ ++++IL TDSYDV+ G ++L++F + +VF AE L +PD L KYP V
Sbjct: 96 EKHADKENLVILFTDSYDVVFASGPRELLKKFRQARSQVVFSAEELIYPDRRLEAKYPVV 155
Query: 152 GSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLAN 211
G R+L SGGFIGYA + +L++ ++ + DQL+Y +FLD+ R + I LD
Sbjct: 156 SDGKRFLGSGGFIGYAPSLSKLVAEWEGQDSDSDQLFYTKIFLDKEKRERINITLDHRCR 215
Query: 212 LFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK-T 270
+FQNL G+L+++ L F++ V N Y+T PV+IHGNG +K++LN GNY+ + W
Sbjct: 216 IFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTFE 274
Query: 271 SGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN 329
+GC C+ ++ L + + PSVL+ VFI++PT FL F ++ L+YP K++ +F++N
Sbjct: 275 TGCVVCDEGLRSLKGIGDEALPSVLVGVFIEQPTPFLSLFFKRLLRLHYPQKRMRLFIHN 334
Query: 330 NQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDSD 388
++++H + ++ +++VK + V + +ARN+ V+ G +YF VD+D
Sbjct: 335 HEQHHKVQVEQFLAEHGGEYQSVKLVGPEVQVANADARNMGVDLCRQDHGCTYYFSVDAD 394
Query: 389 SHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGG 448
L P +L+ L+ +N+++IAPL+ R + WSNFWGAL+ADG+YARS DY++I+ G +
Sbjct: 395 VALTEPQILRLLIEQNKNVIAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR-- 452
Query: 449 KGIWNVPYITNCYLMKTSVIKATNIKT-IYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEY 507
G+WNVPYI+N YLMK S ++A +T ++ + +D DMAFC N+R +G+ + + + +
Sbjct: 453 VGVWNVPYISNIYLMKGSALRAELQQTDLFHHSKLDPDMAFCANVRQQGVFMFLTNRHTF 512
Query: 508 GHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIV 567
GHL+ +N+ + +++E+ NP DW +YIH Y K L V PCPDV+WFPI
Sbjct: 513 GHLLSLDNYQTTHLHNDLWEVFSNPEDWKEKYIHENYTKVLAGKLV-EMPCPDVYWFPIF 571
Query: 568 TEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVV 627
TE C E V+ ME YGQWS G N D R++ GYE VPT DIHM Q+ W +FL +Y+
Sbjct: 572 TETACDELVEEMEHYGQWSLGDNKDNRIQGGYENVPTIDIHMNQISFEREWHKFLVEYIA 631
Query: 628 PLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGC 687
P+ E+ + GY+ + ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VGVDYEGGGC
Sbjct: 632 PVTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFTINIALNRVGVDYEGGGC 690
Query: 688 RFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
RF+RYNC++ A R GW LMHPGRLTHYHEGL T+GTRYI +SFVDP
Sbjct: 691 RFLRYNCSIRAPRKGWALMHPGRLTHYHEGLPTTKGTRYIAVSFVDP 737
>gi|348533600|ref|XP_003454293.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 1-like
[Oreochromis niloticus]
Length = 730
Score = 669 bits (1727), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 325/737 (44%), Positives = 483/737 (65%), Gaps = 13/737 (1%)
Query: 2 LSNLHLNCLILSCVVFFISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQV 61
LS+L L I+ C + ++ ++V+ I EDK LV+TVA+ ETDGY+RF+++A+ V
Sbjct: 3 LSSLLLFSGIVVCAL--SALVNSEVEGIPEDKLLVVTVATKETDGYRRFLRTAKHFNYTV 60
Query: 62 KTLGLHQPWLGGD-MSSLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDIL 120
K LG Q W GGD MS+ GGG KV LL L EM D IIL DSYDV+ G ++L
Sbjct: 61 KVLGRGQKWKGGDYMSAPGGGQKVRLLNEGLKEMK-DDHQIILFIDSYDVVFASGPKELL 119
Query: 121 ERFNTFDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIK 180
++F +VF +E L WPD L DKYP V G R+L SGGFIGY +IKEL++N +
Sbjct: 120 KKFQQAKHRVVFSSETLIWPDRHLEDKYPHVREGNRFLGSGGFIGYLPNIKELVANWTGD 179
Query: 181 NEEDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKY 240
+ + DQL++ ++ D++ R I LD LFQNL+G+L+D+ L F+ D V + N Y
Sbjct: 180 DGDSDQLFFTKIYTDQSKRKSINITLDNKCRLFQNLHGALDDVVLKFE-DHQVRVRNVLY 238
Query: 241 NTNPVIIHGNGKSKIELNSFGNYLAKSWK-TSGCTRCNL-IKHLDSLKPDQFPSVLISVF 298
+T PVIIHGNG +K+++N GNY+ +W SGCT C ++ L +L+ +++P V+I +F
Sbjct: 239 DTLPVIIHGNGPTKLQINYLGNYIPNTWTFESGCTVCREDLRSLSALQENEYPLVVIGIF 298
Query: 299 IDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHN 358
I +PT F+ F ++ L YP K+ +F++N + +H ++ ++ ++++ V I
Sbjct: 299 IQQPTPFVTVFFERLLKLQYPKNKLKLFIFNKEAHHQRQVQSFLKDYGSLYEKVTVIEPE 358
Query: 359 STVNSKEARNLAVENSLH-KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFK 417
++ +RNL ++ + D++F +D + L N D LK L+ +N ++AP++ R +
Sbjct: 359 EEMDGAASRNLGLDLCRRDQDCDYFFSLDIEVVLKNKDTLKILIEQNLPIVAPMITRAGR 418
Query: 418 AWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIY 477
WSNFWGAL+ DG+YARS DY++I+ G + G+WNVPY++N YL+K +++ +K
Sbjct: 419 LWSNFWGALSGDGYYARSEDYVDIVQGRR--VGVWNVPYVSNVYLVKAGLLQ-RELKDYE 475
Query: 478 TLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDL 537
+S D DMAFC N+RNKGI + + + +G ++ +EN+ + +++++ NP+DW+
Sbjct: 476 LFSSSDPDMAFCHNIRNKGIFMYVTNMHTFGRILSTENYQTGHLHNDLWQIFENPVDWEE 535
Query: 538 RYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLET 597
RYIH Y + ++ D + PCPDV+WFP+ + C+ ++ ME YG+WS G N D R+
Sbjct: 536 RYIHENYTR-IMKDKLIENPCPDVYWFPVFSSVACNHMIEEMEHYGKWSGGANVDNRIHG 594
Query: 598 GYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQ 657
GYE VPT DIHM Q+ W +FL +YVVP+ E+ F GY+ + ++FVVRY+PDEQ
Sbjct: 595 GYENVPTIDIHMTQINFEKDWQKFLVEYVVPITEKMFPGYYTK-AHFELAFVVRYKPDEQ 653
Query: 658 PSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEG 717
PSLRPHHD+ST+T+NIALNQVG+DY+GGGCRF+RY+C++ A R GW L+HPGRLTHYHEG
Sbjct: 654 PSLRPHHDASTFTVNIALNQVGLDYQGGGCRFLRYDCSIQAPRKGWALLHPGRLTHYHEG 713
Query: 718 LQVTQGTRYIMISFVDP 734
L T G RYI +SFVDP
Sbjct: 714 LPTTAGVRYIAVSFVDP 730
>gi|73950912|ref|XP_544565.2| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 1
isoform 1 [Canis lupus familiaris]
Length = 727
Score = 668 bits (1723), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 316/708 (44%), Positives = 469/708 (66%), Gaps = 9/708 (1%)
Query: 31 EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNE 90
ED LV+TVA+ ET+G++RF +S + +++ LGL + W G +S GGG KV LLK
Sbjct: 25 EDNLLVLTVATTETEGFRRFKRSGQFFNYKIQALGLGEDWTGEKGTSAGGGLKVRLLKKA 84
Query: 91 LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
L++ +D++IL TDSYDV+ G ++L++F +VF AE L +PD L KYPA
Sbjct: 85 LEKHADKEDLVILFTDSYDVVFASGPRELLKKFRQARGQVVFSAEELIYPDRRLEAKYPA 144
Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
V G R+L SGGFIGYA + +L++ ++ + DQL+Y +FLD R + I LD
Sbjct: 145 VSDGKRFLGSGGFIGYAPSLSKLVAEWEGQDSDSDQLFYTQIFLDPEKRERINITLDHRC 204
Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK- 269
+FQNL G+L+++ L F++ V N Y+T PV+IHGNG +K++LN GNY+ + W
Sbjct: 205 RIFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTF 263
Query: 270 TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
+GC+ C+ ++ L + + P+VL+ VFI++PT FL F ++ +L+YP K++ +F++
Sbjct: 264 ETGCSVCDEGLRSLRGIGEEALPTVLVGVFIEQPTPFLSLFFRRLLHLHYPRKQMRLFIH 323
Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDS 387
N++++H + ++ + +++VK + V + +ARN+ + +G +YF VD+
Sbjct: 324 NHEQHHKAQVEQFLAEHGSEYQSVKLVGPEVRVANADARNMGADLCRQDRGCTYYFSVDA 383
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D L P L+ L+ +N+++IAPL+ R + WSNFWGAL+ADG+YARS DY++I+ G +
Sbjct: 384 DVALTEPKTLRLLIEQNKNVIAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR- 442
Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKT-IYTLNSMDYDMAFCTNLRNKGIHLKIDSTQE 506
G+WNVPYI+N YL+K S ++A T ++ + +D DMAFC N+R + + + + +
Sbjct: 443 -VGVWNVPYISNIYLIKGSALRAELQHTDLFHHSRLDPDMAFCANIRQQDVFMFLTNRHT 501
Query: 507 YGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPI 566
+GHL+ +++ + +++E+ NP DW +YIH Y K+L V PCPDV+WFPI
Sbjct: 502 FGHLLSLDSYQTSHLHNDLWEVFSNPEDWKEKYIHENYTKALAGKLV-EMPCPDVYWFPI 560
Query: 567 VTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYV 626
TE C E V+ ME YGQWS G N D R++ GYE VPT DIHM Q+ W +FL +Y+
Sbjct: 561 FTETACDELVEEMEHYGQWSLGDNKDNRIQGGYENVPTIDIHMNQISFEREWHKFLVEYI 620
Query: 627 VPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGG 686
P+ E+ + GY+ + ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VGVDYEGGG
Sbjct: 621 APMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFTINIALNRVGVDYEGGG 679
Query: 687 CRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
CRF+RYNC++ A R GW LMHPGRLTHYHEGL T+GTRYI +SFVDP
Sbjct: 680 CRFLRYNCSIRAPRKGWTLMHPGRLTHYHEGLPTTRGTRYIAVSFVDP 727
>gi|410966044|ref|XP_003989548.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 1 [Felis
catus]
Length = 727
Score = 665 bits (1716), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 315/708 (44%), Positives = 470/708 (66%), Gaps = 9/708 (1%)
Query: 31 EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNE 90
ED LV+TVA+ ET+G++RF +S + +++ LGL + W G +S GGG KV LLK
Sbjct: 25 EDNLLVLTVATKETEGFRRFKRSGQFFNYKIQALGLGEDWNGEKGASSGGGLKVRLLKKA 84
Query: 91 LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
L++ ++++IL TDSYDV+ G ++L++F + +VF AE L +PD L KYPA
Sbjct: 85 LEKHADKENLVILFTDSYDVVFASGPRELLKKFRQARSQVVFSAEELIYPDRRLEAKYPA 144
Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
V G R+L SGGFIGYA + +L++ ++ + DQL+Y +FLD R + I LD
Sbjct: 145 VSDGKRFLGSGGFIGYAPSLSKLVAEWEGQDGDSDQLFYTKIFLDPEKRERINITLDHRC 204
Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK- 269
+FQNL G+L+++ L F++ V N Y+T PV+IHGNG +K++LN GNY+ + W
Sbjct: 205 RIFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTF 263
Query: 270 TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
+GC C+ ++ L + + P+VL+ VFI++PT FL F ++ L+YP K++ +F++
Sbjct: 264 ETGCAVCDEGLRSLRGIGDEALPTVLVGVFIEQPTPFLSLFFQRLLRLHYPQKQMRLFIH 323
Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDS 387
N++++H + ++ + +++VK + V + +ARN+ + +G +YF VD+
Sbjct: 324 NHEQHHKAQVEQFLAEHGSEYQSVKLVGPEVRVANADARNVGADLCRQDRGCTYYFSVDA 383
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D L P L+ L+ +N+++IAPL+ R + WSNFWGAL+ADG+YARS DY++I+ G +
Sbjct: 384 DVALTEPKTLRLLIEQNKNVIAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR- 442
Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKT-IYTLNSMDYDMAFCTNLRNKGIHLKIDSTQE 506
G+WNVPYI+N YL+K S ++A ++T ++ + +D DMAFC N+R + + + + +
Sbjct: 443 -VGVWNVPYISNIYLIKGSALRAELLQTDLFHHSKLDPDMAFCANIRQQDVFMYLTNRHT 501
Query: 507 YGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPI 566
+GHL+ +++ + +++E+ NP DW +YIH Y K+L V PCPDV+WFPI
Sbjct: 502 FGHLLSLDSYQTSHLHNDLWEVFSNPEDWKEKYIHENYTKALAGKLV-EMPCPDVYWFPI 560
Query: 567 VTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYV 626
TE C E V+ ME YGQWS G N D R++ GYE VPT DIHM Q+ W +FL +Y+
Sbjct: 561 FTETACDELVEEMEHYGQWSLGDNKDNRIQGGYENVPTIDIHMNQISFEREWHKFLVEYI 620
Query: 627 VPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGG 686
P+ E+ + GY+ + ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VGVDYEGGG
Sbjct: 621 APMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFTINIALNRVGVDYEGGG 679
Query: 687 CRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
CRF+RYNC++ A R GW LMHPGRLTHYHEGL T+GTRYI +SFVDP
Sbjct: 680 CRFLRYNCSIRAPRKGWTLMHPGRLTHYHEGLPTTRGTRYIAVSFVDP 727
>gi|334328372|ref|XP_001371352.2| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 1-like
[Monodelphis domestica]
Length = 725
Score = 665 bits (1715), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 317/706 (44%), Positives = 471/706 (66%), Gaps = 10/706 (1%)
Query: 34 FLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDM-SSLGGGYKVNLLKNELD 92
LV+TVA+ ET+G++RF +SA+ ++ LGL + W GG+ ++LGGG KV LLK L+
Sbjct: 25 LLVLTVATKETEGFRRFKRSAQFFNYNIQVLGLGEDWHGGEKETTLGGGQKVRLLKAALE 84
Query: 93 EMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAVG 152
+ +D IIL TDSYDV+ G ++L++F + +VF AE L +PD L KYP V
Sbjct: 85 KYAEKEDQIILFTDSYDVLFASGPKELLKKFRQTKSRVVFSAEELIYPDRRLEAKYPQVH 144
Query: 153 SGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLANL 212
G R+L SGGFIGYA ++ +L+++ ++ + DQL+Y +FLD R K I LD +
Sbjct: 145 DGKRFLGSGGFIGYAPNLSKLVASWQGQDSDSDQLFYTKIFLDPEQREKINITLDHRCRI 204
Query: 213 FQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK-TS 271
FQNL G+L+++ L F+ + V N +Y+T PV+IHGNG +K++LN GNY+ + W +
Sbjct: 205 FQNLDGALDEVVLKFETAQ-VRARNLEYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTFET 263
Query: 272 GCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNN 330
GCT C+ ++ L L P+VLI +FI++PT FL F ++ +L YP K++ +F++N+
Sbjct: 264 GCTVCDEGLRSLKGLGDKALPTVLIGIFIEQPTPFLSLFFKRLLSLRYPRKQLRLFIHNH 323
Query: 331 QEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDSDS 389
+E+H + ++ + + + VK + + + + +ARN+ + + +Y +D++
Sbjct: 324 EEHHEAQVEQFLEDHGSEYHTVKLVGPDQRMKNADARNMGADLCRQDRDCTYYLSMDAEV 383
Query: 390 HLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGK 449
L NPD L+ L+ +N+++IAPL++R + WSNFWGAL+ADGFYARS DY++I+ G +
Sbjct: 384 ALTNPDALRILIEQNKAVIAPLVIRAGRLWSNFWGALSADGFYARSEDYVDIVQGRR--V 441
Query: 450 GIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYG 508
G+WNVPYI+N YL+K S +++ K ++ + +D DMAFC+N+R + + L + + +G
Sbjct: 442 GVWNVPYISNIYLIKGSTLRSDLRQKDLFHSSKLDADMAFCSNVREQNVFLFVTNQHSFG 501
Query: 509 HLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVT 568
L+ +N+ + +++E+ NP DW +YIH Y ++L V PCPDV+WFPI T
Sbjct: 502 RLLSVDNYQTTHLHNDLWEIFNNPEDWKEKYIHENYTEALKGKLVET-PCPDVYWFPIFT 560
Query: 569 EKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVP 628
E C E V+ ME +GQWS G N D R++ GYE VPT DIHM Q+ W +FL +Y+ P
Sbjct: 561 ETACDELVEEMEHFGQWSAGDNKDTRIQGGYENVPTIDIHMNQIKFEREWHKFLVEYIAP 620
Query: 629 LQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCR 688
+ E+ + GY+ + ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VGVDYEGGGCR
Sbjct: 621 MTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFTINIALNRVGVDYEGGGCR 679
Query: 689 FIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
FIRYNC++ A R GW LMHPGRLTHYHEGL T+GTRYI +SFVDP
Sbjct: 680 FIRYNCSIRAPRKGWTLMHPGRLTHYHEGLPTTKGTRYIAVSFVDP 725
>gi|149695386|ref|XP_001491381.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 1-like
[Equus caballus]
Length = 727
Score = 665 bits (1715), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 313/708 (44%), Positives = 469/708 (66%), Gaps = 9/708 (1%)
Query: 31 EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNE 90
ED LV+TVA+ E++G++RF +SA+ +++ LGL + W G +S GGG KV LLK
Sbjct: 25 EDNLLVLTVATKESEGFRRFKRSAQFFNYKIQALGLGEDWDGDKETSAGGGLKVRLLKKA 84
Query: 91 LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
L++ ++++IL TDSYDV+ G ++L++F + +VF AE L +PD L KYP
Sbjct: 85 LEKHADKENLVILFTDSYDVVFASGPRELLKKFRQARSQVVFSAEELIYPDRRLEAKYPV 144
Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
V G R+L SGGFIGYA ++ +L++ ++ + DQL+Y +FLD R + I LD
Sbjct: 145 VSDGKRFLGSGGFIGYAPNLSKLVAEWEGQDSDSDQLFYTKIFLDPEKRERINITLDHRC 204
Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK- 269
+FQNL G+L+++ L F++ V N Y+T PV+IHGNG +K++LN GNY+ + W
Sbjct: 205 RIFQNLVGALDEVVLKFEMGH-VRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTF 263
Query: 270 TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
+GCT C+ ++ L + + P VL+ VFI++PT FL F ++ L+YP K++ +F++
Sbjct: 264 ETGCTVCDEGLRSLKGIGDEALPVVLVGVFIEQPTPFLSLFFQRLLRLHYPRKQLRLFIH 323
Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDS 387
N++++H + ++ +K+VK + V + +ARN+ + +G +YF VD+
Sbjct: 324 NHEQHHKAQVEQFLAEHGGEYKSVKLVGPEVRVANADARNMGADLCRQDRGCTYYFSVDA 383
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D L P L+ L+ +N+++IAPL+ R + WSNFWGA++ADG+YARS DY++I+ G +
Sbjct: 384 DVALTEPKTLRLLIEQNKNVIAPLMTRHGRLWSNFWGAMSADGYYARSEDYVDIVQGRR- 442
Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKT-IYTLNSMDYDMAFCTNLRNKGIHLKIDSTQE 506
G+WNVPYI+N YL+K S ++A +T ++ + +D DMAFC N+R + + + + +
Sbjct: 443 -VGVWNVPYISNIYLIKGSALRAELQQTDLFHHSKLDADMAFCANIRQQDVFMFLTNRHT 501
Query: 507 YGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPI 566
+GHL+ +++ + +++E+ NP DW +YIH Y ++L V PCPDV+WFPI
Sbjct: 502 FGHLLSLDSYQTTHLHNDLWEVFSNPEDWKEKYIHENYTRALAGKLV-EMPCPDVYWFPI 560
Query: 567 VTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYV 626
TE C E V+ ME YGQWS G N D R++ GYE VPT DIHM Q+ W +FL +Y+
Sbjct: 561 FTETACDELVEEMEHYGQWSLGDNKDNRIQGGYENVPTIDIHMNQISFEREWHKFLVEYI 620
Query: 627 VPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGG 686
P+ E+ + GY+ + ++FVVRY+PDEQPSL PHHD+ST+T+NIALN+VGVDYEGGG
Sbjct: 621 APMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFTVNIALNRVGVDYEGGG 679
Query: 687 CRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
CRF+RYNC++ A R GW LMHPGRLTHYHEGL T+GTRYI +SFVDP
Sbjct: 680 CRFLRYNCSIRAPRKGWTLMHPGRLTHYHEGLPTTKGTRYIAVSFVDP 727
>gi|296192333|ref|XP_002744029.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 3
[Callithrix jacchus]
Length = 753
Score = 663 bits (1711), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 310/679 (45%), Positives = 455/679 (67%), Gaps = 11/679 (1%)
Query: 61 VKTLGLHQPWLGGDMS-SLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDI 119
V+TLGL + W GGD++ ++GGG KV LK E+++ +DMII+ DSYDV++ GG +++
Sbjct: 81 VRTLGLGEEWRGGDVARTVGGGQKVRWLKKEMEKYADREDMIIMFVDSYDVVLAGGPSEL 140
Query: 120 LERFNTFDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSI 179
L++F + ++F AE CWP+ L ++YP VG+G R+LNSGGF+G+A I +++
Sbjct: 141 LKKFVQSGSRLLFSAESFCWPEWGLAEQYPEVGTGKRFLNSGGFVGFATTIHQIVRQWKY 200
Query: 180 KNEEDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTK 239
K+++DDQL+Y L+LD LR K + LD + +FQNL G+L+++ L FD + V + N
Sbjct: 201 KDDDDDQLFYTRLYLDPGLREKLGLNLDHKSRIFQNLNGALDEVVLKFDRNR-VRIRNVA 259
Query: 240 YNTNPVIIHGNGKSKIELNSFGNYLAKSW-KTSGCTRCNLIKH-LDSLKPDQFPSVLISV 297
Y+T PV++HGNG +K++LN GNY+ W GC CN + L +P P V ++V
Sbjct: 260 YDTLPVVVHGNGPTKLQLNYLGNYVPNGWTPEGGCGFCNRDRRTLPGGQPP--PRVFLAV 317
Query: 298 FIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAH 357
F+++PT FL FL ++ L+YP +I++F++NN+ +H P D + F K +
Sbjct: 318 FVEQPTPFLPSFLQRLLLLDYPHDRITLFLHNNEVFHEPHIADSWPQLQEHFAATKLVGP 377
Query: 358 NSTVNSKEARNLAVENSLH-KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPF 416
++ EAR++A++ +FYF +D+D+ L NP L+ L+ N +IAP+L R
Sbjct: 378 EEALSPGEARDMAMDMCRQDPECEFYFSLDADAVLTNPQTLRILIEENRKVIAPMLSRHG 437
Query: 417 KAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKAT-NIKT 475
K WSNFWGAL+ D +YARS DY+ ++ + G+WNVPYI+ Y+++ ++ +
Sbjct: 438 KLWSNFWGALSPDEYYARSEDYVELVQRKR--VGVWNVPYISQAYVIQGETLRMELPQRE 495
Query: 476 IYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDW 535
+++ + D DMAFC + R+KGI L + + E+G L+ + +D + +P+++++ NP+DW
Sbjct: 496 VFSGSDTDPDMAFCKSFRDKGIFLHLSNQHEFGRLLATSRYDTEHLHPDLWQIFDNPVDW 555
Query: 536 DLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRL 595
+YIH Y ++L + + QPCPDV+WFP+++E+ C E V+ ME YGQWS G + D RL
Sbjct: 556 QEQYIHENYSRALEGEGIVEQPCPDVYWFPLLSEQMCDELVEEMEHYGQWSGGRHEDSRL 615
Query: 596 ETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPD 655
GYE VPT DIHMKQVG W + LR YV P+ E F GYH + RA M+FVVRYRPD
Sbjct: 616 AGGYENVPTVDIHMKQVGYEDQWLQLLRTYVGPMTESLFPGYHTK-ARAVMNFVVRYRPD 674
Query: 656 EQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYH 715
EQPSLRPHHDSST+T+N+ALN G+DYEGGGCRF+RYNC +++ R GW L+HPGRLTHYH
Sbjct: 675 EQPSLRPHHDSSTFTLNVALNHKGLDYEGGGCRFLRYNCVISSPRKGWALLHPGRLTHYH 734
Query: 716 EGLQVTQGTRYIMISFVDP 734
EGL TQGTRYIM+SFVDP
Sbjct: 735 EGLPTTQGTRYIMVSFVDP 753
>gi|225690536|ref|NP_001071210.2| procollagen-lysine,2-oxoglutarate 5-dioxygenase 1 precursor [Danio
rerio]
Length = 730
Score = 662 bits (1709), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 318/734 (43%), Positives = 480/734 (65%), Gaps = 17/734 (2%)
Query: 7 LNCLILSCVVFFISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGL 66
L CL S F +HCN+ +I E LV+TVA+ ETDG++RF++SA+ +K LG
Sbjct: 8 LACLFAS----FPPLHCNQQGSIPEGDLLVLTVATQETDGFRRFLRSAKHFNYTIKVLGR 63
Query: 67 HQPWLGGD-MSSLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNT 125
+ W GGD M++ GGG KV LLK+ L+++ + +IL DSYDVI G ++L++F
Sbjct: 64 GETWRGGDYMTAPGGGQKVRLLKSALEDIQ-EEKKVILFVDSYDVIFSSGPKELLKKFQQ 122
Query: 126 FDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDD 185
+VF AE L WPD L DK+P V G R+L +GGFIGYA ++K+++S+ S + + D
Sbjct: 123 AKHKVVFSAETLIWPDRHLEDKHPHVREGKRFLGAGGFIGYAANLKKMLSDWSGADGDSD 182
Query: 186 QLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPV 245
QL+Y +++++ R I LD+ LFQNL+G+L+++ L F+ D V N Y+T PV
Sbjct: 183 QLFYTKIYINKEKRKSINITLDSKCRLFQNLHGALDEVVLKFE-DGRVRARNVLYDTLPV 241
Query: 246 IIHGNGKSKIELNSFGNYLAKSWK-TSGCTRCNLIKHLDS-LKPDQFPSVLISVFIDKPT 303
IIHGNG +K+++N GNY+ W +GCT CN + L S L+ ++P V+I +FI +PT
Sbjct: 242 IIHGNGPTKLQINYLGNYIPNLWTFETGCTMCNQDRRLLSGLQESEYPVVVIGIFIQQPT 301
Query: 304 AFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNS 363
F+ F ++ NL YP ++ +F+YN + +H ++ + ++ ++ VK I ++
Sbjct: 302 PFVTVFFERLFNLKYPKNRLKLFIYNQETHHEQHIHAFLDSHESEYQGVKLIGPEEDIDP 361
Query: 364 KEARNLAVENSLHK-GVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNF 422
+RNL + +++F +D D L N D L+ L+ N+ IAP+L +P + W+NF
Sbjct: 362 VSSRNLGFDMCREDIDCEYFFSIDVDVVLKNEDTLRILIEHNKPFIAPMLTKPGRLWTNF 421
Query: 423 WGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKT--IYTLN 480
WGAL+ADGFYARS DY++I+ G + G+WNVPY+++ +L+K ++ T++K ++
Sbjct: 422 WGALSADGFYARSEDYVDIVQGHR--VGLWNVPYVSHIFLIKADTLR-TDLKDPDLFKST 478
Query: 481 SMDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYI 540
++D DMAFC +RNKG+ + + + +G ++ ++N+ + +++++ NP++W+ RYI
Sbjct: 479 TLDPDMAFCEKIRNKGVFMFVTNMDTFGRVLSTDNYQTNHLHNDLWQIFENPVEWEERYI 538
Query: 541 HPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYE 600
HP Y + +L D PCPDV+WFPI +E C V+ ME +GQWS G N D R++ GYE
Sbjct: 539 HPNYSR-VLKDEFIETPCPDVYWFPIFSEVACDHLVEEMENFGQWSGGANVDNRIQGGYE 597
Query: 601 AVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSL 660
VPT DIHM QVG W +FL Y+ P+ E+ F GY+ + ++FVVRY+PDEQPSL
Sbjct: 598 NVPTIDIHMNQVGYEKEWQKFLLDYIAPVTEKMFPGYYTR-AQFDLAFVVRYKPDEQPSL 656
Query: 661 RPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQV 720
RPHHD+ST+TINIALN VG+D++GGGCRF+RY+C++ + R GW MHPGRLTHYHEGL
Sbjct: 657 RPHHDASTFTINIALNHVGIDFQGGGCRFLRYDCSIRSPRKGWAFMHPGRLTHYHEGLPT 716
Query: 721 TQGTRYIMISFVDP 734
T+G RYI +SFVDP
Sbjct: 717 TEGVRYIAVSFVDP 730
>gi|402863091|ref|XP_003895867.1| PREDICTED: LOW QUALITY PROTEIN: procollagen-lysine,2-oxoglutarate
5-dioxygenase 3 [Papio anubis]
Length = 736
Score = 662 bits (1708), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 319/714 (44%), Positives = 469/714 (65%), Gaps = 18/714 (2%)
Query: 29 IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLL 87
++ +K LV+TVA+ ET+GY RF++SAE V+TLGL + W GGD++ ++GGG KV L
Sbjct: 33 VNPEKLLVMTVATAETEGYLRFLRSAEFFNYTVRTLGLGEEWRGGDVARTVGGGQKVRWL 92
Query: 88 KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
K E+++ +DMII+ DSYDVI+ G +++L++F + ++F AE CWP+ L ++
Sbjct: 93 KKEMEKYADREDMIIMFVDSYDVILAGSPSELLKKFVQSGSRLLFSAESFCWPEWGLAEQ 152
Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
YP VG+G R+LNSGGFIG+A I +++ K+++DDQL+Y L+LD LR K + LD
Sbjct: 153 YPEVGTGKRFLNSGGFIGFATTIHQIVRQWKYKDDDDDQLFYTRLYLDPGLREKLSLNLD 212
Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
+ +FQNL G+L+++ L FD + V + N Y+T PV++HGNG +K++LN GNY+
Sbjct: 213 HKSRIFQNLNGALDEVVLKFDRNR-VRIRNVAYDTLPVVVHGNGPTKLQLNYLGNYVPNG 271
Query: 268 WKT-SGCTRCNLIKH-LDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
W GC CN + L +P P V ++VF+++PT FL FL ++ L+YP ++++
Sbjct: 272 WTPEGGCGFCNRDRRTLPGGQPP--PRVFLAVFVEQPTPFLPRFLQRLLLLDYPPDRVTL 329
Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDF---- 381
F++NN+ +H P D + F VK + ++ EAR++A+ + +
Sbjct: 330 FLHNNEVFHEPHIADSWPQLQDHFAVVKLVGPEEALSPGEARDMAIRRAPQQARTHFCLG 389
Query: 382 YFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNI 441
Y + S L +P L R +IAP+L R K WSNFWGAL+ D +YARS DY+ +
Sbjct: 390 YGAPQACSRLPSPH--PRLSXRK--VIAPMLSRHGKLWSNFWGALSPDEYYARSEDYVEL 445
Query: 442 INGDQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLK 500
+ + G+WNVPYI+ Y+++ ++ + +++ + D DMAFC + R+KGI L
Sbjct: 446 VQRKR--VGVWNVPYISQAYVIRGDTLRTELPQRDVFSGSDTDPDMAFCKSFRDKGIFLH 503
Query: 501 IDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPD 560
+ + E+G L+ + +D + +P+++++ NP+DW +YIH Y ++L + + QPCPD
Sbjct: 504 LSNQHEFGRLLATSRYDTEHLHPDLWQIFDNPVDWKEQYIHENYSRALEGEGIVEQPCPD 563
Query: 561 VFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAE 620
V+WFP+++E+ C E V ME YGQWS G + D RL GYE VPT DIHMKQVG W +
Sbjct: 564 VYWFPLLSEQMCDELVAEMEHYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYEDQWLQ 623
Query: 621 FLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGV 680
LR YV P+ E F GYH + RA M+FVVRYRPDEQPSLRPHHDSST+T+N+ALN G+
Sbjct: 624 LLRTYVGPMTESLFPGYHTK-ARAVMNFVVRYRPDEQPSLRPHHDSSTFTLNVALNHKGL 682
Query: 681 DYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
DYEGGGCRF+RY+C +++ R GW L+HPGRLTHYHEGL T+GTRYIM+SFVDP
Sbjct: 683 DYEGGGCRFLRYDCVISSPRKGWALLHPGRLTHYHEGLPTTRGTRYIMVSFVDP 736
>gi|110825733|sp|O77588.2|PLOD1_BOVIN RecName: Full=Procollagen-lysine,2-oxoglutarate 5-dioxygenase 1;
AltName: Full=Lysyl hydroxylase 1; Short=LH1; Flags:
Precursor
gi|95767528|gb|ABF57309.1| lysyl hydroxylase precursor [Bos taurus]
Length = 726
Score = 661 bits (1705), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 315/708 (44%), Positives = 470/708 (66%), Gaps = 10/708 (1%)
Query: 31 EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNE 90
ED LV+TVA+ ET+G++RF +SA+ +++ LGL + W G M + GGG KV LLK
Sbjct: 25 EDNLLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWPGEAMLA-GGGLKVRLLKKA 83
Query: 91 LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
L++ ++++IL TDSYDV+ G ++L++F + +VF AE L +PD L KYP
Sbjct: 84 LEKHADKENLVILFTDSYDVVFASGPRELLKKFRQARSQVVFSAEELIYPDRRLEAKYPV 143
Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
V G R+L SGGFIGYA ++ +L++ ++ + DQL+Y +FLD R + I LD
Sbjct: 144 VSDGKRFLGSGGFIGYAPNLSKLVAEWEGQDSDSDQLFYTKIFLDPEKREQINITLDHRC 203
Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK- 269
+FQNL G+L+++ L F++ + V N Y+T PV+IHGNG +K++LN GNY+ + W
Sbjct: 204 RIFQNLDGALDEVVLKFEMGQ-VRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTF 262
Query: 270 TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
+GC C+ ++ L + + P+VL+ VFI++PT FL F ++ L+YP K++ +F++
Sbjct: 263 ETGCAVCDEGLRSLKGIGDEALPAVLVGVFIEQPTPFLSLFFQRLLRLHYPQKRLRLFIH 322
Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDS 387
N++++H + ++ +++VK + V + +ARN+ + +G +YF VD+
Sbjct: 323 NHEQHHKAQVEQFLAEHGDEYQSVKLVGPEVRVANADARNMGADLCRQDRGCTYYFSVDA 382
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D L P L+ L+ +N+++IAPL+ R + WSNFWGAL+ADG+YARS DY++I+ G +
Sbjct: 383 DVALTEPKTLRLLIEQNKNVIAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR- 441
Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKT-IYTLNSMDYDMAFCTNLRNKGIHLKIDSTQE 506
G+WNVPYI+N YL+K S ++A +T ++ + +D DMAFC N+R + + + + +
Sbjct: 442 -VGVWNVPYISNIYLIKGSALRAELQETDLFHHSKLDPDMAFCANIRQQDVFMFLTNRHS 500
Query: 507 YGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPI 566
+GHL+ +++ + +++E+ NP DW +YIH Y K+L V PCPDV+WFPI
Sbjct: 501 FGHLLSLDSYQTTHLHNDLWEVFSNPEDWKEKYIHENYTKALAGKMVE-MPCPDVYWFPI 559
Query: 567 VTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYV 626
TE C E V+ ME YGQWS G N D R++ GYE VPT DIHM Q+ W +FL +Y+
Sbjct: 560 FTETACDELVEEMEHYGQWSLGDNKDNRIQGGYENVPTIDIHMNQINFEREWHKFLVEYI 619
Query: 627 VPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGG 686
P+ E+ + GY+ + ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VGVDYEGGG
Sbjct: 620 APMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLVPHHDASTFTINIALNRVGVDYEGGG 678
Query: 687 CRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
CRF+RYNC++ A R GW LMHPGRLTHYHEGL T+GTRYI +SFVDP
Sbjct: 679 CRFLRYNCSIRAPRKGWTLMHPGRLTHYHEGLPTTKGTRYIAVSFVDP 726
>gi|344283497|ref|XP_003413508.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 1-like
[Loxodonta africana]
Length = 727
Score = 660 bits (1703), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 311/708 (43%), Positives = 471/708 (66%), Gaps = 9/708 (1%)
Query: 31 EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNE 90
ED LV+TVA+ ET+G++RF +SA+ +++ LGL + W G + GGG KV LLK
Sbjct: 25 EDNLLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWNSGQETPAGGGQKVRLLKRA 84
Query: 91 LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
L++ ++++IL TDSYDV+ G ++L++F + +VF AE L +PD L KYP
Sbjct: 85 LEKHADKENLVILFTDSYDVVFASGPRELLKKFRQARSQVVFSAEELIYPDRRLEAKYPV 144
Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
V G R+L SGGFIGYA ++ +L+S ++ + DQL+Y +FLD R + I LD
Sbjct: 145 VSDGKRFLGSGGFIGYAPNLSKLVSEWEGQDSDSDQLFYTKIFLDPEKRERINITLDHRC 204
Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK- 269
+FQNL G+L+++ L F++ V + N Y+T PV+IHGNG +K++LN GNY+ + W
Sbjct: 205 RIFQNLDGALDEVVLKFEMGH-VRVRNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTF 263
Query: 270 TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
+GCT C+ ++ L ++ + P+VL+ +FI++PT FL F ++ L+YP K++ +F++
Sbjct: 264 ETGCTVCDEGLRLLKGIRDEALPTVLVGIFIEQPTPFLVLFFQRLLRLHYPWKQMRLFIH 323
Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDS 387
N++++H + ++ + +++VK + + + +ARNL + + +YF +D+
Sbjct: 324 NHEQHHKAQVEQFLAEHGSKYQSVKLVGPEIRMANADARNLGADLCRKDQSCTYYFSMDA 383
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D L PD L+ L+ +N+++IAPL+ R + WSNFWGAL+ADG+YARS DY++I+ G +
Sbjct: 384 DVALKEPDTLRLLIEQNKNVIAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR- 442
Query: 448 GKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQE 506
G+WNVPYI+N YL+K S +++ K ++ + +D DM+FC N+R + + + + +
Sbjct: 443 -VGVWNVPYISNIYLIKGSALRSDLQQKDLFHHSKLDPDMSFCANVRQQAVFMFLTNRHT 501
Query: 507 YGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPI 566
+GHL+ +++ + +++E+ NP DW +YIH Y K+L V PCPDV+WFPI
Sbjct: 502 FGHLLSLDSYQTTHLHNDLWEVFSNPEDWKEKYIHENYTKALEGKLVET-PCPDVYWFPI 560
Query: 567 VTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYV 626
TE C E V+ ME YGQWS G N D R++ GYE VPT DIHM Q+ W +FL +Y+
Sbjct: 561 FTEAACDELVEEMEHYGQWSRGDNKDNRIQGGYENVPTIDIHMNQINFEREWHKFLVEYI 620
Query: 627 VPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGG 686
P+ E+ + GY+ + ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VGVDYEGGG
Sbjct: 621 APMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFTINIALNRVGVDYEGGG 679
Query: 687 CRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
CRF+RYNC++ A R GW L+HPGRLTHYHEGL T+GTRYI +SFVDP
Sbjct: 680 CRFLRYNCSIRAPRKGWTLLHPGRLTHYHEGLPTTKGTRYIAVSFVDP 727
>gi|70779497|gb|AAZ08241.1| procollagen lysine 2-oxoglutarate 5-dioxygenase 1 [Danio rerio]
gi|116487779|gb|AAI25831.1| Zgc:152876 [Danio rerio]
Length = 730
Score = 660 bits (1702), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 317/734 (43%), Positives = 480/734 (65%), Gaps = 17/734 (2%)
Query: 7 LNCLILSCVVFFISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGL 66
L CL S F ++CN+ +I E LV+TVA+ ETDG++RF++SA+ +K LG
Sbjct: 8 LACLFAS----FPPLYCNQQGSIPEGDLLVLTVATQETDGFRRFLRSAKHFNYTIKVLGR 63
Query: 67 HQPWLGGD-MSSLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNT 125
+ W GGD M++ GGG KV LLK+ L+++ + +IL DSYDVI G ++L++F
Sbjct: 64 GETWRGGDYMTAPGGGQKVRLLKSALEDIQ-EEKKVILFVDSYDVIFSSGPKELLKKFQQ 122
Query: 126 FDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDD 185
+VF AE L WPD L DK+P V G R+L +GGFIGYA ++K+++S+ S + + D
Sbjct: 123 AKHKVVFSAETLIWPDRHLEDKHPHVREGKRFLGAGGFIGYAANLKKMLSDWSGADGDSD 182
Query: 186 QLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPV 245
QL+Y +++++ R I LD+ LFQNL+G+L+++ L F+ D V N Y+T PV
Sbjct: 183 QLFYTKIYINKEKRKSINITLDSKCRLFQNLHGALDEVVLKFE-DGRVRARNVLYDTLPV 241
Query: 246 IIHGNGKSKIELNSFGNYLAKSWK-TSGCTRCNLIKHLDS-LKPDQFPSVLISVFIDKPT 303
IIHGNG +K+++N GNY+ W +GCT CN + L S L+ ++P V+I +FI +PT
Sbjct: 242 IIHGNGPTKLQINYLGNYIPNLWTFETGCTMCNQDRRLLSGLQESEYPVVVIGIFIQQPT 301
Query: 304 AFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNS 363
F+ F ++ NL YP ++ +F+YN + +H ++ + ++ ++ VK I ++
Sbjct: 302 PFVTVFFERLFNLKYPKNRLKLFIYNQETHHEQHIHAFLDSHESEYQGVKLIGPEEDIDP 361
Query: 364 KEARNLAVENSLHK-GVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNF 422
+RNL + +++F +D D L N D L+ L+ N+ IAP+L +P + W+NF
Sbjct: 362 VSSRNLGFDMCREDIDCEYFFSIDVDVVLKNEDTLRILIEHNKPFIAPMLTKPGRLWTNF 421
Query: 423 WGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKT--IYTLN 480
WGAL+ADGFYARS DY++I+ G + G+WNVPY+++ +L+K ++ T++K ++
Sbjct: 422 WGALSADGFYARSEDYVDIVQGHR--VGLWNVPYVSHIFLIKADTLR-TDLKDPDLFKST 478
Query: 481 SMDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYI 540
++D DMAFC +RNKG+ + + + +G ++ ++N+ + +++++ NP++W+ RYI
Sbjct: 479 TLDPDMAFCEKIRNKGVFMFVTNMDTFGRVLSTDNYQTNHLHNDLWQIFENPVEWEERYI 538
Query: 541 HPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYE 600
HP Y + +L D PCPDV+WFPI +E C V+ ME +GQWS G N D R++ GYE
Sbjct: 539 HPNYSR-VLKDEFIETPCPDVYWFPIFSEVACDHLVEEMENFGQWSGGANVDNRIQGGYE 597
Query: 601 AVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSL 660
VPT DIHM QVG W +FL Y+ P+ E+ F GY+ + ++FVVRY+PDEQPSL
Sbjct: 598 NVPTIDIHMNQVGYEKEWQKFLLDYIAPVTEKMFPGYYTR-AQFDLAFVVRYKPDEQPSL 656
Query: 661 RPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQV 720
RPHHD+ST+TINIALN VG+D++GGGCRF+RY+C++ + R GW MHPGRLTHYHEGL
Sbjct: 657 RPHHDASTFTINIALNHVGIDFQGGGCRFLRYDCSIRSPRKGWAFMHPGRLTHYHEGLPT 716
Query: 721 TQGTRYIMISFVDP 734
T+G RYI +SFVDP
Sbjct: 717 TEGVRYIAVSFVDP 730
>gi|291410569|ref|XP_002721561.1| PREDICTED: lysyl hydroxylase 1 [Oryctolagus cuniculus]
Length = 727
Score = 660 bits (1702), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 312/708 (44%), Positives = 467/708 (65%), Gaps = 9/708 (1%)
Query: 31 EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNE 90
ED LV+TVA+ ET+G++RF +SA+ +++ LGL + W S GGG KV LL+
Sbjct: 25 EDNLLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWPVEKGLSAGGGQKVRLLRKA 84
Query: 91 LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
L++ +D+++L TDSYDV+ G ++L++F + +VF AE L +PD L KYP
Sbjct: 85 LEKHADKEDLVVLFTDSYDVVFASGPRELLKKFRQARSQVVFSAEELIYPDRRLEAKYPV 144
Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
V G R+L SGGFIGYA +++L++ ++ + DQL+Y +FLD R + I LD
Sbjct: 145 VSDGKRFLGSGGFIGYAPSLRKLVAEWEGQDSDSDQLFYTKIFLDPEKRERINITLDHRC 204
Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK- 269
+FQNL G+L+++ L F++ V N Y+T PV+IHGNG +K++LN GNY+ + W
Sbjct: 205 RIFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTF 263
Query: 270 TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
+GC C+ ++ L + + P+VL+ VFI++PT FL F ++ L+YP K++ +FV+
Sbjct: 264 ETGCAVCDEGLRSLKGIGEEALPTVLVGVFIEQPTPFLSLFFQRLLRLHYPRKQMRLFVH 323
Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDS 387
N++++H + ++ +++VK + + + + +ARNL + + +YF +D+
Sbjct: 324 NHEQHHKAQVEQFLLEHGDEYQSVKLVGPEARMANADARNLGADLCRQDRACTYYFSMDA 383
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D L PD L+ L+ +N++++APL+ R + WSNFWGAL+ADG+YARS DY++I+ G +
Sbjct: 384 DVALTEPDSLRLLIEQNKNVLAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR- 442
Query: 448 GKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQE 506
G+WNVPYI+N YL+K S ++A + +Y + +D DMAFC NLR + + + + +
Sbjct: 443 -VGVWNVPYISNVYLIKGSALRAELHSPDLYRYSKLDPDMAFCANLRKQEVFMFLTNRHS 501
Query: 507 YGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPI 566
+GHL+ +N+ + +++E+ NP DW +YIH Y K+L V PCPDV+WFPI
Sbjct: 502 FGHLLSLDNYQTTHLHNDLWEVFSNPEDWKEKYIHEGYGKALAGKLV-EMPCPDVYWFPI 560
Query: 567 VTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYV 626
TE C E V+ ME +GQWS G N D R++ GYE VPT DIHM Q+ W +FL +Y+
Sbjct: 561 FTEVACDELVEEMEHFGQWSLGDNKDNRIQGGYENVPTIDIHMNQISFEREWHKFLVEYI 620
Query: 627 VPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGG 686
P+ E+ + GY+ + ++FVVRY+PDEQPSL PHHD+ST+T+NIALN+VGVDYEGGG
Sbjct: 621 APMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFTVNIALNRVGVDYEGGG 679
Query: 687 CRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
CRF+RYNC++ A R GW LMHPGRLTHYHEGL T+GTRYI +SFVDP
Sbjct: 680 CRFLRYNCSIRAPRKGWALMHPGRLTHYHEGLPTTKGTRYIAVSFVDP 727
>gi|355712268|gb|AES04293.1| procollagen-lysine 1, 2-oxoglutarate 5-dioxygenase 1 [Mustela
putorius furo]
Length = 727
Score = 660 bits (1702), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 315/708 (44%), Positives = 465/708 (65%), Gaps = 9/708 (1%)
Query: 31 EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNE 90
ED LV+TVA+ ET+G++RF +S + +++ LGL + W G SS GGG KV LLK
Sbjct: 25 EDNLLVLTVATRETEGFRRFKRSGQFFNYKIQALGLGEDWSGEKGSSAGGGLKVRLLKKA 84
Query: 91 LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
L++ ++++IL TDSYDV+ G ++L++F + +VF AE L +PD L KYPA
Sbjct: 85 LEKHADKENLVILFTDSYDVVFASGPRELLKKFRQARSQVVFSAEELIYPDRRLEAKYPA 144
Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
V G R+L SGGFIGYA ++ +L++ ++ + DQL+Y +FLD R + I LD
Sbjct: 145 VSDGKRFLGSGGFIGYAPNLSKLVAEWEGQDSDSDQLFYTKIFLDPEKRERINITLDHRC 204
Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK- 269
+FQNL G+L+++ L F++ V N Y+T PV+IHGNG +K++LN GNY+ + W
Sbjct: 205 RIFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTF 263
Query: 270 TSGCTRCN-LIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
+GC C+ ++ L + + P+VL+ VFI++PT FL F ++ L+YP K++ +F++
Sbjct: 264 ETGCAVCDESLRSLRGIGDEALPTVLVGVFIEQPTPFLSLFFQRLLRLHYPRKQMRLFIH 323
Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDS 387
N++ +H + ++ + +VK + + + +ARN+ + + +YF VD+
Sbjct: 324 NHEPHHKVQVEQFLAEHGDEYPSVKLVGPEVRMANADARNMGADLCRQDRSCTYYFSVDA 383
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D L P L+ L+ +N+++IAPL+ R + WSNFWGAL+ADG+YARS DY++I+ G +
Sbjct: 384 DVALTEPKTLRLLIEQNKNVIAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR- 442
Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKT-IYTLNSMDYDMAFCTNLRNKGIHLKIDSTQE 506
G+WNVPYI+N YL+K S ++A +T ++ +D DMAFC N+R + + + + +
Sbjct: 443 -VGVWNVPYISNIYLIKGSALRAELRQTDLFHHRKLDPDMAFCANIRQQDVFMFLTNRHT 501
Query: 507 YGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPI 566
+GHL+ +N+ + +++E+ NP DW +YIH Y K+L V PCPDV+WFPI
Sbjct: 502 FGHLLSLDNYQTTHLHNDLWEVFSNPEDWKEKYIHENYTKALAGKLVET-PCPDVYWFPI 560
Query: 567 VTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYV 626
TE C E V+ ME YGQWS G N D R++ GYE VPT DIHM Q+ W +FL +Y+
Sbjct: 561 FTEAACDELVEEMEHYGQWSLGDNKDNRIQGGYENVPTIDIHMNQINFEREWHKFLVEYI 620
Query: 627 VPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGG 686
P+ E+ + GY+ + ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VGVDYEGGG
Sbjct: 621 APMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFTINIALNRVGVDYEGGG 679
Query: 687 CRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
CRF+RYNC++ A R GW LMHPGRLTHYHEGL T+GTRYI +SFVDP
Sbjct: 680 CRFLRYNCSIRAPRKGWTLMHPGRLTHYHEGLPTTRGTRYIAVSFVDP 727
>gi|311258478|ref|XP_003127625.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 1-like
[Sus scrofa]
Length = 725
Score = 659 bits (1700), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 315/708 (44%), Positives = 470/708 (66%), Gaps = 11/708 (1%)
Query: 31 EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNE 90
ED LV+TVA+ ET+G++RF +SA+ +++ LGL + W + SS GGG KV LLK
Sbjct: 25 EDNLLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWNEKEASS-GGGLKVRLLKKA 83
Query: 91 LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
L E ++++IL TDSYDV+ G ++L++F + +VF AE L +PD L KYPA
Sbjct: 84 L-EKHADENLVILFTDSYDVVFASGPRELLKKFRQAKSQVVFSAEELIYPDRRLEAKYPA 142
Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
V G R+L SGGFIGYA ++ +L++ ++ + DQL+Y +FLD R + I LD
Sbjct: 143 VSDGKRFLGSGGFIGYAPNLSKLVAEWEGQDSDSDQLFYTKIFLDPEKRERINITLDHRC 202
Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK- 269
+FQNL G+L+++ L F++ + V N Y+T PV+IHGNG +K++LN GNY+ + W
Sbjct: 203 RIFQNLDGALDEVVLKFEMGQ-VRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTF 261
Query: 270 TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
+GC+ C+ ++ L + + P+VL+ +FI++PT FL F ++ L YP K++ +F++
Sbjct: 262 ETGCSVCDEGLRSLKGIGDEALPTVLVGLFIEQPTPFLSLFFQRLLRLQYPRKRMRLFIH 321
Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDS 387
N++++H L + ++ +++VK + V + +ARN+ + + +YF VD+
Sbjct: 322 NHEQHHKALVEQFLAEHGDEYQSVKLVGPEVRVANADARNMGADLCRQDRSCTYYFSVDA 381
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D L P L+ L+ +N++++APL+ R + WSNFWGAL+ADG+YARS DY++I+ G +
Sbjct: 382 DVALTEPKTLRLLIEQNKNVLAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR- 440
Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKT-IYTLNSMDYDMAFCTNLRNKGIHLKIDSTQE 506
G+WNVPYI+N YL+K S ++A +T ++ + +D DMAFC N+R + + + + +
Sbjct: 441 -VGVWNVPYISNVYLIKGSALRAELQQTDLFHHSKLDPDMAFCANIRQQDVFMFLTNRHA 499
Query: 507 YGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPI 566
+GHL+ +++ + +++E+ NP DW +YIH Y K+L V PCPDV+WFPI
Sbjct: 500 FGHLLSLDSYQTTHLHNDLWEVFSNPEDWKEKYIHENYTKALAGKLVET-PCPDVYWFPI 558
Query: 567 VTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYV 626
TE C E V+ ME YGQWS G N D R++ GYE VPT DIHM Q+ W +FL +Y+
Sbjct: 559 FTETACDELVEEMEHYGQWSLGDNKDNRIQGGYENVPTIDIHMNQISFEREWHKFLVEYI 618
Query: 627 VPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGG 686
P+ E+ + GY+ + ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VGVDYEGGG
Sbjct: 619 APMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFTINIALNRVGVDYEGGG 677
Query: 687 CRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
CRF+RYNC++ A R GW LMHPGRLTHYHEGL T+GTRYI +SFVDP
Sbjct: 678 CRFLRYNCSIRAPRKGWTLMHPGRLTHYHEGLPTTKGTRYIAVSFVDP 725
>gi|426240317|ref|XP_004014056.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 1 [Ovis
aries]
Length = 703
Score = 659 bits (1699), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 313/705 (44%), Positives = 468/705 (66%), Gaps = 10/705 (1%)
Query: 34 FLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNELDE 93
LV+TVA+ ET+G++RF +SA+ +++ LGL + W G MS+ GGG KV LLK L++
Sbjct: 5 LLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWPGEAMSA-GGGLKVRLLKKALEK 63
Query: 94 MDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAVGS 153
++++IL TDSYDV+ G ++L++F + +VF AE L +PD L KYP V
Sbjct: 64 HADKENLVILFTDSYDVVFASGPRELLKKFRQARSQVVFSAEELIYPDRRLEAKYPVVSD 123
Query: 154 GYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLANLF 213
G R+L SGGFIGYA ++ +L++ ++ + DQL+Y +FLD R + I LD +F
Sbjct: 124 GKRFLGSGGFIGYAPNLSKLVAEWEGQDSDSDQLFYTKIFLDPEKREQINITLDHRCRIF 183
Query: 214 QNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK-TSG 272
QNL G+L+++ L F++ + V N Y+T PV+IHGNG +K++LN GNY+ + W +G
Sbjct: 184 QNLDGALDEVVLKFEMGQ-VRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTFETG 242
Query: 273 CTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQ 331
C C+ ++ L + + P+VL+ VFI++PT FL F ++ L+YP K++ +F++N++
Sbjct: 243 CAVCDEGLRSLKGIGDEALPTVLVGVFIEQPTPFLSLFFQRLLRLHYPRKRLRLFIHNHE 302
Query: 332 EYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDSDSH 390
++H + ++ +++VK + + S +ARN+ + +G +YF VD+D
Sbjct: 303 QHHKAQVEQFLAEHGDEYQSVKLVGPEVRMASADARNMGADLCRQDRGCTYYFSVDADVA 362
Query: 391 LDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKG 450
L P L+ L+ +N+++I PL+ R + WSNFWGAL+ADG+YARS DY++I+ G + G
Sbjct: 363 LTEPRTLRLLIEQNKNVITPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR--VG 420
Query: 451 IWNVPYITNCYLMKTSVIKATNIKT-IYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGH 509
+WNVPYI+N YL+K S ++A +T ++ + +D DMAFC N+R + + + + + +GH
Sbjct: 421 VWNVPYISNIYLIKGSALRAELQETDLFHHSKLDPDMAFCANIRQQDVFMFLTNRHSFGH 480
Query: 510 LVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTE 569
L+ +++ + +++E+ NP DW +YIH Y K+L V PCPDV+WFPI TE
Sbjct: 481 LLSLDSYQTTHLHNDLWEVFSNPEDWKEKYIHENYTKALAGKMVE-MPCPDVYWFPIFTE 539
Query: 570 KFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPL 629
C E V+ ME YGQWS G N D R++ GYE VPT DIHM Q+ W +FL +Y+ P+
Sbjct: 540 TACDELVEEMEHYGQWSLGDNKDNRIQGGYENVPTIDIHMNQINFEREWHKFLVEYIAPM 599
Query: 630 QEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRF 689
E+ + GY+ + ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VGVDYEGGGCRF
Sbjct: 600 TEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLVPHHDASTFTINIALNRVGVDYEGGGCRF 658
Query: 690 IRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
+RYNC++ A R GW LMHPGRLTHYHEGL T+GTRYI +SFVDP
Sbjct: 659 LRYNCSIRAPRKGWTLMHPGRLTHYHEGLPTTKGTRYIAVSFVDP 703
>gi|444728170|gb|ELW68634.1| Procollagen-lysine,2-oxoglutarate 5-dioxygenase 1 [Tupaia
chinensis]
Length = 727
Score = 658 bits (1697), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 317/709 (44%), Positives = 465/709 (65%), Gaps = 11/709 (1%)
Query: 31 EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNE 90
ED LV+TVA+ ET+G++RF +SA+ +++ LGL + W S GGG KV LLK
Sbjct: 25 EDNLLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWNVEKGMSAGGGQKVRLLKKA 84
Query: 91 LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
L++ +D++IL TDSYDV+ G ++L++F + +VF AE L +PD L KYP
Sbjct: 85 LEKHANKEDLVILFTDSYDVVFASGPRELLKKFRQAKSQVVFSAEELIYPDRRLEAKYPV 144
Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
V G R+L SGGFIGYA ++K+L++ ++ + DQL+Y +FLD R + I LD
Sbjct: 145 VSDGKRFLGSGGFIGYAPNLKKLVAEWEGQDSDSDQLFYTKIFLDPEKREQINISLDHRC 204
Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK- 269
+FQNL G+L+++ L F++ V N Y+T PV+IHGNG +KI+LN GNY+ + W
Sbjct: 205 RIFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVLIHGNGPTKIQLNYLGNYIPRFWTF 263
Query: 270 TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
+GC C+ + L + + P VL+ VFI++PT FL F ++ L YP K++ +F++
Sbjct: 264 ETGCAVCDEGLGSLKGIGDEALPIVLVGVFIEQPTPFLSLFFQRLRRLRYPQKRMRLFIH 323
Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDS 387
N++++H + ++ + +++VK + V + +ARN+ + + +YF VD+
Sbjct: 324 NHEQHHKAQVERFLAEHGSEYQSVKLVGPEVRVATADARNMGADLCRQDRSCTYYFSVDA 383
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D L PD L+ L+ +N+++IAPLL R + WSNFWGAL+ADG+YARS DY++I+ G +
Sbjct: 384 DVALTEPDSLRLLIEQNKNVIAPLLTRQGRLWSNFWGALSADGYYARSEDYVDIVQGRR- 442
Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKT--IYTLNSMDYDMAFCTNLRNKGIHLKIDSTQ 505
G+WNVPYI+N YL+K S ++ T ++T ++ +D DMAFC NLR + + + +
Sbjct: 443 -VGVWNVPYISNIYLIKGSALR-TELQTTDLFHHRKLDPDMAFCANLRQQDAFMFLTNRH 500
Query: 506 EYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFP 565
+GHL+ +++ + +++E+ NP DW +YIH Y K+L V PCPDV+WFP
Sbjct: 501 TFGHLLSLDSYQTTHLHNDLWEVFSNPEDWKEKYIHENYTKALAGKLVET-PCPDVYWFP 559
Query: 566 IVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKY 625
I T+ C E V+ ME +GQWS G N D R++ GYE VPT DIHM Q+ W +FL +Y
Sbjct: 560 IFTDAACDELVEEMEHFGQWSLGDNKDNRIQGGYENVPTIDIHMNQISFEREWHKFLVEY 619
Query: 626 VVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGG 685
+ P+ E+ + GY+ + ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VGVDYEGG
Sbjct: 620 IAPMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFTINIALNRVGVDYEGG 678
Query: 686 GCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
GCRF+RYNC++ A R GW LMHPGRLTHYHEGL T+GTRYI +SFVDP
Sbjct: 679 GCRFLRYNCSIRAPRKGWTLMHPGRLTHYHEGLPTTKGTRYIAVSFVDP 727
>gi|281349272|gb|EFB24856.1| hypothetical protein PANDA_011823 [Ailuropoda melanoleuca]
Length = 702
Score = 658 bits (1697), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 312/707 (44%), Positives = 464/707 (65%), Gaps = 9/707 (1%)
Query: 32 DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNEL 91
D LV+TVA+ ET+G++RF +S + +++ LGL + W G SS GGG KV LLK L
Sbjct: 1 DNLLVLTVATRETEGFRRFKRSGQFFNYKIQALGLGEDWSGEKGSSAGGGLKVRLLKKAL 60
Query: 92 DEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAV 151
++ ++++IL DSYDV+ G ++L++F + +VF AE L +PD L KYPAV
Sbjct: 61 EKHADKENLVILFIDSYDVLFASGPRELLKKFRQAKSQVVFSAEELIYPDRRLEAKYPAV 120
Query: 152 GSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLAN 211
G R+L SGGFIGYA ++ +L++ ++ + DQL+Y +FLD R + I LD
Sbjct: 121 SDGKRFLGSGGFIGYAPNLSKLVAEWEGQDSDSDQLFYTKIFLDPEKREQINITLDHRCR 180
Query: 212 LFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK-T 270
+FQNL G+L+++ L F++ V N Y+T PV+IHGNG +K++LN GNY+ + W
Sbjct: 181 IFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTFE 239
Query: 271 SGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN 329
+GC C+ ++ L + + P+VL+ VFI++PT FL F ++ L YP K++ +F++N
Sbjct: 240 TGCAVCDEGLRSLRGIGDEALPTVLVGVFIEQPTPFLSLFFQRLLRLRYPRKQMRLFIHN 299
Query: 330 NQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDSD 388
++++H + ++ +++VK + V + +ARN+ + + +YF VD+D
Sbjct: 300 HEQHHKAQVEQFLAEHGGEYQSVKLVGPEVRVANADARNMGADLCRQDRSCTYYFSVDAD 359
Query: 389 SHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGG 448
L P L+ L+ +N+++IAPL+ R + WSNFWGAL+ADG+YARS DY++I+ G +
Sbjct: 360 VALTEPKTLRLLIEQNKNVIAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR-- 417
Query: 449 KGIWNVPYITNCYLMKTSVIKATNIKT-IYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEY 507
G+WNVPYI+N YL+K S ++ +T ++ + +D DMAFC N+R + + + + + +
Sbjct: 418 VGVWNVPYISNVYLIKGSALRGELQQTDLFHHSKLDPDMAFCANIRQQDVFMFLTNRHTF 477
Query: 508 GHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIV 567
GHL+ +N+ + +++E+ NP DW +YIH Y K+L V PCPDV+WFPI
Sbjct: 478 GHLLSLDNYQTSHLHNDLWEVFSNPEDWKEKYIHENYTKALAGKLV-EMPCPDVYWFPIF 536
Query: 568 TEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVV 627
TE C E V+ ME YG+WS G N D R++ GYE VPT DIHM Q+ W +FL +Y+
Sbjct: 537 TEAACDELVEEMEHYGRWSLGNNKDNRIQGGYENVPTIDIHMNQISFEREWHKFLVEYIA 596
Query: 628 PLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGC 687
P+ E+ + GY+ + ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VGVDYEGGGC
Sbjct: 597 PMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFTINIALNRVGVDYEGGGC 655
Query: 688 RFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
RF+RYNC++ A R GW LMHPGRLTHYHEGL T+GTRYI +SFVDP
Sbjct: 656 RFLRYNCSIRAPRKGWTLMHPGRLTHYHEGLPTTRGTRYIAVSFVDP 702
>gi|301774781|ref|XP_002922812.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 1-like
[Ailuropoda melanoleuca]
Length = 737
Score = 657 bits (1696), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 312/707 (44%), Positives = 464/707 (65%), Gaps = 9/707 (1%)
Query: 32 DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNEL 91
D LV+TVA+ ET+G++RF +S + +++ LGL + W G SS GGG KV LLK L
Sbjct: 36 DNLLVLTVATRETEGFRRFKRSGQFFNYKIQALGLGEDWSGEKGSSAGGGLKVRLLKKAL 95
Query: 92 DEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAV 151
++ ++++IL DSYDV+ G ++L++F + +VF AE L +PD L KYPAV
Sbjct: 96 EKHADKENLVILFIDSYDVLFASGPRELLKKFRQAKSQVVFSAEELIYPDRRLEAKYPAV 155
Query: 152 GSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLAN 211
G R+L SGGFIGYA ++ +L++ ++ + DQL+Y +FLD R + I LD
Sbjct: 156 SDGKRFLGSGGFIGYAPNLSKLVAEWEGQDSDSDQLFYTKIFLDPEKREQINITLDHRCR 215
Query: 212 LFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK-T 270
+FQNL G+L+++ L F++ V N Y+T PV+IHGNG +K++LN GNY+ + W
Sbjct: 216 IFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTFE 274
Query: 271 SGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN 329
+GC C+ ++ L + + P+VL+ VFI++PT FL F ++ L YP K++ +F++N
Sbjct: 275 TGCAVCDEGLRSLRGIGDEALPTVLVGVFIEQPTPFLSLFFQRLLRLRYPRKQMRLFIHN 334
Query: 330 NQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDSD 388
++++H + ++ +++VK + V + +ARN+ + + +YF VD+D
Sbjct: 335 HEQHHKAQVEQFLAEHGGEYQSVKLVGPEVRVANADARNMGADLCRQDRSCTYYFSVDAD 394
Query: 389 SHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGG 448
L P L+ L+ +N+++IAPL+ R + WSNFWGAL+ADG+YARS DY++I+ G +
Sbjct: 395 VALTEPKTLRLLIEQNKNVIAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR-- 452
Query: 449 KGIWNVPYITNCYLMKTSVIKATNIKT-IYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEY 507
G+WNVPYI+N YL+K S ++ +T ++ + +D DMAFC N+R + + + + + +
Sbjct: 453 VGVWNVPYISNVYLIKGSALRGELQQTDLFHHSKLDPDMAFCANIRQQDVFMFLTNRHTF 512
Query: 508 GHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIV 567
GHL+ +N+ + +++E+ NP DW +YIH Y K+L V PCPDV+WFPI
Sbjct: 513 GHLLSLDNYQTSHLHNDLWEVFSNPEDWKEKYIHENYTKALAGKLV-EMPCPDVYWFPIF 571
Query: 568 TEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVV 627
TE C E V+ ME YG+WS G N D R++ GYE VPT DIHM Q+ W +FL +Y+
Sbjct: 572 TEAACDELVEEMEHYGRWSLGNNKDNRIQGGYENVPTIDIHMNQISFEREWHKFLVEYIA 631
Query: 628 PLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGC 687
P+ E+ + GY+ + ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VGVDYEGGGC
Sbjct: 632 PMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFTINIALNRVGVDYEGGGC 690
Query: 688 RFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
RF+RYNC++ A R GW LMHPGRLTHYHEGL T+GTRYI +SFVDP
Sbjct: 691 RFLRYNCSIRAPRKGWTLMHPGRLTHYHEGLPTTRGTRYIAVSFVDP 737
>gi|443689849|gb|ELT92140.1| hypothetical protein CAPTEDRAFT_182861 [Capitella teleta]
Length = 701
Score = 657 bits (1696), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 338/707 (47%), Positives = 451/707 (63%), Gaps = 15/707 (2%)
Query: 37 ITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSS-LGGGYKVNLLKNELDEMD 95
+TVA++ TDG++RFI+S E L VK LG+ Q W GGD+ GGG KVNLLK L+E+
Sbjct: 1 MTVATDNTDGFQRFIRSTETFNLDVKVLGMGQKWEGGDIVKYAGGGQKVNLLKEGLEELK 60
Query: 96 ITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAVG-SG 154
D+I++ DSYDVI+D G + IL F FDA +VF AE CWPD SL +YP V S
Sbjct: 61 EKKDLIVMFVDSYDVIMDAGADAILAAFKKFDARVVFSAEGFCWPDASLAHEYPEVKMSE 120
Query: 155 YRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLANLFQ 214
RYLNSGGFIGYA DI +LI S+++++DDQL+Y FLD+TLR K I LD+ +FQ
Sbjct: 121 KRYLNSGGFIGYATDIYKLIGGSSLRSDDDDQLFYTKSFLDKTLREKLGIKLDSKGEIFQ 180
Query: 215 NLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK-TSGC 273
NL G+L+D+K+ F +L N K P++IHGNG K N+ NYL W T GC
Sbjct: 181 NLNGALDDVKVKFKGSS-SYLYNMKTGVTPLVIHGNGPIKHHFNALTNYLGGHWTPTGGC 239
Query: 274 TRC-NLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQE 332
C HLD+ K + +P V++++FI++PTAFL+EF I NL+YP KI ++++ + E
Sbjct: 240 NGCKQRTIHLDATKTENYPQVMMAIFIEQPTAFLQEFFYNIGNLSYPKSKIDLYLHYSDE 299
Query: 333 YHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSHLD 392
D+++ F + + S +N ARN A+E K ++ F VD D L+
Sbjct: 300 SSKKYVDEFLERNGDEFGSKQIETPVSELNDWTARNKALEKCNSKKCEYLFTVDGDVQLE 359
Query: 393 NPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIW 452
+ + L L+ N S+IAPLL RP K WSNFWG+L+ DGFY RS DY I+ G Q KG W
Sbjct: 360 DHNTLVDLIQYNRSVIAPLLSRPGKLWSNFWGSLSPDGFYKRSDDYAEIVTGRQ--KGQW 417
Query: 453 NVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGHLVD 512
NVPYI+ L+ ++ + + YT + +D DMA C +R KGI + +D+ ++YG LVD
Sbjct: 418 NVPYISQSLLIHGYLVPS--LLGGYTDSDLDSDMAICKRMREKGIFMYVDNQKKYGLLVD 475
Query: 513 SENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSL--LPDTVNNQPCPDVFWFPIVTEK 570
SE FDP K + ++Y + N W+ +Y+HPE+ + L +P + QPCPDVFW PIVT K
Sbjct: 476 SEQFDPSKAHGDLYMIFDNREMWEKKYLHPEFNRYLNTVPFSELEQPCPDVFWLPIVTTK 535
Query: 571 FCHEFVQIMEAYGQWSDGTNN---DKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVV 627
FC + + ME YG+WS G + D RL YE VPT DIH Q+ W E L+ Y+
Sbjct: 536 FCWDLIDEMEHYGKWSGGGHQPAVDDRLGGSYENVPTVDIHTNQIDWEPQWLEILKSYIG 595
Query: 628 PLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGC 687
P + F GY+ E RA M+FVVRY P EQ L+PH DSS+YTINIALN+ G+D+ GGG
Sbjct: 596 PYSGKVFEGYYTE-ARAHMNFVVRYTPGEQDRLKPHSDSSSYTINIALNRPGIDFTGGGT 654
Query: 688 RFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
RFIR NC+VT R GW+LMH GRLTH+HEGL T GTRYIM+SF+DP
Sbjct: 655 RFIRQNCSVTNARQGWLLMHAGRLTHFHEGLPTTGGTRYIMVSFIDP 701
>gi|390465346|ref|XP_003733390.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 1
[Callithrix jacchus]
Length = 727
Score = 657 bits (1696), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 315/708 (44%), Positives = 468/708 (66%), Gaps = 9/708 (1%)
Query: 31 EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNE 90
ED LVITVA+ ET+G++RF +SA+ +++ LGL + W + GGG KV LLK
Sbjct: 25 EDNLLVITVATKETEGFRRFKRSAQFFNYKIQALGLGEDWNVEKRTLAGGGQKVRLLKKA 84
Query: 91 LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
L++ +D++IL TDSYDV+ G ++L++F + +VF AE L +PD L KYPA
Sbjct: 85 LEKHADKEDLVILFTDSYDVLFASGPRELLKKFRQARSQVVFSAEELIYPDRRLEAKYPA 144
Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
V G R+L SGGFIGYA ++ +L++ ++ + DQL+Y +FLD R + I LD
Sbjct: 145 VSDGKRFLGSGGFIGYAPNLSKLVAEWDGQDSDSDQLFYTKIFLDPEKREQINITLDHRC 204
Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK- 269
+FQNL G+L+++ L F++ V N Y+T PV+IHGNG +K++LN GNY+ + W
Sbjct: 205 RIFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTF 263
Query: 270 TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
+GCT C+ ++ L ++ + P+VL+ VFI++PT F+ F ++ L+YP K I +F++
Sbjct: 264 ETGCTVCDEGLRSLKGIEDEALPTVLVGVFIEQPTPFVSLFFQRLLRLHYPRKHIRLFIH 323
Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDS 387
N++++H ++++ + +++VK + + + +ARN+ + + +YF VD+
Sbjct: 324 NHEQHHKTQVEEFLAEHGSEYQSVKLVGPEVRMVNADARNMGADLCRQDRSCTYYFSVDA 383
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D L P+ L+ L+ +N+++IAPL+ R + WSNFWGAL+ADG+YARS DY++I+ G +
Sbjct: 384 DVALTEPNSLRLLIQQNKNVIAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR- 442
Query: 448 GKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQE 506
G+WNVPYI+N YL+K S ++A ++ +D DMAFC N+R + + + + +
Sbjct: 443 -VGVWNVPYISNIYLIKGSALRAELQSPDLFHHRKLDPDMAFCANIRQQDVFMFLTNRHG 501
Query: 507 YGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPI 566
GHL+ +N+ + +++E+ NP DW +YIH Y K+L V PCPDV+WFPI
Sbjct: 502 LGHLLSLDNYRTTHLHNDLWEVFSNPEDWKEKYIHVNYTKALAGKLVET-PCPDVYWFPI 560
Query: 567 VTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYV 626
TE C E V+ ME +GQWS G N D R++ GYE VPT DIHM Q+G W +FL +Y+
Sbjct: 561 FTEVACDELVEEMEHFGQWSLGDNKDNRIQGGYENVPTIDIHMNQIGFEREWHKFLLEYI 620
Query: 627 VPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGG 686
P+ E+ + GY+ + ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VGVDYEGGG
Sbjct: 621 APMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFTINIALNRVGVDYEGGG 679
Query: 687 CRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
CRF+RYNC++ A R GW LMHPGRLTHYHEGL T+GTRYI +SFVDP
Sbjct: 680 CRFLRYNCSIRAPRKGWTLMHPGRLTHYHEGLPTTRGTRYIAVSFVDP 727
>gi|118403473|ref|NP_001072340.1| lysyl hydroxylase 1 precursor [Xenopus (Silurana) tropicalis]
gi|111305666|gb|AAI21423.1| lysyl hydroxylase [Xenopus (Silurana) tropicalis]
Length = 722
Score = 656 bits (1692), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 323/719 (44%), Positives = 472/719 (65%), Gaps = 19/719 (2%)
Query: 20 SVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLG 79
S++C N D LV+T+A+ ETDG KRF +SA +VK LGL + WLG
Sbjct: 19 SMNC---ANASADNLLVLTIATEETDGLKRFQRSAHSFNYKVKVLGLGEEWLGE------ 69
Query: 80 GGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCW 139
G KV L+K L+ +D+IIL T+SYDVI G ++L++F + +VF AE + +
Sbjct: 70 -GQKVRLMKFALEPYADKEDLIILFTESYDVIFASGPGELLKKFRQAKSKVVFSAESVAY 128
Query: 140 PDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLR 199
PD L KYPAVG G R+L SG FIGYA + +++++ K+++ DQL+Y LFLD R
Sbjct: 129 PDRHLESKYPAVGEGKRFLGSGAFIGYATHLYKMVADWDGKDKDSDQLFYTKLFLDPVKR 188
Query: 200 TKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNS 259
K I LD +FQNLYGS ED+ L F+ + V Y+T PV+IHGNG +K+ LN
Sbjct: 189 GKINITLDHRCRIFQNLYGSAEDVALKFE-NGRVRARYLVYDTLPVLIHGNGPTKLHLNY 247
Query: 260 FGNYLAKSWK-TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLN 317
GNY+ + W SGC C+ +++L+ L D FP V+I +FI++PT F+ EF ++ NLN
Sbjct: 248 LGNYIPRVWTFESGCNVCDEGVRNLEGLTVDTFPLVVIGIFIEQPTPFVSEFFKRLNNLN 307
Query: 318 YPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHK 377
YP K+I +++ N++ +H ++++ + T + VK + + + ++RN ++
Sbjct: 308 YPKKRIQLYISNHEPHHQRRVENFLQAYGTQYSFVKTVGPDENSDFADSRNKGMDMCRQT 367
Query: 378 G-VDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSF 436
++YF +D+ L N ++L+ L+ +N+S+IAPL+ R WSNFWGAL++DG+YARS
Sbjct: 368 PECEYYFSIDAPVVLKNINILRILIEQNKSVIAPLVSRTANLWSNFWGALSSDGYYARSE 427
Query: 437 DYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNK 495
DY++I+ + G+WNVPYI++ YL+K S++++ + I+ + D+DMAFC N+R +
Sbjct: 428 DYIHIVQRQR--IGVWNVPYISSVYLVKGSILRSKLSQNDIFHSGTQDFDMAFCHNIRQQ 485
Query: 496 GIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNN 555
GI + + + QE+GH++ EN+ + +++E+ N DW +YIHP + ++L V
Sbjct: 486 GIFMFVTNRQEFGHILSLENYKTTHLHNDLWEIFENTEDWKEKYIHPNHSEALKGKLVE- 544
Query: 556 QPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLA 615
PCPDV+WFPI +E C+E V+ ME +G+WS G+N D RL+ GYE VPT DIHM Q+G
Sbjct: 545 MPCPDVYWFPIFSETTCNELVEEMENFGKWSSGSNKDNRLQGGYENVPTIDIHMNQIGYE 604
Query: 616 GVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIAL 675
W + L ++ PL E+ F GY+ + ++FVVRY+PDEQP L PHHD+ST+TINIAL
Sbjct: 605 KEWHKILLDFIAPLTEKLFPGYYTR-AQFDLAFVVRYKPDEQPLLEPHHDASTFTINIAL 663
Query: 676 NQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
N VG DYEGGGCRF+RYNC+V A R GW LMHPGRLTHYHEGL+VT+GTRYI++SFVDP
Sbjct: 664 NSVGQDYEGGGCRFLRYNCSVRAPRKGWALMHPGRLTHYHEGLRVTKGTRYIVVSFVDP 722
>gi|397502970|ref|XP_003822109.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 1
isoform 1 [Pan paniscus]
Length = 727
Score = 656 bits (1692), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 312/708 (44%), Positives = 467/708 (65%), Gaps = 9/708 (1%)
Query: 31 EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNE 90
ED LV+TVA+ ET+G++RF +SA+ +++ LGL + W +S GGG KV LLK
Sbjct: 25 EDNLLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWNVDKGTSAGGGQKVRLLKKA 84
Query: 91 LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
L++ +D++IL TDSYDV+ G ++L++F + +VF AE L +PD L KYP
Sbjct: 85 LEKHADKEDLVILFTDSYDVLFASGPRELLKKFRQARSQVVFSAEELIYPDRRLETKYPV 144
Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
V G R+L SGGFIGYA ++ +L++ ++ + DQL+Y +FLD R + I LD
Sbjct: 145 VSDGKRFLGSGGFIGYAPNLSKLVAEWEGQDSDSDQLFYTKIFLDPEKREQINITLDHRC 204
Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK- 269
+FQNL G+L+++ L F++ V N Y+T PV+IHGNG +K++LN GNY+ + W
Sbjct: 205 RIFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTF 263
Query: 270 TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
+GCT C+ ++ L + + P+VL+ VFI++PT F+ F ++ L+YP K + +F++
Sbjct: 264 ETGCTVCDEGLRSLKGIGDEALPTVLVGVFIEQPTPFVSLFFQRLLRLHYPQKHMRLFIH 323
Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDS 387
N++++H ++++ + +++VK + + + +ARN+ + + +YF VD+
Sbjct: 324 NHEQHHKAQVEEFLAEHGSEYQSVKLVGPEVRMANADARNMGADLCRQDRSCTYYFSVDA 383
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D L P+ L+ L+ +N+++IAPL+ R + WSNFWGAL+ADG+YARS DY++I+ G +
Sbjct: 384 DVALTEPNSLRLLIQQNKNVIAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR- 442
Query: 448 GKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQE 506
G+WNVPYI+N YL+K S ++ ++ + +D DMAFC N+R + + + + +
Sbjct: 443 -VGVWNVPYISNIYLIKGSALRGELQSSDLFHHSKLDPDMAFCANVRQQDVFMFLTNRHT 501
Query: 507 YGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPI 566
GHL+ +++ + +++E+ NP DW +YIH Y K+L V PCPDV+WFPI
Sbjct: 502 LGHLLSLDSYRTTHLHNDLWEVFSNPEDWKEKYIHQNYTKALAGKLVET-PCPDVYWFPI 560
Query: 567 VTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYV 626
TE C E V+ ME +GQWS G N D R++ GYE VPT DIHM Q+G W +FL +Y+
Sbjct: 561 FTEVACDELVEEMEHFGQWSLGNNKDNRIQGGYENVPTIDIHMNQIGFEREWHKFLLEYI 620
Query: 627 VPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGG 686
P+ E+ + GY+ + ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VGVDYEGGG
Sbjct: 621 APMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFTINIALNRVGVDYEGGG 679
Query: 687 CRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
CRF+RYNC+V A R GW LMHPGRLTHYHEGL T+GTRYI +SFVDP
Sbjct: 680 CRFLRYNCSVRAPRKGWTLMHPGRLTHYHEGLPTTRGTRYIAVSFVDP 727
>gi|190074|gb|AAA60116.1| lysyl hydroxylase [Homo sapiens]
Length = 727
Score = 656 bits (1692), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 311/708 (43%), Positives = 467/708 (65%), Gaps = 9/708 (1%)
Query: 31 EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNE 90
ED LV+TVA+ ET+G++RF +SA+ +++ LGL + W +S GGG KV LLK
Sbjct: 25 EDNLLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWNVEKGTSAGGGQKVRLLKKA 84
Query: 91 LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
L++ +D++IL TDSYDV+ G ++L++F + +VF AE L +PD L KYP
Sbjct: 85 LEKHADKEDLVILFTDSYDVLFASGPRELLKKFRQARSQVVFSAEELIYPDRRLETKYPV 144
Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
V G R+L SGGFIGYA ++ +L++ ++ + DQL+Y +FLD R + I LD
Sbjct: 145 VSDGKRFLGSGGFIGYAPNLSKLVAEWEGQDSDSDQLFYTKIFLDPEKREQINITLDHRC 204
Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK- 269
+FQNL G+L+++ L F++ V N Y+T PV+IHGNG +K++LN GNY+ + W
Sbjct: 205 RIFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTF 263
Query: 270 TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
+GCT C+ ++ L + + P+VL+ VFI++PT F+ F ++ L+YP K + +F++
Sbjct: 264 ETGCTVCDEGLRSLKGIGDEALPTVLVGVFIEQPTPFVSLFFQRLLRLHYPQKHMRLFIH 323
Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDS 387
N++++H ++++ + +++VK + + + +ARN+ + + +YF VD+
Sbjct: 324 NHEQHHKAQVEEFLAQHGSEYQSVKLVGPEVRMANADARNMGADLCRQDRSCTYYFSVDA 383
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D L P+ L+ L+ +N+++IAPL+ R + WSNFWGAL+ADG+YARS DY++I+ G +
Sbjct: 384 DVALTEPNSLRLLIQQNKNVIAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR- 442
Query: 448 GKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQE 506
G+WNVPYI+N YL+K S ++ ++ + +D DMAFC N+R + + + + +
Sbjct: 443 -VGVWNVPYISNIYLIKGSALRGELQSSDLFHHSKLDPDMAFCANIRQQDVFMFLTNRHT 501
Query: 507 YGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPI 566
GHL+ +++ + +++E+ NP DW +YIH Y K+L V PCPDV+WFPI
Sbjct: 502 LGHLLSLDSYRTTHLHNDLWEVFSNPEDWKEKYIHQNYTKALAGKLVET-PCPDVYWFPI 560
Query: 567 VTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYV 626
TE C E V+ ME +GQWS G N D R++ GYE VPT DIHM Q+G W +FL +Y+
Sbjct: 561 FTEVACDELVEEMEHFGQWSLGNNKDNRIQGGYENVPTIDIHMNQIGFEREWHKFLLEYI 620
Query: 627 VPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGG 686
P+ E+ + GY+ + ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VGVDYEGGG
Sbjct: 621 APMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFTINIALNRVGVDYEGGG 679
Query: 687 CRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
CRF+RYNC++ A R GW LMHPGRLTHYHEGL T+GTRYI +SFVDP
Sbjct: 680 CRFLRYNCSIRAPRKGWTLMHPGRLTHYHEGLPTTRGTRYIAVSFVDP 727
>gi|351713694|gb|EHB16613.1| Procollagen-lysine,2-oxoglutarate 5-dioxygenase 1 [Heterocephalus
glaber]
Length = 764
Score = 656 bits (1692), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 313/710 (44%), Positives = 467/710 (65%), Gaps = 9/710 (1%)
Query: 29 IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLK 88
+ D LV+TVA+ ET+G++RF +SA+ +++ LGL + W +S GGG K+ LLK
Sbjct: 60 LGSDNLLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWGVERGTSAGGGQKIRLLK 119
Query: 89 NELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKY 148
L++ + +D++IL TDSYDV+ G ++L++F + +VF AE L +PD L KY
Sbjct: 120 KALEKHEDKEDLVILFTDSYDVVFASGPRELLKKFRQARSRVVFSAEELIYPDRRLEAKY 179
Query: 149 PAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDT 208
P V G R+L SGGFIGYA ++ +L++ ++ + DQL+Y +FLD R + I LD
Sbjct: 180 PMVSDGKRFLGSGGFIGYAPNLSKLVAKWEGQDSDSDQLFYTKIFLDPEKREQINISLDH 239
Query: 209 LANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSW 268
+FQNL G+L+++ L F++ V N Y+T PV+IHGNG +K++LN GNY+ W
Sbjct: 240 RCRIFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPHFW 298
Query: 269 K-TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMF 326
+GCT C+ ++ L + + P VL+ VFI++PT FL F ++ L+YP ++ +F
Sbjct: 299 TFETGCTVCDEGLRSLKGIGDEALPMVLVGVFIEQPTPFLSLFFQRLLRLHYPRSRMRLF 358
Query: 327 VYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVE-NSLHKGVDFYFYV 385
+++++++H + ++ +++VK + +++ ARN+ + + +YF V
Sbjct: 359 IHSHEQHHKAQVEQFLAEHGGEYQSVKLVGPEVRMSNANARNMGADLCRQEQTCTYYFSV 418
Query: 386 DSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGD 445
D+D L PD L+ L+ +N+++IAPL++RP + WSNFWGAL+ADG+YARS DY++I++G
Sbjct: 419 DADVALTEPDSLRLLIEQNKNVIAPLMMRPGRLWSNFWGALSADGYYARSEDYVDIVHGR 478
Query: 446 QGGKGIWNVPYITNCYLMKTSVIKATNIKT-IYTLNSMDYDMAFCTNLRNKGIHLKIDST 504
+ GIWNVPYI+N YL+K S ++A T ++ +D DMAFC N+R + + + + +
Sbjct: 479 R--VGIWNVPYISNIYLIKGSALRAELQHTDLFHHRKLDPDMAFCANIRQQEVFMFLTNR 536
Query: 505 QEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWF 564
+GHL+ +++ + +++E+ NP DW +YIH Y K+L V PCPDV+WF
Sbjct: 537 HSFGHLLSLDSYQTTHLHNDLWEVFSNPEDWKEKYIHENYTKALAGKLVET-PCPDVYWF 595
Query: 565 PIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRK 624
PI TE C E VQ ME +GQWS G N D R++ GYE VPT DIHM Q+ W +FL +
Sbjct: 596 PIFTEVACDELVQEMEHFGQWSLGDNKDNRIQGGYENVPTIDIHMNQISFEREWHKFLVE 655
Query: 625 YVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEG 684
Y+ PL E+ + GY+ + ++FVVRY+PDEQPSL PHHD+ST+T+NIALN+VG DYEG
Sbjct: 656 YIAPLTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFTVNIALNRVGEDYEG 714
Query: 685 GGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
GGCRF+RYNC+V A R GW LMHPGRLTHYHEGL T+GTRYI +SFVDP
Sbjct: 715 GGCRFLRYNCSVQAPRKGWALMHPGRLTHYHEGLPTTRGTRYIAVSFVDP 764
>gi|16741721|gb|AAH16657.1| Procollagen-lysine 1, 2-oxoglutarate 5-dioxygenase 1 [Homo sapiens]
Length = 727
Score = 655 bits (1691), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 311/708 (43%), Positives = 467/708 (65%), Gaps = 9/708 (1%)
Query: 31 EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNE 90
ED LV+TVA+ ET+G++RF +SA+ +++ LGL + W +S GGG KV LLK
Sbjct: 25 EDNLLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWNVEKGTSAGGGQKVRLLKKA 84
Query: 91 LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
L++ +D++IL TDSYDV+ G ++L++F + +VF AE L +PD L KYP
Sbjct: 85 LEKHADKEDLVILFTDSYDVLFASGPRELLKKFRQSRSQVVFSAEELIYPDRRLETKYPV 144
Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
V G R+L SGGFIGYA ++ +L++ ++ + DQL+Y +FLD R + I LD
Sbjct: 145 VSDGKRFLGSGGFIGYAPNLSKLVAEWEGQDSDSDQLFYTKIFLDPEKREQINITLDHRC 204
Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK- 269
+FQNL G+L+++ L F++ V N Y+T PV+IHGNG +K++LN GNY+ + W
Sbjct: 205 RIFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTF 263
Query: 270 TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
+GCT C+ ++ L + + P+VL+ VFI++PT F+ F ++ L+YP K + +F++
Sbjct: 264 ETGCTVCDEGLRSLKGIGDEALPTVLVGVFIEQPTPFVSLFFQRLLRLHYPQKHMRLFIH 323
Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDS 387
N++++H ++++ + +++VK + + + +ARN+ + + +YF VD+
Sbjct: 324 NHEQHHKAQVEEFLAQHGSEYQSVKLVGPEVRMANADARNMGADLCRQDRSCTYYFSVDA 383
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D L P+ L+ L+ +N+++IAPL+ R + WSNFWGAL+ADG+YARS DY++I+ G +
Sbjct: 384 DVALTEPNSLRLLIQQNKNVIAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR- 442
Query: 448 GKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQE 506
G+WNVPYI+N YL+K S ++ ++ + +D DMAFC N+R + + + + +
Sbjct: 443 -VGVWNVPYISNIYLIKGSALRGELQSSDLFHHSKLDPDMAFCANIRQQDVFMFLTNRHT 501
Query: 507 YGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPI 566
GHL+ +++ + +++E+ NP DW +YIH Y K+L V PCPDV+WFPI
Sbjct: 502 LGHLLSLDSYRTTHLHNDLWEVFSNPEDWKEKYIHQNYTKALAGKLVET-PCPDVYWFPI 560
Query: 567 VTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYV 626
TE C E V+ ME +GQWS G N D R++ GYE VPT DIHM Q+G W +FL +Y+
Sbjct: 561 FTEVACDELVEEMEHFGQWSLGNNKDNRIQGGYENVPTIDIHMNQIGFEREWHKFLLEYI 620
Query: 627 VPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGG 686
P+ E+ + GY+ + ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VGVDYEGGG
Sbjct: 621 APMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFTINIALNRVGVDYEGGG 679
Query: 687 CRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
CRF+RYNC++ A R GW LMHPGRLTHYHEGL T+GTRYI +SFVDP
Sbjct: 680 CRFLRYNCSIRAPRKGWTLMHPGRLTHYHEGLPTTRGTRYIAVSFVDP 727
>gi|410340317|gb|JAA39105.1| procollagen-lysine 1, 2-oxoglutarate 5-dioxygenase 1 [Pan
troglodytes]
Length = 727
Score = 655 bits (1690), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 312/708 (44%), Positives = 467/708 (65%), Gaps = 9/708 (1%)
Query: 31 EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNE 90
ED LV+TVA+ ET+G++RF +SA+ +++ LGL + W +S GGG KV LLK
Sbjct: 25 EDNLLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWNVDKGTSAGGGQKVRLLKKA 84
Query: 91 LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
L++ +D++IL TDSYDV+ G ++L++F + +VF AE L +PD L KYP
Sbjct: 85 LEKHADKEDLVILFTDSYDVLFASGPRELLKKFRQARSQVVFSAEELIYPDRRLETKYPV 144
Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
V G R+L SGGFIGYA ++ +L++ ++ + DQL+Y +FLD R + I LD
Sbjct: 145 VSDGKRFLGSGGFIGYAPNLSKLVAEWEGQDSDSDQLFYTKIFLDPEKREQINITLDHRC 204
Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK- 269
+FQNL G+L+++ L F++ V N Y+T PV+IHGNG +K++LN GNY+ + W
Sbjct: 205 RIFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTF 263
Query: 270 TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
+GCT C+ ++ L + + P+VL+ VFI++PT F+ F ++ L+YP K + +F++
Sbjct: 264 ETGCTVCDEGLRSLKGIGDEALPTVLVGVFIEQPTPFVSLFFQRLLRLHYPQKHMRLFIH 323
Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDS 387
N++++H ++++ + +++VK + + + +ARN+ + + +YF VD+
Sbjct: 324 NHEQHHKAQVEEFLAEHGSEYQSVKLVGPEVRMANADARNMGADLCRQDRSCTYYFSVDA 383
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D L P+ L+ L+ +N+++IAPL+ R + WSNFWGAL+ADG+YARS DY++I+ G +
Sbjct: 384 DVALTEPNSLRLLIQQNKNVIAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR- 442
Query: 448 GKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQE 506
G+WNVPYI+N YL+K S ++ ++ + +D DMAFC N+R + + + + +
Sbjct: 443 -VGVWNVPYISNIYLIKGSALRGELQSPDLFHHSKLDPDMAFCANVRQQDVFMFLTNRHT 501
Query: 507 YGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPI 566
GHL+ +++ + +++E+ NP DW +YIH Y K+L V PCPDV+WFPI
Sbjct: 502 LGHLLSLDSYRTTHLHNDLWEVFSNPEDWKEKYIHQNYTKALAGKLVET-PCPDVYWFPI 560
Query: 567 VTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYV 626
TE C E V+ ME +GQWS G N D R++ GYE VPT DIHM Q+G W +FL +Y+
Sbjct: 561 FTEVACDELVEEMEHFGQWSLGNNKDNRIQGGYENVPTIDIHMNQIGFEREWHKFLLEYI 620
Query: 627 VPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGG 686
P+ E+ + GY+ + ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VGVDYEGGG
Sbjct: 621 APMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFTINIALNRVGVDYEGGG 679
Query: 687 CRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
CRF+RYNC+V A R GW LMHPGRLTHYHEGL T+GTRYI +SFVDP
Sbjct: 680 CRFLRYNCSVRAPRKGWTLMHPGRLTHYHEGLPTTRGTRYIAVSFVDP 727
>gi|114554002|ref|XP_001142788.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 1
isoform 7 [Pan troglodytes]
Length = 727
Score = 655 bits (1690), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 312/708 (44%), Positives = 467/708 (65%), Gaps = 9/708 (1%)
Query: 31 EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNE 90
ED LV+TVA+ ET+G++RF +SA+ +++ LGL + W +S GGG KV LLK
Sbjct: 25 EDNLLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWNVDKGTSAGGGQKVRLLKKA 84
Query: 91 LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
L++ +D++IL TDSYDV+ G ++L++F + +VF AE L +PD L KYP
Sbjct: 85 LEKHADKEDLVILFTDSYDVLFASGPRELLKKFRQARSQVVFSAEELIYPDRRLETKYPV 144
Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
V G R+L SGGFIGYA ++ +L++ ++ + DQL+Y +FLD R + I LD
Sbjct: 145 VSDGKRFLGSGGFIGYAPNLSKLVAEWEGQDSDSDQLFYTKIFLDPEKREQINITLDHRC 204
Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK- 269
+FQNL G+L+++ L F++ V N Y+T PV+IHGNG +K++LN GNY+ + W
Sbjct: 205 RIFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVLIHGNGPNKLQLNYLGNYIPRFWTF 263
Query: 270 TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
+GCT C+ ++ L + + P+VL+ VFI++PT F+ F ++ L+YP K + +F++
Sbjct: 264 ETGCTVCDEGLRSLKGIGDEALPTVLVGVFIEQPTPFVSLFFQRLLRLHYPQKHMRLFIH 323
Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDS 387
N++++H ++++ + +++VK + + + +ARN+ + + +YF VD+
Sbjct: 324 NHEQHHKAQVEEFLAEHGSEYQSVKLVGPEVRMANADARNMGADLCRQDRSCTYYFSVDA 383
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D L P+ L+ L+ +N+++IAPL+ R + WSNFWGAL+ADG+YARS DY++I+ G +
Sbjct: 384 DVALTEPNSLRLLIQQNKNVIAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR- 442
Query: 448 GKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQE 506
G+WNVPYI+N YL+K S ++ ++ + +D DMAFC N+R + + + + +
Sbjct: 443 -VGVWNVPYISNIYLIKGSALRGELQSPDLFHHSKLDPDMAFCANVRQQDVFMFLTNRHT 501
Query: 507 YGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPI 566
GHL+ +++ + +++E+ NP DW +YIH Y K+L V PCPDV+WFPI
Sbjct: 502 LGHLLSLDSYRTTHLHNDLWEVFSNPEDWKEKYIHQNYTKALAGKLVET-PCPDVYWFPI 560
Query: 567 VTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYV 626
TE C E V+ ME +GQWS G N D R++ GYE VPT DIHM Q+G W +FL +Y+
Sbjct: 561 FTEVACDELVEEMEHFGQWSLGNNKDNRIQGGYENVPTIDIHMNQIGFEREWHKFLLEYI 620
Query: 627 VPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGG 686
P+ E+ + GY+ + ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VGVDYEGGG
Sbjct: 621 APMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFTINIALNRVGVDYEGGG 679
Query: 687 CRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
CRF+RYNC+V A R GW LMHPGRLTHYHEGL T+GTRYI +SFVDP
Sbjct: 680 CRFLRYNCSVRAPRKGWTLMHPGRLTHYHEGLPTTRGTRYIAVSFVDP 727
>gi|307748828|gb|ADN91862.1| procollagen-lysine 1, 2-oxoglutarate 5-dioxygenase 1 [Canis lupus
familiaris]
Length = 727
Score = 655 bits (1689), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 316/708 (44%), Positives = 468/708 (66%), Gaps = 9/708 (1%)
Query: 31 EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNE 90
ED LV+TVA+ ET+G++RF +S + +++ LGL + W G +S GGG KV LLK
Sbjct: 25 EDNLLVLTVATTETEGFRRFKRSGQFFNYKIQALGLGEDWTGEKGTSAGGGLKVRLLKKA 84
Query: 91 LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
L++ +D++IL TDSYDV+ G ++L++F +VF AE L +PD L KYPA
Sbjct: 85 LEKHADKEDLVILFTDSYDVVFASGPRELLKKFRQARGQVVFSAEELIYPDRRLEAKYPA 144
Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
V G R+L SGGFIGYA + +L++ ++ + DQL+Y +FLD R + I LD
Sbjct: 145 VSDGKRFLGSGGFIGYAPSLSKLVAEWEGQDSDSDQLFYTQIFLDPEKRERINITLDHRC 204
Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK- 269
+FQNL G+L+++ L F++ V N Y+T PV+IHGNG +K++LN GNY+ + W
Sbjct: 205 RIFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTF 263
Query: 270 TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
+GC+ C+ ++ L + + P+VL+ VFI++PT FL F ++ L+YP K++ +F++
Sbjct: 264 ETGCSVCDEGLRSLRGIGEEALPTVLVGVFIEQPTPFLSLFFLRLLRLHYPRKQMRLFIH 323
Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDS 387
N++++H + ++ + +++VK + V + +ARN+ + +G +YF VD+
Sbjct: 324 NHEQHHKAQVEQFLAEHGSEYQSVKLVGPEVRVANADARNMGADLCRQDRGCTYYFSVDA 383
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D L P L+ L+ +N+++IAPL+ R + WSNFWGAL+ADG+YARS DY++I+ G +
Sbjct: 384 DVALTEPKTLRLLIEQNKNVIAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR- 442
Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKT-IYTLNSMDYDMAFCTNLRNKGIHLKIDSTQE 506
G+WNVPYI+N YL+K S ++A T ++ + +D DMAFC N+R + + + + +
Sbjct: 443 -VGVWNVPYISNIYLIKGSALRAELQHTDLFHHSRLDPDMAFCANIRQQDVFMFLTNRHT 501
Query: 507 YGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPI 566
+GHL+ +++ + +++E+ NP DW +YIH Y K+L V PCPDV+WFPI
Sbjct: 502 FGHLLSLDSYQTSHLHNDLWEVFSNPEDWKEKYIHENYTKALAGKLV-EMPCPDVYWFPI 560
Query: 567 VTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYV 626
TE C E V+ ME YGQWS G N D R++ GYE VPT DIHM Q+ W +FL +Y+
Sbjct: 561 FTETACDELVEEMEHYGQWSLGDNKDNRIQGGYENVPTIDIHMNQISFEREWHKFLVEYI 620
Query: 627 VPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGG 686
P+ E+ + GY+ + ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VGVDYEGGG
Sbjct: 621 APMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFTINIALNRVGVDYEGGG 679
Query: 687 CRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
CRF+RYNC++ A R GW LMHPGRLTHYHEGL T+GTRYI +SFVDP
Sbjct: 680 CRFLRYNCSIRAPRKGWTLMHPGRLTHYHEGLPTTRGTRYIAVSFVDP 727
>gi|380813818|gb|AFE78783.1| procollagen-lysine,2-oxoglutarate 5-dioxygenase 1 precursor [Macaca
mulatta]
gi|384943152|gb|AFI35181.1| procollagen-lysine,2-oxoglutarate 5-dioxygenase 1 precursor [Macaca
mulatta]
Length = 727
Score = 654 bits (1688), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 310/708 (43%), Positives = 467/708 (65%), Gaps = 9/708 (1%)
Query: 31 EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNE 90
ED LV+TVA+ ET+G++RF +SA+ +++ LGL + W +S GGG KV LLK
Sbjct: 25 EDNLLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWNVEKGTSAGGGQKVRLLKKA 84
Query: 91 LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
L++ +D++IL TDSYDV+ G ++L++F + +VF AE L +PD L KYP
Sbjct: 85 LEKHADKEDLVILFTDSYDVLFASGPRELLKKFRQARSQVVFSAEELIYPDRRLEAKYPV 144
Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
V G R+L SGGFIGYA ++ +L++ ++ + DQL+Y +FLD R + I LD
Sbjct: 145 VSDGKRFLGSGGFIGYAPNLSKLVAEWEGQDSDSDQLFYTKIFLDPEKREQINITLDHRC 204
Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK- 269
+FQNL G+L+++ L F++ V N Y+T PV+IHGNG +K++LN GNY+ + W
Sbjct: 205 RIFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTF 263
Query: 270 TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
+GCT C+ ++ L + + P+VL+ +FI++PT F+ F ++ L+YP K + +F++
Sbjct: 264 ETGCTVCDEGLRSLKGIGDEALPTVLVGMFIEQPTPFVSLFFQRLLQLHYPRKHMRLFIH 323
Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDS 387
N++++H ++++ + +++VK + + + +ARN+ + + +YF VD+
Sbjct: 324 NHEQHHKAQVEEFLAEHGSEYQSVKLVGPEVRMANADARNMGADLCRQDRSCTYYFSVDA 383
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D L P+ L+ L+ +N+++IAPL+ R + WSNFWGAL+ADG+YARS DY++I+ G +
Sbjct: 384 DVALTEPNSLRLLIQQNKNVIAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR- 442
Query: 448 GKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQE 506
G+WNVPYI+N YL+K S ++ ++ + +D DMAFC N+R + + + + +
Sbjct: 443 -IGVWNVPYISNIYLIKGSALRGELQSPDLFHHSKLDPDMAFCANVRQQDVFMFLTNRHT 501
Query: 507 YGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPI 566
GHL+ +++ + +++E+ NP DW +YIH Y K+L V PCPDV+WFPI
Sbjct: 502 LGHLLSLDSYRTAHLHNDLWEVFSNPEDWKEKYIHQNYTKALAGKLVET-PCPDVYWFPI 560
Query: 567 VTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYV 626
TE C E V+ ME +GQWS G N D R++ GYE VPT DIHM Q+G W +FL +Y+
Sbjct: 561 FTEAACDELVEEMEHFGQWSLGDNKDSRIQGGYENVPTIDIHMNQIGFEREWHKFLLEYI 620
Query: 627 VPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGG 686
P+ E+ + GY+ + ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VGVDYEGGG
Sbjct: 621 APMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFTINIALNRVGVDYEGGG 679
Query: 687 CRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
CRF+RYNC++ A R GW LMHPGRLTHYHEGL T+GTRYI +SFVDP
Sbjct: 680 CRFLRYNCSIRAPRKGWTLMHPGRLTHYHEGLPTTRGTRYIAVSFVDP 727
>gi|410262758|gb|JAA19345.1| procollagen-lysine 1, 2-oxoglutarate 5-dioxygenase 1 [Pan
troglodytes]
gi|410302672|gb|JAA29936.1| procollagen-lysine 1, 2-oxoglutarate 5-dioxygenase 1 [Pan
troglodytes]
Length = 727
Score = 654 bits (1688), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 312/708 (44%), Positives = 466/708 (65%), Gaps = 9/708 (1%)
Query: 31 EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNE 90
ED LV+TVA+ ET+G++RF +SA+ +++ LGL + W +S GGG KV LLK
Sbjct: 25 EDNLLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWNVDKGTSAGGGQKVRLLKKA 84
Query: 91 LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
L++ +D++IL TDSYDV+ G ++L++F + +VF AE L +PD L KYP
Sbjct: 85 LEKHADKEDLVILFTDSYDVLFASGPRELLKKFRQARSQVVFSAEELIYPDRRLETKYPV 144
Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
V G R+L SGGFIGYA ++ +L++ ++ + DQL+Y +FLD R + I LD
Sbjct: 145 VSDGKRFLGSGGFIGYAPNLSKLVAEWEGQDSDSDQLFYTKIFLDPEKREQINITLDHRC 204
Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK- 269
+FQNL G+L+++ L F++ V N Y+T PV+IHGNG +K++LN GNY+ + W
Sbjct: 205 RIFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTF 263
Query: 270 TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
+GCT C+ ++ L + + P VL+ VFI++PT F+ F ++ L+YP K + +F++
Sbjct: 264 ETGCTVCDEGLRSLKGIGDEALPMVLVGVFIEQPTPFVSLFFQRLLRLHYPQKHMRLFIH 323
Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDS 387
N++++H ++++ + +++VK + + + +ARN+ + + +YF VD+
Sbjct: 324 NHEQHHKAQVEEFLAEHGSEYQSVKLVGPEVRMANADARNMGADLCRQDRSCTYYFSVDA 383
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D L P+ L+ L+ +N+++IAPL+ R + WSNFWGAL+ADG+YARS DY++I+ G +
Sbjct: 384 DVALTEPNSLRLLIQQNKNVIAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR- 442
Query: 448 GKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQE 506
G+WNVPYI+N YL+K S ++ ++ + +D DMAFC N+R + + + + +
Sbjct: 443 -VGVWNVPYISNIYLIKGSALRGELQSPDLFHHSKLDPDMAFCANVRQQDVFMFLTNRHT 501
Query: 507 YGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPI 566
GHL+ +++ + +++E+ NP DW +YIH Y K+L V PCPDV+WFPI
Sbjct: 502 LGHLLSLDSYRTTHLHNDLWEVFSNPEDWKEKYIHQNYTKALAGKLVET-PCPDVYWFPI 560
Query: 567 VTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYV 626
TE C E V+ ME +GQWS G N D R++ GYE VPT DIHM Q+G W +FL +Y+
Sbjct: 561 FTEVACDELVEEMEHFGQWSLGNNKDNRIQGGYENVPTIDIHMNQIGFEREWHKFLLEYI 620
Query: 627 VPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGG 686
P+ E+ + GY+ + ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VGVDYEGGG
Sbjct: 621 APMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFTINIALNRVGVDYEGGG 679
Query: 687 CRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
CRF+RYNC+V A R GW LMHPGRLTHYHEGL T+GTRYI +SFVDP
Sbjct: 680 CRFLRYNCSVRAPRKGWTLMHPGRLTHYHEGLPTTRGTRYIAVSFVDP 727
>gi|32307144|ref|NP_000293.2| procollagen-lysine,2-oxoglutarate 5-dioxygenase 1 precursor [Homo
sapiens]
gi|78099790|sp|Q02809.2|PLOD1_HUMAN RecName: Full=Procollagen-lysine,2-oxoglutarate 5-dioxygenase 1;
AltName: Full=Lysyl hydroxylase 1; Short=LH1; Flags:
Precursor
gi|20149013|gb|AAM12752.1| lysyl hydroxylase [Homo sapiens]
gi|119592130|gb|EAW71724.1| procollagen-lysine 1, 2-oxoglutarate 5-dioxygenase 1, isoform CRA_a
[Homo sapiens]
gi|119592131|gb|EAW71725.1| procollagen-lysine 1, 2-oxoglutarate 5-dioxygenase 1, isoform CRA_a
[Homo sapiens]
gi|168277976|dbj|BAG10966.1| procollagen-lysine,2-oxoglutarate 5-dioxygenase 1 precursor
[synthetic construct]
Length = 727
Score = 654 bits (1688), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 310/708 (43%), Positives = 466/708 (65%), Gaps = 9/708 (1%)
Query: 31 EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNE 90
ED LV+TVA+ ET+G++RF +SA+ +++ LGL + W +S GGG KV LLK
Sbjct: 25 EDNLLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWNVEKGTSAGGGQKVRLLKKA 84
Query: 91 LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
L++ +D++IL DSYDV+ G ++L++F + +VF AE L +PD L KYP
Sbjct: 85 LEKHADKEDLVILFADSYDVLFASGPRELLKKFRQARSQVVFSAEELIYPDRRLETKYPV 144
Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
V G R+L SGGFIGYA ++ +L++ ++ + DQL+Y +FLD R + I LD
Sbjct: 145 VSDGKRFLGSGGFIGYAPNLSKLVAEWEGQDSDSDQLFYTKIFLDPEKREQINITLDHRC 204
Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK- 269
+FQNL G+L+++ L F++ V N Y+T PV+IHGNG +K++LN GNY+ + W
Sbjct: 205 RIFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTF 263
Query: 270 TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
+GCT C+ ++ L + + P+VL+ VFI++PT F+ F ++ L+YP K + +F++
Sbjct: 264 ETGCTVCDEGLRSLKGIGDEALPTVLVGVFIEQPTPFVSLFFQRLLRLHYPQKHMRLFIH 323
Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDS 387
N++++H ++++ + +++VK + + + +ARN+ + + +YF VD+
Sbjct: 324 NHEQHHKAQVEEFLAQHGSEYQSVKLVGPEVRMANADARNMGADLCRQDRSCTYYFSVDA 383
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D L P+ L+ L+ +N+++IAPL+ R + WSNFWGAL+ADG+YARS DY++I+ G +
Sbjct: 384 DVALTEPNSLRLLIQQNKNVIAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR- 442
Query: 448 GKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQE 506
G+WNVPYI+N YL+K S ++ ++ + +D DMAFC N+R + + + + +
Sbjct: 443 -VGVWNVPYISNIYLIKGSALRGELQSSDLFHHSKLDPDMAFCANIRQQDVFMFLTNRHT 501
Query: 507 YGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPI 566
GHL+ +++ + +++E+ NP DW +YIH Y K+L V PCPDV+WFPI
Sbjct: 502 LGHLLSLDSYRTTHLHNDLWEVFSNPEDWKEKYIHQNYTKALAGKLVET-PCPDVYWFPI 560
Query: 567 VTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYV 626
TE C E V+ ME +GQWS G N D R++ GYE VPT DIHM Q+G W +FL +Y+
Sbjct: 561 FTEVACDELVEEMEHFGQWSLGNNKDNRIQGGYENVPTIDIHMNQIGFEREWHKFLLEYI 620
Query: 627 VPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGG 686
P+ E+ + GY+ + ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VGVDYEGGG
Sbjct: 621 APMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFTINIALNRVGVDYEGGG 679
Query: 687 CRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
CRF+RYNC++ A R GW LMHPGRLTHYHEGL T+GTRYI +SFVDP
Sbjct: 680 CRFLRYNCSIRAPRKGWTLMHPGRLTHYHEGLPTTRGTRYIAVSFVDP 727
>gi|197102026|ref|NP_001127428.1| procollagen-lysine,2-oxoglutarate 5-dioxygenase 1 precursor [Pongo
abelii]
gi|62900718|sp|Q5R9N3.1|PLOD1_PONAB RecName: Full=Procollagen-lysine,2-oxoglutarate 5-dioxygenase 1;
AltName: Full=Lysyl hydroxylase 1; Short=LH1; Flags:
Precursor
gi|55729596|emb|CAH91527.1| hypothetical protein [Pongo abelii]
Length = 727
Score = 654 bits (1688), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 311/708 (43%), Positives = 466/708 (65%), Gaps = 9/708 (1%)
Query: 31 EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNE 90
ED LV+TVA+ ET+G++RF +SA+ +++ LGL + W +S GGG KV LLK
Sbjct: 25 EDNLLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWNVEKGTSAGGGQKVRLLKKA 84
Query: 91 LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
L++ +D++IL TDSYDV+ G ++L++F + +VF AE L +PD L KYP
Sbjct: 85 LEKHADKEDLVILFTDSYDVLFASGPRELLKKFRQARSQVVFSAEELIYPDRRLETKYPV 144
Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
V G R+L SGGFIGYA ++ +L++ ++ + DQL+Y +FLD R + I LD
Sbjct: 145 VSDGKRFLGSGGFIGYAPNLSKLVAEWEGQDSDSDQLFYTRIFLDPEKREQINITLDHRC 204
Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK- 269
+FQNL G+L+++ L F++ V N Y+T PV+IHGNG +K++LN GNY+ + W
Sbjct: 205 RIFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTF 263
Query: 270 TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
+GCT C+ ++ L + + P+VL+ VFI++PT F+ F ++ L+YP K + +F++
Sbjct: 264 ETGCTVCDEGLRSLKGIGDEALPTVLVGVFIEQPTPFVSLFFQRLLRLHYPQKHMRLFIH 323
Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDS 387
N++++H ++++ + +++VK + + + +ARN+ + + +YF VD+
Sbjct: 324 NHEQHHKAQVEEFLAEHGSEYQSVKLVGPEVRMANADARNMGADLCRQDRSCTYYFSVDA 383
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D L P L+ L+ +N+++IAPL+ R + WSNFWGAL+ADG+YARS DY++I+ G +
Sbjct: 384 DVALTEPSSLRLLIQQNKNVIAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR- 442
Query: 448 GKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQE 506
G+WNVPYI+N YL+K S ++ ++ + +D DMAFC N+R + + + + +
Sbjct: 443 -VGVWNVPYISNIYLIKGSALRGELQSPDLFHHSKLDPDMAFCANVRQQDVFMFLTNRHT 501
Query: 507 YGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPI 566
GHL+ +++ + +++E+ NP DW +YIH Y K+L V PCPDV+WFPI
Sbjct: 502 LGHLLSLDSYRTTHLHNDLWEVFSNPEDWKEKYIHQNYTKALAGKLVET-PCPDVYWFPI 560
Query: 567 VTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYV 626
TE C E V+ ME +GQWS G N D R++ GYE VPT DIHM Q+G W +FL +Y+
Sbjct: 561 FTEVACDELVEEMEHFGQWSLGDNKDNRIQGGYENVPTIDIHMNQIGFEREWHKFLLEYI 620
Query: 627 VPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGG 686
P+ E+ + GY+ + ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VGVDYEGGG
Sbjct: 621 APMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFTINIALNRVGVDYEGGG 679
Query: 687 CRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
CRF+RYNC++ A R GW LMHPGRLTHYHEGL T+GTRYI +SFVDP
Sbjct: 680 CRFLRYNCSIRAPRKGWTLMHPGRLTHYHEGLPTTRGTRYIAVSFVDP 727
>gi|395528052|ref|XP_003766147.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2
[Sarcophilus harrisii]
Length = 737
Score = 654 bits (1688), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 315/712 (44%), Positives = 464/712 (65%), Gaps = 12/712 (1%)
Query: 28 NIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGYKVNL 86
NI DK LVITVA+ ETDGY RF+QSA+ VK LG + W GGD ++++GGG KV L
Sbjct: 33 NIPTDKLLVITVATKETDGYHRFMQSAKYFNYTVKVLGKGEEWKGGDKVNAIGGGQKVRL 92
Query: 87 LKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYD 146
LK + +D+++ T+ YDVI GG ++L++F + +VF A+ + WPD L D
Sbjct: 93 LKEAMGSYADQEDLVVFFTECYDVIFAGGPEELLKKFQKINHKVVFSADGILWPDKRLAD 152
Query: 147 KYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVL 206
KYP V G R+LNSGGF+GYA I ++ ++++ +DDQL+Y +++D R I L
Sbjct: 153 KYPIVHIGKRFLNSGGFVGYAPYINRIVQQWNLQDNDDDQLFYTKIYIDPLKREALNITL 212
Query: 207 DTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAK 266
D +FQ L G+++++ L F+ + NT Y T PV+I+GNG +KI+LN FGNY+
Sbjct: 213 DHKCRIFQALNGAIDEVLLKFENGK-ARAKNTFYETLPVVINGNGPTKIQLNYFGNYIPN 271
Query: 267 SW-KTSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKIS 324
+W + +GCT C+L + L +LK +P V + VFI++PT FL FL+ + L YP + +
Sbjct: 272 AWTQENGCTLCDLDVIDLSTLK--DYPRVTVGVFIEQPTPFLPRFLDLLLTLTYPKEALK 329
Query: 325 MFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHK-GVDFYF 383
+F++N++ YH ++ K + KN+K + ++ EARN+ ++ D+YF
Sbjct: 330 LFIHNSEVYHEKHIKEFWEKAKDVIKNIKIVGPEENLSQAEARNMGMDLCRQDDKCDYYF 389
Query: 384 YVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIIN 443
+D+D L NP L+ L+ +N +IAPL+ R K WSNFWGAL+ DG+YARS DY++I+
Sbjct: 390 SLDADVVLTNPKTLEILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARSEDYVDIVQ 449
Query: 444 GDQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRNKGIHLKID 502
G + G+WN+PY+ N YL+K +++ N + + + +D DMA C N R G+ + I
Sbjct: 450 GSR--VGVWNIPYMANVYLIKGQTLRSEMNERNYFVRDKLDPDMALCRNAREMGVFMYIS 507
Query: 503 STQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVF 562
+ E+G L+ + N++ N +++++ NP+DW +YI+P Y K + + + QPCPDVF
Sbjct: 508 NRHEFGRLLSTANYNISHYNNDLWQIFENPVDWKEKYINPNYSK-IFTENLVEQPCPDVF 566
Query: 563 WFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFL 622
WFPI +EK C E V+ ME YGQWS G ++D R+ GYE VPT DIHMKQ+GL W F+
Sbjct: 567 WFPIFSEKACDELVEEMEHYGQWSGGKHHDSRISGGYENVPTDDIHMKQIGLENEWLHFI 626
Query: 623 RKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDY 682
R+++ P+ + F GY+ + A ++FVV+Y PD Q SLRPHHDSST+TINIALN VG D+
Sbjct: 627 REFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPDRQRSLRPHHDSSTFTINIALNNVGQDF 685
Query: 683 EGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
+GGGC+F+RYNC++ + R GW MHPGRLTH HEGL + GTRYI +SF+DP
Sbjct: 686 QGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPIINGTRYIAVSFIDP 737
>gi|417404209|gb|JAA48874.1| Putative procollagen-lysine2-oxoglutarate 5-dioxygenase 1 [Desmodus
rotundus]
Length = 728
Score = 654 bits (1687), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 313/709 (44%), Positives = 468/709 (66%), Gaps = 10/709 (1%)
Query: 31 EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNE 90
ED LV+TVA+ ET+G++RF +SA+ +++ LGL + W G +S GGG KV LLK
Sbjct: 25 EDNLLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWPGEKETSAGGGLKVRLLKKA 84
Query: 91 LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
L++ ++++IL TDSYDV+ G ++L++F + +VF A+ L +PD L KYP
Sbjct: 85 LEKHADKENLVILFTDSYDVVFASGPRELLKKFRQARSQVVFSAQELIYPDRRLEAKYPM 144
Query: 151 VGS-GYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTL 209
V G R+L SGGFIGYA ++ +L++ ++ + DQL+Y +FLD R + I LD
Sbjct: 145 VSDDGKRFLGSGGFIGYAPNLSKLVAEWEGQDSDSDQLFYTKIFLDPAKREQINITLDHR 204
Query: 210 ANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK 269
+FQNL G+L+++ L F++ V N Y+T PV+IHGNG +K++LN GNY+ + W
Sbjct: 205 CRIFQNLDGALDEVVLKFEMGH-VRARNLVYDTLPVLIHGNGPTKLQLNYLGNYIPRFWT 263
Query: 270 -TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFV 327
+GC C+ ++ L + + P VL+ VFI++PT FL F ++ L+YP +++ +F+
Sbjct: 264 FETGCVVCDEGLRSLKGIGDEALPVVLVGVFIEQPTPFLSLFFQRLLRLHYPRRRMRLFI 323
Query: 328 YNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVD 386
+N++++H + ++ + +++VK + V + +ARN+ + + +YF VD
Sbjct: 324 HNHEKHHKTQVEQFLAEHGSEYQSVKLVGPEVRVANADARNMGADLCRQDRSCTYYFSVD 383
Query: 387 SDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQ 446
+ L P +L+ L+ +N+++IAPL+ R + WSNFWGAL+ADG+YARS DY++I+ G +
Sbjct: 384 AAVALTEPKILRLLIEQNKNVIAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR 443
Query: 447 GGKGIWNVPYITNCYLMKTSVIKATNIKT-IYTLNSMDYDMAFCTNLRNKGIHLKIDSTQ 505
G+WNVPYI+N YL+K S ++A +T ++ + +D DMAFC N+R + + + + +
Sbjct: 444 --VGVWNVPYISNIYLIKGSALRAELQQTDLFHHSKLDPDMAFCANIRQQDVFMFLTNRH 501
Query: 506 EYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFP 565
+GHL+ +N+ + +++E+ NP DW +YIH Y K+L V PCPDV+WFP
Sbjct: 502 TFGHLLSLDNYQTTHLHNDLWEVFNNPEDWKEKYIHKNYTKALAGKLVE-MPCPDVYWFP 560
Query: 566 IVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKY 625
I TE C E V+ ME YGQWS G N D R++ GYE VPT DIHM Q+ W +FL +Y
Sbjct: 561 IFTETACDELVEEMEHYGQWSLGDNKDSRIQGGYENVPTIDIHMNQISFEREWHKFLVEY 620
Query: 626 VVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGG 685
+ P+ E+ + GY+ + ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VGVDYEGG
Sbjct: 621 IAPMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLVPHHDASTFTINIALNRVGVDYEGG 679
Query: 686 GCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
GCRF+RYNC+V A R GW LMHPGRLTHYHEGL T+GTRYI +SFVDP
Sbjct: 680 GCRFLRYNCSVRAPRKGWALMHPGRLTHYHEGLPTTKGTRYISVSFVDP 728
>gi|189053347|dbj|BAG35176.1| unnamed protein product [Homo sapiens]
Length = 727
Score = 654 bits (1686), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 311/708 (43%), Positives = 465/708 (65%), Gaps = 9/708 (1%)
Query: 31 EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNE 90
ED LVITVA+ ET+G++RF +SA+ +++ LGL + W +S GGG KV LLK
Sbjct: 25 EDNLLVITVATKETEGFRRFKRSAQFFNYKIQALGLGEDWNVEKGTSAGGGQKVRLLKKA 84
Query: 91 LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
L++ +D++IL DSYDV+ G ++L++F + +VF AE L +PD L KYP
Sbjct: 85 LEKHADKEDLVILFADSYDVLFASGPRELLKKFRQARSQVVFSAEELIYPDRRLETKYPV 144
Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
V G R+L SGGFIGYA ++ +L++ ++ + DQL+Y +FLD R + I LD
Sbjct: 145 VSDGKRFLGSGGFIGYAPNLSKLVAEWEGQDSDSDQLFYTKIFLDPEKREQINITLDHRC 204
Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK- 269
+FQNL G+L+++ L F++ V N Y+T PV+IHGNG +K++LN GNY+ + W
Sbjct: 205 RIFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTF 263
Query: 270 TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
+GCT C+ ++ L + + P+VL+ VFI++PT F+ F ++ L+YP K + +F++
Sbjct: 264 ETGCTVCDEGLRSLKGIGDEALPTVLVGVFIEQPTPFVSLFFQRLLRLHYPQKHMRLFIH 323
Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDS 387
N++++H ++++ + +++VK + + + +ARN+ + + +YF VD+
Sbjct: 324 NHEQHHKAQVEEFLAQHGSEYQSVKLVGPEVRMANADARNMGADLCRQDRSCTYYFSVDA 383
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D L P+ L+ L+ +N+++IAPL+ R + WSNFWGAL+ADG+YARS DY++I+ G +
Sbjct: 384 DVALTEPNSLRLLIQQNKNVIAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR- 442
Query: 448 GKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQE 506
G+WNVPYI+N YL+K S ++ ++ + +D DMAFC N+R + + + + +
Sbjct: 443 -VGVWNVPYISNIYLIKGSALRGVLQSSDLFHHSKLDPDMAFCANIRQQDVFMFLTNRHT 501
Query: 507 YGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPI 566
GHL+ +++ + +++E+ NP DW +YIH Y K+L V PCPDV+WFPI
Sbjct: 502 LGHLLSLDSYRTTHLHNDLWEVFSNPEDWKEKYIHQNYTKALAGKLVET-PCPDVYWFPI 560
Query: 567 VTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYV 626
TE C E V+ ME +GQWS G N D R++ GYE VPT DIHM Q+G W +FL +Y+
Sbjct: 561 FTEVACDELVEEMEHFGQWSLGNNKDNRIQGGYENVPTIDIHMNQIGFEREWHKFLLEYI 620
Query: 627 VPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGG 686
P+ E+ + GY+ + ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VGVDYEGGG
Sbjct: 621 APMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFTINIALNRVGVDYEGGG 679
Query: 687 CRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
CRF RYNC++ A R GW LMHPGRLTHYHEGL T+GTRYI +SFVDP
Sbjct: 680 CRFPRYNCSIRAPRKGWTLMHPGRLTHYHEGLPTTRGTRYIAVSFVDP 727
>gi|397502972|ref|XP_003822110.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 1
isoform 2 [Pan paniscus]
Length = 774
Score = 653 bits (1685), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 311/707 (43%), Positives = 466/707 (65%), Gaps = 9/707 (1%)
Query: 32 DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNEL 91
D LV+TVA+ ET+G++RF +SA+ +++ LGL + W +S GGG KV LLK L
Sbjct: 73 DNLLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWNVDKGTSAGGGQKVRLLKKAL 132
Query: 92 DEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAV 151
++ +D++IL TDSYDV+ G ++L++F + +VF AE L +PD L KYP V
Sbjct: 133 EKHADKEDLVILFTDSYDVLFASGPRELLKKFRQARSQVVFSAEELIYPDRRLETKYPVV 192
Query: 152 GSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLAN 211
G R+L SGGFIGYA ++ +L++ ++ + DQL+Y +FLD R + I LD
Sbjct: 193 SDGKRFLGSGGFIGYAPNLSKLVAEWEGQDSDSDQLFYTKIFLDPEKREQINITLDHRCR 252
Query: 212 LFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK-T 270
+FQNL G+L+++ L F++ V N Y+T PV+IHGNG +K++LN GNY+ + W
Sbjct: 253 IFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTFE 311
Query: 271 SGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN 329
+GCT C+ ++ L + + P+VL+ VFI++PT F+ F ++ L+YP K + +F++N
Sbjct: 312 TGCTVCDEGLRSLKGIGDEALPTVLVGVFIEQPTPFVSLFFQRLLRLHYPQKHMRLFIHN 371
Query: 330 NQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDSD 388
++++H ++++ + +++VK + + + +ARN+ + + +YF VD+D
Sbjct: 372 HEQHHKAQVEEFLAEHGSEYQSVKLVGPEVRMANADARNMGADLCRQDRSCTYYFSVDAD 431
Query: 389 SHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGG 448
L P+ L+ L+ +N+++IAPL+ R + WSNFWGAL+ADG+YARS DY++I+ G +
Sbjct: 432 VALTEPNSLRLLIQQNKNVIAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR-- 489
Query: 449 KGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEY 507
G+WNVPYI+N YL+K S ++ ++ + +D DMAFC N+R + + + + +
Sbjct: 490 VGVWNVPYISNIYLIKGSALRGELQSSDLFHHSKLDPDMAFCANVRQQDVFMFLTNRHTL 549
Query: 508 GHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIV 567
GHL+ +++ + +++E+ NP DW +YIH Y K+L V PCPDV+WFPI
Sbjct: 550 GHLLSLDSYRTTHLHNDLWEVFSNPEDWKEKYIHQNYTKALAGKLVET-PCPDVYWFPIF 608
Query: 568 TEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVV 627
TE C E V+ ME +GQWS G N D R++ GYE VPT DIHM Q+G W +FL +Y+
Sbjct: 609 TEVACDELVEEMEHFGQWSLGNNKDNRIQGGYENVPTIDIHMNQIGFEREWHKFLLEYIA 668
Query: 628 PLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGC 687
P+ E+ + GY+ + ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VGVDYEGGGC
Sbjct: 669 PMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFTINIALNRVGVDYEGGGC 727
Query: 688 RFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
RF+RYNC+V A R GW LMHPGRLTHYHEGL T+GTRYI +SFVDP
Sbjct: 728 RFLRYNCSVRAPRKGWTLMHPGRLTHYHEGLPTTRGTRYIAVSFVDP 774
>gi|431906315|gb|ELK10512.1| Procollagen-lysine,2-oxoglutarate 5-dioxygenase 1 [Pteropus alecto]
Length = 727
Score = 652 bits (1683), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 310/708 (43%), Positives = 467/708 (65%), Gaps = 9/708 (1%)
Query: 31 EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNE 90
ED LV+TVA+ ET+G++RF +SA+ +++ LGL + W ++S GGG KV LLK
Sbjct: 25 EDNLLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWNVEKVTSAGGGLKVRLLKKA 84
Query: 91 LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
L++ ++++IL TDSYDV+ G ++L++F + +VF AE L +PD L KYP
Sbjct: 85 LEKHADKENLVILFTDSYDVVFASGPRELLKKFRQAKSQVVFSAEELIYPDRRLEVKYPV 144
Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
V G R+L SGGFIGYA ++ +L++ ++ + DQL+Y +FLD R + I LD
Sbjct: 145 VSDGKRFLGSGGFIGYAPNLSKLVAEWKGQDSDSDQLFYTKIFLDPEKREQINITLDHRC 204
Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK- 269
+FQNL G+L+++ L F++ V N Y+T PV+IHGNG +K++LN GNY+ + W
Sbjct: 205 RIFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTF 263
Query: 270 TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
+GC C+ ++ L + + P VL+ VFI++PT FL F ++ +YP K++ +F++
Sbjct: 264 ETGCVVCDEGLRSLKGIGDEALPIVLVGVFIEQPTPFLSLFFQRLLRFHYPRKRMRLFIH 323
Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDS 387
N++++H + ++ ++++K + V S +ARN+ + +G +YF VD+
Sbjct: 324 NHEQHHKAQVEQFLAEHGDEYQSMKLVGPEVRVASADARNMGADLCRQDRGCTYYFSVDA 383
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D L P +L+ L+ +N+++IAPL+ R + WSNFWGAL+ADG+YARS DY++I+ +
Sbjct: 384 DVALTEPKILRLLIEQNKNVIAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQRRR- 442
Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKT-IYTLNSMDYDMAFCTNLRNKGIHLKIDSTQE 506
G+WNVPYI++ YL+K S ++A +T ++ + +D DMAFC N+R + + + + +
Sbjct: 443 -VGVWNVPYISSVYLIKGSALRAELQQTDLFHHSKLDPDMAFCANIRQQDVFMFLTNRHI 501
Query: 507 YGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPI 566
+GHL+ +N+ + +++E+ NP +W +YIH Y K+L V PCPDV+WFPI
Sbjct: 502 FGHLLSLDNYQTTHLHNDLWEVFNNPEEWKEKYIHENYTKALAGKLV-EMPCPDVYWFPI 560
Query: 567 VTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYV 626
TE C E V+ ME +GQWS G N D R++ GYE VPT DIHM Q+ W +FL +Y+
Sbjct: 561 FTETACDELVEEMEHFGQWSLGDNKDNRIQGGYENVPTIDIHMNQISFEREWHKFLVEYI 620
Query: 627 VPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGG 686
P+ E+ + GY+ + ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VGVDYEGGG
Sbjct: 621 APMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFTINIALNRVGVDYEGGG 679
Query: 687 CRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
CRF+RYNC++ A R GW LMHPGRLTHYHEGL T+GTRYI +SFVDP
Sbjct: 680 CRFLRYNCSIRAPRKGWTLMHPGRLTHYHEGLPTTKGTRYIAVSFVDP 727
>gi|326932538|ref|XP_003212372.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 1-like
[Meleagris gallopavo]
Length = 730
Score = 652 bits (1683), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 313/709 (44%), Positives = 473/709 (66%), Gaps = 10/709 (1%)
Query: 31 EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGYKVNLLKN 89
E+ LV+TVA+ +T+G++RF +SA+ +++ LGL + W GGD GGG KV LLK+
Sbjct: 27 EENLLVLTVATKQTEGFRRFRRSAQFFNYKIQVLGLDEEWKGGDDKKPAGGGQKVRLLKS 86
Query: 90 ELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYP 149
L + +D+IIL T+SYDV+ G ++L++F + +VF AE +PD L KYP
Sbjct: 87 ALKQHADKEDLIILFTESYDVLFASGPTELLKKFKQAKSKVVFSAENYIYPDRKLEAKYP 146
Query: 150 AVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTL 209
V G R+L SGGFIGYA ++K+L+ K+++ DQL+Y +FLD R I LD
Sbjct: 147 PVRDGKRFLGSGGFIGYAPNLKKLVEEWKGKDDDSDQLFYTKIFLDPEKRENINISLDHR 206
Query: 210 ANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK 269
+ +FQNL G+L+++ L F+ + V N Y+T PVIIHGNG +K++LN GNY+ + W
Sbjct: 207 SRIFQNLNGALDEVVLKFE-NARVRARNLLYDTLPVIIHGNGPTKLQLNYLGNYIPQIWT 265
Query: 270 -TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFV 327
+GCT C+ ++ L +K + P +LI +FI++PT FL +F ++ NL+YP ++I +F+
Sbjct: 266 FETGCTVCDEGLRSLTGIKDEALPMILIGIFIEQPTPFLSQFFLRLRNLHYPKQRIQIFI 325
Query: 328 YNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVD 386
+N++++H+ D +++ + +K I + + + EARNL ++ D+YF +D
Sbjct: 326 HNHEQHHSMQVDSFVNEHSKEYLAMKVIGPDDEMENAEARNLGMDLCRKDPDCDYYFSLD 385
Query: 387 SDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQ 446
++ L N + L+ L+ +N+S+IAPL+ R K WSNFWGAL+ DG+YARS DY++I+ +
Sbjct: 386 AEVVLKNTETLRILIEQNKSVIAPLVSRHEKLWSNFWGALSPDGYYARSEDYVDIVQRRR 445
Query: 447 GGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQ 505
G+WNVPYI++ Y++K V+++ + ++ +D DMAFC N+RN+G+ + + +
Sbjct: 446 --VGLWNVPYISSVYMVKGKVLRSELDEGDLFHGGKLDADMAFCHNVRNQGVFMYLTNRH 503
Query: 506 EYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFP 565
++GH++ EN+ + +++++ NP DW +YIH Y +L V PCPDV+WFP
Sbjct: 504 QFGHILSLENYQTSHLHNDLWQIFSNPEDWREKYIHENYTAALKGKLVE-MPCPDVYWFP 562
Query: 566 IVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKY 625
I T+ C E V+ ME YG+WS G N D R++ GYE VPT DIHM Q+G W +FL Y
Sbjct: 563 IFTDTACDELVEEMEHYGKWSTGDNTDSRIQGGYENVPTIDIHMNQIGFEREWYKFLLDY 622
Query: 626 VVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGG 685
+ P+ E+ + GY+ + + ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VG+DYEGG
Sbjct: 623 IAPITEKLYPGYYTK-TQFELAFVVRYKPDEQPSLMPHHDASTFTINIALNRVGIDYEGG 681
Query: 686 GCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
GCRF+RYNC++ A R GW LMHPGRLTHYHEGL T+GTRYI +SF+DP
Sbjct: 682 GCRFLRYNCSIRAPRKGWTLMHPGRLTHYHEGLPTTKGTRYIAVSFIDP 730
>gi|114553998|ref|XP_514394.2| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 1
isoform 12 [Pan troglodytes]
Length = 774
Score = 652 bits (1683), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 311/707 (43%), Positives = 466/707 (65%), Gaps = 9/707 (1%)
Query: 32 DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNEL 91
D LV+TVA+ ET+G++RF +SA+ +++ LGL + W +S GGG KV LLK L
Sbjct: 73 DNLLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWNVDKGTSAGGGQKVRLLKKAL 132
Query: 92 DEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAV 151
++ +D++IL TDSYDV+ G ++L++F + +VF AE L +PD L KYP V
Sbjct: 133 EKHADKEDLVILFTDSYDVLFASGPRELLKKFRQARSQVVFSAEELIYPDRRLETKYPVV 192
Query: 152 GSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLAN 211
G R+L SGGFIGYA ++ +L++ ++ + DQL+Y +FLD R + I LD
Sbjct: 193 SDGKRFLGSGGFIGYAPNLSKLVAEWEGQDSDSDQLFYTKIFLDPEKREQINITLDHRCR 252
Query: 212 LFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK-T 270
+FQNL G+L+++ L F++ V N Y+T PV+IHGNG +K++LN GNY+ + W
Sbjct: 253 IFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVLIHGNGPNKLQLNYLGNYIPRFWTFE 311
Query: 271 SGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN 329
+GCT C+ ++ L + + P+VL+ VFI++PT F+ F ++ L+YP K + +F++N
Sbjct: 312 TGCTVCDEGLRSLKGIGDEALPTVLVGVFIEQPTPFVSLFFQRLLRLHYPQKHMRLFIHN 371
Query: 330 NQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDSD 388
++++H ++++ + +++VK + + + +ARN+ + + +YF VD+D
Sbjct: 372 HEQHHKAQVEEFLAEHGSEYQSVKLVGPEVRMANADARNMGADLCRQDRSCTYYFSVDAD 431
Query: 389 SHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGG 448
L P+ L+ L+ +N+++IAPL+ R + WSNFWGAL+ADG+YARS DY++I+ G +
Sbjct: 432 VALTEPNSLRLLIQQNKNVIAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR-- 489
Query: 449 KGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEY 507
G+WNVPYI+N YL+K S ++ ++ + +D DMAFC N+R + + + + +
Sbjct: 490 VGVWNVPYISNIYLIKGSALRGELQSPDLFHHSKLDPDMAFCANVRQQDVFMFLTNRHTL 549
Query: 508 GHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIV 567
GHL+ +++ + +++E+ NP DW +YIH Y K+L V PCPDV+WFPI
Sbjct: 550 GHLLSLDSYRTTHLHNDLWEVFSNPEDWKEKYIHQNYTKALAGKLVET-PCPDVYWFPIF 608
Query: 568 TEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVV 627
TE C E V+ ME +GQWS G N D R++ GYE VPT DIHM Q+G W +FL +Y+
Sbjct: 609 TEVACDELVEEMEHFGQWSLGNNKDNRIQGGYENVPTIDIHMNQIGFEREWHKFLLEYIA 668
Query: 628 PLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGC 687
P+ E+ + GY+ + ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VGVDYEGGGC
Sbjct: 669 PMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFTINIALNRVGVDYEGGGC 727
Query: 688 RFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
RF+RYNC+V A R GW LMHPGRLTHYHEGL T+GTRYI +SFVDP
Sbjct: 728 RFLRYNCSVRAPRKGWTLMHPGRLTHYHEGLPTTRGTRYIAVSFVDP 774
>gi|194386778|dbj|BAG61199.1| unnamed protein product [Homo sapiens]
gi|221045958|dbj|BAH14656.1| unnamed protein product [Homo sapiens]
Length = 774
Score = 652 bits (1681), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 309/707 (43%), Positives = 465/707 (65%), Gaps = 9/707 (1%)
Query: 32 DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNEL 91
D LV+TVA+ ET+G++RF +SA+ +++ LGL + W +S GGG KV LLK L
Sbjct: 73 DNLLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWNVEKGTSAGGGQKVRLLKKAL 132
Query: 92 DEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAV 151
++ +D++IL DSYDV+ G ++L++F + +VF AE L +PD L KYP V
Sbjct: 133 EKHADKEDLVILFADSYDVLFASGPRELLKKFRQARSQVVFSAEELIYPDRRLETKYPVV 192
Query: 152 GSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLAN 211
G R+L SGGFIGYA ++ +L++ ++ + DQL+Y +FLD R + I LD
Sbjct: 193 SDGKRFLGSGGFIGYAPNLSKLVAEWEGQDSDSDQLFYTKIFLDPEKREQINITLDHRCR 252
Query: 212 LFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK-T 270
+FQNL G+L+++ L F++ V N Y+T PV+IHGNG +K++LN GNY+ + W
Sbjct: 253 IFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTFE 311
Query: 271 SGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN 329
+GCT C+ ++ L + + P+VL+ VFI++PT F+ F ++ L+YP K + +F++N
Sbjct: 312 TGCTVCDEGLRSLKGIGDEALPTVLVGVFIEQPTPFVSLFFQRLLRLHYPQKHMRLFIHN 371
Query: 330 NQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDSD 388
++++H ++++ + +++VK + + + +ARN+ + + +YF VD+D
Sbjct: 372 HEQHHKAQVEEFLAQHGSEYQSVKLVGPEVRMANADARNMGADLCRQDRSCTYYFSVDAD 431
Query: 389 SHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGG 448
L P+ L+ L+ +N+++IAPL+ R + WSNFWGAL+ADG+YARS DY++I+ G +
Sbjct: 432 VALTEPNSLRLLIQQNKNVIAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR-- 489
Query: 449 KGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEY 507
G+WNVPYI+N YL+K S ++ ++ + +D DMAFC N+R + + + + +
Sbjct: 490 VGVWNVPYISNIYLIKGSALRGELQSSDLFHHSKLDPDMAFCANIRQQDVFMFLTNRHTL 549
Query: 508 GHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIV 567
GHL+ +++ + +++E+ NP DW +YIH Y K+L V PCPDV+WFPI
Sbjct: 550 GHLLSLDSYRTTHLHNDLWEVFSNPEDWKEKYIHQNYTKALAGKLVET-PCPDVYWFPIF 608
Query: 568 TEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVV 627
TE C E V+ ME +GQWS G N D R++ GYE VPT DIHM Q+G W +FL +Y+
Sbjct: 609 TEVACDELVEEMEHFGQWSLGNNKDNRIQGGYENVPTIDIHMNQIGFEREWHKFLLEYIA 668
Query: 628 PLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGC 687
P+ E+ + GY+ + ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VGVDYEGGGC
Sbjct: 669 PMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFTINIALNRVGVDYEGGGC 727
Query: 688 RFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
RF+RYNC++ A R GW LMHPGRLTHYHEGL T+GTRYI +SFVDP
Sbjct: 728 RFLRYNCSIRAPRKGWTLMHPGRLTHYHEGLPTTRGTRYIAVSFVDP 774
>gi|54111425|ref|NP_001005618.1| procollagen-lysine,2-oxoglutarate 5-dioxygenase 1 precursor [Gallus
gallus]
gi|126651|sp|P24802.1|PLOD1_CHICK RecName: Full=Procollagen-lysine,2-oxoglutarate 5-dioxygenase 1;
AltName: Full=Lysyl hydroxylase 1; Short=LH1; Flags:
Precursor
gi|212282|gb|AAA48945.1| lysyl hydroxylase [Gallus gallus]
Length = 730
Score = 652 bits (1681), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 313/709 (44%), Positives = 471/709 (66%), Gaps = 10/709 (1%)
Query: 31 EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGYKVNLLKN 89
E+ LV+TVA+ +T+G++RF +SA+ +++ LGL + W GGD GGG KV LLK+
Sbjct: 27 EENLLVLTVATKQTEGFRRFRRSAQFFNYKIQVLGLDEEWKGGDDKKPAGGGQKVRLLKS 86
Query: 90 ELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYP 149
L + +D++IL +SYDV+ G ++L++F + +VF AE +PD L KYP
Sbjct: 87 ALKQHADKEDLVILFIESYDVLFASGPTELLKKFKQAKSKVVFSAENYIYPDRKLEAKYP 146
Query: 150 AVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTL 209
V G R+L SGGFIGYA ++K+L+ K+++ DQL+Y +FLD R I LD
Sbjct: 147 PVRDGKRFLGSGGFIGYAPNLKKLVEEWKGKDDDSDQLFYTKIFLDPEKRENINISLDHR 206
Query: 210 ANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK 269
+ +FQNL G+L+++ L F+ + V N Y+T PVIIHGNG +K++LN GNY+ + W
Sbjct: 207 SRIFQNLNGALDEVVLKFE-NARVRARNLLYDTLPVIIHGNGPTKLQLNYLGNYIPQIWT 265
Query: 270 -TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFV 327
+GCT C+ ++ L +K + P +LI +FI++PT FL +F ++ NL+YP ++I +F+
Sbjct: 266 FETGCTVCDEGLRSLTGIKDEALPMILIGIFIEQPTPFLSQFFLRLRNLHYPKQRIQIFI 325
Query: 328 YNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVD 386
+N++E+H+ D ++ + +K I + V + EARNL ++ D+YF +D
Sbjct: 326 HNHEEHHSMQVDSFVKEHSKEYLAMKVIGPDDEVENAEARNLGMDLCRKDPDCDYYFSLD 385
Query: 387 SDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQ 446
++ L N + L+ L+ +N+S+IAPL+ R K WSNFWGAL+ DG+YARS DY++I+ +
Sbjct: 386 AEVVLKNTETLRILIEQNKSVIAPLVSRHEKLWSNFWGALSPDGYYARSEDYVDIVQRRR 445
Query: 447 GGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQ 505
G+WNVPYI++ Y++K V+++ + ++ +D DMAFC N+RN+G+ + + +
Sbjct: 446 --VGLWNVPYISSVYMVKGKVLRSELDEGDLFHGGKLDADMAFCHNVRNQGVFMYLTNRH 503
Query: 506 EYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFP 565
++GH++ EN+ + +++++ NP DW +YIH Y +L V PCPDV+WFP
Sbjct: 504 QFGHILSLENYQTTHLHNDLWQIFSNPEDWREKYIHENYTAALKGKLVE-MPCPDVYWFP 562
Query: 566 IVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKY 625
I T+ C E V+ ME YG+WS G N D R++ GYE VPT DIHM Q+G W +FL Y
Sbjct: 563 IFTDTACDELVEEMEHYGKWSTGDNTDSRIQGGYENVPTIDIHMNQIGFEREWYKFLLDY 622
Query: 626 VVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGG 685
+ P+ E+ + GY+ + + ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VG+DYEGG
Sbjct: 623 IAPITEKLYPGYYTK-TQFELAFVVRYKPDEQPSLMPHHDASTFTINIALNRVGIDYEGG 681
Query: 686 GCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
GCRF+RYNC++ A R GW LMHPGRLTHYHEGL T+GTRYI +SF+DP
Sbjct: 682 GCRFLRYNCSIRAPRKGWTLMHPGRLTHYHEGLPTTKGTRYIAVSFIDP 730
>gi|27806477|ref|NP_776573.1| procollagen-lysine,2-oxoglutarate 5-dioxygenase 1 precursor [Bos
taurus]
gi|3283055|gb|AAC25107.1| procollagen-lysine, 2-oxoglutarate 5-dioxygenase precursor [Bos
taurus]
Length = 726
Score = 651 bits (1680), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 310/708 (43%), Positives = 466/708 (65%), Gaps = 10/708 (1%)
Query: 31 EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNE 90
ED LV+TVA+ ET+G++RF +SA+ +++ LGL + W G M + GGG KV LLK
Sbjct: 25 EDNLLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWPGEAMLA-GGGLKVRLLKKA 83
Query: 91 LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
L++ ++++IL TDSYDV+ G ++L++F + +VF AE L +PD L YP
Sbjct: 84 LEKHADKENLVILFTDSYDVVFASGPRELLKKFRQARSQVVFSAEELIYPDRRLEANYPV 143
Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
V G R+L SGGFIGYA ++ +L++ ++ + DQL+Y +FLD R + I LD
Sbjct: 144 VSDGKRFLGSGGFIGYAPNLIKLVAEWEGQDSDSDQLFYTKIFLDPEKREQINITLDHRC 203
Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK- 269
+FQN +G+L+++ L F++ + V N Y+T PV+IHGNG +K++LN GNY+ + W
Sbjct: 204 RIFQNFHGALDEVVLKFEMGQ-VRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTF 262
Query: 270 TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
+GC C+ ++ L + + P+VL+ VFI++PT FL F ++ L+YP K+ +F++
Sbjct: 263 ETGCAVCDEGLRSLKGIGDEALPAVLVGVFIEQPTPFLSLFFQRLLLLHYPQKRFRLFIH 322
Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDS 387
N++++H + ++ +++VK + V + +ARN+ + +G +YF VD+
Sbjct: 323 NHEQHHKAQVEQFLAEHGDEYQSVKLVGPEVRVANADARNMGADLCRQDRGCTYYFSVDA 382
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D L P L+ L+ +N+++I PL+ R + WSNFWGAL+ADG+YARS DY++I+ G +
Sbjct: 383 DVALTEPKTLRLLIEQNKNVITPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR- 441
Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKT-IYTLNSMDYDMAFCTNLRNKGIHLKIDSTQE 506
G+WNVPYI+N YL+K S ++A +T ++ + +D DMAFC N+R + + + + +
Sbjct: 442 -VGVWNVPYISNIYLIKGSALRAELQETDLFHHSKLDPDMAFCANIRQQDVFMFLTNRHS 500
Query: 507 YGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPI 566
+GHL+ +++ + +++E+ NP DW +YIH Y K+L V PCPDV+WFPI
Sbjct: 501 FGHLLSLDSYQTTHLHNDLWEVFSNPEDWKEKYIHENYTKALAGKMVE-MPCPDVYWFPI 559
Query: 567 VTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYV 626
TE C E V+ ME YGQWS G N D R++ GYE VPT DIHM Q+ W +FL +Y+
Sbjct: 560 FTETACDELVEEMEHYGQWSLGDNKDNRIQGGYENVPTIDIHMNQINYEREWHKFLVEYI 619
Query: 627 VPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGG 686
P+ E+ + GY+ + ++FVVRY+PDEQPSL PHHD+ST+TINI LN+VGVDYEGGG
Sbjct: 620 APMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLVPHHDASTFTINIGLNRVGVDYEGGG 678
Query: 687 CRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
CRF+RYNC++ A R GW LMHPGRLTHYHEG+ T+GTRYI +SFVDP
Sbjct: 679 CRFLRYNCSIRAPRKGWTLMHPGRLTHYHEGVPTTKGTRYIAVSFVDP 726
>gi|151301032|ref|NP_001093074.1| lysyl hydroxylase 1 precursor [Takifugu rubripes]
gi|146325988|dbj|BAF61136.1| lysyl hydroxylase 1 [Takifugu rubripes]
Length = 729
Score = 650 bits (1677), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 312/730 (42%), Positives = 475/730 (65%), Gaps = 12/730 (1%)
Query: 10 LILSCVVFFISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQP 69
L +S FI C + + I E+K LV+TVA+ +TDG++RF++SA+ VK +G +
Sbjct: 7 LWISVCALFILTSCEE-QRIPEEKLLVVTVATKDTDGFRRFLRSAKHFNYTVKVVGRDEK 65
Query: 70 WLGGD-MSSLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDA 128
W+GG+ M + GGG KV LLK+ L+EM D IIL TDSYDV+ G ++L++F
Sbjct: 66 WIGGNYMGAPGGGQKVRLLKSALEEMK-NQDKIILFTDSYDVVFASGPKELLKKFQQARH 124
Query: 129 NIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLY 188
+VF +E L WPD L DKYP V G R+L SGGFIGY +++E+++ S ++++ DQL+
Sbjct: 125 KVVFSSESLIWPDRHLEDKYPHVREGNRFLGSGGFIGYLANVREMVAEWSGEDDDSDQLF 184
Query: 189 YALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIH 248
+ +++D R I LD+ LFQNL GSL+++ L F+ D V N ++T PVIIH
Sbjct: 185 FTRIYIDAAKRKSINITLDSKCRLFQNLLGSLDEVVLKFE-DGRVRARNLLHDTLPVIIH 243
Query: 249 GNGKSKIELNSFGNYLAKSWK-TSGCTRCN-LIKHLDSLKPDQFPSVLISVFIDKPTAFL 306
GNG +K+++N GNY+ +W +GCT C+ + L +L+ ++P V+I +FI++PT F+
Sbjct: 244 GNGPTKLQVNYLGNYIPNAWTFETGCTVCHEEFQPLTALQESEYPLVVIGIFIERPTPFV 303
Query: 307 EEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEA 366
F ++ L YP + + +N + +H + ++ + +++ V+ + ++ +
Sbjct: 304 SVFFERLLKLQYPKEHAQVVDFNKEAHHEQHVNSFLQEHRNLYRAVELLGPEEAMDGVTS 363
Query: 367 RNLAVENSLH-KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGA 425
RNLA + + D++F VD D L N + LK L+ ++AP++ R + WS+FWGA
Sbjct: 364 RNLAFDMCRQDQNCDYFFSVDIDVVLKNENALKILIEHTLPIVAPMITRTGRLWSSFWGA 423
Query: 426 LNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIK-TIYTLNSMDY 484
L+ DG+YARS DY++I+ + G+WNVPY++N YL+K ++++ ++ + +D
Sbjct: 424 LSPDGYYARSEDYVDIVQRRR--VGVWNVPYVSNVYLLKGGLLRSELTDFELFNSHILDP 481
Query: 485 DMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEY 544
DMAFC N+R+KGI + + + +GH++ +EN+ + +++++ NPLDW RYIHP Y
Sbjct: 482 DMAFCHNIRSKGIFMYVTNLHTFGHILSTENYQTGHLHNDLWQIFENPLDWQERYIHPNY 541
Query: 545 QKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPT 604
++ D + PCPDV+WFPI T++ C V+ ME +G+WS G N D R++ GYE VPT
Sbjct: 542 -THIMKDHLIETPCPDVYWFPIFTDEACDHIVEEMENFGRWSGGANTDPRIQGGYENVPT 600
Query: 605 RDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHH 664
DIHM QV W +FL +Y+ P+ E+ + GY+ + + ++FVVRY+PDEQP LRPHH
Sbjct: 601 IDIHMNQVNFEKEWHKFLLEYIAPITEKMYPGYYTK-AQFDLAFVVRYKPDEQPFLRPHH 659
Query: 665 DSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGT 724
D+ST+TINIALNQVG+DY+GGGCRF+RYNC+V A R GW LMHPGRLTHYHEGL T G
Sbjct: 660 DASTFTINIALNQVGLDYQGGGCRFLRYNCSVEAPRKGWALMHPGRLTHYHEGLPTTAGV 719
Query: 725 RYIMISFVDP 734
RYI +SF+DP
Sbjct: 720 RYISVSFIDP 729
>gi|190338002|gb|AAI62509.1| Plod2 protein [Danio rerio]
Length = 733
Score = 650 bits (1676), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 316/730 (43%), Positives = 471/730 (64%), Gaps = 14/730 (1%)
Query: 10 LILSCVVFFISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQP 69
++++CV + + NK +I +K LV+TVA+ ETDG+ RF+QSA VK LG+ +
Sbjct: 13 MLVTCVHCTLGMETNK--DIPTEKLLVLTVATQETDGFLRFMQSANYFNFNVKVLGMGEE 70
Query: 70 WLGGDMS-SLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDA 128
W GGD+ S+GGG KV LLK ++ +D +D+++L DSYD+I GG +IL +F +
Sbjct: 71 WKGGDVGRSIGGGQKVRLLKEAMESLDQQEDLVVLFVDSYDLIFAGGAEEILRKFQQSNH 130
Query: 129 NIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLY 188
+VF AE + WPD+ L +KYP+V SG R+LNSGG IGYA I++L+S + + +DDQL+
Sbjct: 131 KVVFAAEGIIWPDSQLAEKYPSVRSGKRFLNSGGIIGYAPYIQKLVSQWDLHDNDDDQLF 190
Query: 189 YALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIH 248
Y +++D R K + LD +FQNL G+L+++ L F E V + NT YN+ P +IH
Sbjct: 191 YTKIYVDPIQREKLNMTLDHKCEIFQNLNGALDEVLLKFG-TERVRVRNTIYNSLPAVIH 249
Query: 249 GNGKSKIELNSFGNYLAKSWK-TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFL 306
GN +K+ N NY+ +W GCT C+ + L LK +FP V + V+I++PT FL
Sbjct: 250 GNVNTKVYFNYLANYIPNAWNYERGCTICDQDMVDLSQLK--EFPQVTVGVYIEQPTPFL 307
Query: 307 EEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEA 366
EFL ++ +L+YP K+++F++N++ YH + K +F + K + + EA
Sbjct: 308 PEFLERLLSLDYPKDKLNIFIHNSEVYHEKHIQKFWEENKDVFGSFKAVGPEENLTQGEA 367
Query: 367 RNLAVENSLHK-GVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGA 425
RN+ ++ D++F +D+D L N LK L+ +N +IAPL+ R K WSNFWGA
Sbjct: 368 RNMGMDVCRRDPSCDYFFNIDADVMLTNRQTLKLLIEQNRKIIAPLVTRHGKLWSNFWGA 427
Query: 426 LNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIK-ATNIKTIYTLNSMDY 484
L+ DG+YARS DY++I+ G + G+WN+P++ + YL+K ++ + ++ L +D
Sbjct: 428 LSLDGYYARSEDYIDIVQGKR--VGVWNIPFLAHVYLIKGQTLRNELKERNVFVLEKLDP 485
Query: 485 DMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEY 544
DMA C N R+ G+ + + + E+G L+ + N++ N +++++ NPLDW +YIH Y
Sbjct: 486 DMAMCRNARDLGLFMYLTNRHEFGRLISTANYNTSHYNNDLWQIFENPLDWREKYIHANY 545
Query: 545 QKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPT 604
+ + + + QPCPDVFWFP+++EK C+E V+ ME +G WS G + DKR+ GYE+VPT
Sbjct: 546 TR-IFTENLLEQPCPDVFWFPVLSEKACNELVEEMENHGTWSGGKHEDKRITGGYESVPT 604
Query: 605 RDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHH 664
DIHMKQ+ W F+R+++ P+ + F GY+ + A M+FVV+Y PD Q LRPHH
Sbjct: 605 DDIHMKQINYDQEWLHFIREFISPVTLKVFSGYYTKGY-AIMNFVVKYTPDRQAYLRPHH 663
Query: 665 DSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGT 724
DSST+TINIALN G+D+ GGGCRF RYNC++ + R GW MHPGRLTH HEGL VT GT
Sbjct: 664 DSSTFTINIALNNKGLDFLGGGCRFHRYNCSIESPRKGWSFMHPGRLTHLHEGLPVTNGT 723
Query: 725 RYIMISFVDP 734
RYI +SFVDP
Sbjct: 724 RYIAVSFVDP 733
>gi|194440678|ref|NP_001007378.2| procollagen-lysine,2-oxoglutarate 5-dioxygenase 2 isoform 1
precursor [Danio rerio]
gi|70779499|gb|AAZ08242.1| procollagen lysine 2-oxoglutarate 5-dioxygenase 2a isoform [Danio
rerio]
Length = 733
Score = 650 bits (1676), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 316/730 (43%), Positives = 471/730 (64%), Gaps = 14/730 (1%)
Query: 10 LILSCVVFFISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQP 69
++++CV + + NK +I +K LV+TVA+ ETDG+ RF+QSA VK LG+ +
Sbjct: 13 MLVTCVHCTLGMETNK--DIPTEKLLVLTVATQETDGFLRFMQSANYFNFNVKVLGMGEE 70
Query: 70 WLGGDMS-SLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDA 128
W GGD+ S+GGG KV LLK ++ +D +D+++L DSYD+I GG +IL +F +
Sbjct: 71 WKGGDVGHSIGGGQKVRLLKEAMESLDQQEDLVVLFVDSYDLIFAGGAEEILRKFQQSNH 130
Query: 129 NIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLY 188
+VF AE + WPD+ L +KYP+V SG R+LNSGG IGYA I++L+S + + +DDQL+
Sbjct: 131 KVVFAAEGIIWPDSQLAEKYPSVRSGKRFLNSGGIIGYAPYIQKLVSQWDLHDNDDDQLF 190
Query: 189 YALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIH 248
Y +++D R K + LD +FQNL G+L+++ L F E V + NT YN+ P +IH
Sbjct: 191 YTKIYVDPIQREKLNMTLDHKCEIFQNLNGALDEVLLKFG-TERVRVRNTIYNSLPAVIH 249
Query: 249 GNGKSKIELNSFGNYLAKSWK-TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFL 306
GN +K+ N NY+ +W GCT C+ + L LK +FP V + V+I++PT FL
Sbjct: 250 GNVNTKVYFNYLANYIPNAWNYERGCTICDQDMVDLSQLK--EFPQVTVGVYIEQPTPFL 307
Query: 307 EEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEA 366
EFL ++ +L+YP K+++F++N++ YH + K +F + K + + EA
Sbjct: 308 PEFLERLLSLDYPKDKLNIFIHNSEVYHEKHIQKFWEENKDVFGSFKAVGPEENLTQGEA 367
Query: 367 RNLAVENSLHK-GVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGA 425
RN+ ++ D++F +D+D L N LK L+ +N +IAPL+ R K WSNFWGA
Sbjct: 368 RNMGMDVCRRDPSCDYFFNIDADVMLTNRQTLKLLIEQNRKIIAPLVTRHGKLWSNFWGA 427
Query: 426 LNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIK-ATNIKTIYTLNSMDY 484
L+ DG+YARS DY++I+ G + G+WN+P++ + YL+K ++ + ++ L +D
Sbjct: 428 LSLDGYYARSEDYIDIVQGKR--VGVWNIPFLAHVYLIKGQTLRNELKERNVFVLEKLDP 485
Query: 485 DMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEY 544
DMA C N R+ G+ + + + E+G L+ + N++ N +++++ NPLDW +YIH Y
Sbjct: 486 DMAMCRNARDLGLFMYLTNRHEFGRLISTANYNTSHYNNDLWQIFENPLDWREKYIHANY 545
Query: 545 QKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPT 604
+ + + + QPCPDVFWFP+++EK C+E V+ ME +G WS G + DKR+ GYE+VPT
Sbjct: 546 TR-IFTENLLEQPCPDVFWFPVLSEKACNELVEEMENHGTWSGGKHEDKRITGGYESVPT 604
Query: 605 RDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHH 664
DIHMKQ+ W F+R+++ P+ + F GY+ + A M+FVV+Y PD Q LRPHH
Sbjct: 605 DDIHMKQINYDQEWLHFIREFISPVTLKVFSGYYTKGY-AIMNFVVKYTPDRQAYLRPHH 663
Query: 665 DSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGT 724
DSST+TINIALN G+D+ GGGCRF RYNC++ + R GW MHPGRLTH HEGL VT GT
Sbjct: 664 DSSTFTINIALNNKGLDFLGGGCRFHRYNCSIESPRKGWSFMHPGRLTHLHEGLPVTNGT 723
Query: 725 RYIMISFVDP 734
RYI +SFVDP
Sbjct: 724 RYIAVSFVDP 733
>gi|196007006|ref|XP_002113369.1| hypothetical protein TRIADDRAFT_50405 [Trichoplax adhaerens]
gi|190583773|gb|EDV23843.1| hypothetical protein TRIADDRAFT_50405 [Trichoplax adhaerens]
Length = 702
Score = 650 bits (1676), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 320/708 (45%), Positives = 464/708 (65%), Gaps = 14/708 (1%)
Query: 31 EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNE 90
++ L +TVAS+ TDG++RF +S + L K LG+++ W GG M GGGYK+NLL+ E
Sbjct: 5 KETLLTLTVASDCTDGFQRFNRSCRIYDLNCKILGMNKIWKGGSMEFPGGGYKINLLRRE 64
Query: 91 LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
L+ + DD II+VTDSYDVI G +ILE+F+ F+AN+VFGAE CWP+ L YP
Sbjct: 65 LERLKDKDD-IIIVTDSYDVIYTAGTQEILEKFHQFNANVVFGAEPYCWPNQELASHYPV 123
Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
V SG R+LNSGGFIG+A+ I E+I+ RSI++ +DDQLYY ++LD LR K I LD +
Sbjct: 124 VSSGKRFLNSGGFIGHARTIYEIITYRSIEDSDDDQLYYTEIYLDSKLRDKWNIKLDHKS 183
Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSK-IELNSFGNYLAKSWK 269
LF NL G+ E++ L D L N Y T P+ +HGNG +K + LN FGNYLA W
Sbjct: 184 VLFHNLNGAQEEVNLIPDNGGKYRLFNEVYQTLPIAVHGNGPTKEVSLNYFGNYLANYWS 243
Query: 270 -TSGCTRC--NLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMF 326
GC C N I+H LK D +P + +++FI K F + FL +++N YP KI +F
Sbjct: 244 FNDGCIACKENTIEH---LKKDHYPRLSLAIFIHKSAPFTDVFLQRLSNQQYPKDKIDLF 300
Query: 327 VYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVD 386
++ + ++H + + +++ + + I + +N +ARNLA+E +++F ++
Sbjct: 301 LHISIDHHLKDTLVWWKKYSSLYASQELIVPSDKINPSKARNLAMEQCQSSNCEYFFSIE 360
Query: 387 SDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQ 446
+D L N + K L++ N ++++PLL + WSNFWGA++ +G+YARS DY++I+ G +
Sbjct: 361 NDCMLTNNETFKLLMHYNSTIVSPLLFISGRLWSNFWGAIDQNGYYARSKDYIDIVEGRK 420
Query: 447 GGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQE 506
+GIWNVPYI Y++ S +K ++ D+DM +C +R G + +++ Q
Sbjct: 421 --RGIWNVPYIRGAYMINKSHLKMPDLAFD---EEGDFDMKWCAKMRKSGTFMYVNNMQI 475
Query: 507 YGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPI 566
+GHL++ +++ + ++Y++I N DW+ +YIH Y +L D PC DV+WFP+
Sbjct: 476 FGHLLNLKSYSIDHLHNDLYQIIDNQPDWEAKYIHENYSINLRDDHEIQMPCSDVYWFPV 535
Query: 567 VTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYV 626
V+E FC V+ ME +GQWS G + D RL+ GYE VPTRDIH+KQ+ L W FL+KY+
Sbjct: 536 VSEIFCKHLVEEMENFGQWSAGGHKDSRLDGGYENVPTRDIHLKQINLEQQWLYFLQKYI 595
Query: 627 VPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGG 686
VP+Q + + G++ + A M+FVVRY P QPSLRPHHD+STYTINIAL + G+D+EGGG
Sbjct: 596 VPIQAKVYPGFYSKG-HAFMNFVVRYHPTGQPSLRPHHDASTYTINIALTRAGIDHEGGG 654
Query: 687 CRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
CRF+R NC+V T +GW LMHPGRLTHYHEGL T+GTRYIM+SF+DP
Sbjct: 655 CRFLRQNCSVVNTMLGWSLMHPGRLTHYHEGLPTTKGTRYIMVSFIDP 702
>gi|348571365|ref|XP_003471466.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 1-like
[Cavia porcellus]
Length = 727
Score = 649 bits (1675), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 312/708 (44%), Positives = 460/708 (64%), Gaps = 9/708 (1%)
Query: 31 EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNE 90
ED LV+TVA+ E++G++RF +SA+ +V+ LGL + W + GGG KV LLK
Sbjct: 25 EDNLLVLTVATKESEGFRRFKRSAQFFNYKVQALGLGEDWDVERGTMTGGGQKVRLLKKA 84
Query: 91 LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
L++ +D++IL TDSYDV+ G ++L++F + +VF AE L +PD L KYPA
Sbjct: 85 LEKHADKEDLVILFTDSYDVVFASGPRELLKKFRQARSRVVFSAEDLIYPDRRLEAKYPA 144
Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
V G R+L SGGF+GYA + +L++ ++ + DQL+Y +FLD R + I LD
Sbjct: 145 VSEGKRFLGSGGFVGYAPSLSKLVAEWEGQDSDSDQLFYTKIFLDPEKREQINISLDHRC 204
Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK- 269
+FQNL G+L+++ L F++ V N Y+T PV+IHGNG +K++LN GNY+ + W
Sbjct: 205 RIFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTF 263
Query: 270 TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
+GCT C ++ L ++ P+VL+ VFI++PT FL F ++ L YP ++ +F++
Sbjct: 264 ETGCTVCEEGLRSLKGMEDRALPTVLVGVFIEQPTPFLSLFFQRLLRLRYPRSQMRLFIH 323
Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDS 387
N++++H + ++ +++VK + + S ARNL + +YF +D+
Sbjct: 324 NHEQHHKAQVEQFLAEHGGEYQSVKLVGPEVRMESANARNLGADLCRQDHTCTYYFSMDA 383
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D L PD L+ L+ +N+++IAPLL R + WSNFWGAL+ADG+YARS DY++I+ G +
Sbjct: 384 DVALTEPDSLRLLIEQNKNVIAPLLTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR- 442
Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKT-IYTLNSMDYDMAFCTNLRNKGIHLKIDSTQE 506
G+WNVPYI+N YL+K S ++ +T ++ +D DMAFC N+R + + + + +
Sbjct: 443 -VGVWNVPYISNIYLIKGSSLRTELQRTDLFHHRKLDPDMAFCANIRQQEVFMFLTNRHS 501
Query: 507 YGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPI 566
+GHL+ +++ + +++E+ NP DW +YIH Y ++L V PCPDV+WFPI
Sbjct: 502 FGHLLSLDSYQTTHLHNDLWEIFSNPEDWKEKYIHENYTEALAGKLVET-PCPDVYWFPI 560
Query: 567 VTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYV 626
TE C E VQ ME +GQWS G N D R++ GYE VPT DIHM Q+ W +FL +Y+
Sbjct: 561 FTETACDELVQEMEHFGQWSLGDNKDNRIQGGYENVPTIDIHMNQISFEREWHKFLVEYI 620
Query: 627 VPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGG 686
P+ E+ + GY+ + ++FVVRY+PDEQPSL PHHDSST+T+NIALN+VG DYEGGG
Sbjct: 621 APVTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDSSTFTVNIALNRVGEDYEGGG 679
Query: 687 CRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
CRF+RYNC++ A R GW LMHPGRLTHYHEGL T+GTRYI +SFVDP
Sbjct: 680 CRFLRYNCSIRAPRKGWALMHPGRLTHYHEGLPTTKGTRYIAVSFVDP 727
>gi|449268427|gb|EMC79291.1| Procollagen-lysine,2-oxoglutarate 5-dioxygenase 1 [Columba livia]
Length = 730
Score = 649 bits (1675), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 313/709 (44%), Positives = 468/709 (66%), Gaps = 10/709 (1%)
Query: 31 EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGYKVNLLKN 89
E+ LV+TVA+ +T+G++RF +SA+ +++ LGL + W GGD GGG KV LLK+
Sbjct: 27 EENLLVLTVATKQTEGFQRFRRSAQFFNYKIQVLGLDEEWQGGDDKKPAGGGQKVRLLKS 86
Query: 90 ELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYP 149
L + +D+IIL DSYDV+ G ++L++F + +VF AE +PD L KYP
Sbjct: 87 ALKQYADKEDLIILFIDSYDVLFASGPTELLKKFKQAKSKVVFSAENYIYPDRKLEAKYP 146
Query: 150 AVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTL 209
V G R+L SGGFIGYA ++K+L+ K+++ DQL+Y +FLD R I LD
Sbjct: 147 QVRDGKRFLGSGGFIGYAPNLKKLVEEWKGKDDDSDQLFYTNVFLDPEKRESINISLDQR 206
Query: 210 ANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK 269
+ +FQNL G+L+++ + F+ + V N Y+T PV+IHGNG +K++LN GNY+ + W
Sbjct: 207 SRIFQNLNGALDEVVMKFE-NSRVRARNLLYDTLPVVIHGNGPTKLQLNYLGNYIPQIWT 265
Query: 270 -TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFV 327
+GCT C+ ++ L K + P +LI +FI++PT FL +F ++ NL+YP ++I +F+
Sbjct: 266 FETGCTVCDEGLRSLTGFKDEALPVILIGIFIEQPTPFLSQFFLRLRNLHYPKQRIQLFI 325
Query: 328 YNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVD 386
+N++E+H+ D ++ + VK I + V + ARNL ++ D+YF +D
Sbjct: 326 HNHEEHHSMQVDSFVEEHGKEYLAVKVIGPDDEVENAVARNLGMDLCRKDPDCDYYFSLD 385
Query: 387 SDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQ 446
++ L N + L+ L+ +N+ +IAPL+ R K WSNFWGAL+ DG+YARS DY++I+ +
Sbjct: 386 AEIVLKNTETLRILIEQNKMVIAPLVSRHEKLWSNFWGALSPDGYYARSEDYVDIVQRRR 445
Query: 447 GGKGIWNVPYITNCYLMKTSVIKATNIKT-IYTLNSMDYDMAFCTNLRNKGIHLKIDSTQ 505
G+WNVPYI++ Y++K +++ +T ++ +D DMAFC N+RN+G+ + + +
Sbjct: 446 --VGLWNVPYISSVYMIKAKALRSELDQTDLFHSGKLDADMAFCHNVRNQGVFMYLTNRH 503
Query: 506 EYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFP 565
++GH++ EN+ + +++++ NP DW +YIH Y +L V PCPDV+WFP
Sbjct: 504 QFGHILSLENYQTTHLHNDLWQIFSNPEDWREKYIHENYTAALKGKLVE-VPCPDVYWFP 562
Query: 566 IVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKY 625
I T+ C E V+ ME YGQWS G N D R++ GYE VPT DIHM Q+G W +FL Y
Sbjct: 563 IFTDTACDELVEEMEHYGQWSTGDNTDSRIQGGYENVPTIDIHMNQIGFEREWYKFLLDY 622
Query: 626 VVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGG 685
+ P+ E+ + GY+ + + ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VG+DYEGG
Sbjct: 623 IAPITEKLYPGYYTK-AQFELAFVVRYKPDEQPSLMPHHDASTFTINIALNRVGIDYEGG 681
Query: 686 GCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
GCRF+RYNC++ A R GW LMHPGRLTHYHEGL T+GTRYI +SF+DP
Sbjct: 682 GCRFLRYNCSIRAPRKGWTLMHPGRLTHYHEGLPTTKGTRYIAVSFLDP 730
>gi|218931165|ref|NP_036091.2| procollagen-lysine,2-oxoglutarate 5-dioxygenase 2 isoform 2
precursor [Mus musculus]
gi|341941297|sp|Q9R0B9.2|PLOD2_MOUSE RecName: Full=Procollagen-lysine,2-oxoglutarate 5-dioxygenase 2;
AltName: Full=Lysyl hydroxylase 2; Short=LH2; Flags:
Precursor
Length = 737
Score = 648 bits (1672), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 316/710 (44%), Positives = 459/710 (64%), Gaps = 10/710 (1%)
Query: 29 IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGYKVNLL 87
I DK LVITVA+ E DG+ RF+ SA+ VK LG Q W GGD M+S+GGG KV LL
Sbjct: 34 IPADKLLVITVATKENDGFHRFMNSAKYFNYTVKVLGQGQEWRGGDGMNSIGGGQKVRLL 93
Query: 88 KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
K ++ +D++IL T+ +DV+ GG ++L++F + IVF A+ L WPD L DK
Sbjct: 94 KEAMEHYASQEDLVILFTECFDVVFAGGPEEVLKKFQKTNHKIVFAADGLLWPDKRLADK 153
Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
YP V G RYLNSGGFIGYA I L+ ++++ +DDQL+Y +++D R I LD
Sbjct: 154 YPVVHIGKRYLNSGGFIGYAPYISRLVQQWNLQDNDDDQLFYTKVYIDPLKREAFNITLD 213
Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
+FQ L G+ +++ L F+ + + NT Y T PV I+GNG +KI LN FGNY+ S
Sbjct: 214 HKCKIFQALNGATDEVVLKFENGK-SRVKNTFYETLPVAINGNGPTKILLNYFGNYVPNS 272
Query: 268 W-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMF 326
W + +GC C+ + +D D P V + VFI++PT FL FLN + L+YP + + +F
Sbjct: 273 WTQENGCALCD-VDTIDLSTVDVPPKVTLGVFIEQPTPFLPRFLNLLLTLDYPKEALQLF 331
Query: 327 VYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYV 385
++N + YH ++ K ++K + ++ EARN+ ++ + D+YF V
Sbjct: 332 IHNKEVYHEKDIKVFVDKAKHDISSIKIVGPEENLSQAEARNMGMDFCRQDEKCDYYFSV 391
Query: 386 DSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGD 445
D+D L NP LK+L+ +N +IAPL+ R K WSNFWGAL+ DG+YARS DY++I+ G+
Sbjct: 392 DADVVLTNPRTLKFLIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARSEDYVDIVQGN 451
Query: 446 QGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDST 504
+ GIWNVPY+ N YL++ +++ N + + + +D DMA C N R+ G+ + I +
Sbjct: 452 R--VGIWNVPYMANVYLIQGKTLRSEMNERNYFVRDKLDPDMALCRNARDMGVFMYISNR 509
Query: 505 QEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWF 564
E+G L+ + N++ N + +++ NP+DW +YI+ +Y K + + + QPCPDVFWF
Sbjct: 510 HEFGRLISTANYNTSHLNNDFWQIFENPVDWKEKYINRDYSK-IFTENIVEQPCPDVFWF 568
Query: 565 PIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRK 624
PI +E+ C E V+ ME YG+WS G ++D R+ GYE VPT DIHMKQ+GL VW F+R+
Sbjct: 569 PIFSERACDELVEEMEHYGKWSGGKHHDSRISGGYENVPTDDIHMKQIGLENVWLHFIRE 628
Query: 625 YVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEG 684
++ P+ + F GY+ + A ++FVV+Y P+ Q SLRPHHD+ST+TINIALN VG D++G
Sbjct: 629 FIAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPHHDASTFTINIALNNVGEDFQG 687
Query: 685 GGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
GGC+F+RYNC++ + R GW MHPGRLTH HEGL V GTRYI +SF+DP
Sbjct: 688 GGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNGTRYIAVSFIDP 737
>gi|363737318|ref|XP_422695.3| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2
[Gallus gallus]
Length = 881
Score = 647 bits (1670), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 308/707 (43%), Positives = 459/707 (64%), Gaps = 10/707 (1%)
Query: 32 DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLLKNE 90
D LV TVA+ ETDG+ RF+Q+A+ VK LG + W GG+++ S+GGG KV LLK
Sbjct: 181 DNLLVFTVATKETDGFHRFMQTAKHFNYTVKVLGKGEEWKGGELANSIGGGQKVRLLKEG 240
Query: 91 LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
+ +D+I++ + YDVI GG ++L++F + +VF A+ L WPD L DKYP
Sbjct: 241 IQSYADQEDLIVMFVECYDVIFAGGPEELLKKFQETNHKVVFAADGLIWPDKRLADKYPV 300
Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
V SG R+LNSGGFIGYA I ++ ++++ +DDQL+Y +++D R + I LD
Sbjct: 301 VRSGKRFLNSGGFIGYAPYINRIVQQWNLQDNDDDQLFYTKIYVDPLARERLNITLDHKC 360
Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSW-K 269
+FQ L G+++++ LNF+ + V N+ Y T P+ + GNG +KI LN GNY+ +W +
Sbjct: 361 AIFQTLNGAVDEVHLNFEEGK-VRARNSVYETLPITVLGNGPTKIYLNYLGNYIPNAWTR 419
Query: 270 TSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN 329
+GC C+L LD ++P V I VFI++PT FL +FL+++ L+YP + +S+F++N
Sbjct: 420 ETGCNICDL-DMLDLSTVTEYPRVKIGVFIEQPTPFLPKFLDRLLTLDYPKEALSVFIHN 478
Query: 330 NQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDSD 388
N+ YH + K + +N+K + ++ EARN+ ++ + ++YF +D+D
Sbjct: 479 NEVYHEKHIKKFWEKAKNIIRNIKIVGPEENLSQAEARNMGMDLCRQDEACEYYFSIDAD 538
Query: 389 SHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGG 448
L NP LK L+ +N +IAPL+ R K WSNFWGAL+ DG+YARS DY++I+ G++
Sbjct: 539 VVLTNPKTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARSEDYIDIVQGNR-- 596
Query: 449 KGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEY 507
G+WN+PY+ N YL+K +++ K + + +D DMA C N R G+ + I + E+
Sbjct: 597 VGVWNIPYMANIYLIKGQTLRSEMKEKNYFMRDKLDPDMALCRNAREMGVFMYITNRHEF 656
Query: 508 GHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIV 567
G L+ + N++ N +++++ NP+DW YI+P Y K + D + QPCPDVFWFPI
Sbjct: 657 GRLISTANYNTSHYNNDLWQIFENPVDWKETYINPNYSK-IFTDNIVEQPCPDVFWFPIF 715
Query: 568 TEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVV 627
++ C E V+ ME +GQWS G + D R+ GYE VPT DIHMKQ+GL W F+R+++
Sbjct: 716 SDTACDELVEEMEHFGQWSGGKHQDSRISGGYENVPTDDIHMKQIGLDNEWLHFIREFIA 775
Query: 628 PLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGC 687
P+ + F GY+ + A ++FVV+Y PD Q SLRPHHDSST+TINIALN+VG D++GGGC
Sbjct: 776 PVTLKVFAGYYTKGY-ALLNFVVKYSPDRQRSLRPHHDSSTFTINIALNKVGEDFQGGGC 834
Query: 688 RFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
+F+RYNC++ + R GW MHPGRLTH HEGL + GTRYI +SF+DP
Sbjct: 835 KFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPILNGTRYIAVSFIDP 881
>gi|348568796|ref|XP_003470184.1| PREDICTED: LOW QUALITY PROTEIN: procollagen-lysine,2-oxoglutarate
5-dioxygenase 3-like [Cavia porcellus]
Length = 737
Score = 647 bits (1669), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 314/711 (44%), Positives = 464/711 (65%), Gaps = 11/711 (1%)
Query: 29 IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLL 87
++ +K LVITVA+ ET+GY+RF+QSAE V+TLGL + W GGD++ ++GGG KV L
Sbjct: 33 VNPEKLLVITVATAETEGYRRFLQSAEFFNYTVRTLGLGEEWRGGDVARTVGGGQKVRWL 92
Query: 88 KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
K E+++ +DMII+ DSYDVI+ G +++L++F + ++F AE CWPD L ++
Sbjct: 93 KKEMEKYADQEDMIIMFVDSYDVILAGSPSELLKKFVQSGSRLLFSAEGFCWPDWGLAEQ 152
Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
YP VG+G R+LNSGGFIG+A I +++ K+ +DDQL+Y L+LD +R K + LD
Sbjct: 153 YPEVGTGKRFLNSGGFIGFAPTIHQIVHQWKYKDNDDDQLFYTRLYLDPGVREKFSLNLD 212
Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
+ +FQNL G+L+++ L FD + V + N Y+T PV++HGNG +K++LN GNY+
Sbjct: 213 HKSRIFQNLNGALDEVVLKFDQNR-VRIRNVAYDTLPVVVHGNGPTKLQLNYLGNYVPNG 271
Query: 268 W-KTSGCTRCNLIKHLDSLKPDQFP-SVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
W GC CN + +L Q P F+++PT FL L + L A+++S+
Sbjct: 272 WTPQGGCGFCN--RDRRTLPGGQLPPGCCWPCFVEQPTPFLPCVLAALLLLRLTARQVSL 329
Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFY 384
F+++++ YH P D + F +VK + ++ EAR++A++ +FYF
Sbjct: 330 FLHDSEVYHEPHIADAWPQLQDHFASVKLLGPEEALSPGEARDMAMDICRQDPECEFYFS 389
Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
+D+D+ L N L+ L+ +N +IAP+L R K WSNFWGAL+ + +YARS DY+ ++
Sbjct: 390 LDADAVLTNQQTLRILIEQNRKVIAPMLSRHGKLWSNFWGALSPEEYYARSEDYVELVQR 449
Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDS 503
+ G+WNVPYI Y+++ ++ + + +++ + MD DMAFC NLR++GI L + +
Sbjct: 450 KR--LGVWNVPYIAQAYVIRGETLRTELSQREVFSGSDMDPDMAFCMNLRDRGIFLHLSN 507
Query: 504 TQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFW 563
E+G L+ + +D +P+++++ NP+DW +YIH Y ++L + + QPCPDV+W
Sbjct: 508 QHEFGRLLATSRYDTDHLHPDLWQIFNNPVDWKEQYIHENYSRALHGEGLVEQPCPDVYW 567
Query: 564 FPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLR 623
FP+++E+ C E V+ ME YGQWS G + D RL GYE VPT DIHMKQVG W + LR
Sbjct: 568 FPLLSEQMCDELVEEMENYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYEDQWLQLLR 627
Query: 624 KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYE 683
YV P+ E F GYH + RA M+FVVRYRPDEQPSLRPHHDSST+T+N+ALN G+D
Sbjct: 628 TYVGPMTEHLFPGYHTK-TRAVMNFVVRYRPDEQPSLRPHHDSSTFTLNVALNHKGLDMR 686
Query: 684 GGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
+RY+C +++ R GW L+HPGRLTHYHEGL T GTRYIM+SFVDP
Sbjct: 687 XAAAALLRYDCIISSPRKGWALLHPGRLTHYHEGLPTTWGTRYIMVSFVDP 737
>gi|148688975|gb|EDL20922.1| procollagen lysine, 2-oxoglutarate 5-dioxygenase 2, isoform CRA_a
[Mus musculus]
Length = 737
Score = 647 bits (1668), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 316/710 (44%), Positives = 458/710 (64%), Gaps = 10/710 (1%)
Query: 29 IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGYKVNLL 87
I DK LVITVA+ E DG+ RF+ SA+ VK LG Q W GGD M+S+GGG KV LL
Sbjct: 34 IPADKLLVITVATKENDGFHRFMNSAKYFNYTVKVLGQGQEWRGGDGMNSIGGGQKVRLL 93
Query: 88 KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
K ++ +D++IL T+ +DV+ GG ++L++F + IVF A+ L WPD L DK
Sbjct: 94 KEAMEHYASQEDLVILFTECFDVVFAGGPEEVLKKFQKTNHKIVFAADGLLWPDKRLADK 153
Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
YP V G RYLNSGGFIGYA I L+ ++++ +DDQL+Y +++D R I LD
Sbjct: 154 YPVVHIGKRYLNSGGFIGYAPYISRLVQQWNLQDNDDDQLFYTKVYIDPLKREAFNITLD 213
Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
+FQ L G+ +++ L F+ + + NT Y T PV I+GNG +KI LN FGNY+ S
Sbjct: 214 HKCKIFQALNGATDEVVLKFENGK-SRVKNTFYETLPVAINGNGPTKILLNYFGNYVPNS 272
Query: 268 W-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMF 326
W + +GC C+ + +D D P V + VFI++PT FL FLN + L+YP + + +F
Sbjct: 273 WTQENGCALCD-VDTIDLSTVDVPPKVTLGVFIEQPTPFLPRFLNLLLTLDYPKEALQLF 331
Query: 327 VYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYV 385
++N + YH ++ K ++K + ++ EARN+ ++ + D+YF V
Sbjct: 332 IHNKEVYHEKDIKVFVDKAKHDISSIKIVGPEENLSQAEARNMGMDFCRQDEKCDYYFSV 391
Query: 386 DSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGD 445
D+D L NP LK L+ +N +IAPL+ R K WSNFWGAL+ DG+YARS DY++I+ G+
Sbjct: 392 DADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARSEDYVDIVQGN 451
Query: 446 QGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDST 504
+ GIWNVPY+ N YL++ +++ N + + + +D DMA C N R+ G+ + I +
Sbjct: 452 R--VGIWNVPYMANVYLIQGKTLRSEMNERNYFVRDKLDPDMALCRNARDMGVFMYISNR 509
Query: 505 QEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWF 564
E+G L+ + N++ N + +++ NP+DW +YI+ +Y K + + + QPCPDVFWF
Sbjct: 510 HEFGRLISTANYNTSHLNNDFWQIFENPVDWKEKYINRDYSK-IFTENIVEQPCPDVFWF 568
Query: 565 PIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRK 624
PI +E+ C E V+ ME YG+WS G ++D R+ GYE VPT DIHMKQ+GL VW F+R+
Sbjct: 569 PIFSERACDELVEEMEHYGKWSGGKHHDSRISGGYENVPTDDIHMKQIGLENVWLHFIRE 628
Query: 625 YVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEG 684
++ P+ + F GY+ + A ++FVV+Y P+ Q SLRPHHD+ST+TINIALN VG D++G
Sbjct: 629 FIAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPHHDASTFTINIALNNVGEDFQG 687
Query: 685 GGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
GGC+F+RYNC++ + R GW MHPGRLTH HEGL V GTRYI +SF+DP
Sbjct: 688 GGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNGTRYIAVSFIDP 737
>gi|126338250|ref|XP_001371794.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2
[Monodelphis domestica]
Length = 758
Score = 646 bits (1667), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 324/755 (42%), Positives = 470/755 (62%), Gaps = 37/755 (4%)
Query: 10 LILSCVVFFI----SVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLG 65
L+L+ + F+ + K I DK LV+TVA+ ETDGY RF+QSA+ VK LG
Sbjct: 11 LLLALSLHFVKACAAAEAQKPSIIPTDKLLVLTVATQETDGYHRFMQSAKYFNYTVKVLG 70
Query: 66 LHQPWLGGDMS-SLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFN 124
+ W GGD + ++GGG KV LLK + +D+I+ T YDVI GG ++L++F
Sbjct: 71 KGEEWKGGDKANTIGGGQKVRLLKEAMGSYADQEDLIVFFTQCYDVIFAGGPEELLKKFQ 130
Query: 125 TFDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEED 184
+ +VF A+ + WPD L DKYP V G R+LNSGGF+GYA I ++ ++++ +D
Sbjct: 131 KINHKVVFSADGILWPDKKLADKYPIVHIGKRFLNSGGFVGYAPYINHIVQQWNLQDNDD 190
Query: 185 DQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNP 244
DQL+Y +++D R I LD +FQ L G+++++ L F+ + NT Y T P
Sbjct: 191 DQLFYTKIYIDPLKREALNITLDHKCRIFQALNGAIDEVLLKFENGK-ARAKNTFYETLP 249
Query: 245 VIIHGNGKSKIELNSFGNYLAKSW-KTSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKP 302
VII+GNG +KI+LN FGNY+ +W + +GCT C+L + L +L + +P V I VFI++P
Sbjct: 250 VIINGNGPTKIQLNYFGNYVPNAWTQENGCTLCDLDVIDLSTL--EDYPRVTIGVFIEQP 307
Query: 303 TAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVN 362
T FL FL + L+YP + I +F++N + YH ++ K + KN+K + ++
Sbjct: 308 TPFLPRFLELLLTLDYPKEAIKLFIHNKEVYHEKHIKEFWEKAKDVIKNIKIVGPEENLS 367
Query: 363 SKEARNLAVENSLHKG-VDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSN 421
EARN+ ++ G D+YF +D+D L NP LK L+ +N +IAPL+ R K WSN
Sbjct: 368 QAEARNMGMDLCRQDGQCDYYFSLDADVVLTNPKTLKILIEQNRKIIAPLVTRHGKLWSN 427
Query: 422 FWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLN 480
FWGAL+ DG+YARS DY++I+ G + G+WNVPY+ N YL+K +++ N + + +
Sbjct: 428 FWGALSPDGYYARSEDYVDIVQGSR--VGVWNVPYMANVYLIKGQTLRSEMNERNYFVRD 485
Query: 481 SMDYDMAFCTNLRN---------------------KGIHLKIDSTQEYGHLVDSENFDPQ 519
+D DMA C N R KG+ + I + E+G L+ + N++
Sbjct: 486 KLDPDMALCRNAREMTIQREKDSPTPETFQMLSPPKGVFMYISNRHEFGRLLSTANYNTS 545
Query: 520 KTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIM 579
N +++++ NP+DW RYI+ Y K + + + QPCPDVFWFPI +EK C E V+ M
Sbjct: 546 HYNNDLWQIFENPVDWKERYINHNYSK-IFTENLVEQPCPDVFWFPIFSEKACDELVEEM 604
Query: 580 EAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHH 639
E YGQWS G ++D R+ GYE VPT DIHMKQ+GL W F+R+++ P+ + F GY+
Sbjct: 605 EHYGQWSGGKHHDSRISGGYENVPTDDIHMKQIGLENEWLHFIREFIAPVTLKVFAGYYT 664
Query: 640 EPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTAT 699
+ A ++FVV+Y PD Q SLRPHHDSST+TINIALN VG D++GGGC+F+RYNC++ +
Sbjct: 665 KGF-ALLNFVVKYSPDRQRSLRPHHDSSTFTINIALNNVGEDFQGGGCKFLRYNCSIESP 723
Query: 700 RMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
R GW MHPGRLTH HEGL + GTRYI +SF+DP
Sbjct: 724 RKGWSFMHPGRLTHLHEGLPIINGTRYIAVSFIDP 758
>gi|148230120|ref|NP_001088279.1| procollagen-lysine, 2-oxoglutarate 5-dioxygenase 1 precursor
[Xenopus laevis]
gi|54038674|gb|AAH84287.1| LOC495112 protein [Xenopus laevis]
Length = 725
Score = 646 bits (1667), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 321/711 (45%), Positives = 463/711 (65%), Gaps = 16/711 (2%)
Query: 28 NIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLL 87
N D+DK LV+TVA+ ETDG +RF +SA +VK LGL WLG G KV L+
Sbjct: 27 NPDDDKLLVLTVATEETDGLRRFQRSAHSFNYKVKVLGLGGQWLGE-------GQKVQLM 79
Query: 88 KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
K L+ +D+IIL T+SYDVI G ++L++F + +VF AE + +PD L K
Sbjct: 80 KLALEPYADKEDLIILFTESYDVIFAAGPGELLKKFRQAKSKVVFSAESVAYPDRHLESK 139
Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
YP V G R+L SGGFIGYA + +++++ +++ DQL+Y LFLD R K I LD
Sbjct: 140 YPVVPEGKRFLGSGGFIGYAAYLYKMVADWDGTDKDSDQLFYTKLFLDPVKRGKVNITLD 199
Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
+FQNLYGS ED+ L F+ V N Y+T PV+IHGNG +K+ LN GNY+
Sbjct: 200 HRCRIFQNLYGSAEDVVLKFEHGR-VRARNLVYDTLPVLIHGNGPTKLHLNYLGNYIPHV 258
Query: 268 WK-TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
W SGC C+ +++L+SL D P V+I +FI++PT F+ EF ++ NLNYP +I +
Sbjct: 259 WTFESGCNVCDEGLRNLESLSVDTLPLVVIGIFIEQPTPFVSEFFKRLNNLNYPKNRIQL 318
Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKG-VDFYFY 384
++ N++ +H ++++ + T + VK + + +ARN ++ ++YF
Sbjct: 319 YISNHEPHHQRRVENFLQDHGTQYNFVKTVGPEENSDFADARNKGMDMCRQTPECEYYFS 378
Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
+D+ L NP+VL+ L+ +N+S+IAPL+ R WSNFWGAL++DG+YARS DY++I+
Sbjct: 379 IDAPVVLKNPNVLRILIEQNKSVIAPLVSRNANLWSNFWGALSSDGYYARSEDYIDIVQR 438
Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDS 503
+ G+WNVPYI++ YL+K S++++ + ++ ++D DM FC N+R +GI + + +
Sbjct: 439 QR--IGVWNVPYISSVYLVKGSILRSKLSQNDMFHSGTLDSDMVFCDNVRQQGIFMFVTN 496
Query: 504 TQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFW 563
QE+GH++ EN+ + +++E+ N DW +YIHP Y ++L V PCPDV+W
Sbjct: 497 RQEFGHILSLENYKTTHLHNDLWEIFENTEDWKEKYIHPNYSEALKGKLVE-MPCPDVYW 555
Query: 564 FPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLR 623
FP+ TE C+E V+ ME +G+WS G N D+RL+ GYE VPT DIHM Q+ W + L
Sbjct: 556 FPLFTETTCNEIVEEMENFGKWSGGGNKDERLQGGYENVPTIDIHMNQIDYEKEWHKILL 615
Query: 624 KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYE 683
++ PL ++ F GY+ + ++FVVRY+PDEQP L PHHD+ST+TINIALN VG DYE
Sbjct: 616 DFIAPLTQKMFPGYYTS-AQFDLAFVVRYKPDEQPLLEPHHDASTFTINIALNSVGQDYE 674
Query: 684 GGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
GGGCRF+RYNC+V A R GW LMHPGRLTHYHEGL+VT+GTRYI +SFVDP
Sbjct: 675 GGGCRFLRYNCSVRAPRKGWALMHPGRLTHYHEGLRVTKGTRYIAVSFVDP 725
>gi|5852295|gb|AAD53987.1|AF080572_1 lysyl hydroxylase isoform 2 [Mus musculus]
Length = 737
Score = 646 bits (1666), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 316/710 (44%), Positives = 458/710 (64%), Gaps = 10/710 (1%)
Query: 29 IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGYKVNLL 87
I DK LVITVA+ E DG+ RF+ SA+ VK LG Q W GGD M+S+GGG KV LL
Sbjct: 34 IPADKLLVITVATKENDGFHRFMNSAKYFNYTVKVLGQGQEWRGGDGMNSIGGGQKVRLL 93
Query: 88 KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
K ++ +D++IL T+ +DV+ GG ++L++F + IVF A+ L WPD L DK
Sbjct: 94 KEAMEHYASQEDLVILFTECFDVVFAGGPEEVLKKFQKTNHKIVFAADGLLWPDKRLADK 153
Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
YP V G RYLNSGGFIGYA I L+ ++++ +DDQL+Y +++D R I LD
Sbjct: 154 YPVVHIGKRYLNSGGFIGYAPYISRLVQQWNLQDNDDDQLFYTKVYIDPLKREAFNITLD 213
Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
+FQ L G+ +++ L F+ + + NT Y T PV I+GNG +KI LN FGNY+ S
Sbjct: 214 HKCKIFQALNGATDEVVLKFENGK-SRVKNTFYETLPVAINGNGPTKILLNYFGNYVPNS 272
Query: 268 W-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMF 326
W + +GC C+ + +D D P V + VFI++PT FL FLN + L+YP + + +F
Sbjct: 273 WTQENGCALCD-VDTIDLSTVDVPPKVTLGVFIEQPTPFLPRFLNLLLTLDYPKEALQLF 331
Query: 327 VYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYV 385
++N + YH ++ K ++K + ++ EARN+ ++ + D+YF V
Sbjct: 332 IHNKEVYHEKDIKVFVDKAKHDISSIKIVGPEENLSQAEARNMGMDFCRQDEKCDYYFSV 391
Query: 386 DSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGD 445
D+D L NP LK L+ +N +IAPL+ R K WSNFWGAL+ DG+YARS DY++I+ G+
Sbjct: 392 DADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARSEDYVDIVQGN 451
Query: 446 QGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDST 504
+ GIWNVPY+ N YL++ +++ N + + + +D DMA C N R+ G+ + I +
Sbjct: 452 R--VGIWNVPYMANVYLIQGKTLRSEMNERNYFVRDKLDPDMALCRNARDMGVFMYISNR 509
Query: 505 QEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWF 564
E+G L+ + N++ N + +++ NP+DW +YI+ +Y K + + + QPCPDVFWF
Sbjct: 510 HEFGRLISTANYNTSHLNNDFWQIFENPVDWKEKYINRDYSK-IFTENIVEQPCPDVFWF 568
Query: 565 PIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRK 624
PI +E+ C E V+ ME YG+WS G ++D R+ GYE VPT DIHMKQ+GL VW F+R+
Sbjct: 569 PIFSERACDELVEEMEHYGKWSGGKHHDSRISGGYENVPTDDIHMKQIGLENVWLHFIRE 628
Query: 625 YVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEG 684
++ P+ + F GY+ + A ++FVV+Y P+ Q SLRPHHD+ST+TINIALN VG D++G
Sbjct: 629 FIAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPHHDASTFTINIALNNVGEDFQG 687
Query: 685 GGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
GGC+F+RYNC++ + R GW MHPGRLTH HEGL V GTRYI +SF+DP
Sbjct: 688 GGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNGTRYIAVSFIDP 737
>gi|224079514|ref|XP_002194070.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 1
[Taeniopygia guttata]
Length = 730
Score = 646 bits (1666), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 313/709 (44%), Positives = 465/709 (65%), Gaps = 10/709 (1%)
Query: 31 EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGYKVNLLKN 89
E+ LV+TVA+ +T+G++RF +SA+ +V+ LGL + W GGD GGG KV LLK+
Sbjct: 27 EENLLVLTVATKQTEGFQRFRRSAQFFNYKVQVLGLDEEWQGGDDQQPAGGGQKVRLLKS 86
Query: 90 ELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYP 149
L + +D+IIL +SYDV+ G ++L++F + +VF AE +PD + KYP
Sbjct: 87 ALQQYVDKEDLIILFVESYDVLFASGPTELLKKFKQAKSKVVFSAENYIYPDRKVEAKYP 146
Query: 150 AVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTL 209
V G R+L SGGFIGYA +K+L+ ++++ DQL+Y +FLD R I LD
Sbjct: 147 QVRDGKRFLGSGGFIGYAPYLKKLVEEWKGQDDDSDQLFYTNIFLDPEKRESINISLDHR 206
Query: 210 ANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK 269
+ +FQNL G+L++I L F+ + V N Y+T PV+IHGNG +K++LN GNY+ + W
Sbjct: 207 SRIFQNLNGALDEIVLKFE-NSRVRARNLLYDTLPVVIHGNGPTKLQLNYLGNYIPQIWT 265
Query: 270 -TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFV 327
+GCT C+ ++ L K + P +LI +FI++PT FL +F ++ NL+YP ++I +F+
Sbjct: 266 FETGCTVCDEGLRSLSGFKDEALPMILIGIFIEQPTPFLSQFFLRLRNLHYPKQRIQLFI 325
Query: 328 YNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVD 386
+N++E+H D ++ + V+ I + V + EARNL ++ D+YF +D
Sbjct: 326 HNHEEHHLMEVDSFVEEHGREYLTVQVIGPDDEVENAEARNLGMDLCRKDPDCDYYFSLD 385
Query: 387 SDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQ 446
++ L N + L+ L+ +N+ +IAPL+ R K WSNFWGAL+ DG+YARS DY++I+ +
Sbjct: 386 AEVVLKNTETLRILIEQNKLVIAPLVSRHEKLWSNFWGALSPDGYYARSEDYVDIVQRRR 445
Query: 447 GGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQ 505
G+WNVPYI++ YL+K +++ ++ +D DMAFC N+RN+G+ + + +
Sbjct: 446 --VGLWNVPYISSVYLVKGKALRSELEQGDLFHSGKLDADMAFCHNIRNQGVFMYLTNQH 503
Query: 506 EYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFP 565
++GH++ EN+ + +++++ NP DW +YIH Y +L V PCPDV+WFP
Sbjct: 504 QFGHILSLENYQTSHLHNDLWQIFSNPEDWREKYIHENYTAALKGKLVE-MPCPDVYWFP 562
Query: 566 IVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKY 625
I T+ C E V+ ME YGQWS G N D R++ GYE VPT DIHM Q+G W +FL Y
Sbjct: 563 IFTDTACDELVEEMEHYGQWSTGDNTDSRIQGGYENVPTIDIHMNQIGFEREWYKFLLDY 622
Query: 626 VVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGG 685
+ P+ E+ + GY+ + + ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VG+DYEGG
Sbjct: 623 IAPITEKLYPGYYTK-TQFELAFVVRYKPDEQPSLVPHHDASTFTINIALNRVGIDYEGG 681
Query: 686 GCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
GCRF+RYNC++ A R GW LMHPGRLTHYHEGL T+GTRYI +SF+DP
Sbjct: 682 GCRFLRYNCSIRAPRKGWTLMHPGRLTHYHEGLPTTKGTRYIAVSFLDP 730
>gi|426218182|ref|XP_004003328.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2
isoform 1 [Ovis aries]
Length = 739
Score = 645 bits (1665), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 314/720 (43%), Positives = 463/720 (64%), Gaps = 10/720 (1%)
Query: 19 ISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSS 77
+ K +I DK LVITVA+ E+DG+ RF+QSA+ VK LG + W GGD +++
Sbjct: 26 LGADSEKPSSIPTDKLLVITVATKESDGFHRFMQSAKYFNYTVKVLGQGEEWRGGDGINT 85
Query: 78 LGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERL 137
+GGG KV L+K ++ +D+++L T+ +DVI GG ++L++F + +VF A+ +
Sbjct: 86 IGGGQKVRLMKEVMEHYANQEDLVVLFTECFDVIFAGGPEEVLKKFQKSNHKVVFAADGI 145
Query: 138 CWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDET 197
WPD L DKYP V G RYLNSGGFIGYA I ++ ++++ +DDQL+Y +++D
Sbjct: 146 LWPDKRLADKYPIVHIGKRYLNSGGFIGYAPYINRIVQQWNLQDNDDDQLFYTKIYIDPL 205
Query: 198 LRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIEL 257
R I LD +FQ L G+++++ L F+ + N Y T PV+I+GNG +KI L
Sbjct: 206 KREAINITLDHKCKIFQTLNGAVDEVVLKFENGK-ARAKNVFYETLPVMINGNGPTKILL 264
Query: 258 NSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANL 316
N FGNY+ +W + +GCT C + +D + D +P+V I VFI++PT FL FLN + L
Sbjct: 265 NYFGNYIPNAWTQDNGCTLCE-VDTIDLSEVDVYPNVTIGVFIEQPTPFLPRFLNTLLTL 323
Query: 317 NYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH 376
+YP K + F++N + YH + K +K + ++ EARN+ ++
Sbjct: 324 DYPKKALKFFIHNKEVYHEKDIKVFFDKAKHEITTIKIVGPEENLSQAEARNMGMDFCRQ 383
Query: 377 -KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARS 435
+ ++YF VD+D L NP LK L+ +N +IAPL+ R K WSNFWGAL+ DG+YARS
Sbjct: 384 DENCEYYFSVDADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARS 443
Query: 436 FDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN 494
DY++I+ G++ GIWNVPY+ N YL+K +++ N + + + +D DMA C N R
Sbjct: 444 EDYVDIVQGNR--VGIWNVPYMANVYLIKGKTLRSEMNERNYFVRDKLDPDMALCRNARE 501
Query: 495 KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVN 554
G+ + I + E+G L+ + N++ N +++++ NP+DW +YI+ +Y K + + +
Sbjct: 502 MGVFMYISNRHEFGRLLSTANYNISHFNNDLWQIFENPVDWKEKYINRDYAK-IFTENIV 560
Query: 555 NQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGL 614
QPCPDVFWFPI +EK C E V+ ME YGQWS G ++D R+ GYE VPT DIHMKQ+GL
Sbjct: 561 EQPCPDVFWFPIFSEKACDELVEEMEHYGQWSGGKHHDSRISGGYENVPTDDIHMKQIGL 620
Query: 615 AGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIA 674
VW F+R+++ P+ + F GY+ + A ++FVV+Y P+ Q SLRPHHD+ST+TINIA
Sbjct: 621 ENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPHHDASTFTINIA 679
Query: 675 LNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
LN VG D++GGGC+F+RYNC++ + R GW MHPGRLTH HEGL V GTRYI +SF+DP
Sbjct: 680 LNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNGTRYIAVSFIDP 739
>gi|18204027|gb|AAH21352.1| Procollagen lysine, 2-oxoglutarate 5-dioxygenase 2 [Mus musculus]
Length = 737
Score = 645 bits (1663), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 315/710 (44%), Positives = 457/710 (64%), Gaps = 10/710 (1%)
Query: 29 IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGYKVNLL 87
I DK LVITVA+ E DG+ RF+ SA+ VK LG Q W GGD M+S+GGG KV LL
Sbjct: 34 IPADKLLVITVATKENDGFHRFMNSAKYFNYTVKVLGQGQEWRGGDGMNSIGGGQKVRLL 93
Query: 88 KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
K ++ +D++IL T+ +DV+ GG ++L++F + IVF A+ L WPD L DK
Sbjct: 94 KEAMEHYASQEDLVILFTECFDVVFAGGPEEVLKKFQKTNHKIVFAADGLLWPDKRLADK 153
Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
YP V G RYLNSGGFIGYA I L+ ++++ +DDQL+Y +++D R I LD
Sbjct: 154 YPVVHIGKRYLNSGGFIGYAPYISRLVQQWNLQDNDDDQLFYTKVYIDPLKREAFNITLD 213
Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
+FQ L G+ +++ L F+ + + NT Y T PV I+GNG +KI LN FGNY+ S
Sbjct: 214 HKCKIFQALNGATDEVVLKFENGK-SRVKNTFYETLPVAINGNGPTKILLNYFGNYVPNS 272
Query: 268 W-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMF 326
W + +GC C+ + +D D P V + VFI++PT FL FLN + L+YP + + +F
Sbjct: 273 WTQENGCALCD-VDTIDLSTVDVPPKVTLGVFIEQPTPFLPRFLNLLLTLDYPKEALQLF 331
Query: 327 VYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYV 385
++N + YH ++ K ++K + ++ EARN+ ++ + D+YF V
Sbjct: 332 IHNKEVYHEKDIKVFVDKAKHDISSIKIVGPEENLSQAEARNMGMDFCRQDEKCDYYFSV 391
Query: 386 DSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGD 445
D+D L NP LK L+ +N +IAPL+ R K WSNFWGAL+ DG+YARS DY++I+ G+
Sbjct: 392 DADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARSEDYVDIVQGN 451
Query: 446 QGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDST 504
+ GIWNVPY+ N YL++ +++ N + + + +D DMA C N R+ G+ + I +
Sbjct: 452 R--VGIWNVPYMANVYLIQGKTLRSEMNERNYFVRDKLDPDMALCRNARDMGVFMYISNR 509
Query: 505 QEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWF 564
E+G L+ + N++ N + +++ NP+DW +YI+ +Y K + + + QPCPDVFWF
Sbjct: 510 HEFGRLISTANYNTSHLNNDFWQIFENPVDWKEKYINRDYSK-IFTENIVEQPCPDVFWF 568
Query: 565 PIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRK 624
PI +E+ C E V+ ME YG+WS G ++D R+ GYE VPT D HMKQ+GL VW F+R+
Sbjct: 569 PIFSERACDELVEEMEHYGKWSGGKHHDSRISGGYENVPTDDTHMKQIGLENVWLHFIRE 628
Query: 625 YVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEG 684
++ P+ + F GY+ + A ++FVV+Y P+ Q SLRPHHD+ST+TINIALN VG D++G
Sbjct: 629 FIAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPHHDASTFTINIALNNVGEDFQG 687
Query: 685 GGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
GGC+F+RYNC++ + R GW MHPGRLTH HEGL V GTRYI +SF+DP
Sbjct: 688 GGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNGTRYIAVSFIDP 737
>gi|402861310|ref|XP_003895041.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2
isoform 2 [Papio anubis]
Length = 737
Score = 644 bits (1660), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 315/721 (43%), Positives = 461/721 (63%), Gaps = 10/721 (1%)
Query: 18 FISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MS 76
++ K +I DK LVITVA+ E+DG+ RF+QSA+ VK LG + W GGD ++
Sbjct: 23 YLGADSEKPSSIPTDKLLVITVATKESDGFHRFMQSAKYFNYTVKVLGQGEEWRGGDGIN 82
Query: 77 SLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAER 136
S+GGG KV L+K ++ DD++++ T+ +DVI GG ++L++F + +VF A+
Sbjct: 83 SIGGGQKVRLMKEVMEHYADQDDLVVMFTECFDVIFAGGPEEVLKKFQKANHKVVFAADG 142
Query: 137 LCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDE 196
+ WPD L DKYP V G RYLNSGGFIGYA I ++ ++++ +DDQL+Y +++D
Sbjct: 143 ILWPDKRLADKYPVVHIGKRYLNSGGFIGYAPYINRIVQQWNLQDNDDDQLFYTKIYIDP 202
Query: 197 TLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIE 256
R I LD +FQ L G+++++ L F+ + NT Y T PV I+GNG +KI
Sbjct: 203 LKREAINITLDHKCKIFQTLNGAVDEVVLKFENGK-ARAKNTFYETLPVAINGNGPTKIL 261
Query: 257 LNSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIAN 315
LN FGNY+ SW + +GCT C +D D P+V I VFI++PT FL FL+ +
Sbjct: 262 LNYFGNYVPNSWTQDNGCTLCEF-DTIDLSAVDVHPNVTIGVFIEQPTPFLPRFLDILLT 320
Query: 316 LNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSL 375
L+YP + + +F++N + YH + K K +K + ++ EARN+ ++
Sbjct: 321 LDYPKEALKLFIHNKEVYHEKDIKVFFDKAKHEIKTIKIVGPEENLSQAEARNMGMDFCR 380
Query: 376 H-KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYAR 434
+ D+YF VD+D L NP LK L+ +N +IAPL+ R K WSNFWGAL+ DG+YAR
Sbjct: 381 QDEKCDYYFSVDADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYAR 440
Query: 435 SFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIK-ATNIKTIYTLNSMDYDMAFCTNLR 493
S DY++I+ G++ G+WNVPY+ N YL+K ++ N + + + +D DMA C N R
Sbjct: 441 SEDYVDIVQGNR--VGVWNVPYMANVYLIKGKTLRLEMNERNYFVRDKLDPDMALCRNAR 498
Query: 494 NKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTV 553
G+ + I + E+G L+ + N++ N +++++ NP+DW +YI+ +Y K + + +
Sbjct: 499 EMGVFMYISNRHEFGRLLSTANYNTSHYNNDLWQIFENPVDWKEKYINRDYSK-IFTENI 557
Query: 554 NNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVG 613
QPCPDVFWFPI +EK C E V+ ME YG+WS G ++D R+ GYE VPT DIHMKQV
Sbjct: 558 VEQPCPDVFWFPIFSEKACDELVEEMEHYGKWSGGKHHDSRISGGYENVPTDDIHMKQVD 617
Query: 614 LAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINI 673
L VW F+R+++ P+ + F GY+ + A ++FVV+Y P+ Q SLRPHHD+ST+TINI
Sbjct: 618 LENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPHHDASTFTINI 676
Query: 674 ALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVD 733
ALN VG D++GGGC+F+RYNC++ + R GW MHPGRLTH HEGL V GTRYI +SF+D
Sbjct: 677 ALNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNGTRYIAVSFID 736
Query: 734 P 734
P
Sbjct: 737 P 737
>gi|17535123|ref|NP_496170.1| Protein LET-268 [Caenorhabditis elegans]
gi|6093732|sp|Q20679.1|PLOD_CAEEL RecName: Full=Procollagen-lysine,2-oxoglutarate 5-dioxygenase;
AltName: Full=Lethal protein 268; AltName: Full=Lysyl
hydroxylase; Short=LH; Flags: Precursor
gi|3877389|emb|CAA91321.1| Protein LET-268 [Caenorhabditis elegans]
Length = 730
Score = 644 bits (1660), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 312/718 (43%), Positives = 470/718 (65%), Gaps = 21/718 (2%)
Query: 30 DEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLLK 88
D + +V+TVA+ TDG KR ++SA+ + ++ LGL + W GGD GGG K+ +L
Sbjct: 21 DLPELVVVTVATENTDGLKRLLESAKAFDINIEVLGLGEKWNGGDTRIEQGGGQKIRILS 80
Query: 89 NELDEMDITDDMIILVTDSYDVIIDGGVNDILERF--NTFDANIVFGAERLCWPDTSLYD 146
+ +++ D +I+ D+YDV+ + IL +F + + ++FGAE CWPD SL
Sbjct: 81 DWIEKYKDASDTMIMFVDAYDVVFNADSTTILRKFFEHYSEKRLLFGAEPFCWPDQSLAP 140
Query: 147 KYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVL 206
+YP V G R+LNSG F+GY ++ +++ +S+++++DDQLYY +++LDE LR + + L
Sbjct: 141 EYPIVEFGKRFLNSGLFMGYGPEMHKILKLKSVEDKDDDQLYYTMIYLDEKLRKELNMDL 200
Query: 207 DTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAK 266
D+++ +FQNL G +ED++L F D N YNT P+I+HGNG SK LN GNYL
Sbjct: 201 DSMSKIFQNLNGVIEDVELQFKEDGTPEAYNAAYNTKPLIVHGNGPSKSHLNYLGNYLGN 260
Query: 267 SWKTS-GCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
W + GC C L + + ++ P + +++FI KP F+EE L KIA +YP +KI++
Sbjct: 261 RWNSQLGCRTCGL----EVKESEEVPLIALNLFISKPIPFIEEVLQKIAEFDYPKEKIAL 316
Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYV 385
++YNNQ + D++ + + I + + +EARN A+E + + V+F F +
Sbjct: 317 YIYNNQPFSIKNIQDFLQKHGKSYYTKRVINGVTEIGDREARNEAIEWNKARNVEFAFLM 376
Query: 386 DSDSHLDNPDVLKYLVNRNES----LIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNI 441
D D++ P V+K L+ +++ +IAP++ +P K ++NFWGA+ A+G+YARS DYM I
Sbjct: 377 DGDAYFSEPKVIKDLIQYSKTYDVGIIAPMIGQPGKLFTNFWGAIAANGYYARSEDYMAI 436
Query: 442 INGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLN-SMDYDMAFCTNLRNKGIHLK 500
+ G++ G WNVP+IT+ L ++A +K Y+ N ++D DM+ C R+ G L
Sbjct: 437 VKGNR--VGYWNVPFITSAVLFNKEKLEA--MKDAYSYNKNLDPDMSMCKFARDNGHFLY 492
Query: 501 IDSTQEYGHLVDSENFDPQKT----NPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQ 556
ID+ + YG L+ S+ + T +PE++++ N W+ RYIHP Y K + P+ V +Q
Sbjct: 493 IDNEKYYGFLIVSDEYAETVTEGKWHPEMWQIFENRELWEARYIHPGYHKIMEPEHVVDQ 552
Query: 557 PCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAG 616
CPDV+ FP+++E+FC E ++ ME +G+WSDG+NNDKRL GYE VPTRDIHM QVG
Sbjct: 553 ACPDVYDFPLMSERFCEELIEEMEGFGRWSDGSNNDKRLAGGYENVPTRDIHMNQVGFER 612
Query: 617 VWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALN 676
W F+ YV P+QE+ FIGY+H+PV + M FVVRY+P+EQPSLRPHHD+ST++I+IALN
Sbjct: 613 QWLYFMDTYVRPVQEKTFIGYYHQPVESNMMFVVRYKPEEQPSLRPHHDASTFSIDIALN 672
Query: 677 QVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
+ G DYEGGG R+IRYNC V A +G+ +M PGRLTH HEGL T+GTRYIM+SF++P
Sbjct: 673 KKGRDYEGGGVRYIRYNCTVPADEVGYAMMFPGRLTHLHEGLATTKGTRYIMVSFINP 730
>gi|380813822|gb|AFE78785.1| procollagen-lysine,2-oxoglutarate 5-dioxygenase 2 isoform 2
precursor [Macaca mulatta]
Length = 737
Score = 643 bits (1659), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 315/721 (43%), Positives = 461/721 (63%), Gaps = 10/721 (1%)
Query: 18 FISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MS 76
++ K +I DK LVITVA+ E+DG+ RF+QSA+ VK LG + W GGD ++
Sbjct: 23 YLGADSEKPSSIPTDKLLVITVATKESDGFHRFMQSAKYFNYTVKVLGQGEEWRGGDGIN 82
Query: 77 SLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAER 136
S+GGG KV L+K ++ DD++++ T+ +DVI GG ++L++F + +VF A+
Sbjct: 83 SIGGGQKVRLMKEVMEHYADQDDLVVMFTECFDVIFAGGPEEVLKKFQKANHKVVFAADG 142
Query: 137 LCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDE 196
+ WPD L DKYP V G RYLNSGGFIGYA I ++ ++++ +DDQL+Y +++D
Sbjct: 143 ILWPDKRLADKYPVVHIGKRYLNSGGFIGYAPYINRIVQQWNLQDNDDDQLFYTKIYIDP 202
Query: 197 TLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIE 256
R I LD +FQ L G+++++ L F+ + NT Y T PV I+GNG +KI
Sbjct: 203 LKREAINITLDHKCKVFQTLNGAVDEVVLKFENGK-ARAKNTFYETLPVAINGNGPTKIL 261
Query: 257 LNSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIAN 315
LN FGNY+ SW + +GCT C +D D P+V I VFI++PT FL FL+ +
Sbjct: 262 LNYFGNYVPNSWTQDNGCTLCEF-DTIDLSAVDVHPNVTIGVFIEQPTPFLPRFLDILLT 320
Query: 316 LNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSL 375
L+YP + + +F++N + YH + K K +K + ++ EARN+ ++
Sbjct: 321 LDYPKEALKLFIHNKEVYHEKDIKVFFDKAKHEIKTIKIVGPEENLSQAEARNMGMDFCR 380
Query: 376 H-KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYAR 434
+ D+YF VD+D L NP LK L+ +N +IAPL+ R K WSNFWGAL+ DG+YAR
Sbjct: 381 QDEKCDYYFSVDADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYAR 440
Query: 435 SFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIK-ATNIKTIYTLNSMDYDMAFCTNLR 493
S DY++I+ G++ G+WNVPY+ N YL+K ++ N + + + +D DMA C N R
Sbjct: 441 SEDYVDIVQGNR--VGVWNVPYMANVYLIKGKTLRLEMNERNYFVRDKLDPDMALCRNAR 498
Query: 494 NKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTV 553
G+ + I + E+G L+ + N++ N +++++ NP+DW +YI+ +Y K + + +
Sbjct: 499 EMGVFMYISNRHEFGRLLSTANYNTSHYNNDLWQIFENPVDWKEKYINRDYSK-IFTENI 557
Query: 554 NNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVG 613
QPCPDVFWFPI +EK C E V+ ME YG+WS G ++D R+ GYE VPT DIHMKQV
Sbjct: 558 VEQPCPDVFWFPIFSEKACDELVEEMEHYGKWSGGKHHDSRISGGYENVPTDDIHMKQVD 617
Query: 614 LAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINI 673
L VW F+R+++ P+ + F GY+ + A ++FVV+Y P+ Q SLRPHHD+ST+TINI
Sbjct: 618 LENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPHHDASTFTINI 676
Query: 674 ALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVD 733
ALN VG D++GGGC+F+RYNC++ + R GW MHPGRLTH HEGL V GTRYI +SF+D
Sbjct: 677 ALNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNGTRYIAVSFID 736
Query: 734 P 734
P
Sbjct: 737 P 737
>gi|297288059|ref|XP_002808395.1| PREDICTED: LOW QUALITY PROTEIN: procollagen-lysine,2-oxoglutarate
5-dioxygenase 3-like [Macaca mulatta]
Length = 741
Score = 643 bits (1659), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 312/716 (43%), Positives = 459/716 (64%), Gaps = 16/716 (2%)
Query: 29 IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLL 87
++ +K LV+TVA+ ET+GY RF++SAE V+TLGL + W GGD++ ++GGG KV L
Sbjct: 32 VNPEKLLVMTVATAETEGYLRFLRSAEFFNYTVRTLGLGEEWRGGDVARTVGGGQKVRWL 91
Query: 88 KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
K E+++ +D II+ DSYDV++ G +++L++F + ++F AE CWP+ L ++
Sbjct: 92 KKEMEKYADREDTIIMFVDSYDVVLAGSPSELLKKFVQSGSRLLFSAESFCWPEWGLAEQ 151
Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
YP VG+G R+LNSGGFIG+A I +++ K+++DDQL+Y L+LD LR K + LD
Sbjct: 152 YPEVGTGKRFLNSGGFIGFATTIHQIVRQWKYKDDDDDQLFYTRLYLDPGLREKLSLNLD 211
Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
+ +FQNL G+L+++ L FD + V + N Y+T PV++HGNG +K++LN GNY+
Sbjct: 212 HKSRIFQNLNGALDEVVLKFDRNR-VRIRNVAYDTLPVVVHGNGPTKLQLNYLGNYVPNG 270
Query: 268 WKT-SGCTRCNLIKH-LDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
W GC CN + L +P P V ++VF+++PT FL FL ++ L+YP ++++
Sbjct: 271 WTPEGGCGFCNRDRRTLPGGQPP--PRVFLAVFVEQPTPFLPRFLQRLLLLDYPPDRVTL 328
Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAV------ENSLHKGV 379
F++NN+ +H P D + F VK + ++ EAR++A+ E + G
Sbjct: 329 FLHNNEVFHEPHIADSWPQLQDHFAVVKLVGPEEALSPGEARDMAMXAGACGEEGVGXGC 388
Query: 380 DFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYM 439
L +P S L R K WS FWGAL+ D +YARS DY+
Sbjct: 389 RVGVAATXCLALTSPVTRPAPPEPPHSXXXXXLSRHGKLWSXFWGALSPDEYYARSEDYV 448
Query: 440 NIINGDQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIH 498
++ + G+WNVPYI+ Y+++ ++ + +++ + D DMAFC + R+KGI
Sbjct: 449 ELVQRKR--VGVWNVPYISQAYVIRGDTLRTELPQRDVFSGSDTDPDMAFCKSFRDKGIF 506
Query: 499 LKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPC 558
L + + E+G L+ + +D + +P+++++ NP+DW +YIH Y ++L + + QPC
Sbjct: 507 LHLSNQHEFGRLLATSRYDTEHLHPDLWQIFDNPVDWKEQYIHENYSRALEGEGIVEQPC 566
Query: 559 PDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVW 618
PDV+WFP+++E+ C E V ME YGQWS G + D RL GYE VPT DIHMKQVG W
Sbjct: 567 PDVYWFPLLSEQMCDELVAEMEHYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYEDQW 626
Query: 619 AEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQV 678
+ LR YV P+ E F GYH + RA M+FVVRYRPDEQPSLRPHHDSST+T+N+ALN
Sbjct: 627 LQLLRTYVGPMTESLFPGYHTK-ARAVMNFVVRYRPDEQPSLRPHHDSSTFTLNVALNHK 685
Query: 679 GVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
G+DYEGGGCRF+RY+C +++ R GW L+HPGRLTHYHEGL T+GTRYIM+SFVDP
Sbjct: 686 GLDYEGGGCRFLRYDCVISSPRKGWALLHPGRLTHYHEGLPTTRGTRYIMVSFVDP 741
>gi|2138314|gb|AAB58363.1| lysyl hydroxylase isoform 2 [Homo sapiens]
Length = 737
Score = 642 bits (1657), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 314/720 (43%), Positives = 462/720 (64%), Gaps = 10/720 (1%)
Query: 19 ISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSS 77
+ K +I DK LVITVA+ E+DG+ RF+QSA+ VK LG + W GGD ++S
Sbjct: 24 LGADSEKPSSIPTDKLLVITVATKESDGFHRFMQSAKYFNYTVKVLGQGEEWRGGDGINS 83
Query: 78 LGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERL 137
+GGG KV L+K ++ DD++++ T+ +DVI GG ++L++F + +VF A+ +
Sbjct: 84 IGGGQKVRLMKEVMEHYADQDDLVVMFTECFDVIFAGGPEEVLKKFQKANHKVVFAADGI 143
Query: 138 CWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDET 197
WPD L DKYP V G RYLNSGGFIGYA + ++ ++++ +DDQL+Y +++D
Sbjct: 144 LWPDKRLADKYPVVHIGKRYLNSGGFIGYAPYVNRIVQQWNLQDNDDDQLFYTKVYIDPL 203
Query: 198 LRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIEL 257
R I LD +FQ L G+++++ L F+ + NT Y T PV I+GNG +KI L
Sbjct: 204 KREAINITLDHKCKIFQTLNGAVDEVVLKFENGK-ARAKNTFYETLPVAINGNGPTKILL 262
Query: 258 NSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANL 316
N FGNY+ SW + +GCT C +D D P+V I VFI++PT FL FL+ + L
Sbjct: 263 NYFGNYVPNSWTQDNGCTLCEF-DTVDLSAVDVHPNVSIGVFIEQPTPFLPRFLDILLTL 321
Query: 317 NYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH 376
+YP + + +F++N + YH + K K +K + ++ EARN+ ++
Sbjct: 322 DYPKEALKLFIHNKEVYHEKDIKVFFDKAKHEIKTIKIVGPEENLSQAEARNMGMDFCRQ 381
Query: 377 -KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARS 435
+ D+YF VD+D L NP LK L+ +N +IAPL+ R K WSNFWGAL+ DG+YARS
Sbjct: 382 DEKCDYYFSVDADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARS 441
Query: 436 FDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN 494
DY++I+ G++ G+WNVPY+ N YL+K +++ N + + + +D DMA C N R
Sbjct: 442 EDYVDIVQGNR--VGVWNVPYMANVYLIKGKTLRSEMNERNYFVRDKLDPDMALCRNARE 499
Query: 495 KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVN 554
G+ + I + E+G L+ + N++ N +++++ NP+DW +YI+ +Y K + + +
Sbjct: 500 MGVFMYISNRHEFGRLLSTANYNTSHYNNDLWQIFENPVDWKEKYINRDYSK-IFTENIV 558
Query: 555 NQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGL 614
QPCPDVFWFPI +EK C E V+ ME YG+WS G ++D R+ GYE VPT DIHMKQV L
Sbjct: 559 EQPCPDVFWFPIFSEKACDELVEEMEHYGKWSGGKHHDSRISGGYENVPTDDIHMKQVDL 618
Query: 615 AGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIA 674
VW +F+R+++ P+ + F GY+ + A ++FVV+Y P+ Q SLRPHHD+ST+TINIA
Sbjct: 619 ENVWLDFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPHHDASTFTINIA 677
Query: 675 LNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
LN VG D++GGGC+F+RYNC++ + R GW MHPGRLTH HEGL V GTRYI +SF+DP
Sbjct: 678 LNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNGTRYIAVSFIDP 737
>gi|397512440|ref|XP_003826553.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2
isoform 1 [Pan paniscus]
Length = 737
Score = 642 bits (1657), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 314/720 (43%), Positives = 461/720 (64%), Gaps = 10/720 (1%)
Query: 19 ISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSS 77
+ K +I DK LVITVA+ E+DG+ RF+QSA+ VK LG + W GGD ++S
Sbjct: 24 LGADSEKPSSIPTDKLLVITVATKESDGFHRFMQSAKYFNYTVKVLGQGEEWRGGDGINS 83
Query: 78 LGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERL 137
+GGG KV L+K ++ DD++++ T+ +DVI GG ++L++F + +VF A+ +
Sbjct: 84 IGGGQKVRLMKEVMERYADQDDLVVMFTECFDVIFAGGPEEVLKKFQKANHKVVFAADGI 143
Query: 138 CWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDET 197
WPD L DKYP V G RYLNSGGFIGYA + ++ ++++ +DDQL+Y +++D
Sbjct: 144 LWPDKRLADKYPVVHIGKRYLNSGGFIGYAPYVNRIVQQWNLQDNDDDQLFYTKIYIDPL 203
Query: 198 LRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIEL 257
R I LD +FQ L G+++++ L F+ + NT Y T PV I+GNG +KI L
Sbjct: 204 KREAINITLDHKCKIFQTLNGAVDEVVLKFENGK-ARAKNTFYETLPVAINGNGPTKILL 262
Query: 258 NSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANL 316
N FGNY+ SW + +GCT C +D D P+V I VFI++PT FL FL+ + L
Sbjct: 263 NYFGNYVPNSWTQDNGCTLCEF-DTVDLSAVDVHPNVSIGVFIEQPTPFLPRFLDILLTL 321
Query: 317 NYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH 376
+YP + + +F++N + YH + K K +K + ++ EARN+ ++
Sbjct: 322 DYPKEALKLFIHNKEVYHEKDIKIFFDKAKHEIKTIKIVGPEENLSQAEARNMGMDFCRQ 381
Query: 377 -KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARS 435
+ D+YF VD+D L NP LK L+ +N +IAPL+ R K WSNFWGAL+ DG+YARS
Sbjct: 382 DEKCDYYFSVDADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARS 441
Query: 436 FDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN 494
DY++I+ G++ G+WNVPY+ N YL+K +++ N + + + +D DMA C N R
Sbjct: 442 EDYVDIVQGNR--VGVWNVPYMANVYLIKGKTLRSEMNERNYFVRDKLDPDMALCRNARE 499
Query: 495 KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVN 554
G+ + I + E+G L+ + N++ N +++++ NP+DW +YI+ +Y K + + +
Sbjct: 500 MGVFMYISNRHEFGRLLSTANYNTSHYNNDLWQIFENPVDWKEKYINRDYSK-IFTENIV 558
Query: 555 NQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGL 614
QPCPDVFWFPI +EK C E V+ ME YG+WS G ++D R+ GYE VPT DIHMKQV L
Sbjct: 559 EQPCPDVFWFPIFSEKACDELVEEMEHYGKWSGGKHHDSRISGGYENVPTDDIHMKQVDL 618
Query: 615 AGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIA 674
VW F+R+++ P+ + F GY+ + A ++FVV+Y P+ Q SLRPHHD+ST+TINIA
Sbjct: 619 ENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPHHDASTFTINIA 677
Query: 675 LNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
LN VG D++GGGC+F+RYNC++ + R GW MHPGRLTH HEGL V GTRYI +SF+DP
Sbjct: 678 LNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNGTRYIAVSFIDP 737
>gi|403278815|ref|XP_003930980.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2
isoform 2 [Saimiri boliviensis boliviensis]
Length = 737
Score = 642 bits (1657), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 315/720 (43%), Positives = 461/720 (64%), Gaps = 10/720 (1%)
Query: 19 ISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSS 77
+ + K +I DK LVITVA+ E+DG+ RF+QSA+ VK LG + W GGD ++S
Sbjct: 24 LGANSEKPSSIPTDKLLVITVATKESDGFHRFMQSAKYFNYTVKVLGQGEEWRGGDGINS 83
Query: 78 LGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERL 137
+GGG KV L+K + DD++++ T+ +DVI GG ++L++F + +VF A+ +
Sbjct: 84 IGGGQKVRLMKEIMGHYADQDDLVVMFTECFDVIFAGGPEELLKKFQKANHKVVFAADGI 143
Query: 138 CWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDET 197
WPD L DKYP V G RYLNSGGFIGYA I ++ ++++ +DDQL+Y +++D
Sbjct: 144 LWPDKRLADKYPVVHIGKRYLNSGGFIGYAPYINRIVQQWNLQDNDDDQLFYTKIYIDPL 203
Query: 198 LRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIEL 257
R I LD +FQ L G+++++ L F+ + NT Y T PV I+GNG +KI L
Sbjct: 204 KREAINITLDHKCKIFQTLNGAVDEVVLKFENGK-ARAKNTFYETLPVAINGNGPTKILL 262
Query: 258 NSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANL 316
N FGNY+ SW + +GCT C +D D P+V I VFI++PT FL FL+ + L
Sbjct: 263 NYFGNYVPNSWTQDNGCTLCEF-DTVDLSAVDVHPNVTIGVFIEQPTPFLPRFLDILLTL 321
Query: 317 NYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH 376
+YP + + +F++N + YH + K K +K + ++ EARN+ ++
Sbjct: 322 DYPKEALKLFIHNKEVYHEKDIKVFFDKAKHEIKTIKIVGPEENLSQAEARNMGMDFCRQ 381
Query: 377 -KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARS 435
+ D+YF VD+D L NP LK L+ +N +IAPL+ R K WSNFWGAL+ DG+YARS
Sbjct: 382 DENCDYYFSVDADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARS 441
Query: 436 FDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN 494
DY++I+ G++ G+WNVPY+ N YL+K +++ N + + + +D DMA C N R
Sbjct: 442 EDYVDIVQGNR--VGVWNVPYMANVYLIKGKTLRSEMNERNYFVRDKLDPDMALCRNARE 499
Query: 495 KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVN 554
G+ + I + E+G L+ + N++ N +++++ NP+DW +YI+ +Y K + + +
Sbjct: 500 MGVFMYISNRHEFGRLLSTANYNTSHYNNDLWQIFENPVDWKEKYINRDYSK-IFTENIV 558
Query: 555 NQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGL 614
QPCPDVFWFPI +EK C E V+ ME YG+WS G ++D R+ GYE VPT DIHMKQV L
Sbjct: 559 EQPCPDVFWFPIFSEKACDELVEEMEHYGKWSGGKHHDSRISGGYENVPTDDIHMKQVDL 618
Query: 615 AGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIA 674
VW F+R+++ P+ + F GY+ + A ++FVV+Y P+ Q SLRPHHD+ST+TINIA
Sbjct: 619 ENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPHHDASTFTINIA 677
Query: 675 LNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
LN VG D++GGGC+F+RYNC++ + R GW MHPGRLTH HEGL V GTRYI +SF+DP
Sbjct: 678 LNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNGTRYIAVSFIDP 737
>gi|332818216|ref|XP_516801.3| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2 [Pan
troglodytes]
Length = 737
Score = 642 bits (1656), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 314/720 (43%), Positives = 461/720 (64%), Gaps = 10/720 (1%)
Query: 19 ISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSS 77
+ K +I DK LVITVA+ E+DG+ RF+QSA+ VK LG + W GGD ++S
Sbjct: 24 LGADSEKPSSIPTDKLLVITVATKESDGFHRFMQSAKYFNYTVKVLGQGEEWRGGDGINS 83
Query: 78 LGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERL 137
+GGG KV L+K ++ DD++++ T+ +DVI GG ++L++F + +VF A+ +
Sbjct: 84 IGGGQKVRLMKEVMEHYADQDDLVVMFTECFDVIFAGGPEEVLKKFQKANHKVVFAADGI 143
Query: 138 CWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDET 197
WPD L DKYP V G RYLNSGGFIGYA + ++ ++++ +DDQL+Y +++D
Sbjct: 144 LWPDKRLADKYPVVHIGKRYLNSGGFIGYAPYVNRIVQQWNLQDNDDDQLFYTKIYIDPL 203
Query: 198 LRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIEL 257
R I LD +FQ L G+++++ L F+ + NT Y T PV I+GNG +KI L
Sbjct: 204 KREAINITLDHKCKIFQTLNGAVDEVVLKFENGK-ARAKNTFYETLPVAINGNGPTKILL 262
Query: 258 NSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANL 316
N FGNY+ SW + +GCT C +D D P+V I VFI++PT FL FL+ + L
Sbjct: 263 NYFGNYVPNSWTQDNGCTLCEF-DTVDLSAVDVHPNVSIGVFIEQPTPFLPRFLDILLTL 321
Query: 317 NYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH 376
+YP + + +F++N + YH + K K +K + ++ EARN+ ++
Sbjct: 322 DYPKEALKLFIHNKEVYHEKDIKIFFDKAKHEIKTIKIVGPEENLSQAEARNMGMDFCRQ 381
Query: 377 -KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARS 435
+ D+YF VD+D L NP LK L+ +N +IAPL+ R K WSNFWGAL+ DG+YARS
Sbjct: 382 DEKCDYYFSVDADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARS 441
Query: 436 FDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN 494
DY++I+ G++ G+WNVPY+ N YL+K +++ N + + + +D DMA C N R
Sbjct: 442 EDYVDIVQGNR--VGVWNVPYMANVYLIKGKTLRSEMNERNYFVRDKLDPDMALCRNARE 499
Query: 495 KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVN 554
G+ + I + E+G L+ + N++ N +++++ NP+DW +YI+ +Y K + + +
Sbjct: 500 MGVFMYISNRHEFGRLLSTANYNTSHYNNDLWQIFENPVDWKEKYINRDYSK-IFTENIV 558
Query: 555 NQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGL 614
QPCPDVFWFPI +EK C E V+ ME YG+WS G ++D R+ GYE VPT DIHMKQV L
Sbjct: 559 EQPCPDVFWFPIFSEKACDELVEEMEHYGKWSGGKHHDSRISGGYENVPTDDIHMKQVDL 618
Query: 615 AGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIA 674
VW F+R+++ P+ + F GY+ + A ++FVV+Y P+ Q SLRPHHD+ST+TINIA
Sbjct: 619 ENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPHHDASTFTINIA 677
Query: 675 LNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
LN VG D++GGGC+F+RYNC++ + R GW MHPGRLTH HEGL V GTRYI +SF+DP
Sbjct: 678 LNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNGTRYIAVSFIDP 737
>gi|410254494|gb|JAA15214.1| procollagen-lysine, 2-oxoglutarate 5-dioxygenase 2 [Pan
troglodytes]
Length = 737
Score = 642 bits (1656), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 314/720 (43%), Positives = 461/720 (64%), Gaps = 10/720 (1%)
Query: 19 ISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSS 77
+ K +I DK LVITVA+ E+DG+ RF+QSA+ VK LG + W GGD ++S
Sbjct: 24 LGADSEKPSSIPTDKLLVITVATKESDGFHRFMQSAKYFNYTVKVLGQGEEWRGGDGINS 83
Query: 78 LGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERL 137
+GGG KV L+K ++ DD++++ T+ +DVI GG ++L++F + +VF A+ +
Sbjct: 84 IGGGQKVRLMKEVMEHYADQDDLVVMFTECFDVIFAGGPEEVLKKFQKANHKVVFAADGI 143
Query: 138 CWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDET 197
WPD L DKYP V G RYLNSGGFIGYA + ++ ++++ +DDQL+Y +++D
Sbjct: 144 LWPDKRLADKYPVVHIGKRYLNSGGFIGYAPYVNRIVQQWNLQDNDDDQLFYTKIYIDPL 203
Query: 198 LRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIEL 257
R I LD +FQ L G+++++ L F+ + NT Y T PV I+GNG +KI L
Sbjct: 204 KREAINITLDHKCKIFQTLNGAVDEVVLKFENGK-ARAKNTFYETLPVAINGNGPTKILL 262
Query: 258 NSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANL 316
N FGNY+ SW + +GCT C +D D P+V I VFI++PT FL FL+ + L
Sbjct: 263 NYFGNYVPNSWTQDNGCTLCEF-DTVDLSAVDVHPNVSIGVFIEQPTPFLPRFLDILLTL 321
Query: 317 NYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH 376
+YP + + +F++N + YH + K K +K + ++ EARN+ ++
Sbjct: 322 DYPKEALKLFIHNKEVYHEKDIKIFFDKAKHEIKTIKIVGPEENLSQAEARNMGMDFCRQ 381
Query: 377 -KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARS 435
+ D+YF VD+D L NP LK L+ +N +IAPL+ R K WSNFWGAL+ DG+YARS
Sbjct: 382 DEKCDYYFSVDADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARS 441
Query: 436 FDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN 494
DY++I+ G++ G+WNVPY+ N YL+K +++ N + + + +D DMA C N R
Sbjct: 442 EDYVDIVQGNR--VGVWNVPYMANVYLIKGKTLRSEMNERNYFVRDKLDPDMALCRNARE 499
Query: 495 KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVN 554
G+ + I + E+G L+ + N++ N +++++ NP+DW +YI+ +Y K + + +
Sbjct: 500 MGVFMYISNRHEFGRLLSTANYNTSHYNNDLWQIFENPVDWKEKYINRDYSK-IFTENIV 558
Query: 555 NQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGL 614
QPCPDVFWFPI +EK C E V+ ME YG+WS G ++D R+ GYE VPT DIHMKQV L
Sbjct: 559 EQPCPDVFWFPIFSEKACDELVEEMEHYGKWSGGKHHDSRISGGYENVPTDDIHMKQVDL 618
Query: 615 AGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIA 674
VW F+R+++ P+ + F GY+ + A ++FVV+Y P+ Q SLRPHHD+ST+TINIA
Sbjct: 619 ENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPHHDASTFTINIA 677
Query: 675 LNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
LN VG D++GGGC+F+RYNC++ + R GW MHPGRLTH HEGL V GTRYI +SF+DP
Sbjct: 678 LNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKSGTRYIAVSFIDP 737
>gi|62089344|dbj|BAD93116.1| procollagen-lysine, 2-oxoglutarate 5-dioxygenase 2 isoform b
variant [Homo sapiens]
Length = 796
Score = 642 bits (1656), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 314/720 (43%), Positives = 461/720 (64%), Gaps = 10/720 (1%)
Query: 19 ISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSS 77
+ K +I DK LVITVA+ E+DG+ RF+QSA+ VK LG + W GGD ++S
Sbjct: 83 LGADSEKPSSIPTDKLLVITVATKESDGFHRFMQSAKYFNYTVKVLGQGEEWRGGDGINS 142
Query: 78 LGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERL 137
+GGG KV L+K ++ DD++++ T+ +DVI GG ++L++F + +VF A+ +
Sbjct: 143 IGGGQKVRLMKEVMEHYADQDDLVVMFTECFDVIFAGGPEEVLKKFQKANHKVVFAADGI 202
Query: 138 CWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDET 197
WPD L DKYP V G RYLNSGGFIGYA + ++ ++++ +DDQL+Y +++D
Sbjct: 203 LWPDKRLADKYPVVHIGKRYLNSGGFIGYAPYVNRIVQQWNLQDNDDDQLFYTKVYIDPL 262
Query: 198 LRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIEL 257
R I LD +FQ L G+++++ L F+ + NT Y T PV I+GNG +KI L
Sbjct: 263 KREAINITLDHKCKIFQTLNGAVDEVVLKFENGK-ARAKNTFYETLPVAINGNGPTKILL 321
Query: 258 NSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANL 316
N FGNY+ SW + +GCT C +D D P+V I VFI++PT FL FL+ + L
Sbjct: 322 NYFGNYVPNSWTQDNGCTLCEF-DTVDLSAVDVHPNVSIGVFIEQPTPFLPRFLDILLTL 380
Query: 317 NYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH 376
+YP + + +F++N + YH + K K +K + ++ EARN+ ++
Sbjct: 381 DYPKEALKLFIHNKEVYHEKDIKVFFDKAKHEIKTIKIVGPEENLSQAEARNMGMDFCRQ 440
Query: 377 -KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARS 435
+ D+YF VD+D L NP LK L+ +N +IAPL+ R K WSNFWGAL+ DG+YARS
Sbjct: 441 DEKCDYYFSVDADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARS 500
Query: 436 FDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN 494
DY++I+ G++ G+WNVPY+ N YL+K +++ N + + + +D DMA C N R
Sbjct: 501 EDYVDIVQGNR--VGVWNVPYMANVYLIKGKTLRSEMNERNYFVRDKLDPDMALCRNARE 558
Query: 495 KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVN 554
G+ + I + E+G L+ + N++ N +++++ NP+DW +YI+ +Y K + + +
Sbjct: 559 MGVFMYISNRHEFGRLLSTANYNTSHYNNDLWQIFENPVDWKEKYINRDYSK-IFTENIV 617
Query: 555 NQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGL 614
QPCPDVFWFPI +EK C E V+ ME YG+WS G ++D R+ GYE VPT DIHMKQV L
Sbjct: 618 EQPCPDVFWFPIFSEKACDELVEEMEHYGKWSGGKHHDSRISGGYENVPTDDIHMKQVDL 677
Query: 615 AGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIA 674
VW F+R+++ P+ + F GY+ + A ++FVV+Y P+ Q SLRPHHD+ST+TINIA
Sbjct: 678 ENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPHHDASTFTINIA 736
Query: 675 LNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
LN VG D++GGGC+F+RYNC++ + R GW MHPGRLTH HEGL V GTRYI +SF+DP
Sbjct: 737 LNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNGTRYIAVSFIDP 796
>gi|62739166|ref|NP_000926.2| procollagen-lysine,2-oxoglutarate 5-dioxygenase 2 isoform 2
precursor [Homo sapiens]
gi|62906878|sp|O00469.2|PLOD2_HUMAN RecName: Full=Procollagen-lysine,2-oxoglutarate 5-dioxygenase 2;
AltName: Full=Lysyl hydroxylase 2; Short=LH2; Flags:
Precursor
gi|119599347|gb|EAW78941.1| procollagen-lysine, 2-oxoglutarate 5-dioxygenase 2, isoform CRA_c
[Homo sapiens]
gi|261858130|dbj|BAI45587.1| procollagen-lysine, 2-oxoglutarate 5-dioxygenase 2 [synthetic
construct]
Length = 737
Score = 642 bits (1655), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 314/720 (43%), Positives = 461/720 (64%), Gaps = 10/720 (1%)
Query: 19 ISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSS 77
+ K +I DK LVITVA+ E+DG+ RF+QSA+ VK LG + W GGD ++S
Sbjct: 24 LGADSEKPSSIPTDKLLVITVATKESDGFHRFMQSAKYFNYTVKVLGQGEEWRGGDGINS 83
Query: 78 LGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERL 137
+GGG KV L+K ++ DD++++ T+ +DVI GG ++L++F + +VF A+ +
Sbjct: 84 IGGGQKVRLMKEVMEHYADQDDLVVMFTECFDVIFAGGPEEVLKKFQKANHKVVFAADGI 143
Query: 138 CWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDET 197
WPD L DKYP V G RYLNSGGFIGYA + ++ ++++ +DDQL+Y +++D
Sbjct: 144 LWPDKRLADKYPVVHIGKRYLNSGGFIGYAPYVNRIVQQWNLQDNDDDQLFYTKVYIDPL 203
Query: 198 LRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIEL 257
R I LD +FQ L G+++++ L F+ + NT Y T PV I+GNG +KI L
Sbjct: 204 KREAINITLDHKCKIFQTLNGAVDEVVLKFENGK-ARAKNTFYETLPVAINGNGPTKILL 262
Query: 258 NSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANL 316
N FGNY+ SW + +GCT C +D D P+V I VFI++PT FL FL+ + L
Sbjct: 263 NYFGNYVPNSWTQDNGCTLCEF-DTVDLSAVDVHPNVSIGVFIEQPTPFLPRFLDILLTL 321
Query: 317 NYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH 376
+YP + + +F++N + YH + K K +K + ++ EARN+ ++
Sbjct: 322 DYPKEALKLFIHNKEVYHEKDIKVFFDKAKHEIKTIKIVGPEENLSQAEARNMGMDFCRQ 381
Query: 377 -KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARS 435
+ D+YF VD+D L NP LK L+ +N +IAPL+ R K WSNFWGAL+ DG+YARS
Sbjct: 382 DEKCDYYFSVDADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARS 441
Query: 436 FDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN 494
DY++I+ G++ G+WNVPY+ N YL+K +++ N + + + +D DMA C N R
Sbjct: 442 EDYVDIVQGNR--VGVWNVPYMANVYLIKGKTLRSEMNERNYFVRDKLDPDMALCRNARE 499
Query: 495 KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVN 554
G+ + I + E+G L+ + N++ N +++++ NP+DW +YI+ +Y K + + +
Sbjct: 500 MGVFMYISNRHEFGRLLSTANYNTSHYNNDLWQIFENPVDWKEKYINRDYSK-IFTENIV 558
Query: 555 NQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGL 614
QPCPDVFWFPI +EK C E V+ ME YG+WS G ++D R+ GYE VPT DIHMKQV L
Sbjct: 559 EQPCPDVFWFPIFSEKACDELVEEMEHYGKWSGGKHHDSRISGGYENVPTDDIHMKQVDL 618
Query: 615 AGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIA 674
VW F+R+++ P+ + F GY+ + A ++FVV+Y P+ Q SLRPHHD+ST+TINIA
Sbjct: 619 ENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPHHDASTFTINIA 677
Query: 675 LNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
LN VG D++GGGC+F+RYNC++ + R GW MHPGRLTH HEGL V GTRYI +SF+DP
Sbjct: 678 LNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNGTRYIAVSFIDP 737
>gi|27924281|gb|AAH45041.1| LOC398437 protein, partial [Xenopus laevis]
Length = 727
Score = 641 bits (1654), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 313/711 (44%), Positives = 462/711 (64%), Gaps = 16/711 (2%)
Query: 28 NIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLL 87
N D+D LV+TVA+ ET+G +RF +SA +VK LGL + WLG G K+ L+
Sbjct: 29 NPDDDNLLVLTVATEETEGLRRFQRSAHSFNYKVKVLGLGEEWLGD-------GQKIQLM 81
Query: 88 KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
K L+ +D+IIL T+SYDVI G ++L++F + +VF AE + +PD L K
Sbjct: 82 KLALEPYSDKEDLIILFTESYDVIFASGHGELLKKFRQAKSKVVFSAESVAYPDRHLESK 141
Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
YP V G RYL SG FIGYA + +++++ ++ DQL+Y LFLD R K I LD
Sbjct: 142 YPVVREGKRYLGSGAFIGYAAHLYKMVADWDGTDKSSDQLFYTKLFLDPVKRGKINITLD 201
Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
+FQNLYGS ED+ L F+ V N Y+T PV+IHGNG +K+ LN NY+ +
Sbjct: 202 HRCRIFQNLYGSAEDVVLKFEYGR-VRARNLVYDTLPVLIHGNGPTKLHLNYLSNYIPRV 260
Query: 268 WK-TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
W SGC C+ +++LD L D P V+I ++I++PT F+ EF ++ NLNYP +I +
Sbjct: 261 WTFESGCNVCDEGLRNLDGLTVDTLPLVVIGIYIEQPTPFVSEFFKRLNNLNYPKNRIQL 320
Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHK-GVDFYFY 384
++ N++ +H + ++ + T + VK + + +ARN ++ ++YF
Sbjct: 321 YISNHEPHHQKRVEHFLQDHGTQYNFVKTVGPEENSDFADARNKGMDMCRQTPECEYYFS 380
Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
+D+ L N +VL+ L+ +N+S+IAPL+ R WSNFWGALN+DG+YARS DY++++
Sbjct: 381 IDAPVVLKNTNVLRSLIEQNKSVIAPLVSRNANLWSNFWGALNSDGYYARSEDYIDVVQR 440
Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDS 503
+ G+WNVPYI++ YL+K S++++ N ++ ++D DMAFC N+R +G+ + + +
Sbjct: 441 QR--NGVWNVPYISSVYLVKGSILRSKLNQNDLFHSGTLDSDMAFCHNVRQQGVFMFVTN 498
Query: 504 TQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFW 563
QE+GH++ +N+ + +++E+ N DW +YIH + ++L V PCPDV+W
Sbjct: 499 RQEFGHILSLKNYKTTHLHNDLWEIFENTEDWKEKYIHHNHSEALKGKLVE-MPCPDVYW 557
Query: 564 FPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLR 623
FP+ TE C+E V+ ME++G+WS G+N D+RL+ GYE VPT DIHM Q+G W + L
Sbjct: 558 FPVFTETTCNEIVEEMESFGKWSTGSNTDQRLQGGYENVPTIDIHMNQIGYEKEWQKILL 617
Query: 624 KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYE 683
++ PL ++ F GY+ + ++FVVRY+PDEQP L PHHD+ST+T+NIALN VG DYE
Sbjct: 618 DFIAPLTQKMFPGYY-TMAQFDLAFVVRYKPDEQPLLEPHHDASTFTVNIALNSVGQDYE 676
Query: 684 GGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
GGGCRF+RYNC+V A R GW LMHPGRLTHYHEGL+VT+GTRYI++SFVDP
Sbjct: 677 GGGCRFLRYNCSVRALRKGWALMHPGRLTHYHEGLRVTKGTRYIVVSFVDP 727
>gi|301778999|ref|XP_002924916.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2-like
isoform 2 [Ailuropoda melanoleuca]
Length = 736
Score = 640 bits (1652), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 312/714 (43%), Positives = 462/714 (64%), Gaps = 10/714 (1%)
Query: 25 KVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGYK 83
K +I DK LVITVA+ E+DG+ RF+QSA+ VK LG + W GGD ++S+GGG K
Sbjct: 29 KPSSIPTDKLLVITVATKESDGFHRFMQSAKYFNYTVKVLGQGEEWRGGDGINSIGGGQK 88
Query: 84 VNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTS 143
V L+K ++ +D++IL T+ ++VI GG ++L++F + +VF A+ + WPD
Sbjct: 89 VRLMKEVMEHYANQEDLVILFTECFNVIFAGGPEEVLKKFQKSNHKVVFAADGILWPDKR 148
Query: 144 LYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHK 203
L DKYP V G RYLNSGGFIGYA +I +++ ++++ +DDQL+Y +++D R
Sbjct: 149 LADKYPIVHIGKRYLNSGGFIGYAPNINQIVQQWNLQDNDDDQLFYTKIYIDPLKREAIN 208
Query: 204 IVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNY 263
I LD +FQ L G+++++ L F+ + N Y T PV ++GNG +KI LN FGNY
Sbjct: 209 ITLDHKCKIFQTLNGAVDEVVLKFENGK-ARAKNVFYETLPVAVNGNGPTKILLNYFGNY 267
Query: 264 LAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKK 322
+ +W + +GCT C+L +D D P+V I VFI++PT FL FL+ + L+YP +
Sbjct: 268 VPNAWTQDNGCTLCDL-DTIDLSTVDVHPNVTIGVFIEQPTPFLPRFLDILLTLDYPKEA 326
Query: 323 ISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDF 381
+ +F++N + YH + K +K + ++ EARN+ ++ + D+
Sbjct: 327 LKLFIHNKEVYHEKDIKVFFDKAKREISTIKIVGPEENLSQAEARNMGMDFCRQDENCDY 386
Query: 382 YFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNI 441
YF +D+D L NP LK L+ +N +IAPL+ R K WSNFWGAL+ DG+YARS DY++I
Sbjct: 387 YFSMDADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARSEDYVDI 446
Query: 442 INGDQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRNKGIHLK 500
+ G++ GIWNVPY+ N YL+K +++ N + + + +D DMA C N R G+ +
Sbjct: 447 VQGNR--VGIWNVPYMANVYLIKGKTLRSEMNERNYFVRDKLDPDMALCRNAREMGVFMY 504
Query: 501 IDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPD 560
I + E+G L+ + N++ N +++++ NP+DW +YI+ +Y K + + + QPCPD
Sbjct: 505 ISNRHEFGRLLSTANYNTSHYNNDLWQIFENPVDWKEKYINRDYSK-IFTENIVEQPCPD 563
Query: 561 VFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAE 620
VFWFPI +EK C E V+ ME YGQWS G ++D R+ GYE VPT DIHMKQ+ L VW
Sbjct: 564 VFWFPIFSEKACDELVEEMEHYGQWSGGKHHDSRISGGYENVPTDDIHMKQIDLENVWLH 623
Query: 621 FLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGV 680
F+R+++ P+ + F GY+ + A ++FVV+Y P+ Q SLRPHHD+ST+TINIALN VG
Sbjct: 624 FIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPHHDASTFTINIALNNVGE 682
Query: 681 DYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
D++GGGC+F+RYNC++ + R GW MHPGRLTH HEGL V GTRYI +SF+DP
Sbjct: 683 DFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNGTRYIAVSFIDP 736
>gi|62900635|sp|Q811A3.1|PLOD2_RAT RecName: Full=Procollagen-lysine,2-oxoglutarate 5-dioxygenase 2;
AltName: Full=Lysyl hydroxylase 2; Short=LH2; Flags:
Precursor
gi|28400783|emb|CAD23630.1| procollagen-lysine, 2-oxoglutarate 5-dioxygenase 2, short variant
[Rattus norvegicus]
Length = 737
Score = 640 bits (1652), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 314/710 (44%), Positives = 456/710 (64%), Gaps = 10/710 (1%)
Query: 29 IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGYKVNLL 87
I DK LVITVA+ E DG+ RF+ SA+ VK LG Q W GGD M+S+GGG KV L+
Sbjct: 34 IPADKLLVITVATKENDGFHRFMNSAKYFNYTVKVLGQGQEWRGGDGMNSIGGGQKVRLM 93
Query: 88 KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
K ++ DD++IL T+ +DVI GG ++L++F + IVF A+ L WPD L DK
Sbjct: 94 KEAMEHYAGQDDLVILFTECFDVIFAGGPEELLKKFQKTNHKIVFAADALLWPDKRLADK 153
Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
YP V G RYLNSGGFIGYA I L+ ++++ +DDQL+Y +++D R I LD
Sbjct: 154 YPGVHIGKRYLNSGGFIGYAPYISRLVQQWNLQDNDDDQLFYTKVYIDPLKREALNITLD 213
Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
+FQ L G+ +++ L F+ + + NT Y T PV I+GNG SKI LN FGNY+ S
Sbjct: 214 HRCKIFQALNGATDEVVLKFENGK-SRVKNTFYETLPVAINGNGPSKILLNYFGNYVPNS 272
Query: 268 W-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMF 326
W + +GC C+ D D +P V + VFI++PT F FL+ + L+YP + + +F
Sbjct: 273 WTQENGCALCDFDAS-DLSTVDVYPKVTLGVFIEQPTPFQPRFLDLLLTLDYPKEALRLF 331
Query: 327 VYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYV 385
V+N + YH ++ K ++K + ++ EARN+ ++ + D+YF V
Sbjct: 332 VHNKEVYHEKDIKAFVDKAKHDISSIKIVGPEENLSQAEARNMGMDFCRQDEKCDYYFSV 391
Query: 386 DSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGD 445
D+D L NP LK L+ +N +IAPL+ R K WSNFWGAL+ DG+YARS DY++I+ G+
Sbjct: 392 DADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARSEDYVDIVQGN 451
Query: 446 QGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDST 504
+ GIWNVPY+ N YL++ +++ + + + + +D DM+ C N R+ G+ + I +
Sbjct: 452 R--VGIWNVPYMANVYLIQGKTLRSEMSERNYFVRDKLDPDMSLCRNARDMGVFMYISNR 509
Query: 505 QEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWF 564
E+G L+ + N++ N +++++ NP+DW +YI+ +Y K + + + QPCPDVFWF
Sbjct: 510 HEFGRLISTANYNTSHLNNDLWQIFENPVDWKEKYINRDYSK-IFTENIVEQPCPDVFWF 568
Query: 565 PIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRK 624
PI +E+ C E V+ ME YG+WS G ++D R+ GYE VPT DIHMKQ+ L VW F+R+
Sbjct: 569 PIFSERACDELVEEMEHYGKWSGGKHHDSRISGGYENVPTDDIHMKQIDLENVWLHFIRE 628
Query: 625 YVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEG 684
++ P+ + F GY+ + A ++FVV+Y P+ Q SLRPHHD+ST+TINIALN VG D++G
Sbjct: 629 FIAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPHHDASTFTINIALNNVGEDFQG 687
Query: 685 GGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
GGC+F+RYNC++ + R GW MHPGRLTH HEGL V GTRYI +SF+DP
Sbjct: 688 GGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNGTRYIAVSFIDP 737
>gi|308509144|ref|XP_003116755.1| CRE-LET-268 protein [Caenorhabditis remanei]
gi|308241669|gb|EFO85621.1| CRE-LET-268 protein [Caenorhabditis remanei]
Length = 734
Score = 640 bits (1651), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 310/736 (42%), Positives = 471/736 (63%), Gaps = 16/736 (2%)
Query: 11 ILSCVVFFISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPW 70
+L + FF+ D + +V+TVA+ TDG KR ++SA+ ++++ L L + W
Sbjct: 3 VLPLLPFFLIPVILATTITDLPELVVVTVATENTDGLKRLLESAKAFDIKIEVLALGEKW 62
Query: 71 LGGDMS-SLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERF--NTFD 127
GGD GGG K+ +L +++ D II+ D+YDV+ + IL +F + +
Sbjct: 63 NGGDTRVEQGGGQKIRILSEWIEKYKDASDTIIMFVDAYDVVFNADATTILRKFFEHYSE 122
Query: 128 ANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQL 187
++FGAE CWPD +L YP V G R+LNSG F+GY ++ +++ + +++++DDQL
Sbjct: 123 KRLLFGAEPFCWPDQTLAPDYPIVEFGKRFLNSGLFMGYGPEVYKILKLKPVEDKDDDQL 182
Query: 188 YYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVII 247
YY +++LD+ LR + K+ LD+++ +FQNL G +ED++L F D N YNT P+II
Sbjct: 183 YYTMIYLDDKLRKELKMDLDSMSKIFQNLNGVIEDVELQFKDDGTPEAYNAAYNTKPLII 242
Query: 248 HGNGKSKIELNSFGNYLAKSWKTS-GCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFL 306
HGNG SK LN GNYL W + GC C + ++ D P + +++FI KP F+
Sbjct: 243 HGNGPSKSHLNYLGNYLGNRWNSELGCRNCGQEEEKETADED-LPLIALNLFISKPIPFI 301
Query: 307 EEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEA 366
EE L K++ +YP KI++++YNNQ + D++ + + I + + +EA
Sbjct: 302 EEVLQKVSEFDYPKNKIALYIYNNQPFSIKNIQDFLKEHGKSYYTKRVINGVTEIGEREA 361
Query: 367 RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNES----LIAPLLVRPFKAWSNF 422
RN A+E + V++ F++D+D++ +P ++K LV+ +E+ +IAP++ +P K ++NF
Sbjct: 362 RNEAIEWDKQRNVEYGFFMDADAYFTDPKIVKDLVHHSETYDVGIIAPMVGQPGKLFTNF 421
Query: 423 WGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSM 482
WGA+ A+G+YARS DYM I+ G++ G WNVP+IT+ L+ + A Y N +
Sbjct: 422 WGAIAANGYYARSEDYMAIVKGNR--VGYWNVPFITSAVLLNKEKLVAMKDSFSYNKN-L 478
Query: 483 DYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKT----NPEVYELIRNPLDWDLR 538
D DM+ C R+ G + ID+ + YG+L+ S+ F T +PE++++ N W+ R
Sbjct: 479 DPDMSMCQFARDHGHFMYIDNEKSYGYLIVSDEFSETVTQGKWHPEMWQIFENRELWEAR 538
Query: 539 YIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETG 598
YIHP Y K + PD + +Q CPDV+ +P+++E+FC E ++ ME +G+WSDG+NNDKRL G
Sbjct: 539 YIHPGYHKIMEPDHIVDQACPDVYDYPLMSERFCAELIEEMEGFGRWSDGSNNDKRLAGG 598
Query: 599 YEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQP 658
YE VPTRDIHM QVG W FL YV P+QE+ FIGY+H+PV + M FVVRY+P+EQ
Sbjct: 599 YENVPTRDIHMNQVGFERQWLYFLDTYVRPVQEKTFIGYYHQPVESNMMFVVRYKPEEQA 658
Query: 659 SLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGL 718
SLRPHHD+ST++I+IALN+ G DYEGGG R+IRYNC V A +G+ +M PGRLTH HEGL
Sbjct: 659 SLRPHHDASTFSIDIALNKKGRDYEGGGVRYIRYNCTVQADEVGYAMMFPGRLTHMHEGL 718
Query: 719 QVTQGTRYIMISFVDP 734
T+GTRYIM+SF++P
Sbjct: 719 ATTKGTRYIMVSFINP 734
>gi|432928329|ref|XP_004081145.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2-like
[Oryzias latipes]
Length = 737
Score = 640 bits (1651), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 309/713 (43%), Positives = 461/713 (64%), Gaps = 10/713 (1%)
Query: 27 KNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVN 85
++I ++K L++TVA+ ETDG+ RF++SA VK LG+ + W GGD+ S+GGG KV
Sbjct: 30 ESIPKEKLLILTVATEETDGFLRFMRSANYFNYTVKVLGMGEKWKGGDVGHSIGGGQKVR 89
Query: 86 LLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLY 145
LLK ++ + +D++IL DSYD+I GG +IL++F + ++F AE L WPD L
Sbjct: 90 LLKKAMEALADQEDLVILSVDSYDLIFAGGPEEILKKFKQANHKVLFAAEGLIWPDKRLT 149
Query: 146 DKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIV 205
DKYP+V SG R+LNSGG IGYA I ++S ++ + +DDQL+Y ++LD R +
Sbjct: 150 DKYPSVRSGKRFLNSGGIIGYAPYINRIVSEWNLHDNDDDQLFYTKIYLDPLKRETLNMT 209
Query: 206 LDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLA 265
LD +FQNL G+++++ L F V + NT Y++ P+++HGNGK+K+ LN GNY+
Sbjct: 210 LDHKCQIFQNLNGAVDEVLLKFGTGR-VRVRNTMYDSLPIVVHGNGKTKMYLNYLGNYVP 268
Query: 266 KSWK-TSGCTRCNLIKHLDS-LKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKI 323
+W +GC+ C+ + L+ ++P+VL+ VFI++PT FL EFL+++ L+YP K+
Sbjct: 269 NAWNYENGCSGCDDDLLDLTQLEVCEYPNVLVGVFIEQPTPFLPEFLHRLLTLDYPKDKL 328
Query: 324 SMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGV-DFY 382
+FV+NN+ YH + K +F + K + ++ EARN+ ++ D+Y
Sbjct: 329 QVFVHNNEVYHEKHIQTFWEESKNVFGSFKVVGPEENLSQGEARNMGMDLCRKDATCDYY 388
Query: 383 FYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNII 442
F +DSD L N LK L+ +N +I PL+ R K WSNFWGAL+ DG+YARS DY++I+
Sbjct: 389 FSIDSDVMLTNRQTLKLLIEQNRKIIGPLVTRHGKLWSNFWGALSLDGYYARSEDYVDIV 448
Query: 443 NGDQGGKGIWNVPYITNCYLMKTSVI-KATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKI 501
+ G+WN+PY+ + YL+K +V+ K + + L +D DMA C N R G+ + I
Sbjct: 449 QRKR--VGVWNIPYMAHVYLIKGAVLRKELKERNYFVLEKLDPDMALCRNAREMGVFMFI 506
Query: 502 DSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDV 561
+ ++G L+ + +++ N +++++ NP+DW +YIH Y + + + QPCPDV
Sbjct: 507 TNRHDFGRLISTASYNTSHYNNDLWQIFENPVDWKEKYIHQNYTQIFTHNYLE-QPCPDV 565
Query: 562 FWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEF 621
+WFP+++EK C E V+ ME YG WS G + DKR+ GYE VPT DIHMKQ+G W F
Sbjct: 566 YWFPVLSEKACDEIVEEMEHYGSWSGGKHEDKRISGGYETVPTDDIHMKQIGFDKEWLHF 625
Query: 622 LRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVD 681
+R+++ P+ + F GY+ + A M+FVV+Y P+ Q LRPHHDSST+TINIALN G D
Sbjct: 626 IREFISPVTLKVFSGYYTKGY-ALMNFVVKYTPERQAYLRPHHDSSTFTINIALNNKGSD 684
Query: 682 YEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
++GGGCRF RYNC++ + R GW MHPGRLTH HEGL T GTRYI +SF+DP
Sbjct: 685 FQGGGCRFHRYNCSIESPRKGWSFMHPGRLTHLHEGLPTTNGTRYIAVSFIDP 737
>gi|119599346|gb|EAW78940.1| procollagen-lysine, 2-oxoglutarate 5-dioxygenase 2, isoform CRA_b
[Homo sapiens]
Length = 740
Score = 640 bits (1650), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 312/715 (43%), Positives = 460/715 (64%), Gaps = 10/715 (1%)
Query: 24 NKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGY 82
+ + + DK LVITVA+ E+DG+ RF+QSA+ VK LG + W GGD ++S+GGG
Sbjct: 32 SSIPTVFADKLLVITVATKESDGFHRFMQSAKYFNYTVKVLGQGEEWRGGDGINSIGGGQ 91
Query: 83 KVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDT 142
KV L+K ++ DD++++ T+ +DVI GG ++L++F + +VF A+ + WPD
Sbjct: 92 KVRLMKEVMEHYADQDDLVVMFTECFDVIFAGGPEEVLKKFQKANHKVVFAADGILWPDK 151
Query: 143 SLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKH 202
L DKYP V G RYLNSGGFIGYA + ++ ++++ +DDQL+Y +++D R
Sbjct: 152 RLADKYPVVHIGKRYLNSGGFIGYAPYVNRIVQQWNLQDNDDDQLFYTKVYIDPLKREAI 211
Query: 203 KIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGN 262
I LD +FQ L G+++++ L F+ + NT Y T PV I+GNG +KI LN FGN
Sbjct: 212 NITLDHKCKIFQTLNGAVDEVVLKFENGK-ARAKNTFYETLPVAINGNGPTKILLNYFGN 270
Query: 263 YLAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAK 321
Y+ SW + +GCT C +D D P+V I VFI++PT FL FL+ + L+YP +
Sbjct: 271 YVPNSWTQDNGCTLCEF-DTVDLSAVDVHPNVSIGVFIEQPTPFLPRFLDILLTLDYPKE 329
Query: 322 KISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVD 380
+ +F++N + YH + K K +K + ++ EARN+ ++ + D
Sbjct: 330 ALKLFIHNKEVYHEKDIKVFFDKAKHEIKTIKIVGPEENLSQAEARNMGMDFCRQDEKCD 389
Query: 381 FYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMN 440
+YF VD+D L NP LK L+ +N +IAPL+ R K WSNFWGAL+ DG+YARS DY++
Sbjct: 390 YYFSVDADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARSEDYVD 449
Query: 441 IINGDQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRNKGIHL 499
I+ G++ G+WNVPY+ N YL+K +++ N + + + +D DMA C N R G+ +
Sbjct: 450 IVQGNR--VGVWNVPYMANVYLIKGKTLRSEMNERNYFVRDKLDPDMALCRNAREMGVFM 507
Query: 500 KIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCP 559
I + E+G L+ + N++ N +++++ NP+DW +YI+ +Y K + + + QPCP
Sbjct: 508 YISNRHEFGRLLSTANYNTSHYNNDLWQIFENPVDWKEKYINRDYSK-IFTENIVEQPCP 566
Query: 560 DVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWA 619
DVFWFPI +EK C E V+ ME YG+WS G ++D R+ GYE VPT DIHMKQV L VW
Sbjct: 567 DVFWFPIFSEKACDELVEEMEHYGKWSGGKHHDSRISGGYENVPTDDIHMKQVDLENVWL 626
Query: 620 EFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVG 679
F+R+++ P+ + F GY+ + A ++FVV+Y P+ Q SLRPHHD+ST+TINIALN VG
Sbjct: 627 HFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPHHDASTFTINIALNNVG 685
Query: 680 VDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
D++GGGC+F+RYNC++ + R GW MHPGRLTH HEGL V GTRYI +SF+DP
Sbjct: 686 EDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNGTRYIAVSFIDP 740
>gi|147898979|ref|NP_001082933.1| procollagen-lysine,2-oxoglutarate 5-dioxygenase 2 isoform 2
precursor [Danio rerio]
gi|70779501|gb|AAZ08243.1| procollagen lysine 2-oxoglutarate 5-dioxygenase 2b isoform [Danio
rerio]
Length = 754
Score = 640 bits (1650), Expect = e-180, Method: Compositional matrix adjust.
Identities = 317/751 (42%), Positives = 472/751 (62%), Gaps = 35/751 (4%)
Query: 10 LILSCVVFFISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQP 69
++++CV + + NK +I +K LV+TVA+ ETDG+ RF+QSA VK LG+ +
Sbjct: 13 MLVTCVHCTLGMETNK--DIPTEKLLVLTVATQETDGFLRFMQSANYFNFNVKVLGMGEE 70
Query: 70 WLGGDMS-SLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDA 128
W GGD+ S+GGG KV LLK ++ +D +D+++L DSYD+I GG +IL +F +
Sbjct: 71 WKGGDVGHSIGGGQKVRLLKEAMESLDQQEDLVVLFVDSYDLIFAGGAEEILRKFQQSNH 130
Query: 129 NIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLY 188
+VF AE + WPD+ L +KYP+V SG R+LNSGG IGYA I++L+S + + +DDQL+
Sbjct: 131 KVVFAAEGIIWPDSQLAEKYPSVRSGKRFLNSGGIIGYAPYIQKLVSQWDLHDNDDDQLF 190
Query: 189 YALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIH 248
Y +++D R K + LD +FQNL G+L+++ L F E V + NT YN+ P +IH
Sbjct: 191 YTKIYVDPIQREKLNMTLDHKCEIFQNLNGALDEVLLKFG-TERVRVRNTIYNSLPAVIH 249
Query: 249 GNGKSKIELNSFGNYLAKSWK-TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFL 306
GN +K+ N NY+ +W GCT C+ + L LK +FP V + V+I++PT FL
Sbjct: 250 GNVNTKVYFNYLANYIPNAWNYERGCTICDQDMVDLSQLK--EFPQVTVGVYIEQPTPFL 307
Query: 307 EEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEA 366
EFL ++ +L+YP K+++F++N++ YH + K +F + K + + EA
Sbjct: 308 PEFLERLLSLDYPKDKLNIFIHNSEVYHEKHIQKFWEENKDVFGSFKAVGPEENLTQGEA 367
Query: 367 RNLAVENSLH-KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGA 425
RN+ ++ D++F +D+D L N LK L+ +N +IAPL+ R K WSNFWGA
Sbjct: 368 RNMGMDVCRRDPSCDYFFNIDADVMLTNRQTLKLLIEQNRKIIAPLVTRHGKLWSNFWGA 427
Query: 426 LNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIK-ATNIKTIYTLNSMDY 484
L+ DG+YARS DY++I+ G + G+WN+P++ + YL+K ++ + ++ L +D
Sbjct: 428 LSLDGYYARSEDYIDIVQGKR--VGVWNIPFLAHVYLIKGQTLRNELKERNVFVLEKLDP 485
Query: 485 DMAFCTNLRN---------------------KGIHLKIDSTQEYGHLVDSENFDPQKTNP 523
DMA C N R+ KG+ + + + E+G L+ + N++ N
Sbjct: 486 DMAMCRNARDLTVHRERESPSPESFHMLRSPKGLFMYLTNRHEFGRLISTANYNTSHYNN 545
Query: 524 EVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYG 583
+++++ NPLDW +YIH Y + + + + QPCPDVFWFP+++EK C+E V+ ME +G
Sbjct: 546 DLWQIFENPLDWREKYIHANYTR-IFTENLLEQPCPDVFWFPVLSEKACNELVEEMENHG 604
Query: 584 QWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVR 643
WS G + DKR+ GYE+VPT DIHMKQ+ W F+R+++ P+ + F GY+ +
Sbjct: 605 TWSGGKHEDKRITGGYESVPTDDIHMKQINYDQEWLHFIREFISPVTLKVFSGYYTKGY- 663
Query: 644 APMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGW 703
A M+FVV+Y PD Q LRPHHDSST+TINIALN G+D+ GGGCRF RYNC++ + R GW
Sbjct: 664 AIMNFVVKYTPDRQAYLRPHHDSSTFTINIALNNKGLDFLGGGCRFHRYNCSIESPRKGW 723
Query: 704 MLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
MHPGRLTH HEGL VT GTRYI +SFVDP
Sbjct: 724 SFMHPGRLTHLHEGLPVTNGTRYIAVSFVDP 754
>gi|348503412|ref|XP_003439258.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2
[Oreochromis niloticus]
Length = 756
Score = 639 bits (1649), Expect = e-180, Method: Compositional matrix adjust.
Identities = 319/755 (42%), Positives = 465/755 (61%), Gaps = 36/755 (4%)
Query: 10 LILSC-VVFFISVHCNKVKN----IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTL 64
L+ SC + F SV + V I ++K LV+TVA+ ETDG+ RF++SA+ VK L
Sbjct: 8 LVFSCWIKFAASVLSSDVPQAPVPIPKEKLLVLTVATEETDGFLRFMRSADYFNYTVKVL 67
Query: 65 GLHQPWLGGDMS-SLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERF 123
G+ + W GGD+ S+GGG KV LLKN ++ + +D+++L DSYD+I GG +IL +F
Sbjct: 68 GMGEAWKGGDVGRSIGGGQKVRLLKNAMEALADQEDLVVLSVDSYDLIFAGGPEEILRKF 127
Query: 124 NTFDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEE 183
+ ++F AE L WPD L DKYP V +G RYLNSGG IGYA I ++S ++ + +
Sbjct: 128 QQANHKVLFAAEGLVWPDKQLADKYPLVRTGKRYLNSGGIIGYAPYINRIVSQWNLHDND 187
Query: 184 DDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTN 243
DDQL+Y ++LD R + LD +FQNL G+++++ L F D V + NT Y++
Sbjct: 188 DDQLFYTKIYLDPLQRESLNMTLDHKCQIFQNLNGAVDEVLLKFGTDR-VRVRNTAYDSL 246
Query: 244 PVIIHGNGKSKIELNSFGNYLAKSWK-TSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKP 302
PV++HGNG +K+ LN NY+ +W GC+ C+ +D + ++P+VL+ VFI++P
Sbjct: 247 PVVVHGNGNTKMYLNYLANYVPNAWNYEHGCSHCD-DDVVDFSQLKEYPNVLVGVFIEQP 305
Query: 303 TAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVN 362
T FL EF ++ L+YP K+ +FV+NN+ YH + + +F + K + ++
Sbjct: 306 TPFLPEFFQRLLTLDYPKDKLKLFVHNNEVYHEKHIQRFWEENRNVFNSFKVVGPEENLS 365
Query: 363 SKEARNLAVENSLHKGV-DFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSN 421
EARN+A++ D+YF +DSD L N LK L+ +N +I PL+ R K WSN
Sbjct: 366 QGEARNMAMDLCRQDATCDYYFSIDSDVMLTNRQTLKLLIEQNRKIIGPLVTRHGKLWSN 425
Query: 422 FWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIK-ATNIKTIYTLN 480
FWGAL+ DG+YARS DY++I+ + G+WN+PY+ + YL+K S ++ + + L
Sbjct: 426 FWGALSLDGYYARSEDYVDIVQRKR--VGVWNIPYMAHVYLVKGSALRNELKERNYFVLE 483
Query: 481 SMDYDMAFCTNLRN---------------------KGIHLKIDSTQEYGHLVDSENFDPQ 519
+D DMA C N R KG+ + I + E+G L+ + N++
Sbjct: 484 KLDPDMALCRNAREMTSHREKDSPSPESFHMLRPPKGVFMYITNRHEFGRLISTANYNIS 543
Query: 520 KTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIM 579
N +++++ NP+DW +YIH Y + + + QPCPDVFWFP+ +EK C E V+ M
Sbjct: 544 HYNNDLWQIFENPVDWKEKYIHSNYTR-IFTENYLEQPCPDVFWFPVFSEKACDELVEEM 602
Query: 580 EAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHH 639
E YG WS G + DKR+ GYE VPT DIHMKQ+G W F+R+++ P+ + F GY+
Sbjct: 603 EHYGSWSGGKHQDKRIAGGYETVPTDDIHMKQIGFEKEWLHFIREFISPVTLKVFSGYYT 662
Query: 640 EPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTAT 699
+ A M+FVV+Y P+ Q LRPHHDSST+TINIALN G+D++GGGCRF RYNC + +
Sbjct: 663 KGY-AIMNFVVKYTPERQAYLRPHHDSSTFTINIALNNKGIDFQGGGCRFHRYNCTIESP 721
Query: 700 RMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
R GW MHPGRLTH HEGL T GTRYI +SF+DP
Sbjct: 722 RKGWSFMHPGRLTHLHEGLPTTGGTRYIAVSFIDP 756
>gi|354504767|ref|XP_003514445.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 1-like,
partial [Cricetulus griseus]
Length = 709
Score = 639 bits (1649), Expect = e-180, Method: Compositional matrix adjust.
Identities = 311/711 (43%), Positives = 466/711 (65%), Gaps = 9/711 (1%)
Query: 28 NIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLL 87
++ D LV+TVA+ ET+G++RF +SA+ +++ LGL + W S GGG KV LL
Sbjct: 4 SLSTDNLLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWSVDSGPSAGGGQKVRLL 63
Query: 88 KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
K L++ +D++IL TDSYDV+ G ++L++F + +VF AE L +PD L K
Sbjct: 64 KKALEKHAHKEDLVILFTDSYDVVFASGPRELLKKFQQAKSRVVFSAEELIYPDRRLEAK 123
Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
YP V G R+L SGGFIGYA ++ +L++ ++ + DQL+Y +FLD R + I L
Sbjct: 124 YPTVSDGKRFLGSGGFIGYAPNLNKLVAEWEGQDSDSDQLFYTKIFLDPEKREQINISLG 183
Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
++FQNL G+L+++ L F++ V N Y+T PV+IHGNG +K++LN GNY+ +
Sbjct: 184 HSCSIFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVVIHGNGPTKLQLNYLGNYIPRF 242
Query: 268 WK-TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
W +GCT C+ ++ L + + P+VL+ VFI++PT FL F ++ L YP K++ +
Sbjct: 243 WTFETGCTVCDEGLRSLKGIGDEALPTVLVGVFIEQPTPFLSLFFLRLLRLRYPQKRMRL 302
Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFY 384
F++N++++H + ++ T +++VK + + + +ARN+ + + +YF
Sbjct: 303 FIHNHEQHHKLEVEKFLAEHGTEYQSVKLVGPEVRMANADARNMGADLCRQDQTCTYYFS 362
Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
VD+D L PD L+ L+ +N+++IAPL+ R + WSNFWGAL+ADG+YARS DY++I+ G
Sbjct: 363 VDADVALTEPDSLRLLIEQNKNVIAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQG 422
Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDS 503
+ G+WNVPYI+N YL+K S ++A ++ + +D DM+FC N+R + + + + +
Sbjct: 423 RR--VGVWNVPYISNIYLIKGSALRAELQHVDLFHYSKLDADMSFCANVRQQEVFMFLTN 480
Query: 504 TQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFW 563
+GHL+ +N+ + +++E+ NP DW +YIH Y K+L V PCPDV+W
Sbjct: 481 RHTFGHLLSLDNYQTTHLHNDLWEVFSNPEDWKEKYIHENYTKALEGKLV-EMPCPDVYW 539
Query: 564 FPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLR 623
FPI TE C E V+ ME YGQWS G N D R++ GYE VPT DIHM Q+ W +FL
Sbjct: 540 FPIFTEAACDELVEEMEHYGQWSLGDNKDNRIQGGYENVPTIDIHMNQITFEREWHKFLV 599
Query: 624 KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYE 683
+Y+ P+ E+ + GY+ + ++FVVRY+PDEQPSL PHHD+ST+T+NIALN+VG DYE
Sbjct: 600 EYIAPMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFTVNIALNRVGQDYE 658
Query: 684 GGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
GGGCRF+RYNC++ A R GW LMHPGRLTHYHEGL T+GTRYI +SFVDP
Sbjct: 659 GGGCRFLRYNCSIRAPRKGWALMHPGRLTHYHEGLPTTKGTRYIAVSFVDP 709
>gi|74178654|dbj|BAE34000.1| unnamed protein product [Mus musculus]
Length = 758
Score = 639 bits (1648), Expect = e-180, Method: Compositional matrix adjust.
Identities = 317/731 (43%), Positives = 460/731 (62%), Gaps = 31/731 (4%)
Query: 29 IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGYKVNLL 87
I DK LVITVA+ E DG+ RF+ SA+ VK LG Q W GGD M+S+GGG KV LL
Sbjct: 34 IPADKLLVITVATKENDGFHRFMNSAKYFNYTVKVLGQGQEWRGGDGMNSIGGGQKVRLL 93
Query: 88 KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
K ++ +D++IL T+ +DV+ GG ++L++F + IVF A+ L WPD L DK
Sbjct: 94 KEAMEHYASQEDLVILFTECFDVVFAGGPEEVLKKFQKTNHKIVFAADGLLWPDKRLADK 153
Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
YP V G RYLNSGGFIGYA I L+ ++++ +DDQL+Y +++D R I LD
Sbjct: 154 YPVVHIGKRYLNSGGFIGYAPYISRLVQQWNLQDNDDDQLFYTKVYIDPLKREAFNITLD 213
Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
+FQ L G+ +++ L F+ + + NT Y T PV I+GNG +KI LN FGNY+ S
Sbjct: 214 HKCKIFQALNGATDEVVLKFE-NGISRVKNTFYETLPVAINGNGPTKILLNYFGNYVPNS 272
Query: 268 W-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMF 326
W + +GC C+ + +D D P V + VFI++PT FL FLN + L+YP + + +F
Sbjct: 273 WTQENGCALCD-VDTIDLSTVDVPPKVTLGVFIEQPTPFLPRFLNLLLTLDYPKEALQLF 331
Query: 327 VYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYV 385
++N + YH ++ K ++K + ++ EARN+ ++ + D+YF V
Sbjct: 332 IHNKEVYHEKDIKVFVDKAKHDISSIKIVGPEENLSQAEARNMGMDFCRQDEKCDYYFSV 391
Query: 386 DSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGD 445
D+D L NP LK+L+ +N +IAPL+ R K WSNFWGAL+ DG+YARS DY++I+ G+
Sbjct: 392 DADVVLTNPRTLKFLIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARSEDYVDIVQGN 451
Query: 446 QGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN---------- 494
+ GIWNVPY+ N YL++ +++ N + + + +D DMA C N R+
Sbjct: 452 R--VGIWNVPYMANVYLIQGKTLRSEMNERNYFVRDKLDPDMALCRNARDMTLQREKDSP 509
Query: 495 -----------KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPE 543
KG+ + I + E+G L+ + N++ N + +++ NP+DW +YI+ +
Sbjct: 510 TPETFQMLSPPKGVFMYISNRHEFGRLISTANYNTSHLNNDFWQIFENPVDWKEKYINRD 569
Query: 544 YQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVP 603
Y K + + + QPCPDVFWFPI +E+ C E V+ ME YG+WS G ++D R+ GYE VP
Sbjct: 570 YSK-IFTENIVEQPCPDVFWFPIFSERACDELVEEMEHYGKWSGGKHHDSRISGGYENVP 628
Query: 604 TRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPH 663
T DIHMKQ+GL VW F+R+++ P+ + F GY+ + A ++FVV+Y P+ Q SLRPH
Sbjct: 629 TDDIHMKQIGLENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPH 687
Query: 664 HDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQG 723
HD+ST+TINIALN VG D++GGGC+F+RYNC++ + R GW MHPGRLTH HEGL V G
Sbjct: 688 HDASTFTINIALNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNG 747
Query: 724 TRYIMISFVDP 734
TRYI +SF+DP
Sbjct: 748 TRYIAVSFIDP 758
>gi|449509755|ref|XP_002186557.2| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2
[Taeniopygia guttata]
Length = 774
Score = 638 bits (1646), Expect = e-180, Method: Compositional matrix adjust.
Identities = 309/729 (42%), Positives = 462/729 (63%), Gaps = 31/729 (4%)
Query: 31 EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDM-SSLGGGYKVNLLKN 89
+DK LV TVA+ ETDG+ RF+++A+ VK LG + W GG++ +S+GGG KV LLK
Sbjct: 52 KDKLLVFTVATKETDGFHRFMRTAKHFNYTVKVLGKGEEWKGGELPNSIGGGQKVRLLKE 111
Query: 90 ELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYP 149
++ +D+++L + YDVI GG ++L++F + +VF A+ L WPD L DKYP
Sbjct: 112 GIESYADQEDLVVLFVECYDVIFAGGPEELLKKFQETNHKVVFAADGLIWPDKRLADKYP 171
Query: 150 AVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTL 209
V +G R+LNSGGFIGYA I ++ ++++ +DDQL+Y +++D R I LD
Sbjct: 172 VVQTGKRFLNSGGFIGYAPSINRIVQQWNLQDNDDDQLFYTKIYVDPLAREHINITLDHK 231
Query: 210 ANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK 269
+FQ L G+++++ L F+ + N+ Y+T PV IHGNG +KI+LN GNY+ +W
Sbjct: 232 CTIFQTLNGAVDEVLLKFEEGK-ARARNSVYDTLPVTIHGNGPTKIQLNYLGNYIPNAWT 290
Query: 270 -TSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
+GC+ C+L LD ++P V I VFI++PT FL +FL+++ L+YP + +S+FV+
Sbjct: 291 WETGCSVCDL-DLLDLSAVKEYPRVKIGVFIEQPTPFLTKFLDRLLTLDYPREALSIFVH 349
Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDS 387
NN+ YH + K + +N+K + ++ EARN+ ++ K ++Y +D+
Sbjct: 350 NNEVYHEKHIKKFWEKAKNIIRNIKIVGPEENLSQAEARNMGMDLCRQDKTCEYYLSIDA 409
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D L NP L+ L+ +N +IAPL+ R K WSNFWGAL+ DG+YARS DY++I+ G++
Sbjct: 410 DVVLTNPKTLRLLIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARSEDYVDIVQGNR- 468
Query: 448 GKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN------------ 494
G+WN+PY+ N YL+K +++ K + + +D DMA C N R
Sbjct: 469 -VGVWNIPYMANIYLIKGQTLRSEMKEKNYFMRDKLDPDMALCRNAREMTLQREKDSPSS 527
Query: 495 ---------KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQ 545
KG+ + I + E+G L+ + N++ N +++++ NP+DW YI+P Y
Sbjct: 528 ETFHMLRAPKGVFMYITNRHEFGRLLSTANYNTSHYNNDLWQIFENPVDWKETYINPNYS 587
Query: 546 KSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTR 605
K + D + QPCPDVFWFPI ++ C E V+ ME +GQWS G + D R+ GYE VPT
Sbjct: 588 K-IFTDNIVEQPCPDVFWFPIFSDTACDELVEEMEHFGQWSGGKHQDSRISGGYENVPTD 646
Query: 606 DIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHD 665
DIHMKQ+GL W F+R+++ P+ + F GY+ + A ++FVV+Y PD Q SLRPHHD
Sbjct: 647 DIHMKQIGLDNEWLHFIREFIAPVTLKVFAGYYTKGY-ALLNFVVKYSPDRQRSLRPHHD 705
Query: 666 SSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTR 725
SST+TINIALN+VG D++GGGC+F+RYNC++ + R GW MHPGRLTH HEGL + GTR
Sbjct: 706 SSTFTINIALNKVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPILNGTR 765
Query: 726 YIMISFVDP 734
YI +SF+DP
Sbjct: 766 YIAVSFIDP 774
>gi|354490579|ref|XP_003507434.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2,
partial [Cricetulus griseus]
Length = 725
Score = 638 bits (1645), Expect = e-180, Method: Compositional matrix adjust.
Identities = 312/710 (43%), Positives = 455/710 (64%), Gaps = 10/710 (1%)
Query: 29 IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGYKVNLL 87
I DK LVITVA+ E DG+ RF+ SA+ VK LG Q W GGD ++S+GGG KV L+
Sbjct: 22 IPADKLLVITVATKENDGFHRFMNSAKYFNYTVKVLGQGQEWRGGDGINSIGGGQKVRLM 81
Query: 88 KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
K + + +D++IL T+ +DV+ GG ++L++F + IVF A+ + WPD L +K
Sbjct: 82 KEAMAQYASQEDLVILFTECFDVVFAGGPEEVLKKFQKTNHKIVFAADGILWPDKRLAEK 141
Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
YP V G RYLNSGGFIGYA I L+ ++++ +DDQL+Y +++D R I LD
Sbjct: 142 YPVVHIGKRYLNSGGFIGYAPYISHLVQEWNLQDNDDDQLFYTKVYIDPVKREAFNITLD 201
Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
+FQ L G+ +++ L F+ + + NT Y T PV I+GNG +KI LN FGNY+ S
Sbjct: 202 HKCKIFQALNGATDEVVLKFENGK-SRVKNTFYETLPVAINGNGPTKILLNYFGNYVPNS 260
Query: 268 W-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMF 326
W + GC C+ +D D P V I VFI++PT FL FLN + +L+YP + + +F
Sbjct: 261 WTQEHGCALCDF-DTIDLSAVDVHPKVTIGVFIEQPTPFLPRFLNLLLSLDYPKEALKLF 319
Query: 327 VYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYV 385
++N + YH + K +K + ++ EARN+ ++ + D+YF V
Sbjct: 320 IHNKEVYHEKDIKVFFDKAKHEISTIKIVGPEENLSQAEARNMGMDFCRQDEKCDYYFSV 379
Query: 386 DSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGD 445
D+D L NP LK L+ +N +IAPL+ R K WSNFWGAL+ DG+YARS DY++I+ G
Sbjct: 380 DADVVLTNPRTLKNLIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARSEDYVDIVQGK 439
Query: 446 QGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDST 504
+ GIWNVPY+ N YL++ +++ + + + + +D DMA C N R G+ + I +
Sbjct: 440 R--VGIWNVPYMANVYLIQGKTLRSEMSERNYFVRDKLDPDMALCRNAREMGMFMYISNR 497
Query: 505 QEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWF 564
E+G L+ + N++ N +++++ NP+DW +YI+ +Y K + +++ QPCPDVFWF
Sbjct: 498 HEFGRLLSTANYNTSHLNNDLWQIFENPVDWKEKYINRDYSK-IFTESIVEQPCPDVFWF 556
Query: 565 PIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRK 624
PI +E+ C E V+ ME YG+WS G ++D R+ GYE VPT DIHMKQ+GL VW F+R+
Sbjct: 557 PIFSERACDELVEEMEHYGKWSGGKHHDSRISGGYENVPTDDIHMKQIGLENVWLHFIRE 616
Query: 625 YVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEG 684
++ P+ + F GY+ + A ++FVV+Y P+ Q SLRPHHD+ST+TINIALN VG D++G
Sbjct: 617 FIAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPHHDASTFTINIALNNVGEDFQG 675
Query: 685 GGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
GGC+F+RYNC++ + R GW MHPGRLTH HEGL V GTRYI +SF+DP
Sbjct: 676 GGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNGTRYIAVSFIDP 725
>gi|218931167|ref|NP_001136388.1| procollagen-lysine,2-oxoglutarate 5-dioxygenase 2 isoform 1
precursor [Mus musculus]
Length = 758
Score = 637 bits (1644), Expect = e-180, Method: Compositional matrix adjust.
Identities = 317/731 (43%), Positives = 460/731 (62%), Gaps = 31/731 (4%)
Query: 29 IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGYKVNLL 87
I DK LVITVA+ E DG+ RF+ SA+ VK LG Q W GGD M+S+GGG KV LL
Sbjct: 34 IPADKLLVITVATKENDGFHRFMNSAKYFNYTVKVLGQGQEWRGGDGMNSIGGGQKVRLL 93
Query: 88 KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
K ++ +D++IL T+ +DV+ GG ++L++F + IVF A+ L WPD L DK
Sbjct: 94 KEAMEHYASQEDLVILFTECFDVVFAGGPEEVLKKFQKTNHKIVFAADGLLWPDKRLADK 153
Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
YP V G RYLNSGGFIGYA I L+ ++++ +DDQL+Y +++D R I LD
Sbjct: 154 YPVVHIGKRYLNSGGFIGYAPYISRLVQQWNLQDNDDDQLFYTKVYIDPLKREAFNITLD 213
Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
+FQ L G+ +++ L F+ + + NT Y T PV I+GNG +KI LN FGNY+ S
Sbjct: 214 HKCKIFQALNGATDEVVLKFENGK-SRVKNTFYETLPVAINGNGPTKILLNYFGNYVPNS 272
Query: 268 W-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMF 326
W + +GC C+ + +D D P V + VFI++PT FL FLN + L+YP + + +F
Sbjct: 273 WTQENGCALCD-VDTIDLSTVDVPPKVTLGVFIEQPTPFLPRFLNLLLTLDYPKEALQLF 331
Query: 327 VYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYV 385
++N + YH ++ K ++K + ++ EARN+ ++ + D+YF V
Sbjct: 332 IHNKEVYHEKDIKVFVDKAKHDISSIKIVGPEENLSQAEARNMGMDFCRQDEKCDYYFSV 391
Query: 386 DSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGD 445
D+D L NP LK+L+ +N +IAPL+ R K WSNFWGAL+ DG+YARS DY++I+ G+
Sbjct: 392 DADVVLTNPRTLKFLIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARSEDYVDIVQGN 451
Query: 446 QGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN---------- 494
+ GIWNVPY+ N YL++ +++ N + + + +D DMA C N R+
Sbjct: 452 R--VGIWNVPYMANVYLIQGKTLRSEMNERNYFVRDKLDPDMALCRNARDMTLQREKDSP 509
Query: 495 -----------KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPE 543
KG+ + I + E+G L+ + N++ N + +++ NP+DW +YI+ +
Sbjct: 510 TPETFQMLSPPKGVFMYISNRHEFGRLISTANYNTSHLNNDFWQIFENPVDWKEKYINRD 569
Query: 544 YQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVP 603
Y K + + + QPCPDVFWFPI +E+ C E V+ ME YG+WS G ++D R+ GYE VP
Sbjct: 570 YSK-IFTENIVEQPCPDVFWFPIFSERACDELVEEMEHYGKWSGGKHHDSRISGGYENVP 628
Query: 604 TRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPH 663
T DIHMKQ+GL VW F+R+++ P+ + F GY+ + A ++FVV+Y P+ Q SLRPH
Sbjct: 629 TDDIHMKQIGLENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPH 687
Query: 664 HDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQG 723
HD+ST+TINIALN VG D++GGGC+F+RYNC++ + R GW MHPGRLTH HEGL V G
Sbjct: 688 HDASTFTINIALNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNG 747
Query: 724 TRYIMISFVDP 734
TRYI +SF+DP
Sbjct: 748 TRYIAVSFIDP 758
>gi|355557553|gb|EHH14333.1| hypothetical protein EGK_00241 [Macaca mulatta]
Length = 882
Score = 637 bits (1643), Expect = e-180, Method: Compositional matrix adjust.
Identities = 310/743 (41%), Positives = 467/743 (62%), Gaps = 44/743 (5%)
Query: 31 EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNE 90
ED LV+TVA+ ET+G++RF +SA+ +++ LGL + W +S GGG KV LLK
Sbjct: 145 EDNLLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWNVEKGTSAGGGQKVRLLKKA 204
Query: 91 LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
L++ +D++IL TDSYDV+ G ++L++F + +VF AE L +PD L KYP
Sbjct: 205 LEKHADKEDLVILFTDSYDVLFASGPRELLKKFRQARSQVVFSAEELIYPDRRLEAKYPV 264
Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
V G R+L SGGFIGYA ++ +L++ ++ + DQL+Y +FLD R + I LD
Sbjct: 265 VSDGKRFLGSGGFIGYAPNLSKLVAEWEGQDSDSDQLFYTKIFLDPEKREQINITLDHRC 324
Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSK---------------- 254
+FQNL G+L+++ L F++ V N Y+T PV+IHGNG +K
Sbjct: 325 RIFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVLIHGNGPTKYPPGARNTYLGACYEL 383
Query: 255 -------------------IELNSFGNYLAKSWK-TSGCTRCNL-IKHLDSLKPDQFPSV 293
++LN GNY+ + W +GCT C+ ++ L + + P+V
Sbjct: 384 TISVLTSELSVVPSLPAVLLQLNYLGNYIPRFWTFETGCTVCDEGLRSLKGIGDEALPTV 443
Query: 294 LISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVK 353
L+ +FI++PT F+ F ++ L+YP K + +F++N++++H ++++ + +++VK
Sbjct: 444 LVGMFIEQPTPFVSLFFQRLLQLHYPRKHMRLFIHNHEQHHKAQVEEFLAEHGSEYQSVK 503
Query: 354 YIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLL 412
+ + + +ARN+ + + +YF VD+D L P+ L+ L+ +N+++IAPL+
Sbjct: 504 LVGPEVRMANADARNMGADLCRQDRSCTYYFSVDADVALTEPNSLRLLIQQNKNVIAPLM 563
Query: 413 VRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKAT- 471
R + WSNFWGAL+ADG+YARS DY++I+ G + G+WNVPYI+N YL+K S ++
Sbjct: 564 TRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR--IGVWNVPYISNIYLIKGSALRGEL 621
Query: 472 NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRN 531
++ + +D DMAFC N+R + + + + + GHL+ +++ + +++E+ N
Sbjct: 622 QSPDLFHHSKLDPDMAFCANVRQQDVFMFLTNRHTLGHLLSLDSYRTAHLHNDLWEVFSN 681
Query: 532 PLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNN 591
P DW +YIH Y K+L V PCPDV+WFPI TE C E V+ ME +GQWS G N
Sbjct: 682 PEDWKEKYIHQNYTKALAGKLVET-PCPDVYWFPIFTEAACDELVEEMEHFGQWSLGDNK 740
Query: 592 DKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVR 651
D R++ GYE VPT DIHM Q+G W +FL +Y+ P+ E+ + GY+ + ++FVVR
Sbjct: 741 DSRIQGGYENVPTIDIHMNQIGFEREWHKFLLEYIAPMTEKLYPGYYTR-AQFDLAFVVR 799
Query: 652 YRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRL 711
Y+PDEQPSL PHHD+ST+TINIALN+VGVDYEGGGCRF+RYNC++ A R GW LMHPGRL
Sbjct: 800 YKPDEQPSLMPHHDASTFTINIALNRVGVDYEGGGCRFLRYNCSIRAPRKGWTLMHPGRL 859
Query: 712 THYHEGLQVTQGTRYIMISFVDP 734
THYHEGL T+GTRYI +SFVDP
Sbjct: 860 THYHEGLPTTRGTRYIAVSFVDP 882
>gi|221120650|ref|XP_002157097.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 1-like
[Hydra magnipapillata]
Length = 717
Score = 637 bits (1643), Expect = e-180, Method: Compositional matrix adjust.
Identities = 314/707 (44%), Positives = 458/707 (64%), Gaps = 12/707 (1%)
Query: 31 EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDM-SSLGGGYKVNLLKN 89
E F ++TVA+ +TDG+KRF++SA V L V+ GL++ W GGD+ + GGG K+N+LK
Sbjct: 20 EISFKLVTVATEQTDGFKRFMRSANVFGLDVEVYGLNEKWEGGDLENGPGGGQKINILKE 79
Query: 90 ELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYP 149
L + ++++++ TDSYDV+I+ G ++IL+RF +A I+ AE CWPD SL KYP
Sbjct: 80 ALRKYKNNENLVLMFTDSYDVVINAGSDEILKRFLKTEAKILISAEDYCWPDKSLAVKYP 139
Query: 150 AVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTL 209
V GY+YL SGG IGYA + E++S + + + +DDQLYY ++L+ R K+ I LD
Sbjct: 140 KVNVGYKYLCSGGIIGYANKVYEVLSAKPVNHTDDDQLYYTQIYLEH--REKYNIKLDNK 197
Query: 210 ANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK 269
A LFQNL G+ +D++L FD D HL N ++ T P++IHGNG SK L+ GNYL W
Sbjct: 198 AELFQNLNGNQDDVELRFDGDN--HLWNKRFGTYPIVIHGNGPSKDYLSHLGNYLGDYWT 255
Query: 270 -TSGCTRCNLIKHL-DSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFV 327
GC C L ++ +P VLI +FI PT F+ +L I+NL YP +KI +F+
Sbjct: 256 YADGCKSCKENTFLLQDVEVTNWPKVLIGLFIPAPTPFVTSYLEHISNLEYPKEKIDIFI 315
Query: 328 YNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDS 387
++ +H P +D++ F+T + +V Y + + KE R+LA E+ H D+Y VDS
Sbjct: 316 HSVDPHHDPHVEDWLKRFETKYLSVTYKRPTAFLTEKETRHLAFEHCKHVKCDYYLSVDS 375
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
L N L+ L+ +N + I+P++ +P K +SNFWG + DGFY RS DY++I+ ++
Sbjct: 376 IVTLSNTKTLQMLIEQNRTFISPMISKPGKLFSNFWGKVGQDGFYERSPDYIDIVKYNR- 434
Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEY 507
+G+WNVP+++N YL+++ +K ++ + +D +M+FC+N R G+ + I + +
Sbjct: 435 -RGVWNVPFVSNVYLIQSDTLKKFK-SNPFSSDELDQEMSFCSNARKLGMFMYITNLDYF 492
Query: 508 GHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIV 567
GH+ + E++ + ++Y++ N +DW+ RY+HPE L P + N PCPDV+WFP+
Sbjct: 493 GHIKEDESYTTHHKHNDLYQIFDNRIDWEDRYLHPEMMSYLNPTSTPNMPCPDVYWFPLT 552
Query: 568 TEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVV 627
T+ F E V+ ME +G+WS G N D R+ GYE VPT DIHM QVGL W + L+ Y+
Sbjct: 553 TKNFTKELVEEMENFGKWSGGGNKDDRISGGYENVPTVDIHMNQVGLEKQWLKILKDYIA 612
Query: 628 PLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGC 687
P+ R F GY+ + RA M+FVV+Y + Q LRPHHDSSTYTIN+ALN V +YEGGG
Sbjct: 613 PMSSRYFTGYNSD-ARAIMNFVVKYTTNGQYYLRPHHDSSTYTINMALNNVN-EYEGGGA 670
Query: 688 RFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
RF RYNC+V T+ GW LMHPGRLTH HEGL + +GTRYIM+SFVDP
Sbjct: 671 RFTRYNCSVAKTKEGWALMHPGRLTHQHEGLPILKGTRYIMVSFVDP 717
>gi|148688976|gb|EDL20923.1| procollagen lysine, 2-oxoglutarate 5-dioxygenase 2, isoform CRA_b
[Mus musculus]
Length = 758
Score = 636 bits (1640), Expect = e-179, Method: Compositional matrix adjust.
Identities = 317/731 (43%), Positives = 459/731 (62%), Gaps = 31/731 (4%)
Query: 29 IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGYKVNLL 87
I DK LVITVA+ E DG+ RF+ SA+ VK LG Q W GGD M+S+GGG KV LL
Sbjct: 34 IPADKLLVITVATKENDGFHRFMNSAKYFNYTVKVLGQGQEWRGGDGMNSIGGGQKVRLL 93
Query: 88 KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
K ++ +D++IL T+ +DV+ GG ++L++F + IVF A+ L WPD L DK
Sbjct: 94 KEAMEHYASQEDLVILFTECFDVVFAGGPEEVLKKFQKTNHKIVFAADGLLWPDKRLADK 153
Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
YP V G RYLNSGGFIGYA I L+ ++++ +DDQL+Y +++D R I LD
Sbjct: 154 YPVVHIGKRYLNSGGFIGYAPYISRLVQQWNLQDNDDDQLFYTKVYIDPLKREAFNITLD 213
Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
+FQ L G+ +++ L F+ + + NT Y T PV I+GNG +KI LN FGNY+ S
Sbjct: 214 HKCKIFQALNGATDEVVLKFENGK-SRVKNTFYETLPVAINGNGPTKILLNYFGNYVPNS 272
Query: 268 W-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMF 326
W + +GC C+ + +D D P V + VFI++PT FL FLN + L+YP + + +F
Sbjct: 273 WTQENGCALCD-VDTIDLSTVDVPPKVTLGVFIEQPTPFLPRFLNLLLTLDYPKEALQLF 331
Query: 327 VYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYV 385
++N + YH ++ K ++K + ++ EARN+ ++ + D+YF V
Sbjct: 332 IHNKEVYHEKDIKVFVDKAKHDISSIKIVGPEENLSQAEARNMGMDFCRQDEKCDYYFSV 391
Query: 386 DSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGD 445
D+D L NP LK L+ +N +IAPL+ R K WSNFWGAL+ DG+YARS DY++I+ G+
Sbjct: 392 DADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARSEDYVDIVQGN 451
Query: 446 QGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN---------- 494
+ GIWNVPY+ N YL++ +++ N + + + +D DMA C N R+
Sbjct: 452 R--VGIWNVPYMANVYLIQGKTLRSEMNERNYFVRDKLDPDMALCRNARDMTLQREKDSP 509
Query: 495 -----------KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPE 543
KG+ + I + E+G L+ + N++ N + +++ NP+DW +YI+ +
Sbjct: 510 TPETFQMLSPPKGVFMYISNRHEFGRLISTANYNTSHLNNDFWQIFENPVDWKEKYINRD 569
Query: 544 YQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVP 603
Y K + + + QPCPDVFWFPI +E+ C E V+ ME YG+WS G ++D R+ GYE VP
Sbjct: 570 YSK-IFTENIVEQPCPDVFWFPIFSERACDELVEEMEHYGKWSGGKHHDSRISGGYENVP 628
Query: 604 TRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPH 663
T DIHMKQ+GL VW F+R+++ P+ + F GY+ + A ++FVV+Y P+ Q SLRPH
Sbjct: 629 TDDIHMKQIGLENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPH 687
Query: 664 HDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQG 723
HD+ST+TINIALN VG D++GGGC+F+RYNC++ + R GW MHPGRLTH HEGL V G
Sbjct: 688 HDASTFTINIALNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNG 747
Query: 724 TRYIMISFVDP 734
TRYI +SF+DP
Sbjct: 748 TRYIAVSFIDP 758
>gi|426218184|ref|XP_004003329.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2
isoform 2 [Ovis aries]
gi|426218186|ref|XP_004003330.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2
isoform 3 [Ovis aries]
Length = 760
Score = 636 bits (1640), Expect = e-179, Method: Compositional matrix adjust.
Identities = 315/741 (42%), Positives = 464/741 (62%), Gaps = 31/741 (4%)
Query: 19 ISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSS 77
+ K +I DK LVITVA+ E+DG+ RF+QSA+ VK LG + W GGD +++
Sbjct: 26 LGADSEKPSSIPTDKLLVITVATKESDGFHRFMQSAKYFNYTVKVLGQGEEWRGGDGINT 85
Query: 78 LGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERL 137
+GGG KV L+K ++ +D+++L T+ +DVI GG ++L++F + +VF A+ +
Sbjct: 86 IGGGQKVRLMKEVMEHYANQEDLVVLFTECFDVIFAGGPEEVLKKFQKSNHKVVFAADGI 145
Query: 138 CWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDET 197
WPD L DKYP V G RYLNSGGFIGYA I ++ ++++ +DDQL+Y +++D
Sbjct: 146 LWPDKRLADKYPIVHIGKRYLNSGGFIGYAPYINRIVQQWNLQDNDDDQLFYTKIYIDPL 205
Query: 198 LRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIEL 257
R I LD +FQ L G+++++ L F+ + N Y T PV+I+GNG +KI L
Sbjct: 206 KREAINITLDHKCKIFQTLNGAVDEVVLKFENGK-ARAKNVFYETLPVMINGNGPTKILL 264
Query: 258 NSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANL 316
N FGNY+ +W + +GCT C + +D + D +P+V I VFI++PT FL FLN + L
Sbjct: 265 NYFGNYIPNAWTQDNGCTLCE-VDTIDLSEVDVYPNVTIGVFIEQPTPFLPRFLNTLLTL 323
Query: 317 NYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH 376
+YP K + F++N + YH + K +K + ++ EARN+ ++
Sbjct: 324 DYPKKALKFFIHNKEVYHEKDIKVFFDKAKHEITTIKIVGPEENLSQAEARNMGMDFCRQ 383
Query: 377 -KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARS 435
+ ++YF VD+D L NP LK L+ +N +IAPL+ R K WSNFWGAL+ DG+YARS
Sbjct: 384 DENCEYYFSVDADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARS 443
Query: 436 FDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN 494
DY++I+ G++ GIWNVPY+ N YL+K +++ N + + + +D DMA C N R
Sbjct: 444 EDYVDIVQGNR--VGIWNVPYMANVYLIKGKTLRSEMNERNYFVRDKLDPDMALCRNARE 501
Query: 495 ---------------------KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPL 533
KG+ + I + E+G L+ + N++ N +++++ NP+
Sbjct: 502 MTLQREKDSPTPETFQMLSPPKGVFMYISNRHEFGRLLSTANYNISHFNNDLWQIFENPV 561
Query: 534 DWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDK 593
DW +YI+ +Y K + + + QPCPDVFWFPI +EK C E V+ ME YGQWS G ++D
Sbjct: 562 DWKEKYINRDYAK-IFTENIVEQPCPDVFWFPIFSEKACDELVEEMEHYGQWSGGKHHDS 620
Query: 594 RLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYR 653
R+ GYE VPT DIHMKQ+GL VW F+R+++ P+ + F GY+ + A ++FVV+Y
Sbjct: 621 RISGGYENVPTDDIHMKQIGLENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYS 679
Query: 654 PDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTH 713
P+ Q SLRPHHD+ST+TINIALN VG D++GGGC+F+RYNC++ + R GW MHPGRLTH
Sbjct: 680 PERQRSLRPHHDASTFTINIALNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTH 739
Query: 714 YHEGLQVTQGTRYIMISFVDP 734
HEGL V GTRYI +SF+DP
Sbjct: 740 LHEGLPVKNGTRYIAVSFIDP 760
>gi|6093729|sp|Q63321.1|PLOD1_RAT RecName: Full=Procollagen-lysine,2-oxoglutarate 5-dioxygenase 1;
AltName: Full=Lysyl hydroxylase 1; Short=LH1; Flags:
Precursor
gi|409059|gb|AAA41550.1| lysyl hydroxylase [Rattus norvegicus]
gi|1584463|prf||2123247A Lys hydroxylase
Length = 728
Score = 635 bits (1639), Expect = e-179, Method: Compositional matrix adjust.
Identities = 311/709 (43%), Positives = 462/709 (65%), Gaps = 10/709 (1%)
Query: 31 EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGY-KVNLLKN 89
ED LV+TVA+ ET+G++RF +SA+ ++++LGL + W S GG KV LLK
Sbjct: 25 EDNLLVLTVATKETEGFRRFKRSAQFFNYKIQSLGLGEDWSAAGGPSAAGGGQKVRLLKK 84
Query: 90 ELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYP 149
L + +D++IL DSYDV+ G ++L++F + +VF AE L +PD L KYP
Sbjct: 85 ALKKYADKEDLVILFVDSYDVVFASGPRELLKKFQQAKSRVVFSAEELIYPDRRLEAKYP 144
Query: 150 AVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTL 209
V G R+L SGGFIGYA + +L++ ++ + DQL+Y +FLD R + I LD
Sbjct: 145 TVPDGKRFLGSGGFIGYAPSLSKLVAEWEGQDNDSDQLFYTKIFLDPEKREQINISLDHR 204
Query: 210 ANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK 269
+FQNL G+L+++ L F++ V N Y+T PV+IHGNG +K+++N GNY+ + W
Sbjct: 205 CRIFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVVIHGNGPTKLQVNYLGNYIPRFWT 263
Query: 270 -TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFV 327
+GCT C+ ++ L + + P+VL+ VFI++PT FL F ++ +L YP K++ +F+
Sbjct: 264 FETGCTVCDEGLRSLKGIGDEALPTVLVGVFIEQPTPFLSLFFRRLLHLRYPQKQMRLFI 323
Query: 328 YNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVD 386
+N +++H + ++ +++VK + + + +ARN+ + + +YF VD
Sbjct: 324 HNQEQHHKLQVEQFLAEHGGEYQSVKLVGPEVRMANADARNMGADLCRQDQTCTYYFSVD 383
Query: 387 SDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQ 446
+D L P+ L+ L+ +N+++IAPL+ R + WSNFWGAL+ADG+YARS DY++I+ G +
Sbjct: 384 ADVALTEPNSLRLLIEQNKNVIAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR 443
Query: 447 GGKGIWNVPYITNCYLMKTSVIKATNIKT-IYTLNSMDYDMAFCTNLRNKGIHLKIDSTQ 505
G+WNVPYI+N YL+K S ++A ++ + +D DM+FC N+R + + + + +
Sbjct: 444 --VGVWNVPYISNIYLIKGSALRAELRHVDLFHYSKLDPDMSFCANVRQQEVFMFLTNRH 501
Query: 506 EYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFP 565
+GHL+ +N+ + +++E+ NP DW +YIH Y K+L V PCPDV+WFP
Sbjct: 502 TFGHLLSLDNYQTTHLHNDLWEVFSNPQDWKEKYIHENYTKALAGKLVET-PCPDVYWFP 560
Query: 566 IVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKY 625
I TE C E V+ ME YGQWS G N D R++ GYE VPT DIHM Q+ W +FL +Y
Sbjct: 561 IFTEVACDELVEEMEHYGQWSLGDNKDNRIQGGYENVPTIDIHMNQITFEREWHKFLVEY 620
Query: 626 VVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGG 685
+ PL E+ + GY+ + + ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VG DYEGG
Sbjct: 621 IAPLTEKLYPGYYTK-AQFDLAFVVRYKPDEQPSLMPHHDASTFTINIALNRVGEDYEGG 679
Query: 686 GCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
GCRF+RYNC+V A R GW LMHPGRLTHYHEGL T+GTRYI +SFVDP
Sbjct: 680 GCRFLRYNCSVRAPRKGWALMHPGRLTHYHEGLPTTKGTRYIAVSFVDP 728
>gi|400153797|ref|NP_446279.2| procollagen-lysine,2-oxoglutarate 5-dioxygenase 1 precursor [Rattus
norvegicus]
gi|149024588|gb|EDL81085.1| procollagen-lysine, 2-oxoglutarate 5-dioxygenase 1 [Rattus
norvegicus]
Length = 728
Score = 635 bits (1638), Expect = e-179, Method: Compositional matrix adjust.
Identities = 311/709 (43%), Positives = 462/709 (65%), Gaps = 10/709 (1%)
Query: 31 EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGY-KVNLLKN 89
ED LV+TVA+ ET+G++RF +SA+ ++++LGL + W S GG KV LLK
Sbjct: 25 EDNLLVLTVATKETEGFRRFKRSAQFFNYKIQSLGLGEDWSAAGGPSAAGGGQKVRLLKK 84
Query: 90 ELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYP 149
L + +D++IL DSYDV+ G ++L++F + +VF AE L +PD L KYP
Sbjct: 85 ALKKYADKEDLVILFVDSYDVVFASGPRELLKKFQQAKSRVVFSAEELIYPDRRLEAKYP 144
Query: 150 AVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTL 209
V G R+L SGGFIGYA + +L++ ++ + DQL+Y +FLD R + I LD
Sbjct: 145 TVPDGKRFLGSGGFIGYAPSLSKLVAEWEGQDNDSDQLFYTKIFLDPEKREQINISLDHR 204
Query: 210 ANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK 269
+FQNL G+L+++ L F++ V N Y+T PV+IHGNG +K+++N GNY+ + W
Sbjct: 205 CRIFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVVIHGNGPTKLQVNYLGNYIPRFWT 263
Query: 270 -TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFV 327
+GCT C+ ++ L + + P+VL+ VFI++PT FL F ++ +L YP K++ +F+
Sbjct: 264 FETGCTVCDEGLRSLKGIGDEALPTVLVGVFIEQPTPFLSLFFRRLLHLRYPQKQMRLFI 323
Query: 328 YNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVD 386
+N +++H + ++ +++VK + + + +ARN+ + + +YF VD
Sbjct: 324 HNQEQHHKLQVEQFLAEHGGEYQSVKLVGPEVRMANADARNMGADLCRQDQTCTYYFSVD 383
Query: 387 SDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQ 446
+D L P+ L+ L+ +N+++IAPL+ R + WSNFWGAL+ADG+YARS DY++I+ G +
Sbjct: 384 ADVALTEPNSLRLLIEQNKNVIAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR 443
Query: 447 GGKGIWNVPYITNCYLMKTSVIKATNIKT-IYTLNSMDYDMAFCTNLRNKGIHLKIDSTQ 505
G+WNVPYI+N YL+K S ++A ++ + +D DM+FC N+R + + + + +
Sbjct: 444 --VGVWNVPYISNIYLIKGSALRAELRHVDLFHYSKLDPDMSFCANVRQQEVFMFLTNRH 501
Query: 506 EYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFP 565
+GHL+ +N+ + +++E+ NP DW +YIH Y K+L V PCPDV+WFP
Sbjct: 502 TFGHLLSLDNYQTTHLHNDLWEVFSNPEDWKEKYIHENYTKALAGKLVET-PCPDVYWFP 560
Query: 566 IVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKY 625
I TE C E V+ ME YGQWS G N D R++ GYE VPT DIHM Q+ W +FL +Y
Sbjct: 561 IFTEVACDELVEEMEHYGQWSLGDNKDNRIQGGYENVPTIDIHMNQITFEREWHKFLVEY 620
Query: 626 VVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGG 685
+ PL E+ + GY+ + + ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VG DYEGG
Sbjct: 621 IAPLTEKLYPGYYTK-AQFDLAFVVRYKPDEQPSLMPHHDASTFTINIALNRVGEDYEGG 679
Query: 686 GCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
GCRF+RYNC+V A R GW LMHPGRLTHYHEGL T+GTRYI +SFVDP
Sbjct: 680 GCRFLRYNCSVRAPRKGWALMHPGRLTHYHEGLPTTKGTRYIAVSFVDP 728
>gi|355559970|gb|EHH16698.1| hypothetical protein EGK_12027, partial [Macaca mulatta]
Length = 744
Score = 635 bits (1638), Expect = e-179, Method: Compositional matrix adjust.
Identities = 316/742 (42%), Positives = 462/742 (62%), Gaps = 31/742 (4%)
Query: 18 FISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MS 76
++ K +I DK LVITVA+ E+DG+ RF+QSA+ VK LG + W GGD ++
Sbjct: 9 YLGADSEKPSSIPTDKLLVITVATKESDGFHRFMQSAKYFNYTVKVLGQGEEWRGGDGIN 68
Query: 77 SLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAER 136
S+GGG KV L+K ++ DD++++ T+ +DVI GG ++L++F + +VF A+
Sbjct: 69 SIGGGQKVRLMKEVMEHYADQDDLVVMFTECFDVIFAGGPEEVLKKFQKANHKVVFAADG 128
Query: 137 LCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDE 196
+ WPD L DKYP V G RYLNSGGFIGYA I ++ ++++ +DDQL+Y +++D
Sbjct: 129 ILWPDKRLADKYPVVHIGKRYLNSGGFIGYAPYINRIVQQWNLQDNDDDQLFYTKIYIDP 188
Query: 197 TLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIE 256
R I LD +FQ L G+++++ L F+ + NT Y T PV I+GNG +KI
Sbjct: 189 LKREAINITLDHKCKIFQTLNGAVDEVVLKFENGK-ARAKNTFYETLPVAINGNGPTKIL 247
Query: 257 LNSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIAN 315
LN FGNY+ SW + +GCT C +D D P+V I VFI++PT FL FL+ +
Sbjct: 248 LNYFGNYVPNSWTQDNGCTLCEF-DTIDLSAVDVHPNVTIGVFIEQPTPFLPRFLDILLT 306
Query: 316 LNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSL 375
L+YP + + +F++N + YH + K K +K + ++ EARN+ ++
Sbjct: 307 LDYPKEALKLFIHNKEVYHEKDIKAFFEKAKHEIKTIKIVGPEENLSQAEARNMGMDFCR 366
Query: 376 H-KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYAR 434
+ D+YF VD+D L NP LK L+ +N +IAPL+ R K WSNFWGAL+ DG+YAR
Sbjct: 367 QDEKCDYYFSVDADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYAR 426
Query: 435 SFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIK-ATNIKTIYTLNSMDYDMAFCTNLR 493
S DY++I+ G++ G+WNVPY+ N YL+K ++ N + + + +D DMA C N R
Sbjct: 427 SEDYVDIVQGNR--VGVWNVPYMANVYLIKGKTLRLEMNERNYFVRDKLDPDMALCRNAR 484
Query: 494 N---------------------KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNP 532
KG+ + I + E+G L+ + N++ N +++++ NP
Sbjct: 485 EMTLQREKDSPTPETFQMLSPPKGVFMYISNRHEFGRLLSTANYNTSHYNNDLWQIFENP 544
Query: 533 LDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNND 592
+DW +YI+ +Y K + + + QPCPDVFWFPI +EK C E V+ ME YG+WS G ++D
Sbjct: 545 VDWKEKYINRDYSK-IFTENIVEQPCPDVFWFPIFSEKACDELVEEMEHYGKWSGGKHHD 603
Query: 593 KRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRY 652
R+ GYE VPT DIHMKQV L VW F+R+++ P+ + F GY+ + A ++FVV+Y
Sbjct: 604 SRISGGYENVPTDDIHMKQVDLENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKY 662
Query: 653 RPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLT 712
P+ Q SLRPHHD+ST+TINIALN VG D++GGGC+F+RYNC++ + R GW MHPGRLT
Sbjct: 663 SPERQRSLRPHHDASTFTINIALNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLT 722
Query: 713 HYHEGLQVTQGTRYIMISFVDP 734
H HEGL V GTRYI +SF+DP
Sbjct: 723 HLHEGLPVKNGTRYIAVSFIDP 744
>gi|153792754|ref|NP_001093153.1| lysyl hydroxylase 2 precursor [Takifugu rubripes]
gi|146325992|dbj|BAF61138.1| lysyl hydroxylase 2 [Takifugu rubripes]
Length = 756
Score = 635 bits (1638), Expect = e-179, Method: Compositional matrix adjust.
Identities = 318/756 (42%), Positives = 456/756 (60%), Gaps = 36/756 (4%)
Query: 9 CLILSCVV-----FFISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKT 63
C + SC + + I ++K LV+TVA+ ETDG++RF+QSA VK
Sbjct: 7 CFVFSCCLKIAFSLLSTETAQAPAPIPKEKLLVLTVATEETDGFQRFLQSARYFNYSVKV 66
Query: 64 LGLHQPWLGGDMS-SLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILER 122
LG+ + W GGD+ S+GGG KV LLK ++ + DD+++L DSYD+I GG +IL +
Sbjct: 67 LGMGEAWKGGDVGHSIGGGQKVRLLKEAMEALADQDDLVVLFVDSYDLIFAGGPEEILRK 126
Query: 123 FNTFDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNE 182
F + ++F AE L WPD L DKYP V SG RYLNSGGFIGYA I ++S RS+ +
Sbjct: 127 FQQANHKVLFAAEGLIWPDKRLADKYPLVHSGKRYLNSGGFIGYASQINRIVSQRSLHDN 186
Query: 183 EDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNT 242
+DDQL+YA ++LD R + LD +F L G+ +++ L F V + NT +++
Sbjct: 187 DDDQLFYAKIYLDPLQRQTLNMTLDHKCQIFLTLNGAADEVLLKFGTGR-VRVRNTAHDS 245
Query: 243 NPVIIHGNGKSKIELNSFGNYLAKSWK-TSGCTRCNLIKHLDSLKPDQFPSVLISVFIDK 301
PV++HGN +KI LN GNY+ W GC+ C+ LD + ++FPSVL+ VFI+K
Sbjct: 246 LPVVVHGNRNTKIFLNYLGNYVPNMWNYEHGCSLCDK-DILDLSRLNEFPSVLVGVFIEK 304
Query: 302 PTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTV 361
PT FL EF ++ +L+YP ++ +FV+NN+ +H + + F + K + +
Sbjct: 305 PTPFLPEFFQRLLSLDYPKDRLKLFVHNNEVFHEKHIQKFWEEHRNTFSDFKIVGPEENL 364
Query: 362 NSKEARNLAVENSLH-KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWS 420
+ EARN+ ++ DFYF VDSD L N LK LV +N +I PL+ R K WS
Sbjct: 365 SQGEARNMGMDLCRKDAACDFYFSVDSDVMLTNSQTLKLLVEQNRKIIGPLVTRHGKLWS 424
Query: 421 NFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIK-ATNIKTIYTL 479
WGAL+ DG+YARS DY++I+ + G+WN+PY+ + YL+K S ++ N + + L
Sbjct: 425 YLWGALSPDGYYARSEDYIDIVQRKR--VGVWNIPYMAHVYLVKGSALRNELNERNHFVL 482
Query: 480 NSMDYDMAFCTNLRN---------------------KGIHLKIDSTQEYGHLVDSENFDP 518
+D DMAFC N R KG+ + I ++ E+G L+ + N++
Sbjct: 483 EKLDPDMAFCRNAREMTSQREKDSPSPESFHMLRPPKGVFMYITNSHEFGRLISTANYNI 542
Query: 519 QKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQI 578
N +++++ NP+DW +YIH Y + + + +PCPDVFWFP+ T+K C E V+
Sbjct: 543 SHYNNDLWQIFENPVDWKEKYIHENYTR-IFTENYMEEPCPDVFWFPVFTQKACDEIVEE 601
Query: 579 MEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYH 638
ME YG WS G + DKR+ GYE VPT DIHMKQ+G W F+R+++ P+ + F GY+
Sbjct: 602 MEHYGSWSGGKHEDKRITGGYETVPTDDIHMKQIGFDKEWLHFIREFISPVTLKVFSGYY 661
Query: 639 HEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTA 698
+ A M+FVV+Y P+ Q LRPHHDSST+TINIALN D++GGGCRF YNC++ +
Sbjct: 662 TKGY-AIMNFVVKYTPERQAYLRPHHDSSTFTINIALNNKDTDFQGGGCRFHGYNCSIES 720
Query: 699 TRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
R GW MHP RLTH HEGL T GTRYI +SF+DP
Sbjct: 721 PRKGWSFMHPERLTHLHEGLPTTNGTRYIAVSFIDP 756
>gi|402861308|ref|XP_003895040.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2
isoform 1 [Papio anubis]
Length = 758
Score = 635 bits (1637), Expect = e-179, Method: Compositional matrix adjust.
Identities = 316/742 (42%), Positives = 462/742 (62%), Gaps = 31/742 (4%)
Query: 18 FISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MS 76
++ K +I DK LVITVA+ E+DG+ RF+QSA+ VK LG + W GGD ++
Sbjct: 23 YLGADSEKPSSIPTDKLLVITVATKESDGFHRFMQSAKYFNYTVKVLGQGEEWRGGDGIN 82
Query: 77 SLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAER 136
S+GGG KV L+K ++ DD++++ T+ +DVI GG ++L++F + +VF A+
Sbjct: 83 SIGGGQKVRLMKEVMEHYADQDDLVVMFTECFDVIFAGGPEEVLKKFQKANHKVVFAADG 142
Query: 137 LCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDE 196
+ WPD L DKYP V G RYLNSGGFIGYA I ++ ++++ +DDQL+Y +++D
Sbjct: 143 ILWPDKRLADKYPVVHIGKRYLNSGGFIGYAPYINRIVQQWNLQDNDDDQLFYTKIYIDP 202
Query: 197 TLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIE 256
R I LD +FQ L G+++++ L F+ + NT Y T PV I+GNG +KI
Sbjct: 203 LKREAINITLDHKCKIFQTLNGAVDEVVLKFENGK-ARAKNTFYETLPVAINGNGPTKIL 261
Query: 257 LNSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIAN 315
LN FGNY+ SW + +GCT C +D D P+V I VFI++PT FL FL+ +
Sbjct: 262 LNYFGNYVPNSWTQDNGCTLCEF-DTIDLSAVDVHPNVTIGVFIEQPTPFLPRFLDILLT 320
Query: 316 LNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSL 375
L+YP + + +F++N + YH + K K +K + ++ EARN+ ++
Sbjct: 321 LDYPKEALKLFIHNKEVYHEKDIKVFFDKAKHEIKTIKIVGPEENLSQAEARNMGMDFCR 380
Query: 376 H-KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYAR 434
+ D+YF VD+D L NP LK L+ +N +IAPL+ R K WSNFWGAL+ DG+YAR
Sbjct: 381 QDEKCDYYFSVDADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYAR 440
Query: 435 SFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIK-ATNIKTIYTLNSMDYDMAFCTNLR 493
S DY++I+ G++ G+WNVPY+ N YL+K ++ N + + + +D DMA C N R
Sbjct: 441 SEDYVDIVQGNR--VGVWNVPYMANVYLIKGKTLRLEMNERNYFVRDKLDPDMALCRNAR 498
Query: 494 N---------------------KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNP 532
KG+ + I + E+G L+ + N++ N +++++ NP
Sbjct: 499 EMTLQREKDSPTPETFQMLSPPKGVFMYISNRHEFGRLLSTANYNTSHYNNDLWQIFENP 558
Query: 533 LDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNND 592
+DW +YI+ +Y K + + + QPCPDVFWFPI +EK C E V+ ME YG+WS G ++D
Sbjct: 559 VDWKEKYINRDYSK-IFTENIVEQPCPDVFWFPIFSEKACDELVEEMEHYGKWSGGKHHD 617
Query: 593 KRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRY 652
R+ GYE VPT DIHMKQV L VW F+R+++ P+ + F GY+ + A ++FVV+Y
Sbjct: 618 SRISGGYENVPTDDIHMKQVDLENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKY 676
Query: 653 RPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLT 712
P+ Q SLRPHHD+ST+TINIALN VG D++GGGC+F+RYNC++ + R GW MHPGRLT
Sbjct: 677 SPERQRSLRPHHDASTFTINIALNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLT 736
Query: 713 HYHEGLQVTQGTRYIMISFVDP 734
H HEGL V GTRYI +SF+DP
Sbjct: 737 HLHEGLPVKNGTRYIAVSFIDP 758
>gi|380813820|gb|AFE78784.1| procollagen-lysine,2-oxoglutarate 5-dioxygenase 2 isoform 1
precursor [Macaca mulatta]
Length = 758
Score = 634 bits (1636), Expect = e-179, Method: Compositional matrix adjust.
Identities = 316/742 (42%), Positives = 462/742 (62%), Gaps = 31/742 (4%)
Query: 18 FISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MS 76
++ K +I DK LVITVA+ E+DG+ RF+QSA+ VK LG + W GGD ++
Sbjct: 23 YLGADSEKPSSIPTDKLLVITVATKESDGFHRFMQSAKYFNYTVKVLGQGEEWRGGDGIN 82
Query: 77 SLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAER 136
S+GGG KV L+K ++ DD++++ T+ +DVI GG ++L++F + +VF A+
Sbjct: 83 SIGGGQKVRLMKEVMEHYADQDDLVVMFTECFDVIFAGGPEEVLKKFQKANHKVVFAADG 142
Query: 137 LCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDE 196
+ WPD L DKYP V G RYLNSGGFIGYA I ++ ++++ +DDQL+Y +++D
Sbjct: 143 ILWPDKRLADKYPVVHIGKRYLNSGGFIGYAPYINRIVQQWNLQDNDDDQLFYTKIYIDP 202
Query: 197 TLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIE 256
R I LD +FQ L G+++++ L F+ + NT Y T PV I+GNG +KI
Sbjct: 203 LKREAINITLDHKCKVFQTLNGAVDEVVLKFENGK-ARAKNTFYETLPVAINGNGPTKIL 261
Query: 257 LNSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIAN 315
LN FGNY+ SW + +GCT C +D D P+V I VFI++PT FL FL+ +
Sbjct: 262 LNYFGNYVPNSWTQDNGCTLCEF-DTIDLSAVDVHPNVTIGVFIEQPTPFLPRFLDILLT 320
Query: 316 LNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSL 375
L+YP + + +F++N + YH + K K +K + ++ EARN+ ++
Sbjct: 321 LDYPKEALKLFIHNKEVYHEKDIKVFFDKAKHEIKTIKIVGPEENLSQAEARNMGMDFCR 380
Query: 376 H-KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYAR 434
+ D+YF VD+D L NP LK L+ +N +IAPL+ R K WSNFWGAL+ DG+YAR
Sbjct: 381 QDEKCDYYFSVDADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYAR 440
Query: 435 SFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIK-ATNIKTIYTLNSMDYDMAFCTNLR 493
S DY++I+ G++ G+WNVPY+ N YL+K ++ N + + + +D DMA C N R
Sbjct: 441 SEDYVDIVQGNR--VGVWNVPYMANVYLIKGKTLRLEMNERNYFVRDKLDPDMALCRNAR 498
Query: 494 N---------------------KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNP 532
KG+ + I + E+G L+ + N++ N +++++ NP
Sbjct: 499 EMTLQREKDSPTPETFQMLSPPKGVFMYISNRHEFGRLLSTANYNTSHYNNDLWQIFENP 558
Query: 533 LDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNND 592
+DW +YI+ +Y K + + + QPCPDVFWFPI +EK C E V+ ME YG+WS G ++D
Sbjct: 559 VDWKEKYINRDYSK-IFTENIVEQPCPDVFWFPIFSEKACDELVEEMEHYGKWSGGKHHD 617
Query: 593 KRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRY 652
R+ GYE VPT DIHMKQV L VW F+R+++ P+ + F GY+ + A ++FVV+Y
Sbjct: 618 SRISGGYENVPTDDIHMKQVDLENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKY 676
Query: 653 RPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLT 712
P+ Q SLRPHHD+ST+TINIALN VG D++GGGC+F+RYNC++ + R GW MHPGRLT
Sbjct: 677 SPERQRSLRPHHDASTFTINIALNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLT 736
Query: 713 HYHEGLQVTQGTRYIMISFVDP 734
H HEGL V GTRYI +SF+DP
Sbjct: 737 HLHEGLPVKNGTRYIAVSFIDP 758
>gi|354477608|ref|XP_003501011.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 3
[Cricetulus griseus]
Length = 658
Score = 634 bits (1635), Expect = e-179, Method: Compositional matrix adjust.
Identities = 315/708 (44%), Positives = 446/708 (62%), Gaps = 59/708 (8%)
Query: 32 DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLLKNE 90
DK LVITVA+ ET+GY+RF+QSAE V+TLGL W GGD++ ++GGG KV LK E
Sbjct: 5 DKLLVITVATAETEGYRRFLQSAEFFNYTVRTLGLGHEWRGGDVARTVGGGQKVRWLKKE 64
Query: 91 LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
+++ +DMII+ DSYDVI+ ++L++F ++++F AE CWP+ L ++YP
Sbjct: 65 MEKYANREDMIIMFVDSYDVILASSPAELLKKFVQSGSHLLFSAEGFCWPEWGLAEQYPE 124
Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
VG G R+LNSGGFIG+A I +++ K+++DDQL+Y L+LD LR K K+ LD +
Sbjct: 125 VGMGKRFLNSGGFIGFAPTIHQIVRQWKYKDDDDDQLFYTRLYLDPGLREKLKLNLDHKS 184
Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSW-K 269
+FQNL G+L+++ L FD + V + N Y+T PV++HGNG +K++LN GNY+ W
Sbjct: 185 RIFQNLNGALDEVVLKFDQNR-VRIRNVAYDTLPVVVHGNGPTKLQLNYLGNYVPNGWTP 243
Query: 270 TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
GC CN + L +P P VL++VF
Sbjct: 244 QGGCGFCNQNQRTLPGGQPP--PRVLLAVF------------------------------ 271
Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDS 387
F K + ++ EAR++A+++ +FYF +D+
Sbjct: 272 ------------------AHFSAAKLVGPEEALSPGEARDMAMDSCRQDPKCEFYFSLDA 313
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D+ L NP+ L+ L+ +N +I P+L R K WSNFWGAL+ D +YARS DY+ ++ +
Sbjct: 314 DAVLTNPETLRILIEQNRKVICPMLSRHGKLWSNFWGALSPDEYYARSEDYVELVQRKR- 372
Query: 448 GKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQE 506
G+WNVPYI+ Y+++ ++ K +++ + D DMAFC +LR+KGI L + + E
Sbjct: 373 -VGVWNVPYISQAYVIRGETLRTELPQKEVFSGSDTDPDMAFCKSLRDKGIFLHLSNQHE 431
Query: 507 YGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPI 566
+G L+ + +D +P+++++ NP+DW +YIH Y ++L + QPCPDV+WFP+
Sbjct: 432 FGRLLATSRYDTDHLHPDLWQIFDNPVDWKEQYIHENYSRALDGQGLVEQPCPDVYWFPL 491
Query: 567 VTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYV 626
+TE+ C E V+ ME YGQWS G + D RL GYE VPT DIHMKQVG W + LR YV
Sbjct: 492 LTEQMCDELVEEMEHYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYEDQWLQLLRTYV 551
Query: 627 VPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGG 686
P+ E F GYH + RA M+FVVRYRPDEQPSLRPHHDSST+T+N+ALN GVDYEGGG
Sbjct: 552 GPMTEYLFPGYHTK-TRAVMNFVVRYRPDEQPSLRPHHDSSTFTLNVALNHKGVDYEGGG 610
Query: 687 CRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
CRF+RY+C +++ R GW L+HPGRLTHYHEGL T+GTRYIM+SFVDP
Sbjct: 611 CRFLRYDCRISSPRKGWALLHPGRLTHYHEGLPTTRGTRYIMVSFVDP 658
>gi|340378758|ref|XP_003387894.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 3-like
[Amphimedon queenslandica]
Length = 718
Score = 634 bits (1635), Expect = e-179, Method: Compositional matrix adjust.
Identities = 312/708 (44%), Positives = 453/708 (63%), Gaps = 14/708 (1%)
Query: 36 VITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSL-GGGYKVNLLKNELDEM 94
VITVA+ ETDG+KRF++SA + V+ +G+ + W GGD+ GGG+K+NLLK L++
Sbjct: 16 VITVATEETDGFKRFMKSAAYYGISVEIVGMGEEWKGGDIQRYPGGGFKLNLLKPVLEKW 75
Query: 95 DITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAVGSG 154
D++++ DSYDVI ILE+F F N+VF AE+ CWPD SL +YP VG G
Sbjct: 76 RERKDLVVMFVDSYDVIFAANSEKILEKFKDFRTNLVFSAEQFCWPDQSLASRYPKVGLG 135
Query: 155 YRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLANLFQ 214
R+L SGG+IGYA + +I++ I + +DDQL+Y ++LD R K+ + LD +++FQ
Sbjct: 136 KRFLCSGGYIGYASQMYSIITDSEISDTDDDQLFYTKIYLDPHKRDKYGMRLDHRSHIFQ 195
Query: 215 NLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWKT-SGC 273
NL G+ ++I L+ +E + TNT YNT I+HGNG SK LN GNY+ + GC
Sbjct: 196 NLNGAEDEIDLHVTSNE-SYATNTLYNTRAAILHGNGGSKNFLNFLGNYIPNQYNVDEGC 254
Query: 274 TRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEE-FLNKIANLNYPAKKISMFVYNNQE 332
C+ H ++P+V + +F+ PT FL E L ++ LNYP I ++VYN
Sbjct: 255 LHCSEGLHELPEDSSKWPTVFVGLFVMSPTPFLREAILKSLSELNYPKNLIHLWVYNKNS 314
Query: 333 YHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSHLD 392
YH L + K+ + +V+Y + EAR A++ SL K D+Y +DS D
Sbjct: 315 YHEDLLSKWSDEVKSEYASVQYTGSYRDITEIEARTTAMKESLSKKSDYYLMLDSTGVFD 374
Query: 393 NPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIW 452
+PD L+ L+ N+ +IAP+L RP K W+NFWG++ DGFYARS DY I+ + KGIW
Sbjct: 375 DPDALRKLITLNKHVIAPILGRPDKYWTNFWGSIAKDGFYARSRDYFVIVESKR--KGIW 432
Query: 453 NVPYITNCYLMKTSVI--KATNIKTIYTLNSMDY--DMAFCTNLRNKGIHLKIDSTQEYG 508
NVP+I+ L + + ++T+ K + + S ++ DMA C +RN G + + + Q+YG
Sbjct: 433 NVPFISTAILFEGEWLLKRSTDAKGLPSFASEEFEPDMALCQWMRNNGHFMYVSNLQKYG 492
Query: 509 HLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSL--LPDTVNNQPCPDVFWFPI 566
HL+ + N++ + ++Y + N +W+ +Y+H Y +L PD ++ QPC DV+WFP+
Sbjct: 493 HLISTSNYEIHHLHNDIYNIFENRQEWEKKYLHENYSVALNAGPDDIS-QPCTDVYWFPL 551
Query: 567 VTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYV 626
++ + E ++ +E +G+WS+G N+D RL+ GYE VPTRDIHM QVG W + L KYV
Sbjct: 552 LSPAYTKEIIEELEKFGKWSNGENDDPRLDGGYENVPTRDIHMNQVGFEKQWLDILAKYV 611
Query: 627 VPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGG 686
VP+Q + F GY+ RA ++FVV+Y P QP LRPHHDSST+TIN+AL + G+D++GGG
Sbjct: 612 VPIQIKVFPGYYSR-ARADLNFVVKYHPQGQPDLRPHHDSSTFTINVALTRPGIDHQGGG 670
Query: 687 CRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
CRF+R NC+V T++GW LMHPGRLTHYHEGL T GTRYIM+SFVDP
Sbjct: 671 CRFVRQNCSVVDTKLGWALMHPGRLTHYHEGLPTTSGTRYIMVSFVDP 718
>gi|327284629|ref|XP_003227039.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2-like
[Anolis carolinensis]
Length = 759
Score = 634 bits (1635), Expect = e-179, Method: Compositional matrix adjust.
Identities = 313/731 (42%), Positives = 465/731 (63%), Gaps = 31/731 (4%)
Query: 29 IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGG-DMSSLGGGYKVNLL 87
I DK LV+T+A+ ETDG+ RF+QSA+ VK LG + W GG ++S+GGG KV LL
Sbjct: 35 IPTDKLLVLTIATKETDGFHRFMQSAKHFNYTVKILGEGEKWKGGKSLNSIGGGQKVRLL 94
Query: 88 KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
K+ LD +D++++ D YDVI G ++L++F + +VF A+ L WPD L DK
Sbjct: 95 KSALDIYADQEDLVVMYVDCYDVIFAAGPEELLKKFQQANHKVVFAADGLIWPDKRLSDK 154
Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
YP V SG R+LN+GGFIGY+ + ++ +++ +DDQL+Y +++D R + I LD
Sbjct: 155 YPVVRSGKRFLNAGGFIGYSPSVNSIVQQWDLQDNDDDQLFYTKIYIDPLKRERINITLD 214
Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
N+FQ L G+++++ L F+ + N+ Y+T PV +HGNG +K+ LN FGNY+
Sbjct: 215 HKCNIFQTLNGAVDEVLLKFE-EGRARARNSVYDTLPVTLHGNGPTKLNLNYFGNYIPNG 273
Query: 268 W-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMF 326
W + +GC CN LD + P+V+I VFI++PT FL FL+++ L+Y +K+S F
Sbjct: 274 WTRETGCIACNK-DLLDLATLTETPTVIIGVFIEQPTPFLARFLDRLLTLDYAKEKLSFF 332
Query: 327 VYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVE-NSLHKGVDFYFYV 385
++NN+ YH + K M K +K + ++ +ARN+ +E +K D+YF +
Sbjct: 333 IHNNEVYHEKHIKKFWEKAKNMIKTIKIVGPEENLSQADARNMGMEICRQNKECDYYFSI 392
Query: 386 DSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGD 445
D+D L NPD LK L+ +N +IAPL++R K WSNFWGAL+ADG+YARS DY++I+ G+
Sbjct: 393 DADVVLTNPDTLKILIEQNRKIIAPLVMRHGKLWSNFWGALSADGYYARSEDYIDIVQGN 452
Query: 446 QGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN---------- 494
+ G+WNVP++ N YL+K +++ + + +D DMA C N R
Sbjct: 453 R--VGLWNVPFVANIYLIKGQTLRSEMKERNYFARERLDSDMALCRNAREMTLQREKDSP 510
Query: 495 -----------KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPE 543
KG+ + I + E+G L+ + N++ N +++++ NP+DW YI+P
Sbjct: 511 SAETFHMLRPPKGVFMYITNRHEFGRLLSTANYNISHYNNDLWQIFENPVDWKEVYINPN 570
Query: 544 YQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVP 603
Y K + + + QPCPDVFWFPI +E C E V+ ME +GQWS G ++D R+ GYE VP
Sbjct: 571 YSK-IFTEKIVEQPCPDVFWFPIFSEAACDELVEEMEHFGQWSGGRHHDSRISGGYENVP 629
Query: 604 TRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPH 663
T DIHMKQ+GL VW F+R+++ P+ + F GY+ + A ++FVV+Y PD Q SLRPH
Sbjct: 630 TDDIHMKQIGLENVWLHFIREFIAPVTLKVFAGYYTKG-HALLNFVVKYSPDRQRSLRPH 688
Query: 664 HDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQG 723
HD+ST+TINIALN+V D++GGGC+F+RYNC++ + R GW MHPGRLTH HEGL + +G
Sbjct: 689 HDASTFTINIALNKVEEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPILKG 748
Query: 724 TRYIMISFVDP 734
TRYI +SF+DP
Sbjct: 749 TRYIAVSFIDP 759
>gi|155372023|ref|NP_001094619.1| procollagen-lysine,2-oxoglutarate 5-dioxygenase 2 precursor [Bos
taurus]
gi|154425678|gb|AAI51391.1| PLOD2 protein [Bos taurus]
gi|296491069|tpg|DAA33152.1| TPA: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2 [Bos taurus]
Length = 762
Score = 633 bits (1633), Expect = e-178, Method: Compositional matrix adjust.
Identities = 315/741 (42%), Positives = 464/741 (62%), Gaps = 31/741 (4%)
Query: 19 ISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSS 77
+ K +I DK LVITVA+ E+DG+ RF+QSA+ VK LG + W GGD +++
Sbjct: 28 LGADSEKPSSIPTDKLLVITVATKESDGFHRFMQSAKYFNYTVKVLGQGEEWRGGDGINT 87
Query: 78 LGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERL 137
+GGG KV L+K ++ +D+++L T+ ++VI GG ++L++F + +VF A+ +
Sbjct: 88 IGGGQKVRLMKEIMEHYANQEDLVVLFTECFNVIFAGGPEEVLKKFQKSNHKVVFAADGI 147
Query: 138 CWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDET 197
WPD L DKYP V G RYLNSGGFIGYA I ++ ++++ +DDQL+Y +++D
Sbjct: 148 LWPDKRLADKYPIVHIGKRYLNSGGFIGYAPYINRIVQQWNLQDNDDDQLFYTKIYIDPL 207
Query: 198 LRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIEL 257
R I LD +FQ L G+++++ L F+ + N Y T PV+I+GNG +KI L
Sbjct: 208 KREAINITLDHKCKIFQTLNGAVDEVVLKFENGK-ARAKNVFYETLPVMINGNGPTKILL 266
Query: 258 NSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANL 316
N FGNY+ +W + +GCT C + +D D +P+V I VFI++PT FL FLN + L
Sbjct: 267 NYFGNYIPNAWTQDNGCTFCE-VDTIDLSAVDVYPNVTIGVFIEQPTPFLPRFLNTLLTL 325
Query: 317 NYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH 376
+YP + + F++N + YH + K +K + ++ EARN+ ++
Sbjct: 326 DYPKEALKFFIHNKEVYHEKDIKVFFDKAKHEITTIKIVGPEENLSQAEARNMGMDFCRQ 385
Query: 377 -KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARS 435
K ++YF VD+D L NP LK L+ +N +IAPL+ R K WSNFWGAL+ DG+YARS
Sbjct: 386 DKNCEYYFSVDADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARS 445
Query: 436 FDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTL-NSMDYDMAFCTNLRN 494
DY++I+ G++ GIWNVPY+ N YL+K +++ I+ Y + + +D DMA C N R
Sbjct: 446 EDYVDIVQGNR--VGIWNVPYMANVYLIKGKTLRSEMIERNYFVRDKLDPDMALCRNARE 503
Query: 495 ---------------------KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPL 533
KG+ + I + E+G L+ + N++ N +++++ NP+
Sbjct: 504 MTLQREKDSPTPETFQMLSPPKGVFMYISNRHEFGRLLSTANYNISHFNNDLWQIFENPV 563
Query: 534 DWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDK 593
DW +YI+ +Y K + + + QPCPDVFWFPI +EK C E V+ ME YGQWS G ++D
Sbjct: 564 DWKEKYINRDYSK-IFTENIVEQPCPDVFWFPIFSEKACDELVEEMEHYGQWSGGKHHDS 622
Query: 594 RLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYR 653
R+ GYE VPT DIHMKQ+GL VW F+R+++ P+ + F GY+ + A ++FVV+Y
Sbjct: 623 RISGGYENVPTDDIHMKQIGLENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYS 681
Query: 654 PDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTH 713
P+ Q SLRPHHD+ST+TINIALN VG D++GGGC+F+RYNC++ + R GW MHPGRLTH
Sbjct: 682 PERQRSLRPHHDASTFTINIALNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTH 741
Query: 714 YHEGLQVTQGTRYIMISFVDP 734
HEGL V GTRYI +SF+DP
Sbjct: 742 LHEGLPVKNGTRYIAVSFIDP 762
>gi|332232414|ref|XP_003265401.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2
isoform 1 [Nomascus leucogenys]
Length = 758
Score = 633 bits (1632), Expect = e-178, Method: Compositional matrix adjust.
Identities = 315/741 (42%), Positives = 462/741 (62%), Gaps = 31/741 (4%)
Query: 19 ISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSS 77
+ K +I DK LVITVA+ E+DG+ RF+QSA+ VK LG + W GGD ++S
Sbjct: 24 LGADSEKPSSIPTDKLLVITVATKESDGFHRFMQSAKYFNYTVKVLGQGEEWRGGDGINS 83
Query: 78 LGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERL 137
+GGG KV L+K ++ DD++++ T+ +DVI GG ++L++F + +VF A+ +
Sbjct: 84 IGGGQKVRLMKEVMEHYADQDDLVVMFTECFDVIFAGGPEEVLKKFQKANHKVVFAADGI 143
Query: 138 CWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDET 197
WPD L DKYP V G RYLNSGGFIGYA + ++ ++++ +DDQL+Y +++D
Sbjct: 144 LWPDKRLADKYPVVHIGKRYLNSGGFIGYAPYVNRIVQQWNLQDNDDDQLFYTKIYIDSL 203
Query: 198 LRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIEL 257
R I LD +FQ L G+++++ L F+ + NT Y T PV I+GNG +KI L
Sbjct: 204 KREAINITLDHKCKIFQTLNGAVDEVVLKFENGK-ARAKNTFYETLPVAINGNGPTKILL 262
Query: 258 NSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANL 316
N FGNY+ SW + +GCT C +D D P+V I VFI++PT FL FL+ + L
Sbjct: 263 NYFGNYVPNSWTQDNGCTLCEF-DTVDLSAVDVHPNVTIGVFIEQPTPFLPRFLDILLAL 321
Query: 317 NYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH 376
+YP + + +F++N + YH + K K +K + ++ EARN+ ++
Sbjct: 322 DYPKEALKLFIHNKEVYHEKDIKVFFDKAKHEIKTIKIVGPEENLSQAEARNMGMDFCRQ 381
Query: 377 -KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARS 435
+ D+YF VD+D L NP LK L+ +N +IAPL+ R K WSNFWGAL+ DG+YARS
Sbjct: 382 DEKCDYYFSVDADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARS 441
Query: 436 FDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN 494
DY++I+ G++ G+WNVPY+ N YL+K +++ N + + + +D DMA C N R
Sbjct: 442 EDYVDIVQGNR--VGVWNVPYMANVYLIKGKTLRSEMNERNYFVRDKLDPDMALCRNARE 499
Query: 495 ---------------------KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPL 533
KG+ + I + E+G L+ + N++ N +++++ NP+
Sbjct: 500 MTLQREKDSPTPETFQMLSPPKGVFMYISNRHEFGRLLSTANYNTSHYNNDLWQIFENPV 559
Query: 534 DWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDK 593
DW +YI+ +Y K + + + QPCPDVFWFPI +EK C E V+ ME YG+WS G ++D
Sbjct: 560 DWKEKYINRDYSK-IFTENIVEQPCPDVFWFPIFSEKACDELVEEMEHYGKWSGGKHHDS 618
Query: 594 RLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYR 653
R+ GYE VPT DIHMKQV L VW F+R+++ P+ + F GY+ + A ++FVV+Y
Sbjct: 619 RISGGYENVPTDDIHMKQVDLENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYS 677
Query: 654 PDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTH 713
P+ Q SLRPHHD+ST+TINIALN VG D++GGGC+F+RYNC++ + R GW MHPGRLTH
Sbjct: 678 PERQRSLRPHHDASTFTINIALNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTH 737
Query: 714 YHEGLQVTQGTRYIMISFVDP 734
HEGL V GTRYI +SF+DP
Sbjct: 738 LHEGLPVKNGTRYIAVSFIDP 758
>gi|403278813|ref|XP_003930979.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2
isoform 1 [Saimiri boliviensis boliviensis]
Length = 758
Score = 632 bits (1631), Expect = e-178, Method: Compositional matrix adjust.
Identities = 316/741 (42%), Positives = 462/741 (62%), Gaps = 31/741 (4%)
Query: 19 ISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSS 77
+ + K +I DK LVITVA+ E+DG+ RF+QSA+ VK LG + W GGD ++S
Sbjct: 24 LGANSEKPSSIPTDKLLVITVATKESDGFHRFMQSAKYFNYTVKVLGQGEEWRGGDGINS 83
Query: 78 LGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERL 137
+GGG KV L+K + DD++++ T+ +DVI GG ++L++F + +VF A+ +
Sbjct: 84 IGGGQKVRLMKEIMGHYADQDDLVVMFTECFDVIFAGGPEELLKKFQKANHKVVFAADGI 143
Query: 138 CWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDET 197
WPD L DKYP V G RYLNSGGFIGYA I ++ ++++ +DDQL+Y +++D
Sbjct: 144 LWPDKRLADKYPVVHIGKRYLNSGGFIGYAPYINRIVQQWNLQDNDDDQLFYTKIYIDPL 203
Query: 198 LRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIEL 257
R I LD +FQ L G+++++ L F+ + NT Y T PV I+GNG +KI L
Sbjct: 204 KREAINITLDHKCKIFQTLNGAVDEVVLKFENGK-ARAKNTFYETLPVAINGNGPTKILL 262
Query: 258 NSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANL 316
N FGNY+ SW + +GCT C +D D P+V I VFI++PT FL FL+ + L
Sbjct: 263 NYFGNYVPNSWTQDNGCTLCEF-DTVDLSAVDVHPNVTIGVFIEQPTPFLPRFLDILLTL 321
Query: 317 NYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH 376
+YP + + +F++N + YH + K K +K + ++ EARN+ ++
Sbjct: 322 DYPKEALKLFIHNKEVYHEKDIKVFFDKAKHEIKTIKIVGPEENLSQAEARNMGMDFCRQ 381
Query: 377 -KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARS 435
+ D+YF VD+D L NP LK L+ +N +IAPL+ R K WSNFWGAL+ DG+YARS
Sbjct: 382 DENCDYYFSVDADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARS 441
Query: 436 FDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN 494
DY++I+ G++ G+WNVPY+ N YL+K +++ N + + + +D DMA C N R
Sbjct: 442 EDYVDIVQGNR--VGVWNVPYMANVYLIKGKTLRSEMNERNYFVRDKLDPDMALCRNARE 499
Query: 495 ---------------------KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPL 533
KG+ + I + E+G L+ + N++ N +++++ NP+
Sbjct: 500 MTLQREKDSPTPETFQMLSPPKGVFMYISNRHEFGRLLSTANYNTSHYNNDLWQIFENPV 559
Query: 534 DWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDK 593
DW +YI+ +Y K + + + QPCPDVFWFPI +EK C E V+ ME YG+WS G ++D
Sbjct: 560 DWKEKYINRDYSK-IFTENIVEQPCPDVFWFPIFSEKACDELVEEMEHYGKWSGGKHHDS 618
Query: 594 RLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYR 653
R+ GYE VPT DIHMKQV L VW F+R+++ P+ + F GY+ + A ++FVV+Y
Sbjct: 619 RISGGYENVPTDDIHMKQVDLENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYS 677
Query: 654 PDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTH 713
P+ Q SLRPHHD+ST+TINIALN VG D++GGGC+F+RYNC++ + R GW MHPGRLTH
Sbjct: 678 PERQRSLRPHHDASTFTINIALNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTH 737
Query: 714 YHEGLQVTQGTRYIMISFVDP 734
HEGL V GTRYI +SF+DP
Sbjct: 738 LHEGLPVKNGTRYIAVSFIDP 758
>gi|410254496|gb|JAA15215.1| procollagen-lysine, 2-oxoglutarate 5-dioxygenase 2 [Pan
troglodytes]
Length = 758
Score = 632 bits (1631), Expect = e-178, Method: Compositional matrix adjust.
Identities = 315/741 (42%), Positives = 462/741 (62%), Gaps = 31/741 (4%)
Query: 19 ISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSS 77
+ K +I DK LVITVA+ E+DG+ RF+QSA+ VK LG + W GGD ++S
Sbjct: 24 LGADSEKPSSIPTDKLLVITVATKESDGFHRFMQSAKYFNYTVKVLGQGEEWRGGDGINS 83
Query: 78 LGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERL 137
+GGG KV L+K ++ DD++++ T+ +DVI GG ++L++F + +VF A+ +
Sbjct: 84 IGGGQKVRLMKEVMEHYADQDDLVVMFTECFDVIFAGGPEEVLKKFQKANHKVVFAADGI 143
Query: 138 CWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDET 197
WPD L DKYP V G RYLNSGGFIGYA + ++ ++++ +DDQL+Y +++D
Sbjct: 144 LWPDKRLADKYPVVHIGKRYLNSGGFIGYAPYVNRIVQQWNLQDNDDDQLFYTKIYIDPL 203
Query: 198 LRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIEL 257
R I LD +FQ L G+++++ L F+ + NT Y T PV I+GNG +KI L
Sbjct: 204 KREAINITLDHKCKIFQTLNGAVDEVVLKFENGK-ARAKNTFYETLPVAINGNGPTKILL 262
Query: 258 NSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANL 316
N FGNY+ SW + +GCT C +D D P+V I VFI++PT FL FL+ + L
Sbjct: 263 NYFGNYVPNSWTQDNGCTLCEF-DTVDLSAVDVHPNVSIGVFIEQPTPFLPRFLDILLTL 321
Query: 317 NYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH 376
+YP + + +F++N + YH + K K +K + ++ EARN+ ++
Sbjct: 322 DYPKEALKLFIHNKEVYHEKDIKIFFDKAKHEIKTIKIVGPEENLSQAEARNMGMDFCRQ 381
Query: 377 -KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARS 435
+ D+YF VD+D L NP LK L+ +N +IAPL+ R K WSNFWGAL+ DG+YARS
Sbjct: 382 DEKCDYYFSVDADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARS 441
Query: 436 FDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN 494
DY++I+ G++ G+WNVPY+ N YL+K +++ N + + + +D DMA C N R
Sbjct: 442 EDYVDIVQGNR--VGVWNVPYMANVYLIKGKTLRSEMNERNYFVRDKLDPDMALCRNARE 499
Query: 495 ---------------------KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPL 533
KG+ + I + E+G L+ + N++ N +++++ NP+
Sbjct: 500 MTLQREKDSPTPETFQMLSPPKGVFMYISNRHEFGRLLSTANYNTSHYNNDLWQIFENPV 559
Query: 534 DWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDK 593
DW +YI+ +Y K + + + QPCPDVFWFPI +EK C E V+ ME YG+WS G ++D
Sbjct: 560 DWKEKYINRDYSK-IFTENIVEQPCPDVFWFPIFSEKACDELVEEMEHYGKWSGGKHHDS 618
Query: 594 RLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYR 653
R+ GYE VPT DIHMKQV L VW F+R+++ P+ + F GY+ + A ++FVV+Y
Sbjct: 619 RISGGYENVPTDDIHMKQVDLENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYS 677
Query: 654 PDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTH 713
P+ Q SLRPHHD+ST+TINIALN VG D++GGGC+F+RYNC++ + R GW MHPGRLTH
Sbjct: 678 PERQRSLRPHHDASTFTINIALNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTH 737
Query: 714 YHEGLQVTQGTRYIMISFVDP 734
HEGL V GTRYI +SF+DP
Sbjct: 738 LHEGLPVKSGTRYIAVSFIDP 758
>gi|268529772|ref|XP_002630012.1| C. briggsae CBR-LET-268 protein [Caenorhabditis briggsae]
Length = 733
Score = 632 bits (1630), Expect = e-178, Method: Compositional matrix adjust.
Identities = 301/717 (41%), Positives = 466/717 (64%), Gaps = 18/717 (2%)
Query: 30 DEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLLK 88
D + +V+TVA+ TDG KR ++SA+ + ++ LGL + W GGD GGG K+ +L
Sbjct: 23 DLPELVVVTVATENTDGLKRLLESAKAFDINIEVLGLGEKWNGGDTRVEKGGGQKIRILS 82
Query: 89 NELDEMDITDDMIILVTDSYDVIIDGGVNDILERF--NTFDANIVFGAERLCWPDTSLYD 146
+++ D II+ D+YDV+ + +IL++F + ++FGAE CWPD +L
Sbjct: 83 KWIEKYKDASDTIIMFVDAYDVVFNADSKNILQKFLEHYPGKQLLFGAEPFCWPDQTLAP 142
Query: 147 KYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVL 206
YP V G R+LNSG F+GY + ++++ +S+++++DDQLYY +++LDE LR + + L
Sbjct: 143 DYPIVEFGKRFLNSGLFMGYGPQVHKILTLKSVEDKDDDQLYYTMIYLDEKLRKELNMDL 202
Query: 207 DTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAK 266
D+++ +FQNL G +ED++L F D N YNT P+I+HGNG SK LN GNYL
Sbjct: 203 DSMSKIFQNLNGVIEDVELQFKEDGTPEAYNAAYNTKPLIVHGNGPSKSHLNYLGNYLGN 262
Query: 267 SWKTS-GCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
W + GC C+ + + +FP + +++FI KP F+EE L K++ +YP +I++
Sbjct: 263 RWNSQLGCRTCD---QEGAKEQTEFPLIGLNLFISKPVPFIEEVLQKVSEFDYPKNRIAL 319
Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYV 385
++YNNQ + D++ + + + I + + ++ARN A++ + +F F++
Sbjct: 320 YIYNNQPFSIKNIQDFLKDHGKSYYTKRIINGVTEIGERQARNEAIDWCKQRDTEFAFFM 379
Query: 386 DSDSHLDNPDVLKYLVNRNES----LIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNI 441
D D++ P V+K L++ ++S +I+P++ +P K ++NFWGA+ A+G+YARS DYM I
Sbjct: 380 DGDAYFTEPTVIKDLIHYSKSYDVGIISPMVGQPGKLFTNFWGAIAANGYYARSEDYMAI 439
Query: 442 INGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKI 501
+ G++ G WNVP++T+ LM + A + Y N +D DM+ C R+ G + I
Sbjct: 440 VKGNR--VGYWNVPFVTSALLMSKEKLGAMSGAYTYNKN-LDPDMSLCQFARDNGHFMYI 496
Query: 502 DSTQEYGHLVDSENFDPQKT----NPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQP 557
++ + +G+L+ S+ F T +PE++++ N W+ RYIHP Y K + PD + +Q
Sbjct: 497 NNEKYFGYLIVSDEFSETVTEGKWHPEMWQIFENRELWEARYIHPGYHKIMEPDHIIDQA 556
Query: 558 CPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGV 617
CPDV+ +P+++E+FC E ++ ME +G+WSDG+NNDKRL GYE VPTRDIHM QVG
Sbjct: 557 CPDVYDYPLMSERFCEELIEEMEGFGRWSDGSNNDKRLAGGYENVPTRDIHMNQVGFERQ 616
Query: 618 WAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQ 677
W FL YV P+QE+ FIGY+H+PV + M FVVRY+P+EQ SLRPHHD+ST++I+IALN+
Sbjct: 617 WLYFLDTYVRPVQEKTFIGYYHQPVESNMMFVVRYKPEEQASLRPHHDASTFSIDIALNK 676
Query: 678 VGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
G DYEGGG R++RYNC V A +G+ +M PGRLTH HEGL T+GTRYIM+SF++P
Sbjct: 677 KGRDYEGGGVRYVRYNCTVEADEVGYAMMFPGRLTHLHEGLATTKGTRYIMVSFINP 733
>gi|397512442|ref|XP_003826554.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2
isoform 2 [Pan paniscus]
Length = 758
Score = 632 bits (1630), Expect = e-178, Method: Compositional matrix adjust.
Identities = 315/741 (42%), Positives = 462/741 (62%), Gaps = 31/741 (4%)
Query: 19 ISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSS 77
+ K +I DK LVITVA+ E+DG+ RF+QSA+ VK LG + W GGD ++S
Sbjct: 24 LGADSEKPSSIPTDKLLVITVATKESDGFHRFMQSAKYFNYTVKVLGQGEEWRGGDGINS 83
Query: 78 LGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERL 137
+GGG KV L+K ++ DD++++ T+ +DVI GG ++L++F + +VF A+ +
Sbjct: 84 IGGGQKVRLMKEVMERYADQDDLVVMFTECFDVIFAGGPEEVLKKFQKANHKVVFAADGI 143
Query: 138 CWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDET 197
WPD L DKYP V G RYLNSGGFIGYA + ++ ++++ +DDQL+Y +++D
Sbjct: 144 LWPDKRLADKYPVVHIGKRYLNSGGFIGYAPYVNRIVQQWNLQDNDDDQLFYTKIYIDPL 203
Query: 198 LRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIEL 257
R I LD +FQ L G+++++ L F+ + NT Y T PV I+GNG +KI L
Sbjct: 204 KREAINITLDHKCKIFQTLNGAVDEVVLKFENGK-ARAKNTFYETLPVAINGNGPTKILL 262
Query: 258 NSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANL 316
N FGNY+ SW + +GCT C +D D P+V I VFI++PT FL FL+ + L
Sbjct: 263 NYFGNYVPNSWTQDNGCTLCEF-DTVDLSAVDVHPNVSIGVFIEQPTPFLPRFLDILLTL 321
Query: 317 NYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH 376
+YP + + +F++N + YH + K K +K + ++ EARN+ ++
Sbjct: 322 DYPKEALKLFIHNKEVYHEKDIKIFFDKAKHEIKTIKIVGPEENLSQAEARNMGMDFCRQ 381
Query: 377 -KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARS 435
+ D+YF VD+D L NP LK L+ +N +IAPL+ R K WSNFWGAL+ DG+YARS
Sbjct: 382 DEKCDYYFSVDADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARS 441
Query: 436 FDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN 494
DY++I+ G++ G+WNVPY+ N YL+K +++ N + + + +D DMA C N R
Sbjct: 442 EDYVDIVQGNR--VGVWNVPYMANVYLIKGKTLRSEMNERNYFVRDKLDPDMALCRNARE 499
Query: 495 ---------------------KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPL 533
KG+ + I + E+G L+ + N++ N +++++ NP+
Sbjct: 500 MTLQREKDSPTPETFQMLSPPKGVFMYISNRHEFGRLLSTANYNTSHYNNDLWQIFENPV 559
Query: 534 DWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDK 593
DW +YI+ +Y K + + + QPCPDVFWFPI +EK C E V+ ME YG+WS G ++D
Sbjct: 560 DWKEKYINRDYSK-IFTENIVEQPCPDVFWFPIFSEKACDELVEEMEHYGKWSGGKHHDS 618
Query: 594 RLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYR 653
R+ GYE VPT DIHMKQV L VW F+R+++ P+ + F GY+ + A ++FVV+Y
Sbjct: 619 RISGGYENVPTDDIHMKQVDLENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYS 677
Query: 654 PDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTH 713
P+ Q SLRPHHD+ST+TINIALN VG D++GGGC+F+RYNC++ + R GW MHPGRLTH
Sbjct: 678 PERQRSLRPHHDASTFTINIALNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTH 737
Query: 714 YHEGLQVTQGTRYIMISFVDP 734
HEGL V GTRYI +SF+DP
Sbjct: 738 LHEGLPVKNGTRYIAVSFIDP 758
>gi|410307702|gb|JAA32451.1| procollagen-lysine, 2-oxoglutarate 5-dioxygenase 2 [Pan
troglodytes]
Length = 758
Score = 632 bits (1630), Expect = e-178, Method: Compositional matrix adjust.
Identities = 315/741 (42%), Positives = 462/741 (62%), Gaps = 31/741 (4%)
Query: 19 ISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSS 77
+ K +I DK LVITVA+ E+DG+ RF+QSA+ VK LG + W GGD ++S
Sbjct: 24 LGADSEKPSSIPTDKLLVITVATKESDGFHRFMQSAKYFNYTVKVLGQGEEWRGGDGINS 83
Query: 78 LGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERL 137
+GGG KV L+K ++ DD++++ T+ +DVI GG ++L++F + +VF A+ +
Sbjct: 84 IGGGQKVRLMKEVMEHYADQDDLVVMFTECFDVIFAGGPEEVLKKFQKANHKVVFAADGI 143
Query: 138 CWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDET 197
WPD L DKYP V G RYLNSGGFIGYA + ++ ++++ +DDQL+Y +++D
Sbjct: 144 LWPDKRLADKYPVVHIGKRYLNSGGFIGYAPYVNRIVQQWNLQDNDDDQLFYTKIYIDPL 203
Query: 198 LRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIEL 257
R I LD +FQ L G+++++ L F+ + NT Y T PV I+GNG +KI L
Sbjct: 204 KREAINITLDHKCKIFQTLNGAVDEVVLKFENGK-ARAKNTFYETLPVAINGNGPTKILL 262
Query: 258 NSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANL 316
N FGNY+ SW + +GCT C +D D P+V I VFI++PT FL FL+ + L
Sbjct: 263 NYFGNYVPNSWTQDNGCTLCEF-DTVDLSAVDVHPNVSIGVFIEQPTPFLPRFLDILLTL 321
Query: 317 NYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH 376
+YP + + +F++N + YH + K K +K + ++ EARN+ ++
Sbjct: 322 DYPKEALKLFIHNKEVYHEKDIKIFFDKAKHEIKTIKIVGPEENLSQAEARNMGMDFCRQ 381
Query: 377 -KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARS 435
+ D+YF VD+D L NP LK L+ +N +IAPL+ R K WSNFWGAL+ DG+YARS
Sbjct: 382 DEKCDYYFSVDADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARS 441
Query: 436 FDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN 494
DY++I+ G++ G+WNVPY+ N YL+K +++ N + + + +D DMA C N R
Sbjct: 442 EDYVDIVQGNR--VGVWNVPYMANVYLIKGKTLRSEMNERNYFVRDKLDPDMALCRNARE 499
Query: 495 ---------------------KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPL 533
KG+ + I + E+G L+ + N++ N +++++ NP+
Sbjct: 500 MTLQREKDSPTPETFQMLSPPKGVFMYISNRHEFGRLLSTANYNTSHYNNDLWQIFENPV 559
Query: 534 DWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDK 593
DW +YI+ +Y K + + + QPCPDVFWFPI +EK C E V+ ME YG+WS G ++D
Sbjct: 560 DWKEKYINRDYSK-IFTENIVEQPCPDVFWFPIFSEKACDELVEEMEHYGKWSGGKHHDS 618
Query: 594 RLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYR 653
R+ GYE VPT DIHMKQV L VW F+R+++ P+ + F GY+ + A ++FVV+Y
Sbjct: 619 RISGGYENVPTDDIHMKQVDLENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYS 677
Query: 654 PDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTH 713
P+ Q SLRPHHD+ST+TINIALN VG D++GGGC+F+RYNC++ + R GW MHPGRLTH
Sbjct: 678 PERQRSLRPHHDASTFTINIALNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTH 737
Query: 714 YHEGLQVTQGTRYIMISFVDP 734
HEGL V GTRYI +SF+DP
Sbjct: 738 LHEGLPVKNGTRYIAVSFIDP 758
>gi|33636742|ref|NP_891988.1| procollagen-lysine,2-oxoglutarate 5-dioxygenase 2 isoform 1
precursor [Homo sapiens]
gi|22713625|gb|AAH37169.1| Procollagen-lysine, 2-oxoglutarate 5-dioxygenase 2 [Homo sapiens]
gi|119599345|gb|EAW78939.1| procollagen-lysine, 2-oxoglutarate 5-dioxygenase 2, isoform CRA_a
[Homo sapiens]
Length = 758
Score = 632 bits (1630), Expect = e-178, Method: Compositional matrix adjust.
Identities = 315/741 (42%), Positives = 462/741 (62%), Gaps = 31/741 (4%)
Query: 19 ISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSS 77
+ K +I DK LVITVA+ E+DG+ RF+QSA+ VK LG + W GGD ++S
Sbjct: 24 LGADSEKPSSIPTDKLLVITVATKESDGFHRFMQSAKYFNYTVKVLGQGEEWRGGDGINS 83
Query: 78 LGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERL 137
+GGG KV L+K ++ DD++++ T+ +DVI GG ++L++F + +VF A+ +
Sbjct: 84 IGGGQKVRLMKEVMEHYADQDDLVVMFTECFDVIFAGGPEEVLKKFQKANHKVVFAADGI 143
Query: 138 CWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDET 197
WPD L DKYP V G RYLNSGGFIGYA + ++ ++++ +DDQL+Y +++D
Sbjct: 144 LWPDKRLADKYPVVHIGKRYLNSGGFIGYAPYVNRIVQQWNLQDNDDDQLFYTKVYIDPL 203
Query: 198 LRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIEL 257
R I LD +FQ L G+++++ L F+ + NT Y T PV I+GNG +KI L
Sbjct: 204 KREAINITLDHKCKIFQTLNGAVDEVVLKFENGK-ARAKNTFYETLPVAINGNGPTKILL 262
Query: 258 NSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANL 316
N FGNY+ SW + +GCT C +D D P+V I VFI++PT FL FL+ + L
Sbjct: 263 NYFGNYVPNSWTQDNGCTLCEF-DTVDLSAVDVHPNVSIGVFIEQPTPFLPRFLDILLTL 321
Query: 317 NYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH 376
+YP + + +F++N + YH + K K +K + ++ EARN+ ++
Sbjct: 322 DYPKEALKLFIHNKEVYHEKDIKVFFDKAKHEIKTIKIVGPEENLSQAEARNMGMDFCRQ 381
Query: 377 -KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARS 435
+ D+YF VD+D L NP LK L+ +N +IAPL+ R K WSNFWGAL+ DG+YARS
Sbjct: 382 DEKCDYYFSVDADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARS 441
Query: 436 FDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN 494
DY++I+ G++ G+WNVPY+ N YL+K +++ N + + + +D DMA C N R
Sbjct: 442 EDYVDIVQGNR--VGVWNVPYMANVYLIKGKTLRSEMNERNYFVRDKLDPDMALCRNARE 499
Query: 495 ---------------------KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPL 533
KG+ + I + E+G L+ + N++ N +++++ NP+
Sbjct: 500 MTLQREKDSPTPETFQMLSPPKGVFMYISNRHEFGRLLSTANYNTSHYNNDLWQIFENPV 559
Query: 534 DWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDK 593
DW +YI+ +Y K + + + QPCPDVFWFPI +EK C E V+ ME YG+WS G ++D
Sbjct: 560 DWKEKYINRDYSK-IFTENIVEQPCPDVFWFPIFSEKACDELVEEMEHYGKWSGGKHHDS 618
Query: 594 RLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYR 653
R+ GYE VPT DIHMKQV L VW F+R+++ P+ + F GY+ + A ++FVV+Y
Sbjct: 619 RISGGYENVPTDDIHMKQVDLENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYS 677
Query: 654 PDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTH 713
P+ Q SLRPHHD+ST+TINIALN VG D++GGGC+F+RYNC++ + R GW MHPGRLTH
Sbjct: 678 PERQRSLRPHHDASTFTINIALNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTH 737
Query: 714 YHEGLQVTQGTRYIMISFVDP 734
HEGL V GTRYI +SF+DP
Sbjct: 738 LHEGLPVKNGTRYIAVSFIDP 758
>gi|440899349|gb|ELR50661.1| Procollagen-lysine,2-oxoglutarate 5-dioxygenase 2, partial [Bos
grunniens mutus]
Length = 726
Score = 632 bits (1629), Expect = e-178, Method: Compositional matrix adjust.
Identities = 313/728 (42%), Positives = 460/728 (63%), Gaps = 31/728 (4%)
Query: 32 DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGYKVNLLKNE 90
DK LVITVA+ E+DG+ RF+QSA+ VK LG + W GGD ++++GGG KV L+K
Sbjct: 5 DKLLVITVATKESDGFHRFMQSAKYFNYTVKVLGQGEEWRGGDGINTIGGGQKVRLMKEV 64
Query: 91 LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
++ +D+++L T+ ++VI GG ++L++F + +VF A+ + WPD L DKYP
Sbjct: 65 MEHYANQEDLVVLFTECFNVIFAGGPEEVLKKFQKSNHKVVFAADGILWPDKRLADKYPI 124
Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
V G RYLNSGGFIGYA I ++ ++++ +DDQL+Y +++D R I LD
Sbjct: 125 VHIGKRYLNSGGFIGYAPYINRIVQQWNLQDNDDDQLFYTKIYIDPLKREAINITLDHKC 184
Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSW-K 269
+FQ L G+++++ L F+ + N Y T PV+I+GNG +KI LN FGNY+ +W +
Sbjct: 185 KIFQTLNGAVDEVVLKFENGK-ARAKNVFYETLPVMINGNGPTKILLNYFGNYIPNAWTQ 243
Query: 270 TSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN 329
+GCT C + +D D +P+V I VFI++PT FL FLN + L+YP + + F++N
Sbjct: 244 DNGCTFCE-VDTIDLSAVDVYPNVTIGVFIEQPTPFLPRFLNTLLTLDYPKEALKFFIHN 302
Query: 330 NQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDSD 388
+ YH + K +K + ++ EARN+ ++ K ++YF VD+D
Sbjct: 303 KEVYHEKDIKVFFDKAKHEITTIKIVGPEENLSQAEARNMGMDFCRQDKNCEYYFSVDAD 362
Query: 389 SHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGG 448
L NP LK L+ +N +IAPL+ R K WSNFWGAL+ DG+YARS DY++I+ G++
Sbjct: 363 VVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARSEDYVDIVQGNR-- 420
Query: 449 KGIWNVPYITNCYLMKTSVIKATNIKTIYTL-NSMDYDMAFCTNLRN------------- 494
GIWNVPY+ N YL+K +++ I+ Y + + +D DMA C N R
Sbjct: 421 VGIWNVPYMANVYLIKGKTLRSEMIERNYFVRDKLDPDMALCRNAREMTLQREKDSPTPE 480
Query: 495 --------KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQK 546
KG+ + I + E+G L+ + N++ N +++++ NP+DW +YI+ +Y K
Sbjct: 481 TFQMLSPPKGVFMYISNRHEFGRLLSTANYNISHFNNDLWQIFENPVDWKEKYINRDYSK 540
Query: 547 SLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRD 606
+ + + QPCPDVFWFPI +EK C E V+ ME YGQWS G ++D R+ GYE VPT D
Sbjct: 541 -IFTENIVEQPCPDVFWFPIFSEKACDELVEEMEHYGQWSGGKHHDSRISGGYENVPTDD 599
Query: 607 IHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDS 666
IHMKQ+GL VW F+R+++ P+ + F GY+ + A ++FVV+Y P+ Q SLRPHHD+
Sbjct: 600 IHMKQIGLENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPHHDA 658
Query: 667 STYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRY 726
ST+TINIALN VG D++GGGC+F+RYNC++ + R GW MHPGRLTH HEGL V GTRY
Sbjct: 659 STFTINIALNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNGTRY 718
Query: 727 IMISFVDP 734
I +SF+DP
Sbjct: 719 IAVSFIDP 726
>gi|431899781|gb|ELK07728.1| Procollagen-lysine,2-oxoglutarate 5-dioxygenase 2 [Pteropus alecto]
Length = 785
Score = 632 bits (1629), Expect = e-178, Method: Compositional matrix adjust.
Identities = 314/728 (43%), Positives = 458/728 (62%), Gaps = 31/728 (4%)
Query: 32 DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGYKVNLLKNE 90
DK LVITVA+ E+DG+ RF+QSA+ VK LG + W GGD ++S+GGG KV L+K
Sbjct: 64 DKLLVITVATKESDGFHRFMQSAQYFNYTVKVLGQGEEWRGGDGINSIGGGQKVRLMKEV 123
Query: 91 LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
++ +D+++L T+ +DVI GG ++L++F + +VF A+ + WPD L DKYP
Sbjct: 124 MEHYASQEDLVVLFTECFDVIFAGGPEEVLKKFQKSNHKVVFAADGILWPDKRLADKYPI 183
Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
V G RYLNSGGFIGYA I ++ ++++ +DDQL+Y +++D R I LD
Sbjct: 184 VHIGKRYLNSGGFIGYAPYINRIVQQWNLQDNDDDQLFYTKIYIDPLKREAINITLDHKC 243
Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSW-K 269
+FQ L G+++++ L F+ + N Y T PV I+GNG +KI LN FGNY+ +W +
Sbjct: 244 KIFQALNGAVDEVVLKFENGK-ARAKNVFYETLPVAINGNGPTKILLNYFGNYVPNAWTQ 302
Query: 270 TSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN 329
+GCT C L +D + +P+V I VFI++PT FL FL+ + L+YP + + +F++N
Sbjct: 303 DNGCTLCEL-DTIDLSAVNVYPNVTIGVFIEQPTPFLSRFLDVLLTLDYPKEALKVFIHN 361
Query: 330 NQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDSD 388
+ YH + K +K + ++ EARN+ ++ K D+YF VD+D
Sbjct: 362 KEVYHEKDIKVFFDKAKHEISTIKVVGPEENLSQAEARNMGMDFCRQDKNCDYYFSVDAD 421
Query: 389 SHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGG 448
L NP LK L+ +N +IAPL+ R K WSNFWGAL+ DG+YARS DY++I+ G++
Sbjct: 422 VVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARSEDYVDIVQGNR-- 479
Query: 449 KGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN------------- 494
GIWNVPY+ N YL+K +++ N + + + +D DMA C N R
Sbjct: 480 IGIWNVPYMANVYLIKGKTLRSEMNERNYFVRDKLDPDMALCRNAREMTLQREKDSPTPE 539
Query: 495 --------KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQK 546
KG+ + I + E+G L+ + N++ N +++++ NP+DW +YI+ +Y K
Sbjct: 540 TFQMLSPPKGVFMYISNRHEFGRLLSTANYNTSHYNNDLWQIFENPVDWKEKYINRDYSK 599
Query: 547 SLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRD 606
+ + + QPCPDVFWFPI +EK C E V+ ME YGQWS G ++D R+ GYE VPT D
Sbjct: 600 -IFTENIVEQPCPDVFWFPIFSEKACDELVEEMEHYGQWSGGKHHDSRISGGYENVPTDD 658
Query: 607 IHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDS 666
IHMKQ+ L VW F+R+++ P+ + F GY+ + A ++FVV+Y PD Q SLRPHHD+
Sbjct: 659 IHMKQIDLENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPDRQRSLRPHHDA 717
Query: 667 STYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRY 726
ST+TINIALN VG D++GGGC+F+RYNC++ + R GW MHPGRLTH HEGL V GTRY
Sbjct: 718 STFTINIALNSVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNGTRY 777
Query: 727 IMISFVDP 734
I +SF+DP
Sbjct: 778 IAVSFIDP 785
>gi|296227892|ref|XP_002759562.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2
isoform 1 [Callithrix jacchus]
Length = 758
Score = 631 bits (1628), Expect = e-178, Method: Compositional matrix adjust.
Identities = 316/741 (42%), Positives = 461/741 (62%), Gaps = 31/741 (4%)
Query: 19 ISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSS 77
+ K +I DK LVITVA+ E+DG+ RF+QSA+ VK LG + W GGD ++S
Sbjct: 24 LGADSEKPSSIPTDKLLVITVATKESDGFHRFMQSAKYFNYTVKVLGQGEEWRGGDGINS 83
Query: 78 LGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERL 137
+GGG KV L+K + DD++++ T+ +DVI GG ++L++F + +VF A+ +
Sbjct: 84 IGGGQKVRLVKEVMGHYADQDDLVVMFTECFDVIFAGGPEELLKKFQKANHKVVFAADGI 143
Query: 138 CWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDET 197
WPD L DKYP V G RYLNSGGFIGYA I ++ ++++ +DDQL+Y +++D
Sbjct: 144 LWPDKRLADKYPVVHIGKRYLNSGGFIGYAPYINRIVQQWNLQDNDDDQLFYTKIYIDPL 203
Query: 198 LRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIEL 257
R I LD +FQ L G+++++ L F+ + NT Y T PV I+GNG +KI L
Sbjct: 204 KREAINITLDHKCKIFQTLNGAVDEVVLKFENGK-ARAKNTFYETLPVAINGNGPTKILL 262
Query: 258 NSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANL 316
N FGNY+ SW + +GCT C +D D P+V I VFI++PT FL FL+ + L
Sbjct: 263 NYFGNYVPNSWTQDNGCTLCEF-DTVDLSAVDVHPNVTIGVFIEQPTPFLPRFLDILLTL 321
Query: 317 NYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH 376
+YP + + +F++N + YH + K K +K + ++ EARN+ ++
Sbjct: 322 DYPKEALKLFIHNKEVYHEKDIKVFFDKAKHEIKTIKIVGPEENLSQAEARNMGMDFCRQ 381
Query: 377 -KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARS 435
+ D+YF VD+D L NP LK L+ +N +IAPL+ R K WSNFWGAL+ DG+YARS
Sbjct: 382 DENCDYYFSVDADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARS 441
Query: 436 FDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN 494
DY++I+ G++ G+WNVPY+ N YL+K +++ N + + + +D DMA C N R
Sbjct: 442 EDYVDIVQGNR--VGVWNVPYMANVYLIKGKTLRSEMNERNYFVRDKLDPDMALCRNARE 499
Query: 495 ---------------------KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPL 533
KG+ + I + E+G L+ + N++ N +++++ NP+
Sbjct: 500 MTLQREKDSPTPETFQMLSPPKGVFMYISNRNEFGRLLSTANYNTSHYNNDLWQIFENPV 559
Query: 534 DWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDK 593
DW +YI+ +Y K + + + QPCPDVFWFPI +EK C E V+ ME YG+WS G ++D
Sbjct: 560 DWKEKYINRDYSK-IFTENIVEQPCPDVFWFPIFSEKACDELVEEMEHYGKWSGGKHHDS 618
Query: 594 RLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYR 653
R+ GYE VPT DIHMKQV L VW F+R+++ P+ + F GY+ + A ++FVV+Y
Sbjct: 619 RISGGYENVPTDDIHMKQVDLENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYS 677
Query: 654 PDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTH 713
P+ Q SLRPHHD+ST+TINIALN VG D++GGGC+F+RYNC++ + R GW MHPGRLTH
Sbjct: 678 PERQRSLRPHHDASTFTINIALNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTH 737
Query: 714 YHEGLQVTQGTRYIMISFVDP 734
HEGL V GTRYI +SF+DP
Sbjct: 738 LHEGLPVKNGTRYIAVSFIDP 758
>gi|355746993|gb|EHH51607.1| hypothetical protein EGM_11017, partial [Macaca fascicularis]
Length = 723
Score = 631 bits (1628), Expect = e-178, Method: Compositional matrix adjust.
Identities = 314/728 (43%), Positives = 457/728 (62%), Gaps = 31/728 (4%)
Query: 32 DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGYKVNLLKNE 90
DK LVITVA+ E+DG+ RF+QSA+ VK LG + W GGD ++S+GGG KV L+K
Sbjct: 2 DKLLVITVATKESDGFHRFMQSAKYFNYTVKVLGQGEEWRGGDGINSIGGGQKVRLMKEV 61
Query: 91 LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
++ DD++++ T+ +DVI GG ++L++F + +VF A+ + WPD L DKYP
Sbjct: 62 MEHYADQDDLVVMFTECFDVIFAGGPEEVLKKFQKANHKVVFAADGILWPDKRLADKYPV 121
Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
V G RYLNSGGFIGYA I ++ ++++ +DDQL+Y +++D R I LD
Sbjct: 122 VHIGKRYLNSGGFIGYAPYINRIVQQWNLQDNDDDQLFYTKIYIDPLKREAINITLDHKC 181
Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSW-K 269
+FQ L G+++++ L F+ + NT Y T PV I+GNG +KI LN FGNY+ SW +
Sbjct: 182 KIFQTLNGAVDEVVLKFENGK-ARAKNTFYETLPVAINGNGPTKILLNYFGNYVPNSWTQ 240
Query: 270 TSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN 329
+GCT C +D D P+V I VFI++PT FL FL+ + L+YP + + +F++N
Sbjct: 241 GNGCTLCEF-DTIDLSAVDVHPNVTIGVFIEQPTPFLPRFLDILLTLDYPKEALKLFIHN 299
Query: 330 NQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDSD 388
+ YH + K K +K + ++ EARN+ ++ + D+YF VD+D
Sbjct: 300 KEVYHEKDIKVFFDKAKHEIKTIKIVGPEENLSQAEARNMGMDFCRQDEKCDYYFSVDAD 359
Query: 389 SHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGG 448
L NP LK L+ +N +IAPL+ R K WSNFWGAL+ DG+YARS DY++I+ G++
Sbjct: 360 VVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARSEDYVDIVQGNR-- 417
Query: 449 KGIWNVPYITNCYLMKTSVIK-ATNIKTIYTLNSMDYDMAFCTNLRN------------- 494
G+WNVPY+ N YL+K ++ N + + + +D DMA C N R
Sbjct: 418 VGVWNVPYMANVYLIKGKTLRLEMNERNYFVRDKLDPDMALCRNAREMTLQREKDSPTPE 477
Query: 495 --------KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQK 546
KG+ + I + E+G L+ + N++ N +++++ NP+DW +YI+ +Y K
Sbjct: 478 TFQMLSPPKGVFMYISNRHEFGRLLSTANYNTSHYNNDLWQIFENPVDWKEKYINRDYSK 537
Query: 547 SLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRD 606
+ + + QPCPDVFWFPI +EK C E V+ ME YG+WS G ++D R+ GYE VPT D
Sbjct: 538 -IFTENIVEQPCPDVFWFPIFSEKACDELVEEMEHYGKWSGGKHHDSRISGGYENVPTDD 596
Query: 607 IHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDS 666
IHMKQV L VW F+R+++ P+ + F GY+ + A ++FVV+Y P+ Q SLRPHHD+
Sbjct: 597 IHMKQVDLENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPHHDA 655
Query: 667 STYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRY 726
ST+TINIALN VG D++GGGC+F+RYNC++ + R GW MHPGRLTH HEGL V GTRY
Sbjct: 656 STFTINIALNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNGTRY 715
Query: 727 IMISFVDP 734
I +SF+DP
Sbjct: 716 IAVSFIDP 723
>gi|301778997|ref|XP_002924915.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2-like
isoform 1 [Ailuropoda melanoleuca]
Length = 757
Score = 631 bits (1627), Expect = e-178, Method: Compositional matrix adjust.
Identities = 313/741 (42%), Positives = 464/741 (62%), Gaps = 31/741 (4%)
Query: 19 ISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSS 77
+ K +I DK LVITVA+ E+DG+ RF+QSA+ VK LG + W GGD ++S
Sbjct: 23 LGADSEKPSSIPTDKLLVITVATKESDGFHRFMQSAKYFNYTVKVLGQGEEWRGGDGINS 82
Query: 78 LGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERL 137
+GGG KV L+K ++ +D++IL T+ ++VI GG ++L++F + +VF A+ +
Sbjct: 83 IGGGQKVRLMKEVMEHYANQEDLVILFTECFNVIFAGGPEEVLKKFQKSNHKVVFAADGI 142
Query: 138 CWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDET 197
WPD L DKYP V G RYLNSGGFIGYA +I +++ ++++ +DDQL+Y +++D
Sbjct: 143 LWPDKRLADKYPIVHIGKRYLNSGGFIGYAPNINQIVQQWNLQDNDDDQLFYTKIYIDPL 202
Query: 198 LRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIEL 257
R I LD +FQ L G+++++ L F+ + N Y T PV ++GNG +KI L
Sbjct: 203 KREAINITLDHKCKIFQTLNGAVDEVVLKFENGK-ARAKNVFYETLPVAVNGNGPTKILL 261
Query: 258 NSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANL 316
N FGNY+ +W + +GCT C+L +D D P+V I VFI++PT FL FL+ + L
Sbjct: 262 NYFGNYVPNAWTQDNGCTLCDL-DTIDLSTVDVHPNVTIGVFIEQPTPFLPRFLDILLTL 320
Query: 317 NYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH 376
+YP + + +F++N + YH + K +K + ++ EARN+ ++
Sbjct: 321 DYPKEALKLFIHNKEVYHEKDIKVFFDKAKREISTIKIVGPEENLSQAEARNMGMDFCRQ 380
Query: 377 -KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARS 435
+ D+YF +D+D L NP LK L+ +N +IAPL+ R K WSNFWGAL+ DG+YARS
Sbjct: 381 DENCDYYFSMDADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARS 440
Query: 436 FDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN 494
DY++I+ G++ GIWNVPY+ N YL+K +++ N + + + +D DMA C N R
Sbjct: 441 EDYVDIVQGNR--VGIWNVPYMANVYLIKGKTLRSEMNERNYFVRDKLDPDMALCRNARE 498
Query: 495 ---------------------KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPL 533
KG+ + I + E+G L+ + N++ N +++++ NP+
Sbjct: 499 MTLQREKDSPTPETFQMLSPPKGVFMYISNRHEFGRLLSTANYNTSHYNNDLWQIFENPV 558
Query: 534 DWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDK 593
DW +YI+ +Y K + + + QPCPDVFWFPI +EK C E V+ ME YGQWS G ++D
Sbjct: 559 DWKEKYINRDYSK-IFTENIVEQPCPDVFWFPIFSEKACDELVEEMEHYGQWSGGKHHDS 617
Query: 594 RLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYR 653
R+ GYE VPT DIHMKQ+ L VW F+R+++ P+ + F GY+ + A ++FVV+Y
Sbjct: 618 RISGGYENVPTDDIHMKQIDLENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYS 676
Query: 654 PDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTH 713
P+ Q SLRPHHD+ST+TINIALN VG D++GGGC+F+RYNC++ + R GW MHPGRLTH
Sbjct: 677 PERQRSLRPHHDASTFTINIALNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTH 736
Query: 714 YHEGLQVTQGTRYIMISFVDP 734
HEGL V GTRYI +SF+DP
Sbjct: 737 LHEGLPVKNGTRYIAVSFIDP 757
>gi|28400781|emb|CAD23629.1| procollagen-lysine, 2-oxoglutarate 5-dioxygenase 2, long variant
[Rattus norvegicus]
Length = 758
Score = 630 bits (1626), Expect = e-178, Method: Compositional matrix adjust.
Identities = 315/731 (43%), Positives = 457/731 (62%), Gaps = 31/731 (4%)
Query: 29 IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGYKVNLL 87
I DK LVITVA+ E DG+ RF+ SA+ VK LG Q W GGD M+S+GGG KV L+
Sbjct: 34 IPADKLLVITVATKENDGFHRFMNSAKYFNYTVKVLGQGQEWRGGDGMNSIGGGQKVRLM 93
Query: 88 KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
K ++ DD++IL T+ +DVI GG ++L++F + IVF A+ L WPD L DK
Sbjct: 94 KEAMEHYAGQDDLVILFTECFDVIFAGGPEELLKKFQKTNHKIVFAADALLWPDKRLADK 153
Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
YP V G RYLNSGGFIGYA I L+ ++++ +DDQL+Y +++D R I LD
Sbjct: 154 YPGVHIGKRYLNSGGFIGYAPYISRLVQQWNLQDNDDDQLFYTKVYIDPLKREALNITLD 213
Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
+FQ L G+ +++ L F+ + + NT Y T PV I+GNG SKI LN FGNY+ S
Sbjct: 214 HRCKIFQALNGATDEVVLKFENGK-SRVKNTFYETLPVAINGNGPSKILLNYFGNYVPNS 272
Query: 268 W-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMF 326
W + +GC C+ D D +P V + VFI++PT F FL+ + L+YP + + +F
Sbjct: 273 WTQENGCALCDFDAS-DLSTVDVYPKVTLGVFIEQPTPFQPRFLDLLLTLDYPKEALRLF 331
Query: 327 VYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYV 385
V+N + YH ++ K ++K + ++ EARN+ ++ + D+YF V
Sbjct: 332 VHNKEVYHEKDIKAFVDKAKHDISSIKIVGPEENLSQAEARNMGMDFCRQDEKCDYYFSV 391
Query: 386 DSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGD 445
D+D L NP LK L+ +N +IAPL+ R K WSNFWGAL+ DG+YARS DY++I+ G+
Sbjct: 392 DADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARSEDYVDIVQGN 451
Query: 446 QGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN---------- 494
+ GIWNVPY+ N YL++ +++ + + + + +D DM+ C N R+
Sbjct: 452 R--VGIWNVPYMANVYLIQGKTLRSEMSERNYFVRDKLDPDMSLCRNARDMTLQREKDSP 509
Query: 495 -----------KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPE 543
KG+ + I + E+G L+ + N++ N +++++ NP+DW +YI+ +
Sbjct: 510 TPETFQMLSPPKGVFMYISNRHEFGRLISTANYNTSHLNNDLWQIFENPVDWKEKYINRD 569
Query: 544 YQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVP 603
Y K + + + QPCPDVFWFPI +E+ C E V+ ME YG+WS G ++D R+ GYE VP
Sbjct: 570 YSK-IFTENIVEQPCPDVFWFPIFSERACDELVEEMEHYGKWSGGKHHDSRISGGYENVP 628
Query: 604 TRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPH 663
T DIHMKQ+ L VW F+R+++ P+ + F GY+ + A ++FVV+Y P+ Q SLRPH
Sbjct: 629 TDDIHMKQIDLENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPH 687
Query: 664 HDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQG 723
HD+ST+TINIALN VG D++GGGC+F+RYNC++ + R GW MHPGRLTH HEGL V G
Sbjct: 688 HDASTFTINIALNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNG 747
Query: 724 TRYIMISFVDP 734
TRYI +SF+DP
Sbjct: 748 TRYIAVSFIDP 758
>gi|395840986|ref|XP_003793331.1| PREDICTED: LOW QUALITY PROTEIN: procollagen-lysine,2-oxoglutarate
5-dioxygenase 1 [Otolemur garnettii]
Length = 721
Score = 630 bits (1625), Expect = e-178, Method: Compositional matrix adjust.
Identities = 310/707 (43%), Positives = 454/707 (64%), Gaps = 13/707 (1%)
Query: 31 EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNE 90
ED LV+TVA+ ET+G++RF +SA+ +++ LGL + W S GGG KV LLK
Sbjct: 25 EDNLLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWKVEKGISAGGGQKVRLLKKA 84
Query: 91 LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
L++ +D++IL TDSYDV+ G ++L++F + +VF AE L +PD L KYP
Sbjct: 85 LEKHADKEDLVILFTDSYDVLFASGPRELLKKFRQARSQVVFSAEELIYPDRRLEVKYPV 144
Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
V G R+L SGGFIGYA + +L++ + + DQL+Y +FLD R + I LD
Sbjct: 145 VSDGKRFLGSGGFIGYAPSLSKLVAEWEGHDSDSDQLFYTKIFLDPEKREQINITLDHRC 204
Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK- 269
+FQNL G+L+++ L F++ V N Y+T PV+IHGNG +K++LN GNY+ + W
Sbjct: 205 RIFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTF 263
Query: 270 TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
+GCT C+ ++ L L D P VL+ VFI++PT FL F ++ +L+YP ++ +F++
Sbjct: 264 ETGCTVCDEGLRSLKGLGDDALPLVLVGVFIEQPTPFLSLFFRRLLHLHYPRNRMRLFIH 323
Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSD 388
N++++H + ++ + +++VK + V + +ARN+ + + ++
Sbjct: 324 NHEKHHKAQVEKFLAEHGSEYQSVKLVGPEVRVENADARNMGA-XVVGPAHPYXWWWAEG 382
Query: 389 SHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGG 448
P +L N +IAPL+ R + WSNFWGAL+ADG+YARS DY++I+ G +
Sbjct: 383 PWCQEPYSAPFLRN----VIAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR-- 436
Query: 449 KGIWNVPYITNCYLMKTSVIKATNIKT-IYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEY 507
G+WNVPYI+N YL+K S ++A T ++ + +D DMAFC N+R + + + + + +
Sbjct: 437 VGVWNVPYISNIYLIKGSALRAELQSTDLFHHSKLDPDMAFCANVRQQDVFMFLTNRHTF 496
Query: 508 GHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIV 567
GHL+ +++ + +++E+ NP DW +YIH Y K+L V PCPDV+WFPI
Sbjct: 497 GHLLSLDSYQTTHLHNDLWEVFSNPEDWKEKYIHENYTKALAGKLVET-PCPDVYWFPIF 555
Query: 568 TEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVV 627
TE+ C E V+ ME YGQWS G N D R++ GYE VPT DIHM Q+ W +FL +Y+
Sbjct: 556 TEEACDELVEEMEHYGQWSLGDNKDNRIQGGYENVPTIDIHMNQISFEREWHKFLVEYIA 615
Query: 628 PLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGC 687
P+ E+ + GY+ + ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VGVDYEGGGC
Sbjct: 616 PMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLVPHHDASTFTINIALNRVGVDYEGGGC 674
Query: 688 RFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
RF+RYNC+V A R GW LMHPGRLTHYHEGL T+GTRYI +SFVDP
Sbjct: 675 RFLRYNCSVRAPRKGWTLMHPGRLTHYHEGLATTKGTRYIAVSFVDP 721
>gi|344288962|ref|XP_003416215.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2
isoform 1 [Loxodonta africana]
Length = 737
Score = 630 bits (1625), Expect = e-178, Method: Compositional matrix adjust.
Identities = 315/714 (44%), Positives = 461/714 (64%), Gaps = 10/714 (1%)
Query: 25 KVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGYK 83
K +I DK LVITVA+ E+DG+ RF++SAE VK LG + W GGD ++S+GGG K
Sbjct: 30 KPSSIPTDKLLVITVATKESDGFHRFMKSAEYFNYTVKVLGQGEEWRGGDGINSIGGGQK 89
Query: 84 VNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTS 143
V L+K ++ +D+++L T+ +DVI GG ++L++F + +VF A+ + WPD
Sbjct: 90 VRLMKEVMEHYANQEDLVVLFTECFDVIFAGGPEEVLKKFLKTNHKVVFAADGILWPDKR 149
Query: 144 LYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHK 203
L DKYP V G RYLNSGGFIGYA I ++ +++ +DDQL+Y +++D R
Sbjct: 150 LADKYPVVHIGKRYLNSGGFIGYAPYINRIVQQWDLQDNDDDQLFYTKIYIDPLKREALN 209
Query: 204 IVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNY 263
I LD +FQ L G+++++ L F+ + N Y T PV I+GNG +KI LN FGNY
Sbjct: 210 ITLDHKCKIFQALNGAVDEVVLKFENGK-ARAKNVFYETLPVAINGNGPTKIVLNYFGNY 268
Query: 264 LAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKK 322
+ SW + SGCT C+L +D + D +P+V I +FI++PT FL FL+ + L+YP
Sbjct: 269 VPNSWTQDSGCTLCDL-NVIDLSQVDVYPNVTIGIFIEQPTPFLPRFLDTLLTLDYPKDA 327
Query: 323 ISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDF 381
+ +FV+N + YH + K ++K + ++ EARN+ ++ + ++
Sbjct: 328 LKLFVHNREVYHEKDIKAFFDKAKHEISSIKIVGPEEDLSQAEARNMGMDLCRQDEKCNY 387
Query: 382 YFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNI 441
YF VD+D L NP LK L+ +N +IAPL+ R K WSNFWGAL+ DG+YARS DY++I
Sbjct: 388 YFSVDADVVLTNPRTLKLLIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARSEDYVDI 447
Query: 442 INGDQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRNKGIHLK 500
+ G++ G+WNVPY+ N YL+K +++ N + + + +D DMA C N R G+ +
Sbjct: 448 VQGNR--VGVWNVPYMANVYLIKGDTLRSEMNERNYFVRDKLDPDMALCRNAREMGVFMY 505
Query: 501 IDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPD 560
I + E+G L+ + N++ N +++++ NP+DW +YI+ +Y K + + + QPCPD
Sbjct: 506 ISNRHEFGRLLSTANYNTSHYNNDLWQIFENPVDWKEKYINRDYSK-IFTENLVEQPCPD 564
Query: 561 VFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAE 620
VFWFPI +EK C E V+ ME YGQWS G ++D R+ GYE VPT DIHMKQ+ L VW
Sbjct: 565 VFWFPIFSEKACDELVEEMEHYGQWSGGKHHDSRISGGYENVPTDDIHMKQIDLENVWLH 624
Query: 621 FLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGV 680
F+R+++ P+ + F GY+ + A ++FVV+Y PD Q SLRPHHD+ST+TINIALN VG
Sbjct: 625 FIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPDRQRSLRPHHDASTFTINIALNNVGQ 683
Query: 681 DYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
D++GGGC+F+RYNC++ + R GW MHPGRLTH HEGL V GTRYI +SF+DP
Sbjct: 684 DFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNGTRYIAVSFIDP 737
>gi|198434968|ref|XP_002131164.1| PREDICTED: similar to Plod3 protein [Ciona intestinalis]
Length = 729
Score = 629 bits (1623), Expect = e-177, Method: Compositional matrix adjust.
Identities = 306/734 (41%), Positives = 466/734 (63%), Gaps = 12/734 (1%)
Query: 7 LNCLILSCVVFFISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGL 66
++ L +S +F + + K+ + + L++TVA++ETDG+ RF +S + L V +G+
Sbjct: 2 VSLLSVSVTLFCLFLSIEHAKSQETTELLIVTVATDETDGFVRFKESLDYFNLTVLVIGM 61
Query: 67 HQPWLGGDMS-SLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNT 125
H+ W+GGD+S +GGG K+N+LK L+ ++++ TDSYDV+ GG +I+ +FN
Sbjct: 62 HEEWVGGDLSRGMGGGQKINMLKRSLESYKDNTNLVLFFTDSYDVVFTGGKEEIMSKFNK 121
Query: 126 FDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDD 185
F+A +VF AE WPD SL D YP V G R+L SGG IGYA E I+ + I + DD
Sbjct: 122 FNAKLVFSAESTIWPDASLKDLYPEVTVGKRFLCSGGIIGYAPTFWEAINMQDISDTFDD 181
Query: 186 QLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPV 245
QLYY ++L+ TLR K LD + L QN+ + ++++ + + NT Y T PV
Sbjct: 182 QLYYTKIYLNTTLRAKLNATLDHTSQLVQNINFAKSELEI-VQQGDLSRIQNTVYRTYPV 240
Query: 246 IIHGNGKSKIELNSFGNYLAKSWKTS-GCTRC--NLIKHLDSLKPDQFPSVLISVFIDKP 302
+IHGNG SK+ELN NY+ W ++ GC +C NL++ ++ + P+V +++FI+
Sbjct: 241 VIHGNGPSKLELNYMANYIPDGWHSNFGCRKCEWNLLQLPEA--EENLPTVQLAIFIEPN 298
Query: 303 TAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVN 362
T F+ EFL++I L+YP KI++F++ N+E ++ + ++ V+ I+ + V+
Sbjct: 299 TPFIPEFLSRIQQLDYPKSKITLFIHTNEENTERYVSQFLLRHRVKYQGVQVISPHDGVH 358
Query: 363 SKEARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNF 422
ARN+A+++ + K D+ +D + + N ++K+L+ +N+ ++ PL+ K WSNF
Sbjct: 359 EATARNMALDHCILKNCDYQLSIDGNVQITNSSLIKFLMTKNKQVVGPLVKLHEKLWSNF 418
Query: 423 WGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIK--ATNIKTIYTLN 480
WGALNADG+YARS DY++I+N ++ GIWN+P+I++ YLMK+ I+ + + Y
Sbjct: 419 WGALNADGYYARSADYISIVNRER--TGIWNIPFISSVYLMKSETIRFLLSRVPQPYFYE 476
Query: 481 SMDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYI 540
MD DMAFC ++R +GI L + + E+G L+ N +P +P+++++ N DW+ +YI
Sbjct: 477 DMDADMAFCAHVRQEGIFLHVTNEAEFGRLLSKANVNPGPVHPDLWQIETNKKDWEEKYI 536
Query: 541 HPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYE 600
HP++ L +T +QPCPDV+ FP+ TE+ V +ME +G+WS G N D RL GYE
Sbjct: 537 HPDFWNLTLENTEVSQPCPDVYMFPLFTEEMADAIVDVMENHGEWSGGKNKDDRLAGGYE 596
Query: 601 AVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSL 660
VPT DIHM QV W L Y + ++ + GY+ + + M FVVRYRP EQ L
Sbjct: 597 NVPTVDIHMNQVNYEKQWLHMLATYPTHIIQKVYPGYYTK-ASSIMMFVVRYRPSEQSFL 655
Query: 661 RPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQV 720
RPHHDSST+T+N+ALN G DYEGGGCRF+RY+C+VT G+ L+HPGRLTHYHEGLQ
Sbjct: 656 RPHHDSSTWTMNVALNTYGEDYEGGGCRFLRYDCSVTQIPKGYALVHPGRLTHYHEGLQT 715
Query: 721 TQGTRYIMISFVDP 734
+GTRYI +SFVDP
Sbjct: 716 MEGTRYIAVSFVDP 729
>gi|218931163|ref|NP_001136387.1| procollagen-lysine,2-oxoglutarate 5-dioxygenase 2 isoform 2
precursor [Rattus norvegicus]
gi|149018892|gb|EDL77533.1| rCG25923, isoform CRA_b [Rattus norvegicus]
Length = 737
Score = 629 bits (1622), Expect = e-177, Method: Compositional matrix adjust.
Identities = 314/710 (44%), Positives = 457/710 (64%), Gaps = 10/710 (1%)
Query: 29 IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGYKVNLL 87
I DK LVITVA+ E DG+ RF+ SA+ VK LG Q W GGD M+S+GGG KV L+
Sbjct: 34 IPADKLLVITVATKENDGFHRFMNSAKYFNYTVKVLGQGQEWRGGDGMNSIGGGQKVRLM 93
Query: 88 KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
K ++ DD++IL T+ +DVI GG ++L++F + IVF A+ L WPD L DK
Sbjct: 94 KEAMEHYAGQDDLVILFTECFDVIFAGGPEELLKKFQKTNHKIVFAADALLWPDKRLADK 153
Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
YP V G RYLNSGGFIGYA I L+ +++ +DDQL+Y +++D R I LD
Sbjct: 154 YPGVHIGKRYLNSGGFIGYAPYISRLVQQWDLQDNDDDQLFYTKVYIDPLKREALNITLD 213
Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
+FQ L G+ +++ L F+ + + NT Y T PV I+GNG +KI LN FGNY+ S
Sbjct: 214 HRCKIFQALNGATDEVVLKFENGK-SRVKNTFYETLPVAINGNGPTKILLNYFGNYVPNS 272
Query: 268 W-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMF 326
W + +GC C+ +D D +P V + VFI++PT FL FL+ + L+YP + + +F
Sbjct: 273 WTQENGCALCDF-DTIDLSTVDVYPKVTLGVFIEQPTPFLPRFLDLLLTLDYPKEALRLF 331
Query: 327 VYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYV 385
V+N + YH ++ K ++K + ++ EARN+ ++ + D+YF V
Sbjct: 332 VHNKEVYHEKDIKAFVDKAKHDISSIKIVGPEENLSQAEARNMGMDFCRQDEKCDYYFSV 391
Query: 386 DSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGD 445
D+D L NP LK L+ +N +IAPL+ R K WSNFWGAL+ DG+YARS DY++I+ G+
Sbjct: 392 DADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARSEDYVDIVQGN 451
Query: 446 QGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDST 504
+ GIWNVPY+ N YL++ +++ + + + + +D DM+ C N R+ G+ + I +
Sbjct: 452 R--VGIWNVPYMANVYLIQGKTLRSEMSERNYFVRDKLDPDMSLCRNARDMGVFMYISNR 509
Query: 505 QEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWF 564
E+G L+ + N++ N +++++ NP+DW +YI+ +Y K + + + QPCPDVFWF
Sbjct: 510 HEFGRLISTANYNTSHLNNDLWQIFENPVDWKEKYINRDYSK-IFTENIVEQPCPDVFWF 568
Query: 565 PIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRK 624
PI +E+ C E V+ ME YG+WS G ++D R+ GYE VPT DIHMKQ+ L VW F+R+
Sbjct: 569 PIFSERACDELVEEMEHYGKWSGGKHHDSRISGGYENVPTDDIHMKQIDLENVWLHFIRE 628
Query: 625 YVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEG 684
++ P+ + F GY+ + A ++FVV+Y P+ Q SLRPHHD+ST+TINIALN VG D++G
Sbjct: 629 FIAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPHHDASTFTINIALNNVGEDFQG 687
Query: 685 GGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
GGC+F+RYNC++ + R GW MHPGRLTH HEGL V GTRYI +SF+DP
Sbjct: 688 GGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNGTRYIAVSFIDP 737
>gi|281337516|gb|EFB13100.1| hypothetical protein PANDA_014326 [Ailuropoda melanoleuca]
Length = 726
Score = 629 bits (1622), Expect = e-177, Method: Compositional matrix adjust.
Identities = 311/728 (42%), Positives = 460/728 (63%), Gaps = 31/728 (4%)
Query: 32 DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGYKVNLLKNE 90
DK LVITVA+ E+DG+ RF+QSA+ VK LG + W GGD ++S+GGG KV L+K
Sbjct: 5 DKLLVITVATKESDGFHRFMQSAKYFNYTVKVLGQGEEWRGGDGINSIGGGQKVRLMKEV 64
Query: 91 LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
++ +D++IL T+ ++VI GG ++L++F + +VF A+ + WPD L DKYP
Sbjct: 65 MEHYANQEDLVILFTECFNVIFAGGPEEVLKKFQKSNHKVVFAADGILWPDKRLADKYPI 124
Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
V G RYLNSGGFIGYA +I +++ ++++ +DDQL+Y +++D R I LD
Sbjct: 125 VHIGKRYLNSGGFIGYAPNINQIVQQWNLQDNDDDQLFYTKIYIDPLKREAINITLDHKC 184
Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSW-K 269
+FQ L G+++++ L F+ + N Y T PV ++GNG +KI LN FGNY+ +W +
Sbjct: 185 KIFQTLNGAVDEVVLKFENGK-ARAKNVFYETLPVAVNGNGPTKILLNYFGNYVPNAWTQ 243
Query: 270 TSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN 329
+GCT C+L +D D P+V I VFI++PT FL FL+ + L+YP + + +F++N
Sbjct: 244 DNGCTLCDL-DTIDLSTVDVHPNVTIGVFIEQPTPFLPRFLDILLTLDYPKEALKLFIHN 302
Query: 330 NQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDSD 388
+ YH + K +K + ++ EARN+ ++ + D+YF +D+D
Sbjct: 303 KEVYHEKDIKVFFDKAKREISTIKIVGPEENLSQAEARNMGMDFCRQDENCDYYFSMDAD 362
Query: 389 SHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGG 448
L NP LK L+ +N +IAPL+ R K WSNFWGAL+ DG+YARS DY++I+ G++
Sbjct: 363 VVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARSEDYVDIVQGNR-- 420
Query: 449 KGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN------------- 494
GIWNVPY+ N YL+K +++ N + + + +D DMA C N R
Sbjct: 421 VGIWNVPYMANVYLIKGKTLRSEMNERNYFVRDKLDPDMALCRNAREMTLQREKDSPTPE 480
Query: 495 --------KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQK 546
KG+ + I + E+G L+ + N++ N +++++ NP+DW +YI+ +Y K
Sbjct: 481 TFQMLSPPKGVFMYISNRHEFGRLLSTANYNTSHYNNDLWQIFENPVDWKEKYINRDYSK 540
Query: 547 SLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRD 606
+ + + QPCPDVFWFPI +EK C E V+ ME YGQWS G ++D R+ GYE VPT D
Sbjct: 541 -IFTENIVEQPCPDVFWFPIFSEKACDELVEEMEHYGQWSGGKHHDSRISGGYENVPTDD 599
Query: 607 IHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDS 666
IHMKQ+ L VW F+R+++ P+ + F GY+ + A ++FVV+Y P+ Q SLRPHHD+
Sbjct: 600 IHMKQIDLENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPHHDA 658
Query: 667 STYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRY 726
ST+TINIALN VG D++GGGC+F+RYNC++ + R GW MHPGRLTH HEGL V GTRY
Sbjct: 659 STFTINIALNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNGTRY 718
Query: 727 IMISFVDP 734
I +SF+DP
Sbjct: 719 IAVSFIDP 726
>gi|403278817|ref|XP_003930981.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2
isoform 3 [Saimiri boliviensis boliviensis]
Length = 791
Score = 629 bits (1622), Expect = e-177, Method: Compositional matrix adjust.
Identities = 315/739 (42%), Positives = 460/739 (62%), Gaps = 31/739 (4%)
Query: 21 VHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSSLG 79
+H + DK LVITVA+ E+DG+ RF+QSA+ VK LG + W GGD ++S+G
Sbjct: 59 IHPPALPRGRPDKLLVITVATKESDGFHRFMQSAKYFNYTVKVLGQGEEWRGGDGINSIG 118
Query: 80 GGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCW 139
GG KV L+K + DD++++ T+ +DVI GG ++L++F + +VF A+ + W
Sbjct: 119 GGQKVRLMKEIMGHYADQDDLVVMFTECFDVIFAGGPEELLKKFQKANHKVVFAADGILW 178
Query: 140 PDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLR 199
PD L DKYP V G RYLNSGGFIGYA I ++ ++++ +DDQL+Y +++D R
Sbjct: 179 PDKRLADKYPVVHIGKRYLNSGGFIGYAPYINRIVQQWNLQDNDDDQLFYTKIYIDPLKR 238
Query: 200 TKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNS 259
I LD +FQ L G+++++ L F+ + NT Y T PV I+GNG +KI LN
Sbjct: 239 EAINITLDHKCKIFQTLNGAVDEVVLKFENGK-ARAKNTFYETLPVAINGNGPTKILLNY 297
Query: 260 FGNYLAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNY 318
FGNY+ SW + +GCT C +D D P+V I VFI++PT FL FL+ + L+Y
Sbjct: 298 FGNYVPNSWTQDNGCTLCEF-DTVDLSAVDVHPNVTIGVFIEQPTPFLPRFLDILLTLDY 356
Query: 319 PAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-K 377
P + + +F++N + YH + K K +K + ++ EARN+ ++ +
Sbjct: 357 PKEALKLFIHNKEVYHEKDIKVFFDKAKHEIKTIKIVGPEENLSQAEARNMGMDFCRQDE 416
Query: 378 GVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFD 437
D+YF VD+D L NP LK L+ +N +IAPL+ R K WSNFWGAL+ DG+YARS D
Sbjct: 417 NCDYYFSVDADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARSED 476
Query: 438 YMNIINGDQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN-- 494
Y++I+ G++ G+WNVPY+ N YL+K +++ N + + + +D DMA C N R
Sbjct: 477 YVDIVQGNR--VGVWNVPYMANVYLIKGKTLRSEMNERNYFVRDKLDPDMALCRNAREMT 534
Query: 495 -------------------KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDW 535
KG+ + I + E+G L+ + N++ N +++++ NP+DW
Sbjct: 535 LQREKDSPTPETFQMLSPPKGVFMYISNRHEFGRLLSTANYNTSHYNNDLWQIFENPVDW 594
Query: 536 DLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRL 595
+YI+ +Y K + + + QPCPDVFWFPI +EK C E V+ ME YG+WS G ++D R+
Sbjct: 595 KEKYINRDYSK-IFTENIVEQPCPDVFWFPIFSEKACDELVEEMEHYGKWSGGKHHDSRI 653
Query: 596 ETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPD 655
GYE VPT DIHMKQV L VW F+R+++ P+ + F GY+ + A ++FVV+Y P+
Sbjct: 654 SGGYENVPTDDIHMKQVDLENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPE 712
Query: 656 EQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYH 715
Q SLRPHHD+ST+TINIALN VG D++GGGC+F+RYNC++ + R GW MHPGRLTH H
Sbjct: 713 RQRSLRPHHDASTFTINIALNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLH 772
Query: 716 EGLQVTQGTRYIMISFVDP 734
EGL V GTRYI +SF+DP
Sbjct: 773 EGLPVKNGTRYIAVSFIDP 791
>gi|351707541|gb|EHB10460.1| Procollagen-lysine,2-oxoglutarate 5-dioxygenase 2 [Heterocephalus
glaber]
Length = 758
Score = 628 bits (1620), Expect = e-177, Method: Compositional matrix adjust.
Identities = 315/728 (43%), Positives = 456/728 (62%), Gaps = 31/728 (4%)
Query: 32 DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGYKVNLLKNE 90
DK LVITVA+ E+DGY RF+QSA+ VK LG + W GGD ++S+GGG KV L+K
Sbjct: 37 DKLLVITVATKESDGYYRFMQSAKYFNYTVKVLGQGEEWRGGDGLNSIGGGQKVRLMKEV 96
Query: 91 LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
++ +DM+IL T+ +DVI GG ++L++F + +VF A+ + WPD L DKYP
Sbjct: 97 MEHYANQEDMVILFTECFDVIFAGGPEEVLKKFQKTNHKVVFAADGILWPDKRLADKYPI 156
Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
V G RYLNSGGFIGYA I ++ ++++ +DDQL+Y +++D R I LD
Sbjct: 157 VHIGKRYLNSGGFIGYAPYINRIVQQWNLQDNDDDQLFYTKIYIDPLQRQALNITLDHKC 216
Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSW-K 269
+FQ L G+ +++ L F+ + NT Y T PV I+GNG +KI LN FGNY+ SW +
Sbjct: 217 KIFQALNGATDEVVLKFENGK-TRAKNTFYETLPVAINGNGPTKILLNYFGNYVPNSWTQ 275
Query: 270 TSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN 329
+GC C +D D +P+V I VFI++PT FL FL+ + L+YP + + +F++N
Sbjct: 276 DNGCALCEF-DTIDLSAVDVYPNVTIGVFIEQPTPFLPRFLDILLALDYPKEALKLFIHN 334
Query: 330 NQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDSD 388
+ YH + K +K + ++ EARN+ ++ + D+YF +D+D
Sbjct: 335 KEVYHEKDIKVFFDKAKHEISTIKIVGPEENLSQAEARNMGMDFCRQDEKCDYYFSLDAD 394
Query: 389 SHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGG 448
L N LK L+ +N +IAPL+ R K WSNFWGAL+ DG+YARS DY++I+ G++
Sbjct: 395 VVLTNSRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARSEDYVDIVQGNR-- 452
Query: 449 KGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN------------- 494
GIWNVPYI N YL+K ++++ N + + + +D DMA C N R
Sbjct: 453 VGIWNVPYIANVYLIKGKMLRSEMNERNYFVRDKLDPDMALCRNAREMTLQREKDSPTPE 512
Query: 495 --------KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQK 546
KG+ + I + E+G L+ + N++ N +++++ NP+DW +YI+ +Y K
Sbjct: 513 TFQMLIPPKGVFMYISNRHEFGRLLSTANYNTSHYNNDLWQIFENPVDWKEKYINRDYSK 572
Query: 547 SLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRD 606
+ + + QPCPDVFWFPI +EK C E V+ ME YGQWS G ++D R+ GYE VPT D
Sbjct: 573 -IFTENIVEQPCPDVFWFPIFSEKACDELVEEMEHYGQWSGGKHHDSRISGGYENVPTDD 631
Query: 607 IHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDS 666
IHMKQ+ L VW F+R+++ P+ + F GY+ + A ++FVV+Y P+ Q SLRPHHD+
Sbjct: 632 IHMKQIDLENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPHHDA 690
Query: 667 STYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRY 726
ST+TINIALN VG D++GGGC+F+RYNC++ + R GW MHPGRLTH HEGL V GTRY
Sbjct: 691 STFTINIALNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNGTRY 750
Query: 727 IMISFVDP 734
I +SF+DP
Sbjct: 751 IAVSFIDP 758
>gi|417412584|gb|JAA52670.1| Putative procollagen-lysine2-oxoglutarate 5-dioxygenase 2, partial
[Desmodus rotundus]
Length = 757
Score = 626 bits (1614), Expect = e-176, Method: Compositional matrix adjust.
Identities = 313/735 (42%), Positives = 456/735 (62%), Gaps = 31/735 (4%)
Query: 25 KVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGYK 83
K +I DK LVITVA+ E+DG+ RF+QSA+ VK LG + W GGD ++S+GGG K
Sbjct: 29 KPSSIPTDKLLVITVATKESDGFHRFMQSAKYFNYTVKVLGQGEEWRGGDGINSIGGGQK 88
Query: 84 VNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTS 143
V L+K + +D+++L T+ +DVI GG ++L +F + +VF A+ + WPD
Sbjct: 89 VRLMKEVMGHYADQEDLVVLFTECFDVIFAGGPEEVLRKFQKSNHKVVFAADGILWPDKR 148
Query: 144 LYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHK 203
L DKYP V G RYLNSGGFIGYA I ++ ++++ +DDQL+Y +++D R
Sbjct: 149 LADKYPVVHIGKRYLNSGGFIGYAPYIHHIVQQWNLQDNDDDQLFYTKIYIDPLKREALN 208
Query: 204 IVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNY 263
I LD +FQ L G+++++ L F+ + N Y T PV I+GNG +KI LN FGNY
Sbjct: 209 ITLDHKCKIFQALNGAVDEVVLKFENGK-ARAKNVFYETLPVAINGNGPTKILLNYFGNY 267
Query: 264 LAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKK 322
+ +W + GCT C L +D + +P+V I VFI++PT FL FL+ + L+YP +
Sbjct: 268 VPNAWTQDKGCTFCEL-DTVDLSAVNVYPNVTIGVFIEQPTPFLPRFLDTLLTLDYPKEA 326
Query: 323 ISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHK-GVDF 381
+ +F++N + YH + K +K + ++ EARN+ ++ +
Sbjct: 327 LKIFIHNKEVYHEKDIKVFFDKAKHEISTIKIVGPEENLSQAEARNMGMDFCRQDDSCGY 386
Query: 382 YFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNI 441
YF VD+D L NP LK L+ +N +IAPL+ R K WSNFWGAL+ DG+YARS DY++I
Sbjct: 387 YFSVDADVVLTNPKTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARSEDYVDI 446
Query: 442 INGDQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN------ 494
+ G++ G+WNVPY+ N YL+K +++ N + + + +D DMA C N R
Sbjct: 447 VQGNR--VGLWNVPYMANVYLIKGKTLRSEMNERNYFVRDRLDPDMALCRNAREMTVQRE 504
Query: 495 ---------------KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRY 539
KG+ + I + E+G L+ + N++ N +++++ NP+DW +Y
Sbjct: 505 KDSPTPETFQMLSPPKGVFMYISNRHEFGRLLSTANYNTSHYNNDLWQIFENPVDWKEKY 564
Query: 540 IHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGY 599
I+ +Y K + + + QPCPDVFWFPI +EK C E V+ ME YGQWS G ++D R+ GY
Sbjct: 565 INRDYSK-IFTENIVEQPCPDVFWFPIFSEKACDELVEEMEHYGQWSGGKHHDSRISGGY 623
Query: 600 EAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPS 659
E VPT DIHMKQ+ L VW F+R+++ P+ + F GY+ + A ++FVV+Y PD Q S
Sbjct: 624 ENVPTDDIHMKQIDLENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPDRQRS 682
Query: 660 LRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQ 719
LRPHHD+ST+TINIALN VG D++GGGC+F+RYNC++ + R GW MHPGRLTH HEGL
Sbjct: 683 LRPHHDASTFTINIALNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLP 742
Query: 720 VTQGTRYIMISFVDP 734
V GTRYI +SF+DP
Sbjct: 743 VKNGTRYIAVSFIDP 757
>gi|432090515|gb|ELK23936.1| Procollagen-lysine,2-oxoglutarate 5-dioxygenase 2 [Myotis davidii]
Length = 730
Score = 624 bits (1610), Expect = e-176, Method: Compositional matrix adjust.
Identities = 312/732 (42%), Positives = 455/732 (62%), Gaps = 31/732 (4%)
Query: 28 NIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGYKVNL 86
+I DK LVIT+A+ E DG+ RF+QSA+ VK LG + W GGD ++S+GGG KV L
Sbjct: 5 SIFPDKLLVITIATKENDGFHRFMQSAKYFNYTVKVLGQGEEWRGGDGINSIGGGQKVRL 64
Query: 87 LKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYD 146
+K ++ +DM++L T+ +DVI G ++L++F + +VF A+ + WPD L D
Sbjct: 65 MKEVMEHYASHEDMVVLFTECFDVIFAGSPEEVLKKFQKSNHKVVFAADGILWPDKRLAD 124
Query: 147 KYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVL 206
KYP V G RYLNSGGFIGYA I ++ ++++ +DDQL+Y +++D R I L
Sbjct: 125 KYPVVHIGKRYLNSGGFIGYAPYINRIVQQWNLQDNDDDQLFYTKIYIDPLKREALNITL 184
Query: 207 DTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAK 266
D +FQ L G+++++ L F+ + N Y T PV I+GNG +KI LN FGNY+
Sbjct: 185 DHKCKIFQTLNGAVDEVVLKFENGK-ARAKNVFYETLPVAINGNGPTKILLNYFGNYVPN 243
Query: 267 SW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
+W + SGCT C L +D + +P+V I VFI++PT FL FL+ + L+YP + + +
Sbjct: 244 AWTQDSGCTLCEL-DTIDLSAVNVYPNVTIGVFIEQPTPFLPRFLDILLTLDYPKEALKV 302
Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHK-GVDFYFY 384
F++N + YH + K +K + ++ EARN+ ++ D+YF
Sbjct: 303 FIHNKEVYHEKHIKVFFDKAKHEINTIKIVGPEENLSQAEARNMGMDFCRQDDSCDYYFS 362
Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
VD+D L NP LK L+ +N +IAPL+ R K WSNFWGAL+ DG++AR DY++I+ G
Sbjct: 363 VDADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYFARYEDYVDIVQG 422
Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN--------- 494
++ GIWNVPY+ N YL+K +++ N + + + +D DMA C N R
Sbjct: 423 NR--VGIWNVPYMANVYLIKGKTLRSEMNERNYFVRDRLDPDMALCRNAREMTLQREKDS 480
Query: 495 ------------KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHP 542
KG+ + I + E+G L+ + N++ N +++++ NP+DW +YI
Sbjct: 481 PTPETFQMLSPPKGVFMYISNRHEFGRLLSTANYNTSHYNNDLWQIFENPVDWKEKYISH 540
Query: 543 EYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAV 602
+Y K + + + QPCPDVFWFPI +EK C E V+ ME YGQWS G ++D R+ GYE V
Sbjct: 541 DYSK-IFTENIVEQPCPDVFWFPIFSEKACDELVEEMEHYGQWSGGKHHDSRISGGYENV 599
Query: 603 PTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRP 662
PT DIHMKQ+ L VW F+R+++ P+ + F GY+ + A ++FVV+Y PD Q SLRP
Sbjct: 600 PTDDIHMKQIDLENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPDRQRSLRP 658
Query: 663 HHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQ 722
HHD+ST+TINIALN VG D++GGGC+F+RYNC++ + R GW MHPGRLTH HEGL V
Sbjct: 659 HHDASTFTINIALNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKN 718
Query: 723 GTRYIMISFVDP 734
GTRYI +SF+DP
Sbjct: 719 GTRYIAVSFIDP 730
>gi|341900474|gb|EGT56409.1| CBN-LET-268 protein [Caenorhabditis brenneri]
Length = 746
Score = 621 bits (1601), Expect = e-175, Method: Compositional matrix adjust.
Identities = 313/756 (41%), Positives = 473/756 (62%), Gaps = 38/756 (5%)
Query: 7 LNCLILSCVVFFISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGL 66
+ L L +VF + + ++ E +V+TVA+ TDG KR ++SA+ + V+ L L
Sbjct: 1 MRVLPLFPLVFIPVILATTITDLPE--LVVVTVATENTDGLKRLLESAKAFGINVEVLAL 58
Query: 67 HQPWLGGDMS-SLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERF-- 123
+ W GGD GGG K+ +L +++ D +I+ D+YDVI + IL +F
Sbjct: 59 GERWNGGDTRIEQGGGQKIRILSEWIEKYKDASDTMIMFVDAYDVIFNADSTTILRKFFE 118
Query: 124 NTFDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEE 183
+ D ++FGAE CWPD +L YP V G R+LNSG F+GY ++ +++ + +++++
Sbjct: 119 HYSDKRLLFGAEPFCWPDQTLAPDYPIVEFGKRFLNSGLFMGYGPEVYKVLKLKPVEDKD 178
Query: 184 DDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTN 243
DDQLYY ++LD LR + K+ LD+++ +FQNL G +ED++L F D N YNT
Sbjct: 179 DDQLYYTRVYLDNKLRKELKMDLDSMSKIFQNLNGVIEDVELQFKEDGTPEAYNAAYNTK 238
Query: 244 PVIIHGNGKSKIELNSFGNYLAKSWKTS-GCTRCNLIKHLDSLKPDQFPSVLISVFIDKP 302
P+I+HGNG SK LN GNYL W + GC C + DS D+ P + +++FI KP
Sbjct: 239 PLIVHGNGPSKSHLNYLGNYLGNRWNSQLGCRTCGQ-EMKDS---DELPLIGLNIFIAKP 294
Query: 303 TAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVN 362
F+EE L K+A +YP KI++++YNNQ + D++ + + I + +
Sbjct: 295 IPFIEEVLQKVAEFDYPKDKIALYIYNNQPFSIKNIQDFLKEHGKSYYTKRVINGVTEIG 354
Query: 363 SKEARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNES----LIAPLLVRPFKA 418
+EARN A+E + V+F F +D D++ P V+K LV+ +++ +IAP++ + K
Sbjct: 355 EREARNEAIEWDKQRNVEFAFLMDGDAYFTEPKVIKDLVHYSKTYDVGIIAPMVGQIGKL 414
Query: 419 WSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYT 478
++NFWGA+ A+G+YARS DYM I+ G++ G WNVP+IT+ L+ + A +K Y+
Sbjct: 415 FTNFWGAVAANGYYARSEDYMAIVKGNR--IGYWNVPFITSALLLNKEKLSA--LKDAYS 470
Query: 479 LN-SMDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKT----NPEVYELIRNPL 533
N ++D DM+ C R+ G + ID+ ++YG+L+ S+ F T +PE++++ N
Sbjct: 471 YNKNLDPDMSMCQFARDNGHFMYIDNEKQYGYLIVSDEFSETVTEGKWHPEMWQIFENRD 530
Query: 534 DWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDK 593
W+ RY+HP Y K + PD + +Q CPDV+ +P+++E+FC E ++ ME +G+WSDG+NNDK
Sbjct: 531 LWEARYVHPGYHKIMEPDHIIDQACPDVYDYPLMSERFCEELIEEMEGFGRWSDGSNNDK 590
Query: 594 RLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHH-------------- 639
RL GYE VPTRDIHM QVG W FL YV P+QE+ FIGY+H
Sbjct: 591 RLAGGYENVPTRDIHMNQVGFERQWLYFLDTYVRPVQEKTFIGYYHQVEPISYFFIPTII 650
Query: 640 -EPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTA 698
+PV + M FVVRY+P+EQ SLRPHHD+ST++I++ALN+ G DYEGGG R++RYNC V A
Sbjct: 651 FQPVESNMMFVVRYKPEEQASLRPHHDASTFSIDVALNKKGRDYEGGGVRYVRYNCTVEA 710
Query: 699 TRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
+G+ +M PGRLTH HEGL T+GTRYIM+SF++P
Sbjct: 711 DEVGYAMMFPGRLTHLHEGLATTKGTRYIMVSFINP 746
>gi|326925903|ref|XP_003209146.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2-like
[Meleagris gallopavo]
Length = 774
Score = 620 bits (1599), Expect = e-175, Method: Compositional matrix adjust.
Identities = 310/735 (42%), Positives = 462/735 (62%), Gaps = 38/735 (5%)
Query: 32 DKFLVITVASNETDGYKRFIQSAE-------VNKLQVKTLGLHQPWLGGDMS-SLGGGYK 83
D LV TVA+ ETDG+ RF+Q+A+ V + V + G + W GG+++ S+GGG K
Sbjct: 46 DNLLVFTVATKETDGFHRFMQTAKHFNYTVKVPYVLVPSTGKGEEWKGGELANSIGGGQK 105
Query: 84 VNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTS 143
V LLK + +D+I++ + YDVI GG ++L++F + +VF A+ L WPD
Sbjct: 106 VRLLKEGIQGYADQEDLIVMFVECYDVIFAGGPEELLKKFQETNHKVVFAADGLIWPDKR 165
Query: 144 LYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHK 203
L DKYPAV SG R+LNSGGFIGYA I ++ +++ +DDQL+Y +++D R
Sbjct: 166 LADKYPAVRSGKRFLNSGGFIGYAPYINRIVQQWDLQDNDDDQLFYTKIYVDPLARESLN 225
Query: 204 IVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNY 263
I LD +FQ L G+++++ LNF+ + V N+ Y T P+ + GNG +KI LN GNY
Sbjct: 226 ITLDHKCAIFQTLNGAVDEVHLNFEEGK-VRARNSAYETLPITVLGNGPTKIYLNYLGNY 284
Query: 264 LAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKK 322
+ +W + +GC+ C+L LD ++PSV I VFI++PT FL +FL+++ L+YP +
Sbjct: 285 IPNAWTRETGCSICDL-DMLDLSTVTEYPSVKIGVFIEQPTPFLPKFLDRLLTLDYPKEA 343
Query: 323 ISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDF 381
+S+F++NN+ YH + K + +N+K + ++ EARN+ ++ + ++
Sbjct: 344 LSVFIHNNEVYHEKHIKKFWEKAKNIIRNIKIVGPEENLSQAEARNMGMDLCRQDEACEY 403
Query: 382 YFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNI 441
YF +D+D L NP LK L+ +N +IAPL+ R K WSNFWGAL+ DG+YARS DY++I
Sbjct: 404 YFSIDADVVLTNPKTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARSEDYIDI 463
Query: 442 INGDQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN------ 494
+ G++ G+WN+PY+ N YL+K +++ K + + +D DMA C N R
Sbjct: 464 VQGNR--VGVWNIPYMANIYLIKGQTLRSEMKEKNYFMRDKLDPDMALCRNAREMTLQRE 521
Query: 495 ---------------KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRY 539
KG+ + I + E+G L+ + N++ N +++++ NP+DW Y
Sbjct: 522 KDSPSSETFHMLRPPKGVFMYITNRHEFGRLISTANYNTSHYNNDLWQIFENPVDWKETY 581
Query: 540 IHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGY 599
I+P Y K + D + QPCPDVFWFPI ++ C E V+ ME +GQWS G + D R+ GY
Sbjct: 582 INPNYSK-IFTDNIVEQPCPDVFWFPIFSDTACDELVEEMEHFGQWSGGKHQDSRISGGY 640
Query: 600 EAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPS 659
E VPT DIHMKQ+GL W F+R+++ P+ + F GY+ + A ++FVV+Y PD Q S
Sbjct: 641 ENVPTDDIHMKQIGLDNEWLHFIREFIAPVTLKVFAGYYTKGY-ALLNFVVKYSPDRQRS 699
Query: 660 LRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQ 719
LRPHHDSST+TINIALN+VG D++GGGC+F+RYNC++ + R GW MHPGRLTH HEGL
Sbjct: 700 LRPHHDSSTFTINIALNKVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLP 759
Query: 720 VTQGTRYIMISFVDP 734
+ GTRYI +SF+DP
Sbjct: 760 ILNGTRYIAVSFIDP 774
>gi|344288964|ref|XP_003416216.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2
isoform 2 [Loxodonta africana]
Length = 758
Score = 620 bits (1598), Expect = e-174, Method: Compositional matrix adjust.
Identities = 316/735 (42%), Positives = 462/735 (62%), Gaps = 31/735 (4%)
Query: 25 KVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGYK 83
K +I DK LVITVA+ E+DG+ RF++SAE VK LG + W GGD ++S+GGG K
Sbjct: 30 KPSSIPTDKLLVITVATKESDGFHRFMKSAEYFNYTVKVLGQGEEWRGGDGINSIGGGQK 89
Query: 84 VNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTS 143
V L+K ++ +D+++L T+ +DVI GG ++L++F + +VF A+ + WPD
Sbjct: 90 VRLMKEVMEHYANQEDLVVLFTECFDVIFAGGPEEVLKKFLKTNHKVVFAADGILWPDKR 149
Query: 144 LYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHK 203
L DKYP V G RYLNSGGFIGYA I ++ +++ +DDQL+Y +++D R
Sbjct: 150 LADKYPVVHIGKRYLNSGGFIGYAPYINRIVQQWDLQDNDDDQLFYTKIYIDPLKREALN 209
Query: 204 IVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNY 263
I LD +FQ L G+++++ L F+ + N Y T PV I+GNG +KI LN FGNY
Sbjct: 210 ITLDHKCKIFQALNGAVDEVVLKFENGK-ARAKNVFYETLPVAINGNGPTKIVLNYFGNY 268
Query: 264 LAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKK 322
+ SW + SGCT C+L +D + D +P+V I +FI++PT FL FL+ + L+YP
Sbjct: 269 VPNSWTQDSGCTLCDL-NVIDLSQVDVYPNVTIGIFIEQPTPFLPRFLDTLLTLDYPKDA 327
Query: 323 ISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDF 381
+ +FV+N + YH + K ++K + ++ EARN+ ++ + ++
Sbjct: 328 LKLFVHNREVYHEKDIKAFFDKAKHEISSIKIVGPEEDLSQAEARNMGMDLCRQDEKCNY 387
Query: 382 YFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNI 441
YF VD+D L NP LK L+ +N +IAPL+ R K WSNFWGAL+ DG+YARS DY++I
Sbjct: 388 YFSVDADVVLTNPRTLKLLIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARSEDYVDI 447
Query: 442 INGDQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN------ 494
+ G++ G+WNVPY+ N YL+K +++ N + + + +D DMA C N R
Sbjct: 448 VQGNR--VGVWNVPYMANVYLIKGDTLRSEMNERNYFVRDKLDPDMALCRNAREMTLQRE 505
Query: 495 ---------------KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRY 539
KG+ + I + E+G L+ + N++ N +++++ NP+DW +Y
Sbjct: 506 KDSPTPETFQMLSPPKGVFMYISNRHEFGRLLSTANYNTSHYNNDLWQIFENPVDWKEKY 565
Query: 540 IHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGY 599
I+ +Y K + + + QPCPDVFWFPI +EK C E V+ ME YGQWS G ++D R+ GY
Sbjct: 566 INRDYSK-IFTENLVEQPCPDVFWFPIFSEKACDELVEEMEHYGQWSGGKHHDSRISGGY 624
Query: 600 EAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPS 659
E VPT DIHMKQ+ L VW F+R+++ P+ + F GY+ + A ++FVV+Y PD Q S
Sbjct: 625 ENVPTDDIHMKQIDLENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPDRQRS 683
Query: 660 LRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQ 719
LRPHHD+ST+TINIALN VG D++GGGC+F+RYNC++ + R GW MHPGRLTH HEGL
Sbjct: 684 LRPHHDASTFTINIALNNVGQDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLP 743
Query: 720 VTQGTRYIMISFVDP 734
V GTRYI +SF+DP
Sbjct: 744 VKNGTRYIAVSFIDP 758
>gi|218931161|ref|NP_787065.2| procollagen-lysine,2-oxoglutarate 5-dioxygenase 2 isoform 1
precursor [Rattus norvegicus]
gi|149018891|gb|EDL77532.1| rCG25923, isoform CRA_a [Rattus norvegicus]
Length = 758
Score = 619 bits (1597), Expect = e-174, Method: Compositional matrix adjust.
Identities = 315/731 (43%), Positives = 458/731 (62%), Gaps = 31/731 (4%)
Query: 29 IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGYKVNLL 87
I DK LVITVA+ E DG+ RF+ SA+ VK LG Q W GGD M+S+GGG KV L+
Sbjct: 34 IPADKLLVITVATKENDGFHRFMNSAKYFNYTVKVLGQGQEWRGGDGMNSIGGGQKVRLM 93
Query: 88 KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
K ++ DD++IL T+ +DVI GG ++L++F + IVF A+ L WPD L DK
Sbjct: 94 KEAMEHYAGQDDLVILFTECFDVIFAGGPEELLKKFQKTNHKIVFAADALLWPDKRLADK 153
Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
YP V G RYLNSGGFIGYA I L+ +++ +DDQL+Y +++D R I LD
Sbjct: 154 YPGVHIGKRYLNSGGFIGYAPYISRLVQQWDLQDNDDDQLFYTKVYIDPLKREALNITLD 213
Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
+FQ L G+ +++ L F+ + + NT Y T PV I+GNG +KI LN FGNY+ S
Sbjct: 214 HRCKIFQALNGATDEVVLKFENGK-SRVKNTFYETLPVAINGNGPTKILLNYFGNYVPNS 272
Query: 268 W-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMF 326
W + +GC C+ +D D +P V + VFI++PT FL FL+ + L+YP + + +F
Sbjct: 273 WTQENGCALCDF-DTIDLSTVDVYPKVTLGVFIEQPTPFLPRFLDLLLTLDYPKEALRLF 331
Query: 327 VYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYV 385
V+N + YH ++ K ++K + ++ EARN+ ++ + D+YF V
Sbjct: 332 VHNKEVYHEKDIKAFVDKAKHDISSIKIVGPEENLSQAEARNMGMDFCRQDEKCDYYFSV 391
Query: 386 DSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGD 445
D+D L NP LK L+ +N +IAPL+ R K WSNFWGAL+ DG+YARS DY++I+ G+
Sbjct: 392 DADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARSEDYVDIVQGN 451
Query: 446 QGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN---------- 494
+ GIWNVPY+ N YL++ +++ + + + + +D DM+ C N R+
Sbjct: 452 R--VGIWNVPYMANVYLIQGKTLRSEMSERNYFVRDKLDPDMSLCRNARDMTLQREKDSP 509
Query: 495 -----------KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPE 543
KG+ + I + E+G L+ + N++ N +++++ NP+DW +YI+ +
Sbjct: 510 TPETFQMLSPPKGVFMYISNRHEFGRLISTANYNTSHLNNDLWQIFENPVDWKEKYINRD 569
Query: 544 YQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVP 603
Y K + + + QPCPDVFWFPI +E+ C E V+ ME YG+WS G ++D R+ GYE VP
Sbjct: 570 YSK-IFTENIVEQPCPDVFWFPIFSERACDELVEEMEHYGKWSGGKHHDSRISGGYENVP 628
Query: 604 TRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPH 663
T DIHMKQ+ L VW F+R+++ P+ + F GY+ + A ++FVV+Y P+ Q SLRPH
Sbjct: 629 TDDIHMKQIDLENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPH 687
Query: 664 HDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQG 723
HD+ST+TINIALN VG D++GGGC+F+RYNC++ + R GW MHPGRLTH HEGL V G
Sbjct: 688 HDASTFTINIALNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNG 747
Query: 724 TRYIMISFVDP 734
TRYI +SF+DP
Sbjct: 748 TRYIAVSFIDP 758
>gi|6755106|ref|NP_035252.1| procollagen-lysine,2-oxoglutarate 5-dioxygenase 1 precursor [Mus
musculus]
gi|25008938|sp|Q9R0E2.1|PLOD1_MOUSE RecName: Full=Procollagen-lysine,2-oxoglutarate 5-dioxygenase 1;
AltName: Full=Lysyl hydroxylase 1; Short=LH1; Flags:
Precursor
gi|5880315|gb|AAD54617.1|AF046782_1 lysyl hydroxylase 1 [Mus musculus]
gi|13879264|gb|AAH06599.1| Procollagen-lysine, 2-oxoglutarate 5-dioxygenase 1 [Mus musculus]
gi|148682841|gb|EDL14788.1| procollagen-lysine, 2-oxoglutarate 5-dioxygenase 1 [Mus musculus]
Length = 728
Score = 618 bits (1594), Expect = e-174, Method: Compositional matrix adjust.
Identities = 307/710 (43%), Positives = 465/710 (65%), Gaps = 12/710 (1%)
Query: 31 EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPW-LGGDMSSLGGGYKVNLLKN 89
ED LV+TVA+ ET+G++RF +SA+ ++++LGL + W + G ++ GGG KV LLK
Sbjct: 25 EDNLLVLTVATKETEGFRRFKRSAQFFNYKIQSLGLGEDWSVDGGPAAAGGGQKVRLLKK 84
Query: 90 ELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYP 149
L++ +D++IL DSYDV+ G ++L++F + +VF AE +PD L KYP
Sbjct: 85 ALEKHADKEDLVILFVDSYDVVFASGPRELLKKFQQAKSQVVFSAEEHIYPDRRLEAKYP 144
Query: 150 AVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTL 209
V G R+L SGGFIGYA + +L++ ++ + DQL+Y +FL+ R + I LD
Sbjct: 145 TVPDGKRFLGSGGFIGYAPSLSKLVAEWEGQDSDSDQLFYTKIFLNPEKREQINISLDHR 204
Query: 210 ANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK 269
+FQNL G+L+++ L F++ V N Y+T PV++HGNG +K++LN GNY+ + W
Sbjct: 205 CRIFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVVVHGNGPTKLQLNYLGNYIPRFWT 263
Query: 270 -TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFV 327
+GCT C+ ++ L + + P+VL+ VFI++PT FL F ++ L YP K++ +F+
Sbjct: 264 FETGCTVCDEGLRSLKGIGDEALPTVLVGVFIEQPTPFLSLFFLRLLRLRYPQKQMRLFI 323
Query: 328 YNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVD 386
+N + +H + ++ + +++VK + + + +ARN+ + + +YF VD
Sbjct: 324 HNQERHHKLQVEQFLAEHGSEYQSVKLVGPEVRMANADARNMGADLCRQDQTCTYYFSVD 383
Query: 387 SDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQ 446
+D L P+ L+ L+ +N+++IAPL+ R + WSNFWG L+ADG+YARS DY++I+ G +
Sbjct: 384 ADVALTEPNSLRLLIEQNKNVIAPLMTRHGRLWSNFWGGLSADGYYARSEDYVDIVQGRR 443
Query: 447 GGKGIWNVPYITNCYLMKTSVIKA--TNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDST 504
G+WNVPYI+N YL+K S ++A N+ ++ + +D DM+FC N+R + + + + +
Sbjct: 444 --VGVWNVPYISNIYLIKGSALRAELQNVD-LFHYSKLDSDMSFCANVRQQEVFMFLTNR 500
Query: 505 QEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWF 564
+GHL+ +N+ + +++E+ NP DW +YIH Y K+L V PCPDV+WF
Sbjct: 501 HTFGHLLSLDNYQTTHLHNDLWEVFSNPEDWKEKYIHENYTKALAGKLVET-PCPDVYWF 559
Query: 565 PIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRK 624
PI TE C E V+ ME YGQWS G N D R++ GYE VPT DIHM Q+ W +FL +
Sbjct: 560 PIFTEAACDELVEEMEHYGQWSLGDNKDNRIQGGYENVPTIDIHMNQITFEREWHKFLVE 619
Query: 625 YVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEG 684
Y+ P+ E+ + GY+ + ++FVVRY+PDEQPSL PHHD+ST+T+NIALN+VG DYEG
Sbjct: 620 YIAPMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFTVNIALNRVGEDYEG 678
Query: 685 GGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
GGCRF+RYNC+V A R GW L+HPGRLTHYHEGL T+GTRYI +SFVDP
Sbjct: 679 GGCRFLRYNCSVRAPRKGWALLHPGRLTHYHEGLPTTKGTRYIAVSFVDP 728
>gi|74207958|dbj|BAE29100.1| unnamed protein product [Mus musculus]
Length = 728
Score = 617 bits (1592), Expect = e-174, Method: Compositional matrix adjust.
Identities = 307/710 (43%), Positives = 464/710 (65%), Gaps = 12/710 (1%)
Query: 31 EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPW-LGGDMSSLGGGYKVNLLKN 89
ED LV+TVA+ ET+G++RF +SA+ ++++LGL + W + G ++ GGG KV LLK
Sbjct: 25 EDNLLVLTVATKETEGFRRFKRSAQFFNYKIQSLGLGEDWSVDGGPAAAGGGQKVRLLKK 84
Query: 90 ELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYP 149
L++ +D++IL DSYDV+ G ++L++F + +VF AE +PD L KYP
Sbjct: 85 ALEKHADKEDLVILFVDSYDVVFASGPRELLKKFQQAKSQVVFSAEEHIYPDRRLESKYP 144
Query: 150 AVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTL 209
V G R+L SGGFIGYA + +L++ ++ + DQL+Y +FL+ R + I LD
Sbjct: 145 TVPDGKRFLGSGGFIGYAPSLSKLVAEWEGQDSDSDQLFYTKIFLNPEKREQINISLDHR 204
Query: 210 ANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK 269
+FQNL G+L+++ L F++ V N Y+T PV++HGNG +K++LN GNY+ + W
Sbjct: 205 CRIFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVVVHGNGPTKLQLNYLGNYIPRFWT 263
Query: 270 -TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFV 327
+GCT C+ ++ L + + P+VL+ VFI++PT FL F ++ L YP K++ +F+
Sbjct: 264 FETGCTVCDEGLRSLKGIGDEALPTVLVGVFIEQPTPFLSLFFLRLLRLRYPQKQMRLFI 323
Query: 328 YNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVD 386
+N + +H + ++ + +++VK + + + +ARN+ + + +YF VD
Sbjct: 324 HNQERHHKLQVEQFLAEHGSEYQSVKLVGPEVRMANADARNMGADLCRQDQTCTYYFSVD 383
Query: 387 SDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQ 446
+D L P+ L+ L+ +N+++IAPL+ R + WSNFWG L+ADG+YARS DY++I+ G +
Sbjct: 384 ADVALTEPNSLRLLIEQNKNVIAPLMTRHGRLWSNFWGGLSADGYYARSEDYVDIVQGRR 443
Query: 447 GGKGIWNVPYITNCYLMKTSVIKAT--NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDST 504
G+WNVPYI+N YL+K S ++A N+ ++ + +D DM+FC N+R + + + + +
Sbjct: 444 --VGVWNVPYISNIYLIKGSALRAVLQNVD-LFHYSKLDSDMSFCANVRQQEVFMFLTNR 500
Query: 505 QEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWF 564
+GHL+ +N+ + +++E+ NP DW +YIH Y K+L V PCPDV+WF
Sbjct: 501 HTFGHLLSLDNYQTTHLHNDLWEVFSNPEDWKEKYIHENYTKALAGKLVET-PCPDVYWF 559
Query: 565 PIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRK 624
PI TE C E V+ ME YGQWS G N D R++ GYE VPT DIHM Q+ W +FL +
Sbjct: 560 PIFTEAACDELVEEMEHYGQWSLGDNKDNRIQGGYENVPTIDIHMNQITFEREWHKFLVE 619
Query: 625 YVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEG 684
Y+ P+ E+ + GY+ + ++FVVRY PDEQPSL PHHD+ST+T+NIALN+VG DYEG
Sbjct: 620 YIAPMTEKLYPGYYTR-AQFDLAFVVRYNPDEQPSLMPHHDASTFTVNIALNRVGEDYEG 678
Query: 685 GGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
GGCRF+RYNC+V A R GW L+HPGRLTHYHEGL T+GTRYI +SFVDP
Sbjct: 679 GGCRFLRYNCSVRAPRKGWALLHPGRLTHYHEGLPTTKGTRYIAVSFVDP 728
>gi|297286704|ref|XP_002808385.1| PREDICTED: LOW QUALITY PROTEIN: procollagen-lysine,2-oxoglutarate
5-dioxygenase 2-like [Macaca mulatta]
Length = 946
Score = 615 bits (1586), Expect = e-173, Method: Compositional matrix adjust.
Identities = 315/751 (41%), Positives = 456/751 (60%), Gaps = 49/751 (6%)
Query: 10 LILSCVVF-----FISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTL 64
L+L +VF ++ K +I DK LVITVA+ E+DG+ RF+QSA+ VK L
Sbjct: 219 LLLLALVFHPWNPYLGADSEKPSSIPTDKLLVITVATKESDGFHRFMQSAKYFNYTVKVL 278
Query: 65 GLHQPWLGGD-MSSLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERF 123
G + W GGD ++S+GGG KV L+K ++ DD++++ T+ +DVI GG ++L++F
Sbjct: 279 GQGEEWRGGDGINSIGGGQKVRLMKEVMEHYADQDDLVVMFTECFDVIFAGGPEEVLKKF 338
Query: 124 NTFDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEE 183
+ +VF A+ + WPD L DKYP V G RYLNSGGFIGYA I ++ ++++ +
Sbjct: 339 QKANHKVVFAADGILWPDKRLADKYPVVHIGKRYLNSGGFIGYAPYINRIVQQWNLQDND 398
Query: 184 DDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTN 243
DDQL+Y +++D R I LD +FQ L G+++++ L F+ + NT Y T
Sbjct: 399 DDQLFYTKIYIDPLKREAINITLDHKCKIFQTLNGAVDEVVLKFENGK-ARAKNTFYETL 457
Query: 244 PVIIHGNGKSKIELNSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKP 302
PV I+GNG +KI LN FGNY+ SW + +GCT C +D D P+V I VFI++P
Sbjct: 458 PVAINGNGPTKILLNYFGNYVPNSWTQDNGCTLCEF-DTIDLSAVDVHPNVTIGVFIEQP 516
Query: 303 TAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVN 362
T FL FL+ + L+YP + + +F++N + YH + K K +K + ++
Sbjct: 517 TPFLPRFLDILLTLDYPKEALKLFIHNKEVYHEKDIKVFFDKAKHEIKTIKIVGPEENLS 576
Query: 363 SKEARNLAVENSLH-KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSN 421
EARN+ ++ + D+YF VD+D L NP LK L+ +N +IAPL+ R K WSN
Sbjct: 577 QAEARNMGMDFCRQDEKCDYYFSVDADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSN 636
Query: 422 FWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIK-ATNIKTIYTLN 480
FWGAL+ DG+YARS DY++I+ G++ G+WNVPY+ N YL+K ++ N + + +
Sbjct: 637 FWGALSPDGYYARSEDYVDIVQGNR--VGVWNVPYMANVYLIKGKTLRLEMNERNYFVRD 694
Query: 481 SMDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNP-------- 532
+D DMA C N R + + DS PE ++++ P
Sbjct: 695 KLDPDMALCRNAREMTLQREKDSP-----------------TPETFQMLSPPKVLFLLIL 737
Query: 533 ---------LDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYG 583
+DW +YI+ +Y K + + + QPCPDVFWFPI +EK C E V+ ME YG
Sbjct: 738 FIFVYLIFDIDWKEKYINRDYSK-IFTENIVEQPCPDVFWFPIFSEKACDELVEEMEHYG 796
Query: 584 QWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVR 643
+WS G ++D R+ GYE VPT DIHMKQV L VW F+R+++ P+ + F GY+ +
Sbjct: 797 KWSGGKHHDSRISGGYENVPTDDIHMKQVDLENVWLHFIREFIAPVTLKVFAGYYTKGF- 855
Query: 644 APMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGW 703
A ++FVV+Y P+ Q SLRPHHD+ST+TINIALN VG D++GGGC+F+RYNC++ + R GW
Sbjct: 856 ALLNFVVKYSPERQRSLRPHHDASTFTINIALNNVGEDFQGGGCKFLRYNCSIESPRKGW 915
Query: 704 MLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
MHPGRLTH HEGL V GTRYI +SF+DP
Sbjct: 916 SFMHPGRLTHLHEGLPVKNGTRYIAVSFIDP 946
>gi|402594065|gb|EJW87992.1| procollagen-lysine [Wuchereria bancrofti]
Length = 733
Score = 610 bits (1572), Expect = e-171, Method: Compositional matrix adjust.
Identities = 316/746 (42%), Positives = 475/746 (63%), Gaps = 26/746 (3%)
Query: 2 LSNLHLNCLILSCVVFFISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQV 61
++ + L L LS V+ + +V K + E LV+TVA+ ETDG +R ++A++N +++
Sbjct: 1 MTGMILWVLTLSTVLMYGTVTMEKTSGMPE--LLVVTVATEETDGLRRLKRTADINDVRL 58
Query: 62 KTLGLHQPWLGGDMS-SLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDIL 120
+ G+ + W GGD+ GGG K+ +L+ L++ +D+IIL D+YDVI G IL
Sbjct: 59 EVFGMGEQWRGGDIRVDEGGGQKIRILRKSLEKYKDRNDLIILFVDAYDVIFLGNEEQIL 118
Query: 121 ERFNTF--DANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRS 178
+F TF +VF +E CWP+ +L KYP V GYRYLNSG F+G+A +I +LIS +
Sbjct: 119 RKFFTFFDGFRLVFSSEPFCWPNRNLAPKYPLVNFGYRYLNSGIFMGFAPEIWKLISYKD 178
Query: 179 IKNEEDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDE-----FV 233
+++ +DDQLYY L+LDE +R K+ LD+++ LFQNL G+ D+KL DE F+
Sbjct: 179 VEDNDDDQLYYTHLYLDEQIRISLKMTLDSMSILFQNLNGASNDVKLEMS-DERSGAYFI 237
Query: 234 HLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWKTSGCTRCNLIKHLDSLKPDQFPSV 293
+ N YNT P++IHGNG SK+ LN FGNY+ T+ T+ + +L+ + P +
Sbjct: 238 Y--NFIYNTYPLVIHGNGPSKLHLNYFGNYVDPLRITTAKTQHTTM----NLEKIELPRL 291
Query: 294 LISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVK 353
+SV I KP F+ EF I +L Y +KI ++VY NQ + ++ + K ++++
Sbjct: 292 FLSVVISKPIPFIREFFENIKSLAYADEKIDLYVYCNQNFLEKETSGFVEDVKGRYQSLL 351
Query: 354 YIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVN----RNESLIA 409
Y + + +EAR +++ SL G D+ +D D HL+N + L +++ +N ++A
Sbjct: 352 YDDSTTELGEREARAFSLKQSLALGDDYLIMIDGDVHLNNSEALLLMIHTVKEKNSEILA 411
Query: 410 PLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIK 469
PL+ +P K ++NFWGA++++G+YARS +Y++II D GIWNVP+I + ++ K
Sbjct: 412 PLVGQPHKLFTNFWGAISSNGYYARSENYLDII--DHKEVGIWNVPFINSILIIAKE--K 467
Query: 470 ATNIKTIYTLN-SMDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYEL 528
++ Y N +D DM+FC+ R+KG L +D++ YG LV SE+ + K +P++YE+
Sbjct: 468 LASLSNAYYYNDKLDPDMSFCSFARDKGHFLYLDNSYHYGFLVVSEDVESSKVHPDMYEI 527
Query: 529 IRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDG 588
N W+ RYIHP Y +L + C DV+ FP+++E+FC E ++ E YG+WSDG
Sbjct: 528 FNNKELWEKRYIHPNYFAALNGSVQILEICQDVYDFPLMSERFCAELIEECEYYGKWSDG 587
Query: 589 TNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSF 648
+ D+RL GYE VPTRDIHM Q+G W L +YV P+QE+ FIGY+ +PV + M F
Sbjct: 588 KHKDERLVGGYENVPTRDIHMNQIGFERHWLYMLDEYVRPIQEKLFIGYYKQPVESVMMF 647
Query: 649 VVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHP 708
VVRY+P+EQ SLRPHHD+STY+I+IALN+ GVDYEGGG RF+RYNC A +G ++ P
Sbjct: 648 VVRYKPEEQASLRPHHDASTYSIDIALNKRGVDYEGGGVRFLRYNCTFDADTVGHSMIFP 707
Query: 709 GRLTHYHEGLQVTQGTRYIMISFVDP 734
GRLTH HEGL+ TQGTRYI +SF++P
Sbjct: 708 GRLTHLHEGLETTQGTRYIAVSFINP 733
>gi|291399931|ref|XP_002716643.1| PREDICTED: procollagen-lysine, 2-oxoglutarate 5-dioxygenase 2
[Oryctolagus cuniculus]
Length = 877
Score = 607 bits (1566), Expect = e-171, Method: Compositional matrix adjust.
Identities = 314/739 (42%), Positives = 458/739 (61%), Gaps = 31/739 (4%)
Query: 21 VHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPW-LGGDMSSLG 79
V N + +K LVITVA+ E DG+ RF+QSA+ VK LG + W GG ++S+G
Sbjct: 145 VDINYIVQFRHNKLLVITVATKENDGFHRFMQSAKYFNYTVKVLGQGEEWRGGGGINSIG 204
Query: 80 GGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCW 139
GG KV L+K ++ DD++IL T+ + VI GG ++L++F + +VF A+ L W
Sbjct: 205 GGQKVRLMKEVMEHYGNQDDLVILFTECFHVIFAGGPEEVLKKFQKTNHKVVFAADGLLW 264
Query: 140 PDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLR 199
PD L +KYP V SG YLNSGGFIGYA I ++ ++++ +DDQL+Y +++D R
Sbjct: 265 PDKRLAEKYPIVHSGKPYLNSGGFIGYAPYINRIVQQWTLQDNDDDQLFYTKIYIDPLKR 324
Query: 200 TKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNS 259
I LD +FQ L G+++++ L F+ ++ + NT Y T PV+I+GNG +KI LN
Sbjct: 325 EAFNITLDHKCKIFQTLNGAVDEVVLKFENNK-TRVKNTFYETLPVVINGNGPTKIVLNY 383
Query: 260 FGNYLAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNY 318
FGNY+ W + +GC C +D D P+V I VFI++PT FL FL+ + L+Y
Sbjct: 384 FGNYVPNLWTQNNGCLLCEF-DTIDLSAVDVHPNVTIGVFIEQPTPFLPRFLDILLTLDY 442
Query: 319 PAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-K 377
P + + +F++N + YH + K +K + ++ +ARN+ ++ +
Sbjct: 443 PKEALKLFIHNKEVYHEKDLKVFFDKAKHEISTIKIVGPEENLSQAKARNMGMDFCRQDE 502
Query: 378 GVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFD 437
D+YF +D+D L NP LK L+ +N +IAPL+ R K WSNFWGAL+ DG+YARS D
Sbjct: 503 KCDYYFSLDADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARSED 562
Query: 438 YMNIINGDQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN-- 494
Y++I+ G + GIWNVPYI N YL+K +++ N + + + +D DMA C N R
Sbjct: 563 YVDIVQGKR--VGIWNVPYIANVYLIKGKTLRSEMNERNYFVRDKLDPDMALCRNAREMT 620
Query: 495 -------------------KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDW 535
KG+ + I + E+G ++ + N++ N +++++ NP+DW
Sbjct: 621 LQREKDSPTPETFQMLSPPKGMFMYISNRHEFGRILSTANYNISHYNNDLWQIFENPVDW 680
Query: 536 DLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRL 595
+YI+ +Y K + D + QPCPDVFWFPI +EK C E VQ ME YGQWS G ++D R+
Sbjct: 681 KEKYINRDYSK-IFTDNIVEQPCPDVFWFPIFSEKACDELVQEMEHYGQWSGGKHHDSRI 739
Query: 596 ETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPD 655
GYE VPT DIHMKQ+GL VW F+R+++ P+ + F GY+ + A ++FVV+Y P+
Sbjct: 740 SGGYENVPTDDIHMKQIGLENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPE 798
Query: 656 EQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYH 715
Q SLRPHHD+ST+TINIALN VG D++GGGC+F+RYNC++ + R GW MHPGRLTH H
Sbjct: 799 RQRSLRPHHDASTFTINIALNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLH 858
Query: 716 EGLQVTQGTRYIMISFVDP 734
EGL V GTRYI +SF+DP
Sbjct: 859 EGLPVKNGTRYIAVSFIDP 877
>gi|317419977|emb|CBN82013.1| Procollagen-lysine,2-oxoglutarate 5-dioxygenase 2 [Dicentrarchus
labrax]
Length = 682
Score = 605 bits (1559), Expect = e-170, Method: Compositional matrix adjust.
Identities = 294/689 (42%), Positives = 432/689 (62%), Gaps = 12/689 (1%)
Query: 51 IQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLLKNELDEMDITDDMIILVTDSYD 109
+QSA VK LG+ + W GGD+ S+GGG KV LLK ++ + +D+++L DSYD
Sbjct: 1 MQSARYFNYTVKVLGMGEAWKGGDVGRSIGGGQKVRLLKEAMEALADQEDLVVLSVDSYD 60
Query: 110 VIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKD 169
+I GG +IL +F + ++F AE L WPD L DKYP++ SG RYLNSGG IGYA
Sbjct: 61 LIFAGGPEEILRKFQQANHKVLFAAEGLIWPDKRLADKYPSIRSGKRYLNSGGIIGYAPY 120
Query: 170 IKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDL 229
I ++S ++ + +DDQL+Y ++LD R + LD +FQNL G+++++ L F
Sbjct: 121 INRVVSQWNLHDNDDDQLFYTKIYLDPLRRETLNMTLDHKCQIFQNLNGAVDEVLLKFGT 180
Query: 230 DEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK-TSGCTRC-NLIKHLDSLKP 287
D V + NT Y++ PV++HGNG +K+ LN NY+ +W GC+ C + + L LK
Sbjct: 181 DR-VRVRNTVYDSLPVVVHGNGNTKMYLNYMANYVPNTWNYEHGCSHCDDDVVDLSQLK- 238
Query: 288 DQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKT 347
++P+VL+ VFI++PT FL EF ++ L+YP K+ +FV+NN+ YH + +
Sbjct: 239 -EYPNVLVGVFIEQPTPFLPEFFQRLLTLDYPKDKLKVFVHNNEVYHEKHIQKFWEENRN 297
Query: 348 MFKNVKYIAHNSTVNSKEARNLAVENSLHKGV-DFYFYVDSDSHLDNPDVLKYLVNRNES 406
+F + K + ++ EARN+ ++ +YF +DSD L N LK L+ +N
Sbjct: 298 VFNSFKVVGPEENLSQGEARNMGMDLCRKDTTCAYYFSIDSDVMLTNRQTLKLLIEQNRK 357
Query: 407 LIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTS 466
+I PL+ R K WSNFWGAL+ DG+YARS DY++I+ + G+WN+PY+ + Y++K S
Sbjct: 358 IIGPLVTRHSKLWSNFWGALSLDGYYARSEDYVDIVQKKR--VGVWNIPYMAHVYMVKGS 415
Query: 467 VIK-ATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEV 525
++ + + L +D DM+ C N R G+ + I + ++G L+ + N++ N ++
Sbjct: 416 TLRNELKERNYFVLEKLDPDMSLCRNAREMGVFMYITNRHDFGRLISTANYNISHYNNDL 475
Query: 526 YELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQW 585
+++ NP+DW +YIH Y K + + +PCPDVFWFP+ +EK C E V ME YG W
Sbjct: 476 WQIYENPVDWKEKYIHKNYSK-IFTENYMEEPCPDVFWFPVFSEKACDEIVGEMEHYGTW 534
Query: 586 SDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAP 645
S G + DKR+ GYE VPT DIHMKQ+G W F+R+++ P+ + F GY+ + A
Sbjct: 535 SGGRHMDKRIAGGYETVPTDDIHMKQIGFDKEWLHFIREFISPVTLKVFSGYYTKGY-AV 593
Query: 646 MSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWML 705
M+FVV+Y P+ Q LRPHHDSST+TINIALN D++GGGCRF RYNC++ + R GW
Sbjct: 594 MNFVVKYTPERQAYLRPHHDSSTFTINIALNNKDTDFQGGGCRFHRYNCSINSPRKGWSF 653
Query: 706 MHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
MHPGRLTH HEGL T GTRYI +SF+DP
Sbjct: 654 MHPGRLTHLHEGLPTTNGTRYIAVSFIDP 682
>gi|390358384|ref|XP_781719.3| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 1-like
[Strongylocentrotus purpuratus]
Length = 715
Score = 605 bits (1559), Expect = e-170, Method: Compositional matrix adjust.
Identities = 306/707 (43%), Positives = 444/707 (62%), Gaps = 33/707 (4%)
Query: 33 KFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLLKNEL 91
K LV+TVA+ ETD ++R++ SAE + VK +G+ Q W GGD+ GGG+K+NLL+ L
Sbjct: 37 KLLVVTVATKETDAFRRYMDSAEAFGINVKVVGMDQEWKGGDIERGPGGGFKINLLREAL 96
Query: 92 DEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAV 151
+ +D++I+ TDSYDV+ +++L +F + N++F AE WP+ SL +KYP V
Sbjct: 97 TQYKDDEDLVIMFTDSYDVLFLADADEMLRKFKAYQINLLFSAETYIWPEKSLANKYPKV 156
Query: 152 GSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLAN 211
+GY YL SG ++GYA I + +S + I++ DDQL++ L+L E
Sbjct: 157 ENGYPYLCSGLYMGYAPYIYKALSYKPIEDIADDQLFFTELYLAE--------------- 201
Query: 212 LFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK-T 270
+ DI L F+ + L NTKYNT P ++HGNG +K+ LN GNYL W
Sbjct: 202 -------RVTDITLRFEGGNNL-LHNTKYNTVPCVLHGNGPTKVYLNHLGNYLPNKWTFD 253
Query: 271 SGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN 329
GC C+L L L + +PSV+I++F+ PT F EFL+ + LNYP KI +F++N
Sbjct: 254 GGCQNCDLDTFDLQGLPVEDYPSVVIAIFVGVPTPFFAEFLDLLTKLNYPKNKIDIFIHN 313
Query: 330 NQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDS 389
+H + + + ++ ++K I + + RN V++ + D+YF VDSD
Sbjct: 314 RAMFHYHMLEKFREEKGPLYNSIKIILPAEMLGDAKGRNRGVDHCMSMECDYYFSVDSDV 373
Query: 390 HLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGK 449
L NPDVL+ L+ N+ ++AP++ + K WSNFWG LN+ G+YARS DY++++ ++ +
Sbjct: 374 QLTNPDVLRLLMETNKQIVAPVVSKQGKLWSNFWGDLNSQGYYARSEDYVDLVRRNR--R 431
Query: 450 GIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQE-YG 508
G+WNVPYI N YL K ++K K + + +D DMA C +LR+KGI L + + ++ YG
Sbjct: 432 GVWNVPYINNVYLAKGEMVKT--YKPNFEIEDLDTDMAICMDLRSKGIFLYVVNMEDSYG 489
Query: 509 HLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVN-NQPCPDVFWFPIV 567
H+V +N++ + +++EL N DW+ +Y+ P+Y D N PC DV+ FP++
Sbjct: 490 HIVTLDNYETTHLHNDMWELWNNKEDWEAKYLSPDYFVVKEMDRNNITMPCTDVYTFPLM 549
Query: 568 TEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVV 627
+ + E ++ ME +G+WS G N DKRL GYE VPTRDIHM Q+G W FLR+YVV
Sbjct: 550 SRTWAKELIEEMEHFGEWSGGGNQDKRLNGGYENVPTRDIHMNQIGFEQHWLYFLREYVV 609
Query: 628 PLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGC 687
P+ E + GY+ + A M+FVVRY+PDEQ SLRPHHDSSTYTIN+ALN+ DYEGGG
Sbjct: 610 PICENVYPGYYSK-AYAIMNFVVRYKPDEQASLRPHHDSSTYTINVALNERETDYEGGGA 668
Query: 688 RFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
RFIRYNC+V +G+ +MHPGRLTHYHEGL T GTRYIM+SF+DP
Sbjct: 669 RFIRYNCSVVGLPVGYSIMHPGRLTHYHEGLPTTNGTRYIMVSFIDP 715
>gi|397512444|ref|XP_003826555.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2
isoform 3 [Pan paniscus]
Length = 703
Score = 604 bits (1557), Expect = e-170, Method: Compositional matrix adjust.
Identities = 300/709 (42%), Positives = 442/709 (62%), Gaps = 31/709 (4%)
Query: 51 IQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGYKVNLLKNELDEMDITDDMIILVTDSYD 109
+QSA+ VK LG + W GGD ++S+GGG KV L+K ++ DD++++ T+ +D
Sbjct: 1 MQSAKYFNYTVKVLGQGEEWRGGDGINSIGGGQKVRLMKEVMERYADQDDLVVMFTECFD 60
Query: 110 VIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKD 169
VI GG ++L++F + +VF A+ + WPD L DKYP V G RYLNSGGFIGYA
Sbjct: 61 VIFAGGPEEVLKKFQKANHKVVFAADGILWPDKRLADKYPVVHIGKRYLNSGGFIGYAPY 120
Query: 170 IKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDL 229
+ ++ ++++ +DDQL+Y +++D R I LD +FQ L G+++++ L F+
Sbjct: 121 VNRIVQQWNLQDNDDDQLFYTKIYIDPLKREAINITLDHKCKIFQTLNGAVDEVVLKFEN 180
Query: 230 DEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPD 288
+ NT Y T PV I+GNG +KI LN FGNY+ SW + +GCT C +D D
Sbjct: 181 GK-ARAKNTFYETLPVAINGNGPTKILLNYFGNYVPNSWTQDNGCTLCEF-DTVDLSAVD 238
Query: 289 QFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTM 348
P+V I VFI++PT FL FL+ + L+YP + + +F++N + YH + K
Sbjct: 239 VHPNVSIGVFIEQPTPFLPRFLDILLTLDYPKEALKLFIHNKEVYHEKDIKIFFDKAKHE 298
Query: 349 FKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDSDSHLDNPDVLKYLVNRNESL 407
K +K + ++ EARN+ ++ + D+YF VD+D L NP LK L+ +N +
Sbjct: 299 IKTIKIVGPEENLSQAEARNMGMDFCRQDEKCDYYFSVDADVVLTNPRTLKILIEQNRKI 358
Query: 408 IAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSV 467
IAPL+ R K WSNFWGAL+ DG+YARS DY++I+ G++ G+WNVPY+ N YL+K
Sbjct: 359 IAPLVTRHGKLWSNFWGALSPDGYYARSEDYVDIVQGNR--VGVWNVPYMANVYLIKGKT 416
Query: 468 IKA-TNIKTIYTLNSMDYDMAFCTNLRN---------------------KGIHLKIDSTQ 505
+++ N + + + +D DMA C N R KG+ + I +
Sbjct: 417 LRSEMNERNYFVRDKLDPDMALCRNAREMTLQREKDSPTPETFQMLSPPKGVFMYISNRH 476
Query: 506 EYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFP 565
E+G L+ + N++ N +++++ NP+DW +YI+ +Y K + + + QPCPDVFWFP
Sbjct: 477 EFGRLLSTANYNTSHYNNDLWQIFENPVDWKEKYINRDYSK-IFTENIVEQPCPDVFWFP 535
Query: 566 IVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKY 625
I +EK C E V+ ME YG+WS G ++D R+ GYE VPT DIHMKQV L VW F+R++
Sbjct: 536 IFSEKACDELVEEMEHYGKWSGGKHHDSRISGGYENVPTDDIHMKQVDLENVWLHFIREF 595
Query: 626 VVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGG 685
+ P+ + F GY+ + A ++FVV+Y P+ Q SLRPHHD+ST+TINIALN VG D++GG
Sbjct: 596 IAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPHHDASTFTINIALNNVGEDFQGG 654
Query: 686 GCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
GC+F+RYNC++ + R GW MHPGRLTH HEGL V GTRYI +SF+DP
Sbjct: 655 GCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNGTRYIAVSFIDP 703
>gi|194378198|dbj|BAG57849.1| unnamed protein product [Homo sapiens]
Length = 690
Score = 604 bits (1557), Expect = e-170, Method: Compositional matrix adjust.
Identities = 295/708 (41%), Positives = 445/708 (62%), Gaps = 46/708 (6%)
Query: 31 EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNE 90
ED LV+TVA+ ET+G++RF +SA+ +++ LGL + W +S GGG KV LLK
Sbjct: 25 EDNLLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWNVEKGTSAGGGQKVRLLKKA 84
Query: 91 LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
L++ +++I +PD L KYP
Sbjct: 85 LEKHADKEELI-------------------------------------YPDRRLETKYPV 107
Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
V G R+L SGGFIGYA ++ +L++ ++ + DQL+Y +FLD R + I LD
Sbjct: 108 VSDGKRFLGSGGFIGYAPNLSKLVAEWEGQDSDSDQLFYTKIFLDPEKREQINITLDHRC 167
Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK- 269
+FQNL G+L+++ L F++ V N Y+T PV+IHGNG +K++LN GNY+ + W
Sbjct: 168 RIFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTF 226
Query: 270 TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
+GCT C+ ++ L + + P+VL+ VFI++PT F+ F ++ L+YP K + +F++
Sbjct: 227 ETGCTVCDEGLRSLKGIGDEALPTVLVGVFIEQPTPFVSLFFQRLLRLHYPQKHMRLFIH 286
Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVE-NSLHKGVDFYFYVDS 387
N++++H ++++ + +++VK + + + +ARN+ + ++ +YF VD+
Sbjct: 287 NHEQHHKAQVEEFLAQHGSEYQSVKLVGPEVRMANADARNMGADLCRQYRSCTYYFSVDA 346
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D L P+ L+ L+ +N+++IAPL+ R + WSNFWGAL+ADG+YARS DY++I+ G +
Sbjct: 347 DVALTEPNSLRLLIQQNKNVIAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR- 405
Query: 448 GKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQE 506
G+WNVPYI+N YL+K S ++ ++ + +D DMAFC N+R + + + + +
Sbjct: 406 -VGVWNVPYISNIYLIKGSALRGELQSSDLFHHSKLDPDMAFCANIRQQDVFMFLTNRHT 464
Query: 507 YGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPI 566
GHL+ +++ + +++E+ NP DW +YIH Y K+L V PCPDV+WFPI
Sbjct: 465 LGHLLSLDSYRTTHLHNDLWEVFSNPEDWKEKYIHQNYTKALAGKLVET-PCPDVYWFPI 523
Query: 567 VTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYV 626
TE C E V+ ME +GQWS G N D R++ GYE VPT DIHM Q+G W +FL +Y+
Sbjct: 524 FTEVACDELVEEMEHFGQWSLGNNKDNRIQGGYENVPTIDIHMNQIGFEREWHKFLLEYI 583
Query: 627 VPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGG 686
P+ E+ + GY+ + ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VGVDYEGGG
Sbjct: 584 APMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFTINIALNRVGVDYEGGG 642
Query: 687 CRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
CRF+RYNC++ A R GW LMHPGRLTHYHEGL T+GTRYI +SFVDP
Sbjct: 643 CRFLRYNCSIRAPRKGWTLMHPGRLTHYHEGLPTTRGTRYIAVSFVDP 690
>gi|296227894|ref|XP_002759563.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2
isoform 2 [Callithrix jacchus]
Length = 703
Score = 602 bits (1553), Expect = e-169, Method: Compositional matrix adjust.
Identities = 301/709 (42%), Positives = 441/709 (62%), Gaps = 31/709 (4%)
Query: 51 IQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGYKVNLLKNELDEMDITDDMIILVTDSYD 109
+QSA+ VK LG + W GGD ++S+GGG KV L+K + DD++++ T+ +D
Sbjct: 1 MQSAKYFNYTVKVLGQGEEWRGGDGINSIGGGQKVRLVKEVMGHYADQDDLVVMFTECFD 60
Query: 110 VIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKD 169
VI GG ++L++F + +VF A+ + WPD L DKYP V G RYLNSGGFIGYA
Sbjct: 61 VIFAGGPEELLKKFQKANHKVVFAADGILWPDKRLADKYPVVHIGKRYLNSGGFIGYAPY 120
Query: 170 IKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDL 229
I ++ ++++ +DDQL+Y +++D R I LD +FQ L G+++++ L F+
Sbjct: 121 INRIVQQWNLQDNDDDQLFYTKIYIDPLKREAINITLDHKCKIFQTLNGAVDEVVLKFEN 180
Query: 230 DEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPD 288
+ NT Y T PV I+GNG +KI LN FGNY+ SW + +GCT C +D D
Sbjct: 181 GK-ARAKNTFYETLPVAINGNGPTKILLNYFGNYVPNSWTQDNGCTLCEF-DTVDLSAVD 238
Query: 289 QFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTM 348
P+V I VFI++PT FL FL+ + L+YP + + +F++N + YH + K
Sbjct: 239 VHPNVTIGVFIEQPTPFLPRFLDILLTLDYPKEALKLFIHNKEVYHEKDIKVFFDKAKHE 298
Query: 349 FKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDSDSHLDNPDVLKYLVNRNESL 407
K +K + ++ EARN+ ++ + D+YF VD+D L NP LK L+ +N +
Sbjct: 299 IKTIKIVGPEENLSQAEARNMGMDFCRQDENCDYYFSVDADVVLTNPRTLKILIEQNRKI 358
Query: 408 IAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSV 467
IAPL+ R K WSNFWGAL+ DG+YARS DY++I+ G++ G+WNVPY+ N YL+K
Sbjct: 359 IAPLVTRHGKLWSNFWGALSPDGYYARSEDYVDIVQGNR--VGVWNVPYMANVYLIKGKT 416
Query: 468 IKA-TNIKTIYTLNSMDYDMAFCTNLRN---------------------KGIHLKIDSTQ 505
+++ N + + + +D DMA C N R KG+ + I +
Sbjct: 417 LRSEMNERNYFVRDKLDPDMALCRNAREMTLQREKDSPTPETFQMLSPPKGVFMYISNRN 476
Query: 506 EYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFP 565
E+G L+ + N++ N +++++ NP+DW +YI+ +Y K + + + QPCPDVFWFP
Sbjct: 477 EFGRLLSTANYNTSHYNNDLWQIFENPVDWKEKYINRDYSK-IFTENIVEQPCPDVFWFP 535
Query: 566 IVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKY 625
I +EK C E V+ ME YG+WS G ++D R+ GYE VPT DIHMKQV L VW F+R++
Sbjct: 536 IFSEKACDELVEEMEHYGKWSGGKHHDSRISGGYENVPTDDIHMKQVDLENVWLHFIREF 595
Query: 626 VVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGG 685
+ P+ + F GY+ + A ++FVV+Y P+ Q SLRPHHD+ST+TINIALN VG D++GG
Sbjct: 596 IAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPHHDASTFTINIALNNVGEDFQGG 654
Query: 686 GCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
GC+F+RYNC++ + R GW MHPGRLTH HEGL V GTRYI +SF+DP
Sbjct: 655 GCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNGTRYIAVSFIDP 703
>gi|338715149|ref|XP_001493153.3| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2 [Equus
caballus]
Length = 703
Score = 602 bits (1551), Expect = e-169, Method: Compositional matrix adjust.
Identities = 301/709 (42%), Positives = 440/709 (62%), Gaps = 31/709 (4%)
Query: 51 IQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGYKVNLLKNELDEMDITDDMIILVTDSYD 109
+QSA+ VK LG + W GGD ++S+GGG KV L+K ++ +D+++L T+ +D
Sbjct: 1 MQSAKYFNYTVKVLGQGEDWRGGDGINSIGGGQKVRLMKEVMEHYADQEDLVVLFTECFD 60
Query: 110 VIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKD 169
VI GG ++L++F + +VF A+ + WPD L DKYP V G RYLNSGGFIGYA
Sbjct: 61 VIFAGGPEEVLKKFQKSNHKVVFAADGILWPDKRLADKYPIVHIGKRYLNSGGFIGYAPY 120
Query: 170 IKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDL 229
I ++ ++++ +DDQL+Y ++++ R I LD +FQ L G+++++ L F+
Sbjct: 121 INRIVQEWNLQDNDDDQLFYTKIYVNPLKREAINITLDHKCKIFQTLNGAVDEVVLKFEN 180
Query: 230 DEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPD 288
+ N Y T PV I+GNG +KI LN FGNY+ +W + GCT C L +D +
Sbjct: 181 GK-ARAKNVFYETLPVAINGNGPTKILLNYFGNYVPNAWTQDKGCTLCEL-DTIDLSAVN 238
Query: 289 QFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTM 348
P+V I VFI++PT FL FLN + L+YP + + +F++N + YH + K
Sbjct: 239 VHPNVTIGVFIEQPTPFLPRFLNLLLTLDYPKEALKLFIHNKEVYHEKDIKIFFDKAKHE 298
Query: 349 FKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDSDSHLDNPDVLKYLVNRNESL 407
+K + ++ EARN+ ++ K D+YF VD+D L NP LK L+ +N +
Sbjct: 299 ISTIKIVGPEENLSQAEARNMGMDFCRQDKNCDYYFSVDADVVLTNPRTLKILIEQNRKI 358
Query: 408 IAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSV 467
IAPL+ R K WSNFWGAL+ DG+YARS DY++I+ G++ GIWNVPY+ N YL+K
Sbjct: 359 IAPLVTRHGKLWSNFWGALSPDGYYARSEDYVDIVQGNR--VGIWNVPYMANVYLIKGKT 416
Query: 468 IKA-TNIKTIYTLNSMDYDMAFCTNLRN---------------------KGIHLKIDSTQ 505
+++ N + + + +D DMA C N R KG+ + I +
Sbjct: 417 LRSEMNERNYFVRDKLDPDMALCRNAREMTLQREKDSPTPETFQMLSPPKGVFMYISNRH 476
Query: 506 EYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFP 565
E+G L+ + N++ N +++++ NP+DW +YI+ +Y K + + + QPCPDVFWFP
Sbjct: 477 EFGRLLSTANYNTSHYNNDLWQIFENPVDWKEKYINRDYSK-IFTENIVEQPCPDVFWFP 535
Query: 566 IVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKY 625
I +EK C E V+ ME YGQWS G ++D R+ GYE VPT DIHMKQ+ L VW F+R++
Sbjct: 536 IFSEKACDELVEEMEHYGQWSGGKHHDSRISGGYENVPTDDIHMKQIDLENVWLHFIREF 595
Query: 626 VVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGG 685
+ P+ + F GY+ + A ++FVV+Y PD Q SLRPHHD+ST+TINIALN VG D++GG
Sbjct: 596 IAPVTLKVFAGYYTKGF-ALLNFVVKYSPDRQRSLRPHHDASTFTINIALNNVGEDFQGG 654
Query: 686 GCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
GC+F+RYNC++ + R GW MHPGRLTH HEGL V GTRYI +SF+DP
Sbjct: 655 GCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNGTRYIAVSFIDP 703
>gi|393910403|gb|EJD75866.1| procollagen-lysine,2-oxoglutarate 5-dioxygenase [Loa loa]
Length = 729
Score = 602 bits (1551), Expect = e-169, Method: Compositional matrix adjust.
Identities = 306/712 (42%), Positives = 454/712 (63%), Gaps = 19/712 (2%)
Query: 33 KFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLLKNEL 91
K LV+TVA+ ETDG +R ++A N +++ G+ + W GG+ GGG K+ +L+ L
Sbjct: 27 KLLVVTVATEETDGLRRLKRTAHTNHFRLEVFGMGEEWRGGNTRVEQGGGQKIRILRKSL 86
Query: 92 DEMDITDDMIILVTDSYDVIIDGGVNDILERFNTF--DANIVFGAERLCWPDTSLYDKYP 149
+ DD+IIL D+YDVI+ G IL +F TF +VF +E CWP+ +L KYP
Sbjct: 87 GKYKDRDDLIILFVDAYDVILLGNEEQILRKFFTFFNGFRVVFSSEPFCWPNRNLAPKYP 146
Query: 150 AVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTL 209
V G+RYLNSG F+G+A +I LIS R +++ +DDQLYY L+LD+ +R K+ LD++
Sbjct: 147 LVNFGHRYLNSGVFMGFAPEIWSLISYRDVEDNDDDQLYYTRLYLDKQIRLSLKMTLDSM 206
Query: 210 ANLFQNLYGSLEDIKLNFDLDE--FVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
LFQNL G+ D+KL + + N YNT+P++IHGNG SK+ LN GNY+
Sbjct: 207 TVLFQNLNGASNDVKLEMSGERSGMYFIYNFIYNTHPLVIHGNGPSKLYLNHLGNYIDPL 266
Query: 268 WKTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFV 327
+ T+ + + + P + +S+ I KP F+ EF I L Y +KI +FV
Sbjct: 267 RIATSKTQSITM----DFEKIELPKLFLSIIISKPIPFIREFFGNIKKLAYTDEKIDLFV 322
Query: 328 YNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDS 387
Y NQ++ D++ + K ++++ Y ++ + +EAR+ +++ SL G D+ VD
Sbjct: 323 YCNQKFLTKEVSDFVEDVKKRYRSLLY-DDSTEMEEREARSFSLKQSLALGDDYLIMVDG 381
Query: 388 DSHLDNPDVLKYLVN----RNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIIN 443
D HL+N + L ++V+ + ++APL+ +P K ++NFWGA++++G+YARS +Y++II
Sbjct: 382 DVHLNNSEALLFMVHTMKEKEPEILAPLIRQPHKLFTNFWGAISSNGYYARSENYLDII- 440
Query: 444 GDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNS-MDYDMAFCTNLRNKGIHLKID 502
D GIWNVP+I + ++ K T++ Y + +D DM+FC+ R+KG L +D
Sbjct: 441 -DHKEVGIWNVPFIGSILIIAKE--KLTSLSRAYHYDEKLDPDMSFCSFARDKGHFLYLD 497
Query: 503 STQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVF 562
++ YG LV SEN + K +PE+YE+ N W+ RYIHP Y +L T + C DV+
Sbjct: 498 NSHHYGFLVVSENVESSKVHPEMYEIFNNKELWEKRYIHPNYFTALNGSTPIPEICQDVY 557
Query: 563 WFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFL 622
FP+++E+FC E ++ E YG+WSDG + D+RL GYE VPTRDIHMKQ+ W L
Sbjct: 558 DFPLMSERFCAELIEECEYYGKWSDGKHKDERLVGGYENVPTRDIHMKQIDFERHWLYML 617
Query: 623 RKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDY 682
+YV P+QE+ FIGY+ +PV + M FVVRY+P+EQ SLRPHHD+STY+I+IALN+ GVDY
Sbjct: 618 DEYVRPIQEKLFIGYYKQPVESVMMFVVRYKPEEQASLRPHHDASTYSIDIALNKRGVDY 677
Query: 683 EGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
EGGG RF+RYNC A +G ++ PGRLTH HEGL+ T+GTRYI +SF++P
Sbjct: 678 EGGGVRFLRYNCTFDADVVGHSMIFPGRLTHLHEGLETTRGTRYIAVSFINP 729
>gi|194379148|dbj|BAG58125.1| unnamed protein product [Homo sapiens]
Length = 703
Score = 601 bits (1550), Expect = e-169, Method: Compositional matrix adjust.
Identities = 299/709 (42%), Positives = 441/709 (62%), Gaps = 31/709 (4%)
Query: 51 IQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGYKVNLLKNELDEMDITDDMIILVTDSYD 109
+QSA+ VK LG + W GGD ++S+GGG KV L+K ++ DD++++ T+ +D
Sbjct: 1 MQSAKYFNYTVKVLGQGEEWRGGDGINSIGGGQKVRLMKEVMEHYADQDDLVVMFTECFD 60
Query: 110 VIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKD 169
VI GG ++L++F + +VF A+ + WPD L DKYP V G RYLNSGGFIGYA
Sbjct: 61 VIFAGGPEEVLKKFQKANHKVVFAADGILWPDKRLADKYPVVHIGKRYLNSGGFIGYAPY 120
Query: 170 IKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDL 229
+ ++ ++++ +DDQL+Y +++D R I LD +FQ L G+++++ L F+
Sbjct: 121 VNRIVQQWNLQDNDDDQLFYTKVYIDPLKREAINITLDHKCKIFQTLNGAVDEVVLKFEN 180
Query: 230 DEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPD 288
+ NT Y T PV I+GNG +KI LN FGNY+ SW + +GCT C +D D
Sbjct: 181 GK-ARAKNTFYETLPVAINGNGPTKILLNYFGNYVPNSWTQDNGCTLCEF-DTVDLSAVD 238
Query: 289 QFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTM 348
P+V I VFI++PT FL FL+ + L+YP + + +F++N + YH + K
Sbjct: 239 VHPNVSIGVFIEQPTPFLPRFLDILLTLDYPKEALKLFIHNKEVYHEKDIKVFFDKAKHE 298
Query: 349 FKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDSDSHLDNPDVLKYLVNRNESL 407
K +K + ++ EARN+ ++ + D+YF VD+D L NP LK L+ +N +
Sbjct: 299 IKTIKIVGPEENLSQAEARNMGMDFCRQDEKCDYYFSVDADVVLTNPRTLKILIEQNRKI 358
Query: 408 IAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSV 467
IAPL+ R K WSNFWGAL+ DG+YARS DY++I+ G++ G+WNVPY+ N YL+K
Sbjct: 359 IAPLVTRHGKLWSNFWGALSPDGYYARSEDYVDIVQGNR--VGVWNVPYMANVYLIKGKT 416
Query: 468 IKA-TNIKTIYTLNSMDYDMAFCTNLRN---------------------KGIHLKIDSTQ 505
+++ N + + + +D D A C N R KG+ + I +
Sbjct: 417 LRSEMNERNYFVRDKLDPDTALCRNAREMTLQREKDSPTPETFQMLSPPKGVFMYISNRH 476
Query: 506 EYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFP 565
E+G L+ + N++ N +++++ NP+DW +YI+ +Y K + + + QPCPDVFWFP
Sbjct: 477 EFGRLLSTANYNTSHYNNDLWQIFENPVDWKEKYINRDYSK-IFTENIVEQPCPDVFWFP 535
Query: 566 IVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKY 625
I +EK C E V+ ME YG+WS G ++D R+ GYE VPT DIHMKQV L VW F+R++
Sbjct: 536 IFSEKACDELVEEMEHYGKWSGGKHHDSRISGGYENVPTDDIHMKQVDLENVWLHFIREF 595
Query: 626 VVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGG 685
+ P+ + F GY+ + A ++FVV+Y P+ Q SLRPHHD+ST+TINIALN VG D++GG
Sbjct: 596 IAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPHHDASTFTINIALNNVGEDFQGG 654
Query: 686 GCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
GC+F+RYNC++ + R GW MHPGRLTH HEGL V GTRYI +SF+DP
Sbjct: 655 GCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNGTRYIAVSFIDP 703
>gi|410984588|ref|XP_003998609.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 3 [Felis
catus]
Length = 698
Score = 601 bits (1549), Expect = e-169, Method: Compositional matrix adjust.
Identities = 293/711 (41%), Positives = 439/711 (61%), Gaps = 53/711 (7%)
Query: 29 IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLL 87
++ +K LVITVA+ +T+GY+RF++SAE V+TLGL Q W GGD++ ++GGG KV L
Sbjct: 36 VNPEKLLVITVATAKTEGYRRFLRSAEFFNYTVRTLGLGQEWRGGDVARTVGGGQKVRWL 95
Query: 88 KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
K E+++ +DM+I+ DSYDVI+ G +++L++F + ++F AE CWP+ L ++
Sbjct: 96 KKEMEKYADREDMVIMFVDSYDVILAGSPSELLKKFVQSGSRLLFSAEGFCWPEWGLAEQ 155
Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
YP VG+G R+LNSGGFIG+A I ++ ++++DDQL+Y L+LD LR K + LD
Sbjct: 156 YPEVGTGKRFLNSGGFIGFAPTIHRVVRQWKYEDDDDDQLFYTRLYLDPGLREKLSLNLD 215
Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
+ +FQNL G+L++I L FD + V + N Y+T PV++HGNG +K++LN GNY+
Sbjct: 216 HKSRIFQNLNGALDEIVLKFDRNR-VRIRNVAYDTLPVVVHGNGPTKLQLNYLGNYVPNG 274
Query: 268 W-KTSGCTRCNLIKH-LDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
W GC C + L +P P VL++VF+++PT FL FL ++ L+YP ++++
Sbjct: 275 WTPQGGCGFCGRDRRILPGGQPP--PRVLLAVFVEQPTPFLPRFLQRLLLLDYPPDRVTL 332
Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFY 384
F++NN+ YH P D + F VK + + EAR++A+++ +FYF
Sbjct: 333 FLHNNEVYHEPHIADSWPQLQGHFSAVKLVGPEEALTPGEARDMAMDSCRQDPECEFYFS 392
Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
+D+D+ + N L+ L+ N +IAP+L R K WSNFWGAL+ D +YARS DY+ ++
Sbjct: 393 LDADAVITNQQTLRILIEENRKVIAPMLSRHGKLWSNFWGALSPDEYYARSEDYVELVQR 452
Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDS 503
+ G+WNVPYI+ Y+++ ++ + +++ + D DMAFC +LR+KGI L + +
Sbjct: 453 KR--VGVWNVPYISQAYVIRGETLRTELPQREVFSGSDTDPDMAFCKSLRDKGIFLHLSN 510
Query: 504 TQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFW 563
QE+G L+ + +D +P+++++ NPLDW +YIH Y ++L + QPCPDV+W
Sbjct: 511 QQEFGRLLATSRYDTDHLHPDLWQIFDNPLDWREQYIHENYSRALEGQGLVEQPCPDVYW 570
Query: 564 FPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLR 623
FP+++++ C E V+ ME YGQWS G + D RL GYE VPT DIHMKQVG W + LR
Sbjct: 571 FPLLSDQMCDELVEEMEHYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYEDQWLQLLR 630
Query: 624 KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYE 683
YV P+ E F GYH +
Sbjct: 631 TYVGPMTESLFPGYHT-------------------------------------------K 647
Query: 684 GGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
GGGCRF+RY+C V++ R GW L+HPGRLTHYHEGL T+GTRYIM+SFVDP
Sbjct: 648 GGGCRFLRYDCIVSSPRKGWGLLHPGRLTHYHEGLPTTRGTRYIMVSFVDP 698
>gi|345789311|ref|XP_542822.3| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2 [Canis
lupus familiaris]
Length = 703
Score = 600 bits (1548), Expect = e-169, Method: Compositional matrix adjust.
Identities = 297/709 (41%), Positives = 443/709 (62%), Gaps = 31/709 (4%)
Query: 51 IQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGYKVNLLKNELDEMDITDDMIILVTDSYD 109
+QSA VK LG + W GGD ++S+GGG KV L+K ++ +D+++L T+ ++
Sbjct: 1 MQSARYFNYTVKVLGQGEEWRGGDGINSIGGGQKVRLMKEVMEHYANQEDLVVLFTECFN 60
Query: 110 VIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKD 169
VI GG ++L++F + +VF A+ + WPD L DKYP V G RYLNSGGFIGYA +
Sbjct: 61 VIFAGGPEEVLKKFQKSNHKVVFAADGILWPDKRLADKYPIVHIGKRYLNSGGFIGYAPN 120
Query: 170 IKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDL 229
I +++ ++++ +DDQL+Y +++D R I LD +FQ L G+++++ L F+
Sbjct: 121 INQIVQQWNLQDNDDDQLFYTKIYIDPLKREAINITLDHKCKIFQTLNGAVDEVVLKFEN 180
Query: 230 DEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPD 288
+ N Y T PV ++GNG +KI LN FGNY+ +W + +GCT C+L +D D
Sbjct: 181 GK-ARAKNVFYETLPVAVNGNGPTKILLNYFGNYVPNAWTQDNGCTLCDL-DTIDLSTVD 238
Query: 289 QFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTM 348
P+V I VFI++PT FL FL+ + L+YP + + +F++N + YH + K
Sbjct: 239 VHPNVTIGVFIEQPTPFLPRFLDILLTLDYPKEALKLFIHNKEVYHEKDIKVFFDKAKHE 298
Query: 349 FKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDSDSHLDNPDVLKYLVNRNESL 407
+K + ++ EARN+ ++ + D+YF +D+D L NP LK L+ +N +
Sbjct: 299 INTIKIVGPEENLSQAEARNMGMDFCRQDENCDYYFSMDADVVLTNPRTLKILIEQNRKI 358
Query: 408 IAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSV 467
IAPL+ R K WSNFWGAL+ DG+YARS DY++I+ G++ GIWNVPY+ N YL+K
Sbjct: 359 IAPLVTRHGKLWSNFWGALSPDGYYARSEDYVDIVQGNR--VGIWNVPYMANVYLIKGKT 416
Query: 468 IKA-TNIKTIYTLNSMDYDMAFCTNLRN---------------------KGIHLKIDSTQ 505
+++ N + + + +D DMA C N R KG+ + I +
Sbjct: 417 LRSEMNERNYFVRDKLDPDMALCRNAREMTLQREKDSPTPETFQMLSPPKGVFMYISNRH 476
Query: 506 EYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFP 565
E+G L+ + N++ N +++++ NP+DW +YI+ +Y K + + + QPCPDVFWFP
Sbjct: 477 EFGRLLSTANYNTSHYNNDLWQIFENPVDWKEKYINRDYSK-IFTENIVEQPCPDVFWFP 535
Query: 566 IVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKY 625
I +EK C E V+ ME YGQWS G ++D R+ GYE VPT DIHMKQ+ L VW F+R++
Sbjct: 536 IFSEKACDELVEEMEHYGQWSGGKHHDSRISGGYENVPTDDIHMKQIDLENVWLHFIREF 595
Query: 626 VVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGG 685
+ P+ + F GY+ + A ++FVV+Y P+ Q SLRPHHD+ST+TINIALN VG D++GG
Sbjct: 596 IAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPHHDASTFTINIALNNVGEDFQGG 654
Query: 686 GCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
GC+F+RYNC++ + R GW MHPGRLTH HEGL V GTRYI +SF+DP
Sbjct: 655 GCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNGTRYIAVSFIDP 703
>gi|432960048|ref|XP_004086421.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 1-like
[Oryzias latipes]
Length = 695
Score = 600 bits (1546), Expect = e-168, Method: Compositional matrix adjust.
Identities = 294/714 (41%), Positives = 442/714 (61%), Gaps = 46/714 (6%)
Query: 27 KNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGYKVN 85
+ I E+K LV+TVA+ ETDG++RF++SA VK LG Q W GGD MS+ GGG KV
Sbjct: 22 QRIPEEKLLVLTVATKETDGFRRFLKSARNFNYTVKVLGRGQKWSGGDYMSAPGGGQKVR 81
Query: 86 LLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLY 145
LLK L E ++D ++L TDSYD + G ++L +F +VF +ERL WPD L
Sbjct: 82 LLKEALQETK-SEDQVLLFTDSYDAVFASGPKELLRKFQQAKHKVVFSSERLIWPDRHLE 140
Query: 146 DKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIV 205
DK+P V G R+L SGGF+GY +I+E+++N + ++ + DQL++ +++D R I
Sbjct: 141 DKHPHVREGNRFLGSGGFMGYLSNIREMVANWTGEDADSDQLFFTKIYVDPDKRKSINIT 200
Query: 206 LDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLA 265
+D+ LFQNL+G+L+D+ L F+ D V N Y+T PV+IHGNG +K+++N GNY+
Sbjct: 201 VDSRCRLFQNLHGALDDVVLKFE-DGRVRARNVLYDTLPVLIHGNGPTKLQINYMGNYIP 259
Query: 266 KSWK-TSGCTRC-NLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKI 323
KSW +GCT C + ++ L +LK ++P V I +FI +PT F+ F ++ L YP ++
Sbjct: 260 KSWTFENGCTVCQDDLRSLAALKDSEYPLVSIGIFIQQPTPFVSVFFERLLKLEYPKDRL 319
Query: 324 SMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFY 382
+F+YN + +H P ++ + ++++V+ I ++S AR+L ++ D++
Sbjct: 320 KLFIYNQEPHHEPQVSSFLRDHGGLYQDVRSITPKEDMDSAAARDLVLDICRKDTDCDYF 379
Query: 383 FYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNII 442
F +D + L N LK L+ +N ++AP++ R + WSNFWGA++ADG+YARS DY++I+
Sbjct: 380 FNLDIEVVLKNEKTLKILIEQNLPVVAPMITRAARLWSNFWGAVSADGYYARSEDYVDIV 439
Query: 443 NGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRN--KGIHLK 500
G + K + L++ KG
Sbjct: 440 QGRRVSK------------------------------------QSLRGELQDHLKGSFHV 463
Query: 501 IDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPD 560
+ Q +G ++ +EN+ + +++++ NP+DW+ RYIH Y + ++ D + PCPD
Sbjct: 464 CNHMQTFGRILSTENYQSTHLHNDLWQIFENPVDWEERYIHQNYSR-IMRDKLIETPCPD 522
Query: 561 VFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAE 620
V+WFP+ ++ C V+ ME +G+WS G N D R++ GYE VPT DIHM Q+ W +
Sbjct: 523 VYWFPVFSDVACTHLVEEMEHFGKWSGGGNTDTRIQGGYENVPTIDIHMNQINFEKEWQK 582
Query: 621 FLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGV 680
FL +YV P+ E+ + GY+ + ++FVVRY+PDEQPSLRPHHD+ST+TINIALNQ+G
Sbjct: 583 FLVEYVAPITEKMYPGYYTK-AHFELAFVVRYKPDEQPSLRPHHDASTFTINIALNQLGQ 641
Query: 681 DYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
DY+GGGCRF+RY C++ A R GW LMHPGRLTHYHEGL T GTRYI +SFVDP
Sbjct: 642 DYQGGGCRFLRYGCSIQAPRKGWALMHPGRLTHYHEGLPTTAGTRYIAVSFVDP 695
>gi|410971254|ref|XP_003992085.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2 [Felis
catus]
Length = 703
Score = 597 bits (1540), Expect = e-168, Method: Compositional matrix adjust.
Identities = 296/709 (41%), Positives = 441/709 (62%), Gaps = 31/709 (4%)
Query: 51 IQSAEVNKLQVKTLGLHQPWLGG-DMSSLGGGYKVNLLKNELDEMDITDDMIILVTDSYD 109
+QSA+ VK LG + W GG ++S+GGG KV L+K ++ +D+++L T+ +D
Sbjct: 1 MQSAKYFNYTVKVLGQGEEWRGGAGINSIGGGQKVRLMKEVMEHYANQEDLVVLFTECFD 60
Query: 110 VIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKD 169
VI GG ++L++F + +VF A+ + WPD L DKYP V G RYLNSGGFIGYA
Sbjct: 61 VIFAGGPEEVLKKFQKSNHKVVFAADGILWPDKRLADKYPIVHFGKRYLNSGGFIGYAPY 120
Query: 170 IKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDL 229
I ++ ++++ +DDQL+Y +++D R I LD +FQ L G+++++ L F+
Sbjct: 121 INRIVQQWNLQDNDDDQLFYTKIYIDPLKREAINITLDHKCKIFQTLNGAVDEVVLKFEN 180
Query: 230 DEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPD 288
+ N Y T PV ++GNG +KI LN FGNY+ +W + +GCT C+L +D D
Sbjct: 181 GK-ARAKNVFYETLPVALNGNGPTKILLNYFGNYVPNAWTQDNGCTLCDL-DTIDLSTVD 238
Query: 289 QFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTM 348
P+V I +FI++PT FL FL+ + L+YP + + +F++N + YH + K
Sbjct: 239 VHPNVTIGIFIEQPTPFLPRFLDILLTLDYPKEALKLFIHNKEVYHEKDIKVFFDKAKHE 298
Query: 349 FKNVKYIAHNSTVNSKEARNLAVENSLHKGV-DFYFYVDSDSHLDNPDVLKYLVNRNESL 407
+K + ++ EARN+ ++ + D+YF +D+D L NP LK L+ +N +
Sbjct: 299 ISTIKIVGPEENLSQAEARNMGMDFCRQDEICDYYFSIDADVVLTNPRTLKILIEQNRKI 358
Query: 408 IAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSV 467
IAPL+ R K WSNFWGAL+ DG+YARS DY++I+ G++ GIWNVPY+ N YL+K
Sbjct: 359 IAPLVTRHGKLWSNFWGALSPDGYYARSEDYVDIVQGNR--VGIWNVPYMANVYLIKGKT 416
Query: 468 IKA-TNIKTIYTLNSMDYDMAFCTNLRN---------------------KGIHLKIDSTQ 505
+++ N + + + +D DMA C N R KG+ + I +
Sbjct: 417 LRSEMNERNYFVRDKLDPDMALCRNAREMTLQREKDSPTPETFQMLSPPKGVFMYISNRH 476
Query: 506 EYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFP 565
E+G L+ + N++ N +++++ NP+DW +YI+ +Y K + + + QPCPDVFWFP
Sbjct: 477 EFGRLLSTANYNTSHYNNDLWQIFENPVDWKEKYINRDYSK-IFTENIVEQPCPDVFWFP 535
Query: 566 IVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKY 625
I +EK C E V+ ME YGQWS G ++D R+ GYE VPT DIHMKQ+ L VW F+R++
Sbjct: 536 IFSEKACDELVEEMEHYGQWSGGKHHDSRISGGYENVPTDDIHMKQIDLENVWLHFIREF 595
Query: 626 VVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGG 685
+ P+ + F GY+ + A ++FVV+Y P+ Q SLRPHHD+ST+TINIALN VG D++GG
Sbjct: 596 IAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPHHDASTFTINIALNNVGEDFQGG 654
Query: 686 GCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
GC+F+RYNC++ + R GW MHPGRLTH HEGL V GTRYI +SF+DP
Sbjct: 655 GCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNGTRYIAVSFIDP 703
>gi|317419976|emb|CBN82012.1| Procollagen-lysine,2-oxoglutarate 5-dioxygenase 2 [Dicentrarchus
labrax]
Length = 703
Score = 594 bits (1532), Expect = e-167, Method: Compositional matrix adjust.
Identities = 295/710 (41%), Positives = 433/710 (60%), Gaps = 33/710 (4%)
Query: 51 IQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLLKNELDEMDITDDMIILVTDSYD 109
+QSA VK LG+ + W GGD+ S+GGG KV LLK ++ + +D+++L DSYD
Sbjct: 1 MQSARYFNYTVKVLGMGEAWKGGDVGRSIGGGQKVRLLKEAMEALADQEDLVVLSVDSYD 60
Query: 110 VIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKD 169
+I GG +IL +F + ++F AE L WPD L DKYP++ SG RYLNSGG IGYA
Sbjct: 61 LIFAGGPEEILRKFQQANHKVLFAAEGLIWPDKRLADKYPSIRSGKRYLNSGGIIGYAPY 120
Query: 170 IKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDL 229
I ++S ++ + +DDQL+Y ++LD R + LD +FQNL G+++++ L F
Sbjct: 121 INRVVSQWNLHDNDDDQLFYTKIYLDPLRRETLNMTLDHKCQIFQNLNGAVDEVLLKFGT 180
Query: 230 DEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK-TSGCTRC-NLIKHLDSLKP 287
D V + NT Y++ PV++HGNG +K+ LN NY+ +W GC+ C + + L LK
Sbjct: 181 DR-VRVRNTVYDSLPVVVHGNGNTKMYLNYMANYVPNTWNYEHGCSHCDDDVVDLSQLK- 238
Query: 288 DQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKT 347
++P+VL+ VFI++PT FL EF ++ L+YP K+ +FV+NN+ YH + +
Sbjct: 239 -EYPNVLVGVFIEQPTPFLPEFFQRLLTLDYPKDKLKVFVHNNEVYHEKHIQKFWEENRN 297
Query: 348 MFKNVKYIAHNSTVNSKEARNLAVENSLHKGV-DFYFYVDSDSHLDNPDVLKYLVNRNES 406
+F + K + ++ EARN+ ++ +YF +DSD L N LK L+ +N
Sbjct: 298 VFNSFKVVGPEENLSQGEARNMGMDLCRKDTTCAYYFSIDSDVMLTNRQTLKLLIEQNRK 357
Query: 407 LIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTS 466
+I PL+ R K WSNFWGAL+ DG+YARS DY++I+ + G+WN+PY+ + Y++K S
Sbjct: 358 IIGPLVTRHSKLWSNFWGALSLDGYYARSEDYVDIVQKKR--VGVWNIPYMAHVYMVKGS 415
Query: 467 VIK-ATNIKTIYTLNSMDYDMAFCTNLRN---------------------KGIHLKIDST 504
++ + + L +D DM+ C N R KG+ + I +
Sbjct: 416 TLRNELKERNYFVLEKLDPDMSLCRNAREMTSHREKDSPSPESFHMLRPPKGVFMYITNR 475
Query: 505 QEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWF 564
++G L+ + N++ N +++++ NP+DW +YIH Y K + + +PCPDVFWF
Sbjct: 476 HDFGRLISTANYNISHYNNDLWQIYENPVDWKEKYIHKNYSK-IFTENYMEEPCPDVFWF 534
Query: 565 PIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRK 624
P+ +EK C E V ME YG WS G + DKR+ GYE VPT DIHMKQ+G W F+R+
Sbjct: 535 PVFSEKACDEIVGEMEHYGTWSGGRHMDKRIAGGYETVPTDDIHMKQIGFDKEWLHFIRE 594
Query: 625 YVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEG 684
++ P+ + F GY+ + A M+FVV+Y P+ Q LRPHHDSST+TINIALN D++G
Sbjct: 595 FISPVTLKVFSGYYTKGY-AVMNFVVKYTPERQAYLRPHHDSSTFTINIALNNKDTDFQG 653
Query: 685 GGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
GGCRF RYNC++ + R GW MHPGRLTH HEGL T GTRYI +SF+DP
Sbjct: 654 GGCRFHRYNCSINSPRKGWSFMHPGRLTHLHEGLPTTNGTRYIAVSFIDP 703
>gi|226490282|emb|CAX69383.1| procollagen-lysine, 2-oxoglutarate 5-dioxygenase 3 [Schistosoma
japonicum]
Length = 721
Score = 594 bits (1532), Expect = e-167, Method: Compositional matrix adjust.
Identities = 307/716 (42%), Positives = 456/716 (63%), Gaps = 37/716 (5%)
Query: 35 LVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLLKNELDE 93
LV+TVA+ + D RF++S +N +VK LG W GG+++ S GGG KVN+LK+EL +
Sbjct: 27 LVLTVATEKNDALDRFLRSCSLNGFEVKVLGEGSYWKGGNVAKSTGGGQKVNILKDELAK 86
Query: 94 MDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAVGS 153
D ++L DSYDV+ V ++L+ + F++ ++F AE CWP SL YP V
Sbjct: 87 STYRPDQLVLFVDSYDVVFMQNVANLLKEYERFESKVIFSAEEFCWPQPSLKSLYPEVKP 146
Query: 154 G-YRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLANL 212
G RYLNSGGFIG ++ +++++ I +++DDQLYY +FLD LRT + I LD + +
Sbjct: 147 GEKRYLNSGGFIGPVANLIKIVNHTPINDDDDDQLYYTNIFLDSKLRTLYDIELDKTSRI 206
Query: 213 FQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK-TS 271
FQNL G+ +D++L+F+ DE +L N ++T P+I HGNG K+E NS NYL SW T
Sbjct: 207 FQNLNGAFDDVELHFN-DETGYLFNKIFSTTPIIAHGNGPIKVEFNSLSNYLVHSWTPTH 265
Query: 272 GCTRCNLIK-HLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMF--VY 328
GC CN L+ L +P V++ +F+++ T F+E+F +IA L+YP ++ + +
Sbjct: 266 GCQHCNEDNVELNDLS--DYPLVVMGIFVEQATPFIEKFFERIAALSYPKSRLHVVGHMA 323
Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARN------LAVENSLHKGVDFY 382
+ ++ + + F + +V ++ N + + AR LA+E+ +
Sbjct: 324 ESTKFQLSASESFNQTFGHEYLSVSWLEEN--LEEEIARKKVFGYCLAIED-----CKYV 376
Query: 383 FYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNII 442
F VDS + L+NP+ L +L+ N S+IAPLL KAWSNFWGAL DG+YARS DYM+II
Sbjct: 377 FAVDSIAQLENPETLDHLIRMNRSIIAPLLTIRGKAWSNFWGALGTDGYYARSSDYMDII 436
Query: 443 NGDQGGKGIWNVPYITNCYLM-KTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKI 501
+ + GIWNVP + + YL+ + +V+K +I +M F RNK I + +
Sbjct: 437 SYNM--TGIWNVPLVRSAYLITRPAVLKLIDITNT--------EMNFAYEARNKNIFMFV 486
Query: 502 DSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNN---QPC 558
D+ +G+L++++N+ K + ++++++ NP DW+ +YIHP+Y + P+ + QPC
Sbjct: 487 DNQVNFGYLINADNYTKGKLHNDLWQIMDNPQDWEEKYIHPQYFNTAKPEVMMTDVAQPC 546
Query: 559 PDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVW 618
PDVFWFP+++E FC ++ +E YGQWS+G N D RLE GYE VPTRDIHM+Q+G W
Sbjct: 547 PDVFWFPLLSETFCKHLIEEVENYGQWSNGDNYDPRLEGGYENVPTRDIHMRQIGWEEHW 606
Query: 619 AEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQV 678
L KYV +Q++ F GY +P A M+FVVRY+PDEQPSLRPHHD+S+YT+NI LNQ
Sbjct: 607 LHVLEKYVHKIQKKLFQGYDDKP-WARMNFVVRYKPDEQPSLRPHHDASSYTLNIGLNQP 665
Query: 679 GVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
DY+GGG F RYNC++ TR+GW ++ PGR+TH HEGL T+GTRYI ++FV+P
Sbjct: 666 EKDYKGGGVHFNRYNCSIIDTRVGWAVVSPGRVTHLHEGLPTTEGTRYIFVTFVNP 721
>gi|341902492|gb|EGT58427.1| hypothetical protein CAEBREN_29667, partial [Caenorhabditis
brenneri]
Length = 689
Score = 585 bits (1509), Expect = e-164, Method: Compositional matrix adjust.
Identities = 286/684 (41%), Positives = 433/684 (63%), Gaps = 36/684 (5%)
Query: 79 GGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERF--NTFDANIVFGAER 136
GGG K+ +L +++ D +I+ D+YDVI + IL +F + D ++FGAE
Sbjct: 14 GGGQKIRILSEWIEKYKDASDTMIMFVDAYDVIFNADSTTILRKFFEHYSDKRLLFGAEP 73
Query: 137 LCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDE 196
CWPD +L YP V G R+LNSG F+GY ++ +++ + +++++DDQLYY ++LD
Sbjct: 74 FCWPDQTLAPDYPIVEFGKRFLNSGLFMGYGPEVYKVLKLKPVEDKDDDQLYYTRVYLDN 133
Query: 197 TLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIE 256
LR + K+ LD+++ +FQNL G +ED++L F D N YNT P+I+HGNG SK
Sbjct: 134 KLRKELKMDLDSMSKIFQNLNGVIEDVELQFKEDGTPEAYNAAYNTKPLIVHGNGPSKSH 193
Query: 257 LNSFGNYLAKSWKTS-GCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIAN 315
LN GNYL W + GC C + ++ P + +++FI KP F+EE L K+A
Sbjct: 194 LNYLGNYLGNRWNSQLGCRTCGQ----EMKDSEELPLIGLNIFIAKPIPFIEEVLQKVAE 249
Query: 316 LNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSL 375
+YP KI++++YNNQ + D++ + + I + + +EARN A+E
Sbjct: 250 FDYPKDKIALYIYNNQPFSIKNIQDFLKEHGKSYYTKRVINGVTEIGEREARNEAIEWDK 309
Query: 376 HKGVDFYFYVDSDSHLDNPDVLKYLVNRNES----LIAPLLVRPFKAWSNFWGALNADGF 431
+ V+F F +D D++ P V+K L++ +++ +IAP++ + K ++NFWGA+ A+G+
Sbjct: 310 QRNVEFAFLMDGDAYFTEPKVIKDLIHYSKTYDVGIIAPMVGQIGKLFTNFWGAIAANGY 369
Query: 432 YARSFDYMNIINGDQGGKGIWNV----------------PYITNCYLMKTSVIKATNIKT 475
YARS DYM I+ G++ G WNV P+IT+ L+ + A +K
Sbjct: 370 YARSEDYMAIVKGNR--IGYWNVRQKLRNVSNNNFLFQVPFITSALLLNKEKLSA--LKD 425
Query: 476 IYTLN-SMDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQ----KTNPEVYELIR 530
Y+ N ++D DM+ C R+ G + ID+ ++YG+L+ S+ F K +PE++++
Sbjct: 426 AYSYNKNLDPDMSMCQFARDNGHFMYIDNEKQYGYLIVSDEFSETVTEGKWHPEMWQIFE 485
Query: 531 NPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTN 590
N W+ RY+HP Y K + PD + +Q CPDV+ +P+++E+FC E ++ ME +G+WSDG+N
Sbjct: 486 NRDLWEARYVHPGYHKIMEPDHIIDQACPDVYDYPLMSERFCEELIEEMEGFGRWSDGSN 545
Query: 591 NDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVV 650
NDKRL GYE VPTRDIHM QVG W FL YV P+QE+ FIGY+H+PV + M FVV
Sbjct: 546 NDKRLAGGYENVPTRDIHMNQVGFERQWLYFLDTYVRPVQEKTFIGYYHQPVESNMMFVV 605
Query: 651 RYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGR 710
RY+P+EQ SLRPHHD+ST++I++ALN+ G DYEGGG R++RYNC V A +G+ +M PGR
Sbjct: 606 RYKPEEQASLRPHHDASTFSIDVALNKKGRDYEGGGVRYVRYNCTVEADEVGYAMMFPGR 665
Query: 711 LTHYHEGLQVTQGTRYIMISFVDP 734
LTH HEGL T+GTRYIM+SF++P
Sbjct: 666 LTHLHEGLATTKGTRYIMVSFINP 689
>gi|156385144|ref|XP_001633491.1| predicted protein [Nematostella vectensis]
gi|156220562|gb|EDO41428.1| predicted protein [Nematostella vectensis]
Length = 729
Score = 582 bits (1501), Expect = e-163, Method: Compositional matrix adjust.
Identities = 302/741 (40%), Positives = 448/741 (60%), Gaps = 23/741 (3%)
Query: 5 LHLNCLILSCVV------FFISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNK 58
+ + LI SCV + ++ ++ E + LV+TVA+ ETDGY RF++S
Sbjct: 1 MSVKALISSCVFLLASLSYLVNADNGFSRDPKELELLVLTVATEETDGYTRFMRSCSHYD 60
Query: 59 LQVKTLGLHQPWLGGDMSS-LGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVN 117
+ V+ +G++ W GG++ + GG +K+NLLK+ + E +++++ +DSYD I
Sbjct: 61 VPVRVIGMNTSWKGGNVRTDPGGAHKINLLKDAVAEYKDKKNLVLMFSDSYDAIFLARAE 120
Query: 118 DILERFNTFDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNR 177
+++F F A++VF AE CWPD L DKYP VG G RYL SGGFIGYA ++I+ +
Sbjct: 121 AFIKKFLEFKAHVVFSAEGFCWPDRWLVDKYPEVGHGKRYLCSGGFIGYAPVFHQIINEK 180
Query: 178 SIKNEEDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTN 237
+K+E+DDQL+Y ++LD+ R K + LD A +F NL G+ E+++L F+ E V L N
Sbjct: 181 PVKDEDDDQLFYTNIYLDKEKRDKFNMKLDHKAEIFMNLNGAEEEVQLKFE-GEKVWLYN 239
Query: 238 TKYNTNPVIIHGNGKSKIELNSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLIS 296
Y+T P+ +HGNG SK+ LN GNYL W K GC CN K +P V+++
Sbjct: 240 KVYSTTPLWVHGNGPSKVHLNYIGNYLPAMWNKEKGCLVCNEDTIKLPEKESDYPKVMMA 299
Query: 297 VFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYI-HNFKTMFKNVKYI 355
+FI +PT F+ EF +I L+YP KKI+++++N + H ++++ + ++ +V Y
Sbjct: 300 IFISRPTPFVPEFFKRIEALDYPKKKIALYIHNLMDGHTKEVNEWLTEEIRGLYHSVTY- 358
Query: 356 AHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRP 415
ARN AV + G D+ F VD++ N LK L+ +N L+ P + +
Sbjct: 359 -QGPGTFEAAARNKAV----YSGSDYLFVVDANVVYTNKKSLKLLIEQNRPLLVPKMSKH 413
Query: 416 FKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKT 475
K WSNFWG + DG+YAR+ DY++I+ + GIWN Y+T YL++ V+ +K
Sbjct: 414 AKLWSNFWGTIGDDGYYARAEDYIDIVEYRR--VGIWNSAYVTGSYLIQKDVL--PKLKH 469
Query: 476 IYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDW 535
Y+ +++ D++F LR+ GI + + + +G L +++ + +++++ N +DW
Sbjct: 470 AYSYGNLEPDLSFSKYLRDNGIFMYVTNMHYFGRLKETDTVTTNHLHNDLWQIFDNQIDW 529
Query: 536 DLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTN--NDK 593
+ RY+HP Y ++L PC DVFWFP+++E + ++ ME YG+WS G + D
Sbjct: 530 EERYLHPNYSQNLNKSIPLKMPCNDVFWFPLMSETWATHMIEEMEHYGKWSGGKHEPQDA 589
Query: 594 RLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYR 653
RL GYE VPT DIHM QVG W L+ Y+VP+ R F GY+ E RA M+FVV+Y
Sbjct: 590 RLNGGYENVPTVDIHMNQVGWEREWLHLLKTYIVPVNTRIFPGYYSEG-RAIMNFVVKYT 648
Query: 654 PDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTH 713
P Q LRPHHDSSTYTINI LN+ G+ Y GGG RFIR +C VT T++GW LMHPGRLTH
Sbjct: 649 PSGQYYLRPHHDSSTYTINIGLNKPGIHYGGGGSRFIRQDCAVTDTQVGWALMHPGRLTH 708
Query: 714 YHEGLQVTQGTRYIMISFVDP 734
YHEGL T GTRYIM+ FVDP
Sbjct: 709 YHEGLPTTWGTRYIMVCFVDP 729
>gi|90079137|dbj|BAE89248.1| unnamed protein product [Macaca fascicularis]
Length = 645
Score = 580 bits (1495), Expect = e-162, Method: Compositional matrix adjust.
Identities = 281/651 (43%), Positives = 415/651 (63%), Gaps = 9/651 (1%)
Query: 87 LKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYD 146
+K ++ DD++++ T+ +DVI GG ++L++F + +VF A+ + WPD L D
Sbjct: 1 MKEVMEHYADQDDLVVMFTECFDVIFAGGPEEVLKKFQKANHKVVFAADGILWPDKRLAD 60
Query: 147 KYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVL 206
KYP V G RYLNSGGFIGYA I ++ ++++ +DDQL+Y +++D R I L
Sbjct: 61 KYPVVHIGKRYLNSGGFIGYAPYINRIVQQWNLQDNDDDQLFYTKIYIDPLKREAINITL 120
Query: 207 DTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAK 266
D +FQ L G+++++ L F+ + NT Y T PV I+GNG +KI LN FGNY+
Sbjct: 121 DHKCKIFQTLNGAVDEVVLKFENGK-ARAKNTFYETLPVAINGNGPTKILLNYFGNYVPN 179
Query: 267 SW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
SW + +GCT C +D D P+V I VFI++PT FL FL+ + L+YP + + +
Sbjct: 180 SWTQDNGCTLCEF-DTIDLSAVDVHPNVTIGVFIEQPTPFLPRFLDILLTLDYPKEALKL 238
Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFY 384
F++N + YH + K K +K + ++ EARN+ ++ + D+Y
Sbjct: 239 FIHNKEVYHEKDIKVFFDKAKHEIKTIKIVGPEENLSQAEARNMGMDFCRQDEKCDYYSS 298
Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
VD+D L NP LK L+ +N +IAPL+ R K WSNFWGAL+ DG+YARS DY++I+ G
Sbjct: 299 VDADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARSEDYVDIVQG 358
Query: 445 DQGGKGIWNVPYITNCYLMKTSVIK-ATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDS 503
++ G+WNVPY+ N YL+K ++ N + + + +D DMA C N R G+ + I +
Sbjct: 359 NR--VGVWNVPYMANVYLIKGKTLRLEMNERNYFVRDKLDPDMALCRNAREMGVFMYISN 416
Query: 504 TQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFW 563
E+G L+ + N++ N +++++ NP+DW +YI+ +Y K + + + QPCPDVFW
Sbjct: 417 RHEFGRLLSTANYNTSHYNNDLWQIFENPVDWKEKYINRDYSK-IFTENIVEQPCPDVFW 475
Query: 564 FPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLR 623
FPI +EK C E V+ ME YG+WS G ++D R+ GYE VPT DIHMKQV L VW F+R
Sbjct: 476 FPIFSEKACDELVEEMEHYGKWSGGKHHDSRISGGYENVPTDDIHMKQVDLENVWLHFIR 535
Query: 624 KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYE 683
+++ P+ + F GY+ + A ++FVV+Y P+ Q SLRPHHD+ST+TINIALN VG D++
Sbjct: 536 EFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPHHDASTFTINIALNNVGEDFQ 594
Query: 684 GGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
GGGC+F+RYNC++ + R GW MHPGRLTH HEGL V GTRYI +SF+DP
Sbjct: 595 GGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNGTRYIAVSFIDP 645
>gi|395833069|ref|XP_003789568.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2
[Otolemur garnettii]
Length = 942
Score = 580 bits (1495), Expect = e-162, Method: Compositional matrix adjust.
Identities = 292/707 (41%), Positives = 427/707 (60%), Gaps = 52/707 (7%)
Query: 32 DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGYKVNLLKNE 90
DK LVITVA+ E+DG+ RF+QSA+ VK LG + W GGD ++++GGG KV L+K
Sbjct: 284 DKLLVITVATKESDGFHRFMQSAKYFNYTVKVLGQGEEWRGGDGINTIGGGQKVRLMKEV 343
Query: 91 LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
+++ DD++IL T+ +DVI GG ++L++F + +VF A+ L WPD L DKYP
Sbjct: 344 MEQYANEDDLVILFTECFDVIFAGGPEEVLKKFQKTNHKVVFAADGLLWPDKRLADKYPI 403
Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
V G RYLNSGGFIGYA I ++ ++++ +DDQL+Y +++D R I LD
Sbjct: 404 VHIGKRYLNSGGFIGYAPYINRIVQQWNLQDNDDDQLFYTKIYIDPLKREAINITLDHKC 463
Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSW-K 269
+FQ L G+++++ L F+ + + NT Y T PV ++GNG +KI LN FGNY+ SW +
Sbjct: 464 KIFQTLNGAVDEVVLKFENGK-ARVKNTFYETLPVAVNGNGPTKILLNYFGNYIPNSWTQ 522
Query: 270 TSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN 329
GCT C +D D P+V I VFI++PT FL FL+ + L+YP + + +F++N
Sbjct: 523 DKGCTLCE-SDTIDLSAVDVHPNVTIGVFIEQPTPFLPRFLDLLLTLDYPKEALKVFIHN 581
Query: 330 NQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDSD 388
+ YH + K +K + ++ EARN+ ++ + D+YF VD+D
Sbjct: 582 KEVYHEKDIKVFFDKAKHEINTIKIVGPEENLSQAEARNMGMDFCRQDEKCDYYFSVDAD 641
Query: 389 SHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGG 448
L NP LK L+ +N
Sbjct: 642 VVLTNPRTLKLLIEQN-------------------------------------------- 657
Query: 449 KGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEY 507
+G+WNVPY+ N YL+K +++ N + + + +D DMA C N R G+ + I + E+
Sbjct: 658 RGVWNVPYMANVYLIKGKTLRSEMNERNYFVRDKLDPDMALCRNAREMGVFMYISNRHEF 717
Query: 508 GHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIV 567
G L+ + N++ N +++++ NP+DW +YI+ +Y K + +++ QPCPDVFWFPI
Sbjct: 718 GRLLSTANYNTSHYNNDLWQIFENPVDWKEKYINRDYSK-IFTESIVEQPCPDVFWFPIF 776
Query: 568 TEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVV 627
+EK C E V+ ME YGQWS G ++D R+ GYE VPT DIHMKQVGL VW F+R+++
Sbjct: 777 SEKACDELVEEMEHYGQWSGGKHHDSRISGGYENVPTDDIHMKQVGLESVWLHFIREFIA 836
Query: 628 PLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGC 687
P+ + F GY+ + A ++FVV+Y PD Q SLRPHHD+ST+TINIALN VG D++GGGC
Sbjct: 837 PVTLKVFAGYYTKGF-ALLNFVVKYSPDRQRSLRPHHDASTFTINIALNNVGEDFQGGGC 895
Query: 688 RFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
+F+RYNC++ + R GW MHPGRLTH HEGL V GTRYI +SF+DP
Sbjct: 896 KFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNGTRYIAVSFIDP 942
>gi|402852964|ref|XP_003891176.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 1 [Papio
anubis]
Length = 784
Score = 580 bits (1494), Expect = e-162, Method: Compositional matrix adjust.
Identities = 298/764 (39%), Positives = 446/764 (58%), Gaps = 64/764 (8%)
Query: 31 EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNE 90
ED LV+TVA+ ET+G++RF +SA+ +++ LGL + W +S GGG KV LLK
Sbjct: 25 EDNLLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWNVEKGTSAGGGQKVRLLKKA 84
Query: 91 LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
L++ +D++IL TDSYDV+ G ++L++F + +VF AE L +PD L KYP
Sbjct: 85 LEKHADKEDLVILFTDSYDVLFASGPRELLKKFRQARSQVVFSAEELIYPDRRLEAKYPV 144
Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
V G R+L SGGFIGYA ++ +L++ ++ + DQL+Y +FLD R + I LD
Sbjct: 145 VSDGKRFLGSGGFIGYAPNLSKLVAEWEGQDSDSDQLFYTKIFLDPEKREQINITLDHRC 204
Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSK---------------- 254
+FQNL G+L+++ L F++ V N Y+T PV+IHGNG +K
Sbjct: 205 RIFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVLIHGNGPTKYPPGARNTYLGACYEL 263
Query: 255 -------------------IELNSFGNYLAKSWK-TSGCTRCNL-IKHLDSLKPDQFPSV 293
++LN GNY+ + W +GCT C+ ++ L + + P+V
Sbjct: 264 TTSVLTSELSVVPSFPAVLLQLNYLGNYIPRFWTFETGCTVCDEGLRSLKGIGDEALPTV 323
Query: 294 LISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVK 353
L+ +FI++PT F+ F ++ L+YP K + +F++N++++H ++++ + +++VK
Sbjct: 324 LVGMFIEQPTPFVSLFFQRLLQLHYPRKHMRLFIHNHEQHHKAQVEEFLAEHGSEYQSVK 383
Query: 354 YIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLL 412
+ + + +ARN+ + + +YF VD+D L P+ L+ L+ +N+++IAPL+
Sbjct: 384 LVGPEVRMANADARNMGADLCRQDRSCTYYFSVDADVALTEPNSLRLLIQQNKNVIAPLM 443
Query: 413 VRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKAT- 471
R + WSNFWGAL+ADG+YARS DY++I+ G + G+WNVPYI+N YL+K S ++
Sbjct: 444 TRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR--IGVWNVPYISNIYLIKGSALRGEL 501
Query: 472 NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRN 531
++ + +D DMAFC N+R + + + + + GHL+ +++ + +++E+ N
Sbjct: 502 QSPDLFHHSKLDPDMAFCANVRQQDVFMFLTNRHTLGHLLSLDSYRTTHLHNDLWEVFSN 561
Query: 532 PLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNN 591
P DW +YIH Y K+L V PCPDV+WFPI TE C E V+ ME +GQWS G N
Sbjct: 562 PEDWKEKYIHQNYTKALAGKLVET-PCPDVYWFPIFTEAACDELVEEMEHFGQWSLGDNK 620
Query: 592 DKRLETGYEAVPTR-DIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAP----- 645
L G R I + W + P H P R P
Sbjct: 621 VGTLMPGLGPQGGRLSISATHCPGSPSWLLYQPTKKAPPPAVAGQRAHSLPSRKPVPHTC 680
Query: 646 ---------------MSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFI 690
++FVVRY+PDEQPSL PHHD+ST+TINIALN+VGVDYEGGGCRF+
Sbjct: 681 ALLHPKQPSLDAQFDLAFVVRYKPDEQPSLMPHHDASTFTINIALNRVGVDYEGGGCRFL 740
Query: 691 RYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
RYNC++ A R GW LMHPGRLTHYHEGL T+GTRYI +SFVDP
Sbjct: 741 RYNCSIRAPRKGWTLMHPGRLTHYHEGLPTTRGTRYIAVSFVDP 784
>gi|301618077|ref|XP_002938453.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2-like
[Xenopus (Silurana) tropicalis]
Length = 1185
Score = 575 bits (1482), Expect = e-161, Method: Compositional matrix adjust.
Identities = 290/720 (40%), Positives = 438/720 (60%), Gaps = 15/720 (2%)
Query: 24 NKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGY 82
N + N+ DK LV+TVA+ ETDG+ RF+QSA VK LG W GGD++ ++GGG
Sbjct: 472 NSIDNLPTDKLLVLTVATRETDGFHRFMQSARHFSYTVKVLGKGIEWKGGDVANTIGGGQ 531
Query: 83 KVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDT 142
KV LLK L+ ++ DD++IL TDSYDVI GG ++L +F + +VF AE L WPD
Sbjct: 532 KVRLLKEALESLEDQDDLVILFTDSYDVIFAGGPEEVLLKFQQSNHKVVFAAEGLIWPDK 591
Query: 143 SLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKH 202
SL + YP + + +L GFIGY ++K+++ +++ +DDQL+Y +++D+ R
Sbjct: 592 SLKETYPFITFLFLFLCVAGFIGYLPNVKQIVQQWDLQDNDDDQLFYTKIYIDQIQRESI 651
Query: 203 KIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGN 262
I LD + LFQN+ G+L+++ L F+ D + N++Y++ PV+IHGNG +K++LN FGN
Sbjct: 652 SITLDHKSTLFQNINGALDEVILAFE-DGKARVKNSQYDSLPVLIHGNGPTKLQLNYFGN 710
Query: 263 YLAKSWK-TSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAK 321
Y+ W +GC C+L D + +P V + +FI++PT FL EF N++ L+YP +
Sbjct: 711 YIPNVWAPETGCGTCDL-DTTDLSTANAYPKVTVGIFIEQPTPFLPEFFNRLLALDYPKE 769
Query: 322 KISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVD 380
++ F++N++ YH + K + N+K + + EARN+ + K D
Sbjct: 770 NMNFFIHNSEVYHEQHIVKFWEQAKNVIGNLKVVGPEEPIMQAEARNMGMNTCRQDKECD 829
Query: 381 FYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMN 440
+YF +D+D L NP LK L+ +N +IAPL+ R K WSNFWGAL DG+YARS DY++
Sbjct: 830 YYFNIDADVMLTNPQTLKILIEQNRKIIAPLVTRHGKLWSNFWGALTPDGYYARSEDYVD 889
Query: 441 IINGDQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRNKGIHL 499
I+ G + G+WN+PY+ + YL+K +++ + +++L+ +D DMA C N R G+ +
Sbjct: 890 IVQGKR--VGLWNMPYVAHVYLIKGETLRSEMKERNLFSLDRLDPDMALCRNAREMGVFM 947
Query: 500 KIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCP 559
I + E+G L+ + N++ N +++++ NP+DW +YI+ Y K + + QPCP
Sbjct: 948 YITNRHEFGRLLSTANYNTTHYNNDLWQIFENPVDWREKYINANYSK-IFTQNIVEQPCP 1006
Query: 560 DVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWA 619
DVFWFP+++EK C E V+ ME +GQWS + D R+ GYE VPT DIHM Q+GL W
Sbjct: 1007 DVFWFPVLSEKACDELVEEMENFGQWSGSAHTDTRIAGGYENVPTDDIHMNQIGLDNEWL 1066
Query: 620 EFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQ----PSLRPHH-DSSTYTINIA 674
F+R+Y+ P+ + F GY+ + A ++F P P L D +
Sbjct: 1067 HFIREYIAPITLKVFAGYYTKG-HALLNFXXXXXPCPDVFWFPVLSEKACDELVEEMENF 1125
Query: 675 LNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
G + GGGCRF RYNC++ + R GW MHPGRLTH HEGL VT GTRYI +SF+DP
Sbjct: 1126 GQWSGSAHTGGGCRFARYNCSIESPRKGWSFMHPGRLTHLHEGLPVTNGTRYIAVSFIDP 1185
>gi|358254467|dbj|GAA55391.1| lysyl hydroxylase/galactosyltransferase/ glucosyltransferase
[Clonorchis sinensis]
Length = 694
Score = 570 bits (1469), Expect = e-159, Method: Compositional matrix adjust.
Identities = 304/710 (42%), Positives = 439/710 (61%), Gaps = 29/710 (4%)
Query: 33 KFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNELD 92
+ LV+TVA+ D +RF++SA N+ VK S+GGG K+NLL++EL
Sbjct: 6 ELLVLTVATETNDALERFLRSANNNQFNVK--------------SVGGGQKINLLRDELR 51
Query: 93 EMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAVG 152
DD++IL DSYDV+ ++E + + I+FGAE CWPD SL YP VG
Sbjct: 52 SHINLDDLLILFLDSYDVVFMDSKFRLVEEYENSNHTILFGAESFCWPDQSLEKMYPQVG 111
Query: 153 SGY-RYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLAN 211
R+LNSGGFIG A + +++ IK ++DDQLYY ++L+E LR + I LDT ++
Sbjct: 112 PKENRFLNSGGFIGPASSLYRMVTEMPIKEDDDDQLYYTKIYLNEALRRELNIGLDTRSS 171
Query: 212 LFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK-T 270
+FQNL G+L+DI+L+F D +L NTK T P++ HGNG K E NS NYL +W T
Sbjct: 172 IFQNLNGALQDIELHFTNDT-GYLVNTKTGTRPIVAHGNGPIKPEFNSLTNYLDHNWTPT 230
Query: 271 SGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM--FVY 328
GC C+ +++D + +FP++ +S+FI+ PT FL+ F ++IA L+YP I + V
Sbjct: 231 QGCQHCSE-RNIDLDEQGEFPNIQLSIFIENPTPFLDVFFDRIAALSYPKSHIHLTGHVA 289
Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDS 387
E L D + + + +V + + AR+ A + L F F VDS
Sbjct: 290 KKAEKQRALADTFNKTYGHEYLSVSWFDAEEVTDEAIARDYAYAHCLALDTCKFLFSVDS 349
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
L NP +++L+ N S+IAP+L R K WSNFWGAL+ DG+Y RS DY+ I+ ++
Sbjct: 350 TVQLTNPRTIEHLIQMNRSMIAPMLSRRGKLWSNFWGALSRDGYYERSDDYIEIV--ERN 407
Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEY 507
GIWNVP+I + YL+ V++ + S+D +M R + + + +D+ + Y
Sbjct: 408 RTGIWNVPFIRDAYLLSRRVVRKFAEHKL--AGSIDVEMRIPAIARQENVFMTVDNLEPY 465
Query: 508 GHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPD---TVNNQPCPDVFWF 564
G+LV + + N +++++ NPLDW+ +Y+HP+Y K P+ T QPCPDVF+F
Sbjct: 466 GYLVFPDTYTTDHLNNDLWQIFDNPLDWEEQYVHPDYFKISNPEVKMTDIEQPCPDVFYF 525
Query: 565 PIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRK 624
PIV+ KFC + + +E +G WSDGTN D RLE GYE VPTRDIHM+Q+ W FL K
Sbjct: 526 PIVSAKFCRQLIAEVEEFGLWSDGTNIDPRLEGGYENVPTRDIHMRQINWEDHWLHFLVK 585
Query: 625 YVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEG 684
YV P+Q++ F GY +P A M+FVVRYRPDEQ SLRPHHD+S+Y++ IALN+ V+++G
Sbjct: 586 YVHPIQKKVFAGYDDKPW-ARMNFVVRYRPDEQSSLRPHHDASSYSLTIALNEAEVEFQG 644
Query: 685 GGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
GG RF+RYNC++ +++GW M PGR+TH HEGL T GTRYI ++FV+P
Sbjct: 645 GGTRFVRYNCSLVRSKLGWTSMFPGRVTHLHEGLITTSGTRYIFVTFVNP 694
>gi|170590254|ref|XP_001899887.1| Procollagen-lysine,2-oxoglutarate 5-dioxygenase precursor, putative
[Brugia malayi]
gi|158592519|gb|EDP31117.1| Procollagen-lysine,2-oxoglutarate 5-dioxygenase precursor, putative
[Brugia malayi]
Length = 688
Score = 568 bits (1465), Expect = e-159, Method: Compositional matrix adjust.
Identities = 299/741 (40%), Positives = 448/741 (60%), Gaps = 61/741 (8%)
Query: 2 LSNLHLNCLILSCVVFFISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQV 61
++ + L L LS V+ + +V K+ + E LV+TVA+ ETDG +R ++A++N + +
Sbjct: 1 MTGMTLWVLTLSTVLMYGTVTMEKISGMPE--LLVVTVATEETDGLRRLKRTADINDVGL 58
Query: 62 KTLGLHQPWLGGDMS-SLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDIL 120
+ G+ + W GGD+ GGG K+ +L+ L++ +D+IIL D+YDVI+ G IL
Sbjct: 59 EVFGMGEQWRGGDVRVDKGGGQKIRILRKSLEKYKDRNDLIILFVDAYDVILLGNEEQIL 118
Query: 121 ERFNTF--DANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRS 178
F TF +VF +E CWP+ SL KYP V GYRYLNSG F+G+A +I LIS +
Sbjct: 119 RNFFTFFDGFRLVFSSEPFCWPNRSLAPKYPLVNFGYRYLNSGVFMGFAPEIWNLISYKD 178
Query: 179 IKNEEDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNT 238
+++ +DDQLYY L+LDE +R K+ LD+++ LFQNL G+ D+KL DE
Sbjct: 179 VEDNDDDQLYYTRLYLDEQIRMSLKMTLDSMSILFQNLNGASNDVKLEMS-DE------- 230
Query: 239 KYNTNPVIIHGNGKSKIELNSFGNYLAKSWKTSGCTRCNLIKHLDSLKPDQFPSVLISVF 298
G Y L+ + P + +SV
Sbjct: 231 --------------------RSGTYF-------------------DLEKIELPRLFLSVI 251
Query: 299 IDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHN 358
I KP F+ EF I +L Y +KI ++VY NQ + + ++ + K ++++ Y
Sbjct: 252 ISKPIPFIREFFENIKSLVYADEKIDLYVYCNQNFLEKETNGFVEDVKGRYRSLLYDGST 311
Query: 359 STVNSKEARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNR----NESLIAPLLVR 414
+ + +EAR +++ SL G D+ +D D HL+N + L +++R + ++APL+ +
Sbjct: 312 TELGEREARAFSLKQSLALGDDYLIMIDGDVHLNNSEALLLMIHRVKEKDSEILAPLVGQ 371
Query: 415 PFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIK 474
P K ++NFWGA++++G+YARS +Y++II D GIWNVP+I++ ++ K T++
Sbjct: 372 PHKLFTNFWGAISSNGYYARSENYLDII--DYKEVGIWNVPFISSILIIAKE--KLTSLS 427
Query: 475 TIYTLN-SMDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPL 533
Y N +D DM+FC+ R+KG L +D++ YG LV SE+ + K +P++YE+ N
Sbjct: 428 NAYYYNDKLDPDMSFCSFARDKGHFLYLDNSHYYGFLVVSEDVESSKVHPDMYEIFNNKE 487
Query: 534 DWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDK 593
W+ RYIHP Y +L + C DV+ FP+++E+FC E ++ E YG+WSDG + D+
Sbjct: 488 LWEKRYIHPNYFAALNGSIQILEICQDVYDFPLMSERFCAELIEECEYYGKWSDGKHKDE 547
Query: 594 RLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYR 653
RL GYE VPTRDIHM Q+G W L +YV P+QE+ FIGY+ +PV + M FVVRY+
Sbjct: 548 RLVGGYENVPTRDIHMNQIGFERHWLYMLDEYVRPIQEKLFIGYYKQPVESVMMFVVRYK 607
Query: 654 PDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTH 713
P+EQ SLRPHHD+STY+I+IALN+ GVDYEGGG RF+RYNC A +G ++ PGRLTH
Sbjct: 608 PEEQASLRPHHDASTYSIDIALNKRGVDYEGGGVRFLRYNCTFDADTVGHSMIFPGRLTH 667
Query: 714 YHEGLQVTQGTRYIMISFVDP 734
HEGL+ TQGTRYI +SF++P
Sbjct: 668 LHEGLETTQGTRYIAVSFINP 688
>gi|312373903|gb|EFR21571.1| hypothetical protein AND_16831 [Anopheles darlingi]
Length = 902
Score = 567 bits (1462), Expect = e-159, Method: Compositional matrix adjust.
Identities = 282/604 (46%), Positives = 403/604 (66%), Gaps = 16/604 (2%)
Query: 35 LVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNELDEM 94
LV TVASNET+GY R+I+SA + VKTLGL +PWLGGDM S+GGGYK+NLL+ L
Sbjct: 274 LVFTVASNETEGYLRYIRSANHYGISVKTLGLGKPWLGGDMKSVGGGYKINLLREALKPY 333
Query: 95 DITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAV-GS 153
+ ++L TDSYDV+ I+++F TF+A++VFGAE CWPD SL YP + G
Sbjct: 334 RKESERLVLFTDSYDVVFLMPWEKIVQKFLTFNASVVFGAEGFCWPDESLKSLYPPLEGR 393
Query: 154 GYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLANLF 213
G R+LNSG F+GYA + ++ S K+ +DDQLYY ++LD+ LR + I LD +A+LF
Sbjct: 394 GMRFLNSGLFMGYADKLYLMLKTPS-KDTDDDQLYYTNVYLDKQLRNELNIKLDHMASLF 452
Query: 214 QNLYGSLEDIKLNFDLDEF-VHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWKTSG 272
QNL G E + L+ + E L NT+Y + P ++HGNG SK+ LNS+ NYLA ++
Sbjct: 453 QNLNGVEEQVILSLEPSEAEATLKNTEYTSKPAVVHGNGPSKLALNSYANYLAGAFLDGV 512
Query: 273 CTRCNLIK-HLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQ 331
C + LD K + P V +++FI+KPT FLEE+ +KI LNYP ++ + V++
Sbjct: 513 CKTVEENRIQLDDEK--ELPLVTMALFIEKPTPFLEEWFDKITKLNYPGDRLDVLVHSGV 570
Query: 332 EYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSHL 391
YH P+ ++ + ++++K I+H+ AR A ++ +G D+ F VDS+ HL
Sbjct: 571 AYHEPVVKAFLSQQEGRYRSLKSISHSDDHKEAVARAFATKHCRQRGCDYLFVVDSEGHL 630
Query: 392 DNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGK-- 449
DNP+VL+ L+ N ++I+P+L RP K WSNFWGAL+ GFYARS DYM+I+ G K
Sbjct: 631 DNPNVLRALIEANRNVISPVLTRPEKVWSNFWGALSNQGFYARSNDYMDIV----GRKLL 686
Query: 450 GIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGH 509
G+WNVP+I+ YL+K S++ + Y+L D DMA C + R+KGI + + + ++YGH
Sbjct: 687 GLWNVPFISIVYLIKRSILPDVS----YSLKETDPDMAMCWHFRSKGIFMHVINMEQYGH 742
Query: 510 LVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTE 569
L+D+E FD +T+P+ Y+L N DW+ RY+ PEYQ+ L V QPCPDV+WF + T+
Sbjct: 743 LIDTEYFDMDRTHPDFYQLFNNRHDWEQRYLSPEYQQQLETTFVPKQPCPDVYWFAVGTD 802
Query: 570 KFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPL 629
+FC + +I+EA+G+WSDGT+ND RL+ GYEAVPTRDIHM QVGL VW +FL+ Y+ PL
Sbjct: 803 RFCDDLKEIVEAFGKWSDGTHNDNRLQGGYEAVPTRDIHMNQVGLEQVWLKFLQLYIRPL 862
Query: 630 QERE 633
QE++
Sbjct: 863 QEKD 866
Score = 74.3 bits (181), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 38/68 (55%), Positives = 47/68 (69%), Gaps = 5/68 (7%)
Query: 670 TINIALNQVGVDYEGGGCRFIRY---NCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRY 726
T +I +NQVG+ E +F++ TR GW+LMHPGRLTH+HEGL T+GTRY
Sbjct: 837 TRDIHMNQVGL--EQVWLKFLQLYIRPLQEKDTRKGWLLMHPGRLTHFHEGLLTTKGTRY 894
Query: 727 IMISFVDP 734
IMISFVDP
Sbjct: 895 IMISFVDP 902
>gi|146231842|gb|ABQ12996.1| procollagen-lysine, 2-oxoglutarate 5-dioxygenase 3 precursor [Bos
taurus]
Length = 677
Score = 566 bits (1460), Expect = e-158, Method: Compositional matrix adjust.
Identities = 272/637 (42%), Positives = 413/637 (64%), Gaps = 11/637 (1%)
Query: 29 IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLL 87
++ +K LVITVA+ ET+GY+RF+QSAE V+TLGL + W GGD++ ++GGG KV L
Sbjct: 47 VNPEKMLVITVATAETEGYRRFLQSAEFFNYTVRTLGLGEEWRGGDVARTVGGGQKVRWL 106
Query: 88 KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
K E+++ +DM+I+ DSYDV++ G +++L++F + ++F AE CWP+ L ++
Sbjct: 107 KKEMEKYAEREDMVIMFVDSYDVVLAGSPSELLKKFVQSGSRLLFSAESFCWPEWGLAEQ 166
Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
YP VG+G R+LNSGGFIG+A I +++ K+++DDQL+Y L+LD LR K + LD
Sbjct: 167 YPEVGTGKRFLNSGGFIGFAPTIHQIVRQWKYKDDDDDQLFYTRLYLDPGLREKLGLSLD 226
Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
+ +FQNL G+L+++ L F + V + N Y+T PV++HGNG +K++LN GNY+
Sbjct: 227 HKSRIFQNLNGALDEVVLKFGRNR-VRIRNVAYDTLPVVVHGNGPTKLQLNYLGNYVPNG 285
Query: 268 W-KTSGCTRCNLIKH-LDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
W GC CN + L +P P VL++VF+++PT FL FL ++ L+YP ++++
Sbjct: 286 WTPEGGCGFCNQGRRPLPGGQPP--PRVLLAVFVEQPTPFLPRFLQRLLLLDYPPDRVTL 343
Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFY 384
F++NN+ YH P D+ + F VK + + EAR++A++ +FYF
Sbjct: 344 FLHNNEVYHEPHIDESWPQLQDHFSAVKLVGPEEALTPGEARDMAMDICRQDPKCEFYFS 403
Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
+D+D+ + NP L+ L+ N +IAP+L R K WSNFWGAL+ D +YARS DY+ ++
Sbjct: 404 LDADTVITNPQTLRILIEANRKVIAPMLSRHGKLWSNFWGALSPDEYYARSEDYVELVQR 463
Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDS 503
+ G+WNVPYI+ Y+++ ++ + +++ + D DMAFC +LR+KGI L + +
Sbjct: 464 KR--VGVWNVPYISQAYVIRGETLRTELPQREVFSGSDTDPDMAFCKSLRDKGIFLHLSN 521
Query: 504 TQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFW 563
E+G L+ + +D +P+++++ NPLDW +YIH Y ++L + + QPCPDV+W
Sbjct: 522 QHEFGRLLATSRYDTDHLHPDLWQIFDNPLDWKEQYIHENYTRALEGEGLVEQPCPDVYW 581
Query: 564 FPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLR 623
FP+++E+ C E V+ ME YGQWS G + D RL GYE VPT DIHMKQVG W + LR
Sbjct: 582 FPLLSEQMCDELVEEMEHYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYEDQWLQLLR 641
Query: 624 KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSL 660
YV P+ E F GYH + RA M+FVVRYRPDEQPSL
Sbjct: 642 TYVGPMTESLFPGYHTK-TRAVMNFVVRYRPDEQPSL 677
>gi|441649934|ref|XP_003276658.2| PREDICTED: LOW QUALITY PROTEIN: procollagen-lysine,2-oxoglutarate
5-dioxygenase 3 [Nomascus leucogenys]
Length = 682
Score = 564 bits (1454), Expect = e-158, Method: Compositional matrix adjust.
Identities = 296/712 (41%), Positives = 420/712 (58%), Gaps = 69/712 (9%)
Query: 29 IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLL 87
++ +K LVITVA+ ET+GY RF++SAE V+TLGL + W GGD++ ++GGG KV L
Sbjct: 34 VNPEKLLVITVATAETEGYLRFLRSAEFFNYTVRTLGLGEEWRGGDVARTVGGGQKVRWL 93
Query: 88 KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
K E+++ +DMII+ DSYDVI+ G ++L++F + ++F AE CWP+ L ++
Sbjct: 94 KKEMEKYADREDMIIMFVDSYDVILAGSPTELLKKFVQSGSRLLFSAESFCWPEWGLAEQ 153
Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
YP VG+G R+LNSGGFIG+A I +++ K+++DDQL+Y L+LD LR K + LD
Sbjct: 154 YPEVGTGKRFLNSGGFIGFATTIHQIVRQWKYKDDDDDQLFYTRLYLDPGLREKLSLNLD 213
Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
+ +FQNL G+L+++ L FD + V + N Y+T PV++HGNG +K++LN GNY+
Sbjct: 214 HKSRIFQNLNGALDEVVLKFDRNR-VRIRNVAYDTLPVVVHGNGPTKLQLNYLGNYVPNG 272
Query: 268 WKT-SGCTRCNLIKH-LDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
W GC CN + L +P P V ++VF+++PT FL FL ++
Sbjct: 273 WTPEGGCGFCNQDRRTLPGGQPP--PRVFLAVFVEQPTPFLPRFLQRL------------ 318
Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYV 385
L DY + T+F HN+ V
Sbjct: 319 -----------LLLDYPPDRVTLF------LHNNEV------------------------ 337
Query: 386 DSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNA--DGFYARSFDYMNIIN 443
P + + + A LV P +A S A D +Y RS DY+ ++
Sbjct: 338 -----FHEPHIADSWLQLQDHFSAVKLVGPEEALSPGEARDMAIPDEYYXRSEDYVELVQ 392
Query: 444 GDQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRNKGIHLKID 502
+ G+WNVPYI+ Y+++ ++ K +++ + D DMAFC + R+K I L +
Sbjct: 393 RKR--VGVWNVPYISQAYVIRGDTLRTEVPQKDVFSGSDTDPDMAFCKSFRDKCIFLHLS 450
Query: 503 STQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVF 562
+ E+G + + +D + +P DW +YIH Y ++L + + QPCPDV+
Sbjct: 451 NQYEFGWFLATSRYDTEHLHPXXXXXXXXXXDWKEQYIHENYSRALEGEGIVEQPCPDVY 510
Query: 563 WFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFL 622
WFP+++E+ C E V ME YGQWS G + D RL GYE VPT DIHMKQVG W + L
Sbjct: 511 WFPLLSEQMCDELVAEMEHYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYEDQWLQLL 570
Query: 623 RKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDY 682
R YV P+ E F GYH + RA M+FVVRYRPDEQPSLRPHHDSST+T+N+ALN G+DY
Sbjct: 571 RTYVGPMTESLFPGYHTKGARAVMNFVVRYRPDEQPSLRPHHDSSTFTLNVALNHKGLDY 630
Query: 683 EGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
EGGGCRF+RY+C +++ R GW L+HPGRLTHYHEGL T GTRYIM+SFVDP
Sbjct: 631 EGGGCRFLRYDCMISSPRKGWALLHPGRLTHYHEGLPTTWGTRYIMVSFVDP 682
>gi|348581630|ref|XP_003476580.1| PREDICTED: LOW QUALITY PROTEIN: procollagen-lysine,2-oxoglutarate
5-dioxygenase 2-like [Cavia porcellus]
Length = 714
Score = 560 bits (1443), Expect = e-156, Method: Compositional matrix adjust.
Identities = 299/740 (40%), Positives = 434/740 (58%), Gaps = 74/740 (10%)
Query: 19 ISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSS 77
+ + K +I DK LVITVA+ E DGY RF+QSA+ VK LG + W GGD +S
Sbjct: 25 VGENAEKPASIPTDKLLVITVATKENDGYHRFMQSAKYFNYTVKVLGQGEEWKGGDGFNS 84
Query: 78 LGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERL 137
+GGG KV L+K ++ +DM+IL T+ +DVI GG ++L++F + +VF A+ +
Sbjct: 85 IGGGQKVRLMKEVMEHYANQEDMVILFTECFDVIFAGGPEEVLKKFQKTNHKVVFAADGI 144
Query: 138 CWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDET 197
WPD L D+YP V G RYLNSGGFIGYA I ++ ++++ +DDQL+Y +++D
Sbjct: 145 LWPDKRLADRYPVVHIGKRYLNSGGFIGYAPYINRIVQRWNLQDNDDDQLFYTKIYIDPL 204
Query: 198 LRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIEL 257
R I LD +FQ L G+ +++ L F+ + NT Y T PV I+GNG +KI L
Sbjct: 205 QREAFNITLDHKCKIFQALNGATDEVVLKFENGK-TRAKNTFYETLPVAINGNGPTKILL 263
Query: 258 NSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANL 316
N FGNY+ SW + +GCT C +F ++ +S +D+
Sbjct: 264 NYFGNYVPNSWTQDNGCTLC------------EFDTIDLSA-VDE--------------- 295
Query: 317 NYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH 376
VY+ ++ A FD H T +K + ++ +ARN+ ++
Sbjct: 296 ----------VYHEKDIKA-FFDKAKHEIST----IKIVGPEEDLSQAKARNMGMDFCRQ 340
Query: 377 -KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARS 435
+ D+YF +D+D L NP LK L+ +N +IAPL+ R K WSNFWGAL+ DG+YARS
Sbjct: 341 DEKCDYYFSLDADVVLTNPRTLKLLIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARS 400
Query: 436 FDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRN- 494
DY++I+ G + G NVPY+ N YL + ++ + + Y + A C N R
Sbjct: 401 EDYVDIVQGKRVAYG--NVPYMANVYLXRETL--RSEMMKDYFVRDRWIXYALCRNAREC 456
Query: 495 --------------------KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLD 534
KG+ + I + E+G L+ + N++ N +++++ NP+D
Sbjct: 457 SLQREKDSPTPETFQMLSPPKGVFMYISNRHEFGRLLSTANYNTSHYNNDLWQIFENPVD 516
Query: 535 WDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKR 594
W +YI+ +Y K + + + QPCPDVFWFPI +EK C E V+ ME YGQWS G ++D R
Sbjct: 517 WKEQYINRDYSK-IFTENLVEQPCPDVFWFPIFSEKACDELVEEMENYGQWSGGKHHDSR 575
Query: 595 LETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRP 654
+ GYE VPT DIHMKQ+GL VW F+R+++ P+ + F GY+ + A ++FVV+Y P
Sbjct: 576 ISGGYENVPTDDIHMKQIGLENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYSP 634
Query: 655 DEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHY 714
+ Q SLRPHHD+ST+TINIALN VG D++GGGC+F+RYNC++ + R GW MHPGRLTH
Sbjct: 635 ERQRSLRPHHDASTFTINIALNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHL 694
Query: 715 HEGLQVTQGTRYIMISFVDP 734
HEGL V GTRYI +SF+DP
Sbjct: 695 HEGLPVKNGTRYIAVSFIDP 714
>gi|324502700|gb|ADY41187.1| Procollagen-lysine,2-oxoglutarate 5-dioxygenase [Ascaris suum]
Length = 731
Score = 558 bits (1439), Expect = e-156, Method: Compositional matrix adjust.
Identities = 283/737 (38%), Positives = 453/737 (61%), Gaps = 22/737 (2%)
Query: 9 CLILSCVVFFISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQ 68
+ LS +F + + N ++ V+TV D +R +SA +++Q+ L Q
Sbjct: 6 VVALSLSLFILRLDVNAATSLH-----VVTVVIEHQDALERLQRSANAHEIQLNILRHDQ 60
Query: 69 PWLGGDMSSLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERF--NTF 126
S LGGG K+ +L++ L+ D+I+L D+ II+G +IL+RF +
Sbjct: 61 L---ASSSHLGGGEKLRILRDGLEIYKDRSDLILLYVDANKAIINGRGEEILKRFMDSYS 117
Query: 127 DANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQ 186
++ IVF ++ C+PD L +YP V G R+LNS FIGYA I EL++++S++N D+Q
Sbjct: 118 NSQIVFSSDNYCFPDEELTQRYPIVEKGKRFLNSAAFIGYANKIWELLNSQSLENINDEQ 177
Query: 187 LYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVI 246
++Y FLDE LR + ++VLD+ + +F ++ S ++I L+F + ++TN + T+P+I
Sbjct: 178 IFYTHRFLDERLRNRLQMVLDSTSQIFHSVDVSKDEITLDFSDNGDAYITNVIHKTHPLI 237
Query: 247 IHGNGKSKIELNSFGNYLAKSWKTS-GCTRCNL--IKHLDSLKPDQFPSVLISVFIDKPT 303
IHG+ +K+ LN GNY+ K+W GC C+ + L ++P + +++ + KP
Sbjct: 238 IHGDESNKLMLNYLGNYIGKAWSADFGCRDCSAQRVNFLKDNAEQEWPKLTLAIMLAKPI 297
Query: 304 AFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNS 363
F+EEFL K+ L YPA KI +++Y+NQ+Y+ ++++ + + V++ + +
Sbjct: 298 PFVEEFLTKVEKLEYPASKIDLYLYSNQKYNEREVNEFLRRVRGKYSWVEWDSGEVEIGE 357
Query: 364 KEARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNR----NESLIAPLLVRPFKAW 419
+EAR A++ ++ DF F +D++ H + +V+++++ N ++AP++ +P K +
Sbjct: 358 REARRTAIDAAIKANNDFVFLLDANVHFVDLNVIRWIIESALTMNLGILAPMVGKPNKFF 417
Query: 420 SNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTL 479
+NFWGA++ G+Y RS DY I+N + G+WNVP+I++ L+ ++ Y
Sbjct: 418 TNFWGAISPSGYYQRSEDYTEIVNYKR--VGVWNVPFISSAILINKQKMREIRDGFFYNT 475
Query: 480 NSMDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFD--PQKTNPEVYELIRNPLDWDL 537
+ +D D++FC R+ L +D+ + YG L DSE FD + +PE+Y++ N W+
Sbjct: 476 D-VDADLSFCQFARDNDHFLYVDNQRYYGFLADSETFDNGGKHLHPEMYQIFENRHLWES 534
Query: 538 RYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLET 597
RY+HP+Y +L QPCPDV+ +P+++E F E ++ ME +GQWSDG N D+RL
Sbjct: 535 RYVHPDYFGALDGSGEIAQPCPDVYHYPLMSEIFARELIEEMENFGQWSDGKNEDERLAG 594
Query: 598 GYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQ 657
GYE VPT DIHM Q+ W FL +YV P+QE+ FIGY+ +PV A M FVVRY+P EQ
Sbjct: 595 GYENVPTIDIHMNQIDFQREWLYFLDEYVRPMQEKLFIGYYQKPVEALMMFVVRYQPGEQ 654
Query: 658 PSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEG 717
PSLR HHD+STYTI++ LN+ G DYEGGG R++RYNC V A ++G+ M PGRLTH HEG
Sbjct: 655 PSLRAHHDASTYTIDVPLNKRGRDYEGGGVRYVRYNCTVAADQVGYAAMFPGRLTHLHEG 714
Query: 718 LQVTQGTRYIMISFVDP 734
L VT+G RYI +SF++P
Sbjct: 715 LPVTKGIRYIAVSFLNP 731
>gi|350591622|ref|XP_003132514.3| PREDICTED: LOW QUALITY PROTEIN: procollagen-lysine,2-oxoglutarate
5-dioxygenase 2, partial [Sus scrofa]
Length = 645
Score = 556 bits (1433), Expect = e-155, Method: Compositional matrix adjust.
Identities = 274/651 (42%), Positives = 406/651 (62%), Gaps = 30/651 (4%)
Query: 108 YDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYA 167
+DVI GG ++L++F + +VF A+ + WPD L DKYP V G RYLNSGGFIGYA
Sbjct: 1 FDVIFAGGPEEVLKKFQKSNHKVVFSADGILWPDKRLADKYPIVHIGKRYLNSGGFIGYA 60
Query: 168 KDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNF 227
I ++ ++++ +DDQL+Y +++D R I LD +FQ L G+++++ L F
Sbjct: 61 PYINRIVQQWNLQDNDDDQLFYTKIYIDPLKREALNITLDHKCKIFQTLNGAVDEVVLKF 120
Query: 228 DLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSW-KTSGCTRCNLIKHLDSLK 286
+ + N Y T PV I+GNG +KI LN FGNY+ +W + +GCT C+ + +D
Sbjct: 121 ENGK-ARAKNVFYETLPVAINGNGPTKILLNYFGNYVPNAWTQDNGCTLCD-VDTIDLSA 178
Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFK 346
D P+V I VFI++PT FL FL+ + L+YP + + +F++N + YH + K
Sbjct: 179 VDVHPNVTIGVFIEQPTPFLPRFLDTLLTLDYPKEALKLFIHNKEVYHEKNIKVFFDKAK 238
Query: 347 TMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDSDSHLDNPDVLKYLVNRNE 405
+K + ++ EARN+ ++ + ++YF VD+D L NP LK L+ +N
Sbjct: 239 HEITTIKIVGPEENLSQAEARNMGMDFCRQDENCNYYFSVDADVVLTNPRTLKILIEQNR 298
Query: 406 SLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKT 465
+IAPL+ R K WSNFWGAL+ DG+YARS DY++I+ G++ GIWNVPY+ N YL+K
Sbjct: 299 KIIAPLVTRHGKLWSNFWGALSPDGYYARSEDYVDIVQGNR--VGIWNVPYMANVYLIKG 356
Query: 466 SVIKA-TNIKTIYTLNSMDYDMAFCTNLRN---------------------KGIHLKIDS 503
+++ N + + + +D DMA C N R KG+ + + +
Sbjct: 357 KTLRSEMNERNYFVRDKLDPDMALCRNAREMTLQREKDSPTPETFQMLSPPKGVFMYVSN 416
Query: 504 TQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFW 563
E+G L+ + N++ + +++++ NP+DW +YI+ +Y K + + + QPCPDVFW
Sbjct: 417 RHEFGRLLSTANYNISHYHNDLWQIFENPVDWKEKYINRDYSK-IFTENIVEQPCPDVFW 475
Query: 564 FPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLR 623
FPI +EK C E V+ ME YGQWS G ++D R+ GYE VPT DIHMKQ+ L VW F+R
Sbjct: 476 FPIFSEKACDELVEEMEHYGQWSGGKHHDSRISGGYENVPTDDIHMKQIDLENVWLHFIR 535
Query: 624 KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYE 683
+++ P+ + F GY+ + A ++FVV+Y P+ Q SLRPHHD+ST+TINIALN VG D++
Sbjct: 536 EFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPHHDASTFTINIALNNVGEDFQ 594
Query: 684 GGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
GGGC+F+RYNC++ + R GW MHPGRLTH HEGL V GTRYI +SF+DP
Sbjct: 595 GGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNGTRYIAVSFIDP 645
>gi|395734248|ref|XP_002814197.2| PREDICTED: LOW QUALITY PROTEIN: procollagen-lysine,2-oxoglutarate
5-dioxygenase 2 [Pongo abelii]
Length = 909
Score = 552 bits (1423), Expect = e-154, Method: Compositional matrix adjust.
Identities = 279/674 (41%), Positives = 410/674 (60%), Gaps = 15/674 (2%)
Query: 66 LHQPWLGGDMSSLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNT 125
+HQP+ G ++ K + + + + + S VI GG ++F
Sbjct: 246 IHQPFRGAGQVTV----KSDFTGEQFYDEECHESFHKWECLSLIVIFAGGPEKFXKKFLK 301
Query: 126 FDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDD 185
+ +VF A+ + WPD L DKYP V G RYLNSGGFIGYA + ++ ++++ +DD
Sbjct: 302 ANHKVVFAADGILWPDKRLADKYPVVHIGKRYLNSGGFIGYAPYVNRIVQQWNLQDNDDD 361
Query: 186 QLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPV 245
QL+Y +++D R I LD +FQ L G+++++ L F+ + NT Y T PV
Sbjct: 362 QLFYTKIYIDPLKREAINITLDHKCKIFQTLNGAVDEVVLKFENGK-ARAKNTFYETLPV 420
Query: 246 IIHGNGKSKIELNSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTA 304
I+GNG +KI LN FGNY+ SW + +GCT C +D D P+V I VFI++PT
Sbjct: 421 AINGNGPTKILLNYFGNYVPNSWTQDNGCTLCEF-DTVDLSAVDVHPNVSIGVFIEQPTP 479
Query: 305 FLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSK 364
FL FL+ + L+YP + + +F++N + YH + K K +K ++
Sbjct: 480 FLPRFLDILLTLDYPKEALKLFIHNKEVYHEKDIQVFFDKAKHEIKTIKLGXPQKNLSQA 539
Query: 365 EARNLAVENSLHK---GVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSN 421
EA+ + D+YF VD+D L NP LK L+ +N +IAPL+ R K WSN
Sbjct: 540 EAQKHGNGWDFCRQDEKCDYYFSVDADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSN 599
Query: 422 FWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLN 480
FWGAL+ DG+YARS DY++I+ G++ G+WNVPY+ N YL+K +++ N + + +
Sbjct: 600 FWGALSPDGYYARSEDYVDIVQGNR--VGVWNVPYMANVYLIKGKTLRSEMNERNYFVRD 657
Query: 481 SMDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYI 540
+D DMA C N R G+ + I + E+G L+ + N++ N +++++ NP+DW +YI
Sbjct: 658 KLDPDMALCRNAREMGVFMYISNRHEFGRLLSTANYNTSHYNNDLWQIFENPVDWKEKYI 717
Query: 541 HPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYE 600
+ +Y K + + + QPCPDVFWFPI +EK C E V+ ME YG+WS G ++D R+ GYE
Sbjct: 718 NRDYSK-IFTENIVEQPCPDVFWFPIFSEKACDELVEEMEHYGKWSGGKHHDSRISGGYE 776
Query: 601 AVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSL 660
VPT DIHMKQV L VW F+R+++ P+ + F GY+ + A ++FVV+Y P+ Q SL
Sbjct: 777 NVPTDDIHMKQVDLENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSL 835
Query: 661 RPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQV 720
RPHHD+ST+TINIALN VG D++GGGC+F+RYNC++ + R GW MHPGRLTH HEGL V
Sbjct: 836 RPHHDASTFTINIALNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPV 895
Query: 721 TQGTRYIMISFVDP 734
GTRYI +SF+DP
Sbjct: 896 KNGTRYIAVSFIDP 909
>gi|77745212|gb|ABB02507.1| procollagen-lysine 2-oxoglutarate 5-dioxygenase 2 [Sus scrofa]
Length = 640
Score = 551 bits (1420), Expect = e-154, Method: Compositional matrix adjust.
Identities = 271/645 (42%), Positives = 402/645 (62%), Gaps = 30/645 (4%)
Query: 114 GGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKEL 173
GG ++L++F + +VF A+ + WPD L DKYP V G RYLNSGGFIGYA I +
Sbjct: 2 GGPEEVLKKFQKSNHKVVFSADGILWPDKRLADKYPIVHIGKRYLNSGGFIGYAPYINRI 61
Query: 174 ISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFV 233
+ ++++ +DDQL+Y +++D R I LD +FQ L G+++++ L F+ +
Sbjct: 62 VQQWNLQDNDDDQLFYTKIYIDPLKREALNITLDHKCKIFQTLNGAVDEVVLKFENGK-A 120
Query: 234 HLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPDQFPS 292
N Y T PV I+GNG +KI LN FGNY+ +W + +GCT C+ + +D D P+
Sbjct: 121 RAKNVFYETLPVAINGNGPTKILLNYFGNYVPNAWTQDNGCTLCD-VDTIDLSAVDVHPN 179
Query: 293 VLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNV 352
V I VFI++PT FL FL+ + L+YP + + +F++N + YH + K +
Sbjct: 180 VTIGVFIEQPTPFLPRFLDTLLTLDYPKEALKLFIHNKEVYHEKNIKVFFDKAKHEITTI 239
Query: 353 KYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPL 411
K + ++ EARN+ ++ + ++YF VD+D L NP LK L+ +N +IAPL
Sbjct: 240 KIVGPEENLSQAEARNMGMDFCRQDENCNYYFSVDADVVLTNPRTLKILIEQNRKIIAPL 299
Query: 412 LVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKA- 470
+ R K WSNFWGAL+ DG+YARS DY++I+ G++ GIWNVPY+ N YL+K +++
Sbjct: 300 VTRHGKLWSNFWGALSPDGYYARSEDYVDIVQGNR--VGIWNVPYMANVYLIKGKTLRSE 357
Query: 471 TNIKTIYTLNSMDYDMAFCTNLRN---------------------KGIHLKIDSTQEYGH 509
N + + + +D DMA C N R KG+ + + + E+G
Sbjct: 358 MNERNYFVRDKLDPDMALCRNAREMTLQREKDSPTPETFQMLSPPKGVFMYVSNRHEFGR 417
Query: 510 LVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTE 569
L+ + N++ + +++++ NP+DW +YI+ +Y K + + + QPCPDVFWFPI +E
Sbjct: 418 LLSTANYNISHYHNDLWQIFENPVDWKEKYINRDYSK-IFTENIVEQPCPDVFWFPIFSE 476
Query: 570 KFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPL 629
K C E V+ ME YGQWS G ++D R+ GYE VPT DIHMKQ+ L VW F+R+++ P+
Sbjct: 477 KACDELVEEMEHYGQWSGGKHHDSRISGGYENVPTDDIHMKQIDLENVWLHFIREFIAPV 536
Query: 630 QEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRF 689
+ F GY+ + A ++FVV+Y P+ Q SLRPHHD+ST+TINIALN VG D++GGGC+F
Sbjct: 537 TLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPHHDASTFTINIALNNVGEDFQGGGCKF 595
Query: 690 IRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
+RYNC++ + R GW MHPGRLTH HEGL V GTRYI +SF+DP
Sbjct: 596 LRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNGTRYIAVSFIDP 640
>gi|296479150|tpg|DAA21265.1| TPA: procollagen-lysine,2-oxoglutarate 5-dioxygenase 1 precursor
[Bos taurus]
Length = 667
Score = 545 bits (1404), Expect = e-152, Method: Compositional matrix adjust.
Identities = 265/649 (40%), Positives = 414/649 (63%), Gaps = 10/649 (1%)
Query: 31 EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNE 90
ED LV+TVA+ ET+G++RF +SA+ +++ LGL + W G M + GGG KV LLK
Sbjct: 25 EDNLLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWPGEAMLA-GGGLKVRLLKKA 83
Query: 91 LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
L++ ++++IL TDSYDV+ G ++L++F + +VF AE L +PD L YP
Sbjct: 84 LEKHADKENLVILFTDSYDVVFASGPRELLKKFRQARSQVVFSAEELIYPDRRLEANYPV 143
Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
V G R+L SGGFIGYA ++ +L++ ++ + DQL+Y +FLD R + I LD
Sbjct: 144 VSDGKRFLGSGGFIGYAPNLIKLVAEWEGQDSDSDQLFYTKIFLDPEKREQINITLDHRC 203
Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK- 269
+FQN +G+L+++ L F++ + V N Y+T PV+IHGNG +K++LN GNY+ + W
Sbjct: 204 RIFQNFHGALDEVVLKFEMGQ-VRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTF 262
Query: 270 TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
+GC C+ ++ L + + P+VL+ VFI++PT FL F ++ L+YP K+ +F++
Sbjct: 263 ETGCAVCDEGLRSLKGIGDEALPAVLVGVFIEQPTPFLSLFFQRLLLLHYPQKRFRLFIH 322
Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDS 387
N++++H + ++ +++VK + V + +ARN+ + +G +YF VD+
Sbjct: 323 NHEQHHKAQVEQFLAEHGDEYQSVKLVGPEVRVANADARNMGADLCRQDRGCTYYFSVDA 382
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D L P L+ L+ +N+++I PL+ R + WSNFWGAL+ADG+YARS DY++I+ G +
Sbjct: 383 DVALTEPKTLRLLIEQNKNVITPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR- 441
Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKT-IYTLNSMDYDMAFCTNLRNKGIHLKIDSTQE 506
G+WNVPYI+N YL+K S ++A +T ++ + +D DMAFC N+R + + + + +
Sbjct: 442 -VGVWNVPYISNIYLIKGSALRAELQETDLFHHSKLDPDMAFCANIRQQDVFMFLTNRHS 500
Query: 507 YGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPI 566
+GHL+ +++ + +++E+ NP DW +YIH Y K+L V PCPDV+WFPI
Sbjct: 501 FGHLLSLDSYQTTHLHNDLWEVFSNPEDWKEKYIHENYTKALAGKMV-EMPCPDVYWFPI 559
Query: 567 VTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYV 626
TE C E V+ ME YGQWS G N D R++ GYE VPT DIHM Q+ W +FL +Y+
Sbjct: 560 FTETACDELVEEMEHYGQWSLGDNKDNRIQGGYENVPTIDIHMNQINYEREWHKFLVEYI 619
Query: 627 VPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIAL 675
P+ E+ + GY+ + ++FVVRY+PDEQPSL PHHD+ST+TINI L
Sbjct: 620 APMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLVPHHDASTFTINIGL 667
>gi|358254466|dbj|GAA55390.1| lysyl hydroxylase/galactosyltransferase/ glucosyltransferase
[Clonorchis sinensis]
Length = 623
Score = 542 bits (1396), Expect = e-151, Method: Compositional matrix adjust.
Identities = 281/628 (44%), Positives = 401/628 (63%), Gaps = 23/628 (3%)
Query: 119 ILERFNTFDANIVFGAERLCWPDTSLYDKYPAVG-SGYRYLNSGGFIGYAKDIKELISNR 177
+LE + + ++FGAE CWPD +L D YP VG R+LNSGGFIG A + +++
Sbjct: 7 LLEEYEKSNYTVLFGAEGFCWPDKNLADMYPQVGPREKRFLNSGGFIGPASHLYRIVTET 66
Query: 178 SIKNEEDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTN 237
I ++ DDQLYY ++L+ LR + I LDT + +FQNL+G+ +++L+F D +L N
Sbjct: 67 EIADDRDDQLYYTNIYLNRALREQLNIGLDTKSLIFQNLHGAFTEVELHFTNDT-GYLVN 125
Query: 238 TKYNTNPVIIHGNGKSKIELNSFGNYLAKSWKTS-GCTRCNL-IKHLDSLKPDQFPSVLI 295
TK NT P++ HGNG K E NS NYL SW S GC CN I LD K +FP++ +
Sbjct: 126 TKTNTRPIVAHGNGPIKPEFNSLTNYLDHSWTPSMGCQHCNEGIIDLD--KQGEFPTIQL 183
Query: 296 SVFIDKPTAFLEEFLNKIANLNYPAKKISM--FVYNNQEYHAPLFDDYIHNFKTMFKNVK 353
S+FI+ PT FL+ F ++IA L+YP I + + E PL +++I + ++K
Sbjct: 184 SIFIEYPTPFLDVFFDRIAALSYPKTHIHLTGHIGRKAEQQTPLVNEFIKKHGHNYLSIK 243
Query: 354 YIAHNSTVNSKEARNLAVENSLHKGVD---FYFYVDSDSHLDNPDVLKYLVNRNESLIAP 410
+ + V+ AR+ A + L VD F F VD+ L NP L++L+ N S+IAP
Sbjct: 244 WFYPDELVDEGSARDHAYAHCL--AVDTCRFMFSVDAVVQLTNPHTLEHLIRMNRSMIAP 301
Query: 411 LLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKA 470
+L R K WSNFWGAL+ DG+Y RS DY+ I+ ++ GIWNVP+I + YL+ +A
Sbjct: 302 MLSRREKLWSNFWGALSRDGYYERSDDYIEIV--ERKRVGIWNVPFIRDAYLLSR---RA 356
Query: 471 TNIKTIYTLNSMD-YDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELI 529
++ L+ +D +M + R + I + +D+ + YG+LV +E + + N +++++
Sbjct: 357 VHVFAKNKLSGIDGLEMRIPSIARQENIFMTLDNMEPYGYLVQAETYTTEHVNNDLWQIF 416
Query: 530 RNPLDWDLRYIHPEYQKSLLPDTVNN---QPCPDVFWFPIVTEKFCHEFVQIMEAYGQWS 586
NPLDW+ +Y+HP+Y K L P+ + QPCPDVF+ PIVT KFC + + +E +G WS
Sbjct: 417 DNPLDWEEQYVHPDYFKYLAPEVGMSDFKQPCPDVFYLPIVTTKFCRQLIAEVEEFGLWS 476
Query: 587 DGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPM 646
DGTN D RLE GYE VPTRDIHM+Q+ W FL KYV P+Q++ F GY +P A M
Sbjct: 477 DGTNIDPRLEGGYENVPTRDIHMRQINWEDHWLHFLVKYVHPIQKKLFAGYEDKPW-ARM 535
Query: 647 SFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLM 706
+FVVRYRPDEQ SLRPHHD+S+YT+NIALN+ GVD+EGGG F+RYNC+V ++GW +
Sbjct: 536 NFVVRYRPDEQASLRPHHDASSYTLNIALNEAGVDFEGGGTGFVRYNCSVVRAKVGWAAV 595
Query: 707 HPGRLTHYHEGLQVTQGTRYIMISFVDP 734
PGR+TH HEGL T GTRYI ++F++P
Sbjct: 596 FPGRVTHLHEGLTTTSGTRYIFVTFINP 623
>gi|426327853|ref|XP_004024724.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 1
[Gorilla gorilla gorilla]
Length = 710
Score = 534 bits (1376), Expect = e-149, Method: Compositional matrix adjust.
Identities = 253/577 (43%), Positives = 379/577 (65%), Gaps = 9/577 (1%)
Query: 162 GFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLE 221
GFIGYA ++ +L++ ++ + DQL+Y +FLD R + I LD +FQNL G+L+
Sbjct: 139 GFIGYAPNLSKLVAEWEGQDSDSDQLFYTKIFLDPEKREQINITLDHRCRIFQNLDGALD 198
Query: 222 DIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK-TSGCTRCNL-I 279
++ L F++ V N Y+T PV+IHGNG +K++LN GNY+ + W +GCT C+ +
Sbjct: 199 EVVLKFEMGH-VRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTFETGCTVCDEGL 257
Query: 280 KHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFD 339
+ L + + P+VL+ VFI++PT F+ F ++ L+YP K + +F++N++++H +
Sbjct: 258 RSLKGIGDEALPTVLVGVFIEQPTPFVSLFFQRLLRLHYPQKHMRLFIHNHEQHHKAQVE 317
Query: 340 DYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDSDSHLDNPDVLK 398
+++ + +++VK + + + +ARN+ + + +YF VD+D L P+ L+
Sbjct: 318 EFLAEHGSEYQSVKLVGPEVRMANADARNMGADLCRQDRSCTYYFSVDADVALTEPNSLR 377
Query: 399 YLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYIT 458
LV +N+++IAPL+ R + WSNFWGAL+ADG+YARS DY++I+ G + G+WNVPYI+
Sbjct: 378 LLVQQNKNVIAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR--VGVWNVPYIS 435
Query: 459 NCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFD 517
N YL+K S ++ ++ + +D DMAFC N+R + + + + + GHL+ +++
Sbjct: 436 NIYLIKGSALRGELQSPDLFHHSKLDPDMAFCANVRQQDVFMFLTNRHTLGHLLSLDSYR 495
Query: 518 PQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQ 577
+ +++E+ NP DW +YIH Y K+L V PCPDV+WFPI TE C E V+
Sbjct: 496 TTHLHNDLWEVFSNPEDWKEKYIHQNYTKALAGKLVET-PCPDVYWFPIFTEVACDELVE 554
Query: 578 IMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGY 637
ME +GQWS G N D R++ GYE VPT DIHM Q+G W +FL +Y+ P+ E+ + GY
Sbjct: 555 EMEHFGQWSLGNNKDNRIQGGYENVPTIDIHMNQIGFEREWHKFLLEYIAPMTEKLYPGY 614
Query: 638 HHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVT 697
+ V RY+PDEQPSL PHHD+ST+TINIALN+VGVDYEGGGCRF+RYNC++
Sbjct: 615 YTR-VXXXXXXXXRYKPDEQPSLMPHHDASTFTINIALNRVGVDYEGGGCRFLRYNCSIR 673
Query: 698 ATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
A R GW LMHPGRLTHYHEGL T+GTRYI +SFVDP
Sbjct: 674 APRKGWTLMHPGRLTHYHEGLPTTRGTRYIAVSFVDP 710
>gi|403289984|ref|XP_003936115.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 1
[Saimiri boliviensis boliviensis]
Length = 674
Score = 529 bits (1363), Expect = e-147, Method: Compositional matrix adjust.
Identities = 285/715 (39%), Positives = 412/715 (57%), Gaps = 76/715 (10%)
Query: 31 EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNE 90
ED LVITVA+ ET+G++RF +SA+ ++++LGL + W +S GGG KV LLK
Sbjct: 25 EDNLLVITVATKETEGFRRFKRSAQFFNYKIQSLGLGEDWNVEKRTSAGGGQKVRLLKKA 84
Query: 91 LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
L++ +D++IL TDSYDV+ G ++L++F + +VF AE L +PD L KYPA
Sbjct: 85 LEKHADKEDLVILFTDSYDVLFASGPRELLKKFRQARSQVVFSAEELIYPDRRLEAKYPA 144
Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
V G R+L SGG + ++ + + + L+ + I LD
Sbjct: 145 VSDGKRFLGSGG----ERPGCRVLGPAEGPIHSSPRAFPYCSLVPSFLQDQINITLDHRC 200
Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK- 269
+FQNL G+L+++ L F++ V N Y+T PV+IHGNG +K++LN GNY+ + W
Sbjct: 201 RIFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTF 259
Query: 270 TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
+GCT C+ ++ L + + P+VL+ VFI++PT F+ F ++ L+YP K I +
Sbjct: 260 ETGCTVCDEGLRSLKGIGDEALPTVLVGVFIEQPTPFVSLFFQRLLRLHYPRKHIRL--- 316
Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSD 388
+IHN +++ +V S L G F
Sbjct: 317 ------------FIHN---------HVSSRHSVGSS--------CGLGPGPSF------- 340
Query: 389 SHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGG 448
+ R + WSNFWGAL+ADG+YARS DY++I+ G +
Sbjct: 341 -----------------TXXXXXXXRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR-- 381
Query: 449 KGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEY 507
G+WNVPYI+N YL+K S ++A ++ +D DMAFC N+R + + + + +
Sbjct: 382 VGVWNVPYISNIYLIKGSALRAELQSPDLFHHRKLDPDMAFCANIRQQDVFMFLTNRHGL 441
Query: 508 GHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEY--------QKSLLPDTVNNQPCP 559
GHL+ +N+ + +++E+ NP + L + H ++ Q LP PCP
Sbjct: 442 GHLLSLDNYRTTHLHNDLWEVFSNP-EVRLGWAHSDWEQRGPGILQSEALPSQNLTIPCP 500
Query: 560 DVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWA 619
DV+WFPI TE C E V+ ME +GQWS G N D R++ GYE VPT DIHM Q+G W
Sbjct: 501 DVYWFPIFTEAACDELVEEMEHFGQWSLGDNKDNRIQGGYENVPTIDIHMNQIGFEREWH 560
Query: 620 EFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVG 679
+FL +Y+ P+ E+ + GY+ + ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VG
Sbjct: 561 KFLLEYIAPMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFTINIALNRVG 619
Query: 680 VDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
VDYEGGGCRF+RYNC++ A R GW LMHPGRLTHYHEGL T+GTRYI +SFVDP
Sbjct: 620 VDYEGGGCRFLRYNCSIRAPRKGWTLMHPGRLTHYHEGLPTTRGTRYIAVSFVDP 674
>gi|332250431|ref|XP_003274354.1| PREDICTED: LOW QUALITY PROTEIN: procollagen-lysine,2-oxoglutarate
5-dioxygenase 1 [Nomascus leucogenys]
Length = 681
Score = 524 bits (1349), Expect = e-146, Method: Compositional matrix adjust.
Identities = 270/708 (38%), Positives = 404/708 (57%), Gaps = 81/708 (11%)
Query: 31 EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNE 90
ED LVITVA+ ET+G++RF +SA+ +++ LGL + W +S GGG KV LLK
Sbjct: 51 EDNLLVITVATKETEGFRRFKRSAQFFNYKIQALGLGEDWNVEKGTSAGGGQKVRLLKKA 110
Query: 91 LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
L++ +D++IL TDSYDV+ G ++L++F + +VF AE L +PD L KYP
Sbjct: 111 LEKHADKEDLVILFTDSYDVLFASGPRELLKKFRQARSQVVFSAEELIYPDRRLEAKYPV 170
Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
V G R+L SGGFIGYA ++ +L++ ++ + DQL+Y +FLD R + I LD
Sbjct: 171 VSDGKRFLGSGGFIGYAPNLSKLVAEWEGQDSDSDQLFYTKIFLDPEKREQINITLDHRC 230
Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWKT 270
+FQNL G+L+++ L F++ V N Y+T PV+IHGNG +K E
Sbjct: 231 RIFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVLIHGNGPTKDE-------------- 275
Query: 271 SGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNN 330
P+VL+ VFI++PT F+ F ++ L+YP K + +F++N+
Sbjct: 276 ------------------ALPTVLVGVFIEQPTPFVSLFFQRLLRLHYPQKHMRLFIHNH 317
Query: 331 QEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDSDS 389
+++H ++++ + +++VK + + + +ARN+ + + +YF VD+D
Sbjct: 318 EQHHKAQVEEFLAEHGSEYQSVKLVGPEVRMANADARNMGADLCRQDRSCTYYFSVDADV 377
Query: 390 HLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGK 449
L P+ L+ L+ +N+++IAPL+ R + WSNFWGAL+ADG+YARS DY++I+ G +
Sbjct: 378 ALTEPNSLRLLIQQNKNVIAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR--V 435
Query: 450 GIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYG 508
G+WNVPYI+N YL+K S ++ ++ + +D DMAFC N+R + + + + + G
Sbjct: 436 GVWNVPYISNIYLIKGSALRGELQSPDLFHHSKLDPDMAFCANVRQQDVFMFLTNRHTLG 495
Query: 509 HLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVT 568
HL+ +++ + +++E+ NP DW +YIH Y K+L V PCPDV+WFPI T
Sbjct: 496 HLLSLDSYRTTHLHNDLWEVFGNPEDWKEKYIHQNYTKALAGKLVET-PCPDVYWFPIFT 554
Query: 569 EKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLR--KYV 626
E C E V+ ME +GQWS G N D R++ GYE VPT DIHM Q+G W +FL +
Sbjct: 555 EVACDELVEEMEHFGQWSLGDNKDNRIQGGYENVPTIDIHMNQIGFEREWHKFLAGTTSL 614
Query: 627 VPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGG 686
P F+ P+ GGG
Sbjct: 615 CPTPGPRFVRSSRLPI-----------------------------------------GGG 633
Query: 687 CRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
CRF+RYNC+V A R GW LMHPGRLTHYHEGL T+GTRYI +SFVDP
Sbjct: 634 CRFLRYNCSVRAPRKGWTLMHPGRLTHYHEGLPTTRGTRYIAVSFVDP 681
>gi|345308392|ref|XP_001516384.2| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 3
[Ornithorhynchus anatinus]
Length = 674
Score = 523 bits (1346), Expect = e-145, Method: Compositional matrix adjust.
Identities = 250/622 (40%), Positives = 396/622 (63%), Gaps = 13/622 (2%)
Query: 27 KNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVN 85
+ ++ +K LV+T A+ ET+GYKRF+++A V+TLGL + W GGD++ ++GGG KV
Sbjct: 35 ERVNPEKLLVMTAATEETEGYKRFLRTARHFNYTVRTLGLGEEWRGGDVARTVGGGQKVR 94
Query: 86 LLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLY 145
LK E+++ +D++IL DSYDV++ G ++L +F + ++F AE CWP+ SL
Sbjct: 95 WLKQEMEKHADREDLVILFVDSYDVLLAGSPLELLWKFVQSGSRLLFSAEGFCWPEWSLA 154
Query: 146 DKYP--AVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHK 203
D YP + G+G R+LNSGGFIG+A + L+ K+++DDQL+Y L+LD LR KH
Sbjct: 155 DSYPPLSAGNGKRFLNSGGFIGFAPTVHRLVRQWKYKDDDDDQLFYTRLYLDPGLREKHG 214
Query: 204 IVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNY 263
+ LD + +FQNL G+L+++ L F+ + V + N Y+T PV+IHGNG +K++LN GNY
Sbjct: 215 LALDHKSRIFQNLNGALDEVVLKFEKNR-VRVRNVAYDTLPVVIHGNGPTKLQLNYLGNY 273
Query: 264 LAKSWK-TSGCTRCNLIKHLDSLKPD-QFPSVLISVFIDKPTAFLEEFLNKIANLNYPAK 321
+ +W GC C + +L D + P VL+ +F+++PT FL +FL ++ L+YP+
Sbjct: 274 VPNAWTYEGGCGFCAQDRR--NLTGDSELPRVLLGLFVEQPTPFLPQFLQRLLLLDYPSS 331
Query: 322 KISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVD 380
++S+F++N++ YH + +T F V+ + + EAR++A+++ D
Sbjct: 332 RLSLFLHNSEVYHEAHVEALWEQLRTRFSTVQLVGPEEALTQGEARDMAMDSCRQDPSCD 391
Query: 381 FYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMN 440
FYF +D+D+ L NP L L+ + ++AP+L R K WSNFWGAL+ + +YARS DY+
Sbjct: 392 FYFSLDADAVLTNPRTLLSLIEEDRKVVAPMLSRHGKLWSNFWGALSPEEYYARSEDYVE 451
Query: 441 IINGDQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHL 499
++ + G+WNVPY+ YL++ +++ + ++TL D DM+FC +LR+KGI L
Sbjct: 452 LVQRKR--VGLWNVPYVAQAYLVRGETLRSELPQRGVFTLEETDPDMSFCKSLRDKGIFL 509
Query: 500 KIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCP 559
+ + +E+G LV + +D +P+++++ NPLDW +YIHP Y +L + V QPCP
Sbjct: 510 HLSNQEEFGRLVSTARYDTDHLHPDLWQIFDNPLDWREKYIHPNYSLALEGEGVE-QPCP 568
Query: 560 DVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWA 619
DV+WFP+++++ C E V+ ME +GQWS G + D RL GYE VPT DIHM QVG W
Sbjct: 569 DVYWFPVLSDRMCDELVEEMENFGQWSGGRHEDTRLAGGYENVPTVDIHMNQVGYEKEWL 628
Query: 620 EFLRKYVVPLQEREFIGYHHEP 641
+ L +Y+ P+ E F GYH +P
Sbjct: 629 KVLSEYIAPMTESLFPGYHTKP 650
>gi|313237914|emb|CBY13041.1| unnamed protein product [Oikopleura dioica]
Length = 747
Score = 522 bits (1345), Expect = e-145, Method: Compositional matrix adjust.
Identities = 294/751 (39%), Positives = 438/751 (58%), Gaps = 45/751 (5%)
Query: 18 FISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS- 76
+S+ N V N E LVITVA+ +TDGY R+ +S + L+ +T G+ + WLGGD++
Sbjct: 6 LVSLLSNSVLNARE--LLVITVATEKTDGYLRWEESVRYSGLKSRTFGIGEDWLGGDLTN 63
Query: 77 SLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFN------TFDANI 130
GGG+KVNLLK EL E ++ L TD+YDVII+G +I RF+ + N+
Sbjct: 64 GPGGGHKVNLLKKELAEYKGNSELYFLFTDAYDVIINGKEEEIFSRFDDIVSKVEYKTNV 123
Query: 131 VFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYA 190
+ AE L WPD SL KYP V G R+L SG + A +L+ R+I + +DDQL+Y
Sbjct: 124 LISAEDLIWPDASLEPKYPLV-LGKRFLCSGAILARADVFLDLLEYRAIGDRDDDQLFYT 182
Query: 191 LLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLT--NTKYNTNPVIIH 248
FL++ L+ K I LD A LF NL G+LE++ ++F T NTKY T P++IH
Sbjct: 183 EAFLNKELKEKFGIALDHKAELFFNLNGALEEVGIDFARSATGDNTVENTKYRTKPLVIH 242
Query: 249 GNGKSKIELNSFGNYLAKSWKTS-GCTRCNLIKHLDSLKPDQFPS--VLISVFIDKPTAF 305
GNG SK ELN NY+ + W+ GC C+ + + + +K D S ++I+ ID T F
Sbjct: 243 GNGPSKNELNRISNYVPQGWRPDYGCPACSKVLN-EEIKEDIDTSKDIVIAFIIDGITPF 301
Query: 306 LEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKE 365
+ L +IA+L+YPA+K + +Y+N + D ++ F + +K+ K+I+ ++
Sbjct: 302 VHNSLKRIASLDYPAEKTHLLIYSNTVWADERVDTFLEVFGSSYKSTKFISSKEKMSITM 361
Query: 366 ARNLAVENSLHK-GVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWG 424
AR A++ + K +F F+VD L NP V+ L+ N L+AP + R K WSN+WG
Sbjct: 362 ARKFALQLTDEKFSAEFVFFVDGYVQLTNPAVIGELIKTNVELVAPGMSRYGKLWSNYWG 421
Query: 425 ALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSM-- 482
A+ +DGFY+RS DY++I+ G + GIWN+P++ YL+ ++ A ++ I+ S
Sbjct: 422 AVASDGFYSRSDDYLDIVQGTR--VGIWNMPFVNGAYLVHKNL--AADLIDIFAGISQSP 477
Query: 483 ------DYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWD 536
D D+ F +NLR GI + + + +G LVD E+ + +PE+++ N DW+
Sbjct: 478 WQGKFNDPDLDFASNLRTLGIFMHVTNQAYWGRLVDREHMPVDRIHPELWQPEWNRPDWE 537
Query: 537 LRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDG--TNNDKR 594
Y+ +Y + L P+T ++PCPDV FP ++ K + ++ ME YG+WS G + D+R
Sbjct: 538 EDYLDTDYWRVLEPETEMDEPCPDVVAFPFLSSKGGFDMIEEMEHYGKWSGGNEAHTDER 597
Query: 595 LETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAP-MSFVVRYR 653
L GYE VPT DIHM Q+GL W ++ Y P+ + + GY+ P P + FVVRY+
Sbjct: 598 LAGGYENVPTVDIHMNQIGLHDEWMYVVKTYAAPMVSKFYTGYN--PDNKPNLMFVVRYK 655
Query: 654 PDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTAT-----------RMG 702
P EQ LRPHHDSST+T IALN+ +D+EGGG F RY C+V + + G
Sbjct: 656 PGEQDRLRPHHDSSTWTFQIALNRPNIDFEGGGTYFTRYKCSVVGSATEQDSRSLEVKQG 715
Query: 703 WMLMHPGRLTHYHEGLQVTQGTRYIMISFVD 733
PGRLTH H GL T+GTRYI+++F+D
Sbjct: 716 MGFAFPGRLTHQHAGLPTTKGTRYILVNFMD 746
>gi|313247226|emb|CBY36038.1| unnamed protein product [Oikopleura dioica]
Length = 747
Score = 521 bits (1342), Expect = e-145, Method: Compositional matrix adjust.
Identities = 293/751 (39%), Positives = 437/751 (58%), Gaps = 45/751 (5%)
Query: 18 FISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS- 76
+S+ N V N E LVITVA+ +TDGY R+ +S + L+ +T G + WLGGD++
Sbjct: 6 LVSLLSNSVLNARE--LLVITVATEKTDGYLRWEESVRYSGLKSRTFGTGEDWLGGDITN 63
Query: 77 SLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFN------TFDANI 130
GGG+KVNLLK EL ++ L TD+YDVII+G +I RF+ + N+
Sbjct: 64 GPGGGHKVNLLKKELAGYKGNSELYFLFTDAYDVIINGKEEEIFSRFDDIVSKVEYKTNV 123
Query: 131 VFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYA 190
+ AE L WPD SL KYP V G R+L SG + A +L+ R+I + +DDQL+Y
Sbjct: 124 LISAEDLIWPDASLEPKYPLV-LGKRFLCSGAILARADVFLDLLEYRAIGDRDDDQLFYT 182
Query: 191 LLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLT--NTKYNTNPVIIH 248
+L++ L+ K I LD A LF NL G+LE++ ++F T NTKY T P++IH
Sbjct: 183 EAYLNKELKEKFGIALDHKAELFFNLNGALEEVGIDFARSATGDNTVENTKYRTKPLVIH 242
Query: 249 GNGKSKIELNSFGNYLAKSWKTS-GCTRCNLIKHLDSLKPDQFPS--VLISVFIDKPTAF 305
GNG SK ELN NY+ + W+ GC C+ + + + +K D S ++I+ ID T F
Sbjct: 243 GNGPSKNELNRISNYVPQGWRPDYGCPACSKVLN-EEIKEDIDTSKEIVIAFIIDGITPF 301
Query: 306 LEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKE 365
+ L +IA+L+YPA+K + +Y+N + D ++ F + +K+ K+I+ ++
Sbjct: 302 VHNSLKRIASLDYPAEKTHLLIYSNTVWADERVDTFLEVFGSSYKSTKFISSKEKMSVTM 361
Query: 366 ARNLAVENSLHK-GVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWG 424
AR A++ + K V+F F+VD L NP V+ L+ N L+AP + R K WSN+WG
Sbjct: 362 ARKFALQKTYEKFSVEFVFFVDGYVQLTNPAVIGELIKTNVELVAPGMSRYGKLWSNYWG 421
Query: 425 ALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSM-- 482
A+ +DGFY+RS DY++I+ G + GIWN+P++ YL+ ++ A ++ I+ S
Sbjct: 422 AVASDGFYSRSDDYLDIVQGTR--VGIWNMPFVNGAYLVHKNL--AADLIDIFAGISQSP 477
Query: 483 ------DYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWD 536
D D+ F +NLR GI + + + +G LVD E+ + +PE+++ N DW+
Sbjct: 478 WQGKFNDPDLDFASNLRTLGIFMHVTNQAYWGRLVDREHMPVDRIHPELWQPEWNRPDWE 537
Query: 537 LRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDG--TNNDKR 594
Y+ +Y + L P+T ++PCPDV FP ++ K + ++ ME YG+WS G + D+R
Sbjct: 538 EDYLDSDYWRVLEPETEMDEPCPDVVAFPFLSSKGGFDMIEEMEHYGKWSGGNEAHTDER 597
Query: 595 LETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAP-MSFVVRYR 653
L GYE VPT DIHM Q+GL W ++ Y P+ + + GY+ P P + FVVRY+
Sbjct: 598 LAGGYENVPTVDIHMNQIGLQDEWLYVVKTYAAPMVSKFYTGYN--PDNKPNLMFVVRYK 655
Query: 654 PDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTAT-----------RMG 702
P EQ LRPHHDSST+T IALN+ +D+EGGG F RY C+V + + G
Sbjct: 656 PGEQDRLRPHHDSSTWTFQIALNRPNIDFEGGGTYFTRYKCSVVGSATEQDSRSLEVKQG 715
Query: 703 WMLMHPGRLTHYHEGLQVTQGTRYIMISFVD 733
PGRLTH H GL T+GTRYI+++F+D
Sbjct: 716 MGFAFPGRLTHQHAGLPTTKGTRYILVNFMD 746
>gi|426342442|ref|XP_004037854.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2
[Gorilla gorilla gorilla]
Length = 783
Score = 509 bits (1311), Expect = e-141, Method: Compositional matrix adjust.
Identities = 254/624 (40%), Positives = 385/624 (61%), Gaps = 9/624 (1%)
Query: 19 ISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSS 77
+ K +I DK LVITVA+ E+DG+ RF+QSA+ VK LG + W GGD ++S
Sbjct: 125 LGADSEKPSSIPTDKLLVITVATKESDGFHRFMQSAKYFNYTVKVLGQGEEWRGGDGINS 184
Query: 78 LGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERL 137
+GGG KV L+K ++ DD++++ T+ +DVI GG ++L++F + +VF A+ +
Sbjct: 185 IGGGQKVRLMKEVMEHYADQDDLVVMFTECFDVIFAGGPEEVLKKFQKANHKVVFAADGI 244
Query: 138 CWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDET 197
WPD L DKYP V G RYLNSGGFIGYA + ++ ++++ +DDQL+Y +++D
Sbjct: 245 LWPDKRLADKYPVVHIGKRYLNSGGFIGYAPYVNRIVQQWNLQDNDDDQLFYTKIYIDPL 304
Query: 198 LRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIEL 257
R I LD +FQ L G+++++ L F+ + NT Y T PV I+GNG +KI L
Sbjct: 305 KREAINITLDHKCKIFQTLNGAVDEVVLKFENGK-ARAKNTFYETLPVAINGNGPTKILL 363
Query: 258 NSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANL 316
N FGNY+ SW + +GCT C +D D P+V I VFI++PT FL FL+ + L
Sbjct: 364 NYFGNYVPNSWTQDNGCTLCEF-DTVDLSAVDVHPNVSIGVFIEQPTPFLPRFLDILLTL 422
Query: 317 NYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH 376
+YP + + +F++N + YH + K K +K + ++ EARN+ ++
Sbjct: 423 DYPKEALKLFIHNKEVYHEKDIKVFFDKAKHEIKTIKIVGPEENLSQAEARNMGMDFCRQ 482
Query: 377 -KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARS 435
+ D+YF VD+D L NP LK L+ +N +IAPL+ R K WSNFWGAL+ DG+YARS
Sbjct: 483 DEKCDYYFSVDADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARS 542
Query: 436 FDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN 494
DY++I+ G++ G+WNVPY+ N YL+K +++ N + + + +D DMA C N R
Sbjct: 543 EDYVDIVQGNR--VGVWNVPYMANVYLIKGKTLRSEMNERNYFVRDKLDPDMALCRNARE 600
Query: 495 KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVN 554
G+ + I + E+G L+ + N++ N +++++ NP+DW +YI+ +Y K + + +
Sbjct: 601 MGVFMYISNRHEFGRLLSTANYNTSHYNNDLWQIFENPVDWKEKYINRDYSK-IFTENIV 659
Query: 555 NQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGL 614
QPCPDVFWFPI +EK C E V+ ME YG+WS G ++D R+ GYE VPT DIHMKQV L
Sbjct: 660 EQPCPDVFWFPIFSEKACDELVEEMEHYGKWSGGKHHDSRISGGYENVPTDDIHMKQVDL 719
Query: 615 AGVWAEFLRKYVVPLQEREFIGYH 638
VW F+R+++ P+ + F GY+
Sbjct: 720 ENVWLHFIREFIAPVTLKVFAGYY 743
>gi|351698763|gb|EHB01682.1| Procollagen-lysine,2-oxoglutarate 5-dioxygenase 3 [Heterocephalus
glaber]
Length = 828
Score = 498 bits (1282), Expect = e-138, Method: Compositional matrix adjust.
Identities = 241/549 (43%), Positives = 348/549 (63%), Gaps = 41/549 (7%)
Query: 221 EDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKI------------------------- 255
+++ L FD + V + N Y+T PV++HGNG +K+
Sbjct: 286 DEVVLKFDRNR-VRIRNVAYDTLPVVVHGNGPTKVPPSSLCPCLPQALGTLSFLSSHYSP 344
Query: 256 ------ELNSFGNYLAKSW-KTSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLE 307
+LN GNY+ W GC CN + L +P P VL++VF+++PT FL
Sbjct: 345 APATELQLNYLGNYVPNGWTPQGGCGFCNRDQRTLPGGQPP--PRVLLAVFVEQPTPFLP 402
Query: 308 EFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEAR 367
FL ++ L+YP ++++F++N++ YH P D + F++VK + ++ EAR
Sbjct: 403 RFLQRLLLLDYPPDRVTLFLHNSEVYHEPHIADSWPQLQDHFESVKLVGPEEDLSPGEAR 462
Query: 368 NLAVENSLH-KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGAL 426
++A++ +FYF +D+D+ L N L+ L+ +N +IAP+L R K WSNFWGAL
Sbjct: 463 DMAMDTCRQDPECEFYFSLDADAVLTNQQTLRILIEQNRKVIAPMLSRHGKLWSNFWGAL 522
Query: 427 NADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYD 485
+ D +YARS DY+ ++ + G+WNVPYI+ YL++ ++ + +++ + MD D
Sbjct: 523 SPDEYYARSEDYVELVQRKR--LGVWNVPYISQAYLIQGETLRTELPQREVFSSSDMDPD 580
Query: 486 MAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQ 545
MAFC NLR++GI L + + QE+G L+ + +D +P+++++ NP+DW +YIH Y
Sbjct: 581 MAFCMNLRDRGIFLHLSNQQEFGRLLATSRYDTDHLHPDLWQIFDNPVDWKEQYIHENYS 640
Query: 546 KSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTR 605
++L + QPCPDV+WFP+++E+ C E V+ ME YGQWS G + D RL GYE VPT
Sbjct: 641 QALDGKDLVEQPCPDVYWFPLLSEQMCDELVEEMENYGQWSGGRHEDSRLAGGYENVPTV 700
Query: 606 DIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHD 665
DIHMKQVG W + LR YV P+ E F GYH + RA M+FVVRYRPDEQPSLRPHHD
Sbjct: 701 DIHMKQVGYEDQWLQLLRTYVGPMTEHLFPGYHTK-TRAVMNFVVRYRPDEQPSLRPHHD 759
Query: 666 SSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTR 725
SST+T+N+ALN G+DYEGGGCRF+RYNC +++ R GW L+HPGRLTHYHEGL T+GTR
Sbjct: 760 SSTFTLNVALNHKGLDYEGGGCRFLRYNCIISSPRKGWGLLHPGRLTHYHEGLPTTRGTR 819
Query: 726 YIMISFVDP 734
YIM+SFVDP
Sbjct: 820 YIMVSFVDP 828
Score = 190 bits (482), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 90/193 (46%), Positives = 134/193 (69%), Gaps = 1/193 (0%)
Query: 29 IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLL 87
++ +K LVITVA+ ET+GY+RF+Q+AE V+TLGL + W GGD++ ++GGG KV L
Sbjct: 37 VNPEKLLVITVATAETEGYRRFLQTAEFFNYTVRTLGLGKEWRGGDVARTVGGGQKVRWL 96
Query: 88 KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
K E+++ +DMII+ DSYDVI+ G ++L++F + ++F AE CWPD L ++
Sbjct: 97 KKEMEKYADQEDMIIMFVDSYDVILAGSPTELLKKFVQSSSRLLFSAEGFCWPDWGLAEQ 156
Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
YP VG+G R+LNSGGFIG+A I +++ K+++DDQL+Y L+LD LR K + LD
Sbjct: 157 YPEVGTGKRFLNSGGFIGFAPTIHQIVHQWKYKDDDDDQLFYTRLYLDPGLREKFSLNLD 216
Query: 208 TLANLFQNLYGSL 220
+ +FQNL G+L
Sbjct: 217 HKSRIFQNLNGAL 229
>gi|344256859|gb|EGW12963.1| Procollagen-lysine,2-oxoglutarate 5-dioxygenase 1 [Cricetulus
griseus]
Length = 659
Score = 494 bits (1273), Expect = e-137, Method: Compositional matrix adjust.
Identities = 257/656 (39%), Positives = 401/656 (61%), Gaps = 37/656 (5%)
Query: 32 DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNEL 91
D LV+TVA+ ET+G++RF +SA+ +++ + P L S G+
Sbjct: 19 DNLLVLTVATKETEGFRRFKRSAQFFNYKIQWV----PSLDPASPSPRFGH--------- 65
Query: 92 DEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAV 151
SYDV+ G ++L++F + +VF AE L +PD L KYP V
Sbjct: 66 ---------------SYDVVFASGPRELLKKFQQAKSRVVFSAEELIYPDRRLEAKYPTV 110
Query: 152 GSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLAN 211
G R+L SGGFIGYA ++ +L++ ++ + DQL+Y +FLD R + I L +
Sbjct: 111 SDGKRFLGSGGFIGYAPNLNKLVAEWEGQDSDSDQLFYTKIFLDPEKREQINISLGHSCS 170
Query: 212 LFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK-T 270
+FQNL G+L+++ L F++ V N Y+T PV+IHGNG +K++LN GNY+ + W
Sbjct: 171 IFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVVIHGNGPTKLQLNYLGNYIPRFWTFE 229
Query: 271 SGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN 329
+GCT C+ ++ L + + P+VL+ VFI++PT FL F ++ L YP K++ +F++N
Sbjct: 230 TGCTVCDEGLRSLKGIGDEALPTVLVGVFIEQPTPFLSLFFLRLLRLRYPQKRMRLFIHN 289
Query: 330 NQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDSD 388
++++H + ++ T +++VK + + + +ARN+ + + +YF VD+D
Sbjct: 290 HEQHHKLEVEKFLAEHGTEYQSVKLVGPEVRMANADARNMGADLCRQDQTCTYYFSVDAD 349
Query: 389 SHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGG 448
L PD L+ L+ +N+++IAPL+ R + WSNFWGAL+ADG+YARS DY++I+ G +
Sbjct: 350 VALTEPDSLRLLIEQNKNVIAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR-- 407
Query: 449 KGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEY 507
G+WNVPYI+N YL+K S ++A ++ + +D DM+FC N+R + + + + + +
Sbjct: 408 VGVWNVPYISNIYLIKGSALRAELQHVDLFHYSKLDADMSFCANVRQQEVFMFLTNRHTF 467
Query: 508 GHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIV 567
GHL+ +N+ + +++E+ NP DW +YIH Y K+L V PCPDV+WFPI
Sbjct: 468 GHLLSLDNYQTTHLHNDLWEVFSNPEDWKEKYIHENYTKALEGKLV-EMPCPDVYWFPIF 526
Query: 568 TEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVV 627
TE C E V+ ME YGQWS G N D R++ GYE VPT DIHM Q+ W +FL +Y+
Sbjct: 527 TEAACDELVEEMEHYGQWSLGDNKDNRIQGGYENVPTIDIHMNQITFEREWHKFLVEYIA 586
Query: 628 PLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYE 683
P+ E+ + GY+ + ++FVVRY+PDEQPSL PHHD+ST+T+NIALN+VG DYE
Sbjct: 587 PMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFTVNIALNRVGQDYE 641
>gi|312082545|ref|XP_003143488.1| procollagen-lysine,2-oxoglutarate 5-dioxygenase 2 [Loa loa]
Length = 569
Score = 491 bits (1265), Expect = e-136, Method: Compositional matrix adjust.
Identities = 248/578 (42%), Positives = 372/578 (64%), Gaps = 16/578 (2%)
Query: 164 IGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDI 223
+G+A +I LIS R +++ +DDQLYY L+LD+ +R K+ LD++ LFQNL G+ D+
Sbjct: 1 MGFAPEIWSLISYRDVEDNDDDQLYYTRLYLDKQIRLSLKMTLDSMTVLFQNLNGASNDV 60
Query: 224 KLNFDLDE--FVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWKTSGCTRCNLIKH 281
KL + + N YNT+P++IHGNG SK+ LN GNY+ + T+ +
Sbjct: 61 KLEMSGERSGMYFIYNFIYNTHPLVIHGNGPSKLYLNHLGNYIDPLRIATSKTQSITM-- 118
Query: 282 LDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDY 341
+ + P + +S+ I KP F+ EF I L Y +KI +FVY NQ++ D+
Sbjct: 119 --DFEKIELPKLFLSIIISKPIPFIREFFGNIKKLAYTDEKIDLFVYCNQKFLTKEVSDF 176
Query: 342 IHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLV 401
+ + K ++++ Y ++ + +EAR+ +++ SL G D+ VD D HL+N + L ++V
Sbjct: 177 VEDVKKRYRSLLY-DDSTEMEEREARSFSLKQSLALGDDYLIMVDGDVHLNNSEALLFMV 235
Query: 402 N----RNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYI 457
+ + ++APL+ +P K ++NFWGA++++G+YARS +Y++II D GIWNVP+I
Sbjct: 236 HTMKEKEPEILAPLIRQPHKLFTNFWGAISSNGYYARSENYLDII--DHKEVGIWNVPFI 293
Query: 458 TNCYLMKTSVIKATNIKTIYTLNS-MDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENF 516
+ ++ K T++ Y + +D DM+FC+ R+KG L +D++ YG LV SEN
Sbjct: 294 GSILIIAKE--KLTSLSRAYHYDEKLDPDMSFCSFARDKGHFLYLDNSHHYGFLVVSENV 351
Query: 517 DPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFV 576
+ K +PE+YE+ N W+ RYIHP Y +L T + C DV+ FP+++E+FC E +
Sbjct: 352 ESSKVHPEMYEIFNNKELWEKRYIHPNYFTALNGSTPIPEICQDVYDFPLMSERFCAELI 411
Query: 577 QIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIG 636
+ E YG+WSDG + D+RL GYE VPTRDIHMKQ+ W L +YV P+QE+ FIG
Sbjct: 412 EECEYYGKWSDGKHKDERLVGGYENVPTRDIHMKQIDFERHWLYMLDEYVRPIQEKLFIG 471
Query: 637 YHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNV 696
Y+ +PV + M FVVRY+P+EQ SLRPHHD+STY+I+IALN+ GVDYEGGG RF+RYNC
Sbjct: 472 YYKQPVESVMMFVVRYKPEEQASLRPHHDASTYSIDIALNKRGVDYEGGGVRFLRYNCTF 531
Query: 697 TATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
A +G ++ PGRLTH HEGL+ T+GTRYI +SF++P
Sbjct: 532 DADVVGHSMIFPGRLTHLHEGLETTRGTRYIAVSFINP 569
>gi|312082547|ref|XP_003143489.1| hypothetical protein LOAG_07909 [Loa loa]
Length = 719
Score = 483 bits (1242), Expect = e-133, Method: Compositional matrix adjust.
Identities = 273/711 (38%), Positives = 413/711 (58%), Gaps = 26/711 (3%)
Query: 33 KFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNELD 92
K +++ DG +R SAE + K L L Q + + G + +L EL
Sbjct: 26 KLAAFALSTGSNDGLERLKCSAEHYNIDFKILDLGQNSIDHE-DKEDTGKLLRMLTTELG 84
Query: 93 EMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAVG 152
+ I++ I+L+ D ++ II ++I+ +F DA + A L P T + + G
Sbjct: 85 VLRISNSTILLIIDGFNAIITSDESNIICQF--LDACGNYRA--LLTPKTVSAQRSSSFG 140
Query: 153 SGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLANL 212
+ + S IG+ DI ++ I +++ + L Y L+ + ++ T + D L
Sbjct: 141 LLFSEVRSVALIGFVPDILDVFD--FIGSQDGNTLSYTSLYSNYSVDTL-GLTFDVKGIL 197
Query: 213 FQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWKTS- 271
FQN+ + +I L FD + ++ N NT P +I G+ K LN GNY+ K+W
Sbjct: 198 FQNVDSANSEIMLLFDDSGYAYVNNFVQNTRPSVILGSTKGSQLLNHLGNYVGKAWSAED 257
Query: 272 GCTRCNLIKHLDSLKPDQ--FPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN 329
G +C+ SLK + +PSV +++FI KP F+ EFL ++ ++YP KI ++ YN
Sbjct: 258 GYLQCSTT----SLKTSENTWPSVTLALFITKPIPFIREFLATVSRISYPTSKIDIYFYN 313
Query: 330 NQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDS 389
NQ+Y+ + ++ N K +++ V+Y ++ + +EAR A+ + DF F +D D
Sbjct: 314 NQKYNEEEIEKFLQNAKKLYQTVEYDNSDTELGEREARKAALTFAKEMLNDFIFMLDGDV 373
Query: 390 HLDNPDVLKYLVNRNES----LIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGD 445
HL P+ L+ LV+ + +IAPL+ K +SNFWGAL+++G+Y RS DY+ I++G
Sbjct: 374 HLITPETLQLLVDTAIAGKFGIIAPLVTLHGKLFSNFWGALDSNGYYLRSEDYIEIVDGK 433
Query: 446 QGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSM-DYDMAFCTNLRNKGIHLKIDST 504
+ GIWNVPYI+ L+ IK ++ YT N M D DM+FC R G + +D+
Sbjct: 434 R--TGIWNVPYISKAILISKEKIKV--LENSYTYNVMVDADMSFCEYAREMGYFMYVDNQ 489
Query: 505 QEYGHLVDSENF-DPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFW 563
YG LVD+E+F ++ +PE+YE+ +N W+ RYIHP+Y ++L + QPCPDV+
Sbjct: 490 HYYGFLVDAEDFVSDERLHPEMYEIFKNRYVWEQRYIHPKYYEALNSRNIP-QPCPDVYN 548
Query: 564 FPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLR 623
+P+++E F E ++ ME YG WS G N D RL GYE VPT DIHMKQ+ W FL
Sbjct: 549 YPLMSENFTKELIEEMEHYGLWSSGKNEDNRLAGGYENVPTVDIHMKQISFEKEWLYFLD 608
Query: 624 KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYE 683
+YV P+QE+ FIGY+ +PV A M FVVRY+ EQ SL+ HHD+STYT++I LN+ G DYE
Sbjct: 609 EYVRPMQEKLFIGYYQQPVEAVMMFVVRYKQGEQSSLQAHHDASTYTVDIPLNKRGRDYE 668
Query: 684 GGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
GGG R++RYNC V A ++G+ M PGRLTH HEGL VT G RYI +SF++P
Sbjct: 669 GGGIRYVRYNCTVPADQIGYAAMFPGRLTHLHEGLPVTSGIRYIAVSFLNP 719
>gi|297282211|ref|XP_002802231.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 1-like
[Macaca mulatta]
Length = 640
Score = 482 bits (1240), Expect = e-133, Method: Compositional matrix adjust.
Identities = 267/730 (36%), Positives = 391/730 (53%), Gaps = 140/730 (19%)
Query: 31 EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNE 90
ED LV+TVA+ ET+G++RF +SA+ +++ LGL + W +S GGG KV LLK
Sbjct: 25 EDNLLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWNVEKGTSAGGGQKVRLLKKA 84
Query: 91 LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
L++ +D++IL TDSYDV+ G ++L++F + +VF AE L +PD L KYP
Sbjct: 85 LEKHADKEDLVILFTDSYDVLFASGPRELLKKFRQARSQVVFSAEELIYPDRRLEAKYPV 144
Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
V G R+L SGGFIGYA ++ +L++ ++ + DQL+Y +FLD R + I LD
Sbjct: 145 VSDGKRFLGSGGFIGYAPNLSKLVAEWEGQDSDSDQLFYTKIFLDPEKREQINITLDHRC 204
Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK- 269
+FQNL G+L+++ L F++ V N Y+T PV+IHGNG +K++LN GNY+ + W
Sbjct: 205 RIFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTF 263
Query: 270 TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
+GCT C+ ++ L + + P+VL+ +FI++PT F +S+F
Sbjct: 264 ETGCTVCDEGLRSLKGIGDEALPTVLVGMFIEQPTPF-----------------VSLFFQ 306
Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSD 388
+ H P K+++ HN V+S+ +
Sbjct: 307 RLLQLHYPR------------KHMRLFIHNH-VSSRHSEG-------------------- 333
Query: 389 SHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGG 448
++IAPL+ R + WSNFWGAL+ADG+YARS DY++I+ G +
Sbjct: 334 -----------------NVIAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR-- 374
Query: 449 KGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNK------------ 495
G+WNVPYI+N YL+K S ++ ++ + +D DMAFC N+R +
Sbjct: 375 IGVWNVPYISNIYLIKGSALRGELQSPDLFHHSKLDPDMAFCANVRQQVSQQWAAQDTPR 434
Query: 496 -----------GIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEY 544
+ + + + GHL+ +++ + +++E+ NP DW +YIH Y
Sbjct: 435 PRLFHWACFPQDVFMFLTNRHTLGHLLSLDSYRTAHLHNDLWEVFSNPEDWKEKYIHQNY 494
Query: 545 QKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPT 604
K+L V PCPDV+WFPI TE C E V+ ME +GQWS G N D R++ GYE VPT
Sbjct: 495 TKALAGKLVET-PCPDVYWFPIFTEAACDELVEEMEHFGQWSLGDNKDSRIQGGYENVPT 553
Query: 605 RDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHH 664
DIHM Q+G W +FL +Y+ P+ E+ + GY
Sbjct: 554 IDIHMNQIGFEREWHKFLLEYIAPMTEKLYPGY--------------------------- 586
Query: 665 DSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGT 724
YT GGGCRF+RYNC++ A R GW LMHPGRLTHYHEGL T+GT
Sbjct: 587 ----YT------------RGGGCRFLRYNCSIRAPRKGWTLMHPGRLTHYHEGLPTTRGT 630
Query: 725 RYIMISFVDP 734
RYI +SFVDP
Sbjct: 631 RYIAVSFVDP 640
>gi|47210803|emb|CAF89795.1| unnamed protein product [Tetraodon nigroviridis]
Length = 607
Score = 473 bits (1218), Expect = e-130, Method: Compositional matrix adjust.
Identities = 266/680 (39%), Positives = 372/680 (54%), Gaps = 108/680 (15%)
Query: 32 DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLLKNE 90
+ LVIT A+ ETDG+ RF+++A VK LGL + W GGD++ ++GGG KV LK E
Sbjct: 1 ENLLVITAATEETDGFHRFMRTAREFNYTVKVLGLGEEWRGGDVARTVGGGQKVRWLKEE 60
Query: 91 LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
L + D ++L DSYDVI+ G ++L +F+ +VF AE CWPD L KYP
Sbjct: 61 LRKHS-DQDTVVLFVDSYDVILASGPEELLSKFSRLAHRVVFSAEGFCWPDQRLAPKYPE 119
Query: 151 VGSGYRYLNSGG---------------------------FIGYAKDIKELISNRSIKNEE 183
V SG RYLNSGG FIG+A ++ ++ ++++
Sbjct: 120 VPSGKRYLNSGGPRLPPVRVRRRWRLDQPVCVCVCVCSGFIGFASELSAIVQQWKYRDDD 179
Query: 184 DDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTN 243
DDQL+Y ++LD+ RTK + LD + +FQNL G+++++ L F+ + V N Y+T
Sbjct: 180 DDQLFYTRIYLDKVQRTKFNMTLDHRSRIFQNLNGAVDEVVLKFERSK-VRARNVAYDTL 238
Query: 244 PVIIHGNGKSKIELNSFGNYLAKSWK-TSGCTRCN----LIKHLDSLKPDQ-FPSVLISV 297
PV+IHGNG +K++LN NY+ +W GC C+ L+ H+ PD+ P V + V
Sbjct: 239 PVVIHGNGPTKLQLNYLANYVPSAWTFQGGCGVCDDDLLLLNHV----PDEDMPLVHVGV 294
Query: 298 FIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAH 357
FI+K T FLEEFL ++ +NYP + A
Sbjct: 295 FIEKATPFLEEFLERLTLMNYPTAQS--------------------------------AS 322
Query: 358 NSTVNSKEARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFK 417
+ST R+ D+YF +DSD L NPD L+ L+ N+S+IAP+L + K
Sbjct: 323 SSTTTEACLRD--------PECDYYFSLDSDVALTNPDTLRILMEENKSVIAPMLSKHGK 374
Query: 418 AWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIY 477
WSNFWGAL+ +GFY+RS DY+ I+ G + G+WNVPYIT YL+K SV+++
Sbjct: 375 LWSNFWGALSPEGFYSRSEDYIEIVQGKR--IGLWNVPYITQVYLIKGSVLRS------- 425
Query: 478 TLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDL 537
R + L+ S E +P +P E + DW
Sbjct: 426 ---------------RLSQLSLRWTSRHRSSAGTSREQDEPWSCDPPRREDAAD--DWKE 468
Query: 538 RYIHPEYQKSLLP-DTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLE 596
+Y+H Y + ++ QPCPDV+WFP +EK C V+ MEA+GQWS G + D+RL
Sbjct: 469 KYVHENYSRIFEEQESFVEQPCPDVYWFPAFSEKMCDHLVETMEAHGQWSSGGHKDERLS 528
Query: 597 TGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDE 656
GYE VPT D HM Q+G W FLR Y+VP+ E+ + GY+ +A M+FVVRYRPDE
Sbjct: 529 GGYENVPTVDTHMNQIGFEKEWLRFLRDYIVPVTEKLYPGYYPR-AQAIMNFVVRYRPDE 587
Query: 657 QPSLRPHHDSSTYTINIALN 676
QPSLRPHHDSST+TINIALN
Sbjct: 588 QPSLRPHHDSSTFTINIALN 607
>gi|393910404|gb|EFO20581.2| hypothetical protein LOAG_07909 [Loa loa]
Length = 633
Score = 471 bits (1213), Expect = e-130, Method: Compositional matrix adjust.
Identities = 261/658 (39%), Positives = 390/658 (59%), Gaps = 34/658 (5%)
Query: 86 LLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLY 145
+L EL + I++ I+L+ D ++ II ++I+ +F DA C +L
Sbjct: 1 MLTTELGVLRISNSTILLIIDGFNAIITSDESNIICQF--LDA---------CGNYRALL 49
Query: 146 DKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIV 205
P S R + S IG+ DI ++ I +++ + L Y L+ + ++ T +
Sbjct: 50 T--PKTVSAQREVRSVALIGFVPDILDVFD--FIGSQDGNTLSYTSLYSNYSVDTL-GLT 104
Query: 206 LDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLA 265
D LFQN+ + +I L FD + ++ N NT P +I G+ K LN GNY+
Sbjct: 105 FDVKGILFQNVDSANSEIMLLFDDSGYAYVNNFVQNTRPSVILGSTKGSQLLNHLGNYVG 164
Query: 266 KSWKTS-GCTRCNLIKHLDSLKPDQ--FPSVLISVFIDKPTAFLEEFLNKIANLNYPAKK 322
K+W G +C+ SLK + +PSV +++FI KP F+ EFL ++ ++YP K
Sbjct: 165 KAWSAEDGYLQCSTT----SLKTSENTWPSVTLALFITKPIPFIREFLATVSRISYPTSK 220
Query: 323 ISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFY 382
I ++ YNNQ+Y+ + ++ N K +++ V+Y ++ + +EAR A+ + DF
Sbjct: 221 IDIYFYNNQKYNEEEIEKFLQNAKKLYQTVEYDNSDTELGEREARKAALTFAKEMLNDFI 280
Query: 383 FYVDSDSHLDNPDVLKYLVNRNES----LIAPLLVRPFKAWSNFWGALNADGFYARSFDY 438
F +D D HL P+ L+ LV+ + +IAPL+ K +SNFWGAL+++G+Y RS DY
Sbjct: 281 FMLDGDVHLITPETLQLLVDTAIAGKFGIIAPLVTLHGKLFSNFWGALDSNGYYLRSEDY 340
Query: 439 MNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSM-DYDMAFCTNLRNKGI 497
+ I++G + GIWNVPYI+ L+ IK ++ YT N M D DM+FC R G
Sbjct: 341 IEIVDGKR--TGIWNVPYISKAILISKEKIKV--LENSYTYNVMVDADMSFCEYAREMGY 396
Query: 498 HLKIDSTQEYGHLVDSENF-DPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQ 556
+ +D+ YG LVD+E+F ++ +PE+YE+ +N W+ RYIHP+Y ++L + Q
Sbjct: 397 FMYVDNQHYYGFLVDAEDFVSDERLHPEMYEIFKNRYVWEQRYIHPKYYEALNSRNIP-Q 455
Query: 557 PCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAG 616
PCPDV+ +P+++E F E ++ ME YG WS G N D RL GYE VPT DIHMKQ+
Sbjct: 456 PCPDVYNYPLMSENFTKELIEEMEHYGLWSSGKNEDNRLAGGYENVPTVDIHMKQISFEK 515
Query: 617 VWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALN 676
W FL +YV P+QE+ FIGY+ +PV A M FVVRY+ EQ SL+ HHD+STYT++I LN
Sbjct: 516 EWLYFLDEYVRPMQEKLFIGYYQQPVEAVMMFVVRYKQGEQSSLQAHHDASTYTVDIPLN 575
Query: 677 QVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
+ G DYEGGG R++RYNC V A ++G+ M PGRLTH HEGL VT G RYI +SF++P
Sbjct: 576 KRGRDYEGGGIRYVRYNCTVPADQIGYAAMFPGRLTHLHEGLPVTSGIRYIAVSFLNP 633
>gi|393910405|gb|EJD75867.1| hypothetical protein, variant 1 [Loa loa]
gi|393910406|gb|EJD75868.1| hypothetical protein, variant 2 [Loa loa]
Length = 511
Score = 443 bits (1139), Expect = e-121, Method: Compositional matrix adjust.
Identities = 227/519 (43%), Positives = 327/519 (63%), Gaps = 18/519 (3%)
Query: 225 LNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWKTS-GCTRCNLIKHLD 283
L FD + ++ N NT P +I G+ K LN GNY+ K+W G +C+
Sbjct: 2 LLFDDSGYAYVNNFVQNTRPSVILGSTKGSQLLNHLGNYVGKAWSAEDGYLQCSTT---- 57
Query: 284 SLKPDQ--FPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDY 341
SLK + +PSV +++FI KP F+ EFL ++ ++YP KI ++ YNNQ+Y+ + +
Sbjct: 58 SLKTSENTWPSVTLALFITKPIPFIREFLATVSRISYPTSKIDIYFYNNQKYNEEEIEKF 117
Query: 342 IHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLV 401
+ N K +++ V+Y ++ + +EAR A+ + DF F +D D HL P+ L+ LV
Sbjct: 118 LQNAKKLYQTVEYDNSDTELGEREARKAALTFAKEMLNDFIFMLDGDVHLITPETLQLLV 177
Query: 402 NRNES----LIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYI 457
+ + +IAPL+ K +SNFWGAL+++G+Y RS DY+ I++G + GIWNVPYI
Sbjct: 178 DTAIAGKFGIIAPLVTLHGKLFSNFWGALDSNGYYLRSEDYIEIVDGKR--TGIWNVPYI 235
Query: 458 TNCYLMKTSVIKATNIKTIYTLNSM-DYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENF 516
+ L+ IK ++ YT N M D DM+FC R G + +D+ YG LVD+E+F
Sbjct: 236 SKAILISKEKIKV--LENSYTYNVMVDADMSFCEYAREMGYFMYVDNQHYYGFLVDAEDF 293
Query: 517 -DPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEF 575
++ +PE+YE+ +N W+ RYIHP+Y ++L + QPCPDV+ +P+++E F E
Sbjct: 294 VSDERLHPEMYEIFKNRYVWEQRYIHPKYYEALNSRNIP-QPCPDVYNYPLMSENFTKEL 352
Query: 576 VQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFI 635
++ ME YG WS G N D RL GYE VPT DIHMKQ+ W FL +YV P+QE+ FI
Sbjct: 353 IEEMEHYGLWSSGKNEDNRLAGGYENVPTVDIHMKQISFEKEWLYFLDEYVRPMQEKLFI 412
Query: 636 GYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCN 695
GY+ +PV A M FVVRY+ EQ SL+ HHD+STYT++I LN+ G DYEGGG R++RYNC
Sbjct: 413 GYYQQPVEAVMMFVVRYKQGEQSSLQAHHDASTYTVDIPLNKRGRDYEGGGIRYVRYNCT 472
Query: 696 VTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
V A ++G+ M PGRLTH HEGL VT G RYI +SF++P
Sbjct: 473 VPADQIGYAAMFPGRLTHLHEGLPVTSGIRYIAVSFLNP 511
>gi|47205471|emb|CAF94612.1| unnamed protein product [Tetraodon nigroviridis]
Length = 559
Score = 431 bits (1108), Expect = e-118, Method: Compositional matrix adjust.
Identities = 216/565 (38%), Positives = 351/565 (62%), Gaps = 12/565 (2%)
Query: 33 KFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPW-LGGDMSSLGGGYKVNLLKNEL 91
+ LV+TVA+ +TDG++RF+ SA+ VK LG + W GG + GGG KV LLK +
Sbjct: 1 RLLVLTVATGDTDGFRRFLSSAQHFNYTVKVLGRDEAWSGGGYAGAPGGGQKVRLLKAAV 60
Query: 92 DEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAV 151
+EM+ D I+L TDSYD + G ++L++F +VF +E L WPD L DK+P V
Sbjct: 61 EEME-NQDAILLFTDSYDAVFSSGPRELLKKFQQAGHQVVFSSEPLIWPDRHLEDKHPHV 119
Query: 152 GSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLAN 211
G R+L SGGFIGY +IKEL+++ + ++++ DQL++ +++D R I LD+
Sbjct: 120 REGNRFLGSGGFIGYLANIKELVADWTGEDDDSDQLFFTRIYIDAAKRKSINITLDSKCR 179
Query: 212 LFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK-T 270
LFQNL GSL+++ L F+ + V N ++T PV+IHGNG +K+++N GNY+ W
Sbjct: 180 LFQNLLGSLDEVVLKFEEGK-VRARNLVHDTLPVLIHGNGPTKLQINYLGNYIPNVWTFE 238
Query: 271 SGCTRCN-LIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN 329
+GC C ++ L +L+ +P VL+ VFI++PT F+ F ++ L YP ++ + +YN
Sbjct: 239 AGCRVCQEELRPLGALQESDYPLVLVGVFIEQPTPFVSAFFQRLLELQYPKTRLKVLIYN 298
Query: 330 NQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDSD 388
+ +H ++ ++++ V + ++++ARNLA++ + D++F VD D
Sbjct: 299 KEAHHEQHVSAFLQKHQSLYAAVDLLRPEDPADARDARNLALDMCRQDQSCDYFFSVDVD 358
Query: 389 SHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGG 448
L N L+ L+ N ++AP+L R + WSNFWGAL+ DG+YARS DY++I+ +
Sbjct: 359 VVLKNQSTLRTLIEHNLPIVAPMLTRAGRLWSNFWGALSPDGYYARSEDYVDIVQRRR-- 416
Query: 449 KGIWNVPYITNCYLMKTSVIKA--TNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQE 506
G+WNVPY++ L+K ++++ T+ + ++ + +D DMAFC N+RNKGI + + +
Sbjct: 417 VGVWNVPYVSKVVLLKGVLLRSELTDFE-LFDSHILDPDMAFCHNVRNKGIFMYVTNVHT 475
Query: 507 YGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPI 566
+GH++ +EN+ Q + +++++ NPLDW RYIHP Y + +L D + PCPDV+WFP+
Sbjct: 476 FGHILSTENYQTQHLHNDLWQIFENPLDWQERYIHPNYSR-ILRDQLIETPCPDVYWFPV 534
Query: 567 VTEKFCHEFVQIMEAYGQWSDGTNN 591
TE+ C V+ ME +G+WS G N
Sbjct: 535 FTEEACDHMVEEMEHFGRWSGGANT 559
>gi|28380952|gb|AAO41443.1| RE30068p [Drosophila melanogaster]
Length = 595
Score = 428 bits (1101), Expect = e-117, Method: Compositional matrix adjust.
Identities = 218/541 (40%), Positives = 331/541 (61%), Gaps = 12/541 (2%)
Query: 32 DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNEL 91
DK V TVA+ TDGY R+I+SA V ++V TLGL + W GGDM GGG+K+NLL+ +
Sbjct: 27 DKIKVFTVATEPTDGYTRYIRSARVYDIEVTTLGLGEEWKGGDMQKPGGGFKLNLLREAI 86
Query: 92 DEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAV 151
+ IIL TDSYDVII +++I E+F A I+F AE+ CWPD SL D YP V
Sbjct: 87 APYKNEPETIILFTDSYDVIITTTLDEIFEKFKESGAKILFSAEKYCWPDKSLADDYPEV 146
Query: 152 -GSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
G R+LNSG FIGYA + L+ + I++ DDQLY+ +FLDET R K + LD +
Sbjct: 147 EGKASRFLNSGAFIGYAPQVFALLVD-PIEDTADDQLYFTKIFLDETKRAKLGLKLDVQS 205
Query: 211 NLFQNLYGSLEDIKLNFDLDEFVH-LTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK 269
LFQNL+G+ D+KL DL+ L N + T P IIHGNG SK++LN++GNYLA+++
Sbjct: 206 RLFQNLHGAKNDVKLKVDLESNQGVLQNVDFMTTPSIIHGNGLSKVDLNAYGNYLARTF- 264
Query: 270 TSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN 329
C C ++L L+ P + +++ + +P F ++FL I +LNYP +K+ + +Y+
Sbjct: 265 NGVCLLCQ--ENLLDLEETNLPVISLALMVTQPVPFFDQFLEGIESLNYPKEKLHLLIYS 322
Query: 330 NQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDS 389
N +H +++ + K+ ++ ++ R LA++ + D+ F+VD+D+
Sbjct: 323 NVAFHDDDIKSFVNKHAKEYATAKFALSTDELDERQGRQLALDKARLHQSDYIFFVDADA 382
Query: 390 HLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGK 449
H+D+ +VL+ L+ N+ +AP+ + + WSNFWGAL+ G+YARS DY++I+ +
Sbjct: 383 HIDDGEVLRELLRLNKQFVAPIFSKHKELWSNFWGALSEGGYYARSHDYVDIVKREL--I 440
Query: 450 GIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGH 509
G++NVP++T+ YL+K + A + K D DMA C +LRN GI + + + +GH
Sbjct: 441 GMFNVPHVTSIYLVKKTAFDAISFKH----KEFDPDMAMCESLRNAGIFMYASNLRIFGH 496
Query: 510 LVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTE 569
LV++++F+ T P+ Y L N +DW +YIHP Y L QPCPDV+WF IV+E
Sbjct: 497 LVNADDFNTTVTRPDFYTLFSNEIDWTEKYIHPNYSLQLNESNKIQQPCPDVYWFQIVSE 556
Query: 570 K 570
+
Sbjct: 557 R 557
>gi|432957744|ref|XP_004085857.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 3-like,
partial [Oryzias latipes]
Length = 363
Score = 411 bits (1057), Expect = e-112, Method: Compositional matrix adjust.
Identities = 190/357 (53%), Positives = 255/357 (71%), Gaps = 5/357 (1%)
Query: 380 DFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYM 439
+F+F +DSD L NPD L+ L+ N+S+IAP+L + K WSNFWGAL+ +GFY+RS DY+
Sbjct: 10 EFFFSLDSDVALTNPDTLRILMEENKSVIAPMLSKHGKLWSNFWGALSPEGFYSRSEDYI 69
Query: 440 NIINGDQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIH 498
I+ + G+WNVPYI+ YL+K SV+++ + ++ MD DM FC N+R++G+
Sbjct: 70 EIVQAKR--VGLWNVPYISQVYLVKGSVLRSKLSHLNLFVDQGMDPDMVFCKNVRDQGVF 127
Query: 499 LKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLL-PDTVNNQP 557
+ + + E+G LV S NF+ + +P+++++ NPLDW +YIH Y K D QP
Sbjct: 128 MFVSNRDEFGRLVASSNFNTSRLHPDMWQIFDNPLDWREKYIHENYSKIFEDQDGFVEQP 187
Query: 558 CPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGV 617
CPDV+WFP +EK C + V+ ME YG WS G++ D+RL GYE VPT DIHM Q+G
Sbjct: 188 CPDVYWFPAFSEKMCDQLVETMEDYGVWSGGSHKDERLSGGYENVPTVDIHMNQIGFEKE 247
Query: 618 WAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQ 677
W +FL+ Y+ P+ E+ + GY + +A M+FVVRYRPDEQPSLRPHHDSST+TINIALN+
Sbjct: 248 WLKFLKDYIAPVTEKLYPGYFPK-AQAIMNFVVRYRPDEQPSLRPHHDSSTFTINIALNR 306
Query: 678 VGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
GVDYEGGGCRF+RY+C V + R GW MHPGRLTHYHEGL T+GTRYIM+SFVDP
Sbjct: 307 KGVDYEGGGCRFLRYDCKVESPRKGWSFMHPGRLTHYHEGLPTTRGTRYIMVSFVDP 363
>gi|149420843|ref|XP_001508185.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 1-like,
partial [Ornithorhynchus anatinus]
Length = 402
Score = 410 bits (1053), Expect = e-111, Method: Compositional matrix adjust.
Identities = 189/406 (46%), Positives = 277/406 (68%), Gaps = 6/406 (1%)
Query: 331 QEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDSDS 389
+++H + ++ + V+ + + V + +ARN+ + + +YF VD+D
Sbjct: 1 EQHHKAQVERFVAEHGGEYHAVQLVGPDQRVENAQARNMGADLCRKDRDCTYYFSVDADV 60
Query: 390 HLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGK 449
L NP+ L+ L+ +N+++IAP++ RP + WSNFWGAL+ DGFYARS DY++I+ G +
Sbjct: 61 ALKNPETLRLLIEQNKAVIAPMMSRPGRLWSNFWGALSVDGFYARSEDYVDIVQGRR--V 118
Query: 450 GIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYG 508
G+WNVPYI++ YL+K S +++ + ++ +D DMAFC+N+R + + + + + Q +G
Sbjct: 119 GVWNVPYISSIYLVKGSSLRSDLRQEDLFHSGKLDPDMAFCSNVRQQDVFMFLTNRQPFG 178
Query: 509 HLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVT 568
HL+ EN+ + +++E+ NP DW +YIH Y ++L + PCPDV+WFPI T
Sbjct: 179 HLLSLENYQTTHLHNDLWEVFSNPEDWKEKYIHENY-TAVLKGKLVETPCPDVYWFPIFT 237
Query: 569 EKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVP 628
E C E V+ ME +GQWS G N D RL+ GYE VPT DIHM Q+ W +FL +Y+ P
Sbjct: 238 EVACDELVEEMEHFGQWSAGDNKDSRLQGGYENVPTIDIHMNQISFEREWHKFLVEYIAP 297
Query: 629 LQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCR 688
+ E+ + GY+ + + ++FVVRY+PDEQPSL PHHD+ST+T+NIALN+VGVDYEGGGCR
Sbjct: 298 ITEKLYPGYYTK-AQFDLAFVVRYKPDEQPSLMPHHDASTFTLNIALNRVGVDYEGGGCR 356
Query: 689 FIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
F+RYNC+V A R GW LMHPGRLTHYHEGL T+GTRYI +SF+DP
Sbjct: 357 FLRYNCSVKAPRKGWTLMHPGRLTHYHEGLPTTKGTRYIAVSFLDP 402
>gi|4884200|emb|CAB43221.1| hypothetical protein [Homo sapiens]
Length = 365
Score = 401 bits (1031), Expect = e-109, Method: Compositional matrix adjust.
Identities = 182/357 (50%), Positives = 250/357 (70%), Gaps = 4/357 (1%)
Query: 379 VDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDY 438
+FYF +D+D+ L N L+ L+ N +IAP+L R K WSNFWGAL+ D +YARS DY
Sbjct: 12 CEFYFSLDADAVLTNLQTLRILIEENRKVIAPMLSRHGKLWSNFWGALSPDEYYARSEDY 71
Query: 439 MNIINGDQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGI 497
+ ++ + G+WNVPYI+ Y+++ ++ + +++ + D DMAFC + R+KGI
Sbjct: 72 VELVQRKR--VGVWNVPYISQAYVIRGDTLRMELPQRDVFSGSDTDPDMAFCKSFRDKGI 129
Query: 498 HLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQP 557
L + + E+G L+ + +D + +P+++++ NP+ W +YIH Y ++L + + QP
Sbjct: 130 FLHLSNQHEFGRLLATSRYDTEHLHPDLWQIFDNPVGWKEQYIHENYSRALEGEGIVEQP 189
Query: 558 CPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGV 617
CPDV+WFP+++E+ C E V ME YGQWS G + D RL GYE VPT DIHMKQVG
Sbjct: 190 CPDVYWFPLLSEQMCDELVAEMEHYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYEDQ 249
Query: 618 WAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQ 677
W + LR YV P+ E F GYH + RA M+FVVRYRPDEQPSLRPHHDSST+T+N+ALN
Sbjct: 250 WLQLLRTYVGPMTESLFPGYHTK-ARAVMNFVVRYRPDEQPSLRPHHDSSTFTLNVALNH 308
Query: 678 VGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
G+DYEGGGCRF+RY+C +++ R GW L+HPGRLTHYHEGL T GTRYIM+SFVDP
Sbjct: 309 KGLDYEGGGCRFLRYDCVISSPRKGWALLHPGRLTHYHEGLPTTWGTRYIMVSFVDP 365
>gi|449512121|ref|XP_002188714.2| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 1-like,
partial [Taeniopygia guttata]
Length = 466
Score = 399 bits (1026), Expect = e-108, Method: Compositional matrix adjust.
Identities = 195/467 (41%), Positives = 302/467 (64%), Gaps = 9/467 (1%)
Query: 221 EDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK-TSGCTRCNL- 278
++I L F+ + V N Y+T PV+IHGNG +K++LN GNY+ + W +GCT C+
Sbjct: 5 DEIVLKFE-NSRVRARNLLYDTLPVVIHGNGPTKLQLNYLGNYIPQIWTFETGCTVCDEG 63
Query: 279 IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLF 338
++ L K + P +LI +FI++PT FL +F ++ NL+YP ++I +F++N++E+H
Sbjct: 64 LRSLLGFKDEALPMILIGIFIEQPTPFLSQFFLRLRNLHYPKQRIQLFIHNHEEHHLMEV 123
Query: 339 DDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDSDSHLDNPDVL 397
D ++ + V+ I + V + EARNL ++ D+YF +D++ L N + L
Sbjct: 124 DSFVEEHGREYLTVQVIGPDDEVENAEARNLGMDLCRKDPDCDYYFSLDAEVVLKNTETL 183
Query: 398 KYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYI 457
+ L+ +N+ +IAPL+ R K WSNFWGAL+ DG+YARS DY++I+ + G+WNVPYI
Sbjct: 184 RILIEQNKLVIAPLVSRHEKLWSNFWGALSPDGYYARSEDYVDIVQRRR--VGLWNVPYI 241
Query: 458 TNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENF 516
++ YL+K +++ ++ +D DMAFC N+RN+G+ + + + ++GH++ EN+
Sbjct: 242 SSVYLVKGKALRSELEQGDLFHSGKLDADMAFCHNIRNQGVFMYLTNQHQFGHILSLENY 301
Query: 517 DPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFV 576
+ +++++ NP DW +YIH Y +L V PCPDV+WFPI T+ C E V
Sbjct: 302 QTSHLHNDLWQIFSNPEDWREKYIHENYTAALKGKLVE-MPCPDVYWFPIFTDTACDELV 360
Query: 577 QIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIG 636
+ ME YGQWS G N D R++ GYE VPT DIHM Q+G W +FL Y+ P+ E+ + G
Sbjct: 361 EEMEHYGQWSTGDNTDSRIQGGYENVPTIDIHMNQIGFEREWYKFLLDYIAPITEKLYPG 420
Query: 637 YHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYE 683
Y+ + + ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VG+DYE
Sbjct: 421 YYTK-TQFELAFVVRYKPDEQPSLVPHHDASTFTINIALNRVGIDYE 466
>gi|16307441|gb|AAH10268.1| Plod1 protein [Mus musculus]
Length = 364
Score = 393 bits (1010), Expect = e-106, Method: Compositional matrix adjust.
Identities = 182/356 (51%), Positives = 252/356 (70%), Gaps = 7/356 (1%)
Query: 381 FYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMN 440
+YF VD+D L P+ L+ L+ +N+++IAPL+ R + WSNFWG L+ADG+YARS DY++
Sbjct: 14 YYFSVDADVALTEPNSLRLLIEQNKNVIAPLMTRHGRLWSNFWGGLSADGYYARSEDYVD 73
Query: 441 IINGDQGGKGIWNVPYITNCYLMKTSVIKA--TNIKTIYTLNSMDYDMAFCTNLRNKGIH 498
I+ G + G+WNVPYI+N YL+K S ++A N+ ++ + +D DM+FC N+R + +
Sbjct: 74 IVQGRR--VGVWNVPYISNIYLIKGSALRAELQNVD-LFHYSKLDSDMSFCANVRQQEVF 130
Query: 499 LKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPC 558
+ + + +GHL+ +N+ + +++E+ NP DW +YIH Y K+L V PC
Sbjct: 131 MFLTNRHTFGHLLSLDNYQTTHLHNDLWEVFSNPEDWKEKYIHENYTKALAGKLVET-PC 189
Query: 559 PDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVW 618
PDV+WFPI TE C E V+ ME YGQWS G N D R++ GYE VPT DIHM Q+ W
Sbjct: 190 PDVYWFPIFTEAACDELVEEMEHYGQWSLGDNKDNRIQGGYENVPTIDIHMNQITFEREW 249
Query: 619 AEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQV 678
+FL +Y+ P+ E+ + GY+ + ++FVVRY+PDEQPSL PHHD+ST+T+NIALN+V
Sbjct: 250 HKFLVEYIAPMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFTVNIALNRV 308
Query: 679 GVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
G DYEGGGCRF+RYNC+V A R GW L+HPGRLTHYHEGL T+GTRYI +SFVDP
Sbjct: 309 GEDYEGGGCRFLRYNCSVRAPRKGWALLHPGRLTHYHEGLPTTKGTRYIAVSFVDP 364
>gi|339522069|gb|AEJ84199.1| procollagen-lysine 2-oxoglutarate 5-dioxygenase 1 [Capra hircus]
Length = 365
Score = 392 bits (1006), Expect = e-106, Method: Compositional matrix adjust.
Identities = 179/357 (50%), Positives = 245/357 (68%), Gaps = 4/357 (1%)
Query: 379 VDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDY 438
+FYF +DSD+ + NP L+ L+ N +IAP+L R K WSNFWGAL+ D +YARS DY
Sbjct: 12 CEFYFSLDSDTVITNPQPLRILIEANRKVIAPMLSRHGKLWSNFWGALSPDEYYARSEDY 71
Query: 439 MNIINGDQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGI 497
+ ++ + G+WNVPYI+ Y+++ ++ + +++ D DMAFC +LR+KGI
Sbjct: 72 VELVQRKR--VGVWNVPYISQAYVIRGEPLRTELPQREVFSGGDTDPDMAFCKSLRDKGI 129
Query: 498 HLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQP 557
L + + E+ L+ + +D +P+++++ NPLDW +YIH Y ++L + + QP
Sbjct: 130 FLHLSNQHEFARLLATSRYDTDHLHPDLWQIFDNPLDWKEQYIHENYSRALEGEGLVEQP 189
Query: 558 CPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGV 617
CPDV+WFP+ +E+ C E V+ ME +GQWS G + D RL GYE VPT DIH KQVG
Sbjct: 190 CPDVYWFPLPSERMCDELVEEMEHFGQWSGGRHEDSRLAGGYENVPTVDIHRKQVGYEAQ 249
Query: 618 WAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQ 677
W + LR YV P+ E YH + RA M+FVVRYRPDEQPSLRPHHDSST+T+N+ALN
Sbjct: 250 WLQLLRTYVGPMTESLSPAYHTK-TRAVMNFVVRYRPDEQPSLRPHHDSSTFTLNVALNH 308
Query: 678 VGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
G+DYEGGGCRF RY+C +++ R GW L+HPGRLTHYHEGL T+G RY M+SFVDP
Sbjct: 309 KGLDYEGGGCRFRRYDCVISSPRKGWGLLHPGRLTHYHEGLPTTRGPRYTMVSFVDP 365
>gi|344245759|gb|EGW01863.1| Procollagen-lysine,2-oxoglutarate 5-dioxygenase 3 [Cricetulus
griseus]
Length = 322
Score = 382 bits (981), Expect = e-103, Method: Compositional matrix adjust.
Identities = 174/325 (53%), Positives = 232/325 (71%), Gaps = 4/325 (1%)
Query: 411 LLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKA 470
+L R K WSNFWGAL+ D +YARS DY+ ++ + G+WNVPYI+ Y+++ ++
Sbjct: 1 MLSRHGKLWSNFWGALSPDEYYARSEDYVELVQRKR--VGVWNVPYISQAYVIRGETLRT 58
Query: 471 T-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELI 529
K +++ + D DMAFC +LR+KGI L + + E+G L+ + +D +P+++++
Sbjct: 59 ELPQKEVFSGSDTDPDMAFCKSLRDKGIFLHLSNQHEFGRLLATSRYDTDHLHPDLWQIF 118
Query: 530 RNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGT 589
NP+DW +YIH Y ++L + QPCPDV+WFP++TE+ C E V+ ME YGQWS G
Sbjct: 119 DNPVDWKEQYIHENYSRALDGQGLVEQPCPDVYWFPLLTEQMCDELVEEMEHYGQWSGGR 178
Query: 590 NNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFV 649
+ D RL GYE VPT DIHMKQVG W + LR YV P+ E F GYH + RA M+FV
Sbjct: 179 HEDSRLAGGYENVPTVDIHMKQVGYEDQWLQLLRTYVGPMTEYLFPGYHTK-TRAVMNFV 237
Query: 650 VRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPG 709
VRYRPDEQPSLRPHHDSST+T+N+ALN GVDYEGGGCRF+RY+C +++ R GW L+HPG
Sbjct: 238 VRYRPDEQPSLRPHHDSSTFTLNVALNHKGVDYEGGGCRFLRYDCRISSPRKGWALLHPG 297
Query: 710 RLTHYHEGLQVTQGTRYIMISFVDP 734
RLTHYHEGL T+GTRYIM+SFVDP
Sbjct: 298 RLTHYHEGLPTTRGTRYIMVSFVDP 322
>gi|193785082|dbj|BAG54235.1| unnamed protein product [Homo sapiens]
Length = 418
Score = 379 bits (973), Expect = e-102, Method: Compositional matrix adjust.
Identities = 178/377 (47%), Positives = 250/377 (66%), Gaps = 26/377 (6%)
Query: 380 DFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYM 439
D+YF VD+D L NP LK L+ +N +IAPL+ R K WSNFWGAL+ DG+YARS DY+
Sbjct: 46 DYYFSVDADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARSEDYV 105
Query: 440 NIINGDQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN---- 494
+I+ G++ G+WNVPY+ N YL+K +++ N + + + +D DMA C N R
Sbjct: 106 DIVQGNR--VGVWNVPYMANVYLIKGKTLRSEMNERNYFVRDKLDPDMALCRNAREMTLQ 163
Query: 495 -----------------KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDL 537
KG+ + I + E+G L+ + N++ N +++++ NP+DW
Sbjct: 164 REKDSPTPETFQMLSPPKGVFMYISNRHEFGRLLSTANYNTSHYNNDLWQIFENPVDWKE 223
Query: 538 RYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLET 597
+YI+ +Y K + + + QPCPDVFWFPI +EK C E V+ ME YG+WS G ++D R+
Sbjct: 224 KYINRDYSK-IFTENIVEQPCPDVFWFPIFSEKACDELVEEMEHYGKWSGGKHHDSRISG 282
Query: 598 GYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQ 657
GYE VPT DIHMKQV L VW F+R+++ P+ + F GY+ + A ++FVV+Y P+ Q
Sbjct: 283 GYENVPTDDIHMKQVDLENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQ 341
Query: 658 PSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEG 717
SLRPHHD+ST+TINIALN VG D++GGGC+F+RYNC++ + R GW MHPGRLTH HEG
Sbjct: 342 RSLRPHHDASTFTINIALNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEG 401
Query: 718 LQVTQGTRYIMISFVDP 734
L V GTRYI +SF+DP
Sbjct: 402 LPVKNGTRYIAVSFIDP 418
>gi|313240887|emb|CBY33173.1| unnamed protein product [Oikopleura dioica]
Length = 590
Score = 378 bits (970), Expect = e-102, Method: Compositional matrix adjust.
Identities = 219/588 (37%), Positives = 340/588 (57%), Gaps = 29/588 (4%)
Query: 18 FISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS- 76
+S+ N V N E LVITVA+ +TDGY R+ +S + L+ +T G+ + WLGGD++
Sbjct: 6 LVSLLSNSVLNARE--LLVITVATEKTDGYLRWEESVRYSGLKSRTFGIGEDWLGGDLTN 63
Query: 77 SLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFN------TFDANI 130
GGG+KVNLLK EL E ++ L TD+YDVII+G +I RF+ + N+
Sbjct: 64 GPGGGHKVNLLKKELAEYKGNSELYFLFTDAYDVIINGKEEEIFSRFDDIVSKVEYKTNV 123
Query: 131 VFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYA 190
+ AE L WPD SL KYP V G R+L SG + A +L+ R+I + +DDQL+Y
Sbjct: 124 LISAEDLIWPDASLEPKYPLV-LGKRFLCSGAILARADVFLDLLEYRAIGDRDDDQLFYT 182
Query: 191 LLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLT--NTKYNTNPVIIH 248
FL++ L+ K I LD A LF NL G+LE++ ++F T NTKY T P++IH
Sbjct: 183 EAFLNKELKEKFGIALDHKAELFFNLNGALEEVGIDFARSATGDNTVENTKYRTKPLVIH 242
Query: 249 GNGKSKIELNSFGNYLAKSWKTS-GCTRCNLIKHLDSLKPDQFPS--VLISVFIDKPTAF 305
GNG SK ELN NY+ + W+ GC C+ + + + +K D S ++I+ ID T F
Sbjct: 243 GNGPSKNELNRISNYVPQGWRPDYGCPACSKVLN-EEIKEDIDTSKDIVIAFIIDGITPF 301
Query: 306 LEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKE 365
++ L +IA+L+YPA+K + +Y+N + D ++ F + +K+ K+I+ ++
Sbjct: 302 VQNSLKRIASLDYPAEKTHLLIYSNTVWADERVDTFLEVFGSSYKSTKFISSKEKMSVTM 361
Query: 366 ARNLAVENSLHK-GVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWG 424
AR A++ + K +F FYVD L NP V+ L+ N L+AP + R K WSN+WG
Sbjct: 362 ARKFALQLTDEKFSAEFVFYVDGYVQLTNPAVIGELIKTNVELVAPGMSRYGKLWSNYWG 421
Query: 425 ALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSM-- 482
A+ +DGFY+RS DY++I+ G + GIWN+P++ YL+ ++ A ++ I+ S
Sbjct: 422 AVASDGFYSRSDDYLDIVQGTR--VGIWNMPFVNGAYLVHKNL--AADLIDIFAGISQSP 477
Query: 483 ------DYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWD 536
D D+ F +NLR GI + + + +G LVD E+ + +PE+++ N DW+
Sbjct: 478 WQGKFNDPDLDFASNLRTLGIFMHVTNQAYWGRLVDREHMPVDRIHPELWQPEWNRPDWE 537
Query: 537 LRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQ 584
Y+ +Y + L P+T ++PCPDV FP ++ K + ++ ME YG+
Sbjct: 538 EDYLDTDYWRVLEPETEMDEPCPDVVAFPFLSSKGGFDMIEEMEHYGK 585
>gi|21428536|gb|AAM49928.1| LD37702p [Drosophila melanogaster]
Length = 280
Score = 371 bits (953), Expect = e-100, Method: Compositional matrix adjust.
Identities = 171/284 (60%), Positives = 212/284 (74%), Gaps = 4/284 (1%)
Query: 451 IWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGHL 510
++NVP++T+ YL+K + A + K D DMA C +LRN GI + + + +GHL
Sbjct: 1 MFNVPHVTSIYLVKKTAFDAISFKH----KEFDPDMAMCESLRNAGIFMYASNLRIFGHL 56
Query: 511 VDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEK 570
V++++F+ T P+ Y L N +DW +YIHP Y L QPCPDV+WF IV++
Sbjct: 57 VNADDFNTTVTRPDFYTLFSNEIDWTEKYIHPNYSLQLNESNKIQQPCPDVYWFQIVSDA 116
Query: 571 FCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQ 630
FC + V IMEA+ WSDG+NND RLE GYEAVPTRDIHMKQVGL ++ +FL+ +V PLQ
Sbjct: 117 FCDDLVAIMEAHNGWSDGSNNDNRLEGGYEAVPTRDIHMKQVGLERLYLKFLQMFVRPLQ 176
Query: 631 EREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFI 690
ER F GY H P RA M+F+VRYRPDEQPSLRPHHDSSTYTINIA+N+ G+DY+GGGCRFI
Sbjct: 177 ERAFTGYFHNPPRALMNFMVRYRPDEQPSLRPHHDSSTYTINIAMNRAGIDYQGGGCRFI 236
Query: 691 RYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
RYNC+VT T+ GWMLMHPGRLTHYHEGL VT GTRYIMISF+DP
Sbjct: 237 RYNCSVTDTKKGWMLMHPGRLTHYHEGLLVTNGTRYIMISFIDP 280
>gi|291413228|ref|XP_002722881.1| PREDICTED: procollagen-lysine, 2-oxoglutarate 5-dioxygenase 3
[Oryctolagus cuniculus]
Length = 530
Score = 352 bits (902), Expect = 5e-94, Method: Compositional matrix adjust.
Identities = 154/290 (53%), Positives = 209/290 (72%), Gaps = 2/290 (0%)
Query: 446 QGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDST 504
+ +G+WNVPYI Y+++ ++ + +++ + D DMAFC +LR++GI L + +
Sbjct: 242 RASRGVWNVPYIAQAYVIRGETLRTELPQREVFSSSDTDPDMAFCKSLRDQGIFLHLSNR 301
Query: 505 QEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWF 564
E+G L+ + +D +P+++++ NP+DW +YIH Y ++L D + QPCPDV+WF
Sbjct: 302 HEFGRLLATSRYDTDHLHPDLWQIFDNPVDWKEQYIHENYSRALEGDGMVEQPCPDVYWF 361
Query: 565 PIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRK 624
P+++E+ C E V+ ME YGQWS G + D RL GYE VPT DIHMKQVG W + LR
Sbjct: 362 PLLSEQMCDELVEEMEHYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYEDQWLQLLRT 421
Query: 625 YVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEG 684
YV P+ E F GYH + RA M+FVVRYRPDEQPSLRPHHDSST+T+N+ALN G+DYEG
Sbjct: 422 YVGPMTESLFPGYHTK-ARAVMNFVVRYRPDEQPSLRPHHDSSTFTLNVALNHKGLDYEG 480
Query: 685 GGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
GGCRF+RY+C ++A R GW L+HPGRLTHYHEGL T+GTRYIM+SFVDP
Sbjct: 481 GGCRFLRYDCVISAPRKGWGLLHPGRLTHYHEGLPTTRGTRYIMVSFVDP 530
Score = 194 bits (492), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 92/193 (47%), Positives = 137/193 (70%), Gaps = 1/193 (0%)
Query: 30 DEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLLK 88
+ +K LVITVA+ ET+GY+RF++SAEV V+TLGL Q W GGD++ ++GGG KV LK
Sbjct: 42 EPEKLLVITVATAETEGYRRFLRSAEVFNYTVRTLGLGQEWRGGDVARTVGGGQKVRWLK 101
Query: 89 NELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKY 148
E+++ +DM+I+ DSYDVI+ G +++L++F + ++F AE CWP+ L ++Y
Sbjct: 102 KEMEQYADREDMVIMFVDSYDVILAGSPSELLKKFVQSGSRLLFSAESFCWPEWGLAEQY 161
Query: 149 PAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDT 208
P VG+G R+LNSGGFIG+A I +++ K+++DDQL+Y L+LD LR K K+ LD
Sbjct: 162 PEVGTGKRFLNSGGFIGFAPTIHQIVRQWKYKDDDDDQLFYTRLYLDPGLREKLKLNLDH 221
Query: 209 LANLFQNLYGSLE 221
+ +FQNL G+LE
Sbjct: 222 KSRIFQNLNGALE 234
>gi|47223418|emb|CAG04279.1| unnamed protein product [Tetraodon nigroviridis]
Length = 561
Score = 341 bits (875), Expect = 8e-91, Method: Compositional matrix adjust.
Identities = 174/415 (41%), Positives = 254/415 (61%), Gaps = 5/415 (1%)
Query: 32 DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDM-SSLGGGYKVNLLKNE 90
+K LV+TVA+ ETDG++RF+QSA VK LG+ + W GGD+ +S+GGG KV LLK
Sbjct: 1 EKLLVLTVATEETDGFQRFMQSAHYFNYSVKVLGMGEAWKGGDVGNSIGGGQKVRLLKEA 60
Query: 91 LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
+ + +D+++L DSYD+I GG +IL +F + ++F AE L WPD L DKYP
Sbjct: 61 MKALADQEDLVVLFVDSYDLIFAGGPEEILRKFQQANHKVLFAAEGLIWPDKRLADKYPL 120
Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
V SG RYLNSGGF+GYA I +L+S ++ + +DDQL+Y +++D R + LD
Sbjct: 121 VRSGKRYLNSGGFMGYAPPINQLVSQWNLHDNDDDQLFYTKIYVDPLQRQTLNMTLDHKC 180
Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK- 269
+F L G+ +++ L F D V + NT +++ P ++HGN +KI LN GNY+ W
Sbjct: 181 QIFLTLNGAADEVLLKFGTDR-VRVRNTAHDSLPAVVHGNRNTKIFLNYLGNYVPHMWNY 239
Query: 270 TSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN 329
GC+ C+ LD + +PSVL+ VFI+KPT FL EF ++ +L+YP K+ +F++N
Sbjct: 240 EHGCSHCD-KDILDLAQLKDYPSVLVGVFIEKPTPFLPEFFQRLLSLDYPKDKMKVFIHN 298
Query: 330 NQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGV-DFYFYVDSD 388
N+ YH + + F N K + ++ EARN+ ++ DFYF +DSD
Sbjct: 299 NEVYHEKHIQKFWEENRNTFINFKIVGPEENLSQGEARNMGMDLCRKDATCDFYFSLDSD 358
Query: 389 SHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIIN 443
L N LK LV +N +I PL+ R K WSNFWGAL+ DG+YARS DY++I+
Sbjct: 359 VMLTNSQTLKLLVEQNRKIIGPLVTRHSKLWSNFWGALSPDGYYARSEDYIDIVQ 413
Score = 114 bits (286), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 48/71 (67%), Positives = 56/71 (78%)
Query: 664 HDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQG 723
HDSST+TINIALN D++GGGCRF RYNC++ + R GW MHPGRLTH HEGL T G
Sbjct: 491 HDSSTFTINIALNNKETDFQGGGCRFHRYNCSIESPRKGWSFMHPGRLTHLHEGLPTTNG 550
Query: 724 TRYIMISFVDP 734
TRYI +SF+DP
Sbjct: 551 TRYIAVSFIDP 561
>gi|344254154|gb|EGW10258.1| Procollagen-lysine,2-oxoglutarate 5-dioxygenase 2 [Cricetulus
griseus]
Length = 587
Score = 336 bits (861), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 177/447 (39%), Positives = 266/447 (59%), Gaps = 8/447 (1%)
Query: 51 IQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGYKVNLLKNELDEMDITDDMIILVTDSYD 109
+ SA+ VK LG Q W GGD ++S+GGG KV L+K + + +D++IL T+ +D
Sbjct: 1 MNSAKYFNYTVKVLGQGQEWRGGDGINSIGGGQKVRLMKEAMAQYASQEDLVILFTECFD 60
Query: 110 VIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKD 169
V+ GG ++L++F + IVF A+ + WPD L +KYP V G RYLNSGGFIGYA
Sbjct: 61 VVFAGGPEEVLKKFQKTNHKIVFAADGILWPDKRLAEKYPVVHIGKRYLNSGGFIGYAPY 120
Query: 170 IKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDL 229
I L+ ++++ +DDQL+Y +++D R I LD +FQ L G+ +++ L F+
Sbjct: 121 ISHLVQEWNLQDNDDDQLFYTKVYIDPVKREAFNITLDHKCKIFQALNGATDEVVLKFEN 180
Query: 230 DEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPD 288
+ + NT Y T PV I+GNG +KI LN FGNY+ SW + GC C+ +D D
Sbjct: 181 GK-SRVKNTFYETLPVAINGNGPTKILLNYFGNYVPNSWTQEHGCALCDF-DTIDLSAVD 238
Query: 289 QFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTM 348
P V I VFI++PT FL FLN + +L+YP + + +F++N + YH + K
Sbjct: 239 VHPKVTIGVFIEQPTPFLPRFLNLLLSLDYPKEALKLFIHNKEVYHEKDIKVFFDKAKHE 298
Query: 349 FKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDSDSHLDNPDVLKYLVNRNESL 407
+K + ++ EARN+ ++ + D+YF VD+D L NP LK L+ +N +
Sbjct: 299 ISTIKIVGPEENLSQAEARNMGMDFCRQDEKCDYYFSVDADVVLTNPRTLKNLIEQNRKI 358
Query: 408 IAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSV 467
IAPL+ R K WSNFWGAL+ DG+YARS DY++I+ G + GIWNVPY+ N YL++
Sbjct: 359 IAPLVTRHGKLWSNFWGALSPDGYYARSEDYVDIVQGKR--VGIWNVPYMANVYLIQGKT 416
Query: 468 IKA-TNIKTIYTLNSMDYDMAFCTNLR 493
+++ + + + + +D DMA C N R
Sbjct: 417 LRSEMSERNYFVRDKLDPDMALCRNAR 443
Score = 194 bits (493), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 84/143 (58%), Positives = 109/143 (76%), Gaps = 1/143 (0%)
Query: 592 DKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVR 651
D R+ GYE VPT DIHMKQ+GL VW F+R+++ P+ + F GY+ + A ++FVV+
Sbjct: 446 DSRISGGYENVPTDDIHMKQIGLENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVK 504
Query: 652 YRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRL 711
Y P+ Q SLRPHHD+ST+TINIALN VG D++GGGC+F+RYNC++ + R GW MHPGRL
Sbjct: 505 YSPERQRSLRPHHDASTFTINIALNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRL 564
Query: 712 THYHEGLQVTQGTRYIMISFVDP 734
TH HEGL V GTRYI +SF+DP
Sbjct: 565 THLHEGLPVKNGTRYIAVSFIDP 587
>gi|326428759|gb|EGD74329.1| hypothetical protein PTSG_12434 [Salpingoeca sp. ATCC 50818]
Length = 853
Score = 333 bits (854), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 227/736 (30%), Positives = 361/736 (49%), Gaps = 98/736 (13%)
Query: 77 SLGGGYKVNLLKNELDEMDITD--DMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGA 134
S G G L++ +++ T+ D+++ + D+ D ++ ++ +FN D I+F A
Sbjct: 135 SAGAGLGSKELRDVAEQLANTNPNDVVLFIGDAEDTVLLAEATELTRKFNALDCGILFPA 194
Query: 135 ERLCWPDTSLYDKYPAVGSGY-RYLNSGGFIGYAKDIKELISN----RSIKN-EEDDQLY 188
C ++ +P V G+ R+L F+ K L+ + ++ N E DQL
Sbjct: 195 AIRCKRRCAM--DWPLVPEGHGRFLVPSAFMAKGDKFKLLVDSFPDLSTVPNMSESDQLI 252
Query: 189 YALLFLDETLRTKHKIVLDTLANLFQNLYGSLED-------IKLNFDLDEFVHLTNTKYN 241
AL D R + + LDT +FQ L+G ++ + F +E L N +
Sbjct: 253 -ALFMTD---RDFYGMKLDTNFAVFQPLFGYKDEWPAAVFAAEYEFHSEEDTRLRNKDTS 308
Query: 242 TNPVIIHGNGKSKIELNSFGNYLAKSWKTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDK 301
P I+ +G +K+ L NY+ W T+ K L+S+ P+ ++ +V +
Sbjct: 309 EYPGILISHGNTKL-LTQLNNYMPLKWHPD--TQSLTSKTLESVDPNAAVTIAFNVLPES 365
Query: 302 PTAFLEEFLNKIANLN----YPAKKISMFVYNNQEYHAPLFDDYIHNF----KTMFKNVK 353
P FL+ L+ IA + +P ++ V N HA + + + NF K +F V
Sbjct: 366 P--FLQLVLDGIAAQDLLRTHPVTFVAAVVDNP---HAHTYVELVQNFTRDNKQLFAGVT 420
Query: 354 YIAHNSTVNSKEA-RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLL 412
+ H +S++A R L + Y S + L N VL L+ ++ +++P++
Sbjct: 421 -VLHEPQEDSEQAMRKLFTVAMETQPTTHVLYHTSAARLMNSTVLGELLAQDLRVVSPMM 479
Query: 413 VRPFKAWSNFWGALNADG------------------------------------------ 430
R +SNFWGA D
Sbjct: 480 TREASFFSNFWGAATGDRDAQCFDDSAQCEAWAVAGECTKNEPYMKKHCQRSCEVCHAQG 539
Query: 431 -----FYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTS----VIKATNIKTIYTLNS 481
Y RS DYM+II +Q G W VP ++ LMK + V+KA + +
Sbjct: 540 APENIKYRRSADYMSIIKAEQ--TGTWAVPLVSEVILMKLNAFNIVVKALSQLETQPGSP 597
Query: 482 MDYDMAFCT----NLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDL 537
+ +D LR+ + L +D+ YG L++ + F+ +P+V+ L N W
Sbjct: 598 LRFDFPLTAYLLDQLRSSKVKLHVDNRHFYGLLINPDGFNANSVHPDVFLLAGNEQHWRD 657
Query: 538 RYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLET 597
YIHP+Y+ + V + C D++ FP+ +E FC F+ + EA G WS G+N+D RL++
Sbjct: 658 LYIHPDYEPYKKLEFVQGR-CWDIYNFPLFSELFCAHFIDVSEAVGTWSSGSNSDDRLKS 716
Query: 598 GYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQ 657
GYE VPTRDIH Q+G W LR++V P+ E +++GY + R + FVV+Y+P+ Q
Sbjct: 717 GYEPVPTRDIHFNQMGFQETWTAILRRFVAPVAETQWVGYKLDG-RVTLDFVVKYQPEGQ 775
Query: 658 PSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEG 717
P LR HHD+ST+++N+ALN++G D+EGGG RF R NC V +MG L+HPGRLTH HEG
Sbjct: 776 PFLRKHHDASTFSLNVALNRIGEDFEGGGTRFTRQNCTVLTNKMGHALIHPGRLTHQHEG 835
Query: 718 LQVTQGTRYIMISFVD 733
L VT+GTRYI++SFVD
Sbjct: 836 LYVTKGTRYIIVSFVD 851
>gi|345322955|ref|XP_001506269.2| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2
[Ornithorhynchus anatinus]
Length = 501
Score = 332 bits (852), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 169/465 (36%), Positives = 271/465 (58%), Gaps = 28/465 (6%)
Query: 106 DSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIG 165
+SYDVI GG ++L +F + +VF A+ L WPD L DKYP V G R+LNSGGFIG
Sbjct: 26 ESYDVIFAGGPEELLRKFQKINHKVVFAADGLLWPDKRLADKYPIVHIGKRFLNSGGFIG 85
Query: 166 YAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKL 225
Y + +++ ++++ +DDQL+Y +++D R I LD +FQNL G+++++ L
Sbjct: 86 YGPSVNQIVQQWNLQDSDDDQLFYTKIYIDSIKRKAINITLDHKCRIFQNLNGAIDEVLL 145
Query: 226 NFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWKT-SGCTRCNLIKHLDS 284
F+ + V N+ Y T PV I+GNG +K +LN FGNY+ +W +GCT CNL +D
Sbjct: 146 KFENGK-VRAKNSFYETLPVAINGNGPTKNQLNYFGNYIPNAWTIENGCTTCNL-DMIDL 203
Query: 285 LKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHN 344
+P V I VFI++PT FL FL+ + L+YP + +S+F++NN+ YH +
Sbjct: 204 TSSKDYPKVTIGVFIEQPTPFLPRFLDLLLTLDYPKEALSLFIHNNEVYHEKHIKAFWEK 263
Query: 345 FKTMFKNVKYIAHNSTVNSKEARNLAVE-NSLHKGVDFYFYVDSDSHLDNPDVLKYLVNR 403
K + +K + +++ EARN+ ++ ++ D+YF +D+D L NP L+ L+ +
Sbjct: 264 AKNIITTIKIVGPEESLSQAEARNMGMDVCRQNEHCDYYFSLDADVVLTNPSTLRLLIEQ 323
Query: 404 NESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLM 463
N +IAPL+ R K WSNFWG L+ DG+YARS DY++I+ G++ G+WN+PY+ N YL+
Sbjct: 324 NRKIIAPLVTRHGKLWSNFWGTLSPDGYYARSEDYVDIVQGNR--VGLWNIPYMANVYLI 381
Query: 464 KTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN---------------------KGIHLKI 501
K ++A + + + +D DMA C N R KG+ + I
Sbjct: 382 KGQTLRAEMKERNYFVRDKLDPDMALCKNAREMTLQREKDSPSPETFHMLRPPKGVFMYI 441
Query: 502 DSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQK 546
+ E+G L+ + N++ N +++++ NP+DW +YI+ Y K
Sbjct: 442 SNRHEFGRLLSTANYNITHYNNDLWQIFENPVDWKEKYINRNYSK 486
>gi|241633659|ref|XP_002408696.1| procollagen-lysine, 2-oxoglutarate 5-dioxygenase, putative [Ixodes
scapularis]
gi|215501230|gb|EEC10724.1| procollagen-lysine, 2-oxoglutarate 5-dioxygenase, putative [Ixodes
scapularis]
Length = 285
Score = 328 bits (840), Expect = 9e-87, Method: Compositional matrix adjust.
Identities = 154/286 (53%), Positives = 199/286 (69%), Gaps = 2/286 (0%)
Query: 450 GIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGH 509
G+WNVP+I YL+ +++ + + + +D DMAFC N+R KGI + + + YGH
Sbjct: 1 GLWNVPFINTVYLINGTLLHSKDKFPSFISGLLDPDMAFCKNMREKGIFMYVTNMDTYGH 60
Query: 510 LVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTE 569
LV+ E FD + NP+ YE+ N +DW+ RYIH Y K L PD + PCPDV+WFP+VT+
Sbjct: 61 LVNPETFDLKLKNPDFYEIYSNQMDWERRYIHENYSKVLEPDFKVDMPCPDVYWFPVVTD 120
Query: 570 KFCHEFVQIMEAYGQWSDGTNNDKRLETGYE-AVPTRDIHMKQVGLAGVWAEFLRKYVVP 628
FC ++IME +GQWS G N K L Y+ + + IH G+ W FLR+Y+ P
Sbjct: 121 IFCRHMIEIMENFGQWSSGKNEVKFLFFLYQQSSANKFIHFFIKGVQH-WLFFLREYIKP 179
Query: 629 LQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCR 688
+QE+ F+GY H+P RA M+FVVRY PDEQ LRPHHDSSTYTINIALN+ +DYEGGGC
Sbjct: 180 VQEKVFLGYFHDPPRAIMNFVVRYHPDEQYFLRPHHDSSTYTINIALNRPKIDYEGGGCN 239
Query: 689 FIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
F+RYNC+V + GW LMHPGRLTHYHEGL VT+GTRYIM+SFVDP
Sbjct: 240 FLRYNCSVVDLKRGWSLMHPGRLTHYHEGLPVTKGTRYIMVSFVDP 285
>gi|444510095|gb|ELV09466.1| Procollagen-lysine,2-oxoglutarate 5-dioxygenase 2 [Tupaia
chinensis]
Length = 558
Score = 319 bits (817), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 182/479 (37%), Positives = 265/479 (55%), Gaps = 50/479 (10%)
Query: 19 ISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSS 77
+ V K +I DK LVITVA+ E+DG+ RF+QSA+ VK LG + W GGD ++S
Sbjct: 24 LGVDSEKPVSIPTDKLLVITVATKESDGFHRFMQSAKYFNYTVKVLGQGEEWRGGDGINS 83
Query: 78 LGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERL 137
+GGG KV L+K L + DD+++L T+ +DV+ GG ++L++F + +VF A+ +
Sbjct: 84 IGGGQKVRLMKEALGQYASQDDLVVLFTECFDVVFAGGPEEVLKKFQKTNHKVVFAADGI 143
Query: 138 CWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDET 197
WPD L DKYP V G RYLNSGGFIGYA I ++ ++++ +DDQL+Y +++D
Sbjct: 144 LWPDKRLADKYPTVHFGKRYLNSGGFIGYAPYINRIVQQWNLQDNDDDQLFYTKIYIDPL 203
Query: 198 LRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIEL 257
R I LD +FQ L G+ +++ L F+ + NT Y T PV I+GNG +KI L
Sbjct: 204 KREAINITLDHKCKIFQTLNGATDEVVLKFENGK-ARAKNTFYETLPVTINGNGPTKILL 262
Query: 258 NSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANL 316
N FGNY+ SW + +GCT C D++ V
Sbjct: 263 NYFGNYIPNSWTQENGCTHC----ESDTINLSAVDEV----------------------- 295
Query: 317 NYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH 376
Y K I +F FD H T +K + + EARN+ ++
Sbjct: 296 -YHEKDIKVF-----------FDKAKHEIST----IKIVGPEENLRQAEARNMGMDFCRQ 339
Query: 377 -KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARS 435
+ D+YF VD+D L NP LK L+ +N +IAPL+ R K WSNFWGAL+ DG+YARS
Sbjct: 340 DEKCDYYFSVDADVVLTNPKTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARS 399
Query: 436 FDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLR 493
DY++I+ G++ G+WNVPY+ N YL+K +++ N + + + +D DMA C N R
Sbjct: 400 EDYVDIVQGNR--VGVWNVPYMANVYLIKGKTLRSEMNERNYFVRDKLDPDMALCRNAR 456
Score = 108 bits (271), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 50/92 (54%), Positives = 67/92 (72%), Gaps = 1/92 (1%)
Query: 592 DKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVR 651
D R+ GYE VPT DIHMKQ+ L VW F+R+++ P+ + F GY+ + A ++FVV+
Sbjct: 459 DSRISGGYENVPTDDIHMKQIDLENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVK 517
Query: 652 YRPDEQPSLRPHHDSSTYTINIALNQVGVDYE 683
Y PD Q SLRPHHD+ST+TINIALN VG D++
Sbjct: 518 YSPDRQRSLRPHHDASTFTINIALNNVGEDFQ 549
>gi|339235621|ref|XP_003379365.1| procollagen-lysine,2-oxoglutarate 5-dioxygenase 3 [Trichinella
spiralis]
gi|316977983|gb|EFV61016.1| procollagen-lysine,2-oxoglutarate 5-dioxygenase 3 [Trichinella
spiralis]
Length = 1093
Score = 315 bits (808), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 193/554 (34%), Positives = 280/554 (50%), Gaps = 139/554 (25%)
Query: 147 KYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVL 206
+YP V SG RYLNSG FIGYA DI ++I+ RS+++++DDQLYY +FLD LR KHKI L
Sbjct: 296 EYPVVKSGKRYLNSGAFIGYAPDIYKIITERSLRDDDDDQLYYTHIFLDPALREKHKIKL 355
Query: 207 DTLANLFQNLYGSLEDIKLNFDLD----EFVHLTNTKYNTNPVIIHGNGKSKIELNSFGN 262
D+ + +FQNL+G+++D+ L+F V L N Y T PVIIHGNGKSK+ LN GN
Sbjct: 356 DSTSAIFQNLHGAVDDVDLDFSPSGHRMRQVRLANLAYGTEPVIIHGNGKSKMHLNYLGN 415
Query: 263 YLAKSWK-TSGCTRCN-LIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPA 320
Y+ W T GC CN + L+S + FP V+++ FI+ T FL+++ I L+YP
Sbjct: 416 YIGNWWNPTDGCVACNDDLLELNSDNENDFPFVVLACFINSGTPFLDKYFESILRLDYPK 475
Query: 321 KKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKE--ARNLAVENSLHKG 378
+I + ++N HA + +++ M ++ +S ++ E AR+ AV
Sbjct: 476 SRIGIVIFNRP--HAVKVEHFVN---LMDGEYHFVQADSAISLTERNARDRAV------- 523
Query: 379 VDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDY 438
SLIAP+++R WSNFWGALN DGFYARS DY
Sbjct: 524 ---------------------------SLIAPMMIRGEALWSNFWGALNDDGFYARSDDY 556
Query: 439 MNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSM-DYDMAFCTNLRNKGI 497
++I ++ G+WN+P+ + YL++ + + + + Y+ N D DM+F R K
Sbjct: 557 ISIAKRER--LGLWNIPHFSTAYLIRKD--RLSLLLSAYSYNGKNDPDMSFTQFCREK-- 610
Query: 498 HLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQP 557
+W+ RY+ +Y +L D P
Sbjct: 611 ------------------------------------EWEERYLDEKYWDTLSNDYEFELP 634
Query: 558 CPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGV 617
CPDV+ FP+ +++FC E + +ME YG+WS G+N
Sbjct: 635 CPDVYHFPLFSKQFCKEMIAVMENYGRWSSGSN--------------------------- 667
Query: 618 WAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQ 677
P A M+FVVRY+PDEQP+LRPHHD+STYT++IALN+
Sbjct: 668 ----------------------LPPHAIMNFVVRYKPDEQPALRPHHDASTYTVDIALNK 705
Query: 678 VGVDYEGGGCRFIR 691
G D+E R R
Sbjct: 706 AGEDFEVQMSRVGR 719
>gi|363539911|ref|YP_004894378.1| mg327 gene product [Megavirus chiliensis]
gi|350611076|gb|AEQ32520.1| putative procollagen-lysine 2-oxoglutarate 5-dioxygenase [Megavirus
chiliensis]
Length = 889
Score = 303 bits (775), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 230/731 (31%), Positives = 362/731 (49%), Gaps = 116/731 (15%)
Query: 30 DEDK-FLVITVASNETDGYKRFIQSAEVNKLQVKTL-GLHQPWLGGDMSSLGGGYKVNLL 87
D DK F +I + D + RFI+ E+ L L ++ P D+S++
Sbjct: 248 DTDKEFSIIYIGPTNGDSFARFIEYCELYSLPRIILDSINMP----DISTI--------- 294
Query: 88 KNELDEMDITDDMIILV-----TDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDT 142
+L E+D +D + +V D+ + I +I+ +FN+ I + P+
Sbjct: 295 HKKLAEIDNLEDKLFVVISVLPNDNCNFIPTAPPTEIINKFNS----ICHNKNGIIIPN- 349
Query: 143 SLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISN-----RSIKNEEDDQLYYALLFLDET 197
G + + F G+ I+ +I + +KN + L A++F T
Sbjct: 350 ---------GETSKTI----FCGWGNRIQRMIQDYLDKVDIVKNITNAALSTAIIF--NT 394
Query: 198 LRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIEL 257
T I+ D +F + S +DI N + H K+ T P I+ N + L
Sbjct: 395 FITS-DIIKDDTCQIFCCV-NSEDDIVYNTTKSKISH---KKFGTTPSILFSNEIGNLVL 449
Query: 258 NSFGNYLAKSWKTSGCTRCNLIKHLDSLKPDQ-FPSVLISVFIDKPTAFLEEFLNKIANL 316
N NY +W R + +P P+V IS+ DK + ++ I +
Sbjct: 450 NRIQNYTGNNWNEYYGYR-------NHTEPKTILPTVYISILSDKNPSVVD----IIQTI 498
Query: 317 NYPAKKISMFV----YNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVE 372
+YP + +++ + N+ Y L KYIA N
Sbjct: 499 DYPRELLTVVITKGTINDNYYQEDL--------------EKYIATN-------------- 530
Query: 373 NSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFY 432
++YF+++ D L NP+VLK L+N N+ +IAPL+ R ++W+NFWG L+ +G+Y
Sbjct: 531 ------CEYYFFINHDCILVNPNVLKELINLNKKIIAPLIRRGDESWTNFWGDLDKNGYY 584
Query: 433 ARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNS-MDYDMAFCTN 491
RS DY +IING++ +G WNVP++ YL+ SVI++ + ++T N+ +D DM C N
Sbjct: 585 KRSHDYFDIINGER--RGCWNVPHVFGTYLIHRSVIES--VPDMFTKNTDIDADMRMCHN 640
Query: 492 LRNKGIHLKIDSTQEYGHLVDSENFDPQ-KTNPE--VYELIRNPLDWDLRYIHPEY--QK 546
+R IH+ + + YG++ P+ TN E V++ +W+ +Y+HPEY K
Sbjct: 641 IRQHDIHIYLSNLNSYGYIQTELQIAPEIDTNKEVTVFDFSTRRSEWEKKYLHPEYFLNK 700
Query: 547 SLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGT--NNDKRL-ETGYEAVP 603
+ L + + C DVF FP+ + +FC E +Q ME YG+WS G N D RL YE VP
Sbjct: 701 NNLKNLRCTELCNDVFNFPLFSREFCSELIQTMEKYGKWSGGAGHNIDHRLGHNYYENVP 760
Query: 604 TRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPH 663
T+DI + +VGL W + +Y+ PL + Y + V ++FVVRY +Q L+ H
Sbjct: 761 TQDIQLFEVGLDKHWESIVNEYIAPLVRIVYSNYKTKSVH--LAFVVRYHWQQQSELQEH 818
Query: 664 HDSSTYTINIALNQV-GVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQ 722
HD+STYTINIALN+ G DYEGGG RFIR N + +G +HPG+ THYH+GL+ T
Sbjct: 819 HDASTYTINIALNEGDGKDYEGGGSRFIRQNYSSINQEIGTANLHPGKCTHYHKGLKTTA 878
Query: 723 GTRYIMISFVD 733
G RYI++SF++
Sbjct: 879 GIRYILVSFIN 889
>gi|448825278|ref|YP_007418209.1| putative procollagen-lysine 2-oxoglutarate 5-dioxygenase [Megavirus
lba]
gi|444236463|gb|AGD92233.1| putative procollagen-lysine 2-oxoglutarate 5-dioxygenase [Megavirus
lba]
Length = 889
Score = 302 bits (773), Expect = 5e-79, Method: Compositional matrix adjust.
Identities = 228/730 (31%), Positives = 361/730 (49%), Gaps = 114/730 (15%)
Query: 30 DEDK-FLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLK 88
D DK F +I + D + RFI+ E+ L P + D ++ ++ +
Sbjct: 248 DTDKEFSIIYIGPTNGDSFARFIEYCELYSL---------PRIILDSINIPD---ISTIH 295
Query: 89 NELDEMDITDDMIILV-----TDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTS 143
+L E+D +D + +V D+ + I +I+ +FN+ I + P+
Sbjct: 296 KKLAEIDNLEDKLFVVISVLPNDNCNFIPTAPPTEIINKFNS----ICHNKNGIIIPN-- 349
Query: 144 LYDKYPAVGSGYRYLNSGGFIGYAKDIKELISN-----RSIKNEEDDQLYYALLFLDETL 198
G + + F G+ I+ +I + +KN + L A++F T
Sbjct: 350 --------GETSKTI----FCGWGNRIQRMIQDYLDKVDIVKNITNAALSTAIIF--NTF 395
Query: 199 RTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELN 258
T I+ D +F + S +DI N + H K+ T P I+ N + LN
Sbjct: 396 ITS-DIIKDDTCQIFCCV-NSEDDIVYNTTKSKISH---KKFGTTPSILFSNEIGNLVLN 450
Query: 259 SFGNYLAKSWKTSGCTRCNLIKHLDSLKPDQ-FPSVLISVFIDKPTAFLEEFLNKIANLN 317
NY +W R + +P P+V IS+ DK + ++ I ++
Sbjct: 451 RIQNYTGNNWNEYYGYR-------NHTEPKTILPTVYISILSDKNPSVVD----IIQTID 499
Query: 318 YPAKKISMFV----YNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVEN 373
YP + +++ + N+ Y L KYIA N
Sbjct: 500 YPRELLTVVITKGTINDNYYQEDL--------------EKYIATN--------------- 530
Query: 374 SLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYA 433
++YF+++ D L NP+VLK L+N N+ +IAPL+ R ++W+NFWG L+ +G+Y
Sbjct: 531 -----CEYYFFINHDCILVNPNVLKELINLNKKIIAPLIRRGDESWTNFWGDLDKNGYYK 585
Query: 434 RSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNS-MDYDMAFCTNL 492
RS DY +IING++ +G WNVP++ YL+ SVI++ + ++T N+ +D DM C N+
Sbjct: 586 RSHDYFDIINGER--RGCWNVPHVFGTYLIHRSVIES--VPDMFTKNTDIDADMRMCHNI 641
Query: 493 RNKGIHLKIDSTQEYGHLVDSENFDPQ-KTNPE--VYELIRNPLDWDLRYIHPEY--QKS 547
R IH+ + + YG++ P+ TN E V++ +W+ +Y+HPEY K+
Sbjct: 642 RQHDIHIYLSNLNSYGYIQTELQIAPEIDTNKEVTVFDFSTRRSEWEKKYLHPEYFLNKN 701
Query: 548 LLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGT--NNDKRL-ETGYEAVPT 604
L + + C DVF FP+ + +FC E +Q ME YG+WS G N D RL YE VPT
Sbjct: 702 NLKNLRCTELCNDVFNFPLFSREFCSELIQTMEKYGKWSGGAGHNIDHRLGHNYYENVPT 761
Query: 605 RDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHH 664
+DI + +VGL W + +Y+ PL + Y + V ++FVVRY +Q L+ HH
Sbjct: 762 QDIQLFEVGLDKHWESIVNEYIAPLVRIVYSNYKTKSVH--LAFVVRYHWQQQSELQEHH 819
Query: 665 DSSTYTINIALNQV-GVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQG 723
D+STYTINIALN+ G DYEGGG RFIR N + +G +HPG+ THYH+GL+ T G
Sbjct: 820 DASTYTINIALNEGDGKDYEGGGSRFIRQNYSSINQEIGTANLHPGKCTHYHKGLKTTAG 879
Query: 724 TRYIMISFVD 733
RYI++SF++
Sbjct: 880 IRYILVSFIN 889
>gi|371943602|gb|AEX61430.1| putative procollagen-lysine [Megavirus courdo7]
Length = 889
Score = 302 bits (773), Expect = 5e-79, Method: Compositional matrix adjust.
Identities = 229/731 (31%), Positives = 362/731 (49%), Gaps = 116/731 (15%)
Query: 30 DEDK-FLVITVASNETDGYKRFIQSAEVNKLQVKTL-GLHQPWLGGDMSSLGGGYKVNLL 87
D DK F +I + D + RFI+ E+ L L ++ P D+S++
Sbjct: 248 DTDKEFSIIYIGPTNGDSFARFIEYCELYSLPRIILDSINMP----DISTI--------- 294
Query: 88 KNELDEMDITDDMIILV-----TDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDT 142
+L E+D +D + +V D+ + I +I+ +FN+ I + P+
Sbjct: 295 HKKLAEIDNLEDKLFVVISVLPNDNCNFIPTAPPTEIINKFNS----ICHNKNGIIIPN- 349
Query: 143 SLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISN-----RSIKNEEDDQLYYALLFLDET 197
G + + F G+ I+ +I + +KN + L A++F T
Sbjct: 350 ---------GETSKTI----FCGWGNRIQRMIQDYLDKVDIVKNITNAALSTAIMF--NT 394
Query: 198 LRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIEL 257
T I+ D +F + S +DI N + H K+ T P I+ N + L
Sbjct: 395 FITS-DIIKDDTCQIFCCV-NSEDDIIYNTTKSKISH---KKFGTTPSILFSNEIGNLVL 449
Query: 258 NSFGNYLAKSWKTSGCTRCNLIKHLDSLKPDQ-FPSVLISVFIDKPTAFLEEFLNKIANL 316
N NY +W R + +P P+V IS+ DK ++ ++ I +
Sbjct: 450 NRIQNYTGNNWNEYYGYR-------NHTEPKTILPTVYISILSDKNSSVVD----IIQTI 498
Query: 317 NYPAKKISMFV----YNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVE 372
+YP + +++ + N+ Y L KYIA N
Sbjct: 499 DYPRELLTVVITKGTINDNYYQEDL--------------EKYIATN-------------- 530
Query: 373 NSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFY 432
++YF+++ D L NP+VLK L+N N+ +IAPL+ R ++W+NFWG L+ +G+Y
Sbjct: 531 ------CEYYFFINHDCILVNPNVLKELINLNKKIIAPLIRRGDESWTNFWGDLDKNGYY 584
Query: 433 ARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNS-MDYDMAFCTN 491
RS DY +IING++ +G WNVP++ YL+ SVI++ + ++T N+ +D DM C N
Sbjct: 585 KRSHDYFDIINGER--RGCWNVPHVFGTYLIHRSVIES--VPDMFTKNTDIDADMRMCHN 640
Query: 492 LRNKGIHLKIDSTQEYGHLVDSENFDPQ-KTNPE--VYELIRNPLDWDLRYIHPEY--QK 546
+R IH+ + + YG++ P+ N E V++ +W+ +Y+HPEY K
Sbjct: 641 IRQHDIHIYLSNLNSYGYIQTELQIAPEIDINKEVTVFDFSTRRSEWEKKYLHPEYFLNK 700
Query: 547 SLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGT--NNDKRL-ETGYEAVP 603
+ L + + C DVF FP+ + +FC E +Q ME YG+WS G N D RL YE VP
Sbjct: 701 NNLKNLRCTELCNDVFNFPLFSREFCSELIQTMEKYGKWSGGAGHNIDHRLGHNYYENVP 760
Query: 604 TRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPH 663
T+DI + +VGL W + +Y+ PL + Y + V ++FVVRY +Q L+ H
Sbjct: 761 TQDIQLFEVGLDKHWESIVNEYIAPLVRIVYSNYKTKSVH--LAFVVRYHWQQQSELQEH 818
Query: 664 HDSSTYTINIALNQV-GVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQ 722
HD+STYTINIALN+ G DYEGGG RFIR N + +G +HPG+ THYH+GL+ T
Sbjct: 819 HDASTYTINIALNEGDGKDYEGGGSRFIRQNYSSINQEIGTANLHPGKCTHYHKGLKTTA 878
Query: 723 GTRYIMISFVD 733
G RYI++SF++
Sbjct: 879 GIRYILVSFIN 889
>gi|451927620|gb|AGF85498.1| family 25 protein [Moumouvirus goulette]
Length = 890
Score = 300 bits (769), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 188/510 (36%), Positives = 280/510 (54%), Gaps = 59/510 (11%)
Query: 235 LTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWKTSGCTRCNLIKHLDSLKPD-QFPSV 293
+T K T P +++ +G S I LN NY +W R +S +P +P++
Sbjct: 429 ITYNKTGTMPCVLYSSGMSNIILNRIQNYTGNNWNEYYGYR-------NSSEPLLTYPTI 481
Query: 294 LISVFIDK-PTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNV 352
+S +DK PT N I NL YP + +++ + Q LF Y +
Sbjct: 482 YLSFRLDKNPT-----ITNIIENLEYPKELVTINIETGQ--GGDLF--YQQDI------- 525
Query: 353 KYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLL 412
N +NSK ++YF+V+ D + NP +LK L+ + ++APL+
Sbjct: 526 -----NKFLNSK--------------CEYYFFVNHDCVIVNPKILKELLELGKKVVAPLV 566
Query: 413 VRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATN 472
+ ++WSNFWG L+ +G+Y RS DY +I+NG++ +G WNVPYI+ YL+ SVI+
Sbjct: 567 RKGTESWSNFWGDLDKNGYYNRSHDYFDILNGER--RGCWNVPYISGVYLIHRSVIEL-- 622
Query: 473 IKTIYTLNS-MDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQK--TNP-EVYEL 528
+ I++ N +D DM C NLR IHL + + YG + + DP T P +++
Sbjct: 623 VPNIFSDNEKIDIDMRMCHNLREHDIHLYVSNINSYGFIQEEIKIDPNLDLTKPLTIHDF 682
Query: 529 IRNPLDWDLRYIHPEY--QKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWS 586
+W+ +Y+HPE+ K+ L + + C DVF FP+ +++FC E +QIME YG+WS
Sbjct: 683 STRRDEWERKYLHPEFYLNKNNLKNLRCPELCSDVFNFPLFSKEFCSELIQIMEKYGKWS 742
Query: 587 DGT--NNDKRL-ETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVR 643
GT N D RL YE VPT+DI + +VGL W + Y+ PL + + Y + V
Sbjct: 743 GGTGHNIDHRLGHNYYENVPTQDIQLFEVGLDKHWETIVMDYIAPLVKIIYGNYKTKSVH 802
Query: 644 APMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGW 703
++FVVRY Q L+ HHD+STYT+NIALN+ G DYEGGGC FIR +G
Sbjct: 803 --LAFVVRYHWQFQNELQEHHDASTYTVNIALNECGTDYEGGGCEFIRQKYVAKNQEIGT 860
Query: 704 MLMHPGRLTHYHEGLQVTQGTRYIMISFVD 733
+HPGRLTH H+GL+ T GTRYI++SF++
Sbjct: 861 SNIHPGRLTHLHKGLKTTNGTRYILVSFIN 890
>gi|441432191|ref|YP_007354233.1| Glycosyltransferase family 25 fused to procollagen lysine
2-oxoglutarate 5-dioxygenase [Acanthamoeba polyphaga
moumouvirus]
gi|440383271|gb|AGC01797.1| Glycosyltransferase family 25 fused to procollagen lysine
2-oxoglutarate 5-dioxygenase [Acanthamoeba polyphaga
moumouvirus]
Length = 889
Score = 297 bits (760), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 185/510 (36%), Positives = 274/510 (53%), Gaps = 59/510 (11%)
Query: 235 LTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWKTSGCTRCNLIKHLDSLKPD-QFPSV 293
+T+ K + P I++ +G S I LN NY +W R +S +P +P+V
Sbjct: 428 ITHNKTGSMPCILYSSGISNIILNRIQNYTGNNWNEYYGFR-------NSSEPLLTYPTV 480
Query: 294 LISVFIDK-PTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNV 352
+S +DK PT + I L YP + +++ + N + I+ F
Sbjct: 481 YLSFRLDKNPT-----ITDIIEKLEYPKELMTINIENGST-EDLFYQKDINKF------- 527
Query: 353 KYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLL 412
L ++YF+V+ D L NP +LK L+ + +IAPL+
Sbjct: 528 ----------------------LESKCEYYFFVNHDCVLINPKILKELLELGKKVIAPLV 565
Query: 413 VRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATN 472
+ ++WSNFWG + +G+Y RS DY +I+NG++ +G WNVPYI+ YL+ SVIK+
Sbjct: 566 RKGTESWSNFWGDIQENGYYNRSHDYFDILNGER--RGCWNVPYISGVYLIHRSVIKS-- 621
Query: 473 IKTIYTLNS-MDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDP--QKTNP-EVYEL 528
I I+ N +D DM C NLR IH+ + + YG + + DP T P +++L
Sbjct: 622 IPNIFIDNEKIDVDMRICHNLRQHDIHMYVSNINSYGFIQEEIKIDPTIDLTKPVTIHDL 681
Query: 529 IRNPLDWDLRYIHPEY--QKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWS 586
+W+ +Y+HPEY K+ L + + C DVF FP+ +++FC E +QIME YG+WS
Sbjct: 682 FTRRDEWERKYLHPEYYLNKNNLKNLRCPELCSDVFNFPLFSKEFCSELIQIMENYGKWS 741
Query: 587 DGTNN--DKRL-ETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVR 643
GT + D RL YE VPT+DI + +VGL W + Y+ PL + Y + V
Sbjct: 742 GGTGHHIDHRLGHNYYENVPTQDIQLFEVGLDKHWETIVMDYISPLVRIIYGNYKTKSVH 801
Query: 644 APMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGW 703
++FVVRY Q L+ HHD+STYT+NIALN+ G DYEGGGC FIR +G
Sbjct: 802 --LAFVVRYHWQLQNELQEHHDASTYTVNIALNECGTDYEGGGCEFIRQKYIAKNQEVGT 859
Query: 704 MLMHPGRLTHYHEGLQVTQGTRYIMISFVD 733
+HPGRLTH H+GL+ T G RYI++SF++
Sbjct: 860 SNIHPGRLTHLHKGLKTTNGIRYILVSFIN 889
>gi|371945194|gb|AEX63014.1| putative procollagen-lysine [Moumouvirus Monve]
Length = 889
Score = 295 bits (756), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 184/510 (36%), Positives = 274/510 (53%), Gaps = 59/510 (11%)
Query: 235 LTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWKTSGCTRCNLIKHLDSLKPD-QFPSV 293
+T+ K + P I++ +G S I LN NY +W R +S +P +P+V
Sbjct: 428 ITHNKTGSMPCILYSSGISNIILNRIQNYTGNNWNEYYGFR-------NSSEPLLTYPTV 480
Query: 294 LISVFIDK-PTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNV 352
+S +DK PT + I L YP + +++ + N + I+ F
Sbjct: 481 YLSFRLDKNPT-----ITDIIEKLEYPKELMTINIENGST-EDLFYQKDINKF------- 527
Query: 353 KYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLL 412
L ++YF+V+ D L NP +LK L+ + +IAPL+
Sbjct: 528 ----------------------LESKCEYYFFVNHDCVLINPKILKELLELGKKVIAPLV 565
Query: 413 VRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATN 472
+ ++WSNFWG + +G+Y RS DY +I+NG++ +G WNVPYI+ YL+ SVI++
Sbjct: 566 RKGTESWSNFWGDIQENGYYNRSHDYFDILNGER--RGCWNVPYISGVYLIHRSVIES-- 621
Query: 473 IKTIYTLNS-MDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDP--QKTNP-EVYEL 528
I I+ N +D DM C NLR IH+ + + YG + + DP T P +++L
Sbjct: 622 IPNIFIDNEKIDVDMRICHNLRQHDIHMYVSNINSYGFIQEEIKIDPTIDLTKPVTIHDL 681
Query: 529 IRNPLDWDLRYIHPEY--QKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWS 586
+W+ +Y+HPEY K+ L + + C DVF FP+ +++FC E +QIME YG+WS
Sbjct: 682 FTRRDEWERKYLHPEYYLNKNNLKNLRCPELCSDVFNFPLFSKEFCSELIQIMENYGKWS 741
Query: 587 DGTNN--DKRL-ETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVR 643
GT + D RL YE VPT+DI + +VGL W + Y+ PL + Y + V
Sbjct: 742 GGTGHHIDHRLGHNYYENVPTQDIQLFEVGLDKHWETIVMDYISPLVRIIYGNYKTKSVH 801
Query: 644 APMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGW 703
++FVVRY Q L+ HHD+STYT+NIALN+ G DYEGGGC FIR +G
Sbjct: 802 --LAFVVRYHWQLQNELQEHHDASTYTVNIALNECGTDYEGGGCEFIRQKYIAKNQEVGT 859
Query: 704 MLMHPGRLTHYHEGLQVTQGTRYIMISFVD 733
+HPGRLTH H+GL+ T G RYI++SF++
Sbjct: 860 SNIHPGRLTHLHKGLKTTNGIRYILVSFIN 889
>gi|326426536|gb|EGD72106.1| PLOD2 protein [Salpingoeca sp. ATCC 50818]
Length = 527
Score = 291 bits (745), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 166/480 (34%), Positives = 258/480 (53%), Gaps = 47/480 (9%)
Query: 282 LDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNY--PAKKISMFVYNNQEYH----- 334
LDS++ P V ++V + + + FLE L+ + +Y A I++ + ++H
Sbjct: 66 LDSIEDIHVP-VHMAVLVYEGSPFLEYVLSSLEQQHYVKDALTITLLLAPGMDWHLNHQL 124
Query: 335 -APLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSHLDN 393
+ D+ + + + + H++ V E+ H +DS S L N
Sbjct: 125 ASSWQSDHSNKYAAIHVHSPASLHDAVVELVES---------HDSAQHLLLMDSRSRLTN 175
Query: 394 PDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALN------------ADGFYARSFDYMNI 441
PD LK+L++ ++ +AP+LVR K WSNFW A + A+ Y RS Y++I
Sbjct: 176 PDTLKHLISLDKPAVAPMLVRQGKWWSNFWDAASQFHDVSPADFSPANVGYVRSNRYLDI 235
Query: 442 INGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNL----RNKGI 497
+ D+ G++ VP C L++ I A N+ + + F L + +
Sbjct: 236 V--DRKQTGVFIVPLAFGCLLVRPDTIPAMKRALSAMPNTANAEWVFHLTLAYYLHQQQV 293
Query: 498 HLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEY----QKSLLPDTV 553
+ + + EYGHL++ FD K +P+++ + NP +W +Y++ Y + L+P+
Sbjct: 294 PIAVSNLLEYGHLINPTGFDSTKAHPDLFLVEENPAEWADKYLNELYWSFEEHGLIPN-- 351
Query: 554 NNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVG 613
C DVF P+ + F ++ E +GQWS+G N D+R++ GYE VPT+DIH Q+G
Sbjct: 352 ----CTDVFKVPMFSPAFARNLIEECEHFGQWSNGDNKDERIQGGYEPVPTQDIHFNQIG 407
Query: 614 LAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINI 673
W LR+++ P+ + GY E R + FVVRYRPD+Q LRPHHD+ST T+N+
Sbjct: 408 FNNAWRFILRRFLRPVTSHYYTGYTLEG-RTTLDFVVRYRPDKQNYLRPHHDASTVTLNV 466
Query: 674 ALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVD 733
ALNQ GVDY+GGG RFIR NC + T GW + PGRLTH HEGL+ T GTRYI++SF+D
Sbjct: 467 ALNQGGVDYQGGGTRFIRQNCTLINTPPGWGTLSPGRLTHLHEGLKTTAGTRYILVSFID 526
>gi|324519915|gb|ADY47513.1| Procollagen-lysine,2-oxoglutarate 5-dioxygenase [Ascaris suum]
Length = 249
Score = 291 bits (744), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 132/249 (53%), Positives = 178/249 (71%)
Query: 486 MAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQ 545
M+FC R+ G + +D+ YG LV S++FD K +PE+Y++ NP W+ RYIH +Y
Sbjct: 1 MSFCEFARHSGHFMYVDNRNYYGFLVVSDDFDTTKLHPEMYQIFDNPDLWESRYIHEKYF 60
Query: 546 KSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTR 605
+ ++PC DVF FP+++E FC E ++ ME YGQWS G N D RL GYE VPTR
Sbjct: 61 HARDGRIAIDEPCQDVFDFPLMSEAFCSELIEEMEHYGQWSSGKNQDDRLAGGYENVPTR 120
Query: 606 DIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHD 665
DIHM Q+G W L +YV P+QE+ FIGY +PV+A M FVVRYRPDEQ SL+PHHD
Sbjct: 121 DIHMNQIGFERHWLYMLDEYVRPIQEKLFIGYSQKPVQANMMFVVRYRPDEQSSLKPHHD 180
Query: 666 SSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTR 725
+STY+I++ALN+ G+DY+GGG R++RYNC V A ++G+ ++ PGRLTH HEGL T+GTR
Sbjct: 181 ASTYSIDVALNKRGIDYQGGGVRYVRYNCTVDADQIGYSMIFPGRLTHLHEGLPTTEGTR 240
Query: 726 YIMISFVDP 734
YI +SF++P
Sbjct: 241 YIAVSFLNP 249
>gi|194379782|dbj|BAG58243.1| unnamed protein product [Homo sapiens]
Length = 391
Score = 286 bits (732), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 130/244 (53%), Positives = 171/244 (70%), Gaps = 2/244 (0%)
Query: 491 NLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLP 550
R + + + + + GHL+ +++ + +++E+ NP DW +YIH Y K+L
Sbjct: 150 RFRQQDVFMFLTNRHTLGHLLSLDSYRTTHLHNDLWEVFSNPEDWKEKYIHQNYTKALAG 209
Query: 551 DTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMK 610
V PCPDV+WFPI TE C E V+ ME +GQWS G N D R++ GYE VPT DIHM
Sbjct: 210 KLVET-PCPDVYWFPIFTEVACDELVEEMEHFGQWSLGNNKDNRIQGGYENVPTIDIHMN 268
Query: 611 QVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYT 670
Q+G W +FL +Y+ P+ E+ + GY+ + ++FVVRY+PDEQPSL PHHD+ST+T
Sbjct: 269 QIGFEREWHKFLLEYIAPMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFT 327
Query: 671 INIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMIS 730
INIALN+VGVDYEGGGCRF+RYNC++ A R GW LMHPGRLTHYHEGL T+GTRYI +S
Sbjct: 328 INIALNRVGVDYEGGGCRFLRYNCSIRAPRKGWTLMHPGRLTHYHEGLPTTRGTRYIAVS 387
Query: 731 FVDP 734
FVDP
Sbjct: 388 FVDP 391
Score = 111 bits (277), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 54/134 (40%), Positives = 81/134 (60%)
Query: 31 EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNE 90
ED LV+TVA+ ET+G++RF +SA+ +++ LGL + W +S GGG KV LLK
Sbjct: 25 EDNLLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWNVEKGTSAGGGQKVRLLKKA 84
Query: 91 LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
L++ +D++IL DSYDV+ G ++L++F + +VF AE L +PD L KYP
Sbjct: 85 LEKHADKEDLVILFADSYDVLFASGPRELLKKFRQARSQVVFSAEELIYPDRRLETKYPV 144
Query: 151 VGSGYRYLNSGGFI 164
V G R+ F+
Sbjct: 145 VSDGKRFRQQDVFM 158
>gi|256079279|ref|XP_002575916.1| procollagen-lysine2-oxoglutarate 5-dioxygenase [Schistosoma
mansoni]
gi|360044866|emb|CCD82414.1| putative procollagen-lysine,2-oxoglutarate 5-dioxygenase
[Schistosoma mansoni]
Length = 921
Score = 283 bits (725), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 129/245 (52%), Positives = 172/245 (70%), Gaps = 4/245 (1%)
Query: 493 RNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDT 552
R K + + +D+ +G+L D+ N+ K + ++++ + NP DW+ +YIHP+Y P+
Sbjct: 678 RRKNVFMFVDNQMSFGYLTDANNYTKGKLHNDLWQTMDNPQDWEEQYIHPQYFNFAKPEV 737
Query: 553 VNN---QPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHM 609
QPCPDVFWFP+V+E FC ++ +E YGQWS G N D RLE GYE VPTRDIHM
Sbjct: 738 TMTDIAQPCPDVFWFPLVSETFCKHLIEEVENYGQWSTGDNYDPRLEGGYENVPTRDIHM 797
Query: 610 KQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTY 669
+Q+G W L KYV +Q++ F GY +P A M+FVVRY+PDEQPSLRPHHD+S+Y
Sbjct: 798 RQIGWEEHWLHVLEKYVHKIQKKLFQGYDDKPW-ARMNFVVRYKPDEQPSLRPHHDASSY 856
Query: 670 TINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMI 729
TINI LNQ G DY+GGG R+ RYNC++ TR+GW L+ PGR+TH HEGL T GTRYI +
Sbjct: 857 TINIGLNQPGKDYKGGGIRYNRYNCSIVDTRVGWALVSPGRVTHLHEGLPTTGGTRYIFV 916
Query: 730 SFVDP 734
+FV+P
Sbjct: 917 TFVNP 921
Score = 245 bits (626), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 137/356 (38%), Positives = 209/356 (58%), Gaps = 13/356 (3%)
Query: 32 DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLLKNE 90
D LV+TVA+ + D +RF++S +N +VK LG W GG ++ S GGG KVNLLK E
Sbjct: 342 DHVLVLTVATEKNDALQRFLRSCNLNGFKVKVLGEGSHWKGGHVAKSTGGGQKVNLLKEE 401
Query: 91 LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
L + D D +IL DSYDV+ V +LE + F + ++F AE CWP SL YP
Sbjct: 402 LAKGDYKPDQLILFVDSYDVVFMQNVAKLLEEYEKFKSKVIFSAEEFCWPQPSLQSSYPE 461
Query: 151 VGSG-YRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTL 209
V G RYLNSGGFIG ++ +++++ IK+++DDQLYY +FLD T RT + I LD
Sbjct: 462 VKPGEKRYLNSGGFIGPTANLIKIVNHEPIKDDDDDQLYYTKIFLDSTSRTLYDIELDKT 521
Query: 210 ANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK 269
+ +FQNL G+ D++L+F+ D +L N ++T P+I HGNG K+E NS NYLA SW
Sbjct: 522 SRIFQNLNGAFSDVELHFN-DVTGYLFNKIFSTTPIIAHGNGPIKVEFNSLSNYLAYSWS 580
Query: 270 -TSGCTRCNL--IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMF 326
T C +C+ I+ D L +P V++ +FI++ T F+E F +IA L+YP ++ +
Sbjct: 581 PTKNCQQCDEDNIEIQDIL---DYPLVVMGIFIEQGTPFIERFFERIAALSYPKSRLHVV 637
Query: 327 --VYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARN--LAVENSLHKG 378
+ N + + + + + F + +V ++ N + +N + V+N + G
Sbjct: 638 GHMAENSRFQSAVAESFNQTFGHQYFSVNWLEENLDEETARRKNVFMFVDNQMSFG 693
>gi|425701200|gb|AFX92362.1| putative procollagen-lysine 2-oxoglutarate 5-dioxygenase [Megavirus
courdo11]
Length = 889
Score = 281 bits (719), Expect = 9e-73, Method: Compositional matrix adjust.
Identities = 229/731 (31%), Positives = 360/731 (49%), Gaps = 116/731 (15%)
Query: 30 DEDK-FLVITVASNETDGYKRFIQSAEVNKLQVKTL-GLHQPWLGGDMSSLGGGYKVNLL 87
D DK F +I + D + RFI+ E+ L L ++ P D+S++
Sbjct: 248 DTDKEFSIIYIGPTNGDSFARFIEYCELYSLPRIILDSINMP----DISTI--------- 294
Query: 88 KNELDEMDITDDMIILV-----TDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDT 142
+L E+D +D + +V D+ + I +I+ +FN+ I + P+
Sbjct: 295 HKKLAEIDNLEDKLFVVISVLPNDNCNFIPTAPPTEIINKFNS----ICHNKNGIIIPN- 349
Query: 143 SLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISN-----RSIKNEEDDQLYYALLFLDET 197
G + + F G+ I+ +I + +KN + L A++F T
Sbjct: 350 ---------GETSKTI----FCGWGNRIQRMIQDYLDKVDIVKNITNAALSTAIMF--NT 394
Query: 198 LRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIEL 257
T I+ D +F + S +DI N + H K+ T P I+ N + L
Sbjct: 395 FITS-DIIKDDTCQIFCCV-NSEDDIIYNTTKSKISH---KKFGTTPSILFSNEIGNLVL 449
Query: 258 NSFGNYLAKSWKTSGCTRCNLIKHLDSLKPDQ-FPSVLISVFIDKPTAFLEEFLNKIANL 316
N NY +W R + +P P+V IS+ DK + ++ I +
Sbjct: 450 NRIQNYTGNNWNEYYGYR-------NHTEPKTILPTVYISILSDKNPSVVD----IIQTI 498
Query: 317 NYPAKKISMFV----YNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVE 372
+YP + +++ + N+ Y L KYIA N
Sbjct: 499 DYPRELLTVVITKGTINDNYYQEDL--------------EKYIATN-------------- 530
Query: 373 NSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFY 432
++YF+++ D L NP+VLK L+N N+ +IAPL+ R ++W+NFWG L+ +G+Y
Sbjct: 531 ------CEYYFFINHDCILVNPNVLKELINLNKKIIAPLIRRGDESWTNFWGDLDKNGYY 584
Query: 433 ARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNS-MDYDMAFCTN 491
RS DY +IING++ +G WNVP++ YL+ SVI++ + ++T N+ +D DM C N
Sbjct: 585 KRSHDYFDIINGER--RGCWNVPHVFGTYLIHRSVIES--VPDMFTKNTDIDADMRMCHN 640
Query: 492 LRNKGIHLKIDSTQEYGHLVDSENFDPQ-KTNPE--VYELIRNPLDWDLRYIHPEY--QK 546
+R IH+ + + YG++ P+ N E V++ +W+ +Y+HPEY K
Sbjct: 641 IRQHDIHIYLSNLNSYGYIQTELQIAPEIDINKEVTVFDFSTRRSEWEKKYLHPEYFLNK 700
Query: 547 SLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGT--NNDKRL-ETGYEAVP 603
+ L + C DVF FP+ + +FC E +Q ME YG+WS G N D RL YE VP
Sbjct: 701 NNLKHLRCTELCNDVFNFPLFSREFCSELIQTMEKYGKWSGGAGHNIDHRLGHNYYENVP 760
Query: 604 TRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPH 663
T+DI + +VGL W + +Y+ PL + Y + V ++FVVRY +Q L+ H
Sbjct: 761 TQDIQLFEVGLDKHWESIVNEYIAPLVRIVYSNYKTKSVH--LAFVVRYHWQQQSELQEH 818
Query: 664 HDSSTYTINIALNQ-VGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQ 722
HD+STYTINIALN+ G DYEGGG RFIR N + +G +HPG+ THYH+GL+ T
Sbjct: 819 HDASTYTINIALNEGGGKDYEGGGSRFIRQNYSSINQEIGTANLHPGKCTHYHKGLKTTA 878
Query: 723 GTRYIMISFVD 733
G RYI++SF++
Sbjct: 879 GIRYILVSFIN 889
>gi|432962878|ref|XP_004086761.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 3-like,
partial [Oryzias latipes]
Length = 375
Score = 279 bits (714), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 133/341 (39%), Positives = 211/341 (61%), Gaps = 4/341 (1%)
Query: 32 DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLLKNE 90
+ LVIT A+ ETDG++RF+++A +V+ LGL + W GGD++ ++GGG KV LK E
Sbjct: 36 ENLLVITAATEETDGFRRFMRTAREFNYKVQVLGLGEDWRGGDVARTVGGGQKVRWLKKE 95
Query: 91 LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
L + +++I+ DSYDV++ G ++L +F+ +VF AE CWPD L KYP
Sbjct: 96 LLKHSEEAELVIMFVDSYDVVLAAGPGELLAKFSRLGHRVVFSAEGFCWPDQRLASKYPQ 155
Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
V SG RYLNSGGFIG+A D+ ++ ++K+++DDQL+Y ++LD R K I LD +
Sbjct: 156 VHSGKRYLNSGGFIGFAADLSAIVQQWTLKDDDDDQLFYTRIYLDRNQRNKFNITLDHRS 215
Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK- 269
+FQNL G+++++ L F+ + N Y+T PV+IHGNG +K++LN GNY+ +W
Sbjct: 216 QIFQNLNGAIDEVVLKFEKGR-ARVRNVAYDTLPVVIHGNGPTKLQLNYLGNYVPTAWTY 274
Query: 270 TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
+GC C++ ++ D +Q P V ++VFI+ PT F+EEFL ++ LNYP ++ +F++
Sbjct: 275 ENGCGVCDIDLRLFDDTPDEQMPLVHLAVFIEHPTPFMEEFLERLTTLNYPHSRLRLFIH 334
Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNL 369
NN YH ++ K +F N + + +AR +
Sbjct: 335 NNVVYHEQHIQNFWLRHKNLFPNALLVGPEENLEENQARTM 375
>gi|311977606|ref|YP_003986726.1| probable procollagen-lysine,2-oxoglutarate 5-dioxygenase
[Acanthamoeba polyphaga mimivirus]
gi|82000136|sp|Q5UQC3.1|PLOD_MIMIV RecName: Full=Procollagen lysyl hydroxylase and
glycosyltransferase; Short=LHGT; AltName: Full=Lysyl
hydroxylase; AltName:
Full=Procollagen-lysine,2-oxoglutarate 5-dioxygenase
gi|55416853|gb|AAV50503.1| procollagen-lysine,2-oxoglutarate 5-dioxygenase [Acanthamoeba
polyphaga mimivirus]
gi|308204267|gb|ADO18068.1| probable procollagen-lysine,2-oxoglutarate 5-dioxygenase
[Acanthamoeba polyphaga mimivirus]
gi|339061161|gb|AEJ34465.1| procollagen-lysine,2-oxoglutarate 5-dioxygenase [Acanthamoeba
polyphaga mimivirus]
Length = 895
Score = 276 bits (707), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 212/732 (28%), Positives = 339/732 (46%), Gaps = 108/732 (14%)
Query: 28 NIDEDK-FLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNL 86
N D DK F ++ + + + + RF + ++ L K + + D SL +
Sbjct: 246 NFDTDKQFRIVYIGPTKGNSFHRFTEYCKLYLLPYKVIDEKET---NDFVSLRSELQ--- 299
Query: 87 LKNELDEMDITDDMIILVT----DSYDVIIDGGVNDILERFN--TFDANIVFGAERLCWP 140
L E D+ ++++V+ D + I N+ ++++ T D N + A
Sbjct: 300 ---SLSEQDLNTTLMLVVSVNHNDFCNTIPCAPTNEFIDKYKQLTTDTNSIVSA------ 350
Query: 141 DTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIK----NEEDDQLYYALLFLDE 196
V +G N FIG+A I E I++ K N E D LL +
Sbjct: 351 ----------VQNG---TNKTMFIGWANKISEFINHYHQKLTESNAETDINLANLLLISS 397
Query: 197 TLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSK-I 255
+ +V D NLFQ L DI + N K P +++ N S I
Sbjct: 398 ISSDFNCVVEDVEGNLFQ-LINEESDIVFSTTTSR----VNNKLGKTPSVLYANSDSSVI 452
Query: 256 ELNSFGNYLAKSWKTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIA- 314
LN NY W H+ +K D P + +S+ I K + KIA
Sbjct: 453 VLNKVENYTGYGWNEY------YGYHVYPVKFDVLPKIYLSIRIVKNAN-----VTKIAE 501
Query: 315 NLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENS 374
L+YP + I++ + ++ H + I F
Sbjct: 502 TLDYPKELITVSISRSE--HDSFYQADIQKF----------------------------- 530
Query: 375 LHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALN-ADGFYA 433
L G D+YFY+ D + P +LK L+ N+ + PL+ + ++W+N+WG ++ ++G+Y
Sbjct: 531 LLSGADYYFYISGDCIITRPTILKELLELNKDFVGPLMRKGTESWTNYWGDIDPSNGYYK 590
Query: 434 RSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNS-----MDYDMAF 488
RSFDY +II D+ G WNVPY+ + YL+K SVI+ + ++T NS + DM
Sbjct: 591 RSFDYFDIIGRDR--VGCWNVPYLASVYLIKKSVIE--QVPNLFTENSHMWNGSNIDMRL 646
Query: 489 CTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPE---VYELIRNPLDWDLRYIHPEYQ 545
C NLR + + + + + YGH+ DS N + P +Y+L +W+ +Y+HPE+
Sbjct: 647 CHNLRKNNVFMYLSNLRPYGHIDDSINLEVLSGVPTEVTLYDLPTRKEEWEKKYLHPEFL 706
Query: 546 KSL--LPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNN--DKRLETGYEA 601
L D + C DV+ FP+ T FC E +++M+ WS G ++ D R+ G E+
Sbjct: 707 SHLQNFKDFDYTEICNDVYSFPLFTPAFCKEVIEVMDKANLWSKGGDSYFDPRI-GGVES 765
Query: 602 VPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLR 661
PT+D + +VGL W + YV P + Y + + ++FVV+Y + Q L
Sbjct: 766 YPTQDTQLYEVGLDKQWHYVVFNYVAPFVRHLYNNYKTKDIN--LAFVVKYDMERQSELA 823
Query: 662 PHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVT 721
PHHDSSTYT+NIALN+ G +Y GGC FIR+ ++G+ +H G+L YH L +T
Sbjct: 824 PHHDSSTYTLNIALNEYGKEYTAGGCEFIRHKFIWQGQKVGYATIHAGKLLAYHRALPIT 883
Query: 722 QGTRYIMISFVD 733
G RYI++SFV+
Sbjct: 884 SGKRYILVSFVN 895
>gi|351737377|gb|AEQ60412.1| Procollagen-lysine,2-oxoglutarate 5-dioxygenase [Acanthamoeba
castellanii mamavirus]
gi|398257080|gb|EJN40688.1| procollagen-lysine,2-oxoglutarate 5-dioxygenase [Acanthamoeba
polyphaga lentillevirus]
Length = 895
Score = 275 bits (702), Expect = 7e-71, Method: Compositional matrix adjust.
Identities = 210/732 (28%), Positives = 339/732 (46%), Gaps = 108/732 (14%)
Query: 28 NIDEDK-FLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNL 86
N D DK F ++ + + + + RF + ++ L K + + D SL +
Sbjct: 246 NFDTDKQFRIVYIGPTKGNSFHRFTEYCKLYLLPYKVIDEKET---NDFVSLRSELQ--- 299
Query: 87 LKNELDEMDITDDMIILVT----DSYDVIIDGGVNDILERFN--TFDANIVFGAERLCWP 140
L E D+ ++++V+ D + I N+ ++++ T D N + A
Sbjct: 300 ---SLSEQDLNTTLMLVVSVNHNDFCNTIPCAPTNEFIDKYKQLTTDTNSIVSA------ 350
Query: 141 DTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIK----NEEDDQLYYALLFLDE 196
V +G N F+G+A I E I++ K N E D LL +
Sbjct: 351 ----------VQNG---TNKTMFVGWANKISEFINHYHQKLTESNAETDINLANLLLISS 397
Query: 197 TLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSK-I 255
+ +V D NLFQ L DI + N K P +++ N S I
Sbjct: 398 ISSDFNCVVEDIEGNLFQ-LINEESDIVFSTTTSR----VNNKLGKTPSVLYANSDSSVI 452
Query: 256 ELNSFGNYLAKSWKTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIA- 314
LN NY W H+ +K D P + +S+ I K + KIA
Sbjct: 453 VLNKVENYTGYGWNEY------YGYHVYPVKFDVLPKIYLSIRILKNAN-----VTKIAE 501
Query: 315 NLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENS 374
L+YP + +++ + ++ H + I F
Sbjct: 502 TLDYPKELVTVSISRSE--HDNFYQADIQKF----------------------------- 530
Query: 375 LHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALN-ADGFYA 433
L G D+YFY+ D + P +LK L+ N+ + PL+ + ++W+N+WG ++ ++G+Y
Sbjct: 531 LLSGADYYFYISGDCIITRPSILKELLELNKDFVGPLMRKGTESWTNYWGDIDPSNGYYK 590
Query: 434 RSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNS-----MDYDMAF 488
RSFDY +II D+ G WNVPY+ + YL+K SVI+ + ++T NS + DM
Sbjct: 591 RSFDYFDIIGRDR--VGCWNVPYLASVYLIKKSVIE--QVPNLFTENSHMWNGSNIDMRL 646
Query: 489 CTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPE---VYELIRNPLDWDLRYIHPEYQ 545
C NLR + + + + + YGH+ DS N + P +Y+L +W+ +Y+HPE+
Sbjct: 647 CHNLRKNNVFMYLSNLRPYGHIDDSINLEVLSGVPTEVTLYDLPTRKEEWEKKYLHPEFL 706
Query: 546 KSL--LPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNN--DKRLETGYEA 601
L D + C DV+ FP+ T FC E +++M+ WS G ++ D R+ G E+
Sbjct: 707 NHLQNFKDFDYTEICNDVYSFPLFTPAFCKEVIEVMDKANLWSKGGDSYFDPRI-GGVES 765
Query: 602 VPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLR 661
PT+D + +VGL W + YV P + Y + + ++FVV+Y + Q L
Sbjct: 766 YPTQDTQLYEVGLDKQWHYVVFNYVAPFVRHLYNNYKTKDIN--LAFVVKYDMERQSELA 823
Query: 662 PHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVT 721
PHHDSSTYT+N+ALN+ G Y GGGC FIR+ ++G+ +H G+L YH L +T
Sbjct: 824 PHHDSSTYTLNVALNEYGSQYMGGGCEFIRHKFIWQGQKVGYATIHAGKLLAYHRALPIT 883
Query: 722 QGTRYIMISFVD 733
G RYI++SFV+
Sbjct: 884 SGKRYILVSFVN 895
>gi|167522232|ref|XP_001745454.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163776412|gb|EDQ90032.1| predicted protein [Monosiga brevicollis MX1]
Length = 399
Score = 272 bits (695), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 142/369 (38%), Positives = 207/369 (56%), Gaps = 27/369 (7%)
Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGF------------- 431
+ S + L N L +L+ + +IAPLLVR K WSNFWG+ A GF
Sbjct: 36 IHSHARLTNSSALSHLMATDYDVIAPLLVRQNKYWSNFWGS--ASGFAPAVAAQALADAD 93
Query: 432 ---YARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSV---IKATNIKTIYTLNSMDYD 485
Y RS DY I+ Q G+W VP + ++ V +K Y
Sbjct: 94 RLGYMRSPDYYEIVERHQ--TGVWTVPVVFGAVVLSERVHDTLKEAAQDLAEGEAGWFYG 151
Query: 486 MAFCTNLRNK-GIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEY 544
MA G +++ + +GH+++++ +D +P++Y NP +W+ Y+H EY
Sbjct: 152 MAALAAHLRHAGSLVRVTNEHRFGHMINTDAYDASHLHPDMYLAQDNPAEWEAVYLHEEY 211
Query: 545 QKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPT 604
+ + + + C DV+ P ++ +F E ++ E G+WS+G + D RL+ GYE VPT
Sbjct: 212 NQ--FRELGDMEDCTDVYRVPALSARFAREMIEECENLGEWSNGQHTDNRLKGGYEPVPT 269
Query: 605 RDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHH 664
+DIH +Q+G W FLR Y+ P+ ++GYH + R + FVVRYRPD+Q LRPHH
Sbjct: 270 QDIHFEQIGFKDTWQHFLRTYLGPVANHHYMGYHIQG-RTTLDFVVRYRPDKQSFLRPHH 328
Query: 665 DSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGT 724
D+ST T+N+ALNQ GVDY+GGG F+R NC + GW + PGRLTHYHEGL+ T GT
Sbjct: 329 DASTVTLNVALNQGGVDYQGGGTHFLRQNCTIKDAPPGWGTLSPGRLTHYHEGLKTTAGT 388
Query: 725 RYIMISFVD 733
RYI++SF+D
Sbjct: 389 RYILVSFID 397
>gi|16877124|gb|AAH16834.1| PLOD2 protein, partial [Homo sapiens]
Length = 210
Score = 261 bits (667), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 115/211 (54%), Positives = 156/211 (73%), Gaps = 2/211 (0%)
Query: 524 EVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYG 583
+++++ NP+DW +YI+ +Y K + + + QPCPDVFWFPI +EK C E V+ ME YG
Sbjct: 2 DLWQIFENPVDWKEKYINRDYSK-IFTENIVEQPCPDVFWFPIFSEKACDELVEEMEHYG 60
Query: 584 QWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVR 643
+WS G ++D R+ GYE VPT DIHMKQV L VW F+R+++ P+ + F GY+ +
Sbjct: 61 KWSGGKHHDSRISGGYENVPTDDIHMKQVDLENVWLHFIREFIAPVTLKVFAGYYTKGF- 119
Query: 644 APMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGW 703
A ++FVV+Y P+ Q SLRPHHD+ST+TINIALN VG D++GGGC+F+RYNC++ + R GW
Sbjct: 120 ALLNFVVKYSPERQRSLRPHHDASTFTINIALNNVGEDFQGGGCKFLRYNCSIESPRKGW 179
Query: 704 MLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
MHPGRLTH HEGL V GTRYI +SF+DP
Sbjct: 180 SFMHPGRLTHLHEGLPVKNGTRYIAVSFIDP 210
>gi|299930635|gb|ADJ58533.1| seminal fluid protein HACP031 [Heliconius erato]
Length = 332
Score = 251 bits (640), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 123/287 (42%), Positives = 189/287 (65%), Gaps = 6/287 (2%)
Query: 36 VITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNELDEMD 95
V TVA++ G +RF++SA+V + V+ LG+ + W+GG+M GGG K+NLLK +L ++
Sbjct: 47 VFTVATHNNHGLERFLRSAKVYGINVEVLGMGKKWVGGNMDHPGGGQKINLLKQKLKSLE 106
Query: 96 ITDDM--IILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAVGS 153
+D IIL TDS+DV+ + +I+++F ++F AE CWPD +L KYP
Sbjct: 107 KLEDRDRIILFTDSFDVMFLANLKEIVDKFTNMFVRVLFSAESFCWPDPTLSSKYPDTSM 166
Query: 154 GYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLANLF 213
+LNSGGFIGY D+ +++ + ++N +DDQLYY +++LDE R KH+I LD + +F
Sbjct: 167 TNAFLNSGGFIGYYSDVMAILNYKKVRNNDDDQLYYTMVYLDEEYRLKHRIALDHDSEIF 226
Query: 214 QNLYGSLEDIKLNFD-LDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWKTS- 271
QNL G+L D++L + +++ ++ N N P+I+HGNG SK++LN F NYLA +W S
Sbjct: 227 QNLNGALSDVELVLNSTEDYPYIKNVVSNERPLIVHGNGPSKLKLNQFSNYLANAWSVSK 286
Query: 272 GCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNY 318
GC C+ + LK D P+VL++VFI++PT FLEEFL +I ++Y
Sbjct: 287 GCKMCD--EKYTVLKDDALPNVLMAVFIEQPTPFLEEFLTQIEKVDY 331
>gi|74180451|dbj|BAE34174.1| unnamed protein product [Mus musculus]
Length = 398
Score = 245 bits (625), Expect = 7e-62, Method: Compositional matrix adjust.
Identities = 134/375 (35%), Positives = 227/375 (60%), Gaps = 5/375 (1%)
Query: 31 EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPW-LGGDMSSLGGGYKVNLLKN 89
ED LV+TVA+ ET+G++RF +SA+ ++++LGL + W + G ++ GGG KV LLK
Sbjct: 25 EDNLLVLTVATKETEGFRRFKRSAQFFNYKIQSLGLGEDWSVDGGPAAAGGGQKVRLLKK 84
Query: 90 ELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYP 149
L++ +D++IL DSYDV+ G ++L++F + +VF AE +PD L KYP
Sbjct: 85 ALEKHADKEDLVILFVDSYDVVFASGPRELLKKFQQAKSQVVFSAEEHIYPDRRLEAKYP 144
Query: 150 AVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTL 209
V G R+L SGGFIGYA + +L++ ++ + DQL+Y +FL+ R + I LD
Sbjct: 145 TVPDGKRFLGSGGFIGYAPSLSKLVAEWEGQDSDSDQLFYTKIFLNPEKREQINISLDHR 204
Query: 210 ANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK 269
+FQNL G+L+++ L F++ V N Y+T PV++HGNG +K++LN GNY+ + W
Sbjct: 205 CRIFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVVVHGNGPTKLQLNYLGNYIPRFWT 263
Query: 270 -TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFV 327
+GCT C+ ++ L + + P+VL+ VFI++PT FL F ++ L YP K++ +F+
Sbjct: 264 FETGCTVCDEGLRSLKGIGDEALPTVLVGVFIEQPTPFLSLFFLRLLRLRYPQKQMRLFI 323
Query: 328 YNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVD 386
+N + +H + ++ + +++VK + + + +ARN+ + + +YF VD
Sbjct: 324 HNQERHHKLQVEQFLAEHGSEYQSVKLVGPEVRMANADARNMGADLCRQDQTCTYYFSVD 383
Query: 387 SDSHLDNPDVLKYLV 401
+D L P+ L+ L+
Sbjct: 384 ADVALTEPNSLRLLI 398
>gi|345309303|ref|XP_001514467.2| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 1-like,
partial [Ornithorhynchus anatinus]
Length = 302
Score = 243 bits (619), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 121/302 (40%), Positives = 192/302 (63%), Gaps = 4/302 (1%)
Query: 32 DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDM-SSLGGGYKVNLLKNE 90
+ LV+TVA+ ET+G++RF +SA+ +V+ LGL + W D ++ GGG K+ LLK+
Sbjct: 2 ENLLVLTVATRETEGFRRFKRSAQFFNYKVQVLGLGEDWSSEDEPTAAGGGQKIRLLKSA 61
Query: 91 LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
L++ +D++IL TDSYDV+ G ++L++F + +VF AE L +PD L KYP
Sbjct: 62 LEKHADKEDLVILFTDSYDVVFASGPKELLKKFKQAKSRVVFSAEELIYPDRRLEAKYPT 121
Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
V G R+L SG FIGYA ++ +L+++ + + DQL+Y +FLD R I LD
Sbjct: 122 VRDGKRFLGSGAFIGYAPNLSKLVADWKGLDNDSDQLFYTQVFLDPEKREAINISLDHRC 181
Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK- 269
+FQNL G+L+++ L F+ + V N +Y+T PV+IHGNG +K++LN GNY+ + W
Sbjct: 182 RIFQNLNGALDEVVLKFE-NAQVRARNLEYDTLPVLIHGNGPTKLQLNYLGNYIPRVWTF 240
Query: 270 TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
+GCT+C+ ++ L + D P VL+ VFI++PT FL F ++ L YP K++ +F++
Sbjct: 241 ETGCTQCDEGLRSLKGFEDDALPLVLVGVFIEQPTPFLSLFFRRLQALQYPKKQLQLFIH 300
Query: 329 NN 330
N+
Sbjct: 301 NH 302
>gi|345321580|ref|XP_003430455.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2-like,
partial [Ornithorhynchus anatinus]
Length = 183
Score = 242 bits (617), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 106/182 (58%), Positives = 137/182 (75%), Gaps = 1/182 (0%)
Query: 553 VNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQV 612
V +PCPDVFWFPI +EK C E V+ ME +GQWS G ++D R+ GYE VPT DIHM+Q+
Sbjct: 3 VVERPCPDVFWFPIFSEKACDELVEEMEHFGQWSGGKHHDSRISGGYENVPTDDIHMRQI 62
Query: 613 GLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTIN 672
GL W F+R+++ P+ + F GY+ + A ++FVV+Y P+ Q SLRPHHDSST+TIN
Sbjct: 63 GLENEWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPHHDSSTFTIN 121
Query: 673 IALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFV 732
IALN VG D++GGGC+FIRYNC++ + R GW MHPGRLTH HEGL + GTRYI +SF+
Sbjct: 122 IALNSVGEDFQGGGCKFIRYNCSIESPRKGWSFMHPGRLTHLHEGLPIKNGTRYIAVSFI 181
Query: 733 DP 734
DP
Sbjct: 182 DP 183
>gi|355712271|gb|AES04294.1| procollagen-lysine, 2-oxoglutarate 5-dioxygenase 2 [Mustela
putorius furo]
Length = 213
Score = 241 bits (614), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 105/215 (48%), Positives = 152/215 (70%), Gaps = 2/215 (0%)
Query: 495 KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVN 554
KG+ + I + E+G L+ + N++ N +++++ NP+DW +YI+ +Y K + + +
Sbjct: 1 KGVFMYISNRHEFGRLLSTANYNTSHYNNDLWQIFENPVDWKEKYINRDYSK-IFTENIV 59
Query: 555 NQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGL 614
QPCPDVFWFPI +EK C E V+ ME YGQWS G ++D R+ GYE VPT DIHMKQ+ L
Sbjct: 60 EQPCPDVFWFPIFSEKACDELVEEMEHYGQWSGGKHHDSRISGGYENVPTDDIHMKQIDL 119
Query: 615 AGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIA 674
VW F+R+++ P+ + F GY+ + A ++FVV+Y P+ Q SLRPHHD+ST+TINIA
Sbjct: 120 ENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPHHDASTFTINIA 178
Query: 675 LNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPG 709
LN VG D++GGGC+F+RYNC++ + R GW MHPG
Sbjct: 179 LNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPG 213
>gi|324502308|gb|ADY41016.1| Procollagen-lysine,2-oxoglutarate 5-dioxygenase [Ascaris suum]
Length = 379
Score = 219 bits (557), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 119/368 (32%), Positives = 211/368 (57%), Gaps = 13/368 (3%)
Query: 9 CLILSCVVFFISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQ 68
+ LS +F + + N ++ V+TV D +R +SA +++Q+ L Q
Sbjct: 6 VVALSLSLFILRLDVNAATSLH-----VVTVVIEHQDALERLQRSANAHEIQLNILRHDQ 60
Query: 69 PWLGGDMSSLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERF--NTF 126
S LGGG K+ +L++ L+ D+I+L D+ II+G +IL+RF +
Sbjct: 61 L---ASSSHLGGGEKLRILRDGLEIYKDRSDLILLYVDANKAIINGREEEILKRFMDSYS 117
Query: 127 DANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQ 186
++ IVF ++ C+PD L +YP V G R+LNS FIGYA I EL++++S++N D+Q
Sbjct: 118 NSQIVFSSDNYCFPDEELTQRYPIVEKGKRFLNSAAFIGYANKIWELLNSQSLENINDEQ 177
Query: 187 LYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVI 246
++Y FLDE LR + ++VLD+ + +F ++ S ++I L+F + ++TN + T+P+I
Sbjct: 178 IFYTHRFLDERLRNRLQMVLDSTSQIFHSVDVSKDEITLDFSDNGDAYITNVIHKTHPLI 237
Query: 247 IHGNGKSKIELNSFGNYLAKSWKTS-GCTRCNL--IKHLDSLKPDQFPSVLISVFIDKPT 303
IHG+ +K+ LN GNY+ K+W GC C+ + L ++P + +++ + KP
Sbjct: 238 IHGDESNKLMLNYLGNYIGKAWSADFGCRDCSAQRVNFLKDNAEQEWPKLTLAIMLAKPI 297
Query: 304 AFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNS 363
F+EEFL K+ L YPA KI +++Y+NQ+Y+ ++++ + + V++ + +
Sbjct: 298 PFVEEFLTKVEKLEYPASKIDLYLYSNQKYNEREVNEFLRRVRGKYSWVEWDSGEVEIGE 357
Query: 364 KEARNLAV 371
+EAR A+
Sbjct: 358 REARRTAI 365
>gi|194387172|dbj|BAG59952.1| unnamed protein product [Homo sapiens]
Length = 333
Score = 214 bits (545), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 102/225 (45%), Positives = 155/225 (68%), Gaps = 2/225 (0%)
Query: 32 DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLLKNE 90
+K LVITVA+ ET+GY RF++SAE V+TLGL + W GGD++ ++GGG KV LK E
Sbjct: 41 EKLLVITVATAETEGYLRFLRSAEFFNYTVRTLGLGEEWRGGDVARTVGGGQKVRWLKKE 100
Query: 91 LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
+++ +DMII+ DSYDVI+ G ++L++F + ++F AE CWP+ L ++YP
Sbjct: 101 MEKYADREDMIIMFVDSYDVILAGSPTELLKKFVQSGSRLLFSAESFCWPEWGLAEQYPE 160
Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
VG+G R+LNSGGFIG+A I +++ K+++DDQL+Y L+LD LR K + LD +
Sbjct: 161 VGTGKRFLNSGGFIGFATTIHQIVRQWKYKDDDDDQLFYTRLYLDPGLREKLSLNLDHKS 220
Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKI 255
+FQNL G+L+++ L FD + V + N Y+T P+++HGNG +K+
Sbjct: 221 RIFQNLNGALDEVVLKFDRNR-VRIRNVAYDTLPIVVHGNGPTKV 264
>gi|311978107|ref|YP_003987227.1| putative procollagen-lysine,2-oxoglutarate dioxygenase
[Acanthamoeba polyphaga mimivirus]
gi|81999712|sp|Q5UNV6.1|YR699_MIMIV RecName: Full=Uncharacterized protein R699
gi|55417310|gb|AAV50960.1| unknown [Acanthamoeba polyphaga mimivirus]
gi|308204997|gb|ADO18798.1| putative procollagen-lysine,2-oxoglutarate dioxygenase
[Acanthamoeba polyphaga mimivirus]
gi|339061637|gb|AEJ34941.1| hypothetical protein MIMI_R699 [Acanthamoeba polyphaga mimivirus]
gi|351737875|gb|AEQ60910.1| hypothetical protein [Acanthamoeba castellanii mamavirus]
gi|398257501|gb|EJN41109.1| hypothetical protein lvs_R606 [Acanthamoeba polyphaga
lentillevirus]
Length = 455
Score = 206 bits (525), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 153/493 (31%), Positives = 254/493 (51%), Gaps = 54/493 (10%)
Query: 30 DEDKFLV--ITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSS-LGGGYKVNL 86
++D LV I ++ ++TDG RF + + + LQ +G + W GG++ S GGG K+N
Sbjct: 6 NDDNLLVLGIGISVHKTDGVLRFEKYCQAHNLQYMIVGEGKKWNGGNLESEAGGGQKINE 65
Query: 87 LKNELDEMDITDDMIILVTDSYDVIIDGGVNDILE--RFNTFDANIVFGAERLCWPDTSL 144
L L+ I D+ +I+V D+YD+I G +IL RF T D +VF +E CWPD SL
Sbjct: 66 LLIALES--IKDNKLIVVCDTYDLIPLSGPEEILRKYRFLTPDNKVVFSSELYCWPDASL 123
Query: 145 YDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKI 204
++YP V + Y+YLNSG F+GY DI E+I N +K+ +DDQL++++ F++ KI
Sbjct: 124 VERYPKVDTKYKYLNSGAFMGYRDDIYEMIKN-GVKDRDDDQLFFSIKFIETD-----KI 177
Query: 205 VLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSF-GNY 263
VLD LFQ +Y D+ ++ + + N N+ PV HGNG +K LN G +
Sbjct: 178 VLDYKCELFQAMYRCNSDLVVHKN-----RIFNGYTNSYPVFAHGNGPAKKLLNHMEGYF 232
Query: 264 LAKSWKTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDK-PTAFLEEFLNKIANLNYPAKK 322
+ + S T +++ K D P V ++++D + L++FL K+A++ Y K
Sbjct: 233 MTEPIDGSSNT-------INTFKLDNEPKVFFALYVDSNDLSALKQFLGKVASIQYGNKV 285
Query: 323 ISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFY 382
I ++ ++ E + L N+ T KY+ ++ FY
Sbjct: 286 IYLYDRSDNEQNRKLIQISYPNYHTGV--TKYV---------------FDDFKKSDAQFY 328
Query: 383 FYVDSDSHLDNPDVLKYL---VNRNESLIAPLL-VRPFKAWSNFWGALNADGFYARSFDY 438
F ++ + + D+L L V N +I+P++ +NFWG + DG+Y RS +Y
Sbjct: 329 FLLEQNCIITKKDILHELIMQVKDNHRVISPMIGYEQNSTRTNFWGDI-EDGYYKRSENY 387
Query: 439 MNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIH 498
+++ +G+WNVPY+ LM SV++ ++ + D DM C +LR I
Sbjct: 388 LDL--AKHKVRGLWNVPYVYGVILMHESVVRNWDLSMV---KYNDKDMDLCFSLRKHTIF 442
Query: 499 LKIDSTQEYGHLV 511
+ + + YG++V
Sbjct: 443 MYMINNNNYGYMV 455
>gi|451927695|gb|AGF85573.1| hypothetical protein glt_00768 [Moumouvirus goulette]
Length = 449
Score = 198 bits (503), Expect = 9e-48, Method: Compositional matrix adjust.
Identities = 150/484 (30%), Positives = 244/484 (50%), Gaps = 51/484 (10%)
Query: 35 LVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSS-LGGGYKVNLLKNELDE 93
L I V+ N+ DG RF + L K +G + W GGDMS+ +GGG KVN L L+E
Sbjct: 7 LGIGVSPNKNDGVLRFETYCKAFNLPYKIVGDGKIWNGGDMSAGVGGGQKVNELLRTLNE 66
Query: 94 MDITDDMIILVTDSYDVIIDGGVNDILERFNTF---DANIVFGAERLCWPDTSLYDKYPA 150
I ++ +++V D++D+ G +I E++ + +I+F +E CWPD SL + YP
Sbjct: 67 --INENKLLVVCDTFDLFPVSGAKEIYEKYMKLCNGNKSIIFSSEVYCWPDKSLVNVYPV 124
Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
S Y+YLNSG FIGY D+++L+SN I + +DDQLYY FL I+LD
Sbjct: 125 TESKYKYLNSGSFIGYRDDLQKLVSN--ILDTDDDQLYYTKKFL-----RGENIILDYNC 177
Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWKT 270
LFQ + G D+ ++ + + N + P+ +HGNG SK LN NY+
Sbjct: 178 QLFQAINGCKSDLIVHKN-----RVFNKYTKSYPIFLHGNGSSKTYLNHLENYIEP---- 228
Query: 271 SGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNN 330
+LI + Q P + I++++D LN KKI Y+N
Sbjct: 229 -----LSLIDMPQDIITHQ-PKIFIALYVD------TSLLNNFTQFFESVKKID---YDN 273
Query: 331 QEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSH 390
+ + ++D + ++ N+ + S + + E + ++ G DFY ++ +
Sbjct: 274 KNIY--VYDKFQNDQMEQLINLLGFVYKSNITNYEFNDF-----INSGCDFYCLMEQNYI 326
Query: 391 LDNPDVLKY---LVNRNESLIAPLLV-RPFKAWSNFWGALNADGFYARSFDYMNIINGDQ 446
LK L+N N ++APLL+ + +SNFWG+L+ G+Y RS DY+N++ ++
Sbjct: 327 TTRTTFLKEIIPLLNNNHRIVAPLLISKSNSCFSNFWGSLDNKGYYERSEDYLNLMTREK 386
Query: 447 GGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQE 506
G+WNVPY++ + S+I ++K Y D DM C NLR + + + +
Sbjct: 387 --IGLWNVPYVSGLIIFDKSIILNWDLKQ-YNDYKNDRDMNLCFNLRKHTLFMYMCNLDN 443
Query: 507 YGHL 510
YG++
Sbjct: 444 YGYI 447
>gi|425701123|gb|AFX92285.1| putative procollagen-lysine 2-oxoglutarate dioxygenase [Megavirus
courdo11]
Length = 453
Score = 194 bits (492), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 155/491 (31%), Positives = 251/491 (51%), Gaps = 58/491 (11%)
Query: 35 LVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLLKNELDE 93
L I V+ + DG RF + ++ L +G + W GGDMS GGG K+N L L+
Sbjct: 7 LGIGVSLKKNDGVLRFEKYCQIFDLPYTIVGDGKIWKGGDMSVGAGGGQKINELLIALET 66
Query: 94 MDITDDMIILVTDSYDVIIDGGVNDILERFNTF---DANIVFGAERLCWPDTSLYDKY-- 148
ITD+ +I+V D++D+ +IL +++ +VF +E CWP+ +L + Y
Sbjct: 67 --ITDNKLIIVCDTFDLFPVANKQEILNKYHQICREKERVVFSSEVYCWPEKNLANIYTQ 124
Query: 149 --PAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVL 206
P + S YRYLNSG F+G DI L++N I + +DDQLY+ +L + I+L
Sbjct: 125 IYPKIISKYRYLNSGSFMGRRNDICALLNN--ILDTDDDQLYFTKKYLQSS-----NIIL 177
Query: 207 DTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAK 266
DT LFQ + GS +DI ++ D ++ TK T P+ IHGNG +K LN N L
Sbjct: 178 DTECQLFQAINGSTDDIGIH---DNRIYNKYTK--TFPIFIHGNGPAKTFLNYLENNLHP 232
Query: 267 SWKTSGCTRCNLIKHLDS-LKPDQFPSVLISVFIDKPT-AFLEEFLNKIANLNYPAKKIS 324
+L+ +++ L DQ+ V I+++ID + + L+ FL+ + +N K I
Sbjct: 233 K---------SLVNIMNTKLISDQYK-VFIALYIDSNSISELKTFLDSVTKINCTNKIIY 282
Query: 325 MFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFY 384
++ ++ +Y FK + + + +I S N + + D+YF
Sbjct: 283 VYDKSHSDY-----------FKQLLEMLGFIY------SSNVSNYVFIDFIKSNCDYYFL 325
Query: 385 VDSDSHLDNP---DVLKYLVNRNESLIAPLLV-RPFKAWSNFWGALNADGFYARSFDYMN 440
++ + L N ++L +L N +++PLL+ + ++NFWGAL+ +G+Y RS DY+N
Sbjct: 326 LEQNCILTNSMTLEILIHLCQNNNRIVSPLLIGKENTNFANFWGALDKNGYYKRSDDYLN 385
Query: 441 IINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLK 500
II Q G+WNVPYI L S+I N+ Y + D DM C NLR + +
Sbjct: 386 IIR--QEKIGLWNVPYIYGVILFNKSIINDWNLSQ-YEKHKDDRDMNLCFNLRKHTLFMY 442
Query: 501 IDSTQEYGHLV 511
+ YG+++
Sbjct: 443 TCNLDCYGYII 453
>gi|371945290|gb|AEX63110.1| putative procollagen-lysine [Moumouvirus Monve]
Length = 451
Score = 193 bits (490), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 152/486 (31%), Positives = 247/486 (50%), Gaps = 53/486 (10%)
Query: 35 LVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLLKNELDE 93
L I V+ N+ DG RF + L K +G + W GGDMS +GGG KVN L L+E
Sbjct: 7 LGIGVSPNKNDGVLRFETYCKAFNLSYKIVGDGKIWNGGDMSVGMGGGQKVNELLQVLNE 66
Query: 94 MDITDDMIILVTDSYDVIIDGGVNDILERFNTF---DANIVFGAERLCWPDTSLYDKYPA 150
+ ++ +++V D++D+ GV +I E++ + +I+F +E CWPD +L + YP
Sbjct: 67 --VNENKLLIVCDTFDLFPVSGVEEIYEKYKKLCNGNKSIIFSSEVYCWPDKNLANFYPL 124
Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
S Y+YLNSG F+GY D+ +L+SN I + +DDQLYY FL I+LD
Sbjct: 125 TESKYKYLNSGSFMGYRDDLHKLVSN--ILDNDDDQLYYTKKFLQ-----GENIILDQNC 177
Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIE--LNSFGNYLAKSW 268
LFQ + G D+ ++ + + N + P+ IHGNG SK + LN NY+
Sbjct: 178 QLFQAINGCKSDLIVHKN-----RIFNKYTKSYPIFIHGNGPSKTKKFLNRLENYIEPLL 232
Query: 269 KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
L+ +++ Q P V I++++D LN KKI Y
Sbjct: 233 ---------LVDIPKTIETPQ-PKVFIALYVD------TSLLNNFTQFFESVKKID---Y 273
Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSD 388
+N+E + ++D + +N N+ + S + + E ++ ++ G D+Y ++ +
Sbjct: 274 DNKEIY--IYDKFQNNQIEQLINLLGFVYKSNITNYE-----FDDFINSGCDYYCLMEQN 326
Query: 389 SHLDNPDVLKYLV---NRNESLIAPLLV-RPFKAWSNFWGALNADGFYARSFDYMNIING 444
+ + LK ++ N + +IAPLLV + ++NFWG+L+ G+Y RS +Y++ I
Sbjct: 327 YIVTKTNFLKEIIPLCNNHHRIIAPLLVSKSNNYFTNFWGSLDKKGYYKRSKNYLSWIMR 386
Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDST 504
++ G+WNVPYIT + SVI N+K Y D DM C NLR + + + +
Sbjct: 387 EK--IGLWNVPYITGVIIFDKSVILNWNLKQ-YDNYKNDRDMNLCFNLRKHTLFMYMCNL 443
Query: 505 QEYGHL 510
YG +
Sbjct: 444 DNYGFI 449
>gi|441432117|ref|YP_007354159.1| hypothetical protein Moumou_00179 [Acanthamoeba polyphaga
moumouvirus]
gi|440383197|gb|AGC01723.1| hypothetical protein Moumou_00179 [Acanthamoeba polyphaga
moumouvirus]
Length = 451
Score = 190 bits (483), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 150/486 (30%), Positives = 247/486 (50%), Gaps = 53/486 (10%)
Query: 35 LVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSS-LGGGYKVNLLKNELDE 93
L I V+ N+ DG RF + L K +G + W GGDMS+ GGG KVN L L+E
Sbjct: 7 LGIGVSPNKNDGVLRFETYCKSFNLPYKIVGDGKIWNGGDMSAGAGGGQKVNELLQVLNE 66
Query: 94 MDITDDMIILVTDSYDVIIDGGVNDILERFNTF---DANIVFGAERLCWPDTSLYDKYPA 150
+ ++ +++V D++D+ GV +I E++ + +I+F +E CWPD +L + YP
Sbjct: 67 --VNENKLLIVCDTFDLFPVSGVEEIYEKYKKLCNGNKSIIFSSEVYCWPDKNLANFYPL 124
Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
S Y+YLNSG F+GY D+ +L+SN I + +DDQLYY FL I+LD
Sbjct: 125 TESKYKYLNSGSFMGYRDDLHKLVSN--ILDNDDDQLYYTKKFL-----QGENIILDHNC 177
Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIE--LNSFGNYLAKSW 268
LFQ + G DI ++ + + N + P+ IHGNG SK + LN NY+
Sbjct: 178 QLFQAINGCKSDIVVHKN-----RIFNKYTKSYPIFIHGNGPSKTKKFLNRLENYIEPLL 232
Query: 269 KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
L+ +++ Q P V I++++D LN KKI Y
Sbjct: 233 ---------LVDIPKTIETPQ-PKVFIALYVD------TSLLNNFTQFFESVKKID---Y 273
Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSD 388
+N+E + ++D + +N N+ + S + + E ++ ++ G D+Y ++ +
Sbjct: 274 DNKEIY--IYDKFQNNQIEQLINLLGFVYKSNITNYE-----FDDFINSGCDYYCLMEQN 326
Query: 389 SHLDNPDVLKYLV---NRNESLIAPLLV-RPFKAWSNFWGALNADGFYARSFDYMNIING 444
+ LK ++ N + +I+PLL+ + ++NFWG+L+ G+Y RS DY++++
Sbjct: 327 YVITKTTFLKEIIPLFNNHHRIISPLLMSKNNSCFTNFWGSLDDKGYYERSEDYLSLVAR 386
Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDST 504
++ G+WNVPYI+ + SVI N+K Y D DM C NLR + + + +
Sbjct: 387 EK--IGLWNVPYISGVIIFDKSVILNWNLKQ-YDNYKNDRDMNLCFNLRKYTLFMYMCNL 443
Query: 505 QEYGHL 510
YG +
Sbjct: 444 DNYGFI 449
>gi|448825199|ref|YP_007418130.1| putative procollagen-lysine 2-oxoglutarate dioxygenase [Megavirus
lba]
gi|444236384|gb|AGD92154.1| putative procollagen-lysine 2-oxoglutarate dioxygenase [Megavirus
lba]
Length = 453
Score = 190 bits (483), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 154/490 (31%), Positives = 248/490 (50%), Gaps = 56/490 (11%)
Query: 35 LVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLLKNELDE 93
L I V+ + DG RF + ++ L +G + W GGDMS GGG K+N L L+
Sbjct: 7 LGIGVSLKKNDGVLRFEKYCQIFDLPYIIVGDGKIWKGGDMSVGAGGGQKINELLIALET 66
Query: 94 MDITDDMIILVTDSYDVIIDGGVNDILERFNTF---DANIVFGAERLCWPDTSLYDKY-- 148
ITD+ +I+V D++D+ +IL +++ +VF +E CWP+ +L + Y
Sbjct: 67 --ITDNKLIIVCDTFDLFPVANKQEILNKYHQICGEKERVVFSSEVYCWPEKNLANIYTQ 124
Query: 149 --PAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVL 206
P + S YRYLNSG F+G DI L++N I + +DDQL++ +L + I+L
Sbjct: 125 IYPKIISKYRYLNSGSFMGRRNDICALLNN--ILDTDDDQLFFTKKYLQSS-----NIIL 177
Query: 207 DTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAK 266
DT LFQ + GS +DI ++ D ++ TK T P+ IHGNG +K LN N L
Sbjct: 178 DTECQLFQAINGSTDDIGIH---DNRIYNKYTK--TFPIFIHGNGPAKTFLNYLENNLHP 232
Query: 267 SWKTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTA-FLEEFLNKIANLNYPAKKISM 325
+ N++ L DQ+ V I+++ID + L+ FL+ + +N K I +
Sbjct: 233 K------SLVNIVN--TKLVSDQYK-VFIALYIDSNSINELKIFLDSVTKINCTNKIIYV 283
Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYV 385
+ ++ +Y FK + + + +I S N + + D+YF +
Sbjct: 284 YDKSHSDY-----------FKQLLEMLGFIY------SSNVSNYVFVDFIKSDCDYYFLL 326
Query: 386 DSDSHLDNP---DVLKYLVNRNESLIAPLLV-RPFKAWSNFWGALNADGFYARSFDYMNI 441
+ + L N ++L +L N +++PLL+ + ++NFWGAL+ +G+Y RS DY+NI
Sbjct: 327 EQNCILTNSMTLEILIHLCQNNNRIVSPLLIGKENTNFANFWGALDKNGYYKRSDDYLNI 386
Query: 442 INGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKI 501
I Q G+WNVPYI L S+I N+ Y + D DM C NLR + +
Sbjct: 387 IR--QEKIGLWNVPYIYGVILFNKSIINDWNLSQ-YEKHKDDRDMNLCFNLRKHTLFMYT 443
Query: 502 DSTQEYGHLV 511
+ YG+++
Sbjct: 444 CNLDCYGYII 453
>gi|371943512|gb|AEX61341.1| putative procollagen-lysine [Megavirus courdo7]
Length = 453
Score = 190 bits (483), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 154/490 (31%), Positives = 248/490 (50%), Gaps = 56/490 (11%)
Query: 35 LVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLLKNELDE 93
L I V+ + DG RF + ++ L +G + W GGDMS GGG K+N L L+
Sbjct: 7 LGIGVSLKKNDGVLRFEKYCQIFDLPYIIVGDGKIWKGGDMSVGAGGGQKINELLIALET 66
Query: 94 MDITDDMIILVTDSYDVIIDGGVNDILERFNTF---DANIVFGAERLCWPDTSLYDKY-- 148
ITD+ +I+V D++D+ +IL +++ +VF +E CWP+ +L + Y
Sbjct: 67 --ITDNKLIIVCDTFDLFPVANKQEILNKYHQICGEKERVVFSSEVYCWPEKNLANIYTQ 124
Query: 149 --PAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVL 206
P + S YRYLNSG F+G DI L++N I + +DDQL++ +L + I+L
Sbjct: 125 IYPKIISKYRYLNSGSFMGRRNDICALLNN--ILDTDDDQLFFTKKYLQSS-----NIIL 177
Query: 207 DTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAK 266
DT LFQ + GS +DI ++ D ++ TK T P+ IHGNG +K LN N L
Sbjct: 178 DTECQLFQAINGSTDDIGIH---DNRIYNKYTK--TFPIFIHGNGPAKTFLNYLENNLHP 232
Query: 267 SWKTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTA-FLEEFLNKIANLNYPAKKISM 325
+ N++ L DQ+ V I+++ID + L+ FL+ + +N K I +
Sbjct: 233 K------SLVNIVN--TKLVSDQYK-VFIALYIDSNSINELKIFLDSVTKINCTNKIIYV 283
Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYV 385
+ ++ +Y FK + + + +I S N + + D+YF +
Sbjct: 284 YDKSHSDY-----------FKQLLEMLGFIY------SSNVSNYVFVDFIKSDCDYYFLL 326
Query: 386 DSDSHLDNP---DVLKYLVNRNESLIAPLLV-RPFKAWSNFWGALNADGFYARSFDYMNI 441
+ + L N ++L +L N +++PLL+ + ++NFWGAL+ +G+Y RS DY+NI
Sbjct: 327 EQNCILTNSMTLEILIHLCQNNNRIVSPLLIGKENTNFANFWGALDKNGYYKRSDDYLNI 386
Query: 442 INGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKI 501
I Q G+WNVPYI L S+I N+ Y + D DM C NLR + +
Sbjct: 387 IR--QEKIGLWNVPYIYGVILFNKSIINDWNLSQ-YEKHKDDRDMNLCFNLRKHTLFMYT 443
Query: 502 DSTQEYGHLV 511
+ YG+++
Sbjct: 444 CNLDCYGYII 453
>gi|363540743|ref|YP_004894301.1| mg250 gene product [Megavirus chiliensis]
gi|350611908|gb|AEQ33352.1| putative procollagen-lysine 2-oxoglutarate dioxygenase [Megavirus
chiliensis]
Length = 453
Score = 189 bits (481), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 154/490 (31%), Positives = 248/490 (50%), Gaps = 56/490 (11%)
Query: 35 LVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLLKNELDE 93
L I V+ + DG RF + ++ L +G + W GGDMS GGG K+N L L+
Sbjct: 7 LGIGVSLKKNDGVLRFEKYCQIFDLPYIIVGDGKIWKGGDMSVGAGGGQKINELLIALET 66
Query: 94 MDITDDMIILVTDSYDVIIDGGVNDILERFNTF---DANIVFGAERLCWPDTSLYDKY-- 148
ITD+ +I+V D++D+ +IL +++ +VF +E CWP+ +L + Y
Sbjct: 67 --ITDNKLIIVCDTFDLFPVANKQEILNKYHQICGEKERVVFSSEVYCWPEKNLANIYTQ 124
Query: 149 --PAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVL 206
P + S YRYLNSG F+G DI L++N I + +DDQL++ +L + I+L
Sbjct: 125 IYPKIISKYRYLNSGSFMGRRNDICALLNN--ILDTDDDQLFFTKKYLQSS-----NIIL 177
Query: 207 DTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAK 266
DT LFQ + GS +DI ++ D ++ TK T P+ IHGNG +K LN N L
Sbjct: 178 DTECQLFQAINGSTDDIGIH---DNRIYNKYTK--TFPIFIHGNGPAKTFLNYLENNLHP 232
Query: 267 SWKTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTA-FLEEFLNKIANLNYPAKKISM 325
+ N++ L DQ+ V I+++ID + L+ FL+ + +N K I +
Sbjct: 233 K------SLVNIVN--TKLVSDQYK-VFIALYIDSNSINELKIFLDSVTKINCTNKIIYV 283
Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYV 385
+ ++ +Y FK + + + +I S N + + D+YF +
Sbjct: 284 YDKSHSDY-----------FKQLLEMLGFIY------SSNVSNYVFVDFIKSDCDYYFLL 326
Query: 386 DSDSHLDNP---DVLKYLVNRNESLIAPLLV-RPFKAWSNFWGALNADGFYARSFDYMNI 441
+ + L N ++L +L N +++PLL+ + ++NFWGAL+ +G+Y RS DY+NI
Sbjct: 327 EQNCILTNSMTLEILIHLCQNNNRIVSPLLIGKENTNFANFWGALDKNGYYKRSDDYLNI 386
Query: 442 INGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKI 501
I Q G+WNVPYI L S+I N+ Y + D DM C NLR + +
Sbjct: 387 IR--QEKIGLWNVPYIYGVILFNKSIINDWNLSQ-YEKHKDDRDMNLCFNLRKYTLFMYT 443
Query: 502 DSTQEYGHLV 511
+ YG+++
Sbjct: 444 CNLDCYGYII 453
>gi|167535270|ref|XP_001749309.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163772175|gb|EDQ85830.1| predicted protein [Monosiga brevicollis MX1]
Length = 623
Score = 184 bits (467), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 149/593 (25%), Positives = 255/593 (43%), Gaps = 102/593 (17%)
Query: 109 DVIIDGGVNDILERFNTFDANIVFGAERLCWPDTS--------LYDKYPAVGSG-YRYLN 159
+ ++ G + ++ F A I+ A LC + L D +P V G RY +
Sbjct: 38 EALLLGEMVELQRNFQQQPARILMAATHLCRSACAFAGATSWRLTDDWPDVARGSARYGD 97
Query: 160 SGGFIGYAKDIKELISNRSIKNEEDDQLYYAL------LFLDETLRTKHKIVLDTLANLF 213
+ + Y+ D++ L+ +R N+ A L+LD+ R + + +D +
Sbjct: 98 ASALVAYSADMQALL-DRIAPNQPTSAFRVAASKQIISLYLDDASRAQLGLDVDASSAFV 156
Query: 214 QNLYG---SLEDIKLNFDLDEF----VHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAK 266
Q+L G S + L FD L NT P ++ G K+ L++ NY+
Sbjct: 157 QHLRGLGDSFDTRYLRFDFHHRGTNDTRLINTVTRQLPWLVTAGGNGKL-LDAISNYVPM 215
Query: 267 SWKTS-GCTRC--NLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKI 323
W GC C + +H P+ V++++ ++ + FL L ++A + +++
Sbjct: 216 KWHQDLGCLHCVNDATQH-----PEH--KVVMALVVELRSPFLRAVLERLAQQSLSPQQM 268
Query: 324 SMFVYNNQE----YHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEAR-NLAVENSLHKG 378
++ V + + L ++ FK F +++ +A +EA A S +G
Sbjct: 269 ALIVGIEEGDMSVTYTSLVQNFTEEFKDSFASIQIVAGLKGRALREALFQGAAAVSGFQG 328
Query: 379 VDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGF------- 431
F + S ++L NP+ +L+N N+S++AP+L R K +SNFWGA++ D
Sbjct: 329 AS-TFLISSLTYLTNPNTTAHLLNENQSVLAPVLPRHQKLYSNFWGAIDGDARSHCHDFH 387
Query: 432 ----------------------------------------YARSFDYMNIINGDQGGKGI 451
Y RS+DY +I + G
Sbjct: 388 ATCPAWQLAGECETNEVWMSNNCAKACQACQVPGDVQGVRYKRSWDYRDIATREVQG--- 444
Query: 452 WNVPYITNCYLMKTSVIKATNIKTIYTLN-----SMDYDMAFCTNLRNKGIHLKIDSTQE 506
L+K + A + + +D+D+ L +K+D+ +
Sbjct: 445 ------VCALLLKPTAALALQQQLSTSPEHENYLPVDWDLKLTEWLHAAKFEVKVDNQES 498
Query: 507 YGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPI 566
+G L+D NFD +KT+P+++ + NP W YIHP+YQ D V + C D++ FP+
Sbjct: 499 FGTLIDPTNFDSRKTHPDMFLVEANPEPWADIYIHPDYQPYKKLDFVQGR-CWDIYNFPL 557
Query: 567 VTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWA 619
+E+FC E +Q E WS G N DKRL+ GYE VPTRDIH Q+ W+
Sbjct: 558 FSEQFCGEMIQWAETMNLWSGGDNKDKRLKGGYEPVPTRDIHFNQMDFQSAWS 610
>gi|194391238|dbj|BAG60737.1| unnamed protein product [Homo sapiens]
Length = 243
Score = 172 bits (435), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 79/181 (43%), Positives = 120/181 (66%), Gaps = 2/181 (1%)
Query: 98 DDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAVGSGYRY 157
+DMII+ DSYDVI+ G ++L++F + ++F AE CWP+ L ++YP VG+G R+
Sbjct: 8 EDMIIMFVDSYDVILAGSPTELLKKFVQSGSRLLFSAESFCWPEWGLAEQYPEVGTGKRF 67
Query: 158 LNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLY 217
LNSGGFIG+A I +++ K+++DDQL+Y L+LD LR K + LD + +FQNL
Sbjct: 68 LNSGGFIGFATTIHQIVRQWKYKDDDDDQLFYTRLYLDPGLREKLSLNLDHKSRIFQNLN 127
Query: 218 GSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWKT-SGCTRC 276
G+L+++ L FD + V + N Y+T P+++HGNG +K++LN GNY+ W GC C
Sbjct: 128 GALDEVVLKFDRNR-VRIRNVAYDTLPIVVHGNGPTKLQLNYLGNYVPNGWTPEGGCGFC 186
Query: 277 N 277
N
Sbjct: 187 N 187
>gi|209736298|gb|ACI69018.1| Procollagen-lysine,2-oxoglutarate 5-dioxygenase 3 precursor [Salmo
salar]
Length = 317
Score = 165 bits (417), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 86/204 (42%), Positives = 126/204 (61%), Gaps = 9/204 (4%)
Query: 8 NCLILSCVVFFISVHCN---KVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTL 64
+C+ + CV+ + + + + I D LVITVA+ +TDG+ S+ + VK L
Sbjct: 4 SCIAMVCVLLLGWMQSSLGAEQRVISPDNLLVITVATEDTDGF-----SSSSSNYTVKVL 58
Query: 65 GLHQPWLGGDMS-SLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERF 123
GL + W GGD++ ++GGG KV LK EL + D++IL DSYDVI+ G ++L +F
Sbjct: 59 GLGEQWKGGDVARTVGGGQKVRWLKTELLKHSDKKDLVILFVDSYDVILASGPEELLWKF 118
Query: 124 NTFDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEE 183
+ +VF AE CWPD L KYPAV +G RYLNSGGFIGYA ++ E++ K+ +
Sbjct: 119 SRLGHRMVFSAEGFCWPDQKLAPKYPAVHTGKRYLNSGGFIGYAPELSEIVQQWKHKDND 178
Query: 184 DDQLYYALLFLDETLRTKHKIVLD 207
DDQL+Y ++LD+ RTK+ + LD
Sbjct: 179 DDQLFYTKIYLDKVQRTKYNMTLD 202
Score = 107 bits (267), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 48/97 (49%), Positives = 71/97 (73%), Gaps = 2/97 (2%)
Query: 374 SLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYA 433
+L ++YF +D+D + NPDVL+ L+ N+S+IAP+L R K WSNFWGAL+ +GFY+
Sbjct: 200 TLDHQCEYYFSIDADVVIVNPDVLRVLIEENKSVIAPMLSRHGKLWSNFWGALSPEGFYS 259
Query: 434 RSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKA 470
RS DY++I+ G + G+WNVPYIT Y++K SV++
Sbjct: 260 RSEDYIDIVQGKR--IGLWNVPYITQVYMIKGSVLRG 294
>gi|410931371|ref|XP_003979069.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 1-like,
partial [Takifugu rubripes]
Length = 195
Score = 160 bits (405), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 81/191 (42%), Positives = 123/191 (64%), Gaps = 3/191 (1%)
Query: 10 LILSCVVFFISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQP 69
L +S FI C + + I E+K LV+TVA+ +TDG++RF++SA+ VK +G +
Sbjct: 7 LWISVCALFILTSCEE-QRIPEEKLLVVTVATKDTDGFRRFLRSAKHFNYTVKVVGRDEK 65
Query: 70 WLGGD-MSSLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDA 128
W+GG+ M + GGG KV LLK+ L+EM D IIL TDSYDV+ G ++L++F
Sbjct: 66 WIGGNYMGAPGGGQKVRLLKSALEEMK-NQDKIILFTDSYDVVFASGPXELLKKFQQARH 124
Query: 129 NIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLY 188
+VF +E L WPD L DKYP V G R+L SGGFIGY +++E+++ S ++++ DQL+
Sbjct: 125 KVVFSSESLIWPDRHLEDKYPHVREGNRFLGSGGFIGYLANVREMVAEWSGEDDDSDQLF 184
Query: 189 YALLFLDETLR 199
+ +++D R
Sbjct: 185 FTRIYIDAAKR 195
>gi|147744648|gb|ABQ51191.1| procollagen-lysine 2-oxoglutarate 5-dioxygenase 3, partial [Capra
hircus]
Length = 216
Score = 154 bits (388), Expect = 2e-34, Method: Composition-based stats.
Identities = 77/218 (35%), Positives = 128/218 (58%), Gaps = 7/218 (3%)
Query: 272 GCTRCNLIKH-LDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNN 330
GC CN + L +P P VL++VF+++PT FL FL ++ L+YP ++++F++NN
Sbjct: 3 GCGFCNQDRRPLPGGQPP--PRVLLAVFVEQPTPFLPRFLQRLLLLDYPPDRVTLFLHNN 60
Query: 331 QEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHK-GVDFYFYVDSDS 389
+ YH P DD + F VK + + EAR++A++ +FYF +D+D+
Sbjct: 61 EVYHEPHIDDSWPQLQDHFSAVKLVGPEEALTPGEARDMAMDICRQDPKCEFYFSLDADT 120
Query: 390 HLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGK 449
+ NP L+ L+ N +IAP+L R K WSNFWGAL+ D +YARS DY+ ++ +
Sbjct: 121 VITNPQTLRILIEANRKVIAPMLSRHGKLWSNFWGALSPDEYYARSEDYVELVQRKR--V 178
Query: 450 GIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDM 486
G+WNVPYI+ Y+++ ++ + +++ + D DM
Sbjct: 179 GVWNVPYISQAYVIRGETLRTELPQREVFSGSDTDPDM 216
>gi|364023677|gb|AEW46913.1| seminal fluid protein CSSFP065 [Chilo suppressalis]
Length = 178
Score = 153 bits (386), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 76/167 (45%), Positives = 111/167 (66%), Gaps = 4/167 (2%)
Query: 63 TLGLHQPWLGGDMSSLGGGYKVNLLKNELDEMDITDD---MIILVTDSYDVIIDGGVNDI 119
L + W GGDM GGG K+N+LK+EL ++ +DD IIL TDSYD++ + DI
Sbjct: 1 VLAKGKEWTGGDMKYAGGGQKINILKDELSKLMKSDDNKDRIILFTDSYDIMFLSTLEDI 60
Query: 120 LERFNTF-DANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRS 178
L++F +F D ++F AE+ CWPD+ L YP YLNSG FIGY ++ E+++++
Sbjct: 61 LKKFKSFKDTRVLFSAEQFCWPDSKLAGHYPKTEVANPYLNSGAFIGYLPELLEILNHKP 120
Query: 179 IKNEEDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKL 225
IK+++DDQLYY ++LD+ LR KI LD + +FQNLYG+L D++L
Sbjct: 121 IKDQDDDQLYYTKIYLDKELRHNLKISLDHDSKIFQNLYGALSDVQL 167
>gi|55250037|gb|AAH85460.1| Procollagen-lysine, 2-oxoglutarate 5-dioxygenase 2 [Danio rerio]
Length = 165
Score = 147 bits (372), Expect = 2e-32, Method: Composition-based stats.
Identities = 70/154 (45%), Positives = 103/154 (66%), Gaps = 3/154 (1%)
Query: 10 LILSCVVFFISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQP 69
++++CV + + NK +I +K LV+TVA+ ETDG+ RF+QSA VK LG+ +
Sbjct: 13 MLVTCVHCTLGMETNK--DIPTEKLLVLTVATQETDGFLRFMQSANYFNFNVKVLGMGEE 70
Query: 70 WLGGDMS-SLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDA 128
W GGD+ S+GGG KV LLK ++ +D +D+++L DSYD+I GG +IL +F +
Sbjct: 71 WKGGDVGRSIGGGQKVRLLKEAMESLDQQEDLVVLFVDSYDLIFAGGAEEILRKFQQSNH 130
Query: 129 NIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGG 162
+VF AE + WPD+ L +KYP+V SG R+LNSGG
Sbjct: 131 KVVFAAEGIIWPDSQLAEKYPSVRSGKRFLNSGG 164
>gi|76162576|gb|AAX30505.2| SJCHGC04226 protein [Schistosoma japonicum]
Length = 179
Score = 142 bits (357), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 72/169 (42%), Positives = 107/169 (63%), Gaps = 2/169 (1%)
Query: 34 FLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLLKNELD 92
LV+TVA+ + D RF++S +N +VK LG W GG+++ S GGG KVN+LK+EL
Sbjct: 7 ILVLTVATEKNDALDRFLRSCSLNGFEVKVLGEGSYWKGGNVAKSTGGGQKVNILKDELA 66
Query: 93 EMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAVG 152
+ D ++L DSYDV+ V ++L+ + F++ ++F AE CWP SL YP V
Sbjct: 67 KSTYRPDQLVLFVDSYDVVFMQNVANLLKGYERFESKVIFSAEEFCWPQPSLKSLYPEVK 126
Query: 153 SG-YRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRT 200
G RYLNSGGFIG ++ +++++ I +++DDQLYY +FLD LR
Sbjct: 127 PGERRYLNSGGFIGPVANLIKIVNHTPINDDDDDQLYYTNIFLDSKLRV 175
>gi|74199791|dbj|BAE20730.1| unnamed protein product [Mus musculus]
Length = 218
Score = 123 bits (308), Expect = 4e-25, Method: Composition-based stats.
Identities = 65/159 (40%), Positives = 102/159 (64%), Gaps = 1/159 (0%)
Query: 31 EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPW-LGGDMSSLGGGYKVNLLKN 89
ED LV+TVA+ ET+G++RF +SA+ ++++LGL + W + G ++ GGG KV LLK
Sbjct: 25 EDNLLVLTVATKETEGFRRFKRSAQFFNYKIQSLGLGEDWSVDGGPAAAGGGQKVRLLKK 84
Query: 90 ELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYP 149
L++ +D++IL DSYDV+ G ++L++F + +VF AE +PD L KYP
Sbjct: 85 ALEKHADKEDLVILFVDSYDVVFASGPRELLKKFQQAKSQVVFSAEEHIYPDRRLEAKYP 144
Query: 150 AVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLY 188
V G R+L SGGFIGYA + +L++ ++ + DQL+
Sbjct: 145 TVPDGKRFLGSGGFIGYAPSLSKLVAEWEGQDSDSDQLF 183
>gi|313240888|emb|CBY33174.1| unnamed protein product [Oikopleura dioica]
Length = 136
Score = 122 bits (306), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 61/137 (44%), Positives = 82/137 (59%), Gaps = 14/137 (10%)
Query: 609 MKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAP-MSFVVRYRPDEQPSLRPHHDSS 667
M Q+GL W ++ Y P+ + + GY+ P P + FVVRY+P EQ LRPHHDSS
Sbjct: 1 MNQIGLQDEWLYVVKTYAAPMVSKFYTGYN--PDNKPNLMFVVRYKPGEQDRLRPHHDSS 58
Query: 668 TYTINIALNQVGVDYEGGGCRFIRYNCNVTAT-----------RMGWMLMHPGRLTHYHE 716
T+T IALN+ +D+EGGG F RY C+V + + G PGRLTH H
Sbjct: 59 TWTFQIALNRPNIDFEGGGTYFTRYKCSVVGSATEQDSRSLEVKQGMGFAFPGRLTHQHA 118
Query: 717 GLQVTQGTRYIMISFVD 733
GL T+GTRYI+++F+D
Sbjct: 119 GLPTTKGTRYILVNFMD 135
>gi|324538590|gb|ADY49540.1| Procollagen-lysine,2-oxoglutarate 5-dioxygenase, partial [Ascaris
suum]
Length = 144
Score = 120 bits (301), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 59/142 (41%), Positives = 92/142 (64%), Gaps = 3/142 (2%)
Query: 164 IGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDI 223
+GYA +I ++I+ + +++DDQLYY ++LDE LR K+ LD+++ +FQNL G EDI
Sbjct: 1 MGYATEIWQIINAYPVADKDDDQLYYTNVYLDEKLRNSLKMTLDSMSYIFQNLNGVREDI 60
Query: 224 KLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWKTS-GCTRCNLIKH- 281
L FD + + N YNT+P+IIHGNG SK+ LN NY+ K+W GC C +
Sbjct: 61 ALEFDDNGDAQVANIPYNTHPLIIHGNGPSKLFLNHLANYIGKAWSAQRGCLFCETSNYV 120
Query: 282 -LDSLKPDQFPSVLISVFIDKP 302
L+ + +++PS+ +++FI KP
Sbjct: 121 NLEDIPEERWPSLTLAIFIAKP 142
>gi|89892066|gb|ABD78864.1| procollagen-lysine,2-oxoglutarate 5-dioxygenase 2 [Bubalus bubalis]
Length = 93
Score = 114 bits (286), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 48/92 (52%), Positives = 70/92 (76%), Gaps = 1/92 (1%)
Query: 612 VGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTI 671
+GL VW F+R+++ P+ + F GY+ + A ++FVV+Y P+ Q SLRPHHD+ST+TI
Sbjct: 1 IGLENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPHHDASTFTI 59
Query: 672 NIALNQVGVDYEGGGCRFIRYNCNVTATRMGW 703
NIALN VG D++GGGC+F+RYNC++ + R GW
Sbjct: 60 NIALNNVGEDFQGGGCKFLRYNCSIESPRKGW 91
>gi|47212320|emb|CAF91258.1| unnamed protein product [Tetraodon nigroviridis]
Length = 52
Score = 99.0 bits (245), Expect = 8e-18, Method: Composition-based stats.
Identities = 39/52 (75%), Positives = 43/52 (82%)
Query: 683 EGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
+GGGCRF+RYNC+V A R GW LMHPGRLTHYHEGL T G RYI +SFVDP
Sbjct: 1 QGGGCRFLRYNCSVNAPRKGWALMHPGRLTHYHEGLPTTAGVRYIAVSFVDP 52
>gi|291225618|ref|XP_002732798.1| PREDICTED: Dynein intermediate chain 2, ciliary-like [Saccoglossus
kowalevskii]
Length = 858
Score = 94.4 bits (233), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 45/87 (51%), Positives = 62/87 (71%), Gaps = 4/87 (4%)
Query: 407 LIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTS 466
+IAPL+ RP K WSN WGAL+ DGFYARS DY++I+ G++ KG+WN+P+ITN YL++
Sbjct: 5 IIAPLVSRPGKLWSNCWGALSDDGFYARSDDYVDIVKGNR--KGVWNMPHITNLYLVQGD 62
Query: 467 VIKATNIKTIYTLNSMDYDMAFCTNLR 493
V K + IY+ +D DMA +LR
Sbjct: 63 VFKKHKVSFIYS--DLDADMALTRHLR 87
>gi|397584720|gb|EJK53060.1| hypothetical protein THAOC_27572, partial [Thalassiosira oceanica]
Length = 573
Score = 92.0 bits (227), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 85/319 (26%), Positives = 149/319 (46%), Gaps = 37/319 (11%)
Query: 24 NKVKNIDEDKFLVITVASNETD--GYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGG 81
++++ +D D L++ A+ + + GY+ +SA + + + W G
Sbjct: 267 SELEKLDGDADLIVLTAATDPEHFGYQSLKRSATYFGHSLLNVLRGKKW---------EG 317
Query: 82 YKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDAN------------ 129
Y L+ + + +IL D YD ++ G +DIL +N
Sbjct: 318 YNTKLIWTRKVLESVDSNQLILFVDGYDTMLQSGPDDILRSYNEMVEKFRSKWNCEDCVE 377
Query: 130 -IVFGAERLCWPDTSLYDKYPAVGSGYR----YLNSGGFIGYAKDIKELISNRSI-KNEE 183
+ FGAE LCWP ++ ++Y S Y YLNSG +IG A I+ ++ + K ++
Sbjct: 378 PVFFGAEHLCWPSKTVCEQYVNGTSEYSADNPYLNSGTYIGRAGSIRAILEDVDPEKPDD 437
Query: 184 DDQLYYALLFLDETLR-TKHKIVLDTLANLFQNLYGSLEDIKLN-FDLDEFVHLTNTKYN 241
DDQLYY+L + + T IVLD+ LF L G D ++ LD +++ +N K
Sbjct: 438 DDQLYYSLKLVAFVEKGTGVPIVLDSDQRLFYALLGRSSDWTISEKSLDYWLYHSNNKDT 497
Query: 242 -TNPVIIHGNGKSKIELNSFGNYLAKSWKTSGCTRCNLIKHLDSLKPDQFPSVLISVFID 300
T P ++HG G +K L NYL ++ T+ ++ K +D+ D F +++ +F
Sbjct: 498 PTLPAVLHGQGPAKHTLIGITNYLPGAYSDFYGTKQHIHKVVDA---DSFHPLVVGLFYT 554
Query: 301 KPTAFLEE--FLNKIANLN 317
+ T+ E F++ I L+
Sbjct: 555 ELTSDKHERDFISGIKALD 573
>gi|156390789|ref|XP_001635452.1| predicted protein [Nematostella vectensis]
gi|156222546|gb|EDO43389.1| predicted protein [Nematostella vectensis]
Length = 589
Score = 90.1 bits (222), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 77/300 (25%), Positives = 138/300 (46%), Gaps = 39/300 (13%)
Query: 289 QFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN--NQEYHAPLFDDYIHNFK 346
++P+VL+SV L +L I NL+YP +IS+++ + N++ L ++ +N K
Sbjct: 34 KYPTVLLSVIARNAAHLLPNWLGCIENLDYPKDRISIWITSDHNEDNTTELLKEWANNAK 93
Query: 347 TMFKNVKYIAHNSTVNSKEA-----------------RNLAVENSLHKGVDFYFYVDSDS 389
++ V S N + R LA++ + + D+ F VD D+
Sbjct: 94 HLYHRVTMNFTGSPSNYGDVLEASDWTDERYAHVAYLRQLALDTARYWWADYLFVVDCDN 153
Query: 390 HLDNPDVLKYLVNRNESLIAPLL--VRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
L NP L+ L++ +++++P+L A+SNFWG ++ G+Y R+ Y I+N ++
Sbjct: 154 FLFNPITLRQLMHEEKTVVSPMLEVFGNKSAYSNFWGGMDESGYYKRTDQYFTILNREK- 212
Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIHLKI 501
G + VP + + YL+ ++ ++ Y DY + F + R G+ L I
Sbjct: 213 -VGTFEVPMVHSTYLVDLRRRASSELR--YYPPHPDYRGHHDDILVFAHSARMAGVKLHI 269
Query: 502 DSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYI--HPEYQKSLLPDTVNNQPCP 559
+ YGHL+ P + + ++ LD L Y HPE+ L P + P P
Sbjct: 270 INKHIYGHLI-----LPFEARESLEDMRIQFLDGKLGYYVDHPEHLMPLSPH-LTVPPVP 323
>gi|242027195|ref|XP_002433321.1| procollagen-lysine,2-oxoglutarate 5-dioxygenase, putative
[Pediculus humanus corporis]
gi|212519132|gb|EEB20583.1| procollagen-lysine,2-oxoglutarate 5-dioxygenase, putative
[Pediculus humanus corporis]
Length = 144
Score = 89.7 bits (221), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 49/116 (42%), Positives = 78/116 (67%), Gaps = 4/116 (3%)
Query: 164 IGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDI 223
IGYA ++ E++++RSI +++DDQL+Y +L+ETLR KI LD + +F NL+G+++++
Sbjct: 2 IGYAPELYEILTHRSIDDDDDDQLFYTQAYLNETLRNNLKIKLDHKSQIFHNLHGAMDEL 61
Query: 224 KLNFDLDEFVHLTNTKYNTNPVIIHGNGKS--KIELNSFGNYLAKSWKTS-GCTRC 276
L F E +L N + ++P+I+HGNG + K+ LN+ GNYL W T GC C
Sbjct: 62 SLKFKNHE-PYLENEQMKSHPLILHGNGPTVVKVGLNNLGNYLPNCWNTRDGCVSC 116
>gi|118094236|ref|XP_422290.2| PREDICTED: procollagen galactosyltransferase 2 [Gallus gallus]
Length = 627
Score = 85.9 bits (211), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 60/245 (24%), Positives = 113/245 (46%), Gaps = 30/245 (12%)
Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFKTM 348
P+VL+++ + L FL + L YP +I+++V +N + + +++ N + +
Sbjct: 54 PTVLLAIIARNAASALPHFLGCVERLRYPKSRIALWVATDHNADNTTAILREWLKNVQNL 113
Query: 349 FKNVKYIAHNSTVNSKEA-----------------RNLAVENSLHKGVDFYFYVDSDSHL 391
+ +V++ + E R A+ + K D+ ++D+D+ L
Sbjct: 114 YHDVEWRPMEDPQSYPEEMGPKHWPSSRFTHVMKLRQAALRAAREKWSDYVLFLDTDNLL 173
Query: 392 DNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGI 451
NP+ L L+ N++L+AP+L F +SNFW + G+Y R+ DY I + G
Sbjct: 174 TNPETLNLLIAENKTLVAPMLESRF-LYSNFWCGITPQGYYKRTLDYPLI--REWKRTGC 230
Query: 452 WNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIHLKIDSTQ 505
+ VP I + +L+ + K + K ++ DY M F + R GI + I + +
Sbjct: 231 FAVPMIHSTFLI--DLRKEASTKLMFYPPHQDYTWSFDDIMVFAFSSRQAGIQMFICNRE 288
Query: 506 EYGHL 510
YG L
Sbjct: 289 HYGFL 293
>gi|410902625|ref|XP_003964794.1| PREDICTED: procollagen galactosyltransferase 1-like [Takifugu
rubripes]
Length = 611
Score = 83.6 bits (205), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 67/287 (23%), Positives = 137/287 (47%), Gaps = 33/287 (11%)
Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFKTM 348
P VL+++ L FL I LNYP ++++++V +NQ+ + D++ ++
Sbjct: 41 PRVLLALICRNSEHSLPYFLGTIERLNYPKERMALWVATDHNQDNTTVILHDWLVKMQSF 100
Query: 349 FKNVKYIAHNSTVNSKEA-----------------RNLAVENSLHKGVDFYFYVDSDSHL 391
+ NV++ ++ ++ R +A+E++ D++ D D+ L
Sbjct: 101 YHNVEWRPKEKPIHYEDEAGPKDWTDLRYEHVMKLRQVALESAREMWADYFMLADCDNLL 160
Query: 392 DNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGI 451
NPDVL L+ N+++I+P+L A+SNFW +++ G+Y R+ Y+ I Q KG
Sbjct: 161 TNPDVLWMLMKENKTIISPML-ESRAAYSNFWCGMSSQGYYKRTPAYIPI--RKQVRKGC 217
Query: 452 WNVPYITNCYL--MKTSVIKATNIKTIYTLNSMDYD--MAFCTNLRNKGIHLKIDSTQEY 507
+ VP + + L ++ + + + S +D + F + + + + + + + Y
Sbjct: 218 FAVPMVHSTLLIDLRKEASRQLSFHPPHPEYSWAFDDIIVFAFSAQMADVQMFVCNKETY 277
Query: 508 GHL---VDSENFDPQKTNPEVYELI----RNPLDWDLRYIHPEYQKS 547
G+L + S N + + ++ L+ RNPL +YIH +K+
Sbjct: 278 GYLPVPLRSHNTLQDEADSFLHCLLEASARNPLVMPSKYIHVPRKKT 324
>gi|198415096|ref|XP_002129882.1| PREDICTED: similar to GLT25D1 protein [Ciona intestinalis]
Length = 594
Score = 82.4 bits (202), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 60/243 (24%), Positives = 118/243 (48%), Gaps = 30/243 (12%)
Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQ------------------E 332
P+V + +F+ L FL + +LNYP K++S+++ + E
Sbjct: 55 PTVFVPIFVRNKAHALPYFLKCLYDLNYPKKRLSLWIVTDHNSDNSSQILEKWTNTVKHE 114
Query: 333 YHAPLFD--DYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSH 390
YH +F+ D +K N+ + + + R A+E + DF Y+D+D+
Sbjct: 115 YHDLVFEKPDTEWFYKEQKGNLHW-PEERHIKMLQLRQQALEKARKMWSDFILYLDADNM 173
Query: 391 LDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKG 450
L NP L++L++R+ +++AP+L +++NFW + +G+Y R+ +Y I N + G
Sbjct: 174 LINPHTLQHLISRDLTIVAPMLTT-IASYANFWADQDENGYYKRADNYFEIRNRETV--G 230
Query: 451 IWNVPYITNCYLMKTSVIKATNIKTIYTLNS-----MDYDMAFCTNLRNKGIHLKIDSTQ 505
++ VP + + +L+ K+ ++ + L+ +D + F + + I L ID+T
Sbjct: 231 VFEVPMVHSTFLVNLVARKSRKLR-FWPLHEDYYLLVDDIIVFSIHAKLADIPLYIDNTH 289
Query: 506 EYG 508
YG
Sbjct: 290 IYG 292
>gi|373955086|ref|ZP_09615046.1| hypothetical protein Mucpa_3485 [Mucilaginibacter paludis DSM
18603]
gi|373891686|gb|EHQ27583.1| hypothetical protein Mucpa_3485 [Mucilaginibacter paludis DSM
18603]
Length = 260
Score = 81.6 bits (200), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 67/197 (34%), Positives = 94/197 (47%), Gaps = 22/197 (11%)
Query: 36 VITVASN-ETDGYKRFIQ-SAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNELDE 93
VITVAS+ + Y F++ S E L TL + + K LL L +
Sbjct: 3 VITVASDLKNTSYLSFLKASCEFYHLDATTLYYSDVYFSNRI-------KDALLNTHLTQ 55
Query: 94 MDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAVGS 153
DD IIL TD+ D + +I+++FN F+ ++F AE CWPD S+ YPA
Sbjct: 56 F--ADDEIILFTDAIDAVFVAEQKEIIDKFNHFNCPLLFSAEVNCWPDKSMEKNYPAPSV 113
Query: 154 GYRYLNSGGFIGYAKDIKELISNRSI----KNEE---DDQLYYALLFLDETLRTKHKIVL 206
+RYLNSG FIG A +K L I KN +Q Y+ L+F +E+ I L
Sbjct: 114 HFRYLNSGAFIGRAGYLKYLYEKYPIFEIGKNPAYFWSNQYYWNLVFQNESAN----IQL 169
Query: 207 DTLANLFQNLYGSLEDI 223
D LF N ++ +I
Sbjct: 170 DHSGELFFNTSITISNI 186
>gi|340378483|ref|XP_003387757.1| PREDICTED: procollagen galactosyltransferase 1-like [Amphimedon
queenslandica]
Length = 594
Score = 80.5 bits (197), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 76/311 (24%), Positives = 138/311 (44%), Gaps = 45/311 (14%)
Query: 284 SLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYH--APLFDDY 341
SL+ + P V +++ L ++L I LNYP KI + +Y Q L ++
Sbjct: 32 SLQQESRPLVYLAILSRNAAHLLPQYLGYIEGLNYPKDKIIIGLYIGQSVDNTTNLLLEW 91
Query: 342 IHNFKTMFKNVKYIAHN-----------STVNSK-----EARNLAVENSLHKGVDFYFYV 385
N ++++ NV S +S+ + R + N+ ++ F+V
Sbjct: 92 SENVRSIYNNVLIYEDGDIFPLGDSELFSWSDSRLEYMCKLRQDVLSNARMARAEYLFFV 151
Query: 386 DSDSHLDNPDVLKYLVNRNESLIAPLLV-RPFKAWSNFWGALNADGFYARSFDYMNIING 444
D D+ L NPDVL L+ + ++APLL+ +A+SNFWG +G+Y R+ +Y+ I+
Sbjct: 152 DCDNFLINPDVLIRLIEAKKPIVAPLLIYDKERAFSNFWGGQKENGYYLRTEEYLPIVT- 210
Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKATNIK-----TIYTLNSMDYDMAFCTNLRNKGIHL 499
+ G + VP + + L+ + + ++ T Y N +D + + R GI +
Sbjct: 211 -RSNLGCFKVPLVHSTLLIDLRTVSSESLAYWPPPTEYKWN-IDDIILLSYSARVNGIGM 268
Query: 500 KIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCP 559
I +T +G+L+ + + + IR +W L+ I VN+ P P
Sbjct: 269 YILNTDVFGYLLKTGEY------ASLEHAIRETDNWKLKTI------------VNHYPVP 310
Query: 560 DVFWFPIVTEK 570
+ I +EK
Sbjct: 311 VSQFISIHSEK 321
>gi|402593476|gb|EJW87403.1| hypothetical protein WUBG_01688, partial [Wuchereria bancrofti]
Length = 69
Score = 80.1 bits (196), Expect = 4e-12, Method: Composition-based stats.
Identities = 32/58 (55%), Positives = 43/58 (74%)
Query: 677 QVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
+ G DYEGGG R+ RYNC V+A ++G+ M P +LTH HEG +T GTRYI +SF++P
Sbjct: 12 ESGRDYEGGGIRYARYNCTVSADQIGYAAMFPAQLTHMHEGFPITSGTRYIAVSFLNP 69
>gi|3043692|dbj|BAA25510.1| KIAA0584 protein [Homo sapiens]
Length = 738
Score = 80.1 bits (196), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 60/249 (24%), Positives = 114/249 (45%), Gaps = 30/249 (12%)
Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
P Q P+VL++V L FL + L+YP +++++ +N + +F +++ N
Sbjct: 161 PLQSPTVLVAVLARNAAHTLPHFLGCLERLDYPKSRMAIWAATDHNVDNTTEIFREWLKN 220
Query: 345 FKTMFKNVKY------------IAHNSTVNSKEA-----RNLAVENSLHKGVDFYFYVDS 387
+ ++ V++ I S+ A R A+ + K D+ ++D
Sbjct: 221 VQRLYHYVEWRPMDEPESYPDEIGPKHWPTSRFAHVMKLRQAALRTAREKWSDYILFIDV 280
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D+ L NP L L+ N++++AP+L +SNFW + GFY R+ DY+ I +
Sbjct: 281 DNFLTNPQTLNLLIAENKTIVAPML-ESRGLYSNFWCGITPKGFYKRTPDYVQIREWKR- 338
Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIHLKI 501
G + VP + + +L+ + K + K + DY + F + R GI + +
Sbjct: 339 -TGCFPVPMVHSTFLI--DLRKEASDKLTFYPPHQDYTWTFDDIIVFAFSSRQAGIQMYL 395
Query: 502 DSTQEYGHL 510
+ + YG+L
Sbjct: 396 CNREHYGYL 404
>gi|427798775|gb|JAA64839.1| Putative procollagen-lysine 2-oxoglutarate 5-dioxygenase, partial
[Rhipicephalus pulchellus]
Length = 344
Score = 80.1 bits (196), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 61/255 (23%), Positives = 115/255 (45%), Gaps = 34/255 (13%)
Query: 283 DSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDD 340
D L+P P+VLI+V + L F + +YP +IS+++Y +N + A + D
Sbjct: 31 DKLEP---PTVLIAVILRNKAHVLPHFFGYLEQQSYPKSRISLWIYTDHNVDQTAEMVDT 87
Query: 341 YIHNFKTMFKNVKYIAHNSTV-----------------NSKEARNLAVENSLHKGVDFYF 383
+ + NV + + + R A++ + DF F
Sbjct: 88 WAEAVSNEYHNVNVTSEDGEAFFPDEEGSQKWTAQRYWHVIRLREEAIQVARTLWADFIF 147
Query: 384 YVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIIN 443
++D D+ L NP ++ LV N ++IAP+L A+SNFW +N G+Y R+ +YM I+
Sbjct: 148 FLDGDAMLSNPKTIQDLVEENRTIIAPML-DSRSAYSNFWCGMNEKGYYERTDEYMPILE 206
Query: 444 GDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSM-------DYDMAFCTNLRNKG 496
++ G++ V + + L+ + A + K Y + D + F + +
Sbjct: 207 KEK--VGVFPVVMVHSATLINLN--HANSRKLTYDPQKLEGYTGPNDDVITFAHSAKFAA 262
Query: 497 IHLKIDSTQEYGHLV 511
+ + I + +YGH++
Sbjct: 263 VEMFISNKDQYGHIL 277
>gi|16506820|ref|NP_055916.1| procollagen galactosyltransferase 2 precursor [Homo sapiens]
gi|74750765|sp|Q8IYK4.1|GT252_HUMAN RecName: Full=Procollagen galactosyltransferase 2; AltName:
Full=Glycosyltransferase 25 family member 2; AltName:
Full=Hydroxylysine galactosyltransferase 2; Flags:
Precursor
gi|12620188|gb|AAG60609.1|AF288389_1 C1orf17 [Homo sapiens]
gi|23273043|gb|AAH35672.1| Glycosyltransferase 25 domain containing 2 [Homo sapiens]
gi|119611578|gb|EAW91172.1| glycosyltransferase 25 domain containing 2, isoform CRA_c [Homo
sapiens]
gi|168278659|dbj|BAG11209.1| glycosyltransferase 25 domain-containing protein 2 [synthetic
construct]
gi|325463379|gb|ADZ15460.1| glycosyltransferase 25 domain containing 2 [synthetic construct]
Length = 626
Score = 80.1 bits (196), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 60/249 (24%), Positives = 114/249 (45%), Gaps = 30/249 (12%)
Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
P Q P+VL++V L FL + L+YP +++++ +N + +F +++ N
Sbjct: 49 PLQSPTVLVAVLARNAAHTLPHFLGCLERLDYPKSRMAIWAATDHNVDNTTEIFREWLKN 108
Query: 345 FKTMFKNVKY------------IAHNSTVNSKEA-----RNLAVENSLHKGVDFYFYVDS 387
+ ++ V++ I S+ A R A+ + K D+ ++D
Sbjct: 109 VQRLYHYVEWRPMDEPESYPDEIGPKHWPTSRFAHVMKLRQAALRTAREKWSDYILFIDV 168
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D+ L NP L L+ N++++AP+L +SNFW + GFY R+ DY+ I +
Sbjct: 169 DNFLTNPQTLNLLIAENKTIVAPML-ESRGLYSNFWCGITPKGFYKRTPDYVQIREWKRT 227
Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIHLKI 501
G + VP + + +L+ + K + K + DY + F + R GI + +
Sbjct: 228 --GCFPVPMVHSTFLI--DLRKEASDKLTFYPPHQDYTWTFDDIIVFAFSSRQAGIQMYL 283
Query: 502 DSTQEYGHL 510
+ + YG+L
Sbjct: 284 CNREHYGYL 292
>gi|332811368|ref|XP_524994.3| PREDICTED: procollagen galactosyltransferase 2 [Pan troglodytes]
gi|410298208|gb|JAA27704.1| glycosyltransferase 25 domain containing 2 [Pan troglodytes]
Length = 626
Score = 80.1 bits (196), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 60/249 (24%), Positives = 114/249 (45%), Gaps = 30/249 (12%)
Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
P Q P+VL++V L FL + L+YP +++++ +N + +F +++ N
Sbjct: 49 PLQSPTVLVAVLARNAAHTLPHFLGCLERLDYPKSRMAIWAATDHNVDNTTEIFREWLKN 108
Query: 345 FKTMFKNVKY------------IAHNSTVNSKEA-----RNLAVENSLHKGVDFYFYVDS 387
+ ++ V++ I S+ A R A+ + K D+ ++D
Sbjct: 109 VQRLYHYVEWRPMDEPESYPDEIGPKHWPTSRFAHVMKLRQAALRTAREKWSDYILFIDV 168
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D+ L NP L L+ N++++AP+L +SNFW + GFY R+ DY+ I +
Sbjct: 169 DNFLTNPQTLNLLIAENKTIVAPML-ESRGLYSNFWCGITPKGFYKRTPDYVQIREWKRT 227
Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIHLKI 501
G + VP + + +L+ + K + K + DY + F + R GI + +
Sbjct: 228 --GCFPVPMVHSTFLI--DLRKEASDKLTFYPPHQDYTWTFDDIIVFAFSSRQAGIQMYL 283
Query: 502 DSTQEYGHL 510
+ + YG+L
Sbjct: 284 CNREHYGYL 292
>gi|301611908|ref|XP_002935453.1| PREDICTED: LOW QUALITY PROTEIN: procollagen galactosyltransferase
1-B-like [Xenopus (Silurana) tropicalis]
Length = 610
Score = 79.7 bits (195), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 54/238 (22%), Positives = 118/238 (49%), Gaps = 17/238 (7%)
Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
P + P+VLI++ L E L + L+YP ++IS++V +N + + +++ N
Sbjct: 38 PLRRPTVLIALLARNSEGSLPEVLGALERLHYPKERISLWVATDHNIDNTTQMLREWLIN 97
Query: 345 FKTMFKNV--------KYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSHLDNPDV 396
+ + +V +Y +++ + ++ ++ D+ F++D+D+ L NP+
Sbjct: 98 VQNQYHHVEWRPQEHPRYWGYSACFSLSSIHPDSLTSAREMWADYIFFLDADNLLTNPET 157
Query: 397 LKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPY 456
L L+ N++++AP++ A+SNFW + G+Y R+ YM I ++ KG + VP
Sbjct: 158 LNRLIAENKTIVAPMM-ESRAAYSNFWCGMTTQGYYRRTPAYMPIRRRER--KGCFPVPM 214
Query: 457 ITNCYL--MKTSVIKATNIKTIYTLNSMDYD--MAFCTNLRNKGIHLKIDSTQEYGHL 510
+ + +L ++ + + + + YD + F + R + + I + + YG+L
Sbjct: 215 VHSTFLIDLRKEASQQLDFYPPHADYTWAYDDIIVFAFSCRQADVQMFICNKEIYGYL 272
>gi|157073889|ref|NP_001096660.1| procollagen galactosyltransferase 1-A precursor [Xenopus laevis]
gi|160385807|sp|A0JPH3.1|G251A_XENLA RecName: Full=Procollagen galactosyltransferase 1-A; AltName:
Full=Glycosyltransferase 25 family member 1-A; AltName:
Full=Hydroxylysine galactosyltransferase 1-A; Flags:
Precursor
gi|117558235|gb|AAI27423.1| Glt25d1b protein [Xenopus laevis]
Length = 611
Score = 79.7 bits (195), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 56/243 (23%), Positives = 117/243 (48%), Gaps = 26/243 (10%)
Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFKTM 348
P+VLI++ L E L + L+YP ++IS++V +N + + +++ N +
Sbjct: 41 PTVLIALLARNSEGSLPEVLGALDTLHYPKERISLWVATDHNLDNTTEILREWLINVQNQ 100
Query: 349 FKNV-----------------KYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSHL 391
+ +V K+ +H+ + R A+ ++ D+ F++D+D+ L
Sbjct: 101 YHHVEWRPQEHPRWFKDEEGPKHWSHSRYEYIMKLRQAALTSAREMWADYIFFLDADNLL 160
Query: 392 DNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGI 451
NP+ L L+ N++++AP+L A+SNFW + G+Y R+ YM I ++ +G
Sbjct: 161 TNPETLNLLIAENKTVVAPML-DSRAAYSNFWCGMTTQGYYRRTPAYMPIRRRER--RGC 217
Query: 452 WNVPYITNCYL--MKTSVIKATNIKTIYTLNSMDYD--MAFCTNLRNKGIHLKIDSTQEY 507
+ VP + + +L ++ + N + + +D + F + R + + + + + Y
Sbjct: 218 FPVPMVHSTFLIDLRKEASQQLNFYPPHADYTWAFDDIIVFAFSCRQADVQMFLCNKEIY 277
Query: 508 GHL 510
GHL
Sbjct: 278 GHL 280
>gi|194376002|dbj|BAG57345.1| unnamed protein product [Homo sapiens]
Length = 554
Score = 79.3 bits (194), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 59/254 (23%), Positives = 109/254 (42%), Gaps = 40/254 (15%)
Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFK 346
P Q P+VL++V L FL + L+YP +++++ + D+ F+
Sbjct: 49 PLQSPTVLVAVLARNAAHTLPHFLGCLERLDYPKSRMAIWAATDHN-----VDNTTEIFR 103
Query: 347 TMFKNVKYIAH------------------------NSTVNSKEARNLAVENSLHKGVDFY 382
KNV+ + H + + + R A+ + K D+
Sbjct: 104 ERLKNVQRLYHYVEWRPMDEPESYPDEIGPKHWPTSRFAHVMKLRQAALRTAREKWSDYI 163
Query: 383 FYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNII 442
++D D+ L NP L L+ N++++AP+L +SNFW + GFY R+ DY+ I
Sbjct: 164 LFIDVDNFLTNPQTLNLLIAENKTIVAPML-ESRGLYSNFWCGITPKGFYKRTPDYVQI- 221
Query: 443 NGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKG 496
+ G + VP + + +L+ + K + K + DY + F + R G
Sbjct: 222 -REWKRTGCFPVPMVHSTFLI--DLRKEASDKLTFYPPHQDYTWTFDDIIVFAFSSRQAG 278
Query: 497 IHLKIDSTQEYGHL 510
I + + + + YG+L
Sbjct: 279 IQMYLCNREHYGYL 292
>gi|390477037|ref|XP_003735231.1| PREDICTED: LOW QUALITY PROTEIN: procollagen galactosyltransferase 2
[Callithrix jacchus]
Length = 831
Score = 79.3 bits (194), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 59/249 (23%), Positives = 114/249 (45%), Gaps = 30/249 (12%)
Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
P Q P+VL++V L FL + L+YP +++++ +N + + +++ N
Sbjct: 254 PLQSPTVLVAVLARNAAHSLPHFLGCLERLDYPKSRMAVWAATDHNVDNTTEILREWLKN 313
Query: 345 FKTMFKNVKY------------IAHNSTVNSKEA-----RNLAVENSLHKGVDFYFYVDS 387
+ ++ V++ I +S+ A R A+ + K D+ ++D
Sbjct: 314 VQRLYHYVEWRPMDEPESYPDEIGPKHWPSSRFAHVMKLRQAALRTAREKWSDYILFIDV 373
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D+ L NP L L+ N++++AP+L +SNFW + GFY R+ DY+ I +
Sbjct: 374 DNFLTNPQTLNLLIAENKTIVAPML-ESRGLYSNFWCGITPKGFYKRTPDYVQIREWKR- 431
Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIHLKI 501
G + VP + + +L+ + K + K + DY + F + R GI + +
Sbjct: 432 -TGCFPVPMVHSTFLI--DLRKEASDKLTFYPPHQDYTWTFDDIIVFAFSSRQAGIQMYL 488
Query: 502 DSTQEYGHL 510
+ + YG+L
Sbjct: 489 CNREHYGYL 497
>gi|395825231|ref|XP_003785842.1| PREDICTED: procollagen galactosyltransferase 2 [Otolemur garnettii]
Length = 668
Score = 79.0 bits (193), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 58/249 (23%), Positives = 115/249 (46%), Gaps = 30/249 (12%)
Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
P Q P+VL+ V L FL + L+YP +++++ +N + +F +++ N
Sbjct: 91 PLQRPTVLVVVLARNAAHALPPFLGCLERLDYPKSRMAIWAATDHNVDNTTEIFREWLKN 150
Query: 345 FKTMFKNVKY------------IAHNSTVNSKEA-----RNLAVENSLHKGVDFYFYVDS 387
+ ++ V++ I +S+ A R A+ + + D+ ++D
Sbjct: 151 VQKLYHYVEWRPMDEPESYPDEIGPKHWPSSRFAHVMKLRQAALRTARERWSDYILFIDV 210
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D+ L NP L ++ N++++AP+L +SNFW + GFY R+ DY+ I +
Sbjct: 211 DNFLTNPQTLNLMIAENKTIVAPML-ESRGLYSNFWCGITPQGFYKRTPDYIQIREWKR- 268
Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIHLKI 501
G + VP + + +L+ + K + K ++ DY + F + R GI + +
Sbjct: 269 -LGCFPVPMVHSTFLI--DLRKEASDKLMFYPPHQDYTWTFDDIIVFAFSSRQAGIQMHL 325
Query: 502 DSTQEYGHL 510
+ + YG+L
Sbjct: 326 CNREHYGYL 334
>gi|47220022|emb|CAG12170.1| unnamed protein product [Tetraodon nigroviridis]
Length = 635
Score = 79.0 bits (193), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 50/192 (26%), Positives = 95/192 (49%), Gaps = 22/192 (11%)
Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFKTM 348
P +L+++ L FL I LNYP ++++++V +NQ+ A + D++ +
Sbjct: 40 PRILLALVCRNSEHSLPYFLGTIERLNYPKERMALWVATDHNQDNTAVILRDWLVKMQDF 99
Query: 349 FKNVKYIAHNSTVNSKEA-----------------RNLAVENSLHKGVDFYFYVDSDSHL 391
+ NV++ ++ R +A+E++ D++ D D+ L
Sbjct: 100 YHNVEWRPKEKPTRYEDEAGPKDWTDPRYEHVMKLRQVALESAREMWADYFMLADCDNLL 159
Query: 392 DNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGI 451
NPDVL L+ N+++I+P+L A+SNFW +++ G+Y R+ Y+ I Q KG
Sbjct: 160 TNPDVLWMLMKENKTIISPML-ESRGAYSNFWCGMSSQGYYKRTPAYIPI--RKQVRKGC 216
Query: 452 WNVPYITNCYLM 463
+ VP + + L+
Sbjct: 217 FAVPMVHSTLLI 228
>gi|326924714|ref|XP_003208570.1| PREDICTED: procollagen galactosyltransferase 2-like [Meleagris
gallopavo]
Length = 552
Score = 78.6 bits (192), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 57/230 (24%), Positives = 105/230 (45%), Gaps = 30/230 (13%)
Query: 306 LEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNS 363
L FL + L YP +I+++V +N + + +++ N + ++ +V++ +
Sbjct: 4 LPHFLGCVERLRYPKSRIALWVATDHNVDNTTAILREWLKNVQNLYHDVEWRPMEDPQSY 63
Query: 364 KEA-----------------RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNES 406
E R A+ + K D+ ++D+D+ L NP+ L L+ N++
Sbjct: 64 PEEMGPKHWPSSRFTHVMKLRQAALRAAREKWSDYVLFLDTDNLLTNPETLNLLIAENKT 123
Query: 407 LIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTS 466
L+AP+L F +SNFW + G+Y R+ DY I + G + VP I + +L+
Sbjct: 124 LVAPMLESRF-LYSNFWCGITPQGYYKRTLDYPLIREWKR--TGCFAVPMIHSTFLI--D 178
Query: 467 VIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIHLKIDSTQEYGHL 510
+ K + K ++ DY M F + R GI + I + + YG L
Sbjct: 179 LRKEASTKLMFYPPHQDYTWSFDDIMVFAFSSRQAGIQMFICNREHYGFL 228
>gi|297281262|ref|XP_002802062.1| PREDICTED: procollagen galactosyltransferase 2-like [Macaca
mulatta]
Length = 626
Score = 78.6 bits (192), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 59/249 (23%), Positives = 114/249 (45%), Gaps = 30/249 (12%)
Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
P Q P+VL++V L FL + L+YP +++++ +N + + +++ N
Sbjct: 49 PLQSPTVLVAVLARNAAHTLPHFLGCLERLDYPKSRMAIWAATDHNVDNTTEILREWLKN 108
Query: 345 FKTMFKNVKY------------IAHNSTVNSKEA-----RNLAVENSLHKGVDFYFYVDS 387
+ ++ V++ I +S+ A R A+ + K D+ ++D
Sbjct: 109 VQRLYHYVEWRPMDEPESYPDEIGPKHWPSSRFAHVMKLRQAALRTAREKWSDYILFIDV 168
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D+ L NP L L+ N++++AP+L +SNFW + GFY R+ DY+ I +
Sbjct: 169 DNFLTNPQTLNLLIAENKTIVAPML-ESRGLYSNFWCGITPKGFYKRTPDYVQIREWKRS 227
Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIHLKI 501
G + VP + + +L+ + K + K + DY + F + R GI + +
Sbjct: 228 --GCFPVPMVHSTFLI--DLRKEASDKLTFYPPHQDYTWTFDDIIVFAFSSRQAGIQMYL 283
Query: 502 DSTQEYGHL 510
+ + YG+L
Sbjct: 284 CNREHYGYL 292
>gi|402857849|ref|XP_003893450.1| PREDICTED: procollagen galactosyltransferase 2 [Papio anubis]
Length = 626
Score = 78.6 bits (192), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 59/249 (23%), Positives = 114/249 (45%), Gaps = 30/249 (12%)
Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
P Q P+VL++V L FL + L+YP +++++ +N + + +++ N
Sbjct: 49 PLQSPTVLVAVLARNAAHTLPHFLGCLERLDYPKSRMAIWAATDHNVDNTTEILREWLKN 108
Query: 345 FKTMFKNVKY------------IAHNSTVNSKEA-----RNLAVENSLHKGVDFYFYVDS 387
+ ++ V++ I +S+ A R A+ + K D+ ++D
Sbjct: 109 VQRLYHYVEWRPMDEPESYPDEIGPKHWPSSRFAHVMKLRQAALRTAREKWSDYILFIDV 168
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D+ L NP L L+ N++++AP+L +SNFW + GFY R+ DY+ I +
Sbjct: 169 DNFLTNPQTLNLLIAENKTIVAPML-ESRGLYSNFWCGITPKGFYKRTPDYVQIREWKRS 227
Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIHLKI 501
G + VP + + +L+ + K + K + DY + F + R GI + +
Sbjct: 228 --GCFPVPMVHSTFLI--DLRKEASDKLTFYPPHQDYTWTFDDIIVFAFSSRQAGIQMYL 283
Query: 502 DSTQEYGHL 510
+ + YG+L
Sbjct: 284 CNREHYGYL 292
>gi|426333026|ref|XP_004028088.1| PREDICTED: procollagen galactosyltransferase 2 [Gorilla gorilla
gorilla]
Length = 626
Score = 78.6 bits (192), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 60/249 (24%), Positives = 113/249 (45%), Gaps = 30/249 (12%)
Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
P Q P+VL++V L FL + L+YP +++++ +N + +F +++ N
Sbjct: 49 PLQSPTVLVAVLARNAAHTLPHFLGCLERLDYPKSRMAIWAATDHNVDNTTEIFREWLKN 108
Query: 345 FKTMFKNVKY------------IAHNSTVNSKEA-----RNLAVENSLHKGVDFYFYVDS 387
+ ++ V++ I S+ A R A+ + K D+ ++D
Sbjct: 109 VQRLYHYVEWRPMDEPESYPDEIGPKHWPTSRFAHVMKLRQAALRTAREKWSDYILFIDV 168
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D+ L NP L L+ N+++ AP+L +SNFW + GFY R+ DY+ I +
Sbjct: 169 DNFLTNPQTLNLLIAENKTIAAPML-ESRGLYSNFWCGITPKGFYKRTPDYVQIREWKRT 227
Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIHLKI 501
G + VP + + +L+ + K + K + DY + F + R GI + +
Sbjct: 228 --GCFPVPMVHSTFLI--DLRKEASDKLTFYPPHQDYTWTFDDIIVFAFSSRQAGIQMYL 283
Query: 502 DSTQEYGHL 510
+ + YG+L
Sbjct: 284 CNREHYGYL 292
>gi|160395584|sp|Q7Q021.4|GLT25_ANOGA RecName: Full=Glycosyltransferase 25 family member; Flags:
Precursor
Length = 592
Score = 77.8 bits (190), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 47/205 (22%), Positives = 101/205 (49%), Gaps = 22/205 (10%)
Query: 288 DQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN--NQEYHAPLFDDYIHNF 345
+Q P+V+++V + L F + + +L+YP ++S+++ + N++ + ++
Sbjct: 21 EQLPTVMVAVLVRNKAHTLPYFFSYLEDLDYPKDRMSLWIRSDHNEDRSIEITKAWLKRT 80
Query: 346 KTMFKNV--KYIAHNSTVNSKEA---------------RNLAVENSLHKGVDFYFYVDSD 388
+++ +V KY + S++ + A++ + D+ F++D+D
Sbjct: 81 SSLYHSVDFKYRSERGKRESEKTSTHWNEERFSDVIRLKQDALQAARMMWADYIFFIDAD 140
Query: 389 SHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGG 448
L N + L L+ R ++AP+LV +SNFW + +D +Y R+ DY I+N DQ G
Sbjct: 141 VFLTNSNTLGKLIERKLPIVAPMLVSD-GLYSNFWCGMTSDYYYQRTDDYKKILNYDQIG 199
Query: 449 KGIWNVPYITNCYLMKTSVIKATNI 473
+ W VP + L+ ++ + +
Sbjct: 200 Q--WPVPMVHTAVLVSLNIAQTRQL 222
>gi|350589106|ref|XP_003482786.1| PREDICTED: LOW QUALITY PROTEIN: procollagen galactosyltransferase
2-like [Sus scrofa]
Length = 626
Score = 77.8 bits (190), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 58/247 (23%), Positives = 113/247 (45%), Gaps = 30/247 (12%)
Query: 289 QFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFK 346
Q P+VL+++ L FL + L+YP +++++ +N + + +++ N +
Sbjct: 51 QRPTVLVAILARNAAHSLPHFLGCLERLDYPKSRMAIWAATDHNVDNTTEILREWLKNVQ 110
Query: 347 TMFKNVKY------------IAHNSTVNSKEA-----RNLAVENSLHKGVDFYFYVDSDS 389
+ V++ I S+ A R A+ + K D+ ++D D+
Sbjct: 111 RAYHYVEWRPMDEPESYPDEIGPKHWPGSRFAHVMKLRQAALRTAREKWSDYILFIDVDN 170
Query: 390 HLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGK 449
L NP L L+ N++++AP+L +SNFW + GFY R+ DY+ I +
Sbjct: 171 FLTNPQTLSLLMAENKTIVAPML-ESRGLYSNFWCGITPQGFYKRTPDYLQIREWKR--L 227
Query: 450 GIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIHLKIDS 503
G + VP + + +L+ + K + K ++ DY + F + R GI + + +
Sbjct: 228 GCFPVPMVHSTFLI--DLRKEASDKLMFYPPHQDYTWTFDDIIVFAFSSRQAGIQMYLCN 285
Query: 504 TQEYGHL 510
T+ YG+L
Sbjct: 286 TEHYGYL 292
>gi|158300399|ref|XP_320324.3| AGAP012208-PA [Anopheles gambiae str. PEST]
gi|157013141|gb|EAA00118.3| AGAP012208-PA [Anopheles gambiae str. PEST]
Length = 554
Score = 77.4 bits (189), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 47/205 (22%), Positives = 101/205 (49%), Gaps = 22/205 (10%)
Query: 288 DQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN--NQEYHAPLFDDYIHNF 345
+Q P+V+++V + L F + + +L+YP ++S+++ + N++ + ++
Sbjct: 21 EQLPTVMVAVLVRNKAHTLPYFFSYLEDLDYPKDRMSLWIRSDHNEDRSIEITKAWLKRT 80
Query: 346 KTMFKNV--KYIAHNSTVNSKEA---------------RNLAVENSLHKGVDFYFYVDSD 388
+++ +V KY + S++ + A++ + D+ F++D+D
Sbjct: 81 SSLYHSVDFKYRSERGKRESEKTSTHWNEERFSDVIRLKQDALQAARMMWADYIFFIDAD 140
Query: 389 SHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGG 448
L N + L L+ R ++AP+LV +SNFW + +D +Y R+ DY I+N DQ G
Sbjct: 141 VFLTNSNTLGKLIERKLPIVAPMLVSD-GLYSNFWCGMTSDYYYQRTDDYKKILNYDQIG 199
Query: 449 KGIWNVPYITNCYLMKTSVIKATNI 473
+ W VP + L+ ++ + +
Sbjct: 200 Q--WPVPMVHTAVLVSLNIAQTRQL 222
>gi|426240022|ref|XP_004013914.1| PREDICTED: procollagen galactosyltransferase 2 [Ovis aries]
Length = 626
Score = 77.4 bits (189), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 57/249 (22%), Positives = 111/249 (44%), Gaps = 30/249 (12%)
Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
P Q P+VL+ V L FL + L+YP +++++ +N + + +++ N
Sbjct: 49 PLQRPTVLVVVLARNAAHTLPHFLGCLERLDYPKSRMAIWAATDHNVDNTTEILREWLKN 108
Query: 345 FKTMFKNV-----------------KYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDS 387
+ + V K+ + + + R A+ + K D+ ++D
Sbjct: 109 VQQSYHYVEWRPMDEPESYPDEIGPKHWPASRFAHVMKLRQAALRTAREKWSDYILFIDV 168
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D+ L NP L L+ N++++AP+L +SNFW + GFY R+ DY+ I +
Sbjct: 169 DNFLTNPQTLNLLIAENKTIVAPML-ESRGLYSNFWCGITPQGFYKRTPDYLQIREWKR- 226
Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIHLKI 501
G + VP + + +L+ + K + K ++ DY + F + R GI + +
Sbjct: 227 -LGCFPVPMVHSTFLI--DLRKEASAKLMFYPPHQDYTWTFDDIIVFAFSSRQAGIQMYL 283
Query: 502 DSTQEYGHL 510
+ + YG+L
Sbjct: 284 CNREHYGYL 292
>gi|147899177|ref|NP_001088623.1| procollagen galactosyltransferase 1-B precursor [Xenopus laevis]
gi|82179978|sp|Q5U483.1|G251B_XENLA RecName: Full=Procollagen galactosyltransferase 1-B; AltName:
Full=Glycosyltransferase 25 family member 1-B; AltName:
Full=Hydroxylysine galactosyltransferase 1-B; Flags:
Precursor
gi|55153756|gb|AAH85226.1| Glt25d1a protein [Xenopus laevis]
Length = 611
Score = 77.4 bits (189), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 57/249 (22%), Positives = 118/249 (47%), Gaps = 30/249 (12%)
Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYH--APLFDDYIHN 344
P + P+VLI+V L E L + L+YP ++IS++V + + + + +++ N
Sbjct: 37 PFRSPTVLIAVLARNSEGSLPEVLGALDRLHYPKERISLWVATDHNFDNTSQILREWLIN 96
Query: 345 FKTMFKNV-----------------KYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDS 387
+ + +V K+ +H+ + R A+ ++ D+ F++D+
Sbjct: 97 VQNQYHHVEWRPQEHPRWFRDEESPKHWSHSRYEYVMKLRQAALTSAREMWADYIFFLDA 156
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D+ L N + L L+ N++++AP+L A+SNFW + G+Y R+ YM I ++
Sbjct: 157 DNLLTNSETLNLLIAENKTVVAPML-ESRAAYSNFWCGMTTQGYYRRTPAYMPIRRRER- 214
Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMA------FCTNLRNKGIHLKI 501
+G + VP + + +L+ + K + + + DY A F + R + + +
Sbjct: 215 -QGCFPVPMVHSTFLI--DLRKEASQQLDFYPPHADYTWAFDDIIVFAFSCRQAEVQMFL 271
Query: 502 DSTQEYGHL 510
+ + YG+L
Sbjct: 272 CNKEIYGYL 280
>gi|441624487|ref|XP_004088995.1| PREDICTED: procollagen galactosyltransferase 2 isoform 2 [Nomascus
leucogenys]
Length = 554
Score = 77.0 bits (188), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 59/249 (23%), Positives = 112/249 (44%), Gaps = 30/249 (12%)
Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
P Q P+VL++V L FL + L+YP ++++ +N + + +++ N
Sbjct: 49 PLQSPTVLVAVLARNAAHTLPHFLGCLERLDYPKTTMAIWAATDHNVDNTTEILREWLKN 108
Query: 345 FKTMFKNVKY------------IAHNSTVNSKEA-----RNLAVENSLHKGVDFYFYVDS 387
+ ++ V++ I S+ A R A+ + K D+ ++D
Sbjct: 109 VQRLYHYVEWRPMDEPESYPDEIGPKHWPTSRFAHVMKLRQAALRTAREKWSDYILFIDV 168
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D+ L NP L L+ N++++AP+L +SNFW + GFY R+ DY+ I +
Sbjct: 169 DNFLTNPQTLNLLIAENKTIVAPML-ESRGLYSNFWCGITPKGFYKRTPDYVQI--REWK 225
Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIHLKI 501
G + VP + + +L+ + K + K + DY + F + R GI + +
Sbjct: 226 RTGCFPVPMVHSTFLI--DLRKEASDKLTFYPPHQDYTWTFDDIIVFAFSSRQAGIQMYL 283
Query: 502 DSTQEYGHL 510
+ + YG+L
Sbjct: 284 CNREHYGYL 292
>gi|195998972|ref|XP_002109354.1| hypothetical protein TRIADDRAFT_21834 [Trichoplax adhaerens]
gi|190587478|gb|EDV27520.1| hypothetical protein TRIADDRAFT_21834 [Trichoplax adhaerens]
Length = 546
Score = 77.0 bits (188), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 72/270 (26%), Positives = 132/270 (48%), Gaps = 37/270 (13%)
Query: 284 SLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQ--------EY 333
+L+ ++ P+V+I+V + FL L +ANLNY K+I+ ++ NNQ E+
Sbjct: 8 ALEYNRLPAVVIAVLARDASDFLPTSLACLANLNYDKKRIAFWIATDNNQDQTEEMLVEW 67
Query: 334 HAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEAR--------NLAVENSLHKGVDFYFYV 385
+ + DY H + M N + + ++ +R LA+ +L D+ +V
Sbjct: 68 KSQVESDY-HRVEIMTSNNYSLQTDLSLQWTPSRYRHLLQLRQLALAAALKYWADYVLFV 126
Query: 386 DSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKA-WSNFWGALNADGFYARSFDYMNIING 444
D+D+ L PD L L+ N +++APLL+ + +SNFW ++ G+Y R+ DY+ +
Sbjct: 127 DADNFLTEPDTLIELIKSNRTMVAPLLIESRHSYYSNFWCGVDEQGYYRRTEDYLPTLKR 186
Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNS-----MDYDMAFCTNLRNKGIHL 499
++ KG+ V I + +L+ + K+ + Y +S +D + F + + GI
Sbjct: 187 ER--KGVLQVAMIHSTFLIDLNR-KSVEKFSFYPPHSSYQGHIDDLLIFSYSAKMAGIPF 243
Query: 500 KIDSTQEYGHLVDSENFDPQKTNPEVYELI 529
+ + + YG+L S P+V ELI
Sbjct: 244 HLLNNKIYGYLFSS---------PQVQELI 264
>gi|68357136|ref|XP_694217.1| PREDICTED: procollagen galactosyltransferase 1 [Danio rerio]
Length = 609
Score = 77.0 bits (188), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 63/245 (25%), Positives = 113/245 (46%), Gaps = 30/245 (12%)
Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFKTM 348
P V+I++ L FL I LNYP +I+++V +N + L D++ N + +
Sbjct: 39 PRVMIALICRNNQHSLPHFLGTIERLNYPKDRIALWVATDHNVDNTTYLLRDWLINVQKL 98
Query: 349 FKNVKYIAHN--STVNSKEA---------------RNLAVENSLHKGVDFYFYVDSDSHL 391
+ V++ S N +E R A+E++ D+ +D D+ L
Sbjct: 99 YHYVEWRPKEQPSQYNDEEGPKDWTNERYAYVMKLRQAALESAREMWADYLMMIDCDNLL 158
Query: 392 DNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGI 451
N DVL L+ N++++AP++ A+SNFW + + G+Y R+ Y+ I Q KG
Sbjct: 159 INQDVLWKLIKENKTIVAPMM-ESRAAYSNFWCGMTSQGYYKRTPAYIPI--RKQVRKGC 215
Query: 452 WNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMA------FCTNLRNKGIHLKIDSTQ 505
+ VP + + +L+ + K + + + DY A F + R + + I + +
Sbjct: 216 FAVPMVHSTFLV--DLRKEASRQLAFHPPHPDYTWAFDDIIVFAFSARIAEVQMFICNRE 273
Query: 506 EYGHL 510
YGHL
Sbjct: 274 IYGHL 278
>gi|348577951|ref|XP_003474747.1| PREDICTED: procollagen galactosyltransferase 2-like [Cavia
porcellus]
Length = 623
Score = 76.6 bits (187), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 59/249 (23%), Positives = 113/249 (45%), Gaps = 30/249 (12%)
Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
P Q P+VL++V L FL + L+YP +++++ +N + + +++ N
Sbjct: 46 PLQKPTVLVAVLARNAAHTLPHFLGCLERLDYPKSRMAIWAATDHNVDNTTEILREWLKN 105
Query: 345 FKTMFKNVKY------------IAHNSTVNSKEA-----RNLAVENSLHKGVDFYFYVDS 387
+ ++ V++ I +S+ A R A+ + K D+ ++D
Sbjct: 106 VQRLYHYVEWRPMDEPESYPDEIGPKHWPSSRFAHVMKLRQAALRTAREKWSDYILFIDV 165
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D+ L N L L+ N++++AP+L +SNFW + GFY R+ DY+ I +
Sbjct: 166 DNFLTNTQTLSLLIAENKTIVAPML-ESRGLYSNFWCGITPQGFYKRTPDYLQIREWKR- 223
Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIHLKI 501
G + VP + + +L+ + K + K + DY + F + R GI + +
Sbjct: 224 -MGCFPVPMVHSTFLI--DLRKEASDKLTFYPPHQDYTWTFDDIIVFAFSSRQAGIQMYL 280
Query: 502 DSTQEYGHL 510
+ Q YG+L
Sbjct: 281 CNRQHYGYL 289
>gi|170741659|ref|YP_001770314.1| glycosyl transferase family protein [Methylobacterium sp. 4-46]
gi|168195933|gb|ACA17880.1| glycosyl transferase family 2 [Methylobacterium sp. 4-46]
Length = 661
Score = 76.6 bits (187), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 66/264 (25%), Positives = 124/264 (46%), Gaps = 33/264 (12%)
Query: 279 IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAP 336
++ L S P+ P VL++V + L+ +L+ I L+YP I + V NN +
Sbjct: 381 LRPLRSRLPEPAPRVLLAVLAKQKEPVLDLYLDCIEALDYPKSSIVLCVRTNNNTDRTGG 440
Query: 337 LFDDYIHNFKTMFKNVKY------------IAHN------STVNSKEARNLAVENSLHKG 378
+ ++ ++ + + H + + + R+LA+ +L +
Sbjct: 441 MLRAWLDRVGGLYAGIVFDDADVPEPVQDLAVHEWTPQRFAVLGAIRQRSLAL--TLARD 498
Query: 379 VDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLL--VRPFKAWSNFWGALNADGFYARSF 436
FYF D+D+ L P L+ LV+ N ++AP+L V+P ++NF A++A G++A S
Sbjct: 499 CAFYFVADADNFL-IPSTLRDLVSLNLPIVAPMLREVKPGSRYANFHAAVDAQGYFAESR 557
Query: 437 DYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD-MAFCTNLRNK 495
DY ++ ++ G+ VP + YL++ I Y S ++ + F + R +
Sbjct: 558 DYDALL--ERRILGVVEVPVVHCTYLVRADAIPLLR----YEDGSGRHEYVVFSDHARRR 611
Query: 496 GIHLKIDSTQEYGHLVDSENFDPQ 519
GI +D+ + YG L E+ DP+
Sbjct: 612 GIPQYLDNRRCYGCLT-LEDDDPE 634
>gi|332230643|ref|XP_003264502.1| PREDICTED: procollagen galactosyltransferase 2 isoform 1 [Nomascus
leucogenys]
Length = 626
Score = 76.3 bits (186), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 59/249 (23%), Positives = 112/249 (44%), Gaps = 30/249 (12%)
Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
P Q P+VL++V L FL + L+YP ++++ +N + + +++ N
Sbjct: 49 PLQSPTVLVAVLARNAAHTLPHFLGCLERLDYPKTTMAIWAATDHNVDNTTEILREWLKN 108
Query: 345 FKTMFKNVKY------------IAHNSTVNSKEA-----RNLAVENSLHKGVDFYFYVDS 387
+ ++ V++ I S+ A R A+ + K D+ ++D
Sbjct: 109 VQRLYHYVEWRPMDEPESYPDEIGPKHWPTSRFAHVMKLRQAALRTAREKWSDYILFIDV 168
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D+ L NP L L+ N++++AP+L +SNFW + GFY R+ DY+ I +
Sbjct: 169 DNFLTNPQTLNLLIAENKTIVAPML-ESRGLYSNFWCGITPKGFYKRTPDYVQIREWKRT 227
Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIHLKI 501
G + VP + + +L+ + K + K + DY + F + R GI + +
Sbjct: 228 --GCFPVPMVHSTFLI--DLRKEASDKLTFYPPHQDYTWTFDDIIVFAFSSRQAGIQMYL 283
Query: 502 DSTQEYGHL 510
+ + YG+L
Sbjct: 284 CNREHYGYL 292
>gi|359319948|ref|XP_849763.2| PREDICTED: procollagen galactosyltransferase 2 isoform 1 [Canis
lupus familiaris]
Length = 564
Score = 76.3 bits (186), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 58/249 (23%), Positives = 114/249 (45%), Gaps = 30/249 (12%)
Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
P Q P+VL++V L FL + L+YP +++++ +N + + +++ N
Sbjct: 49 PVQRPTVLVAVLARNAAHALPPFLGCLERLDYPKGRMAIWAATDHNVDNTTEILREWLKN 108
Query: 345 FKTMFKNVKY------------IAHNSTVNSKEA-----RNLAVENSLHKGVDFYFYVDS 387
+ + V++ I +S+ A R A+ + K D+ ++D
Sbjct: 109 VQRFYHYVEWRPMDEPESYPDEIGPKHWPSSRFAHVMKLRQAALRTAREKWSDYILFIDV 168
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D+ L NP L ++ N++++AP+L +SNFW + GFY R+ DY+ I +
Sbjct: 169 DNFLTNPQTLNLMIAENKTIVAPML-ESRGLYSNFWCGITPQGFYKRTPDYVQIREWKR- 226
Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIHLKI 501
G + VP + + +L+ + K + K ++ DY + F + R GI + +
Sbjct: 227 -LGCFPVPMVHSTFLI--DLRKEASDKLMFYPPHQDYTWTFDDIIVFAFSSRQAGIQMYL 283
Query: 502 DSTQEYGHL 510
+ + YG+L
Sbjct: 284 CNREHYGYL 292
>gi|344278455|ref|XP_003411009.1| PREDICTED: LOW QUALITY PROTEIN: procollagen galactosyltransferase
2-like [Loxodonta africana]
Length = 763
Score = 75.9 bits (185), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 58/252 (23%), Positives = 111/252 (44%), Gaps = 30/252 (11%)
Query: 284 SLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDY 341
S P Q P ++ L FL + L+YP +++++V +N + + ++
Sbjct: 183 SESPMQNPRCSWAILARNAAHTLSHFLGCLERLDYPKSRMAIWVATDHNVDNTTEILREW 242
Query: 342 IHNFKTMFKNVKY------------IAHNSTVNSK-----EARNLAVENSLHKGVDFYFY 384
+ N + ++ V++ I NS+ R A+ + K D+ +
Sbjct: 243 LKNIQRLYHYVEWRPMDEPQSYPDEIGPKHWPNSRFTHVMRLRQAALRTAREKWSDYILF 302
Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
+D D+ L NP L ++ N++++AP+L +SNFW + GFY R+ DY+ I
Sbjct: 303 IDVDNFLTNPKTLDLMIAENKTIVAPML-ESRSLYSNFWCGITPQGFYKRTPDYLQIREW 361
Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIH 498
+ G + VP + + L+ + K + K ++ DY M F + R GI
Sbjct: 362 KR--TGCFPVPMVHSTLLI--DLRKEASDKLMFYPPHQDYTWTFDDIMVFAFSSRQAGIQ 417
Query: 499 LKIDSTQEYGHL 510
+ + + + YG+L
Sbjct: 418 MYLCNREHYGYL 429
>gi|300796728|ref|NP_001178231.1| procollagen galactosyltransferase 2 precursor [Bos taurus]
gi|296478943|tpg|DAA21058.1| TPA: glycosyltransferase 25 domain containing 2 [Bos taurus]
Length = 626
Score = 75.9 bits (185), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 57/249 (22%), Positives = 111/249 (44%), Gaps = 30/249 (12%)
Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
P Q P+VL+ V L FL + L+YP +++++ +N + + +++ N
Sbjct: 49 PLQRPTVLVVVLARNAAHTLPHFLGCLERLDYPKSRMAIWAATDHNVDNTTEILREWLKN 108
Query: 345 FKTMFKNV-----------------KYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDS 387
+ + V K+ + + + R A+ + K D+ ++D
Sbjct: 109 VQKAYHYVEWRPMDEPESYPDEIGPKHWPASRFAHVMKLRQAALRTAREKWSDYILFIDV 168
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D+ L NP L L+ N++++AP+L +SNFW + GFY R+ DY+ I +
Sbjct: 169 DNFLTNPQTLNLLMAENKTIVAPML-ESRGLYSNFWCGITPQGFYKRTPDYLQIREWKR- 226
Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIHLKI 501
G + VP + + +L+ + K + K ++ DY + F + R GI + +
Sbjct: 227 -LGCFPVPMVHSTFLI--DLRKEASDKLMFYPPHQDYTWTFDDIIVFAFSSRQAGIQMYL 283
Query: 502 DSTQEYGHL 510
+ + YG+L
Sbjct: 284 CNREHYGYL 292
>gi|327277409|ref|XP_003223457.1| PREDICTED: LOW QUALITY PROTEIN: procollagen galactosyltransferase
2-like [Anolis carolinensis]
Length = 631
Score = 75.5 bits (184), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 60/252 (23%), Positives = 110/252 (43%), Gaps = 40/252 (15%)
Query: 289 QFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTM 348
Q P+V +++ L FL + L YP +++++V + D+ + K
Sbjct: 56 QKPTVFLAILARNAAGSLPHFLGCLERLRYPKPRMAVWVAKERN-----VDNTTNILKEW 110
Query: 349 FKNVKYIAH-------------------NSTVNSKEA-----RNLAVENSLHKGVDFYFY 384
KNV+ + H NS+ A R A+ + K D+ +
Sbjct: 111 LKNVQKLYHYLXWRPMEEPHSYPEEIGPKHWPNSRFAHVMKLRQAALRTAREKWSDYIMF 170
Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
+D+D+ L NPDVL ++ N++++AP+L +SNFW + G+Y R+ DY I
Sbjct: 171 IDADNFLTNPDVLNLMIAENKTIVAPML-ESRNLYSNFWCGMTPQGYYKRTPDYSLIREW 229
Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIH 498
+ G + VP + + +L+ + K + K ++ DY + F + R I
Sbjct: 230 KR--TGCFAVPMVHSTFLI--DLRKEASDKLMFYPPHQDYTWTFDDIIVFAFSSRQADIQ 285
Query: 499 LKIDSTQEYGHL 510
+ I + + YG+L
Sbjct: 286 MYICNREHYGYL 297
>gi|46447512|ref|YP_008877.1| procollagen-lysine 5-dioxygenase [Candidatus Protochlamydia
amoebophila UWE25]
gi|46401153|emb|CAF24602.1| putative procollagen-lysine 5-dioxygenase [Candidatus
Protochlamydia amoebophila UWE25]
Length = 295
Score = 75.5 bits (184), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 66/267 (24%), Positives = 133/267 (49%), Gaps = 32/267 (11%)
Query: 292 SVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFKTMF 349
+VL+++ L FLN I +L+Y K IS++++ NN + + + ++ ++
Sbjct: 34 TVLLALLARNKEHTLPAFLNCIEHLDYDKKCISIYIHTNNNIDKTQEILEAWVKEKGNLY 93
Query: 350 KNVKYIAH--NSTVNSK-------------EARNLAVENSLHKGVDFYFYVDSDSHLDNP 394
K+V ++ N+ + ++ + RN ++E + D+YF VD D+ +
Sbjct: 94 KDVIFVKQDLNTVLTNRPHEWTPERFKILAKIRNDSLEYAKLLKSDYYFVVDCDNFI-TA 152
Query: 395 DVLKYLVNRNESLIAPLLVRPFKA---WSNFWGALNADGFYARSFDYMNIINGDQGGKGI 451
D LK L+ +++ +IAPLL R + +SNF+ A++ G+Y DY+ I++ ++ G+
Sbjct: 153 DTLKDLIKQDKPIIAPLL-RSLETNNYYSNFFCAIDETGYYGYHLDYLKIVSYEKI--GV 209
Query: 452 WNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMA-FCTNLRNKGIHLKIDSTQEYGHL 510
+ VP + YL+++ + + Y S DY+ F R K + I + ++YG+L
Sbjct: 210 FKVPVVHCTYLIQSKYLDQLS----YIDGSEDYEFVIFSRKAREKNVDQYISNEKKYGYL 265
Query: 511 V---DSENFDPQKTNPEVYELIRNPLD 534
V D+ + + +K ++R D
Sbjct: 266 VHFFDNLSLEEEKERMASINILRRIAD 292
>gi|410053448|ref|XP_512497.4| PREDICTED: procollagen galactosyltransferase 1, partial [Pan
troglodytes]
Length = 484
Score = 75.5 bits (184), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 55/248 (22%), Positives = 118/248 (47%), Gaps = 28/248 (11%)
Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
P Q P VLI++ L L + L +P ++ +++V +N + + + +++
Sbjct: 48 PLQAPRVLIALLARNAAHALPTTLGALERLRHPRERTALWVATDHNMDNTSTVLREWLVA 107
Query: 345 FKTMFKNV-----------------KYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDS 387
K+++ +V K+ + + + + R A++++ D+ +VD+
Sbjct: 108 VKSLYHSVEWRPAEEPRSYPDEEGPKHWSDSRYEHVMKLRQAALKSARDMWADYILFVDA 167
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D+ + NPD L L+ N++++AP+L A+SNFW + + G+Y R+ Y+ I D+
Sbjct: 168 DNLILNPDTLSLLIAENKTVVAPML-DSRAAYSNFWCGMTSQGYYKRTPAYIPIRKRDR- 225
Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIK-----TIYTLNSMDYDMAFCTNLRNKGIHLKID 502
+G + VP + + +L+ + N+ YT S D + F + + + + +
Sbjct: 226 -RGCFAVPMVHSTFLIDLRKAASRNLAFYPPHPDYTW-SFDDIIVFAFSCKQAEVQMYVC 283
Query: 503 STQEYGHL 510
+ +EYG L
Sbjct: 284 NKEEYGFL 291
>gi|417403445|gb|JAA48526.1| Putative procollagen galactosyltransferase 2 [Desmodus rotundus]
Length = 626
Score = 75.5 bits (184), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 55/249 (22%), Positives = 112/249 (44%), Gaps = 30/249 (12%)
Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
P Q P+VL+++ L FL + L+YP +++++ +N + + +++ N
Sbjct: 49 PLQRPTVLVALLARNAAHTLPHFLGCLERLDYPKSRMAIWAATDHNVDNTTEILREWLKN 108
Query: 345 FKTMFKNV-----------------KYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDS 387
+ + V K+ + + + R A+ + K D+ ++D
Sbjct: 109 VQRAYHYVEWRPMEEPESYPDEIGPKHWPASRFAHVMKLRQAALRTARDKWSDYILFIDV 168
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D+ L NP L ++ N++++AP+L +SNFW + GFY R+ DY+ I +
Sbjct: 169 DNFLTNPQTLNLMIAENKTIVAPML-ESRGLYSNFWCGITPQGFYKRTPDYIQIREWKR- 226
Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIHLKI 501
G + VP + + +L+ + K + K ++ DY + F + R GI + +
Sbjct: 227 -TGCFPVPMVHSTFLI--DLRKEASDKLMFHPPHQDYAWTFDDIIVFAFSSRQAGIQMYL 283
Query: 502 DSTQEYGHL 510
+ + YG+L
Sbjct: 284 CNREHYGYL 292
>gi|426387749|ref|XP_004060325.1| PREDICTED: procollagen galactosyltransferase 1 [Gorilla gorilla
gorilla]
Length = 585
Score = 75.1 bits (183), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 53/229 (23%), Positives = 107/229 (46%), Gaps = 27/229 (11%)
Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFK 346
P Q P VLI++ L L + L +P ++ +++ Y ++E
Sbjct: 48 PLQAPRVLIALLARNAAHALPTTLGALERLRHPRERTALWSYPDEE-------------- 93
Query: 347 TMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNES 406
K+ + + + + R A++++ D+ +VD+D+ + NPD L L+ N++
Sbjct: 94 ----GPKHWSDSRYEHVMKLRQAALKSARDMWADYILFVDADNLILNPDTLSLLIAENKT 149
Query: 407 LIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTS 466
++AP+L A+SNFW + + G+Y R+ Y+ I D+ +G + VP + + +L+
Sbjct: 150 VVAPML-DSRAAYSNFWCGMTSQGYYKRTPAYIPIRKRDR--RGCFAVPMVHSTFLIDLR 206
Query: 467 VIKATNIK-----TIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGHL 510
+ N+ YT S D + F + + + + + + +EYG L
Sbjct: 207 KAASRNLAFYPPHPDYTW-SFDDIIVFAFSCKQAEVQMYVCNKEEYGFL 254
>gi|334321909|ref|XP_001375578.2| PREDICTED: LOW QUALITY PROTEIN: procollagen galactosyltransferase
2-like [Monodelphis domestica]
Length = 631
Score = 75.1 bits (183), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 52/247 (21%), Positives = 113/247 (45%), Gaps = 30/247 (12%)
Query: 289 QFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFK 346
Q P+V +++ L FL + L+YP +++++ +N + + +++ N +
Sbjct: 56 QRPTVFVTILARIAAHTLPHFLGCLERLDYPKDRMAIWAATDHNIDNTTEILREWLKNVQ 115
Query: 347 TMFKNV-----------------KYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDS 389
++ V K+ + + + R A+ + K D+ ++D D+
Sbjct: 116 KLYHYVEWRPMDDPQSYPDEIGPKHWPGSRFTHVMKLRQAALRTAREKWSDYILFIDVDN 175
Query: 390 HLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGK 449
L NP L +++ N++++AP+L +SNFW + G+Y R+ DY+ I + +
Sbjct: 176 FLTNPQTLNLMISENKTIVAPML-ESRSLYSNFWCGITPQGYYKRTPDYIQIREWKR--R 232
Query: 450 GIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIHLKIDS 503
G + VP + + +L+ + K + K ++ DY + F + R GI + + +
Sbjct: 233 GCFPVPMVHSTFLI--DLRKEASQKLMFFPPHQDYSWTFDDIIVFAFSSRQAGIQMYLCN 290
Query: 504 TQEYGHL 510
+ YG+L
Sbjct: 291 REHYGYL 297
>gi|326431358|gb|EGD76928.1| hypothetical protein PTSG_07269 [Salpingoeca sp. ATCC 50818]
Length = 858
Score = 75.1 bits (183), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 68/240 (28%), Positives = 117/240 (48%), Gaps = 30/240 (12%)
Query: 28 NIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLL 87
I + + LV+TVA++ + FI E+ + V +G G G G+K+ +
Sbjct: 161 GIVKPRLLVMTVATHR----EPFI---ELTEQSVGNIGKKLLVAGEGEFFKGYGWKLKKV 213
Query: 88 KNELDEMDITDDM-IILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYD 146
+ L + DD ++L TDS+D + +++++ F + +A +V AE CWP+ L
Sbjct: 214 RETL--LKYKDDYDMVLFTDSFDSFVFAEEDELIDTFRSMNAPMVVSAEVNCWPNPELAT 271
Query: 147 KYPAVGS--GYRYLNSGGFIGYAKDIKEL------ISNRSIKNEEDDQLYYALLFLDETL 198
+ P S Y Y NSGG++GY I L I ++S ++ +L A++ ++
Sbjct: 272 EMPPSSSVGHYPYPNSGGYMGYLGYILHLYNDVIAIHHKSDCCDDQGELIKAVVLDNKAF 331
Query: 199 RTKHKIVLDTLANLFQNLYGSLE-DIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIEL 257
R H+ V LFQ L+GS + D+ + D +H N +T+P ++H NG K L
Sbjct: 332 RIDHQAV------LFQTLFGSAKRDVVVR---DGRIH--NQATHTSPAVVHANGWDKGPL 380
>gi|449507875|ref|XP_004176247.1| PREDICTED: LOW QUALITY PROTEIN: procollagen galactosyltransferase 2
[Taeniopygia guttata]
Length = 621
Score = 75.1 bits (183), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 55/245 (22%), Positives = 108/245 (44%), Gaps = 30/245 (12%)
Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFKTM 348
P+VL+++ L L I L+YP +I+++ +N + + +++ N + +
Sbjct: 44 PTVLLAIIARNAAHTLPHVLGCIERLSYPKSRIALWAATDHNIDNTTAILREWLKNVQHL 103
Query: 349 FKNVKYIAHNSTVNSKEA-----------------RNLAVENSLHKGVDFYFYVDSDSHL 391
+ +V++ + E R A+ + K D+ + D+D+ L
Sbjct: 104 YHDVEWRPMEEPPSYPEEIGPKHWPSSRFTHVMKLRQAALRTAREKWSDYILFTDADNLL 163
Query: 392 DNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGI 451
NP+ L L+ N++L+AP+L +SNFW + G+Y R+ +Y I + G
Sbjct: 164 TNPETLNLLIAENKTLVAPML-ESRSLYSNFWCGITPQGYYKRTLEYPLI--REWKRMGC 220
Query: 452 WNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIHLKIDSTQ 505
+ VP I + +L+ + K + K + DY M F + R G+ + + + +
Sbjct: 221 FAVPMIHSTFLI--DLRKEASAKLAFYPPHQDYTWSFDDIMVFAFSSRQAGVQMFVCNRE 278
Query: 506 EYGHL 510
YG L
Sbjct: 279 HYGFL 283
>gi|194272156|ref|NP_001123548.1| procollagen galactosyltransferase 2 precursor [Danio rerio]
gi|159570814|emb|CAP19485.1| novel protein similar to vertebrate glycosyltransferase 25 domain
containing 1 (GLT25D1) [Danio rerio]
Length = 613
Score = 75.1 bits (183), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 56/245 (22%), Positives = 112/245 (45%), Gaps = 30/245 (12%)
Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFKTM 348
P V+I++ L +L+ I L+YP +I+++ +N + + +++ N ++
Sbjct: 42 PKVMIAILARNSAHSLPYYLDCIDRLDYPKDRIAIWAATDHNVDNSTAMLREWLKNRQSR 101
Query: 349 FKNV-----------------KYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSHL 391
+ V K+ + + + + R A++ + + D+ YVDSD+ L
Sbjct: 102 YHYVEWRPMEEPRSYTDEWGPKHWSSSRVSHVMKLRQAALKAARARWADYILYVDSDNLL 161
Query: 392 DNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGI 451
NP VL L+ N +L+AP+L +SNFW + G+Y R+ DY I + G
Sbjct: 162 TNPRVLNLLMAENLTLVAPML-DSRSLYSNFWCGITPQGYYKRTPDYQPIREWKR--LGC 218
Query: 452 WNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIHLKIDSTQ 505
++VP + + +L+ + ++ + + DY M F + R G+ + + + +
Sbjct: 219 FSVPMVHSTFLL--DLRRSATLDMAFYPPHPDYSWAFDDIMVFAFSAREAGVQMYVCNRE 276
Query: 506 EYGHL 510
YG L
Sbjct: 277 HYGFL 281
>gi|395530942|ref|XP_003767545.1| PREDICTED: procollagen galactosyltransferase 2 [Sarcophilus
harrisii]
Length = 630
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 52/247 (21%), Positives = 113/247 (45%), Gaps = 30/247 (12%)
Query: 289 QFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFK 346
Q P+V +++ L FL + L+YP +++++ +N + + +++ N +
Sbjct: 55 QKPTVFVAILARNAAHTLPHFLGCLERLDYPKDRMAIWAATDHNVDNTTEILREWLKNVQ 114
Query: 347 TMFKNV-----------------KYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDS 389
++ V K+ + + + R A+ + K D+ ++D D+
Sbjct: 115 KLYHYVEWRPMDDPQSYPDEIGPKHWPGSRFTHVMKLRQAALRTAREKWSDYILFIDVDN 174
Query: 390 HLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGK 449
L NP L +++ N++++AP+L +SNFW + G+Y R+ DY+ I + +
Sbjct: 175 FLTNPQTLNLMISENKTIVAPML-ESRSLYSNFWCGITPQGYYKRTPDYIQI--REWKRR 231
Query: 450 GIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIHLKIDS 503
G + VP + + +L+ + K + K ++ DY + F + R GI + + +
Sbjct: 232 GCFPVPMVHSTFLI--DLRKEASQKLMFFPPHQDYAWTFDDIIVFAFSSRQAGIQMYLCN 289
Query: 504 TQEYGHL 510
+ YG+L
Sbjct: 290 REHYGYL 296
>gi|392352745|ref|XP_222718.6| PREDICTED: procollagen galactosyltransferase 2 [Rattus norvegicus]
Length = 633
Score = 74.7 bits (182), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 57/249 (22%), Positives = 112/249 (44%), Gaps = 30/249 (12%)
Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
P Q P+V + V L FL + L+YP +++++ +N + + +++ N
Sbjct: 48 PLQKPTVFVVVLARNAAHTLPHFLGCLERLDYPKSRMAIWAATDHNVDNTTEILREWLKN 107
Query: 345 FKTMFKNVKY------------IAHNSTVNSKEA-----RNLAVENSLHKGVDFYFYVDS 387
+ ++ V++ I +S+ A R A+ + K D+ ++D
Sbjct: 108 VQRLYHYVEWRPMDEPESYPDEIGPKHWPSSRFAHVMKLRQAALRTAREKWSDYILFIDV 167
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D+ L NP L ++ N++++AP+L +SNFW + GFY R+ DY+ I +
Sbjct: 168 DNFLTNPQTLNLMIAENKTILAPML-ESRGLYSNFWCGITPQGFYKRTPDYLQIREWKRT 226
Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIHLKI 501
G + VP + + +L+ + K + K + DY + F + R GI + +
Sbjct: 227 --GCFPVPMVHSTFLI--DLRKEASDKLSFYPPHQDYTWTFDDIIVFAFSSRQAGIQMYL 282
Query: 502 DSTQEYGHL 510
+ + YG+L
Sbjct: 283 CNKEHYGYL 291
>gi|119605029|gb|EAW84623.1| glycosyltransferase 25 domain containing 1, isoform CRA_c [Homo
sapiens]
Length = 565
Score = 74.7 bits (182), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 55/248 (22%), Positives = 118/248 (47%), Gaps = 28/248 (11%)
Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
P Q P VLI++ L L + L +P ++ +++V +N + + + +++
Sbjct: 48 PLQAPRVLIALLARNAAHALPTTLGALERLRHPRERTALWVATDHNMDNTSTVLREWLVA 107
Query: 345 FKTMFKNV-----------------KYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDS 387
K+++ +V K+ + + + + R A++++ D+ +VD+
Sbjct: 108 VKSLYHSVEWRPAEEPRSYPDEEGPKHWSDSRYEHVMKLRQAALKSARDMWADYILFVDA 167
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D+ + NPD L L+ N++++AP+L A+SNFW + + G+Y R+ Y+ I D+
Sbjct: 168 DNLILNPDTLSLLIAENKTVVAPML-DSRAAYSNFWCGMTSQGYYKRTPAYIPIRKRDR- 225
Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIK-----TIYTLNSMDYDMAFCTNLRNKGIHLKID 502
+G + VP + + +L+ + N+ YT S D + F + + + + +
Sbjct: 226 -RGCFAVPMVHSTFLIDLRKAASRNLAFYPPHPDYTW-SFDDIIVFAFSCKQAEVQMYVC 283
Query: 503 STQEYGHL 510
+ +EYG L
Sbjct: 284 NKEEYGFL 291
>gi|380019473|ref|XP_003693629.1| PREDICTED: glycosyltransferase 25 family member-like [Apis florea]
Length = 558
Score = 74.7 bits (182), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 67/245 (27%), Positives = 114/245 (46%), Gaps = 27/245 (11%)
Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFKTM 348
P+VLI + + L FL + L YP K+I ++++ NN + + +++N
Sbjct: 18 PTVLIIILVRNKAHTLPYFLTFLERLTYPKKRIHLWIHSDNNIDNSIEILSTWLNNESNK 77
Query: 349 FKNVK---------YIAHNSTVNSKEARNLAV----ENSLHKG----VDFYFYVDSDSHL 391
+ V+ + N N R L V E +L G DF + +D+D L
Sbjct: 78 YHGVQINFDENSKGFDDENGITNWSAQRFLHVINLREEALKAGRNIWADFIWMLDADVFL 137
Query: 392 DNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGI 451
NP+ L L+ +N+ +IAPLL + +SNFW + +D +Y R+ +Y I+ ++ KG
Sbjct: 138 TNPNTLDELILKNQIVIAPLL-KSDGLYSNFWAGMTSDYYYLRTKEYEPILFREK--KGC 194
Query: 452 WNVPYITNCYLM----KTSVIKATNIKTIYTLNS-MDYDMAFCTNLRNKGIHLKIDSTQE 506
+NVP I + L+ + S N K +Y N +D + F G+ L I +
Sbjct: 195 FNVPMIHSAVLINLRKQLSDFLTYNPKKLYQYNGPIDDIITFAVGANKTGVPLFICNDNI 254
Query: 507 YGHLV 511
YG ++
Sbjct: 255 YGFIM 259
>gi|148707509|gb|EDL39456.1| glycosyltransferase 25 domain containing 2, isoform CRA_a [Mus
musculus]
Length = 469
Score = 74.7 bits (182), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 56/249 (22%), Positives = 112/249 (44%), Gaps = 30/249 (12%)
Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
P Q P+V + V L FL + L+YP +++++ +N + + +++ +
Sbjct: 48 PPQKPTVFVVVLARNAAHTLPYFLGCLERLDYPKSRMAIWAATDHNVDNTTEILREWLKS 107
Query: 345 FKTMFKNVKYIAHNST------------VNSK-----EARNLAVENSLHKGVDFYFYVDS 387
+ ++ V++ N NS+ + R A+ + K D+ ++D
Sbjct: 108 VQRLYHYVEWRPMNEPESYPDEIGPKHWPNSRFSHVMKLRQAALRTAREKWSDYILFIDV 167
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D+ L NP L ++ N++++AP+L +SNFW + GFY R+ DY+ I +
Sbjct: 168 DNFLTNPQTLNLMIVENKTIVAPML-ESRGLYSNFWCGITPQGFYKRTPDYLQI--REWK 224
Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIHLKI 501
G + VP + + +L+ + K + K + DY + F + R GI + +
Sbjct: 225 RMGCFPVPMVHSTFLI--DLRKEASDKLAFYPPHQDYTWTFDDIIVFAFSSRQAGIQMYL 282
Query: 502 DSTQEYGHL 510
+ + YG+L
Sbjct: 283 CNKEHYGYL 291
>gi|148707510|gb|EDL39457.1| glycosyltransferase 25 domain containing 2, isoform CRA_b [Mus
musculus]
Length = 625
Score = 74.3 bits (181), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 56/249 (22%), Positives = 112/249 (44%), Gaps = 30/249 (12%)
Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
P Q P+V + V L FL + L+YP +++++ +N + + +++ +
Sbjct: 48 PPQKPTVFVVVLARNAAHTLPYFLGCLERLDYPKSRMAIWAATDHNVDNTTEILREWLKS 107
Query: 345 FKTMFKNVKYIAHNST------------VNSK-----EARNLAVENSLHKGVDFYFYVDS 387
+ ++ V++ N NS+ + R A+ + K D+ ++D
Sbjct: 108 VQRLYHYVEWRPMNEPESYPDEIGPKHWPNSRFSHVMKLRQAALRTAREKWSDYILFIDV 167
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D+ L NP L ++ N++++AP+L +SNFW + GFY R+ DY+ I +
Sbjct: 168 DNFLTNPQTLNLMIVENKTIVAPML-ESRGLYSNFWCGITPQGFYKRTPDYLQIREWKR- 225
Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIHLKI 501
G + VP + + +L+ + K + K + DY + F + R GI + +
Sbjct: 226 -MGCFPVPMVHSTFLI--DLRKEASDKLAFYPPHQDYTWTFDDIIVFAFSSRQAGIQMYL 282
Query: 502 DSTQEYGHL 510
+ + YG+L
Sbjct: 283 CNKEHYGYL 291
>gi|26343025|dbj|BAC35169.1| unnamed protein product [Mus musculus]
Length = 625
Score = 74.3 bits (181), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 56/249 (22%), Positives = 112/249 (44%), Gaps = 30/249 (12%)
Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
P Q P+V + V L FL + L+YP +++++ +N + + +++ +
Sbjct: 48 PPQKPTVFVVVLARNAAHTLPYFLGCLERLDYPKSRMAIWAATDHNVDNTTEILREWLKS 107
Query: 345 FKTMFKNVKYIAHNST------------VNSK-----EARNLAVENSLHKGVDFYFYVDS 387
+ ++ V++ N NS+ + R A+ + K D+ ++D
Sbjct: 108 VQRLYHYVEWRPMNEPESYPDEIGPKHWPNSRFSHVMKLRQAALRTAREKWSDYILFIDV 167
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D+ L NP L ++ N++++AP+L +SNFW + GFY R+ DY+ I +
Sbjct: 168 DNFLTNPQTLNLMIVENKTIVAPML-ESRGLYSNFWCGITPQGFYKRTPDYLQIREWKR- 225
Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIHLKI 501
G + VP + + +L+ + K + K + DY + F + R GI + +
Sbjct: 226 -MGCFPVPMVHSTFLI--DLRKEASDKLAFYPPHQDYTWTFDDIIVFAFSSRQAGIQMYL 282
Query: 502 DSTQEYGHL 510
+ + YG+L
Sbjct: 283 CNKEHYGYL 291
>gi|45768794|gb|AAH68118.1| Glycosyltransferase 25 domain containing 2 [Mus musculus]
Length = 625
Score = 74.3 bits (181), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 56/249 (22%), Positives = 112/249 (44%), Gaps = 30/249 (12%)
Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
P Q P+V + V L FL + L+YP +++++ +N + + +++ +
Sbjct: 48 PPQKPTVFVVVLARNAAHTLPYFLGCLERLDYPKSRMAIWAATDHNVDNTTEILREWLKS 107
Query: 345 FKTMFKNVKYIAHNST------------VNSK-----EARNLAVENSLHKGVDFYFYVDS 387
+ ++ V++ N NS+ + R A+ + K D+ ++D
Sbjct: 108 VQRLYHYVEWRPMNEPESYPDEIGPKHWPNSRFSHVMKLRQAALRTAREKWSDYILFIDV 167
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D+ L NP L ++ N++++AP+L +SNFW + GFY R+ DY+ I +
Sbjct: 168 DNFLTNPQTLNLMIVENKTIVAPML-ESRGLYSNFWCGITPQGFYKRTPDYLQIREWKR- 225
Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIHLKI 501
G + VP + + +L+ + K + K + DY + F + R GI + +
Sbjct: 226 -MGCFPVPMVHSTFLI--DLRKEASDKLAFYPPHQDYTWTFDDIIVFAFSSRQAGIQMYL 282
Query: 502 DSTQEYGHL 510
+ + YG+L
Sbjct: 283 CNKEHYGYL 291
>gi|293341373|ref|XP_001070927.2| PREDICTED: procollagen galactosyltransferase 2 [Rattus norvegicus]
gi|149058405|gb|EDM09562.1| glycosyltransferase 25 domain containing 2 (predicted) [Rattus
norvegicus]
Length = 625
Score = 74.3 bits (181), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 57/249 (22%), Positives = 112/249 (44%), Gaps = 30/249 (12%)
Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
P Q P+V + V L FL + L+YP +++++ +N + + +++ N
Sbjct: 48 PLQKPTVFVVVLARNAAHTLPHFLGCLERLDYPKSRMAIWAATDHNVDNTTEILREWLKN 107
Query: 345 FKTMFKNVKY------------IAHNSTVNSKEA-----RNLAVENSLHKGVDFYFYVDS 387
+ ++ V++ I +S+ A R A+ + K D+ ++D
Sbjct: 108 VQRLYHYVEWRPMDEPESYPDEIGPKHWPSSRFAHVMKLRQAALRTAREKWSDYILFIDV 167
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D+ L NP L ++ N++++AP+L +SNFW + GFY R+ DY+ I +
Sbjct: 168 DNFLTNPQTLNLMIAENKTILAPML-ESRGLYSNFWCGITPQGFYKRTPDYLQIREWKR- 225
Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIHLKI 501
G + VP + + +L+ + K + K + DY + F + R GI + +
Sbjct: 226 -TGCFPVPMVHSTFLI--DLRKEASDKLSFYPPHQDYTWTFDDIIVFAFSSRQAGIQMYL 282
Query: 502 DSTQEYGHL 510
+ + YG+L
Sbjct: 283 CNKEHYGYL 291
>gi|228008340|ref|NP_808424.3| procollagen galactosyltransferase 2 precursor [Mus musculus]
gi|160395572|sp|Q6NVG7.2|GT252_MOUSE RecName: Full=Procollagen galactosyltransferase 2; AltName:
Full=Glycosyltransferase 25 family member 2; AltName:
Full=Hydroxylysine galactosyltransferase 2; Flags:
Precursor
Length = 625
Score = 74.3 bits (181), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 56/249 (22%), Positives = 112/249 (44%), Gaps = 30/249 (12%)
Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
P Q P+V + V L FL + L+YP +++++ +N + + +++ +
Sbjct: 48 PPQKPTVFVVVLARNAAHTLPYFLGCLERLDYPKSRMAIWAATDHNVDNTTEILREWLKS 107
Query: 345 FKTMFKNVKYIAHNST------------VNSK-----EARNLAVENSLHKGVDFYFYVDS 387
+ ++ V++ N NS+ + R A+ + K D+ ++D
Sbjct: 108 VQRLYHYVEWRPMNEPESYPDEIGPKHWPNSRFSHVMKLRQAALRTAREKWSDYILFIDV 167
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D+ L NP L ++ N++++AP+L +SNFW + GFY R+ DY+ I +
Sbjct: 168 DNFLTNPQTLNLMIVENKTIVAPML-ESRGLYSNFWCGITPQGFYKRTPDYLQIREWKR- 225
Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIHLKI 501
G + VP + + +L+ + K + K + DY + F + R GI + +
Sbjct: 226 -MGCFPVPMVHSTFLI--DLRKEASDKLAFYPPHQDYTWTFDDIIVFAFSSRQAGIQMYL 282
Query: 502 DSTQEYGHL 510
+ + YG+L
Sbjct: 283 CNKEHYGYL 291
>gi|31377697|ref|NP_078932.2| procollagen galactosyltransferase 1 precursor [Homo sapiens]
gi|74715064|sp|Q8NBJ5.1|GT251_HUMAN RecName: Full=Procollagen galactosyltransferase 1; AltName:
Full=Glycosyltransferase 25 family member 1; AltName:
Full=Hydroxylysine galactosyltransferase 1; Flags:
Precursor
gi|22761754|dbj|BAC11684.1| unnamed protein product [Homo sapiens]
gi|80478641|gb|AAI08309.1| Glycosyltransferase 25 domain containing 1 [Homo sapiens]
gi|119605028|gb|EAW84622.1| glycosyltransferase 25 domain containing 1, isoform CRA_b [Homo
sapiens]
Length = 622
Score = 74.3 bits (181), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 55/248 (22%), Positives = 118/248 (47%), Gaps = 28/248 (11%)
Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
P Q P VLI++ L L + L +P ++ +++V +N + + + +++
Sbjct: 48 PLQAPRVLIALLARNAAHALPTTLGALERLRHPRERTALWVATDHNMDNTSTVLREWLVA 107
Query: 345 FKTMFKNV-----------------KYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDS 387
K+++ +V K+ + + + + R A++++ D+ +VD+
Sbjct: 108 VKSLYHSVEWRPAEEPRSYPDEEGPKHWSDSRYEHVMKLRQAALKSARDMWADYILFVDA 167
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D+ + NPD L L+ N++++AP+L A+SNFW + + G+Y R+ Y+ I D+
Sbjct: 168 DNLILNPDTLSLLIAENKTVVAPML-DSRAAYSNFWCGMTSQGYYKRTPAYIPIRKRDR- 225
Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIK-----TIYTLNSMDYDMAFCTNLRNKGIHLKID 502
+G + VP + + +L+ + N+ YT S D + F + + + + +
Sbjct: 226 -RGCFAVPMVHSTFLIDLRKAASRNLAFYPPHPDYTW-SFDDIIVFAFSCKQAEVQMYVC 283
Query: 503 STQEYGHL 510
+ +EYG L
Sbjct: 284 NKEEYGFL 291
>gi|410222202|gb|JAA08320.1| glycosyltransferase 25 domain containing 1 [Pan troglodytes]
gi|410222204|gb|JAA08321.1| glycosyltransferase 25 domain containing 1 [Pan troglodytes]
gi|410259730|gb|JAA17831.1| glycosyltransferase 25 domain containing 1 [Pan troglodytes]
gi|410259732|gb|JAA17832.1| glycosyltransferase 25 domain containing 1 [Pan troglodytes]
gi|410259734|gb|JAA17833.1| glycosyltransferase 25 domain containing 1 [Pan troglodytes]
gi|410259736|gb|JAA17834.1| glycosyltransferase 25 domain containing 1 [Pan troglodytes]
gi|410300922|gb|JAA29061.1| glycosyltransferase 25 domain containing 1 [Pan troglodytes]
gi|410300924|gb|JAA29062.1| glycosyltransferase 25 domain containing 1 [Pan troglodytes]
Length = 622
Score = 74.3 bits (181), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 55/248 (22%), Positives = 118/248 (47%), Gaps = 28/248 (11%)
Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
P Q P VLI++ L L + L +P ++ +++V +N + + + +++
Sbjct: 48 PLQAPRVLIALLARNAAHALPTTLGALERLRHPRERTALWVATDHNMDNTSTVLREWLVA 107
Query: 345 FKTMFKNV-----------------KYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDS 387
K+++ +V K+ + + + + R A++++ D+ +VD+
Sbjct: 108 VKSLYHSVEWRPAEEPRSYPDEEGPKHWSDSRYEHVMKLRQAALKSARDMWADYILFVDA 167
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D+ + NPD L L+ N++++AP+L A+SNFW + + G+Y R+ Y+ I D+
Sbjct: 168 DNLILNPDTLSLLIAENKTVVAPML-DSRAAYSNFWCGMTSQGYYKRTPAYIPIRKRDR- 225
Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIK-----TIYTLNSMDYDMAFCTNLRNKGIHLKID 502
+G + VP + + +L+ + N+ YT S D + F + + + + +
Sbjct: 226 -RGCFAVPMVHSTFLIDLRKAASRNLAFYPPHPDYTW-SFDDIIVFAFSCKQAEVQMYVC 283
Query: 503 STQEYGHL 510
+ +EYG L
Sbjct: 284 NKEEYGFL 291
>gi|301608466|ref|XP_002933810.1| PREDICTED: procollagen galactosyltransferase 2-like [Xenopus
(Silurana) tropicalis]
Length = 616
Score = 73.9 bits (180), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 60/270 (22%), Positives = 121/270 (44%), Gaps = 33/270 (12%)
Query: 270 TSGCTRCNLIKHLDSLKPD---QFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMF 326
T+ C + +++ P+ Q PSVLI++ L F++ I L+YP +I+++
Sbjct: 19 TNLCASAEELNIEEAVLPESSLQKPSVLIAIIARNAAHTLPYFMDCIDKLDYPKSRIAIW 78
Query: 327 VY--NNQEYHAPLFDDYIHNFKTMFKNV-----------------KYIAHNSTVNSKEAR 367
+N + + +++ + + ++ V K+ + + + R
Sbjct: 79 AATDHNIDNTTAILREWLKSVQKLYHYVEWRPMAEPQSYADELGPKHWPASRFAHVMKLR 138
Query: 368 NLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALN 427
A+ + K D+ Y+D+D+ L NP L ++ N++++AP+L +SNFW +
Sbjct: 139 QAALRTAKEKWSDYVLYIDADNFLTNPQTLNLMMKENKTIVAPML-ESRTLYSNFWCGMT 197
Query: 428 ADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMA 487
G+Y R+ DY+ I + G + VP + + L+ + N++ + DY A
Sbjct: 198 PQGYYKRTPDYVLI--REWKRLGCFPVPMVHSTILIDLRKEASKNLQ--FYPPQEDYTWA 253
Query: 488 ------FCTNLRNKGIHLKIDSTQEYGHLV 511
F + R GI + I + + YG+L
Sbjct: 254 FDDIIVFAFSSRQAGIQMYICNREHYGYLA 283
>gi|417411747|gb|JAA52300.1| Putative procollagen galactosyltransferase 1, partial [Desmodus
rotundus]
Length = 579
Score = 73.9 bits (180), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 56/248 (22%), Positives = 117/248 (47%), Gaps = 28/248 (11%)
Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
P Q P VLI++ L L + L +P ++ +++V +N + + + +++
Sbjct: 5 PLQAPRVLIALLARNAAHALPATLGALERLQHPRERTALWVATDHNSDNTSAVLREWLVA 64
Query: 345 FKTMFKNV-----------------KYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDS 387
K+++ +V K+ + + + R A++++ D+ +VDS
Sbjct: 65 VKSLYHSVEWRPAEEPRSYPDEEGPKHWSDTRYEHVMKLRQAALKSARDMWADYILFVDS 124
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D+ + NPD L L+ N++++AP+L A+SNFW + + G+Y R+ Y+ I D+
Sbjct: 125 DNLILNPDTLTLLIAENKTVVAPML-DSRAAYSNFWCGMTSQGYYKRTPAYIPIRKRDR- 182
Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIK-----TIYTLNSMDYDMAFCTNLRNKGIHLKID 502
+G + VP + + +L+ + N+ T YT S D + F + + + + +
Sbjct: 183 -QGCFAVPMVHSTFLIDLRKAASRNLAFYPPHTDYTW-SFDDIIVFAFSCKQAEVQMYVC 240
Query: 503 STQEYGHL 510
+ + YG L
Sbjct: 241 NKEVYGFL 248
>gi|157823499|ref|NP_001099537.1| procollagen galactosyltransferase 1 precursor [Rattus norvegicus]
gi|149036101|gb|EDL90767.1| glycosyltransferase 25 domain containing 1 (predicted) [Rattus
norvegicus]
gi|169642770|gb|AAI60899.1| Glycosyltransferase 25 domain containing 1 [Rattus norvegicus]
Length = 617
Score = 73.9 bits (180), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 56/248 (22%), Positives = 116/248 (46%), Gaps = 28/248 (11%)
Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
P Q P VLI++ L L + L +P ++ +++V +N + + + +++
Sbjct: 43 PLQAPRVLIALLARNAAPALPATLGALERLRHPRERTALWVATDHNTDNTSAILREWLVA 102
Query: 345 FKTMFKNVKY--IAHNSTVNSKEA---------------RNLAVENSLHKGVDFYFYVDS 387
K ++ +V++ S+ +E R A++++ D+ +VDS
Sbjct: 103 VKGLYHSVEWRPAEEPSSYPDEEGPKHWSDSRYEHVMKLRQAALKSARDMWADYILFVDS 162
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D+ + NPD L L+ N++++AP+L A+SNFW + + G+Y R+ Y+ I D+
Sbjct: 163 DNLITNPDTLSLLIAENKTVVAPML-DSRAAYSNFWCGMTSQGYYKRTPAYIPIRKRDR- 220
Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIK-----TIYTLNSMDYDMAFCTNLRNKGIHLKID 502
+G + VP + + +L+ + N+ YT S D + F + + + + +
Sbjct: 221 -RGCFAVPMVHSTFLIDLRKAASRNLAFYPPHPDYTW-SFDDIIVFAFSCKQAEVQMYVC 278
Query: 503 STQEYGHL 510
+ + YG L
Sbjct: 279 NKEVYGFL 286
>gi|13470787|ref|NP_102356.1| hypothetical protein mll0582 [Mesorhizobium loti MAFF303099]
gi|14021530|dbj|BAB48142.1| mll0582 [Mesorhizobium loti MAFF303099]
Length = 931
Score = 73.9 bits (180), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 55/241 (22%), Positives = 113/241 (46%), Gaps = 28/241 (11%)
Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFKTM 348
P +L+++ + L +L I L+YP I +++ NN + + +++ +
Sbjct: 405 PRILVTILAKQKEPALPLYLECIEALDYPKASIVLYIRTNNNTDRTEHILREWVERVGHL 464
Query: 349 FKNVKYIAHNSTVNSKE----------------ARNLAVENSLHKGVDFYFYVDSDSHLD 392
+ V++ A N ++ RN+++ +L DFYF D D+ +
Sbjct: 465 YAAVEFDASNVADRVEQFGEHEWNETRFRVLGRIRNISLRKTLEHSCDFYFVADVDNFV- 523
Query: 393 NPDVLKYLVNRNESLIAPLL--VRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKG 450
P L+ LV + ++APLL + P + +SN+ ++A+G+Y + Y ++N + +G
Sbjct: 524 RPATLRELVALDVPIVAPLLRSISPGQYYSNYHAEIDANGYYMQCDQYGWVLN--RHVRG 581
Query: 451 IWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD-MAFCTNLRNKGIHLKIDSTQEYGH 509
I +P + YL++ V+ + Y + Y+ + F + R GI +D+ Q YG+
Sbjct: 582 IIEMPLVHCTYLVRADVLP----ELTYEDATSRYEYVIFADSARKAGIVQYMDNRQVYGY 637
Query: 510 L 510
+
Sbjct: 638 I 638
>gi|355755605|gb|EHH59352.1| Procollagen galactosyltransferase 1 [Macaca fascicularis]
Length = 558
Score = 73.6 bits (179), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 55/248 (22%), Positives = 117/248 (47%), Gaps = 28/248 (11%)
Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
P Q P VLI++ L L + L +P ++ +++V +N + + + +++
Sbjct: 48 PLQAPRVLIALLARNAAHALPTTLGALERLRHPRERTALWVATDHNMDNTSTVLREWLVA 107
Query: 345 FKTMFKNV-----------------KYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDS 387
K ++ +V K+ + + + + R A++++ D+ +VD+
Sbjct: 108 VKNLYHSVEWRPAEEPRSYPDEEGPKHWSDSRYEHVMKLRQAALKSARDMWADYILFVDA 167
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D+ + NPD L L+ N++++AP+L A+SNFW + + G+Y R+ Y+ I D+
Sbjct: 168 DNLILNPDTLSLLIAENKTVVAPML-NSRAAYSNFWCGMTSQGYYKRTPAYIPIRKRDR- 225
Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIK-----TIYTLNSMDYDMAFCTNLRNKGIHLKID 502
+G + VP + + +L+ + N+ YT S D + F + + + + +
Sbjct: 226 -RGCFAVPMVHSTFLIDLRKAASRNLAFYPPHPDYTW-SFDDIIVFAFSCKQAEVQMYVC 283
Query: 503 STQEYGHL 510
+ +EYG L
Sbjct: 284 NKEEYGFL 291
>gi|355703306|gb|EHH29797.1| Procollagen galactosyltransferase 1 [Macaca mulatta]
Length = 622
Score = 73.6 bits (179), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 57/248 (22%), Positives = 116/248 (46%), Gaps = 28/248 (11%)
Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
P Q P VLI++ L L + L +P ++ +++V +N + + + +++
Sbjct: 48 PLQAPRVLIALLARNAAHALPTTLGALERLRHPRERTALWVATDHNMDNTSTVLREWLVA 107
Query: 345 FKTMFKNVKY-IAHNSTVNSKEA----------------RNLAVENSLHKGVDFYFYVDS 387
K ++ +V++ A S E R A++++ D+ +VD+
Sbjct: 108 VKNLYHSVEWRPAEEPRSYSDEEGPKHWSDSRYEHVMKLRQAALKSARDMWADYILFVDA 167
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D+ + NPD L L+ N++++AP+L A+SNFW + + G+Y R+ Y+ I D+
Sbjct: 168 DNLILNPDTLSLLIAENKTVVAPML-DSRAAYSNFWCGMTSQGYYKRTPAYIPIRKRDR- 225
Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIK-----TIYTLNSMDYDMAFCTNLRNKGIHLKID 502
+G + VP + + +L+ + N+ YT S D + F + + + + +
Sbjct: 226 -RGCFAVPMVHSTFLIDLRKAASRNLAFYPPHPDYTW-SFDDIIVFAFSCKQAEVQMYVC 283
Query: 503 STQEYGHL 510
+ +EYG L
Sbjct: 284 NKEEYGFL 291
>gi|432868112|ref|XP_004071417.1| PREDICTED: procollagen galactosyltransferase 1-like [Oryzias
latipes]
Length = 610
Score = 73.6 bits (179), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 57/245 (23%), Positives = 113/245 (46%), Gaps = 30/245 (12%)
Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFKTM 348
P V +++ L FL I LNYP +++++V +N + + D++ + +
Sbjct: 40 PRVHVALICRNSEHSLPHFLGTIERLNYPKDRMALWVATDHNVDNTTAVLRDWLIKVQNL 99
Query: 349 FKNVKYIAHNS--TVNSKEA---------------RNLAVENSLHKGVDFYFYVDSDSHL 391
+ V++ + + +E R A+E++ D++ VD D+ L
Sbjct: 100 YHYVEWRPQEEPRSYDDEEGPKHWTDLRYEHVMKLRQAALESAREMWADYFMLVDCDNLL 159
Query: 392 DNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGI 451
NP+VL L+ N+++IAP+L A+SNFW + ++G+Y R+ Y+ I Q KG
Sbjct: 160 TNPNVLWKLIQENKTIIAPML-ESRAAYSNFWCGMTSEGYYRRTPAYIPIRR--QVRKGC 216
Query: 452 WNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMA------FCTNLRNKGIHLKIDSTQ 505
+ VP + + +L+ + K + + + DY A F + R + + + + +
Sbjct: 217 FAVPMVHSTFLI--DLRKEASKQLAFYPPHPDYSWAFDDIIVFAYSARMADVQMFVCNRE 274
Query: 506 EYGHL 510
YG+
Sbjct: 275 SYGYF 279
>gi|383862287|ref|XP_003706615.1| PREDICTED: glycosyltransferase 25 family member-like [Megachile
rotundata]
Length = 570
Score = 73.6 bits (179), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 68/246 (27%), Positives = 116/246 (47%), Gaps = 29/246 (11%)
Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYI------ 342
PSVLI++ + L FL + LNYP ++I +++ NN + + ++
Sbjct: 28 PSVLITILVRNKAHTLPYFLTFLEQLNYPKQRIHLWICSDNNIDKSIEILSTWLNRTAKE 87
Query: 343 -HNFKTMF--KNVKYIAHNSTVNSKEARNLAV----ENSLHKG----VDFYFYVDSDSHL 391
H +T F K+V + N + R L V E +L+ G DF + +D+D +
Sbjct: 88 YHGVETSFDEKSVGFEDENGVAHWSMQRFLHVIKLREAALNAGRNIWADFVWMLDADVFI 147
Query: 392 DNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGI 451
NP L L++RN++ +APLL + +SNFW + D +Y R+ Y I+ ++ KG
Sbjct: 148 TNPYTLNELISRNQTAVAPLL-KSDGLYSNFWAGMTNDYYYLRTDKYEPILYREE--KGC 204
Query: 452 WNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIHLKIDSTQ 505
++VP I + L+ + + T N DYD + F + GI L I +
Sbjct: 205 FSVPMIHSAVLIDLRTHLSDQL-TYNPKNLNDYDGPIDDIITFAIGAKKFGIPLFICNAN 263
Query: 506 EYGHLV 511
YG+++
Sbjct: 264 VYGYIM 269
>gi|327282249|ref|XP_003225856.1| PREDICTED: procollagen galactosyltransferase 1-like [Anolis
carolinensis]
Length = 527
Score = 73.6 bits (179), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 57/249 (22%), Positives = 113/249 (45%), Gaps = 30/249 (12%)
Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
P + P VL+++ L L + L +P + +++V +N + + +++ N
Sbjct: 33 PPRAPRVLVALLARNAAHSLPAALGCLERLRHPKDRTALWVATDHNVDNTTAVLREWLTN 92
Query: 345 FKTMFKNVKY--------------IAHNSTVNSKEA---RNLAVENSLHKGVDFYFYVDS 387
K+M+ +V++ H S + R A++ + D+ +VDS
Sbjct: 93 VKSMYHSVEWRPMELPRSYPDEEGPKHWSNFRYEHVMKLRQAALQAARDMWADYILFVDS 152
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D+ L NPD L L+ N++++AP+L A+SNFW + + G+Y R+ Y+ + D+
Sbjct: 153 DNLLTNPDTLSLLIAENKTVVAPML-DSRAAYSNFWCGMTSQGYYKRTPAYIPVRKRDR- 210
Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMA------FCTNLRNKGIHLKI 501
KG + VP + + +L+ + N+ ++ DY A F R + + +
Sbjct: 211 -KGCFAVPMVHSTFLINLQKEASQNL--VFYPPHPDYTWAFDDIIVFAFACRQAEVQMYV 267
Query: 502 DSTQEYGHL 510
+ + YG L
Sbjct: 268 CNKEVYGFL 276
>gi|326664713|ref|XP_686329.4| PREDICTED: procollagen galactosyltransferase 2 [Danio rerio]
Length = 584
Score = 73.6 bits (179), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 58/245 (23%), Positives = 111/245 (45%), Gaps = 29/245 (11%)
Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFKTM 348
P V I++ L FL I L+YP +IS++ +N + + ++I + +
Sbjct: 27 PKVAIAILARNSEHSLPYFLGCIERLDYPKDRISIWAATDHNTDNTTGMLREWIAGVEDL 86
Query: 349 FKNV------------------KYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSH 390
+ +V K+ + + R A++++ + D+ + DSD+
Sbjct: 87 YHSVQLHTMEQEKSSYVDELGPKHWPETRFTHVMKLRQAALKSARAQWADYVLFTDSDNL 146
Query: 391 LDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKG 450
L N VL L++ N +L+AP+L +SNFW + + G+Y R+ Y+ I + G
Sbjct: 147 LTNTQVLNQLISENRTLVAPML-DSRTLYSNFWCGMTSQGYYKRTPHYVPIRTWKR--TG 203
Query: 451 IWNVPYITNCYLMKTSVIKATNIKTIYTLN-----SMDYDMAFCTNLRNKGIHLKIDSTQ 505
VP I + L+ +A+ + Y ++ ++D MAF + R G+ + I + +
Sbjct: 204 CHPVPMIHSTMLIDLRR-RASELLAFYPVHHHYLWALDDIMAFAFSARQTGVQMFICNRE 262
Query: 506 EYGHL 510
YG+L
Sbjct: 263 HYGYL 267
>gi|395847891|ref|XP_003796597.1| PREDICTED: procollagen galactosyltransferase 1 [Otolemur garnettii]
Length = 623
Score = 73.2 bits (178), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 56/248 (22%), Positives = 117/248 (47%), Gaps = 28/248 (11%)
Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
P Q P VLI++ L L + L +P ++ +++V +N + + + +++
Sbjct: 49 PLQAPRVLIALLARNAAHALPTTLGALERLRHPPERTALWVATDHNMDNTSAVLREWLVA 108
Query: 345 FKTMFKNV-----------------KYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDS 387
K+++ +V K+ + + + + R A++++ D+ +VD+
Sbjct: 109 MKSLYHSVEWRPAEEPRSYPDEEGPKHWSDSRYEHIMKLRQAALKSARDMWADYILFVDA 168
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D+ L NPD L L+ N++++AP+L A+SNFW + + G+Y R+ Y+ I D+
Sbjct: 169 DNLLLNPDTLSLLIAENKTVVAPML-DSRAAYSNFWCGMTSQGYYKRTPAYIPIRKRDR- 226
Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIK-----TIYTLNSMDYDMAFCTNLRNKGIHLKID 502
+G + VP + + +L+ + N+ YT S D + F + + I + +
Sbjct: 227 -RGCFAVPMVHSTFLIDLRKAASRNLAFYPPHPDYTW-SFDDIIVFAFSCKQAEIQMYVC 284
Query: 503 STQEYGHL 510
+ + YG L
Sbjct: 285 NKEVYGFL 292
>gi|402904728|ref|XP_003915192.1| PREDICTED: procollagen galactosyltransferase 1 [Papio anubis]
Length = 622
Score = 73.2 bits (178), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 55/248 (22%), Positives = 117/248 (47%), Gaps = 28/248 (11%)
Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
P Q P VLI++ L L + L +P ++ +++V +N + + + +++
Sbjct: 48 PLQAPRVLIALLARNAAHALPTTLGALERLRHPRERTALWVATDHNMDNTSTVLREWLVA 107
Query: 345 FKTMFKNV-----------------KYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDS 387
K+++ +V K+ + + + + R A++++ D+ +VD
Sbjct: 108 VKSLYHSVEWRPAEEPRSYPDEEGPKHWSDSRYEHVMKLRQAALKSARDMWADYILFVDV 167
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D+ + NPD L L+ N++++AP+L A+SNFW + + G+Y R+ Y+ I D+
Sbjct: 168 DNLILNPDTLSLLIAENKTVVAPML-DSRAAYSNFWCGMTSQGYYKRTPAYIPIRKRDR- 225
Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIK-----TIYTLNSMDYDMAFCTNLRNKGIHLKID 502
+G + VP + + +L+ + N+ YT S D + F + + + + +
Sbjct: 226 -RGCFAVPMVHSTFLIDLRKAASRNLAFYPPHPDYTW-SFDDIIVFAFSCKQAEVQMYVC 283
Query: 503 STQEYGHL 510
+ +EYG L
Sbjct: 284 NKEEYGFL 291
>gi|340715525|ref|XP_003396262.1| PREDICTED: glycosyltransferase 25 family member-like [Bombus
terrestris]
Length = 569
Score = 73.2 bits (178), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 56/193 (29%), Positives = 95/193 (49%), Gaps = 24/193 (12%)
Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFKTM 348
P+VLI++ + L FL + L YP ++I +++ NN + + ++ N ++
Sbjct: 28 PTVLITILVRNKAHTLPYFLTFLEQLTYPKERIHLWICSDNNIDNSIEILSAWLKNERSK 87
Query: 349 FKNVKY--------------IAHNSTVNSKEARNLAVENSLHKG----VDFYFYVDSDSH 390
+ V+ IAH S NL E +LH G DF + +D+D
Sbjct: 88 YHGVEINFDEKSNGFEDENEIAHWSPQRFLHVINLR-EEALHAGRNIWADFIWMLDADVF 146
Query: 391 LDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKG 450
L NP+ L L+ +NE+++APLL + +SNFW + +D +Y R+ Y I+ + KG
Sbjct: 147 LTNPNTLNELILKNETVVAPLL-KSDGLYSNFWAGVTSDFYYLRTEKYEPILFREI--KG 203
Query: 451 IWNVPYITNCYLM 463
+NVP I + L+
Sbjct: 204 CFNVPMIHSAVLI 216
>gi|350422829|ref|XP_003493297.1| PREDICTED: glycosyltransferase 25 family member-like [Bombus
impatiens]
Length = 569
Score = 73.2 bits (178), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 57/193 (29%), Positives = 96/193 (49%), Gaps = 24/193 (12%)
Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFKTM 348
P+VLI++ + L FL + L YP ++I +++ NN + + ++ N ++
Sbjct: 28 PTVLITILVRNKAHTLPYFLTFLEQLTYPKERIHLWICSDNNIDNSIEILSAWLKNERSK 87
Query: 349 FKNVKYIAHNSTVNSKEARN----------LAV----ENSLHKG----VDFYFYVDSDSH 390
+ V+ I N N E N L V E +LH G DF + +D+D
Sbjct: 88 YHGVE-INFNEKSNGFEDENEISHWSPQRFLHVINLREEALHAGRNIWADFIWMLDADVF 146
Query: 391 LDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKG 450
L NP+ L L+ +NE+++APLL + +SNFW + +D +Y R+ Y I+ + KG
Sbjct: 147 LTNPNTLNELILKNETVVAPLL-KSDGLYSNFWAGMTSDFYYLRTEKYEPILFREI--KG 203
Query: 451 IWNVPYITNCYLM 463
+NVP I + L+
Sbjct: 204 CFNVPMIHSAVLI 216
>gi|390347653|ref|XP_783019.3| PREDICTED: procollagen galactosyltransferase 1-like
[Strongylocentrotus purpuratus]
Length = 646
Score = 72.8 bits (177), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 59/270 (21%), Positives = 124/270 (45%), Gaps = 34/270 (12%)
Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYH--APLFDDYIHNFKTM 348
P+V I + L F + LNYP +I++++ + P+ ++I
Sbjct: 46 PTVFIPILARNKAHTLPHFFGYLERLNYPKDRITLWIRADHSVDNTIPMLREWIQRVAHY 105
Query: 349 FKNVKY-IAHNSTVNSKEA----------------RNLAVENSLHKGVDFYFYVDSDSHL 391
+ V Y + V + E R+ A++ + + D+++ +D D+ +
Sbjct: 106 YHTVDYAFEEHPQVYALEKGPHDWPSARFNHLIDLRDQALQEARNVWADYFYTMDVDNFV 165
Query: 392 DNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGI 451
++L L++ +++IAP+L + +SNFWG + + GFY R+ +Y+ I+ + G+
Sbjct: 166 WEQNILDVLMSEKKTIIAPML-QSTTYYSNFWGGVTSKGFYKRTKEYVKIVK--RNVTGV 222
Query: 452 WNVPYITNCYLMKTSVIKATNIKTIYTL----NSMDYDMAFCTNLRNKGIHLKIDSTQEY 507
+ VP + + YL+ + +AT+ T L +D + F + + GI I + Y
Sbjct: 223 FKVPMVHSTYLINLNH-EATDKLTYKPLKDYAQDLDDMLTFAHSAKKAGISFYITNKDHY 281
Query: 508 GHLVDSENFDPQKTNP---EVYELIRNPLD 534
G ++ + P+ +P EV +++ L+
Sbjct: 282 GAML----YPPESHHPLKEEVEQMLHTKLE 307
>gi|348525092|ref|XP_003450056.1| PREDICTED: procollagen galactosyltransferase 1-like [Oreochromis
niloticus]
Length = 657
Score = 72.8 bits (177), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 61/244 (25%), Positives = 109/244 (44%), Gaps = 28/244 (11%)
Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFKTM 348
P V+I++ L FL I LNYP +I+++V +N + + +++ +
Sbjct: 86 PRVVIALVCRNSAHSLPLFLGTIERLNYPKDRIALWVATDHNVDNTTAILREWLIKVQNY 145
Query: 349 FKNVKY------IAHNSTVNSKEARNL-----------AVENSLHKGVDFYFYVDSDSHL 391
+ V++ A V K NL A++ + D+ D D+ L
Sbjct: 146 YHYVEWRPEDEPSAFEDEVGPKHWNNLRYEHVMKLRQAALDTAREIWADYLLVADCDNLL 205
Query: 392 DNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGI 451
N DVL L+ N++++AP+L A+SNFW + + G+Y R+ YM I Q +G
Sbjct: 206 TNQDVLWKLMRENKTIVAPML-ESRAAYSNFWCGMTSQGYYRRTPAYMPIRR--QERRGC 262
Query: 452 WNVPYITNCYLMKTSVIKATNIKTIYTLN-----SMDYDMAFCTNLRNKGIHLKIDSTQE 506
+ VP + + YLM +A+ Y + ++D + F + R + + I + +
Sbjct: 263 FPVPMVHSTYLMDLRK-EASRQLAFYPPHPEYSWALDDVIVFAYSARMADVQMYICNKET 321
Query: 507 YGHL 510
YGH
Sbjct: 322 YGHF 325
>gi|440908235|gb|ELR58279.1| Procollagen galactosyltransferase 2, partial [Bos grunniens mutus]
Length = 610
Score = 72.8 bits (177), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 58/251 (23%), Positives = 112/251 (44%), Gaps = 32/251 (12%)
Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
P Q P+VL+ V L FL + L+YP +++++ +N + + +++ N
Sbjct: 31 PLQRPTVLVVVLARNAAHTLPHFLGCLERLDYPKSRMAIWAATDHNVDNTTEILREWLKN 90
Query: 345 FKTMFKNV-----------------KYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDS 387
+ + V K+ + + + R A+ + K D+ ++D
Sbjct: 91 VQKAYHYVEWRPMDEPESYPDEIGPKHWPASRFAHVMKLRQAALRTAREKWSDYILFIDV 150
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGAL--NADGFYARSFDYMNIINGD 445
D+ L NP L L+ N++++AP+L +SNFW + A GFY R+ DY+ I
Sbjct: 151 DNFLTNPQTLNLLMAENKTIVAPML-ESRGLYSNFWCGITPQASGFYKRTPDYLQIREWK 209
Query: 446 QGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIHL 499
+ G + VP + + +L+ + K + K ++ DY + F + R GI +
Sbjct: 210 R--LGCFPVPMVHSTFLI--DLRKEASDKLMFYPPHQDYTWTFDDIIVFAFSSRQAGIQM 265
Query: 500 KIDSTQEYGHL 510
+ + + YG+L
Sbjct: 266 YLCNREHYGYL 276
>gi|312376729|gb|EFR23732.1| hypothetical protein AND_12342 [Anopheles darlingi]
Length = 332
Score = 72.8 bits (177), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 48/205 (23%), Positives = 96/205 (46%), Gaps = 23/205 (11%)
Query: 289 QFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN--NQEYHAPLFDDYIHNFK 346
Q PSV+I+V I L F + L+YP ++S+++ + N++ + ++
Sbjct: 40 QPPSVMIAVLIRNKEHTLPYFFTYLEELDYPKDRLSIWIRSDHNEDRSIEITKAWLKRST 99
Query: 347 TMFKNVKYIAHNSTVNSKEA------------------RNLAVENSLHKGVDFYFYVDSD 388
++ +V + +E+ + A++ + D+ ++D+D
Sbjct: 100 PLYHSVDFKYRTEPAGKRESEKTYTHWTEDRFADVIRLKEEALQTARKMWADYVLFLDAD 159
Query: 389 SHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGG 448
L NP LK L++ ++AP+LV +SNFW + AD +Y R+ DY I+N + G
Sbjct: 160 VFLTNPRSLKALIDLKLPIVAPMLVSD-GLYSNFWCGMTADYYYHRTDDYKKILNYELVG 218
Query: 449 KGIWNVPYITNCYLMKTSVIKATNI 473
+ W VP + + L+ +V ++ +
Sbjct: 219 Q--WAVPMVHSAVLVDLNVAESRRL 241
>gi|160333551|ref|NP_001103992.1| procollagen galactosyltransferase 1 precursor [Danio rerio]
gi|160395521|sp|A5PMF6.1|GT251_DANRE RecName: Full=Procollagen galactosyltransferase 1; AltName:
Full=Glycosyltransferase 25 family member 1; AltName:
Full=Hydroxylysine galactosyltransferase 1; Flags:
Precursor
Length = 604
Score = 72.8 bits (177), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 71/308 (23%), Positives = 133/308 (43%), Gaps = 51/308 (16%)
Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFKTM 348
P VL+++ L L I LNYP +++++V +N + + +++ N +
Sbjct: 34 PRVLVALVCRNSAHSLPHVLGAIDRLNYPKDRMAVWVATDHNSDNTTEILREWLVNVQNF 93
Query: 349 FKNVKYIAHNS-TVNSKEA----------------RNLAVENSLHKGVDFYFYVDSDSHL 391
+ V++ + +V E+ R A+E + D++ VD D+ L
Sbjct: 94 YHYVEWRPQDEPSVYEGESGPKHWTNLRYEHVMKLRQAALETAREMWADYFMLVDCDNLL 153
Query: 392 DNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGI 451
N DVL L+ N++++AP+L A+SNFW + + G+Y R+ YM I Q KG
Sbjct: 154 TNRDVLWKLMRENKTIVAPML-ESRAAYSNFWCGMTSQGYYKRTPAYMPIRR--QERKGC 210
Query: 452 WNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMA------FCTNLRNKGIHLKIDSTQ 505
+ VP + + L+ + K + + + DY A F + R + + I + +
Sbjct: 211 FAVPMVHSTLLL--DLRKEASRQLAFFPPHPDYTWAFDDIIIFAFSARMAEVQMYICNRE 268
Query: 506 EYGHLV-----------DSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVN 554
YG+ ++E+F + ++ ++RNP I P SL+P +
Sbjct: 269 TYGYFPVPLRSQNSLQDEAESF----LHSQLEVMVRNPP------IEPSVYLSLMPKQTD 318
Query: 555 NQPCPDVF 562
+VF
Sbjct: 319 KMGFDEVF 326
>gi|397493909|ref|XP_003817838.1| PREDICTED: LOW QUALITY PROTEIN: procollagen galactosyltransferase 1
[Pan paniscus]
Length = 622
Score = 72.4 bits (176), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 54/248 (21%), Positives = 118/248 (47%), Gaps = 28/248 (11%)
Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
P Q P VLI++ L L + L +P ++ +++V +N + + + +++
Sbjct: 48 PLQAPRVLIALLARNAAHALPTTLGALERLRHPRERTALWVATDHNMDNTSTVLREWLVA 107
Query: 345 FKTMFKNV-----------------KYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDS 387
K+++ +V K+ + + + + R A++++ D+ +VD+
Sbjct: 108 VKSLYHSVEWRPAEEPRSYPDEEGPKHWSDSRYEHVMKLRQAALKSARDMWADYILFVDA 167
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D+ + +PD L L+ N++++AP+L A+SNFW + + G+Y R+ Y+ I D+
Sbjct: 168 DNLILSPDTLSLLIAENKTVVAPML-DSRAAYSNFWCGMTSQGYYKRTPAYIPIRKRDR- 225
Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIK-----TIYTLNSMDYDMAFCTNLRNKGIHLKID 502
+G + VP + + +L+ + N+ YT S D + F + + + + +
Sbjct: 226 -RGCFAVPMVHSTFLIDLRKAASRNLAFYPPHPDYTW-SFDDIIVFAFSCKQAEVQMYVC 283
Query: 503 STQEYGHL 510
+ +EYG L
Sbjct: 284 NKEEYGFL 291
>gi|119605027|gb|EAW84621.1| glycosyltransferase 25 domain containing 1, isoform CRA_a [Homo
sapiens]
Length = 645
Score = 72.4 bits (176), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 54/248 (21%), Positives = 117/248 (47%), Gaps = 28/248 (11%)
Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
P Q P VLI++ L L + L +P ++ +++V +N + + + +++
Sbjct: 48 PLQAPRVLIALLARNAAHALPTTLGALERLRHPRERTALWVATDHNMDNTSTVLREWLVA 107
Query: 345 FKTMFKNV-----------------KYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDS 387
K+++ +V K+ + + + + R A++++ D+ +VD+
Sbjct: 108 VKSLYHSVEWRPAEEPRSYPDEEGPKHWSDSRYEHVMKLRQAALKSARDMWADYILFVDA 167
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D+ + NPD L L+ N++++AP+L A+SNFW + + G+Y R+ Y+ I D+
Sbjct: 168 DNLILNPDTLSLLIAENKTVVAPML-DSRAAYSNFWCGMTSQGYYKRTPAYIPIRKRDR- 225
Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIK-----TIYTLNSMDYDMAFCTNLRNKGIHLKID 502
+G + VP + + +L+ + N+ YT S D + F + + G +++
Sbjct: 226 -RGCFAVPMVHSTFLIDLRKAASRNLAFYPPHPDYTW-SFDDIIVFAFSCKQAGTYMRAT 283
Query: 503 STQEYGHL 510
+ + HL
Sbjct: 284 GPRLFLHL 291
>gi|301787505|ref|XP_002929168.1| PREDICTED: procollagen galactosyltransferase 2-like, partial
[Ailuropoda melanoleuca]
Length = 630
Score = 72.4 bits (176), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 61/266 (22%), Positives = 117/266 (43%), Gaps = 32/266 (12%)
Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
P Q P+VL+++ L FL + L++ + M+ +N + + +++ N
Sbjct: 53 PMQRPTVLVAILARNAAHALPHFLGCLERLDFAKSPLIMWAATDHNVDNTTEILREWLKN 112
Query: 345 FKTMFKNVKY------------IAHNSTVNSKEA-----RNLAVENSLHKGVDFYFYVDS 387
++ + V++ I S+ A R A+ + K D+ ++D
Sbjct: 113 VQSFYHYVEWRPMDEPESYPDEIGPKHWPGSRFAHVMKLRQAALRTAREKWSDYILFIDV 172
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D+ L NP L ++ N++++AP+L +SNFW + GFY R+ DY+ I +
Sbjct: 173 DNFLTNPQTLNLMIAENKTIVAPML-ESRGLYSNFWCGITPQGFYKRTPDYLQIREWKR- 230
Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIHLKI 501
G + VP + + +L+ + K + K ++ DY + F + R GI + +
Sbjct: 231 -LGCFPVPMVHSTFLI--DLRKEASDKLMFYPPHQDYTWTFDDIIVFAFSSRQAGIQMYL 287
Query: 502 DSTQEYGHLVDSENFDPQKTNPEVYE 527
+ + YG+L PQ+T E E
Sbjct: 288 CNREHYGYL--PIPLKPQQTLQEEIE 311
>gi|383421633|gb|AFH34030.1| procollagen galactosyltransferase 1 precursor [Macaca mulatta]
Length = 622
Score = 71.6 bits (174), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 54/248 (21%), Positives = 117/248 (47%), Gaps = 28/248 (11%)
Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
P Q P VLI++ L L + L +P ++ +++V +N + + + +++
Sbjct: 48 PLQAPRVLIALLARNAAHALPTTLGALERLRHPRERTALWVATDHNMDNTSTVLREWLVA 107
Query: 345 FKTMFKNV-----------------KYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDS 387
K ++ +V K+ + + + + R A++++ D+ +VD+
Sbjct: 108 VKNLYHSVEWRPAEEPRSYPDEEGPKHWSDSRYEHVMKLRQAALKSARDMWADYILFVDA 167
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D+ + +PD L L+ N++++AP+L A+SNFW + + G+Y R+ Y+ I D+
Sbjct: 168 DNLILSPDTLSLLIAENKTVVAPML-DSRAAYSNFWCGMTSQGYYKRTPAYIPIRKRDR- 225
Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIK-----TIYTLNSMDYDMAFCTNLRNKGIHLKID 502
+G + VP + + +L+ + N+ YT S D + F + + + + +
Sbjct: 226 -RGCFAVPMVHSTFLIDLRKAASRNLAFYPPHPDYTW-SFDDIIVFAFSCKQAEVQMYVC 283
Query: 503 STQEYGHL 510
+ +EYG L
Sbjct: 284 NKEEYGFL 291
>gi|410926753|ref|XP_003976837.1| PREDICTED: procollagen galactosyltransferase 2-like [Takifugu
rubripes]
Length = 617
Score = 71.6 bits (174), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 56/245 (22%), Positives = 113/245 (46%), Gaps = 26/245 (10%)
Query: 289 QFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN--NQEYHAPLFDDYIHNFK 346
Q P+V+I++ L +L + LNYP +IS++ + N + + +++ +
Sbjct: 57 QPPTVVIAILARNSAHSLPYYLGALERLNYPKDRISVWAASDHNVDNTTAVLKEWLTAMQ 116
Query: 347 TMFKNVKYIAHNSTV------------NSK-----EARNLAVENSLHKGVDFYFYVDSDS 389
+ +V++ + NS+ + + A+ + + D+ Y D+D+
Sbjct: 117 QFYHHVEWRPMDQPTWYAGELGPKHWPNSRYEYVMKLKQAALGFARKRWADYILYADADN 176
Query: 390 HLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGK 449
L NPD L L+ N+S++AP+L A+SNFW + G+Y R+ +Y + +
Sbjct: 177 ILTNPDTLNLLIAENKSVVAPML-HSQGAYSNFWCGITPQGYYRRTAEYFPTRHRHR--L 233
Query: 450 GIWNVPYITNCYL--MKTSVIKATNIKTIYTLNSMDYD--MAFCTNLRNKGIHLKIDSTQ 505
G + VP + + L ++ +K + S YD + F + R +GI + + + +
Sbjct: 234 GCFPVPMVHSTMLLDLRKEGMKRLAFFPPHADYSWPYDDIIVFAFSCRTEGIQMYLCNKE 293
Query: 506 EYGHL 510
YG+L
Sbjct: 294 RYGYL 298
>gi|148697003|gb|EDL28950.1| glycosyltransferase 25 domain containing 1, isoform CRA_b [Mus
musculus]
Length = 478
Score = 71.6 bits (174), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 53/247 (21%), Positives = 114/247 (46%), Gaps = 26/247 (10%)
Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
P Q P VLI++ L L + L +P ++ +++V +N + + + +++
Sbjct: 53 PLQAPRVLIALLARNAAPALPATLGALEQLRHPRERTALWVATDHNTDNTSAILREWLVA 112
Query: 345 FKTMFKNVKY--IAHNSTVNSKEA---------------RNLAVENSLHKGVDFYFYVDS 387
K ++ +V++ S+ +E R A++++ D+ ++D
Sbjct: 113 VKGLYHSVEWRPAEEPSSYPDEEGPKHWSDSRYEHVMKLRQAALKSARDMWADYILFMDI 172
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D+ + NPD L L+ N++++AP+L A+SNFW + + G+Y R+ Y+ I D+
Sbjct: 173 DNLITNPDTLSLLIAENKTVVAPML-DSRAAYSNFWCGMTSQGYYKRTPAYIPIRKRDR- 230
Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKTIYT----LNSMDYDMAFCTNLRNKGIHLKIDS 503
+G + VP + + +L+ + N+ T S D + F + + + + + +
Sbjct: 231 -RGCFAVPMVHSTFLIDLRKAASRNLAFYPTHPDYTWSFDDIIVFAFSCKQAEVQMYVCN 289
Query: 504 TQEYGHL 510
+ YG L
Sbjct: 290 KEVYGFL 296
>gi|348508948|ref|XP_003442014.1| PREDICTED: procollagen galactosyltransferase 1-like [Oreochromis
niloticus]
Length = 610
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 46/192 (23%), Positives = 94/192 (48%), Gaps = 22/192 (11%)
Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFKTM 348
P VL+++ L FL I LNYP +++++V +N++ + D++ + +
Sbjct: 40 PRVLLALICRNSEHSLPYFLGTIERLNYPKDRMALWVATDHNEDNTTAILRDWLVKVQKL 99
Query: 349 FKNVKYIAHNSTVNSKEA-----------------RNLAVENSLHKGVDFYFYVDSDSHL 391
+ V++ + ++ R A+E++ D++ D D+ L
Sbjct: 100 YHYVEWRPKEEPRSYEDEEGPKDWIDPRYEHVMKLRQAALESAREMWADYFMLADCDNLL 159
Query: 392 DNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGI 451
N +VL+ L+ +N+++IAP+L A+SNFW + + G+Y R+ Y+ + Q KG
Sbjct: 160 TNSNVLRGLMKQNKTIIAPML-ESRAAYSNFWCGMTSQGYYKRTPAYIPV--RKQIRKGC 216
Query: 452 WNVPYITNCYLM 463
+ VP + + +L+
Sbjct: 217 FAVPMVHSTFLI 228
>gi|311249255|ref|XP_003123541.1| PREDICTED: procollagen galactosyltransferase 1-like [Sus scrofa]
gi|456752987|gb|JAA74072.1| glycosyltransferase 25 domain containing 1 [Sus scrofa]
Length = 623
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 54/248 (21%), Positives = 117/248 (47%), Gaps = 28/248 (11%)
Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
P Q P VLI++ L L + L +P ++ +++V +N + + + +++
Sbjct: 49 PLQAPRVLIALLARNAAHALPSTLGALERLRHPRERTALWVATDHNSDNTSAVLREWLVA 108
Query: 345 FKTMFKNV-----------------KYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDS 387
K+++ +V K+ + + + + R A++++ D+ +VD+
Sbjct: 109 VKSLYHSVEWRPAEEPRSYPDEEGPKHWSDSRYEHVMKLRQAALKSARDMWADYILFVDA 168
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D+ + NPD L L+ N++++AP+L A+SNFW + + G+Y R+ Y+ I D+
Sbjct: 169 DNLILNPDTLSLLIAENKTVVAPML-DSRAAYSNFWCGMTSQGYYKRTPAYIPIRKRDR- 226
Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIK-----TIYTLNSMDYDMAFCTNLRNKGIHLKID 502
+G + VP + + +L+ + N+ YT S D + F + + + + +
Sbjct: 227 -QGCFAVPMVHSTFLIDLRKAASRNLAFYPPHPDYTW-SFDDIIVFAFSCKQAEVQMYVC 284
Query: 503 STQEYGHL 510
+ + YG L
Sbjct: 285 NKEVYGFL 292
>gi|119611576|gb|EAW91170.1| glycosyltransferase 25 domain containing 2, isoform CRA_a [Homo
sapiens]
Length = 638
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 60/261 (22%), Positives = 114/261 (43%), Gaps = 42/261 (16%)
Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
P Q P+VL++V L FL + L+YP +++++ +N + +F +++ N
Sbjct: 49 PLQSPTVLVAVLARNAAHTLPHFLGCLERLDYPKSRMAIWAATDHNVDNTTEIFREWLKN 108
Query: 345 FKTMFKNVKY------------IAHNSTVNSKEA-----RNLAVENSLHKGVDFYFYVDS 387
+ ++ V++ I S+ A R A+ + K D+ ++D
Sbjct: 109 VQRLYHYVEWRPMDEPESYPDEIGPKHWPTSRFAHVMKLRQAALRTAREKWSDYILFIDV 168
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNA------------DGFYARS 435
D+ L NP L L+ N++++AP+L +SNFW + GFY R+
Sbjct: 169 DNFLTNPQTLNLLIAENKTIVAPML-ESRGLYSNFWCGITPKAKNTTHLFALLQGFYKRT 227
Query: 436 FDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFC 489
DY+ I + G + VP + + +L+ + K + K + DY + F
Sbjct: 228 PDYVQIREWKRT--GCFPVPMVHSTFLI--DLRKEASDKLTFYPPHQDYTWTFDDIIVFA 283
Query: 490 TNLRNKGIHLKIDSTQEYGHL 510
+ R GI + + + + YG+L
Sbjct: 284 FSSRQAGIQMYLCNREHYGYL 304
>gi|355690359|gb|AER99127.1| glycosyltransferase 25 domain containing 1 [Mustela putorius furo]
Length = 579
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 54/248 (21%), Positives = 117/248 (47%), Gaps = 28/248 (11%)
Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
P Q P VLI++ L L + L +P + +++V +N + + + +++
Sbjct: 5 PLQAPRVLIALVARNAAHALPATLGALERLRHPRGRTALWVATDHNSDNTSAVLREWLVA 64
Query: 345 FKTMFKNV-----------------KYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDS 387
K+++ +V K+ + + + + R A++++ D+ +VD+
Sbjct: 65 VKSLYHSVEWRPAEEPRSYPDEEGPKHWSDSRYEHIMKLRQAALKSARDMWADYILFVDA 124
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D+ + NP+ L L+ N++++AP+L A+SNFW + + G+Y R+ Y+ I D+
Sbjct: 125 DNLILNPNTLSLLIAENKTVVAPML-DSRAAYSNFWCGMTSQGYYKRTPAYIPIRKRDR- 182
Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIK-----TIYTLNSMDYDMAFCTNLRNKGIHLKID 502
+G + VP + + +L+ + N+ YT S D + F + + + + +
Sbjct: 183 -RGCFAVPMVHSTFLIDLRKAASRNLAFYPPHPDYTW-SFDDIIVFAFSCKQAEVQMYVC 240
Query: 503 STQEYGHL 510
+ +EYG L
Sbjct: 241 NKEEYGFL 248
>gi|348556988|ref|XP_003464302.1| PREDICTED: procollagen galactosyltransferase 1-like [Cavia
porcellus]
Length = 627
Score = 70.9 bits (172), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 51/221 (23%), Positives = 106/221 (47%), Gaps = 24/221 (10%)
Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
P Q P VLI++ L L + L++P ++ +++V +N + + + +++
Sbjct: 53 PLQAPRVLIALLARNAAHALPATLGALERLHHPRERTALWVATDHNADNTSAVLREWLVA 112
Query: 345 FKTMFKNV-----------------KYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDS 387
K ++ +V K+ + + + R A++ + D+ +VD+
Sbjct: 113 VKGLYHSVEWRPAEEPRSYPDEEGPKHWSDTRYEHVMKLRQAALKAARDMWADYILFVDA 172
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D+ L NPD L+ L+ N++++AP+L A+SNFW + + G+Y R+ Y+ I ++
Sbjct: 173 DNLLVNPDTLRLLIAENKTVVAPML-DSRAAYSNFWCGMTSQGYYKRTPAYIPIRKRER- 230
Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAF 488
+G + VP + + +L+ + KA + + DY AF
Sbjct: 231 -RGCFAVPMVHSTFLL--DLRKAASRSLAFYPPHPDYTWAF 268
>gi|291229542|ref|XP_002734736.1| PREDICTED: glycosyltransferase 25 domain containing 2-like, partial
[Saccoglossus kowalevskii]
Length = 576
Score = 70.9 bits (172), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 55/259 (21%), Positives = 128/259 (49%), Gaps = 29/259 (11%)
Query: 277 NLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN--NQEYH 334
N+++++ + Q P++ + + L FL I L+YP ++ +++ + N +
Sbjct: 35 NVVENVHAESEFQNPTIFLPILARNKAHTLPVFLAYIDRLDYPKSRMRIWIQSDHNIDNT 94
Query: 335 APLFDDYIHNFKTMFKNV---------KYIAHNSTVNSKEAR--------NLAVENSLHK 377
+ +++ N K ++++ KY ++ E R A++ + +
Sbjct: 95 TSILKEWVSNVKHTYRSIDESYADEPDKYSTEVGPLDWPEERFSHMIKLRQEALDEARRQ 154
Query: 378 GVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFD 437
DF F+VD D+ ++ P L L+ +++IAP++ A++NFW ++ G+Y R+ +
Sbjct: 155 WADFIFFVDCDNFIEEPQTLNLLIAEKKTIIAPMM-ESDSAYANFWCGVDDQGYYIRTPE 213
Query: 438 YMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYT-LNSM--DYD--MAFCTNL 492
Y+ + ++ KG + VP + + +L+ + +++++K + LNS DYD + F +
Sbjct: 214 YLPTLRRER--KGCFPVPMVHSTFLI--DLRRSSSLKLQFNPLNSYRGDYDDILIFAYSA 269
Query: 493 RNKGIHLKIDSTQEYGHLV 511
+ I + + +T +G L+
Sbjct: 270 KIAEIQMYVLNTWYFGMLL 288
>gi|22760716|dbj|BAC11307.1| unnamed protein product [Homo sapiens]
Length = 622
Score = 70.9 bits (172), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 54/248 (21%), Positives = 116/248 (46%), Gaps = 28/248 (11%)
Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
P Q P VLI++ L L + L +P ++ +++V +N + + +++
Sbjct: 48 PLQAPRVLIALLARNAAHALPTTLGALERLRHPRERTALWVATDHNMDNTSTELREWLVA 107
Query: 345 FKTMFKNV-----------------KYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDS 387
K+++ +V K+ + + + + R A++++ D+ +VD+
Sbjct: 108 VKSLYHSVEWRPAEEPRSYPDEEGPKHWSDSRYEHVMKLRQAALKSARDMWADYILFVDA 167
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D+ + NPD L L+ N++++AP+L A+SNFW + + G+Y R+ Y+ D+
Sbjct: 168 DNLILNPDTLSLLIAENKTVVAPML-DSRAAYSNFWCGMTSQGYYKRTPAYIPTRKRDR- 225
Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIK-----TIYTLNSMDYDMAFCTNLRNKGIHLKID 502
+G + VP + + +L+ + N+ YT S D + F + + + + +
Sbjct: 226 -RGCFAVPMVHSTFLIDLRKAASRNLAFYPPHPDYTW-SFDDIIVFAFSCKQAEVQMYVC 283
Query: 503 STQEYGHL 510
+ +EYG L
Sbjct: 284 NKEEYGFL 291
>gi|397640090|gb|EJK73928.1| hypothetical protein THAOC_04424 [Thalassiosira oceanica]
Length = 569
Score = 70.9 bits (172), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 49/119 (41%), Positives = 62/119 (52%), Gaps = 10/119 (8%)
Query: 619 AEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQV 678
AE L + L ER F G VRA FVVRY P QP+LR H DSS + NI LN
Sbjct: 309 AEKLNARLSVLMERTF-GVFRGAVRANDIFVVRYEPGGQPNLRRHTDSSFISFNIILND- 366
Query: 679 GVDYEGGGCRFIRY----NCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVD 733
+EGGG RF + +V +G+ ++ + HEGL T GTRYI++ F D
Sbjct: 367 --GFEGGGTRFHSRPDGTHIDVKPPAVGYGILSNANI--LHEGLATTNGTRYILVGFDD 421
>gi|170784829|ref|NP_666323.2| procollagen galactosyltransferase 1 precursor [Mus musculus]
gi|160395574|sp|Q8K297.2|GT251_MOUSE RecName: Full=Procollagen galactosyltransferase 1; AltName:
Full=Glycosyltransferase 25 family member 1; AltName:
Full=Hydroxylysine galactosyltransferase 1; Flags:
Precursor
gi|34785210|gb|AAH56951.1| Glycosyltransferase 25 domain containing 1 [Mus musculus]
gi|148697002|gb|EDL28949.1| glycosyltransferase 25 domain containing 1, isoform CRA_a [Mus
musculus]
Length = 617
Score = 70.5 bits (171), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 53/247 (21%), Positives = 114/247 (46%), Gaps = 26/247 (10%)
Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
P Q P VLI++ L L + L +P ++ +++V +N + + + +++
Sbjct: 43 PLQAPRVLIALLARNAAPALPATLGALEQLRHPRERTALWVATDHNTDNTSAILREWLVA 102
Query: 345 FKTMFKNVKY--IAHNSTVNSKEA---------------RNLAVENSLHKGVDFYFYVDS 387
K ++ +V++ S+ +E R A++++ D+ ++D
Sbjct: 103 VKGLYHSVEWRPAEEPSSYPDEEGPKHWSDSRYEHVMKLRQAALKSARDMWADYILFMDI 162
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D+ + NPD L L+ N++++AP+L A+SNFW + + G+Y R+ Y+ I D+
Sbjct: 163 DNLITNPDTLSLLIAENKTVVAPML-DSRAAYSNFWCGMTSQGYYKRTPAYIPIRKRDR- 220
Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKTIYTLN----SMDYDMAFCTNLRNKGIHLKIDS 503
+G + VP + + +L+ + N+ T S D + F + + + + + +
Sbjct: 221 -RGCFAVPMVHSTFLIDLRKAASRNLAFYPTHPDYTWSFDDIIVFAFSCKQAEVQMYVCN 279
Query: 504 TQEYGHL 510
+ YG L
Sbjct: 280 KEVYGFL 286
>gi|21595163|gb|AAH32165.1| Glycosyltransferase 25 domain containing 1 [Mus musculus]
Length = 617
Score = 70.1 bits (170), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 54/248 (21%), Positives = 115/248 (46%), Gaps = 28/248 (11%)
Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
P Q P VLI++ L L + L +P ++ +++V +N + + + +++
Sbjct: 43 PLQAPRVLIALLARNAAPALPATLGALEQLRHPRERTALWVATDHNTDNTSAILREWLVA 102
Query: 345 FKTMFKNVKY--IAHNSTVNSKEA---------------RNLAVENSLHKGVDFYFYVDS 387
K ++ +V++ S+ +E R A++++ D+ ++D
Sbjct: 103 VKGLYHSVEWRPAEEPSSYPDEEGPKHWSDSRYEHVMKLRQAALKSARDMWADYILFMDI 162
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D+ + NPD L L+ N++++AP+L A+SNFW + + G+Y R+ Y+ I D+
Sbjct: 163 DNLITNPDTLSLLIAENKTVVAPML-DSRAAYSNFWCGMTSQGYYKRTPAYIPIRKRDR- 220
Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIK-----TIYTLNSMDYDMAFCTNLRNKGIHLKID 502
+G + VP + + +L+ + N+ YT S D + F + + + + +
Sbjct: 221 -RGCFAVPMVHSTFLIDLRKAASRNLAFYPPHPDYTW-SFDDIIVFAFSCKQAEVQMYVC 278
Query: 503 STQEYGHL 510
+ + YG L
Sbjct: 279 NKEVYGFL 286
>gi|74217150|dbj|BAE43293.1| unnamed protein product [Mus musculus]
Length = 617
Score = 70.1 bits (170), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 53/247 (21%), Positives = 114/247 (46%), Gaps = 26/247 (10%)
Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
P Q P VLI++ L L + L +P ++ +++V +N + + + +++
Sbjct: 43 PLQAPRVLIALLARNAAPALPATLGALEQLRHPRERTALWVATDHNTDNTSAILREWLVA 102
Query: 345 FKTMFKNVKY--IAHNSTVNSKEA---------------RNLAVENSLHKGVDFYFYVDS 387
K ++ +V++ S+ +E R A++++ D+ ++D
Sbjct: 103 VKGLYHSVEWRPAEEPSSYPDEEGPKHWSDSRYEHVMKLRQAALKSARDMWADYILFMDI 162
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D+ + NPD L L+ N++++AP+L A+SNFW + + G+Y R+ Y+ I D+
Sbjct: 163 DNLITNPDTLSLLIAENKTVVAPML-DSRAAYSNFWCGMTSQGYYKRTPAYIPIRKRDR- 220
Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKTIYTLN----SMDYDMAFCTNLRNKGIHLKIDS 503
+G + VP + + +L+ + N+ T S D + F + + + + + +
Sbjct: 221 -RGCFAVPMVHSTFLIDLRKAASRNLAFYPTHPDYTWSFDDIIVFAFSCKQAEVQMYVCN 279
Query: 504 TQEYGHL 510
+ YG L
Sbjct: 280 KEVYGFL 286
>gi|431921989|gb|ELK19162.1| Glycosyltransferase 25 family member 1 [Pteropus alecto]
Length = 624
Score = 70.1 bits (170), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 54/248 (21%), Positives = 117/248 (47%), Gaps = 28/248 (11%)
Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
P Q P VLI++ L L + L +P ++ +++V +N + + + +++
Sbjct: 50 PLQEPRVLIALLARNAAHALPATLGALERLRHPRERTALWVATDHNSDNTSTVLREWLVA 109
Query: 345 FKTMFKNV-----------------KYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDS 387
K ++ +V K+ + + + + R A++++ D+ +VD+
Sbjct: 110 VKNLYHSVEWRPAEEPRSYPDEEGPKHWSDSRYEHVMKLRQAALKSARDMWADYILFVDA 169
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D+ + NPD L L++ N++++AP+L A+SNFW + + G+Y R+ Y+ I D+
Sbjct: 170 DNLILNPDTLTLLISENKTVVAPML-DSRAAYSNFWCGMTSQGYYKRTPAYIPIRKRDR- 227
Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIK-----TIYTLNSMDYDMAFCTNLRNKGIHLKID 502
+G + VP + + +L+ + N+ YT S D + F + + + + +
Sbjct: 228 -RGCFAVPMVHSTFLIDLRKSASRNLAFYPPHPDYTW-SFDDIIVFAFSCKQAEVQMYVC 285
Query: 503 STQEYGHL 510
+ + YG L
Sbjct: 286 NKEVYGFL 293
>gi|391347179|ref|XP_003747842.1| PREDICTED: glycosyltransferase 25 family member-like [Metaseiulus
occidentalis]
Length = 587
Score = 70.1 bits (170), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 60/266 (22%), Positives = 128/266 (48%), Gaps = 38/266 (14%)
Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEY-HAP-LFDDYIHNFKTM 348
P +L+ + L F + NL+YP K I +++ ++ + + P + D + K+
Sbjct: 46 PDILVVILASNEEHTLPIFFGCLENLDYPKKSIELYIRSDHNHDNTPFMLDTWCAARKSE 105
Query: 349 FKNVK------------------YIAHNSTVNSKE-ARNLAVENSLHKGVDFYFYVDSDS 389
+ ++ + + + KE A N A E KG D+ F++D+D+
Sbjct: 106 YADISLDIRMLPTHYDEKDIHWPMSRYRTMIELKEDALNYARE----KGFDYIFFLDTDA 161
Query: 390 HLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGK 449
+ N D+L L++ N++++APLL + +SNFWG ++ G+Y RS +Y I+ ++G
Sbjct: 162 FITNLDLLNDLISVNKTIVAPLL-QSASLYSNFWGDMDKKGYYLRSTNYTEIV--ERGIV 218
Query: 450 GIWNVPYITNCYLMKTSVIKATNI----KTIYTLNSMDYD--MAFCTNLRNKGIHLKIDS 503
G + V + + L+K + +T + + + S+ D + F + + + + +
Sbjct: 219 GSFPVRLVHSAVLVKLTDEASTALTFVREKVDNFESIPQDDIITFARSAQTNNVPQYVTN 278
Query: 504 TQEYGHLVDSENFDPQKTNPEVYELI 529
+E G+++ S P+ + E+ +L+
Sbjct: 279 EKENGYMLRS----PESLSQEIQDLV 300
>gi|405967145|gb|EKC32345.1| Glycosyltransferase 25 family member 1 [Crassostrea gigas]
Length = 600
Score = 69.7 bits (169), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 56/245 (22%), Positives = 118/245 (48%), Gaps = 33/245 (13%)
Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN--NQEYHAPLFDDYIHNFKTM 348
P+V+I++ + L F + LNYP +IS ++ + N++ A + +++ K +
Sbjct: 45 PTVMIAILVRNKAHILPWFFGHLEKLNYPKNRISFWIRSDHNEDDSARMLREWVDANKNV 104
Query: 349 FKNVKYIA--------------HNSTVNSKEA---RNLAVENSLHKGVDFYFYVDSDSHL 391
+ ++ + H ST + R A+ + D+ F +D+D L
Sbjct: 105 YHHIDLVIEDNKDKYEDEIGPLHWSTKRFDKVIALRENALLAARRAWADYLFMLDADVVL 164
Query: 392 DNPDVLKYLVNRNESLIAPLLVRPF-KAWSNFWGALNADGFYARSFDYMNIINGDQGGKG 450
+N + L L++ + +IAP+L + +SNFWG ++ G+Y R+ Y +I+ ++ G
Sbjct: 165 ENRNTLTQLIDAKQPIIAPMLNASIGETYSNFWGGMDEMGYYKRAPGYFDIL--ERKRLG 222
Query: 451 IWNVPYITNCYLMKTSVIKATNIKTIYTLNSMD-YD------MAFCTNLRNKGIHLKIDS 503
++ VP + L+ ++++ + +T N + YD + F N+R G+ + I +
Sbjct: 223 VFEVPMVHTALLLDMHLMESDS----FTYNKPEGYDGPHDDIIIFGLNVRKAGMVMHIMN 278
Query: 504 TQEYG 508
T+ +G
Sbjct: 279 TEYFG 283
>gi|348527790|ref|XP_003451402.1| PREDICTED: procollagen galactosyltransferase 2-like [Oreochromis
niloticus]
Length = 731
Score = 69.3 bits (168), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 57/261 (21%), Positives = 107/261 (40%), Gaps = 43/261 (16%)
Query: 283 DSLKPDQF---PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFD 339
+ +KP+ P V+I + L +L I L YP ++I+++ + D
Sbjct: 149 EQVKPESSLLKPKVMIVIVARNAAHSLPYYLGCIERLEYPKERIAIWAATDHN-----VD 203
Query: 340 DYIHNFKTMFKNVKYIAHNSTVNSKEA------------------------RNLAVENSL 375
+ + K ++I H E R A++ +
Sbjct: 204 NTTAMLREWLKRAQHIYHFVEWRPMEEPRSYTDEWGPKHWPPSRFNHVMKLRQAALKAAR 263
Query: 376 HKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARS 435
+ D+ +VDSD+ L NP VL ++ N +L+AP+L +SNFW + G+Y R+
Sbjct: 264 ERWADYILFVDSDNLLTNPRVLNLMMAENLTLVAPML-ESRSLYSNFWCGMTPQGYYKRT 322
Query: 436 FDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFC 489
DY I + G + VP + + +L+ ++++ ++ DY M F
Sbjct: 323 PDYQPIREWKR--LGCFPVPMVHSTFLLDLRRESSSDL--VFYPPHPDYSWAFDDIMVFA 378
Query: 490 TNLRNKGIHLKIDSTQEYGHL 510
+ R G+ + + + + YG L
Sbjct: 379 FSARQAGVQMYVCNREHYGFL 399
>gi|91078804|ref|XP_970300.1| PREDICTED: similar to Glycosyltransferase 25 family member
[Tribolium castaneum]
gi|270003725|gb|EFA00173.1| hypothetical protein TcasGA2_TC002995 [Tribolium castaneum]
Length = 559
Score = 69.3 bits (168), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 47/192 (24%), Positives = 92/192 (47%), Gaps = 22/192 (11%)
Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN--NQEYHAPLFDDYIHNFKTM 348
P+VLI+V L FL + NL+YP +IS+++ + N + + +I+ K
Sbjct: 24 PTVLIAVLARNKAHTLPYFLTTLENLDYPKNRISLWIRSDHNSDKTIEILRKWINAVKDE 83
Query: 349 FKNV--KYIAHNSTVNSKEA---------------RNLAVENSLHKGVDFYFYVDSDSHL 391
++ + +++ N + R ++ + D+Y+ +D D L
Sbjct: 84 YRMISTEFVEENEGYPDESGPAHWTPERFNHVIDLRESSLNFARKIWADYYWTIDCDVFL 143
Query: 392 DNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGI 451
NP L L+++ +++AP+L + +SNFW + D +Y R+ DY ++N + G
Sbjct: 144 TNPKTLDILISKGYTVVAPML-KSDGLYSNFWYGMTDDYYYQRTEDYKPVVN--RENIGC 200
Query: 452 WNVPYITNCYLM 463
+NVP + +C L+
Sbjct: 201 FNVPMVHSCVLV 212
>gi|346473379|gb|AEO36534.1| hypothetical protein [Amblyomma maculatum]
Length = 315
Score = 69.3 bits (168), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 60/271 (22%), Positives = 112/271 (41%), Gaps = 46/271 (16%)
Query: 272 GCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQ 331
G RC D L + P++LI+V + L F + +YP ++S+++Y +
Sbjct: 21 GVVRCTTRD--DKL---ESPTLLIAVVLRNKAHVLPHFFGYLERQSYPKSRVSLWIYTDH 75
Query: 332 EYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEA------------------------R 367
D T + HN V ++ R
Sbjct: 76 S-----VDTTAEMVNTWAEEASGDYHNVNVTKEDGDAFFPDEDGVQKWTSERYWHIIRLR 130
Query: 368 NLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALN 427
A+ + DF ++D D+ L NP ++ LV N ++IAP+L A+SNFW +N
Sbjct: 131 EEAIHVARAMWADFVLFLDGDALLSNPKTIQDLVEENRTIIAPML-DSRSAYSNFWCGMN 189
Query: 428 ADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSM----- 482
G+Y R+ +YM I+ ++ G++ V + + L+ + A + K Y +
Sbjct: 190 EKGYYKRTDEYMPILEREK--IGVFPVVMVHSATLINLN--HADSRKLTYDPRKLQGYTG 245
Query: 483 --DYDMAFCTNLRNKGIHLKIDSTQEYGHLV 511
D + F + + G+ + + + +YGH++
Sbjct: 246 PNDDVITFAHSAKFAGVEMFVSNKDQYGHIL 276
>gi|323452214|gb|EGB08089.1| hypothetical protein AURANDRAFT_71687 [Aureococcus anophagefferens]
Length = 1302
Score = 69.3 bits (168), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 58/203 (28%), Positives = 93/203 (45%), Gaps = 17/203 (8%)
Query: 535 WDLRYIHPEYQKSLLPDTVNNQPCPDVFWF--PIVTEKFCHEFVQIMEAYGQWSDGTNND 592
W R + E + D +++P V+ F P+V C + + I EA+ G
Sbjct: 699 WAKRRVPFEALERPKSDDDDDEPPAYVYAFDEPVVPAASCADAIAIAEAHASHGGGWTTA 758
Query: 593 KRLETGYEAVPTRDIHMKQVGLAGVW-AEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVR 651
+ AVPT D+ +++V W + LR + P G +R +F+V+
Sbjct: 759 RHF-----AVPTTDVPVREVPALLKWFNDALRSSIFPALG-ALYGLDPARLRVIDAFLVK 812
Query: 652 YRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRF--IRYNCNVTATRMGWMLMHPG 709
Y Q SL H D S +I + LN DY+GGG F +R N A G ++ PG
Sbjct: 813 YSAAAQRSLPLHSDQSQISITLPLNS-SADYDGGGTYFHDLRQAVNRDA---GGLVAFPG 868
Query: 710 RLTHYHEGLQVTQGTRYIMISFV 732
L H G +T+GTR+++++F+
Sbjct: 869 FLPHA--GHAITRGTRFVVVAFL 889
>gi|73986206|ref|XP_541950.2| PREDICTED: procollagen galactosyltransferase 1 [Canis lupus
familiaris]
Length = 623
Score = 69.3 bits (168), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 53/248 (21%), Positives = 117/248 (47%), Gaps = 28/248 (11%)
Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
P Q P VLI++ L L + L +P ++ +++V +N + + + +++
Sbjct: 49 PLQAPRVLIALVARNAAHALPATLGALERLRHPRERTALWVATDHNSDNTSAVLREWLVA 108
Query: 345 FKTMFKNV-----------------KYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDS 387
K+++ +V K+ + + + + R A++++ D+ +VD+
Sbjct: 109 VKSLYHSVEWRPAEEPRSYPDEEGPKHWSDSRYEHVMKLRQAALKSARDMWADYILFVDA 168
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D+ + NP+ L L+ N++++AP+L A+SNFW + + G+Y R+ Y+ I D+
Sbjct: 169 DNLILNPNTLSLLIAENKTVVAPML-DSRAAYSNFWCGMTSQGYYKRTPAYIPIRKRDR- 226
Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIK-----TIYTLNSMDYDMAFCTNLRNKGIHLKID 502
+G + VP + + +L+ + N+ YT S D + F + + + + +
Sbjct: 227 -RGCFAVPMVHSTFLIDLRKAASRNLAFYPPHPDYTW-SFDDIIVFAFSCKQAEVQMYVC 284
Query: 503 STQEYGHL 510
+ + YG L
Sbjct: 285 NKEVYGFL 292
>gi|260797405|ref|XP_002593693.1| hypothetical protein BRAFLDRAFT_107673 [Branchiostoma floridae]
gi|229278921|gb|EEN49704.1| hypothetical protein BRAFLDRAFT_107673 [Branchiostoma floridae]
Length = 384
Score = 68.9 bits (167), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 48/194 (24%), Positives = 94/194 (48%), Gaps = 22/194 (11%)
Query: 289 QFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNN----------QEY----- 333
Q+P++ +++ L L + +YP ++++++ ++ QE+
Sbjct: 2 QWPTIFVAILARNKAHSLPYTLGYLERQDYPKSRLALWIQSDHNIDNTSAVIQEWLDGVG 61
Query: 334 HAPLFDDYIH-NFKTMFKNVKYIAHNSTVNSKEA---RNLAVENSLHKGVDFYFYVDSDS 389
H D+ H + F + + H S + R A+E + + DF F +D+D+
Sbjct: 62 HLYHHVDFYHKDAPNYFPDEEGANHWSGTRLRHVIKLRQQALEYARKRWADFMFCMDADN 121
Query: 390 HLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGK 449
+ NP LK L+ +N +IAP+L A+SNFW + G+Y R+ +YM I ++ +
Sbjct: 122 LVTNPRTLKLLIAQNRPIIAPML-ESSTAYSNFWCGMTEKGYYMRTDEYMPTI--ERKRR 178
Query: 450 GIWNVPYITNCYLM 463
G++ VP + + YL+
Sbjct: 179 GVFPVPMVHSTYLV 192
>gi|149944687|ref|NP_001092425.1| procollagen galactosyltransferase 1 precursor [Bos taurus]
gi|160395520|sp|A5PK45.1|GT251_BOVIN RecName: Full=Procollagen galactosyltransferase 1; AltName:
Full=Glycosyltransferase 25 family member 1; AltName:
Full=Hydroxylysine galactosyltransferase 1; Flags:
Precursor
gi|148744100|gb|AAI42351.1| GLT25D1 protein [Bos taurus]
gi|296486064|tpg|DAA28177.1| TPA: glycosyltransferase 25 domain containing 1 precursor [Bos
taurus]
Length = 623
Score = 68.9 bits (167), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 53/248 (21%), Positives = 116/248 (46%), Gaps = 28/248 (11%)
Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
P Q P VLI++ L L + L +P ++ +++V +N + + + +++
Sbjct: 49 PLQAPRVLIALLARNAAHALPATLGALERLRHPRERTALWVATDHNADNTSAVLREWLVA 108
Query: 345 FKTMFKNV-----------------KYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDS 387
K ++ +V K+ + + + + R A++++ D+ +VD+
Sbjct: 109 VKGLYHSVEWRPSEEPRSYPDEEGPKHWSDSRYEHVMKLRQAALKSARDMWADYILFVDA 168
Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
D+ + NPD L L+ N++++AP+L A+SNFW + + G+Y R+ Y+ I ++
Sbjct: 169 DNLILNPDTLTLLIAENKTVVAPML-DSRAAYSNFWCGMTSQGYYKRTPAYIPIRKRER- 226
Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIK-----TIYTLNSMDYDMAFCTNLRNKGIHLKID 502
+G + VP + + +L+ + N+ YT S D + F + + + + +
Sbjct: 227 -RGCFAVPMVHSTFLIDLRKAASRNLAFYPPHPDYTW-SFDDIIVFAFSCKQAEVQMYVC 284
Query: 503 STQEYGHL 510
+ + YG L
Sbjct: 285 NKEVYGFL 292
>gi|159793543|gb|ABW99101.1| procollagen-lysine 5-dioxygenase [Drosophila melanogaster]
Length = 44
Score = 68.9 bits (167), Expect = 9e-09, Method: Composition-based stats.
Identities = 27/41 (65%), Positives = 33/41 (80%)
Query: 556 QPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLE 596
QPCPDV+WF IV++ FC + V IMEA+ WSDG+NND RLE
Sbjct: 4 QPCPDVYWFQIVSDAFCDDLVAIMEAHNGWSDGSNNDNRLE 44
>gi|395750709|ref|XP_002828943.2| PREDICTED: procollagen galactosyltransferase 1, partial [Pongo
abelii]
Length = 462
Score = 68.9 bits (167), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 56/249 (22%), Positives = 115/249 (46%), Gaps = 29/249 (11%)
Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
P Q P VLI++ L L + L +P ++ +++V +N + + + +++
Sbjct: 48 PLQAPRVLIALLARNAAHALPTTLGALERLRHPRERTALWVATDHNMDNTSTVLREWLVA 107
Query: 345 FKTMFKNVKY-IAHNSTVNSKEA-------RNLAVENSLHKGVDFYF----------YVD 386
K+++ +V++ A ++ + L +SL F +VD
Sbjct: 108 VKSLYHSVEWRPAEEPSLGPSTGFAVYLLPKALGSMDSLPPPSSFLAHADAVWGVLQFVD 167
Query: 387 SDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQ 446
+D+ + NPD L L+ N++++AP+L A+SNFW + + G+Y R+ Y+ I D+
Sbjct: 168 ADNLILNPDTLSLLIAENKTVVAPMLDS-RAAYSNFWCGMTSQGYYKRTPAYIPIRKRDR 226
Query: 447 GGKGIWNVPYITNCYLMKTSVIKATNIK-----TIYTLNSMDYDMAFCTNLRNKGIHLKI 501
+G + VP + + +L+ + N+ YT S D + F + + + + +
Sbjct: 227 --RGCFAVPMVHSTFLIDLRKAASRNLAFYPPHPDYTW-SFDDIIVFAFSCKQAEVQMYV 283
Query: 502 DSTQEYGHL 510
+ +EYG L
Sbjct: 284 CNKEEYGFL 292
>gi|432848534|ref|XP_004066393.1| PREDICTED: procollagen galactosyltransferase 1-like [Oryzias
latipes]
Length = 414
Score = 68.9 bits (167), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 54/229 (23%), Positives = 104/229 (45%), Gaps = 27/229 (11%)
Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFK 346
P P V++++ L L I LNYP ++++ + ++ P
Sbjct: 34 PLLAPRVVVALICRNAEHCLPLVLGAIERLNYPKDRVALCRFTDEV--GP---------- 81
Query: 347 TMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNES 406
+ N++Y + + R A+ + D+ D D+ L NPDVL L++ N++
Sbjct: 82 KHWNNLRY------EHVMKLRQAALNTAREIWADYILMTDCDNLLTNPDVLWKLMSENKT 135
Query: 407 LIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTS 466
++AP+L A+SNFW + + G+Y R+ YM I Q +G + VP + + YL+
Sbjct: 136 IVAPML-ESRAAYSNFWCGMTSQGYYKRTPAYMPIRR--QERRGCFAVPMVHSTYLVDLR 192
Query: 467 VIKATNIKTIYTLN-----SMDYDMAFCTNLRNKGIHLKIDSTQEYGHL 510
+ N+ Y + ++D + F + R + + + + + YG+L
Sbjct: 193 KEASRNL-AFYPPHEEYNWALDDVIVFAYSARMADVQMYVCNKETYGYL 240
>gi|414075459|ref|YP_006994777.1| procollagen-lysine 5-dioxygenase [Anabaena sp. 90]
gi|413968875|gb|AFW92964.1| procollagen-lysine 5-dioxygenase [Anabaena sp. 90]
Length = 239
Score = 68.9 bits (167), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 64/211 (30%), Positives = 97/211 (45%), Gaps = 31/211 (14%)
Query: 50 FIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNELDEMDITDDMIILVTDSYD 109
F++SA+ + VK LG W + ++ L+ EL D+ DD I+LVTD++D
Sbjct: 24 FLRSAKKQNIDVKVLGEGLEWSANSL-------RLPLILKELK--DVKDDTIVLVTDAFD 74
Query: 110 VIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYP------AVGSGYRYLNSGGF 163
V+ N I E+F I+F AE+ W + Y++Y V Y+YLN+G F
Sbjct: 75 VLYVQNANSIYEKFIQGGYKILFAAEK--WY-SHQYEEYKDFYDSIKVPYDYKYLNAGTF 131
Query: 164 IGYAKDIKELISN-----RSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYG 218
+GY K + E+I N +N D +LY F + LD ++F G
Sbjct: 132 MGYKKYVCEMIDNILSYPNFHENGSDQRLYGKYCF-----ENPETVTLDYCCDIFWCTAG 186
Query: 219 SLEDIKLNFDL-DEFVHLTNTKYNTNPVIIH 248
E + +D+ + FV N T P IIH
Sbjct: 187 EWEILPELYDIHNGFV--LNKLTGTYPAIIH 215
>gi|345482468|ref|XP_001608141.2| PREDICTED: glycosyltransferase 25 family member-like [Nasonia
vitripennis]
Length = 567
Score = 68.6 bits (166), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 61/250 (24%), Positives = 111/250 (44%), Gaps = 37/250 (14%)
Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFK 350
P++L+ + L L+ + L+YP +I++++Y++ D+ I K
Sbjct: 27 PNILVGILARNKAHTLPYTLSYLEKLDYPKDRIALWIYSDNN-----VDNTIEVLKKWLT 81
Query: 351 NVK--YIAHNSTVNSK----------------------EARNLAVENSLHKGVDFYFYVD 386
K Y N+T++ + + R + + DF F +D
Sbjct: 82 VQKDNYFMVNATLDEESHGHDDEKGIADWSSKRFEHIIKLREEVLNYARRIWADFIFMLD 141
Query: 387 SDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQ 446
+D L NP L L+ +NE+++APLL + +SNFW ++ D +Y R+ DY +I+N
Sbjct: 142 ADVFLTNPKTLDSLIRKNETVVAPLL-KSDGMYSNFWAGMSDDFYYKRTDDYESILNNKV 200
Query: 447 GGKGIWNVPYITNCYLM----KTSVIKATNIKTIYTLNS-MDYDMAFCTNLRNKGIHLKI 501
G + VP + + L+ K S N K I N +D + F + + I L +
Sbjct: 201 S--GCFPVPMVHSAVLIDLRRKNSDYLTYNFKNINNYNGPIDDIITFALSAKYSDISLNV 258
Query: 502 DSTQEYGHLV 511
+ Q+YG ++
Sbjct: 259 CNDQKYGFIM 268
>gi|328789321|ref|XP_397154.3| PREDICTED: glycosyltransferase 25 family member-like [Apis
mellifera]
Length = 567
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 65/245 (26%), Positives = 109/245 (44%), Gaps = 27/245 (11%)
Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFKTM 348
P+VLI + + L FL + L YP K+I +++ NN + + +++N
Sbjct: 28 PTVLIIILVRNKAHTLPYFLTFLERLTYPKKRIHLWICSDNNIDNSIEILSAWLNNESNK 87
Query: 349 FKNVK---------YIAHNSTVNSKEARNLAV----ENSLHKG----VDFYFYVDSDSHL 391
+ V+ + N R L V E +L G DF + +D+D L
Sbjct: 88 YHGVQINFDEKSKGFDDEKGITNWSAQRFLHVINLREEALKAGRNMWADFIWMLDADVFL 147
Query: 392 DNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGI 451
NP+ L L+ +N+ +IAPLL + +SNFW + D +Y R+ +Y I+ ++ KG
Sbjct: 148 TNPNTLDELILKNQIVIAPLL-KSDGLYSNFWAGMTNDYYYLRTKEYEPILFREK--KGC 204
Query: 452 WNVPYITNCYLM----KTSVIKATNIKTIYTLNSMDYD-MAFCTNLRNKGIHLKIDSTQE 506
+NVP I + L+ + S N +Y N D + F G+ L I +
Sbjct: 205 FNVPMIHSAVLIDLRKQISDFLTYNPNKLYQYNGPTDDIITFAVGANKTGVPLFICNDNT 264
Query: 507 YGHLV 511
YG ++
Sbjct: 265 YGFIM 269
>gi|301630121|ref|XP_002944176.1| PREDICTED: glycosyltransferase 25 family member 3-like [Xenopus
(Silurana) tropicalis]
Length = 590
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 55/244 (22%), Positives = 110/244 (45%), Gaps = 28/244 (11%)
Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPL--FDDYIHNFKTM 348
PS++I++ L L + L+YP +IS++ + A L D++ + +
Sbjct: 31 PSLVIALIARNAAHALPYSLGALERLDYPRDRISLWCATDHNEDATLDVLQDWLEAIRPL 90
Query: 349 FKNVKYIA------HNSTVNSKE-----------ARNLAVENSLHKGVDFYFYVDSDSHL 391
+ ++++ A + K+ R A+ + K D+ YVD+D+ L
Sbjct: 91 YHSLEWKAEVAPRWYPQETGPKDWPKERYEYVMKLRQEALSYAREKKADYIMYVDADNVL 150
Query: 392 DNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGI 451
N ++ L+ N++L+AP+L +SNFW +N GFY R+ DY N + G
Sbjct: 151 TNVHTVRLLMTENKTLVAPMLDSQ-TGFSNFWCGINPQGFYRRTPDYYPTRNRQR--TGC 207
Query: 452 WNVPYITNCYLMKTSVIKATNIKTIYTLN-----SMDYDMAFCTNLRNKGIHLKIDSTQE 506
++VP + + +L+ ++ + Y L+ + D + F + G+ + +T
Sbjct: 208 FSVPMVHSTFLIDLQKEESHGL-AFYPLHPNYTWTFDDIIVFAYSCLAAGVQGYVCNTHR 266
Query: 507 YGHL 510
YG++
Sbjct: 267 YGYV 270
>gi|312113105|ref|YP_004010701.1| family 2 glycosyl transferase [Rhodomicrobium vannielii ATCC 17100]
gi|311218234|gb|ADP69602.1| glycosyl transferase family 2 [Rhodomicrobium vannielii ATCC 17100]
Length = 676
Score = 67.4 bits (163), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 58/249 (23%), Positives = 114/249 (45%), Gaps = 30/249 (12%)
Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFKTM 348
P VL+++ + FL L I +L+YP I +++ NN + + ++ +
Sbjct: 399 PRVLVAILAKQKEEFLPLHLECIESLDYPKSSIVLYIRTNNNTDGTERILREWAKRVGHL 458
Query: 349 FKNVKYIAHNSTVNSKE----------------ARNLAVENSLHKGVDFYFYVDSDSHLD 392
+ +V++ A V ++ RN+++ +L DFYF D D+ +
Sbjct: 459 YADVEFDAEEVEVPVEQFSVHEWNETRFDVLGHIRNVSLSRALAHRCDFYFVADVDNFI- 517
Query: 393 NPDVLKYLVNRNESLIAPLL--VRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKG 450
P L+ LV + ++AP L + P +SN+ ++A G++ Y I+N + +G
Sbjct: 518 RPCTLRELVALDLPIVAPFLRSLSPDDPYSNYHAEIDASGYFEDCDQYSWILN--RWIRG 575
Query: 451 IWNVPYITNC-YLMKTSVIKATNIKTIYTLNSMDYD-MAFCTNLRNKGIHLKIDSTQEYG 508
+ VP +T+C YL++ V+ + Y + ++ + F + R GI D+ Q YG
Sbjct: 576 VIEVP-VTHCTYLIRADVLG----ELAYRDGTARHEYVIFSESARRHGIPQYFDNRQVYG 630
Query: 509 HLVDSENFD 517
++ + D
Sbjct: 631 YIAFGDGHD 639
>gi|256081803|ref|XP_002577157.1| cerebral cell adhesion molecule related [Schistosoma mansoni]
gi|350645736|emb|CCD59498.1| cerebral cell adhesion molecule related [Schistosoma mansoni]
Length = 680
Score = 67.4 bits (163), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 67/287 (23%), Positives = 127/287 (44%), Gaps = 67/287 (23%)
Query: 275 RCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY------ 328
+ +L+K+L+ L P++ I V + L FLN I N YP K+I++ Y
Sbjct: 105 KISLLKNLNRL----MPTLCIGVLVRNKAHTLPYFLNGIENQQYPTKRITLIFYVDNTID 160
Query: 329 ------------NNQEYHAPLFDDYIHNFKTMFKNVK------YIAHNSTVNSK---EAR 367
N +YH + + ++ K+ ++++ + H ++ K EAR
Sbjct: 161 SSEIILNEWIQCNKDKYHRIILE--VNTTKSEYEHLSKMWTLDHYLHVISLRQKLLDEAR 218
Query: 368 NLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVN-------------------RNESLI 408
N+ DFY +D+D L NP +++L+N N ++
Sbjct: 219 NI--------WADFYLSIDADVILMNPLTIEHLINVMLDSTISTSKSNLNHKIDENIIIL 270
Query: 409 APLL-VRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSV 467
APL+ + +SNFWGA++ +G+Y RS Y +I + +G++ V + + +L+
Sbjct: 271 APLMNCTSSEHYSNFWGAMSEEGYYLRSEHYFDI--QKRRIQGVYPVAMVHSIFLVNLQF 328
Query: 468 IKATNI----KTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGHL 510
++ I I +D + F +++ I +D+TQ YG++
Sbjct: 329 YQSEQIGYSPAPINYTGPVDDIIIFSRSVQRAEIDFYLDNTQFYGYI 375
>gi|126322946|ref|XP_001368839.1| PREDICTED: procollagen galactosyltransferase 1-like [Monodelphis
domestica]
Length = 623
Score = 67.0 bits (162), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 44/194 (22%), Positives = 96/194 (49%), Gaps = 22/194 (11%)
Query: 289 QFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFK 346
Q P VLI++ L L + L +P + +++V +N + + + +++ K
Sbjct: 51 QAPRVLIALIARNAAHALPSTLGALERLRHPRDRTALWVATDHNVDNTSAVLREWLVGVK 110
Query: 347 TMFKNV-----------------KYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDS 389
+++ V K+ +++ + + R A++++ D+ ++D+D+
Sbjct: 111 SLYHYVEWRPMEEPRSYPDEEGPKHWSNSRYEHVMKLRQAALKSARDMWADYILFLDADN 170
Query: 390 HLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGK 449
L NPD L L+ N++++AP+L A+SNFW + + G+Y R+ Y+ I D+ +
Sbjct: 171 LLINPDTLSLLIAENKTVVAPML-DSRAAYSNFWCGMTSQGYYKRTPAYIPIRRRDR--R 227
Query: 450 GIWNVPYITNCYLM 463
G + VP + + +L+
Sbjct: 228 GCFAVPMVHSTFLI 241
>gi|403266623|ref|XP_003925468.1| PREDICTED: procollagen galactosyltransferase 2 [Saimiri boliviensis
boliviensis]
Length = 933
Score = 67.0 bits (162), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 42/149 (28%), Positives = 75/149 (50%), Gaps = 9/149 (6%)
Query: 367 RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGAL 426
R A+ + K D+ ++D D+ L NP L L+ N++++AP+L +SNFW +
Sbjct: 455 RQAALRTAREKWSDYILFIDVDNFLTNPQTLNLLMAENKTIVAPML-ESRGLYSNFWCGI 513
Query: 427 NADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLN-----S 481
GFY R+ DY+ I + G + VP + + +L+ +A+N T Y + +
Sbjct: 514 TPKGFYKRTPDYVQIREWKR--TGCFPVPMVHSTFLIDLRK-EASNKLTFYPPHQDYTWT 570
Query: 482 MDYDMAFCTNLRNKGIHLKIDSTQEYGHL 510
D + F + R GI + + + + YG+L
Sbjct: 571 FDDIIVFAFSSRQAGIQMYLCNREHYGYL 599
>gi|395512663|ref|XP_003760555.1| PREDICTED: procollagen galactosyltransferase 1 [Sarcophilus
harrisii]
Length = 611
Score = 67.0 bits (162), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 44/194 (22%), Positives = 96/194 (49%), Gaps = 22/194 (11%)
Query: 289 QFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFK 346
Q P VLI++ L L + L +P + +++V +N + + + +++ K
Sbjct: 39 QAPRVLIALIARNAAHALPSTLGALERLRHPRDRTALWVATDHNVDNTSAVLREWLVGVK 98
Query: 347 TMFKNV-----------------KYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDS 389
+++ V K+ +++ + + R A++++ D+ ++D+D+
Sbjct: 99 SLYHYVEWRPMEEPRSYPDEDGPKHWSNSRYEHVMKLRQAALKSARDMWADYILFLDADN 158
Query: 390 HLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGK 449
L NPD L L+ N++++AP+L A+SNFW + + G+Y R+ Y+ I D+ +
Sbjct: 159 LLINPDTLSLLIAENKTVVAPML-DSRAAYSNFWCGMTSQGYYKRTPAYIPIRRRDR--R 215
Query: 450 GIWNVPYITNCYLM 463
G + VP + + +L+
Sbjct: 216 GCFAVPMVHSTFLI 229
>gi|292618105|ref|XP_684212.3| PREDICTED: glycosyltransferase 25 family member 3 isoform 1 [Danio
rerio]
Length = 591
Score = 66.6 bits (161), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 55/247 (22%), Positives = 110/247 (44%), Gaps = 30/247 (12%)
Query: 289 QFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFK 346
Q P+V+I++ L +L + LNYP ++IS++ +N + + +++ +
Sbjct: 31 QPPTVVIAIIARNAAHSLPHYLGALERLNYPKERISVWAATDHNIDNTTAMLREWLTVMQ 90
Query: 347 TMFKNVKY------------IAHNSTVNSK-----EARNLAVENSLHKGVDFYFYVDSDS 389
T + V++ + NS+ + + A+ + + D+ Y D+D+
Sbjct: 91 TQYHYVEWRPSDKPTSYAGELGPKHWTNSRYEYIMKLKQAALNFAKKRWADYILYSDTDN 150
Query: 390 HLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGK 449
L NPD L L+ N+S+IAP+L A+SN+W + G+Y R+ +Y +
Sbjct: 151 ILTNPDTLHLLMAENKSVIAPMLDSQ-SAYSNYWCGITPQGYYRRTAEYFP--TKQRQRL 207
Query: 450 GIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIHLKIDS 503
G + VP + + L+ + K K + DY + F + R + + + +
Sbjct: 208 GCYPVPMVHSTVLL--DLRKQGTRKVSFHPPHKDYSWPFDDIIVFAFSCRVSEVQMYLCN 265
Query: 504 TQEYGHL 510
+ YG+L
Sbjct: 266 KERYGYL 272
>gi|380798427|gb|AFE71089.1| procollagen galactosyltransferase 2 precursor, partial [Macaca
mulatta]
Length = 551
Score = 66.2 bits (160), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 41/150 (27%), Positives = 73/150 (48%), Gaps = 11/150 (7%)
Query: 367 RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGAL 426
R A+ + K D+ ++D D+ L NP L L+ N++++AP+L +SNFW +
Sbjct: 73 RQAALRTAREKWSDYILFIDVDNFLTNPQTLNLLIAENKTIVAPML-ESRGLYSNFWCGI 131
Query: 427 NADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD- 485
GFY R+ DY+ I + G + VP + + +L+ + K + K + DY
Sbjct: 132 TPKGFYKRTPDYVQIREWKRS--GCFPVPMVHSTFLI--DLRKEASDKLTFYPPHQDYTW 187
Query: 486 -----MAFCTNLRNKGIHLKIDSTQEYGHL 510
+ F + R GI + + + + YG+L
Sbjct: 188 TFDDIIVFAFSSRQAGIQMYLCNREHYGYL 217
>gi|441628755|ref|XP_003275861.2| PREDICTED: procollagen galactosyltransferase 1 [Nomascus
leucogenys]
Length = 703
Score = 66.2 bits (160), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 40/149 (26%), Positives = 78/149 (52%), Gaps = 9/149 (6%)
Query: 367 RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGAL 426
R A++++ D+ +VD+D+ + NPD L L+ N++++AP+L A+SNFW +
Sbjct: 228 RQAALKSARDMWADYILFVDADNLILNPDTLSLLIAENKTVVAPMLDS-RAAYSNFWCGM 286
Query: 427 NADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIK-----TIYTLNS 481
+ G+Y R+ Y+ I D+ +G + VP + + +L+ + N+ YT S
Sbjct: 287 TSQGYYKRTPAYIPIRKRDR--RGCFAVPMVHSTFLIDLRKAASRNLAFYPPHPDYTW-S 343
Query: 482 MDYDMAFCTNLRNKGIHLKIDSTQEYGHL 510
D + F + + + + + + +EYG L
Sbjct: 344 FDDIIVFAFSCKQAEVQMYVCNKEEYGFL 372
>gi|355558949|gb|EHH15729.1| hypothetical protein EGK_01859, partial [Macaca mulatta]
gi|355759604|gb|EHH61640.1| hypothetical protein EGM_19672, partial [Macaca fascicularis]
Length = 533
Score = 66.2 bits (160), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 41/150 (27%), Positives = 73/150 (48%), Gaps = 11/150 (7%)
Query: 367 RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGAL 426
R A+ + K D+ ++D D+ L NP L L+ N++++AP+L +SNFW +
Sbjct: 60 RQAALRTAREKWSDYILFIDVDNFLTNPQTLNLLIAENKTIVAPML-ESRGLYSNFWCGI 118
Query: 427 NADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD- 485
GFY R+ DY+ I + G + VP + + +L+ + K + K + DY
Sbjct: 119 TPKGFYKRTPDYVQIREWKRS--GCFPVPMVHSTFLI--DLRKEASDKLTFYPPHQDYTW 174
Query: 486 -----MAFCTNLRNKGIHLKIDSTQEYGHL 510
+ F + R GI + + + + YG+L
Sbjct: 175 TFDDIIVFAFSSRQAGIQMYLCNREHYGYL 204
>gi|157136453|ref|XP_001656834.1| hypothetical protein AaeL_AAEL003481 [Aedes aegypti]
gi|122095142|sp|Q17FB8.1|GLT25_AEDAE RecName: Full=Glycosyltransferase 25 family member; Flags:
Precursor
gi|108881003|gb|EAT45228.1| AAEL003481-PA [Aedes aegypti]
Length = 607
Score = 66.2 bits (160), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 67/271 (24%), Positives = 122/271 (45%), Gaps = 36/271 (13%)
Query: 289 QFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN--NQEYHAPLFDDYIHNFK 346
Q P VLI I L F + + + YP +IS++ + N++ + ++
Sbjct: 27 QSPKVLIVSLIRNKEHTLPYFFSYLEDQEYPKDRISLWFRSDHNEDRSIDIIKAWLKRVT 86
Query: 347 TMFKNV---------KYIAHNSTVNSKEARNLAV----ENSLHKG----VDFYFYVDSDS 389
+ +V K S+ + E R V + +L KG DF ++D+D
Sbjct: 87 KKYHSVDFGYRSDAAKRYDEKSSTHWSEDRFADVIRLKQEALDKGRKMWADFVLFLDADV 146
Query: 390 HLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGK 449
L NP+ + LV+ N ++AP+L+ +SNFW + AD +Y R+ +Y I+N ++ G+
Sbjct: 147 LLTNPNTIAKLVSLNLPIVAPMLLSD-GLYSNFWCGMTADYYYHRTDEYKEILNYEKTGE 205
Query: 450 GIWNVPYITNCYLMKTSVIKATNIK-------TIYTLNSMDYDMAFCTNLRNKGIHLKID 502
+ VP + + ++ +V ++ N+ + +D + F + I + I
Sbjct: 206 --FPVPMVHSAVMVNINVQQSLNLSFDKRRLPPGHYTGPVDDIIIFAMSANYSSIPMYIS 263
Query: 503 STQEYGH-LVDSENFDP------QKTNPEVY 526
++ YG+ LV E DP Q TN +VY
Sbjct: 264 NSASYGYILVPLEQGDPLEKDLEQLTNTKVY 294
>gi|119611577|gb|EAW91171.1| glycosyltransferase 25 domain containing 2, isoform CRA_b [Homo
sapiens]
gi|193787801|dbj|BAG53004.1| unnamed protein product [Homo sapiens]
Length = 506
Score = 65.9 bits (159), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 41/150 (27%), Positives = 73/150 (48%), Gaps = 11/150 (7%)
Query: 367 RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGAL 426
R A+ + K D+ ++D D+ L NP L L+ N++++AP+L +SNFW +
Sbjct: 28 RQAALRTAREKWSDYILFIDVDNFLTNPQTLNLLIAENKTIVAPML-ESRGLYSNFWCGI 86
Query: 427 NADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD- 485
GFY R+ DY+ I + G + VP + + +L+ + K + K + DY
Sbjct: 87 TPKGFYKRTPDYVQIREWKRT--GCFPVPMVHSTFLI--DLRKEASDKLTFYPPHQDYTW 142
Query: 486 -----MAFCTNLRNKGIHLKIDSTQEYGHL 510
+ F + R GI + + + + YG+L
Sbjct: 143 TFDDIIVFAFSSRQAGIQMYLCNREHYGYL 172
>gi|397489276|ref|XP_003815656.1| PREDICTED: procollagen galactosyltransferase 2 [Pan paniscus]
Length = 506
Score = 65.9 bits (159), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 41/150 (27%), Positives = 73/150 (48%), Gaps = 11/150 (7%)
Query: 367 RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGAL 426
R A+ + K D+ ++D D+ L NP L L+ N++++AP+L +SNFW +
Sbjct: 28 RQAALRTAREKWSDYILFIDVDNFLTNPQTLNLLIAENKTIVAPML-ESRGLYSNFWCGI 86
Query: 427 NADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD- 485
GFY R+ DY+ I + G + VP + + +L+ + K + K + DY
Sbjct: 87 TPKGFYKRTPDYVQIREWKRT--GCFPVPMVHSTFLI--DLRKEASDKLTFYPPHQDYTW 142
Query: 486 -----MAFCTNLRNKGIHLKIDSTQEYGHL 510
+ F + R GI + + + + YG+L
Sbjct: 143 TFDDIIVFAFSSRQAGIQMYLCNREHYGYL 172
>gi|296233252|ref|XP_002761953.1| PREDICTED: procollagen galactosyltransferase 1 [Callithrix jacchus]
Length = 738
Score = 65.9 bits (159), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 40/149 (26%), Positives = 78/149 (52%), Gaps = 9/149 (6%)
Query: 367 RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGAL 426
R A++++ D+ +VD+D+ + NPD L L+ N++++AP+L A+SNFW +
Sbjct: 263 RQAALKSARDMWADYILFVDADNLILNPDTLSLLIAENKTVVAPMLDS-RAAYSNFWCGM 321
Query: 427 NADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIK-----TIYTLNS 481
+ G+Y R+ Y+ I D+ +G + VP + + +L+ + N+ YT S
Sbjct: 322 TSQGYYRRTPAYIPIRKRDR--QGCFAVPMVHSTFLIDLRKAASRNLAFYPPHPDYTW-S 378
Query: 482 MDYDMAFCTNLRNKGIHLKIDSTQEYGHL 510
D + F + + + + + + +EYG L
Sbjct: 379 FDDIIVFAFSCKQAEVQMYVCNKEEYGFL 407
>gi|410917374|ref|XP_003972161.1| PREDICTED: procollagen galactosyltransferase 1-like [Takifugu
rubripes]
Length = 609
Score = 65.9 bits (159), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 56/244 (22%), Positives = 113/244 (46%), Gaps = 28/244 (11%)
Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFKTM 348
P V+I++ L FL I LNYP +I+++V +N++ + ++ +
Sbjct: 38 PRVVIALICRNSAHSLPLFLGTIERLNYPKDRIALWVATDHNKDNTTSILRSWLIGVQND 97
Query: 349 FKNVKYIAHN-STVNSKEA----------------RNLAVENSLHKGVDFYFYVDSDSHL 391
+ V++ + S+ + E R A++ + D+ VD D+ L
Sbjct: 98 YHYVEWRPDDESSAFADETGPKHWNNLRYEHVMKLRQAALDTAREIWADYILVVDCDNLL 157
Query: 392 DNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGI 451
N DVL L++ N++++AP+L A+SNFW + + G+Y R+ Y+ I ++ +G
Sbjct: 158 TNQDVLWKLMSENKTIVAPML-ESRAAYSNFWCGMTSQGYYKRTPAYIPIRKRER--RGC 214
Query: 452 WNVPYITNCYLMKTSVIKATNIKTIYTLNS-----MDYDMAFCTNLRNKGIHLKIDSTQE 506
+ VP + + YL+ +A+ Y +S +D + F + R + + + + +
Sbjct: 215 FAVPMVHSTYLVDLRK-EASRQLAFYPPHSEYSWALDDVIVFAYSARMADVQMYVCNKEI 273
Query: 507 YGHL 510
YG+
Sbjct: 274 YGYF 277
>gi|255080018|ref|XP_002503589.1| predicted protein [Micromonas sp. RCC299]
gi|226518856|gb|ACO64847.1| predicted protein [Micromonas sp. RCC299]
Length = 898
Score = 65.9 bits (159), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 54/171 (31%), Positives = 79/171 (46%), Gaps = 11/171 (6%)
Query: 565 PIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVG-LAGVWAEFLR 623
P++TE C E+V++ E G+ G + + AVPT DI + + L +W +R
Sbjct: 733 PLMTEAECAEWVRLAEKAGEARGGWTTSR-----HYAVPTTDIPVHAIPDLLPLWNALMR 787
Query: 624 KYVVPLQEREFIGYHHEP--VRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVD 681
+ L +P VR +FVVRY Q L H D S ++ +ALN G +
Sbjct: 788 DKLASLLSAACPEEMPKPSSVRVHDAFVVRYEAGAQHHLPMHADQSAVSVTLALNDEG-E 846
Query: 682 YEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFV 732
YEGGG F G ++ G L H G VT+G RYI+ +F+
Sbjct: 847 YEGGGTTFAVPVGKTVRPGRGHVVAFKGGLQHG--GSPVTRGVRYIVAAFL 895
>gi|292621863|ref|XP_002664798.1| PREDICTED: procollagen galactosyltransferase 1-like, partial [Danio
rerio]
Length = 535
Score = 65.9 bits (159), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 56/213 (26%), Positives = 96/213 (45%), Gaps = 32/213 (15%)
Query: 367 RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGAL 426
R A+E + D++ VD D+ L N DVL L+ N++++AP+L A+SNFW +
Sbjct: 60 RQAALETAREMWADYFMLVDCDNLLTNRDVLWKLMRENKTIVAPML-ESRAAYSNFWCGM 118
Query: 427 NADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDM 486
+ G+Y R+ YM I Q KG + VP + + LM + K + + + DY
Sbjct: 119 TSQGYYKRTPAYMPIRR--QERKGCFAVPMVHSTLLM--DLRKEASRQLAFFPPHPDYTW 174
Query: 487 A------FCTNLRNKGIHLKIDSTQEYGHLV-----------DSENFDPQKTNPEVYELI 529
A F + R + + I + + YG+ ++E+F + ++ ++
Sbjct: 175 AFDDIIIFAFSARMAEVQMYICNRETYGYFPVPLRSQNSLQDEAESF----LHSQLEVMV 230
Query: 530 RNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVF 562
RNP I P SL+P + +VF
Sbjct: 231 RNPP------IEPSVYLSLMPKQTDKMGFDEVF 257
>gi|46446818|ref|YP_008183.1| hypothetical protein pc1184 [Candidatus Protochlamydia amoebophila
UWE25]
gi|46400459|emb|CAF23908.1| hypothetical protein pc1184 [Candidatus Protochlamydia amoebophila
UWE25]
Length = 547
Score = 65.5 bits (158), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 59/249 (23%), Positives = 113/249 (45%), Gaps = 29/249 (11%)
Query: 293 VLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFV--YNNQEYHAPLFDDYIHNFKTMFK 350
V + ID + FL I L Y K+ + + N ++ + ++ + ++
Sbjct: 40 VWVGAIIDNHDQLIPPFLLTIEKLYYDKAKMHLQIDCCNQNKHVRKIVMQWVEKNRKFYQ 99
Query: 351 NVKYIAHNSTVNSKE-----------ARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKY 399
++ ++ H S+++ K+ +N + N + ++ + SD L P LKY
Sbjct: 100 SLVFVDHTSSIDEKKHFIEKNKVLANIKNGYLANCQQQSCNYCLILSSDM-LIAPHTLKY 158
Query: 400 LVNRNESLIAPLLVRPFKA----WSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVP 455
L+ +++ +I+PLL RPF + NF+ + +G+Y DY+ I N + G + VP
Sbjct: 159 LIEKDKPIISPLL-RPFPQPHDPYRNFFCDVTEEGYYKHHEDYLAIANRQK--LGTFQVP 215
Query: 456 YITNCYLMKTSVIKATNIKTIYTLNSMDYD-MAFCTNLRNKGIHLKIDSTQEYG---HLV 511
+ YL++ + + +T +Y+ +AF T R K + I + +E+G HL
Sbjct: 216 CVHGVYLIQAPFLSQLS----FTEGFKNYEFLAFSTYARKKNMGQFICNEREFGFLMHLS 271
Query: 512 DSENFDPQK 520
D D QK
Sbjct: 272 DDATLDQQK 280
>gi|344241371|gb|EGV97474.1| Glycosyltransferase 25 family member 1 [Cricetulus griseus]
Length = 948
Score = 65.5 bits (158), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 40/149 (26%), Positives = 77/149 (51%), Gaps = 9/149 (6%)
Query: 367 RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGAL 426
R A++++ D+ +VDSD+ + NPD L L+ N++++AP+L A+SNFW +
Sbjct: 486 RQAALKSARDMWADYIMFVDSDNLITNPDTLSLLIAENKTVVAPML-DSRAAYSNFWCGM 544
Query: 427 NADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIK-----TIYTLNS 481
+ G+Y R+ Y+ I D+ +G + VP + + +L+ + N+ YT S
Sbjct: 545 TSQGYYKRTPAYIPIRRRDR--RGCFAVPMVHSTFLIDLRKAASRNLAFYPPHPDYTW-S 601
Query: 482 MDYDMAFCTNLRNKGIHLKIDSTQEYGHL 510
D + F + + + + + + + YG L
Sbjct: 602 FDDIIVFAFSCKQAEVQMYVCNKEVYGFL 630
>gi|321463619|gb|EFX74634.1| hypothetical protein DAPPUDRAFT_199801 [Daphnia pulex]
Length = 623
Score = 65.5 bits (158), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 48/205 (23%), Positives = 96/205 (46%), Gaps = 28/205 (13%)
Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN--NQEYHAPLFDDYIHNFKTM 348
P+VL+++ + L FL L+YP ++++++ + NQ+ + + ++ + +
Sbjct: 30 PTVLVTLLVRNKAHTLPYFLKLFEELDYPKNRLTLWIKSDQNQDQSLEIMNKWVSSVE-- 87
Query: 349 FKNVKYIAHNSTVNSKEARNLAV----------------ENSLHKG----VDFYFYVDSD 388
K+ +I H T S A + + E +L KG DF ++VD D
Sbjct: 88 -KSYHHIYHELTTTSPSAVDDKIPTNWTEERFKHIINLREEALDKGRELWADFVWFVDCD 146
Query: 389 SHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGG 448
L N LK +VN N ++AP+L +SN+W + D +Y R+ +Y I ++
Sbjct: 147 VFLTNNQTLKIMVNTNYPVVAPML-DTLSLYSNYWCGMGLDYYYRRTDEYKPI--REREN 203
Query: 449 KGIWNVPYITNCYLMKTSVIKATNI 473
KG V + +C+++ +++ +
Sbjct: 204 KGCHRVIVVHSCFMVDLRQVESQRL 228
>gi|432952470|ref|XP_004085089.1| PREDICTED: procollagen galactosyltransferase 2-like [Oryzias
latipes]
Length = 592
Score = 65.5 bits (158), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 54/245 (22%), Positives = 111/245 (45%), Gaps = 26/245 (10%)
Query: 289 QFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFK 346
Q P+V++++ L +L + LNYP +IS++ +N + + +++ +
Sbjct: 32 QPPTVVVAIIARNAAHALPYYLGALERLNYPKDRISVWAATDHNVDNTTAILREWLTVMQ 91
Query: 347 TMFKNVKYIAHNSTV------------NSK-----EARNLAVENSLHKGVDFYFYVDSDS 389
+ V++ + NS+ + + A+ + + D+ Y D+D+
Sbjct: 92 KYYHYVEWRPMDQPTSYAGELGPKHWPNSRYEYVMKLKQAALNFARKRWADYILYADTDN 151
Query: 390 HLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGK 449
L NPD L+ ++ N+S+IAP+L A+SNFW + G+Y R+ +Y + +
Sbjct: 152 ILTNPDTLQLMIAENKSVIAPMLDSQ-GAYSNFWCGITPQGYYRRTAEYFPTRHRHR--L 208
Query: 450 GIWNVPYITNCYL--MKTSVIKATNIKTIYTLNSMDYD--MAFCTNLRNKGIHLKIDSTQ 505
G + VP + + L ++ +K + S YD + F + R I + + + +
Sbjct: 209 GCFPVPMVHSTVLLNLRKEGMKKLAFYPPHKDYSWPYDDIIVFAFSCRAAEIQMYLCNKE 268
Query: 506 EYGHL 510
YG+L
Sbjct: 269 RYGYL 273
>gi|241835874|ref|XP_002415078.1| procollagen-lysine, 2-oxoglutarate 5-dioxygenase, putative [Ixodes
scapularis]
gi|215509290|gb|EEC18743.1| procollagen-lysine, 2-oxoglutarate 5-dioxygenase, putative [Ixodes
scapularis]
Length = 322
Score = 65.5 bits (158), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 51/197 (25%), Positives = 88/197 (44%), Gaps = 26/197 (13%)
Query: 276 CNLIKHLDSLKPD----QFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNN- 330
C+L+ + D + P+V I+V L F + NYP +IS+++Y +
Sbjct: 14 CSLLLATRAWASDDEKLELPTVFIAVIARNKAHVLPHFFGYLEQQNYPKSRISLWIYTDH 73
Query: 331 ---------QEYHAPLFDDYIHNFK-TMFKNVKYIAHNSTVNSKEA---------RNLAV 371
+ + DDY HN T ++ + A + V A R A+
Sbjct: 74 NSDDTEDILEAWAEAKSDDY-HNVNLTREESDAFYADENGVQKWTAERYWHVIRLREEAL 132
Query: 372 ENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGF 431
+ DF ++D D+ L +P + LV N++++AP+L A+SNFW + G+
Sbjct: 133 NLARSLWADFILFLDCDALLTSPKTILDLVRANKTVVAPML-DSRSAYSNFWCGMTEKGY 191
Query: 432 YARSFDYMNIINGDQGG 448
Y R+ DYM I+ ++ G
Sbjct: 192 YLRTDDYMPILERERVG 208
>gi|338724828|ref|XP_001489806.3| PREDICTED: procollagen galactosyltransferase 2 [Equus caballus]
Length = 572
Score = 65.1 bits (157), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 43/161 (26%), Positives = 77/161 (47%), Gaps = 13/161 (8%)
Query: 367 RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGAL 426
R A+ + K D+ ++D D+ L NP L ++ N++++AP+L +SNFW +
Sbjct: 94 RQAALRTAREKWSDYILFIDVDNFLTNPQTLNLMIAENKTIVAPML-ESRGLYSNFWCGI 152
Query: 427 NADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD- 485
GFY R+ DY I + G + VP + + +L+ + K + K ++ DY
Sbjct: 153 TPQGFYKRTPDYPQIREWKR--MGCFPVPMVHSTFLI--DLRKEASDKLMFYPPHQDYTW 208
Query: 486 -----MAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKT 521
+ F + R GI + + + + YG+L PQ+T
Sbjct: 209 TFDDIIVFAFSSRQAGIQMYLCNREHYGYL--PIPLKPQQT 247
>gi|224005863|ref|XP_002291892.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220972411|gb|EED90743.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 562
Score = 65.1 bits (157), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 51/142 (35%), Positives = 72/142 (50%), Gaps = 24/142 (16%)
Query: 603 PTRDIHMKQVGLAGV---W-AEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDE-Q 657
PT D+++ +G W A+ L + P+ ER F G VRA FVVRY + Q
Sbjct: 265 PTTDLNLVTDPFSGEDREWLAQRLDARMAPIIERAF-GISRGAVRANDIFVVRYDAEAGQ 323
Query: 658 PSLRPHHDSSTYTINIALNQVGVDYEGGGCRF--------IRYNCNVTATRMGWMLMHPG 709
P+LR H DSS + NI LN +++GGG RF I + V T + ++
Sbjct: 324 PNLRVHTDSSHLSFNILLND---EFDGGGTRFHHRIDKSHIDIHPEVGETLLSHAMI--- 377
Query: 710 RLTHYHEGLQVTQGTRYIMISF 731
+HEGL T+GTRYI++ F
Sbjct: 378 ----FHEGLPTTKGTRYILVGF 395
>gi|354473914|ref|XP_003499177.1| PREDICTED: procollagen galactosyltransferase 1 [Cricetulus griseus]
Length = 571
Score = 65.1 bits (157), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 40/149 (26%), Positives = 77/149 (51%), Gaps = 9/149 (6%)
Query: 367 RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGAL 426
R A++++ D+ +VDSD+ + NPD L L+ N++++AP+L A+SNFW +
Sbjct: 109 RQAALKSARDMWADYIMFVDSDNLITNPDTLSLLIAENKTVVAPML-DSRAAYSNFWCGM 167
Query: 427 NADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIK-----TIYTLNS 481
+ G+Y R+ Y+ I D+ +G + VP + + +L+ + N+ YT S
Sbjct: 168 TSQGYYKRTPAYIPIRRRDR--RGCFAVPMVHSTFLIDLRKAASRNLAFYPPHPDYTW-S 224
Query: 482 MDYDMAFCTNLRNKGIHLKIDSTQEYGHL 510
D + F + + + + + + + YG L
Sbjct: 225 FDDIIVFAFSCKQAEVQMYVCNKEVYGFL 253
>gi|348513873|ref|XP_003444465.1| PREDICTED: procollagen galactosyltransferase 1-like [Oreochromis
niloticus]
Length = 591
Score = 65.1 bits (157), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 58/245 (23%), Positives = 108/245 (44%), Gaps = 26/245 (10%)
Query: 289 QFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQ---------EYHAPLFD 339
Q P+V+I++ L +L + LNYP +IS++ + + +
Sbjct: 31 QPPTVVIAIIARNTAHSLPYYLGALERLNYPKDRISVWAATDHNIDNTTAILKEWLTVMQ 90
Query: 340 DYIH--NFKTMFKNVKY---IAHNSTVNSK-----EARNLAVENSLHKGVDFYFYVDSDS 389
Y H ++ M K Y + NS+ + + A+ + + D+ Y D+D+
Sbjct: 91 KYYHYVEWRPMDKPTSYAGELGPKHWPNSRYEYVMKLKQAALNFARKRWADYILYADTDN 150
Query: 390 HLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGK 449
L NP+ L L+ N+S+IAP+L P A+SN+W + G+Y R+ +Y + +
Sbjct: 151 ILTNPESLNLLIAENKSVIAPMLDSP-GAYSNYWCGITPQGYYRRTAEYFPTRHRHR--V 207
Query: 450 GIWNVPYITNCYL--MKTSVIKATNIKTIYTLNSMDYD--MAFCTNLRNKGIHLKIDSTQ 505
G + VP + + L ++ +K + S YD + F + R I + + +
Sbjct: 208 GCFPVPMVHSTLLLDLRKEGMKKLAFYPPHEDYSWPYDDIIVFAFSCRAAEIQMYLCNKD 267
Query: 506 EYGHL 510
YG+L
Sbjct: 268 RYGYL 272
>gi|47216930|emb|CAG04872.1| unnamed protein product [Tetraodon nigroviridis]
Length = 615
Score = 64.7 bits (156), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 53/232 (22%), Positives = 106/232 (45%), Gaps = 26/232 (11%)
Query: 289 QFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFK 346
Q P+V+I++ L +L + LNYP +IS++ +N + + +++ +
Sbjct: 31 QPPTVVIAILARNSAHSLPYYLGALERLNYPKDRISVWAATDHNLDNTTAVLREWLTVMQ 90
Query: 347 TMFKNVKY------------IAHNSTVNSK-----EARNLAVENSLHKGVDFYFYVDSDS 389
+ +V++ + NS+ + + A+ + + D+ Y D+D+
Sbjct: 91 QFYHHVEWRPLEQPTSYAGELGPKHWPNSRYEYLMKLKQAALNFARKRWADYILYADTDN 150
Query: 390 HLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGK 449
L NPD L+ L+ N+S+IAP+L A+SNFW + G+Y R+ +Y + +
Sbjct: 151 ILTNPDTLQLLIAENKSVIAPML-HSQGAYSNFWCGITPQGYYRRTAEYFPTRHRHR--L 207
Query: 450 GIWNVPYITNCYLMKTSVIKATNIKTI--YTLNSMDYD--MAFCTNLRNKGI 497
G + VP + + L+ N+ + S YD + F + R++G+
Sbjct: 208 GCFPVPMVHSTLLLDLRKEGMRNLAFFPPHADYSWPYDDIIVFAFSCRSEGV 259
>gi|410986010|ref|XP_003999305.1| PREDICTED: procollagen galactosyltransferase 2 [Felis catus]
Length = 506
Score = 64.7 bits (156), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 39/150 (26%), Positives = 74/150 (49%), Gaps = 11/150 (7%)
Query: 367 RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGAL 426
R A+ + + D+ ++D D+ L NP L ++ N++++AP+L +SNFW +
Sbjct: 28 RQAALRTAREQWSDYILFIDVDNFLTNPQTLNLMIAENKTIVAPML-ESRGLYSNFWCGI 86
Query: 427 NADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD- 485
GFY R+ DY+ I + G + VP + + +L+ + K + K ++ DY
Sbjct: 87 TPQGFYKRTPDYLQIREWKR--LGCFPVPMVHSTFLI--DLRKEASGKLMFYPPHQDYTW 142
Query: 486 -----MAFCTNLRNKGIHLKIDSTQEYGHL 510
+ F + R GI + + + + YG+L
Sbjct: 143 TFDDIIVFAFSSRQAGIQMYLCNREHYGYL 172
>gi|47213906|emb|CAF95848.1| unnamed protein product [Tetraodon nigroviridis]
Length = 601
Score = 64.7 bits (156), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 61/262 (23%), Positives = 106/262 (40%), Gaps = 47/262 (17%)
Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLF--------DDYI 342
P V+I+V L +L I L YP ++I++ N + L D +
Sbjct: 13 PKVMIAVLARNAAHSLPHYLGCIEKLEYPKERIAICGLTNSGTWSQLIHMLQMAAADHNV 72
Query: 343 HNFKTMFKN-VKYIAH--------------------------NSTVNSK-EARNLAVENS 374
N M + +K+ H S N + R A++ +
Sbjct: 73 DNTTAMLREWLKWAQHVYHYVEWRPMDEPRSYTDEWGPKHWPPSRFNHLLKLRQAALKAA 132
Query: 375 LHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYAR 434
+ D+ +VDSD+ L NP VL L+ N +L+AP+L +SNFW + G+Y R
Sbjct: 133 RERWADYILFVDSDNLLTNPRVLTLLMAENLTLLAPML-ESRSLYSNFWCGVTPQGYYKR 191
Query: 435 SFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAF 488
+ DY I + G + VP + + +L+ + + ++ + DY M F
Sbjct: 192 TPDYQPIREWKR--LGCFPVPMVHSTFLL--DLRRESSRDLAFYPPHPDYSWAFDDIMVF 247
Query: 489 CTNLRNKGIHLKIDSTQEYGHL 510
+ R G+ + + + + YG L
Sbjct: 248 AFSARQAGVQMHVCNREHYGFL 269
>gi|345325485|ref|XP_001516115.2| PREDICTED: procollagen galactosyltransferase 2-like
[Ornithorhynchus anatinus]
Length = 625
Score = 64.3 bits (155), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 38/149 (25%), Positives = 74/149 (49%), Gaps = 9/149 (6%)
Query: 367 RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGAL 426
R A+ + K D+ ++D+D+ L NP L ++ N++++AP+L +SNFW +
Sbjct: 147 RQAALRTAREKWSDYVLFIDADNFLTNPQTLNLMIAENKTIVAPML-ESRSLYSNFWCGI 205
Query: 427 NADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTI-----YTLNS 481
G+Y R+ DY+ I + G + VP + + +L+ + + + YT +
Sbjct: 206 TPQGYYKRTPDYVQI--REWKRIGCFAVPMVHSTFLIDLRKVASDKLSFFPPHQDYTW-T 262
Query: 482 MDYDMAFCTNLRNKGIHLKIDSTQEYGHL 510
D + F + R GI + + + + YG+L
Sbjct: 263 FDDIIVFAFSSRQAGIQMYLCNREHYGYL 291
>gi|403303560|ref|XP_003942394.1| PREDICTED: procollagen galactosyltransferase 1 [Saimiri boliviensis
boliviensis]
Length = 630
Score = 63.9 bits (154), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 39/149 (26%), Positives = 78/149 (52%), Gaps = 9/149 (6%)
Query: 367 RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGAL 426
R A++++ D+ +VD+D+ + NPD L L+ N++++AP+L A+SNFW +
Sbjct: 155 RQAALKSARDMWADYILFVDADNLILNPDTLSLLIAENKTVVAPML-DSRAAYSNFWCGM 213
Query: 427 NADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIK-----TIYTLNS 481
+ G+Y R+ Y+ I ++ +G + VP + + +L+ + N+ YT S
Sbjct: 214 TSQGYYKRTPAYIPIRKRER--QGCFAVPMVHSTFLIDLRKAASRNLAFYPPHPDYTW-S 270
Query: 482 MDYDMAFCTNLRNKGIHLKIDSTQEYGHL 510
D + F + + + + + + +EYG L
Sbjct: 271 FDDIIVFAFSCKQAEVQMYVCNKEEYGFL 299
>gi|351705537|gb|EHB08456.1| Glycosyltransferase 25 family member 2 [Heterocephalus glaber]
Length = 508
Score = 63.9 bits (154), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 42/152 (27%), Positives = 74/152 (48%), Gaps = 13/152 (8%)
Query: 367 RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGAL 426
R A+ + K D+ ++D D+ L NP L ++ N++++AP+L +SNFW +
Sbjct: 28 RQAALRTAREKWSDYILFIDVDNFLTNPQTLNLMIAENKTIVAPML-ESRGLYSNFWCGI 86
Query: 427 --NADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDY 484
A GFY R+ DY+ I + G + VP + + +L+ + K + K + DY
Sbjct: 87 TPQASGFYKRTPDYLQIREWKR--MGCFPVPMVHSTFLI--DLRKEASDKLTFYPPHQDY 142
Query: 485 D------MAFCTNLRNKGIHLKIDSTQEYGHL 510
+ F + R GI + + + Q YG+L
Sbjct: 143 TWTFDDIIVFAFSSRQAGIQMYLCNRQHYGYL 174
>gi|354481442|ref|XP_003502910.1| PREDICTED: procollagen galactosyltransferase 2 [Cricetulus griseus]
Length = 545
Score = 63.9 bits (154), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 40/149 (26%), Positives = 73/149 (48%), Gaps = 9/149 (6%)
Query: 367 RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGAL 426
R A+ + K D+ ++D D+ L NP L ++ N +++AP+L +SNFW +
Sbjct: 67 RQAALRTAREKWSDYILFIDVDNFLTNPQTLTLMIAENRTIVAPML-ESRGLYSNFWCGI 125
Query: 427 NADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLN-----S 481
GFY R+ DY+ I + G + VP + + +L+ +A+N Y + +
Sbjct: 126 TPQGFYKRTPDYLQI--REWKRIGCFPVPMVHSTFLIDLRK-EASNNLAFYPPHQDYTWT 182
Query: 482 MDYDMAFCTNLRNKGIHLKIDSTQEYGHL 510
D + F + R GI + + + + YG+L
Sbjct: 183 FDDIIVFAFSSRQAGIQMYLCNKEHYGYL 211
>gi|332025630|gb|EGI65792.1| Glycosyltransferase 25 family member [Acromyrmex echinatior]
Length = 357
Score = 63.9 bits (154), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 50/192 (26%), Positives = 94/192 (48%), Gaps = 22/192 (11%)
Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFKTM 348
P+VLISV + L FL+ + N +YP K+IS ++ NN + + + +I++ M
Sbjct: 28 PTVLISVLVRNKAHTLPYFLSLLENQDYPKKRISFWIRSDNNVDNSIEILNKWINSRSKM 87
Query: 349 FKNVKYIAHNSTVNSKEARNLA-------------VENSLHKG----VDFYFYVDSDSHL 391
+ ++ + S+ ++ R++A E +L DF +D+D L
Sbjct: 88 YHSMNVHLNASSTGFEDERSIADWSPRRFAHIIDLREQALDYAKEIWADFILMLDADVFL 147
Query: 392 DNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGI 451
NP ++ L+++ +++APLL R +SNFW + + +Y R+ Y I+ ++
Sbjct: 148 INPSTIRNLIHKEYTVVAPLL-RSDGMYSNFWAGMTTEHYYLRTELYEPILFREKIDCH- 205
Query: 452 WNVPYITNCYLM 463
NVP I + L+
Sbjct: 206 -NVPMIHSVVLI 216
>gi|241757469|ref|XP_002401539.1| procollagen-lysine, 2-oxoglutarate 5-dioxygenase, putative [Ixodes
scapularis]
gi|215508473|gb|EEC17927.1| procollagen-lysine, 2-oxoglutarate 5-dioxygenase, putative [Ixodes
scapularis]
Length = 52
Score = 63.5 bits (153), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 28/50 (56%), Positives = 35/50 (70%)
Query: 591 NDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHE 640
D+RL GYE VPTRDIHM QV W FLR+Y+ P+QE+ F+GY H+
Sbjct: 2 QDERLAGGYENVPTRDIHMNQVNFEQHWLFFLREYIKPVQEKVFLGYFHD 51
>gi|443714373|gb|ELU06820.1| hypothetical protein CAPTEDRAFT_153006 [Capitella teleta]
Length = 550
Score = 63.5 bits (153), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 49/193 (25%), Positives = 88/193 (45%), Gaps = 22/193 (11%)
Query: 294 LISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN--NQEYHAPLFDDYIHNFKTMFKN 351
+I+ F+ + FL + L+Y +KI++++ + N + A L +I K M+ +
Sbjct: 1 MIAFFVRNKAHTIPYFLYYLEQLDYDKQKINLWIRSDHNVDLSASLIKHWIPKAKKMYNH 60
Query: 352 VKYIAHNSTVNSKEAR-----------------NLAVENSLHKGVDFYFYVDSDSHLDNP 394
V + NST + R A+ S + + FY+D D+ L N
Sbjct: 61 VSFKDDNSTSAFSDERGPFDWSADRMKHMIMLRQEALNVSRQMNLRYIFYIDVDNILVNS 120
Query: 395 DVLKYLVNRNESLIAPLL-VRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWN 453
VL++L++ +AP+L +SNFW ++ GFY R+ +Y I + +G +
Sbjct: 121 QVLRHLISLQRIAVAPMLNTTASPHYSNFWAGMDEQGFYKRTLEYKPI--QLRHTQGTFQ 178
Query: 454 VPYITNCYLMKTS 466
VP I + L+ S
Sbjct: 179 VPMIHSTLLLDLS 191
>gi|47206702|emb|CAF89946.1| unnamed protein product [Tetraodon nigroviridis]
Length = 270
Score = 63.2 bits (152), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 41/150 (27%), Positives = 73/150 (48%), Gaps = 11/150 (7%)
Query: 367 RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGAL 426
R A++ + + D+ +VDSD+ L NP VL L+ N +L+AP+L +SNFW +
Sbjct: 101 RQAALKAARERWADYILFVDSDNLLTNPRVLTLLMAENLTLLAPML-ESRSLYSNFWCGV 159
Query: 427 NADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD- 485
G+Y R+ DY I + G + VP + + +L+ + + ++ + DY
Sbjct: 160 TPQGYYKRTPDYQPIREWKR--LGCFPVPMVHSTFLL--DLRRESSRDLAFYPPHPDYSW 215
Query: 486 -----MAFCTNLRNKGIHLKIDSTQEYGHL 510
M F + R G+ + + + + YG L
Sbjct: 216 AFDDIMVFAFSARQAGVQMHVCNREHYGFL 245
>gi|307213490|gb|EFN88899.1| Glycosyltransferase 25 family member [Harpegnathos saltator]
Length = 347
Score = 62.8 bits (151), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 52/192 (27%), Positives = 95/192 (49%), Gaps = 22/192 (11%)
Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIH----- 343
P+VLI++ + L FL+ + L+YP +++ +++ NN + + + +I+
Sbjct: 9 PTVLITILVRNKAHTLPYFLSLMEQLDYPKERMCLWICSDNNVDNTIEILNKWINSEGKK 68
Query: 344 --------NFKTM-FKNVKYIAHNSTVNSKEARNL---AVENSLHKGVDFYFYVDSDSHL 391
N +M F++ K I S+ NL A+ + DF + +D+D L
Sbjct: 69 YHCLNVHLNATSMGFEDEKTITDWSSRRFAHVINLREQALNYARQIWTDFIWMLDADVFL 128
Query: 392 DNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGI 451
N L+ LV + E+++APLL + +SNFW + A+ +YAR+ Y I+ ++ G
Sbjct: 129 TNSSTLRNLVLKGETVVAPLL-KSDGMYSNFWAGMTAEYYYARTDQYEPILYREE--IGC 185
Query: 452 WNVPYITNCYLM 463
NVP I + L+
Sbjct: 186 HNVPMIHSAVLI 197
>gi|444726648|gb|ELW67172.1| Procollagen galactosyltransferase 1 [Tupaia chinensis]
Length = 983
Score = 62.8 bits (151), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 31/107 (28%), Positives = 61/107 (57%), Gaps = 3/107 (2%)
Query: 367 RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGAL 426
R A++++ D+ +VD+D+ + NPD L L+ N++++AP+L A+SNFW +
Sbjct: 548 RQAALKSARDMWADYILFVDADNFILNPDTLSLLIAENKTVVAPML-DSRAAYSNFWCGM 606
Query: 427 NADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNI 473
+ G+Y R+ Y+ I D+ +G + VP + + +L+ + N+
Sbjct: 607 TSQGYYKRTPAYIPIRKRDR--QGCFAVPMVHSTFLIDLRKAASRNL 651
>gi|432090315|gb|ELK23745.1| Procollagen galactosyltransferase 1 [Myotis davidii]
Length = 578
Score = 62.8 bits (151), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 36/122 (29%), Positives = 67/122 (54%), Gaps = 5/122 (4%)
Query: 367 RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGAL 426
R A++++ D+ +VD+D+ + NPD L L+ N++++AP+L A+SNFW +
Sbjct: 118 RQAALKSARDMWADYILFVDADNLILNPDTLTLLIAENKTVVAPML-DSRAAYSNFWCGM 176
Query: 427 NADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDM 486
+ G+Y R+ Y+ I D+ +G + VP + + +L+ + KA + + DY
Sbjct: 177 TSQGYYKRTPAYIPIRKRDR--QGCFAVPMVHSTFLI--DLRKAASRSLAFYPPHTDYTW 232
Query: 487 AF 488
AF
Sbjct: 233 AF 234
>gi|426230314|ref|XP_004009220.1| PREDICTED: procollagen galactosyltransferase 1 [Ovis aries]
Length = 618
Score = 62.8 bits (151), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 39/149 (26%), Positives = 77/149 (51%), Gaps = 9/149 (6%)
Query: 367 RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGAL 426
R A++++ D+ +VD+D+ + NPD L L+ N++++AP+L A+SNFW +
Sbjct: 143 RQAALKSARDMWADYILFVDADNLILNPDTLTLLIAENKTVVAPML-DSRAAYSNFWCGM 201
Query: 427 NADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIK-----TIYTLNS 481
+ G+Y R+ Y+ I D+ +G + VP + + +L+ + N+ YT S
Sbjct: 202 TSQGYYKRTPAYIPIRKRDR--RGCFAVPMVHSTFLIDLRKAASRNLAFYPPHPDYTW-S 258
Query: 482 MDYDMAFCTNLRNKGIHLKIDSTQEYGHL 510
D + F + + + + + + + YG L
Sbjct: 259 FDDIIVFAFSCKQAEVQMYVCNREVYGFL 287
>gi|410950910|ref|XP_003982145.1| PREDICTED: procollagen galactosyltransferase 1, partial [Felis
catus]
Length = 535
Score = 62.8 bits (151), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 39/149 (26%), Positives = 77/149 (51%), Gaps = 9/149 (6%)
Query: 367 RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGAL 426
R A++++ D+ +VD+D+ + NPD L L+ N++++AP+L A+SNFW +
Sbjct: 60 RQAALKSARDMWADYILFVDADNLILNPDTLSLLIAENKTVVAPML-DSRAAYSNFWCGM 118
Query: 427 NADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIK-----TIYTLNS 481
+ G+Y R+ Y+ I D+ +G + VP + + +L+ + N+ YT S
Sbjct: 119 TSQGYYKRTPAYIPIRKRDR--QGCFAVPMVHSTFLIDLRKAASRNLAFYPPHPDYTW-S 175
Query: 482 MDYDMAFCTNLRNKGIHLKIDSTQEYGHL 510
D + F + + + + + + + YG L
Sbjct: 176 FDDIIVFAFSCKQAEVQMYVCNKEVYGFL 204
>gi|312068784|ref|XP_003137376.1| hypothetical protein LOAG_01790 [Loa loa]
Length = 102
Score = 62.4 bits (150), Expect = 8e-07, Method: Composition-based stats.
Identities = 29/76 (38%), Positives = 46/76 (60%), Gaps = 1/76 (1%)
Query: 33 KFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLLKNEL 91
K LV+TVA+ ETDG +R ++A N +++ G+ + W GG+ GGG K+ +L+ L
Sbjct: 27 KLLVVTVATEETDGLRRLKRTAHTNHFRLEVFGMGEEWRGGNTRVEQGGGQKIRILRKSL 86
Query: 92 DEMDITDDMIILVTDS 107
+ DD+IIL D+
Sbjct: 87 GKYKDRDDLIILFVDA 102
>gi|410931648|ref|XP_003979207.1| PREDICTED: procollagen galactosyltransferase 2-like, partial
[Takifugu rubripes]
Length = 536
Score = 62.4 bits (150), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 41/150 (27%), Positives = 73/150 (48%), Gaps = 11/150 (7%)
Query: 367 RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGAL 426
R A++ + + D+ +VDSD+ L NP VL L+ N +L+AP+L +SNFW +
Sbjct: 60 RQAALKAARERWADYILFVDSDNLLTNPRVLTLLMAENLTLVAPML-ESRSLYSNFWCGV 118
Query: 427 NADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD- 485
G+Y R+ DY I + G + VP + + +L+ + + ++ + DY
Sbjct: 119 TPQGYYKRTPDYQPIREWKR--LGCFPVPMVHSTFLL--DLRRESSRDLAFYPPHPDYSW 174
Query: 486 -----MAFCTNLRNKGIHLKIDSTQEYGHL 510
M F + R G+ + + + + YG L
Sbjct: 175 AFDDIMVFAFSARQVGVQMHVCNREHYGLL 204
>gi|303291280|ref|XP_003064926.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226453597|gb|EEH50906.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 383
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 57/258 (22%), Positives = 106/258 (41%), Gaps = 39/258 (15%)
Query: 31 EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGG----DMSSLGGGYKVNL 86
++K + T + T G + SA N + LG++ ++G + L G
Sbjct: 78 QNKLVFFTYSDRVTTGLCLSMLSAASNGFLLHVLGINDTYVGDVHEPKLKKLYGMKSFLS 137
Query: 87 LKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTF-----DANIVFGAERLCWP- 140
+ L+ + D+ +++ D+ DV+ G ++ L I+ ER CWP
Sbjct: 138 DRRALERYGLGDETVLVFADASDVLYLGSRDEALHTLQQLLGPLERGIILISGERNCWPF 197
Query: 141 ---DTSLY-------DKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSI---KNEEDDQL 187
D L +++P S +R+LN+G + G K ++ + N DDQL
Sbjct: 198 VHYDKELTAGGREKCEEFPHRNSSFRFLNAGAYAGAIKPMRAFLKTLHAGIPSNVSDDQL 257
Query: 188 YYALLFLDETLRTKH---KIVLDTLANLFQNLYGSLEDIKLNFDLDEFV----------- 233
+ L+ + +H ++V+D + +FQ G L ++ DE V
Sbjct: 258 VFQELYSKQVREGRHELFELVIDHASKMFQT--GHLTSLEGAGTFDEPVPMNAYFNAGIG 315
Query: 234 HLTNTKYNTNPVIIHGNG 251
+ N++ T P ++H NG
Sbjct: 316 RVVNSESETRPFLVHFNG 333
>gi|349992099|dbj|GAA36581.1| collagen beta-1 O-galactosyltransferase [Clonorchis sinensis]
Length = 673
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 60/252 (23%), Positives = 114/252 (45%), Gaps = 34/252 (13%)
Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFKTM 348
P+V I V + L FL+ + +Y K+I + N+ + + +I +
Sbjct: 120 PTVFIGVLVRNKAHALPYFLHGLETQDYLTKRIQLLFLADNSIDDSVNVLSQWIDSVSER 179
Query: 349 FKNVK------YIAHNSTVNSKEARNLAVE-----NSLHKG-VDFYFYVDSDSHLDNPDV 396
+ V Y+AH+ +++ ++A+ N+ K DFY +D+D L NP
Sbjct: 180 YHQVNLEIGGDYLAHSKMWSTEHYEHVALLRQRLLNAARKSWADFYLTIDADVILMNPGT 239
Query: 397 LKYLVNRNES-------LIAPL-LVRPF------KAWSNFWGALNADGFYARSFDYMNII 442
LK+LV +S L+ PL ++ P + +SNFWGA+ G+YARS Y +I
Sbjct: 240 LKHLVESAQSPGKIVSELLDPLPVISPLMNCTSSEFYSNFWGAMTETGYYARSDTYFDIQ 299
Query: 443 NGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNS----MDYDMAFCTNLRNKGIH 498
+ G++ VP + + +L+ + N++ + +D + F + + +
Sbjct: 300 R--RLVLGLFEVPMVHSIFLVNLRHKLSENLRYFPPPSGYKGPLDDLIIFARSAQLSNVP 357
Query: 499 LKIDSTQEYGHL 510
+D+ + YG+L
Sbjct: 358 FYLDNREFYGYL 369
>gi|440904329|gb|ELR54855.1| Procollagen galactosyltransferase 1, partial [Bos grunniens mutus]
Length = 544
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/149 (25%), Positives = 77/149 (51%), Gaps = 9/149 (6%)
Query: 367 RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGAL 426
R A++++ D+ +VD+D+ + NPD L L+ N++++AP+L A+SNFW +
Sbjct: 69 RQAALKSARDMWADYILFVDADNLILNPDTLTLLIAENKTVVAPML-DSRAAYSNFWCGM 127
Query: 427 NADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIK-----TIYTLNS 481
+ G+Y R+ Y+ I ++ +G + VP + + +L+ + N+ YT S
Sbjct: 128 TSQGYYKRTPAYIPIRKRER--RGCFAVPMVHSTFLIDLRKAASRNLAFYPPHPDYTW-S 184
Query: 482 MDYDMAFCTNLRNKGIHLKIDSTQEYGHL 510
D + F + + + + + + + YG L
Sbjct: 185 FDDIIVFAFSCKQAEVQMYVCNKEVYGFL 213
>gi|344283113|ref|XP_003413317.1| PREDICTED: procollagen galactosyltransferase 1-like [Loxodonta
africana]
Length = 540
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 39/149 (26%), Positives = 77/149 (51%), Gaps = 9/149 (6%)
Query: 367 RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGAL 426
R A++++ D+ +VD+D+ + NPD L L+ N++++AP+L A+SNFW +
Sbjct: 99 RQAALKSARDMWADYILFVDADNLILNPDTLTLLMAENKTVVAPML-DSRAAYSNFWCGM 157
Query: 427 NADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIK-----TIYTLNS 481
+ G+Y R+ Y+ I D+ +G + VP + + +L+ + N+ YT S
Sbjct: 158 TSQGYYKRTPAYIPIRKRDR--QGCFPVPMVHSTFLIDLRKAASRNLAFYPPHPDYTW-S 214
Query: 482 MDYDMAFCTNLRNKGIHLKIDSTQEYGHL 510
D + F + + + + + + + YG L
Sbjct: 215 FDDIIVFAFSCKQAEVQMYVCNKEVYGFL 243
>gi|297276457|ref|XP_001114885.2| PREDICTED: procollagen galactosyltransferase 1-like [Macaca
mulatta]
Length = 474
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 37/132 (28%), Positives = 70/132 (53%), Gaps = 9/132 (6%)
Query: 384 YVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIIN 443
+VD+D+ + NPD L L+ N++++AP+L A+SNFW + + G+Y R+ Y+ I
Sbjct: 26 FVDADNLILNPDTLSLLIAENKTVVAPML-DSRAAYSNFWCGMTSQGYYKRTPAYIPIRK 84
Query: 444 GDQGGKGIWNVPYITNCYLMKTSVIKATNIKTI-----YTLNSMDYDMAFCTNLRNKGIH 498
D+ +G + VP + + +L+ + N+ YT S D + F + + +
Sbjct: 85 RDR--RGCFAVPMVHSTFLIDLRKAASRNLAFYPPHPDYTW-SFDDIIVFAFSCKQAEVQ 141
Query: 499 LKIDSTQEYGHL 510
+ + + +EYG L
Sbjct: 142 MYVCNKEEYGFL 153
>gi|281343517|gb|EFB19101.1| hypothetical protein PANDA_000528 [Ailuropoda melanoleuca]
Length = 535
Score = 60.5 bits (145), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 38/149 (25%), Positives = 77/149 (51%), Gaps = 9/149 (6%)
Query: 367 RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGAL 426
R A++++ D+ +VD+D+ + NP+ L L+ N++++AP+L A+SNFW +
Sbjct: 60 RQAALKSARDMWADYILFVDADNLILNPNTLSLLIAENKTVVAPML-DSRAAYSNFWCGM 118
Query: 427 NADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIK-----TIYTLNS 481
+ G+Y R+ Y+ I D+ +G + VP + + +L+ + N+ YT S
Sbjct: 119 TSQGYYKRTPAYIPIRKRDR--RGCFAVPMVHSTFLIDLRKSASRNLAFYPPHPDYTW-S 175
Query: 482 MDYDMAFCTNLRNKGIHLKIDSTQEYGHL 510
D + F + + + + + + + YG L
Sbjct: 176 FDDIIVFAFSCKQAEVQMYVCNKEMYGFL 204
>gi|301753877|ref|XP_002912839.1| PREDICTED: procollagen galactosyltransferase 1-like [Ailuropoda
melanoleuca]
Length = 542
Score = 60.5 bits (145), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 38/149 (25%), Positives = 77/149 (51%), Gaps = 9/149 (6%)
Query: 367 RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGAL 426
R A++++ D+ +VD+D+ + NP+ L L+ N++++AP+L A+SNFW +
Sbjct: 67 RQAALKSARDMWADYILFVDADNLILNPNTLSLLIAENKTVVAPML-DSRAAYSNFWCGM 125
Query: 427 NADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIK-----TIYTLNS 481
+ G+Y R+ Y+ I D+ +G + VP + + +L+ + N+ YT S
Sbjct: 126 TSQGYYKRTPAYIPIRKRDR--RGCFAVPMVHSTFLIDLRKSASRNLAFYPPHPDYTW-S 182
Query: 482 MDYDMAFCTNLRNKGIHLKIDSTQEYGHL 510
D + F + + + + + + + YG L
Sbjct: 183 FDDIIVFAFSCKQAEVQMYVCNKEMYGFL 211
>gi|149757348|ref|XP_001499949.1| PREDICTED: procollagen galactosyltransferase 1 [Equus caballus]
Length = 548
Score = 60.5 bits (145), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 30/107 (28%), Positives = 61/107 (57%), Gaps = 3/107 (2%)
Query: 367 RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGAL 426
R A++++ D+ +VD+D+ + NPD L L+ N++++AP+L A+SNFW +
Sbjct: 73 RQAALKSARDMWADYILFVDADNLILNPDTLSLLIAENKTVVAPML-DSRAAYSNFWCGM 131
Query: 427 NADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNI 473
+ G+Y R+ Y+ I ++ +G + VP + + +L+ + N+
Sbjct: 132 TSQGYYRRTPAYIPIRKRER--RGCFAVPMVHSTFLIDLRKAASRNL 176
>gi|326402622|ref|YP_004282703.1| hypothetical protein ACMV_04740 [Acidiphilium multivorum AIU301]
gi|325049483|dbj|BAJ79821.1| hypothetical protein ACMV_04740 [Acidiphilium multivorum AIU301]
Length = 667
Score = 60.1 bits (144), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 62/270 (22%), Positives = 114/270 (42%), Gaps = 40/270 (14%)
Query: 284 SLKPDQF--------PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEY 333
+LKP++ P VLI++ + FL +L+ I L+YP I +++ NN +
Sbjct: 385 ALKPERLLRSGTTTAPRVLIAILAKQKEEFLPLYLDCIEALDYPKSSIVLYIRTNNNTDR 444
Query: 334 HAPLFDDYIHNFKTMFKNVKYIAHNSTVNSK------------------EARNLAVENSL 375
+ ++I + V++ S V+ + RN ++ +
Sbjct: 445 TEEILREWIARVGHSYAAVEF--DPSDVDERVEQFGAHEWNAIRFRVLGRIRNESLRKTR 502
Query: 376 HKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLL--VRPFKAWSNFWGALNADGFYA 433
G D+YF D D+ + L+ LV ++APLL P +SN ++ +G++
Sbjct: 503 EHGCDWYFVADIDNFIRRC-TLRELVATGLPIVAPLLRDAEPSSYYSNLHAEIDDNGYFR 561
Query: 434 RSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTN-L 492
Y I++ + +G+ VP + Y ++ VI+ N Y S Y+ ++
Sbjct: 562 DCAQYELIMS--RRIQGLIEVPLVHCTYAVRADVIEHLN----YDDGSGRYEYVILSDSA 615
Query: 493 RNKGIHLKIDSTQEYGHLVDSENFDPQKTN 522
R I D+ Q YG++ S+N D N
Sbjct: 616 RKASIPQYFDNRQVYGYITFSKNPDQYDEN 645
>gi|301758784|ref|XP_002915272.1| PREDICTED: glycosyltransferase 25 family member 3-like [Ailuropoda
melanoleuca]
Length = 590
Score = 60.1 bits (144), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 52/255 (20%), Positives = 110/255 (43%), Gaps = 30/255 (11%)
Query: 280 KHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNN--------- 330
K+L++ P P+V++++ L +L + L+YP +++++ +
Sbjct: 17 KNLEASPP--LPAVVLTILARNAEHSLPHYLGALERLDYPRARLALWCATDHNTDNTTQM 74
Query: 331 -QEYHAPLFDDYIHNF------KTMFKNVKYIAHNSTVNSK---EARNLAVENSLHKGVD 380
+E+ A + DDY F + + + H + + E + A+ + G D
Sbjct: 75 LREWLAAVGDDYAAVFWRPEGEPRSYPDEEGPKHWTKERHQFLMELKQEALTFARDWGAD 134
Query: 381 FYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMN 440
+ + D+D+ L N L+ L+ + ++AP+L +SNFW + G+Y R+ DY
Sbjct: 135 YILFADTDNILTNNQTLRLLIEQGLPVVAPMLDSQ-TYYSNFWCGITPQGYYRRTADYFP 193
Query: 441 IINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTI-----YTLNSMDYDMAFCTNLRNK 495
N + +G + VP + + +L+ A + YT D + F +
Sbjct: 194 TKNRQR--RGCFRVPMVHSTFLVSLRAEGAAQLAFYPPHPNYTW-PFDDIIVFAYACQAA 250
Query: 496 GIHLKIDSTQEYGHL 510
G+ + + + YG++
Sbjct: 251 GVTVHVCNEHRYGYM 265
>gi|313229149|emb|CBY23734.1| unnamed protein product [Oikopleura dioica]
Length = 576
Score = 60.1 bits (144), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 48/196 (24%), Positives = 93/196 (47%), Gaps = 26/196 (13%)
Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFKTM 348
PS+L+ VF+ L F + NYP +I ++ +N + + + ++
Sbjct: 29 PSILLPVFVRNKEHALPYFFGGLERQNYPKSRIRLWFVTDHNADNSLEVIKAWKEAWEME 88
Query: 349 FKNVKYIAHN------STVNSK------------EARNLAVENSLHKGVDFYFYVDSDSH 390
+ ++K + S +++ + R A+ ++ + VD+ F +D+D+
Sbjct: 89 YMDIKIEIRDPRKGFWSDADTELSWSPNRYDHILKLRQQALNHARNMLVDYLFMIDADNI 148
Query: 391 LDNPDVLKYLVNRNESLIAPLLVR--PFKAWSNFWGALNAD-GFYARSFDYMNIINGDQG 447
L P +L+ LV R++ ++ P+L PF SN+W NA+ G+Y R DY +I +Q
Sbjct: 149 LVQPSLLRKLVLRDKPIVGPMLETGVPF---SNYWTNQNAETGYYERGDDYYDIRYYEQD 205
Query: 448 GKGIWNVPYITNCYLM 463
+ VP + +CYL+
Sbjct: 206 FLNVHKVPMLHSCYLI 221
>gi|281346524|gb|EFB22108.1| hypothetical protein PANDA_019266 [Ailuropoda melanoleuca]
Length = 635
Score = 59.3 bits (142), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 61/276 (22%), Positives = 114/276 (41%), Gaps = 42/276 (15%)
Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAK-KISMFVYNNQEYHAPLFDDYIHNF 345
P Q P+VL+++ L FL + L + K +N + + +++ N
Sbjct: 48 PMQRPTVLVAILARNAAHALPHFLGCLERLXHAKSLKSKAATDHNVDNTTEILREWLKNV 107
Query: 346 KTMFKNVKY------------IAHNSTVNSKEA-----RNLAVENSLHKGVDFYFYVDSD 388
++ + V++ I S+ A R A+ + K D+ ++D D
Sbjct: 108 QSFYHYVEWRPMDEPESYPDEIGPKHWPGSRFAHVMKLRQAALRTAREKWSDYILFIDVD 167
Query: 389 SHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNA-----------DGFYARSFD 437
+ L NP L ++ N++++AP+L +SNFW + GFY R+ D
Sbjct: 168 NFLTNPQTLNLMIAENKTIVAPML-ESRGLYSNFWCGITPQAKQSPPISFFQGFYKRTPD 226
Query: 438 YMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTN 491
Y+ I + G + VP + + +L+ + K + K ++ DY + F +
Sbjct: 227 YLQIREWKR--LGCFPVPMVHSTFLI--DLRKEASDKLMFYPPHQDYTWTFDDIIVFAFS 282
Query: 492 LRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYE 527
R GI + + + + YG+L PQ+T E E
Sbjct: 283 SRQAGIQMYLCNREHYGYLPIP--LKPQQTLQEEIE 316
>gi|410979348|ref|XP_003996047.1| PREDICTED: glycosyltransferase 25 family member 3 [Felis catus]
Length = 560
Score = 59.3 bits (142), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 42/203 (20%), Positives = 88/203 (43%), Gaps = 22/203 (10%)
Query: 290 FPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNN----------QEYHAPLFD 339
P+V++++ L +L + L+YP +++++ + QE+ A + D
Sbjct: 86 LPAVVLTILARNAEHSLPHYLGALERLDYPRARLALWCATDHNTDNTTQMLQEWLAAVGD 145
Query: 340 DYI------HNFKTMFKNVKYIAHNSTVNSK---EARNLAVENSLHKGVDFYFYVDSDSH 390
DY + + + H + + E + A+ + G D+ + D+D+
Sbjct: 146 DYAAVVWRPEGAPRSYPDEEGPKHWTKERHQFLMELKQEALTFARDWGADYILFADTDNI 205
Query: 391 LDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKG 450
L N L+ L+ + ++AP+L +SNFW + G+Y R+ DY N + +G
Sbjct: 206 LTNNQTLRLLIEQRLPVVAPMLDSQ-TYYSNFWCGITPQGYYRRTADYFPTKNRQR--RG 262
Query: 451 IWNVPYITNCYLMKTSVIKATNI 473
+ VP + + +L+ A +
Sbjct: 263 CFRVPMVHSTFLVSLRAEGAAQL 285
>gi|313215923|emb|CBY37331.1| unnamed protein product [Oikopleura dioica]
Length = 579
Score = 59.3 bits (142), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 47/196 (23%), Positives = 93/196 (47%), Gaps = 26/196 (13%)
Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFKTM 348
PS+L+ VF+ L F + NYP +I ++ +N + + + ++
Sbjct: 29 PSILLPVFVRNKEHALPYFFGGLERQNYPKSRIRLWFVTDHNADNSLEVIKAWKEAWEME 88
Query: 349 FKNVKYIAHN------STVNSK------------EARNLAVENSLHKGVDFYFYVDSDSH 390
+ ++K + S +++ + R A+ ++ + VD+ F +D+D+
Sbjct: 89 YMDIKIEIRDPRKGFWSDADTELSWSPNRYDHILKLRQQALNHARNMLVDYLFMIDADNI 148
Query: 391 LDNPDVLKYLVNRNESLIAPLLVR--PFKAWSNFWGALNAD-GFYARSFDYMNIINGDQG 447
L P +++ LV R++ ++ P+L PF SN+W NA+ G+Y R DY +I +Q
Sbjct: 149 LVQPSLIRKLVLRDKPIVGPMLETGVPF---SNYWTNQNAETGYYERGDDYYDIRYYEQD 205
Query: 448 GKGIWNVPYITNCYLM 463
+ VP + +CYL+
Sbjct: 206 FLNVHKVPMLHSCYLI 221
>gi|281349465|gb|EFB25049.1| hypothetical protein PANDA_003209 [Ailuropoda melanoleuca]
Length = 569
Score = 59.3 bits (142), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 49/245 (20%), Positives = 104/245 (42%), Gaps = 28/245 (11%)
Query: 290 FPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNN----------QEYHAPLFD 339
P+V++++ L +L + L+YP +++++ + +E+ A + D
Sbjct: 4 LPAVVLTILARNAEHSLPHYLGALERLDYPRARLALWCATDHNTDNTTQMLREWLAAVGD 63
Query: 340 DYIHNF------KTMFKNVKYIAHNSTVNSK---EARNLAVENSLHKGVDFYFYVDSDSH 390
DY F + + + H + + E + A+ + G D+ + D+D+
Sbjct: 64 DYAAVFWRPEGEPRSYPDEEGPKHWTKERHQFLMELKQEALTFARDWGADYILFADTDNI 123
Query: 391 LDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKG 450
L N L+ L+ + ++AP+L +SNFW + G+Y R+ DY N + +G
Sbjct: 124 LTNNQTLRLLIEQGLPVVAPMLDSQ-TYYSNFWCGITPQGYYRRTADYFPTKNRQR--RG 180
Query: 451 IWNVPYITNCYLMKTSVIKATNIKTI-----YTLNSMDYDMAFCTNLRNKGIHLKIDSTQ 505
+ VP + + +L+ A + YT D + F + G+ + + +
Sbjct: 181 CFRVPMVHSTFLVSLRAEGAAQLAFYPPHPNYTW-PFDDIIVFAYACQAAGVTVHVCNEH 239
Query: 506 EYGHL 510
YG++
Sbjct: 240 RYGYM 244
>gi|307166664|gb|EFN60661.1| Glycosyltransferase 25 family member [Camponotus floridanus]
Length = 357
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 47/194 (24%), Positives = 92/194 (47%), Gaps = 26/194 (13%)
Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNN------------------QE 332
P+VLI++ + L FL+ + +YP K+I +++ ++ ++
Sbjct: 28 PTVLIAILVRNKAHTLPYFLSLLERQDYPKKRICLWIRSDHNVDRSIEILNKWIGLEGKK 87
Query: 333 YHAPLFDDYIHNFKTMFKNVKYIAHNST---VNSKEARNLAVENSLHKGVDFYFYVDSDS 389
YH + ++ T F++ + A S + + R A+ + DF F +D+D
Sbjct: 88 YHC--LNIQLNATSTRFEDERTFADWSPRRFAHVIDLREQALNYAREIWADFIFMLDADV 145
Query: 390 HLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGK 449
L N ++ LV + ++++APLL R +SNFW + A+ +Y R+ Y I+ ++
Sbjct: 146 FLTNSSTMRDLVLKGQTVVAPLL-RSDGMYSNFWAGITAEYYYVRTDLYEPILFREK--T 202
Query: 450 GIWNVPYITNCYLM 463
G NVP + + L+
Sbjct: 203 GCHNVPMVHSAVLI 216
>gi|73968112|ref|XP_851283.1| PREDICTED: glycosyltransferase 25 family member 3 [Canis lupus
familiaris]
Length = 595
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 48/244 (19%), Positives = 105/244 (43%), Gaps = 26/244 (10%)
Query: 290 FPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNN----------QEYHAPLFD 339
P+V++++ L +L + L+YP +++++ + QE+ A + D
Sbjct: 30 LPAVVLTILARNAEHSLPHYLGALERLDYPRARLALWCATDHNTDNTTEMLQEWLAAVGD 89
Query: 340 DYI------HNFKTMFKNVKYIAHNSTVNSK---EARNLAVENSLHKGVDFYFYVDSDSH 390
DY + + + H + + E + A+ + G D+ + D+D+
Sbjct: 90 DYATVVWRPEGEPRSYPDEEGPKHWTKERHQFLMELKQEALTFARDWGADYILFADTDNI 149
Query: 391 LDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKG 450
L N L+ L+++ ++AP+L +SNFW + G+Y R+ DY N + +G
Sbjct: 150 LTNNQTLRLLIDQGLPVVAPMLDSQ-TYYSNFWCGITPQGYYRRTADYFPTKNRQR--RG 206
Query: 451 IWNVPYITNCYLMKTSVIKATNIK--TIYTLNSMDYD--MAFCTNLRNKGIHLKIDSTQE 506
+ VP + + +L+ A + + S +D + F + G+ + + +
Sbjct: 207 CFQVPMVHSTFLVSLRTEGAAQLAFYPPHPNYSWPFDDIIVFAYACQAVGVTIHVCNEHR 266
Query: 507 YGHL 510
YG++
Sbjct: 267 YGYM 270
>gi|338979866|ref|ZP_08631205.1| Glycosyl transferase family protein [Acidiphilium sp. PM]
gi|338209221|gb|EGO97001.1| Glycosyl transferase family protein [Acidiphilium sp. PM]
Length = 658
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 61/270 (22%), Positives = 114/270 (42%), Gaps = 40/270 (14%)
Query: 284 SLKPDQF--------PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEY 333
+LKP++ P VLI++ + FL +L+ I L+YP I +++ NN +
Sbjct: 376 ALKPERLLRSGTTTAPRVLIAILAKQKEEFLPLYLDCIEALDYPKSSIVLYIRTNNNTDR 435
Query: 334 HAPLFDDYIHNFKTMFKNVKYIAHNSTVNSK------------------EARNLAVENSL 375
+ ++I + V++ S V+ + RN ++ +
Sbjct: 436 TEEILREWIARVGHSYAAVEF--DPSDVDERVEQFGAHEWNAIRFRVLGRIRNESLRKTR 493
Query: 376 HKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLL--VRPFKAWSNFWGALNADGFYA 433
G D+YF D D+ + L+ LV ++APLL P +SN ++ +G++
Sbjct: 494 EHGCDWYFVADIDNFIRRC-TLRELVATGLPIVAPLLRDAEPSSYYSNLHAEIDDNGYFR 552
Query: 434 RSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD-MAFCTNL 492
Y I++ + +G+ VP + Y ++ VI+ N Y S ++ + +
Sbjct: 553 DCAQYELIMS--RRIQGLIEVPLVHCTYAVRADVIEHLN----YDDGSGRHEYVVLSDSA 606
Query: 493 RNKGIHLKIDSTQEYGHLVDSENFDPQKTN 522
R I D+ Q YG++ S+N D N
Sbjct: 607 RKASIPQYFDNRQVYGYITFSKNPDQDDEN 636
>gi|255072887|ref|XP_002500118.1| predicted protein [Micromonas sp. RCC299]
gi|226515380|gb|ACO61376.1| predicted protein [Micromonas sp. RCC299]
Length = 505
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 37/129 (28%), Positives = 59/129 (45%), Gaps = 12/129 (9%)
Query: 33 KFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNELD 92
K LV+ + + D K F+ S + L G+ W + +G K +LL+ D
Sbjct: 201 KDLVVATHTTDKDASKLFMASIHKHGLAASVSGVGTWWHSHEDKEIG--LKASLLRLPAD 258
Query: 93 EMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAVG 152
E D ++++ DS D + ++L RF DA+IV G E CWP + + G
Sbjct: 259 E-----DPLVILADSDDSMFTCDAEEMLSRFEELDADIVVGTETRCWPPEASH-----CG 308
Query: 153 SGYRYLNSG 161
GY++L G
Sbjct: 309 DGYKHLEEG 317
>gi|114626942|ref|XP_001157210.1| PREDICTED: glycosyltransferase 25 family member 3 isoform 1 [Pan
troglodytes]
gi|410224368|gb|JAA09403.1| cerebral endothelial cell adhesion molecule [Pan troglodytes]
gi|410257424|gb|JAA16679.1| cerebral endothelial cell adhesion molecule [Pan troglodytes]
gi|410333099|gb|JAA35496.1| cerebral endothelial cell adhesion molecule [Pan troglodytes]
Length = 595
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 49/245 (20%), Positives = 103/245 (42%), Gaps = 28/245 (11%)
Query: 290 FPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNN----------QEYHAPLFD 339
P+V++++ L +L + L+YP +++++ + QE+ A + D
Sbjct: 30 LPAVVLAILARNAEHSLPHYLGALERLDYPRARMALWCATDHNVDNTTEMLQEWLAAVGD 89
Query: 340 DYIH---------NFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSH 390
DY F ++ K+ E + A+ + + G D+ + D+D+
Sbjct: 90 DYAAVVWRPEGEPRFYPDEESPKHWTKERHQFLMELKQEALTFARNWGADYILFADTDNI 149
Query: 391 LDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKG 450
L N L+ L+ + ++AP+L +SNFW + G+Y R+ +Y N + +G
Sbjct: 150 LTNNQTLRLLMGQGLPVVAPMLDSQ-TYYSNFWCGITPQGYYRRTAEYFPTKNRQR--RG 206
Query: 451 IWNVPYITNCYLMKTSVIKATNIKTI-----YTLNSMDYDMAFCTNLRNKGIHLKIDSTQ 505
+ VP + + +L+ A + YT D + F + G+ + + +
Sbjct: 207 CFRVPMVHSTFLVSLRAEGADQLAFYPPHPNYTW-PFDDIIVFAYACQAAGVSVHVCNEH 265
Query: 506 EYGHL 510
YG++
Sbjct: 266 RYGYM 270
>gi|355678476|gb|AER96128.1| cerebral endothelial cell adhesion molecule [Mustela putorius furo]
Length = 597
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 49/245 (20%), Positives = 103/245 (42%), Gaps = 28/245 (11%)
Query: 290 FPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNN----------QEYHAPLFD 339
P+V++++ L +L + L+YP +++++ + QE+ A + D
Sbjct: 30 LPAVVLTILARNAEHSLPHYLGALERLDYPRARLALWCATDHNTDNSTQMLQEWLAAVGD 89
Query: 340 DYI------HNFKTMFKNVKYIAHNSTVNSK---EARNLAVENSLHKGVDFYFYVDSDSH 390
DY + + + H + + E + A+ + G D+ + D+D+
Sbjct: 90 DYAAVVWRPEGDPRSYPDEEGPKHWTKERHQFLMELKQEALTFARDWGADYILFADTDNI 149
Query: 391 LDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKG 450
L N L+ L+ + ++AP+L +SNFW + G+Y R+ DY N + +G
Sbjct: 150 LTNNQTLRLLIEQGLPVVAPMLDSQ-TYYSNFWCGITPQGYYRRTADYFPTKNRQR--RG 206
Query: 451 IWNVPYITNCYLMKTSVIKATNIKTI-----YTLNSMDYDMAFCTNLRNKGIHLKIDSTQ 505
+ VP + + +L+ A + YT D + F + G+ + + +
Sbjct: 207 CFRVPMVHSTFLVSLRAEGAAQLAFYPPHPNYTW-PFDDIIVFAYACQAAGVTVHVCNEH 265
Query: 506 EYGHL 510
YG++
Sbjct: 266 RYGYM 270
>gi|307102945|gb|EFN51210.1| expressed protein [Chlorella variabilis]
Length = 666
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 47/194 (24%), Positives = 85/194 (43%), Gaps = 22/194 (11%)
Query: 72 GGDMSSLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDAN-- 129
G+ S + G ++ L++ + D I+L+ D+ D +I +L+ +N
Sbjct: 447 AGEFSQVAWGMRLKALRDFAARLTRRD--IVLMADARDALIGASPEALLDTYNDTVGGQR 504
Query: 130 -IVFGAERLCWP----DTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEED 184
++FGAE CW + + YP G+ YR+LN+G +G A I+ L+ + SI D
Sbjct: 505 LVLFGAEPHCWQHDLCPPEVVEGYPETGTPYRFLNAGTVMGPADVIRRLL-DASIDWAAD 563
Query: 185 ----DQLYYALLFLDETLRT---KHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTN 237
++ FL L+ + + +D+ +F + D+ N
Sbjct: 564 TAHRQPGFHDQGFLHGLLKAGPQRRLMAVDSRCRVFCAFFSRQHDLACTRR-----GWLN 618
Query: 238 TKYNTNPVIIHGNG 251
T T P+I+HG+G
Sbjct: 619 TYTGTYPLILHGSG 632
>gi|403299720|ref|XP_003940624.1| PREDICTED: glycosyltransferase 25 family member 3 [Saimiri
boliviensis boliviensis]
Length = 595
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 49/245 (20%), Positives = 104/245 (42%), Gaps = 28/245 (11%)
Query: 290 FPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNN----------QEYHAPLFD 339
P+V++++ L +L + L+YP +++++ + QE+ A + D
Sbjct: 30 LPAVVLAILARNAEHSLPHYLGALERLDYPRARLALWCATDHNVDNTTEMLQEWLAAVGD 89
Query: 340 DYI------HNFKTMFKNVKYIAHNSTVNSK---EARNLAVENSLHKGVDFYFYVDSDSH 390
DY + + + H + + E + A+ + G D+ + D+D+
Sbjct: 90 DYATVVWRPEGEPRSYPDEEGPKHWTKERHQFLMELKQEALTFAREWGADYILFADTDNI 149
Query: 391 LDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKG 450
L N L+ LV + ++AP+L +SNFW + G+Y R+ +Y N + +G
Sbjct: 150 LTNNQTLRLLVGQGLPVVAPMLDSQ-TYYSNFWCGITPQGYYRRTAEYFPTKNRQR--RG 206
Query: 451 IWNVPYITNCYLMKTSVIKATNIKTI-----YTLNSMDYDMAFCTNLRNKGIHLKIDSTQ 505
++VP + + +L+ A + YT D + F + G+ + + +
Sbjct: 207 CFHVPMVHSTFLVSLRAEGADQLAFYPPHRNYTW-PFDDIIVFAYACQAAGVSVHVCNEH 265
Query: 506 EYGHL 510
YG++
Sbjct: 266 RYGYM 270
>gi|363740426|ref|XP_003642326.1| PREDICTED: glycosyltransferase 25 family member 3 [Gallus gallus]
Length = 596
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 56/243 (23%), Positives = 111/243 (45%), Gaps = 33/243 (13%)
Query: 296 SVFIDKPTAF---LEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFKTMFK 350
SV I +P L L + +L++PA I+++ +N + + +++ + +
Sbjct: 40 SVPIYRPIPIPHSLPHCLGALESLDFPAGNIALWCATDHNSDNTTAMLQEWLQAVGSNYH 99
Query: 351 NVKYIAHNST------VNSKEARNLAVEN---------SLHKGV--DFYFYVDSDSHLDN 393
+V + A + K + EN S +G+ D+ +VD+DS L N
Sbjct: 100 SVAWKAEEGPSSYPDELGPKHWSDKRYENLMRLKQEALSYARGLRADYILFVDTDSILTN 159
Query: 394 PDVLKYLVNRNESLIAPLL-VRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIW 452
L +L+ +N+S++AP+L + F +SNFW + GFY R+ DY N + +G +
Sbjct: 160 NQTLTFLMAQNKSVVAPMLDSQTF--YSNFWCGITPQGFYRRTADYFPTKNRQR--RGCF 215
Query: 453 NVPYITNCYLM-----KTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEY 507
VP + +L+ +T+ + YT + D + F + + G + + + Q +
Sbjct: 216 AVPMVYATFLIDLRKEETAQLAFYPPHPNYTW-AFDDIIVFAYSCQEAGAEVHVCNQQRF 274
Query: 508 GHL 510
G++
Sbjct: 275 GYI 277
>gi|440894671|gb|ELR47071.1| Glycosyltransferase 25 family member 3, partial [Bos grunniens
mutus]
Length = 579
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 49/245 (20%), Positives = 103/245 (42%), Gaps = 28/245 (11%)
Query: 290 FPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNN----------QEYHAPLFD 339
P+V++++ L +L + L+YP +++++ + +E+ A + D
Sbjct: 14 LPTVVLAILARNAEHSLPHYLGALERLDYPRARLALWCATDHNVDNTTAMLREWLAAVGD 73
Query: 340 DYI------HNFKTMFKNVKYIAHNSTVNSK---EARNLAVENSLHKGVDFYFYVDSDSH 390
DY + + + H + + E + A+ + G D+ + D+D+
Sbjct: 74 DYAAVVWRPEGEPRSYPDEEGPKHWTKERHQFLMELKQEALTFARDWGADYILFADTDNI 133
Query: 391 LDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKG 450
L N L+ L+ ++AP+L +SNFW + G+Y R+ DY N + +G
Sbjct: 134 LTNNQTLRLLIEPGLPVVAPMLDSQ-TYYSNFWCGITPQGYYRRTADYFPTKNRQR--RG 190
Query: 451 IWNVPYITNCYLMKTSVIKATNIKTIYTLN-----SMDYDMAFCTNLRNKGIHLKIDSTQ 505
+ VP + + +L+ + T Y + D + F + G+ + + + Q
Sbjct: 191 CFRVPMVHSTFLVSLRA-EGTAQLAFYPPHPNYTWPFDDIIVFAYACQAAGVAVHVCNEQ 249
Query: 506 EYGHL 510
YG+L
Sbjct: 250 RYGYL 254
>gi|334311907|ref|XP_001367449.2| PREDICTED: glycosyltransferase 25 family member 3 [Monodelphis
domestica]
Length = 705
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 43/193 (22%), Positives = 86/193 (44%), Gaps = 22/193 (11%)
Query: 290 FPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNN----------QEYHAPLFD 339
P+V+I+V L +L + L+YP +++++ + QE+ A +
Sbjct: 141 LPTVVIAVLARNAGYSLPHYLGALERLDYPRARLALWCATDHNVDNTTEILQEWLAAMGK 200
Query: 340 DYIHN-FKTMFKNVKYIAHNSTVNSKEARNL--------AVENSLHKGVDFYFYVDSDSH 390
+Y ++ + Y S + R+ A++ + G D+ + D+D+
Sbjct: 201 EYAEVVWRPEGEPRLYPDEESPKQWTKERHQFLMELKQEALDFARAWGADYILFADTDNI 260
Query: 391 LDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKG 450
L N LK+L+ ++AP+L +SNFW + G+Y R+ DY N + +G
Sbjct: 261 LTNNQTLKFLIGEGLPVVAPMLDS-QTYYSNFWCGITPQGYYRRTSDYFPTKNRQR--QG 317
Query: 451 IWNVPYITNCYLM 463
+ VP + + +L+
Sbjct: 318 CFRVPMVHSTFLL 330
>gi|46411176|ref|NP_997181.1| probable inactive glycosyltransferase 25 family member 3 precursor
[Mus musculus]
gi|160395523|sp|A3KGW5.1|GT253_MOUSE RecName: Full=Probable inactive glycosyltransferase 25 family
member 3; AltName: Full=Cerebral endothelial cell
adhesion molecule; Flags: Precursor
gi|148676479|gb|EDL08426.1| cerebral endothelial cell adhesion molecule 1 [Mus musculus]
gi|187953029|gb|AAI38848.1| Cerebral endothelial cell adhesion molecule [Mus musculus]
Length = 592
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 51/266 (19%), Positives = 113/266 (42%), Gaps = 30/266 (11%)
Query: 284 SLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNN----------QEY 333
S+ P+V++++ L +L + L+YP +++++ + +E+
Sbjct: 21 SVTEPTLPTVVLAILARNAEHSLPHYLGALERLDYPRARLALWCATDHNMDNTTGMLREW 80
Query: 334 HAPLFDDYI------HNFKTMFKNVKYIAHNSTVNSK---EARNLAVENSLHKGVDFYFY 384
A + DY + + + H + + E R A+ + G D+ +
Sbjct: 81 LAAVGRDYATVVWKPEEEARSYPDEQGPKHWTKERHQFLMELRQEALAFARDWGADYILF 140
Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
D+D+ L N LK L++R ++AP+L +SNFW + G+Y R+ +Y N
Sbjct: 141 ADTDNILTNNQTLKLLIDRQLPVVAPMLDSQ-TYYSNFWCGITPQGYYRRTAEYFPTKNR 199
Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKATNIK--TIYTLNSMDYD--MAFCTNLRNKGIHLK 500
+ +G + VP + + +L+ + + + S +D + F + G+ +
Sbjct: 200 QR--QGCFRVPMVHSTFLLSLQTEETARLAFYPPHPNYSWPFDDIIVFAYACQAAGVSMH 257
Query: 501 IDSTQEYGHL----VDSENFDPQKTN 522
+ + YG++ ++ + +KTN
Sbjct: 258 VCNDHRYGYMNVVVKPHQSLEEEKTN 283
>gi|193788560|ref|NP_057258.3| probable inactive glycosyltransferase 25 family member 3 precursor
[Homo sapiens]
gi|74744901|sp|Q5T4B2.1|GT253_HUMAN RecName: Full=Probable inactive glycosyltransferase 25 family
member 3; AltName: Full=Cerebral endothelial cell
adhesion molecule; Flags: Precursor
gi|119608193|gb|EAW87787.1| cerebral endothelial cell adhesion molecule 1, isoform CRA_a [Homo
sapiens]
Length = 595
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 49/245 (20%), Positives = 101/245 (41%), Gaps = 28/245 (11%)
Query: 290 FPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNN----------QEYHAPLFD 339
P+V++++ L +L + L+YP +++++ + QE+ A + D
Sbjct: 30 LPAVVLAILARNAEHSLPHYLGALERLDYPRARMALWCATDHNVDNTTEMLQEWLAAVGD 89
Query: 340 DYIH---------NFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSH 390
DY F + K+ E + A+ + + G D+ + D+D+
Sbjct: 90 DYAAVVWRPEGEPRFYPDEEGPKHWTKERHQFLMELKQEALTFARNWGADYILFADTDNI 149
Query: 391 LDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKG 450
L N L+ L+ + ++AP+L +SNFW + G+Y R+ +Y N + +G
Sbjct: 150 LTNNQTLRLLMGQGLPVVAPMLDSQ-TYYSNFWCGITPQGYYRRTAEYFPTKNRQR--RG 206
Query: 451 IWNVPYITNCYLMKTSVIKATNIKTI-----YTLNSMDYDMAFCTNLRNKGIHLKIDSTQ 505
+ VP + + +L A + YT D + F + G+ + + +
Sbjct: 207 CFRVPMVHSTFLASLRAEGADQLAFYPPHPNYTW-PFDDIIVFAYACQAAGVSVHVCNEH 265
Query: 506 EYGHL 510
YG++
Sbjct: 266 RYGYM 270
>gi|297270127|ref|XP_001111820.2| PREDICTED: glycosyltransferase 25 family member 3-like [Macaca
mulatta]
Length = 714
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 49/245 (20%), Positives = 101/245 (41%), Gaps = 28/245 (11%)
Query: 290 FPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNN----------QEYHAPLFD 339
P+V++++ L +L + L+YP +++++ + QE+ A + D
Sbjct: 149 LPAVVLAILARNAEHSLPHYLGALERLDYPRARMALWCATDHNMDNTTEMLQEWLAAVGD 208
Query: 340 DYIH---------NFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSH 390
DY F + K+ E + A+ + G D+ + D+D+
Sbjct: 209 DYAAVVWRPEGEPRFYPDEEGPKHWTKERHQFLMELKQEALTFARDWGADYILFADTDNI 268
Query: 391 LDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKG 450
L N L+ L+ + ++AP+L +SNFW + G+Y R+ +Y N + +G
Sbjct: 269 LTNNQTLRLLMGQGLPVVAPMLDS-QTYYSNFWCGITPQGYYRRTAEYFPTKNRQR--RG 325
Query: 451 IWNVPYITNCYLMKTSVIKATNIK-----TIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQ 505
+ VP + + +L+ A + YT D + F + G+ + + +
Sbjct: 326 CFRVPMVHSTFLVSLRAEGADQLAFYPPHPNYTW-PFDDIIVFAYACQAAGVAVHVCNEH 384
Query: 506 EYGHL 510
YG++
Sbjct: 385 RYGYI 389
>gi|148259400|ref|YP_001233527.1| glycosyl transferase family protein [Acidiphilium cryptum JF-5]
gi|146401081|gb|ABQ29608.1| glycosyl transferase, family 2 [Acidiphilium cryptum JF-5]
Length = 667
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 61/270 (22%), Positives = 113/270 (41%), Gaps = 40/270 (14%)
Query: 284 SLKPDQF--------PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEY 333
+LKP + P VLI++ + FL +L+ I L+YP I +++ NN +
Sbjct: 385 ALKPKRLLRSGTTTAPRVLIAILAKQKEEFLPLYLDCIEALDYPKSSIVLYIRTNNNTDR 444
Query: 334 HAPLFDDYIHNFKTMFKNVKYIAHNSTVNSK------------------EARNLAVENSL 375
+ ++I + V++ S V+ + RN ++ +
Sbjct: 445 TEEILREWIARVGHSYAAVEF--DPSDVDERVEQFGAHEWNAIRFRVLGRIRNESLRKTR 502
Query: 376 HKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLL--VRPFKAWSNFWGALNADGFYA 433
G D+YF D D+ + L+ LV ++APLL P +SN ++ +G++
Sbjct: 503 EHGCDWYFVADIDNFIRRC-TLRELVATGLPIVAPLLRDAEPSSYYSNLHAEIDDNGYFR 561
Query: 434 RSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD-MAFCTNL 492
Y I++ + +G+ VP + Y ++ VI+ N Y S ++ + +
Sbjct: 562 DCAQYELIMS--RRIQGLIEVPLVHCTYAVRADVIEHLN----YDDGSGRHEYVVLSDSA 615
Query: 493 RNKGIHLKIDSTQEYGHLVDSENFDPQKTN 522
R I D+ Q YG++ S+N D N
Sbjct: 616 RKASIPQYFDNRQVYGYITFSKNPDQYDEN 645
>gi|83318248|gb|AAI08699.1| CERCAM protein [Homo sapiens]
Length = 558
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 49/245 (20%), Positives = 101/245 (41%), Gaps = 28/245 (11%)
Query: 290 FPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNN----------QEYHAPLFD 339
P+V++++ L +L + L+YP +++++ + QE+ A + D
Sbjct: 30 LPAVVLAILARNAEHSLPHYLGALERLDYPRARMALWCATDHNVDNTTEMLQEWLAAVGD 89
Query: 340 DYIH---------NFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSH 390
DY F + K+ E + A+ + + G D+ + D+D+
Sbjct: 90 DYAAVVWRPEGEPRFYPDEEGPKHWTKERHQFLMELKQEALTFARNWGADYILFADTDNI 149
Query: 391 LDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKG 450
L N L+ L+ + ++AP+L +SNFW + G+Y R+ +Y N + +G
Sbjct: 150 LTNNQTLRLLMGQGLPVVAPMLDSQ-TYYSNFWCGITPQGYYRRTAEYFPTKNRQR--RG 206
Query: 451 IWNVPYITNCYLMKTSVIKATNIKTI-----YTLNSMDYDMAFCTNLRNKGIHLKIDSTQ 505
+ VP + + +L A + YT D + F + G+ + + +
Sbjct: 207 CFRVPMVHSTFLASLRAEGADQLAFYPPHPNYTW-PFDDIIVFAYACQAAGVSVHVCNEH 265
Query: 506 EYGHL 510
YG++
Sbjct: 266 RYGYM 270
>gi|395741044|ref|XP_002820323.2| PREDICTED: glycosyltransferase 25 family member 3-like [Pongo
abelii]
Length = 543
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 49/245 (20%), Positives = 101/245 (41%), Gaps = 28/245 (11%)
Query: 290 FPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNN----------QEYHAPLFD 339
P+V++++ L +L + L+YP +++++ + QE+ A + D
Sbjct: 30 LPAVVLAILARNAEHSLPHYLGALERLDYPRARMALWCATDHNVDNTTEMLQEWLAAVGD 89
Query: 340 DYIH---------NFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSH 390
DY F + K+ E + A+ + G D+ + D+D+
Sbjct: 90 DYAAVVWRPEGEPRFYPDEEGPKHWTKERHQFLMELKQEALTFARDWGADYILFADTDNI 149
Query: 391 LDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKG 450
L N L+ L+ + ++AP+L +SNFW + G+Y R+ +Y N + +G
Sbjct: 150 LTNNQTLRLLMGQELPVVAPMLDSQ-TYYSNFWCGITPQGYYRRTAEYFPTKNRQR--RG 206
Query: 451 IWNVPYITNCYLMKTSVIKATNIKTI-----YTLNSMDYDMAFCTNLRNKGIHLKIDSTQ 505
+ VP + + +L+ A + YT D + F + G+ + + +
Sbjct: 207 CFRVPMVHSTFLVSLRAEGADQLAFYPPHPNYTW-PFDDIIVFAYACQAAGVSVHVCNEH 265
Query: 506 EYGHL 510
YG++
Sbjct: 266 RYGYM 270
>gi|402896403|ref|XP_003911291.1| PREDICTED: glycosyltransferase 25 family member 3, partial [Papio
anubis]
Length = 548
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 49/245 (20%), Positives = 101/245 (41%), Gaps = 28/245 (11%)
Query: 290 FPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNN----------QEYHAPLFD 339
P+V++++ L +L + L+YP +++++ + QE+ A + D
Sbjct: 30 LPAVVLAILARNAEHSLPHYLGALERLDYPRARMALWCATDHNVDNTTEMLQEWLAAVGD 89
Query: 340 DYIH---------NFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSH 390
DY F + K+ E + A+ + G D+ + D+D+
Sbjct: 90 DYAAVVWRPEGEPRFYPDEEGPKHWTKERHQFLMELKQEALTFARDWGADYILFADTDNI 149
Query: 391 LDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKG 450
L N L+ L+ + ++AP+L +SNFW + G+Y R+ +Y N + +G
Sbjct: 150 LTNNQTLRLLMGQGLPVVAPMLDSQ-TYYSNFWCGITPQGYYRRTAEYFPTKNRQR--RG 206
Query: 451 IWNVPYITNCYLMKTSVIKATNIKTI-----YTLNSMDYDMAFCTNLRNKGIHLKIDSTQ 505
+ VP + + +L+ A + YT D + F + G+ + + +
Sbjct: 207 CFRVPMVHSTFLVSLRAEGADQLAFYPPHPNYTW-PFDDIIVFAYACQAAGVAVHVCNEH 265
Query: 506 EYGHL 510
YG++
Sbjct: 266 RYGYI 270
>gi|387542892|gb|AFJ72073.1| glycosyltransferase 25 family member 3 precursor [Macaca mulatta]
Length = 595
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 49/245 (20%), Positives = 101/245 (41%), Gaps = 28/245 (11%)
Query: 290 FPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNN----------QEYHAPLFD 339
P+V++++ L +L + L+YP +++++ + QE+ A + D
Sbjct: 30 LPAVVLAILARNAEHSLPHYLGALERLDYPRARMALWCATDHNVDNTTEMLQEWLAAVGD 89
Query: 340 DYIH---------NFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSH 390
DY F + K+ E + A+ + G D+ + D+D+
Sbjct: 90 DYAAVVWRPEGEPRFYPDEEGPKHWTKERHQFLMELKQEALTFARDWGADYILFADTDNI 149
Query: 391 LDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKG 450
L N L+ L+ + ++AP+L +SNFW + G+Y R+ +Y N + +G
Sbjct: 150 LTNNQTLRLLMGQGLPVVAPMLDSQ-TYYSNFWCGITPQGYYRRTAEYFPTKNRQR--RG 206
Query: 451 IWNVPYITNCYLMKTSVIKATNIKTI-----YTLNSMDYDMAFCTNLRNKGIHLKIDSTQ 505
+ VP + + +L+ A + YT D + F + G+ + + +
Sbjct: 207 CFRVPMVHSTFLVSLRAEGADQLAFYPPHPNYTW-PFDDIIVFAYACQAAGVAVHVCNEH 265
Query: 506 EYGHL 510
YG++
Sbjct: 266 RYGYI 270
>gi|426363201|ref|XP_004048734.1| PREDICTED: glycosyltransferase 25 family member 3 [Gorilla gorilla
gorilla]
Length = 595
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 48/245 (19%), Positives = 106/245 (43%), Gaps = 28/245 (11%)
Query: 290 FPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNN----------QEYHAPLFD 339
P+V++++ L +L + L+YP +++++ + QE+ A + D
Sbjct: 30 LPAVVLAIQARNAEHSLPHYLGALERLDYPRARMALWCATDHNVDNTTEMLQEWLAAVGD 89
Query: 340 DYI------HNFKTMFKNVKYIAHNSTVNSK---EARNLAVENSLHKGVDFYFYVDSDSH 390
DY ++ + + H + + E + A+ + + G D+ + D+D+
Sbjct: 90 DYAAVVWRPEGEPRVYPDEEGPKHWTKERHQFLMELKQEALTFARNWGADYILFADTDNI 149
Query: 391 LDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKG 450
L N +L+ L+ + ++AP+L +SNFW + G+Y R+ +Y N + +G
Sbjct: 150 LTNNQILRLLMGQGLPVVAPMLDSQ-TYYSNFWCGITPQGYYRRTAEYFPTKNRQR--RG 206
Query: 451 IWNVPYITNCYLMKTSVIKATNIKTI-----YTLNSMDYDMAFCTNLRNKGIHLKIDSTQ 505
+ VP + + +L+ A + YT D + F + G+ + + +
Sbjct: 207 CFRVPMVHSTFLVSLRAEGADQLAFYPPHPNYTW-PFDDIIVFAYACQAAGVSVHVCNEH 265
Query: 506 EYGHL 510
YG++
Sbjct: 266 RYGYM 270
>gi|380796385|gb|AFE70068.1| glycosyltransferase 25 family member 3 precursor, partial [Macaca
mulatta]
Length = 576
Score = 57.0 bits (136), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 49/245 (20%), Positives = 101/245 (41%), Gaps = 28/245 (11%)
Query: 290 FPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNN----------QEYHAPLFD 339
P+V++++ L +L + L+YP +++++ + QE+ A + D
Sbjct: 11 LPAVVLAILARNAEHSLPHYLGALERLDYPRARMALWCATDHNVDNTTEMLQEWLAAVGD 70
Query: 340 DYIH---------NFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSH 390
DY F + K+ E + A+ + G D+ + D+D+
Sbjct: 71 DYAAVVWRPEGEPRFYPDEEGPKHWTKERHQFLMELKQEALTFARDWGADYILFADTDNI 130
Query: 391 LDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKG 450
L N L+ L+ + ++AP+L +SNFW + G+Y R+ +Y N + +G
Sbjct: 131 LTNNQTLRLLMGQGLPVVAPMLDSQ-TYYSNFWCGITPQGYYRRTAEYFPTKNRQR--RG 187
Query: 451 IWNVPYITNCYLMKTSVIKATNIKTI-----YTLNSMDYDMAFCTNLRNKGIHLKIDSTQ 505
+ VP + + +L+ A + YT D + F + G+ + + +
Sbjct: 188 CFRVPMVHSTFLVSLRAEGADQLAFYPPHPNYTW-PFDDIIVFAYACQAAGVAVHVCNEH 246
Query: 506 EYGHL 510
YG++
Sbjct: 247 RYGYI 251
>gi|328697541|ref|XP_001943906.2| PREDICTED: glycosyltransferase 25 family member-like [Acyrthosiphon
pisum]
Length = 374
Score = 56.6 bits (135), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 43/201 (21%), Positives = 97/201 (48%), Gaps = 25/201 (12%)
Query: 282 LDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFD 339
+D KP+ +V +++ I L F + + +L+YP ++ +++ +N + + +
Sbjct: 24 VDDRKPN---TVFVAILIRNKAHTLPYFFSALESLDYPKDRMHLWIRCDHNIDNSTQILN 80
Query: 340 DYIHNFKTMFKNVKY-IAHNSTVNSKEA----------------RNLAVENSLHKGVDFY 382
++ ++ +V I ++ST E+ R A++ + D+
Sbjct: 81 KWLKTSGAVYHSVNVKIDNDSTKYDDESGPAHWPHSRFQHIVQLRESALQTARDSWADYI 140
Query: 383 FYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNII 442
+++D D+ + N L++L+ +N ++AP+L + +SNFW + + +Y R+ DY I+
Sbjct: 141 WFLDCDAFIINKSTLRHLIKKNYPVVAPML-KSDGLYSNFWCGMTDNYYYKRTSDYAPIV 199
Query: 443 NGDQGGKGIWNVPYITNCYLM 463
+ KG + VP I + L+
Sbjct: 200 --EWKTKGCYQVPMIHSSVLI 218
>gi|296190930|ref|XP_002743398.1| PREDICTED: glycosyltransferase 25 family member 3 isoform 1
[Callithrix jacchus]
Length = 595
Score = 56.6 bits (135), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 49/245 (20%), Positives = 103/245 (42%), Gaps = 28/245 (11%)
Query: 290 FPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNN----------QEYHAPLFD 339
P+V++++ L +L + L+YP +++++ + QE+ A + D
Sbjct: 30 LPAVVLAILARNAEHSLPHYLGALERLDYPRARLALWYATDHNVDNTTEMLQEWLAAVGD 89
Query: 340 DYI------HNFKTMFKNVKYIAHNSTVNSK---EARNLAVENSLHKGVDFYFYVDSDSH 390
DY + + + H + + E + A+ + G D+ + D+D+
Sbjct: 90 DYATVVWRPEGEPRSYPDEEGPKHWTKERHQFLMELKQEALTFARDWGADYILFADTDNI 149
Query: 391 LDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKG 450
L N L+ L+ + ++AP+L +SNFW + G+Y R+ +Y N + +G
Sbjct: 150 LTNNQTLRLLMGQGLPVVAPMLDSQ-TYYSNFWCGITPQGYYRRTAEYFPTKNRQR--RG 206
Query: 451 IWNVPYITNCYLMKTSVIKATNIKTI-----YTLNSMDYDMAFCTNLRNKGIHLKIDSTQ 505
+ VP + + +L+ A I YT D + F + G+ + + +
Sbjct: 207 CFRVPMVHSTFLVSLRAEGADQIAFYPPHPNYTW-PFDDIIVFAYACQAAGVSVHVCNEH 265
Query: 506 EYGHL 510
YG++
Sbjct: 266 RYGYM 270
>gi|47188856|emb|CAG14621.1| unnamed protein product [Tetraodon nigroviridis]
Length = 52
Score = 56.6 bits (135), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 23/48 (47%), Positives = 32/48 (66%)
Query: 591 NDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYH 638
DKR+ GYE VPT DIHMKQ+G W F+R+++ P+ + F GY+
Sbjct: 2 QDKRIAGGYETVPTDDIHMKQIGFNKEWLHFIREFISPVTLKVFSGYY 49
>gi|156120717|ref|NP_001095505.1| probable inactive glycosyltransferase 25 family member 3 precursor
[Bos taurus]
gi|160395522|sp|A7MB73.1|GT253_BOVIN RecName: Full=Probable inactive glycosyltransferase 25 family
member 3; AltName: Full=Cerebral endothelial cell
adhesion molecule; Flags: Precursor
gi|154425666|gb|AAI51374.1| CERCAM protein [Bos taurus]
Length = 595
Score = 56.2 bits (134), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 48/245 (19%), Positives = 103/245 (42%), Gaps = 28/245 (11%)
Query: 290 FPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNN----------QEYHAPLFD 339
P+V++++ L +L + L+YP +++++ + +E+ A + D
Sbjct: 30 LPTVVLAILARNAEHSLPHYLGALERLDYPRARLALWCATDHNVDNTTAMLREWLAAVGD 89
Query: 340 DYI------HNFKTMFKNVKYIAHNSTVNSK---EARNLAVENSLHKGVDFYFYVDSDSH 390
+Y + + + H + + E + A+ + G D+ + D+D+
Sbjct: 90 NYAAVVWRPEGEPRSYPDEEGPKHWTKERHQFLMELKQEALTFARDWGADYILFADTDNI 149
Query: 391 LDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKG 450
L N L+ L+ ++AP+L +SNFW + G+Y R+ DY N + +G
Sbjct: 150 LTNNQTLRLLIEPGLPVVAPMLDSQ-TYYSNFWCGITPQGYYRRTADYFPTKNRQR--RG 206
Query: 451 IWNVPYITNCYLMKTSVIKATNIKTIYTLN-----SMDYDMAFCTNLRNKGIHLKIDSTQ 505
+ VP + + +L+ + T Y + D + F + G+ + + + Q
Sbjct: 207 CFRVPMVHSTFLVSLRA-EGTGQLAFYPPHPNYTWPFDDIIVFAYACQAAGVAVHVCNEQ 265
Query: 506 EYGHL 510
YG+L
Sbjct: 266 RYGYL 270
>gi|395510081|ref|XP_003759312.1| PREDICTED: glycosyltransferase 25 family member 3, partial
[Sarcophilus harrisii]
Length = 564
Score = 55.8 bits (133), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 43/192 (22%), Positives = 85/192 (44%), Gaps = 22/192 (11%)
Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNN----------QEYHAPLFDD 340
P+V+I+V L +L + L+YP +++++ + QE+ + D
Sbjct: 1 PAVVIAVLARNAGYSLPYYLGALERLDYPRARLALWCATDHNVDNTTEILQEWLTAVGKD 60
Query: 341 YIHN-FKTMFKNVKYIAHNSTVNSKEARNL--------AVENSLHKGVDFYFYVDSDSHL 391
Y ++ + Y S + R+ A++ + G D+ + D+D+ L
Sbjct: 61 YAEVVWRPEGEPRLYPDEESPKQWTKERHQFLMELKQEALDFARAWGADYILFADTDNIL 120
Query: 392 DNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGI 451
N LK+L+ ++AP+L +SNFW + G+Y R+ DY N + +G
Sbjct: 121 TNNQTLKFLIGEGLPVVAPMLDSQ-TYYSNFWCGITPQGYYRRTTDYFPTKNRQR--QGC 177
Query: 452 WNVPYITNCYLM 463
+ VP + + +L+
Sbjct: 178 FQVPMVHSAFLL 189
>gi|326930289|ref|XP_003211280.1| PREDICTED: glycosyltransferase 25 family member 3-like [Meleagris
gallopavo]
Length = 541
Score = 55.8 bits (133), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 38/138 (27%), Positives = 71/138 (51%), Gaps = 11/138 (7%)
Query: 379 VDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLL-VRPFKAWSNFWGALNADGFYARSFD 437
D+ +VD+DS L N L +L+ +N+S++AP+L + F +SNFW + GFY R+ D
Sbjct: 90 ADYILFVDTDSILTNNQTLTFLMAQNKSVVAPMLDSQTF--YSNFWCGITPQGFYRRTAD 147
Query: 438 YMNIINGDQGGKGIWNVPYITNCYLM-----KTSVIKATNIKTIYTLNSMDYDMAFCTNL 492
Y N + +G + VP + +L+ +T+ + YT + D + F +
Sbjct: 148 YFPTKNRQR--RGCFAVPMVYATFLIDLQKEETAQLAFYPPHPNYTW-AFDDIIVFAYSC 204
Query: 493 RNKGIHLKIDSTQEYGHL 510
+ G + + + Q +G++
Sbjct: 205 QEAGAEVHVCNQQRFGYI 222
>gi|296482048|tpg|DAA24163.1| TPA: glycosyltransferase 25 family member 3 precursor [Bos taurus]
Length = 531
Score = 55.8 bits (133), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 48/245 (19%), Positives = 103/245 (42%), Gaps = 28/245 (11%)
Query: 290 FPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNN----------QEYHAPLFD 339
P+V++++ L +L + L+YP +++++ + +E+ A + D
Sbjct: 30 LPTVVLAILARNAEHSLPHYLGALERLDYPRARLALWCATDHNVDNTTAMLREWLAAVGD 89
Query: 340 DYI------HNFKTMFKNVKYIAHNSTVNSK---EARNLAVENSLHKGVDFYFYVDSDSH 390
+Y + + + H + + E + A+ + G D+ + D+D+
Sbjct: 90 NYAAVVWRPEGEPRSYPDEEGPKHWTKERHQFLMELKQEALTFARDWGADYILFADTDNI 149
Query: 391 LDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKG 450
L N L+ L+ ++AP+L +SNFW + G+Y R+ DY N + +G
Sbjct: 150 LTNNQTLRLLIEPGLPVVAPMLDSQ-TYYSNFWCGITPQGYYRRTADYFPTKNRQR--RG 206
Query: 451 IWNVPYITNCYLMKTSVIKATNIKTIYTLN-----SMDYDMAFCTNLRNKGIHLKIDSTQ 505
+ VP + + +L+ + T Y + D + F + G+ + + + Q
Sbjct: 207 CFRVPMVHSTFLVSLRA-EGTGQLAFYPPHPNYTWPFDDIIVFAYACQAAGVAVHVCNEQ 265
Query: 506 EYGHL 510
YG+L
Sbjct: 266 RYGYL 270
>gi|149039147|gb|EDL93367.1| rCG45647, isoform CRA_a [Rattus norvegicus]
Length = 596
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 50/261 (19%), Positives = 110/261 (42%), Gaps = 32/261 (12%)
Query: 290 FPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNN----------QEYHAPLFD 339
P+V++++ L +L + L+YP +++++ + QE+ A +
Sbjct: 31 LPTVVLAILARNAEHSLPHYLGALERLDYPRARLALWGATDHNVDNTTGMLQEWLAAVGR 90
Query: 340 DYI------HNFKTMFKNVKYIAHNSTVNSK---EARNLAVENSLHKGVDFYFYVDSDSH 390
DY + + + + H + + E + A+ + G D+ + D+D+
Sbjct: 91 DYATVVWKSEDEARSYPDEQGPKHWTRERHQFLMELKQEALAFARDWGADYILFADTDNI 150
Query: 391 LDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKG 450
L N L+ L++R ++AP+L +SNFW + G+Y R+ +Y N + +G
Sbjct: 151 LTNNQTLRLLIDRQLPVVAPMLDSQ-TYYSNFWCGITPQGYYRRTAEYFPTKNRQR--QG 207
Query: 451 IWNVPYITNCYLMKTSVIKATNIKTI-----YTLNSMDYDMAFCTNLRNKGIHLKIDSTQ 505
+ VP + + +L+ + + YT D + F + G+ + + +
Sbjct: 208 CFRVPMVHSTFLVSLQTEETARLAFYPPHPNYTW-PFDDIIVFAYACQAAGVSVHVCNDH 266
Query: 506 EYGHL----VDSENFDPQKTN 522
YG++ + + +KTN
Sbjct: 267 RYGYMNVGVKPHQGLEEEKTN 287
>gi|162951747|gb|ABY21735.1| LD07116p [Drosophila melanogaster]
Length = 639
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 56/235 (23%), Positives = 107/235 (45%), Gaps = 38/235 (16%)
Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFKTM 348
P+VLI++ + L FL+ + +YP ++I++++ ++ + L ++ N +
Sbjct: 57 PTVLIALLVRNKAHILPMFLSYLEQQDYPKERIAIWLRCDHSNDDSIELLRQWLDNSGDL 116
Query: 349 FKNVKY---IAHNSTVN---------SKEARNLAV-ENSLHKG----VDFYFYVDSDSHL 391
+ +V Y S VN S+ +A+ E + G D+ F++D+D L
Sbjct: 117 YHSVSYEFKPEEQSFVNGTSPYEWPASRFKHLIALKEEAFQYGRDIWADYVFFLDADVLL 176
Query: 392 DNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGI 451
+ D LK L ++AP+L+ +SNFW + D +Y R+ +Y I + + +G
Sbjct: 177 TSKDSLKVLTRLQLPIVAPMLISE-SLYSNFWCGMTEDYYYRRTDEYKEIYHVKK--QGS 233
Query: 452 WNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQE 506
+ VP ++ T+V+ N + + L T RNK + L+ QE
Sbjct: 234 FPVP------MVHTAVLVNMNHRAVRNL----------TFDRNKLVELQKSRQQE 272
>gi|24581946|ref|NP_723087.1| CG31915 [Drosophila melanogaster]
gi|74864910|sp|Q8IPK4.1|GLT25_DROME RecName: Full=Glycosyltransferase 25 family member; Flags:
Precursor
gi|22945672|gb|AAN10543.1| CG31915 [Drosophila melanogaster]
Length = 612
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 56/235 (23%), Positives = 107/235 (45%), Gaps = 38/235 (16%)
Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFKTM 348
P+VLI++ + L FL+ + +YP ++I++++ ++ + L ++ N +
Sbjct: 30 PTVLIALLVRNKAHILPMFLSYLEQQDYPKERIAIWLRCDHSNDDSIELLRQWLDNSGDL 89
Query: 349 FKNVKY---IAHNSTVN---------SKEARNLAV-ENSLHKG----VDFYFYVDSDSHL 391
+ +V Y S VN S+ +A+ E + G D+ F++D+D L
Sbjct: 90 YHSVSYEFKPEEQSFVNGTSPYEWPASRFKHLIALKEEAFQYGRDIWADYVFFLDADVLL 149
Query: 392 DNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGI 451
+ D LK L ++AP+L+ +SNFW + D +Y R+ +Y I + + +G
Sbjct: 150 TSKDSLKVLTRLQLPIVAPMLISE-SLYSNFWCGMTEDYYYRRTDEYKEIYHVKK--QGS 206
Query: 452 WNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQE 506
+ VP ++ T+V+ N + + L T RNK + L+ QE
Sbjct: 207 FPVP------MVHTAVLVNMNHRAVRNL----------TFDRNKLVELQKSRQQE 245
>gi|219127596|ref|XP_002184018.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217404741|gb|EEC44687.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 487
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 40/120 (33%), Positives = 58/120 (48%), Gaps = 15/120 (12%)
Query: 621 FLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGV 680
L + + P R F G +RA FVVRY ++ L H D +INI LN
Sbjct: 215 LLDRRLAPQLARIF-GIPVTSIRANDMFVVRYDAGKRAHLTNHTDDGDISINILLND--- 270
Query: 681 DYEGGGCRFIRYN-------CNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVD 733
++ GGG RF +N +V TR+G +L H + HEG ++QG R I++ F+
Sbjct: 271 EFRGGGTRF--WNRILKTPFAHVQPTRVGQLLTHSALIN--HEGYHISQGLRMILVGFLS 326
>gi|449277697|gb|EMC85780.1| Glycosyltransferase 25 family member 2, partial [Columba livia]
Length = 176
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 45/175 (25%), Positives = 79/175 (45%), Gaps = 30/175 (17%)
Query: 299 IDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFD--DYIHNFKTMFKNVKYIA 356
+D TA L E+L + NL Y++ E+ P+ + Y F K+
Sbjct: 6 VDNTTAILREWLKNVQNL-----------YHDVEWR-PMEEPQSYPEEF-----GPKHWP 48
Query: 357 HNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPF 416
+ + + R A+ + K D+ ++D+D+ L NP+ L L+ N++L+AP+L
Sbjct: 49 SSRFTHVMKLRQAALRAAREKWSDYILFIDTDNLLTNPETLNLLIAENKTLVAPML-ESR 107
Query: 417 KAWSNFWGALNA--------DGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLM 463
+SNFW + G+Y R+ DY I + G + VP I + +L+
Sbjct: 108 SLYSNFWCGITPQATLSFCLQGYYKRTLDYPLI--REWKRTGCFAVPMIHSTFLI 160
>gi|354499487|ref|XP_003511840.1| PREDICTED: glycosyltransferase 25 family member 3 [Cricetulus
griseus]
gi|344244074|gb|EGW00178.1| Glycosyltransferase 25 family member 3 [Cricetulus griseus]
Length = 592
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 42/193 (21%), Positives = 85/193 (44%), Gaps = 22/193 (11%)
Query: 290 FPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNN----------QEYHAPLFD 339
P+V++++ L +L + L+YP +++++ + QE+ A +
Sbjct: 27 LPTVVLAILARNAEHSLPHYLGALERLDYPRARLALWCATDHNVDNTTEMLQEWLAAVGR 86
Query: 340 DYIHN-FKTMFKNVKYIAHNSTVNSKEARNL--------AVENSLHKGVDFYFYVDSDSH 390
DY +K + Y S + + R+ A+ + G D+ + D+D+
Sbjct: 87 DYAAVVWKPEEEARPYPDEQSPKHWTKERHQFLMELKQEALTFARAWGADYILFSDTDNI 146
Query: 391 LDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKG 450
L N L+ L R ++AP+L +SNFW + G+Y R+ +Y N + +G
Sbjct: 147 LTNNQTLRLLTERQLPVVAPMLDSQ-TYYSNFWCGITPQGYYRRTAEYFPTKNRQR--QG 203
Query: 451 IWNVPYITNCYLM 463
+ VP + + +L+
Sbjct: 204 CFRVPMVHSTFLV 216
>gi|294931519|ref|XP_002779915.1| hypothetical protein Pmar_PMAR002313 [Perkinsus marinus ATCC 50983]
gi|239889633|gb|EER11710.1| hypothetical protein Pmar_PMAR002313 [Perkinsus marinus ATCC 50983]
Length = 339
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 48/193 (24%), Positives = 79/193 (40%), Gaps = 33/193 (17%)
Query: 86 LLKNELDEMDITDDMIILVTDSYDVII------DGGVNDILERFNTFDANIVFGAERLCW 139
+L N L M D +++ D+ DV + V + I+ AER CW
Sbjct: 140 VLLNRLKSMPT--DALMVFNDALDVWFTPHASEEAFVKAFERELQIPEDTILVSAERNCW 197
Query: 140 PDTSLYD---KYPAVGSG--YRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFL 194
P YPA G G Y+Y N+GG++G K I + +D+Q +
Sbjct: 198 PPPERMPYCRDYPASGHGTTYKYANTGGWMGRVK----TTWTACIMDGKDEQGCVQWFYR 253
Query: 195 DETLRTKH-------KIVLDTLANLFQNLYGS---------LEDIKLNFDLDEFVHLTNT 238
D ++ +I LD ++Q L+G+ LE + F ++ L N
Sbjct: 254 DAKESRQYRENVGAFRIALDDTQMIWQTLWGTKFANAERAFLEVDRAGFGEEDAGKLVNP 313
Query: 239 KYNTNPVIIHGNG 251
+ +T P+++H NG
Sbjct: 314 ETSTTPLVVHFNG 326
>gi|357607512|gb|EHJ65551.1| hypothetical protein KGM_15156 [Danaus plexippus]
Length = 516
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 47/223 (21%), Positives = 100/223 (44%), Gaps = 27/223 (12%)
Query: 315 NLNYPAKKISMFVYN--NQEYHAPLFDDYIHNFKTMFKNV-----------------KYI 355
NL+YP +I ++ + N ++ + D+++ F T++ V +
Sbjct: 2 NLDYPKDRIFLWFRSDYNSDHSVDVLRDFVNKFGTLYNRVHLSYNTSKQKFDDELSPTHW 61
Query: 356 AHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRP 415
+H+ ++ + R + ++ + + D+ F +D+D L NP L++L+ + ++AP+LV
Sbjct: 62 SHSRFMHLIKWREMGIKFAKRQWADYVFMLDADVFLTNPQTLRHLIQKQLRVVAPMLVSD 121
Query: 416 FKAWSNFWGALNADGFYARSF--DYMNIINGDQGGKGIWNVPYITNCYLM-----KTSVI 468
+ +SNFW +++ D Y + ++ + ++ G VP I LM K+ I
Sbjct: 122 -RYYSNFWLSVDDDFNYRLNHEDEFYPLYEYNELYMGCHIVPVIYGAVLMDLRSKKSDYI 180
Query: 469 KATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGHLV 511
K + L + + F N I L I + +G++
Sbjct: 181 TYDPYKIVDYLGPLQDHIIFAVNAMRNNISLHICNDDFFGYIT 223
>gi|308800012|ref|XP_003074787.1| SmkH (IC) [Ostreococcus tauri]
gi|116061327|emb|CAL52045.1| SmkH (IC) [Ostreococcus tauri]
Length = 637
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 46/172 (26%), Positives = 76/172 (44%), Gaps = 28/172 (16%)
Query: 572 CHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQV-GLAGVWAEFLRKYVVPLQ 630
C +V+ E+ + G + + ++AVPT D+ + ++ G+ W + P
Sbjct: 480 CPSWVEAAESVARSRGGWDTAR-----HKAVPTTDLPIHEIPGVMEQWNRLFSVVISPFI 534
Query: 631 EREFIGYHHEPVRAPMSF---------VVRYRPDE-QPSLRPHHDSSTYTINIALNQVGV 680
F R P SF VV+Y +E Q L H D +++ +AL+
Sbjct: 535 RDRF--------RLPTSFGTLYVHDAFVVKYNANEGQRELPVHTDQGQFSLTLALHDTQ- 585
Query: 681 DYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFV 732
DY GGG F + C + R G + LTH G+ +T G RYI+++F+
Sbjct: 586 DYSGGGTIFPEHEC-IVRPRCGDFVAFRSSLTH--GGVPITAGVRYIVVAFL 634
>gi|323456551|gb|EGB12418.1| hypothetical protein AURANDRAFT_61110 [Aureococcus anophagefferens]
Length = 794
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 41/119 (34%), Positives = 57/119 (47%), Gaps = 9/119 (7%)
Query: 621 FLRKYVVPLQERE---FIG-YHHEPVRAPMSFVVRYRPDEQ-PSLRPHHDSSTYTINIAL 675
++R V L R F G + P R F VRY + +R H D S ++++AL
Sbjct: 500 YVRALVASLAARATLLFPGTFAGAPARVLDCFFVRYDAERCFAEMRDHVDESAVSVSLAL 559
Query: 676 NQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
N G DY+GGG NV G + PG +TH G+ VT+GTR I+ F+ P
Sbjct: 560 NDAG-DYDGGGLHVAAAG-NVLNGPAGSVFCFPGAITH--GGVAVTRGTRRILSLFLVP 614
>gi|431898872|gb|ELK07242.1| Glycosyltransferase 25 family member 3, partial [Pteropus alecto]
Length = 600
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 46/243 (18%), Positives = 103/243 (42%), Gaps = 28/243 (11%)
Query: 292 SVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNN----------QEYHAPLFDDY 341
+V++++ L +L + L+YP +++++ + QE+ A + +DY
Sbjct: 6 AVVLAILARNAEHSLPHYLGALERLDYPRARLALWCATDHNVDNTTEMLQEWLAAVGNDY 65
Query: 342 I------HNFKTMFKNVKYIAHNSTVNSK---EARNLAVENSLHKGVDFYFYVDSDSHLD 392
+ + + H + + E + A+ + G D+ + D+D+ L
Sbjct: 66 AAVVWRPEGEPRSYPDEESPKHWTKERYQFLMELKQEALTFARGWGADYILFADTDNILT 125
Query: 393 NPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIW 452
N L+ L+ + ++AP+L +SNFW + G+Y R+ +Y N + +G +
Sbjct: 126 NNQTLRLLIEQGLPVVAPMLDSQ-TYYSNFWCGITPQGYYRRTAEYFPTKNRQR--RGCF 182
Query: 453 NVPYITNCYLMKTSVIKATNIKTI-----YTLNSMDYDMAFCTNLRNKGIHLKIDSTQEY 507
VP + + +L+ A + YT D + F + + G+ + + + Y
Sbjct: 183 RVPMVHSTFLVSLRAEGAAQLAFYPPHPNYTW-PFDDIIVFAYSCQAAGVSVHVCNEHRY 241
Query: 508 GHL 510
G++
Sbjct: 242 GYM 244
>gi|83954578|ref|ZP_00963289.1| hypothetical protein NAS141_15193 [Sulfitobacter sp. NAS-14.1]
gi|83840862|gb|EAP80033.1| hypothetical protein NAS141_15193 [Sulfitobacter sp. NAS-14.1]
Length = 380
Score = 53.1 bits (126), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 45/145 (31%), Positives = 61/145 (42%), Gaps = 22/145 (15%)
Query: 588 GTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREF---IGYHHEPVRA 644
G D R E GY A P G G + E + Y+ P+ F +GY +
Sbjct: 114 GAMLDPRSE-GYLAAP---------GFQGFYREMMDAYMRPVSRLLFPDVVGYDTQT--- 160
Query: 645 PMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFI---RYNCNVTATRM 701
F +R++ + SLRPH D+S T+NI LN G Y G FI
Sbjct: 161 -FGFSIRWQASKDTSLRPHSDASAVTLNINLNLPGEGYSGSAVSFIDPVSRRVEKLTFEP 219
Query: 702 GWMLMHPGRLTHYHEGLQVTQGTRY 726
G L+H G + H E +T+G RY
Sbjct: 220 GTALIHHGSVPHASE--PITEGERY 242
>gi|380807617|gb|AFE75684.1| procollagen galactosyltransferase 1 precursor, partial [Macaca
mulatta]
Length = 151
Score = 53.1 bits (126), Expect = 5e-04, Method: Composition-based stats.
Identities = 23/69 (33%), Positives = 43/69 (62%), Gaps = 1/69 (1%)
Query: 367 RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGAL 426
R A++++ D+ +VD+D+ + NPD L L+ N++++AP+L A+SNFW +
Sbjct: 82 RQAALKSARDMWADYILFVDADNLILNPDTLSLLIAENKTVVAPML-DSRAAYSNFWCGM 140
Query: 427 NADGFYARS 435
+ G+Y R+
Sbjct: 141 TSQGYYKRT 149
>gi|156347859|ref|XP_001621780.1| predicted protein [Nematostella vectensis]
gi|156208037|gb|EDO29680.1| predicted protein [Nematostella vectensis]
Length = 248
Score = 53.1 bits (126), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 45/179 (25%), Positives = 81/179 (45%), Gaps = 21/179 (11%)
Query: 560 DVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWA 619
+V+ P+ TE FC +F++ +E + + SD Y + + +G +
Sbjct: 38 EVYRLPVFTESFCEQFIEELEHF-ESSDVPRGRPNTMNNY------GVLLSDLGFDEHFI 90
Query: 620 EFLRK-YVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQV 678
LR+ Y+ P+ F + + + + +F V Y P + L H+D++ T+++ L
Sbjct: 91 NPLRREYLQPITALLFPQWGGDGLDSHKAFTVHYMPGKDTELSYHYDNAEVTLSVCL--- 147
Query: 679 GVDYEGGGCRF--IRY------NCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMI 729
G ++ GG F +R C R + L+H G+ H H L TQG+RY +I
Sbjct: 148 GREFSGGDLYFGDMRQVLLEDTQCTEVENRPTYGLLHRGQ--HMHGALPTTQGSRYNLI 204
>gi|328771198|gb|EGF81238.1| hypothetical protein BATDEDRAFT_87864 [Batrachochytrium
dendrobatidis JAM81]
Length = 324
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 42/130 (32%), Positives = 65/130 (50%), Gaps = 17/130 (13%)
Query: 81 GYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGG--VNDILERFNTF-----DANIVFG 133
G ++ +L + L + +D +I+ +DS DVII G V++++ R+N+ + F
Sbjct: 81 GLRIRILHDYL--LTQPEDRLIVWSDSDDVIITPGTTVSELISRYNSLVDLYNGPRVFFA 138
Query: 134 AERLCWPDTSLYDKY--PAVGSG------YRYLNSGGFIGYAKDIKELISNRSIKNEEDD 185
AE C+P L+ Y P G +RYLN+G IG A I+ LI + DD
Sbjct: 139 AEIACYPRGDLWSNYTDPEHIQGKKTYTPFRYLNAGIMIGPAGLIRRLIQVVYQHDCYDD 198
Query: 186 QLYYALLFLD 195
QL + L LD
Sbjct: 199 QLLFTLALLD 208
>gi|344338865|ref|ZP_08769796.1| hypothetical protein ThimaDRAFT_1534 [Thiocapsa marina 5811]
gi|343801447|gb|EGV19390.1| hypothetical protein ThimaDRAFT_1534 [Thiocapsa marina 5811]
Length = 276
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 36/134 (26%), Positives = 66/134 (49%), Gaps = 15/134 (11%)
Query: 94 MDITDDMIILVTDSYDVIIDGGVNDILERFNT-FDANIVFGAERLCWPDTSLYDKYP-AV 151
+D + +L DS D +I G +++RF F+ +IVFGA+RL WP + ++ A+
Sbjct: 125 LDTIETPYVLYADSRDALILGNPEILVDRFEGHFETDIVFGADRLSWPPLPRFKRFERAM 184
Query: 152 GSG----YRYLNSGGFIGYAKDIKELISNR-----SIKNEEDDQLYYALLFLDETLRTKH 202
+G + YLN G +IG ++L + + + + +Q L+++
Sbjct: 185 AAGQPGDFHYLNGGTWIGRTAFCRDLFAAALEIPPTPEAPDSEQGILRTLWMER----PS 240
Query: 203 KIVLDTLANLFQNL 216
+I LD +FQN+
Sbjct: 241 EIALDYRCRMFQNI 254
>gi|395824283|ref|XP_003785400.1| PREDICTED: glycosyltransferase 25 family member 3 isoform 2
[Otolemur garnettii]
Length = 547
Score = 52.8 bits (125), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 34/138 (24%), Positives = 66/138 (47%), Gaps = 9/138 (6%)
Query: 378 GVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFD 437
G D+ + D+D+ L N L+ LV++ ++AP+L +SNFW + G+Y R+ +
Sbjct: 91 GADYILFADTDNILTNNQTLRLLVDQGLPVVAPMLDSQ-TYYSNFWCGITPQGYYRRTAE 149
Query: 438 YMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTI-----YTLNSMDYDMAFCTNL 492
Y N + +G ++VP + + +L+ A + YT D + F
Sbjct: 150 YFPTKNRQR--RGCFSVPMVHSTFLVSLRAEGAAQLAFYPPHPNYTW-PFDDIIVFAYAC 206
Query: 493 RNKGIHLKIDSTQEYGHL 510
+ G+ + + + Q YG++
Sbjct: 207 QAAGVSVHVCNDQRYGYM 224
>gi|294868172|ref|XP_002765417.1| hypothetical protein Pmar_PMAR002413 [Perkinsus marinus ATCC 50983]
gi|239865436|gb|EEQ98134.1| hypothetical protein Pmar_PMAR002413 [Perkinsus marinus ATCC 50983]
Length = 624
Score = 52.8 bits (125), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 47/193 (24%), Positives = 79/193 (40%), Gaps = 33/193 (17%)
Query: 86 LLKNELDEMDITDDMIILVTDSYDVII------DGGVNDILERFNTFDANIVFGAERLCW 139
+L N L M D +++ D+ DV + V + I+ AER CW
Sbjct: 425 VLLNRLKSMP--SDALMIFNDALDVWFTPHASEEAFVKAFERELQIPEDTILVSAERNCW 482
Query: 140 PDTSLYD---KYPAV--GSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFL 194
P YPA G+ Y+Y N+GG++G K I + +D+Q +
Sbjct: 483 PPPERMPYCRDYPASEHGTTYKYANTGGWMGRVK----TTWTACIMDGKDEQGCVQWFYR 538
Query: 195 DETLRTKH-------KIVLDTLANLFQNLYGS---------LEDIKLNFDLDEFVHLTNT 238
D ++ +I LD ++Q L+G+ LE + F ++ L N
Sbjct: 539 DAKESRQYRENVGAFRIALDDTQMIWQTLWGTKFANVERAFLEVDRAGFGEEDAGKLVNP 598
Query: 239 KYNTNPVIIHGNG 251
+ +T P+++H NG
Sbjct: 599 ETSTTPLVVHFNG 611
>gi|224006610|ref|XP_002292265.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220971907|gb|EED90240.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 288
Score = 52.8 bits (125), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 39/115 (33%), Positives = 58/115 (50%), Gaps = 8/115 (6%)
Query: 622 LRKYVVPLQEREFIGY---HHEPVRAPMSFVVRYRPDE-QPSLRPHHDSSTYTINIALNQ 677
L + + PL ++F Y + +R FVV+Y + Q L+PH D S + NIALN
Sbjct: 159 LVERIYPLLRQQFGMYLPDGGKSLRVADGFVVKYDAEGGQAELKPHRDGSVLSFNIALNP 218
Query: 678 VGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFV 732
+++GGG F + V + G ++ H L H G +T G RYIM+ FV
Sbjct: 219 AD-EFDGGGTWFQSLDGAVKIDQ-GEVVSHSSSLLHGGHG--ITSGKRYIMVCFV 269
>gi|344271834|ref|XP_003407742.1| PREDICTED: LOW QUALITY PROTEIN: glycosyltransferase 25 family
member 3-like [Loxodonta africana]
Length = 596
Score = 52.4 bits (124), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 47/244 (19%), Positives = 102/244 (41%), Gaps = 28/244 (11%)
Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNN----------QEYHAPLFDD 340
P+V++ + L +L + L+YP +++++ + QE+ A + +D
Sbjct: 32 PAVVLVILARNAEHSLPHYLGALERLDYPRARLALWCATDHNIDNTKEMLQEWLAAVGND 91
Query: 341 YI------HNFKTMFKNVKYIAHNSTVNSK---EARNLAVENSLHKGVDFYFYVDSDSHL 391
Y + + + H + + E + A+ + G D+ + D+D+ L
Sbjct: 92 YAAVVWRPEGEPRSYPDEEGPKHWTKERYQFLMELKQEALTFARDWGADYILFADTDNIL 151
Query: 392 DNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGI 451
N L+ L+ + ++AP+L +SNFW + G+Y R+ +Y N + +G
Sbjct: 152 TNNQTLQLLMEQGLPVVAPMLDSQ-TYYSNFWCGITPQGYYRRTAEYFPTKNRQR--RGC 208
Query: 452 WNVPYITNCYLMKTSVIKATNIKTI-----YTLNSMDYDMAFCTNLRNKGIHLKIDSTQE 506
+ VP + + +L+ A + YT D + F + G+ + + +
Sbjct: 209 FRVPMVHSTFLVSLRAEGAAQLAFYPPHPNYTW-PFDDIIVFAYACQAAGVSVHVCNQHR 267
Query: 507 YGHL 510
YG++
Sbjct: 268 YGYM 271
>gi|387219649|gb|AFJ69533.1| hypothetical protein NGATSA_3030300 [Nannochloropsis gaditana
CCMP526]
Length = 324
Score = 52.4 bits (124), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 39/136 (28%), Positives = 67/136 (49%), Gaps = 8/136 (5%)
Query: 601 AVPTRDIHMKQVGLAGVW-AEFLRKYVVPLQEREFIGYHHEPVRAPM--SFVVRYRPDE- 656
A PT D+ ++++ + W L++ + P F + + + +F+V+Y D
Sbjct: 174 AYPTTDVPLQELPRSLAWFNRQLQEKIYPCLATNFASALPDSSKLKVVDAFIVKYDADGG 233
Query: 657 QPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHE 716
Q L+PH D S + NIALN ++EGGG F + + G ++ H + H
Sbjct: 234 QTQLKPHRDGSVVSFNIALNP-SSEFEGGGTYFAGLDQGLR-IEQGHIVTHASNV--LHG 289
Query: 717 GLQVTQGTRYIMISFV 732
G ++ G RYI++SFV
Sbjct: 290 GHPISAGKRYILVSFV 305
>gi|348569847|ref|XP_003470709.1| PREDICTED: glycosyltransferase 25 family member 3-like [Cavia
porcellus]
Length = 591
Score = 52.4 bits (124), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 39/203 (19%), Positives = 87/203 (42%), Gaps = 22/203 (10%)
Query: 290 FPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYI----H 343
PSV++++ L +L + L+YP +++++ +N + + +++ H
Sbjct: 27 LPSVVLAILARNAEHSLPHYLGALERLDYPRARLALWCATDHNIDNTTAMLREWLAAVGH 86
Query: 344 NFKTMF-------------KNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSH 390
++ + + K+ E + A+ + G D+ + D+D+
Sbjct: 87 HYAAVIWRPEGEPRSYPDEEGPKHWTKERHQFLMELKQEALTFARAWGADYILFADTDNI 146
Query: 391 LDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKG 450
L N L++L + ++AP+L +SNFW + G+Y R+ DY N + +G
Sbjct: 147 LTNNQTLRFLTEQALPVVAPMLDSQ-TYYSNFWCGITPQGYYRRTTDYFPTKNRQR--QG 203
Query: 451 IWNVPYITNCYLMKTSVIKATNI 473
+ VP + + +L+ A +
Sbjct: 204 CFRVPMVHSTFLVSLRAEGADQL 226
>gi|156355246|ref|XP_001623582.1| predicted protein [Nematostella vectensis]
gi|156210297|gb|EDO31482.1| predicted protein [Nematostella vectensis]
Length = 344
Score = 52.4 bits (124), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 45/179 (25%), Positives = 81/179 (45%), Gaps = 21/179 (11%)
Query: 560 DVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWA 619
+V+ P+ TE FC +F++ +E + + SD Y + + +G +
Sbjct: 134 EVYRLPVFTESFCEQFIEELEHF-ESSDVPRGRPNTMNNY------GVLLSDLGFDEHFI 186
Query: 620 EFLRK-YVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQV 678
LR+ Y+ P+ F + + + + +F V Y P + L H+D++ T+++ L
Sbjct: 187 NPLRREYLQPITALLFPQWGGDGLDSHKAFTVHYMPGKDTELSYHYDNAEVTLSVCL--- 243
Query: 679 GVDYEGGGCRF--IRY------NCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMI 729
G ++ GG F +R C R + L+H G+ H H L TQG+RY +I
Sbjct: 244 GREFSGGDLYFGDMRQVLLEDTQCTEVENRPTYGLLHRGQ--HMHGALPTTQGSRYNLI 300
>gi|395824281|ref|XP_003785399.1| PREDICTED: glycosyltransferase 25 family member 3 isoform 1
[Otolemur garnettii]
Length = 515
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 34/138 (24%), Positives = 66/138 (47%), Gaps = 9/138 (6%)
Query: 378 GVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFD 437
G D+ + D+D+ L N L+ LV++ ++AP+L +SNFW + G+Y R+ +
Sbjct: 59 GADYILFADTDNILTNNQTLRLLVDQGLPVVAPMLDSQ-TYYSNFWCGITPQGYYRRTAE 117
Query: 438 YMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTI-----YTLNSMDYDMAFCTNL 492
Y N + +G ++VP + + +L+ A + YT D + F
Sbjct: 118 YFPTKNRQR--RGCFSVPMVHSTFLVSLRAEGAAQLAFYPPHPNYTW-PFDDIIVFAYAC 174
Query: 493 RNKGIHLKIDSTQEYGHL 510
+ G+ + + + Q YG++
Sbjct: 175 QAAGVSVHVCNDQRYGYM 192
>gi|303272359|ref|XP_003055541.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226463515|gb|EEH60793.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 896
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 53/180 (29%), Positives = 81/180 (45%), Gaps = 14/180 (7%)
Query: 556 QPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQV-GL 614
Q PD P+++E+ C E++ EA+ + G R + AVPT D+ + V L
Sbjct: 725 QTAPDA---PLLSERECLEWIAAAEAHAAKTRGGWTTSR----HYAVPTTDLPVHAVEAL 777
Query: 615 AGVWAEFLRKYVVPLQEREF--IGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTIN 672
W + +R+ + PL + VR FVVRY Q L H D S ++
Sbjct: 778 VPRWNDLMREKLSPLLAAACADVVARASSVRVHDVFVVRYDASAQHHLPIHVDQSAVSLT 837
Query: 673 IALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFV 732
+ALN G ++GGG F + G + G L H G VT+G RY++ +F+
Sbjct: 838 LALNG-GDAFDGGGTTFADLGVTCS-PETGHAAVFRGDL--RHGGAPVTRGVRYVVAAFL 893
>gi|397575536|gb|EJK49747.1| hypothetical protein THAOC_31344 [Thalassiosira oceanica]
Length = 517
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 32/80 (40%), Positives = 46/80 (57%), Gaps = 4/80 (5%)
Query: 656 EQPSLRPHHDSSTYTINIALNQ-VGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHY 714
E+ L H D ST+T IAL++ G DY GGG F N V R G ML+ G+L
Sbjct: 435 ERQKLELHTDKSTWTFLIALSEGRGTDYSGGGTFFQALNSTVHLQR-GQMLIFRGKL--R 491
Query: 715 HEGLQVTQGTRYIMISFVDP 734
H G++++ G RY+++ F+ P
Sbjct: 492 HAGVRISWGCRYLLVGFLVP 511
>gi|426226143|ref|XP_004007209.1| PREDICTED: LOW QUALITY PROTEIN: glycosyltransferase 25 family
member 3 [Ovis aries]
Length = 652
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 34/138 (24%), Positives = 64/138 (46%), Gaps = 9/138 (6%)
Query: 378 GVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFD 437
G D+ + D+D+ L N L+ L+ + ++AP+L +SNFW + G+Y R+ D
Sbjct: 224 GADYILFADTDNILTNNQTLQLLIEQGLPVVAPMLDS-QTYYSNFWCGITPQGYYRRTAD 282
Query: 438 YMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLN-----SMDYDMAFCTNL 492
Y N + +G + VP + + +L+ + T Y + D + F
Sbjct: 283 YFPTKNRQR--RGCFRVPMVHSTFLVSLRA-EGTAQLAFYPPHPNYTWPFDDIIVFAYAC 339
Query: 493 RNKGIHLKIDSTQEYGHL 510
+ G+ + + + Q YG+L
Sbjct: 340 QAAGVSVHVCNEQRYGYL 357
>gi|195576767|ref|XP_002078245.1| GD23349 [Drosophila simulans]
gi|194190254|gb|EDX03830.1| GD23349 [Drosophila simulans]
Length = 803
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 45/192 (23%), Positives = 90/192 (46%), Gaps = 22/192 (11%)
Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFKTM 348
P+VLI++ + L FL+ + +Y ++I++++ ++ + L ++ N +
Sbjct: 30 PTVLIALLVRNKAHILPMFLSYLERQDYSKERIAIWLRCDHSNDDSIDLLRQWLDNSGDL 89
Query: 349 FKNVKY---IAHNSTVN---------SKEARNLAV-ENSLHKG----VDFYFYVDSDSHL 391
+ +V Y S VN S+ +A+ E + G D+ F++D+D L
Sbjct: 90 YHSVSYEFKPEEQSFVNETSPYEWPASRFKHLIALKEEAFQYGRDIWADYVFFLDADVLL 149
Query: 392 DNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGI 451
+ D LK L ++AP+L+ +SNFW + D +Y R+ +Y I + + +G
Sbjct: 150 TSKDSLKVLTRLQLPIVAPMLISE-SLYSNFWCGMTEDYYYRRTDEYKEIYHAKK--QGS 206
Query: 452 WNVPYITNCYLM 463
+ VP + L+
Sbjct: 207 FPVPMVHTAVLV 218
>gi|170038076|ref|XP_001846879.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167881499|gb|EDS44882.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 496
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 27/90 (30%), Positives = 52/90 (57%), Gaps = 3/90 (3%)
Query: 384 YVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIIN 443
++D+D L NP L LV+ + ++AP+L+ +SNFW + D +Y R+ +Y I+N
Sbjct: 27 FLDADVFLTNPKTLTKLVSLSLPIVAPMLLSD-GLYSNFWCGMTPDYYYERTEEYKEILN 85
Query: 444 GDQGGKGIWNVPYITNCYLMKTSVIKATNI 473
G G + VP + + ++ ++++A N+
Sbjct: 86 --YGKTGEFTVPMVHSAVMVNINLLEAKNL 113
>gi|47223918|emb|CAG06095.1| unnamed protein product [Tetraodon nigroviridis]
Length = 660
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 28/97 (28%), Positives = 54/97 (55%), Gaps = 3/97 (3%)
Query: 367 RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGAL 426
R A++ + D+ VD D+ L N ++L L+ N++++AP+L A+SNFW +
Sbjct: 165 RQAALDTAREIWADYLLVVDCDNLLTNRELLWKLMRENKTVVAPML-ESRAAYSNFWCGM 223
Query: 427 NADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLM 463
+ G+Y R+ Y+ I ++ +G + VP + + L+
Sbjct: 224 TSQGYYKRTPAYVPIRKRER--RGCFAVPMVHSTLLV 258
>gi|349804117|gb|AEQ17531.1| putative procollagen-lysine 2-oxoglutarate 5-dioxygenase 3
[Hymenochirus curtipes]
Length = 111
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/92 (26%), Positives = 50/92 (54%), Gaps = 1/92 (1%)
Query: 312 KIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAV 371
++ L+YP ++S++++N++ YH + K F ++K + ++ EAR++ +
Sbjct: 1 RLVLLDYPRNRLSLYIHNSEVYHEKHIQAFWEKHKEDFSSLKIVGPEEALSQGEARDMGM 60
Query: 372 ENSLH-KGVDFYFYVDSDSHLDNPDVLKYLVN 402
+ + D+Y+ VD+D L NPD L L+
Sbjct: 61 DLCRQDETCDYYYSVDADVVLTNPDTLYILIQ 92
>gi|242007889|ref|XP_002424750.1| conserved hypothetical protein [Pediculus humanus corporis]
gi|212508253|gb|EEB12012.1| conserved hypothetical protein [Pediculus humanus corporis]
Length = 327
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 56/101 (55%), Gaps = 8/101 (7%)
Query: 363 SKEARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNF 422
++A N+A EN DF F V+ D L + + KYLV +N ++ P+L + +SNF
Sbjct: 94 KEKALNVAREN----WADFIF-VNCDVFLTDNETFKYLVRQNHTVTGPML-KSIGLYSNF 147
Query: 423 WGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLM 463
W + + +Y R+ DY I+ ++ KG +NVP I + ++
Sbjct: 148 WCGMTSKYYYMRTDDYKPILKREK--KGCFNVPMIHSALII 186
>gi|384498735|gb|EIE89226.1| hypothetical protein RO3G_13937 [Rhizopus delemar RA 99-880]
Length = 239
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/86 (31%), Positives = 46/86 (53%), Gaps = 4/86 (4%)
Query: 648 FVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMH 707
F+V+Y +EQ L H D ++I + ++ D+EGGG F + V G H
Sbjct: 142 FLVKYSAEEQRGLGLHADGCLFSITLLISHPD-DFEGGGTYFASID-QVVHLGQGDCAYH 199
Query: 708 PGRLTHYHEGLQVTQGTRYIMISFVD 733
R+ H G+++T+G RY+++ F+D
Sbjct: 200 DARV--MHSGMEITKGERYVLVGFID 223
>gi|323451040|gb|EGB06918.1| hypothetical protein AURANDRAFT_14444, partial [Aureococcus
anophagefferens]
Length = 172
Score = 50.8 bits (120), Expect = 0.003, Method: Composition-based stats.
Identities = 46/180 (25%), Positives = 73/180 (40%), Gaps = 34/180 (18%)
Query: 562 FWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEF 621
+ FP+ TE FC + ++A+ + + G + D M+++G +
Sbjct: 1 YAFPLFTEAFCARLLADLDAWERSPLPRRRPNSMNAG--GLVVNDCGMERLG-----DDL 53
Query: 622 LRKYVVPLQEREF--------IGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINI 673
L + V PL F + +HH F VRY E +L HHD+S T+N+
Sbjct: 54 LARVVGPLASTLFGDEVFARSLDHHH-------LFAVRYAVGEDETLAMHHDASEVTLNV 106
Query: 674 ALNQVGVDYEGGGCRFI--------RYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTR 725
L G +EGG +F R R+G ++H GR H H ++ G R
Sbjct: 107 CLGTAG--FEGGALQFCGRVGDGDHRAASGAFDHRVGTAVLHLGR--HRHGVARLASGER 162
>gi|351697039|gb|EHA99957.1| Glycosyltransferase 25 family member 3 [Heterocephalus glaber]
Length = 644
Score = 50.4 bits (119), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 38/194 (19%), Positives = 83/194 (42%), Gaps = 22/194 (11%)
Query: 290 FPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYI----H 343
PSV++++ L +L + L+YP +++++ +N + + +++ H
Sbjct: 79 LPSVVLAILARNAEHSLPHYLGALERLDYPRARLALWCATDHNVDNTTAMLREWLAAVGH 138
Query: 344 NFKTMF-------------KNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSH 390
++ + K+ E + A+ + G D+ + D+D+
Sbjct: 139 HYAAVIWRPEGEPRSYPDEGGPKHWTRERHQFLMELKQEALTFARDWGADYILFADTDNI 198
Query: 391 LDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKG 450
L N L+ L + ++AP+L +SNFW + G+Y R+ DY N + +G
Sbjct: 199 LTNNRTLRLLTEQGLPVVAPMLDSQ-TYYSNFWCGITPQGYYRRTADYFPTKNRQR--RG 255
Query: 451 IWNVPYITNCYLMK 464
+ VP + + +L+
Sbjct: 256 CFRVPMVHSTFLVS 269
>gi|323451068|gb|EGB06946.1| putative 2OG-Fe(II) oxidoreductase like-protein [Aureococcus
anophagefferens]
Length = 312
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 59/245 (24%), Positives = 101/245 (41%), Gaps = 54/245 (22%)
Query: 510 LVDSENFDPQKTNPEVYE-LIRNPLDWDLRYIHPEYQKSLLPDTVN-----------NQP 557
L E + P + PE++E L+R+ E+ L D V +
Sbjct: 26 LSPEEAYAPLRRTPELFESLLRD-----------EWLAPTLLDVVQAARRGQCHRDLREE 74
Query: 558 CPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTR--------DIHM 609
P VF F + ++ FC +F+Q ++ Y +++G +P R + +
Sbjct: 75 APGVFSFAMFSDAFCRDFLQEVDGY------------MDSG---LPIRRPNSMNNYGLIV 119
Query: 610 KQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTY 669
++G+ V +E R+ + P+ R A SF+V+YR E P L H D S
Sbjct: 120 NEIGMLDVISELQREVLWPIA-RSLWPKEGSAFHAHHSFMVQYRKTEDPGLDMHTDDSDV 178
Query: 670 TINIALNQV----GVDYEGGGCRFIRYNCNVTATRM-GWMLMHPGRLTHYHEGLQVTQGT 724
T N+ L +V G+ + GG R R+ + G ++H G + H ++ GT
Sbjct: 179 TFNVCLGEVFAGAGLTFCGGMRRETRHRFAFQYEHVKGRAVVHLG--SKRHGADDISSGT 236
Query: 725 RYIMI 729
R +I
Sbjct: 237 RRNLI 241
>gi|156355248|ref|XP_001623583.1| predicted protein [Nematostella vectensis]
gi|156210298|gb|EDO31483.1| predicted protein [Nematostella vectensis]
Length = 285
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 44/179 (24%), Positives = 80/179 (44%), Gaps = 21/179 (11%)
Query: 560 DVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWA 619
+V+ P+ TE FC +F++ +E + + SD Y + + +G +
Sbjct: 75 EVYRLPVFTESFCEQFIEELEHF-ESSDVPRGRPNTMNNY------GVLLSDLGFDEHFI 127
Query: 620 EFLRK-YVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQV 678
LR+ Y+ P+ F + + + + +F V Y P + L H+D++ T+++ L
Sbjct: 128 NPLRREYLQPITALLFPQWGGDGLDSHKAFTVHYMPGKDTELSYHYDNAEVTLSVCL--- 184
Query: 679 GVDYEGGGCRF--IRY------NCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMI 729
G ++ GG F +R C R + L+H G+ H L TQG+RY +I
Sbjct: 185 GREFSGGDLYFGDMRQVLLEDTQCTEVENRPTYGLLHRGQ--QMHGALPTTQGSRYNLI 241
>gi|160395571|sp|Q5U309.2|GT253_RAT RecName: Full=Probable inactive glycosyltransferase 25 family
member 3; AltName: Full=Cerebral endothelial cell
adhesion molecule
Length = 572
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 37/167 (22%), Positives = 75/167 (44%), Gaps = 13/167 (7%)
Query: 365 EARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWG 424
E + A+ + G D+ + D+D+ L N L+ L++R ++AP+L +SNFW
Sbjct: 101 ELKQEALAFARDWGADYILFADTDNILTNNQTLRLLIDRQLPVVAPMLDSQ-TYYSNFWC 159
Query: 425 ALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTI-----YTL 479
+ G+Y R+ +Y N + +G + VP + + +L+ + + YT
Sbjct: 160 GITPQGYYRRTAEYFPTKNRQR--QGCFRVPMVHSTFLVSLQTEETARLAFYPPHPNYTW 217
Query: 480 NSMDYDMAFCTNLRNKGIHLKIDSTQEYGHL----VDSENFDPQKTN 522
D + F + G+ + + + YG++ + + +KTN
Sbjct: 218 -PFDDIIVFAYACQAAGVSVHVCNDHRYGYMNVGVKPHQGLEEEKTN 263
>gi|58865502|ref|NP_001011962.1| probable inactive glycosyltransferase 25 family member 3 [Rattus
norvegicus]
gi|55249709|gb|AAH85782.1| Cerebral endothelial cell adhesion molecule [Rattus norvegicus]
Length = 517
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 37/167 (22%), Positives = 75/167 (44%), Gaps = 13/167 (7%)
Query: 365 EARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWG 424
E + A+ + G D+ + D+D+ L N L+ L++R ++AP+L +SNFW
Sbjct: 46 ELKQEALAFARDWGADYILFADTDNILTNNQTLRLLIDRQLPVVAPMLDSQ-TYYSNFWC 104
Query: 425 ALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTI-----YTL 479
+ G+Y R+ +Y N + +G + VP + + +L+ + + YT
Sbjct: 105 GITPQGYYRRTAEYFPTKNRQR--QGCFRVPMVHSTFLVSLQTEETARLAFYPPHPNYTW 162
Query: 480 NSMDYDMAFCTNLRNKGIHLKIDSTQEYGHL----VDSENFDPQKTN 522
D + F + G+ + + + YG++ + + +KTN
Sbjct: 163 -PFDDIIVFAYACQAAGVSVHVCNDHRYGYMNVGVKPHQGLEEEKTN 208
>gi|83944028|ref|ZP_00956485.1| hypothetical protein EE36_10295 [Sulfitobacter sp. EE-36]
gi|83845275|gb|EAP83155.1| hypothetical protein EE36_10295 [Sulfitobacter sp. EE-36]
Length = 303
Score = 50.1 bits (118), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 37/130 (28%), Positives = 56/130 (43%), Gaps = 12/130 (9%)
Query: 603 PTRDIHMKQVGLAGVWAEFLRKYVVPLQEREF---IGYHHEPVRAPMSFVVRYRPDEQPS 659
P + ++ G G + E + Y+ P+ F +GY + F +R++ + S
Sbjct: 138 PRSEGYLAAPGFQGFYREMMDAYMRPVSRLLFPDVVGYDTQT----FGFSIRWQASKDTS 193
Query: 660 LRPHHDSSTYTINIALNQVGVDYEGGGCRFI---RYNCNVTATRMGWMLMHPGRLTHYHE 716
LRPH D+S T+NI LN Y G FI G L+H G + H E
Sbjct: 194 LRPHSDASAVTLNINLNLPDEWYSGSAVSFIDPVSRRVEKLTFEPGTALIHHGSVPHASE 253
Query: 717 GLQVTQGTRY 726
+T+G RY
Sbjct: 254 --PITEGERY 261
>gi|432095371|gb|ELK26570.1| Glycosyltransferase 25 family member 3 [Myotis davidii]
Length = 559
Score = 49.7 bits (117), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 32/137 (23%), Positives = 63/137 (45%), Gaps = 9/137 (6%)
Query: 379 VDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDY 438
D+ + DSD+ L N L+ L+ + ++AP+L +SNFW + G+Y R+ +Y
Sbjct: 102 ADYILFADSDNILTNSQTLRLLIEQGLPVVAPMLDSQ-TYYSNFWCGITPQGYYRRTAEY 160
Query: 439 MNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTI-----YTLNSMDYDMAFCTNLR 493
N + +G + VP + + +L+ A + YT D + F + +
Sbjct: 161 FPTKNRQR--RGCFQVPMVHSTFLVSLRAEGAAQLAFYPPHPNYTW-PFDDIIVFAYSCQ 217
Query: 494 NKGIHLKIDSTQEYGHL 510
G+ + + + YG++
Sbjct: 218 AAGVSVHVCNEHRYGYM 234
>gi|296190932|ref|XP_002743399.1| PREDICTED: glycosyltransferase 25 family member 3 isoform 2
[Callithrix jacchus]
Length = 548
Score = 48.9 bits (115), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 33/138 (23%), Positives = 63/138 (45%), Gaps = 9/138 (6%)
Query: 378 GVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFD 437
G D+ + D+D+ L N L+ L+ + ++AP+L +SNFW + G+Y R+ +
Sbjct: 90 GADYILFADTDNILTNNQTLRLLMGQGLPVVAPMLDSQ-TYYSNFWCGITPQGYYRRTAE 148
Query: 438 YMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTI-----YTLNSMDYDMAFCTNL 492
Y N + +G + VP + + +L+ A I YT D + F
Sbjct: 149 YFPTKNRQR--RGCFRVPMVHSTFLVSLRAEGADQIAFYPPHPNYTW-PFDDIIVFAYAC 205
Query: 493 RNKGIHLKIDSTQEYGHL 510
+ G+ + + + YG++
Sbjct: 206 QAAGVSVHVCNEHRYGYM 223
>gi|335281050|ref|XP_003353725.1| PREDICTED: glycosyltransferase 25 family member 3-like isoform 2
[Sus scrofa]
Length = 517
Score = 48.9 bits (115), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 31/138 (22%), Positives = 64/138 (46%), Gaps = 9/138 (6%)
Query: 378 GVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFD 437
G D+ + D+D+ L N L+ L+ + ++AP+L +SNFW + G+Y R+ +
Sbjct: 59 GADYILFADTDNILTNNQTLRLLIEQQLPVVAPMLDSQ-TYYSNFWCGITPQGYYRRTAE 117
Query: 438 YMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLN-----SMDYDMAFCTNL 492
Y N + +G + VP + + +L+ + T + Y + D + F
Sbjct: 118 YFPTKNRQR--RGCFRVPMVHSTFLISLRA-EGTGQLSFYPPHPNYTWPFDDIIVFAYAC 174
Query: 493 RNKGIHLKIDSTQEYGHL 510
+ G+ + + + YG++
Sbjct: 175 QAAGVSVHVCNEHRYGYM 192
>gi|357132260|ref|XP_003567749.1| PREDICTED: uncharacterized PKHD-type hydroxylase At1g22950-like
[Brachypodium distachyon]
Length = 393
Score = 48.9 bits (115), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 48/190 (25%), Positives = 76/190 (40%), Gaps = 27/190 (14%)
Query: 552 TVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTN-NDKRLET--GYEAVPTRDIH 608
++ +P P VF FP++ KFC + ++ + W R T Y AV
Sbjct: 158 SIMAEPIPGVFSFPMLQPKFCDMLFEEVDNFESWVHAMKFKIMRPNTMNKYGAV------ 211
Query: 609 MKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSST 668
+ GL + +F+ K++ P+ + + + + +FVV Y D L H D S
Sbjct: 212 LDDFGLETMLNDFMEKFITPISKVFYPEVGGGTLDSHHAFVVEYGKDRDVELGFHVDDSE 271
Query: 669 YTINIALNQVGVDYEGGGCRFIRYNC----NVTATRM---------GWMLMHPGRLTHYH 715
T+N+ L G + GG F C N A + GW ++H GR H H
Sbjct: 272 VTLNVCL---GKQFSGGQLYFRGVRCENHVNSEAQQEEIYDYPHVPGWAVLHRGR--HRH 326
Query: 716 EGLQVTQGTR 725
+ G R
Sbjct: 327 GARPTSSGLR 336
>gi|335281052|ref|XP_001925614.3| PREDICTED: glycosyltransferase 25 family member 3-like isoform 1
[Sus scrofa]
Length = 555
Score = 48.9 bits (115), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 31/138 (22%), Positives = 64/138 (46%), Gaps = 9/138 (6%)
Query: 378 GVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFD 437
G D+ + D+D+ L N L+ L+ + ++AP+L +SNFW + G+Y R+ +
Sbjct: 97 GADYILFADTDNILTNNQTLRLLIEQQLPVVAPMLDSQ-TYYSNFWCGITPQGYYRRTAE 155
Query: 438 YMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLN-----SMDYDMAFCTNL 492
Y N + +G + VP + + +L+ + T + Y + D + F
Sbjct: 156 YFPTKNRQR--RGCFRVPMVHSTFLISLRA-EGTGQLSFYPPHPNYTWPFDDIIVFAYAC 212
Query: 493 RNKGIHLKIDSTQEYGHL 510
+ G+ + + + YG++
Sbjct: 213 QAAGVSVHVCNEHRYGYM 230
>gi|121583693|ref|NP_001073538.1| 2-oxoglutarate and iron-dependent oxygenase domain-containing
protein 2 [Danio rerio]
gi|118764167|gb|AAI28873.1| Zgc:158437 [Danio rerio]
Length = 345
Score = 48.5 bits (114), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 46/186 (24%), Positives = 81/186 (43%), Gaps = 21/186 (11%)
Query: 553 VNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQV 612
+ + P VF F + ++FC + ++ +E + Q SD Y V + ++
Sbjct: 123 IQTEAAPRVFRFQVFRKEFCKDLLEELEHFEQ-SDAPKGRPNTMNNYGIV------LNEL 175
Query: 613 GLAGVWAEFLRK-YVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTI 671
G + LR+ Y+ PL + + + +FVV+Y E +L H+D+S T+
Sbjct: 176 GFDEGFITPLREVYLRPLTALLYSDCGGNCLDSHKAFVVKYDMHEDLNLSYHYDNSEVTL 235
Query: 672 NIALNQVGVDYEGGGCRF--------IRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQG 723
N++L G D+ G F C R+ L+H G+ H H L ++ G
Sbjct: 236 NVSL---GKDFTEGNLFFGDMRQVPLSETECVEVEHRVTEGLLHRGQ--HMHGALSISSG 290
Query: 724 TRYIMI 729
TR+ +I
Sbjct: 291 TRWNLI 296
>gi|158302599|ref|XP_561137.5| Anopheles gambiae str. PEST AGAP012933-PA [Anopheles gambiae str.
PEST]
gi|157021089|gb|EAL42272.3| AGAP012933-PA [Anopheles gambiae str. PEST]
Length = 330
Score = 48.5 bits (114), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 25/79 (31%), Positives = 44/79 (55%), Gaps = 3/79 (3%)
Query: 395 DVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNV 454
+ L L++R ++AP+LV +SNFW + +D +Y R+ DY I+N DQ G+ W V
Sbjct: 1 NTLGKLIDRKLPIVAPMLVSD-GLYSNFWCGMTSDYYYQRTDDYKKILNYDQIGQ--WPV 57
Query: 455 PYITNCYLMKTSVIKATNI 473
P + L+ ++ + +
Sbjct: 58 PMVHTAVLVSLNIAQTRQL 76
>gi|308801166|ref|XP_003075362.1| Lysyl hydroxylase (ISS) [Ostreococcus tauri]
gi|116061918|emb|CAL52636.1| Lysyl hydroxylase (ISS), partial [Ostreococcus tauri]
Length = 233
Score = 48.5 bits (114), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 53/157 (33%), Positives = 68/157 (43%), Gaps = 34/157 (21%)
Query: 128 ANIVFGAERLCWP----DTSLYD-------KY--PAVGSGYRYLNSGGFIGYAKDIKELI 174
A I+F AE CWP D L D K+ A GS +YLNSGG IG + E+
Sbjct: 26 ALILFSAEGNCWPHMAGDQELIDGGREYCAKFHDKAKGSSNKYLNSGGVIGPVSALAEMY 85
Query: 175 SN-RSIKNEEDDQ------LYYALLFLDETLRTKHK---IVLDTLANLFQN-------LY 217
RS+ DD+ YA DE T K I LD A +FQ +
Sbjct: 86 QEIRSLMKTVDDEDQMITASVYAKQIDDERSGTHSKRYVIALDHEARVFQTGWHTHLEIT 145
Query: 218 GSLEDIKLN---FDLDEFVHLTNTKYNTNPVIIHGNG 251
G + ++N FD V NT++N+ P I H NG
Sbjct: 146 GKYAEPQVNGAYFDTSLGV-FVNTEHNSTPPIAHFNG 181
>gi|412987619|emb|CCO20454.1| Lysyl hydroxylase (ISS) [Bathycoccus prasinos]
Length = 403
Score = 48.5 bits (114), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 53/207 (25%), Positives = 89/207 (42%), Gaps = 52/207 (25%)
Query: 97 TDDMIILVTDSYDVII--DGGVNDILERFNTF--DANI---------VFGAERLCWPDT- 142
+ D I+ + D+ DV+ DG I+E++ DA + + GAER CWP
Sbjct: 186 SGDTIVNIADASDVLYFQDGAT--IMEKYKQIVRDAPVDESRKHTIVLIGAERNCWPSMD 243
Query: 143 ----------SLYDKYPAVG--SGYRYLNSGGFIGYAKDIKELISN-RSI----KNEEDD 185
+++ AV S Y +LNSG +G +K L+ S+ K +DD
Sbjct: 244 GEKELIPGGRKYCEQFKAVSGNSSYHFLNSGSLMGRVDAVKALLKRVESVMDGGKQNDDD 303
Query: 186 QLYYALLFLDETLRTKHK----------IVLDTLANLFQNLYGS--------LEDIKLNF 227
Q + + + ++ K + I+LD A++FQ +GS D +
Sbjct: 304 QQLLQMQY-ERQIKQKSEGGGKEEDAFTILLDHKASIFQTGWGSHLANGRYAARDPNGAY 362
Query: 228 DLDEFVHLTNTKYNTNPVIIHGNGKSK 254
+ + NT++N+ P IIH NG +
Sbjct: 363 YNEAKCAVENTEHNSEPSIIHFNGGKR 389
>gi|223992787|ref|XP_002286077.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220977392|gb|EED95718.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 566
Score = 48.5 bits (114), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 52/186 (27%), Positives = 74/186 (39%), Gaps = 33/186 (17%)
Query: 565 PIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQV-----GLAGVWA 619
P++++ C + I E + T T + AVPT D+ + Q+ +W
Sbjct: 345 PMISQTECQNVINIAEQHAARLGWTT------TRHYAVPTTDVPLHQLIELRPWFYKLWT 398
Query: 620 EFLRKYVVPLQEREFI----GYHHEPVRAPMS---------FVVRYRPDE-QPSLRPHHD 665
LR P R+F V +P S FVVRY Q L PH+D
Sbjct: 399 SRLR----PTLRRQFRISTNTNETATVPSPTSHRDIFIHDVFVVRYDAQGGQRGLPPHYD 454
Query: 666 SSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTR 725
ST++ I LN +Y+GGG + G ML G H G V +G R
Sbjct: 455 ESTHSFVIGLN---TEYQGGGTFIHALGRPLKPKVEGGMLSFSGG-EFLHSGDPVVEGIR 510
Query: 726 YIMISF 731
YI++ F
Sbjct: 511 YIIVGF 516
>gi|323455517|gb|EGB11385.1| hypothetical protein AURANDRAFT_14407, partial [Aureococcus
anophagefferens]
Length = 171
Score = 48.5 bits (114), Expect = 0.013, Method: Composition-based stats.
Identities = 41/133 (30%), Positives = 55/133 (41%), Gaps = 17/133 (12%)
Query: 609 MKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPM----SFVVRYRPDEQPSLRPHH 664
+ VGL K PL + F + H A SF+V+YR DE P L H
Sbjct: 42 VNDVGLEPFVKALQDKVCGPLAQALFKAHPHGHPAADFDSTHSFIVKYRGDEDPHLDVHT 101
Query: 665 DSSTYTINIALNQVGVDYEGGGCRFI--------RYNCNVTATRMGWMLMHPGRLTHYHE 716
D S T N+ L G D+EG G F R +C R+G + H G + H
Sbjct: 102 DDSDVTFNVCL---GRDFEGCGLVFCGMIGAKDHRQHCKTYEHRVGTCVCHLG--SKRHG 156
Query: 717 GLQVTQGTRYIMI 729
+T+G R +I
Sbjct: 157 ADDITRGERLNLI 169
>gi|7959265|dbj|BAA96026.1| KIAA1502 protein [Homo sapiens]
Length = 560
Score = 48.5 bits (114), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 34/151 (22%), Positives = 68/151 (45%), Gaps = 9/151 (5%)
Query: 365 EARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWG 424
E + A+ + + G D+ + D+D+ L N L+ L+ + ++AP+L +SNFW
Sbjct: 123 ELKQEALTFARNWGADYILFADTDNILTNNQTLRLLMGQGLPVVAPMLDSQ-TYYSNFWC 181
Query: 425 ALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTI-----YTL 479
+ G+Y R+ +Y N + +G + VP + + +L A + YT
Sbjct: 182 GITPQGYYRRTAEYFPTKNRQR--RGCFRVPMVHSTFLASLRAEGADQLAFYPPHPNYTW 239
Query: 480 NSMDYDMAFCTNLRNKGIHLKIDSTQEYGHL 510
D + F + G+ + + + YG++
Sbjct: 240 -PFDDIIVFAYACQAAGVSVHVCNEHRYGYM 269
>gi|219114901|ref|XP_002178246.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217409981|gb|EEC49911.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 449
Score = 48.1 bits (113), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 31/78 (39%), Positives = 44/78 (56%), Gaps = 4/78 (5%)
Query: 656 EQPSLRPHHDSSTYTINIAL-NQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHY 714
E+ L H D S +T IAL N G+DYEGGG F + V R G L+ PG+L H
Sbjct: 351 ERQKLDMHTDKSEWTFLIALSNGSGLDYEGGGTFFECLDSTVHVQR-GHALIFPGKLRHC 409
Query: 715 HEGLQVTQGTRYIMISFV 732
G ++T G R++++ F+
Sbjct: 410 --GQRITSGLRFLLVGFL 425
>gi|166240093|ref|XP_001732961.1| hypothetical protein DDB_G0270778 [Dictyostelium discoideum AX4]
gi|165988739|gb|EDR41119.1| hypothetical protein DDB_G0270778 [Dictyostelium discoideum AX4]
Length = 417
Score = 48.1 bits (113), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 52/221 (23%), Positives = 93/221 (42%), Gaps = 28/221 (12%)
Query: 523 PEVYELIRNPLDWDLRYIHP--EYQKSLLPD----TVNNQPCPDVFWFPIVTEKFCHEFV 576
PE++EL D D +I P +Y+K+ D + ++ F I T +FC + +
Sbjct: 193 PEIFELREEYFDKD--FIEPIKQYKKTKNQDDLLKALTKLTETRIYSFRIFTMEFCTKLL 250
Query: 577 QIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIG 636
+ +E + T + Y AV + ++G + + Y+ +
Sbjct: 251 EEIENFKNTGLPTARPNSM-NNYGAV------LDEMGFTEFFKQLREDYLSLFTSILYKD 303
Query: 637 YHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFI------ 690
Y+ E + + +F V+Y+ D++ L H+D S T+N+ L G ++ GG F
Sbjct: 304 YNGEKLNSHHAFAVQYKMDKEKELGFHYDESDITVNLCL---GSEFTGGSLYFKGILDKP 360
Query: 691 -RYNCNVTATRM-GWMLMHPGRLTHYHEGLQVTQGTRYIMI 729
+N + G L+H G H H L +T G R +I
Sbjct: 361 ETHNEYFEFKHIPGVALIHIG--VHRHGALGLTSGERTNLI 399
>gi|452824176|gb|EME31181.1| hypothetical protein Gasu_16740 [Galdieria sulphuraria]
Length = 291
Score = 48.1 bits (113), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 52/205 (25%), Positives = 79/205 (38%), Gaps = 46/205 (22%)
Query: 96 ITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWP--------------- 140
+ D+ I++ D D + + + F D ++ GAE+ CWP
Sbjct: 46 LEDNDIVVFLDGRDAFYMRETSGLRKDFANTDKDLFLGAEKNCWPFSYNPFSNISLNLYD 105
Query: 141 ------------------DTSLYDKYPAVGSG-YRYLNSGGFIGYAKDIKELIS------ 175
DT + + G G Y + N GGFIG K +KE +
Sbjct: 106 PKTQERPWVWSFKAEKLCDTLKLESFKRSGEGPYAFPNGGGFIGRWKKVKEFVDLNWEVF 165
Query: 176 NRSIKNEE-DDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVH 234
+ +K E+ DDQ ++ FL L IVLD A+ Q LE + N+ L +
Sbjct: 166 YKVLKPEQRDDQASTSIAFL---LSIDKSIVLDNKAHFIQ-CTDRLEGVFDNYCLSNSTY 221
Query: 235 LTNTKYNTNPVIIHGNGKSKIELNS 259
+ N T P H NG K+ L +
Sbjct: 222 I-NRDTKTFPYFHHHNGGGKVYLET 245
>gi|145343554|ref|XP_001416384.1| Protein Lysyl hydroxylase fusion protein, putative [Ostreococcus
lucimarinus CCE9901]
gi|144576609|gb|ABO94677.1| Protein Lysyl hydroxylase fusion protein, putative [Ostreococcus
lucimarinus CCE9901]
Length = 618
Score = 48.1 bits (113), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 39/168 (23%), Positives = 78/168 (46%), Gaps = 11/168 (6%)
Query: 567 VTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQV-GLAGVWAEFLRKY 625
++ C +++ EA+ G + D+ +++V T D+ + ++ + W +
Sbjct: 457 ISPSACSSWIKTAEAHATNRGGWDTDR-----HKSVATTDLPIHEIPSVLREWNLIFGQI 511
Query: 626 VVPLQEREFIGYHHEPVRAPMSFVVRY-RPDEQPSLRPHHDSSTYTINIALNQVGVDYEG 684
+ P + F +R +F+V+Y D Q L H D ++I ++LN + Y+G
Sbjct: 512 IGPFIQERFRVDGDTNLRVHDAFIVKYDASDGQCQLPVHTDQGHFSITLSLND-PIQYKG 570
Query: 685 GGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFV 732
GG F + + + G + LTH G+ +T G RYI+++F+
Sbjct: 571 GGTIFPEHE-FIVRPKCGDFVAFRSYLTH--GGVPITSGVRYIVVAFL 615
>gi|412993664|emb|CCO14175.1| predicted protein [Bathycoccus prasinos]
Length = 289
Score = 48.1 bits (113), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 31/104 (29%), Positives = 51/104 (49%), Gaps = 21/104 (20%)
Query: 647 SFVVRY--RPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFI-------------- 690
+FV+RY + ++ LR H D + ++L+ +YEGGG F
Sbjct: 142 AFVIRYDGKSEKDSHLRMHQDDGPISFQVSLSDAD-EYEGGGTNFYEAKRRRTQFEEKSA 200
Query: 691 --RYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFV 732
R NV ++G +L+H G++ H EG +VT G RY ++ F+
Sbjct: 201 KERAKTNVKLEKIGDVLVHGGQIDH--EGAKVTSGLRYTLVYFL 242
>gi|297460155|ref|XP_001255106.3| PREDICTED: glycosyltransferase 25 family member 3-like, partial
[Bos taurus]
Length = 282
Score = 48.1 bits (113), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 37/177 (20%), Positives = 78/177 (44%), Gaps = 22/177 (12%)
Query: 306 LEEFLNKIANLNYPAKKISMFVYNN----------QEYHAPLFDDYI------HNFKTMF 349
L +L + L+YP +++++ + +E+ A + D+Y +
Sbjct: 71 LPHYLGALERLDYPRARLALWCATDHNVDNTTAMLREWLAAVGDNYAAVVWRPEGEPRSY 130
Query: 350 KNVKYIAHNSTVNSK---EARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNES 406
+ + H + + E + A+ + G D+ + D+D+ L N L+ L+
Sbjct: 131 PDEEGPKHWTKERHQFLMELKQEALTFARDWGADYILFADTDNILTNNQTLRLLIEPGLP 190
Query: 407 LIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLM 463
++AP+L +SNFW + G+Y R+ DY N + +G + VP + + +L+
Sbjct: 191 VVAPMLDSQ-TYYSNFWCGITPQGYYRRTADYFPTKNRQR--RGCFRVPMVHSTFLV 244
>gi|111185604|gb|AAI19700.1| Cerebral endothelial cell adhesion molecule [Homo sapiens]
Length = 517
Score = 47.8 bits (112), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 34/151 (22%), Positives = 69/151 (45%), Gaps = 9/151 (5%)
Query: 365 EARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWG 424
E + A+ + + G D+ + D+D+ L N L+ L+ + ++AP+L +SNFW
Sbjct: 46 ELKQEALTFARNWGADYILFADTDNILTNNQTLRLLMGQGLPVVAPMLDSQ-TYYSNFWC 104
Query: 425 ALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTI-----YTL 479
+ G+Y R+ +Y N + +G ++VP + + +L A + YT
Sbjct: 105 GITPQGYYRRTAEYFPTKNRQR--RGCFHVPMVHSTFLASLRAEGADQLAFYPPHPNYTW 162
Query: 480 NSMDYDMAFCTNLRNKGIHLKIDSTQEYGHL 510
D + F + G+ + + + YG++
Sbjct: 163 -PFDDIIVFAYACQAAGVSVHVCNEHRYGYM 192
>gi|5764665|gb|AAD51367.1|AF177203_1 cerebral cell adhesion molecule [Homo sapiens]
Length = 517
Score = 47.8 bits (112), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 34/151 (22%), Positives = 68/151 (45%), Gaps = 9/151 (5%)
Query: 365 EARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWG 424
E + A+ + + G D+ + D+D+ L N L+ L+ + ++AP+L +SNFW
Sbjct: 46 ELKQEALTFARNWGADYILFADTDNILTNNQTLRLLMGQGLPVVAPMLDSQ-TYYSNFWC 104
Query: 425 ALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTI-----YTL 479
+ G+Y R+ +Y N + +G + VP + + +L A + YT
Sbjct: 105 GITPQGYYRRTAEYFPTKNRQR--RGCFRVPMVHSTFLASLRAEGADQLAFYPPHPNYTW 162
Query: 480 NSMDYDMAFCTNLRNKGIHLKIDSTQEYGHL 510
D + F + G+ + + + YG++
Sbjct: 163 -PFDDIIVFAYACQAAGVSVHVCNEHRYGYM 192
>gi|397503528|ref|XP_003822374.1| PREDICTED: glycosyltransferase 25 family member 3 [Pan paniscus]
Length = 533
Score = 47.8 bits (112), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 34/151 (22%), Positives = 69/151 (45%), Gaps = 9/151 (5%)
Query: 365 EARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWG 424
E + A+ + + G D+ + D+D+ L N L+ L+ + ++AP+L +SNFW
Sbjct: 62 ELKQEALTFARNWGADYILFADTDNILTNNQTLQLLMGQGLPVVAPMLDSQ-TYYSNFWC 120
Query: 425 ALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTI-----YTL 479
+ G+Y R+ +Y N + +G + VP + + +L+ A + YT
Sbjct: 121 GITPQGYYRRTAEYFPTKNRQR--RGCFRVPMVHSTFLVSLRAEGADQLAFYPPHPNYTW 178
Query: 480 NSMDYDMAFCTNLRNKGIHLKIDSTQEYGHL 510
D + F + G+ + + + YG++
Sbjct: 179 -PFDDIIVFAYACQAAGVSVHVCNEHRYGYM 208
>gi|119608196|gb|EAW87790.1| cerebral endothelial cell adhesion molecule 1, isoform CRA_d [Homo
sapiens]
Length = 534
Score = 47.8 bits (112), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 34/151 (22%), Positives = 68/151 (45%), Gaps = 9/151 (5%)
Query: 365 EARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWG 424
E + A+ + + G D+ + D+D+ L N L+ L+ + ++AP+L +SNFW
Sbjct: 63 ELKQEALTFARNWGADYILFADTDNILTNNQTLRLLMGQGLPVVAPMLDSQ-TYYSNFWC 121
Query: 425 ALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTI-----YTL 479
+ G+Y R+ +Y N + +G + VP + + +L A + YT
Sbjct: 122 GITPQGYYRRTAEYFPTKNRQR--RGCFRVPMVHSTFLASLRAEGADQLAFYPPHPNYTW 179
Query: 480 NSMDYDMAFCTNLRNKGIHLKIDSTQEYGHL 510
D + F + G+ + + + YG++
Sbjct: 180 -PFDDIIVFAYACQAAGVSVHVCNEHRYGYM 209
>gi|22760015|dbj|BAC11036.1| unnamed protein product [Homo sapiens]
gi|22760023|dbj|BAC11040.1| unnamed protein product [Homo sapiens]
gi|111185706|gb|AAI19699.1| Cerebral endothelial cell adhesion molecule [Homo sapiens]
gi|119608195|gb|EAW87789.1| cerebral endothelial cell adhesion molecule 1, isoform CRA_c [Homo
sapiens]
gi|127802779|gb|AAH98432.2| Cerebral endothelial cell adhesion molecule [Homo sapiens]
Length = 517
Score = 47.8 bits (112), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 34/151 (22%), Positives = 68/151 (45%), Gaps = 9/151 (5%)
Query: 365 EARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWG 424
E + A+ + + G D+ + D+D+ L N L+ L+ + ++AP+L +SNFW
Sbjct: 46 ELKQEALTFARNWGADYILFADTDNILTNNQTLRLLMGQGLPVVAPMLDSQ-TYYSNFWC 104
Query: 425 ALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTI-----YTL 479
+ G+Y R+ +Y N + +G + VP + + +L A + YT
Sbjct: 105 GITPQGYYRRTAEYFPTKNRQR--RGCFRVPMVHSTFLASLRAEGADQLAFYPPHPNYTW 162
Query: 480 NSMDYDMAFCTNLRNKGIHLKIDSTQEYGHL 510
D + F + G+ + + + YG++
Sbjct: 163 -PFDDIIVFAYACQAAGVSVHVCNEHRYGYM 192
>gi|194388556|dbj|BAG60246.1| unnamed protein product [Homo sapiens]
Length = 548
Score = 47.8 bits (112), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 34/151 (22%), Positives = 68/151 (45%), Gaps = 9/151 (5%)
Query: 365 EARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWG 424
E + A+ + + G D+ + D+D+ L N L+ L+ + ++AP+L +SNFW
Sbjct: 77 ELKQEALTFARNWGADYILFADTDNILTNNQTLRLLMGQGLPVVAPMLDSQ-TYYSNFWC 135
Query: 425 ALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTI-----YTL 479
+ G+Y R+ +Y N + +G + VP + + +L A + YT
Sbjct: 136 GITPQGYYRRTAEYFPTKNRQR--RGCFRVPMVHSTFLASLRAEGADQLAFYPPHPNYTW 193
Query: 480 NSMDYDMAFCTNLRNKGIHLKIDSTQEYGHL 510
D + F + G+ + + + YG++
Sbjct: 194 -PFDDIIVFAYACQAAGVSVHVCNEHRYGYM 223
>gi|119608194|gb|EAW87788.1| cerebral endothelial cell adhesion molecule 1, isoform CRA_b [Homo
sapiens]
Length = 539
Score = 47.8 bits (112), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 34/151 (22%), Positives = 68/151 (45%), Gaps = 9/151 (5%)
Query: 365 EARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWG 424
E + A+ + + G D+ + D+D+ L N L+ L+ + ++AP+L +SNFW
Sbjct: 68 ELKQEALTFARNWGADYILFADTDNILTNNQTLRLLMGQGLPVVAPMLDSQ-TYYSNFWC 126
Query: 425 ALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTI-----YTL 479
+ G+Y R+ +Y N + +G + VP + + +L A + YT
Sbjct: 127 GITPQGYYRRTAEYFPTKNRQR--RGCFRVPMVHSTFLASLRAEGADQLAFYPPHPNYTW 184
Query: 480 NSMDYDMAFCTNLRNKGIHLKIDSTQEYGHL 510
D + F + G+ + + + YG++
Sbjct: 185 -PFDDIIVFAYACQAAGVSVHVCNEHRYGYM 214
>gi|281202578|gb|EFA76780.1| putative prolyl 4-hydroxylase alpha subunit [Polysphondylium
pallidum PN500]
Length = 461
Score = 47.8 bits (112), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 35/130 (26%), Positives = 57/130 (43%), Gaps = 15/130 (11%)
Query: 609 MKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSST 668
+ ++G G + E Y+ P + Y+ + + +FVV+Y+ D++ L H+D S
Sbjct: 325 LDEMGFTGFFTELRENYLKPFTSVLYADYNGAQLDSHHAFVVQYKIDKEKELGFHYDESD 384
Query: 669 YTINIALNQVGVDYEGGGCRFIRYNCNVTATRM---------GWMLMHPGRLTHYHEGLQ 719
T+N+ L G + GG F R + T G L+H G H H L
Sbjct: 385 VTLNLCL---GKQFTGGSLYF-RGILDKPETHQEYFEVKHTPGTALLHIG--VHRHGALG 438
Query: 720 VTQGTRYIMI 729
+T G R +I
Sbjct: 439 ITSGERTNLI 448
>gi|338720573|ref|XP_001499943.3| PREDICTED: glycosyltransferase 25 family member 3-like [Equus
caballus]
Length = 517
Score = 47.4 bits (111), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 31/137 (22%), Positives = 62/137 (45%), Gaps = 9/137 (6%)
Query: 379 VDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDY 438
D+ + D+D+ L N L+ L+ + ++AP+L +SNFW + G+Y R+ +Y
Sbjct: 60 ADYILFADTDNILTNNQTLRLLIEKGLPVVAPMLDSQ-TYYSNFWCGITPQGYYRRTAEY 118
Query: 439 MNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTI-----YTLNSMDYDMAFCTNLR 493
N + +G + VP + + +L+ A + YT D + F +
Sbjct: 119 FPTKNRQR--RGCFRVPMVHSTFLISLRAEGAAQLAFYPPHPNYTW-PFDDIIVFAYACQ 175
Query: 494 NKGIHLKIDSTQEYGHL 510
G+ + + + YG++
Sbjct: 176 AAGVSVHVCNQHRYGYM 192
>gi|355753026|gb|EHH57072.1| hypothetical protein EGM_06633 [Macaca fascicularis]
Length = 536
Score = 47.4 bits (111), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 32/138 (23%), Positives = 63/138 (45%), Gaps = 9/138 (6%)
Query: 378 GVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFD 437
G D+ + D+D+ L N L+ L+ + ++AP+L +SNFW + G+Y R+ +
Sbjct: 81 GADYILFADTDNILTNNQTLRLLLGQGLPVVAPMLDSQ-TYYSNFWCGITPQGYYRRTAE 139
Query: 438 YMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTI-----YTLNSMDYDMAFCTNL 492
Y N + +G + VP + + +L+ A + YT D + F
Sbjct: 140 YFPTKNRQR--RGCFRVPMVHSTFLVSLRAEGADQLAFYPPHPNYTW-PFDDIIVFAYAC 196
Query: 493 RNKGIHLKIDSTQEYGHL 510
+ G+ + + + YG++
Sbjct: 197 QAAGVAVHVCNEHRYGYI 214
>gi|76154956|gb|AAX26343.2| SJCHGC08516 protein [Schistosoma japonicum]
Length = 264
Score = 47.4 bits (111), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 47/219 (21%), Positives = 92/219 (42%), Gaps = 47/219 (21%)
Query: 290 FPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY------------------NNQ 331
P++ I V I L FLN + YP K+I + Y N +
Sbjct: 19 MPTLCIGVLIRNKAHTLPYFLNGLEQQQYPTKRIILIFYVDNTIDTSELILNAWIQCNQK 78
Query: 332 EYHAPLFD-DYIHNFKTMFKNV-KYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDS 389
+YH + + D + + ++NV K N + + R ++ + +FY +D+D
Sbjct: 79 KYHKIILEVDKSNTSQLEYENVNKMWTVNHYQHVIKLRQKLLDKARDLWANFYLSIDADV 138
Query: 390 HLDNPDVLKYLVNRNES------------------------LIAPLL-VRPFKAWSNFWG 424
L N +++L+N ++APL+ + +SNFWG
Sbjct: 139 ILMNSLTIEHLINAMHPSQSNNNNNNNNDNNNNNTMNKNIIILAPLINCTTSEYYSNFWG 198
Query: 425 ALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLM 463
A++ +G+Y RS Y +++ + +G++ V + + +L+
Sbjct: 199 AMSEEGYYVRSEHYFDLL--KRHIQGVYPVAMVHSIFLV 235
>gi|323452702|gb|EGB08575.1| hypothetical protein AURANDRAFT_63938 [Aureococcus anophagefferens]
Length = 640
Score = 47.4 bits (111), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 44/88 (50%), Gaps = 6/88 (6%)
Query: 648 FVVRY---RPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWM 704
F VRY P Q L H D+S + ++A++ D+ GGG RF+ + G +
Sbjct: 516 FYVRYDADAPGAQTELEAHRDASLLSFSVAMSSPD-DFVGGGTRFVGSGRVLRPEAAGDL 574
Query: 705 LMHPGRLTHYHEGLQVTQGTRYIMISFV 732
+ H G++ H G VT G R I++ FV
Sbjct: 575 VAHSGKV--LHAGEAVTAGVRDILVGFV 600
>gi|47181445|emb|CAG13372.1| unnamed protein product [Tetraodon nigroviridis]
Length = 47
Score = 47.4 bits (111), Expect = 0.028, Method: Composition-based stats.
Identities = 20/45 (44%), Positives = 29/45 (64%), Gaps = 1/45 (2%)
Query: 450 GIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLR 493
G+WN+PY+ + YL+K S +K N + + L +D DMAFC N R
Sbjct: 1 GVWNIPYMAHVYLIKGSALKKELNERNYFVLEKLDPDMAFCRNAR 45
>gi|397639611|gb|EJK73670.1| hypothetical protein THAOC_04695, partial [Thalassiosira oceanica]
Length = 543
Score = 47.4 bits (111), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 34/151 (22%), Positives = 66/151 (43%), Gaps = 6/151 (3%)
Query: 378 GVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALN---ADGFYAR 434
G VDSD + ++ L + S++ RP K W+N+W ++ +Y R
Sbjct: 41 GAGQALMVDSDVIIARSSAVQDLAGWSRSVVVGHAQRPGKYWANYWTDMDHTAGSQWYKR 100
Query: 435 SFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLR 493
FD ++I N + +G++ VP+ L++ K ++ + + D C
Sbjct: 101 GFDTLDIYN--RARQGLFQVPFGRGLVLVQRGEFKRLADLFSKLKGHGSDTMRRLCLKST 158
Query: 494 NKGIHLKIDSTQEYGHLVDSENFDPQKTNPE 524
+ G+ L ID+ + YG + D + + + E
Sbjct: 159 DVGLPLYIDNQRNYGRIYDPDAKESDANDDE 189
>gi|440791367|gb|ELR12605.1| ankyrin repeat-containing protein [Acanthamoeba castellanii str.
Neff]
Length = 463
Score = 47.4 bits (111), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 39/178 (21%), Positives = 74/178 (41%), Gaps = 26/178 (14%)
Query: 559 PDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVW 618
P ++ FP++ FC ++ ++ Q + Y + + VG +
Sbjct: 285 PGIYSFPMLKLSFCDRLLEELDHLEQSGLSLKRPNSM-NAYGVI------LSDVGFKEMM 337
Query: 619 AEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQV 678
+ +R+YV PL + + + SFVV+Y+ E L+ H D S T+N++L
Sbjct: 338 HQLMRRYVCPLATLLYAEQGGDSLDRLHSFVVKYKIGEDLDLKEHVDDSEVTLNVSL--- 394
Query: 679 GVDYEGGGCRF-----------IRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTR 725
G + GG F + C + G ++H G +H+H L++++ R
Sbjct: 395 GKSFAGGDLDFNGVANTPTSKNDHFTCGHSP---GVAVLHLG--SHWHSALKISECRR 447
>gi|356536266|ref|XP_003536660.1| PREDICTED: uncharacterized PKHD-type hydroxylase At1g22950-like
[Glycine max]
Length = 344
Score = 47.4 bits (111), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 40/194 (20%), Positives = 82/194 (42%), Gaps = 25/194 (12%)
Query: 552 TVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIH--- 608
++ +PC V+ F ++ +FC + + ++ + +W GT +L+ ++ H
Sbjct: 111 SIMAEPCKGVYTFEMLQPQFCKKLMSEVDHFERWVHGT----KLKIMRPNAMNKNKHGVI 166
Query: 609 MKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSST 668
+ + F+ ++ P+ + + + + FVV Y ++ L H D +
Sbjct: 167 LDDFAFEAMLDRFMCDFIRPISQVFYPELGGSSLDSHHGFVVEYGINKDVELGLHEDEAE 226
Query: 669 YTINIALNQVGVDYEGGGCRF--IRYNCNVTATRM-----------GWMLMHPGRLTHYH 715
T+N+ L G ++ GG F +R + +VT+ G ++HPGR + H
Sbjct: 227 VTLNVCL---GKEFSGGELFFQGVRCDAHVTSNAQPEEAFNYSHVPGHAILHPGR--NRH 281
Query: 716 EGLQVTQGTRYIMI 729
T G R +I
Sbjct: 282 GARPTTSGNRMNLI 295
>gi|428169371|gb|EKX38306.1| hypothetical protein GUITHDRAFT_144412 [Guillardia theta CCMP2712]
Length = 233
Score = 47.0 bits (110), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 28/104 (26%), Positives = 48/104 (46%), Gaps = 10/104 (9%)
Query: 83 KVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDT 142
K+ LK+ LD + +D II+ D+YDV+ + + +L+ F +V+ AE C
Sbjct: 39 KIYALKDFLDMVPFEEDNIIVFVDAYDVLFNRTIKYLLKEFLKMKHRVVYSAEVGCSAGR 98
Query: 143 SLYDK----------YPAVGSGYRYLNSGGFIGYAKDIKELISN 176
+ YP V + YLNSG + Y + +K + +
Sbjct: 99 EALSRRSTACDRGWPYPGVNTVAPYLNSGATMAYQRQLKLFLES 142
>gi|355567431|gb|EHH23772.1| hypothetical protein EGK_07313, partial [Macaca mulatta]
Length = 595
Score = 47.0 bits (110), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 32/138 (23%), Positives = 62/138 (44%), Gaps = 9/138 (6%)
Query: 378 GVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFD 437
G D+ + D+D+ L N L+ L+ + ++AP+L +SNFW + G+Y R+ +
Sbjct: 137 GADYILFADTDNILTNNQTLRLLMGQGLPVVAPMLDSQ-TYYSNFWCGITPQGYYRRTAE 195
Query: 438 YMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTI-----YTLNSMDYDMAFCTNL 492
Y N + +G VP + + +L+ A + YT D + F
Sbjct: 196 YFPTKNRQR--RGCLRVPMVHSTFLVSLRAEGADQLAFYPPHPNYTW-PFDDIIVFAYAC 252
Query: 493 RNKGIHLKIDSTQEYGHL 510
+ G+ + + + YG++
Sbjct: 253 QAAGVAVHVCNEHRYGYI 270
>gi|359451740|ref|ZP_09241134.1| hypothetical protein P20480_3882 [Pseudoalteromonas sp. BSi20480]
gi|358042468|dbj|GAA77383.1| hypothetical protein P20480_3882 [Pseudoalteromonas sp. BSi20480]
Length = 304
Score = 47.0 bits (110), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 40/152 (26%), Positives = 62/152 (40%), Gaps = 20/152 (13%)
Query: 588 GTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQE---REFIGYHHEPVRA 644
G DKR E GY A P+ + E L Y+ P+ E GY +
Sbjct: 134 GVMLDKRSE-GYLAAPS---------FQTFYNEMLNTYMRPIARLLFPEITGYDTQT--- 180
Query: 645 PMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRF---IRYNCNVTATRM 701
F + Y P S+RPH D+S T+NI LN G ++ G F + + +
Sbjct: 181 -FGFSIYYDPSTDASIRPHTDASAVTLNINLNLPGEEFTGSELDFYDLVTGKVTQLSFKP 239
Query: 702 GWMLMHPGRLTHYHEGLQVTQGTRYIMISFVD 733
G ++H G + H + + T +++ F D
Sbjct: 240 GIAMIHRGSVAHAAKPITSGDRTNFVLWLFGD 271
>gi|351702454|gb|EHB05373.1| Glycosyltransferase 25 family member 1, partial [Heterocephalus
glaber]
Length = 517
Score = 47.0 bits (110), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 34/130 (26%), Positives = 64/130 (49%), Gaps = 12/130 (9%)
Query: 388 DSHLDNPD-VLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQ 446
+S L++P V L+ N +++AP+L A+SNFW + G+Y R+ Y+ I ++
Sbjct: 82 ESALNSPGFVSSLLIAENRTVVAPML-DSRAAYSNFWCGMTPQGYYRRTPAYIPIRKRER 140
Query: 447 GGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMA------FCTNLRNKGIHLK 500
+G + VP + + +L+ + KA + + DY A F + + G+ +
Sbjct: 141 --QGCFAVPMVHSTFLL--DLRKAASRSLAFYPPHPDYTWAFDDIIVFAFSCKQAGVQMY 196
Query: 501 IDSTQEYGHL 510
+ + Q YG L
Sbjct: 197 VCNKQVYGFL 206
>gi|299473298|emb|CBN77697.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 713
Score = 47.0 bits (110), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 47/170 (27%), Positives = 74/170 (43%), Gaps = 13/170 (7%)
Query: 566 IVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVW-AEFLRK 624
+++ + C +Q E Y Q + G + + AVPT D+ + + W +R+
Sbjct: 453 VLSPEECRGVIQAAEDYSQANGGWTTSR-----HYAVPTTDLPVHALKSTLPWFRSLVRE 507
Query: 625 YVVPLQEREF-IGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVD-Y 682
+ P + F + V +FVVRY +Q L H D ST++ IALN G+D Y
Sbjct: 508 RLFPALAKRFNLAAGPRRVFVHDAFVVRYEEGKQRHLPLHRDQSTHSFTIALN--GLDQY 565
Query: 683 EGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFV 732
GGG F ++ + G L H G + +G RYI+ F
Sbjct: 566 TGGGTFFPSLGRSLRPAEGHALSFRGGIL---HGGDPLLKGVRYIIACFC 612
>gi|119470273|ref|ZP_01613032.1| hypothetical protein ATW7_12953 [Alteromonadales bacterium TW-7]
gi|119446445|gb|EAW27720.1| hypothetical protein ATW7_12953 [Alteromonadales bacterium TW-7]
Length = 304
Score = 47.0 bits (110), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 40/152 (26%), Positives = 62/152 (40%), Gaps = 20/152 (13%)
Query: 588 GTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQE---REFIGYHHEPVRA 644
G DKR E GY A P+ + E L Y+ P+ E GY +
Sbjct: 134 GVMLDKRSE-GYLAAPS---------FQTFYNEMLNTYMRPIARLLFPEITGYDTQT--- 180
Query: 645 PMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRF---IRYNCNVTATRM 701
F + Y P S+RPH D+S T+NI LN G ++ G F + + +
Sbjct: 181 -FGFSIYYDPSTDASIRPHTDASAVTLNINLNLPGEEFTGSELDFYDLVTGKVTQLSFKP 239
Query: 702 GWMLMHPGRLTHYHEGLQVTQGTRYIMISFVD 733
G ++H G + H + + T +++ F D
Sbjct: 240 GIAMIHRGSVAHAAKPITSGDRTNFVLWLFGD 271
>gi|320101633|ref|YP_004177224.1| alkyl hydroperoxide reductase [Isosphaera pallida ATCC 43644]
gi|319748915|gb|ADV60675.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Isosphaera pallida ATCC 43644]
Length = 367
Score = 46.6 bits (109), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 44/182 (24%), Positives = 70/182 (38%), Gaps = 27/182 (14%)
Query: 566 IVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPT--------RDIHMKQVGLAGV 617
I FC + ++I EA G + G E G + VP RD +K + +
Sbjct: 177 IFEPHFCRQLIEIYEADGGYESGFMR----EVGGKTVPVHDHSHKRRRDCEIKDLQVIQA 232
Query: 618 WAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSST-------YT 670
L++ ++P + F E R V Y R H D++T +
Sbjct: 233 CQMRLKRRLIPEIHKSF---QFEATRIERHIVACYDASTGGHFRAHRDNTTKGTAHRRFA 289
Query: 671 INIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMIS 730
I++ LN D++GG RF + +G ++ L HE VT G RY +
Sbjct: 290 ISLNLND---DFQGGDLRFAEFGPRTYRAPVGGAVVFSCSL--LHEATPVTAGKRYAFLP 344
Query: 731 FV 732
F+
Sbjct: 345 FL 346
>gi|410629220|ref|ZP_11339927.1| hypothetical protein GMES_4430 [Glaciecola mesophila KMM 241]
gi|410151244|dbj|GAC26696.1| hypothetical protein GMES_4430 [Glaciecola mesophila KMM 241]
Length = 303
Score = 46.6 bits (109), Expect = 0.053, Method: Compositional matrix adjust.
Identities = 38/152 (25%), Positives = 65/152 (42%), Gaps = 20/152 (13%)
Query: 588 GTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQE---REFIGYHHEPVRA 644
G D R E GY A P+ +Q+ L +Y+ P+ E +GY +
Sbjct: 133 GAMLDSRSE-GYLAAPSFQAFYRQI---------LDRYMRPIARLLFPEIVGYDTQT--- 179
Query: 645 PMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVT---ATRM 701
F + Y+P+ S+RPH D+S T+NI LN + G F + A +
Sbjct: 180 -FGFSIHYKPNTDTSIRPHTDASAVTLNINLNLPDESFTGSNVDFYDPSTGKMIGLAFKP 238
Query: 702 GWMLMHPGRLTHYHEGLQVTQGTRYIMISFVD 733
G ++H G + H + + + T +++ + D
Sbjct: 239 GSAMIHRGNVVHAAQPITSGERTNFVLWLYGD 270
>gi|374619955|ref|ZP_09692489.1| putative iron-regulated protein [gamma proteobacterium HIMB55]
gi|374303182|gb|EHQ57366.1| putative iron-regulated protein [gamma proteobacterium HIMB55]
Length = 341
Score = 46.6 bits (109), Expect = 0.056, Method: Compositional matrix adjust.
Identities = 46/185 (24%), Positives = 73/185 (39%), Gaps = 27/185 (14%)
Query: 555 NQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGL 614
N P P + + E + + ++ G+ G D + P RD+ +
Sbjct: 136 NLPAPVLVIPDAIDEALAEDLIHYLD--GREDHGFVADGDFKRRLHIHPDRDLEHR---- 189
Query: 615 AGVWAEFLRKYVVPLQEREFIG--YHHEPVRAPMSFVVRYRPDEQPSLRPHHDS------ 666
+ L K V+P E+ F H E + + RY H D+
Sbjct: 190 ---LDDKLCKSVLPEIEKVFYSEITHRETYK-----ICRYDGTNSGKFGKHRDTIAPHLH 241
Query: 667 STYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRY 726
Y I + LN DYEGGG F YN V + ++ PG L +H+ ++ G+RY
Sbjct: 242 RRYAITLVLND---DYEGGGIAFPEYNSEVLSIPKYGAVVFPGSL--FHQVNNISSGSRY 296
Query: 727 IMISF 731
++ISF
Sbjct: 297 VIISF 301
>gi|333894574|ref|YP_004468449.1| 2OG-Fe(II) oxygenase [Alteromonas sp. SN2]
gi|332994592|gb|AEF04647.1| 2OG-Fe(II) oxygenase [Alteromonas sp. SN2]
Length = 303
Score = 46.2 bits (108), Expect = 0.060, Method: Compositional matrix adjust.
Identities = 38/147 (25%), Positives = 60/147 (40%), Gaps = 20/147 (13%)
Query: 588 GTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQE---REFIGYHHEPVRA 644
G DKR E GY A P+ + L Y+ P+ E +GY +
Sbjct: 133 GAMLDKRSE-GYLAAPS---------FQAFYRTMLDTYMRPIARLLFPEIMGYDAQT--- 179
Query: 645 PMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNC---NVTATRM 701
F + Y+P+ S+RPH D+S T+NI LN G + G F N
Sbjct: 180 -FGFSIHYQPNTDTSIRPHTDASAVTLNINLNVPGETFTGSTVDFYDVKAGKVNPLTFTP 238
Query: 702 GWMLMHPGRLTHYHEGLQVTQGTRYIM 728
G ++H G + H + + + T +++
Sbjct: 239 GSAMLHRGNVPHAAQPITSGERTNFVL 265
>gi|390176657|ref|XP_003736142.1| GA16561 [Drosophila pseudoobscura pseudoobscura]
gi|160395573|sp|Q29NU5.2|GLT25_DROPS RecName: Full=Glycosyltransferase 25 family member; Flags:
Precursor
gi|388858689|gb|EIM52215.1| GA16561 [Drosophila pseudoobscura pseudoobscura]
Length = 626
Score = 46.2 bits (108), Expect = 0.063, Method: Compositional matrix adjust.
Identities = 45/210 (21%), Positives = 86/210 (40%), Gaps = 33/210 (15%)
Query: 289 QFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTM 348
Q P+VL+++ + L FL+ + +YP +I+ ++ + DD I K
Sbjct: 34 QPPTVLVALLVRNKAHILPMFLSYLEQQDYPKDRIAFWLRCDHSS-----DDSIDLLKQW 88
Query: 349 FKNVKYIAH--NSTVNSKEARNLAVENSLHKG-----------------------VDFYF 383
K+ + H N +S E+S + DF F
Sbjct: 89 LKHSGDLYHSVNYAFDSDGPHGYQNESSPYDWTVSRFKHVIALKEEAFTYARDIWADFVF 148
Query: 384 YVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIIN 443
++D+D L + L+ L ++AP+L+ +SNFW + + +Y R+ +Y I +
Sbjct: 149 FLDADVLLTSQQALRTLTALRLPIVAPMLLSE-SLYSNFWCGMTEEYYYQRTDEYKEIYH 207
Query: 444 GDQGGKGIWNVPYITNCYLMKTSVIKATNI 473
+ +G + VP + L+ + A N+
Sbjct: 208 VKK--QGSFPVPMVHTAVLVDMNHKGARNL 235
>gi|412990294|emb|CCO19612.1| predicted protein [Bathycoccus prasinos]
Length = 672
Score = 46.2 bits (108), Expect = 0.069, Method: Compositional matrix adjust.
Identities = 49/226 (21%), Positives = 96/226 (42%), Gaps = 21/226 (9%)
Query: 513 SENFDPQKTNPEVY--ELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPI--VT 568
S++ QK E Y L R+P + ++ P + K+ + ++ P ++
Sbjct: 454 SQSLKKQKRKDEAYLTRLSRSPSNLWHKF-EPSFSKTWITLAAGA-----IWLLPSHNIS 507
Query: 569 EKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQV-GLAGVWAEFLRKYVV 627
++ C ++ + E + G ++ + +VPT D+ + + L W F+ ++
Sbjct: 508 KRTCEHWIVLAEKFASLQGGWCTNRHI-----SVPTTDLPVHLIPELVDEWNLFVFSDLI 562
Query: 628 PLQEREFI-GYHHEPVRAPMSFVVRYRPDEQPSLRP-HHDSSTYTINIALNQVGVDYEGG 685
PL ++ + + +F+V+Y + P H D S ++ IALN D+ GG
Sbjct: 563 PLAKQILTTSMFRKRLCVHDAFIVKYDASKGCDHLPIHRDQSEISVTIALNS-NSDFSGG 621
Query: 686 GCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISF 731
G ++G +L+ G L H G + G RYI+ +F
Sbjct: 622 GGTMFPNLGITICPKIGEILLFRGDLE--HSGFPINGGIRYIVAAF 665
>gi|356531172|ref|XP_003534152.1| PREDICTED: uncharacterized PKHD-type hydroxylase At1g22950-like
[Glycine max]
Length = 379
Score = 45.8 bits (107), Expect = 0.074, Method: Compositional matrix adjust.
Identities = 48/207 (23%), Positives = 80/207 (38%), Gaps = 33/207 (15%)
Query: 537 LRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLE 596
LR I+ ++S+ ++ ++P P +F F I FC + +E + +W + E
Sbjct: 131 LRAINDNTEQSI--RSIVSEPSPGIFIFDIFQTHFCELLLSEIENFEKWVN--------E 180
Query: 597 TGYEAVPTRDIH-----MKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVR 651
T + + ++ + GL + + + ++ PL F + + FVV
Sbjct: 181 TKFRIMRPNTMNKFGAVLDDFGLETMLDKLMEGFIRPLSRVFFAEVGGSTLDSHHGFVVE 240
Query: 652 YRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNV---TATR-------- 700
Y D L H D S T+N+ L G + GG F C T T
Sbjct: 241 YGKDRDVDLGFHVDDSEVTLNVCL---GKQFSGGELFFRGIRCEKHVNTGTHSEEIFDYS 297
Query: 701 --MGWMLMHPGRLTHYHEGLQVTQGTR 725
+G ++H GR H H T G R
Sbjct: 298 HVLGRAVLHRGR--HRHGARATTSGNR 322
>gi|169234704|ref|NP_001108473.1| chromosome associated protein D3 [Bombyx mori]
gi|18700451|dbj|BAB85193.1| hypothetical protein [Bombyx mori]
gi|22474509|dbj|BAC10614.1| hypothetical protein [Bombyx mori]
Length = 407
Score = 45.8 bits (107), Expect = 0.075, Method: Compositional matrix adjust.
Identities = 32/121 (26%), Positives = 62/121 (51%), Gaps = 10/121 (8%)
Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
+D+D L N + LK L+ ++ ++++P+L+ +SNFW + + +Y R+ DY I+N
Sbjct: 2 LDADVILTNIETLKVLIAKDFTVVSPMLMSD-GVYSNFWCGMTENYYYKRTDDYKPILN- 59
Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIH 498
+ G ++VP + + L+ ++ T Y +YD +AF N + G H
Sbjct: 60 -RKKTGCFDVPMVHSAVLISMRY-DVSDKLTYYPSKITNYDGPEDDIIAFALNSKALGEH 117
Query: 499 L 499
+
Sbjct: 118 V 118
>gi|410615971|ref|ZP_11326967.1| hypothetical protein GPLA_0186 [Glaciecola polaris LMG 21857]
gi|410164453|dbj|GAC31105.1| hypothetical protein GPLA_0186 [Glaciecola polaris LMG 21857]
Length = 304
Score = 45.8 bits (107), Expect = 0.082, Method: Compositional matrix adjust.
Identities = 38/149 (25%), Positives = 65/149 (43%), Gaps = 18/149 (12%)
Query: 588 GTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMS 647
G D R E GY A P+ +Q + ++ + + + P E IGY +
Sbjct: 134 GAMLDSRSE-GYLAAPSFQTFYRQ--MIDMYMRPIARMLFP----EIIGYDDQA----FG 182
Query: 648 FVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATR-----MG 702
F + YRP+ S+RPH D+S T+NI LN + G F Y+ + G
Sbjct: 183 FSIHYRPNTDNSIRPHTDASAVTLNINLNPPDALFTGSTVDF--YDSETGKMKGITFTPG 240
Query: 703 WMLMHPGRLTHYHEGLQVTQGTRYIMISF 731
++H G++ H + + + T +++ F
Sbjct: 241 SAILHRGKVVHAAQPITSGERTNFVLWLF 269
>gi|414070939|ref|ZP_11406917.1| hypothetical protein D172_2149 [Pseudoalteromonas sp. Bsw20308]
gi|410806688|gb|EKS12676.1| hypothetical protein D172_2149 [Pseudoalteromonas sp. Bsw20308]
Length = 304
Score = 45.8 bits (107), Expect = 0.089, Method: Compositional matrix adjust.
Identities = 38/147 (25%), Positives = 62/147 (42%), Gaps = 20/147 (13%)
Query: 588 GTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQE---REFIGYHHEPVRA 644
G D R E GY A P+ + E L Y+ P+ E IGY +
Sbjct: 134 GAMLDSRSE-GYLAAPS---------FQAFYNEILNTYMRPISRLLFPEIIGYDTQT--- 180
Query: 645 PMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIR-YNCNVTAT--RM 701
F + Y P S+RPH D+S T+NI LN ++ G F Y ++ + +
Sbjct: 181 -FGFSIYYDPSTDASIRPHTDASAVTLNINLNLPSEEFTGSEVDFYHPYTGDIKSLTFKP 239
Query: 702 GWMLMHPGRLTHYHEGLQVTQGTRYIM 728
G ++H G + H + + + T +++
Sbjct: 240 GTAMLHRGNIAHAAKPITSGERTNFVL 266
>gi|158513401|sp|A3KGZ2.1|OGFD2_DANRE RecName: Full=2-oxoglutarate and iron-dependent oxygenase
domain-containing protein 2
Length = 345
Score = 45.8 bits (107), Expect = 0.093, Method: Compositional matrix adjust.
Identities = 45/186 (24%), Positives = 80/186 (43%), Gaps = 21/186 (11%)
Query: 553 VNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQV 612
+ + VF F + ++FC + ++ +E + Q SD Y V + ++
Sbjct: 123 IQTEAASRVFRFQVFRKEFCKDLLEELEHFEQ-SDAPKGRPNTMNNYGIV------LNEL 175
Query: 613 GLAGVWAEFLRK-YVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTI 671
G + LR+ Y+ PL + + + +FVV+Y E +L H+D+S T+
Sbjct: 176 GFDEGFITPLREVYLRPLTALLYSDCGGNCLDSHKAFVVKYDMHEDLNLSYHYDNSEVTL 235
Query: 672 NIALNQVGVDYEGGGCRF--------IRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQG 723
N++L G D+ G F C R+ L+H G+ H H L ++ G
Sbjct: 236 NVSL---GKDFTEGNLFFGDMRQVPLSETECVEVEHRVTEGLLHRGQ--HMHGALSISSG 290
Query: 724 TRYIMI 729
TR+ +I
Sbjct: 291 TRWNLI 296
>gi|332535541|ref|ZP_08411316.1| hypothetical protein PH505_cy00110 [Pseudoalteromonas haloplanktis
ANT/505]
gi|332035040|gb|EGI71558.1| hypothetical protein PH505_cy00110 [Pseudoalteromonas haloplanktis
ANT/505]
Length = 304
Score = 45.4 bits (106), Expect = 0.095, Method: Compositional matrix adjust.
Identities = 37/147 (25%), Positives = 64/147 (43%), Gaps = 20/147 (13%)
Query: 588 GTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQER---EFIGYHHEPVRA 644
G D R E GY A P+ + E L Y+ P+ E +G+ +
Sbjct: 134 GAMLDSRSE-GYLAAPS---------FQAFYNEILNTYMRPISRMLFPEVMGFDTQT--- 180
Query: 645 PMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIR-YNCNVTAT--RM 701
F + Y P+ S+RPH D+S T+NI LN G ++ G F Y ++ + +
Sbjct: 181 -FGFSIYYEPNTDSSIRPHTDASAVTLNINLNLPGEEFTGSQVGFYHPYTGDIKSLTFKP 239
Query: 702 GWMLMHPGRLTHYHEGLQVTQGTRYIM 728
G ++H G + H + + + T +++
Sbjct: 240 GTAMLHRGNIAHAAKPITSGERTNFVL 266
>gi|342876206|gb|EGU77862.1| hypothetical protein FOXB_11626 [Fusarium oxysporum Fo5176]
Length = 517
Score = 45.4 bits (106), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 60/262 (22%), Positives = 100/262 (38%), Gaps = 63/262 (24%)
Query: 20 SVHCN---KVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS 76
SVH K K I + + ASN D + + SA VN+ + W G
Sbjct: 58 SVHAPSKPKKKVIKTSQLHFLLPASNPNDMFCAIVTSALVNRYPAPYM---VGWKGEGKY 114
Query: 77 SLGGGYKVNL--LKNELDEMDIT--DDMIILVTDSYDVIIDGGVNDILERFNTFDAN--- 129
+ + L +K LDE+ DD ++ D YDV+ V ++ER+ A+
Sbjct: 115 NASAAHTAKLYSIKKYLDELPQGGDDDDLVFFGDGYDVMAQLPVEVVIERYFKVAADADR 174
Query: 130 ---------------------IVFGAERLCWPD------TSLYDKY-------PAVGSG- 154
+ +GA+++CWP T + + P G G
Sbjct: 175 RLADRFGITVEEAHKRGLKQTLFWGADKMCWPALNEAQCTKIPGSHLPRNVYGPKTGGGD 234
Query: 155 --YR---YLNSGGFIGYAKDIKELIS----------NRSIKNEEDDQLYYALLFLDETLR 199
YR Y NSG IG D++ I+ + + K + DQ+Y A L+ + L
Sbjct: 235 VTYRDAKYFNSGSVIGPVGDLRNFINAGIASLEETFDPNFKYKTSDQIYLARLYARQELS 294
Query: 200 TKHKIVLDTLANLFQNLYGSLE 221
+I +++ F + ++E
Sbjct: 295 RAEQIENESMMASFGDNATAVE 316
>gi|428178407|gb|EKX47282.1| hypothetical protein GUITHDRAFT_46957, partial [Guillardia theta
CCMP2712]
Length = 314
Score = 45.4 bits (106), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 49/192 (25%), Positives = 78/192 (40%), Gaps = 36/192 (18%)
Query: 561 VFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAE 620
V+ FP++ C + V + ++ ++LE +++ + M ++G W
Sbjct: 100 VYSFPLLQPSLCQQIVSCANDFAAFT----RQEKLEGKFDSERPAVLDMMKLG----WIN 151
Query: 621 --FLRKYVVPLQEREFIG-YHHEPVRAPMSFVVRYRPDEQ----PSLRP-------HHDS 666
LR+ V PL E F E + ++V Y P + RP H D
Sbjct: 152 DMLLRQVVSPLAEALFEDELEGETLDWRHGYIVGYAPKDPGQAGTEFRPRRNHLVSHTDD 211
Query: 667 STYTINIALNQVGVDYEGGGCRF---------IRYNCNVTATRMGWMLMHPGRLTHYHEG 717
S T+N+ L DY+GG F ++ + A GW ++H GR HE
Sbjct: 212 SEITLNVCLQS---DYQGGELVFHGRRGSGEELKTLGSFKAPAPGWAVLHVGR--QLHEV 266
Query: 718 LQVTQGTRYIMI 729
L VT G RY +I
Sbjct: 267 LPVTGGKRYGLI 278
>gi|428164686|gb|EKX33703.1| hypothetical protein GUITHDRAFT_147722 [Guillardia theta CCMP2712]
Length = 771
Score = 45.4 bits (106), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 30/100 (30%), Positives = 48/100 (48%), Gaps = 4/100 (4%)
Query: 633 EFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRY 692
E G+ V F+VRY Q L H D + T ++ LN+ D++GGG F
Sbjct: 342 ERFGFRAGEVTPVDVFLVRYTGAGQNQLSVHRDGALMTFSLLLNEAS-DFQGGGT-FFEE 399
Query: 693 NCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFV 732
+ V G ++H G++ H G ++ G+RYI++ F
Sbjct: 400 DGLVFRHEQGVAVLHSGKIRHG--GYPISSGSRYILVGFC 437
>gi|392532697|ref|ZP_10279834.1| 2OG-Fe(II) oxygenase [Pseudoalteromonas arctica A 37-1-2]
Length = 304
Score = 45.1 bits (105), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 37/147 (25%), Positives = 63/147 (42%), Gaps = 20/147 (13%)
Query: 588 GTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQER---EFIGYHHEPVRA 644
G D R E GY A P+ + E L Y+ P+ E G+ +
Sbjct: 134 GAMLDSRSE-GYLAAPS---------FQAFYNEILNTYMRPISRMLFPEVTGFDTQT--- 180
Query: 645 PMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIR-YNCNVTAT--RM 701
F + Y P+ S+RPH D+S T+NI LN G ++ G F Y ++ + +
Sbjct: 181 -FGFSIYYEPNTDSSIRPHTDASAVTLNINLNLPGEEFTGSEVDFYHPYTGDIKSLTFKP 239
Query: 702 GWMLMHPGRLTHYHEGLQVTQGTRYIM 728
G ++H G + H + + + T +++
Sbjct: 240 GTAMLHRGNIAHAAKPITSGERTNFVL 266
>gi|348522203|ref|XP_003448615.1| PREDICTED: 2-oxoglutarate and iron-dependent oxygenase
domain-containing protein 2-like [Oreochromis niloticus]
Length = 350
Score = 45.1 bits (105), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 54/225 (24%), Positives = 96/225 (42%), Gaps = 26/225 (11%)
Query: 519 QKTNPEVYELIRNPLDWDLRYIHPEYQKS-----LLPDTVNNQPCPDVFWFPIVTEKFCH 573
Q +P VY L + L + + I Q S L D + Q P V+ FP+ + FC
Sbjct: 91 QPLHPHVYHLQESYLASEFKQIVEYCQSSNATEEGLLDLLEEQAAPRVYRFPLFDKSFCE 150
Query: 574 EFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLR-KYVVPLQER 632
+ ++ +E + Q S Y I + ++G + LR +Y++PL
Sbjct: 151 DLMEELEHFEQ-SGAPKGRPNTMNQY------GILLNELGFDEHFITPLREQYLLPLTSL 203
Query: 633 EFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRF--- 689
+ + + +FVV+Y +E L H+D++ T+N++L G ++ G F
Sbjct: 204 LYPDCGGRCLDSHKAFVVKYDMNEDLDLSYHYDNAEVTLNVSL---GKEFTEGNLYFGDM 260
Query: 690 -----IRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMI 729
C+ R+ L+H G+ H H L ++ G R+ +I
Sbjct: 261 RQVPVSETECSEVEHRVTEGLLHRGQ--HMHGALPISSGQRWNLI 303
>gi|444721252|gb|ELW61996.1| Outer dense fiber protein 2 [Tupaia chinensis]
Length = 1465
Score = 45.1 bits (105), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 24/90 (26%), Positives = 45/90 (50%), Gaps = 3/90 (3%)
Query: 384 YVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIIN 443
+ D+D+ L N L+ L+ R ++AP+L +SNFW + G+Y R+ +Y N
Sbjct: 122 FADTDNILTNNQTLRLLLERGLPVVAPML-DSQTYYSNFWCGITPQGYYRRTAEYFPTKN 180
Query: 444 GDQGGKGIWNVPYITNCYLMKTSVIKATNI 473
+ +G + VP + + +L+ A +
Sbjct: 181 RQR--QGCFRVPMVHSTFLVSLRAEGAAQL 208
>gi|404370072|ref|ZP_10975399.1| hypothetical protein CSBG_02623 [Clostridium sp. 7_2_43FAA]
gi|226913796|gb|EEH98997.1| hypothetical protein CSBG_02623 [Clostridium sp. 7_2_43FAA]
Length = 641
Score = 44.7 bits (104), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 42/169 (24%), Positives = 71/169 (42%), Gaps = 23/169 (13%)
Query: 367 RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWS----NF 422
+N +E +L + D+ F VDSD + NP LK LV+ N+ +++ + +K S
Sbjct: 94 KNTIIEKALKEDYDYLFLVDSDLVM-NPKTLKRLVSLNKEIVSNIFWTRWKPNSYEQPQV 152
Query: 423 W-------------GALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLM-KTSVI 468
W L R+ D++N++ G + V + C L+ K ++
Sbjct: 153 WLKDMYTLYDFEHGERLRESEVIKRTADFINMLRK----PGTYKVGGLGACTLISKEALS 208
Query: 469 KATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFD 517
K N IY ++ D FC GI L +D+ H+ E+ D
Sbjct: 209 KGVNFNPIYNVSFWGEDRHFCIRAAVLGIQLYVDTYYPAHHIYRDEDLD 257
>gi|449678073|ref|XP_002156385.2| PREDICTED: procollagen galactosyltransferase 1-like, partial [Hydra
magnipapillata]
Length = 425
Score = 44.7 bits (104), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 31/114 (27%), Positives = 61/114 (53%), Gaps = 12/114 (10%)
Query: 407 LIAPLLVRPFKA---WSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLM 463
+++P+L R F A +SNFWGA++ G+Y R +Y ++ + G++ VP + + L+
Sbjct: 2 IVSPML-RSFDADGLYSNFWGAMDERGYYKRVPEYFTLLKRET--LGVYYVPMVHSTMLI 58
Query: 464 K-----TSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGHLVD 512
T + I Y+ +D + F + ++ GI+L I +T+ +G L++
Sbjct: 59 DMRSNLTDTLMFYPIPLSYS-GVIDDILVFAQSAKHSGINLAICNTEVFGFLLN 111
>gi|449279299|gb|EMC86934.1| 2-oxoglutarate and iron-dependent oxygenase domain-containing
protein 2, partial [Columba livia]
Length = 289
Score = 44.7 bits (104), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 42/176 (23%), Positives = 74/176 (42%), Gaps = 17/176 (9%)
Query: 561 VFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAE 620
++ P+ TE+FC FV +E + Q SD Y + + ++G+ +
Sbjct: 74 IYRLPVFTEEFCQAFVDELENFEQ-SDMPKGRPNSMNNYGVL------LNELGMDETFIT 126
Query: 621 FLR-KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVG 679
LR KY+ P+ + + + +FVV+Y E L H+D++ T+N++L G
Sbjct: 127 PLREKYLRPITALLYPDLGGSCLDSHKAFVVKYSLHEDLDLSSHYDNAEVTLNVSL---G 183
Query: 680 VDYEGGGCRFIRYNCNVTATRMGWMLMHPG------RLTHYHEGLQVTQGTRYIMI 729
D+ G F +N + + H G R H L + G R+ +I
Sbjct: 184 KDFTEGNLYFGDFNQDPAPVPKYIEIEHVGAHGLLHRGGQIHGALPIASGERWNLI 239
>gi|392537775|ref|ZP_10284912.1| 2OG-Fe(II) oxygenase [Pseudoalteromonas marina mano4]
Length = 304
Score = 44.7 bits (104), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 40/148 (27%), Positives = 62/148 (41%), Gaps = 20/148 (13%)
Query: 592 DKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQE---REFIGYHHEPVRAPMSF 648
DKR +GY A P + E L Y+ P+ E +GY + F
Sbjct: 138 DKR-SSGYLAAP---------NFQAFYNEILNNYMRPISRLLFPEIMGYDTQT----FGF 183
Query: 649 VVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFI-RYNCNVT--ATRMGWML 705
+ Y P+ S+RPH D+S T+NI LN + G F + VT + + G +
Sbjct: 184 SIYYDPNTDASIRPHTDASAVTLNINLNLPEEKFTGSELDFYDQQTGKVTQLSFKPGCAM 243
Query: 706 MHPGRLTHYHEGLQVTQGTRYIMISFVD 733
+H G + H + + T ++M F D
Sbjct: 244 IHRGNVAHAAKPILTGDRTNFVMWLFGD 271
>gi|109898945|ref|YP_662200.1| 2OG-Fe(II) oxygenase [Pseudoalteromonas atlantica T6c]
gi|109701226|gb|ABG41146.1| 2OG-Fe(II) oxygenase [Pseudoalteromonas atlantica T6c]
Length = 306
Score = 44.7 bits (104), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 38/152 (25%), Positives = 64/152 (42%), Gaps = 20/152 (13%)
Query: 588 GTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREF---IGYHHEPVRA 644
G D R E GY A P+ + + E L +Y+ P+ F +GY +
Sbjct: 136 GAMLDSRSE-GYLAAPSFQVFYR---------EMLDRYMRPIARLLFPDIVGYDTQT--- 182
Query: 645 PMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVT---ATRM 701
F + Y+P+ S+RPH D+S T+NI LN + G F A +
Sbjct: 183 -FGFSIHYKPNTDTSIRPHTDASAVTLNINLNLPDEVFTGSNVDFYDPTTGKMIGLAFKP 241
Query: 702 GWMLMHPGRLTHYHEGLQVTQGTRYIMISFVD 733
G ++H G + H + + + T +++ + D
Sbjct: 242 GSAMIHRGNVVHAAQPITSGERTNFVLWLYGD 273
>gi|359443514|ref|ZP_09233350.1| hypothetical protein P20429_3737 [Pseudoalteromonas sp. BSi20429]
gi|358034560|dbj|GAA69599.1| hypothetical protein P20429_3737 [Pseudoalteromonas sp. BSi20429]
Length = 304
Score = 44.3 bits (103), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 37/147 (25%), Positives = 63/147 (42%), Gaps = 20/147 (13%)
Query: 588 GTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQER---EFIGYHHEPVRA 644
G D R E GY A P+ + E L Y+ P+ E G+ +
Sbjct: 134 GAMLDSRSE-GYLAAPS---------FQAFYNEILNTYMRPISRMLFPEVTGFDTQT--- 180
Query: 645 PMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIR-YNCNVTAT--RM 701
F + Y P+ S+RPH D+S T+NI LN G ++ G F Y ++ + +
Sbjct: 181 -FGFSIYYEPNTDSSIRPHTDASAVTLNINLNLPGEEFTGSEVDFYHPYTGDIKSLTFKP 239
Query: 702 GWMLMHPGRLTHYHEGLQVTQGTRYIM 728
G ++H G + H + + + T +++
Sbjct: 240 GTAMLHRGNIAHAAKLITSGERTNFVL 266
>gi|410643991|ref|ZP_11354476.1| hypothetical protein GAGA_0010 [Glaciecola agarilytica NO2]
gi|410136443|dbj|GAC02875.1| hypothetical protein GAGA_0010 [Glaciecola agarilytica NO2]
Length = 303
Score = 44.3 bits (103), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 39/152 (25%), Positives = 62/152 (40%), Gaps = 20/152 (13%)
Query: 588 GTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQE---REFIGYHHEPVRA 644
GT D R E GY A P+ + E L Y+ P+ E +GY +
Sbjct: 133 GTMLDSRSE-GYLAAPS---------FQAFYREILNTYMRPIARLLFPEIMGYDTQT--- 179
Query: 645 PMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVT---ATRM 701
F + Y+P+ S+RPH D+S T+NI +N + G F + A
Sbjct: 180 -FGFSIHYKPNTDTSIRPHTDASAVTLNININLPDEPFTGSTVDFYDPSAGKMIPLAFTS 238
Query: 702 GWMLMHPGRLTHYHEGLQVTQGTRYIMISFVD 733
G ++H G + H + + + T ++ F D
Sbjct: 239 GSAMIHRGNVVHAAQPITSGERTNLVLWLFGD 270
>gi|303291270|ref|XP_003064921.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226453592|gb|EEH50901.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 521
Score = 44.3 bits (103), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 56/201 (27%), Positives = 87/201 (43%), Gaps = 33/201 (16%)
Query: 34 FLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNELDE 93
L +T ++ G I+S++++ + + LG W G K L L
Sbjct: 276 LLAVTYSNKLNPGLDLLIESSQLHDVPIHVLG----W-GEKHPQPAQKLKATL--KFLSR 328
Query: 94 MDITDDMIILVTDSYDVIIDGGVNDILERFNTFDAN-IVFGAERLCWPDTSLY------- 145
+D + +L D++D II +IL RF F+++ ++FGAE C+P + Y
Sbjct: 329 IDPS--TTVLFVDAFDSIIVKDSYEILRRFKEFNSSSLIFGAENNCFPLSYPYFNLGYDF 386
Query: 146 ---DKYPAVGSGY-RYLNSGGFIGYAKDIKELISNRSIKNEED--------DQLYYALLF 193
+ Y GY YLNSG +IG A + L S+ + ED DQ YA+
Sbjct: 387 CGDENYILKHKGYPSYLNSGQWIGKAGVARRLFSHYMLLVGEDLAETFTGTDQ--YAMEL 444
Query: 194 LDETLRTKHKIVLDTLANLFQ 214
+ T I +D A LFQ
Sbjct: 445 MRMT--KAWNIEVDHEARLFQ 463
>gi|345562733|gb|EGX45769.1| hypothetical protein AOL_s00140g85 [Arthrobotrys oligospora ATCC
24927]
Length = 441
Score = 44.3 bits (103), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 34/152 (22%), Positives = 61/152 (40%), Gaps = 34/152 (22%)
Query: 108 YDVIIDGGVNDILERFNTFDANIVFGAERLCWPD------TSLYDKYPAVGSGY------ 155
+DV + +L RF +VFGA++ CWP+ + + P G +
Sbjct: 143 FDVWFQLPLQVLLSRFLKMGVPVVFGADKKCWPNDFKSVACTAIPQSPLPGDVFGDDTDR 202
Query: 156 ----------------RYLNSGGFIGYAKDIKELISNRSIKNEE------DDQLYYALLF 193
R++NSG IGYA ++ + K E+ DQ+ A ++
Sbjct: 203 QTILRSRREKYDNFRPRWVNSGTIIGYASHVRSIYDEAWKKVEQAGQEVDSDQMILAEVY 262
Query: 194 LDETLRTKHKIVLDTLANLFQNLYGSLEDIKL 225
+ ++ + + +D + LFQ + S DI
Sbjct: 263 GERVVKGDNSMSVDFYSTLFQTMTYSHNDIAF 294
>gi|356574250|ref|XP_003555263.1| PREDICTED: uncharacterized PKHD-type hydroxylase At1g22950-like
[Glycine max]
Length = 380
Score = 44.3 bits (103), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 40/194 (20%), Positives = 79/194 (40%), Gaps = 25/194 (12%)
Query: 552 TVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIH--- 608
++ +PC V+ F ++ +FC + + ++ + +W GT +L ++ H
Sbjct: 147 SIMAEPCKGVYTFEMLQPQFCKKLMSEVDHFERWVHGT----KLRIMRPNAMNKNKHGVI 202
Query: 609 MKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSST 668
+ + F+ ++ P+ + + + FVV Y ++ L H D +
Sbjct: 203 LDDFAFEAMLDRFMCDFIQPISRVFYPELGGSSLDSHHGFVVEYGINKDVELGLHEDEAE 262
Query: 669 YTINIALNQVGVDYEGGGCRF--IRYNCNVTATRM-----------GWMLMHPGRLTHYH 715
T+N+ L G ++ GG F +R + +VT G ++HPGR + H
Sbjct: 263 VTLNVCL---GKEFSGGDLFFQGVRCDAHVTTNTQPEEAFNYSHVPGHAILHPGR--NRH 317
Query: 716 EGLQVTQGTRYIMI 729
T G R +I
Sbjct: 318 GTRPTTSGNRMNLI 331
>gi|347738636|ref|ZP_08870086.1| alkyl hydroperoxide reductase/thiol specific antioxidant/Mal
allergen [Azospirillum amazonense Y2]
gi|346918273|gb|EGY00325.1| alkyl hydroperoxide reductase/thiol specific antioxidant/Mal
allergen [Azospirillum amazonense Y2]
Length = 366
Score = 43.9 bits (102), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 49/182 (26%), Positives = 72/182 (39%), Gaps = 15/182 (8%)
Query: 561 VFWFPIVTEK-FCHEFVQIMEAYGQWSDGTNN--DKRLETGYEAVPTR--DIHMKQVGLA 615
V P V E FC + + + G G N D RL Y+ R D M++ L
Sbjct: 165 VLVVPRVFEPDFCRLLIALHQDRGGLDSGVMNEVDGRLVGVYDYTRKRRRDYFMEEEALC 224
Query: 616 GVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTY-----T 670
+ + + ++P Q R+ + E R V Y E RPH D+++
Sbjct: 225 QAATQRIARRLLP-QVRQAFAF--EATRMERHVVACYDAAEGGYFRPHRDNTSAGTAHRR 281
Query: 671 INIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMIS 730
+ LN DYEGG RF + G ++ L HE L VT+G RY +
Sbjct: 282 FAVTLNLNTEDYEGGELRFPEFGPRTYRAPTGGAVVFSCSL--LHEALPVTRGRRYAYLP 339
Query: 731 FV 732
F+
Sbjct: 340 FL 341
>gi|414877514|tpg|DAA54645.1| TPA: oxidoreductase, partial [Zea mays]
Length = 349
Score = 43.9 bits (102), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 44/190 (23%), Positives = 77/190 (40%), Gaps = 27/190 (14%)
Query: 552 TVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTN-NDKRLET--GYEAVPTRDIH 608
++ +P P V+ F ++ FC ++ +E + +W R T Y AV
Sbjct: 161 SIMTEPIPGVYSFAMLQPTFCEMLLEEVENFEKWVHAMKFKIMRPNTMNKYGAV------ 214
Query: 609 MKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSST 668
+ GL + +F+ +++ P+ + + + + +F+V Y D L H D S
Sbjct: 215 LDDFGLEAMLNQFMEQFIAPISKVLYPEVGGGTLDSHHAFIVEYGKDRDVELGFHVDDSE 274
Query: 669 YTINIALNQVGVDYEGGGCRF--IRYNCNVTATRM-----------GWMLMHPGRLTHYH 715
T+N+ L G + GG F IR +V + GW ++H GR H H
Sbjct: 275 VTLNVCL---GKQFFGGELYFRGIRCENHVNSETQHEEMYDYTHIPGWAVLHHGR--HRH 329
Query: 716 EGLQVTQGTR 725
+ G R
Sbjct: 330 GARATSSGLR 339
>gi|410635006|ref|ZP_11345628.1| hypothetical protein GLIP_0179 [Glaciecola lipolytica E3]
gi|410145432|dbj|GAC12833.1| hypothetical protein GLIP_0179 [Glaciecola lipolytica E3]
Length = 136
Score = 43.9 bits (102), Expect = 0.31, Method: Composition-based stats.
Identities = 27/96 (28%), Positives = 45/96 (46%), Gaps = 9/96 (9%)
Query: 633 EFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRY 692
E IGY + F + Y+P+ S+RPH D+S+ T+N+ LN + G F
Sbjct: 4 EIIGYDSQS----FGFSIHYQPNTDTSIRPHTDASSVTLNVNLNTPEELFSGSAVNFYDT 59
Query: 693 NCNVTATRM---GWMLMHPGRLTHYHEGLQVTQGTR 725
+T + G ++H G + H + +T G+R
Sbjct: 60 KQGLTKEHIFKSGTAVIHRGHVPHAAQ--HITSGSR 93
>gi|149185860|ref|ZP_01864175.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Erythrobacter
sp. SD-21]
gi|148830421|gb|EDL48857.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Erythrobacter
sp. SD-21]
Length = 363
Score = 43.9 bits (102), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 51/220 (23%), Positives = 86/220 (39%), Gaps = 28/220 (12%)
Query: 533 LDWDLRYI--HPEYQ-KSLLPDTVNNQPCPDVFWFPIVT------EKFCHEFVQIMEAYG 583
LD +LR + +P + ++ L + P + W P++T E C + + E G
Sbjct: 124 LDRELRIVGRYPLIEGEAALAELKRRLPQVEDSWAPVLTVPGVFDEALCKHLISLYENDG 183
Query: 584 QWSDGTNNDKR------LETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGY 637
G D L+ G++ RD + L + ++ + + V P ER F
Sbjct: 184 GTPSGFMRDVNGKTTHILDDGFKQ--RRDTTITDPKLIQLLSQRIARRVAPAIERAFA-- 239
Query: 638 HHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTY-----TINIALNQVGVDYEGGGCRFIRY 692
+ R V Y + RPH D++T+ + +N +YEGG RF +
Sbjct: 240 -FKATRIERHIVACYEAGKG-HFRPHRDNTTFGTAHRRFAVTVNLNAEEYEGGNLRFPEF 297
Query: 693 NCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFV 732
G ++ L HE VT+G RY + F+
Sbjct: 298 GQRTYRAPTGGAVVFSCSL--LHEATPVTRGERYAFLPFL 335
>gi|219121537|ref|XP_002181121.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217407107|gb|EEC47044.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 427
Score = 43.9 bits (102), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 53/194 (27%), Positives = 77/194 (39%), Gaps = 39/194 (20%)
Query: 560 DVFWFPIVTEKFC---HEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAG 616
DV+ I++E FC EFV + GQ T L+ G R I + +GL
Sbjct: 221 DVYSLSILSESFCGRVREFVSEVSRLGQ----TEKYANLQMGR-----RPIDLDTIGLG- 270
Query: 617 VWAEFLRKYVV--PLQEREF--------IGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDS 666
W L Y++ P+ F + + V A + RP Q L H D
Sbjct: 271 -WINDLLFYLILRPISRHLFESSESFGDLNWRQGYVAAYSANPTEGRPRAQ--LITHTDD 327
Query: 667 STYTINIALNQVGVDYEGGGCRFIRYNCNVTA--------TRMGWMLMHPGRLTHYHEGL 718
S T+NI L G ++ GG F A R+G L+H GR H+H+
Sbjct: 328 SEVTLNIGL---GENFTGGAIEFRGLRGTPEAGKLIGTIQPRVGVALIHAGR--HFHDVT 382
Query: 719 QVTQGTRYIMISFV 732
VT G R+ ++ +
Sbjct: 383 TVTSGDRFALVMWA 396
>gi|308801735|ref|XP_003078181.1| unnamed protein product [Ostreococcus tauri]
gi|116056632|emb|CAL52921.1| unnamed protein product [Ostreococcus tauri]
Length = 278
Score = 43.9 bits (102), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 33/120 (27%), Positives = 53/120 (44%), Gaps = 16/120 (13%)
Query: 624 KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYE 683
K ++PL + + H +VVRY E P H D T ++LN V +YE
Sbjct: 105 KTLIPLAREQCVIDDHLKFEDDDWYVVRYDAKEFPRASRHRDGGHMTFVVSLNNV-TEYE 163
Query: 684 GGGCRF--IRYNC-----------NVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMIS 730
GGG F + ++ ++ A +G +H +L H+ VT GTRY+++
Sbjct: 164 GGGSVFEGLAFSVPHGGVHKLEDHDIPAQPIGGTTVHGSQLMHWSNA--VTSGTRYVLVG 221
>gi|440798305|gb|ELR19373.1| hypothetical protein ACA1_265980 [Acanthamoeba castellanii str.
Neff]
Length = 329
Score = 43.5 bits (101), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 37/144 (25%), Positives = 67/144 (46%), Gaps = 24/144 (16%)
Query: 83 KVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDT 142
KV +++N L I D I++ DS+D I+ +++L + G +R
Sbjct: 108 KVLMVRNYL--ATIPGDQIVVFIDSFDSILYATPSELLASWRR-------GLDR----QN 154
Query: 143 SLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKH 202
+ P+ G ++G A+D+ E ++ +I E DDQL + L+++D
Sbjct: 155 ARRRPTPSPIRGI-------YMGRARDLLEALTRAAIYEERDDQLAWELVYVD----NPG 203
Query: 203 KIVLDTLANLFQNLYGSLEDIKLN 226
+ LD A+L N+Y S +D+ L
Sbjct: 204 MVALDYHADLVANMYLSCDDLALR 227
>gi|357127527|ref|XP_003565431.1| PREDICTED: uncharacterized PKHD-type hydroxylase At1g22950-like
[Brachypodium distachyon]
Length = 355
Score = 43.5 bits (101), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 45/184 (24%), Positives = 71/184 (38%), Gaps = 19/184 (10%)
Query: 558 CPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGV 617
P V FP++ FC V ++ + W+ T K L T + + +G+ GV
Sbjct: 145 APVVVAFPMLRPGFCDMLVAEVQNFYMWA-CTTKQKILRTNALNTSPYGVVLSDMGMQGV 203
Query: 618 WAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQ 677
+ ++++V P+ F + + +SFV Y D+ H D S T+++ L
Sbjct: 204 LDDLMKQFVSPISTVFFSEVGGGSLDSHVSFVNLYHGDDNNGTDWHVDDSEVTLSVCL-- 261
Query: 678 VGVDYEGGGCRFIRYNCNVTATRM-------------GWMLMHPGRLTHYHEGLQVTQGT 724
G ++ GG F C T M G L+H GR H H G
Sbjct: 262 -GKEFTGGEMYFNGRRCENHTTSMEKDEEKVIHPQVPGEALLHHGR--HRHSVFPTFSGF 318
Query: 725 RYIM 728
R M
Sbjct: 319 RADM 322
>gi|359453402|ref|ZP_09242720.1| hypothetical protein P20495_1464 [Pseudoalteromonas sp. BSi20495]
gi|358049553|dbj|GAA78969.1| hypothetical protein P20495_1464 [Pseudoalteromonas sp. BSi20495]
Length = 235
Score = 43.5 bits (101), Expect = 0.42, Method: Composition-based stats.
Identities = 31/96 (32%), Positives = 42/96 (43%), Gaps = 17/96 (17%)
Query: 592 DKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQE---REFIGYHHEPVRAPMSF 648
D R E GY A P+ + E L Y+ P+ E IGY + F
Sbjct: 138 DSRSE-GYLAAPS---------FQAFYNEILNTYMRPISRLLFPEIIGYDTQT----FGF 183
Query: 649 VVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEG 684
+ Y P S+RPH D+S T+NI LN G ++ G
Sbjct: 184 SIYYDPSTDASIRPHTDASAVTLNINLNLPGEEFTG 219
>gi|219114765|ref|XP_002178178.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217409913|gb|EEC49843.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 456
Score = 43.5 bits (101), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 26/69 (37%), Positives = 34/69 (49%), Gaps = 6/69 (8%)
Query: 663 HHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQ 722
H+D T N+ L + GGG F + G L+HPG L H+GL VT
Sbjct: 375 HYDRCDVTANLLLAHA---FRGGGT-FFPAALTTVHLQPGEFLLHPGSL--IHQGLDVTA 428
Query: 723 GTRYIMISF 731
GTRY+M+ F
Sbjct: 429 GTRYLMVMF 437
>gi|397642554|gb|EJK75306.1| hypothetical protein THAOC_02971 [Thalassiosira oceanica]
Length = 782
Score = 43.1 bits (100), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 44/168 (26%), Positives = 69/168 (41%), Gaps = 11/168 (6%)
Query: 566 IVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQV-GLAGVWAEFLRK 624
+++ C+ +QI E + T + + AVPT D+ + + GL ++
Sbjct: 594 VLSHGECNRMIQIAEDHAVRLGWTTSR------HFAVPTTDMPIHDLPGLQAIFCRAWEN 647
Query: 625 YVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEG 684
+ PL ++F +F+V+Y Q L PH D S + +ALN +EG
Sbjct: 648 KIRPLLRQQFRIPSDSECHIHDAFLVKYGASMQRYLPPHVDESNLSFVVALND--DSFEG 705
Query: 685 GGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFV 732
GG +I +G ML G H G V G RYI+ F
Sbjct: 706 GGT-YIHTLGKTLKPPVGGMLSFCGGEI-LHSGDPVVSGIRYIVAGFC 751
>gi|332306013|ref|YP_004433864.1| 2OG-Fe(II) oxygenase [Glaciecola sp. 4H-3-7+YE-5]
gi|332173342|gb|AEE22596.1| 2OG-Fe(II) oxygenase [Glaciecola sp. 4H-3-7+YE-5]
Length = 303
Score = 43.1 bits (100), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 40/154 (25%), Positives = 62/154 (40%), Gaps = 24/154 (15%)
Query: 588 GTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQE---REFIGYHHEPVRA 644
G D R E GY A P+ + E L Y+ P+ E +GY +
Sbjct: 133 GAMLDSRSE-GYLAAPS---------FQAFYREILNTYMRPIARLLFPEIMGYDTQT--- 179
Query: 645 PMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNV-----TAT 699
F + Y+P+ S+RPH D+S T+NI +N + G F YN A
Sbjct: 180 -FGFSIHYKPNTDTSIRPHTDASAVTLNININLPDEPFTGSTVDF--YNPGAGKMIPLAF 236
Query: 700 RMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVD 733
G ++H G + H + + + T ++ F D
Sbjct: 237 TSGSAMIHRGNVVHAAQPITSGERTNLVLWLFGD 270
>gi|163795958|ref|ZP_02189921.1| hypothetical protein BAL199_28055 [alpha proteobacterium BAL199]
gi|159178713|gb|EDP63251.1| hypothetical protein BAL199_28055 [alpha proteobacterium BAL199]
Length = 383
Score = 43.1 bits (100), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 56/237 (23%), Positives = 92/237 (38%), Gaps = 24/237 (10%)
Query: 508 GHLVDSENFDPQKTNPEVYELIRNPLDW---DLRYIHPEYQKSLLPDTVNNQPCPDVFWF 564
G L+ EN Q + P+D LR EY + P + Q P +
Sbjct: 122 GALLVRENLRAQAI------VAAQPVDGFGERLRTAIAEYPVRMPPQAMQ-QHAPVLMIP 174
Query: 565 PIVTEKFCHEFVQIMEAYGQWSDGTNND-KRLETGY---EAVPTRDIHMKQVGLAGVWAE 620
+V+ FC + + EA G + G D L G + +D ++ L
Sbjct: 175 DVVSPAFCRQLIDYYEARGGGASGFMRDVDGLTRGLLDPKMKRRKDCSIEDESLLKQLRR 234
Query: 621 FLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYT-----INIAL 675
L V+P + F GY R + Y +Q + H D+++ ++L
Sbjct: 235 ALETRVIPEIGKAF-GYRVS--RVERYIIGCYDAADQGFFKAHRDNTSKATAHRKFAMSL 291
Query: 676 NQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFV 732
N +YEGG RF Y + +G ++ L +HE VT+G RY+++ F+
Sbjct: 292 NLNTDEYEGGALRFPEYGQHTYKPGVGCAVVFSCSL--FHEATPVTRGRRYVVLPFL 346
>gi|212724054|ref|NP_001131663.1| uncharacterized protein LOC100193023 [Zea mays]
gi|194692190|gb|ACF80179.1| unknown [Zea mays]
Length = 392
Score = 43.1 bits (100), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 44/190 (23%), Positives = 77/190 (40%), Gaps = 27/190 (14%)
Query: 552 TVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTN-NDKRLET--GYEAVPTRDIH 608
++ +P P V+ F ++ FC ++ +E + +W R T Y AV
Sbjct: 161 SIMTEPIPGVYSFAMLQPTFCEMLLEEVENFEKWVHAMKFKIMRPNTMNKYGAV------ 214
Query: 609 MKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSST 668
+ GL + +F+ +++ P+ + + + + +F+V Y D L H D S
Sbjct: 215 LDDFGLEAMLNQFMEQFIAPISKVLYPEVGGGTLDSHHAFIVEYGKDRDVELGFHVDDSE 274
Query: 669 YTINIALNQVGVDYEGGGCRF--IRYNCNVTATRM-----------GWMLMHPGRLTHYH 715
T+N+ L G + GG F IR +V + GW ++H GR H H
Sbjct: 275 VTLNVCL---GKQFFGGELYFRGIRCENHVNSETQHEEMYDYTHIPGWAVLHHGR--HRH 329
Query: 716 EGLQVTQGTR 725
+ G R
Sbjct: 330 GARATSSGLR 339
>gi|224005507|ref|XP_002291714.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220972233|gb|EED90565.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 662
Score = 43.1 bits (100), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 25/71 (35%), Positives = 37/71 (52%), Gaps = 4/71 (5%)
Query: 663 HHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQ 722
H+D T L + +YEGGG F R + G +L+HPG L YH+G+ +T
Sbjct: 570 HYDGCDVTWQAMLTDIN-EYEGGGTYF-RCLRQTIKLQQGQVLVHPGEL--YHKGIDITC 625
Query: 723 GTRYIMISFVD 733
G R +++ F D
Sbjct: 626 GVRTLLVCFTD 636
>gi|428216499|ref|YP_007100964.1| alkyl hydroperoxide reductase [Pseudanabaena sp. PCC 7367]
gi|427988281|gb|AFY68536.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Pseudanabaena sp. PCC 7367]
Length = 376
Score = 43.1 bits (100), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 45/189 (23%), Positives = 74/189 (39%), Gaps = 18/189 (9%)
Query: 553 VNNQPCPDVFWFP-IVTEKFCHEFVQIMEAYGQWSDGTNNDKRLET----GYEAVPTRDI 607
VN+ P V P +++ +FC E + + G G + +T YE RD
Sbjct: 168 VNHAP---VLLIPNVISPEFCQELIDVWHTRGNQDSGFMRSEGEKTVGYLDYEHKIRRDH 224
Query: 608 HMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSS 667
+++ L + + V P ++ F +E R + Y RPH D+
Sbjct: 225 FVREGQLRDRIDRIMNRRVFPEIKKAFC---YEVTRREAYKIACYNSASGGYFRPHRDNL 281
Query: 668 T-----YTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQ 722
T + LN +YEGG +F Y ++ G ++ L H E VT
Sbjct: 282 TGGTAHRKFAMTLNLNVEEYEGGYLKFAEYGPHLYKPTTGSAVIFSCSLLH--EATDVTA 339
Query: 723 GTRYIMISF 731
G R+ ++SF
Sbjct: 340 GIRFALLSF 348
>gi|348688347|gb|EGZ28161.1| hypothetical protein PHYSODRAFT_469968 [Phytophthora sojae]
Length = 464
Score = 42.7 bits (99), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 30/88 (34%), Positives = 42/88 (47%), Gaps = 6/88 (6%)
Query: 648 FVVRY--RPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWML 705
F V+Y R E+ L H D S + N+ LN D+ GGG F V T+ G
Sbjct: 375 FFVKYEARKGERSELALHRDGSVLSFNLLLNSAD-DFTGGGTYFDATKHTVHITQ-GDAA 432
Query: 706 MHPGRLTHYHEGLQVTQGTRYIMISFVD 733
+H G++ H G V G R I++ F+D
Sbjct: 433 VHSGKV--LHAGAPVVSGIRQILVGFLD 458
>gi|410643600|ref|ZP_11354096.1| hypothetical protein GCHA_4365 [Glaciecola chathamensis S18K6]
gi|410137010|dbj|GAC12283.1| hypothetical protein GCHA_4365 [Glaciecola chathamensis S18K6]
Length = 303
Score = 42.7 bits (99), Expect = 0.75, Method: Compositional matrix adjust.
Identities = 38/152 (25%), Positives = 61/152 (40%), Gaps = 20/152 (13%)
Query: 588 GTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQE---REFIGYHHEPVRA 644
G D R E GY A P+ + E L Y+ P+ E +GY +
Sbjct: 133 GAMLDSRSE-GYLAAPS---------FQAFYREILNTYMRPIARLLFPEIMGYDTQT--- 179
Query: 645 PMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVT---ATRM 701
F + Y+P+ S+RPH D+S T+NI +N + G F + A
Sbjct: 180 -FGFSIHYKPNTDTSIRPHTDASAVTLNININLPDEPFTGSTVDFYDPSAGKMIPLAFTS 238
Query: 702 GWMLMHPGRLTHYHEGLQVTQGTRYIMISFVD 733
G ++H G + H + + + T ++ F D
Sbjct: 239 GSAMIHRGNVVHAAQPITSGERTNLVLWLFGD 270
>gi|297834722|ref|XP_002885243.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297331083|gb|EFH61502.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 394
Score = 42.7 bits (99), Expect = 0.75, Method: Compositional matrix adjust.
Identities = 48/191 (25%), Positives = 74/191 (38%), Gaps = 27/191 (14%)
Query: 555 NQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTN-NDKRLET--GYEAVPTRDIHMKQ 611
++P P VF F ++ FC + ++ + +W T R T Y AV +
Sbjct: 163 SEPSPGVFVFDMLQPSFCEMMLSEIDNFERWVGETKFRIMRPNTMNKYGAV------LDD 216
Query: 612 VGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTI 671
GL + + + ++ P+ + F + + FVV Y D L H D S T+
Sbjct: 217 FGLDTMLDKLMEGFIRPISKLFFSDVGGASLDSHHGFVVEYGKDRDVDLGFHVDDSEVTL 276
Query: 672 NIALNQVGVDYEGGGCRFIRYNCNV---TATRM----------GWMLMHPGRLTHYHEGL 718
N+ L G + GG F C TAT+ G ++H GR H H
Sbjct: 277 NVCL---GNQFVGGELFFRGTRCEKHVNTATKADLTFDYDHIPGQAVLHRGR--HRHGAR 331
Query: 719 QVTQGTRYIMI 729
T G R M+
Sbjct: 332 ATTSGHRVNML 342
>gi|409993565|ref|ZP_11276702.1| hypothetical protein APPUASWS_20677 [Arthrospira platensis str.
Paraca]
gi|409935585|gb|EKN77112.1| hypothetical protein APPUASWS_20677 [Arthrospira platensis str.
Paraca]
Length = 377
Score = 42.7 bits (99), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 63/263 (23%), Positives = 105/263 (39%), Gaps = 43/263 (16%)
Query: 499 LKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYI-----HP--EYQKSLL-- 549
LK + + +YG + D + +N VY + LD +LR + HP E+ + L
Sbjct: 99 LKGEVSTKYGAYI----CDGKNSNTIVYNRVAFLLDRNLRILKIYPLHPLEEFTQQFLGE 154
Query: 550 -PDTVNNQP-------CPDVFWFPIVTEKFCHEFVQIMEAYGQWSDG-TNNDKRLETGY- 599
D V +P P + ++ +FC E + I E G G + GY
Sbjct: 155 IQDLVAQEPPRLIKMQAPVLLIPKVLDLRFCRELIHIWETQGNDESGFMKREGEKTVGYV 214
Query: 600 -EAVPTRDIHMKQVGLAGVWAE-FLRKYVVP--LQEREFIGYHHEPVRAPMSFVVRYRPD 655
+ R H Q G + + +++ V P LQ +F + R + Y +
Sbjct: 215 DPSFKRRRDHFIQDGPVKNYIDSIMQRRVFPEILQAFQF-----QLTRRECYKIGCYDSE 269
Query: 656 EQPSLRPHHDSST-------YTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHP 708
RPH D++T + + I LN +YEGG RF + ++ G ++
Sbjct: 270 SGGFFRPHRDNTTGGTFHRRFAMTINLN--AEEYEGGCLRFPEHAPHLYKPATGDAIIFS 327
Query: 709 GRLTHYHEGLQVTQGTRYIMISF 731
+ HE VT G R+ ++SF
Sbjct: 328 --CSTMHEATDVTSGRRFALLSF 348
>gi|390338764|ref|XP_001180150.2| PREDICTED: 2-oxoglutarate and iron-dependent oxygenase
domain-containing protein 2-like [Strongylocentrotus
purpuratus]
Length = 361
Score = 42.4 bits (98), Expect = 0.85, Method: Compositional matrix adjust.
Identities = 42/187 (22%), Positives = 82/187 (43%), Gaps = 15/187 (8%)
Query: 549 LPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIH 608
L + + VF FP+ T +FC FV+ + + N + + +
Sbjct: 130 LAKRLTRENASRVFSFPVFTAEFCDRFVEEITYF-------ENSPLPKGRPNTMNNYGVL 182
Query: 609 MKQVGLAGVWAEFLR-KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSS 667
+ ++G G + LR Y+ P+ + + + +F+V+Y+ E L H+D++
Sbjct: 183 LMELGFDGNFLNPLRMDYLAPIASLLYPDVGGNSLDSHRAFIVKYKLGEDVDLNYHYDNA 242
Query: 668 TYTINIALNQVGVDYE--GGGCRFIRYNCNVTAT---RMGWMLMHPGRLTHYHEGLQVTQ 722
TIN++L + D E G R + + + A + L+H G+ H H + +++
Sbjct: 243 EVTINVSLGKEFSDGELYFGDMRQMPRDETMYARFEHKKTIGLLHRGQ--HMHGAMPISE 300
Query: 723 GTRYIMI 729
G RY +I
Sbjct: 301 GERYNLI 307
>gi|301117344|ref|XP_002906400.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262107749|gb|EEY65801.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 462
Score = 42.4 bits (98), Expect = 0.89, Method: Compositional matrix adjust.
Identities = 31/88 (35%), Positives = 42/88 (47%), Gaps = 6/88 (6%)
Query: 648 FVVRY--RPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWML 705
F V+Y R E+ L H D S + NI LN D+ GGG F V T+ G
Sbjct: 373 FFVKYEARKGERSELALHRDGSVLSFNILLNSAD-DFTGGGTYFDSTKRTVHITQ-GDAA 430
Query: 706 MHPGRLTHYHEGLQVTQGTRYIMISFVD 733
+H G++ H G V G R I++ F+D
Sbjct: 431 VHSGKV--LHGGAPVLTGIRQILVGFLD 456
>gi|392407556|ref|YP_006444164.1| glycosyltransferase [Anaerobaculum mobile DSM 13181]
gi|390620692|gb|AFM21839.1| putative glycosyltransferase [Anaerobaculum mobile DSM 13181]
Length = 277
Score = 42.4 bits (98), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 39/158 (24%), Positives = 72/158 (45%), Gaps = 28/158 (17%)
Query: 339 DDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLK 398
+D + F T+ ++YI ++S + A N+A+ S+ GV ++ ++SD H +N +V+K
Sbjct: 42 NDSLRKFSTLDDRIEYIFNDSNLGYGRAHNIAIRESIKAGVPYHVVLNSDVHFNN-EVIK 100
Query: 399 YL-----VNRNESLIAP--------------LLVRPFKAWSNF---WGALNADGFYARSF 436
L N + L+ P LL PF ++ WG Y
Sbjct: 101 VLYDFMNANPDVGLVMPKILYPNGELQYDCKLLPTPFDSFGRRFLNWGPFKK---YVEKR 157
Query: 437 DYMNIINGDQGGKGIWNVPYITNCYL-MKTSVIKATNI 473
+++ + K I NVPY+ C++ ++ SV+K +
Sbjct: 158 NHIYELRFADYDK-IMNVPYLCGCFIFLRVSVLKEIGL 194
>gi|195652137|gb|ACG45536.1| oxidoreductase [Zea mays]
Length = 392
Score = 42.0 bits (97), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 43/190 (22%), Positives = 76/190 (40%), Gaps = 27/190 (14%)
Query: 552 TVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTN-NDKRLET--GYEAVPTRDIH 608
++ +P P V+ F ++ FC ++ +E + +W R T Y AV
Sbjct: 161 SIMTEPIPGVYSFAMLQPTFCEMLLEEVENFEKWVHAMKFKIMRPNTMNKYGAV------ 214
Query: 609 MKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSST 668
+ GL + +F+ +++ P+ + + + + +F+V Y D L H D S
Sbjct: 215 LDDFGLEAMLNQFMEQFIAPISKVLYPEVGGGTLDSHHAFIVEYGKDRDVELGFHVDDSE 274
Query: 669 YTINIALNQVGVDYEGGGCRF--IRYNCNVTATRM-----------GWMLMHPGRLTHYH 715
T+N+ L G + GG F IR +V + W ++H GR H H
Sbjct: 275 VTLNVCL---GKQFSGGELYFRGIRCENHVNSETQHEEMYDYTHIPSWAVLHHGR--HRH 329
Query: 716 EGLQVTQGTR 725
+ G R
Sbjct: 330 GARATSSGLR 339
>gi|326929627|ref|XP_003210960.1| PREDICTED: 2-oxoglutarate and iron-dependent oxygenase
domain-containing protein 2-like [Meleagris gallopavo]
Length = 293
Score = 42.0 bits (97), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 41/176 (23%), Positives = 75/176 (42%), Gaps = 17/176 (9%)
Query: 561 VFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAE 620
+F P+ TE+FC F++ +E + Q SD Y + + ++G+ +
Sbjct: 78 IFRLPVFTEEFCQAFIEELENFEQ-SDMPKGRPNSMNNY------GVLLNELGMDESFIT 130
Query: 621 FLR-KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVG 679
LR KY+ P+ + + + +FVV+Y E L H+D++ T+N++L G
Sbjct: 131 PLREKYLRPITALLYPDLGGACLDSHKAFVVKYSLHEDLDLSSHYDNAEVTLNVSL---G 187
Query: 680 VDYEGGGCRFIRYNCNVTATRMGWMLMHPG------RLTHYHEGLQVTQGTRYIMI 729
D+ G F + + + + H G R H L + G R+ +I
Sbjct: 188 KDFTEGNLYFGDFRQDPSPVPSYIEVEHVGTQGLLHRGGQIHGALPIASGERWNLI 243
>gi|432875843|ref|XP_004072935.1| PREDICTED: 2-oxoglutarate and iron-dependent oxygenase
domain-containing protein 2-like [Oryzias latipes]
Length = 290
Score = 42.0 bits (97), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 42/179 (23%), Positives = 80/179 (44%), Gaps = 23/179 (12%)
Query: 561 VFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAE 620
V+ FP+ FC E V+ ++ + Q + I + ++G +
Sbjct: 78 VYRFPVFERDFCRELVEELDHFEQSPAPKGRPNTMNNS-------GILLDELGFDEAFVT 130
Query: 621 FLR-KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVG 679
LR +Y++PL + + + +FVV+Y +E L H+D++ T+N++ +G
Sbjct: 131 PLREQYLLPLTSLLYPDCGGRCLDSHKAFVVKYDMNEDLELSYHYDNAEVTLNVS---IG 187
Query: 680 VDYEGGGCRF--IRYNCNVTATRMGWM-------LMHPGRLTHYHEGLQVTQGTRYIMI 729
D+ G F +R + V+ TR+ L+H G+ H H L ++ G R+ +I
Sbjct: 188 KDFTEGNLYFGDMRQD-PVSETRLTEAEHRITEGLLHRGQ--HMHGALPISHGQRWNLI 243
>gi|345857005|ref|ZP_08809460.1| glycosyl transferase, group 2 family protein [Desulfosporosinus sp.
OT]
gi|344329850|gb|EGW41173.1| glycosyl transferase, group 2 family protein [Desulfosporosinus sp.
OT]
Length = 558
Score = 42.0 bits (97), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 35/151 (23%), Positives = 72/151 (47%), Gaps = 23/151 (15%)
Query: 371 VENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIA--------PLLVRPFKAW--- 419
++ +L +G D+ F VDSD +LD P+ L +L++ N+ +++ P L+ + W
Sbjct: 95 IKIALDEGYDYLFLVDSDLYLD-PNTLPHLLSLNKDIVSEVYWTRWNPKLIPLPQVWIRD 153
Query: 420 ------SNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLM-KTSVIKATN 472
S+ AL+ + R+ +++ +++ G + V + C L+ +T++ + +
Sbjct: 154 QYTLYVSSRGEALSEEEMNKRTKEFIKMLS----HPGTYKVGGLGACTLISRTALERGVS 209
Query: 473 IKTIYTLNSMDYDMAFCTNLRNKGIHLKIDS 503
+ IY L D FC G+ L D+
Sbjct: 210 FQEIYNLGFTGEDRHFCVRAAALGLELYADT 240
>gi|325286683|ref|YP_004262473.1| 2OG-Fe(II) oxygenase [Cellulophaga lytica DSM 7489]
gi|324322137|gb|ADY29602.1| 2OG-Fe(II) oxygenase [Cellulophaga lytica DSM 7489]
Length = 317
Score = 42.0 bits (97), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 30/133 (22%), Positives = 58/133 (43%), Gaps = 7/133 (5%)
Query: 603 PTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRP 662
P + H+ + + + +Y+ P+ R +G + F +RY PD++ L+
Sbjct: 139 PRSEGHLGAPNFQAFYNDIMDRYMRPIS-RLLLGTQGYDSQT-FGFSIRYNPDKEKDLQA 196
Query: 663 HHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRM---GWMLMHPGRLTHYHEGLQ 719
H D+S+ T+NI +N +Y G F + T G ++H G + H
Sbjct: 197 HTDASSATLNININLPDEEYTGSEVDFYDKSTKQTVQTFFEPGKAILHRGNVPHATH--P 254
Query: 720 VTQGTRYIMISFV 732
+T G R ++ ++
Sbjct: 255 ITSGQRSNLVVWL 267
>gi|197104861|ref|YP_002130238.1| hypothetical protein PHZ_c1395 [Phenylobacterium zucineum HLK1]
gi|196478281|gb|ACG77809.1| conserved hypothetical protein [Phenylobacterium zucineum HLK1]
Length = 345
Score = 42.0 bits (97), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 46/179 (25%), Positives = 69/179 (38%), Gaps = 22/179 (12%)
Query: 566 IVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGY---EAVPTRDIHMKQVGLAGVWAEFL 622
I+ + C +++ E G G D T Y E RD+ ++ GL L
Sbjct: 154 ILEPELCRALIELHEGDGGAFTGVMRDAGDRTVYVMDELKRRRDVVVRDPGLVEALRTRL 213
Query: 623 RKYVVPLQERE--FIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSST-------YTINI 673
+ + PL ER F H E V Y + RPH D++T + +I
Sbjct: 214 ERRLFPLIERALGFKATHIE-----RYLVSCYDEADGGVFRPHRDNTTLGTAHRAFACSI 268
Query: 674 ALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFV 732
LN +EGG RF + +G + + L HE L V +G RY + F+
Sbjct: 269 NLND---GFEGGDLRFPEFGPATYRPPVGGVCVFACGL--MHEALPVMEGRRYAFVPFL 322
>gi|408391417|gb|EKJ70794.1| hypothetical protein FPSE_09030 [Fusarium pseudograminearum CS3096]
Length = 528
Score = 41.6 bits (96), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 53/232 (22%), Positives = 92/232 (39%), Gaps = 65/232 (28%)
Query: 40 ASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNL--LKNELDEM--D 95
ASN D + + SA VN+ + W G + + L +K LD++
Sbjct: 92 ASNPNDMFCAIVASALVNRYPAPYM---VGWKGEGKYNASAAHTAKLYSIKKYLDKLPNG 148
Query: 96 ITDDMIILVTDSYDVIIDGGVNDILERFNTFDAN------------------------IV 131
DD ++ D YDV+ V ++ER+ A+ +
Sbjct: 149 GDDDDLVFFGDGYDVMAQLPVEVVIERYFKVAADADQRLADRFGISVQEAHKRGLKQTLF 208
Query: 132 FGAERLCWP---------------DTSLYDKYPAVGSG---YR---YLNSGGFIGYAKDI 170
+GA+++CWP +++Y P G+G YR Y NSG IG D+
Sbjct: 209 WGADKMCWPAINEAQCTKIPGSHLASTVYG--PKTGNGDLNYRDAKYFNSGSVIGPIGDL 266
Query: 171 KELIS----------NRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLANL 212
++ I+ + + K + DQ+Y A ++ + L ++ K + D L N
Sbjct: 267 RKFINAGVTALEETFDPNFKYKTSDQIYLARVYARQEL-SRAKQIEDELLNF 317
>gi|255570701|ref|XP_002526305.1| oxidoreductase, putative [Ricinus communis]
gi|223534386|gb|EEF36094.1| oxidoreductase, putative [Ricinus communis]
Length = 379
Score = 41.6 bits (96), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 47/187 (25%), Positives = 76/187 (40%), Gaps = 21/187 (11%)
Query: 556 QPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTN-NDKRLET--GYEAVPTRDIHMKQV 612
+P P V+ F ++ FC + +E + +W T R T Y AV +
Sbjct: 148 EPTPGVYVFEMLQPNFCEMLMSEVENFERWVHETKFRIMRPNTMNNYGAV------LDDF 201
Query: 613 GLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTIN 672
GL + + + +Y+ P+ + F + + F+V Y D L H D S T+N
Sbjct: 202 GLETMLDKLMDEYIRPMSKLFFPEVGGSTLDSHHGFIVEYGVDRDVELGFHVDDSEVTLN 261
Query: 673 IALNQ--VGVDYEGGGCRFIRY-NCNVTATRM-------GWMLMHPGRLTHYHEGLQVTQ 722
+ L++ VG D G R ++ N A + G ++H GR H H T
Sbjct: 262 VCLSKQFVGGDLFFRGVRCDKHVNTETQAEEILDYVHVQGHAVLHHGR--HRHGARATTS 319
Query: 723 GTRYIMI 729
G R +I
Sbjct: 320 GRRVNLI 326
>gi|326495342|dbj|BAJ85767.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 392
Score = 41.6 bits (96), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 36/147 (24%), Positives = 62/147 (42%), Gaps = 12/147 (8%)
Query: 552 TVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTN-NDKRLET--GYEAVPTRDIH 608
++ +P P VF F ++ KFC ++ +E + +W R T Y AV
Sbjct: 158 SIMTEPTPGVFSFAMLQPKFCDMLLEEVENFEKWVHAMKFKIMRPNTMNKYGAV------ 211
Query: 609 MKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSST 668
+ GL + +F+ +++ P+ + + + + +FVV Y D L H D S
Sbjct: 212 LDDFGLEAMLNQFMEEFIAPISKVFYPEVGGGTLDSHHAFVVEYGKDRDVELGFHVDDSE 271
Query: 669 YTINIALNQVGVDYEGGGCRFIRYNCN 695
T+N+ L G + GG F C
Sbjct: 272 VTLNVCL---GKQFSGGELYFRGIRCE 295
>gi|290978569|ref|XP_002672008.1| prolyl 4-hydroxylase alpha subunit family protein [Naegleria
gruberi]
gi|284085581|gb|EFC39264.1| prolyl 4-hydroxylase alpha subunit family protein [Naegleria
gruberi]
Length = 659
Score = 41.6 bits (96), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 39/78 (50%), Gaps = 13/78 (16%)
Query: 661 RPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHP-----GRLT--- 712
R H+ S YT+ I LNQ D++GG RF + + L+H G+L
Sbjct: 168 RSEHERSIYTLLIYLNQ---DFKGGETRFYNDPTKTDSDFEEYSLLHTLKPSLGQLALFN 224
Query: 713 --HYHEGLQVTQGTRYIM 728
YHEG VT+GT+YI+
Sbjct: 225 QDFYHEGCPVTKGTKYIL 242
>gi|326523063|dbj|BAJ88572.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 392
Score = 41.6 bits (96), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 36/147 (24%), Positives = 62/147 (42%), Gaps = 12/147 (8%)
Query: 552 TVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTN-NDKRLET--GYEAVPTRDIH 608
++ +P P VF F ++ KFC ++ +E + +W R T Y AV
Sbjct: 158 SIMTEPTPGVFSFAMLQPKFCDMLLEEVENFEKWVHAMKFKIMRPNTMNKYGAV------ 211
Query: 609 MKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSST 668
+ GL + +F+ +++ P+ + + + + +FVV Y D L H D S
Sbjct: 212 LDDFGLEAMLNQFMEEFIAPISKVFYPEVGGGTLDSHHAFVVEYGKDRDVELGFHVDDSE 271
Query: 669 YTINIALNQVGVDYEGGGCRFIRYNCN 695
T+N+ L G + GG F C
Sbjct: 272 VTLNVCL---GKQFSGGELYFRGIRCE 295
>gi|134299269|ref|YP_001112765.1| glycosyl transferase family protein [Desulfotomaculum reducens
MI-1]
gi|134051969|gb|ABO49940.1| glycosyl transferase, family 2 [Desulfotomaculum reducens MI-1]
Length = 826
Score = 41.6 bits (96), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 76/379 (20%), Positives = 145/379 (38%), Gaps = 66/379 (17%)
Query: 173 LISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGS---LEDIKLNF-- 227
LI + + N+ +D L L+ R + L L GS ED+ LN
Sbjct: 189 LIITKELANKPNDPFLLYSLALEHYQRKNILEGVQCLKKALTQLRGSEGYFEDVILNTAI 248
Query: 228 ------DLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWKTSGCTRCNLIKH 281
L+E + N +++ K I + N K + + T L K
Sbjct: 249 GLLQLGRLEELMDFIN-----KSLLMLPEQKDLILMRRLANQGLKRYLKAADT---LEKS 300
Query: 282 LDSLKPDQF--PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFD 339
+DS + F V+++ + + L++FL + L ++ N+ H
Sbjct: 301 IDSRGKESFMKTRVMVASPVKQKEVILKQFLESLNKLEKSELELDFVFINDNNEH----- 355
Query: 340 DYIHNFKTMFKNVKYI---AHNSTVNSKEA--------------RNLAVENSLHKGVDFY 382
+ + F KNV+ I +++S + +E +N ++ +L +G D+
Sbjct: 356 NLLEKFSRGKKNVRIIKATSNDSYICDEETHRWSEELIWKVAAYKNSFIKMALEEGYDYL 415
Query: 383 FYVDSDSHLDNPDVLKYLVNRNESLIAPLLVR----PFKAWSNFWG-------------A 425
F VDSD +L +P +K+L++ + +++ + FK WG A
Sbjct: 416 FLVDSDLYL-HPKTIKHLISLKKDIVSEVFWTRWGPEFKILPQVWGSDQYELYHVSRGQA 474
Query: 426 LNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLM-KTSVIKATNIKTIYTLNSMDY 484
L+ + R +++ ++ G + V + C L+ + ++ K + IY L+
Sbjct: 475 LSEEEKIQRIEEFIEKLS----KPGTYKVGGLGACTLISQKALAKGVSFSEIYNLSFWGE 530
Query: 485 DMAFCTNLRNKGIHLKIDS 503
D FC G L D+
Sbjct: 531 DRHFCIRAVALGFELYADT 549
>gi|344297298|ref|XP_003420336.1| PREDICTED: 2-oxoglutarate and iron-dependent oxygenase
domain-containing protein 2 [Loxodonta africana]
Length = 350
Score = 41.6 bits (96), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 33/129 (25%), Positives = 59/129 (45%), Gaps = 11/129 (8%)
Query: 561 VFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAE 620
++ P+ T FC ++ +E + Q SD Y + + ++GL
Sbjct: 139 IYRVPVFTASFCQALLEELEHFEQ-SDLPKGRPNTMNNY------GVLLHELGLDEPLVT 191
Query: 621 FLRK-YVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQV- 678
LR+ ++ PL + Y P+ + +FVV+Y P + L H+D++ T+N+AL +
Sbjct: 192 PLREHFLQPLMALLYPEYSGGPLDSHRAFVVKYAPGQDRELGCHYDNAELTLNVALGKAF 251
Query: 679 --GVDYEGG 685
G Y GG
Sbjct: 252 TGGALYFGG 260
>gi|376007582|ref|ZP_09784776.1| Alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Arthrospira sp. PCC 8005]
gi|375324049|emb|CCE20529.1| Alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Arthrospira sp. PCC 8005]
Length = 377
Score = 41.2 bits (95), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 63/263 (23%), Positives = 104/263 (39%), Gaps = 43/263 (16%)
Query: 499 LKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYI-----HP--EYQKSLL-- 549
LK + + YG + D + +N VY + LD LR + HP E+ + L
Sbjct: 99 LKGEVSTRYGAYI----CDGKNSNTIVYNRVAFLLDRSLRILKIYPLHPLEEFTQKFLGE 154
Query: 550 -PDTVNNQP-------CPDVFWFPIVTEKFCHEFVQIMEAYGQWSDG-TNNDKRLETGY- 599
D V +P P + ++ +FC E ++I E G G + GY
Sbjct: 155 IQDLVAQEPPRLIEMQAPVLLIPKVLDLRFCRELIKIWETQGNDESGFMKREGEKTVGYV 214
Query: 600 -EAVPTRDIHMKQVGLAGVWAE-FLRKYVVP--LQEREFIGYHHEPVRAPMSFVVRYRPD 655
+ R H Q G + + +++ V P LQ +F + R + Y +
Sbjct: 215 DPSFKRRRDHFIQDGPVKNYIDSIMQRRVFPEILQAFQF-----QLTRRECYKIGCYDSE 269
Query: 656 EQPSLRPHHDSST-------YTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHP 708
RPH D++T + + I LN +YEGG RF + ++ G ++
Sbjct: 270 SGGFFRPHRDNTTGGTLHRRFAMTINLNTE--EYEGGCLRFPEHAPHLYKPATGDAIIFS 327
Query: 709 GRLTHYHEGLQVTQGTRYIMISF 731
+ HE VT G R+ ++SF
Sbjct: 328 --CSTMHEATDVTSGRRFALLSF 348
>gi|389878362|ref|YP_006371927.1| alkyl hydroperoxide reductase [Tistrella mobilis KA081020-065]
gi|388529146|gb|AFK54343.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
allergen [Tistrella mobilis KA081020-065]
Length = 404
Score = 41.2 bits (95), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 53/204 (25%), Positives = 78/204 (38%), Gaps = 30/204 (14%)
Query: 547 SLLPDTVNNQPCPDVFWFPIVTE-KFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTR 605
++ P QP V P V E FC ++M Y + G + E G V R
Sbjct: 181 TVAPADDAGQPWAPVLAVPRVFEPAFCR---RLMAEYDRLG-GEESGFMREVGGRTVEMR 236
Query: 606 DIHMKQVG---------LAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDE 656
D K+ AG+ A R+ + LQ+ + ++ R V Y D
Sbjct: 237 DYGHKRRADCLIEDETLRAGIRARIERRLLPELQK----AFQYKATRIERYIVACYDGDG 292
Query: 657 QPS-LRPHHDSST-------YTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHP 708
RPH D++T + + I LN DYEGG RF + G ++
Sbjct: 293 AGGYFRPHRDNTTRGTAHRRFAVTINLN--AEDYEGGELRFPEFGDRRYRAPTGGAVVFS 350
Query: 709 GRLTHYHEGLQVTQGTRYIMISFV 732
L HE L VT+G R+ + F+
Sbjct: 351 CSL--LHEALAVTRGRRFACLPFL 372
>gi|255567788|ref|XP_002524872.1| oxidoreductase, putative [Ricinus communis]
gi|223535835|gb|EEF37496.1| oxidoreductase, putative [Ricinus communis]
Length = 411
Score = 41.2 bits (95), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 53/223 (23%), Positives = 89/223 (39%), Gaps = 39/223 (17%)
Query: 531 NPLDWDLRYIHPE------YQKSLLPDT------VNNQPCPDVFWFPIVTEKFCHEFVQI 578
PL+ +L +HP + K++ +T + ++P P VF F ++ FC+ +
Sbjct: 143 QPLNRELYAMHPSSFFVPSFIKAINDNTEESFRHIMSEPSPGVFTFEMLQPHFCNLLLSE 202
Query: 579 MEAYGQW-SDGTNNDKRLET--GYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFI 635
+E + +W +D R T Y AV + GL + + + ++ P+ + F
Sbjct: 203 VENFEKWVNDSKFRIMRPNTMNKYGAV------LDDFGLETMLDKLMDGFIRPISKVFFP 256
Query: 636 GYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCN 695
+ + FVV Y D L H D S T+N+ L G + GG F C+
Sbjct: 257 EVGGSTLDSHHGFVVEYGKDRDVDLGFHVDDSEVTLNVCL---GKQFSGGDLFFRGIRCD 313
Query: 696 V---TATRM----------GWMLMHPGRLTHYHEGLQVTQGTR 725
T ++ G ++H GR H H T G R
Sbjct: 314 KHVNTGSQSEEIYDYKHEPGKAVLHRGR--HRHGARATTTGHR 354
>gi|219112429|ref|XP_002177966.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217410851|gb|EEC50780.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 360
Score = 41.2 bits (95), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 48/233 (20%), Positives = 91/233 (39%), Gaps = 42/233 (18%)
Query: 529 IRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQI----MEAYGQ 584
IR+ L + P + + + D + P ++ I+++ ++I +A G+
Sbjct: 25 IRSAASTRLIFTLPRFYEDKVDDAIYPSPLHNIHVRTILSDDEAKACLRISSDFAKATGR 84
Query: 585 WSDGTNNDKR-----LETGYEAVPTRDIHMKQVGLAG-VWAEFLRKYVVPLQEREFIGYH 638
W D ++D+ + E + +++++G G ++ E Y V ++ F+
Sbjct: 85 W-DRPDSDRHASYATCDFAVEDCTILEDYLEKIGFTGRIFDELNEVYGVEQEDMSFLDL- 142
Query: 639 HEPVRAPMSFVVRYRPD------EQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRF--- 689
F Y+ L PH D S + +I +N D+EGGG F
Sbjct: 143 ---------FCAHYQTKTDCNQGSMDRLEPHRDGSILSFSITINDPD-DFEGGGTLFDGL 192
Query: 690 ---------IRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVD 733
++ V TR G + H G+ H +T G R +++ FVD
Sbjct: 193 RDVVSTSSVLKNGGVVRPTRAGDAVFHSGKALHGANA--ITSGKRTVLVGFVD 243
>gi|431915931|gb|ELK16185.1| Glycosyltransferase 25 family member 2 [Pteropus alecto]
Length = 166
Score = 41.2 bits (95), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 19/64 (29%), Positives = 33/64 (51%), Gaps = 1/64 (1%)
Query: 367 RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGAL 426
R A+ + K D+ +VD D+ L NP L ++ N++++AP+L +SNFW +
Sbjct: 93 RQTALRTAREKWSDYILFVDVDNFLTNPQTLNLMIAENKTIVAPML-ESRGLYSNFWCGI 151
Query: 427 NADG 430
Sbjct: 152 TPQA 155
>gi|412993564|emb|CCO14075.1| predicted protein [Bathycoccus prasinos]
Length = 486
Score = 41.2 bits (95), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 33/122 (27%), Positives = 55/122 (45%), Gaps = 11/122 (9%)
Query: 25 KVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGG-GYK 83
K K +D +V A+N + SA+ N L++ + G+ ++ G K
Sbjct: 162 KSKGVDSAPVVVTAHATNLKSNGWVILDSAKKNGLEIV--------ISGNGTTFHGFADK 213
Query: 84 VNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTS 143
+ LK L I + II+ D+ DV++ G + +RF DA+ +FG E WP+
Sbjct: 214 MMGLKAAL--HSIPGNPIIVNADATDVLLQCGPEEFQKRFEQADADFIFGGETQLWPEIR 271
Query: 144 LY 145
Y
Sbjct: 272 KY 273
>gi|407068468|ref|ZP_11099306.1| 2OG-Fe(II) oxygenase [Vibrio cyclitrophicus ZF14]
Length = 303
Score = 41.2 bits (95), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 38/148 (25%), Positives = 62/148 (41%), Gaps = 22/148 (14%)
Query: 588 GTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQE---REFIGYHHEPVRA 644
G D R E GY A P+ + + L Y+ P+ E +GY +
Sbjct: 133 GAMLDSRSE-GYLAAPS---------FQAFYRDLLDSYMRPIARLLFPEIMGYDTQT--- 179
Query: 645 PMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRF---IRYNCNVTATRM 701
F ++Y+ ++ SLR H D+S+ T+NI +N ++ G F N T
Sbjct: 180 -FGFSIQYQANKDTSLRLHTDASSVTLNININMPDEEFSGSELNFYDPATGKMNETTFTP 238
Query: 702 GWMLMHPGRLTHYHEGLQVTQGTRYIMI 729
G ++H G + H L +T G R ++
Sbjct: 239 GVAMIHRGNVA--HAALPITSGERSNLV 264
>gi|297719687|ref|NP_001172205.1| Os01g0180900 [Oryza sativa Japonica Group]
gi|255672938|dbj|BAH90935.1| Os01g0180900 [Oryza sativa Japonica Group]
Length = 433
Score = 41.2 bits (95), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 44/192 (22%), Positives = 72/192 (37%), Gaps = 22/192 (11%)
Query: 552 TVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQ 611
++ +P P VF FP++ FC + + + +W+ N T + R +
Sbjct: 194 SIMMEPAPGVFAFPMLKPSFCQMLMSEVNNFLRWAQSANQRIMRPTSLDRH-GRGAALSD 252
Query: 612 VGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHH-DSSTYT 670
GL + ++ ++ P+ F + + +FV+ Y E R H D S T
Sbjct: 253 FGLQEMLDNLMKDFISPMSTVLFPEVGGNTLDSHHTFVLEY--GEADGARGFHVDDSEVT 310
Query: 671 INIALNQVGVDYEGGGCRFIRYNCN-------------VTATRMGWMLMHPGRLTHYHEG 717
+NI L G + G F C V G +L+H G +H H
Sbjct: 311 LNICL---GKHFTGADMYFRGIRCGNHVNSGTHDEEYFVHPNVPGQVLLHHG--SHRHGV 365
Query: 718 LQVTQGTRYIMI 729
VT G R M+
Sbjct: 366 FSVTSGRRVNMV 377
>gi|421502614|ref|ZP_15949567.1| alkyl hydroperoxide reductase [Pseudomonas mendocina DLHK]
gi|400346598|gb|EJO94955.1| alkyl hydroperoxide reductase [Pseudomonas mendocina DLHK]
Length = 377
Score = 41.2 bits (95), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 46/184 (25%), Positives = 71/184 (38%), Gaps = 19/184 (10%)
Query: 561 VFWFPIVTE-KFCHEFVQIMEAYGQWSDGTNNDKRLET----GYEAVPTRDIHMKQVGLA 615
V P V E C + A G G D +T G RD ++ L
Sbjct: 166 VLVLPRVFEPSLCQALMDYYAARGGEPSGYMQDIDGKTVQVIGQAHKSRRDCLVEDEALR 225
Query: 616 GVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSST------- 668
+ + +VP ER F + R + Y EQ RPH D++T
Sbjct: 226 EACRLRIYQRLVPQIERAF---QFKVSRMERYLIGCYDATEQGHFRPHRDNTTKGTAHRR 282
Query: 669 YTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIM 728
+ +++ LN +YEGG RF + + G ++ L H E L VT+G R++
Sbjct: 283 FAVSLFLNSG--EYEGGWLRFPEFGSALYGAPTGGAVVFACSLLH--EALPVTRGRRFMF 338
Query: 729 ISFV 732
+ F+
Sbjct: 339 LPFL 342
>gi|46137635|ref|XP_390509.1| hypothetical protein FG10333.1 [Gibberella zeae PH-1]
Length = 507
Score = 40.8 bits (94), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 53/232 (22%), Positives = 89/232 (38%), Gaps = 65/232 (28%)
Query: 40 ASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNL--LKNELDEM--D 95
AS+ D + + SA VN+ + W G + + L +K LD++
Sbjct: 92 ASDPNDMFCAIVASALVNRYPAPYM---VGWKGEGKYNASAAHTAKLYSIKKYLDKLPNG 148
Query: 96 ITDDMIILVTDSYDVIIDGGVNDILERFNTFDAN------------------------IV 131
DD ++ D YDV+ V I+ER+ A+ +
Sbjct: 149 GDDDDLVFFGDGYDVMAQLPVEVIIERYFKVAADADQRLADRFGITVEEAHKRGLKQTLF 208
Query: 132 FGAERLCWP---------------DTSLYDKYPAVGSG------YRYLNSGGFIGYAKDI 170
+GA+++CWP +++Y P G+G +Y NSG IG D+
Sbjct: 209 WGADKMCWPALNEAQCTKIPSSHLPSTVYG--PKTGNGNTHNRDAKYFNSGSVIGPIGDL 266
Query: 171 KELISNRSIKNEE----------DDQLYYALLFLDETLRTKHKIVLDTLANL 212
++ I+ EE DQ+Y A F + L ++ K + D L N+
Sbjct: 267 RKFINAGVTALEETFDPNFKYKTSDQIYLARTFARQEL-SRAKQIEDELHNI 317
>gi|398803811|ref|ZP_10562825.1| Peroxiredoxin [Polaromonas sp. CF318]
gi|398095675|gb|EJL86010.1| Peroxiredoxin [Polaromonas sp. CF318]
Length = 373
Score = 40.8 bits (94), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 44/182 (24%), Positives = 72/182 (39%), Gaps = 27/182 (14%)
Query: 566 IVTEKFCHEFVQIMEAYGQWSDGTNNDKRLET------GYEAVPTRDI-HMKQVGLAGVW 618
+ FC E + + E +G G ++ +T G++ DI + V A W
Sbjct: 178 VFPPGFCRELISLYETHGGKESGFMREENGKTVLAHDHGHKRREDYDITDLAVVKAARAW 237
Query: 619 AEFLRKYVVPLQEREFIGYHH-EPVRAPMSFVVRYRPDEQPSLRPHHDSST-------YT 670
+++ +VP E H + R + YR D+Q PH D++T +
Sbjct: 238 ---IQRRIVP----EIAKVHQFKATRMERYIIGCYRADQQAHFSPHRDNTTRGTAHRRFA 290
Query: 671 INIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMIS 730
++I LN D+EGG F Y G ++ L H VT+G RY +
Sbjct: 291 VSINLND---DFEGGEVSFPEYGPRSFKPPPGGAVVFSCSLLHAVS--TVTRGRRYAFLP 345
Query: 731 FV 732
F+
Sbjct: 346 FL 347
>gi|424513685|emb|CCO66307.1| predicted protein [Bathycoccus prasinos]
Length = 476
Score = 40.8 bits (94), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 23/65 (35%), Positives = 35/65 (53%), Gaps = 2/65 (3%)
Query: 83 KVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDT 142
KV LK L M + I++V D+ DV + ++ +RF +A++VFG E WP+
Sbjct: 212 KVIGLKAALHAM--PGNPIVVVADASDVFLQCSASEFKDRFTKAEADMVFGGETQLWPEV 269
Query: 143 SLYDK 147
S Y K
Sbjct: 270 SDYFK 274
>gi|18401806|ref|NP_566600.1| oxidoreductase [Arabidopsis thaliana]
gi|145332623|ref|NP_001078177.1| oxidoreductase [Arabidopsis thaliana]
gi|14423468|gb|AAK62416.1|AF386971_1 Unknown protein [Arabidopsis thaliana]
gi|9294077|dbj|BAB02034.1| unnamed protein product [Arabidopsis thaliana]
gi|30725644|gb|AAP37844.1| At3g18210 [Arabidopsis thaliana]
gi|332642543|gb|AEE76064.1| oxidoreductase [Arabidopsis thaliana]
gi|332642544|gb|AEE76065.1| oxidoreductase [Arabidopsis thaliana]
Length = 394
Score = 40.8 bits (94), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 48/191 (25%), Positives = 74/191 (38%), Gaps = 27/191 (14%)
Query: 555 NQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTN-NDKRLET--GYEAVPTRDIHMKQ 611
++P P VF F ++ FC + ++ + +W T R T Y AV +
Sbjct: 163 SEPSPGVFVFDMLQPSFCEMMLAEIDNFERWVGETKFRIMRPNTMNKYGAV------LDD 216
Query: 612 VGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTI 671
GL + + + ++ P+ + F + + FVV Y D L H D S T+
Sbjct: 217 FGLDTMLDKLMEGFIRPISKVFFSDVGGATLDSHHGFVVEYGKDRDVDLGFHVDDSEVTL 276
Query: 672 NIALNQVGVDYEGGGCRFIRYNCNV---TATRM----------GWMLMHPGRLTHYHEGL 718
N+ L G + GG F C TAT+ G ++H GR H H
Sbjct: 277 NVCL---GNQFVGGELFFRGTRCEKHVNTATKADETYDYCHIPGQAVLHRGR--HRHGAR 331
Query: 719 QVTQGTRYIMI 729
T G R M+
Sbjct: 332 ATTCGHRVNML 342
>gi|432089369|gb|ELK23320.1| Procollagen galactosyltransferase 2 [Myotis davidii]
Length = 162
Score = 40.8 bits (94), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 21/70 (30%), Positives = 36/70 (51%), Gaps = 2/70 (2%)
Query: 367 RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGAL 426
R A+ + K D+ ++D D+ L NP L L+ N++++AP+L +SNFW +
Sbjct: 89 RQAALRTAREKWSDYILFIDVDNFLTNPQTLNLLIAENKTIVAPML-ESRGLYSNFWCGI 147
Query: 427 NADG-FYARS 435
+ RS
Sbjct: 148 TPQASLWLRS 157
>gi|326433362|gb|EGD78932.1| hypothetical protein PTSG_01907 [Salpingoeca sp. ATCC 50818]
Length = 423
Score = 40.4 bits (93), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 62/293 (21%), Positives = 103/293 (35%), Gaps = 61/293 (20%)
Query: 485 DMAFCT-NLR----------NKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPL 533
D AFC N R N+ + + D H + ++ E Y I
Sbjct: 126 DEAFCKKNARLLRSWTADELNEALAILRDERARREHAAERSKERRERIKAE-YTFITRCK 184
Query: 534 DWDLRYIHPEYQKSL------------LPDTVNNQP----------CPDVFWFPIVTEKF 571
+ L ++ PE + + LP T N Q P ++ P+ T K+
Sbjct: 185 ELQLHHLRPEIRSLMTAIEETWTMDGRLPPTTNAQALVGTARLLELSPGIYAVPMFTAKY 244
Query: 572 CHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVV-PLQ 630
C E + + + + K L+ ++ + + ++G ++ + + PL
Sbjct: 245 CAELEKELSNFRHVAS-----KDLKQSINSMNKHGVSLHELGFTPTFSNVIMASIANPLV 299
Query: 631 ER----EFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGG 686
E EF H F V Y L H+D S T NI + + +EGG
Sbjct: 300 EALYGAEFATLDHHKC-----FTVEYGEKADTDLSLHYDHSLITFNICITSL---FEGGD 351
Query: 687 CRFI-------RYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFV 732
+F R R GW ++H GR +H L V G R +I ++
Sbjct: 352 LQFFGDSRAAPRDTPVTWRHRCGWAVIHRGR--GWHRALPVRYGHRTNIIMWL 402
>gi|356520629|ref|XP_003528963.1| PREDICTED: uncharacterized PKHD-type hydroxylase At1g22950-like
[Glycine max]
Length = 370
Score = 40.4 bits (93), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 43/192 (22%), Positives = 70/192 (36%), Gaps = 31/192 (16%)
Query: 552 TVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIH--- 608
++ ++P P +F F I FC + +E + +W ET + + ++
Sbjct: 135 SIVSEPFPGIFIFDIFQTHFCELLLSEIENFEKWVT--------ETKFRIMHPNTMNKFG 186
Query: 609 --MKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDS 666
+ GL + + + ++ PL F + + FVV Y D L H D
Sbjct: 187 AVLDDFGLETMLDKLMEGFIRPLSRVFFAEVGGSTLDSHHGFVVEYGKDRDVDLGFHVDD 246
Query: 667 STYTINIALNQVGVDYEGGGCRFIRYNCNV---TATRM----------GWMLMHPGRLTH 713
S T+N+ L G + GG F C T + G ++H GR H
Sbjct: 247 SEVTLNVCL---GKQFSGGELFFRGVRCEKHVNTGSHSEEIFDYSHVPGRAVLHRGR--H 301
Query: 714 YHEGLQVTQGTR 725
H T G R
Sbjct: 302 RHGARATTSGNR 313
>gi|163915483|gb|AAI57324.1| ogfod2 protein [Xenopus (Silurana) tropicalis]
Length = 288
Score = 40.4 bits (93), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 40/178 (22%), Positives = 74/178 (41%), Gaps = 21/178 (11%)
Query: 561 VFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLA-GVWA 619
++ P+ +FC + V+ +E + + SD Y I + ++G + A
Sbjct: 74 IYRLPVFIPEFCAKLVEELENF-ERSDLPKGRPNTMNNY------GILLNELGFVDALTA 126
Query: 620 EFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVG 679
KY+ PL F + + + +FVV+Y E L H+D++ T+N++L G
Sbjct: 127 PLCEKYIEPLTSLLFPDWGGGCLDSHRAFVVKYALQEDLDLSCHYDNAEVTLNVSL---G 183
Query: 680 VDYEGGGCRFIRYNCNVTATR--------MGWMLMHPGRLTHYHEGLQVTQGTRYIMI 729
++ G F R G ++H G+ H H L ++ G R+ +I
Sbjct: 184 KEFTDGNLYFSDMKEVPVNERTYAEVEHITGQGILHRGQ--HVHGALPISSGERWNLI 239
>gi|84387462|ref|ZP_00990481.1| hypothetical protein V12B01_14976 [Vibrio splendidus 12B01]
gi|84377715|gb|EAP94579.1| hypothetical protein V12B01_14976 [Vibrio splendidus 12B01]
Length = 303
Score = 40.0 bits (92), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 38/148 (25%), Positives = 62/148 (41%), Gaps = 22/148 (14%)
Query: 588 GTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQE---REFIGYHHEPVRA 644
G D R E GY A P+ + + L Y+ P+ E +GY +
Sbjct: 133 GAMLDSRSE-GYLAAPS---------FQAFYRDLLDSYMRPIARLLFPEIMGYDTQT--- 179
Query: 645 PMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRF---IRYNCNVTATRM 701
F ++Y+ ++ SLR H D+S+ T+NI +N ++ G F N T
Sbjct: 180 -FGFSIQYQANKDTSLRLHTDASSVTLNINVNMPDEEFSGSELNFYDPATGKMNETIFTP 238
Query: 702 GWMLMHPGRLTHYHEGLQVTQGTRYIMI 729
G ++H G + H L +T G R ++
Sbjct: 239 GVAMIHRGNVA--HAALPITSGERSNLV 264
>gi|300855078|ref|YP_003780062.1| glycosyltransferase [Clostridium ljungdahlii DSM 13528]
gi|300435193|gb|ADK14960.1| putative glycosyltransferase [Clostridium ljungdahlii DSM 13528]
Length = 648
Score = 40.0 bits (92), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 42/185 (22%), Positives = 77/185 (41%), Gaps = 23/185 (12%)
Query: 351 NVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAP 410
N+ Y N E +N ++ + D+ F +DSD + +P L L+N N+ +I+
Sbjct: 81 NMHYWKENLIWKIAEYKNRIIDYTKKNHYDYLFLIDSDIMV-HPKTLLSLINSNKDIISE 139
Query: 411 LLVRPFKAWSNFWGALNADGFYARSFDYMN-----IINGDQGGK------------GIWN 453
+ + W G L + + ++N I++ D+ K G++
Sbjct: 140 IF---WTRWQKDSGEL-PQVWVCDEYSFVNKERNEILSQDEFNKKYTDFIEKLKKPGVYE 195
Query: 454 VPYITNCYLM-KTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGHLVD 512
V + C L+ K ++ K N IY L+ D FC G+ L +D+T H+
Sbjct: 196 VGGLGACTLISKEAIEKGVNFNKIYNLSFWGEDRHFCIRAAALGLKLYVDTTYPAYHIYR 255
Query: 513 SENFD 517
+N +
Sbjct: 256 KDNLE 260
>gi|303279707|ref|XP_003059146.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226458982|gb|EEH56278.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 382
Score = 40.0 bits (92), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 50/183 (27%), Positives = 66/183 (36%), Gaps = 58/183 (31%)
Query: 603 PTRDIHMKQVGLAGVWAEFLRKYV-VPLQEREFIGYHHEPVRAPMS-FVVRYRPDE--QP 658
PT D+ L G WA + + R G V P F+V+Y E Q
Sbjct: 91 PTTDLPWG--ALPGTWAVLNETWTRMEADVRARCGIKSNDVLTPNDIFLVKYDASEGGQK 148
Query: 659 SLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMG----W---MLMHPGRL 711
LR H D ST++ N+ L+ G DY GGG R +N T +R G W + PGR
Sbjct: 149 GLRRHRDGSTFSFNMMLSNPG-DYGGGGTRV--WNATDTESREGRERFWRAEVTKDPGRF 205
Query: 712 ------------------------------------------THYHEGLQVTQGTRYIMI 729
+ H+G+ VT GTRYI+
Sbjct: 206 PGVNLTRGDRMPRNFVPNIHMYPEDESTLHVLEKGQMLVGGGANVHQGVPVTTGTRYIVA 265
Query: 730 SFV 732
FV
Sbjct: 266 GFV 268
>gi|84494478|ref|ZP_00993597.1| hypothetical protein JNB_06769 [Janibacter sp. HTCC2649]
gi|84383971|gb|EAP99851.1| hypothetical protein JNB_06769 [Janibacter sp. HTCC2649]
Length = 711
Score = 39.7 bits (91), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 27/86 (31%), Positives = 39/86 (45%), Gaps = 6/86 (6%)
Query: 648 FVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGG--CRFIRYNCNVTATRMGWML 705
FV + +P + H D S +T+N+ L D GG + V R GW +
Sbjct: 612 FVRHFSERTRPFIPFHPDDSHWTVNVPLEDP--DQTSGGELVMLLDGGLRVVERRRGWAI 669
Query: 706 MHPGRLTHYHEGLQVTQGTRYIMISF 731
HPG L H +VT G R+ +I+F
Sbjct: 670 SHPGALIHGVR--RVTHGDRWSLIAF 693
>gi|449458771|ref|XP_004147120.1| PREDICTED: uncharacterized PKHD-type hydroxylase At1g22950-like
[Cucumis sativus]
gi|449503401|ref|XP_004161984.1| PREDICTED: uncharacterized PKHD-type hydroxylase At1g22950-like
[Cucumis sativus]
Length = 384
Score = 39.7 bits (91), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 45/189 (23%), Positives = 75/189 (39%), Gaps = 27/189 (14%)
Query: 553 VNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTN-NDKRLET--GYEAVPTRDIHM 609
+ ++P P ++ F ++ +FC + + +E++ +W T R T Y AV +
Sbjct: 150 IMSEPSPGIYKFEMLQPQFCEKLLSEVESFERWVHETKFRIMRPNTMNKYGAV------L 203
Query: 610 KQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTY 669
GL + + + ++ P+ F + + FVV Y D L H D S
Sbjct: 204 DDFGLETMLDKLMDDFIRPISRVFFPEVGGATLDSHHGFVVEYGIDRDVELGFHVDDSEV 263
Query: 670 TINIALNQVGVDYEGGGCRFIRYNCNV---TATRM----------GWMLMHPGRLTHYHE 716
T+N+ L G + GG F C+ T T+ G ++H GR H H
Sbjct: 264 TLNVCL---GKQFSGGELFFRGIRCDKHVNTETQSEEIFDYLHVPGHAVLHRGR--HRHG 318
Query: 717 GLQVTQGTR 725
T G R
Sbjct: 319 ARATTSGRR 327
>gi|390338649|ref|XP_786011.3| PREDICTED: 2-oxoglutarate and iron-dependent oxygenase
domain-containing protein 2-like [Strongylocentrotus
purpuratus]
Length = 318
Score = 39.7 bits (91), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 40/180 (22%), Positives = 76/180 (42%), Gaps = 25/180 (13%)
Query: 561 VFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAE 620
VF FP+ T +FC FV+ + + N + + + + ++G G +
Sbjct: 99 VFSFPVFTAEFCDRFVEEITHF-------ENSPLPKGRPNTMNNYGVLLMELGFDGNFLN 151
Query: 621 FLR-KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVG 679
LR Y+ P+ + + + +F+V+Y+ E L H D++ TIN++L +
Sbjct: 152 PLRMDYLAPIASLLYPDVGGNSLDSHRAFIVKYKLGEDVDLNYHFDNAEVTINVSLGKEF 211
Query: 680 VDYE----------GGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMI 729
D E ++ R+ T L+H G+ H H + +++G RY +I
Sbjct: 212 SDGELYFGDMRQMPRDETKYARFEHKKTIG-----LLHRGQ--HMHGAMPISEGERYNLI 264
>gi|168027274|ref|XP_001766155.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162682587|gb|EDQ69004.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 390
Score = 39.7 bits (91), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 43/189 (22%), Positives = 76/189 (40%), Gaps = 35/189 (18%)
Query: 561 VFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLET-------GYEAVPTRDIHMKQVG 613
VF F ++ FC + ++ +E + +W+ + R++ Y AV + +G
Sbjct: 168 VFTFSMLKPSFCSKMLEEVEHFERWA----QEARVKVMRPNTMNNYGAV------LDDIG 217
Query: 614 LAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINI 673
+ + + +Y+ P+ F+ + FVV Y D L H D S T+N+
Sbjct: 218 MEVMLNHLMLRYLKPMAAVLFLNVGGSSLDTHHGFVVEYAMDRDLDLGFHVDDSEVTLNV 277
Query: 674 ALNQVGVDYEGGGC--RFIRYNCNVTATRM-----------GWMLMHPGRLTHYHEGLQV 720
L G ++GG R +R + +V G ++H GR H H +
Sbjct: 278 CL---GKKFDGGELFFRGVRCDKHVNGEARSEEVLEYSHVPGDAILHAGR--HRHGAKAI 332
Query: 721 TQGTRYIMI 729
T G R +I
Sbjct: 333 TSGQRTNLI 341
>gi|393227154|gb|EJD34846.1| HECT-domain-containing protein [Auricularia delicata TFB-10046 SS5]
Length = 997
Score = 39.7 bits (91), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 39/133 (29%), Positives = 64/133 (48%), Gaps = 24/133 (18%)
Query: 112 IDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYP---AVGSGYRYLNSGGFIGYAK 168
IDGG + + F T + VF A+R W TS ++ YP ++ + +LN F+G
Sbjct: 684 IDGG--GVFKEFLTSLSKEVFNADRGLWLTTSQHELYPNPMSIATEPHHLNWYRFVGR-- 739
Query: 169 DIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLY----------G 218
I +++ ++ +A FL + L + LD LA+L + LY G
Sbjct: 740 -----ILGKALYQGILVEVAFASFFLAKWLSKQS--FLDDLASLDRELYNGLIFLKHYQG 792
Query: 219 SLEDIKLNFDLDE 231
+LED+ LNF ++E
Sbjct: 793 NLEDLALNFTINE 805
>gi|168016296|ref|XP_001760685.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162688045|gb|EDQ74424.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 387
Score = 39.7 bits (91), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 43/189 (22%), Positives = 77/189 (40%), Gaps = 35/189 (18%)
Query: 561 VFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLET-------GYEAVPTRDIHMKQVG 613
VF F ++ FC + ++ +E + +W+ + R++ Y AV + +G
Sbjct: 169 VFTFSMLKPSFCVKMLEEVEHFERWA----QEARVKVMRPNTMNNYGAV------LDDIG 218
Query: 614 LAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINI 673
+ + + +Y+ P+ F+ + FVV Y D L H D S T+N+
Sbjct: 219 MESMLNHLMIRYLKPMAAVLFLNVGGCSLDTHHGFVVEYAMDRDLDLGFHVDDSEVTLNV 278
Query: 674 ALNQVGVDYEGGGC--RFIRYNCNVTATRM-----------GWMLMHPGRLTHYHEGLQV 720
L G +++GG R +R + +V G ++H GR H H +
Sbjct: 279 CL---GKEFDGGELFFRGVRCDKHVNGEARPEEVLEYSHVPGHAILHAGR--HRHGAKAI 333
Query: 721 TQGTRYIMI 729
T G R +I
Sbjct: 334 TSGQRTNLI 342
>gi|320166370|gb|EFW43269.1| Ogfod2 protein [Capsaspora owczarzaki ATCC 30864]
Length = 400
Score = 39.3 bits (90), Expect = 7.0, Method: Compositional matrix adjust.
Identities = 27/94 (28%), Positives = 42/94 (44%), Gaps = 13/94 (13%)
Query: 647 SFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRF--------IRYNCNVTA 698
+FVV+YR E L+ H D + T+N+ L G ++ GG F
Sbjct: 276 TFVVQYRMAEDRELKFHFDDAEVTLNVCL---GTEFTGGALYFGGLFDAPETHDESLAVQ 332
Query: 699 TRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFV 732
++G +H G+ H H +T G RY MI ++
Sbjct: 333 HQLGRATLHLGK--HRHAAKPITSGERYNMIMWM 364
>gi|397630386|gb|EJK69753.1| hypothetical protein THAOC_08956, partial [Thalassiosira oceanica]
Length = 533
Score = 39.3 bits (90), Expect = 7.3, Method: Compositional matrix adjust.
Identities = 21/53 (39%), Positives = 30/53 (56%), Gaps = 3/53 (5%)
Query: 681 DYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVD 733
+YEGGG F V + G +L+HPG L YH+G +T G R +++ F D
Sbjct: 458 EYEGGGTYFRSLRKTVI-LQQGQVLVHPGEL--YHKGNDITYGVRCLLVCFTD 507
>gi|303285766|ref|XP_003062173.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226456584|gb|EEH53885.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 174
Score = 39.3 bits (90), Expect = 7.5, Method: Composition-based stats.
Identities = 43/171 (25%), Positives = 73/171 (42%), Gaps = 35/171 (20%)
Query: 561 VFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAE 620
V+ F + E+FC + ++AY + + KR A + + ++G+ G+ +
Sbjct: 1 VYAFDLFEERFCAMLTEEVDAY----EVSGLPKRRPNTMNA---SGLIVNEIGMWGLMTD 53
Query: 621 FLRKYVVPLQEREF--------IGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTIN 672
++ PL + + +HH SFVV Y D+ L HHD+S T+N
Sbjct: 54 VVKALASPLAAALYRDEIFADSLDHHH-------SFVVHYARDKDTRLDMHHDASEVTLN 106
Query: 673 IALNQVGVD-YEGGGCRFI--------RYNCNVTATRM-GWMLMHPGRLTH 713
+ +G D +EG G RF R + + + G +MH GR H
Sbjct: 107 VC---IGRDHFEGAGLRFCGRFGDANHRSGPSFAVSHVPGRAVMHLGRQRH 154
>gi|441623716|ref|XP_003264022.2| PREDICTED: glycosyltransferase 25 family member 3 [Nomascus
leucogenys]
Length = 789
Score = 39.3 bits (90), Expect = 8.2, Method: Compositional matrix adjust.
Identities = 21/71 (29%), Positives = 38/71 (53%), Gaps = 6/71 (8%)
Query: 378 GVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALN-----ADGFY 432
G D+ + D+D+ L N L+ LV + ++AP+L +SNFW + + G+Y
Sbjct: 358 GADYILFADTDNILTNNQTLRLLVGQGLPVVAPMLDS-QTYYSNFWCGITPQHSFSPGYY 416
Query: 433 ARSFDYMNIIN 443
R+ +Y ++N
Sbjct: 417 RRTAEYFPMLN 427
>gi|412993728|emb|CCO14239.1| predicted protein [Bathycoccus prasinos]
Length = 816
Score = 38.9 bits (89), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 40/144 (27%), Positives = 62/144 (43%), Gaps = 10/144 (6%)
Query: 25 KVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKV 84
K +D +V A+N + SA+ N LQV G G D G K+
Sbjct: 483 KTHGVDSADVVVTAHATNIKSTGWVIVDSAKRNGLQVVISGN-----GTDFH--GFADKM 535
Query: 85 NLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSL 144
LK L I + I++ TD+ DV++ + RFN +A+ +FG E WP+
Sbjct: 536 MGLKAAL--HSINGNPIVINTDANDVMLQCSGQEFKNRFNQANADFIFGGETQLWPEIHA 593
Query: 145 Y-DKYPAVGSGYRYLNSGGFIGYA 167
Y +K + + ++ G IG A
Sbjct: 594 YFEKTDEIAWKEKMSDTLGKIGAA 617
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.321 0.139 0.428
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 12,233,670,100
Number of Sequences: 23463169
Number of extensions: 552722575
Number of successful extensions: 1179237
Number of sequences better than 100.0: 742
Number of HSP's better than 100.0 without gapping: 348
Number of HSP's successfully gapped in prelim test: 394
Number of HSP's that attempted gapping in prelim test: 1176183
Number of HSP's gapped (non-prelim): 870
length of query: 734
length of database: 8,064,228,071
effective HSP length: 150
effective length of query: 584
effective length of database: 8,839,720,017
effective search space: 5162396489928
effective search space used: 5162396489928
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 81 (35.8 bits)