BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= psy14856
         (734 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|328713170|ref|XP_003245008.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 3-like
           isoform 2 [Acyrthosiphon pisum]
          Length = 734

 Score =  794 bits (2051), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 386/732 (52%), Positives = 505/732 (68%), Gaps = 14/732 (1%)

Query: 11  ILSCVVFFISVHCNKVKNIDEDK---FLVITVASNETDGYKRFIQSAEVNKLQVKTLGLH 67
           I  C +FFI    +  K++        LV+TVAS + DG+KRFI SA +N L+ K LG+ 
Sbjct: 9   IAICGLFFILDSASTKKDVSAKSDLNLLVLTVASEKNDGFKRFIDSANLNGLKTKVLGVD 68

Query: 68  QPWLGGDMSSLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFD 127
           +PW GG+M+S+GGGYK+NL    L+     D++ +L+TD+YDV++    + IL  F  FD
Sbjct: 69  KPWQGGNMNSVGGGYKLNLYLEALEPYKNNDNLAVLLTDAYDVVLLANSSTILNAFTEFD 128

Query: 128 ANIVFGAERLCWPDTSLYDKYPAVG-SGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQ 186
           ++IV   E  CWPD  L DKYP V  +GYR++NSGG IGYA  + +L+S + IKN  DDQ
Sbjct: 129 SSIVISTENSCWPDRKLADKYPTVDLNGYRFINSGGIIGYASQLYKLLSEKPIKNLGDDQ 188

Query: 187 LYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVI 246
           L+   L+LD  LR K  I LD  A LFQN+Y + +DIKL      +V L N  +NT P +
Sbjct: 189 LHLTNLYLDTDLREKLNIKLDNYAKLFQNVYLAEDDIKLKLVNKSYV-LENINFNTQPAV 247

Query: 247 IHGNGKSKIELNSFGNYLAKSWK-TSGCTRC---NLIKHLDSLKPDQFPSVLISVFIDKP 302
           IHGNG SKI  NS+ NY+   W   SGC  C   NL   L +LK + +P VL+S+ +DKP
Sbjct: 248 IHGNGLSKITFNSYTNYIPNKWSPESGCKTCYDNNL--DLSTLKEENYPKVLLSIIVDKP 305

Query: 303 TAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVN 362
           T F +EFL+KI N++YP  ++ + +    +YH    D +I      + N  ++ H +   
Sbjct: 306 TPFFDEFLDKIENIDYPKSRLCLSITTLVDYHKEHVDKFISKIGDKY-NASFVFHKTAEE 364

Query: 363 SKEARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNF 422
           S  AR+ +      K  DF FY+++++HLDNP  LK L+ RN+ +IAP+L RPFKAWSNF
Sbjct: 365 SIHARHFSFSLCTSKLCDFLFYIENEAHLDNPQTLKILIQRNKKIIAPMLTRPFKAWSNF 424

Query: 423 WGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSM 482
           WGAL+ +GFYARSFDYM+I+N ++   GIWNVPYI++CYLMK ++++    +  Y  +++
Sbjct: 425 WGALSKEGFYARSFDYMDIVNYNK--TGIWNVPYISSCYLMKGTILENKYTRPSYKEDNL 482

Query: 483 DYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHP 542
           DYDMAF  +LR KG+ + ID+   YGHL+DSE+FD    NPEVY++  N  DW+ RYIHP
Sbjct: 483 DYDMAFSKSLREKGVFMYIDNQYTYGHLIDSESFDITLKNPEVYQIFENRYDWEQRYIHP 542

Query: 543 EYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAV 602
           EY ++  PD    +PCPDVFWFPI+TE+FC EF++IME +GQWSDGTNND RL TGYEAV
Sbjct: 543 EYMENFNPDKKPAEPCPDVFWFPILTEQFCQEFIEIMENFGQWSDGTNNDTRLRTGYEAV 602

Query: 603 PTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRP 662
           PTRDIHM QVGL   W EFLR YV P+Q++ FIGY H+P R+ M+FVV+Y P  Q SLRP
Sbjct: 603 PTRDIHMNQVGLEKHWLEFLRSYVQPIQKKAFIGYTHDPPRSLMNFVVKYNPLGQASLRP 662

Query: 663 HHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQ 722
           HHDSSTYTINIALN  G DY+GGGC F+RY C VT  ++GWMLMHPGRLTHYHEGL+VT 
Sbjct: 663 HHDSSTYTINIALNSPGKDYQGGGCHFLRYKCKVTDLKVGWMLMHPGRLTHYHEGLEVTN 722

Query: 723 GTRYIMISFVDP 734
           GTRYIMISFVDP
Sbjct: 723 GTRYIMISFVDP 734


>gi|350421678|ref|XP_003492921.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 3-like
           isoform 1 [Bombus impatiens]
 gi|350421681|ref|XP_003492922.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 3-like
           isoform 2 [Bombus impatiens]
          Length = 736

 Score =  781 bits (2017), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 375/734 (51%), Positives = 504/734 (68%), Gaps = 9/734 (1%)

Query: 7   LNCLILSCVVFFISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQ--VKTL 64
           + C +   +     V    + + D+D  LV T+ASNETDGYKR+++S  V   +  ++ L
Sbjct: 6   IGCCLFWSLFLTYHVFSETLPSTDKDDVLVFTIASNETDGYKRYLRSVNVYGFRDNLRVL 65

Query: 65  GLHQPWLGGD--MSSLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILER 122
           GL +PWLGGD   +S GGGYKVNLLK  L+     D  I++ TDSYDVI    + +I+ +
Sbjct: 66  GLGEPWLGGDNIKTSAGGGYKVNLLKKALENYGDDDQKIVIFTDSYDVIYLSDLTEIINK 125

Query: 123 FNTFDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNE 182
           F + DA ++F AE  CWPD SL  KYP+   G R+LNSGGF+GYA D+  ++++  IKN+
Sbjct: 126 FKSMDARVLFSAEGSCWPDKSLASKYPSAALGKRFLNSGGFVGYASDVYAILTHAPIKNK 185

Query: 183 EDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNT 242
           +DDQL+Y L +LDE LR +HKI LD  + +FQNLYG++ D++L F+  +   L NT Y+T
Sbjct: 186 DDDQLFYTLAYLDEELRERHKIKLDHKSEIFQNLYGAVADVELKFEGGK-ASLLNTVYST 244

Query: 243 NPVIIHGNGKSKIELNSFGNYLAKSWK-TSGCTRC-NLIKHLDSLKPDQFPSVLISVFID 300
            P+I+HGNG SK+ LNS GNYLA +W    GC  C      LD   P+ +P +LI++FI+
Sbjct: 245 EPLILHGNGYSKLSLNSLGNYLAHAWSPEEGCVMCWEETIELDRTTPESYPIILIAIFIE 304

Query: 301 KPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNST 360
           +PT FL EFL+ I    YP  K+ + ++NN EYH  + D+++      + + K I+ N  
Sbjct: 305 RPTPFLTEFLSAIYQQAYPKSKLHLLIHNNVEYHQDVVDNFMKKVGREYNSSKQISVNDA 364

Query: 361 VNSKEARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWS 420
           +N  +ARNLA++  L K    YF +DS SHLDN   LK L+ +   +IAPLLVRP+K WS
Sbjct: 365 MNEVDARNLAMDYCLLKECSGYFSIDSVSHLDNEHTLKLLIEQQRDIIAPLLVRPYKMWS 424

Query: 421 NFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLN 480
           NFWGA+  DGFYARSFDY+ I+N ++  +G+WNVP+I+NCYL+  ++I     +  Y+  
Sbjct: 425 NFWGAIMDDGFYARSFDYIEIVNNER--RGLWNVPFISNCYLINATLISNKETRPSYSEG 482

Query: 481 SMDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYI 540
            +D +MAF    R + I + + +  ++GHLVD +N+D   T+P+ Y+++ N LDW+  YI
Sbjct: 483 DLDTEMAFAYANRERNIFMYVSNRVDFGHLVDPDNYDVTVTHPDFYQILNNKLDWEKTYI 542

Query: 541 HPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYE 600
           H  Y ++  P+    Q CPDV+ FPIV E+F  E + IME +G+WSDG+N+D RL  GYE
Sbjct: 543 HENYSENFNPNKTPVQVCPDVYRFPIVNERFTKELIDIMETFGKWSDGSNHDPRLTGGYE 602

Query: 601 AVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSL 660
            VPTRDIHM QV     W  FL++YV PLQE  F GY+H+P RA M+FVVRYRPDEQPSL
Sbjct: 603 NVPTRDIHMNQVKYEPQWLYFLKEYVRPLQELVFAGYYHDPPRALMNFVVRYRPDEQPSL 662

Query: 661 RPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQV 720
           +PHHDSSTYTINIALN+ GVDYEGGGCRFIRYNC+VT T+ GWMLMHPGRLTHYHEGL+V
Sbjct: 663 KPHHDSSTYTINIALNRAGVDYEGGGCRFIRYNCSVTDTKPGWMLMHPGRLTHYHEGLRV 722

Query: 721 TQGTRYIMISFVDP 734
           T GTRYIMISFVDP
Sbjct: 723 TSGTRYIMISFVDP 736


>gi|340726794|ref|XP_003401738.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 3-like
           isoform 1 [Bombus terrestris]
          Length = 736

 Score =  776 bits (2005), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 374/735 (50%), Positives = 503/735 (68%), Gaps = 9/735 (1%)

Query: 6   HLNCLILSCVVFFISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQ--VKT 63
           ++ C +   +     V    + + D+D  LV T+ASNETDGYKR+++S  V      ++ 
Sbjct: 5   NIGCCLFWSLFLTYHVFSETLPSTDKDDVLVFTIASNETDGYKRYLRSVNVYGFHDNLRV 64

Query: 64  LGLHQPWLGGD--MSSLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILE 121
           LGL +PWLGGD   +S GGGYKVNLLK  L+     D  I++ TDSYDVI    + +I+ 
Sbjct: 65  LGLGEPWLGGDNIKTSAGGGYKVNLLKKALENYGDDDQKIVIFTDSYDVIYLSDLTEIIN 124

Query: 122 RFNTFDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKN 181
           +F + DA ++F AE  CWPD SL  KYP    G R+LNSGGF+GYA D+  ++++  IKN
Sbjct: 125 KFKSMDARVLFSAEGSCWPDKSLASKYPPATLGKRFLNSGGFVGYASDVYAILTHAPIKN 184

Query: 182 EEDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYN 241
           ++DDQL+Y L +LDE LR +HKI LD  + +FQNLYG++ D++L F+  +   L NT YN
Sbjct: 185 KDDDQLFYTLAYLDEELRERHKIKLDHKSEIFQNLYGAVADVELKFEGGK-ASLLNTVYN 243

Query: 242 TNPVIIHGNGKSKIELNSFGNYLAKSWK-TSGCTRC-NLIKHLDSLKPDQFPSVLISVFI 299
           T P+I+HGNG SK+ LNS GNYLA++W    GC  C      LD +    +P +LI++FI
Sbjct: 244 TEPLILHGNGYSKLSLNSLGNYLARAWSPEEGCVMCWEETIELDRIISQSYPIILIAIFI 303

Query: 300 DKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNS 359
           ++PT FL EFL+ I    YP  K+ + ++NN EYH  + D+++   +  + + K I+ N 
Sbjct: 304 ERPTPFLSEFLSAIYQQAYPKSKLHLLIHNNVEYHQDVLDNFMKKVEKEYNSSKQISVND 363

Query: 360 TVNSKEARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAW 419
            ++  +ARNLA++  L K    YF +DS SHLDN   LK LV +   +IAPLLVRP+K W
Sbjct: 364 AMSEVDARNLAMDYCLLKECSGYFSIDSVSHLDNEHTLKLLVEQQRDIIAPLLVRPYKMW 423

Query: 420 SNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTL 479
           SNFWGA+  DGFYARSFDY+ I+  ++  +G+WNVP+I+NCYL+  ++I     +  Y+ 
Sbjct: 424 SNFWGAIMDDGFYARSFDYIEIVKNER--RGLWNVPFISNCYLINATLISNKETRPSYSE 481

Query: 480 NSMDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRY 539
             +D +MAF    R + I + + +  ++GHLVD +N+D   T+P+ Y+++ N LDW+  Y
Sbjct: 482 GDLDTEMAFAYANRERNIFMYVSNRVDFGHLVDPDNYDVTVTHPDFYQILNNKLDWEKTY 541

Query: 540 IHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGY 599
           IH  Y ++  P+    Q CPDV+ FPIV E+F  E + IME +G+WSDG+N+D RL  GY
Sbjct: 542 IHENYSENFNPNKTPVQVCPDVYRFPIVNERFTKELIDIMETFGKWSDGSNHDPRLTGGY 601

Query: 600 EAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPS 659
           E VPTRDIHM QV     W  FL++YV PLQE  F GY+H+P RA M+FVVRYRPDEQPS
Sbjct: 602 ENVPTRDIHMNQVKYEPQWLYFLKEYVRPLQELVFAGYYHDPPRALMNFVVRYRPDEQPS 661

Query: 660 LRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQ 719
           L+PHHDSSTYTINIALN+ GVDYEGGGCRFIRYNC+VT T+ GWMLMHPGRLTHYHEGL+
Sbjct: 662 LKPHHDSSTYTINIALNRAGVDYEGGGCRFIRYNCSVTDTKPGWMLMHPGRLTHYHEGLR 721

Query: 720 VTQGTRYIMISFVDP 734
           VT GTRYIMISFVDP
Sbjct: 722 VTSGTRYIMISFVDP 736


>gi|157117949|ref|XP_001653115.1| procollagen-lysine,2-oxoglutarate 5-dioxygenase [Aedes aegypti]
 gi|108875910|gb|EAT40135.1| AAEL008099-PA [Aedes aegypti]
          Length = 707

 Score =  775 bits (2001), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 371/709 (52%), Positives = 499/709 (70%), Gaps = 11/709 (1%)

Query: 28  NIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLL 87
           NI +   LV TVASN T+GY R+I+SA+   ++V TLGL +PWLGGDM+ LGGGYK+NLL
Sbjct: 8   NISQKPPLVFTVASNATEGYLRYIRSAKYYGIEVSTLGLGKPWLGGDMTRLGGGYKINLL 67

Query: 88  KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
           ++ L      DD I+L TDSYDV+    +  I+E+F TFDA+I+FG+E  CWP+  L  K
Sbjct: 68  RDALKPYKADDDRIVLFTDSYDVLFLASMEKIIEKFRTFDASILFGSEGFCWPEEDLKSK 127

Query: 148 YPAV-GSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVL 206
           YP + G G R+LNSG F+GYA  +  ++    +K+ +DDQLYY   +LDE  R + KI L
Sbjct: 128 YPVLEGRGTRFLNSGLFMGYASKVYRMLKT-PVKDTDDDQLYYTKAYLDEKQRNELKIKL 186

Query: 207 DTLANLFQNLYGSLEDIKLNFDLD-EFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLA 265
           D  A LFQNL G  E + L  D + +   L NT+Y+T P I+HGNG SK+ LN + NYLA
Sbjct: 187 DHTAVLFQNLNGVEEQVVLALDENGKEAFLKNTEYSTVPYIVHGNGPSKLVLNGYANYLA 246

Query: 266 KSWKTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
            ++    C   N  + L  L  +  P+V++++FI+K T F+EE+   IA +NYP+KK+ +
Sbjct: 247 GAFVDGECKTIN--EDLIQLDEENLPTVMLALFIEKATPFIEEWFEGIAKINYPSKKMDL 304

Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYV 385
           F++NN +YH P  DD+I  + + +++ + + +         R+LAV+  L K  D+ F V
Sbjct: 305 FIHNNVDYHKPTIDDFIEKYSSSYRSFRMVDYTDDYEELAGRSLAVDQCLKKQCDYLFVV 364

Query: 386 DSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGD 445
           D+D H+D+ D+++ L+ +N+S+I+P+L RP K WSNFWGAL++ GFYARS DYM+I+   
Sbjct: 365 DADGHIDDSDIIRKLIVQNKSIISPMLNRPEKVWSNFWGALSSQGFYARSSDYMDIVGRK 424

Query: 446 QGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQ 505
             G+  WNVPYI+  YL+K SV+   +    Y L   D DMA C ++R KGI + + + +
Sbjct: 425 ILGQ--WNVPYISTIYLVKASVLPLVS----YELQGTDPDMALCWHMRAKGIFMHVINAE 478

Query: 506 EYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFP 565
           +YGHL+DS+ +D  KT+P+ Y+L  N  DW+ +YI PEY K L  D V  QPCPDV+WF 
Sbjct: 479 QYGHLIDSDYYDTTKTHPDFYQLFNNKHDWEQKYISPEYYKQLEKDYVQIQPCPDVYWFA 538

Query: 566 IVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKY 625
           I +E FC    +I+EA+G+WSDGT+ DKRL+ GYEAVPTRDIHM QVGL  VW +FL+ Y
Sbjct: 539 IASELFCDHLKEIVEAFGKWSDGTHTDKRLQGGYEAVPTRDIHMNQVGLEQVWLKFLQLY 598

Query: 626 VVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGG 685
           V PLQE+ FIGY+H+P R+ M+FVVRYRPDEQPSLRPHHDSSTYTINIALN+ G+DYEGG
Sbjct: 599 VKPLQEKVFIGYYHDPPRSLMNFVVRYRPDEQPSLRPHHDSSTYTINIALNRAGIDYEGG 658

Query: 686 GCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           GC F+RYNC+VT TR GWMLMHPGRLTH+HEGL+   GTRYIMISFVDP
Sbjct: 659 GCHFLRYNCSVTDTRKGWMLMHPGRLTHFHEGLRTNSGTRYIMISFVDP 707


>gi|328784759|ref|XP_003250492.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 3-like
           [Apis mellifera]
          Length = 785

 Score =  773 bits (1995), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 388/783 (49%), Positives = 515/783 (65%), Gaps = 58/783 (7%)

Query: 7   LNCLILSCVVFFISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQ--VKTL 64
           + C +   +     V  +   +ID+D  LV TVA+ ETDGYKR+++S +V   +  ++ L
Sbjct: 6   VGCYLFWSLFLAYHVVSDTPPSIDKDDVLVFTVATKETDGYKRYLRSIDVYGFRDNLRVL 65

Query: 65  GLHQPWLGGD--MSSLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILER 122
           G+  PWLGGD   +S+GGGYKVNLLK  L+E    DD II+ TDSYDVI    + +I+++
Sbjct: 66  GMGTPWLGGDHVKTSVGGGYKVNLLKKALEEYQNDDDRIIIFTDSYDVIFLSDLTEIIDK 125

Query: 123 FNTFDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNE 182
           F   +A ++F AE  CWPD SL  KYP+V  G R+LNSGGFIGYA DI  +++   IKN+
Sbjct: 126 FKNTNARVLFSAEGACWPDRSLASKYPSVTRGKRFLNSGGFIGYASDIYAILTYAPIKNK 185

Query: 183 EDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNT 242
           +DDQL+Y L +LDE LR  HKI LD  + +FQNLY ++ D+KL F+  +   L NT YNT
Sbjct: 186 DDDQLFYTLAYLDEKLREHHKIKLDHKSVIFQNLYLAVGDVKLKFENGK-ASLLNTVYNT 244

Query: 243 NPVIIHGNGKSKIELNSFGNYLAKSWK-TSGCTRC-NLIKHLDSLKPDQFPSVLISVFID 300
            P+I+HGNG SK  LNS GNYLA++W    GC  C      L+   P+ +P +LI+VFI+
Sbjct: 245 EPLILHGNGYSKESLNSLGNYLARAWSPEEGCIMCWEGTIELNKTIPESYPIILIAVFIE 304

Query: 301 KPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNST 360
           +PT FL EFL  I   +YP  K+ +FV+NN EYH  + + ++ N    +   K ++ N  
Sbjct: 305 RPTPFLNEFLATIYQQDYPKSKLHLFVHNNVEYHQDVINSFMKNVGYEYNTSKLVSVNDA 364

Query: 361 VNSKEARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWS 420
           +N  +ARNLA++  L K    YF +DS SHLDN   LK LV +   +IAPLLVRP+K WS
Sbjct: 365 MNEVDARNLAMDYCLLKECSGYFSIDSISHLDNKYTLKLLVEQQREIIAPLLVRPYKMWS 424

Query: 421 NFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLN 480
           NFWGA+  DGFYARSFDYM+I+  ++  +G+WNVP+I+NCYL+ +++I+    +  Y+  
Sbjct: 425 NFWGAIMDDGFYARSFDYMDIVKNER--RGLWNVPFISNCYLINSTLIRNKETRPSYSEG 482

Query: 481 SMDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYI 540
            +D DMAF    R + I + + +  ++GHLV+ +++D   T+P++Y++I N LDW+ RYI
Sbjct: 483 DLDTDMAFAYANRERSIFMYVSNRLDFGHLVNPDSYDITLTHPDLYQIIDNKLDWERRYI 542

Query: 541 HPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRL----- 595
           H  Y ++   +    QPCPDV+WFPIV E+F  E + +ME +G+WSDG+N+D RL     
Sbjct: 543 HENYSENFNSNQTPLQPCPDVYWFPIVNERFTKELIDVMENFGKWSDGSNHDPRLTGGYE 602

Query: 596 --------------------------------------------ETGYEAVPTRDIHMKQ 611
                                                       E+GYEAVPTRDIHMKQ
Sbjct: 603 NVPTRDIHMNQVKNEPQWLYFLKEYVRPLQELVFTGYYHDDPRIESGYEAVPTRDIHMKQ 662

Query: 612 VGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTI 671
           +GL   W  FL +YV PLQE  FIGY+  P RA M+FVVRYRPDEQPSL+PHHDSSTYTI
Sbjct: 663 IGLHESWLNFLDQYVSPLQEHVFIGYNTSPPRALMNFVVRYRPDEQPSLKPHHDSSTYTI 722

Query: 672 NIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISF 731
           NIALN+VGVDYEGGGCRFIRYNC+VT T+ GWMLMHPGRLTHYHEGL+VT GTRYIMISF
Sbjct: 723 NIALNRVGVDYEGGGCRFIRYNCSVTDTKPGWMLMHPGRLTHYHEGLRVTSGTRYIMISF 782

Query: 732 VDP 734
           VDP
Sbjct: 783 VDP 785


>gi|328713172|ref|XP_001943472.2| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 3-like
           isoform 1 [Acyrthosiphon pisum]
          Length = 784

 Score =  771 bits (1990), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 386/782 (49%), Positives = 505/782 (64%), Gaps = 64/782 (8%)

Query: 11  ILSCVVFFISVHCNKVKNIDEDK---FLVITVASNETDGYKRFIQSAEVNKLQVKTLGLH 67
           I  C +FFI    +  K++        LV+TVAS + DG+KRFI SA +N L+ K LG+ 
Sbjct: 9   IAICGLFFILDSASTKKDVSAKSDLNLLVLTVASEKNDGFKRFIDSANLNGLKTKVLGVD 68

Query: 68  QPWLGGDMSSLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFD 127
           +PW GG+M+S+GGGYK+NL    L+     D++ +L+TD+YDV++    + IL  F  FD
Sbjct: 69  KPWQGGNMNSVGGGYKLNLYLEALEPYKNNDNLAVLLTDAYDVVLLANSSTILNAFTEFD 128

Query: 128 ANIVFGAERLCWPDTSLYDKYPAVG-SGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQ 186
           ++IV   E  CWPD  L DKYP V  +GYR++NSGG IGYA  + +L+S + IKN  DDQ
Sbjct: 129 SSIVISTENSCWPDRKLADKYPTVDLNGYRFINSGGIIGYASQLYKLLSEKPIKNLGDDQ 188

Query: 187 LYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVI 246
           L+   L+LD  LR K  I LD  A LFQN+Y + +DIKL      +V L N  +NT P +
Sbjct: 189 LHLTNLYLDTDLREKLNIKLDNYAKLFQNVYLAEDDIKLKLVNKSYV-LENINFNTQPAV 247

Query: 247 IHGNGKSKIELNSFGNYLAKSWK-TSGCTRC---NLIKHLDSLKPDQFPSVLISVFIDKP 302
           IHGNG SKI  NS+ NY+   W   SGC  C   NL   L +LK + +P VL+S+ +DKP
Sbjct: 248 IHGNGLSKITFNSYTNYIPNKWSPESGCKTCYDNNL--DLSTLKEENYPKVLLSIIVDKP 305

Query: 303 TAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVN 362
           T F +EFL+KI N++YP  ++ + +    +YH    D +I      + N  ++ H +   
Sbjct: 306 TPFFDEFLDKIENIDYPKSRLCLSITTLVDYHKEHVDKFISKIGDKY-NASFVFHKTAEE 364

Query: 363 SKEARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNF 422
           S  AR+ +      K  DF FY+++++HLDNP  LK L+ RN+ +IAP+L RPFKAWSNF
Sbjct: 365 SIHARHFSFSLCTSKLCDFLFYIENEAHLDNPQTLKILIQRNKKIIAPMLTRPFKAWSNF 424

Query: 423 WGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSM 482
           WGAL+ +GFYARSFDYM+I+N ++   GIWNVPYI++CYLMK ++++    +  Y  +++
Sbjct: 425 WGALSKEGFYARSFDYMDIVNYNK--TGIWNVPYISSCYLMKGTILENKYTRPSYKEDNL 482

Query: 483 DYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHP 542
           DYDMAF  +LR KG+ + ID+   YGHL+DSE+FD    NPEVY++  N  DW+ RYIHP
Sbjct: 483 DYDMAFSKSLREKGVFMYIDNQYTYGHLIDSESFDITLKNPEVYQIFENRYDWEQRYIHP 542

Query: 543 EYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAV 602
           EY ++  PD    +PCPDVFWFPI+TE+FC EF++IME +GQWSDGTNND RL TGYEAV
Sbjct: 543 EYMENFNPDKKPAEPCPDVFWFPILTEQFCQEFIEIMENFGQWSDGTNNDTRLRTGYEAV 602

Query: 603 PTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHE---------------------- 640
           PTRDIHM QVGL   W EFLR YV P+Q++ FIGY H+                      
Sbjct: 603 PTRDIHMNQVGLEKHWLEFLRSYVQPIQKKAFIGYTHDDPRLDNGYEAVPTRDIHMKQVG 662

Query: 641 ----------------------------PVRAPMSFVVRYRPDEQPSLRPHHDSSTYTIN 672
                                       P R+ M+FVV+Y P  Q SLRPHHDSSTYTIN
Sbjct: 663 LQNVWLEFLRLFVSRLQEHVYLGYYSDGPPRSLMNFVVKYNPLGQASLRPHHDSSTYTIN 722

Query: 673 IALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFV 732
           IALN  G DY+GGGC F+RY C VT  ++GWMLMHPGRLTHYHEGL+VT GTRYIMISFV
Sbjct: 723 IALNSPGKDYQGGGCHFLRYKCKVTDLKVGWMLMHPGRLTHYHEGLEVTNGTRYIMISFV 782

Query: 733 DP 734
           DP
Sbjct: 783 DP 784


>gi|383851266|ref|XP_003701155.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 3-like
           [Megachile rotundata]
          Length = 784

 Score =  770 bits (1988), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 382/782 (48%), Positives = 506/782 (64%), Gaps = 57/782 (7%)

Query: 7   LNCLILSCVVFFISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQ--VKTL 64
           L C +L  +     V      + D    LV TVA+NETDGYKR+++S +V   +  ++ L
Sbjct: 6   LGCCLLWSLFLTYHVVSETPPSTDTKDVLVFTVATNETDGYKRYVRSVDVYGFRDNLRVL 65

Query: 65  GLHQPWLGGDM-SSLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERF 123
           G   PWLGG + +S GGGYKVNLLK  L++    ++ I++ TDSYDVI   G+ +I+E+F
Sbjct: 66  GTGSPWLGGKVRTSAGGGYKVNLLKQALEKYKNDEERIVMFTDSYDVIFLSGLTEIIEKF 125

Query: 124 NTFDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEE 183
              +A I+F AE  CWPD SL  KYP    G R+LNSGGFIGYA DI  +++   IKNE 
Sbjct: 126 KNTNARILFSAEGSCWPDKSLASKYPPATGGKRFLNSGGFIGYASDIYAILTYAPIKNEN 185

Query: 184 DDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTN 243
           DDQL+Y + +LDE LR +HKI LD  + +FQNLYG++ D++L F+  +   L NT YNT 
Sbjct: 186 DDQLHYTIAYLDEKLREQHKIKLDHKSEIFQNLYGAVADVELKFEGGK-ASLLNTVYNTE 244

Query: 244 PVIIHGNGKSKIELNSFGNYLAKSWKTS-GCTRC-NLIKHLDSLKPDQFPSVLISVFIDK 301
           P+I+HGNG SK+ LNS GNYLA +W    GC  C      LD   P+ +P +LI++FI++
Sbjct: 245 PLILHGNGYSKLSLNSLGNYLANAWSPEEGCVMCWEGTTELDKTLPETYPVILIAIFIER 304

Query: 302 PTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTV 361
           PT FLEEFL  I    YP  K+ +F++N  EYH  + +D+I  F   +++ K +    ++
Sbjct: 305 PTPFLEEFLLTIYEQAYPKSKLDLFIHNTVEYHQDVVNDFIKKFGKEYRSNKQVLPKDSI 364

Query: 362 NSKEARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSN 421
           N  +ARNLA++  L K    YF VDS +HLDN   LK LV +   ++APLLVRP+K WSN
Sbjct: 365 NEADARNLAMDYCLLKKCSGYFSVDSIAHLDNEYTLKLLVEQQRGIVAPLLVRPYKMWSN 424

Query: 422 FWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNS 481
           FWGA+  DGFYARSFDYM I+  ++  +G+WNVP+I+ CYL+  ++I     +  Y    
Sbjct: 425 FWGAIMDDGFYARSFDYMEIVKNER--RGLWNVPFISTCYLINATLISNKETRPSYVEGD 482

Query: 482 MDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIH 541
           +D DMAF    R + I + + +  ++GHLV+ +++D   T+P++Y+++ N LDW+ +YIH
Sbjct: 483 LDTDMAFAYANRERSIFMYVSNRVDFGHLVNPDSYDIALTHPDLYQILDNKLDWEKKYIH 542

Query: 542 PEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRL------ 595
             Y ++  P+    QPCPDV+WFPIV EKF    + IMEA+G+WSDG+NND RL      
Sbjct: 543 VNYSENFNPERTPIQPCPDVYWFPIVNEKFTKSLIDIMEAFGKWSDGSNNDPRLTGGYEN 602

Query: 596 -------------------------------------------ETGYEAVPTRDIHMKQV 612
                                                      + GYEAVPTRDIHMKQV
Sbjct: 603 VPTRDIHMNQVNFEPQWLYFLKEYVRPLQEHVFIGYYHDDPRIDGGYEAVPTRDIHMKQV 662

Query: 613 GLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTIN 672
           GL   W  FL +YV PLQE  FIGY+  P R+ M+FVVRYRPDEQPSL+PHHDSSTYT+N
Sbjct: 663 GLHETWLNFLYEYVSPLQEHVFIGYYTSPPRSLMNFVVRYRPDEQPSLKPHHDSSTYTVN 722

Query: 673 IALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFV 732
           IALN+ GVDYEGGGCRFIRYNC+VT T+ GWMLMHPGRLTH+HEGL+VT GTRYIMISFV
Sbjct: 723 IALNKRGVDYEGGGCRFIRYNCSVTDTKPGWMLMHPGRLTHFHEGLRVTNGTRYIMISFV 782

Query: 733 DP 734
           DP
Sbjct: 783 DP 784


>gi|322786337|gb|EFZ12885.1| hypothetical protein SINV_01019 [Solenopsis invicta]
          Length = 742

 Score =  769 bits (1986), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 381/739 (51%), Positives = 496/739 (67%), Gaps = 42/739 (5%)

Query: 35  LVITVASNETDGYKRFIQSAEVNKLQVK--TLGLHQPWLGGDMSS-LGGGYKVNLLKNEL 91
           LV TVASNETDG++R+++S +V   + K   LGL +PW GG++    GGGYK+NLL+  L
Sbjct: 7   LVFTVASNETDGFRRYLRSTDVYGFRDKLNILGLGEPWKGGNVVKYAGGGYKINLLRKAL 66

Query: 92  DEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAV 151
            +    +  IIL TDSYDVI  G ++ I+ERF   +A ++F AE  CWPD SL  +YP V
Sbjct: 67  KDHQNDETKIILFTDSYDVIFLGDLSSIVERFLATNARVLFSAEAYCWPDKSLAAQYPPV 126

Query: 152 GSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLAN 211
             G RYLNSG FIGYA D+ +++    IKNE+DDQL+Y  ++L+E LR +HKI LD  + 
Sbjct: 127 SRGKRYLNSGSFIGYASDVYKILDTAPIKNEDDDQLFYTTVYLNEELRIRHKIKLDHKSE 186

Query: 212 LFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWKT- 270
           +FQNL+G++ D++L F  +E  +L N  YNT P+++HGNG SK+ LNS GNYLA++W   
Sbjct: 187 IFQNLFGAVADVELRFKGEE-AYLQNIVYNTVPLVLHGNGYSKLVLNSLGNYLARAWTPD 245

Query: 271 SGCTRC-NLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN 329
            GC  C +    LD  KP+ +P +LI+VFI++PT FLEEF   I +  YP  K+ +FV+N
Sbjct: 246 EGCLACWDRTIELDKTKPETYPVILIAVFIERPTPFLEEFFRDIYHQFYPKTKLHLFVHN 305

Query: 330 NQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDS 389
           N  YH  +  D+       + + K I  + +V+  +AR LA+E+ L K    Y  +DS +
Sbjct: 306 NVPYHEDVVGDFFEKVGQEYLSAKQILPSDSVSEVDARRLAMEHCLLKECSGYLSIDSVA 365

Query: 390 HLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGK 449
           HL N   LK LV +   +IAPLL+RPFKAWSNFWGA+  DGFYARSFDYM II  ++  +
Sbjct: 366 HLTNEFTLKLLVEQQRGIIAPLLIRPFKAWSNFWGAITDDGFYARSFDYMEIIKNER--R 423

Query: 450 GIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGH 509
           G+WNVP+++NCYL+  ++I +   +  Y    +D +MAF    R +G+ + + +  E+GH
Sbjct: 424 GLWNVPFVSNCYLINATIIASKVTRPTYEHGDLDTEMAFAHGNRQRGLFMYVSNRLEFGH 483

Query: 510 LVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTE 569
           LVD + ++ Q T P++Y++I N LDW+ RYIHP Y ++  PD    QPCPDV+WFPIV  
Sbjct: 484 LVDPDTYNIQLTYPDMYQIIDNKLDWERRYIHPNYSENFNPDKKPIQPCPDVYWFPIVNL 543

Query: 570 KFCHEFVQIMEAYGQWSDGTNN----------------------------------DKRL 595
           +F  E V I+E YGQWSDGTN                                   D RL
Sbjct: 544 RFTKELVGIVETYGQWSDGTNQDPRLSGGYENVPTRDIHMNQVQYEQQWLYFLKEFDSRL 603

Query: 596 ETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPD 655
           +TGYEAVPTRDIHM QVGL   W +FL+ YV PLQE  F GY+  P R+ M+FVVRYRPD
Sbjct: 604 DTGYEAVPTRDIHMTQVGLHDAWLKFLKDYVNPLQEHVFTGYNDYPPRSLMNFVVRYRPD 663

Query: 656 EQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYH 715
           EQPSLRPHHDSSTYTINIALNQ GVDYEGGGCRFIRYNC+VT T+ GWMLMHPGRLTHYH
Sbjct: 664 EQPSLRPHHDSSTYTINIALNQAGVDYEGGGCRFIRYNCSVTDTKPGWMLMHPGRLTHYH 723

Query: 716 EGLQVTQGTRYIMISFVDP 734
           EGL+VT GTRYIMISFVDP
Sbjct: 724 EGLRVTAGTRYIMISFVDP 742


>gi|380020387|ref|XP_003694068.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 3-like
           [Apis florea]
          Length = 785

 Score =  769 bits (1985), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 386/775 (49%), Positives = 510/775 (65%), Gaps = 60/775 (7%)

Query: 17  FFISVHC--NKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQ--VKTLGLHQPWLG 72
            F++ H     + +ID+D  L+ TVA+ ETDGYKR+++S +V   +  ++ LG+  PWLG
Sbjct: 14  LFLAYHVVSETLPSIDKDDVLIFTVATKETDGYKRYLRSIDVYGFRDNLRVLGMGTPWLG 73

Query: 73  GD--MSSLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANI 130
           GD   +S+GGGYKVNLLK  L+E    D+ II+ TDSYDVI    + +I+++F   +A +
Sbjct: 74  GDHVKTSVGGGYKVNLLKKALEEYQNDDERIIIFTDSYDVIFLSDLTEIIDKFKNMNARV 133

Query: 131 VFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYA 190
           +F AE  CWPD SL  KYP V  G R+LNSGGF+GYA DI  +++   IKN++DDQL+Y 
Sbjct: 134 LFSAEGACWPDRSLASKYPPVTRGKRFLNSGGFMGYASDIYAILTYAPIKNKDDDQLFYT 193

Query: 191 LLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGN 250
           L +LDE LR  HKI LD  + +FQNLY ++ D+KL F+  +   L NT YNT P+I+HGN
Sbjct: 194 LAYLDEKLREHHKIKLDHKSVIFQNLYLAVGDVKLKFEGGK-ASLLNTVYNTEPLILHGN 252

Query: 251 GKSKIELNSFGNYLAKSWK-TSGCTRC-NLIKHLDSLKPDQFPSVLISVFIDKPTAFLEE 308
           G SK  LNS GNYLA +W    GC  C      L+   P  +P +LI++FI++PT FL E
Sbjct: 253 GYSKESLNSLGNYLANAWSPEEGCIMCWEGTIELNKTIPKSYPIILIAIFIERPTPFLNE 312

Query: 309 FLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARN 368
           FL  I   +YP  K+ +FV+NN EYH  + + ++ NF   +   K ++ N  +N  +ARN
Sbjct: 313 FLTTIYQQDYPKSKLHLFVHNNVEYHQDVVNSFMKNFGYEYNTSKLVSVNDAMNEVDARN 372

Query: 369 LAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNA 428
           LA++  L K    YF +DS SHLDN   LK LV +   +IAPLLVRP+K WSNFWGA+  
Sbjct: 373 LAMDYCLLKECSGYFSIDSVSHLDNKYTLKLLVEQQREIIAPLLVRPYKMWSNFWGAIMD 432

Query: 429 DGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAF 488
           DGFYARSFDYM+I+  ++  +G+WNVP+I+NCYL+ +++I     +  Y+   +D DMAF
Sbjct: 433 DGFYARSFDYMDIVKNER--RGLWNVPFISNCYLINSTLISNKETRPSYSEGDLDTDMAF 490

Query: 489 CTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSL 548
               R + I + + +  ++GHLV+ +++D   T+P++Y++I N LDW+ RYIH  Y ++ 
Sbjct: 491 AYANRERSIFMYVSNRLDFGHLVNPDSYDITMTHPDLYQIIDNKLDWERRYIHENYSENF 550

Query: 549 LPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRL------------- 595
             +    QPCPDV+WFPIV E+F  E + +ME +G+WSDG+N+D RL             
Sbjct: 551 NSNQTPLQPCPDVYWFPIVNERFTKELIDVMENFGKWSDGSNHDPRLTGGYENVPTRDIH 610

Query: 596 ------------------------------------ETGYEAVPTRDIHMKQVGLAGVWA 619
                                               E GYEAVPTRDIHMKQVGL   W 
Sbjct: 611 MNQIKNEPQWLYFLKEYVRPLQELVFTGYYHDDPRIEGGYEAVPTRDIHMKQVGLHESWL 670

Query: 620 EFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVG 679
            FL +YV PLQE+ FIGY   P RA M+FVVRYRPDEQPSL+PHHDSSTYTINIALN+VG
Sbjct: 671 NFLDQYVSPLQEQVFIGYSTSPPRALMNFVVRYRPDEQPSLKPHHDSSTYTINIALNRVG 730

Query: 680 VDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           VDYEGGGCRFIRYNC+VT T+ GWMLMHPGRLTHYHEGL+VT GTRYIMISFVDP
Sbjct: 731 VDYEGGGCRFIRYNCSVTDTKPGWMLMHPGRLTHYHEGLRVTSGTRYIMISFVDP 785


>gi|307183477|gb|EFN70276.1| Procollagen-lysine,2-oxoglutarate 5-dioxygenase 3 [Camponotus
           floridanus]
          Length = 787

 Score =  768 bits (1983), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 375/782 (47%), Positives = 511/782 (65%), Gaps = 57/782 (7%)

Query: 7   LNCLILSCVVFFISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKL--QVKTL 64
           + C +  C VF      ++    D +  LV TVASNETDG++R+++S EV K   +++ L
Sbjct: 9   IGCYLAWCCVFLTYHVVSEAPAADANDVLVFTVASNETDGFQRYLRSVEVYKFRDKLRIL 68

Query: 65  GLHQPWLGGD-MSSLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERF 123
           GL +PW GG+ M+  GGGYK+NLLK  L++    +  I+L TDSYDVI  GG++ I+ERF
Sbjct: 69  GLGEPWRGGNVMTYAGGGYKINLLKKALEDYQNDEKKIVLFTDSYDVIFLGGLSAIVERF 128

Query: 124 NTFDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEE 183
              DA ++F AE  CWPD SL   YP V  G RYLNSGGFIGYA D+ E++    IK+E+
Sbjct: 129 LDTDARVLFSAEVYCWPDRSLAIHYPTVSGGKRYLNSGGFIGYASDVYEILDKADIKDED 188

Query: 184 DDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTN 243
           DDQL+Y  ++L + LRT+HKI LD  + +FQNL+G++ D++L F  +E  ++ N  YNT 
Sbjct: 189 DDQLFYTTVYLQDELRTRHKIKLDHKSEIFQNLFGAVADVELRFKGEE-AYVQNIVYNTV 247

Query: 244 PVIIHGNGKSKIELNSFGNYLAKSWKTS-GCTRC-NLIKHLDSLKPDQFPSVLISVFIDK 301
           P+I+HGNG SK+ LNS GNYLA++W  + GC  C +    LD  KP+ +P +LI++FI++
Sbjct: 248 PLILHGNGFSKLVLNSLGNYLARAWTANEGCLACWDRTIELDKTKPETYPIILIAIFIEQ 307

Query: 302 PTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTV 361
           PT FLEEF   I    YP  ++ +F++NN  YH  +  ++       + + K I  +  +
Sbjct: 308 PTPFLEEFFQAIHRQAYPKSRLHLFIHNNVPYHESVIYNFFEKTSREYLSGKQILPSDEI 367

Query: 362 NSKEARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSN 421
           +  +AR LA+E+ L K    Y  VD+ +HLDN   LK LV +   ++APLL+RP+KAWSN
Sbjct: 368 SEVDARKLALEHCLLKECSGYLSVDAVAHLDNEHTLKLLVEQQRGIVAPLLIRPYKAWSN 427

Query: 422 FWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNS 481
           FWGA+  DGFYARSFDYM II  ++  +G+WNVP+++NCYL+  ++I     +  Y    
Sbjct: 428 FWGAITDDGFYARSFDYMEIIKNER--RGLWNVPFVSNCYLINATIIANKATRPSYEDAE 485

Query: 482 MDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIH 541
           +D +MAF    R +G+ + +++  ++GHLV+ +++D + T P++Y+++ N LDW+ RYIH
Sbjct: 486 LDTEMAFARTNRQRGLFMYLNNRLDFGHLVNPDSYDIRLTYPDMYQIMDNKLDWEKRYIH 545

Query: 542 PEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTN----------- 590
           P Y ++  PD    QPCPDV+WFPI T +F  E + I+E +GQWSDG+N           
Sbjct: 546 PNYSENFNPDKKPIQPCPDVYWFPIATLRFTSELIGIVETFGQWSDGSNHDPRLTGGYEN 605

Query: 591 --------------------------------------NDKRLETGYEAVPTRDIHMKQV 612
                                                 +D RLE+GYEAVPTRDIHM QV
Sbjct: 606 VPTRDIHMNQIQYEQQWLYFLKEYVRPLQERVFTGYYHDDSRLESGYEAVPTRDIHMNQV 665

Query: 613 GLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTIN 672
           GL   W +FL+ Y+ PLQ+  F GY   P R+ M+FVVRYRPDEQP LRPHHDSSTYTIN
Sbjct: 666 GLEDAWLKFLKDYISPLQQHVFTGYEDYPPRSLMNFVVRYRPDEQPFLRPHHDSSTYTIN 725

Query: 673 IALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFV 732
           IALNQ GVDYEGGGC+FIRYNC+VT T+ GWMLMHPGRLTHYHEGL+VT GTRYIMISFV
Sbjct: 726 IALNQAGVDYEGGGCKFIRYNCSVTDTKPGWMLMHPGRLTHYHEGLRVTAGTRYIMISFV 785

Query: 733 DP 734
           DP
Sbjct: 786 DP 787


>gi|332027746|gb|EGI67813.1| Procollagen-lysine,2-oxoglutarate 5-dioxygenase 3 [Acromyrmex
           echinatior]
          Length = 786

 Score =  768 bits (1982), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 382/782 (48%), Positives = 506/782 (64%), Gaps = 58/782 (7%)

Query: 7   LNCLILSCVVFFISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVK--TL 64
           + C ++ C +F ++ H       D +  LV TVASNETDG+KR+++S E++    K   L
Sbjct: 9   IGCWLMWCYIF-LTYHVVSETPADVNDVLVFTVASNETDGFKRYLRSTEIHGFHDKLNVL 67

Query: 65  GLHQPWLGGDMSS-LGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERF 123
           GL +PW GG++    GGGYK+NLLK  L++    +  IIL TDSYDVI  G ++ I+ERF
Sbjct: 68  GLGEPWKGGNVVRYAGGGYKINLLKKALEDYQNDEKKIILFTDSYDVIFLGDLSIIVERF 127

Query: 124 NTFDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEE 183
              DA ++F AE  CWPD SL  +YP V  G RYLNSGGFIGYA D+ +++    IK+E+
Sbjct: 128 LDTDARVLFSAEAYCWPDKSLATQYPPVSRGKRYLNSGGFIGYASDVYKILETAVIKDED 187

Query: 184 DDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTN 243
           DDQL+Y  ++L + LR ++KI LD  + +FQNLYG++ D++L F  +E  +L N  YNT 
Sbjct: 188 DDQLFYTTVYLQDELRLRYKIKLDHKSEIFQNLYGAVADVELRFKGEE-AYLQNIVYNTV 246

Query: 244 PVIIHGNGKSKIELNSFGNYLAKSWKT-SGCTRC-NLIKHLDSLKPDQFPSVLISVFIDK 301
           P+++HGNG SK+ LNS GNYLA++W    GC  C +    LD +K   +P +LI++FI++
Sbjct: 247 PLVLHGNGPSKLVLNSLGNYLARAWTPDEGCLACWDQTIELDKIKSKTYPVILIAIFIER 306

Query: 302 PTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTV 361
           PT FLEEF   I    YP  K+ +F++NN  YH  + DD+       + + K I  +  V
Sbjct: 307 PTPFLEEFFRAIYRQYYPKSKLHLFIHNNVPYHEDVVDDFFEKIGQEYLSAKRILPSDDV 366

Query: 362 NSKEARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSN 421
           +  +AR LA+E+ L K    Y  +D+ +HLDN   LK LV +   ++APLL+RPFKAWSN
Sbjct: 367 SEVDARKLAMEHCLLKECSGYLSIDAVAHLDNEHTLKLLVEQQRGIVAPLLIRPFKAWSN 426

Query: 422 FWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNS 481
           FWGA+  DGFYARSFDYM II  ++  +G+WNVP+++NCYL+  ++I     +  Y    
Sbjct: 427 FWGAITDDGFYARSFDYMEIIKNER--RGLWNVPFVSNCYLINATIIANKATRPTYEAGD 484

Query: 482 MDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIH 541
           +D +MAF    R +G+ + +++  E+GHLVD + +D + T P++Y++I N LDW+ RYIH
Sbjct: 485 LDTEMAFAHGNRQRGLFMYVNNRLEFGHLVDPDTYDIRLTYPDIYQIIENKLDWEKRYIH 544

Query: 542 PEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTN----------- 590
             Y ++  PD    QPCPDV+WFPIV  +F  E V I+E +GQWSDGTN           
Sbjct: 545 SNYSENFNPDNKPIQPCPDVYWFPIVNLRFTKELVGIVETFGQWSDGTNHDPRLSGGYEN 604

Query: 591 --------------------------------------NDKRLETGYEAVPTRDIHMKQV 612
                                                 +D RLE+GYEAVPTRDIHM QV
Sbjct: 605 VPTRDIHMNQVQYDQQWLYFLKEYVRPLQEFIFTGYFHDDPRLESGYEAVPTRDIHMNQV 664

Query: 613 GLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTIN 672
           GL   W +FL+ YV PLQE  F GY+  P R+ M+FVVRYRPDEQ SLRPHHDSSTYTIN
Sbjct: 665 GLQDAWLKFLKDYVNPLQEHVFTGYNDYPPRSLMNFVVRYRPDEQSSLRPHHDSSTYTIN 724

Query: 673 IALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFV 732
           IALNQ GVDYEGGGCRFIRYNC+VT T+ GWMLMHPGRLTHYHEGL VT GTRYIMISFV
Sbjct: 725 IALNQAGVDYEGGGCRFIRYNCSVTDTKPGWMLMHPGRLTHYHEGLLVTAGTRYIMISFV 784

Query: 733 DP 734
           DP
Sbjct: 785 DP 786


>gi|350421684|ref|XP_003492923.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 3-like
           isoform 3 [Bombus impatiens]
          Length = 785

 Score =  766 bits (1977), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 379/783 (48%), Positives = 507/783 (64%), Gaps = 58/783 (7%)

Query: 7   LNCLILSCVVFFISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQ--VKTL 64
           + C +   +     V    + + D+D  LV T+ASNETDGYKR+++S  V   +  ++ L
Sbjct: 6   IGCCLFWSLFLTYHVFSETLPSTDKDDVLVFTIASNETDGYKRYLRSVNVYGFRDNLRVL 65

Query: 65  GLHQPWLGGD--MSSLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILER 122
           GL +PWLGGD   +S GGGYKVNLLK  L+     D  I++ TDSYDVI    + +I+ +
Sbjct: 66  GLGEPWLGGDNIKTSAGGGYKVNLLKKALENYGDDDQKIVIFTDSYDVIYLSDLTEIINK 125

Query: 123 FNTFDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNE 182
           F + DA ++F AE  CWPD SL  KYP+   G R+LNSGGF+GYA D+  ++++  IKN+
Sbjct: 126 FKSMDARVLFSAEGSCWPDKSLASKYPSAALGKRFLNSGGFVGYASDVYAILTHAPIKNK 185

Query: 183 EDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNT 242
           +DDQL+Y L +LDE LR +HKI LD  + +FQNLYG++ D++L F+  +   L NT Y+T
Sbjct: 186 DDDQLFYTLAYLDEELRERHKIKLDHKSEIFQNLYGAVADVELKFEGGK-ASLLNTVYST 244

Query: 243 NPVIIHGNGKSKIELNSFGNYLAKSWK-TSGCTRC-NLIKHLDSLKPDQFPSVLISVFID 300
            P+I+HGNG SK+ LNS GNYLA +W    GC  C      LD   P+ +P +LI++FI+
Sbjct: 245 EPLILHGNGYSKLSLNSLGNYLAHAWSPEEGCVMCWEETIELDRTTPESYPIILIAIFIE 304

Query: 301 KPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNST 360
           +PT FL EFL+ I    YP  K+ + ++NN EYH  + D+++      + + K I+ N  
Sbjct: 305 RPTPFLTEFLSAIYQQAYPKSKLHLLIHNNVEYHQDVVDNFMKKVGREYNSSKQISVNDA 364

Query: 361 VNSKEARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWS 420
           +N  +ARNLA++  L K    YF +DS SHLDN   LK L+ +   +IAPLLVRP+K WS
Sbjct: 365 MNEVDARNLAMDYCLLKECSGYFSIDSVSHLDNEHTLKLLIEQQRDIIAPLLVRPYKMWS 424

Query: 421 NFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLN 480
           NFWGA+  DGFYARSFDY+ I+N ++  +G+WNVP+I+NCYL+  ++I     +  Y+  
Sbjct: 425 NFWGAIMDDGFYARSFDYIEIVNNER--RGLWNVPFISNCYLINATLISNKETRPSYSEG 482

Query: 481 SMDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYI 540
            +D +MAF    R + I + + +  ++GHLVD +N+D   T+P+ Y+++ N LDW+  YI
Sbjct: 483 DLDTEMAFAYANRERNIFMYVSNRVDFGHLVDPDNYDVTVTHPDFYQILNNKLDWEKTYI 542

Query: 541 HPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRL----- 595
           H  Y ++  P+    Q CPDV+ FPIV E+F  E + IME +G+WSDG+N+D RL     
Sbjct: 543 HENYSENFNPNKTPVQVCPDVYRFPIVNERFTKELIDIMETFGKWSDGSNHDPRLTGGYE 602

Query: 596 --------------------------------------------ETGYEAVPTRDIHMKQ 611
                                                       E GYEAVPTRDIHMKQ
Sbjct: 603 NVPTRDIHMNQVKYEPQWLYFLKEYVRPLQELVFAGYYHDDPRIEGGYEAVPTRDIHMKQ 662

Query: 612 VGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTI 671
           +GL   W  FL +YV PLQE  FIGY+  P RA M+FVVRYRPDEQPSL+PHHDSSTYTI
Sbjct: 663 IGLHESWLNFLYEYVSPLQEHVFIGYNTNPPRALMNFVVRYRPDEQPSLKPHHDSSTYTI 722

Query: 672 NIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISF 731
           NIALN+ GVDYEGGGCRFIRYNC+VT T+ GWMLMHPGRLTHYHEGL+VT GTRYIMISF
Sbjct: 723 NIALNRAGVDYEGGGCRFIRYNCSVTDTKPGWMLMHPGRLTHYHEGLRVTSGTRYIMISF 782

Query: 732 VDP 734
           VDP
Sbjct: 783 VDP 785


>gi|170052410|ref|XP_001862209.1| procollagen-lysine,2-oxoglutarate 5-dioxygenase 3 [Culex
           quinquefasciatus]
 gi|167873364|gb|EDS36747.1| procollagen-lysine,2-oxoglutarate 5-dioxygenase 3 [Culex
           quinquefasciatus]
          Length = 723

 Score =  763 bits (1969), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 371/729 (50%), Positives = 505/729 (69%), Gaps = 22/729 (3%)

Query: 10  LILSCVVFFISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQP 69
           L+L+CV    S  C   + +     LV TVASNET+ Y R+I+SA+   ++V TLGL +P
Sbjct: 13  LLLACV----SHLCVGEEKLPGKAPLVFTVASNETEAYLRYIRSAKRYGIEVTTLGLGKP 68

Query: 70  WLGGDMSSLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDAN 129
           W GGDM  LGGGYK+NLL++ L      DD I+L TDSYDV+    +  I+E+F TF+A+
Sbjct: 69  WQGGDMKKLGGGYKINLLRSALKPYKSDDDRIVLFTDSYDVLFLASLEKIVEKFETFEAS 128

Query: 130 IVFGAERLCWPDTSLYDKYPAV-GSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLY 188
           I+FG+E  CWPD  L +KYP + G G R+LNSG F+GYA  + +++ N  +K+ +DDQLY
Sbjct: 129 ILFGSEGFCWPDPELKNKYPVLEGRGTRFLNSGLFMGYASKVYQMLKN-PVKDTDDDQLY 187

Query: 189 YALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLD-EFVHLTNTKYNTNPVII 247
           Y  +++D+ LR +  + LD  A LFQN+ G  E I L  D D +   L NT+Y+TNP+I+
Sbjct: 188 YTKIYIDQQLREELNMKLDHTAALFQNMNGVEEQITLALDPDSKEAFLKNTEYSTNPLIV 247

Query: 248 HGNGKSKIELNSFGNYLAKSWKTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLE 307
           HGNG SKI LN + NYLA ++    C      ++L  L  +  P V++++F++K T F+E
Sbjct: 248 HGNGPSKITLNGYANYLAGAFVDGECQTVK--ENLIELDEENLPKVMVALFVEKATPFIE 305

Query: 308 EFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEAR 367
           E+   IA LNYP +K+ +F++NN ++H P  D +I  +   +++ + + ++        R
Sbjct: 306 EWFENIAKLNYPKQKMDVFIHNNVDHHKPTIDQFIKQYTEEYRSFRMVDYSEDFEELAGR 365

Query: 368 NLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALN 427
           +LAV   L K  D+ F VD+D H+D+PD L+ L+  N  +I+P+L RP K WSNFWGAL+
Sbjct: 366 SLAVNQCLKKKCDYLFVVDADGHIDDPDTLRRLITLNRDIISPVLTRPEKVWSNFWGALS 425

Query: 428 ADGFYARSFDYMNIINGDQGGK--GIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD 485
           + GFYARS DYM+I+    G K  G+WNVP+I+  YL+K+S    ++   I        D
Sbjct: 426 SQGFYARSSDYMDIV----GRKILGLWNVPFISTVYLVKSSSSVTSSPTPIP-------D 474

Query: 486 MAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQ 545
           MA C ++R KGI + + +T+++GHL+DS+ +D  +T+P+ Y+L  N  DW+ +YI  EY 
Sbjct: 475 MALCWHMRAKGIFMHVVNTEQFGHLIDSDYYDANRTHPDFYQLFNNKYDWERKYISAEYH 534

Query: 546 KSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTR 605
           K L  D V  QPCPDV+WF I TEKFC    +I+EA+G+WSDGT++DKRL+ GYEAVPTR
Sbjct: 535 KQLEKDFVPVQPCPDVYWFSIGTEKFCDHLREIVEAFGKWSDGTHSDKRLQGGYEAVPTR 594

Query: 606 DIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHD 665
           DIHM QVGL  VW +FL+ YV PLQE+ FIGY H+P R+ M+FVVRYRPDEQPSLRPHHD
Sbjct: 595 DIHMNQVGLEQVWLKFLQLYVKPLQEKVFIGYFHDPPRSLMNFVVRYRPDEQPSLRPHHD 654

Query: 666 SSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTR 725
           SSTYTIN+ALN  GVDYEGGGC+F+RYNC+VT TR GWMLMHPGRLTH+HEGL  T+GTR
Sbjct: 655 SSTYTINVALNTAGVDYEGGGCKFLRYNCSVTDTRKGWMLMHPGRLTHFHEGLLTTKGTR 714

Query: 726 YIMISFVDP 734
           YIMISFVDP
Sbjct: 715 YIMISFVDP 723


>gi|340726796|ref|XP_003401739.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 3-like
           isoform 2 [Bombus terrestris]
          Length = 785

 Score =  762 bits (1967), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 378/783 (48%), Positives = 505/783 (64%), Gaps = 58/783 (7%)

Query: 7   LNCLILSCVVFFISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQ--VKTL 64
           + C +   +     V    + + D+D  LV T+ASNETDGYKR+++S  V      ++ L
Sbjct: 6   IGCCLFWSLFLTYHVFSETLPSTDKDDVLVFTIASNETDGYKRYLRSVNVYGFHDNLRVL 65

Query: 65  GLHQPWLGGD--MSSLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILER 122
           GL +PWLGGD   +S GGGYKVNLLK  L+     D  I++ TDSYDVI    + +I+ +
Sbjct: 66  GLGEPWLGGDNIKTSAGGGYKVNLLKKALENYGDDDQKIVIFTDSYDVIYLSDLTEIINK 125

Query: 123 FNTFDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNE 182
           F + DA ++F AE  CWPD SL  KYP    G R+LNSGGF+GYA D+  ++++  IKN+
Sbjct: 126 FKSMDARVLFSAEGSCWPDKSLASKYPPATLGKRFLNSGGFVGYASDVYAILTHAPIKNK 185

Query: 183 EDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNT 242
           +DDQL+Y L +LDE LR +HKI LD  + +FQNLYG++ D++L F+  +   L NT YNT
Sbjct: 186 DDDQLFYTLAYLDEELRERHKIKLDHKSEIFQNLYGAVADVELKFEGGK-ASLLNTVYNT 244

Query: 243 NPVIIHGNGKSKIELNSFGNYLAKSWK-TSGCTRC-NLIKHLDSLKPDQFPSVLISVFID 300
            P+I+HGNG SK+ LNS GNYLA++W    GC  C      LD +    +P +LI++FI+
Sbjct: 245 EPLILHGNGYSKLSLNSLGNYLARAWSPEEGCVMCWEETIELDRIISQSYPIILIAIFIE 304

Query: 301 KPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNST 360
           +PT FL EFL+ I    YP  K+ + ++NN EYH  + D+++   +  + + K I+ N  
Sbjct: 305 RPTPFLSEFLSAIYQQAYPKSKLHLLIHNNVEYHQDVLDNFMKKVEKEYNSSKQISVNDA 364

Query: 361 VNSKEARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWS 420
           ++  +ARNLA++  L K    YF +DS SHLDN   LK LV +   +IAPLLVRP+K WS
Sbjct: 365 MSEVDARNLAMDYCLLKECSGYFSIDSVSHLDNEHTLKLLVEQQRDIIAPLLVRPYKMWS 424

Query: 421 NFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLN 480
           NFWGA+  DGFYARSFDY+ I+  ++  +G+WNVP+I+NCYL+  ++I     +  Y+  
Sbjct: 425 NFWGAIMDDGFYARSFDYIEIVKNER--RGLWNVPFISNCYLINATLISNKETRPSYSEG 482

Query: 481 SMDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYI 540
            +D +MAF    R + I + + +  ++GHLVD +N+D   T+P+ Y+++ N LDW+  YI
Sbjct: 483 DLDTEMAFAYANRERNIFMYVSNRVDFGHLVDPDNYDVTVTHPDFYQILNNKLDWEKTYI 542

Query: 541 HPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRL----- 595
           H  Y ++  P+    Q CPDV+ FPIV E+F  E + IME +G+WSDG+N+D RL     
Sbjct: 543 HENYSENFNPNKTPVQVCPDVYRFPIVNERFTKELIDIMETFGKWSDGSNHDPRLTGGYE 602

Query: 596 --------------------------------------------ETGYEAVPTRDIHMKQ 611
                                                       E GYEAVPTRDIHMKQ
Sbjct: 603 NVPTRDIHMNQVKYEPQWLYFLKEYVRPLQELVFAGYYHDDPRIEGGYEAVPTRDIHMKQ 662

Query: 612 VGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTI 671
           +GL   W  FL +YV PLQE  FIGY+  P RA M+FVVRYRPDEQPSL+PHHDSSTYTI
Sbjct: 663 IGLHESWLNFLYEYVSPLQEHVFIGYNTNPPRALMNFVVRYRPDEQPSLKPHHDSSTYTI 722

Query: 672 NIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISF 731
           NIALN+ GVDYEGGGCRFIRYNC+VT T+ GWMLMHPGRLTHYHEGL+VT GTRYIMISF
Sbjct: 723 NIALNRAGVDYEGGGCRFIRYNCSVTDTKPGWMLMHPGRLTHYHEGLRVTSGTRYIMISF 782

Query: 732 VDP 734
           VDP
Sbjct: 783 VDP 785


>gi|345484574|ref|XP_001601697.2| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 3-like
           [Nasonia vitripennis]
          Length = 775

 Score =  760 bits (1963), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 384/777 (49%), Positives = 513/777 (66%), Gaps = 61/777 (7%)

Query: 13  SCVVFFISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKL--QVKTLGLHQPW 70
           +CV+  + + C  V   + D  LV TVA+NET+G++R+++S EVN     V+ LGL Q W
Sbjct: 5   TCVLLAV-LAC--VAAEETDDALVFTVATNETEGFRRYLRSTEVNGFGDNVRVLGLGQAW 61

Query: 71  LGGDMS-SLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFD-A 128
            GG++    GGG KVNLLK  ++E+    D I+L TDSYDVI    +  I  +F  +D A
Sbjct: 62  RGGEIKLYAGGGQKVNLLKEAIEEIKDDPDQIVLFTDSYDVIFLSSLEKISRKFKEWDDA 121

Query: 129 NIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLY 188
            ++F AE  CWP  SL  +YP V  G R+LNSGGFIGYA DI  ++++  IK+++DDQL+
Sbjct: 122 RVIFSAEEYCWPLKSLASEYPQVKRGKRFLNSGGFIGYAPDIYAILTSAEIKDDDDDQLF 181

Query: 189 YALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIH 248
           Y  ++L+  LR KHKI LD  + +FQNL G++ DI+L F  +E  ++ NT YNT P+IIH
Sbjct: 182 YTKVYLNSELREKHKIKLDHKSEIFQNLNGAIHDIELRFKGNE-AYVQNTAYNTVPLIIH 240

Query: 249 GNGKSKIELNSFGNYLAKSWK-TSGCTRC-NLIKHLDSLKPDQFPSVLISVFIDKPTAFL 306
           GNG SK+ LNS GNY+A++W    GC  C +    LD    + +P +LI++FI+KPT FL
Sbjct: 241 GNGFSKLLLNSLGNYVAQAWSPEEGCLSCWDRTIELDVKNAEAYPKILIAIFIEKPTPFL 300

Query: 307 EEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEA 366
           EEFLNKI +  YP +K+  F+ NN  YH  L D+++      +++VK I     +    A
Sbjct: 301 EEFLNKIKDQRYPKEKLHFFIRNNVPYHEKLIDEFVEKHGDEYQSVKQIKPEDEIAEAAA 360

Query: 367 RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGAL 426
           RNLA+ + L      YF +DS+SHLDN + L+ LV +   ++APLLVRPFKAWSNFWGA+
Sbjct: 361 RNLAMNHCLSVKCSGYFSIDSESHLDNVNTLELLVEQQRGIVAPLLVRPFKAWSNFWGAI 420

Query: 427 NADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDM 486
             DGFYARS DYM+II+ ++  +G+WNVP++++CYL+  ++++    +  Y    +D +M
Sbjct: 421 TDDGFYARSSDYMDIIHHER--RGLWNVPFVSSCYLINATLLENEATRPSYAEADLDAEM 478

Query: 487 AFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQK 546
           AF    R + I + +++  ++GHLV+ E F+   TNP++Y++  N LDW+ RYIH  Y  
Sbjct: 479 AFAYANRRRDIFMYVNNRLDFGHLVNPETFNISLTNPDMYQMFDNKLDWEKRYIHVNYSD 538

Query: 547 SLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTN---------------- 590
           + LP+    QPCPDV+WFPIVTE+F  +FV+IMEAYG+WSDG+N                
Sbjct: 539 NFLPENKPVQPCPDVYWFPIVTERFNKDFVEIMEAYGKWSDGSNYDPRLSNGYENVPTRD 598

Query: 591 ---------------------------------NDKRLETGYEAVPTRDIHMKQVGLAGV 617
                                            +D RLE GYEAVPTRDIHM QVGL   
Sbjct: 599 IHMNQVGLESQWLFFLRNYVKPLQELVFLGYFHDDPRLENGYEAVPTRDIHMTQVGLDES 658

Query: 618 WAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQ 677
           W EFLR YV PLQ+  F GY+  P R+ M+FVVRYRPDEQPSL+PHHDSSTYTINIALN+
Sbjct: 659 WLEFLRVYVNPLQQAVFTGYYDYPPRSLMNFVVRYRPDEQPSLKPHHDSSTYTINIALNK 718

Query: 678 VGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           VGVDYEGGGCRFIRYNC+VT T+ GWMLMHPGRLTHYHEGL+VT+GTRYIMISFVDP
Sbjct: 719 VGVDYEGGGCRFIRYNCSVTDTKPGWMLMHPGRLTHYHEGLKVTKGTRYIMISFVDP 775


>gi|321459829|gb|EFX70878.1| hypothetical protein DAPPUDRAFT_217067 [Daphnia pulex]
          Length = 737

 Score =  756 bits (1951), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 355/705 (50%), Positives = 486/705 (68%), Gaps = 8/705 (1%)

Query: 33  KFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLLKNEL 91
           KFL++TVA+ ET GYKR+ +S  +N L VK LGL + W GGDM+ S+GGG KV +L+ E+
Sbjct: 38  KFLILTVATEETSGYKRYQRSVRINGLPVKVLGLGEEWKGGDMANSVGGGQKVLMLRKEV 97

Query: 92  DEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAV 151
           +      + II+ TDSYDV+ +     I+E+F  F+A ++F AE  CWPD +L  KYP V
Sbjct: 98  ELHKDDPEKIIMFTDSYDVLFNANEEKIVEQFLQFNARVLFSAEGFCWPDPTLASKYPEV 157

Query: 152 GSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLAN 211
             G R+LNSG F+GYA ++ +++++  I N++DDQL+Y  +FLDE  R +  I LD  + 
Sbjct: 158 ERGKRFLNSGLFMGYAPELHQILNSGEIANDDDDQLFYTKVFLDEKKRQELNIKLDHRSE 217

Query: 212 LFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWKTS 271
           +FQNL G++ D++L F      HL NT YNT P++IH NG +K+ LN+ GNYL KSW + 
Sbjct: 218 IFQNLNGAVSDVELRFIES---HLQNTVYNTVPLVIHANGPTKLFLNTLGNYLPKSWNSE 274

Query: 272 -GCTRC-NLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN 329
            GC  C   +  L+  KP  FP V++ +FI+ PT F EEFL+K   L+YP  KI ++++N
Sbjct: 275 EGCLNCWEDMNSLEKKKPKDFPKVVVGMFIENPTPFFEEFLHKFLALSYPKDKIHLYIHN 334

Query: 330 NQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDS 389
              YH      ++ +    + +VK + H   V    ARN  +E  L K  ++YF VD+ +
Sbjct: 335 GVSYHGKQITGFVESHGAEYASVKLVNHEENVKEWHARNTGIEECLKKKCEYYFNVDALA 394

Query: 390 HLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGK 449
           H+DNP  LK L+ +N  ++AP+++RP++AWSNFWG+L  DGFYARS DYM I+ G++  +
Sbjct: 395 HIDNPHTLKLLIEQNRPVVAPMMIRPYQAWSNFWGSLTTDGFYARSIDYMEIVKGER--R 452

Query: 450 GIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGH 509
           G+WNVP++T+ YL++  +I     K  Y  N +D DMAFCTN+RN  ++L + +  ++GH
Sbjct: 453 GLWNVPFVTSVYLVRGDIIHNPKTKPSYIHNLLDADMAFCTNMRNNDVYLFVTNRLDWGH 512

Query: 510 LVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTE 569
           L+  +NF+    N E+YE+  N  DW+ RY+H  Y ++L  +   + PCPDV+WFP+ TE
Sbjct: 513 LITVDNFETTHLNNELYEIQNNRWDWEKRYLHVNYSQNLNMELNVSMPCPDVYWFPMTTE 572

Query: 570 KFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPL 629
           +F  E V  ME +GQWSDGTN D RLE GYE VPTRDIHM+Q+G+   W  FLR YV PL
Sbjct: 573 RFADELVGEMENFGQWSDGTNTDPRLEGGYENVPTRDIHMRQIGMDRHWLAFLRDYVRPL 632

Query: 630 QEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRF 689
           QER F+GY H P R+ M+FVVRYRPDEQP L+PHHDSSTYTIN+ALN+  +D+EGGGCRF
Sbjct: 633 QERVFVGYQHYPPRSVMNFVVRYRPDEQPFLKPHHDSSTYTINLALNRPQIDFEGGGCRF 692

Query: 690 IRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           +RYNC+V  TR GWMLMHPGRLTHYHEGL  T+GTRYIMISFVDP
Sbjct: 693 VRYNCSVLDTRKGWMLMHPGRLTHYHEGLYTTKGTRYIMISFVDP 737


>gi|307195418|gb|EFN77304.1| Procollagen-lysine,2-oxoglutarate 5-dioxygenase 3 [Harpegnathos
           saltator]
          Length = 793

 Score =  754 bits (1946), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 378/789 (47%), Positives = 516/789 (65%), Gaps = 65/789 (8%)

Query: 7   LNCLILSCVVFFISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQ--VKTL 64
           + C ++ C VF ++ H       D +  LV TVAS+ETDG++R+++SAE+   +  +K L
Sbjct: 9   IGCWLVWCYVF-LTYHVVSEAPADTNDVLVATVASDETDGFRRYLRSAEIYGFRDNLKIL 67

Query: 65  GLHQPWLGGDMSS-LGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERF 123
           GL + W GG++SS  GGGYKVNLL+  L++    ++ I+L TDSYDVI  GG++ I+ERF
Sbjct: 68  GLGESWKGGNVSSGPGGGYKVNLLRKALEDYRDDENKIVLFTDSYDVIFLGGLSAIVERF 127

Query: 124 NTFDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEE 183
               A I+F AE  CWPD SL  +YPAV  G RYLNSG FIGYA D+  ++   SI++E+
Sbjct: 128 LDTGARILFSAEGYCWPDKSLASQYPAVSRGKRYLNSGSFIGYATDLLAILDTVSIEDED 187

Query: 184 DDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTN 243
           DDQL Y  ++L++ LR +H+I LD  +++FQNL+G++ D++L F  +E  +L N  YNT 
Sbjct: 188 DDQLLYTNVYLNDELRARHRIKLDHKSDIFQNLFGAVADVELRFKGEE-AYLQNIVYNTV 246

Query: 244 PVIIHGNGKSKIELNSFGNYLAKSWKT-SGCTRC-NLIKHLDSLKPDQFPSV-LISVFID 300
           P+++HGNG SK+ LNS GNY+A++W    GC  C +    LD  KP+ +P++ LI++FI+
Sbjct: 247 PLVLHGNGHSKLVLNSLGNYVARAWTPDEGCLACWDQTVELDKTKPEMYPAIILIALFIE 306

Query: 301 KPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNST 360
           +PT FLEEF   I   +YP  K+ +F++N   +H  +  D+    K  + +V YI+    
Sbjct: 307 RPTPFLEEFFEAIYRQSYPKSKLHLFIHNAVSHHDGVVTDFYERAKREYVDVNYISVKQG 366

Query: 361 VNSKEARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWS 420
           VN   AR LA+++        YF VD+ +HLDN   LK LV +   ++APLL+RP+KAWS
Sbjct: 367 VNEVHARKLAMKHCAFNKCSGYFSVDAVAHLDNEHTLKLLVEQQRRIVAPLLIRPYKAWS 426

Query: 421 NFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIY--- 477
           NFWGA+  DGFYARSFDYM II  ++  +G+WNVP+++NCYL+  +++   + +  Y   
Sbjct: 427 NFWGAITDDGFYARSFDYMEIIKNER--RGLWNVPFVSNCYLINATILNDESTRPFYGNP 484

Query: 478 ---TLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLD 534
              +   MD +MAF    R+ G+ + + +  ++GHLV+ + +D + T PE+Y+++ N LD
Sbjct: 485 DGNSDADMDSEMAFAQRNRHAGVFMYVSNRLDFGHLVNPDTYDIKLTYPEMYQIMDNKLD 544

Query: 535 WDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTN---- 590
           WD RYIH +Y +S  PD    QPCPDV+WFPIVT +F +E + I+EA+GQWSDG+N    
Sbjct: 545 WDRRYIHAKYSESFNPDNKPIQPCPDVYWFPIVTRRFTNELIGIVEAFGQWSDGSNHDPR 604

Query: 591 ---------------------------------------------NDKRLETGYEAVPTR 605
                                                        +D RLETGYEAVPTR
Sbjct: 605 LSGGYENVPTRDIHMNQVQYEQQWLYFLKEYVRPLQELVFTGYYHDDPRLETGYEAVPTR 664

Query: 606 DIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHD 665
           DIHM QV L   W +FL+ YV PLQ+  F GY   P R+ M+FVV+YRPDEQP LRPHHD
Sbjct: 665 DIHMNQVDLQDAWLKFLKDYVSPLQQLVFTGYDDYPPRSLMNFVVKYRPDEQPYLRPHHD 724

Query: 666 SSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTR 725
           SSTYTINIALNQ GVDYEGGGCRFIRYNC+VT T+ GWMLMHPGRLTHYHEGL+VT GTR
Sbjct: 725 SSTYTINIALNQAGVDYEGGGCRFIRYNCSVTDTKPGWMLMHPGRLTHYHEGLRVTAGTR 784

Query: 726 YIMISFVDP 734
           YIMISFVDP
Sbjct: 785 YIMISFVDP 793


>gi|347966056|ref|XP_321614.4| AGAP001507-PA [Anopheles gambiae str. PEST]
 gi|333470231|gb|EAA00870.4| AGAP001507-PA [Anopheles gambiae str. PEST]
          Length = 727

 Score =  752 bits (1942), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 362/702 (51%), Positives = 490/702 (69%), Gaps = 11/702 (1%)

Query: 35  LVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNELDEM 94
           L+ TVASN T+GY R+++SA+   L V TLG+ +PWLGG+M S+GGGYK+NLL+  L   
Sbjct: 35  LIFTVASNATEGYVRYLRSAKHYDLTVTTLGMGKPWLGGNMKSVGGGYKINLLREALKPY 94

Query: 95  DITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAV-GS 153
               D ++L TDSYDV+       I E+F +F+A+I+FGAE  CWPD SL   YP + G 
Sbjct: 95  RADKDRLVLFTDSYDVLFLAPWAKIQEKFASFEASILFGAEGFCWPDESLKSAYPPLEGR 154

Query: 154 GYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLANLF 213
           G RYLNSG F+GYA  + +L+    +K+ EDDQLYY   +LDE LR +  I LD +A LF
Sbjct: 155 GMRYLNSGLFMGYADKLYKLLKT-PVKDAEDDQLYYTKAYLDEELRQELNIKLDHMATLF 213

Query: 214 QNLYGSLEDIKLNFDLDEF-VHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWKTSG 272
           QNL G  E + L+ +  E    L N++YNT P I+HGNG SK+ LNS+ NYLA ++    
Sbjct: 214 QNLNGVEEQVVLSLEPSEKEATLANSEYNTKPAIVHGNGPSKLTLNSYANYLAGAFVDGE 273

Query: 273 CTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQE 332
           C      +   +L   + P V +++F++KPT FLEE+   IA LNYPA ++ + V++N  
Sbjct: 274 CQTVKEGRL--TLSGGELPLVTMALFVEKPTPFLEEWFGTIAKLNYPADRLDVLVHSNVA 331

Query: 333 YHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSHLD 392
           YHA     ++   +  ++++K I H+       ARN A ++   +G D+ F VDS+ HLD
Sbjct: 332 YHAGTVKAFLDAQEGRYRSLKVIEHDGDFTETAARNFATKHCELRGCDYLFVVDSEGHLD 391

Query: 393 NPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIW 452
           +P+VL+ L+  N ++IAP+L RP K WSNFWGAL+  GFYARS DYM+I+   +   G+W
Sbjct: 392 DPNVLRALIEANRNVIAPVLTRPEKVWSNFWGALSGQGFYARSNDYMDIVG--RKLLGLW 449

Query: 453 NVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGHLVD 512
           NVP+++  YL+K +V+   +    Y L   D DMA C + R+KGI + + + ++YGHL+D
Sbjct: 450 NVPFVSIVYLVKRAVLPEVS----YELQETDPDMALCWHFRSKGIFMHVINVEQYGHLID 505

Query: 513 SENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFC 572
           +E FD  +T+P+ Y+L  N  DW+ RY+ P Y++ L  D V  QPCPDV+WF I +++FC
Sbjct: 506 TEYFDMTRTHPDFYQLFNNRHDWEQRYLAPGYKQQLEADFVPQQPCPDVYWFAIGSDRFC 565

Query: 573 HEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQER 632
            +  +I+EA+G+WSDG+++DKRL+ GYEAVPTRDIHM QVGL  +W +FL+ YV PLQE+
Sbjct: 566 DDLREIVEAFGEWSDGSHSDKRLQGGYEAVPTRDIHMNQVGLEQLWLKFLQLYVRPLQEK 625

Query: 633 EFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRY 692
            FIGY H+P R+ M+FVVRYRPDEQPSLRPHHDSSTYTINIALN  GVDYEGGGCRF+RY
Sbjct: 626 VFIGYFHDPPRSLMNFVVRYRPDEQPSLRPHHDSSTYTINIALNTAGVDYEGGGCRFLRY 685

Query: 693 NCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           NC+VT TR GWML+HPGRLTH+HEGL  T+GTRYIMISFVDP
Sbjct: 686 NCSVTDTRKGWMLLHPGRLTHFHEGLLTTKGTRYIMISFVDP 727


>gi|427783339|gb|JAA57121.1| Putative procollagen-lysine 2-oxoglutarate 5-dioxygenase
           [Rhipicephalus pulchellus]
          Length = 772

 Score =  749 bits (1935), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 354/704 (50%), Positives = 487/704 (69%), Gaps = 6/704 (0%)

Query: 33  KFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLLKNEL 91
           + ++ TVAS+ETDG+KRF +SA+V  L+ K LG+H+ WLGGDM+  +GGGYKV LLK  L
Sbjct: 73  RLVIFTVASDETDGFKRFARSAKVYGLEPKILGMHEEWLGGDMAKGMGGGYKVRLLKKAL 132

Query: 92  DEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAV 151
           ++       +I+  DSYDV+   G ++IL +F  F++N+VF AE  CWPD SL + YP  
Sbjct: 133 EDYKNDAATLIMFVDSYDVVFTAGEDEILRKFYKFNSNVVFSAEGFCWPDRSLAEAYPK- 191

Query: 152 GSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLAN 211
            +G R+LNSGGFIGYA  +  ++S+  ++++ DDQL+Y  ++L+E LR K  I LD  A 
Sbjct: 192 ANGERFLNSGGFIGYAPQLYSIVSSSDLEDDADDQLFYTKIYLNEDLRRKWGIRLDHKAE 251

Query: 212 LFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK-T 270
           +FQNL G++ D++L   LD   +L N+ Y T P++IHGNG SK+ LN+ GNYLAKSW   
Sbjct: 252 IFQNLNGAVGDVEL-LGLDSEPYLHNSAYGTTPLVIHGNGPSKVILNNLGNYLAKSWNDM 310

Query: 271 SGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNN 330
           +GC  C     L      + P VLI +F++ PT FL+E L K+ NLNYP +KI +FV+N 
Sbjct: 311 AGCRVCYDTFSLSDKLDSELPKVLIGIFVEHPTPFLKEALQKVYNLNYPKEKIHLFVHNA 370

Query: 331 QEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSH 390
           +E+H      ++  +   + +VKY+  +       ARNLA+E  L    D+ F+VDS++H
Sbjct: 371 EEFHDAEVTKFVEEYGPAYHSVKYLDVSEAKKEWHARNLALEQCLKINCDYAFFVDSEAH 430

Query: 391 LDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKG 450
           LDNPD L+ L+  N +++APLL R    WSNFWG+L+ADG+YARS DY++++  ++  KG
Sbjct: 431 LDNPDTLRLLIETNRTIVAPLLSRHKSLWSNFWGSLSADGYYARSHDYVSLVKRER--KG 488

Query: 451 IWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGHL 510
           IWNVP++   YL+  S++K+      +    +D DMAFC N+R++GI + + +   YGHL
Sbjct: 489 IWNVPFVNGAYLINGSLVKSREKFPSFINGLLDPDMAFCKNMRDRGIFMFMTNMDNYGHL 548

Query: 511 VDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEK 570
           +++E FD +  NP+ YE+  N  DW+ RY+H  Y K L P    + PCPDV+WFP+V+E 
Sbjct: 549 INAETFDTRHKNPDFYEIYSNQKDWERRYLHENYTKVLDPSYKVDMPCPDVYWFPVVSET 608

Query: 571 FCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQ 630
           FC   +QIME +G+WS GTN D+RL  GYE VPTRDIHM QVGL   W  FLR+Y+ P+Q
Sbjct: 609 FCEHLIQIMENFGKWSSGTNEDERLAGGYENVPTRDIHMNQVGLEQHWLYFLREYIRPVQ 668

Query: 631 EREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFI 690
           E+ F+GY H+P +A M+FVVRY P+EQ  LRPHHDSSTYTINIALN+  +DYEGGGC F+
Sbjct: 669 EKVFLGYFHDPPKAIMNFVVRYHPEEQYFLRPHHDSSTYTINIALNRPHIDYEGGGCHFL 728

Query: 691 RYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           RYNC+V   + GW LMHPGRLTHYHEGL VT+GTRYIM+SFVDP
Sbjct: 729 RYNCSVVDLKRGWSLMHPGRLTHYHEGLPVTKGTRYIMVSFVDP 772


>gi|242016159|ref|XP_002428703.1| Procollagen-lysine,2-oxoglutarate 5-dioxygenase 3 precursor,
           putative [Pediculus humanus corporis]
 gi|212513374|gb|EEB15965.1| Procollagen-lysine,2-oxoglutarate 5-dioxygenase 3 precursor,
           putative [Pediculus humanus corporis]
          Length = 661

 Score =  724 bits (1870), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 342/665 (51%), Positives = 472/665 (70%), Gaps = 9/665 (1%)

Query: 75  MSSLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGA 134
           M S GGG K+NL + E+++     + II+ TDSYDVI   G+NDILE+F+     +VFGA
Sbjct: 1   MKSTGGGQKINLFREEVEKYKNDHEKIIIFTDSYDVIFLAGLNDILEQFDKIGGRVVFGA 60

Query: 135 ERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFL 194
           E  CWPD +L  +YP    G +YLNSGG IGYA ++ E++++RSI +++DDQL+Y   +L
Sbjct: 61  EPFCWPDKNLASQYPIQSRGKQYLNSGGIIGYAPELYEILTHRSIDDDDDDQLFYTQAYL 120

Query: 195 DETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKS- 253
           +ETLR   KI LD  + +F NL+G+++++ L F   E  +L N +  ++P+I+HGNG + 
Sbjct: 121 NETLRNNLKIKLDHKSQIFHNLHGAMDELSLKFKNHE-PYLENEQMKSHPLILHGNGPTV 179

Query: 254 -KIELNSFGNYLAKSWKTS-GCTRC--NLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEF 309
            K+ LN+ GNYL   W T  GC  C  N+I  L        P V +++F+ KPT FLE+F
Sbjct: 180 VKVGLNNLGNYLPNCWNTRDGCVSCKENVIT-LSDEDTSNHPRVFVALFVSKPTPFLEDF 238

Query: 310 LNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNL 369
           L K+ +L YP  KI++FVYN  ++H    D ++  F+  +K+VK I  +  +    A+ L
Sbjct: 239 LQKVGDLKYPKNKINLFVYNFIKHHERDVDKFVGKFREKYKSVKEIKADDEIAESHAKTL 298

Query: 370 AVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNAD 429
           A+E+      DFYF +DS++HLDNP  LK LV +N +++AP+LVRPFKAWSNFWG +  D
Sbjct: 299 AIEHFKTSKADFYFNLDSEAHLDNPYTLKLLVEQNRTIVAPMLVRPFKAWSNFWGGIAED 358

Query: 430 GFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFC 489
           GFYARSFDYM+++N ++  +G+WNVPYI+ CYL+  +VI+    K  Y   ++D DMAFC
Sbjct: 359 GFYARSFDYMDLVNNEK--RGLWNVPYISGCYLINGTVIRNDETKPSYVEGALDPDMAFC 416

Query: 490 TNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLL 549
            ++R KG+ + + +  ++GHL++ + +D  +T+P+ Y++  N  DW+ RY+H  Y ++L 
Sbjct: 417 HHMREKGVFMYVSNRVDFGHLINPDTYDVTRTHPDFYQIFDNKWDWEQRYLHENYSENLN 476

Query: 550 PDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHM 609
           P+T    PCPDV+WFPI + +FC E ++I E YG+WSDG+N D RL+ GYE VPTRDIHM
Sbjct: 477 PETKPLMPCPDVYWFPIASPRFCQELIEICETYGKWSDGSNKDLRLDGGYENVPTRDIHM 536

Query: 610 KQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTY 669
           KQ+GL   W  FL++YV PLQE  FIGY+H P RA M+FVVRY+PDEQPSLRPHHDSSTY
Sbjct: 537 KQIGLEYHWLYFLKEYVRPLQENVFIGYYHNPPRAIMNFVVRYKPDEQPSLRPHHDSSTY 596

Query: 670 TINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMI 729
           TIN+ALN   VDYEGGGCRF+RYNC+VT TR+GW+LMHPGRLTHYHEGL VT+GTRYIM+
Sbjct: 597 TINLALNTPKVDYEGGGCRFLRYNCSVTDTRLGWLLMHPGRLTHYHEGLLVTKGTRYIMV 656

Query: 730 SFVDP 734
           SFVDP
Sbjct: 657 SFVDP 661


>gi|91083241|ref|XP_973819.1| PREDICTED: similar to AGAP001507-PA [Tribolium castaneum]
          Length = 751

 Score =  723 bits (1867), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 349/713 (48%), Positives = 483/713 (67%), Gaps = 8/713 (1%)

Query: 27  KNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD--MSSLGGGYKV 84
           K+  +   LV TVAS  TDG++R++ SA    +    LG  Q W GG    +  GGG+K+
Sbjct: 42  KSTTDADILVFTVASEPTDGFQRYLSSAHHYHIAPTVLGFGQEWKGGSDIKNRPGGGWKI 101

Query: 85  NLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSL 144
           NLLK  L+        IIL TD YDVI    ++ IL +F    A ++FGAE  CWPD  L
Sbjct: 102 NLLKTALEPHKDDPTKIILFTDGYDVIFTDTLDAILRKFKETKARVLFGAESSCWPDVQL 161

Query: 145 YDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKI 204
             KYP V  G R+LNSG ++GYA D+ ++++   I++ +DDQL++   +LDE LR K   
Sbjct: 162 APKYPQVTEGKRFLNSGLYMGYAPDLWQVLTFDVIEDTDDDQLFFTKAYLDEDLRKKVGF 221

Query: 205 VLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYL 264
            LD  + +FQNL G+L ++K     +E+  + N  Y+T P+I+HGNG SK+ LN  GNYL
Sbjct: 222 KLDHKSEIFQNLNGALFEVKAKEGPEEY-KIQNVLYHTVPLILHGNGPSKLSLNYLGNYL 280

Query: 265 AKSWKT-SGCTRCNLIK-HLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKK 322
           A SW +  GC RC   +  L + + ++   VL+++F++  T FLEE L+K+ +  YP  +
Sbjct: 281 ANSWNSVEGCVRCKEGQFDLKNKRANEMSLVLLAIFVEFNTPFLEEMLSKVYSQEYPKHR 340

Query: 323 ISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFY 382
           I +F++N  ++H+    D+I    + +++VK I  +       AR+L++   L K  D Y
Sbjct: 341 IDLFIHNAMKFHSKHITDFIEKHGSEYRSVKDIKPDDGTTEWAARDLSLAQCLSKNCDIY 400

Query: 383 FYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNII 442
           F VDS +HLDNP  L+ L+ +N +++APLL RP KAWSNFWG L  +GFYARS DYM+I+
Sbjct: 401 FSVDSVAHLDNPHTLRLLIEQNRTVVAPLLPRPGKAWSNFWGDLTKEGFYARSNDYMDIV 460

Query: 443 NGDQGGKGIWNVPYITNCYLMKTSVIKATN-IKTIYTLNSMDYDMAFCTNLRNKGIHLKI 501
           + D+  +G+WNVP+I NCY +  +++K  +  K  +  ++ D DMAFC NLR+  + + +
Sbjct: 461 HNDK--RGLWNVPFIANCYAINATLLKKFDETKLNFDRDNWDADMAFCANLRDLDVFMYV 518

Query: 502 DSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDV 561
            +  ++GHLV+ E FD  +  PE+Y++  N  DW+ R+IHPEY ++  P+  + QPCPDV
Sbjct: 519 SNRVDFGHLVNPETFDITRVEPEMYQIFDNEQDWEARFIHPEYPENFNPEKTSLQPCPDV 578

Query: 562 FWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEF 621
           +WFPIV+ +FC   + +ME +G+WSDG+N D RLE GYEAVPTRDIHM QVG    W EF
Sbjct: 579 YWFPIVSPRFCTSLINMMENFGKWSDGSNKDPRLEGGYEAVPTRDIHMNQVGWEKHWLEF 638

Query: 622 LRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVD 681
           LRKYV PLQE  F+GY H+P R+ M+FVVRY+PDEQPSLRPHHDSSTYTINIALNQ GVD
Sbjct: 639 LRKYVRPLQEHVFLGYFHDPPRSLMNFVVRYKPDEQPSLRPHHDSSTYTINIALNQRGVD 698

Query: 682 YEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           YEGGGCRFIRYNC+V  T++GW+L+HPGRLTHYHEGL+VT+G RYIMI+FVDP
Sbjct: 699 YEGGGCRFIRYNCSVVDTKLGWLLIHPGRLTHYHEGLKVTKGIRYIMIAFVDP 751


>gi|195441570|ref|XP_002068579.1| GK20548 [Drosophila willistoni]
 gi|194164664|gb|EDW79565.1| GK20548 [Drosophila willistoni]
          Length = 699

 Score =  723 bits (1867), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 357/701 (50%), Positives = 477/701 (68%), Gaps = 11/701 (1%)

Query: 36  VITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNELDEMD 95
           V TVAS  TDGY R+I+SA V  ++V TLG+   W GGDM   GGGYK+NLL+  +    
Sbjct: 8   VFTVASEPTDGYMRYIRSARVYDIEVTTLGMGDEWKGGDMQRAGGGYKLNLLREAIAPHK 67

Query: 96  ITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAV-GSG 154
              D IIL TDSYDVII   V +I+E+F   +A I+F AE+ CWPD +L D+YP V G  
Sbjct: 68  EAQDKIILFTDSYDVIITANVEEIVEKFKESEAKILFSAEKFCWPDKTLADQYPEVEGKA 127

Query: 155 YRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLANLFQ 214
            RYLNSG FIGYA  + EL+ +  I + +DDQLY+  +FLDET R K  I LDT + LFQ
Sbjct: 128 SRYLNSGAFIGYAPQVYELLEDTPIDDTDDDQLYFTKIFLDETKRGKLGIELDTQSRLFQ 187

Query: 215 NLYGSLEDIKLNFDLDEFVH-LTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWKTSGC 273
           NL+G+  D+KL  DLD     L N  + T P IIHGNG SK+ELN++GNYLAK++ +  C
Sbjct: 188 NLHGAKNDVKLKVDLDSNQGILQNIDFMTTPAIIHGNGLSKVELNAYGNYLAKTF-SGIC 246

Query: 274 TRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEY 333
           T C  +++   L  ++ P + +SV +  P  F ++FL  I  LNYP K I +F+Y+N E 
Sbjct: 247 TFC--LENPLELNENELPIISLSVIVPHPVPFFDQFLKGIETLNYPKKSIHLFIYSNVEL 304

Query: 334 HAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSHLDN 393
           H      +++  K  + + KY+     ++ + AR LA+E +     D+ F VD +SH+D+
Sbjct: 305 HDAAVKSFVNQNKDSYASAKYVLSTDELDERRARQLALEQAKRHHSDYIFNVDGESHIDD 364

Query: 394 PDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWN 453
            +VL+ L+  N+  +APL  +  + WSNFWGAL+  G+YARS DY++I+  D    G++N
Sbjct: 365 AEVLRELLRLNKQFVAPLFAKYHELWSNFWGALSDSGYYARSHDYVDIVKRDL--IGMFN 422

Query: 454 VPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDS 513
           VP++T+ YL+K S     N    +     D DMA   +LRN GI + I + + +GHL+++
Sbjct: 423 VPHVTSIYLIKHSAFDVIN----FNHKEYDPDMALSESLRNAGIFMYISNQRYFGHLINT 478

Query: 514 ENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCH 573
           +NF+     P+ + L  N  DW  +YIHP Y   L   T+  QPCPDVFWF IVT+ FC 
Sbjct: 479 DNFNSTLVRPDFHTLFTNRYDWTEKYIHPNYSLQLNESTIIPQPCPDVFWFQIVTDDFCD 538

Query: 574 EFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQERE 633
           + V IME++G WSDG+N+DKRLE GYEAVPTRDIHMKQVGL  ++ +FL+ +V PLQE+ 
Sbjct: 539 DLVAIMESHGGWSDGSNSDKRLEGGYEAVPTRDIHMKQVGLESLYLKFLQLFVRPLQEKV 598

Query: 634 FIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYN 693
           F+GY+H P R+ M+F+VRYRPDEQPSLRPHHDSSTYTINIA+N+ G+DYEGGGCRF+RYN
Sbjct: 599 FLGYYHNPPRSLMNFMVRYRPDEQPSLRPHHDSSTYTINIAMNKAGIDYEGGGCRFLRYN 658

Query: 694 CNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           C+VT T+ GWMLMHPGRLTH+HEGL VT GTRYIMISF+DP
Sbjct: 659 CSVTDTKKGWMLMHPGRLTHFHEGLLVTNGTRYIMISFIDP 699


>gi|405960464|gb|EKC26389.1| Procollagen-lysine,2-oxoglutarate 5-dioxygenase 3 [Crassostrea
           gigas]
          Length = 730

 Score =  723 bits (1867), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 349/735 (47%), Positives = 492/735 (66%), Gaps = 16/735 (2%)

Query: 4   NLHLNCLILSCVVFFISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKT 63
           N  ++ LI +C ++        V ++++++  +IT+ ++ TDG +R+++S     L  + 
Sbjct: 8   NFFIDVLIFTCGIY-------SVASLEDNELKLITIGTDVTDGLRRYLRSTNKYDLDAEV 60

Query: 64  LGLHQPWLGGDMS-SLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILER 122
            G+   W GGD++ S GGG+KVN+LK EL++    +++I++ TDSYDV++  G  DILE+
Sbjct: 61  FGIGMDWKGGDVANSAGGGHKVNILKKELEKYKDQENLILMFTDSYDVVLTAGKQDILEK 120

Query: 123 FNTFDANIVFGAERLCWPDTSLYDKYPAVGSG-YRYLNSGGFIGYAKDIKELISNRSIKN 181
           F  F+A +VF AE  CWPD SL   YP V S   R+LNSGG++GYAKD+ E+I++RSIK+
Sbjct: 121 FKKFNARVVFSAEGFCWPDPSLAASYPEVKSKEKRFLNSGGYVGYAKDLYEIITHRSIKD 180

Query: 182 EEDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYN 241
            +DDQLY+  +FLDETLR K  + LD  + LFQN++G+  D+ + F  D   +  N    
Sbjct: 181 TDDDQLYFTNIFLDETLRKKWNMKLDVKSELFQNMHGAQGDVTIKFKSDH-SYAYNVITG 239

Query: 242 TNPVIIHGNGKSKIELNSFGNYLAKSWKT-SGCTRCNL-IKHLDSLKPDQFPSVLISVFI 299
           T PV++HGNG  K E N F NYLA  W T +GC  C      +  LK D+FP+VL+S+F 
Sbjct: 240 TTPVVVHGNGPIKPEFNRFANYLADGWTTQNGCQACKEETISIRELKDDEFPTVLVSLFF 299

Query: 300 DKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNS 359
           ++PT F E+FL +IANL YP  +I +F++N  E+H      ++  +  M+++   +  + 
Sbjct: 300 EQPTPFAEDFLERIANLKYPKSRIDLFIHNKVEFHNKDIASFLEKYNDMYRSATILMPSD 359

Query: 360 TVNSKEARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAW 419
            +    ARN AVE    K   + F VD  + + +P+ L  L+ +N +++AP+L RP+K W
Sbjct: 360 GIYEAAARNWAVEVCKQKNDQYLFSVDVYAQITDPETLIDLIEQNRTVLAPILSRPYKLW 419

Query: 420 SNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTL 479
           SNFWGA+N DG+YARS DY++I+  ++   G+WNVPYIT  YL+  S+++   ++ IY+ 
Sbjct: 420 SNFWGAVNKDGWYARSEDYIDIV--EKKKIGLWNVPYITGAYLIHGSLME--ELRDIYSA 475

Query: 480 NSMDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRY 539
            +++ DMAFC  LR +GI +   + +  GHLVD +N D    + ++Y++++NP DW L+Y
Sbjct: 476 ENVEPDMAFCGGLRKRGIFMYATNRKILGHLVDYDNMDTSHLHNDLYQIVQNPYDWKLKY 535

Query: 540 IHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGY 599
           IH  Y +SL  +    QPCPDVFWFPIV+ KFC   V+ ME   QWS G + D RL  GY
Sbjct: 536 IHENYSQSLELNRTLVQPCPDVFWFPIVSTKFCDSLVEEMEHLNQWSGGRHEDPRLAGGY 595

Query: 600 EAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPS 659
           E VPT D HM+Q+G+   W  FL+ YV PLQER F GYH +P RA M+FVVRYRP+EQ  
Sbjct: 596 ENVPTVDTHMRQIGMEEHWLHFLKVYVSPLQERAFEGYHSDPPRAIMNFVVRYRPNEQDR 655

Query: 660 LRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQ 719
           LRPHHDSST+TINIALN    D+EGGGCRF+RYNC+VTATR GWMLMHPGRLTH+HEGL 
Sbjct: 656 LRPHHDSSTFTINIALNTPMKDFEGGGCRFLRYNCSVTATRKGWMLMHPGRLTHFHEGLV 715

Query: 720 VTQGTRYIMISFVDP 734
            T+GTRYIMISFVDP
Sbjct: 716 TTKGTRYIMISFVDP 730


>gi|195379566|ref|XP_002048549.1| GJ11296 [Drosophila virilis]
 gi|194155707|gb|EDW70891.1| GJ11296 [Drosophila virilis]
          Length = 741

 Score =  721 bits (1861), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 347/710 (48%), Positives = 486/710 (68%), Gaps = 13/710 (1%)

Query: 27  KNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNL 86
           +N++E K  V TVA+  TDGY+R+++SA V  ++V TLG+ + W GGDM S GGG+K+NL
Sbjct: 43  QNLNE-KIKVFTVATEPTDGYRRYVRSANVYDIEVTTLGMGEEWQGGDMKSAGGGFKINL 101

Query: 87  LKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYD 146
           L+  ++++   +D IIL TDSYDVI    +++ILE+F    A ++F AE+ CWPD SL D
Sbjct: 102 LRKAIEDLKDEEDTIILFTDSYDVIFTAALDEILEKFKESGAKLLFSAEKYCWPDKSLAD 161

Query: 147 KYPAV-GSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIV 205
           +YP V G   R+LNSG FIGYA  +  L+   +I+N  DDQLY+  +FLDE  R K  + 
Sbjct: 162 QYPEVEGKASRFLNSGAFIGYAPQVYALLE-EAIENTGDDQLYFTKVFLDEAKRAKLGMK 220

Query: 206 LDTLANLFQNLYGSLEDIKLNFDLDEFV-HLTNTKYNTNPVIIHGNGKSKIELNSFGNYL 264
           LDT + LFQNL+G+  D+KL  DLD     L N  + T P+IIHGNG SK++LN++GNYL
Sbjct: 221 LDTQSRLFQNLHGAKNDVKLKVDLDSNQGTLQNIDFMTTPLIIHGNGLSKVDLNAYGNYL 280

Query: 265 AKSWKTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKIS 324
           AK++ +  CT C  +++   L     P + ++V + +   F + FL  I  LNYP + + 
Sbjct: 281 AKTF-SGVCTFC--LEYPLELDEQNLPIITLAVMVPQAVPFFDMFLASIEKLNYPKESLH 337

Query: 325 MFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFY 384
           +F+Y+N   H    + Y +N    + + K++     ++ ++ R LA++ +  +  D+ F+
Sbjct: 338 LFMYSNVALHDDAVESYANNQGKNYASAKFVLSVDELDERQGRQLALDKAKLQHSDYIFF 397

Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
           VD+D+H+D+ +VL+ L+  N+  +AP+  +  + WSNFWGAL+ +G+YARS DY++I+  
Sbjct: 398 VDADAHIDDSEVLRELLRMNKQFVAPVFSKYHELWSNFWGALSENGYYARSHDYVDIVKR 457

Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDST 504
           D    G++NVP++T  YL+K S   A   +     N  D DMA C +LRN GI + + + 
Sbjct: 458 DL--IGMFNVPHVTTIYLIKHSAFDAIKFEH----NDFDPDMAMCESLRNAGIFMYVSNQ 511

Query: 505 QEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWF 564
           + +GHL++++NF+     P+ Y L  N  DW L+YIH  Y   L    V  QPCPDVFWF
Sbjct: 512 RYHGHLINADNFNTTVVRPDFYTLFSNQYDWTLKYIHQNYSTQLNESMVIPQPCPDVFWF 571

Query: 565 PIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRK 624
            IV++ FC + V IMEAYG+WSDG+NND RLE GYEAVPTRDIHM+QVGL  ++ +FL+ 
Sbjct: 572 QIVSDAFCDDLVAIMEAYGKWSDGSNNDNRLEGGYEAVPTRDIHMRQVGLDTLYLKFLQI 631

Query: 625 YVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEG 684
           +V PLQER F GY+H P R+ M+F+VRYRPDEQP LRPHHDSSTYTINIA+N VG+DYEG
Sbjct: 632 FVRPLQERVFTGYYHNPPRSLMNFMVRYRPDEQPFLRPHHDSSTYTINIAMNSVGIDYEG 691

Query: 685 GGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           GGC F+RYNC+VT T+ GWMLMHPGRLTH+HEGL VT+GTRYIMISF+DP
Sbjct: 692 GGCHFLRYNCSVTETKKGWMLMHPGRLTHFHEGLLVTKGTRYIMISFIDP 741


>gi|270006955|gb|EFA03403.1| hypothetical protein TcasGA2_TC013390 [Tribolium castaneum]
          Length = 756

 Score =  721 bits (1860), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 349/718 (48%), Positives = 485/718 (67%), Gaps = 13/718 (1%)

Query: 27  KNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD--MSSLGGGYKV 84
           K+  +   LV TVAS  TDG++R++ SA    +    LG  Q W GG    +  GGG+K+
Sbjct: 42  KSTTDADILVFTVASEPTDGFQRYLSSAHHYHIAPTVLGFGQEWKGGSDIKNRPGGGWKI 101

Query: 85  NLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSL 144
           NLLK  L+        IIL TD YDVI    ++ IL +F    A ++FGAE  CWPD  L
Sbjct: 102 NLLKTALEPHKDDPTKIILFTDGYDVIFTDTLDAILRKFKETKARVLFGAESSCWPDVQL 161

Query: 145 YDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKI 204
             KYP V  G R+LNSG ++GYA D+ ++++   I++ +DDQL++   +LDE LR K   
Sbjct: 162 APKYPQVTEGKRFLNSGLYMGYAPDLWQVLTFDVIEDTDDDQLFFTKAYLDEDLRKKVGF 221

Query: 205 VLDTLANLFQNLYGSLEDIKLNFDLD-----EFVHLTNTKYNTNPVIIHGNGKSKIELNS 259
            LD  + +FQNL G++ +++L F++      E   + N  Y+T P+I+HGNG SK+ LN 
Sbjct: 222 KLDHKSEIFQNLNGAVSEVEL-FEVKAKEGPEEYKIQNVLYHTVPLILHGNGPSKLSLNY 280

Query: 260 FGNYLAKSWKT-SGCTRCNLIK-HLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLN 317
            GNYLA SW +  GC RC   +  L + + ++   VL+++F++  T FLEE L+K+ +  
Sbjct: 281 LGNYLANSWNSVEGCVRCKEGQFDLKNKRANEMSLVLLAIFVEFNTPFLEEMLSKVYSQE 340

Query: 318 YPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHK 377
           YP  +I +F++N  ++H+    D+I    + +++VK I  +       AR+L++   L K
Sbjct: 341 YPKHRIDLFIHNAMKFHSKHITDFIEKHGSEYRSVKDIKPDDGTTEWAARDLSLAQCLSK 400

Query: 378 GVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFD 437
             D YF VDS +HLDNP  L+ L+ +N +++APLL RP KAWSNFWG L  +GFYARS D
Sbjct: 401 NCDIYFSVDSVAHLDNPHTLRLLIEQNRTVVAPLLPRPGKAWSNFWGDLTKEGFYARSND 460

Query: 438 YMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATN-IKTIYTLNSMDYDMAFCTNLRNKG 496
           YM+I++ D+  +G+WNVP+I NCY +  +++K  +  K  +  ++ D DMAFC NLR+  
Sbjct: 461 YMDIVHNDK--RGLWNVPFIANCYAINATLLKKFDETKLNFDRDNWDADMAFCANLRDLD 518

Query: 497 IHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQ 556
           + + + +  ++GHLV+ E FD  +  PE+Y++  N  DW+ R+IHPEY ++  P+  + Q
Sbjct: 519 VFMYVSNRVDFGHLVNPETFDITRVEPEMYQIFDNEQDWEARFIHPEYPENFNPEKTSLQ 578

Query: 557 PCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAG 616
           PCPDV+WFPIV+ +FC   + +ME +G+WSDG+N D RLE GYEAVPTRDIHM QVG   
Sbjct: 579 PCPDVYWFPIVSPRFCTSLINMMENFGKWSDGSNKDPRLEGGYEAVPTRDIHMNQVGWEK 638

Query: 617 VWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALN 676
            W EFLRKYV PLQE  F+GY H+P R+ M+FVVRY+PDEQPSLRPHHDSSTYTINIALN
Sbjct: 639 HWLEFLRKYVRPLQEHVFLGYFHDPPRSLMNFVVRYKPDEQPSLRPHHDSSTYTINIALN 698

Query: 677 QVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           Q GVDYEGGGCRFIRYNC+V  T++GW+L+HPGRLTHYHEGL+VT+G RYIMI+FVDP
Sbjct: 699 QRGVDYEGGGCRFIRYNCSVVDTKLGWLLIHPGRLTHYHEGLKVTKGIRYIMIAFVDP 756


>gi|195128689|ref|XP_002008794.1| GI11618 [Drosophila mojavensis]
 gi|193920403|gb|EDW19270.1| GI11618 [Drosophila mojavensis]
          Length = 744

 Score =  720 bits (1859), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 356/705 (50%), Positives = 484/705 (68%), Gaps = 12/705 (1%)

Query: 32  DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNEL 91
           DK  V TVA+ +TDGY+R+I+SA+V  ++V TLGL + W GGDM  LGGGYK+NLL+  +
Sbjct: 50  DKIKVFTVATEQTDGYRRYIRSAQVYDIEVTTLGLGEEWQGGDMKGLGGGYKINLLRKAV 109

Query: 92  DEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAV 151
           +E+   +D IIL TDSYDV+    + +ILE+F    A ++F AE+ CWPD SL D YPAV
Sbjct: 110 EELKDAEDTIILFTDSYDVVFTAPLTEILEKFKESGAKVLFSAEKYCWPDKSLADSYPAV 169

Query: 152 GSG-YRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
           G+   RYLNSG FIGYA  + EL+    I++  DDQLYY  +FLDE  R K  I LDT +
Sbjct: 170 GAKESRYLNSGAFIGYAPQVVELL-KEEIEDTGDDQLYYTKIFLDEAKRAKLNIKLDTQS 228

Query: 211 NLFQNLYGSLEDIKLNFDLDEFVH-LTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK 269
            LFQ+L G+  D+KL  DLD     L N  + T P IIHGNG SKI LN++ NYLAK++ 
Sbjct: 229 RLFQSLNGAQNDVKLEVDLDSNQGVLQNIDFLTTPAIIHGNGPSKINLNAYANYLAKTF- 287

Query: 270 TSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN 329
           +  CT C   ++   L   + P + ++V +++P  FL+ FL  I  LNYP K + +F+Y+
Sbjct: 288 SGVCTFCQ--EYPLELNEQELPIITLAVMVNQPVPFLDMFLAGIEKLNYPKKSMHLFMYS 345

Query: 330 NQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDS 389
           N E H  L   Y+      + +VKYI     +   + R LA++ +  K  D+ FYVD D+
Sbjct: 346 NAELHDELVQSYVTKHGKSYASVKYILSTDGLTESQGRQLALDKAKQKHSDYIFYVDGDA 405

Query: 390 HLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGK 449
           H+++ +VL+ L+  N+  +AP+  +  + WSNFWGAL+  G+YARS DY++I+   +   
Sbjct: 406 HIEDSEVLRELLRMNKQFVAPVFSKYHELWSNFWGALSETGYYARSHDYVDIVK--RNLI 463

Query: 450 GIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGH 509
           G++NVP++T  YL+K S   A   +       +D DMA   +LR+ G+ + + + + +GH
Sbjct: 464 GMFNVPHVTTIYLIKKSAFDAVKFEH----KELDPDMAMSDSLRDAGVFMYVSNERYFGH 519

Query: 510 LVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTE 569
           L++++NF+     P+ Y L  N  DW L+YIHP Y   L    V  QPCPDV+WF IVT+
Sbjct: 520 LINADNFNTTVARPDFYTLFSNRYDWTLKYIHPNYSTQLNESVVIPQPCPDVYWFQIVTD 579

Query: 570 KFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPL 629
            FC + V IMEAYG+WSDG+N+D RLE GYEAVPTRDIHM+QVGL  ++ +FL+ +V PL
Sbjct: 580 AFCDDLVAIMEAYGKWSDGSNSDTRLEGGYEAVPTRDIHMRQVGLDALYLKFLQMFVRPL 639

Query: 630 QEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRF 689
           QER F+GY H+P R+ M+F+VRY+PDEQPSLRPHHDSSTYTINIA+N+VG+DYEGGGCRF
Sbjct: 640 QERVFMGYFHDPPRSLMNFMVRYKPDEQPSLRPHHDSSTYTINIAMNRVGIDYEGGGCRF 699

Query: 690 IRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           +RYNC+VT T+ GWMLMHPGRLTH+HEGL VT+GTRYIMISF+DP
Sbjct: 700 LRYNCSVTETKKGWMLMHPGRLTHFHEGLLVTKGTRYIMISFIDP 744


>gi|391344649|ref|XP_003746608.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 3-like
           [Metaseiulus occidentalis]
          Length = 756

 Score =  719 bits (1856), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 351/710 (49%), Positives = 480/710 (67%), Gaps = 14/710 (1%)

Query: 28  NIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDM-SSLGGGYKVNL 86
           ++D+ + LVITVA++ T+GY+RF+ SAE   L V+TLGL + W GGD+  + GGG+KVNL
Sbjct: 58  SLDKFELLVITVATDRTEGYERFLASAEREDLTVETLGLDEEWRGGDVVHTTGGGHKVNL 117

Query: 87  LKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYD 146
           L+  LD+     D++I+  DSYDVI  G   DILE+F   DA+ VF AE  CWPD SL +
Sbjct: 118 LRKALDKYKDRSDLLIMFVDSYDVIFTGNKQDILEKFFALDADAVFSAEGFCWPDASLEN 177

Query: 147 KYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVL 206
           KYP    G +YLNSGGF+G+A  I ++ ++ +I++E+DDQL+Y  ++LD +LR   +I L
Sbjct: 178 KYPE-SDGKKYLNSGGFVGFAPAIHKIATHVAIQDEDDDQLFYTKIYLDPSLRESLRIRL 236

Query: 207 DTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAK 266
           D  + +FQNL G++ D+ ++   D +  + NT Y T P++IHGNG SK+ LNS  NYLA 
Sbjct: 237 DNKSTIFQNLNGAVGDVSIS--EDAYPKVKNTAYGTEPIVIHGNGPSKVALNSLANYLAG 294

Query: 267 SWKTS-GCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
           +WK   GC  C   +   +L  D  P V + +FI++ T F +EFL+    L+YP +KIS+
Sbjct: 295 AWKNGEGCLVC---EDRITLGTDTMPQVTVGIFIEEATPFFDEFLDHFIELDYPKEKISL 351

Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYV 385
           F++   +YH      ++ N    +  ++  + +  +  K AR  A+E  L    DFY  +
Sbjct: 352 FIHRGVDYHNERLRQFVENGAASYAKLEMTSTDDLLEWK-ARERALEVCLLDACDFYLNL 410

Query: 386 DSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGD 445
           DS  HL N  VL++L+ ++ + IAPL++R  +AWSNFWGAL ++GFYARS DYM I+ G+
Sbjct: 411 DSRVHLTNRKVLQHLIAKDRNFIAPLVMRTGQAWSNFWGALTSEGFYARSHDYMEIVKGE 470

Query: 446 QGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQ 505
           +  KGIWNVPYI   YL+K SV     +   Y   ++D DMA C NLR +GI + +D+ +
Sbjct: 471 K--KGIWNVPYIGEVYLIKASVFSKKPLS--YVNGALDPDMALCKNLRERGIFMYVDNME 526

Query: 506 EYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDT-VNNQPCPDVFWF 564
           ++G L++SE+FD  K +P+ YE+  N   W LRYIH EY+      T V  QPC DVFWF
Sbjct: 527 DFGFLINSEHFDTSKKHPDFYEIYNNQFAWALRYIHKEYKDIFSNHTGVLRQPCHDVFWF 586

Query: 565 PIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRK 624
           P+ +  FC   ++IME +G WSDGTN+D RL  GYE VPTRDIHMKQVGL   W  FLR+
Sbjct: 587 PLASPTFCTHLIEIMENHGGWSDGTNSDPRLAGGYENVPTRDIHMKQVGLEPQWLFFLRE 646

Query: 625 YVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEG 684
           YV P+QE  F GY+H+P +A M+FVVRYRPDEQPSL+PHHD+STYT+N+ALNQ G D+ G
Sbjct: 647 YVRPVQEHVFTGYYHDPPKAIMNFVVRYRPDEQPSLKPHHDASTYTLNLALNQAGKDFTG 706

Query: 685 GGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           GG  FIR NC+VT++  GW L+HPGRLTHYHEGL  T GTRYIM+SFVDP
Sbjct: 707 GGSHFIRQNCSVTSSPSGWGLLHPGRLTHYHEGLTTTSGTRYIMVSFVDP 756


>gi|195166461|ref|XP_002024053.1| GL22837 [Drosophila persimilis]
 gi|194107408|gb|EDW29451.1| GL22837 [Drosophila persimilis]
          Length = 1367

 Score =  718 bits (1854), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 345/705 (48%), Positives = 471/705 (66%), Gaps = 12/705 (1%)

Query: 32   DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNEL 91
            DK  V TVA+  TDGY R+I+SA +  ++V TLGL + W GGDM   GGG+KVNLL+  +
Sbjct: 673  DKVEVFTVATEPTDGYARYIRSARIYDVKVTTLGLGEHWKGGDMQHPGGGFKVNLLRKAV 732

Query: 92   DEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAV 151
              +    D I+L TDSYDVII   + +I+E F    A ++F AE+ CWPD+SL D YP V
Sbjct: 733  APLKDEQDTIVLFTDSYDVIITAKLEEIVELFKESKAKLLFSAEKFCWPDSSLTDAYPEV 792

Query: 152  -GSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
             G+  R+LNSG FIGYA  +  L+   +I + +DDQLYY  +FLDE  R K  + LDT +
Sbjct: 793  EGNASRFLNSGAFIGYAPQVNALL-EEAIDDMDDDQLYYTKVFLDEARRAKLGMKLDTQS 851

Query: 211  NLFQNLYGSLEDIKLNFDLDEFVH-LTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK 269
             LFQNL+G+  D+KL  D++     L N  + T P I+HGNG SK++LN++GNYLAK++ 
Sbjct: 852  RLFQNLHGAKNDVKLKVDIESNQGILQNVNFLTTPAIVHGNGLSKVDLNAYGNYLAKTF- 910

Query: 270  TSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN 329
               CT C   ++L  L     P + +SV +     F ++FL  I  +NYP + + + +Y+
Sbjct: 911  NGICTVCQ--EYLLELDEQHLPVISLSVIVPMAVPFFDQFLEGIEKINYPKQNLHLLIYS 968

Query: 330  NQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDS 389
            N E H      +++     + + KY      ++ ++ R LA + +  +  D+ F++D D+
Sbjct: 969  NVELHDADIKSFVNKHGEKYASAKYTLSTDNLDERQGRQLAFDQAKLRKSDYIFFIDGDA 1028

Query: 390  HLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGK 449
            H+D+ +VL+ L+  N+  +APL  +  + WSNFWGAL+  GFYARS DY++I+  D    
Sbjct: 1029 HIDDGEVLRELLKLNKQFVAPLFAKYHELWSNFWGALSEGGFYARSHDYVDIVKRDL--I 1086

Query: 450  GIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGH 509
            GI+NVP++T+ YL+++S     + +     +  D DMA C +LR  G+ + I + + +GH
Sbjct: 1087 GIFNVPHVTSIYLVRSSAFDVLSFQH----SEYDADMAMCESLRKAGVFMFISNQRYFGH 1142

Query: 510  LVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTE 569
            LV+++NFD +   P+ Y L  N  DW  +YIHP Y + L   TV  QPCPDV+W  IVT+
Sbjct: 1143 LVNADNFDTKVARPDFYTLFSNRYDWTEKYIHPNYSEQLNASTVIEQPCPDVYWMAIVTD 1202

Query: 570  KFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPL 629
             FC + V IME +G WSDG+NND RLE GYEAVPTRDIHMKQVGL  ++ +FL  +V PL
Sbjct: 1203 AFCDDLVAIMENHGTWSDGSNNDNRLEGGYEAVPTRDIHMKQVGLEVLYLKFLELFVRPL 1262

Query: 630  QEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRF 689
            QER F GY+H P RA M+F+VRYRPDEQPSLRPHHD+STYTINIA+NQV  DYEGGGCRF
Sbjct: 1263 QERVFTGYYHNPPRALMNFMVRYRPDEQPSLRPHHDASTYTINIAMNQVDTDYEGGGCRF 1322

Query: 690  IRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
            +RYNC+VT T+ GWMLMHPGRLTHYHEGL VT+GTRYIMISF+DP
Sbjct: 1323 LRYNCSVTETKKGWMLMHPGRLTHYHEGLLVTKGTRYIMISFIDP 1367


>gi|195018422|ref|XP_001984779.1| GH16659 [Drosophila grimshawi]
 gi|193898261|gb|EDV97127.1| GH16659 [Drosophila grimshawi]
          Length = 696

 Score =  716 bits (1848), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 352/705 (49%), Positives = 481/705 (68%), Gaps = 12/705 (1%)

Query: 32  DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNEL 91
           DK  V TVAS  TDGY+R+I+SA+V  ++V TLG+ + W GGDM S GGG+K+NLL+  +
Sbjct: 2   DKIKVFTVASEPTDGYRRYIRSAKVYDIEVTTLGMGEEWKGGDMKSAGGGFKINLLRKAI 61

Query: 92  DEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAV 151
           + +   +D IIL TDSYDVII   + +IL++F   DA ++F AE+ CWPD SL ++YP V
Sbjct: 62  EPLKDAEDTIILFTDSYDVIITSTLEEILQKFKESDAKLLFSAEKYCWPDKSLANQYPEV 121

Query: 152 GSG-YRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
           G    RYLNSG FIGYA  +  L+    I++  DDQLYY  +FLDET R K  + LDT +
Sbjct: 122 GGKESRYLNSGAFIGYAPQVNALLEEL-IEDTGDDQLYYTKVFLDETKRAKLGMKLDTQS 180

Query: 211 NLFQNLYGSLEDIKLNFDLDEFVH-LTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK 269
            LFQNL+G+ +D+KL  DLD     L N  + T P IIHGNG SK++LN++GNYLAK++ 
Sbjct: 181 KLFQNLHGAKDDVKLRVDLDSNQGILENVNFLTKPNIIHGNGLSKVDLNAYGNYLAKTF- 239

Query: 270 TSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN 329
           +  CT C  +++L  L     P + ++V + +P  F + FL  I  LNYP K + +F+Y+
Sbjct: 240 SGICTVC--MEYLLDLDEQNLPIITLAVMVPQPVPFFDLFLAGIEKLNYPKKNLHLFIYS 297

Query: 330 NQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDS 389
               H  L   Y++     + +VK++     ++ ++ R LA++ +  +  D+ FYVD D+
Sbjct: 298 GAALHDDLITSYVNKQGKSYASVKFVLSTDQLDERQGRQLALDKAKLQRSDYIFYVDGDA 357

Query: 390 HLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGK 449
           H+D+ ++L+ L+  N+   AP+  +  + WSNFWGAL+ +G+YARS DY++I+  D    
Sbjct: 358 HIDDRELLRALLRLNKQFAAPVFSKYHELWSNFWGALSENGYYARSHDYVDIVKRDL--I 415

Query: 450 GIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGH 509
           GI+NVP++T  YL+K S   A      +  N  D DMA   +LR+ GI + + + +  GH
Sbjct: 416 GIFNVPHVTTIYLIKRSAFDAIK----FDHNEFDPDMALSKSLRDAGIFMYVSNQRYLGH 471

Query: 510 LVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTE 569
           LV++ENF+     P+ + L  N  DW ++YI P Y   L    V  QPCPDV+W  IVT+
Sbjct: 472 LVNAENFNSTVVRPDFHTLFSNRYDWTIKYIQPNYSAQLNESMVIPQPCPDVYWLHIVTD 531

Query: 570 KFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPL 629
            FC + V IMEA+G+WS+G N DKRLE GYEAVPTRDIHM+QVGL  V+ +FL+ +V PL
Sbjct: 532 AFCDDLVAIMEAFGKWSEGKNQDKRLEGGYEAVPTRDIHMRQVGLDQVYLKFLQMFVRPL 591

Query: 630 QEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRF 689
           QER F GY+H P R+ M+F+VRYRPDEQPSLRPHHDSSTYTINIA+N VG+DYEGGGCRF
Sbjct: 592 QERIFTGYYHNPPRSLMNFMVRYRPDEQPSLRPHHDSSTYTINIAMNNVGIDYEGGGCRF 651

Query: 690 IRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           +RYNC+VT T+ GWMLMHPGRLTH+HEGL VT+GTRYIMISF+DP
Sbjct: 652 LRYNCSVTETKKGWMLMHPGRLTHFHEGLLVTRGTRYIMISFIDP 696


>gi|198466220|ref|XP_001353930.2| GA19434, partial [Drosophila pseudoobscura pseudoobscura]
 gi|198150500|gb|EAL29666.2| GA19434, partial [Drosophila pseudoobscura pseudoobscura]
          Length = 698

 Score =  716 bits (1848), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 346/705 (49%), Positives = 472/705 (66%), Gaps = 12/705 (1%)

Query: 32  DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNEL 91
           DK  V TVA+  TDGY R+I+SA +  ++V TLGL + W GGDM   GGG+KVNLL+  +
Sbjct: 4   DKVEVFTVATEPTDGYARYIRSARIYDVKVTTLGLGEHWKGGDMQHPGGGFKVNLLRKAV 63

Query: 92  DEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAV 151
             +    D I+L TDSYDVII   + +I+E F    A ++F AE+ CWPD+SL D YP V
Sbjct: 64  APLKDEQDTIVLFTDSYDVIITAKLEEIVELFKESKAKLLFSAEKFCWPDSSLTDAYPEV 123

Query: 152 -GSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
            G+  R+LNSG FIGYA  +  L+   +I + +DDQLYY  +FLDE  R K  + LDT +
Sbjct: 124 EGNASRFLNSGAFIGYAPQVNALLE-EAIDDMDDDQLYYTKVFLDEARRAKLGMKLDTQS 182

Query: 211 NLFQNLYGSLEDIKLNFDLDEFVH-LTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK 269
            LFQNL+G+  D+KL  D++     L N  + T P I+HGNG SK++LN++GNYLAK++ 
Sbjct: 183 RLFQNLHGAKNDVKLKVDIESNQGILQNVNFLTTPAIVHGNGLSKVDLNAYGNYLAKTF- 241

Query: 270 TSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN 329
              CT C   ++L  L     P + +SV +     F ++FL  I  +NYP + + + +Y+
Sbjct: 242 NGICTVCQ--EYLLELDEQHLPVISLSVIVPMAVPFFDQFLEGIEKINYPKQNLHLLIYS 299

Query: 330 NQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDS 389
           N E H      +++     + + KY      ++ ++ R LA + +  +  D+ F++D D+
Sbjct: 300 NVELHDADIKSFVNKHGEKYASAKYTLSTDNLDERQGRQLAFDQAKLRKSDYIFFIDGDA 359

Query: 390 HLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGK 449
           H+D+ +VL+ L+  N+  +APL  +  + WSNFWGAL+  GFYARS DY++I+  D    
Sbjct: 360 HIDDGEVLRELLKLNKQFVAPLFAKYHELWSNFWGALSEGGFYARSHDYVDIVKRDL--I 417

Query: 450 GIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGH 509
           GI+NVP++T+ YL+++SV    + +     +  D DMA C +LR  G+ + I + + +GH
Sbjct: 418 GIFNVPHVTSIYLVRSSVFDVLSFQH----SEYDADMAMCESLRKAGVFMFISNQRYFGH 473

Query: 510 LVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTE 569
           LV+++NFD +   P+ Y L  N  DW  +YIHP Y + L   TV  QPCPDV+W  IVT+
Sbjct: 474 LVNADNFDTKVARPDFYTLFSNRYDWTEKYIHPNYSEQLNASTVIEQPCPDVYWMAIVTD 533

Query: 570 KFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPL 629
            FC + V IME +G WSDG+NND RLE GYEAVPTRDIHMKQVGL  ++ +FL  +V PL
Sbjct: 534 AFCDDLVAIMENHGTWSDGSNNDNRLEGGYEAVPTRDIHMKQVGLEVLYLKFLELFVRPL 593

Query: 630 QEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRF 689
           QER F GY+H P RA M+F+VRYRPDEQPSLRPHHD+STYTINIA+NQV  DYEGGGCRF
Sbjct: 594 QERVFTGYYHNPPRALMNFMVRYRPDEQPSLRPHHDASTYTINIAMNQVDTDYEGGGCRF 653

Query: 690 IRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           +RYNC+VT T+ GWMLMHPGRLTHYHEGL VT+GTRYIMISF+DP
Sbjct: 654 LRYNCSVTETKKGWMLMHPGRLTHYHEGLLVTKGTRYIMISFIDP 698


>gi|128485638|ref|NP_835202.2| procollagen-lysine,2-oxoglutarate 5-dioxygenase 3 precursor [Rattus
           norvegicus]
 gi|81883555|sp|Q5U367.1|PLOD3_RAT RecName: Full=Procollagen-lysine,2-oxoglutarate 5-dioxygenase 3;
           AltName: Full=Lysyl hydroxylase 3; Short=LH3; Flags:
           Precursor
 gi|55250563|gb|AAH85683.1| Procollagen-lysine, 2-oxoglutarate 5-dioxygenase 3 [Rattus
           norvegicus]
 gi|149062975|gb|EDM13298.1| procollagen-lysine, 2-oxoglutarate 5-dioxygenase 3 [Rattus
           norvegicus]
          Length = 741

 Score =  712 bits (1837), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 339/711 (47%), Positives = 484/711 (68%), Gaps = 11/711 (1%)

Query: 29  IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLL 87
           ++ DK LVITVA+ ET+GY+RF+QSAE     V+TLGL Q W GGD++ ++GGG KV  L
Sbjct: 37  VNPDKLLVITVATAETEGYRRFLQSAEFFNYTVRTLGLGQEWRGGDVARTVGGGQKVRWL 96

Query: 88  KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
           K E+++    +DMII+  DSYDVI+     ++L++F    ++++F AE  CWPD  L ++
Sbjct: 97  KKEMEKYASQEDMIIMFVDSYDVILASSPTELLKKFVQSGSHLLFSAESFCWPDWGLAEQ 156

Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
           YP VG G R+LNSGGFIG+A  I  ++     K+++DDQL+Y  L+LD  LR K K+ LD
Sbjct: 157 YPEVGVGKRFLNSGGFIGFAPTIHRIVRQWKYKDDDDDQLFYTQLYLDPGLREKLKLSLD 216

Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
             + +FQNL G+L+++ L FD +  V + N  Y+T PV++HGNG +K++LN  GNY+   
Sbjct: 217 HKSRIFQNLNGALDEVVLKFDQNR-VRIRNVAYDTLPVVVHGNGPTKLQLNYLGNYVPNG 275

Query: 268 W-KTSGCTRCNLIKH-LDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
           W    GC  CNL +  L   +P   P VL++VF+++PT FL  FL ++  L+YP  +IS+
Sbjct: 276 WTPQGGCGFCNLNRRTLPGGQPP--PRVLLAVFVEQPTPFLPRFLQRLLLLDYPPDRISL 333

Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKG-VDFYFY 384
           F++NN+ YH P   D     +  F  VK +     ++S EAR++A+++       +FYF 
Sbjct: 334 FLHNNEVYHEPHIADAWPQLQDHFSAVKLVGPEEALSSGEARDMAMDSCRQNPECEFYFS 393

Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
           +D+D+ L NP+ L+ L+ +N  +IAP+L R  K WSNFWGAL+ D +YARS DY+ ++  
Sbjct: 394 LDADAVLTNPETLRILIEQNRKVIAPMLSRHGKLWSNFWGALSPDEYYARSEDYVELVQR 453

Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDS 503
            +   G+WNVPYI+  Y+++   ++     K +++ +  D DMAFC ++R+KGI L + +
Sbjct: 454 KR--VGLWNVPYISQAYVIRGETLRTELPEKEVFSSSDTDPDMAFCRSVRDKGIFLHLSN 511

Query: 504 TQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFW 563
             E+G L+ + ++D    +P+++++  NP+DW  +YIH  Y ++L  + +  QPCPDV+W
Sbjct: 512 QHEFGRLLSTSHYDTDHLHPDLWQIFDNPVDWREQYIHENYSRALDGEGLVEQPCPDVYW 571

Query: 564 FPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLR 623
           FP++TE+ C E V+ ME YGQWS G + D RL  GYE VPT DIHMKQVG    W + LR
Sbjct: 572 FPLLTEQMCDELVEEMEHYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYEDQWLQLLR 631

Query: 624 KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYE 683
            YV P+ E  F GYH +  RA M+FVVRYRPDEQPSLRPHHDSST+T+N+ALN  GVDYE
Sbjct: 632 TYVGPMTEHLFPGYHTK-TRAVMNFVVRYRPDEQPSLRPHHDSSTFTLNVALNHKGVDYE 690

Query: 684 GGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           GGGCRF+RY+C V++ R GW L+HPGRLTHYHEGL  T+GTRYIM+SFVDP
Sbjct: 691 GGGCRFLRYDCRVSSPRKGWALLHPGRLTHYHEGLPTTRGTRYIMVSFVDP 741


>gi|28400779|emb|CAD23628.1| procollagen-lysine, 2-oxoglutarate 5-dioxygenase 3 [Rattus
           norvegicus]
          Length = 741

 Score =  711 bits (1836), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 339/711 (47%), Positives = 484/711 (68%), Gaps = 11/711 (1%)

Query: 29  IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLL 87
           ++ DK LVITVA+ ET+GY+RF+QSAE     V+TLGL Q W GGD++ ++GGG KV  L
Sbjct: 37  VNPDKLLVITVATAETEGYRRFLQSAEFFNYTVRTLGLGQEWRGGDVARTVGGGQKVRWL 96

Query: 88  KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
           K E+++    +DMII+  DSYDVI+     ++L++F    ++++F AE  CWPD  L ++
Sbjct: 97  KKEMEKYASQEDMIIMFVDSYDVILASSPTELLKKFVQSGSHLLFSAESFCWPDWGLAEQ 156

Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
           YP VG G R+LNSGGFIG+A  I  ++     K+++DDQL+Y  L+LD  LR K K+ LD
Sbjct: 157 YPEVGVGKRFLNSGGFIGFAPTIHRIVRQWKYKDDDDDQLFYTQLYLDPGLREKLKLSLD 216

Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
             + +FQNL G+L+++ L FD +  V + N  Y+T PV++HGNG +K++LN  GNY+   
Sbjct: 217 HKSRIFQNLNGALDEVVLKFDQNR-VRIRNVAYDTLPVVVHGNGPTKLQLNYLGNYVPNG 275

Query: 268 W-KTSGCTRCNLIKH-LDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
           W    GC  CNL +  L   +P   P VL++VF+++PT FL  FL ++  L+YP  +IS+
Sbjct: 276 WTPQGGCGFCNLNRRTLPGGQPP--PRVLLAVFVEQPTPFLPRFLQRLLLLDYPPDRISL 333

Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKG-VDFYFY 384
           F++NN+ YH P   D     +  F  VK +     ++S EAR++A+++       +FYF 
Sbjct: 334 FLHNNEVYHEPHIADAWPQLQDHFSAVKLVGPEEALSSGEARDMAMDSCRQNPECEFYFS 393

Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
           +D+D+ L NP+ L+ L+ +N  +IAP+L R  K WSNFWGAL+ D +YARS DY+ ++  
Sbjct: 394 LDADAVLTNPETLRILIEQNRKVIAPMLSRHGKLWSNFWGALSPDEYYARSEDYVELVQR 453

Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDS 503
            +   G+WNVPYI+  Y+++   ++     K +++ +  D DMAFC ++R+KGI L + +
Sbjct: 454 KR--VGLWNVPYISQAYVIRGETLRTELPEKEVFSSSDTDPDMAFCRSVRDKGIFLHLSN 511

Query: 504 TQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFW 563
             E+G L+ + ++D    +P+++++  NP+DW  +YIH  Y ++L  + +  QPCPDV+W
Sbjct: 512 QHEFGRLLSTSHYDTDHLHPDLWQIFDNPVDWREQYIHENYSRALDGEGLVEQPCPDVYW 571

Query: 564 FPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLR 623
           FP++TE+ C E V+ ME YGQWS G + D RL  GYE VPT DIHMKQVG    W + LR
Sbjct: 572 FPLLTEQMCDELVEEMEHYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYEDQWLQLLR 631

Query: 624 KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYE 683
            YV P+ E  F GYH +  RA M+FVVRYRPDEQPSLRPHHDSST+T+N+ALN  GVDYE
Sbjct: 632 TYVGPMTEHLFPGYHTK-TRAVMNFVVRYRPDEQPSLRPHHDSSTFTLNVALNHKGVDYE 690

Query: 684 GGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           GGGCRF+RY+C V++ R GW L+HPGRLTHYHEGL  T+GTRYIM+SFVDP
Sbjct: 691 GGGCRFLRYDCRVSSPRKGWALLHPGRLTHYHEGLPTTRGTRYIMVSFVDP 741


>gi|195326740|ref|XP_002030083.1| GM24765 [Drosophila sechellia]
 gi|194119026|gb|EDW41069.1| GM24765 [Drosophila sechellia]
          Length = 721

 Score =  705 bits (1820), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 342/705 (48%), Positives = 473/705 (67%), Gaps = 12/705 (1%)

Query: 32  DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNEL 91
           DK  V TVA+  TDGY R+I+SA V  ++V TLGL + W GGDM   GGG+K+NLL+  +
Sbjct: 27  DKIKVFTVATEPTDGYTRYIRSARVYDIEVTTLGLGEEWKGGDMQKPGGGFKLNLLREAI 86

Query: 92  DEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAV 151
                  D IIL TDSYDVII   +++I E+F    A I+F AE+ CWPD SL + YP V
Sbjct: 87  APYKNEPDTIILFTDSYDVIITTTLDEIFEKFKEAGARILFSAEKYCWPDKSLANDYPEV 146

Query: 152 -GSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
            G   R+LNSG FIGYA  +  L+ +  I++  DDQLY+  +FLDET RTK  + LD  +
Sbjct: 147 EGKASRFLNSGAFIGYAPQVFALLED-PIEDTADDQLYFTKIFLDETKRTKLGLKLDVQS 205

Query: 211 NLFQNLYGSLEDIKLNFDLDEFVH-LTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK 269
            LFQNL+G+  D+KL  DL+     L N  + T P IIHGNG SK++LN++GNYLA+++ 
Sbjct: 206 RLFQNLHGAKNDVKLKVDLESNQGVLQNVDFMTTPAIIHGNGLSKVDLNAYGNYLARTF- 264

Query: 270 TSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN 329
              C  C   ++L  L+    P + +++ + +P  F ++FL  I +LNYP KK+ + +Y+
Sbjct: 265 NGVCLLCQ--ENLLDLEETNLPVISLALMVTQPVPFFDQFLEGIESLNYPKKKLHLLIYS 322

Query: 330 NQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDS 389
           N  +H      +++ +   +   K+      ++ ++ R LA++ +     D+ F+VD+D+
Sbjct: 323 NIAFHDDDIKSFVNKYGKEYATAKFALSTDELDERQGRQLALDKARLHQSDYIFFVDADA 382

Query: 390 HLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGK 449
           H+D+ +VL+ L+  N+  +AP+  +  + WSNFWGAL+  G+YARS DY++I+  +    
Sbjct: 383 HIDDSEVLRELLRLNKQFVAPIFSKHKELWSNFWGALSEGGYYARSHDYVDIVKREL--I 440

Query: 450 GIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGH 509
           G++NVP++T+ YL+K +   A + K        D DMA C +LRN GI +   + + +GH
Sbjct: 441 GMFNVPHVTSIYLVKKTAFDAISFKH----KEFDPDMAMCESLRNAGIFMYASNLRIFGH 496

Query: 510 LVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTE 569
           LV++++F+   T P+ Y L  N +DW  +YIHP Y   L       QPCPDV+WF IV++
Sbjct: 497 LVNADDFNTTVTRPDFYTLFSNEVDWTEKYIHPNYSLQLNESNKIQQPCPDVYWFQIVSD 556

Query: 570 KFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPL 629
            FC + V IMEA+  WSDG+N+D RLE GYEAVPTRDIHMKQVGL  ++ +FL+ +V PL
Sbjct: 557 AFCDDLVAIMEAHNGWSDGSNSDNRLEGGYEAVPTRDIHMKQVGLERLYLKFLQLFVRPL 616

Query: 630 QEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRF 689
           QER F GY H P RA M+F+VRYRPDEQPSLRPHHDSSTYTINIA+N+ G+DY+GGGCRF
Sbjct: 617 QERAFTGYFHNPPRALMNFMVRYRPDEQPSLRPHHDSSTYTINIAMNRAGIDYQGGGCRF 676

Query: 690 IRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           IRYNC+VT T+ GWMLMHPGRLTHYHEGL VT GTRYIMISF+DP
Sbjct: 677 IRYNCSVTDTKKGWMLMHPGRLTHYHEGLLVTNGTRYIMISFIDP 721


>gi|195589463|ref|XP_002084471.1| GD12814 [Drosophila simulans]
 gi|194196480|gb|EDX10056.1| GD12814 [Drosophila simulans]
          Length = 721

 Score =  705 bits (1820), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 341/705 (48%), Positives = 474/705 (67%), Gaps = 12/705 (1%)

Query: 32  DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNEL 91
           DK  V TVA+  TDGY R+I+SA V  ++V TLGL + W GGDM   GGG+K+NLL+  +
Sbjct: 27  DKIKVFTVATEPTDGYTRYIRSARVYDIEVTTLGLGEEWKGGDMQKPGGGFKLNLLREAI 86

Query: 92  DEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAV 151
                  + IIL TDSYDVII   +++I E+F    A I+F AE+ CWPD SL + YP V
Sbjct: 87  APYKNEPETIILFTDSYDVIITTTLDEIFEKFKEAGAKILFSAEKYCWPDKSLANDYPEV 146

Query: 152 -GSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
            G   R+LNSG FIGYA  +  L+ +  I++  DDQLY+  +FLDET RTK  + LD  +
Sbjct: 147 EGKASRFLNSGAFIGYAPQVFALLED-PIEDTADDQLYFTKIFLDETKRTKLGLKLDVQS 205

Query: 211 NLFQNLYGSLEDIKLNFDLDEFVH-LTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK 269
            LFQNL+G+  D+KL  DL+     L N  + T P IIHGNG SK++LN++GNYLA+++ 
Sbjct: 206 RLFQNLHGAKNDVKLKVDLESNQGVLQNVDFMTTPAIIHGNGLSKVDLNAYGNYLARTF- 264

Query: 270 TSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN 329
           +  C  C   ++L  L+    P + +++ + +P  F ++FL  I +LNYP KK+ + +Y+
Sbjct: 265 SGVCLLCQ--ENLLDLEETNLPVISLALMVTQPVPFFDQFLEGIESLNYPKKKLHLLIYS 322

Query: 330 NQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDS 389
           N  +H      +++ +   +   K+      ++ ++ R LA++ +     D+ F+VD+D+
Sbjct: 323 NVAFHDDDIKSFVNKYDKEYATAKFALSTDELDERQGRQLALDKARLHQSDYIFFVDADA 382

Query: 390 HLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGK 449
           H+D+ +VL+ L+  N+  +AP+  +  + WSNFWGAL+  G+YARS DY++I+  +    
Sbjct: 383 HIDDSEVLRELLRLNKQFVAPIFSKHKELWSNFWGALSEGGYYARSHDYVDIVKREL--I 440

Query: 450 GIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGH 509
           G++NVP++T+ YL+K +   A + K        D DMA C +LRN GI +   + + +GH
Sbjct: 441 GMFNVPHVTSIYLVKKTAFDAISFKH----KEFDPDMAMCESLRNAGIFMYASNLRIFGH 496

Query: 510 LVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTE 569
           LV++++F+   T P+ Y L  N +DW  +YIHP Y   L       QPCPDV+WF IV++
Sbjct: 497 LVNADDFNTTVTRPDFYTLFSNEIDWTEKYIHPNYSLQLNESNKIQQPCPDVYWFQIVSD 556

Query: 570 KFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPL 629
            FC + V IMEA+  WSDG+N+D RLE GYEAVPTRDIHMKQVGL  ++ +FL+ +V PL
Sbjct: 557 AFCDDLVAIMEAHNGWSDGSNSDNRLEGGYEAVPTRDIHMKQVGLERLYLKFLQLFVRPL 616

Query: 630 QEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRF 689
           QER F GY H P RA M+F+VRYRPDEQPSLRPHHDSSTYTINIA+N+ G+DY+GGGCRF
Sbjct: 617 QERAFTGYFHNPPRALMNFMVRYRPDEQPSLRPHHDSSTYTINIAMNRAGIDYQGGGCRF 676

Query: 690 IRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           IRYNC+VT T+ GWMLMHPGRLTHYHEGL VT GTRYIMISF+DP
Sbjct: 677 IRYNCSVTDTKKGWMLMHPGRLTHYHEGLLVTNGTRYIMISFIDP 721


>gi|442759307|gb|JAA71812.1| Putative procollagen-lysine2-oxoglutarate 5-dioxygenase [Ixodes
           ricinus]
          Length = 667

 Score =  704 bits (1818), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 339/673 (50%), Positives = 458/673 (68%), Gaps = 10/673 (1%)

Query: 66  LHQPWLGGDMS-SLGGGYKVNLLKNELDEMDITDD--MIILVTDSYDVIIDGGVNDILER 122
           +++ WLGGDM+  +GGGYKV LL+     +D  DD  +I++  DSYDV+   G  +IL++
Sbjct: 1   MNEEWLGGDMARGMGGGYKVRLLRKA--AVDYKDDTSVILMFVDSYDVLFAAGAKEILKK 58

Query: 123 FNTFDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNE 182
           F  F+ N++F AE  CWPD SL   YP    G R+LNSGG IGYA  I E++++  +++E
Sbjct: 59  FYKFNTNVLFSAEGFCWPDQSLASSYPT-AKGNRFLNSGGIIGYAXXIYEIVTSAELEDE 117

Query: 183 EDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNT 242
            DDQL+Y  ++L+E LR K  I LD  A +FQNL G++ D++L   LD   +L N+ + T
Sbjct: 118 ADDQLFYTKIYLNEDLRKKWGIKLDHRAEIFQNLNGAVGDVEL-LGLDSEPYLHNSAFGT 176

Query: 243 NPVIIHGNGKSKIELNSFGNYLAKSWKT-SGCTRCNLIKHLDSLKPDQFPSVLISVFIDK 301
            P++IHGNG SK+ LNSFGNYLAKSW + +GC  C         +P + P VLI +FI+ 
Sbjct: 177 VPLVIHGNGPSKVVLNSFGNYLAKSWNSLAGCRVCYDAFSPADKEPSELPRVLIGIFIEH 236

Query: 302 PTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTV 361
           PT FL E L+K+ NLNYP ++I +FV+N  E+H    D ++  +   +++VK++ +    
Sbjct: 237 PTPFLWEALSKVYNLNYPRERIDLFVHNAVEFHEEEVDKFVEQYGQSYRSVKHMRNEDGR 296

Query: 362 NSKEARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSN 421
               ARNLA+E  +    D+YF VDSD+HLDN D L+ L+  N +++APLL R    WSN
Sbjct: 297 KEWHARNLALEECMKIKCDYYFSVDSDAHLDNGDTLRALIEMNRTVVAPLLSRHKNLWSN 356

Query: 422 FWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNS 481
           FWGAL+ DG+YARS DY+ ++ G++  KG+WNVP+I   YL+  +++ +      +    
Sbjct: 357 FWGALSTDGYYARSHDYVQLVKGER--KGLWNVPFINTVYLINGTLLHSKEKFPSFISGL 414

Query: 482 MDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIH 541
           +D DMAFC N+R KGI + + +   YGHLV+ E FD +  NP+ YE+  N +DW+ RYIH
Sbjct: 415 LDPDMAFCKNMREKGIFMYVTNMDTYGHLVNPETFDLKLKNPDFYEIYSNQMDWERRYIH 474

Query: 542 PEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEA 601
             Y K L PD   + PCPDV+WFP+VT+ FC   ++IME +GQWS G N D+RL  GYE 
Sbjct: 475 ENYSKVLEPDFKVDMPCPDVYWFPVVTDIFCRHMIEIMENFGQWSSGKNEDERLAGGYEN 534

Query: 602 VPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLR 661
           VPTRDIHM QV     W  FLR+Y+ P+QE+ F+GY H+P RA M+FVVRY PDEQ  LR
Sbjct: 535 VPTRDIHMNQVNFEQHWLFFLREYIKPVQEKVFLGYFHDPPRAIMNFVVRYHPDEQYFLR 594

Query: 662 PHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVT 721
           PHHDSSTYTINIALN+  +DYEGGGC F+RYNC+V   + GW LMHPGRLTHYHEGL VT
Sbjct: 595 PHHDSSTYTINIALNRPKIDYEGGGCNFLRYNCSVVDLKQGWSLMHPGRLTHYHEGLPVT 654

Query: 722 QGTRYIMISFVDP 734
           +GTRYIM+SFVDP
Sbjct: 655 KGTRYIMVSFVDP 667


>gi|260786918|ref|XP_002588503.1| hypothetical protein BRAFLDRAFT_280606 [Branchiostoma floridae]
 gi|229273666|gb|EEN44514.1| hypothetical protein BRAFLDRAFT_280606 [Branchiostoma floridae]
          Length = 679

 Score =  704 bits (1816), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 337/685 (49%), Positives = 465/685 (67%), Gaps = 7/685 (1%)

Query: 51  IQSAEVNKLQVKTLGLHQPWLGGDM-SSLGGGYKVNLLKNELDEMDITDDMIILVTDSYD 109
           ++SA+   +QV+ LG+HQ WLGGD+ +++GGG KV LLK  L +     D++I+ +DSYD
Sbjct: 1   MRSADKYNIQVQVLGMHQEWLGGDVQNNIGGGQKVLLLKEALKKYKDDKDLVIMFSDSYD 60

Query: 110 VIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKD 169
           VII     +IL +F+ F+A +VFGAE  CWPD +L D YP V  G  YLNSGGFIGYA +
Sbjct: 61  VIITAEKEEILRKFDDFNARVVFGAEGFCWPDRTLADLYPEVRLGKPYLNSGGFIGYASE 120

Query: 170 IKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDL 229
           + +++S+ SI+N+ DDQLYY  +FL+  LR K K+ LD  + +FQN+ G+  D+ L FD 
Sbjct: 121 LYQIVSHTSIQNQHDDQLYYTRIFLNPELREKFKMKLDHTSEIFQNMNGAGADLTLKFD- 179

Query: 230 DEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWKTSGCTRCNLIKHLDSLKPDQ 289
           D+   L N  YNT P IIHGNG  K+ LN  GNY+A SW    C  C   ++  SLK D 
Sbjct: 180 DDKTRLRNRVYNTEPCIIHGNGPQKLVLNHIGNYVADSWSFDECHSCK--ENTFSLKTDD 237

Query: 290 FPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMF 349
           +P V+I +FI++PT F+ EFLNKI NL+YP  KI +F++N++E+HA    +++ N+   +
Sbjct: 238 YPVVVIGLFIEQPTPFVPEFLNKIYNLDYPKNKIVLFIHNHEEHHAGDVQEFVKNYGGDY 297

Query: 350 KNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIA 409
           K V+ +  +  +N   AR   +   +    D+Y  VD+D  + NP  L+ L+ +N S+IA
Sbjct: 298 KAVREVTPSMNMNQWYARKQGLSECIGVKCDYYLSVDADVQITNPKTLQILIQQNRSVIA 357

Query: 410 PLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIK 469
           P+  +  K WSNFWGA+  DGFYARS DY++I+ G +  KG+WNVPYI N YL+  S+++
Sbjct: 358 PMATKYGKLWSNFWGAIGDDGFYARSDDYIDIVQGTK--KGVWNVPYINNVYLIHGSLLQ 415

Query: 470 ATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELI 529
                  + +  +D DMAFC +LR KGI + + +   +G L  + ++  +  +P+++++ 
Sbjct: 416 QPKTMPNFIVGQLDADMAFCASLREKGIFMYVTNMDTFGRLTTTTSYSTEHLHPDMWQMY 475

Query: 530 RNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGT 589
            N  DW+ +YIH ++ K L P T    PCPDV+WFPIVTE FC   V+ ME YG+WS G 
Sbjct: 476 DNRPDWEEKYIHADFYKMLDPKTEVEMPCPDVYWFPIVTETFCKHLVEEMENYGEWSAGK 535

Query: 590 NNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFV 649
           N D RL +GYE VPT DIHM Q+G    W  FL+++V  LQE+ + GY+ E  +A M+FV
Sbjct: 536 NEDLRLSSGYENVPTVDIHMNQIGFEREWLHFLKEFVTKLQEKVYPGYYSE-AQAIMNFV 594

Query: 650 VRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPG 709
           VRY P EQP LRPHHDSST+TIN+ALN+ GVD+EGGGCRF+RYNC+VT T+MGW+LMHPG
Sbjct: 595 VRYHPQEQPFLRPHHDSSTFTINLALNKAGVDFEGGGCRFLRYNCSVTNTKMGWLLMHPG 654

Query: 710 RLTHYHEGLQVTQGTRYIMISFVDP 734
           RLTHYHEGL  T GTRYIMISFVDP
Sbjct: 655 RLTHYHEGLPTTNGTRYIMISFVDP 679


>gi|6755110|ref|NP_036092.1| procollagen-lysine,2-oxoglutarate 5-dioxygenase 3 precursor [Mus
           musculus]
 gi|25008937|sp|Q9R0E1.1|PLOD3_MOUSE RecName: Full=Procollagen-lysine,2-oxoglutarate 5-dioxygenase 3;
           AltName: Full=Lysyl hydroxylase 3; Short=LH3; Flags:
           Precursor
 gi|5880317|gb|AAD54618.1|AF046783_1 lysyl hydroxylase 3 [Mus musculus]
 gi|15145782|gb|AAK00576.1| lysyl hydroxylase 3 [Mus musculus]
 gi|26329015|dbj|BAC28246.1| unnamed protein product [Mus musculus]
 gi|26354078|dbj|BAC40669.1| unnamed protein product [Mus musculus]
 gi|28175483|gb|AAH43047.1| Procollagen-lysine, 2-oxoglutarate 5-dioxygenase 3 [Mus musculus]
 gi|32493408|gb|AAH54734.1| Procollagen-lysine, 2-oxoglutarate 5-dioxygenase 3 [Mus musculus]
 gi|148687344|gb|EDL19291.1| procollagen-lysine, 2-oxoglutarate 5-dioxygenase 3 [Mus musculus]
          Length = 741

 Score =  704 bits (1816), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 333/711 (46%), Positives = 484/711 (68%), Gaps = 11/711 (1%)

Query: 29  IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLL 87
           ++ DK LVITVA+ ET+GY+RF+QSAE     V+TLGL Q W GGD++ ++GGG KV  L
Sbjct: 37  VNPDKLLVITVATAETEGYRRFLQSAEFFNYTVRTLGLGQEWRGGDVARTVGGGQKVRWL 96

Query: 88  KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
           K E+++     DMII+  DSYDVI+     ++L++F    ++++F AE  CWP+  L ++
Sbjct: 97  KKEMEKYADQKDMIIMFVDSYDVILASSPTELLKKFVQSGSHLLFSAESFCWPEWGLAEQ 156

Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
           YP VG G R+LNSGGFIG+A  I +++   + K+++DDQL+Y  L+LD  LR K K+ LD
Sbjct: 157 YPEVGMGKRFLNSGGFIGFAPTIHQIVRQWNYKDDDDDQLFYTQLYLDPGLREKLKLSLD 216

Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
             + +FQNL G+L+++ L FD +  V + N  Y+T PV++HGNG +K++LN  GNY+   
Sbjct: 217 HKSRIFQNLNGALDEVILKFDQNR-VRIRNVAYDTLPVVVHGNGPTKLQLNYLGNYVPNG 275

Query: 268 W-KTSGCTRCN-LIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
           W    GC  CN  ++ L   +P   P VL++VF+++PT FL  FL ++  L+YP  +IS+
Sbjct: 276 WTPQGGCGFCNQTLRTLPGGQPP--PRVLLAVFVEQPTPFLPRFLQRLLLLDYPPDRISL 333

Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKG-VDFYFY 384
           F++N++ YH P   D     +  F  VK +     +++ EAR++A+++       +FYF 
Sbjct: 334 FLHNSEVYHEPHIADAWPQLQDHFSAVKLVGPEEALSAGEARDMAMDSCRQNPECEFYFS 393

Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
           +D+D+ L NP+ L+ L+ +N  +IAP+L R  K WSNFWGAL+ + +YARS DY+ ++  
Sbjct: 394 LDADAVLTNPETLRVLIEQNRKVIAPMLSRHGKLWSNFWGALSPNEYYARSEDYVELVQR 453

Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDS 503
            +   G+WNVPYI+  Y+++   ++     K +++ +  D DMAFC ++R+KGI L + +
Sbjct: 454 KR--VGVWNVPYISQAYVIRGETLRTELPQKEVFSSSDTDPDMAFCKSVRDKGIFLHLSN 511

Query: 504 TQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFW 563
             E+G L+ +  +D    +P+++++  NP+DW  +YIH  Y ++L  + +  QPCPDV+W
Sbjct: 512 QHEFGRLLATSRYDTDHLHPDLWQIFDNPVDWREQYIHENYSRALDGEGLVEQPCPDVYW 571

Query: 564 FPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLR 623
           FP++TE+ C E V+ ME YGQWS G + D RL  GYE VPT DIHMKQVG    W + LR
Sbjct: 572 FPLLTEQMCDELVEEMEHYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYEDQWLQLLR 631

Query: 624 KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYE 683
            YV P+ E  F GYH +  RA M+FVVRYRPDEQPSLRPHHDSST+T+N+ALN  GVDYE
Sbjct: 632 TYVGPMTEYLFPGYHTK-TRAVMNFVVRYRPDEQPSLRPHHDSSTFTLNVALNHKGVDYE 690

Query: 684 GGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           GGGCRF+RY+C +++ R GW L+HPGRLTHYHEGL  T+GTRYIM+SFVDP
Sbjct: 691 GGGCRFLRYDCRISSPRKGWALLHPGRLTHYHEGLPTTRGTRYIMVSFVDP 741


>gi|12850403|dbj|BAB28704.1| unnamed protein product [Mus musculus]
          Length = 741

 Score =  704 bits (1816), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 333/711 (46%), Positives = 484/711 (68%), Gaps = 11/711 (1%)

Query: 29  IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLL 87
           ++ DK LVITVA+ ET+GY+RF+QSAE     V+TLGL Q W GGD++ ++GGG KV  L
Sbjct: 37  VNPDKLLVITVATAETEGYRRFLQSAEFFNYTVRTLGLGQEWRGGDVARTVGGGQKVRWL 96

Query: 88  KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
           K E+++     DMII+  DSYDVI+     ++L++F    ++++F AE  CWP+  L ++
Sbjct: 97  KKEMEKYADQKDMIIMFVDSYDVILASSPTELLKKFVQSGSHLLFSAESFCWPEWGLAEQ 156

Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
           YP VG G R+LNSGGFIG+A  I +++   + K+++DDQL+Y  L+LD  LR K K+ LD
Sbjct: 157 YPEVGMGKRFLNSGGFIGFAPTIHQIVRQWNYKDDDDDQLFYTQLYLDPGLREKLKLSLD 216

Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
             + +FQNL G+L+++ L FD +  V + N  Y+T PV++HGNG +K++LN  GNY+   
Sbjct: 217 HKSRIFQNLNGALDEVILKFDQNR-VRIRNVAYDTLPVVVHGNGPTKLQLNYLGNYVPNG 275

Query: 268 W-KTSGCTRCN-LIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
           W    GC  CN  ++ L   +P   P VL++VF+++PT FL  FL ++  L+YP  +IS+
Sbjct: 276 WTPQGGCGFCNQTLRTLPGGQPP--PRVLLAVFVEQPTPFLPRFLQRLLLLDYPPDRISL 333

Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKG-VDFYFY 384
           F++N++ YH P   D     +  F  VK +     +++ EAR++A+++       +FYF 
Sbjct: 334 FLHNSEVYHEPHIADAWPQLQDHFSAVKLVGPEEALSAGEARDMAMDSCRQNPECEFYFS 393

Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
           +D+D+ L NP+ L+ L+ +N  +IAP+L R  K WSNFWGAL+ + +YARS DY+ ++  
Sbjct: 394 LDADAVLTNPETLRVLIEQNRKVIAPMLSRHGKLWSNFWGALSPNEYYARSEDYVELVQR 453

Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDS 503
            +   G+WNVPYI+  Y+++   ++     K +++ +  D DMAFC ++R+KGI L + +
Sbjct: 454 KR--VGVWNVPYISQAYVIRGETLRTELPQKEVFSSSDTDPDMAFCKSVRDKGIFLHLSN 511

Query: 504 TQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFW 563
             E+G L+ +  +D    +P+++++  NP+DW  +YIH  Y ++L  + +  QPCPDV+W
Sbjct: 512 QHEFGRLLATSRYDTDHLHPDLWQIFDNPVDWREQYIHENYSRALDGEGLVEQPCPDVYW 571

Query: 564 FPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLR 623
           FP++TE+ C E V+ ME YGQWS G + D RL  GYE VPT DIHMKQVG    W + LR
Sbjct: 572 FPLLTEQMCDELVEEMEHYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYEDQWLQLLR 631

Query: 624 KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYE 683
            YV P+ E  F GYH +  RA M+FVVRYRPDEQPSLRPHHDSST+T+N+ALN  GVDYE
Sbjct: 632 TYVGPMTEYLFPGYHTK-TRAVMNFVVRYRPDEQPSLRPHHDSSTFTLNVALNHKGVDYE 690

Query: 684 GGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           GGGCRF+RY+C +++ R GW L+HPGRLTHYHEGL  T+GTRYIM+SFVDP
Sbjct: 691 GGGCRFLRYDCRISSPRKGWALLHPGRLTHYHEGLPTTRGTRYIMVSFVDP 741


>gi|24662591|ref|NP_648451.1| procollagen lysyl hydroxylase, isoform A [Drosophila melanogaster]
 gi|24662595|ref|NP_729687.1| procollagen lysyl hydroxylase, isoform B [Drosophila melanogaster]
 gi|7294743|gb|AAF50079.1| procollagen lysyl hydroxylase, isoform A [Drosophila melanogaster]
 gi|23093644|gb|AAN11883.1| procollagen lysyl hydroxylase, isoform B [Drosophila melanogaster]
          Length = 721

 Score =  700 bits (1806), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 340/705 (48%), Positives = 471/705 (66%), Gaps = 12/705 (1%)

Query: 32  DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNEL 91
           DK  V TVA+  TDGY R+I+SA V  ++V TLGL + W GGDM   GGG+K+NLL+  +
Sbjct: 27  DKIKVFTVATEPTDGYTRYIRSARVYDIEVTTLGLGEEWKGGDMQKPGGGFKLNLLREAI 86

Query: 92  DEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAV 151
                  + IIL TDSYDVII   +++I E+F    A I+F AE+ CWPD SL + YP V
Sbjct: 87  APYKNEPETIILFTDSYDVIITTTLDEIFEKFKESGAKILFSAEKYCWPDKSLANDYPEV 146

Query: 152 -GSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
            G   R+LNSG FIGYA  +  L+ +  I++  DDQLY+  +FLDET R K  + LD  +
Sbjct: 147 EGKASRFLNSGAFIGYAPQVFALLVD-PIEDTADDQLYFTKIFLDETKRAKLGLKLDVQS 205

Query: 211 NLFQNLYGSLEDIKLNFDLDEFVH-LTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK 269
            LFQNL+G+  D+KL  DL+     L N  + T P IIHGNG SK++LN++GNYLA+++ 
Sbjct: 206 RLFQNLHGAKNDVKLKVDLESNQGVLQNVDFMTTPSIIHGNGLSKVDLNAYGNYLARTF- 264

Query: 270 TSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN 329
              C  C   ++L  L+    P + +++ + +P  F ++FL  I +LNYP +K+ + +Y+
Sbjct: 265 NGVCLLCQ--ENLLDLEETNLPVISLALMVTQPVPFFDQFLEGIESLNYPKEKLHLLIYS 322

Query: 330 NQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDS 389
           N  +H      +++     +   K+      ++ ++ R LA++ +     D+ F+VD+D+
Sbjct: 323 NVAFHDDDIKSFVNKHAKEYATAKFALSTDELDERQGRQLALDKARLHQSDYIFFVDADA 382

Query: 390 HLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGK 449
           H+D+ +VL+ L+  N+  +AP+  +  + WSNFWGAL+  G+YARS DY++I+  +    
Sbjct: 383 HIDDGEVLRELLRLNKQFVAPIFSKHKELWSNFWGALSEGGYYARSHDYVDIVKREL--I 440

Query: 450 GIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGH 509
           G++NVP++T+ YL+K +   A + K        D DMA C +LRN GI +   + + +GH
Sbjct: 441 GMFNVPHVTSIYLVKKTAFDAISFKH----KEFDPDMAMCESLRNAGIFMYASNLRIFGH 496

Query: 510 LVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTE 569
           LV++++F+   T P+ Y L  N +DW  +YIHP Y   L       QPCPDV+WF IV++
Sbjct: 497 LVNADDFNTTVTRPDFYTLFSNEIDWTEKYIHPNYSLQLNESNKIQQPCPDVYWFQIVSD 556

Query: 570 KFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPL 629
            FC + V IMEA+  WSDG+NND RLE GYEAVPTRDIHMKQVGL  ++ +FL+ +V PL
Sbjct: 557 AFCDDLVAIMEAHNGWSDGSNNDNRLEGGYEAVPTRDIHMKQVGLERLYLKFLQMFVRPL 616

Query: 630 QEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRF 689
           QER F GY H P RA M+F+VRYRPDEQPSLRPHHDSSTYTINIA+N+ G+DY+GGGCRF
Sbjct: 617 QERAFTGYFHNPPRALMNFMVRYRPDEQPSLRPHHDSSTYTINIAMNRAGIDYQGGGCRF 676

Query: 690 IRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           IRYNC+VT T+ GWMLMHPGRLTHYHEGL VT GTRYIMISF+DP
Sbjct: 677 IRYNCSVTDTKKGWMLMHPGRLTHYHEGLLVTNGTRYIMISFIDP 721


>gi|338712640|ref|XP_001504506.2| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 3 [Equus
           caballus]
          Length = 829

 Score =  699 bits (1803), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 328/711 (46%), Positives = 481/711 (67%), Gaps = 11/711 (1%)

Query: 29  IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLL 87
           ++ +K LVITVA+ ET+GY+RF++SAE     V+TLGL + W GGD++ ++GGG KV  L
Sbjct: 125 VNPEKLLVITVATAETEGYRRFLRSAEFFNYTVRTLGLGEDWRGGDVARTVGGGQKVRWL 184

Query: 88  KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
           K E+++    +DM+I+  DSYDVI+ G  +++L++F    + ++F AE  CWP+  L ++
Sbjct: 185 KKEMEKYADREDMVIMFVDSYDVILAGSPSELLKKFVQSGSRLLFSAESFCWPEWGLAEQ 244

Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
           YP VG+G R+LNSGGFIG+A  I +++     K+++DDQL+Y  L+LD  LR K  + LD
Sbjct: 245 YPEVGTGKRFLNSGGFIGFAPTIHQIVRQWKYKDDDDDQLFYTRLYLDPGLREKLSLNLD 304

Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
             + +FQNL G+L+++ L FD +  V + N  Y+T PV++HGNG +K++LN  GNY+ K 
Sbjct: 305 HKSRIFQNLNGALDEVVLKFDRNR-VRIRNVAYDTLPVVVHGNGPTKLQLNYLGNYVPKG 363

Query: 268 WKT-SGCTRCNLIKH-LDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
           W    GC  C+L +  L   +P   P VL++VF+++PT FL  FL ++  L+YP  ++++
Sbjct: 364 WTPEGGCGYCDLDRRTLPGGQPP--PRVLLAVFVEQPTPFLPRFLQRLLLLDYPPDRVAL 421

Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFY 384
           F++NN+ YH P   D     +  F  VK +     +   EAR++A+++       +FYF 
Sbjct: 422 FLHNNEVYHEPHIADSWPQLQDHFSAVKLVGPEEALTPGEARDMAMDSCRQDPKCEFYFS 481

Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
           +D+D+ + NP  L+ L+  N  +IAP+L R  K WSNFWGAL+ D +YARS DY+ ++  
Sbjct: 482 LDADAVITNPQTLRILIEENRKVIAPMLSRHGKLWSNFWGALSPDEYYARSEDYVELVQR 541

Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDS 503
            +   G+WNVPYI+  Y+++   ++     K +++ +  D DMAFC +LR++GI L + +
Sbjct: 542 KR--VGVWNVPYISQAYVIRGETLRTELPQKEVFSSSDTDPDMAFCKSLRDQGIFLHLSN 599

Query: 504 TQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFW 563
             E+G L+ +  +D    +P+++++  NPLDW  +YIH  Y ++L    +  QPCPDV+W
Sbjct: 600 RHEFGRLLATSRYDTDHLHPDLWQIFDNPLDWKEQYIHENYSRALEGKGLVEQPCPDVYW 659

Query: 564 FPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLR 623
           FP+++++ C E V+ ME YGQWS G + D RL  GYE VPT DIHMKQVG    W + LR
Sbjct: 660 FPLLSDQMCDELVEEMEHYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGFEDQWLQLLR 719

Query: 624 KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYE 683
            YV P+ E  F GYH +  RA M+FVVRYRPDEQPSLRPHHDSST+T+N+ALN  G+DYE
Sbjct: 720 TYVGPMTESLFPGYHTK-TRAVMNFVVRYRPDEQPSLRPHHDSSTFTLNVALNHKGLDYE 778

Query: 684 GGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           GGGCRF+RY+C V++ R GW L+HPGRLTHYHEGL  T+GTRYIM+SFVDP
Sbjct: 779 GGGCRFLRYDCVVSSPRKGWGLLHPGRLTHYHEGLPTTRGTRYIMVSFVDP 829


>gi|195493375|ref|XP_002094389.1| GE20228 [Drosophila yakuba]
 gi|194180490|gb|EDW94101.1| GE20228 [Drosophila yakuba]
          Length = 727

 Score =  698 bits (1802), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 337/705 (47%), Positives = 470/705 (66%), Gaps = 12/705 (1%)

Query: 32  DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNEL 91
           +K  V TVA+  TDGY R+ +SA V  ++V TLGL + W GGDM   GGG+K+NLL+  +
Sbjct: 33  EKIKVFTVATEPTDGYNRYARSARVYDIEVTTLGLGEEWKGGDMQRPGGGFKLNLLREAI 92

Query: 92  DEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAV 151
                  + IIL TDSYDVII   +++I E+F    A I+F AE+ CWPD SL + YP V
Sbjct: 93  APYKNDPETIILFTDSYDVIITTTLDEIFEKFKEAGAKILFSAEKYCWPDKSLANDYPEV 152

Query: 152 -GSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
            G   R+LNSG F+GYA  +  L+ +  I++  DDQLY+  +FLDE  RTK  + LD  +
Sbjct: 153 EGKASRFLNSGAFMGYAPQVYALLED-PIEDTADDQLYFTKIFLDEAKRTKLGLKLDVKS 211

Query: 211 NLFQNLYGSLEDIKLNFDLDEFVH-LTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK 269
            LFQNL+G+  D+KL  DLD     L N  + T P IIHGNG SK++LN++GNYLA+++ 
Sbjct: 212 RLFQNLHGAKNDVKLKVDLDSNQGVLQNVDFMTTPAIIHGNGLSKVDLNAYGNYLARTF- 270

Query: 270 TSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN 329
           +  C  C   ++L  L+  + P + +++ + +   F ++FL  I  LNYP  K+ + +Y+
Sbjct: 271 SGVCLLCQ--ENLLDLEETKLPVISLALMVTQAVPFFDQFLEGIETLNYPKDKLHLLIYS 328

Query: 330 NQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDS 389
           N  +H      +++     +   K+      ++ ++ R LAV+ +     D+ F+VD+D+
Sbjct: 329 NVAFHDDDIKSFVNKHAKEYATAKFALSTDELDERQGRQLAVDKARLHQSDYIFFVDADA 388

Query: 390 HLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGK 449
           H+D+ +VL+ L+  N+  +AP+  +P + WSNFWGAL+  G+YARS DY++I+  +    
Sbjct: 389 HIDDSEVLRELLRLNKQFVAPIFSKPKELWSNFWGALSEGGYYARSHDYVDIVKREL--I 446

Query: 450 GIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGH 509
           G++NVP++T+ YL+K++   A + K        D DMA C +LRN GI +   + + +GH
Sbjct: 447 GMFNVPHVTSIYLVKSTAFDAISFKH----KEFDPDMAMCESLRNAGIFMYASNLRIFGH 502

Query: 510 LVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTE 569
           LV++++F+   T P+ Y L  N +DW  +YIHP Y   L       QPCPDV+WF IV++
Sbjct: 503 LVNADDFNTTVTRPDFYTLFSNEIDWTEKYIHPNYSLQLNESYKLQQPCPDVYWFQIVSD 562

Query: 570 KFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPL 629
            FC + V IMEA+  WSDG+N+D RLE GYEAVPTRDIH KQVGL  ++ +FL+ +V PL
Sbjct: 563 AFCDDLVAIMEAHNGWSDGSNSDSRLEGGYEAVPTRDIHTKQVGLERLYLKFLQLFVRPL 622

Query: 630 QEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRF 689
           QER F GY H P RA M+F+VRYRPDEQPSLRPHHDSSTYTINIA+N+ G+DY+GGGCRF
Sbjct: 623 QERAFTGYFHNPPRALMNFMVRYRPDEQPSLRPHHDSSTYTINIAMNRAGIDYQGGGCRF 682

Query: 690 IRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           IRYNC+VT T+ GWMLMHPGRLTHYHEGL VT GTRYIMISF+DP
Sbjct: 683 IRYNCSVTETKKGWMLMHPGRLTHYHEGLLVTNGTRYIMISFIDP 727


>gi|357619307|gb|EHJ71931.1| putative procollagen-lysine,2-oxoglutarate 5-dioxygenase [Danaus
           plexippus]
          Length = 660

 Score =  697 bits (1800), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 334/665 (50%), Positives = 461/665 (69%), Gaps = 10/665 (1%)

Query: 75  MSSLGGGYKVNLLKNELDEMDITDD--MIILVTDSYDVIIDGGVNDILERFNTFDANIVF 132
           M   GGG+KVNLLK++L  M I +D   IIL TDSYDV+  G +++I+++F      ++F
Sbjct: 1   MKHEGGGHKVNLLKDKLSSMKIPEDRDQIILFTDSYDVMFLGSLDEIVQKFLAMSVRVLF 60

Query: 133 GAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALL 192
            AE  CWPD+SL  +YP       +LNSGGFIGY  ++ ++++  ++ N++DDQL+Y  +
Sbjct: 61  SAEPFCWPDSSLASQYPDSQQLNPFLNSGGFIGYLPELLKILNYETVGNKDDDQLFYTKV 120

Query: 193 FLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFD-LDEFVHLTNTKYNTNPVIIHGNG 251
           +LDE  R   +I LD  + +FQNL+G+L D++L  +  DE+ +L N      P+I+HGNG
Sbjct: 121 YLDEDYRESLRISLDHKSAIFQNLHGALSDVQLVANSTDEWPYLVNVVTKQRPLIVHGNG 180

Query: 252 KSKIELNSFGNYLAKSWKTS-GCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFL 310
            +K+ LN+  NYLAKSW  S GC  C+  + +  L  D+ P V++SVFI+  T F+EEF 
Sbjct: 181 PAKLTLNNLSNYLAKSWSVSEGCVLCDEKRIV--LDEDKLPKVMLSVFIEVATPFIEEFF 238

Query: 311 NKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLA 370
             I  ++YP +KI +F+ N  EYH    +++     + +   K I     V   EARN+A
Sbjct: 239 QSILAIDYPKQKIHLFIRNGVEYHESEVENFYQAHSSEYFTAKRIKSTDLVGEAEARNIA 298

Query: 371 VENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADG 430
            +  +    D+ F +DS + ++ PD L YL++    ++APLLVR  +AWSNFWGA+N+ G
Sbjct: 299 KDRCIGSDCDYLFCLDSHARVE-PDTLHYLLSTGYDVVAPLLVRSGQAWSNFWGAINSVG 357

Query: 431 FYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTI-YTLNSMDYDMAFC 489
           FY+RS DYM+I+N  +  +GIWNVP+I NCYLM  S+ +  + K + Y     D DMAFC
Sbjct: 358 FYSRSADYMDIVN--RSIEGIWNVPFINNCYLMNISLFRKPSAKHVSYLKEDTDPDMAFC 415

Query: 490 TNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLL 549
            +LR+ GI + + + +E+GHLV+SE FD  +TNP++Y++I N LDW+ RY+HP+Y +   
Sbjct: 416 ASLRSAGIMMYVSNEKEFGHLVNSETFDVSRTNPDIYQVIDNKLDWEQRYLHPKYHEIFA 475

Query: 550 PDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHM 609
                  PCPDV+WFP+++ +FC E++++MEA+GQWSDG+NNDKRLE+GYEAVPTRDIHM
Sbjct: 476 NKEKQLMPCPDVYWFPLMSMRFCKEWIEVMEAFGQWSDGSNNDKRLESGYEAVPTRDIHM 535

Query: 610 KQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTY 669
            QVGL   W   L+ YV PLQE  F GY+H P  + M+FVVRYRPDEQPSLRPHHDSSTY
Sbjct: 536 NQVGLDIQWLRILKDYVRPLQELVFTGYYHNPPVSVMNFVVRYRPDEQPSLRPHHDSSTY 595

Query: 670 TINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMI 729
           TIN+ALN   +DYEGGGCRFIRYNC+V  T+ GW+LMHPGRLTH+HEGL VT+GTRYIMI
Sbjct: 596 TINLALNTPHLDYEGGGCRFIRYNCSVKDTKPGWLLMHPGRLTHFHEGLLVTKGTRYIMI 655

Query: 730 SFVDP 734
           SFVDP
Sbjct: 656 SFVDP 660


>gi|223647994|gb|ACN10755.1| Procollagen-lysine,2-oxoglutarate 5-dioxygenase 3 precursor [Salmo
           salar]
          Length = 735

 Score =  695 bits (1793), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 334/736 (45%), Positives = 488/736 (66%), Gaps = 13/736 (1%)

Query: 8   NCLILSCVVFFISVHCN---KVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTL 64
           +C+ + CV+    +  +   + + I  D  LVITVA+ +TDG+ RF+++++     VK L
Sbjct: 4   SCIAVVCVLLLGWMQSSLGAEQRVISPDNLLVITVATEDTDGFTRFMRTSKEFNYTVKVL 63

Query: 65  GLHQPWLGGDMS-SLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERF 123
           GL + W GGD++ ++GGG KV  LK EL +     D++IL  DSYDVI+  G  ++L +F
Sbjct: 64  GLGEQWKGGDVARTVGGGQKVRWLKTELLKHSDKKDLVILFVDSYDVILASGPEELLWKF 123

Query: 124 NTFDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEE 183
           +     +VF AE  CWPD  L  KYPAV +G RYLNSGGFIG+A ++ E++     K+ +
Sbjct: 124 SRLGHRMVFSAEGFCWPDQKLAPKYPAVHTGKRYLNSGGFIGFAPELSEIVQQWKHKDND 183

Query: 184 DDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTN 243
           DDQL+Y  ++LD+  RTK+ + LD  + +FQNL G++E++ L F+    V   N  Y+T 
Sbjct: 184 DDQLFYTKIYLDKVQRTKYNMTLDHRSRIFQNLNGAIEEVVLKFEKAR-VRARNVAYDTL 242

Query: 244 PVIIHGNGKSKIELNSFGNYLAKSWK-TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDK 301
           PVIIHGNG +K++LN  GNY+  +W   +GC  C+  + +L+    +  P V +SVFI +
Sbjct: 243 PVIIHGNGPTKLQLNYLGNYVPTAWTHETGCGICDDDLVYLNDTPDEDMPLVYLSVFIVQ 302

Query: 302 PTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTV 361
           PT FLEEFL ++ +LNYP  +I +F++NN  YH      +    + +F   + +     +
Sbjct: 303 PTPFLEEFLERLTSLNYPTSRIRLFIHNNVVYHEQHIQRFWEKHRVLFPEARLVGPEENL 362

Query: 362 NSKEARNLAVENSLHK-GVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWS 420
              +AR +AVE        ++YF +D+D  + NPDVL+ L+  N+S+IAP+L R  K WS
Sbjct: 363 QQDQARTMAVEACQKDHQCEYYFSIDADVVIVNPDVLRVLIEENKSVIAPMLSRHGKLWS 422

Query: 421 NFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTL 479
           NFWGAL+ +GFY+RS DY++I+ G +   G+WNVPYIT  Y++K SV++   +  ++Y+ 
Sbjct: 423 NFWGALSPEGFYSRSEDYIDIVQGKR--IGLWNVPYITQVYMIKGSVLRGRLSQVSLYSQ 480

Query: 480 NSMDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRY 539
             MD DM FC  +R++G+ + + +  E+G LV S N++  + +P+++++  NPLDW  +Y
Sbjct: 481 EGMDPDMVFCRAVRDQGVFMFVSNRDEFGRLVSSSNYNTSRLHPDMWQIFDNPLDWKDKY 540

Query: 540 IHPEYQKSLLPD-TVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETG 598
           IH  Y +    + TV  QPCPDV+WFP  T++ C + V+ ME +G+WS G + D+RL  G
Sbjct: 541 IHENYSQIFEDNQTVVEQPCPDVYWFPSFTDRMCDDLVETMEDFGEWSGGRHTDERLAGG 600

Query: 599 YEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQP 658
           YE VPT DIHM Q+G    W +FL++Y+ P+ E+ + GY+ +  +A M+FVVRYRPDEQP
Sbjct: 601 YENVPTVDIHMNQIGFEKEWLKFLKEYISPVTEKLYPGYYPK-AQAVMNFVVRYRPDEQP 659

Query: 659 SLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGL 718
           SLRPHHDSST+TIN+ALN  G+DY+GGGCRF+RY+CNV A R GW  MHPGRLTHYHEGL
Sbjct: 660 SLRPHHDSSTFTINVALNNKGLDYQGGGCRFLRYDCNVEAPRKGWSFMHPGRLTHYHEGL 719

Query: 719 QVTQGTRYIMISFVDP 734
             T GTRYIM+SFVDP
Sbjct: 720 PTTSGTRYIMVSFVDP 735


>gi|300795072|ref|NP_001180184.1| procollagen-lysine,2-oxoglutarate 5-dioxygenase 3 precursor [Bos
           taurus]
 gi|296473078|tpg|DAA15193.1| TPA: procollagen-lysine, 2-oxoglutarate 5-dioxygenase 3 [Bos
           taurus]
          Length = 751

 Score =  694 bits (1790), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 326/711 (45%), Positives = 479/711 (67%), Gaps = 11/711 (1%)

Query: 29  IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLL 87
           ++ +K LVITVA+ ET+GY+RF+QSAE     V+TLGL + W GGD++ ++GGG KV  L
Sbjct: 47  VNPEKMLVITVATAETEGYRRFLQSAEFFNYTVRTLGLGEEWRGGDVARTVGGGQKVRWL 106

Query: 88  KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
           K E+++    +DM+I+  DSYDV++ G  +++L++F    + ++F AE  CWP+  L ++
Sbjct: 107 KKEMEKYAEREDMVIMFVDSYDVVLAGSPSELLKKFVQSGSRLLFSAESFCWPEWGLAEQ 166

Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
           YP VG+G R+LNSGGFIG+A  I +++     K+++DDQL+Y  L+LD  LR K  + LD
Sbjct: 167 YPEVGTGKRFLNSGGFIGFAPTIHQIVRQWKYKDDDDDQLFYTRLYLDPGLREKLGLSLD 226

Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
             + +FQNL G+L+++ L F  +  V + N  Y+T PV++HGNG +K++LN  GNY+   
Sbjct: 227 HKSRIFQNLNGALDEVVLKFGRNR-VRIRNVAYDTLPVVVHGNGPTKLQLNYLGNYVPNG 285

Query: 268 W-KTSGCTRCNLIKH-LDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
           W    GC  CN  +  L   +P   P VL++VF+++PT FL  FL ++  L+YP  ++++
Sbjct: 286 WTPEGGCGFCNQGRRPLPGGQPP--PRVLLAVFVEQPTPFLPRFLQRLLLLDYPPDRVTL 343

Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFY 384
           F++NN+ YH P  D+     +  F  VK +     +   EAR++A++        +FYF 
Sbjct: 344 FLHNNEVYHEPHIDESWPQLQDHFSAVKLVGPEEALTPGEARDMAMDICRQDPKCEFYFS 403

Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
           +D+D+ + NP  L+ L+  N  +IAP+L R  K WSNFWGAL+ D +YARS DY+ ++  
Sbjct: 404 LDADTVITNPQTLRILIEANRKVIAPMLSRHGKLWSNFWGALSPDEYYARSEDYVELVQR 463

Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDS 503
            +   G+WNVPYI+  Y+++   ++     + +++ +  D DMAFC +LR+KGI L + +
Sbjct: 464 KR--VGVWNVPYISQAYVIRGETLRTELPQREVFSGSDTDPDMAFCKSLRDKGIFLHLSN 521

Query: 504 TQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFW 563
             E+G L+ +  +D    +P+++++  NPLDW  +YIH  Y ++L  + +  QPCPDV+W
Sbjct: 522 QHEFGRLLATSRYDTDHLHPDLWQIFDNPLDWKEQYIHENYTRALEGEGLVEQPCPDVYW 581

Query: 564 FPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLR 623
           FP+++E+ C E V+ ME YGQWS G + D RL  GYE VPT DIHMKQVG    W + LR
Sbjct: 582 FPLLSEQMCDELVEEMEHYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYEDQWLQLLR 641

Query: 624 KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYE 683
            YV P+ E  F GYH +  RA M+FVVRYRPDEQPSLRPHHDSST+T+N+ALN  G+DYE
Sbjct: 642 TYVGPMTESLFPGYHTK-TRAVMNFVVRYRPDEQPSLRPHHDSSTFTLNVALNHKGLDYE 700

Query: 684 GGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           GGGCRF+RY+C +++ R GW L+HPGRLTHYHEGL  T+GTRYIM+SFVDP
Sbjct: 701 GGGCRFLRYDCVISSPRKGWGLLHPGRLTHYHEGLPTTRGTRYIMVSFVDP 751


>gi|440908420|gb|ELR58434.1| Procollagen-lysine,2-oxoglutarate 5-dioxygenase 3 [Bos grunniens
           mutus]
          Length = 751

 Score =  694 bits (1790), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 326/711 (45%), Positives = 479/711 (67%), Gaps = 11/711 (1%)

Query: 29  IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLL 87
           ++ +K LVITVA+ ET+GY+RF+QSAE     V+TLGL + W GGD++ ++GGG KV  L
Sbjct: 47  VNPEKMLVITVATAETEGYRRFLQSAEFFNYTVRTLGLGEEWRGGDVARTVGGGQKVRWL 106

Query: 88  KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
           K E+++    +DM+I+  DSYDV++ G  +++L++F    + ++F AE  CWP+  L ++
Sbjct: 107 KKEMEKYAEREDMVIMFVDSYDVVLAGSPSELLKKFVQSGSRLLFSAESFCWPEWGLAEQ 166

Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
           YP VG+G R+LNSGGFIG+A  I +++     K+++DDQL+Y  L+LD  LR K  + LD
Sbjct: 167 YPEVGTGKRFLNSGGFIGFAPTIHQIVRQWKYKDDDDDQLFYTRLYLDPGLREKLGLSLD 226

Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
             + +FQNL G+L+++ L F  +  V + N  Y+T PV++HGNG +K++LN  GNY+   
Sbjct: 227 HKSRIFQNLNGALDEVVLKFGRNR-VRIRNVAYDTLPVVVHGNGPTKLQLNYLGNYVPNG 285

Query: 268 WKT-SGCTRCNLIKH-LDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
           W    GC  CN  +  L   +P   P VL++VF+++PT FL  FL ++  L+YP  ++++
Sbjct: 286 WTPEGGCGFCNHGRRPLPGGQPP--PRVLLAVFVEQPTPFLPRFLQRLLLLDYPPDRVTL 343

Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFY 384
           F++NN+ YH P  D+     +  F  VK +     +   EAR++A++        +FYF 
Sbjct: 344 FLHNNEVYHEPHIDESWPQLQDHFSAVKLVGPEEALTPGEARDMAMDICRQDPKCEFYFS 403

Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
           +D+D+ + NP  L+ L+  N  +IAP+L R  K WSNFWGAL+ D +YARS DY+ ++  
Sbjct: 404 LDADTVITNPQTLRILIEANRKVIAPMLSRHGKLWSNFWGALSPDEYYARSEDYVELVQR 463

Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDS 503
            +   G+WNVPYI+  Y+++   ++     + +++ +  D DMAFC +LR+KGI L + +
Sbjct: 464 KR--VGVWNVPYISQAYVIRGETLRTELPQREVFSGSDTDPDMAFCKSLRDKGIFLHLSN 521

Query: 504 TQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFW 563
             E+G L+ +  +D    +P+++++  NPLDW  +YIH  Y ++L  + +  QPCPDV+W
Sbjct: 522 QHEFGRLLATSRYDTDHLHPDLWQIFDNPLDWKEQYIHENYTRALEGEGLVEQPCPDVYW 581

Query: 564 FPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLR 623
           FP+++E+ C E V+ ME YGQWS G + D RL  GYE VPT DIHMKQVG    W + LR
Sbjct: 582 FPLLSEQMCDELVEEMEHYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYEDQWLQLLR 641

Query: 624 KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYE 683
            YV P+ E  F GYH +  RA M+FVVRYRPDEQPSLRPHHDSST+T+N+ALN  G+DYE
Sbjct: 642 SYVGPMTESLFPGYHTK-TRAVMNFVVRYRPDEQPSLRPHHDSSTFTLNVALNHKGLDYE 700

Query: 684 GGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           GGGCRF+RY+C +++ R GW L+HPGRLTHYHEGL  T+GTRYIM+SFVDP
Sbjct: 701 GGGCRFLRYDCVISSPRKGWGLLHPGRLTHYHEGLPTTRGTRYIMVSFVDP 751


>gi|194868950|ref|XP_001972362.1| GG13929 [Drosophila erecta]
 gi|190654145|gb|EDV51388.1| GG13929 [Drosophila erecta]
          Length = 727

 Score =  693 bits (1789), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 334/705 (47%), Positives = 470/705 (66%), Gaps = 12/705 (1%)

Query: 32  DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNEL 91
           +K  V TVA+  TDGY R+ +SA V  ++V TLGL + W GGDM   GGG+K+NLL+  +
Sbjct: 33  EKIKVFTVATEPTDGYTRYFRSARVYDIEVTTLGLGEEWKGGDMQRPGGGFKLNLLREAI 92

Query: 92  DEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAV 151
                  + IIL TDSYDVII   +++I E+F    A I+F AE+ CWPD SL + YP V
Sbjct: 93  APYKNDPETIILFTDSYDVIITTTLDEIFEKFKEAGAKILFSAEKFCWPDKSLANDYPEV 152

Query: 152 -GSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
            G   R+LNSG F+GYA  +  L+ +  I++  DDQLY+  +FLDE  RTK  + LD  +
Sbjct: 153 EGKASRFLNSGAFMGYAPQVFALLED-PIEDTADDQLYFTKIFLDEAKRTKLGLKLDVKS 211

Query: 211 NLFQNLYGSLEDIKLNFDLDEFVH-LTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK 269
            LFQNL+G+  D+KL  DL+     L N  + T P IIHGNG SK++LN++ NYLA+++ 
Sbjct: 212 RLFQNLHGAKNDVKLKVDLESNQGVLQNVDFMTTPAIIHGNGLSKVDLNAYSNYLARTF- 270

Query: 270 TSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN 329
           +  C  C   ++L  L+    P + +++ + +   F ++FL  I +LNYP +K+ + +Y+
Sbjct: 271 SGVCLLCQ--ENLLDLEETNLPVISVALMVTQAVPFFDQFLKGIESLNYPKEKLHLLIYS 328

Query: 330 NQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDS 389
           N  +H      +++ +   +   K+      ++ ++ R LA++ +     D+ F+VD+D+
Sbjct: 329 NVAFHDDDIKSFVNKYAKEYATAKFALSTDELDERQGRQLALDKARLHQSDYIFFVDADA 388

Query: 390 HLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGK 449
           H+D+ +VL+ L+  N+  +AP+  +  + WSNFWGAL+  G+YARS DY++I+  +    
Sbjct: 389 HIDDSEVLRELLRLNKQFVAPIFSKHNELWSNFWGALSEGGYYARSHDYVDIVKREL--I 446

Query: 450 GIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGH 509
           G++NVP++T+ YL+K +   A +    Y     D DMA C +LRN GI +   + + +GH
Sbjct: 447 GMFNVPHVTSIYLVKNTAFDAIS----YKHKEFDPDMAMCESLRNAGIFMYASNLRIFGH 502

Query: 510 LVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTE 569
           LV++++F+   T P+ Y L  N +DW  +YIHP Y   L       QPCPDV+WF IV++
Sbjct: 503 LVNADDFNTTVTRPDFYTLFSNEIDWTEKYIHPNYSLQLNESNKLQQPCPDVYWFQIVSD 562

Query: 570 KFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPL 629
            FC + V IMEA+  WSDG+N+D RLE GYEAVPTRDIHMKQVGL  ++ +FL+ +V PL
Sbjct: 563 AFCDDLVAIMEAHNGWSDGSNSDNRLEGGYEAVPTRDIHMKQVGLERLYLKFLQLFVRPL 622

Query: 630 QEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRF 689
           QER F GY H P RA M+F+VRYRPDEQPSLRPHHDSSTYTINIA+N+ G+DY+GGGCRF
Sbjct: 623 QERAFTGYFHNPPRALMNFMVRYRPDEQPSLRPHHDSSTYTINIAMNRAGIDYQGGGCRF 682

Query: 690 IRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           IRYNC+VT T+ GWMLMHPGRLTHYHEGL VT GTRYIMISF+DP
Sbjct: 683 IRYNCSVTETKKGWMLMHPGRLTHYHEGLLVTNGTRYIMISFIDP 727


>gi|403285799|ref|XP_003934198.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 3
           [Saimiri boliviensis boliviensis]
          Length = 736

 Score =  693 bits (1789), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 325/711 (45%), Positives = 478/711 (67%), Gaps = 11/711 (1%)

Query: 29  IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLL 87
           ++ +K LVITVA+ ET+GY RF++SAE     V+TLGL + W GGD++ ++GGG KV  L
Sbjct: 32  VNPEKLLVITVATAETEGYLRFLRSAEFFNYTVRTLGLGEEWRGGDVARTVGGGQKVRWL 91

Query: 88  KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
           K E+++    +DMII+  DSYDVI+ GG  ++L++F    + ++F AE  CWP+  L ++
Sbjct: 92  KKEMEKYADREDMIIMFVDSYDVILAGGPTELLKKFVQSGSRLLFSAESFCWPEWGLAEQ 151

Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
           YP VG+G R+LNSGGFIG+A  +  ++     K+++DDQL+Y  L+LD  LR K  + LD
Sbjct: 152 YPEVGTGKRFLNSGGFIGFATTVHHIVRQWRYKDDDDDQLFYTRLYLDPGLREKLGLSLD 211

Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
             + +FQNL G+L+++ L FD +  V + N  Y+T PV++HGNG +K++LN  GNY+   
Sbjct: 212 HKSRIFQNLNGALDEVVLKFDRNR-VRIRNVAYDTLPVVVHGNGPTKLQLNYLGNYVPNG 270

Query: 268 W-KTSGCTRCNLIKH-LDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
           W    GC  CN  +  L   +P   P V ++VF+++PT FL  FL ++  L+YP ++I++
Sbjct: 271 WTPEGGCGFCNRDRRTLPGGQPP--PRVFLAVFVEQPTPFLPRFLQRLLLLDYPHERITL 328

Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFY 384
           F++NN+ +H P  +D     +  F   K +     ++  EAR++A++        +FYF 
Sbjct: 329 FLHNNEVFHEPHIEDSWPQLQDHFAATKLVGPEEALSPGEARDMAMDMCRQDPECEFYFS 388

Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
           +D+D+ L NP  L+ L+  N  +IAP+L R  K WSNFWGAL+ D +YARS DY+ ++  
Sbjct: 389 LDADAVLTNPQTLRILIEENRKVIAPMLSRHGKLWSNFWGALSPDEYYARSEDYVELVQR 448

Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDS 503
            +   G+WNVPYI+  Y+++   ++     + +++ +  D DMAFC + R+KGI L + +
Sbjct: 449 KR--VGVWNVPYISQAYVIRGETLRMELPQREVFSGSDTDPDMAFCKSFRDKGIFLHLSN 506

Query: 504 TQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFW 563
             E+G L+ +  +D +  +P+++++  NP+DW  +Y+H  Y ++L  + +  QPCPDV+W
Sbjct: 507 QHEFGRLLATSRYDTEHLHPDLWQIFDNPVDWQEQYVHENYSRALEGEGIVEQPCPDVYW 566

Query: 564 FPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLR 623
           FP+++E+ C E V+ ME YGQWS G + D RL  GYE VPT DIHMKQVG    W + LR
Sbjct: 567 FPLLSEQMCDELVEEMEHYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYEDQWLQLLR 626

Query: 624 KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYE 683
            YV P+ E  F GYH +  RA M+FVVRYRPDEQPSLRPHHDSST+T+N+ALN  G+DYE
Sbjct: 627 TYVGPMTESLFPGYHTK-ARAVMNFVVRYRPDEQPSLRPHHDSSTFTLNVALNHKGLDYE 685

Query: 684 GGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           GGGCRF+RYNC +++ R GW L+HPGRLTHYHEGL  T+GTRYIM+SFVDP
Sbjct: 686 GGGCRFLRYNCVISSPRKGWALLHPGRLTHYHEGLPTTRGTRYIMVSFVDP 736


>gi|417404299|gb|JAA48909.1| Putative procollagen-lysine2-oxoglutarate 5-dioxygenase 3 [Desmodus
           rotundus]
          Length = 741

 Score =  691 bits (1783), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 327/711 (45%), Positives = 479/711 (67%), Gaps = 11/711 (1%)

Query: 29  IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLL 87
           ++ +K LVITVA+ +T+GY+RF+QSAE     V+TLGL + W GGD++ ++GGG KV  L
Sbjct: 37  VNPEKLLVITVATAKTEGYRRFLQSAEFFNYTVRTLGLGEEWRGGDVARTVGGGQKVRWL 96

Query: 88  KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
           K E+++    +DM+IL  DSYDVI+ G  +++L++F    + ++F AE  CWP+  L ++
Sbjct: 97  KKEMEKYADQEDMVILFVDSYDVILAGSPSELLKKFVQSGSRLLFSAESFCWPEWGLAEQ 156

Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
           YP VG+G R+LNSGGFIG+A  I +++     K+++DDQL+Y  L+LD  LR K  + LD
Sbjct: 157 YPEVGTGKRFLNSGGFIGFAPTIHQIVRQWKYKDDDDDQLFYTRLYLDPGLREKLSLNLD 216

Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
             + +FQNL G+L+++ L FD +  V + N  Y+T PV++HGNG +K++LN  GNY+   
Sbjct: 217 HKSRIFQNLNGALDEVVLKFDRNR-VRIRNVAYDTLPVVVHGNGPTKLQLNYLGNYVPNG 275

Query: 268 WKTSG-CTRCNLIKH-LDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
           W   G C  C+  +  L   +P   P VL++VF+++PT FL  FL ++  L+YP  +I++
Sbjct: 276 WTPGGGCGFCDRDRRILPGGQPP--PRVLLAVFVEQPTPFLPRFLQRLMLLDYPPDRITL 333

Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFY 384
           F++NN+ YH P   D     +  F  VK +     +   EAR++A+++       +FYF 
Sbjct: 334 FLHNNEVYHEPHIADSWPQLQDHFSAVKLVGPEEALTPGEARDMAMDSCRQDPECEFYFS 393

Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
           +D+D+ + N   L+ L+  N  +IAP+L R  K WSNFWGAL+ D +YARS DY+ ++  
Sbjct: 394 LDADAVITNLQALRILIEENRKVIAPMLSRHGKLWSNFWGALSPDEYYARSEDYVELVQR 453

Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDS 503
            +   G+WNVPYI+  Y+++   ++     + +++ +  D DMAFC +LR KGI L + +
Sbjct: 454 KR--VGVWNVPYISQAYVIRGETLRTELPQREVFSGSDTDPDMAFCKSLREKGIFLHLSN 511

Query: 504 TQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFW 563
             E+G L+ +  +D    +P+++++  NPLDW  +YIH  Y ++L  + +  QPCPDV+W
Sbjct: 512 QHEFGRLLATSRYDTDHLHPDLWQIFDNPLDWKEQYIHKNYSRALEGEGLVEQPCPDVYW 571

Query: 564 FPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLR 623
           FP+++++ C E V+ ME YGQWS G + D RL  GYE VPT DIHMKQVG    W + LR
Sbjct: 572 FPLLSDQMCDELVEEMEHYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYEDQWLQLLR 631

Query: 624 KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYE 683
            YV P+ E+ F GYH +  RA M+FVVRYRPDEQPSLRPHHDSST+T+N+ALN  G+DYE
Sbjct: 632 TYVGPMTEKLFPGYHTK-TRAVMNFVVRYRPDEQPSLRPHHDSSTFTLNVALNHKGLDYE 690

Query: 684 GGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           GGGCRF+RY+C ++A R GW L+HPGRLTHYHEGL  T+GTRYIM+SFVDP
Sbjct: 691 GGGCRFLRYDCVISAPRKGWGLLHPGRLTHYHEGLPTTRGTRYIMVSFVDP 741


>gi|344289632|ref|XP_003416546.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 3-like
           [Loxodonta africana]
          Length = 741

 Score =  689 bits (1779), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 325/712 (45%), Positives = 480/712 (67%), Gaps = 11/712 (1%)

Query: 28  NIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNL 86
            ++ ++ LVITVA+ ET+GY+RF++SAE     V+TLGL + W GGD++ ++GGG KV  
Sbjct: 36  RVNPERLLVITVATAETEGYRRFLRSAEFFNYTVRTLGLGKEWRGGDVARTVGGGQKVRW 95

Query: 87  LKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYD 146
           LK E+++    +DM+I+  DSYDVI+ G   ++L++F    ++++F AE  CWP+  L +
Sbjct: 96  LKKEMEKYRDQEDMVIMFVDSYDVILAGSPTELLKKFVQSGSHLLFSAESFCWPEWGLAE 155

Query: 147 KYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVL 206
           +YP VG+G R+LNSGGFIG+A  I +++     K+++DDQL+Y  L+LD  LR K ++ L
Sbjct: 156 QYPEVGTGKRFLNSGGFIGFAPIIHQIVRQWKYKDDDDDQLFYTQLYLDPGLREKLRLSL 215

Query: 207 DTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAK 266
           D  + +FQNL G+L+++ L FD +  V + N  Y+T PV+IHGNG +K++LN  GNY+  
Sbjct: 216 DHKSRIFQNLNGALDEVVLKFDRNR-VRIRNVAYDTLPVVIHGNGPTKLQLNYLGNYVPS 274

Query: 267 SW-KTSGCTRCNLIKH-LDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKIS 324
            W    GC  CN  +  L   +P   P VL++VF+++PT FL  FL ++  L+YP  +++
Sbjct: 275 GWTPEGGCGFCNKDQRTLPGGQPP--PRVLLAVFVEQPTPFLPRFLQRLLLLDYPPDRVT 332

Query: 325 MFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHK-GVDFYF 383
           +F++NN+ YH P   D     +  F  VK +     +   EAR++A+++       +FYF
Sbjct: 333 LFLHNNEVYHEPHIADAWPQLQDHFSVVKLVGPEEALTPGEARDMAMDSCRQDLSCEFYF 392

Query: 384 YVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIIN 443
            +D+D+ + N   L+ L+  +  +IAP+L R  K WSNFWGAL+ D +YARS DY+ ++ 
Sbjct: 393 SLDADAVITNQQTLRILIEEDRKVIAPMLSRHGKLWSNFWGALSPDEYYARSEDYVELVQ 452

Query: 444 GDQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKID 502
             +   G+WNVPYI+  Y+++   ++     K +++ +  D DMAFC +LR+KG+ L + 
Sbjct: 453 RKR--VGVWNVPYISQAYVIRGETLRTELPQKEVFSSSDTDPDMAFCKSLRDKGVFLHLS 510

Query: 503 STQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVF 562
           +  E+G L+ +  +D    +P+++++  NPLDW  +YIH  Y ++L  + +  QPCPDV+
Sbjct: 511 NQHEFGRLLATSRYDIDHLHPDLWQIFDNPLDWKEQYIHENYSRALEGEGMVEQPCPDVY 570

Query: 563 WFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFL 622
           WFP+++++ C E V+ ME YGQWS G + D RL  GYE VPT DIHMKQVG    W + L
Sbjct: 571 WFPLLSDQMCDELVEEMEHYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYEDQWLQLL 630

Query: 623 RKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDY 682
           R YV P+ E  F GYH +  RA M+FVVRYRPDEQPSLRPHHDSST+T+N+ALN  G+DY
Sbjct: 631 RTYVGPMTESLFPGYHTK-TRAVMNFVVRYRPDEQPSLRPHHDSSTFTLNVALNHKGLDY 689

Query: 683 EGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           EGGGCRF+RY+C V++ R GW L+HPGRLTHYHEGL  T+GTRYIM+SFVDP
Sbjct: 690 EGGGCRFLRYDCVVSSPRKGWGLLHPGRLTHYHEGLPTTRGTRYIMVSFVDP 741


>gi|444715597|gb|ELW56462.1| Procollagen-lysine,2-oxoglutarate 5-dioxygenase 3 [Tupaia
           chinensis]
          Length = 744

 Score =  689 bits (1778), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 323/712 (45%), Positives = 482/712 (67%), Gaps = 11/712 (1%)

Query: 28  NIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNL 86
            ++ +K LVITVA+ ET+GY RF++SAE     V+TLGL + W GGD++ ++GGG KV  
Sbjct: 39  RVNPEKLLVITVATAETEGYHRFLRSAEFFNYTVRTLGLGEEWRGGDVARTVGGGQKVRW 98

Query: 87  LKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYD 146
           LK E+++    +DM+I+  DSYDV++ G  +++L++F    ++++F AE  CWP+  L +
Sbjct: 99  LKKEMEKYADREDMVIMFVDSYDVVLAGSPSELLKKFVQSGSHLLFSAESFCWPEWGLAE 158

Query: 147 KYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVL 206
           +YP VG+G R+LNSGGFIG+A  I +++     K+++DDQL+Y  L+LD  +R K ++ L
Sbjct: 159 QYPEVGTGKRFLNSGGFIGFAPTIHQIVRQWKYKDDDDDQLFYTRLYLDPGVREKLRLNL 218

Query: 207 DTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAK 266
           D  + +FQNL G+L+++ L FD +  V + N  Y+T PV++HGNG +K++LN  GNY+  
Sbjct: 219 DHKSRIFQNLNGALDEVVLKFDRNR-VRIRNVAYDTLPVVVHGNGPTKLQLNYLGNYVPN 277

Query: 267 SW-KTSGCTRCNLIKH-LDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKIS 324
            W    GC  CN  +  L   +P   P VL++VF+++PT FL  FL ++  L+YP  +++
Sbjct: 278 GWTPEGGCGFCNRDRRTLPGGQPP--PRVLLAVFVEQPTPFLPRFLQRLLLLDYPPDRVT 335

Query: 325 MFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYF 383
           +F++NN+ YH P   D+    +  F +VK +     ++  EAR++A+++       +FYF
Sbjct: 336 LFLHNNEVYHEPHIADFWPELQDHFSDVKLVGPEEALSPGEARDMAMDSCRQDPECEFYF 395

Query: 384 YVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIIN 443
            +D+D+ L N   L+ L+  +  +IAP+L R  K WSNFWGAL+ D +YARS DY+ ++ 
Sbjct: 396 SLDADTVLTNQQTLRILIEEDRKVIAPMLSRHGKLWSNFWGALSPDEYYARSEDYVELVQ 455

Query: 444 GDQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKID 502
             +   G+WNVPYI+  Y+++   ++     + +++ +  D DMAFC  LR+KGI L + 
Sbjct: 456 RKR--VGVWNVPYISQAYVIRGETLRTELPQREVFSGSDTDPDMAFCKTLRDKGIFLHLS 513

Query: 503 STQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVF 562
           +  E+G L+ +  +D    +P+++++  NPLDW  +YIH  Y ++L  + +  QPCPDV+
Sbjct: 514 NQHEFGRLLATSRYDTDHLHPDLWQIFDNPLDWKEQYIHENYSRALEGEGIVEQPCPDVY 573

Query: 563 WFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFL 622
           WFP+++++ C E V+ ME YGQWS G + D RL  GYE VPT DIHMKQVG    W + L
Sbjct: 574 WFPLLSDQMCDELVEEMEHYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYEDQWLQLL 633

Query: 623 RKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDY 682
           R YV P+ E  F GYH +  RA M+FVVRYRPDEQPSL+PHHDSST+T+N+ALN  G+DY
Sbjct: 634 RTYVGPMTESLFPGYHTK-TRAVMNFVVRYRPDEQPSLQPHHDSSTFTLNVALNHKGLDY 692

Query: 683 EGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           EGGGCRF+RY+C V++ R GW L+HPGRLTHYHEGL  T+GTRYIM+SFVDP
Sbjct: 693 EGGGCRFLRYDCVVSSPRKGWGLLHPGRLTHYHEGLPTTRGTRYIMVSFVDP 744


>gi|431898207|gb|ELK06902.1| Procollagen-lysine,2-oxoglutarate 5-dioxygenase 3 [Pteropus alecto]
          Length = 743

 Score =  689 bits (1778), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 328/711 (46%), Positives = 478/711 (67%), Gaps = 11/711 (1%)

Query: 29  IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLL 87
           ++ +K LVITVA+ +T+GY+RF++SAE     V+TLGL + W GGD++ ++GGG KV  L
Sbjct: 39  VNLEKLLVITVATAQTEGYRRFLRSAEFFNYTVRTLGLGEEWRGGDVARTVGGGQKVRWL 98

Query: 88  KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
           K E+++    +DM+I+  DSYDVI+ G  +++L++F    + ++F AE  CWP+  L ++
Sbjct: 99  KKEMEKYADREDMVIMFVDSYDVILAGSPSELLKKFIQSGSRLLFSAEGFCWPEWGLAEQ 158

Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
           YP VG+G R+LNSGGFIG+A  I +++     K+++DDQL+Y  L+LD  LR K  + LD
Sbjct: 159 YPEVGTGKRFLNSGGFIGFAPTIHQIVHQWKYKDDDDDQLFYTRLYLDPGLREKLSLNLD 218

Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
             + +FQNL G+L+++ L FD +  V + N  Y+T PV++HGNG +K++LN  GNY+   
Sbjct: 219 HKSRIFQNLNGALDEVILKFDRNR-VRIRNVAYDTLPVVVHGNGPTKLQLNYLGNYVPNG 277

Query: 268 WKT-SGCTRCNLIKH-LDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
           W    GC  C+L +  L   KP   P VL++VF+++PT FL  FL ++  L+YP  +I++
Sbjct: 278 WTPEGGCGFCDLDRRTLPGGKPP--PRVLLAVFVEQPTPFLPRFLQRLMLLDYPPNRITL 335

Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFY 384
           F++NN+ YH P   D     +  F  VK +     +   EAR++A+++       +FYF 
Sbjct: 336 FLHNNEVYHEPHIADSWPQLQNHFSAVKLVGPEEALTPGEARDMAMDSCRQDPECEFYFS 395

Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
           +D+D+ + NP  L+ L+  N  +IAP+L R  K WSNFWGAL+ D +YARS DY+ ++  
Sbjct: 396 LDADAVITNPKTLRILIEENRKVIAPMLSRHGKLWSNFWGALSPDEYYARSEDYVELVQR 455

Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDS 503
            +   G+WNVPYI+  Y+++   ++     + +++ +  D DMAFC   R+KGI L + +
Sbjct: 456 KR--VGVWNVPYISQAYVIQGETLRTELPQREVFSGSDTDPDMAFCKTWRDKGIFLHLSN 513

Query: 504 TQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFW 563
             E+G L+ +  +D    +P+++++  NPLDW  +YIH  Y  +L  + +  QPCPDV+W
Sbjct: 514 QHEFGRLLATSRYDTDHLHPDLWQIFDNPLDWKEQYIHENYSLALEGEGLVEQPCPDVYW 573

Query: 564 FPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLR 623
           FP+++++ C E V+ ME YGQWS G + D RL  GYE VPT DIHMKQVG    W + LR
Sbjct: 574 FPLLSDQMCDELVEEMEHYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYEDQWLQLLR 633

Query: 624 KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYE 683
            YV P+ E  F GYH +  RA M+FVVRYRPDEQPSLRPHHDSST+T+N+ALN  G+DYE
Sbjct: 634 TYVGPMTESLFPGYHTK-TRAVMNFVVRYRPDEQPSLRPHHDSSTFTLNVALNHKGLDYE 692

Query: 684 GGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           GGGCRF+RY+C V+A R GW L+HPGRLTHYHEGL  T+GTRYIM+SFVDP
Sbjct: 693 GGGCRFLRYDCVVSAPRKGWGLLHPGRLTHYHEGLPTTRGTRYIMVSFVDP 743


>gi|327285113|ref|XP_003227279.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 3-like
           [Anolis carolinensis]
          Length = 741

 Score =  689 bits (1778), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 327/716 (45%), Positives = 471/716 (65%), Gaps = 10/716 (1%)

Query: 24  NKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGY 82
           N V  +   K LV+T A++ET+GY+RF+++A+     VKTLGL + W GGD++ ++GGG 
Sbjct: 31  NPVDPVSPGKLLVLTAATDETEGYQRFLRTAKFFNYTVKTLGLGEDWKGGDVARTVGGGQ 90

Query: 83  KVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDT 142
           KV  LKNE+ +    +D+I++  DSYDVI+ G   ++L +F  F + +VF AE  CWP+ 
Sbjct: 91  KVRWLKNEMKKYANEEDLIVMFVDSYDVILAGSPIELLWKFRHFKSKLVFSAESFCWPEW 150

Query: 143 SLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKH 202
           SL +KYPAV  G R+LNSGGFIGYA  I  ++     K+ +DDQL+Y  ++LD  LR KH
Sbjct: 151 SLAEKYPAVAVGKRFLNSGGFIGYAPTINRIVQMWKYKDNDDDQLFYTRIYLDPGLREKH 210

Query: 203 KIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGN 262
            I LD  + +FQNL G++E++ L F+    V   N  Y+T PV+IHGNG +K++LN  GN
Sbjct: 211 GITLDHKSKIFQNLNGAIEEVVLKFEPTR-VRARNVAYDTLPVVIHGNGPTKLQLNYLGN 269

Query: 263 YLAKSWK-TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPA 320
           Y+  +W    GC  C+  +  L  L  + +P VL+ VFI++PT F  +FL ++   +YP 
Sbjct: 270 YVPNAWTYEGGCGTCDQGLLDLSDLPDESYPRVLVGVFIEQPTPFFPQFLQRLLTFDYPY 329

Query: 321 KKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGV- 379
             +S+F++N   YH           +  F ++K +     ++  EAR++A++        
Sbjct: 330 SHLSLFIHNRVVYHEQHIQAEWEQLREAFDSIKLVGPEEDISEGEARDIAMDLCRQDTTC 389

Query: 380 DFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYM 439
           D+YF +D+D  + NP++L+ L+  N+ +IAP++ R  K WSNFWGAL+ D +YARS DY+
Sbjct: 390 DYYFSLDADVVVTNPEILQILIQENKKVIAPMMSRHGKLWSNFWGALSPDEYYARSEDYV 449

Query: 440 NIINGDQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIH 498
            ++ G +   G+WNVPYI+  YL++   ++     + I+TL+  D DMAFC ++R KGI 
Sbjct: 450 ELVQGKR--IGMWNVPYISQAYLLRGETLRQELPQRNIFTLDDTDPDMAFCKSVREKGIF 507

Query: 499 LKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPC 558
           L I +  E+G L+ +  ++  + +P+++++  NPLDW  +YIH  Y + +L    + QPC
Sbjct: 508 LHISNRDEFGRLLSTSRYNTSRLHPDLWQISENPLDWQDKYIHENYSR-VLEGEYHEQPC 566

Query: 559 PDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVW 618
           PDV+WFP+ T++ C E V+  E +GQWS G + D RL  GYE VPT DIHM Q+     W
Sbjct: 567 PDVYWFPVFTDQMCDELVEEAENFGQWSGGKHEDTRLAGGYENVPTVDIHMNQISFEKEW 626

Query: 619 AEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQV 678
            +FLR Y+ P+ E+ + GY+ +  RA M+F+VRYRPDEQPSLRPHHDSST+TIN+ALN  
Sbjct: 627 LQFLRDYIAPVTEKLYPGYYTK-ARAIMNFMVRYRPDEQPSLRPHHDSSTFTINVALNHK 685

Query: 679 GVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           G+DYEGGGCRFIRYNC V + R GW LMHPGRLTHYHEGL  T GTRYIM+SFVDP
Sbjct: 686 GIDYEGGGCRFIRYNCQVESPRKGWSLMHPGRLTHYHEGLPTTSGTRYIMVSFVDP 741


>gi|194748144|ref|XP_001956509.1| GF24561 [Drosophila ananassae]
 gi|190623791|gb|EDV39315.1| GF24561 [Drosophila ananassae]
          Length = 730

 Score =  689 bits (1778), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 341/707 (48%), Positives = 465/707 (65%), Gaps = 16/707 (2%)

Query: 32  DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNEL 91
           DK  V TVA+  TDGY R+ +SA V  ++V TLGL + W GGDM   GGG+K+NLL+  +
Sbjct: 36  DKVKVFTVATEPTDGYNRYYRSARVYDIEVTTLGLGKEWKGGDMQHPGGGFKLNLLRKAI 95

Query: 92  DEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAV 151
                  + IIL TDSYDV+I   + +I+E+F    A ++  AE+ CWPD SL + YP V
Sbjct: 96  SPFKNDPEKIILFTDSYDVLITAPLEEIVEKFKDSGAKVLISAEKYCWPDKSLANAYPEV 155

Query: 152 -GSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
            G   R+LNSG FIGYA  +  L+ +  I++  DDQLY   +FL++  R+K  + LDT +
Sbjct: 156 EGKASRFLNSGAFIGYAPQVYGLLED-PIEDTADDQLYLTKVFLNDAKRSKLGLKLDTQS 214

Query: 211 NLFQNLYGSLEDIKLNFDLDEFVH-LTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK 269
            LFQNL+G+  D+KL  DL+     L N  + T P I+HGNG SK++LN++ NYLAK++ 
Sbjct: 215 KLFQNLHGAKNDVKLKVDLESNQGVLQNVDFMTTPAILHGNGLSKVDLNAYANYLAKTF- 273

Query: 270 TSGCTRCNLIKHLDSLKPDQ--FPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFV 327
              C  C    H + L+ D    P + +++   +P  F + FL  I  +NYP K + +F+
Sbjct: 274 NGVCLLC----HENRLELDNTNLPVISLALIAPQPVPFYDRFLEGIRKINYPKKSLHLFI 329

Query: 328 YNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDS 387
           Y+N   H      Y+      + + KYI     ++ ++ R LA++ +     D+ FYVD+
Sbjct: 330 YSNAALHDDDTKSYVEKHGEEYASAKYILSTDELDERQGRQLALDKARLHNSDYMFYVDA 389

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D+ +D+ +VL+ L+  N+  + PL  +  + WSNFWGAL+  G+YARS DY++I+  D  
Sbjct: 390 DALIDDGEVLRELLALNKQFVGPLFTKHHELWSNFWGALSDGGYYARSHDYVDIVKRDL- 448

Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEY 507
             GI+NVP++T+ YL+K+S   A + +        D DMA C +LRN GI + I + + +
Sbjct: 449 -LGIFNVPHVTSIYLVKSSAFDAMSFQH----KEFDPDMALCESLRNAGIFMYISNQRYF 503

Query: 508 GHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIV 567
           GHLV+++NF+   T P+ Y L  N  DW  +YIHP Y + L       QPCPDVFWF IV
Sbjct: 504 GHLVNTDNFNSTVTRPDFYTLFSNRYDWTEKYIHPNYSQQLNATYPIPQPCPDVFWFQIV 563

Query: 568 TEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVV 627
           T+ FC + V IMEA+G WSDG+N+D RLE GYEAVPTRDIHMKQVGL  ++ +FL+ +V 
Sbjct: 564 TDAFCDDLVAIMEAHGSWSDGSNSDARLEGGYEAVPTRDIHMKQVGLEPLYLKFLQMFVR 623

Query: 628 PLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGC 687
           PLQER F GY H P R+ M+F+VRYRPDEQPSLRPHHDSSTYTINIA+N  G+DYEGGGC
Sbjct: 624 PLQERVFTGYFHNPPRSLMNFMVRYRPDEQPSLRPHHDSSTYTINIAMNHAGIDYEGGGC 683

Query: 688 RFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           RF+RYNC+V  T+ GWMLMHPGRLTHYHEGL VT+GTRYIMISF+DP
Sbjct: 684 RFLRYNCSVVDTKKGWMLMHPGRLTHYHEGLLVTEGTRYIMISFIDP 730


>gi|73957734|ref|XP_536856.2| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 3
           isoform 1 [Canis lupus familiaris]
          Length = 740

 Score =  689 bits (1777), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 326/711 (45%), Positives = 476/711 (66%), Gaps = 11/711 (1%)

Query: 29  IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLL 87
           ++ +K LVITVA+ ET+GY+RF+ SAE     V+TLGL + W GGD++ ++GGG KV  L
Sbjct: 36  VNPEKLLVITVATAETEGYRRFLWSAEFFNYTVRTLGLGEEWRGGDVARTVGGGQKVRWL 95

Query: 88  KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
           K E+++    +DM+I+  DSYDVI+ G   ++L++F    + ++F AE  CWP+  L ++
Sbjct: 96  KKEMEKYADREDMVIMFVDSYDVILAGSPAELLKKFVQSGSRLLFSAEGFCWPEWGLAEQ 155

Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
           YP VG+G R+LNSGGFIG+A  I +++     K+++DDQL+Y  L+LD  LR K  + LD
Sbjct: 156 YPEVGTGKRFLNSGGFIGFAPTIHKVVRQWKYKDDDDDQLFYTRLYLDPGLREKLSLNLD 215

Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
             + +FQNL G+L+++ L FD +  V + N  Y+T PV++HGNG +K+ LN  GNY+   
Sbjct: 216 HKSRIFQNLNGALDEVVLKFDRNR-VRIRNVAYDTLPVVVHGNGPTKLHLNYLGNYVPNG 274

Query: 268 W-KTSGCTRCNLIKH-LDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
           W    GC  C   +  L   +P   P VL++VF+++PT FL  FL ++  L+YP  ++++
Sbjct: 275 WTPQGGCGFCGRDRRTLPGGQPP--PRVLLAVFVEQPTPFLPRFLQRLLLLDYPPDRVTL 332

Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFY 384
           F++NN+ YH P   D     +  F  VK +     +   EAR++A+++       +FYF 
Sbjct: 333 FLHNNEVYHEPHIADSWPQLQDHFSAVKLVGPEEALTPGEARDMAMDSCRQDPECEFYFS 392

Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
           +D+D+ + NP  L+ L+  N  +IAP+L R  K WSNFWGAL+ D +YARS DY+ ++  
Sbjct: 393 LDADAVITNPQTLRILIEENRKVIAPMLSRHGKLWSNFWGALSPDEYYARSEDYVELVQR 452

Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDS 503
            +   G+WNVPYI+  Y+++   ++     + +++ +  D DMAFC +LR+KGI L + +
Sbjct: 453 KR--VGVWNVPYISQAYVIRGETLRTELPQREVFSGSDTDPDMAFCKSLRDKGIFLHLSN 510

Query: 504 TQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFW 563
             E+G L+ +  +D    +P+++++  NPLDW  +YIH  Y ++L  + +  QPCPDV+W
Sbjct: 511 QHEFGRLLATSRYDTDHLHPDLWQIFDNPLDWKEQYIHENYSRALEGEGLVEQPCPDVYW 570

Query: 564 FPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLR 623
           FP+++++ C E V+ ME YGQWS G + D RL  GYE VPT DIHMKQVG    W + LR
Sbjct: 571 FPLLSDQMCDELVEEMEHYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYEDQWLQLLR 630

Query: 624 KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYE 683
            YV P+ E  F GYH +  RA M+FVVRYRPDEQPSLRPHHDSST+T+N+ALN  G+DYE
Sbjct: 631 TYVGPMTESLFPGYHTK-TRAVMNFVVRYRPDEQPSLRPHHDSSTFTLNVALNHKGLDYE 689

Query: 684 GGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           GGGCRF+RY+C V++ R GW L+HPGRLTHYHEGL  T+GTRYIM+SFVDP
Sbjct: 690 GGGCRFLRYDCIVSSPRKGWGLLHPGRLTHYHEGLPTTRGTRYIMVSFVDP 740


>gi|395842838|ref|XP_003794215.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 3
           [Otolemur garnettii]
          Length = 744

 Score =  688 bits (1776), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 327/711 (45%), Positives = 477/711 (67%), Gaps = 11/711 (1%)

Query: 29  IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLL 87
           ++ +K LVITVA+ ET+GY+RF+QSAE     V+TLGL + W GGD++ ++GGG KV  L
Sbjct: 40  VNPEKLLVITVATAETEGYRRFLQSAEFFNYSVRTLGLGEEWRGGDVARTVGGGQKVRWL 99

Query: 88  KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
           K E+++    +DM+I+  DSYDVI+ G  +++L++F    + ++F AE  CWP   L ++
Sbjct: 100 KREMEKYADQEDMVIMFVDSYDVILAGSPSELLKKFVQSGSRLLFSAEGFCWPQWGLAEQ 159

Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
           YP VG+G R+LNSGGFIG+A  I  ++     K+++DDQL+Y  L+LD  LR K  + LD
Sbjct: 160 YPEVGTGKRFLNSGGFIGFAPTIHHIVRQWKYKDDDDDQLFYTRLYLDPGLREKLSLNLD 219

Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
             + +FQNL G+L++I L FD +  V + N  Y+T PV++HGNG +K++LN  GNY+   
Sbjct: 220 HKSRIFQNLNGALDEIVLKFDRNH-VRIRNVAYDTLPVVVHGNGPTKLQLNYLGNYVPNG 278

Query: 268 WKT-SGCTRCNLIKH-LDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
           W    GC  C+  +  L   +P   P VL++VF+++PT FL  FL ++  L+YP  ++++
Sbjct: 279 WSPEGGCGFCSRDRRTLPGGQPP--PRVLLAVFVEQPTPFLPRFLQRLLLLDYPPDRVTL 336

Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFY 384
           F++NN+ +H P   D     +  F  VK +     ++  EAR++A+++       +FYF 
Sbjct: 337 FLHNNEVFHEPHIADAWPQLQDHFSAVKLVGPEEALSPGEARDMAMDSCRQDPKCEFYFS 396

Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
           +D+D+ L N   L+ L+  N  +IAP+L R  K WSNFWGAL+ D +YARS DY+ ++  
Sbjct: 397 LDADAVLTNRQTLRILIEENRKVIAPMLSRHGKLWSNFWGALSPDEYYARSEDYVELVQR 456

Query: 445 DQGGKGIWNVPYITNCYLMKTSVI-KATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDS 503
            +   G+WNVPYI+  Y+++   + K    K +++ +  D DMAFC +LR+KGI L + +
Sbjct: 457 KR--VGVWNVPYISQAYVIRGETLRKELPQKEVFSGSDTDPDMAFCKSLRDKGIFLHLSN 514

Query: 504 TQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFW 563
             E+G L+ +  +D    +P+++++  NPLDW  +YIH  Y ++L    +  QPCPDV+W
Sbjct: 515 QHEFGRLLATSRYDTDHLHPDLWQIFDNPLDWKEQYIHENYSRALEGKEIVEQPCPDVYW 574

Query: 564 FPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLR 623
           FP+++++ C E V+ ME YGQWS G + D RL  GYE VPT DIHMKQVG    W + LR
Sbjct: 575 FPLLSDQMCDELVEEMEHYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYEDQWLQLLR 634

Query: 624 KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYE 683
            YV P+ E  F GYH +  RA M+FVVRYRPDEQP+LRPHHDSST+T+N+ALN  G+DYE
Sbjct: 635 TYVGPMTEHLFPGYHTK-TRAVMNFVVRYRPDEQPALRPHHDSSTFTLNVALNHKGLDYE 693

Query: 684 GGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           GGGCRF+RY+C +++ R GW L+HPGRLTHYHEGL  T+GTRYIM+SFVDP
Sbjct: 694 GGGCRFLRYDCIISSPRKGWGLLHPGRLTHYHEGLPTTRGTRYIMVSFVDP 744


>gi|113931556|ref|NP_001039229.1| procollagen-lysine, 2-oxoglutarate 5-dioxygenase 3 precursor
           [Xenopus (Silurana) tropicalis]
 gi|89272476|emb|CAJ83048.1| procollagen-lysine, 2-oxoglutarate 5-dioxygenase 3 [Xenopus
           (Silurana) tropicalis]
          Length = 733

 Score =  687 bits (1774), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 327/711 (45%), Positives = 478/711 (67%), Gaps = 10/711 (1%)

Query: 29  IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLL 87
           +  DK LV+TVA++ T+GY+RF+++A      V+TLGL   W GGD++ ++GGG KV  L
Sbjct: 28  VRPDKLLVVTVATDTTEGYERFLRTARHFNYTVRTLGLGHEWKGGDVARTVGGGQKVRWL 87

Query: 88  KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
           K EL++    DD++I+  DSYDV+I G   ++L +F+ F+  +VF AE  CWP+ SL + 
Sbjct: 88  KEELEKHSEQDDLVIMFVDSYDVVIAGTPTELLWKFHQFEHKVVFSAEGFCWPEWSLAES 147

Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
           YP + +G R+LNSGGFIG+A  +  ++     K+++DDQL+Y  ++LDE+LR K  I LD
Sbjct: 148 YPPISNGKRFLNSGGFIGFAPQLYRMVQLWKYKDDDDDQLFYTKVYLDESLREKFDIALD 207

Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
             + +FQNL G+++++ L F+ ++ V   N  Y+T PV+IHGNG +K++LN  GNY+  S
Sbjct: 208 HKSKIFQNLNGAIDEVVLKFERNK-VRARNVAYDTIPVVIHGNGPTKLQLNYLGNYVPNS 266

Query: 268 WK-TSGCTRCNLIKHLDSLKPD-QFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
           W    GC  C+      S+  D   P VL+ VFI++PT FL +FL ++  L+YP  ++S+
Sbjct: 267 WTHEGGCEVCDDDLLDLSMLEDDALPQVLLGVFIEQPTPFLPQFLERLVQLDYPRNRLSL 326

Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFY 384
           +++N++ +H      +  N K  F ++K +     ++  EAR++ ++     +  D+YF 
Sbjct: 327 YIHNSEVFHEKHIQAFWENNKDSFTSIKIVGPEEALSQGEARDMGMDLCRQDETCDYYFS 386

Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
           VD+D  L NPD L  L+  N+ +IAP++ R  K WSNFWGAL+ +G+YARS DY++I+ G
Sbjct: 387 VDADVVLTNPDTLYILIQENKKVIAPMVSRSGKLWSNFWGALSPEGYYARSEDYVDIVQG 446

Query: 445 DQGGKGIWNVPYITNCYLMKTSVIK-ATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDS 503
            + G  +WNVPYI + YL+K   ++   + K I+TL  MD DMAFC ++R+K + L + +
Sbjct: 447 KRAG--VWNVPYIAHVYLIKGETLRNELSNKNIFTLPQMDSDMAFCKSIRDKSVFLHLSN 504

Query: 504 TQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFW 563
             E+G L+ +  ++  + + +++++  NP+DW  +YIH  Y K    D    QPCPDV+W
Sbjct: 505 RDEFGRLISTSKYNTSRLHNDLWQIFENPVDWREKYIHENYTKIFEEDYFE-QPCPDVYW 563

Query: 564 FPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLR 623
           FP+  E  C EFV+ ME +GQWS G N D+RL  GYE VPT DIHM QVG    W +FL+
Sbjct: 564 FPVFKEVMCDEFVEEMENFGQWSGGKNTDQRLAGGYENVPTVDIHMTQVGYQEEWLKFLQ 623

Query: 624 KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYE 683
           +Y+ P+ E+ F GY+ +  +A ++F+VRYRPDEQPSLRPHHDSST+TINIALN  G+DYE
Sbjct: 624 EYIAPVTEKLFPGYYTK-AKALLNFIVRYRPDEQPSLRPHHDSSTFTINIALNNKGIDYE 682

Query: 684 GGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           GGGCRF+RYNC V + R GW  MHPGRLTHYHEGL  T+GTRYIM+SFVDP
Sbjct: 683 GGGCRFLRYNCRVESPRKGWSFMHPGRLTHYHEGLPTTRGTRYIMVSFVDP 733


>gi|147899260|ref|NP_001080446.1| procollagen-lysine, 2-oxoglutarate 5-dioxygenase 3 precursor
           [Xenopus laevis]
 gi|27696396|gb|AAH43893.1| Plod-prov protein [Xenopus laevis]
          Length = 733

 Score =  686 bits (1770), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 323/708 (45%), Positives = 478/708 (67%), Gaps = 10/708 (1%)

Query: 32  DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLLKNE 90
           DK LV+TVA+  T+GY RF+++A      V+TLGL   W GGD++ ++GGG KV  LK+E
Sbjct: 31  DKLLVVTVATEATEGYLRFLRTARHFNYTVRTLGLGHEWKGGDVARTVGGGQKVRWLKHE 90

Query: 91  LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
           L++    DD+II+  DSYDV+I G   ++L +F  F+  +VF AE  CWP+ SL + YP 
Sbjct: 91  LEQHKDQDDLIIMFVDSYDVVISGSPTELLWKFQRFEHKVVFSAEGFCWPEWSLAESYPP 150

Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
           + +G R+LNSGGFIG+A  + +++     K+ +DDQL+Y  ++LDE++R K  I LD  +
Sbjct: 151 ITNGKRFLNSGGFIGFAPQLYQMVQLWKYKDNDDDQLFYTKIYLDESMREKFDITLDHKS 210

Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK- 269
           N+FQNL G+++++ L F+ ++ V   N  Y+T PV+IHGNG +K++LN  GNY+  SW  
Sbjct: 211 NIFQNLNGAIDEVVLKFESNK-VRARNVAYDTIPVVIHGNGPTKLQLNYLGNYVPNSWTH 269

Query: 270 TSGCTRCNLIKHLDSLKPD-QFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
             GC  C+      S+  D   P VL+ VFI++PT F+ +FL ++  L+YP  ++S++++
Sbjct: 270 EGGCEVCDDDLLDLSMLEDDALPHVLLGVFIEQPTPFIPQFLQRLVQLDYPRNRLSLYIH 329

Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDS 387
           N++ YH    + +   +K  F ++K +     ++  EAR++ ++     +  D+YF VD+
Sbjct: 330 NSEVYHERHIEVFYKKYKDSFTSIKIVGPEEAMSQGEARDMGMDLCRQDQTCDYYFSVDA 389

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D  L NPD L  L+  N+ +IAP++ R  K WSNFWGAL+ +G+YARS DY++I+   + 
Sbjct: 390 DVALTNPDTLYILIQENKKVIAPMVSRSGKLWSNFWGALSPEGYYARSEDYVDIVQAKRA 449

Query: 448 GKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQE 506
           G  +WNVPYI + YL+K   ++A  + K I+TL  MD DM+ C ++R+K + L I +  E
Sbjct: 450 G--VWNVPYIAHVYLIKGETLRAELSNKNIFTLPQMDPDMSVCKSIRDKNVFLHISNRDE 507

Query: 507 YGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPI 566
           +G L+ +  ++  + + +++++  NP+DW  +YIH  Y K +  +    QPCPDV+WFP+
Sbjct: 508 FGRLLSTSKYNTSRLHNDLWQIFENPVDWKEKYIHENYSK-IFEEDYYQQPCPDVYWFPV 566

Query: 567 VTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYV 626
            +E  C EFV+ ME +GQWS G N D+RL  GYE VPT DIHM Q+G    W +FL++Y+
Sbjct: 567 FSEVMCDEFVEEMENFGQWSGGKNQDQRLAGGYENVPTVDIHMTQIGYQEEWLKFLQEYI 626

Query: 627 VPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGG 686
            P+ E+ F GY+ +  +A ++F+VRYRPDEQPSLRPHHDSST+T+NIALN  G+DYEGGG
Sbjct: 627 APVTEKLFPGYYTK-AKALLNFIVRYRPDEQPSLRPHHDSSTFTVNIALNNKGIDYEGGG 685

Query: 687 CRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           CRF+RYNC V + R GW  MHPGRLTHYHEGL  T+GTRYIM+SFVDP
Sbjct: 686 CRFLRYNCRVESPRKGWSFMHPGRLTHYHEGLPTTRGTRYIMVSFVDP 733


>gi|207080302|ref|NP_001128871.1| procollagen-lysine,2-oxoglutarate 5-dioxygenase 3 precursor [Pongo
           abelii]
 gi|62900717|sp|Q5R6K5.1|PLOD3_PONAB RecName: Full=Procollagen-lysine,2-oxoglutarate 5-dioxygenase 3;
           AltName: Full=Lysyl hydroxylase 3; Short=LH3; Flags:
           Precursor
 gi|55731802|emb|CAH92605.1| hypothetical protein [Pongo abelii]
          Length = 738

 Score =  685 bits (1768), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 325/711 (45%), Positives = 475/711 (66%), Gaps = 11/711 (1%)

Query: 29  IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLL 87
           ++ +K LVITVA+ ET+GY RF++SAE     V+TLGL + W GGD++ ++GGG KV  L
Sbjct: 34  VNPEKLLVITVATAETEGYLRFLRSAEFFNYTVRTLGLGEEWRGGDVARTVGGGQKVRWL 93

Query: 88  KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
           K E+++    +DMII+  DSYDVI+ G   ++L++F    + ++F AE  CWP+  L ++
Sbjct: 94  KKEMEKYADREDMIIMFVDSYDVILAGSPTELLKKFVQSGSRLLFSAESFCWPEWGLAEQ 153

Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
           YP VG+G R+LNSGGFIG+A  I +++     K+++DDQL+Y  L+LD  LR K  + LD
Sbjct: 154 YPEVGTGKRFLNSGGFIGFATTIHQIVRQWKYKDDDDDQLFYTRLYLDPGLREKLSLNLD 213

Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
             + +FQNL G+L+++ L FD +  V + N  Y+T PV++HGNG +K++LN  GNY+ K 
Sbjct: 214 HKSRIFQNLNGALDEVVLKFDRNR-VRIRNVAYDTLPVVVHGNGPTKLQLNYLGNYVPKG 272

Query: 268 WKT-SGCTRCNLIKH-LDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
           W    GC  CN  +  L   +P   P V ++VF+++PT FL  FL ++  L+YP  ++++
Sbjct: 273 WTPEGGCGFCNQDRRTLPGGQPP--PRVFLAVFVEQPTPFLPRFLQRLLLLDYPPDRVTL 330

Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFY 384
           F++NN+ +H P   D     +  F  VK +     ++  EAR++A++        +FYF 
Sbjct: 331 FLHNNEVFHEPHIADSWPQLQDHFSAVKLVGPEEALSPGEARDMAMDLCRQDPECEFYFS 390

Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
           +D+D+ L N   L+ L+  N  +IAP+L R  K WSNFWGAL+ D +YARS DY+ ++  
Sbjct: 391 LDADTVLTNLQTLRILIEENRKVIAPMLSRHGKLWSNFWGALSPDEYYARSEDYVELVQR 450

Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDS 503
            +   G+WNVPYI+  Y+++   ++     + +++ +  D DMAFC + R+KGI L + +
Sbjct: 451 KR--VGVWNVPYISQAYVIRGDTLRTELPQRDVFSGSDTDPDMAFCKSFRDKGIFLHLSN 508

Query: 504 TQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFW 563
             E+G L+ +  +D +  +P+++++  NP+DW  +YIH  Y ++L  + +  QPCPDV+W
Sbjct: 509 QHEFGRLLATSRYDTEHLHPDLWQIFDNPVDWKEQYIHENYSRALEGEGIVEQPCPDVYW 568

Query: 564 FPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLR 623
           FP+++E+ C E V  ME YGQWS G + D RL  GYE VPT DIHMKQVG    W + LR
Sbjct: 569 FPLLSEQMCDELVAEMEHYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYEDQWLQLLR 628

Query: 624 KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYE 683
            YV P+ E  F GYH +  RA M+FVVRYRPDEQPSLRPHHDSST+T+N+ALN  G+DYE
Sbjct: 629 TYVGPMTESLFPGYHTK-ARAVMNFVVRYRPDEQPSLRPHHDSSTFTLNVALNHKGLDYE 687

Query: 684 GGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           GGGCRF+RY+C +++ R GW L+HPGRLTHYHEGL  T GTRYIM+SFVDP
Sbjct: 688 GGGCRFLRYDCVISSPRKGWALLHPGRLTHYHEGLPTTWGTRYIMVSFVDP 738


>gi|395533681|ref|XP_003768883.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 3
           [Sarcophilus harrisii]
          Length = 845

 Score =  685 bits (1768), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 324/707 (45%), Positives = 471/707 (66%), Gaps = 9/707 (1%)

Query: 32  DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLLKNE 90
           +K LVIT A+ ET+GY RF+QSA+     V+TLGL + W GGD++ ++GGG KV  LK E
Sbjct: 144 NKLLVITAATEETEGYLRFLQSAKFFNYSVQTLGLGEEWRGGDVARTVGGGQKVRWLKKE 203

Query: 91  LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
           +++     DM+I+  DSYDV++ G   ++L +F    + ++F AE  CWP+  L ++YP 
Sbjct: 204 MEKYAERKDMVIMFVDSYDVLLAGSPKELLWKFLQSGSRLLFSAESFCWPEWGLAERYPT 263

Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
           VG+G R+LNSGGFIG+A  I  ++     K+++DDQL+Y  L+LD  LR K  + LD  +
Sbjct: 264 VGNGKRFLNSGGFIGFAPTIHHIVRQWKYKDDDDDQLFYTRLYLDSKLREKLGLALDHKS 323

Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWKT 270
            +FQNL G+++++ L FD ++ V + N  Y+T PV+IHGNG +K++LN  GNY+   W  
Sbjct: 324 RIFQNLNGAIDEVVLKFDRNQ-VRIRNVAYDTLPVVIHGNGPTKLQLNYLGNYIPNGWSP 382

Query: 271 -SGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN 329
             GC  C+  + +D  +   FP V +SVF+++PT FL  FL ++  ++YP +KI++F++N
Sbjct: 383 EGGCGFCDRDR-IDLQEQQVFPKVFLSVFVEQPTPFLPRFLQRLLLIDYPPEKITLFLHN 441

Query: 330 NQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKG-VDFYFYVDSD 388
           N+ +H P         +  F  VK +     +   +AR++A++N       +FYF +D+D
Sbjct: 442 NEVHHEPHIAAAWPQLQDHFSAVKLVGPEEALTPAQARDMAMDNCRQDSECEFYFSLDAD 501

Query: 389 SHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGG 448
           + + NP  L+ L+  N  +IAP+L R  K WSNFWGAL+ + +YARS DY+ ++   +  
Sbjct: 502 AVITNPQTLRNLIEENRKVIAPMLSRHGKLWSNFWGALSPEEYYARSEDYVELVQRKR-- 559

Query: 449 KGIWNVPYITNCYLMKTSVI-KATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEY 507
            G+WNVPY++  YL+K   + K    + +++ +  D DMAFC  +R+KGI L + + +E+
Sbjct: 560 VGVWNVPYVSQAYLIKGETLRKELPQREMFSQSESDPDMAFCKTVRDKGIFLHLSNQEEF 619

Query: 508 GHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIV 567
           G L+ +  +      P+++++  NPLDW  +YIH  Y  +L  D +  QPCPDV+WFP++
Sbjct: 620 GRLLSTARYRTDHLYPDLWQIFDNPLDWQEQYIHENYSWALDGDGIVEQPCPDVYWFPLL 679

Query: 568 TEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVV 627
           +E+ C E V+ ME +GQWS G + D RL  GYE VPT DIHMKQ+G    W +FLR YV 
Sbjct: 680 SEQMCDELVEEMENFGQWSGGKHEDSRLAGGYENVPTVDIHMKQLGYEDEWLQFLRTYVG 739

Query: 628 PLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGC 687
           P+ E  F GYH +  RA M+FVVRYRPDEQPSLRPHHDSST+T+NIALN  G+DYEGGGC
Sbjct: 740 PMTENLFPGYHTK-TRAVMNFVVRYRPDEQPSLRPHHDSSTFTLNIALNNKGLDYEGGGC 798

Query: 688 RFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           RF+RY+C V++ R GW L+HPGRLTHYHEGL  T+GTRYIM+SFVDP
Sbjct: 799 RFLRYDCIVSSPRKGWGLLHPGRLTHYHEGLPTTKGTRYIMVSFVDP 845


>gi|193786792|dbj|BAG52115.1| unnamed protein product [Homo sapiens]
          Length = 738

 Score =  685 bits (1767), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 324/711 (45%), Positives = 474/711 (66%), Gaps = 11/711 (1%)

Query: 29  IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLL 87
           ++ +K LVITVA+ ET+GY RF++SAE     V+TLGL + W GGD++ ++GGG KV  L
Sbjct: 34  VNPEKLLVITVATAETEGYLRFLRSAEFFNYTVRTLGLGEEWRGGDVARTVGGGQKVRWL 93

Query: 88  KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
           K E+++    +DMII+  DSYDVI+ G   ++L++F    + ++F AE  CWP+  L ++
Sbjct: 94  KKEMEKYADREDMIIMFVDSYDVILAGSPTELLKKFVQSGSRLLFSAESFCWPEWGLAEQ 153

Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
           YP VG+G R+LNSGGFIG+A  I +++     K+++DDQL+YA L+LD  LR K  + LD
Sbjct: 154 YPEVGTGKRFLNSGGFIGFATTIHQIVRQWKYKDDDDDQLFYARLYLDPGLREKLSLNLD 213

Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
             + +FQNL G+L+++ L FD +  V + N  Y+T P+++HGNG +K++LN  GNY+   
Sbjct: 214 HKSRIFQNLNGALDEVVLKFDRNR-VRIRNVAYDTLPIVVHGNGPTKLQLNYLGNYVPNG 272

Query: 268 WKT-SGCTRCNLIKH-LDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
           W    GC  CN  +  L   +P   P V ++VF+++PT FL  FL ++  L+YP  ++++
Sbjct: 273 WTPEGGCGFCNQDRRTLPGGQPP--PRVFLAVFVEQPTPFLPRFLQRLLLLDYPPDRVTL 330

Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFY 384
           F++NN+ +H P   D     +  F  VK +     ++  EAR++A++        +FYF 
Sbjct: 331 FLHNNEVFHEPHIADSWPQLQDHFSAVKLVGPEEALSPGEARDMAMDLCRQDPECEFYFS 390

Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
           +D+D+ L N   L+ L+  N  +IAP+L R  K WSNFWGAL+ D +YARS DY+ ++  
Sbjct: 391 LDADAVLTNLQTLRILIEENRKVIAPMLSRHGKLWSNFWGALSPDEYYARSEDYVELVQR 450

Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDS 503
            +   G+WNVPYI+  Y+++   ++     + +++ +  D DMAFC + R+KGI L + +
Sbjct: 451 KR--VGVWNVPYISQAYVIRGDTLRMELPQRDVFSGSDTDPDMAFCKSFRDKGIFLHLSN 508

Query: 504 TQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFW 563
             E+G L+ +  +D +  +P+++++  NP+DW  +YIH  Y ++L  + +  QPCPDV+W
Sbjct: 509 QHEFGRLLATSRYDTEHLHPDLWQIFDNPVDWKEQYIHENYSRALEGEGIVEQPCPDVYW 568

Query: 564 FPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLR 623
           FP+++E+ C E V  ME YGQWS G + D RL  GYE VPT DIHMKQVG    W + LR
Sbjct: 569 FPLLSEQMCDELVAEMERYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYGDQWLQLLR 628

Query: 624 KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYE 683
            YV P+ E  F GYH +  RA M+FVVRYRPDEQPSLRPHHDSST+T+N+ALN  G+DYE
Sbjct: 629 TYVGPMTESLFPGYHTK-ARAVMNFVVRYRPDEQPSLRPHHDSSTFTLNVALNHKGLDYE 687

Query: 684 GGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           GGGCRF+RY C +++ R GW L+HPGRLTHYHEGL  T GTRYIM+SFVDP
Sbjct: 688 GGGCRFLRYGCVISSPRKGWALLHPGRLTHYHEGLPTTWGTRYIMVSFVDP 738


>gi|292627353|ref|XP_002666609.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 3 [Danio
           rerio]
 gi|126631809|gb|AAI33840.1| Plod3 protein [Danio rerio]
          Length = 730

 Score =  685 bits (1767), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 330/732 (45%), Positives = 483/732 (65%), Gaps = 14/732 (1%)

Query: 10  LILSCVVFFISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQP 69
           +IL+ ++  I +   + +  +E   LVIT A+  TDGY RF+++       ++ LGL + 
Sbjct: 6   VILTVILAVIQLSRTEPRKPNE--LLVITAATEVTDGYLRFMRTIRQFNYTIQVLGLGEQ 63

Query: 70  WLGGDMS-SLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDA 128
           W GGD++ ++GGG KV  LK EL++     + +I+  DSYDVI+  G  ++L +F+ F  
Sbjct: 64  WRGGDVARTVGGGQKVRWLKTELEKHKDKQNTVIMFVDSYDVILASGPVELLRKFSRFSH 123

Query: 129 NIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLY 188
            +VF AE  CWPD  L  KYPAV  G RYLNSGGFIG+A +I  ++     K+++DDQL+
Sbjct: 124 RVVFSAEGFCWPDQRLASKYPAVHHGKRYLNSGGFIGFAPEIHAIVQQWKYKDDDDDQLF 183

Query: 189 YALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIH 248
           Y  ++LD+  R K  + LD  + +FQNL G++E++ L F+    V + N  Y+T PV+IH
Sbjct: 184 YTRIYLDKEKRRKFNMTLDHRSQIFQNLNGAIEEVVLKFEKSR-VRVRNVAYDTLPVVIH 242

Query: 249 GNGKSKIELNSFGNYLAKSWK-TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFL 306
           GNG +K++LN  GNY+  +W   +GC  C   +  L  L  ++ P V ++VFI++P  FL
Sbjct: 243 GNGPTKLQLNYLGNYVPTAWTYENGCGICEEDLLDLSHLSDEEMPLVHVAVFIEQPMPFL 302

Query: 307 EEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEA 366
           EEFL ++A LNYP  +I +F++NN  YH    + +    +++F   + +     +   +A
Sbjct: 303 EEFLERLATLNYPHTRIRLFLHNNVVYHEQHVERFWTRHRSLFTGARIVGPEENLKHDQA 362

Query: 367 RNLAVENSLHKGV--DFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWG 424
           R +AVE +  K V  D++F +D+D  L NPDVL+ L+  N+S+IAP+L R  K WSNFWG
Sbjct: 363 RTMAVE-ACKKDVSCDYFFSLDADVALTNPDVLRILIEENKSVIAPMLSRHGKLWSNFWG 421

Query: 425 ALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMD 483
           AL+ +GFY+R+ DY++I+   +   G+WNVPYIT  YL++   +++     ++Y    MD
Sbjct: 422 ALSPEGFYSRAEDYIDIVQSKR--VGLWNVPYITQVYLIRGETLRSRLAAVSLYQQEGMD 479

Query: 484 YDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPE 543
            DM+FC ++R +GI + + +  E+G LV S N++  + +P+++++  NP+DW  +YIH  
Sbjct: 480 PDMSFCKSVREQGIFMFVSNRDEFGRLVSSANYNISRLHPDMWQIFDNPVDWREKYIHEN 539

Query: 544 YQKSLLPD-TVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAV 602
           Y +    D +V  QPCPDV+WFP  +E+ C + V+ ME +GQWS G + D+RL  GYE V
Sbjct: 540 YSRIFEDDESVVEQPCPDVYWFPAFSERMCDDLVETMEEFGQWSGGGHKDERLSGGYENV 599

Query: 603 PTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRP 662
           PT DIHM Q+     W +FL++Y+VP+ E+ + GY+ +  +A M+FVVRYRPDEQPSLRP
Sbjct: 600 PTVDIHMNQIQFEKEWLKFLKEYIVPVTEKLYPGYYPK-AQAVMNFVVRYRPDEQPSLRP 658

Query: 663 HHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQ 722
           HHDSST+TINIALN  GVDYEGGGCRF+RYNC V + R GW  MHPGRLTHYHEGL  T+
Sbjct: 659 HHDSSTFTINIALNSKGVDYEGGGCRFLRYNCKVESPRKGWSFMHPGRLTHYHEGLPTTR 718

Query: 723 GTRYIMISFVDP 734
           GTRYIM+SFVDP
Sbjct: 719 GTRYIMVSFVDP 730


>gi|113195568|ref|NP_001037808.1| procollagen-lysine,2-oxoglutarate 5-dioxygenase 3 precursor [Danio
           rerio]
 gi|67973229|gb|AAY84150.1| procollagen lysine 2-oxoglutarate 5-dioxygenase 3 [Danio rerio]
 gi|190337538|gb|AAI63451.1| Procollagen-lysine, 2-oxoglutarate 5-dioxygenase 3 [Danio rerio]
          Length = 730

 Score =  684 bits (1765), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 327/710 (46%), Positives = 473/710 (66%), Gaps = 12/710 (1%)

Query: 32  DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLLKNE 90
           ++ LVIT A+  TDGY RF+++       ++ LGL + W GGD++ ++GGG KV  LK E
Sbjct: 26  NELLVITAATEVTDGYLRFMRTIRQFNYTIQVLGLGEQWRGGDVARTVGGGQKVRWLKTE 85

Query: 91  LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
           L++     + +I+  DSYDVI+  G  ++L +F+ F   +VF AE  CWPD  L  KYPA
Sbjct: 86  LEKHKDKQNTVIMFVDSYDVILASGPVELLRKFSRFSHRVVFSAEGFCWPDQRLASKYPA 145

Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
           V  G RYLNSGGFIG+A +I  ++     K+++DDQL+Y  ++LD+  R K  + LD  +
Sbjct: 146 VHHGKRYLNSGGFIGFAPEIHAIVQQWKYKDDDDDQLFYTRIYLDKEKRRKFNMTLDHRS 205

Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK- 269
            +FQNL G++E++ L F+    V + N  Y+T PV+IHGNG +K++LN  GNY+  +W  
Sbjct: 206 QIFQNLNGAIEEVVLKFEKSR-VRVRNVAYDTLPVVIHGNGPTKLQLNYLGNYVPTAWTY 264

Query: 270 TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
            +GC  C   +  L  L  ++ P V ++VFI++P  FLEEFL ++A LNYP  +I +F++
Sbjct: 265 ENGCGICEEDLLDLSHLSDEEMPLVHVAVFIEQPMPFLEEFLERLATLNYPHTRIRLFLH 324

Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGV--DFYFYVD 386
           NN  YH    + +    +++F   + +     +   +AR +AVE +  K V  D++F +D
Sbjct: 325 NNVVYHEQHVERFWTRHRSLFTGARIVGPEENLKHDQARTMAVE-ACKKDVSCDYFFSLD 383

Query: 387 SDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQ 446
           +D  L NPDVL+ L+  N+S+IAP+L R  K WSNFWGAL+ +GFY+R+ DY++I+   +
Sbjct: 384 ADVALTNPDVLRILIEENKSVIAPMLSRHGKLWSNFWGALSPEGFYSRAEDYIDIVQSKR 443

Query: 447 GGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQ 505
              G+WNVPYIT  YL++   +++     ++Y    MD DM+FC ++R +GI + + +  
Sbjct: 444 --VGLWNVPYITQVYLIRGETLRSRLAAVSLYQQEGMDPDMSFCKSVREQGIFMFVSNRD 501

Query: 506 EYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPD-TVNNQPCPDVFWF 564
           E+G LV S N++  + +P+++++  NP+DW  +YIH  Y +    D +V  QPCPDV+WF
Sbjct: 502 EFGRLVSSANYNISRLHPDMWQIFDNPVDWREKYIHENYSRIFEDDESVVEQPCPDVYWF 561

Query: 565 PIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRK 624
           P  +E+ C + V+ ME +GQWS G + D+RL  GYE VPT DIHM Q+     W +FL++
Sbjct: 562 PAFSERMCDDLVETMEEFGQWSGGGHKDERLSGGYENVPTVDIHMNQIQFEKEWLKFLKE 621

Query: 625 YVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEG 684
           Y+VP+ E+ + GY+ +  +A M+FVVRYRPDEQPSLRPHHDSST+TINIALN  GVDYEG
Sbjct: 622 YIVPVTEKLYPGYYPK-AQAVMNFVVRYRPDEQPSLRPHHDSSTFTINIALNSKGVDYEG 680

Query: 685 GGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           GGCRF+RYNC V + R GW  MHPGRLTHYHEGL  TQGTRYIM+SFVDP
Sbjct: 681 GGCRFLRYNCKVESPRKGWSFMHPGRLTHYHEGLPTTQGTRYIMVSFVDP 730


>gi|148225280|ref|NP_001088601.1| uncharacterized protein LOC495489 precursor [Xenopus laevis]
 gi|54648179|gb|AAH85074.1| LOC495489 protein [Xenopus laevis]
          Length = 729

 Score =  684 bits (1765), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 327/708 (46%), Positives = 472/708 (66%), Gaps = 10/708 (1%)

Query: 32  DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLLKNE 90
           DK LV+TVA+  T+GY RF+++A      V+TLGL   W GGD++ ++GGG KV  LK+E
Sbjct: 27  DKLLVVTVATEATEGYLRFLRTARHYNYTVRTLGLGHEWKGGDVARTVGGGQKVRWLKHE 86

Query: 91  LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
           L++    D +II+  DSYDV+I G   ++L +F   +  +VF AE  CWP+ SL + YP 
Sbjct: 87  LEQHKDQDQLIIMFVDSYDVVIAGTPTELLWKFQQLEHKVVFSAEGFCWPEWSLAESYPP 146

Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
           V +G R+LNSGGFIG+A  I  ++   + K+ +DDQL+Y  ++LDE+LR +  I LD  +
Sbjct: 147 VSNGKRFLNSGGFIGFAPQIYGMVQLWNYKDNDDDQLFYTKIYLDESLRERFNIALDHKS 206

Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK- 269
           N+FQNL G+++++ L F+ ++ V   N  Y+T PV+IHGNG +K++LN  GNY+  SW  
Sbjct: 207 NIFQNLNGAIDEVVLKFERNK-VRARNVAYDTIPVVIHGNGPTKLQLNYLGNYVPNSWTH 265

Query: 270 TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
             GC  C+  +  L  L+ D  P VL+ VFI++PT F+ +FL ++  L+YP  ++S++++
Sbjct: 266 EGGCEVCDDDLFDLSMLEDDALPHVLLGVFIEQPTPFMSQFLERLVQLDYPQNRLSLYIH 325

Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDS 387
           N++ YH      +    K  F  +K +     ++  EAR++ ++     +  D+YF VDS
Sbjct: 326 NSEPYHERHIQAFYERHKDRFTTIKIVGPEEAMSQGEARDMGMDLCRQDETCDYYFSVDS 385

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           +  L NPD L  L+  N+ +IAP++ R  K WSNFWGAL+ +G+YARS DY +I+   + 
Sbjct: 386 NVALTNPDTLYILIQENKKVIAPMVSRSGKLWSNFWGALSPEGYYARSEDYADIVQAKR- 444

Query: 448 GKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQE 506
             G+WNVPYI N YL+K   ++A  + K I+TL  MD DM+ C ++R+K + L I +  E
Sbjct: 445 -VGVWNVPYIANVYLIKGETLRAELSNKNIFTLPQMDPDMSVCKSIRDKNVFLHISNRDE 503

Query: 507 YGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPI 566
           +G L+ +  ++  + + +++++  NPLDW  +YIH  Y K +  +    QPCPDV+WFP+
Sbjct: 504 FGRLLSTSKYNTSRLHNDLWQIFENPLDWKEKYIHENYSK-IFEEDYYEQPCPDVYWFPV 562

Query: 567 VTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYV 626
             E  C EFV+ ME +GQWS G N D+RL  GYE VPT DIHM QVG    W +FL++Y+
Sbjct: 563 FKEIMCDEFVEEMENFGQWSGGKNQDQRLAGGYENVPTVDIHMTQVGYQEEWLKFLQEYI 622

Query: 627 VPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGG 686
            P+ E+ F GY+ +  +A ++F+VRYRPDEQPSLRPHHDSST+TINIALN  G+DYEGGG
Sbjct: 623 GPVTEKLFPGYYTK-AKALLNFIVRYRPDEQPSLRPHHDSSTFTINIALNNKGIDYEGGG 681

Query: 687 CRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           C F+RYNC V + R GW  MHPGRLTHYHEGL  T+GTRYIM+SFVDP
Sbjct: 682 CHFLRYNCRVESPRKGWSFMHPGRLTHYHEGLPTTRGTRYIMVSFVDP 729


>gi|301791363|ref|XP_002930648.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 3-like
           [Ailuropoda melanoleuca]
 gi|281349526|gb|EFB25110.1| hypothetical protein PANDA_021154 [Ailuropoda melanoleuca]
          Length = 740

 Score =  684 bits (1765), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 322/711 (45%), Positives = 477/711 (67%), Gaps = 11/711 (1%)

Query: 29  IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLL 87
           ++ +K LVITVA+ +T+GY+RF++SAE     V+TLGL + W GGD++ ++GGG KV  L
Sbjct: 36  VNPEKLLVITVATAKTEGYRRFLRSAEFFNYTVRTLGLGEEWRGGDVARTVGGGQKVRWL 95

Query: 88  KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
           K E+++    +DM+I+  DSYDVI+ G  +++L++F    ++++F AE  CWP+  L ++
Sbjct: 96  KKEMEKYADREDMVIMFVDSYDVILAGSPSELLKKFVQSGSHLLFSAEGFCWPEWGLAEQ 155

Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
           YP VG+G R+LNSGGFIG+A  + +++     K+++DDQL+Y  L+LD  LR K  + LD
Sbjct: 156 YPEVGTGKRFLNSGGFIGFAPTVHQVVRQWKYKDDDDDQLFYTRLYLDPGLREKLSLGLD 215

Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
             + +FQNL G+L+++ L FD +  V + N  Y+T PV++HGNG +K++LN  GNY+   
Sbjct: 216 HKSRIFQNLNGALDEVVLKFDRNR-VRIRNVAYDTLPVVVHGNGPTKLQLNYLGNYVPNG 274

Query: 268 W-KTSGCTRCNLIKH-LDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
           W    GC  C   +  L   +P   P VL++VF+++ T FL  FL ++  L+YP  ++++
Sbjct: 275 WTPQGGCGFCGRDRRTLPGGQPP--PRVLLAVFVEQATPFLPRFLQRLLLLDYPPDRVTL 332

Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFY 384
           F++NN+ YH P   D     +  F  VK +     +   EAR++A++        +FYF 
Sbjct: 333 FLHNNEVYHEPHIADSWSQLQDHFSAVKLVGPEEALTPGEARDMAMDTCRQDPECEFYFS 392

Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
           +D+D+ + N   L+ L+  N  +IAP+L R  K WSNFWGAL+ D +YARS DY+ ++  
Sbjct: 393 LDADAVITNQQTLRILIEENRKVIAPMLSRHGKLWSNFWGALSPDEYYARSEDYVELVQR 452

Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDS 503
            +   G+WNVPYI+  Y+++   ++     + +++ +  D DMAFC +LR+KGI L + +
Sbjct: 453 KR--VGVWNVPYISQAYVIRGETLRTELPQREVFSGSDTDPDMAFCKSLRDKGIFLHLSN 510

Query: 504 TQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFW 563
             E+G L+ +  +D    +P+++++  NPLDW  +YIH  Y ++L  + +  QPCPDV+W
Sbjct: 511 QHEFGRLLATSRYDTDHLHPDLWQIFDNPLDWKEQYIHENYSRALEGEGLVEQPCPDVYW 570

Query: 564 FPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLR 623
           FP+++++ C E V+ ME YGQWS G + D RL  GYE VPT DIHMKQVG    W + LR
Sbjct: 571 FPLLSDQMCDELVEEMELYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYEDQWLQLLR 630

Query: 624 KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYE 683
            YV P+ E  F GYH +  RA M+FVVRYRPDEQPSLRPHHDSST+T+N+ALN  G+DYE
Sbjct: 631 TYVGPMTESLFPGYHTK-TRAVMNFVVRYRPDEQPSLRPHHDSSTFTLNVALNHKGLDYE 689

Query: 684 GGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           GGGCRF+RY+C V++ R GW L+HPGRLTHYHEGL  T+GTRYIM+SFVDP
Sbjct: 690 GGGCRFLRYDCIVSSPRKGWGLLHPGRLTHYHEGLPTTRGTRYIMVSFVDP 740


>gi|197097730|ref|NP_001126103.1| procollagen-lysine,2-oxoglutarate 5-dioxygenase 3 precursor [Pongo
           abelii]
 gi|55730366|emb|CAH91905.1| hypothetical protein [Pongo abelii]
          Length = 738

 Score =  684 bits (1765), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 324/711 (45%), Positives = 474/711 (66%), Gaps = 11/711 (1%)

Query: 29  IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLL 87
           ++ +K LVITVA+ ET+GY  F++SAE     V+TLGL + W GGD++ ++GGG KV  L
Sbjct: 34  VNPEKLLVITVATAETEGYLHFLRSAEFFNYTVRTLGLGEQWRGGDVARTVGGGQKVRWL 93

Query: 88  KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
           K E+++    +DMII+  DSYDVI+ G   ++L++F    + ++F AE  CWP+  L ++
Sbjct: 94  KKEMEKYADREDMIIMFVDSYDVILAGSPTELLKKFVQSGSRLLFSAESFCWPEWGLAEQ 153

Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
           YP VG+G R+LNSGGFIG+A  I +++     K+++DDQL+Y  L+LD  LR K  + LD
Sbjct: 154 YPEVGTGKRFLNSGGFIGFATTIHQIVRQWKYKDDDDDQLFYTRLYLDPGLREKLSLNLD 213

Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
             + +FQNL G+L+++ L FD +  V + N  Y+T PV++HGNG +K++LN  GNY+ K 
Sbjct: 214 HKSRIFQNLNGALDEVALKFDRNR-VRIRNVAYDTLPVVVHGNGPTKLQLNYLGNYVPKG 272

Query: 268 W-KTSGCTRCNLIKH-LDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
           W    GC  CN  +  L   +P   P V ++VF+++PT FL  FL ++  L+YP  ++++
Sbjct: 273 WTPEGGCGFCNQDRRTLPGGQPP--PRVFLAVFVEQPTPFLPRFLQRLLLLDYPPDRVTL 330

Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFY 384
           F++NN+ +H P   D     +  F  VK +     ++  EAR++A++        +FYF 
Sbjct: 331 FLHNNEVFHEPHIADSWPQLQDHFSAVKLVGPEEALSPGEARDMAMDLCRQDPECEFYFS 390

Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
           +D+D+ L N   L+ L+  N  +IAP+L R  K WSNFWGAL+ D +YARS DY+ ++  
Sbjct: 391 LDADTVLTNLQTLRILIEENRKVIAPMLSRHGKLWSNFWGALSPDEYYARSVDYVELVQR 450

Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDS 503
            +   G+WNVPYI+  Y+++   ++     + +++ +  D DMAFC + R+KGI L + +
Sbjct: 451 KR--VGVWNVPYISQAYVIRGDTLRTELPQRDVFSGSDTDPDMAFCKSFRDKGIFLHLSN 508

Query: 504 TQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFW 563
             E+G L+ +  +D +  +P+++++  NP+DW  +YIH  Y ++L  + +  QPCPDV+W
Sbjct: 509 QHEFGRLLATSRYDTEHLHPDLWQIFDNPVDWKEQYIHENYSRALEGEGIVEQPCPDVYW 568

Query: 564 FPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLR 623
           FP+++E+ C E V  ME YGQWS G + D RL  GYE VPT DIHMKQVG    W + LR
Sbjct: 569 FPLLSEQMCDELVAEMEHYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYEDQWLQLLR 628

Query: 624 KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYE 683
            YV P+ E  F GYH +  RA M+FVVRYRPDEQPSLRPHHDSST+T+N+ALN  G+DYE
Sbjct: 629 TYVGPMTESLFPGYHTK-ARAVMNFVVRYRPDEQPSLRPHHDSSTFTLNVALNHKGLDYE 687

Query: 684 GGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           GGGCRF+RY+C +++ R GW L+HPGRLTHYHEGL  T GTRYIM+SFVDP
Sbjct: 688 GGGCRFLRYDCVISSPRKGWALLHPGRLTHYHEGLPTTWGTRYIMVSFVDP 738


>gi|348517144|ref|XP_003446095.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 3-like
           [Oreochromis niloticus]
          Length = 734

 Score =  684 bits (1765), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 324/716 (45%), Positives = 471/716 (65%), Gaps = 13/716 (1%)

Query: 27  KNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVN 85
           + +  +  LVITVA+ ETDGY RF+++A      VK LGL + W GGD++ ++GGG KV 
Sbjct: 24  RKLSPENLLVITVATEETDGYLRFMRTAREFNYTVKVLGLGEEWKGGDVARTVGGGQKVR 83

Query: 86  LLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLY 145
            LK E+ +     +++I+  DSYDVI   G  ++L +F+     ++F AE  CWPD  L 
Sbjct: 84  WLKKEVQKHSEKTELVIMFVDSYDVIFASGPEELLSKFSRMGHKVIFSAEGFCWPDQRLA 143

Query: 146 DKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIV 205
            KYP V +G RYLNSGGFIGYA +I  ++     K+ +DDQL+Y  ++LD+T RTK  + 
Sbjct: 144 SKYPEVRTGKRYLNSGGFIGYAPEISAIVQQWKYKDSDDDQLFYTRIYLDKTHRTKFNMT 203

Query: 206 LDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLA 265
           LD  + +FQNL G+++++ L F+  + V   N  Y+T PV+IHGNG +K++LN  GNY+ 
Sbjct: 204 LDHRSRIFQNLNGAVDEVVLKFERAK-VRARNVAYDTLPVVIHGNGPTKLQLNYLGNYVP 262

Query: 266 KSWK-TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKI 323
            +W   +GC  C+  +   + +  +Q P V ++VFI+  T F+EEFL +++ LNYP  +I
Sbjct: 263 TAWTYETGCGICDDDVLFFNEVPDEQMPLVYVAVFIEHATPFMEEFLERLSTLNYPKTRI 322

Query: 324 SMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVE-NSLHKGVDFY 382
            +F++NN  YH      +    +++F + + +     +   EAR +AVE        D+Y
Sbjct: 323 RLFIHNNVVYHERHIQKFWERHRSLFPDARVVGPEENLKEDEARTMAVEVCKKDPECDYY 382

Query: 383 FYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNII 442
           F +DSD  L NPD+L+ L+  N+S+IAP+L +  K WSNFWGAL+ +G+Y+RS DY+ I+
Sbjct: 383 FSIDSDVALTNPDILRILIEENKSVIAPMLSKHGKLWSNFWGALSPEGYYSRSEDYIEIV 442

Query: 443 NGDQGGKGIWNVPYITNCYLMKTSVI--KATNIKTIYTLNSMDYDMAFCTNLRNKGIHLK 500
            G +   G+WNVPYIT  YL+K S++  K + +        MD DM FC ++R++G+ + 
Sbjct: 443 QGKR--VGLWNVPYITQAYLLKGSMLRTKLSQVSLYMDEGGMDADMVFCRSIRDQGVFMY 500

Query: 501 IDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVN--NQPC 558
           + +  E+G LV S NF+  + +P+++++  NP+DW  +Y+H  Y K +  D      QPC
Sbjct: 501 VSNRDEFGRLVASSNFNTSRLHPDMWQIFDNPVDWKEKYVHENYSK-IFEDEKKYVEQPC 559

Query: 559 PDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVW 618
           PDV+WFP  ++K C   V+ ME +G+WS GT+ D+RL  GYE VPT DIHM Q+G    W
Sbjct: 560 PDVYWFPAFSDKMCDHMVETMEDHGEWSGGTHKDERLAGGYENVPTVDIHMNQIGFEKEW 619

Query: 619 AEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQV 678
            +FL+ Y+ P+ E+ + GY+    +A M+FVVRYRPDEQP LRPHHDSST+TINIALN+ 
Sbjct: 620 LKFLKDYISPVTEKLYPGYYPR-AQAIMNFVVRYRPDEQPLLRPHHDSSTFTINIALNRK 678

Query: 679 GVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
            +DYEGGGCRF+RY+C V + R GW  MHPGRLTHYHEGL+VT+GTRYIM+SFVDP
Sbjct: 679 DIDYEGGGCRFLRYDCKVESPRKGWSFMHPGRLTHYHEGLRVTKGTRYIMVSFVDP 734


>gi|4505891|ref|NP_001075.1| procollagen-lysine,2-oxoglutarate 5-dioxygenase 3 precursor [Homo
           sapiens]
 gi|6093731|sp|O60568.1|PLOD3_HUMAN RecName: Full=Procollagen-lysine,2-oxoglutarate 5-dioxygenase 3;
           AltName: Full=Lysyl hydroxylase 3; Short=LH3; Flags:
           Precursor
 gi|5630086|gb|AAD45831.1|AC004876_4 lysyl hydroxylase 3 [Homo sapiens]
 gi|3153235|gb|AAC39753.1| lysyl hydroxylase isoform 3 [Homo sapiens]
 gi|3551836|gb|AAC34808.1| lysyl hydroxylase 3 [Homo sapiens]
 gi|7546824|gb|AAF63701.1| lysyl hydroxylase 3 [Homo sapiens]
 gi|15079714|gb|AAH11674.1| Procollagen-lysine, 2-oxoglutarate 5-dioxygenase 3 [Homo sapiens]
 gi|28975434|gb|AAO61775.1| lysyl hydroxylase 3 [Homo sapiens]
 gi|119570590|gb|EAW50205.1| procollagen-lysine, 2-oxoglutarate 5-dioxygenase 3 [Homo sapiens]
 gi|189053447|dbj|BAG35613.1| unnamed protein product [Homo sapiens]
          Length = 738

 Score =  682 bits (1760), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 323/711 (45%), Positives = 474/711 (66%), Gaps = 11/711 (1%)

Query: 29  IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLL 87
           ++ +K LVITVA+ ET+GY RF++SAE     V+TLGL + W GGD++ ++GGG KV  L
Sbjct: 34  VNPEKLLVITVATAETEGYLRFLRSAEFFNYTVRTLGLGEEWRGGDVARTVGGGQKVRWL 93

Query: 88  KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
           K E+++    +DMII+  DSYDVI+ G   ++L++F    + ++F AE  CWP+  L ++
Sbjct: 94  KKEMEKYADREDMIIMFVDSYDVILAGSPTELLKKFVQSGSRLLFSAESFCWPEWGLAEQ 153

Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
           YP VG+G R+LNSGGFIG+A  I +++     K+++DDQL+Y  L+LD  LR K  + LD
Sbjct: 154 YPEVGTGKRFLNSGGFIGFATTIHQIVRQWKYKDDDDDQLFYTRLYLDPGLREKLSLNLD 213

Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
             + +FQNL G+L+++ L FD +  V + N  Y+T P+++HGNG +K++LN  GNY+   
Sbjct: 214 HKSRIFQNLNGALDEVVLKFDRNR-VRIRNVAYDTLPIVVHGNGPTKLQLNYLGNYVPNG 272

Query: 268 WKT-SGCTRCNLIKH-LDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
           W    GC  CN  +  L   +P   P V ++VF+++PT FL  FL ++  L+YP  ++++
Sbjct: 273 WTPEGGCGFCNQDRRTLPGGQPP--PRVFLAVFVEQPTPFLPRFLQRLLLLDYPPDRVTL 330

Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFY 384
           F++NN+ +H P   D     +  F  VK +     ++  EAR++A++        +FYF 
Sbjct: 331 FLHNNEVFHEPHIADSWPQLQDHFSAVKLVGPEEALSPGEARDMAMDLCRQDPECEFYFS 390

Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
           +D+D+ L N   L+ L+  N  +IAP+L R  K WSNFWGAL+ D +YARS DY+ ++  
Sbjct: 391 LDADAVLTNLQTLRILIEENRKVIAPMLSRHGKLWSNFWGALSPDEYYARSEDYVELVQR 450

Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDS 503
            +   G+WNVPYI+  Y+++   ++     + +++ +  D DMAFC + R+KGI L + +
Sbjct: 451 KR--VGVWNVPYISQAYVIRGDTLRMELPQRDVFSGSDTDPDMAFCKSFRDKGIFLHLSN 508

Query: 504 TQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFW 563
             E+G L+ +  +D +  +P+++++  NP+DW  +YIH  Y ++L  + +  QPCPDV+W
Sbjct: 509 QHEFGRLLATSRYDTEHLHPDLWQIFDNPVDWKEQYIHENYSRALEGEGIVEQPCPDVYW 568

Query: 564 FPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLR 623
           FP+++E+ C E V  ME YGQWS G + D RL  GYE VPT DIHMKQVG    W + LR
Sbjct: 569 FPLLSEQMCDELVAEMEHYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYEDQWLQLLR 628

Query: 624 KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYE 683
            YV P+ E  F GYH +  RA M+FVVRYRPDEQPSLRPHHDSST+T+N+ALN  G+DYE
Sbjct: 629 TYVGPMTESLFPGYHTK-ARAVMNFVVRYRPDEQPSLRPHHDSSTFTLNVALNHKGLDYE 687

Query: 684 GGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           GGGCRF+RY+C +++ R GW L+HPGRLTHYHEGL  T GTRYIM+SFVDP
Sbjct: 688 GGGCRFLRYDCVISSPRKGWALLHPGRLTHYHEGLPTTWGTRYIMVSFVDP 738


>gi|426254759|ref|XP_004021044.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 3 [Ovis
           aries]
          Length = 752

 Score =  682 bits (1760), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 327/712 (45%), Positives = 474/712 (66%), Gaps = 12/712 (1%)

Query: 29  IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLL 87
           ++ +K LVITVA+ ET+GY+RF+QSAE     V+TLGL + W GGD++ ++GGG KV  L
Sbjct: 47  VNPEKLLVITVATAETEGYRRFLQSAEFFNYTVRTLGLGEEWRGGDVARTVGGGQKVRWL 106

Query: 88  KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
           K E+++    +DM+I+  DSYDV++ GG +++L++F    + ++F AE  CWP+  L ++
Sbjct: 107 KKEMEKYAEREDMVIMFVDSYDVVLAGGPSELLKKFIQSGSRLLFSAESFCWPEWGLAEQ 166

Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
           YP VG+G R+LNSGGFIG+A  I +++     K+++DDQL+Y  L+LD  LR K  + LD
Sbjct: 167 YPEVGTGKRFLNSGGFIGFAPTIHQIVRQWKYKDDDDDQLFYTRLYLDPGLREKLGLSLD 226

Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
             + +FQNL G+L+++ L F  +  V + N  Y+T PV++HGNG +K++LN  GNY+   
Sbjct: 227 HKSRIFQNLNGALDEVVLKFGRNR-VRIRNVAYDTLPVVVHGNGPTKLQLNYLGNYVPNG 285

Query: 268 WKT-SGCTRCNLIKH-LDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
           W    GC  CN  +  L   +P   P VL++VF+++PT FL  FL ++  L+Y      +
Sbjct: 286 WTPEGGCGFCNQDRRPLPGGQPP--PRVLLAVFVEQPTPFLPRFLQRLLLLDYRGGGKGL 343

Query: 326 FVYNNQE-YHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYF 383
           F  + QE YH P  DD     +  F  VK +     +   EAR++A++        +FYF
Sbjct: 344 FSPHLQEVYHEPHIDDSWPQLQDHFSAVKLVGPEEALTPGEARDMAMDICRQDPKCEFYF 403

Query: 384 YVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIIN 443
            +D+D+ + NP  L+ L+  N  +IAP+L R  K WSNFWGAL+ D +YARS DY+ ++ 
Sbjct: 404 SLDADTVITNPQTLRILIEANRKVIAPMLSRHGKLWSNFWGALSPDEYYARSEDYVELVQ 463

Query: 444 GDQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKID 502
             +   G+WNVPYI+  Y+++   ++     + +++ +  D DMAFC +LR+KGI L + 
Sbjct: 464 RKR--VGVWNVPYISQAYVIRGETLRMELPQREVFSGSDTDPDMAFCKSLRDKGIFLHLS 521

Query: 503 STQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVF 562
           +  E+G L+ +  +D    +P+++++  NPLDW  +YIH  Y ++L  + +  QPCPDV+
Sbjct: 522 NQHEFGRLLATSRYDTDHLHPDLWQIFDNPLDWKEQYIHENYSRALEGEGLVEQPCPDVY 581

Query: 563 WFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFL 622
           WFP+++E+ C E V+ ME YGQWS G + D RL  GYE VPT DIHMKQVG    W + L
Sbjct: 582 WFPLLSEQMCDELVEEMEHYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYEDQWLQLL 641

Query: 623 RKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDY 682
           R YV P+ E  F GYH +  RA M+FVVRYRPDEQPSLRPHHDSST+T+N+ALN  G+DY
Sbjct: 642 RTYVGPMTESLFPGYHTK-TRAVMNFVVRYRPDEQPSLRPHHDSSTFTLNVALNHKGLDY 700

Query: 683 EGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           EGGGCRF+RY+C +++ R GW L+HPGRLTHYHEGL  T+GTRYIM+SFVDP
Sbjct: 701 EGGGCRFLRYDCVISSPRKGWGLLHPGRLTHYHEGLPTTRGTRYIMVSFVDP 752


>gi|426357331|ref|XP_004045997.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 3
           [Gorilla gorilla gorilla]
          Length = 738

 Score =  682 bits (1760), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 324/711 (45%), Positives = 474/711 (66%), Gaps = 11/711 (1%)

Query: 29  IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLL 87
           ++ +K LVITVA+ ET+GY RF++SAE     V+TLGL + W GGD++ ++GGG KV  L
Sbjct: 34  VNPEKLLVITVATAETEGYLRFLRSAEFFNYTVRTLGLGEEWRGGDVARTVGGGQKVRWL 93

Query: 88  KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
           K E+++    +DMII+  DSYDVI+ G   ++L++F    + ++F AE  CWP+  L ++
Sbjct: 94  KKEMEKYTDREDMIIMFVDSYDVILAGSPTELLKKFVQSGSRLLFSAESFCWPEWGLAEQ 153

Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
           YP VG+G R+LNSGGFIG+A  I +++     K+++DDQL+Y  L+LD  LR K  + LD
Sbjct: 154 YPEVGTGKRFLNSGGFIGFATTIHQIVRQWKYKDDDDDQLFYTRLYLDPGLREKLSLNLD 213

Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
             + +FQNL G+L+++ L FD +  V + N  Y+T P++IHGNG +K++LN  GNY+   
Sbjct: 214 HKSRIFQNLNGALDEVVLKFDRNR-VRIRNVAYDTLPIVIHGNGPTKLQLNYLGNYVPNG 272

Query: 268 WKT-SGCTRCNLIKH-LDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
           W    GC  CN  +  L   +P   P V ++VF+++PT FL  FL ++  L+YP  ++++
Sbjct: 273 WTPEGGCGFCNQDRRTLPGGQPP--PRVFLAVFVEQPTPFLPRFLQRLLLLDYPPDRVTL 330

Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFY 384
           F++NN+ +H P   D     +  F  VK +     ++  EAR++A++        +FYF 
Sbjct: 331 FLHNNEVFHEPHIADSWPQLQDHFSAVKLVGPEEALSPGEARDMAMDLCRQDPECEFYFS 390

Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
           +D+D+ L N   L+ L+  N  +IAP+L R  K WSNFWGAL+ D +YARS DY+ ++  
Sbjct: 391 LDADAVLTNLQTLRILIEENRKVIAPMLSRHGKLWSNFWGALSPDEYYARSEDYVELVQR 450

Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDS 503
            +   G+WNVPYI+  Y+++   ++     + +++ +  D DMAFC + R+KGI L + +
Sbjct: 451 KR--VGVWNVPYISQAYVIRGDTLRMELPQRDVFSGSDTDPDMAFCKSFRDKGIFLHLSN 508

Query: 504 TQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFW 563
             E+G L+ +  +D +  +P+++++  NP+DW  +YIH  Y ++L  + +  QPCPDV+W
Sbjct: 509 QYEFGRLLATSRYDTEHLHPDLWQIFDNPVDWKEQYIHENYSRALEGEGIVEQPCPDVYW 568

Query: 564 FPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLR 623
           FP+++E+ C E V  ME YGQWS G + D RL  GYE VPT DIHMKQVG    W + LR
Sbjct: 569 FPLLSEQMCDELVAEMEHYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYEDQWLQLLR 628

Query: 624 KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYE 683
            YV P+ E  F GYH +  RA M+FVVRYRPDEQPSLRPHHDSST+T+N+ALN  G+DYE
Sbjct: 629 TYVGPMTESLFPGYHTK-ARAVMNFVVRYRPDEQPSLRPHHDSSTFTLNVALNHKGLDYE 687

Query: 684 GGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           GGGCRF+RY+C +++ R GW L+HPGRLTHYHEGL  T GTRYIM+SFVDP
Sbjct: 688 GGGCRFLRYDCVISSPRKGWALLHPGRLTHYHEGLPTTWGTRYIMVSFVDP 738


>gi|114615112|ref|XP_001153684.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 3
           isoform 1 [Pan troglodytes]
 gi|397471318|ref|XP_003807243.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 3 [Pan
           paniscus]
 gi|410222176|gb|JAA08307.1| procollagen-lysine, 2-oxoglutarate 5-dioxygenase 3 [Pan
           troglodytes]
 gi|410306108|gb|JAA31654.1| procollagen-lysine, 2-oxoglutarate 5-dioxygenase 3 [Pan
           troglodytes]
 gi|410349965|gb|JAA41586.1| procollagen-lysine, 2-oxoglutarate 5-dioxygenase 3 [Pan
           troglodytes]
          Length = 738

 Score =  682 bits (1759), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 323/711 (45%), Positives = 474/711 (66%), Gaps = 11/711 (1%)

Query: 29  IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLL 87
           ++ +K LVITVA+ ET+GY RF++SAE     V+TLGL + W GGD++ ++GGG KV  L
Sbjct: 34  VNPEKLLVITVATAETEGYLRFLRSAEFFNYTVQTLGLGEEWRGGDVARTVGGGQKVRWL 93

Query: 88  KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
           K E+++    +DMII+  DSYDVI+ G   ++L++F    + ++F AE  CWP+  L ++
Sbjct: 94  KKEMEKYADREDMIIMFVDSYDVILAGSPTELLKKFVQSGSRLLFSAESFCWPEWGLAEQ 153

Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
           YP VG+G R+LNSGGFIG+A  I +++     K+++DDQL+Y  L+LD  LR K  + LD
Sbjct: 154 YPEVGTGKRFLNSGGFIGFATTIHQIVRQWKYKDDDDDQLFYTRLYLDPGLREKLSLNLD 213

Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
             + +FQNL G+L+++ L FD +  V + N  Y+T P+++HGNG +K++LN  GNY+   
Sbjct: 214 HKSRIFQNLNGALDEVVLKFDRNR-VRIRNVAYDTLPIVVHGNGPTKLQLNYLGNYVPNG 272

Query: 268 WKT-SGCTRCNLIKH-LDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
           W    GC  CN  +  L   +P   P V ++VF+++PT FL  FL ++  L+YP  ++++
Sbjct: 273 WTPEGGCGFCNQDRRALPGGQPP--PRVFLAVFVEQPTPFLPRFLQRLLLLDYPPDRVTL 330

Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFY 384
           F++NN+ +H P   D     +  F  VK +     ++  EAR++A++        +FYF 
Sbjct: 331 FLHNNEVFHEPHIADSWPQLQDHFSAVKLVGPEEALSPGEARDMAMDLCRQDPECEFYFS 390

Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
           +D+D+ L N   L+ L+  N  +IAP+L R  K WSNFWGAL+ D +YARS DY+ ++  
Sbjct: 391 LDADTVLTNLQTLRILIEENRKVIAPMLSRHGKLWSNFWGALSPDEYYARSEDYVELVQR 450

Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDS 503
            +   G+WNVPYI+  Y+++   ++     + +++ +  D DMAFC + R+KGI L + +
Sbjct: 451 KR--VGVWNVPYISQAYVIRGDTLRMELPQRDVFSGSDTDPDMAFCKSFRDKGIFLHLSN 508

Query: 504 TQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFW 563
             E+G L+ +  +D +  +P+++++  NP+DW  +YIH  Y ++L  + +  QPCPDV+W
Sbjct: 509 QHEFGRLLATSRYDTEHLHPDLWQIFDNPVDWKEQYIHENYSRALEGEGIVEQPCPDVYW 568

Query: 564 FPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLR 623
           FP+++E+ C E V  ME YGQWS G + D RL  GYE VPT DIHMKQVG    W + LR
Sbjct: 569 FPLLSEQMCDELVAEMEHYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYEDQWLQLLR 628

Query: 624 KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYE 683
            YV P+ E  F GYH +  RA M+FVVRYRPDEQPSLRPHHDSST+T+N+ALN  G+DYE
Sbjct: 629 TYVGPMTESLFPGYHTK-ARAVMNFVVRYRPDEQPSLRPHHDSSTFTLNVALNHKGLDYE 687

Query: 684 GGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           GGGCRF+RY+C +++ R GW L+HPGRLTHYHEGL  T GTRYIM+SFVDP
Sbjct: 688 GGGCRFLRYDCVISSPRKGWALLHPGRLTHYHEGLPTTWGTRYIMVSFVDP 738


>gi|355712274|gb|AES04295.1| procollagen-lysine, 2-oxoglutarate 5-dioxygenase 3 [Mustela
           putorius furo]
          Length = 749

 Score =  681 bits (1758), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 327/718 (45%), Positives = 476/718 (66%), Gaps = 18/718 (2%)

Query: 29  IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLL 87
           ++ +K LVITVA+ ET+GY+RF++SAE     V+TLGL + W GGD++ ++GGG KV  L
Sbjct: 38  VNPEKLLVITVATAETEGYRRFLRSAEFFNYTVRTLGLGEEWRGGDVARTVGGGQKVRWL 97

Query: 88  KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
           K  +++    +DM+I+  DSYDVI+ G  +++L++F    + ++F AE  CWP+  L ++
Sbjct: 98  KKAMEKYADREDMVIMFVDSYDVILAGSPSELLKKFVQTGSRLLFSAEGFCWPEWGLAEQ 157

Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
           YP VG+G R+LNSGGFIG+A  I +++     K+++DDQL+Y  L+LD  LR K  + LD
Sbjct: 158 YPEVGTGKRFLNSGGFIGFAPTIHQVVRQWKYKDDDDDQLFYTRLYLDPGLREKLSLNLD 217

Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
             + +FQNL G+L+++ L FD +  V + N  Y+T PV++HGNG +K++LN  GNY+   
Sbjct: 218 HKSRIFQNLNGALDEVVLKFDRNR-VRIRNVAYDTLPVVVHGNGPTKLQLNYLGNYVPNG 276

Query: 268 W-KTSGCTRCNLIKH-LDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
           W    GC  C   +  L   +P   P VL++VF+++PT FL  FL ++  L+YP  +I++
Sbjct: 277 WTPQGGCGFCGRDRRTLPGGQPP--PRVLLAVFVEQPTPFLPRFLQRLLLLDYPPDRITL 334

Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFY 384
           F++NN+ YH P   D     +  F  VK +     +   EAR++A++        +FYF 
Sbjct: 335 FLHNNEVYHEPHIADSWPQLQDHFSAVKLVGPEEALTPGEARDIAMDTCRQDPECEFYFS 394

Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
           +D+D+ L N   L+ L+  N  +IAP+L R  K WSNFWGAL+ D +YARS DY+ ++  
Sbjct: 395 LDADAVLTNQQTLRILIEENRKVIAPMLSRHGKLWSNFWGALSPDEYYARSEDYVELVQR 454

Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDS 503
            +   G+WNVPYI+  Y+++   ++     + +++ +  D DMAFC +LR+KGI L + +
Sbjct: 455 KR--VGVWNVPYISQAYVIRGETLRTELPQREVFSGSDTDPDMAFCKSLRDKGIFLHLSN 512

Query: 504 TQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFW 563
             E+G L+ +  +D    +P+++++  NPLDW  +YIH  Y ++L  + +  QPCPDV+W
Sbjct: 513 QHEFGRLLATSRYDTDHLHPDLWQIFDNPLDWKEQYIHENYSRALEGEGLVEQPCPDVYW 572

Query: 564 FPI-------VTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAG 616
           FP+       ++E+ C E V+ ME YGQWS G + D RL  GYE VPT DIHMKQVG   
Sbjct: 573 FPLDVYWFPLLSEQMCDELVEEMEHYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYED 632

Query: 617 VWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALN 676
            W + LR YV P+ E  F GYH +  RA M+FVVRYRPDEQPSLRPHHDSST+T+N+ALN
Sbjct: 633 EWLQLLRTYVGPMTESLFPGYHTK-TRAVMNFVVRYRPDEQPSLRPHHDSSTFTLNVALN 691

Query: 677 QVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
             G+DYEGGGCRF+RY+C V++ R GW L+HPGRLTHYHEGL  T+GTRYIM+SFVDP
Sbjct: 692 HKGLDYEGGGCRFLRYDCIVSSPRKGWGLLHPGRLTHYHEGLPTTRGTRYIMVSFVDP 749


>gi|384950168|gb|AFI38689.1| procollagen-lysine,2-oxoglutarate 5-dioxygenase 3 precursor [Macaca
           mulatta]
          Length = 737

 Score =  681 bits (1756), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 321/711 (45%), Positives = 475/711 (66%), Gaps = 11/711 (1%)

Query: 29  IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLL 87
           ++ +K LV+TVA+ ET+GY RF++SAE     V+TLGL + W GGD++ ++GGG KV  L
Sbjct: 33  VNPEKLLVMTVATAETEGYLRFLRSAEFFNYTVRTLGLGEEWRGGDVARTVGGGQKVRWL 92

Query: 88  KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
           K E+++    +D II+  DSYDV++ G  +++L++F    + ++F AE  CWP+  L ++
Sbjct: 93  KKEMEKYADREDTIIMFVDSYDVVLAGSPSELLKKFVQSGSRLLFSAESFCWPEWGLAEQ 152

Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
           YP VG+G R+LNSGGFIG+A  I +++     K+++DDQL+Y  L+LD  LR K  + LD
Sbjct: 153 YPEVGTGKRFLNSGGFIGFATTIHQIVRQWKYKDDDDDQLFYTRLYLDPGLREKLSLNLD 212

Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
             + +FQNL G+L+++ L FD +  V + N  Y+T PV++HGNG +K++LN  GNY+   
Sbjct: 213 HKSRIFQNLNGALDEVVLKFDRNR-VRIRNVAYDTLPVVVHGNGPTKLQLNYLGNYVPNG 271

Query: 268 WKT-SGCTRCNLIKH-LDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
           W    GC  CN  +  L   +P   P V ++VF+++PT FL  FL ++  L+YP+ ++++
Sbjct: 272 WTPEGGCGFCNRDRRTLPGGQPP--PRVFLAVFVEQPTPFLPRFLQRLLLLDYPSDRVTL 329

Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFY 384
           F++NN+ +H P   D     +  F  VK +     ++  EAR++A++        +FYF 
Sbjct: 330 FLHNNEVFHEPHIADSWPQLQDHFAVVKLVGPEEALSPGEARDMAMDMCRQDPECEFYFS 389

Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
           +D+D+ L N   L+ L+  N  +IAP+L R  K WSNFWGAL+ D +YARS DY+ ++  
Sbjct: 390 LDADAVLTNLQTLRILIEENRKVIAPMLSRHGKLWSNFWGALSPDEYYARSEDYVELVQR 449

Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDS 503
            +   G+WNVPYI+  Y+++   ++     + +++ +  D DMAFC + R+KGI L + +
Sbjct: 450 KR--VGVWNVPYISQAYVIRGDTLRTELPQRDVFSGSDTDPDMAFCKSFRDKGIFLHLSN 507

Query: 504 TQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFW 563
             E+G L+ +  +D +  +P+++++  NP+DW  +YIH  Y ++L  + +  QPCPDV+W
Sbjct: 508 QHEFGRLLATSRYDTEHLHPDLWQIFDNPVDWKEQYIHENYSRALEGEGIVEQPCPDVYW 567

Query: 564 FPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLR 623
           FP+++E+ C E V  ME YGQWS G + D RL  GYE VPT DIHMKQVG    W + LR
Sbjct: 568 FPLLSEQMCDELVAEMEHYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYEDQWLQLLR 627

Query: 624 KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYE 683
            YV P+ E  F GYH +  RA M+FVVRYRPDEQPSLRPHHDSST+T+N+ALN  G+DYE
Sbjct: 628 TYVGPMTESLFPGYHTK-ARAVMNFVVRYRPDEQPSLRPHHDSSTFTLNVALNHKGLDYE 686

Query: 684 GGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           GGGCRF+RY+C +++ R GW L HPGRLTHYHEGL  T+GTRYIM+SFVDP
Sbjct: 687 GGGCRFLRYDCVISSPRKGWALRHPGRLTHYHEGLPTTRGTRYIMVSFVDP 737


>gi|380817708|gb|AFE80728.1| procollagen-lysine,2-oxoglutarate 5-dioxygenase 3 precursor [Macaca
           mulatta]
          Length = 737

 Score =  681 bits (1756), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 321/711 (45%), Positives = 475/711 (66%), Gaps = 11/711 (1%)

Query: 29  IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLL 87
           ++ +K LV+TVA+ ET+GY RF++SAE     V+TLGL + W GGD++ ++GGG KV  L
Sbjct: 33  VNPEKLLVMTVATAETEGYLRFLRSAEFFNYTVRTLGLGEEWRGGDVARTVGGGQKVRWL 92

Query: 88  KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
           K E+++    +D II+  DSYDV++ G  +++L++F    + ++F AE  CWP+  L ++
Sbjct: 93  KKEMEKYADREDTIIMFVDSYDVVLAGSPSELLKKFVQSGSRLLFSAESFCWPEWGLAEQ 152

Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
           YP VG+G R+LNSGGFIG+A  I +++     K+++DDQL+Y  L+LD  LR K  + LD
Sbjct: 153 YPEVGTGKRFLNSGGFIGFATTIHQIVRQWKYKDDDDDQLFYTRLYLDPGLREKLSLNLD 212

Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
             + +FQNL G+L+++ L FD +  V + N  Y+T PV++HGNG +K++LN  GNY+   
Sbjct: 213 HKSRIFQNLNGALDEVVLKFDRNR-VRIRNVAYDTLPVVVHGNGPTKLQLNYLGNYVPNG 271

Query: 268 W-KTSGCTRCNLIKH-LDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
           W    GC  CN  +  L   +P   P V ++VF+++PT FL  FL ++  L+YP  ++++
Sbjct: 272 WTPEGGCGFCNRDRRTLPGGQPP--PRVFLAVFVEQPTPFLPRFLQRLLLLDYPPDRVTL 329

Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFY 384
           F++NN+ +H P   D     +  F  VK +     ++  EAR++A++        +FYF 
Sbjct: 330 FLHNNEVFHEPHIADSWPQLQDHFAVVKLVGPEEALSPGEARDMAMDMCRQDPECEFYFS 389

Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
           +D+D+ L N   L+ L+  N  +IAP+L R  K WSNFWGAL+ D +YARS DY+ ++  
Sbjct: 390 LDADAVLTNLQTLRILIEENRKVIAPMLSRHGKLWSNFWGALSPDEYYARSEDYVELVQR 449

Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDS 503
            +   G+WNVPYI+  Y+++   ++     + +++ +  D DMAFC + R+KGI L + +
Sbjct: 450 KR--VGVWNVPYISQAYVIRGDTLRTELPQRDVFSGSDTDPDMAFCKSFRDKGIFLHLSN 507

Query: 504 TQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFW 563
             E+G L+ +  +D +  +P+++++  NP+DW  +YIH  Y ++L  + +  QPCPDV+W
Sbjct: 508 QHEFGRLLATSRYDTEHLHPDLWQIFDNPVDWKEQYIHENYSRALEGEGIVEQPCPDVYW 567

Query: 564 FPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLR 623
           FP+++E+ C E V  ME YGQWS G + D RL  GYE VPT DIHMKQVG    W + LR
Sbjct: 568 FPLLSEQMCDELVAEMEHYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYEDQWLQLLR 627

Query: 624 KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYE 683
            YV P+ E  F GYH +  RA M+FVVRYRPDEQPSLRPHHDSST+T+N+ALN  G+DYE
Sbjct: 628 TYVGPMTESLFPGYHTK-ARAVMNFVVRYRPDEQPSLRPHHDSSTFTLNVALNHKGLDYE 686

Query: 684 GGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           GGGCRF+RY+C +++ R GW L+HPGRLTHYHEGL  T+GTRYIM+SFVDP
Sbjct: 687 GGGCRFLRYDCVISSPRKGWALLHPGRLTHYHEGLPTTRGTRYIMVSFVDP 737


>gi|151301193|ref|NP_001093075.1| lysyl hydroxylase 3 precursor [Takifugu rubripes]
 gi|146325990|dbj|BAF61137.1| lysyl hydroxylase 3 [Takifugu rubripes]
          Length = 731

 Score =  679 bits (1751), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 329/714 (46%), Positives = 466/714 (65%), Gaps = 12/714 (1%)

Query: 27  KNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVN 85
           +++  +  LVIT A+ ETDG+ RF+++A      VK LGL + W GGD++ ++GGG KV 
Sbjct: 24  QSLSPENLLVITAATEETDGFNRFMRTAREFNYTVKVLGLGEEWRGGDVARTVGGGQKVR 83

Query: 86  LLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLY 145
            LK EL +    ++M+I+  DSYDVI+  G  + L +F+     +VF AE  CWPD  L 
Sbjct: 84  WLKKELSKHSDKENMVIMFVDSYDVILAAGPEEPLYKFSRLGHKVVFSAEGFCWPDQRLA 143

Query: 146 DKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIV 205
            KYP V SG RYLNSGGFIG A ++  ++     K+ +DDQL+Y  ++LD+  RTK  + 
Sbjct: 144 SKYPEVHSGKRYLNSGGFIGLASELSAIVQQWKYKDNDDDQLFYTRIYLDKVQRTKFNMT 203

Query: 206 LDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLA 265
           LD  + +FQNL G+++++ L F+  + V   N  Y+T PV+IHGNG +K++LN  GNY+ 
Sbjct: 204 LDHRSRIFQNLNGAVDEVVLKFERSK-VRARNVAYDTLPVVIHGNGPTKLQLNYLGNYVP 262

Query: 266 KSWK-TSGCTRCNLIKHLDSLKPDQ-FPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKI 323
            +W    GC  C+    L  L  D+  P V + VFI+K T FLEEFL ++  ++YP  ++
Sbjct: 263 TAWTFAGGCGICD--DELRLLNEDEEMPLVHVGVFIEKATPFLEEFLERLTAMSYPTARL 320

Query: 324 SMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFY 382
            +F++NN  YH      +    + +F + + +     +   +ARN+A E        +FY
Sbjct: 321 RLFIHNNVFYHERHIHRFWERHRALFLDAQLVGPEENLPESKARNMAAEACKKDPRCEFY 380

Query: 383 FYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNII 442
           F +DSD  L NPD L+ L+  N+S+IAP+L +  K WSNFWGAL+ +G+Y+RS DY+ I+
Sbjct: 381 FSIDSDVALTNPDTLRILIEENKSVIAPMLSQHGKLWSNFWGALSPEGYYSRSEDYIEIV 440

Query: 443 NGDQGGKGIWNVPYITNCYLMKTSVIKATNIK-TIYTLNSMDYDMAFCTNLRNKGIHLKI 501
            G +   G+WNVPYIT  YL+K SV+++   + +++    MD DM FC N+R++GI L +
Sbjct: 441 QGKR--IGLWNVPYITQVYLIKGSVLRSKLSQLSLFVDEEMDSDMVFCRNIRDQGIFLFV 498

Query: 502 DSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLP-DTVNNQPCPD 560
            +  E+G LV S NF+  + +P+++++  NPLDW  +YIH  Y K     ++   QPCPD
Sbjct: 499 SNRDEFGRLVTSTNFNTSRLHPDMWQIFDNPLDWKEKYIHENYSKVFEEQESFVEQPCPD 558

Query: 561 VFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAE 620
           V+WFP  +EK C   V+ ME  GQWS G + D+RL  GYE VPT DIHM Q+G    W +
Sbjct: 559 VYWFPAFSEKMCDHLVETMEDNGQWSSGGHRDERLSGGYENVPTVDIHMNQIGFEKEWLK 618

Query: 621 FLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGV 680
           FL++Y+ P+ ER + GY+ +  +A M+FVVRY PDEQP LRPHHDSST+TINIALN+  +
Sbjct: 619 FLKEYIAPVTERLYPGYYPK-AQAIMNFVVRYHPDEQPFLRPHHDSSTFTINIALNRKNI 677

Query: 681 DYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           DYEGGGCRF+RYNCNV + R GW  MHPGRLTHYHEGL  T+GTRYIM+SFVDP
Sbjct: 678 DYEGGGCRFLRYNCNVESPRKGWSFMHPGRLTHYHEGLPTTKGTRYIMVSFVDP 731


>gi|410267362|gb|JAA21647.1| procollagen-lysine, 2-oxoglutarate 5-dioxygenase 3 [Pan
           troglodytes]
          Length = 738

 Score =  679 bits (1751), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 322/711 (45%), Positives = 473/711 (66%), Gaps = 11/711 (1%)

Query: 29  IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLL 87
           ++ +K LVITVA+ ET+GY RF++SAE     V+TLGL + W GGD++ ++GGG K   L
Sbjct: 34  VNPEKLLVITVATAETEGYLRFLRSAEFFNYTVQTLGLGEEWRGGDVARTVGGGQKGRGL 93

Query: 88  KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
           K E+++    +DMII+  DSYDVI+ G   ++L++F    + ++F AE  CWP+  L ++
Sbjct: 94  KKEMEKYADREDMIIMFVDSYDVILAGSPTELLKKFVQSGSRLLFSAESFCWPEWGLAEQ 153

Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
           YP VG+G R+LNSGGFIG+A  I +++     K+++DDQL+Y  L+LD  LR K  + LD
Sbjct: 154 YPEVGTGKRFLNSGGFIGFATTIHQIVRQWKYKDDDDDQLFYTRLYLDPGLREKLSLNLD 213

Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
             + +FQNL G+L+++ L FD +  V + N  Y+T P+++HGNG +K++LN  GNY+   
Sbjct: 214 HKSRIFQNLNGALDEVVLKFDRNR-VRIRNVAYDTLPIVVHGNGPTKLQLNYLGNYVPNG 272

Query: 268 WKT-SGCTRCNLIKH-LDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
           W    GC  CN  +  L   +P   P V ++VF+++PT FL  FL ++  L+YP  ++++
Sbjct: 273 WTPEGGCGFCNQDRRALPGGQPP--PRVFLAVFVEQPTPFLPRFLQRLLLLDYPPDRVTL 330

Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFY 384
           F++NN+ +H P   D     +  F  VK +     ++  EAR++A++        +FYF 
Sbjct: 331 FLHNNEVFHEPHIADSWPQLQDHFSAVKLVGPEEALSPGEARDMAMDLCRQDPECEFYFS 390

Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
           +D+D+ L N   L+ L+  N  +IAP+L R  K WSNFWGAL+ D +YARS DY+ ++  
Sbjct: 391 LDADTVLTNLQTLRILIEENRKVIAPMLSRHGKLWSNFWGALSPDEYYARSEDYVELVQR 450

Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDS 503
            +   G+WNVPYI+  Y+++   ++     + +++ +  D DMAFC + R+KGI L + +
Sbjct: 451 KR--VGVWNVPYISQAYVIRGDTLRMELPQRDVFSGSDTDPDMAFCKSFRDKGIFLHLSN 508

Query: 504 TQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFW 563
             E+G L+ +  +D +  +P+++++  NP+DW  +YIH  Y ++L  + +  QPCPDV+W
Sbjct: 509 QHEFGRLLATSRYDTEHLHPDLWQIFDNPVDWKEQYIHENYSRALEGEGIVEQPCPDVYW 568

Query: 564 FPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLR 623
           FP+++E+ C E V  ME YGQWS G + D RL  GYE VPT DIHMKQVG    W + LR
Sbjct: 569 FPLLSEQMCDELVAEMEHYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYEDQWLQLLR 628

Query: 624 KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYE 683
            YV P+ E  F GYH +  RA M+FVVRYRPDEQPSLRPHHDSST+T+N+ALN  G+DYE
Sbjct: 629 TYVGPMTESLFPGYHTK-ARAVMNFVVRYRPDEQPSLRPHHDSSTFTLNVALNHKGLDYE 687

Query: 684 GGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           GGGCRF+RY+C +++ R GW L+HPGRLTHYHEGL  T GTRYIM+SFVDP
Sbjct: 688 GGGCRFLRYDCVISSPRKGWALLHPGRLTHYHEGLPTTWGTRYIMVSFVDP 738


>gi|355747555|gb|EHH52052.1| hypothetical protein EGM_12420 [Macaca fascicularis]
          Length = 738

 Score =  678 bits (1750), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 321/708 (45%), Positives = 472/708 (66%), Gaps = 11/708 (1%)

Query: 32  DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLLKNE 90
           +K LV+TVA+ ET+GY RF++SAE     V+TLGL + W GGD++ ++GGG KV  LK E
Sbjct: 37  EKLLVMTVATAETEGYLRFLRSAEFFNYTVRTLGLGEEWRGGDVARTVGGGQKVRWLKKE 96

Query: 91  LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
           +++    +D II+  DSYDV++ G  +++L++F    + ++F AE  CWP+  L ++YP 
Sbjct: 97  MEKYADREDTIIMFVDSYDVVLAGSPSELLKKFVQSGSRLLFSAESFCWPEWGLAEQYPE 156

Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
           VG+G R+LNSGGFIG+A  I +++     K+++DDQL+Y  L+LD  LR K  + LD  +
Sbjct: 157 VGTGKRFLNSGGFIGFATTIHQIVRQWKYKDDDDDQLFYTRLYLDPGLREKLSLNLDHKS 216

Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWKT 270
            +FQNL G+L+++ L FD +  V + N  Y+T PV++HGNG +K++LN  GNY+   W  
Sbjct: 217 RIFQNLNGALDEVVLKFDRNR-VRIRNVAYDTLPVVVHGNGPTKLQLNYLGNYVPNGWTP 275

Query: 271 -SGCTRCNLIKH-LDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
             GC  CN  +  L   +P   P V ++VF+++PT FL  FL ++  L+YP  ++++F++
Sbjct: 276 EGGCGFCNRDRRTLPGGQPP--PRVFLAVFVEQPTPFLPRFLQRLLLLDYPPDRVTLFLH 333

Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDS 387
           NN+ +H P   D     +  F  VK +     ++  EAR++A++        +FYF +D+
Sbjct: 334 NNEVFHEPHIADSWPQLQDHFAVVKLVGPEEALSPGEARDMAMDMCRQDPECEFYFSLDA 393

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D+ L N   L+ L+  N  +IAP+L R  K WSNFWGAL+ D +YARS DY+ ++   + 
Sbjct: 394 DTVLTNLQTLRILIEENRKVIAPMLSRHGKLWSNFWGALSPDEYYARSEDYVELVQRKR- 452

Query: 448 GKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQE 506
             G+WNVPYI+  Y+++   ++     + +++ +  D DMAFC + R+KGI L + +  E
Sbjct: 453 -VGVWNVPYISQAYVIRGDTLRTELPQRDVFSGSDTDPDMAFCKSFRDKGIFLHLSNQHE 511

Query: 507 YGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPI 566
           +G L+ +  +D +  +P+++++  NP+DW  +YIH  Y ++L    +  QPCPDV+WFP+
Sbjct: 512 FGRLLATSRYDTEHLHPDLWQIFDNPVDWKEQYIHENYSRALEGAGIVEQPCPDVYWFPL 571

Query: 567 VTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYV 626
           ++E+ C E V  ME YGQWS G + D RL  GYE VPT DIHMKQVG    W + LR YV
Sbjct: 572 LSEQMCDELVAEMEHYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYEDQWLQLLRTYV 631

Query: 627 VPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGG 686
            P+ E  F GYH +  RA M+FVVRYRPDEQPSLRPHHDSST+T+N+ALN  G+DYEGGG
Sbjct: 632 GPMTESLFPGYHTK-ARAVMNFVVRYRPDEQPSLRPHHDSSTFTLNVALNHKGLDYEGGG 690

Query: 687 CRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           CRF+RY+C +++ R GW L+HPGRLTHYHEGL  T+GTRYIM+SFVDP
Sbjct: 691 CRFLRYDCVISSPRKGWALLHPGRLTHYHEGLPTTRGTRYIMVSFVDP 738


>gi|334323439|ref|XP_001371229.2| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 3
           [Monodelphis domestica]
          Length = 785

 Score =  675 bits (1742), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 321/703 (45%), Positives = 470/703 (66%), Gaps = 9/703 (1%)

Query: 36  VITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLLKNELDEM 94
           VIT A+ ET+GY RF+Q+A+     V+TLGL + W GGD++ ++GGG KV  LK E+++ 
Sbjct: 88  VITAATEETEGYLRFLQTAKFFNYTVQTLGLGEEWRGGDVARTVGGGQKVRWLKKEMEKY 147

Query: 95  DITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAVGSG 154
              +DM+I+  DSYDV++ G   ++L +F    + ++F AE  CWP+  L ++YP+VG+G
Sbjct: 148 AERNDMVIMFVDSYDVLLAGSPKELLWKFLQSGSRLLFSAESFCWPEWGLAERYPSVGNG 207

Query: 155 YRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLANLFQ 214
            R+LNSGGFIG+A  I  ++     K+++DDQL+Y  L+LD  LR K  + LD  + +FQ
Sbjct: 208 KRFLNSGGFIGFAPTIHHIVRQWKYKDDDDDQLFYTRLYLDSKLREKLGLALDHKSRVFQ 267

Query: 215 NLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWKT-SGC 273
           NL G+L+++ L FD ++ V + N  Y+T PV+IHGNG +K++LN  GNY+   W    GC
Sbjct: 268 NLNGALDEVVLKFDRNQ-VRIRNVAYDTLPVVIHGNGPTKLQLNYLGNYIPNGWTPEGGC 326

Query: 274 TRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEY 333
             C+  + +D  +   FP VL+SVF+++PT FL  FL ++  ++YP ++IS+F++NN+ +
Sbjct: 327 GFCDRDR-IDLQEGQPFPRVLLSVFVEQPTPFLPRFLQRLLLIDYPPEQISLFLHNNEVH 385

Query: 334 HAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKG-VDFYFYVDSDSHLD 392
           H P         +  F  VK +     +   +AR++A+++       +FYF +D+D+ + 
Sbjct: 386 HEPHIAAAWPQLQDHFFAVKLVGPEEALTPAQARDMAMDSCRQDSECEFYFSLDADAIIT 445

Query: 393 NPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIW 452
           N   L+ L+  N  +IAP+L R  K WSNFWGAL+ + +YARS DY+ ++   +   G+W
Sbjct: 446 NSQTLRNLIEENRKVIAPMLSRHGKLWSNFWGALSPEEYYARSEDYVELVQRKR--VGVW 503

Query: 453 NVPYITNCYLMKTSVI-KATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGHLV 511
           NVPYI+  YL+K   + K    + +++ +  D DMAFC  +R+KGI L + + +E+G L+
Sbjct: 504 NVPYISQAYLIKGETLRKELPQREVFSRSESDPDMAFCKTIRDKGIFLHLSNQEEFGRLL 563

Query: 512 DSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKF 571
            +  +      P+++++  NPLDW  +YIH  Y  +L  + +  QPCPDV+WFP+++E+ 
Sbjct: 564 STARYKTDHLYPDLWQIFDNPLDWQEQYIHENYTWALDGEGMVEQPCPDVYWFPLLSEQM 623

Query: 572 CHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQE 631
           C E V+ ME +GQWS G + D RL  GYE VPT DIHMKQ+G    W +FLR YV P+ E
Sbjct: 624 CDELVEEMENFGQWSGGKHEDSRLAGGYENVPTVDIHMKQLGYEDEWLQFLRTYVGPMTE 683

Query: 632 REFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIR 691
             F GYH +  RA M+FVVRYRPDEQPSLRPHHDSST+T+NIALN  G+DYEGGGCRF+R
Sbjct: 684 NLFPGYHTK-TRAVMNFVVRYRPDEQPSLRPHHDSSTFTLNIALNSKGLDYEGGGCRFLR 742

Query: 692 YNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           Y+C V++ R GW L+HPGRLTHYHEGL  T+GTRYIM+SFVDP
Sbjct: 743 YDCIVSSPRKGWGLLHPGRLTHYHEGLPTTKGTRYIMVSFVDP 785


>gi|395521882|ref|XP_003765043.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 1
           [Sarcophilus harrisii]
          Length = 733

 Score =  671 bits (1732), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 327/724 (45%), Positives = 479/724 (66%), Gaps = 14/724 (1%)

Query: 20  SVHCNKVKNIDED----KFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDM 75
           SV  +KV N  E       LV+TVA+ ET+G++RF +SA+    +V+ LGL + W GG+ 
Sbjct: 15  SVILSKVLNPPESHLPYNLLVLTVATKETEGFRRFKRSAQFFNYKVQVLGLGEDWQGGEK 74

Query: 76  S-SLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGA 134
             +LGGG KV LLK  L++    +D +IL TDSYDV+   G  ++L++F    + +VF A
Sbjct: 75  EITLGGGQKVRLLKTALEKYADKEDQVILFTDSYDVVFASGPRELLKKFRQAKSRVVFSA 134

Query: 135 ERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFL 194
           E L +PD  L  KYP V  G R+L SGGFIGYA ++ +++++   ++ + DQL+Y  +FL
Sbjct: 135 EELIYPDRRLEVKYPQVHDGKRFLGSGGFIGYAPNLSKMVASWDGQDSDSDQLFYTKIFL 194

Query: 195 DETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSK 254
           D   R K  I LD    +FQNL G+L+++ L F+  + V   N +Y+T PV+IHGNG +K
Sbjct: 195 DPEQRAKINITLDHRCRIFQNLDGALDEVVLKFETAQ-VRARNLEYDTLPVLIHGNGPTK 253

Query: 255 IELNSFGNYLAKSWK-TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNK 312
           ++LN  GNY+ + W   +GCT C+  ++ L ++  +  PSVLI +FI++PT FL  F  +
Sbjct: 254 LQLNYLGNYIPRFWTFETGCTVCDEGLRSLKAIGDEALPSVLIGIFIEQPTPFLSLFFKR 313

Query: 313 IANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVE 372
           + NL YP K++ +F++N++E+H    + +I +  + +  VK +     V   +ARN+  +
Sbjct: 314 LLNLRYPRKRLRLFIHNHEEHHEDQVEQFIADHGSEYHMVKLVGPEQRVRGADARNMGAD 373

Query: 373 NSLH-KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGF 431
                +   +Y  +D++  L NPD L+ L+ +N+++IAPL++R  + WSNFWGAL+ADGF
Sbjct: 374 LCRQDRDCTYYLSMDAEVALTNPDALRLLIEQNKAVIAPLVIRAGRLWSNFWGALSADGF 433

Query: 432 YARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCT 490
           YARS DY++I+ G +   G+WNVPYI+N YL+K S ++     K ++  + +D DMAFC+
Sbjct: 434 YARSEDYVDIVQGRR--VGVWNVPYISNIYLIKGSTLRGDLQQKDLFHSSKLDADMAFCS 491

Query: 491 NLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLP 550
           N+R + I L+I++   +G L+  +N+     + +++E+  NP DW  +YIH  Y ++L  
Sbjct: 492 NVREQVIFLRINNRHSFGRLLSVDNYQTTHLHNDLWEIFNNPEDWKEKYIHENYTEALKG 551

Query: 551 DTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMK 610
             V   PCPDV+WFPI TE  C E V+ ME +GQWS G N D R++ GYE VPT DIHM 
Sbjct: 552 KLVET-PCPDVYWFPIFTETACDELVEEMEHFGQWSAGDNKDTRIQGGYENVPTIDIHMN 610

Query: 611 QVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYT 670
           Q+     W +FL +Y+ P+ E+ + GY+    +  ++FVVRY+PDEQPSL PHHD+ST+T
Sbjct: 611 QIKFEREWHKFLVEYIAPMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFT 669

Query: 671 INIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMIS 730
           INIALN+VGVDYEGGGCRFIRYNC++ A R GW LMHPGRLTHYHEGL  T+GTRYI +S
Sbjct: 670 INIALNRVGVDYEGGGCRFIRYNCSIRAPRKGWTLMHPGRLTHYHEGLPTTRGTRYIAVS 729

Query: 731 FVDP 734
           FVDP
Sbjct: 730 FVDP 733


>gi|432098106|gb|ELK27993.1| Procollagen-lysine,2-oxoglutarate 5-dioxygenase 1 [Myotis davidii]
          Length = 737

 Score =  671 bits (1730), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 319/707 (45%), Positives = 469/707 (66%), Gaps = 9/707 (1%)

Query: 32  DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNEL 91
           D  LV+TVA+ ET+G++RF +SA+    +++ LGL + W G   +S GGG KV LLK  L
Sbjct: 36  DNLLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWHGEKATSAGGGLKVRLLKKAL 95

Query: 92  DEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAV 151
           ++    ++++IL TDSYDV+   G  ++L++F    + +VF AE L +PD  L  KYP V
Sbjct: 96  EKHADKENLVILFTDSYDVVFASGPRELLKKFRQARSQVVFSAEELIYPDRRLEAKYPVV 155

Query: 152 GSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLAN 211
             G R+L SGGFIGYA  + +L++    ++ + DQL+Y  +FLD+  R +  I LD    
Sbjct: 156 SDGKRFLGSGGFIGYAPSLSKLVAEWEGQDSDSDQLFYTKIFLDKEKRERINITLDHRCR 215

Query: 212 LFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK-T 270
           +FQNL G+L+++ L F++   V   N  Y+T PV+IHGNG +K++LN  GNY+ + W   
Sbjct: 216 IFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTFE 274

Query: 271 SGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN 329
           +GC  C+  ++ L  +  +  PSVL+ VFI++PT FL  F  ++  L+YP K++ +F++N
Sbjct: 275 TGCVVCDEGLRSLKGIGDEALPSVLVGVFIEQPTPFLSLFFKRLLRLHYPQKRMRLFIHN 334

Query: 330 NQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDSD 388
           ++++H    + ++      +++VK +     V + +ARN+ V+      G  +YF VD+D
Sbjct: 335 HEQHHKVQVEQFLAEHGGEYQSVKLVGPEVQVANADARNMGVDLCRQDHGCTYYFSVDAD 394

Query: 389 SHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGG 448
             L  P +L+ L+ +N+++IAPL+ R  + WSNFWGAL+ADG+YARS DY++I+ G +  
Sbjct: 395 VALTEPQILRLLIEQNKNVIAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR-- 452

Query: 449 KGIWNVPYITNCYLMKTSVIKATNIKT-IYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEY 507
            G+WNVPYI+N YLMK S ++A   +T ++  + +D DMAFC N+R +G+ + + +   +
Sbjct: 453 VGVWNVPYISNIYLMKGSALRAELQQTDLFHHSKLDPDMAFCANVRQQGVFMFLTNRHTF 512

Query: 508 GHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIV 567
           GHL+  +N+     + +++E+  NP DW  +YIH  Y K L    V   PCPDV+WFPI 
Sbjct: 513 GHLLSLDNYQTTHLHNDLWEVFSNPEDWKEKYIHENYTKVLAGKLV-EMPCPDVYWFPIF 571

Query: 568 TEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVV 627
           TE  C E V+ ME YGQWS G N D R++ GYE VPT DIHM Q+     W +FL +Y+ 
Sbjct: 572 TETACDELVEEMEHYGQWSLGDNKDNRIQGGYENVPTIDIHMNQISFEREWHKFLVEYIA 631

Query: 628 PLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGC 687
           P+ E+ + GY+    +  ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VGVDYEGGGC
Sbjct: 632 PVTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFTINIALNRVGVDYEGGGC 690

Query: 688 RFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           RF+RYNC++ A R GW LMHPGRLTHYHEGL  T+GTRYI +SFVDP
Sbjct: 691 RFLRYNCSIRAPRKGWALMHPGRLTHYHEGLPTTKGTRYIAVSFVDP 737


>gi|348533600|ref|XP_003454293.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 1-like
           [Oreochromis niloticus]
          Length = 730

 Score =  669 bits (1727), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 325/737 (44%), Positives = 483/737 (65%), Gaps = 13/737 (1%)

Query: 2   LSNLHLNCLILSCVVFFISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQV 61
           LS+L L   I+ C +   ++  ++V+ I EDK LV+TVA+ ETDGY+RF+++A+     V
Sbjct: 3   LSSLLLFSGIVVCAL--SALVNSEVEGIPEDKLLVVTVATKETDGYRRFLRTAKHFNYTV 60

Query: 62  KTLGLHQPWLGGD-MSSLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDIL 120
           K LG  Q W GGD MS+ GGG KV LL   L EM   D  IIL  DSYDV+   G  ++L
Sbjct: 61  KVLGRGQKWKGGDYMSAPGGGQKVRLLNEGLKEMK-DDHQIILFIDSYDVVFASGPKELL 119

Query: 121 ERFNTFDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIK 180
           ++F      +VF +E L WPD  L DKYP V  G R+L SGGFIGY  +IKEL++N +  
Sbjct: 120 KKFQQAKHRVVFSSETLIWPDRHLEDKYPHVREGNRFLGSGGFIGYLPNIKELVANWTGD 179

Query: 181 NEEDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKY 240
           + + DQL++  ++ D++ R    I LD    LFQNL+G+L+D+ L F+ D  V + N  Y
Sbjct: 180 DGDSDQLFFTKIYTDQSKRKSINITLDNKCRLFQNLHGALDDVVLKFE-DHQVRVRNVLY 238

Query: 241 NTNPVIIHGNGKSKIELNSFGNYLAKSWK-TSGCTRCNL-IKHLDSLKPDQFPSVLISVF 298
           +T PVIIHGNG +K+++N  GNY+  +W   SGCT C   ++ L +L+ +++P V+I +F
Sbjct: 239 DTLPVIIHGNGPTKLQINYLGNYIPNTWTFESGCTVCREDLRSLSALQENEYPLVVIGIF 298

Query: 299 IDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHN 358
           I +PT F+  F  ++  L YP  K+ +F++N + +H      ++ ++ ++++ V  I   
Sbjct: 299 IQQPTPFVTVFFERLLKLQYPKNKLKLFIFNKEAHHQRQVQSFLKDYGSLYEKVTVIEPE 358

Query: 359 STVNSKEARNLAVENSLH-KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFK 417
             ++   +RNL ++     +  D++F +D +  L N D LK L+ +N  ++AP++ R  +
Sbjct: 359 EEMDGAASRNLGLDLCRRDQDCDYFFSLDIEVVLKNKDTLKILIEQNLPIVAPMITRAGR 418

Query: 418 AWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIY 477
            WSNFWGAL+ DG+YARS DY++I+ G +   G+WNVPY++N YL+K  +++   +K   
Sbjct: 419 LWSNFWGALSGDGYYARSEDYVDIVQGRR--VGVWNVPYVSNVYLVKAGLLQ-RELKDYE 475

Query: 478 TLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDL 537
             +S D DMAFC N+RNKGI + + +   +G ++ +EN+     + +++++  NP+DW+ 
Sbjct: 476 LFSSSDPDMAFCHNIRNKGIFMYVTNMHTFGRILSTENYQTGHLHNDLWQIFENPVDWEE 535

Query: 538 RYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLET 597
           RYIH  Y + ++ D +   PCPDV+WFP+ +   C+  ++ ME YG+WS G N D R+  
Sbjct: 536 RYIHENYTR-IMKDKLIENPCPDVYWFPVFSSVACNHMIEEMEHYGKWSGGANVDNRIHG 594

Query: 598 GYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQ 657
           GYE VPT DIHM Q+     W +FL +YVVP+ E+ F GY+ +     ++FVVRY+PDEQ
Sbjct: 595 GYENVPTIDIHMTQINFEKDWQKFLVEYVVPITEKMFPGYYTK-AHFELAFVVRYKPDEQ 653

Query: 658 PSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEG 717
           PSLRPHHD+ST+T+NIALNQVG+DY+GGGCRF+RY+C++ A R GW L+HPGRLTHYHEG
Sbjct: 654 PSLRPHHDASTFTVNIALNQVGLDYQGGGCRFLRYDCSIQAPRKGWALLHPGRLTHYHEG 713

Query: 718 LQVTQGTRYIMISFVDP 734
           L  T G RYI +SFVDP
Sbjct: 714 LPTTAGVRYIAVSFVDP 730


>gi|73950912|ref|XP_544565.2| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 1
           isoform 1 [Canis lupus familiaris]
          Length = 727

 Score =  668 bits (1723), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 316/708 (44%), Positives = 469/708 (66%), Gaps = 9/708 (1%)

Query: 31  EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNE 90
           ED  LV+TVA+ ET+G++RF +S +    +++ LGL + W G   +S GGG KV LLK  
Sbjct: 25  EDNLLVLTVATTETEGFRRFKRSGQFFNYKIQALGLGEDWTGEKGTSAGGGLKVRLLKKA 84

Query: 91  LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
           L++    +D++IL TDSYDV+   G  ++L++F      +VF AE L +PD  L  KYPA
Sbjct: 85  LEKHADKEDLVILFTDSYDVVFASGPRELLKKFRQARGQVVFSAEELIYPDRRLEAKYPA 144

Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
           V  G R+L SGGFIGYA  + +L++    ++ + DQL+Y  +FLD   R +  I LD   
Sbjct: 145 VSDGKRFLGSGGFIGYAPSLSKLVAEWEGQDSDSDQLFYTQIFLDPEKRERINITLDHRC 204

Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK- 269
            +FQNL G+L+++ L F++   V   N  Y+T PV+IHGNG +K++LN  GNY+ + W  
Sbjct: 205 RIFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTF 263

Query: 270 TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
            +GC+ C+  ++ L  +  +  P+VL+ VFI++PT FL  F  ++ +L+YP K++ +F++
Sbjct: 264 ETGCSVCDEGLRSLRGIGEEALPTVLVGVFIEQPTPFLSLFFRRLLHLHYPRKQMRLFIH 323

Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDS 387
           N++++H    + ++    + +++VK +     V + +ARN+  +     +G  +YF VD+
Sbjct: 324 NHEQHHKAQVEQFLAEHGSEYQSVKLVGPEVRVANADARNMGADLCRQDRGCTYYFSVDA 383

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D  L  P  L+ L+ +N+++IAPL+ R  + WSNFWGAL+ADG+YARS DY++I+ G + 
Sbjct: 384 DVALTEPKTLRLLIEQNKNVIAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR- 442

Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKT-IYTLNSMDYDMAFCTNLRNKGIHLKIDSTQE 506
             G+WNVPYI+N YL+K S ++A    T ++  + +D DMAFC N+R + + + + +   
Sbjct: 443 -VGVWNVPYISNIYLIKGSALRAELQHTDLFHHSRLDPDMAFCANIRQQDVFMFLTNRHT 501

Query: 507 YGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPI 566
           +GHL+  +++     + +++E+  NP DW  +YIH  Y K+L    V   PCPDV+WFPI
Sbjct: 502 FGHLLSLDSYQTSHLHNDLWEVFSNPEDWKEKYIHENYTKALAGKLV-EMPCPDVYWFPI 560

Query: 567 VTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYV 626
            TE  C E V+ ME YGQWS G N D R++ GYE VPT DIHM Q+     W +FL +Y+
Sbjct: 561 FTETACDELVEEMEHYGQWSLGDNKDNRIQGGYENVPTIDIHMNQISFEREWHKFLVEYI 620

Query: 627 VPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGG 686
            P+ E+ + GY+    +  ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VGVDYEGGG
Sbjct: 621 APMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFTINIALNRVGVDYEGGG 679

Query: 687 CRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           CRF+RYNC++ A R GW LMHPGRLTHYHEGL  T+GTRYI +SFVDP
Sbjct: 680 CRFLRYNCSIRAPRKGWTLMHPGRLTHYHEGLPTTRGTRYIAVSFVDP 727


>gi|410966044|ref|XP_003989548.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 1 [Felis
           catus]
          Length = 727

 Score =  665 bits (1716), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 315/708 (44%), Positives = 470/708 (66%), Gaps = 9/708 (1%)

Query: 31  EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNE 90
           ED  LV+TVA+ ET+G++RF +S +    +++ LGL + W G   +S GGG KV LLK  
Sbjct: 25  EDNLLVLTVATKETEGFRRFKRSGQFFNYKIQALGLGEDWNGEKGASSGGGLKVRLLKKA 84

Query: 91  LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
           L++    ++++IL TDSYDV+   G  ++L++F    + +VF AE L +PD  L  KYPA
Sbjct: 85  LEKHADKENLVILFTDSYDVVFASGPRELLKKFRQARSQVVFSAEELIYPDRRLEAKYPA 144

Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
           V  G R+L SGGFIGYA  + +L++    ++ + DQL+Y  +FLD   R +  I LD   
Sbjct: 145 VSDGKRFLGSGGFIGYAPSLSKLVAEWEGQDGDSDQLFYTKIFLDPEKRERINITLDHRC 204

Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK- 269
            +FQNL G+L+++ L F++   V   N  Y+T PV+IHGNG +K++LN  GNY+ + W  
Sbjct: 205 RIFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTF 263

Query: 270 TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
            +GC  C+  ++ L  +  +  P+VL+ VFI++PT FL  F  ++  L+YP K++ +F++
Sbjct: 264 ETGCAVCDEGLRSLRGIGDEALPTVLVGVFIEQPTPFLSLFFQRLLRLHYPQKQMRLFIH 323

Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDS 387
           N++++H    + ++    + +++VK +     V + +ARN+  +     +G  +YF VD+
Sbjct: 324 NHEQHHKAQVEQFLAEHGSEYQSVKLVGPEVRVANADARNVGADLCRQDRGCTYYFSVDA 383

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D  L  P  L+ L+ +N+++IAPL+ R  + WSNFWGAL+ADG+YARS DY++I+ G + 
Sbjct: 384 DVALTEPKTLRLLIEQNKNVIAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR- 442

Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKT-IYTLNSMDYDMAFCTNLRNKGIHLKIDSTQE 506
             G+WNVPYI+N YL+K S ++A  ++T ++  + +D DMAFC N+R + + + + +   
Sbjct: 443 -VGVWNVPYISNIYLIKGSALRAELLQTDLFHHSKLDPDMAFCANIRQQDVFMYLTNRHT 501

Query: 507 YGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPI 566
           +GHL+  +++     + +++E+  NP DW  +YIH  Y K+L    V   PCPDV+WFPI
Sbjct: 502 FGHLLSLDSYQTSHLHNDLWEVFSNPEDWKEKYIHENYTKALAGKLV-EMPCPDVYWFPI 560

Query: 567 VTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYV 626
            TE  C E V+ ME YGQWS G N D R++ GYE VPT DIHM Q+     W +FL +Y+
Sbjct: 561 FTETACDELVEEMEHYGQWSLGDNKDNRIQGGYENVPTIDIHMNQISFEREWHKFLVEYI 620

Query: 627 VPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGG 686
            P+ E+ + GY+    +  ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VGVDYEGGG
Sbjct: 621 APMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFTINIALNRVGVDYEGGG 679

Query: 687 CRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           CRF+RYNC++ A R GW LMHPGRLTHYHEGL  T+GTRYI +SFVDP
Sbjct: 680 CRFLRYNCSIRAPRKGWTLMHPGRLTHYHEGLPTTRGTRYIAVSFVDP 727


>gi|334328372|ref|XP_001371352.2| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 1-like
           [Monodelphis domestica]
          Length = 725

 Score =  665 bits (1715), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 317/706 (44%), Positives = 471/706 (66%), Gaps = 10/706 (1%)

Query: 34  FLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDM-SSLGGGYKVNLLKNELD 92
            LV+TVA+ ET+G++RF +SA+     ++ LGL + W GG+  ++LGGG KV LLK  L+
Sbjct: 25  LLVLTVATKETEGFRRFKRSAQFFNYNIQVLGLGEDWHGGEKETTLGGGQKVRLLKAALE 84

Query: 93  EMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAVG 152
           +    +D IIL TDSYDV+   G  ++L++F    + +VF AE L +PD  L  KYP V 
Sbjct: 85  KYAEKEDQIILFTDSYDVLFASGPKELLKKFRQTKSRVVFSAEELIYPDRRLEAKYPQVH 144

Query: 153 SGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLANL 212
            G R+L SGGFIGYA ++ +L+++   ++ + DQL+Y  +FLD   R K  I LD    +
Sbjct: 145 DGKRFLGSGGFIGYAPNLSKLVASWQGQDSDSDQLFYTKIFLDPEQREKINITLDHRCRI 204

Query: 213 FQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK-TS 271
           FQNL G+L+++ L F+  + V   N +Y+T PV+IHGNG +K++LN  GNY+ + W   +
Sbjct: 205 FQNLDGALDEVVLKFETAQ-VRARNLEYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTFET 263

Query: 272 GCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNN 330
           GCT C+  ++ L  L     P+VLI +FI++PT FL  F  ++ +L YP K++ +F++N+
Sbjct: 264 GCTVCDEGLRSLKGLGDKALPTVLIGIFIEQPTPFLSLFFKRLLSLRYPRKQLRLFIHNH 323

Query: 331 QEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDSDS 389
           +E+H    + ++ +  + +  VK +  +  + + +ARN+  +     +   +Y  +D++ 
Sbjct: 324 EEHHEAQVEQFLEDHGSEYHTVKLVGPDQRMKNADARNMGADLCRQDRDCTYYLSMDAEV 383

Query: 390 HLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGK 449
            L NPD L+ L+ +N+++IAPL++R  + WSNFWGAL+ADGFYARS DY++I+ G +   
Sbjct: 384 ALTNPDALRILIEQNKAVIAPLVIRAGRLWSNFWGALSADGFYARSEDYVDIVQGRR--V 441

Query: 450 GIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYG 508
           G+WNVPYI+N YL+K S +++    K ++  + +D DMAFC+N+R + + L + +   +G
Sbjct: 442 GVWNVPYISNIYLIKGSTLRSDLRQKDLFHSSKLDADMAFCSNVREQNVFLFVTNQHSFG 501

Query: 509 HLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVT 568
            L+  +N+     + +++E+  NP DW  +YIH  Y ++L    V   PCPDV+WFPI T
Sbjct: 502 RLLSVDNYQTTHLHNDLWEIFNNPEDWKEKYIHENYTEALKGKLVET-PCPDVYWFPIFT 560

Query: 569 EKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVP 628
           E  C E V+ ME +GQWS G N D R++ GYE VPT DIHM Q+     W +FL +Y+ P
Sbjct: 561 ETACDELVEEMEHFGQWSAGDNKDTRIQGGYENVPTIDIHMNQIKFEREWHKFLVEYIAP 620

Query: 629 LQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCR 688
           + E+ + GY+    +  ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VGVDYEGGGCR
Sbjct: 621 MTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFTINIALNRVGVDYEGGGCR 679

Query: 689 FIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           FIRYNC++ A R GW LMHPGRLTHYHEGL  T+GTRYI +SFVDP
Sbjct: 680 FIRYNCSIRAPRKGWTLMHPGRLTHYHEGLPTTKGTRYIAVSFVDP 725


>gi|149695386|ref|XP_001491381.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 1-like
           [Equus caballus]
          Length = 727

 Score =  665 bits (1715), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 313/708 (44%), Positives = 469/708 (66%), Gaps = 9/708 (1%)

Query: 31  EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNE 90
           ED  LV+TVA+ E++G++RF +SA+    +++ LGL + W G   +S GGG KV LLK  
Sbjct: 25  EDNLLVLTVATKESEGFRRFKRSAQFFNYKIQALGLGEDWDGDKETSAGGGLKVRLLKKA 84

Query: 91  LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
           L++    ++++IL TDSYDV+   G  ++L++F    + +VF AE L +PD  L  KYP 
Sbjct: 85  LEKHADKENLVILFTDSYDVVFASGPRELLKKFRQARSQVVFSAEELIYPDRRLEAKYPV 144

Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
           V  G R+L SGGFIGYA ++ +L++    ++ + DQL+Y  +FLD   R +  I LD   
Sbjct: 145 VSDGKRFLGSGGFIGYAPNLSKLVAEWEGQDSDSDQLFYTKIFLDPEKRERINITLDHRC 204

Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK- 269
            +FQNL G+L+++ L F++   V   N  Y+T PV+IHGNG +K++LN  GNY+ + W  
Sbjct: 205 RIFQNLVGALDEVVLKFEMGH-VRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTF 263

Query: 270 TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
            +GCT C+  ++ L  +  +  P VL+ VFI++PT FL  F  ++  L+YP K++ +F++
Sbjct: 264 ETGCTVCDEGLRSLKGIGDEALPVVLVGVFIEQPTPFLSLFFQRLLRLHYPRKQLRLFIH 323

Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDS 387
           N++++H    + ++      +K+VK +     V + +ARN+  +     +G  +YF VD+
Sbjct: 324 NHEQHHKAQVEQFLAEHGGEYKSVKLVGPEVRVANADARNMGADLCRQDRGCTYYFSVDA 383

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D  L  P  L+ L+ +N+++IAPL+ R  + WSNFWGA++ADG+YARS DY++I+ G + 
Sbjct: 384 DVALTEPKTLRLLIEQNKNVIAPLMTRHGRLWSNFWGAMSADGYYARSEDYVDIVQGRR- 442

Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKT-IYTLNSMDYDMAFCTNLRNKGIHLKIDSTQE 506
             G+WNVPYI+N YL+K S ++A   +T ++  + +D DMAFC N+R + + + + +   
Sbjct: 443 -VGVWNVPYISNIYLIKGSALRAELQQTDLFHHSKLDADMAFCANIRQQDVFMFLTNRHT 501

Query: 507 YGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPI 566
           +GHL+  +++     + +++E+  NP DW  +YIH  Y ++L    V   PCPDV+WFPI
Sbjct: 502 FGHLLSLDSYQTTHLHNDLWEVFSNPEDWKEKYIHENYTRALAGKLV-EMPCPDVYWFPI 560

Query: 567 VTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYV 626
            TE  C E V+ ME YGQWS G N D R++ GYE VPT DIHM Q+     W +FL +Y+
Sbjct: 561 FTETACDELVEEMEHYGQWSLGDNKDNRIQGGYENVPTIDIHMNQISFEREWHKFLVEYI 620

Query: 627 VPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGG 686
            P+ E+ + GY+    +  ++FVVRY+PDEQPSL PHHD+ST+T+NIALN+VGVDYEGGG
Sbjct: 621 APMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFTVNIALNRVGVDYEGGG 679

Query: 687 CRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           CRF+RYNC++ A R GW LMHPGRLTHYHEGL  T+GTRYI +SFVDP
Sbjct: 680 CRFLRYNCSIRAPRKGWTLMHPGRLTHYHEGLPTTKGTRYIAVSFVDP 727


>gi|296192333|ref|XP_002744029.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 3
           [Callithrix jacchus]
          Length = 753

 Score =  663 bits (1711), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 310/679 (45%), Positives = 455/679 (67%), Gaps = 11/679 (1%)

Query: 61  VKTLGLHQPWLGGDMS-SLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDI 119
           V+TLGL + W GGD++ ++GGG KV  LK E+++    +DMII+  DSYDV++ GG +++
Sbjct: 81  VRTLGLGEEWRGGDVARTVGGGQKVRWLKKEMEKYADREDMIIMFVDSYDVVLAGGPSEL 140

Query: 120 LERFNTFDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSI 179
           L++F    + ++F AE  CWP+  L ++YP VG+G R+LNSGGF+G+A  I +++     
Sbjct: 141 LKKFVQSGSRLLFSAESFCWPEWGLAEQYPEVGTGKRFLNSGGFVGFATTIHQIVRQWKY 200

Query: 180 KNEEDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTK 239
           K+++DDQL+Y  L+LD  LR K  + LD  + +FQNL G+L+++ L FD +  V + N  
Sbjct: 201 KDDDDDQLFYTRLYLDPGLREKLGLNLDHKSRIFQNLNGALDEVVLKFDRNR-VRIRNVA 259

Query: 240 YNTNPVIIHGNGKSKIELNSFGNYLAKSW-KTSGCTRCNLIKH-LDSLKPDQFPSVLISV 297
           Y+T PV++HGNG +K++LN  GNY+   W    GC  CN  +  L   +P   P V ++V
Sbjct: 260 YDTLPVVVHGNGPTKLQLNYLGNYVPNGWTPEGGCGFCNRDRRTLPGGQPP--PRVFLAV 317

Query: 298 FIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAH 357
           F+++PT FL  FL ++  L+YP  +I++F++NN+ +H P   D     +  F   K +  
Sbjct: 318 FVEQPTPFLPSFLQRLLLLDYPHDRITLFLHNNEVFHEPHIADSWPQLQEHFAATKLVGP 377

Query: 358 NSTVNSKEARNLAVENSLH-KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPF 416
              ++  EAR++A++        +FYF +D+D+ L NP  L+ L+  N  +IAP+L R  
Sbjct: 378 EEALSPGEARDMAMDMCRQDPECEFYFSLDADAVLTNPQTLRILIEENRKVIAPMLSRHG 437

Query: 417 KAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKAT-NIKT 475
           K WSNFWGAL+ D +YARS DY+ ++   +   G+WNVPYI+  Y+++   ++     + 
Sbjct: 438 KLWSNFWGALSPDEYYARSEDYVELVQRKR--VGVWNVPYISQAYVIQGETLRMELPQRE 495

Query: 476 IYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDW 535
           +++ +  D DMAFC + R+KGI L + +  E+G L+ +  +D +  +P+++++  NP+DW
Sbjct: 496 VFSGSDTDPDMAFCKSFRDKGIFLHLSNQHEFGRLLATSRYDTEHLHPDLWQIFDNPVDW 555

Query: 536 DLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRL 595
             +YIH  Y ++L  + +  QPCPDV+WFP+++E+ C E V+ ME YGQWS G + D RL
Sbjct: 556 QEQYIHENYSRALEGEGIVEQPCPDVYWFPLLSEQMCDELVEEMEHYGQWSGGRHEDSRL 615

Query: 596 ETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPD 655
             GYE VPT DIHMKQVG    W + LR YV P+ E  F GYH +  RA M+FVVRYRPD
Sbjct: 616 AGGYENVPTVDIHMKQVGYEDQWLQLLRTYVGPMTESLFPGYHTK-ARAVMNFVVRYRPD 674

Query: 656 EQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYH 715
           EQPSLRPHHDSST+T+N+ALN  G+DYEGGGCRF+RYNC +++ R GW L+HPGRLTHYH
Sbjct: 675 EQPSLRPHHDSSTFTLNVALNHKGLDYEGGGCRFLRYNCVISSPRKGWALLHPGRLTHYH 734

Query: 716 EGLQVTQGTRYIMISFVDP 734
           EGL  TQGTRYIM+SFVDP
Sbjct: 735 EGLPTTQGTRYIMVSFVDP 753


>gi|225690536|ref|NP_001071210.2| procollagen-lysine,2-oxoglutarate 5-dioxygenase 1 precursor [Danio
           rerio]
          Length = 730

 Score =  662 bits (1709), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 318/734 (43%), Positives = 480/734 (65%), Gaps = 17/734 (2%)

Query: 7   LNCLILSCVVFFISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGL 66
           L CL  S    F  +HCN+  +I E   LV+TVA+ ETDG++RF++SA+     +K LG 
Sbjct: 8   LACLFAS----FPPLHCNQQGSIPEGDLLVLTVATQETDGFRRFLRSAKHFNYTIKVLGR 63

Query: 67  HQPWLGGD-MSSLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNT 125
            + W GGD M++ GGG KV LLK+ L+++   +  +IL  DSYDVI   G  ++L++F  
Sbjct: 64  GETWRGGDYMTAPGGGQKVRLLKSALEDIQ-EEKKVILFVDSYDVIFSSGPKELLKKFQQ 122

Query: 126 FDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDD 185
               +VF AE L WPD  L DK+P V  G R+L +GGFIGYA ++K+++S+ S  + + D
Sbjct: 123 AKHKVVFSAETLIWPDRHLEDKHPHVREGKRFLGAGGFIGYAANLKKMLSDWSGADGDSD 182

Query: 186 QLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPV 245
           QL+Y  +++++  R    I LD+   LFQNL+G+L+++ L F+ D  V   N  Y+T PV
Sbjct: 183 QLFYTKIYINKEKRKSINITLDSKCRLFQNLHGALDEVVLKFE-DGRVRARNVLYDTLPV 241

Query: 246 IIHGNGKSKIELNSFGNYLAKSWK-TSGCTRCNLIKHLDS-LKPDQFPSVLISVFIDKPT 303
           IIHGNG +K+++N  GNY+   W   +GCT CN  + L S L+  ++P V+I +FI +PT
Sbjct: 242 IIHGNGPTKLQINYLGNYIPNLWTFETGCTMCNQDRRLLSGLQESEYPVVVIGIFIQQPT 301

Query: 304 AFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNS 363
            F+  F  ++ NL YP  ++ +F+YN + +H      ++ + ++ ++ VK I     ++ 
Sbjct: 302 PFVTVFFERLFNLKYPKNRLKLFIYNQETHHEQHIHAFLDSHESEYQGVKLIGPEEDIDP 361

Query: 364 KEARNLAVENSLHK-GVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNF 422
             +RNL  +        +++F +D D  L N D L+ L+  N+  IAP+L +P + W+NF
Sbjct: 362 VSSRNLGFDMCREDIDCEYFFSIDVDVVLKNEDTLRILIEHNKPFIAPMLTKPGRLWTNF 421

Query: 423 WGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKT--IYTLN 480
           WGAL+ADGFYARS DY++I+ G +   G+WNVPY+++ +L+K   ++ T++K   ++   
Sbjct: 422 WGALSADGFYARSEDYVDIVQGHR--VGLWNVPYVSHIFLIKADTLR-TDLKDPDLFKST 478

Query: 481 SMDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYI 540
           ++D DMAFC  +RNKG+ + + +   +G ++ ++N+     + +++++  NP++W+ RYI
Sbjct: 479 TLDPDMAFCEKIRNKGVFMFVTNMDTFGRVLSTDNYQTNHLHNDLWQIFENPVEWEERYI 538

Query: 541 HPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYE 600
           HP Y + +L D     PCPDV+WFPI +E  C   V+ ME +GQWS G N D R++ GYE
Sbjct: 539 HPNYSR-VLKDEFIETPCPDVYWFPIFSEVACDHLVEEMENFGQWSGGANVDNRIQGGYE 597

Query: 601 AVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSL 660
            VPT DIHM QVG    W +FL  Y+ P+ E+ F GY+    +  ++FVVRY+PDEQPSL
Sbjct: 598 NVPTIDIHMNQVGYEKEWQKFLLDYIAPVTEKMFPGYYTR-AQFDLAFVVRYKPDEQPSL 656

Query: 661 RPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQV 720
           RPHHD+ST+TINIALN VG+D++GGGCRF+RY+C++ + R GW  MHPGRLTHYHEGL  
Sbjct: 657 RPHHDASTFTINIALNHVGIDFQGGGCRFLRYDCSIRSPRKGWAFMHPGRLTHYHEGLPT 716

Query: 721 TQGTRYIMISFVDP 734
           T+G RYI +SFVDP
Sbjct: 717 TEGVRYIAVSFVDP 730


>gi|402863091|ref|XP_003895867.1| PREDICTED: LOW QUALITY PROTEIN: procollagen-lysine,2-oxoglutarate
           5-dioxygenase 3 [Papio anubis]
          Length = 736

 Score =  662 bits (1708), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 319/714 (44%), Positives = 469/714 (65%), Gaps = 18/714 (2%)

Query: 29  IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLL 87
           ++ +K LV+TVA+ ET+GY RF++SAE     V+TLGL + W GGD++ ++GGG KV  L
Sbjct: 33  VNPEKLLVMTVATAETEGYLRFLRSAEFFNYTVRTLGLGEEWRGGDVARTVGGGQKVRWL 92

Query: 88  KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
           K E+++    +DMII+  DSYDVI+ G  +++L++F    + ++F AE  CWP+  L ++
Sbjct: 93  KKEMEKYADREDMIIMFVDSYDVILAGSPSELLKKFVQSGSRLLFSAESFCWPEWGLAEQ 152

Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
           YP VG+G R+LNSGGFIG+A  I +++     K+++DDQL+Y  L+LD  LR K  + LD
Sbjct: 153 YPEVGTGKRFLNSGGFIGFATTIHQIVRQWKYKDDDDDQLFYTRLYLDPGLREKLSLNLD 212

Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
             + +FQNL G+L+++ L FD +  V + N  Y+T PV++HGNG +K++LN  GNY+   
Sbjct: 213 HKSRIFQNLNGALDEVVLKFDRNR-VRIRNVAYDTLPVVVHGNGPTKLQLNYLGNYVPNG 271

Query: 268 WKT-SGCTRCNLIKH-LDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
           W    GC  CN  +  L   +P   P V ++VF+++PT FL  FL ++  L+YP  ++++
Sbjct: 272 WTPEGGCGFCNRDRRTLPGGQPP--PRVFLAVFVEQPTPFLPRFLQRLLLLDYPPDRVTL 329

Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDF---- 381
           F++NN+ +H P   D     +  F  VK +     ++  EAR++A+  +  +        
Sbjct: 330 FLHNNEVFHEPHIADSWPQLQDHFAVVKLVGPEEALSPGEARDMAIRRAPQQARTHFCLG 389

Query: 382 YFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNI 441
           Y    + S L +P     L  R   +IAP+L R  K WSNFWGAL+ D +YARS DY+ +
Sbjct: 390 YGAPQACSRLPSPH--PRLSXRK--VIAPMLSRHGKLWSNFWGALSPDEYYARSEDYVEL 445

Query: 442 INGDQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLK 500
           +   +   G+WNVPYI+  Y+++   ++     + +++ +  D DMAFC + R+KGI L 
Sbjct: 446 VQRKR--VGVWNVPYISQAYVIRGDTLRTELPQRDVFSGSDTDPDMAFCKSFRDKGIFLH 503

Query: 501 IDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPD 560
           + +  E+G L+ +  +D +  +P+++++  NP+DW  +YIH  Y ++L  + +  QPCPD
Sbjct: 504 LSNQHEFGRLLATSRYDTEHLHPDLWQIFDNPVDWKEQYIHENYSRALEGEGIVEQPCPD 563

Query: 561 VFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAE 620
           V+WFP+++E+ C E V  ME YGQWS G + D RL  GYE VPT DIHMKQVG    W +
Sbjct: 564 VYWFPLLSEQMCDELVAEMEHYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYEDQWLQ 623

Query: 621 FLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGV 680
            LR YV P+ E  F GYH +  RA M+FVVRYRPDEQPSLRPHHDSST+T+N+ALN  G+
Sbjct: 624 LLRTYVGPMTESLFPGYHTK-ARAVMNFVVRYRPDEQPSLRPHHDSSTFTLNVALNHKGL 682

Query: 681 DYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           DYEGGGCRF+RY+C +++ R GW L+HPGRLTHYHEGL  T+GTRYIM+SFVDP
Sbjct: 683 DYEGGGCRFLRYDCVISSPRKGWALLHPGRLTHYHEGLPTTRGTRYIMVSFVDP 736


>gi|110825733|sp|O77588.2|PLOD1_BOVIN RecName: Full=Procollagen-lysine,2-oxoglutarate 5-dioxygenase 1;
           AltName: Full=Lysyl hydroxylase 1; Short=LH1; Flags:
           Precursor
 gi|95767528|gb|ABF57309.1| lysyl hydroxylase precursor [Bos taurus]
          Length = 726

 Score =  661 bits (1705), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 315/708 (44%), Positives = 470/708 (66%), Gaps = 10/708 (1%)

Query: 31  EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNE 90
           ED  LV+TVA+ ET+G++RF +SA+    +++ LGL + W G  M + GGG KV LLK  
Sbjct: 25  EDNLLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWPGEAMLA-GGGLKVRLLKKA 83

Query: 91  LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
           L++    ++++IL TDSYDV+   G  ++L++F    + +VF AE L +PD  L  KYP 
Sbjct: 84  LEKHADKENLVILFTDSYDVVFASGPRELLKKFRQARSQVVFSAEELIYPDRRLEAKYPV 143

Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
           V  G R+L SGGFIGYA ++ +L++    ++ + DQL+Y  +FLD   R +  I LD   
Sbjct: 144 VSDGKRFLGSGGFIGYAPNLSKLVAEWEGQDSDSDQLFYTKIFLDPEKREQINITLDHRC 203

Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK- 269
            +FQNL G+L+++ L F++ + V   N  Y+T PV+IHGNG +K++LN  GNY+ + W  
Sbjct: 204 RIFQNLDGALDEVVLKFEMGQ-VRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTF 262

Query: 270 TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
            +GC  C+  ++ L  +  +  P+VL+ VFI++PT FL  F  ++  L+YP K++ +F++
Sbjct: 263 ETGCAVCDEGLRSLKGIGDEALPAVLVGVFIEQPTPFLSLFFQRLLRLHYPQKRLRLFIH 322

Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDS 387
           N++++H    + ++      +++VK +     V + +ARN+  +     +G  +YF VD+
Sbjct: 323 NHEQHHKAQVEQFLAEHGDEYQSVKLVGPEVRVANADARNMGADLCRQDRGCTYYFSVDA 382

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D  L  P  L+ L+ +N+++IAPL+ R  + WSNFWGAL+ADG+YARS DY++I+ G + 
Sbjct: 383 DVALTEPKTLRLLIEQNKNVIAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR- 441

Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKT-IYTLNSMDYDMAFCTNLRNKGIHLKIDSTQE 506
             G+WNVPYI+N YL+K S ++A   +T ++  + +D DMAFC N+R + + + + +   
Sbjct: 442 -VGVWNVPYISNIYLIKGSALRAELQETDLFHHSKLDPDMAFCANIRQQDVFMFLTNRHS 500

Query: 507 YGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPI 566
           +GHL+  +++     + +++E+  NP DW  +YIH  Y K+L    V   PCPDV+WFPI
Sbjct: 501 FGHLLSLDSYQTTHLHNDLWEVFSNPEDWKEKYIHENYTKALAGKMVE-MPCPDVYWFPI 559

Query: 567 VTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYV 626
            TE  C E V+ ME YGQWS G N D R++ GYE VPT DIHM Q+     W +FL +Y+
Sbjct: 560 FTETACDELVEEMEHYGQWSLGDNKDNRIQGGYENVPTIDIHMNQINFEREWHKFLVEYI 619

Query: 627 VPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGG 686
            P+ E+ + GY+    +  ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VGVDYEGGG
Sbjct: 620 APMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLVPHHDASTFTINIALNRVGVDYEGGG 678

Query: 687 CRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           CRF+RYNC++ A R GW LMHPGRLTHYHEGL  T+GTRYI +SFVDP
Sbjct: 679 CRFLRYNCSIRAPRKGWTLMHPGRLTHYHEGLPTTKGTRYIAVSFVDP 726


>gi|344283497|ref|XP_003413508.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 1-like
           [Loxodonta africana]
          Length = 727

 Score =  660 bits (1703), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 311/708 (43%), Positives = 471/708 (66%), Gaps = 9/708 (1%)

Query: 31  EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNE 90
           ED  LV+TVA+ ET+G++RF +SA+    +++ LGL + W  G  +  GGG KV LLK  
Sbjct: 25  EDNLLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWNSGQETPAGGGQKVRLLKRA 84

Query: 91  LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
           L++    ++++IL TDSYDV+   G  ++L++F    + +VF AE L +PD  L  KYP 
Sbjct: 85  LEKHADKENLVILFTDSYDVVFASGPRELLKKFRQARSQVVFSAEELIYPDRRLEAKYPV 144

Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
           V  G R+L SGGFIGYA ++ +L+S    ++ + DQL+Y  +FLD   R +  I LD   
Sbjct: 145 VSDGKRFLGSGGFIGYAPNLSKLVSEWEGQDSDSDQLFYTKIFLDPEKRERINITLDHRC 204

Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK- 269
            +FQNL G+L+++ L F++   V + N  Y+T PV+IHGNG +K++LN  GNY+ + W  
Sbjct: 205 RIFQNLDGALDEVVLKFEMGH-VRVRNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTF 263

Query: 270 TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
            +GCT C+  ++ L  ++ +  P+VL+ +FI++PT FL  F  ++  L+YP K++ +F++
Sbjct: 264 ETGCTVCDEGLRLLKGIRDEALPTVLVGIFIEQPTPFLVLFFQRLLRLHYPWKQMRLFIH 323

Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDS 387
           N++++H    + ++    + +++VK +     + + +ARNL  +     +   +YF +D+
Sbjct: 324 NHEQHHKAQVEQFLAEHGSKYQSVKLVGPEIRMANADARNLGADLCRKDQSCTYYFSMDA 383

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D  L  PD L+ L+ +N+++IAPL+ R  + WSNFWGAL+ADG+YARS DY++I+ G + 
Sbjct: 384 DVALKEPDTLRLLIEQNKNVIAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR- 442

Query: 448 GKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQE 506
             G+WNVPYI+N YL+K S +++    K ++  + +D DM+FC N+R + + + + +   
Sbjct: 443 -VGVWNVPYISNIYLIKGSALRSDLQQKDLFHHSKLDPDMSFCANVRQQAVFMFLTNRHT 501

Query: 507 YGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPI 566
           +GHL+  +++     + +++E+  NP DW  +YIH  Y K+L    V   PCPDV+WFPI
Sbjct: 502 FGHLLSLDSYQTTHLHNDLWEVFSNPEDWKEKYIHENYTKALEGKLVET-PCPDVYWFPI 560

Query: 567 VTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYV 626
            TE  C E V+ ME YGQWS G N D R++ GYE VPT DIHM Q+     W +FL +Y+
Sbjct: 561 FTEAACDELVEEMEHYGQWSRGDNKDNRIQGGYENVPTIDIHMNQINFEREWHKFLVEYI 620

Query: 627 VPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGG 686
            P+ E+ + GY+    +  ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VGVDYEGGG
Sbjct: 621 APMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFTINIALNRVGVDYEGGG 679

Query: 687 CRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           CRF+RYNC++ A R GW L+HPGRLTHYHEGL  T+GTRYI +SFVDP
Sbjct: 680 CRFLRYNCSIRAPRKGWTLLHPGRLTHYHEGLPTTKGTRYIAVSFVDP 727


>gi|70779497|gb|AAZ08241.1| procollagen lysine 2-oxoglutarate 5-dioxygenase 1 [Danio rerio]
 gi|116487779|gb|AAI25831.1| Zgc:152876 [Danio rerio]
          Length = 730

 Score =  660 bits (1702), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 317/734 (43%), Positives = 480/734 (65%), Gaps = 17/734 (2%)

Query: 7   LNCLILSCVVFFISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGL 66
           L CL  S    F  ++CN+  +I E   LV+TVA+ ETDG++RF++SA+     +K LG 
Sbjct: 8   LACLFAS----FPPLYCNQQGSIPEGDLLVLTVATQETDGFRRFLRSAKHFNYTIKVLGR 63

Query: 67  HQPWLGGD-MSSLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNT 125
            + W GGD M++ GGG KV LLK+ L+++   +  +IL  DSYDVI   G  ++L++F  
Sbjct: 64  GETWRGGDYMTAPGGGQKVRLLKSALEDIQ-EEKKVILFVDSYDVIFSSGPKELLKKFQQ 122

Query: 126 FDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDD 185
               +VF AE L WPD  L DK+P V  G R+L +GGFIGYA ++K+++S+ S  + + D
Sbjct: 123 AKHKVVFSAETLIWPDRHLEDKHPHVREGKRFLGAGGFIGYAANLKKMLSDWSGADGDSD 182

Query: 186 QLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPV 245
           QL+Y  +++++  R    I LD+   LFQNL+G+L+++ L F+ D  V   N  Y+T PV
Sbjct: 183 QLFYTKIYINKEKRKSINITLDSKCRLFQNLHGALDEVVLKFE-DGRVRARNVLYDTLPV 241

Query: 246 IIHGNGKSKIELNSFGNYLAKSWK-TSGCTRCNLIKHLDS-LKPDQFPSVLISVFIDKPT 303
           IIHGNG +K+++N  GNY+   W   +GCT CN  + L S L+  ++P V+I +FI +PT
Sbjct: 242 IIHGNGPTKLQINYLGNYIPNLWTFETGCTMCNQDRRLLSGLQESEYPVVVIGIFIQQPT 301

Query: 304 AFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNS 363
            F+  F  ++ NL YP  ++ +F+YN + +H      ++ + ++ ++ VK I     ++ 
Sbjct: 302 PFVTVFFERLFNLKYPKNRLKLFIYNQETHHEQHIHAFLDSHESEYQGVKLIGPEEDIDP 361

Query: 364 KEARNLAVENSLHK-GVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNF 422
             +RNL  +        +++F +D D  L N D L+ L+  N+  IAP+L +P + W+NF
Sbjct: 362 VSSRNLGFDMCREDIDCEYFFSIDVDVVLKNEDTLRILIEHNKPFIAPMLTKPGRLWTNF 421

Query: 423 WGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKT--IYTLN 480
           WGAL+ADGFYARS DY++I+ G +   G+WNVPY+++ +L+K   ++ T++K   ++   
Sbjct: 422 WGALSADGFYARSEDYVDIVQGHR--VGLWNVPYVSHIFLIKADTLR-TDLKDPDLFKST 478

Query: 481 SMDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYI 540
           ++D DMAFC  +RNKG+ + + +   +G ++ ++N+     + +++++  NP++W+ RYI
Sbjct: 479 TLDPDMAFCEKIRNKGVFMFVTNMDTFGRVLSTDNYQTNHLHNDLWQIFENPVEWEERYI 538

Query: 541 HPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYE 600
           HP Y + +L D     PCPDV+WFPI +E  C   V+ ME +GQWS G N D R++ GYE
Sbjct: 539 HPNYSR-VLKDEFIETPCPDVYWFPIFSEVACDHLVEEMENFGQWSGGANVDNRIQGGYE 597

Query: 601 AVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSL 660
            VPT DIHM QVG    W +FL  Y+ P+ E+ F GY+    +  ++FVVRY+PDEQPSL
Sbjct: 598 NVPTIDIHMNQVGYEKEWQKFLLDYIAPVTEKMFPGYYTR-AQFDLAFVVRYKPDEQPSL 656

Query: 661 RPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQV 720
           RPHHD+ST+TINIALN VG+D++GGGCRF+RY+C++ + R GW  MHPGRLTHYHEGL  
Sbjct: 657 RPHHDASTFTINIALNHVGIDFQGGGCRFLRYDCSIRSPRKGWAFMHPGRLTHYHEGLPT 716

Query: 721 TQGTRYIMISFVDP 734
           T+G RYI +SFVDP
Sbjct: 717 TEGVRYIAVSFVDP 730


>gi|291410569|ref|XP_002721561.1| PREDICTED: lysyl hydroxylase 1 [Oryctolagus cuniculus]
          Length = 727

 Score =  660 bits (1702), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 312/708 (44%), Positives = 467/708 (65%), Gaps = 9/708 (1%)

Query: 31  EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNE 90
           ED  LV+TVA+ ET+G++RF +SA+    +++ LGL + W      S GGG KV LL+  
Sbjct: 25  EDNLLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWPVEKGLSAGGGQKVRLLRKA 84

Query: 91  LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
           L++    +D+++L TDSYDV+   G  ++L++F    + +VF AE L +PD  L  KYP 
Sbjct: 85  LEKHADKEDLVVLFTDSYDVVFASGPRELLKKFRQARSQVVFSAEELIYPDRRLEAKYPV 144

Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
           V  G R+L SGGFIGYA  +++L++    ++ + DQL+Y  +FLD   R +  I LD   
Sbjct: 145 VSDGKRFLGSGGFIGYAPSLRKLVAEWEGQDSDSDQLFYTKIFLDPEKRERINITLDHRC 204

Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK- 269
            +FQNL G+L+++ L F++   V   N  Y+T PV+IHGNG +K++LN  GNY+ + W  
Sbjct: 205 RIFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTF 263

Query: 270 TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
            +GC  C+  ++ L  +  +  P+VL+ VFI++PT FL  F  ++  L+YP K++ +FV+
Sbjct: 264 ETGCAVCDEGLRSLKGIGEEALPTVLVGVFIEQPTPFLSLFFQRLLRLHYPRKQMRLFVH 323

Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDS 387
           N++++H    + ++      +++VK +   + + + +ARNL  +     +   +YF +D+
Sbjct: 324 NHEQHHKAQVEQFLLEHGDEYQSVKLVGPEARMANADARNLGADLCRQDRACTYYFSMDA 383

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D  L  PD L+ L+ +N++++APL+ R  + WSNFWGAL+ADG+YARS DY++I+ G + 
Sbjct: 384 DVALTEPDSLRLLIEQNKNVLAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR- 442

Query: 448 GKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQE 506
             G+WNVPYI+N YL+K S ++A  +   +Y  + +D DMAFC NLR + + + + +   
Sbjct: 443 -VGVWNVPYISNVYLIKGSALRAELHSPDLYRYSKLDPDMAFCANLRKQEVFMFLTNRHS 501

Query: 507 YGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPI 566
           +GHL+  +N+     + +++E+  NP DW  +YIH  Y K+L    V   PCPDV+WFPI
Sbjct: 502 FGHLLSLDNYQTTHLHNDLWEVFSNPEDWKEKYIHEGYGKALAGKLV-EMPCPDVYWFPI 560

Query: 567 VTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYV 626
            TE  C E V+ ME +GQWS G N D R++ GYE VPT DIHM Q+     W +FL +Y+
Sbjct: 561 FTEVACDELVEEMEHFGQWSLGDNKDNRIQGGYENVPTIDIHMNQISFEREWHKFLVEYI 620

Query: 627 VPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGG 686
            P+ E+ + GY+    +  ++FVVRY+PDEQPSL PHHD+ST+T+NIALN+VGVDYEGGG
Sbjct: 621 APMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFTVNIALNRVGVDYEGGG 679

Query: 687 CRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           CRF+RYNC++ A R GW LMHPGRLTHYHEGL  T+GTRYI +SFVDP
Sbjct: 680 CRFLRYNCSIRAPRKGWALMHPGRLTHYHEGLPTTKGTRYIAVSFVDP 727


>gi|355712268|gb|AES04293.1| procollagen-lysine 1, 2-oxoglutarate 5-dioxygenase 1 [Mustela
           putorius furo]
          Length = 727

 Score =  660 bits (1702), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 315/708 (44%), Positives = 465/708 (65%), Gaps = 9/708 (1%)

Query: 31  EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNE 90
           ED  LV+TVA+ ET+G++RF +S +    +++ LGL + W G   SS GGG KV LLK  
Sbjct: 25  EDNLLVLTVATRETEGFRRFKRSGQFFNYKIQALGLGEDWSGEKGSSAGGGLKVRLLKKA 84

Query: 91  LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
           L++    ++++IL TDSYDV+   G  ++L++F    + +VF AE L +PD  L  KYPA
Sbjct: 85  LEKHADKENLVILFTDSYDVVFASGPRELLKKFRQARSQVVFSAEELIYPDRRLEAKYPA 144

Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
           V  G R+L SGGFIGYA ++ +L++    ++ + DQL+Y  +FLD   R +  I LD   
Sbjct: 145 VSDGKRFLGSGGFIGYAPNLSKLVAEWEGQDSDSDQLFYTKIFLDPEKRERINITLDHRC 204

Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK- 269
            +FQNL G+L+++ L F++   V   N  Y+T PV+IHGNG +K++LN  GNY+ + W  
Sbjct: 205 RIFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTF 263

Query: 270 TSGCTRCN-LIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
            +GC  C+  ++ L  +  +  P+VL+ VFI++PT FL  F  ++  L+YP K++ +F++
Sbjct: 264 ETGCAVCDESLRSLRGIGDEALPTVLVGVFIEQPTPFLSLFFQRLLRLHYPRKQMRLFIH 323

Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDS 387
           N++ +H    + ++      + +VK +     + + +ARN+  +     +   +YF VD+
Sbjct: 324 NHEPHHKVQVEQFLAEHGDEYPSVKLVGPEVRMANADARNMGADLCRQDRSCTYYFSVDA 383

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D  L  P  L+ L+ +N+++IAPL+ R  + WSNFWGAL+ADG+YARS DY++I+ G + 
Sbjct: 384 DVALTEPKTLRLLIEQNKNVIAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR- 442

Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKT-IYTLNSMDYDMAFCTNLRNKGIHLKIDSTQE 506
             G+WNVPYI+N YL+K S ++A   +T ++    +D DMAFC N+R + + + + +   
Sbjct: 443 -VGVWNVPYISNIYLIKGSALRAELRQTDLFHHRKLDPDMAFCANIRQQDVFMFLTNRHT 501

Query: 507 YGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPI 566
           +GHL+  +N+     + +++E+  NP DW  +YIH  Y K+L    V   PCPDV+WFPI
Sbjct: 502 FGHLLSLDNYQTTHLHNDLWEVFSNPEDWKEKYIHENYTKALAGKLVET-PCPDVYWFPI 560

Query: 567 VTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYV 626
            TE  C E V+ ME YGQWS G N D R++ GYE VPT DIHM Q+     W +FL +Y+
Sbjct: 561 FTEAACDELVEEMEHYGQWSLGDNKDNRIQGGYENVPTIDIHMNQINFEREWHKFLVEYI 620

Query: 627 VPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGG 686
            P+ E+ + GY+    +  ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VGVDYEGGG
Sbjct: 621 APMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFTINIALNRVGVDYEGGG 679

Query: 687 CRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           CRF+RYNC++ A R GW LMHPGRLTHYHEGL  T+GTRYI +SFVDP
Sbjct: 680 CRFLRYNCSIRAPRKGWTLMHPGRLTHYHEGLPTTRGTRYIAVSFVDP 727


>gi|311258478|ref|XP_003127625.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 1-like
           [Sus scrofa]
          Length = 725

 Score =  659 bits (1700), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 315/708 (44%), Positives = 470/708 (66%), Gaps = 11/708 (1%)

Query: 31  EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNE 90
           ED  LV+TVA+ ET+G++RF +SA+    +++ LGL + W   + SS GGG KV LLK  
Sbjct: 25  EDNLLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWNEKEASS-GGGLKVRLLKKA 83

Query: 91  LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
           L E    ++++IL TDSYDV+   G  ++L++F    + +VF AE L +PD  L  KYPA
Sbjct: 84  L-EKHADENLVILFTDSYDVVFASGPRELLKKFRQAKSQVVFSAEELIYPDRRLEAKYPA 142

Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
           V  G R+L SGGFIGYA ++ +L++    ++ + DQL+Y  +FLD   R +  I LD   
Sbjct: 143 VSDGKRFLGSGGFIGYAPNLSKLVAEWEGQDSDSDQLFYTKIFLDPEKRERINITLDHRC 202

Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK- 269
            +FQNL G+L+++ L F++ + V   N  Y+T PV+IHGNG +K++LN  GNY+ + W  
Sbjct: 203 RIFQNLDGALDEVVLKFEMGQ-VRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTF 261

Query: 270 TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
            +GC+ C+  ++ L  +  +  P+VL+ +FI++PT FL  F  ++  L YP K++ +F++
Sbjct: 262 ETGCSVCDEGLRSLKGIGDEALPTVLVGLFIEQPTPFLSLFFQRLLRLQYPRKRMRLFIH 321

Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDS 387
           N++++H  L + ++      +++VK +     V + +ARN+  +     +   +YF VD+
Sbjct: 322 NHEQHHKALVEQFLAEHGDEYQSVKLVGPEVRVANADARNMGADLCRQDRSCTYYFSVDA 381

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D  L  P  L+ L+ +N++++APL+ R  + WSNFWGAL+ADG+YARS DY++I+ G + 
Sbjct: 382 DVALTEPKTLRLLIEQNKNVLAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR- 440

Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKT-IYTLNSMDYDMAFCTNLRNKGIHLKIDSTQE 506
             G+WNVPYI+N YL+K S ++A   +T ++  + +D DMAFC N+R + + + + +   
Sbjct: 441 -VGVWNVPYISNVYLIKGSALRAELQQTDLFHHSKLDPDMAFCANIRQQDVFMFLTNRHA 499

Query: 507 YGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPI 566
           +GHL+  +++     + +++E+  NP DW  +YIH  Y K+L    V   PCPDV+WFPI
Sbjct: 500 FGHLLSLDSYQTTHLHNDLWEVFSNPEDWKEKYIHENYTKALAGKLVET-PCPDVYWFPI 558

Query: 567 VTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYV 626
            TE  C E V+ ME YGQWS G N D R++ GYE VPT DIHM Q+     W +FL +Y+
Sbjct: 559 FTETACDELVEEMEHYGQWSLGDNKDNRIQGGYENVPTIDIHMNQISFEREWHKFLVEYI 618

Query: 627 VPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGG 686
            P+ E+ + GY+    +  ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VGVDYEGGG
Sbjct: 619 APMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFTINIALNRVGVDYEGGG 677

Query: 687 CRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           CRF+RYNC++ A R GW LMHPGRLTHYHEGL  T+GTRYI +SFVDP
Sbjct: 678 CRFLRYNCSIRAPRKGWTLMHPGRLTHYHEGLPTTKGTRYIAVSFVDP 725


>gi|426240317|ref|XP_004014056.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 1 [Ovis
           aries]
          Length = 703

 Score =  659 bits (1699), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 313/705 (44%), Positives = 468/705 (66%), Gaps = 10/705 (1%)

Query: 34  FLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNELDE 93
            LV+TVA+ ET+G++RF +SA+    +++ LGL + W G  MS+ GGG KV LLK  L++
Sbjct: 5   LLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWPGEAMSA-GGGLKVRLLKKALEK 63

Query: 94  MDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAVGS 153
               ++++IL TDSYDV+   G  ++L++F    + +VF AE L +PD  L  KYP V  
Sbjct: 64  HADKENLVILFTDSYDVVFASGPRELLKKFRQARSQVVFSAEELIYPDRRLEAKYPVVSD 123

Query: 154 GYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLANLF 213
           G R+L SGGFIGYA ++ +L++    ++ + DQL+Y  +FLD   R +  I LD    +F
Sbjct: 124 GKRFLGSGGFIGYAPNLSKLVAEWEGQDSDSDQLFYTKIFLDPEKREQINITLDHRCRIF 183

Query: 214 QNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK-TSG 272
           QNL G+L+++ L F++ + V   N  Y+T PV+IHGNG +K++LN  GNY+ + W   +G
Sbjct: 184 QNLDGALDEVVLKFEMGQ-VRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTFETG 242

Query: 273 CTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQ 331
           C  C+  ++ L  +  +  P+VL+ VFI++PT FL  F  ++  L+YP K++ +F++N++
Sbjct: 243 CAVCDEGLRSLKGIGDEALPTVLVGVFIEQPTPFLSLFFQRLLRLHYPRKRLRLFIHNHE 302

Query: 332 EYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDSDSH 390
           ++H    + ++      +++VK +     + S +ARN+  +     +G  +YF VD+D  
Sbjct: 303 QHHKAQVEQFLAEHGDEYQSVKLVGPEVRMASADARNMGADLCRQDRGCTYYFSVDADVA 362

Query: 391 LDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKG 450
           L  P  L+ L+ +N+++I PL+ R  + WSNFWGAL+ADG+YARS DY++I+ G +   G
Sbjct: 363 LTEPRTLRLLIEQNKNVITPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR--VG 420

Query: 451 IWNVPYITNCYLMKTSVIKATNIKT-IYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGH 509
           +WNVPYI+N YL+K S ++A   +T ++  + +D DMAFC N+R + + + + +   +GH
Sbjct: 421 VWNVPYISNIYLIKGSALRAELQETDLFHHSKLDPDMAFCANIRQQDVFMFLTNRHSFGH 480

Query: 510 LVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTE 569
           L+  +++     + +++E+  NP DW  +YIH  Y K+L    V   PCPDV+WFPI TE
Sbjct: 481 LLSLDSYQTTHLHNDLWEVFSNPEDWKEKYIHENYTKALAGKMVE-MPCPDVYWFPIFTE 539

Query: 570 KFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPL 629
             C E V+ ME YGQWS G N D R++ GYE VPT DIHM Q+     W +FL +Y+ P+
Sbjct: 540 TACDELVEEMEHYGQWSLGDNKDNRIQGGYENVPTIDIHMNQINFEREWHKFLVEYIAPM 599

Query: 630 QEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRF 689
            E+ + GY+    +  ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VGVDYEGGGCRF
Sbjct: 600 TEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLVPHHDASTFTINIALNRVGVDYEGGGCRF 658

Query: 690 IRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           +RYNC++ A R GW LMHPGRLTHYHEGL  T+GTRYI +SFVDP
Sbjct: 659 LRYNCSIRAPRKGWTLMHPGRLTHYHEGLPTTKGTRYIAVSFVDP 703


>gi|444728170|gb|ELW68634.1| Procollagen-lysine,2-oxoglutarate 5-dioxygenase 1 [Tupaia
           chinensis]
          Length = 727

 Score =  658 bits (1697), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 317/709 (44%), Positives = 465/709 (65%), Gaps = 11/709 (1%)

Query: 31  EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNE 90
           ED  LV+TVA+ ET+G++RF +SA+    +++ LGL + W      S GGG KV LLK  
Sbjct: 25  EDNLLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWNVEKGMSAGGGQKVRLLKKA 84

Query: 91  LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
           L++    +D++IL TDSYDV+   G  ++L++F    + +VF AE L +PD  L  KYP 
Sbjct: 85  LEKHANKEDLVILFTDSYDVVFASGPRELLKKFRQAKSQVVFSAEELIYPDRRLEAKYPV 144

Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
           V  G R+L SGGFIGYA ++K+L++    ++ + DQL+Y  +FLD   R +  I LD   
Sbjct: 145 VSDGKRFLGSGGFIGYAPNLKKLVAEWEGQDSDSDQLFYTKIFLDPEKREQINISLDHRC 204

Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK- 269
            +FQNL G+L+++ L F++   V   N  Y+T PV+IHGNG +KI+LN  GNY+ + W  
Sbjct: 205 RIFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVLIHGNGPTKIQLNYLGNYIPRFWTF 263

Query: 270 TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
            +GC  C+  +  L  +  +  P VL+ VFI++PT FL  F  ++  L YP K++ +F++
Sbjct: 264 ETGCAVCDEGLGSLKGIGDEALPIVLVGVFIEQPTPFLSLFFQRLRRLRYPQKRMRLFIH 323

Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDS 387
           N++++H    + ++    + +++VK +     V + +ARN+  +     +   +YF VD+
Sbjct: 324 NHEQHHKAQVERFLAEHGSEYQSVKLVGPEVRVATADARNMGADLCRQDRSCTYYFSVDA 383

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D  L  PD L+ L+ +N+++IAPLL R  + WSNFWGAL+ADG+YARS DY++I+ G + 
Sbjct: 384 DVALTEPDSLRLLIEQNKNVIAPLLTRQGRLWSNFWGALSADGYYARSEDYVDIVQGRR- 442

Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKT--IYTLNSMDYDMAFCTNLRNKGIHLKIDSTQ 505
             G+WNVPYI+N YL+K S ++ T ++T  ++    +D DMAFC NLR +   + + +  
Sbjct: 443 -VGVWNVPYISNIYLIKGSALR-TELQTTDLFHHRKLDPDMAFCANLRQQDAFMFLTNRH 500

Query: 506 EYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFP 565
            +GHL+  +++     + +++E+  NP DW  +YIH  Y K+L    V   PCPDV+WFP
Sbjct: 501 TFGHLLSLDSYQTTHLHNDLWEVFSNPEDWKEKYIHENYTKALAGKLVET-PCPDVYWFP 559

Query: 566 IVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKY 625
           I T+  C E V+ ME +GQWS G N D R++ GYE VPT DIHM Q+     W +FL +Y
Sbjct: 560 IFTDAACDELVEEMEHFGQWSLGDNKDNRIQGGYENVPTIDIHMNQISFEREWHKFLVEY 619

Query: 626 VVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGG 685
           + P+ E+ + GY+    +  ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VGVDYEGG
Sbjct: 620 IAPMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFTINIALNRVGVDYEGG 678

Query: 686 GCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           GCRF+RYNC++ A R GW LMHPGRLTHYHEGL  T+GTRYI +SFVDP
Sbjct: 679 GCRFLRYNCSIRAPRKGWTLMHPGRLTHYHEGLPTTKGTRYIAVSFVDP 727


>gi|281349272|gb|EFB24856.1| hypothetical protein PANDA_011823 [Ailuropoda melanoleuca]
          Length = 702

 Score =  658 bits (1697), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 312/707 (44%), Positives = 464/707 (65%), Gaps = 9/707 (1%)

Query: 32  DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNEL 91
           D  LV+TVA+ ET+G++RF +S +    +++ LGL + W G   SS GGG KV LLK  L
Sbjct: 1   DNLLVLTVATRETEGFRRFKRSGQFFNYKIQALGLGEDWSGEKGSSAGGGLKVRLLKKAL 60

Query: 92  DEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAV 151
           ++    ++++IL  DSYDV+   G  ++L++F    + +VF AE L +PD  L  KYPAV
Sbjct: 61  EKHADKENLVILFIDSYDVLFASGPRELLKKFRQAKSQVVFSAEELIYPDRRLEAKYPAV 120

Query: 152 GSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLAN 211
             G R+L SGGFIGYA ++ +L++    ++ + DQL+Y  +FLD   R +  I LD    
Sbjct: 121 SDGKRFLGSGGFIGYAPNLSKLVAEWEGQDSDSDQLFYTKIFLDPEKREQINITLDHRCR 180

Query: 212 LFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK-T 270
           +FQNL G+L+++ L F++   V   N  Y+T PV+IHGNG +K++LN  GNY+ + W   
Sbjct: 181 IFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTFE 239

Query: 271 SGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN 329
           +GC  C+  ++ L  +  +  P+VL+ VFI++PT FL  F  ++  L YP K++ +F++N
Sbjct: 240 TGCAVCDEGLRSLRGIGDEALPTVLVGVFIEQPTPFLSLFFQRLLRLRYPRKQMRLFIHN 299

Query: 330 NQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDSD 388
           ++++H    + ++      +++VK +     V + +ARN+  +     +   +YF VD+D
Sbjct: 300 HEQHHKAQVEQFLAEHGGEYQSVKLVGPEVRVANADARNMGADLCRQDRSCTYYFSVDAD 359

Query: 389 SHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGG 448
             L  P  L+ L+ +N+++IAPL+ R  + WSNFWGAL+ADG+YARS DY++I+ G +  
Sbjct: 360 VALTEPKTLRLLIEQNKNVIAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR-- 417

Query: 449 KGIWNVPYITNCYLMKTSVIKATNIKT-IYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEY 507
            G+WNVPYI+N YL+K S ++    +T ++  + +D DMAFC N+R + + + + +   +
Sbjct: 418 VGVWNVPYISNVYLIKGSALRGELQQTDLFHHSKLDPDMAFCANIRQQDVFMFLTNRHTF 477

Query: 508 GHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIV 567
           GHL+  +N+     + +++E+  NP DW  +YIH  Y K+L    V   PCPDV+WFPI 
Sbjct: 478 GHLLSLDNYQTSHLHNDLWEVFSNPEDWKEKYIHENYTKALAGKLV-EMPCPDVYWFPIF 536

Query: 568 TEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVV 627
           TE  C E V+ ME YG+WS G N D R++ GYE VPT DIHM Q+     W +FL +Y+ 
Sbjct: 537 TEAACDELVEEMEHYGRWSLGNNKDNRIQGGYENVPTIDIHMNQISFEREWHKFLVEYIA 596

Query: 628 PLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGC 687
           P+ E+ + GY+    +  ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VGVDYEGGGC
Sbjct: 597 PMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFTINIALNRVGVDYEGGGC 655

Query: 688 RFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           RF+RYNC++ A R GW LMHPGRLTHYHEGL  T+GTRYI +SFVDP
Sbjct: 656 RFLRYNCSIRAPRKGWTLMHPGRLTHYHEGLPTTRGTRYIAVSFVDP 702


>gi|301774781|ref|XP_002922812.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 1-like
           [Ailuropoda melanoleuca]
          Length = 737

 Score =  657 bits (1696), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 312/707 (44%), Positives = 464/707 (65%), Gaps = 9/707 (1%)

Query: 32  DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNEL 91
           D  LV+TVA+ ET+G++RF +S +    +++ LGL + W G   SS GGG KV LLK  L
Sbjct: 36  DNLLVLTVATRETEGFRRFKRSGQFFNYKIQALGLGEDWSGEKGSSAGGGLKVRLLKKAL 95

Query: 92  DEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAV 151
           ++    ++++IL  DSYDV+   G  ++L++F    + +VF AE L +PD  L  KYPAV
Sbjct: 96  EKHADKENLVILFIDSYDVLFASGPRELLKKFRQAKSQVVFSAEELIYPDRRLEAKYPAV 155

Query: 152 GSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLAN 211
             G R+L SGGFIGYA ++ +L++    ++ + DQL+Y  +FLD   R +  I LD    
Sbjct: 156 SDGKRFLGSGGFIGYAPNLSKLVAEWEGQDSDSDQLFYTKIFLDPEKREQINITLDHRCR 215

Query: 212 LFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK-T 270
           +FQNL G+L+++ L F++   V   N  Y+T PV+IHGNG +K++LN  GNY+ + W   
Sbjct: 216 IFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTFE 274

Query: 271 SGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN 329
           +GC  C+  ++ L  +  +  P+VL+ VFI++PT FL  F  ++  L YP K++ +F++N
Sbjct: 275 TGCAVCDEGLRSLRGIGDEALPTVLVGVFIEQPTPFLSLFFQRLLRLRYPRKQMRLFIHN 334

Query: 330 NQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDSD 388
           ++++H    + ++      +++VK +     V + +ARN+  +     +   +YF VD+D
Sbjct: 335 HEQHHKAQVEQFLAEHGGEYQSVKLVGPEVRVANADARNMGADLCRQDRSCTYYFSVDAD 394

Query: 389 SHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGG 448
             L  P  L+ L+ +N+++IAPL+ R  + WSNFWGAL+ADG+YARS DY++I+ G +  
Sbjct: 395 VALTEPKTLRLLIEQNKNVIAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR-- 452

Query: 449 KGIWNVPYITNCYLMKTSVIKATNIKT-IYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEY 507
            G+WNVPYI+N YL+K S ++    +T ++  + +D DMAFC N+R + + + + +   +
Sbjct: 453 VGVWNVPYISNVYLIKGSALRGELQQTDLFHHSKLDPDMAFCANIRQQDVFMFLTNRHTF 512

Query: 508 GHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIV 567
           GHL+  +N+     + +++E+  NP DW  +YIH  Y K+L    V   PCPDV+WFPI 
Sbjct: 513 GHLLSLDNYQTSHLHNDLWEVFSNPEDWKEKYIHENYTKALAGKLV-EMPCPDVYWFPIF 571

Query: 568 TEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVV 627
           TE  C E V+ ME YG+WS G N D R++ GYE VPT DIHM Q+     W +FL +Y+ 
Sbjct: 572 TEAACDELVEEMEHYGRWSLGNNKDNRIQGGYENVPTIDIHMNQISFEREWHKFLVEYIA 631

Query: 628 PLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGC 687
           P+ E+ + GY+    +  ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VGVDYEGGGC
Sbjct: 632 PMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFTINIALNRVGVDYEGGGC 690

Query: 688 RFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           RF+RYNC++ A R GW LMHPGRLTHYHEGL  T+GTRYI +SFVDP
Sbjct: 691 RFLRYNCSIRAPRKGWTLMHPGRLTHYHEGLPTTRGTRYIAVSFVDP 737


>gi|443689849|gb|ELT92140.1| hypothetical protein CAPTEDRAFT_182861 [Capitella teleta]
          Length = 701

 Score =  657 bits (1696), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 338/707 (47%), Positives = 451/707 (63%), Gaps = 15/707 (2%)

Query: 37  ITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSS-LGGGYKVNLLKNELDEMD 95
           +TVA++ TDG++RFI+S E   L VK LG+ Q W GGD+    GGG KVNLLK  L+E+ 
Sbjct: 1   MTVATDNTDGFQRFIRSTETFNLDVKVLGMGQKWEGGDIVKYAGGGQKVNLLKEGLEELK 60

Query: 96  ITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAVG-SG 154
              D+I++  DSYDVI+D G + IL  F  FDA +VF AE  CWPD SL  +YP V  S 
Sbjct: 61  EKKDLIVMFVDSYDVIMDAGADAILAAFKKFDARVVFSAEGFCWPDASLAHEYPEVKMSE 120

Query: 155 YRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLANLFQ 214
            RYLNSGGFIGYA DI +LI   S+++++DDQL+Y   FLD+TLR K  I LD+   +FQ
Sbjct: 121 KRYLNSGGFIGYATDIYKLIGGSSLRSDDDDQLFYTKSFLDKTLREKLGIKLDSKGEIFQ 180

Query: 215 NLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK-TSGC 273
           NL G+L+D+K+ F      +L N K    P++IHGNG  K   N+  NYL   W  T GC
Sbjct: 181 NLNGALDDVKVKFKGSS-SYLYNMKTGVTPLVIHGNGPIKHHFNALTNYLGGHWTPTGGC 239

Query: 274 TRC-NLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQE 332
             C     HLD+ K + +P V++++FI++PTAFL+EF   I NL+YP  KI ++++ + E
Sbjct: 240 NGCKQRTIHLDATKTENYPQVMMAIFIEQPTAFLQEFFYNIGNLSYPKSKIDLYLHYSDE 299

Query: 333 YHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSHLD 392
                 D+++      F + +     S +N   ARN A+E    K  ++ F VD D  L+
Sbjct: 300 SSKKYVDEFLERNGDEFGSKQIETPVSELNDWTARNKALEKCNSKKCEYLFTVDGDVQLE 359

Query: 393 NPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIW 452
           + + L  L+  N S+IAPLL RP K WSNFWG+L+ DGFY RS DY  I+ G Q  KG W
Sbjct: 360 DHNTLVDLIQYNRSVIAPLLSRPGKLWSNFWGSLSPDGFYKRSDDYAEIVTGRQ--KGQW 417

Query: 453 NVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGHLVD 512
           NVPYI+   L+   ++ +  +   YT + +D DMA C  +R KGI + +D+ ++YG LVD
Sbjct: 418 NVPYISQSLLIHGYLVPS--LLGGYTDSDLDSDMAICKRMREKGIFMYVDNQKKYGLLVD 475

Query: 513 SENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSL--LPDTVNNQPCPDVFWFPIVTEK 570
           SE FDP K + ++Y +  N   W+ +Y+HPE+ + L  +P +   QPCPDVFW PIVT K
Sbjct: 476 SEQFDPSKAHGDLYMIFDNREMWEKKYLHPEFNRYLNTVPFSELEQPCPDVFWLPIVTTK 535

Query: 571 FCHEFVQIMEAYGQWSDGTNN---DKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVV 627
           FC + +  ME YG+WS G +    D RL   YE VPT DIH  Q+     W E L+ Y+ 
Sbjct: 536 FCWDLIDEMEHYGKWSGGGHQPAVDDRLGGSYENVPTVDIHTNQIDWEPQWLEILKSYIG 595

Query: 628 PLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGC 687
           P   + F GY+ E  RA M+FVVRY P EQ  L+PH DSS+YTINIALN+ G+D+ GGG 
Sbjct: 596 PYSGKVFEGYYTE-ARAHMNFVVRYTPGEQDRLKPHSDSSSYTINIALNRPGIDFTGGGT 654

Query: 688 RFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           RFIR NC+VT  R GW+LMH GRLTH+HEGL  T GTRYIM+SF+DP
Sbjct: 655 RFIRQNCSVTNARQGWLLMHAGRLTHFHEGLPTTGGTRYIMVSFIDP 701


>gi|390465346|ref|XP_003733390.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 1
           [Callithrix jacchus]
          Length = 727

 Score =  657 bits (1696), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 315/708 (44%), Positives = 468/708 (66%), Gaps = 9/708 (1%)

Query: 31  EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNE 90
           ED  LVITVA+ ET+G++RF +SA+    +++ LGL + W     +  GGG KV LLK  
Sbjct: 25  EDNLLVITVATKETEGFRRFKRSAQFFNYKIQALGLGEDWNVEKRTLAGGGQKVRLLKKA 84

Query: 91  LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
           L++    +D++IL TDSYDV+   G  ++L++F    + +VF AE L +PD  L  KYPA
Sbjct: 85  LEKHADKEDLVILFTDSYDVLFASGPRELLKKFRQARSQVVFSAEELIYPDRRLEAKYPA 144

Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
           V  G R+L SGGFIGYA ++ +L++    ++ + DQL+Y  +FLD   R +  I LD   
Sbjct: 145 VSDGKRFLGSGGFIGYAPNLSKLVAEWDGQDSDSDQLFYTKIFLDPEKREQINITLDHRC 204

Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK- 269
            +FQNL G+L+++ L F++   V   N  Y+T PV+IHGNG +K++LN  GNY+ + W  
Sbjct: 205 RIFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTF 263

Query: 270 TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
            +GCT C+  ++ L  ++ +  P+VL+ VFI++PT F+  F  ++  L+YP K I +F++
Sbjct: 264 ETGCTVCDEGLRSLKGIEDEALPTVLVGVFIEQPTPFVSLFFQRLLRLHYPRKHIRLFIH 323

Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDS 387
           N++++H    ++++    + +++VK +     + + +ARN+  +     +   +YF VD+
Sbjct: 324 NHEQHHKTQVEEFLAEHGSEYQSVKLVGPEVRMVNADARNMGADLCRQDRSCTYYFSVDA 383

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D  L  P+ L+ L+ +N+++IAPL+ R  + WSNFWGAL+ADG+YARS DY++I+ G + 
Sbjct: 384 DVALTEPNSLRLLIQQNKNVIAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR- 442

Query: 448 GKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQE 506
             G+WNVPYI+N YL+K S ++A      ++    +D DMAFC N+R + + + + +   
Sbjct: 443 -VGVWNVPYISNIYLIKGSALRAELQSPDLFHHRKLDPDMAFCANIRQQDVFMFLTNRHG 501

Query: 507 YGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPI 566
            GHL+  +N+     + +++E+  NP DW  +YIH  Y K+L    V   PCPDV+WFPI
Sbjct: 502 LGHLLSLDNYRTTHLHNDLWEVFSNPEDWKEKYIHVNYTKALAGKLVET-PCPDVYWFPI 560

Query: 567 VTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYV 626
            TE  C E V+ ME +GQWS G N D R++ GYE VPT DIHM Q+G    W +FL +Y+
Sbjct: 561 FTEVACDELVEEMEHFGQWSLGDNKDNRIQGGYENVPTIDIHMNQIGFEREWHKFLLEYI 620

Query: 627 VPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGG 686
            P+ E+ + GY+    +  ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VGVDYEGGG
Sbjct: 621 APMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFTINIALNRVGVDYEGGG 679

Query: 687 CRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           CRF+RYNC++ A R GW LMHPGRLTHYHEGL  T+GTRYI +SFVDP
Sbjct: 680 CRFLRYNCSIRAPRKGWTLMHPGRLTHYHEGLPTTRGTRYIAVSFVDP 727


>gi|118403473|ref|NP_001072340.1| lysyl hydroxylase 1 precursor [Xenopus (Silurana) tropicalis]
 gi|111305666|gb|AAI21423.1| lysyl hydroxylase [Xenopus (Silurana) tropicalis]
          Length = 722

 Score =  656 bits (1692), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 323/719 (44%), Positives = 472/719 (65%), Gaps = 19/719 (2%)

Query: 20  SVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLG 79
           S++C    N   D  LV+T+A+ ETDG KRF +SA     +VK LGL + WLG       
Sbjct: 19  SMNC---ANASADNLLVLTIATEETDGLKRFQRSAHSFNYKVKVLGLGEEWLGE------ 69

Query: 80  GGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCW 139
            G KV L+K  L+     +D+IIL T+SYDVI   G  ++L++F    + +VF AE + +
Sbjct: 70  -GQKVRLMKFALEPYADKEDLIILFTESYDVIFASGPGELLKKFRQAKSKVVFSAESVAY 128

Query: 140 PDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLR 199
           PD  L  KYPAVG G R+L SG FIGYA  + +++++   K+++ DQL+Y  LFLD   R
Sbjct: 129 PDRHLESKYPAVGEGKRFLGSGAFIGYATHLYKMVADWDGKDKDSDQLFYTKLFLDPVKR 188

Query: 200 TKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNS 259
            K  I LD    +FQNLYGS ED+ L F+ +  V      Y+T PV+IHGNG +K+ LN 
Sbjct: 189 GKINITLDHRCRIFQNLYGSAEDVALKFE-NGRVRARYLVYDTLPVLIHGNGPTKLHLNY 247

Query: 260 FGNYLAKSWK-TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLN 317
            GNY+ + W   SGC  C+  +++L+ L  D FP V+I +FI++PT F+ EF  ++ NLN
Sbjct: 248 LGNYIPRVWTFESGCNVCDEGVRNLEGLTVDTFPLVVIGIFIEQPTPFVSEFFKRLNNLN 307

Query: 318 YPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHK 377
           YP K+I +++ N++ +H    ++++  + T +  VK +  +   +  ++RN  ++     
Sbjct: 308 YPKKRIQLYISNHEPHHQRRVENFLQAYGTQYSFVKTVGPDENSDFADSRNKGMDMCRQT 367

Query: 378 G-VDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSF 436
              ++YF +D+   L N ++L+ L+ +N+S+IAPL+ R    WSNFWGAL++DG+YARS 
Sbjct: 368 PECEYYFSIDAPVVLKNINILRILIEQNKSVIAPLVSRTANLWSNFWGALSSDGYYARSE 427

Query: 437 DYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNK 495
           DY++I+   +   G+WNVPYI++ YL+K S++++  +   I+   + D+DMAFC N+R +
Sbjct: 428 DYIHIVQRQR--IGVWNVPYISSVYLVKGSILRSKLSQNDIFHSGTQDFDMAFCHNIRQQ 485

Query: 496 GIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNN 555
           GI + + + QE+GH++  EN+     + +++E+  N  DW  +YIHP + ++L    V  
Sbjct: 486 GIFMFVTNRQEFGHILSLENYKTTHLHNDLWEIFENTEDWKEKYIHPNHSEALKGKLVE- 544

Query: 556 QPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLA 615
            PCPDV+WFPI +E  C+E V+ ME +G+WS G+N D RL+ GYE VPT DIHM Q+G  
Sbjct: 545 MPCPDVYWFPIFSETTCNELVEEMENFGKWSSGSNKDNRLQGGYENVPTIDIHMNQIGYE 604

Query: 616 GVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIAL 675
             W + L  ++ PL E+ F GY+    +  ++FVVRY+PDEQP L PHHD+ST+TINIAL
Sbjct: 605 KEWHKILLDFIAPLTEKLFPGYYTR-AQFDLAFVVRYKPDEQPLLEPHHDASTFTINIAL 663

Query: 676 NQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           N VG DYEGGGCRF+RYNC+V A R GW LMHPGRLTHYHEGL+VT+GTRYI++SFVDP
Sbjct: 664 NSVGQDYEGGGCRFLRYNCSVRAPRKGWALMHPGRLTHYHEGLRVTKGTRYIVVSFVDP 722


>gi|397502970|ref|XP_003822109.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 1
           isoform 1 [Pan paniscus]
          Length = 727

 Score =  656 bits (1692), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 312/708 (44%), Positives = 467/708 (65%), Gaps = 9/708 (1%)

Query: 31  EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNE 90
           ED  LV+TVA+ ET+G++RF +SA+    +++ LGL + W     +S GGG KV LLK  
Sbjct: 25  EDNLLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWNVDKGTSAGGGQKVRLLKKA 84

Query: 91  LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
           L++    +D++IL TDSYDV+   G  ++L++F    + +VF AE L +PD  L  KYP 
Sbjct: 85  LEKHADKEDLVILFTDSYDVLFASGPRELLKKFRQARSQVVFSAEELIYPDRRLETKYPV 144

Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
           V  G R+L SGGFIGYA ++ +L++    ++ + DQL+Y  +FLD   R +  I LD   
Sbjct: 145 VSDGKRFLGSGGFIGYAPNLSKLVAEWEGQDSDSDQLFYTKIFLDPEKREQINITLDHRC 204

Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK- 269
            +FQNL G+L+++ L F++   V   N  Y+T PV+IHGNG +K++LN  GNY+ + W  
Sbjct: 205 RIFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTF 263

Query: 270 TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
            +GCT C+  ++ L  +  +  P+VL+ VFI++PT F+  F  ++  L+YP K + +F++
Sbjct: 264 ETGCTVCDEGLRSLKGIGDEALPTVLVGVFIEQPTPFVSLFFQRLLRLHYPQKHMRLFIH 323

Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDS 387
           N++++H    ++++    + +++VK +     + + +ARN+  +     +   +YF VD+
Sbjct: 324 NHEQHHKAQVEEFLAEHGSEYQSVKLVGPEVRMANADARNMGADLCRQDRSCTYYFSVDA 383

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D  L  P+ L+ L+ +N+++IAPL+ R  + WSNFWGAL+ADG+YARS DY++I+ G + 
Sbjct: 384 DVALTEPNSLRLLIQQNKNVIAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR- 442

Query: 448 GKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQE 506
             G+WNVPYI+N YL+K S ++       ++  + +D DMAFC N+R + + + + +   
Sbjct: 443 -VGVWNVPYISNIYLIKGSALRGELQSSDLFHHSKLDPDMAFCANVRQQDVFMFLTNRHT 501

Query: 507 YGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPI 566
            GHL+  +++     + +++E+  NP DW  +YIH  Y K+L    V   PCPDV+WFPI
Sbjct: 502 LGHLLSLDSYRTTHLHNDLWEVFSNPEDWKEKYIHQNYTKALAGKLVET-PCPDVYWFPI 560

Query: 567 VTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYV 626
            TE  C E V+ ME +GQWS G N D R++ GYE VPT DIHM Q+G    W +FL +Y+
Sbjct: 561 FTEVACDELVEEMEHFGQWSLGNNKDNRIQGGYENVPTIDIHMNQIGFEREWHKFLLEYI 620

Query: 627 VPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGG 686
            P+ E+ + GY+    +  ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VGVDYEGGG
Sbjct: 621 APMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFTINIALNRVGVDYEGGG 679

Query: 687 CRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           CRF+RYNC+V A R GW LMHPGRLTHYHEGL  T+GTRYI +SFVDP
Sbjct: 680 CRFLRYNCSVRAPRKGWTLMHPGRLTHYHEGLPTTRGTRYIAVSFVDP 727


>gi|190074|gb|AAA60116.1| lysyl hydroxylase [Homo sapiens]
          Length = 727

 Score =  656 bits (1692), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 311/708 (43%), Positives = 467/708 (65%), Gaps = 9/708 (1%)

Query: 31  EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNE 90
           ED  LV+TVA+ ET+G++RF +SA+    +++ LGL + W     +S GGG KV LLK  
Sbjct: 25  EDNLLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWNVEKGTSAGGGQKVRLLKKA 84

Query: 91  LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
           L++    +D++IL TDSYDV+   G  ++L++F    + +VF AE L +PD  L  KYP 
Sbjct: 85  LEKHADKEDLVILFTDSYDVLFASGPRELLKKFRQARSQVVFSAEELIYPDRRLETKYPV 144

Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
           V  G R+L SGGFIGYA ++ +L++    ++ + DQL+Y  +FLD   R +  I LD   
Sbjct: 145 VSDGKRFLGSGGFIGYAPNLSKLVAEWEGQDSDSDQLFYTKIFLDPEKREQINITLDHRC 204

Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK- 269
            +FQNL G+L+++ L F++   V   N  Y+T PV+IHGNG +K++LN  GNY+ + W  
Sbjct: 205 RIFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTF 263

Query: 270 TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
            +GCT C+  ++ L  +  +  P+VL+ VFI++PT F+  F  ++  L+YP K + +F++
Sbjct: 264 ETGCTVCDEGLRSLKGIGDEALPTVLVGVFIEQPTPFVSLFFQRLLRLHYPQKHMRLFIH 323

Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDS 387
           N++++H    ++++    + +++VK +     + + +ARN+  +     +   +YF VD+
Sbjct: 324 NHEQHHKAQVEEFLAQHGSEYQSVKLVGPEVRMANADARNMGADLCRQDRSCTYYFSVDA 383

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D  L  P+ L+ L+ +N+++IAPL+ R  + WSNFWGAL+ADG+YARS DY++I+ G + 
Sbjct: 384 DVALTEPNSLRLLIQQNKNVIAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR- 442

Query: 448 GKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQE 506
             G+WNVPYI+N YL+K S ++       ++  + +D DMAFC N+R + + + + +   
Sbjct: 443 -VGVWNVPYISNIYLIKGSALRGELQSSDLFHHSKLDPDMAFCANIRQQDVFMFLTNRHT 501

Query: 507 YGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPI 566
            GHL+  +++     + +++E+  NP DW  +YIH  Y K+L    V   PCPDV+WFPI
Sbjct: 502 LGHLLSLDSYRTTHLHNDLWEVFSNPEDWKEKYIHQNYTKALAGKLVET-PCPDVYWFPI 560

Query: 567 VTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYV 626
            TE  C E V+ ME +GQWS G N D R++ GYE VPT DIHM Q+G    W +FL +Y+
Sbjct: 561 FTEVACDELVEEMEHFGQWSLGNNKDNRIQGGYENVPTIDIHMNQIGFEREWHKFLLEYI 620

Query: 627 VPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGG 686
            P+ E+ + GY+    +  ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VGVDYEGGG
Sbjct: 621 APMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFTINIALNRVGVDYEGGG 679

Query: 687 CRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           CRF+RYNC++ A R GW LMHPGRLTHYHEGL  T+GTRYI +SFVDP
Sbjct: 680 CRFLRYNCSIRAPRKGWTLMHPGRLTHYHEGLPTTRGTRYIAVSFVDP 727


>gi|351713694|gb|EHB16613.1| Procollagen-lysine,2-oxoglutarate 5-dioxygenase 1 [Heterocephalus
           glaber]
          Length = 764

 Score =  656 bits (1692), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 313/710 (44%), Positives = 467/710 (65%), Gaps = 9/710 (1%)

Query: 29  IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLK 88
           +  D  LV+TVA+ ET+G++RF +SA+    +++ LGL + W     +S GGG K+ LLK
Sbjct: 60  LGSDNLLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWGVERGTSAGGGQKIRLLK 119

Query: 89  NELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKY 148
             L++ +  +D++IL TDSYDV+   G  ++L++F    + +VF AE L +PD  L  KY
Sbjct: 120 KALEKHEDKEDLVILFTDSYDVVFASGPRELLKKFRQARSRVVFSAEELIYPDRRLEAKY 179

Query: 149 PAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDT 208
           P V  G R+L SGGFIGYA ++ +L++    ++ + DQL+Y  +FLD   R +  I LD 
Sbjct: 180 PMVSDGKRFLGSGGFIGYAPNLSKLVAKWEGQDSDSDQLFYTKIFLDPEKREQINISLDH 239

Query: 209 LANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSW 268
              +FQNL G+L+++ L F++   V   N  Y+T PV+IHGNG +K++LN  GNY+   W
Sbjct: 240 RCRIFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPHFW 298

Query: 269 K-TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMF 326
              +GCT C+  ++ L  +  +  P VL+ VFI++PT FL  F  ++  L+YP  ++ +F
Sbjct: 299 TFETGCTVCDEGLRSLKGIGDEALPMVLVGVFIEQPTPFLSLFFQRLLRLHYPRSRMRLF 358

Query: 327 VYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVE-NSLHKGVDFYFYV 385
           +++++++H    + ++      +++VK +     +++  ARN+  +     +   +YF V
Sbjct: 359 IHSHEQHHKAQVEQFLAEHGGEYQSVKLVGPEVRMSNANARNMGADLCRQEQTCTYYFSV 418

Query: 386 DSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGD 445
           D+D  L  PD L+ L+ +N+++IAPL++RP + WSNFWGAL+ADG+YARS DY++I++G 
Sbjct: 419 DADVALTEPDSLRLLIEQNKNVIAPLMMRPGRLWSNFWGALSADGYYARSEDYVDIVHGR 478

Query: 446 QGGKGIWNVPYITNCYLMKTSVIKATNIKT-IYTLNSMDYDMAFCTNLRNKGIHLKIDST 504
           +   GIWNVPYI+N YL+K S ++A    T ++    +D DMAFC N+R + + + + + 
Sbjct: 479 R--VGIWNVPYISNIYLIKGSALRAELQHTDLFHHRKLDPDMAFCANIRQQEVFMFLTNR 536

Query: 505 QEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWF 564
             +GHL+  +++     + +++E+  NP DW  +YIH  Y K+L    V   PCPDV+WF
Sbjct: 537 HSFGHLLSLDSYQTTHLHNDLWEVFSNPEDWKEKYIHENYTKALAGKLVET-PCPDVYWF 595

Query: 565 PIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRK 624
           PI TE  C E VQ ME +GQWS G N D R++ GYE VPT DIHM Q+     W +FL +
Sbjct: 596 PIFTEVACDELVQEMEHFGQWSLGDNKDNRIQGGYENVPTIDIHMNQISFEREWHKFLVE 655

Query: 625 YVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEG 684
           Y+ PL E+ + GY+    +  ++FVVRY+PDEQPSL PHHD+ST+T+NIALN+VG DYEG
Sbjct: 656 YIAPLTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFTVNIALNRVGEDYEG 714

Query: 685 GGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           GGCRF+RYNC+V A R GW LMHPGRLTHYHEGL  T+GTRYI +SFVDP
Sbjct: 715 GGCRFLRYNCSVQAPRKGWALMHPGRLTHYHEGLPTTRGTRYIAVSFVDP 764


>gi|16741721|gb|AAH16657.1| Procollagen-lysine 1, 2-oxoglutarate 5-dioxygenase 1 [Homo sapiens]
          Length = 727

 Score =  655 bits (1691), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 311/708 (43%), Positives = 467/708 (65%), Gaps = 9/708 (1%)

Query: 31  EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNE 90
           ED  LV+TVA+ ET+G++RF +SA+    +++ LGL + W     +S GGG KV LLK  
Sbjct: 25  EDNLLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWNVEKGTSAGGGQKVRLLKKA 84

Query: 91  LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
           L++    +D++IL TDSYDV+   G  ++L++F    + +VF AE L +PD  L  KYP 
Sbjct: 85  LEKHADKEDLVILFTDSYDVLFASGPRELLKKFRQSRSQVVFSAEELIYPDRRLETKYPV 144

Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
           V  G R+L SGGFIGYA ++ +L++    ++ + DQL+Y  +FLD   R +  I LD   
Sbjct: 145 VSDGKRFLGSGGFIGYAPNLSKLVAEWEGQDSDSDQLFYTKIFLDPEKREQINITLDHRC 204

Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK- 269
            +FQNL G+L+++ L F++   V   N  Y+T PV+IHGNG +K++LN  GNY+ + W  
Sbjct: 205 RIFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTF 263

Query: 270 TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
            +GCT C+  ++ L  +  +  P+VL+ VFI++PT F+  F  ++  L+YP K + +F++
Sbjct: 264 ETGCTVCDEGLRSLKGIGDEALPTVLVGVFIEQPTPFVSLFFQRLLRLHYPQKHMRLFIH 323

Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDS 387
           N++++H    ++++    + +++VK +     + + +ARN+  +     +   +YF VD+
Sbjct: 324 NHEQHHKAQVEEFLAQHGSEYQSVKLVGPEVRMANADARNMGADLCRQDRSCTYYFSVDA 383

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D  L  P+ L+ L+ +N+++IAPL+ R  + WSNFWGAL+ADG+YARS DY++I+ G + 
Sbjct: 384 DVALTEPNSLRLLIQQNKNVIAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR- 442

Query: 448 GKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQE 506
             G+WNVPYI+N YL+K S ++       ++  + +D DMAFC N+R + + + + +   
Sbjct: 443 -VGVWNVPYISNIYLIKGSALRGELQSSDLFHHSKLDPDMAFCANIRQQDVFMFLTNRHT 501

Query: 507 YGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPI 566
            GHL+  +++     + +++E+  NP DW  +YIH  Y K+L    V   PCPDV+WFPI
Sbjct: 502 LGHLLSLDSYRTTHLHNDLWEVFSNPEDWKEKYIHQNYTKALAGKLVET-PCPDVYWFPI 560

Query: 567 VTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYV 626
            TE  C E V+ ME +GQWS G N D R++ GYE VPT DIHM Q+G    W +FL +Y+
Sbjct: 561 FTEVACDELVEEMEHFGQWSLGNNKDNRIQGGYENVPTIDIHMNQIGFEREWHKFLLEYI 620

Query: 627 VPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGG 686
            P+ E+ + GY+    +  ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VGVDYEGGG
Sbjct: 621 APMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFTINIALNRVGVDYEGGG 679

Query: 687 CRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           CRF+RYNC++ A R GW LMHPGRLTHYHEGL  T+GTRYI +SFVDP
Sbjct: 680 CRFLRYNCSIRAPRKGWTLMHPGRLTHYHEGLPTTRGTRYIAVSFVDP 727


>gi|410340317|gb|JAA39105.1| procollagen-lysine 1, 2-oxoglutarate 5-dioxygenase 1 [Pan
           troglodytes]
          Length = 727

 Score =  655 bits (1690), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 312/708 (44%), Positives = 467/708 (65%), Gaps = 9/708 (1%)

Query: 31  EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNE 90
           ED  LV+TVA+ ET+G++RF +SA+    +++ LGL + W     +S GGG KV LLK  
Sbjct: 25  EDNLLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWNVDKGTSAGGGQKVRLLKKA 84

Query: 91  LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
           L++    +D++IL TDSYDV+   G  ++L++F    + +VF AE L +PD  L  KYP 
Sbjct: 85  LEKHADKEDLVILFTDSYDVLFASGPRELLKKFRQARSQVVFSAEELIYPDRRLETKYPV 144

Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
           V  G R+L SGGFIGYA ++ +L++    ++ + DQL+Y  +FLD   R +  I LD   
Sbjct: 145 VSDGKRFLGSGGFIGYAPNLSKLVAEWEGQDSDSDQLFYTKIFLDPEKREQINITLDHRC 204

Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK- 269
            +FQNL G+L+++ L F++   V   N  Y+T PV+IHGNG +K++LN  GNY+ + W  
Sbjct: 205 RIFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTF 263

Query: 270 TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
            +GCT C+  ++ L  +  +  P+VL+ VFI++PT F+  F  ++  L+YP K + +F++
Sbjct: 264 ETGCTVCDEGLRSLKGIGDEALPTVLVGVFIEQPTPFVSLFFQRLLRLHYPQKHMRLFIH 323

Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDS 387
           N++++H    ++++    + +++VK +     + + +ARN+  +     +   +YF VD+
Sbjct: 324 NHEQHHKAQVEEFLAEHGSEYQSVKLVGPEVRMANADARNMGADLCRQDRSCTYYFSVDA 383

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D  L  P+ L+ L+ +N+++IAPL+ R  + WSNFWGAL+ADG+YARS DY++I+ G + 
Sbjct: 384 DVALTEPNSLRLLIQQNKNVIAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR- 442

Query: 448 GKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQE 506
             G+WNVPYI+N YL+K S ++       ++  + +D DMAFC N+R + + + + +   
Sbjct: 443 -VGVWNVPYISNIYLIKGSALRGELQSPDLFHHSKLDPDMAFCANVRQQDVFMFLTNRHT 501

Query: 507 YGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPI 566
            GHL+  +++     + +++E+  NP DW  +YIH  Y K+L    V   PCPDV+WFPI
Sbjct: 502 LGHLLSLDSYRTTHLHNDLWEVFSNPEDWKEKYIHQNYTKALAGKLVET-PCPDVYWFPI 560

Query: 567 VTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYV 626
            TE  C E V+ ME +GQWS G N D R++ GYE VPT DIHM Q+G    W +FL +Y+
Sbjct: 561 FTEVACDELVEEMEHFGQWSLGNNKDNRIQGGYENVPTIDIHMNQIGFEREWHKFLLEYI 620

Query: 627 VPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGG 686
            P+ E+ + GY+    +  ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VGVDYEGGG
Sbjct: 621 APMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFTINIALNRVGVDYEGGG 679

Query: 687 CRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           CRF+RYNC+V A R GW LMHPGRLTHYHEGL  T+GTRYI +SFVDP
Sbjct: 680 CRFLRYNCSVRAPRKGWTLMHPGRLTHYHEGLPTTRGTRYIAVSFVDP 727


>gi|114554002|ref|XP_001142788.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 1
           isoform 7 [Pan troglodytes]
          Length = 727

 Score =  655 bits (1690), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 312/708 (44%), Positives = 467/708 (65%), Gaps = 9/708 (1%)

Query: 31  EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNE 90
           ED  LV+TVA+ ET+G++RF +SA+    +++ LGL + W     +S GGG KV LLK  
Sbjct: 25  EDNLLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWNVDKGTSAGGGQKVRLLKKA 84

Query: 91  LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
           L++    +D++IL TDSYDV+   G  ++L++F    + +VF AE L +PD  L  KYP 
Sbjct: 85  LEKHADKEDLVILFTDSYDVLFASGPRELLKKFRQARSQVVFSAEELIYPDRRLETKYPV 144

Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
           V  G R+L SGGFIGYA ++ +L++    ++ + DQL+Y  +FLD   R +  I LD   
Sbjct: 145 VSDGKRFLGSGGFIGYAPNLSKLVAEWEGQDSDSDQLFYTKIFLDPEKREQINITLDHRC 204

Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK- 269
            +FQNL G+L+++ L F++   V   N  Y+T PV+IHGNG +K++LN  GNY+ + W  
Sbjct: 205 RIFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVLIHGNGPNKLQLNYLGNYIPRFWTF 263

Query: 270 TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
            +GCT C+  ++ L  +  +  P+VL+ VFI++PT F+  F  ++  L+YP K + +F++
Sbjct: 264 ETGCTVCDEGLRSLKGIGDEALPTVLVGVFIEQPTPFVSLFFQRLLRLHYPQKHMRLFIH 323

Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDS 387
           N++++H    ++++    + +++VK +     + + +ARN+  +     +   +YF VD+
Sbjct: 324 NHEQHHKAQVEEFLAEHGSEYQSVKLVGPEVRMANADARNMGADLCRQDRSCTYYFSVDA 383

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D  L  P+ L+ L+ +N+++IAPL+ R  + WSNFWGAL+ADG+YARS DY++I+ G + 
Sbjct: 384 DVALTEPNSLRLLIQQNKNVIAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR- 442

Query: 448 GKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQE 506
             G+WNVPYI+N YL+K S ++       ++  + +D DMAFC N+R + + + + +   
Sbjct: 443 -VGVWNVPYISNIYLIKGSALRGELQSPDLFHHSKLDPDMAFCANVRQQDVFMFLTNRHT 501

Query: 507 YGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPI 566
            GHL+  +++     + +++E+  NP DW  +YIH  Y K+L    V   PCPDV+WFPI
Sbjct: 502 LGHLLSLDSYRTTHLHNDLWEVFSNPEDWKEKYIHQNYTKALAGKLVET-PCPDVYWFPI 560

Query: 567 VTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYV 626
            TE  C E V+ ME +GQWS G N D R++ GYE VPT DIHM Q+G    W +FL +Y+
Sbjct: 561 FTEVACDELVEEMEHFGQWSLGNNKDNRIQGGYENVPTIDIHMNQIGFEREWHKFLLEYI 620

Query: 627 VPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGG 686
            P+ E+ + GY+    +  ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VGVDYEGGG
Sbjct: 621 APMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFTINIALNRVGVDYEGGG 679

Query: 687 CRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           CRF+RYNC+V A R GW LMHPGRLTHYHEGL  T+GTRYI +SFVDP
Sbjct: 680 CRFLRYNCSVRAPRKGWTLMHPGRLTHYHEGLPTTRGTRYIAVSFVDP 727


>gi|307748828|gb|ADN91862.1| procollagen-lysine 1, 2-oxoglutarate 5-dioxygenase 1 [Canis lupus
           familiaris]
          Length = 727

 Score =  655 bits (1689), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 316/708 (44%), Positives = 468/708 (66%), Gaps = 9/708 (1%)

Query: 31  EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNE 90
           ED  LV+TVA+ ET+G++RF +S +    +++ LGL + W G   +S GGG KV LLK  
Sbjct: 25  EDNLLVLTVATTETEGFRRFKRSGQFFNYKIQALGLGEDWTGEKGTSAGGGLKVRLLKKA 84

Query: 91  LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
           L++    +D++IL TDSYDV+   G  ++L++F      +VF AE L +PD  L  KYPA
Sbjct: 85  LEKHADKEDLVILFTDSYDVVFASGPRELLKKFRQARGQVVFSAEELIYPDRRLEAKYPA 144

Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
           V  G R+L SGGFIGYA  + +L++    ++ + DQL+Y  +FLD   R +  I LD   
Sbjct: 145 VSDGKRFLGSGGFIGYAPSLSKLVAEWEGQDSDSDQLFYTQIFLDPEKRERINITLDHRC 204

Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK- 269
            +FQNL G+L+++ L F++   V   N  Y+T PV+IHGNG +K++LN  GNY+ + W  
Sbjct: 205 RIFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTF 263

Query: 270 TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
            +GC+ C+  ++ L  +  +  P+VL+ VFI++PT FL  F  ++  L+YP K++ +F++
Sbjct: 264 ETGCSVCDEGLRSLRGIGEEALPTVLVGVFIEQPTPFLSLFFLRLLRLHYPRKQMRLFIH 323

Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDS 387
           N++++H    + ++    + +++VK +     V + +ARN+  +     +G  +YF VD+
Sbjct: 324 NHEQHHKAQVEQFLAEHGSEYQSVKLVGPEVRVANADARNMGADLCRQDRGCTYYFSVDA 383

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D  L  P  L+ L+ +N+++IAPL+ R  + WSNFWGAL+ADG+YARS DY++I+ G + 
Sbjct: 384 DVALTEPKTLRLLIEQNKNVIAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR- 442

Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKT-IYTLNSMDYDMAFCTNLRNKGIHLKIDSTQE 506
             G+WNVPYI+N YL+K S ++A    T ++  + +D DMAFC N+R + + + + +   
Sbjct: 443 -VGVWNVPYISNIYLIKGSALRAELQHTDLFHHSRLDPDMAFCANIRQQDVFMFLTNRHT 501

Query: 507 YGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPI 566
           +GHL+  +++     + +++E+  NP DW  +YIH  Y K+L    V   PCPDV+WFPI
Sbjct: 502 FGHLLSLDSYQTSHLHNDLWEVFSNPEDWKEKYIHENYTKALAGKLV-EMPCPDVYWFPI 560

Query: 567 VTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYV 626
            TE  C E V+ ME YGQWS G N D R++ GYE VPT DIHM Q+     W +FL +Y+
Sbjct: 561 FTETACDELVEEMEHYGQWSLGDNKDNRIQGGYENVPTIDIHMNQISFEREWHKFLVEYI 620

Query: 627 VPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGG 686
            P+ E+ + GY+    +  ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VGVDYEGGG
Sbjct: 621 APMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFTINIALNRVGVDYEGGG 679

Query: 687 CRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           CRF+RYNC++ A R GW LMHPGRLTHYHEGL  T+GTRYI +SFVDP
Sbjct: 680 CRFLRYNCSIRAPRKGWTLMHPGRLTHYHEGLPTTRGTRYIAVSFVDP 727


>gi|380813818|gb|AFE78783.1| procollagen-lysine,2-oxoglutarate 5-dioxygenase 1 precursor [Macaca
           mulatta]
 gi|384943152|gb|AFI35181.1| procollagen-lysine,2-oxoglutarate 5-dioxygenase 1 precursor [Macaca
           mulatta]
          Length = 727

 Score =  654 bits (1688), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 310/708 (43%), Positives = 467/708 (65%), Gaps = 9/708 (1%)

Query: 31  EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNE 90
           ED  LV+TVA+ ET+G++RF +SA+    +++ LGL + W     +S GGG KV LLK  
Sbjct: 25  EDNLLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWNVEKGTSAGGGQKVRLLKKA 84

Query: 91  LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
           L++    +D++IL TDSYDV+   G  ++L++F    + +VF AE L +PD  L  KYP 
Sbjct: 85  LEKHADKEDLVILFTDSYDVLFASGPRELLKKFRQARSQVVFSAEELIYPDRRLEAKYPV 144

Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
           V  G R+L SGGFIGYA ++ +L++    ++ + DQL+Y  +FLD   R +  I LD   
Sbjct: 145 VSDGKRFLGSGGFIGYAPNLSKLVAEWEGQDSDSDQLFYTKIFLDPEKREQINITLDHRC 204

Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK- 269
            +FQNL G+L+++ L F++   V   N  Y+T PV+IHGNG +K++LN  GNY+ + W  
Sbjct: 205 RIFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTF 263

Query: 270 TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
            +GCT C+  ++ L  +  +  P+VL+ +FI++PT F+  F  ++  L+YP K + +F++
Sbjct: 264 ETGCTVCDEGLRSLKGIGDEALPTVLVGMFIEQPTPFVSLFFQRLLQLHYPRKHMRLFIH 323

Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDS 387
           N++++H    ++++    + +++VK +     + + +ARN+  +     +   +YF VD+
Sbjct: 324 NHEQHHKAQVEEFLAEHGSEYQSVKLVGPEVRMANADARNMGADLCRQDRSCTYYFSVDA 383

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D  L  P+ L+ L+ +N+++IAPL+ R  + WSNFWGAL+ADG+YARS DY++I+ G + 
Sbjct: 384 DVALTEPNSLRLLIQQNKNVIAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR- 442

Query: 448 GKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQE 506
             G+WNVPYI+N YL+K S ++       ++  + +D DMAFC N+R + + + + +   
Sbjct: 443 -IGVWNVPYISNIYLIKGSALRGELQSPDLFHHSKLDPDMAFCANVRQQDVFMFLTNRHT 501

Query: 507 YGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPI 566
            GHL+  +++     + +++E+  NP DW  +YIH  Y K+L    V   PCPDV+WFPI
Sbjct: 502 LGHLLSLDSYRTAHLHNDLWEVFSNPEDWKEKYIHQNYTKALAGKLVET-PCPDVYWFPI 560

Query: 567 VTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYV 626
            TE  C E V+ ME +GQWS G N D R++ GYE VPT DIHM Q+G    W +FL +Y+
Sbjct: 561 FTEAACDELVEEMEHFGQWSLGDNKDSRIQGGYENVPTIDIHMNQIGFEREWHKFLLEYI 620

Query: 627 VPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGG 686
            P+ E+ + GY+    +  ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VGVDYEGGG
Sbjct: 621 APMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFTINIALNRVGVDYEGGG 679

Query: 687 CRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           CRF+RYNC++ A R GW LMHPGRLTHYHEGL  T+GTRYI +SFVDP
Sbjct: 680 CRFLRYNCSIRAPRKGWTLMHPGRLTHYHEGLPTTRGTRYIAVSFVDP 727


>gi|410262758|gb|JAA19345.1| procollagen-lysine 1, 2-oxoglutarate 5-dioxygenase 1 [Pan
           troglodytes]
 gi|410302672|gb|JAA29936.1| procollagen-lysine 1, 2-oxoglutarate 5-dioxygenase 1 [Pan
           troglodytes]
          Length = 727

 Score =  654 bits (1688), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 312/708 (44%), Positives = 466/708 (65%), Gaps = 9/708 (1%)

Query: 31  EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNE 90
           ED  LV+TVA+ ET+G++RF +SA+    +++ LGL + W     +S GGG KV LLK  
Sbjct: 25  EDNLLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWNVDKGTSAGGGQKVRLLKKA 84

Query: 91  LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
           L++    +D++IL TDSYDV+   G  ++L++F    + +VF AE L +PD  L  KYP 
Sbjct: 85  LEKHADKEDLVILFTDSYDVLFASGPRELLKKFRQARSQVVFSAEELIYPDRRLETKYPV 144

Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
           V  G R+L SGGFIGYA ++ +L++    ++ + DQL+Y  +FLD   R +  I LD   
Sbjct: 145 VSDGKRFLGSGGFIGYAPNLSKLVAEWEGQDSDSDQLFYTKIFLDPEKREQINITLDHRC 204

Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK- 269
            +FQNL G+L+++ L F++   V   N  Y+T PV+IHGNG +K++LN  GNY+ + W  
Sbjct: 205 RIFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTF 263

Query: 270 TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
            +GCT C+  ++ L  +  +  P VL+ VFI++PT F+  F  ++  L+YP K + +F++
Sbjct: 264 ETGCTVCDEGLRSLKGIGDEALPMVLVGVFIEQPTPFVSLFFQRLLRLHYPQKHMRLFIH 323

Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDS 387
           N++++H    ++++    + +++VK +     + + +ARN+  +     +   +YF VD+
Sbjct: 324 NHEQHHKAQVEEFLAEHGSEYQSVKLVGPEVRMANADARNMGADLCRQDRSCTYYFSVDA 383

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D  L  P+ L+ L+ +N+++IAPL+ R  + WSNFWGAL+ADG+YARS DY++I+ G + 
Sbjct: 384 DVALTEPNSLRLLIQQNKNVIAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR- 442

Query: 448 GKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQE 506
             G+WNVPYI+N YL+K S ++       ++  + +D DMAFC N+R + + + + +   
Sbjct: 443 -VGVWNVPYISNIYLIKGSALRGELQSPDLFHHSKLDPDMAFCANVRQQDVFMFLTNRHT 501

Query: 507 YGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPI 566
            GHL+  +++     + +++E+  NP DW  +YIH  Y K+L    V   PCPDV+WFPI
Sbjct: 502 LGHLLSLDSYRTTHLHNDLWEVFSNPEDWKEKYIHQNYTKALAGKLVET-PCPDVYWFPI 560

Query: 567 VTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYV 626
            TE  C E V+ ME +GQWS G N D R++ GYE VPT DIHM Q+G    W +FL +Y+
Sbjct: 561 FTEVACDELVEEMEHFGQWSLGNNKDNRIQGGYENVPTIDIHMNQIGFEREWHKFLLEYI 620

Query: 627 VPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGG 686
            P+ E+ + GY+    +  ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VGVDYEGGG
Sbjct: 621 APMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFTINIALNRVGVDYEGGG 679

Query: 687 CRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           CRF+RYNC+V A R GW LMHPGRLTHYHEGL  T+GTRYI +SFVDP
Sbjct: 680 CRFLRYNCSVRAPRKGWTLMHPGRLTHYHEGLPTTRGTRYIAVSFVDP 727


>gi|32307144|ref|NP_000293.2| procollagen-lysine,2-oxoglutarate 5-dioxygenase 1 precursor [Homo
           sapiens]
 gi|78099790|sp|Q02809.2|PLOD1_HUMAN RecName: Full=Procollagen-lysine,2-oxoglutarate 5-dioxygenase 1;
           AltName: Full=Lysyl hydroxylase 1; Short=LH1; Flags:
           Precursor
 gi|20149013|gb|AAM12752.1| lysyl hydroxylase [Homo sapiens]
 gi|119592130|gb|EAW71724.1| procollagen-lysine 1, 2-oxoglutarate 5-dioxygenase 1, isoform CRA_a
           [Homo sapiens]
 gi|119592131|gb|EAW71725.1| procollagen-lysine 1, 2-oxoglutarate 5-dioxygenase 1, isoform CRA_a
           [Homo sapiens]
 gi|168277976|dbj|BAG10966.1| procollagen-lysine,2-oxoglutarate 5-dioxygenase 1 precursor
           [synthetic construct]
          Length = 727

 Score =  654 bits (1688), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 310/708 (43%), Positives = 466/708 (65%), Gaps = 9/708 (1%)

Query: 31  EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNE 90
           ED  LV+TVA+ ET+G++RF +SA+    +++ LGL + W     +S GGG KV LLK  
Sbjct: 25  EDNLLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWNVEKGTSAGGGQKVRLLKKA 84

Query: 91  LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
           L++    +D++IL  DSYDV+   G  ++L++F    + +VF AE L +PD  L  KYP 
Sbjct: 85  LEKHADKEDLVILFADSYDVLFASGPRELLKKFRQARSQVVFSAEELIYPDRRLETKYPV 144

Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
           V  G R+L SGGFIGYA ++ +L++    ++ + DQL+Y  +FLD   R +  I LD   
Sbjct: 145 VSDGKRFLGSGGFIGYAPNLSKLVAEWEGQDSDSDQLFYTKIFLDPEKREQINITLDHRC 204

Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK- 269
            +FQNL G+L+++ L F++   V   N  Y+T PV+IHGNG +K++LN  GNY+ + W  
Sbjct: 205 RIFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTF 263

Query: 270 TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
            +GCT C+  ++ L  +  +  P+VL+ VFI++PT F+  F  ++  L+YP K + +F++
Sbjct: 264 ETGCTVCDEGLRSLKGIGDEALPTVLVGVFIEQPTPFVSLFFQRLLRLHYPQKHMRLFIH 323

Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDS 387
           N++++H    ++++    + +++VK +     + + +ARN+  +     +   +YF VD+
Sbjct: 324 NHEQHHKAQVEEFLAQHGSEYQSVKLVGPEVRMANADARNMGADLCRQDRSCTYYFSVDA 383

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D  L  P+ L+ L+ +N+++IAPL+ R  + WSNFWGAL+ADG+YARS DY++I+ G + 
Sbjct: 384 DVALTEPNSLRLLIQQNKNVIAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR- 442

Query: 448 GKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQE 506
             G+WNVPYI+N YL+K S ++       ++  + +D DMAFC N+R + + + + +   
Sbjct: 443 -VGVWNVPYISNIYLIKGSALRGELQSSDLFHHSKLDPDMAFCANIRQQDVFMFLTNRHT 501

Query: 507 YGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPI 566
            GHL+  +++     + +++E+  NP DW  +YIH  Y K+L    V   PCPDV+WFPI
Sbjct: 502 LGHLLSLDSYRTTHLHNDLWEVFSNPEDWKEKYIHQNYTKALAGKLVET-PCPDVYWFPI 560

Query: 567 VTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYV 626
            TE  C E V+ ME +GQWS G N D R++ GYE VPT DIHM Q+G    W +FL +Y+
Sbjct: 561 FTEVACDELVEEMEHFGQWSLGNNKDNRIQGGYENVPTIDIHMNQIGFEREWHKFLLEYI 620

Query: 627 VPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGG 686
            P+ E+ + GY+    +  ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VGVDYEGGG
Sbjct: 621 APMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFTINIALNRVGVDYEGGG 679

Query: 687 CRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           CRF+RYNC++ A R GW LMHPGRLTHYHEGL  T+GTRYI +SFVDP
Sbjct: 680 CRFLRYNCSIRAPRKGWTLMHPGRLTHYHEGLPTTRGTRYIAVSFVDP 727


>gi|197102026|ref|NP_001127428.1| procollagen-lysine,2-oxoglutarate 5-dioxygenase 1 precursor [Pongo
           abelii]
 gi|62900718|sp|Q5R9N3.1|PLOD1_PONAB RecName: Full=Procollagen-lysine,2-oxoglutarate 5-dioxygenase 1;
           AltName: Full=Lysyl hydroxylase 1; Short=LH1; Flags:
           Precursor
 gi|55729596|emb|CAH91527.1| hypothetical protein [Pongo abelii]
          Length = 727

 Score =  654 bits (1688), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 311/708 (43%), Positives = 466/708 (65%), Gaps = 9/708 (1%)

Query: 31  EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNE 90
           ED  LV+TVA+ ET+G++RF +SA+    +++ LGL + W     +S GGG KV LLK  
Sbjct: 25  EDNLLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWNVEKGTSAGGGQKVRLLKKA 84

Query: 91  LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
           L++    +D++IL TDSYDV+   G  ++L++F    + +VF AE L +PD  L  KYP 
Sbjct: 85  LEKHADKEDLVILFTDSYDVLFASGPRELLKKFRQARSQVVFSAEELIYPDRRLETKYPV 144

Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
           V  G R+L SGGFIGYA ++ +L++    ++ + DQL+Y  +FLD   R +  I LD   
Sbjct: 145 VSDGKRFLGSGGFIGYAPNLSKLVAEWEGQDSDSDQLFYTRIFLDPEKREQINITLDHRC 204

Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK- 269
            +FQNL G+L+++ L F++   V   N  Y+T PV+IHGNG +K++LN  GNY+ + W  
Sbjct: 205 RIFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTF 263

Query: 270 TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
            +GCT C+  ++ L  +  +  P+VL+ VFI++PT F+  F  ++  L+YP K + +F++
Sbjct: 264 ETGCTVCDEGLRSLKGIGDEALPTVLVGVFIEQPTPFVSLFFQRLLRLHYPQKHMRLFIH 323

Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDS 387
           N++++H    ++++    + +++VK +     + + +ARN+  +     +   +YF VD+
Sbjct: 324 NHEQHHKAQVEEFLAEHGSEYQSVKLVGPEVRMANADARNMGADLCRQDRSCTYYFSVDA 383

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D  L  P  L+ L+ +N+++IAPL+ R  + WSNFWGAL+ADG+YARS DY++I+ G + 
Sbjct: 384 DVALTEPSSLRLLIQQNKNVIAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR- 442

Query: 448 GKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQE 506
             G+WNVPYI+N YL+K S ++       ++  + +D DMAFC N+R + + + + +   
Sbjct: 443 -VGVWNVPYISNIYLIKGSALRGELQSPDLFHHSKLDPDMAFCANVRQQDVFMFLTNRHT 501

Query: 507 YGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPI 566
            GHL+  +++     + +++E+  NP DW  +YIH  Y K+L    V   PCPDV+WFPI
Sbjct: 502 LGHLLSLDSYRTTHLHNDLWEVFSNPEDWKEKYIHQNYTKALAGKLVET-PCPDVYWFPI 560

Query: 567 VTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYV 626
            TE  C E V+ ME +GQWS G N D R++ GYE VPT DIHM Q+G    W +FL +Y+
Sbjct: 561 FTEVACDELVEEMEHFGQWSLGDNKDNRIQGGYENVPTIDIHMNQIGFEREWHKFLLEYI 620

Query: 627 VPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGG 686
            P+ E+ + GY+    +  ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VGVDYEGGG
Sbjct: 621 APMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFTINIALNRVGVDYEGGG 679

Query: 687 CRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           CRF+RYNC++ A R GW LMHPGRLTHYHEGL  T+GTRYI +SFVDP
Sbjct: 680 CRFLRYNCSIRAPRKGWTLMHPGRLTHYHEGLPTTRGTRYIAVSFVDP 727


>gi|395528052|ref|XP_003766147.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2
           [Sarcophilus harrisii]
          Length = 737

 Score =  654 bits (1688), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 315/712 (44%), Positives = 464/712 (65%), Gaps = 12/712 (1%)

Query: 28  NIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGYKVNL 86
           NI  DK LVITVA+ ETDGY RF+QSA+     VK LG  + W GGD ++++GGG KV L
Sbjct: 33  NIPTDKLLVITVATKETDGYHRFMQSAKYFNYTVKVLGKGEEWKGGDKVNAIGGGQKVRL 92

Query: 87  LKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYD 146
           LK  +      +D+++  T+ YDVI  GG  ++L++F   +  +VF A+ + WPD  L D
Sbjct: 93  LKEAMGSYADQEDLVVFFTECYDVIFAGGPEELLKKFQKINHKVVFSADGILWPDKRLAD 152

Query: 147 KYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVL 206
           KYP V  G R+LNSGGF+GYA  I  ++   ++++ +DDQL+Y  +++D   R    I L
Sbjct: 153 KYPIVHIGKRFLNSGGFVGYAPYINRIVQQWNLQDNDDDQLFYTKIYIDPLKREALNITL 212

Query: 207 DTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAK 266
           D    +FQ L G+++++ L F+  +     NT Y T PV+I+GNG +KI+LN FGNY+  
Sbjct: 213 DHKCRIFQALNGAIDEVLLKFENGK-ARAKNTFYETLPVVINGNGPTKIQLNYFGNYIPN 271

Query: 267 SW-KTSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKIS 324
           +W + +GCT C+L +  L +LK   +P V + VFI++PT FL  FL+ +  L YP + + 
Sbjct: 272 AWTQENGCTLCDLDVIDLSTLK--DYPRVTVGVFIEQPTPFLPRFLDLLLTLTYPKEALK 329

Query: 325 MFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHK-GVDFYF 383
           +F++N++ YH     ++    K + KN+K +     ++  EARN+ ++        D+YF
Sbjct: 330 LFIHNSEVYHEKHIKEFWEKAKDVIKNIKIVGPEENLSQAEARNMGMDLCRQDDKCDYYF 389

Query: 384 YVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIIN 443
            +D+D  L NP  L+ L+ +N  +IAPL+ R  K WSNFWGAL+ DG+YARS DY++I+ 
Sbjct: 390 SLDADVVLTNPKTLEILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARSEDYVDIVQ 449

Query: 444 GDQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRNKGIHLKID 502
           G +   G+WN+PY+ N YL+K   +++  N +  +  + +D DMA C N R  G+ + I 
Sbjct: 450 GSR--VGVWNIPYMANVYLIKGQTLRSEMNERNYFVRDKLDPDMALCRNAREMGVFMYIS 507

Query: 503 STQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVF 562
           +  E+G L+ + N++    N +++++  NP+DW  +YI+P Y K +  + +  QPCPDVF
Sbjct: 508 NRHEFGRLLSTANYNISHYNNDLWQIFENPVDWKEKYINPNYSK-IFTENLVEQPCPDVF 566

Query: 563 WFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFL 622
           WFPI +EK C E V+ ME YGQWS G ++D R+  GYE VPT DIHMKQ+GL   W  F+
Sbjct: 567 WFPIFSEKACDELVEEMEHYGQWSGGKHHDSRISGGYENVPTDDIHMKQIGLENEWLHFI 626

Query: 623 RKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDY 682
           R+++ P+  + F GY+ +   A ++FVV+Y PD Q SLRPHHDSST+TINIALN VG D+
Sbjct: 627 REFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPDRQRSLRPHHDSSTFTINIALNNVGQDF 685

Query: 683 EGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           +GGGC+F+RYNC++ + R GW  MHPGRLTH HEGL +  GTRYI +SF+DP
Sbjct: 686 QGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPIINGTRYIAVSFIDP 737


>gi|417404209|gb|JAA48874.1| Putative procollagen-lysine2-oxoglutarate 5-dioxygenase 1 [Desmodus
           rotundus]
          Length = 728

 Score =  654 bits (1687), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 313/709 (44%), Positives = 468/709 (66%), Gaps = 10/709 (1%)

Query: 31  EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNE 90
           ED  LV+TVA+ ET+G++RF +SA+    +++ LGL + W G   +S GGG KV LLK  
Sbjct: 25  EDNLLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWPGEKETSAGGGLKVRLLKKA 84

Query: 91  LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
           L++    ++++IL TDSYDV+   G  ++L++F    + +VF A+ L +PD  L  KYP 
Sbjct: 85  LEKHADKENLVILFTDSYDVVFASGPRELLKKFRQARSQVVFSAQELIYPDRRLEAKYPM 144

Query: 151 VGS-GYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTL 209
           V   G R+L SGGFIGYA ++ +L++    ++ + DQL+Y  +FLD   R +  I LD  
Sbjct: 145 VSDDGKRFLGSGGFIGYAPNLSKLVAEWEGQDSDSDQLFYTKIFLDPAKREQINITLDHR 204

Query: 210 ANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK 269
             +FQNL G+L+++ L F++   V   N  Y+T PV+IHGNG +K++LN  GNY+ + W 
Sbjct: 205 CRIFQNLDGALDEVVLKFEMGH-VRARNLVYDTLPVLIHGNGPTKLQLNYLGNYIPRFWT 263

Query: 270 -TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFV 327
             +GC  C+  ++ L  +  +  P VL+ VFI++PT FL  F  ++  L+YP +++ +F+
Sbjct: 264 FETGCVVCDEGLRSLKGIGDEALPVVLVGVFIEQPTPFLSLFFQRLLRLHYPRRRMRLFI 323

Query: 328 YNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVD 386
           +N++++H    + ++    + +++VK +     V + +ARN+  +     +   +YF VD
Sbjct: 324 HNHEKHHKTQVEQFLAEHGSEYQSVKLVGPEVRVANADARNMGADLCRQDRSCTYYFSVD 383

Query: 387 SDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQ 446
           +   L  P +L+ L+ +N+++IAPL+ R  + WSNFWGAL+ADG+YARS DY++I+ G +
Sbjct: 384 AAVALTEPKILRLLIEQNKNVIAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR 443

Query: 447 GGKGIWNVPYITNCYLMKTSVIKATNIKT-IYTLNSMDYDMAFCTNLRNKGIHLKIDSTQ 505
              G+WNVPYI+N YL+K S ++A   +T ++  + +D DMAFC N+R + + + + +  
Sbjct: 444 --VGVWNVPYISNIYLIKGSALRAELQQTDLFHHSKLDPDMAFCANIRQQDVFMFLTNRH 501

Query: 506 EYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFP 565
            +GHL+  +N+     + +++E+  NP DW  +YIH  Y K+L    V   PCPDV+WFP
Sbjct: 502 TFGHLLSLDNYQTTHLHNDLWEVFNNPEDWKEKYIHKNYTKALAGKLVE-MPCPDVYWFP 560

Query: 566 IVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKY 625
           I TE  C E V+ ME YGQWS G N D R++ GYE VPT DIHM Q+     W +FL +Y
Sbjct: 561 IFTETACDELVEEMEHYGQWSLGDNKDSRIQGGYENVPTIDIHMNQISFEREWHKFLVEY 620

Query: 626 VVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGG 685
           + P+ E+ + GY+    +  ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VGVDYEGG
Sbjct: 621 IAPMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLVPHHDASTFTINIALNRVGVDYEGG 679

Query: 686 GCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           GCRF+RYNC+V A R GW LMHPGRLTHYHEGL  T+GTRYI +SFVDP
Sbjct: 680 GCRFLRYNCSVRAPRKGWALMHPGRLTHYHEGLPTTKGTRYISVSFVDP 728


>gi|189053347|dbj|BAG35176.1| unnamed protein product [Homo sapiens]
          Length = 727

 Score =  654 bits (1686), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 311/708 (43%), Positives = 465/708 (65%), Gaps = 9/708 (1%)

Query: 31  EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNE 90
           ED  LVITVA+ ET+G++RF +SA+    +++ LGL + W     +S GGG KV LLK  
Sbjct: 25  EDNLLVITVATKETEGFRRFKRSAQFFNYKIQALGLGEDWNVEKGTSAGGGQKVRLLKKA 84

Query: 91  LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
           L++    +D++IL  DSYDV+   G  ++L++F    + +VF AE L +PD  L  KYP 
Sbjct: 85  LEKHADKEDLVILFADSYDVLFASGPRELLKKFRQARSQVVFSAEELIYPDRRLETKYPV 144

Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
           V  G R+L SGGFIGYA ++ +L++    ++ + DQL+Y  +FLD   R +  I LD   
Sbjct: 145 VSDGKRFLGSGGFIGYAPNLSKLVAEWEGQDSDSDQLFYTKIFLDPEKREQINITLDHRC 204

Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK- 269
            +FQNL G+L+++ L F++   V   N  Y+T PV+IHGNG +K++LN  GNY+ + W  
Sbjct: 205 RIFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTF 263

Query: 270 TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
            +GCT C+  ++ L  +  +  P+VL+ VFI++PT F+  F  ++  L+YP K + +F++
Sbjct: 264 ETGCTVCDEGLRSLKGIGDEALPTVLVGVFIEQPTPFVSLFFQRLLRLHYPQKHMRLFIH 323

Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDS 387
           N++++H    ++++    + +++VK +     + + +ARN+  +     +   +YF VD+
Sbjct: 324 NHEQHHKAQVEEFLAQHGSEYQSVKLVGPEVRMANADARNMGADLCRQDRSCTYYFSVDA 383

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D  L  P+ L+ L+ +N+++IAPL+ R  + WSNFWGAL+ADG+YARS DY++I+ G + 
Sbjct: 384 DVALTEPNSLRLLIQQNKNVIAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR- 442

Query: 448 GKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQE 506
             G+WNVPYI+N YL+K S ++       ++  + +D DMAFC N+R + + + + +   
Sbjct: 443 -VGVWNVPYISNIYLIKGSALRGVLQSSDLFHHSKLDPDMAFCANIRQQDVFMFLTNRHT 501

Query: 507 YGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPI 566
            GHL+  +++     + +++E+  NP DW  +YIH  Y K+L    V   PCPDV+WFPI
Sbjct: 502 LGHLLSLDSYRTTHLHNDLWEVFSNPEDWKEKYIHQNYTKALAGKLVET-PCPDVYWFPI 560

Query: 567 VTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYV 626
            TE  C E V+ ME +GQWS G N D R++ GYE VPT DIHM Q+G    W +FL +Y+
Sbjct: 561 FTEVACDELVEEMEHFGQWSLGNNKDNRIQGGYENVPTIDIHMNQIGFEREWHKFLLEYI 620

Query: 627 VPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGG 686
            P+ E+ + GY+    +  ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VGVDYEGGG
Sbjct: 621 APMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFTINIALNRVGVDYEGGG 679

Query: 687 CRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           CRF RYNC++ A R GW LMHPGRLTHYHEGL  T+GTRYI +SFVDP
Sbjct: 680 CRFPRYNCSIRAPRKGWTLMHPGRLTHYHEGLPTTRGTRYIAVSFVDP 727


>gi|397502972|ref|XP_003822110.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 1
           isoform 2 [Pan paniscus]
          Length = 774

 Score =  653 bits (1685), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 311/707 (43%), Positives = 466/707 (65%), Gaps = 9/707 (1%)

Query: 32  DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNEL 91
           D  LV+TVA+ ET+G++RF +SA+    +++ LGL + W     +S GGG KV LLK  L
Sbjct: 73  DNLLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWNVDKGTSAGGGQKVRLLKKAL 132

Query: 92  DEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAV 151
           ++    +D++IL TDSYDV+   G  ++L++F    + +VF AE L +PD  L  KYP V
Sbjct: 133 EKHADKEDLVILFTDSYDVLFASGPRELLKKFRQARSQVVFSAEELIYPDRRLETKYPVV 192

Query: 152 GSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLAN 211
             G R+L SGGFIGYA ++ +L++    ++ + DQL+Y  +FLD   R +  I LD    
Sbjct: 193 SDGKRFLGSGGFIGYAPNLSKLVAEWEGQDSDSDQLFYTKIFLDPEKREQINITLDHRCR 252

Query: 212 LFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK-T 270
           +FQNL G+L+++ L F++   V   N  Y+T PV+IHGNG +K++LN  GNY+ + W   
Sbjct: 253 IFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTFE 311

Query: 271 SGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN 329
           +GCT C+  ++ L  +  +  P+VL+ VFI++PT F+  F  ++  L+YP K + +F++N
Sbjct: 312 TGCTVCDEGLRSLKGIGDEALPTVLVGVFIEQPTPFVSLFFQRLLRLHYPQKHMRLFIHN 371

Query: 330 NQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDSD 388
           ++++H    ++++    + +++VK +     + + +ARN+  +     +   +YF VD+D
Sbjct: 372 HEQHHKAQVEEFLAEHGSEYQSVKLVGPEVRMANADARNMGADLCRQDRSCTYYFSVDAD 431

Query: 389 SHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGG 448
             L  P+ L+ L+ +N+++IAPL+ R  + WSNFWGAL+ADG+YARS DY++I+ G +  
Sbjct: 432 VALTEPNSLRLLIQQNKNVIAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR-- 489

Query: 449 KGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEY 507
            G+WNVPYI+N YL+K S ++       ++  + +D DMAFC N+R + + + + +    
Sbjct: 490 VGVWNVPYISNIYLIKGSALRGELQSSDLFHHSKLDPDMAFCANVRQQDVFMFLTNRHTL 549

Query: 508 GHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIV 567
           GHL+  +++     + +++E+  NP DW  +YIH  Y K+L    V   PCPDV+WFPI 
Sbjct: 550 GHLLSLDSYRTTHLHNDLWEVFSNPEDWKEKYIHQNYTKALAGKLVET-PCPDVYWFPIF 608

Query: 568 TEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVV 627
           TE  C E V+ ME +GQWS G N D R++ GYE VPT DIHM Q+G    W +FL +Y+ 
Sbjct: 609 TEVACDELVEEMEHFGQWSLGNNKDNRIQGGYENVPTIDIHMNQIGFEREWHKFLLEYIA 668

Query: 628 PLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGC 687
           P+ E+ + GY+    +  ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VGVDYEGGGC
Sbjct: 669 PMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFTINIALNRVGVDYEGGGC 727

Query: 688 RFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           RF+RYNC+V A R GW LMHPGRLTHYHEGL  T+GTRYI +SFVDP
Sbjct: 728 RFLRYNCSVRAPRKGWTLMHPGRLTHYHEGLPTTRGTRYIAVSFVDP 774


>gi|431906315|gb|ELK10512.1| Procollagen-lysine,2-oxoglutarate 5-dioxygenase 1 [Pteropus alecto]
          Length = 727

 Score =  652 bits (1683), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 310/708 (43%), Positives = 467/708 (65%), Gaps = 9/708 (1%)

Query: 31  EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNE 90
           ED  LV+TVA+ ET+G++RF +SA+    +++ LGL + W    ++S GGG KV LLK  
Sbjct: 25  EDNLLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWNVEKVTSAGGGLKVRLLKKA 84

Query: 91  LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
           L++    ++++IL TDSYDV+   G  ++L++F    + +VF AE L +PD  L  KYP 
Sbjct: 85  LEKHADKENLVILFTDSYDVVFASGPRELLKKFRQAKSQVVFSAEELIYPDRRLEVKYPV 144

Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
           V  G R+L SGGFIGYA ++ +L++    ++ + DQL+Y  +FLD   R +  I LD   
Sbjct: 145 VSDGKRFLGSGGFIGYAPNLSKLVAEWKGQDSDSDQLFYTKIFLDPEKREQINITLDHRC 204

Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK- 269
            +FQNL G+L+++ L F++   V   N  Y+T PV+IHGNG +K++LN  GNY+ + W  
Sbjct: 205 RIFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTF 263

Query: 270 TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
            +GC  C+  ++ L  +  +  P VL+ VFI++PT FL  F  ++   +YP K++ +F++
Sbjct: 264 ETGCVVCDEGLRSLKGIGDEALPIVLVGVFIEQPTPFLSLFFQRLLRFHYPRKRMRLFIH 323

Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDS 387
           N++++H    + ++      ++++K +     V S +ARN+  +     +G  +YF VD+
Sbjct: 324 NHEQHHKAQVEQFLAEHGDEYQSMKLVGPEVRVASADARNMGADLCRQDRGCTYYFSVDA 383

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D  L  P +L+ L+ +N+++IAPL+ R  + WSNFWGAL+ADG+YARS DY++I+   + 
Sbjct: 384 DVALTEPKILRLLIEQNKNVIAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQRRR- 442

Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKT-IYTLNSMDYDMAFCTNLRNKGIHLKIDSTQE 506
             G+WNVPYI++ YL+K S ++A   +T ++  + +D DMAFC N+R + + + + +   
Sbjct: 443 -VGVWNVPYISSVYLIKGSALRAELQQTDLFHHSKLDPDMAFCANIRQQDVFMFLTNRHI 501

Query: 507 YGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPI 566
           +GHL+  +N+     + +++E+  NP +W  +YIH  Y K+L    V   PCPDV+WFPI
Sbjct: 502 FGHLLSLDNYQTTHLHNDLWEVFNNPEEWKEKYIHENYTKALAGKLV-EMPCPDVYWFPI 560

Query: 567 VTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYV 626
            TE  C E V+ ME +GQWS G N D R++ GYE VPT DIHM Q+     W +FL +Y+
Sbjct: 561 FTETACDELVEEMEHFGQWSLGDNKDNRIQGGYENVPTIDIHMNQISFEREWHKFLVEYI 620

Query: 627 VPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGG 686
            P+ E+ + GY+    +  ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VGVDYEGGG
Sbjct: 621 APMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFTINIALNRVGVDYEGGG 679

Query: 687 CRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           CRF+RYNC++ A R GW LMHPGRLTHYHEGL  T+GTRYI +SFVDP
Sbjct: 680 CRFLRYNCSIRAPRKGWTLMHPGRLTHYHEGLPTTKGTRYIAVSFVDP 727


>gi|326932538|ref|XP_003212372.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 1-like
           [Meleagris gallopavo]
          Length = 730

 Score =  652 bits (1683), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 313/709 (44%), Positives = 473/709 (66%), Gaps = 10/709 (1%)

Query: 31  EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGYKVNLLKN 89
           E+  LV+TVA+ +T+G++RF +SA+    +++ LGL + W GGD     GGG KV LLK+
Sbjct: 27  EENLLVLTVATKQTEGFRRFRRSAQFFNYKIQVLGLDEEWKGGDDKKPAGGGQKVRLLKS 86

Query: 90  ELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYP 149
            L +    +D+IIL T+SYDV+   G  ++L++F    + +VF AE   +PD  L  KYP
Sbjct: 87  ALKQHADKEDLIILFTESYDVLFASGPTELLKKFKQAKSKVVFSAENYIYPDRKLEAKYP 146

Query: 150 AVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTL 209
            V  G R+L SGGFIGYA ++K+L+     K+++ DQL+Y  +FLD   R    I LD  
Sbjct: 147 PVRDGKRFLGSGGFIGYAPNLKKLVEEWKGKDDDSDQLFYTKIFLDPEKRENINISLDHR 206

Query: 210 ANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK 269
           + +FQNL G+L+++ L F+ +  V   N  Y+T PVIIHGNG +K++LN  GNY+ + W 
Sbjct: 207 SRIFQNLNGALDEVVLKFE-NARVRARNLLYDTLPVIIHGNGPTKLQLNYLGNYIPQIWT 265

Query: 270 -TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFV 327
             +GCT C+  ++ L  +K +  P +LI +FI++PT FL +F  ++ NL+YP ++I +F+
Sbjct: 266 FETGCTVCDEGLRSLTGIKDEALPMILIGIFIEQPTPFLSQFFLRLRNLHYPKQRIQIFI 325

Query: 328 YNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVD 386
           +N++++H+   D +++     +  +K I  +  + + EARNL ++        D+YF +D
Sbjct: 326 HNHEQHHSMQVDSFVNEHSKEYLAMKVIGPDDEMENAEARNLGMDLCRKDPDCDYYFSLD 385

Query: 387 SDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQ 446
           ++  L N + L+ L+ +N+S+IAPL+ R  K WSNFWGAL+ DG+YARS DY++I+   +
Sbjct: 386 AEVVLKNTETLRILIEQNKSVIAPLVSRHEKLWSNFWGALSPDGYYARSEDYVDIVQRRR 445

Query: 447 GGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQ 505
              G+WNVPYI++ Y++K  V+++  +   ++    +D DMAFC N+RN+G+ + + +  
Sbjct: 446 --VGLWNVPYISSVYMVKGKVLRSELDEGDLFHGGKLDADMAFCHNVRNQGVFMYLTNRH 503

Query: 506 EYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFP 565
           ++GH++  EN+     + +++++  NP DW  +YIH  Y  +L    V   PCPDV+WFP
Sbjct: 504 QFGHILSLENYQTSHLHNDLWQIFSNPEDWREKYIHENYTAALKGKLVE-MPCPDVYWFP 562

Query: 566 IVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKY 625
           I T+  C E V+ ME YG+WS G N D R++ GYE VPT DIHM Q+G    W +FL  Y
Sbjct: 563 IFTDTACDELVEEMEHYGKWSTGDNTDSRIQGGYENVPTIDIHMNQIGFEREWYKFLLDY 622

Query: 626 VVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGG 685
           + P+ E+ + GY+ +  +  ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VG+DYEGG
Sbjct: 623 IAPITEKLYPGYYTK-TQFELAFVVRYKPDEQPSLMPHHDASTFTINIALNRVGIDYEGG 681

Query: 686 GCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           GCRF+RYNC++ A R GW LMHPGRLTHYHEGL  T+GTRYI +SF+DP
Sbjct: 682 GCRFLRYNCSIRAPRKGWTLMHPGRLTHYHEGLPTTKGTRYIAVSFIDP 730


>gi|114553998|ref|XP_514394.2| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 1
           isoform 12 [Pan troglodytes]
          Length = 774

 Score =  652 bits (1683), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 311/707 (43%), Positives = 466/707 (65%), Gaps = 9/707 (1%)

Query: 32  DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNEL 91
           D  LV+TVA+ ET+G++RF +SA+    +++ LGL + W     +S GGG KV LLK  L
Sbjct: 73  DNLLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWNVDKGTSAGGGQKVRLLKKAL 132

Query: 92  DEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAV 151
           ++    +D++IL TDSYDV+   G  ++L++F    + +VF AE L +PD  L  KYP V
Sbjct: 133 EKHADKEDLVILFTDSYDVLFASGPRELLKKFRQARSQVVFSAEELIYPDRRLETKYPVV 192

Query: 152 GSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLAN 211
             G R+L SGGFIGYA ++ +L++    ++ + DQL+Y  +FLD   R +  I LD    
Sbjct: 193 SDGKRFLGSGGFIGYAPNLSKLVAEWEGQDSDSDQLFYTKIFLDPEKREQINITLDHRCR 252

Query: 212 LFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK-T 270
           +FQNL G+L+++ L F++   V   N  Y+T PV+IHGNG +K++LN  GNY+ + W   
Sbjct: 253 IFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVLIHGNGPNKLQLNYLGNYIPRFWTFE 311

Query: 271 SGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN 329
           +GCT C+  ++ L  +  +  P+VL+ VFI++PT F+  F  ++  L+YP K + +F++N
Sbjct: 312 TGCTVCDEGLRSLKGIGDEALPTVLVGVFIEQPTPFVSLFFQRLLRLHYPQKHMRLFIHN 371

Query: 330 NQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDSD 388
           ++++H    ++++    + +++VK +     + + +ARN+  +     +   +YF VD+D
Sbjct: 372 HEQHHKAQVEEFLAEHGSEYQSVKLVGPEVRMANADARNMGADLCRQDRSCTYYFSVDAD 431

Query: 389 SHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGG 448
             L  P+ L+ L+ +N+++IAPL+ R  + WSNFWGAL+ADG+YARS DY++I+ G +  
Sbjct: 432 VALTEPNSLRLLIQQNKNVIAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR-- 489

Query: 449 KGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEY 507
            G+WNVPYI+N YL+K S ++       ++  + +D DMAFC N+R + + + + +    
Sbjct: 490 VGVWNVPYISNIYLIKGSALRGELQSPDLFHHSKLDPDMAFCANVRQQDVFMFLTNRHTL 549

Query: 508 GHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIV 567
           GHL+  +++     + +++E+  NP DW  +YIH  Y K+L    V   PCPDV+WFPI 
Sbjct: 550 GHLLSLDSYRTTHLHNDLWEVFSNPEDWKEKYIHQNYTKALAGKLVET-PCPDVYWFPIF 608

Query: 568 TEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVV 627
           TE  C E V+ ME +GQWS G N D R++ GYE VPT DIHM Q+G    W +FL +Y+ 
Sbjct: 609 TEVACDELVEEMEHFGQWSLGNNKDNRIQGGYENVPTIDIHMNQIGFEREWHKFLLEYIA 668

Query: 628 PLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGC 687
           P+ E+ + GY+    +  ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VGVDYEGGGC
Sbjct: 669 PMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFTINIALNRVGVDYEGGGC 727

Query: 688 RFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           RF+RYNC+V A R GW LMHPGRLTHYHEGL  T+GTRYI +SFVDP
Sbjct: 728 RFLRYNCSVRAPRKGWTLMHPGRLTHYHEGLPTTRGTRYIAVSFVDP 774


>gi|194386778|dbj|BAG61199.1| unnamed protein product [Homo sapiens]
 gi|221045958|dbj|BAH14656.1| unnamed protein product [Homo sapiens]
          Length = 774

 Score =  652 bits (1681), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 309/707 (43%), Positives = 465/707 (65%), Gaps = 9/707 (1%)

Query: 32  DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNEL 91
           D  LV+TVA+ ET+G++RF +SA+    +++ LGL + W     +S GGG KV LLK  L
Sbjct: 73  DNLLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWNVEKGTSAGGGQKVRLLKKAL 132

Query: 92  DEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAV 151
           ++    +D++IL  DSYDV+   G  ++L++F    + +VF AE L +PD  L  KYP V
Sbjct: 133 EKHADKEDLVILFADSYDVLFASGPRELLKKFRQARSQVVFSAEELIYPDRRLETKYPVV 192

Query: 152 GSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLAN 211
             G R+L SGGFIGYA ++ +L++    ++ + DQL+Y  +FLD   R +  I LD    
Sbjct: 193 SDGKRFLGSGGFIGYAPNLSKLVAEWEGQDSDSDQLFYTKIFLDPEKREQINITLDHRCR 252

Query: 212 LFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK-T 270
           +FQNL G+L+++ L F++   V   N  Y+T PV+IHGNG +K++LN  GNY+ + W   
Sbjct: 253 IFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTFE 311

Query: 271 SGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN 329
           +GCT C+  ++ L  +  +  P+VL+ VFI++PT F+  F  ++  L+YP K + +F++N
Sbjct: 312 TGCTVCDEGLRSLKGIGDEALPTVLVGVFIEQPTPFVSLFFQRLLRLHYPQKHMRLFIHN 371

Query: 330 NQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDSD 388
           ++++H    ++++    + +++VK +     + + +ARN+  +     +   +YF VD+D
Sbjct: 372 HEQHHKAQVEEFLAQHGSEYQSVKLVGPEVRMANADARNMGADLCRQDRSCTYYFSVDAD 431

Query: 389 SHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGG 448
             L  P+ L+ L+ +N+++IAPL+ R  + WSNFWGAL+ADG+YARS DY++I+ G +  
Sbjct: 432 VALTEPNSLRLLIQQNKNVIAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR-- 489

Query: 449 KGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEY 507
            G+WNVPYI+N YL+K S ++       ++  + +D DMAFC N+R + + + + +    
Sbjct: 490 VGVWNVPYISNIYLIKGSALRGELQSSDLFHHSKLDPDMAFCANIRQQDVFMFLTNRHTL 549

Query: 508 GHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIV 567
           GHL+  +++     + +++E+  NP DW  +YIH  Y K+L    V   PCPDV+WFPI 
Sbjct: 550 GHLLSLDSYRTTHLHNDLWEVFSNPEDWKEKYIHQNYTKALAGKLVET-PCPDVYWFPIF 608

Query: 568 TEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVV 627
           TE  C E V+ ME +GQWS G N D R++ GYE VPT DIHM Q+G    W +FL +Y+ 
Sbjct: 609 TEVACDELVEEMEHFGQWSLGNNKDNRIQGGYENVPTIDIHMNQIGFEREWHKFLLEYIA 668

Query: 628 PLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGC 687
           P+ E+ + GY+    +  ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VGVDYEGGGC
Sbjct: 669 PMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFTINIALNRVGVDYEGGGC 727

Query: 688 RFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           RF+RYNC++ A R GW LMHPGRLTHYHEGL  T+GTRYI +SFVDP
Sbjct: 728 RFLRYNCSIRAPRKGWTLMHPGRLTHYHEGLPTTRGTRYIAVSFVDP 774


>gi|54111425|ref|NP_001005618.1| procollagen-lysine,2-oxoglutarate 5-dioxygenase 1 precursor [Gallus
           gallus]
 gi|126651|sp|P24802.1|PLOD1_CHICK RecName: Full=Procollagen-lysine,2-oxoglutarate 5-dioxygenase 1;
           AltName: Full=Lysyl hydroxylase 1; Short=LH1; Flags:
           Precursor
 gi|212282|gb|AAA48945.1| lysyl hydroxylase [Gallus gallus]
          Length = 730

 Score =  652 bits (1681), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 313/709 (44%), Positives = 471/709 (66%), Gaps = 10/709 (1%)

Query: 31  EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGYKVNLLKN 89
           E+  LV+TVA+ +T+G++RF +SA+    +++ LGL + W GGD     GGG KV LLK+
Sbjct: 27  EENLLVLTVATKQTEGFRRFRRSAQFFNYKIQVLGLDEEWKGGDDKKPAGGGQKVRLLKS 86

Query: 90  ELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYP 149
            L +    +D++IL  +SYDV+   G  ++L++F    + +VF AE   +PD  L  KYP
Sbjct: 87  ALKQHADKEDLVILFIESYDVLFASGPTELLKKFKQAKSKVVFSAENYIYPDRKLEAKYP 146

Query: 150 AVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTL 209
            V  G R+L SGGFIGYA ++K+L+     K+++ DQL+Y  +FLD   R    I LD  
Sbjct: 147 PVRDGKRFLGSGGFIGYAPNLKKLVEEWKGKDDDSDQLFYTKIFLDPEKRENINISLDHR 206

Query: 210 ANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK 269
           + +FQNL G+L+++ L F+ +  V   N  Y+T PVIIHGNG +K++LN  GNY+ + W 
Sbjct: 207 SRIFQNLNGALDEVVLKFE-NARVRARNLLYDTLPVIIHGNGPTKLQLNYLGNYIPQIWT 265

Query: 270 -TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFV 327
             +GCT C+  ++ L  +K +  P +LI +FI++PT FL +F  ++ NL+YP ++I +F+
Sbjct: 266 FETGCTVCDEGLRSLTGIKDEALPMILIGIFIEQPTPFLSQFFLRLRNLHYPKQRIQIFI 325

Query: 328 YNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVD 386
           +N++E+H+   D ++      +  +K I  +  V + EARNL ++        D+YF +D
Sbjct: 326 HNHEEHHSMQVDSFVKEHSKEYLAMKVIGPDDEVENAEARNLGMDLCRKDPDCDYYFSLD 385

Query: 387 SDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQ 446
           ++  L N + L+ L+ +N+S+IAPL+ R  K WSNFWGAL+ DG+YARS DY++I+   +
Sbjct: 386 AEVVLKNTETLRILIEQNKSVIAPLVSRHEKLWSNFWGALSPDGYYARSEDYVDIVQRRR 445

Query: 447 GGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQ 505
              G+WNVPYI++ Y++K  V+++  +   ++    +D DMAFC N+RN+G+ + + +  
Sbjct: 446 --VGLWNVPYISSVYMVKGKVLRSELDEGDLFHGGKLDADMAFCHNVRNQGVFMYLTNRH 503

Query: 506 EYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFP 565
           ++GH++  EN+     + +++++  NP DW  +YIH  Y  +L    V   PCPDV+WFP
Sbjct: 504 QFGHILSLENYQTTHLHNDLWQIFSNPEDWREKYIHENYTAALKGKLVE-MPCPDVYWFP 562

Query: 566 IVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKY 625
           I T+  C E V+ ME YG+WS G N D R++ GYE VPT DIHM Q+G    W +FL  Y
Sbjct: 563 IFTDTACDELVEEMEHYGKWSTGDNTDSRIQGGYENVPTIDIHMNQIGFEREWYKFLLDY 622

Query: 626 VVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGG 685
           + P+ E+ + GY+ +  +  ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VG+DYEGG
Sbjct: 623 IAPITEKLYPGYYTK-TQFELAFVVRYKPDEQPSLMPHHDASTFTINIALNRVGIDYEGG 681

Query: 686 GCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           GCRF+RYNC++ A R GW LMHPGRLTHYHEGL  T+GTRYI +SF+DP
Sbjct: 682 GCRFLRYNCSIRAPRKGWTLMHPGRLTHYHEGLPTTKGTRYIAVSFIDP 730


>gi|27806477|ref|NP_776573.1| procollagen-lysine,2-oxoglutarate 5-dioxygenase 1 precursor [Bos
           taurus]
 gi|3283055|gb|AAC25107.1| procollagen-lysine, 2-oxoglutarate 5-dioxygenase precursor [Bos
           taurus]
          Length = 726

 Score =  651 bits (1680), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 310/708 (43%), Positives = 466/708 (65%), Gaps = 10/708 (1%)

Query: 31  EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNE 90
           ED  LV+TVA+ ET+G++RF +SA+    +++ LGL + W G  M + GGG KV LLK  
Sbjct: 25  EDNLLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWPGEAMLA-GGGLKVRLLKKA 83

Query: 91  LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
           L++    ++++IL TDSYDV+   G  ++L++F    + +VF AE L +PD  L   YP 
Sbjct: 84  LEKHADKENLVILFTDSYDVVFASGPRELLKKFRQARSQVVFSAEELIYPDRRLEANYPV 143

Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
           V  G R+L SGGFIGYA ++ +L++    ++ + DQL+Y  +FLD   R +  I LD   
Sbjct: 144 VSDGKRFLGSGGFIGYAPNLIKLVAEWEGQDSDSDQLFYTKIFLDPEKREQINITLDHRC 203

Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK- 269
            +FQN +G+L+++ L F++ + V   N  Y+T PV+IHGNG +K++LN  GNY+ + W  
Sbjct: 204 RIFQNFHGALDEVVLKFEMGQ-VRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTF 262

Query: 270 TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
            +GC  C+  ++ L  +  +  P+VL+ VFI++PT FL  F  ++  L+YP K+  +F++
Sbjct: 263 ETGCAVCDEGLRSLKGIGDEALPAVLVGVFIEQPTPFLSLFFQRLLLLHYPQKRFRLFIH 322

Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDS 387
           N++++H    + ++      +++VK +     V + +ARN+  +     +G  +YF VD+
Sbjct: 323 NHEQHHKAQVEQFLAEHGDEYQSVKLVGPEVRVANADARNMGADLCRQDRGCTYYFSVDA 382

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D  L  P  L+ L+ +N+++I PL+ R  + WSNFWGAL+ADG+YARS DY++I+ G + 
Sbjct: 383 DVALTEPKTLRLLIEQNKNVITPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR- 441

Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKT-IYTLNSMDYDMAFCTNLRNKGIHLKIDSTQE 506
             G+WNVPYI+N YL+K S ++A   +T ++  + +D DMAFC N+R + + + + +   
Sbjct: 442 -VGVWNVPYISNIYLIKGSALRAELQETDLFHHSKLDPDMAFCANIRQQDVFMFLTNRHS 500

Query: 507 YGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPI 566
           +GHL+  +++     + +++E+  NP DW  +YIH  Y K+L    V   PCPDV+WFPI
Sbjct: 501 FGHLLSLDSYQTTHLHNDLWEVFSNPEDWKEKYIHENYTKALAGKMVE-MPCPDVYWFPI 559

Query: 567 VTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYV 626
            TE  C E V+ ME YGQWS G N D R++ GYE VPT DIHM Q+     W +FL +Y+
Sbjct: 560 FTETACDELVEEMEHYGQWSLGDNKDNRIQGGYENVPTIDIHMNQINYEREWHKFLVEYI 619

Query: 627 VPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGG 686
            P+ E+ + GY+    +  ++FVVRY+PDEQPSL PHHD+ST+TINI LN+VGVDYEGGG
Sbjct: 620 APMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLVPHHDASTFTINIGLNRVGVDYEGGG 678

Query: 687 CRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           CRF+RYNC++ A R GW LMHPGRLTHYHEG+  T+GTRYI +SFVDP
Sbjct: 679 CRFLRYNCSIRAPRKGWTLMHPGRLTHYHEGVPTTKGTRYIAVSFVDP 726


>gi|151301032|ref|NP_001093074.1| lysyl hydroxylase 1 precursor [Takifugu rubripes]
 gi|146325988|dbj|BAF61136.1| lysyl hydroxylase 1 [Takifugu rubripes]
          Length = 729

 Score =  650 bits (1677), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 312/730 (42%), Positives = 475/730 (65%), Gaps = 12/730 (1%)

Query: 10  LILSCVVFFISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQP 69
           L +S    FI   C + + I E+K LV+TVA+ +TDG++RF++SA+     VK +G  + 
Sbjct: 7   LWISVCALFILTSCEE-QRIPEEKLLVVTVATKDTDGFRRFLRSAKHFNYTVKVVGRDEK 65

Query: 70  WLGGD-MSSLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDA 128
           W+GG+ M + GGG KV LLK+ L+EM    D IIL TDSYDV+   G  ++L++F     
Sbjct: 66  WIGGNYMGAPGGGQKVRLLKSALEEMK-NQDKIILFTDSYDVVFASGPKELLKKFQQARH 124

Query: 129 NIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLY 188
            +VF +E L WPD  L DKYP V  G R+L SGGFIGY  +++E+++  S ++++ DQL+
Sbjct: 125 KVVFSSESLIWPDRHLEDKYPHVREGNRFLGSGGFIGYLANVREMVAEWSGEDDDSDQLF 184

Query: 189 YALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIH 248
           +  +++D   R    I LD+   LFQNL GSL+++ L F+ D  V   N  ++T PVIIH
Sbjct: 185 FTRIYIDAAKRKSINITLDSKCRLFQNLLGSLDEVVLKFE-DGRVRARNLLHDTLPVIIH 243

Query: 249 GNGKSKIELNSFGNYLAKSWK-TSGCTRCN-LIKHLDSLKPDQFPSVLISVFIDKPTAFL 306
           GNG +K+++N  GNY+  +W   +GCT C+   + L +L+  ++P V+I +FI++PT F+
Sbjct: 244 GNGPTKLQVNYLGNYIPNAWTFETGCTVCHEEFQPLTALQESEYPLVVIGIFIERPTPFV 303

Query: 307 EEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEA 366
             F  ++  L YP +   +  +N + +H    + ++   + +++ V+ +     ++   +
Sbjct: 304 SVFFERLLKLQYPKEHAQVVDFNKEAHHEQHVNSFLQEHRNLYRAVELLGPEEAMDGVTS 363

Query: 367 RNLAVENSLH-KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGA 425
           RNLA +     +  D++F VD D  L N + LK L+     ++AP++ R  + WS+FWGA
Sbjct: 364 RNLAFDMCRQDQNCDYFFSVDIDVVLKNENALKILIEHTLPIVAPMITRTGRLWSSFWGA 423

Query: 426 LNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIK-TIYTLNSMDY 484
           L+ DG+YARS DY++I+   +   G+WNVPY++N YL+K  ++++      ++  + +D 
Sbjct: 424 LSPDGYYARSEDYVDIVQRRR--VGVWNVPYVSNVYLLKGGLLRSELTDFELFNSHILDP 481

Query: 485 DMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEY 544
           DMAFC N+R+KGI + + +   +GH++ +EN+     + +++++  NPLDW  RYIHP Y
Sbjct: 482 DMAFCHNIRSKGIFMYVTNLHTFGHILSTENYQTGHLHNDLWQIFENPLDWQERYIHPNY 541

Query: 545 QKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPT 604
              ++ D +   PCPDV+WFPI T++ C   V+ ME +G+WS G N D R++ GYE VPT
Sbjct: 542 -THIMKDHLIETPCPDVYWFPIFTDEACDHIVEEMENFGRWSGGANTDPRIQGGYENVPT 600

Query: 605 RDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHH 664
            DIHM QV     W +FL +Y+ P+ E+ + GY+ +  +  ++FVVRY+PDEQP LRPHH
Sbjct: 601 IDIHMNQVNFEKEWHKFLLEYIAPITEKMYPGYYTK-AQFDLAFVVRYKPDEQPFLRPHH 659

Query: 665 DSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGT 724
           D+ST+TINIALNQVG+DY+GGGCRF+RYNC+V A R GW LMHPGRLTHYHEGL  T G 
Sbjct: 660 DASTFTINIALNQVGLDYQGGGCRFLRYNCSVEAPRKGWALMHPGRLTHYHEGLPTTAGV 719

Query: 725 RYIMISFVDP 734
           RYI +SF+DP
Sbjct: 720 RYISVSFIDP 729


>gi|190338002|gb|AAI62509.1| Plod2 protein [Danio rerio]
          Length = 733

 Score =  650 bits (1676), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 316/730 (43%), Positives = 471/730 (64%), Gaps = 14/730 (1%)

Query: 10  LILSCVVFFISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQP 69
           ++++CV   + +  NK  +I  +K LV+TVA+ ETDG+ RF+QSA      VK LG+ + 
Sbjct: 13  MLVTCVHCTLGMETNK--DIPTEKLLVLTVATQETDGFLRFMQSANYFNFNVKVLGMGEE 70

Query: 70  WLGGDMS-SLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDA 128
           W GGD+  S+GGG KV LLK  ++ +D  +D+++L  DSYD+I  GG  +IL +F   + 
Sbjct: 71  WKGGDVGRSIGGGQKVRLLKEAMESLDQQEDLVVLFVDSYDLIFAGGAEEILRKFQQSNH 130

Query: 129 NIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLY 188
            +VF AE + WPD+ L +KYP+V SG R+LNSGG IGYA  I++L+S   + + +DDQL+
Sbjct: 131 KVVFAAEGIIWPDSQLAEKYPSVRSGKRFLNSGGIIGYAPYIQKLVSQWDLHDNDDDQLF 190

Query: 189 YALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIH 248
           Y  +++D   R K  + LD    +FQNL G+L+++ L F   E V + NT YN+ P +IH
Sbjct: 191 YTKIYVDPIQREKLNMTLDHKCEIFQNLNGALDEVLLKFG-TERVRVRNTIYNSLPAVIH 249

Query: 249 GNGKSKIELNSFGNYLAKSWK-TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFL 306
           GN  +K+  N   NY+  +W    GCT C+  +  L  LK  +FP V + V+I++PT FL
Sbjct: 250 GNVNTKVYFNYLANYIPNAWNYERGCTICDQDMVDLSQLK--EFPQVTVGVYIEQPTPFL 307

Query: 307 EEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEA 366
            EFL ++ +L+YP  K+++F++N++ YH      +    K +F + K +     +   EA
Sbjct: 308 PEFLERLLSLDYPKDKLNIFIHNSEVYHEKHIQKFWEENKDVFGSFKAVGPEENLTQGEA 367

Query: 367 RNLAVENSLHK-GVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGA 425
           RN+ ++        D++F +D+D  L N   LK L+ +N  +IAPL+ R  K WSNFWGA
Sbjct: 368 RNMGMDVCRRDPSCDYFFNIDADVMLTNRQTLKLLIEQNRKIIAPLVTRHGKLWSNFWGA 427

Query: 426 LNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIK-ATNIKTIYTLNSMDY 484
           L+ DG+YARS DY++I+ G +   G+WN+P++ + YL+K   ++     + ++ L  +D 
Sbjct: 428 LSLDGYYARSEDYIDIVQGKR--VGVWNIPFLAHVYLIKGQTLRNELKERNVFVLEKLDP 485

Query: 485 DMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEY 544
           DMA C N R+ G+ + + +  E+G L+ + N++    N +++++  NPLDW  +YIH  Y
Sbjct: 486 DMAMCRNARDLGLFMYLTNRHEFGRLISTANYNTSHYNNDLWQIFENPLDWREKYIHANY 545

Query: 545 QKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPT 604
            + +  + +  QPCPDVFWFP+++EK C+E V+ ME +G WS G + DKR+  GYE+VPT
Sbjct: 546 TR-IFTENLLEQPCPDVFWFPVLSEKACNELVEEMENHGTWSGGKHEDKRITGGYESVPT 604

Query: 605 RDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHH 664
            DIHMKQ+     W  F+R+++ P+  + F GY+ +   A M+FVV+Y PD Q  LRPHH
Sbjct: 605 DDIHMKQINYDQEWLHFIREFISPVTLKVFSGYYTKGY-AIMNFVVKYTPDRQAYLRPHH 663

Query: 665 DSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGT 724
           DSST+TINIALN  G+D+ GGGCRF RYNC++ + R GW  MHPGRLTH HEGL VT GT
Sbjct: 664 DSSTFTINIALNNKGLDFLGGGCRFHRYNCSIESPRKGWSFMHPGRLTHLHEGLPVTNGT 723

Query: 725 RYIMISFVDP 734
           RYI +SFVDP
Sbjct: 724 RYIAVSFVDP 733


>gi|194440678|ref|NP_001007378.2| procollagen-lysine,2-oxoglutarate 5-dioxygenase 2 isoform 1
           precursor [Danio rerio]
 gi|70779499|gb|AAZ08242.1| procollagen lysine 2-oxoglutarate 5-dioxygenase 2a isoform [Danio
           rerio]
          Length = 733

 Score =  650 bits (1676), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 316/730 (43%), Positives = 471/730 (64%), Gaps = 14/730 (1%)

Query: 10  LILSCVVFFISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQP 69
           ++++CV   + +  NK  +I  +K LV+TVA+ ETDG+ RF+QSA      VK LG+ + 
Sbjct: 13  MLVTCVHCTLGMETNK--DIPTEKLLVLTVATQETDGFLRFMQSANYFNFNVKVLGMGEE 70

Query: 70  WLGGDMS-SLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDA 128
           W GGD+  S+GGG KV LLK  ++ +D  +D+++L  DSYD+I  GG  +IL +F   + 
Sbjct: 71  WKGGDVGHSIGGGQKVRLLKEAMESLDQQEDLVVLFVDSYDLIFAGGAEEILRKFQQSNH 130

Query: 129 NIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLY 188
            +VF AE + WPD+ L +KYP+V SG R+LNSGG IGYA  I++L+S   + + +DDQL+
Sbjct: 131 KVVFAAEGIIWPDSQLAEKYPSVRSGKRFLNSGGIIGYAPYIQKLVSQWDLHDNDDDQLF 190

Query: 189 YALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIH 248
           Y  +++D   R K  + LD    +FQNL G+L+++ L F   E V + NT YN+ P +IH
Sbjct: 191 YTKIYVDPIQREKLNMTLDHKCEIFQNLNGALDEVLLKFG-TERVRVRNTIYNSLPAVIH 249

Query: 249 GNGKSKIELNSFGNYLAKSWK-TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFL 306
           GN  +K+  N   NY+  +W    GCT C+  +  L  LK  +FP V + V+I++PT FL
Sbjct: 250 GNVNTKVYFNYLANYIPNAWNYERGCTICDQDMVDLSQLK--EFPQVTVGVYIEQPTPFL 307

Query: 307 EEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEA 366
            EFL ++ +L+YP  K+++F++N++ YH      +    K +F + K +     +   EA
Sbjct: 308 PEFLERLLSLDYPKDKLNIFIHNSEVYHEKHIQKFWEENKDVFGSFKAVGPEENLTQGEA 367

Query: 367 RNLAVENSLHK-GVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGA 425
           RN+ ++        D++F +D+D  L N   LK L+ +N  +IAPL+ R  K WSNFWGA
Sbjct: 368 RNMGMDVCRRDPSCDYFFNIDADVMLTNRQTLKLLIEQNRKIIAPLVTRHGKLWSNFWGA 427

Query: 426 LNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIK-ATNIKTIYTLNSMDY 484
           L+ DG+YARS DY++I+ G +   G+WN+P++ + YL+K   ++     + ++ L  +D 
Sbjct: 428 LSLDGYYARSEDYIDIVQGKR--VGVWNIPFLAHVYLIKGQTLRNELKERNVFVLEKLDP 485

Query: 485 DMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEY 544
           DMA C N R+ G+ + + +  E+G L+ + N++    N +++++  NPLDW  +YIH  Y
Sbjct: 486 DMAMCRNARDLGLFMYLTNRHEFGRLISTANYNTSHYNNDLWQIFENPLDWREKYIHANY 545

Query: 545 QKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPT 604
            + +  + +  QPCPDVFWFP+++EK C+E V+ ME +G WS G + DKR+  GYE+VPT
Sbjct: 546 TR-IFTENLLEQPCPDVFWFPVLSEKACNELVEEMENHGTWSGGKHEDKRITGGYESVPT 604

Query: 605 RDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHH 664
            DIHMKQ+     W  F+R+++ P+  + F GY+ +   A M+FVV+Y PD Q  LRPHH
Sbjct: 605 DDIHMKQINYDQEWLHFIREFISPVTLKVFSGYYTKGY-AIMNFVVKYTPDRQAYLRPHH 663

Query: 665 DSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGT 724
           DSST+TINIALN  G+D+ GGGCRF RYNC++ + R GW  MHPGRLTH HEGL VT GT
Sbjct: 664 DSSTFTINIALNNKGLDFLGGGCRFHRYNCSIESPRKGWSFMHPGRLTHLHEGLPVTNGT 723

Query: 725 RYIMISFVDP 734
           RYI +SFVDP
Sbjct: 724 RYIAVSFVDP 733


>gi|196007006|ref|XP_002113369.1| hypothetical protein TRIADDRAFT_50405 [Trichoplax adhaerens]
 gi|190583773|gb|EDV23843.1| hypothetical protein TRIADDRAFT_50405 [Trichoplax adhaerens]
          Length = 702

 Score =  650 bits (1676), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 320/708 (45%), Positives = 464/708 (65%), Gaps = 14/708 (1%)

Query: 31  EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNE 90
           ++  L +TVAS+ TDG++RF +S  +  L  K LG+++ W GG M   GGGYK+NLL+ E
Sbjct: 5   KETLLTLTVASDCTDGFQRFNRSCRIYDLNCKILGMNKIWKGGSMEFPGGGYKINLLRRE 64

Query: 91  LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
           L+ +   DD II+VTDSYDVI   G  +ILE+F+ F+AN+VFGAE  CWP+  L   YP 
Sbjct: 65  LERLKDKDD-IIIVTDSYDVIYTAGTQEILEKFHQFNANVVFGAEPYCWPNQELASHYPV 123

Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
           V SG R+LNSGGFIG+A+ I E+I+ RSI++ +DDQLYY  ++LD  LR K  I LD  +
Sbjct: 124 VSSGKRFLNSGGFIGHARTIYEIITYRSIEDSDDDQLYYTEIYLDSKLRDKWNIKLDHKS 183

Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSK-IELNSFGNYLAKSWK 269
            LF NL G+ E++ L  D      L N  Y T P+ +HGNG +K + LN FGNYLA  W 
Sbjct: 184 VLFHNLNGAQEEVNLIPDNGGKYRLFNEVYQTLPIAVHGNGPTKEVSLNYFGNYLANYWS 243

Query: 270 -TSGCTRC--NLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMF 326
              GC  C  N I+H   LK D +P + +++FI K   F + FL +++N  YP  KI +F
Sbjct: 244 FNDGCIACKENTIEH---LKKDHYPRLSLAIFIHKSAPFTDVFLQRLSNQQYPKDKIDLF 300

Query: 327 VYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVD 386
           ++ + ++H      +   + +++ + + I  +  +N  +ARNLA+E       +++F ++
Sbjct: 301 LHISIDHHLKDTLVWWKKYSSLYASQELIVPSDKINPSKARNLAMEQCQSSNCEYFFSIE 360

Query: 387 SDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQ 446
           +D  L N +  K L++ N ++++PLL    + WSNFWGA++ +G+YARS DY++I+ G +
Sbjct: 361 NDCMLTNNETFKLLMHYNSTIVSPLLFISGRLWSNFWGAIDQNGYYARSKDYIDIVEGRK 420

Query: 447 GGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQE 506
             +GIWNVPYI   Y++  S +K  ++         D+DM +C  +R  G  + +++ Q 
Sbjct: 421 --RGIWNVPYIRGAYMINKSHLKMPDLAFD---EEGDFDMKWCAKMRKSGTFMYVNNMQI 475

Query: 507 YGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPI 566
           +GHL++ +++     + ++Y++I N  DW+ +YIH  Y  +L  D     PC DV+WFP+
Sbjct: 476 FGHLLNLKSYSIDHLHNDLYQIIDNQPDWEAKYIHENYSINLRDDHEIQMPCSDVYWFPV 535

Query: 567 VTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYV 626
           V+E FC   V+ ME +GQWS G + D RL+ GYE VPTRDIH+KQ+ L   W  FL+KY+
Sbjct: 536 VSEIFCKHLVEEMENFGQWSAGGHKDSRLDGGYENVPTRDIHLKQINLEQQWLYFLQKYI 595

Query: 627 VPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGG 686
           VP+Q + + G++ +   A M+FVVRY P  QPSLRPHHD+STYTINIAL + G+D+EGGG
Sbjct: 596 VPIQAKVYPGFYSKG-HAFMNFVVRYHPTGQPSLRPHHDASTYTINIALTRAGIDHEGGG 654

Query: 687 CRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           CRF+R NC+V  T +GW LMHPGRLTHYHEGL  T+GTRYIM+SF+DP
Sbjct: 655 CRFLRQNCSVVNTMLGWSLMHPGRLTHYHEGLPTTKGTRYIMVSFIDP 702


>gi|348571365|ref|XP_003471466.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 1-like
           [Cavia porcellus]
          Length = 727

 Score =  649 bits (1675), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 312/708 (44%), Positives = 460/708 (64%), Gaps = 9/708 (1%)

Query: 31  EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNE 90
           ED  LV+TVA+ E++G++RF +SA+    +V+ LGL + W     +  GGG KV LLK  
Sbjct: 25  EDNLLVLTVATKESEGFRRFKRSAQFFNYKVQALGLGEDWDVERGTMTGGGQKVRLLKKA 84

Query: 91  LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
           L++    +D++IL TDSYDV+   G  ++L++F    + +VF AE L +PD  L  KYPA
Sbjct: 85  LEKHADKEDLVILFTDSYDVVFASGPRELLKKFRQARSRVVFSAEDLIYPDRRLEAKYPA 144

Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
           V  G R+L SGGF+GYA  + +L++    ++ + DQL+Y  +FLD   R +  I LD   
Sbjct: 145 VSEGKRFLGSGGFVGYAPSLSKLVAEWEGQDSDSDQLFYTKIFLDPEKREQINISLDHRC 204

Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK- 269
            +FQNL G+L+++ L F++   V   N  Y+T PV+IHGNG +K++LN  GNY+ + W  
Sbjct: 205 RIFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTF 263

Query: 270 TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
            +GCT C   ++ L  ++    P+VL+ VFI++PT FL  F  ++  L YP  ++ +F++
Sbjct: 264 ETGCTVCEEGLRSLKGMEDRALPTVLVGVFIEQPTPFLSLFFQRLLRLRYPRSQMRLFIH 323

Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDS 387
           N++++H    + ++      +++VK +     + S  ARNL  +         +YF +D+
Sbjct: 324 NHEQHHKAQVEQFLAEHGGEYQSVKLVGPEVRMESANARNLGADLCRQDHTCTYYFSMDA 383

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D  L  PD L+ L+ +N+++IAPLL R  + WSNFWGAL+ADG+YARS DY++I+ G + 
Sbjct: 384 DVALTEPDSLRLLIEQNKNVIAPLLTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR- 442

Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKT-IYTLNSMDYDMAFCTNLRNKGIHLKIDSTQE 506
             G+WNVPYI+N YL+K S ++    +T ++    +D DMAFC N+R + + + + +   
Sbjct: 443 -VGVWNVPYISNIYLIKGSSLRTELQRTDLFHHRKLDPDMAFCANIRQQEVFMFLTNRHS 501

Query: 507 YGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPI 566
           +GHL+  +++     + +++E+  NP DW  +YIH  Y ++L    V   PCPDV+WFPI
Sbjct: 502 FGHLLSLDSYQTTHLHNDLWEIFSNPEDWKEKYIHENYTEALAGKLVET-PCPDVYWFPI 560

Query: 567 VTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYV 626
            TE  C E VQ ME +GQWS G N D R++ GYE VPT DIHM Q+     W +FL +Y+
Sbjct: 561 FTETACDELVQEMEHFGQWSLGDNKDNRIQGGYENVPTIDIHMNQISFEREWHKFLVEYI 620

Query: 627 VPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGG 686
            P+ E+ + GY+    +  ++FVVRY+PDEQPSL PHHDSST+T+NIALN+VG DYEGGG
Sbjct: 621 APVTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDSSTFTVNIALNRVGEDYEGGG 679

Query: 687 CRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           CRF+RYNC++ A R GW LMHPGRLTHYHEGL  T+GTRYI +SFVDP
Sbjct: 680 CRFLRYNCSIRAPRKGWALMHPGRLTHYHEGLPTTKGTRYIAVSFVDP 727


>gi|449268427|gb|EMC79291.1| Procollagen-lysine,2-oxoglutarate 5-dioxygenase 1 [Columba livia]
          Length = 730

 Score =  649 bits (1675), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 313/709 (44%), Positives = 468/709 (66%), Gaps = 10/709 (1%)

Query: 31  EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGYKVNLLKN 89
           E+  LV+TVA+ +T+G++RF +SA+    +++ LGL + W GGD     GGG KV LLK+
Sbjct: 27  EENLLVLTVATKQTEGFQRFRRSAQFFNYKIQVLGLDEEWQGGDDKKPAGGGQKVRLLKS 86

Query: 90  ELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYP 149
            L +    +D+IIL  DSYDV+   G  ++L++F    + +VF AE   +PD  L  KYP
Sbjct: 87  ALKQYADKEDLIILFIDSYDVLFASGPTELLKKFKQAKSKVVFSAENYIYPDRKLEAKYP 146

Query: 150 AVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTL 209
            V  G R+L SGGFIGYA ++K+L+     K+++ DQL+Y  +FLD   R    I LD  
Sbjct: 147 QVRDGKRFLGSGGFIGYAPNLKKLVEEWKGKDDDSDQLFYTNVFLDPEKRESINISLDQR 206

Query: 210 ANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK 269
           + +FQNL G+L+++ + F+ +  V   N  Y+T PV+IHGNG +K++LN  GNY+ + W 
Sbjct: 207 SRIFQNLNGALDEVVMKFE-NSRVRARNLLYDTLPVVIHGNGPTKLQLNYLGNYIPQIWT 265

Query: 270 -TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFV 327
             +GCT C+  ++ L   K +  P +LI +FI++PT FL +F  ++ NL+YP ++I +F+
Sbjct: 266 FETGCTVCDEGLRSLTGFKDEALPVILIGIFIEQPTPFLSQFFLRLRNLHYPKQRIQLFI 325

Query: 328 YNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVD 386
           +N++E+H+   D ++      +  VK I  +  V +  ARNL ++        D+YF +D
Sbjct: 326 HNHEEHHSMQVDSFVEEHGKEYLAVKVIGPDDEVENAVARNLGMDLCRKDPDCDYYFSLD 385

Query: 387 SDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQ 446
           ++  L N + L+ L+ +N+ +IAPL+ R  K WSNFWGAL+ DG+YARS DY++I+   +
Sbjct: 386 AEIVLKNTETLRILIEQNKMVIAPLVSRHEKLWSNFWGALSPDGYYARSEDYVDIVQRRR 445

Query: 447 GGKGIWNVPYITNCYLMKTSVIKATNIKT-IYTLNSMDYDMAFCTNLRNKGIHLKIDSTQ 505
              G+WNVPYI++ Y++K   +++   +T ++    +D DMAFC N+RN+G+ + + +  
Sbjct: 446 --VGLWNVPYISSVYMIKAKALRSELDQTDLFHSGKLDADMAFCHNVRNQGVFMYLTNRH 503

Query: 506 EYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFP 565
           ++GH++  EN+     + +++++  NP DW  +YIH  Y  +L    V   PCPDV+WFP
Sbjct: 504 QFGHILSLENYQTTHLHNDLWQIFSNPEDWREKYIHENYTAALKGKLVE-VPCPDVYWFP 562

Query: 566 IVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKY 625
           I T+  C E V+ ME YGQWS G N D R++ GYE VPT DIHM Q+G    W +FL  Y
Sbjct: 563 IFTDTACDELVEEMEHYGQWSTGDNTDSRIQGGYENVPTIDIHMNQIGFEREWYKFLLDY 622

Query: 626 VVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGG 685
           + P+ E+ + GY+ +  +  ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VG+DYEGG
Sbjct: 623 IAPITEKLYPGYYTK-AQFELAFVVRYKPDEQPSLMPHHDASTFTINIALNRVGIDYEGG 681

Query: 686 GCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           GCRF+RYNC++ A R GW LMHPGRLTHYHEGL  T+GTRYI +SF+DP
Sbjct: 682 GCRFLRYNCSIRAPRKGWTLMHPGRLTHYHEGLPTTKGTRYIAVSFLDP 730


>gi|218931165|ref|NP_036091.2| procollagen-lysine,2-oxoglutarate 5-dioxygenase 2 isoform 2
           precursor [Mus musculus]
 gi|341941297|sp|Q9R0B9.2|PLOD2_MOUSE RecName: Full=Procollagen-lysine,2-oxoglutarate 5-dioxygenase 2;
           AltName: Full=Lysyl hydroxylase 2; Short=LH2; Flags:
           Precursor
          Length = 737

 Score =  648 bits (1672), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 316/710 (44%), Positives = 459/710 (64%), Gaps = 10/710 (1%)

Query: 29  IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGYKVNLL 87
           I  DK LVITVA+ E DG+ RF+ SA+     VK LG  Q W GGD M+S+GGG KV LL
Sbjct: 34  IPADKLLVITVATKENDGFHRFMNSAKYFNYTVKVLGQGQEWRGGDGMNSIGGGQKVRLL 93

Query: 88  KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
           K  ++     +D++IL T+ +DV+  GG  ++L++F   +  IVF A+ L WPD  L DK
Sbjct: 94  KEAMEHYASQEDLVILFTECFDVVFAGGPEEVLKKFQKTNHKIVFAADGLLWPDKRLADK 153

Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
           YP V  G RYLNSGGFIGYA  I  L+   ++++ +DDQL+Y  +++D   R    I LD
Sbjct: 154 YPVVHIGKRYLNSGGFIGYAPYISRLVQQWNLQDNDDDQLFYTKVYIDPLKREAFNITLD 213

Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
               +FQ L G+ +++ L F+  +   + NT Y T PV I+GNG +KI LN FGNY+  S
Sbjct: 214 HKCKIFQALNGATDEVVLKFENGK-SRVKNTFYETLPVAINGNGPTKILLNYFGNYVPNS 272

Query: 268 W-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMF 326
           W + +GC  C+ +  +D    D  P V + VFI++PT FL  FLN +  L+YP + + +F
Sbjct: 273 WTQENGCALCD-VDTIDLSTVDVPPKVTLGVFIEQPTPFLPRFLNLLLTLDYPKEALQLF 331

Query: 327 VYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYV 385
           ++N + YH      ++   K    ++K +     ++  EARN+ ++     +  D+YF V
Sbjct: 332 IHNKEVYHEKDIKVFVDKAKHDISSIKIVGPEENLSQAEARNMGMDFCRQDEKCDYYFSV 391

Query: 386 DSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGD 445
           D+D  L NP  LK+L+ +N  +IAPL+ R  K WSNFWGAL+ DG+YARS DY++I+ G+
Sbjct: 392 DADVVLTNPRTLKFLIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARSEDYVDIVQGN 451

Query: 446 QGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDST 504
           +   GIWNVPY+ N YL++   +++  N +  +  + +D DMA C N R+ G+ + I + 
Sbjct: 452 R--VGIWNVPYMANVYLIQGKTLRSEMNERNYFVRDKLDPDMALCRNARDMGVFMYISNR 509

Query: 505 QEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWF 564
            E+G L+ + N++    N + +++  NP+DW  +YI+ +Y K +  + +  QPCPDVFWF
Sbjct: 510 HEFGRLISTANYNTSHLNNDFWQIFENPVDWKEKYINRDYSK-IFTENIVEQPCPDVFWF 568

Query: 565 PIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRK 624
           PI +E+ C E V+ ME YG+WS G ++D R+  GYE VPT DIHMKQ+GL  VW  F+R+
Sbjct: 569 PIFSERACDELVEEMEHYGKWSGGKHHDSRISGGYENVPTDDIHMKQIGLENVWLHFIRE 628

Query: 625 YVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEG 684
           ++ P+  + F GY+ +   A ++FVV+Y P+ Q SLRPHHD+ST+TINIALN VG D++G
Sbjct: 629 FIAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPHHDASTFTINIALNNVGEDFQG 687

Query: 685 GGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           GGC+F+RYNC++ + R GW  MHPGRLTH HEGL V  GTRYI +SF+DP
Sbjct: 688 GGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNGTRYIAVSFIDP 737


>gi|363737318|ref|XP_422695.3| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2
           [Gallus gallus]
          Length = 881

 Score =  647 bits (1670), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 308/707 (43%), Positives = 459/707 (64%), Gaps = 10/707 (1%)

Query: 32  DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLLKNE 90
           D  LV TVA+ ETDG+ RF+Q+A+     VK LG  + W GG+++ S+GGG KV LLK  
Sbjct: 181 DNLLVFTVATKETDGFHRFMQTAKHFNYTVKVLGKGEEWKGGELANSIGGGQKVRLLKEG 240

Query: 91  LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
           +      +D+I++  + YDVI  GG  ++L++F   +  +VF A+ L WPD  L DKYP 
Sbjct: 241 IQSYADQEDLIVMFVECYDVIFAGGPEELLKKFQETNHKVVFAADGLIWPDKRLADKYPV 300

Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
           V SG R+LNSGGFIGYA  I  ++   ++++ +DDQL+Y  +++D   R +  I LD   
Sbjct: 301 VRSGKRFLNSGGFIGYAPYINRIVQQWNLQDNDDDQLFYTKIYVDPLARERLNITLDHKC 360

Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSW-K 269
            +FQ L G+++++ LNF+  + V   N+ Y T P+ + GNG +KI LN  GNY+  +W +
Sbjct: 361 AIFQTLNGAVDEVHLNFEEGK-VRARNSVYETLPITVLGNGPTKIYLNYLGNYIPNAWTR 419

Query: 270 TSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN 329
            +GC  C+L   LD     ++P V I VFI++PT FL +FL+++  L+YP + +S+F++N
Sbjct: 420 ETGCNICDL-DMLDLSTVTEYPRVKIGVFIEQPTPFLPKFLDRLLTLDYPKEALSVFIHN 478

Query: 330 NQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDSD 388
           N+ YH      +    K + +N+K +     ++  EARN+ ++     +  ++YF +D+D
Sbjct: 479 NEVYHEKHIKKFWEKAKNIIRNIKIVGPEENLSQAEARNMGMDLCRQDEACEYYFSIDAD 538

Query: 389 SHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGG 448
             L NP  LK L+ +N  +IAPL+ R  K WSNFWGAL+ DG+YARS DY++I+ G++  
Sbjct: 539 VVLTNPKTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARSEDYIDIVQGNR-- 596

Query: 449 KGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEY 507
            G+WN+PY+ N YL+K   +++    K  +  + +D DMA C N R  G+ + I +  E+
Sbjct: 597 VGVWNIPYMANIYLIKGQTLRSEMKEKNYFMRDKLDPDMALCRNAREMGVFMYITNRHEF 656

Query: 508 GHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIV 567
           G L+ + N++    N +++++  NP+DW   YI+P Y K +  D +  QPCPDVFWFPI 
Sbjct: 657 GRLISTANYNTSHYNNDLWQIFENPVDWKETYINPNYSK-IFTDNIVEQPCPDVFWFPIF 715

Query: 568 TEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVV 627
           ++  C E V+ ME +GQWS G + D R+  GYE VPT DIHMKQ+GL   W  F+R+++ 
Sbjct: 716 SDTACDELVEEMEHFGQWSGGKHQDSRISGGYENVPTDDIHMKQIGLDNEWLHFIREFIA 775

Query: 628 PLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGC 687
           P+  + F GY+ +   A ++FVV+Y PD Q SLRPHHDSST+TINIALN+VG D++GGGC
Sbjct: 776 PVTLKVFAGYYTKGY-ALLNFVVKYSPDRQRSLRPHHDSSTFTINIALNKVGEDFQGGGC 834

Query: 688 RFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           +F+RYNC++ + R GW  MHPGRLTH HEGL +  GTRYI +SF+DP
Sbjct: 835 KFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPILNGTRYIAVSFIDP 881


>gi|348568796|ref|XP_003470184.1| PREDICTED: LOW QUALITY PROTEIN: procollagen-lysine,2-oxoglutarate
           5-dioxygenase 3-like [Cavia porcellus]
          Length = 737

 Score =  647 bits (1669), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 314/711 (44%), Positives = 464/711 (65%), Gaps = 11/711 (1%)

Query: 29  IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLL 87
           ++ +K LVITVA+ ET+GY+RF+QSAE     V+TLGL + W GGD++ ++GGG KV  L
Sbjct: 33  VNPEKLLVITVATAETEGYRRFLQSAEFFNYTVRTLGLGEEWRGGDVARTVGGGQKVRWL 92

Query: 88  KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
           K E+++    +DMII+  DSYDVI+ G  +++L++F    + ++F AE  CWPD  L ++
Sbjct: 93  KKEMEKYADQEDMIIMFVDSYDVILAGSPSELLKKFVQSGSRLLFSAEGFCWPDWGLAEQ 152

Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
           YP VG+G R+LNSGGFIG+A  I +++     K+ +DDQL+Y  L+LD  +R K  + LD
Sbjct: 153 YPEVGTGKRFLNSGGFIGFAPTIHQIVHQWKYKDNDDDQLFYTRLYLDPGVREKFSLNLD 212

Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
             + +FQNL G+L+++ L FD +  V + N  Y+T PV++HGNG +K++LN  GNY+   
Sbjct: 213 HKSRIFQNLNGALDEVVLKFDQNR-VRIRNVAYDTLPVVVHGNGPTKLQLNYLGNYVPNG 271

Query: 268 W-KTSGCTRCNLIKHLDSLKPDQFP-SVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
           W    GC  CN  +   +L   Q P       F+++PT FL   L  +  L   A+++S+
Sbjct: 272 WTPQGGCGFCN--RDRRTLPGGQLPPGCCWPCFVEQPTPFLPCVLAALLLLRLTARQVSL 329

Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFY 384
           F+++++ YH P   D     +  F +VK +     ++  EAR++A++        +FYF 
Sbjct: 330 FLHDSEVYHEPHIADAWPQLQDHFASVKLLGPEEALSPGEARDMAMDICRQDPECEFYFS 389

Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
           +D+D+ L N   L+ L+ +N  +IAP+L R  K WSNFWGAL+ + +YARS DY+ ++  
Sbjct: 390 LDADAVLTNQQTLRILIEQNRKVIAPMLSRHGKLWSNFWGALSPEEYYARSEDYVELVQR 449

Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDS 503
            +   G+WNVPYI   Y+++   ++   + + +++ + MD DMAFC NLR++GI L + +
Sbjct: 450 KR--LGVWNVPYIAQAYVIRGETLRTELSQREVFSGSDMDPDMAFCMNLRDRGIFLHLSN 507

Query: 504 TQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFW 563
             E+G L+ +  +D    +P+++++  NP+DW  +YIH  Y ++L  + +  QPCPDV+W
Sbjct: 508 QHEFGRLLATSRYDTDHLHPDLWQIFNNPVDWKEQYIHENYSRALHGEGLVEQPCPDVYW 567

Query: 564 FPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLR 623
           FP+++E+ C E V+ ME YGQWS G + D RL  GYE VPT DIHMKQVG    W + LR
Sbjct: 568 FPLLSEQMCDELVEEMENYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYEDQWLQLLR 627

Query: 624 KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYE 683
            YV P+ E  F GYH +  RA M+FVVRYRPDEQPSLRPHHDSST+T+N+ALN  G+D  
Sbjct: 628 TYVGPMTEHLFPGYHTK-TRAVMNFVVRYRPDEQPSLRPHHDSSTFTLNVALNHKGLDMR 686

Query: 684 GGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
                 +RY+C +++ R GW L+HPGRLTHYHEGL  T GTRYIM+SFVDP
Sbjct: 687 XAAAALLRYDCIISSPRKGWALLHPGRLTHYHEGLPTTWGTRYIMVSFVDP 737


>gi|148688975|gb|EDL20922.1| procollagen lysine, 2-oxoglutarate 5-dioxygenase 2, isoform CRA_a
           [Mus musculus]
          Length = 737

 Score =  647 bits (1668), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 316/710 (44%), Positives = 458/710 (64%), Gaps = 10/710 (1%)

Query: 29  IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGYKVNLL 87
           I  DK LVITVA+ E DG+ RF+ SA+     VK LG  Q W GGD M+S+GGG KV LL
Sbjct: 34  IPADKLLVITVATKENDGFHRFMNSAKYFNYTVKVLGQGQEWRGGDGMNSIGGGQKVRLL 93

Query: 88  KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
           K  ++     +D++IL T+ +DV+  GG  ++L++F   +  IVF A+ L WPD  L DK
Sbjct: 94  KEAMEHYASQEDLVILFTECFDVVFAGGPEEVLKKFQKTNHKIVFAADGLLWPDKRLADK 153

Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
           YP V  G RYLNSGGFIGYA  I  L+   ++++ +DDQL+Y  +++D   R    I LD
Sbjct: 154 YPVVHIGKRYLNSGGFIGYAPYISRLVQQWNLQDNDDDQLFYTKVYIDPLKREAFNITLD 213

Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
               +FQ L G+ +++ L F+  +   + NT Y T PV I+GNG +KI LN FGNY+  S
Sbjct: 214 HKCKIFQALNGATDEVVLKFENGK-SRVKNTFYETLPVAINGNGPTKILLNYFGNYVPNS 272

Query: 268 W-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMF 326
           W + +GC  C+ +  +D    D  P V + VFI++PT FL  FLN +  L+YP + + +F
Sbjct: 273 WTQENGCALCD-VDTIDLSTVDVPPKVTLGVFIEQPTPFLPRFLNLLLTLDYPKEALQLF 331

Query: 327 VYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYV 385
           ++N + YH      ++   K    ++K +     ++  EARN+ ++     +  D+YF V
Sbjct: 332 IHNKEVYHEKDIKVFVDKAKHDISSIKIVGPEENLSQAEARNMGMDFCRQDEKCDYYFSV 391

Query: 386 DSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGD 445
           D+D  L NP  LK L+ +N  +IAPL+ R  K WSNFWGAL+ DG+YARS DY++I+ G+
Sbjct: 392 DADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARSEDYVDIVQGN 451

Query: 446 QGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDST 504
           +   GIWNVPY+ N YL++   +++  N +  +  + +D DMA C N R+ G+ + I + 
Sbjct: 452 R--VGIWNVPYMANVYLIQGKTLRSEMNERNYFVRDKLDPDMALCRNARDMGVFMYISNR 509

Query: 505 QEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWF 564
            E+G L+ + N++    N + +++  NP+DW  +YI+ +Y K +  + +  QPCPDVFWF
Sbjct: 510 HEFGRLISTANYNTSHLNNDFWQIFENPVDWKEKYINRDYSK-IFTENIVEQPCPDVFWF 568

Query: 565 PIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRK 624
           PI +E+ C E V+ ME YG+WS G ++D R+  GYE VPT DIHMKQ+GL  VW  F+R+
Sbjct: 569 PIFSERACDELVEEMEHYGKWSGGKHHDSRISGGYENVPTDDIHMKQIGLENVWLHFIRE 628

Query: 625 YVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEG 684
           ++ P+  + F GY+ +   A ++FVV+Y P+ Q SLRPHHD+ST+TINIALN VG D++G
Sbjct: 629 FIAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPHHDASTFTINIALNNVGEDFQG 687

Query: 685 GGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           GGC+F+RYNC++ + R GW  MHPGRLTH HEGL V  GTRYI +SF+DP
Sbjct: 688 GGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNGTRYIAVSFIDP 737


>gi|126338250|ref|XP_001371794.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2
           [Monodelphis domestica]
          Length = 758

 Score =  646 bits (1667), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 324/755 (42%), Positives = 470/755 (62%), Gaps = 37/755 (4%)

Query: 10  LILSCVVFFI----SVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLG 65
           L+L+  + F+    +    K   I  DK LV+TVA+ ETDGY RF+QSA+     VK LG
Sbjct: 11  LLLALSLHFVKACAAAEAQKPSIIPTDKLLVLTVATQETDGYHRFMQSAKYFNYTVKVLG 70

Query: 66  LHQPWLGGDMS-SLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFN 124
             + W GGD + ++GGG KV LLK  +      +D+I+  T  YDVI  GG  ++L++F 
Sbjct: 71  KGEEWKGGDKANTIGGGQKVRLLKEAMGSYADQEDLIVFFTQCYDVIFAGGPEELLKKFQ 130

Query: 125 TFDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEED 184
             +  +VF A+ + WPD  L DKYP V  G R+LNSGGF+GYA  I  ++   ++++ +D
Sbjct: 131 KINHKVVFSADGILWPDKKLADKYPIVHIGKRFLNSGGFVGYAPYINHIVQQWNLQDNDD 190

Query: 185 DQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNP 244
           DQL+Y  +++D   R    I LD    +FQ L G+++++ L F+  +     NT Y T P
Sbjct: 191 DQLFYTKIYIDPLKREALNITLDHKCRIFQALNGAIDEVLLKFENGK-ARAKNTFYETLP 249

Query: 245 VIIHGNGKSKIELNSFGNYLAKSW-KTSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKP 302
           VII+GNG +KI+LN FGNY+  +W + +GCT C+L +  L +L  + +P V I VFI++P
Sbjct: 250 VIINGNGPTKIQLNYFGNYVPNAWTQENGCTLCDLDVIDLSTL--EDYPRVTIGVFIEQP 307

Query: 303 TAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVN 362
           T FL  FL  +  L+YP + I +F++N + YH     ++    K + KN+K +     ++
Sbjct: 308 TPFLPRFLELLLTLDYPKEAIKLFIHNKEVYHEKHIKEFWEKAKDVIKNIKIVGPEENLS 367

Query: 363 SKEARNLAVENSLHKG-VDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSN 421
             EARN+ ++     G  D+YF +D+D  L NP  LK L+ +N  +IAPL+ R  K WSN
Sbjct: 368 QAEARNMGMDLCRQDGQCDYYFSLDADVVLTNPKTLKILIEQNRKIIAPLVTRHGKLWSN 427

Query: 422 FWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLN 480
           FWGAL+ DG+YARS DY++I+ G +   G+WNVPY+ N YL+K   +++  N +  +  +
Sbjct: 428 FWGALSPDGYYARSEDYVDIVQGSR--VGVWNVPYMANVYLIKGQTLRSEMNERNYFVRD 485

Query: 481 SMDYDMAFCTNLRN---------------------KGIHLKIDSTQEYGHLVDSENFDPQ 519
            +D DMA C N R                      KG+ + I +  E+G L+ + N++  
Sbjct: 486 KLDPDMALCRNAREMTIQREKDSPTPETFQMLSPPKGVFMYISNRHEFGRLLSTANYNTS 545

Query: 520 KTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIM 579
             N +++++  NP+DW  RYI+  Y K +  + +  QPCPDVFWFPI +EK C E V+ M
Sbjct: 546 HYNNDLWQIFENPVDWKERYINHNYSK-IFTENLVEQPCPDVFWFPIFSEKACDELVEEM 604

Query: 580 EAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHH 639
           E YGQWS G ++D R+  GYE VPT DIHMKQ+GL   W  F+R+++ P+  + F GY+ 
Sbjct: 605 EHYGQWSGGKHHDSRISGGYENVPTDDIHMKQIGLENEWLHFIREFIAPVTLKVFAGYYT 664

Query: 640 EPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTAT 699
           +   A ++FVV+Y PD Q SLRPHHDSST+TINIALN VG D++GGGC+F+RYNC++ + 
Sbjct: 665 KGF-ALLNFVVKYSPDRQRSLRPHHDSSTFTINIALNNVGEDFQGGGCKFLRYNCSIESP 723

Query: 700 RMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           R GW  MHPGRLTH HEGL +  GTRYI +SF+DP
Sbjct: 724 RKGWSFMHPGRLTHLHEGLPIINGTRYIAVSFIDP 758


>gi|148230120|ref|NP_001088279.1| procollagen-lysine, 2-oxoglutarate 5-dioxygenase 1 precursor
           [Xenopus laevis]
 gi|54038674|gb|AAH84287.1| LOC495112 protein [Xenopus laevis]
          Length = 725

 Score =  646 bits (1667), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 321/711 (45%), Positives = 463/711 (65%), Gaps = 16/711 (2%)

Query: 28  NIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLL 87
           N D+DK LV+TVA+ ETDG +RF +SA     +VK LGL   WLG        G KV L+
Sbjct: 27  NPDDDKLLVLTVATEETDGLRRFQRSAHSFNYKVKVLGLGGQWLGE-------GQKVQLM 79

Query: 88  KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
           K  L+     +D+IIL T+SYDVI   G  ++L++F    + +VF AE + +PD  L  K
Sbjct: 80  KLALEPYADKEDLIILFTESYDVIFAAGPGELLKKFRQAKSKVVFSAESVAYPDRHLESK 139

Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
           YP V  G R+L SGGFIGYA  + +++++    +++ DQL+Y  LFLD   R K  I LD
Sbjct: 140 YPVVPEGKRFLGSGGFIGYAAYLYKMVADWDGTDKDSDQLFYTKLFLDPVKRGKVNITLD 199

Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
               +FQNLYGS ED+ L F+    V   N  Y+T PV+IHGNG +K+ LN  GNY+   
Sbjct: 200 HRCRIFQNLYGSAEDVVLKFEHGR-VRARNLVYDTLPVLIHGNGPTKLHLNYLGNYIPHV 258

Query: 268 WK-TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
           W   SGC  C+  +++L+SL  D  P V+I +FI++PT F+ EF  ++ NLNYP  +I +
Sbjct: 259 WTFESGCNVCDEGLRNLESLSVDTLPLVVIGIFIEQPTPFVSEFFKRLNNLNYPKNRIQL 318

Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKG-VDFYFY 384
           ++ N++ +H    ++++ +  T +  VK +      +  +ARN  ++        ++YF 
Sbjct: 319 YISNHEPHHQRRVENFLQDHGTQYNFVKTVGPEENSDFADARNKGMDMCRQTPECEYYFS 378

Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
           +D+   L NP+VL+ L+ +N+S+IAPL+ R    WSNFWGAL++DG+YARS DY++I+  
Sbjct: 379 IDAPVVLKNPNVLRILIEQNKSVIAPLVSRNANLWSNFWGALSSDGYYARSEDYIDIVQR 438

Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDS 503
            +   G+WNVPYI++ YL+K S++++  +   ++   ++D DM FC N+R +GI + + +
Sbjct: 439 QR--IGVWNVPYISSVYLVKGSILRSKLSQNDMFHSGTLDSDMVFCDNVRQQGIFMFVTN 496

Query: 504 TQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFW 563
            QE+GH++  EN+     + +++E+  N  DW  +YIHP Y ++L    V   PCPDV+W
Sbjct: 497 RQEFGHILSLENYKTTHLHNDLWEIFENTEDWKEKYIHPNYSEALKGKLVE-MPCPDVYW 555

Query: 564 FPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLR 623
           FP+ TE  C+E V+ ME +G+WS G N D+RL+ GYE VPT DIHM Q+     W + L 
Sbjct: 556 FPLFTETTCNEIVEEMENFGKWSGGGNKDERLQGGYENVPTIDIHMNQIDYEKEWHKILL 615

Query: 624 KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYE 683
            ++ PL ++ F GY+    +  ++FVVRY+PDEQP L PHHD+ST+TINIALN VG DYE
Sbjct: 616 DFIAPLTQKMFPGYYTS-AQFDLAFVVRYKPDEQPLLEPHHDASTFTINIALNSVGQDYE 674

Query: 684 GGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           GGGCRF+RYNC+V A R GW LMHPGRLTHYHEGL+VT+GTRYI +SFVDP
Sbjct: 675 GGGCRFLRYNCSVRAPRKGWALMHPGRLTHYHEGLRVTKGTRYIAVSFVDP 725


>gi|5852295|gb|AAD53987.1|AF080572_1 lysyl hydroxylase isoform 2 [Mus musculus]
          Length = 737

 Score =  646 bits (1666), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 316/710 (44%), Positives = 458/710 (64%), Gaps = 10/710 (1%)

Query: 29  IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGYKVNLL 87
           I  DK LVITVA+ E DG+ RF+ SA+     VK LG  Q W GGD M+S+GGG KV LL
Sbjct: 34  IPADKLLVITVATKENDGFHRFMNSAKYFNYTVKVLGQGQEWRGGDGMNSIGGGQKVRLL 93

Query: 88  KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
           K  ++     +D++IL T+ +DV+  GG  ++L++F   +  IVF A+ L WPD  L DK
Sbjct: 94  KEAMEHYASQEDLVILFTECFDVVFAGGPEEVLKKFQKTNHKIVFAADGLLWPDKRLADK 153

Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
           YP V  G RYLNSGGFIGYA  I  L+   ++++ +DDQL+Y  +++D   R    I LD
Sbjct: 154 YPVVHIGKRYLNSGGFIGYAPYISRLVQQWNLQDNDDDQLFYTKVYIDPLKREAFNITLD 213

Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
               +FQ L G+ +++ L F+  +   + NT Y T PV I+GNG +KI LN FGNY+  S
Sbjct: 214 HKCKIFQALNGATDEVVLKFENGK-SRVKNTFYETLPVAINGNGPTKILLNYFGNYVPNS 272

Query: 268 W-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMF 326
           W + +GC  C+ +  +D    D  P V + VFI++PT FL  FLN +  L+YP + + +F
Sbjct: 273 WTQENGCALCD-VDTIDLSTVDVPPKVTLGVFIEQPTPFLPRFLNLLLTLDYPKEALQLF 331

Query: 327 VYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYV 385
           ++N + YH      ++   K    ++K +     ++  EARN+ ++     +  D+YF V
Sbjct: 332 IHNKEVYHEKDIKVFVDKAKHDISSIKIVGPEENLSQAEARNMGMDFCRQDEKCDYYFSV 391

Query: 386 DSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGD 445
           D+D  L NP  LK L+ +N  +IAPL+ R  K WSNFWGAL+ DG+YARS DY++I+ G+
Sbjct: 392 DADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARSEDYVDIVQGN 451

Query: 446 QGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDST 504
           +   GIWNVPY+ N YL++   +++  N +  +  + +D DMA C N R+ G+ + I + 
Sbjct: 452 R--VGIWNVPYMANVYLIQGKTLRSEMNERNYFVRDKLDPDMALCRNARDMGVFMYISNR 509

Query: 505 QEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWF 564
            E+G L+ + N++    N + +++  NP+DW  +YI+ +Y K +  + +  QPCPDVFWF
Sbjct: 510 HEFGRLISTANYNTSHLNNDFWQIFENPVDWKEKYINRDYSK-IFTENIVEQPCPDVFWF 568

Query: 565 PIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRK 624
           PI +E+ C E V+ ME YG+WS G ++D R+  GYE VPT DIHMKQ+GL  VW  F+R+
Sbjct: 569 PIFSERACDELVEEMEHYGKWSGGKHHDSRISGGYENVPTDDIHMKQIGLENVWLHFIRE 628

Query: 625 YVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEG 684
           ++ P+  + F GY+ +   A ++FVV+Y P+ Q SLRPHHD+ST+TINIALN VG D++G
Sbjct: 629 FIAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPHHDASTFTINIALNNVGEDFQG 687

Query: 685 GGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           GGC+F+RYNC++ + R GW  MHPGRLTH HEGL V  GTRYI +SF+DP
Sbjct: 688 GGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNGTRYIAVSFIDP 737


>gi|224079514|ref|XP_002194070.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 1
           [Taeniopygia guttata]
          Length = 730

 Score =  646 bits (1666), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 313/709 (44%), Positives = 465/709 (65%), Gaps = 10/709 (1%)

Query: 31  EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGYKVNLLKN 89
           E+  LV+TVA+ +T+G++RF +SA+    +V+ LGL + W GGD     GGG KV LLK+
Sbjct: 27  EENLLVLTVATKQTEGFQRFRRSAQFFNYKVQVLGLDEEWQGGDDQQPAGGGQKVRLLKS 86

Query: 90  ELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYP 149
            L +    +D+IIL  +SYDV+   G  ++L++F    + +VF AE   +PD  +  KYP
Sbjct: 87  ALQQYVDKEDLIILFVESYDVLFASGPTELLKKFKQAKSKVVFSAENYIYPDRKVEAKYP 146

Query: 150 AVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTL 209
            V  G R+L SGGFIGYA  +K+L+     ++++ DQL+Y  +FLD   R    I LD  
Sbjct: 147 QVRDGKRFLGSGGFIGYAPYLKKLVEEWKGQDDDSDQLFYTNIFLDPEKRESINISLDHR 206

Query: 210 ANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK 269
           + +FQNL G+L++I L F+ +  V   N  Y+T PV+IHGNG +K++LN  GNY+ + W 
Sbjct: 207 SRIFQNLNGALDEIVLKFE-NSRVRARNLLYDTLPVVIHGNGPTKLQLNYLGNYIPQIWT 265

Query: 270 -TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFV 327
             +GCT C+  ++ L   K +  P +LI +FI++PT FL +F  ++ NL+YP ++I +F+
Sbjct: 266 FETGCTVCDEGLRSLSGFKDEALPMILIGIFIEQPTPFLSQFFLRLRNLHYPKQRIQLFI 325

Query: 328 YNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVD 386
           +N++E+H    D ++      +  V+ I  +  V + EARNL ++        D+YF +D
Sbjct: 326 HNHEEHHLMEVDSFVEEHGREYLTVQVIGPDDEVENAEARNLGMDLCRKDPDCDYYFSLD 385

Query: 387 SDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQ 446
           ++  L N + L+ L+ +N+ +IAPL+ R  K WSNFWGAL+ DG+YARS DY++I+   +
Sbjct: 386 AEVVLKNTETLRILIEQNKLVIAPLVSRHEKLWSNFWGALSPDGYYARSEDYVDIVQRRR 445

Query: 447 GGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQ 505
              G+WNVPYI++ YL+K   +++      ++    +D DMAFC N+RN+G+ + + +  
Sbjct: 446 --VGLWNVPYISSVYLVKGKALRSELEQGDLFHSGKLDADMAFCHNIRNQGVFMYLTNQH 503

Query: 506 EYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFP 565
           ++GH++  EN+     + +++++  NP DW  +YIH  Y  +L    V   PCPDV+WFP
Sbjct: 504 QFGHILSLENYQTSHLHNDLWQIFSNPEDWREKYIHENYTAALKGKLVE-MPCPDVYWFP 562

Query: 566 IVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKY 625
           I T+  C E V+ ME YGQWS G N D R++ GYE VPT DIHM Q+G    W +FL  Y
Sbjct: 563 IFTDTACDELVEEMEHYGQWSTGDNTDSRIQGGYENVPTIDIHMNQIGFEREWYKFLLDY 622

Query: 626 VVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGG 685
           + P+ E+ + GY+ +  +  ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VG+DYEGG
Sbjct: 623 IAPITEKLYPGYYTK-TQFELAFVVRYKPDEQPSLVPHHDASTFTINIALNRVGIDYEGG 681

Query: 686 GCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           GCRF+RYNC++ A R GW LMHPGRLTHYHEGL  T+GTRYI +SF+DP
Sbjct: 682 GCRFLRYNCSIRAPRKGWTLMHPGRLTHYHEGLPTTKGTRYIAVSFLDP 730


>gi|426218182|ref|XP_004003328.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2
           isoform 1 [Ovis aries]
          Length = 739

 Score =  645 bits (1665), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 314/720 (43%), Positives = 463/720 (64%), Gaps = 10/720 (1%)

Query: 19  ISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSS 77
           +     K  +I  DK LVITVA+ E+DG+ RF+QSA+     VK LG  + W GGD +++
Sbjct: 26  LGADSEKPSSIPTDKLLVITVATKESDGFHRFMQSAKYFNYTVKVLGQGEEWRGGDGINT 85

Query: 78  LGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERL 137
           +GGG KV L+K  ++     +D+++L T+ +DVI  GG  ++L++F   +  +VF A+ +
Sbjct: 86  IGGGQKVRLMKEVMEHYANQEDLVVLFTECFDVIFAGGPEEVLKKFQKSNHKVVFAADGI 145

Query: 138 CWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDET 197
            WPD  L DKYP V  G RYLNSGGFIGYA  I  ++   ++++ +DDQL+Y  +++D  
Sbjct: 146 LWPDKRLADKYPIVHIGKRYLNSGGFIGYAPYINRIVQQWNLQDNDDDQLFYTKIYIDPL 205

Query: 198 LRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIEL 257
            R    I LD    +FQ L G+++++ L F+  +     N  Y T PV+I+GNG +KI L
Sbjct: 206 KREAINITLDHKCKIFQTLNGAVDEVVLKFENGK-ARAKNVFYETLPVMINGNGPTKILL 264

Query: 258 NSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANL 316
           N FGNY+  +W + +GCT C  +  +D  + D +P+V I VFI++PT FL  FLN +  L
Sbjct: 265 NYFGNYIPNAWTQDNGCTLCE-VDTIDLSEVDVYPNVTIGVFIEQPTPFLPRFLNTLLTL 323

Query: 317 NYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH 376
           +YP K +  F++N + YH      +    K     +K +     ++  EARN+ ++    
Sbjct: 324 DYPKKALKFFIHNKEVYHEKDIKVFFDKAKHEITTIKIVGPEENLSQAEARNMGMDFCRQ 383

Query: 377 -KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARS 435
            +  ++YF VD+D  L NP  LK L+ +N  +IAPL+ R  K WSNFWGAL+ DG+YARS
Sbjct: 384 DENCEYYFSVDADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARS 443

Query: 436 FDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN 494
            DY++I+ G++   GIWNVPY+ N YL+K   +++  N +  +  + +D DMA C N R 
Sbjct: 444 EDYVDIVQGNR--VGIWNVPYMANVYLIKGKTLRSEMNERNYFVRDKLDPDMALCRNARE 501

Query: 495 KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVN 554
            G+ + I +  E+G L+ + N++    N +++++  NP+DW  +YI+ +Y K +  + + 
Sbjct: 502 MGVFMYISNRHEFGRLLSTANYNISHFNNDLWQIFENPVDWKEKYINRDYAK-IFTENIV 560

Query: 555 NQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGL 614
            QPCPDVFWFPI +EK C E V+ ME YGQWS G ++D R+  GYE VPT DIHMKQ+GL
Sbjct: 561 EQPCPDVFWFPIFSEKACDELVEEMEHYGQWSGGKHHDSRISGGYENVPTDDIHMKQIGL 620

Query: 615 AGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIA 674
             VW  F+R+++ P+  + F GY+ +   A ++FVV+Y P+ Q SLRPHHD+ST+TINIA
Sbjct: 621 ENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPHHDASTFTINIA 679

Query: 675 LNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           LN VG D++GGGC+F+RYNC++ + R GW  MHPGRLTH HEGL V  GTRYI +SF+DP
Sbjct: 680 LNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNGTRYIAVSFIDP 739


>gi|18204027|gb|AAH21352.1| Procollagen lysine, 2-oxoglutarate 5-dioxygenase 2 [Mus musculus]
          Length = 737

 Score =  645 bits (1663), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 315/710 (44%), Positives = 457/710 (64%), Gaps = 10/710 (1%)

Query: 29  IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGYKVNLL 87
           I  DK LVITVA+ E DG+ RF+ SA+     VK LG  Q W GGD M+S+GGG KV LL
Sbjct: 34  IPADKLLVITVATKENDGFHRFMNSAKYFNYTVKVLGQGQEWRGGDGMNSIGGGQKVRLL 93

Query: 88  KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
           K  ++     +D++IL T+ +DV+  GG  ++L++F   +  IVF A+ L WPD  L DK
Sbjct: 94  KEAMEHYASQEDLVILFTECFDVVFAGGPEEVLKKFQKTNHKIVFAADGLLWPDKRLADK 153

Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
           YP V  G RYLNSGGFIGYA  I  L+   ++++ +DDQL+Y  +++D   R    I LD
Sbjct: 154 YPVVHIGKRYLNSGGFIGYAPYISRLVQQWNLQDNDDDQLFYTKVYIDPLKREAFNITLD 213

Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
               +FQ L G+ +++ L F+  +   + NT Y T PV I+GNG +KI LN FGNY+  S
Sbjct: 214 HKCKIFQALNGATDEVVLKFENGK-SRVKNTFYETLPVAINGNGPTKILLNYFGNYVPNS 272

Query: 268 W-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMF 326
           W + +GC  C+ +  +D    D  P V + VFI++PT FL  FLN +  L+YP + + +F
Sbjct: 273 WTQENGCALCD-VDTIDLSTVDVPPKVTLGVFIEQPTPFLPRFLNLLLTLDYPKEALQLF 331

Query: 327 VYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYV 385
           ++N + YH      ++   K    ++K +     ++  EARN+ ++     +  D+YF V
Sbjct: 332 IHNKEVYHEKDIKVFVDKAKHDISSIKIVGPEENLSQAEARNMGMDFCRQDEKCDYYFSV 391

Query: 386 DSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGD 445
           D+D  L NP  LK L+ +N  +IAPL+ R  K WSNFWGAL+ DG+YARS DY++I+ G+
Sbjct: 392 DADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARSEDYVDIVQGN 451

Query: 446 QGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDST 504
           +   GIWNVPY+ N YL++   +++  N +  +  + +D DMA C N R+ G+ + I + 
Sbjct: 452 R--VGIWNVPYMANVYLIQGKTLRSEMNERNYFVRDKLDPDMALCRNARDMGVFMYISNR 509

Query: 505 QEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWF 564
            E+G L+ + N++    N + +++  NP+DW  +YI+ +Y K +  + +  QPCPDVFWF
Sbjct: 510 HEFGRLISTANYNTSHLNNDFWQIFENPVDWKEKYINRDYSK-IFTENIVEQPCPDVFWF 568

Query: 565 PIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRK 624
           PI +E+ C E V+ ME YG+WS G ++D R+  GYE VPT D HMKQ+GL  VW  F+R+
Sbjct: 569 PIFSERACDELVEEMEHYGKWSGGKHHDSRISGGYENVPTDDTHMKQIGLENVWLHFIRE 628

Query: 625 YVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEG 684
           ++ P+  + F GY+ +   A ++FVV+Y P+ Q SLRPHHD+ST+TINIALN VG D++G
Sbjct: 629 FIAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPHHDASTFTINIALNNVGEDFQG 687

Query: 685 GGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           GGC+F+RYNC++ + R GW  MHPGRLTH HEGL V  GTRYI +SF+DP
Sbjct: 688 GGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNGTRYIAVSFIDP 737


>gi|402861310|ref|XP_003895041.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2
           isoform 2 [Papio anubis]
          Length = 737

 Score =  644 bits (1660), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 315/721 (43%), Positives = 461/721 (63%), Gaps = 10/721 (1%)

Query: 18  FISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MS 76
           ++     K  +I  DK LVITVA+ E+DG+ RF+QSA+     VK LG  + W GGD ++
Sbjct: 23  YLGADSEKPSSIPTDKLLVITVATKESDGFHRFMQSAKYFNYTVKVLGQGEEWRGGDGIN 82

Query: 77  SLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAER 136
           S+GGG KV L+K  ++     DD++++ T+ +DVI  GG  ++L++F   +  +VF A+ 
Sbjct: 83  SIGGGQKVRLMKEVMEHYADQDDLVVMFTECFDVIFAGGPEEVLKKFQKANHKVVFAADG 142

Query: 137 LCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDE 196
           + WPD  L DKYP V  G RYLNSGGFIGYA  I  ++   ++++ +DDQL+Y  +++D 
Sbjct: 143 ILWPDKRLADKYPVVHIGKRYLNSGGFIGYAPYINRIVQQWNLQDNDDDQLFYTKIYIDP 202

Query: 197 TLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIE 256
             R    I LD    +FQ L G+++++ L F+  +     NT Y T PV I+GNG +KI 
Sbjct: 203 LKREAINITLDHKCKIFQTLNGAVDEVVLKFENGK-ARAKNTFYETLPVAINGNGPTKIL 261

Query: 257 LNSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIAN 315
           LN FGNY+  SW + +GCT C     +D    D  P+V I VFI++PT FL  FL+ +  
Sbjct: 262 LNYFGNYVPNSWTQDNGCTLCEF-DTIDLSAVDVHPNVTIGVFIEQPTPFLPRFLDILLT 320

Query: 316 LNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSL 375
           L+YP + + +F++N + YH      +    K   K +K +     ++  EARN+ ++   
Sbjct: 321 LDYPKEALKLFIHNKEVYHEKDIKVFFDKAKHEIKTIKIVGPEENLSQAEARNMGMDFCR 380

Query: 376 H-KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYAR 434
             +  D+YF VD+D  L NP  LK L+ +N  +IAPL+ R  K WSNFWGAL+ DG+YAR
Sbjct: 381 QDEKCDYYFSVDADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYAR 440

Query: 435 SFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIK-ATNIKTIYTLNSMDYDMAFCTNLR 493
           S DY++I+ G++   G+WNVPY+ N YL+K   ++   N +  +  + +D DMA C N R
Sbjct: 441 SEDYVDIVQGNR--VGVWNVPYMANVYLIKGKTLRLEMNERNYFVRDKLDPDMALCRNAR 498

Query: 494 NKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTV 553
             G+ + I +  E+G L+ + N++    N +++++  NP+DW  +YI+ +Y K +  + +
Sbjct: 499 EMGVFMYISNRHEFGRLLSTANYNTSHYNNDLWQIFENPVDWKEKYINRDYSK-IFTENI 557

Query: 554 NNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVG 613
             QPCPDVFWFPI +EK C E V+ ME YG+WS G ++D R+  GYE VPT DIHMKQV 
Sbjct: 558 VEQPCPDVFWFPIFSEKACDELVEEMEHYGKWSGGKHHDSRISGGYENVPTDDIHMKQVD 617

Query: 614 LAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINI 673
           L  VW  F+R+++ P+  + F GY+ +   A ++FVV+Y P+ Q SLRPHHD+ST+TINI
Sbjct: 618 LENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPHHDASTFTINI 676

Query: 674 ALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVD 733
           ALN VG D++GGGC+F+RYNC++ + R GW  MHPGRLTH HEGL V  GTRYI +SF+D
Sbjct: 677 ALNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNGTRYIAVSFID 736

Query: 734 P 734
           P
Sbjct: 737 P 737


>gi|17535123|ref|NP_496170.1| Protein LET-268 [Caenorhabditis elegans]
 gi|6093732|sp|Q20679.1|PLOD_CAEEL RecName: Full=Procollagen-lysine,2-oxoglutarate 5-dioxygenase;
           AltName: Full=Lethal protein 268; AltName: Full=Lysyl
           hydroxylase; Short=LH; Flags: Precursor
 gi|3877389|emb|CAA91321.1| Protein LET-268 [Caenorhabditis elegans]
          Length = 730

 Score =  644 bits (1660), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 312/718 (43%), Positives = 470/718 (65%), Gaps = 21/718 (2%)

Query: 30  DEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLLK 88
           D  + +V+TVA+  TDG KR ++SA+   + ++ LGL + W GGD     GGG K+ +L 
Sbjct: 21  DLPELVVVTVATENTDGLKRLLESAKAFDINIEVLGLGEKWNGGDTRIEQGGGQKIRILS 80

Query: 89  NELDEMDITDDMIILVTDSYDVIIDGGVNDILERF--NTFDANIVFGAERLCWPDTSLYD 146
           + +++     D +I+  D+YDV+ +     IL +F  +  +  ++FGAE  CWPD SL  
Sbjct: 81  DWIEKYKDASDTMIMFVDAYDVVFNADSTTILRKFFEHYSEKRLLFGAEPFCWPDQSLAP 140

Query: 147 KYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVL 206
           +YP V  G R+LNSG F+GY  ++ +++  +S+++++DDQLYY +++LDE LR +  + L
Sbjct: 141 EYPIVEFGKRFLNSGLFMGYGPEMHKILKLKSVEDKDDDQLYYTMIYLDEKLRKELNMDL 200

Query: 207 DTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAK 266
           D+++ +FQNL G +ED++L F  D      N  YNT P+I+HGNG SK  LN  GNYL  
Sbjct: 201 DSMSKIFQNLNGVIEDVELQFKEDGTPEAYNAAYNTKPLIVHGNGPSKSHLNYLGNYLGN 260

Query: 267 SWKTS-GCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
            W +  GC  C L    +  + ++ P + +++FI KP  F+EE L KIA  +YP +KI++
Sbjct: 261 RWNSQLGCRTCGL----EVKESEEVPLIALNLFISKPIPFIEEVLQKIAEFDYPKEKIAL 316

Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYV 385
           ++YNNQ +      D++      +   + I   + +  +EARN A+E +  + V+F F +
Sbjct: 317 YIYNNQPFSIKNIQDFLQKHGKSYYTKRVINGVTEIGDREARNEAIEWNKARNVEFAFLM 376

Query: 386 DSDSHLDNPDVLKYLVNRNES----LIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNI 441
           D D++   P V+K L+  +++    +IAP++ +P K ++NFWGA+ A+G+YARS DYM I
Sbjct: 377 DGDAYFSEPKVIKDLIQYSKTYDVGIIAPMIGQPGKLFTNFWGAIAANGYYARSEDYMAI 436

Query: 442 INGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLN-SMDYDMAFCTNLRNKGIHLK 500
           + G++   G WNVP+IT+  L     ++A  +K  Y+ N ++D DM+ C   R+ G  L 
Sbjct: 437 VKGNR--VGYWNVPFITSAVLFNKEKLEA--MKDAYSYNKNLDPDMSMCKFARDNGHFLY 492

Query: 501 IDSTQEYGHLVDSENFDPQKT----NPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQ 556
           ID+ + YG L+ S+ +    T    +PE++++  N   W+ RYIHP Y K + P+ V +Q
Sbjct: 493 IDNEKYYGFLIVSDEYAETVTEGKWHPEMWQIFENRELWEARYIHPGYHKIMEPEHVVDQ 552

Query: 557 PCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAG 616
            CPDV+ FP+++E+FC E ++ ME +G+WSDG+NNDKRL  GYE VPTRDIHM QVG   
Sbjct: 553 ACPDVYDFPLMSERFCEELIEEMEGFGRWSDGSNNDKRLAGGYENVPTRDIHMNQVGFER 612

Query: 617 VWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALN 676
            W  F+  YV P+QE+ FIGY+H+PV + M FVVRY+P+EQPSLRPHHD+ST++I+IALN
Sbjct: 613 QWLYFMDTYVRPVQEKTFIGYYHQPVESNMMFVVRYKPEEQPSLRPHHDASTFSIDIALN 672

Query: 677 QVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           + G DYEGGG R+IRYNC V A  +G+ +M PGRLTH HEGL  T+GTRYIM+SF++P
Sbjct: 673 KKGRDYEGGGVRYIRYNCTVPADEVGYAMMFPGRLTHLHEGLATTKGTRYIMVSFINP 730


>gi|380813822|gb|AFE78785.1| procollagen-lysine,2-oxoglutarate 5-dioxygenase 2 isoform 2
           precursor [Macaca mulatta]
          Length = 737

 Score =  643 bits (1659), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 315/721 (43%), Positives = 461/721 (63%), Gaps = 10/721 (1%)

Query: 18  FISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MS 76
           ++     K  +I  DK LVITVA+ E+DG+ RF+QSA+     VK LG  + W GGD ++
Sbjct: 23  YLGADSEKPSSIPTDKLLVITVATKESDGFHRFMQSAKYFNYTVKVLGQGEEWRGGDGIN 82

Query: 77  SLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAER 136
           S+GGG KV L+K  ++     DD++++ T+ +DVI  GG  ++L++F   +  +VF A+ 
Sbjct: 83  SIGGGQKVRLMKEVMEHYADQDDLVVMFTECFDVIFAGGPEEVLKKFQKANHKVVFAADG 142

Query: 137 LCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDE 196
           + WPD  L DKYP V  G RYLNSGGFIGYA  I  ++   ++++ +DDQL+Y  +++D 
Sbjct: 143 ILWPDKRLADKYPVVHIGKRYLNSGGFIGYAPYINRIVQQWNLQDNDDDQLFYTKIYIDP 202

Query: 197 TLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIE 256
             R    I LD    +FQ L G+++++ L F+  +     NT Y T PV I+GNG +KI 
Sbjct: 203 LKREAINITLDHKCKVFQTLNGAVDEVVLKFENGK-ARAKNTFYETLPVAINGNGPTKIL 261

Query: 257 LNSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIAN 315
           LN FGNY+  SW + +GCT C     +D    D  P+V I VFI++PT FL  FL+ +  
Sbjct: 262 LNYFGNYVPNSWTQDNGCTLCEF-DTIDLSAVDVHPNVTIGVFIEQPTPFLPRFLDILLT 320

Query: 316 LNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSL 375
           L+YP + + +F++N + YH      +    K   K +K +     ++  EARN+ ++   
Sbjct: 321 LDYPKEALKLFIHNKEVYHEKDIKVFFDKAKHEIKTIKIVGPEENLSQAEARNMGMDFCR 380

Query: 376 H-KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYAR 434
             +  D+YF VD+D  L NP  LK L+ +N  +IAPL+ R  K WSNFWGAL+ DG+YAR
Sbjct: 381 QDEKCDYYFSVDADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYAR 440

Query: 435 SFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIK-ATNIKTIYTLNSMDYDMAFCTNLR 493
           S DY++I+ G++   G+WNVPY+ N YL+K   ++   N +  +  + +D DMA C N R
Sbjct: 441 SEDYVDIVQGNR--VGVWNVPYMANVYLIKGKTLRLEMNERNYFVRDKLDPDMALCRNAR 498

Query: 494 NKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTV 553
             G+ + I +  E+G L+ + N++    N +++++  NP+DW  +YI+ +Y K +  + +
Sbjct: 499 EMGVFMYISNRHEFGRLLSTANYNTSHYNNDLWQIFENPVDWKEKYINRDYSK-IFTENI 557

Query: 554 NNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVG 613
             QPCPDVFWFPI +EK C E V+ ME YG+WS G ++D R+  GYE VPT DIHMKQV 
Sbjct: 558 VEQPCPDVFWFPIFSEKACDELVEEMEHYGKWSGGKHHDSRISGGYENVPTDDIHMKQVD 617

Query: 614 LAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINI 673
           L  VW  F+R+++ P+  + F GY+ +   A ++FVV+Y P+ Q SLRPHHD+ST+TINI
Sbjct: 618 LENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPHHDASTFTINI 676

Query: 674 ALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVD 733
           ALN VG D++GGGC+F+RYNC++ + R GW  MHPGRLTH HEGL V  GTRYI +SF+D
Sbjct: 677 ALNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNGTRYIAVSFID 736

Query: 734 P 734
           P
Sbjct: 737 P 737


>gi|297288059|ref|XP_002808395.1| PREDICTED: LOW QUALITY PROTEIN: procollagen-lysine,2-oxoglutarate
           5-dioxygenase 3-like [Macaca mulatta]
          Length = 741

 Score =  643 bits (1659), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 312/716 (43%), Positives = 459/716 (64%), Gaps = 16/716 (2%)

Query: 29  IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLL 87
           ++ +K LV+TVA+ ET+GY RF++SAE     V+TLGL + W GGD++ ++GGG KV  L
Sbjct: 32  VNPEKLLVMTVATAETEGYLRFLRSAEFFNYTVRTLGLGEEWRGGDVARTVGGGQKVRWL 91

Query: 88  KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
           K E+++    +D II+  DSYDV++ G  +++L++F    + ++F AE  CWP+  L ++
Sbjct: 92  KKEMEKYADREDTIIMFVDSYDVVLAGSPSELLKKFVQSGSRLLFSAESFCWPEWGLAEQ 151

Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
           YP VG+G R+LNSGGFIG+A  I +++     K+++DDQL+Y  L+LD  LR K  + LD
Sbjct: 152 YPEVGTGKRFLNSGGFIGFATTIHQIVRQWKYKDDDDDQLFYTRLYLDPGLREKLSLNLD 211

Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
             + +FQNL G+L+++ L FD +  V + N  Y+T PV++HGNG +K++LN  GNY+   
Sbjct: 212 HKSRIFQNLNGALDEVVLKFDRNR-VRIRNVAYDTLPVVVHGNGPTKLQLNYLGNYVPNG 270

Query: 268 WKT-SGCTRCNLIKH-LDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
           W    GC  CN  +  L   +P   P V ++VF+++PT FL  FL ++  L+YP  ++++
Sbjct: 271 WTPEGGCGFCNRDRRTLPGGQPP--PRVFLAVFVEQPTPFLPRFLQRLLLLDYPPDRVTL 328

Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAV------ENSLHKGV 379
           F++NN+ +H P   D     +  F  VK +     ++  EAR++A+      E  +  G 
Sbjct: 329 FLHNNEVFHEPHIADSWPQLQDHFAVVKLVGPEEALSPGEARDMAMXAGACGEEGVGXGC 388

Query: 380 DFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYM 439
                      L +P           S     L R  K WS FWGAL+ D +YARS DY+
Sbjct: 389 RVGVAATXCLALTSPVTRPAPPEPPHSXXXXXLSRHGKLWSXFWGALSPDEYYARSEDYV 448

Query: 440 NIINGDQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIH 498
            ++   +   G+WNVPYI+  Y+++   ++     + +++ +  D DMAFC + R+KGI 
Sbjct: 449 ELVQRKR--VGVWNVPYISQAYVIRGDTLRTELPQRDVFSGSDTDPDMAFCKSFRDKGIF 506

Query: 499 LKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPC 558
           L + +  E+G L+ +  +D +  +P+++++  NP+DW  +YIH  Y ++L  + +  QPC
Sbjct: 507 LHLSNQHEFGRLLATSRYDTEHLHPDLWQIFDNPVDWKEQYIHENYSRALEGEGIVEQPC 566

Query: 559 PDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVW 618
           PDV+WFP+++E+ C E V  ME YGQWS G + D RL  GYE VPT DIHMKQVG    W
Sbjct: 567 PDVYWFPLLSEQMCDELVAEMEHYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYEDQW 626

Query: 619 AEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQV 678
            + LR YV P+ E  F GYH +  RA M+FVVRYRPDEQPSLRPHHDSST+T+N+ALN  
Sbjct: 627 LQLLRTYVGPMTESLFPGYHTK-ARAVMNFVVRYRPDEQPSLRPHHDSSTFTLNVALNHK 685

Query: 679 GVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           G+DYEGGGCRF+RY+C +++ R GW L+HPGRLTHYHEGL  T+GTRYIM+SFVDP
Sbjct: 686 GLDYEGGGCRFLRYDCVISSPRKGWALLHPGRLTHYHEGLPTTRGTRYIMVSFVDP 741


>gi|2138314|gb|AAB58363.1| lysyl hydroxylase isoform 2 [Homo sapiens]
          Length = 737

 Score =  642 bits (1657), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 314/720 (43%), Positives = 462/720 (64%), Gaps = 10/720 (1%)

Query: 19  ISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSS 77
           +     K  +I  DK LVITVA+ E+DG+ RF+QSA+     VK LG  + W GGD ++S
Sbjct: 24  LGADSEKPSSIPTDKLLVITVATKESDGFHRFMQSAKYFNYTVKVLGQGEEWRGGDGINS 83

Query: 78  LGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERL 137
           +GGG KV L+K  ++     DD++++ T+ +DVI  GG  ++L++F   +  +VF A+ +
Sbjct: 84  IGGGQKVRLMKEVMEHYADQDDLVVMFTECFDVIFAGGPEEVLKKFQKANHKVVFAADGI 143

Query: 138 CWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDET 197
            WPD  L DKYP V  G RYLNSGGFIGYA  +  ++   ++++ +DDQL+Y  +++D  
Sbjct: 144 LWPDKRLADKYPVVHIGKRYLNSGGFIGYAPYVNRIVQQWNLQDNDDDQLFYTKVYIDPL 203

Query: 198 LRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIEL 257
            R    I LD    +FQ L G+++++ L F+  +     NT Y T PV I+GNG +KI L
Sbjct: 204 KREAINITLDHKCKIFQTLNGAVDEVVLKFENGK-ARAKNTFYETLPVAINGNGPTKILL 262

Query: 258 NSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANL 316
           N FGNY+  SW + +GCT C     +D    D  P+V I VFI++PT FL  FL+ +  L
Sbjct: 263 NYFGNYVPNSWTQDNGCTLCEF-DTVDLSAVDVHPNVSIGVFIEQPTPFLPRFLDILLTL 321

Query: 317 NYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH 376
           +YP + + +F++N + YH      +    K   K +K +     ++  EARN+ ++    
Sbjct: 322 DYPKEALKLFIHNKEVYHEKDIKVFFDKAKHEIKTIKIVGPEENLSQAEARNMGMDFCRQ 381

Query: 377 -KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARS 435
            +  D+YF VD+D  L NP  LK L+ +N  +IAPL+ R  K WSNFWGAL+ DG+YARS
Sbjct: 382 DEKCDYYFSVDADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARS 441

Query: 436 FDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN 494
            DY++I+ G++   G+WNVPY+ N YL+K   +++  N +  +  + +D DMA C N R 
Sbjct: 442 EDYVDIVQGNR--VGVWNVPYMANVYLIKGKTLRSEMNERNYFVRDKLDPDMALCRNARE 499

Query: 495 KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVN 554
            G+ + I +  E+G L+ + N++    N +++++  NP+DW  +YI+ +Y K +  + + 
Sbjct: 500 MGVFMYISNRHEFGRLLSTANYNTSHYNNDLWQIFENPVDWKEKYINRDYSK-IFTENIV 558

Query: 555 NQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGL 614
            QPCPDVFWFPI +EK C E V+ ME YG+WS G ++D R+  GYE VPT DIHMKQV L
Sbjct: 559 EQPCPDVFWFPIFSEKACDELVEEMEHYGKWSGGKHHDSRISGGYENVPTDDIHMKQVDL 618

Query: 615 AGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIA 674
             VW +F+R+++ P+  + F GY+ +   A ++FVV+Y P+ Q SLRPHHD+ST+TINIA
Sbjct: 619 ENVWLDFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPHHDASTFTINIA 677

Query: 675 LNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           LN VG D++GGGC+F+RYNC++ + R GW  MHPGRLTH HEGL V  GTRYI +SF+DP
Sbjct: 678 LNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNGTRYIAVSFIDP 737


>gi|397512440|ref|XP_003826553.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2
           isoform 1 [Pan paniscus]
          Length = 737

 Score =  642 bits (1657), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 314/720 (43%), Positives = 461/720 (64%), Gaps = 10/720 (1%)

Query: 19  ISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSS 77
           +     K  +I  DK LVITVA+ E+DG+ RF+QSA+     VK LG  + W GGD ++S
Sbjct: 24  LGADSEKPSSIPTDKLLVITVATKESDGFHRFMQSAKYFNYTVKVLGQGEEWRGGDGINS 83

Query: 78  LGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERL 137
           +GGG KV L+K  ++     DD++++ T+ +DVI  GG  ++L++F   +  +VF A+ +
Sbjct: 84  IGGGQKVRLMKEVMERYADQDDLVVMFTECFDVIFAGGPEEVLKKFQKANHKVVFAADGI 143

Query: 138 CWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDET 197
            WPD  L DKYP V  G RYLNSGGFIGYA  +  ++   ++++ +DDQL+Y  +++D  
Sbjct: 144 LWPDKRLADKYPVVHIGKRYLNSGGFIGYAPYVNRIVQQWNLQDNDDDQLFYTKIYIDPL 203

Query: 198 LRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIEL 257
            R    I LD    +FQ L G+++++ L F+  +     NT Y T PV I+GNG +KI L
Sbjct: 204 KREAINITLDHKCKIFQTLNGAVDEVVLKFENGK-ARAKNTFYETLPVAINGNGPTKILL 262

Query: 258 NSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANL 316
           N FGNY+  SW + +GCT C     +D    D  P+V I VFI++PT FL  FL+ +  L
Sbjct: 263 NYFGNYVPNSWTQDNGCTLCEF-DTVDLSAVDVHPNVSIGVFIEQPTPFLPRFLDILLTL 321

Query: 317 NYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH 376
           +YP + + +F++N + YH      +    K   K +K +     ++  EARN+ ++    
Sbjct: 322 DYPKEALKLFIHNKEVYHEKDIKIFFDKAKHEIKTIKIVGPEENLSQAEARNMGMDFCRQ 381

Query: 377 -KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARS 435
            +  D+YF VD+D  L NP  LK L+ +N  +IAPL+ R  K WSNFWGAL+ DG+YARS
Sbjct: 382 DEKCDYYFSVDADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARS 441

Query: 436 FDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN 494
            DY++I+ G++   G+WNVPY+ N YL+K   +++  N +  +  + +D DMA C N R 
Sbjct: 442 EDYVDIVQGNR--VGVWNVPYMANVYLIKGKTLRSEMNERNYFVRDKLDPDMALCRNARE 499

Query: 495 KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVN 554
            G+ + I +  E+G L+ + N++    N +++++  NP+DW  +YI+ +Y K +  + + 
Sbjct: 500 MGVFMYISNRHEFGRLLSTANYNTSHYNNDLWQIFENPVDWKEKYINRDYSK-IFTENIV 558

Query: 555 NQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGL 614
            QPCPDVFWFPI +EK C E V+ ME YG+WS G ++D R+  GYE VPT DIHMKQV L
Sbjct: 559 EQPCPDVFWFPIFSEKACDELVEEMEHYGKWSGGKHHDSRISGGYENVPTDDIHMKQVDL 618

Query: 615 AGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIA 674
             VW  F+R+++ P+  + F GY+ +   A ++FVV+Y P+ Q SLRPHHD+ST+TINIA
Sbjct: 619 ENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPHHDASTFTINIA 677

Query: 675 LNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           LN VG D++GGGC+F+RYNC++ + R GW  MHPGRLTH HEGL V  GTRYI +SF+DP
Sbjct: 678 LNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNGTRYIAVSFIDP 737


>gi|403278815|ref|XP_003930980.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2
           isoform 2 [Saimiri boliviensis boliviensis]
          Length = 737

 Score =  642 bits (1657), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 315/720 (43%), Positives = 461/720 (64%), Gaps = 10/720 (1%)

Query: 19  ISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSS 77
           +  +  K  +I  DK LVITVA+ E+DG+ RF+QSA+     VK LG  + W GGD ++S
Sbjct: 24  LGANSEKPSSIPTDKLLVITVATKESDGFHRFMQSAKYFNYTVKVLGQGEEWRGGDGINS 83

Query: 78  LGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERL 137
           +GGG KV L+K  +      DD++++ T+ +DVI  GG  ++L++F   +  +VF A+ +
Sbjct: 84  IGGGQKVRLMKEIMGHYADQDDLVVMFTECFDVIFAGGPEELLKKFQKANHKVVFAADGI 143

Query: 138 CWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDET 197
            WPD  L DKYP V  G RYLNSGGFIGYA  I  ++   ++++ +DDQL+Y  +++D  
Sbjct: 144 LWPDKRLADKYPVVHIGKRYLNSGGFIGYAPYINRIVQQWNLQDNDDDQLFYTKIYIDPL 203

Query: 198 LRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIEL 257
            R    I LD    +FQ L G+++++ L F+  +     NT Y T PV I+GNG +KI L
Sbjct: 204 KREAINITLDHKCKIFQTLNGAVDEVVLKFENGK-ARAKNTFYETLPVAINGNGPTKILL 262

Query: 258 NSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANL 316
           N FGNY+  SW + +GCT C     +D    D  P+V I VFI++PT FL  FL+ +  L
Sbjct: 263 NYFGNYVPNSWTQDNGCTLCEF-DTVDLSAVDVHPNVTIGVFIEQPTPFLPRFLDILLTL 321

Query: 317 NYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH 376
           +YP + + +F++N + YH      +    K   K +K +     ++  EARN+ ++    
Sbjct: 322 DYPKEALKLFIHNKEVYHEKDIKVFFDKAKHEIKTIKIVGPEENLSQAEARNMGMDFCRQ 381

Query: 377 -KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARS 435
            +  D+YF VD+D  L NP  LK L+ +N  +IAPL+ R  K WSNFWGAL+ DG+YARS
Sbjct: 382 DENCDYYFSVDADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARS 441

Query: 436 FDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN 494
            DY++I+ G++   G+WNVPY+ N YL+K   +++  N +  +  + +D DMA C N R 
Sbjct: 442 EDYVDIVQGNR--VGVWNVPYMANVYLIKGKTLRSEMNERNYFVRDKLDPDMALCRNARE 499

Query: 495 KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVN 554
            G+ + I +  E+G L+ + N++    N +++++  NP+DW  +YI+ +Y K +  + + 
Sbjct: 500 MGVFMYISNRHEFGRLLSTANYNTSHYNNDLWQIFENPVDWKEKYINRDYSK-IFTENIV 558

Query: 555 NQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGL 614
            QPCPDVFWFPI +EK C E V+ ME YG+WS G ++D R+  GYE VPT DIHMKQV L
Sbjct: 559 EQPCPDVFWFPIFSEKACDELVEEMEHYGKWSGGKHHDSRISGGYENVPTDDIHMKQVDL 618

Query: 615 AGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIA 674
             VW  F+R+++ P+  + F GY+ +   A ++FVV+Y P+ Q SLRPHHD+ST+TINIA
Sbjct: 619 ENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPHHDASTFTINIA 677

Query: 675 LNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           LN VG D++GGGC+F+RYNC++ + R GW  MHPGRLTH HEGL V  GTRYI +SF+DP
Sbjct: 678 LNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNGTRYIAVSFIDP 737


>gi|332818216|ref|XP_516801.3| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2 [Pan
           troglodytes]
          Length = 737

 Score =  642 bits (1656), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 314/720 (43%), Positives = 461/720 (64%), Gaps = 10/720 (1%)

Query: 19  ISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSS 77
           +     K  +I  DK LVITVA+ E+DG+ RF+QSA+     VK LG  + W GGD ++S
Sbjct: 24  LGADSEKPSSIPTDKLLVITVATKESDGFHRFMQSAKYFNYTVKVLGQGEEWRGGDGINS 83

Query: 78  LGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERL 137
           +GGG KV L+K  ++     DD++++ T+ +DVI  GG  ++L++F   +  +VF A+ +
Sbjct: 84  IGGGQKVRLMKEVMEHYADQDDLVVMFTECFDVIFAGGPEEVLKKFQKANHKVVFAADGI 143

Query: 138 CWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDET 197
            WPD  L DKYP V  G RYLNSGGFIGYA  +  ++   ++++ +DDQL+Y  +++D  
Sbjct: 144 LWPDKRLADKYPVVHIGKRYLNSGGFIGYAPYVNRIVQQWNLQDNDDDQLFYTKIYIDPL 203

Query: 198 LRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIEL 257
            R    I LD    +FQ L G+++++ L F+  +     NT Y T PV I+GNG +KI L
Sbjct: 204 KREAINITLDHKCKIFQTLNGAVDEVVLKFENGK-ARAKNTFYETLPVAINGNGPTKILL 262

Query: 258 NSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANL 316
           N FGNY+  SW + +GCT C     +D    D  P+V I VFI++PT FL  FL+ +  L
Sbjct: 263 NYFGNYVPNSWTQDNGCTLCEF-DTVDLSAVDVHPNVSIGVFIEQPTPFLPRFLDILLTL 321

Query: 317 NYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH 376
           +YP + + +F++N + YH      +    K   K +K +     ++  EARN+ ++    
Sbjct: 322 DYPKEALKLFIHNKEVYHEKDIKIFFDKAKHEIKTIKIVGPEENLSQAEARNMGMDFCRQ 381

Query: 377 -KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARS 435
            +  D+YF VD+D  L NP  LK L+ +N  +IAPL+ R  K WSNFWGAL+ DG+YARS
Sbjct: 382 DEKCDYYFSVDADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARS 441

Query: 436 FDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN 494
            DY++I+ G++   G+WNVPY+ N YL+K   +++  N +  +  + +D DMA C N R 
Sbjct: 442 EDYVDIVQGNR--VGVWNVPYMANVYLIKGKTLRSEMNERNYFVRDKLDPDMALCRNARE 499

Query: 495 KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVN 554
            G+ + I +  E+G L+ + N++    N +++++  NP+DW  +YI+ +Y K +  + + 
Sbjct: 500 MGVFMYISNRHEFGRLLSTANYNTSHYNNDLWQIFENPVDWKEKYINRDYSK-IFTENIV 558

Query: 555 NQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGL 614
            QPCPDVFWFPI +EK C E V+ ME YG+WS G ++D R+  GYE VPT DIHMKQV L
Sbjct: 559 EQPCPDVFWFPIFSEKACDELVEEMEHYGKWSGGKHHDSRISGGYENVPTDDIHMKQVDL 618

Query: 615 AGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIA 674
             VW  F+R+++ P+  + F GY+ +   A ++FVV+Y P+ Q SLRPHHD+ST+TINIA
Sbjct: 619 ENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPHHDASTFTINIA 677

Query: 675 LNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           LN VG D++GGGC+F+RYNC++ + R GW  MHPGRLTH HEGL V  GTRYI +SF+DP
Sbjct: 678 LNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNGTRYIAVSFIDP 737


>gi|410254494|gb|JAA15214.1| procollagen-lysine, 2-oxoglutarate 5-dioxygenase 2 [Pan
           troglodytes]
          Length = 737

 Score =  642 bits (1656), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 314/720 (43%), Positives = 461/720 (64%), Gaps = 10/720 (1%)

Query: 19  ISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSS 77
           +     K  +I  DK LVITVA+ E+DG+ RF+QSA+     VK LG  + W GGD ++S
Sbjct: 24  LGADSEKPSSIPTDKLLVITVATKESDGFHRFMQSAKYFNYTVKVLGQGEEWRGGDGINS 83

Query: 78  LGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERL 137
           +GGG KV L+K  ++     DD++++ T+ +DVI  GG  ++L++F   +  +VF A+ +
Sbjct: 84  IGGGQKVRLMKEVMEHYADQDDLVVMFTECFDVIFAGGPEEVLKKFQKANHKVVFAADGI 143

Query: 138 CWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDET 197
            WPD  L DKYP V  G RYLNSGGFIGYA  +  ++   ++++ +DDQL+Y  +++D  
Sbjct: 144 LWPDKRLADKYPVVHIGKRYLNSGGFIGYAPYVNRIVQQWNLQDNDDDQLFYTKIYIDPL 203

Query: 198 LRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIEL 257
            R    I LD    +FQ L G+++++ L F+  +     NT Y T PV I+GNG +KI L
Sbjct: 204 KREAINITLDHKCKIFQTLNGAVDEVVLKFENGK-ARAKNTFYETLPVAINGNGPTKILL 262

Query: 258 NSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANL 316
           N FGNY+  SW + +GCT C     +D    D  P+V I VFI++PT FL  FL+ +  L
Sbjct: 263 NYFGNYVPNSWTQDNGCTLCEF-DTVDLSAVDVHPNVSIGVFIEQPTPFLPRFLDILLTL 321

Query: 317 NYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH 376
           +YP + + +F++N + YH      +    K   K +K +     ++  EARN+ ++    
Sbjct: 322 DYPKEALKLFIHNKEVYHEKDIKIFFDKAKHEIKTIKIVGPEENLSQAEARNMGMDFCRQ 381

Query: 377 -KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARS 435
            +  D+YF VD+D  L NP  LK L+ +N  +IAPL+ R  K WSNFWGAL+ DG+YARS
Sbjct: 382 DEKCDYYFSVDADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARS 441

Query: 436 FDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN 494
            DY++I+ G++   G+WNVPY+ N YL+K   +++  N +  +  + +D DMA C N R 
Sbjct: 442 EDYVDIVQGNR--VGVWNVPYMANVYLIKGKTLRSEMNERNYFVRDKLDPDMALCRNARE 499

Query: 495 KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVN 554
            G+ + I +  E+G L+ + N++    N +++++  NP+DW  +YI+ +Y K +  + + 
Sbjct: 500 MGVFMYISNRHEFGRLLSTANYNTSHYNNDLWQIFENPVDWKEKYINRDYSK-IFTENIV 558

Query: 555 NQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGL 614
            QPCPDVFWFPI +EK C E V+ ME YG+WS G ++D R+  GYE VPT DIHMKQV L
Sbjct: 559 EQPCPDVFWFPIFSEKACDELVEEMEHYGKWSGGKHHDSRISGGYENVPTDDIHMKQVDL 618

Query: 615 AGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIA 674
             VW  F+R+++ P+  + F GY+ +   A ++FVV+Y P+ Q SLRPHHD+ST+TINIA
Sbjct: 619 ENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPHHDASTFTINIA 677

Query: 675 LNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           LN VG D++GGGC+F+RYNC++ + R GW  MHPGRLTH HEGL V  GTRYI +SF+DP
Sbjct: 678 LNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKSGTRYIAVSFIDP 737


>gi|62089344|dbj|BAD93116.1| procollagen-lysine, 2-oxoglutarate 5-dioxygenase 2 isoform b
           variant [Homo sapiens]
          Length = 796

 Score =  642 bits (1656), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 314/720 (43%), Positives = 461/720 (64%), Gaps = 10/720 (1%)

Query: 19  ISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSS 77
           +     K  +I  DK LVITVA+ E+DG+ RF+QSA+     VK LG  + W GGD ++S
Sbjct: 83  LGADSEKPSSIPTDKLLVITVATKESDGFHRFMQSAKYFNYTVKVLGQGEEWRGGDGINS 142

Query: 78  LGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERL 137
           +GGG KV L+K  ++     DD++++ T+ +DVI  GG  ++L++F   +  +VF A+ +
Sbjct: 143 IGGGQKVRLMKEVMEHYADQDDLVVMFTECFDVIFAGGPEEVLKKFQKANHKVVFAADGI 202

Query: 138 CWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDET 197
            WPD  L DKYP V  G RYLNSGGFIGYA  +  ++   ++++ +DDQL+Y  +++D  
Sbjct: 203 LWPDKRLADKYPVVHIGKRYLNSGGFIGYAPYVNRIVQQWNLQDNDDDQLFYTKVYIDPL 262

Query: 198 LRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIEL 257
            R    I LD    +FQ L G+++++ L F+  +     NT Y T PV I+GNG +KI L
Sbjct: 263 KREAINITLDHKCKIFQTLNGAVDEVVLKFENGK-ARAKNTFYETLPVAINGNGPTKILL 321

Query: 258 NSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANL 316
           N FGNY+  SW + +GCT C     +D    D  P+V I VFI++PT FL  FL+ +  L
Sbjct: 322 NYFGNYVPNSWTQDNGCTLCEF-DTVDLSAVDVHPNVSIGVFIEQPTPFLPRFLDILLTL 380

Query: 317 NYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH 376
           +YP + + +F++N + YH      +    K   K +K +     ++  EARN+ ++    
Sbjct: 381 DYPKEALKLFIHNKEVYHEKDIKVFFDKAKHEIKTIKIVGPEENLSQAEARNMGMDFCRQ 440

Query: 377 -KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARS 435
            +  D+YF VD+D  L NP  LK L+ +N  +IAPL+ R  K WSNFWGAL+ DG+YARS
Sbjct: 441 DEKCDYYFSVDADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARS 500

Query: 436 FDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN 494
            DY++I+ G++   G+WNVPY+ N YL+K   +++  N +  +  + +D DMA C N R 
Sbjct: 501 EDYVDIVQGNR--VGVWNVPYMANVYLIKGKTLRSEMNERNYFVRDKLDPDMALCRNARE 558

Query: 495 KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVN 554
            G+ + I +  E+G L+ + N++    N +++++  NP+DW  +YI+ +Y K +  + + 
Sbjct: 559 MGVFMYISNRHEFGRLLSTANYNTSHYNNDLWQIFENPVDWKEKYINRDYSK-IFTENIV 617

Query: 555 NQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGL 614
            QPCPDVFWFPI +EK C E V+ ME YG+WS G ++D R+  GYE VPT DIHMKQV L
Sbjct: 618 EQPCPDVFWFPIFSEKACDELVEEMEHYGKWSGGKHHDSRISGGYENVPTDDIHMKQVDL 677

Query: 615 AGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIA 674
             VW  F+R+++ P+  + F GY+ +   A ++FVV+Y P+ Q SLRPHHD+ST+TINIA
Sbjct: 678 ENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPHHDASTFTINIA 736

Query: 675 LNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           LN VG D++GGGC+F+RYNC++ + R GW  MHPGRLTH HEGL V  GTRYI +SF+DP
Sbjct: 737 LNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNGTRYIAVSFIDP 796


>gi|62739166|ref|NP_000926.2| procollagen-lysine,2-oxoglutarate 5-dioxygenase 2 isoform 2
           precursor [Homo sapiens]
 gi|62906878|sp|O00469.2|PLOD2_HUMAN RecName: Full=Procollagen-lysine,2-oxoglutarate 5-dioxygenase 2;
           AltName: Full=Lysyl hydroxylase 2; Short=LH2; Flags:
           Precursor
 gi|119599347|gb|EAW78941.1| procollagen-lysine, 2-oxoglutarate 5-dioxygenase 2, isoform CRA_c
           [Homo sapiens]
 gi|261858130|dbj|BAI45587.1| procollagen-lysine, 2-oxoglutarate 5-dioxygenase 2 [synthetic
           construct]
          Length = 737

 Score =  642 bits (1655), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 314/720 (43%), Positives = 461/720 (64%), Gaps = 10/720 (1%)

Query: 19  ISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSS 77
           +     K  +I  DK LVITVA+ E+DG+ RF+QSA+     VK LG  + W GGD ++S
Sbjct: 24  LGADSEKPSSIPTDKLLVITVATKESDGFHRFMQSAKYFNYTVKVLGQGEEWRGGDGINS 83

Query: 78  LGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERL 137
           +GGG KV L+K  ++     DD++++ T+ +DVI  GG  ++L++F   +  +VF A+ +
Sbjct: 84  IGGGQKVRLMKEVMEHYADQDDLVVMFTECFDVIFAGGPEEVLKKFQKANHKVVFAADGI 143

Query: 138 CWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDET 197
            WPD  L DKYP V  G RYLNSGGFIGYA  +  ++   ++++ +DDQL+Y  +++D  
Sbjct: 144 LWPDKRLADKYPVVHIGKRYLNSGGFIGYAPYVNRIVQQWNLQDNDDDQLFYTKVYIDPL 203

Query: 198 LRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIEL 257
            R    I LD    +FQ L G+++++ L F+  +     NT Y T PV I+GNG +KI L
Sbjct: 204 KREAINITLDHKCKIFQTLNGAVDEVVLKFENGK-ARAKNTFYETLPVAINGNGPTKILL 262

Query: 258 NSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANL 316
           N FGNY+  SW + +GCT C     +D    D  P+V I VFI++PT FL  FL+ +  L
Sbjct: 263 NYFGNYVPNSWTQDNGCTLCEF-DTVDLSAVDVHPNVSIGVFIEQPTPFLPRFLDILLTL 321

Query: 317 NYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH 376
           +YP + + +F++N + YH      +    K   K +K +     ++  EARN+ ++    
Sbjct: 322 DYPKEALKLFIHNKEVYHEKDIKVFFDKAKHEIKTIKIVGPEENLSQAEARNMGMDFCRQ 381

Query: 377 -KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARS 435
            +  D+YF VD+D  L NP  LK L+ +N  +IAPL+ R  K WSNFWGAL+ DG+YARS
Sbjct: 382 DEKCDYYFSVDADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARS 441

Query: 436 FDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN 494
            DY++I+ G++   G+WNVPY+ N YL+K   +++  N +  +  + +D DMA C N R 
Sbjct: 442 EDYVDIVQGNR--VGVWNVPYMANVYLIKGKTLRSEMNERNYFVRDKLDPDMALCRNARE 499

Query: 495 KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVN 554
            G+ + I +  E+G L+ + N++    N +++++  NP+DW  +YI+ +Y K +  + + 
Sbjct: 500 MGVFMYISNRHEFGRLLSTANYNTSHYNNDLWQIFENPVDWKEKYINRDYSK-IFTENIV 558

Query: 555 NQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGL 614
            QPCPDVFWFPI +EK C E V+ ME YG+WS G ++D R+  GYE VPT DIHMKQV L
Sbjct: 559 EQPCPDVFWFPIFSEKACDELVEEMEHYGKWSGGKHHDSRISGGYENVPTDDIHMKQVDL 618

Query: 615 AGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIA 674
             VW  F+R+++ P+  + F GY+ +   A ++FVV+Y P+ Q SLRPHHD+ST+TINIA
Sbjct: 619 ENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPHHDASTFTINIA 677

Query: 675 LNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           LN VG D++GGGC+F+RYNC++ + R GW  MHPGRLTH HEGL V  GTRYI +SF+DP
Sbjct: 678 LNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNGTRYIAVSFIDP 737


>gi|27924281|gb|AAH45041.1| LOC398437 protein, partial [Xenopus laevis]
          Length = 727

 Score =  641 bits (1654), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 313/711 (44%), Positives = 462/711 (64%), Gaps = 16/711 (2%)

Query: 28  NIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLL 87
           N D+D  LV+TVA+ ET+G +RF +SA     +VK LGL + WLG        G K+ L+
Sbjct: 29  NPDDDNLLVLTVATEETEGLRRFQRSAHSFNYKVKVLGLGEEWLGD-------GQKIQLM 81

Query: 88  KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
           K  L+     +D+IIL T+SYDVI   G  ++L++F    + +VF AE + +PD  L  K
Sbjct: 82  KLALEPYSDKEDLIILFTESYDVIFASGHGELLKKFRQAKSKVVFSAESVAYPDRHLESK 141

Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
           YP V  G RYL SG FIGYA  + +++++    ++  DQL+Y  LFLD   R K  I LD
Sbjct: 142 YPVVREGKRYLGSGAFIGYAAHLYKMVADWDGTDKSSDQLFYTKLFLDPVKRGKINITLD 201

Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
               +FQNLYGS ED+ L F+    V   N  Y+T PV+IHGNG +K+ LN   NY+ + 
Sbjct: 202 HRCRIFQNLYGSAEDVVLKFEYGR-VRARNLVYDTLPVLIHGNGPTKLHLNYLSNYIPRV 260

Query: 268 WK-TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
           W   SGC  C+  +++LD L  D  P V+I ++I++PT F+ EF  ++ NLNYP  +I +
Sbjct: 261 WTFESGCNVCDEGLRNLDGLTVDTLPLVVIGIYIEQPTPFVSEFFKRLNNLNYPKNRIQL 320

Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHK-GVDFYFY 384
           ++ N++ +H    + ++ +  T +  VK +      +  +ARN  ++        ++YF 
Sbjct: 321 YISNHEPHHQKRVEHFLQDHGTQYNFVKTVGPEENSDFADARNKGMDMCRQTPECEYYFS 380

Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
           +D+   L N +VL+ L+ +N+S+IAPL+ R    WSNFWGALN+DG+YARS DY++++  
Sbjct: 381 IDAPVVLKNTNVLRSLIEQNKSVIAPLVSRNANLWSNFWGALNSDGYYARSEDYIDVVQR 440

Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDS 503
            +   G+WNVPYI++ YL+K S++++  N   ++   ++D DMAFC N+R +G+ + + +
Sbjct: 441 QR--NGVWNVPYISSVYLVKGSILRSKLNQNDLFHSGTLDSDMAFCHNVRQQGVFMFVTN 498

Query: 504 TQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFW 563
            QE+GH++  +N+     + +++E+  N  DW  +YIH  + ++L    V   PCPDV+W
Sbjct: 499 RQEFGHILSLKNYKTTHLHNDLWEIFENTEDWKEKYIHHNHSEALKGKLVE-MPCPDVYW 557

Query: 564 FPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLR 623
           FP+ TE  C+E V+ ME++G+WS G+N D+RL+ GYE VPT DIHM Q+G    W + L 
Sbjct: 558 FPVFTETTCNEIVEEMESFGKWSTGSNTDQRLQGGYENVPTIDIHMNQIGYEKEWQKILL 617

Query: 624 KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYE 683
            ++ PL ++ F GY+    +  ++FVVRY+PDEQP L PHHD+ST+T+NIALN VG DYE
Sbjct: 618 DFIAPLTQKMFPGYY-TMAQFDLAFVVRYKPDEQPLLEPHHDASTFTVNIALNSVGQDYE 676

Query: 684 GGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           GGGCRF+RYNC+V A R GW LMHPGRLTHYHEGL+VT+GTRYI++SFVDP
Sbjct: 677 GGGCRFLRYNCSVRALRKGWALMHPGRLTHYHEGLRVTKGTRYIVVSFVDP 727


>gi|301778999|ref|XP_002924916.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2-like
           isoform 2 [Ailuropoda melanoleuca]
          Length = 736

 Score =  640 bits (1652), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 312/714 (43%), Positives = 462/714 (64%), Gaps = 10/714 (1%)

Query: 25  KVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGYK 83
           K  +I  DK LVITVA+ E+DG+ RF+QSA+     VK LG  + W GGD ++S+GGG K
Sbjct: 29  KPSSIPTDKLLVITVATKESDGFHRFMQSAKYFNYTVKVLGQGEEWRGGDGINSIGGGQK 88

Query: 84  VNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTS 143
           V L+K  ++     +D++IL T+ ++VI  GG  ++L++F   +  +VF A+ + WPD  
Sbjct: 89  VRLMKEVMEHYANQEDLVILFTECFNVIFAGGPEEVLKKFQKSNHKVVFAADGILWPDKR 148

Query: 144 LYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHK 203
           L DKYP V  G RYLNSGGFIGYA +I +++   ++++ +DDQL+Y  +++D   R    
Sbjct: 149 LADKYPIVHIGKRYLNSGGFIGYAPNINQIVQQWNLQDNDDDQLFYTKIYIDPLKREAIN 208

Query: 204 IVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNY 263
           I LD    +FQ L G+++++ L F+  +     N  Y T PV ++GNG +KI LN FGNY
Sbjct: 209 ITLDHKCKIFQTLNGAVDEVVLKFENGK-ARAKNVFYETLPVAVNGNGPTKILLNYFGNY 267

Query: 264 LAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKK 322
           +  +W + +GCT C+L   +D    D  P+V I VFI++PT FL  FL+ +  L+YP + 
Sbjct: 268 VPNAWTQDNGCTLCDL-DTIDLSTVDVHPNVTIGVFIEQPTPFLPRFLDILLTLDYPKEA 326

Query: 323 ISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDF 381
           + +F++N + YH      +    K     +K +     ++  EARN+ ++     +  D+
Sbjct: 327 LKLFIHNKEVYHEKDIKVFFDKAKREISTIKIVGPEENLSQAEARNMGMDFCRQDENCDY 386

Query: 382 YFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNI 441
           YF +D+D  L NP  LK L+ +N  +IAPL+ R  K WSNFWGAL+ DG+YARS DY++I
Sbjct: 387 YFSMDADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARSEDYVDI 446

Query: 442 INGDQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRNKGIHLK 500
           + G++   GIWNVPY+ N YL+K   +++  N +  +  + +D DMA C N R  G+ + 
Sbjct: 447 VQGNR--VGIWNVPYMANVYLIKGKTLRSEMNERNYFVRDKLDPDMALCRNAREMGVFMY 504

Query: 501 IDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPD 560
           I +  E+G L+ + N++    N +++++  NP+DW  +YI+ +Y K +  + +  QPCPD
Sbjct: 505 ISNRHEFGRLLSTANYNTSHYNNDLWQIFENPVDWKEKYINRDYSK-IFTENIVEQPCPD 563

Query: 561 VFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAE 620
           VFWFPI +EK C E V+ ME YGQWS G ++D R+  GYE VPT DIHMKQ+ L  VW  
Sbjct: 564 VFWFPIFSEKACDELVEEMEHYGQWSGGKHHDSRISGGYENVPTDDIHMKQIDLENVWLH 623

Query: 621 FLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGV 680
           F+R+++ P+  + F GY+ +   A ++FVV+Y P+ Q SLRPHHD+ST+TINIALN VG 
Sbjct: 624 FIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPHHDASTFTINIALNNVGE 682

Query: 681 DYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           D++GGGC+F+RYNC++ + R GW  MHPGRLTH HEGL V  GTRYI +SF+DP
Sbjct: 683 DFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNGTRYIAVSFIDP 736


>gi|62900635|sp|Q811A3.1|PLOD2_RAT RecName: Full=Procollagen-lysine,2-oxoglutarate 5-dioxygenase 2;
           AltName: Full=Lysyl hydroxylase 2; Short=LH2; Flags:
           Precursor
 gi|28400783|emb|CAD23630.1| procollagen-lysine, 2-oxoglutarate 5-dioxygenase 2, short variant
           [Rattus norvegicus]
          Length = 737

 Score =  640 bits (1652), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 314/710 (44%), Positives = 456/710 (64%), Gaps = 10/710 (1%)

Query: 29  IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGYKVNLL 87
           I  DK LVITVA+ E DG+ RF+ SA+     VK LG  Q W GGD M+S+GGG KV L+
Sbjct: 34  IPADKLLVITVATKENDGFHRFMNSAKYFNYTVKVLGQGQEWRGGDGMNSIGGGQKVRLM 93

Query: 88  KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
           K  ++     DD++IL T+ +DVI  GG  ++L++F   +  IVF A+ L WPD  L DK
Sbjct: 94  KEAMEHYAGQDDLVILFTECFDVIFAGGPEELLKKFQKTNHKIVFAADALLWPDKRLADK 153

Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
           YP V  G RYLNSGGFIGYA  I  L+   ++++ +DDQL+Y  +++D   R    I LD
Sbjct: 154 YPGVHIGKRYLNSGGFIGYAPYISRLVQQWNLQDNDDDQLFYTKVYIDPLKREALNITLD 213

Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
               +FQ L G+ +++ L F+  +   + NT Y T PV I+GNG SKI LN FGNY+  S
Sbjct: 214 HRCKIFQALNGATDEVVLKFENGK-SRVKNTFYETLPVAINGNGPSKILLNYFGNYVPNS 272

Query: 268 W-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMF 326
           W + +GC  C+     D    D +P V + VFI++PT F   FL+ +  L+YP + + +F
Sbjct: 273 WTQENGCALCDFDAS-DLSTVDVYPKVTLGVFIEQPTPFQPRFLDLLLTLDYPKEALRLF 331

Query: 327 VYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYV 385
           V+N + YH      ++   K    ++K +     ++  EARN+ ++     +  D+YF V
Sbjct: 332 VHNKEVYHEKDIKAFVDKAKHDISSIKIVGPEENLSQAEARNMGMDFCRQDEKCDYYFSV 391

Query: 386 DSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGD 445
           D+D  L NP  LK L+ +N  +IAPL+ R  K WSNFWGAL+ DG+YARS DY++I+ G+
Sbjct: 392 DADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARSEDYVDIVQGN 451

Query: 446 QGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDST 504
           +   GIWNVPY+ N YL++   +++  + +  +  + +D DM+ C N R+ G+ + I + 
Sbjct: 452 R--VGIWNVPYMANVYLIQGKTLRSEMSERNYFVRDKLDPDMSLCRNARDMGVFMYISNR 509

Query: 505 QEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWF 564
            E+G L+ + N++    N +++++  NP+DW  +YI+ +Y K +  + +  QPCPDVFWF
Sbjct: 510 HEFGRLISTANYNTSHLNNDLWQIFENPVDWKEKYINRDYSK-IFTENIVEQPCPDVFWF 568

Query: 565 PIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRK 624
           PI +E+ C E V+ ME YG+WS G ++D R+  GYE VPT DIHMKQ+ L  VW  F+R+
Sbjct: 569 PIFSERACDELVEEMEHYGKWSGGKHHDSRISGGYENVPTDDIHMKQIDLENVWLHFIRE 628

Query: 625 YVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEG 684
           ++ P+  + F GY+ +   A ++FVV+Y P+ Q SLRPHHD+ST+TINIALN VG D++G
Sbjct: 629 FIAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPHHDASTFTINIALNNVGEDFQG 687

Query: 685 GGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           GGC+F+RYNC++ + R GW  MHPGRLTH HEGL V  GTRYI +SF+DP
Sbjct: 688 GGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNGTRYIAVSFIDP 737


>gi|308509144|ref|XP_003116755.1| CRE-LET-268 protein [Caenorhabditis remanei]
 gi|308241669|gb|EFO85621.1| CRE-LET-268 protein [Caenorhabditis remanei]
          Length = 734

 Score =  640 bits (1651), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 310/736 (42%), Positives = 471/736 (63%), Gaps = 16/736 (2%)

Query: 11  ILSCVVFFISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPW 70
           +L  + FF+          D  + +V+TVA+  TDG KR ++SA+   ++++ L L + W
Sbjct: 3   VLPLLPFFLIPVILATTITDLPELVVVTVATENTDGLKRLLESAKAFDIKIEVLALGEKW 62

Query: 71  LGGDMS-SLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERF--NTFD 127
            GGD     GGG K+ +L   +++     D II+  D+YDV+ +     IL +F  +  +
Sbjct: 63  NGGDTRVEQGGGQKIRILSEWIEKYKDASDTIIMFVDAYDVVFNADATTILRKFFEHYSE 122

Query: 128 ANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQL 187
             ++FGAE  CWPD +L   YP V  G R+LNSG F+GY  ++ +++  + +++++DDQL
Sbjct: 123 KRLLFGAEPFCWPDQTLAPDYPIVEFGKRFLNSGLFMGYGPEVYKILKLKPVEDKDDDQL 182

Query: 188 YYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVII 247
           YY +++LD+ LR + K+ LD+++ +FQNL G +ED++L F  D      N  YNT P+II
Sbjct: 183 YYTMIYLDDKLRKELKMDLDSMSKIFQNLNGVIEDVELQFKDDGTPEAYNAAYNTKPLII 242

Query: 248 HGNGKSKIELNSFGNYLAKSWKTS-GCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFL 306
           HGNG SK  LN  GNYL   W +  GC  C   +  ++   D  P + +++FI KP  F+
Sbjct: 243 HGNGPSKSHLNYLGNYLGNRWNSELGCRNCGQEEEKETADED-LPLIALNLFISKPIPFI 301

Query: 307 EEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEA 366
           EE L K++  +YP  KI++++YNNQ +      D++      +   + I   + +  +EA
Sbjct: 302 EEVLQKVSEFDYPKNKIALYIYNNQPFSIKNIQDFLKEHGKSYYTKRVINGVTEIGEREA 361

Query: 367 RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNES----LIAPLLVRPFKAWSNF 422
           RN A+E    + V++ F++D+D++  +P ++K LV+ +E+    +IAP++ +P K ++NF
Sbjct: 362 RNEAIEWDKQRNVEYGFFMDADAYFTDPKIVKDLVHHSETYDVGIIAPMVGQPGKLFTNF 421

Query: 423 WGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSM 482
           WGA+ A+G+YARS DYM I+ G++   G WNVP+IT+  L+    + A      Y  N +
Sbjct: 422 WGAIAANGYYARSEDYMAIVKGNR--VGYWNVPFITSAVLLNKEKLVAMKDSFSYNKN-L 478

Query: 483 DYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKT----NPEVYELIRNPLDWDLR 538
           D DM+ C   R+ G  + ID+ + YG+L+ S+ F    T    +PE++++  N   W+ R
Sbjct: 479 DPDMSMCQFARDHGHFMYIDNEKSYGYLIVSDEFSETVTQGKWHPEMWQIFENRELWEAR 538

Query: 539 YIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETG 598
           YIHP Y K + PD + +Q CPDV+ +P+++E+FC E ++ ME +G+WSDG+NNDKRL  G
Sbjct: 539 YIHPGYHKIMEPDHIVDQACPDVYDYPLMSERFCAELIEEMEGFGRWSDGSNNDKRLAGG 598

Query: 599 YEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQP 658
           YE VPTRDIHM QVG    W  FL  YV P+QE+ FIGY+H+PV + M FVVRY+P+EQ 
Sbjct: 599 YENVPTRDIHMNQVGFERQWLYFLDTYVRPVQEKTFIGYYHQPVESNMMFVVRYKPEEQA 658

Query: 659 SLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGL 718
           SLRPHHD+ST++I+IALN+ G DYEGGG R+IRYNC V A  +G+ +M PGRLTH HEGL
Sbjct: 659 SLRPHHDASTFSIDIALNKKGRDYEGGGVRYIRYNCTVQADEVGYAMMFPGRLTHMHEGL 718

Query: 719 QVTQGTRYIMISFVDP 734
             T+GTRYIM+SF++P
Sbjct: 719 ATTKGTRYIMVSFINP 734


>gi|432928329|ref|XP_004081145.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2-like
           [Oryzias latipes]
          Length = 737

 Score =  640 bits (1651), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 309/713 (43%), Positives = 461/713 (64%), Gaps = 10/713 (1%)

Query: 27  KNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVN 85
           ++I ++K L++TVA+ ETDG+ RF++SA      VK LG+ + W GGD+  S+GGG KV 
Sbjct: 30  ESIPKEKLLILTVATEETDGFLRFMRSANYFNYTVKVLGMGEKWKGGDVGHSIGGGQKVR 89

Query: 86  LLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLY 145
           LLK  ++ +   +D++IL  DSYD+I  GG  +IL++F   +  ++F AE L WPD  L 
Sbjct: 90  LLKKAMEALADQEDLVILSVDSYDLIFAGGPEEILKKFKQANHKVLFAAEGLIWPDKRLT 149

Query: 146 DKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIV 205
           DKYP+V SG R+LNSGG IGYA  I  ++S  ++ + +DDQL+Y  ++LD   R    + 
Sbjct: 150 DKYPSVRSGKRFLNSGGIIGYAPYINRIVSEWNLHDNDDDQLFYTKIYLDPLKRETLNMT 209

Query: 206 LDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLA 265
           LD    +FQNL G+++++ L F     V + NT Y++ P+++HGNGK+K+ LN  GNY+ 
Sbjct: 210 LDHKCQIFQNLNGAVDEVLLKFGTGR-VRVRNTMYDSLPIVVHGNGKTKMYLNYLGNYVP 268

Query: 266 KSWK-TSGCTRCNLIKHLDS-LKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKI 323
            +W   +GC+ C+      + L+  ++P+VL+ VFI++PT FL EFL+++  L+YP  K+
Sbjct: 269 NAWNYENGCSGCDDDLLDLTQLEVCEYPNVLVGVFIEQPTPFLPEFLHRLLTLDYPKDKL 328

Query: 324 SMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGV-DFY 382
            +FV+NN+ YH      +    K +F + K +     ++  EARN+ ++        D+Y
Sbjct: 329 QVFVHNNEVYHEKHIQTFWEESKNVFGSFKVVGPEENLSQGEARNMGMDLCRKDATCDYY 388

Query: 383 FYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNII 442
           F +DSD  L N   LK L+ +N  +I PL+ R  K WSNFWGAL+ DG+YARS DY++I+
Sbjct: 389 FSIDSDVMLTNRQTLKLLIEQNRKIIGPLVTRHGKLWSNFWGALSLDGYYARSEDYVDIV 448

Query: 443 NGDQGGKGIWNVPYITNCYLMKTSVI-KATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKI 501
              +   G+WN+PY+ + YL+K +V+ K    +  + L  +D DMA C N R  G+ + I
Sbjct: 449 QRKR--VGVWNIPYMAHVYLIKGAVLRKELKERNYFVLEKLDPDMALCRNAREMGVFMFI 506

Query: 502 DSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDV 561
            +  ++G L+ + +++    N +++++  NP+DW  +YIH  Y +    + +  QPCPDV
Sbjct: 507 TNRHDFGRLISTASYNTSHYNNDLWQIFENPVDWKEKYIHQNYTQIFTHNYLE-QPCPDV 565

Query: 562 FWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEF 621
           +WFP+++EK C E V+ ME YG WS G + DKR+  GYE VPT DIHMKQ+G    W  F
Sbjct: 566 YWFPVLSEKACDEIVEEMEHYGSWSGGKHEDKRISGGYETVPTDDIHMKQIGFDKEWLHF 625

Query: 622 LRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVD 681
           +R+++ P+  + F GY+ +   A M+FVV+Y P+ Q  LRPHHDSST+TINIALN  G D
Sbjct: 626 IREFISPVTLKVFSGYYTKGY-ALMNFVVKYTPERQAYLRPHHDSSTFTINIALNNKGSD 684

Query: 682 YEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           ++GGGCRF RYNC++ + R GW  MHPGRLTH HEGL  T GTRYI +SF+DP
Sbjct: 685 FQGGGCRFHRYNCSIESPRKGWSFMHPGRLTHLHEGLPTTNGTRYIAVSFIDP 737


>gi|119599346|gb|EAW78940.1| procollagen-lysine, 2-oxoglutarate 5-dioxygenase 2, isoform CRA_b
           [Homo sapiens]
          Length = 740

 Score =  640 bits (1650), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 312/715 (43%), Positives = 460/715 (64%), Gaps = 10/715 (1%)

Query: 24  NKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGY 82
           + +  +  DK LVITVA+ E+DG+ RF+QSA+     VK LG  + W GGD ++S+GGG 
Sbjct: 32  SSIPTVFADKLLVITVATKESDGFHRFMQSAKYFNYTVKVLGQGEEWRGGDGINSIGGGQ 91

Query: 83  KVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDT 142
           KV L+K  ++     DD++++ T+ +DVI  GG  ++L++F   +  +VF A+ + WPD 
Sbjct: 92  KVRLMKEVMEHYADQDDLVVMFTECFDVIFAGGPEEVLKKFQKANHKVVFAADGILWPDK 151

Query: 143 SLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKH 202
            L DKYP V  G RYLNSGGFIGYA  +  ++   ++++ +DDQL+Y  +++D   R   
Sbjct: 152 RLADKYPVVHIGKRYLNSGGFIGYAPYVNRIVQQWNLQDNDDDQLFYTKVYIDPLKREAI 211

Query: 203 KIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGN 262
            I LD    +FQ L G+++++ L F+  +     NT Y T PV I+GNG +KI LN FGN
Sbjct: 212 NITLDHKCKIFQTLNGAVDEVVLKFENGK-ARAKNTFYETLPVAINGNGPTKILLNYFGN 270

Query: 263 YLAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAK 321
           Y+  SW + +GCT C     +D    D  P+V I VFI++PT FL  FL+ +  L+YP +
Sbjct: 271 YVPNSWTQDNGCTLCEF-DTVDLSAVDVHPNVSIGVFIEQPTPFLPRFLDILLTLDYPKE 329

Query: 322 KISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVD 380
            + +F++N + YH      +    K   K +K +     ++  EARN+ ++     +  D
Sbjct: 330 ALKLFIHNKEVYHEKDIKVFFDKAKHEIKTIKIVGPEENLSQAEARNMGMDFCRQDEKCD 389

Query: 381 FYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMN 440
           +YF VD+D  L NP  LK L+ +N  +IAPL+ R  K WSNFWGAL+ DG+YARS DY++
Sbjct: 390 YYFSVDADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARSEDYVD 449

Query: 441 IINGDQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRNKGIHL 499
           I+ G++   G+WNVPY+ N YL+K   +++  N +  +  + +D DMA C N R  G+ +
Sbjct: 450 IVQGNR--VGVWNVPYMANVYLIKGKTLRSEMNERNYFVRDKLDPDMALCRNAREMGVFM 507

Query: 500 KIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCP 559
            I +  E+G L+ + N++    N +++++  NP+DW  +YI+ +Y K +  + +  QPCP
Sbjct: 508 YISNRHEFGRLLSTANYNTSHYNNDLWQIFENPVDWKEKYINRDYSK-IFTENIVEQPCP 566

Query: 560 DVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWA 619
           DVFWFPI +EK C E V+ ME YG+WS G ++D R+  GYE VPT DIHMKQV L  VW 
Sbjct: 567 DVFWFPIFSEKACDELVEEMEHYGKWSGGKHHDSRISGGYENVPTDDIHMKQVDLENVWL 626

Query: 620 EFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVG 679
            F+R+++ P+  + F GY+ +   A ++FVV+Y P+ Q SLRPHHD+ST+TINIALN VG
Sbjct: 627 HFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPHHDASTFTINIALNNVG 685

Query: 680 VDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
            D++GGGC+F+RYNC++ + R GW  MHPGRLTH HEGL V  GTRYI +SF+DP
Sbjct: 686 EDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNGTRYIAVSFIDP 740


>gi|147898979|ref|NP_001082933.1| procollagen-lysine,2-oxoglutarate 5-dioxygenase 2 isoform 2
           precursor [Danio rerio]
 gi|70779501|gb|AAZ08243.1| procollagen lysine 2-oxoglutarate 5-dioxygenase 2b isoform [Danio
           rerio]
          Length = 754

 Score =  640 bits (1650), Expect = e-180,   Method: Compositional matrix adjust.
 Identities = 317/751 (42%), Positives = 472/751 (62%), Gaps = 35/751 (4%)

Query: 10  LILSCVVFFISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQP 69
           ++++CV   + +  NK  +I  +K LV+TVA+ ETDG+ RF+QSA      VK LG+ + 
Sbjct: 13  MLVTCVHCTLGMETNK--DIPTEKLLVLTVATQETDGFLRFMQSANYFNFNVKVLGMGEE 70

Query: 70  WLGGDMS-SLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDA 128
           W GGD+  S+GGG KV LLK  ++ +D  +D+++L  DSYD+I  GG  +IL +F   + 
Sbjct: 71  WKGGDVGHSIGGGQKVRLLKEAMESLDQQEDLVVLFVDSYDLIFAGGAEEILRKFQQSNH 130

Query: 129 NIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLY 188
            +VF AE + WPD+ L +KYP+V SG R+LNSGG IGYA  I++L+S   + + +DDQL+
Sbjct: 131 KVVFAAEGIIWPDSQLAEKYPSVRSGKRFLNSGGIIGYAPYIQKLVSQWDLHDNDDDQLF 190

Query: 189 YALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIH 248
           Y  +++D   R K  + LD    +FQNL G+L+++ L F   E V + NT YN+ P +IH
Sbjct: 191 YTKIYVDPIQREKLNMTLDHKCEIFQNLNGALDEVLLKFG-TERVRVRNTIYNSLPAVIH 249

Query: 249 GNGKSKIELNSFGNYLAKSWK-TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFL 306
           GN  +K+  N   NY+  +W    GCT C+  +  L  LK  +FP V + V+I++PT FL
Sbjct: 250 GNVNTKVYFNYLANYIPNAWNYERGCTICDQDMVDLSQLK--EFPQVTVGVYIEQPTPFL 307

Query: 307 EEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEA 366
            EFL ++ +L+YP  K+++F++N++ YH      +    K +F + K +     +   EA
Sbjct: 308 PEFLERLLSLDYPKDKLNIFIHNSEVYHEKHIQKFWEENKDVFGSFKAVGPEENLTQGEA 367

Query: 367 RNLAVENSLH-KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGA 425
           RN+ ++        D++F +D+D  L N   LK L+ +N  +IAPL+ R  K WSNFWGA
Sbjct: 368 RNMGMDVCRRDPSCDYFFNIDADVMLTNRQTLKLLIEQNRKIIAPLVTRHGKLWSNFWGA 427

Query: 426 LNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIK-ATNIKTIYTLNSMDY 484
           L+ DG+YARS DY++I+ G +   G+WN+P++ + YL+K   ++     + ++ L  +D 
Sbjct: 428 LSLDGYYARSEDYIDIVQGKR--VGVWNIPFLAHVYLIKGQTLRNELKERNVFVLEKLDP 485

Query: 485 DMAFCTNLRN---------------------KGIHLKIDSTQEYGHLVDSENFDPQKTNP 523
           DMA C N R+                     KG+ + + +  E+G L+ + N++    N 
Sbjct: 486 DMAMCRNARDLTVHRERESPSPESFHMLRSPKGLFMYLTNRHEFGRLISTANYNTSHYNN 545

Query: 524 EVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYG 583
           +++++  NPLDW  +YIH  Y + +  + +  QPCPDVFWFP+++EK C+E V+ ME +G
Sbjct: 546 DLWQIFENPLDWREKYIHANYTR-IFTENLLEQPCPDVFWFPVLSEKACNELVEEMENHG 604

Query: 584 QWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVR 643
            WS G + DKR+  GYE+VPT DIHMKQ+     W  F+R+++ P+  + F GY+ +   
Sbjct: 605 TWSGGKHEDKRITGGYESVPTDDIHMKQINYDQEWLHFIREFISPVTLKVFSGYYTKGY- 663

Query: 644 APMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGW 703
           A M+FVV+Y PD Q  LRPHHDSST+TINIALN  G+D+ GGGCRF RYNC++ + R GW
Sbjct: 664 AIMNFVVKYTPDRQAYLRPHHDSSTFTINIALNNKGLDFLGGGCRFHRYNCSIESPRKGW 723

Query: 704 MLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
             MHPGRLTH HEGL VT GTRYI +SFVDP
Sbjct: 724 SFMHPGRLTHLHEGLPVTNGTRYIAVSFVDP 754


>gi|348503412|ref|XP_003439258.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2
           [Oreochromis niloticus]
          Length = 756

 Score =  639 bits (1649), Expect = e-180,   Method: Compositional matrix adjust.
 Identities = 319/755 (42%), Positives = 465/755 (61%), Gaps = 36/755 (4%)

Query: 10  LILSC-VVFFISVHCNKVKN----IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTL 64
           L+ SC + F  SV  + V      I ++K LV+TVA+ ETDG+ RF++SA+     VK L
Sbjct: 8   LVFSCWIKFAASVLSSDVPQAPVPIPKEKLLVLTVATEETDGFLRFMRSADYFNYTVKVL 67

Query: 65  GLHQPWLGGDMS-SLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERF 123
           G+ + W GGD+  S+GGG KV LLKN ++ +   +D+++L  DSYD+I  GG  +IL +F
Sbjct: 68  GMGEAWKGGDVGRSIGGGQKVRLLKNAMEALADQEDLVVLSVDSYDLIFAGGPEEILRKF 127

Query: 124 NTFDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEE 183
              +  ++F AE L WPD  L DKYP V +G RYLNSGG IGYA  I  ++S  ++ + +
Sbjct: 128 QQANHKVLFAAEGLVWPDKQLADKYPLVRTGKRYLNSGGIIGYAPYINRIVSQWNLHDND 187

Query: 184 DDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTN 243
           DDQL+Y  ++LD   R    + LD    +FQNL G+++++ L F  D  V + NT Y++ 
Sbjct: 188 DDQLFYTKIYLDPLQRESLNMTLDHKCQIFQNLNGAVDEVLLKFGTDR-VRVRNTAYDSL 246

Query: 244 PVIIHGNGKSKIELNSFGNYLAKSWK-TSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKP 302
           PV++HGNG +K+ LN   NY+  +W    GC+ C+    +D  +  ++P+VL+ VFI++P
Sbjct: 247 PVVVHGNGNTKMYLNYLANYVPNAWNYEHGCSHCD-DDVVDFSQLKEYPNVLVGVFIEQP 305

Query: 303 TAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVN 362
           T FL EF  ++  L+YP  K+ +FV+NN+ YH      +    + +F + K +     ++
Sbjct: 306 TPFLPEFFQRLLTLDYPKDKLKLFVHNNEVYHEKHIQRFWEENRNVFNSFKVVGPEENLS 365

Query: 363 SKEARNLAVENSLHKGV-DFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSN 421
             EARN+A++        D+YF +DSD  L N   LK L+ +N  +I PL+ R  K WSN
Sbjct: 366 QGEARNMAMDLCRQDATCDYYFSIDSDVMLTNRQTLKLLIEQNRKIIGPLVTRHGKLWSN 425

Query: 422 FWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIK-ATNIKTIYTLN 480
           FWGAL+ DG+YARS DY++I+   +   G+WN+PY+ + YL+K S ++     +  + L 
Sbjct: 426 FWGALSLDGYYARSEDYVDIVQRKR--VGVWNIPYMAHVYLVKGSALRNELKERNYFVLE 483

Query: 481 SMDYDMAFCTNLRN---------------------KGIHLKIDSTQEYGHLVDSENFDPQ 519
            +D DMA C N R                      KG+ + I +  E+G L+ + N++  
Sbjct: 484 KLDPDMALCRNAREMTSHREKDSPSPESFHMLRPPKGVFMYITNRHEFGRLISTANYNIS 543

Query: 520 KTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIM 579
             N +++++  NP+DW  +YIH  Y + +  +    QPCPDVFWFP+ +EK C E V+ M
Sbjct: 544 HYNNDLWQIFENPVDWKEKYIHSNYTR-IFTENYLEQPCPDVFWFPVFSEKACDELVEEM 602

Query: 580 EAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHH 639
           E YG WS G + DKR+  GYE VPT DIHMKQ+G    W  F+R+++ P+  + F GY+ 
Sbjct: 603 EHYGSWSGGKHQDKRIAGGYETVPTDDIHMKQIGFEKEWLHFIREFISPVTLKVFSGYYT 662

Query: 640 EPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTAT 699
           +   A M+FVV+Y P+ Q  LRPHHDSST+TINIALN  G+D++GGGCRF RYNC + + 
Sbjct: 663 KGY-AIMNFVVKYTPERQAYLRPHHDSSTFTINIALNNKGIDFQGGGCRFHRYNCTIESP 721

Query: 700 RMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           R GW  MHPGRLTH HEGL  T GTRYI +SF+DP
Sbjct: 722 RKGWSFMHPGRLTHLHEGLPTTGGTRYIAVSFIDP 756


>gi|354504767|ref|XP_003514445.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 1-like,
           partial [Cricetulus griseus]
          Length = 709

 Score =  639 bits (1649), Expect = e-180,   Method: Compositional matrix adjust.
 Identities = 311/711 (43%), Positives = 466/711 (65%), Gaps = 9/711 (1%)

Query: 28  NIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLL 87
           ++  D  LV+TVA+ ET+G++RF +SA+    +++ LGL + W      S GGG KV LL
Sbjct: 4   SLSTDNLLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWSVDSGPSAGGGQKVRLL 63

Query: 88  KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
           K  L++    +D++IL TDSYDV+   G  ++L++F    + +VF AE L +PD  L  K
Sbjct: 64  KKALEKHAHKEDLVILFTDSYDVVFASGPRELLKKFQQAKSRVVFSAEELIYPDRRLEAK 123

Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
           YP V  G R+L SGGFIGYA ++ +L++    ++ + DQL+Y  +FLD   R +  I L 
Sbjct: 124 YPTVSDGKRFLGSGGFIGYAPNLNKLVAEWEGQDSDSDQLFYTKIFLDPEKREQINISLG 183

Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
              ++FQNL G+L+++ L F++   V   N  Y+T PV+IHGNG +K++LN  GNY+ + 
Sbjct: 184 HSCSIFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVVIHGNGPTKLQLNYLGNYIPRF 242

Query: 268 WK-TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
           W   +GCT C+  ++ L  +  +  P+VL+ VFI++PT FL  F  ++  L YP K++ +
Sbjct: 243 WTFETGCTVCDEGLRSLKGIGDEALPTVLVGVFIEQPTPFLSLFFLRLLRLRYPQKRMRL 302

Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFY 384
           F++N++++H    + ++    T +++VK +     + + +ARN+  +     +   +YF 
Sbjct: 303 FIHNHEQHHKLEVEKFLAEHGTEYQSVKLVGPEVRMANADARNMGADLCRQDQTCTYYFS 362

Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
           VD+D  L  PD L+ L+ +N+++IAPL+ R  + WSNFWGAL+ADG+YARS DY++I+ G
Sbjct: 363 VDADVALTEPDSLRLLIEQNKNVIAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQG 422

Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDS 503
            +   G+WNVPYI+N YL+K S ++A      ++  + +D DM+FC N+R + + + + +
Sbjct: 423 RR--VGVWNVPYISNIYLIKGSALRAELQHVDLFHYSKLDADMSFCANVRQQEVFMFLTN 480

Query: 504 TQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFW 563
              +GHL+  +N+     + +++E+  NP DW  +YIH  Y K+L    V   PCPDV+W
Sbjct: 481 RHTFGHLLSLDNYQTTHLHNDLWEVFSNPEDWKEKYIHENYTKALEGKLV-EMPCPDVYW 539

Query: 564 FPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLR 623
           FPI TE  C E V+ ME YGQWS G N D R++ GYE VPT DIHM Q+     W +FL 
Sbjct: 540 FPIFTEAACDELVEEMEHYGQWSLGDNKDNRIQGGYENVPTIDIHMNQITFEREWHKFLV 599

Query: 624 KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYE 683
           +Y+ P+ E+ + GY+    +  ++FVVRY+PDEQPSL PHHD+ST+T+NIALN+VG DYE
Sbjct: 600 EYIAPMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFTVNIALNRVGQDYE 658

Query: 684 GGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           GGGCRF+RYNC++ A R GW LMHPGRLTHYHEGL  T+GTRYI +SFVDP
Sbjct: 659 GGGCRFLRYNCSIRAPRKGWALMHPGRLTHYHEGLPTTKGTRYIAVSFVDP 709


>gi|74178654|dbj|BAE34000.1| unnamed protein product [Mus musculus]
          Length = 758

 Score =  639 bits (1648), Expect = e-180,   Method: Compositional matrix adjust.
 Identities = 317/731 (43%), Positives = 460/731 (62%), Gaps = 31/731 (4%)

Query: 29  IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGYKVNLL 87
           I  DK LVITVA+ E DG+ RF+ SA+     VK LG  Q W GGD M+S+GGG KV LL
Sbjct: 34  IPADKLLVITVATKENDGFHRFMNSAKYFNYTVKVLGQGQEWRGGDGMNSIGGGQKVRLL 93

Query: 88  KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
           K  ++     +D++IL T+ +DV+  GG  ++L++F   +  IVF A+ L WPD  L DK
Sbjct: 94  KEAMEHYASQEDLVILFTECFDVVFAGGPEEVLKKFQKTNHKIVFAADGLLWPDKRLADK 153

Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
           YP V  G RYLNSGGFIGYA  I  L+   ++++ +DDQL+Y  +++D   R    I LD
Sbjct: 154 YPVVHIGKRYLNSGGFIGYAPYISRLVQQWNLQDNDDDQLFYTKVYIDPLKREAFNITLD 213

Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
               +FQ L G+ +++ L F+ +    + NT Y T PV I+GNG +KI LN FGNY+  S
Sbjct: 214 HKCKIFQALNGATDEVVLKFE-NGISRVKNTFYETLPVAINGNGPTKILLNYFGNYVPNS 272

Query: 268 W-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMF 326
           W + +GC  C+ +  +D    D  P V + VFI++PT FL  FLN +  L+YP + + +F
Sbjct: 273 WTQENGCALCD-VDTIDLSTVDVPPKVTLGVFIEQPTPFLPRFLNLLLTLDYPKEALQLF 331

Query: 327 VYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYV 385
           ++N + YH      ++   K    ++K +     ++  EARN+ ++     +  D+YF V
Sbjct: 332 IHNKEVYHEKDIKVFVDKAKHDISSIKIVGPEENLSQAEARNMGMDFCRQDEKCDYYFSV 391

Query: 386 DSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGD 445
           D+D  L NP  LK+L+ +N  +IAPL+ R  K WSNFWGAL+ DG+YARS DY++I+ G+
Sbjct: 392 DADVVLTNPRTLKFLIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARSEDYVDIVQGN 451

Query: 446 QGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN---------- 494
           +   GIWNVPY+ N YL++   +++  N +  +  + +D DMA C N R+          
Sbjct: 452 R--VGIWNVPYMANVYLIQGKTLRSEMNERNYFVRDKLDPDMALCRNARDMTLQREKDSP 509

Query: 495 -----------KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPE 543
                      KG+ + I +  E+G L+ + N++    N + +++  NP+DW  +YI+ +
Sbjct: 510 TPETFQMLSPPKGVFMYISNRHEFGRLISTANYNTSHLNNDFWQIFENPVDWKEKYINRD 569

Query: 544 YQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVP 603
           Y K +  + +  QPCPDVFWFPI +E+ C E V+ ME YG+WS G ++D R+  GYE VP
Sbjct: 570 YSK-IFTENIVEQPCPDVFWFPIFSERACDELVEEMEHYGKWSGGKHHDSRISGGYENVP 628

Query: 604 TRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPH 663
           T DIHMKQ+GL  VW  F+R+++ P+  + F GY+ +   A ++FVV+Y P+ Q SLRPH
Sbjct: 629 TDDIHMKQIGLENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPH 687

Query: 664 HDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQG 723
           HD+ST+TINIALN VG D++GGGC+F+RYNC++ + R GW  MHPGRLTH HEGL V  G
Sbjct: 688 HDASTFTINIALNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNG 747

Query: 724 TRYIMISFVDP 734
           TRYI +SF+DP
Sbjct: 748 TRYIAVSFIDP 758


>gi|449509755|ref|XP_002186557.2| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2
           [Taeniopygia guttata]
          Length = 774

 Score =  638 bits (1646), Expect = e-180,   Method: Compositional matrix adjust.
 Identities = 309/729 (42%), Positives = 462/729 (63%), Gaps = 31/729 (4%)

Query: 31  EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDM-SSLGGGYKVNLLKN 89
           +DK LV TVA+ ETDG+ RF+++A+     VK LG  + W GG++ +S+GGG KV LLK 
Sbjct: 52  KDKLLVFTVATKETDGFHRFMRTAKHFNYTVKVLGKGEEWKGGELPNSIGGGQKVRLLKE 111

Query: 90  ELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYP 149
            ++     +D+++L  + YDVI  GG  ++L++F   +  +VF A+ L WPD  L DKYP
Sbjct: 112 GIESYADQEDLVVLFVECYDVIFAGGPEELLKKFQETNHKVVFAADGLIWPDKRLADKYP 171

Query: 150 AVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTL 209
            V +G R+LNSGGFIGYA  I  ++   ++++ +DDQL+Y  +++D   R    I LD  
Sbjct: 172 VVQTGKRFLNSGGFIGYAPSINRIVQQWNLQDNDDDQLFYTKIYVDPLAREHINITLDHK 231

Query: 210 ANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK 269
             +FQ L G+++++ L F+  +     N+ Y+T PV IHGNG +KI+LN  GNY+  +W 
Sbjct: 232 CTIFQTLNGAVDEVLLKFEEGK-ARARNSVYDTLPVTIHGNGPTKIQLNYLGNYIPNAWT 290

Query: 270 -TSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
             +GC+ C+L   LD     ++P V I VFI++PT FL +FL+++  L+YP + +S+FV+
Sbjct: 291 WETGCSVCDL-DLLDLSAVKEYPRVKIGVFIEQPTPFLTKFLDRLLTLDYPREALSIFVH 349

Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDS 387
           NN+ YH      +    K + +N+K +     ++  EARN+ ++     K  ++Y  +D+
Sbjct: 350 NNEVYHEKHIKKFWEKAKNIIRNIKIVGPEENLSQAEARNMGMDLCRQDKTCEYYLSIDA 409

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D  L NP  L+ L+ +N  +IAPL+ R  K WSNFWGAL+ DG+YARS DY++I+ G++ 
Sbjct: 410 DVVLTNPKTLRLLIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARSEDYVDIVQGNR- 468

Query: 448 GKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN------------ 494
             G+WN+PY+ N YL+K   +++    K  +  + +D DMA C N R             
Sbjct: 469 -VGVWNIPYMANIYLIKGQTLRSEMKEKNYFMRDKLDPDMALCRNAREMTLQREKDSPSS 527

Query: 495 ---------KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQ 545
                    KG+ + I +  E+G L+ + N++    N +++++  NP+DW   YI+P Y 
Sbjct: 528 ETFHMLRAPKGVFMYITNRHEFGRLLSTANYNTSHYNNDLWQIFENPVDWKETYINPNYS 587

Query: 546 KSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTR 605
           K +  D +  QPCPDVFWFPI ++  C E V+ ME +GQWS G + D R+  GYE VPT 
Sbjct: 588 K-IFTDNIVEQPCPDVFWFPIFSDTACDELVEEMEHFGQWSGGKHQDSRISGGYENVPTD 646

Query: 606 DIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHD 665
           DIHMKQ+GL   W  F+R+++ P+  + F GY+ +   A ++FVV+Y PD Q SLRPHHD
Sbjct: 647 DIHMKQIGLDNEWLHFIREFIAPVTLKVFAGYYTKGY-ALLNFVVKYSPDRQRSLRPHHD 705

Query: 666 SSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTR 725
           SST+TINIALN+VG D++GGGC+F+RYNC++ + R GW  MHPGRLTH HEGL +  GTR
Sbjct: 706 SSTFTINIALNKVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPILNGTR 765

Query: 726 YIMISFVDP 734
           YI +SF+DP
Sbjct: 766 YIAVSFIDP 774


>gi|354490579|ref|XP_003507434.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2,
           partial [Cricetulus griseus]
          Length = 725

 Score =  638 bits (1645), Expect = e-180,   Method: Compositional matrix adjust.
 Identities = 312/710 (43%), Positives = 455/710 (64%), Gaps = 10/710 (1%)

Query: 29  IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGYKVNLL 87
           I  DK LVITVA+ E DG+ RF+ SA+     VK LG  Q W GGD ++S+GGG KV L+
Sbjct: 22  IPADKLLVITVATKENDGFHRFMNSAKYFNYTVKVLGQGQEWRGGDGINSIGGGQKVRLM 81

Query: 88  KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
           K  + +    +D++IL T+ +DV+  GG  ++L++F   +  IVF A+ + WPD  L +K
Sbjct: 82  KEAMAQYASQEDLVILFTECFDVVFAGGPEEVLKKFQKTNHKIVFAADGILWPDKRLAEK 141

Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
           YP V  G RYLNSGGFIGYA  I  L+   ++++ +DDQL+Y  +++D   R    I LD
Sbjct: 142 YPVVHIGKRYLNSGGFIGYAPYISHLVQEWNLQDNDDDQLFYTKVYIDPVKREAFNITLD 201

Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
               +FQ L G+ +++ L F+  +   + NT Y T PV I+GNG +KI LN FGNY+  S
Sbjct: 202 HKCKIFQALNGATDEVVLKFENGK-SRVKNTFYETLPVAINGNGPTKILLNYFGNYVPNS 260

Query: 268 W-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMF 326
           W +  GC  C+    +D    D  P V I VFI++PT FL  FLN + +L+YP + + +F
Sbjct: 261 WTQEHGCALCDF-DTIDLSAVDVHPKVTIGVFIEQPTPFLPRFLNLLLSLDYPKEALKLF 319

Query: 327 VYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYV 385
           ++N + YH      +    K     +K +     ++  EARN+ ++     +  D+YF V
Sbjct: 320 IHNKEVYHEKDIKVFFDKAKHEISTIKIVGPEENLSQAEARNMGMDFCRQDEKCDYYFSV 379

Query: 386 DSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGD 445
           D+D  L NP  LK L+ +N  +IAPL+ R  K WSNFWGAL+ DG+YARS DY++I+ G 
Sbjct: 380 DADVVLTNPRTLKNLIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARSEDYVDIVQGK 439

Query: 446 QGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDST 504
           +   GIWNVPY+ N YL++   +++  + +  +  + +D DMA C N R  G+ + I + 
Sbjct: 440 R--VGIWNVPYMANVYLIQGKTLRSEMSERNYFVRDKLDPDMALCRNAREMGMFMYISNR 497

Query: 505 QEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWF 564
            E+G L+ + N++    N +++++  NP+DW  +YI+ +Y K +  +++  QPCPDVFWF
Sbjct: 498 HEFGRLLSTANYNTSHLNNDLWQIFENPVDWKEKYINRDYSK-IFTESIVEQPCPDVFWF 556

Query: 565 PIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRK 624
           PI +E+ C E V+ ME YG+WS G ++D R+  GYE VPT DIHMKQ+GL  VW  F+R+
Sbjct: 557 PIFSERACDELVEEMEHYGKWSGGKHHDSRISGGYENVPTDDIHMKQIGLENVWLHFIRE 616

Query: 625 YVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEG 684
           ++ P+  + F GY+ +   A ++FVV+Y P+ Q SLRPHHD+ST+TINIALN VG D++G
Sbjct: 617 FIAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPHHDASTFTINIALNNVGEDFQG 675

Query: 685 GGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           GGC+F+RYNC++ + R GW  MHPGRLTH HEGL V  GTRYI +SF+DP
Sbjct: 676 GGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNGTRYIAVSFIDP 725


>gi|218931167|ref|NP_001136388.1| procollagen-lysine,2-oxoglutarate 5-dioxygenase 2 isoform 1
           precursor [Mus musculus]
          Length = 758

 Score =  637 bits (1644), Expect = e-180,   Method: Compositional matrix adjust.
 Identities = 317/731 (43%), Positives = 460/731 (62%), Gaps = 31/731 (4%)

Query: 29  IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGYKVNLL 87
           I  DK LVITVA+ E DG+ RF+ SA+     VK LG  Q W GGD M+S+GGG KV LL
Sbjct: 34  IPADKLLVITVATKENDGFHRFMNSAKYFNYTVKVLGQGQEWRGGDGMNSIGGGQKVRLL 93

Query: 88  KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
           K  ++     +D++IL T+ +DV+  GG  ++L++F   +  IVF A+ L WPD  L DK
Sbjct: 94  KEAMEHYASQEDLVILFTECFDVVFAGGPEEVLKKFQKTNHKIVFAADGLLWPDKRLADK 153

Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
           YP V  G RYLNSGGFIGYA  I  L+   ++++ +DDQL+Y  +++D   R    I LD
Sbjct: 154 YPVVHIGKRYLNSGGFIGYAPYISRLVQQWNLQDNDDDQLFYTKVYIDPLKREAFNITLD 213

Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
               +FQ L G+ +++ L F+  +   + NT Y T PV I+GNG +KI LN FGNY+  S
Sbjct: 214 HKCKIFQALNGATDEVVLKFENGK-SRVKNTFYETLPVAINGNGPTKILLNYFGNYVPNS 272

Query: 268 W-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMF 326
           W + +GC  C+ +  +D    D  P V + VFI++PT FL  FLN +  L+YP + + +F
Sbjct: 273 WTQENGCALCD-VDTIDLSTVDVPPKVTLGVFIEQPTPFLPRFLNLLLTLDYPKEALQLF 331

Query: 327 VYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYV 385
           ++N + YH      ++   K    ++K +     ++  EARN+ ++     +  D+YF V
Sbjct: 332 IHNKEVYHEKDIKVFVDKAKHDISSIKIVGPEENLSQAEARNMGMDFCRQDEKCDYYFSV 391

Query: 386 DSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGD 445
           D+D  L NP  LK+L+ +N  +IAPL+ R  K WSNFWGAL+ DG+YARS DY++I+ G+
Sbjct: 392 DADVVLTNPRTLKFLIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARSEDYVDIVQGN 451

Query: 446 QGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN---------- 494
           +   GIWNVPY+ N YL++   +++  N +  +  + +D DMA C N R+          
Sbjct: 452 R--VGIWNVPYMANVYLIQGKTLRSEMNERNYFVRDKLDPDMALCRNARDMTLQREKDSP 509

Query: 495 -----------KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPE 543
                      KG+ + I +  E+G L+ + N++    N + +++  NP+DW  +YI+ +
Sbjct: 510 TPETFQMLSPPKGVFMYISNRHEFGRLISTANYNTSHLNNDFWQIFENPVDWKEKYINRD 569

Query: 544 YQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVP 603
           Y K +  + +  QPCPDVFWFPI +E+ C E V+ ME YG+WS G ++D R+  GYE VP
Sbjct: 570 YSK-IFTENIVEQPCPDVFWFPIFSERACDELVEEMEHYGKWSGGKHHDSRISGGYENVP 628

Query: 604 TRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPH 663
           T DIHMKQ+GL  VW  F+R+++ P+  + F GY+ +   A ++FVV+Y P+ Q SLRPH
Sbjct: 629 TDDIHMKQIGLENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPH 687

Query: 664 HDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQG 723
           HD+ST+TINIALN VG D++GGGC+F+RYNC++ + R GW  MHPGRLTH HEGL V  G
Sbjct: 688 HDASTFTINIALNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNG 747

Query: 724 TRYIMISFVDP 734
           TRYI +SF+DP
Sbjct: 748 TRYIAVSFIDP 758


>gi|355557553|gb|EHH14333.1| hypothetical protein EGK_00241 [Macaca mulatta]
          Length = 882

 Score =  637 bits (1643), Expect = e-180,   Method: Compositional matrix adjust.
 Identities = 310/743 (41%), Positives = 467/743 (62%), Gaps = 44/743 (5%)

Query: 31  EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNE 90
           ED  LV+TVA+ ET+G++RF +SA+    +++ LGL + W     +S GGG KV LLK  
Sbjct: 145 EDNLLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWNVEKGTSAGGGQKVRLLKKA 204

Query: 91  LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
           L++    +D++IL TDSYDV+   G  ++L++F    + +VF AE L +PD  L  KYP 
Sbjct: 205 LEKHADKEDLVILFTDSYDVLFASGPRELLKKFRQARSQVVFSAEELIYPDRRLEAKYPV 264

Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
           V  G R+L SGGFIGYA ++ +L++    ++ + DQL+Y  +FLD   R +  I LD   
Sbjct: 265 VSDGKRFLGSGGFIGYAPNLSKLVAEWEGQDSDSDQLFYTKIFLDPEKREQINITLDHRC 324

Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSK---------------- 254
            +FQNL G+L+++ L F++   V   N  Y+T PV+IHGNG +K                
Sbjct: 325 RIFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVLIHGNGPTKYPPGARNTYLGACYEL 383

Query: 255 -------------------IELNSFGNYLAKSWK-TSGCTRCNL-IKHLDSLKPDQFPSV 293
                              ++LN  GNY+ + W   +GCT C+  ++ L  +  +  P+V
Sbjct: 384 TISVLTSELSVVPSLPAVLLQLNYLGNYIPRFWTFETGCTVCDEGLRSLKGIGDEALPTV 443

Query: 294 LISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVK 353
           L+ +FI++PT F+  F  ++  L+YP K + +F++N++++H    ++++    + +++VK
Sbjct: 444 LVGMFIEQPTPFVSLFFQRLLQLHYPRKHMRLFIHNHEQHHKAQVEEFLAEHGSEYQSVK 503

Query: 354 YIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLL 412
            +     + + +ARN+  +     +   +YF VD+D  L  P+ L+ L+ +N+++IAPL+
Sbjct: 504 LVGPEVRMANADARNMGADLCRQDRSCTYYFSVDADVALTEPNSLRLLIQQNKNVIAPLM 563

Query: 413 VRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKAT- 471
            R  + WSNFWGAL+ADG+YARS DY++I+ G +   G+WNVPYI+N YL+K S ++   
Sbjct: 564 TRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR--IGVWNVPYISNIYLIKGSALRGEL 621

Query: 472 NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRN 531
               ++  + +D DMAFC N+R + + + + +    GHL+  +++     + +++E+  N
Sbjct: 622 QSPDLFHHSKLDPDMAFCANVRQQDVFMFLTNRHTLGHLLSLDSYRTAHLHNDLWEVFSN 681

Query: 532 PLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNN 591
           P DW  +YIH  Y K+L    V   PCPDV+WFPI TE  C E V+ ME +GQWS G N 
Sbjct: 682 PEDWKEKYIHQNYTKALAGKLVET-PCPDVYWFPIFTEAACDELVEEMEHFGQWSLGDNK 740

Query: 592 DKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVR 651
           D R++ GYE VPT DIHM Q+G    W +FL +Y+ P+ E+ + GY+    +  ++FVVR
Sbjct: 741 DSRIQGGYENVPTIDIHMNQIGFEREWHKFLLEYIAPMTEKLYPGYYTR-AQFDLAFVVR 799

Query: 652 YRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRL 711
           Y+PDEQPSL PHHD+ST+TINIALN+VGVDYEGGGCRF+RYNC++ A R GW LMHPGRL
Sbjct: 800 YKPDEQPSLMPHHDASTFTINIALNRVGVDYEGGGCRFLRYNCSIRAPRKGWTLMHPGRL 859

Query: 712 THYHEGLQVTQGTRYIMISFVDP 734
           THYHEGL  T+GTRYI +SFVDP
Sbjct: 860 THYHEGLPTTRGTRYIAVSFVDP 882


>gi|221120650|ref|XP_002157097.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 1-like
           [Hydra magnipapillata]
          Length = 717

 Score =  637 bits (1643), Expect = e-180,   Method: Compositional matrix adjust.
 Identities = 314/707 (44%), Positives = 458/707 (64%), Gaps = 12/707 (1%)

Query: 31  EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDM-SSLGGGYKVNLLKN 89
           E  F ++TVA+ +TDG+KRF++SA V  L V+  GL++ W GGD+ +  GGG K+N+LK 
Sbjct: 20  EISFKLVTVATEQTDGFKRFMRSANVFGLDVEVYGLNEKWEGGDLENGPGGGQKINILKE 79

Query: 90  ELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYP 149
            L +    ++++++ TDSYDV+I+ G ++IL+RF   +A I+  AE  CWPD SL  KYP
Sbjct: 80  ALRKYKNNENLVLMFTDSYDVVINAGSDEILKRFLKTEAKILISAEDYCWPDKSLAVKYP 139

Query: 150 AVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTL 209
            V  GY+YL SGG IGYA  + E++S + + + +DDQLYY  ++L+   R K+ I LD  
Sbjct: 140 KVNVGYKYLCSGGIIGYANKVYEVLSAKPVNHTDDDQLYYTQIYLEH--REKYNIKLDNK 197

Query: 210 ANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK 269
           A LFQNL G+ +D++L FD D   HL N ++ T P++IHGNG SK  L+  GNYL   W 
Sbjct: 198 AELFQNLNGNQDDVELRFDGDN--HLWNKRFGTYPIVIHGNGPSKDYLSHLGNYLGDYWT 255

Query: 270 -TSGCTRCNLIKHL-DSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFV 327
              GC  C     L   ++   +P VLI +FI  PT F+  +L  I+NL YP +KI +F+
Sbjct: 256 YADGCKSCKENTFLLQDVEVTNWPKVLIGLFIPAPTPFVTSYLEHISNLEYPKEKIDIFI 315

Query: 328 YNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDS 387
           ++   +H P  +D++  F+T + +V Y    + +  KE R+LA E+  H   D+Y  VDS
Sbjct: 316 HSVDPHHDPHVEDWLKRFETKYLSVTYKRPTAFLTEKETRHLAFEHCKHVKCDYYLSVDS 375

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
              L N   L+ L+ +N + I+P++ +P K +SNFWG +  DGFY RS DY++I+  ++ 
Sbjct: 376 IVTLSNTKTLQMLIEQNRTFISPMISKPGKLFSNFWGKVGQDGFYERSPDYIDIVKYNR- 434

Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEY 507
            +G+WNVP+++N YL+++  +K       ++ + +D +M+FC+N R  G+ + I +   +
Sbjct: 435 -RGVWNVPFVSNVYLIQSDTLKKFK-SNPFSSDELDQEMSFCSNARKLGMFMYITNLDYF 492

Query: 508 GHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIV 567
           GH+ + E++     + ++Y++  N +DW+ RY+HPE    L P +  N PCPDV+WFP+ 
Sbjct: 493 GHIKEDESYTTHHKHNDLYQIFDNRIDWEDRYLHPEMMSYLNPTSTPNMPCPDVYWFPLT 552

Query: 568 TEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVV 627
           T+ F  E V+ ME +G+WS G N D R+  GYE VPT DIHM QVGL   W + L+ Y+ 
Sbjct: 553 TKNFTKELVEEMENFGKWSGGGNKDDRISGGYENVPTVDIHMNQVGLEKQWLKILKDYIA 612

Query: 628 PLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGC 687
           P+  R F GY+ +  RA M+FVV+Y  + Q  LRPHHDSSTYTIN+ALN V  +YEGGG 
Sbjct: 613 PMSSRYFTGYNSD-ARAIMNFVVKYTTNGQYYLRPHHDSSTYTINMALNNVN-EYEGGGA 670

Query: 688 RFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           RF RYNC+V  T+ GW LMHPGRLTH HEGL + +GTRYIM+SFVDP
Sbjct: 671 RFTRYNCSVAKTKEGWALMHPGRLTHQHEGLPILKGTRYIMVSFVDP 717


>gi|148688976|gb|EDL20923.1| procollagen lysine, 2-oxoglutarate 5-dioxygenase 2, isoform CRA_b
           [Mus musculus]
          Length = 758

 Score =  636 bits (1640), Expect = e-179,   Method: Compositional matrix adjust.
 Identities = 317/731 (43%), Positives = 459/731 (62%), Gaps = 31/731 (4%)

Query: 29  IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGYKVNLL 87
           I  DK LVITVA+ E DG+ RF+ SA+     VK LG  Q W GGD M+S+GGG KV LL
Sbjct: 34  IPADKLLVITVATKENDGFHRFMNSAKYFNYTVKVLGQGQEWRGGDGMNSIGGGQKVRLL 93

Query: 88  KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
           K  ++     +D++IL T+ +DV+  GG  ++L++F   +  IVF A+ L WPD  L DK
Sbjct: 94  KEAMEHYASQEDLVILFTECFDVVFAGGPEEVLKKFQKTNHKIVFAADGLLWPDKRLADK 153

Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
           YP V  G RYLNSGGFIGYA  I  L+   ++++ +DDQL+Y  +++D   R    I LD
Sbjct: 154 YPVVHIGKRYLNSGGFIGYAPYISRLVQQWNLQDNDDDQLFYTKVYIDPLKREAFNITLD 213

Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
               +FQ L G+ +++ L F+  +   + NT Y T PV I+GNG +KI LN FGNY+  S
Sbjct: 214 HKCKIFQALNGATDEVVLKFENGK-SRVKNTFYETLPVAINGNGPTKILLNYFGNYVPNS 272

Query: 268 W-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMF 326
           W + +GC  C+ +  +D    D  P V + VFI++PT FL  FLN +  L+YP + + +F
Sbjct: 273 WTQENGCALCD-VDTIDLSTVDVPPKVTLGVFIEQPTPFLPRFLNLLLTLDYPKEALQLF 331

Query: 327 VYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYV 385
           ++N + YH      ++   K    ++K +     ++  EARN+ ++     +  D+YF V
Sbjct: 332 IHNKEVYHEKDIKVFVDKAKHDISSIKIVGPEENLSQAEARNMGMDFCRQDEKCDYYFSV 391

Query: 386 DSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGD 445
           D+D  L NP  LK L+ +N  +IAPL+ R  K WSNFWGAL+ DG+YARS DY++I+ G+
Sbjct: 392 DADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARSEDYVDIVQGN 451

Query: 446 QGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN---------- 494
           +   GIWNVPY+ N YL++   +++  N +  +  + +D DMA C N R+          
Sbjct: 452 R--VGIWNVPYMANVYLIQGKTLRSEMNERNYFVRDKLDPDMALCRNARDMTLQREKDSP 509

Query: 495 -----------KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPE 543
                      KG+ + I +  E+G L+ + N++    N + +++  NP+DW  +YI+ +
Sbjct: 510 TPETFQMLSPPKGVFMYISNRHEFGRLISTANYNTSHLNNDFWQIFENPVDWKEKYINRD 569

Query: 544 YQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVP 603
           Y K +  + +  QPCPDVFWFPI +E+ C E V+ ME YG+WS G ++D R+  GYE VP
Sbjct: 570 YSK-IFTENIVEQPCPDVFWFPIFSERACDELVEEMEHYGKWSGGKHHDSRISGGYENVP 628

Query: 604 TRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPH 663
           T DIHMKQ+GL  VW  F+R+++ P+  + F GY+ +   A ++FVV+Y P+ Q SLRPH
Sbjct: 629 TDDIHMKQIGLENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPH 687

Query: 664 HDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQG 723
           HD+ST+TINIALN VG D++GGGC+F+RYNC++ + R GW  MHPGRLTH HEGL V  G
Sbjct: 688 HDASTFTINIALNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNG 747

Query: 724 TRYIMISFVDP 734
           TRYI +SF+DP
Sbjct: 748 TRYIAVSFIDP 758


>gi|426218184|ref|XP_004003329.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2
           isoform 2 [Ovis aries]
 gi|426218186|ref|XP_004003330.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2
           isoform 3 [Ovis aries]
          Length = 760

 Score =  636 bits (1640), Expect = e-179,   Method: Compositional matrix adjust.
 Identities = 315/741 (42%), Positives = 464/741 (62%), Gaps = 31/741 (4%)

Query: 19  ISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSS 77
           +     K  +I  DK LVITVA+ E+DG+ RF+QSA+     VK LG  + W GGD +++
Sbjct: 26  LGADSEKPSSIPTDKLLVITVATKESDGFHRFMQSAKYFNYTVKVLGQGEEWRGGDGINT 85

Query: 78  LGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERL 137
           +GGG KV L+K  ++     +D+++L T+ +DVI  GG  ++L++F   +  +VF A+ +
Sbjct: 86  IGGGQKVRLMKEVMEHYANQEDLVVLFTECFDVIFAGGPEEVLKKFQKSNHKVVFAADGI 145

Query: 138 CWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDET 197
            WPD  L DKYP V  G RYLNSGGFIGYA  I  ++   ++++ +DDQL+Y  +++D  
Sbjct: 146 LWPDKRLADKYPIVHIGKRYLNSGGFIGYAPYINRIVQQWNLQDNDDDQLFYTKIYIDPL 205

Query: 198 LRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIEL 257
            R    I LD    +FQ L G+++++ L F+  +     N  Y T PV+I+GNG +KI L
Sbjct: 206 KREAINITLDHKCKIFQTLNGAVDEVVLKFENGK-ARAKNVFYETLPVMINGNGPTKILL 264

Query: 258 NSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANL 316
           N FGNY+  +W + +GCT C  +  +D  + D +P+V I VFI++PT FL  FLN +  L
Sbjct: 265 NYFGNYIPNAWTQDNGCTLCE-VDTIDLSEVDVYPNVTIGVFIEQPTPFLPRFLNTLLTL 323

Query: 317 NYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH 376
           +YP K +  F++N + YH      +    K     +K +     ++  EARN+ ++    
Sbjct: 324 DYPKKALKFFIHNKEVYHEKDIKVFFDKAKHEITTIKIVGPEENLSQAEARNMGMDFCRQ 383

Query: 377 -KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARS 435
            +  ++YF VD+D  L NP  LK L+ +N  +IAPL+ R  K WSNFWGAL+ DG+YARS
Sbjct: 384 DENCEYYFSVDADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARS 443

Query: 436 FDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN 494
            DY++I+ G++   GIWNVPY+ N YL+K   +++  N +  +  + +D DMA C N R 
Sbjct: 444 EDYVDIVQGNR--VGIWNVPYMANVYLIKGKTLRSEMNERNYFVRDKLDPDMALCRNARE 501

Query: 495 ---------------------KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPL 533
                                KG+ + I +  E+G L+ + N++    N +++++  NP+
Sbjct: 502 MTLQREKDSPTPETFQMLSPPKGVFMYISNRHEFGRLLSTANYNISHFNNDLWQIFENPV 561

Query: 534 DWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDK 593
           DW  +YI+ +Y K +  + +  QPCPDVFWFPI +EK C E V+ ME YGQWS G ++D 
Sbjct: 562 DWKEKYINRDYAK-IFTENIVEQPCPDVFWFPIFSEKACDELVEEMEHYGQWSGGKHHDS 620

Query: 594 RLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYR 653
           R+  GYE VPT DIHMKQ+GL  VW  F+R+++ P+  + F GY+ +   A ++FVV+Y 
Sbjct: 621 RISGGYENVPTDDIHMKQIGLENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYS 679

Query: 654 PDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTH 713
           P+ Q SLRPHHD+ST+TINIALN VG D++GGGC+F+RYNC++ + R GW  MHPGRLTH
Sbjct: 680 PERQRSLRPHHDASTFTINIALNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTH 739

Query: 714 YHEGLQVTQGTRYIMISFVDP 734
            HEGL V  GTRYI +SF+DP
Sbjct: 740 LHEGLPVKNGTRYIAVSFIDP 760


>gi|6093729|sp|Q63321.1|PLOD1_RAT RecName: Full=Procollagen-lysine,2-oxoglutarate 5-dioxygenase 1;
           AltName: Full=Lysyl hydroxylase 1; Short=LH1; Flags:
           Precursor
 gi|409059|gb|AAA41550.1| lysyl hydroxylase [Rattus norvegicus]
 gi|1584463|prf||2123247A Lys hydroxylase
          Length = 728

 Score =  635 bits (1639), Expect = e-179,   Method: Compositional matrix adjust.
 Identities = 311/709 (43%), Positives = 462/709 (65%), Gaps = 10/709 (1%)

Query: 31  EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGY-KVNLLKN 89
           ED  LV+TVA+ ET+G++RF +SA+    ++++LGL + W      S  GG  KV LLK 
Sbjct: 25  EDNLLVLTVATKETEGFRRFKRSAQFFNYKIQSLGLGEDWSAAGGPSAAGGGQKVRLLKK 84

Query: 90  ELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYP 149
            L +    +D++IL  DSYDV+   G  ++L++F    + +VF AE L +PD  L  KYP
Sbjct: 85  ALKKYADKEDLVILFVDSYDVVFASGPRELLKKFQQAKSRVVFSAEELIYPDRRLEAKYP 144

Query: 150 AVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTL 209
            V  G R+L SGGFIGYA  + +L++    ++ + DQL+Y  +FLD   R +  I LD  
Sbjct: 145 TVPDGKRFLGSGGFIGYAPSLSKLVAEWEGQDNDSDQLFYTKIFLDPEKREQINISLDHR 204

Query: 210 ANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK 269
             +FQNL G+L+++ L F++   V   N  Y+T PV+IHGNG +K+++N  GNY+ + W 
Sbjct: 205 CRIFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVVIHGNGPTKLQVNYLGNYIPRFWT 263

Query: 270 -TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFV 327
             +GCT C+  ++ L  +  +  P+VL+ VFI++PT FL  F  ++ +L YP K++ +F+
Sbjct: 264 FETGCTVCDEGLRSLKGIGDEALPTVLVGVFIEQPTPFLSLFFRRLLHLRYPQKQMRLFI 323

Query: 328 YNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVD 386
           +N +++H    + ++      +++VK +     + + +ARN+  +     +   +YF VD
Sbjct: 324 HNQEQHHKLQVEQFLAEHGGEYQSVKLVGPEVRMANADARNMGADLCRQDQTCTYYFSVD 383

Query: 387 SDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQ 446
           +D  L  P+ L+ L+ +N+++IAPL+ R  + WSNFWGAL+ADG+YARS DY++I+ G +
Sbjct: 384 ADVALTEPNSLRLLIEQNKNVIAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR 443

Query: 447 GGKGIWNVPYITNCYLMKTSVIKATNIKT-IYTLNSMDYDMAFCTNLRNKGIHLKIDSTQ 505
              G+WNVPYI+N YL+K S ++A      ++  + +D DM+FC N+R + + + + +  
Sbjct: 444 --VGVWNVPYISNIYLIKGSALRAELRHVDLFHYSKLDPDMSFCANVRQQEVFMFLTNRH 501

Query: 506 EYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFP 565
            +GHL+  +N+     + +++E+  NP DW  +YIH  Y K+L    V   PCPDV+WFP
Sbjct: 502 TFGHLLSLDNYQTTHLHNDLWEVFSNPQDWKEKYIHENYTKALAGKLVET-PCPDVYWFP 560

Query: 566 IVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKY 625
           I TE  C E V+ ME YGQWS G N D R++ GYE VPT DIHM Q+     W +FL +Y
Sbjct: 561 IFTEVACDELVEEMEHYGQWSLGDNKDNRIQGGYENVPTIDIHMNQITFEREWHKFLVEY 620

Query: 626 VVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGG 685
           + PL E+ + GY+ +  +  ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VG DYEGG
Sbjct: 621 IAPLTEKLYPGYYTK-AQFDLAFVVRYKPDEQPSLMPHHDASTFTINIALNRVGEDYEGG 679

Query: 686 GCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           GCRF+RYNC+V A R GW LMHPGRLTHYHEGL  T+GTRYI +SFVDP
Sbjct: 680 GCRFLRYNCSVRAPRKGWALMHPGRLTHYHEGLPTTKGTRYIAVSFVDP 728


>gi|400153797|ref|NP_446279.2| procollagen-lysine,2-oxoglutarate 5-dioxygenase 1 precursor [Rattus
           norvegicus]
 gi|149024588|gb|EDL81085.1| procollagen-lysine, 2-oxoglutarate 5-dioxygenase 1 [Rattus
           norvegicus]
          Length = 728

 Score =  635 bits (1638), Expect = e-179,   Method: Compositional matrix adjust.
 Identities = 311/709 (43%), Positives = 462/709 (65%), Gaps = 10/709 (1%)

Query: 31  EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGY-KVNLLKN 89
           ED  LV+TVA+ ET+G++RF +SA+    ++++LGL + W      S  GG  KV LLK 
Sbjct: 25  EDNLLVLTVATKETEGFRRFKRSAQFFNYKIQSLGLGEDWSAAGGPSAAGGGQKVRLLKK 84

Query: 90  ELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYP 149
            L +    +D++IL  DSYDV+   G  ++L++F    + +VF AE L +PD  L  KYP
Sbjct: 85  ALKKYADKEDLVILFVDSYDVVFASGPRELLKKFQQAKSRVVFSAEELIYPDRRLEAKYP 144

Query: 150 AVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTL 209
            V  G R+L SGGFIGYA  + +L++    ++ + DQL+Y  +FLD   R +  I LD  
Sbjct: 145 TVPDGKRFLGSGGFIGYAPSLSKLVAEWEGQDNDSDQLFYTKIFLDPEKREQINISLDHR 204

Query: 210 ANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK 269
             +FQNL G+L+++ L F++   V   N  Y+T PV+IHGNG +K+++N  GNY+ + W 
Sbjct: 205 CRIFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVVIHGNGPTKLQVNYLGNYIPRFWT 263

Query: 270 -TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFV 327
             +GCT C+  ++ L  +  +  P+VL+ VFI++PT FL  F  ++ +L YP K++ +F+
Sbjct: 264 FETGCTVCDEGLRSLKGIGDEALPTVLVGVFIEQPTPFLSLFFRRLLHLRYPQKQMRLFI 323

Query: 328 YNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVD 386
           +N +++H    + ++      +++VK +     + + +ARN+  +     +   +YF VD
Sbjct: 324 HNQEQHHKLQVEQFLAEHGGEYQSVKLVGPEVRMANADARNMGADLCRQDQTCTYYFSVD 383

Query: 387 SDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQ 446
           +D  L  P+ L+ L+ +N+++IAPL+ R  + WSNFWGAL+ADG+YARS DY++I+ G +
Sbjct: 384 ADVALTEPNSLRLLIEQNKNVIAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR 443

Query: 447 GGKGIWNVPYITNCYLMKTSVIKATNIKT-IYTLNSMDYDMAFCTNLRNKGIHLKIDSTQ 505
              G+WNVPYI+N YL+K S ++A      ++  + +D DM+FC N+R + + + + +  
Sbjct: 444 --VGVWNVPYISNIYLIKGSALRAELRHVDLFHYSKLDPDMSFCANVRQQEVFMFLTNRH 501

Query: 506 EYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFP 565
            +GHL+  +N+     + +++E+  NP DW  +YIH  Y K+L    V   PCPDV+WFP
Sbjct: 502 TFGHLLSLDNYQTTHLHNDLWEVFSNPEDWKEKYIHENYTKALAGKLVET-PCPDVYWFP 560

Query: 566 IVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKY 625
           I TE  C E V+ ME YGQWS G N D R++ GYE VPT DIHM Q+     W +FL +Y
Sbjct: 561 IFTEVACDELVEEMEHYGQWSLGDNKDNRIQGGYENVPTIDIHMNQITFEREWHKFLVEY 620

Query: 626 VVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGG 685
           + PL E+ + GY+ +  +  ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VG DYEGG
Sbjct: 621 IAPLTEKLYPGYYTK-AQFDLAFVVRYKPDEQPSLMPHHDASTFTINIALNRVGEDYEGG 679

Query: 686 GCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           GCRF+RYNC+V A R GW LMHPGRLTHYHEGL  T+GTRYI +SFVDP
Sbjct: 680 GCRFLRYNCSVRAPRKGWALMHPGRLTHYHEGLPTTKGTRYIAVSFVDP 728


>gi|355559970|gb|EHH16698.1| hypothetical protein EGK_12027, partial [Macaca mulatta]
          Length = 744

 Score =  635 bits (1638), Expect = e-179,   Method: Compositional matrix adjust.
 Identities = 316/742 (42%), Positives = 462/742 (62%), Gaps = 31/742 (4%)

Query: 18  FISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MS 76
           ++     K  +I  DK LVITVA+ E+DG+ RF+QSA+     VK LG  + W GGD ++
Sbjct: 9   YLGADSEKPSSIPTDKLLVITVATKESDGFHRFMQSAKYFNYTVKVLGQGEEWRGGDGIN 68

Query: 77  SLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAER 136
           S+GGG KV L+K  ++     DD++++ T+ +DVI  GG  ++L++F   +  +VF A+ 
Sbjct: 69  SIGGGQKVRLMKEVMEHYADQDDLVVMFTECFDVIFAGGPEEVLKKFQKANHKVVFAADG 128

Query: 137 LCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDE 196
           + WPD  L DKYP V  G RYLNSGGFIGYA  I  ++   ++++ +DDQL+Y  +++D 
Sbjct: 129 ILWPDKRLADKYPVVHIGKRYLNSGGFIGYAPYINRIVQQWNLQDNDDDQLFYTKIYIDP 188

Query: 197 TLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIE 256
             R    I LD    +FQ L G+++++ L F+  +     NT Y T PV I+GNG +KI 
Sbjct: 189 LKREAINITLDHKCKIFQTLNGAVDEVVLKFENGK-ARAKNTFYETLPVAINGNGPTKIL 247

Query: 257 LNSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIAN 315
           LN FGNY+  SW + +GCT C     +D    D  P+V I VFI++PT FL  FL+ +  
Sbjct: 248 LNYFGNYVPNSWTQDNGCTLCEF-DTIDLSAVDVHPNVTIGVFIEQPTPFLPRFLDILLT 306

Query: 316 LNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSL 375
           L+YP + + +F++N + YH      +    K   K +K +     ++  EARN+ ++   
Sbjct: 307 LDYPKEALKLFIHNKEVYHEKDIKAFFEKAKHEIKTIKIVGPEENLSQAEARNMGMDFCR 366

Query: 376 H-KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYAR 434
             +  D+YF VD+D  L NP  LK L+ +N  +IAPL+ R  K WSNFWGAL+ DG+YAR
Sbjct: 367 QDEKCDYYFSVDADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYAR 426

Query: 435 SFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIK-ATNIKTIYTLNSMDYDMAFCTNLR 493
           S DY++I+ G++   G+WNVPY+ N YL+K   ++   N +  +  + +D DMA C N R
Sbjct: 427 SEDYVDIVQGNR--VGVWNVPYMANVYLIKGKTLRLEMNERNYFVRDKLDPDMALCRNAR 484

Query: 494 N---------------------KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNP 532
                                 KG+ + I +  E+G L+ + N++    N +++++  NP
Sbjct: 485 EMTLQREKDSPTPETFQMLSPPKGVFMYISNRHEFGRLLSTANYNTSHYNNDLWQIFENP 544

Query: 533 LDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNND 592
           +DW  +YI+ +Y K +  + +  QPCPDVFWFPI +EK C E V+ ME YG+WS G ++D
Sbjct: 545 VDWKEKYINRDYSK-IFTENIVEQPCPDVFWFPIFSEKACDELVEEMEHYGKWSGGKHHD 603

Query: 593 KRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRY 652
            R+  GYE VPT DIHMKQV L  VW  F+R+++ P+  + F GY+ +   A ++FVV+Y
Sbjct: 604 SRISGGYENVPTDDIHMKQVDLENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKY 662

Query: 653 RPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLT 712
            P+ Q SLRPHHD+ST+TINIALN VG D++GGGC+F+RYNC++ + R GW  MHPGRLT
Sbjct: 663 SPERQRSLRPHHDASTFTINIALNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLT 722

Query: 713 HYHEGLQVTQGTRYIMISFVDP 734
           H HEGL V  GTRYI +SF+DP
Sbjct: 723 HLHEGLPVKNGTRYIAVSFIDP 744


>gi|153792754|ref|NP_001093153.1| lysyl hydroxylase 2 precursor [Takifugu rubripes]
 gi|146325992|dbj|BAF61138.1| lysyl hydroxylase 2 [Takifugu rubripes]
          Length = 756

 Score =  635 bits (1638), Expect = e-179,   Method: Compositional matrix adjust.
 Identities = 318/756 (42%), Positives = 456/756 (60%), Gaps = 36/756 (4%)

Query: 9   CLILSCVV-----FFISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKT 63
           C + SC +        +        I ++K LV+TVA+ ETDG++RF+QSA      VK 
Sbjct: 7   CFVFSCCLKIAFSLLSTETAQAPAPIPKEKLLVLTVATEETDGFQRFLQSARYFNYSVKV 66

Query: 64  LGLHQPWLGGDMS-SLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILER 122
           LG+ + W GGD+  S+GGG KV LLK  ++ +   DD+++L  DSYD+I  GG  +IL +
Sbjct: 67  LGMGEAWKGGDVGHSIGGGQKVRLLKEAMEALADQDDLVVLFVDSYDLIFAGGPEEILRK 126

Query: 123 FNTFDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNE 182
           F   +  ++F AE L WPD  L DKYP V SG RYLNSGGFIGYA  I  ++S RS+ + 
Sbjct: 127 FQQANHKVLFAAEGLIWPDKRLADKYPLVHSGKRYLNSGGFIGYASQINRIVSQRSLHDN 186

Query: 183 EDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNT 242
           +DDQL+YA ++LD   R    + LD    +F  L G+ +++ L F     V + NT +++
Sbjct: 187 DDDQLFYAKIYLDPLQRQTLNMTLDHKCQIFLTLNGAADEVLLKFGTGR-VRVRNTAHDS 245

Query: 243 NPVIIHGNGKSKIELNSFGNYLAKSWK-TSGCTRCNLIKHLDSLKPDQFPSVLISVFIDK 301
            PV++HGN  +KI LN  GNY+   W    GC+ C+    LD  + ++FPSVL+ VFI+K
Sbjct: 246 LPVVVHGNRNTKIFLNYLGNYVPNMWNYEHGCSLCDK-DILDLSRLNEFPSVLVGVFIEK 304

Query: 302 PTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTV 361
           PT FL EF  ++ +L+YP  ++ +FV+NN+ +H      +    +  F + K +     +
Sbjct: 305 PTPFLPEFFQRLLSLDYPKDRLKLFVHNNEVFHEKHIQKFWEEHRNTFSDFKIVGPEENL 364

Query: 362 NSKEARNLAVENSLH-KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWS 420
           +  EARN+ ++        DFYF VDSD  L N   LK LV +N  +I PL+ R  K WS
Sbjct: 365 SQGEARNMGMDLCRKDAACDFYFSVDSDVMLTNSQTLKLLVEQNRKIIGPLVTRHGKLWS 424

Query: 421 NFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIK-ATNIKTIYTL 479
             WGAL+ DG+YARS DY++I+   +   G+WN+PY+ + YL+K S ++   N +  + L
Sbjct: 425 YLWGALSPDGYYARSEDYIDIVQRKR--VGVWNIPYMAHVYLVKGSALRNELNERNHFVL 482

Query: 480 NSMDYDMAFCTNLRN---------------------KGIHLKIDSTQEYGHLVDSENFDP 518
             +D DMAFC N R                      KG+ + I ++ E+G L+ + N++ 
Sbjct: 483 EKLDPDMAFCRNAREMTSQREKDSPSPESFHMLRPPKGVFMYITNSHEFGRLISTANYNI 542

Query: 519 QKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQI 578
              N +++++  NP+DW  +YIH  Y + +  +    +PCPDVFWFP+ T+K C E V+ 
Sbjct: 543 SHYNNDLWQIFENPVDWKEKYIHENYTR-IFTENYMEEPCPDVFWFPVFTQKACDEIVEE 601

Query: 579 MEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYH 638
           ME YG WS G + DKR+  GYE VPT DIHMKQ+G    W  F+R+++ P+  + F GY+
Sbjct: 602 MEHYGSWSGGKHEDKRITGGYETVPTDDIHMKQIGFDKEWLHFIREFISPVTLKVFSGYY 661

Query: 639 HEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTA 698
            +   A M+FVV+Y P+ Q  LRPHHDSST+TINIALN    D++GGGCRF  YNC++ +
Sbjct: 662 TKGY-AIMNFVVKYTPERQAYLRPHHDSSTFTINIALNNKDTDFQGGGCRFHGYNCSIES 720

Query: 699 TRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
            R GW  MHP RLTH HEGL  T GTRYI +SF+DP
Sbjct: 721 PRKGWSFMHPERLTHLHEGLPTTNGTRYIAVSFIDP 756


>gi|402861308|ref|XP_003895040.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2
           isoform 1 [Papio anubis]
          Length = 758

 Score =  635 bits (1637), Expect = e-179,   Method: Compositional matrix adjust.
 Identities = 316/742 (42%), Positives = 462/742 (62%), Gaps = 31/742 (4%)

Query: 18  FISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MS 76
           ++     K  +I  DK LVITVA+ E+DG+ RF+QSA+     VK LG  + W GGD ++
Sbjct: 23  YLGADSEKPSSIPTDKLLVITVATKESDGFHRFMQSAKYFNYTVKVLGQGEEWRGGDGIN 82

Query: 77  SLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAER 136
           S+GGG KV L+K  ++     DD++++ T+ +DVI  GG  ++L++F   +  +VF A+ 
Sbjct: 83  SIGGGQKVRLMKEVMEHYADQDDLVVMFTECFDVIFAGGPEEVLKKFQKANHKVVFAADG 142

Query: 137 LCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDE 196
           + WPD  L DKYP V  G RYLNSGGFIGYA  I  ++   ++++ +DDQL+Y  +++D 
Sbjct: 143 ILWPDKRLADKYPVVHIGKRYLNSGGFIGYAPYINRIVQQWNLQDNDDDQLFYTKIYIDP 202

Query: 197 TLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIE 256
             R    I LD    +FQ L G+++++ L F+  +     NT Y T PV I+GNG +KI 
Sbjct: 203 LKREAINITLDHKCKIFQTLNGAVDEVVLKFENGK-ARAKNTFYETLPVAINGNGPTKIL 261

Query: 257 LNSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIAN 315
           LN FGNY+  SW + +GCT C     +D    D  P+V I VFI++PT FL  FL+ +  
Sbjct: 262 LNYFGNYVPNSWTQDNGCTLCEF-DTIDLSAVDVHPNVTIGVFIEQPTPFLPRFLDILLT 320

Query: 316 LNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSL 375
           L+YP + + +F++N + YH      +    K   K +K +     ++  EARN+ ++   
Sbjct: 321 LDYPKEALKLFIHNKEVYHEKDIKVFFDKAKHEIKTIKIVGPEENLSQAEARNMGMDFCR 380

Query: 376 H-KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYAR 434
             +  D+YF VD+D  L NP  LK L+ +N  +IAPL+ R  K WSNFWGAL+ DG+YAR
Sbjct: 381 QDEKCDYYFSVDADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYAR 440

Query: 435 SFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIK-ATNIKTIYTLNSMDYDMAFCTNLR 493
           S DY++I+ G++   G+WNVPY+ N YL+K   ++   N +  +  + +D DMA C N R
Sbjct: 441 SEDYVDIVQGNR--VGVWNVPYMANVYLIKGKTLRLEMNERNYFVRDKLDPDMALCRNAR 498

Query: 494 N---------------------KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNP 532
                                 KG+ + I +  E+G L+ + N++    N +++++  NP
Sbjct: 499 EMTLQREKDSPTPETFQMLSPPKGVFMYISNRHEFGRLLSTANYNTSHYNNDLWQIFENP 558

Query: 533 LDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNND 592
           +DW  +YI+ +Y K +  + +  QPCPDVFWFPI +EK C E V+ ME YG+WS G ++D
Sbjct: 559 VDWKEKYINRDYSK-IFTENIVEQPCPDVFWFPIFSEKACDELVEEMEHYGKWSGGKHHD 617

Query: 593 KRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRY 652
            R+  GYE VPT DIHMKQV L  VW  F+R+++ P+  + F GY+ +   A ++FVV+Y
Sbjct: 618 SRISGGYENVPTDDIHMKQVDLENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKY 676

Query: 653 RPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLT 712
            P+ Q SLRPHHD+ST+TINIALN VG D++GGGC+F+RYNC++ + R GW  MHPGRLT
Sbjct: 677 SPERQRSLRPHHDASTFTINIALNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLT 736

Query: 713 HYHEGLQVTQGTRYIMISFVDP 734
           H HEGL V  GTRYI +SF+DP
Sbjct: 737 HLHEGLPVKNGTRYIAVSFIDP 758


>gi|380813820|gb|AFE78784.1| procollagen-lysine,2-oxoglutarate 5-dioxygenase 2 isoform 1
           precursor [Macaca mulatta]
          Length = 758

 Score =  634 bits (1636), Expect = e-179,   Method: Compositional matrix adjust.
 Identities = 316/742 (42%), Positives = 462/742 (62%), Gaps = 31/742 (4%)

Query: 18  FISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MS 76
           ++     K  +I  DK LVITVA+ E+DG+ RF+QSA+     VK LG  + W GGD ++
Sbjct: 23  YLGADSEKPSSIPTDKLLVITVATKESDGFHRFMQSAKYFNYTVKVLGQGEEWRGGDGIN 82

Query: 77  SLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAER 136
           S+GGG KV L+K  ++     DD++++ T+ +DVI  GG  ++L++F   +  +VF A+ 
Sbjct: 83  SIGGGQKVRLMKEVMEHYADQDDLVVMFTECFDVIFAGGPEEVLKKFQKANHKVVFAADG 142

Query: 137 LCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDE 196
           + WPD  L DKYP V  G RYLNSGGFIGYA  I  ++   ++++ +DDQL+Y  +++D 
Sbjct: 143 ILWPDKRLADKYPVVHIGKRYLNSGGFIGYAPYINRIVQQWNLQDNDDDQLFYTKIYIDP 202

Query: 197 TLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIE 256
             R    I LD    +FQ L G+++++ L F+  +     NT Y T PV I+GNG +KI 
Sbjct: 203 LKREAINITLDHKCKVFQTLNGAVDEVVLKFENGK-ARAKNTFYETLPVAINGNGPTKIL 261

Query: 257 LNSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIAN 315
           LN FGNY+  SW + +GCT C     +D    D  P+V I VFI++PT FL  FL+ +  
Sbjct: 262 LNYFGNYVPNSWTQDNGCTLCEF-DTIDLSAVDVHPNVTIGVFIEQPTPFLPRFLDILLT 320

Query: 316 LNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSL 375
           L+YP + + +F++N + YH      +    K   K +K +     ++  EARN+ ++   
Sbjct: 321 LDYPKEALKLFIHNKEVYHEKDIKVFFDKAKHEIKTIKIVGPEENLSQAEARNMGMDFCR 380

Query: 376 H-KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYAR 434
             +  D+YF VD+D  L NP  LK L+ +N  +IAPL+ R  K WSNFWGAL+ DG+YAR
Sbjct: 381 QDEKCDYYFSVDADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYAR 440

Query: 435 SFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIK-ATNIKTIYTLNSMDYDMAFCTNLR 493
           S DY++I+ G++   G+WNVPY+ N YL+K   ++   N +  +  + +D DMA C N R
Sbjct: 441 SEDYVDIVQGNR--VGVWNVPYMANVYLIKGKTLRLEMNERNYFVRDKLDPDMALCRNAR 498

Query: 494 N---------------------KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNP 532
                                 KG+ + I +  E+G L+ + N++    N +++++  NP
Sbjct: 499 EMTLQREKDSPTPETFQMLSPPKGVFMYISNRHEFGRLLSTANYNTSHYNNDLWQIFENP 558

Query: 533 LDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNND 592
           +DW  +YI+ +Y K +  + +  QPCPDVFWFPI +EK C E V+ ME YG+WS G ++D
Sbjct: 559 VDWKEKYINRDYSK-IFTENIVEQPCPDVFWFPIFSEKACDELVEEMEHYGKWSGGKHHD 617

Query: 593 KRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRY 652
            R+  GYE VPT DIHMKQV L  VW  F+R+++ P+  + F GY+ +   A ++FVV+Y
Sbjct: 618 SRISGGYENVPTDDIHMKQVDLENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKY 676

Query: 653 RPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLT 712
            P+ Q SLRPHHD+ST+TINIALN VG D++GGGC+F+RYNC++ + R GW  MHPGRLT
Sbjct: 677 SPERQRSLRPHHDASTFTINIALNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLT 736

Query: 713 HYHEGLQVTQGTRYIMISFVDP 734
           H HEGL V  GTRYI +SF+DP
Sbjct: 737 HLHEGLPVKNGTRYIAVSFIDP 758


>gi|354477608|ref|XP_003501011.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 3
           [Cricetulus griseus]
          Length = 658

 Score =  634 bits (1635), Expect = e-179,   Method: Compositional matrix adjust.
 Identities = 315/708 (44%), Positives = 446/708 (62%), Gaps = 59/708 (8%)

Query: 32  DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLLKNE 90
           DK LVITVA+ ET+GY+RF+QSAE     V+TLGL   W GGD++ ++GGG KV  LK E
Sbjct: 5   DKLLVITVATAETEGYRRFLQSAEFFNYTVRTLGLGHEWRGGDVARTVGGGQKVRWLKKE 64

Query: 91  LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
           +++    +DMII+  DSYDVI+     ++L++F    ++++F AE  CWP+  L ++YP 
Sbjct: 65  MEKYANREDMIIMFVDSYDVILASSPAELLKKFVQSGSHLLFSAEGFCWPEWGLAEQYPE 124

Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
           VG G R+LNSGGFIG+A  I +++     K+++DDQL+Y  L+LD  LR K K+ LD  +
Sbjct: 125 VGMGKRFLNSGGFIGFAPTIHQIVRQWKYKDDDDDQLFYTRLYLDPGLREKLKLNLDHKS 184

Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSW-K 269
            +FQNL G+L+++ L FD +  V + N  Y+T PV++HGNG +K++LN  GNY+   W  
Sbjct: 185 RIFQNLNGALDEVVLKFDQNR-VRIRNVAYDTLPVVVHGNGPTKLQLNYLGNYVPNGWTP 243

Query: 270 TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
             GC  CN   + L   +P   P VL++VF                              
Sbjct: 244 QGGCGFCNQNQRTLPGGQPP--PRVLLAVF------------------------------ 271

Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDS 387
                               F   K +     ++  EAR++A+++       +FYF +D+
Sbjct: 272 ------------------AHFSAAKLVGPEEALSPGEARDMAMDSCRQDPKCEFYFSLDA 313

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D+ L NP+ L+ L+ +N  +I P+L R  K WSNFWGAL+ D +YARS DY+ ++   + 
Sbjct: 314 DAVLTNPETLRILIEQNRKVICPMLSRHGKLWSNFWGALSPDEYYARSEDYVELVQRKR- 372

Query: 448 GKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQE 506
             G+WNVPYI+  Y+++   ++     K +++ +  D DMAFC +LR+KGI L + +  E
Sbjct: 373 -VGVWNVPYISQAYVIRGETLRTELPQKEVFSGSDTDPDMAFCKSLRDKGIFLHLSNQHE 431

Query: 507 YGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPI 566
           +G L+ +  +D    +P+++++  NP+DW  +YIH  Y ++L    +  QPCPDV+WFP+
Sbjct: 432 FGRLLATSRYDTDHLHPDLWQIFDNPVDWKEQYIHENYSRALDGQGLVEQPCPDVYWFPL 491

Query: 567 VTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYV 626
           +TE+ C E V+ ME YGQWS G + D RL  GYE VPT DIHMKQVG    W + LR YV
Sbjct: 492 LTEQMCDELVEEMEHYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYEDQWLQLLRTYV 551

Query: 627 VPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGG 686
            P+ E  F GYH +  RA M+FVVRYRPDEQPSLRPHHDSST+T+N+ALN  GVDYEGGG
Sbjct: 552 GPMTEYLFPGYHTK-TRAVMNFVVRYRPDEQPSLRPHHDSSTFTLNVALNHKGVDYEGGG 610

Query: 687 CRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           CRF+RY+C +++ R GW L+HPGRLTHYHEGL  T+GTRYIM+SFVDP
Sbjct: 611 CRFLRYDCRISSPRKGWALLHPGRLTHYHEGLPTTRGTRYIMVSFVDP 658


>gi|340378758|ref|XP_003387894.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 3-like
           [Amphimedon queenslandica]
          Length = 718

 Score =  634 bits (1635), Expect = e-179,   Method: Compositional matrix adjust.
 Identities = 312/708 (44%), Positives = 453/708 (63%), Gaps = 14/708 (1%)

Query: 36  VITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSL-GGGYKVNLLKNELDEM 94
           VITVA+ ETDG+KRF++SA    + V+ +G+ + W GGD+    GGG+K+NLLK  L++ 
Sbjct: 16  VITVATEETDGFKRFMKSAAYYGISVEIVGMGEEWKGGDIQRYPGGGFKLNLLKPVLEKW 75

Query: 95  DITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAVGSG 154
               D++++  DSYDVI       ILE+F  F  N+VF AE+ CWPD SL  +YP VG G
Sbjct: 76  RERKDLVVMFVDSYDVIFAANSEKILEKFKDFRTNLVFSAEQFCWPDQSLASRYPKVGLG 135

Query: 155 YRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLANLFQ 214
            R+L SGG+IGYA  +  +I++  I + +DDQL+Y  ++LD   R K+ + LD  +++FQ
Sbjct: 136 KRFLCSGGYIGYASQMYSIITDSEISDTDDDQLFYTKIYLDPHKRDKYGMRLDHRSHIFQ 195

Query: 215 NLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWKT-SGC 273
           NL G+ ++I L+   +E  + TNT YNT   I+HGNG SK  LN  GNY+   +    GC
Sbjct: 196 NLNGAEDEIDLHVTSNE-SYATNTLYNTRAAILHGNGGSKNFLNFLGNYIPNQYNVDEGC 254

Query: 274 TRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEE-FLNKIANLNYPAKKISMFVYNNQE 332
             C+   H       ++P+V + +F+  PT FL E  L  ++ LNYP   I ++VYN   
Sbjct: 255 LHCSEGLHELPEDSSKWPTVFVGLFVMSPTPFLREAILKSLSELNYPKNLIHLWVYNKNS 314

Query: 333 YHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSHLD 392
           YH  L   +    K+ + +V+Y      +   EAR  A++ SL K  D+Y  +DS    D
Sbjct: 315 YHEDLLSKWSDEVKSEYASVQYTGSYRDITEIEARTTAMKESLSKKSDYYLMLDSTGVFD 374

Query: 393 NPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIW 452
           +PD L+ L+  N+ +IAP+L RP K W+NFWG++  DGFYARS DY  I+   +  KGIW
Sbjct: 375 DPDALRKLITLNKHVIAPILGRPDKYWTNFWGSIAKDGFYARSRDYFVIVESKR--KGIW 432

Query: 453 NVPYITNCYLMKTSVI--KATNIKTIYTLNSMDY--DMAFCTNLRNKGIHLKIDSTQEYG 508
           NVP+I+   L +   +  ++T+ K + +  S ++  DMA C  +RN G  + + + Q+YG
Sbjct: 433 NVPFISTAILFEGEWLLKRSTDAKGLPSFASEEFEPDMALCQWMRNNGHFMYVSNLQKYG 492

Query: 509 HLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSL--LPDTVNNQPCPDVFWFPI 566
           HL+ + N++    + ++Y +  N  +W+ +Y+H  Y  +L   PD ++ QPC DV+WFP+
Sbjct: 493 HLISTSNYEIHHLHNDIYNIFENRQEWEKKYLHENYSVALNAGPDDIS-QPCTDVYWFPL 551

Query: 567 VTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYV 626
           ++  +  E ++ +E +G+WS+G N+D RL+ GYE VPTRDIHM QVG    W + L KYV
Sbjct: 552 LSPAYTKEIIEELEKFGKWSNGENDDPRLDGGYENVPTRDIHMNQVGFEKQWLDILAKYV 611

Query: 627 VPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGG 686
           VP+Q + F GY+    RA ++FVV+Y P  QP LRPHHDSST+TIN+AL + G+D++GGG
Sbjct: 612 VPIQIKVFPGYYSR-ARADLNFVVKYHPQGQPDLRPHHDSSTFTINVALTRPGIDHQGGG 670

Query: 687 CRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           CRF+R NC+V  T++GW LMHPGRLTHYHEGL  T GTRYIM+SFVDP
Sbjct: 671 CRFVRQNCSVVDTKLGWALMHPGRLTHYHEGLPTTSGTRYIMVSFVDP 718


>gi|327284629|ref|XP_003227039.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2-like
           [Anolis carolinensis]
          Length = 759

 Score =  634 bits (1635), Expect = e-179,   Method: Compositional matrix adjust.
 Identities = 313/731 (42%), Positives = 465/731 (63%), Gaps = 31/731 (4%)

Query: 29  IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGG-DMSSLGGGYKVNLL 87
           I  DK LV+T+A+ ETDG+ RF+QSA+     VK LG  + W GG  ++S+GGG KV LL
Sbjct: 35  IPTDKLLVLTIATKETDGFHRFMQSAKHFNYTVKILGEGEKWKGGKSLNSIGGGQKVRLL 94

Query: 88  KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
           K+ LD     +D++++  D YDVI   G  ++L++F   +  +VF A+ L WPD  L DK
Sbjct: 95  KSALDIYADQEDLVVMYVDCYDVIFAAGPEELLKKFQQANHKVVFAADGLIWPDKRLSDK 154

Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
           YP V SG R+LN+GGFIGY+  +  ++    +++ +DDQL+Y  +++D   R +  I LD
Sbjct: 155 YPVVRSGKRFLNAGGFIGYSPSVNSIVQQWDLQDNDDDQLFYTKIYIDPLKRERINITLD 214

Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
              N+FQ L G+++++ L F+ +      N+ Y+T PV +HGNG +K+ LN FGNY+   
Sbjct: 215 HKCNIFQTLNGAVDEVLLKFE-EGRARARNSVYDTLPVTLHGNGPTKLNLNYFGNYIPNG 273

Query: 268 W-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMF 326
           W + +GC  CN    LD     + P+V+I VFI++PT FL  FL+++  L+Y  +K+S F
Sbjct: 274 WTRETGCIACNK-DLLDLATLTETPTVIIGVFIEQPTPFLARFLDRLLTLDYAKEKLSFF 332

Query: 327 VYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVE-NSLHKGVDFYFYV 385
           ++NN+ YH      +    K M K +K +     ++  +ARN+ +E    +K  D+YF +
Sbjct: 333 IHNNEVYHEKHIKKFWEKAKNMIKTIKIVGPEENLSQADARNMGMEICRQNKECDYYFSI 392

Query: 386 DSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGD 445
           D+D  L NPD LK L+ +N  +IAPL++R  K WSNFWGAL+ADG+YARS DY++I+ G+
Sbjct: 393 DADVVLTNPDTLKILIEQNRKIIAPLVMRHGKLWSNFWGALSADGYYARSEDYIDIVQGN 452

Query: 446 QGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN---------- 494
           +   G+WNVP++ N YL+K   +++    +  +    +D DMA C N R           
Sbjct: 453 R--VGLWNVPFVANIYLIKGQTLRSEMKERNYFARERLDSDMALCRNAREMTLQREKDSP 510

Query: 495 -----------KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPE 543
                      KG+ + I +  E+G L+ + N++    N +++++  NP+DW   YI+P 
Sbjct: 511 SAETFHMLRPPKGVFMYITNRHEFGRLLSTANYNISHYNNDLWQIFENPVDWKEVYINPN 570

Query: 544 YQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVP 603
           Y K +  + +  QPCPDVFWFPI +E  C E V+ ME +GQWS G ++D R+  GYE VP
Sbjct: 571 YSK-IFTEKIVEQPCPDVFWFPIFSEAACDELVEEMEHFGQWSGGRHHDSRISGGYENVP 629

Query: 604 TRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPH 663
           T DIHMKQ+GL  VW  F+R+++ P+  + F GY+ +   A ++FVV+Y PD Q SLRPH
Sbjct: 630 TDDIHMKQIGLENVWLHFIREFIAPVTLKVFAGYYTKG-HALLNFVVKYSPDRQRSLRPH 688

Query: 664 HDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQG 723
           HD+ST+TINIALN+V  D++GGGC+F+RYNC++ + R GW  MHPGRLTH HEGL + +G
Sbjct: 689 HDASTFTINIALNKVEEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPILKG 748

Query: 724 TRYIMISFVDP 734
           TRYI +SF+DP
Sbjct: 749 TRYIAVSFIDP 759


>gi|155372023|ref|NP_001094619.1| procollagen-lysine,2-oxoglutarate 5-dioxygenase 2 precursor [Bos
           taurus]
 gi|154425678|gb|AAI51391.1| PLOD2 protein [Bos taurus]
 gi|296491069|tpg|DAA33152.1| TPA: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2 [Bos taurus]
          Length = 762

 Score =  633 bits (1633), Expect = e-178,   Method: Compositional matrix adjust.
 Identities = 315/741 (42%), Positives = 464/741 (62%), Gaps = 31/741 (4%)

Query: 19  ISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSS 77
           +     K  +I  DK LVITVA+ E+DG+ RF+QSA+     VK LG  + W GGD +++
Sbjct: 28  LGADSEKPSSIPTDKLLVITVATKESDGFHRFMQSAKYFNYTVKVLGQGEEWRGGDGINT 87

Query: 78  LGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERL 137
           +GGG KV L+K  ++     +D+++L T+ ++VI  GG  ++L++F   +  +VF A+ +
Sbjct: 88  IGGGQKVRLMKEIMEHYANQEDLVVLFTECFNVIFAGGPEEVLKKFQKSNHKVVFAADGI 147

Query: 138 CWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDET 197
            WPD  L DKYP V  G RYLNSGGFIGYA  I  ++   ++++ +DDQL+Y  +++D  
Sbjct: 148 LWPDKRLADKYPIVHIGKRYLNSGGFIGYAPYINRIVQQWNLQDNDDDQLFYTKIYIDPL 207

Query: 198 LRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIEL 257
            R    I LD    +FQ L G+++++ L F+  +     N  Y T PV+I+GNG +KI L
Sbjct: 208 KREAINITLDHKCKIFQTLNGAVDEVVLKFENGK-ARAKNVFYETLPVMINGNGPTKILL 266

Query: 258 NSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANL 316
           N FGNY+  +W + +GCT C  +  +D    D +P+V I VFI++PT FL  FLN +  L
Sbjct: 267 NYFGNYIPNAWTQDNGCTFCE-VDTIDLSAVDVYPNVTIGVFIEQPTPFLPRFLNTLLTL 325

Query: 317 NYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH 376
           +YP + +  F++N + YH      +    K     +K +     ++  EARN+ ++    
Sbjct: 326 DYPKEALKFFIHNKEVYHEKDIKVFFDKAKHEITTIKIVGPEENLSQAEARNMGMDFCRQ 385

Query: 377 -KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARS 435
            K  ++YF VD+D  L NP  LK L+ +N  +IAPL+ R  K WSNFWGAL+ DG+YARS
Sbjct: 386 DKNCEYYFSVDADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARS 445

Query: 436 FDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTL-NSMDYDMAFCTNLRN 494
            DY++I+ G++   GIWNVPY+ N YL+K   +++  I+  Y + + +D DMA C N R 
Sbjct: 446 EDYVDIVQGNR--VGIWNVPYMANVYLIKGKTLRSEMIERNYFVRDKLDPDMALCRNARE 503

Query: 495 ---------------------KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPL 533
                                KG+ + I +  E+G L+ + N++    N +++++  NP+
Sbjct: 504 MTLQREKDSPTPETFQMLSPPKGVFMYISNRHEFGRLLSTANYNISHFNNDLWQIFENPV 563

Query: 534 DWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDK 593
           DW  +YI+ +Y K +  + +  QPCPDVFWFPI +EK C E V+ ME YGQWS G ++D 
Sbjct: 564 DWKEKYINRDYSK-IFTENIVEQPCPDVFWFPIFSEKACDELVEEMEHYGQWSGGKHHDS 622

Query: 594 RLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYR 653
           R+  GYE VPT DIHMKQ+GL  VW  F+R+++ P+  + F GY+ +   A ++FVV+Y 
Sbjct: 623 RISGGYENVPTDDIHMKQIGLENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYS 681

Query: 654 PDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTH 713
           P+ Q SLRPHHD+ST+TINIALN VG D++GGGC+F+RYNC++ + R GW  MHPGRLTH
Sbjct: 682 PERQRSLRPHHDASTFTINIALNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTH 741

Query: 714 YHEGLQVTQGTRYIMISFVDP 734
            HEGL V  GTRYI +SF+DP
Sbjct: 742 LHEGLPVKNGTRYIAVSFIDP 762


>gi|332232414|ref|XP_003265401.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2
           isoform 1 [Nomascus leucogenys]
          Length = 758

 Score =  633 bits (1632), Expect = e-178,   Method: Compositional matrix adjust.
 Identities = 315/741 (42%), Positives = 462/741 (62%), Gaps = 31/741 (4%)

Query: 19  ISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSS 77
           +     K  +I  DK LVITVA+ E+DG+ RF+QSA+     VK LG  + W GGD ++S
Sbjct: 24  LGADSEKPSSIPTDKLLVITVATKESDGFHRFMQSAKYFNYTVKVLGQGEEWRGGDGINS 83

Query: 78  LGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERL 137
           +GGG KV L+K  ++     DD++++ T+ +DVI  GG  ++L++F   +  +VF A+ +
Sbjct: 84  IGGGQKVRLMKEVMEHYADQDDLVVMFTECFDVIFAGGPEEVLKKFQKANHKVVFAADGI 143

Query: 138 CWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDET 197
            WPD  L DKYP V  G RYLNSGGFIGYA  +  ++   ++++ +DDQL+Y  +++D  
Sbjct: 144 LWPDKRLADKYPVVHIGKRYLNSGGFIGYAPYVNRIVQQWNLQDNDDDQLFYTKIYIDSL 203

Query: 198 LRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIEL 257
            R    I LD    +FQ L G+++++ L F+  +     NT Y T PV I+GNG +KI L
Sbjct: 204 KREAINITLDHKCKIFQTLNGAVDEVVLKFENGK-ARAKNTFYETLPVAINGNGPTKILL 262

Query: 258 NSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANL 316
           N FGNY+  SW + +GCT C     +D    D  P+V I VFI++PT FL  FL+ +  L
Sbjct: 263 NYFGNYVPNSWTQDNGCTLCEF-DTVDLSAVDVHPNVTIGVFIEQPTPFLPRFLDILLAL 321

Query: 317 NYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH 376
           +YP + + +F++N + YH      +    K   K +K +     ++  EARN+ ++    
Sbjct: 322 DYPKEALKLFIHNKEVYHEKDIKVFFDKAKHEIKTIKIVGPEENLSQAEARNMGMDFCRQ 381

Query: 377 -KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARS 435
            +  D+YF VD+D  L NP  LK L+ +N  +IAPL+ R  K WSNFWGAL+ DG+YARS
Sbjct: 382 DEKCDYYFSVDADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARS 441

Query: 436 FDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN 494
            DY++I+ G++   G+WNVPY+ N YL+K   +++  N +  +  + +D DMA C N R 
Sbjct: 442 EDYVDIVQGNR--VGVWNVPYMANVYLIKGKTLRSEMNERNYFVRDKLDPDMALCRNARE 499

Query: 495 ---------------------KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPL 533
                                KG+ + I +  E+G L+ + N++    N +++++  NP+
Sbjct: 500 MTLQREKDSPTPETFQMLSPPKGVFMYISNRHEFGRLLSTANYNTSHYNNDLWQIFENPV 559

Query: 534 DWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDK 593
           DW  +YI+ +Y K +  + +  QPCPDVFWFPI +EK C E V+ ME YG+WS G ++D 
Sbjct: 560 DWKEKYINRDYSK-IFTENIVEQPCPDVFWFPIFSEKACDELVEEMEHYGKWSGGKHHDS 618

Query: 594 RLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYR 653
           R+  GYE VPT DIHMKQV L  VW  F+R+++ P+  + F GY+ +   A ++FVV+Y 
Sbjct: 619 RISGGYENVPTDDIHMKQVDLENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYS 677

Query: 654 PDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTH 713
           P+ Q SLRPHHD+ST+TINIALN VG D++GGGC+F+RYNC++ + R GW  MHPGRLTH
Sbjct: 678 PERQRSLRPHHDASTFTINIALNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTH 737

Query: 714 YHEGLQVTQGTRYIMISFVDP 734
            HEGL V  GTRYI +SF+DP
Sbjct: 738 LHEGLPVKNGTRYIAVSFIDP 758


>gi|403278813|ref|XP_003930979.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2
           isoform 1 [Saimiri boliviensis boliviensis]
          Length = 758

 Score =  632 bits (1631), Expect = e-178,   Method: Compositional matrix adjust.
 Identities = 316/741 (42%), Positives = 462/741 (62%), Gaps = 31/741 (4%)

Query: 19  ISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSS 77
           +  +  K  +I  DK LVITVA+ E+DG+ RF+QSA+     VK LG  + W GGD ++S
Sbjct: 24  LGANSEKPSSIPTDKLLVITVATKESDGFHRFMQSAKYFNYTVKVLGQGEEWRGGDGINS 83

Query: 78  LGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERL 137
           +GGG KV L+K  +      DD++++ T+ +DVI  GG  ++L++F   +  +VF A+ +
Sbjct: 84  IGGGQKVRLMKEIMGHYADQDDLVVMFTECFDVIFAGGPEELLKKFQKANHKVVFAADGI 143

Query: 138 CWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDET 197
            WPD  L DKYP V  G RYLNSGGFIGYA  I  ++   ++++ +DDQL+Y  +++D  
Sbjct: 144 LWPDKRLADKYPVVHIGKRYLNSGGFIGYAPYINRIVQQWNLQDNDDDQLFYTKIYIDPL 203

Query: 198 LRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIEL 257
            R    I LD    +FQ L G+++++ L F+  +     NT Y T PV I+GNG +KI L
Sbjct: 204 KREAINITLDHKCKIFQTLNGAVDEVVLKFENGK-ARAKNTFYETLPVAINGNGPTKILL 262

Query: 258 NSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANL 316
           N FGNY+  SW + +GCT C     +D    D  P+V I VFI++PT FL  FL+ +  L
Sbjct: 263 NYFGNYVPNSWTQDNGCTLCEF-DTVDLSAVDVHPNVTIGVFIEQPTPFLPRFLDILLTL 321

Query: 317 NYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH 376
           +YP + + +F++N + YH      +    K   K +K +     ++  EARN+ ++    
Sbjct: 322 DYPKEALKLFIHNKEVYHEKDIKVFFDKAKHEIKTIKIVGPEENLSQAEARNMGMDFCRQ 381

Query: 377 -KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARS 435
            +  D+YF VD+D  L NP  LK L+ +N  +IAPL+ R  K WSNFWGAL+ DG+YARS
Sbjct: 382 DENCDYYFSVDADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARS 441

Query: 436 FDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN 494
            DY++I+ G++   G+WNVPY+ N YL+K   +++  N +  +  + +D DMA C N R 
Sbjct: 442 EDYVDIVQGNR--VGVWNVPYMANVYLIKGKTLRSEMNERNYFVRDKLDPDMALCRNARE 499

Query: 495 ---------------------KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPL 533
                                KG+ + I +  E+G L+ + N++    N +++++  NP+
Sbjct: 500 MTLQREKDSPTPETFQMLSPPKGVFMYISNRHEFGRLLSTANYNTSHYNNDLWQIFENPV 559

Query: 534 DWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDK 593
           DW  +YI+ +Y K +  + +  QPCPDVFWFPI +EK C E V+ ME YG+WS G ++D 
Sbjct: 560 DWKEKYINRDYSK-IFTENIVEQPCPDVFWFPIFSEKACDELVEEMEHYGKWSGGKHHDS 618

Query: 594 RLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYR 653
           R+  GYE VPT DIHMKQV L  VW  F+R+++ P+  + F GY+ +   A ++FVV+Y 
Sbjct: 619 RISGGYENVPTDDIHMKQVDLENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYS 677

Query: 654 PDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTH 713
           P+ Q SLRPHHD+ST+TINIALN VG D++GGGC+F+RYNC++ + R GW  MHPGRLTH
Sbjct: 678 PERQRSLRPHHDASTFTINIALNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTH 737

Query: 714 YHEGLQVTQGTRYIMISFVDP 734
            HEGL V  GTRYI +SF+DP
Sbjct: 738 LHEGLPVKNGTRYIAVSFIDP 758


>gi|410254496|gb|JAA15215.1| procollagen-lysine, 2-oxoglutarate 5-dioxygenase 2 [Pan
           troglodytes]
          Length = 758

 Score =  632 bits (1631), Expect = e-178,   Method: Compositional matrix adjust.
 Identities = 315/741 (42%), Positives = 462/741 (62%), Gaps = 31/741 (4%)

Query: 19  ISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSS 77
           +     K  +I  DK LVITVA+ E+DG+ RF+QSA+     VK LG  + W GGD ++S
Sbjct: 24  LGADSEKPSSIPTDKLLVITVATKESDGFHRFMQSAKYFNYTVKVLGQGEEWRGGDGINS 83

Query: 78  LGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERL 137
           +GGG KV L+K  ++     DD++++ T+ +DVI  GG  ++L++F   +  +VF A+ +
Sbjct: 84  IGGGQKVRLMKEVMEHYADQDDLVVMFTECFDVIFAGGPEEVLKKFQKANHKVVFAADGI 143

Query: 138 CWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDET 197
            WPD  L DKYP V  G RYLNSGGFIGYA  +  ++   ++++ +DDQL+Y  +++D  
Sbjct: 144 LWPDKRLADKYPVVHIGKRYLNSGGFIGYAPYVNRIVQQWNLQDNDDDQLFYTKIYIDPL 203

Query: 198 LRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIEL 257
            R    I LD    +FQ L G+++++ L F+  +     NT Y T PV I+GNG +KI L
Sbjct: 204 KREAINITLDHKCKIFQTLNGAVDEVVLKFENGK-ARAKNTFYETLPVAINGNGPTKILL 262

Query: 258 NSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANL 316
           N FGNY+  SW + +GCT C     +D    D  P+V I VFI++PT FL  FL+ +  L
Sbjct: 263 NYFGNYVPNSWTQDNGCTLCEF-DTVDLSAVDVHPNVSIGVFIEQPTPFLPRFLDILLTL 321

Query: 317 NYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH 376
           +YP + + +F++N + YH      +    K   K +K +     ++  EARN+ ++    
Sbjct: 322 DYPKEALKLFIHNKEVYHEKDIKIFFDKAKHEIKTIKIVGPEENLSQAEARNMGMDFCRQ 381

Query: 377 -KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARS 435
            +  D+YF VD+D  L NP  LK L+ +N  +IAPL+ R  K WSNFWGAL+ DG+YARS
Sbjct: 382 DEKCDYYFSVDADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARS 441

Query: 436 FDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN 494
            DY++I+ G++   G+WNVPY+ N YL+K   +++  N +  +  + +D DMA C N R 
Sbjct: 442 EDYVDIVQGNR--VGVWNVPYMANVYLIKGKTLRSEMNERNYFVRDKLDPDMALCRNARE 499

Query: 495 ---------------------KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPL 533
                                KG+ + I +  E+G L+ + N++    N +++++  NP+
Sbjct: 500 MTLQREKDSPTPETFQMLSPPKGVFMYISNRHEFGRLLSTANYNTSHYNNDLWQIFENPV 559

Query: 534 DWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDK 593
           DW  +YI+ +Y K +  + +  QPCPDVFWFPI +EK C E V+ ME YG+WS G ++D 
Sbjct: 560 DWKEKYINRDYSK-IFTENIVEQPCPDVFWFPIFSEKACDELVEEMEHYGKWSGGKHHDS 618

Query: 594 RLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYR 653
           R+  GYE VPT DIHMKQV L  VW  F+R+++ P+  + F GY+ +   A ++FVV+Y 
Sbjct: 619 RISGGYENVPTDDIHMKQVDLENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYS 677

Query: 654 PDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTH 713
           P+ Q SLRPHHD+ST+TINIALN VG D++GGGC+F+RYNC++ + R GW  MHPGRLTH
Sbjct: 678 PERQRSLRPHHDASTFTINIALNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTH 737

Query: 714 YHEGLQVTQGTRYIMISFVDP 734
            HEGL V  GTRYI +SF+DP
Sbjct: 738 LHEGLPVKSGTRYIAVSFIDP 758


>gi|268529772|ref|XP_002630012.1| C. briggsae CBR-LET-268 protein [Caenorhabditis briggsae]
          Length = 733

 Score =  632 bits (1630), Expect = e-178,   Method: Compositional matrix adjust.
 Identities = 301/717 (41%), Positives = 466/717 (64%), Gaps = 18/717 (2%)

Query: 30  DEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLLK 88
           D  + +V+TVA+  TDG KR ++SA+   + ++ LGL + W GGD     GGG K+ +L 
Sbjct: 23  DLPELVVVTVATENTDGLKRLLESAKAFDINIEVLGLGEKWNGGDTRVEKGGGQKIRILS 82

Query: 89  NELDEMDITDDMIILVTDSYDVIIDGGVNDILERF--NTFDANIVFGAERLCWPDTSLYD 146
             +++     D II+  D+YDV+ +    +IL++F  +     ++FGAE  CWPD +L  
Sbjct: 83  KWIEKYKDASDTIIMFVDAYDVVFNADSKNILQKFLEHYPGKQLLFGAEPFCWPDQTLAP 142

Query: 147 KYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVL 206
            YP V  G R+LNSG F+GY   + ++++ +S+++++DDQLYY +++LDE LR +  + L
Sbjct: 143 DYPIVEFGKRFLNSGLFMGYGPQVHKILTLKSVEDKDDDQLYYTMIYLDEKLRKELNMDL 202

Query: 207 DTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAK 266
           D+++ +FQNL G +ED++L F  D      N  YNT P+I+HGNG SK  LN  GNYL  
Sbjct: 203 DSMSKIFQNLNGVIEDVELQFKEDGTPEAYNAAYNTKPLIVHGNGPSKSHLNYLGNYLGN 262

Query: 267 SWKTS-GCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
            W +  GC  C+      + +  +FP + +++FI KP  F+EE L K++  +YP  +I++
Sbjct: 263 RWNSQLGCRTCD---QEGAKEQTEFPLIGLNLFISKPVPFIEEVLQKVSEFDYPKNRIAL 319

Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYV 385
           ++YNNQ +      D++ +    +   + I   + +  ++ARN A++    +  +F F++
Sbjct: 320 YIYNNQPFSIKNIQDFLKDHGKSYYTKRIINGVTEIGERQARNEAIDWCKQRDTEFAFFM 379

Query: 386 DSDSHLDNPDVLKYLVNRNES----LIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNI 441
           D D++   P V+K L++ ++S    +I+P++ +P K ++NFWGA+ A+G+YARS DYM I
Sbjct: 380 DGDAYFTEPTVIKDLIHYSKSYDVGIISPMVGQPGKLFTNFWGAIAANGYYARSEDYMAI 439

Query: 442 INGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKI 501
           + G++   G WNVP++T+  LM    + A +    Y  N +D DM+ C   R+ G  + I
Sbjct: 440 VKGNR--VGYWNVPFVTSALLMSKEKLGAMSGAYTYNKN-LDPDMSLCQFARDNGHFMYI 496

Query: 502 DSTQEYGHLVDSENFDPQKT----NPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQP 557
           ++ + +G+L+ S+ F    T    +PE++++  N   W+ RYIHP Y K + PD + +Q 
Sbjct: 497 NNEKYFGYLIVSDEFSETVTEGKWHPEMWQIFENRELWEARYIHPGYHKIMEPDHIIDQA 556

Query: 558 CPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGV 617
           CPDV+ +P+++E+FC E ++ ME +G+WSDG+NNDKRL  GYE VPTRDIHM QVG    
Sbjct: 557 CPDVYDYPLMSERFCEELIEEMEGFGRWSDGSNNDKRLAGGYENVPTRDIHMNQVGFERQ 616

Query: 618 WAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQ 677
           W  FL  YV P+QE+ FIGY+H+PV + M FVVRY+P+EQ SLRPHHD+ST++I+IALN+
Sbjct: 617 WLYFLDTYVRPVQEKTFIGYYHQPVESNMMFVVRYKPEEQASLRPHHDASTFSIDIALNK 676

Query: 678 VGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
            G DYEGGG R++RYNC V A  +G+ +M PGRLTH HEGL  T+GTRYIM+SF++P
Sbjct: 677 KGRDYEGGGVRYVRYNCTVEADEVGYAMMFPGRLTHLHEGLATTKGTRYIMVSFINP 733


>gi|397512442|ref|XP_003826554.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2
           isoform 2 [Pan paniscus]
          Length = 758

 Score =  632 bits (1630), Expect = e-178,   Method: Compositional matrix adjust.
 Identities = 315/741 (42%), Positives = 462/741 (62%), Gaps = 31/741 (4%)

Query: 19  ISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSS 77
           +     K  +I  DK LVITVA+ E+DG+ RF+QSA+     VK LG  + W GGD ++S
Sbjct: 24  LGADSEKPSSIPTDKLLVITVATKESDGFHRFMQSAKYFNYTVKVLGQGEEWRGGDGINS 83

Query: 78  LGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERL 137
           +GGG KV L+K  ++     DD++++ T+ +DVI  GG  ++L++F   +  +VF A+ +
Sbjct: 84  IGGGQKVRLMKEVMERYADQDDLVVMFTECFDVIFAGGPEEVLKKFQKANHKVVFAADGI 143

Query: 138 CWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDET 197
            WPD  L DKYP V  G RYLNSGGFIGYA  +  ++   ++++ +DDQL+Y  +++D  
Sbjct: 144 LWPDKRLADKYPVVHIGKRYLNSGGFIGYAPYVNRIVQQWNLQDNDDDQLFYTKIYIDPL 203

Query: 198 LRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIEL 257
            R    I LD    +FQ L G+++++ L F+  +     NT Y T PV I+GNG +KI L
Sbjct: 204 KREAINITLDHKCKIFQTLNGAVDEVVLKFENGK-ARAKNTFYETLPVAINGNGPTKILL 262

Query: 258 NSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANL 316
           N FGNY+  SW + +GCT C     +D    D  P+V I VFI++PT FL  FL+ +  L
Sbjct: 263 NYFGNYVPNSWTQDNGCTLCEF-DTVDLSAVDVHPNVSIGVFIEQPTPFLPRFLDILLTL 321

Query: 317 NYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH 376
           +YP + + +F++N + YH      +    K   K +K +     ++  EARN+ ++    
Sbjct: 322 DYPKEALKLFIHNKEVYHEKDIKIFFDKAKHEIKTIKIVGPEENLSQAEARNMGMDFCRQ 381

Query: 377 -KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARS 435
            +  D+YF VD+D  L NP  LK L+ +N  +IAPL+ R  K WSNFWGAL+ DG+YARS
Sbjct: 382 DEKCDYYFSVDADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARS 441

Query: 436 FDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN 494
            DY++I+ G++   G+WNVPY+ N YL+K   +++  N +  +  + +D DMA C N R 
Sbjct: 442 EDYVDIVQGNR--VGVWNVPYMANVYLIKGKTLRSEMNERNYFVRDKLDPDMALCRNARE 499

Query: 495 ---------------------KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPL 533
                                KG+ + I +  E+G L+ + N++    N +++++  NP+
Sbjct: 500 MTLQREKDSPTPETFQMLSPPKGVFMYISNRHEFGRLLSTANYNTSHYNNDLWQIFENPV 559

Query: 534 DWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDK 593
           DW  +YI+ +Y K +  + +  QPCPDVFWFPI +EK C E V+ ME YG+WS G ++D 
Sbjct: 560 DWKEKYINRDYSK-IFTENIVEQPCPDVFWFPIFSEKACDELVEEMEHYGKWSGGKHHDS 618

Query: 594 RLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYR 653
           R+  GYE VPT DIHMKQV L  VW  F+R+++ P+  + F GY+ +   A ++FVV+Y 
Sbjct: 619 RISGGYENVPTDDIHMKQVDLENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYS 677

Query: 654 PDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTH 713
           P+ Q SLRPHHD+ST+TINIALN VG D++GGGC+F+RYNC++ + R GW  MHPGRLTH
Sbjct: 678 PERQRSLRPHHDASTFTINIALNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTH 737

Query: 714 YHEGLQVTQGTRYIMISFVDP 734
            HEGL V  GTRYI +SF+DP
Sbjct: 738 LHEGLPVKNGTRYIAVSFIDP 758


>gi|410307702|gb|JAA32451.1| procollagen-lysine, 2-oxoglutarate 5-dioxygenase 2 [Pan
           troglodytes]
          Length = 758

 Score =  632 bits (1630), Expect = e-178,   Method: Compositional matrix adjust.
 Identities = 315/741 (42%), Positives = 462/741 (62%), Gaps = 31/741 (4%)

Query: 19  ISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSS 77
           +     K  +I  DK LVITVA+ E+DG+ RF+QSA+     VK LG  + W GGD ++S
Sbjct: 24  LGADSEKPSSIPTDKLLVITVATKESDGFHRFMQSAKYFNYTVKVLGQGEEWRGGDGINS 83

Query: 78  LGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERL 137
           +GGG KV L+K  ++     DD++++ T+ +DVI  GG  ++L++F   +  +VF A+ +
Sbjct: 84  IGGGQKVRLMKEVMEHYADQDDLVVMFTECFDVIFAGGPEEVLKKFQKANHKVVFAADGI 143

Query: 138 CWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDET 197
            WPD  L DKYP V  G RYLNSGGFIGYA  +  ++   ++++ +DDQL+Y  +++D  
Sbjct: 144 LWPDKRLADKYPVVHIGKRYLNSGGFIGYAPYVNRIVQQWNLQDNDDDQLFYTKIYIDPL 203

Query: 198 LRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIEL 257
            R    I LD    +FQ L G+++++ L F+  +     NT Y T PV I+GNG +KI L
Sbjct: 204 KREAINITLDHKCKIFQTLNGAVDEVVLKFENGK-ARAKNTFYETLPVAINGNGPTKILL 262

Query: 258 NSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANL 316
           N FGNY+  SW + +GCT C     +D    D  P+V I VFI++PT FL  FL+ +  L
Sbjct: 263 NYFGNYVPNSWTQDNGCTLCEF-DTVDLSAVDVHPNVSIGVFIEQPTPFLPRFLDILLTL 321

Query: 317 NYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH 376
           +YP + + +F++N + YH      +    K   K +K +     ++  EARN+ ++    
Sbjct: 322 DYPKEALKLFIHNKEVYHEKDIKIFFDKAKHEIKTIKIVGPEENLSQAEARNMGMDFCRQ 381

Query: 377 -KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARS 435
            +  D+YF VD+D  L NP  LK L+ +N  +IAPL+ R  K WSNFWGAL+ DG+YARS
Sbjct: 382 DEKCDYYFSVDADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARS 441

Query: 436 FDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN 494
            DY++I+ G++   G+WNVPY+ N YL+K   +++  N +  +  + +D DMA C N R 
Sbjct: 442 EDYVDIVQGNR--VGVWNVPYMANVYLIKGKTLRSEMNERNYFVRDKLDPDMALCRNARE 499

Query: 495 ---------------------KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPL 533
                                KG+ + I +  E+G L+ + N++    N +++++  NP+
Sbjct: 500 MTLQREKDSPTPETFQMLSPPKGVFMYISNRHEFGRLLSTANYNTSHYNNDLWQIFENPV 559

Query: 534 DWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDK 593
           DW  +YI+ +Y K +  + +  QPCPDVFWFPI +EK C E V+ ME YG+WS G ++D 
Sbjct: 560 DWKEKYINRDYSK-IFTENIVEQPCPDVFWFPIFSEKACDELVEEMEHYGKWSGGKHHDS 618

Query: 594 RLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYR 653
           R+  GYE VPT DIHMKQV L  VW  F+R+++ P+  + F GY+ +   A ++FVV+Y 
Sbjct: 619 RISGGYENVPTDDIHMKQVDLENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYS 677

Query: 654 PDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTH 713
           P+ Q SLRPHHD+ST+TINIALN VG D++GGGC+F+RYNC++ + R GW  MHPGRLTH
Sbjct: 678 PERQRSLRPHHDASTFTINIALNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTH 737

Query: 714 YHEGLQVTQGTRYIMISFVDP 734
            HEGL V  GTRYI +SF+DP
Sbjct: 738 LHEGLPVKNGTRYIAVSFIDP 758


>gi|33636742|ref|NP_891988.1| procollagen-lysine,2-oxoglutarate 5-dioxygenase 2 isoform 1
           precursor [Homo sapiens]
 gi|22713625|gb|AAH37169.1| Procollagen-lysine, 2-oxoglutarate 5-dioxygenase 2 [Homo sapiens]
 gi|119599345|gb|EAW78939.1| procollagen-lysine, 2-oxoglutarate 5-dioxygenase 2, isoform CRA_a
           [Homo sapiens]
          Length = 758

 Score =  632 bits (1630), Expect = e-178,   Method: Compositional matrix adjust.
 Identities = 315/741 (42%), Positives = 462/741 (62%), Gaps = 31/741 (4%)

Query: 19  ISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSS 77
           +     K  +I  DK LVITVA+ E+DG+ RF+QSA+     VK LG  + W GGD ++S
Sbjct: 24  LGADSEKPSSIPTDKLLVITVATKESDGFHRFMQSAKYFNYTVKVLGQGEEWRGGDGINS 83

Query: 78  LGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERL 137
           +GGG KV L+K  ++     DD++++ T+ +DVI  GG  ++L++F   +  +VF A+ +
Sbjct: 84  IGGGQKVRLMKEVMEHYADQDDLVVMFTECFDVIFAGGPEEVLKKFQKANHKVVFAADGI 143

Query: 138 CWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDET 197
            WPD  L DKYP V  G RYLNSGGFIGYA  +  ++   ++++ +DDQL+Y  +++D  
Sbjct: 144 LWPDKRLADKYPVVHIGKRYLNSGGFIGYAPYVNRIVQQWNLQDNDDDQLFYTKVYIDPL 203

Query: 198 LRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIEL 257
            R    I LD    +FQ L G+++++ L F+  +     NT Y T PV I+GNG +KI L
Sbjct: 204 KREAINITLDHKCKIFQTLNGAVDEVVLKFENGK-ARAKNTFYETLPVAINGNGPTKILL 262

Query: 258 NSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANL 316
           N FGNY+  SW + +GCT C     +D    D  P+V I VFI++PT FL  FL+ +  L
Sbjct: 263 NYFGNYVPNSWTQDNGCTLCEF-DTVDLSAVDVHPNVSIGVFIEQPTPFLPRFLDILLTL 321

Query: 317 NYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH 376
           +YP + + +F++N + YH      +    K   K +K +     ++  EARN+ ++    
Sbjct: 322 DYPKEALKLFIHNKEVYHEKDIKVFFDKAKHEIKTIKIVGPEENLSQAEARNMGMDFCRQ 381

Query: 377 -KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARS 435
            +  D+YF VD+D  L NP  LK L+ +N  +IAPL+ R  K WSNFWGAL+ DG+YARS
Sbjct: 382 DEKCDYYFSVDADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARS 441

Query: 436 FDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN 494
            DY++I+ G++   G+WNVPY+ N YL+K   +++  N +  +  + +D DMA C N R 
Sbjct: 442 EDYVDIVQGNR--VGVWNVPYMANVYLIKGKTLRSEMNERNYFVRDKLDPDMALCRNARE 499

Query: 495 ---------------------KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPL 533
                                KG+ + I +  E+G L+ + N++    N +++++  NP+
Sbjct: 500 MTLQREKDSPTPETFQMLSPPKGVFMYISNRHEFGRLLSTANYNTSHYNNDLWQIFENPV 559

Query: 534 DWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDK 593
           DW  +YI+ +Y K +  + +  QPCPDVFWFPI +EK C E V+ ME YG+WS G ++D 
Sbjct: 560 DWKEKYINRDYSK-IFTENIVEQPCPDVFWFPIFSEKACDELVEEMEHYGKWSGGKHHDS 618

Query: 594 RLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYR 653
           R+  GYE VPT DIHMKQV L  VW  F+R+++ P+  + F GY+ +   A ++FVV+Y 
Sbjct: 619 RISGGYENVPTDDIHMKQVDLENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYS 677

Query: 654 PDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTH 713
           P+ Q SLRPHHD+ST+TINIALN VG D++GGGC+F+RYNC++ + R GW  MHPGRLTH
Sbjct: 678 PERQRSLRPHHDASTFTINIALNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTH 737

Query: 714 YHEGLQVTQGTRYIMISFVDP 734
            HEGL V  GTRYI +SF+DP
Sbjct: 738 LHEGLPVKNGTRYIAVSFIDP 758


>gi|440899349|gb|ELR50661.1| Procollagen-lysine,2-oxoglutarate 5-dioxygenase 2, partial [Bos
           grunniens mutus]
          Length = 726

 Score =  632 bits (1629), Expect = e-178,   Method: Compositional matrix adjust.
 Identities = 313/728 (42%), Positives = 460/728 (63%), Gaps = 31/728 (4%)

Query: 32  DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGYKVNLLKNE 90
           DK LVITVA+ E+DG+ RF+QSA+     VK LG  + W GGD ++++GGG KV L+K  
Sbjct: 5   DKLLVITVATKESDGFHRFMQSAKYFNYTVKVLGQGEEWRGGDGINTIGGGQKVRLMKEV 64

Query: 91  LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
           ++     +D+++L T+ ++VI  GG  ++L++F   +  +VF A+ + WPD  L DKYP 
Sbjct: 65  MEHYANQEDLVVLFTECFNVIFAGGPEEVLKKFQKSNHKVVFAADGILWPDKRLADKYPI 124

Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
           V  G RYLNSGGFIGYA  I  ++   ++++ +DDQL+Y  +++D   R    I LD   
Sbjct: 125 VHIGKRYLNSGGFIGYAPYINRIVQQWNLQDNDDDQLFYTKIYIDPLKREAINITLDHKC 184

Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSW-K 269
            +FQ L G+++++ L F+  +     N  Y T PV+I+GNG +KI LN FGNY+  +W +
Sbjct: 185 KIFQTLNGAVDEVVLKFENGK-ARAKNVFYETLPVMINGNGPTKILLNYFGNYIPNAWTQ 243

Query: 270 TSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN 329
            +GCT C  +  +D    D +P+V I VFI++PT FL  FLN +  L+YP + +  F++N
Sbjct: 244 DNGCTFCE-VDTIDLSAVDVYPNVTIGVFIEQPTPFLPRFLNTLLTLDYPKEALKFFIHN 302

Query: 330 NQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDSD 388
            + YH      +    K     +K +     ++  EARN+ ++     K  ++YF VD+D
Sbjct: 303 KEVYHEKDIKVFFDKAKHEITTIKIVGPEENLSQAEARNMGMDFCRQDKNCEYYFSVDAD 362

Query: 389 SHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGG 448
             L NP  LK L+ +N  +IAPL+ R  K WSNFWGAL+ DG+YARS DY++I+ G++  
Sbjct: 363 VVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARSEDYVDIVQGNR-- 420

Query: 449 KGIWNVPYITNCYLMKTSVIKATNIKTIYTL-NSMDYDMAFCTNLRN------------- 494
            GIWNVPY+ N YL+K   +++  I+  Y + + +D DMA C N R              
Sbjct: 421 VGIWNVPYMANVYLIKGKTLRSEMIERNYFVRDKLDPDMALCRNAREMTLQREKDSPTPE 480

Query: 495 --------KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQK 546
                   KG+ + I +  E+G L+ + N++    N +++++  NP+DW  +YI+ +Y K
Sbjct: 481 TFQMLSPPKGVFMYISNRHEFGRLLSTANYNISHFNNDLWQIFENPVDWKEKYINRDYSK 540

Query: 547 SLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRD 606
            +  + +  QPCPDVFWFPI +EK C E V+ ME YGQWS G ++D R+  GYE VPT D
Sbjct: 541 -IFTENIVEQPCPDVFWFPIFSEKACDELVEEMEHYGQWSGGKHHDSRISGGYENVPTDD 599

Query: 607 IHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDS 666
           IHMKQ+GL  VW  F+R+++ P+  + F GY+ +   A ++FVV+Y P+ Q SLRPHHD+
Sbjct: 600 IHMKQIGLENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPHHDA 658

Query: 667 STYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRY 726
           ST+TINIALN VG D++GGGC+F+RYNC++ + R GW  MHPGRLTH HEGL V  GTRY
Sbjct: 659 STFTINIALNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNGTRY 718

Query: 727 IMISFVDP 734
           I +SF+DP
Sbjct: 719 IAVSFIDP 726


>gi|431899781|gb|ELK07728.1| Procollagen-lysine,2-oxoglutarate 5-dioxygenase 2 [Pteropus alecto]
          Length = 785

 Score =  632 bits (1629), Expect = e-178,   Method: Compositional matrix adjust.
 Identities = 314/728 (43%), Positives = 458/728 (62%), Gaps = 31/728 (4%)

Query: 32  DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGYKVNLLKNE 90
           DK LVITVA+ E+DG+ RF+QSA+     VK LG  + W GGD ++S+GGG KV L+K  
Sbjct: 64  DKLLVITVATKESDGFHRFMQSAQYFNYTVKVLGQGEEWRGGDGINSIGGGQKVRLMKEV 123

Query: 91  LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
           ++     +D+++L T+ +DVI  GG  ++L++F   +  +VF A+ + WPD  L DKYP 
Sbjct: 124 MEHYASQEDLVVLFTECFDVIFAGGPEEVLKKFQKSNHKVVFAADGILWPDKRLADKYPI 183

Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
           V  G RYLNSGGFIGYA  I  ++   ++++ +DDQL+Y  +++D   R    I LD   
Sbjct: 184 VHIGKRYLNSGGFIGYAPYINRIVQQWNLQDNDDDQLFYTKIYIDPLKREAINITLDHKC 243

Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSW-K 269
            +FQ L G+++++ L F+  +     N  Y T PV I+GNG +KI LN FGNY+  +W +
Sbjct: 244 KIFQALNGAVDEVVLKFENGK-ARAKNVFYETLPVAINGNGPTKILLNYFGNYVPNAWTQ 302

Query: 270 TSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN 329
            +GCT C L   +D    + +P+V I VFI++PT FL  FL+ +  L+YP + + +F++N
Sbjct: 303 DNGCTLCEL-DTIDLSAVNVYPNVTIGVFIEQPTPFLSRFLDVLLTLDYPKEALKVFIHN 361

Query: 330 NQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDSD 388
            + YH      +    K     +K +     ++  EARN+ ++     K  D+YF VD+D
Sbjct: 362 KEVYHEKDIKVFFDKAKHEISTIKVVGPEENLSQAEARNMGMDFCRQDKNCDYYFSVDAD 421

Query: 389 SHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGG 448
             L NP  LK L+ +N  +IAPL+ R  K WSNFWGAL+ DG+YARS DY++I+ G++  
Sbjct: 422 VVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARSEDYVDIVQGNR-- 479

Query: 449 KGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN------------- 494
            GIWNVPY+ N YL+K   +++  N +  +  + +D DMA C N R              
Sbjct: 480 IGIWNVPYMANVYLIKGKTLRSEMNERNYFVRDKLDPDMALCRNAREMTLQREKDSPTPE 539

Query: 495 --------KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQK 546
                   KG+ + I +  E+G L+ + N++    N +++++  NP+DW  +YI+ +Y K
Sbjct: 540 TFQMLSPPKGVFMYISNRHEFGRLLSTANYNTSHYNNDLWQIFENPVDWKEKYINRDYSK 599

Query: 547 SLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRD 606
            +  + +  QPCPDVFWFPI +EK C E V+ ME YGQWS G ++D R+  GYE VPT D
Sbjct: 600 -IFTENIVEQPCPDVFWFPIFSEKACDELVEEMEHYGQWSGGKHHDSRISGGYENVPTDD 658

Query: 607 IHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDS 666
           IHMKQ+ L  VW  F+R+++ P+  + F GY+ +   A ++FVV+Y PD Q SLRPHHD+
Sbjct: 659 IHMKQIDLENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPDRQRSLRPHHDA 717

Query: 667 STYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRY 726
           ST+TINIALN VG D++GGGC+F+RYNC++ + R GW  MHPGRLTH HEGL V  GTRY
Sbjct: 718 STFTINIALNSVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNGTRY 777

Query: 727 IMISFVDP 734
           I +SF+DP
Sbjct: 778 IAVSFIDP 785


>gi|296227892|ref|XP_002759562.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2
           isoform 1 [Callithrix jacchus]
          Length = 758

 Score =  631 bits (1628), Expect = e-178,   Method: Compositional matrix adjust.
 Identities = 316/741 (42%), Positives = 461/741 (62%), Gaps = 31/741 (4%)

Query: 19  ISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSS 77
           +     K  +I  DK LVITVA+ E+DG+ RF+QSA+     VK LG  + W GGD ++S
Sbjct: 24  LGADSEKPSSIPTDKLLVITVATKESDGFHRFMQSAKYFNYTVKVLGQGEEWRGGDGINS 83

Query: 78  LGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERL 137
           +GGG KV L+K  +      DD++++ T+ +DVI  GG  ++L++F   +  +VF A+ +
Sbjct: 84  IGGGQKVRLVKEVMGHYADQDDLVVMFTECFDVIFAGGPEELLKKFQKANHKVVFAADGI 143

Query: 138 CWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDET 197
            WPD  L DKYP V  G RYLNSGGFIGYA  I  ++   ++++ +DDQL+Y  +++D  
Sbjct: 144 LWPDKRLADKYPVVHIGKRYLNSGGFIGYAPYINRIVQQWNLQDNDDDQLFYTKIYIDPL 203

Query: 198 LRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIEL 257
            R    I LD    +FQ L G+++++ L F+  +     NT Y T PV I+GNG +KI L
Sbjct: 204 KREAINITLDHKCKIFQTLNGAVDEVVLKFENGK-ARAKNTFYETLPVAINGNGPTKILL 262

Query: 258 NSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANL 316
           N FGNY+  SW + +GCT C     +D    D  P+V I VFI++PT FL  FL+ +  L
Sbjct: 263 NYFGNYVPNSWTQDNGCTLCEF-DTVDLSAVDVHPNVTIGVFIEQPTPFLPRFLDILLTL 321

Query: 317 NYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH 376
           +YP + + +F++N + YH      +    K   K +K +     ++  EARN+ ++    
Sbjct: 322 DYPKEALKLFIHNKEVYHEKDIKVFFDKAKHEIKTIKIVGPEENLSQAEARNMGMDFCRQ 381

Query: 377 -KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARS 435
            +  D+YF VD+D  L NP  LK L+ +N  +IAPL+ R  K WSNFWGAL+ DG+YARS
Sbjct: 382 DENCDYYFSVDADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARS 441

Query: 436 FDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN 494
            DY++I+ G++   G+WNVPY+ N YL+K   +++  N +  +  + +D DMA C N R 
Sbjct: 442 EDYVDIVQGNR--VGVWNVPYMANVYLIKGKTLRSEMNERNYFVRDKLDPDMALCRNARE 499

Query: 495 ---------------------KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPL 533
                                KG+ + I +  E+G L+ + N++    N +++++  NP+
Sbjct: 500 MTLQREKDSPTPETFQMLSPPKGVFMYISNRNEFGRLLSTANYNTSHYNNDLWQIFENPV 559

Query: 534 DWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDK 593
           DW  +YI+ +Y K +  + +  QPCPDVFWFPI +EK C E V+ ME YG+WS G ++D 
Sbjct: 560 DWKEKYINRDYSK-IFTENIVEQPCPDVFWFPIFSEKACDELVEEMEHYGKWSGGKHHDS 618

Query: 594 RLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYR 653
           R+  GYE VPT DIHMKQV L  VW  F+R+++ P+  + F GY+ +   A ++FVV+Y 
Sbjct: 619 RISGGYENVPTDDIHMKQVDLENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYS 677

Query: 654 PDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTH 713
           P+ Q SLRPHHD+ST+TINIALN VG D++GGGC+F+RYNC++ + R GW  MHPGRLTH
Sbjct: 678 PERQRSLRPHHDASTFTINIALNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTH 737

Query: 714 YHEGLQVTQGTRYIMISFVDP 734
            HEGL V  GTRYI +SF+DP
Sbjct: 738 LHEGLPVKNGTRYIAVSFIDP 758


>gi|355746993|gb|EHH51607.1| hypothetical protein EGM_11017, partial [Macaca fascicularis]
          Length = 723

 Score =  631 bits (1628), Expect = e-178,   Method: Compositional matrix adjust.
 Identities = 314/728 (43%), Positives = 457/728 (62%), Gaps = 31/728 (4%)

Query: 32  DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGYKVNLLKNE 90
           DK LVITVA+ E+DG+ RF+QSA+     VK LG  + W GGD ++S+GGG KV L+K  
Sbjct: 2   DKLLVITVATKESDGFHRFMQSAKYFNYTVKVLGQGEEWRGGDGINSIGGGQKVRLMKEV 61

Query: 91  LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
           ++     DD++++ T+ +DVI  GG  ++L++F   +  +VF A+ + WPD  L DKYP 
Sbjct: 62  MEHYADQDDLVVMFTECFDVIFAGGPEEVLKKFQKANHKVVFAADGILWPDKRLADKYPV 121

Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
           V  G RYLNSGGFIGYA  I  ++   ++++ +DDQL+Y  +++D   R    I LD   
Sbjct: 122 VHIGKRYLNSGGFIGYAPYINRIVQQWNLQDNDDDQLFYTKIYIDPLKREAINITLDHKC 181

Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSW-K 269
            +FQ L G+++++ L F+  +     NT Y T PV I+GNG +KI LN FGNY+  SW +
Sbjct: 182 KIFQTLNGAVDEVVLKFENGK-ARAKNTFYETLPVAINGNGPTKILLNYFGNYVPNSWTQ 240

Query: 270 TSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN 329
            +GCT C     +D    D  P+V I VFI++PT FL  FL+ +  L+YP + + +F++N
Sbjct: 241 GNGCTLCEF-DTIDLSAVDVHPNVTIGVFIEQPTPFLPRFLDILLTLDYPKEALKLFIHN 299

Query: 330 NQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDSD 388
            + YH      +    K   K +K +     ++  EARN+ ++     +  D+YF VD+D
Sbjct: 300 KEVYHEKDIKVFFDKAKHEIKTIKIVGPEENLSQAEARNMGMDFCRQDEKCDYYFSVDAD 359

Query: 389 SHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGG 448
             L NP  LK L+ +N  +IAPL+ R  K WSNFWGAL+ DG+YARS DY++I+ G++  
Sbjct: 360 VVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARSEDYVDIVQGNR-- 417

Query: 449 KGIWNVPYITNCYLMKTSVIK-ATNIKTIYTLNSMDYDMAFCTNLRN------------- 494
            G+WNVPY+ N YL+K   ++   N +  +  + +D DMA C N R              
Sbjct: 418 VGVWNVPYMANVYLIKGKTLRLEMNERNYFVRDKLDPDMALCRNAREMTLQREKDSPTPE 477

Query: 495 --------KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQK 546
                   KG+ + I +  E+G L+ + N++    N +++++  NP+DW  +YI+ +Y K
Sbjct: 478 TFQMLSPPKGVFMYISNRHEFGRLLSTANYNTSHYNNDLWQIFENPVDWKEKYINRDYSK 537

Query: 547 SLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRD 606
            +  + +  QPCPDVFWFPI +EK C E V+ ME YG+WS G ++D R+  GYE VPT D
Sbjct: 538 -IFTENIVEQPCPDVFWFPIFSEKACDELVEEMEHYGKWSGGKHHDSRISGGYENVPTDD 596

Query: 607 IHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDS 666
           IHMKQV L  VW  F+R+++ P+  + F GY+ +   A ++FVV+Y P+ Q SLRPHHD+
Sbjct: 597 IHMKQVDLENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPHHDA 655

Query: 667 STYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRY 726
           ST+TINIALN VG D++GGGC+F+RYNC++ + R GW  MHPGRLTH HEGL V  GTRY
Sbjct: 656 STFTINIALNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNGTRY 715

Query: 727 IMISFVDP 734
           I +SF+DP
Sbjct: 716 IAVSFIDP 723


>gi|301778997|ref|XP_002924915.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2-like
           isoform 1 [Ailuropoda melanoleuca]
          Length = 757

 Score =  631 bits (1627), Expect = e-178,   Method: Compositional matrix adjust.
 Identities = 313/741 (42%), Positives = 464/741 (62%), Gaps = 31/741 (4%)

Query: 19  ISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSS 77
           +     K  +I  DK LVITVA+ E+DG+ RF+QSA+     VK LG  + W GGD ++S
Sbjct: 23  LGADSEKPSSIPTDKLLVITVATKESDGFHRFMQSAKYFNYTVKVLGQGEEWRGGDGINS 82

Query: 78  LGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERL 137
           +GGG KV L+K  ++     +D++IL T+ ++VI  GG  ++L++F   +  +VF A+ +
Sbjct: 83  IGGGQKVRLMKEVMEHYANQEDLVILFTECFNVIFAGGPEEVLKKFQKSNHKVVFAADGI 142

Query: 138 CWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDET 197
            WPD  L DKYP V  G RYLNSGGFIGYA +I +++   ++++ +DDQL+Y  +++D  
Sbjct: 143 LWPDKRLADKYPIVHIGKRYLNSGGFIGYAPNINQIVQQWNLQDNDDDQLFYTKIYIDPL 202

Query: 198 LRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIEL 257
            R    I LD    +FQ L G+++++ L F+  +     N  Y T PV ++GNG +KI L
Sbjct: 203 KREAINITLDHKCKIFQTLNGAVDEVVLKFENGK-ARAKNVFYETLPVAVNGNGPTKILL 261

Query: 258 NSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANL 316
           N FGNY+  +W + +GCT C+L   +D    D  P+V I VFI++PT FL  FL+ +  L
Sbjct: 262 NYFGNYVPNAWTQDNGCTLCDL-DTIDLSTVDVHPNVTIGVFIEQPTPFLPRFLDILLTL 320

Query: 317 NYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH 376
           +YP + + +F++N + YH      +    K     +K +     ++  EARN+ ++    
Sbjct: 321 DYPKEALKLFIHNKEVYHEKDIKVFFDKAKREISTIKIVGPEENLSQAEARNMGMDFCRQ 380

Query: 377 -KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARS 435
            +  D+YF +D+D  L NP  LK L+ +N  +IAPL+ R  K WSNFWGAL+ DG+YARS
Sbjct: 381 DENCDYYFSMDADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARS 440

Query: 436 FDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN 494
            DY++I+ G++   GIWNVPY+ N YL+K   +++  N +  +  + +D DMA C N R 
Sbjct: 441 EDYVDIVQGNR--VGIWNVPYMANVYLIKGKTLRSEMNERNYFVRDKLDPDMALCRNARE 498

Query: 495 ---------------------KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPL 533
                                KG+ + I +  E+G L+ + N++    N +++++  NP+
Sbjct: 499 MTLQREKDSPTPETFQMLSPPKGVFMYISNRHEFGRLLSTANYNTSHYNNDLWQIFENPV 558

Query: 534 DWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDK 593
           DW  +YI+ +Y K +  + +  QPCPDVFWFPI +EK C E V+ ME YGQWS G ++D 
Sbjct: 559 DWKEKYINRDYSK-IFTENIVEQPCPDVFWFPIFSEKACDELVEEMEHYGQWSGGKHHDS 617

Query: 594 RLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYR 653
           R+  GYE VPT DIHMKQ+ L  VW  F+R+++ P+  + F GY+ +   A ++FVV+Y 
Sbjct: 618 RISGGYENVPTDDIHMKQIDLENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYS 676

Query: 654 PDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTH 713
           P+ Q SLRPHHD+ST+TINIALN VG D++GGGC+F+RYNC++ + R GW  MHPGRLTH
Sbjct: 677 PERQRSLRPHHDASTFTINIALNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTH 736

Query: 714 YHEGLQVTQGTRYIMISFVDP 734
            HEGL V  GTRYI +SF+DP
Sbjct: 737 LHEGLPVKNGTRYIAVSFIDP 757


>gi|28400781|emb|CAD23629.1| procollagen-lysine, 2-oxoglutarate 5-dioxygenase 2, long variant
           [Rattus norvegicus]
          Length = 758

 Score =  630 bits (1626), Expect = e-178,   Method: Compositional matrix adjust.
 Identities = 315/731 (43%), Positives = 457/731 (62%), Gaps = 31/731 (4%)

Query: 29  IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGYKVNLL 87
           I  DK LVITVA+ E DG+ RF+ SA+     VK LG  Q W GGD M+S+GGG KV L+
Sbjct: 34  IPADKLLVITVATKENDGFHRFMNSAKYFNYTVKVLGQGQEWRGGDGMNSIGGGQKVRLM 93

Query: 88  KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
           K  ++     DD++IL T+ +DVI  GG  ++L++F   +  IVF A+ L WPD  L DK
Sbjct: 94  KEAMEHYAGQDDLVILFTECFDVIFAGGPEELLKKFQKTNHKIVFAADALLWPDKRLADK 153

Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
           YP V  G RYLNSGGFIGYA  I  L+   ++++ +DDQL+Y  +++D   R    I LD
Sbjct: 154 YPGVHIGKRYLNSGGFIGYAPYISRLVQQWNLQDNDDDQLFYTKVYIDPLKREALNITLD 213

Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
               +FQ L G+ +++ L F+  +   + NT Y T PV I+GNG SKI LN FGNY+  S
Sbjct: 214 HRCKIFQALNGATDEVVLKFENGK-SRVKNTFYETLPVAINGNGPSKILLNYFGNYVPNS 272

Query: 268 W-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMF 326
           W + +GC  C+     D    D +P V + VFI++PT F   FL+ +  L+YP + + +F
Sbjct: 273 WTQENGCALCDFDAS-DLSTVDVYPKVTLGVFIEQPTPFQPRFLDLLLTLDYPKEALRLF 331

Query: 327 VYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYV 385
           V+N + YH      ++   K    ++K +     ++  EARN+ ++     +  D+YF V
Sbjct: 332 VHNKEVYHEKDIKAFVDKAKHDISSIKIVGPEENLSQAEARNMGMDFCRQDEKCDYYFSV 391

Query: 386 DSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGD 445
           D+D  L NP  LK L+ +N  +IAPL+ R  K WSNFWGAL+ DG+YARS DY++I+ G+
Sbjct: 392 DADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARSEDYVDIVQGN 451

Query: 446 QGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN---------- 494
           +   GIWNVPY+ N YL++   +++  + +  +  + +D DM+ C N R+          
Sbjct: 452 R--VGIWNVPYMANVYLIQGKTLRSEMSERNYFVRDKLDPDMSLCRNARDMTLQREKDSP 509

Query: 495 -----------KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPE 543
                      KG+ + I +  E+G L+ + N++    N +++++  NP+DW  +YI+ +
Sbjct: 510 TPETFQMLSPPKGVFMYISNRHEFGRLISTANYNTSHLNNDLWQIFENPVDWKEKYINRD 569

Query: 544 YQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVP 603
           Y K +  + +  QPCPDVFWFPI +E+ C E V+ ME YG+WS G ++D R+  GYE VP
Sbjct: 570 YSK-IFTENIVEQPCPDVFWFPIFSERACDELVEEMEHYGKWSGGKHHDSRISGGYENVP 628

Query: 604 TRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPH 663
           T DIHMKQ+ L  VW  F+R+++ P+  + F GY+ +   A ++FVV+Y P+ Q SLRPH
Sbjct: 629 TDDIHMKQIDLENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPH 687

Query: 664 HDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQG 723
           HD+ST+TINIALN VG D++GGGC+F+RYNC++ + R GW  MHPGRLTH HEGL V  G
Sbjct: 688 HDASTFTINIALNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNG 747

Query: 724 TRYIMISFVDP 734
           TRYI +SF+DP
Sbjct: 748 TRYIAVSFIDP 758


>gi|395840986|ref|XP_003793331.1| PREDICTED: LOW QUALITY PROTEIN: procollagen-lysine,2-oxoglutarate
           5-dioxygenase 1 [Otolemur garnettii]
          Length = 721

 Score =  630 bits (1625), Expect = e-178,   Method: Compositional matrix adjust.
 Identities = 310/707 (43%), Positives = 454/707 (64%), Gaps = 13/707 (1%)

Query: 31  EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNE 90
           ED  LV+TVA+ ET+G++RF +SA+    +++ LGL + W      S GGG KV LLK  
Sbjct: 25  EDNLLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWKVEKGISAGGGQKVRLLKKA 84

Query: 91  LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
           L++    +D++IL TDSYDV+   G  ++L++F    + +VF AE L +PD  L  KYP 
Sbjct: 85  LEKHADKEDLVILFTDSYDVLFASGPRELLKKFRQARSQVVFSAEELIYPDRRLEVKYPV 144

Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
           V  G R+L SGGFIGYA  + +L++     + + DQL+Y  +FLD   R +  I LD   
Sbjct: 145 VSDGKRFLGSGGFIGYAPSLSKLVAEWEGHDSDSDQLFYTKIFLDPEKREQINITLDHRC 204

Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK- 269
            +FQNL G+L+++ L F++   V   N  Y+T PV+IHGNG +K++LN  GNY+ + W  
Sbjct: 205 RIFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTF 263

Query: 270 TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
            +GCT C+  ++ L  L  D  P VL+ VFI++PT FL  F  ++ +L+YP  ++ +F++
Sbjct: 264 ETGCTVCDEGLRSLKGLGDDALPLVLVGVFIEQPTPFLSLFFRRLLHLHYPRNRMRLFIH 323

Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSD 388
           N++++H    + ++    + +++VK +     V + +ARN+     +     + ++    
Sbjct: 324 NHEKHHKAQVEKFLAEHGSEYQSVKLVGPEVRVENADARNMGA-XVVGPAHPYXWWWAEG 382

Query: 389 SHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGG 448
                P    +L N    +IAPL+ R  + WSNFWGAL+ADG+YARS DY++I+ G +  
Sbjct: 383 PWCQEPYSAPFLRN----VIAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR-- 436

Query: 449 KGIWNVPYITNCYLMKTSVIKATNIKT-IYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEY 507
            G+WNVPYI+N YL+K S ++A    T ++  + +D DMAFC N+R + + + + +   +
Sbjct: 437 VGVWNVPYISNIYLIKGSALRAELQSTDLFHHSKLDPDMAFCANVRQQDVFMFLTNRHTF 496

Query: 508 GHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIV 567
           GHL+  +++     + +++E+  NP DW  +YIH  Y K+L    V   PCPDV+WFPI 
Sbjct: 497 GHLLSLDSYQTTHLHNDLWEVFSNPEDWKEKYIHENYTKALAGKLVET-PCPDVYWFPIF 555

Query: 568 TEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVV 627
           TE+ C E V+ ME YGQWS G N D R++ GYE VPT DIHM Q+     W +FL +Y+ 
Sbjct: 556 TEEACDELVEEMEHYGQWSLGDNKDNRIQGGYENVPTIDIHMNQISFEREWHKFLVEYIA 615

Query: 628 PLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGC 687
           P+ E+ + GY+    +  ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VGVDYEGGGC
Sbjct: 616 PMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLVPHHDASTFTINIALNRVGVDYEGGGC 674

Query: 688 RFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           RF+RYNC+V A R GW LMHPGRLTHYHEGL  T+GTRYI +SFVDP
Sbjct: 675 RFLRYNCSVRAPRKGWTLMHPGRLTHYHEGLATTKGTRYIAVSFVDP 721


>gi|344288962|ref|XP_003416215.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2
           isoform 1 [Loxodonta africana]
          Length = 737

 Score =  630 bits (1625), Expect = e-178,   Method: Compositional matrix adjust.
 Identities = 315/714 (44%), Positives = 461/714 (64%), Gaps = 10/714 (1%)

Query: 25  KVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGYK 83
           K  +I  DK LVITVA+ E+DG+ RF++SAE     VK LG  + W GGD ++S+GGG K
Sbjct: 30  KPSSIPTDKLLVITVATKESDGFHRFMKSAEYFNYTVKVLGQGEEWRGGDGINSIGGGQK 89

Query: 84  VNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTS 143
           V L+K  ++     +D+++L T+ +DVI  GG  ++L++F   +  +VF A+ + WPD  
Sbjct: 90  VRLMKEVMEHYANQEDLVVLFTECFDVIFAGGPEEVLKKFLKTNHKVVFAADGILWPDKR 149

Query: 144 LYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHK 203
           L DKYP V  G RYLNSGGFIGYA  I  ++    +++ +DDQL+Y  +++D   R    
Sbjct: 150 LADKYPVVHIGKRYLNSGGFIGYAPYINRIVQQWDLQDNDDDQLFYTKIYIDPLKREALN 209

Query: 204 IVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNY 263
           I LD    +FQ L G+++++ L F+  +     N  Y T PV I+GNG +KI LN FGNY
Sbjct: 210 ITLDHKCKIFQALNGAVDEVVLKFENGK-ARAKNVFYETLPVAINGNGPTKIVLNYFGNY 268

Query: 264 LAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKK 322
           +  SW + SGCT C+L   +D  + D +P+V I +FI++PT FL  FL+ +  L+YP   
Sbjct: 269 VPNSWTQDSGCTLCDL-NVIDLSQVDVYPNVTIGIFIEQPTPFLPRFLDTLLTLDYPKDA 327

Query: 323 ISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDF 381
           + +FV+N + YH      +    K    ++K +     ++  EARN+ ++     +  ++
Sbjct: 328 LKLFVHNREVYHEKDIKAFFDKAKHEISSIKIVGPEEDLSQAEARNMGMDLCRQDEKCNY 387

Query: 382 YFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNI 441
           YF VD+D  L NP  LK L+ +N  +IAPL+ R  K WSNFWGAL+ DG+YARS DY++I
Sbjct: 388 YFSVDADVVLTNPRTLKLLIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARSEDYVDI 447

Query: 442 INGDQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRNKGIHLK 500
           + G++   G+WNVPY+ N YL+K   +++  N +  +  + +D DMA C N R  G+ + 
Sbjct: 448 VQGNR--VGVWNVPYMANVYLIKGDTLRSEMNERNYFVRDKLDPDMALCRNAREMGVFMY 505

Query: 501 IDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPD 560
           I +  E+G L+ + N++    N +++++  NP+DW  +YI+ +Y K +  + +  QPCPD
Sbjct: 506 ISNRHEFGRLLSTANYNTSHYNNDLWQIFENPVDWKEKYINRDYSK-IFTENLVEQPCPD 564

Query: 561 VFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAE 620
           VFWFPI +EK C E V+ ME YGQWS G ++D R+  GYE VPT DIHMKQ+ L  VW  
Sbjct: 565 VFWFPIFSEKACDELVEEMEHYGQWSGGKHHDSRISGGYENVPTDDIHMKQIDLENVWLH 624

Query: 621 FLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGV 680
           F+R+++ P+  + F GY+ +   A ++FVV+Y PD Q SLRPHHD+ST+TINIALN VG 
Sbjct: 625 FIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPDRQRSLRPHHDASTFTINIALNNVGQ 683

Query: 681 DYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           D++GGGC+F+RYNC++ + R GW  MHPGRLTH HEGL V  GTRYI +SF+DP
Sbjct: 684 DFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNGTRYIAVSFIDP 737


>gi|198434968|ref|XP_002131164.1| PREDICTED: similar to Plod3 protein [Ciona intestinalis]
          Length = 729

 Score =  629 bits (1623), Expect = e-177,   Method: Compositional matrix adjust.
 Identities = 306/734 (41%), Positives = 466/734 (63%), Gaps = 12/734 (1%)

Query: 7   LNCLILSCVVFFISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGL 66
           ++ L +S  +F + +     K+ +  + L++TVA++ETDG+ RF +S +   L V  +G+
Sbjct: 2   VSLLSVSVTLFCLFLSIEHAKSQETTELLIVTVATDETDGFVRFKESLDYFNLTVLVIGM 61

Query: 67  HQPWLGGDMS-SLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNT 125
           H+ W+GGD+S  +GGG K+N+LK  L+      ++++  TDSYDV+  GG  +I+ +FN 
Sbjct: 62  HEEWVGGDLSRGMGGGQKINMLKRSLESYKDNTNLVLFFTDSYDVVFTGGKEEIMSKFNK 121

Query: 126 FDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDD 185
           F+A +VF AE   WPD SL D YP V  G R+L SGG IGYA    E I+ + I +  DD
Sbjct: 122 FNAKLVFSAESTIWPDASLKDLYPEVTVGKRFLCSGGIIGYAPTFWEAINMQDISDTFDD 181

Query: 186 QLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPV 245
           QLYY  ++L+ TLR K    LD  + L QN+  +  ++++     +   + NT Y T PV
Sbjct: 182 QLYYTKIYLNTTLRAKLNATLDHTSQLVQNINFAKSELEI-VQQGDLSRIQNTVYRTYPV 240

Query: 246 IIHGNGKSKIELNSFGNYLAKSWKTS-GCTRC--NLIKHLDSLKPDQFPSVLISVFIDKP 302
           +IHGNG SK+ELN   NY+   W ++ GC +C  NL++  ++   +  P+V +++FI+  
Sbjct: 241 VIHGNGPSKLELNYMANYIPDGWHSNFGCRKCEWNLLQLPEA--EENLPTVQLAIFIEPN 298

Query: 303 TAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVN 362
           T F+ EFL++I  L+YP  KI++F++ N+E        ++   +  ++ V+ I+ +  V+
Sbjct: 299 TPFIPEFLSRIQQLDYPKSKITLFIHTNEENTERYVSQFLLRHRVKYQGVQVISPHDGVH 358

Query: 363 SKEARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNF 422
              ARN+A+++ + K  D+   +D +  + N  ++K+L+ +N+ ++ PL+    K WSNF
Sbjct: 359 EATARNMALDHCILKNCDYQLSIDGNVQITNSSLIKFLMTKNKQVVGPLVKLHEKLWSNF 418

Query: 423 WGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIK--ATNIKTIYTLN 480
           WGALNADG+YARS DY++I+N ++   GIWN+P+I++ YLMK+  I+   + +   Y   
Sbjct: 419 WGALNADGYYARSADYISIVNRER--TGIWNIPFISSVYLMKSETIRFLLSRVPQPYFYE 476

Query: 481 SMDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYI 540
            MD DMAFC ++R +GI L + +  E+G L+   N +P   +P+++++  N  DW+ +YI
Sbjct: 477 DMDADMAFCAHVRQEGIFLHVTNEAEFGRLLSKANVNPGPVHPDLWQIETNKKDWEEKYI 536

Query: 541 HPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYE 600
           HP++    L +T  +QPCPDV+ FP+ TE+     V +ME +G+WS G N D RL  GYE
Sbjct: 537 HPDFWNLTLENTEVSQPCPDVYMFPLFTEEMADAIVDVMENHGEWSGGKNKDDRLAGGYE 596

Query: 601 AVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSL 660
            VPT DIHM QV     W   L  Y   + ++ + GY+ +   + M FVVRYRP EQ  L
Sbjct: 597 NVPTVDIHMNQVNYEKQWLHMLATYPTHIIQKVYPGYYTK-ASSIMMFVVRYRPSEQSFL 655

Query: 661 RPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQV 720
           RPHHDSST+T+N+ALN  G DYEGGGCRF+RY+C+VT    G+ L+HPGRLTHYHEGLQ 
Sbjct: 656 RPHHDSSTWTMNVALNTYGEDYEGGGCRFLRYDCSVTQIPKGYALVHPGRLTHYHEGLQT 715

Query: 721 TQGTRYIMISFVDP 734
            +GTRYI +SFVDP
Sbjct: 716 MEGTRYIAVSFVDP 729


>gi|218931163|ref|NP_001136387.1| procollagen-lysine,2-oxoglutarate 5-dioxygenase 2 isoform 2
           precursor [Rattus norvegicus]
 gi|149018892|gb|EDL77533.1| rCG25923, isoform CRA_b [Rattus norvegicus]
          Length = 737

 Score =  629 bits (1622), Expect = e-177,   Method: Compositional matrix adjust.
 Identities = 314/710 (44%), Positives = 457/710 (64%), Gaps = 10/710 (1%)

Query: 29  IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGYKVNLL 87
           I  DK LVITVA+ E DG+ RF+ SA+     VK LG  Q W GGD M+S+GGG KV L+
Sbjct: 34  IPADKLLVITVATKENDGFHRFMNSAKYFNYTVKVLGQGQEWRGGDGMNSIGGGQKVRLM 93

Query: 88  KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
           K  ++     DD++IL T+ +DVI  GG  ++L++F   +  IVF A+ L WPD  L DK
Sbjct: 94  KEAMEHYAGQDDLVILFTECFDVIFAGGPEELLKKFQKTNHKIVFAADALLWPDKRLADK 153

Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
           YP V  G RYLNSGGFIGYA  I  L+    +++ +DDQL+Y  +++D   R    I LD
Sbjct: 154 YPGVHIGKRYLNSGGFIGYAPYISRLVQQWDLQDNDDDQLFYTKVYIDPLKREALNITLD 213

Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
               +FQ L G+ +++ L F+  +   + NT Y T PV I+GNG +KI LN FGNY+  S
Sbjct: 214 HRCKIFQALNGATDEVVLKFENGK-SRVKNTFYETLPVAINGNGPTKILLNYFGNYVPNS 272

Query: 268 W-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMF 326
           W + +GC  C+    +D    D +P V + VFI++PT FL  FL+ +  L+YP + + +F
Sbjct: 273 WTQENGCALCDF-DTIDLSTVDVYPKVTLGVFIEQPTPFLPRFLDLLLTLDYPKEALRLF 331

Query: 327 VYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYV 385
           V+N + YH      ++   K    ++K +     ++  EARN+ ++     +  D+YF V
Sbjct: 332 VHNKEVYHEKDIKAFVDKAKHDISSIKIVGPEENLSQAEARNMGMDFCRQDEKCDYYFSV 391

Query: 386 DSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGD 445
           D+D  L NP  LK L+ +N  +IAPL+ R  K WSNFWGAL+ DG+YARS DY++I+ G+
Sbjct: 392 DADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARSEDYVDIVQGN 451

Query: 446 QGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDST 504
           +   GIWNVPY+ N YL++   +++  + +  +  + +D DM+ C N R+ G+ + I + 
Sbjct: 452 R--VGIWNVPYMANVYLIQGKTLRSEMSERNYFVRDKLDPDMSLCRNARDMGVFMYISNR 509

Query: 505 QEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWF 564
            E+G L+ + N++    N +++++  NP+DW  +YI+ +Y K +  + +  QPCPDVFWF
Sbjct: 510 HEFGRLISTANYNTSHLNNDLWQIFENPVDWKEKYINRDYSK-IFTENIVEQPCPDVFWF 568

Query: 565 PIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRK 624
           PI +E+ C E V+ ME YG+WS G ++D R+  GYE VPT DIHMKQ+ L  VW  F+R+
Sbjct: 569 PIFSERACDELVEEMEHYGKWSGGKHHDSRISGGYENVPTDDIHMKQIDLENVWLHFIRE 628

Query: 625 YVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEG 684
           ++ P+  + F GY+ +   A ++FVV+Y P+ Q SLRPHHD+ST+TINIALN VG D++G
Sbjct: 629 FIAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPHHDASTFTINIALNNVGEDFQG 687

Query: 685 GGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           GGC+F+RYNC++ + R GW  MHPGRLTH HEGL V  GTRYI +SF+DP
Sbjct: 688 GGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNGTRYIAVSFIDP 737


>gi|281337516|gb|EFB13100.1| hypothetical protein PANDA_014326 [Ailuropoda melanoleuca]
          Length = 726

 Score =  629 bits (1622), Expect = e-177,   Method: Compositional matrix adjust.
 Identities = 311/728 (42%), Positives = 460/728 (63%), Gaps = 31/728 (4%)

Query: 32  DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGYKVNLLKNE 90
           DK LVITVA+ E+DG+ RF+QSA+     VK LG  + W GGD ++S+GGG KV L+K  
Sbjct: 5   DKLLVITVATKESDGFHRFMQSAKYFNYTVKVLGQGEEWRGGDGINSIGGGQKVRLMKEV 64

Query: 91  LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
           ++     +D++IL T+ ++VI  GG  ++L++F   +  +VF A+ + WPD  L DKYP 
Sbjct: 65  MEHYANQEDLVILFTECFNVIFAGGPEEVLKKFQKSNHKVVFAADGILWPDKRLADKYPI 124

Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
           V  G RYLNSGGFIGYA +I +++   ++++ +DDQL+Y  +++D   R    I LD   
Sbjct: 125 VHIGKRYLNSGGFIGYAPNINQIVQQWNLQDNDDDQLFYTKIYIDPLKREAINITLDHKC 184

Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSW-K 269
            +FQ L G+++++ L F+  +     N  Y T PV ++GNG +KI LN FGNY+  +W +
Sbjct: 185 KIFQTLNGAVDEVVLKFENGK-ARAKNVFYETLPVAVNGNGPTKILLNYFGNYVPNAWTQ 243

Query: 270 TSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN 329
            +GCT C+L   +D    D  P+V I VFI++PT FL  FL+ +  L+YP + + +F++N
Sbjct: 244 DNGCTLCDL-DTIDLSTVDVHPNVTIGVFIEQPTPFLPRFLDILLTLDYPKEALKLFIHN 302

Query: 330 NQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDSD 388
            + YH      +    K     +K +     ++  EARN+ ++     +  D+YF +D+D
Sbjct: 303 KEVYHEKDIKVFFDKAKREISTIKIVGPEENLSQAEARNMGMDFCRQDENCDYYFSMDAD 362

Query: 389 SHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGG 448
             L NP  LK L+ +N  +IAPL+ R  K WSNFWGAL+ DG+YARS DY++I+ G++  
Sbjct: 363 VVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARSEDYVDIVQGNR-- 420

Query: 449 KGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN------------- 494
            GIWNVPY+ N YL+K   +++  N +  +  + +D DMA C N R              
Sbjct: 421 VGIWNVPYMANVYLIKGKTLRSEMNERNYFVRDKLDPDMALCRNAREMTLQREKDSPTPE 480

Query: 495 --------KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQK 546
                   KG+ + I +  E+G L+ + N++    N +++++  NP+DW  +YI+ +Y K
Sbjct: 481 TFQMLSPPKGVFMYISNRHEFGRLLSTANYNTSHYNNDLWQIFENPVDWKEKYINRDYSK 540

Query: 547 SLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRD 606
            +  + +  QPCPDVFWFPI +EK C E V+ ME YGQWS G ++D R+  GYE VPT D
Sbjct: 541 -IFTENIVEQPCPDVFWFPIFSEKACDELVEEMEHYGQWSGGKHHDSRISGGYENVPTDD 599

Query: 607 IHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDS 666
           IHMKQ+ L  VW  F+R+++ P+  + F GY+ +   A ++FVV+Y P+ Q SLRPHHD+
Sbjct: 600 IHMKQIDLENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPHHDA 658

Query: 667 STYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRY 726
           ST+TINIALN VG D++GGGC+F+RYNC++ + R GW  MHPGRLTH HEGL V  GTRY
Sbjct: 659 STFTINIALNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNGTRY 718

Query: 727 IMISFVDP 734
           I +SF+DP
Sbjct: 719 IAVSFIDP 726


>gi|403278817|ref|XP_003930981.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2
           isoform 3 [Saimiri boliviensis boliviensis]
          Length = 791

 Score =  629 bits (1622), Expect = e-177,   Method: Compositional matrix adjust.
 Identities = 315/739 (42%), Positives = 460/739 (62%), Gaps = 31/739 (4%)

Query: 21  VHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSSLG 79
           +H   +     DK LVITVA+ E+DG+ RF+QSA+     VK LG  + W GGD ++S+G
Sbjct: 59  IHPPALPRGRPDKLLVITVATKESDGFHRFMQSAKYFNYTVKVLGQGEEWRGGDGINSIG 118

Query: 80  GGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCW 139
           GG KV L+K  +      DD++++ T+ +DVI  GG  ++L++F   +  +VF A+ + W
Sbjct: 119 GGQKVRLMKEIMGHYADQDDLVVMFTECFDVIFAGGPEELLKKFQKANHKVVFAADGILW 178

Query: 140 PDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLR 199
           PD  L DKYP V  G RYLNSGGFIGYA  I  ++   ++++ +DDQL+Y  +++D   R
Sbjct: 179 PDKRLADKYPVVHIGKRYLNSGGFIGYAPYINRIVQQWNLQDNDDDQLFYTKIYIDPLKR 238

Query: 200 TKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNS 259
               I LD    +FQ L G+++++ L F+  +     NT Y T PV I+GNG +KI LN 
Sbjct: 239 EAINITLDHKCKIFQTLNGAVDEVVLKFENGK-ARAKNTFYETLPVAINGNGPTKILLNY 297

Query: 260 FGNYLAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNY 318
           FGNY+  SW + +GCT C     +D    D  P+V I VFI++PT FL  FL+ +  L+Y
Sbjct: 298 FGNYVPNSWTQDNGCTLCEF-DTVDLSAVDVHPNVTIGVFIEQPTPFLPRFLDILLTLDY 356

Query: 319 PAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-K 377
           P + + +F++N + YH      +    K   K +K +     ++  EARN+ ++     +
Sbjct: 357 PKEALKLFIHNKEVYHEKDIKVFFDKAKHEIKTIKIVGPEENLSQAEARNMGMDFCRQDE 416

Query: 378 GVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFD 437
             D+YF VD+D  L NP  LK L+ +N  +IAPL+ R  K WSNFWGAL+ DG+YARS D
Sbjct: 417 NCDYYFSVDADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARSED 476

Query: 438 YMNIINGDQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN-- 494
           Y++I+ G++   G+WNVPY+ N YL+K   +++  N +  +  + +D DMA C N R   
Sbjct: 477 YVDIVQGNR--VGVWNVPYMANVYLIKGKTLRSEMNERNYFVRDKLDPDMALCRNAREMT 534

Query: 495 -------------------KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDW 535
                              KG+ + I +  E+G L+ + N++    N +++++  NP+DW
Sbjct: 535 LQREKDSPTPETFQMLSPPKGVFMYISNRHEFGRLLSTANYNTSHYNNDLWQIFENPVDW 594

Query: 536 DLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRL 595
             +YI+ +Y K +  + +  QPCPDVFWFPI +EK C E V+ ME YG+WS G ++D R+
Sbjct: 595 KEKYINRDYSK-IFTENIVEQPCPDVFWFPIFSEKACDELVEEMEHYGKWSGGKHHDSRI 653

Query: 596 ETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPD 655
             GYE VPT DIHMKQV L  VW  F+R+++ P+  + F GY+ +   A ++FVV+Y P+
Sbjct: 654 SGGYENVPTDDIHMKQVDLENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPE 712

Query: 656 EQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYH 715
            Q SLRPHHD+ST+TINIALN VG D++GGGC+F+RYNC++ + R GW  MHPGRLTH H
Sbjct: 713 RQRSLRPHHDASTFTINIALNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLH 772

Query: 716 EGLQVTQGTRYIMISFVDP 734
           EGL V  GTRYI +SF+DP
Sbjct: 773 EGLPVKNGTRYIAVSFIDP 791


>gi|351707541|gb|EHB10460.1| Procollagen-lysine,2-oxoglutarate 5-dioxygenase 2 [Heterocephalus
           glaber]
          Length = 758

 Score =  628 bits (1620), Expect = e-177,   Method: Compositional matrix adjust.
 Identities = 315/728 (43%), Positives = 456/728 (62%), Gaps = 31/728 (4%)

Query: 32  DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGYKVNLLKNE 90
           DK LVITVA+ E+DGY RF+QSA+     VK LG  + W GGD ++S+GGG KV L+K  
Sbjct: 37  DKLLVITVATKESDGYYRFMQSAKYFNYTVKVLGQGEEWRGGDGLNSIGGGQKVRLMKEV 96

Query: 91  LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
           ++     +DM+IL T+ +DVI  GG  ++L++F   +  +VF A+ + WPD  L DKYP 
Sbjct: 97  MEHYANQEDMVILFTECFDVIFAGGPEEVLKKFQKTNHKVVFAADGILWPDKRLADKYPI 156

Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
           V  G RYLNSGGFIGYA  I  ++   ++++ +DDQL+Y  +++D   R    I LD   
Sbjct: 157 VHIGKRYLNSGGFIGYAPYINRIVQQWNLQDNDDDQLFYTKIYIDPLQRQALNITLDHKC 216

Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSW-K 269
            +FQ L G+ +++ L F+  +     NT Y T PV I+GNG +KI LN FGNY+  SW +
Sbjct: 217 KIFQALNGATDEVVLKFENGK-TRAKNTFYETLPVAINGNGPTKILLNYFGNYVPNSWTQ 275

Query: 270 TSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN 329
            +GC  C     +D    D +P+V I VFI++PT FL  FL+ +  L+YP + + +F++N
Sbjct: 276 DNGCALCEF-DTIDLSAVDVYPNVTIGVFIEQPTPFLPRFLDILLALDYPKEALKLFIHN 334

Query: 330 NQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDSD 388
            + YH      +    K     +K +     ++  EARN+ ++     +  D+YF +D+D
Sbjct: 335 KEVYHEKDIKVFFDKAKHEISTIKIVGPEENLSQAEARNMGMDFCRQDEKCDYYFSLDAD 394

Query: 389 SHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGG 448
             L N   LK L+ +N  +IAPL+ R  K WSNFWGAL+ DG+YARS DY++I+ G++  
Sbjct: 395 VVLTNSRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARSEDYVDIVQGNR-- 452

Query: 449 KGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN------------- 494
            GIWNVPYI N YL+K  ++++  N +  +  + +D DMA C N R              
Sbjct: 453 VGIWNVPYIANVYLIKGKMLRSEMNERNYFVRDKLDPDMALCRNAREMTLQREKDSPTPE 512

Query: 495 --------KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQK 546
                   KG+ + I +  E+G L+ + N++    N +++++  NP+DW  +YI+ +Y K
Sbjct: 513 TFQMLIPPKGVFMYISNRHEFGRLLSTANYNTSHYNNDLWQIFENPVDWKEKYINRDYSK 572

Query: 547 SLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRD 606
            +  + +  QPCPDVFWFPI +EK C E V+ ME YGQWS G ++D R+  GYE VPT D
Sbjct: 573 -IFTENIVEQPCPDVFWFPIFSEKACDELVEEMEHYGQWSGGKHHDSRISGGYENVPTDD 631

Query: 607 IHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDS 666
           IHMKQ+ L  VW  F+R+++ P+  + F GY+ +   A ++FVV+Y P+ Q SLRPHHD+
Sbjct: 632 IHMKQIDLENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPHHDA 690

Query: 667 STYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRY 726
           ST+TINIALN VG D++GGGC+F+RYNC++ + R GW  MHPGRLTH HEGL V  GTRY
Sbjct: 691 STFTINIALNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNGTRY 750

Query: 727 IMISFVDP 734
           I +SF+DP
Sbjct: 751 IAVSFIDP 758


>gi|417412584|gb|JAA52670.1| Putative procollagen-lysine2-oxoglutarate 5-dioxygenase 2, partial
           [Desmodus rotundus]
          Length = 757

 Score =  626 bits (1614), Expect = e-176,   Method: Compositional matrix adjust.
 Identities = 313/735 (42%), Positives = 456/735 (62%), Gaps = 31/735 (4%)

Query: 25  KVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGYK 83
           K  +I  DK LVITVA+ E+DG+ RF+QSA+     VK LG  + W GGD ++S+GGG K
Sbjct: 29  KPSSIPTDKLLVITVATKESDGFHRFMQSAKYFNYTVKVLGQGEEWRGGDGINSIGGGQK 88

Query: 84  VNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTS 143
           V L+K  +      +D+++L T+ +DVI  GG  ++L +F   +  +VF A+ + WPD  
Sbjct: 89  VRLMKEVMGHYADQEDLVVLFTECFDVIFAGGPEEVLRKFQKSNHKVVFAADGILWPDKR 148

Query: 144 LYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHK 203
           L DKYP V  G RYLNSGGFIGYA  I  ++   ++++ +DDQL+Y  +++D   R    
Sbjct: 149 LADKYPVVHIGKRYLNSGGFIGYAPYIHHIVQQWNLQDNDDDQLFYTKIYIDPLKREALN 208

Query: 204 IVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNY 263
           I LD    +FQ L G+++++ L F+  +     N  Y T PV I+GNG +KI LN FGNY
Sbjct: 209 ITLDHKCKIFQALNGAVDEVVLKFENGK-ARAKNVFYETLPVAINGNGPTKILLNYFGNY 267

Query: 264 LAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKK 322
           +  +W +  GCT C L   +D    + +P+V I VFI++PT FL  FL+ +  L+YP + 
Sbjct: 268 VPNAWTQDKGCTFCEL-DTVDLSAVNVYPNVTIGVFIEQPTPFLPRFLDTLLTLDYPKEA 326

Query: 323 ISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHK-GVDF 381
           + +F++N + YH      +    K     +K +     ++  EARN+ ++         +
Sbjct: 327 LKIFIHNKEVYHEKDIKVFFDKAKHEISTIKIVGPEENLSQAEARNMGMDFCRQDDSCGY 386

Query: 382 YFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNI 441
           YF VD+D  L NP  LK L+ +N  +IAPL+ R  K WSNFWGAL+ DG+YARS DY++I
Sbjct: 387 YFSVDADVVLTNPKTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARSEDYVDI 446

Query: 442 INGDQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN------ 494
           + G++   G+WNVPY+ N YL+K   +++  N +  +  + +D DMA C N R       
Sbjct: 447 VQGNR--VGLWNVPYMANVYLIKGKTLRSEMNERNYFVRDRLDPDMALCRNAREMTVQRE 504

Query: 495 ---------------KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRY 539
                          KG+ + I +  E+G L+ + N++    N +++++  NP+DW  +Y
Sbjct: 505 KDSPTPETFQMLSPPKGVFMYISNRHEFGRLLSTANYNTSHYNNDLWQIFENPVDWKEKY 564

Query: 540 IHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGY 599
           I+ +Y K +  + +  QPCPDVFWFPI +EK C E V+ ME YGQWS G ++D R+  GY
Sbjct: 565 INRDYSK-IFTENIVEQPCPDVFWFPIFSEKACDELVEEMEHYGQWSGGKHHDSRISGGY 623

Query: 600 EAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPS 659
           E VPT DIHMKQ+ L  VW  F+R+++ P+  + F GY+ +   A ++FVV+Y PD Q S
Sbjct: 624 ENVPTDDIHMKQIDLENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPDRQRS 682

Query: 660 LRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQ 719
           LRPHHD+ST+TINIALN VG D++GGGC+F+RYNC++ + R GW  MHPGRLTH HEGL 
Sbjct: 683 LRPHHDASTFTINIALNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLP 742

Query: 720 VTQGTRYIMISFVDP 734
           V  GTRYI +SF+DP
Sbjct: 743 VKNGTRYIAVSFIDP 757


>gi|432090515|gb|ELK23936.1| Procollagen-lysine,2-oxoglutarate 5-dioxygenase 2 [Myotis davidii]
          Length = 730

 Score =  624 bits (1610), Expect = e-176,   Method: Compositional matrix adjust.
 Identities = 312/732 (42%), Positives = 455/732 (62%), Gaps = 31/732 (4%)

Query: 28  NIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGYKVNL 86
           +I  DK LVIT+A+ E DG+ RF+QSA+     VK LG  + W GGD ++S+GGG KV L
Sbjct: 5   SIFPDKLLVITIATKENDGFHRFMQSAKYFNYTVKVLGQGEEWRGGDGINSIGGGQKVRL 64

Query: 87  LKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYD 146
           +K  ++     +DM++L T+ +DVI  G   ++L++F   +  +VF A+ + WPD  L D
Sbjct: 65  MKEVMEHYASHEDMVVLFTECFDVIFAGSPEEVLKKFQKSNHKVVFAADGILWPDKRLAD 124

Query: 147 KYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVL 206
           KYP V  G RYLNSGGFIGYA  I  ++   ++++ +DDQL+Y  +++D   R    I L
Sbjct: 125 KYPVVHIGKRYLNSGGFIGYAPYINRIVQQWNLQDNDDDQLFYTKIYIDPLKREALNITL 184

Query: 207 DTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAK 266
           D    +FQ L G+++++ L F+  +     N  Y T PV I+GNG +KI LN FGNY+  
Sbjct: 185 DHKCKIFQTLNGAVDEVVLKFENGK-ARAKNVFYETLPVAINGNGPTKILLNYFGNYVPN 243

Query: 267 SW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
           +W + SGCT C L   +D    + +P+V I VFI++PT FL  FL+ +  L+YP + + +
Sbjct: 244 AWTQDSGCTLCEL-DTIDLSAVNVYPNVTIGVFIEQPTPFLPRFLDILLTLDYPKEALKV 302

Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHK-GVDFYFY 384
           F++N + YH      +    K     +K +     ++  EARN+ ++        D+YF 
Sbjct: 303 FIHNKEVYHEKHIKVFFDKAKHEINTIKIVGPEENLSQAEARNMGMDFCRQDDSCDYYFS 362

Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
           VD+D  L NP  LK L+ +N  +IAPL+ R  K WSNFWGAL+ DG++AR  DY++I+ G
Sbjct: 363 VDADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYFARYEDYVDIVQG 422

Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN--------- 494
           ++   GIWNVPY+ N YL+K   +++  N +  +  + +D DMA C N R          
Sbjct: 423 NR--VGIWNVPYMANVYLIKGKTLRSEMNERNYFVRDRLDPDMALCRNAREMTLQREKDS 480

Query: 495 ------------KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHP 542
                       KG+ + I +  E+G L+ + N++    N +++++  NP+DW  +YI  
Sbjct: 481 PTPETFQMLSPPKGVFMYISNRHEFGRLLSTANYNTSHYNNDLWQIFENPVDWKEKYISH 540

Query: 543 EYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAV 602
           +Y K +  + +  QPCPDVFWFPI +EK C E V+ ME YGQWS G ++D R+  GYE V
Sbjct: 541 DYSK-IFTENIVEQPCPDVFWFPIFSEKACDELVEEMEHYGQWSGGKHHDSRISGGYENV 599

Query: 603 PTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRP 662
           PT DIHMKQ+ L  VW  F+R+++ P+  + F GY+ +   A ++FVV+Y PD Q SLRP
Sbjct: 600 PTDDIHMKQIDLENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPDRQRSLRP 658

Query: 663 HHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQ 722
           HHD+ST+TINIALN VG D++GGGC+F+RYNC++ + R GW  MHPGRLTH HEGL V  
Sbjct: 659 HHDASTFTINIALNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKN 718

Query: 723 GTRYIMISFVDP 734
           GTRYI +SF+DP
Sbjct: 719 GTRYIAVSFIDP 730


>gi|341900474|gb|EGT56409.1| CBN-LET-268 protein [Caenorhabditis brenneri]
          Length = 746

 Score =  621 bits (1601), Expect = e-175,   Method: Compositional matrix adjust.
 Identities = 313/756 (41%), Positives = 473/756 (62%), Gaps = 38/756 (5%)

Query: 7   LNCLILSCVVFFISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGL 66
           +  L L  +VF   +    + ++ E   +V+TVA+  TDG KR ++SA+   + V+ L L
Sbjct: 1   MRVLPLFPLVFIPVILATTITDLPE--LVVVTVATENTDGLKRLLESAKAFGINVEVLAL 58

Query: 67  HQPWLGGDMS-SLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERF-- 123
            + W GGD     GGG K+ +L   +++     D +I+  D+YDVI +     IL +F  
Sbjct: 59  GERWNGGDTRIEQGGGQKIRILSEWIEKYKDASDTMIMFVDAYDVIFNADSTTILRKFFE 118

Query: 124 NTFDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEE 183
           +  D  ++FGAE  CWPD +L   YP V  G R+LNSG F+GY  ++ +++  + +++++
Sbjct: 119 HYSDKRLLFGAEPFCWPDQTLAPDYPIVEFGKRFLNSGLFMGYGPEVYKVLKLKPVEDKD 178

Query: 184 DDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTN 243
           DDQLYY  ++LD  LR + K+ LD+++ +FQNL G +ED++L F  D      N  YNT 
Sbjct: 179 DDQLYYTRVYLDNKLRKELKMDLDSMSKIFQNLNGVIEDVELQFKEDGTPEAYNAAYNTK 238

Query: 244 PVIIHGNGKSKIELNSFGNYLAKSWKTS-GCTRCNLIKHLDSLKPDQFPSVLISVFIDKP 302
           P+I+HGNG SK  LN  GNYL   W +  GC  C   +  DS   D+ P + +++FI KP
Sbjct: 239 PLIVHGNGPSKSHLNYLGNYLGNRWNSQLGCRTCGQ-EMKDS---DELPLIGLNIFIAKP 294

Query: 303 TAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVN 362
             F+EE L K+A  +YP  KI++++YNNQ +      D++      +   + I   + + 
Sbjct: 295 IPFIEEVLQKVAEFDYPKDKIALYIYNNQPFSIKNIQDFLKEHGKSYYTKRVINGVTEIG 354

Query: 363 SKEARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNES----LIAPLLVRPFKA 418
            +EARN A+E    + V+F F +D D++   P V+K LV+ +++    +IAP++ +  K 
Sbjct: 355 EREARNEAIEWDKQRNVEFAFLMDGDAYFTEPKVIKDLVHYSKTYDVGIIAPMVGQIGKL 414

Query: 419 WSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYT 478
           ++NFWGA+ A+G+YARS DYM I+ G++   G WNVP+IT+  L+    + A  +K  Y+
Sbjct: 415 FTNFWGAVAANGYYARSEDYMAIVKGNR--IGYWNVPFITSALLLNKEKLSA--LKDAYS 470

Query: 479 LN-SMDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKT----NPEVYELIRNPL 533
            N ++D DM+ C   R+ G  + ID+ ++YG+L+ S+ F    T    +PE++++  N  
Sbjct: 471 YNKNLDPDMSMCQFARDNGHFMYIDNEKQYGYLIVSDEFSETVTEGKWHPEMWQIFENRD 530

Query: 534 DWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDK 593
            W+ RY+HP Y K + PD + +Q CPDV+ +P+++E+FC E ++ ME +G+WSDG+NNDK
Sbjct: 531 LWEARYVHPGYHKIMEPDHIIDQACPDVYDYPLMSERFCEELIEEMEGFGRWSDGSNNDK 590

Query: 594 RLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHH-------------- 639
           RL  GYE VPTRDIHM QVG    W  FL  YV P+QE+ FIGY+H              
Sbjct: 591 RLAGGYENVPTRDIHMNQVGFERQWLYFLDTYVRPVQEKTFIGYYHQVEPISYFFIPTII 650

Query: 640 -EPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTA 698
            +PV + M FVVRY+P+EQ SLRPHHD+ST++I++ALN+ G DYEGGG R++RYNC V A
Sbjct: 651 FQPVESNMMFVVRYKPEEQASLRPHHDASTFSIDVALNKKGRDYEGGGVRYVRYNCTVEA 710

Query: 699 TRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
             +G+ +M PGRLTH HEGL  T+GTRYIM+SF++P
Sbjct: 711 DEVGYAMMFPGRLTHLHEGLATTKGTRYIMVSFINP 746


>gi|326925903|ref|XP_003209146.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2-like
           [Meleagris gallopavo]
          Length = 774

 Score =  620 bits (1599), Expect = e-175,   Method: Compositional matrix adjust.
 Identities = 310/735 (42%), Positives = 462/735 (62%), Gaps = 38/735 (5%)

Query: 32  DKFLVITVASNETDGYKRFIQSAE-------VNKLQVKTLGLHQPWLGGDMS-SLGGGYK 83
           D  LV TVA+ ETDG+ RF+Q+A+       V  + V + G  + W GG+++ S+GGG K
Sbjct: 46  DNLLVFTVATKETDGFHRFMQTAKHFNYTVKVPYVLVPSTGKGEEWKGGELANSIGGGQK 105

Query: 84  VNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTS 143
           V LLK  +      +D+I++  + YDVI  GG  ++L++F   +  +VF A+ L WPD  
Sbjct: 106 VRLLKEGIQGYADQEDLIVMFVECYDVIFAGGPEELLKKFQETNHKVVFAADGLIWPDKR 165

Query: 144 LYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHK 203
           L DKYPAV SG R+LNSGGFIGYA  I  ++    +++ +DDQL+Y  +++D   R    
Sbjct: 166 LADKYPAVRSGKRFLNSGGFIGYAPYINRIVQQWDLQDNDDDQLFYTKIYVDPLARESLN 225

Query: 204 IVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNY 263
           I LD    +FQ L G+++++ LNF+  + V   N+ Y T P+ + GNG +KI LN  GNY
Sbjct: 226 ITLDHKCAIFQTLNGAVDEVHLNFEEGK-VRARNSAYETLPITVLGNGPTKIYLNYLGNY 284

Query: 264 LAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKK 322
           +  +W + +GC+ C+L   LD     ++PSV I VFI++PT FL +FL+++  L+YP + 
Sbjct: 285 IPNAWTRETGCSICDL-DMLDLSTVTEYPSVKIGVFIEQPTPFLPKFLDRLLTLDYPKEA 343

Query: 323 ISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDF 381
           +S+F++NN+ YH      +    K + +N+K +     ++  EARN+ ++     +  ++
Sbjct: 344 LSVFIHNNEVYHEKHIKKFWEKAKNIIRNIKIVGPEENLSQAEARNMGMDLCRQDEACEY 403

Query: 382 YFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNI 441
           YF +D+D  L NP  LK L+ +N  +IAPL+ R  K WSNFWGAL+ DG+YARS DY++I
Sbjct: 404 YFSIDADVVLTNPKTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARSEDYIDI 463

Query: 442 INGDQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN------ 494
           + G++   G+WN+PY+ N YL+K   +++    K  +  + +D DMA C N R       
Sbjct: 464 VQGNR--VGVWNIPYMANIYLIKGQTLRSEMKEKNYFMRDKLDPDMALCRNAREMTLQRE 521

Query: 495 ---------------KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRY 539
                          KG+ + I +  E+G L+ + N++    N +++++  NP+DW   Y
Sbjct: 522 KDSPSSETFHMLRPPKGVFMYITNRHEFGRLISTANYNTSHYNNDLWQIFENPVDWKETY 581

Query: 540 IHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGY 599
           I+P Y K +  D +  QPCPDVFWFPI ++  C E V+ ME +GQWS G + D R+  GY
Sbjct: 582 INPNYSK-IFTDNIVEQPCPDVFWFPIFSDTACDELVEEMEHFGQWSGGKHQDSRISGGY 640

Query: 600 EAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPS 659
           E VPT DIHMKQ+GL   W  F+R+++ P+  + F GY+ +   A ++FVV+Y PD Q S
Sbjct: 641 ENVPTDDIHMKQIGLDNEWLHFIREFIAPVTLKVFAGYYTKGY-ALLNFVVKYSPDRQRS 699

Query: 660 LRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQ 719
           LRPHHDSST+TINIALN+VG D++GGGC+F+RYNC++ + R GW  MHPGRLTH HEGL 
Sbjct: 700 LRPHHDSSTFTINIALNKVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLP 759

Query: 720 VTQGTRYIMISFVDP 734
           +  GTRYI +SF+DP
Sbjct: 760 ILNGTRYIAVSFIDP 774


>gi|344288964|ref|XP_003416216.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2
           isoform 2 [Loxodonta africana]
          Length = 758

 Score =  620 bits (1598), Expect = e-174,   Method: Compositional matrix adjust.
 Identities = 316/735 (42%), Positives = 462/735 (62%), Gaps = 31/735 (4%)

Query: 25  KVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGYK 83
           K  +I  DK LVITVA+ E+DG+ RF++SAE     VK LG  + W GGD ++S+GGG K
Sbjct: 30  KPSSIPTDKLLVITVATKESDGFHRFMKSAEYFNYTVKVLGQGEEWRGGDGINSIGGGQK 89

Query: 84  VNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTS 143
           V L+K  ++     +D+++L T+ +DVI  GG  ++L++F   +  +VF A+ + WPD  
Sbjct: 90  VRLMKEVMEHYANQEDLVVLFTECFDVIFAGGPEEVLKKFLKTNHKVVFAADGILWPDKR 149

Query: 144 LYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHK 203
           L DKYP V  G RYLNSGGFIGYA  I  ++    +++ +DDQL+Y  +++D   R    
Sbjct: 150 LADKYPVVHIGKRYLNSGGFIGYAPYINRIVQQWDLQDNDDDQLFYTKIYIDPLKREALN 209

Query: 204 IVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNY 263
           I LD    +FQ L G+++++ L F+  +     N  Y T PV I+GNG +KI LN FGNY
Sbjct: 210 ITLDHKCKIFQALNGAVDEVVLKFENGK-ARAKNVFYETLPVAINGNGPTKIVLNYFGNY 268

Query: 264 LAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKK 322
           +  SW + SGCT C+L   +D  + D +P+V I +FI++PT FL  FL+ +  L+YP   
Sbjct: 269 VPNSWTQDSGCTLCDL-NVIDLSQVDVYPNVTIGIFIEQPTPFLPRFLDTLLTLDYPKDA 327

Query: 323 ISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDF 381
           + +FV+N + YH      +    K    ++K +     ++  EARN+ ++     +  ++
Sbjct: 328 LKLFVHNREVYHEKDIKAFFDKAKHEISSIKIVGPEEDLSQAEARNMGMDLCRQDEKCNY 387

Query: 382 YFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNI 441
           YF VD+D  L NP  LK L+ +N  +IAPL+ R  K WSNFWGAL+ DG+YARS DY++I
Sbjct: 388 YFSVDADVVLTNPRTLKLLIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARSEDYVDI 447

Query: 442 INGDQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN------ 494
           + G++   G+WNVPY+ N YL+K   +++  N +  +  + +D DMA C N R       
Sbjct: 448 VQGNR--VGVWNVPYMANVYLIKGDTLRSEMNERNYFVRDKLDPDMALCRNAREMTLQRE 505

Query: 495 ---------------KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRY 539
                          KG+ + I +  E+G L+ + N++    N +++++  NP+DW  +Y
Sbjct: 506 KDSPTPETFQMLSPPKGVFMYISNRHEFGRLLSTANYNTSHYNNDLWQIFENPVDWKEKY 565

Query: 540 IHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGY 599
           I+ +Y K +  + +  QPCPDVFWFPI +EK C E V+ ME YGQWS G ++D R+  GY
Sbjct: 566 INRDYSK-IFTENLVEQPCPDVFWFPIFSEKACDELVEEMEHYGQWSGGKHHDSRISGGY 624

Query: 600 EAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPS 659
           E VPT DIHMKQ+ L  VW  F+R+++ P+  + F GY+ +   A ++FVV+Y PD Q S
Sbjct: 625 ENVPTDDIHMKQIDLENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPDRQRS 683

Query: 660 LRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQ 719
           LRPHHD+ST+TINIALN VG D++GGGC+F+RYNC++ + R GW  MHPGRLTH HEGL 
Sbjct: 684 LRPHHDASTFTINIALNNVGQDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLP 743

Query: 720 VTQGTRYIMISFVDP 734
           V  GTRYI +SF+DP
Sbjct: 744 VKNGTRYIAVSFIDP 758


>gi|218931161|ref|NP_787065.2| procollagen-lysine,2-oxoglutarate 5-dioxygenase 2 isoform 1
           precursor [Rattus norvegicus]
 gi|149018891|gb|EDL77532.1| rCG25923, isoform CRA_a [Rattus norvegicus]
          Length = 758

 Score =  619 bits (1597), Expect = e-174,   Method: Compositional matrix adjust.
 Identities = 315/731 (43%), Positives = 458/731 (62%), Gaps = 31/731 (4%)

Query: 29  IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGYKVNLL 87
           I  DK LVITVA+ E DG+ RF+ SA+     VK LG  Q W GGD M+S+GGG KV L+
Sbjct: 34  IPADKLLVITVATKENDGFHRFMNSAKYFNYTVKVLGQGQEWRGGDGMNSIGGGQKVRLM 93

Query: 88  KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
           K  ++     DD++IL T+ +DVI  GG  ++L++F   +  IVF A+ L WPD  L DK
Sbjct: 94  KEAMEHYAGQDDLVILFTECFDVIFAGGPEELLKKFQKTNHKIVFAADALLWPDKRLADK 153

Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
           YP V  G RYLNSGGFIGYA  I  L+    +++ +DDQL+Y  +++D   R    I LD
Sbjct: 154 YPGVHIGKRYLNSGGFIGYAPYISRLVQQWDLQDNDDDQLFYTKVYIDPLKREALNITLD 213

Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
               +FQ L G+ +++ L F+  +   + NT Y T PV I+GNG +KI LN FGNY+  S
Sbjct: 214 HRCKIFQALNGATDEVVLKFENGK-SRVKNTFYETLPVAINGNGPTKILLNYFGNYVPNS 272

Query: 268 W-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMF 326
           W + +GC  C+    +D    D +P V + VFI++PT FL  FL+ +  L+YP + + +F
Sbjct: 273 WTQENGCALCDF-DTIDLSTVDVYPKVTLGVFIEQPTPFLPRFLDLLLTLDYPKEALRLF 331

Query: 327 VYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYV 385
           V+N + YH      ++   K    ++K +     ++  EARN+ ++     +  D+YF V
Sbjct: 332 VHNKEVYHEKDIKAFVDKAKHDISSIKIVGPEENLSQAEARNMGMDFCRQDEKCDYYFSV 391

Query: 386 DSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGD 445
           D+D  L NP  LK L+ +N  +IAPL+ R  K WSNFWGAL+ DG+YARS DY++I+ G+
Sbjct: 392 DADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARSEDYVDIVQGN 451

Query: 446 QGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN---------- 494
           +   GIWNVPY+ N YL++   +++  + +  +  + +D DM+ C N R+          
Sbjct: 452 R--VGIWNVPYMANVYLIQGKTLRSEMSERNYFVRDKLDPDMSLCRNARDMTLQREKDSP 509

Query: 495 -----------KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPE 543
                      KG+ + I +  E+G L+ + N++    N +++++  NP+DW  +YI+ +
Sbjct: 510 TPETFQMLSPPKGVFMYISNRHEFGRLISTANYNTSHLNNDLWQIFENPVDWKEKYINRD 569

Query: 544 YQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVP 603
           Y K +  + +  QPCPDVFWFPI +E+ C E V+ ME YG+WS G ++D R+  GYE VP
Sbjct: 570 YSK-IFTENIVEQPCPDVFWFPIFSERACDELVEEMEHYGKWSGGKHHDSRISGGYENVP 628

Query: 604 TRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPH 663
           T DIHMKQ+ L  VW  F+R+++ P+  + F GY+ +   A ++FVV+Y P+ Q SLRPH
Sbjct: 629 TDDIHMKQIDLENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPH 687

Query: 664 HDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQG 723
           HD+ST+TINIALN VG D++GGGC+F+RYNC++ + R GW  MHPGRLTH HEGL V  G
Sbjct: 688 HDASTFTINIALNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNG 747

Query: 724 TRYIMISFVDP 734
           TRYI +SF+DP
Sbjct: 748 TRYIAVSFIDP 758


>gi|6755106|ref|NP_035252.1| procollagen-lysine,2-oxoglutarate 5-dioxygenase 1 precursor [Mus
           musculus]
 gi|25008938|sp|Q9R0E2.1|PLOD1_MOUSE RecName: Full=Procollagen-lysine,2-oxoglutarate 5-dioxygenase 1;
           AltName: Full=Lysyl hydroxylase 1; Short=LH1; Flags:
           Precursor
 gi|5880315|gb|AAD54617.1|AF046782_1 lysyl hydroxylase 1 [Mus musculus]
 gi|13879264|gb|AAH06599.1| Procollagen-lysine, 2-oxoglutarate 5-dioxygenase 1 [Mus musculus]
 gi|148682841|gb|EDL14788.1| procollagen-lysine, 2-oxoglutarate 5-dioxygenase 1 [Mus musculus]
          Length = 728

 Score =  618 bits (1594), Expect = e-174,   Method: Compositional matrix adjust.
 Identities = 307/710 (43%), Positives = 465/710 (65%), Gaps = 12/710 (1%)

Query: 31  EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPW-LGGDMSSLGGGYKVNLLKN 89
           ED  LV+TVA+ ET+G++RF +SA+    ++++LGL + W + G  ++ GGG KV LLK 
Sbjct: 25  EDNLLVLTVATKETEGFRRFKRSAQFFNYKIQSLGLGEDWSVDGGPAAAGGGQKVRLLKK 84

Query: 90  ELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYP 149
            L++    +D++IL  DSYDV+   G  ++L++F    + +VF AE   +PD  L  KYP
Sbjct: 85  ALEKHADKEDLVILFVDSYDVVFASGPRELLKKFQQAKSQVVFSAEEHIYPDRRLEAKYP 144

Query: 150 AVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTL 209
            V  G R+L SGGFIGYA  + +L++    ++ + DQL+Y  +FL+   R +  I LD  
Sbjct: 145 TVPDGKRFLGSGGFIGYAPSLSKLVAEWEGQDSDSDQLFYTKIFLNPEKREQINISLDHR 204

Query: 210 ANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK 269
             +FQNL G+L+++ L F++   V   N  Y+T PV++HGNG +K++LN  GNY+ + W 
Sbjct: 205 CRIFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVVVHGNGPTKLQLNYLGNYIPRFWT 263

Query: 270 -TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFV 327
             +GCT C+  ++ L  +  +  P+VL+ VFI++PT FL  F  ++  L YP K++ +F+
Sbjct: 264 FETGCTVCDEGLRSLKGIGDEALPTVLVGVFIEQPTPFLSLFFLRLLRLRYPQKQMRLFI 323

Query: 328 YNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVD 386
           +N + +H    + ++    + +++VK +     + + +ARN+  +     +   +YF VD
Sbjct: 324 HNQERHHKLQVEQFLAEHGSEYQSVKLVGPEVRMANADARNMGADLCRQDQTCTYYFSVD 383

Query: 387 SDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQ 446
           +D  L  P+ L+ L+ +N+++IAPL+ R  + WSNFWG L+ADG+YARS DY++I+ G +
Sbjct: 384 ADVALTEPNSLRLLIEQNKNVIAPLMTRHGRLWSNFWGGLSADGYYARSEDYVDIVQGRR 443

Query: 447 GGKGIWNVPYITNCYLMKTSVIKA--TNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDST 504
              G+WNVPYI+N YL+K S ++A   N+  ++  + +D DM+FC N+R + + + + + 
Sbjct: 444 --VGVWNVPYISNIYLIKGSALRAELQNVD-LFHYSKLDSDMSFCANVRQQEVFMFLTNR 500

Query: 505 QEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWF 564
             +GHL+  +N+     + +++E+  NP DW  +YIH  Y K+L    V   PCPDV+WF
Sbjct: 501 HTFGHLLSLDNYQTTHLHNDLWEVFSNPEDWKEKYIHENYTKALAGKLVET-PCPDVYWF 559

Query: 565 PIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRK 624
           PI TE  C E V+ ME YGQWS G N D R++ GYE VPT DIHM Q+     W +FL +
Sbjct: 560 PIFTEAACDELVEEMEHYGQWSLGDNKDNRIQGGYENVPTIDIHMNQITFEREWHKFLVE 619

Query: 625 YVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEG 684
           Y+ P+ E+ + GY+    +  ++FVVRY+PDEQPSL PHHD+ST+T+NIALN+VG DYEG
Sbjct: 620 YIAPMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFTVNIALNRVGEDYEG 678

Query: 685 GGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           GGCRF+RYNC+V A R GW L+HPGRLTHYHEGL  T+GTRYI +SFVDP
Sbjct: 679 GGCRFLRYNCSVRAPRKGWALLHPGRLTHYHEGLPTTKGTRYIAVSFVDP 728


>gi|74207958|dbj|BAE29100.1| unnamed protein product [Mus musculus]
          Length = 728

 Score =  617 bits (1592), Expect = e-174,   Method: Compositional matrix adjust.
 Identities = 307/710 (43%), Positives = 464/710 (65%), Gaps = 12/710 (1%)

Query: 31  EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPW-LGGDMSSLGGGYKVNLLKN 89
           ED  LV+TVA+ ET+G++RF +SA+    ++++LGL + W + G  ++ GGG KV LLK 
Sbjct: 25  EDNLLVLTVATKETEGFRRFKRSAQFFNYKIQSLGLGEDWSVDGGPAAAGGGQKVRLLKK 84

Query: 90  ELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYP 149
            L++    +D++IL  DSYDV+   G  ++L++F    + +VF AE   +PD  L  KYP
Sbjct: 85  ALEKHADKEDLVILFVDSYDVVFASGPRELLKKFQQAKSQVVFSAEEHIYPDRRLESKYP 144

Query: 150 AVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTL 209
            V  G R+L SGGFIGYA  + +L++    ++ + DQL+Y  +FL+   R +  I LD  
Sbjct: 145 TVPDGKRFLGSGGFIGYAPSLSKLVAEWEGQDSDSDQLFYTKIFLNPEKREQINISLDHR 204

Query: 210 ANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK 269
             +FQNL G+L+++ L F++   V   N  Y+T PV++HGNG +K++LN  GNY+ + W 
Sbjct: 205 CRIFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVVVHGNGPTKLQLNYLGNYIPRFWT 263

Query: 270 -TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFV 327
             +GCT C+  ++ L  +  +  P+VL+ VFI++PT FL  F  ++  L YP K++ +F+
Sbjct: 264 FETGCTVCDEGLRSLKGIGDEALPTVLVGVFIEQPTPFLSLFFLRLLRLRYPQKQMRLFI 323

Query: 328 YNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVD 386
           +N + +H    + ++    + +++VK +     + + +ARN+  +     +   +YF VD
Sbjct: 324 HNQERHHKLQVEQFLAEHGSEYQSVKLVGPEVRMANADARNMGADLCRQDQTCTYYFSVD 383

Query: 387 SDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQ 446
           +D  L  P+ L+ L+ +N+++IAPL+ R  + WSNFWG L+ADG+YARS DY++I+ G +
Sbjct: 384 ADVALTEPNSLRLLIEQNKNVIAPLMTRHGRLWSNFWGGLSADGYYARSEDYVDIVQGRR 443

Query: 447 GGKGIWNVPYITNCYLMKTSVIKAT--NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDST 504
              G+WNVPYI+N YL+K S ++A   N+  ++  + +D DM+FC N+R + + + + + 
Sbjct: 444 --VGVWNVPYISNIYLIKGSALRAVLQNVD-LFHYSKLDSDMSFCANVRQQEVFMFLTNR 500

Query: 505 QEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWF 564
             +GHL+  +N+     + +++E+  NP DW  +YIH  Y K+L    V   PCPDV+WF
Sbjct: 501 HTFGHLLSLDNYQTTHLHNDLWEVFSNPEDWKEKYIHENYTKALAGKLVET-PCPDVYWF 559

Query: 565 PIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRK 624
           PI TE  C E V+ ME YGQWS G N D R++ GYE VPT DIHM Q+     W +FL +
Sbjct: 560 PIFTEAACDELVEEMEHYGQWSLGDNKDNRIQGGYENVPTIDIHMNQITFEREWHKFLVE 619

Query: 625 YVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEG 684
           Y+ P+ E+ + GY+    +  ++FVVRY PDEQPSL PHHD+ST+T+NIALN+VG DYEG
Sbjct: 620 YIAPMTEKLYPGYYTR-AQFDLAFVVRYNPDEQPSLMPHHDASTFTVNIALNRVGEDYEG 678

Query: 685 GGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           GGCRF+RYNC+V A R GW L+HPGRLTHYHEGL  T+GTRYI +SFVDP
Sbjct: 679 GGCRFLRYNCSVRAPRKGWALLHPGRLTHYHEGLPTTKGTRYIAVSFVDP 728


>gi|297286704|ref|XP_002808385.1| PREDICTED: LOW QUALITY PROTEIN: procollagen-lysine,2-oxoglutarate
           5-dioxygenase 2-like [Macaca mulatta]
          Length = 946

 Score =  615 bits (1586), Expect = e-173,   Method: Compositional matrix adjust.
 Identities = 315/751 (41%), Positives = 456/751 (60%), Gaps = 49/751 (6%)

Query: 10  LILSCVVF-----FISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTL 64
           L+L  +VF     ++     K  +I  DK LVITVA+ E+DG+ RF+QSA+     VK L
Sbjct: 219 LLLLALVFHPWNPYLGADSEKPSSIPTDKLLVITVATKESDGFHRFMQSAKYFNYTVKVL 278

Query: 65  GLHQPWLGGD-MSSLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERF 123
           G  + W GGD ++S+GGG KV L+K  ++     DD++++ T+ +DVI  GG  ++L++F
Sbjct: 279 GQGEEWRGGDGINSIGGGQKVRLMKEVMEHYADQDDLVVMFTECFDVIFAGGPEEVLKKF 338

Query: 124 NTFDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEE 183
              +  +VF A+ + WPD  L DKYP V  G RYLNSGGFIGYA  I  ++   ++++ +
Sbjct: 339 QKANHKVVFAADGILWPDKRLADKYPVVHIGKRYLNSGGFIGYAPYINRIVQQWNLQDND 398

Query: 184 DDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTN 243
           DDQL+Y  +++D   R    I LD    +FQ L G+++++ L F+  +     NT Y T 
Sbjct: 399 DDQLFYTKIYIDPLKREAINITLDHKCKIFQTLNGAVDEVVLKFENGK-ARAKNTFYETL 457

Query: 244 PVIIHGNGKSKIELNSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKP 302
           PV I+GNG +KI LN FGNY+  SW + +GCT C     +D    D  P+V I VFI++P
Sbjct: 458 PVAINGNGPTKILLNYFGNYVPNSWTQDNGCTLCEF-DTIDLSAVDVHPNVTIGVFIEQP 516

Query: 303 TAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVN 362
           T FL  FL+ +  L+YP + + +F++N + YH      +    K   K +K +     ++
Sbjct: 517 TPFLPRFLDILLTLDYPKEALKLFIHNKEVYHEKDIKVFFDKAKHEIKTIKIVGPEENLS 576

Query: 363 SKEARNLAVENSLH-KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSN 421
             EARN+ ++     +  D+YF VD+D  L NP  LK L+ +N  +IAPL+ R  K WSN
Sbjct: 577 QAEARNMGMDFCRQDEKCDYYFSVDADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSN 636

Query: 422 FWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIK-ATNIKTIYTLN 480
           FWGAL+ DG+YARS DY++I+ G++   G+WNVPY+ N YL+K   ++   N +  +  +
Sbjct: 637 FWGALSPDGYYARSEDYVDIVQGNR--VGVWNVPYMANVYLIKGKTLRLEMNERNYFVRD 694

Query: 481 SMDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNP-------- 532
            +D DMA C N R   +  + DS                   PE ++++  P        
Sbjct: 695 KLDPDMALCRNAREMTLQREKDSP-----------------TPETFQMLSPPKVLFLLIL 737

Query: 533 ---------LDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYG 583
                    +DW  +YI+ +Y K +  + +  QPCPDVFWFPI +EK C E V+ ME YG
Sbjct: 738 FIFVYLIFDIDWKEKYINRDYSK-IFTENIVEQPCPDVFWFPIFSEKACDELVEEMEHYG 796

Query: 584 QWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVR 643
           +WS G ++D R+  GYE VPT DIHMKQV L  VW  F+R+++ P+  + F GY+ +   
Sbjct: 797 KWSGGKHHDSRISGGYENVPTDDIHMKQVDLENVWLHFIREFIAPVTLKVFAGYYTKGF- 855

Query: 644 APMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGW 703
           A ++FVV+Y P+ Q SLRPHHD+ST+TINIALN VG D++GGGC+F+RYNC++ + R GW
Sbjct: 856 ALLNFVVKYSPERQRSLRPHHDASTFTINIALNNVGEDFQGGGCKFLRYNCSIESPRKGW 915

Query: 704 MLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
             MHPGRLTH HEGL V  GTRYI +SF+DP
Sbjct: 916 SFMHPGRLTHLHEGLPVKNGTRYIAVSFIDP 946


>gi|402594065|gb|EJW87992.1| procollagen-lysine [Wuchereria bancrofti]
          Length = 733

 Score =  610 bits (1572), Expect = e-171,   Method: Compositional matrix adjust.
 Identities = 316/746 (42%), Positives = 475/746 (63%), Gaps = 26/746 (3%)

Query: 2   LSNLHLNCLILSCVVFFISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQV 61
           ++ + L  L LS V+ + +V   K   + E   LV+TVA+ ETDG +R  ++A++N +++
Sbjct: 1   MTGMILWVLTLSTVLMYGTVTMEKTSGMPE--LLVVTVATEETDGLRRLKRTADINDVRL 58

Query: 62  KTLGLHQPWLGGDMS-SLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDIL 120
           +  G+ + W GGD+    GGG K+ +L+  L++    +D+IIL  D+YDVI  G    IL
Sbjct: 59  EVFGMGEQWRGGDIRVDEGGGQKIRILRKSLEKYKDRNDLIILFVDAYDVIFLGNEEQIL 118

Query: 121 ERFNTF--DANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRS 178
            +F TF     +VF +E  CWP+ +L  KYP V  GYRYLNSG F+G+A +I +LIS + 
Sbjct: 119 RKFFTFFDGFRLVFSSEPFCWPNRNLAPKYPLVNFGYRYLNSGIFMGFAPEIWKLISYKD 178

Query: 179 IKNEEDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDE-----FV 233
           +++ +DDQLYY  L+LDE +R   K+ LD+++ LFQNL G+  D+KL    DE     F+
Sbjct: 179 VEDNDDDQLYYTHLYLDEQIRISLKMTLDSMSILFQNLNGASNDVKLEMS-DERSGAYFI 237

Query: 234 HLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWKTSGCTRCNLIKHLDSLKPDQFPSV 293
           +  N  YNT P++IHGNG SK+ LN FGNY+     T+  T+   +    +L+  + P +
Sbjct: 238 Y--NFIYNTYPLVIHGNGPSKLHLNYFGNYVDPLRITTAKTQHTTM----NLEKIELPRL 291

Query: 294 LISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVK 353
            +SV I KP  F+ EF   I +L Y  +KI ++VY NQ +       ++ + K  ++++ 
Sbjct: 292 FLSVVISKPIPFIREFFENIKSLAYADEKIDLYVYCNQNFLEKETSGFVEDVKGRYQSLL 351

Query: 354 YIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVN----RNESLIA 409
           Y    + +  +EAR  +++ SL  G D+   +D D HL+N + L  +++    +N  ++A
Sbjct: 352 YDDSTTELGEREARAFSLKQSLALGDDYLIMIDGDVHLNNSEALLLMIHTVKEKNSEILA 411

Query: 410 PLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIK 469
           PL+ +P K ++NFWGA++++G+YARS +Y++II  D    GIWNVP+I +  ++     K
Sbjct: 412 PLVGQPHKLFTNFWGAISSNGYYARSENYLDII--DHKEVGIWNVPFINSILIIAKE--K 467

Query: 470 ATNIKTIYTLN-SMDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYEL 528
             ++   Y  N  +D DM+FC+  R+KG  L +D++  YG LV SE+ +  K +P++YE+
Sbjct: 468 LASLSNAYYYNDKLDPDMSFCSFARDKGHFLYLDNSYHYGFLVVSEDVESSKVHPDMYEI 527

Query: 529 IRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDG 588
             N   W+ RYIHP Y  +L       + C DV+ FP+++E+FC E ++  E YG+WSDG
Sbjct: 528 FNNKELWEKRYIHPNYFAALNGSVQILEICQDVYDFPLMSERFCAELIEECEYYGKWSDG 587

Query: 589 TNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSF 648
            + D+RL  GYE VPTRDIHM Q+G    W   L +YV P+QE+ FIGY+ +PV + M F
Sbjct: 588 KHKDERLVGGYENVPTRDIHMNQIGFERHWLYMLDEYVRPIQEKLFIGYYKQPVESVMMF 647

Query: 649 VVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHP 708
           VVRY+P+EQ SLRPHHD+STY+I+IALN+ GVDYEGGG RF+RYNC   A  +G  ++ P
Sbjct: 648 VVRYKPEEQASLRPHHDASTYSIDIALNKRGVDYEGGGVRFLRYNCTFDADTVGHSMIFP 707

Query: 709 GRLTHYHEGLQVTQGTRYIMISFVDP 734
           GRLTH HEGL+ TQGTRYI +SF++P
Sbjct: 708 GRLTHLHEGLETTQGTRYIAVSFINP 733


>gi|291399931|ref|XP_002716643.1| PREDICTED: procollagen-lysine, 2-oxoglutarate 5-dioxygenase 2
           [Oryctolagus cuniculus]
          Length = 877

 Score =  607 bits (1566), Expect = e-171,   Method: Compositional matrix adjust.
 Identities = 314/739 (42%), Positives = 458/739 (61%), Gaps = 31/739 (4%)

Query: 21  VHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPW-LGGDMSSLG 79
           V  N +     +K LVITVA+ E DG+ RF+QSA+     VK LG  + W  GG ++S+G
Sbjct: 145 VDINYIVQFRHNKLLVITVATKENDGFHRFMQSAKYFNYTVKVLGQGEEWRGGGGINSIG 204

Query: 80  GGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCW 139
           GG KV L+K  ++     DD++IL T+ + VI  GG  ++L++F   +  +VF A+ L W
Sbjct: 205 GGQKVRLMKEVMEHYGNQDDLVILFTECFHVIFAGGPEEVLKKFQKTNHKVVFAADGLLW 264

Query: 140 PDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLR 199
           PD  L +KYP V SG  YLNSGGFIGYA  I  ++   ++++ +DDQL+Y  +++D   R
Sbjct: 265 PDKRLAEKYPIVHSGKPYLNSGGFIGYAPYINRIVQQWTLQDNDDDQLFYTKIYIDPLKR 324

Query: 200 TKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNS 259
               I LD    +FQ L G+++++ L F+ ++   + NT Y T PV+I+GNG +KI LN 
Sbjct: 325 EAFNITLDHKCKIFQTLNGAVDEVVLKFENNK-TRVKNTFYETLPVVINGNGPTKIVLNY 383

Query: 260 FGNYLAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNY 318
           FGNY+   W + +GC  C     +D    D  P+V I VFI++PT FL  FL+ +  L+Y
Sbjct: 384 FGNYVPNLWTQNNGCLLCEF-DTIDLSAVDVHPNVTIGVFIEQPTPFLPRFLDILLTLDY 442

Query: 319 PAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-K 377
           P + + +F++N + YH      +    K     +K +     ++  +ARN+ ++     +
Sbjct: 443 PKEALKLFIHNKEVYHEKDLKVFFDKAKHEISTIKIVGPEENLSQAKARNMGMDFCRQDE 502

Query: 378 GVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFD 437
             D+YF +D+D  L NP  LK L+ +N  +IAPL+ R  K WSNFWGAL+ DG+YARS D
Sbjct: 503 KCDYYFSLDADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARSED 562

Query: 438 YMNIINGDQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN-- 494
           Y++I+ G +   GIWNVPYI N YL+K   +++  N +  +  + +D DMA C N R   
Sbjct: 563 YVDIVQGKR--VGIWNVPYIANVYLIKGKTLRSEMNERNYFVRDKLDPDMALCRNAREMT 620

Query: 495 -------------------KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDW 535
                              KG+ + I +  E+G ++ + N++    N +++++  NP+DW
Sbjct: 621 LQREKDSPTPETFQMLSPPKGMFMYISNRHEFGRILSTANYNISHYNNDLWQIFENPVDW 680

Query: 536 DLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRL 595
             +YI+ +Y K +  D +  QPCPDVFWFPI +EK C E VQ ME YGQWS G ++D R+
Sbjct: 681 KEKYINRDYSK-IFTDNIVEQPCPDVFWFPIFSEKACDELVQEMEHYGQWSGGKHHDSRI 739

Query: 596 ETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPD 655
             GYE VPT DIHMKQ+GL  VW  F+R+++ P+  + F GY+ +   A ++FVV+Y P+
Sbjct: 740 SGGYENVPTDDIHMKQIGLENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPE 798

Query: 656 EQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYH 715
            Q SLRPHHD+ST+TINIALN VG D++GGGC+F+RYNC++ + R GW  MHPGRLTH H
Sbjct: 799 RQRSLRPHHDASTFTINIALNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLH 858

Query: 716 EGLQVTQGTRYIMISFVDP 734
           EGL V  GTRYI +SF+DP
Sbjct: 859 EGLPVKNGTRYIAVSFIDP 877


>gi|317419977|emb|CBN82013.1| Procollagen-lysine,2-oxoglutarate 5-dioxygenase 2 [Dicentrarchus
           labrax]
          Length = 682

 Score =  605 bits (1559), Expect = e-170,   Method: Compositional matrix adjust.
 Identities = 294/689 (42%), Positives = 432/689 (62%), Gaps = 12/689 (1%)

Query: 51  IQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLLKNELDEMDITDDMIILVTDSYD 109
           +QSA      VK LG+ + W GGD+  S+GGG KV LLK  ++ +   +D+++L  DSYD
Sbjct: 1   MQSARYFNYTVKVLGMGEAWKGGDVGRSIGGGQKVRLLKEAMEALADQEDLVVLSVDSYD 60

Query: 110 VIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKD 169
           +I  GG  +IL +F   +  ++F AE L WPD  L DKYP++ SG RYLNSGG IGYA  
Sbjct: 61  LIFAGGPEEILRKFQQANHKVLFAAEGLIWPDKRLADKYPSIRSGKRYLNSGGIIGYAPY 120

Query: 170 IKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDL 229
           I  ++S  ++ + +DDQL+Y  ++LD   R    + LD    +FQNL G+++++ L F  
Sbjct: 121 INRVVSQWNLHDNDDDQLFYTKIYLDPLRRETLNMTLDHKCQIFQNLNGAVDEVLLKFGT 180

Query: 230 DEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK-TSGCTRC-NLIKHLDSLKP 287
           D  V + NT Y++ PV++HGNG +K+ LN   NY+  +W    GC+ C + +  L  LK 
Sbjct: 181 DR-VRVRNTVYDSLPVVVHGNGNTKMYLNYMANYVPNTWNYEHGCSHCDDDVVDLSQLK- 238

Query: 288 DQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKT 347
            ++P+VL+ VFI++PT FL EF  ++  L+YP  K+ +FV+NN+ YH      +    + 
Sbjct: 239 -EYPNVLVGVFIEQPTPFLPEFFQRLLTLDYPKDKLKVFVHNNEVYHEKHIQKFWEENRN 297

Query: 348 MFKNVKYIAHNSTVNSKEARNLAVENSLHKGV-DFYFYVDSDSHLDNPDVLKYLVNRNES 406
           +F + K +     ++  EARN+ ++         +YF +DSD  L N   LK L+ +N  
Sbjct: 298 VFNSFKVVGPEENLSQGEARNMGMDLCRKDTTCAYYFSIDSDVMLTNRQTLKLLIEQNRK 357

Query: 407 LIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTS 466
           +I PL+ R  K WSNFWGAL+ DG+YARS DY++I+   +   G+WN+PY+ + Y++K S
Sbjct: 358 IIGPLVTRHSKLWSNFWGALSLDGYYARSEDYVDIVQKKR--VGVWNIPYMAHVYMVKGS 415

Query: 467 VIK-ATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEV 525
            ++     +  + L  +D DM+ C N R  G+ + I +  ++G L+ + N++    N ++
Sbjct: 416 TLRNELKERNYFVLEKLDPDMSLCRNAREMGVFMYITNRHDFGRLISTANYNISHYNNDL 475

Query: 526 YELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQW 585
           +++  NP+DW  +YIH  Y K +  +    +PCPDVFWFP+ +EK C E V  ME YG W
Sbjct: 476 WQIYENPVDWKEKYIHKNYSK-IFTENYMEEPCPDVFWFPVFSEKACDEIVGEMEHYGTW 534

Query: 586 SDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAP 645
           S G + DKR+  GYE VPT DIHMKQ+G    W  F+R+++ P+  + F GY+ +   A 
Sbjct: 535 SGGRHMDKRIAGGYETVPTDDIHMKQIGFDKEWLHFIREFISPVTLKVFSGYYTKGY-AV 593

Query: 646 MSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWML 705
           M+FVV+Y P+ Q  LRPHHDSST+TINIALN    D++GGGCRF RYNC++ + R GW  
Sbjct: 594 MNFVVKYTPERQAYLRPHHDSSTFTINIALNNKDTDFQGGGCRFHRYNCSINSPRKGWSF 653

Query: 706 MHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           MHPGRLTH HEGL  T GTRYI +SF+DP
Sbjct: 654 MHPGRLTHLHEGLPTTNGTRYIAVSFIDP 682


>gi|390358384|ref|XP_781719.3| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 1-like
           [Strongylocentrotus purpuratus]
          Length = 715

 Score =  605 bits (1559), Expect = e-170,   Method: Compositional matrix adjust.
 Identities = 306/707 (43%), Positives = 444/707 (62%), Gaps = 33/707 (4%)

Query: 33  KFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLLKNEL 91
           K LV+TVA+ ETD ++R++ SAE   + VK +G+ Q W GGD+    GGG+K+NLL+  L
Sbjct: 37  KLLVVTVATKETDAFRRYMDSAEAFGINVKVVGMDQEWKGGDIERGPGGGFKINLLREAL 96

Query: 92  DEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAV 151
            +    +D++I+ TDSYDV+     +++L +F  +  N++F AE   WP+ SL +KYP V
Sbjct: 97  TQYKDDEDLVIMFTDSYDVLFLADADEMLRKFKAYQINLLFSAETYIWPEKSLANKYPKV 156

Query: 152 GSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLAN 211
            +GY YL SG ++GYA  I + +S + I++  DDQL++  L+L E               
Sbjct: 157 ENGYPYLCSGLYMGYAPYIYKALSYKPIEDIADDQLFFTELYLAE--------------- 201

Query: 212 LFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK-T 270
                   + DI L F+    + L NTKYNT P ++HGNG +K+ LN  GNYL   W   
Sbjct: 202 -------RVTDITLRFEGGNNL-LHNTKYNTVPCVLHGNGPTKVYLNHLGNYLPNKWTFD 253

Query: 271 SGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN 329
            GC  C+L    L  L  + +PSV+I++F+  PT F  EFL+ +  LNYP  KI +F++N
Sbjct: 254 GGCQNCDLDTFDLQGLPVEDYPSVVIAIFVGVPTPFFAEFLDLLTKLNYPKNKIDIFIHN 313

Query: 330 NQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDS 389
              +H  + + +      ++ ++K I     +   + RN  V++ +    D+YF VDSD 
Sbjct: 314 RAMFHYHMLEKFREEKGPLYNSIKIILPAEMLGDAKGRNRGVDHCMSMECDYYFSVDSDV 373

Query: 390 HLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGK 449
            L NPDVL+ L+  N+ ++AP++ +  K WSNFWG LN+ G+YARS DY++++  ++  +
Sbjct: 374 QLTNPDVLRLLMETNKQIVAPVVSKQGKLWSNFWGDLNSQGYYARSEDYVDLVRRNR--R 431

Query: 450 GIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQE-YG 508
           G+WNVPYI N YL K  ++K    K  + +  +D DMA C +LR+KGI L + + ++ YG
Sbjct: 432 GVWNVPYINNVYLAKGEMVKT--YKPNFEIEDLDTDMAICMDLRSKGIFLYVVNMEDSYG 489

Query: 509 HLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVN-NQPCPDVFWFPIV 567
           H+V  +N++    + +++EL  N  DW+ +Y+ P+Y      D  N   PC DV+ FP++
Sbjct: 490 HIVTLDNYETTHLHNDMWELWNNKEDWEAKYLSPDYFVVKEMDRNNITMPCTDVYTFPLM 549

Query: 568 TEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVV 627
           +  +  E ++ ME +G+WS G N DKRL  GYE VPTRDIHM Q+G    W  FLR+YVV
Sbjct: 550 SRTWAKELIEEMEHFGEWSGGGNQDKRLNGGYENVPTRDIHMNQIGFEQHWLYFLREYVV 609

Query: 628 PLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGC 687
           P+ E  + GY+ +   A M+FVVRY+PDEQ SLRPHHDSSTYTIN+ALN+   DYEGGG 
Sbjct: 610 PICENVYPGYYSK-AYAIMNFVVRYKPDEQASLRPHHDSSTYTINVALNERETDYEGGGA 668

Query: 688 RFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           RFIRYNC+V    +G+ +MHPGRLTHYHEGL  T GTRYIM+SF+DP
Sbjct: 669 RFIRYNCSVVGLPVGYSIMHPGRLTHYHEGLPTTNGTRYIMVSFIDP 715


>gi|397512444|ref|XP_003826555.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2
           isoform 3 [Pan paniscus]
          Length = 703

 Score =  604 bits (1557), Expect = e-170,   Method: Compositional matrix adjust.
 Identities = 300/709 (42%), Positives = 442/709 (62%), Gaps = 31/709 (4%)

Query: 51  IQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGYKVNLLKNELDEMDITDDMIILVTDSYD 109
           +QSA+     VK LG  + W GGD ++S+GGG KV L+K  ++     DD++++ T+ +D
Sbjct: 1   MQSAKYFNYTVKVLGQGEEWRGGDGINSIGGGQKVRLMKEVMERYADQDDLVVMFTECFD 60

Query: 110 VIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKD 169
           VI  GG  ++L++F   +  +VF A+ + WPD  L DKYP V  G RYLNSGGFIGYA  
Sbjct: 61  VIFAGGPEEVLKKFQKANHKVVFAADGILWPDKRLADKYPVVHIGKRYLNSGGFIGYAPY 120

Query: 170 IKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDL 229
           +  ++   ++++ +DDQL+Y  +++D   R    I LD    +FQ L G+++++ L F+ 
Sbjct: 121 VNRIVQQWNLQDNDDDQLFYTKIYIDPLKREAINITLDHKCKIFQTLNGAVDEVVLKFEN 180

Query: 230 DEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPD 288
            +     NT Y T PV I+GNG +KI LN FGNY+  SW + +GCT C     +D    D
Sbjct: 181 GK-ARAKNTFYETLPVAINGNGPTKILLNYFGNYVPNSWTQDNGCTLCEF-DTVDLSAVD 238

Query: 289 QFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTM 348
             P+V I VFI++PT FL  FL+ +  L+YP + + +F++N + YH      +    K  
Sbjct: 239 VHPNVSIGVFIEQPTPFLPRFLDILLTLDYPKEALKLFIHNKEVYHEKDIKIFFDKAKHE 298

Query: 349 FKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDSDSHLDNPDVLKYLVNRNESL 407
            K +K +     ++  EARN+ ++     +  D+YF VD+D  L NP  LK L+ +N  +
Sbjct: 299 IKTIKIVGPEENLSQAEARNMGMDFCRQDEKCDYYFSVDADVVLTNPRTLKILIEQNRKI 358

Query: 408 IAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSV 467
           IAPL+ R  K WSNFWGAL+ DG+YARS DY++I+ G++   G+WNVPY+ N YL+K   
Sbjct: 359 IAPLVTRHGKLWSNFWGALSPDGYYARSEDYVDIVQGNR--VGVWNVPYMANVYLIKGKT 416

Query: 468 IKA-TNIKTIYTLNSMDYDMAFCTNLRN---------------------KGIHLKIDSTQ 505
           +++  N +  +  + +D DMA C N R                      KG+ + I +  
Sbjct: 417 LRSEMNERNYFVRDKLDPDMALCRNAREMTLQREKDSPTPETFQMLSPPKGVFMYISNRH 476

Query: 506 EYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFP 565
           E+G L+ + N++    N +++++  NP+DW  +YI+ +Y K +  + +  QPCPDVFWFP
Sbjct: 477 EFGRLLSTANYNTSHYNNDLWQIFENPVDWKEKYINRDYSK-IFTENIVEQPCPDVFWFP 535

Query: 566 IVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKY 625
           I +EK C E V+ ME YG+WS G ++D R+  GYE VPT DIHMKQV L  VW  F+R++
Sbjct: 536 IFSEKACDELVEEMEHYGKWSGGKHHDSRISGGYENVPTDDIHMKQVDLENVWLHFIREF 595

Query: 626 VVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGG 685
           + P+  + F GY+ +   A ++FVV+Y P+ Q SLRPHHD+ST+TINIALN VG D++GG
Sbjct: 596 IAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPHHDASTFTINIALNNVGEDFQGG 654

Query: 686 GCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           GC+F+RYNC++ + R GW  MHPGRLTH HEGL V  GTRYI +SF+DP
Sbjct: 655 GCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNGTRYIAVSFIDP 703


>gi|194378198|dbj|BAG57849.1| unnamed protein product [Homo sapiens]
          Length = 690

 Score =  604 bits (1557), Expect = e-170,   Method: Compositional matrix adjust.
 Identities = 295/708 (41%), Positives = 445/708 (62%), Gaps = 46/708 (6%)

Query: 31  EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNE 90
           ED  LV+TVA+ ET+G++RF +SA+    +++ LGL + W     +S GGG KV LLK  
Sbjct: 25  EDNLLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWNVEKGTSAGGGQKVRLLKKA 84

Query: 91  LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
           L++    +++I                                     +PD  L  KYP 
Sbjct: 85  LEKHADKEELI-------------------------------------YPDRRLETKYPV 107

Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
           V  G R+L SGGFIGYA ++ +L++    ++ + DQL+Y  +FLD   R +  I LD   
Sbjct: 108 VSDGKRFLGSGGFIGYAPNLSKLVAEWEGQDSDSDQLFYTKIFLDPEKREQINITLDHRC 167

Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK- 269
            +FQNL G+L+++ L F++   V   N  Y+T PV+IHGNG +K++LN  GNY+ + W  
Sbjct: 168 RIFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTF 226

Query: 270 TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
            +GCT C+  ++ L  +  +  P+VL+ VFI++PT F+  F  ++  L+YP K + +F++
Sbjct: 227 ETGCTVCDEGLRSLKGIGDEALPTVLVGVFIEQPTPFVSLFFQRLLRLHYPQKHMRLFIH 286

Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVE-NSLHKGVDFYFYVDS 387
           N++++H    ++++    + +++VK +     + + +ARN+  +    ++   +YF VD+
Sbjct: 287 NHEQHHKAQVEEFLAQHGSEYQSVKLVGPEVRMANADARNMGADLCRQYRSCTYYFSVDA 346

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D  L  P+ L+ L+ +N+++IAPL+ R  + WSNFWGAL+ADG+YARS DY++I+ G + 
Sbjct: 347 DVALTEPNSLRLLIQQNKNVIAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR- 405

Query: 448 GKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQE 506
             G+WNVPYI+N YL+K S ++       ++  + +D DMAFC N+R + + + + +   
Sbjct: 406 -VGVWNVPYISNIYLIKGSALRGELQSSDLFHHSKLDPDMAFCANIRQQDVFMFLTNRHT 464

Query: 507 YGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPI 566
            GHL+  +++     + +++E+  NP DW  +YIH  Y K+L    V   PCPDV+WFPI
Sbjct: 465 LGHLLSLDSYRTTHLHNDLWEVFSNPEDWKEKYIHQNYTKALAGKLVET-PCPDVYWFPI 523

Query: 567 VTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYV 626
            TE  C E V+ ME +GQWS G N D R++ GYE VPT DIHM Q+G    W +FL +Y+
Sbjct: 524 FTEVACDELVEEMEHFGQWSLGNNKDNRIQGGYENVPTIDIHMNQIGFEREWHKFLLEYI 583

Query: 627 VPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGG 686
            P+ E+ + GY+    +  ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VGVDYEGGG
Sbjct: 584 APMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFTINIALNRVGVDYEGGG 642

Query: 687 CRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           CRF+RYNC++ A R GW LMHPGRLTHYHEGL  T+GTRYI +SFVDP
Sbjct: 643 CRFLRYNCSIRAPRKGWTLMHPGRLTHYHEGLPTTRGTRYIAVSFVDP 690


>gi|296227894|ref|XP_002759563.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2
           isoform 2 [Callithrix jacchus]
          Length = 703

 Score =  602 bits (1553), Expect = e-169,   Method: Compositional matrix adjust.
 Identities = 301/709 (42%), Positives = 441/709 (62%), Gaps = 31/709 (4%)

Query: 51  IQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGYKVNLLKNELDEMDITDDMIILVTDSYD 109
           +QSA+     VK LG  + W GGD ++S+GGG KV L+K  +      DD++++ T+ +D
Sbjct: 1   MQSAKYFNYTVKVLGQGEEWRGGDGINSIGGGQKVRLVKEVMGHYADQDDLVVMFTECFD 60

Query: 110 VIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKD 169
           VI  GG  ++L++F   +  +VF A+ + WPD  L DKYP V  G RYLNSGGFIGYA  
Sbjct: 61  VIFAGGPEELLKKFQKANHKVVFAADGILWPDKRLADKYPVVHIGKRYLNSGGFIGYAPY 120

Query: 170 IKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDL 229
           I  ++   ++++ +DDQL+Y  +++D   R    I LD    +FQ L G+++++ L F+ 
Sbjct: 121 INRIVQQWNLQDNDDDQLFYTKIYIDPLKREAINITLDHKCKIFQTLNGAVDEVVLKFEN 180

Query: 230 DEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPD 288
            +     NT Y T PV I+GNG +KI LN FGNY+  SW + +GCT C     +D    D
Sbjct: 181 GK-ARAKNTFYETLPVAINGNGPTKILLNYFGNYVPNSWTQDNGCTLCEF-DTVDLSAVD 238

Query: 289 QFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTM 348
             P+V I VFI++PT FL  FL+ +  L+YP + + +F++N + YH      +    K  
Sbjct: 239 VHPNVTIGVFIEQPTPFLPRFLDILLTLDYPKEALKLFIHNKEVYHEKDIKVFFDKAKHE 298

Query: 349 FKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDSDSHLDNPDVLKYLVNRNESL 407
            K +K +     ++  EARN+ ++     +  D+YF VD+D  L NP  LK L+ +N  +
Sbjct: 299 IKTIKIVGPEENLSQAEARNMGMDFCRQDENCDYYFSVDADVVLTNPRTLKILIEQNRKI 358

Query: 408 IAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSV 467
           IAPL+ R  K WSNFWGAL+ DG+YARS DY++I+ G++   G+WNVPY+ N YL+K   
Sbjct: 359 IAPLVTRHGKLWSNFWGALSPDGYYARSEDYVDIVQGNR--VGVWNVPYMANVYLIKGKT 416

Query: 468 IKA-TNIKTIYTLNSMDYDMAFCTNLRN---------------------KGIHLKIDSTQ 505
           +++  N +  +  + +D DMA C N R                      KG+ + I +  
Sbjct: 417 LRSEMNERNYFVRDKLDPDMALCRNAREMTLQREKDSPTPETFQMLSPPKGVFMYISNRN 476

Query: 506 EYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFP 565
           E+G L+ + N++    N +++++  NP+DW  +YI+ +Y K +  + +  QPCPDVFWFP
Sbjct: 477 EFGRLLSTANYNTSHYNNDLWQIFENPVDWKEKYINRDYSK-IFTENIVEQPCPDVFWFP 535

Query: 566 IVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKY 625
           I +EK C E V+ ME YG+WS G ++D R+  GYE VPT DIHMKQV L  VW  F+R++
Sbjct: 536 IFSEKACDELVEEMEHYGKWSGGKHHDSRISGGYENVPTDDIHMKQVDLENVWLHFIREF 595

Query: 626 VVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGG 685
           + P+  + F GY+ +   A ++FVV+Y P+ Q SLRPHHD+ST+TINIALN VG D++GG
Sbjct: 596 IAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPHHDASTFTINIALNNVGEDFQGG 654

Query: 686 GCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           GC+F+RYNC++ + R GW  MHPGRLTH HEGL V  GTRYI +SF+DP
Sbjct: 655 GCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNGTRYIAVSFIDP 703


>gi|338715149|ref|XP_001493153.3| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2 [Equus
           caballus]
          Length = 703

 Score =  602 bits (1551), Expect = e-169,   Method: Compositional matrix adjust.
 Identities = 301/709 (42%), Positives = 440/709 (62%), Gaps = 31/709 (4%)

Query: 51  IQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGYKVNLLKNELDEMDITDDMIILVTDSYD 109
           +QSA+     VK LG  + W GGD ++S+GGG KV L+K  ++     +D+++L T+ +D
Sbjct: 1   MQSAKYFNYTVKVLGQGEDWRGGDGINSIGGGQKVRLMKEVMEHYADQEDLVVLFTECFD 60

Query: 110 VIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKD 169
           VI  GG  ++L++F   +  +VF A+ + WPD  L DKYP V  G RYLNSGGFIGYA  
Sbjct: 61  VIFAGGPEEVLKKFQKSNHKVVFAADGILWPDKRLADKYPIVHIGKRYLNSGGFIGYAPY 120

Query: 170 IKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDL 229
           I  ++   ++++ +DDQL+Y  ++++   R    I LD    +FQ L G+++++ L F+ 
Sbjct: 121 INRIVQEWNLQDNDDDQLFYTKIYVNPLKREAINITLDHKCKIFQTLNGAVDEVVLKFEN 180

Query: 230 DEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPD 288
            +     N  Y T PV I+GNG +KI LN FGNY+  +W +  GCT C L   +D    +
Sbjct: 181 GK-ARAKNVFYETLPVAINGNGPTKILLNYFGNYVPNAWTQDKGCTLCEL-DTIDLSAVN 238

Query: 289 QFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTM 348
             P+V I VFI++PT FL  FLN +  L+YP + + +F++N + YH      +    K  
Sbjct: 239 VHPNVTIGVFIEQPTPFLPRFLNLLLTLDYPKEALKLFIHNKEVYHEKDIKIFFDKAKHE 298

Query: 349 FKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDSDSHLDNPDVLKYLVNRNESL 407
              +K +     ++  EARN+ ++     K  D+YF VD+D  L NP  LK L+ +N  +
Sbjct: 299 ISTIKIVGPEENLSQAEARNMGMDFCRQDKNCDYYFSVDADVVLTNPRTLKILIEQNRKI 358

Query: 408 IAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSV 467
           IAPL+ R  K WSNFWGAL+ DG+YARS DY++I+ G++   GIWNVPY+ N YL+K   
Sbjct: 359 IAPLVTRHGKLWSNFWGALSPDGYYARSEDYVDIVQGNR--VGIWNVPYMANVYLIKGKT 416

Query: 468 IKA-TNIKTIYTLNSMDYDMAFCTNLRN---------------------KGIHLKIDSTQ 505
           +++  N +  +  + +D DMA C N R                      KG+ + I +  
Sbjct: 417 LRSEMNERNYFVRDKLDPDMALCRNAREMTLQREKDSPTPETFQMLSPPKGVFMYISNRH 476

Query: 506 EYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFP 565
           E+G L+ + N++    N +++++  NP+DW  +YI+ +Y K +  + +  QPCPDVFWFP
Sbjct: 477 EFGRLLSTANYNTSHYNNDLWQIFENPVDWKEKYINRDYSK-IFTENIVEQPCPDVFWFP 535

Query: 566 IVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKY 625
           I +EK C E V+ ME YGQWS G ++D R+  GYE VPT DIHMKQ+ L  VW  F+R++
Sbjct: 536 IFSEKACDELVEEMEHYGQWSGGKHHDSRISGGYENVPTDDIHMKQIDLENVWLHFIREF 595

Query: 626 VVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGG 685
           + P+  + F GY+ +   A ++FVV+Y PD Q SLRPHHD+ST+TINIALN VG D++GG
Sbjct: 596 IAPVTLKVFAGYYTKGF-ALLNFVVKYSPDRQRSLRPHHDASTFTINIALNNVGEDFQGG 654

Query: 686 GCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           GC+F+RYNC++ + R GW  MHPGRLTH HEGL V  GTRYI +SF+DP
Sbjct: 655 GCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNGTRYIAVSFIDP 703


>gi|393910403|gb|EJD75866.1| procollagen-lysine,2-oxoglutarate 5-dioxygenase [Loa loa]
          Length = 729

 Score =  602 bits (1551), Expect = e-169,   Method: Compositional matrix adjust.
 Identities = 306/712 (42%), Positives = 454/712 (63%), Gaps = 19/712 (2%)

Query: 33  KFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLLKNEL 91
           K LV+TVA+ ETDG +R  ++A  N  +++  G+ + W GG+     GGG K+ +L+  L
Sbjct: 27  KLLVVTVATEETDGLRRLKRTAHTNHFRLEVFGMGEEWRGGNTRVEQGGGQKIRILRKSL 86

Query: 92  DEMDITDDMIILVTDSYDVIIDGGVNDILERFNTF--DANIVFGAERLCWPDTSLYDKYP 149
            +    DD+IIL  D+YDVI+ G    IL +F TF     +VF +E  CWP+ +L  KYP
Sbjct: 87  GKYKDRDDLIILFVDAYDVILLGNEEQILRKFFTFFNGFRVVFSSEPFCWPNRNLAPKYP 146

Query: 150 AVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTL 209
            V  G+RYLNSG F+G+A +I  LIS R +++ +DDQLYY  L+LD+ +R   K+ LD++
Sbjct: 147 LVNFGHRYLNSGVFMGFAPEIWSLISYRDVEDNDDDQLYYTRLYLDKQIRLSLKMTLDSM 206

Query: 210 ANLFQNLYGSLEDIKLNFDLDE--FVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
             LFQNL G+  D+KL    +      + N  YNT+P++IHGNG SK+ LN  GNY+   
Sbjct: 207 TVLFQNLNGASNDVKLEMSGERSGMYFIYNFIYNTHPLVIHGNGPSKLYLNHLGNYIDPL 266

Query: 268 WKTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFV 327
              +  T+   +      +  + P + +S+ I KP  F+ EF   I  L Y  +KI +FV
Sbjct: 267 RIATSKTQSITM----DFEKIELPKLFLSIIISKPIPFIREFFGNIKKLAYTDEKIDLFV 322

Query: 328 YNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDS 387
           Y NQ++      D++ + K  ++++ Y   ++ +  +EAR+ +++ SL  G D+   VD 
Sbjct: 323 YCNQKFLTKEVSDFVEDVKKRYRSLLY-DDSTEMEEREARSFSLKQSLALGDDYLIMVDG 381

Query: 388 DSHLDNPDVLKYLVN----RNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIIN 443
           D HL+N + L ++V+    +   ++APL+ +P K ++NFWGA++++G+YARS +Y++II 
Sbjct: 382 DVHLNNSEALLFMVHTMKEKEPEILAPLIRQPHKLFTNFWGAISSNGYYARSENYLDII- 440

Query: 444 GDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNS-MDYDMAFCTNLRNKGIHLKID 502
            D    GIWNVP+I +  ++     K T++   Y  +  +D DM+FC+  R+KG  L +D
Sbjct: 441 -DHKEVGIWNVPFIGSILIIAKE--KLTSLSRAYHYDEKLDPDMSFCSFARDKGHFLYLD 497

Query: 503 STQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVF 562
           ++  YG LV SEN +  K +PE+YE+  N   W+ RYIHP Y  +L   T   + C DV+
Sbjct: 498 NSHHYGFLVVSENVESSKVHPEMYEIFNNKELWEKRYIHPNYFTALNGSTPIPEICQDVY 557

Query: 563 WFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFL 622
            FP+++E+FC E ++  E YG+WSDG + D+RL  GYE VPTRDIHMKQ+     W   L
Sbjct: 558 DFPLMSERFCAELIEECEYYGKWSDGKHKDERLVGGYENVPTRDIHMKQIDFERHWLYML 617

Query: 623 RKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDY 682
            +YV P+QE+ FIGY+ +PV + M FVVRY+P+EQ SLRPHHD+STY+I+IALN+ GVDY
Sbjct: 618 DEYVRPIQEKLFIGYYKQPVESVMMFVVRYKPEEQASLRPHHDASTYSIDIALNKRGVDY 677

Query: 683 EGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           EGGG RF+RYNC   A  +G  ++ PGRLTH HEGL+ T+GTRYI +SF++P
Sbjct: 678 EGGGVRFLRYNCTFDADVVGHSMIFPGRLTHLHEGLETTRGTRYIAVSFINP 729


>gi|194379148|dbj|BAG58125.1| unnamed protein product [Homo sapiens]
          Length = 703

 Score =  601 bits (1550), Expect = e-169,   Method: Compositional matrix adjust.
 Identities = 299/709 (42%), Positives = 441/709 (62%), Gaps = 31/709 (4%)

Query: 51  IQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGYKVNLLKNELDEMDITDDMIILVTDSYD 109
           +QSA+     VK LG  + W GGD ++S+GGG KV L+K  ++     DD++++ T+ +D
Sbjct: 1   MQSAKYFNYTVKVLGQGEEWRGGDGINSIGGGQKVRLMKEVMEHYADQDDLVVMFTECFD 60

Query: 110 VIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKD 169
           VI  GG  ++L++F   +  +VF A+ + WPD  L DKYP V  G RYLNSGGFIGYA  
Sbjct: 61  VIFAGGPEEVLKKFQKANHKVVFAADGILWPDKRLADKYPVVHIGKRYLNSGGFIGYAPY 120

Query: 170 IKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDL 229
           +  ++   ++++ +DDQL+Y  +++D   R    I LD    +FQ L G+++++ L F+ 
Sbjct: 121 VNRIVQQWNLQDNDDDQLFYTKVYIDPLKREAINITLDHKCKIFQTLNGAVDEVVLKFEN 180

Query: 230 DEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPD 288
            +     NT Y T PV I+GNG +KI LN FGNY+  SW + +GCT C     +D    D
Sbjct: 181 GK-ARAKNTFYETLPVAINGNGPTKILLNYFGNYVPNSWTQDNGCTLCEF-DTVDLSAVD 238

Query: 289 QFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTM 348
             P+V I VFI++PT FL  FL+ +  L+YP + + +F++N + YH      +    K  
Sbjct: 239 VHPNVSIGVFIEQPTPFLPRFLDILLTLDYPKEALKLFIHNKEVYHEKDIKVFFDKAKHE 298

Query: 349 FKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDSDSHLDNPDVLKYLVNRNESL 407
            K +K +     ++  EARN+ ++     +  D+YF VD+D  L NP  LK L+ +N  +
Sbjct: 299 IKTIKIVGPEENLSQAEARNMGMDFCRQDEKCDYYFSVDADVVLTNPRTLKILIEQNRKI 358

Query: 408 IAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSV 467
           IAPL+ R  K WSNFWGAL+ DG+YARS DY++I+ G++   G+WNVPY+ N YL+K   
Sbjct: 359 IAPLVTRHGKLWSNFWGALSPDGYYARSEDYVDIVQGNR--VGVWNVPYMANVYLIKGKT 416

Query: 468 IKA-TNIKTIYTLNSMDYDMAFCTNLRN---------------------KGIHLKIDSTQ 505
           +++  N +  +  + +D D A C N R                      KG+ + I +  
Sbjct: 417 LRSEMNERNYFVRDKLDPDTALCRNAREMTLQREKDSPTPETFQMLSPPKGVFMYISNRH 476

Query: 506 EYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFP 565
           E+G L+ + N++    N +++++  NP+DW  +YI+ +Y K +  + +  QPCPDVFWFP
Sbjct: 477 EFGRLLSTANYNTSHYNNDLWQIFENPVDWKEKYINRDYSK-IFTENIVEQPCPDVFWFP 535

Query: 566 IVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKY 625
           I +EK C E V+ ME YG+WS G ++D R+  GYE VPT DIHMKQV L  VW  F+R++
Sbjct: 536 IFSEKACDELVEEMEHYGKWSGGKHHDSRISGGYENVPTDDIHMKQVDLENVWLHFIREF 595

Query: 626 VVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGG 685
           + P+  + F GY+ +   A ++FVV+Y P+ Q SLRPHHD+ST+TINIALN VG D++GG
Sbjct: 596 IAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPHHDASTFTINIALNNVGEDFQGG 654

Query: 686 GCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           GC+F+RYNC++ + R GW  MHPGRLTH HEGL V  GTRYI +SF+DP
Sbjct: 655 GCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNGTRYIAVSFIDP 703


>gi|410984588|ref|XP_003998609.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 3 [Felis
           catus]
          Length = 698

 Score =  601 bits (1549), Expect = e-169,   Method: Compositional matrix adjust.
 Identities = 293/711 (41%), Positives = 439/711 (61%), Gaps = 53/711 (7%)

Query: 29  IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLL 87
           ++ +K LVITVA+ +T+GY+RF++SAE     V+TLGL Q W GGD++ ++GGG KV  L
Sbjct: 36  VNPEKLLVITVATAKTEGYRRFLRSAEFFNYTVRTLGLGQEWRGGDVARTVGGGQKVRWL 95

Query: 88  KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
           K E+++    +DM+I+  DSYDVI+ G  +++L++F    + ++F AE  CWP+  L ++
Sbjct: 96  KKEMEKYADREDMVIMFVDSYDVILAGSPSELLKKFVQSGSRLLFSAEGFCWPEWGLAEQ 155

Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
           YP VG+G R+LNSGGFIG+A  I  ++     ++++DDQL+Y  L+LD  LR K  + LD
Sbjct: 156 YPEVGTGKRFLNSGGFIGFAPTIHRVVRQWKYEDDDDDQLFYTRLYLDPGLREKLSLNLD 215

Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
             + +FQNL G+L++I L FD +  V + N  Y+T PV++HGNG +K++LN  GNY+   
Sbjct: 216 HKSRIFQNLNGALDEIVLKFDRNR-VRIRNVAYDTLPVVVHGNGPTKLQLNYLGNYVPNG 274

Query: 268 W-KTSGCTRCNLIKH-LDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
           W    GC  C   +  L   +P   P VL++VF+++PT FL  FL ++  L+YP  ++++
Sbjct: 275 WTPQGGCGFCGRDRRILPGGQPP--PRVLLAVFVEQPTPFLPRFLQRLLLLDYPPDRVTL 332

Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFY 384
           F++NN+ YH P   D     +  F  VK +     +   EAR++A+++       +FYF 
Sbjct: 333 FLHNNEVYHEPHIADSWPQLQGHFSAVKLVGPEEALTPGEARDMAMDSCRQDPECEFYFS 392

Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
           +D+D+ + N   L+ L+  N  +IAP+L R  K WSNFWGAL+ D +YARS DY+ ++  
Sbjct: 393 LDADAVITNQQTLRILIEENRKVIAPMLSRHGKLWSNFWGALSPDEYYARSEDYVELVQR 452

Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDS 503
            +   G+WNVPYI+  Y+++   ++     + +++ +  D DMAFC +LR+KGI L + +
Sbjct: 453 KR--VGVWNVPYISQAYVIRGETLRTELPQREVFSGSDTDPDMAFCKSLRDKGIFLHLSN 510

Query: 504 TQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFW 563
            QE+G L+ +  +D    +P+++++  NPLDW  +YIH  Y ++L    +  QPCPDV+W
Sbjct: 511 QQEFGRLLATSRYDTDHLHPDLWQIFDNPLDWREQYIHENYSRALEGQGLVEQPCPDVYW 570

Query: 564 FPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLR 623
           FP+++++ C E V+ ME YGQWS G + D RL  GYE VPT DIHMKQVG    W + LR
Sbjct: 571 FPLLSDQMCDELVEEMEHYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYEDQWLQLLR 630

Query: 624 KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYE 683
            YV P+ E  F GYH                                            +
Sbjct: 631 TYVGPMTESLFPGYHT-------------------------------------------K 647

Query: 684 GGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           GGGCRF+RY+C V++ R GW L+HPGRLTHYHEGL  T+GTRYIM+SFVDP
Sbjct: 648 GGGCRFLRYDCIVSSPRKGWGLLHPGRLTHYHEGLPTTRGTRYIMVSFVDP 698


>gi|345789311|ref|XP_542822.3| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2 [Canis
           lupus familiaris]
          Length = 703

 Score =  600 bits (1548), Expect = e-169,   Method: Compositional matrix adjust.
 Identities = 297/709 (41%), Positives = 443/709 (62%), Gaps = 31/709 (4%)

Query: 51  IQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGYKVNLLKNELDEMDITDDMIILVTDSYD 109
           +QSA      VK LG  + W GGD ++S+GGG KV L+K  ++     +D+++L T+ ++
Sbjct: 1   MQSARYFNYTVKVLGQGEEWRGGDGINSIGGGQKVRLMKEVMEHYANQEDLVVLFTECFN 60

Query: 110 VIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKD 169
           VI  GG  ++L++F   +  +VF A+ + WPD  L DKYP V  G RYLNSGGFIGYA +
Sbjct: 61  VIFAGGPEEVLKKFQKSNHKVVFAADGILWPDKRLADKYPIVHIGKRYLNSGGFIGYAPN 120

Query: 170 IKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDL 229
           I +++   ++++ +DDQL+Y  +++D   R    I LD    +FQ L G+++++ L F+ 
Sbjct: 121 INQIVQQWNLQDNDDDQLFYTKIYIDPLKREAINITLDHKCKIFQTLNGAVDEVVLKFEN 180

Query: 230 DEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPD 288
            +     N  Y T PV ++GNG +KI LN FGNY+  +W + +GCT C+L   +D    D
Sbjct: 181 GK-ARAKNVFYETLPVAVNGNGPTKILLNYFGNYVPNAWTQDNGCTLCDL-DTIDLSTVD 238

Query: 289 QFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTM 348
             P+V I VFI++PT FL  FL+ +  L+YP + + +F++N + YH      +    K  
Sbjct: 239 VHPNVTIGVFIEQPTPFLPRFLDILLTLDYPKEALKLFIHNKEVYHEKDIKVFFDKAKHE 298

Query: 349 FKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDSDSHLDNPDVLKYLVNRNESL 407
              +K +     ++  EARN+ ++     +  D+YF +D+D  L NP  LK L+ +N  +
Sbjct: 299 INTIKIVGPEENLSQAEARNMGMDFCRQDENCDYYFSMDADVVLTNPRTLKILIEQNRKI 358

Query: 408 IAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSV 467
           IAPL+ R  K WSNFWGAL+ DG+YARS DY++I+ G++   GIWNVPY+ N YL+K   
Sbjct: 359 IAPLVTRHGKLWSNFWGALSPDGYYARSEDYVDIVQGNR--VGIWNVPYMANVYLIKGKT 416

Query: 468 IKA-TNIKTIYTLNSMDYDMAFCTNLRN---------------------KGIHLKIDSTQ 505
           +++  N +  +  + +D DMA C N R                      KG+ + I +  
Sbjct: 417 LRSEMNERNYFVRDKLDPDMALCRNAREMTLQREKDSPTPETFQMLSPPKGVFMYISNRH 476

Query: 506 EYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFP 565
           E+G L+ + N++    N +++++  NP+DW  +YI+ +Y K +  + +  QPCPDVFWFP
Sbjct: 477 EFGRLLSTANYNTSHYNNDLWQIFENPVDWKEKYINRDYSK-IFTENIVEQPCPDVFWFP 535

Query: 566 IVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKY 625
           I +EK C E V+ ME YGQWS G ++D R+  GYE VPT DIHMKQ+ L  VW  F+R++
Sbjct: 536 IFSEKACDELVEEMEHYGQWSGGKHHDSRISGGYENVPTDDIHMKQIDLENVWLHFIREF 595

Query: 626 VVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGG 685
           + P+  + F GY+ +   A ++FVV+Y P+ Q SLRPHHD+ST+TINIALN VG D++GG
Sbjct: 596 IAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPHHDASTFTINIALNNVGEDFQGG 654

Query: 686 GCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           GC+F+RYNC++ + R GW  MHPGRLTH HEGL V  GTRYI +SF+DP
Sbjct: 655 GCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNGTRYIAVSFIDP 703


>gi|432960048|ref|XP_004086421.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 1-like
           [Oryzias latipes]
          Length = 695

 Score =  600 bits (1546), Expect = e-168,   Method: Compositional matrix adjust.
 Identities = 294/714 (41%), Positives = 442/714 (61%), Gaps = 46/714 (6%)

Query: 27  KNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGYKVN 85
           + I E+K LV+TVA+ ETDG++RF++SA      VK LG  Q W GGD MS+ GGG KV 
Sbjct: 22  QRIPEEKLLVLTVATKETDGFRRFLKSARNFNYTVKVLGRGQKWSGGDYMSAPGGGQKVR 81

Query: 86  LLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLY 145
           LLK  L E   ++D ++L TDSYD +   G  ++L +F      +VF +ERL WPD  L 
Sbjct: 82  LLKEALQETK-SEDQVLLFTDSYDAVFASGPKELLRKFQQAKHKVVFSSERLIWPDRHLE 140

Query: 146 DKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIV 205
           DK+P V  G R+L SGGF+GY  +I+E+++N + ++ + DQL++  +++D   R    I 
Sbjct: 141 DKHPHVREGNRFLGSGGFMGYLSNIREMVANWTGEDADSDQLFFTKIYVDPDKRKSINIT 200

Query: 206 LDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLA 265
           +D+   LFQNL+G+L+D+ L F+ D  V   N  Y+T PV+IHGNG +K+++N  GNY+ 
Sbjct: 201 VDSRCRLFQNLHGALDDVVLKFE-DGRVRARNVLYDTLPVLIHGNGPTKLQINYMGNYIP 259

Query: 266 KSWK-TSGCTRC-NLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKI 323
           KSW   +GCT C + ++ L +LK  ++P V I +FI +PT F+  F  ++  L YP  ++
Sbjct: 260 KSWTFENGCTVCQDDLRSLAALKDSEYPLVSIGIFIQQPTPFVSVFFERLLKLEYPKDRL 319

Query: 324 SMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFY 382
            +F+YN + +H P    ++ +   ++++V+ I     ++S  AR+L ++        D++
Sbjct: 320 KLFIYNQEPHHEPQVSSFLRDHGGLYQDVRSITPKEDMDSAAARDLVLDICRKDTDCDYF 379

Query: 383 FYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNII 442
           F +D +  L N   LK L+ +N  ++AP++ R  + WSNFWGA++ADG+YARS DY++I+
Sbjct: 380 FNLDIEVVLKNEKTLKILIEQNLPVVAPMITRAARLWSNFWGAVSADGYYARSEDYVDIV 439

Query: 443 NGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRN--KGIHLK 500
            G +  K                                     +    L++  KG    
Sbjct: 440 QGRRVSK------------------------------------QSLRGELQDHLKGSFHV 463

Query: 501 IDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPD 560
            +  Q +G ++ +EN+     + +++++  NP+DW+ RYIH  Y + ++ D +   PCPD
Sbjct: 464 CNHMQTFGRILSTENYQSTHLHNDLWQIFENPVDWEERYIHQNYSR-IMRDKLIETPCPD 522

Query: 561 VFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAE 620
           V+WFP+ ++  C   V+ ME +G+WS G N D R++ GYE VPT DIHM Q+     W +
Sbjct: 523 VYWFPVFSDVACTHLVEEMEHFGKWSGGGNTDTRIQGGYENVPTIDIHMNQINFEKEWQK 582

Query: 621 FLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGV 680
           FL +YV P+ E+ + GY+ +     ++FVVRY+PDEQPSLRPHHD+ST+TINIALNQ+G 
Sbjct: 583 FLVEYVAPITEKMYPGYYTK-AHFELAFVVRYKPDEQPSLRPHHDASTFTINIALNQLGQ 641

Query: 681 DYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           DY+GGGCRF+RY C++ A R GW LMHPGRLTHYHEGL  T GTRYI +SFVDP
Sbjct: 642 DYQGGGCRFLRYGCSIQAPRKGWALMHPGRLTHYHEGLPTTAGTRYIAVSFVDP 695


>gi|410971254|ref|XP_003992085.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2 [Felis
           catus]
          Length = 703

 Score =  597 bits (1540), Expect = e-168,   Method: Compositional matrix adjust.
 Identities = 296/709 (41%), Positives = 441/709 (62%), Gaps = 31/709 (4%)

Query: 51  IQSAEVNKLQVKTLGLHQPWLGG-DMSSLGGGYKVNLLKNELDEMDITDDMIILVTDSYD 109
           +QSA+     VK LG  + W GG  ++S+GGG KV L+K  ++     +D+++L T+ +D
Sbjct: 1   MQSAKYFNYTVKVLGQGEEWRGGAGINSIGGGQKVRLMKEVMEHYANQEDLVVLFTECFD 60

Query: 110 VIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKD 169
           VI  GG  ++L++F   +  +VF A+ + WPD  L DKYP V  G RYLNSGGFIGYA  
Sbjct: 61  VIFAGGPEEVLKKFQKSNHKVVFAADGILWPDKRLADKYPIVHFGKRYLNSGGFIGYAPY 120

Query: 170 IKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDL 229
           I  ++   ++++ +DDQL+Y  +++D   R    I LD    +FQ L G+++++ L F+ 
Sbjct: 121 INRIVQQWNLQDNDDDQLFYTKIYIDPLKREAINITLDHKCKIFQTLNGAVDEVVLKFEN 180

Query: 230 DEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPD 288
            +     N  Y T PV ++GNG +KI LN FGNY+  +W + +GCT C+L   +D    D
Sbjct: 181 GK-ARAKNVFYETLPVALNGNGPTKILLNYFGNYVPNAWTQDNGCTLCDL-DTIDLSTVD 238

Query: 289 QFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTM 348
             P+V I +FI++PT FL  FL+ +  L+YP + + +F++N + YH      +    K  
Sbjct: 239 VHPNVTIGIFIEQPTPFLPRFLDILLTLDYPKEALKLFIHNKEVYHEKDIKVFFDKAKHE 298

Query: 349 FKNVKYIAHNSTVNSKEARNLAVENSLHKGV-DFYFYVDSDSHLDNPDVLKYLVNRNESL 407
              +K +     ++  EARN+ ++      + D+YF +D+D  L NP  LK L+ +N  +
Sbjct: 299 ISTIKIVGPEENLSQAEARNMGMDFCRQDEICDYYFSIDADVVLTNPRTLKILIEQNRKI 358

Query: 408 IAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSV 467
           IAPL+ R  K WSNFWGAL+ DG+YARS DY++I+ G++   GIWNVPY+ N YL+K   
Sbjct: 359 IAPLVTRHGKLWSNFWGALSPDGYYARSEDYVDIVQGNR--VGIWNVPYMANVYLIKGKT 416

Query: 468 IKA-TNIKTIYTLNSMDYDMAFCTNLRN---------------------KGIHLKIDSTQ 505
           +++  N +  +  + +D DMA C N R                      KG+ + I +  
Sbjct: 417 LRSEMNERNYFVRDKLDPDMALCRNAREMTLQREKDSPTPETFQMLSPPKGVFMYISNRH 476

Query: 506 EYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFP 565
           E+G L+ + N++    N +++++  NP+DW  +YI+ +Y K +  + +  QPCPDVFWFP
Sbjct: 477 EFGRLLSTANYNTSHYNNDLWQIFENPVDWKEKYINRDYSK-IFTENIVEQPCPDVFWFP 535

Query: 566 IVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKY 625
           I +EK C E V+ ME YGQWS G ++D R+  GYE VPT DIHMKQ+ L  VW  F+R++
Sbjct: 536 IFSEKACDELVEEMEHYGQWSGGKHHDSRISGGYENVPTDDIHMKQIDLENVWLHFIREF 595

Query: 626 VVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGG 685
           + P+  + F GY+ +   A ++FVV+Y P+ Q SLRPHHD+ST+TINIALN VG D++GG
Sbjct: 596 IAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPHHDASTFTINIALNNVGEDFQGG 654

Query: 686 GCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           GC+F+RYNC++ + R GW  MHPGRLTH HEGL V  GTRYI +SF+DP
Sbjct: 655 GCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNGTRYIAVSFIDP 703


>gi|317419976|emb|CBN82012.1| Procollagen-lysine,2-oxoglutarate 5-dioxygenase 2 [Dicentrarchus
           labrax]
          Length = 703

 Score =  594 bits (1532), Expect = e-167,   Method: Compositional matrix adjust.
 Identities = 295/710 (41%), Positives = 433/710 (60%), Gaps = 33/710 (4%)

Query: 51  IQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLLKNELDEMDITDDMIILVTDSYD 109
           +QSA      VK LG+ + W GGD+  S+GGG KV LLK  ++ +   +D+++L  DSYD
Sbjct: 1   MQSARYFNYTVKVLGMGEAWKGGDVGRSIGGGQKVRLLKEAMEALADQEDLVVLSVDSYD 60

Query: 110 VIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKD 169
           +I  GG  +IL +F   +  ++F AE L WPD  L DKYP++ SG RYLNSGG IGYA  
Sbjct: 61  LIFAGGPEEILRKFQQANHKVLFAAEGLIWPDKRLADKYPSIRSGKRYLNSGGIIGYAPY 120

Query: 170 IKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDL 229
           I  ++S  ++ + +DDQL+Y  ++LD   R    + LD    +FQNL G+++++ L F  
Sbjct: 121 INRVVSQWNLHDNDDDQLFYTKIYLDPLRRETLNMTLDHKCQIFQNLNGAVDEVLLKFGT 180

Query: 230 DEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK-TSGCTRC-NLIKHLDSLKP 287
           D  V + NT Y++ PV++HGNG +K+ LN   NY+  +W    GC+ C + +  L  LK 
Sbjct: 181 DR-VRVRNTVYDSLPVVVHGNGNTKMYLNYMANYVPNTWNYEHGCSHCDDDVVDLSQLK- 238

Query: 288 DQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKT 347
            ++P+VL+ VFI++PT FL EF  ++  L+YP  K+ +FV+NN+ YH      +    + 
Sbjct: 239 -EYPNVLVGVFIEQPTPFLPEFFQRLLTLDYPKDKLKVFVHNNEVYHEKHIQKFWEENRN 297

Query: 348 MFKNVKYIAHNSTVNSKEARNLAVENSLHKGV-DFYFYVDSDSHLDNPDVLKYLVNRNES 406
           +F + K +     ++  EARN+ ++         +YF +DSD  L N   LK L+ +N  
Sbjct: 298 VFNSFKVVGPEENLSQGEARNMGMDLCRKDTTCAYYFSIDSDVMLTNRQTLKLLIEQNRK 357

Query: 407 LIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTS 466
           +I PL+ R  K WSNFWGAL+ DG+YARS DY++I+   +   G+WN+PY+ + Y++K S
Sbjct: 358 IIGPLVTRHSKLWSNFWGALSLDGYYARSEDYVDIVQKKR--VGVWNIPYMAHVYMVKGS 415

Query: 467 VIK-ATNIKTIYTLNSMDYDMAFCTNLRN---------------------KGIHLKIDST 504
            ++     +  + L  +D DM+ C N R                      KG+ + I + 
Sbjct: 416 TLRNELKERNYFVLEKLDPDMSLCRNAREMTSHREKDSPSPESFHMLRPPKGVFMYITNR 475

Query: 505 QEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWF 564
            ++G L+ + N++    N +++++  NP+DW  +YIH  Y K +  +    +PCPDVFWF
Sbjct: 476 HDFGRLISTANYNISHYNNDLWQIYENPVDWKEKYIHKNYSK-IFTENYMEEPCPDVFWF 534

Query: 565 PIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRK 624
           P+ +EK C E V  ME YG WS G + DKR+  GYE VPT DIHMKQ+G    W  F+R+
Sbjct: 535 PVFSEKACDEIVGEMEHYGTWSGGRHMDKRIAGGYETVPTDDIHMKQIGFDKEWLHFIRE 594

Query: 625 YVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEG 684
           ++ P+  + F GY+ +   A M+FVV+Y P+ Q  LRPHHDSST+TINIALN    D++G
Sbjct: 595 FISPVTLKVFSGYYTKGY-AVMNFVVKYTPERQAYLRPHHDSSTFTINIALNNKDTDFQG 653

Query: 685 GGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           GGCRF RYNC++ + R GW  MHPGRLTH HEGL  T GTRYI +SF+DP
Sbjct: 654 GGCRFHRYNCSINSPRKGWSFMHPGRLTHLHEGLPTTNGTRYIAVSFIDP 703


>gi|226490282|emb|CAX69383.1| procollagen-lysine, 2-oxoglutarate 5-dioxygenase 3 [Schistosoma
           japonicum]
          Length = 721

 Score =  594 bits (1532), Expect = e-167,   Method: Compositional matrix adjust.
 Identities = 307/716 (42%), Positives = 456/716 (63%), Gaps = 37/716 (5%)

Query: 35  LVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLLKNELDE 93
           LV+TVA+ + D   RF++S  +N  +VK LG    W GG+++ S GGG KVN+LK+EL +
Sbjct: 27  LVLTVATEKNDALDRFLRSCSLNGFEVKVLGEGSYWKGGNVAKSTGGGQKVNILKDELAK 86

Query: 94  MDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAVGS 153
                D ++L  DSYDV+    V ++L+ +  F++ ++F AE  CWP  SL   YP V  
Sbjct: 87  STYRPDQLVLFVDSYDVVFMQNVANLLKEYERFESKVIFSAEEFCWPQPSLKSLYPEVKP 146

Query: 154 G-YRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLANL 212
           G  RYLNSGGFIG   ++ +++++  I +++DDQLYY  +FLD  LRT + I LD  + +
Sbjct: 147 GEKRYLNSGGFIGPVANLIKIVNHTPINDDDDDQLYYTNIFLDSKLRTLYDIELDKTSRI 206

Query: 213 FQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK-TS 271
           FQNL G+ +D++L+F+ DE  +L N  ++T P+I HGNG  K+E NS  NYL  SW  T 
Sbjct: 207 FQNLNGAFDDVELHFN-DETGYLFNKIFSTTPIIAHGNGPIKVEFNSLSNYLVHSWTPTH 265

Query: 272 GCTRCNLIK-HLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMF--VY 328
           GC  CN     L+ L    +P V++ +F+++ T F+E+F  +IA L+YP  ++ +   + 
Sbjct: 266 GCQHCNEDNVELNDLS--DYPLVVMGIFVEQATPFIEKFFERIAALSYPKSRLHVVGHMA 323

Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARN------LAVENSLHKGVDFY 382
            + ++     + +   F   + +V ++  N  +  + AR       LA+E+       + 
Sbjct: 324 ESTKFQLSASESFNQTFGHEYLSVSWLEEN--LEEEIARKKVFGYCLAIED-----CKYV 376

Query: 383 FYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNII 442
           F VDS + L+NP+ L +L+  N S+IAPLL    KAWSNFWGAL  DG+YARS DYM+II
Sbjct: 377 FAVDSIAQLENPETLDHLIRMNRSIIAPLLTIRGKAWSNFWGALGTDGYYARSSDYMDII 436

Query: 443 NGDQGGKGIWNVPYITNCYLM-KTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKI 501
           + +    GIWNVP + + YL+ + +V+K  +I           +M F    RNK I + +
Sbjct: 437 SYNM--TGIWNVPLVRSAYLITRPAVLKLIDITNT--------EMNFAYEARNKNIFMFV 486

Query: 502 DSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNN---QPC 558
           D+   +G+L++++N+   K + ++++++ NP DW+ +YIHP+Y  +  P+ +     QPC
Sbjct: 487 DNQVNFGYLINADNYTKGKLHNDLWQIMDNPQDWEEKYIHPQYFNTAKPEVMMTDVAQPC 546

Query: 559 PDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVW 618
           PDVFWFP+++E FC   ++ +E YGQWS+G N D RLE GYE VPTRDIHM+Q+G    W
Sbjct: 547 PDVFWFPLLSETFCKHLIEEVENYGQWSNGDNYDPRLEGGYENVPTRDIHMRQIGWEEHW 606

Query: 619 AEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQV 678
              L KYV  +Q++ F GY  +P  A M+FVVRY+PDEQPSLRPHHD+S+YT+NI LNQ 
Sbjct: 607 LHVLEKYVHKIQKKLFQGYDDKP-WARMNFVVRYKPDEQPSLRPHHDASSYTLNIGLNQP 665

Query: 679 GVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
             DY+GGG  F RYNC++  TR+GW ++ PGR+TH HEGL  T+GTRYI ++FV+P
Sbjct: 666 EKDYKGGGVHFNRYNCSIIDTRVGWAVVSPGRVTHLHEGLPTTEGTRYIFVTFVNP 721


>gi|341902492|gb|EGT58427.1| hypothetical protein CAEBREN_29667, partial [Caenorhabditis
           brenneri]
          Length = 689

 Score =  585 bits (1509), Expect = e-164,   Method: Compositional matrix adjust.
 Identities = 286/684 (41%), Positives = 433/684 (63%), Gaps = 36/684 (5%)

Query: 79  GGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERF--NTFDANIVFGAER 136
           GGG K+ +L   +++     D +I+  D+YDVI +     IL +F  +  D  ++FGAE 
Sbjct: 14  GGGQKIRILSEWIEKYKDASDTMIMFVDAYDVIFNADSTTILRKFFEHYSDKRLLFGAEP 73

Query: 137 LCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDE 196
            CWPD +L   YP V  G R+LNSG F+GY  ++ +++  + +++++DDQLYY  ++LD 
Sbjct: 74  FCWPDQTLAPDYPIVEFGKRFLNSGLFMGYGPEVYKVLKLKPVEDKDDDQLYYTRVYLDN 133

Query: 197 TLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIE 256
            LR + K+ LD+++ +FQNL G +ED++L F  D      N  YNT P+I+HGNG SK  
Sbjct: 134 KLRKELKMDLDSMSKIFQNLNGVIEDVELQFKEDGTPEAYNAAYNTKPLIVHGNGPSKSH 193

Query: 257 LNSFGNYLAKSWKTS-GCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIAN 315
           LN  GNYL   W +  GC  C      +    ++ P + +++FI KP  F+EE L K+A 
Sbjct: 194 LNYLGNYLGNRWNSQLGCRTCGQ----EMKDSEELPLIGLNIFIAKPIPFIEEVLQKVAE 249

Query: 316 LNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSL 375
            +YP  KI++++YNNQ +      D++      +   + I   + +  +EARN A+E   
Sbjct: 250 FDYPKDKIALYIYNNQPFSIKNIQDFLKEHGKSYYTKRVINGVTEIGEREARNEAIEWDK 309

Query: 376 HKGVDFYFYVDSDSHLDNPDVLKYLVNRNES----LIAPLLVRPFKAWSNFWGALNADGF 431
            + V+F F +D D++   P V+K L++ +++    +IAP++ +  K ++NFWGA+ A+G+
Sbjct: 310 QRNVEFAFLMDGDAYFTEPKVIKDLIHYSKTYDVGIIAPMVGQIGKLFTNFWGAIAANGY 369

Query: 432 YARSFDYMNIINGDQGGKGIWNV----------------PYITNCYLMKTSVIKATNIKT 475
           YARS DYM I+ G++   G WNV                P+IT+  L+    + A  +K 
Sbjct: 370 YARSEDYMAIVKGNR--IGYWNVRQKLRNVSNNNFLFQVPFITSALLLNKEKLSA--LKD 425

Query: 476 IYTLN-SMDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQ----KTNPEVYELIR 530
            Y+ N ++D DM+ C   R+ G  + ID+ ++YG+L+ S+ F       K +PE++++  
Sbjct: 426 AYSYNKNLDPDMSMCQFARDNGHFMYIDNEKQYGYLIVSDEFSETVTEGKWHPEMWQIFE 485

Query: 531 NPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTN 590
           N   W+ RY+HP Y K + PD + +Q CPDV+ +P+++E+FC E ++ ME +G+WSDG+N
Sbjct: 486 NRDLWEARYVHPGYHKIMEPDHIIDQACPDVYDYPLMSERFCEELIEEMEGFGRWSDGSN 545

Query: 591 NDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVV 650
           NDKRL  GYE VPTRDIHM QVG    W  FL  YV P+QE+ FIGY+H+PV + M FVV
Sbjct: 546 NDKRLAGGYENVPTRDIHMNQVGFERQWLYFLDTYVRPVQEKTFIGYYHQPVESNMMFVV 605

Query: 651 RYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGR 710
           RY+P+EQ SLRPHHD+ST++I++ALN+ G DYEGGG R++RYNC V A  +G+ +M PGR
Sbjct: 606 RYKPEEQASLRPHHDASTFSIDVALNKKGRDYEGGGVRYVRYNCTVEADEVGYAMMFPGR 665

Query: 711 LTHYHEGLQVTQGTRYIMISFVDP 734
           LTH HEGL  T+GTRYIM+SF++P
Sbjct: 666 LTHLHEGLATTKGTRYIMVSFINP 689


>gi|156385144|ref|XP_001633491.1| predicted protein [Nematostella vectensis]
 gi|156220562|gb|EDO41428.1| predicted protein [Nematostella vectensis]
          Length = 729

 Score =  582 bits (1501), Expect = e-163,   Method: Compositional matrix adjust.
 Identities = 302/741 (40%), Positives = 448/741 (60%), Gaps = 23/741 (3%)

Query: 5   LHLNCLILSCVV------FFISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNK 58
           + +  LI SCV       + ++      ++  E + LV+TVA+ ETDGY RF++S     
Sbjct: 1   MSVKALISSCVFLLASLSYLVNADNGFSRDPKELELLVLTVATEETDGYTRFMRSCSHYD 60

Query: 59  LQVKTLGLHQPWLGGDMSS-LGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVN 117
           + V+ +G++  W GG++ +  GG +K+NLLK+ + E     +++++ +DSYD I      
Sbjct: 61  VPVRVIGMNTSWKGGNVRTDPGGAHKINLLKDAVAEYKDKKNLVLMFSDSYDAIFLARAE 120

Query: 118 DILERFNTFDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNR 177
             +++F  F A++VF AE  CWPD  L DKYP VG G RYL SGGFIGYA    ++I+ +
Sbjct: 121 AFIKKFLEFKAHVVFSAEGFCWPDRWLVDKYPEVGHGKRYLCSGGFIGYAPVFHQIINEK 180

Query: 178 SIKNEEDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTN 237
            +K+E+DDQL+Y  ++LD+  R K  + LD  A +F NL G+ E+++L F+  E V L N
Sbjct: 181 PVKDEDDDQLFYTNIYLDKEKRDKFNMKLDHKAEIFMNLNGAEEEVQLKFE-GEKVWLYN 239

Query: 238 TKYNTNPVIIHGNGKSKIELNSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLIS 296
             Y+T P+ +HGNG SK+ LN  GNYL   W K  GC  CN        K   +P V+++
Sbjct: 240 KVYSTTPLWVHGNGPSKVHLNYIGNYLPAMWNKEKGCLVCNEDTIKLPEKESDYPKVMMA 299

Query: 297 VFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYI-HNFKTMFKNVKYI 355
           +FI +PT F+ EF  +I  L+YP KKI+++++N  + H    ++++    + ++ +V Y 
Sbjct: 300 IFISRPTPFVPEFFKRIEALDYPKKKIALYIHNLMDGHTKEVNEWLTEEIRGLYHSVTY- 358

Query: 356 AHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRP 415
                     ARN AV    + G D+ F VD++    N   LK L+ +N  L+ P + + 
Sbjct: 359 -QGPGTFEAAARNKAV----YSGSDYLFVVDANVVYTNKKSLKLLIEQNRPLLVPKMSKH 413

Query: 416 FKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKT 475
            K WSNFWG +  DG+YAR+ DY++I+   +   GIWN  Y+T  YL++  V+    +K 
Sbjct: 414 AKLWSNFWGTIGDDGYYARAEDYIDIVEYRR--VGIWNSAYVTGSYLIQKDVL--PKLKH 469

Query: 476 IYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDW 535
            Y+  +++ D++F   LR+ GI + + +   +G L +++       + +++++  N +DW
Sbjct: 470 AYSYGNLEPDLSFSKYLRDNGIFMYVTNMHYFGRLKETDTVTTNHLHNDLWQIFDNQIDW 529

Query: 536 DLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTN--NDK 593
           + RY+HP Y ++L        PC DVFWFP+++E +    ++ ME YG+WS G +   D 
Sbjct: 530 EERYLHPNYSQNLNKSIPLKMPCNDVFWFPLMSETWATHMIEEMEHYGKWSGGKHEPQDA 589

Query: 594 RLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYR 653
           RL  GYE VPT DIHM QVG    W   L+ Y+VP+  R F GY+ E  RA M+FVV+Y 
Sbjct: 590 RLNGGYENVPTVDIHMNQVGWEREWLHLLKTYIVPVNTRIFPGYYSEG-RAIMNFVVKYT 648

Query: 654 PDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTH 713
           P  Q  LRPHHDSSTYTINI LN+ G+ Y GGG RFIR +C VT T++GW LMHPGRLTH
Sbjct: 649 PSGQYYLRPHHDSSTYTINIGLNKPGIHYGGGGSRFIRQDCAVTDTQVGWALMHPGRLTH 708

Query: 714 YHEGLQVTQGTRYIMISFVDP 734
           YHEGL  T GTRYIM+ FVDP
Sbjct: 709 YHEGLPTTWGTRYIMVCFVDP 729


>gi|90079137|dbj|BAE89248.1| unnamed protein product [Macaca fascicularis]
          Length = 645

 Score =  580 bits (1495), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 281/651 (43%), Positives = 415/651 (63%), Gaps = 9/651 (1%)

Query: 87  LKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYD 146
           +K  ++     DD++++ T+ +DVI  GG  ++L++F   +  +VF A+ + WPD  L D
Sbjct: 1   MKEVMEHYADQDDLVVMFTECFDVIFAGGPEEVLKKFQKANHKVVFAADGILWPDKRLAD 60

Query: 147 KYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVL 206
           KYP V  G RYLNSGGFIGYA  I  ++   ++++ +DDQL+Y  +++D   R    I L
Sbjct: 61  KYPVVHIGKRYLNSGGFIGYAPYINRIVQQWNLQDNDDDQLFYTKIYIDPLKREAINITL 120

Query: 207 DTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAK 266
           D    +FQ L G+++++ L F+  +     NT Y T PV I+GNG +KI LN FGNY+  
Sbjct: 121 DHKCKIFQTLNGAVDEVVLKFENGK-ARAKNTFYETLPVAINGNGPTKILLNYFGNYVPN 179

Query: 267 SW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
           SW + +GCT C     +D    D  P+V I VFI++PT FL  FL+ +  L+YP + + +
Sbjct: 180 SWTQDNGCTLCEF-DTIDLSAVDVHPNVTIGVFIEQPTPFLPRFLDILLTLDYPKEALKL 238

Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFY 384
           F++N + YH      +    K   K +K +     ++  EARN+ ++     +  D+Y  
Sbjct: 239 FIHNKEVYHEKDIKVFFDKAKHEIKTIKIVGPEENLSQAEARNMGMDFCRQDEKCDYYSS 298

Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
           VD+D  L NP  LK L+ +N  +IAPL+ R  K WSNFWGAL+ DG+YARS DY++I+ G
Sbjct: 299 VDADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARSEDYVDIVQG 358

Query: 445 DQGGKGIWNVPYITNCYLMKTSVIK-ATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDS 503
           ++   G+WNVPY+ N YL+K   ++   N +  +  + +D DMA C N R  G+ + I +
Sbjct: 359 NR--VGVWNVPYMANVYLIKGKTLRLEMNERNYFVRDKLDPDMALCRNAREMGVFMYISN 416

Query: 504 TQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFW 563
             E+G L+ + N++    N +++++  NP+DW  +YI+ +Y K +  + +  QPCPDVFW
Sbjct: 417 RHEFGRLLSTANYNTSHYNNDLWQIFENPVDWKEKYINRDYSK-IFTENIVEQPCPDVFW 475

Query: 564 FPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLR 623
           FPI +EK C E V+ ME YG+WS G ++D R+  GYE VPT DIHMKQV L  VW  F+R
Sbjct: 476 FPIFSEKACDELVEEMEHYGKWSGGKHHDSRISGGYENVPTDDIHMKQVDLENVWLHFIR 535

Query: 624 KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYE 683
           +++ P+  + F GY+ +   A ++FVV+Y P+ Q SLRPHHD+ST+TINIALN VG D++
Sbjct: 536 EFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPHHDASTFTINIALNNVGEDFQ 594

Query: 684 GGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           GGGC+F+RYNC++ + R GW  MHPGRLTH HEGL V  GTRYI +SF+DP
Sbjct: 595 GGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNGTRYIAVSFIDP 645


>gi|395833069|ref|XP_003789568.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2
           [Otolemur garnettii]
          Length = 942

 Score =  580 bits (1495), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 292/707 (41%), Positives = 427/707 (60%), Gaps = 52/707 (7%)

Query: 32  DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGYKVNLLKNE 90
           DK LVITVA+ E+DG+ RF+QSA+     VK LG  + W GGD ++++GGG KV L+K  
Sbjct: 284 DKLLVITVATKESDGFHRFMQSAKYFNYTVKVLGQGEEWRGGDGINTIGGGQKVRLMKEV 343

Query: 91  LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
           +++    DD++IL T+ +DVI  GG  ++L++F   +  +VF A+ L WPD  L DKYP 
Sbjct: 344 MEQYANEDDLVILFTECFDVIFAGGPEEVLKKFQKTNHKVVFAADGLLWPDKRLADKYPI 403

Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
           V  G RYLNSGGFIGYA  I  ++   ++++ +DDQL+Y  +++D   R    I LD   
Sbjct: 404 VHIGKRYLNSGGFIGYAPYINRIVQQWNLQDNDDDQLFYTKIYIDPLKREAINITLDHKC 463

Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSW-K 269
            +FQ L G+++++ L F+  +   + NT Y T PV ++GNG +KI LN FGNY+  SW +
Sbjct: 464 KIFQTLNGAVDEVVLKFENGK-ARVKNTFYETLPVAVNGNGPTKILLNYFGNYIPNSWTQ 522

Query: 270 TSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN 329
             GCT C     +D    D  P+V I VFI++PT FL  FL+ +  L+YP + + +F++N
Sbjct: 523 DKGCTLCE-SDTIDLSAVDVHPNVTIGVFIEQPTPFLPRFLDLLLTLDYPKEALKVFIHN 581

Query: 330 NQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDSD 388
            + YH      +    K     +K +     ++  EARN+ ++     +  D+YF VD+D
Sbjct: 582 KEVYHEKDIKVFFDKAKHEINTIKIVGPEENLSQAEARNMGMDFCRQDEKCDYYFSVDAD 641

Query: 389 SHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGG 448
             L NP  LK L+ +N                                            
Sbjct: 642 VVLTNPRTLKLLIEQN-------------------------------------------- 657

Query: 449 KGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEY 507
           +G+WNVPY+ N YL+K   +++  N +  +  + +D DMA C N R  G+ + I +  E+
Sbjct: 658 RGVWNVPYMANVYLIKGKTLRSEMNERNYFVRDKLDPDMALCRNAREMGVFMYISNRHEF 717

Query: 508 GHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIV 567
           G L+ + N++    N +++++  NP+DW  +YI+ +Y K +  +++  QPCPDVFWFPI 
Sbjct: 718 GRLLSTANYNTSHYNNDLWQIFENPVDWKEKYINRDYSK-IFTESIVEQPCPDVFWFPIF 776

Query: 568 TEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVV 627
           +EK C E V+ ME YGQWS G ++D R+  GYE VPT DIHMKQVGL  VW  F+R+++ 
Sbjct: 777 SEKACDELVEEMEHYGQWSGGKHHDSRISGGYENVPTDDIHMKQVGLESVWLHFIREFIA 836

Query: 628 PLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGC 687
           P+  + F GY+ +   A ++FVV+Y PD Q SLRPHHD+ST+TINIALN VG D++GGGC
Sbjct: 837 PVTLKVFAGYYTKGF-ALLNFVVKYSPDRQRSLRPHHDASTFTINIALNNVGEDFQGGGC 895

Query: 688 RFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           +F+RYNC++ + R GW  MHPGRLTH HEGL V  GTRYI +SF+DP
Sbjct: 896 KFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNGTRYIAVSFIDP 942


>gi|402852964|ref|XP_003891176.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 1 [Papio
           anubis]
          Length = 784

 Score =  580 bits (1494), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 298/764 (39%), Positives = 446/764 (58%), Gaps = 64/764 (8%)

Query: 31  EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNE 90
           ED  LV+TVA+ ET+G++RF +SA+    +++ LGL + W     +S GGG KV LLK  
Sbjct: 25  EDNLLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWNVEKGTSAGGGQKVRLLKKA 84

Query: 91  LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
           L++    +D++IL TDSYDV+   G  ++L++F    + +VF AE L +PD  L  KYP 
Sbjct: 85  LEKHADKEDLVILFTDSYDVLFASGPRELLKKFRQARSQVVFSAEELIYPDRRLEAKYPV 144

Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
           V  G R+L SGGFIGYA ++ +L++    ++ + DQL+Y  +FLD   R +  I LD   
Sbjct: 145 VSDGKRFLGSGGFIGYAPNLSKLVAEWEGQDSDSDQLFYTKIFLDPEKREQINITLDHRC 204

Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSK---------------- 254
            +FQNL G+L+++ L F++   V   N  Y+T PV+IHGNG +K                
Sbjct: 205 RIFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVLIHGNGPTKYPPGARNTYLGACYEL 263

Query: 255 -------------------IELNSFGNYLAKSWK-TSGCTRCNL-IKHLDSLKPDQFPSV 293
                              ++LN  GNY+ + W   +GCT C+  ++ L  +  +  P+V
Sbjct: 264 TTSVLTSELSVVPSFPAVLLQLNYLGNYIPRFWTFETGCTVCDEGLRSLKGIGDEALPTV 323

Query: 294 LISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVK 353
           L+ +FI++PT F+  F  ++  L+YP K + +F++N++++H    ++++    + +++VK
Sbjct: 324 LVGMFIEQPTPFVSLFFQRLLQLHYPRKHMRLFIHNHEQHHKAQVEEFLAEHGSEYQSVK 383

Query: 354 YIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLL 412
            +     + + +ARN+  +     +   +YF VD+D  L  P+ L+ L+ +N+++IAPL+
Sbjct: 384 LVGPEVRMANADARNMGADLCRQDRSCTYYFSVDADVALTEPNSLRLLIQQNKNVIAPLM 443

Query: 413 VRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKAT- 471
            R  + WSNFWGAL+ADG+YARS DY++I+ G +   G+WNVPYI+N YL+K S ++   
Sbjct: 444 TRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR--IGVWNVPYISNIYLIKGSALRGEL 501

Query: 472 NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRN 531
               ++  + +D DMAFC N+R + + + + +    GHL+  +++     + +++E+  N
Sbjct: 502 QSPDLFHHSKLDPDMAFCANVRQQDVFMFLTNRHTLGHLLSLDSYRTTHLHNDLWEVFSN 561

Query: 532 PLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNN 591
           P DW  +YIH  Y K+L    V   PCPDV+WFPI TE  C E V+ ME +GQWS G N 
Sbjct: 562 PEDWKEKYIHQNYTKALAGKLVET-PCPDVYWFPIFTEAACDELVEEMEHFGQWSLGDNK 620

Query: 592 DKRLETGYEAVPTR-DIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAP----- 645
              L  G      R  I       +  W  +      P         H  P R P     
Sbjct: 621 VGTLMPGLGPQGGRLSISATHCPGSPSWLLYQPTKKAPPPAVAGQRAHSLPSRKPVPHTC 680

Query: 646 ---------------MSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFI 690
                          ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VGVDYEGGGCRF+
Sbjct: 681 ALLHPKQPSLDAQFDLAFVVRYKPDEQPSLMPHHDASTFTINIALNRVGVDYEGGGCRFL 740

Query: 691 RYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           RYNC++ A R GW LMHPGRLTHYHEGL  T+GTRYI +SFVDP
Sbjct: 741 RYNCSIRAPRKGWTLMHPGRLTHYHEGLPTTRGTRYIAVSFVDP 784


>gi|301618077|ref|XP_002938453.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2-like
            [Xenopus (Silurana) tropicalis]
          Length = 1185

 Score =  575 bits (1482), Expect = e-161,   Method: Compositional matrix adjust.
 Identities = 290/720 (40%), Positives = 438/720 (60%), Gaps = 15/720 (2%)

Query: 24   NKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGY 82
            N + N+  DK LV+TVA+ ETDG+ RF+QSA      VK LG    W GGD++ ++GGG 
Sbjct: 472  NSIDNLPTDKLLVLTVATRETDGFHRFMQSARHFSYTVKVLGKGIEWKGGDVANTIGGGQ 531

Query: 83   KVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDT 142
            KV LLK  L+ ++  DD++IL TDSYDVI  GG  ++L +F   +  +VF AE L WPD 
Sbjct: 532  KVRLLKEALESLEDQDDLVILFTDSYDVIFAGGPEEVLLKFQQSNHKVVFAAEGLIWPDK 591

Query: 143  SLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKH 202
            SL + YP +   + +L   GFIGY  ++K+++    +++ +DDQL+Y  +++D+  R   
Sbjct: 592  SLKETYPFITFLFLFLCVAGFIGYLPNVKQIVQQWDLQDNDDDQLFYTKIYIDQIQRESI 651

Query: 203  KIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGN 262
             I LD  + LFQN+ G+L+++ L F+ D    + N++Y++ PV+IHGNG +K++LN FGN
Sbjct: 652  SITLDHKSTLFQNINGALDEVILAFE-DGKARVKNSQYDSLPVLIHGNGPTKLQLNYFGN 710

Query: 263  YLAKSWK-TSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAK 321
            Y+   W   +GC  C+L    D    + +P V + +FI++PT FL EF N++  L+YP +
Sbjct: 711  YIPNVWAPETGCGTCDL-DTTDLSTANAYPKVTVGIFIEQPTPFLPEFFNRLLALDYPKE 769

Query: 322  KISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVD 380
             ++ F++N++ YH      +    K +  N+K +     +   EARN+ +      K  D
Sbjct: 770  NMNFFIHNSEVYHEQHIVKFWEQAKNVIGNLKVVGPEEPIMQAEARNMGMNTCRQDKECD 829

Query: 381  FYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMN 440
            +YF +D+D  L NP  LK L+ +N  +IAPL+ R  K WSNFWGAL  DG+YARS DY++
Sbjct: 830  YYFNIDADVMLTNPQTLKILIEQNRKIIAPLVTRHGKLWSNFWGALTPDGYYARSEDYVD 889

Query: 441  IINGDQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRNKGIHL 499
            I+ G +   G+WN+PY+ + YL+K   +++    + +++L+ +D DMA C N R  G+ +
Sbjct: 890  IVQGKR--VGLWNMPYVAHVYLIKGETLRSEMKERNLFSLDRLDPDMALCRNAREMGVFM 947

Query: 500  KIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCP 559
             I +  E+G L+ + N++    N +++++  NP+DW  +YI+  Y K +    +  QPCP
Sbjct: 948  YITNRHEFGRLLSTANYNTTHYNNDLWQIFENPVDWREKYINANYSK-IFTQNIVEQPCP 1006

Query: 560  DVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWA 619
            DVFWFP+++EK C E V+ ME +GQWS   + D R+  GYE VPT DIHM Q+GL   W 
Sbjct: 1007 DVFWFPVLSEKACDELVEEMENFGQWSGSAHTDTRIAGGYENVPTDDIHMNQIGLDNEWL 1066

Query: 620  EFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQ----PSLRPHH-DSSTYTINIA 674
             F+R+Y+ P+  + F GY+ +   A ++F     P       P L     D     +   
Sbjct: 1067 HFIREYIAPITLKVFAGYYTKG-HALLNFXXXXXPCPDVFWFPVLSEKACDELVEEMENF 1125

Query: 675  LNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
                G  + GGGCRF RYNC++ + R GW  MHPGRLTH HEGL VT GTRYI +SF+DP
Sbjct: 1126 GQWSGSAHTGGGCRFARYNCSIESPRKGWSFMHPGRLTHLHEGLPVTNGTRYIAVSFIDP 1185


>gi|358254467|dbj|GAA55391.1| lysyl hydroxylase/galactosyltransferase/ glucosyltransferase
           [Clonorchis sinensis]
          Length = 694

 Score =  570 bits (1469), Expect = e-159,   Method: Compositional matrix adjust.
 Identities = 304/710 (42%), Positives = 439/710 (61%), Gaps = 29/710 (4%)

Query: 33  KFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNELD 92
           + LV+TVA+   D  +RF++SA  N+  VK              S+GGG K+NLL++EL 
Sbjct: 6   ELLVLTVATETNDALERFLRSANNNQFNVK--------------SVGGGQKINLLRDELR 51

Query: 93  EMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAVG 152
                DD++IL  DSYDV+       ++E +   +  I+FGAE  CWPD SL   YP VG
Sbjct: 52  SHINLDDLLILFLDSYDVVFMDSKFRLVEEYENSNHTILFGAESFCWPDQSLEKMYPQVG 111

Query: 153 SGY-RYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLAN 211
               R+LNSGGFIG A  +  +++   IK ++DDQLYY  ++L+E LR +  I LDT ++
Sbjct: 112 PKENRFLNSGGFIGPASSLYRMVTEMPIKEDDDDQLYYTKIYLNEALRRELNIGLDTRSS 171

Query: 212 LFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK-T 270
           +FQNL G+L+DI+L+F  D   +L NTK  T P++ HGNG  K E NS  NYL  +W  T
Sbjct: 172 IFQNLNGALQDIELHFTNDT-GYLVNTKTGTRPIVAHGNGPIKPEFNSLTNYLDHNWTPT 230

Query: 271 SGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM--FVY 328
            GC  C+  +++D  +  +FP++ +S+FI+ PT FL+ F ++IA L+YP   I +   V 
Sbjct: 231 QGCQHCSE-RNIDLDEQGEFPNIQLSIFIENPTPFLDVFFDRIAALSYPKSHIHLTGHVA 289

Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDS 387
              E    L D +   +   + +V +       +   AR+ A  + L      F F VDS
Sbjct: 290 KKAEKQRALADTFNKTYGHEYLSVSWFDAEEVTDEAIARDYAYAHCLALDTCKFLFSVDS 349

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
              L NP  +++L+  N S+IAP+L R  K WSNFWGAL+ DG+Y RS DY+ I+  ++ 
Sbjct: 350 TVQLTNPRTIEHLIQMNRSMIAPMLSRRGKLWSNFWGALSRDGYYERSDDYIEIV--ERN 407

Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEY 507
             GIWNVP+I + YL+   V++      +    S+D +M      R + + + +D+ + Y
Sbjct: 408 RTGIWNVPFIRDAYLLSRRVVRKFAEHKL--AGSIDVEMRIPAIARQENVFMTVDNLEPY 465

Query: 508 GHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPD---TVNNQPCPDVFWF 564
           G+LV  + +     N +++++  NPLDW+ +Y+HP+Y K   P+   T   QPCPDVF+F
Sbjct: 466 GYLVFPDTYTTDHLNNDLWQIFDNPLDWEEQYVHPDYFKISNPEVKMTDIEQPCPDVFYF 525

Query: 565 PIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRK 624
           PIV+ KFC + +  +E +G WSDGTN D RLE GYE VPTRDIHM+Q+     W  FL K
Sbjct: 526 PIVSAKFCRQLIAEVEEFGLWSDGTNIDPRLEGGYENVPTRDIHMRQINWEDHWLHFLVK 585

Query: 625 YVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEG 684
           YV P+Q++ F GY  +P  A M+FVVRYRPDEQ SLRPHHD+S+Y++ IALN+  V+++G
Sbjct: 586 YVHPIQKKVFAGYDDKPW-ARMNFVVRYRPDEQSSLRPHHDASSYSLTIALNEAEVEFQG 644

Query: 685 GGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           GG RF+RYNC++  +++GW  M PGR+TH HEGL  T GTRYI ++FV+P
Sbjct: 645 GGTRFVRYNCSLVRSKLGWTSMFPGRVTHLHEGLITTSGTRYIFVTFVNP 694


>gi|170590254|ref|XP_001899887.1| Procollagen-lysine,2-oxoglutarate 5-dioxygenase precursor, putative
           [Brugia malayi]
 gi|158592519|gb|EDP31117.1| Procollagen-lysine,2-oxoglutarate 5-dioxygenase precursor, putative
           [Brugia malayi]
          Length = 688

 Score =  568 bits (1465), Expect = e-159,   Method: Compositional matrix adjust.
 Identities = 299/741 (40%), Positives = 448/741 (60%), Gaps = 61/741 (8%)

Query: 2   LSNLHLNCLILSCVVFFISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQV 61
           ++ + L  L LS V+ + +V   K+  + E   LV+TVA+ ETDG +R  ++A++N + +
Sbjct: 1   MTGMTLWVLTLSTVLMYGTVTMEKISGMPE--LLVVTVATEETDGLRRLKRTADINDVGL 58

Query: 62  KTLGLHQPWLGGDMS-SLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDIL 120
           +  G+ + W GGD+    GGG K+ +L+  L++    +D+IIL  D+YDVI+ G    IL
Sbjct: 59  EVFGMGEQWRGGDVRVDKGGGQKIRILRKSLEKYKDRNDLIILFVDAYDVILLGNEEQIL 118

Query: 121 ERFNTF--DANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRS 178
             F TF     +VF +E  CWP+ SL  KYP V  GYRYLNSG F+G+A +I  LIS + 
Sbjct: 119 RNFFTFFDGFRLVFSSEPFCWPNRSLAPKYPLVNFGYRYLNSGVFMGFAPEIWNLISYKD 178

Query: 179 IKNEEDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNT 238
           +++ +DDQLYY  L+LDE +R   K+ LD+++ LFQNL G+  D+KL    DE       
Sbjct: 179 VEDNDDDQLYYTRLYLDEQIRMSLKMTLDSMSILFQNLNGASNDVKLEMS-DE------- 230

Query: 239 KYNTNPVIIHGNGKSKIELNSFGNYLAKSWKTSGCTRCNLIKHLDSLKPDQFPSVLISVF 298
                                 G Y                     L+  + P + +SV 
Sbjct: 231 --------------------RSGTYF-------------------DLEKIELPRLFLSVI 251

Query: 299 IDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHN 358
           I KP  F+ EF   I +L Y  +KI ++VY NQ +     + ++ + K  ++++ Y    
Sbjct: 252 ISKPIPFIREFFENIKSLVYADEKIDLYVYCNQNFLEKETNGFVEDVKGRYRSLLYDGST 311

Query: 359 STVNSKEARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNR----NESLIAPLLVR 414
           + +  +EAR  +++ SL  G D+   +D D HL+N + L  +++R    +  ++APL+ +
Sbjct: 312 TELGEREARAFSLKQSLALGDDYLIMIDGDVHLNNSEALLLMIHRVKEKDSEILAPLVGQ 371

Query: 415 PFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIK 474
           P K ++NFWGA++++G+YARS +Y++II  D    GIWNVP+I++  ++     K T++ 
Sbjct: 372 PHKLFTNFWGAISSNGYYARSENYLDII--DYKEVGIWNVPFISSILIIAKE--KLTSLS 427

Query: 475 TIYTLN-SMDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPL 533
             Y  N  +D DM+FC+  R+KG  L +D++  YG LV SE+ +  K +P++YE+  N  
Sbjct: 428 NAYYYNDKLDPDMSFCSFARDKGHFLYLDNSHYYGFLVVSEDVESSKVHPDMYEIFNNKE 487

Query: 534 DWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDK 593
            W+ RYIHP Y  +L       + C DV+ FP+++E+FC E ++  E YG+WSDG + D+
Sbjct: 488 LWEKRYIHPNYFAALNGSIQILEICQDVYDFPLMSERFCAELIEECEYYGKWSDGKHKDE 547

Query: 594 RLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYR 653
           RL  GYE VPTRDIHM Q+G    W   L +YV P+QE+ FIGY+ +PV + M FVVRY+
Sbjct: 548 RLVGGYENVPTRDIHMNQIGFERHWLYMLDEYVRPIQEKLFIGYYKQPVESVMMFVVRYK 607

Query: 654 PDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTH 713
           P+EQ SLRPHHD+STY+I+IALN+ GVDYEGGG RF+RYNC   A  +G  ++ PGRLTH
Sbjct: 608 PEEQASLRPHHDASTYSIDIALNKRGVDYEGGGVRFLRYNCTFDADTVGHSMIFPGRLTH 667

Query: 714 YHEGLQVTQGTRYIMISFVDP 734
            HEGL+ TQGTRYI +SF++P
Sbjct: 668 LHEGLETTQGTRYIAVSFINP 688


>gi|312373903|gb|EFR21571.1| hypothetical protein AND_16831 [Anopheles darlingi]
          Length = 902

 Score =  567 bits (1462), Expect = e-159,   Method: Compositional matrix adjust.
 Identities = 282/604 (46%), Positives = 403/604 (66%), Gaps = 16/604 (2%)

Query: 35  LVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNELDEM 94
           LV TVASNET+GY R+I+SA    + VKTLGL +PWLGGDM S+GGGYK+NLL+  L   
Sbjct: 274 LVFTVASNETEGYLRYIRSANHYGISVKTLGLGKPWLGGDMKSVGGGYKINLLREALKPY 333

Query: 95  DITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAV-GS 153
               + ++L TDSYDV+       I+++F TF+A++VFGAE  CWPD SL   YP + G 
Sbjct: 334 RKESERLVLFTDSYDVVFLMPWEKIVQKFLTFNASVVFGAEGFCWPDESLKSLYPPLEGR 393

Query: 154 GYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLANLF 213
           G R+LNSG F+GYA  +  ++   S K+ +DDQLYY  ++LD+ LR +  I LD +A+LF
Sbjct: 394 GMRFLNSGLFMGYADKLYLMLKTPS-KDTDDDQLYYTNVYLDKQLRNELNIKLDHMASLF 452

Query: 214 QNLYGSLEDIKLNFDLDEF-VHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWKTSG 272
           QNL G  E + L+ +  E    L NT+Y + P ++HGNG SK+ LNS+ NYLA ++    
Sbjct: 453 QNLNGVEEQVILSLEPSEAEATLKNTEYTSKPAVVHGNGPSKLALNSYANYLAGAFLDGV 512

Query: 273 CTRCNLIK-HLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQ 331
           C      +  LD  K  + P V +++FI+KPT FLEE+ +KI  LNYP  ++ + V++  
Sbjct: 513 CKTVEENRIQLDDEK--ELPLVTMALFIEKPTPFLEEWFDKITKLNYPGDRLDVLVHSGV 570

Query: 332 EYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSHL 391
            YH P+   ++   +  ++++K I+H+       AR  A ++   +G D+ F VDS+ HL
Sbjct: 571 AYHEPVVKAFLSQQEGRYRSLKSISHSDDHKEAVARAFATKHCRQRGCDYLFVVDSEGHL 630

Query: 392 DNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGK-- 449
           DNP+VL+ L+  N ++I+P+L RP K WSNFWGAL+  GFYARS DYM+I+    G K  
Sbjct: 631 DNPNVLRALIEANRNVISPVLTRPEKVWSNFWGALSNQGFYARSNDYMDIV----GRKLL 686

Query: 450 GIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGH 509
           G+WNVP+I+  YL+K S++   +    Y+L   D DMA C + R+KGI + + + ++YGH
Sbjct: 687 GLWNVPFISIVYLIKRSILPDVS----YSLKETDPDMAMCWHFRSKGIFMHVINMEQYGH 742

Query: 510 LVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTE 569
           L+D+E FD  +T+P+ Y+L  N  DW+ RY+ PEYQ+ L    V  QPCPDV+WF + T+
Sbjct: 743 LIDTEYFDMDRTHPDFYQLFNNRHDWEQRYLSPEYQQQLETTFVPKQPCPDVYWFAVGTD 802

Query: 570 KFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPL 629
           +FC +  +I+EA+G+WSDGT+ND RL+ GYEAVPTRDIHM QVGL  VW +FL+ Y+ PL
Sbjct: 803 RFCDDLKEIVEAFGKWSDGTHNDNRLQGGYEAVPTRDIHMNQVGLEQVWLKFLQLYIRPL 862

Query: 630 QERE 633
           QE++
Sbjct: 863 QEKD 866



 Score = 74.3 bits (181), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 38/68 (55%), Positives = 47/68 (69%), Gaps = 5/68 (7%)

Query: 670 TINIALNQVGVDYEGGGCRFIRY---NCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRY 726
           T +I +NQVG+  E    +F++          TR GW+LMHPGRLTH+HEGL  T+GTRY
Sbjct: 837 TRDIHMNQVGL--EQVWLKFLQLYIRPLQEKDTRKGWLLMHPGRLTHFHEGLLTTKGTRY 894

Query: 727 IMISFVDP 734
           IMISFVDP
Sbjct: 895 IMISFVDP 902


>gi|146231842|gb|ABQ12996.1| procollagen-lysine, 2-oxoglutarate 5-dioxygenase 3 precursor [Bos
           taurus]
          Length = 677

 Score =  566 bits (1460), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 272/637 (42%), Positives = 413/637 (64%), Gaps = 11/637 (1%)

Query: 29  IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLL 87
           ++ +K LVITVA+ ET+GY+RF+QSAE     V+TLGL + W GGD++ ++GGG KV  L
Sbjct: 47  VNPEKMLVITVATAETEGYRRFLQSAEFFNYTVRTLGLGEEWRGGDVARTVGGGQKVRWL 106

Query: 88  KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
           K E+++    +DM+I+  DSYDV++ G  +++L++F    + ++F AE  CWP+  L ++
Sbjct: 107 KKEMEKYAEREDMVIMFVDSYDVVLAGSPSELLKKFVQSGSRLLFSAESFCWPEWGLAEQ 166

Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
           YP VG+G R+LNSGGFIG+A  I +++     K+++DDQL+Y  L+LD  LR K  + LD
Sbjct: 167 YPEVGTGKRFLNSGGFIGFAPTIHQIVRQWKYKDDDDDQLFYTRLYLDPGLREKLGLSLD 226

Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
             + +FQNL G+L+++ L F  +  V + N  Y+T PV++HGNG +K++LN  GNY+   
Sbjct: 227 HKSRIFQNLNGALDEVVLKFGRNR-VRIRNVAYDTLPVVVHGNGPTKLQLNYLGNYVPNG 285

Query: 268 W-KTSGCTRCNLIKH-LDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
           W    GC  CN  +  L   +P   P VL++VF+++PT FL  FL ++  L+YP  ++++
Sbjct: 286 WTPEGGCGFCNQGRRPLPGGQPP--PRVLLAVFVEQPTPFLPRFLQRLLLLDYPPDRVTL 343

Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFY 384
           F++NN+ YH P  D+     +  F  VK +     +   EAR++A++        +FYF 
Sbjct: 344 FLHNNEVYHEPHIDESWPQLQDHFSAVKLVGPEEALTPGEARDMAMDICRQDPKCEFYFS 403

Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
           +D+D+ + NP  L+ L+  N  +IAP+L R  K WSNFWGAL+ D +YARS DY+ ++  
Sbjct: 404 LDADTVITNPQTLRILIEANRKVIAPMLSRHGKLWSNFWGALSPDEYYARSEDYVELVQR 463

Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDS 503
            +   G+WNVPYI+  Y+++   ++     + +++ +  D DMAFC +LR+KGI L + +
Sbjct: 464 KR--VGVWNVPYISQAYVIRGETLRTELPQREVFSGSDTDPDMAFCKSLRDKGIFLHLSN 521

Query: 504 TQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFW 563
             E+G L+ +  +D    +P+++++  NPLDW  +YIH  Y ++L  + +  QPCPDV+W
Sbjct: 522 QHEFGRLLATSRYDTDHLHPDLWQIFDNPLDWKEQYIHENYTRALEGEGLVEQPCPDVYW 581

Query: 564 FPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLR 623
           FP+++E+ C E V+ ME YGQWS G + D RL  GYE VPT DIHMKQVG    W + LR
Sbjct: 582 FPLLSEQMCDELVEEMEHYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYEDQWLQLLR 641

Query: 624 KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSL 660
            YV P+ E  F GYH +  RA M+FVVRYRPDEQPSL
Sbjct: 642 TYVGPMTESLFPGYHTK-TRAVMNFVVRYRPDEQPSL 677


>gi|441649934|ref|XP_003276658.2| PREDICTED: LOW QUALITY PROTEIN: procollagen-lysine,2-oxoglutarate
           5-dioxygenase 3 [Nomascus leucogenys]
          Length = 682

 Score =  564 bits (1454), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 296/712 (41%), Positives = 420/712 (58%), Gaps = 69/712 (9%)

Query: 29  IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLL 87
           ++ +K LVITVA+ ET+GY RF++SAE     V+TLGL + W GGD++ ++GGG KV  L
Sbjct: 34  VNPEKLLVITVATAETEGYLRFLRSAEFFNYTVRTLGLGEEWRGGDVARTVGGGQKVRWL 93

Query: 88  KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
           K E+++    +DMII+  DSYDVI+ G   ++L++F    + ++F AE  CWP+  L ++
Sbjct: 94  KKEMEKYADREDMIIMFVDSYDVILAGSPTELLKKFVQSGSRLLFSAESFCWPEWGLAEQ 153

Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
           YP VG+G R+LNSGGFIG+A  I +++     K+++DDQL+Y  L+LD  LR K  + LD
Sbjct: 154 YPEVGTGKRFLNSGGFIGFATTIHQIVRQWKYKDDDDDQLFYTRLYLDPGLREKLSLNLD 213

Query: 208 TLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKS 267
             + +FQNL G+L+++ L FD +  V + N  Y+T PV++HGNG +K++LN  GNY+   
Sbjct: 214 HKSRIFQNLNGALDEVVLKFDRNR-VRIRNVAYDTLPVVVHGNGPTKLQLNYLGNYVPNG 272

Query: 268 WKT-SGCTRCNLIKH-LDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISM 325
           W    GC  CN  +  L   +P   P V ++VF+++PT FL  FL ++            
Sbjct: 273 WTPEGGCGFCNQDRRTLPGGQPP--PRVFLAVFVEQPTPFLPRFLQRL------------ 318

Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYV 385
                      L  DY  +  T+F       HN+ V                        
Sbjct: 319 -----------LLLDYPPDRVTLF------LHNNEV------------------------ 337

Query: 386 DSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNA--DGFYARSFDYMNIIN 443
                   P +    +   +   A  LV P +A S       A  D +Y RS DY+ ++ 
Sbjct: 338 -----FHEPHIADSWLQLQDHFSAVKLVGPEEALSPGEARDMAIPDEYYXRSEDYVELVQ 392

Query: 444 GDQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRNKGIHLKID 502
             +   G+WNVPYI+  Y+++   ++     K +++ +  D DMAFC + R+K I L + 
Sbjct: 393 RKR--VGVWNVPYISQAYVIRGDTLRTEVPQKDVFSGSDTDPDMAFCKSFRDKCIFLHLS 450

Query: 503 STQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVF 562
           +  E+G  + +  +D +  +P          DW  +YIH  Y ++L  + +  QPCPDV+
Sbjct: 451 NQYEFGWFLATSRYDTEHLHPXXXXXXXXXXDWKEQYIHENYSRALEGEGIVEQPCPDVY 510

Query: 563 WFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFL 622
           WFP+++E+ C E V  ME YGQWS G + D RL  GYE VPT DIHMKQVG    W + L
Sbjct: 511 WFPLLSEQMCDELVAEMEHYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYEDQWLQLL 570

Query: 623 RKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDY 682
           R YV P+ E  F GYH +  RA M+FVVRYRPDEQPSLRPHHDSST+T+N+ALN  G+DY
Sbjct: 571 RTYVGPMTESLFPGYHTKGARAVMNFVVRYRPDEQPSLRPHHDSSTFTLNVALNHKGLDY 630

Query: 683 EGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           EGGGCRF+RY+C +++ R GW L+HPGRLTHYHEGL  T GTRYIM+SFVDP
Sbjct: 631 EGGGCRFLRYDCMISSPRKGWALLHPGRLTHYHEGLPTTWGTRYIMVSFVDP 682


>gi|348581630|ref|XP_003476580.1| PREDICTED: LOW QUALITY PROTEIN: procollagen-lysine,2-oxoglutarate
           5-dioxygenase 2-like [Cavia porcellus]
          Length = 714

 Score =  560 bits (1443), Expect = e-156,   Method: Compositional matrix adjust.
 Identities = 299/740 (40%), Positives = 434/740 (58%), Gaps = 74/740 (10%)

Query: 19  ISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSS 77
           +  +  K  +I  DK LVITVA+ E DGY RF+QSA+     VK LG  + W GGD  +S
Sbjct: 25  VGENAEKPASIPTDKLLVITVATKENDGYHRFMQSAKYFNYTVKVLGQGEEWKGGDGFNS 84

Query: 78  LGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERL 137
           +GGG KV L+K  ++     +DM+IL T+ +DVI  GG  ++L++F   +  +VF A+ +
Sbjct: 85  IGGGQKVRLMKEVMEHYANQEDMVILFTECFDVIFAGGPEEVLKKFQKTNHKVVFAADGI 144

Query: 138 CWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDET 197
            WPD  L D+YP V  G RYLNSGGFIGYA  I  ++   ++++ +DDQL+Y  +++D  
Sbjct: 145 LWPDKRLADRYPVVHIGKRYLNSGGFIGYAPYINRIVQRWNLQDNDDDQLFYTKIYIDPL 204

Query: 198 LRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIEL 257
            R    I LD    +FQ L G+ +++ L F+  +     NT Y T PV I+GNG +KI L
Sbjct: 205 QREAFNITLDHKCKIFQALNGATDEVVLKFENGK-TRAKNTFYETLPVAINGNGPTKILL 263

Query: 258 NSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANL 316
           N FGNY+  SW + +GCT C            +F ++ +S  +D+               
Sbjct: 264 NYFGNYVPNSWTQDNGCTLC------------EFDTIDLSA-VDE--------------- 295

Query: 317 NYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH 376
                     VY+ ++  A  FD   H   T    +K +     ++  +ARN+ ++    
Sbjct: 296 ----------VYHEKDIKA-FFDKAKHEIST----IKIVGPEEDLSQAKARNMGMDFCRQ 340

Query: 377 -KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARS 435
            +  D+YF +D+D  L NP  LK L+ +N  +IAPL+ R  K WSNFWGAL+ DG+YARS
Sbjct: 341 DEKCDYYFSLDADVVLTNPRTLKLLIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARS 400

Query: 436 FDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRN- 494
            DY++I+ G +   G  NVPY+ N YL + ++   + +   Y +       A C N R  
Sbjct: 401 EDYVDIVQGKRVAYG--NVPYMANVYLXRETL--RSEMMKDYFVRDRWIXYALCRNAREC 456

Query: 495 --------------------KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLD 534
                               KG+ + I +  E+G L+ + N++    N +++++  NP+D
Sbjct: 457 SLQREKDSPTPETFQMLSPPKGVFMYISNRHEFGRLLSTANYNTSHYNNDLWQIFENPVD 516

Query: 535 WDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKR 594
           W  +YI+ +Y K +  + +  QPCPDVFWFPI +EK C E V+ ME YGQWS G ++D R
Sbjct: 517 WKEQYINRDYSK-IFTENLVEQPCPDVFWFPIFSEKACDELVEEMENYGQWSGGKHHDSR 575

Query: 595 LETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRP 654
           +  GYE VPT DIHMKQ+GL  VW  F+R+++ P+  + F GY+ +   A ++FVV+Y P
Sbjct: 576 ISGGYENVPTDDIHMKQIGLENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYSP 634

Query: 655 DEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHY 714
           + Q SLRPHHD+ST+TINIALN VG D++GGGC+F+RYNC++ + R GW  MHPGRLTH 
Sbjct: 635 ERQRSLRPHHDASTFTINIALNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHL 694

Query: 715 HEGLQVTQGTRYIMISFVDP 734
           HEGL V  GTRYI +SF+DP
Sbjct: 695 HEGLPVKNGTRYIAVSFIDP 714


>gi|324502700|gb|ADY41187.1| Procollagen-lysine,2-oxoglutarate 5-dioxygenase [Ascaris suum]
          Length = 731

 Score =  558 bits (1439), Expect = e-156,   Method: Compositional matrix adjust.
 Identities = 283/737 (38%), Positives = 453/737 (61%), Gaps = 22/737 (2%)

Query: 9   CLILSCVVFFISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQ 68
            + LS  +F + +  N   ++      V+TV     D  +R  +SA  +++Q+  L   Q
Sbjct: 6   VVALSLSLFILRLDVNAATSLH-----VVTVVIEHQDALERLQRSANAHEIQLNILRHDQ 60

Query: 69  PWLGGDMSSLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERF--NTF 126
                  S LGGG K+ +L++ L+      D+I+L  D+   II+G   +IL+RF  +  
Sbjct: 61  L---ASSSHLGGGEKLRILRDGLEIYKDRSDLILLYVDANKAIINGRGEEILKRFMDSYS 117

Query: 127 DANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQ 186
           ++ IVF ++  C+PD  L  +YP V  G R+LNS  FIGYA  I EL++++S++N  D+Q
Sbjct: 118 NSQIVFSSDNYCFPDEELTQRYPIVEKGKRFLNSAAFIGYANKIWELLNSQSLENINDEQ 177

Query: 187 LYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVI 246
           ++Y   FLDE LR + ++VLD+ + +F ++  S ++I L+F  +   ++TN  + T+P+I
Sbjct: 178 IFYTHRFLDERLRNRLQMVLDSTSQIFHSVDVSKDEITLDFSDNGDAYITNVIHKTHPLI 237

Query: 247 IHGNGKSKIELNSFGNYLAKSWKTS-GCTRCNL--IKHLDSLKPDQFPSVLISVFIDKPT 303
           IHG+  +K+ LN  GNY+ K+W    GC  C+   +  L      ++P + +++ + KP 
Sbjct: 238 IHGDESNKLMLNYLGNYIGKAWSADFGCRDCSAQRVNFLKDNAEQEWPKLTLAIMLAKPI 297

Query: 304 AFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNS 363
            F+EEFL K+  L YPA KI +++Y+NQ+Y+    ++++   +  +  V++ +    +  
Sbjct: 298 PFVEEFLTKVEKLEYPASKIDLYLYSNQKYNEREVNEFLRRVRGKYSWVEWDSGEVEIGE 357

Query: 364 KEARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNR----NESLIAPLLVRPFKAW 419
           +EAR  A++ ++    DF F +D++ H  + +V+++++      N  ++AP++ +P K +
Sbjct: 358 REARRTAIDAAIKANNDFVFLLDANVHFVDLNVIRWIIESALTMNLGILAPMVGKPNKFF 417

Query: 420 SNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTL 479
           +NFWGA++  G+Y RS DY  I+N  +   G+WNVP+I++  L+    ++       Y  
Sbjct: 418 TNFWGAISPSGYYQRSEDYTEIVNYKR--VGVWNVPFISSAILINKQKMREIRDGFFYNT 475

Query: 480 NSMDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFD--PQKTNPEVYELIRNPLDWDL 537
           + +D D++FC   R+    L +D+ + YG L DSE FD   +  +PE+Y++  N   W+ 
Sbjct: 476 D-VDADLSFCQFARDNDHFLYVDNQRYYGFLADSETFDNGGKHLHPEMYQIFENRHLWES 534

Query: 538 RYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLET 597
           RY+HP+Y  +L       QPCPDV+ +P+++E F  E ++ ME +GQWSDG N D+RL  
Sbjct: 535 RYVHPDYFGALDGSGEIAQPCPDVYHYPLMSEIFARELIEEMENFGQWSDGKNEDERLAG 594

Query: 598 GYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQ 657
           GYE VPT DIHM Q+     W  FL +YV P+QE+ FIGY+ +PV A M FVVRY+P EQ
Sbjct: 595 GYENVPTIDIHMNQIDFQREWLYFLDEYVRPMQEKLFIGYYQKPVEALMMFVVRYQPGEQ 654

Query: 658 PSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEG 717
           PSLR HHD+STYTI++ LN+ G DYEGGG R++RYNC V A ++G+  M PGRLTH HEG
Sbjct: 655 PSLRAHHDASTYTIDVPLNKRGRDYEGGGVRYVRYNCTVAADQVGYAAMFPGRLTHLHEG 714

Query: 718 LQVTQGTRYIMISFVDP 734
           L VT+G RYI +SF++P
Sbjct: 715 LPVTKGIRYIAVSFLNP 731


>gi|350591622|ref|XP_003132514.3| PREDICTED: LOW QUALITY PROTEIN: procollagen-lysine,2-oxoglutarate
           5-dioxygenase 2, partial [Sus scrofa]
          Length = 645

 Score =  556 bits (1433), Expect = e-155,   Method: Compositional matrix adjust.
 Identities = 274/651 (42%), Positives = 406/651 (62%), Gaps = 30/651 (4%)

Query: 108 YDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYA 167
           +DVI  GG  ++L++F   +  +VF A+ + WPD  L DKYP V  G RYLNSGGFIGYA
Sbjct: 1   FDVIFAGGPEEVLKKFQKSNHKVVFSADGILWPDKRLADKYPIVHIGKRYLNSGGFIGYA 60

Query: 168 KDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNF 227
             I  ++   ++++ +DDQL+Y  +++D   R    I LD    +FQ L G+++++ L F
Sbjct: 61  PYINRIVQQWNLQDNDDDQLFYTKIYIDPLKREALNITLDHKCKIFQTLNGAVDEVVLKF 120

Query: 228 DLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSW-KTSGCTRCNLIKHLDSLK 286
           +  +     N  Y T PV I+GNG +KI LN FGNY+  +W + +GCT C+ +  +D   
Sbjct: 121 ENGK-ARAKNVFYETLPVAINGNGPTKILLNYFGNYVPNAWTQDNGCTLCD-VDTIDLSA 178

Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFK 346
            D  P+V I VFI++PT FL  FL+ +  L+YP + + +F++N + YH      +    K
Sbjct: 179 VDVHPNVTIGVFIEQPTPFLPRFLDTLLTLDYPKEALKLFIHNKEVYHEKNIKVFFDKAK 238

Query: 347 TMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDSDSHLDNPDVLKYLVNRNE 405
                +K +     ++  EARN+ ++     +  ++YF VD+D  L NP  LK L+ +N 
Sbjct: 239 HEITTIKIVGPEENLSQAEARNMGMDFCRQDENCNYYFSVDADVVLTNPRTLKILIEQNR 298

Query: 406 SLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKT 465
            +IAPL+ R  K WSNFWGAL+ DG+YARS DY++I+ G++   GIWNVPY+ N YL+K 
Sbjct: 299 KIIAPLVTRHGKLWSNFWGALSPDGYYARSEDYVDIVQGNR--VGIWNVPYMANVYLIKG 356

Query: 466 SVIKA-TNIKTIYTLNSMDYDMAFCTNLRN---------------------KGIHLKIDS 503
             +++  N +  +  + +D DMA C N R                      KG+ + + +
Sbjct: 357 KTLRSEMNERNYFVRDKLDPDMALCRNAREMTLQREKDSPTPETFQMLSPPKGVFMYVSN 416

Query: 504 TQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFW 563
             E+G L+ + N++    + +++++  NP+DW  +YI+ +Y K +  + +  QPCPDVFW
Sbjct: 417 RHEFGRLLSTANYNISHYHNDLWQIFENPVDWKEKYINRDYSK-IFTENIVEQPCPDVFW 475

Query: 564 FPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLR 623
           FPI +EK C E V+ ME YGQWS G ++D R+  GYE VPT DIHMKQ+ L  VW  F+R
Sbjct: 476 FPIFSEKACDELVEEMEHYGQWSGGKHHDSRISGGYENVPTDDIHMKQIDLENVWLHFIR 535

Query: 624 KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYE 683
           +++ P+  + F GY+ +   A ++FVV+Y P+ Q SLRPHHD+ST+TINIALN VG D++
Sbjct: 536 EFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPHHDASTFTINIALNNVGEDFQ 594

Query: 684 GGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           GGGC+F+RYNC++ + R GW  MHPGRLTH HEGL V  GTRYI +SF+DP
Sbjct: 595 GGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNGTRYIAVSFIDP 645


>gi|395734248|ref|XP_002814197.2| PREDICTED: LOW QUALITY PROTEIN: procollagen-lysine,2-oxoglutarate
           5-dioxygenase 2 [Pongo abelii]
          Length = 909

 Score =  552 bits (1423), Expect = e-154,   Method: Compositional matrix adjust.
 Identities = 279/674 (41%), Positives = 410/674 (60%), Gaps = 15/674 (2%)

Query: 66  LHQPWLGGDMSSLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNT 125
           +HQP+ G    ++    K +    +  + +  +        S  VI  GG     ++F  
Sbjct: 246 IHQPFRGAGQVTV----KSDFTGEQFYDEECHESFHKWECLSLIVIFAGGPEKFXKKFLK 301

Query: 126 FDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDD 185
            +  +VF A+ + WPD  L DKYP V  G RYLNSGGFIGYA  +  ++   ++++ +DD
Sbjct: 302 ANHKVVFAADGILWPDKRLADKYPVVHIGKRYLNSGGFIGYAPYVNRIVQQWNLQDNDDD 361

Query: 186 QLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPV 245
           QL+Y  +++D   R    I LD    +FQ L G+++++ L F+  +     NT Y T PV
Sbjct: 362 QLFYTKIYIDPLKREAINITLDHKCKIFQTLNGAVDEVVLKFENGK-ARAKNTFYETLPV 420

Query: 246 IIHGNGKSKIELNSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTA 304
            I+GNG +KI LN FGNY+  SW + +GCT C     +D    D  P+V I VFI++PT 
Sbjct: 421 AINGNGPTKILLNYFGNYVPNSWTQDNGCTLCEF-DTVDLSAVDVHPNVSIGVFIEQPTP 479

Query: 305 FLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSK 364
           FL  FL+ +  L+YP + + +F++N + YH      +    K   K +K       ++  
Sbjct: 480 FLPRFLDILLTLDYPKEALKLFIHNKEVYHEKDIQVFFDKAKHEIKTIKLGXPQKNLSQA 539

Query: 365 EARNLAVENSLHK---GVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSN 421
           EA+         +     D+YF VD+D  L NP  LK L+ +N  +IAPL+ R  K WSN
Sbjct: 540 EAQKHGNGWDFCRQDEKCDYYFSVDADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSN 599

Query: 422 FWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLN 480
           FWGAL+ DG+YARS DY++I+ G++   G+WNVPY+ N YL+K   +++  N +  +  +
Sbjct: 600 FWGALSPDGYYARSEDYVDIVQGNR--VGVWNVPYMANVYLIKGKTLRSEMNERNYFVRD 657

Query: 481 SMDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYI 540
            +D DMA C N R  G+ + I +  E+G L+ + N++    N +++++  NP+DW  +YI
Sbjct: 658 KLDPDMALCRNAREMGVFMYISNRHEFGRLLSTANYNTSHYNNDLWQIFENPVDWKEKYI 717

Query: 541 HPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYE 600
           + +Y K +  + +  QPCPDVFWFPI +EK C E V+ ME YG+WS G ++D R+  GYE
Sbjct: 718 NRDYSK-IFTENIVEQPCPDVFWFPIFSEKACDELVEEMEHYGKWSGGKHHDSRISGGYE 776

Query: 601 AVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSL 660
            VPT DIHMKQV L  VW  F+R+++ P+  + F GY+ +   A ++FVV+Y P+ Q SL
Sbjct: 777 NVPTDDIHMKQVDLENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSL 835

Query: 661 RPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQV 720
           RPHHD+ST+TINIALN VG D++GGGC+F+RYNC++ + R GW  MHPGRLTH HEGL V
Sbjct: 836 RPHHDASTFTINIALNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPV 895

Query: 721 TQGTRYIMISFVDP 734
             GTRYI +SF+DP
Sbjct: 896 KNGTRYIAVSFIDP 909


>gi|77745212|gb|ABB02507.1| procollagen-lysine 2-oxoglutarate 5-dioxygenase 2 [Sus scrofa]
          Length = 640

 Score =  551 bits (1420), Expect = e-154,   Method: Compositional matrix adjust.
 Identities = 271/645 (42%), Positives = 402/645 (62%), Gaps = 30/645 (4%)

Query: 114 GGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKEL 173
           GG  ++L++F   +  +VF A+ + WPD  L DKYP V  G RYLNSGGFIGYA  I  +
Sbjct: 2   GGPEEVLKKFQKSNHKVVFSADGILWPDKRLADKYPIVHIGKRYLNSGGFIGYAPYINRI 61

Query: 174 ISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFV 233
           +   ++++ +DDQL+Y  +++D   R    I LD    +FQ L G+++++ L F+  +  
Sbjct: 62  VQQWNLQDNDDDQLFYTKIYIDPLKREALNITLDHKCKIFQTLNGAVDEVVLKFENGK-A 120

Query: 234 HLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPDQFPS 292
              N  Y T PV I+GNG +KI LN FGNY+  +W + +GCT C+ +  +D    D  P+
Sbjct: 121 RAKNVFYETLPVAINGNGPTKILLNYFGNYVPNAWTQDNGCTLCD-VDTIDLSAVDVHPN 179

Query: 293 VLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNV 352
           V I VFI++PT FL  FL+ +  L+YP + + +F++N + YH      +    K     +
Sbjct: 180 VTIGVFIEQPTPFLPRFLDTLLTLDYPKEALKLFIHNKEVYHEKNIKVFFDKAKHEITTI 239

Query: 353 KYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPL 411
           K +     ++  EARN+ ++     +  ++YF VD+D  L NP  LK L+ +N  +IAPL
Sbjct: 240 KIVGPEENLSQAEARNMGMDFCRQDENCNYYFSVDADVVLTNPRTLKILIEQNRKIIAPL 299

Query: 412 LVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKA- 470
           + R  K WSNFWGAL+ DG+YARS DY++I+ G++   GIWNVPY+ N YL+K   +++ 
Sbjct: 300 VTRHGKLWSNFWGALSPDGYYARSEDYVDIVQGNR--VGIWNVPYMANVYLIKGKTLRSE 357

Query: 471 TNIKTIYTLNSMDYDMAFCTNLRN---------------------KGIHLKIDSTQEYGH 509
            N +  +  + +D DMA C N R                      KG+ + + +  E+G 
Sbjct: 358 MNERNYFVRDKLDPDMALCRNAREMTLQREKDSPTPETFQMLSPPKGVFMYVSNRHEFGR 417

Query: 510 LVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTE 569
           L+ + N++    + +++++  NP+DW  +YI+ +Y K +  + +  QPCPDVFWFPI +E
Sbjct: 418 LLSTANYNISHYHNDLWQIFENPVDWKEKYINRDYSK-IFTENIVEQPCPDVFWFPIFSE 476

Query: 570 KFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPL 629
           K C E V+ ME YGQWS G ++D R+  GYE VPT DIHMKQ+ L  VW  F+R+++ P+
Sbjct: 477 KACDELVEEMEHYGQWSGGKHHDSRISGGYENVPTDDIHMKQIDLENVWLHFIREFIAPV 536

Query: 630 QEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRF 689
             + F GY+ +   A ++FVV+Y P+ Q SLRPHHD+ST+TINIALN VG D++GGGC+F
Sbjct: 537 TLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPHHDASTFTINIALNNVGEDFQGGGCKF 595

Query: 690 IRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           +RYNC++ + R GW  MHPGRLTH HEGL V  GTRYI +SF+DP
Sbjct: 596 LRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNGTRYIAVSFIDP 640


>gi|296479150|tpg|DAA21265.1| TPA: procollagen-lysine,2-oxoglutarate 5-dioxygenase 1 precursor
           [Bos taurus]
          Length = 667

 Score =  545 bits (1404), Expect = e-152,   Method: Compositional matrix adjust.
 Identities = 265/649 (40%), Positives = 414/649 (63%), Gaps = 10/649 (1%)

Query: 31  EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNE 90
           ED  LV+TVA+ ET+G++RF +SA+    +++ LGL + W G  M + GGG KV LLK  
Sbjct: 25  EDNLLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWPGEAMLA-GGGLKVRLLKKA 83

Query: 91  LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
           L++    ++++IL TDSYDV+   G  ++L++F    + +VF AE L +PD  L   YP 
Sbjct: 84  LEKHADKENLVILFTDSYDVVFASGPRELLKKFRQARSQVVFSAEELIYPDRRLEANYPV 143

Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
           V  G R+L SGGFIGYA ++ +L++    ++ + DQL+Y  +FLD   R +  I LD   
Sbjct: 144 VSDGKRFLGSGGFIGYAPNLIKLVAEWEGQDSDSDQLFYTKIFLDPEKREQINITLDHRC 203

Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK- 269
            +FQN +G+L+++ L F++ + V   N  Y+T PV+IHGNG +K++LN  GNY+ + W  
Sbjct: 204 RIFQNFHGALDEVVLKFEMGQ-VRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTF 262

Query: 270 TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
            +GC  C+  ++ L  +  +  P+VL+ VFI++PT FL  F  ++  L+YP K+  +F++
Sbjct: 263 ETGCAVCDEGLRSLKGIGDEALPAVLVGVFIEQPTPFLSLFFQRLLLLHYPQKRFRLFIH 322

Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDS 387
           N++++H    + ++      +++VK +     V + +ARN+  +     +G  +YF VD+
Sbjct: 323 NHEQHHKAQVEQFLAEHGDEYQSVKLVGPEVRVANADARNMGADLCRQDRGCTYYFSVDA 382

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D  L  P  L+ L+ +N+++I PL+ R  + WSNFWGAL+ADG+YARS DY++I+ G + 
Sbjct: 383 DVALTEPKTLRLLIEQNKNVITPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR- 441

Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKT-IYTLNSMDYDMAFCTNLRNKGIHLKIDSTQE 506
             G+WNVPYI+N YL+K S ++A   +T ++  + +D DMAFC N+R + + + + +   
Sbjct: 442 -VGVWNVPYISNIYLIKGSALRAELQETDLFHHSKLDPDMAFCANIRQQDVFMFLTNRHS 500

Query: 507 YGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPI 566
           +GHL+  +++     + +++E+  NP DW  +YIH  Y K+L    V   PCPDV+WFPI
Sbjct: 501 FGHLLSLDSYQTTHLHNDLWEVFSNPEDWKEKYIHENYTKALAGKMV-EMPCPDVYWFPI 559

Query: 567 VTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYV 626
            TE  C E V+ ME YGQWS G N D R++ GYE VPT DIHM Q+     W +FL +Y+
Sbjct: 560 FTETACDELVEEMEHYGQWSLGDNKDNRIQGGYENVPTIDIHMNQINYEREWHKFLVEYI 619

Query: 627 VPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIAL 675
            P+ E+ + GY+    +  ++FVVRY+PDEQPSL PHHD+ST+TINI L
Sbjct: 620 APMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLVPHHDASTFTINIGL 667


>gi|358254466|dbj|GAA55390.1| lysyl hydroxylase/galactosyltransferase/ glucosyltransferase
           [Clonorchis sinensis]
          Length = 623

 Score =  542 bits (1396), Expect = e-151,   Method: Compositional matrix adjust.
 Identities = 281/628 (44%), Positives = 401/628 (63%), Gaps = 23/628 (3%)

Query: 119 ILERFNTFDANIVFGAERLCWPDTSLYDKYPAVG-SGYRYLNSGGFIGYAKDIKELISNR 177
           +LE +   +  ++FGAE  CWPD +L D YP VG    R+LNSGGFIG A  +  +++  
Sbjct: 7   LLEEYEKSNYTVLFGAEGFCWPDKNLADMYPQVGPREKRFLNSGGFIGPASHLYRIVTET 66

Query: 178 SIKNEEDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTN 237
            I ++ DDQLYY  ++L+  LR +  I LDT + +FQNL+G+  +++L+F  D   +L N
Sbjct: 67  EIADDRDDQLYYTNIYLNRALREQLNIGLDTKSLIFQNLHGAFTEVELHFTNDT-GYLVN 125

Query: 238 TKYNTNPVIIHGNGKSKIELNSFGNYLAKSWKTS-GCTRCNL-IKHLDSLKPDQFPSVLI 295
           TK NT P++ HGNG  K E NS  NYL  SW  S GC  CN  I  LD  K  +FP++ +
Sbjct: 126 TKTNTRPIVAHGNGPIKPEFNSLTNYLDHSWTPSMGCQHCNEGIIDLD--KQGEFPTIQL 183

Query: 296 SVFIDKPTAFLEEFLNKIANLNYPAKKISM--FVYNNQEYHAPLFDDYIHNFKTMFKNVK 353
           S+FI+ PT FL+ F ++IA L+YP   I +   +    E   PL +++I      + ++K
Sbjct: 184 SIFIEYPTPFLDVFFDRIAALSYPKTHIHLTGHIGRKAEQQTPLVNEFIKKHGHNYLSIK 243

Query: 354 YIAHNSTVNSKEARNLAVENSLHKGVD---FYFYVDSDSHLDNPDVLKYLVNRNESLIAP 410
           +   +  V+   AR+ A  + L   VD   F F VD+   L NP  L++L+  N S+IAP
Sbjct: 244 WFYPDELVDEGSARDHAYAHCL--AVDTCRFMFSVDAVVQLTNPHTLEHLIRMNRSMIAP 301

Query: 411 LLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKA 470
           +L R  K WSNFWGAL+ DG+Y RS DY+ I+  ++   GIWNVP+I + YL+     +A
Sbjct: 302 MLSRREKLWSNFWGALSRDGYYERSDDYIEIV--ERKRVGIWNVPFIRDAYLLSR---RA 356

Query: 471 TNIKTIYTLNSMD-YDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELI 529
            ++     L+ +D  +M   +  R + I + +D+ + YG+LV +E +  +  N +++++ 
Sbjct: 357 VHVFAKNKLSGIDGLEMRIPSIARQENIFMTLDNMEPYGYLVQAETYTTEHVNNDLWQIF 416

Query: 530 RNPLDWDLRYIHPEYQKSLLPDTVNN---QPCPDVFWFPIVTEKFCHEFVQIMEAYGQWS 586
            NPLDW+ +Y+HP+Y K L P+   +   QPCPDVF+ PIVT KFC + +  +E +G WS
Sbjct: 417 DNPLDWEEQYVHPDYFKYLAPEVGMSDFKQPCPDVFYLPIVTTKFCRQLIAEVEEFGLWS 476

Query: 587 DGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPM 646
           DGTN D RLE GYE VPTRDIHM+Q+     W  FL KYV P+Q++ F GY  +P  A M
Sbjct: 477 DGTNIDPRLEGGYENVPTRDIHMRQINWEDHWLHFLVKYVHPIQKKLFAGYEDKPW-ARM 535

Query: 647 SFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLM 706
           +FVVRYRPDEQ SLRPHHD+S+YT+NIALN+ GVD+EGGG  F+RYNC+V   ++GW  +
Sbjct: 536 NFVVRYRPDEQASLRPHHDASSYTLNIALNEAGVDFEGGGTGFVRYNCSVVRAKVGWAAV 595

Query: 707 HPGRLTHYHEGLQVTQGTRYIMISFVDP 734
            PGR+TH HEGL  T GTRYI ++F++P
Sbjct: 596 FPGRVTHLHEGLTTTSGTRYIFVTFINP 623


>gi|426327853|ref|XP_004024724.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 1
           [Gorilla gorilla gorilla]
          Length = 710

 Score =  534 bits (1376), Expect = e-149,   Method: Compositional matrix adjust.
 Identities = 253/577 (43%), Positives = 379/577 (65%), Gaps = 9/577 (1%)

Query: 162 GFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLE 221
           GFIGYA ++ +L++    ++ + DQL+Y  +FLD   R +  I LD    +FQNL G+L+
Sbjct: 139 GFIGYAPNLSKLVAEWEGQDSDSDQLFYTKIFLDPEKREQINITLDHRCRIFQNLDGALD 198

Query: 222 DIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK-TSGCTRCNL-I 279
           ++ L F++   V   N  Y+T PV+IHGNG +K++LN  GNY+ + W   +GCT C+  +
Sbjct: 199 EVVLKFEMGH-VRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTFETGCTVCDEGL 257

Query: 280 KHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFD 339
           + L  +  +  P+VL+ VFI++PT F+  F  ++  L+YP K + +F++N++++H    +
Sbjct: 258 RSLKGIGDEALPTVLVGVFIEQPTPFVSLFFQRLLRLHYPQKHMRLFIHNHEQHHKAQVE 317

Query: 340 DYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDSDSHLDNPDVLK 398
           +++    + +++VK +     + + +ARN+  +     +   +YF VD+D  L  P+ L+
Sbjct: 318 EFLAEHGSEYQSVKLVGPEVRMANADARNMGADLCRQDRSCTYYFSVDADVALTEPNSLR 377

Query: 399 YLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYIT 458
            LV +N+++IAPL+ R  + WSNFWGAL+ADG+YARS DY++I+ G +   G+WNVPYI+
Sbjct: 378 LLVQQNKNVIAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR--VGVWNVPYIS 435

Query: 459 NCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFD 517
           N YL+K S ++       ++  + +D DMAFC N+R + + + + +    GHL+  +++ 
Sbjct: 436 NIYLIKGSALRGELQSPDLFHHSKLDPDMAFCANVRQQDVFMFLTNRHTLGHLLSLDSYR 495

Query: 518 PQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQ 577
               + +++E+  NP DW  +YIH  Y K+L    V   PCPDV+WFPI TE  C E V+
Sbjct: 496 TTHLHNDLWEVFSNPEDWKEKYIHQNYTKALAGKLVET-PCPDVYWFPIFTEVACDELVE 554

Query: 578 IMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGY 637
            ME +GQWS G N D R++ GYE VPT DIHM Q+G    W +FL +Y+ P+ E+ + GY
Sbjct: 555 EMEHFGQWSLGNNKDNRIQGGYENVPTIDIHMNQIGFEREWHKFLLEYIAPMTEKLYPGY 614

Query: 638 HHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVT 697
           +   V        RY+PDEQPSL PHHD+ST+TINIALN+VGVDYEGGGCRF+RYNC++ 
Sbjct: 615 YTR-VXXXXXXXXRYKPDEQPSLMPHHDASTFTINIALNRVGVDYEGGGCRFLRYNCSIR 673

Query: 698 ATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           A R GW LMHPGRLTHYHEGL  T+GTRYI +SFVDP
Sbjct: 674 APRKGWTLMHPGRLTHYHEGLPTTRGTRYIAVSFVDP 710


>gi|403289984|ref|XP_003936115.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 1
           [Saimiri boliviensis boliviensis]
          Length = 674

 Score =  529 bits (1363), Expect = e-147,   Method: Compositional matrix adjust.
 Identities = 285/715 (39%), Positives = 412/715 (57%), Gaps = 76/715 (10%)

Query: 31  EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNE 90
           ED  LVITVA+ ET+G++RF +SA+    ++++LGL + W     +S GGG KV LLK  
Sbjct: 25  EDNLLVITVATKETEGFRRFKRSAQFFNYKIQSLGLGEDWNVEKRTSAGGGQKVRLLKKA 84

Query: 91  LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
           L++    +D++IL TDSYDV+   G  ++L++F    + +VF AE L +PD  L  KYPA
Sbjct: 85  LEKHADKEDLVILFTDSYDVLFASGPRELLKKFRQARSQVVFSAEELIYPDRRLEAKYPA 144

Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
           V  G R+L SGG     +    ++           + +     +   L+ +  I LD   
Sbjct: 145 VSDGKRFLGSGG----ERPGCRVLGPAEGPIHSSPRAFPYCSLVPSFLQDQINITLDHRC 200

Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK- 269
            +FQNL G+L+++ L F++   V   N  Y+T PV+IHGNG +K++LN  GNY+ + W  
Sbjct: 201 RIFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTF 259

Query: 270 TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
            +GCT C+  ++ L  +  +  P+VL+ VFI++PT F+  F  ++  L+YP K I +   
Sbjct: 260 ETGCTVCDEGLRSLKGIGDEALPTVLVGVFIEQPTPFVSLFFQRLLRLHYPRKHIRL--- 316

Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSD 388
                       +IHN         +++   +V S           L  G  F       
Sbjct: 317 ------------FIHN---------HVSSRHSVGSS--------CGLGPGPSF------- 340

Query: 389 SHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGG 448
                            +       R  + WSNFWGAL+ADG+YARS DY++I+ G +  
Sbjct: 341 -----------------TXXXXXXXRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR-- 381

Query: 449 KGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEY 507
            G+WNVPYI+N YL+K S ++A      ++    +D DMAFC N+R + + + + +    
Sbjct: 382 VGVWNVPYISNIYLIKGSALRAELQSPDLFHHRKLDPDMAFCANIRQQDVFMFLTNRHGL 441

Query: 508 GHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEY--------QKSLLPDTVNNQPCP 559
           GHL+  +N+     + +++E+  NP +  L + H ++        Q   LP      PCP
Sbjct: 442 GHLLSLDNYRTTHLHNDLWEVFSNP-EVRLGWAHSDWEQRGPGILQSEALPSQNLTIPCP 500

Query: 560 DVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWA 619
           DV+WFPI TE  C E V+ ME +GQWS G N D R++ GYE VPT DIHM Q+G    W 
Sbjct: 501 DVYWFPIFTEAACDELVEEMEHFGQWSLGDNKDNRIQGGYENVPTIDIHMNQIGFEREWH 560

Query: 620 EFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVG 679
           +FL +Y+ P+ E+ + GY+    +  ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VG
Sbjct: 561 KFLLEYIAPMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFTINIALNRVG 619

Query: 680 VDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           VDYEGGGCRF+RYNC++ A R GW LMHPGRLTHYHEGL  T+GTRYI +SFVDP
Sbjct: 620 VDYEGGGCRFLRYNCSIRAPRKGWTLMHPGRLTHYHEGLPTTRGTRYIAVSFVDP 674


>gi|332250431|ref|XP_003274354.1| PREDICTED: LOW QUALITY PROTEIN: procollagen-lysine,2-oxoglutarate
           5-dioxygenase 1 [Nomascus leucogenys]
          Length = 681

 Score =  524 bits (1349), Expect = e-146,   Method: Compositional matrix adjust.
 Identities = 270/708 (38%), Positives = 404/708 (57%), Gaps = 81/708 (11%)

Query: 31  EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNE 90
           ED  LVITVA+ ET+G++RF +SA+    +++ LGL + W     +S GGG KV LLK  
Sbjct: 51  EDNLLVITVATKETEGFRRFKRSAQFFNYKIQALGLGEDWNVEKGTSAGGGQKVRLLKKA 110

Query: 91  LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
           L++    +D++IL TDSYDV+   G  ++L++F    + +VF AE L +PD  L  KYP 
Sbjct: 111 LEKHADKEDLVILFTDSYDVLFASGPRELLKKFRQARSQVVFSAEELIYPDRRLEAKYPV 170

Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
           V  G R+L SGGFIGYA ++ +L++    ++ + DQL+Y  +FLD   R +  I LD   
Sbjct: 171 VSDGKRFLGSGGFIGYAPNLSKLVAEWEGQDSDSDQLFYTKIFLDPEKREQINITLDHRC 230

Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWKT 270
            +FQNL G+L+++ L F++   V   N  Y+T PV+IHGNG +K E              
Sbjct: 231 RIFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVLIHGNGPTKDE-------------- 275

Query: 271 SGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNN 330
                               P+VL+ VFI++PT F+  F  ++  L+YP K + +F++N+
Sbjct: 276 ------------------ALPTVLVGVFIEQPTPFVSLFFQRLLRLHYPQKHMRLFIHNH 317

Query: 331 QEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDSDS 389
           +++H    ++++    + +++VK +     + + +ARN+  +     +   +YF VD+D 
Sbjct: 318 EQHHKAQVEEFLAEHGSEYQSVKLVGPEVRMANADARNMGADLCRQDRSCTYYFSVDADV 377

Query: 390 HLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGK 449
            L  P+ L+ L+ +N+++IAPL+ R  + WSNFWGAL+ADG+YARS DY++I+ G +   
Sbjct: 378 ALTEPNSLRLLIQQNKNVIAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR--V 435

Query: 450 GIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYG 508
           G+WNVPYI+N YL+K S ++       ++  + +D DMAFC N+R + + + + +    G
Sbjct: 436 GVWNVPYISNIYLIKGSALRGELQSPDLFHHSKLDPDMAFCANVRQQDVFMFLTNRHTLG 495

Query: 509 HLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVT 568
           HL+  +++     + +++E+  NP DW  +YIH  Y K+L    V   PCPDV+WFPI T
Sbjct: 496 HLLSLDSYRTTHLHNDLWEVFGNPEDWKEKYIHQNYTKALAGKLVET-PCPDVYWFPIFT 554

Query: 569 EKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLR--KYV 626
           E  C E V+ ME +GQWS G N D R++ GYE VPT DIHM Q+G    W +FL     +
Sbjct: 555 EVACDELVEEMEHFGQWSLGDNKDNRIQGGYENVPTIDIHMNQIGFEREWHKFLAGTTSL 614

Query: 627 VPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGG 686
            P     F+     P+                                         GGG
Sbjct: 615 CPTPGPRFVRSSRLPI-----------------------------------------GGG 633

Query: 687 CRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           CRF+RYNC+V A R GW LMHPGRLTHYHEGL  T+GTRYI +SFVDP
Sbjct: 634 CRFLRYNCSVRAPRKGWTLMHPGRLTHYHEGLPTTRGTRYIAVSFVDP 681


>gi|345308392|ref|XP_001516384.2| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 3
           [Ornithorhynchus anatinus]
          Length = 674

 Score =  523 bits (1346), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 250/622 (40%), Positives = 396/622 (63%), Gaps = 13/622 (2%)

Query: 27  KNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVN 85
           + ++ +K LV+T A+ ET+GYKRF+++A      V+TLGL + W GGD++ ++GGG KV 
Sbjct: 35  ERVNPEKLLVMTAATEETEGYKRFLRTARHFNYTVRTLGLGEEWRGGDVARTVGGGQKVR 94

Query: 86  LLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLY 145
            LK E+++    +D++IL  DSYDV++ G   ++L +F    + ++F AE  CWP+ SL 
Sbjct: 95  WLKQEMEKHADREDLVILFVDSYDVLLAGSPLELLWKFVQSGSRLLFSAEGFCWPEWSLA 154

Query: 146 DKYP--AVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHK 203
           D YP  + G+G R+LNSGGFIG+A  +  L+     K+++DDQL+Y  L+LD  LR KH 
Sbjct: 155 DSYPPLSAGNGKRFLNSGGFIGFAPTVHRLVRQWKYKDDDDDQLFYTRLYLDPGLREKHG 214

Query: 204 IVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNY 263
           + LD  + +FQNL G+L+++ L F+ +  V + N  Y+T PV+IHGNG +K++LN  GNY
Sbjct: 215 LALDHKSRIFQNLNGALDEVVLKFEKNR-VRVRNVAYDTLPVVIHGNGPTKLQLNYLGNY 273

Query: 264 LAKSWK-TSGCTRCNLIKHLDSLKPD-QFPSVLISVFIDKPTAFLEEFLNKIANLNYPAK 321
           +  +W    GC  C   +   +L  D + P VL+ +F+++PT FL +FL ++  L+YP+ 
Sbjct: 274 VPNAWTYEGGCGFCAQDRR--NLTGDSELPRVLLGLFVEQPTPFLPQFLQRLLLLDYPSS 331

Query: 322 KISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVD 380
           ++S+F++N++ YH    +      +T F  V+ +     +   EAR++A+++       D
Sbjct: 332 RLSLFLHNSEVYHEAHVEALWEQLRTRFSTVQLVGPEEALTQGEARDMAMDSCRQDPSCD 391

Query: 381 FYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMN 440
           FYF +D+D+ L NP  L  L+  +  ++AP+L R  K WSNFWGAL+ + +YARS DY+ 
Sbjct: 392 FYFSLDADAVLTNPRTLLSLIEEDRKVVAPMLSRHGKLWSNFWGALSPEEYYARSEDYVE 451

Query: 441 IINGDQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHL 499
           ++   +   G+WNVPY+   YL++   +++    + ++TL   D DM+FC +LR+KGI L
Sbjct: 452 LVQRKR--VGLWNVPYVAQAYLVRGETLRSELPQRGVFTLEETDPDMSFCKSLRDKGIFL 509

Query: 500 KIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCP 559
            + + +E+G LV +  +D    +P+++++  NPLDW  +YIHP Y  +L  + V  QPCP
Sbjct: 510 HLSNQEEFGRLVSTARYDTDHLHPDLWQIFDNPLDWREKYIHPNYSLALEGEGVE-QPCP 568

Query: 560 DVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWA 619
           DV+WFP+++++ C E V+ ME +GQWS G + D RL  GYE VPT DIHM QVG    W 
Sbjct: 569 DVYWFPVLSDRMCDELVEEMENFGQWSGGRHEDTRLAGGYENVPTVDIHMNQVGYEKEWL 628

Query: 620 EFLRKYVVPLQEREFIGYHHEP 641
           + L +Y+ P+ E  F GYH +P
Sbjct: 629 KVLSEYIAPMTESLFPGYHTKP 650


>gi|313237914|emb|CBY13041.1| unnamed protein product [Oikopleura dioica]
          Length = 747

 Score =  522 bits (1345), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 294/751 (39%), Positives = 438/751 (58%), Gaps = 45/751 (5%)

Query: 18  FISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS- 76
            +S+  N V N  E   LVITVA+ +TDGY R+ +S   + L+ +T G+ + WLGGD++ 
Sbjct: 6   LVSLLSNSVLNARE--LLVITVATEKTDGYLRWEESVRYSGLKSRTFGIGEDWLGGDLTN 63

Query: 77  SLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFN------TFDANI 130
             GGG+KVNLLK EL E     ++  L TD+YDVII+G   +I  RF+       +  N+
Sbjct: 64  GPGGGHKVNLLKKELAEYKGNSELYFLFTDAYDVIINGKEEEIFSRFDDIVSKVEYKTNV 123

Query: 131 VFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYA 190
           +  AE L WPD SL  KYP V  G R+L SG  +  A    +L+  R+I + +DDQL+Y 
Sbjct: 124 LISAEDLIWPDASLEPKYPLV-LGKRFLCSGAILARADVFLDLLEYRAIGDRDDDQLFYT 182

Query: 191 LLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLT--NTKYNTNPVIIH 248
             FL++ L+ K  I LD  A LF NL G+LE++ ++F        T  NTKY T P++IH
Sbjct: 183 EAFLNKELKEKFGIALDHKAELFFNLNGALEEVGIDFARSATGDNTVENTKYRTKPLVIH 242

Query: 249 GNGKSKIELNSFGNYLAKSWKTS-GCTRCNLIKHLDSLKPDQFPS--VLISVFIDKPTAF 305
           GNG SK ELN   NY+ + W+   GC  C+ + + + +K D   S  ++I+  ID  T F
Sbjct: 243 GNGPSKNELNRISNYVPQGWRPDYGCPACSKVLN-EEIKEDIDTSKDIVIAFIIDGITPF 301

Query: 306 LEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKE 365
           +   L +IA+L+YPA+K  + +Y+N  +     D ++  F + +K+ K+I+    ++   
Sbjct: 302 VHNSLKRIASLDYPAEKTHLLIYSNTVWADERVDTFLEVFGSSYKSTKFISSKEKMSITM 361

Query: 366 ARNLAVENSLHK-GVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWG 424
           AR  A++ +  K   +F F+VD    L NP V+  L+  N  L+AP + R  K WSN+WG
Sbjct: 362 ARKFALQLTDEKFSAEFVFFVDGYVQLTNPAVIGELIKTNVELVAPGMSRYGKLWSNYWG 421

Query: 425 ALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSM-- 482
           A+ +DGFY+RS DY++I+ G +   GIWN+P++   YL+  ++  A ++  I+   S   
Sbjct: 422 AVASDGFYSRSDDYLDIVQGTR--VGIWNMPFVNGAYLVHKNL--AADLIDIFAGISQSP 477

Query: 483 ------DYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWD 536
                 D D+ F +NLR  GI + + +   +G LVD E+    + +PE+++   N  DW+
Sbjct: 478 WQGKFNDPDLDFASNLRTLGIFMHVTNQAYWGRLVDREHMPVDRIHPELWQPEWNRPDWE 537

Query: 537 LRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDG--TNNDKR 594
             Y+  +Y + L P+T  ++PCPDV  FP ++ K   + ++ ME YG+WS G   + D+R
Sbjct: 538 EDYLDTDYWRVLEPETEMDEPCPDVVAFPFLSSKGGFDMIEEMEHYGKWSGGNEAHTDER 597

Query: 595 LETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAP-MSFVVRYR 653
           L  GYE VPT DIHM Q+GL   W   ++ Y  P+  + + GY+  P   P + FVVRY+
Sbjct: 598 LAGGYENVPTVDIHMNQIGLHDEWMYVVKTYAAPMVSKFYTGYN--PDNKPNLMFVVRYK 655

Query: 654 PDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTAT-----------RMG 702
           P EQ  LRPHHDSST+T  IALN+  +D+EGGG  F RY C+V  +           + G
Sbjct: 656 PGEQDRLRPHHDSSTWTFQIALNRPNIDFEGGGTYFTRYKCSVVGSATEQDSRSLEVKQG 715

Query: 703 WMLMHPGRLTHYHEGLQVTQGTRYIMISFVD 733
                PGRLTH H GL  T+GTRYI+++F+D
Sbjct: 716 MGFAFPGRLTHQHAGLPTTKGTRYILVNFMD 746


>gi|313247226|emb|CBY36038.1| unnamed protein product [Oikopleura dioica]
          Length = 747

 Score =  521 bits (1342), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 293/751 (39%), Positives = 437/751 (58%), Gaps = 45/751 (5%)

Query: 18  FISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS- 76
            +S+  N V N  E   LVITVA+ +TDGY R+ +S   + L+ +T G  + WLGGD++ 
Sbjct: 6   LVSLLSNSVLNARE--LLVITVATEKTDGYLRWEESVRYSGLKSRTFGTGEDWLGGDITN 63

Query: 77  SLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFN------TFDANI 130
             GGG+KVNLLK EL       ++  L TD+YDVII+G   +I  RF+       +  N+
Sbjct: 64  GPGGGHKVNLLKKELAGYKGNSELYFLFTDAYDVIINGKEEEIFSRFDDIVSKVEYKTNV 123

Query: 131 VFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYA 190
           +  AE L WPD SL  KYP V  G R+L SG  +  A    +L+  R+I + +DDQL+Y 
Sbjct: 124 LISAEDLIWPDASLEPKYPLV-LGKRFLCSGAILARADVFLDLLEYRAIGDRDDDQLFYT 182

Query: 191 LLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLT--NTKYNTNPVIIH 248
             +L++ L+ K  I LD  A LF NL G+LE++ ++F        T  NTKY T P++IH
Sbjct: 183 EAYLNKELKEKFGIALDHKAELFFNLNGALEEVGIDFARSATGDNTVENTKYRTKPLVIH 242

Query: 249 GNGKSKIELNSFGNYLAKSWKTS-GCTRCNLIKHLDSLKPDQFPS--VLISVFIDKPTAF 305
           GNG SK ELN   NY+ + W+   GC  C+ + + + +K D   S  ++I+  ID  T F
Sbjct: 243 GNGPSKNELNRISNYVPQGWRPDYGCPACSKVLN-EEIKEDIDTSKEIVIAFIIDGITPF 301

Query: 306 LEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKE 365
           +   L +IA+L+YPA+K  + +Y+N  +     D ++  F + +K+ K+I+    ++   
Sbjct: 302 VHNSLKRIASLDYPAEKTHLLIYSNTVWADERVDTFLEVFGSSYKSTKFISSKEKMSVTM 361

Query: 366 ARNLAVENSLHK-GVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWG 424
           AR  A++ +  K  V+F F+VD    L NP V+  L+  N  L+AP + R  K WSN+WG
Sbjct: 362 ARKFALQKTYEKFSVEFVFFVDGYVQLTNPAVIGELIKTNVELVAPGMSRYGKLWSNYWG 421

Query: 425 ALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSM-- 482
           A+ +DGFY+RS DY++I+ G +   GIWN+P++   YL+  ++  A ++  I+   S   
Sbjct: 422 AVASDGFYSRSDDYLDIVQGTR--VGIWNMPFVNGAYLVHKNL--AADLIDIFAGISQSP 477

Query: 483 ------DYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWD 536
                 D D+ F +NLR  GI + + +   +G LVD E+    + +PE+++   N  DW+
Sbjct: 478 WQGKFNDPDLDFASNLRTLGIFMHVTNQAYWGRLVDREHMPVDRIHPELWQPEWNRPDWE 537

Query: 537 LRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDG--TNNDKR 594
             Y+  +Y + L P+T  ++PCPDV  FP ++ K   + ++ ME YG+WS G   + D+R
Sbjct: 538 EDYLDSDYWRVLEPETEMDEPCPDVVAFPFLSSKGGFDMIEEMEHYGKWSGGNEAHTDER 597

Query: 595 LETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAP-MSFVVRYR 653
           L  GYE VPT DIHM Q+GL   W   ++ Y  P+  + + GY+  P   P + FVVRY+
Sbjct: 598 LAGGYENVPTVDIHMNQIGLQDEWLYVVKTYAAPMVSKFYTGYN--PDNKPNLMFVVRYK 655

Query: 654 PDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTAT-----------RMG 702
           P EQ  LRPHHDSST+T  IALN+  +D+EGGG  F RY C+V  +           + G
Sbjct: 656 PGEQDRLRPHHDSSTWTFQIALNRPNIDFEGGGTYFTRYKCSVVGSATEQDSRSLEVKQG 715

Query: 703 WMLMHPGRLTHYHEGLQVTQGTRYIMISFVD 733
                PGRLTH H GL  T+GTRYI+++F+D
Sbjct: 716 MGFAFPGRLTHQHAGLPTTKGTRYILVNFMD 746


>gi|426342442|ref|XP_004037854.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2
           [Gorilla gorilla gorilla]
          Length = 783

 Score =  509 bits (1311), Expect = e-141,   Method: Compositional matrix adjust.
 Identities = 254/624 (40%), Positives = 385/624 (61%), Gaps = 9/624 (1%)

Query: 19  ISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSS 77
           +     K  +I  DK LVITVA+ E+DG+ RF+QSA+     VK LG  + W GGD ++S
Sbjct: 125 LGADSEKPSSIPTDKLLVITVATKESDGFHRFMQSAKYFNYTVKVLGQGEEWRGGDGINS 184

Query: 78  LGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERL 137
           +GGG KV L+K  ++     DD++++ T+ +DVI  GG  ++L++F   +  +VF A+ +
Sbjct: 185 IGGGQKVRLMKEVMEHYADQDDLVVMFTECFDVIFAGGPEEVLKKFQKANHKVVFAADGI 244

Query: 138 CWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDET 197
            WPD  L DKYP V  G RYLNSGGFIGYA  +  ++   ++++ +DDQL+Y  +++D  
Sbjct: 245 LWPDKRLADKYPVVHIGKRYLNSGGFIGYAPYVNRIVQQWNLQDNDDDQLFYTKIYIDPL 304

Query: 198 LRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIEL 257
            R    I LD    +FQ L G+++++ L F+  +     NT Y T PV I+GNG +KI L
Sbjct: 305 KREAINITLDHKCKIFQTLNGAVDEVVLKFENGK-ARAKNTFYETLPVAINGNGPTKILL 363

Query: 258 NSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANL 316
           N FGNY+  SW + +GCT C     +D    D  P+V I VFI++PT FL  FL+ +  L
Sbjct: 364 NYFGNYVPNSWTQDNGCTLCEF-DTVDLSAVDVHPNVSIGVFIEQPTPFLPRFLDILLTL 422

Query: 317 NYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH 376
           +YP + + +F++N + YH      +    K   K +K +     ++  EARN+ ++    
Sbjct: 423 DYPKEALKLFIHNKEVYHEKDIKVFFDKAKHEIKTIKIVGPEENLSQAEARNMGMDFCRQ 482

Query: 377 -KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARS 435
            +  D+YF VD+D  L NP  LK L+ +N  +IAPL+ R  K WSNFWGAL+ DG+YARS
Sbjct: 483 DEKCDYYFSVDADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARS 542

Query: 436 FDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN 494
            DY++I+ G++   G+WNVPY+ N YL+K   +++  N +  +  + +D DMA C N R 
Sbjct: 543 EDYVDIVQGNR--VGVWNVPYMANVYLIKGKTLRSEMNERNYFVRDKLDPDMALCRNARE 600

Query: 495 KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVN 554
            G+ + I +  E+G L+ + N++    N +++++  NP+DW  +YI+ +Y K +  + + 
Sbjct: 601 MGVFMYISNRHEFGRLLSTANYNTSHYNNDLWQIFENPVDWKEKYINRDYSK-IFTENIV 659

Query: 555 NQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGL 614
            QPCPDVFWFPI +EK C E V+ ME YG+WS G ++D R+  GYE VPT DIHMKQV L
Sbjct: 660 EQPCPDVFWFPIFSEKACDELVEEMEHYGKWSGGKHHDSRISGGYENVPTDDIHMKQVDL 719

Query: 615 AGVWAEFLRKYVVPLQEREFIGYH 638
             VW  F+R+++ P+  + F GY+
Sbjct: 720 ENVWLHFIREFIAPVTLKVFAGYY 743


>gi|351698763|gb|EHB01682.1| Procollagen-lysine,2-oxoglutarate 5-dioxygenase 3 [Heterocephalus
           glaber]
          Length = 828

 Score =  498 bits (1282), Expect = e-138,   Method: Compositional matrix adjust.
 Identities = 241/549 (43%), Positives = 348/549 (63%), Gaps = 41/549 (7%)

Query: 221 EDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKI------------------------- 255
           +++ L FD +  V + N  Y+T PV++HGNG +K+                         
Sbjct: 286 DEVVLKFDRNR-VRIRNVAYDTLPVVVHGNGPTKVPPSSLCPCLPQALGTLSFLSSHYSP 344

Query: 256 ------ELNSFGNYLAKSW-KTSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLE 307
                 +LN  GNY+   W    GC  CN   + L   +P   P VL++VF+++PT FL 
Sbjct: 345 APATELQLNYLGNYVPNGWTPQGGCGFCNRDQRTLPGGQPP--PRVLLAVFVEQPTPFLP 402

Query: 308 EFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEAR 367
            FL ++  L+YP  ++++F++N++ YH P   D     +  F++VK +     ++  EAR
Sbjct: 403 RFLQRLLLLDYPPDRVTLFLHNSEVYHEPHIADSWPQLQDHFESVKLVGPEEDLSPGEAR 462

Query: 368 NLAVENSLH-KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGAL 426
           ++A++        +FYF +D+D+ L N   L+ L+ +N  +IAP+L R  K WSNFWGAL
Sbjct: 463 DMAMDTCRQDPECEFYFSLDADAVLTNQQTLRILIEQNRKVIAPMLSRHGKLWSNFWGAL 522

Query: 427 NADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYD 485
           + D +YARS DY+ ++   +   G+WNVPYI+  YL++   ++     + +++ + MD D
Sbjct: 523 SPDEYYARSEDYVELVQRKR--LGVWNVPYISQAYLIQGETLRTELPQREVFSSSDMDPD 580

Query: 486 MAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQ 545
           MAFC NLR++GI L + + QE+G L+ +  +D    +P+++++  NP+DW  +YIH  Y 
Sbjct: 581 MAFCMNLRDRGIFLHLSNQQEFGRLLATSRYDTDHLHPDLWQIFDNPVDWKEQYIHENYS 640

Query: 546 KSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTR 605
           ++L    +  QPCPDV+WFP+++E+ C E V+ ME YGQWS G + D RL  GYE VPT 
Sbjct: 641 QALDGKDLVEQPCPDVYWFPLLSEQMCDELVEEMENYGQWSGGRHEDSRLAGGYENVPTV 700

Query: 606 DIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHD 665
           DIHMKQVG    W + LR YV P+ E  F GYH +  RA M+FVVRYRPDEQPSLRPHHD
Sbjct: 701 DIHMKQVGYEDQWLQLLRTYVGPMTEHLFPGYHTK-TRAVMNFVVRYRPDEQPSLRPHHD 759

Query: 666 SSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTR 725
           SST+T+N+ALN  G+DYEGGGCRF+RYNC +++ R GW L+HPGRLTHYHEGL  T+GTR
Sbjct: 760 SSTFTLNVALNHKGLDYEGGGCRFLRYNCIISSPRKGWGLLHPGRLTHYHEGLPTTRGTR 819

Query: 726 YIMISFVDP 734
           YIM+SFVDP
Sbjct: 820 YIMVSFVDP 828



 Score =  190 bits (482), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 90/193 (46%), Positives = 134/193 (69%), Gaps = 1/193 (0%)

Query: 29  IDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLL 87
           ++ +K LVITVA+ ET+GY+RF+Q+AE     V+TLGL + W GGD++ ++GGG KV  L
Sbjct: 37  VNPEKLLVITVATAETEGYRRFLQTAEFFNYTVRTLGLGKEWRGGDVARTVGGGQKVRWL 96

Query: 88  KNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDK 147
           K E+++    +DMII+  DSYDVI+ G   ++L++F    + ++F AE  CWPD  L ++
Sbjct: 97  KKEMEKYADQEDMIIMFVDSYDVILAGSPTELLKKFVQSSSRLLFSAEGFCWPDWGLAEQ 156

Query: 148 YPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLD 207
           YP VG+G R+LNSGGFIG+A  I +++     K+++DDQL+Y  L+LD  LR K  + LD
Sbjct: 157 YPEVGTGKRFLNSGGFIGFAPTIHQIVHQWKYKDDDDDQLFYTRLYLDPGLREKFSLNLD 216

Query: 208 TLANLFQNLYGSL 220
             + +FQNL G+L
Sbjct: 217 HKSRIFQNLNGAL 229


>gi|344256859|gb|EGW12963.1| Procollagen-lysine,2-oxoglutarate 5-dioxygenase 1 [Cricetulus
           griseus]
          Length = 659

 Score =  494 bits (1273), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 257/656 (39%), Positives = 401/656 (61%), Gaps = 37/656 (5%)

Query: 32  DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNEL 91
           D  LV+TVA+ ET+G++RF +SA+    +++ +    P L     S   G+         
Sbjct: 19  DNLLVLTVATKETEGFRRFKRSAQFFNYKIQWV----PSLDPASPSPRFGH--------- 65

Query: 92  DEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAV 151
                          SYDV+   G  ++L++F    + +VF AE L +PD  L  KYP V
Sbjct: 66  ---------------SYDVVFASGPRELLKKFQQAKSRVVFSAEELIYPDRRLEAKYPTV 110

Query: 152 GSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLAN 211
             G R+L SGGFIGYA ++ +L++    ++ + DQL+Y  +FLD   R +  I L    +
Sbjct: 111 SDGKRFLGSGGFIGYAPNLNKLVAEWEGQDSDSDQLFYTKIFLDPEKREQINISLGHSCS 170

Query: 212 LFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK-T 270
           +FQNL G+L+++ L F++   V   N  Y+T PV+IHGNG +K++LN  GNY+ + W   
Sbjct: 171 IFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVVIHGNGPTKLQLNYLGNYIPRFWTFE 229

Query: 271 SGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN 329
           +GCT C+  ++ L  +  +  P+VL+ VFI++PT FL  F  ++  L YP K++ +F++N
Sbjct: 230 TGCTVCDEGLRSLKGIGDEALPTVLVGVFIEQPTPFLSLFFLRLLRLRYPQKRMRLFIHN 289

Query: 330 NQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDSD 388
           ++++H    + ++    T +++VK +     + + +ARN+  +     +   +YF VD+D
Sbjct: 290 HEQHHKLEVEKFLAEHGTEYQSVKLVGPEVRMANADARNMGADLCRQDQTCTYYFSVDAD 349

Query: 389 SHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGG 448
             L  PD L+ L+ +N+++IAPL+ R  + WSNFWGAL+ADG+YARS DY++I+ G +  
Sbjct: 350 VALTEPDSLRLLIEQNKNVIAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR-- 407

Query: 449 KGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEY 507
            G+WNVPYI+N YL+K S ++A      ++  + +D DM+FC N+R + + + + +   +
Sbjct: 408 VGVWNVPYISNIYLIKGSALRAELQHVDLFHYSKLDADMSFCANVRQQEVFMFLTNRHTF 467

Query: 508 GHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIV 567
           GHL+  +N+     + +++E+  NP DW  +YIH  Y K+L    V   PCPDV+WFPI 
Sbjct: 468 GHLLSLDNYQTTHLHNDLWEVFSNPEDWKEKYIHENYTKALEGKLV-EMPCPDVYWFPIF 526

Query: 568 TEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVV 627
           TE  C E V+ ME YGQWS G N D R++ GYE VPT DIHM Q+     W +FL +Y+ 
Sbjct: 527 TEAACDELVEEMEHYGQWSLGDNKDNRIQGGYENVPTIDIHMNQITFEREWHKFLVEYIA 586

Query: 628 PLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYE 683
           P+ E+ + GY+    +  ++FVVRY+PDEQPSL PHHD+ST+T+NIALN+VG DYE
Sbjct: 587 PMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFTVNIALNRVGQDYE 641


>gi|312082545|ref|XP_003143488.1| procollagen-lysine,2-oxoglutarate 5-dioxygenase 2 [Loa loa]
          Length = 569

 Score =  491 bits (1265), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 248/578 (42%), Positives = 372/578 (64%), Gaps = 16/578 (2%)

Query: 164 IGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDI 223
           +G+A +I  LIS R +++ +DDQLYY  L+LD+ +R   K+ LD++  LFQNL G+  D+
Sbjct: 1   MGFAPEIWSLISYRDVEDNDDDQLYYTRLYLDKQIRLSLKMTLDSMTVLFQNLNGASNDV 60

Query: 224 KLNFDLDE--FVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWKTSGCTRCNLIKH 281
           KL    +      + N  YNT+P++IHGNG SK+ LN  GNY+      +  T+   +  
Sbjct: 61  KLEMSGERSGMYFIYNFIYNTHPLVIHGNGPSKLYLNHLGNYIDPLRIATSKTQSITM-- 118

Query: 282 LDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDY 341
               +  + P + +S+ I KP  F+ EF   I  L Y  +KI +FVY NQ++      D+
Sbjct: 119 --DFEKIELPKLFLSIIISKPIPFIREFFGNIKKLAYTDEKIDLFVYCNQKFLTKEVSDF 176

Query: 342 IHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLV 401
           + + K  ++++ Y   ++ +  +EAR+ +++ SL  G D+   VD D HL+N + L ++V
Sbjct: 177 VEDVKKRYRSLLY-DDSTEMEEREARSFSLKQSLALGDDYLIMVDGDVHLNNSEALLFMV 235

Query: 402 N----RNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYI 457
           +    +   ++APL+ +P K ++NFWGA++++G+YARS +Y++II  D    GIWNVP+I
Sbjct: 236 HTMKEKEPEILAPLIRQPHKLFTNFWGAISSNGYYARSENYLDII--DHKEVGIWNVPFI 293

Query: 458 TNCYLMKTSVIKATNIKTIYTLNS-MDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENF 516
            +  ++     K T++   Y  +  +D DM+FC+  R+KG  L +D++  YG LV SEN 
Sbjct: 294 GSILIIAKE--KLTSLSRAYHYDEKLDPDMSFCSFARDKGHFLYLDNSHHYGFLVVSENV 351

Query: 517 DPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFV 576
           +  K +PE+YE+  N   W+ RYIHP Y  +L   T   + C DV+ FP+++E+FC E +
Sbjct: 352 ESSKVHPEMYEIFNNKELWEKRYIHPNYFTALNGSTPIPEICQDVYDFPLMSERFCAELI 411

Query: 577 QIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIG 636
           +  E YG+WSDG + D+RL  GYE VPTRDIHMKQ+     W   L +YV P+QE+ FIG
Sbjct: 412 EECEYYGKWSDGKHKDERLVGGYENVPTRDIHMKQIDFERHWLYMLDEYVRPIQEKLFIG 471

Query: 637 YHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNV 696
           Y+ +PV + M FVVRY+P+EQ SLRPHHD+STY+I+IALN+ GVDYEGGG RF+RYNC  
Sbjct: 472 YYKQPVESVMMFVVRYKPEEQASLRPHHDASTYSIDIALNKRGVDYEGGGVRFLRYNCTF 531

Query: 697 TATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
            A  +G  ++ PGRLTH HEGL+ T+GTRYI +SF++P
Sbjct: 532 DADVVGHSMIFPGRLTHLHEGLETTRGTRYIAVSFINP 569


>gi|312082547|ref|XP_003143489.1| hypothetical protein LOAG_07909 [Loa loa]
          Length = 719

 Score =  483 bits (1242), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 273/711 (38%), Positives = 413/711 (58%), Gaps = 26/711 (3%)

Query: 33  KFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNELD 92
           K     +++   DG +R   SAE   +  K L L Q  +  +      G  + +L  EL 
Sbjct: 26  KLAAFALSTGSNDGLERLKCSAEHYNIDFKILDLGQNSIDHE-DKEDTGKLLRMLTTELG 84

Query: 93  EMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAVG 152
            + I++  I+L+ D ++ II    ++I+ +F   DA   + A  L  P T    +  + G
Sbjct: 85  VLRISNSTILLIIDGFNAIITSDESNIICQF--LDACGNYRA--LLTPKTVSAQRSSSFG 140

Query: 153 SGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLANL 212
             +  + S   IG+  DI ++     I +++ + L Y  L+ + ++ T   +  D    L
Sbjct: 141 LLFSEVRSVALIGFVPDILDVFD--FIGSQDGNTLSYTSLYSNYSVDTL-GLTFDVKGIL 197

Query: 213 FQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWKTS- 271
           FQN+  +  +I L FD   + ++ N   NT P +I G+ K    LN  GNY+ K+W    
Sbjct: 198 FQNVDSANSEIMLLFDDSGYAYVNNFVQNTRPSVILGSTKGSQLLNHLGNYVGKAWSAED 257

Query: 272 GCTRCNLIKHLDSLKPDQ--FPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN 329
           G  +C+      SLK  +  +PSV +++FI KP  F+ EFL  ++ ++YP  KI ++ YN
Sbjct: 258 GYLQCSTT----SLKTSENTWPSVTLALFITKPIPFIREFLATVSRISYPTSKIDIYFYN 313

Query: 330 NQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDS 389
           NQ+Y+    + ++ N K +++ V+Y   ++ +  +EAR  A+  +     DF F +D D 
Sbjct: 314 NQKYNEEEIEKFLQNAKKLYQTVEYDNSDTELGEREARKAALTFAKEMLNDFIFMLDGDV 373

Query: 390 HLDNPDVLKYLVNRNES----LIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGD 445
           HL  P+ L+ LV+   +    +IAPL+    K +SNFWGAL+++G+Y RS DY+ I++G 
Sbjct: 374 HLITPETLQLLVDTAIAGKFGIIAPLVTLHGKLFSNFWGALDSNGYYLRSEDYIEIVDGK 433

Query: 446 QGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSM-DYDMAFCTNLRNKGIHLKIDST 504
           +   GIWNVPYI+   L+    IK   ++  YT N M D DM+FC   R  G  + +D+ 
Sbjct: 434 R--TGIWNVPYISKAILISKEKIKV--LENSYTYNVMVDADMSFCEYAREMGYFMYVDNQ 489

Query: 505 QEYGHLVDSENF-DPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFW 563
             YG LVD+E+F   ++ +PE+YE+ +N   W+ RYIHP+Y ++L    +  QPCPDV+ 
Sbjct: 490 HYYGFLVDAEDFVSDERLHPEMYEIFKNRYVWEQRYIHPKYYEALNSRNIP-QPCPDVYN 548

Query: 564 FPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLR 623
           +P+++E F  E ++ ME YG WS G N D RL  GYE VPT DIHMKQ+     W  FL 
Sbjct: 549 YPLMSENFTKELIEEMEHYGLWSSGKNEDNRLAGGYENVPTVDIHMKQISFEKEWLYFLD 608

Query: 624 KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYE 683
           +YV P+QE+ FIGY+ +PV A M FVVRY+  EQ SL+ HHD+STYT++I LN+ G DYE
Sbjct: 609 EYVRPMQEKLFIGYYQQPVEAVMMFVVRYKQGEQSSLQAHHDASTYTVDIPLNKRGRDYE 668

Query: 684 GGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           GGG R++RYNC V A ++G+  M PGRLTH HEGL VT G RYI +SF++P
Sbjct: 669 GGGIRYVRYNCTVPADQIGYAAMFPGRLTHLHEGLPVTSGIRYIAVSFLNP 719


>gi|297282211|ref|XP_002802231.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 1-like
           [Macaca mulatta]
          Length = 640

 Score =  482 bits (1240), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 267/730 (36%), Positives = 391/730 (53%), Gaps = 140/730 (19%)

Query: 31  EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNE 90
           ED  LV+TVA+ ET+G++RF +SA+    +++ LGL + W     +S GGG KV LLK  
Sbjct: 25  EDNLLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWNVEKGTSAGGGQKVRLLKKA 84

Query: 91  LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
           L++    +D++IL TDSYDV+   G  ++L++F    + +VF AE L +PD  L  KYP 
Sbjct: 85  LEKHADKEDLVILFTDSYDVLFASGPRELLKKFRQARSQVVFSAEELIYPDRRLEAKYPV 144

Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
           V  G R+L SGGFIGYA ++ +L++    ++ + DQL+Y  +FLD   R +  I LD   
Sbjct: 145 VSDGKRFLGSGGFIGYAPNLSKLVAEWEGQDSDSDQLFYTKIFLDPEKREQINITLDHRC 204

Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK- 269
            +FQNL G+L+++ L F++   V   N  Y+T PV+IHGNG +K++LN  GNY+ + W  
Sbjct: 205 RIFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTF 263

Query: 270 TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
            +GCT C+  ++ L  +  +  P+VL+ +FI++PT F                 +S+F  
Sbjct: 264 ETGCTVCDEGLRSLKGIGDEALPTVLVGMFIEQPTPF-----------------VSLFFQ 306

Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSD 388
              + H P             K+++   HN  V+S+ +                      
Sbjct: 307 RLLQLHYPR------------KHMRLFIHNH-VSSRHSEG-------------------- 333

Query: 389 SHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGG 448
                            ++IAPL+ R  + WSNFWGAL+ADG+YARS DY++I+ G +  
Sbjct: 334 -----------------NVIAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRR-- 374

Query: 449 KGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNK------------ 495
            G+WNVPYI+N YL+K S ++       ++  + +D DMAFC N+R +            
Sbjct: 375 IGVWNVPYISNIYLIKGSALRGELQSPDLFHHSKLDPDMAFCANVRQQVSQQWAAQDTPR 434

Query: 496 -----------GIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEY 544
                       + + + +    GHL+  +++     + +++E+  NP DW  +YIH  Y
Sbjct: 435 PRLFHWACFPQDVFMFLTNRHTLGHLLSLDSYRTAHLHNDLWEVFSNPEDWKEKYIHQNY 494

Query: 545 QKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPT 604
            K+L    V   PCPDV+WFPI TE  C E V+ ME +GQWS G N D R++ GYE VPT
Sbjct: 495 TKALAGKLVET-PCPDVYWFPIFTEAACDELVEEMEHFGQWSLGDNKDSRIQGGYENVPT 553

Query: 605 RDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHH 664
            DIHM Q+G    W +FL +Y+ P+ E+ + GY                           
Sbjct: 554 IDIHMNQIGFEREWHKFLLEYIAPMTEKLYPGY--------------------------- 586

Query: 665 DSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGT 724
               YT             GGGCRF+RYNC++ A R GW LMHPGRLTHYHEGL  T+GT
Sbjct: 587 ----YT------------RGGGCRFLRYNCSIRAPRKGWTLMHPGRLTHYHEGLPTTRGT 630

Query: 725 RYIMISFVDP 734
           RYI +SFVDP
Sbjct: 631 RYIAVSFVDP 640


>gi|47210803|emb|CAF89795.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 607

 Score =  473 bits (1218), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 266/680 (39%), Positives = 372/680 (54%), Gaps = 108/680 (15%)

Query: 32  DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLLKNE 90
           +  LVIT A+ ETDG+ RF+++A      VK LGL + W GGD++ ++GGG KV  LK E
Sbjct: 1   ENLLVITAATEETDGFHRFMRTAREFNYTVKVLGLGEEWRGGDVARTVGGGQKVRWLKEE 60

Query: 91  LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
           L +     D ++L  DSYDVI+  G  ++L +F+     +VF AE  CWPD  L  KYP 
Sbjct: 61  LRKHS-DQDTVVLFVDSYDVILASGPEELLSKFSRLAHRVVFSAEGFCWPDQRLAPKYPE 119

Query: 151 VGSGYRYLNSGG---------------------------FIGYAKDIKELISNRSIKNEE 183
           V SG RYLNSGG                           FIG+A ++  ++     ++++
Sbjct: 120 VPSGKRYLNSGGPRLPPVRVRRRWRLDQPVCVCVCVCSGFIGFASELSAIVQQWKYRDDD 179

Query: 184 DDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTN 243
           DDQL+Y  ++LD+  RTK  + LD  + +FQNL G+++++ L F+  + V   N  Y+T 
Sbjct: 180 DDQLFYTRIYLDKVQRTKFNMTLDHRSRIFQNLNGAVDEVVLKFERSK-VRARNVAYDTL 238

Query: 244 PVIIHGNGKSKIELNSFGNYLAKSWK-TSGCTRCN----LIKHLDSLKPDQ-FPSVLISV 297
           PV+IHGNG +K++LN   NY+  +W    GC  C+    L+ H+    PD+  P V + V
Sbjct: 239 PVVIHGNGPTKLQLNYLANYVPSAWTFQGGCGVCDDDLLLLNHV----PDEDMPLVHVGV 294

Query: 298 FIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAH 357
           FI+K T FLEEFL ++  +NYP  +                                 A 
Sbjct: 295 FIEKATPFLEEFLERLTLMNYPTAQS--------------------------------AS 322

Query: 358 NSTVNSKEARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFK 417
           +ST      R+           D+YF +DSD  L NPD L+ L+  N+S+IAP+L +  K
Sbjct: 323 SSTTTEACLRD--------PECDYYFSLDSDVALTNPDTLRILMEENKSVIAPMLSKHGK 374

Query: 418 AWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIY 477
            WSNFWGAL+ +GFY+RS DY+ I+ G +   G+WNVPYIT  YL+K SV+++       
Sbjct: 375 LWSNFWGALSPEGFYSRSEDYIEIVQGKR--IGLWNVPYITQVYLIKGSVLRS------- 425

Query: 478 TLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDL 537
                          R   + L+  S          E  +P   +P   E   +  DW  
Sbjct: 426 ---------------RLSQLSLRWTSRHRSSAGTSREQDEPWSCDPPRREDAAD--DWKE 468

Query: 538 RYIHPEYQKSLLP-DTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLE 596
           +Y+H  Y +     ++   QPCPDV+WFP  +EK C   V+ MEA+GQWS G + D+RL 
Sbjct: 469 KYVHENYSRIFEEQESFVEQPCPDVYWFPAFSEKMCDHLVETMEAHGQWSSGGHKDERLS 528

Query: 597 TGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDE 656
            GYE VPT D HM Q+G    W  FLR Y+VP+ E+ + GY+    +A M+FVVRYRPDE
Sbjct: 529 GGYENVPTVDTHMNQIGFEKEWLRFLRDYIVPVTEKLYPGYYPR-AQAIMNFVVRYRPDE 587

Query: 657 QPSLRPHHDSSTYTINIALN 676
           QPSLRPHHDSST+TINIALN
Sbjct: 588 QPSLRPHHDSSTFTINIALN 607


>gi|393910404|gb|EFO20581.2| hypothetical protein LOAG_07909 [Loa loa]
          Length = 633

 Score =  471 bits (1213), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 261/658 (39%), Positives = 390/658 (59%), Gaps = 34/658 (5%)

Query: 86  LLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLY 145
           +L  EL  + I++  I+L+ D ++ II    ++I+ +F   DA         C    +L 
Sbjct: 1   MLTTELGVLRISNSTILLIIDGFNAIITSDESNIICQF--LDA---------CGNYRALL 49

Query: 146 DKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIV 205
              P   S  R + S   IG+  DI ++     I +++ + L Y  L+ + ++ T   + 
Sbjct: 50  T--PKTVSAQREVRSVALIGFVPDILDVFD--FIGSQDGNTLSYTSLYSNYSVDTL-GLT 104

Query: 206 LDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLA 265
            D    LFQN+  +  +I L FD   + ++ N   NT P +I G+ K    LN  GNY+ 
Sbjct: 105 FDVKGILFQNVDSANSEIMLLFDDSGYAYVNNFVQNTRPSVILGSTKGSQLLNHLGNYVG 164

Query: 266 KSWKTS-GCTRCNLIKHLDSLKPDQ--FPSVLISVFIDKPTAFLEEFLNKIANLNYPAKK 322
           K+W    G  +C+      SLK  +  +PSV +++FI KP  F+ EFL  ++ ++YP  K
Sbjct: 165 KAWSAEDGYLQCSTT----SLKTSENTWPSVTLALFITKPIPFIREFLATVSRISYPTSK 220

Query: 323 ISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFY 382
           I ++ YNNQ+Y+    + ++ N K +++ V+Y   ++ +  +EAR  A+  +     DF 
Sbjct: 221 IDIYFYNNQKYNEEEIEKFLQNAKKLYQTVEYDNSDTELGEREARKAALTFAKEMLNDFI 280

Query: 383 FYVDSDSHLDNPDVLKYLVNRNES----LIAPLLVRPFKAWSNFWGALNADGFYARSFDY 438
           F +D D HL  P+ L+ LV+   +    +IAPL+    K +SNFWGAL+++G+Y RS DY
Sbjct: 281 FMLDGDVHLITPETLQLLVDTAIAGKFGIIAPLVTLHGKLFSNFWGALDSNGYYLRSEDY 340

Query: 439 MNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSM-DYDMAFCTNLRNKGI 497
           + I++G +   GIWNVPYI+   L+    IK   ++  YT N M D DM+FC   R  G 
Sbjct: 341 IEIVDGKR--TGIWNVPYISKAILISKEKIKV--LENSYTYNVMVDADMSFCEYAREMGY 396

Query: 498 HLKIDSTQEYGHLVDSENF-DPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQ 556
            + +D+   YG LVD+E+F   ++ +PE+YE+ +N   W+ RYIHP+Y ++L    +  Q
Sbjct: 397 FMYVDNQHYYGFLVDAEDFVSDERLHPEMYEIFKNRYVWEQRYIHPKYYEALNSRNIP-Q 455

Query: 557 PCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAG 616
           PCPDV+ +P+++E F  E ++ ME YG WS G N D RL  GYE VPT DIHMKQ+    
Sbjct: 456 PCPDVYNYPLMSENFTKELIEEMEHYGLWSSGKNEDNRLAGGYENVPTVDIHMKQISFEK 515

Query: 617 VWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALN 676
            W  FL +YV P+QE+ FIGY+ +PV A M FVVRY+  EQ SL+ HHD+STYT++I LN
Sbjct: 516 EWLYFLDEYVRPMQEKLFIGYYQQPVEAVMMFVVRYKQGEQSSLQAHHDASTYTVDIPLN 575

Query: 677 QVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           + G DYEGGG R++RYNC V A ++G+  M PGRLTH HEGL VT G RYI +SF++P
Sbjct: 576 KRGRDYEGGGIRYVRYNCTVPADQIGYAAMFPGRLTHLHEGLPVTSGIRYIAVSFLNP 633


>gi|393910405|gb|EJD75867.1| hypothetical protein, variant 1 [Loa loa]
 gi|393910406|gb|EJD75868.1| hypothetical protein, variant 2 [Loa loa]
          Length = 511

 Score =  443 bits (1139), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 227/519 (43%), Positives = 327/519 (63%), Gaps = 18/519 (3%)

Query: 225 LNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWKTS-GCTRCNLIKHLD 283
           L FD   + ++ N   NT P +I G+ K    LN  GNY+ K+W    G  +C+      
Sbjct: 2   LLFDDSGYAYVNNFVQNTRPSVILGSTKGSQLLNHLGNYVGKAWSAEDGYLQCSTT---- 57

Query: 284 SLKPDQ--FPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDY 341
           SLK  +  +PSV +++FI KP  F+ EFL  ++ ++YP  KI ++ YNNQ+Y+    + +
Sbjct: 58  SLKTSENTWPSVTLALFITKPIPFIREFLATVSRISYPTSKIDIYFYNNQKYNEEEIEKF 117

Query: 342 IHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLV 401
           + N K +++ V+Y   ++ +  +EAR  A+  +     DF F +D D HL  P+ L+ LV
Sbjct: 118 LQNAKKLYQTVEYDNSDTELGEREARKAALTFAKEMLNDFIFMLDGDVHLITPETLQLLV 177

Query: 402 NRNES----LIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYI 457
           +   +    +IAPL+    K +SNFWGAL+++G+Y RS DY+ I++G +   GIWNVPYI
Sbjct: 178 DTAIAGKFGIIAPLVTLHGKLFSNFWGALDSNGYYLRSEDYIEIVDGKR--TGIWNVPYI 235

Query: 458 TNCYLMKTSVIKATNIKTIYTLNSM-DYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENF 516
           +   L+    IK   ++  YT N M D DM+FC   R  G  + +D+   YG LVD+E+F
Sbjct: 236 SKAILISKEKIKV--LENSYTYNVMVDADMSFCEYAREMGYFMYVDNQHYYGFLVDAEDF 293

Query: 517 -DPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEF 575
              ++ +PE+YE+ +N   W+ RYIHP+Y ++L    +  QPCPDV+ +P+++E F  E 
Sbjct: 294 VSDERLHPEMYEIFKNRYVWEQRYIHPKYYEALNSRNIP-QPCPDVYNYPLMSENFTKEL 352

Query: 576 VQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFI 635
           ++ ME YG WS G N D RL  GYE VPT DIHMKQ+     W  FL +YV P+QE+ FI
Sbjct: 353 IEEMEHYGLWSSGKNEDNRLAGGYENVPTVDIHMKQISFEKEWLYFLDEYVRPMQEKLFI 412

Query: 636 GYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCN 695
           GY+ +PV A M FVVRY+  EQ SL+ HHD+STYT++I LN+ G DYEGGG R++RYNC 
Sbjct: 413 GYYQQPVEAVMMFVVRYKQGEQSSLQAHHDASTYTVDIPLNKRGRDYEGGGIRYVRYNCT 472

Query: 696 VTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           V A ++G+  M PGRLTH HEGL VT G RYI +SF++P
Sbjct: 473 VPADQIGYAAMFPGRLTHLHEGLPVTSGIRYIAVSFLNP 511


>gi|47205471|emb|CAF94612.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 559

 Score =  431 bits (1108), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 216/565 (38%), Positives = 351/565 (62%), Gaps = 12/565 (2%)

Query: 33  KFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPW-LGGDMSSLGGGYKVNLLKNEL 91
           + LV+TVA+ +TDG++RF+ SA+     VK LG  + W  GG   + GGG KV LLK  +
Sbjct: 1   RLLVLTVATGDTDGFRRFLSSAQHFNYTVKVLGRDEAWSGGGYAGAPGGGQKVRLLKAAV 60

Query: 92  DEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAV 151
           +EM+   D I+L TDSYD +   G  ++L++F      +VF +E L WPD  L DK+P V
Sbjct: 61  EEME-NQDAILLFTDSYDAVFSSGPRELLKKFQQAGHQVVFSSEPLIWPDRHLEDKHPHV 119

Query: 152 GSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLAN 211
             G R+L SGGFIGY  +IKEL+++ + ++++ DQL++  +++D   R    I LD+   
Sbjct: 120 REGNRFLGSGGFIGYLANIKELVADWTGEDDDSDQLFFTRIYIDAAKRKSINITLDSKCR 179

Query: 212 LFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK-T 270
           LFQNL GSL+++ L F+  + V   N  ++T PV+IHGNG +K+++N  GNY+   W   
Sbjct: 180 LFQNLLGSLDEVVLKFEEGK-VRARNLVHDTLPVLIHGNGPTKLQINYLGNYIPNVWTFE 238

Query: 271 SGCTRCN-LIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN 329
           +GC  C   ++ L +L+   +P VL+ VFI++PT F+  F  ++  L YP  ++ + +YN
Sbjct: 239 AGCRVCQEELRPLGALQESDYPLVLVGVFIEQPTPFVSAFFQRLLELQYPKTRLKVLIYN 298

Query: 330 NQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDSD 388
            + +H      ++   ++++  V  +      ++++ARNLA++     +  D++F VD D
Sbjct: 299 KEAHHEQHVSAFLQKHQSLYAAVDLLRPEDPADARDARNLALDMCRQDQSCDYFFSVDVD 358

Query: 389 SHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGG 448
             L N   L+ L+  N  ++AP+L R  + WSNFWGAL+ DG+YARS DY++I+   +  
Sbjct: 359 VVLKNQSTLRTLIEHNLPIVAPMLTRAGRLWSNFWGALSPDGYYARSEDYVDIVQRRR-- 416

Query: 449 KGIWNVPYITNCYLMKTSVIKA--TNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQE 506
            G+WNVPY++   L+K  ++++  T+ + ++  + +D DMAFC N+RNKGI + + +   
Sbjct: 417 VGVWNVPYVSKVVLLKGVLLRSELTDFE-LFDSHILDPDMAFCHNVRNKGIFMYVTNVHT 475

Query: 507 YGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPI 566
           +GH++ +EN+  Q  + +++++  NPLDW  RYIHP Y + +L D +   PCPDV+WFP+
Sbjct: 476 FGHILSTENYQTQHLHNDLWQIFENPLDWQERYIHPNYSR-ILRDQLIETPCPDVYWFPV 534

Query: 567 VTEKFCHEFVQIMEAYGQWSDGTNN 591
            TE+ C   V+ ME +G+WS G N 
Sbjct: 535 FTEEACDHMVEEMEHFGRWSGGANT 559


>gi|28380952|gb|AAO41443.1| RE30068p [Drosophila melanogaster]
          Length = 595

 Score =  428 bits (1101), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 218/541 (40%), Positives = 331/541 (61%), Gaps = 12/541 (2%)

Query: 32  DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNEL 91
           DK  V TVA+  TDGY R+I+SA V  ++V TLGL + W GGDM   GGG+K+NLL+  +
Sbjct: 27  DKIKVFTVATEPTDGYTRYIRSARVYDIEVTTLGLGEEWKGGDMQKPGGGFKLNLLREAI 86

Query: 92  DEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAV 151
                  + IIL TDSYDVII   +++I E+F    A I+F AE+ CWPD SL D YP V
Sbjct: 87  APYKNEPETIILFTDSYDVIITTTLDEIFEKFKESGAKILFSAEKYCWPDKSLADDYPEV 146

Query: 152 -GSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
            G   R+LNSG FIGYA  +  L+ +  I++  DDQLY+  +FLDET R K  + LD  +
Sbjct: 147 EGKASRFLNSGAFIGYAPQVFALLVD-PIEDTADDQLYFTKIFLDETKRAKLGLKLDVQS 205

Query: 211 NLFQNLYGSLEDIKLNFDLDEFVH-LTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK 269
            LFQNL+G+  D+KL  DL+     L N  + T P IIHGNG SK++LN++GNYLA+++ 
Sbjct: 206 RLFQNLHGAKNDVKLKVDLESNQGVLQNVDFMTTPSIIHGNGLSKVDLNAYGNYLARTF- 264

Query: 270 TSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN 329
              C  C   ++L  L+    P + +++ + +P  F ++FL  I +LNYP +K+ + +Y+
Sbjct: 265 NGVCLLCQ--ENLLDLEETNLPVISLALMVTQPVPFFDQFLEGIESLNYPKEKLHLLIYS 322

Query: 330 NQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDS 389
           N  +H      +++     +   K+      ++ ++ R LA++ +     D+ F+VD+D+
Sbjct: 323 NVAFHDDDIKSFVNKHAKEYATAKFALSTDELDERQGRQLALDKARLHQSDYIFFVDADA 382

Query: 390 HLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGK 449
           H+D+ +VL+ L+  N+  +AP+  +  + WSNFWGAL+  G+YARS DY++I+  +    
Sbjct: 383 HIDDGEVLRELLRLNKQFVAPIFSKHKELWSNFWGALSEGGYYARSHDYVDIVKREL--I 440

Query: 450 GIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGH 509
           G++NVP++T+ YL+K +   A + K        D DMA C +LRN GI +   + + +GH
Sbjct: 441 GMFNVPHVTSIYLVKKTAFDAISFKH----KEFDPDMAMCESLRNAGIFMYASNLRIFGH 496

Query: 510 LVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTE 569
           LV++++F+   T P+ Y L  N +DW  +YIHP Y   L       QPCPDV+WF IV+E
Sbjct: 497 LVNADDFNTTVTRPDFYTLFSNEIDWTEKYIHPNYSLQLNESNKIQQPCPDVYWFQIVSE 556

Query: 570 K 570
           +
Sbjct: 557 R 557


>gi|432957744|ref|XP_004085857.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 3-like,
           partial [Oryzias latipes]
          Length = 363

 Score =  411 bits (1057), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 190/357 (53%), Positives = 255/357 (71%), Gaps = 5/357 (1%)

Query: 380 DFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYM 439
           +F+F +DSD  L NPD L+ L+  N+S+IAP+L +  K WSNFWGAL+ +GFY+RS DY+
Sbjct: 10  EFFFSLDSDVALTNPDTLRILMEENKSVIAPMLSKHGKLWSNFWGALSPEGFYSRSEDYI 69

Query: 440 NIINGDQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIH 498
            I+   +   G+WNVPYI+  YL+K SV+++  +   ++    MD DM FC N+R++G+ 
Sbjct: 70  EIVQAKR--VGLWNVPYISQVYLVKGSVLRSKLSHLNLFVDQGMDPDMVFCKNVRDQGVF 127

Query: 499 LKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLL-PDTVNNQP 557
           + + +  E+G LV S NF+  + +P+++++  NPLDW  +YIH  Y K     D    QP
Sbjct: 128 MFVSNRDEFGRLVASSNFNTSRLHPDMWQIFDNPLDWREKYIHENYSKIFEDQDGFVEQP 187

Query: 558 CPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGV 617
           CPDV+WFP  +EK C + V+ ME YG WS G++ D+RL  GYE VPT DIHM Q+G    
Sbjct: 188 CPDVYWFPAFSEKMCDQLVETMEDYGVWSGGSHKDERLSGGYENVPTVDIHMNQIGFEKE 247

Query: 618 WAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQ 677
           W +FL+ Y+ P+ E+ + GY  +  +A M+FVVRYRPDEQPSLRPHHDSST+TINIALN+
Sbjct: 248 WLKFLKDYIAPVTEKLYPGYFPK-AQAIMNFVVRYRPDEQPSLRPHHDSSTFTINIALNR 306

Query: 678 VGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
            GVDYEGGGCRF+RY+C V + R GW  MHPGRLTHYHEGL  T+GTRYIM+SFVDP
Sbjct: 307 KGVDYEGGGCRFLRYDCKVESPRKGWSFMHPGRLTHYHEGLPTTRGTRYIMVSFVDP 363


>gi|149420843|ref|XP_001508185.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 1-like,
           partial [Ornithorhynchus anatinus]
          Length = 402

 Score =  410 bits (1053), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 189/406 (46%), Positives = 277/406 (68%), Gaps = 6/406 (1%)

Query: 331 QEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDSDS 389
           +++H    + ++      +  V+ +  +  V + +ARN+  +     +   +YF VD+D 
Sbjct: 1   EQHHKAQVERFVAEHGGEYHAVQLVGPDQRVENAQARNMGADLCRKDRDCTYYFSVDADV 60

Query: 390 HLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGK 449
            L NP+ L+ L+ +N+++IAP++ RP + WSNFWGAL+ DGFYARS DY++I+ G +   
Sbjct: 61  ALKNPETLRLLIEQNKAVIAPMMSRPGRLWSNFWGALSVDGFYARSEDYVDIVQGRR--V 118

Query: 450 GIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYG 508
           G+WNVPYI++ YL+K S +++    + ++    +D DMAFC+N+R + + + + + Q +G
Sbjct: 119 GVWNVPYISSIYLVKGSSLRSDLRQEDLFHSGKLDPDMAFCSNVRQQDVFMFLTNRQPFG 178

Query: 509 HLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVT 568
           HL+  EN+     + +++E+  NP DW  +YIH  Y  ++L   +   PCPDV+WFPI T
Sbjct: 179 HLLSLENYQTTHLHNDLWEVFSNPEDWKEKYIHENY-TAVLKGKLVETPCPDVYWFPIFT 237

Query: 569 EKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVP 628
           E  C E V+ ME +GQWS G N D RL+ GYE VPT DIHM Q+     W +FL +Y+ P
Sbjct: 238 EVACDELVEEMEHFGQWSAGDNKDSRLQGGYENVPTIDIHMNQISFEREWHKFLVEYIAP 297

Query: 629 LQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCR 688
           + E+ + GY+ +  +  ++FVVRY+PDEQPSL PHHD+ST+T+NIALN+VGVDYEGGGCR
Sbjct: 298 ITEKLYPGYYTK-AQFDLAFVVRYKPDEQPSLMPHHDASTFTLNIALNRVGVDYEGGGCR 356

Query: 689 FIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           F+RYNC+V A R GW LMHPGRLTHYHEGL  T+GTRYI +SF+DP
Sbjct: 357 FLRYNCSVKAPRKGWTLMHPGRLTHYHEGLPTTKGTRYIAVSFLDP 402


>gi|4884200|emb|CAB43221.1| hypothetical protein [Homo sapiens]
          Length = 365

 Score =  401 bits (1031), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 182/357 (50%), Positives = 250/357 (70%), Gaps = 4/357 (1%)

Query: 379 VDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDY 438
            +FYF +D+D+ L N   L+ L+  N  +IAP+L R  K WSNFWGAL+ D +YARS DY
Sbjct: 12  CEFYFSLDADAVLTNLQTLRILIEENRKVIAPMLSRHGKLWSNFWGALSPDEYYARSEDY 71

Query: 439 MNIINGDQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGI 497
           + ++   +   G+WNVPYI+  Y+++   ++     + +++ +  D DMAFC + R+KGI
Sbjct: 72  VELVQRKR--VGVWNVPYISQAYVIRGDTLRMELPQRDVFSGSDTDPDMAFCKSFRDKGI 129

Query: 498 HLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQP 557
            L + +  E+G L+ +  +D +  +P+++++  NP+ W  +YIH  Y ++L  + +  QP
Sbjct: 130 FLHLSNQHEFGRLLATSRYDTEHLHPDLWQIFDNPVGWKEQYIHENYSRALEGEGIVEQP 189

Query: 558 CPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGV 617
           CPDV+WFP+++E+ C E V  ME YGQWS G + D RL  GYE VPT DIHMKQVG    
Sbjct: 190 CPDVYWFPLLSEQMCDELVAEMEHYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYEDQ 249

Query: 618 WAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQ 677
           W + LR YV P+ E  F GYH +  RA M+FVVRYRPDEQPSLRPHHDSST+T+N+ALN 
Sbjct: 250 WLQLLRTYVGPMTESLFPGYHTK-ARAVMNFVVRYRPDEQPSLRPHHDSSTFTLNVALNH 308

Query: 678 VGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
            G+DYEGGGCRF+RY+C +++ R GW L+HPGRLTHYHEGL  T GTRYIM+SFVDP
Sbjct: 309 KGLDYEGGGCRFLRYDCVISSPRKGWALLHPGRLTHYHEGLPTTWGTRYIMVSFVDP 365


>gi|449512121|ref|XP_002188714.2| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 1-like,
           partial [Taeniopygia guttata]
          Length = 466

 Score =  399 bits (1026), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 195/467 (41%), Positives = 302/467 (64%), Gaps = 9/467 (1%)

Query: 221 EDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK-TSGCTRCNL- 278
           ++I L F+ +  V   N  Y+T PV+IHGNG +K++LN  GNY+ + W   +GCT C+  
Sbjct: 5   DEIVLKFE-NSRVRARNLLYDTLPVVIHGNGPTKLQLNYLGNYIPQIWTFETGCTVCDEG 63

Query: 279 IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLF 338
           ++ L   K +  P +LI +FI++PT FL +F  ++ NL+YP ++I +F++N++E+H    
Sbjct: 64  LRSLLGFKDEALPMILIGIFIEQPTPFLSQFFLRLRNLHYPKQRIQLFIHNHEEHHLMEV 123

Query: 339 DDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDSDSHLDNPDVL 397
           D ++      +  V+ I  +  V + EARNL ++        D+YF +D++  L N + L
Sbjct: 124 DSFVEEHGREYLTVQVIGPDDEVENAEARNLGMDLCRKDPDCDYYFSLDAEVVLKNTETL 183

Query: 398 KYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYI 457
           + L+ +N+ +IAPL+ R  K WSNFWGAL+ DG+YARS DY++I+   +   G+WNVPYI
Sbjct: 184 RILIEQNKLVIAPLVSRHEKLWSNFWGALSPDGYYARSEDYVDIVQRRR--VGLWNVPYI 241

Query: 458 TNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENF 516
           ++ YL+K   +++      ++    +D DMAFC N+RN+G+ + + +  ++GH++  EN+
Sbjct: 242 SSVYLVKGKALRSELEQGDLFHSGKLDADMAFCHNIRNQGVFMYLTNQHQFGHILSLENY 301

Query: 517 DPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFV 576
                + +++++  NP DW  +YIH  Y  +L    V   PCPDV+WFPI T+  C E V
Sbjct: 302 QTSHLHNDLWQIFSNPEDWREKYIHENYTAALKGKLVE-MPCPDVYWFPIFTDTACDELV 360

Query: 577 QIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIG 636
           + ME YGQWS G N D R++ GYE VPT DIHM Q+G    W +FL  Y+ P+ E+ + G
Sbjct: 361 EEMEHYGQWSTGDNTDSRIQGGYENVPTIDIHMNQIGFEREWYKFLLDYIAPITEKLYPG 420

Query: 637 YHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYE 683
           Y+ +  +  ++FVVRY+PDEQPSL PHHD+ST+TINIALN+VG+DYE
Sbjct: 421 YYTK-TQFELAFVVRYKPDEQPSLVPHHDASTFTINIALNRVGIDYE 466


>gi|16307441|gb|AAH10268.1| Plod1 protein [Mus musculus]
          Length = 364

 Score =  393 bits (1010), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 182/356 (51%), Positives = 252/356 (70%), Gaps = 7/356 (1%)

Query: 381 FYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMN 440
           +YF VD+D  L  P+ L+ L+ +N+++IAPL+ R  + WSNFWG L+ADG+YARS DY++
Sbjct: 14  YYFSVDADVALTEPNSLRLLIEQNKNVIAPLMTRHGRLWSNFWGGLSADGYYARSEDYVD 73

Query: 441 IINGDQGGKGIWNVPYITNCYLMKTSVIKA--TNIKTIYTLNSMDYDMAFCTNLRNKGIH 498
           I+ G +   G+WNVPYI+N YL+K S ++A   N+  ++  + +D DM+FC N+R + + 
Sbjct: 74  IVQGRR--VGVWNVPYISNIYLIKGSALRAELQNVD-LFHYSKLDSDMSFCANVRQQEVF 130

Query: 499 LKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPC 558
           + + +   +GHL+  +N+     + +++E+  NP DW  +YIH  Y K+L    V   PC
Sbjct: 131 MFLTNRHTFGHLLSLDNYQTTHLHNDLWEVFSNPEDWKEKYIHENYTKALAGKLVET-PC 189

Query: 559 PDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVW 618
           PDV+WFPI TE  C E V+ ME YGQWS G N D R++ GYE VPT DIHM Q+     W
Sbjct: 190 PDVYWFPIFTEAACDELVEEMEHYGQWSLGDNKDNRIQGGYENVPTIDIHMNQITFEREW 249

Query: 619 AEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQV 678
            +FL +Y+ P+ E+ + GY+    +  ++FVVRY+PDEQPSL PHHD+ST+T+NIALN+V
Sbjct: 250 HKFLVEYIAPMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFTVNIALNRV 308

Query: 679 GVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           G DYEGGGCRF+RYNC+V A R GW L+HPGRLTHYHEGL  T+GTRYI +SFVDP
Sbjct: 309 GEDYEGGGCRFLRYNCSVRAPRKGWALLHPGRLTHYHEGLPTTKGTRYIAVSFVDP 364


>gi|339522069|gb|AEJ84199.1| procollagen-lysine 2-oxoglutarate 5-dioxygenase 1 [Capra hircus]
          Length = 365

 Score =  392 bits (1006), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 179/357 (50%), Positives = 245/357 (68%), Gaps = 4/357 (1%)

Query: 379 VDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDY 438
            +FYF +DSD+ + NP  L+ L+  N  +IAP+L R  K WSNFWGAL+ D +YARS DY
Sbjct: 12  CEFYFSLDSDTVITNPQPLRILIEANRKVIAPMLSRHGKLWSNFWGALSPDEYYARSEDY 71

Query: 439 MNIINGDQGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGI 497
           + ++   +   G+WNVPYI+  Y+++   ++     + +++    D DMAFC +LR+KGI
Sbjct: 72  VELVQRKR--VGVWNVPYISQAYVIRGEPLRTELPQREVFSGGDTDPDMAFCKSLRDKGI 129

Query: 498 HLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQP 557
            L + +  E+  L+ +  +D    +P+++++  NPLDW  +YIH  Y ++L  + +  QP
Sbjct: 130 FLHLSNQHEFARLLATSRYDTDHLHPDLWQIFDNPLDWKEQYIHENYSRALEGEGLVEQP 189

Query: 558 CPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGV 617
           CPDV+WFP+ +E+ C E V+ ME +GQWS G + D RL  GYE VPT DIH KQVG    
Sbjct: 190 CPDVYWFPLPSERMCDELVEEMEHFGQWSGGRHEDSRLAGGYENVPTVDIHRKQVGYEAQ 249

Query: 618 WAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQ 677
           W + LR YV P+ E     YH +  RA M+FVVRYRPDEQPSLRPHHDSST+T+N+ALN 
Sbjct: 250 WLQLLRTYVGPMTESLSPAYHTK-TRAVMNFVVRYRPDEQPSLRPHHDSSTFTLNVALNH 308

Query: 678 VGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
            G+DYEGGGCRF RY+C +++ R GW L+HPGRLTHYHEGL  T+G RY M+SFVDP
Sbjct: 309 KGLDYEGGGCRFRRYDCVISSPRKGWGLLHPGRLTHYHEGLPTTRGPRYTMVSFVDP 365


>gi|344245759|gb|EGW01863.1| Procollagen-lysine,2-oxoglutarate 5-dioxygenase 3 [Cricetulus
           griseus]
          Length = 322

 Score =  382 bits (981), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 174/325 (53%), Positives = 232/325 (71%), Gaps = 4/325 (1%)

Query: 411 LLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKA 470
           +L R  K WSNFWGAL+ D +YARS DY+ ++   +   G+WNVPYI+  Y+++   ++ 
Sbjct: 1   MLSRHGKLWSNFWGALSPDEYYARSEDYVELVQRKR--VGVWNVPYISQAYVIRGETLRT 58

Query: 471 T-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELI 529
               K +++ +  D DMAFC +LR+KGI L + +  E+G L+ +  +D    +P+++++ 
Sbjct: 59  ELPQKEVFSGSDTDPDMAFCKSLRDKGIFLHLSNQHEFGRLLATSRYDTDHLHPDLWQIF 118

Query: 530 RNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGT 589
            NP+DW  +YIH  Y ++L    +  QPCPDV+WFP++TE+ C E V+ ME YGQWS G 
Sbjct: 119 DNPVDWKEQYIHENYSRALDGQGLVEQPCPDVYWFPLLTEQMCDELVEEMEHYGQWSGGR 178

Query: 590 NNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFV 649
           + D RL  GYE VPT DIHMKQVG    W + LR YV P+ E  F GYH +  RA M+FV
Sbjct: 179 HEDSRLAGGYENVPTVDIHMKQVGYEDQWLQLLRTYVGPMTEYLFPGYHTK-TRAVMNFV 237

Query: 650 VRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPG 709
           VRYRPDEQPSLRPHHDSST+T+N+ALN  GVDYEGGGCRF+RY+C +++ R GW L+HPG
Sbjct: 238 VRYRPDEQPSLRPHHDSSTFTLNVALNHKGVDYEGGGCRFLRYDCRISSPRKGWALLHPG 297

Query: 710 RLTHYHEGLQVTQGTRYIMISFVDP 734
           RLTHYHEGL  T+GTRYIM+SFVDP
Sbjct: 298 RLTHYHEGLPTTRGTRYIMVSFVDP 322


>gi|193785082|dbj|BAG54235.1| unnamed protein product [Homo sapiens]
          Length = 418

 Score =  379 bits (973), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 178/377 (47%), Positives = 250/377 (66%), Gaps = 26/377 (6%)

Query: 380 DFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYM 439
           D+YF VD+D  L NP  LK L+ +N  +IAPL+ R  K WSNFWGAL+ DG+YARS DY+
Sbjct: 46  DYYFSVDADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARSEDYV 105

Query: 440 NIINGDQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN---- 494
           +I+ G++   G+WNVPY+ N YL+K   +++  N +  +  + +D DMA C N R     
Sbjct: 106 DIVQGNR--VGVWNVPYMANVYLIKGKTLRSEMNERNYFVRDKLDPDMALCRNAREMTLQ 163

Query: 495 -----------------KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDL 537
                            KG+ + I +  E+G L+ + N++    N +++++  NP+DW  
Sbjct: 164 REKDSPTPETFQMLSPPKGVFMYISNRHEFGRLLSTANYNTSHYNNDLWQIFENPVDWKE 223

Query: 538 RYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLET 597
           +YI+ +Y K +  + +  QPCPDVFWFPI +EK C E V+ ME YG+WS G ++D R+  
Sbjct: 224 KYINRDYSK-IFTENIVEQPCPDVFWFPIFSEKACDELVEEMEHYGKWSGGKHHDSRISG 282

Query: 598 GYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQ 657
           GYE VPT DIHMKQV L  VW  F+R+++ P+  + F GY+ +   A ++FVV+Y P+ Q
Sbjct: 283 GYENVPTDDIHMKQVDLENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQ 341

Query: 658 PSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEG 717
            SLRPHHD+ST+TINIALN VG D++GGGC+F+RYNC++ + R GW  MHPGRLTH HEG
Sbjct: 342 RSLRPHHDASTFTINIALNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEG 401

Query: 718 LQVTQGTRYIMISFVDP 734
           L V  GTRYI +SF+DP
Sbjct: 402 LPVKNGTRYIAVSFIDP 418


>gi|313240887|emb|CBY33173.1| unnamed protein product [Oikopleura dioica]
          Length = 590

 Score =  378 bits (970), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 219/588 (37%), Positives = 340/588 (57%), Gaps = 29/588 (4%)

Query: 18  FISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS- 76
            +S+  N V N  E   LVITVA+ +TDGY R+ +S   + L+ +T G+ + WLGGD++ 
Sbjct: 6   LVSLLSNSVLNARE--LLVITVATEKTDGYLRWEESVRYSGLKSRTFGIGEDWLGGDLTN 63

Query: 77  SLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFN------TFDANI 130
             GGG+KVNLLK EL E     ++  L TD+YDVII+G   +I  RF+       +  N+
Sbjct: 64  GPGGGHKVNLLKKELAEYKGNSELYFLFTDAYDVIINGKEEEIFSRFDDIVSKVEYKTNV 123

Query: 131 VFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYA 190
           +  AE L WPD SL  KYP V  G R+L SG  +  A    +L+  R+I + +DDQL+Y 
Sbjct: 124 LISAEDLIWPDASLEPKYPLV-LGKRFLCSGAILARADVFLDLLEYRAIGDRDDDQLFYT 182

Query: 191 LLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLT--NTKYNTNPVIIH 248
             FL++ L+ K  I LD  A LF NL G+LE++ ++F        T  NTKY T P++IH
Sbjct: 183 EAFLNKELKEKFGIALDHKAELFFNLNGALEEVGIDFARSATGDNTVENTKYRTKPLVIH 242

Query: 249 GNGKSKIELNSFGNYLAKSWKTS-GCTRCNLIKHLDSLKPDQFPS--VLISVFIDKPTAF 305
           GNG SK ELN   NY+ + W+   GC  C+ + + + +K D   S  ++I+  ID  T F
Sbjct: 243 GNGPSKNELNRISNYVPQGWRPDYGCPACSKVLN-EEIKEDIDTSKDIVIAFIIDGITPF 301

Query: 306 LEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKE 365
           ++  L +IA+L+YPA+K  + +Y+N  +     D ++  F + +K+ K+I+    ++   
Sbjct: 302 VQNSLKRIASLDYPAEKTHLLIYSNTVWADERVDTFLEVFGSSYKSTKFISSKEKMSVTM 361

Query: 366 ARNLAVENSLHK-GVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWG 424
           AR  A++ +  K   +F FYVD    L NP V+  L+  N  L+AP + R  K WSN+WG
Sbjct: 362 ARKFALQLTDEKFSAEFVFYVDGYVQLTNPAVIGELIKTNVELVAPGMSRYGKLWSNYWG 421

Query: 425 ALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSM-- 482
           A+ +DGFY+RS DY++I+ G +   GIWN+P++   YL+  ++  A ++  I+   S   
Sbjct: 422 AVASDGFYSRSDDYLDIVQGTR--VGIWNMPFVNGAYLVHKNL--AADLIDIFAGISQSP 477

Query: 483 ------DYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWD 536
                 D D+ F +NLR  GI + + +   +G LVD E+    + +PE+++   N  DW+
Sbjct: 478 WQGKFNDPDLDFASNLRTLGIFMHVTNQAYWGRLVDREHMPVDRIHPELWQPEWNRPDWE 537

Query: 537 LRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQ 584
             Y+  +Y + L P+T  ++PCPDV  FP ++ K   + ++ ME YG+
Sbjct: 538 EDYLDTDYWRVLEPETEMDEPCPDVVAFPFLSSKGGFDMIEEMEHYGK 585


>gi|21428536|gb|AAM49928.1| LD37702p [Drosophila melanogaster]
          Length = 280

 Score =  371 bits (953), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 171/284 (60%), Positives = 212/284 (74%), Gaps = 4/284 (1%)

Query: 451 IWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGHL 510
           ++NVP++T+ YL+K +   A + K        D DMA C +LRN GI +   + + +GHL
Sbjct: 1   MFNVPHVTSIYLVKKTAFDAISFKH----KEFDPDMAMCESLRNAGIFMYASNLRIFGHL 56

Query: 511 VDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEK 570
           V++++F+   T P+ Y L  N +DW  +YIHP Y   L       QPCPDV+WF IV++ 
Sbjct: 57  VNADDFNTTVTRPDFYTLFSNEIDWTEKYIHPNYSLQLNESNKIQQPCPDVYWFQIVSDA 116

Query: 571 FCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQ 630
           FC + V IMEA+  WSDG+NND RLE GYEAVPTRDIHMKQVGL  ++ +FL+ +V PLQ
Sbjct: 117 FCDDLVAIMEAHNGWSDGSNNDNRLEGGYEAVPTRDIHMKQVGLERLYLKFLQMFVRPLQ 176

Query: 631 EREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFI 690
           ER F GY H P RA M+F+VRYRPDEQPSLRPHHDSSTYTINIA+N+ G+DY+GGGCRFI
Sbjct: 177 ERAFTGYFHNPPRALMNFMVRYRPDEQPSLRPHHDSSTYTINIAMNRAGIDYQGGGCRFI 236

Query: 691 RYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           RYNC+VT T+ GWMLMHPGRLTHYHEGL VT GTRYIMISF+DP
Sbjct: 237 RYNCSVTDTKKGWMLMHPGRLTHYHEGLLVTNGTRYIMISFIDP 280


>gi|291413228|ref|XP_002722881.1| PREDICTED: procollagen-lysine, 2-oxoglutarate 5-dioxygenase 3
           [Oryctolagus cuniculus]
          Length = 530

 Score =  352 bits (902), Expect = 5e-94,   Method: Compositional matrix adjust.
 Identities = 154/290 (53%), Positives = 209/290 (72%), Gaps = 2/290 (0%)

Query: 446 QGGKGIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDST 504
           +  +G+WNVPYI   Y+++   ++     + +++ +  D DMAFC +LR++GI L + + 
Sbjct: 242 RASRGVWNVPYIAQAYVIRGETLRTELPQREVFSSSDTDPDMAFCKSLRDQGIFLHLSNR 301

Query: 505 QEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWF 564
            E+G L+ +  +D    +P+++++  NP+DW  +YIH  Y ++L  D +  QPCPDV+WF
Sbjct: 302 HEFGRLLATSRYDTDHLHPDLWQIFDNPVDWKEQYIHENYSRALEGDGMVEQPCPDVYWF 361

Query: 565 PIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRK 624
           P+++E+ C E V+ ME YGQWS G + D RL  GYE VPT DIHMKQVG    W + LR 
Sbjct: 362 PLLSEQMCDELVEEMEHYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYEDQWLQLLRT 421

Query: 625 YVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEG 684
           YV P+ E  F GYH +  RA M+FVVRYRPDEQPSLRPHHDSST+T+N+ALN  G+DYEG
Sbjct: 422 YVGPMTESLFPGYHTK-ARAVMNFVVRYRPDEQPSLRPHHDSSTFTLNVALNHKGLDYEG 480

Query: 685 GGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           GGCRF+RY+C ++A R GW L+HPGRLTHYHEGL  T+GTRYIM+SFVDP
Sbjct: 481 GGCRFLRYDCVISAPRKGWGLLHPGRLTHYHEGLPTTRGTRYIMVSFVDP 530



 Score =  194 bits (492), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 92/193 (47%), Positives = 137/193 (70%), Gaps = 1/193 (0%)

Query: 30  DEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLLK 88
           + +K LVITVA+ ET+GY+RF++SAEV    V+TLGL Q W GGD++ ++GGG KV  LK
Sbjct: 42  EPEKLLVITVATAETEGYRRFLRSAEVFNYTVRTLGLGQEWRGGDVARTVGGGQKVRWLK 101

Query: 89  NELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKY 148
            E+++    +DM+I+  DSYDVI+ G  +++L++F    + ++F AE  CWP+  L ++Y
Sbjct: 102 KEMEQYADREDMVIMFVDSYDVILAGSPSELLKKFVQSGSRLLFSAESFCWPEWGLAEQY 161

Query: 149 PAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDT 208
           P VG+G R+LNSGGFIG+A  I +++     K+++DDQL+Y  L+LD  LR K K+ LD 
Sbjct: 162 PEVGTGKRFLNSGGFIGFAPTIHQIVRQWKYKDDDDDQLFYTRLYLDPGLREKLKLNLDH 221

Query: 209 LANLFQNLYGSLE 221
            + +FQNL G+LE
Sbjct: 222 KSRIFQNLNGALE 234


>gi|47223418|emb|CAG04279.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 561

 Score =  341 bits (875), Expect = 8e-91,   Method: Compositional matrix adjust.
 Identities = 174/415 (41%), Positives = 254/415 (61%), Gaps = 5/415 (1%)

Query: 32  DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDM-SSLGGGYKVNLLKNE 90
           +K LV+TVA+ ETDG++RF+QSA      VK LG+ + W GGD+ +S+GGG KV LLK  
Sbjct: 1   EKLLVLTVATEETDGFQRFMQSAHYFNYSVKVLGMGEAWKGGDVGNSIGGGQKVRLLKEA 60

Query: 91  LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
           +  +   +D+++L  DSYD+I  GG  +IL +F   +  ++F AE L WPD  L DKYP 
Sbjct: 61  MKALADQEDLVVLFVDSYDLIFAGGPEEILRKFQQANHKVLFAAEGLIWPDKRLADKYPL 120

Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
           V SG RYLNSGGF+GYA  I +L+S  ++ + +DDQL+Y  +++D   R    + LD   
Sbjct: 121 VRSGKRYLNSGGFMGYAPPINQLVSQWNLHDNDDDQLFYTKIYVDPLQRQTLNMTLDHKC 180

Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK- 269
            +F  L G+ +++ L F  D  V + NT +++ P ++HGN  +KI LN  GNY+   W  
Sbjct: 181 QIFLTLNGAADEVLLKFGTDR-VRVRNTAHDSLPAVVHGNRNTKIFLNYLGNYVPHMWNY 239

Query: 270 TSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN 329
             GC+ C+    LD  +   +PSVL+ VFI+KPT FL EF  ++ +L+YP  K+ +F++N
Sbjct: 240 EHGCSHCD-KDILDLAQLKDYPSVLVGVFIEKPTPFLPEFFQRLLSLDYPKDKMKVFIHN 298

Query: 330 NQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGV-DFYFYVDSD 388
           N+ YH      +    +  F N K +     ++  EARN+ ++        DFYF +DSD
Sbjct: 299 NEVYHEKHIQKFWEENRNTFINFKIVGPEENLSQGEARNMGMDLCRKDATCDFYFSLDSD 358

Query: 389 SHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIIN 443
             L N   LK LV +N  +I PL+ R  K WSNFWGAL+ DG+YARS DY++I+ 
Sbjct: 359 VMLTNSQTLKLLVEQNRKIIGPLVTRHSKLWSNFWGALSPDGYYARSEDYIDIVQ 413



 Score =  114 bits (286), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 48/71 (67%), Positives = 56/71 (78%)

Query: 664 HDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQG 723
           HDSST+TINIALN    D++GGGCRF RYNC++ + R GW  MHPGRLTH HEGL  T G
Sbjct: 491 HDSSTFTINIALNNKETDFQGGGCRFHRYNCSIESPRKGWSFMHPGRLTHLHEGLPTTNG 550

Query: 724 TRYIMISFVDP 734
           TRYI +SF+DP
Sbjct: 551 TRYIAVSFIDP 561


>gi|344254154|gb|EGW10258.1| Procollagen-lysine,2-oxoglutarate 5-dioxygenase 2 [Cricetulus
           griseus]
          Length = 587

 Score =  336 bits (861), Expect = 3e-89,   Method: Compositional matrix adjust.
 Identities = 177/447 (39%), Positives = 266/447 (59%), Gaps = 8/447 (1%)

Query: 51  IQSAEVNKLQVKTLGLHQPWLGGD-MSSLGGGYKVNLLKNELDEMDITDDMIILVTDSYD 109
           + SA+     VK LG  Q W GGD ++S+GGG KV L+K  + +    +D++IL T+ +D
Sbjct: 1   MNSAKYFNYTVKVLGQGQEWRGGDGINSIGGGQKVRLMKEAMAQYASQEDLVILFTECFD 60

Query: 110 VIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKD 169
           V+  GG  ++L++F   +  IVF A+ + WPD  L +KYP V  G RYLNSGGFIGYA  
Sbjct: 61  VVFAGGPEEVLKKFQKTNHKIVFAADGILWPDKRLAEKYPVVHIGKRYLNSGGFIGYAPY 120

Query: 170 IKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDL 229
           I  L+   ++++ +DDQL+Y  +++D   R    I LD    +FQ L G+ +++ L F+ 
Sbjct: 121 ISHLVQEWNLQDNDDDQLFYTKVYIDPVKREAFNITLDHKCKIFQALNGATDEVVLKFEN 180

Query: 230 DEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPD 288
            +   + NT Y T PV I+GNG +KI LN FGNY+  SW +  GC  C+    +D    D
Sbjct: 181 GK-SRVKNTFYETLPVAINGNGPTKILLNYFGNYVPNSWTQEHGCALCDF-DTIDLSAVD 238

Query: 289 QFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTM 348
             P V I VFI++PT FL  FLN + +L+YP + + +F++N + YH      +    K  
Sbjct: 239 VHPKVTIGVFIEQPTPFLPRFLNLLLSLDYPKEALKLFIHNKEVYHEKDIKVFFDKAKHE 298

Query: 349 FKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVDSDSHLDNPDVLKYLVNRNESL 407
              +K +     ++  EARN+ ++     +  D+YF VD+D  L NP  LK L+ +N  +
Sbjct: 299 ISTIKIVGPEENLSQAEARNMGMDFCRQDEKCDYYFSVDADVVLTNPRTLKNLIEQNRKI 358

Query: 408 IAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSV 467
           IAPL+ R  K WSNFWGAL+ DG+YARS DY++I+ G +   GIWNVPY+ N YL++   
Sbjct: 359 IAPLVTRHGKLWSNFWGALSPDGYYARSEDYVDIVQGKR--VGIWNVPYMANVYLIQGKT 416

Query: 468 IKA-TNIKTIYTLNSMDYDMAFCTNLR 493
           +++  + +  +  + +D DMA C N R
Sbjct: 417 LRSEMSERNYFVRDKLDPDMALCRNAR 443



 Score =  194 bits (493), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 84/143 (58%), Positives = 109/143 (76%), Gaps = 1/143 (0%)

Query: 592 DKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVR 651
           D R+  GYE VPT DIHMKQ+GL  VW  F+R+++ P+  + F GY+ +   A ++FVV+
Sbjct: 446 DSRISGGYENVPTDDIHMKQIGLENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVK 504

Query: 652 YRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRL 711
           Y P+ Q SLRPHHD+ST+TINIALN VG D++GGGC+F+RYNC++ + R GW  MHPGRL
Sbjct: 505 YSPERQRSLRPHHDASTFTINIALNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRL 564

Query: 712 THYHEGLQVTQGTRYIMISFVDP 734
           TH HEGL V  GTRYI +SF+DP
Sbjct: 565 THLHEGLPVKNGTRYIAVSFIDP 587


>gi|326428759|gb|EGD74329.1| hypothetical protein PTSG_12434 [Salpingoeca sp. ATCC 50818]
          Length = 853

 Score =  333 bits (854), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 227/736 (30%), Positives = 361/736 (49%), Gaps = 98/736 (13%)

Query: 77  SLGGGYKVNLLKNELDEMDITD--DMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGA 134
           S G G     L++  +++  T+  D+++ + D+ D ++     ++  +FN  D  I+F A
Sbjct: 135 SAGAGLGSKELRDVAEQLANTNPNDVVLFIGDAEDTVLLAEATELTRKFNALDCGILFPA 194

Query: 135 ERLCWPDTSLYDKYPAVGSGY-RYLNSGGFIGYAKDIKELISN----RSIKN-EEDDQLY 188
              C    ++   +P V  G+ R+L    F+      K L+ +     ++ N  E DQL 
Sbjct: 195 AIRCKRRCAM--DWPLVPEGHGRFLVPSAFMAKGDKFKLLVDSFPDLSTVPNMSESDQLI 252

Query: 189 YALLFLDETLRTKHKIVLDTLANLFQNLYGSLED-------IKLNFDLDEFVHLTNTKYN 241
            AL   D   R  + + LDT   +FQ L+G  ++        +  F  +E   L N   +
Sbjct: 253 -ALFMTD---RDFYGMKLDTNFAVFQPLFGYKDEWPAAVFAAEYEFHSEEDTRLRNKDTS 308

Query: 242 TNPVIIHGNGKSKIELNSFGNYLAKSWKTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDK 301
             P I+  +G +K+ L    NY+   W     T+    K L+S+ P+   ++  +V  + 
Sbjct: 309 EYPGILISHGNTKL-LTQLNNYMPLKWHPD--TQSLTSKTLESVDPNAAVTIAFNVLPES 365

Query: 302 PTAFLEEFLNKIANLN----YPAKKISMFVYNNQEYHAPLFDDYIHNF----KTMFKNVK 353
           P  FL+  L+ IA  +    +P   ++  V N    HA  + + + NF    K +F  V 
Sbjct: 366 P--FLQLVLDGIAAQDLLRTHPVTFVAAVVDNP---HAHTYVELVQNFTRDNKQLFAGVT 420

Query: 354 YIAHNSTVNSKEA-RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLL 412
            + H    +S++A R L       +      Y  S + L N  VL  L+ ++  +++P++
Sbjct: 421 -VLHEPQEDSEQAMRKLFTVAMETQPTTHVLYHTSAARLMNSTVLGELLAQDLRVVSPMM 479

Query: 413 VRPFKAWSNFWGALNADG------------------------------------------ 430
            R    +SNFWGA   D                                           
Sbjct: 480 TREASFFSNFWGAATGDRDAQCFDDSAQCEAWAVAGECTKNEPYMKKHCQRSCEVCHAQG 539

Query: 431 -----FYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTS----VIKATNIKTIYTLNS 481
                 Y RS DYM+II  +Q   G W VP ++   LMK +    V+KA +       + 
Sbjct: 540 APENIKYRRSADYMSIIKAEQ--TGTWAVPLVSEVILMKLNAFNIVVKALSQLETQPGSP 597

Query: 482 MDYDMAFCT----NLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDL 537
           + +D          LR+  + L +D+   YG L++ + F+    +P+V+ L  N   W  
Sbjct: 598 LRFDFPLTAYLLDQLRSSKVKLHVDNRHFYGLLINPDGFNANSVHPDVFLLAGNEQHWRD 657

Query: 538 RYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLET 597
            YIHP+Y+     + V  + C D++ FP+ +E FC  F+ + EA G WS G+N+D RL++
Sbjct: 658 LYIHPDYEPYKKLEFVQGR-CWDIYNFPLFSELFCAHFIDVSEAVGTWSSGSNSDDRLKS 716

Query: 598 GYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQ 657
           GYE VPTRDIH  Q+G    W   LR++V P+ E +++GY  +  R  + FVV+Y+P+ Q
Sbjct: 717 GYEPVPTRDIHFNQMGFQETWTAILRRFVAPVAETQWVGYKLDG-RVTLDFVVKYQPEGQ 775

Query: 658 PSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEG 717
           P LR HHD+ST+++N+ALN++G D+EGGG RF R NC V   +MG  L+HPGRLTH HEG
Sbjct: 776 PFLRKHHDASTFSLNVALNRIGEDFEGGGTRFTRQNCTVLTNKMGHALIHPGRLTHQHEG 835

Query: 718 LQVTQGTRYIMISFVD 733
           L VT+GTRYI++SFVD
Sbjct: 836 LYVTKGTRYIIVSFVD 851


>gi|345322955|ref|XP_001506269.2| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2
           [Ornithorhynchus anatinus]
          Length = 501

 Score =  332 bits (852), Expect = 3e-88,   Method: Compositional matrix adjust.
 Identities = 169/465 (36%), Positives = 271/465 (58%), Gaps = 28/465 (6%)

Query: 106 DSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIG 165
           +SYDVI  GG  ++L +F   +  +VF A+ L WPD  L DKYP V  G R+LNSGGFIG
Sbjct: 26  ESYDVIFAGGPEELLRKFQKINHKVVFAADGLLWPDKRLADKYPIVHIGKRFLNSGGFIG 85

Query: 166 YAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKL 225
           Y   + +++   ++++ +DDQL+Y  +++D   R    I LD    +FQNL G+++++ L
Sbjct: 86  YGPSVNQIVQQWNLQDSDDDQLFYTKIYIDSIKRKAINITLDHKCRIFQNLNGAIDEVLL 145

Query: 226 NFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWKT-SGCTRCNLIKHLDS 284
            F+  + V   N+ Y T PV I+GNG +K +LN FGNY+  +W   +GCT CNL   +D 
Sbjct: 146 KFENGK-VRAKNSFYETLPVAINGNGPTKNQLNYFGNYIPNAWTIENGCTTCNL-DMIDL 203

Query: 285 LKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHN 344
                +P V I VFI++PT FL  FL+ +  L+YP + +S+F++NN+ YH      +   
Sbjct: 204 TSSKDYPKVTIGVFIEQPTPFLPRFLDLLLTLDYPKEALSLFIHNNEVYHEKHIKAFWEK 263

Query: 345 FKTMFKNVKYIAHNSTVNSKEARNLAVE-NSLHKGVDFYFYVDSDSHLDNPDVLKYLVNR 403
            K +   +K +    +++  EARN+ ++    ++  D+YF +D+D  L NP  L+ L+ +
Sbjct: 264 AKNIITTIKIVGPEESLSQAEARNMGMDVCRQNEHCDYYFSLDADVVLTNPSTLRLLIEQ 323

Query: 404 NESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLM 463
           N  +IAPL+ R  K WSNFWG L+ DG+YARS DY++I+ G++   G+WN+PY+ N YL+
Sbjct: 324 NRKIIAPLVTRHGKLWSNFWGTLSPDGYYARSEDYVDIVQGNR--VGLWNIPYMANVYLI 381

Query: 464 KTSVIKA-TNIKTIYTLNSMDYDMAFCTNLRN---------------------KGIHLKI 501
           K   ++A    +  +  + +D DMA C N R                      KG+ + I
Sbjct: 382 KGQTLRAEMKERNYFVRDKLDPDMALCKNAREMTLQREKDSPSPETFHMLRPPKGVFMYI 441

Query: 502 DSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQK 546
            +  E+G L+ + N++    N +++++  NP+DW  +YI+  Y K
Sbjct: 442 SNRHEFGRLLSTANYNITHYNNDLWQIFENPVDWKEKYINRNYSK 486


>gi|241633659|ref|XP_002408696.1| procollagen-lysine, 2-oxoglutarate 5-dioxygenase, putative [Ixodes
           scapularis]
 gi|215501230|gb|EEC10724.1| procollagen-lysine, 2-oxoglutarate 5-dioxygenase, putative [Ixodes
           scapularis]
          Length = 285

 Score =  328 bits (840), Expect = 9e-87,   Method: Compositional matrix adjust.
 Identities = 154/286 (53%), Positives = 199/286 (69%), Gaps = 2/286 (0%)

Query: 450 GIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGH 509
           G+WNVP+I   YL+  +++ + +    +    +D DMAFC N+R KGI + + +   YGH
Sbjct: 1   GLWNVPFINTVYLINGTLLHSKDKFPSFISGLLDPDMAFCKNMREKGIFMYVTNMDTYGH 60

Query: 510 LVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTE 569
           LV+ E FD +  NP+ YE+  N +DW+ RYIH  Y K L PD   + PCPDV+WFP+VT+
Sbjct: 61  LVNPETFDLKLKNPDFYEIYSNQMDWERRYIHENYSKVLEPDFKVDMPCPDVYWFPVVTD 120

Query: 570 KFCHEFVQIMEAYGQWSDGTNNDKRLETGYE-AVPTRDIHMKQVGLAGVWAEFLRKYVVP 628
            FC   ++IME +GQWS G N  K L   Y+ +   + IH    G+   W  FLR+Y+ P
Sbjct: 121 IFCRHMIEIMENFGQWSSGKNEVKFLFFLYQQSSANKFIHFFIKGVQH-WLFFLREYIKP 179

Query: 629 LQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCR 688
           +QE+ F+GY H+P RA M+FVVRY PDEQ  LRPHHDSSTYTINIALN+  +DYEGGGC 
Sbjct: 180 VQEKVFLGYFHDPPRAIMNFVVRYHPDEQYFLRPHHDSSTYTINIALNRPKIDYEGGGCN 239

Query: 689 FIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           F+RYNC+V   + GW LMHPGRLTHYHEGL VT+GTRYIM+SFVDP
Sbjct: 240 FLRYNCSVVDLKRGWSLMHPGRLTHYHEGLPVTKGTRYIMVSFVDP 285


>gi|444510095|gb|ELV09466.1| Procollagen-lysine,2-oxoglutarate 5-dioxygenase 2 [Tupaia
           chinensis]
          Length = 558

 Score =  319 bits (817), Expect = 4e-84,   Method: Compositional matrix adjust.
 Identities = 182/479 (37%), Positives = 265/479 (55%), Gaps = 50/479 (10%)

Query: 19  ISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGD-MSS 77
           + V   K  +I  DK LVITVA+ E+DG+ RF+QSA+     VK LG  + W GGD ++S
Sbjct: 24  LGVDSEKPVSIPTDKLLVITVATKESDGFHRFMQSAKYFNYTVKVLGQGEEWRGGDGINS 83

Query: 78  LGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERL 137
           +GGG KV L+K  L +    DD+++L T+ +DV+  GG  ++L++F   +  +VF A+ +
Sbjct: 84  IGGGQKVRLMKEALGQYASQDDLVVLFTECFDVVFAGGPEEVLKKFQKTNHKVVFAADGI 143

Query: 138 CWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDET 197
            WPD  L DKYP V  G RYLNSGGFIGYA  I  ++   ++++ +DDQL+Y  +++D  
Sbjct: 144 LWPDKRLADKYPTVHFGKRYLNSGGFIGYAPYINRIVQQWNLQDNDDDQLFYTKIYIDPL 203

Query: 198 LRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIEL 257
            R    I LD    +FQ L G+ +++ L F+  +     NT Y T PV I+GNG +KI L
Sbjct: 204 KREAINITLDHKCKIFQTLNGATDEVVLKFENGK-ARAKNTFYETLPVTINGNGPTKILL 262

Query: 258 NSFGNYLAKSW-KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANL 316
           N FGNY+  SW + +GCT C      D++       V                       
Sbjct: 263 NYFGNYIPNSWTQENGCTHC----ESDTINLSAVDEV----------------------- 295

Query: 317 NYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH 376
            Y  K I +F           FD   H   T    +K +     +   EARN+ ++    
Sbjct: 296 -YHEKDIKVF-----------FDKAKHEIST----IKIVGPEENLRQAEARNMGMDFCRQ 339

Query: 377 -KGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARS 435
            +  D+YF VD+D  L NP  LK L+ +N  +IAPL+ R  K WSNFWGAL+ DG+YARS
Sbjct: 340 DEKCDYYFSVDADVVLTNPKTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARS 399

Query: 436 FDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLR 493
            DY++I+ G++   G+WNVPY+ N YL+K   +++  N +  +  + +D DMA C N R
Sbjct: 400 EDYVDIVQGNR--VGVWNVPYMANVYLIKGKTLRSEMNERNYFVRDKLDPDMALCRNAR 456



 Score =  108 bits (271), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 50/92 (54%), Positives = 67/92 (72%), Gaps = 1/92 (1%)

Query: 592 DKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVR 651
           D R+  GYE VPT DIHMKQ+ L  VW  F+R+++ P+  + F GY+ +   A ++FVV+
Sbjct: 459 DSRISGGYENVPTDDIHMKQIDLENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVK 517

Query: 652 YRPDEQPSLRPHHDSSTYTINIALNQVGVDYE 683
           Y PD Q SLRPHHD+ST+TINIALN VG D++
Sbjct: 518 YSPDRQRSLRPHHDASTFTINIALNNVGEDFQ 549


>gi|339235621|ref|XP_003379365.1| procollagen-lysine,2-oxoglutarate 5-dioxygenase 3 [Trichinella
           spiralis]
 gi|316977983|gb|EFV61016.1| procollagen-lysine,2-oxoglutarate 5-dioxygenase 3 [Trichinella
           spiralis]
          Length = 1093

 Score =  315 bits (808), Expect = 4e-83,   Method: Compositional matrix adjust.
 Identities = 193/554 (34%), Positives = 280/554 (50%), Gaps = 139/554 (25%)

Query: 147 KYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVL 206
           +YP V SG RYLNSG FIGYA DI ++I+ RS+++++DDQLYY  +FLD  LR KHKI L
Sbjct: 296 EYPVVKSGKRYLNSGAFIGYAPDIYKIITERSLRDDDDDQLYYTHIFLDPALREKHKIKL 355

Query: 207 DTLANLFQNLYGSLEDIKLNFDLD----EFVHLTNTKYNTNPVIIHGNGKSKIELNSFGN 262
           D+ + +FQNL+G+++D+ L+F         V L N  Y T PVIIHGNGKSK+ LN  GN
Sbjct: 356 DSTSAIFQNLHGAVDDVDLDFSPSGHRMRQVRLANLAYGTEPVIIHGNGKSKMHLNYLGN 415

Query: 263 YLAKSWK-TSGCTRCN-LIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPA 320
           Y+   W  T GC  CN  +  L+S   + FP V+++ FI+  T FL+++   I  L+YP 
Sbjct: 416 YIGNWWNPTDGCVACNDDLLELNSDNENDFPFVVLACFINSGTPFLDKYFESILRLDYPK 475

Query: 321 KKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKE--ARNLAVENSLHKG 378
            +I + ++N    HA   + +++    M     ++  +S ++  E  AR+ AV       
Sbjct: 476 SRIGIVIFNRP--HAVKVEHFVN---LMDGEYHFVQADSAISLTERNARDRAV------- 523

Query: 379 VDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDY 438
                                      SLIAP+++R    WSNFWGALN DGFYARS DY
Sbjct: 524 ---------------------------SLIAPMMIRGEALWSNFWGALNDDGFYARSDDY 556

Query: 439 MNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSM-DYDMAFCTNLRNKGI 497
           ++I   ++   G+WN+P+ +  YL++    + + + + Y+ N   D DM+F    R K  
Sbjct: 557 ISIAKRER--LGLWNIPHFSTAYLIRKD--RLSLLLSAYSYNGKNDPDMSFTQFCREK-- 610

Query: 498 HLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQP 557
                                               +W+ RY+  +Y  +L  D     P
Sbjct: 611 ------------------------------------EWEERYLDEKYWDTLSNDYEFELP 634

Query: 558 CPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGV 617
           CPDV+ FP+ +++FC E + +ME YG+WS G+N                           
Sbjct: 635 CPDVYHFPLFSKQFCKEMIAVMENYGRWSSGSN--------------------------- 667

Query: 618 WAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQ 677
                                  P  A M+FVVRY+PDEQP+LRPHHD+STYT++IALN+
Sbjct: 668 ----------------------LPPHAIMNFVVRYKPDEQPALRPHHDASTYTVDIALNK 705

Query: 678 VGVDYEGGGCRFIR 691
            G D+E    R  R
Sbjct: 706 AGEDFEVQMSRVGR 719


>gi|363539911|ref|YP_004894378.1| mg327 gene product [Megavirus chiliensis]
 gi|350611076|gb|AEQ32520.1| putative procollagen-lysine 2-oxoglutarate 5-dioxygenase [Megavirus
           chiliensis]
          Length = 889

 Score =  303 bits (775), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 230/731 (31%), Positives = 362/731 (49%), Gaps = 116/731 (15%)

Query: 30  DEDK-FLVITVASNETDGYKRFIQSAEVNKLQVKTL-GLHQPWLGGDMSSLGGGYKVNLL 87
           D DK F +I +     D + RFI+  E+  L    L  ++ P    D+S++         
Sbjct: 248 DTDKEFSIIYIGPTNGDSFARFIEYCELYSLPRIILDSINMP----DISTI--------- 294

Query: 88  KNELDEMDITDDMIILV-----TDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDT 142
             +L E+D  +D + +V      D+ + I      +I+ +FN+    I      +  P+ 
Sbjct: 295 HKKLAEIDNLEDKLFVVISVLPNDNCNFIPTAPPTEIINKFNS----ICHNKNGIIIPN- 349

Query: 143 SLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISN-----RSIKNEEDDQLYYALLFLDET 197
                    G   + +    F G+   I+ +I +       +KN  +  L  A++F   T
Sbjct: 350 ---------GETSKTI----FCGWGNRIQRMIQDYLDKVDIVKNITNAALSTAIIF--NT 394

Query: 198 LRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIEL 257
             T   I+ D    +F  +  S +DI  N    +  H    K+ T P I+  N    + L
Sbjct: 395 FITS-DIIKDDTCQIFCCV-NSEDDIVYNTTKSKISH---KKFGTTPSILFSNEIGNLVL 449

Query: 258 NSFGNYLAKSWKTSGCTRCNLIKHLDSLKPDQ-FPSVLISVFIDKPTAFLEEFLNKIANL 316
           N   NY   +W      R       +  +P    P+V IS+  DK  + ++     I  +
Sbjct: 450 NRIQNYTGNNWNEYYGYR-------NHTEPKTILPTVYISILSDKNPSVVD----IIQTI 498

Query: 317 NYPAKKISMFV----YNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVE 372
           +YP + +++ +     N+  Y   L               KYIA N              
Sbjct: 499 DYPRELLTVVITKGTINDNYYQEDL--------------EKYIATN-------------- 530

Query: 373 NSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFY 432
                  ++YF+++ D  L NP+VLK L+N N+ +IAPL+ R  ++W+NFWG L+ +G+Y
Sbjct: 531 ------CEYYFFINHDCILVNPNVLKELINLNKKIIAPLIRRGDESWTNFWGDLDKNGYY 584

Query: 433 ARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNS-MDYDMAFCTN 491
            RS DY +IING++  +G WNVP++   YL+  SVI++  +  ++T N+ +D DM  C N
Sbjct: 585 KRSHDYFDIINGER--RGCWNVPHVFGTYLIHRSVIES--VPDMFTKNTDIDADMRMCHN 640

Query: 492 LRNKGIHLKIDSTQEYGHLVDSENFDPQ-KTNPE--VYELIRNPLDWDLRYIHPEY--QK 546
           +R   IH+ + +   YG++       P+  TN E  V++      +W+ +Y+HPEY   K
Sbjct: 641 IRQHDIHIYLSNLNSYGYIQTELQIAPEIDTNKEVTVFDFSTRRSEWEKKYLHPEYFLNK 700

Query: 547 SLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGT--NNDKRL-ETGYEAVP 603
           + L +    + C DVF FP+ + +FC E +Q ME YG+WS G   N D RL    YE VP
Sbjct: 701 NNLKNLRCTELCNDVFNFPLFSREFCSELIQTMEKYGKWSGGAGHNIDHRLGHNYYENVP 760

Query: 604 TRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPH 663
           T+DI + +VGL   W   + +Y+ PL    +  Y  + V   ++FVVRY   +Q  L+ H
Sbjct: 761 TQDIQLFEVGLDKHWESIVNEYIAPLVRIVYSNYKTKSVH--LAFVVRYHWQQQSELQEH 818

Query: 664 HDSSTYTINIALNQV-GVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQ 722
           HD+STYTINIALN+  G DYEGGG RFIR N +     +G   +HPG+ THYH+GL+ T 
Sbjct: 819 HDASTYTINIALNEGDGKDYEGGGSRFIRQNYSSINQEIGTANLHPGKCTHYHKGLKTTA 878

Query: 723 GTRYIMISFVD 733
           G RYI++SF++
Sbjct: 879 GIRYILVSFIN 889


>gi|448825278|ref|YP_007418209.1| putative procollagen-lysine 2-oxoglutarate 5-dioxygenase [Megavirus
           lba]
 gi|444236463|gb|AGD92233.1| putative procollagen-lysine 2-oxoglutarate 5-dioxygenase [Megavirus
           lba]
          Length = 889

 Score =  302 bits (773), Expect = 5e-79,   Method: Compositional matrix adjust.
 Identities = 228/730 (31%), Positives = 361/730 (49%), Gaps = 114/730 (15%)

Query: 30  DEDK-FLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLK 88
           D DK F +I +     D + RFI+  E+  L         P +  D  ++     ++ + 
Sbjct: 248 DTDKEFSIIYIGPTNGDSFARFIEYCELYSL---------PRIILDSINIPD---ISTIH 295

Query: 89  NELDEMDITDDMIILV-----TDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTS 143
            +L E+D  +D + +V      D+ + I      +I+ +FN+    I      +  P+  
Sbjct: 296 KKLAEIDNLEDKLFVVISVLPNDNCNFIPTAPPTEIINKFNS----ICHNKNGIIIPN-- 349

Query: 144 LYDKYPAVGSGYRYLNSGGFIGYAKDIKELISN-----RSIKNEEDDQLYYALLFLDETL 198
                   G   + +    F G+   I+ +I +       +KN  +  L  A++F   T 
Sbjct: 350 --------GETSKTI----FCGWGNRIQRMIQDYLDKVDIVKNITNAALSTAIIF--NTF 395

Query: 199 RTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELN 258
            T   I+ D    +F  +  S +DI  N    +  H    K+ T P I+  N    + LN
Sbjct: 396 ITS-DIIKDDTCQIFCCV-NSEDDIVYNTTKSKISH---KKFGTTPSILFSNEIGNLVLN 450

Query: 259 SFGNYLAKSWKTSGCTRCNLIKHLDSLKPDQ-FPSVLISVFIDKPTAFLEEFLNKIANLN 317
              NY   +W      R       +  +P    P+V IS+  DK  + ++     I  ++
Sbjct: 451 RIQNYTGNNWNEYYGYR-------NHTEPKTILPTVYISILSDKNPSVVD----IIQTID 499

Query: 318 YPAKKISMFV----YNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVEN 373
           YP + +++ +     N+  Y   L               KYIA N               
Sbjct: 500 YPRELLTVVITKGTINDNYYQEDL--------------EKYIATN--------------- 530

Query: 374 SLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYA 433
                 ++YF+++ D  L NP+VLK L+N N+ +IAPL+ R  ++W+NFWG L+ +G+Y 
Sbjct: 531 -----CEYYFFINHDCILVNPNVLKELINLNKKIIAPLIRRGDESWTNFWGDLDKNGYYK 585

Query: 434 RSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNS-MDYDMAFCTNL 492
           RS DY +IING++  +G WNVP++   YL+  SVI++  +  ++T N+ +D DM  C N+
Sbjct: 586 RSHDYFDIINGER--RGCWNVPHVFGTYLIHRSVIES--VPDMFTKNTDIDADMRMCHNI 641

Query: 493 RNKGIHLKIDSTQEYGHLVDSENFDPQ-KTNPE--VYELIRNPLDWDLRYIHPEY--QKS 547
           R   IH+ + +   YG++       P+  TN E  V++      +W+ +Y+HPEY   K+
Sbjct: 642 RQHDIHIYLSNLNSYGYIQTELQIAPEIDTNKEVTVFDFSTRRSEWEKKYLHPEYFLNKN 701

Query: 548 LLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGT--NNDKRL-ETGYEAVPT 604
            L +    + C DVF FP+ + +FC E +Q ME YG+WS G   N D RL    YE VPT
Sbjct: 702 NLKNLRCTELCNDVFNFPLFSREFCSELIQTMEKYGKWSGGAGHNIDHRLGHNYYENVPT 761

Query: 605 RDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHH 664
           +DI + +VGL   W   + +Y+ PL    +  Y  + V   ++FVVRY   +Q  L+ HH
Sbjct: 762 QDIQLFEVGLDKHWESIVNEYIAPLVRIVYSNYKTKSVH--LAFVVRYHWQQQSELQEHH 819

Query: 665 DSSTYTINIALNQV-GVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQG 723
           D+STYTINIALN+  G DYEGGG RFIR N +     +G   +HPG+ THYH+GL+ T G
Sbjct: 820 DASTYTINIALNEGDGKDYEGGGSRFIRQNYSSINQEIGTANLHPGKCTHYHKGLKTTAG 879

Query: 724 TRYIMISFVD 733
            RYI++SF++
Sbjct: 880 IRYILVSFIN 889


>gi|371943602|gb|AEX61430.1| putative procollagen-lysine [Megavirus courdo7]
          Length = 889

 Score =  302 bits (773), Expect = 5e-79,   Method: Compositional matrix adjust.
 Identities = 229/731 (31%), Positives = 362/731 (49%), Gaps = 116/731 (15%)

Query: 30  DEDK-FLVITVASNETDGYKRFIQSAEVNKLQVKTL-GLHQPWLGGDMSSLGGGYKVNLL 87
           D DK F +I +     D + RFI+  E+  L    L  ++ P    D+S++         
Sbjct: 248 DTDKEFSIIYIGPTNGDSFARFIEYCELYSLPRIILDSINMP----DISTI--------- 294

Query: 88  KNELDEMDITDDMIILV-----TDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDT 142
             +L E+D  +D + +V      D+ + I      +I+ +FN+    I      +  P+ 
Sbjct: 295 HKKLAEIDNLEDKLFVVISVLPNDNCNFIPTAPPTEIINKFNS----ICHNKNGIIIPN- 349

Query: 143 SLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISN-----RSIKNEEDDQLYYALLFLDET 197
                    G   + +    F G+   I+ +I +       +KN  +  L  A++F   T
Sbjct: 350 ---------GETSKTI----FCGWGNRIQRMIQDYLDKVDIVKNITNAALSTAIMF--NT 394

Query: 198 LRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIEL 257
             T   I+ D    +F  +  S +DI  N    +  H    K+ T P I+  N    + L
Sbjct: 395 FITS-DIIKDDTCQIFCCV-NSEDDIIYNTTKSKISH---KKFGTTPSILFSNEIGNLVL 449

Query: 258 NSFGNYLAKSWKTSGCTRCNLIKHLDSLKPDQ-FPSVLISVFIDKPTAFLEEFLNKIANL 316
           N   NY   +W      R       +  +P    P+V IS+  DK ++ ++     I  +
Sbjct: 450 NRIQNYTGNNWNEYYGYR-------NHTEPKTILPTVYISILSDKNSSVVD----IIQTI 498

Query: 317 NYPAKKISMFV----YNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVE 372
           +YP + +++ +     N+  Y   L               KYIA N              
Sbjct: 499 DYPRELLTVVITKGTINDNYYQEDL--------------EKYIATN-------------- 530

Query: 373 NSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFY 432
                  ++YF+++ D  L NP+VLK L+N N+ +IAPL+ R  ++W+NFWG L+ +G+Y
Sbjct: 531 ------CEYYFFINHDCILVNPNVLKELINLNKKIIAPLIRRGDESWTNFWGDLDKNGYY 584

Query: 433 ARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNS-MDYDMAFCTN 491
            RS DY +IING++  +G WNVP++   YL+  SVI++  +  ++T N+ +D DM  C N
Sbjct: 585 KRSHDYFDIINGER--RGCWNVPHVFGTYLIHRSVIES--VPDMFTKNTDIDADMRMCHN 640

Query: 492 LRNKGIHLKIDSTQEYGHLVDSENFDPQ-KTNPE--VYELIRNPLDWDLRYIHPEY--QK 546
           +R   IH+ + +   YG++       P+   N E  V++      +W+ +Y+HPEY   K
Sbjct: 641 IRQHDIHIYLSNLNSYGYIQTELQIAPEIDINKEVTVFDFSTRRSEWEKKYLHPEYFLNK 700

Query: 547 SLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGT--NNDKRL-ETGYEAVP 603
           + L +    + C DVF FP+ + +FC E +Q ME YG+WS G   N D RL    YE VP
Sbjct: 701 NNLKNLRCTELCNDVFNFPLFSREFCSELIQTMEKYGKWSGGAGHNIDHRLGHNYYENVP 760

Query: 604 TRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPH 663
           T+DI + +VGL   W   + +Y+ PL    +  Y  + V   ++FVVRY   +Q  L+ H
Sbjct: 761 TQDIQLFEVGLDKHWESIVNEYIAPLVRIVYSNYKTKSVH--LAFVVRYHWQQQSELQEH 818

Query: 664 HDSSTYTINIALNQV-GVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQ 722
           HD+STYTINIALN+  G DYEGGG RFIR N +     +G   +HPG+ THYH+GL+ T 
Sbjct: 819 HDASTYTINIALNEGDGKDYEGGGSRFIRQNYSSINQEIGTANLHPGKCTHYHKGLKTTA 878

Query: 723 GTRYIMISFVD 733
           G RYI++SF++
Sbjct: 879 GIRYILVSFIN 889


>gi|451927620|gb|AGF85498.1| family 25 protein [Moumouvirus goulette]
          Length = 890

 Score =  300 bits (769), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 188/510 (36%), Positives = 280/510 (54%), Gaps = 59/510 (11%)

Query: 235 LTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWKTSGCTRCNLIKHLDSLKPD-QFPSV 293
           +T  K  T P +++ +G S I LN   NY   +W      R       +S +P   +P++
Sbjct: 429 ITYNKTGTMPCVLYSSGMSNIILNRIQNYTGNNWNEYYGYR-------NSSEPLLTYPTI 481

Query: 294 LISVFIDK-PTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNV 352
            +S  +DK PT       N I NL YP + +++ +   Q     LF  Y  +        
Sbjct: 482 YLSFRLDKNPT-----ITNIIENLEYPKELVTINIETGQ--GGDLF--YQQDI------- 525

Query: 353 KYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLL 412
                N  +NSK               ++YF+V+ D  + NP +LK L+   + ++APL+
Sbjct: 526 -----NKFLNSK--------------CEYYFFVNHDCVIVNPKILKELLELGKKVVAPLV 566

Query: 413 VRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATN 472
            +  ++WSNFWG L+ +G+Y RS DY +I+NG++  +G WNVPYI+  YL+  SVI+   
Sbjct: 567 RKGTESWSNFWGDLDKNGYYNRSHDYFDILNGER--RGCWNVPYISGVYLIHRSVIEL-- 622

Query: 473 IKTIYTLNS-MDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQK--TNP-EVYEL 528
           +  I++ N  +D DM  C NLR   IHL + +   YG + +    DP    T P  +++ 
Sbjct: 623 VPNIFSDNEKIDIDMRMCHNLREHDIHLYVSNINSYGFIQEEIKIDPNLDLTKPLTIHDF 682

Query: 529 IRNPLDWDLRYIHPEY--QKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWS 586
                +W+ +Y+HPE+   K+ L +    + C DVF FP+ +++FC E +QIME YG+WS
Sbjct: 683 STRRDEWERKYLHPEFYLNKNNLKNLRCPELCSDVFNFPLFSKEFCSELIQIMEKYGKWS 742

Query: 587 DGT--NNDKRL-ETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVR 643
            GT  N D RL    YE VPT+DI + +VGL   W   +  Y+ PL +  +  Y  + V 
Sbjct: 743 GGTGHNIDHRLGHNYYENVPTQDIQLFEVGLDKHWETIVMDYIAPLVKIIYGNYKTKSVH 802

Query: 644 APMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGW 703
             ++FVVRY    Q  L+ HHD+STYT+NIALN+ G DYEGGGC FIR         +G 
Sbjct: 803 --LAFVVRYHWQFQNELQEHHDASTYTVNIALNECGTDYEGGGCEFIRQKYVAKNQEIGT 860

Query: 704 MLMHPGRLTHYHEGLQVTQGTRYIMISFVD 733
             +HPGRLTH H+GL+ T GTRYI++SF++
Sbjct: 861 SNIHPGRLTHLHKGLKTTNGTRYILVSFIN 890


>gi|441432191|ref|YP_007354233.1| Glycosyltransferase family 25 fused to procollagen lysine
           2-oxoglutarate 5-dioxygenase [Acanthamoeba polyphaga
           moumouvirus]
 gi|440383271|gb|AGC01797.1| Glycosyltransferase family 25 fused to procollagen lysine
           2-oxoglutarate 5-dioxygenase [Acanthamoeba polyphaga
           moumouvirus]
          Length = 889

 Score =  297 bits (760), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 185/510 (36%), Positives = 274/510 (53%), Gaps = 59/510 (11%)

Query: 235 LTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWKTSGCTRCNLIKHLDSLKPD-QFPSV 293
           +T+ K  + P I++ +G S I LN   NY   +W      R       +S +P   +P+V
Sbjct: 428 ITHNKTGSMPCILYSSGISNIILNRIQNYTGNNWNEYYGFR-------NSSEPLLTYPTV 480

Query: 294 LISVFIDK-PTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNV 352
            +S  +DK PT       + I  L YP + +++ + N        +   I+ F       
Sbjct: 481 YLSFRLDKNPT-----ITDIIEKLEYPKELMTINIENGST-EDLFYQKDINKF------- 527

Query: 353 KYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLL 412
                                 L    ++YF+V+ D  L NP +LK L+   + +IAPL+
Sbjct: 528 ----------------------LESKCEYYFFVNHDCVLINPKILKELLELGKKVIAPLV 565

Query: 413 VRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATN 472
            +  ++WSNFWG +  +G+Y RS DY +I+NG++  +G WNVPYI+  YL+  SVIK+  
Sbjct: 566 RKGTESWSNFWGDIQENGYYNRSHDYFDILNGER--RGCWNVPYISGVYLIHRSVIKS-- 621

Query: 473 IKTIYTLNS-MDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDP--QKTNP-EVYEL 528
           I  I+  N  +D DM  C NLR   IH+ + +   YG + +    DP    T P  +++L
Sbjct: 622 IPNIFIDNEKIDVDMRICHNLRQHDIHMYVSNINSYGFIQEEIKIDPTIDLTKPVTIHDL 681

Query: 529 IRNPLDWDLRYIHPEY--QKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWS 586
                +W+ +Y+HPEY   K+ L +    + C DVF FP+ +++FC E +QIME YG+WS
Sbjct: 682 FTRRDEWERKYLHPEYYLNKNNLKNLRCPELCSDVFNFPLFSKEFCSELIQIMENYGKWS 741

Query: 587 DGTNN--DKRL-ETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVR 643
            GT +  D RL    YE VPT+DI + +VGL   W   +  Y+ PL    +  Y  + V 
Sbjct: 742 GGTGHHIDHRLGHNYYENVPTQDIQLFEVGLDKHWETIVMDYISPLVRIIYGNYKTKSVH 801

Query: 644 APMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGW 703
             ++FVVRY    Q  L+ HHD+STYT+NIALN+ G DYEGGGC FIR         +G 
Sbjct: 802 --LAFVVRYHWQLQNELQEHHDASTYTVNIALNECGTDYEGGGCEFIRQKYIAKNQEVGT 859

Query: 704 MLMHPGRLTHYHEGLQVTQGTRYIMISFVD 733
             +HPGRLTH H+GL+ T G RYI++SF++
Sbjct: 860 SNIHPGRLTHLHKGLKTTNGIRYILVSFIN 889


>gi|371945194|gb|AEX63014.1| putative procollagen-lysine [Moumouvirus Monve]
          Length = 889

 Score =  295 bits (756), Expect = 4e-77,   Method: Compositional matrix adjust.
 Identities = 184/510 (36%), Positives = 274/510 (53%), Gaps = 59/510 (11%)

Query: 235 LTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWKTSGCTRCNLIKHLDSLKPD-QFPSV 293
           +T+ K  + P I++ +G S I LN   NY   +W      R       +S +P   +P+V
Sbjct: 428 ITHNKTGSMPCILYSSGISNIILNRIQNYTGNNWNEYYGFR-------NSSEPLLTYPTV 480

Query: 294 LISVFIDK-PTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNV 352
            +S  +DK PT       + I  L YP + +++ + N        +   I+ F       
Sbjct: 481 YLSFRLDKNPT-----ITDIIEKLEYPKELMTINIENGST-EDLFYQKDINKF------- 527

Query: 353 KYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLL 412
                                 L    ++YF+V+ D  L NP +LK L+   + +IAPL+
Sbjct: 528 ----------------------LESKCEYYFFVNHDCVLINPKILKELLELGKKVIAPLV 565

Query: 413 VRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATN 472
            +  ++WSNFWG +  +G+Y RS DY +I+NG++  +G WNVPYI+  YL+  SVI++  
Sbjct: 566 RKGTESWSNFWGDIQENGYYNRSHDYFDILNGER--RGCWNVPYISGVYLIHRSVIES-- 621

Query: 473 IKTIYTLNS-MDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDP--QKTNP-EVYEL 528
           I  I+  N  +D DM  C NLR   IH+ + +   YG + +    DP    T P  +++L
Sbjct: 622 IPNIFIDNEKIDVDMRICHNLRQHDIHMYVSNINSYGFIQEEIKIDPTIDLTKPVTIHDL 681

Query: 529 IRNPLDWDLRYIHPEY--QKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWS 586
                +W+ +Y+HPEY   K+ L +    + C DVF FP+ +++FC E +QIME YG+WS
Sbjct: 682 FTRRDEWERKYLHPEYYLNKNNLKNLRCPELCSDVFNFPLFSKEFCSELIQIMENYGKWS 741

Query: 587 DGTNN--DKRL-ETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVR 643
            GT +  D RL    YE VPT+DI + +VGL   W   +  Y+ PL    +  Y  + V 
Sbjct: 742 GGTGHHIDHRLGHNYYENVPTQDIQLFEVGLDKHWETIVMDYISPLVRIIYGNYKTKSVH 801

Query: 644 APMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGW 703
             ++FVVRY    Q  L+ HHD+STYT+NIALN+ G DYEGGGC FIR         +G 
Sbjct: 802 --LAFVVRYHWQLQNELQEHHDASTYTVNIALNECGTDYEGGGCEFIRQKYIAKNQEVGT 859

Query: 704 MLMHPGRLTHYHEGLQVTQGTRYIMISFVD 733
             +HPGRLTH H+GL+ T G RYI++SF++
Sbjct: 860 SNIHPGRLTHLHKGLKTTNGIRYILVSFIN 889


>gi|326426536|gb|EGD72106.1| PLOD2 protein [Salpingoeca sp. ATCC 50818]
          Length = 527

 Score =  291 bits (745), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 166/480 (34%), Positives = 258/480 (53%), Gaps = 47/480 (9%)

Query: 282 LDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNY--PAKKISMFVYNNQEYH----- 334
           LDS++    P V ++V + + + FLE  L+ +   +Y   A  I++ +    ++H     
Sbjct: 66  LDSIEDIHVP-VHMAVLVYEGSPFLEYVLSSLEQQHYVKDALTITLLLAPGMDWHLNHQL 124

Query: 335 -APLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSHLDN 393
            +    D+ + +  +  +     H++ V   E+         H        +DS S L N
Sbjct: 125 ASSWQSDHSNKYAAIHVHSPASLHDAVVELVES---------HDSAQHLLLMDSRSRLTN 175

Query: 394 PDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALN------------ADGFYARSFDYMNI 441
           PD LK+L++ ++  +AP+LVR  K WSNFW A +            A+  Y RS  Y++I
Sbjct: 176 PDTLKHLISLDKPAVAPMLVRQGKWWSNFWDAASQFHDVSPADFSPANVGYVRSNRYLDI 235

Query: 442 INGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNL----RNKGI 497
           +  D+   G++ VP    C L++   I A         N+ + +  F   L      + +
Sbjct: 236 V--DRKQTGVFIVPLAFGCLLVRPDTIPAMKRALSAMPNTANAEWVFHLTLAYYLHQQQV 293

Query: 498 HLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEY----QKSLLPDTV 553
            + + +  EYGHL++   FD  K +P+++ +  NP +W  +Y++  Y    +  L+P+  
Sbjct: 294 PIAVSNLLEYGHLINPTGFDSTKAHPDLFLVEENPAEWADKYLNELYWSFEEHGLIPN-- 351

Query: 554 NNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVG 613
               C DVF  P+ +  F    ++  E +GQWS+G N D+R++ GYE VPT+DIH  Q+G
Sbjct: 352 ----CTDVFKVPMFSPAFARNLIEECEHFGQWSNGDNKDERIQGGYEPVPTQDIHFNQIG 407

Query: 614 LAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINI 673
               W   LR+++ P+    + GY  E  R  + FVVRYRPD+Q  LRPHHD+ST T+N+
Sbjct: 408 FNNAWRFILRRFLRPVTSHYYTGYTLEG-RTTLDFVVRYRPDKQNYLRPHHDASTVTLNV 466

Query: 674 ALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVD 733
           ALNQ GVDY+GGG RFIR NC +  T  GW  + PGRLTH HEGL+ T GTRYI++SF+D
Sbjct: 467 ALNQGGVDYQGGGTRFIRQNCTLINTPPGWGTLSPGRLTHLHEGLKTTAGTRYILVSFID 526


>gi|324519915|gb|ADY47513.1| Procollagen-lysine,2-oxoglutarate 5-dioxygenase [Ascaris suum]
          Length = 249

 Score =  291 bits (744), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 132/249 (53%), Positives = 178/249 (71%)

Query: 486 MAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQ 545
           M+FC   R+ G  + +D+   YG LV S++FD  K +PE+Y++  NP  W+ RYIH +Y 
Sbjct: 1   MSFCEFARHSGHFMYVDNRNYYGFLVVSDDFDTTKLHPEMYQIFDNPDLWESRYIHEKYF 60

Query: 546 KSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTR 605
            +       ++PC DVF FP+++E FC E ++ ME YGQWS G N D RL  GYE VPTR
Sbjct: 61  HARDGRIAIDEPCQDVFDFPLMSEAFCSELIEEMEHYGQWSSGKNQDDRLAGGYENVPTR 120

Query: 606 DIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHD 665
           DIHM Q+G    W   L +YV P+QE+ FIGY  +PV+A M FVVRYRPDEQ SL+PHHD
Sbjct: 121 DIHMNQIGFERHWLYMLDEYVRPIQEKLFIGYSQKPVQANMMFVVRYRPDEQSSLKPHHD 180

Query: 666 SSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTR 725
           +STY+I++ALN+ G+DY+GGG R++RYNC V A ++G+ ++ PGRLTH HEGL  T+GTR
Sbjct: 181 ASTYSIDVALNKRGIDYQGGGVRYVRYNCTVDADQIGYSMIFPGRLTHLHEGLPTTEGTR 240

Query: 726 YIMISFVDP 734
           YI +SF++P
Sbjct: 241 YIAVSFLNP 249


>gi|194379782|dbj|BAG58243.1| unnamed protein product [Homo sapiens]
          Length = 391

 Score =  286 bits (732), Expect = 3e-74,   Method: Compositional matrix adjust.
 Identities = 130/244 (53%), Positives = 171/244 (70%), Gaps = 2/244 (0%)

Query: 491 NLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLP 550
             R + + + + +    GHL+  +++     + +++E+  NP DW  +YIH  Y K+L  
Sbjct: 150 RFRQQDVFMFLTNRHTLGHLLSLDSYRTTHLHNDLWEVFSNPEDWKEKYIHQNYTKALAG 209

Query: 551 DTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMK 610
             V   PCPDV+WFPI TE  C E V+ ME +GQWS G N D R++ GYE VPT DIHM 
Sbjct: 210 KLVET-PCPDVYWFPIFTEVACDELVEEMEHFGQWSLGNNKDNRIQGGYENVPTIDIHMN 268

Query: 611 QVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYT 670
           Q+G    W +FL +Y+ P+ E+ + GY+    +  ++FVVRY+PDEQPSL PHHD+ST+T
Sbjct: 269 QIGFEREWHKFLLEYIAPMTEKLYPGYYTR-AQFDLAFVVRYKPDEQPSLMPHHDASTFT 327

Query: 671 INIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMIS 730
           INIALN+VGVDYEGGGCRF+RYNC++ A R GW LMHPGRLTHYHEGL  T+GTRYI +S
Sbjct: 328 INIALNRVGVDYEGGGCRFLRYNCSIRAPRKGWTLMHPGRLTHYHEGLPTTRGTRYIAVS 387

Query: 731 FVDP 734
           FVDP
Sbjct: 388 FVDP 391



 Score =  111 bits (277), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 54/134 (40%), Positives = 81/134 (60%)

Query: 31  EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNE 90
           ED  LV+TVA+ ET+G++RF +SA+    +++ LGL + W     +S GGG KV LLK  
Sbjct: 25  EDNLLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWNVEKGTSAGGGQKVRLLKKA 84

Query: 91  LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
           L++    +D++IL  DSYDV+   G  ++L++F    + +VF AE L +PD  L  KYP 
Sbjct: 85  LEKHADKEDLVILFADSYDVLFASGPRELLKKFRQARSQVVFSAEELIYPDRRLETKYPV 144

Query: 151 VGSGYRYLNSGGFI 164
           V  G R+     F+
Sbjct: 145 VSDGKRFRQQDVFM 158


>gi|256079279|ref|XP_002575916.1| procollagen-lysine2-oxoglutarate 5-dioxygenase [Schistosoma
           mansoni]
 gi|360044866|emb|CCD82414.1| putative procollagen-lysine,2-oxoglutarate 5-dioxygenase
           [Schistosoma mansoni]
          Length = 921

 Score =  283 bits (725), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 129/245 (52%), Positives = 172/245 (70%), Gaps = 4/245 (1%)

Query: 493 RNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDT 552
           R K + + +D+   +G+L D+ N+   K + ++++ + NP DW+ +YIHP+Y     P+ 
Sbjct: 678 RRKNVFMFVDNQMSFGYLTDANNYTKGKLHNDLWQTMDNPQDWEEQYIHPQYFNFAKPEV 737

Query: 553 VNN---QPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHM 609
                 QPCPDVFWFP+V+E FC   ++ +E YGQWS G N D RLE GYE VPTRDIHM
Sbjct: 738 TMTDIAQPCPDVFWFPLVSETFCKHLIEEVENYGQWSTGDNYDPRLEGGYENVPTRDIHM 797

Query: 610 KQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTY 669
           +Q+G    W   L KYV  +Q++ F GY  +P  A M+FVVRY+PDEQPSLRPHHD+S+Y
Sbjct: 798 RQIGWEEHWLHVLEKYVHKIQKKLFQGYDDKPW-ARMNFVVRYKPDEQPSLRPHHDASSY 856

Query: 670 TINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMI 729
           TINI LNQ G DY+GGG R+ RYNC++  TR+GW L+ PGR+TH HEGL  T GTRYI +
Sbjct: 857 TINIGLNQPGKDYKGGGIRYNRYNCSIVDTRVGWALVSPGRVTHLHEGLPTTGGTRYIFV 916

Query: 730 SFVDP 734
           +FV+P
Sbjct: 917 TFVNP 921



 Score =  245 bits (626), Expect = 6e-62,   Method: Compositional matrix adjust.
 Identities = 137/356 (38%), Positives = 209/356 (58%), Gaps = 13/356 (3%)

Query: 32  DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLLKNE 90
           D  LV+TVA+ + D  +RF++S  +N  +VK LG    W GG ++ S GGG KVNLLK E
Sbjct: 342 DHVLVLTVATEKNDALQRFLRSCNLNGFKVKVLGEGSHWKGGHVAKSTGGGQKVNLLKEE 401

Query: 91  LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
           L + D   D +IL  DSYDV+    V  +LE +  F + ++F AE  CWP  SL   YP 
Sbjct: 402 LAKGDYKPDQLILFVDSYDVVFMQNVAKLLEEYEKFKSKVIFSAEEFCWPQPSLQSSYPE 461

Query: 151 VGSG-YRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTL 209
           V  G  RYLNSGGFIG   ++ +++++  IK+++DDQLYY  +FLD T RT + I LD  
Sbjct: 462 VKPGEKRYLNSGGFIGPTANLIKIVNHEPIKDDDDDQLYYTKIFLDSTSRTLYDIELDKT 521

Query: 210 ANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK 269
           + +FQNL G+  D++L+F+ D   +L N  ++T P+I HGNG  K+E NS  NYLA SW 
Sbjct: 522 SRIFQNLNGAFSDVELHFN-DVTGYLFNKIFSTTPIIAHGNGPIKVEFNSLSNYLAYSWS 580

Query: 270 -TSGCTRCNL--IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMF 326
            T  C +C+   I+  D L    +P V++ +FI++ T F+E F  +IA L+YP  ++ + 
Sbjct: 581 PTKNCQQCDEDNIEIQDIL---DYPLVVMGIFIEQGTPFIERFFERIAALSYPKSRLHVV 637

Query: 327 --VYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARN--LAVENSLHKG 378
             +  N  + + + + +   F   + +V ++  N    +   +N  + V+N +  G
Sbjct: 638 GHMAENSRFQSAVAESFNQTFGHQYFSVNWLEENLDEETARRKNVFMFVDNQMSFG 693


>gi|425701200|gb|AFX92362.1| putative procollagen-lysine 2-oxoglutarate 5-dioxygenase [Megavirus
           courdo11]
          Length = 889

 Score =  281 bits (719), Expect = 9e-73,   Method: Compositional matrix adjust.
 Identities = 229/731 (31%), Positives = 360/731 (49%), Gaps = 116/731 (15%)

Query: 30  DEDK-FLVITVASNETDGYKRFIQSAEVNKLQVKTL-GLHQPWLGGDMSSLGGGYKVNLL 87
           D DK F +I +     D + RFI+  E+  L    L  ++ P    D+S++         
Sbjct: 248 DTDKEFSIIYIGPTNGDSFARFIEYCELYSLPRIILDSINMP----DISTI--------- 294

Query: 88  KNELDEMDITDDMIILV-----TDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDT 142
             +L E+D  +D + +V      D+ + I      +I+ +FN+    I      +  P+ 
Sbjct: 295 HKKLAEIDNLEDKLFVVISVLPNDNCNFIPTAPPTEIINKFNS----ICHNKNGIIIPN- 349

Query: 143 SLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISN-----RSIKNEEDDQLYYALLFLDET 197
                    G   + +    F G+   I+ +I +       +KN  +  L  A++F   T
Sbjct: 350 ---------GETSKTI----FCGWGNRIQRMIQDYLDKVDIVKNITNAALSTAIMF--NT 394

Query: 198 LRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIEL 257
             T   I+ D    +F  +  S +DI  N    +  H    K+ T P I+  N    + L
Sbjct: 395 FITS-DIIKDDTCQIFCCV-NSEDDIIYNTTKSKISH---KKFGTTPSILFSNEIGNLVL 449

Query: 258 NSFGNYLAKSWKTSGCTRCNLIKHLDSLKPDQ-FPSVLISVFIDKPTAFLEEFLNKIANL 316
           N   NY   +W      R       +  +P    P+V IS+  DK  + ++     I  +
Sbjct: 450 NRIQNYTGNNWNEYYGYR-------NHTEPKTILPTVYISILSDKNPSVVD----IIQTI 498

Query: 317 NYPAKKISMFV----YNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVE 372
           +YP + +++ +     N+  Y   L               KYIA N              
Sbjct: 499 DYPRELLTVVITKGTINDNYYQEDL--------------EKYIATN-------------- 530

Query: 373 NSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFY 432
                  ++YF+++ D  L NP+VLK L+N N+ +IAPL+ R  ++W+NFWG L+ +G+Y
Sbjct: 531 ------CEYYFFINHDCILVNPNVLKELINLNKKIIAPLIRRGDESWTNFWGDLDKNGYY 584

Query: 433 ARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNS-MDYDMAFCTN 491
            RS DY +IING++  +G WNVP++   YL+  SVI++  +  ++T N+ +D DM  C N
Sbjct: 585 KRSHDYFDIINGER--RGCWNVPHVFGTYLIHRSVIES--VPDMFTKNTDIDADMRMCHN 640

Query: 492 LRNKGIHLKIDSTQEYGHLVDSENFDPQ-KTNPE--VYELIRNPLDWDLRYIHPEY--QK 546
           +R   IH+ + +   YG++       P+   N E  V++      +W+ +Y+HPEY   K
Sbjct: 641 IRQHDIHIYLSNLNSYGYIQTELQIAPEIDINKEVTVFDFSTRRSEWEKKYLHPEYFLNK 700

Query: 547 SLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGT--NNDKRL-ETGYEAVP 603
           + L      + C DVF FP+ + +FC E +Q ME YG+WS G   N D RL    YE VP
Sbjct: 701 NNLKHLRCTELCNDVFNFPLFSREFCSELIQTMEKYGKWSGGAGHNIDHRLGHNYYENVP 760

Query: 604 TRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPH 663
           T+DI + +VGL   W   + +Y+ PL    +  Y  + V   ++FVVRY   +Q  L+ H
Sbjct: 761 TQDIQLFEVGLDKHWESIVNEYIAPLVRIVYSNYKTKSVH--LAFVVRYHWQQQSELQEH 818

Query: 664 HDSSTYTINIALNQ-VGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQ 722
           HD+STYTINIALN+  G DYEGGG RFIR N +     +G   +HPG+ THYH+GL+ T 
Sbjct: 819 HDASTYTINIALNEGGGKDYEGGGSRFIRQNYSSINQEIGTANLHPGKCTHYHKGLKTTA 878

Query: 723 GTRYIMISFVD 733
           G RYI++SF++
Sbjct: 879 GIRYILVSFIN 889


>gi|432962878|ref|XP_004086761.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 3-like,
           partial [Oryzias latipes]
          Length = 375

 Score =  279 bits (714), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 133/341 (39%), Positives = 211/341 (61%), Gaps = 4/341 (1%)

Query: 32  DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLLKNE 90
           +  LVIT A+ ETDG++RF+++A     +V+ LGL + W GGD++ ++GGG KV  LK E
Sbjct: 36  ENLLVITAATEETDGFRRFMRTAREFNYKVQVLGLGEDWRGGDVARTVGGGQKVRWLKKE 95

Query: 91  LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
           L +     +++I+  DSYDV++  G  ++L +F+     +VF AE  CWPD  L  KYP 
Sbjct: 96  LLKHSEEAELVIMFVDSYDVVLAAGPGELLAKFSRLGHRVVFSAEGFCWPDQRLASKYPQ 155

Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
           V SG RYLNSGGFIG+A D+  ++   ++K+++DDQL+Y  ++LD   R K  I LD  +
Sbjct: 156 VHSGKRYLNSGGFIGFAADLSAIVQQWTLKDDDDDQLFYTRIYLDRNQRNKFNITLDHRS 215

Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK- 269
            +FQNL G+++++ L F+      + N  Y+T PV+IHGNG +K++LN  GNY+  +W  
Sbjct: 216 QIFQNLNGAIDEVVLKFEKGR-ARVRNVAYDTLPVVIHGNGPTKLQLNYLGNYVPTAWTY 274

Query: 270 TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
            +GC  C++ ++  D    +Q P V ++VFI+ PT F+EEFL ++  LNYP  ++ +F++
Sbjct: 275 ENGCGVCDIDLRLFDDTPDEQMPLVHLAVFIEHPTPFMEEFLERLTTLNYPHSRLRLFIH 334

Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNL 369
           NN  YH     ++    K +F N   +     +   +AR +
Sbjct: 335 NNVVYHEQHIQNFWLRHKNLFPNALLVGPEENLEENQARTM 375


>gi|311977606|ref|YP_003986726.1| probable procollagen-lysine,2-oxoglutarate 5-dioxygenase
           [Acanthamoeba polyphaga mimivirus]
 gi|82000136|sp|Q5UQC3.1|PLOD_MIMIV RecName: Full=Procollagen lysyl hydroxylase and
           glycosyltransferase; Short=LHGT; AltName: Full=Lysyl
           hydroxylase; AltName:
           Full=Procollagen-lysine,2-oxoglutarate 5-dioxygenase
 gi|55416853|gb|AAV50503.1| procollagen-lysine,2-oxoglutarate 5-dioxygenase [Acanthamoeba
           polyphaga mimivirus]
 gi|308204267|gb|ADO18068.1| probable procollagen-lysine,2-oxoglutarate 5-dioxygenase
           [Acanthamoeba polyphaga mimivirus]
 gi|339061161|gb|AEJ34465.1| procollagen-lysine,2-oxoglutarate 5-dioxygenase [Acanthamoeba
           polyphaga mimivirus]
          Length = 895

 Score =  276 bits (707), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 212/732 (28%), Positives = 339/732 (46%), Gaps = 108/732 (14%)

Query: 28  NIDEDK-FLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNL 86
           N D DK F ++ +   + + + RF +  ++  L  K +   +     D  SL    +   
Sbjct: 246 NFDTDKQFRIVYIGPTKGNSFHRFTEYCKLYLLPYKVIDEKET---NDFVSLRSELQ--- 299

Query: 87  LKNELDEMDITDDMIILVT----DSYDVIIDGGVNDILERFN--TFDANIVFGAERLCWP 140
               L E D+   ++++V+    D  + I     N+ ++++   T D N +  A      
Sbjct: 300 ---SLSEQDLNTTLMLVVSVNHNDFCNTIPCAPTNEFIDKYKQLTTDTNSIVSA------ 350

Query: 141 DTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIK----NEEDDQLYYALLFLDE 196
                     V +G    N   FIG+A  I E I++   K    N E D     LL +  
Sbjct: 351 ----------VQNG---TNKTMFIGWANKISEFINHYHQKLTESNAETDINLANLLLISS 397

Query: 197 TLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSK-I 255
                + +V D   NLFQ L     DI  +          N K    P +++ N  S  I
Sbjct: 398 ISSDFNCVVEDVEGNLFQ-LINEESDIVFSTTTSR----VNNKLGKTPSVLYANSDSSVI 452

Query: 256 ELNSFGNYLAKSWKTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIA- 314
            LN   NY    W            H+  +K D  P + +S+ I K        + KIA 
Sbjct: 453 VLNKVENYTGYGWNEY------YGYHVYPVKFDVLPKIYLSIRIVKNAN-----VTKIAE 501

Query: 315 NLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENS 374
            L+YP + I++ +  ++  H   +   I  F                             
Sbjct: 502 TLDYPKELITVSISRSE--HDSFYQADIQKF----------------------------- 530

Query: 375 LHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALN-ADGFYA 433
           L  G D+YFY+  D  +  P +LK L+  N+  + PL+ +  ++W+N+WG ++ ++G+Y 
Sbjct: 531 LLSGADYYFYISGDCIITRPTILKELLELNKDFVGPLMRKGTESWTNYWGDIDPSNGYYK 590

Query: 434 RSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNS-----MDYDMAF 488
           RSFDY +II  D+   G WNVPY+ + YL+K SVI+   +  ++T NS      + DM  
Sbjct: 591 RSFDYFDIIGRDR--VGCWNVPYLASVYLIKKSVIE--QVPNLFTENSHMWNGSNIDMRL 646

Query: 489 CTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPE---VYELIRNPLDWDLRYIHPEYQ 545
           C NLR   + + + + + YGH+ DS N +     P    +Y+L     +W+ +Y+HPE+ 
Sbjct: 647 CHNLRKNNVFMYLSNLRPYGHIDDSINLEVLSGVPTEVTLYDLPTRKEEWEKKYLHPEFL 706

Query: 546 KSL--LPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNN--DKRLETGYEA 601
             L    D    + C DV+ FP+ T  FC E +++M+    WS G ++  D R+  G E+
Sbjct: 707 SHLQNFKDFDYTEICNDVYSFPLFTPAFCKEVIEVMDKANLWSKGGDSYFDPRI-GGVES 765

Query: 602 VPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLR 661
            PT+D  + +VGL   W   +  YV P     +  Y  + +   ++FVV+Y  + Q  L 
Sbjct: 766 YPTQDTQLYEVGLDKQWHYVVFNYVAPFVRHLYNNYKTKDIN--LAFVVKYDMERQSELA 823

Query: 662 PHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVT 721
           PHHDSSTYT+NIALN+ G +Y  GGC FIR+       ++G+  +H G+L  YH  L +T
Sbjct: 824 PHHDSSTYTLNIALNEYGKEYTAGGCEFIRHKFIWQGQKVGYATIHAGKLLAYHRALPIT 883

Query: 722 QGTRYIMISFVD 733
            G RYI++SFV+
Sbjct: 884 SGKRYILVSFVN 895


>gi|351737377|gb|AEQ60412.1| Procollagen-lysine,2-oxoglutarate 5-dioxygenase [Acanthamoeba
           castellanii mamavirus]
 gi|398257080|gb|EJN40688.1| procollagen-lysine,2-oxoglutarate 5-dioxygenase [Acanthamoeba
           polyphaga lentillevirus]
          Length = 895

 Score =  275 bits (702), Expect = 7e-71,   Method: Compositional matrix adjust.
 Identities = 210/732 (28%), Positives = 339/732 (46%), Gaps = 108/732 (14%)

Query: 28  NIDEDK-FLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNL 86
           N D DK F ++ +   + + + RF +  ++  L  K +   +     D  SL    +   
Sbjct: 246 NFDTDKQFRIVYIGPTKGNSFHRFTEYCKLYLLPYKVIDEKET---NDFVSLRSELQ--- 299

Query: 87  LKNELDEMDITDDMIILVT----DSYDVIIDGGVNDILERFN--TFDANIVFGAERLCWP 140
               L E D+   ++++V+    D  + I     N+ ++++   T D N +  A      
Sbjct: 300 ---SLSEQDLNTTLMLVVSVNHNDFCNTIPCAPTNEFIDKYKQLTTDTNSIVSA------ 350

Query: 141 DTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIK----NEEDDQLYYALLFLDE 196
                     V +G    N   F+G+A  I E I++   K    N E D     LL +  
Sbjct: 351 ----------VQNG---TNKTMFVGWANKISEFINHYHQKLTESNAETDINLANLLLISS 397

Query: 197 TLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSK-I 255
                + +V D   NLFQ L     DI  +          N K    P +++ N  S  I
Sbjct: 398 ISSDFNCVVEDIEGNLFQ-LINEESDIVFSTTTSR----VNNKLGKTPSVLYANSDSSVI 452

Query: 256 ELNSFGNYLAKSWKTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIA- 314
            LN   NY    W            H+  +K D  P + +S+ I K        + KIA 
Sbjct: 453 VLNKVENYTGYGWNEY------YGYHVYPVKFDVLPKIYLSIRILKNAN-----VTKIAE 501

Query: 315 NLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENS 374
            L+YP + +++ +  ++  H   +   I  F                             
Sbjct: 502 TLDYPKELVTVSISRSE--HDNFYQADIQKF----------------------------- 530

Query: 375 LHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALN-ADGFYA 433
           L  G D+YFY+  D  +  P +LK L+  N+  + PL+ +  ++W+N+WG ++ ++G+Y 
Sbjct: 531 LLSGADYYFYISGDCIITRPSILKELLELNKDFVGPLMRKGTESWTNYWGDIDPSNGYYK 590

Query: 434 RSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNS-----MDYDMAF 488
           RSFDY +II  D+   G WNVPY+ + YL+K SVI+   +  ++T NS      + DM  
Sbjct: 591 RSFDYFDIIGRDR--VGCWNVPYLASVYLIKKSVIE--QVPNLFTENSHMWNGSNIDMRL 646

Query: 489 CTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPE---VYELIRNPLDWDLRYIHPEYQ 545
           C NLR   + + + + + YGH+ DS N +     P    +Y+L     +W+ +Y+HPE+ 
Sbjct: 647 CHNLRKNNVFMYLSNLRPYGHIDDSINLEVLSGVPTEVTLYDLPTRKEEWEKKYLHPEFL 706

Query: 546 KSL--LPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNN--DKRLETGYEA 601
             L    D    + C DV+ FP+ T  FC E +++M+    WS G ++  D R+  G E+
Sbjct: 707 NHLQNFKDFDYTEICNDVYSFPLFTPAFCKEVIEVMDKANLWSKGGDSYFDPRI-GGVES 765

Query: 602 VPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLR 661
            PT+D  + +VGL   W   +  YV P     +  Y  + +   ++FVV+Y  + Q  L 
Sbjct: 766 YPTQDTQLYEVGLDKQWHYVVFNYVAPFVRHLYNNYKTKDIN--LAFVVKYDMERQSELA 823

Query: 662 PHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVT 721
           PHHDSSTYT+N+ALN+ G  Y GGGC FIR+       ++G+  +H G+L  YH  L +T
Sbjct: 824 PHHDSSTYTLNVALNEYGSQYMGGGCEFIRHKFIWQGQKVGYATIHAGKLLAYHRALPIT 883

Query: 722 QGTRYIMISFVD 733
            G RYI++SFV+
Sbjct: 884 SGKRYILVSFVN 895


>gi|167522232|ref|XP_001745454.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163776412|gb|EDQ90032.1| predicted protein [Monosiga brevicollis MX1]
          Length = 399

 Score =  272 bits (695), Expect = 5e-70,   Method: Compositional matrix adjust.
 Identities = 142/369 (38%), Positives = 207/369 (56%), Gaps = 27/369 (7%)

Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGF------------- 431
           + S + L N   L +L+  +  +IAPLLVR  K WSNFWG+  A GF             
Sbjct: 36  IHSHARLTNSSALSHLMATDYDVIAPLLVRQNKYWSNFWGS--ASGFAPAVAAQALADAD 93

Query: 432 ---YARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSV---IKATNIKTIYTLNSMDYD 485
              Y RS DY  I+   Q   G+W VP +    ++   V   +K              Y 
Sbjct: 94  RLGYMRSPDYYEIVERHQ--TGVWTVPVVFGAVVLSERVHDTLKEAAQDLAEGEAGWFYG 151

Query: 486 MAFCTNLRNK-GIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEY 544
           MA         G  +++ +   +GH+++++ +D    +P++Y    NP +W+  Y+H EY
Sbjct: 152 MAALAAHLRHAGSLVRVTNEHRFGHMINTDAYDASHLHPDMYLAQDNPAEWEAVYLHEEY 211

Query: 545 QKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPT 604
            +    +  + + C DV+  P ++ +F  E ++  E  G+WS+G + D RL+ GYE VPT
Sbjct: 212 NQ--FRELGDMEDCTDVYRVPALSARFAREMIEECENLGEWSNGQHTDNRLKGGYEPVPT 269

Query: 605 RDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHH 664
           +DIH +Q+G    W  FLR Y+ P+    ++GYH +  R  + FVVRYRPD+Q  LRPHH
Sbjct: 270 QDIHFEQIGFKDTWQHFLRTYLGPVANHHYMGYHIQG-RTTLDFVVRYRPDKQSFLRPHH 328

Query: 665 DSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGT 724
           D+ST T+N+ALNQ GVDY+GGG  F+R NC +     GW  + PGRLTHYHEGL+ T GT
Sbjct: 329 DASTVTLNVALNQGGVDYQGGGTHFLRQNCTIKDAPPGWGTLSPGRLTHYHEGLKTTAGT 388

Query: 725 RYIMISFVD 733
           RYI++SF+D
Sbjct: 389 RYILVSFID 397


>gi|16877124|gb|AAH16834.1| PLOD2 protein, partial [Homo sapiens]
          Length = 210

 Score =  261 bits (667), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 115/211 (54%), Positives = 156/211 (73%), Gaps = 2/211 (0%)

Query: 524 EVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYG 583
           +++++  NP+DW  +YI+ +Y K +  + +  QPCPDVFWFPI +EK C E V+ ME YG
Sbjct: 2   DLWQIFENPVDWKEKYINRDYSK-IFTENIVEQPCPDVFWFPIFSEKACDELVEEMEHYG 60

Query: 584 QWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVR 643
           +WS G ++D R+  GYE VPT DIHMKQV L  VW  F+R+++ P+  + F GY+ +   
Sbjct: 61  KWSGGKHHDSRISGGYENVPTDDIHMKQVDLENVWLHFIREFIAPVTLKVFAGYYTKGF- 119

Query: 644 APMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGW 703
           A ++FVV+Y P+ Q SLRPHHD+ST+TINIALN VG D++GGGC+F+RYNC++ + R GW
Sbjct: 120 ALLNFVVKYSPERQRSLRPHHDASTFTINIALNNVGEDFQGGGCKFLRYNCSIESPRKGW 179

Query: 704 MLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
             MHPGRLTH HEGL V  GTRYI +SF+DP
Sbjct: 180 SFMHPGRLTHLHEGLPVKNGTRYIAVSFIDP 210


>gi|299930635|gb|ADJ58533.1| seminal fluid protein HACP031 [Heliconius erato]
          Length = 332

 Score =  251 bits (640), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 123/287 (42%), Positives = 189/287 (65%), Gaps = 6/287 (2%)

Query: 36  VITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNELDEMD 95
           V TVA++   G +RF++SA+V  + V+ LG+ + W+GG+M   GGG K+NLLK +L  ++
Sbjct: 47  VFTVATHNNHGLERFLRSAKVYGINVEVLGMGKKWVGGNMDHPGGGQKINLLKQKLKSLE 106

Query: 96  ITDDM--IILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAVGS 153
             +D   IIL TDS+DV+    + +I+++F      ++F AE  CWPD +L  KYP    
Sbjct: 107 KLEDRDRIILFTDSFDVMFLANLKEIVDKFTNMFVRVLFSAESFCWPDPTLSSKYPDTSM 166

Query: 154 GYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLANLF 213
              +LNSGGFIGY  D+  +++ + ++N +DDQLYY +++LDE  R KH+I LD  + +F
Sbjct: 167 TNAFLNSGGFIGYYSDVMAILNYKKVRNNDDDQLYYTMVYLDEEYRLKHRIALDHDSEIF 226

Query: 214 QNLYGSLEDIKLNFD-LDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWKTS- 271
           QNL G+L D++L  +  +++ ++ N   N  P+I+HGNG SK++LN F NYLA +W  S 
Sbjct: 227 QNLNGALSDVELVLNSTEDYPYIKNVVSNERPLIVHGNGPSKLKLNQFSNYLANAWSVSK 286

Query: 272 GCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNY 318
           GC  C+  +    LK D  P+VL++VFI++PT FLEEFL +I  ++Y
Sbjct: 287 GCKMCD--EKYTVLKDDALPNVLMAVFIEQPTPFLEEFLTQIEKVDY 331


>gi|74180451|dbj|BAE34174.1| unnamed protein product [Mus musculus]
          Length = 398

 Score =  245 bits (625), Expect = 7e-62,   Method: Compositional matrix adjust.
 Identities = 134/375 (35%), Positives = 227/375 (60%), Gaps = 5/375 (1%)

Query: 31  EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPW-LGGDMSSLGGGYKVNLLKN 89
           ED  LV+TVA+ ET+G++RF +SA+    ++++LGL + W + G  ++ GGG KV LLK 
Sbjct: 25  EDNLLVLTVATKETEGFRRFKRSAQFFNYKIQSLGLGEDWSVDGGPAAAGGGQKVRLLKK 84

Query: 90  ELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYP 149
            L++    +D++IL  DSYDV+   G  ++L++F    + +VF AE   +PD  L  KYP
Sbjct: 85  ALEKHADKEDLVILFVDSYDVVFASGPRELLKKFQQAKSQVVFSAEEHIYPDRRLEAKYP 144

Query: 150 AVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTL 209
            V  G R+L SGGFIGYA  + +L++    ++ + DQL+Y  +FL+   R +  I LD  
Sbjct: 145 TVPDGKRFLGSGGFIGYAPSLSKLVAEWEGQDSDSDQLFYTKIFLNPEKREQINISLDHR 204

Query: 210 ANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK 269
             +FQNL G+L+++ L F++   V   N  Y+T PV++HGNG +K++LN  GNY+ + W 
Sbjct: 205 CRIFQNLDGALDEVVLKFEMGH-VRARNLAYDTLPVVVHGNGPTKLQLNYLGNYIPRFWT 263

Query: 270 -TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFV 327
             +GCT C+  ++ L  +  +  P+VL+ VFI++PT FL  F  ++  L YP K++ +F+
Sbjct: 264 FETGCTVCDEGLRSLKGIGDEALPTVLVGVFIEQPTPFLSLFFLRLLRLRYPQKQMRLFI 323

Query: 328 YNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLH-KGVDFYFYVD 386
           +N + +H    + ++    + +++VK +     + + +ARN+  +     +   +YF VD
Sbjct: 324 HNQERHHKLQVEQFLAEHGSEYQSVKLVGPEVRMANADARNMGADLCRQDQTCTYYFSVD 383

Query: 387 SDSHLDNPDVLKYLV 401
           +D  L  P+ L+ L+
Sbjct: 384 ADVALTEPNSLRLLI 398


>gi|345309303|ref|XP_001514467.2| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 1-like,
           partial [Ornithorhynchus anatinus]
          Length = 302

 Score =  243 bits (619), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 121/302 (40%), Positives = 192/302 (63%), Gaps = 4/302 (1%)

Query: 32  DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDM-SSLGGGYKVNLLKNE 90
           +  LV+TVA+ ET+G++RF +SA+    +V+ LGL + W   D  ++ GGG K+ LLK+ 
Sbjct: 2   ENLLVLTVATRETEGFRRFKRSAQFFNYKVQVLGLGEDWSSEDEPTAAGGGQKIRLLKSA 61

Query: 91  LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
           L++    +D++IL TDSYDV+   G  ++L++F    + +VF AE L +PD  L  KYP 
Sbjct: 62  LEKHADKEDLVILFTDSYDVVFASGPKELLKKFKQAKSRVVFSAEELIYPDRRLEAKYPT 121

Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
           V  G R+L SG FIGYA ++ +L+++    + + DQL+Y  +FLD   R    I LD   
Sbjct: 122 VRDGKRFLGSGAFIGYAPNLSKLVADWKGLDNDSDQLFYTQVFLDPEKREAINISLDHRC 181

Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWK- 269
            +FQNL G+L+++ L F+ +  V   N +Y+T PV+IHGNG +K++LN  GNY+ + W  
Sbjct: 182 RIFQNLNGALDEVVLKFE-NAQVRARNLEYDTLPVLIHGNGPTKLQLNYLGNYIPRVWTF 240

Query: 270 TSGCTRCNL-IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
            +GCT+C+  ++ L   + D  P VL+ VFI++PT FL  F  ++  L YP K++ +F++
Sbjct: 241 ETGCTQCDEGLRSLKGFEDDALPLVLVGVFIEQPTPFLSLFFRRLQALQYPKKQLQLFIH 300

Query: 329 NN 330
           N+
Sbjct: 301 NH 302


>gi|345321580|ref|XP_003430455.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 2-like,
           partial [Ornithorhynchus anatinus]
          Length = 183

 Score =  242 bits (617), Expect = 6e-61,   Method: Compositional matrix adjust.
 Identities = 106/182 (58%), Positives = 137/182 (75%), Gaps = 1/182 (0%)

Query: 553 VNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQV 612
           V  +PCPDVFWFPI +EK C E V+ ME +GQWS G ++D R+  GYE VPT DIHM+Q+
Sbjct: 3   VVERPCPDVFWFPIFSEKACDELVEEMEHFGQWSGGKHHDSRISGGYENVPTDDIHMRQI 62

Query: 613 GLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTIN 672
           GL   W  F+R+++ P+  + F GY+ +   A ++FVV+Y P+ Q SLRPHHDSST+TIN
Sbjct: 63  GLENEWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPHHDSSTFTIN 121

Query: 673 IALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFV 732
           IALN VG D++GGGC+FIRYNC++ + R GW  MHPGRLTH HEGL +  GTRYI +SF+
Sbjct: 122 IALNSVGEDFQGGGCKFIRYNCSIESPRKGWSFMHPGRLTHLHEGLPIKNGTRYIAVSFI 181

Query: 733 DP 734
           DP
Sbjct: 182 DP 183


>gi|355712271|gb|AES04294.1| procollagen-lysine, 2-oxoglutarate 5-dioxygenase 2 [Mustela
           putorius furo]
          Length = 213

 Score =  241 bits (614), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 105/215 (48%), Positives = 152/215 (70%), Gaps = 2/215 (0%)

Query: 495 KGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVN 554
           KG+ + I +  E+G L+ + N++    N +++++  NP+DW  +YI+ +Y K +  + + 
Sbjct: 1   KGVFMYISNRHEFGRLLSTANYNTSHYNNDLWQIFENPVDWKEKYINRDYSK-IFTENIV 59

Query: 555 NQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGL 614
            QPCPDVFWFPI +EK C E V+ ME YGQWS G ++D R+  GYE VPT DIHMKQ+ L
Sbjct: 60  EQPCPDVFWFPIFSEKACDELVEEMEHYGQWSGGKHHDSRISGGYENVPTDDIHMKQIDL 119

Query: 615 AGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIA 674
             VW  F+R+++ P+  + F GY+ +   A ++FVV+Y P+ Q SLRPHHD+ST+TINIA
Sbjct: 120 ENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPHHDASTFTINIA 178

Query: 675 LNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPG 709
           LN VG D++GGGC+F+RYNC++ + R GW  MHPG
Sbjct: 179 LNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPG 213


>gi|324502308|gb|ADY41016.1| Procollagen-lysine,2-oxoglutarate 5-dioxygenase [Ascaris suum]
          Length = 379

 Score =  219 bits (557), Expect = 6e-54,   Method: Compositional matrix adjust.
 Identities = 119/368 (32%), Positives = 211/368 (57%), Gaps = 13/368 (3%)

Query: 9   CLILSCVVFFISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQ 68
            + LS  +F + +  N   ++      V+TV     D  +R  +SA  +++Q+  L   Q
Sbjct: 6   VVALSLSLFILRLDVNAATSLH-----VVTVVIEHQDALERLQRSANAHEIQLNILRHDQ 60

Query: 69  PWLGGDMSSLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERF--NTF 126
                  S LGGG K+ +L++ L+      D+I+L  D+   II+G   +IL+RF  +  
Sbjct: 61  L---ASSSHLGGGEKLRILRDGLEIYKDRSDLILLYVDANKAIINGREEEILKRFMDSYS 117

Query: 127 DANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQ 186
           ++ IVF ++  C+PD  L  +YP V  G R+LNS  FIGYA  I EL++++S++N  D+Q
Sbjct: 118 NSQIVFSSDNYCFPDEELTQRYPIVEKGKRFLNSAAFIGYANKIWELLNSQSLENINDEQ 177

Query: 187 LYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVI 246
           ++Y   FLDE LR + ++VLD+ + +F ++  S ++I L+F  +   ++TN  + T+P+I
Sbjct: 178 IFYTHRFLDERLRNRLQMVLDSTSQIFHSVDVSKDEITLDFSDNGDAYITNVIHKTHPLI 237

Query: 247 IHGNGKSKIELNSFGNYLAKSWKTS-GCTRCNL--IKHLDSLKPDQFPSVLISVFIDKPT 303
           IHG+  +K+ LN  GNY+ K+W    GC  C+   +  L      ++P + +++ + KP 
Sbjct: 238 IHGDESNKLMLNYLGNYIGKAWSADFGCRDCSAQRVNFLKDNAEQEWPKLTLAIMLAKPI 297

Query: 304 AFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNS 363
            F+EEFL K+  L YPA KI +++Y+NQ+Y+    ++++   +  +  V++ +    +  
Sbjct: 298 PFVEEFLTKVEKLEYPASKIDLYLYSNQKYNEREVNEFLRRVRGKYSWVEWDSGEVEIGE 357

Query: 364 KEARNLAV 371
           +EAR  A+
Sbjct: 358 REARRTAI 365


>gi|194387172|dbj|BAG59952.1| unnamed protein product [Homo sapiens]
          Length = 333

 Score =  214 bits (545), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 102/225 (45%), Positives = 155/225 (68%), Gaps = 2/225 (0%)

Query: 32  DKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLLKNE 90
           +K LVITVA+ ET+GY RF++SAE     V+TLGL + W GGD++ ++GGG KV  LK E
Sbjct: 41  EKLLVITVATAETEGYLRFLRSAEFFNYTVRTLGLGEEWRGGDVARTVGGGQKVRWLKKE 100

Query: 91  LDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPA 150
           +++    +DMII+  DSYDVI+ G   ++L++F    + ++F AE  CWP+  L ++YP 
Sbjct: 101 MEKYADREDMIIMFVDSYDVILAGSPTELLKKFVQSGSRLLFSAESFCWPEWGLAEQYPE 160

Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
           VG+G R+LNSGGFIG+A  I +++     K+++DDQL+Y  L+LD  LR K  + LD  +
Sbjct: 161 VGTGKRFLNSGGFIGFATTIHQIVRQWKYKDDDDDQLFYTRLYLDPGLREKLSLNLDHKS 220

Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKI 255
            +FQNL G+L+++ L FD +  V + N  Y+T P+++HGNG +K+
Sbjct: 221 RIFQNLNGALDEVVLKFDRNR-VRIRNVAYDTLPIVVHGNGPTKV 264


>gi|311978107|ref|YP_003987227.1| putative procollagen-lysine,2-oxoglutarate dioxygenase
           [Acanthamoeba polyphaga mimivirus]
 gi|81999712|sp|Q5UNV6.1|YR699_MIMIV RecName: Full=Uncharacterized protein R699
 gi|55417310|gb|AAV50960.1| unknown [Acanthamoeba polyphaga mimivirus]
 gi|308204997|gb|ADO18798.1| putative procollagen-lysine,2-oxoglutarate dioxygenase
           [Acanthamoeba polyphaga mimivirus]
 gi|339061637|gb|AEJ34941.1| hypothetical protein MIMI_R699 [Acanthamoeba polyphaga mimivirus]
 gi|351737875|gb|AEQ60910.1| hypothetical protein [Acanthamoeba castellanii mamavirus]
 gi|398257501|gb|EJN41109.1| hypothetical protein lvs_R606 [Acanthamoeba polyphaga
           lentillevirus]
          Length = 455

 Score =  206 bits (525), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 153/493 (31%), Positives = 254/493 (51%), Gaps = 54/493 (10%)

Query: 30  DEDKFLV--ITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSS-LGGGYKVNL 86
           ++D  LV  I ++ ++TDG  RF +  + + LQ   +G  + W GG++ S  GGG K+N 
Sbjct: 6   NDDNLLVLGIGISVHKTDGVLRFEKYCQAHNLQYMIVGEGKKWNGGNLESEAGGGQKINE 65

Query: 87  LKNELDEMDITDDMIILVTDSYDVIIDGGVNDILE--RFNTFDANIVFGAERLCWPDTSL 144
           L   L+   I D+ +I+V D+YD+I   G  +IL   RF T D  +VF +E  CWPD SL
Sbjct: 66  LLIALES--IKDNKLIVVCDTYDLIPLSGPEEILRKYRFLTPDNKVVFSSELYCWPDASL 123

Query: 145 YDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKI 204
            ++YP V + Y+YLNSG F+GY  DI E+I N  +K+ +DDQL++++ F++       KI
Sbjct: 124 VERYPKVDTKYKYLNSGAFMGYRDDIYEMIKN-GVKDRDDDQLFFSIKFIETD-----KI 177

Query: 205 VLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSF-GNY 263
           VLD    LFQ +Y    D+ ++ +      + N   N+ PV  HGNG +K  LN   G +
Sbjct: 178 VLDYKCELFQAMYRCNSDLVVHKN-----RIFNGYTNSYPVFAHGNGPAKKLLNHMEGYF 232

Query: 264 LAKSWKTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDK-PTAFLEEFLNKIANLNYPAKK 322
           + +    S  T       +++ K D  P V  ++++D    + L++FL K+A++ Y  K 
Sbjct: 233 MTEPIDGSSNT-------INTFKLDNEPKVFFALYVDSNDLSALKQFLGKVASIQYGNKV 285

Query: 323 ISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFY 382
           I ++  ++ E +  L      N+ T     KY+                ++       FY
Sbjct: 286 IYLYDRSDNEQNRKLIQISYPNYHTGV--TKYV---------------FDDFKKSDAQFY 328

Query: 383 FYVDSDSHLDNPDVLKYL---VNRNESLIAPLL-VRPFKAWSNFWGALNADGFYARSFDY 438
           F ++ +  +   D+L  L   V  N  +I+P++        +NFWG +  DG+Y RS +Y
Sbjct: 329 FLLEQNCIITKKDILHELIMQVKDNHRVISPMIGYEQNSTRTNFWGDI-EDGYYKRSENY 387

Query: 439 MNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIH 498
           +++       +G+WNVPY+    LM  SV++  ++  +      D DM  C +LR   I 
Sbjct: 388 LDL--AKHKVRGLWNVPYVYGVILMHESVVRNWDLSMV---KYNDKDMDLCFSLRKHTIF 442

Query: 499 LKIDSTQEYGHLV 511
           + + +   YG++V
Sbjct: 443 MYMINNNNYGYMV 455


>gi|451927695|gb|AGF85573.1| hypothetical protein glt_00768 [Moumouvirus goulette]
          Length = 449

 Score =  198 bits (503), Expect = 9e-48,   Method: Compositional matrix adjust.
 Identities = 150/484 (30%), Positives = 244/484 (50%), Gaps = 51/484 (10%)

Query: 35  LVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSS-LGGGYKVNLLKNELDE 93
           L I V+ N+ DG  RF    +   L  K +G  + W GGDMS+ +GGG KVN L   L+E
Sbjct: 7   LGIGVSPNKNDGVLRFETYCKAFNLPYKIVGDGKIWNGGDMSAGVGGGQKVNELLRTLNE 66

Query: 94  MDITDDMIILVTDSYDVIIDGGVNDILERFNTF---DANIVFGAERLCWPDTSLYDKYPA 150
             I ++ +++V D++D+    G  +I E++      + +I+F +E  CWPD SL + YP 
Sbjct: 67  --INENKLLVVCDTFDLFPVSGAKEIYEKYMKLCNGNKSIIFSSEVYCWPDKSLVNVYPV 124

Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
             S Y+YLNSG FIGY  D+++L+SN  I + +DDQLYY   FL         I+LD   
Sbjct: 125 TESKYKYLNSGSFIGYRDDLQKLVSN--ILDTDDDQLYYTKKFL-----RGENIILDYNC 177

Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWKT 270
            LFQ + G   D+ ++ +      + N    + P+ +HGNG SK  LN   NY+      
Sbjct: 178 QLFQAINGCKSDLIVHKN-----RVFNKYTKSYPIFLHGNGSSKTYLNHLENYIEP---- 228

Query: 271 SGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNN 330
                 +LI     +   Q P + I++++D         LN         KKI    Y+N
Sbjct: 229 -----LSLIDMPQDIITHQ-PKIFIALYVD------TSLLNNFTQFFESVKKID---YDN 273

Query: 331 QEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSH 390
           +  +  ++D + ++      N+    + S + + E  +      ++ G DFY  ++ +  
Sbjct: 274 KNIY--VYDKFQNDQMEQLINLLGFVYKSNITNYEFNDF-----INSGCDFYCLMEQNYI 326

Query: 391 LDNPDVLKY---LVNRNESLIAPLLV-RPFKAWSNFWGALNADGFYARSFDYMNIINGDQ 446
                 LK    L+N N  ++APLL+ +    +SNFWG+L+  G+Y RS DY+N++  ++
Sbjct: 327 TTRTTFLKEIIPLLNNNHRIVAPLLISKSNSCFSNFWGSLDNKGYYERSEDYLNLMTREK 386

Query: 447 GGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQE 506
              G+WNVPY++   +   S+I   ++K  Y     D DM  C NLR   + + + +   
Sbjct: 387 --IGLWNVPYVSGLIIFDKSIILNWDLKQ-YNDYKNDRDMNLCFNLRKHTLFMYMCNLDN 443

Query: 507 YGHL 510
           YG++
Sbjct: 444 YGYI 447


>gi|425701123|gb|AFX92285.1| putative procollagen-lysine 2-oxoglutarate dioxygenase [Megavirus
           courdo11]
          Length = 453

 Score =  194 bits (492), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 155/491 (31%), Positives = 251/491 (51%), Gaps = 58/491 (11%)

Query: 35  LVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLLKNELDE 93
           L I V+  + DG  RF +  ++  L    +G  + W GGDMS   GGG K+N L   L+ 
Sbjct: 7   LGIGVSLKKNDGVLRFEKYCQIFDLPYTIVGDGKIWKGGDMSVGAGGGQKINELLIALET 66

Query: 94  MDITDDMIILVTDSYDVIIDGGVNDILERFNTF---DANIVFGAERLCWPDTSLYDKY-- 148
             ITD+ +I+V D++D+       +IL +++        +VF +E  CWP+ +L + Y  
Sbjct: 67  --ITDNKLIIVCDTFDLFPVANKQEILNKYHQICREKERVVFSSEVYCWPEKNLANIYTQ 124

Query: 149 --PAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVL 206
             P + S YRYLNSG F+G   DI  L++N  I + +DDQLY+   +L  +      I+L
Sbjct: 125 IYPKIISKYRYLNSGSFMGRRNDICALLNN--ILDTDDDQLYFTKKYLQSS-----NIIL 177

Query: 207 DTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAK 266
           DT   LFQ + GS +DI ++   D  ++   TK  T P+ IHGNG +K  LN   N L  
Sbjct: 178 DTECQLFQAINGSTDDIGIH---DNRIYNKYTK--TFPIFIHGNGPAKTFLNYLENNLHP 232

Query: 267 SWKTSGCTRCNLIKHLDS-LKPDQFPSVLISVFIDKPT-AFLEEFLNKIANLNYPAKKIS 324
                     +L+  +++ L  DQ+  V I+++ID  + + L+ FL+ +  +N   K I 
Sbjct: 233 K---------SLVNIMNTKLISDQYK-VFIALYIDSNSISELKTFLDSVTKINCTNKIIY 282

Query: 325 MFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFY 384
           ++  ++ +Y           FK + + + +I       S    N    + +    D+YF 
Sbjct: 283 VYDKSHSDY-----------FKQLLEMLGFIY------SSNVSNYVFIDFIKSNCDYYFL 325

Query: 385 VDSDSHLDNP---DVLKYLVNRNESLIAPLLV-RPFKAWSNFWGALNADGFYARSFDYMN 440
           ++ +  L N    ++L +L   N  +++PLL+ +    ++NFWGAL+ +G+Y RS DY+N
Sbjct: 326 LEQNCILTNSMTLEILIHLCQNNNRIVSPLLIGKENTNFANFWGALDKNGYYKRSDDYLN 385

Query: 441 IINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLK 500
           II   Q   G+WNVPYI    L   S+I   N+   Y  +  D DM  C NLR   + + 
Sbjct: 386 IIR--QEKIGLWNVPYIYGVILFNKSIINDWNLSQ-YEKHKDDRDMNLCFNLRKHTLFMY 442

Query: 501 IDSTQEYGHLV 511
             +   YG+++
Sbjct: 443 TCNLDCYGYII 453


>gi|371945290|gb|AEX63110.1| putative procollagen-lysine [Moumouvirus Monve]
          Length = 451

 Score =  193 bits (490), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 152/486 (31%), Positives = 247/486 (50%), Gaps = 53/486 (10%)

Query: 35  LVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLLKNELDE 93
           L I V+ N+ DG  RF    +   L  K +G  + W GGDMS  +GGG KVN L   L+E
Sbjct: 7   LGIGVSPNKNDGVLRFETYCKAFNLSYKIVGDGKIWNGGDMSVGMGGGQKVNELLQVLNE 66

Query: 94  MDITDDMIILVTDSYDVIIDGGVNDILERFNTF---DANIVFGAERLCWPDTSLYDKYPA 150
             + ++ +++V D++D+    GV +I E++      + +I+F +E  CWPD +L + YP 
Sbjct: 67  --VNENKLLIVCDTFDLFPVSGVEEIYEKYKKLCNGNKSIIFSSEVYCWPDKNLANFYPL 124

Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
             S Y+YLNSG F+GY  D+ +L+SN  I + +DDQLYY   FL         I+LD   
Sbjct: 125 TESKYKYLNSGSFMGYRDDLHKLVSN--ILDNDDDQLYYTKKFLQ-----GENIILDQNC 177

Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIE--LNSFGNYLAKSW 268
            LFQ + G   D+ ++ +      + N    + P+ IHGNG SK +  LN   NY+    
Sbjct: 178 QLFQAINGCKSDLIVHKN-----RIFNKYTKSYPIFIHGNGPSKTKKFLNRLENYIEPLL 232

Query: 269 KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
                    L+    +++  Q P V I++++D         LN         KKI    Y
Sbjct: 233 ---------LVDIPKTIETPQ-PKVFIALYVD------TSLLNNFTQFFESVKKID---Y 273

Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSD 388
           +N+E +  ++D + +N      N+    + S + + E      ++ ++ G D+Y  ++ +
Sbjct: 274 DNKEIY--IYDKFQNNQIEQLINLLGFVYKSNITNYE-----FDDFINSGCDYYCLMEQN 326

Query: 389 SHLDNPDVLKYLV---NRNESLIAPLLV-RPFKAWSNFWGALNADGFYARSFDYMNIING 444
             +   + LK ++   N +  +IAPLLV +    ++NFWG+L+  G+Y RS +Y++ I  
Sbjct: 327 YIVTKTNFLKEIIPLCNNHHRIIAPLLVSKSNNYFTNFWGSLDKKGYYKRSKNYLSWIMR 386

Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDST 504
           ++   G+WNVPYIT   +   SVI   N+K  Y     D DM  C NLR   + + + + 
Sbjct: 387 EK--IGLWNVPYITGVIIFDKSVILNWNLKQ-YDNYKNDRDMNLCFNLRKHTLFMYMCNL 443

Query: 505 QEYGHL 510
             YG +
Sbjct: 444 DNYGFI 449


>gi|441432117|ref|YP_007354159.1| hypothetical protein Moumou_00179 [Acanthamoeba polyphaga
           moumouvirus]
 gi|440383197|gb|AGC01723.1| hypothetical protein Moumou_00179 [Acanthamoeba polyphaga
           moumouvirus]
          Length = 451

 Score =  190 bits (483), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 150/486 (30%), Positives = 247/486 (50%), Gaps = 53/486 (10%)

Query: 35  LVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSS-LGGGYKVNLLKNELDE 93
           L I V+ N+ DG  RF    +   L  K +G  + W GGDMS+  GGG KVN L   L+E
Sbjct: 7   LGIGVSPNKNDGVLRFETYCKSFNLPYKIVGDGKIWNGGDMSAGAGGGQKVNELLQVLNE 66

Query: 94  MDITDDMIILVTDSYDVIIDGGVNDILERFNTF---DANIVFGAERLCWPDTSLYDKYPA 150
             + ++ +++V D++D+    GV +I E++      + +I+F +E  CWPD +L + YP 
Sbjct: 67  --VNENKLLIVCDTFDLFPVSGVEEIYEKYKKLCNGNKSIIFSSEVYCWPDKNLANFYPL 124

Query: 151 VGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLA 210
             S Y+YLNSG F+GY  D+ +L+SN  I + +DDQLYY   FL         I+LD   
Sbjct: 125 TESKYKYLNSGSFMGYRDDLHKLVSN--ILDNDDDQLYYTKKFL-----QGENIILDHNC 177

Query: 211 NLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIE--LNSFGNYLAKSW 268
            LFQ + G   DI ++ +      + N    + P+ IHGNG SK +  LN   NY+    
Sbjct: 178 QLFQAINGCKSDIVVHKN-----RIFNKYTKSYPIFIHGNGPSKTKKFLNRLENYIEPLL 232

Query: 269 KTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY 328
                    L+    +++  Q P V I++++D         LN         KKI    Y
Sbjct: 233 ---------LVDIPKTIETPQ-PKVFIALYVD------TSLLNNFTQFFESVKKID---Y 273

Query: 329 NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSD 388
           +N+E +  ++D + +N      N+    + S + + E      ++ ++ G D+Y  ++ +
Sbjct: 274 DNKEIY--IYDKFQNNQIEQLINLLGFVYKSNITNYE-----FDDFINSGCDYYCLMEQN 326

Query: 389 SHLDNPDVLKYLV---NRNESLIAPLLV-RPFKAWSNFWGALNADGFYARSFDYMNIING 444
             +     LK ++   N +  +I+PLL+ +    ++NFWG+L+  G+Y RS DY++++  
Sbjct: 327 YVITKTTFLKEIIPLFNNHHRIISPLLMSKNNSCFTNFWGSLDDKGYYERSEDYLSLVAR 386

Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDST 504
           ++   G+WNVPYI+   +   SVI   N+K  Y     D DM  C NLR   + + + + 
Sbjct: 387 EK--IGLWNVPYISGVIIFDKSVILNWNLKQ-YDNYKNDRDMNLCFNLRKYTLFMYMCNL 443

Query: 505 QEYGHL 510
             YG +
Sbjct: 444 DNYGFI 449


>gi|448825199|ref|YP_007418130.1| putative procollagen-lysine 2-oxoglutarate dioxygenase [Megavirus
           lba]
 gi|444236384|gb|AGD92154.1| putative procollagen-lysine 2-oxoglutarate dioxygenase [Megavirus
           lba]
          Length = 453

 Score =  190 bits (483), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 154/490 (31%), Positives = 248/490 (50%), Gaps = 56/490 (11%)

Query: 35  LVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLLKNELDE 93
           L I V+  + DG  RF +  ++  L    +G  + W GGDMS   GGG K+N L   L+ 
Sbjct: 7   LGIGVSLKKNDGVLRFEKYCQIFDLPYIIVGDGKIWKGGDMSVGAGGGQKINELLIALET 66

Query: 94  MDITDDMIILVTDSYDVIIDGGVNDILERFNTF---DANIVFGAERLCWPDTSLYDKY-- 148
             ITD+ +I+V D++D+       +IL +++        +VF +E  CWP+ +L + Y  
Sbjct: 67  --ITDNKLIIVCDTFDLFPVANKQEILNKYHQICGEKERVVFSSEVYCWPEKNLANIYTQ 124

Query: 149 --PAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVL 206
             P + S YRYLNSG F+G   DI  L++N  I + +DDQL++   +L  +      I+L
Sbjct: 125 IYPKIISKYRYLNSGSFMGRRNDICALLNN--ILDTDDDQLFFTKKYLQSS-----NIIL 177

Query: 207 DTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAK 266
           DT   LFQ + GS +DI ++   D  ++   TK  T P+ IHGNG +K  LN   N L  
Sbjct: 178 DTECQLFQAINGSTDDIGIH---DNRIYNKYTK--TFPIFIHGNGPAKTFLNYLENNLHP 232

Query: 267 SWKTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTA-FLEEFLNKIANLNYPAKKISM 325
                  +  N++     L  DQ+  V I+++ID  +   L+ FL+ +  +N   K I +
Sbjct: 233 K------SLVNIVN--TKLVSDQYK-VFIALYIDSNSINELKIFLDSVTKINCTNKIIYV 283

Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYV 385
           +  ++ +Y           FK + + + +I       S    N    + +    D+YF +
Sbjct: 284 YDKSHSDY-----------FKQLLEMLGFIY------SSNVSNYVFVDFIKSDCDYYFLL 326

Query: 386 DSDSHLDNP---DVLKYLVNRNESLIAPLLV-RPFKAWSNFWGALNADGFYARSFDYMNI 441
           + +  L N    ++L +L   N  +++PLL+ +    ++NFWGAL+ +G+Y RS DY+NI
Sbjct: 327 EQNCILTNSMTLEILIHLCQNNNRIVSPLLIGKENTNFANFWGALDKNGYYKRSDDYLNI 386

Query: 442 INGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKI 501
           I   Q   G+WNVPYI    L   S+I   N+   Y  +  D DM  C NLR   + +  
Sbjct: 387 IR--QEKIGLWNVPYIYGVILFNKSIINDWNLSQ-YEKHKDDRDMNLCFNLRKHTLFMYT 443

Query: 502 DSTQEYGHLV 511
            +   YG+++
Sbjct: 444 CNLDCYGYII 453


>gi|371943512|gb|AEX61341.1| putative procollagen-lysine [Megavirus courdo7]
          Length = 453

 Score =  190 bits (483), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 154/490 (31%), Positives = 248/490 (50%), Gaps = 56/490 (11%)

Query: 35  LVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLLKNELDE 93
           L I V+  + DG  RF +  ++  L    +G  + W GGDMS   GGG K+N L   L+ 
Sbjct: 7   LGIGVSLKKNDGVLRFEKYCQIFDLPYIIVGDGKIWKGGDMSVGAGGGQKINELLIALET 66

Query: 94  MDITDDMIILVTDSYDVIIDGGVNDILERFNTF---DANIVFGAERLCWPDTSLYDKY-- 148
             ITD+ +I+V D++D+       +IL +++        +VF +E  CWP+ +L + Y  
Sbjct: 67  --ITDNKLIIVCDTFDLFPVANKQEILNKYHQICGEKERVVFSSEVYCWPEKNLANIYTQ 124

Query: 149 --PAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVL 206
             P + S YRYLNSG F+G   DI  L++N  I + +DDQL++   +L  +      I+L
Sbjct: 125 IYPKIISKYRYLNSGSFMGRRNDICALLNN--ILDTDDDQLFFTKKYLQSS-----NIIL 177

Query: 207 DTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAK 266
           DT   LFQ + GS +DI ++   D  ++   TK  T P+ IHGNG +K  LN   N L  
Sbjct: 178 DTECQLFQAINGSTDDIGIH---DNRIYNKYTK--TFPIFIHGNGPAKTFLNYLENNLHP 232

Query: 267 SWKTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTA-FLEEFLNKIANLNYPAKKISM 325
                  +  N++     L  DQ+  V I+++ID  +   L+ FL+ +  +N   K I +
Sbjct: 233 K------SLVNIVN--TKLVSDQYK-VFIALYIDSNSINELKIFLDSVTKINCTNKIIYV 283

Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYV 385
           +  ++ +Y           FK + + + +I       S    N    + +    D+YF +
Sbjct: 284 YDKSHSDY-----------FKQLLEMLGFIY------SSNVSNYVFVDFIKSDCDYYFLL 326

Query: 386 DSDSHLDNP---DVLKYLVNRNESLIAPLLV-RPFKAWSNFWGALNADGFYARSFDYMNI 441
           + +  L N    ++L +L   N  +++PLL+ +    ++NFWGAL+ +G+Y RS DY+NI
Sbjct: 327 EQNCILTNSMTLEILIHLCQNNNRIVSPLLIGKENTNFANFWGALDKNGYYKRSDDYLNI 386

Query: 442 INGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKI 501
           I   Q   G+WNVPYI    L   S+I   N+   Y  +  D DM  C NLR   + +  
Sbjct: 387 IR--QEKIGLWNVPYIYGVILFNKSIINDWNLSQ-YEKHKDDRDMNLCFNLRKHTLFMYT 443

Query: 502 DSTQEYGHLV 511
            +   YG+++
Sbjct: 444 CNLDCYGYII 453


>gi|363540743|ref|YP_004894301.1| mg250 gene product [Megavirus chiliensis]
 gi|350611908|gb|AEQ33352.1| putative procollagen-lysine 2-oxoglutarate dioxygenase [Megavirus
           chiliensis]
          Length = 453

 Score =  189 bits (481), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 154/490 (31%), Positives = 248/490 (50%), Gaps = 56/490 (11%)

Query: 35  LVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLLKNELDE 93
           L I V+  + DG  RF +  ++  L    +G  + W GGDMS   GGG K+N L   L+ 
Sbjct: 7   LGIGVSLKKNDGVLRFEKYCQIFDLPYIIVGDGKIWKGGDMSVGAGGGQKINELLIALET 66

Query: 94  MDITDDMIILVTDSYDVIIDGGVNDILERFNTF---DANIVFGAERLCWPDTSLYDKY-- 148
             ITD+ +I+V D++D+       +IL +++        +VF +E  CWP+ +L + Y  
Sbjct: 67  --ITDNKLIIVCDTFDLFPVANKQEILNKYHQICGEKERVVFSSEVYCWPEKNLANIYTQ 124

Query: 149 --PAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVL 206
             P + S YRYLNSG F+G   DI  L++N  I + +DDQL++   +L  +      I+L
Sbjct: 125 IYPKIISKYRYLNSGSFMGRRNDICALLNN--ILDTDDDQLFFTKKYLQSS-----NIIL 177

Query: 207 DTLANLFQNLYGSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAK 266
           DT   LFQ + GS +DI ++   D  ++   TK  T P+ IHGNG +K  LN   N L  
Sbjct: 178 DTECQLFQAINGSTDDIGIH---DNRIYNKYTK--TFPIFIHGNGPAKTFLNYLENNLHP 232

Query: 267 SWKTSGCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTA-FLEEFLNKIANLNYPAKKISM 325
                  +  N++     L  DQ+  V I+++ID  +   L+ FL+ +  +N   K I +
Sbjct: 233 K------SLVNIVN--TKLVSDQYK-VFIALYIDSNSINELKIFLDSVTKINCTNKIIYV 283

Query: 326 FVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYV 385
           +  ++ +Y           FK + + + +I       S    N    + +    D+YF +
Sbjct: 284 YDKSHSDY-----------FKQLLEMLGFIY------SSNVSNYVFVDFIKSDCDYYFLL 326

Query: 386 DSDSHLDNP---DVLKYLVNRNESLIAPLLV-RPFKAWSNFWGALNADGFYARSFDYMNI 441
           + +  L N    ++L +L   N  +++PLL+ +    ++NFWGAL+ +G+Y RS DY+NI
Sbjct: 327 EQNCILTNSMTLEILIHLCQNNNRIVSPLLIGKENTNFANFWGALDKNGYYKRSDDYLNI 386

Query: 442 INGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKI 501
           I   Q   G+WNVPYI    L   S+I   N+   Y  +  D DM  C NLR   + +  
Sbjct: 387 IR--QEKIGLWNVPYIYGVILFNKSIINDWNLSQ-YEKHKDDRDMNLCFNLRKYTLFMYT 443

Query: 502 DSTQEYGHLV 511
            +   YG+++
Sbjct: 444 CNLDCYGYII 453


>gi|167535270|ref|XP_001749309.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163772175|gb|EDQ85830.1| predicted protein [Monosiga brevicollis MX1]
          Length = 623

 Score =  184 bits (467), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 149/593 (25%), Positives = 255/593 (43%), Gaps = 102/593 (17%)

Query: 109 DVIIDGGVNDILERFNTFDANIVFGAERLCWPDTS--------LYDKYPAVGSG-YRYLN 159
           + ++ G + ++   F    A I+  A  LC    +        L D +P V  G  RY +
Sbjct: 38  EALLLGEMVELQRNFQQQPARILMAATHLCRSACAFAGATSWRLTDDWPDVARGSARYGD 97

Query: 160 SGGFIGYAKDIKELISNRSIKNEEDDQLYYAL------LFLDETLRTKHKIVLDTLANLF 213
           +   + Y+ D++ L+ +R   N+       A       L+LD+  R +  + +D  +   
Sbjct: 98  ASALVAYSADMQALL-DRIAPNQPTSAFRVAASKQIISLYLDDASRAQLGLDVDASSAFV 156

Query: 214 QNLYG---SLEDIKLNFDLDEF----VHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAK 266
           Q+L G   S +   L FD          L NT     P ++   G  K+ L++  NY+  
Sbjct: 157 QHLRGLGDSFDTRYLRFDFHHRGTNDTRLINTVTRQLPWLVTAGGNGKL-LDAISNYVPM 215

Query: 267 SWKTS-GCTRC--NLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKI 323
            W    GC  C  +  +H     P+    V++++ ++  + FL   L ++A  +   +++
Sbjct: 216 KWHQDLGCLHCVNDATQH-----PEH--KVVMALVVELRSPFLRAVLERLAQQSLSPQQM 268

Query: 324 SMFVYNNQE----YHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEAR-NLAVENSLHKG 378
           ++ V   +      +  L  ++   FK  F +++ +A       +EA    A   S  +G
Sbjct: 269 ALIVGIEEGDMSVTYTSLVQNFTEEFKDSFASIQIVAGLKGRALREALFQGAAAVSGFQG 328

Query: 379 VDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGF------- 431
               F + S ++L NP+   +L+N N+S++AP+L R  K +SNFWGA++ D         
Sbjct: 329 AS-TFLISSLTYLTNPNTTAHLLNENQSVLAPVLPRHQKLYSNFWGAIDGDARSHCHDFH 387

Query: 432 ----------------------------------------YARSFDYMNIINGDQGGKGI 451
                                                   Y RS+DY +I   +  G   
Sbjct: 388 ATCPAWQLAGECETNEVWMSNNCAKACQACQVPGDVQGVRYKRSWDYRDIATREVQG--- 444

Query: 452 WNVPYITNCYLMKTSVIKATNIKTIYTLN-----SMDYDMAFCTNLRNKGIHLKIDSTQE 506
                     L+K +   A   +   +        +D+D+     L      +K+D+ + 
Sbjct: 445 ------VCALLLKPTAALALQQQLSTSPEHENYLPVDWDLKLTEWLHAAKFEVKVDNQES 498

Query: 507 YGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPI 566
           +G L+D  NFD +KT+P+++ +  NP  W   YIHP+YQ     D V  + C D++ FP+
Sbjct: 499 FGTLIDPTNFDSRKTHPDMFLVEANPEPWADIYIHPDYQPYKKLDFVQGR-CWDIYNFPL 557

Query: 567 VTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWA 619
            +E+FC E +Q  E    WS G N DKRL+ GYE VPTRDIH  Q+     W+
Sbjct: 558 FSEQFCGEMIQWAETMNLWSGGDNKDKRLKGGYEPVPTRDIHFNQMDFQSAWS 610


>gi|194391238|dbj|BAG60737.1| unnamed protein product [Homo sapiens]
          Length = 243

 Score =  172 bits (435), Expect = 8e-40,   Method: Compositional matrix adjust.
 Identities = 79/181 (43%), Positives = 120/181 (66%), Gaps = 2/181 (1%)

Query: 98  DDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAVGSGYRY 157
           +DMII+  DSYDVI+ G   ++L++F    + ++F AE  CWP+  L ++YP VG+G R+
Sbjct: 8   EDMIIMFVDSYDVILAGSPTELLKKFVQSGSRLLFSAESFCWPEWGLAEQYPEVGTGKRF 67

Query: 158 LNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLY 217
           LNSGGFIG+A  I +++     K+++DDQL+Y  L+LD  LR K  + LD  + +FQNL 
Sbjct: 68  LNSGGFIGFATTIHQIVRQWKYKDDDDDQLFYTRLYLDPGLREKLSLNLDHKSRIFQNLN 127

Query: 218 GSLEDIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWKT-SGCTRC 276
           G+L+++ L FD +  V + N  Y+T P+++HGNG +K++LN  GNY+   W    GC  C
Sbjct: 128 GALDEVVLKFDRNR-VRIRNVAYDTLPIVVHGNGPTKLQLNYLGNYVPNGWTPEGGCGFC 186

Query: 277 N 277
           N
Sbjct: 187 N 187


>gi|209736298|gb|ACI69018.1| Procollagen-lysine,2-oxoglutarate 5-dioxygenase 3 precursor [Salmo
           salar]
          Length = 317

 Score =  165 bits (417), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 86/204 (42%), Positives = 126/204 (61%), Gaps = 9/204 (4%)

Query: 8   NCLILSCVVFFISVHCN---KVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTL 64
           +C+ + CV+    +  +   + + I  D  LVITVA+ +TDG+     S+  +   VK L
Sbjct: 4   SCIAMVCVLLLGWMQSSLGAEQRVISPDNLLVITVATEDTDGF-----SSSSSNYTVKVL 58

Query: 65  GLHQPWLGGDMS-SLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERF 123
           GL + W GGD++ ++GGG KV  LK EL +     D++IL  DSYDVI+  G  ++L +F
Sbjct: 59  GLGEQWKGGDVARTVGGGQKVRWLKTELLKHSDKKDLVILFVDSYDVILASGPEELLWKF 118

Query: 124 NTFDANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEE 183
           +     +VF AE  CWPD  L  KYPAV +G RYLNSGGFIGYA ++ E++     K+ +
Sbjct: 119 SRLGHRMVFSAEGFCWPDQKLAPKYPAVHTGKRYLNSGGFIGYAPELSEIVQQWKHKDND 178

Query: 184 DDQLYYALLFLDETLRTKHKIVLD 207
           DDQL+Y  ++LD+  RTK+ + LD
Sbjct: 179 DDQLFYTKIYLDKVQRTKYNMTLD 202



 Score =  107 bits (267), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 48/97 (49%), Positives = 71/97 (73%), Gaps = 2/97 (2%)

Query: 374 SLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYA 433
           +L    ++YF +D+D  + NPDVL+ L+  N+S+IAP+L R  K WSNFWGAL+ +GFY+
Sbjct: 200 TLDHQCEYYFSIDADVVIVNPDVLRVLIEENKSVIAPMLSRHGKLWSNFWGALSPEGFYS 259

Query: 434 RSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKA 470
           RS DY++I+ G +   G+WNVPYIT  Y++K SV++ 
Sbjct: 260 RSEDYIDIVQGKR--IGLWNVPYITQVYMIKGSVLRG 294


>gi|410931371|ref|XP_003979069.1| PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 1-like,
           partial [Takifugu rubripes]
          Length = 195

 Score =  160 bits (405), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 81/191 (42%), Positives = 123/191 (64%), Gaps = 3/191 (1%)

Query: 10  LILSCVVFFISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQP 69
           L +S    FI   C + + I E+K LV+TVA+ +TDG++RF++SA+     VK +G  + 
Sbjct: 7   LWISVCALFILTSCEE-QRIPEEKLLVVTVATKDTDGFRRFLRSAKHFNYTVKVVGRDEK 65

Query: 70  WLGGD-MSSLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDA 128
           W+GG+ M + GGG KV LLK+ L+EM    D IIL TDSYDV+   G  ++L++F     
Sbjct: 66  WIGGNYMGAPGGGQKVRLLKSALEEMK-NQDKIILFTDSYDVVFASGPXELLKKFQQARH 124

Query: 129 NIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLY 188
            +VF +E L WPD  L DKYP V  G R+L SGGFIGY  +++E+++  S ++++ DQL+
Sbjct: 125 KVVFSSESLIWPDRHLEDKYPHVREGNRFLGSGGFIGYLANVREMVAEWSGEDDDSDQLF 184

Query: 189 YALLFLDETLR 199
           +  +++D   R
Sbjct: 185 FTRIYIDAAKR 195


>gi|147744648|gb|ABQ51191.1| procollagen-lysine 2-oxoglutarate 5-dioxygenase 3, partial [Capra
           hircus]
          Length = 216

 Score =  154 bits (388), Expect = 2e-34,   Method: Composition-based stats.
 Identities = 77/218 (35%), Positives = 128/218 (58%), Gaps = 7/218 (3%)

Query: 272 GCTRCNLIKH-LDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNN 330
           GC  CN  +  L   +P   P VL++VF+++PT FL  FL ++  L+YP  ++++F++NN
Sbjct: 3   GCGFCNQDRRPLPGGQPP--PRVLLAVFVEQPTPFLPRFLQRLLLLDYPPDRVTLFLHNN 60

Query: 331 QEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHK-GVDFYFYVDSDS 389
           + YH P  DD     +  F  VK +     +   EAR++A++        +FYF +D+D+
Sbjct: 61  EVYHEPHIDDSWPQLQDHFSAVKLVGPEEALTPGEARDMAMDICRQDPKCEFYFSLDADT 120

Query: 390 HLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGK 449
            + NP  L+ L+  N  +IAP+L R  K WSNFWGAL+ D +YARS DY+ ++   +   
Sbjct: 121 VITNPQTLRILIEANRKVIAPMLSRHGKLWSNFWGALSPDEYYARSEDYVELVQRKR--V 178

Query: 450 GIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDM 486
           G+WNVPYI+  Y+++   ++     + +++ +  D DM
Sbjct: 179 GVWNVPYISQAYVIRGETLRTELPQREVFSGSDTDPDM 216


>gi|364023677|gb|AEW46913.1| seminal fluid protein CSSFP065 [Chilo suppressalis]
          Length = 178

 Score =  153 bits (386), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 76/167 (45%), Positives = 111/167 (66%), Gaps = 4/167 (2%)

Query: 63  TLGLHQPWLGGDMSSLGGGYKVNLLKNELDEMDITDD---MIILVTDSYDVIIDGGVNDI 119
            L   + W GGDM   GGG K+N+LK+EL ++  +DD    IIL TDSYD++    + DI
Sbjct: 1   VLAKGKEWTGGDMKYAGGGQKINILKDELSKLMKSDDNKDRIILFTDSYDIMFLSTLEDI 60

Query: 120 LERFNTF-DANIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRS 178
           L++F +F D  ++F AE+ CWPD+ L   YP       YLNSG FIGY  ++ E+++++ 
Sbjct: 61  LKKFKSFKDTRVLFSAEQFCWPDSKLAGHYPKTEVANPYLNSGAFIGYLPELLEILNHKP 120

Query: 179 IKNEEDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKL 225
           IK+++DDQLYY  ++LD+ LR   KI LD  + +FQNLYG+L D++L
Sbjct: 121 IKDQDDDQLYYTKIYLDKELRHNLKISLDHDSKIFQNLYGALSDVQL 167


>gi|55250037|gb|AAH85460.1| Procollagen-lysine, 2-oxoglutarate 5-dioxygenase 2 [Danio rerio]
          Length = 165

 Score =  147 bits (372), Expect = 2e-32,   Method: Composition-based stats.
 Identities = 70/154 (45%), Positives = 103/154 (66%), Gaps = 3/154 (1%)

Query: 10  LILSCVVFFISVHCNKVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQP 69
           ++++CV   + +  NK  +I  +K LV+TVA+ ETDG+ RF+QSA      VK LG+ + 
Sbjct: 13  MLVTCVHCTLGMETNK--DIPTEKLLVLTVATQETDGFLRFMQSANYFNFNVKVLGMGEE 70

Query: 70  WLGGDMS-SLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDA 128
           W GGD+  S+GGG KV LLK  ++ +D  +D+++L  DSYD+I  GG  +IL +F   + 
Sbjct: 71  WKGGDVGRSIGGGQKVRLLKEAMESLDQQEDLVVLFVDSYDLIFAGGAEEILRKFQQSNH 130

Query: 129 NIVFGAERLCWPDTSLYDKYPAVGSGYRYLNSGG 162
            +VF AE + WPD+ L +KYP+V SG R+LNSGG
Sbjct: 131 KVVFAAEGIIWPDSQLAEKYPSVRSGKRFLNSGG 164


>gi|76162576|gb|AAX30505.2| SJCHGC04226 protein [Schistosoma japonicum]
          Length = 179

 Score =  142 bits (357), Expect = 8e-31,   Method: Compositional matrix adjust.
 Identities = 72/169 (42%), Positives = 107/169 (63%), Gaps = 2/169 (1%)

Query: 34  FLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLLKNELD 92
            LV+TVA+ + D   RF++S  +N  +VK LG    W GG+++ S GGG KVN+LK+EL 
Sbjct: 7   ILVLTVATEKNDALDRFLRSCSLNGFEVKVLGEGSYWKGGNVAKSTGGGQKVNILKDELA 66

Query: 93  EMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAVG 152
           +     D ++L  DSYDV+    V ++L+ +  F++ ++F AE  CWP  SL   YP V 
Sbjct: 67  KSTYRPDQLVLFVDSYDVVFMQNVANLLKGYERFESKVIFSAEEFCWPQPSLKSLYPEVK 126

Query: 153 SG-YRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRT 200
            G  RYLNSGGFIG   ++ +++++  I +++DDQLYY  +FLD  LR 
Sbjct: 127 PGERRYLNSGGFIGPVANLIKIVNHTPINDDDDDQLYYTNIFLDSKLRV 175


>gi|74199791|dbj|BAE20730.1| unnamed protein product [Mus musculus]
          Length = 218

 Score =  123 bits (308), Expect = 4e-25,   Method: Composition-based stats.
 Identities = 65/159 (40%), Positives = 102/159 (64%), Gaps = 1/159 (0%)

Query: 31  EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPW-LGGDMSSLGGGYKVNLLKN 89
           ED  LV+TVA+ ET+G++RF +SA+    ++++LGL + W + G  ++ GGG KV LLK 
Sbjct: 25  EDNLLVLTVATKETEGFRRFKRSAQFFNYKIQSLGLGEDWSVDGGPAAAGGGQKVRLLKK 84

Query: 90  ELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYP 149
            L++    +D++IL  DSYDV+   G  ++L++F    + +VF AE   +PD  L  KYP
Sbjct: 85  ALEKHADKEDLVILFVDSYDVVFASGPRELLKKFQQAKSQVVFSAEEHIYPDRRLEAKYP 144

Query: 150 AVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLY 188
            V  G R+L SGGFIGYA  + +L++    ++ + DQL+
Sbjct: 145 TVPDGKRFLGSGGFIGYAPSLSKLVAEWEGQDSDSDQLF 183


>gi|313240888|emb|CBY33174.1| unnamed protein product [Oikopleura dioica]
          Length = 136

 Score =  122 bits (306), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 61/137 (44%), Positives = 82/137 (59%), Gaps = 14/137 (10%)

Query: 609 MKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAP-MSFVVRYRPDEQPSLRPHHDSS 667
           M Q+GL   W   ++ Y  P+  + + GY+  P   P + FVVRY+P EQ  LRPHHDSS
Sbjct: 1   MNQIGLQDEWLYVVKTYAAPMVSKFYTGYN--PDNKPNLMFVVRYKPGEQDRLRPHHDSS 58

Query: 668 TYTINIALNQVGVDYEGGGCRFIRYNCNVTAT-----------RMGWMLMHPGRLTHYHE 716
           T+T  IALN+  +D+EGGG  F RY C+V  +           + G     PGRLTH H 
Sbjct: 59  TWTFQIALNRPNIDFEGGGTYFTRYKCSVVGSATEQDSRSLEVKQGMGFAFPGRLTHQHA 118

Query: 717 GLQVTQGTRYIMISFVD 733
           GL  T+GTRYI+++F+D
Sbjct: 119 GLPTTKGTRYILVNFMD 135


>gi|324538590|gb|ADY49540.1| Procollagen-lysine,2-oxoglutarate 5-dioxygenase, partial [Ascaris
           suum]
          Length = 144

 Score =  120 bits (301), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 59/142 (41%), Positives = 92/142 (64%), Gaps = 3/142 (2%)

Query: 164 IGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDI 223
           +GYA +I ++I+   + +++DDQLYY  ++LDE LR   K+ LD+++ +FQNL G  EDI
Sbjct: 1   MGYATEIWQIINAYPVADKDDDQLYYTNVYLDEKLRNSLKMTLDSMSYIFQNLNGVREDI 60

Query: 224 KLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWKTS-GCTRCNLIKH- 281
            L FD +    + N  YNT+P+IIHGNG SK+ LN   NY+ K+W    GC  C    + 
Sbjct: 61  ALEFDDNGDAQVANIPYNTHPLIIHGNGPSKLFLNHLANYIGKAWSAQRGCLFCETSNYV 120

Query: 282 -LDSLKPDQFPSVLISVFIDKP 302
            L+ +  +++PS+ +++FI KP
Sbjct: 121 NLEDIPEERWPSLTLAIFIAKP 142


>gi|89892066|gb|ABD78864.1| procollagen-lysine,2-oxoglutarate 5-dioxygenase 2 [Bubalus bubalis]
          Length = 93

 Score =  114 bits (286), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 48/92 (52%), Positives = 70/92 (76%), Gaps = 1/92 (1%)

Query: 612 VGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTI 671
           +GL  VW  F+R+++ P+  + F GY+ +   A ++FVV+Y P+ Q SLRPHHD+ST+TI
Sbjct: 1   IGLENVWLHFIREFIAPVTLKVFAGYYTKGF-ALLNFVVKYSPERQRSLRPHHDASTFTI 59

Query: 672 NIALNQVGVDYEGGGCRFIRYNCNVTATRMGW 703
           NIALN VG D++GGGC+F+RYNC++ + R GW
Sbjct: 60  NIALNNVGEDFQGGGCKFLRYNCSIESPRKGW 91


>gi|47212320|emb|CAF91258.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 52

 Score = 99.0 bits (245), Expect = 8e-18,   Method: Composition-based stats.
 Identities = 39/52 (75%), Positives = 43/52 (82%)

Query: 683 EGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           +GGGCRF+RYNC+V A R GW LMHPGRLTHYHEGL  T G RYI +SFVDP
Sbjct: 1   QGGGCRFLRYNCSVNAPRKGWALMHPGRLTHYHEGLPTTAGVRYIAVSFVDP 52


>gi|291225618|ref|XP_002732798.1| PREDICTED: Dynein intermediate chain 2, ciliary-like [Saccoglossus
           kowalevskii]
          Length = 858

 Score = 94.4 bits (233), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 45/87 (51%), Positives = 62/87 (71%), Gaps = 4/87 (4%)

Query: 407 LIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTS 466
           +IAPL+ RP K WSN WGAL+ DGFYARS DY++I+ G++  KG+WN+P+ITN YL++  
Sbjct: 5   IIAPLVSRPGKLWSNCWGALSDDGFYARSDDYVDIVKGNR--KGVWNMPHITNLYLVQGD 62

Query: 467 VIKATNIKTIYTLNSMDYDMAFCTNLR 493
           V K   +  IY+   +D DMA   +LR
Sbjct: 63  VFKKHKVSFIYS--DLDADMALTRHLR 87


>gi|397584720|gb|EJK53060.1| hypothetical protein THAOC_27572, partial [Thalassiosira oceanica]
          Length = 573

 Score = 92.0 bits (227), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 85/319 (26%), Positives = 149/319 (46%), Gaps = 37/319 (11%)

Query: 24  NKVKNIDEDKFLVITVASNETD--GYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGG 81
           ++++ +D D  L++  A+ + +  GY+   +SA      +  +   + W          G
Sbjct: 267 SELEKLDGDADLIVLTAATDPEHFGYQSLKRSATYFGHSLLNVLRGKKW---------EG 317

Query: 82  YKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDAN------------ 129
           Y   L+        +  + +IL  D YD ++  G +DIL  +N                 
Sbjct: 318 YNTKLIWTRKVLESVDSNQLILFVDGYDTMLQSGPDDILRSYNEMVEKFRSKWNCEDCVE 377

Query: 130 -IVFGAERLCWPDTSLYDKYPAVGSGYR----YLNSGGFIGYAKDIKELISNRSI-KNEE 183
            + FGAE LCWP  ++ ++Y    S Y     YLNSG +IG A  I+ ++ +    K ++
Sbjct: 378 PVFFGAEHLCWPSKTVCEQYVNGTSEYSADNPYLNSGTYIGRAGSIRAILEDVDPEKPDD 437

Query: 184 DDQLYYALLFLDETLR-TKHKIVLDTLANLFQNLYGSLEDIKLN-FDLDEFVHLTNTKYN 241
           DDQLYY+L  +    + T   IVLD+   LF  L G   D  ++   LD +++ +N K  
Sbjct: 438 DDQLYYSLKLVAFVEKGTGVPIVLDSDQRLFYALLGRSSDWTISEKSLDYWLYHSNNKDT 497

Query: 242 -TNPVIIHGNGKSKIELNSFGNYLAKSWKTSGCTRCNLIKHLDSLKPDQFPSVLISVFID 300
            T P ++HG G +K  L    NYL  ++     T+ ++ K +D+   D F  +++ +F  
Sbjct: 498 PTLPAVLHGQGPAKHTLIGITNYLPGAYSDFYGTKQHIHKVVDA---DSFHPLVVGLFYT 554

Query: 301 KPTAFLEE--FLNKIANLN 317
           + T+   E  F++ I  L+
Sbjct: 555 ELTSDKHERDFISGIKALD 573


>gi|156390789|ref|XP_001635452.1| predicted protein [Nematostella vectensis]
 gi|156222546|gb|EDO43389.1| predicted protein [Nematostella vectensis]
          Length = 589

 Score = 90.1 bits (222), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 77/300 (25%), Positives = 138/300 (46%), Gaps = 39/300 (13%)

Query: 289 QFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN--NQEYHAPLFDDYIHNFK 346
           ++P+VL+SV        L  +L  I NL+YP  +IS+++ +  N++    L  ++ +N K
Sbjct: 34  KYPTVLLSVIARNAAHLLPNWLGCIENLDYPKDRISIWITSDHNEDNTTELLKEWANNAK 93

Query: 347 TMFKNVKYIAHNSTVNSKEA-----------------RNLAVENSLHKGVDFYFYVDSDS 389
            ++  V      S  N  +                  R LA++ + +   D+ F VD D+
Sbjct: 94  HLYHRVTMNFTGSPSNYGDVLEASDWTDERYAHVAYLRQLALDTARYWWADYLFVVDCDN 153

Query: 390 HLDNPDVLKYLVNRNESLIAPLL--VRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
            L NP  L+ L++  +++++P+L       A+SNFWG ++  G+Y R+  Y  I+N ++ 
Sbjct: 154 FLFNPITLRQLMHEEKTVVSPMLEVFGNKSAYSNFWGGMDESGYYKRTDQYFTILNREK- 212

Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIHLKI 501
             G + VP + + YL+      ++ ++  Y     DY       + F  + R  G+ L I
Sbjct: 213 -VGTFEVPMVHSTYLVDLRRRASSELR--YYPPHPDYRGHHDDILVFAHSARMAGVKLHI 269

Query: 502 DSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYI--HPEYQKSLLPDTVNNQPCP 559
            +   YGHL+      P +    + ++    LD  L Y   HPE+   L P  +   P P
Sbjct: 270 INKHIYGHLI-----LPFEARESLEDMRIQFLDGKLGYYVDHPEHLMPLSPH-LTVPPVP 323


>gi|242027195|ref|XP_002433321.1| procollagen-lysine,2-oxoglutarate 5-dioxygenase, putative
           [Pediculus humanus corporis]
 gi|212519132|gb|EEB20583.1| procollagen-lysine,2-oxoglutarate 5-dioxygenase, putative
           [Pediculus humanus corporis]
          Length = 144

 Score = 89.7 bits (221), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 49/116 (42%), Positives = 78/116 (67%), Gaps = 4/116 (3%)

Query: 164 IGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDI 223
           IGYA ++ E++++RSI +++DDQL+Y   +L+ETLR   KI LD  + +F NL+G+++++
Sbjct: 2   IGYAPELYEILTHRSIDDDDDDQLFYTQAYLNETLRNNLKIKLDHKSQIFHNLHGAMDEL 61

Query: 224 KLNFDLDEFVHLTNTKYNTNPVIIHGNGKS--KIELNSFGNYLAKSWKTS-GCTRC 276
            L F   E  +L N +  ++P+I+HGNG +  K+ LN+ GNYL   W T  GC  C
Sbjct: 62  SLKFKNHE-PYLENEQMKSHPLILHGNGPTVVKVGLNNLGNYLPNCWNTRDGCVSC 116


>gi|118094236|ref|XP_422290.2| PREDICTED: procollagen galactosyltransferase 2 [Gallus gallus]
          Length = 627

 Score = 85.9 bits (211), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 60/245 (24%), Positives = 113/245 (46%), Gaps = 30/245 (12%)

Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFKTM 348
           P+VL+++      + L  FL  +  L YP  +I+++V   +N +    +  +++ N + +
Sbjct: 54  PTVLLAIIARNAASALPHFLGCVERLRYPKSRIALWVATDHNADNTTAILREWLKNVQNL 113

Query: 349 FKNVKYIAHNSTVNSKEA-----------------RNLAVENSLHKGVDFYFYVDSDSHL 391
           + +V++       +  E                  R  A+  +  K  D+  ++D+D+ L
Sbjct: 114 YHDVEWRPMEDPQSYPEEMGPKHWPSSRFTHVMKLRQAALRAAREKWSDYVLFLDTDNLL 173

Query: 392 DNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGI 451
            NP+ L  L+  N++L+AP+L   F  +SNFW  +   G+Y R+ DY  I   +    G 
Sbjct: 174 TNPETLNLLIAENKTLVAPMLESRF-LYSNFWCGITPQGYYKRTLDYPLI--REWKRTGC 230

Query: 452 WNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIHLKIDSTQ 505
           + VP I + +L+   + K  + K ++     DY       M F  + R  GI + I + +
Sbjct: 231 FAVPMIHSTFLI--DLRKEASTKLMFYPPHQDYTWSFDDIMVFAFSSRQAGIQMFICNRE 288

Query: 506 EYGHL 510
            YG L
Sbjct: 289 HYGFL 293


>gi|410902625|ref|XP_003964794.1| PREDICTED: procollagen galactosyltransferase 1-like [Takifugu
           rubripes]
          Length = 611

 Score = 83.6 bits (205), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 67/287 (23%), Positives = 137/287 (47%), Gaps = 33/287 (11%)

Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFKTM 348
           P VL+++        L  FL  I  LNYP ++++++V   +NQ+    +  D++   ++ 
Sbjct: 41  PRVLLALICRNSEHSLPYFLGTIERLNYPKERMALWVATDHNQDNTTVILHDWLVKMQSF 100

Query: 349 FKNVKYIAHNSTVNSKEA-----------------RNLAVENSLHKGVDFYFYVDSDSHL 391
           + NV++      ++ ++                  R +A+E++     D++   D D+ L
Sbjct: 101 YHNVEWRPKEKPIHYEDEAGPKDWTDLRYEHVMKLRQVALESAREMWADYFMLADCDNLL 160

Query: 392 DNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGI 451
            NPDVL  L+  N+++I+P+L     A+SNFW  +++ G+Y R+  Y+ I    Q  KG 
Sbjct: 161 TNPDVLWMLMKENKTIISPML-ESRAAYSNFWCGMSSQGYYKRTPAYIPI--RKQVRKGC 217

Query: 452 WNVPYITNCYL--MKTSVIKATNIKTIYTLNSMDYD--MAFCTNLRNKGIHLKIDSTQEY 507
           + VP + +  L  ++    +  +    +   S  +D  + F  + +   + + + + + Y
Sbjct: 218 FAVPMVHSTLLIDLRKEASRQLSFHPPHPEYSWAFDDIIVFAFSAQMADVQMFVCNKETY 277

Query: 508 GHL---VDSENFDPQKTNPEVYELI----RNPLDWDLRYIHPEYQKS 547
           G+L   + S N    + +  ++ L+    RNPL    +YIH   +K+
Sbjct: 278 GYLPVPLRSHNTLQDEADSFLHCLLEASARNPLVMPSKYIHVPRKKT 324


>gi|198415096|ref|XP_002129882.1| PREDICTED: similar to GLT25D1 protein [Ciona intestinalis]
          Length = 594

 Score = 82.4 bits (202), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 60/243 (24%), Positives = 118/243 (48%), Gaps = 30/243 (12%)

Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQ------------------E 332
           P+V + +F+      L  FL  + +LNYP K++S+++  +                   E
Sbjct: 55  PTVFVPIFVRNKAHALPYFLKCLYDLNYPKKRLSLWIVTDHNSDNSSQILEKWTNTVKHE 114

Query: 333 YHAPLFD--DYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSH 390
           YH  +F+  D    +K    N+ +      +   + R  A+E +     DF  Y+D+D+ 
Sbjct: 115 YHDLVFEKPDTEWFYKEQKGNLHW-PEERHIKMLQLRQQALEKARKMWSDFILYLDADNM 173

Query: 391 LDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKG 450
           L NP  L++L++R+ +++AP+L     +++NFW   + +G+Y R+ +Y  I N +    G
Sbjct: 174 LINPHTLQHLISRDLTIVAPMLTT-IASYANFWADQDENGYYKRADNYFEIRNRETV--G 230

Query: 451 IWNVPYITNCYLMKTSVIKATNIKTIYTLNS-----MDYDMAFCTNLRNKGIHLKIDSTQ 505
           ++ VP + + +L+     K+  ++  + L+      +D  + F  + +   I L ID+T 
Sbjct: 231 VFEVPMVHSTFLVNLVARKSRKLR-FWPLHEDYYLLVDDIIVFSIHAKLADIPLYIDNTH 289

Query: 506 EYG 508
            YG
Sbjct: 290 IYG 292


>gi|373955086|ref|ZP_09615046.1| hypothetical protein Mucpa_3485 [Mucilaginibacter paludis DSM
           18603]
 gi|373891686|gb|EHQ27583.1| hypothetical protein Mucpa_3485 [Mucilaginibacter paludis DSM
           18603]
          Length = 260

 Score = 81.6 bits (200), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 67/197 (34%), Positives = 94/197 (47%), Gaps = 22/197 (11%)

Query: 36  VITVASN-ETDGYKRFIQ-SAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNELDE 93
           VITVAS+ +   Y  F++ S E   L   TL     +    +       K  LL   L +
Sbjct: 3   VITVASDLKNTSYLSFLKASCEFYHLDATTLYYSDVYFSNRI-------KDALLNTHLTQ 55

Query: 94  MDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAVGS 153
               DD IIL TD+ D +      +I+++FN F+  ++F AE  CWPD S+   YPA   
Sbjct: 56  F--ADDEIILFTDAIDAVFVAEQKEIIDKFNHFNCPLLFSAEVNCWPDKSMEKNYPAPSV 113

Query: 154 GYRYLNSGGFIGYAKDIKELISNRSI----KNEE---DDQLYYALLFLDETLRTKHKIVL 206
            +RYLNSG FIG A  +K L     I    KN      +Q Y+ L+F +E+      I L
Sbjct: 114 HFRYLNSGAFIGRAGYLKYLYEKYPIFEIGKNPAYFWSNQYYWNLVFQNESAN----IQL 169

Query: 207 DTLANLFQNLYGSLEDI 223
           D    LF N   ++ +I
Sbjct: 170 DHSGELFFNTSITISNI 186


>gi|340378483|ref|XP_003387757.1| PREDICTED: procollagen galactosyltransferase 1-like [Amphimedon
           queenslandica]
          Length = 594

 Score = 80.5 bits (197), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 76/311 (24%), Positives = 138/311 (44%), Gaps = 45/311 (14%)

Query: 284 SLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYH--APLFDDY 341
           SL+ +  P V +++        L ++L  I  LNYP  KI + +Y  Q       L  ++
Sbjct: 32  SLQQESRPLVYLAILSRNAAHLLPQYLGYIEGLNYPKDKIIIGLYIGQSVDNTTNLLLEW 91

Query: 342 IHNFKTMFKNVKYIAHN-----------STVNSK-----EARNLAVENSLHKGVDFYFYV 385
             N ++++ NV                 S  +S+     + R   + N+     ++ F+V
Sbjct: 92  SENVRSIYNNVLIYEDGDIFPLGDSELFSWSDSRLEYMCKLRQDVLSNARMARAEYLFFV 151

Query: 386 DSDSHLDNPDVLKYLVNRNESLIAPLLV-RPFKAWSNFWGALNADGFYARSFDYMNIING 444
           D D+ L NPDVL  L+   + ++APLL+    +A+SNFWG    +G+Y R+ +Y+ I+  
Sbjct: 152 DCDNFLINPDVLIRLIEAKKPIVAPLLIYDKERAFSNFWGGQKENGYYLRTEEYLPIVT- 210

Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKATNIK-----TIYTLNSMDYDMAFCTNLRNKGIHL 499
            +   G + VP + +  L+    + + ++      T Y  N +D  +    + R  GI +
Sbjct: 211 -RSNLGCFKVPLVHSTLLIDLRTVSSESLAYWPPPTEYKWN-IDDIILLSYSARVNGIGM 268

Query: 500 KIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCP 559
            I +T  +G+L+ +  +        +   IR   +W L+ I            VN+ P P
Sbjct: 269 YILNTDVFGYLLKTGEY------ASLEHAIRETDNWKLKTI------------VNHYPVP 310

Query: 560 DVFWFPIVTEK 570
              +  I +EK
Sbjct: 311 VSQFISIHSEK 321


>gi|402593476|gb|EJW87403.1| hypothetical protein WUBG_01688, partial [Wuchereria bancrofti]
          Length = 69

 Score = 80.1 bits (196), Expect = 4e-12,   Method: Composition-based stats.
 Identities = 32/58 (55%), Positives = 43/58 (74%)

Query: 677 QVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           + G DYEGGG R+ RYNC V+A ++G+  M P +LTH HEG  +T GTRYI +SF++P
Sbjct: 12  ESGRDYEGGGIRYARYNCTVSADQIGYAAMFPAQLTHMHEGFPITSGTRYIAVSFLNP 69


>gi|3043692|dbj|BAA25510.1| KIAA0584 protein [Homo sapiens]
          Length = 738

 Score = 80.1 bits (196), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 60/249 (24%), Positives = 114/249 (45%), Gaps = 30/249 (12%)

Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
           P Q P+VL++V        L  FL  +  L+YP  +++++    +N +    +F +++ N
Sbjct: 161 PLQSPTVLVAVLARNAAHTLPHFLGCLERLDYPKSRMAIWAATDHNVDNTTEIFREWLKN 220

Query: 345 FKTMFKNVKY------------IAHNSTVNSKEA-----RNLAVENSLHKGVDFYFYVDS 387
            + ++  V++            I       S+ A     R  A+  +  K  D+  ++D 
Sbjct: 221 VQRLYHYVEWRPMDEPESYPDEIGPKHWPTSRFAHVMKLRQAALRTAREKWSDYILFIDV 280

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D+ L NP  L  L+  N++++AP+L      +SNFW  +   GFY R+ DY+ I    + 
Sbjct: 281 DNFLTNPQTLNLLIAENKTIVAPML-ESRGLYSNFWCGITPKGFYKRTPDYVQIREWKR- 338

Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIHLKI 501
             G + VP + + +L+   + K  + K  +     DY       + F  + R  GI + +
Sbjct: 339 -TGCFPVPMVHSTFLI--DLRKEASDKLTFYPPHQDYTWTFDDIIVFAFSSRQAGIQMYL 395

Query: 502 DSTQEYGHL 510
            + + YG+L
Sbjct: 396 CNREHYGYL 404


>gi|427798775|gb|JAA64839.1| Putative procollagen-lysine 2-oxoglutarate 5-dioxygenase, partial
           [Rhipicephalus pulchellus]
          Length = 344

 Score = 80.1 bits (196), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 61/255 (23%), Positives = 115/255 (45%), Gaps = 34/255 (13%)

Query: 283 DSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDD 340
           D L+P   P+VLI+V +      L  F   +   +YP  +IS+++Y  +N +  A + D 
Sbjct: 31  DKLEP---PTVLIAVILRNKAHVLPHFFGYLEQQSYPKSRISLWIYTDHNVDQTAEMVDT 87

Query: 341 YIHNFKTMFKNVKYIAHNSTV-----------------NSKEARNLAVENSLHKGVDFYF 383
           +       + NV   + +                    +    R  A++ +     DF F
Sbjct: 88  WAEAVSNEYHNVNVTSEDGEAFFPDEEGSQKWTAQRYWHVIRLREEAIQVARTLWADFIF 147

Query: 384 YVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIIN 443
           ++D D+ L NP  ++ LV  N ++IAP+L     A+SNFW  +N  G+Y R+ +YM I+ 
Sbjct: 148 FLDGDAMLSNPKTIQDLVEENRTIIAPML-DSRSAYSNFWCGMNEKGYYERTDEYMPILE 206

Query: 444 GDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSM-------DYDMAFCTNLRNKG 496
            ++   G++ V  + +  L+  +   A + K  Y    +       D  + F  + +   
Sbjct: 207 KEK--VGVFPVVMVHSATLINLN--HANSRKLTYDPQKLEGYTGPNDDVITFAHSAKFAA 262

Query: 497 IHLKIDSTQEYGHLV 511
           + + I +  +YGH++
Sbjct: 263 VEMFISNKDQYGHIL 277


>gi|16506820|ref|NP_055916.1| procollagen galactosyltransferase 2 precursor [Homo sapiens]
 gi|74750765|sp|Q8IYK4.1|GT252_HUMAN RecName: Full=Procollagen galactosyltransferase 2; AltName:
           Full=Glycosyltransferase 25 family member 2; AltName:
           Full=Hydroxylysine galactosyltransferase 2; Flags:
           Precursor
 gi|12620188|gb|AAG60609.1|AF288389_1 C1orf17 [Homo sapiens]
 gi|23273043|gb|AAH35672.1| Glycosyltransferase 25 domain containing 2 [Homo sapiens]
 gi|119611578|gb|EAW91172.1| glycosyltransferase 25 domain containing 2, isoform CRA_c [Homo
           sapiens]
 gi|168278659|dbj|BAG11209.1| glycosyltransferase 25 domain-containing protein 2 [synthetic
           construct]
 gi|325463379|gb|ADZ15460.1| glycosyltransferase 25 domain containing 2 [synthetic construct]
          Length = 626

 Score = 80.1 bits (196), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 60/249 (24%), Positives = 114/249 (45%), Gaps = 30/249 (12%)

Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
           P Q P+VL++V        L  FL  +  L+YP  +++++    +N +    +F +++ N
Sbjct: 49  PLQSPTVLVAVLARNAAHTLPHFLGCLERLDYPKSRMAIWAATDHNVDNTTEIFREWLKN 108

Query: 345 FKTMFKNVKY------------IAHNSTVNSKEA-----RNLAVENSLHKGVDFYFYVDS 387
            + ++  V++            I       S+ A     R  A+  +  K  D+  ++D 
Sbjct: 109 VQRLYHYVEWRPMDEPESYPDEIGPKHWPTSRFAHVMKLRQAALRTAREKWSDYILFIDV 168

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D+ L NP  L  L+  N++++AP+L      +SNFW  +   GFY R+ DY+ I    + 
Sbjct: 169 DNFLTNPQTLNLLIAENKTIVAPML-ESRGLYSNFWCGITPKGFYKRTPDYVQIREWKRT 227

Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIHLKI 501
             G + VP + + +L+   + K  + K  +     DY       + F  + R  GI + +
Sbjct: 228 --GCFPVPMVHSTFLI--DLRKEASDKLTFYPPHQDYTWTFDDIIVFAFSSRQAGIQMYL 283

Query: 502 DSTQEYGHL 510
            + + YG+L
Sbjct: 284 CNREHYGYL 292


>gi|332811368|ref|XP_524994.3| PREDICTED: procollagen galactosyltransferase 2 [Pan troglodytes]
 gi|410298208|gb|JAA27704.1| glycosyltransferase 25 domain containing 2 [Pan troglodytes]
          Length = 626

 Score = 80.1 bits (196), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 60/249 (24%), Positives = 114/249 (45%), Gaps = 30/249 (12%)

Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
           P Q P+VL++V        L  FL  +  L+YP  +++++    +N +    +F +++ N
Sbjct: 49  PLQSPTVLVAVLARNAAHTLPHFLGCLERLDYPKSRMAIWAATDHNVDNTTEIFREWLKN 108

Query: 345 FKTMFKNVKY------------IAHNSTVNSKEA-----RNLAVENSLHKGVDFYFYVDS 387
            + ++  V++            I       S+ A     R  A+  +  K  D+  ++D 
Sbjct: 109 VQRLYHYVEWRPMDEPESYPDEIGPKHWPTSRFAHVMKLRQAALRTAREKWSDYILFIDV 168

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D+ L NP  L  L+  N++++AP+L      +SNFW  +   GFY R+ DY+ I    + 
Sbjct: 169 DNFLTNPQTLNLLIAENKTIVAPML-ESRGLYSNFWCGITPKGFYKRTPDYVQIREWKRT 227

Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIHLKI 501
             G + VP + + +L+   + K  + K  +     DY       + F  + R  GI + +
Sbjct: 228 --GCFPVPMVHSTFLI--DLRKEASDKLTFYPPHQDYTWTFDDIIVFAFSSRQAGIQMYL 283

Query: 502 DSTQEYGHL 510
            + + YG+L
Sbjct: 284 CNREHYGYL 292


>gi|301611908|ref|XP_002935453.1| PREDICTED: LOW QUALITY PROTEIN: procollagen galactosyltransferase
           1-B-like [Xenopus (Silurana) tropicalis]
          Length = 610

 Score = 79.7 bits (195), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 54/238 (22%), Positives = 118/238 (49%), Gaps = 17/238 (7%)

Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
           P + P+VLI++        L E L  +  L+YP ++IS++V   +N +    +  +++ N
Sbjct: 38  PLRRPTVLIALLARNSEGSLPEVLGALERLHYPKERISLWVATDHNIDNTTQMLREWLIN 97

Query: 345 FKTMFKNV--------KYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSHLDNPDV 396
            +  + +V        +Y  +++  +       ++ ++     D+ F++D+D+ L NP+ 
Sbjct: 98  VQNQYHHVEWRPQEHPRYWGYSACFSLSSIHPDSLTSAREMWADYIFFLDADNLLTNPET 157

Query: 397 LKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPY 456
           L  L+  N++++AP++     A+SNFW  +   G+Y R+  YM I   ++  KG + VP 
Sbjct: 158 LNRLIAENKTIVAPMM-ESRAAYSNFWCGMTTQGYYRRTPAYMPIRRRER--KGCFPVPM 214

Query: 457 ITNCYL--MKTSVIKATNIKTIYTLNSMDYD--MAFCTNLRNKGIHLKIDSTQEYGHL 510
           + + +L  ++    +  +    +   +  YD  + F  + R   + + I + + YG+L
Sbjct: 215 VHSTFLIDLRKEASQQLDFYPPHADYTWAYDDIIVFAFSCRQADVQMFICNKEIYGYL 272


>gi|157073889|ref|NP_001096660.1| procollagen galactosyltransferase 1-A precursor [Xenopus laevis]
 gi|160385807|sp|A0JPH3.1|G251A_XENLA RecName: Full=Procollagen galactosyltransferase 1-A; AltName:
           Full=Glycosyltransferase 25 family member 1-A; AltName:
           Full=Hydroxylysine galactosyltransferase 1-A; Flags:
           Precursor
 gi|117558235|gb|AAI27423.1| Glt25d1b protein [Xenopus laevis]
          Length = 611

 Score = 79.7 bits (195), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 56/243 (23%), Positives = 117/243 (48%), Gaps = 26/243 (10%)

Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFKTM 348
           P+VLI++        L E L  +  L+YP ++IS++V   +N +    +  +++ N +  
Sbjct: 41  PTVLIALLARNSEGSLPEVLGALDTLHYPKERISLWVATDHNLDNTTEILREWLINVQNQ 100

Query: 349 FKNV-----------------KYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSHL 391
           + +V                 K+ +H+      + R  A+ ++     D+ F++D+D+ L
Sbjct: 101 YHHVEWRPQEHPRWFKDEEGPKHWSHSRYEYIMKLRQAALTSAREMWADYIFFLDADNLL 160

Query: 392 DNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGI 451
            NP+ L  L+  N++++AP+L     A+SNFW  +   G+Y R+  YM I   ++  +G 
Sbjct: 161 TNPETLNLLIAENKTVVAPML-DSRAAYSNFWCGMTTQGYYRRTPAYMPIRRRER--RGC 217

Query: 452 WNVPYITNCYL--MKTSVIKATNIKTIYTLNSMDYD--MAFCTNLRNKGIHLKIDSTQEY 507
           + VP + + +L  ++    +  N    +   +  +D  + F  + R   + + + + + Y
Sbjct: 218 FPVPMVHSTFLIDLRKEASQQLNFYPPHADYTWAFDDIIVFAFSCRQADVQMFLCNKEIY 277

Query: 508 GHL 510
           GHL
Sbjct: 278 GHL 280


>gi|194376002|dbj|BAG57345.1| unnamed protein product [Homo sapiens]
          Length = 554

 Score = 79.3 bits (194), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 59/254 (23%), Positives = 109/254 (42%), Gaps = 40/254 (15%)

Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFK 346
           P Q P+VL++V        L  FL  +  L+YP  +++++   +        D+    F+
Sbjct: 49  PLQSPTVLVAVLARNAAHTLPHFLGCLERLDYPKSRMAIWAATDHN-----VDNTTEIFR 103

Query: 347 TMFKNVKYIAH------------------------NSTVNSKEARNLAVENSLHKGVDFY 382
              KNV+ + H                        +   +  + R  A+  +  K  D+ 
Sbjct: 104 ERLKNVQRLYHYVEWRPMDEPESYPDEIGPKHWPTSRFAHVMKLRQAALRTAREKWSDYI 163

Query: 383 FYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNII 442
            ++D D+ L NP  L  L+  N++++AP+L      +SNFW  +   GFY R+ DY+ I 
Sbjct: 164 LFIDVDNFLTNPQTLNLLIAENKTIVAPML-ESRGLYSNFWCGITPKGFYKRTPDYVQI- 221

Query: 443 NGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKG 496
             +    G + VP + + +L+   + K  + K  +     DY       + F  + R  G
Sbjct: 222 -REWKRTGCFPVPMVHSTFLI--DLRKEASDKLTFYPPHQDYTWTFDDIIVFAFSSRQAG 278

Query: 497 IHLKIDSTQEYGHL 510
           I + + + + YG+L
Sbjct: 279 IQMYLCNREHYGYL 292


>gi|390477037|ref|XP_003735231.1| PREDICTED: LOW QUALITY PROTEIN: procollagen galactosyltransferase 2
           [Callithrix jacchus]
          Length = 831

 Score = 79.3 bits (194), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 59/249 (23%), Positives = 114/249 (45%), Gaps = 30/249 (12%)

Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
           P Q P+VL++V        L  FL  +  L+YP  +++++    +N +    +  +++ N
Sbjct: 254 PLQSPTVLVAVLARNAAHSLPHFLGCLERLDYPKSRMAVWAATDHNVDNTTEILREWLKN 313

Query: 345 FKTMFKNVKY------------IAHNSTVNSKEA-----RNLAVENSLHKGVDFYFYVDS 387
            + ++  V++            I      +S+ A     R  A+  +  K  D+  ++D 
Sbjct: 314 VQRLYHYVEWRPMDEPESYPDEIGPKHWPSSRFAHVMKLRQAALRTAREKWSDYILFIDV 373

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D+ L NP  L  L+  N++++AP+L      +SNFW  +   GFY R+ DY+ I    + 
Sbjct: 374 DNFLTNPQTLNLLIAENKTIVAPML-ESRGLYSNFWCGITPKGFYKRTPDYVQIREWKR- 431

Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIHLKI 501
             G + VP + + +L+   + K  + K  +     DY       + F  + R  GI + +
Sbjct: 432 -TGCFPVPMVHSTFLI--DLRKEASDKLTFYPPHQDYTWTFDDIIVFAFSSRQAGIQMYL 488

Query: 502 DSTQEYGHL 510
            + + YG+L
Sbjct: 489 CNREHYGYL 497


>gi|395825231|ref|XP_003785842.1| PREDICTED: procollagen galactosyltransferase 2 [Otolemur garnettii]
          Length = 668

 Score = 79.0 bits (193), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 58/249 (23%), Positives = 115/249 (46%), Gaps = 30/249 (12%)

Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
           P Q P+VL+ V        L  FL  +  L+YP  +++++    +N +    +F +++ N
Sbjct: 91  PLQRPTVLVVVLARNAAHALPPFLGCLERLDYPKSRMAIWAATDHNVDNTTEIFREWLKN 150

Query: 345 FKTMFKNVKY------------IAHNSTVNSKEA-----RNLAVENSLHKGVDFYFYVDS 387
            + ++  V++            I      +S+ A     R  A+  +  +  D+  ++D 
Sbjct: 151 VQKLYHYVEWRPMDEPESYPDEIGPKHWPSSRFAHVMKLRQAALRTARERWSDYILFIDV 210

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D+ L NP  L  ++  N++++AP+L      +SNFW  +   GFY R+ DY+ I    + 
Sbjct: 211 DNFLTNPQTLNLMIAENKTIVAPML-ESRGLYSNFWCGITPQGFYKRTPDYIQIREWKR- 268

Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIHLKI 501
             G + VP + + +L+   + K  + K ++     DY       + F  + R  GI + +
Sbjct: 269 -LGCFPVPMVHSTFLI--DLRKEASDKLMFYPPHQDYTWTFDDIIVFAFSSRQAGIQMHL 325

Query: 502 DSTQEYGHL 510
            + + YG+L
Sbjct: 326 CNREHYGYL 334


>gi|47220022|emb|CAG12170.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 635

 Score = 79.0 bits (193), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 50/192 (26%), Positives = 95/192 (49%), Gaps = 22/192 (11%)

Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFKTM 348
           P +L+++        L  FL  I  LNYP ++++++V   +NQ+  A +  D++   +  
Sbjct: 40  PRILLALVCRNSEHSLPYFLGTIERLNYPKERMALWVATDHNQDNTAVILRDWLVKMQDF 99

Query: 349 FKNVKYIAHNSTVNSKEA-----------------RNLAVENSLHKGVDFYFYVDSDSHL 391
           + NV++         ++                  R +A+E++     D++   D D+ L
Sbjct: 100 YHNVEWRPKEKPTRYEDEAGPKDWTDPRYEHVMKLRQVALESAREMWADYFMLADCDNLL 159

Query: 392 DNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGI 451
            NPDVL  L+  N+++I+P+L     A+SNFW  +++ G+Y R+  Y+ I    Q  KG 
Sbjct: 160 TNPDVLWMLMKENKTIISPML-ESRGAYSNFWCGMSSQGYYKRTPAYIPI--RKQVRKGC 216

Query: 452 WNVPYITNCYLM 463
           + VP + +  L+
Sbjct: 217 FAVPMVHSTLLI 228


>gi|326924714|ref|XP_003208570.1| PREDICTED: procollagen galactosyltransferase 2-like [Meleagris
           gallopavo]
          Length = 552

 Score = 78.6 bits (192), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 57/230 (24%), Positives = 105/230 (45%), Gaps = 30/230 (13%)

Query: 306 LEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNS 363
           L  FL  +  L YP  +I+++V   +N +    +  +++ N + ++ +V++       + 
Sbjct: 4   LPHFLGCVERLRYPKSRIALWVATDHNVDNTTAILREWLKNVQNLYHDVEWRPMEDPQSY 63

Query: 364 KEA-----------------RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNES 406
            E                  R  A+  +  K  D+  ++D+D+ L NP+ L  L+  N++
Sbjct: 64  PEEMGPKHWPSSRFTHVMKLRQAALRAAREKWSDYVLFLDTDNLLTNPETLNLLIAENKT 123

Query: 407 LIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTS 466
           L+AP+L   F  +SNFW  +   G+Y R+ DY  I    +   G + VP I + +L+   
Sbjct: 124 LVAPMLESRF-LYSNFWCGITPQGYYKRTLDYPLIREWKR--TGCFAVPMIHSTFLI--D 178

Query: 467 VIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIHLKIDSTQEYGHL 510
           + K  + K ++     DY       M F  + R  GI + I + + YG L
Sbjct: 179 LRKEASTKLMFYPPHQDYTWSFDDIMVFAFSSRQAGIQMFICNREHYGFL 228


>gi|297281262|ref|XP_002802062.1| PREDICTED: procollagen galactosyltransferase 2-like [Macaca
           mulatta]
          Length = 626

 Score = 78.6 bits (192), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 59/249 (23%), Positives = 114/249 (45%), Gaps = 30/249 (12%)

Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
           P Q P+VL++V        L  FL  +  L+YP  +++++    +N +    +  +++ N
Sbjct: 49  PLQSPTVLVAVLARNAAHTLPHFLGCLERLDYPKSRMAIWAATDHNVDNTTEILREWLKN 108

Query: 345 FKTMFKNVKY------------IAHNSTVNSKEA-----RNLAVENSLHKGVDFYFYVDS 387
            + ++  V++            I      +S+ A     R  A+  +  K  D+  ++D 
Sbjct: 109 VQRLYHYVEWRPMDEPESYPDEIGPKHWPSSRFAHVMKLRQAALRTAREKWSDYILFIDV 168

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D+ L NP  L  L+  N++++AP+L      +SNFW  +   GFY R+ DY+ I    + 
Sbjct: 169 DNFLTNPQTLNLLIAENKTIVAPML-ESRGLYSNFWCGITPKGFYKRTPDYVQIREWKRS 227

Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIHLKI 501
             G + VP + + +L+   + K  + K  +     DY       + F  + R  GI + +
Sbjct: 228 --GCFPVPMVHSTFLI--DLRKEASDKLTFYPPHQDYTWTFDDIIVFAFSSRQAGIQMYL 283

Query: 502 DSTQEYGHL 510
            + + YG+L
Sbjct: 284 CNREHYGYL 292


>gi|402857849|ref|XP_003893450.1| PREDICTED: procollagen galactosyltransferase 2 [Papio anubis]
          Length = 626

 Score = 78.6 bits (192), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 59/249 (23%), Positives = 114/249 (45%), Gaps = 30/249 (12%)

Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
           P Q P+VL++V        L  FL  +  L+YP  +++++    +N +    +  +++ N
Sbjct: 49  PLQSPTVLVAVLARNAAHTLPHFLGCLERLDYPKSRMAIWAATDHNVDNTTEILREWLKN 108

Query: 345 FKTMFKNVKY------------IAHNSTVNSKEA-----RNLAVENSLHKGVDFYFYVDS 387
            + ++  V++            I      +S+ A     R  A+  +  K  D+  ++D 
Sbjct: 109 VQRLYHYVEWRPMDEPESYPDEIGPKHWPSSRFAHVMKLRQAALRTAREKWSDYILFIDV 168

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D+ L NP  L  L+  N++++AP+L      +SNFW  +   GFY R+ DY+ I    + 
Sbjct: 169 DNFLTNPQTLNLLIAENKTIVAPML-ESRGLYSNFWCGITPKGFYKRTPDYVQIREWKRS 227

Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIHLKI 501
             G + VP + + +L+   + K  + K  +     DY       + F  + R  GI + +
Sbjct: 228 --GCFPVPMVHSTFLI--DLRKEASDKLTFYPPHQDYTWTFDDIIVFAFSSRQAGIQMYL 283

Query: 502 DSTQEYGHL 510
            + + YG+L
Sbjct: 284 CNREHYGYL 292


>gi|426333026|ref|XP_004028088.1| PREDICTED: procollagen galactosyltransferase 2 [Gorilla gorilla
           gorilla]
          Length = 626

 Score = 78.6 bits (192), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 60/249 (24%), Positives = 113/249 (45%), Gaps = 30/249 (12%)

Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
           P Q P+VL++V        L  FL  +  L+YP  +++++    +N +    +F +++ N
Sbjct: 49  PLQSPTVLVAVLARNAAHTLPHFLGCLERLDYPKSRMAIWAATDHNVDNTTEIFREWLKN 108

Query: 345 FKTMFKNVKY------------IAHNSTVNSKEA-----RNLAVENSLHKGVDFYFYVDS 387
            + ++  V++            I       S+ A     R  A+  +  K  D+  ++D 
Sbjct: 109 VQRLYHYVEWRPMDEPESYPDEIGPKHWPTSRFAHVMKLRQAALRTAREKWSDYILFIDV 168

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D+ L NP  L  L+  N+++ AP+L      +SNFW  +   GFY R+ DY+ I    + 
Sbjct: 169 DNFLTNPQTLNLLIAENKTIAAPML-ESRGLYSNFWCGITPKGFYKRTPDYVQIREWKRT 227

Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIHLKI 501
             G + VP + + +L+   + K  + K  +     DY       + F  + R  GI + +
Sbjct: 228 --GCFPVPMVHSTFLI--DLRKEASDKLTFYPPHQDYTWTFDDIIVFAFSSRQAGIQMYL 283

Query: 502 DSTQEYGHL 510
            + + YG+L
Sbjct: 284 CNREHYGYL 292


>gi|160395584|sp|Q7Q021.4|GLT25_ANOGA RecName: Full=Glycosyltransferase 25 family member; Flags:
           Precursor
          Length = 592

 Score = 77.8 bits (190), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 47/205 (22%), Positives = 101/205 (49%), Gaps = 22/205 (10%)

Query: 288 DQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN--NQEYHAPLFDDYIHNF 345
           +Q P+V+++V +      L  F + + +L+YP  ++S+++ +  N++    +   ++   
Sbjct: 21  EQLPTVMVAVLVRNKAHTLPYFFSYLEDLDYPKDRMSLWIRSDHNEDRSIEITKAWLKRT 80

Query: 346 KTMFKNV--KYIAHNSTVNSKEA---------------RNLAVENSLHKGVDFYFYVDSD 388
            +++ +V  KY +      S++                +  A++ +     D+ F++D+D
Sbjct: 81  SSLYHSVDFKYRSERGKRESEKTSTHWNEERFSDVIRLKQDALQAARMMWADYIFFIDAD 140

Query: 389 SHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGG 448
             L N + L  L+ R   ++AP+LV     +SNFW  + +D +Y R+ DY  I+N DQ G
Sbjct: 141 VFLTNSNTLGKLIERKLPIVAPMLVSD-GLYSNFWCGMTSDYYYQRTDDYKKILNYDQIG 199

Query: 449 KGIWNVPYITNCYLMKTSVIKATNI 473
           +  W VP +    L+  ++ +   +
Sbjct: 200 Q--WPVPMVHTAVLVSLNIAQTRQL 222


>gi|350589106|ref|XP_003482786.1| PREDICTED: LOW QUALITY PROTEIN: procollagen galactosyltransferase
           2-like [Sus scrofa]
          Length = 626

 Score = 77.8 bits (190), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 58/247 (23%), Positives = 113/247 (45%), Gaps = 30/247 (12%)

Query: 289 QFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFK 346
           Q P+VL+++        L  FL  +  L+YP  +++++    +N +    +  +++ N +
Sbjct: 51  QRPTVLVAILARNAAHSLPHFLGCLERLDYPKSRMAIWAATDHNVDNTTEILREWLKNVQ 110

Query: 347 TMFKNVKY------------IAHNSTVNSKEA-----RNLAVENSLHKGVDFYFYVDSDS 389
             +  V++            I       S+ A     R  A+  +  K  D+  ++D D+
Sbjct: 111 RAYHYVEWRPMDEPESYPDEIGPKHWPGSRFAHVMKLRQAALRTAREKWSDYILFIDVDN 170

Query: 390 HLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGK 449
            L NP  L  L+  N++++AP+L      +SNFW  +   GFY R+ DY+ I    +   
Sbjct: 171 FLTNPQTLSLLMAENKTIVAPML-ESRGLYSNFWCGITPQGFYKRTPDYLQIREWKR--L 227

Query: 450 GIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIHLKIDS 503
           G + VP + + +L+   + K  + K ++     DY       + F  + R  GI + + +
Sbjct: 228 GCFPVPMVHSTFLI--DLRKEASDKLMFYPPHQDYTWTFDDIIVFAFSSRQAGIQMYLCN 285

Query: 504 TQEYGHL 510
           T+ YG+L
Sbjct: 286 TEHYGYL 292


>gi|158300399|ref|XP_320324.3| AGAP012208-PA [Anopheles gambiae str. PEST]
 gi|157013141|gb|EAA00118.3| AGAP012208-PA [Anopheles gambiae str. PEST]
          Length = 554

 Score = 77.4 bits (189), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 47/205 (22%), Positives = 101/205 (49%), Gaps = 22/205 (10%)

Query: 288 DQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN--NQEYHAPLFDDYIHNF 345
           +Q P+V+++V +      L  F + + +L+YP  ++S+++ +  N++    +   ++   
Sbjct: 21  EQLPTVMVAVLVRNKAHTLPYFFSYLEDLDYPKDRMSLWIRSDHNEDRSIEITKAWLKRT 80

Query: 346 KTMFKNV--KYIAHNSTVNSKEA---------------RNLAVENSLHKGVDFYFYVDSD 388
            +++ +V  KY +      S++                +  A++ +     D+ F++D+D
Sbjct: 81  SSLYHSVDFKYRSERGKRESEKTSTHWNEERFSDVIRLKQDALQAARMMWADYIFFIDAD 140

Query: 389 SHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGG 448
             L N + L  L+ R   ++AP+LV     +SNFW  + +D +Y R+ DY  I+N DQ G
Sbjct: 141 VFLTNSNTLGKLIERKLPIVAPMLVSD-GLYSNFWCGMTSDYYYQRTDDYKKILNYDQIG 199

Query: 449 KGIWNVPYITNCYLMKTSVIKATNI 473
           +  W VP +    L+  ++ +   +
Sbjct: 200 Q--WPVPMVHTAVLVSLNIAQTRQL 222


>gi|426240022|ref|XP_004013914.1| PREDICTED: procollagen galactosyltransferase 2 [Ovis aries]
          Length = 626

 Score = 77.4 bits (189), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 57/249 (22%), Positives = 111/249 (44%), Gaps = 30/249 (12%)

Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
           P Q P+VL+ V        L  FL  +  L+YP  +++++    +N +    +  +++ N
Sbjct: 49  PLQRPTVLVVVLARNAAHTLPHFLGCLERLDYPKSRMAIWAATDHNVDNTTEILREWLKN 108

Query: 345 FKTMFKNV-----------------KYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDS 387
            +  +  V                 K+   +   +  + R  A+  +  K  D+  ++D 
Sbjct: 109 VQQSYHYVEWRPMDEPESYPDEIGPKHWPASRFAHVMKLRQAALRTAREKWSDYILFIDV 168

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D+ L NP  L  L+  N++++AP+L      +SNFW  +   GFY R+ DY+ I    + 
Sbjct: 169 DNFLTNPQTLNLLIAENKTIVAPML-ESRGLYSNFWCGITPQGFYKRTPDYLQIREWKR- 226

Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIHLKI 501
             G + VP + + +L+   + K  + K ++     DY       + F  + R  GI + +
Sbjct: 227 -LGCFPVPMVHSTFLI--DLRKEASAKLMFYPPHQDYTWTFDDIIVFAFSSRQAGIQMYL 283

Query: 502 DSTQEYGHL 510
            + + YG+L
Sbjct: 284 CNREHYGYL 292


>gi|147899177|ref|NP_001088623.1| procollagen galactosyltransferase 1-B precursor [Xenopus laevis]
 gi|82179978|sp|Q5U483.1|G251B_XENLA RecName: Full=Procollagen galactosyltransferase 1-B; AltName:
           Full=Glycosyltransferase 25 family member 1-B; AltName:
           Full=Hydroxylysine galactosyltransferase 1-B; Flags:
           Precursor
 gi|55153756|gb|AAH85226.1| Glt25d1a protein [Xenopus laevis]
          Length = 611

 Score = 77.4 bits (189), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 57/249 (22%), Positives = 118/249 (47%), Gaps = 30/249 (12%)

Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYH--APLFDDYIHN 344
           P + P+VLI+V        L E L  +  L+YP ++IS++V  +  +   + +  +++ N
Sbjct: 37  PFRSPTVLIAVLARNSEGSLPEVLGALDRLHYPKERISLWVATDHNFDNTSQILREWLIN 96

Query: 345 FKTMFKNV-----------------KYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDS 387
            +  + +V                 K+ +H+      + R  A+ ++     D+ F++D+
Sbjct: 97  VQNQYHHVEWRPQEHPRWFRDEESPKHWSHSRYEYVMKLRQAALTSAREMWADYIFFLDA 156

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D+ L N + L  L+  N++++AP+L     A+SNFW  +   G+Y R+  YM I   ++ 
Sbjct: 157 DNLLTNSETLNLLIAENKTVVAPML-ESRAAYSNFWCGMTTQGYYRRTPAYMPIRRRER- 214

Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMA------FCTNLRNKGIHLKI 501
            +G + VP + + +L+   + K  + +  +     DY  A      F  + R   + + +
Sbjct: 215 -QGCFPVPMVHSTFLI--DLRKEASQQLDFYPPHADYTWAFDDIIVFAFSCRQAEVQMFL 271

Query: 502 DSTQEYGHL 510
            + + YG+L
Sbjct: 272 CNKEIYGYL 280


>gi|441624487|ref|XP_004088995.1| PREDICTED: procollagen galactosyltransferase 2 isoform 2 [Nomascus
           leucogenys]
          Length = 554

 Score = 77.0 bits (188), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 59/249 (23%), Positives = 112/249 (44%), Gaps = 30/249 (12%)

Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
           P Q P+VL++V        L  FL  +  L+YP   ++++    +N +    +  +++ N
Sbjct: 49  PLQSPTVLVAVLARNAAHTLPHFLGCLERLDYPKTTMAIWAATDHNVDNTTEILREWLKN 108

Query: 345 FKTMFKNVKY------------IAHNSTVNSKEA-----RNLAVENSLHKGVDFYFYVDS 387
            + ++  V++            I       S+ A     R  A+  +  K  D+  ++D 
Sbjct: 109 VQRLYHYVEWRPMDEPESYPDEIGPKHWPTSRFAHVMKLRQAALRTAREKWSDYILFIDV 168

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D+ L NP  L  L+  N++++AP+L      +SNFW  +   GFY R+ DY+ I   +  
Sbjct: 169 DNFLTNPQTLNLLIAENKTIVAPML-ESRGLYSNFWCGITPKGFYKRTPDYVQI--REWK 225

Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIHLKI 501
             G + VP + + +L+   + K  + K  +     DY       + F  + R  GI + +
Sbjct: 226 RTGCFPVPMVHSTFLI--DLRKEASDKLTFYPPHQDYTWTFDDIIVFAFSSRQAGIQMYL 283

Query: 502 DSTQEYGHL 510
            + + YG+L
Sbjct: 284 CNREHYGYL 292


>gi|195998972|ref|XP_002109354.1| hypothetical protein TRIADDRAFT_21834 [Trichoplax adhaerens]
 gi|190587478|gb|EDV27520.1| hypothetical protein TRIADDRAFT_21834 [Trichoplax adhaerens]
          Length = 546

 Score = 77.0 bits (188), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 72/270 (26%), Positives = 132/270 (48%), Gaps = 37/270 (13%)

Query: 284 SLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQ--------EY 333
           +L+ ++ P+V+I+V     + FL   L  +ANLNY  K+I+ ++   NNQ        E+
Sbjct: 8   ALEYNRLPAVVIAVLARDASDFLPTSLACLANLNYDKKRIAFWIATDNNQDQTEEMLVEW 67

Query: 334 HAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEAR--------NLAVENSLHKGVDFYFYV 385
            + +  DY H  + M  N   +  + ++    +R         LA+  +L    D+  +V
Sbjct: 68  KSQVESDY-HRVEIMTSNNYSLQTDLSLQWTPSRYRHLLQLRQLALAAALKYWADYVLFV 126

Query: 386 DSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKA-WSNFWGALNADGFYARSFDYMNIING 444
           D+D+ L  PD L  L+  N +++APLL+    + +SNFW  ++  G+Y R+ DY+  +  
Sbjct: 127 DADNFLTEPDTLIELIKSNRTMVAPLLIESRHSYYSNFWCGVDEQGYYRRTEDYLPTLKR 186

Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNS-----MDYDMAFCTNLRNKGIHL 499
           ++  KG+  V  I + +L+  +  K+    + Y  +S     +D  + F  + +  GI  
Sbjct: 187 ER--KGVLQVAMIHSTFLIDLNR-KSVEKFSFYPPHSSYQGHIDDLLIFSYSAKMAGIPF 243

Query: 500 KIDSTQEYGHLVDSENFDPQKTNPEVYELI 529
            + + + YG+L  S         P+V ELI
Sbjct: 244 HLLNNKIYGYLFSS---------PQVQELI 264


>gi|68357136|ref|XP_694217.1| PREDICTED: procollagen galactosyltransferase 1 [Danio rerio]
          Length = 609

 Score = 77.0 bits (188), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 63/245 (25%), Positives = 113/245 (46%), Gaps = 30/245 (12%)

Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFKTM 348
           P V+I++        L  FL  I  LNYP  +I+++V   +N +    L  D++ N + +
Sbjct: 39  PRVMIALICRNNQHSLPHFLGTIERLNYPKDRIALWVATDHNVDNTTYLLRDWLINVQKL 98

Query: 349 FKNVKYIAHN--STVNSKEA---------------RNLAVENSLHKGVDFYFYVDSDSHL 391
           +  V++      S  N +E                R  A+E++     D+   +D D+ L
Sbjct: 99  YHYVEWRPKEQPSQYNDEEGPKDWTNERYAYVMKLRQAALESAREMWADYLMMIDCDNLL 158

Query: 392 DNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGI 451
            N DVL  L+  N++++AP++     A+SNFW  + + G+Y R+  Y+ I    Q  KG 
Sbjct: 159 INQDVLWKLIKENKTIVAPMM-ESRAAYSNFWCGMTSQGYYKRTPAYIPI--RKQVRKGC 215

Query: 452 WNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMA------FCTNLRNKGIHLKIDSTQ 505
           + VP + + +L+   + K  + +  +     DY  A      F  + R   + + I + +
Sbjct: 216 FAVPMVHSTFLV--DLRKEASRQLAFHPPHPDYTWAFDDIIVFAFSARIAEVQMFICNRE 273

Query: 506 EYGHL 510
            YGHL
Sbjct: 274 IYGHL 278


>gi|348577951|ref|XP_003474747.1| PREDICTED: procollagen galactosyltransferase 2-like [Cavia
           porcellus]
          Length = 623

 Score = 76.6 bits (187), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 59/249 (23%), Positives = 113/249 (45%), Gaps = 30/249 (12%)

Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
           P Q P+VL++V        L  FL  +  L+YP  +++++    +N +    +  +++ N
Sbjct: 46  PLQKPTVLVAVLARNAAHTLPHFLGCLERLDYPKSRMAIWAATDHNVDNTTEILREWLKN 105

Query: 345 FKTMFKNVKY------------IAHNSTVNSKEA-----RNLAVENSLHKGVDFYFYVDS 387
            + ++  V++            I      +S+ A     R  A+  +  K  D+  ++D 
Sbjct: 106 VQRLYHYVEWRPMDEPESYPDEIGPKHWPSSRFAHVMKLRQAALRTAREKWSDYILFIDV 165

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D+ L N   L  L+  N++++AP+L      +SNFW  +   GFY R+ DY+ I    + 
Sbjct: 166 DNFLTNTQTLSLLIAENKTIVAPML-ESRGLYSNFWCGITPQGFYKRTPDYLQIREWKR- 223

Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIHLKI 501
             G + VP + + +L+   + K  + K  +     DY       + F  + R  GI + +
Sbjct: 224 -MGCFPVPMVHSTFLI--DLRKEASDKLTFYPPHQDYTWTFDDIIVFAFSSRQAGIQMYL 280

Query: 502 DSTQEYGHL 510
            + Q YG+L
Sbjct: 281 CNRQHYGYL 289


>gi|170741659|ref|YP_001770314.1| glycosyl transferase family protein [Methylobacterium sp. 4-46]
 gi|168195933|gb|ACA17880.1| glycosyl transferase family 2 [Methylobacterium sp. 4-46]
          Length = 661

 Score = 76.6 bits (187), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 66/264 (25%), Positives = 124/264 (46%), Gaps = 33/264 (12%)

Query: 279 IKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAP 336
           ++ L S  P+  P VL++V   +    L+ +L+ I  L+YP   I + V   NN +    
Sbjct: 381 LRPLRSRLPEPAPRVLLAVLAKQKEPVLDLYLDCIEALDYPKSSIVLCVRTNNNTDRTGG 440

Query: 337 LFDDYIHNFKTMFKNVKY------------IAHN------STVNSKEARNLAVENSLHKG 378
           +   ++     ++  + +              H       + + +   R+LA+  +L + 
Sbjct: 441 MLRAWLDRVGGLYAGIVFDDADVPEPVQDLAVHEWTPQRFAVLGAIRQRSLAL--TLARD 498

Query: 379 VDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLL--VRPFKAWSNFWGALNADGFYARSF 436
             FYF  D+D+ L  P  L+ LV+ N  ++AP+L  V+P   ++NF  A++A G++A S 
Sbjct: 499 CAFYFVADADNFL-IPSTLRDLVSLNLPIVAPMLREVKPGSRYANFHAAVDAQGYFAESR 557

Query: 437 DYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD-MAFCTNLRNK 495
           DY  ++  ++   G+  VP +   YL++   I        Y   S  ++ + F  + R +
Sbjct: 558 DYDALL--ERRILGVVEVPVVHCTYLVRADAIPLLR----YEDGSGRHEYVVFSDHARRR 611

Query: 496 GIHLKIDSTQEYGHLVDSENFDPQ 519
           GI   +D+ + YG L   E+ DP+
Sbjct: 612 GIPQYLDNRRCYGCLT-LEDDDPE 634


>gi|332230643|ref|XP_003264502.1| PREDICTED: procollagen galactosyltransferase 2 isoform 1 [Nomascus
           leucogenys]
          Length = 626

 Score = 76.3 bits (186), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 59/249 (23%), Positives = 112/249 (44%), Gaps = 30/249 (12%)

Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
           P Q P+VL++V        L  FL  +  L+YP   ++++    +N +    +  +++ N
Sbjct: 49  PLQSPTVLVAVLARNAAHTLPHFLGCLERLDYPKTTMAIWAATDHNVDNTTEILREWLKN 108

Query: 345 FKTMFKNVKY------------IAHNSTVNSKEA-----RNLAVENSLHKGVDFYFYVDS 387
            + ++  V++            I       S+ A     R  A+  +  K  D+  ++D 
Sbjct: 109 VQRLYHYVEWRPMDEPESYPDEIGPKHWPTSRFAHVMKLRQAALRTAREKWSDYILFIDV 168

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D+ L NP  L  L+  N++++AP+L      +SNFW  +   GFY R+ DY+ I    + 
Sbjct: 169 DNFLTNPQTLNLLIAENKTIVAPML-ESRGLYSNFWCGITPKGFYKRTPDYVQIREWKRT 227

Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIHLKI 501
             G + VP + + +L+   + K  + K  +     DY       + F  + R  GI + +
Sbjct: 228 --GCFPVPMVHSTFLI--DLRKEASDKLTFYPPHQDYTWTFDDIIVFAFSSRQAGIQMYL 283

Query: 502 DSTQEYGHL 510
            + + YG+L
Sbjct: 284 CNREHYGYL 292


>gi|359319948|ref|XP_849763.2| PREDICTED: procollagen galactosyltransferase 2 isoform 1 [Canis
           lupus familiaris]
          Length = 564

 Score = 76.3 bits (186), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 58/249 (23%), Positives = 114/249 (45%), Gaps = 30/249 (12%)

Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
           P Q P+VL++V        L  FL  +  L+YP  +++++    +N +    +  +++ N
Sbjct: 49  PVQRPTVLVAVLARNAAHALPPFLGCLERLDYPKGRMAIWAATDHNVDNTTEILREWLKN 108

Query: 345 FKTMFKNVKY------------IAHNSTVNSKEA-----RNLAVENSLHKGVDFYFYVDS 387
            +  +  V++            I      +S+ A     R  A+  +  K  D+  ++D 
Sbjct: 109 VQRFYHYVEWRPMDEPESYPDEIGPKHWPSSRFAHVMKLRQAALRTAREKWSDYILFIDV 168

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D+ L NP  L  ++  N++++AP+L      +SNFW  +   GFY R+ DY+ I    + 
Sbjct: 169 DNFLTNPQTLNLMIAENKTIVAPML-ESRGLYSNFWCGITPQGFYKRTPDYVQIREWKR- 226

Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIHLKI 501
             G + VP + + +L+   + K  + K ++     DY       + F  + R  GI + +
Sbjct: 227 -LGCFPVPMVHSTFLI--DLRKEASDKLMFYPPHQDYTWTFDDIIVFAFSSRQAGIQMYL 283

Query: 502 DSTQEYGHL 510
            + + YG+L
Sbjct: 284 CNREHYGYL 292


>gi|344278455|ref|XP_003411009.1| PREDICTED: LOW QUALITY PROTEIN: procollagen galactosyltransferase
           2-like [Loxodonta africana]
          Length = 763

 Score = 75.9 bits (185), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 58/252 (23%), Positives = 111/252 (44%), Gaps = 30/252 (11%)

Query: 284 SLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDY 341
           S  P Q P    ++        L  FL  +  L+YP  +++++V   +N +    +  ++
Sbjct: 183 SESPMQNPRCSWAILARNAAHTLSHFLGCLERLDYPKSRMAIWVATDHNVDNTTEILREW 242

Query: 342 IHNFKTMFKNVKY------------IAHNSTVNSK-----EARNLAVENSLHKGVDFYFY 384
           + N + ++  V++            I      NS+       R  A+  +  K  D+  +
Sbjct: 243 LKNIQRLYHYVEWRPMDEPQSYPDEIGPKHWPNSRFTHVMRLRQAALRTAREKWSDYILF 302

Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
           +D D+ L NP  L  ++  N++++AP+L      +SNFW  +   GFY R+ DY+ I   
Sbjct: 303 IDVDNFLTNPKTLDLMIAENKTIVAPML-ESRSLYSNFWCGITPQGFYKRTPDYLQIREW 361

Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIH 498
            +   G + VP + +  L+   + K  + K ++     DY       M F  + R  GI 
Sbjct: 362 KR--TGCFPVPMVHSTLLI--DLRKEASDKLMFYPPHQDYTWTFDDIMVFAFSSRQAGIQ 417

Query: 499 LKIDSTQEYGHL 510
           + + + + YG+L
Sbjct: 418 MYLCNREHYGYL 429


>gi|300796728|ref|NP_001178231.1| procollagen galactosyltransferase 2 precursor [Bos taurus]
 gi|296478943|tpg|DAA21058.1| TPA: glycosyltransferase 25 domain containing 2 [Bos taurus]
          Length = 626

 Score = 75.9 bits (185), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 57/249 (22%), Positives = 111/249 (44%), Gaps = 30/249 (12%)

Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
           P Q P+VL+ V        L  FL  +  L+YP  +++++    +N +    +  +++ N
Sbjct: 49  PLQRPTVLVVVLARNAAHTLPHFLGCLERLDYPKSRMAIWAATDHNVDNTTEILREWLKN 108

Query: 345 FKTMFKNV-----------------KYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDS 387
            +  +  V                 K+   +   +  + R  A+  +  K  D+  ++D 
Sbjct: 109 VQKAYHYVEWRPMDEPESYPDEIGPKHWPASRFAHVMKLRQAALRTAREKWSDYILFIDV 168

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D+ L NP  L  L+  N++++AP+L      +SNFW  +   GFY R+ DY+ I    + 
Sbjct: 169 DNFLTNPQTLNLLMAENKTIVAPML-ESRGLYSNFWCGITPQGFYKRTPDYLQIREWKR- 226

Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIHLKI 501
             G + VP + + +L+   + K  + K ++     DY       + F  + R  GI + +
Sbjct: 227 -LGCFPVPMVHSTFLI--DLRKEASDKLMFYPPHQDYTWTFDDIIVFAFSSRQAGIQMYL 283

Query: 502 DSTQEYGHL 510
            + + YG+L
Sbjct: 284 CNREHYGYL 292


>gi|327277409|ref|XP_003223457.1| PREDICTED: LOW QUALITY PROTEIN: procollagen galactosyltransferase
           2-like [Anolis carolinensis]
          Length = 631

 Score = 75.5 bits (184), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 60/252 (23%), Positives = 110/252 (43%), Gaps = 40/252 (15%)

Query: 289 QFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTM 348
           Q P+V +++        L  FL  +  L YP  +++++V   +       D+  +  K  
Sbjct: 56  QKPTVFLAILARNAAGSLPHFLGCLERLRYPKPRMAVWVAKERN-----VDNTTNILKEW 110

Query: 349 FKNVKYIAH-------------------NSTVNSKEA-----RNLAVENSLHKGVDFYFY 384
            KNV+ + H                       NS+ A     R  A+  +  K  D+  +
Sbjct: 111 LKNVQKLYHYLXWRPMEEPHSYPEEIGPKHWPNSRFAHVMKLRQAALRTAREKWSDYIMF 170

Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
           +D+D+ L NPDVL  ++  N++++AP+L      +SNFW  +   G+Y R+ DY  I   
Sbjct: 171 IDADNFLTNPDVLNLMIAENKTIVAPML-ESRNLYSNFWCGMTPQGYYKRTPDYSLIREW 229

Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIH 498
            +   G + VP + + +L+   + K  + K ++     DY       + F  + R   I 
Sbjct: 230 KR--TGCFAVPMVHSTFLI--DLRKEASDKLMFYPPHQDYTWTFDDIIVFAFSSRQADIQ 285

Query: 499 LKIDSTQEYGHL 510
           + I + + YG+L
Sbjct: 286 MYICNREHYGYL 297


>gi|46447512|ref|YP_008877.1| procollagen-lysine 5-dioxygenase [Candidatus Protochlamydia
           amoebophila UWE25]
 gi|46401153|emb|CAF24602.1| putative procollagen-lysine 5-dioxygenase [Candidatus
           Protochlamydia amoebophila UWE25]
          Length = 295

 Score = 75.5 bits (184), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 66/267 (24%), Positives = 133/267 (49%), Gaps = 32/267 (11%)

Query: 292 SVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFKTMF 349
           +VL+++        L  FLN I +L+Y  K IS++++  NN +    + + ++     ++
Sbjct: 34  TVLLALLARNKEHTLPAFLNCIEHLDYDKKCISIYIHTNNNIDKTQEILEAWVKEKGNLY 93

Query: 350 KNVKYIAH--NSTVNSK-------------EARNLAVENSLHKGVDFYFYVDSDSHLDNP 394
           K+V ++    N+ + ++             + RN ++E +     D+YF VD D+ +   
Sbjct: 94  KDVIFVKQDLNTVLTNRPHEWTPERFKILAKIRNDSLEYAKLLKSDYYFVVDCDNFI-TA 152

Query: 395 DVLKYLVNRNESLIAPLLVRPFKA---WSNFWGALNADGFYARSFDYMNIINGDQGGKGI 451
           D LK L+ +++ +IAPLL R  +    +SNF+ A++  G+Y    DY+ I++ ++   G+
Sbjct: 153 DTLKDLIKQDKPIIAPLL-RSLETNNYYSNFFCAIDETGYYGYHLDYLKIVSYEKI--GV 209

Query: 452 WNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMA-FCTNLRNKGIHLKIDSTQEYGHL 510
           + VP +   YL+++  +   +    Y   S DY+   F    R K +   I + ++YG+L
Sbjct: 210 FKVPVVHCTYLIQSKYLDQLS----YIDGSEDYEFVIFSRKAREKNVDQYISNEKKYGYL 265

Query: 511 V---DSENFDPQKTNPEVYELIRNPLD 534
           V   D+ + + +K       ++R   D
Sbjct: 266 VHFFDNLSLEEEKERMASINILRRIAD 292


>gi|410053448|ref|XP_512497.4| PREDICTED: procollagen galactosyltransferase 1, partial [Pan
           troglodytes]
          Length = 484

 Score = 75.5 bits (184), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 55/248 (22%), Positives = 118/248 (47%), Gaps = 28/248 (11%)

Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
           P Q P VLI++        L   L  +  L +P ++ +++V   +N +  + +  +++  
Sbjct: 48  PLQAPRVLIALLARNAAHALPTTLGALERLRHPRERTALWVATDHNMDNTSTVLREWLVA 107

Query: 345 FKTMFKNV-----------------KYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDS 387
            K+++ +V                 K+ + +   +  + R  A++++     D+  +VD+
Sbjct: 108 VKSLYHSVEWRPAEEPRSYPDEEGPKHWSDSRYEHVMKLRQAALKSARDMWADYILFVDA 167

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D+ + NPD L  L+  N++++AP+L     A+SNFW  + + G+Y R+  Y+ I   D+ 
Sbjct: 168 DNLILNPDTLSLLIAENKTVVAPML-DSRAAYSNFWCGMTSQGYYKRTPAYIPIRKRDR- 225

Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIK-----TIYTLNSMDYDMAFCTNLRNKGIHLKID 502
            +G + VP + + +L+      + N+        YT  S D  + F  + +   + + + 
Sbjct: 226 -RGCFAVPMVHSTFLIDLRKAASRNLAFYPPHPDYTW-SFDDIIVFAFSCKQAEVQMYVC 283

Query: 503 STQEYGHL 510
           + +EYG L
Sbjct: 284 NKEEYGFL 291


>gi|417403445|gb|JAA48526.1| Putative procollagen galactosyltransferase 2 [Desmodus rotundus]
          Length = 626

 Score = 75.5 bits (184), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 55/249 (22%), Positives = 112/249 (44%), Gaps = 30/249 (12%)

Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
           P Q P+VL+++        L  FL  +  L+YP  +++++    +N +    +  +++ N
Sbjct: 49  PLQRPTVLVALLARNAAHTLPHFLGCLERLDYPKSRMAIWAATDHNVDNTTEILREWLKN 108

Query: 345 FKTMFKNV-----------------KYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDS 387
            +  +  V                 K+   +   +  + R  A+  +  K  D+  ++D 
Sbjct: 109 VQRAYHYVEWRPMEEPESYPDEIGPKHWPASRFAHVMKLRQAALRTARDKWSDYILFIDV 168

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D+ L NP  L  ++  N++++AP+L      +SNFW  +   GFY R+ DY+ I    + 
Sbjct: 169 DNFLTNPQTLNLMIAENKTIVAPML-ESRGLYSNFWCGITPQGFYKRTPDYIQIREWKR- 226

Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIHLKI 501
             G + VP + + +L+   + K  + K ++     DY       + F  + R  GI + +
Sbjct: 227 -TGCFPVPMVHSTFLI--DLRKEASDKLMFHPPHQDYAWTFDDIIVFAFSSRQAGIQMYL 283

Query: 502 DSTQEYGHL 510
            + + YG+L
Sbjct: 284 CNREHYGYL 292


>gi|426387749|ref|XP_004060325.1| PREDICTED: procollagen galactosyltransferase 1 [Gorilla gorilla
           gorilla]
          Length = 585

 Score = 75.1 bits (183), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 53/229 (23%), Positives = 107/229 (46%), Gaps = 27/229 (11%)

Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFK 346
           P Q P VLI++        L   L  +  L +P ++ +++ Y ++E              
Sbjct: 48  PLQAPRVLIALLARNAAHALPTTLGALERLRHPRERTALWSYPDEE-------------- 93

Query: 347 TMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNES 406
                 K+ + +   +  + R  A++++     D+  +VD+D+ + NPD L  L+  N++
Sbjct: 94  ----GPKHWSDSRYEHVMKLRQAALKSARDMWADYILFVDADNLILNPDTLSLLIAENKT 149

Query: 407 LIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTS 466
           ++AP+L     A+SNFW  + + G+Y R+  Y+ I   D+  +G + VP + + +L+   
Sbjct: 150 VVAPML-DSRAAYSNFWCGMTSQGYYKRTPAYIPIRKRDR--RGCFAVPMVHSTFLIDLR 206

Query: 467 VIKATNIK-----TIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGHL 510
              + N+        YT  S D  + F  + +   + + + + +EYG L
Sbjct: 207 KAASRNLAFYPPHPDYTW-SFDDIIVFAFSCKQAEVQMYVCNKEEYGFL 254


>gi|334321909|ref|XP_001375578.2| PREDICTED: LOW QUALITY PROTEIN: procollagen galactosyltransferase
           2-like [Monodelphis domestica]
          Length = 631

 Score = 75.1 bits (183), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 52/247 (21%), Positives = 113/247 (45%), Gaps = 30/247 (12%)

Query: 289 QFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFK 346
           Q P+V +++        L  FL  +  L+YP  +++++    +N +    +  +++ N +
Sbjct: 56  QRPTVFVTILARIAAHTLPHFLGCLERLDYPKDRMAIWAATDHNIDNTTEILREWLKNVQ 115

Query: 347 TMFKNV-----------------KYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDS 389
            ++  V                 K+   +   +  + R  A+  +  K  D+  ++D D+
Sbjct: 116 KLYHYVEWRPMDDPQSYPDEIGPKHWPGSRFTHVMKLRQAALRTAREKWSDYILFIDVDN 175

Query: 390 HLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGK 449
            L NP  L  +++ N++++AP+L      +SNFW  +   G+Y R+ DY+ I    +  +
Sbjct: 176 FLTNPQTLNLMISENKTIVAPML-ESRSLYSNFWCGITPQGYYKRTPDYIQIREWKR--R 232

Query: 450 GIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIHLKIDS 503
           G + VP + + +L+   + K  + K ++     DY       + F  + R  GI + + +
Sbjct: 233 GCFPVPMVHSTFLI--DLRKEASQKLMFFPPHQDYSWTFDDIIVFAFSSRQAGIQMYLCN 290

Query: 504 TQEYGHL 510
            + YG+L
Sbjct: 291 REHYGYL 297


>gi|326431358|gb|EGD76928.1| hypothetical protein PTSG_07269 [Salpingoeca sp. ATCC 50818]
          Length = 858

 Score = 75.1 bits (183), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 68/240 (28%), Positives = 117/240 (48%), Gaps = 30/240 (12%)

Query: 28  NIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLL 87
            I + + LV+TVA++     + FI   E+ +  V  +G      G      G G+K+  +
Sbjct: 161 GIVKPRLLVMTVATHR----EPFI---ELTEQSVGNIGKKLLVAGEGEFFKGYGWKLKKV 213

Query: 88  KNELDEMDITDDM-IILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYD 146
           +  L  +   DD  ++L TDS+D  +    +++++ F + +A +V  AE  CWP+  L  
Sbjct: 214 RETL--LKYKDDYDMVLFTDSFDSFVFAEEDELIDTFRSMNAPMVVSAEVNCWPNPELAT 271

Query: 147 KYPAVGS--GYRYLNSGGFIGYAKDIKEL------ISNRSIKNEEDDQLYYALLFLDETL 198
           + P   S   Y Y NSGG++GY   I  L      I ++S   ++  +L  A++  ++  
Sbjct: 272 EMPPSSSVGHYPYPNSGGYMGYLGYILHLYNDVIAIHHKSDCCDDQGELIKAVVLDNKAF 331

Query: 199 RTKHKIVLDTLANLFQNLYGSLE-DIKLNFDLDEFVHLTNTKYNTNPVIIHGNGKSKIEL 257
           R  H+ V      LFQ L+GS + D+ +    D  +H  N   +T+P ++H NG  K  L
Sbjct: 332 RIDHQAV------LFQTLFGSAKRDVVVR---DGRIH--NQATHTSPAVVHANGWDKGPL 380


>gi|449507875|ref|XP_004176247.1| PREDICTED: LOW QUALITY PROTEIN: procollagen galactosyltransferase 2
           [Taeniopygia guttata]
          Length = 621

 Score = 75.1 bits (183), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 55/245 (22%), Positives = 108/245 (44%), Gaps = 30/245 (12%)

Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFKTM 348
           P+VL+++        L   L  I  L+YP  +I+++    +N +    +  +++ N + +
Sbjct: 44  PTVLLAIIARNAAHTLPHVLGCIERLSYPKSRIALWAATDHNIDNTTAILREWLKNVQHL 103

Query: 349 FKNVKYIAHNSTVNSKEA-----------------RNLAVENSLHKGVDFYFYVDSDSHL 391
           + +V++       +  E                  R  A+  +  K  D+  + D+D+ L
Sbjct: 104 YHDVEWRPMEEPPSYPEEIGPKHWPSSRFTHVMKLRQAALRTAREKWSDYILFTDADNLL 163

Query: 392 DNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGI 451
            NP+ L  L+  N++L+AP+L      +SNFW  +   G+Y R+ +Y  I   +    G 
Sbjct: 164 TNPETLNLLIAENKTLVAPML-ESRSLYSNFWCGITPQGYYKRTLEYPLI--REWKRMGC 220

Query: 452 WNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIHLKIDSTQ 505
           + VP I + +L+   + K  + K  +     DY       M F  + R  G+ + + + +
Sbjct: 221 FAVPMIHSTFLI--DLRKEASAKLAFYPPHQDYTWSFDDIMVFAFSSRQAGVQMFVCNRE 278

Query: 506 EYGHL 510
            YG L
Sbjct: 279 HYGFL 283


>gi|194272156|ref|NP_001123548.1| procollagen galactosyltransferase 2 precursor [Danio rerio]
 gi|159570814|emb|CAP19485.1| novel protein similar to vertebrate glycosyltransferase 25 domain
           containing 1 (GLT25D1) [Danio rerio]
          Length = 613

 Score = 75.1 bits (183), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 56/245 (22%), Positives = 112/245 (45%), Gaps = 30/245 (12%)

Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFKTM 348
           P V+I++        L  +L+ I  L+YP  +I+++    +N +    +  +++ N ++ 
Sbjct: 42  PKVMIAILARNSAHSLPYYLDCIDRLDYPKDRIAIWAATDHNVDNSTAMLREWLKNRQSR 101

Query: 349 FKNV-----------------KYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSHL 391
           +  V                 K+ + +   +  + R  A++ +  +  D+  YVDSD+ L
Sbjct: 102 YHYVEWRPMEEPRSYTDEWGPKHWSSSRVSHVMKLRQAALKAARARWADYILYVDSDNLL 161

Query: 392 DNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGI 451
            NP VL  L+  N +L+AP+L      +SNFW  +   G+Y R+ DY  I    +   G 
Sbjct: 162 TNPRVLNLLMAENLTLVAPML-DSRSLYSNFWCGITPQGYYKRTPDYQPIREWKR--LGC 218

Query: 452 WNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIHLKIDSTQ 505
           ++VP + + +L+   + ++  +   +     DY       M F  + R  G+ + + + +
Sbjct: 219 FSVPMVHSTFLL--DLRRSATLDMAFYPPHPDYSWAFDDIMVFAFSAREAGVQMYVCNRE 276

Query: 506 EYGHL 510
            YG L
Sbjct: 277 HYGFL 281


>gi|395530942|ref|XP_003767545.1| PREDICTED: procollagen galactosyltransferase 2 [Sarcophilus
           harrisii]
          Length = 630

 Score = 74.7 bits (182), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 52/247 (21%), Positives = 113/247 (45%), Gaps = 30/247 (12%)

Query: 289 QFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFK 346
           Q P+V +++        L  FL  +  L+YP  +++++    +N +    +  +++ N +
Sbjct: 55  QKPTVFVAILARNAAHTLPHFLGCLERLDYPKDRMAIWAATDHNVDNTTEILREWLKNVQ 114

Query: 347 TMFKNV-----------------KYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDS 389
            ++  V                 K+   +   +  + R  A+  +  K  D+  ++D D+
Sbjct: 115 KLYHYVEWRPMDDPQSYPDEIGPKHWPGSRFTHVMKLRQAALRTAREKWSDYILFIDVDN 174

Query: 390 HLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGK 449
            L NP  L  +++ N++++AP+L      +SNFW  +   G+Y R+ DY+ I   +   +
Sbjct: 175 FLTNPQTLNLMISENKTIVAPML-ESRSLYSNFWCGITPQGYYKRTPDYIQI--REWKRR 231

Query: 450 GIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIHLKIDS 503
           G + VP + + +L+   + K  + K ++     DY       + F  + R  GI + + +
Sbjct: 232 GCFPVPMVHSTFLI--DLRKEASQKLMFFPPHQDYAWTFDDIIVFAFSSRQAGIQMYLCN 289

Query: 504 TQEYGHL 510
            + YG+L
Sbjct: 290 REHYGYL 296


>gi|392352745|ref|XP_222718.6| PREDICTED: procollagen galactosyltransferase 2 [Rattus norvegicus]
          Length = 633

 Score = 74.7 bits (182), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 57/249 (22%), Positives = 112/249 (44%), Gaps = 30/249 (12%)

Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
           P Q P+V + V        L  FL  +  L+YP  +++++    +N +    +  +++ N
Sbjct: 48  PLQKPTVFVVVLARNAAHTLPHFLGCLERLDYPKSRMAIWAATDHNVDNTTEILREWLKN 107

Query: 345 FKTMFKNVKY------------IAHNSTVNSKEA-----RNLAVENSLHKGVDFYFYVDS 387
            + ++  V++            I      +S+ A     R  A+  +  K  D+  ++D 
Sbjct: 108 VQRLYHYVEWRPMDEPESYPDEIGPKHWPSSRFAHVMKLRQAALRTAREKWSDYILFIDV 167

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D+ L NP  L  ++  N++++AP+L      +SNFW  +   GFY R+ DY+ I    + 
Sbjct: 168 DNFLTNPQTLNLMIAENKTILAPML-ESRGLYSNFWCGITPQGFYKRTPDYLQIREWKRT 226

Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIHLKI 501
             G + VP + + +L+   + K  + K  +     DY       + F  + R  GI + +
Sbjct: 227 --GCFPVPMVHSTFLI--DLRKEASDKLSFYPPHQDYTWTFDDIIVFAFSSRQAGIQMYL 282

Query: 502 DSTQEYGHL 510
            + + YG+L
Sbjct: 283 CNKEHYGYL 291


>gi|119605029|gb|EAW84623.1| glycosyltransferase 25 domain containing 1, isoform CRA_c [Homo
           sapiens]
          Length = 565

 Score = 74.7 bits (182), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 55/248 (22%), Positives = 118/248 (47%), Gaps = 28/248 (11%)

Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
           P Q P VLI++        L   L  +  L +P ++ +++V   +N +  + +  +++  
Sbjct: 48  PLQAPRVLIALLARNAAHALPTTLGALERLRHPRERTALWVATDHNMDNTSTVLREWLVA 107

Query: 345 FKTMFKNV-----------------KYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDS 387
            K+++ +V                 K+ + +   +  + R  A++++     D+  +VD+
Sbjct: 108 VKSLYHSVEWRPAEEPRSYPDEEGPKHWSDSRYEHVMKLRQAALKSARDMWADYILFVDA 167

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D+ + NPD L  L+  N++++AP+L     A+SNFW  + + G+Y R+  Y+ I   D+ 
Sbjct: 168 DNLILNPDTLSLLIAENKTVVAPML-DSRAAYSNFWCGMTSQGYYKRTPAYIPIRKRDR- 225

Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIK-----TIYTLNSMDYDMAFCTNLRNKGIHLKID 502
            +G + VP + + +L+      + N+        YT  S D  + F  + +   + + + 
Sbjct: 226 -RGCFAVPMVHSTFLIDLRKAASRNLAFYPPHPDYTW-SFDDIIVFAFSCKQAEVQMYVC 283

Query: 503 STQEYGHL 510
           + +EYG L
Sbjct: 284 NKEEYGFL 291


>gi|380019473|ref|XP_003693629.1| PREDICTED: glycosyltransferase 25 family member-like [Apis florea]
          Length = 558

 Score = 74.7 bits (182), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 67/245 (27%), Positives = 114/245 (46%), Gaps = 27/245 (11%)

Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFKTM 348
           P+VLI + +      L  FL  +  L YP K+I ++++  NN +    +   +++N    
Sbjct: 18  PTVLIIILVRNKAHTLPYFLTFLERLTYPKKRIHLWIHSDNNIDNSIEILSTWLNNESNK 77

Query: 349 FKNVK---------YIAHNSTVNSKEARNLAV----ENSLHKG----VDFYFYVDSDSHL 391
           +  V+         +   N   N    R L V    E +L  G     DF + +D+D  L
Sbjct: 78  YHGVQINFDENSKGFDDENGITNWSAQRFLHVINLREEALKAGRNIWADFIWMLDADVFL 137

Query: 392 DNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGI 451
            NP+ L  L+ +N+ +IAPLL +    +SNFW  + +D +Y R+ +Y  I+  ++  KG 
Sbjct: 138 TNPNTLDELILKNQIVIAPLL-KSDGLYSNFWAGMTSDYYYLRTKEYEPILFREK--KGC 194

Query: 452 WNVPYITNCYLM----KTSVIKATNIKTIYTLNS-MDYDMAFCTNLRNKGIHLKIDSTQE 506
           +NVP I +  L+    + S     N K +Y  N  +D  + F       G+ L I +   
Sbjct: 195 FNVPMIHSAVLINLRKQLSDFLTYNPKKLYQYNGPIDDIITFAVGANKTGVPLFICNDNI 254

Query: 507 YGHLV 511
           YG ++
Sbjct: 255 YGFIM 259


>gi|148707509|gb|EDL39456.1| glycosyltransferase 25 domain containing 2, isoform CRA_a [Mus
           musculus]
          Length = 469

 Score = 74.7 bits (182), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 56/249 (22%), Positives = 112/249 (44%), Gaps = 30/249 (12%)

Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
           P Q P+V + V        L  FL  +  L+YP  +++++    +N +    +  +++ +
Sbjct: 48  PPQKPTVFVVVLARNAAHTLPYFLGCLERLDYPKSRMAIWAATDHNVDNTTEILREWLKS 107

Query: 345 FKTMFKNVKYIAHNST------------VNSK-----EARNLAVENSLHKGVDFYFYVDS 387
            + ++  V++   N               NS+     + R  A+  +  K  D+  ++D 
Sbjct: 108 VQRLYHYVEWRPMNEPESYPDEIGPKHWPNSRFSHVMKLRQAALRTAREKWSDYILFIDV 167

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D+ L NP  L  ++  N++++AP+L      +SNFW  +   GFY R+ DY+ I   +  
Sbjct: 168 DNFLTNPQTLNLMIVENKTIVAPML-ESRGLYSNFWCGITPQGFYKRTPDYLQI--REWK 224

Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIHLKI 501
             G + VP + + +L+   + K  + K  +     DY       + F  + R  GI + +
Sbjct: 225 RMGCFPVPMVHSTFLI--DLRKEASDKLAFYPPHQDYTWTFDDIIVFAFSSRQAGIQMYL 282

Query: 502 DSTQEYGHL 510
            + + YG+L
Sbjct: 283 CNKEHYGYL 291


>gi|148707510|gb|EDL39457.1| glycosyltransferase 25 domain containing 2, isoform CRA_b [Mus
           musculus]
          Length = 625

 Score = 74.3 bits (181), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 56/249 (22%), Positives = 112/249 (44%), Gaps = 30/249 (12%)

Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
           P Q P+V + V        L  FL  +  L+YP  +++++    +N +    +  +++ +
Sbjct: 48  PPQKPTVFVVVLARNAAHTLPYFLGCLERLDYPKSRMAIWAATDHNVDNTTEILREWLKS 107

Query: 345 FKTMFKNVKYIAHNST------------VNSK-----EARNLAVENSLHKGVDFYFYVDS 387
            + ++  V++   N               NS+     + R  A+  +  K  D+  ++D 
Sbjct: 108 VQRLYHYVEWRPMNEPESYPDEIGPKHWPNSRFSHVMKLRQAALRTAREKWSDYILFIDV 167

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D+ L NP  L  ++  N++++AP+L      +SNFW  +   GFY R+ DY+ I    + 
Sbjct: 168 DNFLTNPQTLNLMIVENKTIVAPML-ESRGLYSNFWCGITPQGFYKRTPDYLQIREWKR- 225

Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIHLKI 501
             G + VP + + +L+   + K  + K  +     DY       + F  + R  GI + +
Sbjct: 226 -MGCFPVPMVHSTFLI--DLRKEASDKLAFYPPHQDYTWTFDDIIVFAFSSRQAGIQMYL 282

Query: 502 DSTQEYGHL 510
            + + YG+L
Sbjct: 283 CNKEHYGYL 291


>gi|26343025|dbj|BAC35169.1| unnamed protein product [Mus musculus]
          Length = 625

 Score = 74.3 bits (181), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 56/249 (22%), Positives = 112/249 (44%), Gaps = 30/249 (12%)

Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
           P Q P+V + V        L  FL  +  L+YP  +++++    +N +    +  +++ +
Sbjct: 48  PPQKPTVFVVVLARNAAHTLPYFLGCLERLDYPKSRMAIWAATDHNVDNTTEILREWLKS 107

Query: 345 FKTMFKNVKYIAHNST------------VNSK-----EARNLAVENSLHKGVDFYFYVDS 387
            + ++  V++   N               NS+     + R  A+  +  K  D+  ++D 
Sbjct: 108 VQRLYHYVEWRPMNEPESYPDEIGPKHWPNSRFSHVMKLRQAALRTAREKWSDYILFIDV 167

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D+ L NP  L  ++  N++++AP+L      +SNFW  +   GFY R+ DY+ I    + 
Sbjct: 168 DNFLTNPQTLNLMIVENKTIVAPML-ESRGLYSNFWCGITPQGFYKRTPDYLQIREWKR- 225

Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIHLKI 501
             G + VP + + +L+   + K  + K  +     DY       + F  + R  GI + +
Sbjct: 226 -MGCFPVPMVHSTFLI--DLRKEASDKLAFYPPHQDYTWTFDDIIVFAFSSRQAGIQMYL 282

Query: 502 DSTQEYGHL 510
            + + YG+L
Sbjct: 283 CNKEHYGYL 291


>gi|45768794|gb|AAH68118.1| Glycosyltransferase 25 domain containing 2 [Mus musculus]
          Length = 625

 Score = 74.3 bits (181), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 56/249 (22%), Positives = 112/249 (44%), Gaps = 30/249 (12%)

Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
           P Q P+V + V        L  FL  +  L+YP  +++++    +N +    +  +++ +
Sbjct: 48  PPQKPTVFVVVLARNAAHTLPYFLGCLERLDYPKSRMAIWAATDHNVDNTTEILREWLKS 107

Query: 345 FKTMFKNVKYIAHNST------------VNSK-----EARNLAVENSLHKGVDFYFYVDS 387
            + ++  V++   N               NS+     + R  A+  +  K  D+  ++D 
Sbjct: 108 VQRLYHYVEWRPMNEPESYPDEIGPKHWPNSRFSHVMKLRQAALRTAREKWSDYILFIDV 167

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D+ L NP  L  ++  N++++AP+L      +SNFW  +   GFY R+ DY+ I    + 
Sbjct: 168 DNFLTNPQTLNLMIVENKTIVAPML-ESRGLYSNFWCGITPQGFYKRTPDYLQIREWKR- 225

Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIHLKI 501
             G + VP + + +L+   + K  + K  +     DY       + F  + R  GI + +
Sbjct: 226 -MGCFPVPMVHSTFLI--DLRKEASDKLAFYPPHQDYTWTFDDIIVFAFSSRQAGIQMYL 282

Query: 502 DSTQEYGHL 510
            + + YG+L
Sbjct: 283 CNKEHYGYL 291


>gi|293341373|ref|XP_001070927.2| PREDICTED: procollagen galactosyltransferase 2 [Rattus norvegicus]
 gi|149058405|gb|EDM09562.1| glycosyltransferase 25 domain containing 2 (predicted) [Rattus
           norvegicus]
          Length = 625

 Score = 74.3 bits (181), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 57/249 (22%), Positives = 112/249 (44%), Gaps = 30/249 (12%)

Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
           P Q P+V + V        L  FL  +  L+YP  +++++    +N +    +  +++ N
Sbjct: 48  PLQKPTVFVVVLARNAAHTLPHFLGCLERLDYPKSRMAIWAATDHNVDNTTEILREWLKN 107

Query: 345 FKTMFKNVKY------------IAHNSTVNSKEA-----RNLAVENSLHKGVDFYFYVDS 387
            + ++  V++            I      +S+ A     R  A+  +  K  D+  ++D 
Sbjct: 108 VQRLYHYVEWRPMDEPESYPDEIGPKHWPSSRFAHVMKLRQAALRTAREKWSDYILFIDV 167

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D+ L NP  L  ++  N++++AP+L      +SNFW  +   GFY R+ DY+ I    + 
Sbjct: 168 DNFLTNPQTLNLMIAENKTILAPML-ESRGLYSNFWCGITPQGFYKRTPDYLQIREWKR- 225

Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIHLKI 501
             G + VP + + +L+   + K  + K  +     DY       + F  + R  GI + +
Sbjct: 226 -TGCFPVPMVHSTFLI--DLRKEASDKLSFYPPHQDYTWTFDDIIVFAFSSRQAGIQMYL 282

Query: 502 DSTQEYGHL 510
            + + YG+L
Sbjct: 283 CNKEHYGYL 291


>gi|228008340|ref|NP_808424.3| procollagen galactosyltransferase 2 precursor [Mus musculus]
 gi|160395572|sp|Q6NVG7.2|GT252_MOUSE RecName: Full=Procollagen galactosyltransferase 2; AltName:
           Full=Glycosyltransferase 25 family member 2; AltName:
           Full=Hydroxylysine galactosyltransferase 2; Flags:
           Precursor
          Length = 625

 Score = 74.3 bits (181), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 56/249 (22%), Positives = 112/249 (44%), Gaps = 30/249 (12%)

Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
           P Q P+V + V        L  FL  +  L+YP  +++++    +N +    +  +++ +
Sbjct: 48  PPQKPTVFVVVLARNAAHTLPYFLGCLERLDYPKSRMAIWAATDHNVDNTTEILREWLKS 107

Query: 345 FKTMFKNVKYIAHNST------------VNSK-----EARNLAVENSLHKGVDFYFYVDS 387
            + ++  V++   N               NS+     + R  A+  +  K  D+  ++D 
Sbjct: 108 VQRLYHYVEWRPMNEPESYPDEIGPKHWPNSRFSHVMKLRQAALRTAREKWSDYILFIDV 167

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D+ L NP  L  ++  N++++AP+L      +SNFW  +   GFY R+ DY+ I    + 
Sbjct: 168 DNFLTNPQTLNLMIVENKTIVAPML-ESRGLYSNFWCGITPQGFYKRTPDYLQIREWKR- 225

Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIHLKI 501
             G + VP + + +L+   + K  + K  +     DY       + F  + R  GI + +
Sbjct: 226 -MGCFPVPMVHSTFLI--DLRKEASDKLAFYPPHQDYTWTFDDIIVFAFSSRQAGIQMYL 282

Query: 502 DSTQEYGHL 510
            + + YG+L
Sbjct: 283 CNKEHYGYL 291


>gi|31377697|ref|NP_078932.2| procollagen galactosyltransferase 1 precursor [Homo sapiens]
 gi|74715064|sp|Q8NBJ5.1|GT251_HUMAN RecName: Full=Procollagen galactosyltransferase 1; AltName:
           Full=Glycosyltransferase 25 family member 1; AltName:
           Full=Hydroxylysine galactosyltransferase 1; Flags:
           Precursor
 gi|22761754|dbj|BAC11684.1| unnamed protein product [Homo sapiens]
 gi|80478641|gb|AAI08309.1| Glycosyltransferase 25 domain containing 1 [Homo sapiens]
 gi|119605028|gb|EAW84622.1| glycosyltransferase 25 domain containing 1, isoform CRA_b [Homo
           sapiens]
          Length = 622

 Score = 74.3 bits (181), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 55/248 (22%), Positives = 118/248 (47%), Gaps = 28/248 (11%)

Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
           P Q P VLI++        L   L  +  L +P ++ +++V   +N +  + +  +++  
Sbjct: 48  PLQAPRVLIALLARNAAHALPTTLGALERLRHPRERTALWVATDHNMDNTSTVLREWLVA 107

Query: 345 FKTMFKNV-----------------KYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDS 387
            K+++ +V                 K+ + +   +  + R  A++++     D+  +VD+
Sbjct: 108 VKSLYHSVEWRPAEEPRSYPDEEGPKHWSDSRYEHVMKLRQAALKSARDMWADYILFVDA 167

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D+ + NPD L  L+  N++++AP+L     A+SNFW  + + G+Y R+  Y+ I   D+ 
Sbjct: 168 DNLILNPDTLSLLIAENKTVVAPML-DSRAAYSNFWCGMTSQGYYKRTPAYIPIRKRDR- 225

Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIK-----TIYTLNSMDYDMAFCTNLRNKGIHLKID 502
            +G + VP + + +L+      + N+        YT  S D  + F  + +   + + + 
Sbjct: 226 -RGCFAVPMVHSTFLIDLRKAASRNLAFYPPHPDYTW-SFDDIIVFAFSCKQAEVQMYVC 283

Query: 503 STQEYGHL 510
           + +EYG L
Sbjct: 284 NKEEYGFL 291


>gi|410222202|gb|JAA08320.1| glycosyltransferase 25 domain containing 1 [Pan troglodytes]
 gi|410222204|gb|JAA08321.1| glycosyltransferase 25 domain containing 1 [Pan troglodytes]
 gi|410259730|gb|JAA17831.1| glycosyltransferase 25 domain containing 1 [Pan troglodytes]
 gi|410259732|gb|JAA17832.1| glycosyltransferase 25 domain containing 1 [Pan troglodytes]
 gi|410259734|gb|JAA17833.1| glycosyltransferase 25 domain containing 1 [Pan troglodytes]
 gi|410259736|gb|JAA17834.1| glycosyltransferase 25 domain containing 1 [Pan troglodytes]
 gi|410300922|gb|JAA29061.1| glycosyltransferase 25 domain containing 1 [Pan troglodytes]
 gi|410300924|gb|JAA29062.1| glycosyltransferase 25 domain containing 1 [Pan troglodytes]
          Length = 622

 Score = 74.3 bits (181), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 55/248 (22%), Positives = 118/248 (47%), Gaps = 28/248 (11%)

Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
           P Q P VLI++        L   L  +  L +P ++ +++V   +N +  + +  +++  
Sbjct: 48  PLQAPRVLIALLARNAAHALPTTLGALERLRHPRERTALWVATDHNMDNTSTVLREWLVA 107

Query: 345 FKTMFKNV-----------------KYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDS 387
            K+++ +V                 K+ + +   +  + R  A++++     D+  +VD+
Sbjct: 108 VKSLYHSVEWRPAEEPRSYPDEEGPKHWSDSRYEHVMKLRQAALKSARDMWADYILFVDA 167

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D+ + NPD L  L+  N++++AP+L     A+SNFW  + + G+Y R+  Y+ I   D+ 
Sbjct: 168 DNLILNPDTLSLLIAENKTVVAPML-DSRAAYSNFWCGMTSQGYYKRTPAYIPIRKRDR- 225

Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIK-----TIYTLNSMDYDMAFCTNLRNKGIHLKID 502
            +G + VP + + +L+      + N+        YT  S D  + F  + +   + + + 
Sbjct: 226 -RGCFAVPMVHSTFLIDLRKAASRNLAFYPPHPDYTW-SFDDIIVFAFSCKQAEVQMYVC 283

Query: 503 STQEYGHL 510
           + +EYG L
Sbjct: 284 NKEEYGFL 291


>gi|301608466|ref|XP_002933810.1| PREDICTED: procollagen galactosyltransferase 2-like [Xenopus
           (Silurana) tropicalis]
          Length = 616

 Score = 73.9 bits (180), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 60/270 (22%), Positives = 121/270 (44%), Gaps = 33/270 (12%)

Query: 270 TSGCTRCNLIKHLDSLKPD---QFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMF 326
           T+ C     +   +++ P+   Q PSVLI++        L  F++ I  L+YP  +I+++
Sbjct: 19  TNLCASAEELNIEEAVLPESSLQKPSVLIAIIARNAAHTLPYFMDCIDKLDYPKSRIAIW 78

Query: 327 VY--NNQEYHAPLFDDYIHNFKTMFKNV-----------------KYIAHNSTVNSKEAR 367
               +N +    +  +++ + + ++  V                 K+   +   +  + R
Sbjct: 79  AATDHNIDNTTAILREWLKSVQKLYHYVEWRPMAEPQSYADELGPKHWPASRFAHVMKLR 138

Query: 368 NLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALN 427
             A+  +  K  D+  Y+D+D+ L NP  L  ++  N++++AP+L      +SNFW  + 
Sbjct: 139 QAALRTAKEKWSDYVLYIDADNFLTNPQTLNLMMKENKTIVAPML-ESRTLYSNFWCGMT 197

Query: 428 ADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMA 487
             G+Y R+ DY+ I   +    G + VP + +  L+      + N++  +     DY  A
Sbjct: 198 PQGYYKRTPDYVLI--REWKRLGCFPVPMVHSTILIDLRKEASKNLQ--FYPPQEDYTWA 253

Query: 488 ------FCTNLRNKGIHLKIDSTQEYGHLV 511
                 F  + R  GI + I + + YG+L 
Sbjct: 254 FDDIIVFAFSSRQAGIQMYICNREHYGYLA 283


>gi|417411747|gb|JAA52300.1| Putative procollagen galactosyltransferase 1, partial [Desmodus
           rotundus]
          Length = 579

 Score = 73.9 bits (180), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 56/248 (22%), Positives = 117/248 (47%), Gaps = 28/248 (11%)

Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
           P Q P VLI++        L   L  +  L +P ++ +++V   +N +  + +  +++  
Sbjct: 5   PLQAPRVLIALLARNAAHALPATLGALERLQHPRERTALWVATDHNSDNTSAVLREWLVA 64

Query: 345 FKTMFKNV-----------------KYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDS 387
            K+++ +V                 K+ +     +  + R  A++++     D+  +VDS
Sbjct: 65  VKSLYHSVEWRPAEEPRSYPDEEGPKHWSDTRYEHVMKLRQAALKSARDMWADYILFVDS 124

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D+ + NPD L  L+  N++++AP+L     A+SNFW  + + G+Y R+  Y+ I   D+ 
Sbjct: 125 DNLILNPDTLTLLIAENKTVVAPML-DSRAAYSNFWCGMTSQGYYKRTPAYIPIRKRDR- 182

Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIK-----TIYTLNSMDYDMAFCTNLRNKGIHLKID 502
            +G + VP + + +L+      + N+      T YT  S D  + F  + +   + + + 
Sbjct: 183 -QGCFAVPMVHSTFLIDLRKAASRNLAFYPPHTDYTW-SFDDIIVFAFSCKQAEVQMYVC 240

Query: 503 STQEYGHL 510
           + + YG L
Sbjct: 241 NKEVYGFL 248


>gi|157823499|ref|NP_001099537.1| procollagen galactosyltransferase 1 precursor [Rattus norvegicus]
 gi|149036101|gb|EDL90767.1| glycosyltransferase 25 domain containing 1 (predicted) [Rattus
           norvegicus]
 gi|169642770|gb|AAI60899.1| Glycosyltransferase 25 domain containing 1 [Rattus norvegicus]
          Length = 617

 Score = 73.9 bits (180), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 56/248 (22%), Positives = 116/248 (46%), Gaps = 28/248 (11%)

Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
           P Q P VLI++        L   L  +  L +P ++ +++V   +N +  + +  +++  
Sbjct: 43  PLQAPRVLIALLARNAAPALPATLGALERLRHPRERTALWVATDHNTDNTSAILREWLVA 102

Query: 345 FKTMFKNVKY--IAHNSTVNSKEA---------------RNLAVENSLHKGVDFYFYVDS 387
            K ++ +V++      S+   +E                R  A++++     D+  +VDS
Sbjct: 103 VKGLYHSVEWRPAEEPSSYPDEEGPKHWSDSRYEHVMKLRQAALKSARDMWADYILFVDS 162

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D+ + NPD L  L+  N++++AP+L     A+SNFW  + + G+Y R+  Y+ I   D+ 
Sbjct: 163 DNLITNPDTLSLLIAENKTVVAPML-DSRAAYSNFWCGMTSQGYYKRTPAYIPIRKRDR- 220

Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIK-----TIYTLNSMDYDMAFCTNLRNKGIHLKID 502
            +G + VP + + +L+      + N+        YT  S D  + F  + +   + + + 
Sbjct: 221 -RGCFAVPMVHSTFLIDLRKAASRNLAFYPPHPDYTW-SFDDIIVFAFSCKQAEVQMYVC 278

Query: 503 STQEYGHL 510
           + + YG L
Sbjct: 279 NKEVYGFL 286


>gi|13470787|ref|NP_102356.1| hypothetical protein mll0582 [Mesorhizobium loti MAFF303099]
 gi|14021530|dbj|BAB48142.1| mll0582 [Mesorhizobium loti MAFF303099]
          Length = 931

 Score = 73.9 bits (180), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 55/241 (22%), Positives = 113/241 (46%), Gaps = 28/241 (11%)

Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFKTM 348
           P +L+++   +    L  +L  I  L+YP   I +++   NN +    +  +++     +
Sbjct: 405 PRILVTILAKQKEPALPLYLECIEALDYPKASIVLYIRTNNNTDRTEHILREWVERVGHL 464

Query: 349 FKNVKYIAHNSTVNSKE----------------ARNLAVENSLHKGVDFYFYVDSDSHLD 392
           +  V++ A N     ++                 RN+++  +L    DFYF  D D+ + 
Sbjct: 465 YAAVEFDASNVADRVEQFGEHEWNETRFRVLGRIRNISLRKTLEHSCDFYFVADVDNFV- 523

Query: 393 NPDVLKYLVNRNESLIAPLL--VRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKG 450
            P  L+ LV  +  ++APLL  + P + +SN+   ++A+G+Y +   Y  ++N  +  +G
Sbjct: 524 RPATLRELVALDVPIVAPLLRSISPGQYYSNYHAEIDANGYYMQCDQYGWVLN--RHVRG 581

Query: 451 IWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD-MAFCTNLRNKGIHLKIDSTQEYGH 509
           I  +P +   YL++  V+     +  Y   +  Y+ + F  + R  GI   +D+ Q YG+
Sbjct: 582 IIEMPLVHCTYLVRADVLP----ELTYEDATSRYEYVIFADSARKAGIVQYMDNRQVYGY 637

Query: 510 L 510
           +
Sbjct: 638 I 638


>gi|355755605|gb|EHH59352.1| Procollagen galactosyltransferase 1 [Macaca fascicularis]
          Length = 558

 Score = 73.6 bits (179), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 55/248 (22%), Positives = 117/248 (47%), Gaps = 28/248 (11%)

Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
           P Q P VLI++        L   L  +  L +P ++ +++V   +N +  + +  +++  
Sbjct: 48  PLQAPRVLIALLARNAAHALPTTLGALERLRHPRERTALWVATDHNMDNTSTVLREWLVA 107

Query: 345 FKTMFKNV-----------------KYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDS 387
            K ++ +V                 K+ + +   +  + R  A++++     D+  +VD+
Sbjct: 108 VKNLYHSVEWRPAEEPRSYPDEEGPKHWSDSRYEHVMKLRQAALKSARDMWADYILFVDA 167

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D+ + NPD L  L+  N++++AP+L     A+SNFW  + + G+Y R+  Y+ I   D+ 
Sbjct: 168 DNLILNPDTLSLLIAENKTVVAPML-NSRAAYSNFWCGMTSQGYYKRTPAYIPIRKRDR- 225

Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIK-----TIYTLNSMDYDMAFCTNLRNKGIHLKID 502
            +G + VP + + +L+      + N+        YT  S D  + F  + +   + + + 
Sbjct: 226 -RGCFAVPMVHSTFLIDLRKAASRNLAFYPPHPDYTW-SFDDIIVFAFSCKQAEVQMYVC 283

Query: 503 STQEYGHL 510
           + +EYG L
Sbjct: 284 NKEEYGFL 291


>gi|355703306|gb|EHH29797.1| Procollagen galactosyltransferase 1 [Macaca mulatta]
          Length = 622

 Score = 73.6 bits (179), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 57/248 (22%), Positives = 116/248 (46%), Gaps = 28/248 (11%)

Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
           P Q P VLI++        L   L  +  L +P ++ +++V   +N +  + +  +++  
Sbjct: 48  PLQAPRVLIALLARNAAHALPTTLGALERLRHPRERTALWVATDHNMDNTSTVLREWLVA 107

Query: 345 FKTMFKNVKY-IAHNSTVNSKEA----------------RNLAVENSLHKGVDFYFYVDS 387
            K ++ +V++  A      S E                 R  A++++     D+  +VD+
Sbjct: 108 VKNLYHSVEWRPAEEPRSYSDEEGPKHWSDSRYEHVMKLRQAALKSARDMWADYILFVDA 167

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D+ + NPD L  L+  N++++AP+L     A+SNFW  + + G+Y R+  Y+ I   D+ 
Sbjct: 168 DNLILNPDTLSLLIAENKTVVAPML-DSRAAYSNFWCGMTSQGYYKRTPAYIPIRKRDR- 225

Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIK-----TIYTLNSMDYDMAFCTNLRNKGIHLKID 502
            +G + VP + + +L+      + N+        YT  S D  + F  + +   + + + 
Sbjct: 226 -RGCFAVPMVHSTFLIDLRKAASRNLAFYPPHPDYTW-SFDDIIVFAFSCKQAEVQMYVC 283

Query: 503 STQEYGHL 510
           + +EYG L
Sbjct: 284 NKEEYGFL 291


>gi|432868112|ref|XP_004071417.1| PREDICTED: procollagen galactosyltransferase 1-like [Oryzias
           latipes]
          Length = 610

 Score = 73.6 bits (179), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 57/245 (23%), Positives = 113/245 (46%), Gaps = 30/245 (12%)

Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFKTM 348
           P V +++        L  FL  I  LNYP  +++++V   +N +    +  D++   + +
Sbjct: 40  PRVHVALICRNSEHSLPHFLGTIERLNYPKDRMALWVATDHNVDNTTAVLRDWLIKVQNL 99

Query: 349 FKNVKYIAHNS--TVNSKEA---------------RNLAVENSLHKGVDFYFYVDSDSHL 391
           +  V++       + + +E                R  A+E++     D++  VD D+ L
Sbjct: 100 YHYVEWRPQEEPRSYDDEEGPKHWTDLRYEHVMKLRQAALESAREMWADYFMLVDCDNLL 159

Query: 392 DNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGI 451
            NP+VL  L+  N+++IAP+L     A+SNFW  + ++G+Y R+  Y+ I    Q  KG 
Sbjct: 160 TNPNVLWKLIQENKTIIAPML-ESRAAYSNFWCGMTSEGYYRRTPAYIPIRR--QVRKGC 216

Query: 452 WNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMA------FCTNLRNKGIHLKIDSTQ 505
           + VP + + +L+   + K  + +  +     DY  A      F  + R   + + + + +
Sbjct: 217 FAVPMVHSTFLI--DLRKEASKQLAFYPPHPDYSWAFDDIIVFAYSARMADVQMFVCNRE 274

Query: 506 EYGHL 510
            YG+ 
Sbjct: 275 SYGYF 279


>gi|383862287|ref|XP_003706615.1| PREDICTED: glycosyltransferase 25 family member-like [Megachile
           rotundata]
          Length = 570

 Score = 73.6 bits (179), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 68/246 (27%), Positives = 116/246 (47%), Gaps = 29/246 (11%)

Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYI------ 342
           PSVLI++ +      L  FL  +  LNYP ++I +++   NN +    +   ++      
Sbjct: 28  PSVLITILVRNKAHTLPYFLTFLEQLNYPKQRIHLWICSDNNIDKSIEILSTWLNRTAKE 87

Query: 343 -HNFKTMF--KNVKYIAHNSTVNSKEARNLAV----ENSLHKG----VDFYFYVDSDSHL 391
            H  +T F  K+V +   N   +    R L V    E +L+ G     DF + +D+D  +
Sbjct: 88  YHGVETSFDEKSVGFEDENGVAHWSMQRFLHVIKLREAALNAGRNIWADFVWMLDADVFI 147

Query: 392 DNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGI 451
            NP  L  L++RN++ +APLL +    +SNFW  +  D +Y R+  Y  I+  ++  KG 
Sbjct: 148 TNPYTLNELISRNQTAVAPLL-KSDGLYSNFWAGMTNDYYYLRTDKYEPILYREE--KGC 204

Query: 452 WNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIHLKIDSTQ 505
           ++VP I +  L+      +  + T    N  DYD      + F    +  GI L I +  
Sbjct: 205 FSVPMIHSAVLIDLRTHLSDQL-TYNPKNLNDYDGPIDDIITFAIGAKKFGIPLFICNAN 263

Query: 506 EYGHLV 511
            YG+++
Sbjct: 264 VYGYIM 269


>gi|327282249|ref|XP_003225856.1| PREDICTED: procollagen galactosyltransferase 1-like [Anolis
           carolinensis]
          Length = 527

 Score = 73.6 bits (179), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 57/249 (22%), Positives = 113/249 (45%), Gaps = 30/249 (12%)

Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
           P + P VL+++        L   L  +  L +P  + +++V   +N +    +  +++ N
Sbjct: 33  PPRAPRVLVALLARNAAHSLPAALGCLERLRHPKDRTALWVATDHNVDNTTAVLREWLTN 92

Query: 345 FKTMFKNVKY--------------IAHNSTVNSKEA---RNLAVENSLHKGVDFYFYVDS 387
            K+M+ +V++                H S    +     R  A++ +     D+  +VDS
Sbjct: 93  VKSMYHSVEWRPMELPRSYPDEEGPKHWSNFRYEHVMKLRQAALQAARDMWADYILFVDS 152

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D+ L NPD L  L+  N++++AP+L     A+SNFW  + + G+Y R+  Y+ +   D+ 
Sbjct: 153 DNLLTNPDTLSLLIAENKTVVAPML-DSRAAYSNFWCGMTSQGYYKRTPAYIPVRKRDR- 210

Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMA------FCTNLRNKGIHLKI 501
            KG + VP + + +L+      + N+  ++     DY  A      F    R   + + +
Sbjct: 211 -KGCFAVPMVHSTFLINLQKEASQNL--VFYPPHPDYTWAFDDIIVFAFACRQAEVQMYV 267

Query: 502 DSTQEYGHL 510
            + + YG L
Sbjct: 268 CNKEVYGFL 276


>gi|326664713|ref|XP_686329.4| PREDICTED: procollagen galactosyltransferase 2 [Danio rerio]
          Length = 584

 Score = 73.6 bits (179), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 58/245 (23%), Positives = 111/245 (45%), Gaps = 29/245 (11%)

Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFKTM 348
           P V I++        L  FL  I  L+YP  +IS++    +N +    +  ++I   + +
Sbjct: 27  PKVAIAILARNSEHSLPYFLGCIERLDYPKDRISIWAATDHNTDNTTGMLREWIAGVEDL 86

Query: 349 FKNV------------------KYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSH 390
           + +V                  K+       +  + R  A++++  +  D+  + DSD+ 
Sbjct: 87  YHSVQLHTMEQEKSSYVDELGPKHWPETRFTHVMKLRQAALKSARAQWADYVLFTDSDNL 146

Query: 391 LDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKG 450
           L N  VL  L++ N +L+AP+L      +SNFW  + + G+Y R+  Y+ I    +   G
Sbjct: 147 LTNTQVLNQLISENRTLVAPML-DSRTLYSNFWCGMTSQGYYKRTPHYVPIRTWKR--TG 203

Query: 451 IWNVPYITNCYLMKTSVIKATNIKTIYTLN-----SMDYDMAFCTNLRNKGIHLKIDSTQ 505
              VP I +  L+     +A+ +   Y ++     ++D  MAF  + R  G+ + I + +
Sbjct: 204 CHPVPMIHSTMLIDLRR-RASELLAFYPVHHHYLWALDDIMAFAFSARQTGVQMFICNRE 262

Query: 506 EYGHL 510
            YG+L
Sbjct: 263 HYGYL 267


>gi|395847891|ref|XP_003796597.1| PREDICTED: procollagen galactosyltransferase 1 [Otolemur garnettii]
          Length = 623

 Score = 73.2 bits (178), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 56/248 (22%), Positives = 117/248 (47%), Gaps = 28/248 (11%)

Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
           P Q P VLI++        L   L  +  L +P ++ +++V   +N +  + +  +++  
Sbjct: 49  PLQAPRVLIALLARNAAHALPTTLGALERLRHPPERTALWVATDHNMDNTSAVLREWLVA 108

Query: 345 FKTMFKNV-----------------KYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDS 387
            K+++ +V                 K+ + +   +  + R  A++++     D+  +VD+
Sbjct: 109 MKSLYHSVEWRPAEEPRSYPDEEGPKHWSDSRYEHIMKLRQAALKSARDMWADYILFVDA 168

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D+ L NPD L  L+  N++++AP+L     A+SNFW  + + G+Y R+  Y+ I   D+ 
Sbjct: 169 DNLLLNPDTLSLLIAENKTVVAPML-DSRAAYSNFWCGMTSQGYYKRTPAYIPIRKRDR- 226

Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIK-----TIYTLNSMDYDMAFCTNLRNKGIHLKID 502
            +G + VP + + +L+      + N+        YT  S D  + F  + +   I + + 
Sbjct: 227 -RGCFAVPMVHSTFLIDLRKAASRNLAFYPPHPDYTW-SFDDIIVFAFSCKQAEIQMYVC 284

Query: 503 STQEYGHL 510
           + + YG L
Sbjct: 285 NKEVYGFL 292


>gi|402904728|ref|XP_003915192.1| PREDICTED: procollagen galactosyltransferase 1 [Papio anubis]
          Length = 622

 Score = 73.2 bits (178), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 55/248 (22%), Positives = 117/248 (47%), Gaps = 28/248 (11%)

Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
           P Q P VLI++        L   L  +  L +P ++ +++V   +N +  + +  +++  
Sbjct: 48  PLQAPRVLIALLARNAAHALPTTLGALERLRHPRERTALWVATDHNMDNTSTVLREWLVA 107

Query: 345 FKTMFKNV-----------------KYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDS 387
            K+++ +V                 K+ + +   +  + R  A++++     D+  +VD 
Sbjct: 108 VKSLYHSVEWRPAEEPRSYPDEEGPKHWSDSRYEHVMKLRQAALKSARDMWADYILFVDV 167

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D+ + NPD L  L+  N++++AP+L     A+SNFW  + + G+Y R+  Y+ I   D+ 
Sbjct: 168 DNLILNPDTLSLLIAENKTVVAPML-DSRAAYSNFWCGMTSQGYYKRTPAYIPIRKRDR- 225

Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIK-----TIYTLNSMDYDMAFCTNLRNKGIHLKID 502
            +G + VP + + +L+      + N+        YT  S D  + F  + +   + + + 
Sbjct: 226 -RGCFAVPMVHSTFLIDLRKAASRNLAFYPPHPDYTW-SFDDIIVFAFSCKQAEVQMYVC 283

Query: 503 STQEYGHL 510
           + +EYG L
Sbjct: 284 NKEEYGFL 291


>gi|340715525|ref|XP_003396262.1| PREDICTED: glycosyltransferase 25 family member-like [Bombus
           terrestris]
          Length = 569

 Score = 73.2 bits (178), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 56/193 (29%), Positives = 95/193 (49%), Gaps = 24/193 (12%)

Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFKTM 348
           P+VLI++ +      L  FL  +  L YP ++I +++   NN +    +   ++ N ++ 
Sbjct: 28  PTVLITILVRNKAHTLPYFLTFLEQLTYPKERIHLWICSDNNIDNSIEILSAWLKNERSK 87

Query: 349 FKNVKY--------------IAHNSTVNSKEARNLAVENSLHKG----VDFYFYVDSDSH 390
           +  V+               IAH S        NL  E +LH G     DF + +D+D  
Sbjct: 88  YHGVEINFDEKSNGFEDENEIAHWSPQRFLHVINLR-EEALHAGRNIWADFIWMLDADVF 146

Query: 391 LDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKG 450
           L NP+ L  L+ +NE+++APLL +    +SNFW  + +D +Y R+  Y  I+  +   KG
Sbjct: 147 LTNPNTLNELILKNETVVAPLL-KSDGLYSNFWAGVTSDFYYLRTEKYEPILFREI--KG 203

Query: 451 IWNVPYITNCYLM 463
            +NVP I +  L+
Sbjct: 204 CFNVPMIHSAVLI 216


>gi|350422829|ref|XP_003493297.1| PREDICTED: glycosyltransferase 25 family member-like [Bombus
           impatiens]
          Length = 569

 Score = 73.2 bits (178), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 57/193 (29%), Positives = 96/193 (49%), Gaps = 24/193 (12%)

Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFKTM 348
           P+VLI++ +      L  FL  +  L YP ++I +++   NN +    +   ++ N ++ 
Sbjct: 28  PTVLITILVRNKAHTLPYFLTFLEQLTYPKERIHLWICSDNNIDNSIEILSAWLKNERSK 87

Query: 349 FKNVKYIAHNSTVNSKEARN----------LAV----ENSLHKG----VDFYFYVDSDSH 390
           +  V+ I  N   N  E  N          L V    E +LH G     DF + +D+D  
Sbjct: 88  YHGVE-INFNEKSNGFEDENEISHWSPQRFLHVINLREEALHAGRNIWADFIWMLDADVF 146

Query: 391 LDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKG 450
           L NP+ L  L+ +NE+++APLL +    +SNFW  + +D +Y R+  Y  I+  +   KG
Sbjct: 147 LTNPNTLNELILKNETVVAPLL-KSDGLYSNFWAGMTSDFYYLRTEKYEPILFREI--KG 203

Query: 451 IWNVPYITNCYLM 463
            +NVP I +  L+
Sbjct: 204 CFNVPMIHSAVLI 216


>gi|390347653|ref|XP_783019.3| PREDICTED: procollagen galactosyltransferase 1-like
           [Strongylocentrotus purpuratus]
          Length = 646

 Score = 72.8 bits (177), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 59/270 (21%), Positives = 124/270 (45%), Gaps = 34/270 (12%)

Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYH--APLFDDYIHNFKTM 348
           P+V I +        L  F   +  LNYP  +I++++  +       P+  ++I      
Sbjct: 46  PTVFIPILARNKAHTLPHFFGYLERLNYPKDRITLWIRADHSVDNTIPMLREWIQRVAHY 105

Query: 349 FKNVKY-IAHNSTVNSKEA----------------RNLAVENSLHKGVDFYFYVDSDSHL 391
           +  V Y    +  V + E                 R+ A++ + +   D+++ +D D+ +
Sbjct: 106 YHTVDYAFEEHPQVYALEKGPHDWPSARFNHLIDLRDQALQEARNVWADYFYTMDVDNFV 165

Query: 392 DNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGI 451
              ++L  L++  +++IAP+L +    +SNFWG + + GFY R+ +Y+ I+   +   G+
Sbjct: 166 WEQNILDVLMSEKKTIIAPML-QSTTYYSNFWGGVTSKGFYKRTKEYVKIVK--RNVTGV 222

Query: 452 WNVPYITNCYLMKTSVIKATNIKTIYTL----NSMDYDMAFCTNLRNKGIHLKIDSTQEY 507
           + VP + + YL+  +  +AT+  T   L      +D  + F  + +  GI   I +   Y
Sbjct: 223 FKVPMVHSTYLINLNH-EATDKLTYKPLKDYAQDLDDMLTFAHSAKKAGISFYITNKDHY 281

Query: 508 GHLVDSENFDPQKTNP---EVYELIRNPLD 534
           G ++    + P+  +P   EV +++   L+
Sbjct: 282 GAML----YPPESHHPLKEEVEQMLHTKLE 307


>gi|348525092|ref|XP_003450056.1| PREDICTED: procollagen galactosyltransferase 1-like [Oreochromis
           niloticus]
          Length = 657

 Score = 72.8 bits (177), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 61/244 (25%), Positives = 109/244 (44%), Gaps = 28/244 (11%)

Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFKTM 348
           P V+I++        L  FL  I  LNYP  +I+++V   +N +    +  +++   +  
Sbjct: 86  PRVVIALVCRNSAHSLPLFLGTIERLNYPKDRIALWVATDHNVDNTTAILREWLIKVQNY 145

Query: 349 FKNVKY------IAHNSTVNSKEARNL-----------AVENSLHKGVDFYFYVDSDSHL 391
           +  V++       A    V  K   NL           A++ +     D+    D D+ L
Sbjct: 146 YHYVEWRPEDEPSAFEDEVGPKHWNNLRYEHVMKLRQAALDTAREIWADYLLVADCDNLL 205

Query: 392 DNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGI 451
            N DVL  L+  N++++AP+L     A+SNFW  + + G+Y R+  YM I    Q  +G 
Sbjct: 206 TNQDVLWKLMRENKTIVAPML-ESRAAYSNFWCGMTSQGYYRRTPAYMPIRR--QERRGC 262

Query: 452 WNVPYITNCYLMKTSVIKATNIKTIYTLN-----SMDYDMAFCTNLRNKGIHLKIDSTQE 506
           + VP + + YLM     +A+     Y  +     ++D  + F  + R   + + I + + 
Sbjct: 263 FPVPMVHSTYLMDLRK-EASRQLAFYPPHPEYSWALDDVIVFAYSARMADVQMYICNKET 321

Query: 507 YGHL 510
           YGH 
Sbjct: 322 YGHF 325


>gi|440908235|gb|ELR58279.1| Procollagen galactosyltransferase 2, partial [Bos grunniens mutus]
          Length = 610

 Score = 72.8 bits (177), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 58/251 (23%), Positives = 112/251 (44%), Gaps = 32/251 (12%)

Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
           P Q P+VL+ V        L  FL  +  L+YP  +++++    +N +    +  +++ N
Sbjct: 31  PLQRPTVLVVVLARNAAHTLPHFLGCLERLDYPKSRMAIWAATDHNVDNTTEILREWLKN 90

Query: 345 FKTMFKNV-----------------KYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDS 387
            +  +  V                 K+   +   +  + R  A+  +  K  D+  ++D 
Sbjct: 91  VQKAYHYVEWRPMDEPESYPDEIGPKHWPASRFAHVMKLRQAALRTAREKWSDYILFIDV 150

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGAL--NADGFYARSFDYMNIINGD 445
           D+ L NP  L  L+  N++++AP+L      +SNFW  +   A GFY R+ DY+ I    
Sbjct: 151 DNFLTNPQTLNLLMAENKTIVAPML-ESRGLYSNFWCGITPQASGFYKRTPDYLQIREWK 209

Query: 446 QGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIHL 499
           +   G + VP + + +L+   + K  + K ++     DY       + F  + R  GI +
Sbjct: 210 R--LGCFPVPMVHSTFLI--DLRKEASDKLMFYPPHQDYTWTFDDIIVFAFSSRQAGIQM 265

Query: 500 KIDSTQEYGHL 510
            + + + YG+L
Sbjct: 266 YLCNREHYGYL 276


>gi|312376729|gb|EFR23732.1| hypothetical protein AND_12342 [Anopheles darlingi]
          Length = 332

 Score = 72.8 bits (177), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 48/205 (23%), Positives = 96/205 (46%), Gaps = 23/205 (11%)

Query: 289 QFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN--NQEYHAPLFDDYIHNFK 346
           Q PSV+I+V I      L  F   +  L+YP  ++S+++ +  N++    +   ++    
Sbjct: 40  QPPSVMIAVLIRNKEHTLPYFFTYLEELDYPKDRLSIWIRSDHNEDRSIEITKAWLKRST 99

Query: 347 TMFKNVKYIAHNSTVNSKEA------------------RNLAVENSLHKGVDFYFYVDSD 388
            ++ +V +         +E+                  +  A++ +     D+  ++D+D
Sbjct: 100 PLYHSVDFKYRTEPAGKRESEKTYTHWTEDRFADVIRLKEEALQTARKMWADYVLFLDAD 159

Query: 389 SHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGG 448
             L NP  LK L++    ++AP+LV     +SNFW  + AD +Y R+ DY  I+N +  G
Sbjct: 160 VFLTNPRSLKALIDLKLPIVAPMLVSD-GLYSNFWCGMTADYYYHRTDDYKKILNYELVG 218

Query: 449 KGIWNVPYITNCYLMKTSVIKATNI 473
           +  W VP + +  L+  +V ++  +
Sbjct: 219 Q--WAVPMVHSAVLVDLNVAESRRL 241


>gi|160333551|ref|NP_001103992.1| procollagen galactosyltransferase 1 precursor [Danio rerio]
 gi|160395521|sp|A5PMF6.1|GT251_DANRE RecName: Full=Procollagen galactosyltransferase 1; AltName:
           Full=Glycosyltransferase 25 family member 1; AltName:
           Full=Hydroxylysine galactosyltransferase 1; Flags:
           Precursor
          Length = 604

 Score = 72.8 bits (177), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 71/308 (23%), Positives = 133/308 (43%), Gaps = 51/308 (16%)

Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFKTM 348
           P VL+++        L   L  I  LNYP  +++++V   +N +    +  +++ N +  
Sbjct: 34  PRVLVALVCRNSAHSLPHVLGAIDRLNYPKDRMAVWVATDHNSDNTTEILREWLVNVQNF 93

Query: 349 FKNVKYIAHNS-TVNSKEA----------------RNLAVENSLHKGVDFYFYVDSDSHL 391
           +  V++   +  +V   E+                R  A+E +     D++  VD D+ L
Sbjct: 94  YHYVEWRPQDEPSVYEGESGPKHWTNLRYEHVMKLRQAALETAREMWADYFMLVDCDNLL 153

Query: 392 DNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGI 451
            N DVL  L+  N++++AP+L     A+SNFW  + + G+Y R+  YM I    Q  KG 
Sbjct: 154 TNRDVLWKLMRENKTIVAPML-ESRAAYSNFWCGMTSQGYYKRTPAYMPIRR--QERKGC 210

Query: 452 WNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMA------FCTNLRNKGIHLKIDSTQ 505
           + VP + +  L+   + K  + +  +     DY  A      F  + R   + + I + +
Sbjct: 211 FAVPMVHSTLLL--DLRKEASRQLAFFPPHPDYTWAFDDIIIFAFSARMAEVQMYICNRE 268

Query: 506 EYGHLV-----------DSENFDPQKTNPEVYELIRNPLDWDLRYIHPEYQKSLLPDTVN 554
            YG+             ++E+F     + ++  ++RNP       I P    SL+P   +
Sbjct: 269 TYGYFPVPLRSQNSLQDEAESF----LHSQLEVMVRNPP------IEPSVYLSLMPKQTD 318

Query: 555 NQPCPDVF 562
                +VF
Sbjct: 319 KMGFDEVF 326


>gi|397493909|ref|XP_003817838.1| PREDICTED: LOW QUALITY PROTEIN: procollagen galactosyltransferase 1
           [Pan paniscus]
          Length = 622

 Score = 72.4 bits (176), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 54/248 (21%), Positives = 118/248 (47%), Gaps = 28/248 (11%)

Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
           P Q P VLI++        L   L  +  L +P ++ +++V   +N +  + +  +++  
Sbjct: 48  PLQAPRVLIALLARNAAHALPTTLGALERLRHPRERTALWVATDHNMDNTSTVLREWLVA 107

Query: 345 FKTMFKNV-----------------KYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDS 387
            K+++ +V                 K+ + +   +  + R  A++++     D+  +VD+
Sbjct: 108 VKSLYHSVEWRPAEEPRSYPDEEGPKHWSDSRYEHVMKLRQAALKSARDMWADYILFVDA 167

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D+ + +PD L  L+  N++++AP+L     A+SNFW  + + G+Y R+  Y+ I   D+ 
Sbjct: 168 DNLILSPDTLSLLIAENKTVVAPML-DSRAAYSNFWCGMTSQGYYKRTPAYIPIRKRDR- 225

Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIK-----TIYTLNSMDYDMAFCTNLRNKGIHLKID 502
            +G + VP + + +L+      + N+        YT  S D  + F  + +   + + + 
Sbjct: 226 -RGCFAVPMVHSTFLIDLRKAASRNLAFYPPHPDYTW-SFDDIIVFAFSCKQAEVQMYVC 283

Query: 503 STQEYGHL 510
           + +EYG L
Sbjct: 284 NKEEYGFL 291


>gi|119605027|gb|EAW84621.1| glycosyltransferase 25 domain containing 1, isoform CRA_a [Homo
           sapiens]
          Length = 645

 Score = 72.4 bits (176), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 54/248 (21%), Positives = 117/248 (47%), Gaps = 28/248 (11%)

Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
           P Q P VLI++        L   L  +  L +P ++ +++V   +N +  + +  +++  
Sbjct: 48  PLQAPRVLIALLARNAAHALPTTLGALERLRHPRERTALWVATDHNMDNTSTVLREWLVA 107

Query: 345 FKTMFKNV-----------------KYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDS 387
            K+++ +V                 K+ + +   +  + R  A++++     D+  +VD+
Sbjct: 108 VKSLYHSVEWRPAEEPRSYPDEEGPKHWSDSRYEHVMKLRQAALKSARDMWADYILFVDA 167

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D+ + NPD L  L+  N++++AP+L     A+SNFW  + + G+Y R+  Y+ I   D+ 
Sbjct: 168 DNLILNPDTLSLLIAENKTVVAPML-DSRAAYSNFWCGMTSQGYYKRTPAYIPIRKRDR- 225

Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIK-----TIYTLNSMDYDMAFCTNLRNKGIHLKID 502
            +G + VP + + +L+      + N+        YT  S D  + F  + +  G +++  
Sbjct: 226 -RGCFAVPMVHSTFLIDLRKAASRNLAFYPPHPDYTW-SFDDIIVFAFSCKQAGTYMRAT 283

Query: 503 STQEYGHL 510
             + + HL
Sbjct: 284 GPRLFLHL 291


>gi|301787505|ref|XP_002929168.1| PREDICTED: procollagen galactosyltransferase 2-like, partial
           [Ailuropoda melanoleuca]
          Length = 630

 Score = 72.4 bits (176), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 61/266 (22%), Positives = 117/266 (43%), Gaps = 32/266 (12%)

Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
           P Q P+VL+++        L  FL  +  L++    + M+    +N +    +  +++ N
Sbjct: 53  PMQRPTVLVAILARNAAHALPHFLGCLERLDFAKSPLIMWAATDHNVDNTTEILREWLKN 112

Query: 345 FKTMFKNVKY------------IAHNSTVNSKEA-----RNLAVENSLHKGVDFYFYVDS 387
            ++ +  V++            I       S+ A     R  A+  +  K  D+  ++D 
Sbjct: 113 VQSFYHYVEWRPMDEPESYPDEIGPKHWPGSRFAHVMKLRQAALRTAREKWSDYILFIDV 172

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D+ L NP  L  ++  N++++AP+L      +SNFW  +   GFY R+ DY+ I    + 
Sbjct: 173 DNFLTNPQTLNLMIAENKTIVAPML-ESRGLYSNFWCGITPQGFYKRTPDYLQIREWKR- 230

Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIHLKI 501
             G + VP + + +L+   + K  + K ++     DY       + F  + R  GI + +
Sbjct: 231 -LGCFPVPMVHSTFLI--DLRKEASDKLMFYPPHQDYTWTFDDIIVFAFSSRQAGIQMYL 287

Query: 502 DSTQEYGHLVDSENFDPQKTNPEVYE 527
            + + YG+L       PQ+T  E  E
Sbjct: 288 CNREHYGYL--PIPLKPQQTLQEEIE 311


>gi|383421633|gb|AFH34030.1| procollagen galactosyltransferase 1 precursor [Macaca mulatta]
          Length = 622

 Score = 71.6 bits (174), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 54/248 (21%), Positives = 117/248 (47%), Gaps = 28/248 (11%)

Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
           P Q P VLI++        L   L  +  L +P ++ +++V   +N +  + +  +++  
Sbjct: 48  PLQAPRVLIALLARNAAHALPTTLGALERLRHPRERTALWVATDHNMDNTSTVLREWLVA 107

Query: 345 FKTMFKNV-----------------KYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDS 387
            K ++ +V                 K+ + +   +  + R  A++++     D+  +VD+
Sbjct: 108 VKNLYHSVEWRPAEEPRSYPDEEGPKHWSDSRYEHVMKLRQAALKSARDMWADYILFVDA 167

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D+ + +PD L  L+  N++++AP+L     A+SNFW  + + G+Y R+  Y+ I   D+ 
Sbjct: 168 DNLILSPDTLSLLIAENKTVVAPML-DSRAAYSNFWCGMTSQGYYKRTPAYIPIRKRDR- 225

Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIK-----TIYTLNSMDYDMAFCTNLRNKGIHLKID 502
            +G + VP + + +L+      + N+        YT  S D  + F  + +   + + + 
Sbjct: 226 -RGCFAVPMVHSTFLIDLRKAASRNLAFYPPHPDYTW-SFDDIIVFAFSCKQAEVQMYVC 283

Query: 503 STQEYGHL 510
           + +EYG L
Sbjct: 284 NKEEYGFL 291


>gi|410926753|ref|XP_003976837.1| PREDICTED: procollagen galactosyltransferase 2-like [Takifugu
           rubripes]
          Length = 617

 Score = 71.6 bits (174), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 56/245 (22%), Positives = 113/245 (46%), Gaps = 26/245 (10%)

Query: 289 QFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN--NQEYHAPLFDDYIHNFK 346
           Q P+V+I++        L  +L  +  LNYP  +IS++  +  N +    +  +++   +
Sbjct: 57  QPPTVVIAILARNSAHSLPYYLGALERLNYPKDRISVWAASDHNVDNTTAVLKEWLTAMQ 116

Query: 347 TMFKNVKYIAHNSTV------------NSK-----EARNLAVENSLHKGVDFYFYVDSDS 389
             + +V++   +               NS+     + +  A+  +  +  D+  Y D+D+
Sbjct: 117 QFYHHVEWRPMDQPTWYAGELGPKHWPNSRYEYVMKLKQAALGFARKRWADYILYADADN 176

Query: 390 HLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGK 449
            L NPD L  L+  N+S++AP+L     A+SNFW  +   G+Y R+ +Y    +  +   
Sbjct: 177 ILTNPDTLNLLIAENKSVVAPML-HSQGAYSNFWCGITPQGYYRRTAEYFPTRHRHR--L 233

Query: 450 GIWNVPYITNCYL--MKTSVIKATNIKTIYTLNSMDYD--MAFCTNLRNKGIHLKIDSTQ 505
           G + VP + +  L  ++   +K       +   S  YD  + F  + R +GI + + + +
Sbjct: 234 GCFPVPMVHSTMLLDLRKEGMKRLAFFPPHADYSWPYDDIIVFAFSCRTEGIQMYLCNKE 293

Query: 506 EYGHL 510
            YG+L
Sbjct: 294 RYGYL 298


>gi|148697003|gb|EDL28950.1| glycosyltransferase 25 domain containing 1, isoform CRA_b [Mus
           musculus]
          Length = 478

 Score = 71.6 bits (174), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 53/247 (21%), Positives = 114/247 (46%), Gaps = 26/247 (10%)

Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
           P Q P VLI++        L   L  +  L +P ++ +++V   +N +  + +  +++  
Sbjct: 53  PLQAPRVLIALLARNAAPALPATLGALEQLRHPRERTALWVATDHNTDNTSAILREWLVA 112

Query: 345 FKTMFKNVKY--IAHNSTVNSKEA---------------RNLAVENSLHKGVDFYFYVDS 387
            K ++ +V++      S+   +E                R  A++++     D+  ++D 
Sbjct: 113 VKGLYHSVEWRPAEEPSSYPDEEGPKHWSDSRYEHVMKLRQAALKSARDMWADYILFMDI 172

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D+ + NPD L  L+  N++++AP+L     A+SNFW  + + G+Y R+  Y+ I   D+ 
Sbjct: 173 DNLITNPDTLSLLIAENKTVVAPML-DSRAAYSNFWCGMTSQGYYKRTPAYIPIRKRDR- 230

Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKTIYT----LNSMDYDMAFCTNLRNKGIHLKIDS 503
            +G + VP + + +L+      + N+    T      S D  + F  + +   + + + +
Sbjct: 231 -RGCFAVPMVHSTFLIDLRKAASRNLAFYPTHPDYTWSFDDIIVFAFSCKQAEVQMYVCN 289

Query: 504 TQEYGHL 510
            + YG L
Sbjct: 290 KEVYGFL 296


>gi|348508948|ref|XP_003442014.1| PREDICTED: procollagen galactosyltransferase 1-like [Oreochromis
           niloticus]
          Length = 610

 Score = 71.2 bits (173), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 46/192 (23%), Positives = 94/192 (48%), Gaps = 22/192 (11%)

Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFKTM 348
           P VL+++        L  FL  I  LNYP  +++++V   +N++    +  D++   + +
Sbjct: 40  PRVLLALICRNSEHSLPYFLGTIERLNYPKDRMALWVATDHNEDNTTAILRDWLVKVQKL 99

Query: 349 FKNVKYIAHNSTVNSKEA-----------------RNLAVENSLHKGVDFYFYVDSDSHL 391
           +  V++       + ++                  R  A+E++     D++   D D+ L
Sbjct: 100 YHYVEWRPKEEPRSYEDEEGPKDWIDPRYEHVMKLRQAALESAREMWADYFMLADCDNLL 159

Query: 392 DNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGI 451
            N +VL+ L+ +N+++IAP+L     A+SNFW  + + G+Y R+  Y+ +    Q  KG 
Sbjct: 160 TNSNVLRGLMKQNKTIIAPML-ESRAAYSNFWCGMTSQGYYKRTPAYIPV--RKQIRKGC 216

Query: 452 WNVPYITNCYLM 463
           + VP + + +L+
Sbjct: 217 FAVPMVHSTFLI 228


>gi|311249255|ref|XP_003123541.1| PREDICTED: procollagen galactosyltransferase 1-like [Sus scrofa]
 gi|456752987|gb|JAA74072.1| glycosyltransferase 25 domain containing 1 [Sus scrofa]
          Length = 623

 Score = 71.2 bits (173), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 54/248 (21%), Positives = 117/248 (47%), Gaps = 28/248 (11%)

Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
           P Q P VLI++        L   L  +  L +P ++ +++V   +N +  + +  +++  
Sbjct: 49  PLQAPRVLIALLARNAAHALPSTLGALERLRHPRERTALWVATDHNSDNTSAVLREWLVA 108

Query: 345 FKTMFKNV-----------------KYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDS 387
            K+++ +V                 K+ + +   +  + R  A++++     D+  +VD+
Sbjct: 109 VKSLYHSVEWRPAEEPRSYPDEEGPKHWSDSRYEHVMKLRQAALKSARDMWADYILFVDA 168

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D+ + NPD L  L+  N++++AP+L     A+SNFW  + + G+Y R+  Y+ I   D+ 
Sbjct: 169 DNLILNPDTLSLLIAENKTVVAPML-DSRAAYSNFWCGMTSQGYYKRTPAYIPIRKRDR- 226

Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIK-----TIYTLNSMDYDMAFCTNLRNKGIHLKID 502
            +G + VP + + +L+      + N+        YT  S D  + F  + +   + + + 
Sbjct: 227 -QGCFAVPMVHSTFLIDLRKAASRNLAFYPPHPDYTW-SFDDIIVFAFSCKQAEVQMYVC 284

Query: 503 STQEYGHL 510
           + + YG L
Sbjct: 285 NKEVYGFL 292


>gi|119611576|gb|EAW91170.1| glycosyltransferase 25 domain containing 2, isoform CRA_a [Homo
           sapiens]
          Length = 638

 Score = 71.2 bits (173), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 60/261 (22%), Positives = 114/261 (43%), Gaps = 42/261 (16%)

Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
           P Q P+VL++V        L  FL  +  L+YP  +++++    +N +    +F +++ N
Sbjct: 49  PLQSPTVLVAVLARNAAHTLPHFLGCLERLDYPKSRMAIWAATDHNVDNTTEIFREWLKN 108

Query: 345 FKTMFKNVKY------------IAHNSTVNSKEA-----RNLAVENSLHKGVDFYFYVDS 387
            + ++  V++            I       S+ A     R  A+  +  K  D+  ++D 
Sbjct: 109 VQRLYHYVEWRPMDEPESYPDEIGPKHWPTSRFAHVMKLRQAALRTAREKWSDYILFIDV 168

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNA------------DGFYARS 435
           D+ L NP  L  L+  N++++AP+L      +SNFW  +               GFY R+
Sbjct: 169 DNFLTNPQTLNLLIAENKTIVAPML-ESRGLYSNFWCGITPKAKNTTHLFALLQGFYKRT 227

Query: 436 FDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFC 489
            DY+ I    +   G + VP + + +L+   + K  + K  +     DY       + F 
Sbjct: 228 PDYVQIREWKRT--GCFPVPMVHSTFLI--DLRKEASDKLTFYPPHQDYTWTFDDIIVFA 283

Query: 490 TNLRNKGIHLKIDSTQEYGHL 510
            + R  GI + + + + YG+L
Sbjct: 284 FSSRQAGIQMYLCNREHYGYL 304


>gi|355690359|gb|AER99127.1| glycosyltransferase 25 domain containing 1 [Mustela putorius furo]
          Length = 579

 Score = 71.2 bits (173), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 54/248 (21%), Positives = 117/248 (47%), Gaps = 28/248 (11%)

Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
           P Q P VLI++        L   L  +  L +P  + +++V   +N +  + +  +++  
Sbjct: 5   PLQAPRVLIALVARNAAHALPATLGALERLRHPRGRTALWVATDHNSDNTSAVLREWLVA 64

Query: 345 FKTMFKNV-----------------KYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDS 387
            K+++ +V                 K+ + +   +  + R  A++++     D+  +VD+
Sbjct: 65  VKSLYHSVEWRPAEEPRSYPDEEGPKHWSDSRYEHIMKLRQAALKSARDMWADYILFVDA 124

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D+ + NP+ L  L+  N++++AP+L     A+SNFW  + + G+Y R+  Y+ I   D+ 
Sbjct: 125 DNLILNPNTLSLLIAENKTVVAPML-DSRAAYSNFWCGMTSQGYYKRTPAYIPIRKRDR- 182

Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIK-----TIYTLNSMDYDMAFCTNLRNKGIHLKID 502
            +G + VP + + +L+      + N+        YT  S D  + F  + +   + + + 
Sbjct: 183 -RGCFAVPMVHSTFLIDLRKAASRNLAFYPPHPDYTW-SFDDIIVFAFSCKQAEVQMYVC 240

Query: 503 STQEYGHL 510
           + +EYG L
Sbjct: 241 NKEEYGFL 248


>gi|348556988|ref|XP_003464302.1| PREDICTED: procollagen galactosyltransferase 1-like [Cavia
           porcellus]
          Length = 627

 Score = 70.9 bits (172), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 51/221 (23%), Positives = 106/221 (47%), Gaps = 24/221 (10%)

Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
           P Q P VLI++        L   L  +  L++P ++ +++V   +N +  + +  +++  
Sbjct: 53  PLQAPRVLIALLARNAAHALPATLGALERLHHPRERTALWVATDHNADNTSAVLREWLVA 112

Query: 345 FKTMFKNV-----------------KYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDS 387
            K ++ +V                 K+ +     +  + R  A++ +     D+  +VD+
Sbjct: 113 VKGLYHSVEWRPAEEPRSYPDEEGPKHWSDTRYEHVMKLRQAALKAARDMWADYILFVDA 172

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D+ L NPD L+ L+  N++++AP+L     A+SNFW  + + G+Y R+  Y+ I   ++ 
Sbjct: 173 DNLLVNPDTLRLLIAENKTVVAPML-DSRAAYSNFWCGMTSQGYYKRTPAYIPIRKRER- 230

Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAF 488
            +G + VP + + +L+   + KA +    +     DY  AF
Sbjct: 231 -RGCFAVPMVHSTFLL--DLRKAASRSLAFYPPHPDYTWAF 268


>gi|291229542|ref|XP_002734736.1| PREDICTED: glycosyltransferase 25 domain containing 2-like, partial
           [Saccoglossus kowalevskii]
          Length = 576

 Score = 70.9 bits (172), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 55/259 (21%), Positives = 128/259 (49%), Gaps = 29/259 (11%)

Query: 277 NLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN--NQEYH 334
           N+++++ +    Q P++ + +        L  FL  I  L+YP  ++ +++ +  N +  
Sbjct: 35  NVVENVHAESEFQNPTIFLPILARNKAHTLPVFLAYIDRLDYPKSRMRIWIQSDHNIDNT 94

Query: 335 APLFDDYIHNFKTMFKNV---------KYIAHNSTVNSKEAR--------NLAVENSLHK 377
             +  +++ N K  ++++         KY      ++  E R          A++ +  +
Sbjct: 95  TSILKEWVSNVKHTYRSIDESYADEPDKYSTEVGPLDWPEERFSHMIKLRQEALDEARRQ 154

Query: 378 GVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFD 437
             DF F+VD D+ ++ P  L  L+   +++IAP++     A++NFW  ++  G+Y R+ +
Sbjct: 155 WADFIFFVDCDNFIEEPQTLNLLIAEKKTIIAPMM-ESDSAYANFWCGVDDQGYYIRTPE 213

Query: 438 YMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYT-LNSM--DYD--MAFCTNL 492
           Y+  +  ++  KG + VP + + +L+   + +++++K  +  LNS   DYD  + F  + 
Sbjct: 214 YLPTLRRER--KGCFPVPMVHSTFLI--DLRRSSSLKLQFNPLNSYRGDYDDILIFAYSA 269

Query: 493 RNKGIHLKIDSTQEYGHLV 511
           +   I + + +T  +G L+
Sbjct: 270 KIAEIQMYVLNTWYFGMLL 288


>gi|22760716|dbj|BAC11307.1| unnamed protein product [Homo sapiens]
          Length = 622

 Score = 70.9 bits (172), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 54/248 (21%), Positives = 116/248 (46%), Gaps = 28/248 (11%)

Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
           P Q P VLI++        L   L  +  L +P ++ +++V   +N +  +    +++  
Sbjct: 48  PLQAPRVLIALLARNAAHALPTTLGALERLRHPRERTALWVATDHNMDNTSTELREWLVA 107

Query: 345 FKTMFKNV-----------------KYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDS 387
            K+++ +V                 K+ + +   +  + R  A++++     D+  +VD+
Sbjct: 108 VKSLYHSVEWRPAEEPRSYPDEEGPKHWSDSRYEHVMKLRQAALKSARDMWADYILFVDA 167

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D+ + NPD L  L+  N++++AP+L     A+SNFW  + + G+Y R+  Y+     D+ 
Sbjct: 168 DNLILNPDTLSLLIAENKTVVAPML-DSRAAYSNFWCGMTSQGYYKRTPAYIPTRKRDR- 225

Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIK-----TIYTLNSMDYDMAFCTNLRNKGIHLKID 502
            +G + VP + + +L+      + N+        YT  S D  + F  + +   + + + 
Sbjct: 226 -RGCFAVPMVHSTFLIDLRKAASRNLAFYPPHPDYTW-SFDDIIVFAFSCKQAEVQMYVC 283

Query: 503 STQEYGHL 510
           + +EYG L
Sbjct: 284 NKEEYGFL 291


>gi|397640090|gb|EJK73928.1| hypothetical protein THAOC_04424 [Thalassiosira oceanica]
          Length = 569

 Score = 70.9 bits (172), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 49/119 (41%), Positives = 62/119 (52%), Gaps = 10/119 (8%)

Query: 619 AEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQV 678
           AE L   +  L ER F G     VRA   FVVRY P  QP+LR H DSS  + NI LN  
Sbjct: 309 AEKLNARLSVLMERTF-GVFRGAVRANDIFVVRYEPGGQPNLRRHTDSSFISFNIILND- 366

Query: 679 GVDYEGGGCRFIRY----NCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVD 733
              +EGGG RF       + +V    +G+ ++    +   HEGL  T GTRYI++ F D
Sbjct: 367 --GFEGGGTRFHSRPDGTHIDVKPPAVGYGILSNANI--LHEGLATTNGTRYILVGFDD 421


>gi|170784829|ref|NP_666323.2| procollagen galactosyltransferase 1 precursor [Mus musculus]
 gi|160395574|sp|Q8K297.2|GT251_MOUSE RecName: Full=Procollagen galactosyltransferase 1; AltName:
           Full=Glycosyltransferase 25 family member 1; AltName:
           Full=Hydroxylysine galactosyltransferase 1; Flags:
           Precursor
 gi|34785210|gb|AAH56951.1| Glycosyltransferase 25 domain containing 1 [Mus musculus]
 gi|148697002|gb|EDL28949.1| glycosyltransferase 25 domain containing 1, isoform CRA_a [Mus
           musculus]
          Length = 617

 Score = 70.5 bits (171), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 53/247 (21%), Positives = 114/247 (46%), Gaps = 26/247 (10%)

Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
           P Q P VLI++        L   L  +  L +P ++ +++V   +N +  + +  +++  
Sbjct: 43  PLQAPRVLIALLARNAAPALPATLGALEQLRHPRERTALWVATDHNTDNTSAILREWLVA 102

Query: 345 FKTMFKNVKY--IAHNSTVNSKEA---------------RNLAVENSLHKGVDFYFYVDS 387
            K ++ +V++      S+   +E                R  A++++     D+  ++D 
Sbjct: 103 VKGLYHSVEWRPAEEPSSYPDEEGPKHWSDSRYEHVMKLRQAALKSARDMWADYILFMDI 162

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D+ + NPD L  L+  N++++AP+L     A+SNFW  + + G+Y R+  Y+ I   D+ 
Sbjct: 163 DNLITNPDTLSLLIAENKTVVAPML-DSRAAYSNFWCGMTSQGYYKRTPAYIPIRKRDR- 220

Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKTIYTLN----SMDYDMAFCTNLRNKGIHLKIDS 503
            +G + VP + + +L+      + N+    T      S D  + F  + +   + + + +
Sbjct: 221 -RGCFAVPMVHSTFLIDLRKAASRNLAFYPTHPDYTWSFDDIIVFAFSCKQAEVQMYVCN 279

Query: 504 TQEYGHL 510
            + YG L
Sbjct: 280 KEVYGFL 286


>gi|21595163|gb|AAH32165.1| Glycosyltransferase 25 domain containing 1 [Mus musculus]
          Length = 617

 Score = 70.1 bits (170), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 54/248 (21%), Positives = 115/248 (46%), Gaps = 28/248 (11%)

Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
           P Q P VLI++        L   L  +  L +P ++ +++V   +N +  + +  +++  
Sbjct: 43  PLQAPRVLIALLARNAAPALPATLGALEQLRHPRERTALWVATDHNTDNTSAILREWLVA 102

Query: 345 FKTMFKNVKY--IAHNSTVNSKEA---------------RNLAVENSLHKGVDFYFYVDS 387
            K ++ +V++      S+   +E                R  A++++     D+  ++D 
Sbjct: 103 VKGLYHSVEWRPAEEPSSYPDEEGPKHWSDSRYEHVMKLRQAALKSARDMWADYILFMDI 162

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D+ + NPD L  L+  N++++AP+L     A+SNFW  + + G+Y R+  Y+ I   D+ 
Sbjct: 163 DNLITNPDTLSLLIAENKTVVAPML-DSRAAYSNFWCGMTSQGYYKRTPAYIPIRKRDR- 220

Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIK-----TIYTLNSMDYDMAFCTNLRNKGIHLKID 502
            +G + VP + + +L+      + N+        YT  S D  + F  + +   + + + 
Sbjct: 221 -RGCFAVPMVHSTFLIDLRKAASRNLAFYPPHPDYTW-SFDDIIVFAFSCKQAEVQMYVC 278

Query: 503 STQEYGHL 510
           + + YG L
Sbjct: 279 NKEVYGFL 286


>gi|74217150|dbj|BAE43293.1| unnamed protein product [Mus musculus]
          Length = 617

 Score = 70.1 bits (170), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 53/247 (21%), Positives = 114/247 (46%), Gaps = 26/247 (10%)

Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
           P Q P VLI++        L   L  +  L +P ++ +++V   +N +  + +  +++  
Sbjct: 43  PLQAPRVLIALLARNAAPALPATLGALEQLRHPRERTALWVATDHNTDNTSAILREWLVA 102

Query: 345 FKTMFKNVKY--IAHNSTVNSKEA---------------RNLAVENSLHKGVDFYFYVDS 387
            K ++ +V++      S+   +E                R  A++++     D+  ++D 
Sbjct: 103 VKGLYHSVEWRPAEEPSSYPDEEGPKHWSDSRYEHVMKLRQAALKSARDMWADYILFMDI 162

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D+ + NPD L  L+  N++++AP+L     A+SNFW  + + G+Y R+  Y+ I   D+ 
Sbjct: 163 DNLITNPDTLSLLIAENKTVVAPML-DSRAAYSNFWCGMTSQGYYKRTPAYIPIRKRDR- 220

Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIKTIYTLN----SMDYDMAFCTNLRNKGIHLKIDS 503
            +G + VP + + +L+      + N+    T      S D  + F  + +   + + + +
Sbjct: 221 -RGCFAVPMVHSTFLIDLRKAASRNLAFYPTHPDYTWSFDDIIVFAFSCKQAEVQMYVCN 279

Query: 504 TQEYGHL 510
            + YG L
Sbjct: 280 KEVYGFL 286


>gi|431921989|gb|ELK19162.1| Glycosyltransferase 25 family member 1 [Pteropus alecto]
          Length = 624

 Score = 70.1 bits (170), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 54/248 (21%), Positives = 117/248 (47%), Gaps = 28/248 (11%)

Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
           P Q P VLI++        L   L  +  L +P ++ +++V   +N +  + +  +++  
Sbjct: 50  PLQEPRVLIALLARNAAHALPATLGALERLRHPRERTALWVATDHNSDNTSTVLREWLVA 109

Query: 345 FKTMFKNV-----------------KYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDS 387
            K ++ +V                 K+ + +   +  + R  A++++     D+  +VD+
Sbjct: 110 VKNLYHSVEWRPAEEPRSYPDEEGPKHWSDSRYEHVMKLRQAALKSARDMWADYILFVDA 169

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D+ + NPD L  L++ N++++AP+L     A+SNFW  + + G+Y R+  Y+ I   D+ 
Sbjct: 170 DNLILNPDTLTLLISENKTVVAPML-DSRAAYSNFWCGMTSQGYYKRTPAYIPIRKRDR- 227

Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIK-----TIYTLNSMDYDMAFCTNLRNKGIHLKID 502
            +G + VP + + +L+      + N+        YT  S D  + F  + +   + + + 
Sbjct: 228 -RGCFAVPMVHSTFLIDLRKSASRNLAFYPPHPDYTW-SFDDIIVFAFSCKQAEVQMYVC 285

Query: 503 STQEYGHL 510
           + + YG L
Sbjct: 286 NKEVYGFL 293


>gi|391347179|ref|XP_003747842.1| PREDICTED: glycosyltransferase 25 family member-like [Metaseiulus
           occidentalis]
          Length = 587

 Score = 70.1 bits (170), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 60/266 (22%), Positives = 128/266 (48%), Gaps = 38/266 (14%)

Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEY-HAP-LFDDYIHNFKTM 348
           P +L+ +        L  F   + NL+YP K I +++ ++  + + P + D +    K+ 
Sbjct: 46  PDILVVILASNEEHTLPIFFGCLENLDYPKKSIELYIRSDHNHDNTPFMLDTWCAARKSE 105

Query: 349 FKNVK------------------YIAHNSTVNSKE-ARNLAVENSLHKGVDFYFYVDSDS 389
           + ++                      + + +  KE A N A E    KG D+ F++D+D+
Sbjct: 106 YADISLDIRMLPTHYDEKDIHWPMSRYRTMIELKEDALNYARE----KGFDYIFFLDTDA 161

Query: 390 HLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGK 449
            + N D+L  L++ N++++APLL +    +SNFWG ++  G+Y RS +Y  I+  ++G  
Sbjct: 162 FITNLDLLNDLISVNKTIVAPLL-QSASLYSNFWGDMDKKGYYLRSTNYTEIV--ERGIV 218

Query: 450 GIWNVPYITNCYLMKTSVIKATNI----KTIYTLNSMDYD--MAFCTNLRNKGIHLKIDS 503
           G + V  + +  L+K +   +T +    + +    S+  D  + F  + +   +   + +
Sbjct: 219 GSFPVRLVHSAVLVKLTDEASTALTFVREKVDNFESIPQDDIITFARSAQTNNVPQYVTN 278

Query: 504 TQEYGHLVDSENFDPQKTNPEVYELI 529
            +E G+++ S    P+  + E+ +L+
Sbjct: 279 EKENGYMLRS----PESLSQEIQDLV 300


>gi|405967145|gb|EKC32345.1| Glycosyltransferase 25 family member 1 [Crassostrea gigas]
          Length = 600

 Score = 69.7 bits (169), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 56/245 (22%), Positives = 118/245 (48%), Gaps = 33/245 (13%)

Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN--NQEYHAPLFDDYIHNFKTM 348
           P+V+I++ +      L  F   +  LNYP  +IS ++ +  N++  A +  +++   K +
Sbjct: 45  PTVMIAILVRNKAHILPWFFGHLEKLNYPKNRISFWIRSDHNEDDSARMLREWVDANKNV 104

Query: 349 FKNVKYIA--------------HNSTVNSKEA---RNLAVENSLHKGVDFYFYVDSDSHL 391
           + ++  +               H ST    +    R  A+  +     D+ F +D+D  L
Sbjct: 105 YHHIDLVIEDNKDKYEDEIGPLHWSTKRFDKVIALRENALLAARRAWADYLFMLDADVVL 164

Query: 392 DNPDVLKYLVNRNESLIAPLLVRPF-KAWSNFWGALNADGFYARSFDYMNIINGDQGGKG 450
           +N + L  L++  + +IAP+L     + +SNFWG ++  G+Y R+  Y +I+  ++   G
Sbjct: 165 ENRNTLTQLIDAKQPIIAPMLNASIGETYSNFWGGMDEMGYYKRAPGYFDIL--ERKRLG 222

Query: 451 IWNVPYITNCYLMKTSVIKATNIKTIYTLNSMD-YD------MAFCTNLRNKGIHLKIDS 503
           ++ VP +    L+   ++++ +    +T N  + YD      + F  N+R  G+ + I +
Sbjct: 223 VFEVPMVHTALLLDMHLMESDS----FTYNKPEGYDGPHDDIIIFGLNVRKAGMVMHIMN 278

Query: 504 TQEYG 508
           T+ +G
Sbjct: 279 TEYFG 283


>gi|348527790|ref|XP_003451402.1| PREDICTED: procollagen galactosyltransferase 2-like [Oreochromis
           niloticus]
          Length = 731

 Score = 69.3 bits (168), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 57/261 (21%), Positives = 107/261 (40%), Gaps = 43/261 (16%)

Query: 283 DSLKPDQF---PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFD 339
           + +KP+     P V+I +        L  +L  I  L YP ++I+++   +        D
Sbjct: 149 EQVKPESSLLKPKVMIVIVARNAAHSLPYYLGCIERLEYPKERIAIWAATDHN-----VD 203

Query: 340 DYIHNFKTMFKNVKYIAHNSTVNSKEA------------------------RNLAVENSL 375
           +     +   K  ++I H       E                         R  A++ + 
Sbjct: 204 NTTAMLREWLKRAQHIYHFVEWRPMEEPRSYTDEWGPKHWPPSRFNHVMKLRQAALKAAR 263

Query: 376 HKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARS 435
            +  D+  +VDSD+ L NP VL  ++  N +L+AP+L      +SNFW  +   G+Y R+
Sbjct: 264 ERWADYILFVDSDNLLTNPRVLNLMMAENLTLVAPML-ESRSLYSNFWCGMTPQGYYKRT 322

Query: 436 FDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFC 489
            DY  I    +   G + VP + + +L+      ++++  ++     DY       M F 
Sbjct: 323 PDYQPIREWKR--LGCFPVPMVHSTFLLDLRRESSSDL--VFYPPHPDYSWAFDDIMVFA 378

Query: 490 TNLRNKGIHLKIDSTQEYGHL 510
            + R  G+ + + + + YG L
Sbjct: 379 FSARQAGVQMYVCNREHYGFL 399


>gi|91078804|ref|XP_970300.1| PREDICTED: similar to Glycosyltransferase 25 family member
           [Tribolium castaneum]
 gi|270003725|gb|EFA00173.1| hypothetical protein TcasGA2_TC002995 [Tribolium castaneum]
          Length = 559

 Score = 69.3 bits (168), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 47/192 (24%), Positives = 92/192 (47%), Gaps = 22/192 (11%)

Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN--NQEYHAPLFDDYIHNFKTM 348
           P+VLI+V        L  FL  + NL+YP  +IS+++ +  N +    +   +I+  K  
Sbjct: 24  PTVLIAVLARNKAHTLPYFLTTLENLDYPKNRISLWIRSDHNSDKTIEILRKWINAVKDE 83

Query: 349 FKNV--KYIAHNSTVNSKEA---------------RNLAVENSLHKGVDFYFYVDSDSHL 391
           ++ +  +++  N     +                 R  ++  +     D+Y+ +D D  L
Sbjct: 84  YRMISTEFVEENEGYPDESGPAHWTPERFNHVIDLRESSLNFARKIWADYYWTIDCDVFL 143

Query: 392 DNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGI 451
            NP  L  L+++  +++AP+L +    +SNFW  +  D +Y R+ DY  ++N  +   G 
Sbjct: 144 TNPKTLDILISKGYTVVAPML-KSDGLYSNFWYGMTDDYYYQRTEDYKPVVN--RENIGC 200

Query: 452 WNVPYITNCYLM 463
           +NVP + +C L+
Sbjct: 201 FNVPMVHSCVLV 212


>gi|346473379|gb|AEO36534.1| hypothetical protein [Amblyomma maculatum]
          Length = 315

 Score = 69.3 bits (168), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 60/271 (22%), Positives = 112/271 (41%), Gaps = 46/271 (16%)

Query: 272 GCTRCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQ 331
           G  RC      D L   + P++LI+V +      L  F   +   +YP  ++S+++Y + 
Sbjct: 21  GVVRCTTRD--DKL---ESPTLLIAVVLRNKAHVLPHFFGYLERQSYPKSRVSLWIYTDH 75

Query: 332 EYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEA------------------------R 367
                  D       T  +      HN  V  ++                         R
Sbjct: 76  S-----VDTTAEMVNTWAEEASGDYHNVNVTKEDGDAFFPDEDGVQKWTSERYWHIIRLR 130

Query: 368 NLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALN 427
             A+  +     DF  ++D D+ L NP  ++ LV  N ++IAP+L     A+SNFW  +N
Sbjct: 131 EEAIHVARAMWADFVLFLDGDALLSNPKTIQDLVEENRTIIAPML-DSRSAYSNFWCGMN 189

Query: 428 ADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSM----- 482
             G+Y R+ +YM I+  ++   G++ V  + +  L+  +   A + K  Y    +     
Sbjct: 190 EKGYYKRTDEYMPILEREK--IGVFPVVMVHSATLINLN--HADSRKLTYDPRKLQGYTG 245

Query: 483 --DYDMAFCTNLRNKGIHLKIDSTQEYGHLV 511
             D  + F  + +  G+ + + +  +YGH++
Sbjct: 246 PNDDVITFAHSAKFAGVEMFVSNKDQYGHIL 276


>gi|323452214|gb|EGB08089.1| hypothetical protein AURANDRAFT_71687 [Aureococcus anophagefferens]
          Length = 1302

 Score = 69.3 bits (168), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 58/203 (28%), Positives = 93/203 (45%), Gaps = 17/203 (8%)

Query: 535 WDLRYIHPEYQKSLLPDTVNNQPCPDVFWF--PIVTEKFCHEFVQIMEAYGQWSDGTNND 592
           W  R +  E  +    D  +++P   V+ F  P+V    C + + I EA+     G    
Sbjct: 699 WAKRRVPFEALERPKSDDDDDEPPAYVYAFDEPVVPAASCADAIAIAEAHASHGGGWTTA 758

Query: 593 KRLETGYEAVPTRDIHMKQVGLAGVW-AEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVR 651
           +       AVPT D+ +++V     W  + LR  + P       G     +R   +F+V+
Sbjct: 759 RHF-----AVPTTDVPVREVPALLKWFNDALRSSIFPALG-ALYGLDPARLRVIDAFLVK 812

Query: 652 YRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRF--IRYNCNVTATRMGWMLMHPG 709
           Y    Q SL  H D S  +I + LN    DY+GGG  F  +R   N  A   G ++  PG
Sbjct: 813 YSAAAQRSLPLHSDQSQISITLPLNS-SADYDGGGTYFHDLRQAVNRDA---GGLVAFPG 868

Query: 710 RLTHYHEGLQVTQGTRYIMISFV 732
            L H   G  +T+GTR+++++F+
Sbjct: 869 FLPHA--GHAITRGTRFVVVAFL 889


>gi|73986206|ref|XP_541950.2| PREDICTED: procollagen galactosyltransferase 1 [Canis lupus
           familiaris]
          Length = 623

 Score = 69.3 bits (168), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 53/248 (21%), Positives = 117/248 (47%), Gaps = 28/248 (11%)

Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
           P Q P VLI++        L   L  +  L +P ++ +++V   +N +  + +  +++  
Sbjct: 49  PLQAPRVLIALVARNAAHALPATLGALERLRHPRERTALWVATDHNSDNTSAVLREWLVA 108

Query: 345 FKTMFKNV-----------------KYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDS 387
            K+++ +V                 K+ + +   +  + R  A++++     D+  +VD+
Sbjct: 109 VKSLYHSVEWRPAEEPRSYPDEEGPKHWSDSRYEHVMKLRQAALKSARDMWADYILFVDA 168

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D+ + NP+ L  L+  N++++AP+L     A+SNFW  + + G+Y R+  Y+ I   D+ 
Sbjct: 169 DNLILNPNTLSLLIAENKTVVAPML-DSRAAYSNFWCGMTSQGYYKRTPAYIPIRKRDR- 226

Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIK-----TIYTLNSMDYDMAFCTNLRNKGIHLKID 502
            +G + VP + + +L+      + N+        YT  S D  + F  + +   + + + 
Sbjct: 227 -RGCFAVPMVHSTFLIDLRKAASRNLAFYPPHPDYTW-SFDDIIVFAFSCKQAEVQMYVC 284

Query: 503 STQEYGHL 510
           + + YG L
Sbjct: 285 NKEVYGFL 292


>gi|260797405|ref|XP_002593693.1| hypothetical protein BRAFLDRAFT_107673 [Branchiostoma floridae]
 gi|229278921|gb|EEN49704.1| hypothetical protein BRAFLDRAFT_107673 [Branchiostoma floridae]
          Length = 384

 Score = 68.9 bits (167), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 48/194 (24%), Positives = 94/194 (48%), Gaps = 22/194 (11%)

Query: 289 QFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNN----------QEY----- 333
           Q+P++ +++        L   L  +   +YP  ++++++ ++          QE+     
Sbjct: 2   QWPTIFVAILARNKAHSLPYTLGYLERQDYPKSRLALWIQSDHNIDNTSAVIQEWLDGVG 61

Query: 334 HAPLFDDYIH-NFKTMFKNVKYIAHNSTVNSKEA---RNLAVENSLHKGVDFYFYVDSDS 389
           H     D+ H +    F + +   H S    +     R  A+E +  +  DF F +D+D+
Sbjct: 62  HLYHHVDFYHKDAPNYFPDEEGANHWSGTRLRHVIKLRQQALEYARKRWADFMFCMDADN 121

Query: 390 HLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGK 449
            + NP  LK L+ +N  +IAP+L     A+SNFW  +   G+Y R+ +YM  I  ++  +
Sbjct: 122 LVTNPRTLKLLIAQNRPIIAPML-ESSTAYSNFWCGMTEKGYYMRTDEYMPTI--ERKRR 178

Query: 450 GIWNVPYITNCYLM 463
           G++ VP + + YL+
Sbjct: 179 GVFPVPMVHSTYLV 192


>gi|149944687|ref|NP_001092425.1| procollagen galactosyltransferase 1 precursor [Bos taurus]
 gi|160395520|sp|A5PK45.1|GT251_BOVIN RecName: Full=Procollagen galactosyltransferase 1; AltName:
           Full=Glycosyltransferase 25 family member 1; AltName:
           Full=Hydroxylysine galactosyltransferase 1; Flags:
           Precursor
 gi|148744100|gb|AAI42351.1| GLT25D1 protein [Bos taurus]
 gi|296486064|tpg|DAA28177.1| TPA: glycosyltransferase 25 domain containing 1 precursor [Bos
           taurus]
          Length = 623

 Score = 68.9 bits (167), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 53/248 (21%), Positives = 116/248 (46%), Gaps = 28/248 (11%)

Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
           P Q P VLI++        L   L  +  L +P ++ +++V   +N +  + +  +++  
Sbjct: 49  PLQAPRVLIALLARNAAHALPATLGALERLRHPRERTALWVATDHNADNTSAVLREWLVA 108

Query: 345 FKTMFKNV-----------------KYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDS 387
            K ++ +V                 K+ + +   +  + R  A++++     D+  +VD+
Sbjct: 109 VKGLYHSVEWRPSEEPRSYPDEEGPKHWSDSRYEHVMKLRQAALKSARDMWADYILFVDA 168

Query: 388 DSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQG 447
           D+ + NPD L  L+  N++++AP+L     A+SNFW  + + G+Y R+  Y+ I   ++ 
Sbjct: 169 DNLILNPDTLTLLIAENKTVVAPML-DSRAAYSNFWCGMTSQGYYKRTPAYIPIRKRER- 226

Query: 448 GKGIWNVPYITNCYLMKTSVIKATNIK-----TIYTLNSMDYDMAFCTNLRNKGIHLKID 502
            +G + VP + + +L+      + N+        YT  S D  + F  + +   + + + 
Sbjct: 227 -RGCFAVPMVHSTFLIDLRKAASRNLAFYPPHPDYTW-SFDDIIVFAFSCKQAEVQMYVC 284

Query: 503 STQEYGHL 510
           + + YG L
Sbjct: 285 NKEVYGFL 292


>gi|159793543|gb|ABW99101.1| procollagen-lysine 5-dioxygenase [Drosophila melanogaster]
          Length = 44

 Score = 68.9 bits (167), Expect = 9e-09,   Method: Composition-based stats.
 Identities = 27/41 (65%), Positives = 33/41 (80%)

Query: 556 QPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLE 596
           QPCPDV+WF IV++ FC + V IMEA+  WSDG+NND RLE
Sbjct: 4   QPCPDVYWFQIVSDAFCDDLVAIMEAHNGWSDGSNNDNRLE 44


>gi|395750709|ref|XP_002828943.2| PREDICTED: procollagen galactosyltransferase 1, partial [Pongo
           abelii]
          Length = 462

 Score = 68.9 bits (167), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 56/249 (22%), Positives = 115/249 (46%), Gaps = 29/249 (11%)

Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHN 344
           P Q P VLI++        L   L  +  L +P ++ +++V   +N +  + +  +++  
Sbjct: 48  PLQAPRVLIALLARNAAHALPTTLGALERLRHPRERTALWVATDHNMDNTSTVLREWLVA 107

Query: 345 FKTMFKNVKY-IAHNSTVNSKEA-------RNLAVENSLHKGVDFYF----------YVD 386
            K+++ +V++  A   ++            + L   +SL     F            +VD
Sbjct: 108 VKSLYHSVEWRPAEEPSLGPSTGFAVYLLPKALGSMDSLPPPSSFLAHADAVWGVLQFVD 167

Query: 387 SDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQ 446
           +D+ + NPD L  L+  N++++AP+L     A+SNFW  + + G+Y R+  Y+ I   D+
Sbjct: 168 ADNLILNPDTLSLLIAENKTVVAPMLDS-RAAYSNFWCGMTSQGYYKRTPAYIPIRKRDR 226

Query: 447 GGKGIWNVPYITNCYLMKTSVIKATNIK-----TIYTLNSMDYDMAFCTNLRNKGIHLKI 501
             +G + VP + + +L+      + N+        YT  S D  + F  + +   + + +
Sbjct: 227 --RGCFAVPMVHSTFLIDLRKAASRNLAFYPPHPDYTW-SFDDIIVFAFSCKQAEVQMYV 283

Query: 502 DSTQEYGHL 510
            + +EYG L
Sbjct: 284 CNKEEYGFL 292


>gi|432848534|ref|XP_004066393.1| PREDICTED: procollagen galactosyltransferase 1-like [Oryzias
           latipes]
          Length = 414

 Score = 68.9 bits (167), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 54/229 (23%), Positives = 104/229 (45%), Gaps = 27/229 (11%)

Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFK 346
           P   P V++++        L   L  I  LNYP  ++++  + ++    P          
Sbjct: 34  PLLAPRVVVALICRNAEHCLPLVLGAIERLNYPKDRVALCRFTDEV--GP---------- 81

Query: 347 TMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNES 406
             + N++Y       +  + R  A+  +     D+    D D+ L NPDVL  L++ N++
Sbjct: 82  KHWNNLRY------EHVMKLRQAALNTAREIWADYILMTDCDNLLTNPDVLWKLMSENKT 135

Query: 407 LIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTS 466
           ++AP+L     A+SNFW  + + G+Y R+  YM I    Q  +G + VP + + YL+   
Sbjct: 136 IVAPML-ESRAAYSNFWCGMTSQGYYKRTPAYMPIRR--QERRGCFAVPMVHSTYLVDLR 192

Query: 467 VIKATNIKTIYTLN-----SMDYDMAFCTNLRNKGIHLKIDSTQEYGHL 510
              + N+   Y  +     ++D  + F  + R   + + + + + YG+L
Sbjct: 193 KEASRNL-AFYPPHEEYNWALDDVIVFAYSARMADVQMYVCNKETYGYL 240


>gi|414075459|ref|YP_006994777.1| procollagen-lysine 5-dioxygenase [Anabaena sp. 90]
 gi|413968875|gb|AFW92964.1| procollagen-lysine 5-dioxygenase [Anabaena sp. 90]
          Length = 239

 Score = 68.9 bits (167), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 64/211 (30%), Positives = 97/211 (45%), Gaps = 31/211 (14%)

Query: 50  FIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNELDEMDITDDMIILVTDSYD 109
           F++SA+   + VK LG    W    +       ++ L+  EL   D+ DD I+LVTD++D
Sbjct: 24  FLRSAKKQNIDVKVLGEGLEWSANSL-------RLPLILKELK--DVKDDTIVLVTDAFD 74

Query: 110 VIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYP------AVGSGYRYLNSGGF 163
           V+     N I E+F      I+F AE+  W  +  Y++Y        V   Y+YLN+G F
Sbjct: 75  VLYVQNANSIYEKFIQGGYKILFAAEK--WY-SHQYEEYKDFYDSIKVPYDYKYLNAGTF 131

Query: 164 IGYAKDIKELISN-----RSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYG 218
           +GY K + E+I N        +N  D +LY    F          + LD   ++F    G
Sbjct: 132 MGYKKYVCEMIDNILSYPNFHENGSDQRLYGKYCF-----ENPETVTLDYCCDIFWCTAG 186

Query: 219 SLEDIKLNFDL-DEFVHLTNTKYNTNPVIIH 248
             E +   +D+ + FV   N    T P IIH
Sbjct: 187 EWEILPELYDIHNGFV--LNKLTGTYPAIIH 215


>gi|345482468|ref|XP_001608141.2| PREDICTED: glycosyltransferase 25 family member-like [Nasonia
           vitripennis]
          Length = 567

 Score = 68.6 bits (166), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 61/250 (24%), Positives = 111/250 (44%), Gaps = 37/250 (14%)

Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFK 350
           P++L+ +        L   L+ +  L+YP  +I++++Y++        D+ I   K    
Sbjct: 27  PNILVGILARNKAHTLPYTLSYLEKLDYPKDRIALWIYSDNN-----VDNTIEVLKKWLT 81

Query: 351 NVK--YIAHNSTVNSK----------------------EARNLAVENSLHKGVDFYFYVD 386
             K  Y   N+T++ +                      + R   +  +     DF F +D
Sbjct: 82  VQKDNYFMVNATLDEESHGHDDEKGIADWSSKRFEHIIKLREEVLNYARRIWADFIFMLD 141

Query: 387 SDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQ 446
           +D  L NP  L  L+ +NE+++APLL +    +SNFW  ++ D +Y R+ DY +I+N   
Sbjct: 142 ADVFLTNPKTLDSLIRKNETVVAPLL-KSDGMYSNFWAGMSDDFYYKRTDDYESILNNKV 200

Query: 447 GGKGIWNVPYITNCYLM----KTSVIKATNIKTIYTLNS-MDYDMAFCTNLRNKGIHLKI 501
              G + VP + +  L+    K S     N K I   N  +D  + F  + +   I L +
Sbjct: 201 S--GCFPVPMVHSAVLIDLRRKNSDYLTYNFKNINNYNGPIDDIITFALSAKYSDISLNV 258

Query: 502 DSTQEYGHLV 511
            + Q+YG ++
Sbjct: 259 CNDQKYGFIM 268


>gi|328789321|ref|XP_397154.3| PREDICTED: glycosyltransferase 25 family member-like [Apis
           mellifera]
          Length = 567

 Score = 68.2 bits (165), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 65/245 (26%), Positives = 109/245 (44%), Gaps = 27/245 (11%)

Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFKTM 348
           P+VLI + +      L  FL  +  L YP K+I +++   NN +    +   +++N    
Sbjct: 28  PTVLIIILVRNKAHTLPYFLTFLERLTYPKKRIHLWICSDNNIDNSIEILSAWLNNESNK 87

Query: 349 FKNVK---------YIAHNSTVNSKEARNLAV----ENSLHKG----VDFYFYVDSDSHL 391
           +  V+         +       N    R L V    E +L  G     DF + +D+D  L
Sbjct: 88  YHGVQINFDEKSKGFDDEKGITNWSAQRFLHVINLREEALKAGRNMWADFIWMLDADVFL 147

Query: 392 DNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGI 451
            NP+ L  L+ +N+ +IAPLL +    +SNFW  +  D +Y R+ +Y  I+  ++  KG 
Sbjct: 148 TNPNTLDELILKNQIVIAPLL-KSDGLYSNFWAGMTNDYYYLRTKEYEPILFREK--KGC 204

Query: 452 WNVPYITNCYLM----KTSVIKATNIKTIYTLNSMDYD-MAFCTNLRNKGIHLKIDSTQE 506
           +NVP I +  L+    + S     N   +Y  N    D + F       G+ L I +   
Sbjct: 205 FNVPMIHSAVLIDLRKQISDFLTYNPNKLYQYNGPTDDIITFAVGANKTGVPLFICNDNT 264

Query: 507 YGHLV 511
           YG ++
Sbjct: 265 YGFIM 269


>gi|301630121|ref|XP_002944176.1| PREDICTED: glycosyltransferase 25 family member 3-like [Xenopus
           (Silurana) tropicalis]
          Length = 590

 Score = 68.2 bits (165), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 55/244 (22%), Positives = 110/244 (45%), Gaps = 28/244 (11%)

Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPL--FDDYIHNFKTM 348
           PS++I++        L   L  +  L+YP  +IS++   +    A L    D++   + +
Sbjct: 31  PSLVIALIARNAAHALPYSLGALERLDYPRDRISLWCATDHNEDATLDVLQDWLEAIRPL 90

Query: 349 FKNVKYIA------HNSTVNSKE-----------ARNLAVENSLHKGVDFYFYVDSDSHL 391
           + ++++ A      +      K+            R  A+  +  K  D+  YVD+D+ L
Sbjct: 91  YHSLEWKAEVAPRWYPQETGPKDWPKERYEYVMKLRQEALSYAREKKADYIMYVDADNVL 150

Query: 392 DNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGI 451
            N   ++ L+  N++L+AP+L      +SNFW  +N  GFY R+ DY    N  +   G 
Sbjct: 151 TNVHTVRLLMTENKTLVAPMLDSQ-TGFSNFWCGINPQGFYRRTPDYYPTRNRQR--TGC 207

Query: 452 WNVPYITNCYLMKTSVIKATNIKTIYTLN-----SMDYDMAFCTNLRNKGIHLKIDSTQE 506
           ++VP + + +L+     ++  +   Y L+     + D  + F  +    G+   + +T  
Sbjct: 208 FSVPMVHSTFLIDLQKEESHGL-AFYPLHPNYTWTFDDIIVFAYSCLAAGVQGYVCNTHR 266

Query: 507 YGHL 510
           YG++
Sbjct: 267 YGYV 270


>gi|312113105|ref|YP_004010701.1| family 2 glycosyl transferase [Rhodomicrobium vannielii ATCC 17100]
 gi|311218234|gb|ADP69602.1| glycosyl transferase family 2 [Rhodomicrobium vannielii ATCC 17100]
          Length = 676

 Score = 67.4 bits (163), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 58/249 (23%), Positives = 114/249 (45%), Gaps = 30/249 (12%)

Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFKTM 348
           P VL+++   +   FL   L  I +L+YP   I +++   NN +    +  ++      +
Sbjct: 399 PRVLVAILAKQKEEFLPLHLECIESLDYPKSSIVLYIRTNNNTDGTERILREWAKRVGHL 458

Query: 349 FKNVKYIAHNSTVNSKE----------------ARNLAVENSLHKGVDFYFYVDSDSHLD 392
           + +V++ A    V  ++                 RN+++  +L    DFYF  D D+ + 
Sbjct: 459 YADVEFDAEEVEVPVEQFSVHEWNETRFDVLGHIRNVSLSRALAHRCDFYFVADVDNFI- 517

Query: 393 NPDVLKYLVNRNESLIAPLL--VRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKG 450
            P  L+ LV  +  ++AP L  + P   +SN+   ++A G++     Y  I+N  +  +G
Sbjct: 518 RPCTLRELVALDLPIVAPFLRSLSPDDPYSNYHAEIDASGYFEDCDQYSWILN--RWIRG 575

Query: 451 IWNVPYITNC-YLMKTSVIKATNIKTIYTLNSMDYD-MAFCTNLRNKGIHLKIDSTQEYG 508
           +  VP +T+C YL++  V+     +  Y   +  ++ + F  + R  GI    D+ Q YG
Sbjct: 576 VIEVP-VTHCTYLIRADVLG----ELAYRDGTARHEYVIFSESARRHGIPQYFDNRQVYG 630

Query: 509 HLVDSENFD 517
           ++   +  D
Sbjct: 631 YIAFGDGHD 639


>gi|256081803|ref|XP_002577157.1| cerebral cell adhesion molecule related [Schistosoma mansoni]
 gi|350645736|emb|CCD59498.1| cerebral cell adhesion molecule related [Schistosoma mansoni]
          Length = 680

 Score = 67.4 bits (163), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 67/287 (23%), Positives = 127/287 (44%), Gaps = 67/287 (23%)

Query: 275 RCNLIKHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY------ 328
           + +L+K+L+ L     P++ I V +      L  FLN I N  YP K+I++  Y      
Sbjct: 105 KISLLKNLNRL----MPTLCIGVLVRNKAHTLPYFLNGIENQQYPTKRITLIFYVDNTID 160

Query: 329 ------------NNQEYHAPLFDDYIHNFKTMFKNVK------YIAHNSTVNSK---EAR 367
                       N  +YH  + +  ++  K+ ++++       +  H  ++  K   EAR
Sbjct: 161 SSEIILNEWIQCNKDKYHRIILE--VNTTKSEYEHLSKMWTLDHYLHVISLRQKLLDEAR 218

Query: 368 NLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVN-------------------RNESLI 408
           N+          DFY  +D+D  L NP  +++L+N                    N  ++
Sbjct: 219 NI--------WADFYLSIDADVILMNPLTIEHLINVMLDSTISTSKSNLNHKIDENIIIL 270

Query: 409 APLL-VRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSV 467
           APL+     + +SNFWGA++ +G+Y RS  Y +I    +  +G++ V  + + +L+    
Sbjct: 271 APLMNCTSSEHYSNFWGAMSEEGYYLRSEHYFDI--QKRRIQGVYPVAMVHSIFLVNLQF 328

Query: 468 IKATNI----KTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGHL 510
            ++  I      I     +D  + F  +++   I   +D+TQ YG++
Sbjct: 329 YQSEQIGYSPAPINYTGPVDDIIIFSRSVQRAEIDFYLDNTQFYGYI 375


>gi|126322946|ref|XP_001368839.1| PREDICTED: procollagen galactosyltransferase 1-like [Monodelphis
           domestica]
          Length = 623

 Score = 67.0 bits (162), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 44/194 (22%), Positives = 96/194 (49%), Gaps = 22/194 (11%)

Query: 289 QFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFK 346
           Q P VLI++        L   L  +  L +P  + +++V   +N +  + +  +++   K
Sbjct: 51  QAPRVLIALIARNAAHALPSTLGALERLRHPRDRTALWVATDHNVDNTSAVLREWLVGVK 110

Query: 347 TMFKNV-----------------KYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDS 389
           +++  V                 K+ +++   +  + R  A++++     D+  ++D+D+
Sbjct: 111 SLYHYVEWRPMEEPRSYPDEEGPKHWSNSRYEHVMKLRQAALKSARDMWADYILFLDADN 170

Query: 390 HLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGK 449
            L NPD L  L+  N++++AP+L     A+SNFW  + + G+Y R+  Y+ I   D+  +
Sbjct: 171 LLINPDTLSLLIAENKTVVAPML-DSRAAYSNFWCGMTSQGYYKRTPAYIPIRRRDR--R 227

Query: 450 GIWNVPYITNCYLM 463
           G + VP + + +L+
Sbjct: 228 GCFAVPMVHSTFLI 241


>gi|403266623|ref|XP_003925468.1| PREDICTED: procollagen galactosyltransferase 2 [Saimiri boliviensis
           boliviensis]
          Length = 933

 Score = 67.0 bits (162), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 42/149 (28%), Positives = 75/149 (50%), Gaps = 9/149 (6%)

Query: 367 RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGAL 426
           R  A+  +  K  D+  ++D D+ L NP  L  L+  N++++AP+L      +SNFW  +
Sbjct: 455 RQAALRTAREKWSDYILFIDVDNFLTNPQTLNLLMAENKTIVAPML-ESRGLYSNFWCGI 513

Query: 427 NADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLN-----S 481
              GFY R+ DY+ I    +   G + VP + + +L+     +A+N  T Y  +     +
Sbjct: 514 TPKGFYKRTPDYVQIREWKR--TGCFPVPMVHSTFLIDLRK-EASNKLTFYPPHQDYTWT 570

Query: 482 MDYDMAFCTNLRNKGIHLKIDSTQEYGHL 510
            D  + F  + R  GI + + + + YG+L
Sbjct: 571 FDDIIVFAFSSRQAGIQMYLCNREHYGYL 599


>gi|395512663|ref|XP_003760555.1| PREDICTED: procollagen galactosyltransferase 1 [Sarcophilus
           harrisii]
          Length = 611

 Score = 67.0 bits (162), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 44/194 (22%), Positives = 96/194 (49%), Gaps = 22/194 (11%)

Query: 289 QFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFK 346
           Q P VLI++        L   L  +  L +P  + +++V   +N +  + +  +++   K
Sbjct: 39  QAPRVLIALIARNAAHALPSTLGALERLRHPRDRTALWVATDHNVDNTSAVLREWLVGVK 98

Query: 347 TMFKNV-----------------KYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDS 389
           +++  V                 K+ +++   +  + R  A++++     D+  ++D+D+
Sbjct: 99  SLYHYVEWRPMEEPRSYPDEDGPKHWSNSRYEHVMKLRQAALKSARDMWADYILFLDADN 158

Query: 390 HLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGK 449
            L NPD L  L+  N++++AP+L     A+SNFW  + + G+Y R+  Y+ I   D+  +
Sbjct: 159 LLINPDTLSLLIAENKTVVAPML-DSRAAYSNFWCGMTSQGYYKRTPAYIPIRRRDR--R 215

Query: 450 GIWNVPYITNCYLM 463
           G + VP + + +L+
Sbjct: 216 GCFAVPMVHSTFLI 229


>gi|292618105|ref|XP_684212.3| PREDICTED: glycosyltransferase 25 family member 3 isoform 1 [Danio
           rerio]
          Length = 591

 Score = 66.6 bits (161), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 55/247 (22%), Positives = 110/247 (44%), Gaps = 30/247 (12%)

Query: 289 QFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFK 346
           Q P+V+I++        L  +L  +  LNYP ++IS++    +N +    +  +++   +
Sbjct: 31  QPPTVVIAIIARNAAHSLPHYLGALERLNYPKERISVWAATDHNIDNTTAMLREWLTVMQ 90

Query: 347 TMFKNVKY------------IAHNSTVNSK-----EARNLAVENSLHKGVDFYFYVDSDS 389
           T +  V++            +      NS+     + +  A+  +  +  D+  Y D+D+
Sbjct: 91  TQYHYVEWRPSDKPTSYAGELGPKHWTNSRYEYIMKLKQAALNFAKKRWADYILYSDTDN 150

Query: 390 HLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGK 449
            L NPD L  L+  N+S+IAP+L     A+SN+W  +   G+Y R+ +Y       +   
Sbjct: 151 ILTNPDTLHLLMAENKSVIAPMLDSQ-SAYSNYWCGITPQGYYRRTAEYFP--TKQRQRL 207

Query: 450 GIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIHLKIDS 503
           G + VP + +  L+   + K    K  +     DY       + F  + R   + + + +
Sbjct: 208 GCYPVPMVHSTVLL--DLRKQGTRKVSFHPPHKDYSWPFDDIIVFAFSCRVSEVQMYLCN 265

Query: 504 TQEYGHL 510
            + YG+L
Sbjct: 266 KERYGYL 272


>gi|380798427|gb|AFE71089.1| procollagen galactosyltransferase 2 precursor, partial [Macaca
           mulatta]
          Length = 551

 Score = 66.2 bits (160), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 41/150 (27%), Positives = 73/150 (48%), Gaps = 11/150 (7%)

Query: 367 RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGAL 426
           R  A+  +  K  D+  ++D D+ L NP  L  L+  N++++AP+L      +SNFW  +
Sbjct: 73  RQAALRTAREKWSDYILFIDVDNFLTNPQTLNLLIAENKTIVAPML-ESRGLYSNFWCGI 131

Query: 427 NADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD- 485
              GFY R+ DY+ I    +   G + VP + + +L+   + K  + K  +     DY  
Sbjct: 132 TPKGFYKRTPDYVQIREWKRS--GCFPVPMVHSTFLI--DLRKEASDKLTFYPPHQDYTW 187

Query: 486 -----MAFCTNLRNKGIHLKIDSTQEYGHL 510
                + F  + R  GI + + + + YG+L
Sbjct: 188 TFDDIIVFAFSSRQAGIQMYLCNREHYGYL 217


>gi|441628755|ref|XP_003275861.2| PREDICTED: procollagen galactosyltransferase 1 [Nomascus
           leucogenys]
          Length = 703

 Score = 66.2 bits (160), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 40/149 (26%), Positives = 78/149 (52%), Gaps = 9/149 (6%)

Query: 367 RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGAL 426
           R  A++++     D+  +VD+D+ + NPD L  L+  N++++AP+L     A+SNFW  +
Sbjct: 228 RQAALKSARDMWADYILFVDADNLILNPDTLSLLIAENKTVVAPMLDS-RAAYSNFWCGM 286

Query: 427 NADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIK-----TIYTLNS 481
            + G+Y R+  Y+ I   D+  +G + VP + + +L+      + N+        YT  S
Sbjct: 287 TSQGYYKRTPAYIPIRKRDR--RGCFAVPMVHSTFLIDLRKAASRNLAFYPPHPDYTW-S 343

Query: 482 MDYDMAFCTNLRNKGIHLKIDSTQEYGHL 510
            D  + F  + +   + + + + +EYG L
Sbjct: 344 FDDIIVFAFSCKQAEVQMYVCNKEEYGFL 372


>gi|355558949|gb|EHH15729.1| hypothetical protein EGK_01859, partial [Macaca mulatta]
 gi|355759604|gb|EHH61640.1| hypothetical protein EGM_19672, partial [Macaca fascicularis]
          Length = 533

 Score = 66.2 bits (160), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 41/150 (27%), Positives = 73/150 (48%), Gaps = 11/150 (7%)

Query: 367 RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGAL 426
           R  A+  +  K  D+  ++D D+ L NP  L  L+  N++++AP+L      +SNFW  +
Sbjct: 60  RQAALRTAREKWSDYILFIDVDNFLTNPQTLNLLIAENKTIVAPML-ESRGLYSNFWCGI 118

Query: 427 NADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD- 485
              GFY R+ DY+ I    +   G + VP + + +L+   + K  + K  +     DY  
Sbjct: 119 TPKGFYKRTPDYVQIREWKRS--GCFPVPMVHSTFLI--DLRKEASDKLTFYPPHQDYTW 174

Query: 486 -----MAFCTNLRNKGIHLKIDSTQEYGHL 510
                + F  + R  GI + + + + YG+L
Sbjct: 175 TFDDIIVFAFSSRQAGIQMYLCNREHYGYL 204


>gi|157136453|ref|XP_001656834.1| hypothetical protein AaeL_AAEL003481 [Aedes aegypti]
 gi|122095142|sp|Q17FB8.1|GLT25_AEDAE RecName: Full=Glycosyltransferase 25 family member; Flags:
           Precursor
 gi|108881003|gb|EAT45228.1| AAEL003481-PA [Aedes aegypti]
          Length = 607

 Score = 66.2 bits (160), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 67/271 (24%), Positives = 122/271 (45%), Gaps = 36/271 (13%)

Query: 289 QFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN--NQEYHAPLFDDYIHNFK 346
           Q P VLI   I      L  F + + +  YP  +IS++  +  N++    +   ++    
Sbjct: 27  QSPKVLIVSLIRNKEHTLPYFFSYLEDQEYPKDRISLWFRSDHNEDRSIDIIKAWLKRVT 86

Query: 347 TMFKNV---------KYIAHNSTVNSKEARNLAV----ENSLHKG----VDFYFYVDSDS 389
             + +V         K     S+ +  E R   V    + +L KG     DF  ++D+D 
Sbjct: 87  KKYHSVDFGYRSDAAKRYDEKSSTHWSEDRFADVIRLKQEALDKGRKMWADFVLFLDADV 146

Query: 390 HLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGK 449
            L NP+ +  LV+ N  ++AP+L+     +SNFW  + AD +Y R+ +Y  I+N ++ G+
Sbjct: 147 LLTNPNTIAKLVSLNLPIVAPMLLSD-GLYSNFWCGMTADYYYHRTDEYKEILNYEKTGE 205

Query: 450 GIWNVPYITNCYLMKTSVIKATNIK-------TIYTLNSMDYDMAFCTNLRNKGIHLKID 502
             + VP + +  ++  +V ++ N+          +    +D  + F  +     I + I 
Sbjct: 206 --FPVPMVHSAVMVNINVQQSLNLSFDKRRLPPGHYTGPVDDIIIFAMSANYSSIPMYIS 263

Query: 503 STQEYGH-LVDSENFDP------QKTNPEVY 526
           ++  YG+ LV  E  DP      Q TN +VY
Sbjct: 264 NSASYGYILVPLEQGDPLEKDLEQLTNTKVY 294


>gi|119611577|gb|EAW91171.1| glycosyltransferase 25 domain containing 2, isoform CRA_b [Homo
           sapiens]
 gi|193787801|dbj|BAG53004.1| unnamed protein product [Homo sapiens]
          Length = 506

 Score = 65.9 bits (159), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 41/150 (27%), Positives = 73/150 (48%), Gaps = 11/150 (7%)

Query: 367 RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGAL 426
           R  A+  +  K  D+  ++D D+ L NP  L  L+  N++++AP+L      +SNFW  +
Sbjct: 28  RQAALRTAREKWSDYILFIDVDNFLTNPQTLNLLIAENKTIVAPML-ESRGLYSNFWCGI 86

Query: 427 NADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD- 485
              GFY R+ DY+ I    +   G + VP + + +L+   + K  + K  +     DY  
Sbjct: 87  TPKGFYKRTPDYVQIREWKRT--GCFPVPMVHSTFLI--DLRKEASDKLTFYPPHQDYTW 142

Query: 486 -----MAFCTNLRNKGIHLKIDSTQEYGHL 510
                + F  + R  GI + + + + YG+L
Sbjct: 143 TFDDIIVFAFSSRQAGIQMYLCNREHYGYL 172


>gi|397489276|ref|XP_003815656.1| PREDICTED: procollagen galactosyltransferase 2 [Pan paniscus]
          Length = 506

 Score = 65.9 bits (159), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 41/150 (27%), Positives = 73/150 (48%), Gaps = 11/150 (7%)

Query: 367 RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGAL 426
           R  A+  +  K  D+  ++D D+ L NP  L  L+  N++++AP+L      +SNFW  +
Sbjct: 28  RQAALRTAREKWSDYILFIDVDNFLTNPQTLNLLIAENKTIVAPML-ESRGLYSNFWCGI 86

Query: 427 NADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD- 485
              GFY R+ DY+ I    +   G + VP + + +L+   + K  + K  +     DY  
Sbjct: 87  TPKGFYKRTPDYVQIREWKRT--GCFPVPMVHSTFLI--DLRKEASDKLTFYPPHQDYTW 142

Query: 486 -----MAFCTNLRNKGIHLKIDSTQEYGHL 510
                + F  + R  GI + + + + YG+L
Sbjct: 143 TFDDIIVFAFSSRQAGIQMYLCNREHYGYL 172


>gi|296233252|ref|XP_002761953.1| PREDICTED: procollagen galactosyltransferase 1 [Callithrix jacchus]
          Length = 738

 Score = 65.9 bits (159), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 40/149 (26%), Positives = 78/149 (52%), Gaps = 9/149 (6%)

Query: 367 RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGAL 426
           R  A++++     D+  +VD+D+ + NPD L  L+  N++++AP+L     A+SNFW  +
Sbjct: 263 RQAALKSARDMWADYILFVDADNLILNPDTLSLLIAENKTVVAPMLDS-RAAYSNFWCGM 321

Query: 427 NADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIK-----TIYTLNS 481
            + G+Y R+  Y+ I   D+  +G + VP + + +L+      + N+        YT  S
Sbjct: 322 TSQGYYRRTPAYIPIRKRDR--QGCFAVPMVHSTFLIDLRKAASRNLAFYPPHPDYTW-S 378

Query: 482 MDYDMAFCTNLRNKGIHLKIDSTQEYGHL 510
            D  + F  + +   + + + + +EYG L
Sbjct: 379 FDDIIVFAFSCKQAEVQMYVCNKEEYGFL 407


>gi|410917374|ref|XP_003972161.1| PREDICTED: procollagen galactosyltransferase 1-like [Takifugu
           rubripes]
          Length = 609

 Score = 65.9 bits (159), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 56/244 (22%), Positives = 113/244 (46%), Gaps = 28/244 (11%)

Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFKTM 348
           P V+I++        L  FL  I  LNYP  +I+++V   +N++    +   ++   +  
Sbjct: 38  PRVVIALICRNSAHSLPLFLGTIERLNYPKDRIALWVATDHNKDNTTSILRSWLIGVQND 97

Query: 349 FKNVKYIAHN-STVNSKEA----------------RNLAVENSLHKGVDFYFYVDSDSHL 391
           +  V++   + S+  + E                 R  A++ +     D+   VD D+ L
Sbjct: 98  YHYVEWRPDDESSAFADETGPKHWNNLRYEHVMKLRQAALDTAREIWADYILVVDCDNLL 157

Query: 392 DNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGI 451
            N DVL  L++ N++++AP+L     A+SNFW  + + G+Y R+  Y+ I   ++  +G 
Sbjct: 158 TNQDVLWKLMSENKTIVAPML-ESRAAYSNFWCGMTSQGYYKRTPAYIPIRKRER--RGC 214

Query: 452 WNVPYITNCYLMKTSVIKATNIKTIYTLNS-----MDYDMAFCTNLRNKGIHLKIDSTQE 506
           + VP + + YL+     +A+     Y  +S     +D  + F  + R   + + + + + 
Sbjct: 215 FAVPMVHSTYLVDLRK-EASRQLAFYPPHSEYSWALDDVIVFAYSARMADVQMYVCNKEI 273

Query: 507 YGHL 510
           YG+ 
Sbjct: 274 YGYF 277


>gi|255080018|ref|XP_002503589.1| predicted protein [Micromonas sp. RCC299]
 gi|226518856|gb|ACO64847.1| predicted protein [Micromonas sp. RCC299]
          Length = 898

 Score = 65.9 bits (159), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 54/171 (31%), Positives = 79/171 (46%), Gaps = 11/171 (6%)

Query: 565 PIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVG-LAGVWAEFLR 623
           P++TE  C E+V++ E  G+   G    +     + AVPT DI +  +  L  +W   +R
Sbjct: 733 PLMTEAECAEWVRLAEKAGEARGGWTTSR-----HYAVPTTDIPVHAIPDLLPLWNALMR 787

Query: 624 KYVVPLQEREFIGYHHEP--VRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVD 681
             +  L          +P  VR   +FVVRY    Q  L  H D S  ++ +ALN  G +
Sbjct: 788 DKLASLLSAACPEEMPKPSSVRVHDAFVVRYEAGAQHHLPMHADQSAVSVTLALNDEG-E 846

Query: 682 YEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFV 732
           YEGGG  F            G ++   G L H   G  VT+G RYI+ +F+
Sbjct: 847 YEGGGTTFAVPVGKTVRPGRGHVVAFKGGLQHG--GSPVTRGVRYIVAAFL 895


>gi|292621863|ref|XP_002664798.1| PREDICTED: procollagen galactosyltransferase 1-like, partial [Danio
           rerio]
          Length = 535

 Score = 65.9 bits (159), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 56/213 (26%), Positives = 96/213 (45%), Gaps = 32/213 (15%)

Query: 367 RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGAL 426
           R  A+E +     D++  VD D+ L N DVL  L+  N++++AP+L     A+SNFW  +
Sbjct: 60  RQAALETAREMWADYFMLVDCDNLLTNRDVLWKLMRENKTIVAPML-ESRAAYSNFWCGM 118

Query: 427 NADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDM 486
            + G+Y R+  YM I    Q  KG + VP + +  LM   + K  + +  +     DY  
Sbjct: 119 TSQGYYKRTPAYMPIRR--QERKGCFAVPMVHSTLLM--DLRKEASRQLAFFPPHPDYTW 174

Query: 487 A------FCTNLRNKGIHLKIDSTQEYGHLV-----------DSENFDPQKTNPEVYELI 529
           A      F  + R   + + I + + YG+             ++E+F     + ++  ++
Sbjct: 175 AFDDIIIFAFSARMAEVQMYICNRETYGYFPVPLRSQNSLQDEAESF----LHSQLEVMV 230

Query: 530 RNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVF 562
           RNP       I P    SL+P   +     +VF
Sbjct: 231 RNPP------IEPSVYLSLMPKQTDKMGFDEVF 257


>gi|46446818|ref|YP_008183.1| hypothetical protein pc1184 [Candidatus Protochlamydia amoebophila
           UWE25]
 gi|46400459|emb|CAF23908.1| hypothetical protein pc1184 [Candidatus Protochlamydia amoebophila
           UWE25]
          Length = 547

 Score = 65.5 bits (158), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 59/249 (23%), Positives = 113/249 (45%), Gaps = 29/249 (11%)

Query: 293 VLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFV--YNNQEYHAPLFDDYIHNFKTMFK 350
           V +   ID     +  FL  I  L Y   K+ + +   N  ++   +   ++   +  ++
Sbjct: 40  VWVGAIIDNHDQLIPPFLLTIEKLYYDKAKMHLQIDCCNQNKHVRKIVMQWVEKNRKFYQ 99

Query: 351 NVKYIAHNSTVNSKE-----------ARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKY 399
           ++ ++ H S+++ K+            +N  + N   +  ++   + SD  L  P  LKY
Sbjct: 100 SLVFVDHTSSIDEKKHFIEKNKVLANIKNGYLANCQQQSCNYCLILSSDM-LIAPHTLKY 158

Query: 400 LVNRNESLIAPLLVRPFKA----WSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVP 455
           L+ +++ +I+PLL RPF      + NF+  +  +G+Y    DY+ I N  +   G + VP
Sbjct: 159 LIEKDKPIISPLL-RPFPQPHDPYRNFFCDVTEEGYYKHHEDYLAIANRQK--LGTFQVP 215

Query: 456 YITNCYLMKTSVIKATNIKTIYTLNSMDYD-MAFCTNLRNKGIHLKIDSTQEYG---HLV 511
            +   YL++   +   +    +T    +Y+ +AF T  R K +   I + +E+G   HL 
Sbjct: 216 CVHGVYLIQAPFLSQLS----FTEGFKNYEFLAFSTYARKKNMGQFICNEREFGFLMHLS 271

Query: 512 DSENFDPQK 520
           D    D QK
Sbjct: 272 DDATLDQQK 280


>gi|344241371|gb|EGV97474.1| Glycosyltransferase 25 family member 1 [Cricetulus griseus]
          Length = 948

 Score = 65.5 bits (158), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 40/149 (26%), Positives = 77/149 (51%), Gaps = 9/149 (6%)

Query: 367 RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGAL 426
           R  A++++     D+  +VDSD+ + NPD L  L+  N++++AP+L     A+SNFW  +
Sbjct: 486 RQAALKSARDMWADYIMFVDSDNLITNPDTLSLLIAENKTVVAPML-DSRAAYSNFWCGM 544

Query: 427 NADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIK-----TIYTLNS 481
            + G+Y R+  Y+ I   D+  +G + VP + + +L+      + N+        YT  S
Sbjct: 545 TSQGYYKRTPAYIPIRRRDR--RGCFAVPMVHSTFLIDLRKAASRNLAFYPPHPDYTW-S 601

Query: 482 MDYDMAFCTNLRNKGIHLKIDSTQEYGHL 510
            D  + F  + +   + + + + + YG L
Sbjct: 602 FDDIIVFAFSCKQAEVQMYVCNKEVYGFL 630


>gi|321463619|gb|EFX74634.1| hypothetical protein DAPPUDRAFT_199801 [Daphnia pulex]
          Length = 623

 Score = 65.5 bits (158), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 48/205 (23%), Positives = 96/205 (46%), Gaps = 28/205 (13%)

Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN--NQEYHAPLFDDYIHNFKTM 348
           P+VL+++ +      L  FL     L+YP  ++++++ +  NQ+    + + ++ + +  
Sbjct: 30  PTVLVTLLVRNKAHTLPYFLKLFEELDYPKNRLTLWIKSDQNQDQSLEIMNKWVSSVE-- 87

Query: 349 FKNVKYIAHNSTVNSKEARNLAV----------------ENSLHKG----VDFYFYVDSD 388
            K+  +I H  T  S  A +  +                E +L KG     DF ++VD D
Sbjct: 88  -KSYHHIYHELTTTSPSAVDDKIPTNWTEERFKHIINLREEALDKGRELWADFVWFVDCD 146

Query: 389 SHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGG 448
             L N   LK +VN N  ++AP+L      +SN+W  +  D +Y R+ +Y  I   ++  
Sbjct: 147 VFLTNNQTLKIMVNTNYPVVAPML-DTLSLYSNYWCGMGLDYYYRRTDEYKPI--REREN 203

Query: 449 KGIWNVPYITNCYLMKTSVIKATNI 473
           KG   V  + +C+++    +++  +
Sbjct: 204 KGCHRVIVVHSCFMVDLRQVESQRL 228


>gi|432952470|ref|XP_004085089.1| PREDICTED: procollagen galactosyltransferase 2-like [Oryzias
           latipes]
          Length = 592

 Score = 65.5 bits (158), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 54/245 (22%), Positives = 111/245 (45%), Gaps = 26/245 (10%)

Query: 289 QFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFK 346
           Q P+V++++        L  +L  +  LNYP  +IS++    +N +    +  +++   +
Sbjct: 32  QPPTVVVAIIARNAAHALPYYLGALERLNYPKDRISVWAATDHNVDNTTAILREWLTVMQ 91

Query: 347 TMFKNVKYIAHNSTV------------NSK-----EARNLAVENSLHKGVDFYFYVDSDS 389
             +  V++   +               NS+     + +  A+  +  +  D+  Y D+D+
Sbjct: 92  KYYHYVEWRPMDQPTSYAGELGPKHWPNSRYEYVMKLKQAALNFARKRWADYILYADTDN 151

Query: 390 HLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGK 449
            L NPD L+ ++  N+S+IAP+L     A+SNFW  +   G+Y R+ +Y    +  +   
Sbjct: 152 ILTNPDTLQLMIAENKSVIAPMLDSQ-GAYSNFWCGITPQGYYRRTAEYFPTRHRHR--L 208

Query: 450 GIWNVPYITNCYL--MKTSVIKATNIKTIYTLNSMDYD--MAFCTNLRNKGIHLKIDSTQ 505
           G + VP + +  L  ++   +K       +   S  YD  + F  + R   I + + + +
Sbjct: 209 GCFPVPMVHSTVLLNLRKEGMKKLAFYPPHKDYSWPYDDIIVFAFSCRAAEIQMYLCNKE 268

Query: 506 EYGHL 510
            YG+L
Sbjct: 269 RYGYL 273


>gi|241835874|ref|XP_002415078.1| procollagen-lysine, 2-oxoglutarate 5-dioxygenase, putative [Ixodes
           scapularis]
 gi|215509290|gb|EEC18743.1| procollagen-lysine, 2-oxoglutarate 5-dioxygenase, putative [Ixodes
           scapularis]
          Length = 322

 Score = 65.5 bits (158), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 51/197 (25%), Positives = 88/197 (44%), Gaps = 26/197 (13%)

Query: 276 CNLIKHLDSLKPD----QFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNN- 330
           C+L+    +   D    + P+V I+V        L  F   +   NYP  +IS+++Y + 
Sbjct: 14  CSLLLATRAWASDDEKLELPTVFIAVIARNKAHVLPHFFGYLEQQNYPKSRISLWIYTDH 73

Query: 331 ---------QEYHAPLFDDYIHNFK-TMFKNVKYIAHNSTVNSKEA---------RNLAV 371
                    + +     DDY HN   T  ++  + A  + V    A         R  A+
Sbjct: 74  NSDDTEDILEAWAEAKSDDY-HNVNLTREESDAFYADENGVQKWTAERYWHVIRLREEAL 132

Query: 372 ENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGF 431
             +     DF  ++D D+ L +P  +  LV  N++++AP+L     A+SNFW  +   G+
Sbjct: 133 NLARSLWADFILFLDCDALLTSPKTILDLVRANKTVVAPML-DSRSAYSNFWCGMTEKGY 191

Query: 432 YARSFDYMNIINGDQGG 448
           Y R+ DYM I+  ++ G
Sbjct: 192 YLRTDDYMPILERERVG 208


>gi|338724828|ref|XP_001489806.3| PREDICTED: procollagen galactosyltransferase 2 [Equus caballus]
          Length = 572

 Score = 65.1 bits (157), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 43/161 (26%), Positives = 77/161 (47%), Gaps = 13/161 (8%)

Query: 367 RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGAL 426
           R  A+  +  K  D+  ++D D+ L NP  L  ++  N++++AP+L      +SNFW  +
Sbjct: 94  RQAALRTAREKWSDYILFIDVDNFLTNPQTLNLMIAENKTIVAPML-ESRGLYSNFWCGI 152

Query: 427 NADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD- 485
              GFY R+ DY  I    +   G + VP + + +L+   + K  + K ++     DY  
Sbjct: 153 TPQGFYKRTPDYPQIREWKR--MGCFPVPMVHSTFLI--DLRKEASDKLMFYPPHQDYTW 208

Query: 486 -----MAFCTNLRNKGIHLKIDSTQEYGHLVDSENFDPQKT 521
                + F  + R  GI + + + + YG+L       PQ+T
Sbjct: 209 TFDDIIVFAFSSRQAGIQMYLCNREHYGYL--PIPLKPQQT 247


>gi|224005863|ref|XP_002291892.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220972411|gb|EED90743.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 562

 Score = 65.1 bits (157), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 51/142 (35%), Positives = 72/142 (50%), Gaps = 24/142 (16%)

Query: 603 PTRDIHMKQVGLAGV---W-AEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDE-Q 657
           PT D+++     +G    W A+ L   + P+ ER F G     VRA   FVVRY  +  Q
Sbjct: 265 PTTDLNLVTDPFSGEDREWLAQRLDARMAPIIERAF-GISRGAVRANDIFVVRYDAEAGQ 323

Query: 658 PSLRPHHDSSTYTINIALNQVGVDYEGGGCRF--------IRYNCNVTATRMGWMLMHPG 709
           P+LR H DSS  + NI LN    +++GGG RF        I  +  V  T +   ++   
Sbjct: 324 PNLRVHTDSSHLSFNILLND---EFDGGGTRFHHRIDKSHIDIHPEVGETLLSHAMI--- 377

Query: 710 RLTHYHEGLQVTQGTRYIMISF 731
               +HEGL  T+GTRYI++ F
Sbjct: 378 ----FHEGLPTTKGTRYILVGF 395


>gi|354473914|ref|XP_003499177.1| PREDICTED: procollagen galactosyltransferase 1 [Cricetulus griseus]
          Length = 571

 Score = 65.1 bits (157), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 40/149 (26%), Positives = 77/149 (51%), Gaps = 9/149 (6%)

Query: 367 RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGAL 426
           R  A++++     D+  +VDSD+ + NPD L  L+  N++++AP+L     A+SNFW  +
Sbjct: 109 RQAALKSARDMWADYIMFVDSDNLITNPDTLSLLIAENKTVVAPML-DSRAAYSNFWCGM 167

Query: 427 NADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIK-----TIYTLNS 481
            + G+Y R+  Y+ I   D+  +G + VP + + +L+      + N+        YT  S
Sbjct: 168 TSQGYYKRTPAYIPIRRRDR--RGCFAVPMVHSTFLIDLRKAASRNLAFYPPHPDYTW-S 224

Query: 482 MDYDMAFCTNLRNKGIHLKIDSTQEYGHL 510
            D  + F  + +   + + + + + YG L
Sbjct: 225 FDDIIVFAFSCKQAEVQMYVCNKEVYGFL 253


>gi|348513873|ref|XP_003444465.1| PREDICTED: procollagen galactosyltransferase 1-like [Oreochromis
           niloticus]
          Length = 591

 Score = 65.1 bits (157), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 58/245 (23%), Positives = 108/245 (44%), Gaps = 26/245 (10%)

Query: 289 QFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQ---------EYHAPLFD 339
           Q P+V+I++        L  +L  +  LNYP  +IS++   +          +    +  
Sbjct: 31  QPPTVVIAIIARNTAHSLPYYLGALERLNYPKDRISVWAATDHNIDNTTAILKEWLTVMQ 90

Query: 340 DYIH--NFKTMFKNVKY---IAHNSTVNSK-----EARNLAVENSLHKGVDFYFYVDSDS 389
            Y H   ++ M K   Y   +      NS+     + +  A+  +  +  D+  Y D+D+
Sbjct: 91  KYYHYVEWRPMDKPTSYAGELGPKHWPNSRYEYVMKLKQAALNFARKRWADYILYADTDN 150

Query: 390 HLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGK 449
            L NP+ L  L+  N+S+IAP+L  P  A+SN+W  +   G+Y R+ +Y    +  +   
Sbjct: 151 ILTNPESLNLLIAENKSVIAPMLDSP-GAYSNYWCGITPQGYYRRTAEYFPTRHRHR--V 207

Query: 450 GIWNVPYITNCYL--MKTSVIKATNIKTIYTLNSMDYD--MAFCTNLRNKGIHLKIDSTQ 505
           G + VP + +  L  ++   +K       +   S  YD  + F  + R   I + + +  
Sbjct: 208 GCFPVPMVHSTLLLDLRKEGMKKLAFYPPHEDYSWPYDDIIVFAFSCRAAEIQMYLCNKD 267

Query: 506 EYGHL 510
            YG+L
Sbjct: 268 RYGYL 272


>gi|47216930|emb|CAG04872.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 615

 Score = 64.7 bits (156), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 53/232 (22%), Positives = 106/232 (45%), Gaps = 26/232 (11%)

Query: 289 QFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFK 346
           Q P+V+I++        L  +L  +  LNYP  +IS++    +N +    +  +++   +
Sbjct: 31  QPPTVVIAILARNSAHSLPYYLGALERLNYPKDRISVWAATDHNLDNTTAVLREWLTVMQ 90

Query: 347 TMFKNVKY------------IAHNSTVNSK-----EARNLAVENSLHKGVDFYFYVDSDS 389
             + +V++            +      NS+     + +  A+  +  +  D+  Y D+D+
Sbjct: 91  QFYHHVEWRPLEQPTSYAGELGPKHWPNSRYEYLMKLKQAALNFARKRWADYILYADTDN 150

Query: 390 HLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGK 449
            L NPD L+ L+  N+S+IAP+L     A+SNFW  +   G+Y R+ +Y    +  +   
Sbjct: 151 ILTNPDTLQLLIAENKSVIAPML-HSQGAYSNFWCGITPQGYYRRTAEYFPTRHRHR--L 207

Query: 450 GIWNVPYITNCYLMKTSVIKATNIKTI--YTLNSMDYD--MAFCTNLRNKGI 497
           G + VP + +  L+        N+     +   S  YD  + F  + R++G+
Sbjct: 208 GCFPVPMVHSTLLLDLRKEGMRNLAFFPPHADYSWPYDDIIVFAFSCRSEGV 259


>gi|410986010|ref|XP_003999305.1| PREDICTED: procollagen galactosyltransferase 2 [Felis catus]
          Length = 506

 Score = 64.7 bits (156), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 39/150 (26%), Positives = 74/150 (49%), Gaps = 11/150 (7%)

Query: 367 RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGAL 426
           R  A+  +  +  D+  ++D D+ L NP  L  ++  N++++AP+L      +SNFW  +
Sbjct: 28  RQAALRTAREQWSDYILFIDVDNFLTNPQTLNLMIAENKTIVAPML-ESRGLYSNFWCGI 86

Query: 427 NADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD- 485
              GFY R+ DY+ I    +   G + VP + + +L+   + K  + K ++     DY  
Sbjct: 87  TPQGFYKRTPDYLQIREWKR--LGCFPVPMVHSTFLI--DLRKEASGKLMFYPPHQDYTW 142

Query: 486 -----MAFCTNLRNKGIHLKIDSTQEYGHL 510
                + F  + R  GI + + + + YG+L
Sbjct: 143 TFDDIIVFAFSSRQAGIQMYLCNREHYGYL 172


>gi|47213906|emb|CAF95848.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 601

 Score = 64.7 bits (156), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 61/262 (23%), Positives = 106/262 (40%), Gaps = 47/262 (17%)

Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLF--------DDYI 342
           P V+I+V        L  +L  I  L YP ++I++    N    + L         D  +
Sbjct: 13  PKVMIAVLARNAAHSLPHYLGCIEKLEYPKERIAICGLTNSGTWSQLIHMLQMAAADHNV 72

Query: 343 HNFKTMFKN-VKYIAH--------------------------NSTVNSK-EARNLAVENS 374
            N   M +  +K+  H                           S  N   + R  A++ +
Sbjct: 73  DNTTAMLREWLKWAQHVYHYVEWRPMDEPRSYTDEWGPKHWPPSRFNHLLKLRQAALKAA 132

Query: 375 LHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYAR 434
             +  D+  +VDSD+ L NP VL  L+  N +L+AP+L      +SNFW  +   G+Y R
Sbjct: 133 RERWADYILFVDSDNLLTNPRVLTLLMAENLTLLAPML-ESRSLYSNFWCGVTPQGYYKR 191

Query: 435 SFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAF 488
           + DY  I    +   G + VP + + +L+   + + ++    +     DY       M F
Sbjct: 192 TPDYQPIREWKR--LGCFPVPMVHSTFLL--DLRRESSRDLAFYPPHPDYSWAFDDIMVF 247

Query: 489 CTNLRNKGIHLKIDSTQEYGHL 510
             + R  G+ + + + + YG L
Sbjct: 248 AFSARQAGVQMHVCNREHYGFL 269


>gi|345325485|ref|XP_001516115.2| PREDICTED: procollagen galactosyltransferase 2-like
           [Ornithorhynchus anatinus]
          Length = 625

 Score = 64.3 bits (155), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 38/149 (25%), Positives = 74/149 (49%), Gaps = 9/149 (6%)

Query: 367 RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGAL 426
           R  A+  +  K  D+  ++D+D+ L NP  L  ++  N++++AP+L      +SNFW  +
Sbjct: 147 RQAALRTAREKWSDYVLFIDADNFLTNPQTLNLMIAENKTIVAPML-ESRSLYSNFWCGI 205

Query: 427 NADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTI-----YTLNS 481
              G+Y R+ DY+ I   +    G + VP + + +L+    + +  +        YT  +
Sbjct: 206 TPQGYYKRTPDYVQI--REWKRIGCFAVPMVHSTFLIDLRKVASDKLSFFPPHQDYTW-T 262

Query: 482 MDYDMAFCTNLRNKGIHLKIDSTQEYGHL 510
            D  + F  + R  GI + + + + YG+L
Sbjct: 263 FDDIIVFAFSSRQAGIQMYLCNREHYGYL 291


>gi|403303560|ref|XP_003942394.1| PREDICTED: procollagen galactosyltransferase 1 [Saimiri boliviensis
           boliviensis]
          Length = 630

 Score = 63.9 bits (154), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 39/149 (26%), Positives = 78/149 (52%), Gaps = 9/149 (6%)

Query: 367 RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGAL 426
           R  A++++     D+  +VD+D+ + NPD L  L+  N++++AP+L     A+SNFW  +
Sbjct: 155 RQAALKSARDMWADYILFVDADNLILNPDTLSLLIAENKTVVAPML-DSRAAYSNFWCGM 213

Query: 427 NADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIK-----TIYTLNS 481
            + G+Y R+  Y+ I   ++  +G + VP + + +L+      + N+        YT  S
Sbjct: 214 TSQGYYKRTPAYIPIRKRER--QGCFAVPMVHSTFLIDLRKAASRNLAFYPPHPDYTW-S 270

Query: 482 MDYDMAFCTNLRNKGIHLKIDSTQEYGHL 510
            D  + F  + +   + + + + +EYG L
Sbjct: 271 FDDIIVFAFSCKQAEVQMYVCNKEEYGFL 299


>gi|351705537|gb|EHB08456.1| Glycosyltransferase 25 family member 2 [Heterocephalus glaber]
          Length = 508

 Score = 63.9 bits (154), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 42/152 (27%), Positives = 74/152 (48%), Gaps = 13/152 (8%)

Query: 367 RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGAL 426
           R  A+  +  K  D+  ++D D+ L NP  L  ++  N++++AP+L      +SNFW  +
Sbjct: 28  RQAALRTAREKWSDYILFIDVDNFLTNPQTLNLMIAENKTIVAPML-ESRGLYSNFWCGI 86

Query: 427 --NADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDY 484
              A GFY R+ DY+ I    +   G + VP + + +L+   + K  + K  +     DY
Sbjct: 87  TPQASGFYKRTPDYLQIREWKR--MGCFPVPMVHSTFLI--DLRKEASDKLTFYPPHQDY 142

Query: 485 D------MAFCTNLRNKGIHLKIDSTQEYGHL 510
                  + F  + R  GI + + + Q YG+L
Sbjct: 143 TWTFDDIIVFAFSSRQAGIQMYLCNRQHYGYL 174


>gi|354481442|ref|XP_003502910.1| PREDICTED: procollagen galactosyltransferase 2 [Cricetulus griseus]
          Length = 545

 Score = 63.9 bits (154), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 40/149 (26%), Positives = 73/149 (48%), Gaps = 9/149 (6%)

Query: 367 RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGAL 426
           R  A+  +  K  D+  ++D D+ L NP  L  ++  N +++AP+L      +SNFW  +
Sbjct: 67  RQAALRTAREKWSDYILFIDVDNFLTNPQTLTLMIAENRTIVAPML-ESRGLYSNFWCGI 125

Query: 427 NADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLN-----S 481
              GFY R+ DY+ I   +    G + VP + + +L+     +A+N    Y  +     +
Sbjct: 126 TPQGFYKRTPDYLQI--REWKRIGCFPVPMVHSTFLIDLRK-EASNNLAFYPPHQDYTWT 182

Query: 482 MDYDMAFCTNLRNKGIHLKIDSTQEYGHL 510
            D  + F  + R  GI + + + + YG+L
Sbjct: 183 FDDIIVFAFSSRQAGIQMYLCNKEHYGYL 211


>gi|332025630|gb|EGI65792.1| Glycosyltransferase 25 family member [Acromyrmex echinatior]
          Length = 357

 Score = 63.9 bits (154), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 50/192 (26%), Positives = 94/192 (48%), Gaps = 22/192 (11%)

Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFKTM 348
           P+VLISV +      L  FL+ + N +YP K+IS ++   NN +    + + +I++   M
Sbjct: 28  PTVLISVLVRNKAHTLPYFLSLLENQDYPKKRISFWIRSDNNVDNSIEILNKWINSRSKM 87

Query: 349 FKNVKYIAHNSTVNSKEARNLA-------------VENSLHKG----VDFYFYVDSDSHL 391
           + ++    + S+   ++ R++A              E +L        DF   +D+D  L
Sbjct: 88  YHSMNVHLNASSTGFEDERSIADWSPRRFAHIIDLREQALDYAKEIWADFILMLDADVFL 147

Query: 392 DNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGI 451
            NP  ++ L+++  +++APLL R    +SNFW  +  + +Y R+  Y  I+  ++     
Sbjct: 148 INPSTIRNLIHKEYTVVAPLL-RSDGMYSNFWAGMTTEHYYLRTELYEPILFREKIDCH- 205

Query: 452 WNVPYITNCYLM 463
            NVP I +  L+
Sbjct: 206 -NVPMIHSVVLI 216


>gi|241757469|ref|XP_002401539.1| procollagen-lysine, 2-oxoglutarate 5-dioxygenase, putative [Ixodes
           scapularis]
 gi|215508473|gb|EEC17927.1| procollagen-lysine, 2-oxoglutarate 5-dioxygenase, putative [Ixodes
           scapularis]
          Length = 52

 Score = 63.5 bits (153), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 28/50 (56%), Positives = 35/50 (70%)

Query: 591 NDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHE 640
            D+RL  GYE VPTRDIHM QV     W  FLR+Y+ P+QE+ F+GY H+
Sbjct: 2   QDERLAGGYENVPTRDIHMNQVNFEQHWLFFLREYIKPVQEKVFLGYFHD 51


>gi|443714373|gb|ELU06820.1| hypothetical protein CAPTEDRAFT_153006 [Capitella teleta]
          Length = 550

 Score = 63.5 bits (153), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 49/193 (25%), Positives = 88/193 (45%), Gaps = 22/193 (11%)

Query: 294 LISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYN--NQEYHAPLFDDYIHNFKTMFKN 351
           +I+ F+      +  FL  +  L+Y  +KI++++ +  N +  A L   +I   K M+ +
Sbjct: 1   MIAFFVRNKAHTIPYFLYYLEQLDYDKQKINLWIRSDHNVDLSASLIKHWIPKAKKMYNH 60

Query: 352 VKYIAHNSTVNSKEAR-----------------NLAVENSLHKGVDFYFYVDSDSHLDNP 394
           V +   NST    + R                   A+  S    + + FY+D D+ L N 
Sbjct: 61  VSFKDDNSTSAFSDERGPFDWSADRMKHMIMLRQEALNVSRQMNLRYIFYIDVDNILVNS 120

Query: 395 DVLKYLVNRNESLIAPLL-VRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWN 453
            VL++L++     +AP+L       +SNFW  ++  GFY R+ +Y  I    +  +G + 
Sbjct: 121 QVLRHLISLQRIAVAPMLNTTASPHYSNFWAGMDEQGFYKRTLEYKPI--QLRHTQGTFQ 178

Query: 454 VPYITNCYLMKTS 466
           VP I +  L+  S
Sbjct: 179 VPMIHSTLLLDLS 191


>gi|47206702|emb|CAF89946.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 270

 Score = 63.2 bits (152), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 41/150 (27%), Positives = 73/150 (48%), Gaps = 11/150 (7%)

Query: 367 RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGAL 426
           R  A++ +  +  D+  +VDSD+ L NP VL  L+  N +L+AP+L      +SNFW  +
Sbjct: 101 RQAALKAARERWADYILFVDSDNLLTNPRVLTLLMAENLTLLAPML-ESRSLYSNFWCGV 159

Query: 427 NADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD- 485
              G+Y R+ DY  I    +   G + VP + + +L+   + + ++    +     DY  
Sbjct: 160 TPQGYYKRTPDYQPIREWKR--LGCFPVPMVHSTFLL--DLRRESSRDLAFYPPHPDYSW 215

Query: 486 -----MAFCTNLRNKGIHLKIDSTQEYGHL 510
                M F  + R  G+ + + + + YG L
Sbjct: 216 AFDDIMVFAFSARQAGVQMHVCNREHYGFL 245


>gi|307213490|gb|EFN88899.1| Glycosyltransferase 25 family member [Harpegnathos saltator]
          Length = 347

 Score = 62.8 bits (151), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 52/192 (27%), Positives = 95/192 (49%), Gaps = 22/192 (11%)

Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIH----- 343
           P+VLI++ +      L  FL+ +  L+YP +++ +++   NN +    + + +I+     
Sbjct: 9   PTVLITILVRNKAHTLPYFLSLMEQLDYPKERMCLWICSDNNVDNTIEILNKWINSEGKK 68

Query: 344 --------NFKTM-FKNVKYIAHNSTVNSKEARNL---AVENSLHKGVDFYFYVDSDSHL 391
                   N  +M F++ K I   S+       NL   A+  +     DF + +D+D  L
Sbjct: 69  YHCLNVHLNATSMGFEDEKTITDWSSRRFAHVINLREQALNYARQIWTDFIWMLDADVFL 128

Query: 392 DNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGI 451
            N   L+ LV + E+++APLL +    +SNFW  + A+ +YAR+  Y  I+  ++   G 
Sbjct: 129 TNSSTLRNLVLKGETVVAPLL-KSDGMYSNFWAGMTAEYYYARTDQYEPILYREE--IGC 185

Query: 452 WNVPYITNCYLM 463
            NVP I +  L+
Sbjct: 186 HNVPMIHSAVLI 197


>gi|444726648|gb|ELW67172.1| Procollagen galactosyltransferase 1 [Tupaia chinensis]
          Length = 983

 Score = 62.8 bits (151), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 31/107 (28%), Positives = 61/107 (57%), Gaps = 3/107 (2%)

Query: 367 RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGAL 426
           R  A++++     D+  +VD+D+ + NPD L  L+  N++++AP+L     A+SNFW  +
Sbjct: 548 RQAALKSARDMWADYILFVDADNFILNPDTLSLLIAENKTVVAPML-DSRAAYSNFWCGM 606

Query: 427 NADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNI 473
            + G+Y R+  Y+ I   D+  +G + VP + + +L+      + N+
Sbjct: 607 TSQGYYKRTPAYIPIRKRDR--QGCFAVPMVHSTFLIDLRKAASRNL 651


>gi|432090315|gb|ELK23745.1| Procollagen galactosyltransferase 1 [Myotis davidii]
          Length = 578

 Score = 62.8 bits (151), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 36/122 (29%), Positives = 67/122 (54%), Gaps = 5/122 (4%)

Query: 367 RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGAL 426
           R  A++++     D+  +VD+D+ + NPD L  L+  N++++AP+L     A+SNFW  +
Sbjct: 118 RQAALKSARDMWADYILFVDADNLILNPDTLTLLIAENKTVVAPML-DSRAAYSNFWCGM 176

Query: 427 NADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDM 486
            + G+Y R+  Y+ I   D+  +G + VP + + +L+   + KA +    +     DY  
Sbjct: 177 TSQGYYKRTPAYIPIRKRDR--QGCFAVPMVHSTFLI--DLRKAASRSLAFYPPHTDYTW 232

Query: 487 AF 488
           AF
Sbjct: 233 AF 234


>gi|426230314|ref|XP_004009220.1| PREDICTED: procollagen galactosyltransferase 1 [Ovis aries]
          Length = 618

 Score = 62.8 bits (151), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 39/149 (26%), Positives = 77/149 (51%), Gaps = 9/149 (6%)

Query: 367 RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGAL 426
           R  A++++     D+  +VD+D+ + NPD L  L+  N++++AP+L     A+SNFW  +
Sbjct: 143 RQAALKSARDMWADYILFVDADNLILNPDTLTLLIAENKTVVAPML-DSRAAYSNFWCGM 201

Query: 427 NADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIK-----TIYTLNS 481
            + G+Y R+  Y+ I   D+  +G + VP + + +L+      + N+        YT  S
Sbjct: 202 TSQGYYKRTPAYIPIRKRDR--RGCFAVPMVHSTFLIDLRKAASRNLAFYPPHPDYTW-S 258

Query: 482 MDYDMAFCTNLRNKGIHLKIDSTQEYGHL 510
            D  + F  + +   + + + + + YG L
Sbjct: 259 FDDIIVFAFSCKQAEVQMYVCNREVYGFL 287


>gi|410950910|ref|XP_003982145.1| PREDICTED: procollagen galactosyltransferase 1, partial [Felis
           catus]
          Length = 535

 Score = 62.8 bits (151), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 39/149 (26%), Positives = 77/149 (51%), Gaps = 9/149 (6%)

Query: 367 RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGAL 426
           R  A++++     D+  +VD+D+ + NPD L  L+  N++++AP+L     A+SNFW  +
Sbjct: 60  RQAALKSARDMWADYILFVDADNLILNPDTLSLLIAENKTVVAPML-DSRAAYSNFWCGM 118

Query: 427 NADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIK-----TIYTLNS 481
            + G+Y R+  Y+ I   D+  +G + VP + + +L+      + N+        YT  S
Sbjct: 119 TSQGYYKRTPAYIPIRKRDR--QGCFAVPMVHSTFLIDLRKAASRNLAFYPPHPDYTW-S 175

Query: 482 MDYDMAFCTNLRNKGIHLKIDSTQEYGHL 510
            D  + F  + +   + + + + + YG L
Sbjct: 176 FDDIIVFAFSCKQAEVQMYVCNKEVYGFL 204


>gi|312068784|ref|XP_003137376.1| hypothetical protein LOAG_01790 [Loa loa]
          Length = 102

 Score = 62.4 bits (150), Expect = 8e-07,   Method: Composition-based stats.
 Identities = 29/76 (38%), Positives = 46/76 (60%), Gaps = 1/76 (1%)

Query: 33  KFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS-SLGGGYKVNLLKNEL 91
           K LV+TVA+ ETDG +R  ++A  N  +++  G+ + W GG+     GGG K+ +L+  L
Sbjct: 27  KLLVVTVATEETDGLRRLKRTAHTNHFRLEVFGMGEEWRGGNTRVEQGGGQKIRILRKSL 86

Query: 92  DEMDITDDMIILVTDS 107
            +    DD+IIL  D+
Sbjct: 87  GKYKDRDDLIILFVDA 102


>gi|410931648|ref|XP_003979207.1| PREDICTED: procollagen galactosyltransferase 2-like, partial
           [Takifugu rubripes]
          Length = 536

 Score = 62.4 bits (150), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 41/150 (27%), Positives = 73/150 (48%), Gaps = 11/150 (7%)

Query: 367 RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGAL 426
           R  A++ +  +  D+  +VDSD+ L NP VL  L+  N +L+AP+L      +SNFW  +
Sbjct: 60  RQAALKAARERWADYILFVDSDNLLTNPRVLTLLMAENLTLVAPML-ESRSLYSNFWCGV 118

Query: 427 NADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD- 485
              G+Y R+ DY  I    +   G + VP + + +L+   + + ++    +     DY  
Sbjct: 119 TPQGYYKRTPDYQPIREWKR--LGCFPVPMVHSTFLL--DLRRESSRDLAFYPPHPDYSW 174

Query: 486 -----MAFCTNLRNKGIHLKIDSTQEYGHL 510
                M F  + R  G+ + + + + YG L
Sbjct: 175 AFDDIMVFAFSARQVGVQMHVCNREHYGLL 204


>gi|303291280|ref|XP_003064926.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226453597|gb|EEH50906.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 383

 Score = 62.0 bits (149), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 57/258 (22%), Positives = 106/258 (41%), Gaps = 39/258 (15%)

Query: 31  EDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGG----DMSSLGGGYKVNL 86
           ++K +  T +   T G    + SA  N   +  LG++  ++G      +  L G      
Sbjct: 78  QNKLVFFTYSDRVTTGLCLSMLSAASNGFLLHVLGINDTYVGDVHEPKLKKLYGMKSFLS 137

Query: 87  LKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTF-----DANIVFGAERLCWP- 140
            +  L+   + D+ +++  D+ DV+  G  ++ L              I+   ER CWP 
Sbjct: 138 DRRALERYGLGDETVLVFADASDVLYLGSRDEALHTLQQLLGPLERGIILISGERNCWPF 197

Query: 141 ---DTSLY-------DKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSI---KNEEDDQL 187
              D  L        +++P   S +R+LN+G + G  K ++  +         N  DDQL
Sbjct: 198 VHYDKELTAGGREKCEEFPHRNSSFRFLNAGAYAGAIKPMRAFLKTLHAGIPSNVSDDQL 257

Query: 188 YYALLFLDETLRTKH---KIVLDTLANLFQNLYGSLEDIKLNFDLDEFV----------- 233
            +  L+  +    +H   ++V+D  + +FQ   G L  ++     DE V           
Sbjct: 258 VFQELYSKQVREGRHELFELVIDHASKMFQT--GHLTSLEGAGTFDEPVPMNAYFNAGIG 315

Query: 234 HLTNTKYNTNPVIIHGNG 251
            + N++  T P ++H NG
Sbjct: 316 RVVNSESETRPFLVHFNG 333


>gi|349992099|dbj|GAA36581.1| collagen beta-1 O-galactosyltransferase [Clonorchis sinensis]
          Length = 673

 Score = 61.2 bits (147), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 60/252 (23%), Positives = 114/252 (45%), Gaps = 34/252 (13%)

Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFKTM 348
           P+V I V +      L  FL+ +   +Y  K+I +     N+ +    +   +I +    
Sbjct: 120 PTVFIGVLVRNKAHALPYFLHGLETQDYLTKRIQLLFLADNSIDDSVNVLSQWIDSVSER 179

Query: 349 FKNVK------YIAHNSTVNSKEARNLAVE-----NSLHKG-VDFYFYVDSDSHLDNPDV 396
           +  V       Y+AH+   +++   ++A+      N+  K   DFY  +D+D  L NP  
Sbjct: 180 YHQVNLEIGGDYLAHSKMWSTEHYEHVALLRQRLLNAARKSWADFYLTIDADVILMNPGT 239

Query: 397 LKYLVNRNES-------LIAPL-LVRPF------KAWSNFWGALNADGFYARSFDYMNII 442
           LK+LV   +S       L+ PL ++ P       + +SNFWGA+   G+YARS  Y +I 
Sbjct: 240 LKHLVESAQSPGKIVSELLDPLPVISPLMNCTSSEFYSNFWGAMTETGYYARSDTYFDIQ 299

Query: 443 NGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNS----MDYDMAFCTNLRNKGIH 498
              +   G++ VP + + +L+      + N++     +     +D  + F  + +   + 
Sbjct: 300 R--RLVLGLFEVPMVHSIFLVNLRHKLSENLRYFPPPSGYKGPLDDLIIFARSAQLSNVP 357

Query: 499 LKIDSTQEYGHL 510
             +D+ + YG+L
Sbjct: 358 FYLDNREFYGYL 369


>gi|440904329|gb|ELR54855.1| Procollagen galactosyltransferase 1, partial [Bos grunniens mutus]
          Length = 544

 Score = 61.2 bits (147), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 38/149 (25%), Positives = 77/149 (51%), Gaps = 9/149 (6%)

Query: 367 RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGAL 426
           R  A++++     D+  +VD+D+ + NPD L  L+  N++++AP+L     A+SNFW  +
Sbjct: 69  RQAALKSARDMWADYILFVDADNLILNPDTLTLLIAENKTVVAPML-DSRAAYSNFWCGM 127

Query: 427 NADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIK-----TIYTLNS 481
            + G+Y R+  Y+ I   ++  +G + VP + + +L+      + N+        YT  S
Sbjct: 128 TSQGYYKRTPAYIPIRKRER--RGCFAVPMVHSTFLIDLRKAASRNLAFYPPHPDYTW-S 184

Query: 482 MDYDMAFCTNLRNKGIHLKIDSTQEYGHL 510
            D  + F  + +   + + + + + YG L
Sbjct: 185 FDDIIVFAFSCKQAEVQMYVCNKEVYGFL 213


>gi|344283113|ref|XP_003413317.1| PREDICTED: procollagen galactosyltransferase 1-like [Loxodonta
           africana]
          Length = 540

 Score = 61.2 bits (147), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 39/149 (26%), Positives = 77/149 (51%), Gaps = 9/149 (6%)

Query: 367 RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGAL 426
           R  A++++     D+  +VD+D+ + NPD L  L+  N++++AP+L     A+SNFW  +
Sbjct: 99  RQAALKSARDMWADYILFVDADNLILNPDTLTLLMAENKTVVAPML-DSRAAYSNFWCGM 157

Query: 427 NADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIK-----TIYTLNS 481
            + G+Y R+  Y+ I   D+  +G + VP + + +L+      + N+        YT  S
Sbjct: 158 TSQGYYKRTPAYIPIRKRDR--QGCFPVPMVHSTFLIDLRKAASRNLAFYPPHPDYTW-S 214

Query: 482 MDYDMAFCTNLRNKGIHLKIDSTQEYGHL 510
            D  + F  + +   + + + + + YG L
Sbjct: 215 FDDIIVFAFSCKQAEVQMYVCNKEVYGFL 243


>gi|297276457|ref|XP_001114885.2| PREDICTED: procollagen galactosyltransferase 1-like [Macaca
           mulatta]
          Length = 474

 Score = 61.2 bits (147), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 37/132 (28%), Positives = 70/132 (53%), Gaps = 9/132 (6%)

Query: 384 YVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIIN 443
           +VD+D+ + NPD L  L+  N++++AP+L     A+SNFW  + + G+Y R+  Y+ I  
Sbjct: 26  FVDADNLILNPDTLSLLIAENKTVVAPML-DSRAAYSNFWCGMTSQGYYKRTPAYIPIRK 84

Query: 444 GDQGGKGIWNVPYITNCYLMKTSVIKATNIKTI-----YTLNSMDYDMAFCTNLRNKGIH 498
            D+  +G + VP + + +L+      + N+        YT  S D  + F  + +   + 
Sbjct: 85  RDR--RGCFAVPMVHSTFLIDLRKAASRNLAFYPPHPDYTW-SFDDIIVFAFSCKQAEVQ 141

Query: 499 LKIDSTQEYGHL 510
           + + + +EYG L
Sbjct: 142 MYVCNKEEYGFL 153


>gi|281343517|gb|EFB19101.1| hypothetical protein PANDA_000528 [Ailuropoda melanoleuca]
          Length = 535

 Score = 60.5 bits (145), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 38/149 (25%), Positives = 77/149 (51%), Gaps = 9/149 (6%)

Query: 367 RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGAL 426
           R  A++++     D+  +VD+D+ + NP+ L  L+  N++++AP+L     A+SNFW  +
Sbjct: 60  RQAALKSARDMWADYILFVDADNLILNPNTLSLLIAENKTVVAPML-DSRAAYSNFWCGM 118

Query: 427 NADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIK-----TIYTLNS 481
            + G+Y R+  Y+ I   D+  +G + VP + + +L+      + N+        YT  S
Sbjct: 119 TSQGYYKRTPAYIPIRKRDR--RGCFAVPMVHSTFLIDLRKSASRNLAFYPPHPDYTW-S 175

Query: 482 MDYDMAFCTNLRNKGIHLKIDSTQEYGHL 510
            D  + F  + +   + + + + + YG L
Sbjct: 176 FDDIIVFAFSCKQAEVQMYVCNKEMYGFL 204


>gi|301753877|ref|XP_002912839.1| PREDICTED: procollagen galactosyltransferase 1-like [Ailuropoda
           melanoleuca]
          Length = 542

 Score = 60.5 bits (145), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 38/149 (25%), Positives = 77/149 (51%), Gaps = 9/149 (6%)

Query: 367 RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGAL 426
           R  A++++     D+  +VD+D+ + NP+ L  L+  N++++AP+L     A+SNFW  +
Sbjct: 67  RQAALKSARDMWADYILFVDADNLILNPNTLSLLIAENKTVVAPML-DSRAAYSNFWCGM 125

Query: 427 NADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIK-----TIYTLNS 481
            + G+Y R+  Y+ I   D+  +G + VP + + +L+      + N+        YT  S
Sbjct: 126 TSQGYYKRTPAYIPIRKRDR--RGCFAVPMVHSTFLIDLRKSASRNLAFYPPHPDYTW-S 182

Query: 482 MDYDMAFCTNLRNKGIHLKIDSTQEYGHL 510
            D  + F  + +   + + + + + YG L
Sbjct: 183 FDDIIVFAFSCKQAEVQMYVCNKEMYGFL 211


>gi|149757348|ref|XP_001499949.1| PREDICTED: procollagen galactosyltransferase 1 [Equus caballus]
          Length = 548

 Score = 60.5 bits (145), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 30/107 (28%), Positives = 61/107 (57%), Gaps = 3/107 (2%)

Query: 367 RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGAL 426
           R  A++++     D+  +VD+D+ + NPD L  L+  N++++AP+L     A+SNFW  +
Sbjct: 73  RQAALKSARDMWADYILFVDADNLILNPDTLSLLIAENKTVVAPML-DSRAAYSNFWCGM 131

Query: 427 NADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNI 473
            + G+Y R+  Y+ I   ++  +G + VP + + +L+      + N+
Sbjct: 132 TSQGYYRRTPAYIPIRKRER--RGCFAVPMVHSTFLIDLRKAASRNL 176


>gi|326402622|ref|YP_004282703.1| hypothetical protein ACMV_04740 [Acidiphilium multivorum AIU301]
 gi|325049483|dbj|BAJ79821.1| hypothetical protein ACMV_04740 [Acidiphilium multivorum AIU301]
          Length = 667

 Score = 60.1 bits (144), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 62/270 (22%), Positives = 114/270 (42%), Gaps = 40/270 (14%)

Query: 284 SLKPDQF--------PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEY 333
           +LKP++         P VLI++   +   FL  +L+ I  L+YP   I +++   NN + 
Sbjct: 385 ALKPERLLRSGTTTAPRVLIAILAKQKEEFLPLYLDCIEALDYPKSSIVLYIRTNNNTDR 444

Query: 334 HAPLFDDYIHNFKTMFKNVKYIAHNSTVNSK------------------EARNLAVENSL 375
              +  ++I      +  V++    S V+ +                    RN ++  + 
Sbjct: 445 TEEILREWIARVGHSYAAVEF--DPSDVDERVEQFGAHEWNAIRFRVLGRIRNESLRKTR 502

Query: 376 HKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLL--VRPFKAWSNFWGALNADGFYA 433
             G D+YF  D D+ +     L+ LV     ++APLL    P   +SN    ++ +G++ 
Sbjct: 503 EHGCDWYFVADIDNFIRRC-TLRELVATGLPIVAPLLRDAEPSSYYSNLHAEIDDNGYFR 561

Query: 434 RSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTN-L 492
               Y  I++  +  +G+  VP +   Y ++  VI+  N    Y   S  Y+    ++  
Sbjct: 562 DCAQYELIMS--RRIQGLIEVPLVHCTYAVRADVIEHLN----YDDGSGRYEYVILSDSA 615

Query: 493 RNKGIHLKIDSTQEYGHLVDSENFDPQKTN 522
           R   I    D+ Q YG++  S+N D    N
Sbjct: 616 RKASIPQYFDNRQVYGYITFSKNPDQYDEN 645


>gi|301758784|ref|XP_002915272.1| PREDICTED: glycosyltransferase 25 family member 3-like [Ailuropoda
           melanoleuca]
          Length = 590

 Score = 60.1 bits (144), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 52/255 (20%), Positives = 110/255 (43%), Gaps = 30/255 (11%)

Query: 280 KHLDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNN--------- 330
           K+L++  P   P+V++++        L  +L  +  L+YP  +++++   +         
Sbjct: 17  KNLEASPP--LPAVVLTILARNAEHSLPHYLGALERLDYPRARLALWCATDHNTDNTTQM 74

Query: 331 -QEYHAPLFDDYIHNF------KTMFKNVKYIAHNSTVNSK---EARNLAVENSLHKGVD 380
            +E+ A + DDY   F         + + +   H +    +   E +  A+  +   G D
Sbjct: 75  LREWLAAVGDDYAAVFWRPEGEPRSYPDEEGPKHWTKERHQFLMELKQEALTFARDWGAD 134

Query: 381 FYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMN 440
           +  + D+D+ L N   L+ L+ +   ++AP+L      +SNFW  +   G+Y R+ DY  
Sbjct: 135 YILFADTDNILTNNQTLRLLIEQGLPVVAPMLDSQ-TYYSNFWCGITPQGYYRRTADYFP 193

Query: 441 IINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTI-----YTLNSMDYDMAFCTNLRNK 495
             N  +  +G + VP + + +L+      A  +        YT    D  + F    +  
Sbjct: 194 TKNRQR--RGCFRVPMVHSTFLVSLRAEGAAQLAFYPPHPNYTW-PFDDIIVFAYACQAA 250

Query: 496 GIHLKIDSTQEYGHL 510
           G+ + + +   YG++
Sbjct: 251 GVTVHVCNEHRYGYM 265


>gi|313229149|emb|CBY23734.1| unnamed protein product [Oikopleura dioica]
          Length = 576

 Score = 60.1 bits (144), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 48/196 (24%), Positives = 93/196 (47%), Gaps = 26/196 (13%)

Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFKTM 348
           PS+L+ VF+      L  F   +   NYP  +I ++    +N +    +   +   ++  
Sbjct: 29  PSILLPVFVRNKEHALPYFFGGLERQNYPKSRIRLWFVTDHNADNSLEVIKAWKEAWEME 88

Query: 349 FKNVKYIAHN------STVNSK------------EARNLAVENSLHKGVDFYFYVDSDSH 390
           + ++K    +      S  +++            + R  A+ ++ +  VD+ F +D+D+ 
Sbjct: 89  YMDIKIEIRDPRKGFWSDADTELSWSPNRYDHILKLRQQALNHARNMLVDYLFMIDADNI 148

Query: 391 LDNPDVLKYLVNRNESLIAPLLVR--PFKAWSNFWGALNAD-GFYARSFDYMNIINGDQG 447
           L  P +L+ LV R++ ++ P+L    PF   SN+W   NA+ G+Y R  DY +I   +Q 
Sbjct: 149 LVQPSLLRKLVLRDKPIVGPMLETGVPF---SNYWTNQNAETGYYERGDDYYDIRYYEQD 205

Query: 448 GKGIWNVPYITNCYLM 463
              +  VP + +CYL+
Sbjct: 206 FLNVHKVPMLHSCYLI 221


>gi|281346524|gb|EFB22108.1| hypothetical protein PANDA_019266 [Ailuropoda melanoleuca]
          Length = 635

 Score = 59.3 bits (142), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 61/276 (22%), Positives = 114/276 (41%), Gaps = 42/276 (15%)

Query: 287 PDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAK-KISMFVYNNQEYHAPLFDDYIHNF 345
           P Q P+VL+++        L  FL  +  L +    K      +N +    +  +++ N 
Sbjct: 48  PMQRPTVLVAILARNAAHALPHFLGCLERLXHAKSLKSKAATDHNVDNTTEILREWLKNV 107

Query: 346 KTMFKNVKY------------IAHNSTVNSKEA-----RNLAVENSLHKGVDFYFYVDSD 388
           ++ +  V++            I       S+ A     R  A+  +  K  D+  ++D D
Sbjct: 108 QSFYHYVEWRPMDEPESYPDEIGPKHWPGSRFAHVMKLRQAALRTAREKWSDYILFIDVD 167

Query: 389 SHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNA-----------DGFYARSFD 437
           + L NP  L  ++  N++++AP+L      +SNFW  +              GFY R+ D
Sbjct: 168 NFLTNPQTLNLMIAENKTIVAPML-ESRGLYSNFWCGITPQAKQSPPISFFQGFYKRTPD 226

Query: 438 YMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTN 491
           Y+ I    +   G + VP + + +L+   + K  + K ++     DY       + F  +
Sbjct: 227 YLQIREWKR--LGCFPVPMVHSTFLI--DLRKEASDKLMFYPPHQDYTWTFDDIIVFAFS 282

Query: 492 LRNKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYE 527
            R  GI + + + + YG+L       PQ+T  E  E
Sbjct: 283 SRQAGIQMYLCNREHYGYLPIP--LKPQQTLQEEIE 316


>gi|410979348|ref|XP_003996047.1| PREDICTED: glycosyltransferase 25 family member 3 [Felis catus]
          Length = 560

 Score = 59.3 bits (142), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 42/203 (20%), Positives = 88/203 (43%), Gaps = 22/203 (10%)

Query: 290 FPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNN----------QEYHAPLFD 339
            P+V++++        L  +L  +  L+YP  +++++   +          QE+ A + D
Sbjct: 86  LPAVVLTILARNAEHSLPHYLGALERLDYPRARLALWCATDHNTDNTTQMLQEWLAAVGD 145

Query: 340 DYI------HNFKTMFKNVKYIAHNSTVNSK---EARNLAVENSLHKGVDFYFYVDSDSH 390
           DY             + + +   H +    +   E +  A+  +   G D+  + D+D+ 
Sbjct: 146 DYAAVVWRPEGAPRSYPDEEGPKHWTKERHQFLMELKQEALTFARDWGADYILFADTDNI 205

Query: 391 LDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKG 450
           L N   L+ L+ +   ++AP+L      +SNFW  +   G+Y R+ DY    N  +  +G
Sbjct: 206 LTNNQTLRLLIEQRLPVVAPMLDSQ-TYYSNFWCGITPQGYYRRTADYFPTKNRQR--RG 262

Query: 451 IWNVPYITNCYLMKTSVIKATNI 473
            + VP + + +L+      A  +
Sbjct: 263 CFRVPMVHSTFLVSLRAEGAAQL 285


>gi|313215923|emb|CBY37331.1| unnamed protein product [Oikopleura dioica]
          Length = 579

 Score = 59.3 bits (142), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 47/196 (23%), Positives = 93/196 (47%), Gaps = 26/196 (13%)

Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFKTM 348
           PS+L+ VF+      L  F   +   NYP  +I ++    +N +    +   +   ++  
Sbjct: 29  PSILLPVFVRNKEHALPYFFGGLERQNYPKSRIRLWFVTDHNADNSLEVIKAWKEAWEME 88

Query: 349 FKNVKYIAHN------STVNSK------------EARNLAVENSLHKGVDFYFYVDSDSH 390
           + ++K    +      S  +++            + R  A+ ++ +  VD+ F +D+D+ 
Sbjct: 89  YMDIKIEIRDPRKGFWSDADTELSWSPNRYDHILKLRQQALNHARNMLVDYLFMIDADNI 148

Query: 391 LDNPDVLKYLVNRNESLIAPLLVR--PFKAWSNFWGALNAD-GFYARSFDYMNIINGDQG 447
           L  P +++ LV R++ ++ P+L    PF   SN+W   NA+ G+Y R  DY +I   +Q 
Sbjct: 149 LVQPSLIRKLVLRDKPIVGPMLETGVPF---SNYWTNQNAETGYYERGDDYYDIRYYEQD 205

Query: 448 GKGIWNVPYITNCYLM 463
              +  VP + +CYL+
Sbjct: 206 FLNVHKVPMLHSCYLI 221


>gi|281349465|gb|EFB25049.1| hypothetical protein PANDA_003209 [Ailuropoda melanoleuca]
          Length = 569

 Score = 59.3 bits (142), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 49/245 (20%), Positives = 104/245 (42%), Gaps = 28/245 (11%)

Query: 290 FPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNN----------QEYHAPLFD 339
            P+V++++        L  +L  +  L+YP  +++++   +          +E+ A + D
Sbjct: 4   LPAVVLTILARNAEHSLPHYLGALERLDYPRARLALWCATDHNTDNTTQMLREWLAAVGD 63

Query: 340 DYIHNF------KTMFKNVKYIAHNSTVNSK---EARNLAVENSLHKGVDFYFYVDSDSH 390
           DY   F         + + +   H +    +   E +  A+  +   G D+  + D+D+ 
Sbjct: 64  DYAAVFWRPEGEPRSYPDEEGPKHWTKERHQFLMELKQEALTFARDWGADYILFADTDNI 123

Query: 391 LDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKG 450
           L N   L+ L+ +   ++AP+L      +SNFW  +   G+Y R+ DY    N  +  +G
Sbjct: 124 LTNNQTLRLLIEQGLPVVAPMLDSQ-TYYSNFWCGITPQGYYRRTADYFPTKNRQR--RG 180

Query: 451 IWNVPYITNCYLMKTSVIKATNIKTI-----YTLNSMDYDMAFCTNLRNKGIHLKIDSTQ 505
            + VP + + +L+      A  +        YT    D  + F    +  G+ + + +  
Sbjct: 181 CFRVPMVHSTFLVSLRAEGAAQLAFYPPHPNYTW-PFDDIIVFAYACQAAGVTVHVCNEH 239

Query: 506 EYGHL 510
            YG++
Sbjct: 240 RYGYM 244


>gi|307166664|gb|EFN60661.1| Glycosyltransferase 25 family member [Camponotus floridanus]
          Length = 357

 Score = 58.9 bits (141), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 47/194 (24%), Positives = 92/194 (47%), Gaps = 26/194 (13%)

Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNN------------------QE 332
           P+VLI++ +      L  FL+ +   +YP K+I +++ ++                  ++
Sbjct: 28  PTVLIAILVRNKAHTLPYFLSLLERQDYPKKRICLWIRSDHNVDRSIEILNKWIGLEGKK 87

Query: 333 YHAPLFDDYIHNFKTMFKNVKYIAHNST---VNSKEARNLAVENSLHKGVDFYFYVDSDS 389
           YH    +  ++   T F++ +  A  S     +  + R  A+  +     DF F +D+D 
Sbjct: 88  YHC--LNIQLNATSTRFEDERTFADWSPRRFAHVIDLREQALNYAREIWADFIFMLDADV 145

Query: 390 HLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGK 449
            L N   ++ LV + ++++APLL R    +SNFW  + A+ +Y R+  Y  I+  ++   
Sbjct: 146 FLTNSSTMRDLVLKGQTVVAPLL-RSDGMYSNFWAGITAEYYYVRTDLYEPILFREK--T 202

Query: 450 GIWNVPYITNCYLM 463
           G  NVP + +  L+
Sbjct: 203 GCHNVPMVHSAVLI 216


>gi|73968112|ref|XP_851283.1| PREDICTED: glycosyltransferase 25 family member 3 [Canis lupus
           familiaris]
          Length = 595

 Score = 58.9 bits (141), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 48/244 (19%), Positives = 105/244 (43%), Gaps = 26/244 (10%)

Query: 290 FPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNN----------QEYHAPLFD 339
            P+V++++        L  +L  +  L+YP  +++++   +          QE+ A + D
Sbjct: 30  LPAVVLTILARNAEHSLPHYLGALERLDYPRARLALWCATDHNTDNTTEMLQEWLAAVGD 89

Query: 340 DYI------HNFKTMFKNVKYIAHNSTVNSK---EARNLAVENSLHKGVDFYFYVDSDSH 390
           DY             + + +   H +    +   E +  A+  +   G D+  + D+D+ 
Sbjct: 90  DYATVVWRPEGEPRSYPDEEGPKHWTKERHQFLMELKQEALTFARDWGADYILFADTDNI 149

Query: 391 LDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKG 450
           L N   L+ L+++   ++AP+L      +SNFW  +   G+Y R+ DY    N  +  +G
Sbjct: 150 LTNNQTLRLLIDQGLPVVAPMLDSQ-TYYSNFWCGITPQGYYRRTADYFPTKNRQR--RG 206

Query: 451 IWNVPYITNCYLMKTSVIKATNIK--TIYTLNSMDYD--MAFCTNLRNKGIHLKIDSTQE 506
            + VP + + +L+      A  +     +   S  +D  + F    +  G+ + + +   
Sbjct: 207 CFQVPMVHSTFLVSLRTEGAAQLAFYPPHPNYSWPFDDIIVFAYACQAVGVTIHVCNEHR 266

Query: 507 YGHL 510
           YG++
Sbjct: 267 YGYM 270


>gi|338979866|ref|ZP_08631205.1| Glycosyl transferase family protein [Acidiphilium sp. PM]
 gi|338209221|gb|EGO97001.1| Glycosyl transferase family protein [Acidiphilium sp. PM]
          Length = 658

 Score = 58.9 bits (141), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 61/270 (22%), Positives = 114/270 (42%), Gaps = 40/270 (14%)

Query: 284 SLKPDQF--------PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEY 333
           +LKP++         P VLI++   +   FL  +L+ I  L+YP   I +++   NN + 
Sbjct: 376 ALKPERLLRSGTTTAPRVLIAILAKQKEEFLPLYLDCIEALDYPKSSIVLYIRTNNNTDR 435

Query: 334 HAPLFDDYIHNFKTMFKNVKYIAHNSTVNSK------------------EARNLAVENSL 375
              +  ++I      +  V++    S V+ +                    RN ++  + 
Sbjct: 436 TEEILREWIARVGHSYAAVEF--DPSDVDERVEQFGAHEWNAIRFRVLGRIRNESLRKTR 493

Query: 376 HKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLL--VRPFKAWSNFWGALNADGFYA 433
             G D+YF  D D+ +     L+ LV     ++APLL    P   +SN    ++ +G++ 
Sbjct: 494 EHGCDWYFVADIDNFIRRC-TLRELVATGLPIVAPLLRDAEPSSYYSNLHAEIDDNGYFR 552

Query: 434 RSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD-MAFCTNL 492
               Y  I++  +  +G+  VP +   Y ++  VI+  N    Y   S  ++ +    + 
Sbjct: 553 DCAQYELIMS--RRIQGLIEVPLVHCTYAVRADVIEHLN----YDDGSGRHEYVVLSDSA 606

Query: 493 RNKGIHLKIDSTQEYGHLVDSENFDPQKTN 522
           R   I    D+ Q YG++  S+N D    N
Sbjct: 607 RKASIPQYFDNRQVYGYITFSKNPDQDDEN 636


>gi|255072887|ref|XP_002500118.1| predicted protein [Micromonas sp. RCC299]
 gi|226515380|gb|ACO61376.1| predicted protein [Micromonas sp. RCC299]
          Length = 505

 Score = 58.5 bits (140), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 37/129 (28%), Positives = 59/129 (45%), Gaps = 12/129 (9%)

Query: 33  KFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNELD 92
           K LV+   + + D  K F+ S   + L     G+   W   +   +G   K +LL+   D
Sbjct: 201 KDLVVATHTTDKDASKLFMASIHKHGLAASVSGVGTWWHSHEDKEIG--LKASLLRLPAD 258

Query: 93  EMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYPAVG 152
           E     D ++++ DS D +      ++L RF   DA+IV G E  CWP  + +      G
Sbjct: 259 E-----DPLVILADSDDSMFTCDAEEMLSRFEELDADIVVGTETRCWPPEASH-----CG 308

Query: 153 SGYRYLNSG 161
            GY++L  G
Sbjct: 309 DGYKHLEEG 317


>gi|114626942|ref|XP_001157210.1| PREDICTED: glycosyltransferase 25 family member 3 isoform 1 [Pan
           troglodytes]
 gi|410224368|gb|JAA09403.1| cerebral endothelial cell adhesion molecule [Pan troglodytes]
 gi|410257424|gb|JAA16679.1| cerebral endothelial cell adhesion molecule [Pan troglodytes]
 gi|410333099|gb|JAA35496.1| cerebral endothelial cell adhesion molecule [Pan troglodytes]
          Length = 595

 Score = 58.5 bits (140), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 49/245 (20%), Positives = 103/245 (42%), Gaps = 28/245 (11%)

Query: 290 FPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNN----------QEYHAPLFD 339
            P+V++++        L  +L  +  L+YP  +++++   +          QE+ A + D
Sbjct: 30  LPAVVLAILARNAEHSLPHYLGALERLDYPRARMALWCATDHNVDNTTEMLQEWLAAVGD 89

Query: 340 DYIH---------NFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSH 390
           DY            F    ++ K+          E +  A+  + + G D+  + D+D+ 
Sbjct: 90  DYAAVVWRPEGEPRFYPDEESPKHWTKERHQFLMELKQEALTFARNWGADYILFADTDNI 149

Query: 391 LDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKG 450
           L N   L+ L+ +   ++AP+L      +SNFW  +   G+Y R+ +Y    N  +  +G
Sbjct: 150 LTNNQTLRLLMGQGLPVVAPMLDSQ-TYYSNFWCGITPQGYYRRTAEYFPTKNRQR--RG 206

Query: 451 IWNVPYITNCYLMKTSVIKATNIKTI-----YTLNSMDYDMAFCTNLRNKGIHLKIDSTQ 505
            + VP + + +L+      A  +        YT    D  + F    +  G+ + + +  
Sbjct: 207 CFRVPMVHSTFLVSLRAEGADQLAFYPPHPNYTW-PFDDIIVFAYACQAAGVSVHVCNEH 265

Query: 506 EYGHL 510
            YG++
Sbjct: 266 RYGYM 270


>gi|355678476|gb|AER96128.1| cerebral endothelial cell adhesion molecule [Mustela putorius furo]
          Length = 597

 Score = 58.5 bits (140), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 49/245 (20%), Positives = 103/245 (42%), Gaps = 28/245 (11%)

Query: 290 FPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNN----------QEYHAPLFD 339
            P+V++++        L  +L  +  L+YP  +++++   +          QE+ A + D
Sbjct: 30  LPAVVLTILARNAEHSLPHYLGALERLDYPRARLALWCATDHNTDNSTQMLQEWLAAVGD 89

Query: 340 DYI------HNFKTMFKNVKYIAHNSTVNSK---EARNLAVENSLHKGVDFYFYVDSDSH 390
           DY             + + +   H +    +   E +  A+  +   G D+  + D+D+ 
Sbjct: 90  DYAAVVWRPEGDPRSYPDEEGPKHWTKERHQFLMELKQEALTFARDWGADYILFADTDNI 149

Query: 391 LDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKG 450
           L N   L+ L+ +   ++AP+L      +SNFW  +   G+Y R+ DY    N  +  +G
Sbjct: 150 LTNNQTLRLLIEQGLPVVAPMLDSQ-TYYSNFWCGITPQGYYRRTADYFPTKNRQR--RG 206

Query: 451 IWNVPYITNCYLMKTSVIKATNIKTI-----YTLNSMDYDMAFCTNLRNKGIHLKIDSTQ 505
            + VP + + +L+      A  +        YT    D  + F    +  G+ + + +  
Sbjct: 207 CFRVPMVHSTFLVSLRAEGAAQLAFYPPHPNYTW-PFDDIIVFAYACQAAGVTVHVCNEH 265

Query: 506 EYGHL 510
            YG++
Sbjct: 266 RYGYM 270


>gi|307102945|gb|EFN51210.1| expressed protein [Chlorella variabilis]
          Length = 666

 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 47/194 (24%), Positives = 85/194 (43%), Gaps = 22/194 (11%)

Query: 72  GGDMSSLGGGYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDAN-- 129
            G+ S +  G ++  L++    +   D  I+L+ D+ D +I      +L+ +N       
Sbjct: 447 AGEFSQVAWGMRLKALRDFAARLTRRD--IVLMADARDALIGASPEALLDTYNDTVGGQR 504

Query: 130 -IVFGAERLCWP----DTSLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEED 184
            ++FGAE  CW        + + YP  G+ YR+LN+G  +G A  I+ L+ + SI    D
Sbjct: 505 LVLFGAEPHCWQHDLCPPEVVEGYPETGTPYRFLNAGTVMGPADVIRRLL-DASIDWAAD 563

Query: 185 ----DQLYYALLFLDETLRT---KHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVHLTN 237
                  ++   FL   L+    +  + +D+   +F   +    D+             N
Sbjct: 564 TAHRQPGFHDQGFLHGLLKAGPQRRLMAVDSRCRVFCAFFSRQHDLACTRR-----GWLN 618

Query: 238 TKYNTNPVIIHGNG 251
           T   T P+I+HG+G
Sbjct: 619 TYTGTYPLILHGSG 632


>gi|403299720|ref|XP_003940624.1| PREDICTED: glycosyltransferase 25 family member 3 [Saimiri
           boliviensis boliviensis]
          Length = 595

 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 49/245 (20%), Positives = 104/245 (42%), Gaps = 28/245 (11%)

Query: 290 FPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNN----------QEYHAPLFD 339
            P+V++++        L  +L  +  L+YP  +++++   +          QE+ A + D
Sbjct: 30  LPAVVLAILARNAEHSLPHYLGALERLDYPRARLALWCATDHNVDNTTEMLQEWLAAVGD 89

Query: 340 DYI------HNFKTMFKNVKYIAHNSTVNSK---EARNLAVENSLHKGVDFYFYVDSDSH 390
           DY             + + +   H +    +   E +  A+  +   G D+  + D+D+ 
Sbjct: 90  DYATVVWRPEGEPRSYPDEEGPKHWTKERHQFLMELKQEALTFAREWGADYILFADTDNI 149

Query: 391 LDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKG 450
           L N   L+ LV +   ++AP+L      +SNFW  +   G+Y R+ +Y    N  +  +G
Sbjct: 150 LTNNQTLRLLVGQGLPVVAPMLDSQ-TYYSNFWCGITPQGYYRRTAEYFPTKNRQR--RG 206

Query: 451 IWNVPYITNCYLMKTSVIKATNIKTI-----YTLNSMDYDMAFCTNLRNKGIHLKIDSTQ 505
            ++VP + + +L+      A  +        YT    D  + F    +  G+ + + +  
Sbjct: 207 CFHVPMVHSTFLVSLRAEGADQLAFYPPHRNYTW-PFDDIIVFAYACQAAGVSVHVCNEH 265

Query: 506 EYGHL 510
            YG++
Sbjct: 266 RYGYM 270


>gi|363740426|ref|XP_003642326.1| PREDICTED: glycosyltransferase 25 family member 3 [Gallus gallus]
          Length = 596

 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 56/243 (23%), Positives = 111/243 (45%), Gaps = 33/243 (13%)

Query: 296 SVFIDKPTAF---LEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFKTMFK 350
           SV I +P      L   L  + +L++PA  I+++    +N +    +  +++    + + 
Sbjct: 40  SVPIYRPIPIPHSLPHCLGALESLDFPAGNIALWCATDHNSDNTTAMLQEWLQAVGSNYH 99

Query: 351 NVKYIAHNST------VNSKEARNLAVEN---------SLHKGV--DFYFYVDSDSHLDN 393
           +V + A          +  K   +   EN         S  +G+  D+  +VD+DS L N
Sbjct: 100 SVAWKAEEGPSSYPDELGPKHWSDKRYENLMRLKQEALSYARGLRADYILFVDTDSILTN 159

Query: 394 PDVLKYLVNRNESLIAPLL-VRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIW 452
              L +L+ +N+S++AP+L  + F  +SNFW  +   GFY R+ DY    N  +  +G +
Sbjct: 160 NQTLTFLMAQNKSVVAPMLDSQTF--YSNFWCGITPQGFYRRTADYFPTKNRQR--RGCF 215

Query: 453 NVPYITNCYLM-----KTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEY 507
            VP +   +L+     +T+ +        YT  + D  + F  + +  G  + + + Q +
Sbjct: 216 AVPMVYATFLIDLRKEETAQLAFYPPHPNYTW-AFDDIIVFAYSCQEAGAEVHVCNQQRF 274

Query: 508 GHL 510
           G++
Sbjct: 275 GYI 277


>gi|440894671|gb|ELR47071.1| Glycosyltransferase 25 family member 3, partial [Bos grunniens
           mutus]
          Length = 579

 Score = 57.8 bits (138), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 49/245 (20%), Positives = 103/245 (42%), Gaps = 28/245 (11%)

Query: 290 FPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNN----------QEYHAPLFD 339
            P+V++++        L  +L  +  L+YP  +++++   +          +E+ A + D
Sbjct: 14  LPTVVLAILARNAEHSLPHYLGALERLDYPRARLALWCATDHNVDNTTAMLREWLAAVGD 73

Query: 340 DYI------HNFKTMFKNVKYIAHNSTVNSK---EARNLAVENSLHKGVDFYFYVDSDSH 390
           DY             + + +   H +    +   E +  A+  +   G D+  + D+D+ 
Sbjct: 74  DYAAVVWRPEGEPRSYPDEEGPKHWTKERHQFLMELKQEALTFARDWGADYILFADTDNI 133

Query: 391 LDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKG 450
           L N   L+ L+     ++AP+L      +SNFW  +   G+Y R+ DY    N  +  +G
Sbjct: 134 LTNNQTLRLLIEPGLPVVAPMLDSQ-TYYSNFWCGITPQGYYRRTADYFPTKNRQR--RG 190

Query: 451 IWNVPYITNCYLMKTSVIKATNIKTIYTLN-----SMDYDMAFCTNLRNKGIHLKIDSTQ 505
            + VP + + +L+     + T     Y  +       D  + F    +  G+ + + + Q
Sbjct: 191 CFRVPMVHSTFLVSLRA-EGTAQLAFYPPHPNYTWPFDDIIVFAYACQAAGVAVHVCNEQ 249

Query: 506 EYGHL 510
            YG+L
Sbjct: 250 RYGYL 254


>gi|334311907|ref|XP_001367449.2| PREDICTED: glycosyltransferase 25 family member 3 [Monodelphis
           domestica]
          Length = 705

 Score = 57.8 bits (138), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 43/193 (22%), Positives = 86/193 (44%), Gaps = 22/193 (11%)

Query: 290 FPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNN----------QEYHAPLFD 339
            P+V+I+V        L  +L  +  L+YP  +++++   +          QE+ A +  
Sbjct: 141 LPTVVIAVLARNAGYSLPHYLGALERLDYPRARLALWCATDHNVDNTTEILQEWLAAMGK 200

Query: 340 DYIHN-FKTMFKNVKYIAHNSTVNSKEARNL--------AVENSLHKGVDFYFYVDSDSH 390
           +Y    ++   +   Y    S     + R+         A++ +   G D+  + D+D+ 
Sbjct: 201 EYAEVVWRPEGEPRLYPDEESPKQWTKERHQFLMELKQEALDFARAWGADYILFADTDNI 260

Query: 391 LDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKG 450
           L N   LK+L+     ++AP+L      +SNFW  +   G+Y R+ DY    N  +  +G
Sbjct: 261 LTNNQTLKFLIGEGLPVVAPMLDS-QTYYSNFWCGITPQGYYRRTSDYFPTKNRQR--QG 317

Query: 451 IWNVPYITNCYLM 463
            + VP + + +L+
Sbjct: 318 CFRVPMVHSTFLL 330


>gi|46411176|ref|NP_997181.1| probable inactive glycosyltransferase 25 family member 3 precursor
           [Mus musculus]
 gi|160395523|sp|A3KGW5.1|GT253_MOUSE RecName: Full=Probable inactive glycosyltransferase 25 family
           member 3; AltName: Full=Cerebral endothelial cell
           adhesion molecule; Flags: Precursor
 gi|148676479|gb|EDL08426.1| cerebral endothelial cell adhesion molecule 1 [Mus musculus]
 gi|187953029|gb|AAI38848.1| Cerebral endothelial cell adhesion molecule [Mus musculus]
          Length = 592

 Score = 57.8 bits (138), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 51/266 (19%), Positives = 113/266 (42%), Gaps = 30/266 (11%)

Query: 284 SLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNN----------QEY 333
           S+     P+V++++        L  +L  +  L+YP  +++++   +          +E+
Sbjct: 21  SVTEPTLPTVVLAILARNAEHSLPHYLGALERLDYPRARLALWCATDHNMDNTTGMLREW 80

Query: 334 HAPLFDDYI------HNFKTMFKNVKYIAHNSTVNSK---EARNLAVENSLHKGVDFYFY 384
            A +  DY             + + +   H +    +   E R  A+  +   G D+  +
Sbjct: 81  LAAVGRDYATVVWKPEEEARSYPDEQGPKHWTKERHQFLMELRQEALAFARDWGADYILF 140

Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
            D+D+ L N   LK L++R   ++AP+L      +SNFW  +   G+Y R+ +Y    N 
Sbjct: 141 ADTDNILTNNQTLKLLIDRQLPVVAPMLDSQ-TYYSNFWCGITPQGYYRRTAEYFPTKNR 199

Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKATNIK--TIYTLNSMDYD--MAFCTNLRNKGIHLK 500
            +  +G + VP + + +L+     +   +     +   S  +D  + F    +  G+ + 
Sbjct: 200 QR--QGCFRVPMVHSTFLLSLQTEETARLAFYPPHPNYSWPFDDIIVFAYACQAAGVSMH 257

Query: 501 IDSTQEYGHL----VDSENFDPQKTN 522
           + +   YG++       ++ + +KTN
Sbjct: 258 VCNDHRYGYMNVVVKPHQSLEEEKTN 283


>gi|193788560|ref|NP_057258.3| probable inactive glycosyltransferase 25 family member 3 precursor
           [Homo sapiens]
 gi|74744901|sp|Q5T4B2.1|GT253_HUMAN RecName: Full=Probable inactive glycosyltransferase 25 family
           member 3; AltName: Full=Cerebral endothelial cell
           adhesion molecule; Flags: Precursor
 gi|119608193|gb|EAW87787.1| cerebral endothelial cell adhesion molecule 1, isoform CRA_a [Homo
           sapiens]
          Length = 595

 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 49/245 (20%), Positives = 101/245 (41%), Gaps = 28/245 (11%)

Query: 290 FPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNN----------QEYHAPLFD 339
            P+V++++        L  +L  +  L+YP  +++++   +          QE+ A + D
Sbjct: 30  LPAVVLAILARNAEHSLPHYLGALERLDYPRARMALWCATDHNVDNTTEMLQEWLAAVGD 89

Query: 340 DYIH---------NFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSH 390
           DY            F    +  K+          E +  A+  + + G D+  + D+D+ 
Sbjct: 90  DYAAVVWRPEGEPRFYPDEEGPKHWTKERHQFLMELKQEALTFARNWGADYILFADTDNI 149

Query: 391 LDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKG 450
           L N   L+ L+ +   ++AP+L      +SNFW  +   G+Y R+ +Y    N  +  +G
Sbjct: 150 LTNNQTLRLLMGQGLPVVAPMLDSQ-TYYSNFWCGITPQGYYRRTAEYFPTKNRQR--RG 206

Query: 451 IWNVPYITNCYLMKTSVIKATNIKTI-----YTLNSMDYDMAFCTNLRNKGIHLKIDSTQ 505
            + VP + + +L       A  +        YT    D  + F    +  G+ + + +  
Sbjct: 207 CFRVPMVHSTFLASLRAEGADQLAFYPPHPNYTW-PFDDIIVFAYACQAAGVSVHVCNEH 265

Query: 506 EYGHL 510
            YG++
Sbjct: 266 RYGYM 270


>gi|297270127|ref|XP_001111820.2| PREDICTED: glycosyltransferase 25 family member 3-like [Macaca
           mulatta]
          Length = 714

 Score = 57.4 bits (137), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 49/245 (20%), Positives = 101/245 (41%), Gaps = 28/245 (11%)

Query: 290 FPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNN----------QEYHAPLFD 339
            P+V++++        L  +L  +  L+YP  +++++   +          QE+ A + D
Sbjct: 149 LPAVVLAILARNAEHSLPHYLGALERLDYPRARMALWCATDHNMDNTTEMLQEWLAAVGD 208

Query: 340 DYIH---------NFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSH 390
           DY            F    +  K+          E +  A+  +   G D+  + D+D+ 
Sbjct: 209 DYAAVVWRPEGEPRFYPDEEGPKHWTKERHQFLMELKQEALTFARDWGADYILFADTDNI 268

Query: 391 LDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKG 450
           L N   L+ L+ +   ++AP+L      +SNFW  +   G+Y R+ +Y    N  +  +G
Sbjct: 269 LTNNQTLRLLMGQGLPVVAPMLDS-QTYYSNFWCGITPQGYYRRTAEYFPTKNRQR--RG 325

Query: 451 IWNVPYITNCYLMKTSVIKATNIK-----TIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQ 505
            + VP + + +L+      A  +        YT    D  + F    +  G+ + + +  
Sbjct: 326 CFRVPMVHSTFLVSLRAEGADQLAFYPPHPNYTW-PFDDIIVFAYACQAAGVAVHVCNEH 384

Query: 506 EYGHL 510
            YG++
Sbjct: 385 RYGYI 389


>gi|148259400|ref|YP_001233527.1| glycosyl transferase family protein [Acidiphilium cryptum JF-5]
 gi|146401081|gb|ABQ29608.1| glycosyl transferase, family 2 [Acidiphilium cryptum JF-5]
          Length = 667

 Score = 57.4 bits (137), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 61/270 (22%), Positives = 113/270 (41%), Gaps = 40/270 (14%)

Query: 284 SLKPDQF--------PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEY 333
           +LKP +         P VLI++   +   FL  +L+ I  L+YP   I +++   NN + 
Sbjct: 385 ALKPKRLLRSGTTTAPRVLIAILAKQKEEFLPLYLDCIEALDYPKSSIVLYIRTNNNTDR 444

Query: 334 HAPLFDDYIHNFKTMFKNVKYIAHNSTVNSK------------------EARNLAVENSL 375
              +  ++I      +  V++    S V+ +                    RN ++  + 
Sbjct: 445 TEEILREWIARVGHSYAAVEF--DPSDVDERVEQFGAHEWNAIRFRVLGRIRNESLRKTR 502

Query: 376 HKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLL--VRPFKAWSNFWGALNADGFYA 433
             G D+YF  D D+ +     L+ LV     ++APLL    P   +SN    ++ +G++ 
Sbjct: 503 EHGCDWYFVADIDNFIRRC-TLRELVATGLPIVAPLLRDAEPSSYYSNLHAEIDDNGYFR 561

Query: 434 RSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD-MAFCTNL 492
               Y  I++  +  +G+  VP +   Y ++  VI+  N    Y   S  ++ +    + 
Sbjct: 562 DCAQYELIMS--RRIQGLIEVPLVHCTYAVRADVIEHLN----YDDGSGRHEYVVLSDSA 615

Query: 493 RNKGIHLKIDSTQEYGHLVDSENFDPQKTN 522
           R   I    D+ Q YG++  S+N D    N
Sbjct: 616 RKASIPQYFDNRQVYGYITFSKNPDQYDEN 645


>gi|83318248|gb|AAI08699.1| CERCAM protein [Homo sapiens]
          Length = 558

 Score = 57.4 bits (137), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 49/245 (20%), Positives = 101/245 (41%), Gaps = 28/245 (11%)

Query: 290 FPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNN----------QEYHAPLFD 339
            P+V++++        L  +L  +  L+YP  +++++   +          QE+ A + D
Sbjct: 30  LPAVVLAILARNAEHSLPHYLGALERLDYPRARMALWCATDHNVDNTTEMLQEWLAAVGD 89

Query: 340 DYIH---------NFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSH 390
           DY            F    +  K+          E +  A+  + + G D+  + D+D+ 
Sbjct: 90  DYAAVVWRPEGEPRFYPDEEGPKHWTKERHQFLMELKQEALTFARNWGADYILFADTDNI 149

Query: 391 LDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKG 450
           L N   L+ L+ +   ++AP+L      +SNFW  +   G+Y R+ +Y    N  +  +G
Sbjct: 150 LTNNQTLRLLMGQGLPVVAPMLDSQ-TYYSNFWCGITPQGYYRRTAEYFPTKNRQR--RG 206

Query: 451 IWNVPYITNCYLMKTSVIKATNIKTI-----YTLNSMDYDMAFCTNLRNKGIHLKIDSTQ 505
            + VP + + +L       A  +        YT    D  + F    +  G+ + + +  
Sbjct: 207 CFRVPMVHSTFLASLRAEGADQLAFYPPHPNYTW-PFDDIIVFAYACQAAGVSVHVCNEH 265

Query: 506 EYGHL 510
            YG++
Sbjct: 266 RYGYM 270


>gi|395741044|ref|XP_002820323.2| PREDICTED: glycosyltransferase 25 family member 3-like [Pongo
           abelii]
          Length = 543

 Score = 57.4 bits (137), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 49/245 (20%), Positives = 101/245 (41%), Gaps = 28/245 (11%)

Query: 290 FPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNN----------QEYHAPLFD 339
            P+V++++        L  +L  +  L+YP  +++++   +          QE+ A + D
Sbjct: 30  LPAVVLAILARNAEHSLPHYLGALERLDYPRARMALWCATDHNVDNTTEMLQEWLAAVGD 89

Query: 340 DYIH---------NFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSH 390
           DY            F    +  K+          E +  A+  +   G D+  + D+D+ 
Sbjct: 90  DYAAVVWRPEGEPRFYPDEEGPKHWTKERHQFLMELKQEALTFARDWGADYILFADTDNI 149

Query: 391 LDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKG 450
           L N   L+ L+ +   ++AP+L      +SNFW  +   G+Y R+ +Y    N  +  +G
Sbjct: 150 LTNNQTLRLLMGQELPVVAPMLDSQ-TYYSNFWCGITPQGYYRRTAEYFPTKNRQR--RG 206

Query: 451 IWNVPYITNCYLMKTSVIKATNIKTI-----YTLNSMDYDMAFCTNLRNKGIHLKIDSTQ 505
            + VP + + +L+      A  +        YT    D  + F    +  G+ + + +  
Sbjct: 207 CFRVPMVHSTFLVSLRAEGADQLAFYPPHPNYTW-PFDDIIVFAYACQAAGVSVHVCNEH 265

Query: 506 EYGHL 510
            YG++
Sbjct: 266 RYGYM 270


>gi|402896403|ref|XP_003911291.1| PREDICTED: glycosyltransferase 25 family member 3, partial [Papio
           anubis]
          Length = 548

 Score = 57.4 bits (137), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 49/245 (20%), Positives = 101/245 (41%), Gaps = 28/245 (11%)

Query: 290 FPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNN----------QEYHAPLFD 339
            P+V++++        L  +L  +  L+YP  +++++   +          QE+ A + D
Sbjct: 30  LPAVVLAILARNAEHSLPHYLGALERLDYPRARMALWCATDHNVDNTTEMLQEWLAAVGD 89

Query: 340 DYIH---------NFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSH 390
           DY            F    +  K+          E +  A+  +   G D+  + D+D+ 
Sbjct: 90  DYAAVVWRPEGEPRFYPDEEGPKHWTKERHQFLMELKQEALTFARDWGADYILFADTDNI 149

Query: 391 LDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKG 450
           L N   L+ L+ +   ++AP+L      +SNFW  +   G+Y R+ +Y    N  +  +G
Sbjct: 150 LTNNQTLRLLMGQGLPVVAPMLDSQ-TYYSNFWCGITPQGYYRRTAEYFPTKNRQR--RG 206

Query: 451 IWNVPYITNCYLMKTSVIKATNIKTI-----YTLNSMDYDMAFCTNLRNKGIHLKIDSTQ 505
            + VP + + +L+      A  +        YT    D  + F    +  G+ + + +  
Sbjct: 207 CFRVPMVHSTFLVSLRAEGADQLAFYPPHPNYTW-PFDDIIVFAYACQAAGVAVHVCNEH 265

Query: 506 EYGHL 510
            YG++
Sbjct: 266 RYGYI 270


>gi|387542892|gb|AFJ72073.1| glycosyltransferase 25 family member 3 precursor [Macaca mulatta]
          Length = 595

 Score = 57.4 bits (137), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 49/245 (20%), Positives = 101/245 (41%), Gaps = 28/245 (11%)

Query: 290 FPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNN----------QEYHAPLFD 339
            P+V++++        L  +L  +  L+YP  +++++   +          QE+ A + D
Sbjct: 30  LPAVVLAILARNAEHSLPHYLGALERLDYPRARMALWCATDHNVDNTTEMLQEWLAAVGD 89

Query: 340 DYIH---------NFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSH 390
           DY            F    +  K+          E +  A+  +   G D+  + D+D+ 
Sbjct: 90  DYAAVVWRPEGEPRFYPDEEGPKHWTKERHQFLMELKQEALTFARDWGADYILFADTDNI 149

Query: 391 LDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKG 450
           L N   L+ L+ +   ++AP+L      +SNFW  +   G+Y R+ +Y    N  +  +G
Sbjct: 150 LTNNQTLRLLMGQGLPVVAPMLDSQ-TYYSNFWCGITPQGYYRRTAEYFPTKNRQR--RG 206

Query: 451 IWNVPYITNCYLMKTSVIKATNIKTI-----YTLNSMDYDMAFCTNLRNKGIHLKIDSTQ 505
            + VP + + +L+      A  +        YT    D  + F    +  G+ + + +  
Sbjct: 207 CFRVPMVHSTFLVSLRAEGADQLAFYPPHPNYTW-PFDDIIVFAYACQAAGVAVHVCNEH 265

Query: 506 EYGHL 510
            YG++
Sbjct: 266 RYGYI 270


>gi|426363201|ref|XP_004048734.1| PREDICTED: glycosyltransferase 25 family member 3 [Gorilla gorilla
           gorilla]
          Length = 595

 Score = 57.4 bits (137), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 48/245 (19%), Positives = 106/245 (43%), Gaps = 28/245 (11%)

Query: 290 FPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNN----------QEYHAPLFD 339
            P+V++++        L  +L  +  L+YP  +++++   +          QE+ A + D
Sbjct: 30  LPAVVLAIQARNAEHSLPHYLGALERLDYPRARMALWCATDHNVDNTTEMLQEWLAAVGD 89

Query: 340 DYI------HNFKTMFKNVKYIAHNSTVNSK---EARNLAVENSLHKGVDFYFYVDSDSH 390
           DY            ++ + +   H +    +   E +  A+  + + G D+  + D+D+ 
Sbjct: 90  DYAAVVWRPEGEPRVYPDEEGPKHWTKERHQFLMELKQEALTFARNWGADYILFADTDNI 149

Query: 391 LDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKG 450
           L N  +L+ L+ +   ++AP+L      +SNFW  +   G+Y R+ +Y    N  +  +G
Sbjct: 150 LTNNQILRLLMGQGLPVVAPMLDSQ-TYYSNFWCGITPQGYYRRTAEYFPTKNRQR--RG 206

Query: 451 IWNVPYITNCYLMKTSVIKATNIKTI-----YTLNSMDYDMAFCTNLRNKGIHLKIDSTQ 505
            + VP + + +L+      A  +        YT    D  + F    +  G+ + + +  
Sbjct: 207 CFRVPMVHSTFLVSLRAEGADQLAFYPPHPNYTW-PFDDIIVFAYACQAAGVSVHVCNEH 265

Query: 506 EYGHL 510
            YG++
Sbjct: 266 RYGYM 270


>gi|380796385|gb|AFE70068.1| glycosyltransferase 25 family member 3 precursor, partial [Macaca
           mulatta]
          Length = 576

 Score = 57.0 bits (136), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 49/245 (20%), Positives = 101/245 (41%), Gaps = 28/245 (11%)

Query: 290 FPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNN----------QEYHAPLFD 339
            P+V++++        L  +L  +  L+YP  +++++   +          QE+ A + D
Sbjct: 11  LPAVVLAILARNAEHSLPHYLGALERLDYPRARMALWCATDHNVDNTTEMLQEWLAAVGD 70

Query: 340 DYIH---------NFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSH 390
           DY            F    +  K+          E +  A+  +   G D+  + D+D+ 
Sbjct: 71  DYAAVVWRPEGEPRFYPDEEGPKHWTKERHQFLMELKQEALTFARDWGADYILFADTDNI 130

Query: 391 LDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKG 450
           L N   L+ L+ +   ++AP+L      +SNFW  +   G+Y R+ +Y    N  +  +G
Sbjct: 131 LTNNQTLRLLMGQGLPVVAPMLDSQ-TYYSNFWCGITPQGYYRRTAEYFPTKNRQR--RG 187

Query: 451 IWNVPYITNCYLMKTSVIKATNIKTI-----YTLNSMDYDMAFCTNLRNKGIHLKIDSTQ 505
            + VP + + +L+      A  +        YT    D  + F    +  G+ + + +  
Sbjct: 188 CFRVPMVHSTFLVSLRAEGADQLAFYPPHPNYTW-PFDDIIVFAYACQAAGVAVHVCNEH 246

Query: 506 EYGHL 510
            YG++
Sbjct: 247 RYGYI 251


>gi|328697541|ref|XP_001943906.2| PREDICTED: glycosyltransferase 25 family member-like [Acyrthosiphon
           pisum]
          Length = 374

 Score = 56.6 bits (135), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 43/201 (21%), Positives = 97/201 (48%), Gaps = 25/201 (12%)

Query: 282 LDSLKPDQFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFD 339
           +D  KP+   +V +++ I      L  F + + +L+YP  ++ +++   +N +    + +
Sbjct: 24  VDDRKPN---TVFVAILIRNKAHTLPYFFSALESLDYPKDRMHLWIRCDHNIDNSTQILN 80

Query: 340 DYIHNFKTMFKNVKY-IAHNSTVNSKEA----------------RNLAVENSLHKGVDFY 382
            ++     ++ +V   I ++ST    E+                R  A++ +     D+ 
Sbjct: 81  KWLKTSGAVYHSVNVKIDNDSTKYDDESGPAHWPHSRFQHIVQLRESALQTARDSWADYI 140

Query: 383 FYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNII 442
           +++D D+ + N   L++L+ +N  ++AP+L +    +SNFW  +  + +Y R+ DY  I+
Sbjct: 141 WFLDCDAFIINKSTLRHLIKKNYPVVAPML-KSDGLYSNFWCGMTDNYYYKRTSDYAPIV 199

Query: 443 NGDQGGKGIWNVPYITNCYLM 463
             +   KG + VP I +  L+
Sbjct: 200 --EWKTKGCYQVPMIHSSVLI 218


>gi|296190930|ref|XP_002743398.1| PREDICTED: glycosyltransferase 25 family member 3 isoform 1
           [Callithrix jacchus]
          Length = 595

 Score = 56.6 bits (135), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 49/245 (20%), Positives = 103/245 (42%), Gaps = 28/245 (11%)

Query: 290 FPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNN----------QEYHAPLFD 339
            P+V++++        L  +L  +  L+YP  +++++   +          QE+ A + D
Sbjct: 30  LPAVVLAILARNAEHSLPHYLGALERLDYPRARLALWYATDHNVDNTTEMLQEWLAAVGD 89

Query: 340 DYI------HNFKTMFKNVKYIAHNSTVNSK---EARNLAVENSLHKGVDFYFYVDSDSH 390
           DY             + + +   H +    +   E +  A+  +   G D+  + D+D+ 
Sbjct: 90  DYATVVWRPEGEPRSYPDEEGPKHWTKERHQFLMELKQEALTFARDWGADYILFADTDNI 149

Query: 391 LDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKG 450
           L N   L+ L+ +   ++AP+L      +SNFW  +   G+Y R+ +Y    N  +  +G
Sbjct: 150 LTNNQTLRLLMGQGLPVVAPMLDSQ-TYYSNFWCGITPQGYYRRTAEYFPTKNRQR--RG 206

Query: 451 IWNVPYITNCYLMKTSVIKATNIKTI-----YTLNSMDYDMAFCTNLRNKGIHLKIDSTQ 505
            + VP + + +L+      A  I        YT    D  + F    +  G+ + + +  
Sbjct: 207 CFRVPMVHSTFLVSLRAEGADQIAFYPPHPNYTW-PFDDIIVFAYACQAAGVSVHVCNEH 265

Query: 506 EYGHL 510
            YG++
Sbjct: 266 RYGYM 270


>gi|47188856|emb|CAG14621.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 52

 Score = 56.6 bits (135), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 23/48 (47%), Positives = 32/48 (66%)

Query: 591 NDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYH 638
            DKR+  GYE VPT DIHMKQ+G    W  F+R+++ P+  + F GY+
Sbjct: 2   QDKRIAGGYETVPTDDIHMKQIGFNKEWLHFIREFISPVTLKVFSGYY 49


>gi|156120717|ref|NP_001095505.1| probable inactive glycosyltransferase 25 family member 3 precursor
           [Bos taurus]
 gi|160395522|sp|A7MB73.1|GT253_BOVIN RecName: Full=Probable inactive glycosyltransferase 25 family
           member 3; AltName: Full=Cerebral endothelial cell
           adhesion molecule; Flags: Precursor
 gi|154425666|gb|AAI51374.1| CERCAM protein [Bos taurus]
          Length = 595

 Score = 56.2 bits (134), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 48/245 (19%), Positives = 103/245 (42%), Gaps = 28/245 (11%)

Query: 290 FPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNN----------QEYHAPLFD 339
            P+V++++        L  +L  +  L+YP  +++++   +          +E+ A + D
Sbjct: 30  LPTVVLAILARNAEHSLPHYLGALERLDYPRARLALWCATDHNVDNTTAMLREWLAAVGD 89

Query: 340 DYI------HNFKTMFKNVKYIAHNSTVNSK---EARNLAVENSLHKGVDFYFYVDSDSH 390
           +Y             + + +   H +    +   E +  A+  +   G D+  + D+D+ 
Sbjct: 90  NYAAVVWRPEGEPRSYPDEEGPKHWTKERHQFLMELKQEALTFARDWGADYILFADTDNI 149

Query: 391 LDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKG 450
           L N   L+ L+     ++AP+L      +SNFW  +   G+Y R+ DY    N  +  +G
Sbjct: 150 LTNNQTLRLLIEPGLPVVAPMLDSQ-TYYSNFWCGITPQGYYRRTADYFPTKNRQR--RG 206

Query: 451 IWNVPYITNCYLMKTSVIKATNIKTIYTLN-----SMDYDMAFCTNLRNKGIHLKIDSTQ 505
            + VP + + +L+     + T     Y  +       D  + F    +  G+ + + + Q
Sbjct: 207 CFRVPMVHSTFLVSLRA-EGTGQLAFYPPHPNYTWPFDDIIVFAYACQAAGVAVHVCNEQ 265

Query: 506 EYGHL 510
            YG+L
Sbjct: 266 RYGYL 270


>gi|395510081|ref|XP_003759312.1| PREDICTED: glycosyltransferase 25 family member 3, partial
           [Sarcophilus harrisii]
          Length = 564

 Score = 55.8 bits (133), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 43/192 (22%), Positives = 85/192 (44%), Gaps = 22/192 (11%)

Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNN----------QEYHAPLFDD 340
           P+V+I+V        L  +L  +  L+YP  +++++   +          QE+   +  D
Sbjct: 1   PAVVIAVLARNAGYSLPYYLGALERLDYPRARLALWCATDHNVDNTTEILQEWLTAVGKD 60

Query: 341 YIHN-FKTMFKNVKYIAHNSTVNSKEARNL--------AVENSLHKGVDFYFYVDSDSHL 391
           Y    ++   +   Y    S     + R+         A++ +   G D+  + D+D+ L
Sbjct: 61  YAEVVWRPEGEPRLYPDEESPKQWTKERHQFLMELKQEALDFARAWGADYILFADTDNIL 120

Query: 392 DNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGI 451
            N   LK+L+     ++AP+L      +SNFW  +   G+Y R+ DY    N  +  +G 
Sbjct: 121 TNNQTLKFLIGEGLPVVAPMLDSQ-TYYSNFWCGITPQGYYRRTTDYFPTKNRQR--QGC 177

Query: 452 WNVPYITNCYLM 463
           + VP + + +L+
Sbjct: 178 FQVPMVHSAFLL 189


>gi|326930289|ref|XP_003211280.1| PREDICTED: glycosyltransferase 25 family member 3-like [Meleagris
           gallopavo]
          Length = 541

 Score = 55.8 bits (133), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 38/138 (27%), Positives = 71/138 (51%), Gaps = 11/138 (7%)

Query: 379 VDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLL-VRPFKAWSNFWGALNADGFYARSFD 437
            D+  +VD+DS L N   L +L+ +N+S++AP+L  + F  +SNFW  +   GFY R+ D
Sbjct: 90  ADYILFVDTDSILTNNQTLTFLMAQNKSVVAPMLDSQTF--YSNFWCGITPQGFYRRTAD 147

Query: 438 YMNIINGDQGGKGIWNVPYITNCYLM-----KTSVIKATNIKTIYTLNSMDYDMAFCTNL 492
           Y    N  +  +G + VP +   +L+     +T+ +        YT  + D  + F  + 
Sbjct: 148 YFPTKNRQR--RGCFAVPMVYATFLIDLQKEETAQLAFYPPHPNYTW-AFDDIIVFAYSC 204

Query: 493 RNKGIHLKIDSTQEYGHL 510
           +  G  + + + Q +G++
Sbjct: 205 QEAGAEVHVCNQQRFGYI 222


>gi|296482048|tpg|DAA24163.1| TPA: glycosyltransferase 25 family member 3 precursor [Bos taurus]
          Length = 531

 Score = 55.8 bits (133), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 48/245 (19%), Positives = 103/245 (42%), Gaps = 28/245 (11%)

Query: 290 FPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNN----------QEYHAPLFD 339
            P+V++++        L  +L  +  L+YP  +++++   +          +E+ A + D
Sbjct: 30  LPTVVLAILARNAEHSLPHYLGALERLDYPRARLALWCATDHNVDNTTAMLREWLAAVGD 89

Query: 340 DYI------HNFKTMFKNVKYIAHNSTVNSK---EARNLAVENSLHKGVDFYFYVDSDSH 390
           +Y             + + +   H +    +   E +  A+  +   G D+  + D+D+ 
Sbjct: 90  NYAAVVWRPEGEPRSYPDEEGPKHWTKERHQFLMELKQEALTFARDWGADYILFADTDNI 149

Query: 391 LDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKG 450
           L N   L+ L+     ++AP+L      +SNFW  +   G+Y R+ DY    N  +  +G
Sbjct: 150 LTNNQTLRLLIEPGLPVVAPMLDSQ-TYYSNFWCGITPQGYYRRTADYFPTKNRQR--RG 206

Query: 451 IWNVPYITNCYLMKTSVIKATNIKTIYTLN-----SMDYDMAFCTNLRNKGIHLKIDSTQ 505
            + VP + + +L+     + T     Y  +       D  + F    +  G+ + + + Q
Sbjct: 207 CFRVPMVHSTFLVSLRA-EGTGQLAFYPPHPNYTWPFDDIIVFAYACQAAGVAVHVCNEQ 265

Query: 506 EYGHL 510
            YG+L
Sbjct: 266 RYGYL 270


>gi|149039147|gb|EDL93367.1| rCG45647, isoform CRA_a [Rattus norvegicus]
          Length = 596

 Score = 55.5 bits (132), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 50/261 (19%), Positives = 110/261 (42%), Gaps = 32/261 (12%)

Query: 290 FPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNN----------QEYHAPLFD 339
            P+V++++        L  +L  +  L+YP  +++++   +          QE+ A +  
Sbjct: 31  LPTVVLAILARNAEHSLPHYLGALERLDYPRARLALWGATDHNVDNTTGMLQEWLAAVGR 90

Query: 340 DYI------HNFKTMFKNVKYIAHNSTVNSK---EARNLAVENSLHKGVDFYFYVDSDSH 390
           DY        +    + + +   H +    +   E +  A+  +   G D+  + D+D+ 
Sbjct: 91  DYATVVWKSEDEARSYPDEQGPKHWTRERHQFLMELKQEALAFARDWGADYILFADTDNI 150

Query: 391 LDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKG 450
           L N   L+ L++R   ++AP+L      +SNFW  +   G+Y R+ +Y    N  +  +G
Sbjct: 151 LTNNQTLRLLIDRQLPVVAPMLDSQ-TYYSNFWCGITPQGYYRRTAEYFPTKNRQR--QG 207

Query: 451 IWNVPYITNCYLMKTSVIKATNIKTI-----YTLNSMDYDMAFCTNLRNKGIHLKIDSTQ 505
            + VP + + +L+     +   +        YT    D  + F    +  G+ + + +  
Sbjct: 208 CFRVPMVHSTFLVSLQTEETARLAFYPPHPNYTW-PFDDIIVFAYACQAAGVSVHVCNDH 266

Query: 506 EYGHL----VDSENFDPQKTN 522
            YG++       +  + +KTN
Sbjct: 267 RYGYMNVGVKPHQGLEEEKTN 287


>gi|162951747|gb|ABY21735.1| LD07116p [Drosophila melanogaster]
          Length = 639

 Score = 55.1 bits (131), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 56/235 (23%), Positives = 107/235 (45%), Gaps = 38/235 (16%)

Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFKTM 348
           P+VLI++ +      L  FL+ +   +YP ++I++++   ++ +    L   ++ N   +
Sbjct: 57  PTVLIALLVRNKAHILPMFLSYLEQQDYPKERIAIWLRCDHSNDDSIELLRQWLDNSGDL 116

Query: 349 FKNVKY---IAHNSTVN---------SKEARNLAV-ENSLHKG----VDFYFYVDSDSHL 391
           + +V Y       S VN         S+    +A+ E +   G     D+ F++D+D  L
Sbjct: 117 YHSVSYEFKPEEQSFVNGTSPYEWPASRFKHLIALKEEAFQYGRDIWADYVFFLDADVLL 176

Query: 392 DNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGI 451
            + D LK L      ++AP+L+     +SNFW  +  D +Y R+ +Y  I +  +  +G 
Sbjct: 177 TSKDSLKVLTRLQLPIVAPMLISE-SLYSNFWCGMTEDYYYRRTDEYKEIYHVKK--QGS 233

Query: 452 WNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQE 506
           + VP      ++ T+V+   N + +  L          T  RNK + L+    QE
Sbjct: 234 FPVP------MVHTAVLVNMNHRAVRNL----------TFDRNKLVELQKSRQQE 272


>gi|24581946|ref|NP_723087.1| CG31915 [Drosophila melanogaster]
 gi|74864910|sp|Q8IPK4.1|GLT25_DROME RecName: Full=Glycosyltransferase 25 family member; Flags:
           Precursor
 gi|22945672|gb|AAN10543.1| CG31915 [Drosophila melanogaster]
          Length = 612

 Score = 55.1 bits (131), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 56/235 (23%), Positives = 107/235 (45%), Gaps = 38/235 (16%)

Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFKTM 348
           P+VLI++ +      L  FL+ +   +YP ++I++++   ++ +    L   ++ N   +
Sbjct: 30  PTVLIALLVRNKAHILPMFLSYLEQQDYPKERIAIWLRCDHSNDDSIELLRQWLDNSGDL 89

Query: 349 FKNVKY---IAHNSTVN---------SKEARNLAV-ENSLHKG----VDFYFYVDSDSHL 391
           + +V Y       S VN         S+    +A+ E +   G     D+ F++D+D  L
Sbjct: 90  YHSVSYEFKPEEQSFVNGTSPYEWPASRFKHLIALKEEAFQYGRDIWADYVFFLDADVLL 149

Query: 392 DNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGI 451
            + D LK L      ++AP+L+     +SNFW  +  D +Y R+ +Y  I +  +  +G 
Sbjct: 150 TSKDSLKVLTRLQLPIVAPMLISE-SLYSNFWCGMTEDYYYRRTDEYKEIYHVKK--QGS 206

Query: 452 WNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQE 506
           + VP      ++ T+V+   N + +  L          T  RNK + L+    QE
Sbjct: 207 FPVP------MVHTAVLVNMNHRAVRNL----------TFDRNKLVELQKSRQQE 245


>gi|219127596|ref|XP_002184018.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217404741|gb|EEC44687.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 487

 Score = 54.3 bits (129), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 40/120 (33%), Positives = 58/120 (48%), Gaps = 15/120 (12%)

Query: 621 FLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGV 680
            L + + P   R F G     +RA   FVVRY   ++  L  H D    +INI LN    
Sbjct: 215 LLDRRLAPQLARIF-GIPVTSIRANDMFVVRYDAGKRAHLTNHTDDGDISINILLND--- 270

Query: 681 DYEGGGCRFIRYN-------CNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVD 733
           ++ GGG RF  +N        +V  TR+G +L H   +   HEG  ++QG R I++ F+ 
Sbjct: 271 EFRGGGTRF--WNRILKTPFAHVQPTRVGQLLTHSALIN--HEGYHISQGLRMILVGFLS 326


>gi|449277697|gb|EMC85780.1| Glycosyltransferase 25 family member 2, partial [Columba livia]
          Length = 176

 Score = 54.3 bits (129), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 45/175 (25%), Positives = 79/175 (45%), Gaps = 30/175 (17%)

Query: 299 IDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFD--DYIHNFKTMFKNVKYIA 356
           +D  TA L E+L  + NL           Y++ E+  P+ +   Y   F       K+  
Sbjct: 6   VDNTTAILREWLKNVQNL-----------YHDVEWR-PMEEPQSYPEEF-----GPKHWP 48

Query: 357 HNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPF 416
            +   +  + R  A+  +  K  D+  ++D+D+ L NP+ L  L+  N++L+AP+L    
Sbjct: 49  SSRFTHVMKLRQAALRAAREKWSDYILFIDTDNLLTNPETLNLLIAENKTLVAPML-ESR 107

Query: 417 KAWSNFWGALNA--------DGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLM 463
             +SNFW  +           G+Y R+ DY  I   +    G + VP I + +L+
Sbjct: 108 SLYSNFWCGITPQATLSFCLQGYYKRTLDYPLI--REWKRTGCFAVPMIHSTFLI 160


>gi|354499487|ref|XP_003511840.1| PREDICTED: glycosyltransferase 25 family member 3 [Cricetulus
           griseus]
 gi|344244074|gb|EGW00178.1| Glycosyltransferase 25 family member 3 [Cricetulus griseus]
          Length = 592

 Score = 53.9 bits (128), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 42/193 (21%), Positives = 85/193 (44%), Gaps = 22/193 (11%)

Query: 290 FPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNN----------QEYHAPLFD 339
            P+V++++        L  +L  +  L+YP  +++++   +          QE+ A +  
Sbjct: 27  LPTVVLAILARNAEHSLPHYLGALERLDYPRARLALWCATDHNVDNTTEMLQEWLAAVGR 86

Query: 340 DYIHN-FKTMFKNVKYIAHNSTVNSKEARNL--------AVENSLHKGVDFYFYVDSDSH 390
           DY    +K   +   Y    S  +  + R+         A+  +   G D+  + D+D+ 
Sbjct: 87  DYAAVVWKPEEEARPYPDEQSPKHWTKERHQFLMELKQEALTFARAWGADYILFSDTDNI 146

Query: 391 LDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKG 450
           L N   L+ L  R   ++AP+L      +SNFW  +   G+Y R+ +Y    N  +  +G
Sbjct: 147 LTNNQTLRLLTERQLPVVAPMLDSQ-TYYSNFWCGITPQGYYRRTAEYFPTKNRQR--QG 203

Query: 451 IWNVPYITNCYLM 463
            + VP + + +L+
Sbjct: 204 CFRVPMVHSTFLV 216


>gi|294931519|ref|XP_002779915.1| hypothetical protein Pmar_PMAR002313 [Perkinsus marinus ATCC 50983]
 gi|239889633|gb|EER11710.1| hypothetical protein Pmar_PMAR002313 [Perkinsus marinus ATCC 50983]
          Length = 339

 Score = 53.5 bits (127), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 48/193 (24%), Positives = 79/193 (40%), Gaps = 33/193 (17%)

Query: 86  LLKNELDEMDITDDMIILVTDSYDVII------DGGVNDILERFNTFDANIVFGAERLCW 139
           +L N L  M    D +++  D+ DV        +  V          +  I+  AER CW
Sbjct: 140 VLLNRLKSMPT--DALMVFNDALDVWFTPHASEEAFVKAFERELQIPEDTILVSAERNCW 197

Query: 140 PDTSLYD---KYPAVGSG--YRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFL 194
           P          YPA G G  Y+Y N+GG++G  K          I + +D+Q      + 
Sbjct: 198 PPPERMPYCRDYPASGHGTTYKYANTGGWMGRVK----TTWTACIMDGKDEQGCVQWFYR 253

Query: 195 DETLRTKH-------KIVLDTLANLFQNLYGS---------LEDIKLNFDLDEFVHLTNT 238
           D     ++       +I LD    ++Q L+G+         LE  +  F  ++   L N 
Sbjct: 254 DAKESRQYRENVGAFRIALDDTQMIWQTLWGTKFANAERAFLEVDRAGFGEEDAGKLVNP 313

Query: 239 KYNTNPVIIHGNG 251
           + +T P+++H NG
Sbjct: 314 ETSTTPLVVHFNG 326


>gi|357607512|gb|EHJ65551.1| hypothetical protein KGM_15156 [Danaus plexippus]
          Length = 516

 Score = 53.5 bits (127), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 47/223 (21%), Positives = 100/223 (44%), Gaps = 27/223 (12%)

Query: 315 NLNYPAKKISMFVYN--NQEYHAPLFDDYIHNFKTMFKNV-----------------KYI 355
           NL+YP  +I ++  +  N ++   +  D+++ F T++  V                  + 
Sbjct: 2   NLDYPKDRIFLWFRSDYNSDHSVDVLRDFVNKFGTLYNRVHLSYNTSKQKFDDELSPTHW 61

Query: 356 AHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRP 415
           +H+  ++  + R + ++ +  +  D+ F +D+D  L NP  L++L+ +   ++AP+LV  
Sbjct: 62  SHSRFMHLIKWREMGIKFAKRQWADYVFMLDADVFLTNPQTLRHLIQKQLRVVAPMLVSD 121

Query: 416 FKAWSNFWGALNADGFYARSF--DYMNIINGDQGGKGIWNVPYITNCYLM-----KTSVI 468
            + +SNFW +++ D  Y  +   ++  +   ++   G   VP I    LM     K+  I
Sbjct: 122 -RYYSNFWLSVDDDFNYRLNHEDEFYPLYEYNELYMGCHIVPVIYGAVLMDLRSKKSDYI 180

Query: 469 KATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGHLV 511
                K +  L  +   + F  N     I L I +   +G++ 
Sbjct: 181 TYDPYKIVDYLGPLQDHIIFAVNAMRNNISLHICNDDFFGYIT 223


>gi|308800012|ref|XP_003074787.1| SmkH (IC) [Ostreococcus tauri]
 gi|116061327|emb|CAL52045.1| SmkH (IC) [Ostreococcus tauri]
          Length = 637

 Score = 53.5 bits (127), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 46/172 (26%), Positives = 76/172 (44%), Gaps = 28/172 (16%)

Query: 572 CHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQV-GLAGVWAEFLRKYVVPLQ 630
           C  +V+  E+  +   G +  +     ++AVPT D+ + ++ G+   W       + P  
Sbjct: 480 CPSWVEAAESVARSRGGWDTAR-----HKAVPTTDLPIHEIPGVMEQWNRLFSVVISPFI 534

Query: 631 EREFIGYHHEPVRAPMSF---------VVRYRPDE-QPSLRPHHDSSTYTINIALNQVGV 680
              F        R P SF         VV+Y  +E Q  L  H D   +++ +AL+    
Sbjct: 535 RDRF--------RLPTSFGTLYVHDAFVVKYNANEGQRELPVHTDQGQFSLTLALHDTQ- 585

Query: 681 DYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFV 732
           DY GGG  F  + C +   R G  +     LTH   G+ +T G RYI+++F+
Sbjct: 586 DYSGGGTIFPEHEC-IVRPRCGDFVAFRSSLTH--GGVPITAGVRYIVVAFL 634


>gi|323456551|gb|EGB12418.1| hypothetical protein AURANDRAFT_61110 [Aureococcus anophagefferens]
          Length = 794

 Score = 53.5 bits (127), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 41/119 (34%), Positives = 57/119 (47%), Gaps = 9/119 (7%)

Query: 621 FLRKYVVPLQERE---FIG-YHHEPVRAPMSFVVRYRPDEQ-PSLRPHHDSSTYTINIAL 675
           ++R  V  L  R    F G +   P R    F VRY  +     +R H D S  ++++AL
Sbjct: 500 YVRALVASLAARATLLFPGTFAGAPARVLDCFFVRYDAERCFAEMRDHVDESAVSVSLAL 559

Query: 676 NQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVDP 734
           N  G DY+GGG        NV     G +   PG +TH   G+ VT+GTR I+  F+ P
Sbjct: 560 NDAG-DYDGGGLHVAAAG-NVLNGPAGSVFCFPGAITH--GGVAVTRGTRRILSLFLVP 614


>gi|431898872|gb|ELK07242.1| Glycosyltransferase 25 family member 3, partial [Pteropus alecto]
          Length = 600

 Score = 53.5 bits (127), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 46/243 (18%), Positives = 103/243 (42%), Gaps = 28/243 (11%)

Query: 292 SVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNN----------QEYHAPLFDDY 341
           +V++++        L  +L  +  L+YP  +++++   +          QE+ A + +DY
Sbjct: 6   AVVLAILARNAEHSLPHYLGALERLDYPRARLALWCATDHNVDNTTEMLQEWLAAVGNDY 65

Query: 342 I------HNFKTMFKNVKYIAHNSTVNSK---EARNLAVENSLHKGVDFYFYVDSDSHLD 392
                        + + +   H +    +   E +  A+  +   G D+  + D+D+ L 
Sbjct: 66  AAVVWRPEGEPRSYPDEESPKHWTKERYQFLMELKQEALTFARGWGADYILFADTDNILT 125

Query: 393 NPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIW 452
           N   L+ L+ +   ++AP+L      +SNFW  +   G+Y R+ +Y    N  +  +G +
Sbjct: 126 NNQTLRLLIEQGLPVVAPMLDSQ-TYYSNFWCGITPQGYYRRTAEYFPTKNRQR--RGCF 182

Query: 453 NVPYITNCYLMKTSVIKATNIKTI-----YTLNSMDYDMAFCTNLRNKGIHLKIDSTQEY 507
            VP + + +L+      A  +        YT    D  + F  + +  G+ + + +   Y
Sbjct: 183 RVPMVHSTFLVSLRAEGAAQLAFYPPHPNYTW-PFDDIIVFAYSCQAAGVSVHVCNEHRY 241

Query: 508 GHL 510
           G++
Sbjct: 242 GYM 244


>gi|83954578|ref|ZP_00963289.1| hypothetical protein NAS141_15193 [Sulfitobacter sp. NAS-14.1]
 gi|83840862|gb|EAP80033.1| hypothetical protein NAS141_15193 [Sulfitobacter sp. NAS-14.1]
          Length = 380

 Score = 53.1 bits (126), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 45/145 (31%), Positives = 61/145 (42%), Gaps = 22/145 (15%)

Query: 588 GTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREF---IGYHHEPVRA 644
           G   D R E GY A P         G  G + E +  Y+ P+    F   +GY  +    
Sbjct: 114 GAMLDPRSE-GYLAAP---------GFQGFYREMMDAYMRPVSRLLFPDVVGYDTQT--- 160

Query: 645 PMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFI---RYNCNVTATRM 701
              F +R++  +  SLRPH D+S  T+NI LN  G  Y G    FI              
Sbjct: 161 -FGFSIRWQASKDTSLRPHSDASAVTLNINLNLPGEGYSGSAVSFIDPVSRRVEKLTFEP 219

Query: 702 GWMLMHPGRLTHYHEGLQVTQGTRY 726
           G  L+H G + H  E   +T+G RY
Sbjct: 220 GTALIHHGSVPHASE--PITEGERY 242


>gi|380807617|gb|AFE75684.1| procollagen galactosyltransferase 1 precursor, partial [Macaca
           mulatta]
          Length = 151

 Score = 53.1 bits (126), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 23/69 (33%), Positives = 43/69 (62%), Gaps = 1/69 (1%)

Query: 367 RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGAL 426
           R  A++++     D+  +VD+D+ + NPD L  L+  N++++AP+L     A+SNFW  +
Sbjct: 82  RQAALKSARDMWADYILFVDADNLILNPDTLSLLIAENKTVVAPML-DSRAAYSNFWCGM 140

Query: 427 NADGFYARS 435
            + G+Y R+
Sbjct: 141 TSQGYYKRT 149


>gi|156347859|ref|XP_001621780.1| predicted protein [Nematostella vectensis]
 gi|156208037|gb|EDO29680.1| predicted protein [Nematostella vectensis]
          Length = 248

 Score = 53.1 bits (126), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 45/179 (25%), Positives = 81/179 (45%), Gaps = 21/179 (11%)

Query: 560 DVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWA 619
           +V+  P+ TE FC +F++ +E + + SD           Y       + +  +G    + 
Sbjct: 38  EVYRLPVFTESFCEQFIEELEHF-ESSDVPRGRPNTMNNY------GVLLSDLGFDEHFI 90

Query: 620 EFLRK-YVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQV 678
             LR+ Y+ P+    F  +  + + +  +F V Y P +   L  H+D++  T+++ L   
Sbjct: 91  NPLRREYLQPITALLFPQWGGDGLDSHKAFTVHYMPGKDTELSYHYDNAEVTLSVCL--- 147

Query: 679 GVDYEGGGCRF--IRY------NCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMI 729
           G ++ GG   F  +R        C     R  + L+H G+  H H  L  TQG+RY +I
Sbjct: 148 GREFSGGDLYFGDMRQVLLEDTQCTEVENRPTYGLLHRGQ--HMHGALPTTQGSRYNLI 204


>gi|328771198|gb|EGF81238.1| hypothetical protein BATDEDRAFT_87864 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 324

 Score = 53.1 bits (126), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 42/130 (32%), Positives = 65/130 (50%), Gaps = 17/130 (13%)

Query: 81  GYKVNLLKNELDEMDITDDMIILVTDSYDVIIDGG--VNDILERFNTF-----DANIVFG 133
           G ++ +L + L  +   +D +I+ +DS DVII  G  V++++ R+N+         + F 
Sbjct: 81  GLRIRILHDYL--LTQPEDRLIVWSDSDDVIITPGTTVSELISRYNSLVDLYNGPRVFFA 138

Query: 134 AERLCWPDTSLYDKY--PAVGSG------YRYLNSGGFIGYAKDIKELISNRSIKNEEDD 185
           AE  C+P   L+  Y  P    G      +RYLN+G  IG A  I+ LI      +  DD
Sbjct: 139 AEIACYPRGDLWSNYTDPEHIQGKKTYTPFRYLNAGIMIGPAGLIRRLIQVVYQHDCYDD 198

Query: 186 QLYYALLFLD 195
           QL + L  LD
Sbjct: 199 QLLFTLALLD 208


>gi|344338865|ref|ZP_08769796.1| hypothetical protein ThimaDRAFT_1534 [Thiocapsa marina 5811]
 gi|343801447|gb|EGV19390.1| hypothetical protein ThimaDRAFT_1534 [Thiocapsa marina 5811]
          Length = 276

 Score = 53.1 bits (126), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 36/134 (26%), Positives = 66/134 (49%), Gaps = 15/134 (11%)

Query: 94  MDITDDMIILVTDSYDVIIDGGVNDILERFNT-FDANIVFGAERLCWPDTSLYDKYP-AV 151
           +D  +   +L  DS D +I G    +++RF   F+ +IVFGA+RL WP    + ++  A+
Sbjct: 125 LDTIETPYVLYADSRDALILGNPEILVDRFEGHFETDIVFGADRLSWPPLPRFKRFERAM 184

Query: 152 GSG----YRYLNSGGFIGYAKDIKELISNR-----SIKNEEDDQLYYALLFLDETLRTKH 202
            +G    + YLN G +IG     ++L +       + +  + +Q     L+++       
Sbjct: 185 AAGQPGDFHYLNGGTWIGRTAFCRDLFAAALEIPPTPEAPDSEQGILRTLWMER----PS 240

Query: 203 KIVLDTLANLFQNL 216
           +I LD    +FQN+
Sbjct: 241 EIALDYRCRMFQNI 254


>gi|395824283|ref|XP_003785400.1| PREDICTED: glycosyltransferase 25 family member 3 isoform 2
           [Otolemur garnettii]
          Length = 547

 Score = 52.8 bits (125), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 34/138 (24%), Positives = 66/138 (47%), Gaps = 9/138 (6%)

Query: 378 GVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFD 437
           G D+  + D+D+ L N   L+ LV++   ++AP+L      +SNFW  +   G+Y R+ +
Sbjct: 91  GADYILFADTDNILTNNQTLRLLVDQGLPVVAPMLDSQ-TYYSNFWCGITPQGYYRRTAE 149

Query: 438 YMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTI-----YTLNSMDYDMAFCTNL 492
           Y    N  +  +G ++VP + + +L+      A  +        YT    D  + F    
Sbjct: 150 YFPTKNRQR--RGCFSVPMVHSTFLVSLRAEGAAQLAFYPPHPNYTW-PFDDIIVFAYAC 206

Query: 493 RNKGIHLKIDSTQEYGHL 510
           +  G+ + + + Q YG++
Sbjct: 207 QAAGVSVHVCNDQRYGYM 224


>gi|294868172|ref|XP_002765417.1| hypothetical protein Pmar_PMAR002413 [Perkinsus marinus ATCC 50983]
 gi|239865436|gb|EEQ98134.1| hypothetical protein Pmar_PMAR002413 [Perkinsus marinus ATCC 50983]
          Length = 624

 Score = 52.8 bits (125), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 47/193 (24%), Positives = 79/193 (40%), Gaps = 33/193 (17%)

Query: 86  LLKNELDEMDITDDMIILVTDSYDVII------DGGVNDILERFNTFDANIVFGAERLCW 139
           +L N L  M    D +++  D+ DV        +  V          +  I+  AER CW
Sbjct: 425 VLLNRLKSMP--SDALMIFNDALDVWFTPHASEEAFVKAFERELQIPEDTILVSAERNCW 482

Query: 140 PDTSLYD---KYPAV--GSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFL 194
           P          YPA   G+ Y+Y N+GG++G  K          I + +D+Q      + 
Sbjct: 483 PPPERMPYCRDYPASEHGTTYKYANTGGWMGRVK----TTWTACIMDGKDEQGCVQWFYR 538

Query: 195 DETLRTKH-------KIVLDTLANLFQNLYGS---------LEDIKLNFDLDEFVHLTNT 238
           D     ++       +I LD    ++Q L+G+         LE  +  F  ++   L N 
Sbjct: 539 DAKESRQYRENVGAFRIALDDTQMIWQTLWGTKFANVERAFLEVDRAGFGEEDAGKLVNP 598

Query: 239 KYNTNPVIIHGNG 251
           + +T P+++H NG
Sbjct: 599 ETSTTPLVVHFNG 611


>gi|224006610|ref|XP_002292265.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220971907|gb|EED90240.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 288

 Score = 52.8 bits (125), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 39/115 (33%), Positives = 58/115 (50%), Gaps = 8/115 (6%)

Query: 622 LRKYVVPLQEREFIGY---HHEPVRAPMSFVVRYRPDE-QPSLRPHHDSSTYTINIALNQ 677
           L + + PL  ++F  Y     + +R    FVV+Y  +  Q  L+PH D S  + NIALN 
Sbjct: 159 LVERIYPLLRQQFGMYLPDGGKSLRVADGFVVKYDAEGGQAELKPHRDGSVLSFNIALNP 218

Query: 678 VGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFV 732
              +++GGG  F   +  V   + G ++ H   L H   G  +T G RYIM+ FV
Sbjct: 219 AD-EFDGGGTWFQSLDGAVKIDQ-GEVVSHSSSLLHGGHG--ITSGKRYIMVCFV 269


>gi|344271834|ref|XP_003407742.1| PREDICTED: LOW QUALITY PROTEIN: glycosyltransferase 25 family
           member 3-like [Loxodonta africana]
          Length = 596

 Score = 52.4 bits (124), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 47/244 (19%), Positives = 102/244 (41%), Gaps = 28/244 (11%)

Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNN----------QEYHAPLFDD 340
           P+V++ +        L  +L  +  L+YP  +++++   +          QE+ A + +D
Sbjct: 32  PAVVLVILARNAEHSLPHYLGALERLDYPRARLALWCATDHNIDNTKEMLQEWLAAVGND 91

Query: 341 YI------HNFKTMFKNVKYIAHNSTVNSK---EARNLAVENSLHKGVDFYFYVDSDSHL 391
           Y             + + +   H +    +   E +  A+  +   G D+  + D+D+ L
Sbjct: 92  YAAVVWRPEGEPRSYPDEEGPKHWTKERYQFLMELKQEALTFARDWGADYILFADTDNIL 151

Query: 392 DNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGI 451
            N   L+ L+ +   ++AP+L      +SNFW  +   G+Y R+ +Y    N  +  +G 
Sbjct: 152 TNNQTLQLLMEQGLPVVAPMLDSQ-TYYSNFWCGITPQGYYRRTAEYFPTKNRQR--RGC 208

Query: 452 WNVPYITNCYLMKTSVIKATNIKTI-----YTLNSMDYDMAFCTNLRNKGIHLKIDSTQE 506
           + VP + + +L+      A  +        YT    D  + F    +  G+ + + +   
Sbjct: 209 FRVPMVHSTFLVSLRAEGAAQLAFYPPHPNYTW-PFDDIIVFAYACQAAGVSVHVCNQHR 267

Query: 507 YGHL 510
           YG++
Sbjct: 268 YGYM 271


>gi|387219649|gb|AFJ69533.1| hypothetical protein NGATSA_3030300 [Nannochloropsis gaditana
           CCMP526]
          Length = 324

 Score = 52.4 bits (124), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 39/136 (28%), Positives = 67/136 (49%), Gaps = 8/136 (5%)

Query: 601 AVPTRDIHMKQVGLAGVW-AEFLRKYVVPLQEREFIGYHHEPVRAPM--SFVVRYRPDE- 656
           A PT D+ ++++  +  W    L++ + P     F     +  +  +  +F+V+Y  D  
Sbjct: 174 AYPTTDVPLQELPRSLAWFNRQLQEKIYPCLATNFASALPDSSKLKVVDAFIVKYDADGG 233

Query: 657 QPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHE 716
           Q  L+PH D S  + NIALN    ++EGGG  F   +  +     G ++ H   +   H 
Sbjct: 234 QTQLKPHRDGSVVSFNIALNP-SSEFEGGGTYFAGLDQGLR-IEQGHIVTHASNV--LHG 289

Query: 717 GLQVTQGTRYIMISFV 732
           G  ++ G RYI++SFV
Sbjct: 290 GHPISAGKRYILVSFV 305


>gi|348569847|ref|XP_003470709.1| PREDICTED: glycosyltransferase 25 family member 3-like [Cavia
           porcellus]
          Length = 591

 Score = 52.4 bits (124), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 39/203 (19%), Positives = 87/203 (42%), Gaps = 22/203 (10%)

Query: 290 FPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYI----H 343
            PSV++++        L  +L  +  L+YP  +++++    +N +    +  +++    H
Sbjct: 27  LPSVVLAILARNAEHSLPHYLGALERLDYPRARLALWCATDHNIDNTTAMLREWLAAVGH 86

Query: 344 NFKTMF-------------KNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSH 390
           ++  +              +  K+          E +  A+  +   G D+  + D+D+ 
Sbjct: 87  HYAAVIWRPEGEPRSYPDEEGPKHWTKERHQFLMELKQEALTFARAWGADYILFADTDNI 146

Query: 391 LDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKG 450
           L N   L++L  +   ++AP+L      +SNFW  +   G+Y R+ DY    N  +  +G
Sbjct: 147 LTNNQTLRFLTEQALPVVAPMLDSQ-TYYSNFWCGITPQGYYRRTTDYFPTKNRQR--QG 203

Query: 451 IWNVPYITNCYLMKTSVIKATNI 473
            + VP + + +L+      A  +
Sbjct: 204 CFRVPMVHSTFLVSLRAEGADQL 226


>gi|156355246|ref|XP_001623582.1| predicted protein [Nematostella vectensis]
 gi|156210297|gb|EDO31482.1| predicted protein [Nematostella vectensis]
          Length = 344

 Score = 52.4 bits (124), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 45/179 (25%), Positives = 81/179 (45%), Gaps = 21/179 (11%)

Query: 560 DVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWA 619
           +V+  P+ TE FC +F++ +E + + SD           Y       + +  +G    + 
Sbjct: 134 EVYRLPVFTESFCEQFIEELEHF-ESSDVPRGRPNTMNNY------GVLLSDLGFDEHFI 186

Query: 620 EFLRK-YVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQV 678
             LR+ Y+ P+    F  +  + + +  +F V Y P +   L  H+D++  T+++ L   
Sbjct: 187 NPLRREYLQPITALLFPQWGGDGLDSHKAFTVHYMPGKDTELSYHYDNAEVTLSVCL--- 243

Query: 679 GVDYEGGGCRF--IRY------NCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMI 729
           G ++ GG   F  +R        C     R  + L+H G+  H H  L  TQG+RY +I
Sbjct: 244 GREFSGGDLYFGDMRQVLLEDTQCTEVENRPTYGLLHRGQ--HMHGALPTTQGSRYNLI 300


>gi|395824281|ref|XP_003785399.1| PREDICTED: glycosyltransferase 25 family member 3 isoform 1
           [Otolemur garnettii]
          Length = 515

 Score = 52.4 bits (124), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 34/138 (24%), Positives = 66/138 (47%), Gaps = 9/138 (6%)

Query: 378 GVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFD 437
           G D+  + D+D+ L N   L+ LV++   ++AP+L      +SNFW  +   G+Y R+ +
Sbjct: 59  GADYILFADTDNILTNNQTLRLLVDQGLPVVAPMLDSQ-TYYSNFWCGITPQGYYRRTAE 117

Query: 438 YMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTI-----YTLNSMDYDMAFCTNL 492
           Y    N  +  +G ++VP + + +L+      A  +        YT    D  + F    
Sbjct: 118 YFPTKNRQR--RGCFSVPMVHSTFLVSLRAEGAAQLAFYPPHPNYTW-PFDDIIVFAYAC 174

Query: 493 RNKGIHLKIDSTQEYGHL 510
           +  G+ + + + Q YG++
Sbjct: 175 QAAGVSVHVCNDQRYGYM 192


>gi|303272359|ref|XP_003055541.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226463515|gb|EEH60793.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 896

 Score = 52.0 bits (123), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 53/180 (29%), Positives = 81/180 (45%), Gaps = 14/180 (7%)

Query: 556 QPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQV-GL 614
           Q  PD    P+++E+ C E++   EA+   + G     R    + AVPT D+ +  V  L
Sbjct: 725 QTAPDA---PLLSERECLEWIAAAEAHAAKTRGGWTTSR----HYAVPTTDLPVHAVEAL 777

Query: 615 AGVWAEFLRKYVVPLQEREF--IGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTIN 672
              W + +R+ + PL       +      VR    FVVRY    Q  L  H D S  ++ 
Sbjct: 778 VPRWNDLMREKLSPLLAAACADVVARASSVRVHDVFVVRYDASAQHHLPIHVDQSAVSLT 837

Query: 673 IALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFV 732
           +ALN  G  ++GGG  F       +    G   +  G L   H G  VT+G RY++ +F+
Sbjct: 838 LALNG-GDAFDGGGTTFADLGVTCS-PETGHAAVFRGDL--RHGGAPVTRGVRYVVAAFL 893


>gi|397575536|gb|EJK49747.1| hypothetical protein THAOC_31344 [Thalassiosira oceanica]
          Length = 517

 Score = 52.0 bits (123), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 32/80 (40%), Positives = 46/80 (57%), Gaps = 4/80 (5%)

Query: 656 EQPSLRPHHDSSTYTINIALNQ-VGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHY 714
           E+  L  H D ST+T  IAL++  G DY GGG  F   N  V   R G ML+  G+L   
Sbjct: 435 ERQKLELHTDKSTWTFLIALSEGRGTDYSGGGTFFQALNSTVHLQR-GQMLIFRGKL--R 491

Query: 715 HEGLQVTQGTRYIMISFVDP 734
           H G++++ G RY+++ F+ P
Sbjct: 492 HAGVRISWGCRYLLVGFLVP 511


>gi|426226143|ref|XP_004007209.1| PREDICTED: LOW QUALITY PROTEIN: glycosyltransferase 25 family
           member 3 [Ovis aries]
          Length = 652

 Score = 52.0 bits (123), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 34/138 (24%), Positives = 64/138 (46%), Gaps = 9/138 (6%)

Query: 378 GVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFD 437
           G D+  + D+D+ L N   L+ L+ +   ++AP+L      +SNFW  +   G+Y R+ D
Sbjct: 224 GADYILFADTDNILTNNQTLQLLIEQGLPVVAPMLDS-QTYYSNFWCGITPQGYYRRTAD 282

Query: 438 YMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLN-----SMDYDMAFCTNL 492
           Y    N  +  +G + VP + + +L+     + T     Y  +       D  + F    
Sbjct: 283 YFPTKNRQR--RGCFRVPMVHSTFLVSLRA-EGTAQLAFYPPHPNYTWPFDDIIVFAYAC 339

Query: 493 RNKGIHLKIDSTQEYGHL 510
           +  G+ + + + Q YG+L
Sbjct: 340 QAAGVSVHVCNEQRYGYL 357


>gi|195576767|ref|XP_002078245.1| GD23349 [Drosophila simulans]
 gi|194190254|gb|EDX03830.1| GD23349 [Drosophila simulans]
          Length = 803

 Score = 52.0 bits (123), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 45/192 (23%), Positives = 90/192 (46%), Gaps = 22/192 (11%)

Query: 291 PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYIHNFKTM 348
           P+VLI++ +      L  FL+ +   +Y  ++I++++   ++ +    L   ++ N   +
Sbjct: 30  PTVLIALLVRNKAHILPMFLSYLERQDYSKERIAIWLRCDHSNDDSIDLLRQWLDNSGDL 89

Query: 349 FKNVKY---IAHNSTVN---------SKEARNLAV-ENSLHKG----VDFYFYVDSDSHL 391
           + +V Y       S VN         S+    +A+ E +   G     D+ F++D+D  L
Sbjct: 90  YHSVSYEFKPEEQSFVNETSPYEWPASRFKHLIALKEEAFQYGRDIWADYVFFLDADVLL 149

Query: 392 DNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGI 451
            + D LK L      ++AP+L+     +SNFW  +  D +Y R+ +Y  I +  +  +G 
Sbjct: 150 TSKDSLKVLTRLQLPIVAPMLISE-SLYSNFWCGMTEDYYYRRTDEYKEIYHAKK--QGS 206

Query: 452 WNVPYITNCYLM 463
           + VP +    L+
Sbjct: 207 FPVPMVHTAVLV 218


>gi|170038076|ref|XP_001846879.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167881499|gb|EDS44882.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 496

 Score = 52.0 bits (123), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 27/90 (30%), Positives = 52/90 (57%), Gaps = 3/90 (3%)

Query: 384 YVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIIN 443
           ++D+D  L NP  L  LV+ +  ++AP+L+     +SNFW  +  D +Y R+ +Y  I+N
Sbjct: 27  FLDADVFLTNPKTLTKLVSLSLPIVAPMLLSD-GLYSNFWCGMTPDYYYERTEEYKEILN 85

Query: 444 GDQGGKGIWNVPYITNCYLMKTSVIKATNI 473
              G  G + VP + +  ++  ++++A N+
Sbjct: 86  --YGKTGEFTVPMVHSAVMVNINLLEAKNL 113


>gi|47223918|emb|CAG06095.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 660

 Score = 51.6 bits (122), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 28/97 (28%), Positives = 54/97 (55%), Gaps = 3/97 (3%)

Query: 367 RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGAL 426
           R  A++ +     D+   VD D+ L N ++L  L+  N++++AP+L     A+SNFW  +
Sbjct: 165 RQAALDTAREIWADYLLVVDCDNLLTNRELLWKLMRENKTVVAPML-ESRAAYSNFWCGM 223

Query: 427 NADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLM 463
            + G+Y R+  Y+ I   ++  +G + VP + +  L+
Sbjct: 224 TSQGYYKRTPAYVPIRKRER--RGCFAVPMVHSTLLV 258


>gi|349804117|gb|AEQ17531.1| putative procollagen-lysine 2-oxoglutarate 5-dioxygenase 3
           [Hymenochirus curtipes]
          Length = 111

 Score = 51.6 bits (122), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 24/92 (26%), Positives = 50/92 (54%), Gaps = 1/92 (1%)

Query: 312 KIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAV 371
           ++  L+YP  ++S++++N++ YH      +    K  F ++K +     ++  EAR++ +
Sbjct: 1   RLVLLDYPRNRLSLYIHNSEVYHEKHIQAFWEKHKEDFSSLKIVGPEEALSQGEARDMGM 60

Query: 372 ENSLH-KGVDFYFYVDSDSHLDNPDVLKYLVN 402
           +     +  D+Y+ VD+D  L NPD L  L+ 
Sbjct: 61  DLCRQDETCDYYYSVDADVVLTNPDTLYILIQ 92


>gi|242007889|ref|XP_002424750.1| conserved hypothetical protein [Pediculus humanus corporis]
 gi|212508253|gb|EEB12012.1| conserved hypothetical protein [Pediculus humanus corporis]
          Length = 327

 Score = 51.2 bits (121), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 33/101 (32%), Positives = 56/101 (55%), Gaps = 8/101 (7%)

Query: 363 SKEARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNF 422
            ++A N+A EN      DF F V+ D  L + +  KYLV +N ++  P+L +    +SNF
Sbjct: 94  KEKALNVAREN----WADFIF-VNCDVFLTDNETFKYLVRQNHTVTGPML-KSIGLYSNF 147

Query: 423 WGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLM 463
           W  + +  +Y R+ DY  I+  ++  KG +NVP I +  ++
Sbjct: 148 WCGMTSKYYYMRTDDYKPILKREK--KGCFNVPMIHSALII 186


>gi|384498735|gb|EIE89226.1| hypothetical protein RO3G_13937 [Rhizopus delemar RA 99-880]
          Length = 239

 Score = 50.8 bits (120), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 27/86 (31%), Positives = 46/86 (53%), Gaps = 4/86 (4%)

Query: 648 FVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMH 707
           F+V+Y  +EQ  L  H D   ++I + ++    D+EGGG  F   +  V     G    H
Sbjct: 142 FLVKYSAEEQRGLGLHADGCLFSITLLISHPD-DFEGGGTYFASID-QVVHLGQGDCAYH 199

Query: 708 PGRLTHYHEGLQVTQGTRYIMISFVD 733
             R+   H G+++T+G RY+++ F+D
Sbjct: 200 DARV--MHSGMEITKGERYVLVGFID 223


>gi|323451040|gb|EGB06918.1| hypothetical protein AURANDRAFT_14444, partial [Aureococcus
           anophagefferens]
          Length = 172

 Score = 50.8 bits (120), Expect = 0.003,   Method: Composition-based stats.
 Identities = 46/180 (25%), Positives = 73/180 (40%), Gaps = 34/180 (18%)

Query: 562 FWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEF 621
           + FP+ TE FC   +  ++A+ +          +  G   +   D  M+++G      + 
Sbjct: 1   YAFPLFTEAFCARLLADLDAWERSPLPRRRPNSMNAG--GLVVNDCGMERLG-----DDL 53

Query: 622 LRKYVVPLQEREF--------IGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINI 673
           L + V PL    F        + +HH        F VRY   E  +L  HHD+S  T+N+
Sbjct: 54  LARVVGPLASTLFGDEVFARSLDHHH-------LFAVRYAVGEDETLAMHHDASEVTLNV 106

Query: 674 ALNQVGVDYEGGGCRFI--------RYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTR 725
            L   G  +EGG  +F         R        R+G  ++H GR  H H   ++  G R
Sbjct: 107 CLGTAG--FEGGALQFCGRVGDGDHRAASGAFDHRVGTAVLHLGR--HRHGVARLASGER 162


>gi|351697039|gb|EHA99957.1| Glycosyltransferase 25 family member 3 [Heterocephalus glaber]
          Length = 644

 Score = 50.4 bits (119), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 38/194 (19%), Positives = 83/194 (42%), Gaps = 22/194 (11%)

Query: 290 FPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY--NNQEYHAPLFDDYI----H 343
            PSV++++        L  +L  +  L+YP  +++++    +N +    +  +++    H
Sbjct: 79  LPSVVLAILARNAEHSLPHYLGALERLDYPRARLALWCATDHNVDNTTAMLREWLAAVGH 138

Query: 344 NFKTMF-------------KNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSH 390
           ++  +                 K+          E +  A+  +   G D+  + D+D+ 
Sbjct: 139 HYAAVIWRPEGEPRSYPDEGGPKHWTRERHQFLMELKQEALTFARDWGADYILFADTDNI 198

Query: 391 LDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKG 450
           L N   L+ L  +   ++AP+L      +SNFW  +   G+Y R+ DY    N  +  +G
Sbjct: 199 LTNNRTLRLLTEQGLPVVAPMLDSQ-TYYSNFWCGITPQGYYRRTADYFPTKNRQR--RG 255

Query: 451 IWNVPYITNCYLMK 464
            + VP + + +L+ 
Sbjct: 256 CFRVPMVHSTFLVS 269


>gi|323451068|gb|EGB06946.1| putative 2OG-Fe(II) oxidoreductase like-protein [Aureococcus
           anophagefferens]
          Length = 312

 Score = 50.4 bits (119), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 59/245 (24%), Positives = 101/245 (41%), Gaps = 54/245 (22%)

Query: 510 LVDSENFDPQKTNPEVYE-LIRNPLDWDLRYIHPEYQKSLLPDTVN-----------NQP 557
           L   E + P +  PE++E L+R+           E+    L D V             + 
Sbjct: 26  LSPEEAYAPLRRTPELFESLLRD-----------EWLAPTLLDVVQAARRGQCHRDLREE 74

Query: 558 CPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTR--------DIHM 609
            P VF F + ++ FC +F+Q ++ Y            +++G   +P R         + +
Sbjct: 75  APGVFSFAMFSDAFCRDFLQEVDGY------------MDSG---LPIRRPNSMNNYGLIV 119

Query: 610 KQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTY 669
            ++G+  V +E  R+ + P+  R           A  SF+V+YR  E P L  H D S  
Sbjct: 120 NEIGMLDVISELQREVLWPIA-RSLWPKEGSAFHAHHSFMVQYRKTEDPGLDMHTDDSDV 178

Query: 670 TINIALNQV----GVDYEGGGCRFIRYNCNVTATRM-GWMLMHPGRLTHYHEGLQVTQGT 724
           T N+ L +V    G+ + GG  R  R+        + G  ++H G  +  H    ++ GT
Sbjct: 179 TFNVCLGEVFAGAGLTFCGGMRRETRHRFAFQYEHVKGRAVVHLG--SKRHGADDISSGT 236

Query: 725 RYIMI 729
           R  +I
Sbjct: 237 RRNLI 241


>gi|156355248|ref|XP_001623583.1| predicted protein [Nematostella vectensis]
 gi|156210298|gb|EDO31483.1| predicted protein [Nematostella vectensis]
          Length = 285

 Score = 50.4 bits (119), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 44/179 (24%), Positives = 80/179 (44%), Gaps = 21/179 (11%)

Query: 560 DVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWA 619
           +V+  P+ TE FC +F++ +E + + SD           Y       + +  +G    + 
Sbjct: 75  EVYRLPVFTESFCEQFIEELEHF-ESSDVPRGRPNTMNNY------GVLLSDLGFDEHFI 127

Query: 620 EFLRK-YVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQV 678
             LR+ Y+ P+    F  +  + + +  +F V Y P +   L  H+D++  T+++ L   
Sbjct: 128 NPLRREYLQPITALLFPQWGGDGLDSHKAFTVHYMPGKDTELSYHYDNAEVTLSVCL--- 184

Query: 679 GVDYEGGGCRF--IRY------NCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMI 729
           G ++ GG   F  +R        C     R  + L+H G+    H  L  TQG+RY +I
Sbjct: 185 GREFSGGDLYFGDMRQVLLEDTQCTEVENRPTYGLLHRGQ--QMHGALPTTQGSRYNLI 241


>gi|160395571|sp|Q5U309.2|GT253_RAT RecName: Full=Probable inactive glycosyltransferase 25 family
           member 3; AltName: Full=Cerebral endothelial cell
           adhesion molecule
          Length = 572

 Score = 50.4 bits (119), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 37/167 (22%), Positives = 75/167 (44%), Gaps = 13/167 (7%)

Query: 365 EARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWG 424
           E +  A+  +   G D+  + D+D+ L N   L+ L++R   ++AP+L      +SNFW 
Sbjct: 101 ELKQEALAFARDWGADYILFADTDNILTNNQTLRLLIDRQLPVVAPMLDSQ-TYYSNFWC 159

Query: 425 ALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTI-----YTL 479
            +   G+Y R+ +Y    N  +  +G + VP + + +L+     +   +        YT 
Sbjct: 160 GITPQGYYRRTAEYFPTKNRQR--QGCFRVPMVHSTFLVSLQTEETARLAFYPPHPNYTW 217

Query: 480 NSMDYDMAFCTNLRNKGIHLKIDSTQEYGHL----VDSENFDPQKTN 522
              D  + F    +  G+ + + +   YG++       +  + +KTN
Sbjct: 218 -PFDDIIVFAYACQAAGVSVHVCNDHRYGYMNVGVKPHQGLEEEKTN 263


>gi|58865502|ref|NP_001011962.1| probable inactive glycosyltransferase 25 family member 3 [Rattus
           norvegicus]
 gi|55249709|gb|AAH85782.1| Cerebral endothelial cell adhesion molecule [Rattus norvegicus]
          Length = 517

 Score = 50.4 bits (119), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 37/167 (22%), Positives = 75/167 (44%), Gaps = 13/167 (7%)

Query: 365 EARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWG 424
           E +  A+  +   G D+  + D+D+ L N   L+ L++R   ++AP+L      +SNFW 
Sbjct: 46  ELKQEALAFARDWGADYILFADTDNILTNNQTLRLLIDRQLPVVAPMLDSQ-TYYSNFWC 104

Query: 425 ALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTI-----YTL 479
            +   G+Y R+ +Y    N  +  +G + VP + + +L+     +   +        YT 
Sbjct: 105 GITPQGYYRRTAEYFPTKNRQR--QGCFRVPMVHSTFLVSLQTEETARLAFYPPHPNYTW 162

Query: 480 NSMDYDMAFCTNLRNKGIHLKIDSTQEYGHL----VDSENFDPQKTN 522
              D  + F    +  G+ + + +   YG++       +  + +KTN
Sbjct: 163 -PFDDIIVFAYACQAAGVSVHVCNDHRYGYMNVGVKPHQGLEEEKTN 208


>gi|83944028|ref|ZP_00956485.1| hypothetical protein EE36_10295 [Sulfitobacter sp. EE-36]
 gi|83845275|gb|EAP83155.1| hypothetical protein EE36_10295 [Sulfitobacter sp. EE-36]
          Length = 303

 Score = 50.1 bits (118), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 37/130 (28%), Positives = 56/130 (43%), Gaps = 12/130 (9%)

Query: 603 PTRDIHMKQVGLAGVWAEFLRKYVVPLQEREF---IGYHHEPVRAPMSFVVRYRPDEQPS 659
           P  + ++   G  G + E +  Y+ P+    F   +GY  +       F +R++  +  S
Sbjct: 138 PRSEGYLAAPGFQGFYREMMDAYMRPVSRLLFPDVVGYDTQT----FGFSIRWQASKDTS 193

Query: 660 LRPHHDSSTYTINIALNQVGVDYEGGGCRFI---RYNCNVTATRMGWMLMHPGRLTHYHE 716
           LRPH D+S  T+NI LN     Y G    FI              G  L+H G + H  E
Sbjct: 194 LRPHSDASAVTLNINLNLPDEWYSGSAVSFIDPVSRRVEKLTFEPGTALIHHGSVPHASE 253

Query: 717 GLQVTQGTRY 726
              +T+G RY
Sbjct: 254 --PITEGERY 261


>gi|432095371|gb|ELK26570.1| Glycosyltransferase 25 family member 3 [Myotis davidii]
          Length = 559

 Score = 49.7 bits (117), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 32/137 (23%), Positives = 63/137 (45%), Gaps = 9/137 (6%)

Query: 379 VDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDY 438
            D+  + DSD+ L N   L+ L+ +   ++AP+L      +SNFW  +   G+Y R+ +Y
Sbjct: 102 ADYILFADSDNILTNSQTLRLLIEQGLPVVAPMLDSQ-TYYSNFWCGITPQGYYRRTAEY 160

Query: 439 MNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTI-----YTLNSMDYDMAFCTNLR 493
               N  +  +G + VP + + +L+      A  +        YT    D  + F  + +
Sbjct: 161 FPTKNRQR--RGCFQVPMVHSTFLVSLRAEGAAQLAFYPPHPNYTW-PFDDIIVFAYSCQ 217

Query: 494 NKGIHLKIDSTQEYGHL 510
             G+ + + +   YG++
Sbjct: 218 AAGVSVHVCNEHRYGYM 234


>gi|296190932|ref|XP_002743399.1| PREDICTED: glycosyltransferase 25 family member 3 isoform 2
           [Callithrix jacchus]
          Length = 548

 Score = 48.9 bits (115), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 33/138 (23%), Positives = 63/138 (45%), Gaps = 9/138 (6%)

Query: 378 GVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFD 437
           G D+  + D+D+ L N   L+ L+ +   ++AP+L      +SNFW  +   G+Y R+ +
Sbjct: 90  GADYILFADTDNILTNNQTLRLLMGQGLPVVAPMLDSQ-TYYSNFWCGITPQGYYRRTAE 148

Query: 438 YMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTI-----YTLNSMDYDMAFCTNL 492
           Y    N  +  +G + VP + + +L+      A  I        YT    D  + F    
Sbjct: 149 YFPTKNRQR--RGCFRVPMVHSTFLVSLRAEGADQIAFYPPHPNYTW-PFDDIIVFAYAC 205

Query: 493 RNKGIHLKIDSTQEYGHL 510
           +  G+ + + +   YG++
Sbjct: 206 QAAGVSVHVCNEHRYGYM 223


>gi|335281050|ref|XP_003353725.1| PREDICTED: glycosyltransferase 25 family member 3-like isoform 2
           [Sus scrofa]
          Length = 517

 Score = 48.9 bits (115), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 31/138 (22%), Positives = 64/138 (46%), Gaps = 9/138 (6%)

Query: 378 GVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFD 437
           G D+  + D+D+ L N   L+ L+ +   ++AP+L      +SNFW  +   G+Y R+ +
Sbjct: 59  GADYILFADTDNILTNNQTLRLLIEQQLPVVAPMLDSQ-TYYSNFWCGITPQGYYRRTAE 117

Query: 438 YMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLN-----SMDYDMAFCTNL 492
           Y    N  +  +G + VP + + +L+     + T   + Y  +       D  + F    
Sbjct: 118 YFPTKNRQR--RGCFRVPMVHSTFLISLRA-EGTGQLSFYPPHPNYTWPFDDIIVFAYAC 174

Query: 493 RNKGIHLKIDSTQEYGHL 510
           +  G+ + + +   YG++
Sbjct: 175 QAAGVSVHVCNEHRYGYM 192


>gi|357132260|ref|XP_003567749.1| PREDICTED: uncharacterized PKHD-type hydroxylase At1g22950-like
           [Brachypodium distachyon]
          Length = 393

 Score = 48.9 bits (115), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 48/190 (25%), Positives = 76/190 (40%), Gaps = 27/190 (14%)

Query: 552 TVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTN-NDKRLET--GYEAVPTRDIH 608
           ++  +P P VF FP++  KFC    + ++ +  W         R  T   Y AV      
Sbjct: 158 SIMAEPIPGVFSFPMLQPKFCDMLFEEVDNFESWVHAMKFKIMRPNTMNKYGAV------ 211

Query: 609 MKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSST 668
           +   GL  +  +F+ K++ P+ +  +       + +  +FVV Y  D    L  H D S 
Sbjct: 212 LDDFGLETMLNDFMEKFITPISKVFYPEVGGGTLDSHHAFVVEYGKDRDVELGFHVDDSE 271

Query: 669 YTINIALNQVGVDYEGGGCRFIRYNC----NVTATRM---------GWMLMHPGRLTHYH 715
            T+N+ L   G  + GG   F    C    N  A +          GW ++H GR  H H
Sbjct: 272 VTLNVCL---GKQFSGGQLYFRGVRCENHVNSEAQQEEIYDYPHVPGWAVLHRGR--HRH 326

Query: 716 EGLQVTQGTR 725
                + G R
Sbjct: 327 GARPTSSGLR 336


>gi|335281052|ref|XP_001925614.3| PREDICTED: glycosyltransferase 25 family member 3-like isoform 1
           [Sus scrofa]
          Length = 555

 Score = 48.9 bits (115), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 31/138 (22%), Positives = 64/138 (46%), Gaps = 9/138 (6%)

Query: 378 GVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFD 437
           G D+  + D+D+ L N   L+ L+ +   ++AP+L      +SNFW  +   G+Y R+ +
Sbjct: 97  GADYILFADTDNILTNNQTLRLLIEQQLPVVAPMLDSQ-TYYSNFWCGITPQGYYRRTAE 155

Query: 438 YMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLN-----SMDYDMAFCTNL 492
           Y    N  +  +G + VP + + +L+     + T   + Y  +       D  + F    
Sbjct: 156 YFPTKNRQR--RGCFRVPMVHSTFLISLRA-EGTGQLSFYPPHPNYTWPFDDIIVFAYAC 212

Query: 493 RNKGIHLKIDSTQEYGHL 510
           +  G+ + + +   YG++
Sbjct: 213 QAAGVSVHVCNEHRYGYM 230


>gi|121583693|ref|NP_001073538.1| 2-oxoglutarate and iron-dependent oxygenase domain-containing
           protein 2 [Danio rerio]
 gi|118764167|gb|AAI28873.1| Zgc:158437 [Danio rerio]
          Length = 345

 Score = 48.5 bits (114), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 46/186 (24%), Positives = 81/186 (43%), Gaps = 21/186 (11%)

Query: 553 VNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQV 612
           +  +  P VF F +  ++FC + ++ +E + Q SD           Y  V      + ++
Sbjct: 123 IQTEAAPRVFRFQVFRKEFCKDLLEELEHFEQ-SDAPKGRPNTMNNYGIV------LNEL 175

Query: 613 GLAGVWAEFLRK-YVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTI 671
           G    +   LR+ Y+ PL    +       + +  +FVV+Y   E  +L  H+D+S  T+
Sbjct: 176 GFDEGFITPLREVYLRPLTALLYSDCGGNCLDSHKAFVVKYDMHEDLNLSYHYDNSEVTL 235

Query: 672 NIALNQVGVDYEGGGCRF--------IRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQG 723
           N++L   G D+  G   F            C     R+   L+H G+  H H  L ++ G
Sbjct: 236 NVSL---GKDFTEGNLFFGDMRQVPLSETECVEVEHRVTEGLLHRGQ--HMHGALSISSG 290

Query: 724 TRYIMI 729
           TR+ +I
Sbjct: 291 TRWNLI 296


>gi|158302599|ref|XP_561137.5| Anopheles gambiae str. PEST AGAP012933-PA [Anopheles gambiae str.
           PEST]
 gi|157021089|gb|EAL42272.3| AGAP012933-PA [Anopheles gambiae str. PEST]
          Length = 330

 Score = 48.5 bits (114), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 25/79 (31%), Positives = 44/79 (55%), Gaps = 3/79 (3%)

Query: 395 DVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNV 454
           + L  L++R   ++AP+LV     +SNFW  + +D +Y R+ DY  I+N DQ G+  W V
Sbjct: 1   NTLGKLIDRKLPIVAPMLVSD-GLYSNFWCGMTSDYYYQRTDDYKKILNYDQIGQ--WPV 57

Query: 455 PYITNCYLMKTSVIKATNI 473
           P +    L+  ++ +   +
Sbjct: 58  PMVHTAVLVSLNIAQTRQL 76


>gi|308801166|ref|XP_003075362.1| Lysyl hydroxylase (ISS) [Ostreococcus tauri]
 gi|116061918|emb|CAL52636.1| Lysyl hydroxylase (ISS), partial [Ostreococcus tauri]
          Length = 233

 Score = 48.5 bits (114), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 53/157 (33%), Positives = 68/157 (43%), Gaps = 34/157 (21%)

Query: 128 ANIVFGAERLCWP----DTSLYD-------KY--PAVGSGYRYLNSGGFIGYAKDIKELI 174
           A I+F AE  CWP    D  L D       K+   A GS  +YLNSGG IG    + E+ 
Sbjct: 26  ALILFSAEGNCWPHMAGDQELIDGGREYCAKFHDKAKGSSNKYLNSGGVIGPVSALAEMY 85

Query: 175 SN-RSIKNEEDDQ------LYYALLFLDETLRTKHK---IVLDTLANLFQN-------LY 217
              RS+    DD+        YA    DE   T  K   I LD  A +FQ        + 
Sbjct: 86  QEIRSLMKTVDDEDQMITASVYAKQIDDERSGTHSKRYVIALDHEARVFQTGWHTHLEIT 145

Query: 218 GSLEDIKLN---FDLDEFVHLTNTKYNTNPVIIHGNG 251
           G   + ++N   FD    V   NT++N+ P I H NG
Sbjct: 146 GKYAEPQVNGAYFDTSLGV-FVNTEHNSTPPIAHFNG 181


>gi|412987619|emb|CCO20454.1| Lysyl hydroxylase (ISS) [Bathycoccus prasinos]
          Length = 403

 Score = 48.5 bits (114), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 53/207 (25%), Positives = 89/207 (42%), Gaps = 52/207 (25%)

Query: 97  TDDMIILVTDSYDVII--DGGVNDILERFNTF--DANI---------VFGAERLCWPDT- 142
           + D I+ + D+ DV+   DG    I+E++     DA +         + GAER CWP   
Sbjct: 186 SGDTIVNIADASDVLYFQDGAT--IMEKYKQIVRDAPVDESRKHTIVLIGAERNCWPSMD 243

Query: 143 ----------SLYDKYPAVG--SGYRYLNSGGFIGYAKDIKELISN-RSI----KNEEDD 185
                        +++ AV   S Y +LNSG  +G    +K L+    S+    K  +DD
Sbjct: 244 GEKELIPGGRKYCEQFKAVSGNSSYHFLNSGSLMGRVDAVKALLKRVESVMDGGKQNDDD 303

Query: 186 QLYYALLFLDETLRTKHK----------IVLDTLANLFQNLYGS--------LEDIKLNF 227
           Q    + + +  ++ K +          I+LD  A++FQ  +GS          D    +
Sbjct: 304 QQLLQMQY-ERQIKQKSEGGGKEEDAFTILLDHKASIFQTGWGSHLANGRYAARDPNGAY 362

Query: 228 DLDEFVHLTNTKYNTNPVIIHGNGKSK 254
             +    + NT++N+ P IIH NG  +
Sbjct: 363 YNEAKCAVENTEHNSEPSIIHFNGGKR 389


>gi|223992787|ref|XP_002286077.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220977392|gb|EED95718.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 566

 Score = 48.5 bits (114), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 52/186 (27%), Positives = 74/186 (39%), Gaps = 33/186 (17%)

Query: 565 PIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQV-----GLAGVWA 619
           P++++  C   + I E +      T       T + AVPT D+ + Q+         +W 
Sbjct: 345 PMISQTECQNVINIAEQHAARLGWTT------TRHYAVPTTDVPLHQLIELRPWFYKLWT 398

Query: 620 EFLRKYVVPLQEREFI----GYHHEPVRAPMS---------FVVRYRPDE-QPSLRPHHD 665
             LR    P   R+F           V +P S         FVVRY     Q  L PH+D
Sbjct: 399 SRLR----PTLRRQFRISTNTNETATVPSPTSHRDIFIHDVFVVRYDAQGGQRGLPPHYD 454

Query: 666 SSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTR 725
            ST++  I LN    +Y+GGG         +     G ML   G     H G  V +G R
Sbjct: 455 ESTHSFVIGLN---TEYQGGGTFIHALGRPLKPKVEGGMLSFSGG-EFLHSGDPVVEGIR 510

Query: 726 YIMISF 731
           YI++ F
Sbjct: 511 YIIVGF 516


>gi|323455517|gb|EGB11385.1| hypothetical protein AURANDRAFT_14407, partial [Aureococcus
           anophagefferens]
          Length = 171

 Score = 48.5 bits (114), Expect = 0.013,   Method: Composition-based stats.
 Identities = 41/133 (30%), Positives = 55/133 (41%), Gaps = 17/133 (12%)

Query: 609 MKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPM----SFVVRYRPDEQPSLRPHH 664
           +  VGL         K   PL +  F  + H    A      SF+V+YR DE P L  H 
Sbjct: 42  VNDVGLEPFVKALQDKVCGPLAQALFKAHPHGHPAADFDSTHSFIVKYRGDEDPHLDVHT 101

Query: 665 DSSTYTINIALNQVGVDYEGGGCRFI--------RYNCNVTATRMGWMLMHPGRLTHYHE 716
           D S  T N+ L   G D+EG G  F         R +C     R+G  + H G  +  H 
Sbjct: 102 DDSDVTFNVCL---GRDFEGCGLVFCGMIGAKDHRQHCKTYEHRVGTCVCHLG--SKRHG 156

Query: 717 GLQVTQGTRYIMI 729
              +T+G R  +I
Sbjct: 157 ADDITRGERLNLI 169


>gi|7959265|dbj|BAA96026.1| KIAA1502 protein [Homo sapiens]
          Length = 560

 Score = 48.5 bits (114), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 34/151 (22%), Positives = 68/151 (45%), Gaps = 9/151 (5%)

Query: 365 EARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWG 424
           E +  A+  + + G D+  + D+D+ L N   L+ L+ +   ++AP+L      +SNFW 
Sbjct: 123 ELKQEALTFARNWGADYILFADTDNILTNNQTLRLLMGQGLPVVAPMLDSQ-TYYSNFWC 181

Query: 425 ALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTI-----YTL 479
            +   G+Y R+ +Y    N  +  +G + VP + + +L       A  +        YT 
Sbjct: 182 GITPQGYYRRTAEYFPTKNRQR--RGCFRVPMVHSTFLASLRAEGADQLAFYPPHPNYTW 239

Query: 480 NSMDYDMAFCTNLRNKGIHLKIDSTQEYGHL 510
              D  + F    +  G+ + + +   YG++
Sbjct: 240 -PFDDIIVFAYACQAAGVSVHVCNEHRYGYM 269


>gi|219114901|ref|XP_002178246.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217409981|gb|EEC49911.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 449

 Score = 48.1 bits (113), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 31/78 (39%), Positives = 44/78 (56%), Gaps = 4/78 (5%)

Query: 656 EQPSLRPHHDSSTYTINIAL-NQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHY 714
           E+  L  H D S +T  IAL N  G+DYEGGG  F   +  V   R G  L+ PG+L H 
Sbjct: 351 ERQKLDMHTDKSEWTFLIALSNGSGLDYEGGGTFFECLDSTVHVQR-GHALIFPGKLRHC 409

Query: 715 HEGLQVTQGTRYIMISFV 732
             G ++T G R++++ F+
Sbjct: 410 --GQRITSGLRFLLVGFL 425


>gi|166240093|ref|XP_001732961.1| hypothetical protein DDB_G0270778 [Dictyostelium discoideum AX4]
 gi|165988739|gb|EDR41119.1| hypothetical protein DDB_G0270778 [Dictyostelium discoideum AX4]
          Length = 417

 Score = 48.1 bits (113), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 52/221 (23%), Positives = 93/221 (42%), Gaps = 28/221 (12%)

Query: 523 PEVYELIRNPLDWDLRYIHP--EYQKSLLPD----TVNNQPCPDVFWFPIVTEKFCHEFV 576
           PE++EL     D D  +I P  +Y+K+   D     +       ++ F I T +FC + +
Sbjct: 193 PEIFELREEYFDKD--FIEPIKQYKKTKNQDDLLKALTKLTETRIYSFRIFTMEFCTKLL 250

Query: 577 QIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIG 636
           + +E +      T     +   Y AV      + ++G    + +    Y+       +  
Sbjct: 251 EEIENFKNTGLPTARPNSM-NNYGAV------LDEMGFTEFFKQLREDYLSLFTSILYKD 303

Query: 637 YHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFI------ 690
           Y+ E + +  +F V+Y+ D++  L  H+D S  T+N+ L   G ++ GG   F       
Sbjct: 304 YNGEKLNSHHAFAVQYKMDKEKELGFHYDESDITVNLCL---GSEFTGGSLYFKGILDKP 360

Query: 691 -RYNCNVTATRM-GWMLMHPGRLTHYHEGLQVTQGTRYIMI 729
             +N       + G  L+H G   H H  L +T G R  +I
Sbjct: 361 ETHNEYFEFKHIPGVALIHIG--VHRHGALGLTSGERTNLI 399


>gi|452824176|gb|EME31181.1| hypothetical protein Gasu_16740 [Galdieria sulphuraria]
          Length = 291

 Score = 48.1 bits (113), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 52/205 (25%), Positives = 79/205 (38%), Gaps = 46/205 (22%)

Query: 96  ITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWP--------------- 140
           + D+ I++  D  D       + + + F   D ++  GAE+ CWP               
Sbjct: 46  LEDNDIVVFLDGRDAFYMRETSGLRKDFANTDKDLFLGAEKNCWPFSYNPFSNISLNLYD 105

Query: 141 ------------------DTSLYDKYPAVGSG-YRYLNSGGFIGYAKDIKELIS------ 175
                             DT   + +   G G Y + N GGFIG  K +KE +       
Sbjct: 106 PKTQERPWVWSFKAEKLCDTLKLESFKRSGEGPYAFPNGGGFIGRWKKVKEFVDLNWEVF 165

Query: 176 NRSIKNEE-DDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGSLEDIKLNFDLDEFVH 234
            + +K E+ DDQ   ++ FL   L     IVLD  A+  Q     LE +  N+ L    +
Sbjct: 166 YKVLKPEQRDDQASTSIAFL---LSIDKSIVLDNKAHFIQ-CTDRLEGVFDNYCLSNSTY 221

Query: 235 LTNTKYNTNPVIIHGNGKSKIELNS 259
           + N    T P   H NG  K+ L +
Sbjct: 222 I-NRDTKTFPYFHHHNGGGKVYLET 245


>gi|145343554|ref|XP_001416384.1| Protein Lysyl hydroxylase fusion protein, putative [Ostreococcus
           lucimarinus CCE9901]
 gi|144576609|gb|ABO94677.1| Protein Lysyl hydroxylase fusion protein, putative [Ostreococcus
           lucimarinus CCE9901]
          Length = 618

 Score = 48.1 bits (113), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 39/168 (23%), Positives = 78/168 (46%), Gaps = 11/168 (6%)

Query: 567 VTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQV-GLAGVWAEFLRKY 625
           ++   C  +++  EA+     G + D+     +++V T D+ + ++  +   W     + 
Sbjct: 457 ISPSACSSWIKTAEAHATNRGGWDTDR-----HKSVATTDLPIHEIPSVLREWNLIFGQI 511

Query: 626 VVPLQEREFIGYHHEPVRAPMSFVVRY-RPDEQPSLRPHHDSSTYTINIALNQVGVDYEG 684
           + P  +  F       +R   +F+V+Y   D Q  L  H D   ++I ++LN   + Y+G
Sbjct: 512 IGPFIQERFRVDGDTNLRVHDAFIVKYDASDGQCQLPVHTDQGHFSITLSLND-PIQYKG 570

Query: 685 GGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFV 732
           GG  F  +   +   + G  +     LTH   G+ +T G RYI+++F+
Sbjct: 571 GGTIFPEHE-FIVRPKCGDFVAFRSYLTH--GGVPITSGVRYIVVAFL 615


>gi|412993664|emb|CCO14175.1| predicted protein [Bathycoccus prasinos]
          Length = 289

 Score = 48.1 bits (113), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 31/104 (29%), Positives = 51/104 (49%), Gaps = 21/104 (20%)

Query: 647 SFVVRY--RPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFI-------------- 690
           +FV+RY  + ++   LR H D    +  ++L+    +YEGGG  F               
Sbjct: 142 AFVIRYDGKSEKDSHLRMHQDDGPISFQVSLSDAD-EYEGGGTNFYEAKRRRTQFEEKSA 200

Query: 691 --RYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFV 732
             R   NV   ++G +L+H G++ H  EG +VT G RY ++ F+
Sbjct: 201 KERAKTNVKLEKIGDVLVHGGQIDH--EGAKVTSGLRYTLVYFL 242


>gi|297460155|ref|XP_001255106.3| PREDICTED: glycosyltransferase 25 family member 3-like, partial
           [Bos taurus]
          Length = 282

 Score = 48.1 bits (113), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 37/177 (20%), Positives = 78/177 (44%), Gaps = 22/177 (12%)

Query: 306 LEEFLNKIANLNYPAKKISMFVYNN----------QEYHAPLFDDYI------HNFKTMF 349
           L  +L  +  L+YP  +++++   +          +E+ A + D+Y             +
Sbjct: 71  LPHYLGALERLDYPRARLALWCATDHNVDNTTAMLREWLAAVGDNYAAVVWRPEGEPRSY 130

Query: 350 KNVKYIAHNSTVNSK---EARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNES 406
            + +   H +    +   E +  A+  +   G D+  + D+D+ L N   L+ L+     
Sbjct: 131 PDEEGPKHWTKERHQFLMELKQEALTFARDWGADYILFADTDNILTNNQTLRLLIEPGLP 190

Query: 407 LIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLM 463
           ++AP+L      +SNFW  +   G+Y R+ DY    N  +  +G + VP + + +L+
Sbjct: 191 VVAPMLDSQ-TYYSNFWCGITPQGYYRRTADYFPTKNRQR--RGCFRVPMVHSTFLV 244


>gi|111185604|gb|AAI19700.1| Cerebral endothelial cell adhesion molecule [Homo sapiens]
          Length = 517

 Score = 47.8 bits (112), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 34/151 (22%), Positives = 69/151 (45%), Gaps = 9/151 (5%)

Query: 365 EARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWG 424
           E +  A+  + + G D+  + D+D+ L N   L+ L+ +   ++AP+L      +SNFW 
Sbjct: 46  ELKQEALTFARNWGADYILFADTDNILTNNQTLRLLMGQGLPVVAPMLDSQ-TYYSNFWC 104

Query: 425 ALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTI-----YTL 479
            +   G+Y R+ +Y    N  +  +G ++VP + + +L       A  +        YT 
Sbjct: 105 GITPQGYYRRTAEYFPTKNRQR--RGCFHVPMVHSTFLASLRAEGADQLAFYPPHPNYTW 162

Query: 480 NSMDYDMAFCTNLRNKGIHLKIDSTQEYGHL 510
              D  + F    +  G+ + + +   YG++
Sbjct: 163 -PFDDIIVFAYACQAAGVSVHVCNEHRYGYM 192


>gi|5764665|gb|AAD51367.1|AF177203_1 cerebral cell adhesion molecule [Homo sapiens]
          Length = 517

 Score = 47.8 bits (112), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 34/151 (22%), Positives = 68/151 (45%), Gaps = 9/151 (5%)

Query: 365 EARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWG 424
           E +  A+  + + G D+  + D+D+ L N   L+ L+ +   ++AP+L      +SNFW 
Sbjct: 46  ELKQEALTFARNWGADYILFADTDNILTNNQTLRLLMGQGLPVVAPMLDSQ-TYYSNFWC 104

Query: 425 ALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTI-----YTL 479
            +   G+Y R+ +Y    N  +  +G + VP + + +L       A  +        YT 
Sbjct: 105 GITPQGYYRRTAEYFPTKNRQR--RGCFRVPMVHSTFLASLRAEGADQLAFYPPHPNYTW 162

Query: 480 NSMDYDMAFCTNLRNKGIHLKIDSTQEYGHL 510
              D  + F    +  G+ + + +   YG++
Sbjct: 163 -PFDDIIVFAYACQAAGVSVHVCNEHRYGYM 192


>gi|397503528|ref|XP_003822374.1| PREDICTED: glycosyltransferase 25 family member 3 [Pan paniscus]
          Length = 533

 Score = 47.8 bits (112), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 34/151 (22%), Positives = 69/151 (45%), Gaps = 9/151 (5%)

Query: 365 EARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWG 424
           E +  A+  + + G D+  + D+D+ L N   L+ L+ +   ++AP+L      +SNFW 
Sbjct: 62  ELKQEALTFARNWGADYILFADTDNILTNNQTLQLLMGQGLPVVAPMLDSQ-TYYSNFWC 120

Query: 425 ALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTI-----YTL 479
            +   G+Y R+ +Y    N  +  +G + VP + + +L+      A  +        YT 
Sbjct: 121 GITPQGYYRRTAEYFPTKNRQR--RGCFRVPMVHSTFLVSLRAEGADQLAFYPPHPNYTW 178

Query: 480 NSMDYDMAFCTNLRNKGIHLKIDSTQEYGHL 510
              D  + F    +  G+ + + +   YG++
Sbjct: 179 -PFDDIIVFAYACQAAGVSVHVCNEHRYGYM 208


>gi|119608196|gb|EAW87790.1| cerebral endothelial cell adhesion molecule 1, isoform CRA_d [Homo
           sapiens]
          Length = 534

 Score = 47.8 bits (112), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 34/151 (22%), Positives = 68/151 (45%), Gaps = 9/151 (5%)

Query: 365 EARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWG 424
           E +  A+  + + G D+  + D+D+ L N   L+ L+ +   ++AP+L      +SNFW 
Sbjct: 63  ELKQEALTFARNWGADYILFADTDNILTNNQTLRLLMGQGLPVVAPMLDSQ-TYYSNFWC 121

Query: 425 ALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTI-----YTL 479
            +   G+Y R+ +Y    N  +  +G + VP + + +L       A  +        YT 
Sbjct: 122 GITPQGYYRRTAEYFPTKNRQR--RGCFRVPMVHSTFLASLRAEGADQLAFYPPHPNYTW 179

Query: 480 NSMDYDMAFCTNLRNKGIHLKIDSTQEYGHL 510
              D  + F    +  G+ + + +   YG++
Sbjct: 180 -PFDDIIVFAYACQAAGVSVHVCNEHRYGYM 209


>gi|22760015|dbj|BAC11036.1| unnamed protein product [Homo sapiens]
 gi|22760023|dbj|BAC11040.1| unnamed protein product [Homo sapiens]
 gi|111185706|gb|AAI19699.1| Cerebral endothelial cell adhesion molecule [Homo sapiens]
 gi|119608195|gb|EAW87789.1| cerebral endothelial cell adhesion molecule 1, isoform CRA_c [Homo
           sapiens]
 gi|127802779|gb|AAH98432.2| Cerebral endothelial cell adhesion molecule [Homo sapiens]
          Length = 517

 Score = 47.8 bits (112), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 34/151 (22%), Positives = 68/151 (45%), Gaps = 9/151 (5%)

Query: 365 EARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWG 424
           E +  A+  + + G D+  + D+D+ L N   L+ L+ +   ++AP+L      +SNFW 
Sbjct: 46  ELKQEALTFARNWGADYILFADTDNILTNNQTLRLLMGQGLPVVAPMLDSQ-TYYSNFWC 104

Query: 425 ALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTI-----YTL 479
            +   G+Y R+ +Y    N  +  +G + VP + + +L       A  +        YT 
Sbjct: 105 GITPQGYYRRTAEYFPTKNRQR--RGCFRVPMVHSTFLASLRAEGADQLAFYPPHPNYTW 162

Query: 480 NSMDYDMAFCTNLRNKGIHLKIDSTQEYGHL 510
              D  + F    +  G+ + + +   YG++
Sbjct: 163 -PFDDIIVFAYACQAAGVSVHVCNEHRYGYM 192


>gi|194388556|dbj|BAG60246.1| unnamed protein product [Homo sapiens]
          Length = 548

 Score = 47.8 bits (112), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 34/151 (22%), Positives = 68/151 (45%), Gaps = 9/151 (5%)

Query: 365 EARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWG 424
           E +  A+  + + G D+  + D+D+ L N   L+ L+ +   ++AP+L      +SNFW 
Sbjct: 77  ELKQEALTFARNWGADYILFADTDNILTNNQTLRLLMGQGLPVVAPMLDSQ-TYYSNFWC 135

Query: 425 ALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTI-----YTL 479
            +   G+Y R+ +Y    N  +  +G + VP + + +L       A  +        YT 
Sbjct: 136 GITPQGYYRRTAEYFPTKNRQR--RGCFRVPMVHSTFLASLRAEGADQLAFYPPHPNYTW 193

Query: 480 NSMDYDMAFCTNLRNKGIHLKIDSTQEYGHL 510
              D  + F    +  G+ + + +   YG++
Sbjct: 194 -PFDDIIVFAYACQAAGVSVHVCNEHRYGYM 223


>gi|119608194|gb|EAW87788.1| cerebral endothelial cell adhesion molecule 1, isoform CRA_b [Homo
           sapiens]
          Length = 539

 Score = 47.8 bits (112), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 34/151 (22%), Positives = 68/151 (45%), Gaps = 9/151 (5%)

Query: 365 EARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWG 424
           E +  A+  + + G D+  + D+D+ L N   L+ L+ +   ++AP+L      +SNFW 
Sbjct: 68  ELKQEALTFARNWGADYILFADTDNILTNNQTLRLLMGQGLPVVAPMLDSQ-TYYSNFWC 126

Query: 425 ALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTI-----YTL 479
            +   G+Y R+ +Y    N  +  +G + VP + + +L       A  +        YT 
Sbjct: 127 GITPQGYYRRTAEYFPTKNRQR--RGCFRVPMVHSTFLASLRAEGADQLAFYPPHPNYTW 184

Query: 480 NSMDYDMAFCTNLRNKGIHLKIDSTQEYGHL 510
              D  + F    +  G+ + + +   YG++
Sbjct: 185 -PFDDIIVFAYACQAAGVSVHVCNEHRYGYM 214


>gi|281202578|gb|EFA76780.1| putative prolyl 4-hydroxylase alpha subunit [Polysphondylium
           pallidum PN500]
          Length = 461

 Score = 47.8 bits (112), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 35/130 (26%), Positives = 57/130 (43%), Gaps = 15/130 (11%)

Query: 609 MKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSST 668
           + ++G  G + E    Y+ P     +  Y+   + +  +FVV+Y+ D++  L  H+D S 
Sbjct: 325 LDEMGFTGFFTELRENYLKPFTSVLYADYNGAQLDSHHAFVVQYKIDKEKELGFHYDESD 384

Query: 669 YTINIALNQVGVDYEGGGCRFIRYNCNVTATRM---------GWMLMHPGRLTHYHEGLQ 719
            T+N+ L   G  + GG   F R   +   T           G  L+H G   H H  L 
Sbjct: 385 VTLNLCL---GKQFTGGSLYF-RGILDKPETHQEYFEVKHTPGTALLHIG--VHRHGALG 438

Query: 720 VTQGTRYIMI 729
           +T G R  +I
Sbjct: 439 ITSGERTNLI 448


>gi|338720573|ref|XP_001499943.3| PREDICTED: glycosyltransferase 25 family member 3-like [Equus
           caballus]
          Length = 517

 Score = 47.4 bits (111), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 31/137 (22%), Positives = 62/137 (45%), Gaps = 9/137 (6%)

Query: 379 VDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDY 438
            D+  + D+D+ L N   L+ L+ +   ++AP+L      +SNFW  +   G+Y R+ +Y
Sbjct: 60  ADYILFADTDNILTNNQTLRLLIEKGLPVVAPMLDSQ-TYYSNFWCGITPQGYYRRTAEY 118

Query: 439 MNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTI-----YTLNSMDYDMAFCTNLR 493
               N  +  +G + VP + + +L+      A  +        YT    D  + F    +
Sbjct: 119 FPTKNRQR--RGCFRVPMVHSTFLISLRAEGAAQLAFYPPHPNYTW-PFDDIIVFAYACQ 175

Query: 494 NKGIHLKIDSTQEYGHL 510
             G+ + + +   YG++
Sbjct: 176 AAGVSVHVCNQHRYGYM 192


>gi|355753026|gb|EHH57072.1| hypothetical protein EGM_06633 [Macaca fascicularis]
          Length = 536

 Score = 47.4 bits (111), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 32/138 (23%), Positives = 63/138 (45%), Gaps = 9/138 (6%)

Query: 378 GVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFD 437
           G D+  + D+D+ L N   L+ L+ +   ++AP+L      +SNFW  +   G+Y R+ +
Sbjct: 81  GADYILFADTDNILTNNQTLRLLLGQGLPVVAPMLDSQ-TYYSNFWCGITPQGYYRRTAE 139

Query: 438 YMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTI-----YTLNSMDYDMAFCTNL 492
           Y    N  +  +G + VP + + +L+      A  +        YT    D  + F    
Sbjct: 140 YFPTKNRQR--RGCFRVPMVHSTFLVSLRAEGADQLAFYPPHPNYTW-PFDDIIVFAYAC 196

Query: 493 RNKGIHLKIDSTQEYGHL 510
           +  G+ + + +   YG++
Sbjct: 197 QAAGVAVHVCNEHRYGYI 214


>gi|76154956|gb|AAX26343.2| SJCHGC08516 protein [Schistosoma japonicum]
          Length = 264

 Score = 47.4 bits (111), Expect = 0.027,   Method: Compositional matrix adjust.
 Identities = 47/219 (21%), Positives = 92/219 (42%), Gaps = 47/219 (21%)

Query: 290 FPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVY------------------NNQ 331
            P++ I V I      L  FLN +    YP K+I +  Y                  N +
Sbjct: 19  MPTLCIGVLIRNKAHTLPYFLNGLEQQQYPTKRIILIFYVDNTIDTSELILNAWIQCNQK 78

Query: 332 EYHAPLFD-DYIHNFKTMFKNV-KYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDS 389
           +YH  + + D  +  +  ++NV K    N   +  + R   ++ +     +FY  +D+D 
Sbjct: 79  KYHKIILEVDKSNTSQLEYENVNKMWTVNHYQHVIKLRQKLLDKARDLWANFYLSIDADV 138

Query: 390 HLDNPDVLKYLVNRNES------------------------LIAPLL-VRPFKAWSNFWG 424
            L N   +++L+N                            ++APL+     + +SNFWG
Sbjct: 139 ILMNSLTIEHLINAMHPSQSNNNNNNNNDNNNNNTMNKNIIILAPLINCTTSEYYSNFWG 198

Query: 425 ALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLM 463
           A++ +G+Y RS  Y +++   +  +G++ V  + + +L+
Sbjct: 199 AMSEEGYYVRSEHYFDLL--KRHIQGVYPVAMVHSIFLV 235


>gi|323452702|gb|EGB08575.1| hypothetical protein AURANDRAFT_63938 [Aureococcus anophagefferens]
          Length = 640

 Score = 47.4 bits (111), Expect = 0.027,   Method: Compositional matrix adjust.
 Identities = 29/88 (32%), Positives = 44/88 (50%), Gaps = 6/88 (6%)

Query: 648 FVVRY---RPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWM 704
           F VRY    P  Q  L  H D+S  + ++A++    D+ GGG RF+     +     G +
Sbjct: 516 FYVRYDADAPGAQTELEAHRDASLLSFSVAMSSPD-DFVGGGTRFVGSGRVLRPEAAGDL 574

Query: 705 LMHPGRLTHYHEGLQVTQGTRYIMISFV 732
           + H G++   H G  VT G R I++ FV
Sbjct: 575 VAHSGKV--LHAGEAVTAGVRDILVGFV 600


>gi|47181445|emb|CAG13372.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 47

 Score = 47.4 bits (111), Expect = 0.028,   Method: Composition-based stats.
 Identities = 20/45 (44%), Positives = 29/45 (64%), Gaps = 1/45 (2%)

Query: 450 GIWNVPYITNCYLMKTSVIKAT-NIKTIYTLNSMDYDMAFCTNLR 493
           G+WN+PY+ + YL+K S +K   N +  + L  +D DMAFC N R
Sbjct: 1   GVWNIPYMAHVYLIKGSALKKELNERNYFVLEKLDPDMAFCRNAR 45


>gi|397639611|gb|EJK73670.1| hypothetical protein THAOC_04695, partial [Thalassiosira oceanica]
          Length = 543

 Score = 47.4 bits (111), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 34/151 (22%), Positives = 66/151 (43%), Gaps = 6/151 (3%)

Query: 378 GVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALN---ADGFYAR 434
           G      VDSD  +     ++ L   + S++     RP K W+N+W  ++      +Y R
Sbjct: 41  GAGQALMVDSDVIIARSSAVQDLAGWSRSVVVGHAQRPGKYWANYWTDMDHTAGSQWYKR 100

Query: 435 SFDYMNIINGDQGGKGIWNVPYITNCYLMKTSVIKA-TNIKTIYTLNSMDYDMAFCTNLR 493
            FD ++I N  +  +G++ VP+     L++    K   ++ +    +  D     C    
Sbjct: 101 GFDTLDIYN--RARQGLFQVPFGRGLVLVQRGEFKRLADLFSKLKGHGSDTMRRLCLKST 158

Query: 494 NKGIHLKIDSTQEYGHLVDSENFDPQKTNPE 524
           + G+ L ID+ + YG + D +  +    + E
Sbjct: 159 DVGLPLYIDNQRNYGRIYDPDAKESDANDDE 189


>gi|440791367|gb|ELR12605.1| ankyrin repeat-containing protein [Acanthamoeba castellanii str.
           Neff]
          Length = 463

 Score = 47.4 bits (111), Expect = 0.031,   Method: Compositional matrix adjust.
 Identities = 39/178 (21%), Positives = 74/178 (41%), Gaps = 26/178 (14%)

Query: 559 PDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVW 618
           P ++ FP++   FC   ++ ++   Q          +   Y  +      +  VG   + 
Sbjct: 285 PGIYSFPMLKLSFCDRLLEELDHLEQSGLSLKRPNSM-NAYGVI------LSDVGFKEMM 337

Query: 619 AEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQV 678
            + +R+YV PL    +     + +    SFVV+Y+  E   L+ H D S  T+N++L   
Sbjct: 338 HQLMRRYVCPLATLLYAEQGGDSLDRLHSFVVKYKIGEDLDLKEHVDDSEVTLNVSL--- 394

Query: 679 GVDYEGGGCRF-----------IRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTR 725
           G  + GG   F             + C  +    G  ++H G  +H+H  L++++  R
Sbjct: 395 GKSFAGGDLDFNGVANTPTSKNDHFTCGHSP---GVAVLHLG--SHWHSALKISECRR 447


>gi|356536266|ref|XP_003536660.1| PREDICTED: uncharacterized PKHD-type hydroxylase At1g22950-like
           [Glycine max]
          Length = 344

 Score = 47.4 bits (111), Expect = 0.031,   Method: Compositional matrix adjust.
 Identities = 40/194 (20%), Positives = 82/194 (42%), Gaps = 25/194 (12%)

Query: 552 TVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIH--- 608
           ++  +PC  V+ F ++  +FC + +  ++ + +W  GT    +L+        ++ H   
Sbjct: 111 SIMAEPCKGVYTFEMLQPQFCKKLMSEVDHFERWVHGT----KLKIMRPNAMNKNKHGVI 166

Query: 609 MKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSST 668
           +       +   F+  ++ P+ +  +       + +   FVV Y  ++   L  H D + 
Sbjct: 167 LDDFAFEAMLDRFMCDFIRPISQVFYPELGGSSLDSHHGFVVEYGINKDVELGLHEDEAE 226

Query: 669 YTINIALNQVGVDYEGGGCRF--IRYNCNVTATRM-----------GWMLMHPGRLTHYH 715
            T+N+ L   G ++ GG   F  +R + +VT+              G  ++HPGR  + H
Sbjct: 227 VTLNVCL---GKEFSGGELFFQGVRCDAHVTSNAQPEEAFNYSHVPGHAILHPGR--NRH 281

Query: 716 EGLQVTQGTRYIMI 729
                T G R  +I
Sbjct: 282 GARPTTSGNRMNLI 295


>gi|428169371|gb|EKX38306.1| hypothetical protein GUITHDRAFT_144412 [Guillardia theta CCMP2712]
          Length = 233

 Score = 47.0 bits (110), Expect = 0.034,   Method: Compositional matrix adjust.
 Identities = 28/104 (26%), Positives = 48/104 (46%), Gaps = 10/104 (9%)

Query: 83  KVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDT 142
           K+  LK+ LD +   +D II+  D+YDV+ +  +  +L+ F      +V+ AE  C    
Sbjct: 39  KIYALKDFLDMVPFEEDNIIVFVDAYDVLFNRTIKYLLKEFLKMKHRVVYSAEVGCSAGR 98

Query: 143 SLYDK----------YPAVGSGYRYLNSGGFIGYAKDIKELISN 176
               +          YP V +   YLNSG  + Y + +K  + +
Sbjct: 99  EALSRRSTACDRGWPYPGVNTVAPYLNSGATMAYQRQLKLFLES 142


>gi|355567431|gb|EHH23772.1| hypothetical protein EGK_07313, partial [Macaca mulatta]
          Length = 595

 Score = 47.0 bits (110), Expect = 0.035,   Method: Compositional matrix adjust.
 Identities = 32/138 (23%), Positives = 62/138 (44%), Gaps = 9/138 (6%)

Query: 378 GVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFD 437
           G D+  + D+D+ L N   L+ L+ +   ++AP+L      +SNFW  +   G+Y R+ +
Sbjct: 137 GADYILFADTDNILTNNQTLRLLMGQGLPVVAPMLDSQ-TYYSNFWCGITPQGYYRRTAE 195

Query: 438 YMNIINGDQGGKGIWNVPYITNCYLMKTSVIKATNIKTI-----YTLNSMDYDMAFCTNL 492
           Y    N  +  +G   VP + + +L+      A  +        YT    D  + F    
Sbjct: 196 YFPTKNRQR--RGCLRVPMVHSTFLVSLRAEGADQLAFYPPHPNYTW-PFDDIIVFAYAC 252

Query: 493 RNKGIHLKIDSTQEYGHL 510
           +  G+ + + +   YG++
Sbjct: 253 QAAGVAVHVCNEHRYGYI 270


>gi|359451740|ref|ZP_09241134.1| hypothetical protein P20480_3882 [Pseudoalteromonas sp. BSi20480]
 gi|358042468|dbj|GAA77383.1| hypothetical protein P20480_3882 [Pseudoalteromonas sp. BSi20480]
          Length = 304

 Score = 47.0 bits (110), Expect = 0.038,   Method: Compositional matrix adjust.
 Identities = 40/152 (26%), Positives = 62/152 (40%), Gaps = 20/152 (13%)

Query: 588 GTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQE---REFIGYHHEPVRA 644
           G   DKR E GY A P+             + E L  Y+ P+      E  GY  +    
Sbjct: 134 GVMLDKRSE-GYLAAPS---------FQTFYNEMLNTYMRPIARLLFPEITGYDTQT--- 180

Query: 645 PMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRF---IRYNCNVTATRM 701
              F + Y P    S+RPH D+S  T+NI LN  G ++ G    F   +       + + 
Sbjct: 181 -FGFSIYYDPSTDASIRPHTDASAVTLNINLNLPGEEFTGSELDFYDLVTGKVTQLSFKP 239

Query: 702 GWMLMHPGRLTHYHEGLQVTQGTRYIMISFVD 733
           G  ++H G + H  + +     T +++  F D
Sbjct: 240 GIAMIHRGSVAHAAKPITSGDRTNFVLWLFGD 271


>gi|351702454|gb|EHB05373.1| Glycosyltransferase 25 family member 1, partial [Heterocephalus
           glaber]
          Length = 517

 Score = 47.0 bits (110), Expect = 0.040,   Method: Compositional matrix adjust.
 Identities = 34/130 (26%), Positives = 64/130 (49%), Gaps = 12/130 (9%)

Query: 388 DSHLDNPD-VLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIINGDQ 446
           +S L++P  V   L+  N +++AP+L     A+SNFW  +   G+Y R+  Y+ I   ++
Sbjct: 82  ESALNSPGFVSSLLIAENRTVVAPML-DSRAAYSNFWCGMTPQGYYRRTPAYIPIRKRER 140

Query: 447 GGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYDMA------FCTNLRNKGIHLK 500
             +G + VP + + +L+   + KA +    +     DY  A      F  + +  G+ + 
Sbjct: 141 --QGCFAVPMVHSTFLL--DLRKAASRSLAFYPPHPDYTWAFDDIIVFAFSCKQAGVQMY 196

Query: 501 IDSTQEYGHL 510
           + + Q YG L
Sbjct: 197 VCNKQVYGFL 206


>gi|299473298|emb|CBN77697.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 713

 Score = 47.0 bits (110), Expect = 0.040,   Method: Compositional matrix adjust.
 Identities = 47/170 (27%), Positives = 74/170 (43%), Gaps = 13/170 (7%)

Query: 566 IVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVW-AEFLRK 624
           +++ + C   +Q  E Y Q + G    +     + AVPT D+ +  +     W    +R+
Sbjct: 453 VLSPEECRGVIQAAEDYSQANGGWTTSR-----HYAVPTTDLPVHALKSTLPWFRSLVRE 507

Query: 625 YVVPLQEREF-IGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVD-Y 682
            + P   + F +      V    +FVVRY   +Q  L  H D ST++  IALN  G+D Y
Sbjct: 508 RLFPALAKRFNLAAGPRRVFVHDAFVVRYEEGKQRHLPLHRDQSTHSFTIALN--GLDQY 565

Query: 683 EGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFV 732
            GGG  F     ++       +    G L   H G  + +G RYI+  F 
Sbjct: 566 TGGGTFFPSLGRSLRPAEGHALSFRGGIL---HGGDPLLKGVRYIIACFC 612


>gi|119470273|ref|ZP_01613032.1| hypothetical protein ATW7_12953 [Alteromonadales bacterium TW-7]
 gi|119446445|gb|EAW27720.1| hypothetical protein ATW7_12953 [Alteromonadales bacterium TW-7]
          Length = 304

 Score = 47.0 bits (110), Expect = 0.040,   Method: Compositional matrix adjust.
 Identities = 40/152 (26%), Positives = 62/152 (40%), Gaps = 20/152 (13%)

Query: 588 GTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQE---REFIGYHHEPVRA 644
           G   DKR E GY A P+             + E L  Y+ P+      E  GY  +    
Sbjct: 134 GVMLDKRSE-GYLAAPS---------FQTFYNEMLNTYMRPIARLLFPEITGYDTQT--- 180

Query: 645 PMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRF---IRYNCNVTATRM 701
              F + Y P    S+RPH D+S  T+NI LN  G ++ G    F   +       + + 
Sbjct: 181 -FGFSIYYDPSTDASIRPHTDASAVTLNINLNLPGEEFTGSELDFYDLVTGKVTQLSFKP 239

Query: 702 GWMLMHPGRLTHYHEGLQVTQGTRYIMISFVD 733
           G  ++H G + H  + +     T +++  F D
Sbjct: 240 GIAMIHRGSVAHAAKPITSGDRTNFVLWLFGD 271


>gi|320101633|ref|YP_004177224.1| alkyl hydroperoxide reductase [Isosphaera pallida ATCC 43644]
 gi|319748915|gb|ADV60675.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
           allergen [Isosphaera pallida ATCC 43644]
          Length = 367

 Score = 46.6 bits (109), Expect = 0.045,   Method: Compositional matrix adjust.
 Identities = 44/182 (24%), Positives = 70/182 (38%), Gaps = 27/182 (14%)

Query: 566 IVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPT--------RDIHMKQVGLAGV 617
           I    FC + ++I EA G +  G       E G + VP         RD  +K + +   
Sbjct: 177 IFEPHFCRQLIEIYEADGGYESGFMR----EVGGKTVPVHDHSHKRRRDCEIKDLQVIQA 232

Query: 618 WAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSST-------YT 670
               L++ ++P   + F     E  R     V  Y        R H D++T       + 
Sbjct: 233 CQMRLKRRLIPEIHKSF---QFEATRIERHIVACYDASTGGHFRAHRDNTTKGTAHRRFA 289

Query: 671 INIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMIS 730
           I++ LN    D++GG  RF  +        +G  ++    L   HE   VT G RY  + 
Sbjct: 290 ISLNLND---DFQGGDLRFAEFGPRTYRAPVGGAVVFSCSL--LHEATPVTAGKRYAFLP 344

Query: 731 FV 732
           F+
Sbjct: 345 FL 346


>gi|410629220|ref|ZP_11339927.1| hypothetical protein GMES_4430 [Glaciecola mesophila KMM 241]
 gi|410151244|dbj|GAC26696.1| hypothetical protein GMES_4430 [Glaciecola mesophila KMM 241]
          Length = 303

 Score = 46.6 bits (109), Expect = 0.053,   Method: Compositional matrix adjust.
 Identities = 38/152 (25%), Positives = 65/152 (42%), Gaps = 20/152 (13%)

Query: 588 GTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQE---REFIGYHHEPVRA 644
           G   D R E GY A P+     +Q+         L +Y+ P+      E +GY  +    
Sbjct: 133 GAMLDSRSE-GYLAAPSFQAFYRQI---------LDRYMRPIARLLFPEIVGYDTQT--- 179

Query: 645 PMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVT---ATRM 701
              F + Y+P+   S+RPH D+S  T+NI LN     + G    F   +       A + 
Sbjct: 180 -FGFSIHYKPNTDTSIRPHTDASAVTLNINLNLPDESFTGSNVDFYDPSTGKMIGLAFKP 238

Query: 702 GWMLMHPGRLTHYHEGLQVTQGTRYIMISFVD 733
           G  ++H G + H  + +   + T +++  + D
Sbjct: 239 GSAMIHRGNVVHAAQPITSGERTNFVLWLYGD 270


>gi|374619955|ref|ZP_09692489.1| putative iron-regulated protein [gamma proteobacterium HIMB55]
 gi|374303182|gb|EHQ57366.1| putative iron-regulated protein [gamma proteobacterium HIMB55]
          Length = 341

 Score = 46.6 bits (109), Expect = 0.056,   Method: Compositional matrix adjust.
 Identities = 46/185 (24%), Positives = 73/185 (39%), Gaps = 27/185 (14%)

Query: 555 NQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGL 614
           N P P +     + E    + +  ++  G+   G   D   +      P RD+  +    
Sbjct: 136 NLPAPVLVIPDAIDEALAEDLIHYLD--GREDHGFVADGDFKRRLHIHPDRDLEHR---- 189

Query: 615 AGVWAEFLRKYVVPLQEREFIG--YHHEPVRAPMSFVVRYRPDEQPSLRPHHDS------ 666
                + L K V+P  E+ F     H E  +     + RY          H D+      
Sbjct: 190 ---LDDKLCKSVLPEIEKVFYSEITHRETYK-----ICRYDGTNSGKFGKHRDTIAPHLH 241

Query: 667 STYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRY 726
             Y I + LN    DYEGGG  F  YN  V +      ++ PG L  +H+   ++ G+RY
Sbjct: 242 RRYAITLVLND---DYEGGGIAFPEYNSEVLSIPKYGAVVFPGSL--FHQVNNISSGSRY 296

Query: 727 IMISF 731
           ++ISF
Sbjct: 297 VIISF 301


>gi|333894574|ref|YP_004468449.1| 2OG-Fe(II) oxygenase [Alteromonas sp. SN2]
 gi|332994592|gb|AEF04647.1| 2OG-Fe(II) oxygenase [Alteromonas sp. SN2]
          Length = 303

 Score = 46.2 bits (108), Expect = 0.060,   Method: Compositional matrix adjust.
 Identities = 38/147 (25%), Positives = 60/147 (40%), Gaps = 20/147 (13%)

Query: 588 GTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQE---REFIGYHHEPVRA 644
           G   DKR E GY A P+             +   L  Y+ P+      E +GY  +    
Sbjct: 133 GAMLDKRSE-GYLAAPS---------FQAFYRTMLDTYMRPIARLLFPEIMGYDAQT--- 179

Query: 645 PMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNC---NVTATRM 701
              F + Y+P+   S+RPH D+S  T+NI LN  G  + G    F        N      
Sbjct: 180 -FGFSIHYQPNTDTSIRPHTDASAVTLNINLNVPGETFTGSTVDFYDVKAGKVNPLTFTP 238

Query: 702 GWMLMHPGRLTHYHEGLQVTQGTRYIM 728
           G  ++H G + H  + +   + T +++
Sbjct: 239 GSAMLHRGNVPHAAQPITSGERTNFVL 265


>gi|390176657|ref|XP_003736142.1| GA16561 [Drosophila pseudoobscura pseudoobscura]
 gi|160395573|sp|Q29NU5.2|GLT25_DROPS RecName: Full=Glycosyltransferase 25 family member; Flags:
           Precursor
 gi|388858689|gb|EIM52215.1| GA16561 [Drosophila pseudoobscura pseudoobscura]
          Length = 626

 Score = 46.2 bits (108), Expect = 0.063,   Method: Compositional matrix adjust.
 Identities = 45/210 (21%), Positives = 86/210 (40%), Gaps = 33/210 (15%)

Query: 289 QFPSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFDDYIHNFKTM 348
           Q P+VL+++ +      L  FL+ +   +YP  +I+ ++  +        DD I   K  
Sbjct: 34  QPPTVLVALLVRNKAHILPMFLSYLEQQDYPKDRIAFWLRCDHSS-----DDSIDLLKQW 88

Query: 349 FKNVKYIAH--NSTVNSKEARNLAVENSLHKG-----------------------VDFYF 383
            K+   + H  N   +S        E+S +                          DF F
Sbjct: 89  LKHSGDLYHSVNYAFDSDGPHGYQNESSPYDWTVSRFKHVIALKEEAFTYARDIWADFVF 148

Query: 384 YVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIIN 443
           ++D+D  L +   L+ L      ++AP+L+     +SNFW  +  + +Y R+ +Y  I +
Sbjct: 149 FLDADVLLTSQQALRTLTALRLPIVAPMLLSE-SLYSNFWCGMTEEYYYQRTDEYKEIYH 207

Query: 444 GDQGGKGIWNVPYITNCYLMKTSVIKATNI 473
             +  +G + VP +    L+  +   A N+
Sbjct: 208 VKK--QGSFPVPMVHTAVLVDMNHKGARNL 235


>gi|412990294|emb|CCO19612.1| predicted protein [Bathycoccus prasinos]
          Length = 672

 Score = 46.2 bits (108), Expect = 0.069,   Method: Compositional matrix adjust.
 Identities = 49/226 (21%), Positives = 96/226 (42%), Gaps = 21/226 (9%)

Query: 513 SENFDPQKTNPEVY--ELIRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPI--VT 568
           S++   QK   E Y   L R+P +   ++  P + K+ +           ++  P   ++
Sbjct: 454 SQSLKKQKRKDEAYLTRLSRSPSNLWHKF-EPSFSKTWITLAAGA-----IWLLPSHNIS 507

Query: 569 EKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQV-GLAGVWAEFLRKYVV 627
           ++ C  ++ + E +     G   ++ +     +VPT D+ +  +  L   W  F+   ++
Sbjct: 508 KRTCEHWIVLAEKFASLQGGWCTNRHI-----SVPTTDLPVHLIPELVDEWNLFVFSDLI 562

Query: 628 PLQEREFI-GYHHEPVRAPMSFVVRYRPDEQPSLRP-HHDSSTYTINIALNQVGVDYEGG 685
           PL ++        + +    +F+V+Y   +     P H D S  ++ IALN    D+ GG
Sbjct: 563 PLAKQILTTSMFRKRLCVHDAFIVKYDASKGCDHLPIHRDQSEISVTIALNS-NSDFSGG 621

Query: 686 GCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISF 731
           G             ++G +L+  G L   H G  +  G RYI+ +F
Sbjct: 622 GGTMFPNLGITICPKIGEILLFRGDLE--HSGFPINGGIRYIVAAF 665


>gi|356531172|ref|XP_003534152.1| PREDICTED: uncharacterized PKHD-type hydroxylase At1g22950-like
           [Glycine max]
          Length = 379

 Score = 45.8 bits (107), Expect = 0.074,   Method: Compositional matrix adjust.
 Identities = 48/207 (23%), Positives = 80/207 (38%), Gaps = 33/207 (15%)

Query: 537 LRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLE 596
           LR I+   ++S+   ++ ++P P +F F I    FC   +  +E + +W +        E
Sbjct: 131 LRAINDNTEQSI--RSIVSEPSPGIFIFDIFQTHFCELLLSEIENFEKWVN--------E 180

Query: 597 TGYEAVPTRDIH-----MKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVR 651
           T +  +    ++     +   GL  +  + +  ++ PL    F       + +   FVV 
Sbjct: 181 TKFRIMRPNTMNKFGAVLDDFGLETMLDKLMEGFIRPLSRVFFAEVGGSTLDSHHGFVVE 240

Query: 652 YRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNV---TATR-------- 700
           Y  D    L  H D S  T+N+ L   G  + GG   F    C     T T         
Sbjct: 241 YGKDRDVDLGFHVDDSEVTLNVCL---GKQFSGGELFFRGIRCEKHVNTGTHSEEIFDYS 297

Query: 701 --MGWMLMHPGRLTHYHEGLQVTQGTR 725
             +G  ++H GR  H H     T G R
Sbjct: 298 HVLGRAVLHRGR--HRHGARATTSGNR 322


>gi|169234704|ref|NP_001108473.1| chromosome associated protein D3 [Bombyx mori]
 gi|18700451|dbj|BAB85193.1| hypothetical protein [Bombyx mori]
 gi|22474509|dbj|BAC10614.1| hypothetical protein [Bombyx mori]
          Length = 407

 Score = 45.8 bits (107), Expect = 0.075,   Method: Compositional matrix adjust.
 Identities = 32/121 (26%), Positives = 62/121 (51%), Gaps = 10/121 (8%)

Query: 385 VDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIING 444
           +D+D  L N + LK L+ ++ ++++P+L+     +SNFW  +  + +Y R+ DY  I+N 
Sbjct: 2   LDADVILTNIETLKVLIAKDFTVVSPMLMSD-GVYSNFWCGMTENYYYKRTDDYKPILN- 59

Query: 445 DQGGKGIWNVPYITNCYLMKTSVIKATNIKTIYTLNSMDYD------MAFCTNLRNKGIH 498
            +   G ++VP + +  L+       ++  T Y     +YD      +AF  N +  G H
Sbjct: 60  -RKKTGCFDVPMVHSAVLISMRY-DVSDKLTYYPSKITNYDGPEDDIIAFALNSKALGEH 117

Query: 499 L 499
           +
Sbjct: 118 V 118


>gi|410615971|ref|ZP_11326967.1| hypothetical protein GPLA_0186 [Glaciecola polaris LMG 21857]
 gi|410164453|dbj|GAC31105.1| hypothetical protein GPLA_0186 [Glaciecola polaris LMG 21857]
          Length = 304

 Score = 45.8 bits (107), Expect = 0.082,   Method: Compositional matrix adjust.
 Identities = 38/149 (25%), Positives = 65/149 (43%), Gaps = 18/149 (12%)

Query: 588 GTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMS 647
           G   D R E GY A P+     +Q  +  ++   + + + P    E IGY  +       
Sbjct: 134 GAMLDSRSE-GYLAAPSFQTFYRQ--MIDMYMRPIARMLFP----EIIGYDDQA----FG 182

Query: 648 FVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATR-----MG 702
           F + YRP+   S+RPH D+S  T+NI LN     + G    F  Y+      +      G
Sbjct: 183 FSIHYRPNTDNSIRPHTDASAVTLNINLNPPDALFTGSTVDF--YDSETGKMKGITFTPG 240

Query: 703 WMLMHPGRLTHYHEGLQVTQGTRYIMISF 731
             ++H G++ H  + +   + T +++  F
Sbjct: 241 SAILHRGKVVHAAQPITSGERTNFVLWLF 269


>gi|414070939|ref|ZP_11406917.1| hypothetical protein D172_2149 [Pseudoalteromonas sp. Bsw20308]
 gi|410806688|gb|EKS12676.1| hypothetical protein D172_2149 [Pseudoalteromonas sp. Bsw20308]
          Length = 304

 Score = 45.8 bits (107), Expect = 0.089,   Method: Compositional matrix adjust.
 Identities = 38/147 (25%), Positives = 62/147 (42%), Gaps = 20/147 (13%)

Query: 588 GTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQE---REFIGYHHEPVRA 644
           G   D R E GY A P+             + E L  Y+ P+      E IGY  +    
Sbjct: 134 GAMLDSRSE-GYLAAPS---------FQAFYNEILNTYMRPISRLLFPEIIGYDTQT--- 180

Query: 645 PMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIR-YNCNVTAT--RM 701
              F + Y P    S+RPH D+S  T+NI LN    ++ G    F   Y  ++ +   + 
Sbjct: 181 -FGFSIYYDPSTDASIRPHTDASAVTLNINLNLPSEEFTGSEVDFYHPYTGDIKSLTFKP 239

Query: 702 GWMLMHPGRLTHYHEGLQVTQGTRYIM 728
           G  ++H G + H  + +   + T +++
Sbjct: 240 GTAMLHRGNIAHAAKPITSGERTNFVL 266


>gi|158513401|sp|A3KGZ2.1|OGFD2_DANRE RecName: Full=2-oxoglutarate and iron-dependent oxygenase
           domain-containing protein 2
          Length = 345

 Score = 45.8 bits (107), Expect = 0.093,   Method: Compositional matrix adjust.
 Identities = 45/186 (24%), Positives = 80/186 (43%), Gaps = 21/186 (11%)

Query: 553 VNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQV 612
           +  +    VF F +  ++FC + ++ +E + Q SD           Y  V      + ++
Sbjct: 123 IQTEAASRVFRFQVFRKEFCKDLLEELEHFEQ-SDAPKGRPNTMNNYGIV------LNEL 175

Query: 613 GLAGVWAEFLRK-YVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTI 671
           G    +   LR+ Y+ PL    +       + +  +FVV+Y   E  +L  H+D+S  T+
Sbjct: 176 GFDEGFITPLREVYLRPLTALLYSDCGGNCLDSHKAFVVKYDMHEDLNLSYHYDNSEVTL 235

Query: 672 NIALNQVGVDYEGGGCRF--------IRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQG 723
           N++L   G D+  G   F            C     R+   L+H G+  H H  L ++ G
Sbjct: 236 NVSL---GKDFTEGNLFFGDMRQVPLSETECVEVEHRVTEGLLHRGQ--HMHGALSISSG 290

Query: 724 TRYIMI 729
           TR+ +I
Sbjct: 291 TRWNLI 296


>gi|332535541|ref|ZP_08411316.1| hypothetical protein PH505_cy00110 [Pseudoalteromonas haloplanktis
           ANT/505]
 gi|332035040|gb|EGI71558.1| hypothetical protein PH505_cy00110 [Pseudoalteromonas haloplanktis
           ANT/505]
          Length = 304

 Score = 45.4 bits (106), Expect = 0.095,   Method: Compositional matrix adjust.
 Identities = 37/147 (25%), Positives = 64/147 (43%), Gaps = 20/147 (13%)

Query: 588 GTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQER---EFIGYHHEPVRA 644
           G   D R E GY A P+             + E L  Y+ P+      E +G+  +    
Sbjct: 134 GAMLDSRSE-GYLAAPS---------FQAFYNEILNTYMRPISRMLFPEVMGFDTQT--- 180

Query: 645 PMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIR-YNCNVTAT--RM 701
              F + Y P+   S+RPH D+S  T+NI LN  G ++ G    F   Y  ++ +   + 
Sbjct: 181 -FGFSIYYEPNTDSSIRPHTDASAVTLNINLNLPGEEFTGSQVGFYHPYTGDIKSLTFKP 239

Query: 702 GWMLMHPGRLTHYHEGLQVTQGTRYIM 728
           G  ++H G + H  + +   + T +++
Sbjct: 240 GTAMLHRGNIAHAAKPITSGERTNFVL 266


>gi|342876206|gb|EGU77862.1| hypothetical protein FOXB_11626 [Fusarium oxysporum Fo5176]
          Length = 517

 Score = 45.4 bits (106), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 60/262 (22%), Positives = 100/262 (38%), Gaps = 63/262 (24%)

Query: 20  SVHCN---KVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMS 76
           SVH     K K I   +   +  ASN  D +   + SA VN+     +     W G    
Sbjct: 58  SVHAPSKPKKKVIKTSQLHFLLPASNPNDMFCAIVTSALVNRYPAPYM---VGWKGEGKY 114

Query: 77  SLGGGYKVNL--LKNELDEMDIT--DDMIILVTDSYDVIIDGGVNDILERFNTFDAN--- 129
           +    +   L  +K  LDE+     DD ++   D YDV+    V  ++ER+    A+   
Sbjct: 115 NASAAHTAKLYSIKKYLDELPQGGDDDDLVFFGDGYDVMAQLPVEVVIERYFKVAADADR 174

Query: 130 ---------------------IVFGAERLCWPD------TSLYDKY-------PAVGSG- 154
                                + +GA+++CWP       T +   +       P  G G 
Sbjct: 175 RLADRFGITVEEAHKRGLKQTLFWGADKMCWPALNEAQCTKIPGSHLPRNVYGPKTGGGD 234

Query: 155 --YR---YLNSGGFIGYAKDIKELIS----------NRSIKNEEDDQLYYALLFLDETLR 199
             YR   Y NSG  IG   D++  I+          + + K +  DQ+Y A L+  + L 
Sbjct: 235 VTYRDAKYFNSGSVIGPVGDLRNFINAGIASLEETFDPNFKYKTSDQIYLARLYARQELS 294

Query: 200 TKHKIVLDTLANLFQNLYGSLE 221
              +I  +++   F +   ++E
Sbjct: 295 RAEQIENESMMASFGDNATAVE 316


>gi|428178407|gb|EKX47282.1| hypothetical protein GUITHDRAFT_46957, partial [Guillardia theta
           CCMP2712]
          Length = 314

 Score = 45.4 bits (106), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 49/192 (25%), Positives = 78/192 (40%), Gaps = 36/192 (18%)

Query: 561 VFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAE 620
           V+ FP++    C + V     +  ++      ++LE  +++     + M ++G    W  
Sbjct: 100 VYSFPLLQPSLCQQIVSCANDFAAFT----RQEKLEGKFDSERPAVLDMMKLG----WIN 151

Query: 621 --FLRKYVVPLQEREFIG-YHHEPVRAPMSFVVRYRPDEQ----PSLRP-------HHDS 666
              LR+ V PL E  F      E +     ++V Y P +        RP       H D 
Sbjct: 152 DMLLRQVVSPLAEALFEDELEGETLDWRHGYIVGYAPKDPGQAGTEFRPRRNHLVSHTDD 211

Query: 667 STYTINIALNQVGVDYEGGGCRF---------IRYNCNVTATRMGWMLMHPGRLTHYHEG 717
           S  T+N+ L     DY+GG   F         ++   +  A   GW ++H GR    HE 
Sbjct: 212 SEITLNVCLQS---DYQGGELVFHGRRGSGEELKTLGSFKAPAPGWAVLHVGR--QLHEV 266

Query: 718 LQVTQGTRYIMI 729
           L VT G RY +I
Sbjct: 267 LPVTGGKRYGLI 278


>gi|428164686|gb|EKX33703.1| hypothetical protein GUITHDRAFT_147722 [Guillardia theta CCMP2712]
          Length = 771

 Score = 45.4 bits (106), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 30/100 (30%), Positives = 48/100 (48%), Gaps = 4/100 (4%)

Query: 633 EFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRY 692
           E  G+    V     F+VRY    Q  L  H D +  T ++ LN+   D++GGG  F   
Sbjct: 342 ERFGFRAGEVTPVDVFLVRYTGAGQNQLSVHRDGALMTFSLLLNEAS-DFQGGGT-FFEE 399

Query: 693 NCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFV 732
           +  V     G  ++H G++ H   G  ++ G+RYI++ F 
Sbjct: 400 DGLVFRHEQGVAVLHSGKIRHG--GYPISSGSRYILVGFC 437


>gi|392532697|ref|ZP_10279834.1| 2OG-Fe(II) oxygenase [Pseudoalteromonas arctica A 37-1-2]
          Length = 304

 Score = 45.1 bits (105), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 37/147 (25%), Positives = 63/147 (42%), Gaps = 20/147 (13%)

Query: 588 GTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQER---EFIGYHHEPVRA 644
           G   D R E GY A P+             + E L  Y+ P+      E  G+  +    
Sbjct: 134 GAMLDSRSE-GYLAAPS---------FQAFYNEILNTYMRPISRMLFPEVTGFDTQT--- 180

Query: 645 PMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIR-YNCNVTAT--RM 701
              F + Y P+   S+RPH D+S  T+NI LN  G ++ G    F   Y  ++ +   + 
Sbjct: 181 -FGFSIYYEPNTDSSIRPHTDASAVTLNINLNLPGEEFTGSEVDFYHPYTGDIKSLTFKP 239

Query: 702 GWMLMHPGRLTHYHEGLQVTQGTRYIM 728
           G  ++H G + H  + +   + T +++
Sbjct: 240 GTAMLHRGNIAHAAKPITSGERTNFVL 266


>gi|348522203|ref|XP_003448615.1| PREDICTED: 2-oxoglutarate and iron-dependent oxygenase
           domain-containing protein 2-like [Oreochromis niloticus]
          Length = 350

 Score = 45.1 bits (105), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 54/225 (24%), Positives = 96/225 (42%), Gaps = 26/225 (11%)

Query: 519 QKTNPEVYELIRNPLDWDLRYIHPEYQKS-----LLPDTVNNQPCPDVFWFPIVTEKFCH 573
           Q  +P VY L  + L  + + I    Q S      L D +  Q  P V+ FP+  + FC 
Sbjct: 91  QPLHPHVYHLQESYLASEFKQIVEYCQSSNATEEGLLDLLEEQAAPRVYRFPLFDKSFCE 150

Query: 574 EFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLR-KYVVPLQER 632
           + ++ +E + Q S            Y       I + ++G    +   LR +Y++PL   
Sbjct: 151 DLMEELEHFEQ-SGAPKGRPNTMNQY------GILLNELGFDEHFITPLREQYLLPLTSL 203

Query: 633 EFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRF--- 689
            +       + +  +FVV+Y  +E   L  H+D++  T+N++L   G ++  G   F   
Sbjct: 204 LYPDCGGRCLDSHKAFVVKYDMNEDLDLSYHYDNAEVTLNVSL---GKEFTEGNLYFGDM 260

Query: 690 -----IRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMI 729
                    C+    R+   L+H G+  H H  L ++ G R+ +I
Sbjct: 261 RQVPVSETECSEVEHRVTEGLLHRGQ--HMHGALPISSGQRWNLI 303


>gi|444721252|gb|ELW61996.1| Outer dense fiber protein 2 [Tupaia chinensis]
          Length = 1465

 Score = 45.1 bits (105), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 24/90 (26%), Positives = 45/90 (50%), Gaps = 3/90 (3%)

Query: 384 YVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALNADGFYARSFDYMNIIN 443
           + D+D+ L N   L+ L+ R   ++AP+L      +SNFW  +   G+Y R+ +Y    N
Sbjct: 122 FADTDNILTNNQTLRLLLERGLPVVAPML-DSQTYYSNFWCGITPQGYYRRTAEYFPTKN 180

Query: 444 GDQGGKGIWNVPYITNCYLMKTSVIKATNI 473
             +  +G + VP + + +L+      A  +
Sbjct: 181 RQR--QGCFRVPMVHSTFLVSLRAEGAAQL 208


>gi|404370072|ref|ZP_10975399.1| hypothetical protein CSBG_02623 [Clostridium sp. 7_2_43FAA]
 gi|226913796|gb|EEH98997.1| hypothetical protein CSBG_02623 [Clostridium sp. 7_2_43FAA]
          Length = 641

 Score = 44.7 bits (104), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 42/169 (24%), Positives = 71/169 (42%), Gaps = 23/169 (13%)

Query: 367 RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWS----NF 422
           +N  +E +L +  D+ F VDSD  + NP  LK LV+ N+ +++ +    +K  S      
Sbjct: 94  KNTIIEKALKEDYDYLFLVDSDLVM-NPKTLKRLVSLNKEIVSNIFWTRWKPNSYEQPQV 152

Query: 423 W-------------GALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLM-KTSVI 468
           W               L       R+ D++N++       G + V  +  C L+ K ++ 
Sbjct: 153 WLKDMYTLYDFEHGERLRESEVIKRTADFINMLRK----PGTYKVGGLGACTLISKEALS 208

Query: 469 KATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGHLVDSENFD 517
           K  N   IY ++    D  FC      GI L +D+     H+   E+ D
Sbjct: 209 KGVNFNPIYNVSFWGEDRHFCIRAAVLGIQLYVDTYYPAHHIYRDEDLD 257


>gi|449678073|ref|XP_002156385.2| PREDICTED: procollagen galactosyltransferase 1-like, partial [Hydra
           magnipapillata]
          Length = 425

 Score = 44.7 bits (104), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 31/114 (27%), Positives = 61/114 (53%), Gaps = 12/114 (10%)

Query: 407 LIAPLLVRPFKA---WSNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLM 463
           +++P+L R F A   +SNFWGA++  G+Y R  +Y  ++  +    G++ VP + +  L+
Sbjct: 2   IVSPML-RSFDADGLYSNFWGAMDERGYYKRVPEYFTLLKRET--LGVYYVPMVHSTMLI 58

Query: 464 K-----TSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGHLVD 512
                 T  +    I   Y+   +D  + F  + ++ GI+L I +T+ +G L++
Sbjct: 59  DMRSNLTDTLMFYPIPLSYS-GVIDDILVFAQSAKHSGINLAICNTEVFGFLLN 111


>gi|449279299|gb|EMC86934.1| 2-oxoglutarate and iron-dependent oxygenase domain-containing
           protein 2, partial [Columba livia]
          Length = 289

 Score = 44.7 bits (104), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 42/176 (23%), Positives = 74/176 (42%), Gaps = 17/176 (9%)

Query: 561 VFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAE 620
           ++  P+ TE+FC  FV  +E + Q SD           Y  +      + ++G+   +  
Sbjct: 74  IYRLPVFTEEFCQAFVDELENFEQ-SDMPKGRPNSMNNYGVL------LNELGMDETFIT 126

Query: 621 FLR-KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVG 679
            LR KY+ P+    +       + +  +FVV+Y   E   L  H+D++  T+N++L   G
Sbjct: 127 PLREKYLRPITALLYPDLGGSCLDSHKAFVVKYSLHEDLDLSSHYDNAEVTLNVSL---G 183

Query: 680 VDYEGGGCRFIRYNCNVTATRMGWMLMHPG------RLTHYHEGLQVTQGTRYIMI 729
            D+  G   F  +N +         + H G      R    H  L +  G R+ +I
Sbjct: 184 KDFTEGNLYFGDFNQDPAPVPKYIEIEHVGAHGLLHRGGQIHGALPIASGERWNLI 239


>gi|392537775|ref|ZP_10284912.1| 2OG-Fe(II) oxygenase [Pseudoalteromonas marina mano4]
          Length = 304

 Score = 44.7 bits (104), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 40/148 (27%), Positives = 62/148 (41%), Gaps = 20/148 (13%)

Query: 592 DKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQE---REFIGYHHEPVRAPMSF 648
           DKR  +GY A P              + E L  Y+ P+      E +GY  +       F
Sbjct: 138 DKR-SSGYLAAP---------NFQAFYNEILNNYMRPISRLLFPEIMGYDTQT----FGF 183

Query: 649 VVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFI-RYNCNVT--ATRMGWML 705
            + Y P+   S+RPH D+S  T+NI LN     + G    F  +    VT  + + G  +
Sbjct: 184 SIYYDPNTDASIRPHTDASAVTLNINLNLPEEKFTGSELDFYDQQTGKVTQLSFKPGCAM 243

Query: 706 MHPGRLTHYHEGLQVTQGTRYIMISFVD 733
           +H G + H  + +     T ++M  F D
Sbjct: 244 IHRGNVAHAAKPILTGDRTNFVMWLFGD 271


>gi|109898945|ref|YP_662200.1| 2OG-Fe(II) oxygenase [Pseudoalteromonas atlantica T6c]
 gi|109701226|gb|ABG41146.1| 2OG-Fe(II) oxygenase [Pseudoalteromonas atlantica T6c]
          Length = 306

 Score = 44.7 bits (104), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 38/152 (25%), Positives = 64/152 (42%), Gaps = 20/152 (13%)

Query: 588 GTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREF---IGYHHEPVRA 644
           G   D R E GY A P+  +  +         E L +Y+ P+    F   +GY  +    
Sbjct: 136 GAMLDSRSE-GYLAAPSFQVFYR---------EMLDRYMRPIARLLFPDIVGYDTQT--- 182

Query: 645 PMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVT---ATRM 701
              F + Y+P+   S+RPH D+S  T+NI LN     + G    F           A + 
Sbjct: 183 -FGFSIHYKPNTDTSIRPHTDASAVTLNINLNLPDEVFTGSNVDFYDPTTGKMIGLAFKP 241

Query: 702 GWMLMHPGRLTHYHEGLQVTQGTRYIMISFVD 733
           G  ++H G + H  + +   + T +++  + D
Sbjct: 242 GSAMIHRGNVVHAAQPITSGERTNFVLWLYGD 273


>gi|359443514|ref|ZP_09233350.1| hypothetical protein P20429_3737 [Pseudoalteromonas sp. BSi20429]
 gi|358034560|dbj|GAA69599.1| hypothetical protein P20429_3737 [Pseudoalteromonas sp. BSi20429]
          Length = 304

 Score = 44.3 bits (103), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 37/147 (25%), Positives = 63/147 (42%), Gaps = 20/147 (13%)

Query: 588 GTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQER---EFIGYHHEPVRA 644
           G   D R E GY A P+             + E L  Y+ P+      E  G+  +    
Sbjct: 134 GAMLDSRSE-GYLAAPS---------FQAFYNEILNTYMRPISRMLFPEVTGFDTQT--- 180

Query: 645 PMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIR-YNCNVTAT--RM 701
              F + Y P+   S+RPH D+S  T+NI LN  G ++ G    F   Y  ++ +   + 
Sbjct: 181 -FGFSIYYEPNTDSSIRPHTDASAVTLNINLNLPGEEFTGSEVDFYHPYTGDIKSLTFKP 239

Query: 702 GWMLMHPGRLTHYHEGLQVTQGTRYIM 728
           G  ++H G + H  + +   + T +++
Sbjct: 240 GTAMLHRGNIAHAAKLITSGERTNFVL 266


>gi|410643991|ref|ZP_11354476.1| hypothetical protein GAGA_0010 [Glaciecola agarilytica NO2]
 gi|410136443|dbj|GAC02875.1| hypothetical protein GAGA_0010 [Glaciecola agarilytica NO2]
          Length = 303

 Score = 44.3 bits (103), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 39/152 (25%), Positives = 62/152 (40%), Gaps = 20/152 (13%)

Query: 588 GTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQE---REFIGYHHEPVRA 644
           GT  D R E GY A P+             + E L  Y+ P+      E +GY  +    
Sbjct: 133 GTMLDSRSE-GYLAAPS---------FQAFYREILNTYMRPIARLLFPEIMGYDTQT--- 179

Query: 645 PMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVT---ATRM 701
              F + Y+P+   S+RPH D+S  T+NI +N     + G    F   +       A   
Sbjct: 180 -FGFSIHYKPNTDTSIRPHTDASAVTLNININLPDEPFTGSTVDFYDPSAGKMIPLAFTS 238

Query: 702 GWMLMHPGRLTHYHEGLQVTQGTRYIMISFVD 733
           G  ++H G + H  + +   + T  ++  F D
Sbjct: 239 GSAMIHRGNVVHAAQPITSGERTNLVLWLFGD 270


>gi|303291270|ref|XP_003064921.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226453592|gb|EEH50901.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 521

 Score = 44.3 bits (103), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 56/201 (27%), Positives = 87/201 (43%), Gaps = 33/201 (16%)

Query: 34  FLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNLLKNELDE 93
            L +T ++    G    I+S++++ + +  LG    W G          K  L    L  
Sbjct: 276 LLAVTYSNKLNPGLDLLIESSQLHDVPIHVLG----W-GEKHPQPAQKLKATL--KFLSR 328

Query: 94  MDITDDMIILVTDSYDVIIDGGVNDILERFNTFDAN-IVFGAERLCWPDTSLY------- 145
           +D +    +L  D++D II     +IL RF  F+++ ++FGAE  C+P +  Y       
Sbjct: 329 IDPS--TTVLFVDAFDSIIVKDSYEILRRFKEFNSSSLIFGAENNCFPLSYPYFNLGYDF 386

Query: 146 ---DKYPAVGSGY-RYLNSGGFIGYAKDIKELISNRSIKNEED--------DQLYYALLF 193
              + Y     GY  YLNSG +IG A   + L S+  +   ED        DQ  YA+  
Sbjct: 387 CGDENYILKHKGYPSYLNSGQWIGKAGVARRLFSHYMLLVGEDLAETFTGTDQ--YAMEL 444

Query: 194 LDETLRTKHKIVLDTLANLFQ 214
           +  T      I +D  A LFQ
Sbjct: 445 MRMT--KAWNIEVDHEARLFQ 463


>gi|345562733|gb|EGX45769.1| hypothetical protein AOL_s00140g85 [Arthrobotrys oligospora ATCC
           24927]
          Length = 441

 Score = 44.3 bits (103), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 34/152 (22%), Positives = 61/152 (40%), Gaps = 34/152 (22%)

Query: 108 YDVIIDGGVNDILERFNTFDANIVFGAERLCWPD------TSLYDKYPAVGSGY------ 155
           +DV     +  +L RF      +VFGA++ CWP+       +   + P  G  +      
Sbjct: 143 FDVWFQLPLQVLLSRFLKMGVPVVFGADKKCWPNDFKSVACTAIPQSPLPGDVFGDDTDR 202

Query: 156 ----------------RYLNSGGFIGYAKDIKELISNRSIKNEE------DDQLYYALLF 193
                           R++NSG  IGYA  ++ +      K E+       DQ+  A ++
Sbjct: 203 QTILRSRREKYDNFRPRWVNSGTIIGYASHVRSIYDEAWKKVEQAGQEVDSDQMILAEVY 262

Query: 194 LDETLRTKHKIVLDTLANLFQNLYGSLEDIKL 225
            +  ++  + + +D  + LFQ +  S  DI  
Sbjct: 263 GERVVKGDNSMSVDFYSTLFQTMTYSHNDIAF 294


>gi|356574250|ref|XP_003555263.1| PREDICTED: uncharacterized PKHD-type hydroxylase At1g22950-like
           [Glycine max]
          Length = 380

 Score = 44.3 bits (103), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 40/194 (20%), Positives = 79/194 (40%), Gaps = 25/194 (12%)

Query: 552 TVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIH--- 608
           ++  +PC  V+ F ++  +FC + +  ++ + +W  GT    +L         ++ H   
Sbjct: 147 SIMAEPCKGVYTFEMLQPQFCKKLMSEVDHFERWVHGT----KLRIMRPNAMNKNKHGVI 202

Query: 609 MKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSST 668
           +       +   F+  ++ P+    +       + +   FVV Y  ++   L  H D + 
Sbjct: 203 LDDFAFEAMLDRFMCDFIQPISRVFYPELGGSSLDSHHGFVVEYGINKDVELGLHEDEAE 262

Query: 669 YTINIALNQVGVDYEGGGCRF--IRYNCNVTATRM-----------GWMLMHPGRLTHYH 715
            T+N+ L   G ++ GG   F  +R + +VT               G  ++HPGR  + H
Sbjct: 263 VTLNVCL---GKEFSGGDLFFQGVRCDAHVTTNTQPEEAFNYSHVPGHAILHPGR--NRH 317

Query: 716 EGLQVTQGTRYIMI 729
                T G R  +I
Sbjct: 318 GTRPTTSGNRMNLI 331


>gi|347738636|ref|ZP_08870086.1| alkyl hydroperoxide reductase/thiol specific antioxidant/Mal
           allergen [Azospirillum amazonense Y2]
 gi|346918273|gb|EGY00325.1| alkyl hydroperoxide reductase/thiol specific antioxidant/Mal
           allergen [Azospirillum amazonense Y2]
          Length = 366

 Score = 43.9 bits (102), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 49/182 (26%), Positives = 72/182 (39%), Gaps = 15/182 (8%)

Query: 561 VFWFPIVTEK-FCHEFVQIMEAYGQWSDGTNN--DKRLETGYEAVPTR--DIHMKQVGLA 615
           V   P V E  FC   + + +  G    G  N  D RL   Y+    R  D  M++  L 
Sbjct: 165 VLVVPRVFEPDFCRLLIALHQDRGGLDSGVMNEVDGRLVGVYDYTRKRRRDYFMEEEALC 224

Query: 616 GVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTY-----T 670
               + + + ++P Q R+   +  E  R     V  Y   E    RPH D+++       
Sbjct: 225 QAATQRIARRLLP-QVRQAFAF--EATRMERHVVACYDAAEGGYFRPHRDNTSAGTAHRR 281

Query: 671 INIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMIS 730
             + LN    DYEGG  RF  +         G  ++    L   HE L VT+G RY  + 
Sbjct: 282 FAVTLNLNTEDYEGGELRFPEFGPRTYRAPTGGAVVFSCSL--LHEALPVTRGRRYAYLP 339

Query: 731 FV 732
           F+
Sbjct: 340 FL 341


>gi|414877514|tpg|DAA54645.1| TPA: oxidoreductase, partial [Zea mays]
          Length = 349

 Score = 43.9 bits (102), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 44/190 (23%), Positives = 77/190 (40%), Gaps = 27/190 (14%)

Query: 552 TVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTN-NDKRLET--GYEAVPTRDIH 608
           ++  +P P V+ F ++   FC   ++ +E + +W         R  T   Y AV      
Sbjct: 161 SIMTEPIPGVYSFAMLQPTFCEMLLEEVENFEKWVHAMKFKIMRPNTMNKYGAV------ 214

Query: 609 MKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSST 668
           +   GL  +  +F+ +++ P+ +  +       + +  +F+V Y  D    L  H D S 
Sbjct: 215 LDDFGLEAMLNQFMEQFIAPISKVLYPEVGGGTLDSHHAFIVEYGKDRDVELGFHVDDSE 274

Query: 669 YTINIALNQVGVDYEGGGCRF--IRYNCNVTATRM-----------GWMLMHPGRLTHYH 715
            T+N+ L   G  + GG   F  IR   +V +              GW ++H GR  H H
Sbjct: 275 VTLNVCL---GKQFFGGELYFRGIRCENHVNSETQHEEMYDYTHIPGWAVLHHGR--HRH 329

Query: 716 EGLQVTQGTR 725
                + G R
Sbjct: 330 GARATSSGLR 339


>gi|410635006|ref|ZP_11345628.1| hypothetical protein GLIP_0179 [Glaciecola lipolytica E3]
 gi|410145432|dbj|GAC12833.1| hypothetical protein GLIP_0179 [Glaciecola lipolytica E3]
          Length = 136

 Score = 43.9 bits (102), Expect = 0.31,   Method: Composition-based stats.
 Identities = 27/96 (28%), Positives = 45/96 (46%), Gaps = 9/96 (9%)

Query: 633 EFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRY 692
           E IGY  +       F + Y+P+   S+RPH D+S+ T+N+ LN     + G    F   
Sbjct: 4   EIIGYDSQS----FGFSIHYQPNTDTSIRPHTDASSVTLNVNLNTPEELFSGSAVNFYDT 59

Query: 693 NCNVTATRM---GWMLMHPGRLTHYHEGLQVTQGTR 725
              +T   +   G  ++H G + H  +   +T G+R
Sbjct: 60  KQGLTKEHIFKSGTAVIHRGHVPHAAQ--HITSGSR 93


>gi|149185860|ref|ZP_01864175.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Erythrobacter
           sp. SD-21]
 gi|148830421|gb|EDL48857.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Erythrobacter
           sp. SD-21]
          Length = 363

 Score = 43.9 bits (102), Expect = 0.33,   Method: Compositional matrix adjust.
 Identities = 51/220 (23%), Positives = 86/220 (39%), Gaps = 28/220 (12%)

Query: 533 LDWDLRYI--HPEYQ-KSLLPDTVNNQPCPDVFWFPIVT------EKFCHEFVQIMEAYG 583
           LD +LR +  +P  + ++ L +     P  +  W P++T      E  C   + + E  G
Sbjct: 124 LDRELRIVGRYPLIEGEAALAELKRRLPQVEDSWAPVLTVPGVFDEALCKHLISLYENDG 183

Query: 584 QWSDGTNNDKR------LETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGY 637
               G   D        L+ G++    RD  +    L  + ++ + + V P  ER F   
Sbjct: 184 GTPSGFMRDVNGKTTHILDDGFKQ--RRDTTITDPKLIQLLSQRIARRVAPAIERAFA-- 239

Query: 638 HHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTY-----TINIALNQVGVDYEGGGCRFIRY 692
             +  R     V  Y   +    RPH D++T+        + +N    +YEGG  RF  +
Sbjct: 240 -FKATRIERHIVACYEAGKG-HFRPHRDNTTFGTAHRRFAVTVNLNAEEYEGGNLRFPEF 297

Query: 693 NCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFV 732
                    G  ++    L   HE   VT+G RY  + F+
Sbjct: 298 GQRTYRAPTGGAVVFSCSL--LHEATPVTRGERYAFLPFL 335


>gi|219121537|ref|XP_002181121.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217407107|gb|EEC47044.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 427

 Score = 43.9 bits (102), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 53/194 (27%), Positives = 77/194 (39%), Gaps = 39/194 (20%)

Query: 560 DVFWFPIVTEKFC---HEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAG 616
           DV+   I++E FC    EFV  +   GQ    T     L+ G      R I +  +GL  
Sbjct: 221 DVYSLSILSESFCGRVREFVSEVSRLGQ----TEKYANLQMGR-----RPIDLDTIGLG- 270

Query: 617 VWAEFLRKYVV--PLQEREF--------IGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDS 666
            W   L  Y++  P+    F        + +    V A  +     RP  Q  L  H D 
Sbjct: 271 -WINDLLFYLILRPISRHLFESSESFGDLNWRQGYVAAYSANPTEGRPRAQ--LITHTDD 327

Query: 667 STYTINIALNQVGVDYEGGGCRFIRYNCNVTA--------TRMGWMLMHPGRLTHYHEGL 718
           S  T+NI L   G ++ GG   F        A         R+G  L+H GR  H+H+  
Sbjct: 328 SEVTLNIGL---GENFTGGAIEFRGLRGTPEAGKLIGTIQPRVGVALIHAGR--HFHDVT 382

Query: 719 QVTQGTRYIMISFV 732
            VT G R+ ++ + 
Sbjct: 383 TVTSGDRFALVMWA 396


>gi|308801735|ref|XP_003078181.1| unnamed protein product [Ostreococcus tauri]
 gi|116056632|emb|CAL52921.1| unnamed protein product [Ostreococcus tauri]
          Length = 278

 Score = 43.9 bits (102), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 33/120 (27%), Positives = 53/120 (44%), Gaps = 16/120 (13%)

Query: 624 KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYE 683
           K ++PL   + +   H        +VVRY   E P    H D    T  ++LN V  +YE
Sbjct: 105 KTLIPLAREQCVIDDHLKFEDDDWYVVRYDAKEFPRASRHRDGGHMTFVVSLNNV-TEYE 163

Query: 684 GGGCRF--IRYNC-----------NVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMIS 730
           GGG  F  + ++            ++ A  +G   +H  +L H+     VT GTRY+++ 
Sbjct: 164 GGGSVFEGLAFSVPHGGVHKLEDHDIPAQPIGGTTVHGSQLMHWSNA--VTSGTRYVLVG 221


>gi|440798305|gb|ELR19373.1| hypothetical protein ACA1_265980 [Acanthamoeba castellanii str.
           Neff]
          Length = 329

 Score = 43.5 bits (101), Expect = 0.38,   Method: Compositional matrix adjust.
 Identities = 37/144 (25%), Positives = 67/144 (46%), Gaps = 24/144 (16%)

Query: 83  KVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDT 142
           KV +++N L    I  D I++  DS+D I+    +++L  +         G +R      
Sbjct: 108 KVLMVRNYL--ATIPGDQIVVFIDSFDSILYATPSELLASWRR-------GLDR----QN 154

Query: 143 SLYDKYPAVGSGYRYLNSGGFIGYAKDIKELISNRSIKNEEDDQLYYALLFLDETLRTKH 202
           +     P+   G        ++G A+D+ E ++  +I  E DDQL + L+++D       
Sbjct: 155 ARRRPTPSPIRGI-------YMGRARDLLEALTRAAIYEERDDQLAWELVYVD----NPG 203

Query: 203 KIVLDTLANLFQNLYGSLEDIKLN 226
            + LD  A+L  N+Y S +D+ L 
Sbjct: 204 MVALDYHADLVANMYLSCDDLALR 227


>gi|357127527|ref|XP_003565431.1| PREDICTED: uncharacterized PKHD-type hydroxylase At1g22950-like
           [Brachypodium distachyon]
          Length = 355

 Score = 43.5 bits (101), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 45/184 (24%), Positives = 71/184 (38%), Gaps = 19/184 (10%)

Query: 558 CPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGV 617
            P V  FP++   FC   V  ++ +  W+  T   K L T         + +  +G+ GV
Sbjct: 145 APVVVAFPMLRPGFCDMLVAEVQNFYMWA-CTTKQKILRTNALNTSPYGVVLSDMGMQGV 203

Query: 618 WAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQ 677
             + ++++V P+    F       + + +SFV  Y  D+      H D S  T+++ L  
Sbjct: 204 LDDLMKQFVSPISTVFFSEVGGGSLDSHVSFVNLYHGDDNNGTDWHVDDSEVTLSVCL-- 261

Query: 678 VGVDYEGGGCRFIRYNCNVTATRM-------------GWMLMHPGRLTHYHEGLQVTQGT 724
            G ++ GG   F    C    T M             G  L+H GR  H H       G 
Sbjct: 262 -GKEFTGGEMYFNGRRCENHTTSMEKDEEKVIHPQVPGEALLHHGR--HRHSVFPTFSGF 318

Query: 725 RYIM 728
           R  M
Sbjct: 319 RADM 322


>gi|359453402|ref|ZP_09242720.1| hypothetical protein P20495_1464 [Pseudoalteromonas sp. BSi20495]
 gi|358049553|dbj|GAA78969.1| hypothetical protein P20495_1464 [Pseudoalteromonas sp. BSi20495]
          Length = 235

 Score = 43.5 bits (101), Expect = 0.42,   Method: Composition-based stats.
 Identities = 31/96 (32%), Positives = 42/96 (43%), Gaps = 17/96 (17%)

Query: 592 DKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQE---REFIGYHHEPVRAPMSF 648
           D R E GY A P+             + E L  Y+ P+      E IGY  +       F
Sbjct: 138 DSRSE-GYLAAPS---------FQAFYNEILNTYMRPISRLLFPEIIGYDTQT----FGF 183

Query: 649 VVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEG 684
            + Y P    S+RPH D+S  T+NI LN  G ++ G
Sbjct: 184 SIYYDPSTDASIRPHTDASAVTLNINLNLPGEEFTG 219


>gi|219114765|ref|XP_002178178.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217409913|gb|EEC49843.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 456

 Score = 43.5 bits (101), Expect = 0.47,   Method: Compositional matrix adjust.
 Identities = 26/69 (37%), Positives = 34/69 (49%), Gaps = 6/69 (8%)

Query: 663 HHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQ 722
           H+D    T N+ L      + GGG  F          + G  L+HPG L   H+GL VT 
Sbjct: 375 HYDRCDVTANLLLAHA---FRGGGT-FFPAALTTVHLQPGEFLLHPGSL--IHQGLDVTA 428

Query: 723 GTRYIMISF 731
           GTRY+M+ F
Sbjct: 429 GTRYLMVMF 437


>gi|397642554|gb|EJK75306.1| hypothetical protein THAOC_02971 [Thalassiosira oceanica]
          Length = 782

 Score = 43.1 bits (100), Expect = 0.48,   Method: Compositional matrix adjust.
 Identities = 44/168 (26%), Positives = 69/168 (41%), Gaps = 11/168 (6%)

Query: 566 IVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQV-GLAGVWAEFLRK 624
           +++   C+  +QI E +      T +       + AVPT D+ +  + GL  ++      
Sbjct: 594 VLSHGECNRMIQIAEDHAVRLGWTTSR------HFAVPTTDMPIHDLPGLQAIFCRAWEN 647

Query: 625 YVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEG 684
            + PL  ++F            +F+V+Y    Q  L PH D S  +  +ALN     +EG
Sbjct: 648 KIRPLLRQQFRIPSDSECHIHDAFLVKYGASMQRYLPPHVDESNLSFVVALND--DSFEG 705

Query: 685 GGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFV 732
           GG  +I          +G ML   G     H G  V  G RYI+  F 
Sbjct: 706 GGT-YIHTLGKTLKPPVGGMLSFCGGEI-LHSGDPVVSGIRYIVAGFC 751


>gi|332306013|ref|YP_004433864.1| 2OG-Fe(II) oxygenase [Glaciecola sp. 4H-3-7+YE-5]
 gi|332173342|gb|AEE22596.1| 2OG-Fe(II) oxygenase [Glaciecola sp. 4H-3-7+YE-5]
          Length = 303

 Score = 43.1 bits (100), Expect = 0.49,   Method: Compositional matrix adjust.
 Identities = 40/154 (25%), Positives = 62/154 (40%), Gaps = 24/154 (15%)

Query: 588 GTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQE---REFIGYHHEPVRA 644
           G   D R E GY A P+             + E L  Y+ P+      E +GY  +    
Sbjct: 133 GAMLDSRSE-GYLAAPS---------FQAFYREILNTYMRPIARLLFPEIMGYDTQT--- 179

Query: 645 PMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNV-----TAT 699
              F + Y+P+   S+RPH D+S  T+NI +N     + G    F  YN         A 
Sbjct: 180 -FGFSIHYKPNTDTSIRPHTDASAVTLNININLPDEPFTGSTVDF--YNPGAGKMIPLAF 236

Query: 700 RMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVD 733
             G  ++H G + H  + +   + T  ++  F D
Sbjct: 237 TSGSAMIHRGNVVHAAQPITSGERTNLVLWLFGD 270


>gi|163795958|ref|ZP_02189921.1| hypothetical protein BAL199_28055 [alpha proteobacterium BAL199]
 gi|159178713|gb|EDP63251.1| hypothetical protein BAL199_28055 [alpha proteobacterium BAL199]
          Length = 383

 Score = 43.1 bits (100), Expect = 0.49,   Method: Compositional matrix adjust.
 Identities = 56/237 (23%), Positives = 92/237 (38%), Gaps = 24/237 (10%)

Query: 508 GHLVDSENFDPQKTNPEVYELIRNPLDW---DLRYIHPEYQKSLLPDTVNNQPCPDVFWF 564
           G L+  EN   Q        +   P+D     LR    EY   + P  +  Q  P +   
Sbjct: 122 GALLVRENLRAQAI------VAAQPVDGFGERLRTAIAEYPVRMPPQAMQ-QHAPVLMIP 174

Query: 565 PIVTEKFCHEFVQIMEAYGQWSDGTNND-KRLETGY---EAVPTRDIHMKQVGLAGVWAE 620
            +V+  FC + +   EA G  + G   D   L  G    +    +D  ++   L      
Sbjct: 175 DVVSPAFCRQLIDYYEARGGGASGFMRDVDGLTRGLLDPKMKRRKDCSIEDESLLKQLRR 234

Query: 621 FLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYT-----INIAL 675
            L   V+P   + F GY     R     +  Y   +Q   + H D+++         ++L
Sbjct: 235 ALETRVIPEIGKAF-GYRVS--RVERYIIGCYDAADQGFFKAHRDNTSKATAHRKFAMSL 291

Query: 676 NQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFV 732
           N    +YEGG  RF  Y  +     +G  ++    L  +HE   VT+G RY+++ F+
Sbjct: 292 NLNTDEYEGGALRFPEYGQHTYKPGVGCAVVFSCSL--FHEATPVTRGRRYVVLPFL 346


>gi|212724054|ref|NP_001131663.1| uncharacterized protein LOC100193023 [Zea mays]
 gi|194692190|gb|ACF80179.1| unknown [Zea mays]
          Length = 392

 Score = 43.1 bits (100), Expect = 0.54,   Method: Compositional matrix adjust.
 Identities = 44/190 (23%), Positives = 77/190 (40%), Gaps = 27/190 (14%)

Query: 552 TVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTN-NDKRLET--GYEAVPTRDIH 608
           ++  +P P V+ F ++   FC   ++ +E + +W         R  T   Y AV      
Sbjct: 161 SIMTEPIPGVYSFAMLQPTFCEMLLEEVENFEKWVHAMKFKIMRPNTMNKYGAV------ 214

Query: 609 MKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSST 668
           +   GL  +  +F+ +++ P+ +  +       + +  +F+V Y  D    L  H D S 
Sbjct: 215 LDDFGLEAMLNQFMEQFIAPISKVLYPEVGGGTLDSHHAFIVEYGKDRDVELGFHVDDSE 274

Query: 669 YTINIALNQVGVDYEGGGCRF--IRYNCNVTATRM-----------GWMLMHPGRLTHYH 715
            T+N+ L   G  + GG   F  IR   +V +              GW ++H GR  H H
Sbjct: 275 VTLNVCL---GKQFFGGELYFRGIRCENHVNSETQHEEMYDYTHIPGWAVLHHGR--HRH 329

Query: 716 EGLQVTQGTR 725
                + G R
Sbjct: 330 GARATSSGLR 339


>gi|224005507|ref|XP_002291714.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220972233|gb|EED90565.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 662

 Score = 43.1 bits (100), Expect = 0.56,   Method: Compositional matrix adjust.
 Identities = 25/71 (35%), Positives = 37/71 (52%), Gaps = 4/71 (5%)

Query: 663 HHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQ 722
           H+D    T    L  +  +YEGGG  F R        + G +L+HPG L  YH+G+ +T 
Sbjct: 570 HYDGCDVTWQAMLTDIN-EYEGGGTYF-RCLRQTIKLQQGQVLVHPGEL--YHKGIDITC 625

Query: 723 GTRYIMISFVD 733
           G R +++ F D
Sbjct: 626 GVRTLLVCFTD 636


>gi|428216499|ref|YP_007100964.1| alkyl hydroperoxide reductase [Pseudanabaena sp. PCC 7367]
 gi|427988281|gb|AFY68536.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
           allergen [Pseudanabaena sp. PCC 7367]
          Length = 376

 Score = 43.1 bits (100), Expect = 0.57,   Method: Compositional matrix adjust.
 Identities = 45/189 (23%), Positives = 74/189 (39%), Gaps = 18/189 (9%)

Query: 553 VNNQPCPDVFWFP-IVTEKFCHEFVQIMEAYGQWSDGTNNDKRLET----GYEAVPTRDI 607
           VN+ P   V   P +++ +FC E + +    G    G    +  +T     YE    RD 
Sbjct: 168 VNHAP---VLLIPNVISPEFCQELIDVWHTRGNQDSGFMRSEGEKTVGYLDYEHKIRRDH 224

Query: 608 HMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSS 667
            +++  L       + + V P  ++ F    +E  R     +  Y        RPH D+ 
Sbjct: 225 FVREGQLRDRIDRIMNRRVFPEIKKAFC---YEVTRREAYKIACYNSASGGYFRPHRDNL 281

Query: 668 T-----YTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQ 722
           T         + LN    +YEGG  +F  Y  ++     G  ++    L H  E   VT 
Sbjct: 282 TGGTAHRKFAMTLNLNVEEYEGGYLKFAEYGPHLYKPTTGSAVIFSCSLLH--EATDVTA 339

Query: 723 GTRYIMISF 731
           G R+ ++SF
Sbjct: 340 GIRFALLSF 348


>gi|348688347|gb|EGZ28161.1| hypothetical protein PHYSODRAFT_469968 [Phytophthora sojae]
          Length = 464

 Score = 42.7 bits (99), Expect = 0.66,   Method: Compositional matrix adjust.
 Identities = 30/88 (34%), Positives = 42/88 (47%), Gaps = 6/88 (6%)

Query: 648 FVVRY--RPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWML 705
           F V+Y  R  E+  L  H D S  + N+ LN    D+ GGG  F      V  T+ G   
Sbjct: 375 FFVKYEARKGERSELALHRDGSVLSFNLLLNSAD-DFTGGGTYFDATKHTVHITQ-GDAA 432

Query: 706 MHPGRLTHYHEGLQVTQGTRYIMISFVD 733
           +H G++   H G  V  G R I++ F+D
Sbjct: 433 VHSGKV--LHAGAPVVSGIRQILVGFLD 458


>gi|410643600|ref|ZP_11354096.1| hypothetical protein GCHA_4365 [Glaciecola chathamensis S18K6]
 gi|410137010|dbj|GAC12283.1| hypothetical protein GCHA_4365 [Glaciecola chathamensis S18K6]
          Length = 303

 Score = 42.7 bits (99), Expect = 0.75,   Method: Compositional matrix adjust.
 Identities = 38/152 (25%), Positives = 61/152 (40%), Gaps = 20/152 (13%)

Query: 588 GTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQE---REFIGYHHEPVRA 644
           G   D R E GY A P+             + E L  Y+ P+      E +GY  +    
Sbjct: 133 GAMLDSRSE-GYLAAPS---------FQAFYREILNTYMRPIARLLFPEIMGYDTQT--- 179

Query: 645 PMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVT---ATRM 701
              F + Y+P+   S+RPH D+S  T+NI +N     + G    F   +       A   
Sbjct: 180 -FGFSIHYKPNTDTSIRPHTDASAVTLNININLPDEPFTGSTVDFYDPSAGKMIPLAFTS 238

Query: 702 GWMLMHPGRLTHYHEGLQVTQGTRYIMISFVD 733
           G  ++H G + H  + +   + T  ++  F D
Sbjct: 239 GSAMIHRGNVVHAAQPITSGERTNLVLWLFGD 270


>gi|297834722|ref|XP_002885243.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
 gi|297331083|gb|EFH61502.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
          Length = 394

 Score = 42.7 bits (99), Expect = 0.75,   Method: Compositional matrix adjust.
 Identities = 48/191 (25%), Positives = 74/191 (38%), Gaps = 27/191 (14%)

Query: 555 NQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTN-NDKRLET--GYEAVPTRDIHMKQ 611
           ++P P VF F ++   FC   +  ++ + +W   T     R  T   Y AV      +  
Sbjct: 163 SEPSPGVFVFDMLQPSFCEMMLSEIDNFERWVGETKFRIMRPNTMNKYGAV------LDD 216

Query: 612 VGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTI 671
            GL  +  + +  ++ P+ +  F       + +   FVV Y  D    L  H D S  T+
Sbjct: 217 FGLDTMLDKLMEGFIRPISKLFFSDVGGASLDSHHGFVVEYGKDRDVDLGFHVDDSEVTL 276

Query: 672 NIALNQVGVDYEGGGCRFIRYNCNV---TATRM----------GWMLMHPGRLTHYHEGL 718
           N+ L   G  + GG   F    C     TAT+           G  ++H GR  H H   
Sbjct: 277 NVCL---GNQFVGGELFFRGTRCEKHVNTATKADLTFDYDHIPGQAVLHRGR--HRHGAR 331

Query: 719 QVTQGTRYIMI 729
             T G R  M+
Sbjct: 332 ATTSGHRVNML 342


>gi|409993565|ref|ZP_11276702.1| hypothetical protein APPUASWS_20677 [Arthrospira platensis str.
           Paraca]
 gi|409935585|gb|EKN77112.1| hypothetical protein APPUASWS_20677 [Arthrospira platensis str.
           Paraca]
          Length = 377

 Score = 42.7 bits (99), Expect = 0.80,   Method: Compositional matrix adjust.
 Identities = 63/263 (23%), Positives = 105/263 (39%), Gaps = 43/263 (16%)

Query: 499 LKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYI-----HP--EYQKSLL-- 549
           LK + + +YG  +     D + +N  VY  +   LD +LR +     HP  E+ +  L  
Sbjct: 99  LKGEVSTKYGAYI----CDGKNSNTIVYNRVAFLLDRNLRILKIYPLHPLEEFTQQFLGE 154

Query: 550 -PDTVNNQP-------CPDVFWFPIVTEKFCHEFVQIMEAYGQWSDG-TNNDKRLETGY- 599
             D V  +P        P +    ++  +FC E + I E  G    G    +     GY 
Sbjct: 155 IQDLVAQEPPRLIKMQAPVLLIPKVLDLRFCRELIHIWETQGNDESGFMKREGEKTVGYV 214

Query: 600 -EAVPTRDIHMKQVGLAGVWAE-FLRKYVVP--LQEREFIGYHHEPVRAPMSFVVRYRPD 655
             +   R  H  Q G    + +  +++ V P  LQ  +F     +  R     +  Y  +
Sbjct: 215 DPSFKRRRDHFIQDGPVKNYIDSIMQRRVFPEILQAFQF-----QLTRRECYKIGCYDSE 269

Query: 656 EQPSLRPHHDSST-------YTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHP 708
                RPH D++T       + + I LN    +YEGG  RF  +  ++     G  ++  
Sbjct: 270 SGGFFRPHRDNTTGGTFHRRFAMTINLN--AEEYEGGCLRFPEHAPHLYKPATGDAIIFS 327

Query: 709 GRLTHYHEGLQVTQGTRYIMISF 731
              +  HE   VT G R+ ++SF
Sbjct: 328 --CSTMHEATDVTSGRRFALLSF 348


>gi|390338764|ref|XP_001180150.2| PREDICTED: 2-oxoglutarate and iron-dependent oxygenase
           domain-containing protein 2-like [Strongylocentrotus
           purpuratus]
          Length = 361

 Score = 42.4 bits (98), Expect = 0.85,   Method: Compositional matrix adjust.
 Identities = 42/187 (22%), Positives = 82/187 (43%), Gaps = 15/187 (8%)

Query: 549 LPDTVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIH 608
           L   +  +    VF FP+ T +FC  FV+ +  +        N    +     +    + 
Sbjct: 130 LAKRLTRENASRVFSFPVFTAEFCDRFVEEITYF-------ENSPLPKGRPNTMNNYGVL 182

Query: 609 MKQVGLAGVWAEFLR-KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSS 667
           + ++G  G +   LR  Y+ P+    +       + +  +F+V+Y+  E   L  H+D++
Sbjct: 183 LMELGFDGNFLNPLRMDYLAPIASLLYPDVGGNSLDSHRAFIVKYKLGEDVDLNYHYDNA 242

Query: 668 TYTINIALNQVGVDYE--GGGCRFIRYNCNVTAT---RMGWMLMHPGRLTHYHEGLQVTQ 722
             TIN++L +   D E   G  R +  +  + A    +    L+H G+  H H  + +++
Sbjct: 243 EVTINVSLGKEFSDGELYFGDMRQMPRDETMYARFEHKKTIGLLHRGQ--HMHGAMPISE 300

Query: 723 GTRYIMI 729
           G RY +I
Sbjct: 301 GERYNLI 307


>gi|301117344|ref|XP_002906400.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262107749|gb|EEY65801.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 462

 Score = 42.4 bits (98), Expect = 0.89,   Method: Compositional matrix adjust.
 Identities = 31/88 (35%), Positives = 42/88 (47%), Gaps = 6/88 (6%)

Query: 648 FVVRY--RPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWML 705
           F V+Y  R  E+  L  H D S  + NI LN    D+ GGG  F      V  T+ G   
Sbjct: 373 FFVKYEARKGERSELALHRDGSVLSFNILLNSAD-DFTGGGTYFDSTKRTVHITQ-GDAA 430

Query: 706 MHPGRLTHYHEGLQVTQGTRYIMISFVD 733
           +H G++   H G  V  G R I++ F+D
Sbjct: 431 VHSGKV--LHGGAPVLTGIRQILVGFLD 456


>gi|392407556|ref|YP_006444164.1| glycosyltransferase [Anaerobaculum mobile DSM 13181]
 gi|390620692|gb|AFM21839.1| putative glycosyltransferase [Anaerobaculum mobile DSM 13181]
          Length = 277

 Score = 42.4 bits (98), Expect = 0.90,   Method: Compositional matrix adjust.
 Identities = 39/158 (24%), Positives = 72/158 (45%), Gaps = 28/158 (17%)

Query: 339 DDYIHNFKTMFKNVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLK 398
           +D +  F T+   ++YI ++S +    A N+A+  S+  GV ++  ++SD H +N +V+K
Sbjct: 42  NDSLRKFSTLDDRIEYIFNDSNLGYGRAHNIAIRESIKAGVPYHVVLNSDVHFNN-EVIK 100

Query: 399 YL-----VNRNESLIAP--------------LLVRPFKAWSNF---WGALNADGFYARSF 436
            L      N +  L+ P              LL  PF ++      WG       Y    
Sbjct: 101 VLYDFMNANPDVGLVMPKILYPNGELQYDCKLLPTPFDSFGRRFLNWGPFKK---YVEKR 157

Query: 437 DYMNIINGDQGGKGIWNVPYITNCYL-MKTSVIKATNI 473
           +++  +      K I NVPY+  C++ ++ SV+K   +
Sbjct: 158 NHIYELRFADYDK-IMNVPYLCGCFIFLRVSVLKEIGL 194


>gi|195652137|gb|ACG45536.1| oxidoreductase [Zea mays]
          Length = 392

 Score = 42.0 bits (97), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 43/190 (22%), Positives = 76/190 (40%), Gaps = 27/190 (14%)

Query: 552 TVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTN-NDKRLET--GYEAVPTRDIH 608
           ++  +P P V+ F ++   FC   ++ +E + +W         R  T   Y AV      
Sbjct: 161 SIMTEPIPGVYSFAMLQPTFCEMLLEEVENFEKWVHAMKFKIMRPNTMNKYGAV------ 214

Query: 609 MKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSST 668
           +   GL  +  +F+ +++ P+ +  +       + +  +F+V Y  D    L  H D S 
Sbjct: 215 LDDFGLEAMLNQFMEQFIAPISKVLYPEVGGGTLDSHHAFIVEYGKDRDVELGFHVDDSE 274

Query: 669 YTINIALNQVGVDYEGGGCRF--IRYNCNVTATRM-----------GWMLMHPGRLTHYH 715
            T+N+ L   G  + GG   F  IR   +V +               W ++H GR  H H
Sbjct: 275 VTLNVCL---GKQFSGGELYFRGIRCENHVNSETQHEEMYDYTHIPSWAVLHHGR--HRH 329

Query: 716 EGLQVTQGTR 725
                + G R
Sbjct: 330 GARATSSGLR 339


>gi|326929627|ref|XP_003210960.1| PREDICTED: 2-oxoglutarate and iron-dependent oxygenase
           domain-containing protein 2-like [Meleagris gallopavo]
          Length = 293

 Score = 42.0 bits (97), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 41/176 (23%), Positives = 75/176 (42%), Gaps = 17/176 (9%)

Query: 561 VFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAE 620
           +F  P+ TE+FC  F++ +E + Q SD           Y       + + ++G+   +  
Sbjct: 78  IFRLPVFTEEFCQAFIEELENFEQ-SDMPKGRPNSMNNY------GVLLNELGMDESFIT 130

Query: 621 FLR-KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVG 679
            LR KY+ P+    +       + +  +FVV+Y   E   L  H+D++  T+N++L   G
Sbjct: 131 PLREKYLRPITALLYPDLGGACLDSHKAFVVKYSLHEDLDLSSHYDNAEVTLNVSL---G 187

Query: 680 VDYEGGGCRFIRYNCNVTATRMGWMLMHPG------RLTHYHEGLQVTQGTRYIMI 729
            D+  G   F  +  + +       + H G      R    H  L +  G R+ +I
Sbjct: 188 KDFTEGNLYFGDFRQDPSPVPSYIEVEHVGTQGLLHRGGQIHGALPIASGERWNLI 243


>gi|432875843|ref|XP_004072935.1| PREDICTED: 2-oxoglutarate and iron-dependent oxygenase
           domain-containing protein 2-like [Oryzias latipes]
          Length = 290

 Score = 42.0 bits (97), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 42/179 (23%), Positives = 80/179 (44%), Gaps = 23/179 (12%)

Query: 561 VFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAE 620
           V+ FP+    FC E V+ ++ + Q          +           I + ++G    +  
Sbjct: 78  VYRFPVFERDFCRELVEELDHFEQSPAPKGRPNTMNNS-------GILLDELGFDEAFVT 130

Query: 621 FLR-KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVG 679
            LR +Y++PL    +       + +  +FVV+Y  +E   L  H+D++  T+N++   +G
Sbjct: 131 PLREQYLLPLTSLLYPDCGGRCLDSHKAFVVKYDMNEDLELSYHYDNAEVTLNVS---IG 187

Query: 680 VDYEGGGCRF--IRYNCNVTATRMGWM-------LMHPGRLTHYHEGLQVTQGTRYIMI 729
            D+  G   F  +R +  V+ TR+          L+H G+  H H  L ++ G R+ +I
Sbjct: 188 KDFTEGNLYFGDMRQD-PVSETRLTEAEHRITEGLLHRGQ--HMHGALPISHGQRWNLI 243


>gi|345857005|ref|ZP_08809460.1| glycosyl transferase, group 2 family protein [Desulfosporosinus sp.
           OT]
 gi|344329850|gb|EGW41173.1| glycosyl transferase, group 2 family protein [Desulfosporosinus sp.
           OT]
          Length = 558

 Score = 42.0 bits (97), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 35/151 (23%), Positives = 72/151 (47%), Gaps = 23/151 (15%)

Query: 371 VENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIA--------PLLVRPFKAW--- 419
           ++ +L +G D+ F VDSD +LD P+ L +L++ N+ +++        P L+   + W   
Sbjct: 95  IKIALDEGYDYLFLVDSDLYLD-PNTLPHLLSLNKDIVSEVYWTRWNPKLIPLPQVWIRD 153

Query: 420 ------SNFWGALNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLM-KTSVIKATN 472
                 S+   AL+ +    R+ +++ +++      G + V  +  C L+ +T++ +  +
Sbjct: 154 QYTLYVSSRGEALSEEEMNKRTKEFIKMLS----HPGTYKVGGLGACTLISRTALERGVS 209

Query: 473 IKTIYTLNSMDYDMAFCTNLRNKGIHLKIDS 503
            + IY L     D  FC      G+ L  D+
Sbjct: 210 FQEIYNLGFTGEDRHFCVRAAALGLELYADT 240


>gi|325286683|ref|YP_004262473.1| 2OG-Fe(II) oxygenase [Cellulophaga lytica DSM 7489]
 gi|324322137|gb|ADY29602.1| 2OG-Fe(II) oxygenase [Cellulophaga lytica DSM 7489]
          Length = 317

 Score = 42.0 bits (97), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 30/133 (22%), Positives = 58/133 (43%), Gaps = 7/133 (5%)

Query: 603 PTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRP 662
           P  + H+        + + + +Y+ P+  R  +G      +    F +RY PD++  L+ 
Sbjct: 139 PRSEGHLGAPNFQAFYNDIMDRYMRPIS-RLLLGTQGYDSQT-FGFSIRYNPDKEKDLQA 196

Query: 663 HHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRM---GWMLMHPGRLTHYHEGLQ 719
           H D+S+ T+NI +N    +Y G    F   +   T       G  ++H G + H      
Sbjct: 197 HTDASSATLNININLPDEEYTGSEVDFYDKSTKQTVQTFFEPGKAILHRGNVPHATH--P 254

Query: 720 VTQGTRYIMISFV 732
           +T G R  ++ ++
Sbjct: 255 ITSGQRSNLVVWL 267


>gi|197104861|ref|YP_002130238.1| hypothetical protein PHZ_c1395 [Phenylobacterium zucineum HLK1]
 gi|196478281|gb|ACG77809.1| conserved hypothetical protein [Phenylobacterium zucineum HLK1]
          Length = 345

 Score = 42.0 bits (97), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 46/179 (25%), Positives = 69/179 (38%), Gaps = 22/179 (12%)

Query: 566 IVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGY---EAVPTRDIHMKQVGLAGVWAEFL 622
           I+  + C   +++ E  G    G   D    T Y   E    RD+ ++  GL       L
Sbjct: 154 ILEPELCRALIELHEGDGGAFTGVMRDAGDRTVYVMDELKRRRDVVVRDPGLVEALRTRL 213

Query: 623 RKYVVPLQERE--FIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSST-------YTINI 673
            + + PL ER   F   H E        V  Y   +    RPH D++T       +  +I
Sbjct: 214 ERRLFPLIERALGFKATHIE-----RYLVSCYDEADGGVFRPHRDNTTLGTAHRAFACSI 268

Query: 674 ALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFV 732
            LN     +EGG  RF  +        +G + +    L   HE L V +G RY  + F+
Sbjct: 269 NLND---GFEGGDLRFPEFGPATYRPPVGGVCVFACGL--MHEALPVMEGRRYAFVPFL 322


>gi|408391417|gb|EKJ70794.1| hypothetical protein FPSE_09030 [Fusarium pseudograminearum CS3096]
          Length = 528

 Score = 41.6 bits (96), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 53/232 (22%), Positives = 92/232 (39%), Gaps = 65/232 (28%)

Query: 40  ASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNL--LKNELDEM--D 95
           ASN  D +   + SA VN+     +     W G    +    +   L  +K  LD++   
Sbjct: 92  ASNPNDMFCAIVASALVNRYPAPYM---VGWKGEGKYNASAAHTAKLYSIKKYLDKLPNG 148

Query: 96  ITDDMIILVTDSYDVIIDGGVNDILERFNTFDAN------------------------IV 131
             DD ++   D YDV+    V  ++ER+    A+                        + 
Sbjct: 149 GDDDDLVFFGDGYDVMAQLPVEVVIERYFKVAADADQRLADRFGISVQEAHKRGLKQTLF 208

Query: 132 FGAERLCWP---------------DTSLYDKYPAVGSG---YR---YLNSGGFIGYAKDI 170
           +GA+++CWP                +++Y   P  G+G   YR   Y NSG  IG   D+
Sbjct: 209 WGADKMCWPAINEAQCTKIPGSHLASTVYG--PKTGNGDLNYRDAKYFNSGSVIGPIGDL 266

Query: 171 KELIS----------NRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLANL 212
           ++ I+          + + K +  DQ+Y A ++  + L ++ K + D L N 
Sbjct: 267 RKFINAGVTALEETFDPNFKYKTSDQIYLARVYARQEL-SRAKQIEDELLNF 317


>gi|255570701|ref|XP_002526305.1| oxidoreductase, putative [Ricinus communis]
 gi|223534386|gb|EEF36094.1| oxidoreductase, putative [Ricinus communis]
          Length = 379

 Score = 41.6 bits (96), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 47/187 (25%), Positives = 76/187 (40%), Gaps = 21/187 (11%)

Query: 556 QPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTN-NDKRLET--GYEAVPTRDIHMKQV 612
           +P P V+ F ++   FC   +  +E + +W   T     R  T   Y AV      +   
Sbjct: 148 EPTPGVYVFEMLQPNFCEMLMSEVENFERWVHETKFRIMRPNTMNNYGAV------LDDF 201

Query: 613 GLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTIN 672
           GL  +  + + +Y+ P+ +  F       + +   F+V Y  D    L  H D S  T+N
Sbjct: 202 GLETMLDKLMDEYIRPMSKLFFPEVGGSTLDSHHGFIVEYGVDRDVELGFHVDDSEVTLN 261

Query: 673 IALNQ--VGVDYEGGGCRFIRY-NCNVTATRM-------GWMLMHPGRLTHYHEGLQVTQ 722
           + L++  VG D    G R  ++ N    A  +       G  ++H GR  H H     T 
Sbjct: 262 VCLSKQFVGGDLFFRGVRCDKHVNTETQAEEILDYVHVQGHAVLHHGR--HRHGARATTS 319

Query: 723 GTRYIMI 729
           G R  +I
Sbjct: 320 GRRVNLI 326


>gi|326495342|dbj|BAJ85767.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 392

 Score = 41.6 bits (96), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 36/147 (24%), Positives = 62/147 (42%), Gaps = 12/147 (8%)

Query: 552 TVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTN-NDKRLET--GYEAVPTRDIH 608
           ++  +P P VF F ++  KFC   ++ +E + +W         R  T   Y AV      
Sbjct: 158 SIMTEPTPGVFSFAMLQPKFCDMLLEEVENFEKWVHAMKFKIMRPNTMNKYGAV------ 211

Query: 609 MKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSST 668
           +   GL  +  +F+ +++ P+ +  +       + +  +FVV Y  D    L  H D S 
Sbjct: 212 LDDFGLEAMLNQFMEEFIAPISKVFYPEVGGGTLDSHHAFVVEYGKDRDVELGFHVDDSE 271

Query: 669 YTINIALNQVGVDYEGGGCRFIRYNCN 695
            T+N+ L   G  + GG   F    C 
Sbjct: 272 VTLNVCL---GKQFSGGELYFRGIRCE 295


>gi|290978569|ref|XP_002672008.1| prolyl 4-hydroxylase alpha subunit family protein [Naegleria
           gruberi]
 gi|284085581|gb|EFC39264.1| prolyl 4-hydroxylase alpha subunit family protein [Naegleria
           gruberi]
          Length = 659

 Score = 41.6 bits (96), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 28/78 (35%), Positives = 39/78 (50%), Gaps = 13/78 (16%)

Query: 661 RPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHP-----GRLT--- 712
           R  H+ S YT+ I LNQ   D++GG  RF        +    + L+H      G+L    
Sbjct: 168 RSEHERSIYTLLIYLNQ---DFKGGETRFYNDPTKTDSDFEEYSLLHTLKPSLGQLALFN 224

Query: 713 --HYHEGLQVTQGTRYIM 728
              YHEG  VT+GT+YI+
Sbjct: 225 QDFYHEGCPVTKGTKYIL 242


>gi|326523063|dbj|BAJ88572.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 392

 Score = 41.6 bits (96), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 36/147 (24%), Positives = 62/147 (42%), Gaps = 12/147 (8%)

Query: 552 TVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTN-NDKRLET--GYEAVPTRDIH 608
           ++  +P P VF F ++  KFC   ++ +E + +W         R  T   Y AV      
Sbjct: 158 SIMTEPTPGVFSFAMLQPKFCDMLLEEVENFEKWVHAMKFKIMRPNTMNKYGAV------ 211

Query: 609 MKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSST 668
           +   GL  +  +F+ +++ P+ +  +       + +  +FVV Y  D    L  H D S 
Sbjct: 212 LDDFGLEAMLNQFMEEFIAPISKVFYPEVGGGTLDSHHAFVVEYGKDRDVELGFHVDDSE 271

Query: 669 YTINIALNQVGVDYEGGGCRFIRYNCN 695
            T+N+ L   G  + GG   F    C 
Sbjct: 272 VTLNVCL---GKQFSGGELYFRGIRCE 295


>gi|134299269|ref|YP_001112765.1| glycosyl transferase family protein [Desulfotomaculum reducens
           MI-1]
 gi|134051969|gb|ABO49940.1| glycosyl transferase, family 2 [Desulfotomaculum reducens MI-1]
          Length = 826

 Score = 41.6 bits (96), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 76/379 (20%), Positives = 145/379 (38%), Gaps = 66/379 (17%)

Query: 173 LISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLYGS---LEDIKLNF-- 227
           LI  + + N+ +D      L L+   R      +  L      L GS    ED+ LN   
Sbjct: 189 LIITKELANKPNDPFLLYSLALEHYQRKNILEGVQCLKKALTQLRGSEGYFEDVILNTAI 248

Query: 228 ------DLDEFVHLTNTKYNTNPVIIHGNGKSKIELNSFGNYLAKSWKTSGCTRCNLIKH 281
                  L+E +   N       +++    K  I +    N   K +  +  T   L K 
Sbjct: 249 GLLQLGRLEELMDFIN-----KSLLMLPEQKDLILMRRLANQGLKRYLKAADT---LEKS 300

Query: 282 LDSLKPDQF--PSVLISVFIDKPTAFLEEFLNKIANLNYPAKKISMFVYNNQEYHAPLFD 339
           +DS   + F    V+++  + +    L++FL  +  L     ++     N+   H     
Sbjct: 301 IDSRGKESFMKTRVMVASPVKQKEVILKQFLESLNKLEKSELELDFVFINDNNEH----- 355

Query: 340 DYIHNFKTMFKNVKYI---AHNSTVNSKEA--------------RNLAVENSLHKGVDFY 382
           + +  F    KNV+ I   +++S +  +E               +N  ++ +L +G D+ 
Sbjct: 356 NLLEKFSRGKKNVRIIKATSNDSYICDEETHRWSEELIWKVAAYKNSFIKMALEEGYDYL 415

Query: 383 FYVDSDSHLDNPDVLKYLVNRNESLIAPLLVR----PFKAWSNFWG-------------A 425
           F VDSD +L +P  +K+L++  + +++ +        FK     WG             A
Sbjct: 416 FLVDSDLYL-HPKTIKHLISLKKDIVSEVFWTRWGPEFKILPQVWGSDQYELYHVSRGQA 474

Query: 426 LNADGFYARSFDYMNIINGDQGGKGIWNVPYITNCYLM-KTSVIKATNIKTIYTLNSMDY 484
           L+ +    R  +++  ++      G + V  +  C L+ + ++ K  +   IY L+    
Sbjct: 475 LSEEEKIQRIEEFIEKLS----KPGTYKVGGLGACTLISQKALAKGVSFSEIYNLSFWGE 530

Query: 485 DMAFCTNLRNKGIHLKIDS 503
           D  FC      G  L  D+
Sbjct: 531 DRHFCIRAVALGFELYADT 549


>gi|344297298|ref|XP_003420336.1| PREDICTED: 2-oxoglutarate and iron-dependent oxygenase
           domain-containing protein 2 [Loxodonta africana]
          Length = 350

 Score = 41.6 bits (96), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 33/129 (25%), Positives = 59/129 (45%), Gaps = 11/129 (8%)

Query: 561 VFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAE 620
           ++  P+ T  FC   ++ +E + Q SD           Y       + + ++GL      
Sbjct: 139 IYRVPVFTASFCQALLEELEHFEQ-SDLPKGRPNTMNNY------GVLLHELGLDEPLVT 191

Query: 621 FLRK-YVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQV- 678
            LR+ ++ PL    +  Y   P+ +  +FVV+Y P +   L  H+D++  T+N+AL +  
Sbjct: 192 PLREHFLQPLMALLYPEYSGGPLDSHRAFVVKYAPGQDRELGCHYDNAELTLNVALGKAF 251

Query: 679 --GVDYEGG 685
             G  Y GG
Sbjct: 252 TGGALYFGG 260


>gi|376007582|ref|ZP_09784776.1| Alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
           allergen [Arthrospira sp. PCC 8005]
 gi|375324049|emb|CCE20529.1| Alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
           allergen [Arthrospira sp. PCC 8005]
          Length = 377

 Score = 41.2 bits (95), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 63/263 (23%), Positives = 104/263 (39%), Gaps = 43/263 (16%)

Query: 499 LKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPLDWDLRYI-----HP--EYQKSLL-- 549
           LK + +  YG  +     D + +N  VY  +   LD  LR +     HP  E+ +  L  
Sbjct: 99  LKGEVSTRYGAYI----CDGKNSNTIVYNRVAFLLDRSLRILKIYPLHPLEEFTQKFLGE 154

Query: 550 -PDTVNNQP-------CPDVFWFPIVTEKFCHEFVQIMEAYGQWSDG-TNNDKRLETGY- 599
             D V  +P        P +    ++  +FC E ++I E  G    G    +     GY 
Sbjct: 155 IQDLVAQEPPRLIEMQAPVLLIPKVLDLRFCRELIKIWETQGNDESGFMKREGEKTVGYV 214

Query: 600 -EAVPTRDIHMKQVGLAGVWAE-FLRKYVVP--LQEREFIGYHHEPVRAPMSFVVRYRPD 655
             +   R  H  Q G    + +  +++ V P  LQ  +F     +  R     +  Y  +
Sbjct: 215 DPSFKRRRDHFIQDGPVKNYIDSIMQRRVFPEILQAFQF-----QLTRRECYKIGCYDSE 269

Query: 656 EQPSLRPHHDSST-------YTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHP 708
                RPH D++T       + + I LN    +YEGG  RF  +  ++     G  ++  
Sbjct: 270 SGGFFRPHRDNTTGGTLHRRFAMTINLNTE--EYEGGCLRFPEHAPHLYKPATGDAIIFS 327

Query: 709 GRLTHYHEGLQVTQGTRYIMISF 731
              +  HE   VT G R+ ++SF
Sbjct: 328 --CSTMHEATDVTSGRRFALLSF 348


>gi|389878362|ref|YP_006371927.1| alkyl hydroperoxide reductase [Tistrella mobilis KA081020-065]
 gi|388529146|gb|AFK54343.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
           allergen [Tistrella mobilis KA081020-065]
          Length = 404

 Score = 41.2 bits (95), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 53/204 (25%), Positives = 78/204 (38%), Gaps = 30/204 (14%)

Query: 547 SLLPDTVNNQPCPDVFWFPIVTE-KFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTR 605
           ++ P     QP   V   P V E  FC    ++M  Y +   G  +    E G   V  R
Sbjct: 181 TVAPADDAGQPWAPVLAVPRVFEPAFCR---RLMAEYDRLG-GEESGFMREVGGRTVEMR 236

Query: 606 DIHMKQVG---------LAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDE 656
           D   K+            AG+ A   R+ +  LQ+     + ++  R     V  Y  D 
Sbjct: 237 DYGHKRRADCLIEDETLRAGIRARIERRLLPELQK----AFQYKATRIERYIVACYDGDG 292

Query: 657 QPS-LRPHHDSST-------YTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHP 708
                RPH D++T       + + I LN    DYEGG  RF  +         G  ++  
Sbjct: 293 AGGYFRPHRDNTTRGTAHRRFAVTINLN--AEDYEGGELRFPEFGDRRYRAPTGGAVVFS 350

Query: 709 GRLTHYHEGLQVTQGTRYIMISFV 732
             L   HE L VT+G R+  + F+
Sbjct: 351 CSL--LHEALAVTRGRRFACLPFL 372


>gi|255567788|ref|XP_002524872.1| oxidoreductase, putative [Ricinus communis]
 gi|223535835|gb|EEF37496.1| oxidoreductase, putative [Ricinus communis]
          Length = 411

 Score = 41.2 bits (95), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 53/223 (23%), Positives = 89/223 (39%), Gaps = 39/223 (17%)

Query: 531 NPLDWDLRYIHPE------YQKSLLPDT------VNNQPCPDVFWFPIVTEKFCHEFVQI 578
            PL+ +L  +HP       + K++  +T      + ++P P VF F ++   FC+  +  
Sbjct: 143 QPLNRELYAMHPSSFFVPSFIKAINDNTEESFRHIMSEPSPGVFTFEMLQPHFCNLLLSE 202

Query: 579 MEAYGQW-SDGTNNDKRLET--GYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQEREFI 635
           +E + +W +D      R  T   Y AV      +   GL  +  + +  ++ P+ +  F 
Sbjct: 203 VENFEKWVNDSKFRIMRPNTMNKYGAV------LDDFGLETMLDKLMDGFIRPISKVFFP 256

Query: 636 GYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCN 695
                 + +   FVV Y  D    L  H D S  T+N+ L   G  + GG   F    C+
Sbjct: 257 EVGGSTLDSHHGFVVEYGKDRDVDLGFHVDDSEVTLNVCL---GKQFSGGDLFFRGIRCD 313

Query: 696 V---TATRM----------GWMLMHPGRLTHYHEGLQVTQGTR 725
               T ++           G  ++H GR  H H     T G R
Sbjct: 314 KHVNTGSQSEEIYDYKHEPGKAVLHRGR--HRHGARATTTGHR 354


>gi|219112429|ref|XP_002177966.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217410851|gb|EEC50780.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 360

 Score = 41.2 bits (95), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 48/233 (20%), Positives = 91/233 (39%), Gaps = 42/233 (18%)

Query: 529 IRNPLDWDLRYIHPEYQKSLLPDTVNNQPCPDVFWFPIVTEKFCHEFVQI----MEAYGQ 584
           IR+     L +  P + +  + D +   P  ++    I+++      ++I     +A G+
Sbjct: 25  IRSAASTRLIFTLPRFYEDKVDDAIYPSPLHNIHVRTILSDDEAKACLRISSDFAKATGR 84

Query: 585 WSDGTNNDKR-----LETGYEAVPTRDIHMKQVGLAG-VWAEFLRKYVVPLQEREFIGYH 638
           W D  ++D+       +   E     + +++++G  G ++ E    Y V  ++  F+   
Sbjct: 85  W-DRPDSDRHASYATCDFAVEDCTILEDYLEKIGFTGRIFDELNEVYGVEQEDMSFLDL- 142

Query: 639 HEPVRAPMSFVVRYRPD------EQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRF--- 689
                    F   Y+            L PH D S  + +I +N    D+EGGG  F   
Sbjct: 143 ---------FCAHYQTKTDCNQGSMDRLEPHRDGSILSFSITINDPD-DFEGGGTLFDGL 192

Query: 690 ---------IRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVD 733
                    ++    V  TR G  + H G+  H      +T G R +++ FVD
Sbjct: 193 RDVVSTSSVLKNGGVVRPTRAGDAVFHSGKALHGANA--ITSGKRTVLVGFVD 243


>gi|431915931|gb|ELK16185.1| Glycosyltransferase 25 family member 2 [Pteropus alecto]
          Length = 166

 Score = 41.2 bits (95), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 19/64 (29%), Positives = 33/64 (51%), Gaps = 1/64 (1%)

Query: 367 RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGAL 426
           R  A+  +  K  D+  +VD D+ L NP  L  ++  N++++AP+L      +SNFW  +
Sbjct: 93  RQTALRTAREKWSDYILFVDVDNFLTNPQTLNLMIAENKTIVAPML-ESRGLYSNFWCGI 151

Query: 427 NADG 430
               
Sbjct: 152 TPQA 155


>gi|412993564|emb|CCO14075.1| predicted protein [Bathycoccus prasinos]
          Length = 486

 Score = 41.2 bits (95), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 33/122 (27%), Positives = 55/122 (45%), Gaps = 11/122 (9%)

Query: 25  KVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGG-GYK 83
           K K +D    +V   A+N        + SA+ N L++         + G+ ++  G   K
Sbjct: 162 KSKGVDSAPVVVTAHATNLKSNGWVILDSAKKNGLEIV--------ISGNGTTFHGFADK 213

Query: 84  VNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTS 143
           +  LK  L    I  + II+  D+ DV++  G  +  +RF   DA+ +FG E   WP+  
Sbjct: 214 MMGLKAAL--HSIPGNPIIVNADATDVLLQCGPEEFQKRFEQADADFIFGGETQLWPEIR 271

Query: 144 LY 145
            Y
Sbjct: 272 KY 273


>gi|407068468|ref|ZP_11099306.1| 2OG-Fe(II) oxygenase [Vibrio cyclitrophicus ZF14]
          Length = 303

 Score = 41.2 bits (95), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 38/148 (25%), Positives = 62/148 (41%), Gaps = 22/148 (14%)

Query: 588 GTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQE---REFIGYHHEPVRA 644
           G   D R E GY A P+             + + L  Y+ P+      E +GY  +    
Sbjct: 133 GAMLDSRSE-GYLAAPS---------FQAFYRDLLDSYMRPIARLLFPEIMGYDTQT--- 179

Query: 645 PMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRF---IRYNCNVTATRM 701
              F ++Y+ ++  SLR H D+S+ T+NI +N    ++ G    F        N T    
Sbjct: 180 -FGFSIQYQANKDTSLRLHTDASSVTLNININMPDEEFSGSELNFYDPATGKMNETTFTP 238

Query: 702 GWMLMHPGRLTHYHEGLQVTQGTRYIMI 729
           G  ++H G +   H  L +T G R  ++
Sbjct: 239 GVAMIHRGNVA--HAALPITSGERSNLV 264


>gi|297719687|ref|NP_001172205.1| Os01g0180900 [Oryza sativa Japonica Group]
 gi|255672938|dbj|BAH90935.1| Os01g0180900 [Oryza sativa Japonica Group]
          Length = 433

 Score = 41.2 bits (95), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 44/192 (22%), Positives = 72/192 (37%), Gaps = 22/192 (11%)

Query: 552 TVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQ 611
           ++  +P P VF FP++   FC   +  +  + +W+   N      T  +    R   +  
Sbjct: 194 SIMMEPAPGVFAFPMLKPSFCQMLMSEVNNFLRWAQSANQRIMRPTSLDRH-GRGAALSD 252

Query: 612 VGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHH-DSSTYT 670
            GL  +    ++ ++ P+    F       + +  +FV+ Y   E    R  H D S  T
Sbjct: 253 FGLQEMLDNLMKDFISPMSTVLFPEVGGNTLDSHHTFVLEY--GEADGARGFHVDDSEVT 310

Query: 671 INIALNQVGVDYEGGGCRFIRYNCN-------------VTATRMGWMLMHPGRLTHYHEG 717
           +NI L   G  + G    F    C              V     G +L+H G  +H H  
Sbjct: 311 LNICL---GKHFTGADMYFRGIRCGNHVNSGTHDEEYFVHPNVPGQVLLHHG--SHRHGV 365

Query: 718 LQVTQGTRYIMI 729
             VT G R  M+
Sbjct: 366 FSVTSGRRVNMV 377


>gi|421502614|ref|ZP_15949567.1| alkyl hydroperoxide reductase [Pseudomonas mendocina DLHK]
 gi|400346598|gb|EJO94955.1| alkyl hydroperoxide reductase [Pseudomonas mendocina DLHK]
          Length = 377

 Score = 41.2 bits (95), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 46/184 (25%), Positives = 71/184 (38%), Gaps = 19/184 (10%)

Query: 561 VFWFPIVTE-KFCHEFVQIMEAYGQWSDGTNNDKRLET----GYEAVPTRDIHMKQVGLA 615
           V   P V E   C   +    A G    G   D   +T    G      RD  ++   L 
Sbjct: 166 VLVLPRVFEPSLCQALMDYYAARGGEPSGYMQDIDGKTVQVIGQAHKSRRDCLVEDEALR 225

Query: 616 GVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSST------- 668
                 + + +VP  ER F     +  R     +  Y   EQ   RPH D++T       
Sbjct: 226 EACRLRIYQRLVPQIERAF---QFKVSRMERYLIGCYDATEQGHFRPHRDNTTKGTAHRR 282

Query: 669 YTINIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIM 728
           + +++ LN    +YEGG  RF  +   +     G  ++    L H  E L VT+G R++ 
Sbjct: 283 FAVSLFLNSG--EYEGGWLRFPEFGSALYGAPTGGAVVFACSLLH--EALPVTRGRRFMF 338

Query: 729 ISFV 732
           + F+
Sbjct: 339 LPFL 342


>gi|46137635|ref|XP_390509.1| hypothetical protein FG10333.1 [Gibberella zeae PH-1]
          Length = 507

 Score = 40.8 bits (94), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 53/232 (22%), Positives = 89/232 (38%), Gaps = 65/232 (28%)

Query: 40  ASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKVNL--LKNELDEM--D 95
           AS+  D +   + SA VN+     +     W G    +    +   L  +K  LD++   
Sbjct: 92  ASDPNDMFCAIVASALVNRYPAPYM---VGWKGEGKYNASAAHTAKLYSIKKYLDKLPNG 148

Query: 96  ITDDMIILVTDSYDVIIDGGVNDILERFNTFDAN------------------------IV 131
             DD ++   D YDV+    V  I+ER+    A+                        + 
Sbjct: 149 GDDDDLVFFGDGYDVMAQLPVEVIIERYFKVAADADQRLADRFGITVEEAHKRGLKQTLF 208

Query: 132 FGAERLCWP---------------DTSLYDKYPAVGSG------YRYLNSGGFIGYAKDI 170
           +GA+++CWP                +++Y   P  G+G       +Y NSG  IG   D+
Sbjct: 209 WGADKMCWPALNEAQCTKIPSSHLPSTVYG--PKTGNGNTHNRDAKYFNSGSVIGPIGDL 266

Query: 171 KELISNRSIKNEE----------DDQLYYALLFLDETLRTKHKIVLDTLANL 212
           ++ I+      EE           DQ+Y A  F  + L ++ K + D L N+
Sbjct: 267 RKFINAGVTALEETFDPNFKYKTSDQIYLARTFARQEL-SRAKQIEDELHNI 317


>gi|398803811|ref|ZP_10562825.1| Peroxiredoxin [Polaromonas sp. CF318]
 gi|398095675|gb|EJL86010.1| Peroxiredoxin [Polaromonas sp. CF318]
          Length = 373

 Score = 40.8 bits (94), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 44/182 (24%), Positives = 72/182 (39%), Gaps = 27/182 (14%)

Query: 566 IVTEKFCHEFVQIMEAYGQWSDGTNNDKRLET------GYEAVPTRDI-HMKQVGLAGVW 618
           +    FC E + + E +G    G   ++  +T      G++     DI  +  V  A  W
Sbjct: 178 VFPPGFCRELISLYETHGGKESGFMREENGKTVLAHDHGHKRREDYDITDLAVVKAARAW 237

Query: 619 AEFLRKYVVPLQEREFIGYHH-EPVRAPMSFVVRYRPDEQPSLRPHHDSST-------YT 670
              +++ +VP    E    H  +  R     +  YR D+Q    PH D++T       + 
Sbjct: 238 ---IQRRIVP----EIAKVHQFKATRMERYIIGCYRADQQAHFSPHRDNTTRGTAHRRFA 290

Query: 671 INIALNQVGVDYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMIS 730
           ++I LN    D+EGG   F  Y         G  ++    L H      VT+G RY  + 
Sbjct: 291 VSINLND---DFEGGEVSFPEYGPRSFKPPPGGAVVFSCSLLHAVS--TVTRGRRYAFLP 345

Query: 731 FV 732
           F+
Sbjct: 346 FL 347


>gi|424513685|emb|CCO66307.1| predicted protein [Bathycoccus prasinos]
          Length = 476

 Score = 40.8 bits (94), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 23/65 (35%), Positives = 35/65 (53%), Gaps = 2/65 (3%)

Query: 83  KVNLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDT 142
           KV  LK  L  M    + I++V D+ DV +    ++  +RF   +A++VFG E   WP+ 
Sbjct: 212 KVIGLKAALHAM--PGNPIVVVADASDVFLQCSASEFKDRFTKAEADMVFGGETQLWPEV 269

Query: 143 SLYDK 147
           S Y K
Sbjct: 270 SDYFK 274


>gi|18401806|ref|NP_566600.1| oxidoreductase [Arabidopsis thaliana]
 gi|145332623|ref|NP_001078177.1| oxidoreductase [Arabidopsis thaliana]
 gi|14423468|gb|AAK62416.1|AF386971_1 Unknown protein [Arabidopsis thaliana]
 gi|9294077|dbj|BAB02034.1| unnamed protein product [Arabidopsis thaliana]
 gi|30725644|gb|AAP37844.1| At3g18210 [Arabidopsis thaliana]
 gi|332642543|gb|AEE76064.1| oxidoreductase [Arabidopsis thaliana]
 gi|332642544|gb|AEE76065.1| oxidoreductase [Arabidopsis thaliana]
          Length = 394

 Score = 40.8 bits (94), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 48/191 (25%), Positives = 74/191 (38%), Gaps = 27/191 (14%)

Query: 555 NQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTN-NDKRLET--GYEAVPTRDIHMKQ 611
           ++P P VF F ++   FC   +  ++ + +W   T     R  T   Y AV      +  
Sbjct: 163 SEPSPGVFVFDMLQPSFCEMMLAEIDNFERWVGETKFRIMRPNTMNKYGAV------LDD 216

Query: 612 VGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTI 671
            GL  +  + +  ++ P+ +  F       + +   FVV Y  D    L  H D S  T+
Sbjct: 217 FGLDTMLDKLMEGFIRPISKVFFSDVGGATLDSHHGFVVEYGKDRDVDLGFHVDDSEVTL 276

Query: 672 NIALNQVGVDYEGGGCRFIRYNCNV---TATRM----------GWMLMHPGRLTHYHEGL 718
           N+ L   G  + GG   F    C     TAT+           G  ++H GR  H H   
Sbjct: 277 NVCL---GNQFVGGELFFRGTRCEKHVNTATKADETYDYCHIPGQAVLHRGR--HRHGAR 331

Query: 719 QVTQGTRYIMI 729
             T G R  M+
Sbjct: 332 ATTCGHRVNML 342


>gi|432089369|gb|ELK23320.1| Procollagen galactosyltransferase 2 [Myotis davidii]
          Length = 162

 Score = 40.8 bits (94), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 21/70 (30%), Positives = 36/70 (51%), Gaps = 2/70 (2%)

Query: 367 RNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGAL 426
           R  A+  +  K  D+  ++D D+ L NP  L  L+  N++++AP+L      +SNFW  +
Sbjct: 89  RQAALRTAREKWSDYILFIDVDNFLTNPQTLNLLIAENKTIVAPML-ESRGLYSNFWCGI 147

Query: 427 NADG-FYARS 435
                 + RS
Sbjct: 148 TPQASLWLRS 157


>gi|326433362|gb|EGD78932.1| hypothetical protein PTSG_01907 [Salpingoeca sp. ATCC 50818]
          Length = 423

 Score = 40.4 bits (93), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 62/293 (21%), Positives = 103/293 (35%), Gaps = 61/293 (20%)

Query: 485 DMAFCT-NLR----------NKGIHLKIDSTQEYGHLVDSENFDPQKTNPEVYELIRNPL 533
           D AFC  N R          N+ + +  D      H  +      ++   E Y  I    
Sbjct: 126 DEAFCKKNARLLRSWTADELNEALAILRDERARREHAAERSKERRERIKAE-YTFITRCK 184

Query: 534 DWDLRYIHPEYQKSL------------LPDTVNNQP----------CPDVFWFPIVTEKF 571
           +  L ++ PE +  +            LP T N Q            P ++  P+ T K+
Sbjct: 185 ELQLHHLRPEIRSLMTAIEETWTMDGRLPPTTNAQALVGTARLLELSPGIYAVPMFTAKY 244

Query: 572 CHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVV-PLQ 630
           C E  + +  +   +      K L+    ++    + + ++G    ++  +   +  PL 
Sbjct: 245 CAELEKELSNFRHVAS-----KDLKQSINSMNKHGVSLHELGFTPTFSNVIMASIANPLV 299

Query: 631 ER----EFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGG 686
           E     EF    H        F V Y       L  H+D S  T NI +  +   +EGG 
Sbjct: 300 EALYGAEFATLDHHKC-----FTVEYGEKADTDLSLHYDHSLITFNICITSL---FEGGD 351

Query: 687 CRFI-------RYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFV 732
            +F        R        R GW ++H GR   +H  L V  G R  +I ++
Sbjct: 352 LQFFGDSRAAPRDTPVTWRHRCGWAVIHRGR--GWHRALPVRYGHRTNIIMWL 402


>gi|356520629|ref|XP_003528963.1| PREDICTED: uncharacterized PKHD-type hydroxylase At1g22950-like
           [Glycine max]
          Length = 370

 Score = 40.4 bits (93), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 43/192 (22%), Positives = 70/192 (36%), Gaps = 31/192 (16%)

Query: 552 TVNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIH--- 608
           ++ ++P P +F F I    FC   +  +E + +W          ET +  +    ++   
Sbjct: 135 SIVSEPFPGIFIFDIFQTHFCELLLSEIENFEKWVT--------ETKFRIMHPNTMNKFG 186

Query: 609 --MKQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDS 666
             +   GL  +  + +  ++ PL    F       + +   FVV Y  D    L  H D 
Sbjct: 187 AVLDDFGLETMLDKLMEGFIRPLSRVFFAEVGGSTLDSHHGFVVEYGKDRDVDLGFHVDD 246

Query: 667 STYTINIALNQVGVDYEGGGCRFIRYNCNV---TATRM----------GWMLMHPGRLTH 713
           S  T+N+ L   G  + GG   F    C     T +            G  ++H GR  H
Sbjct: 247 SEVTLNVCL---GKQFSGGELFFRGVRCEKHVNTGSHSEEIFDYSHVPGRAVLHRGR--H 301

Query: 714 YHEGLQVTQGTR 725
            H     T G R
Sbjct: 302 RHGARATTSGNR 313


>gi|163915483|gb|AAI57324.1| ogfod2 protein [Xenopus (Silurana) tropicalis]
          Length = 288

 Score = 40.4 bits (93), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 40/178 (22%), Positives = 74/178 (41%), Gaps = 21/178 (11%)

Query: 561 VFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLA-GVWA 619
           ++  P+   +FC + V+ +E + + SD           Y       I + ++G    + A
Sbjct: 74  IYRLPVFIPEFCAKLVEELENF-ERSDLPKGRPNTMNNY------GILLNELGFVDALTA 126

Query: 620 EFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVG 679
               KY+ PL    F  +    + +  +FVV+Y   E   L  H+D++  T+N++L   G
Sbjct: 127 PLCEKYIEPLTSLLFPDWGGGCLDSHRAFVVKYALQEDLDLSCHYDNAEVTLNVSL---G 183

Query: 680 VDYEGGGCRFIRYNCNVTATR--------MGWMLMHPGRLTHYHEGLQVTQGTRYIMI 729
            ++  G   F          R         G  ++H G+  H H  L ++ G R+ +I
Sbjct: 184 KEFTDGNLYFSDMKEVPVNERTYAEVEHITGQGILHRGQ--HVHGALPISSGERWNLI 239


>gi|84387462|ref|ZP_00990481.1| hypothetical protein V12B01_14976 [Vibrio splendidus 12B01]
 gi|84377715|gb|EAP94579.1| hypothetical protein V12B01_14976 [Vibrio splendidus 12B01]
          Length = 303

 Score = 40.0 bits (92), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 38/148 (25%), Positives = 62/148 (41%), Gaps = 22/148 (14%)

Query: 588 GTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAEFLRKYVVPLQE---REFIGYHHEPVRA 644
           G   D R E GY A P+             + + L  Y+ P+      E +GY  +    
Sbjct: 133 GAMLDSRSE-GYLAAPS---------FQAFYRDLLDSYMRPIARLLFPEIMGYDTQT--- 179

Query: 645 PMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRF---IRYNCNVTATRM 701
              F ++Y+ ++  SLR H D+S+ T+NI +N    ++ G    F        N T    
Sbjct: 180 -FGFSIQYQANKDTSLRLHTDASSVTLNINVNMPDEEFSGSELNFYDPATGKMNETIFTP 238

Query: 702 GWMLMHPGRLTHYHEGLQVTQGTRYIMI 729
           G  ++H G +   H  L +T G R  ++
Sbjct: 239 GVAMIHRGNVA--HAALPITSGERSNLV 264


>gi|300855078|ref|YP_003780062.1| glycosyltransferase [Clostridium ljungdahlii DSM 13528]
 gi|300435193|gb|ADK14960.1| putative glycosyltransferase [Clostridium ljungdahlii DSM 13528]
          Length = 648

 Score = 40.0 bits (92), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 42/185 (22%), Positives = 77/185 (41%), Gaps = 23/185 (12%)

Query: 351 NVKYIAHNSTVNSKEARNLAVENSLHKGVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAP 410
           N+ Y   N      E +N  ++ +     D+ F +DSD  + +P  L  L+N N+ +I+ 
Sbjct: 81  NMHYWKENLIWKIAEYKNRIIDYTKKNHYDYLFLIDSDIMV-HPKTLLSLINSNKDIISE 139

Query: 411 LLVRPFKAWSNFWGALNADGFYARSFDYMN-----IINGDQGGK------------GIWN 453
           +    +  W    G L    +    + ++N     I++ D+  K            G++ 
Sbjct: 140 IF---WTRWQKDSGEL-PQVWVCDEYSFVNKERNEILSQDEFNKKYTDFIEKLKKPGVYE 195

Query: 454 VPYITNCYLM-KTSVIKATNIKTIYTLNSMDYDMAFCTNLRNKGIHLKIDSTQEYGHLVD 512
           V  +  C L+ K ++ K  N   IY L+    D  FC      G+ L +D+T    H+  
Sbjct: 196 VGGLGACTLISKEAIEKGVNFNKIYNLSFWGEDRHFCIRAAALGLKLYVDTTYPAYHIYR 255

Query: 513 SENFD 517
            +N +
Sbjct: 256 KDNLE 260


>gi|303279707|ref|XP_003059146.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226458982|gb|EEH56278.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 382

 Score = 40.0 bits (92), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 50/183 (27%), Positives = 66/183 (36%), Gaps = 58/183 (31%)

Query: 603 PTRDIHMKQVGLAGVWAEFLRKYV-VPLQEREFIGYHHEPVRAPMS-FVVRYRPDE--QP 658
           PT D+      L G WA     +  +    R   G     V  P   F+V+Y   E  Q 
Sbjct: 91  PTTDLPWG--ALPGTWAVLNETWTRMEADVRARCGIKSNDVLTPNDIFLVKYDASEGGQK 148

Query: 659 SLRPHHDSSTYTINIALNQVGVDYEGGGCRFIRYNCNVTATRMG----W---MLMHPGRL 711
            LR H D ST++ N+ L+  G DY GGG R   +N   T +R G    W   +   PGR 
Sbjct: 149 GLRRHRDGSTFSFNMMLSNPG-DYGGGGTRV--WNATDTESREGRERFWRAEVTKDPGRF 205

Query: 712 ------------------------------------------THYHEGLQVTQGTRYIMI 729
                                                      + H+G+ VT GTRYI+ 
Sbjct: 206 PGVNLTRGDRMPRNFVPNIHMYPEDESTLHVLEKGQMLVGGGANVHQGVPVTTGTRYIVA 265

Query: 730 SFV 732
            FV
Sbjct: 266 GFV 268


>gi|84494478|ref|ZP_00993597.1| hypothetical protein JNB_06769 [Janibacter sp. HTCC2649]
 gi|84383971|gb|EAP99851.1| hypothetical protein JNB_06769 [Janibacter sp. HTCC2649]
          Length = 711

 Score = 39.7 bits (91), Expect = 5.6,   Method: Compositional matrix adjust.
 Identities = 27/86 (31%), Positives = 39/86 (45%), Gaps = 6/86 (6%)

Query: 648 FVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGG--CRFIRYNCNVTATRMGWML 705
           FV  +    +P +  H D S +T+N+ L     D   GG     +     V   R GW +
Sbjct: 612 FVRHFSERTRPFIPFHPDDSHWTVNVPLEDP--DQTSGGELVMLLDGGLRVVERRRGWAI 669

Query: 706 MHPGRLTHYHEGLQVTQGTRYIMISF 731
            HPG L H     +VT G R+ +I+F
Sbjct: 670 SHPGALIHGVR--RVTHGDRWSLIAF 693


>gi|449458771|ref|XP_004147120.1| PREDICTED: uncharacterized PKHD-type hydroxylase At1g22950-like
           [Cucumis sativus]
 gi|449503401|ref|XP_004161984.1| PREDICTED: uncharacterized PKHD-type hydroxylase At1g22950-like
           [Cucumis sativus]
          Length = 384

 Score = 39.7 bits (91), Expect = 5.7,   Method: Compositional matrix adjust.
 Identities = 45/189 (23%), Positives = 75/189 (39%), Gaps = 27/189 (14%)

Query: 553 VNNQPCPDVFWFPIVTEKFCHEFVQIMEAYGQWSDGTN-NDKRLET--GYEAVPTRDIHM 609
           + ++P P ++ F ++  +FC + +  +E++ +W   T     R  T   Y AV      +
Sbjct: 150 IMSEPSPGIYKFEMLQPQFCEKLLSEVESFERWVHETKFRIMRPNTMNKYGAV------L 203

Query: 610 KQVGLAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTY 669
              GL  +  + +  ++ P+    F       + +   FVV Y  D    L  H D S  
Sbjct: 204 DDFGLETMLDKLMDDFIRPISRVFFPEVGGATLDSHHGFVVEYGIDRDVELGFHVDDSEV 263

Query: 670 TINIALNQVGVDYEGGGCRFIRYNCNV---TATRM----------GWMLMHPGRLTHYHE 716
           T+N+ L   G  + GG   F    C+    T T+           G  ++H GR  H H 
Sbjct: 264 TLNVCL---GKQFSGGELFFRGIRCDKHVNTETQSEEIFDYLHVPGHAVLHRGR--HRHG 318

Query: 717 GLQVTQGTR 725
               T G R
Sbjct: 319 ARATTSGRR 327


>gi|390338649|ref|XP_786011.3| PREDICTED: 2-oxoglutarate and iron-dependent oxygenase
           domain-containing protein 2-like [Strongylocentrotus
           purpuratus]
          Length = 318

 Score = 39.7 bits (91), Expect = 5.8,   Method: Compositional matrix adjust.
 Identities = 40/180 (22%), Positives = 76/180 (42%), Gaps = 25/180 (13%)

Query: 561 VFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAE 620
           VF FP+ T +FC  FV+ +  +        N    +     +    + + ++G  G +  
Sbjct: 99  VFSFPVFTAEFCDRFVEEITHF-------ENSPLPKGRPNTMNNYGVLLMELGFDGNFLN 151

Query: 621 FLR-KYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINIALNQVG 679
            LR  Y+ P+    +       + +  +F+V+Y+  E   L  H D++  TIN++L +  
Sbjct: 152 PLRMDYLAPIASLLYPDVGGNSLDSHRAFIVKYKLGEDVDLNYHFDNAEVTINVSLGKEF 211

Query: 680 VDYE----------GGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMI 729
            D E              ++ R+    T       L+H G+  H H  + +++G RY +I
Sbjct: 212 SDGELYFGDMRQMPRDETKYARFEHKKTIG-----LLHRGQ--HMHGAMPISEGERYNLI 264


>gi|168027274|ref|XP_001766155.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162682587|gb|EDQ69004.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 390

 Score = 39.7 bits (91), Expect = 5.8,   Method: Compositional matrix adjust.
 Identities = 43/189 (22%), Positives = 76/189 (40%), Gaps = 35/189 (18%)

Query: 561 VFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLET-------GYEAVPTRDIHMKQVG 613
           VF F ++   FC + ++ +E + +W+     + R++         Y AV      +  +G
Sbjct: 168 VFTFSMLKPSFCSKMLEEVEHFERWA----QEARVKVMRPNTMNNYGAV------LDDIG 217

Query: 614 LAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINI 673
           +  +    + +Y+ P+    F+      +     FVV Y  D    L  H D S  T+N+
Sbjct: 218 MEVMLNHLMLRYLKPMAAVLFLNVGGSSLDTHHGFVVEYAMDRDLDLGFHVDDSEVTLNV 277

Query: 674 ALNQVGVDYEGGGC--RFIRYNCNVTATRM-----------GWMLMHPGRLTHYHEGLQV 720
            L   G  ++GG    R +R + +V                G  ++H GR  H H    +
Sbjct: 278 CL---GKKFDGGELFFRGVRCDKHVNGEARSEEVLEYSHVPGDAILHAGR--HRHGAKAI 332

Query: 721 TQGTRYIMI 729
           T G R  +I
Sbjct: 333 TSGQRTNLI 341


>gi|393227154|gb|EJD34846.1| HECT-domain-containing protein [Auricularia delicata TFB-10046 SS5]
          Length = 997

 Score = 39.7 bits (91), Expect = 6.1,   Method: Compositional matrix adjust.
 Identities = 39/133 (29%), Positives = 64/133 (48%), Gaps = 24/133 (18%)

Query: 112 IDGGVNDILERFNTFDANIVFGAERLCWPDTSLYDKYP---AVGSGYRYLNSGGFIGYAK 168
           IDGG   + + F T  +  VF A+R  W  TS ++ YP   ++ +   +LN   F+G   
Sbjct: 684 IDGG--GVFKEFLTSLSKEVFNADRGLWLTTSQHELYPNPMSIATEPHHLNWYRFVGR-- 739

Query: 169 DIKELISNRSIKNEEDDQLYYALLFLDETLRTKHKIVLDTLANLFQNLY----------G 218
                I  +++      ++ +A  FL + L  +    LD LA+L + LY          G
Sbjct: 740 -----ILGKALYQGILVEVAFASFFLAKWLSKQS--FLDDLASLDRELYNGLIFLKHYQG 792

Query: 219 SLEDIKLNFDLDE 231
           +LED+ LNF ++E
Sbjct: 793 NLEDLALNFTINE 805


>gi|168016296|ref|XP_001760685.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162688045|gb|EDQ74424.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 387

 Score = 39.7 bits (91), Expect = 6.1,   Method: Compositional matrix adjust.
 Identities = 43/189 (22%), Positives = 77/189 (40%), Gaps = 35/189 (18%)

Query: 561 VFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLET-------GYEAVPTRDIHMKQVG 613
           VF F ++   FC + ++ +E + +W+     + R++         Y AV      +  +G
Sbjct: 169 VFTFSMLKPSFCVKMLEEVEHFERWA----QEARVKVMRPNTMNNYGAV------LDDIG 218

Query: 614 LAGVWAEFLRKYVVPLQEREFIGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTINI 673
           +  +    + +Y+ P+    F+      +     FVV Y  D    L  H D S  T+N+
Sbjct: 219 MESMLNHLMIRYLKPMAAVLFLNVGGCSLDTHHGFVVEYAMDRDLDLGFHVDDSEVTLNV 278

Query: 674 ALNQVGVDYEGGGC--RFIRYNCNVTATRM-----------GWMLMHPGRLTHYHEGLQV 720
            L   G +++GG    R +R + +V                G  ++H GR  H H    +
Sbjct: 279 CL---GKEFDGGELFFRGVRCDKHVNGEARPEEVLEYSHVPGHAILHAGR--HRHGAKAI 333

Query: 721 TQGTRYIMI 729
           T G R  +I
Sbjct: 334 TSGQRTNLI 342


>gi|320166370|gb|EFW43269.1| Ogfod2 protein [Capsaspora owczarzaki ATCC 30864]
          Length = 400

 Score = 39.3 bits (90), Expect = 7.0,   Method: Compositional matrix adjust.
 Identities = 27/94 (28%), Positives = 42/94 (44%), Gaps = 13/94 (13%)

Query: 647 SFVVRYRPDEQPSLRPHHDSSTYTINIALNQVGVDYEGGGCRF--------IRYNCNVTA 698
           +FVV+YR  E   L+ H D +  T+N+ L   G ++ GG   F                 
Sbjct: 276 TFVVQYRMAEDRELKFHFDDAEVTLNVCL---GTEFTGGALYFGGLFDAPETHDESLAVQ 332

Query: 699 TRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFV 732
            ++G   +H G+  H H    +T G RY MI ++
Sbjct: 333 HQLGRATLHLGK--HRHAAKPITSGERYNMIMWM 364


>gi|397630386|gb|EJK69753.1| hypothetical protein THAOC_08956, partial [Thalassiosira oceanica]
          Length = 533

 Score = 39.3 bits (90), Expect = 7.3,   Method: Compositional matrix adjust.
 Identities = 21/53 (39%), Positives = 30/53 (56%), Gaps = 3/53 (5%)

Query: 681 DYEGGGCRFIRYNCNVTATRMGWMLMHPGRLTHYHEGLQVTQGTRYIMISFVD 733
           +YEGGG  F      V   + G +L+HPG L  YH+G  +T G R +++ F D
Sbjct: 458 EYEGGGTYFRSLRKTVI-LQQGQVLVHPGEL--YHKGNDITYGVRCLLVCFTD 507


>gi|303285766|ref|XP_003062173.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226456584|gb|EEH53885.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 174

 Score = 39.3 bits (90), Expect = 7.5,   Method: Composition-based stats.
 Identities = 43/171 (25%), Positives = 73/171 (42%), Gaps = 35/171 (20%)

Query: 561 VFWFPIVTEKFCHEFVQIMEAYGQWSDGTNNDKRLETGYEAVPTRDIHMKQVGLAGVWAE 620
           V+ F +  E+FC    + ++AY    + +   KR      A     + + ++G+ G+  +
Sbjct: 1   VYAFDLFEERFCAMLTEEVDAY----EVSGLPKRRPNTMNA---SGLIVNEIGMWGLMTD 53

Query: 621 FLRKYVVPLQEREF--------IGYHHEPVRAPMSFVVRYRPDEQPSLRPHHDSSTYTIN 672
            ++    PL    +        + +HH       SFVV Y  D+   L  HHD+S  T+N
Sbjct: 54  VVKALASPLAAALYRDEIFADSLDHHH-------SFVVHYARDKDTRLDMHHDASEVTLN 106

Query: 673 IALNQVGVD-YEGGGCRFI--------RYNCNVTATRM-GWMLMHPGRLTH 713
           +    +G D +EG G RF         R   +   + + G  +MH GR  H
Sbjct: 107 VC---IGRDHFEGAGLRFCGRFGDANHRSGPSFAVSHVPGRAVMHLGRQRH 154


>gi|441623716|ref|XP_003264022.2| PREDICTED: glycosyltransferase 25 family member 3 [Nomascus
           leucogenys]
          Length = 789

 Score = 39.3 bits (90), Expect = 8.2,   Method: Compositional matrix adjust.
 Identities = 21/71 (29%), Positives = 38/71 (53%), Gaps = 6/71 (8%)

Query: 378 GVDFYFYVDSDSHLDNPDVLKYLVNRNESLIAPLLVRPFKAWSNFWGALN-----ADGFY 432
           G D+  + D+D+ L N   L+ LV +   ++AP+L      +SNFW  +      + G+Y
Sbjct: 358 GADYILFADTDNILTNNQTLRLLVGQGLPVVAPMLDS-QTYYSNFWCGITPQHSFSPGYY 416

Query: 433 ARSFDYMNIIN 443
            R+ +Y  ++N
Sbjct: 417 RRTAEYFPMLN 427


>gi|412993728|emb|CCO14239.1| predicted protein [Bathycoccus prasinos]
          Length = 816

 Score = 38.9 bits (89), Expect = 9.1,   Method: Compositional matrix adjust.
 Identities = 40/144 (27%), Positives = 62/144 (43%), Gaps = 10/144 (6%)

Query: 25  KVKNIDEDKFLVITVASNETDGYKRFIQSAEVNKLQVKTLGLHQPWLGGDMSSLGGGYKV 84
           K   +D    +V   A+N        + SA+ N LQV   G      G D    G   K+
Sbjct: 483 KTHGVDSADVVVTAHATNIKSTGWVIVDSAKRNGLQVVISGN-----GTDFH--GFADKM 535

Query: 85  NLLKNELDEMDITDDMIILVTDSYDVIIDGGVNDILERFNTFDANIVFGAERLCWPDTSL 144
             LK  L    I  + I++ TD+ DV++     +   RFN  +A+ +FG E   WP+   
Sbjct: 536 MGLKAAL--HSINGNPIVINTDANDVMLQCSGQEFKNRFNQANADFIFGGETQLWPEIHA 593

Query: 145 Y-DKYPAVGSGYRYLNSGGFIGYA 167
           Y +K   +    +  ++ G IG A
Sbjct: 594 YFEKTDEIAWKEKMSDTLGKIGAA 617


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.321    0.139    0.428 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 12,233,670,100
Number of Sequences: 23463169
Number of extensions: 552722575
Number of successful extensions: 1179237
Number of sequences better than 100.0: 742
Number of HSP's better than 100.0 without gapping: 348
Number of HSP's successfully gapped in prelim test: 394
Number of HSP's that attempted gapping in prelim test: 1176183
Number of HSP's gapped (non-prelim): 870
length of query: 734
length of database: 8,064,228,071
effective HSP length: 150
effective length of query: 584
effective length of database: 8,839,720,017
effective search space: 5162396489928
effective search space used: 5162396489928
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 81 (35.8 bits)