Citrus Sinensis ID: 029399


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190----
MVMAPSMKIVFGLLTFVTFGMIIGALFQLAFIRKLEDSYGTDFPSFMRRQKNGYLQLPRGVTFWDNDKEAELLRLGYVKPEVISWSPRILVLHNFLSMEECDYLRAIARPHLQVSTVVDTKTGKGIKSNVRTSSGMFLSPEEKKYPMIQAIEKRISVFSQVPVENGELIQVLSTGMKKISITSPITITFQTLST
cccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccccccHHHHHHHccccccccccEEEcccccccEEEccccccEEEcccccEEEEcccccHHHHHHHHHHHcccccccEEEcccccccEEEcEEEEEcccccccccccHHHHHHHHHHHHHcccccccccccEEEEEcccccccccccccccccccc
cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccccccccccccccccHHHHccccccccccccccccccccEEcccccEEEEccccEEEEEcccccHHHHHHHHHHHcHHHcccEEEccccccEEEccEEcccccEccccccccHHHHHHHHHHHHHcccccccccHHEEEEcccccccccccEEEEHHHccc
MVMAPSMKIVFGLLTFVTFGMIIGALFQLAFIRKLedsygtdfpsfmrrqkngylqlprgvtfwdndKEAELLRLgyvkpeviswsprILVLHNFLSMEECDYLRAIArphlqvstvvdtktgkgiksnvrtssgmflspeekkypMIQAIEKRISVfsqvpvengELIQVLSTGmkkisitspititfqtlst
MVMAPSMKIVFGLLTFVTFGMIIGALFQLAFIRKLEDSYGTDFPSFMRRQKNGYLQLPRGVTFWDNDKEAELLRLGyvkpeviswsPRILVLHNFLSMEECDYLRAIARPhlqvstvvdtktgkgiksnvrtssgmflspeEKKYPMIQAIEKRISVFSQVPVENGELIQVLStgmkkisitspititfqtlst
MVMAPSMKIVFGLLTFVTFGMIIGALFQLAFIRKLEDSYGTDFPSFMRRQKNGYLQLPRGVTFWDNDKEAELLRLGYVKPEVISWSPRILVLHNFLSMEECDYLRAIARPHLQVSTVVDTKTGKGIKSNVRTSSGMFLSPEEKKYPMIQAIEKRISVFSQVPVENGELIQVLSTGMKKisitspititFQTLST
******MKIVFGLLTFVTFGMIIGALFQLAFIRKLEDSYGTDFPSFMRRQKNGYLQLPRGVTFWDNDKEAELLRLGYVKPEVISWSPRILVLHNFLSMEECDYLRAIARPHLQVSTVVDTKTG*********************YPMIQAIEKRISVFSQVPVENGELIQVLSTGMKKISITSPITITF*****
*****SM**VFGLLTFVTFGMIIGALFQLAFI******************************FWDNDKEAELLRLGYVKPEVISWSPRILVLHNFLSMEECDYLRAIAR***************GIKSNVRTSSGMFLSPEEKKYPMIQAIEKRISVFSQVPVENGELIQVLSTGMKKISITSPITITFQTLS*
MVMAPSMKIVFGLLTFVTFGMIIGALFQLAFIRKLEDSYGTDFPSFMRRQKNGYLQLPRGVTFWDNDKEAELLRLGYVKPEVISWSPRILVLHNFLSMEECDYLRAIARPHLQVSTVVDTKTGKGIKSNVRTSSGMFLSPEEKKYPMIQAIEKRISVFSQVPVENGELIQVLSTGMKKISITSPITITFQTLST
***APSMKIVFGLLTFVTFGMIIGALFQLAFIRK*******DFPSFM***KNG*LQLPRGVTFWDNDKEAELLRLGYVKPEVISWSPRILVLHNFLSMEECDYLRAIARPHLQVSTVVDTKTGKGIKSNVRTSSGMFLSPEEKKYPMIQAIEKRISVFSQVPVENGELIQVLSTGMKKISITSPITITFQTLST
iiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHHoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
SSSSSSSSSSSSiiHHHHHHHHHHHHHHHHoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
SSSSSSSSSSSSSSSSSSSSSSSSoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
SSSSSSSSSSSSSSSSSSSSSSSSSooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MVMAPSMKIVFGLLTFVTFGMIIGALFQLAFIRKLEDSYGTDFPSFMRRQKNGYLQLPRGVTFWDNDKEAELLRLGYVKPEVISWSPRILVLHNFLSMEECDYLRAIARPHLQVSTVVDTKTGKGIKSNVRTSSGMFLSPEEKKYPMIQAIEKRISVFSQVPVENGELIQVLSTGMKKISITSPITITFQTLST
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query194 2.2.26 [Sep-21-2011]
Q60716537 Prolyl 4-hydroxylase subu yes no 0.458 0.165 0.329 1e-06
P16924516 Prolyl 4-hydroxylase subu yes no 0.618 0.232 0.302 2e-06
Q5ZLK5534 Prolyl 4-hydroxylase subu no no 0.618 0.224 0.294 2e-06
O15460535 Prolyl 4-hydroxylase subu yes no 0.458 0.166 0.318 5e-06
P54001534 Prolyl 4-hydroxylase subu no no 0.453 0.164 0.322 6e-06
Q20065 539 Prolyl 4-hydroxylase subu yes no 0.551 0.198 0.247 6e-05
Q10576 559 Prolyl 4-hydroxylase subu no no 0.5 0.173 0.252 0.0001
Q1RMU3534 Prolyl 4-hydroxylase subu no no 0.453 0.164 0.288 0.0008
Q5RAG8534 Prolyl 4-hydroxylase subu no no 0.453 0.164 0.288 0.0009
P13674534 Prolyl 4-hydroxylase subu no no 0.453 0.164 0.288 0.0009
>sp|Q60716|P4HA2_MOUSE Prolyl 4-hydroxylase subunit alpha-2 OS=Mus musculus GN=P4ha2 PE=2 SV=1 Back     alignment and function desciption
 Score = 53.1 bits (126), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 30/91 (32%), Positives = 52/91 (57%), Gaps = 2/91 (2%)

Query: 86  SPRILVLHNFLSMEECDYLRAIARPHLQVSTVVDTKTGKGIKSNVRTSSGMFLSPEEKKY 145
           SP I+  ++ +S EE + ++ IA+P L  +TV D KTG    ++ R S   +L  EE   
Sbjct: 337 SPHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWL--EEDDD 394

Query: 146 PMIQAIEKRISVFSQVPVENGELIQVLSTGM 176
           P++  + +R+   + + V+  EL+QV + GM
Sbjct: 395 PVVARVNRRMQHITGLTVKTAELLQVANYGM 425




Catalyzes the post-translational formation of 4-hydroxyproline in -Xaa-Pro-Gly- sequences in collagens and other proteins.
Mus musculus (taxid: 10090)
EC: 1EC: .EC: 1EC: 4EC: .EC: 1EC: 1EC: .EC: 2
>sp|P16924|P4HA1_CHICK Prolyl 4-hydroxylase subunit alpha-1 OS=Gallus gallus GN=P4HA1 PE=1 SV=1 Back     alignment and function description
>sp|Q5ZLK5|P4HA2_CHICK Prolyl 4-hydroxylase subunit alpha-2 OS=Gallus gallus GN=P4HA2 PE=2 SV=1 Back     alignment and function description
>sp|O15460|P4HA2_HUMAN Prolyl 4-hydroxylase subunit alpha-2 OS=Homo sapiens GN=P4HA2 PE=1 SV=1 Back     alignment and function description
>sp|P54001|P4HA1_RAT Prolyl 4-hydroxylase subunit alpha-1 OS=Rattus norvegicus GN=P4ha1 PE=2 SV=2 Back     alignment and function description
>sp|Q20065|P4HA2_CAEEL Prolyl 4-hydroxylase subunit alpha-2 OS=Caenorhabditis elegans GN=phy-2 PE=1 SV=1 Back     alignment and function description
>sp|Q10576|P4HA1_CAEEL Prolyl 4-hydroxylase subunit alpha-1 OS=Caenorhabditis elegans GN=dpy-18 PE=1 SV=2 Back     alignment and function description
>sp|Q1RMU3|P4HA1_BOVIN Prolyl 4-hydroxylase subunit alpha-1 OS=Bos taurus GN=P4HA1 PE=1 SV=1 Back     alignment and function description
>sp|Q5RAG8|P4HA1_PONAB Prolyl 4-hydroxylase subunit alpha-1 OS=Pongo abelii GN=P4HA1 PE=2 SV=1 Back     alignment and function description
>sp|P13674|P4HA1_HUMAN Prolyl 4-hydroxylase subunit alpha-1 OS=Homo sapiens GN=P4HA1 PE=1 SV=2 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query194
255641158195 unknown [Glycine max] 0.989 0.984 0.733 6e-80
363814557238 uncharacterized protein LOC100794585 [Gl 0.876 0.714 0.763 6e-73
356576923287 PREDICTED: prolyl 4-hydroxylase subunit 0.876 0.592 0.757 1e-72
224069056287 predicted protein [Populus trichocarpa] 0.876 0.592 0.763 2e-72
255637879287 unknown [Glycine max] 0.876 0.592 0.763 2e-72
15224220283 P4H isoform 1 [Arabidopsis thaliana] gi| 0.860 0.590 0.744 6e-67
449468746290 PREDICTED: putative prolyl 4-hydroxylase 0.881 0.589 0.718 3e-65
297824279283 AT-P4H-1 [Arabidopsis lyrata subsp. lyra 0.860 0.590 0.726 1e-64
225433714287 PREDICTED: prolyl 4-hydroxylase subunit 0.876 0.592 0.664 1e-61
357445147281 Prolyl 4-hydroxylase subunit alpha-1 [Me 0.835 0.576 0.705 3e-61
>gi|255641158|gb|ACU20856.1| unknown [Glycine max] Back     alignment and taxonomy information
 Score =  301 bits (772), Expect = 6e-80,   Method: Compositional matrix adjust.
 Identities = 143/195 (73%), Positives = 171/195 (87%), Gaps = 3/195 (1%)

Query: 3   MAPSMKIVFGLLTFVTFGMIIGALFQLAFIRKLEDSYGTDFPSFMRRQ---KNGYLQLPR 59
           MAP+M+IVFGLLTFVT GMIIGAL QLA IR+LEDS+GTD   F R +    + +LQLPR
Sbjct: 1   MAPAMRIVFGLLTFVTVGMIIGALSQLAIIRRLEDSHGTDSLPFSRLRGLDTDRHLQLPR 60

Query: 60  GVTFWDNDKEAELLRLGYVKPEVISWSPRILVLHNFLSMEECDYLRAIARPHLQVSTVVD 119
           G+ FW+NDKEAE+LRLGYVKPEV++WSPRI++LHNFLSMEECDYLRAIA P L +S VVD
Sbjct: 61  GIPFWNNDKEAEVLRLGYVKPEVLNWSPRIILLHNFLSMEECDYLRAIALPRLHISNVVD 120

Query: 120 TKTGKGIKSNVRTSSGMFLSPEEKKYPMIQAIEKRISVFSQVPVENGELIQVLSTGMKKI 179
           TKTGKGIKS+VRTSSGMFL+P+E+KYPM+QAIEKRISV+SQ+P+ENGEL+QVLSTG ++I
Sbjct: 121 TKTGKGIKSDVRTSSGMFLNPQERKYPMVQAIEKRISVYSQIPIENGELMQVLSTGTRRI 180

Query: 180 SITSPITITFQTLST 194
           +IT+PI   F  LST
Sbjct: 181 NITNPIMTIFLILST 195




Source: Glycine max

Species: Glycine max

Genus: Glycine

Family: Fabaceae

Order: Fabales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|363814557|ref|NP_001242754.1| uncharacterized protein LOC100794585 [Glycine max] gi|255628535|gb|ACU14612.1| unknown [Glycine max] Back     alignment and taxonomy information
>gi|356576923|ref|XP_003556579.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max] Back     alignment and taxonomy information
>gi|224069056|ref|XP_002302889.1| predicted protein [Populus trichocarpa] gi|222844615|gb|EEE82162.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|255637879|gb|ACU19258.1| unknown [Glycine max] Back     alignment and taxonomy information
>gi|15224220|ref|NP_181836.1| P4H isoform 1 [Arabidopsis thaliana] gi|3763917|gb|AAC64297.1| hypothetical protein [Arabidopsis thaliana] gi|20197628|gb|AAM15158.1| hypothetical protein [Arabidopsis thaliana] gi|26450452|dbj|BAC42340.1| unknown protein [Arabidopsis thaliana] gi|29824245|gb|AAP04083.1| unknown protein [Arabidopsis thaliana] gi|330255112|gb|AEC10206.1| P4H isoform 1 [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|449468746|ref|XP_004152082.1| PREDICTED: putative prolyl 4-hydroxylase-like [Cucumis sativus] Back     alignment and taxonomy information
>gi|297824279|ref|XP_002880022.1| AT-P4H-1 [Arabidopsis lyrata subsp. lyrata] gi|297325861|gb|EFH56281.1| AT-P4H-1 [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information
>gi|225433714|ref|XP_002268409.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera] gi|296089634|emb|CBI39453.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|357445147|ref|XP_003592851.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula] gi|355481899|gb|AES63102.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query194
TAIR|locus:2041001283 AT-P4H-1 "P4H isoform 1" [Arab 0.860 0.590 0.744 1.8e-62
TAIR|locus:2156852289 AT5G66060 [Arabidopsis thalian 0.463 0.311 0.586 3.4e-22
TAIR|locus:2012903287 AT1G20270 [Arabidopsis thalian 0.463 0.313 0.543 4.9e-21
TAIR|locus:2827906291 P4H5 "prolyl 4-hydroxylase 5" 0.463 0.309 0.554 2.1e-20
TAIR|locus:2125329290 AT4G35810 [Arabidopsis thalian 0.463 0.310 0.543 2.1e-20
TAIR|locus:2125344272 AT4G35820 [Arabidopsis thalian 0.469 0.334 0.435 1.8e-14
TAIR|locus:2081106 299 P4H2 "prolyl 4-hydroxylase 2" 0.474 0.307 0.425 2.3e-13
TAIR|locus:2088892 288 AT3G28490 [Arabidopsis thalian 0.453 0.305 0.406 2.4e-13
TAIR|locus:2144960 298 AT5G18900 [Arabidopsis thalian 0.474 0.308 0.425 6.5e-13
TAIR|locus:2118889288 AT4G33910 [Arabidopsis thalian 0.5 0.336 0.3 1.9e-07
TAIR|locus:2041001 AT-P4H-1 "P4H isoform 1" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 638 (229.6 bits), Expect = 1.8e-62, P = 1.8e-62
 Identities = 128/172 (74%), Positives = 147/172 (85%)

Query:     3 MAPSMKIVFGLLTFVTFGMIIGALFQLAFIRKLEDSYGTDFPSF--MRRQKNGYLQLPRG 60
             MAP+MKIVFGLLTFVT GM+IG+L QLAFI +LEDSYGT FPS   +R Q   YL   R 
Sbjct:     1 MAPAMKIVFGLLTFVTVGMVIGSLLQLAFINRLEDSYGTGFPSLRGLRGQNTRYL---RD 57

Query:    61 VTFWDNDKEAELLRLGYVKPEVISWSPRILVLHNFLSMEECDYLRAIARPHLQVSTVVDT 120
             V+ W NDK+AELLR+G VKPEV+SWSPRI+VLH+FLS EEC+YL+AIARP LQVSTVVD 
Sbjct:    58 VSRWANDKDAELLRIGNVKPEVVSWSPRIIVLHDFLSPEECEYLKAIARPRLQVSTVVDV 117

Query:   121 KTGKGIKSNVRTSSGMFLSPEEKKYPMIQAIEKRISVFSQVPVENGELIQVL 172
             KTGKG+KS+VRTSSGMFL+  E+ YP+IQAIEKRI+VFSQVP ENGELIQVL
Sbjct:   118 KTGKGVKSDVRTSSGMFLTHVERSYPIIQAIEKRIAVFSQVPAENGELIQVL 169




GO:0005506 "iron ion binding" evidence=IEA
GO:0005576 "extracellular region" evidence=ISM
GO:0016491 "oxidoreductase activity" evidence=IEA
GO:0016705 "oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen" evidence=IEA
GO:0016706 "oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen, 2-oxoglutarate as one donor, and incorporation of one atom each of oxygen into both donors" evidence=IEA;ISS
GO:0018401 "peptidyl-proline hydroxylation to 4-hydroxy-L-proline" evidence=ISS;IDA
GO:0031418 "L-ascorbic acid binding" evidence=IEA
GO:0055114 "oxidation-reduction process" evidence=IEA
GO:0004656 "procollagen-proline 4-dioxygenase activity" evidence=IDA
GO:0005794 "Golgi apparatus" evidence=IDA
GO:0006487 "protein N-linked glycosylation" evidence=RCA
TAIR|locus:2156852 AT5G66060 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2012903 AT1G20270 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2827906 P4H5 "prolyl 4-hydroxylase 5" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2125329 AT4G35810 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2125344 AT4G35820 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2081106 P4H2 "prolyl 4-hydroxylase 2" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2088892 AT3G28490 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2144960 AT5G18900 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2118889 AT4G33910 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query194
PLN00052 310 PLN00052, PLN00052, prolyl 4-hydroxylase; Provisio 5e-23
smart00702165 smart00702, P4Hc, Prolyl 4-hydroxylase alpha subun 2e-09
>gnl|CDD|177683 PLN00052, PLN00052, prolyl 4-hydroxylase; Provisional Back     alignment and domain information
 Score = 93.2 bits (231), Expect = 5e-23
 Identities = 43/90 (47%), Positives = 62/90 (68%), Gaps = 2/90 (2%)

Query: 83  ISWSPRILVLHNFLSMEECDYLRAIARPHLQVSTVVDTKTGKGIKSNVRTSSGMFLSPEE 142
           +SW PRI V   FLS  ECD+L  +A+  +Q S V D K+GK + S VRTSSGMFL  ++
Sbjct: 50  VSWQPRIFVYKGFLSDAECDHLVKLAKKKIQRSMVADNKSGKSVMSEVRTSSGMFL--DK 107

Query: 143 KKYPMIQAIEKRISVFSQVPVENGELIQVL 172
           ++ P++  IE+RI+ ++ +P EN E IQ+L
Sbjct: 108 RQDPVVSRIEERIAAWTFLPEENAENIQIL 137


Length = 310

>gnl|CDD|214780 smart00702, P4Hc, Prolyl 4-hydroxylase alpha subunit homologues Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 194
KOG1591289 consensus Prolyl 4-hydroxylase alpha subunit [Amin 100.0
PLN00052 310 prolyl 4-hydroxylase; Provisional 99.97
smart00702178 P4Hc Prolyl 4-hydroxylase alpha subunit homologues 99.76
PRK05467 226 Fe(II)-dependent oxygenase superfamily protein; Pr 97.75
PF13532194 2OG-FeII_Oxy_2: 2OG-Fe(II) oxygenase superfamily; 87.42
PHA02813 354 hypothetical protein; Provisional 83.32
PHA02869 418 C4L/C10L-like gene family protein; Provisional 83.03
>KOG1591 consensus Prolyl 4-hydroxylase alpha subunit [Amino acid transport and metabolism] Back     alignment and domain information
Probab=100.00  E-value=2.6e-36  Score=264.07  Aligned_cols=141  Identities=42%  Similarity=0.617  Sum_probs=124.9

Q ss_pred             ccccccccCCCc------------eEEEeCCCCcceEEeecceeEEeecCccEEEEcCCCCHHHHHHHHHHhcCCccccE
Q 029399           49 RQKNGYLQLPRG------------VTFWDNDKEAELLRLGYVKPEVISWSPRILVLHNFLSMEECDYLRAIARPHLQVST  116 (194)
Q Consensus        49 ~~~~~y~~lcrg------------l~C~y~~~~~pfl~LaP~K~E~LS~~P~I~~~hdfLSd~Ecd~Li~lA~p~L~rS~  116 (194)
                      .++..|+..|||            +.|++.+.  ||++++|+|+|++||+|+|++||||||++|||+|+.+|+|+|++++
T Consensus        49 ~~~~~~~~~c~g~~~~~~~~~~~~~~~~~~~~--~~~~~ap~k~E~lsw~P~~~~yhd~ls~~e~d~l~~lak~~l~~st  126 (289)
T KOG1591|consen   49 QEFTVYEQGCRGELPPLTKLTLRRLSCRNRAG--PFLRLAPVKLEELSWDPRVVLYHDFLSDEECDHLISLAKPKLERST  126 (289)
T ss_pred             ccccchhhhccCccCccchhHhhhhhcccccC--cceeecchhhhhcccCCceEeehhcCCHHHHHHHHHhhhhhhhcee
Confidence            456678888887            45665544  9999999999999999999999999999999999999999999999


Q ss_pred             E-eeCCCCCeeecceeeeeeeecCCCCCcchHHHHHHHHHHHhcCCCCCCCccceEEEeCCCcccccc------------
Q 029399          117 V-VDTKTGKGIKSNVRTSSGMFLSPEEKKYPMIQAIEKRISVFSQVPVENGELIQVLSTGMKKISITS------------  183 (194)
Q Consensus       117 V-~~~~tG~~~~s~~RtS~~awL~~~~~~~pvv~rI~~Ria~ltGl~~~~~E~LQV~nYg~Gg~~~~~------------  183 (194)
                      | .+.++|....+.+|+|+++|+..++  ++++++|++||+++||+|.+++|+|||+|||+||+|..|            
T Consensus       127 v~~~~~~~~~~~~~~R~S~~t~l~~~~--~~~~~~i~~ri~~~T~l~~e~~E~lqVlnYg~Gg~Y~~H~D~~~~~~~~~~  204 (289)
T KOG1591|consen  127 VVADKGTGHSTTSAVRTSSGTFLPDGA--SPVVSRIEQRIADLTGLPVENGESLQVLNYGLGGHYEPHYDYFLPEEDETF  204 (289)
T ss_pred             eeccCCcccccceeeEecceeEecCCC--CHHHHHHHHHHHhccCCCcccCccceEEEecCCccccccccccccccchhh
Confidence            9 4556677778889999999999954  899999999999999999999999999999999998555            


Q ss_pred             -------ceeEEEEecc
Q 029399          184 -------PITITFQTLS  193 (194)
Q Consensus       184 -------~~~~~f~~~~  193 (194)
                             ||+|+-.|||
T Consensus       205 ~~~~~g~RiaT~l~yls  221 (289)
T KOG1591|consen  205 NGLNGGNRIATVLMYLS  221 (289)
T ss_pred             hhcccCCcceeEEEEec
Confidence                   6777777776



>PLN00052 prolyl 4-hydroxylase; Provisional Back     alignment and domain information
>smart00702 P4Hc Prolyl 4-hydroxylase alpha subunit homologues Back     alignment and domain information
>PRK05467 Fe(II)-dependent oxygenase superfamily protein; Provisional Back     alignment and domain information
>PF13532 2OG-FeII_Oxy_2: 2OG-Fe(II) oxygenase superfamily; PDB: 2IUW_A 3BTZ_A 3RZL_A 3RZH_A 3S5A_A 3RZG_A 3RZJ_A 3BUC_A 3H8X_A 3H8R_A Back     alignment and domain information
>PHA02813 hypothetical protein; Provisional Back     alignment and domain information
>PHA02869 C4L/C10L-like gene family protein; Provisional Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query194
3gze_A225 Algal Prolyl 4-Hydroxylase Complexed With Zinc And 4e-18
2jij_A233 Crystal Structure Of The Apo Form Of Chlamydomonas 5e-18
2jig_A224 Crystal Structure Of Chlamydomonas Reinhardtii Prol 5e-18
2v4a_A233 Crystal Structure Of The Semet-Labeled Prolyl-4 Hyd 8e-18
3itq_A216 Crystal Structure Of A Prolyl 4-Hydroxylase From Ba 2e-08
>pdb|3GZE|A Chain A, Algal Prolyl 4-Hydroxylase Complexed With Zinc And (Ser-Pro)5 Peptide Substrate Length = 225 Back     alignment and structure

Iteration: 1

Score = 87.4 bits (215), Expect = 4e-18, Method: Compositional matrix adjust. Identities = 42/90 (46%), Positives = 61/90 (67%), Gaps = 2/90 (2%) Query: 83 ISWSPRILVLHNFLSMEECDYLRAIARPHLQVSTVVDTKTGKGIKSNVRTSSGMFLSPEE 142 +SWSPR +L NFLS EECDY+ ARP + S+VVD ++GK + S +RTS+G + + E Sbjct: 17 LSWSPRAFLLKNFLSDEECDYIVEKARPKMVKSSVVDNESGKSVDSEIRTSTGTWFAKGE 76 Query: 143 KKYPMIQAIEKRISVFSQVPVENGELIQVL 172 +I IEKR++ + +P+EN E +QVL Sbjct: 77 DS--VISKIEKRVAQVTMIPLENHEGLQVL 104
>pdb|2JIJ|A Chain A, Crystal Structure Of The Apo Form Of Chlamydomonas Reinhardtii Prolyl-4 Hydroxylase Type I Length = 233 Back     alignment and structure
>pdb|2JIG|A Chain A, Crystal Structure Of Chlamydomonas Reinhardtii Prolyl-4 Hydroxylase Type I Complexed With Zinc And Pyridine-2,4- Dicarboxylate Length = 224 Back     alignment and structure
>pdb|2V4A|A Chain A, Crystal Structure Of The Semet-Labeled Prolyl-4 Hydroxylase (P4h) Type I From Green Algae Chlamydomonas Reinhardtii. Length = 233 Back     alignment and structure
>pdb|3ITQ|A Chain A, Crystal Structure Of A Prolyl 4-Hydroxylase From Bacillus Anthracis Length = 216 Back     alignment and structure

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query194
2jig_A224 Prolyl-4 hydroxylase; hydrolase; HET: PD2; 1.85A { 3e-33
3itq_A216 Prolyl 4-hydroxylase, alpha subunit domain protei; 1e-31
>2jig_A Prolyl-4 hydroxylase; hydrolase; HET: PD2; 1.85A {Chlamydomonas reinhardtii} PDB: 3gze_A 2v4a_A 2jij_A Length = 224 Back     alignment and structure
 Score =  117 bits (294), Expect = 3e-33
 Identities = 41/94 (43%), Positives = 61/94 (64%), Gaps = 2/94 (2%)

Query: 78  VKPEVISWSPRILVLHNFLSMEECDYLRAIARPHLQVSTVVDTKTGKGIKSNVRTSSGMF 137
            +   +SWSPR  +L NFLS EECDY+   ARP +  S+VVD ++GK + S +RTS+G +
Sbjct: 11  GEVVHLSWSPRAFLLKNFLSDEECDYIVEKARPKMVKSSVVDNESGKSVDSEIRTSTGTW 70

Query: 138 LSPEEKKYPMIQAIEKRISVFSQVPVENGELIQV 171
            +  E    +I  IEKR++  + +P+EN E +QV
Sbjct: 71  FAKGE--DSVISKIEKRVAQVTMIPLENHEGLQV 102


>3itq_A Prolyl 4-hydroxylase, alpha subunit domain protei; double-stranded beta helix, alpha-keto dependent non-heme iron oxygenase; 1.40A {Bacillus anthracis str} Length = 216 Back     alignment and structure

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query194
2jig_A224 Prolyl-4 hydroxylase; hydrolase; HET: PD2; 1.85A { 99.95
3itq_A216 Prolyl 4-hydroxylase, alpha subunit domain protei; 99.93
3dkq_A 243 PKHD-type hydroxylase SBAL_3634; putative oxygenas 99.25
2hbt_A247 EGL nine homolog 1; prolyl hydroxylase, hypoxia in 98.62
2iuw_A238 Alkylated repair protein ALKB homolog 3; oxidoredu 92.11
3s57_A204 Alpha-ketoglutarate-dependent dioxygenase ALKB HO; 81.24
>2jig_A Prolyl-4 hydroxylase; hydrolase; HET: PD2; 1.85A {Chlamydomonas reinhardtii} PDB: 3gze_A 2v4a_A 2jij_A Back     alignment and structure
Probab=99.95  E-value=2.2e-27  Score=197.72  Aligned_cols=118  Identities=36%  Similarity=0.545  Sum_probs=110.4

Q ss_pred             EeecceeEEeecCccEEEEcCCCCHHHHHHHHHHhcCCccccEEeeCCCCCeeecceeeeeeeecCCCCCcchHHHHHHH
Q 029399           74 RLGYVKPEVISWSPRILVLHNFLSMEECDYLRAIARPHLQVSTVVDTKTGKGIKSNVRTSSGMFLSPEEKKYPMIQAIEK  153 (194)
Q Consensus        74 ~LaP~K~E~LS~~P~I~~~hdfLSd~Ecd~Li~lA~p~L~rS~V~~~~tG~~~~s~~RtS~~awL~~~~~~~pvv~rI~~  153 (194)
                      .++|.|+|+||++|+|+++|||||++||++|++++++++++|++.+..+|+...+.+|+|+++||+..+  ++++++|.+
T Consensus         7 ~~~~~k~e~ls~~P~i~~~~~fLs~~Ec~~li~~~~~~~~~s~v~~~~~g~~~~~~~R~s~~~~l~~~~--~~~~~~i~~   84 (224)
T 2jig_A            7 EEWRGEVVHLSWSPRAFLLKNFLSDEECDYIVEKARPKMVKSSVVDNESGKSVDSEIRTSTGTWFAKGE--DSVISKIEK   84 (224)
T ss_dssp             CCCCCCEEEEETTTTEEEETTCSCHHHHHHHHHHHGGGCEECEEEETTTTEEEECSSBCSEEEECCTTC--SHHHHHHHH
T ss_pred             cccCcceEEEeCCCCEEEEcccCCHHHHHHHHHHhhccCeeeeeecCCCCcccccCCEEEeeeEecCCC--CHHHHHHHH
Confidence            578999999999999999999999999999999999999999999987788888899999999999865  799999999


Q ss_pred             HHHHhcCCCCCCCccceEEEeCCCcccc-----------------ccceeEEEEecc
Q 029399          154 RISVFSQVPVENGELIQVLSTGMKKISI-----------------TSPITITFQTLS  193 (194)
Q Consensus       154 Ria~ltGl~~~~~E~LQV~nYg~Gg~~~-----------------~~~~~~~f~~~~  193 (194)
                      ||++++|+|....|.+||+||++||+|.                 .+|+.|+.+||+
T Consensus        85 ri~~~~gl~~~~~e~~qv~rY~~G~~y~~H~D~~~~~~~~~~~~~~~R~~T~l~YLn  141 (224)
T 2jig_A           85 RVAQVTMIPLENHEGLQVLHYHDGQKYEPHYDYFHDPVNAGPEHGGQRVVTMLMYLT  141 (224)
T ss_dssp             HHHHHHTCCGGGBCCCEEEEEETTCCEEEECCSSCCTTSSSCCCCSCEEEEEEEECS
T ss_pred             HHHHHhCCCcccccceEEEecCCCccccCcccCCCCccccccccCCCeEEEEEEEec
Confidence            9999999999999999999999999973                 468899999997



>3itq_A Prolyl 4-hydroxylase, alpha subunit domain protei; double-stranded beta helix, alpha-keto dependent non-heme iron oxygenase; 1.40A {Bacillus anthracis str} Back     alignment and structure
>3dkq_A PKHD-type hydroxylase SBAL_3634; putative oxygenase, structural genomics, JOI for structural genomics, JCSG; 2.26A {Shewanella baltica OS155} Back     alignment and structure
>2hbt_A EGL nine homolog 1; prolyl hydroxylase, hypoxia inducible factor, HIF, 2- oxoglutarate, oxygenase, oxidoreductase; HET: UN9; 1.60A {Homo sapiens} PDB: 2hbu_A* 2g1m_A* 3hqu_A* 3hqr_A* 2y33_A* 2y34_A* 2g19_A* 3ouj_A* 3ouh_A* 3oui_A* Back     alignment and structure
>2iuw_A Alkylated repair protein ALKB homolog 3; oxidoreductase, DNA/RNA repair, demethylase, beta jellyroll; HET: AKG; 1.50A {Homo sapiens} SCOP: b.82.2.10 Back     alignment and structure
>3s57_A Alpha-ketoglutarate-dependent dioxygenase ALKB HO; protein-DNA complex, jelly-roll fold, dioxygenase, dsDNA BIN plasma, oxidoreductase-DNA complex; HET: AKG; 1.60A {Homo sapiens} PDB: 3s5a_A* 3rzg_A 3rzl_A 3rzh_A* 3rzj_A* 3rzk_A* 3rzm_A 3bty_A* 3buc_A* 3h8r_A* 3h8o_A* 3h8x_A* 3btx_A* 3bu0_A* 3btz_A* Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query194
d2iuwa1210 AlkB homolog 3 {Human (Homo sapiens) [TaxId: 9606] 95.76
>d2iuwa1 b.82.2.10 (A:70-279) AlkB homolog 3 {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
class: All beta proteins
fold: Double-stranded beta-helix
superfamily: Clavaminate synthase-like
family: AlkB-like
domain: AlkB homolog 3
species: Human (Homo sapiens) [TaxId: 9606]
Probab=95.76  E-value=0.033  Score=42.64  Aligned_cols=101  Identities=14%  Similarity=0.093  Sum_probs=60.4

Q ss_pred             eeEEeecCcc----EEEEcCCCCHHHHHHHHHHhcC--CccccEEeeCCCCCeeec----------ceeeeeeeecCCCC
Q 029399           79 KPEVISWSPR----ILVLHNFLSMEECDYLRAIARP--HLQVSTVVDTKTGKGIKS----------NVRTSSGMFLSPEE  142 (194)
Q Consensus        79 K~E~LS~~P~----I~~~hdfLSd~Ecd~Li~lA~p--~L~rS~V~~~~tG~~~~s----------~~RtS~~awL~~~~  142 (194)
                      +.-++++.|.    +.++.|||+++|.+.|.+....  .+.+.....  -|+....          .++.+...+.....
T Consensus         7 ~~~~~~~~p~g~~~l~y~p~fLs~~e~~~Ll~~L~~e~~w~~~~~~~--~g~~~~~pR~~~~~~d~~y~y~~~~~~~~~~   84 (210)
T d2iuwa1           7 GVYEISLSPTGVSRVCLYPGFVDVKEADWILEQLCQDVPWKQRTGIR--EDITYQQPRLTAWYGELPYTYSRITMEPNPH   84 (210)
T ss_dssp             EEEEEECCSSSCEEEEEETTSSCHHHHHHHHHHHHHHSCCBCCEEES--SSCEEECSSEEEEEECCCTTSCHHHHCCBSS
T ss_pred             CceEeccCCCCcceEEEECCcCCHHHHHHHHHHHHhhCCccccceec--cCcccceeeeeEEecCcCccccccccccCCC
Confidence            4557788885    8999999999999999875421  222222211  1121111          11212111111110


Q ss_pred             CcchHHHHHHHHHHHhcCCCCCCCccceEEEeCCCccccccce
Q 029399          143 KKYPMIQAIEKRISVFSQVPVENGELIQVLSTGMKKISITSPI  185 (194)
Q Consensus       143 ~~~pvv~rI~~Ria~ltGl~~~~~E~LQV~nYg~Gg~~~~~~~  185 (194)
                       -.+.+..+.+++...++.+.++   ..+.+|..|+..+..|.
T Consensus        85 -~~~~l~~l~~~~~~~~~~~~~~---~~ln~Y~~~~~~I~~H~  123 (210)
T d2iuwa1          85 -WHPVLRTLKNRIEENTGHTFNS---LLCNLYRNEKDSVDWHS  123 (210)
T ss_dssp             -CCHHHHHHHHHHHHHHSCCCCE---EEEEEECSTTCCEEEEC
T ss_pred             -CcHHHHHHHHhhhhhcCccchh---hhhhccccCCCccccCc
Confidence             1478999999999999887543   46788988888766553