Citrus Sinensis ID: 045722


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--
MKWGQFLWLVESMNKNNAIANMSETKLSMHAGPHVDAPDHMFDHYFDVDSLDLEQFQHCFNWICYICPELLVNFTKDKNILN
cccccHHHHHHHHHHcccccEEcEEEEEEcccccccccccccccccccccccccccccccEEEEEcccEEEEEccccccccc
cccccEEEHHHHHHcccccccccEEEEEccccccccccccccccccccccccHHHHcccccEEEccccEEEEEccccccccc
MKWGQFLWLVESMNKNNAIANMSetklsmhagphvdapdhmfdhyfdvdsldleqFQHCFNWICYICPELLVNFTKDKNILN
MKWGQFLWLVESMNKNNAIANMSETKLSMHAGPHVDAPDHMFDHYFDVDSLDLEQFQHCFNWICYICPELLVNFTKDKNILN
MKWGQFLWLVESMNKNNAIANMSETKLSMHAGPHVDAPDHMFDHYFDVDSLDLEQFQHCFNWICYICPELLVNFTKDKNILN
**WGQFLWLVESMN********************VDAPDHMFDHYFDVDSLDLEQFQHCFNWICYICPELLVNFTK******
*KWGQFLWLVESMNKNNAIANMSETKLSMHAGPHVDAPDHMFDHYFDVDSLDLEQFQHCFNWICYICPELLVNF*K******
MKWGQFLWLVESMNKNNAIANMSETKLSMHAGPHVDAPDHMFDHYFDVDSLDLEQFQHCFNWICYICPELLVNFTKDKNILN
MKWGQFLWLVESMNKNNAIANMSETKLSMHAGPHVDAPDHMFDHYFDVDSLDLEQFQHCFNWICYICPELLVNFTKD*****
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhiiiiiiiiiii
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MKWGQFLWLVESMNKNNAIANMSETKLSMHAGPHVDAPDHMFDHYFDVDSLDLEQFQHCFNWICYICPELLVNFTKDKNILN
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query82
449438375 272 PREDICTED: kynurenine formamidase-like [ 0.817 0.246 0.543 1e-13
388504058 254 unknown [Medicago truncatula] 0.817 0.263 0.567 1e-13
359475423 270 PREDICTED: LOW QUALITY PROTEIN: kynureni 0.817 0.248 0.530 2e-13
449533619 270 PREDICTED: kynurenine formamidase-like, 0.817 0.248 0.543 2e-13
147838051 246 hypothetical protein VITISV_019335 [Viti 0.817 0.272 0.530 2e-12
255555315 273 conserved hypothetical protein [Ricinus 0.817 0.245 0.518 2e-12
356536188 255 PREDICTED: LOW QUALITY PROTEIN: kynureni 0.817 0.262 0.518 3e-12
449438373 270 PREDICTED: kynurenine formamidase-like [ 0.817 0.248 0.506 6e-12
255555311 274 conserved hypothetical protein [Ricinus 0.804 0.240 0.524 1e-11
255555313 260 conserved hypothetical protein [Ricinus 0.817 0.257 0.456 1e-11
>gi|449438375|ref|XP_004136964.1| PREDICTED: kynurenine formamidase-like [Cucumis sativus] Back     alignment and taxonomy information
 Score = 80.5 bits (197), Expect = 1e-13,   Method: Composition-based stats.
 Identities = 44/81 (54%), Positives = 51/81 (62%), Gaps = 14/81 (17%)

Query: 4   GQFLWLVESMNKNNAIANMSETKLSMHAGPHVDAPDHMFDHY----FDVDSLDLEQFQHC 59
           GQFLWL +SM KN ++AN SE KL  H G HVDAP H+FDHY    FDVD+LDLE     
Sbjct: 78  GQFLWLPKSM-KNGSLANNSEMKLPAHTGTHVDAPGHVFDHYFDAGFDVDTLDLEVLNG- 135

Query: 60  FNWICYICPELLVNFTKDKNI 80
                   P LLV+  +DKNI
Sbjct: 136 --------PGLLVDVPRDKNI 148




Source: Cucumis sativus

Species: Cucumis sativus

Genus: Cucumis

Family: Cucurbitaceae

Order: Cucurbitales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|388504058|gb|AFK40095.1| unknown [Medicago truncatula] Back     alignment and taxonomy information
>gi|359475423|ref|XP_003631683.1| PREDICTED: LOW QUALITY PROTEIN: kynurenine formamidase-like [Vitis vinifera] Back     alignment and taxonomy information
>gi|449533619|ref|XP_004173770.1| PREDICTED: kynurenine formamidase-like, partial [Cucumis sativus] Back     alignment and taxonomy information
>gi|147838051|emb|CAN60920.1| hypothetical protein VITISV_019335 [Vitis vinifera] Back     alignment and taxonomy information
>gi|255555315|ref|XP_002518694.1| conserved hypothetical protein [Ricinus communis] gi|223542075|gb|EEF43619.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
>gi|356536188|ref|XP_003536621.1| PREDICTED: LOW QUALITY PROTEIN: kynurenine formamidase-like [Glycine max] Back     alignment and taxonomy information
>gi|449438373|ref|XP_004136963.1| PREDICTED: kynurenine formamidase-like [Cucumis sativus] Back     alignment and taxonomy information
>gi|255555311|ref|XP_002518692.1| conserved hypothetical protein [Ricinus communis] gi|223542073|gb|EEF43617.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
>gi|255555313|ref|XP_002518693.1| conserved hypothetical protein [Ricinus communis] gi|223542074|gb|EEF43618.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query82
TAIR|locus:2122053 272 AT4G35220 "AT4G35220" [Arabido 0.817 0.246 0.493 3.5e-12
TAIR|locus:2124231 255 AT4G34180 "AT4G34180" [Arabido 0.792 0.254 0.493 4.9e-11
TAIR|locus:2823650 271 AT1G44542 "AT1G44542" [Arabido 0.817 0.247 0.493 8.5e-11
TAIR|locus:2122053 AT4G35220 "AT4G35220" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 165 (63.1 bits), Expect = 3.5e-12, P = 3.5e-12
 Identities = 40/81 (49%), Positives = 50/81 (61%)

Query:     4 GQFLWLVESMNKNNAIANMSETKLSMHAGPHVDAPDHMFDHY----FDVDSLDLEQFQHC 59
             G+FLWL  SM KN ++AN SE K+  H G HVD+P H++D Y    FDVDSLDL+     
Sbjct:    79 GRFLWLAASM-KNGSLANNSEMKIPTHTGTHVDSPGHVYDKYYDAGFDVDSLDLQ----V 133

Query:    60 FNWICYICPELLVNFTKDKNI 80
              N +      LLV+  KDKNI
Sbjct:   134 LNGLA-----LLVDVPKDKNI 149




GO:0003674 "molecular_function" evidence=ND
GO:0008150 "biological_process" evidence=ND
GO:0009507 "chloroplast" evidence=ISM
GO:0005794 "Golgi apparatus" evidence=IDA
GO:0005783 "endoplasmic reticulum" evidence=IDA
TAIR|locus:2124231 AT4G34180 "AT4G34180" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2823650 AT1G44542 "AT1G44542" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query82
pfam04199162 pfam04199, Cyclase, Putative cyclase 3e-05
COG1878 218 COG1878, COG1878, Kynurenine formamidase [Amino ac 1e-04
>gnl|CDD|217960 pfam04199, Cyclase, Putative cyclase Back     alignment and domain information
 Score = 39.3 bits (92), Expect = 3e-05
 Identities = 16/59 (27%), Positives = 21/59 (35%), Gaps = 5/59 (8%)

Query: 3  WGQFLWLVE---SMNKNNAIANMSETKLSMHAGPHVDAPDHMFD--HYFDVDSLDLEQF 56
          +       E      ++    N  +   S H G HVDAP H         +D L LE F
Sbjct: 15 YPGHPTSGEAGDGTGEDGDGYNTDKITTSTHTGTHVDAPGHFIFGPDGRTIDELPLEWF 73


Proteins in this family are thought to be cyclase enzymes. They are found in proteins involved in antibiotic synthesis. However they are also found in organisms that do not make antibiotics pointing to a wider role for these proteins. The proteins contain a conserved motif HXGTHXDXPXH that is likely to form part of the active site. Length = 162

>gnl|CDD|224790 COG1878, COG1878, Kynurenine formamidase [Amino acid transport and metabolism] Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 82
COG1878 218 Kynurenine formamidase [Amino acid transport and m 99.78
TIGR03035 206 trp_arylform arylformamidase. One of several pathw 99.75
PF04199171 Cyclase: Putative cyclase; InterPro: IPR007325 Pro 99.67
>COG1878 Kynurenine formamidase [Amino acid transport and metabolism] Back     alignment and domain information
Probab=99.78  E-value=2.9e-19  Score=132.43  Aligned_cols=59  Identities=27%  Similarity=0.528  Sum_probs=57.1

Q ss_pred             eEEEEEeecCCceeEEeeEEeeccccccccCCccccCCCCCccCCChHHHHhhcccceecCCEEEEEccC
Q 045722            7 LWLVESMNKNNAIANMSETKLSMHAGPHVDAPDHMFDHYFDVDSLDLEQFQHCFNWICYICPELLVNFTK   76 (82)
Q Consensus         7 ~~~~~~~~~~g~~~~~~~i~l~~H~GTHvDAP~H~~~~g~~id~lple~f~~~~~~~~~~gp~~vvd~~~   76 (82)
                      +++..+++++|  |+++.+++++|+|||||||.||.++|.+||++|+++|+         ||++|||+++
T Consensus        28 ie~~~~~~~~g--~~~~~i~~~~HtGTHiDAP~Hf~~~G~~id~ipl~~~i---------g~~~Vid~~~   86 (218)
T COG1878          28 IERIATLAKDG--FNVSIITLGEHTGTHIDAPLHFIEGGLSIDEIPLESFI---------GPGVVIDISG   86 (218)
T ss_pred             EEEeEEecCCc--EEEEEEEecCccccccccHHhhcCCCCccccCCHHHhE---------eeeEEEEecC
Confidence            88999998888  99999999999999999999999999999999999999         9999999996



>TIGR03035 trp_arylform arylformamidase Back     alignment and domain information
>PF04199 Cyclase: Putative cyclase; InterPro: IPR007325 Proteins in this family are thought to be cyclase enzymes Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query82
1r61_A 207 Metal-dependent hydrolase; zinc-dependent hydrolas 4e-07
>1r61_A Metal-dependent hydrolase; zinc-dependent hydrolase, structural genomics, cyclase, PSI, protein structure initiative; 2.50A {Geobacillus stearothermophilus} SCOP: c.8.8.1 PDB: 3krv_A Length = 207 Back     alignment and structure
 Score = 44.0 bits (104), Expect = 4e-07
 Identities = 11/47 (23%), Positives = 19/47 (40%)

Query: 10 VESMNKNNAIANMSETKLSMHAGPHVDAPDHMFDHYFDVDSLDLEQF 56
           +     N     S   + +H G H+DAP HM +     +++ L   
Sbjct: 27 PKRTTITNGYVTESRIDMDVHTGTHIDAPLHMVEGGATFETIPLNDL 73


Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query82
1r61_A 207 Metal-dependent hydrolase; zinc-dependent hydrolas 99.64
2b0a_A 186 Hypothetical protein MJ0783; structural genomics, 99.44
>1r61_A Metal-dependent hydrolase; zinc-dependent hydrolase, structural genomics, cyclase, PSI, protein structure initiative; 2.50A {Geobacillus stearothermophilus} SCOP: c.8.8.1 PDB: 3krv_A Back     alignment and structure
Probab=99.64  E-value=1.6e-16  Score=113.63  Aligned_cols=59  Identities=24%  Similarity=0.324  Sum_probs=53.2

Q ss_pred             eEEEEEeecCCceeEEeeEEeeccccccccCCccccCCCCCccCCChHHHHhhcccceecCCEEEEEccCC
Q 045722            7 LWLVESMNKNNAIANMSETKLSMHAGPHVDAPDHMFDHYFDVDSLDLEQFQHCFNWICYICPELLVNFTKD   77 (82)
Q Consensus         7 ~~~~~~~~~~g~~~~~~~i~l~~H~GTHvDAP~H~~~~g~~id~lple~f~~~~~~~~~~gp~~vvd~~~~   77 (82)
                      ++...++++ +  ++.+.+++++|+|||+|||.||.++|.+++++|++.|+         ||++|||++..
T Consensus        27 ~~~~~~~~~-~--~~~~~i~~~~H~GTH~Dap~H~~~~g~~i~~~pl~~~~---------g~~vviD~~~~   85 (207)
T 1r61_A           27 PKRTTITNG-Y--VTESRIDMDVHTGTHIDAPLHMVEGGATFETIPLNDLV---------GPCKLFDLTHV   85 (207)
T ss_dssp             CEEEEEEET-T--EEEEEEEEETTSSSEEECCGGGSCSSCTTCCCCCTTTE---------EEEEEEECTTC
T ss_pred             EEEeeecCC-c--eEEeEEEECCCCchhhchHhhcCCCCcChhhCChhHcc---------ceEEEEEEeCC
Confidence            556666644 3  99999999999999999999999999999999999999         99999999864



>2b0a_A Hypothetical protein MJ0783; structural genomics, PSI, protein STRU initiative; 1.45A {Methanocaldococcus jannaschii} SCOP: c.8.8.1 Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query 82
d1r61a_ 205 c.8.8.1 (A:) Unnamed protein {Bacillus stearotherm 6e-06
d2b0aa1186 c.8.8.1 (A:1-186) Hypothetical protein MJ0783 {Met 9e-05
>d1r61a_ c.8.8.1 (A:) Unnamed protein {Bacillus stearothermophilus [TaxId: 1422]} Length = 205 Back     information, alignment and structure

class: Alpha and beta proteins (a/b)
fold: The "swivelling" beta/beta/alpha domain
superfamily: Putative cyclase
family: Putative cyclase
domain: Unnamed protein
species: Bacillus stearothermophilus [TaxId: 1422]
 Score = 39.7 bits (92), Expect = 6e-06
 Identities = 11/59 (18%), Positives = 19/59 (32%)

Query: 10 VESMNKNNAIANMSETKLSMHAGPHVDAPDHMFDHYFDVDSLDLEQFQHCFNWICYICP 68
           +     N     S   + +H G H+DAP HM +     +++ L               
Sbjct: 27 PKRTTITNGYVTESRIDMDVHTGTHIDAPLHMVEGGATFETIPLNDLVGPCKLFDLTHV 85


>d2b0aa1 c.8.8.1 (A:1-186) Hypothetical protein MJ0783 {Methanococcus jannaschii [TaxId: 2190]} Length = 186 Back     information, alignment and structure

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query82
d1r61a_ 205 Unnamed protein {Bacillus stearothermophilus [TaxI 99.69
d2b0aa1 186 Hypothetical protein MJ0783 {Methanococcus jannasc 99.5
>d1r61a_ c.8.8.1 (A:) Unnamed protein {Bacillus stearothermophilus [TaxId: 1422]} Back     information, alignment and structure
class: Alpha and beta proteins (a/b)
fold: The "swivelling" beta/beta/alpha domain
superfamily: Putative cyclase
family: Putative cyclase
domain: Unnamed protein
species: Bacillus stearothermophilus [TaxId: 1422]
Probab=99.69  E-value=8.4e-18  Score=118.09  Aligned_cols=49  Identities=27%  Similarity=0.451  Sum_probs=47.5

Q ss_pred             eEEeeEEeeccccccccCCccccCCCCCccCCChHHHHhhcccceecCCEEEEEccCC
Q 045722           20 ANMSETKLSMHAGPHVDAPDHMFDHYFDVDSLDLEQFQHCFNWICYICPELLVNFTKD   77 (82)
Q Consensus        20 ~~~~~i~l~~H~GTHvDAP~H~~~~g~~id~lple~f~~~~~~~~~~gp~~vvd~~~~   77 (82)
                      ++.++++|++|+|||+|||+||.++|++|+++||++|+         ||++|||+++.
T Consensus        37 ~~~~~i~~~~H~GTHiDaP~H~~~~g~~i~~~~l~~~~---------g~~vviD~~~~   85 (205)
T d1r61a_          37 VTESRIDMDVHTGTHIDAPLHMVEGGATFETIPLNDLV---------GPCKLFDLTHV   85 (205)
T ss_dssp             EEEEEEEEETTSSSEEECCGGGSCSSCTTCCCCCTTTE---------EEEEEEECTTC
T ss_pred             EEEEEEEECCcCccccccHHHhcCCCCceeeEChhHhC---------ceEEEEEeecc
Confidence            89999999999999999999999999999999999999         99999999874



>d2b0aa1 c.8.8.1 (A:1-186) Hypothetical protein MJ0783 {Methanococcus jannaschii [TaxId: 2190]} Back     information, alignment and structure