Abstract
Measuring the concentrations of metabolites and estimating the reaction rates of each reaction step consisting of metabolic pathways are significant for an improvement in microorganisms used in maximizing the production of materials. Although the reaction pathway must be identified for such an improvement, doing so is not easy. Numerous reaction steps have been reported; however, the actual reaction steps activated vary or change according to the conditions. Furthermore, to build mathematical models for a dynamical analysis, the reaction mechanisms and parameter values must be known; however, to date, sufficient information has yet to be published for many cases. In addition, experimental observations are expensive. A new mathematical approach that is applicable to small sample data, and that requires no detailed reaction information, is strongly needed. S-system is one such model that can use smaller samples than other ordinary differential equation models. We propose a simplified S-system to apply minimal quantities of samples for a dynamic analysis of the metabolic pathways. We applied the model to the phenyl lactate production pathway of Escherichia coli. The model obtained suggests that actually activated reaction steps and feedback are inhibitions within the pathway.
Introduction
Mathematical model for metabolic pathways
The artificial and industrial uses of microorganisms for material production have a long history of more than a thousand years. Recently, genetic operations have been widely applied to improve production. Two generally considered approaches introduce enzymes that have higher activities from other organisms or species and introduce enzymes to realize metabolic pathways that do not naturally occur in the microorganisms. The former method is popular because its operation is simpler and improvements are more predictable than with the latter method. Conventional gene modifications using ultraviolet or other radiation types are easy to achieve and have been widely applied in many industries. Nevertheless, the efficiency of such improvements is quite low because gene modifications occur accidentally and uncontrollably, and progress is made serendipitously. Therefore, gene introduction is currently used along with conventional methods.
Target genes for modification are chosen based on information including the reaction rates of the respective reaction steps within the metabolic pathway and include the production materials and substrates of microorganisms, as well as changes in the reaction rates through changes in the concentrations of the metabolites that consist of the pathway. Bottleneck reaction steps and feedback loop inhibitions are suggested based on this information. The genes of enzymes used in such reactions are candidates for modification.
The rates of enzymic reactions are generally defined as the limit of changes in the compounds over time.1,2 Several formulae are established for the types of enzymic reactions, such as inhibition schemes. The most popular is the Michaelis-Menten law3,4 for a simple one-to-one enzymic reaction without any inhibition or catalysis using an enzyme. The reaction rate is modeled based on the following reaction scheme:
in which
Therein, a pair of square brackets denotes the concentration of the compound,
Generally, finding the parameter values is difficult and expensive because it requires enzyme isolation and measurements of the reaction rates in test tubes (in vitro measurements). Although the amount of enzyme information in the literature and public databases is growing, 7 the values of the kinetic parameters have not been sufficiently published or accumulated. An enzyme generally has different parameter values for the various conditions and species of organisms. Moreover, the parameter values generally differ between in vitro and in vivo (in living cells) conditions.8,9 For most industrial applications, a dynamical analysis of the pathways must be conducted without reaction scheme information or kinetic parameter values.
Canonical ODE model
The ODE systems in canonical forms are applicable because they are independent of the molecular mechanism of the reaction scheme. The S-system 6 is one such canonical ODE model. For a reaction scheme with 2 reactions, the following is used:
where
where
The S-system above is a simplified form of the general mass action law, 6 which can be presented as follows:
Therein, the
The parameter values of
Method for small sample
Numerical optimizations require a sufficient number of observed samples. Smaller needs are better because observations entail a certain amount of costs. A mathematical model with fewer parameters requires fewer samples. We propose a canonical ODE model for small samples through the simplification of the second term of Equation (4), as shown below:
By the mass action law, the decomposition rate of a compound depends solely on its concentration in many biological processes, such as Michaelis-Menten–type reactions shown in Equation (2). We introduce this idea as an assumption in Equation (6). Although decomposition reactions are often modeled as linear ODE, such as
our model includes a nonlinear decomposition term because we suspect that a linear term might be too simple for the metabolite in a complex biological network system containing many unknown reactions. Our assumption is reasonable when regulation of the degradation processes by these unknown reactions is not significant.
The resulting time series of the model based on a numerical integration varies greatly through a change in the initial value. Finding the best initial value is difficult because the observed initial value frequently has errors, particularly for small sample data sets. Therefore, we compare the model and data in differential spaces. The differential of the observed data can be calculated through a numerical differentiation, and the parameter values can be evaluated by comparing the differential values with the values of Equation (6).
We evaluated the proposed method according to its application to the phenyl lactate (PL) production pathway from glucose using Escherichia coli. For a pathway that includes branches and feedback loops, we estimated the actually activated reaction steps and activities of the feedback inhibitions suggesting strategies for an improvement in production.
Method
First, we build a pathway map based on information from the literature and different databases. We then choose some metabolites in the map as observation targets. A pathway map is reconstructed using only the target metabolites. The observations are measurements of the target metabolite concentration at sampling time points with equal intervals. A simplified S-system model is defined based on the reconstructed pathway map by fixing some
Observed time series data on the concentration of metabolites are numerically differentiated. Optimal values of

Main pathway of phenyl lactate metabolism of Escherichia coli. A dashed line indicates that the path consists of plural reactions. A solid 1-directional line indicates a single enzymic reaction. A bidirectional line indicates 2 reactions: forward and backward processes catalyzed by the same or different enzymes. A double-lined circle indicates a target metabolite selected for observation.

Reconstructed pathway consisting of observation target metabolites.

Observed time series of concentrations of target metabolites at the 6 time points. SA, PP, PA, and PL indicate shikimic acid, phenyl pyruvate, phenylalanine, and PL, respectively.
Pathway map construction and observation
The pathway from glucose to PL consists roughly of glycolysis and shikimic acid (SA) pathways. Phenyl lactate is produced from phenyl pyruvate (PP), PP is from phenylalanine (PA), and PA is from prephenic acid. In addition, prephenic acid is from chorismic acid in the SA pathway. Choosing phosphoenolpyruvate (PEP), erythrose 4-phosphate (E4P), SA, PA, PP, and PL as the observation targets, we then reconstruct the pathway for these 6 metabolites (Figure 2). The pathway includes a branching point and 2 feedback loops.
ODE models for the respective metabolite
Changes in concentration of metabolites in the reconstructed pathway are modeled mathematically using the following ODE models:
Actually, production of PEP and E4P is not controlled by any other metabolites, and these compounds are independent variables in the ODE system above. Their respective dynamics are not modeled. Parameter
Time differential values of the metabolite concentration are calculable using the ODE system by determining all parameter values. We introduce the differential evolution algorithm 11 to find the parameter values that minimize the differences between the differential of the concentration values calculated using the ODE system, as shown below:
Therein,
Result
Model interpretation
The ODE system with the determined parameter values is shown below (Figure 4):

Estimated actual reaction activities on the reconstructed pathway. Numerical values shown in the figure are
The ODE system suggests that the reaction chain to SA from PEP is more active than that from E4P. In addition, PP production is inhibited by feedback loops from both PA and PL.
There are 3 negative value parameters. Reaction steps that are represented by these parameters consist of only productive enzymic reactions in the literature.12,13 These negative parameters may suggest unknown regulatory pathways whose effects look substrate inhibitions.
Discussion
Numerical advantages of our model
The changes in metabolite concentration over time can be modeled using the Michaelis-Menten law, as shown below:
For each metabolite, there are 8 parameters for SA, 10 for PP, 6 for PA, and 4 for PL because a model of the reaction step (single enzymic reaction) has 2 parameters. Changes of concentrations in time of SA and PP cannot be modeled using the 6 sampling data in this study. There are a total of 16 parameters for the reconstructed pathway because some parameters are common (1 of the outgoing reactions of SA is an incoming reaction of PP). Some numerical optimization methods can search for the 16 parameters simultaneously. However, such a simultaneous nonlinear numerical optimization is not easy, and the difficulty increases significantly with an increase in the number of parameters (“curse of dimensionality”).
The form of the S-system model is reduction or simplification of general mass action (GMA) law model shown as follows:
For each metabolite, there are 7 S-system parameters for SA, 8 for PP, 5 for PA, and 4 for PL, or 18 in total because some of the parameters are the same. The number of parametes are reduced for SA, PP, and PA compared with Michaelis-Menten models; however, concentrations of SA and PP cannot be modeled even when using the GMA. In our proposed model, the corresponding numbers are 5, 6, 5, and 4, or 20 in all, which is not a small number; however, it is not a problem that the total number of parameters is larger than in the S-system because no parameters are common to any 2 metabolites, and the parameters of each metabolite are optimized independently of the other metabolites. Our model has fewer parameters for each metabolite, which indicates that our model is more robust against errors than the S-system and Michaelis-Menten models.
The less number of model parameters means that the model needs a less number of data. Michaelis-Menten model consists of 2 parameters for each reaction, thus
Although the model and data do not match perfectly because the data generally include errors in the probabilistic distributions, the optimization precision of the parameters (model fitness to the data) shown in Figure 5 is apparently sufficient. In terms of biochemical engineering, the values of some parameters shown in Figure 4 are large as the reaction order. Perhaps because of fluxes of pathways other than the reconstructed pathway from which we omitted pathways other than the main reaction chain, glycolysis has many branches to other subsystems. However, the parameter values for PA (PA in Figure 4), PP, and PL might be more reliable because the reaction steps occurring naturally around these metabolites may be nearly the same as those of the reconstructed pathway.

Values of differentials calculated from the observed data through a quadratic interpolation and calculated from the optimized ODE model (simplified S-system model). The dashed line shows differential values calculated from the ODE model with the optimized parameters. A solid line represents the numerical differentiation from the observed data. SA, PP, PA, and PL indicate shikimic acid, phenyl pyruvate, phenylalanine, and phenyl lactate, respectively. Michaelis-Menten model and the full S-system model cannot be obtained due to the much number of parameters of the models compared with the number of data. SA, PP, PA, and PL indicate shikimic acid, phenyl pyruvate, phenylalanine, and PL, respectively.
The parameter values of the
Two negative feedback loops exist: PP production is inhibited by PL, and PA production is inhibited by PP. Although it can be readily imagined that the inhibition of the feedback reactions increases the production of PL, the main inhibitory effect on PP production is from the SA. Disrupting one or more genes of the reactions to PP from SA might improve the PL production.
Phenylalanine decreases gradually (Figure 3), but shows no natural decomposition, which is represented as
In conclusion, the results show that the reliability of the estimated parameter values might not be the best or even very high because the reconstructed pathway and the ODE system are simplified. These values suggest that the target genes can be modified for an industrial improvement in production using microorganisms. This case study presents several suggestions that may be useful when constrained to only a few samples or low observation costs.
Footnotes
Acknowledgements
The stock of microorganisms used in this study was provided by professor Naoki Takaya at the Graduate School of Life and Environmental Science, Tsukuba University.
Funding:
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by the New Energy and Industrial Technology Development Organization, Japan.
Declaration of conflicting interests:
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Author Contributions
DT and SA conceived the mathematical analyses. DT carried out the mathematical analyses. HK and TH conceived the experimental design for observations. HK, TH and CO prepared biological and biochemical materials. HK and YH carried out the experiments and observations.
