Disturbance rejection control for a wastewater treatment process by a learning approach

Abstract

Nonlinearities, uncertainties and external disturbances commonly exist in a wastewater treatment process (WWTP). Those issues present great challenges to the control of the dissolved oxygen (DO) concentration in a WWTP. In this paper, an active disturbance rejection control (ADRC) is utilized to estimate the total disturbance and drive the DO concentration to track the set-value. Simultaneously, an iterative learning strategy is employed to adjust the parameters of an extended state observer (ESO) to improve the accuracy of the estimation and reduce the dependence on experience in determining parameters. By combining the advantages of the ADRC and the iterative learning strategy, an iterative learning based active disturbance rejection control (ILADRC) is constructed, and the close-loop stability is analyzed. The benchmark simulation model No.1 (BSM1) is utilized to confirm the ILADRC. Numerical results show that the ILADRC is more effective in the DO concentration control.

Keywords

wastewater active disturbance rejection control iterative learning DO concentration BSM1

Introduction

With the development of human society, freshwater becomes one of the most significant problems that we face due to the increasing demand and serious pollution.¹ The water crisis significantly affects several industries or even countries.² Therefore, the wastewater treatment attracts much attention. In early years, wastewater treatment processes (WWTPs) mainly relied on the manual control, the efficiency was low and the effluent quality could not satisfy higher effluent standards.³ Thus, automatic control system has been applied in WWTPs.

Dissolved oxygen (DO) concentration directly affect the effluent quality, it has been commonly recognized as a key variable in a WWTP. Therefore, the control of DO concentration has been becoming a hot topic in the automatic control of a WWTP. However, kinds of control methods fail to regulate DO at a desired level due to WWTPs’ unique features, such as the varying rate of inflow, unstable compositions and concentrations, and unknown biochemical reactions, by comparison with other process industries.^3,4 Much effort has been paid to address those challenges. PID control, a commonly used approach, has also been widely utilized in the DO concentration control.⁵ However, the performance of the PID control degrades significantly for strong nonlinearities and disturbances in WWTPs.⁵ To improve the tracking performance, a feedback linearization-based PI controller was designed.⁶ Drawback of the feedback linearization method is that the robustness cannot be guaranteed for the existing uncertainties.⁷ Nowadays, model predictive control (MPC) has a wide variety of applications.⁸ In Zeng and Liu,⁹ a centralized economic model predictive control (EMPC) was applied to a WWTP, the simulation results show that it can reduce operating cost and improve the effluent quality simultaneously. However, computational complexity resulting from solving the optimization problem and the relatively poor fault tolerance of a centralized control challenge its implementation.¹⁰ Belchior¹¹ proposed a stable adaptive fuzzy control for a WWTP to control the DO. The algorithm achieved a promising result. However, traditional fuzzy control is largely dependent on input dimensions. If more input dimensions are defined, more fuzzy rules are necessary, and more computation complexity is involved.¹² In Bo and Zhang¹³ an echo state networks based online adaptive dynamic programming was taken to control the DO concentration. Its design mainly depends on the online data, and minor prior knowledge is required.

It should be noted that the key point of the DO control in a WWTP is how to deal with the uncertainties, unknown dynamics, and disturbances. To estimate those undesired factors, various disturbance observer-based methods have been designed. Lin¹⁴ proposed an adaptive neural control, it combined a nonlinear disturbance observer for a WWTP. Satisfactory performance was achieved. An active disturbance rejection control (ADRC)¹⁵ was employed to control the DO concentration. The extended state observer (ESO) estimated the system states and total disturbance. Simulation results showed that the ADRC can achieve satisfied performance on DO concentration control. Based on the ESO, system can be dynamically transformed to be a linear system with connected integrators form. Then, a controller based on the idea of the U-model control is designed to control the DO concentration.¹⁶ It does achieve faster and more robust response.

Actually, for the convenience in realizing, the effectiveness in dealing with strong disturbances and uncertainties, and the satisfied performance, ADRC has been applied in many fields.^17–19 In this paper, we also focus on the ADRC. However, to some extent, determining parameters of an ESO depends on engineers’ experience. To make the ESO be more accurate in estimating and reducing the dependence on experience to fix the parameters, a P type iterative learning algorithm²⁰ is utilized, and the iterative learning based ADRC (ILADRC)²¹ is designed in the DO concentration control. Main advantage of the ILADRC is that the ESO in the ILADRC can acquire better estimation performance by an iterative learning approach. Simultaneously, it reduces the requirement of an engineer’s experience.

The rest of this paper is organized as follows. Section II describes the model and control difficulties of a WWTP. Section III is controller design and stability analysis. Simulation results are shown in Section IV. Finally, a conclusion is drawn in Section V.

System description and control problem statement

System description

Benchmark simulation model no.1 (BSM1) is a benchmark simulation model for a WWTP. It contains influent data of dry, rain and storm weather, and it provides a set of standard evaluation criteria for different control strategies.²² For fair comparisons among different control strategies, the BSM1 becomes a commonly accepted platform on benchmarking of control strategies for WWTPs.²² Layout of the BSM1 is presented in Figure 1.²³ It consists of two parts. One is the biological reactor and the other is a secondary clarifier. The biological reactor comprises two anoxic tanks (V₁=V₂=1000 m³) and three aerobic tanks (V₃ = V₄ = V₅ = 1333 m³). Biological and physical phenomena in tanks are described by activated sludge model no.1 (ASM1). The secondary clarifier is modeled as a 10 layers non-reactive unit.

Figure 1.

Layout of the BSM1.

Control problem statement

DO concentration in the fifth tank is a key parameter, which affects the growth of microorganisms and the effluent quality. Keeping it in a desirable level is critical for a WWTP. However, the DO concentration in the fifth tank is affected by various uncertainties, such as the time-varying influent components, concentrations, and inflow rates. Meanwhile, in practice, both parameters and dynamics of a WWTP are partially known or even completely unknown. Besides that, most issues are usually coupled with each other, and most of them are not available. In other words, disturbances, uncertain dynamics, and strong couplings are difficulties in keeping the DO concentration in a satisfied level. Thus, it is still a challenge to develop and apply an effective control strategy to satisfy the discharge requirements.

Therefore, the aim of this paper is to control the DO concentration in the fifth tank within a satisfied range by the oxygen transfer coefficient K_La₅ in presence of disturbances, uncertain dynamics, and strong nonlinear couplings.

Iterative learning based active disturbance rejection control

Structure of the ILADRC

ADRC can estimate and cancel out the total disturbance in real time to guarantee the closed-loop system performance. Thus, an accurate mathematical model is not necessary. However, the parameters of an ESO determines its estimation ability greatly. To reduce the dependence on engineers’ experience in tuning parameters and improve ESO’s estimation ability, an iterative learning based ESO is designed. The structure of the ILADRC is given in Figure 2.

Figure 2.

The ILADRC for a WWTP.

Here r is the set-value, u is the control signal, and y is the system output.

The ESO is designed as²⁴

{\begin{matrix} {\overset{\cdot}{z}}_{1} = z_{2} + β_{1} (y - z_{1}) + b_{0} u \\ {\overset{\cdot}{z}}_{2} = β_{2} (y - z_{1}) \end{matrix}

(1)

where b₀, β₁, β₂ are adjustable gains of an ESO, y is the system output, u is the control signal, z₁ is the estimation of y, and z₂ is the estimation of the total disturbance.

The control law is

u = (u_{0} - z_{2}) / b_{0}

(2)

u_{0} = k_{p} (r - z_{1})

(3)

where k_p is an adjustable control gain.

In this paper, to decrease the estimation error of an ESO and reduce the dependence on experience in determining an ESO’s parameters, a P type iterative learning algorithm is utilized to adjust the parameters of an ESO, and it is designed as,

{b_{0}}_{(k + 1)} (t) = {b_{0}}_{k} (t) + k_{l} \cdot e_{k} (t)

(4)

{ω_{o}}_{(k + 1)} (t) = {ω_{o}}_{k} (t) + k_{l} \cdot e_{k} (t)

(5)

where k is the current iteration number, $e_{k} (t)$ is the estimation error of the kth iteration, $e_{k} (t) = y_{k} (t) - z_{1 k} (t)$ , $ω_{ok} (t)$ is the observer bandwidth, and $k_{l}$ is a learning gain.

According to the bandwidth-parameterization approach,²⁴ one has $β_{1} = 2 ω_{on}, β_{2} = ω_{on}^{2}$ . Here, $ω_{on}$ is the bandwidth after iterating n times.

Remark 1. If the learning gains equations (4) and (5) are selected properly, estimation errors will be bounded (It can be seen from simulation results in section IV). For a strict formulation, a projection mechanism is introduced to the iterative learning algorithm

{\begin{matrix} {b_{0}}_{(k + 1)} (t) = sat [{b_{0}}_{k} (t) + k_{l} e_{k} (t)] \\ {ω_{o}}_{(k + 1)} (t) = sat [{ω_{o}}_{k} (t) + k_{l} e_{k} (t)] \end{matrix}

The bound of sat(·) should be chosen as large as possible to ensure the actually generated parameters would be within it. Then, we can say the generated parameters are always bounded. After the algorithm iterating n times, much smaller estimation errors can be obtained by the iterative learning based ESO.

Remark 2. The estimation error directly reflects whether an ESO works as desired. Thus, based on the estimation error, a P type iterative learning algorithm is designed to adjust the parameters of an ESO, and the ESO adjusts its bandwidths to estimate the total disturbance more accurately.

Stability analysis

Consider a first-order system

\overset{\cdot}{y} = f + b_{0} u

(6)

where f represents the total disturbance, b₀ is a non-zero constant, u is the control input, y is the controlled system output.

Let $x_{1} = y, x_{2} = f$ and $\overset{\cdot}{f} = h$ , system equation (6) can be rewritten as

\overset{\cdot}{X} = AX + B_{1} u + B_{2} h (X, w)

(7)

where $X = [\begin{matrix} x_{1} \\ x_{2} \end{matrix}], A = [\begin{matrix} 0 1 \\ 0 0 \end{matrix}], B_{1} = [\begin{matrix} b_{0} \\ 0 \end{matrix}], B_{2} = [\begin{matrix} 0 \\ 1 \end{matrix}]$ .

Similarly, the ESO (1) can be rewritten as

\overset{\cdot}{\hat{X}} = A \hat{X} + B_{1} u + β (X - \hat{X})

(8)

where $\hat{X} = [{\hat{x}}_{1}, {\hat{x}}_{2}]^{T}$ is the observation vector, ${\hat{x}}_{1}, {\hat{x}}_{2}$ are observation values of y and f, respectively, $β = [\begin{matrix} β_{1} 0 \\ β_{2} 0 \end{matrix}]$ is a gain matrix.

Next, convergence of the iterative learning based ESO and the close-loop stability of ILADRC are analyzed.

Convergence of the iterative learning based ESO

Subtracting equations (8) from (7), one has

\overset{\cdot}{\tilde{X}} = \overset{\cdot}{X} - \overset{\cdot}{\hat{X}} = (A - β) (X - \hat{X}) + B_{2} h (X, w)

(9)

where $\tilde{X} = [{\tilde{x}}_{1}, {\tilde{x}}_{2}]^{T} = [x_{1} - {\hat{x}}_{1}, x_{2} - {\hat{x}}_{2}]^{T}$ is an estimation error vector.

Let $ε_{j} = {\tilde{x}}_{j} / {ω_{on}}^{j - 1}$ , j = 1,2, then, the estimation error system equation (9) can be rewritten as

\overset{\cdot}{ε} = ω_{on} A_{2} ε + B \frac{h (X, w)}{ω_{on}}

(10)

where $ε = [\begin{matrix} ε_{1} \\ ε_{2} \end{matrix}], A_{2} = [\begin{matrix} - 2 & 1 \\ - 1 & 0 \end{matrix}], B = [\begin{matrix} 0 \\ 1 \end{matrix}]$ .

Here, $ω_{on}$ is the bandwidth after iterating n times, and according to Remark 1, $ω_{on}$ is always bounded.

Remark 3. Due to the power of an engineering system is always limited, it is reasonable to assume that the change rate of the total disturbance is bounded.

Then, Theorem 1 is obtained.

Theorem 1. When h( X , w) is bounded, that is, there exists a positive constant M₁ such that $| h (X, w) | \leq M_{1}$ , then, the estimation error of an iterative learning based ESO is also bounded.

Proof. Solving equation (10), one has

ε (t) = e^{ω_{on} A_{2} t} ε (0) + \int_{0}^{t} e^{ω_{on} A_{2} (t - τ)} B \frac{h (X (τ), w)}{ω_{on}} d τ

(11)

Let

P (t) = \int_{0}^{t} e^{ω_{on} A_{2} (t - τ)} B \frac{h (X (τ), w)}{ω_{on}} d τ

(12)

For j = 1,2, one has

\begin{matrix} | p_{j} | \leq \frac{\int_{0}^{t} | {[e^{ω_{on} A_{2} (t - τ)} B]}_{j} | | h (X (τ), w) | d τ}{ω_{on}} \\ \leq \frac{M_{1} \int_{0}^{t} | {[e^{ω_{on} A_{2} (t - τ)} B]}_{j} | d τ}{ω_{on}} \\ \leq \frac{M_{1}}{{ω_{on}}^{2}} [| {({A_{2}}^{- 1} B)}_{j} | + | {({A_{2}}^{- 1} e^{ω_{on} A_{2} t} B)}_{j} |] \end{matrix}

(13)

For A ₂ and B defined in equation (10), one has ${A_{2}}^{- 1} = [\begin{matrix} 0 & - 1 \\ 1 & - 2 \end{matrix}]$ and

| {({A_{2}}^{- 1} B)}_{j} | \leq 2

(14)

System matrix A ₂ is Hurwitz, then there exists a finite time T₁ > 0 such that

| {[e^{ω_{n} A_{2} t}]}_{jk} | \leq \frac{1}{{ω_{on}}^{2}}

(15)

for all t≥T₁, j, k = 1,2.

Thus

| {[e^{ω_{on} A_{2} t} B]}_{j} | \leq \frac{1}{{ω_{on}}^{2}}

(16)

for all t≥T₁.

Let ${A_{2}}^{- 1} = [\begin{matrix} s_{11} & s_{12} \\ s_{21} & s_{22} \end{matrix}]$ and $e^{ω_{on} A_{2} t} = [\begin{matrix} d_{11} & d_{12} \\ d_{21} & d_{22} \end{matrix}]$ , one has

\begin{matrix} | ({A_{2}}^{- 1} e^{ω_{on} A_{2} t} B) | = {\begin{matrix} | s_{11} d_{12} + s_{12} d_{22} |, j = 1 \\ | s_{21} d_{12} + s_{22} d_{22} |, j = 2 \end{matrix} \\ \leq {\begin{matrix} \frac{| s_{11} | + | s_{12} |}{{ω_{on}}^{2}}, j = 1 \\ \frac{| s_{21} | + | s_{22} |}{{ω_{on}}^{2}}, j = 2 \end{matrix} \\ = {\begin{matrix} \frac{1}{{ω_{on}}^{2}}, j = 1 \\ \frac{3}{{ω_{on}}^{2}}, j = 2 \end{matrix} \\ \leq \frac{3}{{ω_{on}}^{2}} \end{matrix}

(17)

From equations (13), (14), and (17), one has

| p_{j} | \leq \frac{2 M_{1}}{{ω_{on}}^{2}} + \frac{3 M_{1}}{{ω_{on}}^{4}}

(18)

for all t≥T₁, j = 1,2.

Let $ε_{sum} (0) = | ε_{1} (0) | + | ε_{2} (0) |$ , it follows that

\begin{matrix} | {[e^{ω_{on} A_{2} t} ε (0)]}_{j} | = | d_{j 1} ε_{1} (0) + d_{j 2} ε_{2} (0) | \leq \frac{| ε_{1} (0) | + | ε_{2} (0) |}{{ω_{on}}^{2}} \\ = \frac{ε_{sum} (0)}{{ω_{on}}^{2}} \end{matrix}

(19)

for all t≥T₁, j = 1,2.

From equation (11), one has

| ε_{j} | \leq | {[e^{ω_{on} A_{2} t} ε (0)]}_{j} | + | p_{j} |

(20)

Let ${\tilde{x}}_{sum} (0) = | {\tilde{x}}_{1} (0) | + | {\tilde{x}}_{2} (0) |$ , and taking $ε_{j} = {\tilde{x}}_{j} / {ω_{on}}^{j - 1}$ , equations (18) and (20) into consideration, one has

| {\tilde{x}}_{j} | \leq | \frac{{\tilde{x}}_{sum} (0)}{{ω_{on}}^{2}} | + | \frac{2 M_{1}}{{ω_{on}}^{3 - j}} | + | \frac{3 M_{1}}{{ω_{on}}^{5 - j}} | = M_{2}

(21)

for all t≥T₁, j = 1,2.

Thus, when h( X , w) is bounded, the estimation errors of an iterative learning based ESO are also bounded, and the bounds are proportional to $ω_{on}^{- 1}$ .

Close-loop stability of the ILADRC

Control law of the ILADRC is

u = \frac{k_{p} (r - {\hat{x}}_{1}) + (\overset{\cdot}{r} - {\hat{x}}_{2})}{b_{0}}

(22)

Substitute equations (22) into (6), one has

\overset{\cdot}{y} = k_{p} (r - {\hat{x}}_{1}) + (f - {\hat{x}}_{2}) + \overset{\cdot}{r}

(23)

Let $r_{1} = r, r_{2} = \overset{\cdot}{r}$ , and the tracking error $ξ = r - y = r_{1} - x_{1}$ , then

\overset{\cdot}{ξ} = {\overset{\cdot}{r}}_{1} - {\overset{\cdot}{x}}_{1} = r_{2} - \overset{\cdot}{y} = - k_{p} ξ - k_{p} {\tilde{x}}_{1} - {\tilde{x}}_{2}

(24)

Let $A_{3} = [- k_{p}, - 1]$ , then equation (24) can be rewritten as

\overset{\cdot}{ξ} = - k_{p} ξ + A_{3} \tilde{X}

(25)

Theorem 2. When $| h (X, ω) | \leq M_{1}$ and there exists a tunable control gain $k_{p} > 0$ , the tracking error of the ILADRC is bounded, and the closed-loop system is bounded input and bounded output (BIBO) stable.

Proof. Solving equation (25), one has

ξ (t) = e^{- k_{p} t} ξ (0) + \int_{0}^{t} e^{- k_{p} (t - τ)} A_{3} \tilde{X} d τ

(26)

According to equation (25) and Theorem 1, one has

| A_{3} \tilde{X} | = | - k_{p} {\tilde{x}}_{1} - {\tilde{x}}_{2} | \leq (1 + k_{p}) M_{1} \overset{Δ}{=} γ

(27)

Let

φ (t) = \int_{0}^{t} e^{- k_{p} (t - τ)} A_{3} \tilde{X} d τ

(28)

Then

\begin{matrix} | φ (t) | = \int_{0}^{t} | e^{- k_{p} (t - τ)} A_{3} \tilde{X} | d τ \leq \int_{0}^{t} e^{- k_{p} (t - τ)} γ d τ \leq | \frac{1}{k_{p}} γ | \\ + | \frac{1}{k_{p}} e^{- k_{p} t} γ | \end{matrix}

(29)

According to Gao,²⁴ one has $k_{p} = ω_{c}$ , $ω_{c}$ is the controller bandwidth. Then, there exists a finite time $T_{2}$ , such that

| e^{- k_{p} t} | \leq \frac{1}{ω_{c}^{2}}

(30)

| e^{- k_{p} t} ξ (0) | \leq \frac{ξ (0)}{{ω_{c}}^{2}}

(31)

for all t≥T₂.

Let T₃ = max{T₁, T₂}, one has

| e^{- k_{p} t} γ | \leq \frac{γ}{ω_{c}^{2}}

(32)

for all t≥T₃.

Then

| \frac{1}{k_{p}} e^{- k_{p} t} γ | \leq \frac{γ}{ω_{c}^{3}}

(33)

for all t≥T₃.

From equations (29) and (33), one has

| φ (t) | \leq \frac{γ}{ω_{c}} + \frac{γ}{{ω_{c}}^{3}}

(34)

for all t≥T₃.

From equation (26), one has

| ξ (t) | = | e^{- k_{p} t} ξ (0) | + | φ (t) |

(35)

Then,

| ξ (t) | \leq \frac{ξ (0)}{{ω_{c}}^{2}} + \frac{γ}{ω_{c}} + \frac{γ}{{ω_{c}}^{3}} = M_{3}

(36)

Based on Theorem 1 and Theorem 2, one can find that, if estimation errors of an ESO are bounded, tracking errors of the closed-loop system will also be bounded. Then, for a bounded set-value $r$ , the system output $y$ is also bounded. In other words, the closed-loop system is bounded input and bounded output (BIBO) stable.

Therefore, by choosing proper parameters, the iterative learning based ESO is bounded, and the closed-loop system is also stable.

Simulation results

In this section, the iterative learning based ADRC is designed, and it is verified on the BSM1 under dry, rain and storm weather. The aim is to make the DO concentration in the fifth reactor (S_O_,5) at 2 mg/L by manipulating the oxygen transfer coefficient (K_La₅). To confirm the ILADRC, the ADRC is taken to make a comparison. All experiments are based on the same numerical environments. Parameters of the ADRC and the ILADRC are listed in Table 1.

Table 1.

Parameters of the ADRC and ILADRC.

Controllers	k_p = ω_c	ω_o	b ₀	k_l	Iteration times	Sampling time
ADRC	400	500	1	–	–	0.001
ILADRC (dry)	400	500	1	16	30	0.001
ILADRC (rain)	400	500	1	17	30	0.001
ILADRC (storm)	400	500	1	16	30	0.001

The performance is evaluated by the integral of absolute error (IAE), integral of squared error (ISE) and the maximal deviation from the set-value (DEV^max). IAE, ISE, and DEV^max can be calculated as²²

IAE = \int_{7}^{14} | ξ | dt

(37)

ISE = \int_{7}^{14} ξ^{2} dt

(38)

De v^{max} = max {| ξ |}

(39)

where $ξ = r - y$ is the tracking error.

For the ILADRC, in each experiment, initial learning gain k_l is determined by try and error, and they are listed in Table 1. To obtain a better learning gain, Monte Carlo experiments are carried out. To ensure the learning gains k_l s in Monte Carlo experiments are selected properly, they randomly vary in ±20% of their initial values. In other words, in dry and stormy days $k_{l} \in [12.8, 19.2]$ , and in rainy days $k_{l} \in [13.6, 20.4]$ . 30 Monte Carlo experiments are carried out, and 30 k_l s are obtained. For each k_l, 30 iterations have been executed, and 30 IAE values can be obtained. Then, a root mean square error (RMSE) can be calculated based on the 30 IAE values. Thus, for each k_l, there is one RMSE. 30 RMSE values can be received from 30 times Monte Carlo tests. Relationships between the learning gain k_l s and RMSE values are given in Figures 3, 8 and 13. From those figures, one can find a clear picture that, when k_l increases, the RMSE values decreases. k_l equals 19.0118, 20.2000, and 18.9395, the RMSEs are minimum. Thus, k_l s are chosen to be 19.0118, 20.2000, and 18.9395 in dry, rainy and stormy days’ simulations.

RMSE = \sqrt{\frac{1}{N} \sum_{i = 1}^{N} {IAE}_{i}^{2}}

(40)

Based on k_l s, bandwidths ω_os and b₀s of the ILADRC can be obtained. The ω_os and b₀s after 30 times iterations are listed in Table 2. Other parameters are the same with those given in Table 1.

Figure 3.

Learning gains (k_l s) and RMSEs in dry weather.

Table 2.

Main parameters of the ILADRC after 30 times iterations.

Controllers	ω_o	b ₀
ILADRC (dry)	499.7357	0.7357
ILADRC (rain)	499.8096	0.8096
ILADRC (storm)	499.7775	0.7775

Remark 4. Generally, the learning gain k_l can be determined by experience. However, to fix the learning gain via a scientific approach, or to get rid of the subjective experience, here, the Monte-Carlo method is utilized. The gains are generated in a random way from a certain range, and objective numerical simulations are performed to evaluate the gains. It overcomes the subjectivity and blindness of the parameter selection.

Remark 5. More times of the Monte Carlo tests should be taken so that the gains can be chosen more objective. Here, by compromising the computational complexity and the numerical results, 30 times Monte Carlo tests are performed.

Remark 6. The iteration times of each Monte Carlo test are determined by try and error. It should guarantee the convergence of the estimation errors.

Remark 7. Initial parameters of the ESOs in both ADRC and ILADRC are the same, and they are determined by try and error.

Figure 4 shows the tendency of IAE values when k_l = 19.0118. In this case, outputs of the ILADRC are given in Figure 5. Simultaneously, outputs of the ADRC are also shown in Figure 5. From Figure 5, it is easy to find that the ILADRC achieves better tracking performance in dry days. Figures 6 and 7 are tracking errors and estimation errors of the ADRC and the ILADRC. The median of both tracking error and estimation error equal zero, it means the average level of those errors is zero. The distance of upper and lower quartiles in the boxplot reflects the varying degree of the tracking and/or estimation errors. Higher the boxplot, larger fluctuations of the errors. Figures 6 and 7 show that, by comparison with the ADRC, both tracking errors and estimation errors of the ILADRC have fewer changes and closer to the median. Fewer outliers of the ILADRC also means ILADRC is more effective. Performance values listed in Table 3 confirm the fact discussed above in a quantitative way.

Figure 4.

IAE values in dry weather (k_l = 19.0118).

Figure 5.

Tracking performance in dry weather (k_l = 19.0118).

Figure 6.

Tracking errors of the ADRC and the ILADRC in dry weather (k_l = 19.0118).

Figure 7.

Estimation errors of the ADRC and the ILADRC in dry weather (k_l = 19.0118).

Table 3.

Performance values of the ILADRC and the ADRC.

		IAE	ISE	DEV^max
Dry	ADRC	0.0206	1.3552 × 10⁻⁴	0.016
	ILADRC	0.0151	7.3339 × 10⁻⁵	0.012
	Improvements	26.70%	45.85%	25%
Rain	ADRC	0.0187	1.0903 × 10⁻⁴	0.016
	ILADRC	0.0152	7.1548 × 10⁻⁵	0.013
	Improvements	18.72%	34.38%	19%
Storm	ADRC	0.0201	1.2351 × 10⁻⁴	0.016
	ILADRC	0.0156	7.4772 × 10⁻⁵	0.012
	Improvements	22.39%	39.46%	25%

Figure 8 shows the relationship between the learning gains (k_l s) and RMSEs. It can be found that, when the learning gain k_l = 20.2000, the RMSE value is minimum.

Figure 8.

Learning gains (k_l s) and RMSEs in rain weather.

Figure 9 presents the IAE values of the estimation errors. System outputs of the ADRC and the ILADRC are shown in Figure 10. From Figure 10, it is easy to see that better tracking performance is obtained by the ILADRC in rain weather. Figures 11 and 12 are tracking errors and estimation errors of the ADRC and the ILADRC. Both tracking and estimation errors of the ILADRC are smaller than the ones of the ADRC. Performance values are shown in Table 3.

Figure 9.

IAE values in rain weather (k_l = 20.2000).

Figure 10.

Tracking performance in rain weather (k_l = 20.2000).

Figure 11.

Tracking errors of the ADRC and the ILADRC in rain weather (k_l = 20.2000).

Figure 12.

Estimation errors of the ADRC and the ILADRC in rain weather (k_l = 20.2000).

Figure 13 shows the relationship between learning gains (k_l s) and RMSEs in stormy days. When k_l = 18.9395, RMSE is minimum.

Figure 13.

Learning gains (k_l s) and RMSEs in storm weather.

Figure 14 is the IAE values of the estimation errors. It illustrates that the learning process is convergent. System outputs of the ADRC and the ILADRC are shown in Figure 15. It indicates that ILADRC has better tracking performance. Figures 16 and 17 show that the average level of the tracking errors and estimation errors are close to zero. In addition, from the distance of upper and lower quartiles in boxplots, one can see that the tracking errors and estimation errors of the ILADRC system are closer to zero. Performance comparisons in storm weather are also given in Table 3.

Figure 14.

IAE values in storm weather (k_l = 18.9395).

Figure 15.

Tracking performance in storm weather (k_l = 18.9395).

Figure 16.

Tracking errors of the ADRC and the ILADRC in storm weather (k_l = 18.9395).

Figure 17.

Estimation errors of the ADRC and the ILADRC in storm weather (k_l = 18.9395).

In Table 3, the improvements signify that the ILADRC is superior to the ADRC from the perspective of IAE, ISE and DEV^max. For example, in dry days, ISE values of the ILADRC are improved by 45.85% by comparison with the ADRC. Data listed in Table 3 confirm the advantage of the iterative learning based ESO.

From the simulation results in dry, rain and storm weather, it is obvious that, because of an iterative learning ESO, the ILADRC tracks the set-value more accurately. It demonstrates that, compared with a conventional ESO, an iterative learning ESO can estimate the total disturbance more effectively. Simultaneously, when choosing the parameters, the iterative learning based ESO is less dependent on the experience of an engineer. It helps improve the estimation and control performance of the ADRC.

Conclusion

A WWTP is a time-varying system with strong nonlinearities and couplings. In addition, kinds of uncertainties and disturbances are also existing. Therefore, it is impossible to establish an accurate model for the control of a WWTP. In this paper, an iterative learning based ADRC is designed for the DO control in a WWTP. The iterative learning method is utilized to optimize the parameters of an ESO. Compared with the conventional ESO whose parameters are fixed, the iterative learning based ESO achieves a more accurate estimation. Then, the close-loop DO concentration control of a WWTP is more satisfied. Advantages of the iterative learning based ESO and the ILADRC show that it may be a promising way to control the DO in a WWTP. However, it should be verified in a real wastewater treatment process and it is our future work.

Footnotes

Declaration of conflicting interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This research is supported by Key program of Beijing Municipal Education Commission (grant no.: KZ20181 0011012) and National Natural Science Foundation of China (grant no.: 61873005).

ORCID iD

Min Zuo

References

Prasse

Stalter

Schulte-Oehlmann

, et al. Spoilt for choice: a critical review on the chemical and biological assessment of current wastewater treatment technologies. Water Res 2015; 87(1): 237–270.

Collins

Schwab

Samans

, et al. The global risks report 2018. Geneva: World Economic Forum, 2018.

Olsson

. ICA and me—a subjective review. Water Res 2012; 46(6): 1585–1624.

Qiao

. Heuristic dynamic programming using echo state network for multivariable tracking control of wastewater treatment process. Asian J Control 2015; 17(5): 1654–1666.

Iratni

Chang

. Advances in control technologies for wastewater treatment processes: status, challenges, and perspectives. IEEE/CAA J Automatica Sinica 2019; 6(2): 4–30.

Lindberg

Carlsson

. Nonlinear and set-point control of the dissolved oxygen concentration in an activated sludge process. Water Sci Technol 1996; 34(3–4): 135–142.

Munoz

Young

Antileo

, et al. Sliding mode control of dissolved oxygen in an integrated nitrogen removal process in a sequencing batch reactor (SBR). Water Sci Technol 2009; 60(10): 2545–2553.

Holenda

Domokos

Redey

, et al. Dissolved oxygen control of the activated sludge wastewater treatment process using model predictive control. Comput Chem Eng 2008; 32: 1270–1278.

Zeng

Liu

. Economic model predictive control of wastewater treatment processes. Ind Eng Chem Res 2015; 54(21): 5710–5721.

10.

Zhang

Yin

Liu

, et al. Distributed economic model predictive control of wastewater treatment plants. Chem Eng Res Des 2019; 141(1): 144–155.

11.

Belchior

CAC

Araujo

RAM

Landeck

JAC

. Dissolved oxygen control of the activated sludge wastewater treatment process using stable adaptive fuzzy control. Comput Chem Eng 2012; 37(2): 152–162.

12.

Yang

Qiao

. A novel dissolve oxygen control method based on fuzzy neural network. In: Proceedings of the 36th Chinese Control Conference, Dalian, China, 26–28 July 2017, pp. 4363–4368. New York: IEEE.

13.

Zhang

. Online adaptive dynamic programming based on echo state networks for dissolved oxygen control. Appl Soft Comput 2018; 62(1): 830–839.

14.

Lin

Luo

. Adaptive neural control of the dissolved oxygen concentration in WWTPs based on disturbance observer. Neurocomputing 2016; 185(4): 133–141.

15.

Wei

Zuo

, et al. Control of dissolved oxygen for a wastewater treatment process by active disturbance rejection control approach. Control Theory Appl 2018; 35(1): 24–30 (in Chinese).

16.

Wei

Chen

Zhang

, et al. U-model-based active disturbance rejection control for the dissolved oxygen in a wastewater treatment process. Math Probl Eng 2020; 3507910, 1–14.

17.

Cheng

Chen

Sun

, et al. Cascade active disturbance rejection control of a high-purity distillation column with measurement noise. Ind Eng Chem Res 2018; 57(13): 4623–4631.

18.

Chen

Yang

Guo

, et al. Disturbance-observer-based control and related methods-an overview. IEEE Trans Ind Electron 2016; 63(2): 1083–1095.

19.

, et al. Superheated steam temperature control based on modified active disturbance rejection control. Control Eng Pract 2019; 83: 83–97.

20.

Shen

Zhang

Wang

, et al. On almost sure and mean square convergence of P-type ILC under randomly varying iteration lengths. Automatica 2016; 63: 359–365.

21.

Wei

Xia

, et al. Seizure control by a learning type active disturbance rejection approach. IEEE Access 2019; 7(1): 164792–164802.

22.

Alex

Benedetti

Copp

, et al. Benchmark simulation model no. 1 (bsm1). Lund University, Sweden, 2008.

23.

Han

Qiao

Chen

. Model predictive control of dissolved oxygen concentration based on a self-organizing RBF neural network. Control Eng Pract, 2012; 20(4): 465–476.

24.

Gao

. Scaling and bandwidth-parameterization based controller tuning. In: Proceedings of the 2003 American Control Conference, Denver, CO, 4–6 June 2003, pp. 4989–4996. New York: IEEE.